0% found this document useful (0 votes)

94 views3 pages

Step-by-Step Machine Learning Guide

1. The document outlines a 12-step process for machine learning projects that includes setting objectives, obtaining data, exploring the data, selecting tools, training models, validating models on new data, testing models, building production systems, launching models, monitoring performance, and maintaining systems. 2. Key steps include setting measurable objectives, exploring data to understand patterns, evaluating models on held-out validation and test data to avoid overfitting, gradually increasing complexity of models, and continuously monitoring models in production. 3. Successful machine learning requires cross-functional teams including decision makers, domain experts, engineers, analysts, and reliability engineers working through each step of the process.

Uploaded by

koernj

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

94 views3 pages

Step-by-Step Machine Learning Guide

Uploaded by

koernj

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Introduction to Machine Learning
Step 1: Set Objective
Step 3: Split Data
Step 2: Get Data
Step 4: Explore Data
Step 6: Train
Step 5: Model Data
Step 7: Tune and Debug
Step 9: Test
Step 11: Launch
Step 10: Build
Step 8: Validate
Step 12: Monitor and Maintain

Step-by-Step Machine Learning Process

Cassie Kozyrkov (kozyr@[Link])

What is Machine Learning?
● Machine Learning (ML) is an approach to making many decisions that involves algorithmically f inding
patterns in old data and using these patterns to create models (recipes) for dealing correctly with brand new
situations.
● In a nutshell: ML is all about finding and using patterns in data to perform new tasks.

Jargon:
● Instances: examples, observations. (rows)
● Labels: targets, ground truth. (correct answers)
● Features: information about the instances, variables, attributes. (columns)

Check you need ML
● If you can’t even imagine what sort of decisions (labels) you’d like your ML system to make for you, stop. It
might be too early for you to consider ML.
● Try imagining that you’d get 1 million free but distracted human workers who would all work on similar small
tasks. What would you ask them to work on? What does good work look like? How would you know if they
were slacking off? How would you choose between these million or those million free human workers?
Imagine the work and imagine how you would assess its quality.

Step-by-Step Machine Learning

Step 1: Set Objective ( Decision Maker, Domain Expert)
● Steps:
○ Write down outputs/labels.
○ Consider mistakes.
○ How would you score one mistake vs another.
○ Create business performance metric (BPM) by stitching together individual outcome scores for all of
your individual outcomes together.
○ Look up some classic loss functions used for these types of outputs (e.g. loss functions for binary
outputs, for text outputs, for numerical outputs, etc.)
○ Compare the business performance metric with the loss function.
○ Set minimum performance criteria to productionize and to launch.
● Thinking carefully about what success means and picking a metric that captures business performance is
important! This is the decision-maker’s responsibility and without it, the ML process is doomed.
● Imagine all the labels are made by an imperfect human worker instead of an ML system. Just focus on output.
● In ML, the proof of the pudding (model) is always in the eating (performance on new data). Always evaluate
performance based on your business metric.

Step 2: Get Data (Engineer, Domain Expert)
○ Your ML system is only as good as the data that went into it.
○ Getting data involves lots of engineering effort.
○ Just focus on getting IDs and a few inputs (“features” or “variables”) per ID. These won’t be the right
ones, just a starting point for your analysts to work on in Step 4.

Step 3: Split Data (Engineer)
● Overfitting happens when you model noise instead of reality.
● If you don’t optimize for fit on totally fresh data, your models are no good to you.
● You need fresh data for checking performance!
● Split your data into:
○ Training dataset
○ Validation dataset
○ Test dataset
● Key Message: Your ML system is no good to you if it can’t deal with new data. It’s too easy to build a system
that’s really good at old data but fails miserably on new. Make sure you avoid this by evaluating performance
on fresh data.

Step 4: Explore (Analyst, Domain Expert)
● Plotting data is your secret weapon for machine learning.
● You’re only allowed to look in your training data!
● Don’t look in your validation and test datasets.
● Key Message: Your data are your most valuable resource. If you don’t explore your training dataset, you’re
missing out on taking full advantage of it.

Step 5: Get Tools (Engineer)
● The math is in service of:
1) Finding patterns (in old data).
2) Assessing models (in new data).
● Unless you’re designing brand new algorithms, you can get away with:
○ Sufficient computer skills to use ML tools others have built
○ Sufficient statistical skills to evaluate model performance
● Key Message: Start with a list of available tools and aggressively eliminate everything that obviously won’t
work, then just pick as many of the remaining tools and try them in parallel. Invest in the ability to try to run
many algorithms in parallel.
● Don’t worry about picking the “right” algorithm. Worry about giving as many of them as possible a chance.
The proof of the pudding is in the eating.

Step 6: Train ( Engineer, Analyst)
● In training, your goal is to run lots of algorithms in parallel on your data and assess the models they produce
on that same data.
● You’re making a shortlist of models that seem to work.
● Don’t worry about getting it right first time - it’ll take a few tries.
● Start simple and only build up the complexity if the simple solution doesn’t work.

Step 7: Tune and Debug (Engineer, Analyst)
● If you want the safest, most effective debugging strategy, then:
○ Run your algorithm (step 6) in some data.
○ Debug its performance by using it to label different data.
○ Since you’re not allowed to debug using validation or test data, you’re going to need to allocate a
separate dataset for tuning/debugging. You’ll have a 4th dataset in play for debugging.
● You can create the tuning/debugging dataset on the fly by allocating some of the training data for this.
● If you have hyperparameters (numerical settings you must choose before running algorithm), use this data to
tune them.

Step 8: Validate ( Analyst)
● Validation is all about checking if your model succeeds on a new dataset.
● Validation protects you from blindly overfitting. It keeps you safe, don’t skip it!
● Only view the final metric, not individual validation data points. Don’t debug in your validation data.
● Repeatedly validating erodes your protection. That’s why we have step 9 (testing).

Step 9: Test (Decision Maker, Analyst)
● Testing is the final frontier before you take your model live. This is where statistical rigor enters the picture.
● This is what the discipline of statistical inference is all about. Ask your statisticians for help.
● You only get one shot at this per test dataset.
● If testing fails, the only way to start again is to collect a pristine new test dataset.
● Never test on data that was involved in any way in training/tuning/debugging/testing.

Step 10: Build (Engineer)
● Your algorithm got you a favorite model (a model is just a recipe for turning inputs into outputs). The
engineering team’s job is to get this recipe into production.
● You can build it so it keeps itself updated automatically by building capabilities for retraining in production.
● Changing anything changes everything, so always test after a change.
● Don’t forget to think about:
○ Retraining data, speed, and frequency.
○ Ability to restrict retraining data inputs and detect fleeting changes.
○ Logging bugs and changes to logging.
○ Tracking and safety nets for outliers.
○ Plans for when retesting fails.
● Policy layers (logic that checks the model output) are a very good idea. Build them before launching in
production and make them easy to add case-by-case corrections to.

Step 11: Launch (Analyst, Decision Maker)
● You need to make sure the ML system is good enough for your business needs.
● Do an experiment to measure its impact and check that launching it at 100% is the right decision.
● Components of an experiment:
○ Hypothesis. (Performance of ML system good enough? This criterion was decided in Step 1.)
○ Different treatments. (ML system vs not ML system.)
○ Randomization to treatments. (Live traffic sent at random to ML system or old system.)

Step 12: Monitor and Maintain (Reliability Engineer, Analyst)
● Invest in:
○ Monitoring plan
○ Maintenance plan (and ensure there’s headcount for carrying it out)
○ Tracking dashboards
○ Good documentation

Common questions

The choice and quality of data critically impact the effectiveness of a machine learning system because the model relies on patterns found in historical data to make predictions. High-quality, relevant data enables the model to learn accurate patterns that generalize well to new, unseen data. Poor-quality data can lead to overfitting or underfitting, decrements in model performance, and erroneous predictions. Data quality and appropriateness affect every stage from training to testing, underscoring the importance of careful data collection and preprocessing .

Using separate datasets for training, validation, and testing in machine learning is necessary to ensure the model's ability to generalize to new data. The training dataset is used to build and tune the model. The validation dataset helps in fine-tuning and selecting the best-performing model variant by preventing overfitting. Finally, the testing dataset provides an unbiased evaluation of the final model's performance. This separation is crucial because it ensures that the model has not learned specific data patterns that don't generalize beyond the training and validation phases .

Exploration and understanding of data are pivotal in the machine learning process as they help identify underlying patterns, feature correlations, and potential data quality issues. Plotting and examining the training data can reveal insights into feature distributions and outliers, guiding feature selection and engineering processes. Exploring data informs the subsequent modeling steps and decisions, ensuring that models make informed predictions and mitigating errors caused by irrelevant or noisy data .

Randomization is important in the design of experiments for launching a machine learning system because it minimizes bias and ensures that differences in outcomes are due to the ML system itself rather than confounding variables. By randomly assigning traffic or cases to the ML system and the control system, one can more accurately assess the causal impact of the ML deployment on performance metrics, leading to more credible and actionable insights from the experimental results .

Machine learning models benefit from policy layers added before production as they enhance reliability and safety. These layers act as checks and balances, ensuring outputs align with business rules or regulations and providing mechanisms for manual adjustment of outputs based on real-world scenarios. Policy layers help manage risks by catching outliers or unexpected predictions and support compliance and trust by allowing human oversight and intervention when necessary .

Monitoring and maintenance of a machine learning system ensure its long-term effectiveness by allowing early detection of performance degradation, model drift, and infrastructure issues. Continuous monitoring helps track changes in data inputs and outputs, while regular maintenance updates the model based on new data, adapts to changing patterns, and fixes any emerging issues. Together, these practices keep the system aligned with business objectives and improving over time, ensuring sustained reliability and performance .

Setting a proper objective in the machine learning process is crucial because it guides the entire process, ensuring that the outcomes are aligned with business goals. A well-defined objective helps in determining what success looks like and choosing appropriate metrics that capture business performance accurately. Without a clear objective, the machine learning model is unlikely to achieve desired outcomes, as the process lacks direction and fails to measure meaningful performance .

Testing a machine learning model on previously used data compromises the model’s reliability because it can lead to overestimation of model performance. The model may have inadvertently learned the specifics of the training data rather than capturing generalizable patterns, resulting in a performance measure that does not reflect real-world utility. Testing on fresh, unused data provides a more accurate estimate of how the model will perform in practice, revealing any tendencies to overfit to the training data .

Skipping the validation step could lead to overfitting, where the model performs well on training data but poorly on new data because it fails to generalize. Validation acts as a safeguard by testing model performance on unseen data, ensuring the model's robustness and preventing overfitting. Without validation, the model might underperform in real-world applications, resulting in unreliable predictions and potentially costly errors when deployed .

When deciding whether to launch a machine learning model fully in production, factors such as model performance relative to predefined success metrics, potential business impact, alignment with business objectives, and reliability under various conditions should be considered. An experiment should be conducted to compare the ML system against existing systems, considering factors like system robustness, scalability, and the ease of implementing policy layers or corrective measures. Additionally, the readiness of monitoring and maintenance plans contributes significantly to the decision .

Step-by-Step Machine Learning Process
Cassie Kozyrkov (kozyr@google.com) (mailto:kozyr@google.com)

What is Machine

Step 2: Get Data (Engineer, Domain Expert)
○
Your ML system is only as good as the data that went into it.
○
Getting dat

●
If you have hyperparameters (numerical settings you must choose before running algorithm), use this data to
tune them.

Machine Learning in Data Science
No ratings yet
Machine Learning in Data Science
64 pages
Hadoop for Big Data Management
No ratings yet
Hadoop for Big Data Management
38 pages
Machine Learning Overview and Types
No ratings yet
Machine Learning Overview and Types
45 pages
Australian Gas Production Time Series Analysis
100% (19)
Australian Gas Production Time Series Analysis
29 pages
Unified Process and Use Case Diagrams
No ratings yet
Unified Process and Use Case Diagrams
62 pages
ML Algorithms for Lung Cancer Prognosis
No ratings yet
ML Algorithms for Lung Cancer Prognosis
11 pages
Introduction to Data Science Course
No ratings yet
Introduction to Data Science Course
25 pages
Understanding Object-Oriented Design
No ratings yet
Understanding Object-Oriented Design
66 pages
Ensemble Diabetes Prediction Project
100% (1)
Ensemble Diabetes Prediction Project
49 pages
Database Application Integration in VB
No ratings yet
Database Application Integration in VB
41 pages
Crime Pattern Detection Using Data Mining: Snath1@Fau - Edu
No ratings yet
Crime Pattern Detection Using Data Mining: Snath1@Fau - Edu
4 pages
Student Performance Analysis with BI Tools
No ratings yet
Student Performance Analysis with BI Tools
132 pages
Chicken Disease Classification Project Plan
No ratings yet
Chicken Disease Classification Project Plan
14 pages
Fuzzy Logic Applications in Engineering
No ratings yet
Fuzzy Logic Applications in Engineering
3 pages
Hard vs Soft Margin in SVM Explained
No ratings yet
Hard vs Soft Margin in SVM Explained
8 pages
Understanding Pattern Recognition
No ratings yet
Understanding Pattern Recognition
3 pages
Predicting Academic Performance with AugmentED
No ratings yet
Predicting Academic Performance with AugmentED
6 pages
Supervised vs. Unsupervised Learning Guide
No ratings yet
Supervised vs. Unsupervised Learning Guide
9 pages
Fuzzy Logic and Applications PDF
No ratings yet
Fuzzy Logic and Applications PDF
13 pages
Master C# .NET Programming Techniques
No ratings yet
Master C# .NET Programming Techniques
1,097 pages
SQL Server Database with Visual Basic
No ratings yet
SQL Server Database with Visual Basic
23 pages
Invocable Methods in Salesforce Apex
No ratings yet
Invocable Methods in Salesforce Apex
7 pages
Introduction to System Design & Implementation
No ratings yet
Introduction to System Design & Implementation
16 pages
Specialized Data in Predictive Analytics
No ratings yet
Specialized Data in Predictive Analytics
44 pages
Understanding Cloud Architecture
No ratings yet
Understanding Cloud Architecture
16 pages
Heart Attack Risk Detection via Retinal Imaging
No ratings yet
Heart Attack Risk Detection via Retinal Imaging
6 pages
Adaline and Madaline Neural Networks
No ratings yet
Adaline and Madaline Neural Networks
8 pages
Data Manipulation with ADO.NET in ASP.NET
No ratings yet
Data Manipulation with ADO.NET in ASP.NET
39 pages
Superposition in Series-Parallel Circuits
No ratings yet
Superposition in Series-Parallel Circuits
93 pages
Machine Learning Course Outline
No ratings yet
Machine Learning Course Outline
4 pages
Understanding ADO.NET Basics
No ratings yet
Understanding ADO.NET Basics
21 pages
Machine Learning Experiment Guidelines
No ratings yet
Machine Learning Experiment Guidelines
6 pages
Memory-Based Reasoning in Data Mining
100% (1)
Memory-Based Reasoning in Data Mining
19 pages
IPL Dashboard Insights 2023
No ratings yet
IPL Dashboard Insights 2023
21 pages
Unsupervised Clustering of Learning Behaviors
No ratings yet
Unsupervised Clustering of Learning Behaviors
7 pages
Machine Learning Project Checklist
No ratings yet
Machine Learning Project Checklist
6 pages
Magnetic Induction and AC Circuits Guide
No ratings yet
Magnetic Induction and AC Circuits Guide
10 pages
VB Projects for Mouse Events and Forms
No ratings yet
VB Projects for Mouse Events and Forms
40 pages
Loan Prediction System Overview
No ratings yet
Loan Prediction System Overview
5 pages
Clustering Techniques Overview
No ratings yet
Clustering Techniques Overview
67 pages
Predicting Student Performance with ML
0% (1)
Predicting Student Performance with ML
2 pages
Deep Learning Basics Overview
No ratings yet
Deep Learning Basics Overview
29 pages
Student Dropout Prediction in Higher Ed
No ratings yet
Student Dropout Prediction in Higher Ed
5 pages
Neural Networks Overview for JNTUK R20
No ratings yet
Neural Networks Overview for JNTUK R20
10 pages
Python Recommendation System Overview
No ratings yet
Python Recommendation System Overview
13 pages
Implement of Salary Prediction System To Improve Student Motivation Using Data Mining Technique PDF
No ratings yet
Implement of Salary Prediction System To Improve Student Motivation Using Data Mining Technique PDF
6 pages
Machine Learning Regression Techniques
No ratings yet
Machine Learning Regression Techniques
16 pages
Support Vector Machine Overview
No ratings yet
Support Vector Machine Overview
8 pages
Understanding Business Intelligence Systems
100% (1)
Understanding Business Intelligence Systems
25 pages
Multi Soft
No ratings yet
Multi Soft
14 pages
Student Academic Performance Prediction Under Various Machine Learning Classification Algorithms
No ratings yet
Student Academic Performance Prediction Under Various Machine Learning Classification Algorithms
19 pages
EQ's Role in TVET for 4IR in Bangladesh
No ratings yet
EQ's Role in TVET for 4IR in Bangladesh
18 pages
Understanding Neural Networks Basics
No ratings yet
Understanding Neural Networks Basics
27 pages
SVMs in Pattern Recognition Tutorial
No ratings yet
SVMs in Pattern Recognition Tutorial
37 pages
Perceptron Model Overview and Learning
No ratings yet
Perceptron Model Overview and Learning
33 pages
Smart Parking System with MERN Stack
No ratings yet
Smart Parking System with MERN Stack
6 pages
Test Suite Growth and Prioritization Techniques
No ratings yet
Test Suite Growth and Prioritization Techniques
28 pages
Salesforce Lightning CTI Setup Guide
No ratings yet
Salesforce Lightning CTI Setup Guide
15 pages
SQL Queries for Book Data Retrieval
No ratings yet
SQL Queries for Book Data Retrieval
29 pages
Machine Learning Life Cycle Overview
No ratings yet
Machine Learning Life Cycle Overview
10 pages
Essential Linux Commands Handbook
100% (17)
Essential Linux Commands Handbook
135 pages
TSI SW100 Cyber Security How SABSA Can Help Your Business 1
No ratings yet
TSI SW100 Cyber Security How SABSA Can Help Your Business 1
5 pages
Windows Group Policy
100% (1)
Windows Group Policy
185 pages
Windows 2000 Group Policy
No ratings yet
Windows 2000 Group Policy
186 pages
VOIPSA Threat Taxonomy 0.1
No ratings yet
VOIPSA Threat Taxonomy 0.1
36 pages
ENISA Activities on Resilience and Privacy
No ratings yet
ENISA Activities on Resilience and Privacy
20 pages
Energy-Efficient TinyML: Quantization & Pruning
No ratings yet
Energy-Efficient TinyML: Quantization & Pruning
8 pages
Data Science Resume of Ismail Moeen
No ratings yet
Data Science Resume of Ismail Moeen
1 page
Smart Drone Surveillance DSS in Agriculture
No ratings yet
Smart Drone Surveillance DSS in Agriculture
7 pages
SCM in BPO: Overview and Benefits
No ratings yet
SCM in BPO: Overview and Benefits
34 pages
AI ML Engineer Roadmap
No ratings yet
AI ML Engineer Roadmap
3 pages
Senior AI/ML Engineer Profile
No ratings yet
Senior AI/ML Engineer Profile
2 pages
ICT Interview Questions & Answers PDF
No ratings yet
ICT Interview Questions & Answers PDF
5 pages
Python Trading Simulation with ML
No ratings yet
Python Trading Simulation with ML
5 pages
Transformer-Based Active Learning For Multi-Class
No ratings yet
Transformer-Based Active Learning For Multi-Class
21 pages
GDP Prediction with Machine Learning
No ratings yet
GDP Prediction with Machine Learning
63 pages
Cellular Networks Optimization Resources
No ratings yet
Cellular Networks Optimization Resources
7 pages
ChatGPT in Medical Diagnosis Tools
No ratings yet
ChatGPT in Medical Diagnosis Tools
14 pages
Predictive Crime Analysis Web App
No ratings yet
Predictive Crime Analysis Web App
7 pages
Lightweight Self-Supervised Depth Estimation
No ratings yet
Lightweight Self-Supervised Depth Estimation
11 pages
Machine Learning for Family Wellness Analysis
No ratings yet
Machine Learning for Family Wellness Analysis
15 pages
Metal Price Forecasting with AI Models
No ratings yet
Metal Price Forecasting with AI Models
16 pages
Overview of Artificial Intelligence Systems
No ratings yet
Overview of Artificial Intelligence Systems
45 pages
Machine Learning in Blockchain Analysis
No ratings yet
Machine Learning in Blockchain Analysis
9 pages
Principles of Synthetic Biology
No ratings yet
Principles of Synthetic Biology
4 pages
AI Adoption Challenges in Finance
No ratings yet
AI Adoption Challenges in Finance
10 pages
Deep Learning for Traffic Accident Risk
No ratings yet
Deep Learning for Traffic Accident Risk
10 pages
Few-Shot Adaptation of Foundation Models
No ratings yet
Few-Shot Adaptation of Foundation Models
1 page
IJAISC: AI and Soft Computing Research
No ratings yet
IJAISC: AI and Soft Computing Research
2 pages
Deep Learning Model Management Guide
No ratings yet
Deep Learning Model Management Guide
8 pages
Deep Learning Exam Answers and Matrix G
No ratings yet
Deep Learning Exam Answers and Matrix G
20 pages
Integrative Frameworks for Cancer Driver Gene Prediction
No ratings yet
Integrative Frameworks for Cancer Driver Gene Prediction
16 pages
Machine Learning Approaches Explained
No ratings yet
Machine Learning Approaches Explained
6 pages
Importance of Statistics in Data Science
No ratings yet
Importance of Statistics in Data Science
3 pages
Learning Outcome 3: Perform Model Deployment
No ratings yet
Learning Outcome 3: Perform Model Deployment
36 pages
Enhancing Stock Forecasting Accuracy
No ratings yet
Enhancing Stock Forecasting Accuracy
30 pages

Step-by-Step Machine Learning Guide

Uploaded by

Step-by-Step Machine Learning Guide

Uploaded by

Step-by-Step Machine Learning Process

Cassie Kozyrkov (kozyr@[Link])

Common questions

How does the choice and quality of data impact the effectiveness of a machine learning system?

Why is it necessary to use separate datasets for training, validation, and testing in machine learning?

What role does the exploration and understanding of data play in the machine learning process?

Why is randomization important in the design of experiments for launching a machine learning system?

How do machine learning models benefit from having policy layers added before going into production?

Discuss how monitoring and maintenance of a machine learning system contribute to its long-term effectiveness.

Why is it important to focus on setting a proper objective in the initial step of the machine learning process?

Explain why testing a machine learning model on previously used data can compromise the model’s reliability.

What are the potential consequences of skipping the validation step in the machine learning workflow?

What factors should be considered when deciding whether to launch a machine learning model fully in production?

You might also like

Step-by-Step Machine Learning Guide

Uploaded by

Step-by-Step Machine Learning Guide

Uploaded by

Step-by-Step Machine Learning Process

Cassie Kozyrkov (​kozyr@[Link]​)

Common questions

How does the choice and quality of data impact the effectiveness of a machine learning system?

How does the choice and quality of data impact the effectiveness of a machine learning system?

Why is it necessary to use separate datasets for training, validation, and testing in machine learning?

Why is it necessary to use separate datasets for training, validation, and testing in machine learning?

What role does the exploration and understanding of data play in the machine learning process?

What role does the exploration and understanding of data play in the machine learning process?

Why is randomization important in the design of experiments for launching a machine learning system?

Why is randomization important in the design of experiments for launching a machine learning system?

How do machine learning models benefit from having policy layers added before going into production?

How do machine learning models benefit from having policy layers added before going into production?

Discuss how monitoring and maintenance of a machine learning system contribute to its long-term effectiveness.

Discuss how monitoring and maintenance of a machine learning system contribute to its long-term effectiveness.

Why is it important to focus on setting a proper objective in the initial step of the machine learning process?

Why is it important to focus on setting a proper objective in the initial step of the machine learning process?

Explain why testing a machine learning model on previously used data can compromise the model’s reliability.

Explain why testing a machine learning model on previously used data can compromise the model’s reliability.

What are the potential consequences of skipping the validation step in the machine learning workflow?

What are the potential consequences of skipping the validation step in the machine learning workflow?

What factors should be considered when deciding whether to launch a machine learning model fully in production?

What factors should be considered when deciding whether to launch a machine learning model fully in production?

You might also like

Cassie Kozyrkov (kozyr@[Link])