Evaluating Machine Learning Metrics

The document discusses the importance of evaluating machine learning models, highlighting different evaluation metrics for classification and regression models. Key classification metrics include accuracy, precision, recall, F1 score, and ROC curve, while regression metrics include R^2, mean absolute error, and mean squared error. It also provides guidance on which metrics to prioritize based on the specific characteristics of the data and the model's performance.

Uploaded by

Renato

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

16 views2 pages

Evaluating Machine Learning Metrics

Uploaded by

Renato

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

Machine Learning Model Evaluation

Evaluating the results of a machine learning model is as important as building

one.

But just like how different problems have different machine learning models,
different machine learning models have different evaluation metrics.

Below are some of the most important evaluation metrics you'll want to look
into for classification and regression models.

Classification Model Evaluation Metrics/Techniques

 Accuracy - The accuracy of the model in decimal form. Perfect accuracy
is equal to 1.0.
 Precision - Indicates the proportion of positive identifications (model
predicted class 1) which were actually correct. A model which produces
no false positives has a precision of 1.0.
 Recall - Indicates the proportion of actual positives which were correctly
classified. A model which produces no false negatives has a recall of 1.0.
 F1 score - A combination of precision and recall. A perfect model
achieves an F1 score of 1.0.
 Confusion matrix - Compares the predicted values with the true values
in a tabular way, if 100% correct, all values in the matrix will be top left to
bottom right (diagonal line).
 Cross-validation - Splits your dataset into multiple parts and train and
tests your model on each part then evaluates performance as an
average.
 Classification report - Sklearn has a built-in function
called classification_report() which returns some of the main classification
metrics such as precision, recall and f1-score.
 ROC Curve - Also known as receiver operating characteristic is a plot of
true positive rate versus false-positive rate.
 Area Under Curve (AUC) Score - The area underneath the ROC curve.
A perfect model achieves an AUC score of 1.0.

Which classification metric should you use?

 Accuracy is a good measure to start with if all classes are balanced (e.g.
same amount of samples which are labelled with 0 or 1).
 Precision and recall become more important when classes are
imbalanced.
 If false-positive predictions are worse than false-negatives, aim for higher
precision.

 If false-negative predictions are worse than false-positives, aim for higher

recall.
 F1-score is a combination of precision and recall.
 A confusion matrix is always a good way to visualize how a classification
model is going.

Regression Model Evaluation Metrics/Techniques

 R^2 (pronounced r-squared) or the coefficient of determination -
Compares your model's predictions to the mean of the targets. Values
can range from negative infinity (a very poor model) to 1. For example, if
all your model does is predict the mean of the targets, its R^2 value
would be 0. And if your model perfectly predicts a range of numbers it's
R^2 value would be 1.
 Mean absolute error (MAE) - The average of the absolute differences
between predictions and actual values. It gives you an idea of how wrong
your predictions were.
 Mean squared error (MSE) - The average squared differences between
predictions and actual values. Squaring the errors removes negative
errors. It also amplifies outliers (samples which have larger errors).

Which regression metric should you use?
 R2 is similar to accuracy. It gives you a quick indication of how well your
model might be doing. Generally, the closer your R2 value is to 1.0, the
better the model. But it doesn't really tell exactly how wrong your model
is in terms of how far off each prediction is.
 MAE gives a better indication of how far off each of your model's
predictions are on average.
 As for MAE or MSE, because of the way MSE is calculated, squaring the
differences between predicted values and actual values, it amplifies
larger differences. Let's say we're predicting the value of houses (which
we are).
 Pay more attention to MAE: When being $10,000 off is twice as bad
as being $5,000 off.
 Pay more attention to MSE: When being $10,000 off is more than
twice as bad as being $5,000 off.

For more resources on evaluating a machine learning model, be sure to check
out the following resources:

 Scikit-Learn documentation for metrics and scoring (quantifying the

quality of predictions)
 Beyond Accuracy: Precision and Recall by Will Koehrsen
 Stack Overflow answer describing MSE (mean squared error) and
RSME (root mean squared error)

Machine Learning Model Evaluation Metrics
No ratings yet
Machine Learning Model Evaluation Metrics
1 page
Key Performance Metrics for ML Models
No ratings yet
Key Performance Metrics for ML Models
43 pages
Machine Learning Performance Metrics Guide
No ratings yet
Machine Learning Performance Metrics Guide
24 pages
Machine Learning Performance Metrics Guide
No ratings yet
Machine Learning Performance Metrics Guide
19 pages
Model Evaluation Techniques Explained
No ratings yet
Model Evaluation Techniques Explained
18 pages
Evaluation Metrics for Machine Learning
No ratings yet
Evaluation Metrics for Machine Learning
14 pages
Performance Metrics for ML Algorithms
No ratings yet
Performance Metrics for ML Algorithms
13 pages
Machine Learning Performance Metrics Guide
No ratings yet
Machine Learning Performance Metrics Guide
24 pages
Evaluating Metrics for Model Performance
No ratings yet
Evaluating Metrics for Model Performance
40 pages
M3 Evaluation Metrics
No ratings yet
M3 Evaluation Metrics
20 pages
Key Metrics for Model Evaluation
No ratings yet
Key Metrics for Model Evaluation
7 pages
Puran Singh - Assignment - 05
No ratings yet
Puran Singh - Assignment - 05
8 pages
DL 1
No ratings yet
DL 1
14 pages
Model Evaluation and Performance Metrics
No ratings yet
Model Evaluation and Performance Metrics
16 pages
Measuring Performance in Regression Models
No ratings yet
Measuring Performance in Regression Models
7 pages
Model Evaluation and Performance Metrics
No ratings yet
Model Evaluation and Performance Metrics
15 pages
Lec 7,8,9 Performance Evaluation Metrics
No ratings yet
Lec 7,8,9 Performance Evaluation Metrics
62 pages
Evaluation Metrics in Machine Learning
No ratings yet
Evaluation Metrics in Machine Learning
6 pages
Machine Learning Evaluation Metrics Guide
No ratings yet
Machine Learning Evaluation Metrics Guide
16 pages
Understanding Machine Learning Metrics
No ratings yet
Understanding Machine Learning Metrics
32 pages
Key Metrics for Evaluating Regression Models
No ratings yet
Key Metrics for Evaluating Regression Models
6 pages
Model Evaluation Techniques and Metrics
No ratings yet
Model Evaluation Techniques and Metrics
35 pages
Machine Learning Model Training & Testing
No ratings yet
Machine Learning Model Training & Testing
23 pages
Model Evaluation Metrics Explained
No ratings yet
Model Evaluation Metrics Explained
23 pages
Model Evaluation Metrics Explained
No ratings yet
Model Evaluation Metrics Explained
3 pages
Performance Metrics
No ratings yet
Performance Metrics
6 pages
ML Chapter 3 - Evaluation Metrics
No ratings yet
ML Chapter 3 - Evaluation Metrics
23 pages
Machine Learning Model Evaluation Metrics
No ratings yet
Machine Learning Model Evaluation Metrics
40 pages
Performance Metrics Regression 1
No ratings yet
Performance Metrics Regression 1
6 pages
Chapter-7 - Evaluation
No ratings yet
Chapter-7 - Evaluation
4 pages
Machine Learning Performance Metrics Guide
No ratings yet
Machine Learning Performance Metrics Guide
24 pages
Machine Learning Evaluation Metrics
No ratings yet
Machine Learning Evaluation Metrics
16 pages
Machine Learning Evaluation Metrics Guide
No ratings yet
Machine Learning Evaluation Metrics Guide
20 pages
Key Evaluation Metrics for ML Models
No ratings yet
Key Evaluation Metrics for ML Models
6 pages
Deep Learning Model Evaluation Metrics
No ratings yet
Deep Learning Model Evaluation Metrics
11 pages
Ai Linear Regression
No ratings yet
Ai Linear Regression
2 pages
Intro to Model Evaluation Metrics
No ratings yet
Intro to Model Evaluation Metrics
24 pages
Evaluating Machine Learning Models
No ratings yet
Evaluating Machine Learning Models
14 pages
Linear Regression Metrics Explained
No ratings yet
Linear Regression Metrics Explained
8 pages
Common Metrics for Regression Evaluation
No ratings yet
Common Metrics for Regression Evaluation
3 pages
Performance Metrics
No ratings yet
Performance Metrics
6 pages
Machine Learning Evaluation Metrics Guide
No ratings yet
Machine Learning Evaluation Metrics Guide
43 pages
Machine Learning Performance Metrics Guide
No ratings yet
Machine Learning Performance Metrics Guide
24 pages
Machine Learning Performance Metrics Guide
No ratings yet
Machine Learning Performance Metrics Guide
30 pages
Exp 4 Ads
No ratings yet
Exp 4 Ads
8 pages
Machine Learning Model Evaluation Metrics
No ratings yet
Machine Learning Model Evaluation Metrics
29 pages
Unit 4
No ratings yet
Unit 4
15 pages
ML Model Evaluation Metrics Guide
No ratings yet
ML Model Evaluation Metrics Guide
33 pages
ROC Curve and Evaluation Metrics Guide
No ratings yet
ROC Curve and Evaluation Metrics Guide
5 pages
Measuring Model Effectiveness in Big Data
No ratings yet
Measuring Model Effectiveness in Big Data
59 pages
Machine Learning Performance Metrics Guide
No ratings yet
Machine Learning Performance Metrics Guide
28 pages
Regression Model Basics and Metrics
No ratings yet
Regression Model Basics and Metrics
3 pages
Supervised Machine Learning Overview
No ratings yet
Supervised Machine Learning Overview
73 pages
Regression Error Metrics Explained
No ratings yet
Regression Error Metrics Explained
5 pages
Model Evaluation Metrics in ML
No ratings yet
Model Evaluation Metrics in ML
9 pages
Machine Learning Model Evaluation Techniques
No ratings yet
Machine Learning Model Evaluation Techniques
32 pages
Regression Model Evaluation Metrics
No ratings yet
Regression Model Evaluation Metrics
6 pages
Logistic Regression Numerical Exercises
No ratings yet
Logistic Regression Numerical Exercises
3 pages
Advanced Econometrics Exam Paper
No ratings yet
Advanced Econometrics Exam Paper
4 pages
Evolution Simulation Lab Report
No ratings yet
Evolution Simulation Lab Report
8 pages
SPSS Regression Analysis Results
No ratings yet
SPSS Regression Analysis Results
7 pages
Multiple Regression Analysis in SPSS
No ratings yet
Multiple Regression Analysis in SPSS
11 pages
Understanding Heteroskedasticity and Detection
No ratings yet
Understanding Heteroskedasticity and Detection
4 pages
Understanding Statistical Inference and Estimators
No ratings yet
Understanding Statistical Inference and Estimators
37 pages
Simple Linear Regression Analysis and OLS Estimators
No ratings yet
Simple Linear Regression Analysis and OLS Estimators
17 pages
Regression Analysis Quiz
No ratings yet
Regression Analysis Quiz
5 pages
Linear Regression in Financial Econometrics
No ratings yet
Linear Regression in Financial Econometrics
61 pages
2024 Econometrics Summer School Details
No ratings yet
2024 Econometrics Summer School Details
13 pages
Monthly Product Demand Analysis 2021-2024
No ratings yet
Monthly Product Demand Analysis 2021-2024
6 pages
Demand Forecasting Techniques Overview
No ratings yet
Demand Forecasting Techniques Overview
65 pages
SRS Abridged Life Tables 2016-20
No ratings yet
SRS Abridged Life Tables 2016-20
68 pages
Descriptive Statistics and Regression Analysis
No ratings yet
Descriptive Statistics and Regression Analysis
7 pages
Newsletter 23 - Logit, Probit, Tobit (2P)
No ratings yet
Newsletter 23 - Logit, Probit, Tobit (2P)
2 pages
Econometric Methods Course Handout
No ratings yet
Econometric Methods Course Handout
4 pages
FAANG Data Scientist Interview Prep Guide
No ratings yet
FAANG Data Scientist Interview Prep Guide
3 pages
Business Statistics II Question Bank
No ratings yet
Business Statistics II Question Bank
2 pages
Understanding Quantitative Kriging Neighborhood Analysis
No ratings yet
Understanding Quantitative Kriging Neighborhood Analysis
10 pages
Triple Exponential Smoothing Guide
No ratings yet
Triple Exponential Smoothing Guide
31 pages
Rainfall Impact on Crop Production
No ratings yet
Rainfall Impact on Crop Production
2 pages
Building Regression Trees Explained
No ratings yet
Building Regression Trees Explained
17 pages
Biostatistics: Correlation & Regression
No ratings yet
Biostatistics: Correlation & Regression
52 pages
Understanding Linear Regression Concepts
No ratings yet
Understanding Linear Regression Concepts
3 pages
Statistical Analysis Techniques in Minitab
No ratings yet
Statistical Analysis Techniques in Minitab
56 pages
Regression Analysis and ANOVA Tutorial
No ratings yet
Regression Analysis and ANOVA Tutorial
3 pages
Simple Linear Regression Explained
No ratings yet
Simple Linear Regression Explained
4 pages
EBLUP and SEBLUP Methods in Small Area Estimation
No ratings yet
EBLUP and SEBLUP Methods in Small Area Estimation
47 pages
Understanding VIF: Definition & Importance
No ratings yet
Understanding VIF: Definition & Importance
5 pages