0% found this document useful (0 votes)

16 views21 pages

Model Evaluation Techniques in Machine Learning

Model evaluation is the process of assessing a model's performance on unseen data to ensure accuracy and generalization, preventing overfitting and underfitting. Key evaluation metrics include accuracy, error, precision, recall, and the confusion matrix, which help in understanding a model's effectiveness. Ethical concerns such as bias, transparency, and accountability are crucial in the evaluation process to ensure responsible AI usage.

Uploaded by

sunitabarate851

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

16 views21 pages

Model Evaluation Techniques in Machine Learning

Uploaded by

sunitabarate851

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

Unit 3: Evaluating Models

Model Evaluation
• Definition:
Model evaluation is the process of assessing how
well a model performs on unseen data.

• Explanation:
It helps determine the model’s accuracy,
robustness, and suitability for solving the
problem. It ensures that the model is not just
memorizing data (overfitting) but can generalize.
Importance of Model Evaluation
• Evaluation refers to the process of assessing a machine learning
model's performance on data that it hasn't seen before (usually
from a test dataset). The goal is to determine:
– How well the model is making predictions.
– Whether it is generalizing correctly beyond the training data.
– If the model is accurate, reliable, and fair in practical scenarios.

• This process is essential to identify:

– Overfitting (model performs well on training data but poorly on new
data).
– Underfitting (model fails to capture patterns even in training data).
Areas where the model can be improved before deployment.
– Evaluating a model helps determine its performance and reliability.
Splitting the Training Set

• It ensures the model generalizes well and is

not over fitted.
Train-Test Split
• Definition:
Train-Test Split is dividing the dataset into two parts –
training and testing.

• Explanation:
Training set is used to build the model; the test set is
used to evaluate it. It mimics real-world data to
measure performance realistically.
Need of Train-test split

❖ The train dataset is used to make the model learn

❖ The input elements of the test dataset are

provided to the trained model. The model makes
predictions, and the predicted values are
compared to the expected values

❖ The objective is to estimate the performance of the

machine learning model on new data: data not
used to train the model
What is Accuracy and Error?
• Accuracy:

• Accuracy is an evaluation metric that allows you

to measure the total number of
predictions a model gets right.

• The accuracy of the model and performance of

the model is directly proportional, and hence
better the performance of the model, the more
accurate are the predictions.
What is Error?

Error can be described as an action that is inaccurate

or wrong.

In Machine Learning, the error is used to see how

accurately our model can predict data it uses to learn
new, unseen data.

Based on our error, we choose the machine learning

model which performs best for a particular dataset
3.3 Accuracy and Error
Definitions:
• Accuracy = (Correct Predictions / Total
Predictions) × 100%
• Error = 1 - Accuracy

• Explanation:
High accuracy means fewer prediction mistakes.
Error helps track how much the model deviates
from the actual results.
Evaluation Metrics for Classification
• Definition:
Classification metrics help in understanding
model performance on classification
problems.
Classification Metrics

Popular metrics used for classification model

▪ Confusion matrix
▪ Classification accuracy
▪ Precision
▪ Recall
The confusion matrix

Confusion Matrix: Table to visualize performance.

• The confusion matrix is a handy presentation of the accuracy of a model with
two or more classes
• The table presents the actual values on the y-axis and predicted values on the x-
axis
• The numbers in each cell represents the number of predictions made by a
machine learning algorithm that falls into that particular category
True Positive and True Negative

1. True Positive: True Positive (TP) is the outcome of

the model correctly predicting the positive class.
Example: You had predicted that France
would win the world cup, and it won.

2. True Negative : True Negative (TN) is the outcome

of the model correctly predicting the negative
class.
Example: You had predicted that Germany
would not win, and it lost
False Positive and False Negative
False Positive: False Positive (FP) is the outcome of the
model wrongly predicting the negative class as positive
class
Example: You had predicted that Germany would win, but it
lost.
False Negative: False Negative (FN) is the outcome of the
model wrongly predicting the positive class as the negative
class.
Example: You had predicted that France would not win but
it won
Calculations
Precision from Confusion matrix

Precision is the ratio of the total number of

correctly classified positive examples and the
total number of predicted positive examples.

Precision = Correct positive predictions

Total positive predictions
TP
TP+FP
Recall from Confusion matrix

The recall is the measure of our model correctly identifying

True Positives

Recall = Correct positive predictions

Total actual positive values
TP
TP+FN
• Recall is also called as Sensitivity or True Positive Rate

• Recall is generally used for unbalanced dataset when

dealing with the False Negatives become important and
the model needs to reduce the FNs as much as possible.
Example

The case of predicting a good day based on

weather conditions to launch satellite.

Missing out on predicting a good weather day is

okay (low recall) but predicting the bad weather
day (Negative class) as a good weather
day (Positive class) to launch the satellite can be
disastrous.
Ethical Concerns in Model Evaluation
• Bias: A model may favor one group over
another unfairly.
• Transparency: Model logic should be
understandable.
• Accountability: Developers must ensure models
behave ethically.

• These concerns ensure responsible use of AI

systems.
F1 Score

F1-Score provides a way to combine both precisions

and recall into a single measure that captures both
properties

Used where the dataset is unbalanced, and we are

unable to decide whether FP is more important or FN,
we should use the F1 score as the suitable metric.

F1 Score = 2 x Precision x Recall

Precision + Recall
Ethical Concerns in Evaluation

• Bias: Ensuring fairness in model outcomes

• Transparency: Clear model operations
• Accountability: Responsibility for model
decisions

Understanding the F1 Score in AI Evaluation
100% (1)
Understanding the F1 Score in AI Evaluation
5 pages
Model Evaluation Techniques in AI
No ratings yet
Model Evaluation Techniques in AI
4 pages
Model Evaluation and Accuracy Metrics
No ratings yet
Model Evaluation and Accuracy Metrics
10 pages
Class 10 Chap 3
No ratings yet
Class 10 Chap 3
8 pages
Understanding AI Model Evaluation
No ratings yet
Understanding AI Model Evaluation
18 pages
Evaluating AI Model Performance Metrics
No ratings yet
Evaluating AI Model Performance Metrics
4 pages
Chapter 3 Evaluating Models Notes Edited
No ratings yet
Chapter 3 Evaluating Models Notes Edited
5 pages
Model Evaluation in Machine Learning
No ratings yet
Model Evaluation in Machine Learning
6 pages
Understanding False Positives in Model Evaluation
No ratings yet
Understanding False Positives in Model Evaluation
10 pages
AI Model Evaluation Metrics Guide
No ratings yet
AI Model Evaluation Metrics Guide
28 pages
Evaluating Models
No ratings yet
Evaluating Models
6 pages
AI PART B - UNIT 3 Class 10
No ratings yet
AI PART B - UNIT 3 Class 10
4 pages
Evaluating AI Model Performance Metrics
No ratings yet
Evaluating AI Model Performance Metrics
51 pages
AI Model Evaluation Metrics Guide
No ratings yet
AI Model Evaluation Metrics Guide
23 pages
Evaluating AI Models in Machine Learning
No ratings yet
Evaluating AI Models in Machine Learning
4 pages
Enhancing AI Model Evaluation Techniques
No ratings yet
Enhancing AI Model Evaluation Techniques
5 pages
Precision vs Recall in AI Evaluation
No ratings yet
Precision vs Recall in AI Evaluation
45 pages
Chapter-7 - Evaluation
No ratings yet
Chapter-7 - Evaluation
4 pages
Key Metrics for Machine Learning Evaluation
No ratings yet
Key Metrics for Machine Learning Evaluation
2 pages
Machine Learning Model Evaluation Guide
No ratings yet
Machine Learning Model Evaluation Guide
3 pages
Importance of Model Evaluation in ML
No ratings yet
Importance of Model Evaluation in ML
22 pages
AI Model Evaluation Metrics Guide
No ratings yet
AI Model Evaluation Metrics Guide
21 pages
Model Evaluation in AI Systems
100% (1)
Model Evaluation in AI Systems
16 pages
Class 10 Model Evaluation Notes
No ratings yet
Class 10 Model Evaluation Notes
7 pages
ReactNativeBlobUtilTmp 43t1uksk18iwj4u6mds4t
No ratings yet
ReactNativeBlobUtilTmp 43t1uksk18iwj4u6mds4t
75 pages
Class 10 AI Model Evaluation Notes
No ratings yet
Class 10 AI Model Evaluation Notes
52 pages
AI Model Evaluation Metrics Explained
No ratings yet
AI Model Evaluation Metrics Explained
39 pages
AI Model Evaluation Metrics Explained
No ratings yet
AI Model Evaluation Metrics Explained
37 pages
Evaluating AI Model Accuracy Metrics
No ratings yet
Evaluating AI Model Accuracy Metrics
5 pages
Key Notes - Evaluating Model
No ratings yet
Key Notes - Evaluating Model
4 pages
Evaluation Metrics in Machine Learning
No ratings yet
Evaluation Metrics in Machine Learning
2 pages
Machine Learning Evaluation Metrics Guide
No ratings yet
Machine Learning Evaluation Metrics Guide
57 pages
Understanding ML Model Evaluation Metrics
No ratings yet
Understanding ML Model Evaluation Metrics
2 pages
Evaluation 25 26
No ratings yet
Evaluation 25 26
73 pages
AI G10 N 05 EvaluatingModelling PDF
No ratings yet
AI G10 N 05 EvaluatingModelling PDF
7 pages
Model Evaluation in Machine Learning
No ratings yet
Model Evaluation in Machine Learning
6 pages
Evaluation 25-26-1
No ratings yet
Evaluation 25-26-1
70 pages
Evaluation
No ratings yet
Evaluation
7 pages
Machine Learning Model Evaluation Metrics
No ratings yet
Machine Learning Model Evaluation Metrics
29 pages
Unit 3
No ratings yet
Unit 3
5 pages
What Is Evaluating Models?: Why We Need Evaluation Model?
No ratings yet
What Is Evaluating Models?: Why We Need Evaluation Model?
11 pages
Evaluating Models Class 10 Notes
No ratings yet
Evaluating Models Class 10 Notes
14 pages
Classification Metrics Overview
No ratings yet
Classification Metrics Overview
43 pages
2005677evaluation - L1.Pdf - Evaluation - L1
No ratings yet
2005677evaluation - L1.Pdf - Evaluation - L1
31 pages
Model Evaluation and Accuracy Metrics
No ratings yet
Model Evaluation and Accuracy Metrics
119 pages
Understanding AI Model Evaluation
No ratings yet
Understanding AI Model Evaluation
9 pages
AI Model Evaluation Techniques
No ratings yet
AI Model Evaluation Techniques
10 pages
Intro to Model Evaluation Metrics
No ratings yet
Intro to Model Evaluation Metrics
24 pages
Understanding Model Evaluation Metrics
No ratings yet
Understanding Model Evaluation Metrics
50 pages
AI Model Evaluation Techniques Explained
No ratings yet
AI Model Evaluation Techniques Explained
20 pages
Model Evaluation Techniques in AI
No ratings yet
Model Evaluation Techniques in AI
74 pages
Machine Learning Evaluation Metrics Guide
No ratings yet
Machine Learning Evaluation Metrics Guide
43 pages
Model Evaluation in AI Development
No ratings yet
Model Evaluation in AI Development
54 pages
Machine Learning Model Evaluation Techniques
No ratings yet
Machine Learning Model Evaluation Techniques
32 pages
Evaluation Class X Artificial Intelligence Class 10
No ratings yet
Evaluation Class X Artificial Intelligence Class 10
5 pages
AI Model Evaluation and Metrics Guide
No ratings yet
AI Model Evaluation and Metrics Guide
6 pages
Machine Learning for Student Performance
No ratings yet
Machine Learning for Student Performance
12 pages
Confusion Matrix & Accuracy Explained
No ratings yet
Confusion Matrix & Accuracy Explained
4 pages
AI Model Evaluation Explained
No ratings yet
AI Model Evaluation Explained
25 pages
Understanding Natural Language Processing
No ratings yet
Understanding Natural Language Processing
26 pages
PGPR Isolation from Gujarat Lignite Mines
No ratings yet
PGPR Isolation from Gujarat Lignite Mines
10 pages
Numpy Basics: Installation and Usage Guide
No ratings yet
Numpy Basics: Installation and Usage Guide
8 pages
PGPR Isolation from Gujarat Lignite Mines
No ratings yet
PGPR Isolation from Gujarat Lignite Mines
8 pages
AI Project Cycle Overview and Steps
No ratings yet
AI Project Cycle Overview and Steps
12 pages
Ship Control System Project Report
No ratings yet
Ship Control System Project Report
8 pages
Linear Algebra in Healthcare Applications
No ratings yet
Linear Algebra in Healthcare Applications
20 pages
Multi-Floor Facility Layout Optimization
No ratings yet
Multi-Floor Facility Layout Optimization
24 pages
Job Application Tips and Strategies
No ratings yet
Job Application Tips and Strategies
3 pages
Flood Mitigation Review: Qoa Dhamota Dam
No ratings yet
Flood Mitigation Review: Qoa Dhamota Dam
8 pages
Computing Neural Network Gradients
No ratings yet
Computing Neural Network Gradients
67 pages
6.manutenção Periodica-GR3505T3III
No ratings yet
6.manutenção Periodica-GR3505T3III
17 pages
A Detailed Overview of Flash and Flash Management Techniques
No ratings yet
A Detailed Overview of Flash and Flash Management Techniques
18 pages
Compression Testing Machine Quotation
No ratings yet
Compression Testing Machine Quotation
1 page
Calypso Java Card Command Error
No ratings yet
Calypso Java Card Command Error
4 pages
Aseptic Risk Assessment Simplified
No ratings yet
Aseptic Risk Assessment Simplified
3 pages
CPU Lesson Plan for Students
No ratings yet
CPU Lesson Plan for Students
2 pages
C++ Midterm Exam: Vector & Set Classes
No ratings yet
C++ Midterm Exam: Vector & Set Classes
8 pages
Karnataka Short News Highlights
No ratings yet
Karnataka Short News Highlights
19 pages
Privacy Backdoors in Pretrained Models
No ratings yet
Privacy Backdoors in Pretrained Models
35 pages
LED Displays Classification Under Customs
No ratings yet
LED Displays Classification Under Customs
1 page
Otto Group: B2B Ecommerce Challenges
No ratings yet
Otto Group: B2B Ecommerce Challenges
8 pages
UE4 Prerequisite Setup for MX2021
No ratings yet
UE4 Prerequisite Setup for MX2021
5 pages
Music Notes and Rests Activity Sheet
No ratings yet
Music Notes and Rests Activity Sheet
11 pages
TMC 132
No ratings yet
TMC 132
61 pages
JavaScript Asynchronous vs Synchronous Concepts
No ratings yet
JavaScript Asynchronous vs Synchronous Concepts
10 pages
GTJZ0608E 0808E Parts Manual A
No ratings yet
GTJZ0608E 0808E Parts Manual A
111 pages
Transhuman Space Teralogos News - 2100, Fourth Quarter
100% (1)
Transhuman Space Teralogos News - 2100, Fourth Quarter
8 pages
GLOCK Gen 5 Flyer With Specs
No ratings yet
GLOCK Gen 5 Flyer With Specs
2 pages
Non-Destructive Testing Methods Guide
No ratings yet
Non-Destructive Testing Methods Guide
7 pages
Identifying Failures in Multimodal Systems
No ratings yet
Identifying Failures in Multimodal Systems
31 pages
Cross-Border Crypto Asset Arbitration Guide
No ratings yet
Cross-Border Crypto Asset Arbitration Guide
4 pages
Quantum Levitation and Superconductivity
No ratings yet
Quantum Levitation and Superconductivity
14 pages
ITP for Building Material Quality Control
No ratings yet
ITP for Building Material Quality Control
38 pages
Set-Linearizable Implementations with Multiplicity
No ratings yet
Set-Linearizable Implementations with Multiplicity
19 pages

Model Evaluation Techniques in Machine Learning

Uploaded by

Model Evaluation Techniques in Machine Learning

Uploaded by

Unit 3: Evaluating Models

• This process is essential to identify:

• It ensures the model generalizes well and is

❖ The train dataset is used to make the model learn

❖ The input elements of the test dataset are

❖ The objective is to estimate the performance of the

• Accuracy is an evaluation metric that allows you

• The accuracy of the model and performance of

Error can be described as an action that is inaccurate

In Machine Learning, the error is used to see how

Based on our error, we choose the machine learning

Popular metrics used for classification model

Confusion Matrix: Table to visualize performance.

1. True Positive: True Positive (TP) is the outcome of

2. True Negative : True Negative (TN) is the outcome

Precision is the ratio of the total number of

Precision = Correct positive predictions

The recall is the measure of our model correctly identifying

Recall = Correct positive predictions

• Recall is generally used for unbalanced dataset when

The case of predicting a good day based on

Missing out on predicting a good weather day is

• These concerns ensure responsible use of AI

F1-Score provides a way to combine both precisions

Used where the dataset is unbalanced, and we are

F1 Score = 2 x Precision x Recall

• Bias: Ensuring fairness in model outcomes

You might also like