Open navigation menu

Scribd

0% found this document useful (0 votes)

20 views20 pages

Model Evaluation Techniques in ML

The document presents a PowerPoint presentation on Machine Learning by Dr. P. Siva Prasad, focusing on model evaluation techniques, including metrics like accuracy, precision, recall, F1 score, and AUC-ROC. It discusses cross-validation methods, specifically K Fold and Holdout, and provides insights into regression evaluation metrics such as Mean Absolute Error (MAE) and Mean Squared Error (MSE). Additionally, the document includes practical examples using Python libraries for data handling and model training.

Uploaded by

Copyright

© © All Rights Reserved

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

20 views20 pages

Model Evaluation Techniques in ML

The document presents a PowerPoint presentation on Machine Learning by Dr. P. Siva Prasad, focusing on model evaluation techniques, including metrics like accuracy, precision, recall, F1 score, and AUC-ROC. It discusses cross-validation methods, specifically K Fold and Holdout, and provides insights into regression evaluation metrics such as Mean Absolute Error (MAE) and Mean Squared Error (MSE). Additionally, the document includes practical examples using Python libraries for data handling and model training.

Uploaded by

Copyright

© © All Rights Reserved

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

DEPARTMENT OF

Computer Science & Engineering

(S.O.C & I)
PPT Presentation on
Machine Learning

Dr. P. SIVA PRASAD

Department of Computer Science & Engineering

School of Computing and Informatics

11/05/2024 Dr. P. SIVA PRASAD,Associate Professor,[Link] CSE 1

DEPARTMENT OF
Computer Science & Engineering

(S.O.C & I)

Machine Learning Model Evaluation

Machine Learning Model does not require hard-coded algorithms. We feed a large amount of data to the model
and the model tries to figure out the features on its own to make future predictions. So we must also use some
techniques to determine the predictive power of the model.
Machine Learning Model Evaluation
Model evaluation is the process that uses some metrics which help us to analyze the performance of the model.
As we all know that model development is a multi-step process and a check should be kept on how well the
model generalizes future predictions.
Therefore evaluating a model plays a vital role so that we can judge the performance of our model. The
evaluation also helps to analyze a model’s key weaknesses.
There are many metrics like
Accuracy, Precision,
Recall, F1 score,
Area under Curve, Confusion Matrix, and Mean Square Error(MSE),Mean Absolute
Error(MSE),R^2 value etc
Cross Validation is one technique that is followed during the training phase and it is a model evaluation
technique as well.
2

11/02/2024 Dr. P. SIVA PRASAD,Associate Professor,[Link] CSE 2

DEPARTMENT OF
Computer Science & Engineering

(S.O.C & I)
Cross Validation and Holdout
Cross Validation is a method in which we do not use the whole dataset for training. In this technique, some part
of the dataset is reserved for testing the model.

There are many types of Cross-Validation out of which K Fold Cross Validation is mostly used. In K Fold Cross
Validation the original dataset is divided into k subsets. The subsets are known as folds. This is repeated k times
where 1 fold is used for testing purposes. Rest k-1 folds are used for training the model. So each data point acts as
a test subject for the model as well as acts as the training subject. It is seen that this technique generalizes the
model well and reduces the error rate.

Holdout is the simplest approach. It is used in neural networks as well as in many classifiers. In this technique,
the dataset is divided into train and test datasets. The dataset is usually divided into ratios like 70:30 or 80:20.
Normally a large percentage of data is used for training the model and a small portion of the dataset is used for
testing the model.

11/02/2024 Dr. P. SIVA PRASAD,Associate Professor,[Link] CSE 3

DEPARTMENT OF
Computer Science & Engineering

(S.O.C & I)

Evaluation Metrics for Classification Task

In this Python code, we have imported the iris dataset which has features like the length and width
of sepals and petals. The target values are Iris setosa, Iris virginica, and Iris versicolor. After
importing the dataset we divided the dataset into train and test datasets in the ratio 80:20. Then we
called Decision Trees and trained our model. After that, we performed the prediction and
calculated the accuracy score, precision, recall, and f1 score. We also plotted the confusion matrix.
Importing Libraries and Dataset
Python libraries make it very easy for us to handle the data and perform typical and complex tasks
with a single line of code.
Pandas – This library helps to load the data frame in a 2D array format and has multiple functions
to perform analysis tasks in one go.
Numpy – Numpy arrays are very fast and can perform large computations in a very short time.
Matplotlib/Seaborn – This library is used to draw visualizations.
Sklearn – This module contains multiple libraries having pre-implemented functions to perform
tasks from data preprocessing to model development and evaluation.

11/02/2024 Dr. P. SIVA PRASAD,Associate Professor,[Link] CSE 4

DEPARTMENT OF
Computer Science & Engineering

(S.O.C & I)

11/02/2024 Dr. P. SIVA PRASAD,Associate Professor,[Link] CSE 5

DEPARTMENT OF
Computer Science & Engineering

(S.O.C & I)

Now let’s load the toy dataset iris flowers from the sklearn. datasets library and then split it into
training and testing parts (for model evaluation) in the 80:20 ratio.

11/02/2024 Dr. P. SIVA PRASAD,Associate Professor,[Link] CSE 6

DEPARTMENT OF
Computer Science & Engineering

(S.O.C & I)
Now, let’s train a Decision Tree Classifier model on the training data, and then we will move on to the
evaluation part of the model using different metrics.

Accuracy
Accuracy is defined as the ratio of the number of correct predictions to the total number of predictions. This is the
most fundamental metric used to evaluate the model. The formula is given by

However, Accuracy has a drawback. It cannot perform well on an imbalanced dataset. Suppose a model
classifies that the majority of the data belongs to the major class label. It yields higher accuracy. But in
general, the model cannot classify on minor class labels and has poor performance.

11/02/2024 Dr. P. SIVA PRASAD,Associate Professor,[Link] CSE 7

DEPARTMENT OF
Computer Science & Engineering

(S.O.C & I)
Precision and Recall
Precision is the ratio of true positives to the summation of true positives and false positives. It
basically analyses the positive predictions.

The drawback of Precision is that it does not consider the True Negatives and False Negatives.
Recall is the ratio of true positives to the summation of true positives and false negatives. It basically
analyses the number of correct positive samples.

The drawback of Recall is that often it leads to a higher false positive rate.

11/02/2024 Dr. P. SIVA PRASAD,Associate Professor,[Link] CSE 8

DEPARTMENT OF
Computer Science & Engineering

(S.O.C & I)

F1 score
The F1 score is the harmonic mean of precision and recall. It is seen that during the precision-
recall trade-off if we increase the precision, recall decreases and vice versa. The goal of the F1
score is to combine precision and recall.

11/02/2024 Dr. P. SIVA PRASAD,Associate Professor,[Link] CSE 9

DEPARTMENT OF
Computer Science & Engineering

(S.O.C & I)

Confusion Matrix
A confusion matrix is an N x N matrix where N is the number of target classes. It represents the
number of actual outputs and the predicted outputs. Some terminologies in the matrix are as follows:
True Positives: It is also known as TP. It is the output in which the actual and the predicted values
are YES.
True Negatives: It is also known as TN. It is the output in which the actual and the predicted values
are NO.
False Positives: It is also known as FP. It is the output in which the actual value is NO but the
predicted value is YES.
False Negatives: It is also known as FN. It is the output in which the actual value is YES but the
predicted value is NO.

10

11/02/2024 Dr. P. SIVA PRASAD,Associate Professor,[Link] CSE 10

DEPARTMENT OF
Computer Science & Engineering

(S.O.C & I)

11

11/02/2024 Dr. P. SIVA PRASAD,Associate Professor,[Link] CSE 11

DEPARTMENT OF
Computer Science & Engineering

(S.O.C & I)
AUC-ROC Curve
AUC (Area Under Curve) is an evaluation metric that is used to analyze the classification model at different
threshold values. The Receiver Operating Characteristic(ROC) curve is a probabilistic curve used to
highlight the model’s performance. The curve has two parameters:
TPR: It stands for True positive rate. It basically follows the formula of Recall.
FPR: It stands for False Positive rate. It is defined as the ratio of False positives to the summation of false
positives and True negatives.
This curve is useful as it helps us to determine the model’s capacity to distinguish between different classes.
Let us illustrate this with the help of a simple Python example

AUC score is a useful metric to evaluate the model. It basically highlights a model’s capacity to separate the classes. In the
above code, 0.75 is a good AUC score. A model is considered good if the AUC score is greater than 0.5 and approaches 1.
A poor model has an AUC score of 0. 12

11/02/2024 Dr. P. SIVA PRASAD,Associate Professor,[Link] CSE 12

DEPARTMENT OF
Computer Science & Engineering

(S.O.C & I)
Evaluation Metrics for Regression Task
Regression is used to determine continuous values. It is mostly used to find a relation between a dependent
and an independent variable.
For classification, we use a confusion matrix, accuracy, f1 score, etc. But for regression analysis, since we
are predicting a numerical value it may differ from the actual output. So we consider the error calculation as
it helps to summarize how close the prediction is to the actual value. There are many metrics available for
evaluating the regression model.
In this Python Code, we have implemented a simple regression model using the Mumbai weather CSV file.
This file comprises Day, Hour, Temperature, Relative Humidity, Wind Speed, and Wind Direction. The link
for the dataset is here.
 We are basically interested in finding a relationship between Temperature and Relative Humidity. Here
Relative Humidity is the dependent variable and Temperature is the independent variable. We performed the
Linear Regression and used the metrics to evaluate the performance of our model. To calculate the metrics
we make extensive use of sklearn library.

13

11/02/2024 Dr. P. SIVA PRASAD,Associate Professor,[Link] CSE 13

DEPARTMENT OF
Computer Science & Engineering

(S.O.C & I)
Now let’s load the data into the panda’s data frame and then split it into training and testing parts (for
model evaluation) in the 80:20 ratio.

Now, let’s train a simple linear regression model. On the training data and we will move to the
evaluation part of the model using different metrics.

14

11/02/2024 Dr. P. SIVA PRASAD,Associate Professor,[Link] CSE 14

DEPARTMENT OF
Computer Science & Engineering

(S.O.C & I)

Mean Absolute Error(MAE)

This is the simplest metric used to analyze the loss over the whole dataset. As we all know the error is
basically the difference between the predicted and actual values. Therefore MAE is defined as the
average of the errors calculated. Here we calculate the modulus of the error, perform the summation
and then divide the result by the number of data points. It is a positive quantity and is not concerned
about the direction. The formula of MAE is given by

15

11/02/2024 Dr. P. SIVA PRASAD,Associate Professor,[Link] CSE 15

DEPARTMENT OF
Computer Science & Engineering

(S.O.C & I)
Mean Squared Error(MSE)
The most commonly used metric is Mean Square error or MSE. It is a function used to calculate the
loss. We find the difference between the predicted values and the truth variable, square the result and
then find the average over the whole dataset. MSE is always positive as we square the values. The
small the MSE better is the performance of our model. The formula of MSE is given:

16

11/02/2024 Dr. P. SIVA PRASAD,Associate Professor,[Link] CSE 16

DEPARTMENT OF
Computer Science & Engineering

(S.O.C & I)

Root Mean Squared Error(RMSE)

RMSE is a popular method and is the extended version of MSE(Mean Squared Error). This method is
basically used to evaluate the performance of our model. It indicates how much the data points are
spread around the best line. It is the standard deviation of the Mean squared error. A lower value means
that the data point lies closer to the best fit line.

17

11/02/2024 Dr. P. SIVA PRASAD,Associate Professor,[Link] CSE 17

DEPARTMENT OF
Computer Science & Engineering

(S.O.C & I)

Mean Absolute Percentage Error (MAPE)

MAPE is basically used to express the error in terms of percentage. It is defined as the difference
between the actual and predicted value. The error is then divided by the actual value. The results are
then summed up and finally, we calculate the average. Smaller the percentage better the performance
of the model. The formula is given by

18

11/02/2024 Dr. P. SIVA PRASAD,Associate Professor,[Link] CSE 18

DEPARTMENT OF
MATHAMATICS & STATISTICS

(S.A.S & H)
Questions, Suggestions and Comments

19

11/05/2024 [Link] Prasad 21

DEPARTMENT OF
MATHAMATICS & STATISTICS

(S.A.S & H)

20

11/05/2024 [Link] [Link] 22

You might also like

Machine Learning Model Evaluation Techniques
No ratings yet
Machine Learning Model Evaluation Techniques
11 pages
Machine Learning Model Evaluation Guide
No ratings yet
Machine Learning Model Evaluation Guide
12 pages
Machine Learning Evaluation Techniques
No ratings yet
Machine Learning Evaluation Techniques
121 pages
Machine Learning Model Evaluation Metrics
No ratings yet
Machine Learning Model Evaluation Metrics
17 pages
Machine Learning Evaluation Techniques
No ratings yet
Machine Learning Evaluation Techniques
19 pages
Machine Learning Model Evaluation Metrics
No ratings yet
Machine Learning Model Evaluation Metrics
37 pages
Unit 2 Machine Learning
No ratings yet
Unit 2 Machine Learning
46 pages
Machine Learning Performance Metrics Guide
No ratings yet
Machine Learning Performance Metrics Guide
24 pages
Machine Learning Model Evaluation Guide
No ratings yet
Machine Learning Model Evaluation Guide
31 pages
Deep Learning Model Evaluation Metrics
No ratings yet
Deep Learning Model Evaluation Metrics
21 pages
Machine Learning Model Training & Testing
No ratings yet
Machine Learning Model Training & Testing
23 pages
Machine Learning Model Evaluation Metrics
No ratings yet
Machine Learning Model Evaluation Metrics
36 pages
Machine Learning Evaluation Metrics Guide
No ratings yet
Machine Learning Evaluation Metrics Guide
43 pages
Machine Learning Model Evaluation Metrics
No ratings yet
Machine Learning Model Evaluation Metrics
40 pages
Model Evaluation and Performance Metrics
No ratings yet
Model Evaluation and Performance Metrics
16 pages
Machine Learning Overview: Python vs C++
No ratings yet
Machine Learning Overview: Python vs C++
24 pages
Machine Learning Performance Metrics Guide
No ratings yet
Machine Learning Performance Metrics Guide
30 pages
DL 1
No ratings yet
DL 1
14 pages
ML Model Evaluation Metrics Guide
No ratings yet
ML Model Evaluation Metrics Guide
33 pages
Evaluation Metrics in Machine Learning
No ratings yet
Evaluation Metrics in Machine Learning
6 pages
Machine Learning Evaluation Metrics Guide
No ratings yet
Machine Learning Evaluation Metrics Guide
17 pages
Intro to Model Evaluation Metrics
No ratings yet
Intro to Model Evaluation Metrics
24 pages
Machine Learning Classifier Evaluation Guide
No ratings yet
Machine Learning Classifier Evaluation Guide
61 pages
Module 6 - Evaluation Metrics
No ratings yet
Module 6 - Evaluation Metrics
23 pages
Machine Learning Algorithm Evaluation Guide
No ratings yet
Machine Learning Algorithm Evaluation Guide
26 pages
Machine Learning Basics and Evaluation
No ratings yet
Machine Learning Basics and Evaluation
48 pages
Machine Learning Performance Metrics Guide
No ratings yet
Machine Learning Performance Metrics Guide
24 pages
Model Evaluation Metrics in Machine Learning
No ratings yet
Model Evaluation Metrics in Machine Learning
49 pages
Deep Learning Model Evaluation Metrics
No ratings yet
Deep Learning Model Evaluation Metrics
11 pages
Module 5
No ratings yet
Module 5
10 pages
Machine Learning Performance Metrics Guide
No ratings yet
Machine Learning Performance Metrics Guide
43 pages
Unit 4
No ratings yet
Unit 4
15 pages
Model Evaluation Metrics in Machine Learning
No ratings yet
Model Evaluation Metrics in Machine Learning
27 pages
Performance Metrics in Deep Learning
100% (1)
Performance Metrics in Deep Learning
36 pages
Model Validation and Interpretability Techniques
No ratings yet
Model Validation and Interpretability Techniques
33 pages
Classification, Evaluation Metrics & Cross Validation
No ratings yet
Classification, Evaluation Metrics & Cross Validation
56 pages
Evaluating Machine Learning Models
No ratings yet
Evaluating Machine Learning Models
46 pages
Elements of Machine Learning Models LJ
No ratings yet
Elements of Machine Learning Models LJ
42 pages
Machine Learning: Performance Evaluation: CSC 640: Advanced Software Engineering
No ratings yet
Machine Learning: Performance Evaluation: CSC 640: Advanced Software Engineering
27 pages
Model Evaluation and Performance Metrics
No ratings yet
Model Evaluation and Performance Metrics
15 pages
K-Fold Cross Validation in Python
No ratings yet
K-Fold Cross Validation in Python
11 pages
Evaluating Machine Learning Performance
No ratings yet
Evaluating Machine Learning Performance
42 pages
Machine Learning Model Evaluation Guide
No ratings yet
Machine Learning Model Evaluation Guide
22 pages
Machine Learning Performance Metrics Guide
No ratings yet
Machine Learning Performance Metrics Guide
19 pages
Hyperparameter Tuning and Overfitting
No ratings yet
Hyperparameter Tuning and Overfitting
17 pages
Machine Learning Classification Overview
No ratings yet
Machine Learning Classification Overview
20 pages
Machine Learning Performance Metrics Guide
No ratings yet
Machine Learning Performance Metrics Guide
50 pages
Unit IIITYCSDS
No ratings yet
Unit IIITYCSDS
52 pages
Unit III ML Model Deployment
No ratings yet
Unit III ML Model Deployment
31 pages
Understanding Classification in ML
No ratings yet
Understanding Classification in ML
38 pages
Unit 4
No ratings yet
Unit 4
4 pages
Machine Learning Performance Metrics Guide
No ratings yet
Machine Learning Performance Metrics Guide
24 pages
DL 2 Unit 3
No ratings yet
DL 2 Unit 3
22 pages
Understanding Machine Learning Basics
No ratings yet
Understanding Machine Learning Basics
40 pages
Model Evaluation Biological Insights Week 4
No ratings yet
Model Evaluation Biological Insights Week 4
20 pages
Machine Learning Classification Overview
No ratings yet
Machine Learning Classification Overview
214 pages
Key Performance Metrics in ML
No ratings yet
Key Performance Metrics in ML
12 pages
Cross Validation Techniques in ML
No ratings yet
Cross Validation Techniques in ML
52 pages
3.2 Class - Dm..
No ratings yet
3.2 Class - Dm..
37 pages
Lexical Analyzer in Compiler Design
No ratings yet
Lexical Analyzer in Compiler Design
34 pages
Overview of the LEX Tool in Compiler Design
No ratings yet
Overview of the LEX Tool in Compiler Design
17 pages
Token Specification in Compiler Design
No ratings yet
Token Specification in Compiler Design
20 pages
Hypothesis Testing in Machine Learning
No ratings yet
Hypothesis Testing in Machine Learning
37 pages
Machine Learning Model Selection Techniques
No ratings yet
Machine Learning Model Selection Techniques
24 pages
OS Record Management and Scripts
No ratings yet
OS Record Management and Scripts
50 pages
BBA First Year Course Materials PDF
No ratings yet
BBA First Year Course Materials PDF
30 pages
MTP2955V Power MOSFET 12 Amps, 60 Volts: P-Channel TO-220
No ratings yet
MTP2955V Power MOSFET 12 Amps, 60 Volts: P-Channel TO-220
8 pages
B.Sc. Mathematics Algebra Exam Paper
No ratings yet
B.Sc. Mathematics Algebra Exam Paper
4 pages
Airport Hangar Truss Design Analysis
No ratings yet
Airport Hangar Truss Design Analysis
22 pages
Class 6 Basic Geometry Concepts
No ratings yet
Class 6 Basic Geometry Concepts
63 pages
Understanding Rational Functions and Asymptotes
No ratings yet
Understanding Rational Functions and Asymptotes
39 pages
Cryptography Standards in Quantum Time:: New Wine in An Old Wineskin?
No ratings yet
Cryptography Standards in Quantum Time:: New Wine in An Old Wineskin?
7 pages
Creeping Flow in Periodic Channels
No ratings yet
Creeping Flow in Periodic Channels
20 pages
Integral Calculus
No ratings yet
Integral Calculus
174 pages
Understanding Algebraic Structures
No ratings yet
Understanding Algebraic Structures
8 pages
Arcminute Definition and Applications
No ratings yet
Arcminute Definition and Applications
2 pages
JSU Matematik Tingkatan 1 & 4A
No ratings yet
JSU Matematik Tingkatan 1 & 4A
9 pages
Search Algorithms and Machine Learning Models
No ratings yet
Search Algorithms and Machine Learning Models
4 pages
UAV Trajectory Planning with Reinforcement Learning
No ratings yet
UAV Trajectory Planning with Reinforcement Learning
5 pages
Hyperspectral Segmentation of Retina
No ratings yet
Hyperspectral Segmentation of Retina
5 pages
Image Processing in Delphi
100% (1)
Image Processing in Delphi
42 pages
Isometric Drawing Tool
No ratings yet
Isometric Drawing Tool
1 page
Analysis of Whitworth Quick Return Mechanism
No ratings yet
Analysis of Whitworth Quick Return Mechanism
8 pages
Spring 2014 D.I.Y. Weekly Newsletter
No ratings yet
Spring 2014 D.I.Y. Weekly Newsletter
6 pages
M.Tech in AI & ML at BITS Pilani
No ratings yet
M.Tech in AI & ML at BITS Pilani
19 pages
JEE Mains Syllabus and Analysis 2014-2018
100% (1)
JEE Mains Syllabus and Analysis 2014-2018
13 pages
Acid-Base Titration Experiment Guide
No ratings yet
Acid-Base Titration Experiment Guide
6 pages
Understanding Circle Properties and Angles
No ratings yet
Understanding Circle Properties and Angles
11 pages
Evolution of Structural Engineering History
No ratings yet
Evolution of Structural Engineering History
11 pages
Ultrasonic Vocalizations in Mice Research
No ratings yet
Ultrasonic Vocalizations in Mice Research
24 pages
Common Catenary and Equilibrium Problems
No ratings yet
Common Catenary and Equilibrium Problems
4 pages
Importance of Mathematical Models in Chemistry
No ratings yet
Importance of Mathematical Models in Chemistry
8 pages
Understanding the Ranque-Hilsch Vortex Tube
No ratings yet
Understanding the Ranque-Hilsch Vortex Tube
10 pages
GATE 2011 Metallurgy Study Resources
No ratings yet
GATE 2011 Metallurgy Study Resources
9 pages
Introduction To Complex Analysis Priestley Solutions
No ratings yet
Introduction To Complex Analysis Priestley Solutions
16 pages