Evaluating AI Models: Key Metrics and Risks

The document discusses the importance of evaluating AI models before deployment, highlighting potential consequences of not doing so, such as incorrect predictions and financial losses. It explains key concepts like train-test split, classification accuracy, and the significance of understanding error and accuracy for model improvement. Additionally, it provides case studies for applying evaluation metrics like precision, recall, and confusion matrices.

Uploaded by

arushmittal4u

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

52 views3 pages

Evaluating AI Models: Key Metrics and Risks

Uploaded by

arushmittal4u

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Class X

Subject: AI
Topic: Evaluation
Worksheet with Solution

Q1. What will happen if you deploy an AI model without evaluating it with known test set
data?
Test sets simulate real-world scenarios in a controlled way. If we deploy AI model without evaluating then
the following may be the consequences:
1. We won’t know how well the model performs on unseen data.
2. This can lead to incorrect predictions, low reliability, and poor user experience.
3. Without testing, biases in training data may go unnoticed.
4. Businesses may face financial loss or reputational damage.

Q2. Do you think evaluating an AI model is that essential in an AI project cycle?

Yes, in essence, model evaluation is like giving your AI model a report card. It helps you understand its
strengths, weaknesses, and suitability for the task at hand. This feedback loop is essential for building
trustworthy and reliable AI systems.

Q3. Explain train-test split with an example.

-The train-test split is a technique for evaluating the performance of a machine learning algorithm
-It can be used for any supervised learning algorithm
- The procedure involves taking a dataset and dividing it into two subsets: The training dataset and the
testing dataset
-The train-test procedure is appropriate when there is a sufficiently large dataset available
Example:

Q4. “Understanding both error and accuracy is crucial for effectively evaluating and
improving AI models.” Justify this statement.
The statement “Understanding both error and accuracy is crucial for effectively evaluating and improving
AI models” is absolutely justified because error and accuracy are two sides of the same coin—they provide
a balanced and complete view of model performance. Accuracy is an evaluation metric that allows you to
measure the total number of predictions a model gets right. Error refers to the difference between a
model's prediction and the actual outcome. It quantifies how often the model makes mistakes. The goal is
to minimize error and maximize accuracy.
Q5. What is classification accuracy? Can it be used all times for evaluating AI models?
Classification accuracy is the number of correct predictions made as a ratio of all predictions made.

In case of imbalanced datasets (e.g., 95% "no disease", 5% "disease"), a model can achieve 95% accuracy
by always predicting “no disease” but fails the 5% who need diagnosis. Hence Accuracy cannot be used
always.

Case study-based questions:

Q1. Identify which metric (Precision or Recall) is to be used in the following cases and
why?
a) Email Spam Detection
b) Cancer Diagnosis
c) Legal Cases (Innocent until proven guilty)
d) Fraud Detection
e) Safe Content Filtering (like Kids YouTube)

False Positive is more costly, and hence Precision False Negative is more costly hence Recall metric
metric will be used will be used
a) Email Spam Detection b) Cancer Diagnosis
c) Legal Cases (Innocent until proven guilty) d) Fraud Detection

e) Safe Content Filtering (like Kids YouTube)

Q2. Examine the following case studies. Draw the confusion matrix and calculate metrics
such as accuracy, precision, recall, and F1-score for each one of them.
a. Case Study 1: A spam email detection system is used to classify emails as either spam
(1) or not spam (0). Out of 1000 emails: -
-150 emails were correctly classified as spam.
- 50 emails were incorrectly classified as spam.
- 750 emails were correctly classified as not spam.
- 50 emails were incorrectly classified as not spam
Confusion Matrix
Reality/Prediction→ Yes No

Yes 150 (TP) 50 (FN)

No 50 (FP) 750 (TN)
(TP+TN)
Classification Accuracy%= 𝑇𝑃+𝑇𝑁+𝐹𝑃+𝐹𝑁 𝑥100= (150+750)x100/1000=90%
(TP)
Precision=𝑇𝑃+𝐹𝑃= 150/(150+50)=0.75
(TP)
Recall=𝑇𝑃+𝐹𝑁=150/(150+50)=0.75
PrecisionxRecall
F1 Score=𝑃𝑟𝑒𝑐𝑖𝑠𝑖𝑜𝑛+𝑅𝑒𝑐𝑎𝑙𝑙 𝑥2= 2x(0.75x0.75)/(0.75+0.75)=1.125/1.5=0.75

Class 10 AI: Evaluating Models Guide
No ratings yet
Class 10 AI: Evaluating Models Guide
9 pages
AI Project Cycle Overview for Class 10
No ratings yet
AI Project Cycle Overview for Class 10
7 pages
AI Learning Types and Applications Worksheet
No ratings yet
AI Learning Types and Applications Worksheet
3 pages
AI Project Cycle & Ethical Frameworks
100% (1)
AI Project Cycle & Ethical Frameworks
42 pages
Nationalism in India: Class 10 Notes
No ratings yet
Nationalism in India: Class 10 Notes
4 pages
Understanding India's Federalism: Class 10 Notes
No ratings yet
Understanding India's Federalism: Class 10 Notes
3 pages
Advanced AI Modeling Concepts for Class 10
67% (3)
Advanced AI Modeling Concepts for Class 10
4 pages
Resources and Development Overview
No ratings yet
Resources and Development Overview
4 pages
Nationalism in India: Class 10 Notes
No ratings yet
Nationalism in India: Class 10 Notes
5 pages
Python Programs for AI Practical Class 10
No ratings yet
Python Programs for AI Practical Class 10
3 pages
Probability Distribution in Data Science
No ratings yet
Probability Distribution in Data Science
22 pages
Analytical Paragraph Examples for Class 10
No ratings yet
Analytical Paragraph Examples for Class 10
4 pages
Geography All Chapter Notes Class 10
No ratings yet
Geography All Chapter Notes Class 10
23 pages
Causes of Water Scarcity in India
No ratings yet
Causes of Water Scarcity in India
12 pages
Class 10 IT Sample Paper Solutions
No ratings yet
Class 10 IT Sample Paper Solutions
6 pages
Money and Credit in Economic Development
100% (1)
Money and Credit in Economic Development
2 pages
AI Pre-Board Exam Paper - Class X
No ratings yet
AI Pre-Board Exam Paper - Class X
9 pages
Class 10 AI Introduction Notes
No ratings yet
Class 10 AI Introduction Notes
7 pages
Class 10 Advanced AI Modeling Concepts
No ratings yet
Class 10 Advanced AI Modeling Concepts
17 pages
Development Goals Beyond Income
No ratings yet
Development Goals Beyond Income
10 pages
Water Scarcity: Causes and Solutions
No ratings yet
Water Scarcity: Causes and Solutions
4 pages
Power Sharing: Concepts and Importance
No ratings yet
Power Sharing: Concepts and Importance
11 pages
Understanding Development Goals
0% (1)
Understanding Development Goals
10 pages
Understanding Consumer Rights
No ratings yet
Understanding Consumer Rights
20 pages
Class 10 Water Resources Overview
100% (1)
Class 10 Water Resources Overview
12 pages
Class 10 AI Model Evaluation Notes
No ratings yet
Class 10 AI Model Evaluation Notes
8 pages
Types of Farming in India: Class 10 Notes
No ratings yet
Types of Farming in India: Class 10 Notes
10 pages
Class 10 AI: Key Concepts and Notes
No ratings yet
Class 10 AI: Key Concepts and Notes
8 pages
Class 10 Economics: Indian Economy Sectors
100% (1)
Class 10 Economics: Indian Economy Sectors
5 pages
Class X Ratio Analysis Overview
No ratings yet
Class X Ratio Analysis Overview
15 pages
Nationalism in India: Class X History Notes
No ratings yet
Nationalism in India: Class X History Notes
9 pages
Class 10 Geography Practice Paper: Agriculture
No ratings yet
Class 10 Geography Practice Paper: Agriculture
4 pages
Sectors of the Indian Economy Explained
No ratings yet
Sectors of the Indian Economy Explained
24 pages
Class 10 Geography: Water Resources Notes
No ratings yet
Class 10 Geography: Water Resources Notes
25 pages
Class 9 Social Science Curriculum 2025-26
No ratings yet
Class 9 Social Science Curriculum 2025-26
55 pages
Evolution of Print in China and Beyond
No ratings yet
Evolution of Print in China and Beyond
7 pages
Outcomes and Merits of Democracy
100% (1)
Outcomes and Merits of Democracy
23 pages
Indian Economy Sectors Overview
100% (2)
Indian Economy Sectors Overview
8 pages
Class 10 Geography: Resources & Development
No ratings yet
Class 10 Geography: Resources & Development
13 pages
Maths Lab Activity (Class - 10)
No ratings yet
Maths Lab Activity (Class - 10)
7 pages
Class 10 AI Facilitator Handbook
No ratings yet
Class 10 AI Facilitator Handbook
60 pages
AI Model Evaluation Metrics Quiz
No ratings yet
AI Model Evaluation Metrics Quiz
3 pages
Class 10 Economics: Development Notes
No ratings yet
Class 10 Economics: Development Notes
2 pages
Intro to Economics: Key Concepts
No ratings yet
Intro to Economics: Key Concepts
2 pages
Flora and Fauna Conservation in India
No ratings yet
Flora and Fauna Conservation in India
4 pages
Sectors of Indian Economy: Class 10 Notes
0% (1)
Sectors of Indian Economy: Class 10 Notes
94 pages
Class 10 Political Parties Overview
No ratings yet
Class 10 Political Parties Overview
32 pages
Class 10 Consumer Rights Overview
No ratings yet
Class 10 Consumer Rights Overview
6 pages
Key Features of Federalism Explained
No ratings yet
Key Features of Federalism Explained
6 pages
Summary of "His First Flight"
100% (1)
Summary of "His First Flight"
38 pages
Accounting: Meaning, Objectives & Scope
No ratings yet
Accounting: Meaning, Objectives & Scope
36 pages
Class 10 Notes on Indian Economy Sectors
No ratings yet
Class 10 Notes on Indian Economy Sectors
49 pages
Resource Management and Conservation Issues
100% (1)
Resource Management and Conservation Issues
2 pages
Advanced AI Modeling Concepts for Class 10
No ratings yet
Advanced AI Modeling Concepts for Class 10
50 pages
AI Evaluating Models
No ratings yet
AI Evaluating Models
8 pages
Model Evaluation Metrics and Examples
No ratings yet
Model Evaluation Metrics and Examples
5 pages
AI Model Evaluation Techniques Explained
No ratings yet
AI Model Evaluation Techniques Explained
7 pages
AI Model Evaluation Techniques Explained
No ratings yet
AI Model Evaluation Techniques Explained
5 pages
Confusion Matrix Evaluation Metrics
No ratings yet
Confusion Matrix Evaluation Metrics
7 pages
Focal Length Measurement of Mirrors & Lenses
No ratings yet
Focal Length Measurement of Mirrors & Lenses
5 pages
Auditions for Arush Mittal's Theatre
No ratings yet
Auditions for Arush Mittal's Theatre
1 page
Nationalism in 19th Century Europe
No ratings yet
Nationalism in 19th Century Europe
2 pages
Print P00137648 Invoice
No ratings yet
Print P00137648 Invoice
1 page
Ws 13 Trigonometry
No ratings yet
Ws 13 Trigonometry
2 pages
POE and Resident Satisfaction in Kanhapur
No ratings yet
POE and Resident Satisfaction in Kanhapur
15 pages
Z Test
No ratings yet
Z Test
26 pages
Senior Research Project Guidelines
100% (1)
Senior Research Project Guidelines
72 pages
Service Innovation's Impact on Firm Performance
No ratings yet
Service Innovation's Impact on Firm Performance
26 pages
URICA Change Assessment Scale
No ratings yet
URICA Change Assessment Scale
2 pages
Introduction to Research Fundamentals
No ratings yet
Introduction to Research Fundamentals
7 pages
Media Planning Functions and Challenges
No ratings yet
Media Planning Functions and Challenges
104 pages
Research Problem and Methodology Overview
No ratings yet
Research Problem and Methodology Overview
28 pages
Atterberg Limits of Soil Analysis
No ratings yet
Atterberg Limits of Soil Analysis
6 pages
Civic Knowledge and Engagement Study
No ratings yet
Civic Knowledge and Engagement Study
12 pages
Short Stories Boost Vocabulary Retention
No ratings yet
Short Stories Boost Vocabulary Retention
1 page
Frequency Analysis of Health Variables
No ratings yet
Frequency Analysis of Health Variables
12 pages
Evaluating Cognizant Flowsource Feasibility
No ratings yet
Evaluating Cognizant Flowsource Feasibility
9 pages
Project Scope and Recommendations Document
100% (2)
Project Scope and Recommendations Document
12 pages
Quality Assurance and Control Standards
No ratings yet
Quality Assurance and Control Standards
10 pages
Evaluating Scholarship Impact on Student Success
No ratings yet
Evaluating Scholarship Impact on Student Success
8 pages
Library Service Quality and User Satisfaction
No ratings yet
Library Service Quality and User Satisfaction
9 pages
IFRA Standard for 4-Phenyl-3-buten-2-ol
No ratings yet
IFRA Standard for 4-Phenyl-3-buten-2-ol
3 pages
Youth Alcoholism and Advertising Impact
No ratings yet
Youth Alcoholism and Advertising Impact
3 pages
Literature Review on Taxi Service Quality
No ratings yet
Literature Review on Taxi Service Quality
42 pages
Lotte India Products Overview
No ratings yet
Lotte India Products Overview
91 pages
Effective Remedial Worksheets for Math
No ratings yet
Effective Remedial Worksheets for Math
17 pages
Data Science: Understanding Data Types and Distribution
No ratings yet
Data Science: Understanding Data Types and Distribution
46 pages
Qualitative Research Methodology Overview
No ratings yet
Qualitative Research Methodology Overview
4 pages
Sales Forecasting Methods Overview
No ratings yet
Sales Forecasting Methods Overview
34 pages
Bma3203 Survival Analysis Reg Main
No ratings yet
Bma3203 Survival Analysis Reg Main
3 pages
Questionnaire Design and Ethics Guide
No ratings yet
Questionnaire Design and Ethics Guide
36 pages
Lips Hitz 2001
No ratings yet
Lips Hitz 2001
22 pages
Pictorial Techniques in Projective Research
No ratings yet
Pictorial Techniques in Projective Research
4 pages

Evaluating AI Models: Key Metrics and Risks

Uploaded by

Evaluating AI Models: Key Metrics and Risks

Uploaded by

Class X

Q2. Do you think evaluating an AI model is that essential in an AI project cycle?

Q3. Explain train-test split with an example.

Case study-based questions:

e) Safe Content Filtering (like Kids YouTube)

Yes 150 (TP) 50 (FN)

You might also like