X
([Link] ([Link]
[Link]@[Link]
NPTEL ([Link] » Python for Data Science (course)
Click to register for
Certification exam
Week 4 : Assignment 4
([Link]
If already The due date for submitting this assignment has passed.
registered, click to Due on 2026-02-18, 23:59 IST.
check your
payment status Assignment submitted on 2026-02-09, 11:57 IST
1) Which of the following are regression problems? Assume that appropriate data is given. 1 point
Predicting the house price.
Course outline
Predicting whether it will rain or not on a given day.
About NPTEL () Predicting the maximum temperature on a given day.
Predicting the sales of the ice-creams.
How does an Yes, the answer is correct.
NPTEL online Score: 1
course work? () Accepted Answers:
Predicting the house price.
Week 0 () Predicting the maximum temperature on a given day.
Predicting the sales of the ice-creams.
Week 1 ()
2) Which of the following are multiclass classification problems? 1 point
Week 2 ()
Classifying emails as spam or not spam.
Week 3 () Classifying a person’s blood type as A, B, AB, or O.
Predicting the price of a second-hand car.
Week 4 ()
Classifying a movie genre into Drama, Comedy, Action, or Thriller.
Introduction to Yes, the answer is correct.
Classification Score: 1
Case Study (unit? Accepted Answers:
unit=56&lesson=5 Classifying a person’s blood type as A, B, AB, or O.
7) Classifying a movie genre into Drama, Comedy, Action, or Thriller.
Case Study on
Classification Part
I (unit? 3) If a linear regression model achieves zero training error, can we say that all the data points lie on 1 point
unit=56&lesson=5 a straight line in the feature space?
8)
Yes
Case Study on
Classification Part No
II (unit?
Yes, the answer is correct.
unit=56&lesson=5 Score: 1
9) Accepted Answers:
Yes
Introduction to
Regression Case
Study (unit?
Read the information given below and answer the questions from 4 to 6:
unit=56&lesson=6
0)
Data Description:
Case Study on
Regression Part I An automotive service chain is launching its new grand service station this weekend. They offer to service a
(unit? wide variety of cars. The current capacity of the station is to check 315 cars thoroughly per day. As an
unit=56&lesson=6
inaugural offer, they claim to freely check all cars that arrive on their launch day, and report whether they need
1)
servicing or not! Unexpectedly, they get 450 cars. The servicemen will not work longer than the working
Case Study on hours, but the data analysts have to! Can you save the day for the new service station? How can a data
Regression Part II scientist save the day for them? He has been given a data set, ‘[Link]
(unit? ([Link] that contains
unit=56&lesson=6
some attributes of the car that can be easily measured and a conclusion that if a service is needed or not.
2)
Now for the cars they cannot check in detail, they measure those attributes and store them in ‘[Link]
Case Study on ([Link]
Regression Part
III (unit? Problem Statement:
unit=56&lesson=6
3)
Use machine learning techniques to identify whether the cars require service or not.
Data sets (unit?
unit=56&lesson=6 Read the given datasets ‘[Link]
4) ([Link] and
‘[Link] ([Link]
Case Study
usp=sharing)’ as train data and test data respectively and import all the required packages for
codes (unit?
unit=56&lesson=6 analysis.
5)
4) Which of the following machine learning techniques would NOT be appropriate to solve the 1 point
Week 4 Feedback problem given in the problem statement?
Form : Python for
Data Science!! kNN
(unit? Random Forest
unit=56&lesson=1
Logistic Regression
16)
Linear regression
Quiz: Week 4 :
Assignment 4 Yes, the answer is correct.
Score: 1
(assessment?
name=187)
Accepted Answers:
Linear regression
Practice: Week 4 :
Practice
Assignment 4 Prepare the data by following the steps given below, and answer questions 5 and 6.
(assessment? • Encode categorical variable, Service - Yes as 1 and No as 0 for both the train and test datasets.
name=188) • Split the set of independent features and the dependent feature on both the train and test datasets.
• Set random_state for the instance of the logistic regression class as 0.
5) After applying logistic regression, what is/are the correct observations from the resultant 1 point
Supporting
confusion matrix?
material for
Week 4 () True Positive = 29, True Negative = 94
True Positive = 94, True Negative = 29
Download
Videos () False Positive = 5, True Negative = 94
None of the above
Books () Yes, the answer is correct.
Score: 1
Text Accepted Answers:
Transcripts () True Positive = 29, True Negative = 94
False Positive = 5, True Negative = 94
Problem
Solving 6) The logistic regression model built between the input and output variables is checked for its 1 point
Session - Jan prediction accuracy of the test data. What is the accuracy range (in %) of the predictions made over test
2026 () data?
60 - 79
90 - 95
30 – 59
80 – 89
Yes, the answer is correct.
Score: 1
Accepted Answers:
90 - 95
7) How are categorical variables preprocessed before model building? 1 point
Standardization
Dummy variables
Correlation
None of the above
Yes, the answer is correct.
Score: 1
Accepted Answers:
Dummy variables
8) A regression model with the function y = 80 + 4.5x was built to understand the impact of 1 point
temperature x on ice cream sales y. The temperature this month is 10 degrees more than the previous
month. What is the predicted difference in ice cream sales?
56 units
45 units
80 units
None of the above
Yes, the answer is correct.
Score: 1
Accepted Answers:
45 units
9) X and Y are two variables that have a strong linear relationship. Which of the following 1 point
statements are incorrect?
There cannot be a negative relationship between the two variables.
The relationship between the two variables is purely causal.
One variable may or may not cause a change in the other variable.
The variables can be positively or negatively correlated with each other.
Yes, the answer is correct.
Score: 1
Accepted Answers:
There cannot be a negative relationship between the two variables.
The relationship between the two variables is purely causal.
The Global Happiness Index report contains the Happiness Score data with multiple features (namely
the Economy, Family, Health, and Freedom) that could affect the target variable value.
Prepare the data by following the steps given below, and answer question 10.
• Split the set of independent features and the dependent feature on the given dataset
• Create training and testing data from the set of independent features and dependent feature by splitting
the original data in the ratio 3:1 respectively, and set the value for random_state of the training/test split
method’s instance as 1
10) A multiple linear regression model is built on the Global Happiness Index dataset ‘GHI 1 point
[Link] ([Link]
What is the RMSE of the baseline model?
2.00
0.50
1.06
0.75
Yes, the answer is correct.
Score: 1
Accepted Answers:
1.06