0% found this document useful (0 votes)

10 views7 pages

InterView Questions

The document provides a comprehensive overview of machine learning and deep learning concepts, including model evaluation metrics, handling overfitting, and the differences between various algorithms. It covers topics such as activation functions, regression techniques, and the importance of feature transformation. Additionally, it discusses the use of different models and methods for classification, clustering, and dimensionality reduction.

Uploaded by

marwanabbas418

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

10 views7 pages

InterView Questions

Uploaded by

marwanabbas418

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Model Answers

Machine Learning
Q1
By examining the learned weights, we can gain insight into how the model represents the
data. In linear models, each weight reflects the importance of its corresponding feature:
ŷ = wT x + b

Q2
The normal equation provides a direct, closed-form solution for the optimal weights in linear
regression:
w = (X T X)−1 X T y

Q3
A poorly chosen model generally results in large prediction errors and fails to predict the
target accurately.

Q4
R2 measures how much variance in the data is explained by the model:
(y − ŷ)2
P
2
R =1− P
(y − ȳ)2
Mean Squared Error (MSE) measures the average squared prediction error:
1X
MSE = (y − ŷ)2
n

Q5
Non-linear relationships can be handled by feature transformation. For example:
z = x2
The model becomes linear in z:
y = wz + b

1
Q6
Overfitting occurs when the model fits training data very well but fails on unseen data.
Underfitting happens when the model performs poorly on both training and test sets.

Q7
Overfitting can be reduced using regularization:
X
L2 (Ridge): λ w2

or by increasing training data.

Q8
Lasso regression (L1 ) drives some weights exactly to zero:
X
λ |w|

Ridge regression (L2 ) shrinks weights without eliminating them:

X
λ w2

Q9
Mutual Information measures dependency between feature X and target Y :
X p(x, y)
I(X; Y ) = p(x, y) log
p(x)p(y)

Q10
The sigmoid activation function maps values to [0, 1]:
1
σ(z) =
1 + e−z

Q11
The Receiver Operating Characteristic (ROC) curve is a classification evaluation tool that
illustrates the trade-off between the True Positive Rate (TPR) and the False Positive Rate
(FPR) at different decision thresholds. It helps assess how well a model distinguishes between
classes regardless of class imbalance.
The True Positive Rate (also known as Recall) is defined as:

TP
TPR =
TP + FN

2
The False Positive Rate is defined as:
FP
FPR =
FP + TN
The Area Under the ROC Curve (AUC) provides a single scalar value that summarizes
the model’s performance. An AUC of 1 indicates perfect classification, while an AUC of 0.5
corresponds to random guessing.

Q12
Dimensionality reduction can be achieved using PCA:

Xreduced = XW

Q13
Multiclass classification can be solved using one-vs-rest binary logistic classifiers.

Q14
The elbow method or bias–variance analysis can be used to select k in KNN.

Q15
Hard margin enforces a perfect decision boundary, which can lead to overfitting and higher
sensitivity to noise, while soft margin allows a small margin for misclassification.

Q16
The bias–variance trade-off describes the effect of model complexity: reducing complexity
lowers overfitting but increases bias, leading to underfitting, while more complex models
reduce bias but increase variance.

Q17
Random Forest improves performance by aggregating multiple decision trees:
T
1X
ŷ = ŷt
T t=1

Q18
For imbalanced data, suitable metrics include Precision, Recall, F1-score, and AUC.

3
Q19
Boosting trains models sequentially, where each new model focuses more on the samples
with higher errors from previous models, improving performance step by step. It is generally
faster and more targeted than stacking and bagging. Bagging trains multiple models (usually
decision trees) on different bootstrap samples of the data (with replacement) and combines
their predictions by averaging or voting. Stacking combines different types of models by
training a meta-model on their outputs to make the final prediction

Q20
In bagging, each model is trained on a bootstrap sample drawn with replacement.

Q21
Silhouette score measures how close a point is to other points in its own cluster compared
to points in other clusters. A higher value indicates better clustering. Davies–Bouldin
index measures how similar clusters are to each other. Lower values indicate better cluster
separation.

Q22
DBSCAN works by grouping points that are close to each other based on a distance threshold
and a minimum number of points; it forms dense regions as clusters and labels sparse points
as noise.

Q23
A high learning rate usually causes poor validation performance because the model over-
shoots the optimal weights, leading to instability and failure to generalize.

Q24
With Stochastic Gradient Descent, the training accuracy per epoch will fluctuate and look
noisy instead of smoothly increasing because updates are based on individual samples or
small batches.

Q25
Dimensionality reduction can be achieved using PCA.

4
Q26
MSE is very sensitive to outliers because it squares the errors, so a few large errors can
dominate the loss and distort training:
1X
MSE = (y − ŷ)2
n
MAE is less sensitive to outliers but has a constant gradient, which makes optimization
slower and less stable, especially near the minimum:
1X
MAE = |y − ŷ|
n
We can solve these problems by using loss functions like Huber Loss, which behaves like
MSE for small errors and like MAE for large errors, or by handling outliers through data
preprocessing.

Q27
An AUC of 0.5 indicates random guessing.

Q28
In d dimensions, the decision boundary has dimension d − 1.

Q29
First, perform EDA to understand the data, identify useful features, check linearity or non-
linearity, determine preprocessing needs, and detect class imbalance. Then, evaluate the
models selected from this step using appropriate metrics or ensemble methods like voting to
choose the best one.

Deep Learning
Q1
The sigmoid function causes the vanishing gradient problem because its gradients become
very small for large positive or negative inputs, which makes learning very slow in deep
networks. It is also not zero-centered, so gradient updates can be inefficient, and it saturates
easily, causing neurons to stop learning.

Q2
The main problem of ReLU is the “dying ReLU” issue, where neurons output zero for all
inputs if they receive large negative values, causing their gradients to become zero and
stopping learning. ReLU can also be sensitive to large learning rates, which may push many

5
neurons into this inactive state. This can be solved by using variants like Leaky ReLU or
Parametric ReLU, which allow a small negative slope, or by using proper weight initialization
and smaller learning rates.

Q3
For regression tasks, the best activation function for the output layer is usually a linear
(identity) activation, because it allows the model to predict any real-valued number without
restriction.

Q4
The main difference between CNN and MLP is in how they handle data. An MLP connects
every neuron to all neurons in the next layer, so it ignores spatial structure and has a large
number of parameters. A CNN uses convolutional layers with shared weights and local
connections, which allows it to capture spatial patterns like edges and textures, makes it
more efficient, and works especially well for images and grid-like data.

Q5
We use convolution when the data has spatial or local structure, such as images, audio
signals, or time-series. Convolutions are useful because they focus on local patterns (like
edges in images), use shared weights which greatly reduce the number of parameters, and
preserve spatial relationships. This makes models more efficient, faster to train, and better
at generalizing compared to fully connected layers.

Q6
Convolution on images works by sliding a small matrix called a filter (or kernel) over the
image. At each position, the filter is placed on top of a small region of the image, the values
are multiplied element by element, and then summed to produce one number. This number
becomes a pixel in a new image called a feature map. By moving the filter across the whole
image, the network can detect local patterns such as edges, corners, or textures. Different
filters learn to detect different patterns.

Q7
right but high number of hidden layers can cause overfitting risk

Q8
RNNs work by processing data step by step while keeping a memory of previous inputs. At
each time step, the RNN takes the current input and the hidden state from the previous
step, combines them, and produces a new hidden state. This hidden state carries information
from the past, which allows RNNs to model sequences like text, speech, or time-series data.

6
Q9
There is no absolute “better” one; it depends on the use case. PyTorch is generally preferred
for research and learning because it is more intuitive, uses dynamic computation graphs, and
is easier to debug. TensorFlow is often preferred in production and deployment because it
has strong tools for scaling, mobile and web deployment, and long-term production support.
In short, PyTorch is better for flexibility and experimentation, while TensorFlow is better
for large-scale and production-ready systems.

Q10
We use activation functions to introduce non-linearity into neural networks. Without ac-
tivation functions, the network would behave like a simple linear model no matter how
many layers it has. Activation functions allow the network to learn complex patterns, make
decisions, and approximate complicated relationships in data.

Bias vs. Variance in Machine Learning
100% (1)
Bias vs. Variance in Machine Learning
5 pages
Supervised vs Unsupervised Learning Guide
No ratings yet
Supervised vs Unsupervised Learning Guide
18 pages
Key Concepts in Machine Learning
No ratings yet
Key Concepts in Machine Learning
24 pages
Ai Overview
No ratings yet
Ai Overview
7 pages
Essential ML Interview Questions 2024
No ratings yet
Essential ML Interview Questions 2024
13 pages
Top 25 Machine Learning Interview Questions 1
No ratings yet
Top 25 Machine Learning Interview Questions 1
10 pages
DL Study Guide Complete
No ratings yet
DL Study Guide Complete
29 pages
ML Assignment2 Answers
No ratings yet
ML Assignment2 Answers
5 pages
Key Machine Learning Algorithms Explained
No ratings yet
Key Machine Learning Algorithms Explained
67 pages
Supervised vs. Unsupervised Learning Explained
No ratings yet
Supervised vs. Unsupervised Learning Explained
8 pages
Machine Learning - 1
No ratings yet
Machine Learning - 1
25 pages
AIML
No ratings yet
AIML
3 pages
Machine Learning Overview: Types & Applications
No ratings yet
Machine Learning Overview: Types & Applications
13 pages
Key ML Questions for End Semester Exam
No ratings yet
Key ML Questions for End Semester Exam
18 pages
Machine Learning Q&A for Data Scientists
No ratings yet
Machine Learning Q&A for Data Scientists
21 pages
Machine Learning Lab Viva Questions
No ratings yet
Machine Learning Lab Viva Questions
8 pages
Machine Learning Concepts and Techniques
No ratings yet
Machine Learning Concepts and Techniques
19 pages
Machine Learning Q&A: Concepts & Techniques
No ratings yet
Machine Learning Q&A: Concepts & Techniques
57 pages
Top 25 Machine Learning Interview Q&A
No ratings yet
Top 25 Machine Learning Interview Q&A
11 pages
Machine Learning Viva Questions Bank
No ratings yet
Machine Learning Viva Questions Bank
10 pages
Vanishing Gradients in Neural Networks
No ratings yet
Vanishing Gradients in Neural Networks
82 pages
Supervised Learning
No ratings yet
Supervised Learning
16 pages
Machine Learning Concepts and Algorithms
No ratings yet
Machine Learning Concepts and Algorithms
14 pages
Introduction to Machine Learning Concepts
No ratings yet
Introduction to Machine Learning Concepts
36 pages
Machine Learning Techniques Explained
No ratings yet
Machine Learning Techniques Explained
4 pages
Correlation, ML Algorithms, and Bias-Variance
No ratings yet
Correlation, ML Algorithms, and Bias-Variance
9 pages
JRF ML DL RL Basics Qna
No ratings yet
JRF ML DL RL Basics Qna
21 pages
MLT ST1 Solution
No ratings yet
MLT ST1 Solution
14 pages
Machine Learning Concepts and Techniques
No ratings yet
Machine Learning Concepts and Techniques
10 pages
Machine Learning Concepts Overview
No ratings yet
Machine Learning Concepts Overview
5 pages
AML Detailed Notes
No ratings yet
AML Detailed Notes
4 pages
100 Machine Learning Interview Q&A
No ratings yet
100 Machine Learning Interview Q&A
24 pages
Machine Learning Viva Prep Guide
No ratings yet
Machine Learning Viva Prep Guide
7 pages
Deep Learning Exam Questions Guide
No ratings yet
Deep Learning Exam Questions Guide
32 pages
Gen Ai Interview Qa Guide
No ratings yet
Gen Ai Interview Qa Guide
9 pages
Essential ML Interview Questions
No ratings yet
Essential ML Interview Questions
12 pages
AI Learning Models Comparison Guide
No ratings yet
AI Learning Models Comparison Guide
15 pages
ML Viva EasyLanguage
No ratings yet
ML Viva EasyLanguage
13 pages
Deep Learning Assignment 1 Overview
No ratings yet
Deep Learning Assignment 1 Overview
5 pages
V2V AAM Super 25
No ratings yet
V2V AAM Super 25
51 pages
MLViva
No ratings yet
MLViva
7 pages
Discriminant Functions and Learning Algorithms
No ratings yet
Discriminant Functions and Learning Algorithms
12 pages
Deep Learning in Image Classification
No ratings yet
Deep Learning in Image Classification
68 pages
Supervised Learning in AI & ML
No ratings yet
Supervised Learning in AI & ML
35 pages
Gen Ai Interview QnAs
No ratings yet
Gen Ai Interview QnAs
13 pages
Top Data Scientist Interview Questions
No ratings yet
Top Data Scientist Interview Questions
33 pages
Gradient Descent and Machine Learning Concepts
No ratings yet
Gradient Descent and Machine Learning Concepts
29 pages
Machine Learning Concepts Explained
100% (1)
Machine Learning Concepts Explained
13 pages
Supervised vs Unsupervised Learning Explained
No ratings yet
Supervised vs Unsupervised Learning Explained
4 pages
Machine Learning Viva Questions and Answers
No ratings yet
Machine Learning Viva Questions and Answers
6 pages
Overview of Neural Networks and Algorithms
No ratings yet
Overview of Neural Networks and Algorithms
16 pages
ML 1
No ratings yet
ML 1
32 pages
Data Science Interview Insights
100% (1)
Data Science Interview Insights
68 pages
Machine Learning Concepts and Applications
No ratings yet
Machine Learning Concepts and Applications
10 pages
Supervised and Unsupervised Learning Overview
No ratings yet
Supervised and Unsupervised Learning Overview
8 pages
Machine Learning Interview Question Bank
No ratings yet
Machine Learning Interview Question Bank
105 pages
Dopamine Hypothesis in Schizophrenia
No ratings yet
Dopamine Hypothesis in Schizophrenia
2 pages
Clonal Degradation in Cannabis Plants
No ratings yet
Clonal Degradation in Cannabis Plants
9 pages
Od Notes
No ratings yet
Od Notes
2 pages
Dry Mix Mortar: Innovations & Impact
100% (1)
Dry Mix Mortar: Innovations & Impact
16 pages
Calibration Procedure for Dial Indicators
No ratings yet
Calibration Procedure for Dial Indicators
2 pages
Automotive Engineering Career Aspirations
No ratings yet
Automotive Engineering Career Aspirations
1 page
Occlusal Considerations for RPDs
No ratings yet
Occlusal Considerations for RPDs
17 pages
IT Skills in Business Management
No ratings yet
IT Skills in Business Management
7 pages
Business Studies Grade 10 Term 2 Week 3 - 2020
No ratings yet
Business Studies Grade 10 Term 2 Week 3 - 2020
4 pages
Proper Bedding for PVC Pipe Systems
No ratings yet
Proper Bedding for PVC Pipe Systems
5 pages
Injection Pump Parts List
No ratings yet
Injection Pump Parts List
5 pages
Criminology 4: Ethics and Conduct Standards
50% (2)
Criminology 4: Ethics and Conduct Standards
5 pages
Key Insights on Death of a Salesman
No ratings yet
Key Insights on Death of a Salesman
19 pages
Inorganic Trends & Exceptions For NEET
No ratings yet
Inorganic Trends & Exceptions For NEET
14 pages
Positive Discipline in Teaching
No ratings yet
Positive Discipline in Teaching
33 pages
LS40M51B11 Limit Switch Specifications
No ratings yet
LS40M51B11 Limit Switch Specifications
3 pages
Insights on Bangladesh's Mutual Fund Sector
No ratings yet
Insights on Bangladesh's Mutual Fund Sector
23 pages
Character Alienation in The Zoo Story
No ratings yet
Character Alienation in The Zoo Story
10 pages
Elon Musk's Twitter Acquisition Saga
No ratings yet
Elon Musk's Twitter Acquisition Saga
4 pages
Overview of Astrophysics Coordinate Systems
No ratings yet
Overview of Astrophysics Coordinate Systems
2 pages
Gujarat UG Admission Application 2024
No ratings yet
Gujarat UG Admission Application 2024
2 pages
Torm Tankers Interview Questions
No ratings yet
Torm Tankers Interview Questions
11 pages
Cpacc Msds and Non DG
No ratings yet
Cpacc Msds and Non DG
6 pages
Operations Improvement Strategies Explained
No ratings yet
Operations Improvement Strategies Explained
41 pages
Grade 9 Animal Production Lesson Plan
No ratings yet
Grade 9 Animal Production Lesson Plan
3 pages
Synchronous Learning Action Research Example
100% (1)
Synchronous Learning Action Research Example
4 pages
Vajiram & Ravi IAS Course Fees 2025-2027
No ratings yet
Vajiram & Ravi IAS Course Fees 2025-2027
6 pages
CA Taxation Mock Test 2024-25
No ratings yet
CA Taxation Mock Test 2024-25
51 pages
Effective Reading for Dyslexia
No ratings yet
Effective Reading for Dyslexia
2 pages
Mechanical Systems: Types & Modeling
No ratings yet
Mechanical Systems: Types & Modeling
28 pages

InterView Questions

Uploaded by

InterView Questions

Uploaded by

Model Answers

or by increasing training data.

Ridge regression (L2 ) shrinks weights without eliminating them:

You might also like