0% found this document useful (0 votes)

147 views5 pages

Key Machine Learning Questions & Answers

This document contains a question bank for machine learning concepts including: 1. It provides definitions and questions to test understanding of fundamental machine learning concepts like supervised vs unsupervised learning, classification vs regression, and linear vs non-linear models. 2. It also includes questions about specific algorithms like decision trees, naive Bayes, KNN, linear regression, logistic regression, and SVMs. Example questions cover how to implement the algorithms, calculate key metrics, and evaluate their effectiveness. 3. The last part focuses on questions for decision trees, including how to calculate information gain and entropy, detect overfitting, and determine the stopping criteria.

Uploaded by

manisha mudgal

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

147 views5 pages

Key Machine Learning Questions & Answers

Uploaded by

manisha mudgal

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

Question Bank for Section A
Questions on Linear and Logistic Regression and SVM
Questions on KNN Algorithm
Questions on Decision Trees
Questions on Naïve Bayes Classifier

QUESTION BANK FOR SECTION A

Basic Ml questions:

a) Define Machine learning? Briefly explain the types of learning.

c) What are the issues in decision tree induction?

[Link]
[Link] and hypothesis space
[Link] space\feature matrix
[Link] gain
[Link] index
[Link] does this change with positive and negative data values of a feature
[Link] curve
[Link] and hard margin in SVM
[Link]
[Link] vector
[Link] pruning
[Link] pruning
[Link]
[Link] probability.

[Link] between
[Link] and unsupervised machine learning
[Link] and regression
[Link] v/s all and one v/s one multiclass classification
[Link] and logistic regression
[Link] learning and traditional programming
[Link] and non linear separable data.
Q. What are the elements of reinforcement learning?

Q. How to classify mixed data?

[Link] short notes on

a) Logistic regression
b) Back propogation algorithm
c) issues in machine Learning
[Link] one is the best supervised machine learning algorithm out of decision tree, naive
bayes,K-NN and SVM for classifying large PDF documents?

Questions on Distance based method/nearest neighbour/knn

[Link] is “K” in KNN algorithm?

2. How do we decide the value of "K" in KNN algorithm?

3. Why is the odd value of “K” preferable in KNN algorithm?

4. What is the difference between Euclidean Distance and Manhattan distance? What is the
formula of Euclidean distance and Manhattan distance?

5. Why is KNN algorithm called Lazy Learner?

6. Why should we not use KNN algorithm for large datasets?

7. What are the advantages and disadvantages of KNN algorithm?

8. A dealer has a warehouse that stores a variety of fruits and vegetables. When fruit is
brought to the warehouse, various types of fruit may be mixed together. The dealer wants a
model that will sort the fruit according to type. Justify with reasons how machine learning
model is efficient compared to feature based classification technique.
[Link] the K nearest neighbor recognition what would be the best distance metric to
implement for a handwritten digit recognizer?
[Link] to combine and code SVM and KNN for image classification?
[Link] can we increase the acuuracy of KNN ?

Questions on linear and logistic regression and SVM.

[Link] linear regression with example.

1. What is a logistic function? What is the range of values of a logistic function?
2. Why is logistic regression very popular?
3. What is the formula for the logistic regression function?
4. How can the probability of a logistic regression model be expressed as conditional
probability?
5. What are the outputs of the logistic model and the logistic function?
[Link] can’t linear regression be used in place of logistic regression for binary classification?
7. Is the decision boundary linear or nonlinear in the case of a logistic regression model?
8. What is the likelihood function?
9. What is the Maximum Likelihood Estimator (MLE)?
10. Why can’t we use Mean Square Error (MSE) as a cost function for logistic regression?
11. Why is accuracy not a good measure for classification problems?
12. Which algorithm is better at handling outliers logistic regression or SVM?
13. How will you deal with the multiclass classification problem using logistic regression?
[Link] to choose the best fit [Link] with example.
[Link] the working of SVM with diagram.
[Link] values of independent variable x and dependent value y are given below:
Find the least square regression line y=ax+b. Estimate the value of y when x is 10.

[Link] is the goal of the support vector machine (SVM)? How to compute the margin
[Link] decision tree to classify students based on their academic [Link] with
example.
[Link] that we want to build a neural network that classifies two dimensional data (i.e., X
= [x1, x2]) into two classes: diamonds and crosses. We have a set of training data that is
plotted as follows:

Draw a network that can solve this classification problem. Justify your choice of the number
of nodes and the architecture. Draw the decision boundary that your network can find on the
diagram.
[Link] is kernel trick in [Link] with the help of [Link] explain the types of
kernel.
[Link] SVM is efficient than logistic regression for classification?
[Link] will you apply SVM to detect credit card fraud?

Questions on Decision trees

[Link] the following dataset for predicting a outcome of a tennis match

Write the formula for information gain for an [Link] information gain for
all [Link] is selected as the root node?

[Link] are entropy and information gain related vis-a-vis decision trees?
[Link] do you calculate the entropy of children nodes after the split based on on a feature?
[Link] overfitting problem in decision trees.
[Link] is the stopping criteria in decision tress?
[Link] is a decision tree ? How a decision tree is constructed explain with example.

Questions on Naïve Bayes classifier

[Link] naïve bayes classifier in context with Bayes theorem

[Link] Bayes theorem.
[Link] is Naive Bayes naive?

Common questions

Accuracy may not be suitable for imbalanced datasets because it doesn't consider the class distribution. A model might achieve high accuracy by simply predicting the majority class, which can be misleading when the minority class is the true interest of analysis. In such cases, metrics like precision, recall, and the F1-score provide better insights into the model's performance on each class, emphasizing the success on the minority class .

Decision trees can easily overfit the training data by capturing noise and small fluctuations in the dataset. To mitigate this, techniques such as pruning (pre-pruning and post-pruning) are used. Pre-pruning stops the tree's growth early before it becomes too complex, while post-pruning removes branches that have little importance after the tree is fully grown. These techniques help to simplify the model and enhance its generalization to new data .

The kernel trick in SVM is a method that allows the algorithm to operate in a higher-dimensional space without explicitly calculating the coordinates in that space, by using a kernel function. This capability is particularly useful for dealing with nonlinear data. Common kernel types include the linear kernel, polynomial kernel, and radial basis function (RBF) kernel. Each kernel transforms the input space differently, enabling the SVM to create non-linear decision boundaries .

Entropy is a measure of the disorder or impurity in a dataset, with higher values indicating greater disorder. In decision tree construction, information gain is used to evaluate the effectiveness of an attribute in classifying the data. It is calculated by measuring the reduction in entropy after the dataset is split based on the attribute. An optimal split is one that achieves the highest information gain, indicating a more homogeneous division of data post-split, leading to clearer distinctions between nodes .

Mean Square Error (MSE) is not suitable for logistic regression because the predictions are probabilistic and bounded between 0 and 1, which can result in non-convex minimization problems. Instead, the logistic regression model uses the log-likelihood function as its cost function, whereby the Maximum Likelihood Estimation (MLE) technique is employed to find the optimal parameters. This results in a convex optimization problem that can be efficiently solved .

The hyperplane in a Support Vector Machine (SVM) serves as the decision boundary that separates different classes in the feature space. The goal of an SVM is to find the optimal hyperplane that maximizes the margin between the data points of different classes, ensuring that the classification is as robust as possible. This maximization of the margin reduces the model's susceptibility to overfitting and increases its ability to generalize to unseen data .

Support Vector Machines (SVM) generally handle outliers better than logistic regression, primarily because SVM focuses on maximizing the margin around the decision boundary and not all data points, particularly those outside the margin, influence its positioning. Logistic regression, however, assumes a linear relationship with all data points and is sensitive to outliers that can skew the decision boundary away from its optimal position .

Multiclass classification using logistic regression can be approached with 'one vs. all' where a separate binary classifier is trained for each class to determine whether data belongs to that class or not. Alternatively, 'one vs. one' involves training classifiers for every pair of classes. While 'one vs. all' is computationally simpler, 'one vs. one' often provides more accurate results as the classifiers focus on separating only two classes at a time, leading to simpler decision boundaries .

Supervised learning involves training a model on labeled data, where the outcomes are known, allowing the model to learn the mapping from inputs to outputs. Unsupervised learning, in contrast, deals with unlabeled data, where the model tries to identify patterns and structures, such as clustering or association. Distinguishing between these types is crucial because it influences the choice of algorithms and the approach to problem-solving. Supervised learning is suitable for tasks where outcomes are known and precise prediction is required, while unsupervised learning is used for exploratory data analysis and discovering hidden patterns .

Linear regression is used for predicting continuous outcome variables and assumes a linear relationship between features and target. Logistic regression, on the other hand, is used for predicting binary outcomes and leverages the logistic function to estimate probabilities, resulting in outputs bounded between 0 and 1. While linear regression provides direct numerical predictions, logistic regression provides class probabilities, making it essential for classification tasks .

QUESTION BANK FOR SECTION A
Basic Ml questions:
a) Define Machine learning? Briefly explain the types of learning.
c) What ar

1.What is “K” in KNN algorithm?
2. How do we decide the value of "K" in KNN algorithm?
3. Why is the odd value of “K” prefera

11. Why is accuracy not a good measure for classification problems?
12. Which algorithm is better at handling outliers logist

Q.What is kernel trick in SVM.Explain with the help of example.Also explain the types of
kernel.
Q.Why SVM is efficient than

ANN Quiz - PDF - Artificial Neural Network - Computational Science
No ratings yet
ANN Quiz - PDF - Artificial Neural Network - Computational Science
17 pages
Machine Learning Question Bank Module
No ratings yet
Machine Learning Question Bank Module
7 pages
Neural Network Concepts and Misconceptions
No ratings yet
Neural Network Concepts and Misconceptions
13 pages
Machine Learning Question Bank 2023
No ratings yet
Machine Learning Question Bank 2023
11 pages
Deep Learning Quiz for JNTUK Students
No ratings yet
Deep Learning Quiz for JNTUK Students
3 pages
ML Question Paper
No ratings yet
ML Question Paper
2 pages
Perceptron and Adaline Calculations
No ratings yet
Perceptron and Adaline Calculations
7 pages
Neural Networks and Learning Techniques
50% (2)
Neural Networks and Learning Techniques
21 pages
ECE528 Machine Learning Midterm Exam
No ratings yet
ECE528 Machine Learning Midterm Exam
4 pages
Machine Learning MCQs and Concepts
100% (1)
Machine Learning MCQs and Concepts
10 pages
Deep Learning Exam Questions & Format
No ratings yet
Deep Learning Exam Questions & Format
2 pages
RNN Exam Questions and Answers
No ratings yet
RNN Exam Questions and Answers
15 pages
Machine Learning Question Bank PDF
No ratings yet
Machine Learning Question Bank PDF
7 pages
Machine Learning Techniques Question Bank
No ratings yet
Machine Learning Techniques Question Bank
13 pages
Basis Function Regression Explained
0% (1)
Basis Function Regression Explained
3 pages
Overview of Recurrent Neural Networks
No ratings yet
Overview of Recurrent Neural Networks
2 pages
CISC 867 Deep Learning Assignment 1
No ratings yet
CISC 867 Deep Learning Assignment 1
3 pages
Genetic Algorithms and Back Propagation MCQs
No ratings yet
Genetic Algorithms and Back Propagation MCQs
3 pages
Applied Machine Learning Question Bank
No ratings yet
Applied Machine Learning Question Bank
2 pages
Neural Networks Question Bank
No ratings yet
Neural Networks Question Bank
42 pages
Understanding Output Variables in ML
100% (1)
Understanding Output Variables in ML
9 pages
NLP MCQs on RNNs and Sentiment Analysis
No ratings yet
NLP MCQs on RNNs and Sentiment Analysis
13 pages
M.Tech Machine Learning Exam Paper 2023
No ratings yet
M.Tech Machine Learning Exam Paper 2023
2 pages
NPTEL Machine Learning MCQ Assignment
No ratings yet
NPTEL Machine Learning MCQ Assignment
26 pages
k-NN Algorithm: Key Concepts and Questions
No ratings yet
k-NN Algorithm: Key Concepts and Questions
28 pages
GATE Question Bank: ML Decision Trees & Bias
No ratings yet
GATE Question Bank: ML Decision Trees & Bias
37 pages
AI Neural Networks MCQ Practice Guide
No ratings yet
AI Neural Networks MCQ Practice Guide
13 pages
CS 189 Midterm Exam Guidelines
No ratings yet
CS 189 Midterm Exam Guidelines
106 pages
Binning and Normalization Techniques
No ratings yet
Binning and Normalization Techniques
4 pages
Deep Learning Viva Questions Guide
No ratings yet
Deep Learning Viva Questions Guide
7 pages
RTMNU Machine Learning Exam Paper 2024
100% (1)
RTMNU Machine Learning Exam Paper 2024
4 pages
Machine Learning Exam Questions
100% (1)
Machine Learning Exam Questions
2 pages
AL3451 Machine Learning Question Bank
100% (1)
AL3451 Machine Learning Question Bank
12 pages
Machine Learning Mid-Sem Exam Questions
No ratings yet
Machine Learning Mid-Sem Exam Questions
11 pages
Deep Learning Course Notes - IIT Ropar
No ratings yet
Deep Learning Course Notes - IIT Ropar
1 page
Decision Trees in Machine Learning
No ratings yet
Decision Trees in Machine Learning
3 pages
Python ML and Clustering Quiz Answers
No ratings yet
Python ML and Clustering Quiz Answers
2 pages
MCQ On Genetic Algorithms 5eea6a0e39140f30f369e524
No ratings yet
MCQ On Genetic Algorithms 5eea6a0e39140f30f369e524
21 pages
Decision Tree Concepts and CHAID Model
No ratings yet
Decision Tree Concepts and CHAID Model
4 pages
Deep Learning MCQs for Knowledge Assessment
No ratings yet
Deep Learning MCQs for Knowledge Assessment
4 pages
AMT 305 Machine Learning Exam Paper
No ratings yet
AMT 305 Machine Learning Exam Paper
3 pages
Data Mining Final Exam Solutions
No ratings yet
Data Mining Final Exam Solutions
5 pages
BPUT COA 4th Sem Question Paper 2017-18
No ratings yet
BPUT COA 4th Sem Question Paper 2017-18
2 pages
Parallel and Distributed IR Architectures
No ratings yet
Parallel and Distributed IR Architectures
33 pages
Machine Learning Assignment MCQs
No ratings yet
Machine Learning Assignment MCQs
34 pages
BAI701: Deep Learning Overview
No ratings yet
BAI701: Deep Learning Overview
25 pages
Deep Learning Question Bank 2024-25
No ratings yet
Deep Learning Question Bank 2024-25
2 pages
AI & ML Techniques: Question Bank
No ratings yet
AI & ML Techniques: Question Bank
2 pages
Deep Learning MCQs and Answers
No ratings yet
Deep Learning MCQs and Answers
4 pages
Machine Learning Question Paper - Anna University
No ratings yet
Machine Learning Question Paper - Anna University
4 pages
Machine Learning Question Bank PDF
No ratings yet
Machine Learning Question Bank PDF
4 pages
Bagging vs Boosting Trees Explained
100% (1)
Bagging vs Boosting Trees Explained
12 pages
Anna University ML Question Paper 2024
No ratings yet
Anna University ML Question Paper 2024
5 pages
Reinforcement Learning Assignment Solutions
No ratings yet
Reinforcement Learning Assignment Solutions
50 pages
Deep Learning Quiz 1: Concepts & Questions
No ratings yet
Deep Learning Quiz 1: Concepts & Questions
5 pages
CS771A End-Semester Exam 2016
100% (1)
CS771A End-Semester Exam 2016
8 pages
Practice Assignment Overview
No ratings yet
Practice Assignment Overview
26 pages
ML Student Revision Sheet (No Answer)
No ratings yet
ML Student Revision Sheet (No Answer)
10 pages
ML Viva Questions
No ratings yet
ML Viva Questions
8 pages
Overview of Machine Learning Concepts
No ratings yet
Overview of Machine Learning Concepts
7 pages
SQL WHILE Loop Syntax and Examples
No ratings yet
SQL WHILE Loop Syntax and Examples
11 pages
SQL Procedures: Creation and Examples
No ratings yet
SQL Procedures: Creation and Examples
8 pages
SQL Looping and Conditional Statements
No ratings yet
SQL Looping and Conditional Statements
15 pages
Overview of DOS Commands
No ratings yet
Overview of DOS Commands
12 pages
Data Analytics Life Cycle Insights
No ratings yet
Data Analytics Life Cycle Insights
41 pages
Child Marriage Effects on Health in Bangladesh
No ratings yet
Child Marriage Effects on Health in Bangladesh
43 pages
COVID-19 Vaccine Adoption in Canada
No ratings yet
COVID-19 Vaccine Adoption in Canada
9 pages
Influencing Factors of Understanding COVID-19 Risks and Coping Behaviors Among The Elderly Population
No ratings yet
Influencing Factors of Understanding COVID-19 Risks and Coping Behaviors Among The Elderly Population
16 pages
LIBROJ. S. Cramer - Logit Models From Economics and Other Fields-Cambridge University Press (2003)
100% (1)
LIBROJ. S. Cramer - Logit Models From Economics and Other Fields-Cambridge University Press (2003)
185 pages
Machine Learning for Cancer Prediction
No ratings yet
Machine Learning for Cancer Prediction
70 pages
Alcohol Use Among Adolescents in Liberia
No ratings yet
Alcohol Use Among Adolescents in Liberia
10 pages
BECS-184 Data Analysis Overview
No ratings yet
BECS-184 Data Analysis Overview
32 pages
Global Flourishing Study Methodology
No ratings yet
Global Flourishing Study Methodology
28 pages
Flood Fragility Models for Bridges
No ratings yet
Flood Fragility Models for Bridges
9 pages
ANOVA and Regression Analysis Concepts
No ratings yet
ANOVA and Regression Analysis Concepts
8 pages
Non-GAAP Earnings and Restatement Risks
No ratings yet
Non-GAAP Earnings and Restatement Risks
29 pages
Racial Composition and Workplace Attachment
No ratings yet
Racial Composition and Workplace Attachment
14 pages
Diabetes Prediction via Data Mining
No ratings yet
Diabetes Prediction via Data Mining
52 pages
Zarei 2020
No ratings yet
Zarei 2020
22 pages
Statistical Methods in Social Sciences
No ratings yet
Statistical Methods in Social Sciences
11 pages
Business Analytics Quiz for MBA Students
No ratings yet
Business Analytics Quiz for MBA Students
7 pages
Higher Education Expansion in China: Inequality Insights
No ratings yet
Higher Education Expansion in China: Inequality Insights
22 pages
Essentials of Business Analytics An Introduction To The Methodology and Its Applications ISBN 9783319688367, 3319688367 Full Download
100% (9)
Essentials of Business Analytics An Introduction To The Methodology and Its Applications ISBN 9783319688367, 3319688367 Full Download
16 pages
Fraud Diamond & Beneish M-Score Analysis
No ratings yet
Fraud Diamond & Beneish M-Score Analysis
17 pages
Logistic Regression & Scorecard PDF
No ratings yet
Logistic Regression & Scorecard PDF
100 pages
Developmental Delay in Ethiopian Children
No ratings yet
Developmental Delay in Ethiopian Children
15 pages
Classification vs. Regression in ML
No ratings yet
Classification vs. Regression in ML
51 pages
Bank-Fintech Partnerships for Inclusion
No ratings yet
Bank-Fintech Partnerships for Inclusion
32 pages
Complicated Grief Prevalence in Japan
No ratings yet
Complicated Grief Prevalence in Japan
7 pages
MoE Economics Exit Exam Answers 2023
100% (1)
MoE Economics Exit Exam Answers 2023
118 pages
Thesis Abstracts from Wolaita Sodo University
No ratings yet
Thesis Abstracts from Wolaita Sodo University
393 pages
Understanding Customer Churn in Banking
100% (1)
Understanding Customer Churn in Banking
20 pages
Katz Diagram
No ratings yet
Katz Diagram
16 pages
1471 Adasaasdasdas2458 2 9
No ratings yet
1471 Adasaasdasdas2458 2 9
6 pages

Key Machine Learning Questions & Answers

Uploaded by

Key Machine Learning Questions & Answers

Uploaded by

QUESTION BANK FOR SECTION A

a) Define Machine learning? Briefly explain the types of learning.

Q. How to classify mixed data?

[Link] short notes on

Questions on Distance based method/nearest neighbour/knn

2. How do we decide the value of "K" in KNN algorithm?

3. Why is the odd value of “K” preferable in KNN algorithm?

5. Why is KNN algorithm called Lazy Learner?

6. Why should we not use KNN algorithm for large datasets?

7. What are the advantages and disadvantages of KNN algorithm?

Questions on linear and logistic regression and SVM.

[Link] linear regression with example.

Questions on Decision trees

[Link] the following dataset for predicting a outcome of a tennis match

Questions on Naïve Bayes classifier

[Link] naïve bayes classifier in context with Bayes theorem

Common questions

Why might accuracy not be a suitable evaluation measure for classification problems, particularly in imbalanced datasets?

How does a decision tree handle issues related to overfitting, and what techniques are implemented to mitigate these effects?

Describe the concept of the kernel trick in SVM and provide examples of different kernel types.

Discuss the concepts of entropy and information gain in decision tree construction and how they determine an optimal splitting of nodes.

In the context of logistic regression, why is the cost function not based on Mean Square Error (MSE), and what alternative is used instead?

Explain the role of the hyperplane in a Support Vector Machine (SVM) and how it influences the classification of data.

Evaluate the efficiency of logistic regression and support vector machines (SVM) when it comes to handling outliers in data.

How do you address a multiclass classification problem using logistic regression, and what are the differences between 'one vs. all' and 'one vs. one' approaches?

What are the key differences between supervised and unsupervised machine learning, and why is it important to distinguish them when selecting a model for a specific task?

Compare and contrast linear and logistic regression, particularly in terms of their applicability to different types of prediction tasks.

You might also like