0% found this document useful (0 votes)
79 views2 pages

Key Machine Learning Concepts and Questions

The document outlines various topics in machine learning across five units, covering challenges, types of systems, and the importance of statistics. It discusses methods such as KNN, decision trees, ensemble learning, clustering techniques, and artificial neural networks. Additionally, it includes practical implementations and examples using tools like TensorFlow and Keras.

Uploaded by

cheetohunter516
Copyright
© All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as TXT, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
79 views2 pages

Key Machine Learning Concepts and Questions

The document outlines various topics in machine learning across five units, covering challenges, types of systems, and the importance of statistics. It discusses methods such as KNN, decision trees, ensemble learning, clustering techniques, and artificial neural networks. Additionally, it includes practical implementations and examples using tools like TensorFlow and Keras.

Uploaded by

cheetohunter516
Copyright
© All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as TXT, PDF, TXT or read online on Scribd

UNIT-I

Can you name four of the main challenges in Machine Learning?


What are different types of machine learning systems.
Write short note on AI, ML, & DL.
Write the importance of statistics in Supervise learning and unsupervised learning.
Write a note on Training loss Vs Testing loss
Write about different risk statistics that you need to encounter while working with
Machine Learning.
What are different Tradeoffs in Statistical Learning? Explain.
How to estimate risk statistics? how to Minimize Empirical Risk ? Explain.
Write down the procedure for estimating sampling distribution of an estimator .

UNIT-II

Write a shot note of various Distance based Methods of classification / regression.


With an example, explain KNN.
What is decision tree? Explain the procedure to construct decision tree.
What are the appropriate problems for decision tree learning.
How to identify best splitting attribute in decision tee construction?
Explain Naive Bayes classification with example.
Explain linear and logistic regression with examples.
With an example explain binary classification in machine learning.

UNIT-III

What do you mean by Ensemble learning? What are its main challenges for developing?
What is the difference between hard and soft voting classifiers?
Differences between bagging and Boosting
What is the benefit of out-of-bag evaluation?
Explain about AdaBoost ensemble and Gradient Boosting ensemble.
With an example, explain the working of random forest.
Differences between decision tree and random forest.
What is stacking? Explain working of stacking as Ensemble learning?
What are SVM? Explain linear and non-linear SVM
Write a note on SVM Regression.
Write about Naive Bayes classifiers Vs SVM in Text classification.

UNIT-IV

With an example, explain K-means clustering. Also write limitations of K-means.


How clustering is used in image segmentation, preprocessing, and sem-supervised
learning?
Write about DBSCAN and Gaussian Mixtures.
What do you mean by cure of dimensionality? What solutions do you propose for this?
Write down the Main Approaches for Dimensionality Reduction
What is PCA? How it works as Dimensionality Reduction technique? Explain with
example.
How to implement PCA using Sci-kit learn?
Write a note on Randomized PCA and Kernel PCA

UNIT-V

Write down the biological motivation behind ANN


What is Perception? Explain Perception training with example.
What is gradient descent? Derive delta rules with algorithm.
Write a short note on Multi-layer networks and back propagation.
What are the various ways to implement MLPs with Keras? Explain.
Write about different ways to Installing TensorFlow 2.
With an example program, explain the procedure of Loading and Preprocessing Data
with TensorFlow.

Common questions

Powered by AI

The curse of dimensionality can be mitigated by employing dimensionality reduction techniques such as PCA, where the data is transformed into a lower-dimensional space while preserving most of the variance . Techniques like Random Projection and Kernel PCA can also be effective by focusing more on preserving the distances and structure respectively. Regularization techniques that penalize the complexity of the model could help manage overfitting, as well as feature selection processes that identify and retain only relevant features. Lastly, domain-specific knowledge often aids in creating tailored solutions to address high dimensionality's negative impacts .

Decision trees have several benefits for classification, including their ease of interpretation and the ability to handle both numerical and categorical data without needing extensive data preprocessing . They can model complex decision boundaries by partitioning the feature space. However, they are prone to overfitting, especially when the tree is deep, and can create overly complex models that do not generalize well beyond the training data. Pruning and setting constraints on the maximum depth are common techniques to mitigate overfitting . Additionally, they exhibit high variance, meaning that small data changes can significantly alter the model .

KNN is effective for regression in scenarios where the decision boundaries are non-linear and the dataset is not overwhelmingly large. It is simple to implement, versatile due to its non-parametric nature, and can model arbitrary functions. However, its effectiveness diminishes with high dimensions due to the 'curse of dimensionality' (leading to sparse data points) and is computationally expensive as it requires calculating the distance to many points during prediction . The choice of 'k' is crucial; too small a 'k' can lead to high variance while a large 'k' can result in high bias, affecting the model's ability to generalize .

Empirical risk minimization is fundamental in supervised learning as it forms the basis for optimizing model performance. The process involves minimizing the average loss over the training data, thus ensuring that the model's predictions closely align with the actual outcomes . The empirical risk acts as a proxy for the expected risk, as we attempt to approximate the true risk which is typically unknown. By focusing on minimizing empirical risk, adjustments in the model parameters are made iteratively (e.g., using gradient descent), ultimately leading to a model that performs well on the training dataset and has the potential to generalize effectively if correctly tuned .

The trade-offs in statistical learning, primarily the bias-variance trade-off, influence model selection significantly. Models with high bias (e.g., linear models) often underfit the training data, failing to capture the underlying trend adequately. Conversely, high-variance models (e.g., complex neural networks) may overfit the data, capturing noise along with the trend. Selecting the appropriate model involves finding a balance where both bias and variance are minimized to achieve optimal generalization performance on the test data . This balance impacts decisions regarding the complexity of the chosen model, the amount and method of data preprocessing, and hyperparameter tuning .

Bagging (Bootstrap Aggregating) and Boosting are ensemble techniques that improve model predictions by combining multiple models. Bagging reduces model variance by training each model on a random subset of the data and averaging their predictions, leading to better performance in unstable models like decision trees . Boosting, on the other hand, focuses on reducing bias by sequentially training models, each attempting to correct the errors of its predecessor. This sequential dependence typically leads to models that are harder to train but potentially more accurate . Weighting is also a distinguishing factor; boosting assigns higher weights to misclassified instances to focus subsequent iterations on difficult cases .

SVM is highly effective in text classification scenarios due to its ability to find the hyperplane that maximizes the margin between different classes, which leads to a robust decision boundary for high-dimensional data like text. In contrast, Naive Bayes, relying on the conditional independence assumption, is simple and fast, making it attractive for text tasks that have relatively few features compared to instances . SVM tends to achieve higher accuracy, especially in datasets where classes are linearly separable, but can be computationally intensive. Naive Bayes, with its probabilistic approach, often performs well even with little training data and can be more interpretable .

K-means clustering, while useful in image segmentation for its simplicity and efficiency, has several limitations. Firstly, K-means assumes spherical clusters which can be inappropriate for the complex shape of most real-world data clusters . It also requires the number of clusters (k) to be predefined, which is often not known in advance and influences the quality of segmentation significantly. Its reliance on initial centroid positions can lead to suboptimal clustering by converging to local minima, making results sensitive to the initial state. K-means is also susceptible to noise and outliers, which can drastically skew results, complicating the process of achieving precise segmentation .

Stacking improves predictive performance by leveraging the strengths of multiple models to create a meta-model that typically predicts better than any individual model alone. It involves training base classifiers with different algorithms and then using their outputs as features for training a meta-learner model . This approach allows stacking to capture a variety of patterns and interactions learned by the base models, thus enhancing overall prediction capability. The meta-learner uses cross-validation outputs from base learners to prevent overfitting, ensuring that ensemble prediction remains robust and generalizable across different datasets .

Estimating risk statistics is crucial as it directly affects the confidence we have in our machine learning models' predictions. If these statistics are poorly estimated, the model may either underfit or overfit the data, leading to unreliable predictions. For instance, improper risk estimation can lead to an enlarged generalization error where the model does not perform well on unseen data . Calculating empirical risk accurately involves considering various factors such as the variance and bias trade-offs, which directly affect model performance. Hence, understanding and minimizing empirical risk through correct estimation is vital for enhancing predictive accuracy .

You might also like