CART Decision Tree: Overview & Techniques

The document discusses the CART decision tree algorithm, which can handle both classification and regression tasks using the Gini index for decision points. It highlights the limitations of the ID3 algorithm, the importance of hyperparameter tuning to prevent overfitting, and methods such as pruning to improve decision tree performance. Additionally, it introduces the concept of Random Forests, which applies decision trees to subsets of data for better accuracy.

Uploaded by

rkr201759

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

5 views32 pages

CART Decision Tree: Overview & Techniques

Uploaded by

rkr201759

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

Artificial Intelligence and Machine

Learning

Decision Tree – CART

Dr. Rajashree Nayak
DECISION TREE – CART
 CART is an alternative decision tree building
algorithm which can handle both
classification and regression tasks.

 This algorithm uses a new metric named gini

index to create decision points for
classification tasks.

 Select the feature that has lower Gini index

for splitting.
DECISION TREE – CART
DECISION TREE – CART
DECISION TREE – CART
DECISION TREE – CART
DECISION TREE – CART
DECISION TREE – CART
OBSERVATION
 ID3 is the most common conventional
decision tree algorithm but it has
bottlenecks. Attributes must be nominal
values, dataset must not include
missing data, and finally the algorithm tend
to fall into overfitting.
WHEN TO STOP SPLITTING?

 Usually, real-world datasets have a large

number of features, which will result in a
large number of splits, which in turn gives
a huge tree.

 Such trees take time to build and can lead

to overfitting. That means the tree will give
very good accuracy on the training dataset
but will give bad accuracy in test data.
HYPERPARAMETER TUNING
 There are many ways to tackle this problem through
hyperparameter tuning.

 We can set the maximum depth of our decision tree

using the max_depth parameter. The more the value
of max_depth, the more complex your tree will
be.

 Another way is to set the minimum number of

samples for each spilt. It is denoted
by min_samples_split. Here we specify the minimum
number of samples required to do a spilt.

 That means if a node has less than 10 samples

then using this parameter, we can stop the further
splitting of this node and make it a leaf node.
DECISION TREE – PROS AND CONS
OVERFITTING
 A hypothesis h is said to overfit the training data if there is
another hypothesis h’, such that h has a smaller error than
h’ on the training data but h has larger error on the test data
than h’.

accuracy
On training

On testing

Complexity of tree
OVERFITTING
Outlook
 Outlook = Sunny,
 Temp = Hot
 Humidity = Normal Sunny Overcast Rain
 Wind = Strong 1,2,8,9,113,7,12,13 4,5,6,10,14
 label: NO 2+,3- 4+,0- 3+,2-
 this example doesn’t exist in the Humidity Yes Wind
tree

High Normal Strong Weak

No Yes No Yes
OVERFITTING
This can always be
Outlook
done – may fit noise
or other
coincidental
• Outlook = Sunny, regularities Rain
Sunny Overcast
• Temp = Hot
1,2,8,9,113,7,12,13 4,5,6,10,14
• Humidity = Normal
2+,3- 4+,0- 3+,2-
• Wind = Strong
Humidity Yes Wind
• label: NO
• this example doesn’t exist in the
tree High Normal Strong Weak
No Wind No Yes

Strong Weak
No Yes
REASONS FOR OVERFITTING
 Too much variance in the training data
 Training data is not a representative sample
of the instance space
 We split on features that are actually irrelevant

 Too much noise in the training data

 Noise = some feature values or class labels are
incorrect
 We learn to predict the noise

 In both cases, it is a result of our will to minimize the

empirical error when we learn, and the ability to do it
(with DTs)
PREVENTING OVERFITTING
PRUNING

 Pruning is another method that can help us avoid

overfitting. It helps in improving the performance
of the Decision tree by cutting the nodes or
sub-nodes which are not significant.
Additionally, it removes the branches which
have very low importance.
 There are mainly 2 ways for pruning:
 Pre-pruning – we can stop growing the tree
earlier, which means we can prune/remove/cut a
node if it has low importance while growing the
tree.
 Post-pruning – once our tree is built to its
depth, we can start pruning the nodes based on
their significance.
DECISION TREE TO RANDOM FOREST

 Instead of applying decision tree algorithm on all dataset,

dataset would be separated into subsets and decision tree
algorithm will be applied to these subsets

 Decision would be made by the highest number of subset

results
GRAPHICAL ABSTRACT
HOW DOES IT WORK ?
HOW DOES IT WORK ?
HOW DOES IT WORK ?
HOW DOES IT WORK ?
HOW DOES IT WORK ?
ADVANTAGES

Decision Tree Learning Overview
No ratings yet
Decision Tree Learning Overview
68 pages
ML-I Module 3 Part I Shraddha More
No ratings yet
ML-I Module 3 Part I Shraddha More
87 pages
Decision Tree Learning Overview
No ratings yet
Decision Tree Learning Overview
45 pages
Understanding Decision Trees and Splits
No ratings yet
Understanding Decision Trees and Splits
16 pages
CART Decision Tree Overview
No ratings yet
CART Decision Tree Overview
26 pages
Understanding Decision Trees in Machine Learning
No ratings yet
Understanding Decision Trees in Machine Learning
23 pages
Decision Tree Algorithm Explained
No ratings yet
Decision Tree Algorithm Explained
14 pages
Understanding Decision Trees in ML
No ratings yet
Understanding Decision Trees in ML
41 pages
Understanding Decision Tree Classification
No ratings yet
Understanding Decision Tree Classification
30 pages
Overview of Decision Tree Algorithms
No ratings yet
Overview of Decision Tree Algorithms
47 pages
Decision Trees: Types, Overfitting & Pruning
No ratings yet
Decision Trees: Types, Overfitting & Pruning
20 pages
Understanding Decision Tree Classification
No ratings yet
Understanding Decision Tree Classification
16 pages
Understanding Decision Tree Algorithms
No ratings yet
Understanding Decision Tree Algorithms
21 pages
Decision Trees for AI Decision Support
No ratings yet
Decision Trees for AI Decision Support
34 pages
Decision Tree Classification Explained
No ratings yet
Decision Tree Classification Explained
13 pages
Understanding Decision Trees in ML
No ratings yet
Understanding Decision Trees in ML
35 pages
Understanding Decision Trees and Pruning
No ratings yet
Understanding Decision Trees and Pruning
7 pages
Decision Tree and Random Forest Documentation
No ratings yet
Decision Tree and Random Forest Documentation
15 pages
Decision Tree & Random Forest Guide
No ratings yet
Decision Tree & Random Forest Guide
22 pages
Understanding Decision Trees in ML
No ratings yet
Understanding Decision Trees in ML
11 pages
Understanding Decision Trees and Pruning
No ratings yet
Understanding Decision Trees and Pruning
7 pages
Decision Tree Learning and Prediction Techniques
No ratings yet
Decision Tree Learning and Prediction Techniques
19 pages
Decision Tree Classification Overview
No ratings yet
Decision Tree Classification Overview
11 pages
Decision Tree Algorithm Overview
No ratings yet
Decision Tree Algorithm Overview
17 pages
Decision Trees: Overview and Examples
No ratings yet
Decision Trees: Overview and Examples
22 pages
Supervised Learning: Nonlinear Models
No ratings yet
Supervised Learning: Nonlinear Models
40 pages
Pa Unit-3 Part1
No ratings yet
Pa Unit-3 Part1
49 pages
Decision Tree Classification Overview
No ratings yet
Decision Tree Classification Overview
46 pages
Understanding Decision Trees in ML
No ratings yet
Understanding Decision Trees in ML
14 pages
Decision Trees: Algorithms and Pruning Techniques
No ratings yet
Decision Trees: Algorithms and Pruning Techniques
25 pages
Decision Trees vs. Random Forests Explained
No ratings yet
Decision Trees vs. Random Forests Explained
21 pages
Understanding Decision Trees in Data Mining
No ratings yet
Understanding Decision Trees in Data Mining
21 pages
Decision Tree Implementation Guide
No ratings yet
Decision Tree Implementation Guide
3 pages
Decision Trees in Business Analytics
No ratings yet
Decision Trees in Business Analytics
24 pages
Random Forests: Estimators Overview
No ratings yet
Random Forests: Estimators Overview
78 pages
Decision Tree Splitting Strategies
100% (1)
Decision Tree Splitting Strategies
83 pages
Decision Tree Analysis in AI & ML
No ratings yet
Decision Tree Analysis in AI & ML
29 pages
Understanding Decision Tree Learning
No ratings yet
Understanding Decision Tree Learning
16 pages
Supervised Learning: Decision Trees & Random Forest
No ratings yet
Supervised Learning: Decision Trees & Random Forest
73 pages
Understanding Non-Metric Classification
No ratings yet
Understanding Non-Metric Classification
35 pages
Understanding Decision Trees and Random Forests
No ratings yet
Understanding Decision Trees and Random Forests
19 pages
Decision Tree Algorithms in Machine Learning
No ratings yet
Decision Tree Algorithms in Machine Learning
54 pages
Nonlinear Models in Supervised Learning
No ratings yet
Nonlinear Models in Supervised Learning
30 pages
Decision Trees in Classification Systems
No ratings yet
Decision Trees in Classification Systems
25 pages
Decision Tree Classification Explained
No ratings yet
Decision Tree Classification Explained
18 pages
Decision Trees and Random Forests Explained
No ratings yet
Decision Trees and Random Forests Explained
24 pages
Decision Tree Construction Techniques
71% (7)
Decision Tree Construction Techniques
41 pages
Supervised Learning: Nonlinear Models Overview
No ratings yet
Supervised Learning: Nonlinear Models Overview
30 pages
Understanding Decision Trees in ML
No ratings yet
Understanding Decision Trees in ML
26 pages
Decision Trees and Neural Networks Overview
No ratings yet
Decision Trees and Neural Networks Overview
9 pages
Understanding Random Forest Algorithm
No ratings yet
Understanding Random Forest Algorithm
25 pages
Understanding Decision Trees in ML
No ratings yet
Understanding Decision Trees in ML
5 pages
Decision Trees in Machine Learning Explained
No ratings yet
Decision Trees in Machine Learning Explained
13 pages
Decision Tree Learning Explained
No ratings yet
Decision Tree Learning Explained
59 pages
Decision Trees in Machine Learning
No ratings yet
Decision Trees in Machine Learning
26 pages
Decision Trees and Ensemble Methods
No ratings yet
Decision Trees and Ensemble Methods
69 pages
Understanding Decision Trees in ML
No ratings yet
Understanding Decision Trees in ML
31 pages
Quiz 1 - Key
No ratings yet
Quiz 1 - Key
4 pages
Machine Learning in Reservoir Analysis
No ratings yet
Machine Learning in Reservoir Analysis
18 pages
Car Popularity Prediction Using ML
No ratings yet
Car Popularity Prediction Using ML
16 pages
Understanding Linear Regression in ML
No ratings yet
Understanding Linear Regression in ML
41 pages
AI/ML Project Weekly Summary
No ratings yet
AI/ML Project Weekly Summary
10 pages
Developing Cryptocurrency Trading Strategy Based On Autoencoder-CNN-GANs Algorithms
No ratings yet
Developing Cryptocurrency Trading Strategy Based On Autoencoder-CNN-GANs Algorithms
5 pages
Machine Learning for Employee Training
No ratings yet
Machine Learning for Employee Training
6 pages
AI & ML Certification Program Overview
No ratings yet
AI & ML Certification Program Overview
34 pages
Hybrid CNN+ViT for Breast Cancer Detection
No ratings yet
Hybrid CNN+ViT for Breast Cancer Detection
9 pages
Disparity Estimation in Stereo Images
No ratings yet
Disparity Estimation in Stereo Images
6 pages
Video Generation from Semantic Labels
No ratings yet
Video Generation from Semantic Labels
10 pages
Bayesian Learning and Neural Networks Guide
No ratings yet
Bayesian Learning and Neural Networks Guide
27 pages
Expert Lecture on Medical Image Analysis
No ratings yet
Expert Lecture on Medical Image Analysis
4 pages
Elective II ML Chapter 2
No ratings yet
Elective II ML Chapter 2
24 pages
Fetal Ultrasound Skill Classification via Eye-Tracking
No ratings yet
Fetal Ultrasound Skill Classification via Eye-Tracking
15 pages
Zhu NICE-SLAM Neural Implicit Scalable Encoding For SLAM CVPR 2022 Paper (1) - 1-8
No ratings yet
Zhu NICE-SLAM Neural Implicit Scalable Encoding For SLAM CVPR 2022 Paper (1) - 1-8
8 pages
Understanding Federated Learning Basics
No ratings yet
Understanding Federated Learning Basics
60 pages
Human Pose Estimation in Crowded Scenes
No ratings yet
Human Pose Estimation in Crowded Scenes
8 pages
Survey of LLMs in Software Engineering
No ratings yet
Survey of LLMs in Software Engineering
57 pages
Classification and Prediction Techniques
No ratings yet
Classification and Prediction Techniques
17 pages
Deep Learning Question Bank
No ratings yet
Deep Learning Question Bank
4 pages
Computer Vision Overview and Applications
No ratings yet
Computer Vision Overview and Applications
5 pages
Neural Network Ensembles for Appliance Identification
No ratings yet
Neural Network Ensembles for Appliance Identification
5 pages
AI Engineer Course Roadmap Overview
No ratings yet
AI Engineer Course Roadmap Overview
1 page
CS231n Visual Recognition Syllabus 2024
No ratings yet
CS231n Visual Recognition Syllabus 2024
123 pages
Disease Prediction Using Naive Bayes Classifier
No ratings yet
Disease Prediction Using Naive Bayes Classifier
16 pages
UPI Fraud Detection via Isolation Forest
No ratings yet
UPI Fraud Detection via Isolation Forest
5 pages
ONH Segmentation for ISNT Analysis
No ratings yet
ONH Segmentation for ISNT Analysis
47 pages
Deep Learning for Fault Detection in Cold Forging
No ratings yet
Deep Learning for Fault Detection in Cold Forging
11 pages
Deep Learning in Computational Mechanics Review
No ratings yet
Deep Learning in Computational Mechanics Review
275 pages