Class 3

A Decision Tree is a decision-making tool used in machine learning for classification and prediction, structured with a root node, branches, internal nodes, and leaf nodes. It operates by asking yes/no questions to split data based on features, with common splitting criteria including Gini Impurity and Entropy, while techniques like pruning help prevent overfitting. Decision Trees are versatile and interpretable, finding applications in fields such as banking, healthcare, education, and finance, while Random Forest Regression enhances predictions by averaging results from multiple decision trees.

Uploaded by

devikalyan2012

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

10 views6 pages

Class 3

Uploaded by

devikalyan2012

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

Decision Tree

A Decision Tree helps us to make decisions by mapping out different choices and their
possible outcomes. It’s used in machine learning for tasks like classification and prediction.
In this article, we’ll see more about Decision Trees, their types and other core concepts.
A Decision Tree helps us make decisions by showing different options and how they are
related. It has a tree-like structure that starts with one main question called the root node
which represents the entire dataset. From there, the tree branches out into different
possibilities based on features in the data.
 Root Node: Starting point representing the whole dataset.
 Branches: Lines connecting nodes showing the flow from one decision to another.
 Internal Nodes: Points where decisions are made based on data features.
 Leaf Nodes: End points of the tree where the final decision or prediction is made.
Decision Tree
A Decision Tree also helps with decision-making by showing possible outcomes clearly. By
looking at the "branches" we can quickly compare options and figure out the best choice.
There are mainly two types of Decision Trees based on the target variable:
1. Classification Trees: Used for predicting categorical outcomes like spam or not
spam. These trees split the data based on features to classify data into predefined
categories.
2. Regression Trees: Used for predicting continuous outcomes like predicting house
prices. Instead of assigning categories, it provides numerical predictions based on the
input features.
How Decision Trees Work?
1. Start with the Root Node: It begins with a main question at the root node which is
derived from the dataset’s features.
2. Ask Yes/No Questions: From the root, the tree asks a series of yes/no questions to split the
data into subsets based on specific attributes.
3. Branching Based on Answers: Each question leads to different branches:
 If the answer is yes, the tree follows one path.
 If the answer is no, the tree follows another path.
4. Continue Splitting: This branching continues through further decisions helps in reducing
the data down step-by-step.
5. Reach the Leaf Node: The process ends when there are no more useful questions to ask
leading to the leaf node where the final decision or prediction is made.
Let’s look at a simple example to understand how it works. Imagine we need to decide
whether to drink coffee based on the time of day and how tired we feel. The tree first checks
the time:
1. In the morning: It asks “Tired?”
 If yes, the tree suggests drinking coffee.
 If no, it says no coffee is needed.
2. In the afternoon: It asks again “Tired?”
 If yes, it suggests drinking coffee.
 If no, no coffee is needed.

Example
Splitting Criteria in Decision Trees
In a Decision Tree, the process of splitting data at each node is important. The splitting
criteria finds the best feature to split the data on. Common splitting criteria include Gini
Impurity and Entropy.
 Gini Impurity: This criterion measures how "impure" a node is. The lower the Gini
Impurity the better the feature splits the data into distinct categories.
 Entropy: This measures the amount of uncertainty or disorder in the data. The tree
tries to reduce the entropy by splitting the data on features that provide the most
information about the target variable.
These criteria help decide which features are useful for making the best split at each decision
point in the tree.
Pruning in Decision Trees
 Pruning is an important technique used to prevent overfitting in Decision Trees.
Overfitting occurs when a tree becomes too deep and starts to memorize the training
data rather than learning general patterns. This leads to poor performance on new,
unseen data.
 This technique reduces the complexity of the tree by removing branches that have
little predictive power. It improves model performance by helping the tree generalize
better to new data. It also makes the model simpler and faster to deploy.
 It is useful when a Decision Tree is too deep and starts to capture noise in the data.
Advantages of Decision Trees
 Easy to Understand: Decision Trees are visual which makes it easy to follow the
decision-making process.
 Versatility: Can be used for both classification and regression problems.
 No Need for Feature Scaling: Unlike many machine learning models, it don’t require
us to scale or normalize our data.
 Handles Non-linear Relationships: It capture complex, non-linear relationships
between features and outcomes effectively.
 Interpretability: The tree structure is easy to interpret helps in allowing users to
understand the reasoning behind each decision.
 Handles Missing Data: It can handle missing values by using strategies like
assigning the most common value or ignoring missing data during splits.
Disadvantages of Decision Trees
 Overfitting: They can overfit the training data if they are too deep which means they
memorize the data instead of learning general patterns. This leads to poor
performance on unseen data.
 Instability: It can be unstable which means that small changes in the data may lead to
significant differences in the tree structure and predictions.
 Bias towards Features with Many Categories: It can become biased toward
features with many distinct values which focuses too much on them and potentially
missing other important features which can reduce prediction accuracy.
 Difficulty in Capturing Complex Interactions: Decision Trees may struggle to
capture complex interactions between features which helps in making them less
effective for certain types of data.
 Computationally Expensive for Large Datasets: For large datasets, building and
pruning a Decision Tree can be computationally intensive, especially as the tree depth
increases.
Applications of Decision Trees
Decision Trees are used across various fields due to their simplicity, interpretability and
versatility lets see some key applications:
1. Loan Approval in Banking: Banks use Decision Trees to assess whether a loan
application should be approved. The decision is based on factors like credit score,
income, employment status and loan history. This helps predict approval or rejection
helps in enabling quick and reliable decisions.
2. Medical Diagnosis: In healthcare they assist in diagnosing diseases. For example,
they can predict whether a patient has diabetes based on clinical data like glucose
levels, BMI and blood pressure. This helps classify patients into diabetic or non-
diabetic categories, supporting early diagnosis and treatment.
3. Predicting Exam Results in Education: Educational institutions use to predict
whether a student will pass or fail based on factors like attendance, study time and
past grades. This helps teachers identify at-risk students and offer targeted support.
4. Customer Churn Prediction: Companies use Decision Trees to predict whether a
customer will leave or stay based on behavior patterns, purchase history, and
interactions. This allows businesses to take proactive steps to retain customers.
5. Fraud Detection: In finance, Decision Trees are used to detect fraudulent activities,
such as credit card fraud. By analyzing past transaction data and patterns, Decision
Trees can identify suspicious activities and flag them for further investigation.
Random Forest Regression in Python
A random forest is an ensemble learning method that combines the predictions from multiple
decision trees to produce a more accurate and stable prediction. It can be used for both
classification and regression tasks. In a regression task, we can use the Random Forest
Regression technique for predicting numerical values. It predicts continuous values by
averaging the results of multiple decision trees.
Working of Random Forest Regression
Random Forest Regression works by creating multiple of decision trees each trained on a
random subset of the data. The process begins with Bootstrap sampling where random rows
of data are selected with replacement to form different training datasets for each tree. After
this we do feature sampling where only a random subset of features is used to build each
tree ensuring diversity in the models.
After the trees are trained each tree make a prediction and the final prediction for regression
tasks is the average of all the individual tree predictions and this process is called
as Aggregation.

Random Forest Regression Model Working

This approach is beneficial because individual decision trees may have high variance and are
prone to overfitting especially with complex data. However by averaging the predictions
from multiple decision trees Random Forest minimizes this variance leading to more accurate
and stable predictions and hence improving generalization of model.

Understanding Decision Trees in ML
No ratings yet
Understanding Decision Trees in ML
12 pages
Understanding Decision Trees in ML
No ratings yet
Understanding Decision Trees in ML
29 pages
Analyssi 5
No ratings yet
Analyssi 5
1 page
Decision Trees: Classification & Regression Guide
No ratings yet
Decision Trees: Classification & Regression Guide
38 pages
Decision Tree
No ratings yet
Decision Tree
5 pages
ML Unit 3
No ratings yet
ML Unit 3
38 pages
Decision Tree and Random Forest Documentation
No ratings yet
Decision Tree and Random Forest Documentation
15 pages
Decision Tree: Splitting and Stopping Criteria
No ratings yet
Decision Tree: Splitting and Stopping Criteria
6 pages
Understanding Decision Trees in Depth
No ratings yet
Understanding Decision Trees in Depth
7 pages
Unit 3
No ratings yet
Unit 3
36 pages
Understanding Decision Trees in ML
No ratings yet
Understanding Decision Trees in ML
14 pages
Understanding Decision Trees in ML
No ratings yet
Understanding Decision Trees in ML
27 pages
Understanding Decision Trees in ML
No ratings yet
Understanding Decision Trees in ML
5 pages
Understanding Decision Trees in ML
No ratings yet
Understanding Decision Trees in ML
12 pages
Decision Trees and Random Forests Explained
No ratings yet
Decision Trees and Random Forests Explained
16 pages
Understanding Decision Trees Explained
No ratings yet
Understanding Decision Trees Explained
4 pages
Decision Tree Classification Explained
No ratings yet
Decision Tree Classification Explained
4 pages
Decision Tree Splitting Techniques
No ratings yet
Decision Tree Splitting Techniques
56 pages
Understanding Decision Trees in ML
No ratings yet
Understanding Decision Trees in ML
5 pages
Understanding Decision Trees in ML
No ratings yet
Understanding Decision Trees in ML
24 pages
Pa Unit-3 Part1
No ratings yet
Pa Unit-3 Part1
49 pages
Understanding Internal Nodes in Decision Trees
No ratings yet
Understanding Internal Nodes in Decision Trees
2 pages
Understanding Decision Trees in AI
No ratings yet
Understanding Decision Trees in AI
13 pages
Understanding Decision Tree Structure
No ratings yet
Understanding Decision Tree Structure
57 pages
Understanding Decision Trees in Data Science
No ratings yet
Understanding Decision Trees in Data Science
6 pages
Understanding Decision Tree Algorithms
100% (1)
Understanding Decision Tree Algorithms
57 pages
Understanding Decision Trees in ML
No ratings yet
Understanding Decision Trees in ML
52 pages
Beginner's Guide to Decision Trees
No ratings yet
Beginner's Guide to Decision Trees
10 pages
Decision Trees in Machine Learning
No ratings yet
Decision Trees in Machine Learning
30 pages
Decision Tree Overview and Assignment
No ratings yet
Decision Tree Overview and Assignment
15 pages
Understanding Decision Tree Algorithms
No ratings yet
Understanding Decision Tree Algorithms
29 pages
Understanding Decision Trees in ML
No ratings yet
Understanding Decision Trees in ML
35 pages
Introduction to Decision Trees in ML
No ratings yet
Introduction to Decision Trees in ML
12 pages
Decision Trees in Machine Learning
No ratings yet
Decision Trees in Machine Learning
15 pages
Machine Learning Algorithms - Decision Tree
No ratings yet
Machine Learning Algorithms - Decision Tree
2 pages
ML 3
No ratings yet
ML 3
21 pages
Understanding Decision Trees in ML
No ratings yet
Understanding Decision Trees in ML
10 pages
Understanding Decision Trees in ML
No ratings yet
Understanding Decision Trees in ML
15 pages
Understanding Decision Trees in ML
No ratings yet
Understanding Decision Trees in ML
67 pages
Decision Tree Learning Overview
No ratings yet
Decision Tree Learning Overview
22 pages
Understanding Decision Trees in CSE
No ratings yet
Understanding Decision Trees in CSE
5 pages
Decision Trees: Types, Overfitting & Pruning
No ratings yet
Decision Trees: Types, Overfitting & Pruning
20 pages
Understanding Decision Trees in ML
No ratings yet
Understanding Decision Trees in ML
10 pages
Decision Trees for Number Prediction
No ratings yet
Decision Trees for Number Prediction
4 pages
Understanding Decision Trees in AI
No ratings yet
Understanding Decision Trees in AI
3 pages
Understanding Decision Trees in ML
No ratings yet
Understanding Decision Trees in ML
43 pages
Understanding Decision Trees in Data Analysis
No ratings yet
Understanding Decision Trees in Data Analysis
9 pages
Classification Techniques Overview
No ratings yet
Classification Techniques Overview
70 pages
Decision Tree Algorithms in Machine Learning
No ratings yet
Decision Tree Algorithms in Machine Learning
54 pages
Decision Tree Uodated Final
No ratings yet
Decision Tree Uodated Final
28 pages
Decision Trees in AI: Overview & Uses
No ratings yet
Decision Trees in AI: Overview & Uses
28 pages
Decision Trees in Classification Systems
No ratings yet
Decision Trees in Classification Systems
25 pages
Overview of Decision Trees
No ratings yet
Overview of Decision Trees
5 pages
Understanding Decision Trees
No ratings yet
Understanding Decision Trees
45 pages
Decision Tree Learning Overview
No ratings yet
Decision Tree Learning Overview
33 pages
Decision Trees in Machine Learning Explained
No ratings yet
Decision Trees in Machine Learning Explained
13 pages
Understanding Decision Trees in ML
No ratings yet
Understanding Decision Trees in ML
11 pages
Clustering and Decision Trees Explained
No ratings yet
Clustering and Decision Trees Explained
34 pages
Decision Tree
No ratings yet
Decision Tree
17 pages
Hybrid Gesture Recognition Framework
No ratings yet
Hybrid Gesture Recognition Framework
6 pages
Psychoanalytic & Feminist Insights on Eliot
No ratings yet
Psychoanalytic & Feminist Insights on Eliot
8 pages
Indibidwal na Talaan ng Mag-aaral
No ratings yet
Indibidwal na Talaan ng Mag-aaral
1 page
Intersubjectivity in Philosophy Lesson 12
No ratings yet
Intersubjectivity in Philosophy Lesson 12
4 pages
Connectivism and Networked Learning Handouts
No ratings yet
Connectivism and Networked Learning Handouts
2 pages
Factors Influencing Literary Interest
91% (11)
Factors Influencing Literary Interest
3 pages
IBPS Clerk Exam Syllabus 2025 Overview
No ratings yet
IBPS Clerk Exam Syllabus 2025 Overview
4 pages
Change Management Strategies Explained
No ratings yet
Change Management Strategies Explained
2 pages
Lesson Justification for BTEC Sport Unit
No ratings yet
Lesson Justification for BTEC Sport Unit
9 pages
Goffman's Dramaturgical Approach Explained
No ratings yet
Goffman's Dramaturgical Approach Explained
20 pages
The Role of Education in Society
No ratings yet
The Role of Education in Society
2 pages
Class 12 Bio-Botany Practical Manual
No ratings yet
Class 12 Bio-Botany Practical Manual
31 pages
Sion School Annual Report 2024-2025
No ratings yet
Sion School Annual Report 2024-2025
7 pages
Avenor College Academic Calendar 2025
No ratings yet
Avenor College Academic Calendar 2025
3 pages
Daily Life Notes and Ideas
No ratings yet
Daily Life Notes and Ideas
8 pages
Lesson Plan: Coral Reef Ecosystems
No ratings yet
Lesson Plan: Coral Reef Ecosystems
2 pages
Analyzing Effects of Enjoyment and Item Experience Intention To Purchase Mobile Games Content
No ratings yet
Analyzing Effects of Enjoyment and Item Experience Intention To Purchase Mobile Games Content
20 pages
Discriminant Analysis in Business Research
No ratings yet
Discriminant Analysis in Business Research
29 pages
Corporate Public Relations Course Overview
No ratings yet
Corporate Public Relations Course Overview
8 pages
1BLIB1 Library and Society 1st Sem
No ratings yet
1BLIB1 Library and Society 1st Sem
5 pages
DOH Cybersecurity Training Plan 2024
No ratings yet
DOH Cybersecurity Training Plan 2024
3 pages
Creating a Fictional Country Guide
No ratings yet
Creating a Fictional Country Guide
22 pages
TypeScript Mastery: A Step-by-Step Guide
100% (12)
TypeScript Mastery: A Step-by-Step Guide
133 pages
Civics and Government Course Syllabus
No ratings yet
Civics and Government Course Syllabus
4 pages
Hostel Facilities and Regulations Overview
No ratings yet
Hostel Facilities and Regulations Overview
2 pages
2021 Scheme 7th and 8th Scheme and Syllabus-1
No ratings yet
2021 Scheme 7th and 8th Scheme and Syllabus-1
37 pages
Geometry PDF
100% (2)
Geometry PDF
285 pages
JJM Medical College Davangere Overview
No ratings yet
JJM Medical College Davangere Overview
10 pages
Self-Compassion: Key to Kindness
No ratings yet
Self-Compassion: Key to Kindness
3 pages
NUS Graduate Admission Handbook 2020
No ratings yet
NUS Graduate Admission Handbook 2020
39 pages

Class 3

Uploaded by

Class 3

Uploaded by

Decision Tree

Random Forest Regression Model Working

You might also like