AIFinance 2a - Introduction To Supervised Learning

The document provides an overview of supervised learning, detailing parametric models like linear and logistic regression, as well as non-parametric models such as decision trees. It explains the objectives of supervised algorithms, including classification and regression tasks, and discusses the training process and optimization techniques like gradient descent. Additionally, it covers the decision-making process in building decision trees, focusing on information gain and the challenges of selecting attributes and thresholds for splits.

Uploaded by

phiklongk

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4 views42 pages

AIFinance 2a - Introduction To Supervised Learning

Uploaded by

phiklongk

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Dr Ho Diep

Date: 27-Aug-2025

AIFinance 2a:
Introduction to
Supervised
Learning
1
Intro Class: Overview

● Principles of Supervised Learning

● Parametric Models: Linear regression

● Parametric Models: Logistic regression

● Non-Parametric Models: Decision Trees and Random Forests

Introduction to the principles of Supervised Learning -
Categorisation of Supervised models
● In some instances, all explanatory
features are considered on the same
footing. This is typically the case with
regressions, and related ones such as
Logit (Parametric models).
● Alternatively, the explanatory features
may be ordered in a successive manner in
order to refine the selection effort (non
Parametric models).
● In the first instance, fitting a model means
finding the optimal weights applied to
each feature. In the second instance
finding the optimal model means ordering
the most relevant features and finding the
best cut-offs at each step.
Parametric Models
Supervised Learning – Parametric Models -
Setting up the Objective
● Supervised Learning is the process of learning a function which maps input data to an output
based on several input-output pairs. Let's detail the process:
○ First, we have a dataset of pairs {features, target} = {(Xi , Yi )1<i<n } over𝙓 x 𝗬
○ Typically : 𝙓 = RD and 𝗬 = {0, 1}.
○ The pairs {(Xi , Yi )1<i<n } are assumed to be independent and identically distributed (i.i.d.)
following an unknown distribution. It is important to mention here that we assume no
sequentiality in the data.
● Example:
○ Let’s consider this small dataset: We try to predict whether a student will fail or pass the
final exam based on some feature values.
○ Yi = 1 if the student pass, Yi = 0 if he fails.
○ For each Xi , the first coordinate represents the number of hours spent on the course, the
second coordinate is the average intermediary quiz mark and the third coordinate is the
number of hours spent on the coursework.
Setting up the Objective

● A Supervised Algorithm is an algorithm that aims at building a predictor (i.e, a function

which minimizes an error, based on the dataset.

● In the previous example, our objective was to predict a discrete value : pass or fail (1 or 0). This
supervised task is called classification.
● We can also try to predict a continuous value: the final exam mark for instance. In that case, the
task is called regression.
Setting up the Objective

● To define the error, we need first to define a loss function (i.e, a function which
measures the "distance" between the the output of the predictor and the true labels (Yi )1<i<n ).
● For the loss function, we usually choose for all pairs (output, true label),

● We then define the following error l risk associated to the predictor , the aggregate loss over the
train set, which follows an unknown distribution :

● Our objective is to find the optimal predictor among all the possible functions defined by the
modeler (e.g. Logit, linear regression, etc) :

● Since is unknown, we optimize (minimize) the empirical cumulative risk :

Linear Regression
Linear Regression

● Let us start with the simplest regression model : Linear Regression.

● Consider the following (fake) dataset representing the salary (in the x axis) of some (fake)
employees according to the number of years of experience (in the y axis).
Setting up the Objective:

● We would like to find a way to define the red line from the pairs (experience, salary) represented
by the blue points.
● In that way, we could assign an estimated salary to each value of the experience variable.
Linear Regression: a Mathematical Perspective
The training process:
The training process:

● In optimization matters, we usually prefer to minimize functions instead of maximizing them.

● Thus, we transform the likelihood maximization problem into the equivalent cost minimization
problem, where the cost is the following negative log-likelihood:

● The training problem can then be written as the following equivalent minimization problem:
Matrix Notation and Optimization:
Using a Gradient Descent for Optimization
Using a Gradient Descent for Optimization
Logistic Regression
Logistic Regression
Introduction:
● The Logistic Regression is one of the easiest classification models to implement. It also
performs very well on linearly separable classes.
● We call decision boundary the hypersurface separating the space of input data between two
subsets, one for each class. The classifier will classify all the points belonging in one side of the
decision boundary as belonging in one class and all those on the other side as belonging in the
other class.
● In the case of a Logistic Regression, the decision boundary is a hyperplane.
● The following scatterplot of the public Iris dataset shows a linear decision boundary associated
with Logistic Regression.
Presenting the logit function:
The sigmoid function and the Logistic Regression
Model
The prediction phase after training:
The Training Process: finding the optimal w
Using a Gradient Descent for Optimization
Non Parametric Models
A high level description of the algorithm

● The DT algorithm is basically just a bunch of nested if-statements on the input features (also
called attributes) in the training dataset.
● The decision algorithm:
○ We start at the tree root (with the whole dataset)
○ Then we split the dataset on the attribute that results in the largest Information Gain (IG).
○ We iterate the splitting procedure at each child node until the leaves are pure (which means
that the samples at the leaves belong to the same class)
○ A very deep tree is prone to overfitting. To avoid that, we set a limit for the maximal depth
of the tree.
Building Decision Trees

● First, we need to define an objective function that we want to optimize (Information Gain).
● Then, at each iteration, two challenges arise when trying to choose the best split.
○ How do we choose the best attribute responsible for the split ?
○ How do we choose the threshold when splitting based on the "best attribute" ?
Summary

Linier & Logistic
No ratings yet
Linier & Logistic
15 pages
Supervised Learning and Classification Basics
No ratings yet
Supervised Learning and Classification Basics
34 pages
Machine Learning Concepts Explained
No ratings yet
Machine Learning Concepts Explained
26 pages
Introduction to Machine Learning Concepts
No ratings yet
Introduction to Machine Learning Concepts
15 pages
Understanding Machine Learning Basics
No ratings yet
Understanding Machine Learning Basics
48 pages
Introduction to Machine Learning Concepts
No ratings yet
Introduction to Machine Learning Concepts
18 pages
Understanding Classification Algorithms
No ratings yet
Understanding Classification Algorithms
46 pages
Regression and Logistic Models Explained
No ratings yet
Regression and Logistic Models Explained
46 pages
Unit 1
No ratings yet
Unit 1
82 pages
Supervised Learning: Linear Regression Guide
No ratings yet
Supervised Learning: Linear Regression Guide
9 pages
Machine Learning Classification Overview
No ratings yet
Machine Learning Classification Overview
214 pages
Supervised Learning in Machine Learning
No ratings yet
Supervised Learning in Machine Learning
47 pages
Understanding Supervised Machine Learning
No ratings yet
Understanding Supervised Machine Learning
44 pages
Unit 2 Final
No ratings yet
Unit 2 Final
94 pages
Supervised Learning in Machine Learning
No ratings yet
Supervised Learning in Machine Learning
20 pages
Unit IV Supervised Machine Learning For Financial Data Analysis
No ratings yet
Unit IV Supervised Machine Learning For Financial Data Analysis
60 pages
Comparing ML Algorithms and Loss Functions
No ratings yet
Comparing ML Algorithms and Loss Functions
14 pages
Machine Learning: Linear Regression Guide
No ratings yet
Machine Learning: Linear Regression Guide
36 pages
Training Linear Regression Models Guide
No ratings yet
Training Linear Regression Models Guide
52 pages
Machine Learning: Model Building Complete Study Notes
No ratings yet
Machine Learning: Model Building Complete Study Notes
18 pages
Key Concepts in Machine Learning
No ratings yet
Key Concepts in Machine Learning
47 pages
Advanced Machine Learning Techniques
No ratings yet
Advanced Machine Learning Techniques
136 pages
Unit 3
No ratings yet
Unit 3
77 pages
Overview of Supervised Learning
No ratings yet
Overview of Supervised Learning
24 pages
Machine Learning Overview and Techniques
No ratings yet
Machine Learning Overview and Techniques
7 pages
Linear Regression and Machine Learning Techniques
No ratings yet
Linear Regression and Machine Learning Techniques
86 pages
Lec1-Introduction To Machine Learning
No ratings yet
Lec1-Introduction To Machine Learning
53 pages
Tom Mitchell's Machine Learning Definition
No ratings yet
Tom Mitchell's Machine Learning Definition
10 pages
Training Machine Learning Models Overview
No ratings yet
Training Machine Learning Models Overview
83 pages
Capstone Project
No ratings yet
Capstone Project
20 pages
Introduction to Cognitive Science and Machine Learning
No ratings yet
Introduction to Cognitive Science and Machine Learning
14 pages
ML - Supervised and Unsupervised Learning
No ratings yet
ML - Supervised and Unsupervised Learning
146 pages
Supervised Learning & Logistic Regression Guide
No ratings yet
Supervised Learning & Logistic Regression Guide
10 pages
Understanding Machine Learning Concepts
No ratings yet
Understanding Machine Learning Concepts
117 pages
Supervised Learning in Machine Learning
No ratings yet
Supervised Learning in Machine Learning
12 pages
Introduction to Machine Learning Concepts
No ratings yet
Introduction to Machine Learning Concepts
47 pages
Introduction to Machine Learning Concepts
No ratings yet
Introduction to Machine Learning Concepts
8 pages
Introduction to Machine Learning Concepts
No ratings yet
Introduction to Machine Learning Concepts
7 pages
Supervised vs Unsupervised Learning Explained
No ratings yet
Supervised vs Unsupervised Learning Explained
11 pages
Supervised Machine Learning: Linear Models and Fundamentals
No ratings yet
Supervised Machine Learning: Linear Models and Fundamentals
49 pages
Logistic Regression in Python Guide
No ratings yet
Logistic Regression in Python Guide
94 pages
Understanding Machine Learning Basics
No ratings yet
Understanding Machine Learning Basics
127 pages
Foundational Machine Learning Concepts
No ratings yet
Foundational Machine Learning Concepts
22 pages
Module 5
No ratings yet
Module 5
19 pages
Supervised Learning: Classification & Regression
No ratings yet
Supervised Learning: Classification & Regression
307 pages
Supervised Learning: Regression Explained
No ratings yet
Supervised Learning: Regression Explained
155 pages
Regression, Loss and Cost Functions
No ratings yet
Regression, Loss and Cost Functions
38 pages
ML Lec2 Regression and SL
No ratings yet
ML Lec2 Regression and SL
17 pages
Logistic Regression Overview and Methods
No ratings yet
Logistic Regression Overview and Methods
62 pages
Overview of Logistic Regression & Decision Trees
No ratings yet
Overview of Logistic Regression & Decision Trees
27 pages
Independence of Events in Machine Learning
No ratings yet
Independence of Events in Machine Learning
39 pages
Machine Learning Concepts Explained
No ratings yet
Machine Learning Concepts Explained
34 pages
Machine Learning Algorithms Explained
No ratings yet
Machine Learning Algorithms Explained
77 pages
Introduction to Machine Learning Basics
No ratings yet
Introduction to Machine Learning Basics
46 pages
Machine Learning Basics in Drug Discovery
No ratings yet
Machine Learning Basics in Drug Discovery
6 pages
Linear Regression in Machine Learning
No ratings yet
Linear Regression in Machine Learning
91 pages
Machine Learning Overview and Applications
No ratings yet
Machine Learning Overview and Applications
22 pages
AI Fundamentals Exam Questions
No ratings yet
AI Fundamentals Exam Questions
22 pages
Introduction to Machine Learning Concepts
No ratings yet
Introduction to Machine Learning Concepts
26 pages
AI in Acoustic Wildlife Monitoring Review
No ratings yet
AI in Acoustic Wildlife Monitoring Review
20 pages
AWS Machine Learning Essentials Guide
No ratings yet
AWS Machine Learning Essentials Guide
7 pages
Supervised Learning in Machine Learning
No ratings yet
Supervised Learning in Machine Learning
35 pages
Machine Learning With Python
No ratings yet
Machine Learning With Python
44 pages
Machine Learning Model Evaluation Methods
No ratings yet
Machine Learning Model Evaluation Methods
28 pages
Machine Learning Exam Insights
No ratings yet
Machine Learning Exam Insights
3 pages
Data Science Interview Insights
100% (1)
Data Science Interview Insights
68 pages
Intro to Machine Learning Course 4350702
No ratings yet
Intro to Machine Learning Course 4350702
11 pages
Neural Networks: Activation Functions & Models
No ratings yet
Neural Networks: Activation Functions & Models
25 pages
Statistical Tests for Classifier Comparison
No ratings yet
Statistical Tests for Classifier Comparison
30 pages
AI in Financial Statement Audits
No ratings yet
AI in Financial Statement Audits
91 pages
Neural Networks Course Overview
No ratings yet
Neural Networks Course Overview
36 pages
Voice Gender Detection via ML Algorithms
No ratings yet
Voice Gender Detection via ML Algorithms
16 pages
SPE-195068-MS When Petrophysics Meets Big Data: What Can Machine Do?
No ratings yet
SPE-195068-MS When Petrophysics Meets Big Data: What Can Machine Do?
25 pages
Supervised Learning Algorithms Cheat Sheet
No ratings yet
Supervised Learning Algorithms Cheat Sheet
20 pages
Land Cover Classification in Iran
No ratings yet
Land Cover Classification in Iran
9 pages
Machine Learning Tutorial For Beginners
No ratings yet
Machine Learning Tutorial For Beginners
15 pages
Enhancing Food Integrity Through Artificial Intelligence and Machine Learning: A Comprehensive Review
No ratings yet
Enhancing Food Integrity Through Artificial Intelligence and Machine Learning: A Comprehensive Review
28 pages
R Programming for Data Science Models
No ratings yet
R Programming for Data Science Models
15 pages
Dynamic Load Modeling - PSSE - Gyawali - S - T - 2020
No ratings yet
Dynamic Load Modeling - PSSE - Gyawali - S - T - 2020
96 pages
Data Science Fundamentals Overview
100% (1)
Data Science Fundamentals Overview
31 pages
Deep Learning for Asset Health Prognostics
No ratings yet
Deep Learning for Asset Health Prognostics
74 pages
Aleatoric vs Epistemic Uncertainty in ML
No ratings yet
Aleatoric vs Epistemic Uncertainty in ML
59 pages
ML Techniques for Depression Prediction
No ratings yet
ML Techniques for Depression Prediction
76 pages