0% found this document useful (0 votes)

3 views53 pages

Lec1-Introduction To Machine Learning

The document provides an introduction to machine learning, defining it as the ability of computers to learn from experience without explicit programming. It outlines various types of machine learning algorithms, including supervised, unsupervised, semi-supervised, and reinforcement learning, along with their applications in regression and classification tasks. Additionally, it discusses techniques such as gradient descent, feature scaling, and regularization to improve model performance and address issues like overfitting.

Uploaded by

daredevil039512

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

3 views53 pages

Lec1-Introduction To Machine Learning

Uploaded by

daredevil039512

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Introduction to machine

learning
Kh. Aghajani
[Link]@[Link]
Machine Learning definition
• Arthur Samuel (1959). Machine Learning: Field of study
that gives computers the ability to learn without being
explicitly programmed.
• Tom Mitchell (1998) Well-posed Learning Problem: A
computer program is said to learn from experience E with
respect to some task T and some performance measure P, if
its performance on T, as measured by P, improves with
experience E.
Machine learning algorithms:
- Supervised learning
- Unsupervised learning
- Semi supervised
- Reinforcement learning
Supervised learning
• Supervised learning, also known as supervised machine learning, is a subcategory
of machine learning and artificial intelligence. It is defined by its use of labeled
datasets to train algorithms that to classify data or predict outcomes accurately.
Unsupervised learning
• Unsupervised learning, also known as unsupervised machine learning, uses
machine learning algorithms to analyze and cluster unlabeled datasets. These
algorithms discover hidden patterns or data groupings without the need for
human intervention.
Semi-supervised learning
Semi-supervised learning is a broad category of machine learning that uses labeled data to
ground predictions, and unlabeled data to learn the shape of the larger data distribution.
Reinforcement learning
Reinforcement learning is a machine learning training method based on rewarding desired behaviors and
punishing undesired ones. In general, a reinforcement learning agent -- the entity being trained -- is able to
perceive and interpret its environment, take actions and learn through trial and error.
Supervised learning-regression
• Purpose: Regression is used when the output variable (also known as the
dependent variable) is continuous. It predicts a numerical value or a real
number. For example, predicting house prices, temperature, stock prices, or
a person's age.
• Output: The output of a regression model is a continuous range of values. It
can be any real number, and the prediction typically falls within a specific
numerical range.
• Algorithms: Algorithms commonly used for regression tasks include linear
regression, polynomial regression, decision trees, support vector regression,
and neural networks, among others.
• Evaluation: Regression models are evaluated using metrics like Mean
Squared Error (MSE), Root Mean Squared Error (RMSE), Mean Absolute Error
(MAE), which measure the accuracy of the model's numerical predictions.
Supervised learning- classification
• Purpose: Classification is used when the output variable is categorical. It predicts the class
or category to which a data point belongs. Common examples include email spam
classification (spam or not spam), image recognition (cat or dog), or disease diagnosis
(positive or negative).
• Output: The output of a classification model is a discrete category or label. It assigns data
points to predefined classes or categories.
• Algorithms: Algorithms used for classification tasks include logistic regression, decision
trees, support vector machines, k-nearest neighbors, Naive Bayes, and various deep
learning models such as convolutional neural networks (CNNs) and recurrent neural
networks (RNNs).
• Evaluation: Classification models are evaluated using metrics such as accuracy, precision,
recall, F1-score, and the area under the Receiver Operating Characteristic (ROC-AUC) curve,
depending on the specific problem and the desired trade-offs between true positives, true
negatives, false positives, and false negatives.
Linear regression

Housing Prices problem

Price
(in 1000s of dollars)

Size (feet2)

Supervised Learning
Regression Problem
Given the “right answer” for
each example in the data. Predict real-valued output
Size in feet2 (x) Price ($) in 1000's (y)

Training set of 2104 460

housing prices 1416 232
1534 315
852 178
Notation: … …
m = Number of training examples
x’s = “input” variable / features
y’s = “output” variable / “target” variable
Hypothesis:

Parameters:

Cost Function:

Goal:
(for fixed , this is a function of x) (function of the parameters )
(for fixed , this is a function of x) (function of the parameters )
(for fixed , this is a function of x) (function of the parameters )
Gradient descent method
Have some function
Want
Outline:
• Start with some
• Keep changing to reduce until we hopefully
end up at a minimum
The impact of initial point
at local optima

Current value of
The impact of learning rate

If α is too small, gradient descent can be slow.

If α is too large, gradient descent can overshoot the minimum. It may
fail to converge, or even diverge.
Linear regression-example
Linear regression with Multiple features (variables)
Size (feet2) Number of Number of Age of home Price ($1000)
bedrooms floors (years)

2104 5 1 45 460
1416 3 2 40 232
1534 3 2 30 315
852 2 1 36 178
… … … … …
Notation:
= number of features
= input (features) of training example.
= value of feature in training example.
Multivariate linear regression.

Hypothesis:
Parameters:
Cost function:

Gradient descent:
Repeat

(simultaneously update for every )

New algorithm :
Gradient Descent
Repeat
Previously (n=1):
Repeat
simultaneously update for

(simultaneously update )
Feature Scaling
Idea: Make sure features are on a similar scale.
E.g. = size (0-2000 feet2) size (feet2)

= number of bedrooms (1-5)

number of bedrooms
Feature Scaling

Get every feature into approximately a range.

Replace with (x i  i ) /  i to make features have approximately zero mean

(Do not apply to ).
Polynomial regression
training examples, features.
Gradient Descent Normal Equation
• Need to choose . • No need to choose .
• Needs many iterations. • Don’t need to iterate.
• Works well even when • Need to compute
is large.
• Slow if is very large.
Logistic Regression
(Classification)
Classification

Email: Spam / Not Spam?

Online Transactions: Fraudulent (Yes / No)?
Tumor: Malignant / Benign ?

0: “Negative Class” (e.g., benign tumor)

1: “Positive Class” (e.g., malignant tumor)
Logistic Regression Model
Interpretation of Hypothesis Output
= estimated probability that y = 1 on input x

Example: If

Tell patient that 70% chance of tumor being malignant

Decision Boundary
x2

x2
3 -1 1 x1
2 -1

1
Predict “ “ if
1 2 3
x1

Predict “ “ if
Training set:

m examples

How to choose parameters ?

Logistic regression - cost function
Logistic regression cost function

To fit parameters :

To make a prediction given new :

Output
Gradient Descent

Want :
Repeat

(simultaneously update all )

Linear regression : ℎ𝜃 𝑥 = 𝜃 𝑇 𝑥

Logistic regression :
Cross-entropy derivative
Multi-class classification:
One-vs-all
Binary classification: Multi-class Classification:

x2 x2

x1 x1
x2
One-vs-all (one-vs-rest):

x1
x2 x2

x1 x1
x2
Class 1:
Class 2:
Class 3:
x1
One-vs-all

Train a logistic regression classifier for each

class to predict the probability that .
On a new input , to make a prediction, pick the
class that maximizes
Regularization : Example: regression (housing prices)
Price

Price

Price
Size Size Size

Overfitting: If we have too many features, the learned hypothesis

may fit the training set very well ( ), but
fail to generalize to new examples (predict prices on new examples).
Example: Logistic regression

x2 x2 x2

x1 x1 x1

( = sigmoid function)
Addressing overfitting:
size of house
no. of bedrooms
no. of floors
age of house
average income in neighborhood
kitchen size
Cost function with considering Regularization term

Price
Price

Size of house Size of house

Suppose we penalize and make , really small.

+100𝜃32 + 100𝜃42
In regularized linear regression, we choose to minimize

What if is set to an extremely large value (perhaps for too large

for our problem, say )?
Price

Size of house
Gradient descent
Repeat
Regularized logistic regression.

x1
Cost function:

𝑛
𝜆
+ 𝜃𝑗2
2𝑚
𝑗=1

Logistic Regression Overview and Techniques
No ratings yet
Logistic Regression Overview and Techniques
55 pages
Unit 1
No ratings yet
Unit 1
82 pages
Machine Learning: Linear Regression Guide
No ratings yet
Machine Learning: Linear Regression Guide
36 pages
Introduction to Machine Learning Concepts
No ratings yet
Introduction to Machine Learning Concepts
15 pages
Introduction to Cognitive Science and Machine Learning
No ratings yet
Introduction to Cognitive Science and Machine Learning
14 pages
Understanding Linear Regression Basics
No ratings yet
Understanding Linear Regression Basics
101 pages
Supervised Machine Learning Basics
No ratings yet
Supervised Machine Learning Basics
6 pages
Machine Learning: Supervised & Unsupervised
No ratings yet
Machine Learning: Supervised & Unsupervised
75 pages
Introduction to Machine Learning Concepts
No ratings yet
Introduction to Machine Learning Concepts
19 pages
Supervised vs Unsupervised Learning Guide
No ratings yet
Supervised vs Unsupervised Learning Guide
22 pages
Machine Learning Basics: Supervised vs Unsupervised
No ratings yet
Machine Learning Basics: Supervised vs Unsupervised
38 pages
Understanding Linear Regression Basics
No ratings yet
Understanding Linear Regression Basics
34 pages
Gradient Descent for Multivariable Regression
No ratings yet
Gradient Descent for Multivariable Regression
101 pages
Supervised Learning: Regression vs. Classification
No ratings yet
Supervised Learning: Regression vs. Classification
10 pages
Introduction to Machine Learning Concepts
No ratings yet
Introduction to Machine Learning Concepts
8 pages
Supervised Learning and Classification Basics
No ratings yet
Supervised Learning and Classification Basics
34 pages
Machine Learning: Supervised & Unsupervised
No ratings yet
Machine Learning: Supervised & Unsupervised
31 pages
Understanding Linear Regression in ML
No ratings yet
Understanding Linear Regression in ML
37 pages
Linear Regression and Classification Techniques
No ratings yet
Linear Regression and Classification Techniques
42 pages
Supervised Learning Overview and Types
No ratings yet
Supervised Learning Overview and Types
31 pages
Linear Regression Techniques in Python
No ratings yet
Linear Regression Techniques in Python
25 pages
Machine Learning Notes Dtu Unit 1
No ratings yet
Machine Learning Notes Dtu Unit 1
130 pages
Key Concepts in Regression Models
No ratings yet
Key Concepts in Regression Models
26 pages
Introduction to Machine Learning Concepts
No ratings yet
Introduction to Machine Learning Concepts
7 pages
Understanding Machine Learning Basics
No ratings yet
Understanding Machine Learning Basics
48 pages
Machine Learning Concepts and Techniques
No ratings yet
Machine Learning Concepts and Techniques
155 pages
01B DL2023 LinearModels
No ratings yet
01B DL2023 LinearModels
47 pages
Lasso Regression in Logistic Models
No ratings yet
Lasso Regression in Logistic Models
43 pages
Lec 02
No ratings yet
Lec 02
36 pages
Types of Machine Learning Explained
No ratings yet
Types of Machine Learning Explained
50 pages
Linier & Logistic
No ratings yet
Linier & Logistic
15 pages
Overfitting in Linear Regression Explained
No ratings yet
Overfitting in Linear Regression Explained
8 pages
Introduction to Machine Learning Concepts
No ratings yet
Introduction to Machine Learning Concepts
10 pages
E-Note 47072 Content Document 20251208100443AM
No ratings yet
E-Note 47072 Content Document 20251208100443AM
187 pages
Overview of Supervised Learning
No ratings yet
Overview of Supervised Learning
24 pages
Understanding Linear Regression in ML
No ratings yet
Understanding Linear Regression in ML
41 pages
Machine Learning: Regression Techniques
No ratings yet
Machine Learning: Regression Techniques
24 pages
Decision Tree Predictions in ML
No ratings yet
Decision Tree Predictions in ML
86 pages
Machine Learningggggggggggggggg
No ratings yet
Machine Learningggggggggggggggg
14 pages
Understanding Machine Learning Basics
No ratings yet
Understanding Machine Learning Basics
17 pages
Overview of Regression Techniques
No ratings yet
Overview of Regression Techniques
48 pages
Understanding Linear Regression Basics
No ratings yet
Understanding Linear Regression Basics
68 pages
8 Machine Learning Models Overview
No ratings yet
8 Machine Learning Models Overview
31 pages
Analytical Solution in Linear Regression
No ratings yet
Analytical Solution in Linear Regression
21 pages
Linear and Logistic Regression Basics
No ratings yet
Linear and Logistic Regression Basics
60 pages
Understanding Linear Regression Techniques
No ratings yet
Understanding Linear Regression Techniques
54 pages
Supervised Machine Learning Techniques Guide
No ratings yet
Supervised Machine Learning Techniques Guide
131 pages
Machine Learning Course Syllabus
No ratings yet
Machine Learning Course Syllabus
56 pages
Regression and Logistic Models Explained
No ratings yet
Regression and Logistic Models Explained
46 pages
Machine Learning Module 2 Notes
No ratings yet
Machine Learning Module 2 Notes
12 pages
Linear Regression Explained: Models & Methods
No ratings yet
Linear Regression Explained: Models & Methods
52 pages
Machine Learning: Feature Scaling & Regression
No ratings yet
Machine Learning: Feature Scaling & Regression
33 pages
Understanding Supervised Machine Learning
No ratings yet
Understanding Supervised Machine Learning
44 pages
Tom Mitchell's Machine Learning Definition
No ratings yet
Tom Mitchell's Machine Learning Definition
10 pages
Understanding Regression in Machine Learning
No ratings yet
Understanding Regression in Machine Learning
137 pages
Application for Maths and Science Teacher
No ratings yet
Application for Maths and Science Teacher
2 pages
25 Super-Fun Spelling Games
100% (3)
25 Super-Fun Spelling Games
64 pages
Understanding IVLE for Effective Communication
No ratings yet
Understanding IVLE for Effective Communication
34 pages
Syntax Tree Analysis in Poetry
No ratings yet
Syntax Tree Analysis in Poetry
7 pages
Mobile Apps and Vocabulary Learning
No ratings yet
Mobile Apps and Vocabulary Learning
6 pages
Kaggle Competition Participation Guide
100% (1)
Kaggle Competition Participation Guide
74 pages
Insights on Sexual Morality and Self
No ratings yet
Insights on Sexual Morality and Self
3 pages
Descriptive Writing with Sensory Details
No ratings yet
Descriptive Writing with Sensory Details
6 pages
Effective Study and Communication Skills
No ratings yet
Effective Study and Communication Skills
9 pages
Impact of Tech on Accounting Learning
No ratings yet
Impact of Tech on Accounting Learning
6 pages
Katie Hong: Passionate Educator Profile
No ratings yet
Katie Hong: Passionate Educator Profile
1 page
English Debate Lesson Plan for Class 3/3
No ratings yet
English Debate Lesson Plan for Class 3/3
4 pages
Analog IP Cores for ASIC Design Reuse
No ratings yet
Analog IP Cores for ASIC Design Reuse
2 pages
Usability Experience
No ratings yet
Usability Experience
1,026 pages
Nursing Education Philosophy Explained
No ratings yet
Nursing Education Philosophy Explained
15 pages
ECCE Guia de Ejemplo Del Examen
No ratings yet
ECCE Guia de Ejemplo Del Examen
59 pages
Deep Generative Models Overview
No ratings yet
Deep Generative Models Overview
49 pages
AI-Based Career Counseling System
No ratings yet
AI-Based Career Counseling System
4 pages
Key Factors in Multilingual Success
100% (5)
Key Factors in Multilingual Success
16 pages
Understanding African Literature Insights
No ratings yet
Understanding African Literature Insights
6 pages
AI Programming Course Overview
No ratings yet
AI Programming Course Overview
1 page
Human Performance Training Overview
No ratings yet
Human Performance Training Overview
116 pages
Problem Solving Agents in AI
100% (1)
Problem Solving Agents in AI
11 pages
Grade 2 Daily Lesson Log: January 2023
No ratings yet
Grade 2 Daily Lesson Log: January 2023
7 pages
Unlocking The Potential of ChatGPT
96% (25)
Unlocking The Potential of ChatGPT
45 pages
2023 Summer Learning Recovery Plan Template
No ratings yet
2023 Summer Learning Recovery Plan Template
5 pages
Enhancing Behavioral Safety Culture
No ratings yet
Enhancing Behavioral Safety Culture
18 pages
William James' Theory of the Self
No ratings yet
William James' Theory of the Self
8 pages
Constructivism in CS Education Analysis
No ratings yet
Constructivism in CS Education Analysis
20 pages
Fuzzy Logic Edge Detection in ImageJ
No ratings yet
Fuzzy Logic Edge Detection in ImageJ
15 pages

Lec1-Introduction To Machine Learning

Uploaded by

Lec1-Introduction To Machine Learning

Uploaded by

Introduction to machine

Housing Prices problem

Training set of 2104 460

If α is too small, gradient descent can be slow.

(simultaneously update for every )

= number of bedrooms (1-5)

Get every feature into approximately a range.

Replace with (x i  i ) /  i to make features have approximately zero mean

Email: Spam / Not Spam?

0: “Negative Class” (e.g., benign tumor)

Tell patient that 70% chance of tumor being malignant

How to choose parameters ?

To make a prediction given new :

(simultaneously update all )

Train a logistic regression classifier for each

Overfitting: If we have too many features, the learned hypothesis

Size of house Size of house

Suppose we penalize and make , really small.

What if is set to an extremely large value (perhaps for too large

You might also like