0% found this document useful (0 votes)

5 views23 pages

Understanding Machine Learning Types

The document discusses the concept of learning in agents, emphasizing the importance of adaptability in unpredictable environments and the limitations of pre-programmed solutions. It outlines various forms of learning, including supervised, unsupervised, and reinforcement learning, detailing their methodologies, advantages, and disadvantages. Additionally, it covers techniques like gradient descent and Hebbian learning, highlighting their roles in optimizing machine learning models.

Uploaded by

prabhatgt421

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

5 views23 pages

Understanding Machine Learning Types

Uploaded by

prabhatgt421

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

LEARNING

➢ An agent is learning if it improves its performance on future tasks

after making observations about the world.

Why learning?

Why would we want an agent to learn? If the design of the agent can
be improved, why wouldn’t the designers just program in that
improvement to begin with?

There are three main reasons:

➢ First, the designers cannot anticipate all possible situations that

the agent might find itself in. For example, a robot designed to
navigate mazes must learn the layout of each new maze it
encounters.
➢ Second, the designers cannot anticipate all changes over time.
For example, a program designed to predict tomorrow’s stock
market prices must learn to adapt when conditions change from
boom to bust.
➢ Third, sometimes human programmers have no idea how to
program a solution themselves. For example, most people are
good at recognizing the faces of family members, but even the
best programmers are unable to program a computer to
accomplish that task, except by using learning algorithms.
FORMS OF LEARNING

➢ Any component of an agent can be improved by learning from data.

The improvements, and the techniques used to make them, depend
on four major factors:
• Which component is to be improved
• What prior knowledge the agent already has.
• What representation is used for the data and the component.
• What feedback is available to learn from.
➢ Most of current machine learning research covers inputs that form a
factored representation (a vector of attribute values) and outputs
that can be either a continuous numerical value or a discrete value.
➢ Learning a (possibly incorrect) general function or rule from
specific input–output pairs is called inductive learning.
➢ Analytical or deductive learning: going from a known general rule
to a new rule that is logically entailed, but is useful because it allows
more efficient processing.

Feedback to learn from

There are three types of feedback that determine the four main types
of learning:
1. Supervised learning
2. Un-supervised learning
3. Reinforcement learning
4. Semi-Supervised learning

1. Supervised Learning
➢ Supervised learning is an ML method in which a model learns
from a labeled dataset containing input-output pairs.
➢ Each input in the dataset has a corresponding correct output (the
label), and the model's task is to learn the relationship between the
inputs and outputs.
➢ This enables the model to make predictions on new, unseen data
by applying the learned mapping.

Example of Supervised Learning

Predicting house prices: The input might be house features such as

size, location, and number of bedrooms, and the output would be the
house price. The supervised learning model would learn the
relationship between these features and house prices from historical
data, and then it could predict prices for new houses entering the
market.
The task of supervised learning is this:

Given a training set of N example input–output pairs

(x1, y1),(x2, y2),...(xN , yN )

where each yj was generated by an unknown function y = f(x),

discover a function h that approximates the true function f.

Here x and y can be any value; they need not be numbers. The
function h is a hypothesis. Learning is a search through the space of
possible hypotheses for one that will perform well, even on new
examples beyond the training set.

We say a hypothesis generalizes well if it correctly predicts the value

of y. Sometimes the function f is stochastic (i.e. it is not strictly a
function of x), and what we have to learn is a conditional probability
distribution, P(Y | x).
Categories of Supervised Learning

• Regression: When y is a number (such as tomorrow’s

temperature), the learning problem is called regression.

When dealing with real-valued output variables like "price" or

"temperature," several popular Regression algorithms come into
play, such as the Simple Linear Regression Algorithm, Multivariate
Regression Algorithm, Decision Tree Algorithm, and Lasso
Regression.

• Classification: When the output y is one of a finite set of values

(such as sunny, cloudy or rainy), the learning problem is called
classification, and is called Boolean or binary classification if there
are only two values.

In instances where the output variable is a category, like

distinguishing between 'spam' and 'not spam' in email filtering,
several widely-used classification algorithms come into play.
These encompass the following algorithms: Random Forest,
Decision Tree, Logistic Regression, and Support Vector Machine.
Advantages

• Gathers previous data, which helps in learning from past

mistakes.
• It is a powerful tool of AI that can perform plenty of business
functions single-handedly.
• It is a more trustworthy algorithm.

Disadvantages

• Difficult to classify huge data sets.

• It requires a certain level of expertise to operate.
• It is time intensive.

2. Un-Supervised Learning :-
➢ In unsupervised learning the agent learns patterns in the input even
though no explicit feedback is supplied.
➢ These algorithms discover hidden patterns or data groupings
without the need for human intervention.
➢ Un-supervised learning builds a concise representation of the data
and generate imaginative content from it.
Types of Unsupervised Learning
Unsupervised learning can be broken down into three main tasks:
i. Clustering
ii. Association rules
iii. Dimensionality reduction.

Clustering
➢ Clustering is a data mining technique which groups unlabeled
data based on their similarities or differences.
➢ Clustering algorithms are used to process raw, unclassified data
objects into groups represented by structures or patterns in the
information.
➢ Clustering algorithms can be categorized into different types of
clustering; for example:
• Exclusive clustering: Data is grouped such that a single
data point exclusively belongs to one cluster.
• Overlapping clustering: A soft cluster in which a single
data point may belong to multiple clusters with varying
degrees of membership.
• Hierarchical clustering: A type of clustering in which
groups are created such that similar instances are within
the same group and different objects are in other groups.
• Probalistic clustering: Clusters are created using
probability distribution.

Association Rule Mining

➢ Association rule mining algorithms have been popularized
through market basket analyses, leading to different
recommendation engines for music platforms and online
retailers.
➢ They are used within transactional datasets to identify frequent
item sets, or collections of items, to identify the likelihood of
consuming a product given the consumption of another
product.
➢ The most widely used algorithm for association rule learning is
the Apriori algorithm. However, other algorithms are used for
this type of unsupervised learning, such as the Eclat and FP-
growth algorithms.

Dimensionality reduction
➢ While more data generally yields more accurate results, it can
also impact the performance of machine learning algorithms
(e.g. overfitting) and it can also make it difficult to visualize
datasets.
➢ Dimensionality reduction is a technique used when the number
of features, or dimensions, in a given dataset is too high.
➢ It reduces the number of data inputs to a manageable size while
also preserving the integrity of the dataset as much as possible.
It is commonly used in the preprocessing data stage.

Advantages of Unsupervised Learning

• Uncovering hidden patterns and structures in data without

needing labeled examples.
• Ability to explore and discover insights from large and
complex datasets.
• Flexibility in handling diverse data types and domains.
• Useful for exploratory data analysis and feature engineering.
• Can be applied in scenarios where labeled data is scarce or
unavailable.

Disadvantages of Unsupervised Learning

• Lack of clear objective metrics for evaluating model

performance.
• Difficulty in interpreting and validating the learned patterns or
clusters.
• Sensitivity to noise and outliers in the data, leading to
potentially misleading results.
• Potential scalability issues with large datasets and high-
dimensional feature spaces.

Reinforcement learning
➢ In reinforcement learning the agent learns from a series of
reinforcements (rewards or punishments).
➢ Reinforcement learning problems involve learning what to
do—how to map situations to actions—so as to maximize a
numerical reward signal.
➢ Moreover, the learner is not told which actions to take, as in
many forms of machine learning, but instead must discover
which actions yield the most reward by trying them out.

Markov decision process

➢ The reinforcement learning agent learns about a problem by

interacting with its environment. The environment provides
information on its current state. The agent then uses that
information to determine which actions(s) to take.
➢ If that action obtains a reward signal from the surrounding
environment, the agent is encouraged to take that action again
when in a similar future state. This process repeats for every
new state thereafter.
➢ The task of reinforcement learning is to use observed rewards
to learn an optimal (or nearly optimal) policy for the
environment. An optimal policy is a policy that maximizes the
expected total reward.

For example, the lack of a tip at the end of the journey gives the
taxi agent an indication that it did something wrong. The two points
for a win at the end of a chess game tells the agent it did something
right. It is up to the agent to decide which of the actions prior to the
reinforcement were most responsible for it.

Exploration-exploitation trade-off

➢ One of the challenges that arise in reinforcement learning, and

not in other kinds of learning, is the trade-off between
exploration and exploitation.
➢ To obtain a lot of reward, a reinforcement learning agent must
prefer actions that it has tried in the past and found to be
effective in producing reward.
➢ But to discover such actions, it has to try actions that it has not
selected before. The agent has to exploit what it already knows
in order to obtain reward, but it also has to explore in order to
make better action selections in the future. T
➢ he dilemma is that neither exploration nor exploitation can be
pursued exclusively without failing at the task.
➢ The agent must try a variety of actions and progressively favor
those that appear to be best.

Components of reinforcement learning

Beyond the agent-environment-goal, four principal sub-elements
characterize reinforcement learning problems.
- Policy. This defines the RL agent’s behavior by mapping
perceived environmental states to specific actions the agent must
take when in those states.

- Reward signal. This designates the RL problem’s goal. Each of

the RL agent’s actions either receives a reward from the
environment or not. The agent’s only objective is to maximize its
cumulative rewards from the environment.

- Value function. Reward signal differs from value function in that

the former denotes immediate benefit while the latter specifies long-
term benefit. Value refers to a state’s desirability per all of the states
(with their incumbent rewards) that are likely to follow.

- Model. This is an optional sub-element of reinforcement learning

systems. Models allow agents to predict environment behavior for
possible actions.

Benefits

• Ability to Learn Optimal Strategies Through Trial and Error

• Scalability to Complex Decision-Making Problems

• Flexibility in Adapting to New Information

• Potential for High Autonomy and Reduced Human
Supervision

• Efficiency in Handling Long-Term Sequential Decision-

Making

Limitations

• Susceptibility to High Variance and Instability

• Dependency on Large Amounts of Environmental Interaction

Data

• Difficulty in Specifying Reward Functions

• Limited Transferability Between Different Tasks

• Ethical and Safety Concerns in Autonomous Decision-

Making
Gradient Descent Learning
➢ Gradient descent is an optimization algorithm that’s used when
training a machine learning model. It’s based on a convex
function and tweaks its parameters iteratively to minimize a
given function to its local minimum.
➢ It trains machine learning models by minimizing errors
between predicted and actual results.

What is a Gradient?

➢ A gradient simply measures the change in all weights with

regard to the change in error. You can also think of a gradient
as the slope of a function.
➢ The higher the gradient, the steeper the slope and the faster
a model can learn. But if the slope is zero, the model stops
learning. In mathematical terms, a gradient is a partial
derivative with respect to its inputs.

How Does Gradient Descent Work?

➢ Instead of climbing up a hill, think of gradient descent as
hiking down to the bottom of a valley. The equation below
describes what the gradient descent algorithm does:

𝑏 = 𝑎 − 𝛾 ∇𝑓(𝑎)
Where,

a = Current position if climber

b = next position of climber

 = the learning rate

f(a) = the gradient of the loss function with respect to

the parameters

➢ This formula basically tells us the next position we need to go,

which is the direction of the steepest descent.

Types of Gradient Descent

Batch Gradient Descent

➢ Batch gradient descent, also called vanilla gradient descent,

calculates the error for each example within the training
dataset, but it only gets updated after all training examples
have been evaluated.
Stochastic Gradient Descent

➢ Stochastic gradient descent (SGD) does this for each training

example within the dataset, meaning it updates the parameters
for each training example one by one.

𝑏𝑡+1 = 𝑏𝑡 − 𝛾 ∇𝑓(𝑏𝑡 ; 𝑥𝑖 )

Mini-Batch Gradient Descent

➢ Mini-batch gradient descent is the go-to method since it’s a

combination of the concepts of SGD and batch gradient
descent. It simply splits the training dataset into small batches
and performs an update for each of those batches.
Hebbian learning
➢ The neuroscientific concept of Hebbian learning was
introduced by Donald Hebb in his 1949 publication of The
Organization of Behaviors. Also known as Hebb’s Rule or Cell
Assembly Theory,
➢ The basis of the theory is when our brains learn something
new, neurons are activated and connected with other neurons,
forming a neural network. These connections start off weak,
but each time the stimulus is repeated, the connections grow
stronger and stronger, and the action becomes more intuitive.
➢ Hebb or Hebbian learning rule comes under Artificial Neural
Network (ANN) which is an architecture of a large number of
interconnected elements called neurons.
➢ These neurons process the input received to give the desired
output. The nodes or neurons are linked by inputs
(x1,x2,x3…xn), connection weights (w1,w2,w3…wn),
and activation functions(a function that defines the output of a
node).
➢ This network is suitable for bipolar data. The Hebbian learning
rule is generally applied to logic gates.
The weights are updated as:

W (new) = w (old) + x*y

Training Algorithm For Hebbian Learning Rule

The training steps of the algorithm are as follows:

• Initially, the weights are set to zero, i.e. w =0 for all inputs i =1
to n and n is the total number of input neurons.
• Let s be the output. The activation function for inputs is
generally set as an identity function.
• The activation function for output is also set to y= t.
• The weight adjustments and bias are adjusted to:

• The steps 2 to 4 are repeated for each input vector and output.
Competitive Learning
➢ Competitive learning is a form of unsupervised learning in
artificial neural networks, in which nodes compete for the right
to respond to a subset of the input data.
➢ Models and algorithms based on the principle of competitive
learning include winner-take-all nets, vector quantization, self-
organizing maps, etc.

Architecture of Competitive Learning

Implementation of Competitive Learning
• Competitive learning is usually implemented with neural
networks that contain a hidden layer which is commonly
known as "competitive layer".
• Every competitive neuron is described by a vector or weights
𝑤𝑖 = (𝑤𝑖1 , … , 𝑤𝑖𝑑 )𝑇 , 𝑖 = 1, … , M and calculates the similarity
measure between the input data 𝑋 𝑛 = (𝑥𝑛1 , … , 𝑥𝑛𝑑 )𝑇 ∈ 𝑅𝑑
and the weight vector 𝑤𝑖 .
• For every input vector, the competitive neurons "compete"
with each other to see which one of them is the most similar to
that particular input vector.
• The winner neuron m sets its output 𝑜𝑚 = 1 and all other
competitive neurons set their output 𝑜𝑖 = 0, i = 1, ..., M, i ≠ m.

FIXED-WEIGHT COMPETITIVE NETS

➢ Many neural nets use the idea of competition among neurons
to enhance the contrast in activations of the neurons.
➢ In the most extreme situation, often called Winner-Take-All,
only the neuron with the largest activation is allowed to remain
"on".
➢ Examples of fixed-weight competitive nets include Maxnet,
Mexican Hat, Hamming net, etc.

Supervised vs Unsupervised Learning Guide
No ratings yet
Supervised vs Unsupervised Learning Guide
5 pages
Ai Module5
No ratings yet
Ai Module5
38 pages
Machine Learning Concepts and Methods
No ratings yet
Machine Learning Concepts and Methods
103 pages
AI Modelling: Techniques & Approaches
No ratings yet
AI Modelling: Techniques & Approaches
10 pages
Machine Learning Techniques Overview
No ratings yet
Machine Learning Techniques Overview
78 pages
Types of Machine Learning Algorithms
No ratings yet
Types of Machine Learning Algorithms
30 pages
Understanding Learning in AI Systems
100% (1)
Understanding Learning in AI Systems
36 pages
Supervised Learning in Machine Learning
No ratings yet
Supervised Learning in Machine Learning
124 pages
Notes of AI and ML
No ratings yet
Notes of AI and ML
9 pages
FOAI Unit 5-3
No ratings yet
FOAI Unit 5-3
10 pages
Introduction to Machine Learning Concepts
No ratings yet
Introduction to Machine Learning Concepts
4 pages
Machine Learning Algorithms Overview
No ratings yet
Machine Learning Algorithms Overview
29 pages
Understanding Learning Agents in AI
No ratings yet
Understanding Learning Agents in AI
10 pages
Types of AI Learning Explained
No ratings yet
Types of AI Learning Explained
11 pages
Machine Larning (Lecture 2)
No ratings yet
Machine Larning (Lecture 2)
25 pages
Inductive vs. Deductive Learning in AI
No ratings yet
Inductive vs. Deductive Learning in AI
17 pages
Understanding Machine Learning Basics
No ratings yet
Understanding Machine Learning Basics
5 pages
Machine Learning vs Human Learning
No ratings yet
Machine Learning vs Human Learning
34 pages
Understanding Machine Learning Types
No ratings yet
Understanding Machine Learning Types
50 pages
Overview of Machine Learning Types
No ratings yet
Overview of Machine Learning Types
26 pages
Machine Learning Fundamentals and Types
No ratings yet
Machine Learning Fundamentals and Types
44 pages
Introduction to Machine Learning Concepts
No ratings yet
Introduction to Machine Learning Concepts
27 pages
Machine Learning Lab Overview and Questions
50% (2)
Machine Learning Lab Overview and Questions
9 pages
Lecture1 1 1
No ratings yet
Lecture1 1 1
4 pages
Machine Learning Overview and Types
No ratings yet
Machine Learning Overview and Types
21 pages
MLF - Unit 1
No ratings yet
MLF - Unit 1
54 pages
Understanding Machine Learning Concepts
No ratings yet
Understanding Machine Learning Concepts
12 pages
Understanding Machine Learning Basics
No ratings yet
Understanding Machine Learning Basics
29 pages
Types of Machine Learning Explained
100% (1)
Types of Machine Learning Explained
21 pages
Introduction to Machine Learning Concepts
No ratings yet
Introduction to Machine Learning Concepts
11 pages
Deep Learning Overview and Techniques
No ratings yet
Deep Learning Overview and Techniques
30 pages
Supervised Learning in Machine Learning
No ratings yet
Supervised Learning in Machine Learning
67 pages
Machine Learning Types Explained
No ratings yet
Machine Learning Types Explained
5 pages
Understanding Machine Learning Basics
No ratings yet
Understanding Machine Learning Basics
14 pages
Types of Machine Learning Algorithms
No ratings yet
Types of Machine Learning Algorithms
12 pages
Types of Machine Learning Explained
No ratings yet
Types of Machine Learning Explained
11 pages
Overview of Machine Learning Types
No ratings yet
Overview of Machine Learning Types
20 pages
Four Types of Machine Learning
No ratings yet
Four Types of Machine Learning
12 pages
UNIT-3 Course Material
No ratings yet
UNIT-3 Course Material
8 pages
Machine Learning Techniques Overview
No ratings yet
Machine Learning Techniques Overview
30 pages
Types and Benefits of Machine Learning
No ratings yet
Types and Benefits of Machine Learning
27 pages
Machine Learning Overview and Techniques
No ratings yet
Machine Learning Overview and Techniques
58 pages
Machine Learning Algorithms Overview
No ratings yet
Machine Learning Algorithms Overview
9 pages
Machine Learning Approaches Explained
100% (1)
Machine Learning Approaches Explained
11 pages
Overview of Machine Learning Types
No ratings yet
Overview of Machine Learning Types
17 pages
Introduction to Machine Learning Concepts
No ratings yet
Introduction to Machine Learning Concepts
31 pages
Machine Learning Basics and Workflow
No ratings yet
Machine Learning Basics and Workflow
38 pages
AI in Student Application Evaluation
No ratings yet
AI in Student Application Evaluation
96 pages
Reinforcement Learning in AI Models
No ratings yet
Reinforcement Learning in AI Models
13 pages
ML AssigANS
No ratings yet
ML AssigANS
23 pages
Machine Learning Modeling Process Overview
No ratings yet
Machine Learning Modeling Process Overview
16 pages
Machine Learning: Supervised vs Unsupervised
No ratings yet
Machine Learning: Supervised vs Unsupervised
19 pages
Introduction to Machine Learning Types
No ratings yet
Introduction to Machine Learning Types
20 pages
Machine Learning Techniques Overview
100% (1)
Machine Learning Techniques Overview
113 pages
Machine Learning: Problems & Solutions
No ratings yet
Machine Learning: Problems & Solutions
9 pages
Reels Bundle 2025-2026
67% (3)
Reels Bundle 2025-2026
6 pages
Canva Pro 2
No ratings yet
Canva Pro 2
1 page
Effective Software Review Techniques
No ratings yet
Effective Software Review Techniques
12 pages
Software Requirements Overview and Analysis
No ratings yet
Software Requirements Overview and Analysis
26 pages
Project Schedule Essentials Explained
No ratings yet
Project Schedule Essentials Explained
58 pages
Hebbian Learning and Gradient Descent Learning: Neural Computation: Lecture 5
No ratings yet
Hebbian Learning and Gradient Descent Learning: Neural Computation: Lecture 5
20 pages
Understanding Neural Network Architecture
No ratings yet
Understanding Neural Network Architecture
74 pages
Hebbian Learning Rule Explained
No ratings yet
Hebbian Learning Rule Explained
35 pages
Surya Ganguli: Academic Profile
No ratings yet
Surya Ganguli: Academic Profile
9 pages
Neuroplasticity in Counselling Practice
No ratings yet
Neuroplasticity in Counselling Practice
7 pages
Digit Span Test in Experimental Psychology
No ratings yet
Digit Span Test in Experimental Psychology
49 pages
Biopsychology Exam Questions & Answers
No ratings yet
Biopsychology Exam Questions & Answers
6 pages
Synaptic Organization in Neocortex
No ratings yet
Synaptic Organization in Neocortex
6 pages
Dr. Md. Aminul Haque on Neural Networks
100% (1)
Dr. Md. Aminul Haque on Neural Networks
82 pages
Brain-Inspired Learning in ANNs Review
No ratings yet
Brain-Inspired Learning in ANNs Review
14 pages
Neuromorphic Computing Seminar Report
No ratings yet
Neuromorphic Computing Seminar Report
33 pages
Machine Learning Unit Overview
100% (1)
Machine Learning Unit Overview
15 pages
Hebbian Plasticity for Adaptive Learning
No ratings yet
Hebbian Plasticity for Adaptive Learning
16 pages
Understanding Soft Computing Concepts
No ratings yet
Understanding Soft Computing Concepts
87 pages
Supervised Hebbian Learning Overview
No ratings yet
Supervised Hebbian Learning Overview
14 pages
Aristotle's Memory Insights & Neural Networks
No ratings yet
Aristotle's Memory Insights & Neural Networks
93 pages
Donald Hebb: Pioneer of Behavioral Psychology
No ratings yet
Donald Hebb: Pioneer of Behavioral Psychology
5 pages
Freud's Neurobiology and Psychology Insights
No ratings yet
Freud's Neurobiology and Psychology Insights
17 pages
Hebbian Theory
No ratings yet
Hebbian Theory
5 pages
Physiology of Memory Explained
No ratings yet
Physiology of Memory Explained
21 pages
Learning Rules for Neural Networks
No ratings yet
Learning Rules for Neural Networks
9 pages
Expert Systems in Artificial Intelligence
No ratings yet
Expert Systems in Artificial Intelligence
39 pages
Neural Network Learning Rules Explained
No ratings yet
Neural Network Learning Rules Explained
7 pages
Overview of Expert Systems
No ratings yet
Overview of Expert Systems
16 pages
Biological vs. Artificial Neural Networks
No ratings yet
Biological vs. Artificial Neural Networks
41 pages
Neural Networks For Beginners
86% (7)
Neural Networks For Beginners
72 pages
Enhancing AI with Episodic Memory
No ratings yet
Enhancing AI with Episodic Memory
12 pages
Principles of Neural Science 5th Edition Kandel E.R.
No ratings yet
Principles of Neural Science 5th Edition Kandel E.R.
77 pages
Hebb Neural Network Design and Applications
No ratings yet
Hebb Neural Network Design and Applications
12 pages
(Solution Manual) Astronomy Today 8th Edition by Eric Chaisson Full
100% (5)
(Solution Manual) Astronomy Today 8th Edition by Eric Chaisson Full
101 pages

Understanding Machine Learning Types

Uploaded by

Understanding Machine Learning Types

Uploaded by

LEARNING

➢ An agent is learning if it improves its performance on future tasks

There are three main reasons:

➢ First, the designers cannot anticipate all possible situations that

➢ Any component of an agent can be improved by learning from data.

Feedback to learn from

Example of Supervised Learning

Predicting house prices: The input might be house features such as

Given a training set of N example input–output pairs

(x1, y1),(x2, y2),...(xN , yN )

where each yj was generated by an unknown function y = f(x),

We say a hypothesis generalizes well if it correctly predicts the value

• Regression: When y is a number (such as tomorrow’s

When dealing with real-valued output variables like "price" or

• Classification: When the output y is one of a finite set of values

In instances where the output variable is a category, like

• Gathers previous data, which helps in learning from past

• Difficult to classify huge data sets.

Association Rule Mining

Advantages of Unsupervised Learning

• Uncovering hidden patterns and structures in data without

Disadvantages of Unsupervised Learning

• Lack of clear objective metrics for evaluating model

Markov decision process

➢ The reinforcement learning agent learns about a problem by

➢ One of the challenges that arise in reinforcement learning, and

Components of reinforcement learning

- Reward signal. This designates the RL problem’s goal. Each of

- Value function. Reward signal differs from value function in that

- Model. This is an optional sub-element of reinforcement learning

• Ability to Learn Optimal Strategies Through Trial and Error

• Scalability to Complex Decision-Making Problems

• Flexibility in Adapting to New Information

• Efficiency in Handling Long-Term Sequential Decision-

• Susceptibility to High Variance and Instability

• Dependency on Large Amounts of Environmental Interaction

• Difficulty in Specifying Reward Functions

• Limited Transferability Between Different Tasks

• Ethical and Safety Concerns in Autonomous Decision-

➢ A gradient simply measures the change in all weights with

How Does Gradient Descent Work?

a = Current position if climber

b = next position of climber

 = the learning rate

f(a) = the gradient of the loss function with respect to

➢ This formula basically tells us the next position we need to go,

Types of Gradient Descent

Batch Gradient Descent

➢ Batch gradient descent, also called vanilla gradient descent,

➢ Stochastic gradient descent (SGD) does this for each training

Mini-Batch Gradient Descent

➢ Mini-batch gradient descent is the go-to method since it’s a

W (new) = w (old) + x*y

Training Algorithm For Hebbian Learning Rule

Architecture of Competitive Learning

FIXED-WEIGHT COMPETITIVE NETS

You might also like