0% found this document useful (0 votes)

6 views51 pages

Understanding Neural Networks Basics

Neural networks have emerged as a leading approach in AI, excelling in various applications such as pattern recognition and prediction, often surpassing previous benchmarks. They are modeled after the human brain's interconnected neurons and can perform complex computations through multi-layer perceptrons, enabling them to classify data and represent continuous functions. The document also highlights the potential of neural networks to model memory and probability distributions, indicating their versatility in AI tasks.

Uploaded by

baslahmad2002

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

6 views51 pages

Understanding Neural Networks Basics

Uploaded by

baslahmad2002

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Neural Networks

1. Introduction

1
Neural Networks are taking over!
• Neural networks have become one of the main
approaches to AI

• They have been successfully applied to various pattern

recognition, prediction, and analysis problems
• In many problems they have established the state of
the art
– Often exceeding previous benchmarks by large margins
– Sometimes solving problems you couldn’t solve using
earlier ML methods

7
Breakthroughs with neural networks

8
Breakthrough with neural networks

9
Image segmentation and recognition

10
Image recognition

[Link] 11
Breakthroughs with neural networks

12
Success with neural networks

• Captions generated entirely by a neural

network 13
Breakthroughs with neural networks
[Link] uses AI
to generate endless fake faces

– [Link]
fake-people-portraits-thispersondoesnotexist-stylegan 14
Successes with neural networks
• And a variety of other problems:
– From art to astronomy to healthcare..
– and even predicting stock markets!

15
So what are neural networks??

Voice Image
[Link] Transcription [Link] Text caption
signal

Game
[Link] Next move
State

• What are these boxes?

16
So what are neural networks??

• It begins with this..

17
The magical capacity of humans
• Humans can
– Learn
– Solve problems
– Recognize patterns
– Create
– Cogitate
– …
Dante!
• Worthy of emulation
• But how do humans “work“?
19
Cognition and the brain..
• “If the brain was simple enough to be
understood - we would be too simple to
understand it!”
– Marvin Minsky

20
Observation: The Brain

• Mid 1800s: The brain is a mass of

interconnected neurons

28
Brain: Interconnected Neurons

• Many neurons connect in to each neuron

• Each neuron connects out to many neurons
29
Modelling the brain
• What are the units?
• A neuron: Soma

Dendrites
Axon

• Signals come in through the dendrites into the Soma

• A signal goes out via the axon to other neurons
– Only one axon per neuron
• Factoid that may only interest me: Neurons do not undergo cell
division
– Neurogenesis occurs from neuronal stem cells, and is minimal after
birth 45
McCulloch and Pitts

• The Doctor and the Hobo..

– Warren McCulloch: Neurophysiologist
– Walter Pitts: Homeless wannabe logician who
arrived at his door
46
Perceptron: Simplified model

• Number of inputs combine linearly

– Threshold logic: Fire if combined input exceeds threshold

60
The Universal Model
• Originally assumed could represent any Boolean circuit and
perform any logic
– “the embryo of an electronic computer that [the Navy] expects
will be able to walk, talk, see, write, reproduce itself and be
conscious of its existence,” New York Times (8 July) 1958
– “Frankenstein Monster Designed by Navy That Thinks,” Tulsa,
Oklahoma Times 1958

61
Perceptron
X 1
-1
2 X 0
1

Y
X 1

1
1

Y Values shown on edges are weights,

numbers in the circles are thresholds

• Easily shown to mimic any Boolean gate

• But…
63
Perceptron

No solution for XOR!

Not universal!
X ?

?
?

• Minsky and Papert, 1968

64
A single neuron is not enough

• Individual elements are weak computational elements

– Marvin Minsky and Seymour Papert, 1969, Perceptrons:
An Introduction to Computational Geometry

• Networked elements are required

65
Multi-layer Perceptron!
X 1

1
-1 1

2
1
1

-1
-1

Y
Hidden Layer
• XOR
– The first layer is a “hidden” layer
– Also originally suggested by Minsky and Papert 1968
66
A more generic model

21
1 1
01 1
1 -1 1 1

21 21 1 21
1 1 1 -1 1 -1
1 1
1

X Y Z A

• A “multi-layer” perceptron
• Can compose arbitrarily complicated Boolean functions!
– In cognitive terms: Can compute arbitrary Boolean functions over
sensory input
– More on this in the next class
67
Story so far
• Neural networks began as computational models of the brain
• Neural network models are connectionist machines
– The comprise networks of neural units
• McCullough and Pitt model: Neurons as Boolean threshold units
– Models the brain as performing propositional logic
– But no learning rule
• Hebb’s learning rule: Neurons that fire together wire together
– Unstable
• Rosenblatt’s perceptron : A variant of the McCulloch and Pitt neuron with
a provably convergent learning rule
– But individual perceptrons are limited in their capacity (Minsky and Papert)
• Multi-layer perceptrons can model arbitrarily complex Boolean functions

68
But our brain is not Boolean

• We have real inputs

• We make non-Boolean inferences/predictions
69
The perceptron with real inputs

x1
x2
x3

• x1…xN are real valued

• w1…wN are real valued
• Unit “fires” if weighted input matches (or exceeds)
a threshold
70
The perceptron with real inputs

x1
x2
x3

• Alternate view:
– A threshold “activation” operates on the weighted sum of inputs
plus a bias
• An affine function of the inputs
– outputs a 1 if z is non-negative, 0 otherwise
• Unit “fires” if weighted input matches or exceeds a threshold
71
The perceptron with real inputs
and a real output
b
x1
x2
x3
sigmoid

• x1…xN are real valued

• w1…wN are real valued
• The output y can also be real valued
– Sometimes viewed as the “probability” of firing
72
The “real” valued perceptron
b
x1
x2
f(sum)
x3

• Any real-valued “activation” function may operate on the affine

function of the input
– We will see several later
– Output will be real valued
• The perceptron maps real-valued inputs to real-valued outputs
• Is useful to continue assuming Boolean outputs though, for interpretation
73
A Perceptron on Reals

1
x1

x3
x2 w1x1+w2x2=T

xN
0
x1

• A perceptron operates on x2
x1
real-valued vectors
– This is a linear classifier 74
Boolean functions with a real
perceptron
0,1 1,1 0,1 1,1 0,1 1,1

x1 x1 x1

0,0 x2 1,0 0,0 x2 1,0 0,0 x2 1,0

• Boolean perceptrons are also linear classifiers

– Purple regions have output 1 in the figures
– What are these functions
– Why can we not compose an XOR?
75
Composing complicated “decision”
boundaries

x2 Can now be composed into

“networks” to compute arbitrary
classification “boundaries”

• Build a network of units with a single output

that fires if the input is in the coloured area

76
Booleans over the reals

x2 x1

• The network must fire if the input is in the

coloured area
77
Booleans over the reals

x2 x1

• The network must fire if the input is in the

coloured area
78
Booleans over the reals

x2 x1

• The network must fire if the input is in the

coloured area
79
Booleans over the reals

x2 x1

• The network must fire if the input is in the

coloured area
80
Booleans over the reals

x2 x1

• The network must fire if the input is in the

coloured area
81
Booleans over the reals
3

x2
4
4 AND
3 3
5
y1 y2 y3 y4 y5
4 x1
4

3 3
4 x2 x1

• The network must fire if the input is in the

coloured area
82
More complex decision boundaries

AND AND

x1 x1 x2
• Network to fire if the input is in the yellow area
– “OR” two polygons
– A third layer is required 83
Complex decision boundaries

• Can compose very complex decision boundaries

– How complex exactly? More on this in the next class

84
Complex decision boundaries

784 dimensions
(MNIST)
784 dimensions

• Classification problems: finding decision boundaries in

high-dimensional space
– Can be performed by an MLP
• MLPs can classify real-valued inputs 85
Story so far
• MLPs are connectionist computational models
– Individual perceptrons are computational equivalent of neurons
– The MLP is a layered composition of many perceptrons

• MLPs can model Boolean functions

– Individual perceptrons can act as Boolean gates
– Networks of perceptrons are Boolean functions

• MLPs are Boolean machines

– They represent Boolean functions over linear boundaries
– They can represent arbitrary decision boundaries
– They can be used to classify data

86
But what about continuous valued
outputs?

• Inputs may be real-valued

• Can outputs be continuous-valued too?
88
Other things MLPs can do
• Model memory
– Loopy networks can “remember” patterns
• Proposed by Lawrence Kubie in 1930, as a
model for memory in the CNS

• Represent probability distributions

– Over integer, real and complex-valued
domains
– MLPs can model both a posteriori and a
priori distributions of data
• A posteriori conditioned on other variables
– MLPs can generate data from complicated,
or even unknown distributions

• They can rub their stomachs and pat their

heads at the same time..
93
NNets in AI
• The network is a function
– Given an input, it computes the function layer
wise to predict an output
• More generally, given one or more inputs, predicts one
or more outputs

94
These tasks are functions

Voice Image
[Link] Transcription [Link] Text caption
signal

Game
[Link] Next move
State

• Each of these boxes is actually a function

– E.g f: Image  Caption
95
These tasks are functions

Voice Image
Transcription Text caption
signal

Game
State Next move

• Each box is actually a function

– E.g f: Image  Caption
– It can be approximated by a neural network 96
Story so far
• Multi-layer perceptrons are connectionist
computational models
• MLPs are classification engines

• MLP can also model continuous valued

functions
• Interesting AI tasks are functions that can be
modelled by the network
97
Next Up

• More on neural networks as universal

approximators
– And the issue of depth in networks

Understanding Neural Networks Basics
No ratings yet
Understanding Neural Networks Basics
90 pages
Deep Learning Course Introduction Slides
No ratings yet
Deep Learning Course Introduction Slides
63 pages
Understanding Perceptrons and Neural Networks
No ratings yet
Understanding Perceptrons and Neural Networks
89 pages
Deep Learning Fundamentals & CNNs Overview
No ratings yet
Deep Learning Fundamentals & CNNs Overview
156 pages
Introduction to Neural Networks Basics
No ratings yet
Introduction to Neural Networks Basics
36 pages
Overview of Neural Network Fundamentals
No ratings yet
Overview of Neural Network Fundamentals
136 pages
Deep Learning: Foundations & Applications
No ratings yet
Deep Learning: Foundations & Applications
18 pages
Introduction to Neural Networks Overview
No ratings yet
Introduction to Neural Networks Overview
36 pages
Overview of Neural Networks Concepts
No ratings yet
Overview of Neural Networks Concepts
46 pages
Introduction to Artificial Neural Networks
No ratings yet
Introduction to Artificial Neural Networks
49 pages
Understanding Artificial Neural Networks
No ratings yet
Understanding Artificial Neural Networks
96 pages
Introduction to Artificial Neural Networks
No ratings yet
Introduction to Artificial Neural Networks
76 pages
Breakthroughs in Neural Networks AI
No ratings yet
Breakthroughs in Neural Networks AI
18 pages
Lecture 02 ANN
No ratings yet
Lecture 02 ANN
18 pages
Lecture 01
No ratings yet
Lecture 01
52 pages
Understanding Neuron Activation Functions
No ratings yet
Understanding Neuron Activation Functions
46 pages
Intelligent Computing Paradigms Overview
No ratings yet
Intelligent Computing Paradigms Overview
76 pages
Understanding Artificial Neural Networks
No ratings yet
Understanding Artificial Neural Networks
65 pages
Overview of Neural Networks and Learning
No ratings yet
Overview of Neural Networks and Learning
129 pages
Introduction to Neural Networks in CMSC475
No ratings yet
Introduction to Neural Networks in CMSC475
19 pages
Overview of Artificial Neural Networks
No ratings yet
Overview of Artificial Neural Networks
51 pages
Foundations of Artificial Intelligence
No ratings yet
Foundations of Artificial Intelligence
60 pages
Neural Ver2 Updated
No ratings yet
Neural Ver2 Updated
142 pages
Overview of Industrial Revolutions and Deep Learning
No ratings yet
Overview of Industrial Revolutions and Deep Learning
48 pages
Understanding Artificial Neural Networks
No ratings yet
Understanding Artificial Neural Networks
3 pages
Introduction to Neural Networks Basics
No ratings yet
Introduction to Neural Networks Basics
35 pages
Understanding Artificial Neural Networks
No ratings yet
Understanding Artificial Neural Networks
74 pages
Understanding Neural Networks Basics
100% (1)
Understanding Neural Networks Basics
119 pages
Neural Networks: Basics and Functions
No ratings yet
Neural Networks: Basics and Functions
77 pages
Introduction to Neural Networks Basics
No ratings yet
Introduction to Neural Networks Basics
140 pages
McCulloch-Pitts Neurons Overview
No ratings yet
McCulloch-Pitts Neurons Overview
92 pages
Understanding Machine Learning Concepts
No ratings yet
Understanding Machine Learning Concepts
40 pages
Understanding Deep Learning Concepts
No ratings yet
Understanding Deep Learning Concepts
126 pages
Understanding Neural Networks Basics
No ratings yet
Understanding Neural Networks Basics
70 pages
Introduction to Neural Networks Basics
No ratings yet
Introduction to Neural Networks Basics
85 pages
Introduction to Artificial Neural Networks
No ratings yet
Introduction to Artificial Neural Networks
25 pages
Perceptron Design and Functionality
No ratings yet
Perceptron Design and Functionality
100 pages
Multilayer Perceptron Overview
No ratings yet
Multilayer Perceptron Overview
24 pages
History and Concepts of Deep Learning
No ratings yet
History and Concepts of Deep Learning
17 pages
Understanding Neural Networks in Deep Learning
No ratings yet
Understanding Neural Networks in Deep Learning
48 pages
Neural Networks: Basics and MLP Overview
No ratings yet
Neural Networks: Basics and MLP Overview
74 pages
Excitatory and Inhibitory Neuron Inputs
No ratings yet
Excitatory and Inhibitory Neuron Inputs
16 pages
Neural Networks: Structure & Learning
No ratings yet
Neural Networks: Structure & Learning
25 pages
Neural Networks: Basics and Applications
No ratings yet
Neural Networks: Basics and Applications
36 pages
Deep Learning: Concepts and Models
No ratings yet
Deep Learning: Concepts and Models
33 pages
Neural Networks and Deep Learning Overview
No ratings yet
Neural Networks and Deep Learning Overview
9 pages
Unit 1 Deep Learning
No ratings yet
Unit 1 Deep Learning
9 pages
Machine Learning: Regression & Bayesian Methods
No ratings yet
Machine Learning: Regression & Bayesian Methods
84 pages
Unit-2 Ait Notes
No ratings yet
Unit-2 Ait Notes
23 pages
Deep Learning Fundamentals and Models
No ratings yet
Deep Learning Fundamentals and Models
117 pages
AI and Machine Learning in Business
No ratings yet
AI and Machine Learning in Business
44 pages
Deep Learning Basics and Applications
No ratings yet
Deep Learning Basics and Applications
23 pages
Deep Learning: Artificial Neural Networks
No ratings yet
Deep Learning: Artificial Neural Networks
91 pages
Introduction to Neural Networks Concepts
No ratings yet
Introduction to Neural Networks Concepts
54 pages
Introduction to Artificial Neural Networks
No ratings yet
Introduction to Artificial Neural Networks
65 pages
Deep Learning and Neural Networks Overview
No ratings yet
Deep Learning and Neural Networks Overview
48 pages
Ann PPT 1
No ratings yet
Ann PPT 1
114 pages
Neural Network Architecture Overview
No ratings yet
Neural Network Architecture Overview
58 pages
Understanding Neural Networks Explained
No ratings yet
Understanding Neural Networks Explained
24 pages
Entanglement in NV Center Chains
No ratings yet
Entanglement in NV Center Chains
11 pages
Rover VMS 1000 Manual
No ratings yet
Rover VMS 1000 Manual
74 pages
Control Center and Data Exchange Requirements: PJM Manual 01
No ratings yet
Control Center and Data Exchange Requirements: PJM Manual 01
67 pages
PLC Communication Systems Overview
No ratings yet
PLC Communication Systems Overview
6 pages
Samsung Refrigerator Model SR-42/43 Guide
No ratings yet
Samsung Refrigerator Model SR-42/43 Guide
26 pages
Hydro Jetting Safety Analysis Report
No ratings yet
Hydro Jetting Safety Analysis Report
23 pages
EdiQue Solutions: Immersive Education Tech
No ratings yet
EdiQue Solutions: Immersive Education Tech
3 pages
9fm0 3a Que 20190614
No ratings yet
9fm0 3a Que 20190614
32 pages
STRUCTURALISMClass
No ratings yet
STRUCTURALISMClass
15 pages
Cultivating a Security Culture in Organizations
No ratings yet
Cultivating a Security Culture in Organizations
2 pages
Software Development Career Training
No ratings yet
Software Development Career Training
2 pages
Primer Design Essentials for PCR
No ratings yet
Primer Design Essentials for PCR
8 pages
Understanding the Balanced Scorecard
No ratings yet
Understanding the Balanced Scorecard
14 pages
Zero HVAC Energy Optimization Strategies
No ratings yet
Zero HVAC Energy Optimization Strategies
5 pages
2007 Corolla Key and Remote Guide
100% (1)
2007 Corolla Key and Remote Guide
24 pages
Inventory Management Impact on Customer Service
No ratings yet
Inventory Management Impact on Customer Service
24 pages
Digital Governance Overview and Practices
No ratings yet
Digital Governance Overview and Practices
18 pages
Week 6 Assignment: Python Pandas Quiz
No ratings yet
Week 6 Assignment: Python Pandas Quiz
3 pages
(8th) James Stewart - Student Solutions Manual, Chapters 1-11 For Stewart's Single Variable Calculus, 8th-Brooks Cole (2015)
100% (1)
(8th) James Stewart - Student Solutions Manual, Chapters 1-11 For Stewart's Single Variable Calculus, 8th-Brooks Cole (2015)
2 pages
Aseptic Risk Assessment Simplified
No ratings yet
Aseptic Risk Assessment Simplified
3 pages
Interior Designer Position at Hippo Stores
No ratings yet
Interior Designer Position at Hippo Stores
2 pages
EGRZVS4-65D-R6N43 Product Specifications
No ratings yet
EGRZVS4-65D-R6N43 Product Specifications
6 pages
L-861T LED Taxiway Edge Light Specs
No ratings yet
L-861T LED Taxiway Edge Light Specs
2 pages
Superresolution Imaging Techniques Overview
No ratings yet
Superresolution Imaging Techniques Overview
54 pages
Amazon Order Summary for $6.45
No ratings yet
Amazon Order Summary for $6.45
1 page
Rethinking Career Mobility Strategies
100% (7)
Rethinking Career Mobility Strategies
14 pages
AI-Based Facial Emotion Detection
100% (1)
AI-Based Facial Emotion Detection
6 pages
Management Information Systems Assessments
No ratings yet
Management Information Systems Assessments
10 pages
Software Testing Methods Explained
No ratings yet
Software Testing Methods Explained
12 pages
HGL and EGL in Fluid Mechanics
No ratings yet
HGL and EGL in Fluid Mechanics
8 pages

Understanding Neural Networks Basics

Uploaded by

Understanding Neural Networks Basics

Uploaded by

Neural Networks

• They have been successfully applied to various pattern

• Captions generated entirely by a neural

• What are these boxes?

• It begins with this..

• Mid 1800s: The brain is a mass of

• Many neurons connect in to each neuron

• Signals come in through the dendrites into the Soma

• The Doctor and the Hobo..

• Number of inputs combine linearly

Y Values shown on edges are weights,

• Easily shown to mimic any Boolean gate

No solution for XOR!

• Minsky and Papert, 1968

• Individual elements are weak computational elements

• Networked elements are required

• We have real inputs

• x1…xN are real valued

• x1…xN are real valued

• Any real-valued “activation” function may operate on the affine

0,0 x2 1,0 0,0 x2 1,0 0,0 x2 1,0

• Boolean perceptrons are also linear classifiers

x2 Can now be composed into

• Build a network of units with a single output

• The network must fire if the input is in the

• The network must fire if the input is in the

• The network must fire if the input is in the

• The network must fire if the input is in the

• The network must fire if the input is in the

• The network must fire if the input is in the

• Can compose very complex decision boundaries

• Classification problems: finding decision boundaries in

• MLPs can model Boolean functions

• MLPs are Boolean machines

• Inputs may be real-valued

• Represent probability distributions

• They can rub their stomachs and pat their

• Each of these boxes is actually a function

• Each box is actually a function

• MLP can also model continuous valued

• More on neural networks as universal

You might also like