0% found this document useful (0 votes)

16 views8 pages

XOR Backpropagation Neural Network

The document provides a Python implementation of a Back Propagation Network to solve the XOR function using binary inputs and outputs. It includes the definition of the sigmoid activation function, the training process over 10,000 epochs, and displays the network's predictions, weights, biases, and accuracy after training. The final accuracy achieved by the model is 100%.

Uploaded by

shankar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

16 views8 pages

XOR Backpropagation Neural Network

Uploaded by

shankar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

7.

Write a program to show Back Propagation Network for XOR

function with Binary Input and Output

import numpy as np
[Link](42)
def sigmoid(x):
return 1.0 / (1.0 + [Link](-x))
def sigmoid_deriv(y):
# y is already sigmoid(x)
return y * (1.0 - y)
X = [Link]([[0, 0],
[0, 1],
[1, 0],
[1, 1]], dtype=float)
Y = [Link]([[0], [1], [1], [0]], dtype=float)
input_size = 2
hidden_size = 2
output_size = 1
lr = 0.5 # learning rate
epochs = 10000 # number of training iterations
W1 = [Link](-1.0, 1.0, (input_size, hidden_size))
B1 = [Link]((1, hidden_size))
W2 = [Link](-1.0, 1.0, (hidden_size, output_size))
B2 = [Link]((1, output_size))
for epoch in range(epochs):
Z1 = [Link](X, W1) + B1 # (4, hidden_size)
A1 = sigmoid(Z1) # hidden activations
Z2 = [Link](A1, W2) + B2 # (4, 1)
A2 = sigmoid(Z2) # output activations
loss = [Link](0.5 * (Y - A2) ** 2)
dA2 = A2 - Y # derivative of MSE wrt A2
dZ2 = dA2 * sigmoid_deriv(A2) # (4,1)
dW2 = [Link](A1.T, dZ2) / [Link][0]
dB2 = [Link](dZ2, axis=0, keepdims=True)
dA1 = [Link](dZ2, W2.T) # (4, hidden_size)
dZ1 = dA1 * sigmoid_deriv(A1)
dW1 = [Link](X.T, dZ1) / [Link][0]
dB1 = [Link](dZ1, axis=0, keepdims=True)
W2 -= lr * dW2
B2 -= lr * dB2
W1 -= lr * dW1
B1 -= lr * dB1
if epoch % 1000 == 0 or epoch == epochs - 1:
print(f"Epoch {epoch:5d} Loss: {loss:.6f}")
print("\nTrained predictions on XOR inputs:")
Z1 = [Link](X, W1) + B1
A1 = sigmoid(Z1)
Z2 = [Link](A1, W2) + B2
A2 = sigmoid(Z2)
print([Link]((X, Y, A2, [Link](A2))))
print("\nWeights and biases:")
print("W1:\n", W1)
print("B1:\n", B1)
print("W2:\n", W2)
print("B2:\n", B2)
preds = [Link](A2)
accuracy = [Link](preds == Y)
print(f"\nAccuracy (after rounding): {accuracy * 100:.1f}%")

Output:

Epoch 0 Loss: 0.143150

Epoch 1000 Loss: 0.124974

Epoch 2000 Loss: 0.124862

Epoch 3000 Loss: 0.124477

Epoch 4000 Loss: 0.121065

Epoch 5000 Loss: 0.070273

Epoch 6000 Loss: 0.018014

Epoch 7000 Loss: 0.008040

Epoch 8000 Loss: 0.004882

Epoch 9000 Loss: 0.003431

Epoch 9999 Loss: 0.002619

Trained predictions on XOR inputs:

[[0. 0. 0. 0.07022023 0. ]

[0. 1. 1. 0.92223364 1. ]

[1. 0. 1. 0.92231458 1. ]

[1. 1. 0. 0.06270003 0. ]]

Weights and biases:

W1:

[[-5.50637624 5.67926358]

[ 5.69407248 -5.46834499]]

B1:

[[2.81493644 2.78774586]]

W2:

[[-6.15130337]

[-6.15441065]]

B2:

[[9.01782245]]

Accuracy (after rounding): 100.0%

Explanation:

import numpy as np
Imports the NumPy library and gives it the short name np, so you can use
NumPy functions and arrays with np..
[Link](42)
Sets the random-number generator seed to 42 so any subsequent random
numbers (like initial weights) are reproducible every run.

def sigmoid(x):
Starts the definition of a function named sigmoid that takes one argument
x.

return 1.0 / (1.0 + [Link](-x))

Computes the sigmoid activation σ(x) = 1 / (1 + e^{-x}) elementwise for
input x; maps real values into the range (0,1).

def sigmoid_deriv(y):
Starts the definition of a function named sigmoid_deriv that expects y,
which should already be sigmoid(x).

return y * (1.0 - y)
Returns the derivative of sigmoid with respect to its input, using the
identity σ'(x) = σ(x) * (1 - σ(x)). This expects y = σ(x).

X = [Link]([[0, 0],
[0, 1],
[1, 0],
[1, 1]], dtype=float)
Creates the input matrix X as a NumPy array with four rows (samples) and
two columns (features). Each row is one XOR input pair. dtype=float
ensures numeric (floating point) math.

Y = [Link]([[0], [1], [1], [0]], dtype=float)

Creates the target/output column vector Y with four rows corresponding to
XOR outputs. It has shape (4,1) and is float for gradient math.

input_size = 2
Stores the number of input neurons/features (2) in a variable used for
shaping weights.

hidden_size = 2
Stores the number of hidden neurons (2). Two hidden units are sufficient
to represent XOR.

output_size = 1
Stores the number of output neurons (1) — the network predicts a single
scalar per input.

lr = 0.5 # learning rate

Sets the learning rate lr to 0.5; this scales how big each gradient descent
update is. The comment labels it.
epochs = 10000 # number of training iterations
Sets the number of training iterations (full passes over the dataset) to
10,000. The comment explains its meaning.

W1 = [Link](-1.0, 1.0, (input_size, hidden_size))

Initializes the input-to-hidden weight matrix W1 with random values
uniformly drawn from -1.0 to 1.0. Its shape is (2,2): rows correspond to
input features, columns to hidden neurons.

B1 = [Link]((1, hidden_size))
Initializes the hidden-layer bias B1 as a row vector of zeros with shape
(1,2). This will broadcast across the 4 samples when added.

W2 = [Link](-1.0, 1.0, (hidden_size, output_size))

Initializes the hidden-to-output weight matrix W2 randomly in [-1,1] with
shape (2,1): rows are hidden units, column is the single output unit.

B2 = [Link]((1, output_size))
Initializes the output-layer bias B2 as zeros with shape (1,1).

for epoch in range(epochs):

Begins the training loop that will run epochs times; epoch counts from 0 to
epochs-1. Each iteration performs one forward and backward pass over
the full dataset (batch gradient descent).

Z1 = [Link](X, W1) + B1 # (4, hidden_size)

Computes the pre-activation of the hidden layer: Z1 = X · W1 + B1.
[Link](X, W1) multiplies shape (4,2) × (2,2) → (4,2); adding B1 (1,2) uses
broadcasting to add the bias to every sample.

A1 = sigmoid(Z1) # hidden activations

Applies the sigmoid activation elementwise to Z1, producing hidden-layer
activations A1 with shape (4,2).

Z2 = [Link](A1, W2) + B2 # (4, 1)

Computes the pre-activation of the output layer: Z2 = A1 · W2 + B2.
Shapes: (4,2) × (2,1) → (4,1); add bias B2 (1,1) via broadcasting.

A2 = sigmoid(Z2) # output activations

Applies sigmoid to Z2 to get the network's predicted outputs A2 (4,1),
values in (0,1).

loss = [Link](0.5 * (Y - A2) ** 2)

Computes the scalar loss (Mean Squared Error): for each sample compute
0.5*(target - output)^2, then take the mean across samples. The 0.5
simplifies derivatives.
dA2 = A2 - Y # derivative of MSE wrt A2
Computes the derivative of loss w.r.t. the network output A2. For MSE
0.5*(Y-A2)^2, derivative is A2 - Y. Shape (4,1).

dZ2 = dA2 * sigmoid_deriv(A2) # (4,1)

Applies the chain rule: derivative w.r.t. pre-activation Z2 equals dA2 *
σ'(Z2). Since sigmoid_deriv expects the sigmoid output, we pass A2.
Elementwise multiply gives shape (4,1).

dW2 = [Link](A1.T, dZ2) / [Link][0]

Computes the gradient of the loss w.r.t. W2: A1^T · dZ2 yields shape
(2,1). Dividing by [Link][0] (4) averages the gradient across samples
(batch gradient).

dB2 = [Link](dZ2, axis=0, keepdims=True)

Computes gradient of the loss w.r.t. bias B2 by averaging dZ2 across
samples, resulting in shape (1,1). keepdims=True preserves 2D shape for
broadcasting consistency.

dA1 = [Link](dZ2, W2.T) # (4, hidden_size)

Backpropagates the gradient to the hidden activations: dA1 = dZ2 ·
W2^T. Shapes: (4,1) × (1,2) → (4,2). This represents how changes in
hidden activations change loss.

dZ1 = dA1 * sigmoid_deriv(A1)

Applies elementwise multiplication with the derivative of the sigmoid to
get gradient w.r.t. pre-activation Z1. sigmoid_deriv(A1) returns shape
(4,2), so dZ1 is (4,2).

dW1 = [Link](X.T, dZ1) / [Link][0]

Computes gradient w.r.t. W1 as X^T · dZ1 with shapes (2,4) × (4,2) →
(2,2), then divides by 4 to average over samples.

dB1 = [Link](dZ1, axis=0, keepdims=True)

Computes gradient w.r.t. hidden bias B1 by averaging dZ1 across samples,
returning shape (1,2).

W2 -= lr * dW2
Updates the output weights W2 by subtracting the learning-rate-scaled
gradient (gradient descent step).

B2 -= lr * dB2
Updates the output bias B2 similarly.

W1 -= lr * dW1
Updates input-to-hidden weights W1 with gradient descent.

B1 -= lr * dB1
Updates hidden bias B1 with gradient descent.
if epoch % 1000 == 0 or epoch == epochs - 1:
Checks whether to print progress: either every 1000 epochs, or the very
last epoch.

print(f"Epoch {epoch:5d} Loss: {loss:.6f}")

If the condition is met, prints the current epoch number and the loss
formatted to 6 decimal places so you can observe training progress.

print("\nTrained predictions on XOR inputs:")

After training finishes, prints a header line announcing that final
predictions follow.

Z1 = [Link](X, W1) + B1
Recomputes hidden pre-activations using the final trained W1 and B1.

A1 = sigmoid(Z1)
Computes final hidden activations.

Z2 = [Link](A1, W2) + B2
Computes final output pre-activations.

A2 = sigmoid(Z2)
Computes the final network outputs for the training inputs (raw values in
(0,1)).

print([Link]((X, Y, A2, [Link](A2))))

Horizontally stacks and prints the input X, target Y, raw output A2, and
rounded output [Link](A2) (0 or 1). This shows inputs, expected
outputs, predicted probabilities, and final discrete predictions side-by-side.

print("\nWeights and biases:")

Prints a header announcing that the learned weights and biases will be
displayed.

print("W1:\n", W1)
Prints the final W1 matrix (input→hidden weights).

print("B1:\n", B1)
Prints the final B1 bias row for the hidden layer.

print("W2:\n", W2)
Prints the final W2 matrix (hidden→output weights).

print("B2:\n", B2)
Prints the final B2 bias scalar for the output layer.

preds = [Link](A2)
Rounds the final raw outputs A2 to 0 or 1 and stores them in preds.
accuracy = [Link](preds == Y)
Computes accuracy as the mean of the boolean array preds == Y.
Booleans convert to 1/0, so the mean is the fraction of correct predictions.

print(f"\nAccuracy (after rounding): {accuracy * 100:.1f}%")

Prints the accuracy as a percentage with one decimal place

Build a Deep Feedforward Neural Network
No ratings yet
Build a Deep Feedforward Neural Network
6 pages
XOR Problem Solved with MLP Backpropagation
No ratings yet
XOR Problem Solved with MLP Backpropagation
3 pages
Backpropagation in Python Example
No ratings yet
Backpropagation in Python Example
2 pages
Train Neural Network for XOR Function
No ratings yet
Train Neural Network for XOR Function
12 pages
XOR Neural Network Implementation
No ratings yet
XOR Neural Network Implementation
3 pages
Building a Neural Network from Scratch
No ratings yet
Building a Neural Network from Scratch
1 page
XOR Neural Network Implementation
No ratings yet
XOR Neural Network Implementation
6 pages
Neural Network XOR Implementation Guide
No ratings yet
Neural Network XOR Implementation Guide
4 pages
Backpropagation Algorithm for XOR
No ratings yet
Backpropagation Algorithm for XOR
5 pages
Neural Network XOR Training Code
No ratings yet
Neural Network XOR Training Code
36 pages
Neural Network Training Loss Analysis
No ratings yet
Neural Network Training Loss Analysis
2 pages
Perceptron and MLP Implementation Guide
No ratings yet
Perceptron and MLP Implementation Guide
73 pages
DL Exp
No ratings yet
DL Exp
21 pages
XOR Function Backpropagation Code
No ratings yet
XOR Function Backpropagation Code
3 pages
Modify ANN with LeakyReLU Activation
No ratings yet
Modify ANN with LeakyReLU Activation
4 pages
Neural Network Implementation in Python
No ratings yet
Neural Network Implementation in Python
4 pages
Neural Network Training with NumPy
No ratings yet
Neural Network Training with NumPy
1 page
Deep Learning REcord
No ratings yet
Deep Learning REcord
23 pages
Neural Network Training Example
No ratings yet
Neural Network Training Example
4 pages
Neural Network Logic Gates in NumPy
No ratings yet
Neural Network Logic Gates in NumPy
5 pages
XOR Function Neural Network Implementation
No ratings yet
XOR Function Neural Network Implementation
11 pages
XOR Problem Neural Network Solutions
No ratings yet
XOR Problem Neural Network Solutions
5 pages
Program 5
No ratings yet
Program 5
3 pages
ML 9
No ratings yet
ML 9
3 pages
M.Tech AI Lab Assignments Overview
No ratings yet
M.Tech AI Lab Assignments Overview
39 pages
exp8SC V
No ratings yet
exp8SC V
5 pages
Prac 1
No ratings yet
Prac 1
6 pages
ANN Theory
No ratings yet
ANN Theory
10 pages
Practical 1
No ratings yet
Practical 1
5 pages
Backpropagation in Neural Networks
No ratings yet
Backpropagation in Neural Networks
9 pages
Softmax Function and Neural Network Implementation
No ratings yet
Softmax Function and Neural Network Implementation
22 pages
DL Lab Manual
No ratings yet
DL Lab Manual
26 pages
Python XOR Neural Network Code
No ratings yet
Python XOR Neural Network Code
4 pages
Deep Learning Lab Manual 2023-2024
No ratings yet
Deep Learning Lab Manual 2023-2024
41 pages
Neural Network XOR Problem Solution
No ratings yet
Neural Network XOR Problem Solution
3 pages
Neural Network Training with Sigmoid Function
No ratings yet
Neural Network Training with Sigmoid Function
2 pages
Multi-Layer Perceptron for XOR Function
No ratings yet
Multi-Layer Perceptron for XOR Function
19 pages
Solving XOR with Deep Neural Network
100% (1)
Solving XOR with Deep Neural Network
4 pages
XOR Neural Network with Sigmoid Activation
No ratings yet
XOR Neural Network with Sigmoid Activation
3 pages
Exp 3 Soft Computing
No ratings yet
Exp 3 Soft Computing
2 pages
Neural Network Backpropagation Guide
No ratings yet
Neural Network Backpropagation Guide
10 pages
XOR Problem with Multilayer Perceptron
No ratings yet
XOR Problem with Multilayer Perceptron
44 pages
Backpropagation Neural Network Code
No ratings yet
Backpropagation Neural Network Code
2 pages
XOR Neural Network in Python
No ratings yet
XOR Neural Network in Python
3 pages
Simple Neural Network Implementation
No ratings yet
Simple Neural Network Implementation
4 pages
MLP Implementation for XOR Gate Simulation
No ratings yet
MLP Implementation for XOR Gate Simulation
70 pages
Neural Network Training in Python
No ratings yet
Neural Network Training in Python
3 pages
Expriment 2
No ratings yet
Expriment 2
6 pages
Neural Network Training and Testing
No ratings yet
Neural Network Training and Testing
8 pages
Multi-Layer Neural Network Lab Guide
No ratings yet
Multi-Layer Neural Network Lab Guide
15 pages
Backpropagation Neural Network for XOR
No ratings yet
Backpropagation Neural Network for XOR
6 pages
XOR Problem Neural Network Code
No ratings yet
XOR Problem Neural Network Code
4 pages
Neural Network Basics: AND, OR, NOT, XNOR
No ratings yet
Neural Network Basics: AND, OR, NOT, XNOR
11 pages
Deep Learning Experiments Overview
No ratings yet
Deep Learning Experiments Overview
11 pages
Backpropagation in XOR Neural Network
No ratings yet
Backpropagation in XOR Neural Network
8 pages
Simple Feedforward Neural Network Code
No ratings yet
Simple Feedforward Neural Network Code
3 pages
AI Lab
No ratings yet
AI Lab
6 pages
DAC0800 and 8086 Microprocessor Guide
No ratings yet
DAC0800 and 8086 Microprocessor Guide
13 pages
Understanding Read-Only Memory (ROM)
No ratings yet
Understanding Read-Only Memory (ROM)
2 pages
Bridge Rectifier Circuit Analysis
No ratings yet
Bridge Rectifier Circuit Analysis
3 pages
Neural Network Logic Gate Simulation
No ratings yet
Neural Network Logic Gate Simulation
11 pages
Half-Wave Rectifier Circuit Design
No ratings yet
Half-Wave Rectifier Circuit Design
9 pages
DAC0808 Interfacing with 8051 Guide
No ratings yet
DAC0808 Interfacing with 8051 Guide
2 pages
Computer Networks Sessional Exam Guide
No ratings yet
Computer Networks Sessional Exam Guide
1 page
Python Programming Lab Manual
No ratings yet
Python Programming Lab Manual
45 pages
Network Services Overview and Components
No ratings yet
Network Services Overview and Components
90 pages
Dr YSR ANU C Programming Sessional
No ratings yet
Dr YSR ANU C Programming Sessional
1 page
Overview of Pocket Algorithm
No ratings yet
Overview of Pocket Algorithm
1 page
Statistical Texture Features Analysis
No ratings yet
Statistical Texture Features Analysis
1 page
Understanding Gauss's Law Applications
No ratings yet
Understanding Gauss's Law Applications
5 pages
Jacobi's Method for Eigenvalue Problems
No ratings yet
Jacobi's Method for Eigenvalue Problems
10 pages
Anderson Et Al Edition 3 PDF
No ratings yet
Anderson Et Al Edition 3 PDF
122 pages
Hyperbolic Functions Overview
No ratings yet
Hyperbolic Functions Overview
3 pages
Coherent Point Drift for Non-Rigid Registration
No ratings yet
Coherent Point Drift for Non-Rigid Registration
8 pages
Inner Product Spaces and Orthogonality
No ratings yet
Inner Product Spaces and Orthogonality
9 pages
Essential Mathematics Books for Self-Study
0% (1)
Essential Mathematics Books for Self-Study
5 pages
Real and Abstract Analysis PDF
100% (2)
Real and Abstract Analysis PDF
484 pages
Integrating Sin, Cos, and Tan Functions
No ratings yet
Integrating Sin, Cos, and Tan Functions
18 pages
SymPy Basics: Math Operations Guide
No ratings yet
SymPy Basics: Math Operations Guide
3 pages
Class 12 Maths Preboard Marking Scheme
No ratings yet
Class 12 Maths Preboard Marking Scheme
7 pages
Mathematics Relations Questions for Class X
No ratings yet
Mathematics Relations Questions for Class X
3 pages
10th Grade Maths MCQs and Answers PDF
No ratings yet
10th Grade Maths MCQs and Answers PDF
31 pages
Limited-memory BFGS Optimization Algorithm
No ratings yet
Limited-memory BFGS Optimization Algorithm
6 pages
Mathematics-III: Calculus & Linear Algebra
No ratings yet
Mathematics-III: Calculus & Linear Algebra
6 pages
Differential Equations Q&A Guide
100% (1)
Differential Equations Q&A Guide
17 pages
Anisotropic Elasticity Solutions Overview
No ratings yet
Anisotropic Elasticity Solutions Overview
5 pages
Examination Conduct Guidelines 2023
No ratings yet
Examination Conduct Guidelines 2023
6 pages
BSE Property of L1(G, ω, A) Algebra
No ratings yet
BSE Property of L1(G, ω, A) Algebra
8 pages
Stress & Strain for Engineers
No ratings yet
Stress & Strain for Engineers
7 pages
Math 1100 Take-Home Exam 1 Overview
No ratings yet
Math 1100 Take-Home Exam 1 Overview
24 pages
Linear Algebra Problem Sheet 2
No ratings yet
Linear Algebra Problem Sheet 2
2 pages
복소수와 이차방정식 개념 정리
No ratings yet
복소수와 이차방정식 개념 정리
10 pages
Relations and Functions Worksheet
No ratings yet
Relations and Functions Worksheet
3 pages
MIO-SINDy: Sparse Nonlinear Dynamics
No ratings yet
MIO-SINDy: Sparse Nonlinear Dynamics
20 pages
Elasto-Plastic Analysis of Reinforced Soils
No ratings yet
Elasto-Plastic Analysis of Reinforced Soils
16 pages
Vector Algebra: Dot & Cross Product
No ratings yet
Vector Algebra: Dot & Cross Product
3 pages
MIT2 080JF13 Lecture8
No ratings yet
MIT2 080JF13 Lecture8
15 pages
Shape from Shading Survey Analysis
No ratings yet
Shape from Shading Survey Analysis
41 pages
Understanding Bezier, B-Splines, and NURBS
No ratings yet
Understanding Bezier, B-Splines, and NURBS
30 pages

XOR Backpropagation Neural Network

Uploaded by

XOR Backpropagation Neural Network

Uploaded by

7.

Write a program to show Back Propagation Network for XOR

Epoch 0 Loss: 0.143150

Epoch 1000 Loss: 0.124974

Epoch 2000 Loss: 0.124862

Epoch 3000 Loss: 0.124477

Epoch 4000 Loss: 0.121065

Epoch 5000 Loss: 0.070273

Epoch 6000 Loss: 0.018014

Epoch 8000 Loss: 0.004882

Epoch 9000 Loss: 0.003431

Epoch 9999 Loss: 0.002619

Trained predictions on XOR inputs:

Weights and biases:

Accuracy (after rounding): 100.0%

return 1.0 / (1.0 + [Link](-x))

Y = [Link]([[0], [1], [1], [0]], dtype=float)

lr = 0.5 # learning rate

W1 = [Link](-1.0, 1.0, (input_size, hidden_size))

W2 = [Link](-1.0, 1.0, (hidden_size, output_size))

for epoch in range(epochs):

Z1 = [Link](X, W1) + B1 # (4, hidden_size)

A1 = sigmoid(Z1) # hidden activations

Z2 = [Link](A1, W2) + B2 # (4, 1)

A2 = sigmoid(Z2) # output activations

loss = [Link](0.5 * (Y - A2) ** 2)

dZ2 = dA2 * sigmoid_deriv(A2) # (4,1)

dW2 = [Link](A1.T, dZ2) / [Link][0]

dB2 = [Link](dZ2, axis=0, keepdims=True)

dA1 = [Link](dZ2, W2.T) # (4, hidden_size)

dZ1 = dA1 * sigmoid_deriv(A1)

dW1 = [Link](X.T, dZ1) / [Link][0]

dB1 = [Link](dZ1, axis=0, keepdims=True)

print(f"Epoch {epoch:5d} Loss: {loss:.6f}")

print("\nTrained predictions on XOR inputs:")

print([Link]((X, Y, A2, [Link](A2))))

print("\nWeights and biases:")

print(f"\nAccuracy (after rounding): {accuracy * 100:.1f}%")

You might also like