0% found this document useful (0 votes)

12 views16 pages

Data Preparation for Neural Networks

The document outlines the process of building a simple artificial neural network (ANN) using the Iris dataset. It details data preparation, model structure definition, parameter initialization, forward propagation, and cost computation. Key steps include removing unnecessary data, defining input and hidden layer sizes, and implementing the sigmoid activation function and cross-entropy cost function.

Uploaded by

ghulamfiaz6

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

12 views16 pages

Data Preparation for Neural Networks

Uploaded by

ghulamfiaz6

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

In [124]: import matplotlib.

pyplot as plt
from [Link] import load_iris
import pandas as pd
import seaborn as sns
import numpy as np
import math
from sklearn import datasets

Prerequisite - Prepare the Data

Remove null values
Remove data wihout labels
Remove columns not needed for training

In [125]: iris_data = load_iris()

df = [Link](data=iris_data.data,
columns=iris_data.feature_names)
df['target'] = iris_data.target

In [126]: [Link]()

Out[126]: sepal length (cm) sepal width (cm) petal length (cm) petal width (cm) target

0 5.1 3.5 1.4 0.2 0

1 4.9 3.0 1.4 0.2 0

2 4.7 3.2 1.3 0.2 0

3 4.6 3.1 1.5 0.2 0

4 5.0 3.6 1.4 0.2 0

In [127]: [Link](df[[Link] == 2].index, inplace=True)

In [128]: [Link](data=df, x="sepal length (cm)", y="sepal width (cm)", hue="targ

Out[128]: <Axes: xlabel='sepal length (cm)', ylabel='sepal width (cm)'>

In [129]: # build a simple ANN to predict the target based on "sepal length (cm)" and "se
# X contains two features ("sepal length (cm)", "sepal width (cm)"")
# Y contains the labels (blue:0, orange:1)

In [130]: x1 = (df[["sepal length (cm)", "sepal width (cm)"]]).to_numpy()

y1 = (df['target']).to_numpy()
m = [Link][0]
y1 = [Link]([m, 1])

print ('The shape of X is: ' + str([Link]))
print ('The shape of Y is: ' + str([Link]))
print ('I have m = %d training examples!' % (m))

The shape of X is: (100, 2)

The shape of Y is: (100, 1)
I have m = 100 training examples!

!!! Note the shape of the data should look like follows !!!
X[number of features in the input, numbber of samples in the training set]
Y[number of output nodes, numbber of samples in the training set]

Apply [Link] if this this not the case.

In [131]: X = [Link](x1)
Y = [Link](y1)
In [132]: print ('The shape of X is: ' + str([Link]))
print ('The shape of Y is: ' + str([Link]))
print ('I have m = %d training examples!' % (m))

The shape of X is: (2, 100)

The shape of Y is: (1, 100)
I have m = 100 training examples!

Neural Network model

1. Define the neural network structure ( # of input units, # of hidden units, etc).
2. Initialize the model's parameters
3. Loop:
Implement forward propagation
Compute loss
Implement backward propagation to get the gradients
Update parameters (gradient descent)

Step 1: Define the Model Structure

X -- input dataset of shape (input size, number of examples)
H -- number of hidden layers
Y -- labels of shape (output size, number of examples)
n_x -- the size of the input layer
n_h -- the number of neurons in the hidden layer
n_y -- the size of the output layer
m -- The number of examples in the training set

In [133]: h = 1 # we are initialising the number of hidden layer in the model to 1

n_x = [Link][0]
m = [Link][1]
n_h = 4 # we are initialising the number of neurons in the hidden layer to 4
n_y = [Link][0]

print("INPUT PARAMETERS")
print("Size of the input layer - ", n_x)
print("Number of examples in the training set - ", m)

print("HIDDEN LAYER PARAMETERS")
print("Number of hidden layers - ", h)
print("Number of neurons in the hidden layer - ", n_h)

print("OUTPUT LAYER")
print("Size of the output layer - ", n_y)

INPUT PARAMETERS
Size of the input layer - 2
Number of examples in the training set - 100
HIDDEN LAYER PARAMETERS
Number of hidden layers - 1
Number of neurons in the hidden layer - 4
OUTPUT LAYER
Size of the output layer - 1
Step 2: Initialize Model Parameters

Intialise the following parameters in the model

W1 -- weight matrix of shape (n_h, n_x)

b1 -- bias vector of shape (n_h, 1)
W2 -- weight matrix of shape (n_y, n_h)
b2 -- bias vector of shape (n_y, 1

The weights ("W") is initialized to a very small random values, so that W starts out close to the centre of
the tanh or sigmoid function, if we start with large value for W, it would slow down the learning. The biases
("b") is initialised to 0
In [134]: def initialize_parameters(n_x, n_h, n_y):
"""
Argument:
n_x -- size of the input layer
n_h -- size of the hidden layer
n_y -- size of the output layer

Returns:
params -- python dictionary containing your parameters:
W1 -- weight matrix of shape (n_h, n_x)
b1 -- bias vector of shape (n_h, 1)
W2 -- weight matrix of shape (n_y, n_h)
b2 -- bias vector of shape (n_y, 1)
"""
W1 = [Link](n_h, n_x) * 0.01
b1 = [Link]((n_h,1))
W2 = [Link](n_y, n_h) * 0.01
b2 = [Link]((n_y,1))
parameters = {"W1": W1,
"b1": b1,
"W2": W2,
"b2": b2}

return parameters

In [135]: [Link](2)
parameters = initialize_parameters(n_x, n_h, n_y)

print("W1 = " + str(parameters["W1"]))
print("b1 = " + str(parameters["b1"]))
print("W2 = " + str(parameters["W2"]))
print("b2 = " + str(parameters["b2"]))

# Note - matrix structure W[output_node,input_node] and b[output_node,input_nod
# W1 [ 1,1 1,2
# 2,1 2,2
# 3,1 3,2
# 4,1 4,2]
# b1 [ 1,1 1,2 1,3 1,4]

W1 = [[-0.00416758 -0.00056267]
[-0.02136196 0.01640271]
[-0.01793436 -0.00841747]
[ 0.00502881 -0.01245288]]
b1 = [[0.]
[0.]
[0.]
[0.]]
W2 = [[-0.01057952 -0.00909008 0.00551454 0.02292208]]
b2 = [[0.]]

Step 3: Forward Propogation

Move from the input layer to the output layer.

Activation function

𝑎[𝑙]𝑗 = 𝑔[𝑙] (∑𝑘 𝑤[𝑙]𝑗𝑘 ∗ 𝑎[𝑙−1]

𝑘 + 𝑏𝑗 )
[𝑙]

Where
layer
l - lth
node in layer l
j - jth

# activation function for node 1 in layer 1 can be computed as follows,

a1 = tanh(w1[1,1] * x1 + w1[1,2] * x2 + b1[1,1])
# activation function for node 2 in layer 1 can be computed as follows,
a2 = tanh(w1[2,1] * x1 + w1[2,2] * x2 + b1[1,2])
# activation function for node 3 in layer 1 can be computed as follows,
a3 = tanh(w1[3,1] * x1 + w1[3,2] * x2 + b1[1,3])
# activation function for node 4 in layer 1 can be computed as follows,
a4 = tanh(w1[4,1] * x1 + w1[4,2] * x2 + b1[1,4])

Alternatively the same can be computed using the following matric operation

𝑍 [1][1]= 𝑊 [1]𝑋 +[1]𝑏[1]

𝐴 = tanh(𝑍 )
𝑍 [2] = 𝑊[2][2]𝐴[1] +[2]𝑏[2]
𝑌 ̂ = 𝐴 = 𝜎(𝑍 )
In [136]: def sigmoid(z):
"""
Compute the sigmoid of z

Arguments:
z -- A scalar or numpy array of any size.

Return:
s -- sigmoid(z)
"""
s = 1 / (1+ [Link](-z))
return s
In [137]: def forward_propagation(X, parameters):
"""
Argument:
X -- input data of size (n_x, m)
parameters -- python dictionary containing your parameters (output of initi

Returns:
A2 -- The sigmoid output of the second activation
cache -- a dictionary containing "Z1", "A1", "Z2" and "A2"
"""
# Retrieve each parameter from the dictionary "parameters"
W1 = parameters["W1"]
b1 = parameters["b1"]
W2 = parameters["W2"]
b2 = parameters["b2"]

# Implement Forward Propagation to calculate A2 (probabilities)

Z1 = [Link](W1,X) + b1
A1 = [Link](Z1)
Z2 = [Link](W2,A1) + b2
A2 = sigmoid(Z2)

cache = {"Z1": Z1,

"A1": A1,
"Z2": Z2,
"A2": A2}

return A2, cache

In [138]: A2, cache = forward_propagation(X, parameters)

In [139]: print(A2)

[[0.49990974 0.49995621 0.49991974 0.49992521 0.4998915 0.49988153

0.49988965 0.4999152 0.49993615 0.49994436 0.49990522 0.49990243
0.49994982 0.49991792 0.49989526 0.49984151 0.49988153 0.49990974
0.49991255 0.4998742 0.49994074 0.49988605 0.49986595 0.49993343
0.49990243 0.49996259 0.4999152 0.49991613 0.49992797 0.49991974
0.49993797 0.49994074 0.49984505 0.4998524 0.49994436 0.49993889
0.49993529 0.49988511 0.4999243 0.49992158 0.49990335 0.50001364
0.49990059 0.49990335 0.4998742 0.49994982 0.4998742 0.49991336
0.49989883 0.49992705 0.50006651 0.50002825 0.50007195 0.50007736
0.50008192 0.50003092 0.50001004 0.50002729 0.50007647 0.50001088
0.50008104 0.50002001 0.50012102 0.5000446 0.50001271 0.5000592
0.50000087 0.50004913 0.50013374 0.50006006 0.49999634 0.50005642
0.50010464 0.50005642 0.50006372 0.50006465 0.50010103 0.50007102
0.50003822 0.50005459 0.50006552 0.50006552 0.50004913 0.50006188
0.49998811 0.49997906 0.5000592 0.50012829 0.50000087 0.50005368
0.50004185 0.50003277 0.50006097 0.50004551 0.50003638 0.50000725
0.50001909 0.50005097 0.50002819 0.50003092]]

Step 3.1: Compute the Cost function

Cost function measures the performance of a machine learning model for a data set. Cost function
quantifies the error between predicted and expected values and presents that error in the form of a single
real number.

Logistic regression
It models the relationship between input features and the probability of the event occurring, where the
event is typically represented by the binary outcome (0 or 1).Logistic function is also known as sigmoid
function.

𝑠𝑖𝑔𝑚𝑎 = 1+𝑒1 −𝑧
e is the base of the natural logarithm, approximately equal to 2.718

Cost function Logistic regression

When it comes to Linear Regression, the conventional Cost Function employed is the Mean Squared
Error.

𝑀𝑆𝐸 = 2𝑚1 𝑖=1∑𝑚 (ŷ2𝑖 − 𝑦2𝑖 )

i = index of sample
ŷ = predicted value
y = expected value
m - number of samples in the data set

The same cannot be used for Logistic regression since sigmoid function is a nonlinear transformation,
and evaluating this term within the Mean Squared Error formula results in a non-convex cost function that
has multiple local minima.

For logistic regression, the Cost function is defined as:

putting it together we get

In [140]: def compute_cost(A2, Y):

"""
Computes the cross-entropy cost given in equation

Arguments:
A2 -- The sigmoid output of the second activation, of shape (1, number of e
Y -- "true" labels vector of shape (1, number of examples)

Returns:
cost -- cross-entropy cost
"""
m = [Link][1]
logprobs = [Link](Y,[Link](A2)) + [Link](1-Y,[Link](1-A2))
cost = -1/m * [Link](logprobs)

cost = float([Link](cost))

return cost

In [141]: cost = compute_cost(A2, Y)

print("cost = ",cost)

cost = 0.6930099444143113

Step 4: Back Propagation

Backpropagation is the essence of neural net training. It is the practice of fine-tuning the weights of a
neural net based on the error rate (i.e. loss) obtained in the previous epoch (i.e. iteration.) Proper tuning
of the weights ensures lower error rates, making the model reliable by increasing its generalization.

Limitation of back propagation

Training data can impact the performance of the model, so high-quality data is essential.
Noisy data can also affect backpropagation, potentially tainting its results.
It can take a while to train backpropagation models and get them up to speed.
Backpropagation requires a matrix-based approach, which can lead to other issues.
Deep learning neural networks are trained using the stochastic gradient descent algorithm.

Stochastic gradient descent is an optimization algorithm that estimates the error gradient for the current
state of the model using examples from the training dataset, then updates the weights of the model using
the back-propagation of errors algorithm, referred to as simply backpropagation.

The lower the cost function, the predicted output is closer to the actual output. So, to minimize this cost
function we use Gradient Descent to determine the global minima.
Back propagation is computed using the following formulas,
In [142]: def backward_propagation(parameters, cache, X, Y):
"""
Implement the backward propagation using the instructions above.

Arguments:
parameters -- python dictionary containing our parameters
cache -- a dictionary containing "Z1", "A1", "Z2" and "A2".
X -- input data of shape (2, number of examples)
Y -- "true" labels vector of shape (1, number of examples)

Returns:
grads -- python dictionary containing your gradients with respect to differ
"""
m = [Link][1]
W1 = parameters["W1"]
W2 = parameters["W2"]
A1 = cache['A1']
A2 = cache['A2']

dZ2 = A2 - Y
dW2 = (1./m) * [Link](dZ2,[Link](A1))
db2 = (1./m) * [Link](dZ2, axis=1, keepdims= True)
dZ1 = [Link]([Link](W2), dZ2) * (1 - [Link](A1, 2))
dW1 = (1./m) * [Link](dZ1,[Link](X))
db1 = (1./m) * [Link](dZ1, axis=1, keepdims= True)

grads = {"dW1": dW1,

"db1": db1,
"dW2": dW2,
"db2": db2}

return grads

In [143]: grads = backward_propagation(parameters, cache, X, Y)

print ("dW1 = "+ str(grads["dW1"]))
print ("db1 = "+ str(grads["db1"]))
print ("dW2 = "+ str(grads["dW2"]))
print ("db2 = "+ str(grads["db2"]))

dW1 = [[ 0.00245638 -0.00173974]

[ 0.00205219 -0.00151646]
[-0.00124116 0.00090428]
[-0.0053392 0.00376265]]
db1 = [[-2.58938841e-07]
[-9.10664254e-06]
[ 3.68058997e-06]
[-2.08060949e-06]]
dW2 = [[ 0.0008762 0.00762958 0.00274315 -0.00321645]]
db2 = [[-1.91004228e-05]]

Step 5: Update Parameters

The amount that the weights are updated during training is referred to as the step size or the “learning
rate.”. It is a configurable hyperparameter used in the training of neural networks that has a small positive
value, often in the range between 0.0 and 1.0.
In [122]: def update_parameters(parameters, grads, learning_rate = 1.2):
"""
Updates parameters using the gradient descent update rule given above

Arguments:
parameters -- python dictionary containing your parameters
grads -- python dictionary containing your gradients

Returns:
parameters -- python dictionary containing your updated parameters
"""

W1 = parameters['W1']
b1 = parameters['b1']
W2 = parameters['W2']
b2 = parameters['b2']
dW1 = grads['dW1']
db1 = grads['db1']
dW2 = grads['dW2']
db2 = grads['db2']

W1 = W1 - learning_rate * dW1
b1 = b1 - learning_rate * db1
W2 = W2 - learning_rate * dW2
b2 = b2 - learning_rate * db2

parameters = {"W1": W1,

"b1": b1,
"W2": W2,
"b2": b2}

return parameters
In [123]: print("FORWARD PROPOGATION")
print("W1 = " + str(parameters["W1"]))
print("b1 = " + str(parameters["b1"]))
print("W2 = " + str(parameters["W2"]))
print("b2 = " + str(parameters["b2"]))
cost = compute_cost(A2, Y)
print("cost = ",cost)

new_parameters = update_parameters(parameters, grads)

print("BACK PROPOGATION")
print("W1 = " + str(new_parameters["W1"]))
print("b1 = " + str(new_parameters["b1"]))
print("W2 = " + str(new_parameters["W2"]))
print("b2 = " + str(new_parameters["b2"]))

A2, cache = forward_propagation(X, new_parameters)
cost = compute_cost(A2, Y)
print("cost = ",cost)

FORWARD PROPOGATION
W1 = [[-0.00416758 -0.00056267]
[-0.02136196 0.01640271]
[-0.01793436 -0.00841747]
[ 0.00502881 -0.01245288]]
b1 = [[0.]
[0.]
[0.]
[0.]]
W2 = [[-0.01057952 -0.00909008 0.00551454 0.02292208]]
b2 = [[0.]]
cost = 0.6930099444143113
BACK PROPOGATION
W1 = [[-0.00711524 0.00152502]
[-0.02382459 0.01822246]
[-0.01644496 -0.0095026 ]
[ 0.01143585 -0.01696806]]
b1 = [[ 3.10726609e-07]
[ 1.09279710e-05]
[-4.41670797e-06]
[ 2.49673138e-06]]
W2 = [[-0.01163097 -0.01824557 0.00222276 0.02678182]]
b2 = [[2.29205074e-05]]
cost = 0.6928296515387732

Putting it a all together

Building your first neural network model
In [148]: def nn_model(X, Y, n_h, num_iterations = 10000, print_cost=True):
"""
Arguments:
X -- dataset of shape (2, number of examples)
Y -- labels of shape (1, number of examples)
n_h -- size of the hidden layer
num_iterations -- Number of iterations in gradient descent loop
print_cost -- if True, print the cost every 1000 iterations

Returns:
parameters -- parameters learnt by the model. They can then be used to pred
"""

[Link](3)
n_x = [Link][0]
n_y = [Link][0]
parameters = initialize_parameters(n_x, n_h, n_y)

for i in range(0, num_iterations):

# Forward propagation.
A2, cache = forward_propagation(X, parameters)
# Cost function.
cost = compute_cost(A2, Y)
# Backpropagation.
grads = backward_propagation(parameters, cache, X, Y)

# Gradient descent parameter update.

parameters = update_parameters(parameters, grads)

if print_cost and i % 1000 == 0:

print ("Cost after iteration %i: %f" %(i, cost))

return parameters

In [149]: nn_model(X,Y,4)

Cost after iteration 0: 0.693156

Cost after iteration 1000: 0.693145
Cost after iteration 2000: 0.693142
Cost after iteration 3000: 0.693148
Cost after iteration 4000: 0.693147
Cost after iteration 5000: 0.693147
Cost after iteration 6000: 0.693146
Cost after iteration 7000: 0.693145
Cost after iteration 8000: 0.693143
Cost after iteration 9000: 0.693118

Out[149]: {'W1': array([[ 1.17690894, -0.02226817],

[ 1.62165152, -1.00347165],
[-1.1327999 , 0.17950776],
[ 1.20505408, -0.09474608]]),
'b1': array([[ 0.1130491 ],
[ 0.01616223],
[-0.08719592],
[ 0.10884186]]),
'W2': array([[-0.07967349, 0.41647047, -0.06831913, -0.03922782]]),
'b2': array([[-0.36577288]])}

Model Prediction
Predict with your model by building predict() . Use forward propagation to predict results.
Reminder: predictions = 𝑦𝑝𝑟𝑒𝑑𝑖𝑐𝑡𝑖𝑜𝑛 = 𝟙{activation > 0.5} = { 10 ifotherwise
𝑎𝑐𝑡𝑖𝑣𝑎𝑡𝑖𝑜𝑛 > 0.5

In [150]: def predict(parameters, X):

"""
Using the learned parameters, predicts a class for each example in X

Arguments:
parameters -- python dictionary containing your parameters
X -- input data of size (n_x, m)

Returns
predictions -- vector of predictions of our model (red: 0 / blue: 1)
"""

A2, cache = forward_propagation(X, parameters)

predictions = (A2 > 0.5)
return predictions

In [151]: predictions = predict(parameters, X)

print("Predictions: " + str(predictions))

Predictions: [[False False False False False False False False False False Fal
se False
False False False False False False False False False False False False
False False False False False False False False False False False False
False False False False False True False False False False False False
False False True True True True True True True True True True
True True True True True True True True True True False True
True True True True True True True True True True True True
False False True True True True True True True True True True
True True True True]]

In [152]: print ('Accuracy: %d' % float(([Link](Y, predictions.T) + [Link](1 - Y, 1 - pre

Accuracy: 96%

In [ ]:

Neural Network Parameter Initialization
No ratings yet
Neural Network Parameter Initialization
22 pages
Logistic Regression Model Tutorial
No ratings yet
Logistic Regression Model Tutorial
11 pages
NC Lab Program1 Perceptron Algorithm
No ratings yet
NC Lab Program1 Perceptron Algorithm
36 pages
Neural Network Basics: Forward & Back Propagation
No ratings yet
Neural Network Basics: Forward & Back Propagation
33 pages
Deep Learning File
No ratings yet
Deep Learning File
35 pages
Simple Neural Network Training Code
No ratings yet
Simple Neural Network Training Code
4 pages
Deep Neural Network Assignment Guide
No ratings yet
Deep Neural Network Assignment Guide
44 pages
Lab 8 Manual
No ratings yet
Lab 8 Manual
6 pages
Python Code for Neural Network Basics
No ratings yet
Python Code for Neural Network Basics
7 pages
Simple Neural Network with Backpropagation
No ratings yet
Simple Neural Network with Backpropagation
18 pages
Build Your Deep Neural Network Step-by-Step
No ratings yet
Build Your Deep Neural Network Step-by-Step
16 pages
Single Unit Perceptron for Iris Classification
No ratings yet
Single Unit Perceptron for Iris Classification
59 pages
Deep Learning Lab Manual: Python Programs
No ratings yet
Deep Learning Lab Manual: Python Programs
34 pages
Neural Network Image Classification
No ratings yet
Neural Network Image Classification
11 pages
XOR Gate Neural Network Implementation
No ratings yet
XOR Gate Neural Network Implementation
18 pages
Perceptron and Neural Network Implementations
No ratings yet
Perceptron and Neural Network Implementations
41 pages
AI - Manual Exp 7 8 9 Print
No ratings yet
AI - Manual Exp 7 8 9 Print
12 pages
Neural Network Classifier for Points
No ratings yet
Neural Network Classifier for Points
8 pages
AISC Expt 5
No ratings yet
AISC Expt 5
24 pages
Binary Classifier with ANN in Python
No ratings yet
Binary Classifier with ANN in Python
5 pages
Logistic Regression and Neural Network Implementation
No ratings yet
Logistic Regression and Neural Network Implementation
1 page
Backpropagation in Python Explained
No ratings yet
Backpropagation in Python Explained
14 pages
Python Neural Network Experiments
No ratings yet
Python Neural Network Experiments
38 pages
Adaline: Linear Activation in Neural Networks
No ratings yet
Adaline: Linear Activation in Neural Networks
19 pages
Neural Network Binary Classifier Code
No ratings yet
Neural Network Binary Classifier Code
5 pages
Logistic Regression with NumPy
No ratings yet
Logistic Regression with NumPy
4 pages
Neural Network Training and Testing
No ratings yet
Neural Network Training and Testing
8 pages
Implementing Machine Learning Algorithms
No ratings yet
Implementing Machine Learning Algorithms
20 pages
Mtech Programs AI Lab
No ratings yet
Mtech Programs AI Lab
11 pages
DL Lab Manual: Neural Network Programs
No ratings yet
DL Lab Manual: Neural Network Programs
29 pages
Constant Learning Rate and Batch Normalization
No ratings yet
Constant Learning Rate and Batch Normalization
11 pages
ImportError in Scikit-Learn Usage
No ratings yet
ImportError in Scikit-Learn Usage
35 pages
DNN Lab Manual for MCA Program
No ratings yet
DNN Lab Manual for MCA Program
34 pages
Python Neural Network Implementations
No ratings yet
Python Neural Network Implementations
29 pages
Heart Disease Prediction with DNN
No ratings yet
Heart Disease Prediction with DNN
95 pages
Planar Data Classification Assignment
No ratings yet
Planar Data Classification Assignment
19 pages
Shubham Chaudhary Soft Computing11
No ratings yet
Shubham Chaudhary Soft Computing11
20 pages
Neural Network Training Process Explained
No ratings yet
Neural Network Training Process Explained
16 pages
MP Neuron Model Logic Network Experiment
No ratings yet
MP Neuron Model Logic Network Experiment
41 pages
Machine Learning Algorithms Overview
No ratings yet
Machine Learning Algorithms Overview
19 pages
Back Propagation Algorithm in Neural Networks
No ratings yet
Back Propagation Algorithm in Neural Networks
6 pages
Understanding Artificial Neural Networks
No ratings yet
Understanding Artificial Neural Networks
33 pages
Logistic Regression with Python Code
No ratings yet
Logistic Regression with Python Code
6 pages
Perceptron and PCA for Data Analysis
No ratings yet
Perceptron and PCA for Data Analysis
9 pages
DLP Final PDF
No ratings yet
DLP Final PDF
36 pages
Neurons and Layers Lab Overview
No ratings yet
Neurons and Layers Lab Overview
6 pages
Building a Deep Neural Network in TensorFlow
0% (1)
Building a Deep Neural Network in TensorFlow
6 pages
Gaussian Naive Bayes & K-Means Analysis
No ratings yet
Gaussian Naive Bayes & K-Means Analysis
14 pages
Neural Networks Implementation in Python
No ratings yet
Neural Networks Implementation in Python
8 pages
Understanding Shallow Neural Networks
No ratings yet
Understanding Shallow Neural Networks
25 pages
Logistic Regression and Neural Networks
No ratings yet
Logistic Regression and Neural Networks
7 pages
Multilayer Perceptron Training Guide
No ratings yet
Multilayer Perceptron Training Guide
7 pages
Deep Learning: Neural Networks Overview
No ratings yet
Deep Learning: Neural Networks Overview
44 pages
Autoencoders in Deep Learning
No ratings yet
Autoencoders in Deep Learning
73 pages
Perceptron and Fuzzy Logic Implementations
No ratings yet
Perceptron and Fuzzy Logic Implementations
10 pages
Perceptron vs Adaline in Python
No ratings yet
Perceptron vs Adaline in Python
11 pages
C2 W1 Lab01 Neurons and Layers
No ratings yet
C2 W1 Lab01 Neurons and Layers
7 pages
Day 4 - DS
No ratings yet
Day 4 - DS
26 pages
Neural Network Activation & Loss Functions
No ratings yet
Neural Network Activation & Loss Functions
10 pages
ANN Modeling in Pharmaceutical Research
No ratings yet
ANN Modeling in Pharmaceutical Research
11 pages
Classification and Prediction Techniques
No ratings yet
Classification and Prediction Techniques
23 pages
Understanding Neural Networks and Their Functions
No ratings yet
Understanding Neural Networks and Their Functions
8 pages
Understanding Neural Networks Basics
No ratings yet
Understanding Neural Networks Basics
10 pages
Ijesrt: International Journal of Engineering Sciences & Research Technology
No ratings yet
Ijesrt: International Journal of Engineering Sciences & Research Technology
6 pages
Deep Learning for Audio Classification
No ratings yet
Deep Learning for Audio Classification
20 pages
NeurIPS 2022 Do Residual Neural Networks Discretize Neural Ordinary Differential Equations Paper Conference
No ratings yet
NeurIPS 2022 Do Residual Neural Networks Discretize Neural Ordinary Differential Equations Paper Conference
13 pages
Deep Learning Concepts Overview
No ratings yet
Deep Learning Concepts Overview
17 pages
Brief Answers To Assignment 3 Questions
No ratings yet
Brief Answers To Assignment 3 Questions
13 pages
Managerial Roles and Decision-Making Models
No ratings yet
Managerial Roles and Decision-Making Models
8 pages
Physics-Guided Deep Learning in Hydrology
No ratings yet
Physics-Guided Deep Learning in Hydrology
25 pages
Assignment Machine Learning
No ratings yet
Assignment Machine Learning
12 pages
Biological vs. Artificial Neurons
No ratings yet
Biological vs. Artificial Neurons
25 pages
Machine Learning Viva Q&A Guide
No ratings yet
Machine Learning Viva Q&A Guide
4 pages
Neural Network Toolbox Overview
No ratings yet
Neural Network Toolbox Overview
4 pages
AI Fundamentals Exam Questions 2024
No ratings yet
AI Fundamentals Exam Questions 2024
6 pages
Loss Functions in Deep Learning Explained
No ratings yet
Loss Functions in Deep Learning Explained
5 pages
Activation and Loss Functions in ML
No ratings yet
Activation and Loss Functions in ML
35 pages
Bipolar vs. Unipolar Neurons Explained
No ratings yet
Bipolar vs. Unipolar Neurons Explained
203 pages
Face Recognition Techniques Explained
No ratings yet
Face Recognition Techniques Explained
19 pages
CNN Applications in Radiology Overview
No ratings yet
CNN Applications in Radiology Overview
19 pages
AI Course Syllabus Overview
No ratings yet
AI Course Syllabus Overview
79 pages
Key Concepts in Machine Learning
No ratings yet
Key Concepts in Machine Learning
14 pages
Multi-layer Perceptron Explained
No ratings yet
Multi-layer Perceptron Explained
20 pages
XOR Function Backpropagation Code
No ratings yet
XOR Function Backpropagation Code
3 pages
Deep Learning: Keras & TensorFlow Guide
No ratings yet
Deep Learning: Keras & TensorFlow Guide
77 pages
Understanding Deep Learning & Neural Networks
No ratings yet
Understanding Deep Learning & Neural Networks
36 pages
Deep Learning Overview and Concepts
No ratings yet
Deep Learning Overview and Concepts
45 pages
ML Fundamentals and Algorithms Overview
No ratings yet
ML Fundamentals and Algorithms Overview
19 pages
Statistical Pattern Recognition Techniques
No ratings yet
Statistical Pattern Recognition Techniques
27 pages

Data Preparation for Neural Networks

Uploaded by

Data Preparation for Neural Networks

Uploaded by

In [124]: import matplotlib.

Prerequisite - Prepare the Data

In [125]: iris_data = load_iris()

0 5.1 3.5 1.4 0.2 0

1 4.9 3.0 1.4 0.2 0

2 4.7 3.2 1.3 0.2 0

3 4.6 3.1 1.5 0.2 0

4 5.0 3.6 1.4 0.2 0

In [127]: [Link](df[[Link] == 2].index, inplace=True)

Out[128]: <Axes: xlabel='sepal length (cm)', ylabel='sepal width (cm)'>

In [130]: x1 = (df[["sepal length (cm)", "sepal width (cm)"]]).to_numpy()

The shape of X is: (100, 2)

Apply [Link] if this this not the case.

The shape of X is: (2, 100)

Neural Network model

Step 1: Define the Model Structure

In [133]: h = 1 # we are initialising the number of hidden layer in the model to 1

Intialise the following parameters in the model

W1 -- weight matrix of shape (n_h, n_x)

Step 3: Forward Propogation

𝑎[𝑙]𝑗 = 𝑔[𝑙] (∑𝑘 𝑤[𝑙]𝑗𝑘 ∗ 𝑎[𝑙−1]

# activation function for node 1 in layer 1 can be computed as follows,

𝑍 [1][1]= 𝑊 [1]𝑋 +[1]𝑏[1]

# Implement Forward Propagation to calculate A2 (probabilities)

cache = {"Z1": Z1,

return A2, cache

In [138]: A2, cache = forward_propagation(X, parameters)

[[0.49990974 0.49995621 0.49991974 0.49992521 0.4998915 0.49988153

Step 3.1: Compute the Cost function

Cost function Logistic regression

𝑀𝑆𝐸 = 2𝑚1 𝑖=1∑𝑚 (ŷ2𝑖 − 𝑦2𝑖 )

For logistic regression, the Cost function is defined as:

In [140]: def compute_cost(A2, Y):

In [141]: cost = compute_cost(A2, Y)

Step 4: Back Propagation

Limitation of back propagation

grads = {"dW1": dW1,

In [143]: grads = backward_propagation(parameters, cache, X, Y)

dW1 = [[ 0.00245638 -0.00173974]

Step 5: Update Parameters

parameters = {"W1": W1,

Putting it a all together

# Gradient descent parameter update.

if print_cost and i % 1000 == 0:

Cost after iteration 0: 0.693156

Out[149]: {'W1': array([[ 1.17690894, -0.02226817],

In [150]: def predict(parameters, X):

A2, cache = forward_propagation(X, parameters)

In [151]: predictions = predict(parameters, X)

In [152]: print ('Accuracy: %d' % float(([Link](Y, predictions.T) + [Link](1 - Y, 1 - pre

You might also like