0% found this document useful (0 votes)
7 views4 pages

Overview of Artificial Neural Networks

Uploaded by

Aryan
Copyright
© All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
7 views4 pages

Overview of Artificial Neural Networks

Uploaded by

Aryan
Copyright
© All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd

National Conference on Latest Trends in Computer Science and Engineering (NCLTCSE-24)

Organized by CSE Department, Vaish College of Engineering, Rohtak

Artificial Neural Network: Overview and its


learning
Neha Taneja1, Aryan2
1
Assistant Professor, Department of CSE, Vaish College of Engineering, Rohtak, Haryana, India
Email: Nehataneja89@[Link]
2
[Link] Student, Department of CSE, Vaish College of Engineering, Rohtak, Haryana, India
Email: aryansaini2403@[Link]

Abstract provide electrochemical impulses to the dendrite,


Artificial intelligence in the modern day is a which then reaches the cell body. [1]The nucleus and
commodity used in various experimental fields. other chemical components needed for the cell's
Artificial intelligence is used in neural network and sustenance are found in the soma, or cell body. The
machine learning, which enables computers to learn signal from one neuron to another is carried by an
from historical data. ANN i.e. Artificial Neural axon. The connection between the dendrites of two
Network, is a computational model which is inspired neurons is known as a synapse. This synaptic gap
by the structure and functioning of biological neuron. between two neurons plays an important role.
ANNs are typically used in machine learning and
artificial intelligence applications to learn complex
patterns and relationships in data. They excel in tasks
such as classification, clustering, pattern recognition,
regression and optimization. This paper sheds light on
ANN and two learning strategies: Supervised Learning
and Unsupervised Learning.

Keywords: ANN, Neuron, Weights, Learning,


Activation Function

Figure 1: Diagram of Biological Neuron


1. Introduction
Via its dendrites, the neuron receives impulses
An artificial neural network (ANN) is a from other neurons. When the signal strength
nonlinear data modeling system which exceeds a threshold, this neuron starts to send
establishes models or patterns in complex out its own signal. Synapses then allow the
interactions between inputs and outputs. It signal to go along the axon to the next cell. Other
debuted at the beginning of the 1940s. They have neurons are then triggered by signals sent to
just lately started to play a big role in artificial them by synapses, and the cycle is repeated.[2]
intelligence In this way information is transmitted from one
neuron to another and decisions are made and
this is what is call as human intelligence.
1.1 Biological Neuron
1.2 Artificial Neuron
ANN is basically imitation of human brain.
Neurons in our brain are responsible for human
An Artificial Neural Network consists of
intelligence and so the same idea is copied to
network of artificial neurons connected to each
generate artificial neuron to achieve Artificial
other. An artificial neuron tries to mimic the
Intelligence. Artificial neural networks (ANNs)
biological neuron's structure and functions. A
mimic real neural networks by connecting their
neuron has one output (a synapses via an axon)
artificial neurons in a manner akin to that of the
and one input (dendrites). There is a function
brain network.[1]
called Activation function that determines
The components of a biological neuron include
whether a neuron will fire or not.
the cell body, axon, and dendrite. Other neurons

1
National Conference on Latest Trends in Computer Science and Engineering (NCLTCSE-24)
Organized by CSE Department, Vaish College of Engineering, Rohtak

Figure 2: Diagram of Artificial Neural


Network Figure 3: ANN architecture

Here, x1,x2,.….,xn are the set of inputs to the The weight value determines the input
neuron. b, here is bias, a constant which is strength. A weight value may be zero,
usually initialized to 1. w0,w1,…..,wn are weights positive, or negative. A signal that has a
associated with the inputs. Then, come the negative weight is weaker or more
Summation part, ∑ which is equal to xiwi. And, suppressed. A weight of zero indicates that
then is the role of Activation Function. the two neurons are not connected. The
Various activation functions are available to weights are modified in order to acquire the
achieve the desired output. Sigmoid Function is desired result. To get the desired output,
one of the most widely used activation functions, there are algorithms for changing the ANN's
which squeezes the value received from weights. This weight-adjusting procedure is
summation in range 0 to 1. Then, a threshold referred to as training or learning.
value can easily determine whether a neuron will After the network is trained, comes the
fire or not. testing part.
σ(x) = 1/(1+exp(-x))
This paper is structured as follows: 3. Learning in Neural Network
Section 1 presents the structure of biological
neuron, artificial neuron, activation function. A system that can automatically acquire and
Section 2 reviews the ANN architecture. integrate knowledge is referred to as
Learning in Neural Network is presented in machine learning. Learning is basically a
Section 3. Section 4 reviews the difference process of analyzing input-output pair and
between Supervised and Unsupervised Learning. predicts the output for a new input, or,
Conclusions are discussed in Section 5. improving knowledge by observing
environment.
2. Artificial Neural Network Two major categories can be used to
categorize ANN learning methodologies.
Architecture One is Supervised Learning and other is
Unsupervised Learning.
ANN architecture is composed of:
a) Input Layer : which is used to receive
Learning
input
b) Hidden Layer (s) : this layer is between
input and output layer and it can be
more than one depending on the
complexity of problem Supervised Unsupervised
c) Output Layer : which gives output Learning Learning
which usually range between 0 and 1

2
National Conference on Latest Trends in Computer Science and Engineering (NCLTCSE-24)
Organized by CSE Department, Vaish College of Engineering, Rohtak

In case of Supervised Learning computational


3.1 Supervised Learning: complexity of the problem is less while in
In Supervised Learning, with a given set of Unsupervised Learning it is more complex.
input data and, under the presumption that In Supervised Learning, the categories are
there is a relationship between the input and known, while in Unsupervised Learning, the
the result, we already know what our correct categories are unknown.
output should look like. In Supervised Learning, the desired output is
There are two categories of supervised learning given as input- output pair is used to train the
tasks: "classification" problems and system, while in case of Unsupervised Learning,
"regression" assignments. the desired output is not given.
In case of regression, the prediction is about Supervised Learning is known as Classification,
outcomes within a continuous form (value while Unsupervised Learning is also known as
based), which implies to translate the input clustering.
values to a continuous fashion. Rather, the goal
in case of classification challenge is to predict 5. Conclusion
outcomes in a discrete output. Stated
differently, our goal is to classify the input ANN, Artificial Neural Networks commonly
variables. known as neural networks, serve as the
foundation for machine learning methodologies.
3.2 Unsupervised Learning: Their form and function are inspired by the
By grouping the data according to the structure of neuron in the human brain, and they
correlations between the variables, this try to mimic the way that biological neurons
structure may be obtained. communicate with each other. ANN, Artificial
In case of Unsupervised Learning, there is no neural networks, or are highly parallel systems
supervisor to correct results of prediction composed of numerous interconnected core
because there is no feedback based system on processors. This piece of writing talks about
the predicted results. There is no input-output artificial neural networks and the basic learning
pair given to system to train itself. Only input types of them. The fundamental neuron, the
is given. It goes beyond just categorizing. artificial computer model, and the logic behind
building artificial neural networks (ANNs) are
4. Comparison between Supervised all covered in this review.
Learning and Unsupervised Learning
One major difference between Supervised
References
Learning and Unsupervised Learning algorithm [1] [Link] [Link] [Link]
is on the basis of the training data set used to ,[Link], “AN INTRODUCTION TO
train the system. Supervised Learning algorithm ARTIFICIAL NEURAL NETWORK AND
uses labeled data set of input-output pair to train THEIR APPLICATION”, Volume 07, ISSN-
the system, while in case of Unsupervised 23495162, February 2020.
Learning the input is given and output is not [2] Ritu Sharma , Kavya Sharma , Apurva
known. Supervised learning methods make use Khanna, “Study of Supervised Learning and
of each training instance's class membership Unsupervised Learning”, Volume 08, ISSN-
information. Using this data, supervised learning 2321-9653, June 2020.
algorithms are able to identify pattern [3] Yegnanarayana, B., Artificial Neural
misclassifications and provide self-repair Network, PHI, New Delhi, 1999.
feedback. Algorithms for unsupervised learning [4] E. H. Chan, “Applications of neural network
employ unlabelled instances. They either process computing in intelligent arm processing,” Proc.
them heuristically or mindlessly. Compared to IEEE PICA Conf., pp. 246-251, Seattle, WA,
supervised learning algorithms, unsupervised USA, 1989.
learning techniques frequently have lower [5] Debar, H. & Dorizzi, B. (1992). An
computing complexity and accuracy. Application of a Recurrent Network to an
In Supervised Learning, the data that we feed to Intrusion Detection System. In Proceedings of
system is labelled, while in case of Unsupervised the International Joint Conference on Neural
Learning the input data is unknown. Networks.

3
National Conference on Latest Trends in Computer Science and Engineering (NCLTCSE-24)
Organized by CSE Department, Vaish College of Engineering, Rohtak

[6] Kishan Mehrotra, Chilkuri K. Mohan, Sanjay


Ranka, “Elements of Artificial Neural
Networks”, Penram International Publishing
(India), (1997), Volume 1, Page no.- 1-41.
[7] Dr. R. R. Srikanth. “Application of ANN in
Condition Monitoring: A Case Study”,
(Conference proceeding “Condition Monitoring
of Mechanical System”) (2009), Gitam
University, Vishakhapattanam, Page no. 31-44.
[8] [Link], [Link], Amit [Link],
“Introduction to Artificial Neural Network”,
Volume 02, ISSN- 2277-3754, July 2012.

Common questions

Powered by AI

Hidden layers in an artificial neural network serve as intermediate layers situated between the input and output layers. They are crucial for capturing and transforming input signals into complex patterns and representations. Hidden layers enable the network to perform hierarchical feature learning, which involves progressively extracting higher-level features from raw inputs. The presence of one or more hidden layers allows the ANN to model non-linear functions efficiently, making it possible to solve complex problems such as image recognition or natural language processing. Having an adequate number of hidden layers enhances the network's capacity to generalize from input data .

Artificial neural networks have a wide range of potential applications as highlighted in the sources. They are particularly suited for tasks such as classification, clustering, pattern recognition, regression, and optimization. These networks are used in diverse fields such as intelligent signal processing, where they aid in methods like intrusion detection systems and condition monitoring of mechanical systems. Their versatility in modeling complex, non-linear relationships makes them invaluable in machine learning and artificial intelligence applications .

Supervised and unsupervised learning algorithms in ANNs differ in computational complexity due to distinct data handling and problem-solving approaches. Supervised learning algorithms use labeled datasets, making the learning process less computationally intensive, as the model benefits from defined guidance to match input-output pairs. Conversely, unsupervised learning doesn't have labeled examples to guide the learning, requiring the system to independently discover patterns or groupings in the input data. This lack of labeled data increases complexity since various potential clustering approaches and evaluations must be considered, often entailing higher computational demands and iterative computation to derive meaningful insights .

Weights and activation functions are critical to how artificial neural networks operate. Weights represent the strength of connections between neurons, affecting the input's influence on the neuron's output. They can be adjusted during the learning process to minimize error in predictions. The activation function determines whether a neuron should 'fire' by computing a weighted sum of input signals and applying a threshold function. Common activation functions like the sigmoid function squash input values to a specific range, typically between 0 and 1, thereby allowing the network to introduce non-linearity into the model .

An artificial neuron functions as the fundamental unit of an ANN and plays a specific role in each layer. In the input layer, neurons receive raw data inputs directly from the external environment. The subsequent hidden layers transform the inputs into an intermediate form, performing computations to extract features or patterns. The output layer is responsible for producing the final result of the network, whether it's a classification, regression, or another output type. Each layer's interactions are adjusted during training by altering the weights, enabling the network to learn complex functions by capturing non-linear relationships between input and output .

Artificial neural networks (ANNs) mimic the structure and function of biological neurons by creating networks of artificial neurons that function similarly to the brain's network. In biological neurons, the cell body, axon, and dendrite play crucial roles where electrochemical impulses are received and transmitted by neurons. Similarly, artificial neurons have inputs (akin to dendrites), a computation function (like summation and activation), and outputs (similar to axons). The activation function in artificial neurons determines whether a neuron will fire, similar to synapses in biological neurons that transmit signals based on a threshold .

Weight modification and training are critical processes that significantly affect an artificial neural network's performance. The weights are adjusted during training to minimize the difference between the network's predictions and actual outcomes (error). This adaptive modification process involves algorithms such as gradient descent, which iteratively updates weights to reach an optimal configuration. Effective training enables the network to generalize well from training data to unseen data, enhancing its accuracy and reliability in predictive tasks. Inadequate training or incorrect weight adjustment can lead to issues like overfitting or convergence to local minima, negatively impacting performance .

The threshold value in an ANN's activation function is crucial in determining a neuron's behavior because it sets the criteria for whether a neuron 'fires' or not. If the input weighted sum exceeds this threshold, the neuron activates, thus passing a signal to the succeeding layer of neurons. For instance, using a sigmoid activation function, the output is only significant when inputs cross a specified threshold value, effectively allowing the model to make decisions by distinguishing between important and irrelevant signals. This decision-making process is vital for tasks such as feature selection and noise reduction in data .

Supervised learning tasks within artificial neural networks can be categorized mainly into 'classification' and 'regression' tasks. Classification involves mapping input variables to a discrete set of categories, often using networks to discern distinct patterns and assign input data to one of these predefined categories. Regression, on the other hand, deals with predicting continuous values, translating input information into a continuous variable, often for function estimation or trend analysis. Both tasks require paired input-output data for effective training under supervised learning paradigms .

The fundamental difference between supervised and unsupervised learning lies in data labeling and structure. Supervised learning utilizes labeled datasets with known input-output pairs to train the system, allowing for classification and regression tasks. This type of learning involves feedback to correct predictions and is often less computationally complex. In contrast, unsupervised learning deals with unlabeled data, aiming to identify patterns or groupings (clustering) in the data without any feedback mechanism. Consequently, unsupervised learning is more complex and does not rely on predefined outcomes, making it less accurate in pattern recognition compared to supervised learning .

You might also like