0% found this document useful (0 votes)

124 views

Artificial Neural Network

An artificial neural network (ANN) is a mathematical model inspired by biological neural networks. It consists of interconnected artificial neurons that process information using a connectionist approach. Modern neural networks are nonlinear statistical models used to model complex relationships in data. They learn through training to change the strength of connections between neurons based on examples. The key aspects of ANNs are the network architecture of interconnected nodes, learning algorithms that update weights to model patterns in data, and their ability to adapt based on information that flows through the network.

Uploaded by

RoyalRon Yoga Prabhu

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

124 views

Artificial Neural Network

Uploaded by

RoyalRon Yoga Prabhu

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 8

Paper presentation

On
ARTIFICIAL NEURAL NETWORK

B.YOGAPRABHU
CSE ‘B’ III
REG.NO:10108104088
AALIM MUHAMMED SALEGH

COLLEGE OF ENGINEERING
Artificial neural network

An artificial neural network (ANN), usually called neural network (NN), is a mathematical
model or computational model that is inspired by the structure and/or functional aspects of biological
neural networks. A neural network consists of an interconnected group of artificial neurons, and it
processes information using a connectionist approach to computation. In most cases an ANN is an
adaptive system that changes its structure based on external or internal information that flows through the
network during the learning phase. Modern neural networks are non-linear statistical data modeling tools.
They are usually used to model complex relationships between inputs and outputs or to find patterns in
data.

An artificial neural network is an interconnected group of nodes, akin to the vast network of
neurons in the human brain.
Background

The original inspiration for the term Artificial Neural Network came from examination of central
nervous systems and their neurons, axons, dendrites and synapses which constitute the processing
elements of biological neural networks investigated by neuroscience. In an artificial neural network,
simple artificial nodes, variously called "neurons", "neurodes", "processing elements" (PEs) or "units",
are connected together to form a network of nodes mimicking the biological neural networks — hence
the term "artificial neural network".

Because neuroscience is still full of unanswered questions and since there are many levels of
abstraction and therefore, many ways to take inspiration from the brain, there is no single formal
definition of what an artificial neural network is. Most would agree that it involves a network of simple
processing elements which can exhibit complex global behavior determined by the connections between
the processing elements and element parameters. While an artificial neural network does not have to be
adaptive per se, its practical use comes with algorithms designed to alter the strength (weights) of the
connections in the network to produce a desired signal flow.

These networks are also similar to the biological neural networks in the sense that functions are
performed collectively and in parallel by the units, rather than there being a clear delineation of subtasks
to which various units are assigned (see also connectionism). Currently, the term Artificial Neural
Network (ANN) tends to refer mostly to neural network models employed in statistics, cognitive
psychology and artificial intelligence. Neural network models designed with emulation of the central
nervous system (CNS) in mind are a subject of theoretical neuroscience and computational neuroscience.

In modern software implementations of artificial neural networks, the approach inspired by biology has
been largely abandoned for a more practical approach based on statistics and signal processing. In some
of these systems, neural networks or parts of neural networks (such as artificial neurons) are used as
components in larger systems that combine both adaptive and non-adaptive elements. While the more
general approach of such adaptive systems is more suitable for real-world problem solving, it has far less
to do with the traditional artificial intelligence connectionist models. What they do have in common,
however, is the principle of non-linear, distributed, parallel and local processing and adaptation.
Models
Neural network models in artificial intelligence are usually referred to as artificial neural
networks (ANNs); these are essentially simple mathematical models defining a function or a
distribution over or both and , but sometimes models are also intimately associated with a particular
learning algorithm or learning rule. A common use of the phrase ANN model really means the definition
of a class of such functions (where members of the class are obtained by varying parameters, connection
weights, or specifics of the architecture such as the number of neurons or their connectivity).

Network function

The word network in the term 'artificial neural network' refers to the inter–connections between
the neurons in the different layers of each system. The most basic system has three layers. The first layer
has input neurons which send data via synapses to the second layer of neurons and then via more
synapses to the third layer of output neurons. More complex systems will have more layers of neurons
with some having increased layers of input neurons and output neurons. The synapses store parameters
called "weights" which are used to manipulate the data in the calculations.

The layers network through the mathematics of the system algorithms. The network function
is defined as a composition of other functions , which can further be defined as a composition of
other functions. This can be conveniently represented as a network structure, with arrows depicting the
dependencies between variables. A widely used type of composition is the nonlinear weighted sum,
where , where (commonly referred to as the activation function[1]) is some
predefined function, such as the hyperbolic tangent. It will be convenient for the following to refer to a
collection of functions as simply a vector .

Learning

What has attracted the most interest in neural networks is the possibility of learning. Given a
specific task to solve, and a class of functions , learning means using a set of observations to find
which solves the task in some optimal sense.

This entails defining a cost function such that, for the optimal solution ,
(i.e., no solution has a cost less than the cost of the optimal solution).

The cost function is an important concept in learning, as it is a measure of how far away a
particular solution is from an optimal solution to the problem to be solved. Learning algorithms search
through the solution space to find a function that has the smallest possible cost.
For applications where the solution is dependent on some data, the cost must necessarily be a
function of the observations, otherwise we would not be modelling anything related to the data. It is
frequently defined as a statistic to which only approximations can be made. As a simple example,
consider the problem of finding the model which minimizes , for data pairs drawn
from some distribution . In practical situations we would only have samples from and thus, for the
above example, we would only minimize . Thus, the cost is minimized over a
sample of the data rather than the entire data set.

When some form of online machine learning must be used, where the cost is partially
minimized as each new example is seen. While online machine learning is often used when is fixed, it
is most useful in the case where the distribution changes slowly over time. In neural network methods,
some form of online machine learning is frequently used for finite datasets.

Choosing a cost function

While it is possible to define some arbitrary, ad hoc cost function, frequently a particular cost will
be used, either because it has desirable properties (such as convexity) or because it arises naturally from a
particular formulation of the problem (e.g., in a probabilistic formulation the posterior probability of the
model can be used as an inverse cost). Ultimately, the cost function will depend on the desired task. An
overview of the three main categories of learning tasks is provided below.

Learning paradigms

There are three major learning paradigms, each corresponding to a particular abstract learning
task. These are supervised learning, unsupervised learning and reinforcement learning.

Supervised learning

In supervised learning, we are given a set of example pairs and the aim is to find a
function in the allowed class of functions that matches the examples. In other words, we wish to
infer the mapping implied by the data; the cost function is related to the mismatch between our mapping
and the data and it implicitly contains prior knowledge about the problem domain.

A commonly used cost is the mean-squared error which tries to minimize the average squared
error between the network's output, f(x), and the target value y over all the example pairs. When one tries
to minimize this cost using gradient descent for the class of neural networks called multilayer
perceptrons, one obtains the common and well-known backpropagation algorithm for training neural
networks.

Tasks that fall within the paradigm of supervised learning are pattern recognition (also known as
classification) and regression (also known as function approximation). The supervised learning paradigm
is also applicable to sequential data (e.g., for speech and gesture recognition). This can be thought of as
learning with a "teacher," in the form of a function that provides continuous feedback on the quality of
solutions obtained thus far.
Unsupervised learning

In unsupervised learning, some data is given and the cost function to be minimized, that can be
any function of the data and the network's output, .

The cost function is dependent on the task (what we are trying to model) and our a priori
assumptions (the implicit properties of our model, its parameters and the observed variables).

As a trivial example, consider the model , where is a constant and the cost
. Minimizing this cost will give us a value of that is equal to the mean of the data. The
cost function can be much more complicated. Its form depends on the application: for example, in
compression it could be related to the mutual information between x and y, whereas in statistical
modeling, it could be related to the posterior probability of the model given the data. (Note that in both of
those examples those quantities would be maximized rather than minimized).

Tasks that fall within the paradigm of unsupervised learning are in general estimation problems;
the applications include clustering, the estimation of statistical distributions, compression and filtering.

Reinforcement learning

In reinforcement learning, data are usually not given, but generated by an agent's interactions
with the environment. At each point in time , the agent performs an action and the environment
generates an observation and an instantaneous cost , according to some (usually unknown) dynamics.
The aim is to discover a policy for selecting actions that minimizes some measure of a long-term cost;
i.e., the expected cumulative cost. The environment's dynamics and the long-term cost for each policy are
usually unknown, but can be estimated.

More formally, the environment is modeled as a Markov decision process (MDP) with states
and actions with the following probability distributions: the instantaneous cost distribution
, the observation distribution and the transition , while a policy is defined as
conditional distribution over actions given the observations. Taken together, the two define a Markov
chain (MC). The aim is to discover the policy that minimizes the cost; i.e., the MC for which the cost is
minimal.

ANNs are frequently used in reinforcement learning as part of the overall algorithm.

Tasks that fall within the paradigm of reinforcement learning are control problems, games and other
sequential decision making tasks.

Evolutionary methods, simulated annealing, expectation-maximization and non-parametric

methods are some commonly used methods for training neural networks.

Temporal perceptual learning relies on finding temporal relationships in sensory signal streams.
In an environment, statistically salient temporal correlations can be found by monitoring the arrival times
of sensory signals. This is done by the perceptual network.

Employing artificial neural networks

Perhaps the greatest advantage of ANNs is their ability to be used as an arbitrary function approximation
mechanism which 'learns' from observed data. However, using them is not so straightforward and a
relatively good understanding of the underlying theory is essential.

 Choice of model: This will depend on the data representation and the application. Overly complex
models tend to lead to problems with learning.
 Learning algorithm: There are numerous trade-offs between learning algorithms. Almost any
algorithm will work well with the correct hyperparameters for training on a particular fixed data
set. However selecting and tuning an algorithm for training on unseen data requires a significant
amount of experimentation.
 Robustness: If the model, cost function and learning algorithm are selected appropriately the
resulting ANN can be extremely robust.

With the correct implementation, ANNs can be used naturally in online learning and large data set
applications. Their simple implementation and the existence of mostly local dependencies exhibited in
the structure allows for fast, parallel implementations in hardware.

Applications
The utility of artificial neural network models lies in the fact that they can be used to infer a
function from observations. This is particularly useful in applications where the complexity of the data or
task makes the design of such a function by hand impractical.

Real-life applications

The tasks to which artificial neural networks are applied tend to fall within the following broad
categories:
 Function approximation, or regression analysis, including time series prediction, fitness
approximation and modeling.
 Classification, including pattern and sequence recognition, novelty detection and sequential
decision making.
 Data processing, including filtering, clustering, blind source separation and compression.
 Robotics, including directing manipulators, Computer numerical control.

Application areas include system identification and control (vehicle control, process control), quantum
chemistry,[2] game-playing and decision making (backgammon, chess, racing), pattern recognition (radar
systems, face identification, object recognition and more), sequence recognition (gesture, speech,
handwritten text recognition), medical diagnosis, financial applications (automated trading systems), data
mining (or knowledge discovery in databases, "KDD"), visualization and e-mail spam filtering.

Neural networks and neuroscience

Theoretical and computational neuroscience is the field concerned with the theoretical analysis and
computational modeling of biological neural systems. Since neural systems are intimately related to
cognitive processes and behavior, the field is closely related to cognitive and behavioral modeling.

The aim of the field is to create models of biological neural systems in order to understand how
biological systems work. To gain this understanding, neuroscientists strive to make a link between
observed biological processes (data), biologically plausible mechanisms for neural processing and
learning (biological neural network models) and theory (statistical learning theory and information
theory).

Types of models

Many models are used in the field defined at different levels of abstraction and modeling different aspects
of neural systems. They range from models of the short-term behavior of individual neurons, models of
how the dynamics of neural circuitry arise from interactions between individual neurons and finally to
models of how behavior can arise from abstract neural modules that represent complete subsystems.
These include models of the long-term, and short-term plasticity, of neural systems and their relations to
learning and memory from the individual neuron to the system level.

Current research

While initial research had been concerned mostly with the electrical characteristics of neurons, a
particularly important part of the investigation in recent years has been the exploration of the role of
neuromodulators such as dopamine, acetylcholine, and serotonin on behavior and learning.

Biophysical models, such as BCM theory, have been important in understanding mechanisms for
synaptic plasticity, and have had applications in both computer science and neuroscience. Research is
ongoing in understanding the computational algorithms used in the brain, with some recent biological
evidence for radial basis networks and neural backpropagation as mechanisms for processing data.
Computational devices have been created in CMOS for both biophysical simulation and neuromorphic
computing. More recent efforts show promise for creating nanodevices for very large scale principal
components analyses and convolution. If successful, these effort could usher in a new era of neural
computing that is a step beyond digital computing, because it depends on learning rather than
programming and because it is fundamentally analog rather than digital even though the first
instantiations may in fact be with CMOS digital devices

Apache Cassandra Administrator Associate - Exam Practice Tests
From Everand
Apache Cassandra Administrator Associate - Exam Practice Tests
Cristian Scutaru
No ratings yet
Artificial Intelligence For R-2017 by Krishna Sankar P., Shangaranarayanee N. P., Nithyananthan S.
0% (1)
Artificial Intelligence For R-2017 by Krishna Sankar P., Shangaranarayanee N. P., Nithyananthan S.
8 pages
Neural Networks and Fuzzy Logic
From Everand
Neural Networks and Fuzzy Logic
C. Naga Bhaskar
No ratings yet
EC360 Soft Computing - Syllabus PDF
No ratings yet
EC360 Soft Computing - Syllabus PDF
2 pages
Question Bank COURSE: Artificial Intelligence Department: Cse Class: Iii B.Tech Sem Ii Year: 2009-2010 Unit I
No ratings yet
Question Bank COURSE: Artificial Intelligence Department: Cse Class: Iii B.Tech Sem Ii Year: 2009-2010 Unit I
9 pages
Machine Learning (15CS73) Question Bank: 6. Consider The Following Set of Training Examples
No ratings yet
Machine Learning (15CS73) Question Bank: 6. Consider The Following Set of Training Examples
2 pages
Hci 2m
No ratings yet
Hci 2m
8 pages
Cs6303 - Computer Architecture Question Bank Unit-I Overview & Instructions Part - A (2 MARKS)
No ratings yet
Cs6303 - Computer Architecture Question Bank Unit-I Overview & Instructions Part - A (2 MARKS)
4 pages
HCI Lecture 1
No ratings yet
HCI Lecture 1
26 pages
AI Question Bank 2017 18 CSE
No ratings yet
AI Question Bank 2017 18 CSE
4 pages
Topic 6 - Linear Programming - Graphical Method 2
No ratings yet
Topic 6 - Linear Programming - Graphical Method 2
52 pages
BDA Lab Manual
No ratings yet
BDA Lab Manual
62 pages
EC360 Soft Computing
No ratings yet
EC360 Soft Computing
2 pages
Wrapper Classes Exercise: Cognizant Technology Solutions
No ratings yet
Wrapper Classes Exercise: Cognizant Technology Solutions
7 pages
Hci 2 Mark Question Bank Unit 1
No ratings yet
Hci 2 Mark Question Bank Unit 1
10 pages
NN Assignment PDF
No ratings yet
NN Assignment PDF
3 pages
Icles' Motilal Jhunjhunwala College, Vashi IT& CS Department
No ratings yet
Icles' Motilal Jhunjhunwala College, Vashi IT& CS Department
41 pages
GCC QB
100% (1)
GCC QB
16 pages
Artificial Neural Networks: Part 1/3
No ratings yet
Artificial Neural Networks: Part 1/3
25 pages
Presentation 2
No ratings yet
Presentation 2
36 pages
Tycs Ai Unit 2
No ratings yet
Tycs Ai Unit 2
84 pages
Operations Research Lecture 1: Introduction To OR Models: Kusum Deep Mathematics Department
No ratings yet
Operations Research Lecture 1: Introduction To OR Models: Kusum Deep Mathematics Department
44 pages
Hadoop Online Training
No ratings yet
Hadoop Online Training
7 pages
Intelligent Agents
100% (2)
Intelligent Agents
24 pages
Neural Networks
No ratings yet
Neural Networks
29 pages
Department of Computer Science and Engineering Astu: NLP: Background and Overview
No ratings yet
Department of Computer Science and Engineering Astu: NLP: Background and Overview
30 pages
ML Question Bank and Sol
No ratings yet
ML Question Bank and Sol
12 pages
DWDM 1-5 QB Sols
No ratings yet
DWDM 1-5 QB Sols
193 pages
MPC
No ratings yet
MPC
17 pages
Gate Questions On Pumping Lemma - Theory of Computation - AcademyEra PDF
No ratings yet
Gate Questions On Pumping Lemma - Theory of Computation - AcademyEra PDF
3 pages
Master's Theorem
No ratings yet
Master's Theorem
13 pages
A Review Paper On Extractive Techniques of Text Summarization
No ratings yet
A Review Paper On Extractive Techniques of Text Summarization
4 pages
Unit-1 Wireless Communication Fundamentals
No ratings yet
Unit-1 Wireless Communication Fundamentals
5 pages
Artificial Neural Networks - MiniProject
100% (1)
Artificial Neural Networks - MiniProject
16 pages
It1402 QB
No ratings yet
It1402 QB
3 pages
COMP 577 - Soft Computing Techniques: Chapter - 1: Introduction
No ratings yet
COMP 577 - Soft Computing Techniques: Chapter - 1: Introduction
2 pages
Cs 403 Software Engineering Jun 2020
No ratings yet
Cs 403 Software Engineering Jun 2020
3 pages
Viii Sem Cs6008 QB
No ratings yet
Viii Sem Cs6008 QB
12 pages
Artificial Intelligence
No ratings yet
Artificial Intelligence
107 pages
DBMS QB
No ratings yet
DBMS QB
4 pages
STLD Notes Unit-2
No ratings yet
STLD Notes Unit-2
19 pages
Assignment: 1: 1 - Computer Networks
No ratings yet
Assignment: 1: 1 - Computer Networks
8 pages
Unit-Ii Knowledge Representation and Reasoning Part-A
No ratings yet
Unit-Ii Knowledge Representation and Reasoning Part-A
10 pages
CN GATE Question and Answers
No ratings yet
CN GATE Question and Answers
24 pages
AI-ques-ans-Unit-1 Prof. Anuj Khanna KOIT
No ratings yet
AI-ques-ans-Unit-1 Prof. Anuj Khanna KOIT
17 pages
Neural Network Unit - 4 - 221210 - 134739
No ratings yet
Neural Network Unit - 4 - 221210 - 134739
15 pages
NoSQL Systems For Big Data Management
No ratings yet
NoSQL Systems For Big Data Management
8 pages
Android Training Online
No ratings yet
Android Training Online
6 pages
It6713 Grid Cloud Computing Lab
No ratings yet
It6713 Grid Cloud Computing Lab
96 pages
Notes On Backpropagation
No ratings yet
Notes On Backpropagation
14 pages
Lec01 Conceptlearning
100% (1)
Lec01 Conceptlearning
49 pages
Java Programming Part I
No ratings yet
Java Programming Part I
120 pages
Unit 2 v1.
No ratings yet
Unit 2 v1.
41 pages
UID IT 604 Part A
No ratings yet
UID IT 604 Part A
4 pages
Q1. Explain JDK, JRE and JVM?
No ratings yet
Q1. Explain JDK, JRE and JVM?
21 pages
June-2012 (Computer Science) (Paper)
No ratings yet
June-2012 (Computer Science) (Paper)
12 pages
2005 Neural Networks and Applications
No ratings yet
2005 Neural Networks and Applications
4 pages
Index
No ratings yet
Index
51 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
9 pages
Artificial Neural Network - ..
100% (1)
Artificial Neural Network - ..
15 pages
Assign 1
No ratings yet
Assign 1
2 pages
From Burnout To Balance - AI-Enhanced Work Models For The Future
No ratings yet
From Burnout To Balance - AI-Enhanced Work Models For The Future
10 pages
1339 Organizational Behaviour: Course Author
No ratings yet
1339 Organizational Behaviour: Course Author
6 pages
Cource Out Line Risk Final AAU
No ratings yet
Cource Out Line Risk Final AAU
4 pages
ARTIKEL Elfrida Yosefina Marlon
No ratings yet
ARTIKEL Elfrida Yosefina Marlon
15 pages
The Relationship of Principal Leadership Behaviors With School Climate, Teacher Job Satisfaction, and Student Achievement
No ratings yet
The Relationship of Principal Leadership Behaviors With School Climate, Teacher Job Satisfaction, and Student Achievement
133 pages
Lesson-Plan-In-Homeroom-Guidance w3
No ratings yet
Lesson-Plan-In-Homeroom-Guidance w3
5 pages
MPhil Educational Leadership and Policy Studies
100% (1)
MPhil Educational Leadership and Policy Studies
43 pages
Five Dimension of Interaction Design
No ratings yet
Five Dimension of Interaction Design
9 pages
Pragmatics Lesson 1
No ratings yet
Pragmatics Lesson 1
3 pages
PD - Chapter 2
100% (1)
PD - Chapter 2
22 pages
Artificial Intelligence in Education
No ratings yet
Artificial Intelligence in Education
5 pages
Conference Slide
No ratings yet
Conference Slide
8 pages
Towards Relevant Education For All: 1. Explain The Diversity and Spatial Differentiation in The City
No ratings yet
Towards Relevant Education For All: 1. Explain The Diversity and Spatial Differentiation in The City
2 pages
Civil Engineering-Project-Thesis Format
No ratings yet
Civil Engineering-Project-Thesis Format
32 pages
Bachelor of Science in Computer Science
No ratings yet
Bachelor of Science in Computer Science
4 pages
What Google Learned From Its Quest To Build The Perfect Team - The New York Times
No ratings yet
What Google Learned From Its Quest To Build The Perfect Team - The New York Times
8 pages
CASP RCT Checklist
No ratings yet
CASP RCT Checklist
3 pages
Full download (Ebook) Intersections Across Disciplines: Interdisciplinarity and learning by Brad Hokanson, Marisa Exter, Amy Grincewicz, Matthew Schmidt, Andrew A. Tawfik ISBN 9783030538743, 9783030538750, 3030538745, 3030538753 pdf docx
100% (9)
Full download (Ebook) Intersections Across Disciplines: Interdisciplinarity and learning by Brad Hokanson, Marisa Exter, Amy Grincewicz, Matthew Schmidt, Andrew A. Tawfik ISBN 9783030538743, 9783030538750, 3030538745, 3030538753 pdf docx
67 pages
Talcott Parsons and Robert Merton - Misa, Joshua D.
No ratings yet
Talcott Parsons and Robert Merton - Misa, Joshua D.
6 pages
Social Structure Theories Notes
No ratings yet
Social Structure Theories Notes
2 pages
Journal of Environmental Management
No ratings yet
Journal of Environmental Management
13 pages
Types of Artificial Neural Networks
No ratings yet
Types of Artificial Neural Networks
2 pages
Media and Information Literacy
No ratings yet
Media and Information Literacy
24 pages
UNAM Organization Chart
100% (1)
UNAM Organization Chart
1 page
Role of Education and School in Protection and Transmission of Culture
100% (1)
Role of Education and School in Protection and Transmission of Culture
3 pages
Agentic AI Roadmap
No ratings yet
Agentic AI Roadmap
6 pages
Vertucio, Dave F. (Educ 321-Module 2 Pre-Test)
No ratings yet
Vertucio, Dave F. (Educ 321-Module 2 Pre-Test)
2 pages
Thesis Topics in Construction Project Management
100% (2)
Thesis Topics in Construction Project Management
5 pages
Assignment Three Question 1 Individual M Gwede
No ratings yet
Assignment Three Question 1 Individual M Gwede
6 pages

Artificial Neural Network

Uploaded by

Artificial Neural Network

Uploaded by

Paper presentation

Choosing a cost function

See also: dynamic programming and stochastic control

Evolutionary methods, simulated annealing, expectation-maximization and non-parametric

Employing artificial neural networks

Neural networks and neuroscience

You might also like