0% found this document useful (0 votes)

37 views41 pages

Machine Learning: Neural Networks Slides Mostly Adapted From Tom Mithcell, Han and Kamber

Neural networks are computational models inspired by the human brain that are made up of simple processing units called neurons. Knowledge is stored in the synaptic connection strengths between neurons and is acquired through a learning process. Neural networks have been widely used for pattern recognition, function approximation, and associative memory. The backpropagation algorithm allows neural networks to be trained on labeled data samples to learn patterns and make predictions.

Uploaded by

MOHD ASIF ALI

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

37 views41 pages

Machine Learning: Neural Networks Slides Mostly Adapted From Tom Mithcell, Han and Kamber

Uploaded by

MOHD ASIF ALI

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 41

Machine Learning

Neural Networks

Slides mostly adapted from Tom

Mithcell, Han and Kamber
Artificial Neural Networks
● Computational models inspired by the human
brain:
● Algorithms that try to mimic the brain.

● Massively parallel, distributed system, made up of

simple processing units (neurons)

● Synaptic connection strengths among neurons are

used to store the acquired knowledge.

● Knowledge is acquired by the network from its

environment through a learning process
History
● late-1800's - Neural Networks appear as an
analogy to biological systems
● 1960's and 70's – Simple neural networks appear
● Fall out of favor because the perceptron is not
effective by itself, and there were no good algorithms
for multilayer nets
● 1986 – Backpropagation algorithm appears
● Neural Networks have a resurgence in popularity
● More computationally expensive
Applications of ANNs

● ANNs have been widely used in various domains

for:
● Pattern recognition
● Function approximation
● Associative memory
Properties
● Inputs are flexible
● any real values
● Highly correlated or independent
● Target function may be discrete-valued, real-valued, or
vectors of discrete or real values
● Outputs are real numbers between 0 and 1
● Resistant to errors in the training data
● Long training time
● Fast evaluation
● The function produced can be difficult for humans to
interpret
When to consider neural networks
● Input is high-dimensional discrete or raw-valued
● Output is discrete or real-valued
● Output is a vector of values
● Possibly noisy data
● Form of target function is unknown
● Human readability of the result is not important
Examples:
● Speech phoneme recognition
● Image classification
● Financial prediction
A Neuron (= a perceptron)
- t
x0 w0

∑
x1 w1
f
output y
xn wn

Input weight weighted Activation

vector x vector w sum function

● The n-dimensional input vector x is mapped into variable y by

means of the scalar product and a nonlinear function mapping

November 25, 2021 Data Mining: Concepts and Techniques 7

Perceptron
● Basic unit in a neural network
● Linear separator
● Parts
● N inputs, x1 ... xn
● Weights for each input, w1 ... wn
● A bias input x0 (constant) and associated weight w0
● Weighted sum of inputs, y = w0x0 + w1x1 + ... + wnxn
● A threshold function or activation function,
● i.e 1 if y > t, -1 if y <= t
Artificial Neural Networks (ANN)
● Model is an assembly of
inter-connected nodes
and weighted links

● Output node sums up

each of its input value
according to the weights
of its links Perceptron Model

or
● Compare output node
against some threshold t
Types of connectivity

output units
● Feedforward networks
● These compute a series of
transformations hidden units
● Typically, the first layer is the input
and the last layer is the output.
input units
● Recurrent networks
● These have directed cycles in their
connection graph. They can have
complicated dynamics.
● More biologically realistic.
Different Network Topologies
● Single layer feed-forward networks
● Input layer projecting into the output layer

Single layer
network

Input Output
layer layer
Different Network Topologies
● Multi-layer feed-forward networks
● One or more hidden layers. Input projects only from
previous layers onto a layer.

2-layer or
1-hidden layer
fully connected
network
Input Hidden Output
layer layer layer
Different Network Topologies
● Multi-layer feed-forward networks

Input Hidden Output

layer layers layer
Different Network Topologies
● Recurrent networks
● A network with feedback, where some of its inputs
are connected to some of its outputs (discrete time).

Recurrent
network

Input Output
layer layer
Algorithm for learning ANN
● Initialize the weights (w0, w1, …, wk)

● Adjust the weights in such a way that the output

of ANN is consistent with class labels of training
examples
● Error function:

● Find the weights wi’s that minimize the above error

function
● e.g., gradient descent, backpropagation algorithm
Optimizing concave/convex function

● Maximum of a concave function = minimum of a

convex function
Gradient ascent (concave) / Gradient descent (convex)

Gradient ascent rule

Decision surface of a perceptron

● Decision surface is a hyperplane

● Can capture linearly separable classes
● Non-linearly separable
● Use a network of them
Multi-layer Networks
● Linear units inappropriate
● No more expressive than a single layer
● Introduce non-linearity
● Threshold not differentiable
● Use sigmoid function
Backpropagation
● Iteratively process a set of training tuples & compare the network's
prediction with the actual known target value
● For each training tuple, the weights are modified to minimize the mean
squared error between the network's prediction and the actual target
value
● Modifications are made in the “backwards” direction: from the output
layer, through each hidden layer down to the first hidden layer, hence
“backpropagation”
● Steps
● Initialize weights (to small random #s) and biases in the network

● Propagate the inputs forward (by applying activation function)

● Backpropagate the error (by updating weights and biases)

● Terminating condition (when error is very small, etc.)

November 23, 2021 Data Mining: Concepts and Techniques 31

How A Multi-Layer Neural Network Works?
● The inputs to the network correspond to the attributes measured for
each training tuple
● Inputs are fed simultaneously into the units making up the input layer
● They are then weighted and fed simultaneously to a hidden layer
● The number of hidden layers is arbitrary, although usually only one
● The weighted outputs of the last hidden layer are input to units making
up the output layer, which emits the network's prediction
● The network is feed-forward in that none of the weights cycles back to
an input unit or to an output unit of a previous layer
● From a statistical point of view, networks perform nonlinear regression:
Given enough hidden units and enough training samples, they can
closely approximate any function

November 23, 2021 Data Mining: Concepts and Techniques 33

Defining a Network Topology
● First decide the network topology: # of units in the input
layer, # of hidden layers (if > 1), # of units in each hidden
layer, and # of units in the output layer
● Normalizing the input values for each attribute measured in
the training tuples to [0.0—1.0]
● One input unit per domain value, each initialized to 0
● Output, if for classification and more than two classes, one
output unit per class is used
● Once a network has been trained and its accuracy is
unacceptable, repeat the training process with a different
network topology or a different set of initial weights

November 23, 2021 Data Mining: Concepts and Techniques 34

Backpropagation and Interpretability
● Efficiency of backpropagation: Each epoch (one interation through the
training set) takes O(|D| * w), with |D| tuples and w weights, but # of
epochs can be exponential to n, the number of inputs, in the worst case
● Rule extraction from networks: network pruning
● Simplify the network structure by removing weighted links that have the
least effect on the trained network
● Then perform link, unit, or activation value clustering

● The set of input and activation values are studied to derive rules
describing the relationship between the input and hidden unit layers
● Sensitivity analysis: assess the impact that a given input variable has on a
network output. The knowledge gained from this analysis can be
represented in rules

November 23, 2021 Data Mining: Concepts and Techniques 35

Neural Network as a Classifier
● Weakness
● Long training time
● Require a number of parameters typically best determined empirically,
e.g., the network topology or “structure.”
● Poor interpretability: Difficult to interpret the symbolic meaning behind
the learned weights and of “hidden units” in the network
● Strength
● High tolerance to noisy data
● Ability to classify untrained patterns
● Well-suited for continuous-valued inputs and outputs
● Successful on a wide array of real-world data
● Algorithms are inherently parallel
● Techniques have recently been developed for the extraction of rules from
trained neural networks

November 23, 2021 Data Mining: Concepts and Techniques 36

Artificial Neural Networks (ANN)
Learning Perceptrons
A Multi-Layer Feed-Forward Neural Network
Output vector

Output layer

Hidden layer

wij

Input layer

Input vector: X
November 23, 2021 Data Mining: Concepts and Techniques 40
General Structure of ANN

Training ANN means learning

the weights of the neurons

The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
From Everand
The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
Mark Manson
4/5 (6436)
Principles: Life and Work
From Everand
Principles: Life and Work
Ray Dalio
4/5 (642)
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
From Everand
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
Brene Brown
4/5 (1174)
Never Split the Difference: Negotiating As If Your Life Depended On It
From Everand
Never Split the Difference: Negotiating As If Your Life Depended On It
Chris Voss
4.5/5 (997)
The Glass Castle: A Memoir
From Everand
The Glass Castle: A Memoir
Jeannette Walls
4.5/5 (1854)
Grit: The Power of Passion and Perseverance
From Everand
Grit: The Power of Passion and Perseverance
Angela Duckworth
4/5 (650)
Sing, Unburied, Sing: A Novel
From Everand
Sing, Unburied, Sing: A Novel
Jesmyn Ward
4/5 (1267)
The Perks of Being a Wallflower
From Everand
The Perks of Being a Wallflower
Stephen Chbosky
4.5/5 (4102)
Her Body and Other Parties: Stories
From Everand
Her Body and Other Parties: Stories
Carmen Maria Machado
4/5 (903)
Shoe Dog: A Memoir by the Creator of Nike
From Everand
Shoe Dog: A Memoir by the Creator of Nike
Phil Knight
4.5/5 (628)
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
From Everand
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
Margot Lee Shetterly
4/5 (1018)
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
From Everand
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
Ben Horowitz
4.5/5 (361)
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
From Everand
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
Ashlee Vance
4.5/5 (581)
The Emperor of All Maladies: A Biography of Cancer
From Everand
The Emperor of All Maladies: A Biography of Cancer
Siddhartha Mukherjee
4.5/5 (298)
Steve Jobs
From Everand
Steve Jobs
Walter Isaacson
4.5/5 (1138)
A Man Called Ove: A Novel
From Everand
A Man Called Ove: A Novel
Fredrik Backman
4.5/5 (5144)
Angela's Ashes: A Memoir
From Everand
Angela's Ashes: A Memoir
Frank McCourt
4.5/5 (943)
The Yellow House: A Memoir (2019 National Book Award Winner)
From Everand
The Yellow House: A Memoir (2019 National Book Award Winner)
Sarah M. Broom
4/5 (100)
Brooklyn: A Novel
From Everand
Brooklyn: A Novel
Colm Toibin
3.5/5 (2133)
The Little Book of Hygge: Danish Secrets to Happy Living
From Everand
The Little Book of Hygge: Danish Secrets to Happy Living
Meik Wiking
3.5/5 (463)
The World Is Flat 3.0: A Brief History of the Twenty-first Century
From Everand
The World Is Flat 3.0: A Brief History of the Twenty-first Century
Thomas L. Friedman
3.5/5 (2289)
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
From Everand
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
Gilbert King
4.5/5 (279)
The Art of Racing in the Rain: A Novel
From Everand
The Art of Racing in the Rain: A Novel
Garth Stein
4/5 (4360)
Yes Please
From Everand
Yes Please
Amy Poehler
4/5 (2010)
Bad Feminist: Essays
From Everand
Bad Feminist: Essays
Roxane Gay
4/5 (1090)
A Tree Grows in Brooklyn
From Everand
A Tree Grows in Brooklyn
Betty Smith
4.5/5 (2033)
The Woman in Cabin 10
From Everand
The Woman in Cabin 10
Ruth Ware
3.5/5 (2788)
The Outsider: A Novel
From Everand
The Outsider: A Novel
Stephen King
4/5 (2884)
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
From Everand
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
Dave Eggers
3.5/5 (233)
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
From Everand
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
Viet Thanh Nguyen
4.5/5 (141)
Wolf Hall: A Novel
From Everand
Wolf Hall: A Novel
Hilary Mantel
4/5 (4088)
Team of Rivals: The Political Genius of Abraham Lincoln
From Everand
Team of Rivals: The Political Genius of Abraham Lincoln
Doris Kearns Goodwin
4.5/5 (244)
On Fire: The (Burning) Case for a Green New Deal
From Everand
On Fire: The (Burning) Case for a Green New Deal
Naomi Klein
4/5 (78)
Fear: Trump in the White House
From Everand
Fear: Trump in the White House
Bob Woodward
3.5/5 (835)
Manhattan Beach: A Novel
From Everand
Manhattan Beach: A Novel
Jennifer Egan
3.5/5 (919)
Electrical Manual of GSK980TDb
100% (1)
Electrical Manual of GSK980TDb
18 pages
Rise of ISIS: A Threat We Can't Ignore
From Everand
Rise of ISIS: A Threat We Can't Ignore
Jay Sekulow
3.5/5 (144)
John Adams
From Everand
John Adams
David McCullough
4.5/5 (2546)
Progress Reporting Procedure Draft
100% (2)
Progress Reporting Procedure Draft
9 pages
Sales Script Template PDF
0% (1)
Sales Script Template PDF
2 pages
The Unwinding: An Inner History of the New America
From Everand
The Unwinding: An Inner History of the New America
George Packer
4/5 (45)
The Light Between Oceans: A Novel
From Everand
The Light Between Oceans: A Novel
M.L. Stedman
4.5/5 (815)
Little Women
From Everand
Little Women
Louisa May Alcott
4.5/5 (2369)
The Constant Gardener: A Novel
From Everand
The Constant Gardener: A Novel
John le Carré
4/5 (278)
Computer Architecture Question Bank
No ratings yet
Computer Architecture Question Bank
7 pages
Hanna - Effects of The Wide Use of Technology in The Academic Performance of Senior High School Students in Aroroy National High School
No ratings yet
Hanna - Effects of The Wide Use of Technology in The Academic Performance of Senior High School Students in Aroroy National High School
49 pages
The Glamorous CookBook of Maharashtrian Recipes
No ratings yet
The Glamorous CookBook of Maharashtrian Recipes
93 pages
22.6 DLC153V - BGC-I01A - Markazia - 71
No ratings yet
22.6 DLC153V - BGC-I01A - Markazia - 71
1 page
Isdb T Soft Opm e 6 0
No ratings yet
Isdb T Soft Opm e 6 0
316 pages
Curriculum Vitae Stelica GHEORGHITA: Personal Profile
No ratings yet
Curriculum Vitae Stelica GHEORGHITA: Personal Profile
6 pages
D 3 Plot
No ratings yet
D 3 Plot
733 pages
Brochure OS8000
No ratings yet
Brochure OS8000
4 pages
IEEE 802.16 Network Architecture: Wireless Communication Networks
100% (3)
IEEE 802.16 Network Architecture: Wireless Communication Networks
10 pages
2024 Researchable Dissertation Topics in Digital Imaging
No ratings yet
2024 Researchable Dissertation Topics in Digital Imaging
14 pages
Autocad Pid 2011 Frequently Asked Questions
No ratings yet
Autocad Pid 2011 Frequently Asked Questions
3 pages
Time Series Models Manfred Deistler Wolfgang Scherrer download
No ratings yet
Time Series Models Manfred Deistler Wolfgang Scherrer download
80 pages
How To Update Thesis 1.8 To 2.0
100% (2)
How To Update Thesis 1.8 To 2.0
7 pages
ICT and The Future of Industry
No ratings yet
ICT and The Future of Industry
16 pages
Amici Introduction - To - Industrial - Security - Concepts
No ratings yet
Amici Introduction - To - Industrial - Security - Concepts
154 pages
Forrest Gump Essay
100% (2)
Forrest Gump Essay
6 pages
Competence Controls V1 ENG 01 2014 Web
No ratings yet
Competence Controls V1 ENG 01 2014 Web
24 pages
UserManual_ViewLine_52mm_Instruments_ML
No ratings yet
UserManual_ViewLine_52mm_Instruments_ML
43 pages
Rotograph EVO
No ratings yet
Rotograph EVO
11 pages
Circuit Terminology
No ratings yet
Circuit Terminology
12 pages
Working With Word Problems - Turn Words Into Mathematics: Look For Key Words and Relationships
No ratings yet
Working With Word Problems - Turn Words Into Mathematics: Look For Key Words and Relationships
2 pages
SAP Process - Cash Sales and Rush Order
No ratings yet
SAP Process - Cash Sales and Rush Order
2 pages
Convex Cardinality Optimization
No ratings yet
Convex Cardinality Optimization
26 pages
Lesson 1-2 History of Philippine Internet
No ratings yet
Lesson 1-2 History of Philippine Internet
4 pages
Min Chen, Yixue Hao, Kai Hwang, Fellow, IEEE, Lu Wang, and Lin Wang
No ratings yet
Min Chen, Yixue Hao, Kai Hwang, Fellow, IEEE, Lu Wang, and Lin Wang
7 pages
Inner Circle Trader2 PDF Free
No ratings yet
Inner Circle Trader2 PDF Free
4 pages
ACQ580-31 Drives: Hardware Manual
No ratings yet
ACQ580-31 Drives: Hardware Manual
242 pages

Machine Learning: Neural Networks Slides Mostly Adapted From Tom Mithcell, Han and Kamber

Uploaded by

Machine Learning: Neural Networks Slides Mostly Adapted From Tom Mithcell, Han and Kamber

Uploaded by

Machine Learning

Slides mostly adapted from Tom

● Massively parallel, distributed system, made up of

● Synaptic connection strengths among neurons are

● Knowledge is acquired by the network from its

● ANNs have been widely used in various domains

Input weight weighted Activation

● The n-dimensional input vector x is mapped into variable y by

November 25, 2021 Data Mining: Concepts and Techniques 7

● Output node sums up

Input Hidden Output

● Adjust the weights in such a way that the output

● Find the weights wi’s that minimize the above error

● Maximum of a concave function = minimum of a

Gradient ascent rule

● Decision surface is a hyperplane

● Propagate the inputs forward (by applying activation function)

● Backpropagate the error (by updating weights and biases)

● Terminating condition (when error is very small, etc.)

November 23, 2021 Data Mining: Concepts and Techniques 31

November 23, 2021 Data Mining: Concepts and Techniques 33

November 23, 2021 Data Mining: Concepts and Techniques 34

November 23, 2021 Data Mining: Concepts and Techniques 35

November 23, 2021 Data Mining: Concepts and Techniques 36

Training ANN means learning

You might also like