Generalization and Network Design Strategies

Redes Neurais, generalização, back-propagation\

Uploaded by

Paulo Lima Campos

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF or read online on Scribd

0% found this document useful (0 votes)

717 views

Generalization and Network Design Strategies

Redes Neurais, generalização, back-propagation\

Uploaded by

Paulo Lima Campos

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF or read online on Scribd

You are on page 1/ 20

Generalization and Network Design Strategies Y. le Cun Department of Computer Science University of Toronto ‘Technical Report CRG-TR-89-4 June 1989 Send requests to: ‘The CRG technical report secretary Department of Computer Science University of Toronto 10 Kings College Road ‘Toronto MSS 1A4 CANADA INTERNET: [email protected] UUCP: uunettutailcarol BITNET: carol@utorgpu ‘This work has been supported by a grant from the Fyssen foundation, and a grant from the Sloan foundation to Geoffrey Hinton. The author wishes to thank Geoff Hinton, Mike Mozer, Sue Becker and Steve Nowlan for helpful discussions, and John Denker and Larry Jackel for useful comments ‘The Neural Network simulator SN is the result of a collaboration between Leon-Yves Bottou and the author. Y. le Cun’s present address is Room 4G-332, AT&T Bell Laboratories, Crawfords Corner Rd, Holmdel, NJ 07733.Y. Le Cun. Generalization and network design strategies. Technical Report CRG-TR-89-4, University of Toronto Connectionist Research Group, June 1989. a shorter version was published in Pfeifer, Schreter, Fogelman and Steels (eds) ‘Connectionism in perspective’, Elsevier 1989. Generalization and Network Design. Strategies Yana le Cua * Department of Computer Science, University of Toronto Toronto, Ontario, MSS 144. CANADA. Abstract, ‘An interesting property of connectiomst systems is their ability to learn from examples, Although most recent work in the field concentrates on reducing learning times, the most important feature of a learning machine is its generalization performance. It is usually accepted that good generalization performance on real-world problems cannot be achieved unless some a priort knovledge about the task is built into the system. Back-propagation networks provide a way of specifying such knowledge by imposing constraints both on the architecture of the network and on ts weights. In general, such constraints can be considered as particular transformations of the parameter space Bunlding a constrained network for image recogmition appears to be a feasible task. We describe a small handwntten digit recognition problem and show that, even though the problem 1s linearly separable, single layer networks exhibit poor generalization performance. Multilayer constrained networks perform very well on this task when organized in a hierarchical structure with shift invariant feature detectors, ‘These results confirm the idea that minimizing the number of free parameters in the network enhances generalization. 1 Introduction Connectionist architectures have drawn considerable attention in recent years because of their interesting learning abilities Among the numerous learning algorithms that have been proposed for complex connectionist networks, “Present address: Room 4G-352, ATLT Bell Laboratories, Crawfords Comer Ra, Holmdel, NJ 07733,Back-Propagation (BP) 1s probably the most widespread. BP was proposed in (Rumethart et al , 1986), but had been developed before by several independent ‘groups in different contexts and for different purposes (Bryson and Ho, 1969, Werbos, 1974, le Cun, 1985; Parker, 1985; le Cun, 1986) Reference (Bryson and Ho, 1969) was in the framework of optumal control and system identification, and one could argue that the basic 1dea behind BP had been used im optimal control long before its application to machine learning was considered (le Cun, 1088) Two performance measures should be considered when testing a learning algorithm learning speed and generalization performance Generalization is the main property that should be sought, it determines the amount of data needed to traim the system such that a correct response 1s produced when presented fa patterns outside of the training set. We will see that learning speed and generalization are closely related. Although various successful applications of BP have been described in the literature, the conditions in which good generalization performance can be ob- tamed are not understood. Considering BP as a general learning rule that can be used as a black box for a wide variety of problems is, of course, wishful think- ing. Although some moderate sized problems can be solved using unstructured networks, we eannot expect an unstructured network to generalize correctly on every problem. The main point of this paper is to show that good generalization performance can be obtained if some a priort knowledge about the task 1s built into the network. Although in the general case specifying such knowledge may be difficult, st appears feasible on some highly regular tasks such as image and speech recognition, Tailormg the network architecture to the task can be thought of as a way of reducing the size of the space of possible functions that the network can generate, without overly reducing its computational power Theoretical studies (Denker et al, 1987) (Patarnello and Carnevali, 1987) have shown that the likelihood of correct generalization depends on the size of the hypothesis space (total number of networks being considered), the size ofthe solution space (set of networks that give good generalization), and the number of traing examples Ifthe hypothesis space 1s too large and/or the number of traning examples s too small, then there will bea vast number of networks which are consistent with the training data, only a small proportion of whuch will he in the true solution space, s0 poor generalization 1s to be expected Conversely, if good generalization 18, required, when the generality of the architecture 1s mereased, the number of training examples must also be incéeased. Specifically, the required number of examples scales like the logarithm of the number of functions that the network arclutecture can implement

Data Driven Science and Engineering 1108422098 all chapter instant download
100% (2)
Data Driven Science and Engineering 1108422098 all chapter instant download
34 pages
Implementing FDTD Tutorial
No ratings yet
Implementing FDTD Tutorial
10 pages
Manifesto On Numerical Integration of Equations of Motion Using Matlab
No ratings yet
Manifesto On Numerical Integration of Equations of Motion Using Matlab
10 pages
(Thomas P. Pearsall) Photonics Essentials An Intr (BookFi)
No ratings yet
(Thomas P. Pearsall) Photonics Essentials An Intr (BookFi)
284 pages
LIGHTCOUNTING State of The Optical Communications
No ratings yet
LIGHTCOUNTING State of The Optical Communications
35 pages
(Victor R. Preedy, Vinood B. Patel) Biosensors and
No ratings yet
(Victor R. Preedy, Vinood B. Patel) Biosensors and
421 pages
Jitter and Phase Noise
No ratings yet
Jitter and Phase Noise
62 pages
SerDes Report
No ratings yet
SerDes Report
5 pages
The Michealson Inferometre
100% (1)
The Michealson Inferometre
32 pages
RF MEMS Switches and Switch Circuits: Shimul Chandra Saha
No ratings yet
RF MEMS Switches and Switch Circuits: Shimul Chandra Saha
174 pages
An Introduction To: Compressive Sensing
No ratings yet
An Introduction To: Compressive Sensing
28 pages
PDF Lithium Niobate Photonics 1st Edition James E Toney Download
100% (2)
PDF Lithium Niobate Photonics 1st Edition James E Toney Download
69 pages
Using Generalized Quantum Fourier Transforms in Quantum Phase Estimation Algorithms
100% (1)
Using Generalized Quantum Fourier Transforms in Quantum Phase Estimation Algorithms
94 pages
This Island Earth
No ratings yet
This Island Earth
9 pages
How To Play Take 5 Chords On Guitar
No ratings yet
How To Play Take 5 Chords On Guitar
6 pages
Coulomb Blockade
No ratings yet
Coulomb Blockade
21 pages
Sigma-Delta Modulators: Tutorial Overview, Design Guide, and State-of-the-Art Survey
No ratings yet
Sigma-Delta Modulators: Tutorial Overview, Design Guide, and State-of-the-Art Survey
21 pages
Broadband Circuits For Optical Fiber Communication
0% (1)
Broadband Circuits For Optical Fiber Communication
116 pages
Genesis Manual
No ratings yet
Genesis Manual
51 pages
PDF Nonlinear Fiber Optics 6th Edition Govind P. Agrawal Download
100% (3)
PDF Nonlinear Fiber Optics 6th Edition Govind P. Agrawal Download
52 pages
Wedge 50: User's Manual
No ratings yet
Wedge 50: User's Manual
23 pages
Sampled Imaging Systems: Analysis and Evaluation of
No ratings yet
Sampled Imaging Systems: Analysis and Evaluation of
20 pages
Reiser 1988
No ratings yet
Reiser 1988
5 pages
Optical Communication Fundamentals - Sudhir Warier
No ratings yet
Optical Communication Fundamentals - Sudhir Warier
270 pages
Rane Audio Reference
No ratings yet
Rane Audio Reference
437 pages
Aschauer Daniel - 2008 - Algorithmic composition
No ratings yet
Aschauer Daniel - 2008 - Algorithmic composition
122 pages
Boosting Transistor Switching Speed (R.H.bakeR 1957 4p)
No ratings yet
Boosting Transistor Switching Speed (R.H.bakeR 1957 4p)
4 pages
(Nano) Lithography: 1 1K 1M 1G 1T
No ratings yet
(Nano) Lithography: 1 1K 1M 1G 1T
12 pages
(IOP Concise Physics - A Morgan & Claypool Publication) Marco Lanzagorta - Quantum Information in Gravitational Fields-IOP Concise Physics (2014)
No ratings yet
(IOP Concise Physics - A Morgan & Claypool Publication) Marco Lanzagorta - Quantum Information in Gravitational Fields-IOP Concise Physics (2014)
298 pages
QCL Lecture Long
No ratings yet
QCL Lecture Long
160 pages
Aperture Coupled Fed Micro-Strip Patch Antenna
No ratings yet
Aperture Coupled Fed Micro-Strip Patch Antenna
5 pages
Download Fundamentals and Applications of Nanophotonics 1st Edition Haus ebook All Chapters PDF
100% (10)
Download Fundamentals and Applications of Nanophotonics 1st Edition Haus ebook All Chapters PDF
40 pages
Bessel Function Zeroes
No ratings yet
Bessel Function Zeroes
5 pages
Computational Imaging
No ratings yet
Computational Imaging
51 pages
Experimental Photonic Quantum Memristor (s41566-022-00973-5)
No ratings yet
Experimental Photonic Quantum Memristor (s41566-022-00973-5)
7 pages
Mohanty Book 2015 Nanoelectronics-Mixed-Signal ToC Preprint
100% (1)
Mohanty Book 2015 Nanoelectronics-Mixed-Signal ToC Preprint
113 pages
MATLAB Programs
No ratings yet
MATLAB Programs
15 pages
Steps For Simulating VHDL Programs in OrCAD
No ratings yet
Steps For Simulating VHDL Programs in OrCAD
14 pages
Tremaine 2002 0113
No ratings yet
Tremaine 2002 0113
4 pages
Quantum Wells, Supperlattices and Bandgap Engineering
No ratings yet
Quantum Wells, Supperlattices and Bandgap Engineering
21 pages
Cascode CS LNA Design Example
100% (1)
Cascode CS LNA Design Example
8 pages
Software-Defined Networking (SDN)
No ratings yet
Software-Defined Networking (SDN)
16 pages
LightTools - 2024.03 Release Notes
No ratings yet
LightTools - 2024.03 Release Notes
54 pages
Complete Slow Light Science and Applications 1st Edition Jacob B. Khurgin ebook download PDF & DOCX
100% (1)
Complete Slow Light Science and Applications 1st Edition Jacob B. Khurgin ebook download PDF & DOCX
77 pages
Philip Duke - Synchrotron Radiation - Production and Properties (Oxford Series On Synchrotron Radiation) (2009)
No ratings yet
Philip Duke - Synchrotron Radiation - Production and Properties (Oxford Series On Synchrotron Radiation) (2009)
266 pages
University of Liverpool Campus Map
No ratings yet
University of Liverpool Campus Map
2 pages
(Computer Monograph) Barron, David William-Recursive Techniques in Programming-Macdonald & Co (1968)
No ratings yet
(Computer Monograph) Barron, David William-Recursive Techniques in Programming-Macdonald & Co (1968)
71 pages
Long Term Vision
No ratings yet
Long Term Vision
102 pages
Circuit Analysis Chapter 1 Notes
No ratings yet
Circuit Analysis Chapter 1 Notes
25 pages
4★★★★모듈레이터의 종류 - Fukuda - RF Sourced and Modulator (mainly for linac)
No ratings yet
4★★★★모듈레이터의 종류 - Fukuda - RF Sourced and Modulator (mainly for linac)
79 pages
A Physical Model of The Classical Guitar, Including The Player's Touch
No ratings yet
A Physical Model of The Classical Guitar, Including The Player's Touch
37 pages
6 - Proceedings of The International Conference On Structural Health Monitoring of Intelligent Infrastructure
No ratings yet
6 - Proceedings of The International Conference On Structural Health Monitoring of Intelligent Infrastructure
1,848 pages
Generalization
No ratings yet
Generalization
10 pages
Generalization
No ratings yet
Generalization
10 pages
A Mathematical Theory of Generalization: Part I: David H. Wolpert
No ratings yet
A Mathematical Theory of Generalization: Part I: David H. Wolpert
50 pages
Creating Artificial Neural Networks That Generalize
No ratings yet
Creating Artificial Neural Networks That Generalize
13 pages
Optimization For Deep Learning Theory and Algorithms
No ratings yet
Optimization For Deep Learning Theory and Algorithms
60 pages
reed1993
No ratings yet
reed1993
8 pages
A Dynamic All Parameters Adaptive BP Neural Networks Model and Its Application On Oil Reservoir Prediction
No ratings yet
A Dynamic All Parameters Adaptive BP Neural Networks Model and Its Application On Oil Reservoir Prediction
10 pages
Learning Algo For Boltzman Machine
No ratings yet
Learning Algo For Boltzman Machine
23 pages
001-91445 AN91445 Antenna Design and RF Layout Guidelines
No ratings yet
001-91445 AN91445 Antenna Design and RF Layout Guidelines
61 pages
For Vehicle Weight Enforcement Applications: Proven Reliability and Ease of Use
No ratings yet
For Vehicle Weight Enforcement Applications: Proven Reliability and Ease of Use
6 pages
Going Deeper With Convolutions: Wliu@cs - Unc.edu, Reedscott@umich - Edu
No ratings yet
Going Deeper With Convolutions: Wliu@cs - Unc.edu, Reedscott@umich - Edu
9 pages
Dermatologist-Level Classification of Skin Cancer With Deep Neural Networks
No ratings yet
Dermatologist-Level Classification of Skin Cancer With Deep Neural Networks
11 pages

Generalization and Network Design Strategies

Uploaded by

Generalization and Network Design Strategies

Uploaded by

You might also like