Neural Networks: Tricks of the Trade

Neural Networks: Tricks of the Trade [electronic resource] / edited by Genevieve B. Orr, Klaus-Robert Müller. - 1st ed. 1998. - VIII, 432 p. online resource. - Lecture Notes in Computer Science, 1524 1611-3349 ; . - Lecture Notes in Computer Science, 1524 .

Speeding Learning -- Efficient BackProp -- Regularization Techniques to Improve Generalization -- Early Stopping - But When? -- A Simple Trick for Estimating the Weight Decay Parameter -- Controlling the hyperparameter search in MacKay’s Bayesian neural network framework -- Adaptive Regularization in Neural Network Modeling -- Large Ensemble Averaging -- Improving Network Models and Algorithmic Tricks -- Square Unit Augmented Radially Extended Multilayer Perceptrons -- A Dozen Tricks with Multitask Learning -- Solving the Ill-Conditioning in Neural Network Learning -- Centering Neural Network Gradient Factors -- Avoiding roundoff error in backpropagating derivatives -- Representing and Incorporating Prior Knowledge in Neural Network Training -- Transformation Invariance in Pattern Recognition — Tangent Distance and Tangent Propagation -- Combining Neural Networks and Context-Driven Search for Online, Printed Handwriting Recognition in the Newton -- Neural Network Classification and Prior Class Probabilities -- Applying Divide and Conquer to Large Scale Pattern Recognition Tasks -- Tricks for Time Series -- Forecasting the Economy with Neural Nets: A Survey of Challenges and Solutions -- How to Train Neural Networks.

It is our belief that researchers and practitioners acquire, through experience and word-of-mouth, techniques and heuristics that help them successfully apply neural networks to di cult real world problems. Often these \tricks" are theo- tically well motivated. Sometimes they are the result of trial and error. However, their most common link is that they are usually hidden in people’s heads or in the back pages of space-constrained conference papers. As a result newcomers to the eld waste much time wondering why their networks train so slowly and perform so poorly. This book is an outgrowth of a 1996 NIPS workshop called Tricks of the Trade whose goal was to begin the process of gathering and documenting these tricks. The interest that the workshop generated motivated us to expand our collection and compile it into this book. Although we have no doubt that there are many tricks we have missed, we hope that what we have included will prove to be useful, particularly to those who are relatively new to the eld. Each chapter contains one or more tricks presented by a given author (or authors). We have attempted to group related chapters into sections, though we recognize that the di erent sections are far from disjoint. Some of the chapters (e.g., 1, 13, 17) contain entire systems of tricks that are far more general than the category they have been placed in.

9783540494300

10.1007/3-540-49430-8 doi


Computer science.
Artificial intelligence.
Microprocessors.
Computer architecture.
Pattern recognition systems.
Dynamics.
Nonlinear theories.
Theory of Computation.
Artificial Intelligence.
Processor Architectures.
Automated Pattern Recognition.
Applied Dynamical Systems.

QA75.5-76.95

004.0151
© 2024 IIIT-Delhi, library@iiitd.ac.in