Papperoni

1986

Learning Internal Representations by Error Propagation

D. E. Rumelhart, Geoffrey E. Hinton, Ronald J. Williams

citations

Cite Score

AI summary

This paper introduces the generalized delta rule for training multilayer networks with hidden units, showcasing its ability to learn complex mappings and internal representations, achieving results on XOR, encoding, and symmetry problems.

Main Contributions

Introduces the generalized delta rule, an extension of the delta rule for training multilayer feedforward networks with semilinear units.
Demonstrates the ability of the generalized delta rule to learn internal representations in hidden units.
Shows that the method implements a gradient descent in weight space.
Applies the learning rule to a variety of problems including XOR, parity, encoding, symmetry, and binary addition.
Discusses how the generalized delta rule can be extended to sigma-pi units and recurrent networks.

Abstract

We now have a rather good understanding of simple two-layer associative networks in which a set of input patterns arriving at an input layer are mapped directly to a set of output patterns at an output layer. Such networks have no hidden units. They involve only input and output units. In these cases there is no internal representation. The coding provided by the external world must suffice. These networks have proved useful in a wide variety of applications (cf. Chapters 2, 17, and 18). Perhaps the essential character of such networks is that they map similar input patterns to similar output patterns. This is what allows these networks to make reasonable generalizations and perform reasonably on patterns that have never before been presented. The similarity of patterns in a PDP system is determined by their overlap. The overlap in such networks is determined outside the learning system itself-by whatever produces the patterns. The constraint that similar input patterns lead to similar outputs can lead to an inability of the system to learn certain mappings from input to output. Whenever the representation provided by the outside world is such that the similarity structure of the input and output patterns are very different, a network without internal representations (i.e., a

Citation Graph

Loading graph...

References [11]

Sort:

Filter:

[1]Neocognitron: A Self-Organizing Neural Network Model for a Mechanism of Pattern Recognition Unaffected by Shift in Position

Kunihiko Fukushima - 1980

8 papers in library cite

Google Scholar

Good read! First conv net? :)

[2]Une Procédure D'Apprentissage pour Réseau à Seuil Asymétrique

Yann Lecun - 1985

5 papers in library cite

Google Scholar

Meh. Really did not bring much to the table. Describes back-prop but mentions that it comes from Hinton.

[3]Perceptrons

M. Minsky, S. Papert - 1969

12 papers in library cite

Google Scholar

Book, 500 pages

[4]Parallel Distributed Processing

D. E. Rumelhart, J. L. Mcclelland, P. R. Group - 1986

15 papers in library cite

Google Scholar

[5]Learning-Logic

D. B. Parker - 1985

8 papers in library cite

Google Scholar

[6]Adaptive Switching Circuits

B. Widrow, H. E. Hoff - 1960

5 papers in library cite

Google Scholar

[7]An Interactive Activation Model of Context Effects in Letter Perception

J. Mcclelland - 1981

3 papers in library cite

Google Scholar

[8]A Learning algorithm for Asymmetric Neural Networks

Barto - 1985

1 paper in library cites

Google Scholar

[9]An Information-Theoretic Solution to the Aperture Problem

Ackley, Hinton, Sejnowski - 1985

1 paper in library cites

Google Scholar

Missing year

[10]Boltzmann Machines

Hinton, Sejnowski

1 paper in library cites

Google Scholar

[11]Studies of Learning Automata With Pattern Recognizing Units Using Computer Simulation

Kienker, Sejnowski, Hinton, Schumacher - 1985

1 paper in library cites

Google Scholar

Cited by

papers in your library

Cites

papers in your library

Read

on June 24, 2025

I expected very little of this, but was so good in explaining concepts! Very good read. It gets a bit boring when it starts explaining things by the end of the chapter, but good nonetheless.