Papperoni

1986

Learning Process in an Asymmetric Threshold Network

Yann Lecun

citations

Cite Score

AI summary

This paper introduces a hierarchical learning machine (HLM) and presents a method to describe the evolution of weights through an energy function, addressing the credit assignment problem (CAP). The method is applied to a hierarchical associative memory model, demonstrating the potential for learning and generalization through simulations on alphabetic character recognition.

Main Contributions

Introduces a method to describe the evolution of weights through an energy function.
Presents a solution to the credit assignment problem (CAP).
Applies the method to a hierarchical associative memory model (HLM).
Demonstrates learning and generalization capabilities through simulations.
Achieves correct classification with a 6-layer network in character recognition tasks.

Abstract

Threshold functions and related operators are widely used as basiC elements of adaptive and associative networks [Nakano 72, Amari 72, Hopfleld 821. There exist numerous learning rules for finding a set of weights to achieve a particular correspondence between input-output pairs. But early works in the field have shown that the number of threshold functions (or linearly separable functions) in N binary variables is small compared to the number of all possible boolean mappings in N variables, especially if N is large. This problem is one of the main limitations of most neural networks models where the state is fully specified by the environment during learning: they can only learn linearly separable functions of their Inputs. Moreover, a learning procedure which requires the outside world to specify the state of every neuron during the learning session can hardly be considered as a general learning rule because in real-world conditions, only a partial information on the "ideal" network state for each task is available from the environment. It is possible to use a set of so-called "hidden units" [Hinton,Sejnowski,Ackley. 84], without direct inter action with the enVironment, which can compute intermediate predicates. Unfortunately, the global response depends on the output of a particular hidden unit in a highly non-linear way, moreover the nature of this dependence is influenced by the states of the other cells. Thus, it is difficult to decide whether the output of a hidden unit is wrong for a particular input, and, consequently, how to modify its weights. This last problem has been referred to as the "credit aSSignment problem" (CAP) in [Hinton & aJ. 841. Attempts to find a learning rule taking into account hidden units and generating high order predicates failed until recently, which could explain for the decrease of interest in this field for the past 15 years [Minsky & Papert 681. In this paper, we consIder learning and associative memorIzation as dynamic processes and show how to describe the evolution of the weights through an "energy" function. This method will be used to solve the CAP and applied to a model of hierarchical associative memory called HLM (Hierarchical Learning Machine).

Citation Graph

Loading graph...

References [11]

Sort:

Filter:

[1]Neural Networks and Physical Systems With Emergent Collective Computational Abilities

J. J. Hopfield - 1982

8 papers in library cite

Google Scholar

Very theory heavy and doesn't seem to add much... Famous because it introduced the "Hopfield network" - "At a time when research in neural networks was facing a lull, Hopfield's paper injected a fresh perspective by demonstrating how a network of simple, interconnected units could exhibit complex computational abilities as a collective, emergent property."

[2]A Learning Scheme for Asymmetric Threshold Networks

Yann Lecun - 1985

5 papers in library cite

Google Scholar

Meh. Really did not bring much to the table. Describes back-prop but mentions that it comes from Hinton.

[3]Perceptrons

M. Minsky, S. Papert - 1969

12 papers in library cite

Google Scholar

Book, 500 pages

[4]Pattern Classification and Scene Analysis

R. O. Duda, P. E. Hart - 1973

9 papers in library cite

Google Scholar

[5]Adaptive Switching Circuits

B. Widrow, H. E. Hoff - 1960

5 papers in library cite

Google Scholar

[6]Boltzmann Machines, Constraint Satisfaction Networks That Learn

Geoffrey Hinton, T. Sejnowski, D. Ackley - 1984

3 papers in library cite

Google Scholar

[7]Self-Organization and Associative Memory

T. Kohonen - 1988

3 papers in library cite

Google Scholar

[8]An Adaptive Associative Memory Principle

T. Kohonen - 1974

1 paper in library cites

Google Scholar

[9]Associatron, a Model of Associative Memory

K. Nakano - 1972

1 paper in library cites

Google Scholar

[10]Learning Patterns and Patterns Sequences by Self-Organizing Net of Threshold Elements

S. I. Amari - 1972

1 paper in library cites

Google Scholar

[11]Representation of Associated Data by Matrix Operators

T. Kohonen, M. Ruohonen - 1973

1 paper in library cites

Google Scholar

Cited by

papers in your library

Cites

papers in your library

Read

on June 22, 2025

Meh. Really did not bring much to the table. Describes back-prop but mentions that it comes from Hinton.