1986

Learning Process in an Asymmetric Threshold Network

Yann Lecun

citations

Cite Score

20

AI summary

This paper introduces a hierarchical learning machine (HLM) and presents a method to describe the evolution of weights through an energy function, addressing the credit assignment problem (CAP). The method is applied to a hierarchical associative memory model, demonstrating the potential for learning and generalization through simulations on alphabetic character recognition.

Main Contributions

  • Introduces a method to describe the evolution of weights through an energy function.
  • Presents a solution to the credit assignment problem (CAP).
  • Applies the method to a hierarchical associative memory model (HLM).
  • Demonstrates learning and generalization capabilities through simulations.
  • Achieves correct classification with a 6-layer network in character recognition tasks.

Abstract

Threshold functions and related operators are widely used as basiC elements of adaptive and associative networks [Nakano 72, Amari 72, Hopfleld 821. There exist numerous learning rules for finding a set of weights to achieve a particular correspondence between input-output pairs. But early works in the field have shown that the number of threshold functions (or linearly separable functions) in N binary variables is small compared to the number of all possible boolean mappings in N variables, especially if N is large. This problem is one of the main limitations of most neural networks models where the state is fully specified by the environment during learning: they can only learn linearly separable functions of their Inputs. Moreover, a learning procedure which requires the outside world to specify the state of every neuron during the learning session can hardly be considered as a general learning rule because in real-world conditions, only a partial information on the "ideal" network state for each task is available from the environment. It is possible to use a set of so-called "hidden units" [Hinton,Sejnowski,Ackley. 84], without direct inter action with the enVironment, which can compute intermediate predicates. Unfortunately, the global response depends on the output of a particular hidden unit in a highly non-linear way, moreover the nature of this dependence is influenced by the states of the other cells. Thus, it is difficult to decide whether the output of a hidden unit is wrong for a particular input, and, consequently, how to modify its weights. This last problem has been referred to as the "credit aSSignment problem" (CAP) in [Hinton & aJ. 841. Attempts to find a learning rule taking into account hidden units and generating high order predicates failed until recently, which could explain for the decrease of interest in this field for the past 15 years [Minsky & Papert 681. In this paper, we consIder learning and associative memorIzation as dynamic processes and show how to describe the evolution of the weights through an "energy" function. This method will be used to solve the CAP and applied to a model of hierarchical associative memory called HLM (Hierarchical Learning Machine).

Citation Graph

Loading graph...

References [11]

Sort:
Filter:

J. J. Hopfield - 1982

8 papers in library cite

Yann Lecun - 1985

5 papers in library cite

M. Minsky, S. Papert - 1969

12 papers in library cite

R. O. Duda, P. E. Hart - 1973

9 papers in library cite

B. Widrow, H. E. Hoff - 1960

5 papers in library cite

Geoffrey Hinton, T. Sejnowski, D. Ackley - 1984

3 papers in library cite

T. Kohonen - 1988

3 papers in library cite

T. Kohonen - 1974

1 paper in library cites

K. Nakano - 1972

1 paper in library cites

S. I. Amari - 1972

1 paper in library cites

T. Kohonen, M. Ruohonen - 1973

1 paper in library cites

Cited by

15

papers in your library

Cites

3

papers in your library

Read

on June 22, 2025

Your review

Tags

Paper Aliases

Une Procédure D'Apprentissage pour Réseau à Seuil Asymétrique
A Learning Scheme for Asymmetric Threshold Networks
Learning Processes in an Asymmetric Threshold Network
Une procédure d'apprentissage pour Réseau a seuil Asymmetrique (A Learning Scheme for Asymmetric Threshold Networks)