Papperoni

1996

Training MLPs Layer by Layer Using an Objective Function for Internal Representations

Thierry Denoeux

Open PDF Google Scholar

citations

Cite Score

5

AI summary

The paper introduces a novel constructive algorithm for designing and training multilayer perceptrons by optimizing an objective function for internal representations, using class separability for discrimination and maximizing statistical measures for continuous function approximation, demonstrating improved performance and robustness.

Main Contributions

Introduces a novel constructive algorithm for training multilayer perceptrons.
Optimizes an objective function for internal representations without needing network output computation.
Proposes two objective functions: class separability for discrimination problems and maximizing statistical measures for continuous function approximation.
Demonstrates improvements over back-propagation in terms of performance and robustness through simulations.
Presents a layer-by-layer training scheme until self-encoding of pattern categories is achieved.

Abstract

A new constructive algorithm for designing and training multilayer perceptrons is proposed. This algorithm involves the optimization of an objective function for internal representations, which does not require any computation of the network's outputs. Coupled with a strategy for recruiting units during the learning process, this concept provides a scheme for training a multilayer network layer by layer, until self-encoding of the pattern categories is achieved in the final, highest-level representations. Two objective functions are proposed. For discrimination problems, recent experimental and theoretical results concerning back-propagation training of networks with one hidden layer and linear outputs suggest the introduction of a particular measure of class separability. For problems involving the approximation of a continuous function, we show that the minimization of the mean squared output error can be achieved by maximizing a statistical measure (the sample coefficient of multiple determination) in the last hidden layer. Simulations are used to illustrate the process of network construction, and to demonstrate the improvements brought by this approach over back-propagation in terms of performance and robustness.

Citation Graph

Loading graph...

References [29]

Sort:

Filter:

[1]Learning Internal Representations by Error Propagation

D. E. Rumelhart, Geoffrey E. Hinton, Ronald J. Williams - 1986

46 papers in library cite

I expected very little of this, but was so good in explaining concepts! Very good read. It gets a bit boring when it starts explaining things by the end of the chapter, but good nonetheless.

[2]The Perceptron: A Probabilistic Model for Information Storage and Organization in the Brain

F. Rosenblatt - 1958

3 papers in library cite

Very nice intro to NNs!

[3]Connectionist Learning Procedures

Geoffrey E. Hinton - 1987

11 papers in library cite

It's a very good overview of everything that was happening in 1987! A bit too long though, but a good start nonetheless.

[4]Pattern Classification and Scene Analysis

R. O. Duda, P. E. Hart - 1973

9 papers in library cite

[5]The Cascade-Correlation Learning Architecture

S. E. Fahlman, C. Lebiere - 1989

6 papers in library cite

[6]What Size Net Gives Valid Generaliztion

E. B. Baum, D. Haussler - 1989

3 papers in library cite

[7]Estimation of Dependences Based on Empirical Data

V. N. Vapnik - 1983

2 papers in library cite

[8]Generalization by Weight-Elimination With Application to Forecasting

A. S. Weigend, D. E. Rumelhart, B. A. Huberman - 1991

2 papers in library cite

[9]Learning by Choice of Internal Representations

T. Grossman, R. Meir, E. Domany - 1989

2 papers in library cite

[10]Quantifying Inductive Bias: AI Learning Algorithms and valiant's Learning Framework

D. Haussler - 1988

2 papers in library cite

[11]A Convergence Theorem for Sequential Learning in Two-Layer Perceptrons

M. Marchand, M. Golea, P. Rujan - 1990

1 paper in library cites

[12]A Cost Function for Internal Representations

A. Krogh, G. I. Thorebegsson, J. A. Hertz - 1990

1 paper in library cites

[13]A Distributed Search Algorithm for Hard Optimization

P. Courrieu - 1991

1 paper in library cites

[14]A Manual of Standardized Terminology, Techniques and Scoring System for Sleep Stages of Human Subjects

A. Rechstschaffen, A. Kales - 1968

1 paper in library cites

[15]An Approximation of Nonlinear Discriminant Analysis by multilayer neural Networks

H. Asoh, N. Otsu - 1990

1 paper in library cites

[16]Approximation Capabilities of Multilayer Feedforward Networks

Kur Hornik - 1991

1 paper in library cites

[17]Back-Propagation Algorithm Which Varies the Number of Hidden Units

Y. Hirose, K. Yamashita, S. Hijiya - 1991

1 paper in library cites

[18]Constructing Hidden Units Using Examples and Queries

E. B. Baum, Kevin J. Lang - 1991

1 paper in library cites

[19]Fast Generating algorithm for a General Three-Layer Perceptron

R. Zollner, H. J. Schmitz, F. Wunsch, U. Krey - 1992

1 paper in library cites

[20]Introduction to Statistical Pattern Recognition

K. Fukunaga - 1972

1 paper in library cites

[21]Learning in Feedforward Layered Networks: The Tiling Algorithm

M. Mezard, J. P. Nadal - 1989

1 paper in library cites

[22]Multilayer Perceptrons and Data Analysis

P. Gallinari, S. Thiria, F. F. Soulie - 1988

1 paper in library cites

[23]Neural Network model: Application to Automatic Analysis of Human Sleep

N. Schaltenbrand, R. Lengelle, J. P. Macher - 1993

1 paper in library cites

[24]On the Relations Between Discriminant Analysis and Multilayer Perceptrons

P. Gallinari, S. Thiria, F. Badran, F. F. Soulie - 1991

1 paper in library cites

[25]Optimizing Multilayer Networks Layer Per Layer Without Back-Propagation

R. Lengelle, Thierry Denoeux - 1992

1 paper in library cites

[26]Reliable Evaluation of Neural Networks

R. M. Burton, W. G. Farris - 1991

1 paper in library cites

[27]Remarks on Interpolation and Recognition Using Neural Nets

E. D. Sontag - 1991

1 paper in library cites

[28]Statistical Pattern Recognition With Neural Networks: Benchmarking Studies

T. Kohonen, R. Chrisley, G. Barna - 1989

1 paper in library cites

[29]The Optimised Internal Representation of Multilayer classifier networks Performs Non Linear Discriminant Analysis

A. R. Webb, D. Lowe - 1990

1 paper in library cites

Cited by

2

papers in your library

Cites

3

papers in your library

Read

on June 28, 2025

There's nothing really wrong with this paper, I just found it VERY uninteresting.

Tags

Paper Aliases

No aliases