computing all invariant states of a neural network

COMPUTING ALL INVARIANT STATES OF A NEURAL NETWORK. Bart De Moor, Lieven Vandenberghe, Joos Vandewalle ESAT, El. Eng. Dept., Katholieke Universiteit Leuven, K.Mereierlaan 94, 3030 Heverlee, Belgium. Conventional methods for the training of neural networks (e.g. the outer- product rule or the spectral algorithm) do not succeed in avoiding the occur- fence of undesired ("spurious") invariant states, and this can result in bad con- vergence. The problem of finding all invariant states for a given neural net is therefore an important design problem. In this presentation it will be shown how this question can be resolved for both continuous neural networks of the type dv(t)/dt = -Av(t) + B.~(Tv(t) + s) and discrete-time models Av(k + i) = B.~(Tv(k) + s) , where v denotes the state vector, s the input, and Y" is an arbitrary piecewise- linear function. This general form includes the popular MeCulloeh-Pitts model [1}. The invariant states of the models given above are the solutions to the set of piecewise-linear equations Av = B.,V(Tv + s). Next, the equivalence between this set of nonlinear equations and the Gen- eralized Linear Complementarity Problem (GLCP) will be established. The formulation of this GLC.P reads : Given real matrices M, N E R TM and a vector z E R r', find all q x 1 vectors v, u, and scalars c~ such that My + Nw = za and v > 0, w > 0,c~ ~ O, vtu, = 0. This problem originates from mathematical programnring and has also been applied to the analysis of pieeewise-linear resistive eirenits. Moreover, an algorithm for finding all solutions to this problem has recently been proposed [2]. The related !,r,~b!em of finding all invariant states that share a prespe..'ified amount ef partial information can be solved with the same techniques. These results have a theoretical and conceptual importance as well. The GLCP has been studied in connection with optimization problems, and can therefore be expected to yield insight in the variational fornmlation of the dy- namical behaviour of a neural net. Another interesting issue from the GLCP- literature is the extensive body of knowledge concerning the characterization of the number of solutions from the classes of matrices in the GLCP-equation. These results might be useful in studying the information capacity of a neural net and in resolving the problem of designing a net without spurious states. References [1} Grossberg, S., (1988). Nonlinear neural networks : principles, mechanisms, and architectures. Neural Networks. 1, 17-61. [2] De Moor, B., Vandenberghe, L., and Vandewalle, J. The generalized linear eonwlementarity problem and an algoritlun to find all its solutions. Submit- ted to Mathematical Program,hint. 89

Upload: bart-de-moor

Post on 26-Jun-2016

212 views

Category:

Documents

0 download

Report

Download

Embed Size (px):

TRANSCRIPT

Page 1: Computing all invariant states of a neural network

COMPUTING ALL INVARIANT STATES OF A NEURAL NETWORK. Bart De Moor, Lieven Vandenberghe, Joos Vandewalle ESAT, El. Eng. Dept., Katholieke Universiteit Leuven, K.Mereierlaan 94, 3030 Heverlee, Belgium.

Conventional methods for the training of neural networks (e.g. the outer- product rule or the spectral algorithm) do not succeed in avoiding the occur- fence of undesired ("spurious") invariant states, and this can result in bad con- vergence. The problem of finding all invariant states for a given neural net is therefore an important design problem. In this presentation it will be shown how this question can be resolved for both continuous neural networks of the type dv(t)/dt = - A v ( t ) + B.~(Tv(t) + s) and discrete-time models Av(k + i) = B.~(Tv(k) + s) , where v denotes the state vector, s the input, and Y" is an arbitrary piecewise- linear function. This general form includes the popular MeCulloeh-Pitts model [1}. The invariant states of the models given above are the solutions to the set of piecewise-linear equations Av = B.,V(Tv + s).

Next, the equivalence between this set of nonlinear equations and the Gen- eralized Linear Complementarity Problem (GLCP) will be established. The formulation of this GLC.P reads : Given real matrices M, N E R T M and a vector z E R r', find all q x 1 vectors v, u, and scalars c~ such that My + N w = za and v > 0, w > 0,c~ ~ O, vtu, = 0. This problem originates from mathematical programnring and has also been applied to the analysis of pieeewise-linear resistive eirenits. Moreover, an algorithm for finding all solutions to this problem has recently been proposed [2]. The related !,r,~b!em of finding all invariant states that share a prespe..'ified amount ef partial information can be solved with the same techniques.

These results have a theoretical and conceptual importance as well. The GLCP has been studied in connection with optimization problems, and can therefore be expected to yield insight in the variational fornmlation of the dy- namical behaviour of a neural net. Another interesting issue from the GLCP- literature is the extensive body of knowledge concerning the characterization of the number of solutions from the classes of matrices in the GLCP-equation. These results might be useful in studying the information capacity of a neural net and in resolving the problem of designing a net without spurious states.

R e f e r e n c e s

[1} Grossberg, S., (1988). Nonlinear neural networks : principles, mechanisms, and architectures. Neural Networks. 1, 17-61.

[2] De Moor, B., Vandenberghe, L., and Vandewalle, J. The generalized linear eonwlementarity problem and an algoritlun to find all its solutions. Submit- ted to Mathematical Program,hint.

Rotation Invariant Neural Network-Based Face Detectionholder/courses/cse6363/fall00/papers/rowley... · Rotation Invariant Neural Network-Based Face Detection Henry A. Rowley [email protected]

Neural Networks Regularization Through Class-wise ... · Neural Networks Regularization Through Class-wise Invariant Representation Learning Souﬁane Belharbi Normandie Univ, UNIROUEN,

- Tutorial - Neuromorphic Computing with Spintronics...• Deep neural network • Recurrent neural network –Reservoir computing • Hopfield network • Stochastic neural network

Research on the Brain-inspired Cross-modal Neural ... · Cognitive Computing, Multimedia neural cognitive computing 1. Introduction The brain-inspired computing (BIC) is the integration

METODE MOMENT INVARIANT DAN BACKPRORAGATION …digilib.batan.go.id/ppin/katalog/file/0853-9812-2012-283.pdfMetode Moment Invariant dan Backpropragation Neural Network pada Pengenalan

VC 14/15 TP19 Neural Networks & SVMs - dcc.fc.up.pt€¦ · VC 14/15 - TP19 - Neural Networks & SVMs Topic: Introduction to soft computing • Introduction to soft computing • Neural

Neural Networks for Data Compression - NASA › archive › nasa › casi.ntrs.nasa.gov › ...Neural Networks for Data Compression and Invariant Image Recognit ion Sheldon Gardner

Size-Invariant Fully Convolutional Neural Network for ... · PDF fileSize-Invariant Fully Convolutional Neural Network for Vessel Segmentation of Digital Retinal Images Yuansheng Luo

Probabilistic Symmetries and Invariant Neural Networks

Rotation Invariant Neural-Network Based Face Detection

View-Invariant Action Recognition Based on Artificial Neural Networks

A PREPROCESSING METHOD AND ROTATION INVARIANT 2D OBJECT RECOGNITION USING BPG NEURAL NETWORKS