8/10/2015 1 rbf networksm.w. mak radial basis function networks 1. introduction 2. finding rbf...

04/19/23 1RBF Networks M.W. Mak

Radial Basis Function Radial Basis Function NetworksNetworks

1. Introduction

2. Finding RBF Parameters

3. Decision Surface of RBF Networks

4. Comparison between RBF and BP

1. Introduction MLPs are highly non-linear in the parameter space

gradient descent local minima RBF networks solve this problem by dividing the

learning into two independent processes.

RBF networks implement the function

s x w w x ci i ii

( ) ( )

wi i and ci can be determined separately

Fast learning algorithm Basis function types

( ) log( )

( ) exp( )

For Gaussian basis functions

s x w w x c

w wx c

p i i p ii

ipj ij

exp( )

Assume the variance across each dimension are equal

s x w w x cp ii

pj ijj

( ) exp ( )

To write in matrix form, let

s x w a a

pi i p i

p i pii

where ( )

N N NM M

11 12 1

21 22 2

2. Finding the RBF Parameters

Use the K-mean algorithm to find ci

K-mean Algorithm

step1: K initial clusters are chosen randomly from the samples to form K groups.

step2: Each new sample is added to the group whose mean is the closest to this sample.

step3: Adjust the mean of the group to take account of the new points.

step4: Repeat step2 until the distance between the old means and the new means of all clusters is smaller than a predefined tolerance.

Outcome: There are K clusters with means representing the centroid of each clusters.

Advantages: (1) A fast and simple algorithm.

(2) Reduce the effects of noisy samples.

Use K nearest neighbor rule to find the function width

kiki cc

k-th nearest neighbor of ci

The objective is to cover the training points so that a smooth fit of the training samples can be achieved

Centers and widths found by K-means and K-NN

Determining weights w using the least square method

E d w x cp j jj

where dp is the desired output for pattern p

( ) ( )

d Aw d Aw

wA A A dSet w

Let E be the total-squared error between the actual output and the target output TNdddd

wAdwAdET

AwAwAwddAwdd

AwdAwdTTTTTT

E TTTTT

wAAAwAdAww

TTTTTT

dAAwATT

Note that

xAxAxAxx

yAyAxx

Problems

(1) Susceptible to round-off error.

(2) No solution if is singular.

(3) If is close to singular, we get very large component in w.

Reasons

(1) Inaccuracy in forming(2) If A is ill-conditioned, small change in A introduces

large change in(3) If ATA is close to singular, dependent columns in ATA

e.g. two parallel straight lines.

singular matrix :

If the lines are nearly parallel, they intersect each other at

So, the magnitude of the solution becomes very large; hence overflow will occur.

The effect of the large components can be cancelled out if the machine precision is infinite.

If the machine precision is finite, we get large error.For example,

Finite machine precision =>

1000001.4

Solution: Singular Value Decomposition

K-means

K-NearestNeighbor

BasisFunctions

LinearRegression

RBF learning processRBF learning process

RBF learning by gradient descent

Let and i p

p p pxx c

e x d x s x( ) exp ( ) ( ) ( )

E e x pp

we have

ci ij ij

, , and

we have the following update equations

w t w t e x x i M

w t w t e x i

t t e x w x x c t

c t c t e x w x x c t

i i w p i pp

i i w pp

ij ij p i i p pj ij ijp

ij ij c p i i p pj ij ijp

( ) ( ) ( ) ( ) , , ,

( ) ( ) ( )

( ) ( ) ( ) ( ) ( )

Elliptical Basis Function networks

)}()(2

1exp{)( 1

jppj xxx

: function centers

: covariance matrix

jpjkjpk xwxy

y W D W = +

y x1( )

y xK ( )

K-means and Sample covariance K-means :

if Sample covariance :

j jj x

x x j kj k

( )( )

The EM algorithm

EBF Vs. RBF networksEBF Vs. RBF networks

RBFN with 4 centers EBFN with 4 centers

-3 -2 -1 0 1 2 3

Class 1Class 2

-3 -2 -1 0 1 2 3

Class 1Class 2

Out put 1 of an EBF net work (bias, no rescale, gamma=1)

'nxor.ebf 4.Y.N.1.dat ' 1.43

0.948 0.463

-0.0209 -0.505

3 -3-2

EBF Network’s output

Elliptical Basis Function NetworksElliptical Basis Function Networks

RBFN for Pattern Classification

MLP RBFHyperplane Kernel function

The probability density function (also called conditional density function or likelihood) of the k-th class is defined as

kCxp |

•According to Bays’ theorem, the posterior prob. is

CPCxpxCP kk

where P(Ck) is the prior prob. and

)()|( rr

r CPCxpxp

• It is possible to use a common pool of M basis functions, labeled by an index j, to represent all of the class-conditional densities, i.e.

)|()|(|1

jk CjPjxpCxp

)1|(xp

)|( kCxp

)|( Mxp)2|(xp

jk CjPjxpCxp |||

)|( kCMP

jk CPCjPjxpxp

CPCjPjxp

xjPjCP

Hidden node’s output posterior prob. of the j-th set of

features in the input .

weight posterior prob. of class membership, given

the presence of the j- th set of features .

:)|()( xjPxj

:)|( jCPw kkj

No bias term

RBF networks MLP

Learning speed Very Fast Very Slow

Convergence Almost guarantee Not guarantee

Response time Slow Fast

Memoryrequirement

Very large Small

Hardwareimplementation

IBM ZISC036Nestor Ni1000www-5.ibm.com/fr/cdlab/zisc.html

Voice Direct 364www.sensoryinc.com

Generalization Usually better Usually poorer

Comparison of RBF and MLPComparison of RBF and MLP

To learn more about NN hardware, see To learn more about NN hardware, see http://www.particle.kth.se/~lindsey/HardwareNNWCourse/home.htmlhttp://www.particle.kth.se/~lindsey/HardwareNNWCourse/home.html

8/10/2015 1 rbf networksm.w. mak radial basis function networks 1. introduction 2. finding rbf...

Documents

using rbf neural networks to identify relationship between

rbf providers

radial basis function networks: algorithms - school of...

application of radial basis function (rbf) neural networks

rbf summer 2010

6. radial-basis function (rbf) networks rbf = radial-basis...

neural networks: radial bases functions (rbf)

radial basis function networks:...

rețele neuronale. rezolvarea problemelor de...

soft computing lecture 12 self-organizing maps of kohonen...

radial-basis function networks rbf -...

radiale basis- funktionen as1-5 - 2 - lernen in rbf-netzen...

jacek mazurkiewicz, phd softcomputing part 13:...

the grnn and the rbf neural networks for 2d displacement...

section5 rbf

rbf opening1

サースタット付シャワーバス水栓 -...

rbf same presentation

evolving rbf networks via gp for estimating fitness values...

training rbf networks with selective backpropagation