from last time: pr methods

37
From last time: PR Methods Feature extraction + Pattern classification Training, testing, overfitting, overtraining Minimum distance methods Discriminant Functions • Linear • Nonlinear (e.g, quadratic, neural networks)

Upload: dacian

Post on 07-Jan-2016

24 views

Category:

Documents


0 download

DESCRIPTION

From last time: PR Methods. Feature extraction + Pattern classification Training, testing, overfitting, overtraining Minimum distance methods Discriminant Functions Linear Nonlinear (e.g, quadratic, neural networks) -> Statistical Discriminant Functions. Statistical Pattern Recognition. - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: From last time: PR Methods

From last time:PR Methods

• Feature extraction + Pattern classification

• Training, testing, overfitting, overtraining

• Minimum distance methods• Discriminant Functions• Linear• Nonlinear (e.g, quadratic, neural

networks)• -> Statistical Discriminant

Functions

Page 2: From last time: PR Methods

Statistical Pattern Recognition

• Many sources of variability in speech signal

• Much more than known deterministic factors

• Powerful mathematical foundation• More general way of handling

discrimination

Page 3: From last time: PR Methods

Statistical Discrimination Methods

• Minimum error classifier and Bayes rule

• Gaussian classifiers• Discrete density estimation• Mixture Gaussians• Neural networks

Page 4: From last time: PR Methods
Page 5: From last time: PR Methods
Page 6: From last time: PR Methods
Page 7: From last time: PR Methods

we decide x is in class 2

we decide x is in class 1

Page 8: From last time: PR Methods
Page 9: From last time: PR Methods
Page 10: From last time: PR Methods
Page 11: From last time: PR Methods
Page 12: From last time: PR Methods

How to approximate a Bayes classifier• Parametric form with single pass

estimation

• Discretize, count co-occurrences

• Iterative training (mixture Gaussians, ANNs)

• Kernel estimation

Page 13: From last time: PR Methods
Page 14: From last time: PR Methods
Page 15: From last time: PR Methods
Page 16: From last time: PR Methods

Minimum distance classifiers• If Euclidean distance used,

optimum if:• Gaussian

• Equal priors

• Uncorrelated features

• Equal variance per feature

• If different variances per feature, correlated features, MD could be better

Page 17: From last time: PR Methods
Page 18: From last time: PR Methods

•Then the discriminant function can be

Di(x) = wiTx + wi0

•Where

Wi = Σi-1μi

•Andwi0 = - ½ (μi

T Σi-1μi) + log p(ωi)

•This is a linear classifier

Page 19: From last time: PR Methods

General Gaussian case

•Unconstrained covariance matrices per class

•Then the discriminant function is

Di(x) = xTWix + wiTx + wi0

•This is a quadratic classifier

•Gaussians are completely specified by 1st and 2nd order statistics

•Is this enough for general populations of data?

Page 20: From last time: PR Methods
Page 21: From last time: PR Methods

log p(x |ωi) + log p (ωi )

A statistical discriminant function

Page 22: From last time: PR Methods
Page 23: From last time: PR Methods
Page 24: From last time: PR Methods
Page 25: From last time: PR Methods
Page 26: From last time: PR Methods

P(a|b) = P(a,b)/P(b)

P(a,b) = P(a|b)P(b) = P(b|a)P(a)

Remember:

Page 27: From last time: PR Methods
Page 28: From last time: PR Methods
Page 29: From last time: PR Methods
Page 30: From last time: PR Methods
Page 31: From last time: PR Methods
Page 32: From last time: PR Methods
Page 33: From last time: PR Methods
Page 34: From last time: PR Methods
Page 35: From last time: PR Methods
Page 36: From last time: PR Methods
Page 37: From last time: PR Methods

Upcoming quiz etc.• Monday, 1st the guest talk on

“deep” neural networks

• Then the quiz. Topics: ASR basics, pattern recognition overview. Typical questions are multiple choice plus short explanation. Aimed at a 30 minute length.

• There will be one more HW, one more quiz, then all oriented towards project.