introduction to machine learning multivariate methods

Introduction to Machine Learning

Multivariate Methods

姓名 :李政軒

Multiple measurements d inputs/features/attributes: d-

variate N instances/observations/examples

Multivariate Data

XXXXXX

Multivariate Parameters

XXXCov

ijijji

,,...,

Corr :nCorrelatio

Cov:Covariance:Mean 1μx

Parameter Estimation

,...,,:

matrix nCorrelatio

matrix Covariance

mean Sample

Estimation of Missing ValuesWhat to do if certain instances have missing attributesIgnore those instances: not a good idea if the sample is smallUse ‘missing’ as an attribute: may give informationImputation: Fill in the missing value

Mean imputation: Use the most likely valueImputation by regression: Predict based on other attributes

Multivariate Normal Distribution

μxμxx

Σ21Σ

Mahalanobis distance: (x – μ)T ∑–1 (x – μ)

measures the distance from x to μ in terms of ∑ (normalizes for difference in variances and correlations)

Bivariate: d = 2

Multivariate Normal Distribution

iiii xz

zzzzxxp

2212122

21 2121

Bivariate Normal

Is probability contour plot of the bivariate normal distribution.Its center is given by the mean, and its shape and

orientation depend on the covariance matrix.

If xi are independent, offdiagonals of ∑ are 0, Mahalanobis distance reduces to weighted (by 1/σi) Euclidean distance:

If variances are also equal, reduces to Euclidean distance

Independent Inputs: Naive Bayes

Parametric Classification

If p (x | Ci ) ~ N ( μi , ∑i

Discriminant functions

diCp μxμxx 1212

Σ21 exp| //

CPdCPCpg

log loglog

log| log

μΣμ21Σ

Estimation of Parameters

iii CPg ˆ log log mxmxx 1

Different Si

Quadratic discriminant

iiiiTii

iiiTiii

ˆ log log

mmmxxxx

likelihoods

posterior for C1

discriminant: P (C1|x ) = 0.5

Shared common sample covariance S

Discriminant reduces to

which is a linear discriminant

ii CP̂g log21 1 mxmxx S

Common Covariance Matrix S

iCP̂ SS

iiTiiii

ˆ log

Common Covariance Matrix S

Covariances may be arbitrary but shared by both classes.

When xj j = 1,..d, are independent, ∑ is diagonalp (x|Ci) = ∏j p (xj |Ci)

Classify based on weighted Euclidean distance (in sj units) to the nearest mean

Diagonal S

i CPsmx

g ˆ log

Diagonal S

variances may bedifferent

Nearest mean classifier: Classify based on Euclidean distance to the nearest mean

Each mean can be considered a prototype or template and this is template matching

Diagonal S, equal variances

ˆ log

Diagonal S, equal variances

All classes have equal, diagonal covariance matrices of equal variances on both dimensions.

As we increase complexity (less restricted S), bias decreases and variance increases

Assume simple models (allow some bias) to control variance (regularization)

Model Selection

Different cases of the covariance matrices fitted to the same data lead to different boundaries.

Binary features:if xj are independent (Naive

Bayes’)

the discriminant is linear

ijij Cxpp |1

jj ppCxp1

ij rrx

Discrete Features

jijjijj

CPpxpx

log log log

log| log

Estimated parameters

Multinomial (1-of-nj) features: xj Î {v1, v2,..., vnj

if xj are independent

iijkj k jki

log log

Discrete Features

ikjijkijk CvxpCzpp || 1

Multivariate linear model

Multivariate polynomial model: Define new higher-order variables z1=x1, z2=x2, z3=x1

2, z4=x22, z5=x1x2

and use the linear model in this new z space

211010

xwxwwrwwwE

xwxwxww

X|,...,,

Multivariate Regression

dtt wwwxgr ,...,,| 10

introduction to machine learning multivariate methods

Documents

some multivariate methods used by ecologists

multivariate methods for interpretable analysis of

multivariate methods in pharmacrutical appliacations.pdf

nonparametric multivariate ranking methods for global

decoding the infant mind: multivariate pattern analysis...

multivariate methods for index construction

machine learning 6. multivariate methods 1. based on e...

multivariate l1 statistical methods: the package mnm1....

modeling and multivariate methods - sas

5. multivariate methods - university of...

multivariate methods - uppsala...

multivariate methods in hep

machine learning methods for empirical streamﬂow ... ·...

[abeyasekera] multivariate methods for index construction

spectral methods for learning multivariate latent...

tensor product volumes and multivariate methods

1 glen cowan multivariate statistical methods in particle...

statistical methods for particle physics lecture 2: ...

reliability of output gaps comparing multivariate methods...

multivariate methods - sasmultivariate methods the correct...