cmu scs 15-826(c) c. faloutsos and j-y pan (2013)#1 15-826: multimedia databases and data mining...

42
CMU SCS 15-826 (c) C. Faloutsos and J-Y Pan (2013) #1 15-826: Multimedia Databases and Data Mining Lecture #21: Independent Component Analysis (ICA) Jia-Yu Pan and Christos Faloutsos

Upload: rosemary-whicker

Post on 14-Jan-2016

222 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: CMU SCS 15-826(c) C. Faloutsos and J-Y Pan (2013)#1 15-826: Multimedia Databases and Data Mining Lecture #21: Independent Component Analysis (ICA) Jia-Yu

CMU SCS

15-826 (c) C. Faloutsos and J-Y Pan (2013) #1

15-826: Multimedia Databasesand Data Mining

Lecture #21:

Independent Component Analysis (ICA)

Jia-Yu Pan and Christos Faloutsos

Page 2: CMU SCS 15-826(c) C. Faloutsos and J-Y Pan (2013)#1 15-826: Multimedia Databases and Data Mining Lecture #21: Independent Component Analysis (ICA) Jia-Yu

CMU SCS

15-826 (c) C. Faloutsos and J-Y Pan (2013) #2

Must-read Material

• AutoSplit: Fast and Scalable Discovery of Hidden Variables in Stream and Multimedia Databases, Jia-Yu Pan, Hiroyuki Kitagawa, Christos Faloutsos and Masafumi Hamamoto, PAKDD 2004, Sydney, Australia

Page 3: CMU SCS 15-826(c) C. Faloutsos and J-Y Pan (2013)#1 15-826: Multimedia Databases and Data Mining Lecture #21: Independent Component Analysis (ICA) Jia-Yu

CMU SCS

15-826 (c) C. Faloutsos and J-Y Pan (2013) #3

Outline

• Motivation

• Formulation

• PCA and ICA

• Example applications

• Conclusion

Page 4: CMU SCS 15-826(c) C. Faloutsos and J-Y Pan (2013)#1 15-826: Multimedia Databases and Data Mining Lecture #21: Independent Component Analysis (ICA) Jia-Yu

CMU SCS

15-826 (c) C. Faloutsos and J-Y Pan (2013) #4

Motivation: (Q1) Find patterns in data

• Motion capture data: broad jumps

Left Knee

Right KneeEnergy exerted

Take-offLanding

Energy exerted

Page 5: CMU SCS 15-826(c) C. Faloutsos and J-Y Pan (2013)#1 15-826: Multimedia Databases and Data Mining Lecture #21: Independent Component Analysis (ICA) Jia-Yu

CMU SCS

15-826 (c) C. Faloutsos and J-Y Pan (2013) #5

Motivation: (Q1) Find patterns in data

• Human would say– Pattern 1: along

diagonal– Pattern 2: along

vertical axis

• How to find these automatically?

Each point is the measurement at a time tick (total 550 points).

Take-off

LandingR:L=60:1

R:L=1:1

Page 6: CMU SCS 15-826(c) C. Faloutsos and J-Y Pan (2013)#1 15-826: Multimedia Databases and Data Mining Lecture #21: Independent Component Analysis (ICA) Jia-Yu

CMU SCS

15-826 (c) C. Faloutsos and J-Y Pan (2013) #6

Motivation: (Q2) Find hidden variables

Hidden variables (=‘topics’ = concepts)

“Internet bubble”

Alcoa

American Express

Boeing

Citi Group

“General trend”

Stock prices

Page 7: CMU SCS 15-826(c) C. Faloutsos and J-Y Pan (2013)#1 15-826: Multimedia Databases and Data Mining Lecture #21: Independent Component Analysis (ICA) Jia-Yu

CMU SCS

15-826 (c) C. Faloutsos and J-Y Pan (2013) #7

Motivation:(Q2) Find hidden variables

“General trend”

“Internet bubble”

0.940.030.63

0.64

Caterpillar

Intel

Hidden variables

Page 8: CMU SCS 15-826(c) C. Faloutsos and J-Y Pan (2013)#1 15-826: Multimedia Databases and Data Mining Lecture #21: Independent Component Analysis (ICA) Jia-Yu

CMU SCS

15-826 (c) C. Faloutsos and J-Y Pan (2013) #8

Motivation:(Q2) Find hidden variables

Hidden variable 1 Hidden variable 2

B1,CAT

B1,INTC B2,CAT

B2,INTC?

? ?

Caterpillar

Intel

Page 9: CMU SCS 15-826(c) C. Faloutsos and J-Y Pan (2013)#1 15-826: Multimedia Databases and Data Mining Lecture #21: Independent Component Analysis (ICA) Jia-Yu

CMU SCS

15-826 (c) C. Faloutsos and J-Y Pan (2013) #9

Motivation:Find hidden variables

• There are two sound sources in a cocktail party…

=“blind source separation”(= we don’t know the sources, nor their mixing)

Page 10: CMU SCS 15-826(c) C. Faloutsos and J-Y Pan (2013)#1 15-826: Multimedia Databases and Data Mining Lecture #21: Independent Component Analysis (ICA) Jia-Yu

CMU SCS

15-826 (c) C. Faloutsos and J-Y Pan (2013) #10

Outline

• Motivation

• Formulation

• PCA and ICA

• Example applications

• Conclusion

Page 11: CMU SCS 15-826(c) C. Faloutsos and J-Y Pan (2013)#1 15-826: Multimedia Databases and Data Mining Lecture #21: Independent Component Analysis (ICA) Jia-Yu

CMU SCS

15-826 (c) C. Faloutsos and J-Y Pan (2013) #11

Formulation: Finding patterns

Given n data points, each with m attributes.

Find patterns that describe data properties the best.

Page 12: CMU SCS 15-826(c) C. Faloutsos and J-Y Pan (2013)#1 15-826: Multimedia Databases and Data Mining Lecture #21: Independent Component Analysis (ICA) Jia-Yu

CMU SCS

15-826 (c) C. Faloutsos and J-Y Pan (2013) #12

Linear representation

• Find vectors that describe the data set the best.

• Each point: linear combination of the vectors (patterns):

22,1, bbx 1i

ii hh

Page 13: CMU SCS 15-826(c) C. Faloutsos and J-Y Pan (2013)#1 15-826: Multimedia Databases and Data Mining Lecture #21: Independent Component Analysis (ICA) Jia-Yu

CMU SCS

15-826 (c) C. Faloutsos and J-Y Pan (2013) #13

Patterns as data “vocabulary”

Good pattern ≈ sparse coding

(Q) Given data xi’s, compute hi,j’s and bi’s that are “sparse”?

Only b1 is needed to describe xi.

Page 14: CMU SCS 15-826(c) C. Faloutsos and J-Y Pan (2013)#1 15-826: Multimedia Databases and Data Mining Lecture #21: Independent Component Analysis (ICA) Jia-Yu

CMU SCS

15-826 (c) C. Faloutsos and J-Y Pan (2013) #14

Patterns in motion capture data

b2

b1

n=550 ticks

2

2,1,

2,21,2

2,11,1

nxX

nn xx

xx

xx

Data matrix

Left Right

222

2

2,1,

2,21,2

2,11,1

xnx

1

BH

b

b

nn hh

hh

hh

Basis vectors

Hidden variables

? ?

Sparse ~ non-Gaussian ~ “Independent”

“Independent”: e.g., minimize mutual information.

Page 15: CMU SCS 15-826(c) C. Faloutsos and J-Y Pan (2013)#1 15-826: Multimedia Databases and Data Mining Lecture #21: Independent Component Analysis (ICA) Jia-Yu

CMU SCS

15-826 (c) C. Faloutsos and J-Y Pan (2013) #15

Patterns in motion capture data

2

2,1,

2,21,2

2,11,1

nxX

nn xx

xx

xx

Data matrix

222

2

2,1,

2,21,2

2,11,1

xnx

1

BH

b

b

nn hh

hh

hh

Basis vectors

Hidden variables

? ?

X = (U ) VT

Sparse ~ non-Gaussian ~ “Independent”

Page 16: CMU SCS 15-826(c) C. Faloutsos and J-Y Pan (2013)#1 15-826: Multimedia Databases and Data Mining Lecture #21: Independent Component Analysis (ICA) Jia-Yu

CMU SCS

15-826 (c) C. Faloutsos and J-Y Pan (2013) #16

Outline

• Motivation

• Formulation

• PCA and ICA

• Example applications– Find topics in documents– Hidden variables in stock prices

• Conclusion

Page 17: CMU SCS 15-826(c) C. Faloutsos and J-Y Pan (2013)#1 15-826: Multimedia Databases and Data Mining Lecture #21: Independent Component Analysis (ICA) Jia-Yu

CMU SCS

15-826 (c) C. Faloutsos and J-Y Pan (2013) #17

Pattern discovery with ICA: AutoSplit [PAKDD 04][WIRI 05]

Step 1: Data points (matrix)

Step 2:Compute patterns

Step 3:Interpret patterns

or

or

Data mining(Case studies)

(Q) What pattern?

(Q) Different modalities

(Q) How?

Video frames

Stock prices

Text documents

Page 18: CMU SCS 15-826(c) C. Faloutsos and J-Y Pan (2013)#1 15-826: Multimedia Databases and Data Mining Lecture #21: Independent Component Analysis (ICA) Jia-Yu

CMU SCS

15-826 (c) C. Faloutsos and J-Y Pan (2013) #18

Finding patterns in high-dimensional data

PCA finds the hyperplane. ICA finds the correct patterns.

Dimensionality reduction

details

Page 19: CMU SCS 15-826(c) C. Faloutsos and J-Y Pan (2013)#1 15-826: Multimedia Databases and Data Mining Lecture #21: Independent Component Analysis (ICA) Jia-Yu

CMU SCS

15-826 (c) C. Faloutsos and J-Y Pan (2013) #19

Outline• Motivation• Formulation• PCA and ICA• Example applications

– Find topics in documents– Hidden variables in stock prices– Visual vocabulary for retinal images

• Conclusion

Page 20: CMU SCS 15-826(c) C. Faloutsos and J-Y Pan (2013)#1 15-826: Multimedia Databases and Data Mining Lecture #21: Independent Component Analysis (ICA) Jia-Yu

CMU SCS

15-826 (c) C. Faloutsos and J-Y Pan (2013) #20

Topic discovery on text streams

• Data: CNN headline news (Jan.-Jun. 1998)

• Documents of 10 topics in one single text stream – Documents are sorted by date/time

– Subsequent documents may have different topics

Date/TimeTopic 1 Topic 3 Topic 1…

Page 21: CMU SCS 15-826(c) C. Faloutsos and J-Y Pan (2013)#1 15-826: Multimedia Databases and Data Mining Lecture #21: Independent Component Analysis (ICA) Jia-Yu

CMU SCS

15-826 (c) C. Faloutsos and J-Y Pan (2013) #21

Topic discovery on text streams

• Data: CNN headline news (Jan.-Jun. 1998)

• Documents of 10 topics in one single text stream – FIND: the document boundaries– AND: the terms of each topic

Date/Time

??

Page 22: CMU SCS 15-826(c) C. Faloutsos and J-Y Pan (2013)#1 15-826: Multimedia Databases and Data Mining Lecture #21: Independent Component Analysis (ICA) Jia-Yu

CMU SCS

15-826 (c) C. Faloutsos and J-Y Pan (2013) #22

Topic discovery on text streams

• Known: number of topics = 10

• Unknown: (1) topic of each document (2) topic description

Date/TimeTopic 1 Topic 3 Topic 1…

Page 23: CMU SCS 15-826(c) C. Faloutsos and J-Y Pan (2013)#1 15-826: Multimedia Databases and Data Mining Lecture #21: Independent Component Analysis (ICA) Jia-Yu

CMU SCS

15-826 (c) C. Faloutsos and J-Y Pan (2013) #23

Topic discovery in documents

X[nxm]

xi = [1, 5, …, 0]

Step 1

Windowing

Step 2

X[nxm’] = H[nxm’] B[m’xm’]

(1) Find hyperplane (m’=10)

(2) Find patterns

aaronzoo

m=3887 (dictionary size)

(n=1659)New stories

(30 words)

Step 3

b’i = [0, 0.7, …, 0.6]aaron

zooanim

al

B’[10x3887]

'

'

'

10

2

1

b

b

b

(Q) What does b’i mean?

Page 24: CMU SCS 15-826(c) C. Faloutsos and J-Y Pan (2013)#1 15-826: Multimedia Databases and Data Mining Lecture #21: Independent Component Analysis (ICA) Jia-Yu

CMU SCS

15-826 (c) C. Faloutsos and J-Y Pan (2013) #24

Step 3: Interpret the patterns

b’i = [0, 0.7, …, 0.6]

aaronzooanim

al

Topicsfound

A hidden topic!

Top words : “animal”, “zoo”, …

ID Sorted word list

A Mckinne Sergeant sexual Major Armi

B bomb Rudolph Clinic Atlanta Birmingham

C Winfrei Beef Texa Oprah Cattl

D Viagra Drug Impot Pill Doctor

E Zamora Graham Kill Former Jone

F Medal Olymp Gold Women Game

G Pope Cube Castro Cuban Visit

H Asia Economi Japan Econom Asian

I Super Bowl Game Team Re

J Peopl Tornado Florida Re bomb

General idea: related to the data attributes

'

'

'

10

2

1

b

b

b

m=3887 (dictionary size)

Page 25: CMU SCS 15-826(c) C. Faloutsos and J-Y Pan (2013)#1 15-826: Multimedia Databases and Data Mining Lecture #21: Independent Component Analysis (ICA) Jia-Yu

CMU SCS

15-826 (c) C. Faloutsos and J-Y Pan (2013) #25

Step 3: Evaluate the patternsID True Topic

1 Sgt. Gene Mckinney is on trial for alleged sexual misconduct

2 A bomb explodes in a Birmingham, AL abortion clinic

3 The Cattle Industry in Texas sues Oprah Winfrey for defaming beef

4 New impotency drug Viagra is approved for use

5 Diane Zamora is convicted of helping to murder her lover’s girlfriend

6 1998 Winter Olympic games

7 The Pope’s historic visit to Cube in Winter 1998

8 The economic crisis in Asia

9 Superbowl XXXII

10 Tornado in Florida

AutoSplit finds correct topics.

ID Sorted word list

A mckinne sergeant sexual major armi

B bomb rudolph clinic atlanta birmingham

C winfrei beef texa oprah cattl

D viagra drug Impot pill doctor

E zamora graham kill former jone

Page 26: CMU SCS 15-826(c) C. Faloutsos and J-Y Pan (2013)#1 15-826: Multimedia Databases and Data Mining Lecture #21: Independent Component Analysis (ICA) Jia-Yu

CMU SCS

15-826 (c) C. Faloutsos and J-Y Pan (2013) #26

Step 3: Evaluate the patternsID AutoSplit

A mckinne sergeant sexual major armi

B bomb rudolph clinic atlanta birmingham

C winfrei beef texa oprah cattl

D viagra drug Impot pill doctor

E zamora graham kill former jone

AutoSplit’s topics are better than PCA.

ID PCA

A’ mckinne bomb women sexual sergeant

B’ bomb mckinne rudolph clinic atlanta

C’ winfrei viagra texa beef oprah

D’ viagra winfrei drug texa beef

E’ zamora viagra winfrei graham olymp

ID PCA

A’ mckinne bomb women sexual sergeant

B’ bomb mckinne rudolph clinic atlanta

C’ winfrei viagra texa beef oprah

D’ viagra winfrei drug texa beef

E’ zamora viagra winfrei graham olymp

Page 27: CMU SCS 15-826(c) C. Faloutsos and J-Y Pan (2013)#1 15-826: Multimedia Databases and Data Mining Lecture #21: Independent Component Analysis (ICA) Jia-Yu

CMU SCS

15-826 (c) C. Faloutsos and J-Y Pan (2013) #27

Step 3: Evaluate the patternsAutoSplit

A

B

C

D

E

AutoSplit’s topics are better than PCA.

PCA

A’

B’

C’

D’

E’PCA vectors mix the topics.

Topic 1

Topic 2

Page 28: CMU SCS 15-826(c) C. Faloutsos and J-Y Pan (2013)#1 15-826: Multimedia Databases and Data Mining Lecture #21: Independent Component Analysis (ICA) Jia-Yu

CMU SCS

15-826 (c) C. Faloutsos and J-Y Pan (2013) #28

Outline

• Motivation

• Formulation

• PCA and ICA

• Example applications– Find topics in documents– Hidden variables in stock prices

• Conclusion

Page 29: CMU SCS 15-826(c) C. Faloutsos and J-Y Pan (2013)#1 15-826: Multimedia Databases and Data Mining Lecture #21: Independent Component Analysis (ICA) Jia-Yu

CMU SCS

15-826 (c) C. Faloutsos and J-Y Pan (2013) #29

Find hidden variables (DJIA stocks)

• Weekly DJIA closing prices– 01/02/1990-08/05/2002, n=660 data points– A data point: prices of 29 companies at the time

Alcoa

American Express

Boeing

Caterpillar

Citi Group

Page 30: CMU SCS 15-826(c) C. Faloutsos and J-Y Pan (2013)#1 15-826: Multimedia Databases and Data Mining Lecture #21: Independent Component Analysis (ICA) Jia-Yu

CMU SCS

15-826 (c) C. Faloutsos and J-Y Pan (2013) #30

Formulation: Find hidden variables

AA1, …, XOM1

AAn, …, XOMn

H11, H12, …, H1m

Hn1, Hn2, …, Hnm

=

B11, B12, …, B1m

Bm1, Bm2, …, Bmm ? ?

Date Hidden variable

Date

Page 31: CMU SCS 15-826(c) C. Faloutsos and J-Y Pan (2013)#1 15-826: Multimedia Databases and Data Mining Lecture #21: Independent Component Analysis (ICA) Jia-Yu

CMU SCS

15-826 (c) C. Faloutsos and J-Y Pan (2013) #31

Characterize hidden variable by the companies it influences

“General trend” “Internet bubble”

0.940.63 0.03

0.64

Caterpillar

Intel

B1,CAT

B1,INTC B2,CAT

B2,INTC

Page 32: CMU SCS 15-826(c) C. Faloutsos and J-Y Pan (2013)#1 15-826: Multimedia Databases and Data Mining Lecture #21: Independent Component Analysis (ICA) Jia-Yu

CMU SCS

15-826 (c) C. Faloutsos and J-Y Pan (2013) #32

Companies related to hidden variable 1

B1,j

Highest Lowest

Caterpillar 0.938512 AT&T 0.021885

Boeing 0.911120 WalMart 0.624570

MMM 0.906542 Intel 0.638010

Coca Cola 0.903858 Home Depot 0.647774

Du Pont 0.900317 Hewlett-Packard 0.658768

“General trend”

Page 33: CMU SCS 15-826(c) C. Faloutsos and J-Y Pan (2013)#1 15-826: Multimedia Databases and Data Mining Lecture #21: Independent Component Analysis (ICA) Jia-Yu

CMU SCS

15-826 (c) C. Faloutsos and J-Y Pan (2013) #33

Companies related to hidden variable 1

B1,j

Highest Lowest

Caterpillar 0.938512 AT&T 0.021885

Boeing 0.911120 WalMart 0.624570

MMM 0.906542 Intel 0.638010

Coca Cola 0.903858 Home Depot 0.647774

Du Pont 0.900317 Hewlett-Packard 0.658768

All companies are affected by the “general trend” variable (with weights 0.6~0.9), except AT&T.

Page 34: CMU SCS 15-826(c) C. Faloutsos and J-Y Pan (2013)#1 15-826: Multimedia Databases and Data Mining Lecture #21: Independent Component Analysis (ICA) Jia-Yu

CMU SCS

15-826 (c) C. Faloutsos and J-Y Pan (2013) #34

General trend (and outlier)

“General trend”

AT&T

United Technologies

Walmart

Exxon Mobil

Page 35: CMU SCS 15-826(c) C. Faloutsos and J-Y Pan (2013)#1 15-826: Multimedia Databases and Data Mining Lecture #21: Independent Component Analysis (ICA) Jia-Yu

CMU SCS

15-826 (c) C. Faloutsos and J-Y Pan (2013) #35

Companies related to hidden variable 2

B2,j

Highest Lowest

Intel 0.641102 Philip Morris -0.194843

Hewlett-Packard 0.621159 International Paper -0.089569

GE 0.509164 Caterpillar 0.031678

American Express 0.504871 Procter and Gamble 0.109576

Disney 0.490529 Du Pont 0.133337

2000-2001 “Internet bubble”

Tech company

Page 36: CMU SCS 15-826(c) C. Faloutsos and J-Y Pan (2013)#1 15-826: Multimedia Databases and Data Mining Lecture #21: Independent Component Analysis (ICA) Jia-Yu

CMU SCS

15-826 (c) C. Faloutsos and J-Y Pan (2013) #36

Companies related to hidden variable 2

B2,j

Highest Lowest

Intel 0.641102 Philip Morris -0.194843

Hewlett-Packard 0.621159 International Paper -0.089569

GE 0.509164 Caterpillar 0.031678

American Express 0.504871 Procter and Gamble 0.109576

Disney 0.490529 Du Pont 0.133337

Companies affected by the “internet bubble” variable (with weights 0.5~0.6) are tech-related.Other companies are un-related (weights < 0.15).

Tech company

Page 37: CMU SCS 15-826(c) C. Faloutsos and J-Y Pan (2013)#1 15-826: Multimedia Databases and Data Mining Lecture #21: Independent Component Analysis (ICA) Jia-Yu

CMU SCS

15-826 (c) C. Faloutsos and J-Y Pan (2013) #37

Outline• Motivation• Formulation• PCA and ICA• Example applications

– Find topics in documents– Hidden variables in stock prices– Visual vocabulary for retinal images

• Conclusion

Page 38: CMU SCS 15-826(c) C. Faloutsos and J-Y Pan (2013)#1 15-826: Multimedia Databases and Data Mining Lecture #21: Independent Component Analysis (ICA) Jia-Yu

CMU SCS

15-826 (c) C. Faloutsos and J-Y Pan (2013) #38

Conclusion• ICA: more flexible than PCA in finding

patterns.

• Many applications– Find topics and “vocabulary” for images– Find hidden variables in time series (e.g., stock

prices)– Blind source separation

ICA

PCA

Page 39: CMU SCS 15-826(c) C. Faloutsos and J-Y Pan (2013)#1 15-826: Multimedia Databases and Data Mining Lecture #21: Independent Component Analysis (ICA) Jia-Yu

CMU SCS

15-826 (c) C. Faloutsos and J-Y Pan (2013) #39

Citation

• AutoSplit: Fast and Scalable Discovery of Hidden Variables in Stream and Multimedia Databases, Jia-Yu Pan, Hiroyuki Kitagawa, Christos Faloutsos and Masafumi Hamamoto

PAKDD 2004, Sydney, Australia

Page 40: CMU SCS 15-826(c) C. Faloutsos and J-Y Pan (2013)#1 15-826: Multimedia Databases and Data Mining Lecture #21: Independent Component Analysis (ICA) Jia-Yu

CMU SCS

15-826 (c) C. Faloutsos and J-Y Pan (2013) #40

References

• Jia-Yu Pan, Andre Guilherme Ribeiro Balan, Eric P. Xing, Agma Juci Machado Traina, and Christos Faloutsos. Automatic Mining of Fruit Fly Embryo Images. In Proceedings of the Twelfth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD), 2006.

• Arnab Bhattacharya, Vebjorn Ljosa, Jia-Yu Pan, Mark R. Verardo, Hyungjeong Yang, Christos Faloutsos, and Ambuj K. Singh. ViVo: Visual Vocabulary Construction for Mining Biomedical Images. In Proceedings of the Fifth IEEE International Conference on Data Mining (ICDM), 2005.

• Masafumi Hamamoto, Hiroyuki Kitagawa, Jia-Yu Pan, and Christos Faloutsos. A Comparative Study of Feature Vector-Based Topic Detection Schemes for Text Streams. In Proceedings of International Workshop on Challenges in Web Information Retrieval and Integration (WIRI), 2005, pp.125-130.

• Jia-Yu Pan, Hiroyuki Kitagawa, Christos Faloutsos, and Masafumi Hamamoto. AutoSplit: Fast and Scalable Discovery of Hidden Variables in Stream and Multimedia Databases. In Proceedings of the The Eighth Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD), 2004.

Page 41: CMU SCS 15-826(c) C. Faloutsos and J-Y Pan (2013)#1 15-826: Multimedia Databases and Data Mining Lecture #21: Independent Component Analysis (ICA) Jia-Yu

CMU SCS

15-826 (c) C. Faloutsos and J-Y Pan (2013) #41

References

• Aapo Hyvärinen, Juha Karhunen, Erkki Oja: Independent Component Analysis, John Wiley & Sons, 2001

Page 42: CMU SCS 15-826(c) C. Faloutsos and J-Y Pan (2013)#1 15-826: Multimedia Databases and Data Mining Lecture #21: Independent Component Analysis (ICA) Jia-Yu

CMU SCS

15-826 (c) C. Faloutsos and J-Y Pan (2013) #42

Software

• Open source software: ‘fastICA’ http://research.ics.tkk.fi/ica/fastica/

• Or ‘autosplit’:

www.cs.cmu.edu/~jypan/software/autosplit_cmu.tar.gz