causal inference in time series via supervised learning … · 2018-12-01 · 2015.04 – now:...

50
Copyright©2018 NTT corp. All Rights Reserved. Causal Inference in Time Series via Supervised Learning (IJCAI2018, to appear) Yoichi Chikahara, Akinori Fujino NTT Communication Science Laboratories Kyoto, Japan

Upload: others

Post on 20-May-2020

4 views

Category:

Documents


1 download

TRANSCRIPT

Page 1: Causal Inference in Time Series via Supervised Learning … · 2018-12-01 · 2015.04 – Now: Researcher @ NTT Communication Science Laboratories ... Causal inference from time series

Copyright©2018 NTT corp. All Rights Reserved.

Causal Inference in Time Series

via Supervised Learning

(IJCAI2018, to appear)

Yoichi Chikahara, Akinori Fujino

NTT Communication Science Laboratories

Kyoto, Japan

Page 2: Causal Inference in Time Series via Supervised Learning … · 2018-12-01 · 2015.04 – Now: Researcher @ NTT Communication Science Laboratories ... Causal inference from time series

2 Copyright©2018 NTT corp. All Rights Reserved.

A bit about myself

• Name: Yoichi Chikahara (近原 鷹一)

• Contact: [email protected]

• Education:

2013.03: B. Sc. from Keio University

2015.03: M. Info. Sci. & Tech. from University of Tokyo

DNA information analysis lab. (Miyano lab.) in Dept. of Computer Science

2015.04 – Now: Researcher @ NTT Communication Science

Laboratories

• Research:

Machine Learning, Bioinformatics / Systems Biology

Page 3: Causal Inference in Time Series via Supervised Learning … · 2018-12-01 · 2015.04 – Now: Researcher @ NTT Communication Science Laboratories ... Causal inference from time series

Causal Inference

in Time Series

Page 4: Causal Inference in Time Series via Supervised Learning … · 2018-12-01 · 2015.04 – Now: Researcher @ NTT Communication Science Laboratories ... Causal inference from time series

4 Copyright©2018 NTT corp. All Rights Reserved.

Causal inference in time series

• Given time series data

• Infer causal relationships between variables

Input: Time Series Data

X

Y

X Y

cause effect

Output: Causal Relationships

Page 5: Causal Inference in Time Series via Supervised Learning … · 2018-12-01 · 2015.04 – Now: Researcher @ NTT Communication Science Laboratories ... Causal inference from time series

5 Copyright©2018 NTT corp. All Rights Reserved.

Application 1: Economics

• Finding that R&D expenditures influences

total sales is useful for companies

total sales

R&D

expenditures

cause effect

Page 6: Causal Inference in Time Series via Supervised Learning … · 2018-12-01 · 2015.04 – Now: Researcher @ NTT Communication Science Laboratories ... Causal inference from time series

6 Copyright©2018 NTT corp. All Rights Reserved.

Application 2: Bioinformatics

• Discovering gene regulatory relationships is

useful for drug discovery

0 20 40 60 80 100

−1.0−0.5

0.00.5

1.01.5

2.0

time

value

0 20 40 60 80 100

−1.0−0.5

0.00.5

1.01.5

2.0

time

value

0 20 40 60 80 100

−1.0−0.5

0.00.5

1.01.5

2.0

time

value

0 20 40 60 80 100

−1.0−0.5

0.00.5

1.01.5

2.0

time

value

Page 7: Causal Inference in Time Series via Supervised Learning … · 2018-12-01 · 2015.04 – Now: Researcher @ NTT Communication Science Laboratories ... Causal inference from time series

What is “causal relationship”?

How can we define causal

relationships between variables?

Page 8: Causal Inference in Time Series via Supervised Learning … · 2018-12-01 · 2015.04 – Now: Researcher @ NTT Communication Science Laboratories ... Causal inference from time series

8 Copyright©2018 NTT corp. All Rights Reserved.

A definition of temporal causality

X is the cause of Y

if the past values of X are helpful in predicting

the future values of Y

Clive W. J. Granger (1934-2009)

Granger causality [Granger1969]

Page 9: Causal Inference in Time Series via Supervised Learning … · 2018-12-01 · 2015.04 – Now: Researcher @ NTT Communication Science Laboratories ... Causal inference from time series

9 Copyright©2018 NTT corp. All Rights Reserved.

Existing approach:

Compare prediction errors with/without using values of X

If errors are significantly reduced

by using values of X,

X Y

cause effect

Y

X

Y

t

(Two) Regression

Models

Y

X

Y

Predicted Values Prediction Errors

Page 10: Causal Inference in Time Series via Supervised Learning … · 2018-12-01 · 2015.04 – Now: Researcher @ NTT Communication Science Laboratories ... Causal inference from time series

10 Copyright©2018 NTT corp. All Rights Reserved.

In Summary,

Regression

Models X Y Y

X

Page 11: Causal Inference in Time Series via Supervised Learning … · 2018-12-01 · 2015.04 – Now: Researcher @ NTT Communication Science Laboratories ... Causal inference from time series

11 Copyright©2018 NTT corp. All Rights Reserved.

Weakness: Model selection problem

X Y

Data1

Which regression model

should I use ?

X Y

X Y

Data2

Data3

VAR

Model

Gaussian

Processes

GAM

Page 12: Causal Inference in Time Series via Supervised Learning … · 2018-12-01 · 2015.04 – Now: Researcher @ NTT Communication Science Laboratories ... Causal inference from time series

12 Copyright©2018 NTT corp. All Rights Reserved.

Weakness: Model selection problem

• Problem

Selecting appropriate regression models is difficult

(needs a deep understanding of data analysis)

It is known that existing approach does not work

when regression models cannot be well fitted to data

Page 13: Causal Inference in Time Series via Supervised Learning … · 2018-12-01 · 2015.04 – Now: Researcher @ NTT Communication Science Laboratories ... Causal inference from time series

13 Copyright©2018 NTT corp. All Rights Reserved.

Our approach:

Causal inference via classification

X Y

Data1

Same

Classifier

X Y

X Y

Data2

Data3

No need to select

regression models!

Page 14: Causal Inference in Time Series via Supervised Learning … · 2018-12-01 · 2015.04 – Now: Researcher @ NTT Communication Science Laboratories ... Causal inference from time series

14 Copyright©2018 NTT corp. All Rights Reserved.

Related work [ICML15, JMLR15, CVPR17]: Causal inference from i.i.d. data via classification

• In fact, in case of i.i.d. data, there are several

existing methods based on classification

Copyright©2016 NTT corp. All Rights Reserved .

X→Y

X←Y

C , -‐‐‐ 2 05 5  0 1 .

Classifier

i.i.d. data

Page 15: Causal Inference in Time Series via Supervised Learning … · 2018-12-01 · 2015.04 – Now: Researcher @ NTT Communication Science Laboratories ... Causal inference from time series

15 Copyright©2018 NTT corp. All Rights Reserved.

Related work [ICML15, JMLR15, CVPR17]:

1) Train a classifier

Copyright©2016 NTT corp. All Rights Reserved .

X→Y

X←Y

C , -‐‐‐ 2 05 5  0 1 .

Classifier

Copyright©2016 NTT corp. All Rights Reserved .

X→Y

X←Y

C , -‐‐‐ 2 05 5  0 1 .

...

Training data

(Data where causal relationships are known)

Page 16: Causal Inference in Time Series via Supervised Learning … · 2018-12-01 · 2015.04 – Now: Researcher @ NTT Communication Science Laboratories ... Causal inference from time series

16 Copyright©2018 NTT corp. All Rights Reserved.

Copyright©2016 NTT corp. All Rights Reserved .

X→Y

X←Y

C , -‐‐‐ 2 05 5  0 1 .

Test Data

Related work [ICML15, JMLR15, CVPR17]: 2) Infer causal relationship by using trained classifier

Trained

Classifier

(Data where causal relationships

are unknown)

Page 17: Causal Inference in Time Series via Supervised Learning … · 2018-12-01 · 2015.04 – Now: Researcher @ NTT Communication Science Laboratories ... Causal inference from time series

17 Copyright©2018 NTT corp. All Rights Reserved.

Classifier

Test Data

Training Data

...

Our approach:

Causal inference from time series data via supervised learning

Page 18: Causal Inference in Time Series via Supervised Learning … · 2018-12-01 · 2015.04 – Now: Researcher @ NTT Communication Science Laboratories ... Causal inference from time series

Classification approach seems good,

but how can we solve

Granger causality identification problem

via classification?

Page 19: Causal Inference in Time Series via Supervised Learning … · 2018-12-01 · 2015.04 – Now: Researcher @ NTT Communication Science Laboratories ... Causal inference from time series

Classification approach seems good,

but how can we solve

Granger causality identification problem

via classification?

Page 20: Causal Inference in Time Series via Supervised Learning … · 2018-12-01 · 2015.04 – Now: Researcher @ NTT Communication Science Laboratories ... Causal inference from time series

20 Copyright©2018 NTT corp. All Rights Reserved.

• Granger causality assumes that

(Our method also uses the assumption)

X

Y X Y

At any time point t, the causal direction is the same

Revisiting assumption of Granger causality:

Causal direction never changes over time

Page 21: Causal Inference in Time Series via Supervised Learning … · 2018-12-01 · 2015.04 – Now: Researcher @ NTT Communication Science Laboratories ... Causal inference from time series

21 Copyright©2018 NTT corp. All Rights Reserved.

Revisiting definition of Granger causality

if the following holds:

X Y

cause effect

X

Y

t

at any time point t

Page 22: Causal Inference in Time Series via Supervised Learning … · 2018-12-01 · 2015.04 – Now: Researcher @ NTT Communication Science Laboratories ... Causal inference from time series

22 Copyright©2018 NTT corp. All Rights Reserved.

Revisiting definition of Granger causality

if the following holds:

X Y

cause effect

X

Y

t

Distribution of Yt+1

given past values of Y

Distribution of Yt+1

given past values of Y and X ≠

SX is useful in prediction!

Page 23: Causal Inference in Time Series via Supervised Learning … · 2018-12-01 · 2015.04 – Now: Researcher @ NTT Communication Science Laboratories ... Causal inference from time series

23 Copyright©2018 NTT corp. All Rights Reserved.

Revisiting definition of Granger causality

if

X Y

X Y

if

Page 24: Causal Inference in Time Series via Supervised Learning … · 2018-12-01 · 2015.04 – Now: Researcher @ NTT Communication Science Laboratories ... Causal inference from time series

24 Copyright©2018 NTT corp. All Rights Reserved.

Classifier

Building a classifier for Granger causality identification

If

If

If

X Y

Y X

X Y

Y X

X Y

Y X

, then assign

, then assign

, then assign

Label Assignment Rules

Page 25: Causal Inference in Time Series via Supervised Learning … · 2018-12-01 · 2015.04 – Now: Researcher @ NTT Communication Science Laboratories ... Causal inference from time series

25 Copyright©2018 NTT corp. All Rights Reserved.

Classifier

Building a classifier for Granger causality identification

If

If

If

X Y

Y X

X Y

Y X

X Y

Y X

, then assign

, then assign

, then assign

Label Assignment Rules

Test Data

Page 26: Causal Inference in Time Series via Supervised Learning … · 2018-12-01 · 2015.04 – Now: Researcher @ NTT Communication Science Laboratories ... Causal inference from time series

26 Copyright©2018 NTT corp. All Rights Reserved.

Building a classifier for Granger causality identification

If

If

If

then

then

then

Label Assignment Rules

Page 27: Causal Inference in Time Series via Supervised Learning … · 2018-12-01 · 2015.04 – Now: Researcher @ NTT Communication Science Laboratories ... Causal inference from time series

Key information lies in distributions

-> To determine whether or not

the two distributions are identical,

how do we obtain feature vectors

for classification?

Page 28: Causal Inference in Time Series via Supervised Learning … · 2018-12-01 · 2015.04 – Now: Researcher @ NTT Communication Science Laboratories ... Causal inference from time series

Key information lies in distributions

-> To determine whether or not

the two distributions are identical,

how do we obtain feature vectors

for classification?

Page 29: Causal Inference in Time Series via Supervised Learning … · 2018-12-01 · 2015.04 – Now: Researcher @ NTT Communication Science Laboratories ... Causal inference from time series

29 Copyright©2018 NTT corp. All Rights Reserved.

Representing features of distributions

E[Z]

to represent mean

Page 30: Causal Inference in Time Series via Supervised Learning … · 2018-12-01 · 2015.04 – Now: Researcher @ NTT Communication Science Laboratories ... Causal inference from time series

30 Copyright©2018 NTT corp. All Rights Reserved.

Representing features of distributions

E[Z]

E[Z2]

to represent

mean & variance

Page 31: Causal Inference in Time Series via Supervised Learning … · 2018-12-01 · 2015.04 – Now: Researcher @ NTT Communication Science Laboratories ... Causal inference from time series

31 Copyright©2018 NTT corp. All Rights Reserved.

Feature Space

(RKHS)

Representing features of distributions

• Kernel mean embedding: map a distribution

to a point in feature space called RKHS

When using Gaussian kernel,

Page 32: Causal Inference in Time Series via Supervised Learning … · 2018-12-01 · 2015.04 – Now: Researcher @ NTT Communication Science Laboratories ... Causal inference from time series

32 Copyright©2018 NTT corp. All Rights Reserved.

• By mapping distributions to points,

label assignment rules can be rephrased as

Feature Space

Reformulating label assignment rules

If

If

If

then

then

then

Feature Space

Page 33: Causal Inference in Time Series via Supervised Learning … · 2018-12-01 · 2015.04 – Now: Researcher @ NTT Communication Science Laboratories ... Causal inference from time series

33 Copyright©2018 NTT corp. All Rights Reserved.

Feature representation

• We only have to determine whether or not

the two points are equal over time t

• We obtain feature vectors by using the distance between the points (called maximum mean discrepancy (MMD) [Gretton+ NIPS2007]

in kernel method community)

Page 34: Causal Inference in Time Series via Supervised Learning … · 2018-12-01 · 2015.04 – Now: Researcher @ NTT Communication Science Laboratories ... Causal inference from time series

34 Copyright©2018 NTT corp. All Rights Reserved.

Feature representation

... (Since MMDs are finite sample estimates,

they cannot become exactly zero)

• By utilizing MMDs, we can obtain feature vectors

that are sufficiently different depending on Granger

causality

Page 35: Causal Inference in Time Series via Supervised Learning … · 2018-12-01 · 2015.04 – Now: Researcher @ NTT Communication Science Laboratories ... Causal inference from time series

35 Copyright©2018 NTT corp. All Rights Reserved.

Classifier (Random Forest)

Experiments

Test Data

Training Data

• linear time series from VAR model

• Nonlinear time series from VAR + sigmoid

...

Page 36: Causal Inference in Time Series via Supervised Learning … · 2018-12-01 · 2015.04 – Now: Researcher @ NTT Communication Science Laboratories ... Causal inference from time series

36 Copyright©2018 NTT corp. All Rights Reserved.

Experiment 1: Synthetic test data

• Prepare 300 pairs of bivariate time series

• Evaluate the number of time series whose causal

relationships are correctly inferred (i.e., Test Accuracy)

Linear Test Data

-- generated from VAR model

Nonlinear Test Data

-- generated from

Page 37: Causal Inference in Time Series via Supervised Learning … · 2018-12-01 · 2015.04 – Now: Researcher @ NTT Communication Science Laboratories ... Causal inference from time series

37 Copyright©2018 NTT corp. All Rights Reserved.

Test accuracy

Linear Test Data Nonlinear Test Data

Page 38: Causal Inference in Time Series via Supervised Learning … · 2018-12-01 · 2015.04 – Now: Researcher @ NTT Communication Science Laboratories ... Causal inference from time series

38 Copyright©2018 NTT corp. All Rights Reserved.

Test accuracy

Linear Test Data Nonlinear Test Data

Existing Granger causality methods

Test accuracy strongly depends on the regression model

Page 39: Causal Inference in Time Series via Supervised Learning … · 2018-12-01 · 2015.04 – Now: Researcher @ NTT Communication Science Laboratories ... Causal inference from time series

39 Copyright©2018 NTT corp. All Rights Reserved.

Test accuracy

Linear Test Data Nonlinear Test Data

GCKER < GCGAM

Kernel regression cannot be well fitted since time series are too short

Page 40: Causal Inference in Time Series via Supervised Learning … · 2018-12-01 · 2015.04 – Now: Researcher @ NTT Communication Science Laboratories ... Causal inference from time series

40 Copyright©2018 NTT corp. All Rights Reserved.

Test accuracy

Linear Test Data Nonlinear Test Data

Proposed > Existing classification approach for i.i.d. data

Our feature representation is effective

Page 41: Causal Inference in Time Series via Supervised Learning … · 2018-12-01 · 2015.04 – Now: Researcher @ NTT Communication Science Laboratories ... Causal inference from time series

41 Copyright©2018 NTT corp. All Rights Reserved.

Classifier

Real-world

Test Data

Synthetic

Training Data

...

e.g., River Runoff

X: Precipitation

Y: River runoff

(※truth: )

Experiment 2: Real-world test data

True causal directions are given in literatures

Page 42: Causal Inference in Time Series via Supervised Learning … · 2018-12-01 · 2015.04 – Now: Researcher @ NTT Communication Science Laboratories ... Causal inference from time series

42 Copyright©2018 NTT corp. All Rights Reserved.

Test accuracy

Our Proposed sufficiently worked better

than other methods

Page 43: Causal Inference in Time Series via Supervised Learning … · 2018-12-01 · 2015.04 – Now: Researcher @ NTT Communication Science Laboratories ... Causal inference from time series

How can we extend proposed approach

to multivariate time series?

Page 44: Causal Inference in Time Series via Supervised Learning … · 2018-12-01 · 2015.04 – Now: Researcher @ NTT Communication Science Laboratories ... Causal inference from time series

44 Copyright©2018 NTT corp. All Rights Reserved.

Granger causality definition for

multivariate time series

• Conditional Granger causality [Geweke JASA1984]:

compare two conditional distributions given past

values of the third variable Z

if X Y

X Y if

Z

Z

Page 45: Causal Inference in Time Series via Supervised Learning … · 2018-12-01 · 2015.04 – Now: Researcher @ NTT Communication Science Laboratories ... Causal inference from time series

45 Copyright©2018 NTT corp. All Rights Reserved.

Feature representation

• Similarly, we map conditional distributions to points

in feature spaces and measure the distance

• By using additional MMDs, we formulate feature

representation for multivariate time series

Page 46: Causal Inference in Time Series via Supervised Learning … · 2018-12-01 · 2015.04 – Now: Researcher @ NTT Communication Science Laboratories ... Causal inference from time series

46 Copyright©2018 NTT corp. All Rights Reserved.

Classifier

Real-world

Test Data

Synthetic

Training Data

Yeast cell cycle gene expression data [Spellman+ 1998]

14 variables (genes)

Experiment 3: Multivariate real-world data

True causal directions are given in database

0 20 40 60 80 100

−1.0−0.5

0.00.5

1.01.5

2.0

time

value

0 20 40 60 80 100

−1.0−0.5

0.00.5

1.01.5

2.0

time

value

0 20 40 60 80 100

−1.0−0.5

0.00.5

1.01.5

2.0

time

value

0 20 40 60 80 100

−1.0−0.5

0.00.5

1.01.5

2.0

time

value X Y

Z ...

Page 47: Causal Inference in Time Series via Supervised Learning … · 2018-12-01 · 2015.04 – Now: Researcher @ NTT Communication Science Laboratories ... Causal inference from time series

47 Copyright©2018 NTT corp. All Rights Reserved.

Macro F1 score and micro F1 score

※Higher is better

Page 48: Causal Inference in Time Series via Supervised Learning … · 2018-12-01 · 2015.04 – Now: Researcher @ NTT Communication Science Laboratories ... Causal inference from time series

48 Copyright©2018 NTT corp. All Rights Reserved.

Macro F1 score and micro F1 score

※Higher is better

Proposed with extended feature representation

worked better

Page 49: Causal Inference in Time Series via Supervised Learning … · 2018-12-01 · 2015.04 – Now: Researcher @ NTT Communication Science Laboratories ... Causal inference from time series

49 Copyright©2018 NTT corp. All Rights Reserved.

Conclusion

• Classification approach to Granger causality identification

Requires no selection of regression models

Performs sufficiently better than existing model-based approach

Can be applied to multivariate time series

• Future work:

Addressing more complicated setting

e.g., causal direction changes over time t

Page 50: Causal Inference in Time Series via Supervised Learning … · 2018-12-01 · 2015.04 – Now: Researcher @ NTT Communication Science Laboratories ... Causal inference from time series

Questions ?