conquering the complexity of time: machine learning for big...

30
Conquering the Complexity of Time: Machine Learning for Big Time Series Data Yan Liu Computer Science Department University of Southern California Mini-Workshop on Theoretical Foundations of Cyber-Physical Systems April 14, 2017 Yan Liu (USC) Mining from Large-scale Time Series Data April 14, 2017 1 / 25

Upload: others

Post on 07-Aug-2020

8 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Conquering the Complexity of Time: Machine Learning for Big …cci.usc.edu/wp-content/uploads/2017/06/ml-retreat.pdf · 2017-06-23 · Conquering the Complexity of Time: Machine Learning

Conquering the Complexity of Time:Machine Learning for Big Time Series Data

Yan Liu

Computer Science DepartmentUniversity of Southern California

Mini-Workshop on Theoretical Foundations of Cyber-Physical Systems

April 14, 2017

Yan Liu (USC) Mining from Large-scale Time Series Data April 14, 2017 1 / 25

Page 2: Conquering the Complexity of Time: Machine Learning for Big …cci.usc.edu/wp-content/uploads/2017/06/ml-retreat.pdf · 2017-06-23 · Conquering the Complexity of Time: Machine Learning

A human being is a part of a whole, called by us“universe”, a part limited in time and space.

Yan Liu (USC) Mining from Large-scale Time Series Data April 14, 2017 2 / 25

Page 3: Conquering the Complexity of Time: Machine Learning for Big …cci.usc.edu/wp-content/uploads/2017/06/ml-retreat.pdf · 2017-06-23 · Conquering the Complexity of Time: Machine Learning

Research Synopsis: Machine Learning Models

Developing scalable and effective solutions by leveraging recent progressesacross disciplines

Yan Liu (USC) Mining from Large-scale Time Series Data April 14, 2017 3 / 25

Page 4: Conquering the Complexity of Time: Machine Learning for Big …cci.usc.edu/wp-content/uploads/2017/06/ml-retreat.pdf · 2017-06-23 · Conquering the Complexity of Time: Machine Learning

Research Synopsis: Applications

• Doctor AI - Healthcare Analytics

• Social Network Analysis

He-et-al. HawkesTopic ICML-2015

Result:-ArXiv

16/17

• Computational sustainability

Yan Liu (USC) Mining from Large-scale Time Series Data April 14, 2017 4 / 25

Page 5: Conquering the Complexity of Time: Machine Learning for Big …cci.usc.edu/wp-content/uploads/2017/06/ml-retreat.pdf · 2017-06-23 · Conquering the Complexity of Time: Machine Learning

Google Alpha Go Beats Human

Yan Liu (USC) Mining from Large-scale Time Series Data April 14, 2017 5 / 25

Page 6: Conquering the Complexity of Time: Machine Learning for Big …cci.usc.edu/wp-content/uploads/2017/06/ml-retreat.pdf · 2017-06-23 · Conquering the Complexity of Time: Machine Learning

After AlphaGo, what’s next for AI?

Googles DeepMind AI group unveils health care ambitions

http://www.theverge.com/2016/3/14/11219258/google-deepmind-alphago-go-challenge-ai-future

Yan Liu (USC) Mining from Large-scale Time Series Data April 14, 2017 6 / 25

Page 7: Conquering the Complexity of Time: Machine Learning for Big …cci.usc.edu/wp-content/uploads/2017/06/ml-retreat.pdf · 2017-06-23 · Conquering the Complexity of Time: Machine Learning

Properties of Health Care Data

How are health care data different from the data from existing applicationsof deep learning?

• Privacy, privacy!

• Heterogeneity

• Lots lots of missing data

• Big small data

• Worst of all: doctors do notbelieve anything they cannotunderstand no matter how cooland how deep they are!!

Example 1:

Example 2:

Yan Liu (USC) Mining from Large-scale Time Series Data April 14, 2017 7 / 25

Page 8: Conquering the Complexity of Time: Machine Learning for Big …cci.usc.edu/wp-content/uploads/2017/06/ml-retreat.pdf · 2017-06-23 · Conquering the Complexity of Time: Machine Learning

Recurrent Neural Networks for Time Series Data

Our Contributions [KDD 2015, AMIA 2015, 2016, arXiv 2016, ICLR 2017]

• Multi-modal Deep Neural Networks

• Handling Missing Data in DNN

• Big Small Data Solution via DNN

• Interpretation of DNN

Yan Liu (USC) Mining from Large-scale Time Series Data April 14, 2017 8 / 25

Page 9: Conquering the Complexity of Time: Machine Learning for Big …cci.usc.edu/wp-content/uploads/2017/06/ml-retreat.pdf · 2017-06-23 · Conquering the Complexity of Time: Machine Learning

Explainable Artificial Intelligence: Mimic learningFramework

Deep

Learning

ModelX

y

𝑿𝒏𝒏

𝒚𝒏𝒏Mimic

ModelX 𝒚𝒎Input

Target

Output

𝒚𝒏𝒏

Main idea:

• Use Gradient Boosting Trees (GBT) to mimic the performance of deeplearning models

Benefits:

• Good performance from complex deep networks – Mimic model keeps it

• Easy overfitting in original decision tree methods – Mimic model avoids it

• Interpretations are hard to get in original models – Mimic model provides it

Yan Liu (USC) Mining from Large-scale Time Series Data April 14, 2017 9 / 25

Page 10: Conquering the Complexity of Time: Machine Learning for Big …cci.usc.edu/wp-content/uploads/2017/06/ml-retreat.pdf · 2017-06-23 · Conquering the Complexity of Time: Machine Learning

Experiment Result

Yan Liu (USC) Mining from Large-scale Time Series Data April 14, 2017 10 / 25

Page 11: Conquering the Complexity of Time: Machine Learning for Big …cci.usc.edu/wp-content/uploads/2017/06/ml-retreat.pdf · 2017-06-23 · Conquering the Complexity of Time: Machine Learning

Experiment Result

Yan Liu (USC) Mining from Large-scale Time Series Data April 14, 2017 11 / 25

Page 12: Conquering the Complexity of Time: Machine Learning for Big …cci.usc.edu/wp-content/uploads/2017/06/ml-retreat.pdf · 2017-06-23 · Conquering the Complexity of Time: Machine Learning

Experiment Result

Yan Liu (USC) Mining from Large-scale Time Series Data April 14, 2017 12 / 25

Page 13: Conquering the Complexity of Time: Machine Learning for Big …cci.usc.edu/wp-content/uploads/2017/06/ml-retreat.pdf · 2017-06-23 · Conquering the Complexity of Time: Machine Learning

Research Synopsis: Applications

• Doctor AI - Healthcare Analytics

• Social Network Analysis

He-et-al. HawkesTopic ICML-2015

Result:-ArXiv

16/17

• Computational sustainability

Yan Liu (USC) Mining from Large-scale Time Series Data April 14, 2017 13 / 25

Page 14: Conquering the Complexity of Time: Machine Learning for Big …cci.usc.edu/wp-content/uploads/2017/06/ml-retreat.pdf · 2017-06-23 · Conquering the Complexity of Time: Machine Learning

Diffusion Analysis

• Adoption of innovation: new treatment, new technology

• Marketing: word of mouth effect, viral marketing

• Public opinion surveillance: detection of rumors

• Media analysis: modeling news dynamics

Yan Liu (USC) Mining from Large-scale Time Series Data April 14, 2017 14 / 25

Page 15: Conquering the Complexity of Time: Machine Learning for Big …cci.usc.edu/wp-content/uploads/2017/06/ml-retreat.pdf · 2017-06-23 · Conquering the Complexity of Time: Machine Learning

Network Inference

Network Inference: inferring the latent diffusion network from observedcascades

Our contribution [ICML 2015, NIPS 2016, WSDM submission]

Yan Liu (USC) Mining from Large-scale Time Series Data April 14, 2017 15 / 25

Page 16: Conquering the Complexity of Time: Machine Learning for Big …cci.usc.edu/wp-content/uploads/2017/06/ml-retreat.pdf · 2017-06-23 · Conquering the Complexity of Time: Machine Learning

Hawkes-Topic Models: Joint Inference of DiffusionNetworks and Topics

1 Generate all the events and the event times via the MultivariateHawkes Process

2 For each topic k: draw βk ∼ Dir(α).3 For each event e of node v:

1 If e is a spontaneous event: ηe ∼ N(αv, σ2I). Otherwise

ηe ∼ N(ηparent[e], σ2I).

2 For each word n:ze,n ∼ Discrete(π(ηe)), we,n ∼ Discrete(βze,n).

Yan Liu (USC) Mining from Large-scale Time Series Data April 14, 2017 16 / 25

Page 17: Conquering the Complexity of Time: Machine Learning for Big …cci.usc.edu/wp-content/uploads/2017/06/ml-retreat.pdf · 2017-06-23 · Conquering the Complexity of Time: Machine Learning

Experiment Results: EventRegistry

He-et-al. HawkesTopic ICML-2015

Result:-EventRegistry

Hawkes Hawkes?LDA Hawkes?CTM HTM

Component"1 0.622 0.669 0.673 0.697Component"2 0.670 0.704 0.716 0.730Component"3 0.666 0.665 0.669 0.700

LDA CTM HTM

Component"1 P42945 P42458 ?42325Component"2 P22558 P22181 P22164

Component"3 P17574 P17574 P17571

Network&Inference&accuracy:&10% improvement&

Topic&modeling&accuracy:&

13/17

Yan Liu (USC) Mining from Large-scale Time Series Data April 14, 2017 17 / 25

Page 18: Conquering the Complexity of Time: Machine Learning for Big …cci.usc.edu/wp-content/uploads/2017/06/ml-retreat.pdf · 2017-06-23 · Conquering the Complexity of Time: Machine Learning

Experiment Results: EventRegistry

He-et-al. HawkesTopic ICML-2015

Result:-EventRegistry

14/17Early bird report agency : sunherald.com, miamiherald.com and in.reuters.comNews gathering and re-distribution agency : www.reuters.com

Yan Liu (USC) Mining from Large-scale Time Series Data April 14, 2017 18 / 25

Page 19: Conquering the Complexity of Time: Machine Learning for Big …cci.usc.edu/wp-content/uploads/2017/06/ml-retreat.pdf · 2017-06-23 · Conquering the Complexity of Time: Machine Learning

Research Synopsis: Applications

• Doctor AI - Healthcare Analytics

• Social Network Analysis

He-et-al. HawkesTopic ICML-2015

Result:-ArXiv

16/17

• Computational sustainability

Yan Liu (USC) Mining from Large-scale Time Series Data April 14, 2017 19 / 25

Page 20: Conquering the Complexity of Time: Machine Learning for Big …cci.usc.edu/wp-content/uploads/2017/06/ml-retreat.pdf · 2017-06-23 · Conquering the Complexity of Time: Machine Learning

Spatiotemporal Data Analysis [NIPS 2104 Spotlight Presentation,

ICML 2015, ICML 2016]

Two key principles in designing spatial-temporal models

Local smoothness: features in thesame neighborhood share similarvalue

Global latent structure: the datalie on a lower dimensional latentstructure

0 50 1000

50

100

150

200

250

300

350

Sin

gula

r va

lues

Location

0 50 100 1500

50

100

150

200

250

300

350Time

0 5 10 150

50

100

150

200

250

300Variable

Yan Liu (USC) Mining from Large-scale Time Series Data April 14, 2017 20 / 25

Page 21: Conquering the Complexity of Time: Machine Learning for Big …cci.usc.edu/wp-content/uploads/2017/06/ml-retreat.pdf · 2017-06-23 · Conquering the Complexity of Time: Machine Learning

Spatiotemporal Data Analysis [NIPS 2104 Spotlight Presentation,

ICML 2015, ICML 2016]

Two key principles in designing spatial-temporal models

Local smoothness: features in thesame neighborhood share similarvalue

Global latent structure: the datalie on a lower dimensional latentstructure

0 50 1000

50

100

150

200

250

300

350

Sin

gula

r va

lues

Location

0 50 100 1500

50

100

150

200

250

300

350Time

0 5 10 150

50

100

150

200

250

300Variable

Yan Liu (USC) Mining from Large-scale Time Series Data April 14, 2017 20 / 25

Page 22: Conquering the Complexity of Time: Machine Learning for Big …cci.usc.edu/wp-content/uploads/2017/06/ml-retreat.pdf · 2017-06-23 · Conquering the Complexity of Time: Machine Learning

Tensor Representation for Spatial Temporal Data

Spatial temporal data can naturally be represented as tensors

Time

Sp

ati

al L

oc

ati

on

s

Simple solutions based on tensor formulation:

• Local smoothness can be achieved by Laplacian regularizer

• Global latent structures can be achieved by low-rank constraint

Yan Liu (USC) Mining from Large-scale Time Series Data April 14, 2017 21 / 25

Page 23: Conquering the Complexity of Time: Machine Learning for Big …cci.usc.edu/wp-content/uploads/2017/06/ml-retreat.pdf · 2017-06-23 · Conquering the Complexity of Time: Machine Learning

Tensor Representation for Spatial Temporal Data

Spatial temporal data can naturally be represented as tensors

Time

Sp

ati

al L

oc

ati

on

s

Simple solutions based on tensor formulation:

• Local smoothness can be achieved by Laplacian regularizer

• Global latent structures can be achieved by low-rank constraint

Yan Liu (USC) Mining from Large-scale Time Series Data April 14, 2017 21 / 25

Page 24: Conquering the Complexity of Time: Machine Learning for Big …cci.usc.edu/wp-content/uploads/2017/06/ml-retreat.pdf · 2017-06-23 · Conquering the Complexity of Time: Machine Learning

Application I: Cokriging

Task description Jointly predicting the value of multiple features atunknown locations by taking advantage of the observations from knownlocations.

Given the complete data X ∈ RP×T×M and partial observations at asubset of locations indexed by Ω ⊂ 1, . . . , P, we need to estimateW ∈ RP×T×M so that WΩ = XΩ.

Our formulation

W = argminW

Loss term︷ ︸︸ ︷‖WΩ −XΩ‖2F +

Local consistency︷ ︸︸ ︷µ

M∑m=1

tr(W>:,:,mLW:,:,m)

subject to rank(W) ≤ ρ︸ ︷︷ ︸

Global consistency

.

Yan Liu (USC) Mining from Large-scale Time Series Data April 14, 2017 22 / 25

Page 25: Conquering the Complexity of Time: Machine Learning for Big …cci.usc.edu/wp-content/uploads/2017/06/ml-retreat.pdf · 2017-06-23 · Conquering the Complexity of Time: Machine Learning

Application I: Cokriging

Task description Jointly predicting the value of multiple features atunknown locations by taking advantage of the observations from knownlocations.Given the complete data X ∈ RP×T×M and partial observations at asubset of locations indexed by Ω ⊂ 1, . . . , P, we need to estimateW ∈ RP×T×M so that WΩ = XΩ.

Our formulation

W = argminW

Loss term︷ ︸︸ ︷‖WΩ −XΩ‖2F +

Local consistency︷ ︸︸ ︷µ

M∑m=1

tr(W>:,:,mLW:,:,m)

subject to rank(W) ≤ ρ︸ ︷︷ ︸

Global consistency

.

Yan Liu (USC) Mining from Large-scale Time Series Data April 14, 2017 22 / 25

Page 26: Conquering the Complexity of Time: Machine Learning for Big …cci.usc.edu/wp-content/uploads/2017/06/ml-retreat.pdf · 2017-06-23 · Conquering the Complexity of Time: Machine Learning

Application I: Cokriging

Task description Jointly predicting the value of multiple features atunknown locations by taking advantage of the observations from knownlocations.Given the complete data X ∈ RP×T×M and partial observations at asubset of locations indexed by Ω ⊂ 1, . . . , P, we need to estimateW ∈ RP×T×M so that WΩ = XΩ.

Our formulation

W = argminW

Loss term︷ ︸︸ ︷‖WΩ −XΩ‖2F +

Local consistency︷ ︸︸ ︷µ

M∑m=1

tr(W>:,:,mLW:,:,m)

subject to rank(W) ≤ ρ︸ ︷︷ ︸

Global consistency

.

Yan Liu (USC) Mining from Large-scale Time Series Data April 14, 2017 22 / 25

Page 27: Conquering the Complexity of Time: Machine Learning for Big …cci.usc.edu/wp-content/uploads/2017/06/ml-retreat.pdf · 2017-06-23 · Conquering the Complexity of Time: Machine Learning

Experiments on Climate Datasets

CokrigingDataset Tensor-F Tensor-O ADMM Simple MTGP

USHCN 0.7594 0.7210 0.8051 0.8760 1.0007CCDS 0.5555 0.4532 0.8292 0.7634 1.0296

ForecastingDataset Tensor-F Tensor-O Tucker ADMM OrthoNL Trace MTLl1 MTLdirty

USHCN 0.9171 0.9069 0.8975 0.9227 0.9175 0.9273 0.9528 0.9735CCDS 0.8810 0.8325 0.9438 0.8448 0.8555 0.8632 0.9105 1.0950

Map of most predictive regionsanalyzed by the greedy algorithmusing CCDS dataset.

Yan Liu (USC) Mining from Large-scale Time Series Data April 14, 2017 23 / 25

Page 28: Conquering the Complexity of Time: Machine Learning for Big …cci.usc.edu/wp-content/uploads/2017/06/ml-retreat.pdf · 2017-06-23 · Conquering the Complexity of Time: Machine Learning

Experiments on Climate Datasets

CokrigingDataset Tensor-F Tensor-O ADMM Simple MTGP

USHCN 0.7594 0.7210 0.8051 0.8760 1.0007CCDS 0.5555 0.4532 0.8292 0.7634 1.0296

ForecastingDataset Tensor-F Tensor-O Tucker ADMM OrthoNL Trace MTLl1 MTLdirty

USHCN 0.9171 0.9069 0.8975 0.9227 0.9175 0.9273 0.9528 0.9735CCDS 0.8810 0.8325 0.9438 0.8448 0.8555 0.8632 0.9105 1.0950

Map of most predictive regionsanalyzed by the greedy algorithmusing CCDS dataset.

Yan Liu (USC) Mining from Large-scale Time Series Data April 14, 2017 23 / 25

Page 29: Conquering the Complexity of Time: Machine Learning for Big …cci.usc.edu/wp-content/uploads/2017/06/ml-retreat.pdf · 2017-06-23 · Conquering the Complexity of Time: Machine Learning

Experiments on Scalability

Running time (in seconds) for cokriging and forecasting

Cokriging Forecasting

Dataset USHCN CCDS FSQ USHCN CCDS FSQ

Tensor-O 93.03 16.98 91.51 75.47 21.38 37.70ADMM 791.25 320.77 720.40 235.73 45.62 33.83MTGP ∞

Other Development

• Online prediction (ICML 2015)

• Memory efficient prediction (ICML 2016)

• Connections to Gaussian Processes (Upcoming)

Yan Liu (USC) Mining from Large-scale Time Series Data April 14, 2017 24 / 25

Page 30: Conquering the Complexity of Time: Machine Learning for Big …cci.usc.edu/wp-content/uploads/2017/06/ml-retreat.pdf · 2017-06-23 · Conquering the Complexity of Time: Machine Learning

Thank you!

For more information: USC Melady Grouphttp://www-bcf.usc.edu/~liu32/melady.html

Yan Liu (USC) Mining from Large-scale Time Series Data April 14, 2017 25 / 25