spotting culprits in epidemics: how many and which...

26
Spotting Culprits in Epidemics: How many and Which ones? B. Aditya Prakash Virginia Tech Jilles Vreeken University of Antwerp Christos Faloutsos Carnegie Mellon University IEEE ICDM Brussels December 11, 2012

Upload: others

Post on 12-Aug-2020

7 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Spotting Culprits in Epidemics: How many and Which ones?eda.mmci.uni-saarland.de/pres/icdm12-culprits-pres.pdf · IEEE ICDM Brussels December 11, 2012 . Virus Propagation •Susceptible-Infected

Spotting Culprits in Epidemics: How many and Which ones?

B. Aditya Prakash Virginia Tech

Jilles Vreeken University of Antwerp

Christos Faloutsos Carnegie Mellon University

IEEE ICDM Brussels December 11, 2012

Page 2: Spotting Culprits in Epidemics: How many and Which ones?eda.mmci.uni-saarland.de/pres/icdm12-culprits-pres.pdf · IEEE ICDM Brussels December 11, 2012 . Virus Propagation •Susceptible-Infected

Virus Propagation

• Susceptible-Infected (SI) Model [AJPH 2007]

CDC data: Visualization of the first 35 tuberculosis (TB) patients and their 1039 contacts

Diseases over contact networks

Prakash, Vreeken, Faloutsos 2012

Page 3: Spotting Culprits in Epidemics: How many and Which ones?eda.mmci.uni-saarland.de/pres/icdm12-culprits-pres.pdf · IEEE ICDM Brussels December 11, 2012 . Virus Propagation •Susceptible-Infected

Outline

• Motivation---Introduction • Problem Definition • Intuition • MDL • Experiments • Conclusion

Prakash, Vreeken, Faloutsos 2012

Page 4: Spotting Culprits in Epidemics: How many and Which ones?eda.mmci.uni-saarland.de/pres/icdm12-culprits-pres.pdf · IEEE ICDM Brussels December 11, 2012 . Virus Propagation •Susceptible-Infected

Culprits: Problem definition

2-d grid Q: Who started it?

Prakash, Vreeken, Faloutsos 2012

Page 5: Spotting Culprits in Epidemics: How many and Which ones?eda.mmci.uni-saarland.de/pres/icdm12-culprits-pres.pdf · IEEE ICDM Brussels December 11, 2012 . Virus Propagation •Susceptible-Infected

Culprits: Problem definition

Prakash, Vreeken, Faloutsos 2012

Prior work: [Lappas et al. 2010, Shah et al. 2011]

2-d grid Q: Who started it?

Page 6: Spotting Culprits in Epidemics: How many and Which ones?eda.mmci.uni-saarland.de/pres/icdm12-culprits-pres.pdf · IEEE ICDM Brussels December 11, 2012 . Virus Propagation •Susceptible-Infected

Outline

• Motivation---Introduction • Problem Definition • Intuition • MDL • Experiments • Conclusion

Prakash, Vreeken, Faloutsos 2012

Page 7: Spotting Culprits in Epidemics: How many and Which ones?eda.mmci.uni-saarland.de/pres/icdm12-culprits-pres.pdf · IEEE ICDM Brussels December 11, 2012 . Virus Propagation •Susceptible-Infected

Culprits: Exoneration

Prakash, Vreeken, Faloutsos 2012

Page 8: Spotting Culprits in Epidemics: How many and Which ones?eda.mmci.uni-saarland.de/pres/icdm12-culprits-pres.pdf · IEEE ICDM Brussels December 11, 2012 . Virus Propagation •Susceptible-Infected

Culprits: Exoneration

Prakash, Vreeken, Faloutsos 2012

Page 9: Spotting Culprits in Epidemics: How many and Which ones?eda.mmci.uni-saarland.de/pres/icdm12-culprits-pres.pdf · IEEE ICDM Brussels December 11, 2012 . Virus Propagation •Susceptible-Infected

Who are the culprits

• Two-part solution – use MDL for number of seeds – for a given number:

• exoneration = centrality + penalty

• Running time = – linear! (in edges and nodes)

Prakash, Vreeken, Faloutsos 2012

NetSleuth

Page 10: Spotting Culprits in Epidemics: How many and Which ones?eda.mmci.uni-saarland.de/pres/icdm12-culprits-pres.pdf · IEEE ICDM Brussels December 11, 2012 . Virus Propagation •Susceptible-Infected

Outline

• Motivation---Introduction • Problem Definition • Intuition • MDL

– Construction – Opitimization

• Experiments • Conclusion

Prakash, Vreeken, Faloutsos 2012

Page 11: Spotting Culprits in Epidemics: How many and Which ones?eda.mmci.uni-saarland.de/pres/icdm12-culprits-pres.pdf · IEEE ICDM Brussels December 11, 2012 . Virus Propagation •Susceptible-Infected

Modeling using MDL

• Minimum Description Length Principle == Induction by compression

• Related to Bayesian approaches • MDL = Model + Data • Model

– Scoring the seed-set

Number of possible |S|-sized sets En-coding integer |S|

Prakash, Vreeken, Faloutsos 2012

Page 12: Spotting Culprits in Epidemics: How many and Which ones?eda.mmci.uni-saarland.de/pres/icdm12-culprits-pres.pdf · IEEE ICDM Brussels December 11, 2012 . Virus Propagation •Susceptible-Infected

Modeling using MDL

• Data: Propagation Ripples Original

Graph Infected Snapshot

Ripple R2 Ripple R1 Prakash, Vreeken, Faloutsos 2012

Page 13: Spotting Culprits in Epidemics: How many and Which ones?eda.mmci.uni-saarland.de/pres/icdm12-culprits-pres.pdf · IEEE ICDM Brussels December 11, 2012 . Virus Propagation •Susceptible-Infected

Modeling using MDL

• Ripple cost

• Total MDL cost

How the ‘frontier’ advances How long is the ripple

Ripple R

Prakash, Vreeken, Faloutsos 2012

Page 14: Spotting Culprits in Epidemics: How many and Which ones?eda.mmci.uni-saarland.de/pres/icdm12-culprits-pres.pdf · IEEE ICDM Brussels December 11, 2012 . Virus Propagation •Susceptible-Infected

Outline

• Motivation---Introduction • Problem Definition • Intuition • MDL

– Construction – Opitimization

• Experiments • Conclusion

Prakash, Vreeken, Faloutsos 2012

Page 15: Spotting Culprits in Epidemics: How many and Which ones?eda.mmci.uni-saarland.de/pres/icdm12-culprits-pres.pdf · IEEE ICDM Brussels December 11, 2012 . Virus Propagation •Susceptible-Infected

How to optimize the score?

• Two-step process – Given k, quickly identify high-quality set – Given these nodes, optimize the ripple R

Prakash, Vreeken, Faloutsos 2012

Page 16: Spotting Culprits in Epidemics: How many and Which ones?eda.mmci.uni-saarland.de/pres/icdm12-culprits-pres.pdf · IEEE ICDM Brussels December 11, 2012 . Virus Propagation •Susceptible-Infected

Optimizing the score

• High-quality k-seed-set – Exoneration

• Best single seed: – Smallest eigenvector of Laplacian sub-matrix – Analyze a Constrained SI epidemic

• Exonerate neighbors • Repeat

Prakash, Vreeken, Faloutsos 2012

Page 17: Spotting Culprits in Epidemics: How many and Which ones?eda.mmci.uni-saarland.de/pres/icdm12-culprits-pres.pdf · IEEE ICDM Brussels December 11, 2012 . Virus Propagation •Susceptible-Infected

Optimizing the score

• Optimizing R – Get the MLE ripple!

• Finally use MDL score to tell us the best set

• NetSleuth: Linear running time in nodes and edges

Prakash, Vreeken, Faloutsos 2012

Ripple R

Page 18: Spotting Culprits in Epidemics: How many and Which ones?eda.mmci.uni-saarland.de/pres/icdm12-culprits-pres.pdf · IEEE ICDM Brussels December 11, 2012 . Virus Propagation •Susceptible-Infected

Outline

• Motivation---Introduction • Problem Definition • Intuition • MDL • Experiments • Conclusion

Prakash, Vreeken, Faloutsos 2012

Page 19: Spotting Culprits in Epidemics: How many and Which ones?eda.mmci.uni-saarland.de/pres/icdm12-culprits-pres.pdf · IEEE ICDM Brussels December 11, 2012 . Virus Propagation •Susceptible-Infected

Experiments

• Evaluation functions: – MDL based

– Overlap based

(JD == Jaccard distance)

Closer to 1 the better

Prakash, Vreeken, Faloutsos 2012

How far are they?

Page 20: Spotting Culprits in Epidemics: How many and Which ones?eda.mmci.uni-saarland.de/pres/icdm12-culprits-pres.pdf · IEEE ICDM Brussels December 11, 2012 . Virus Propagation •Susceptible-Infected

Experiments: # of Seeds

One Seed Two Seeds

Three Seeds

Page 21: Spotting Culprits in Epidemics: How many and Which ones?eda.mmci.uni-saarland.de/pres/icdm12-culprits-pres.pdf · IEEE ICDM Brussels December 11, 2012 . Virus Propagation •Susceptible-Infected

Experiments: Quality (MDL and JD)

Prakash, Vreeken, Faloutsos 2012

Ideal = 1

One Seed Two Seeds

Three Seeds

Page 22: Spotting Culprits in Epidemics: How many and Which ones?eda.mmci.uni-saarland.de/pres/icdm12-culprits-pres.pdf · IEEE ICDM Brussels December 11, 2012 . Virus Propagation •Susceptible-Infected

Experiments: Quality (Jaccard Scores)

Prakash, Vreeken, Faloutsos 2012

Closer to diagonal, the better

True

Net

Sleu

th

One Seed Two Seeds

Three Seeds

Page 23: Spotting Culprits in Epidemics: How many and Which ones?eda.mmci.uni-saarland.de/pres/icdm12-culprits-pres.pdf · IEEE ICDM Brussels December 11, 2012 . Virus Propagation •Susceptible-Infected

Experiments: Scalability

Prakash, Vreeken, Faloutsos 2012

Page 24: Spotting Culprits in Epidemics: How many and Which ones?eda.mmci.uni-saarland.de/pres/icdm12-culprits-pres.pdf · IEEE ICDM Brussels December 11, 2012 . Virus Propagation •Susceptible-Infected

Outline

• Motivation---Introduction • Problem Definition • Intuition • MDL • Experiments • Conclusion

Prakash, Vreeken, Faloutsos 2012

Page 25: Spotting Culprits in Epidemics: How many and Which ones?eda.mmci.uni-saarland.de/pres/icdm12-culprits-pres.pdf · IEEE ICDM Brussels December 11, 2012 . Virus Propagation •Susceptible-Infected

Conclusion • Given: Graph and Infections • Find: Best ‘Culprits’

• Two-part solution

– use MDL for number of seeds – for a given number:

exoneration = centrality + penalty

• NetSleuth: – Linear running time in nodes and edges

Prakash, Vreeken, Faloutsos 2012

Page 26: Spotting Culprits in Epidemics: How many and Which ones?eda.mmci.uni-saarland.de/pres/icdm12-culprits-pres.pdf · IEEE ICDM Brussels December 11, 2012 . Virus Propagation •Susceptible-Infected

B. Aditya Prakash http://www.cs.vt.edu/~badityap

Any Questions?

Prakash, Vreeken, Faloutsos 2012