malaria transmission: modeling & inferencebhadra/presentations/ab_eindhoven.pdf · of the...

29
Malaria transmission: Modeling & Inference Anindya Bhadra University of Michigan, Ann Arbor June 8, 2009 Anindya Bhadra Malaria transmission: Modeling & Inference

Upload: ngoque

Post on 14-Mar-2018

220 views

Category:

Documents


2 download

TRANSCRIPT

Page 1: Malaria transmission: Modeling & Inferencebhadra/presentations/ab_eindhoven.pdf · of the University of Michigan and Menno Bouma of the ... is a delay distribution describing the

Malaria transmission: Modeling & Inference

Anindya Bhadra

University of Michigan, Ann Arbor

June 8, 2009

Anindya Bhadra Malaria transmission: Modeling & Inference

Page 2: Malaria transmission: Modeling & Inferencebhadra/presentations/ab_eindhoven.pdf · of the University of Michigan and Menno Bouma of the ... is a delay distribution describing the

Overview

Malaria data and model

Inference via Iterated Filtering

Results

(Joint work with Karina Laneri, Ed Ionides and Mercedes Pascualof the University of Michigan and Menno Bouma of the LondonSchool of Hygiene and Tropical Medicine. Data provided byRamesh Dhiman and Rajpal Yadav, National Malaria ResearchInstitute, India)

Anindya Bhadra Malaria transmission: Modeling & Inference

Page 3: Malaria transmission: Modeling & Inferencebhadra/presentations/ab_eindhoven.pdf · of the University of Michigan and Menno Bouma of the ... is a delay distribution describing the

Introduction

Malaria: 300 - 500 million cases each year, resulting in nearly1 million deaths

Common in Sub-Saharan Africa, parts of Asia, central andsouth America

Caused by the Plasmodium parasite, carried by femaleAnopheles mosquitoes

Estimated economic impact USD 12 billion/year

Anindya Bhadra Malaria transmission: Modeling & Inference

Page 4: Malaria transmission: Modeling & Inferencebhadra/presentations/ab_eindhoven.pdf · of the University of Michigan and Menno Bouma of the ... is a delay distribution describing the

Our Goal

Fit and compare mechanistic models of the diseasetransmission via maximum likelihood

Analyze the role of climate covariates, especially rainfall

Treat issues of parameter identifiability

Anindya Bhadra Malaria transmission: Modeling & Inference

Page 5: Malaria transmission: Modeling & Inferencebhadra/presentations/ab_eindhoven.pdf · of the University of Michigan and Menno Bouma of the ... is a delay distribution describing the

The Data

1990 1995 2000 2005

050

010

0015

0020

0025

0030

0035

00

Time

Case

s

Monthly clinical case data from Kutch, a district in the stateof Gujarat, western India, 1987-2006.

Anindya Bhadra Malaria transmission: Modeling & Inference

Page 6: Malaria transmission: Modeling & Inferencebhadra/presentations/ab_eindhoven.pdf · of the University of Michigan and Menno Bouma of the ... is a delay distribution describing the

The Data

2 4 6 8 10 12

01

00

20

03

00

40

05

00

Month

Ra

in

Monthly rain (mm)

2 4 6 8 10 12

15

10

50

10

05

00

MonthC

ase

s

Monthly cases (log scale)

Anindya Bhadra Malaria transmission: Modeling & Inference

Page 7: Malaria transmission: Modeling & Inferencebhadra/presentations/ab_eindhoven.pdf · of the University of Michigan and Menno Bouma of the ... is a delay distribution describing the

Challenges with malaria modeling

Malaria is a complex disease with incomplete immunity,mosquito -human interaction and an intial symptom ofnon-specific fever.

Previously developed methodology, e.g. the one for measles,have been unable to model malaria successfully.

Mechanistic inclusion of climate covariates (e.g. rainfall) havenot been successful. Lags? Integrals? Threshold effects?

MCMC and Stochastic EM are known not to work.

Anindya Bhadra Malaria transmission: Modeling & Inference

Page 8: Malaria transmission: Modeling & Inferencebhadra/presentations/ab_eindhoven.pdf · of the University of Michigan and Menno Bouma of the ... is a delay distribution describing the

Partially Observed Markov Processes (POMP)

xt is an unobserved vector valued stochastic process (discreteor continuous time).

It is assumed to be Markovian.reasonable if all important dynamic processes are modeled aspart of the system.

yt is a vector of the available observations (discrete time),assumed to be conditionally independent given xt (a standardmeasurement model, which can be relaxed).

θ is a vector of unknown parameters.

Anindya Bhadra Malaria transmission: Modeling & Inference

Page 9: Malaria transmission: Modeling & Inferencebhadra/presentations/ab_eindhoven.pdf · of the University of Michigan and Menno Bouma of the ... is a delay distribution describing the

Disease Transmission Model

S E I

QR

-µSE -µEI

?

µIQ

�µQR

6µRS

6µRQ

?

µIS

Figure: Flow diagram for the SEIQR model with superinfection

Anindya Bhadra Malaria transmission: Modeling & Inference

Page 10: Malaria transmission: Modeling & Inferencebhadra/presentations/ab_eindhoven.pdf · of the University of Michigan and Menno Bouma of the ... is a delay distribution describing the

Model Equations

State Model

dS

dt= µBSN − µSE S + µIS I + µRSR − δS

dE

dt= µSE S − µEI E − δE

dI

dt= µEI E − µIS I − µIQ I − δI

dQ

dt= µIQ I + µRQR − µQRQ − δQ

dR

dt= −µRSR + µQR I − µRQR − δR

Observation Model

Mn = ρ

∫ tn

tn−1

dNEI (s)

Yn|Mn ∼ Negbin(mean = Mn, var = Mn + σ2obsM2

n)

Anindya Bhadra Malaria transmission: Modeling & Inference

Page 11: Malaria transmission: Modeling & Inferencebhadra/presentations/ab_eindhoven.pdf · of the University of Michigan and Menno Bouma of the ... is a delay distribution describing the

Modeling the force of infection µSE (t)

µSE (t) =

∫ ∞0

f (s)p(t − s) ds

f (t) is the effective human-human transmission rate

p(t) is a delay distribution describing the vector survival

f (t) =I (t) + qQ(t)

N(t)exp

{ k∑i=1

βi si (t) + βtt + βcC (t)}dΓ

dt

∫ t0+∆

t0

dtdt ∼ Gamma(∆/σ2, σ2)

Anindya Bhadra Malaria transmission: Modeling & Inference

Page 12: Malaria transmission: Modeling & Inferencebhadra/presentations/ab_eindhoven.pdf · of the University of Michigan and Menno Bouma of the ... is a delay distribution describing the

Coupled system of Levy SDEs

We consider this a special case of POMP, so our generalstatistical framework applies.

Our inference method is based on numerical solution to LevySDEs which are similar to more standard Gaussian SDEs.

Theoretical possibility of an infinite jump in in a Levy processdoes not appear to be a problem in practice.

Anindya Bhadra Malaria transmission: Modeling & Inference

Page 13: Malaria transmission: Modeling & Inferencebhadra/presentations/ab_eindhoven.pdf · of the University of Michigan and Menno Bouma of the ... is a delay distribution describing the

Plug-and-play inference

Statistical methods for pomps are plug-and-play if theyrequire simulation from the dynamic model but not explicitlikelihood ratios.

Bayesian plug-and-play:1. Artificial parameter evolution (Liu and West, 2001)2. Approximate Bayesian computation (“Sequential MonteCarlo without likelihoods,” Sisson et al, PNAS, 2007)

Non-Bayesian plug-and-play:3. Simulation-based prediction rules (Kendall et al, Ecology,1999)4. Maximum likelihood via iterated filtering (Ionides et al,PNAS, 2006)

Plug-and-play is a VERY USEFUL PROPERTY forinvestigating scientific models.

Anindya Bhadra Malaria transmission: Modeling & Inference

Page 14: Malaria transmission: Modeling & Inferencebhadra/presentations/ab_eindhoven.pdf · of the University of Michigan and Menno Bouma of the ... is a delay distribution describing the

The cost of plug-and-play

Approximate Bayesian methods and simulated momentmethods lead to a loss of statistical efficiency.

In contrast, iterated filtering enables (almost) exactlikelihood-based inference.

Improvements in numerical efficiency may be possible whenanalytic properties are available (at the expense ofplug-and-play). But many interesting dynamic models areanalytically intractable.

Anindya Bhadra Malaria transmission: Modeling & Inference

Page 15: Malaria transmission: Modeling & Inferencebhadra/presentations/ab_eindhoven.pdf · of the University of Michigan and Menno Bouma of the ... is a delay distribution describing the

Iterated Filtering

Filtering: the conditional distribution of the unobserved statevector xn given the observations up to that time,y1, y2, . . . , yn.

Iterated filtering: an algorithm which uses a sequence ofsolutions to the filtering problem to maximize the likelihoodfunction over unknown model parameters.(Ionides, Breto & King. PNAS, 2006)

If the filter is plug-and-play (e.g. using standard sequentialMonte Carlo methods) this is inherited by iterated filtering.

Anindya Bhadra Malaria transmission: Modeling & Inference

Page 16: Malaria transmission: Modeling & Inferencebhadra/presentations/ab_eindhoven.pdf · of the University of Michigan and Menno Bouma of the ... is a delay distribution describing the

Key ideas of iterated filtering

Bayesian inference for time-varying parameters becomes asolveable filtering problem. Set θ = θn to be a random walkwith

E [θn|θn−1] = θn−1

Var(θn|θn−1) = σ2

The limit σ → 0 can be used to maximize the likelihood forfixed parameters.

Anindya Bhadra Malaria transmission: Modeling & Inference

Page 17: Malaria transmission: Modeling & Inferencebhadra/presentations/ab_eindhoven.pdf · of the University of Michigan and Menno Bouma of the ... is a delay distribution describing the

Theoretical basis for iterated filtering

Theorem 1. (Ionides, Breto & King, PNAS, 2006)Suppose θ0, C and y1:N are fixed and define

θFn = E [θn|y1:n]

V Pn = Var(θn|y1:n−1)

Assuming sufficient regularity conditions for a Taylor series

expansion, limσ→0∑N

n=1 (V Pn )−1

(θFn − θF

n−1) = ∇`(θ)

Anindya Bhadra Malaria transmission: Modeling & Inference

Page 18: Malaria transmission: Modeling & Inferencebhadra/presentations/ab_eindhoven.pdf · of the University of Michigan and Menno Bouma of the ... is a delay distribution describing the

Theoretical basis for iterated filtering

Theorem 2. (Ionides, Bhadra & King, (arXiv:0902.0347v1))Let θF

n,m and V Pn be the sequential Monte Carlo estimates of θF

n,m

and V Pn respectively with number of particles Jm. If τm → 0 and

τmJm →∞, then under suitable regularity assumptions

limm→∞

E

[ N∑n=1

(V Pn,m)−1(θF

n,m − θFn−1,m)

]= ∇`(θ)

limm→∞

τm2JmVar

( N∑n=1

(V Pn,m)−1(θF

n,m − θFn−1,m)

)< ∞

Anindya Bhadra Malaria transmission: Modeling & Inference

Page 19: Malaria transmission: Modeling & Inferencebhadra/presentations/ab_eindhoven.pdf · of the University of Michigan and Menno Bouma of the ... is a delay distribution describing the

Theoretical basis for iterated filtering

Theorem 3. (Ionides, Bhadra & King, (arXiv:0902.0347v1))Let τm → 0 and τmJm →∞,

∑m am =∞, am → 0 and∑

m a2mJ−1

m τ2m <∞. define a recursion by,

θm+1 = θm + am

N∑n=1

(V Pn,m)−1(θF

n,m − θFn−1,m)

Then under suitable regularity assumptions for stochasticapproximation (Kushner and Clark, 1978), limm→∞ θm = θ, theMLE, with probability 1.

Anindya Bhadra Malaria transmission: Modeling & Inference

Page 20: Malaria transmission: Modeling & Inferencebhadra/presentations/ab_eindhoven.pdf · of the University of Michigan and Menno Bouma of the ... is a delay distribution describing the

Debate over the role of climate factors

(Hay et al., Nature, 2002) think the epidemic cycles aregenerated by the disease dynamics itself, without externalforcing.

(Zhou et al., PNAS, 2004, Pascual et al., Proc. Royal SocietyInterface, 2007) think external drivers such as temperatureand rainfall play a role.

Previous analyses have been based on more limited data fromEastern African highlands, which are desert fringes.

The data from India also come from a desert region and helpus answer these scientific questions.

Anindya Bhadra Malaria transmission: Modeling & Inference

Page 21: Malaria transmission: Modeling & Inferencebhadra/presentations/ab_eindhoven.pdf · of the University of Michigan and Menno Bouma of the ... is a delay distribution describing the

Role of rainfall

Recall the force of infection term-

µSE (t) =

∫ ∞0

f (s)p(t − s) ds

f (t) =I (t) + qQ(t)

N(t)exp

{ k∑i=1

βi si (t) + βtt + βcC (t)}dΓ

dt

For us, C (t) = max(R(t)− 200, 0), where R(t) is theaccumulated rainfall at time t over past 6 months.

This introduces a threshold effect of rainfall, i.e. rainfall overa certain threshold is conducive to malaria transmission.

Observed correlation in the data: correlation between Rainfall(summed over May to August) and Cases (summed overSeptember to December) = 0.78

Anindya Bhadra Malaria transmission: Modeling & Inference

Page 22: Malaria transmission: Modeling & Inferencebhadra/presentations/ab_eindhoven.pdf · of the University of Michigan and Menno Bouma of the ... is a delay distribution describing the

Application of iterated filtering to the malaria model :Model comparison

model log-likelihood

SEIR -1275SEIR with rainfall -1268

SEIQR with superinfection -1274SARIMA (1, 0, 1)X (1, 0, 1)12 -1330

Table: Table of log-likelihoods of the fitted model

The full model seems to be unidentifiable. From now we dealwith just the first two.

Including rainfall properly in the model improves thelikelihood.

Anindya Bhadra Malaria transmission: Modeling & Inference

Page 23: Malaria transmission: Modeling & Inferencebhadra/presentations/ab_eindhoven.pdf · of the University of Michigan and Menno Bouma of the ... is a delay distribution describing the

Simulations from the model with rainfall

1990 1995 2000 2005

010

0020

0030

0040

0050

00

mal

aria

, mon

thly

rep

orte

d ca

ses

Cases, deterministic skeleton (ODE) from fit with rainfallas a covariate, 10th and 90th percentiles of SDE model.

Simulations from the initial values for a long interval of 20years look remarkably similar to the data

Anindya Bhadra Malaria transmission: Modeling & Inference

Page 24: Malaria transmission: Modeling & Inferencebhadra/presentations/ab_eindhoven.pdf · of the University of Michigan and Menno Bouma of the ... is a delay distribution describing the

Simulations from the model without rainfall

1990 1995 2000 2005

010

0020

0030

0040

0050

00

mal

aria

, mon

thly

rep

orte

d ca

ses

Cases, deterministic skeleton (ODE) from fitted modelwithout rainfall, 10th and 90th percentiles of SDE model.

Simulations, in particular the timing of the peaks, look verydifferent from the data

Anindya Bhadra Malaria transmission: Modeling & Inference

Page 25: Malaria transmission: Modeling & Inferencebhadra/presentations/ab_eindhoven.pdf · of the University of Michigan and Menno Bouma of the ... is a delay distribution describing the

Capturing the observed correlation in the simulations

−1.0 −0.5 0.0 0.5 1.0

0.00.5

1.01.5

2.0

correlation

Dens

ity

Density plot of correlation between Rainfall (summed overMay to August) and Cases (summed over September toDecember) from the model with rainfall and without rainfall

Broken black line is the observed correlation in the data

Anindya Bhadra Malaria transmission: Modeling & Inference

Page 26: Malaria transmission: Modeling & Inferencebhadra/presentations/ab_eindhoven.pdf · of the University of Michigan and Menno Bouma of the ... is a delay distribution describing the

Duration of immunity

0.2 0.5 1.0 2.0 5.0 10.0

−127

2.0−1

271.0

−127

0.0−1

269.0

immunity(years)

loglik

The profile plot suggests an average duration of immunity of 1year.

Possibly too short to generate interannual variability

Anindya Bhadra Malaria transmission: Modeling & Inference

Page 27: Malaria transmission: Modeling & Inferencebhadra/presentations/ab_eindhoven.pdf · of the University of Michigan and Menno Bouma of the ... is a delay distribution describing the

Conclusion

Plug-and-play statistical methodology permits likelihood-basedanalysis of stochastic dynamic models. This enables inferenceon a very flexible class of models that was impossible so far.

General-purpose statistical software for partially observedMarkov processes is available in the pomp package for R (onCRAN).

This technique allows us to investigate the role of climatecovariates on malaria while taking into account intrinsicdisease dynamics.

Anindya Bhadra Malaria transmission: Modeling & Inference

Page 28: Malaria transmission: Modeling & Inferencebhadra/presentations/ab_eindhoven.pdf · of the University of Michigan and Menno Bouma of the ... is a delay distribution describing the

References

1 Ionides, E. L., Breto, C., and King, A. A. (2006). Inference fornonlinear dynamical systems. Proceedings of the National Academyof Sciences of the USA, 103 : 18438-18443.

2 Ionides, E. L. Bhadra A., and King, A. A. (2009). Iterated Filtering,arXiv:0902.0347v1.

3 Liu, J. and West, M. (2001). Combining parameter and stateestimation in simulation-based filtering. In Doucet, A., de Freitas,N., and Gordon, N. J., editors, Sequential Monte Carlo Methods inPractice, pages 197-224. Springer, New York.

4 Kendall, B. E., Briggs, C. J., Murdoch, W. W., Turchin, P., Ellner,S. P., McCauley, E., Nisbet, R. M., and Wood, S. N. (1999). Whydo populations cycle? A synthesis of statistical and mechanisticmodeling approaches. Ecology, 80 : 1789-1805.

5 Pascual, M., B. Cazelles, M.J. Bouma, L.F. Chaves, and K. Koelle.(2008). Shifting patterns: malaria dynamics and climate variabilityin an East African highland. Proc. R. Soc. London B, 275(1631):123-132.

6 Hay, S. I. et al. (2002) Nature 415, 905-909.

Anindya Bhadra Malaria transmission: Modeling & Inference

Page 29: Malaria transmission: Modeling & Inferencebhadra/presentations/ab_eindhoven.pdf · of the University of Michigan and Menno Bouma of the ... is a delay distribution describing the

Thank you!

These slides are available athttp://www.sitemaker.umich.edu/anindya

Anindya Bhadra Malaria transmission: Modeling & Inference