machine learning for healthcare - github pages · machine learning for healthcare hst.956, 6.s897...

47
Machine Learning for Healthcare HST.956, 6.S897 Lecture 14: Causal Inference Part 1 David Sontag

Upload: others

Post on 22-May-2020

29 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Machine Learning for Healthcare - GitHub Pages · Machine Learning for Healthcare HST.956, 6.S897 Lecture 14: Causal Inference Part 1 David Sontag

MachineLearningforHealthcareHST.956,6.S897

Lecture14:CausalInferencePart1

DavidSontag

Page 2: Machine Learning for Healthcare - GitHub Pages · Machine Learning for Healthcare HST.956, 6.S897 Lecture 14: Causal Inference Part 1 David Sontag

Courseannouncements• Pleasefilloutmid-semestersurvey• Projectproposals

– Youwillreceivee-mail feedback thisweek– OfficehoursnextTuesday,10-11:30am

• Problemsets– PS1-4graded(seeStellar)– PS5outtonight,duenextTuesday,April9– Lastproblem set,PS6,released in~2weeks

• Recitationthisweekwillbeadiscussionof– Bratetal.,Postsurgicalprescriptions foropioidnaïvepatients

andassociationwithoverdoseandmisuse,BMJ2018– Bertsimasetal.,Personalized diabetesmanagement using

electronicmedical records,DiabetesCare2017

Page 3: Machine Learning for Healthcare - GitHub Pages · Machine Learning for Healthcare HST.956, 6.S897 Lecture 14: Causal Inference Part 1 David Sontag

Doesgastricbypasssurgerypreventonsetofdiabetes?

• InLecture4&PS2weusedmachinelearningforearlydetectionofType2diabetes

• Healthsystemdoesn’twanttoknowhowtopredictdiabetes– theywanttoknowhowtopreventit

• Gastricbypasssurgeryisthehighest negativeweight(9thmostpredictivefeature)– Doesthismeanitwouldbeagoodintervention?

1994 2000

<4.5%4.5%–5.9%6.0%–7.4%7.5%–8.9%>9.0%

2013

Page 4: Machine Learning for Healthcare - GitHub Pages · Machine Learning for Healthcare HST.956, 6.S897 Lecture 14: Causal Inference Part 1 David Sontag

• Suchpredictivemodelswidelyusedtostagepatients.Shouldweinitiatetreatment?Howaggressive?

• Whatcouldgowrongifwetrainedtopredictsurvival,andthenusedtoguidepatientcare?

Mammography(86Ksubjects)

Competitive Period Launch: Nov 18, 2016Competitive Period Close: May 9, 2017

Outof1000womenscreened,only5willhavebreastcancer

Goal:developalgorithmsforriskstratificationofscreeningmammogramsthatcanbeusedtoimprovebreastcancerdetection

Whatisthelikelihoodthispatient,withbreastcancer,willsurvive5years?

𝑿𝒀

Diagnosis Death Time

“Mary”

Treatment

Alongsurvivaltimemaybebecauseoftreatment!

Page 5: Machine Learning for Healthcare - GitHub Pages · Machine Learning for Healthcare HST.956, 6.S897 Lecture 14: Causal Inference Part 1 David Sontag

• Peopleresponddifferentlytotreatment• Goal:usedatafromotherpatientsandtheirjourneystoguidefuturetreatmentdecisions

• Whatcouldgowrongifwetrainedtopredict(past)treatmentdecisions?

Whattreatmentshouldwegivethispatient?Expansion Pathology

with DNA-FISH and Protein-IF

Blue =HER2ProteinRed =HER2AmpliconGreen =Centromeric probe

NegativeforHER2Amplification HER2Amplified

Expansionpathology(imagefromAndyBeck)

“David” TreatmentA

TreatmentA“Juana”“John” TreatmentB

Bestthiscandoismatchcurrentmedicalpractice!

Page 6: Machine Learning for Healthcare - GitHub Pages · Machine Learning for Healthcare HST.956, 6.S897 Lecture 14: Causal Inference Part 1 David Sontag

• Doingarandomizedcontroltrialisunethical• CouldwesimplyanswerthisquestionbycomparingPr(lungcancer|smoker)vsPr(lungcancer|nonsmoker)?

• No!Answeringsuchquestionsfromobservationaldataisdifficultbecauseofconfounding

Doessmokingcauselungcancer?

Page 7: Machine Learning for Healthcare - GitHub Pages · Machine Learning for Healthcare HST.956, 6.S897 Lecture 14: Causal Inference Part 1 David Sontag

Toproperlyanswer,needtoformulateascausal questions:

Intervention, 𝑇

(e.g. medication, procedure)

Outcome, 𝑌

Patient, 𝑋

(including allconfoundingfactors)

?

Highdimensional Observationaldata

Page 8: Machine Learning for Healthcare - GitHub Pages · Machine Learning for Healthcare HST.956, 6.S897 Lecture 14: Causal Inference Part 1 David Sontag

PotentialOutcomesFramework(Rubin-Neyman CausalModel)

• Eachunit(individual)𝑥' hastwopotentialoutcomes:– 𝑌((𝑥') isthepotentialoutcomehadtheunitnotbeentreated:“controloutcome”

– 𝑌+(𝑥') isthepotentialoutcomehadtheunitbeentreated:“treatedoutcome”

• Conditionalaveragetreatmenteffectforunit𝑖:𝐶𝐴𝑇𝐸 𝑥' = 𝔼23~5(23|78)[𝑌+|𝑥'] − 𝔼2=~5(2=|78)[𝑌(|𝑥']

• AverageTreatmentEffect:𝐴𝑇𝐸:= 𝔼 𝑌+ − 𝑌( = 𝔼7~5(7) 𝐶𝐴𝑇𝐸 𝑥

Page 9: Machine Learning for Healthcare - GitHub Pages · Machine Learning for Healthcare HST.956, 6.S897 Lecture 14: Causal Inference Part 1 David Sontag

PotentialOutcomesFramework(Rubin-Neyman CausalModel)

• Eachunit(individual)𝑥' hastwopotentialoutcomes:– 𝑌((𝑥') isthepotentialoutcomehadtheunitnotbeentreated:“controloutcome”

– 𝑌+(𝑥') isthepotentialoutcomehadtheunitbeentreated:“treatedoutcome”

• Observedfactualoutcome:𝑦' = 𝑡'𝑌+ 𝑥' + 1 − 𝑡' 𝑌((𝑥')

• Unobservedcounterfactualoutcome:𝑦'CD = (1 − 𝑡')𝑌+ 𝑥' + 𝑡'𝑌((𝑥')

Page 10: Machine Learning for Healthcare - GitHub Pages · Machine Learning for Healthcare HST.956, 6.S897 Lecture 14: Causal Inference Part 1 David Sontag

Thefundamental problemofcausal inference“Thefundamental problemof

causal inference”

Weonlyeverobserveoneofthetwooutcomes

Page 11: Machine Learning for Healthcare - GitHub Pages · Machine Learning for Healthcare HST.956, 6.S897 Lecture 14: Causal Inference Part 1 David Sontag

Treated

𝑥 = 𝑎𝑔𝑒

𝑦 =𝑏𝑙𝑜𝑜𝑑_𝑝𝑟𝑒𝑠.

𝑌+ 𝑥

𝑌( 𝑥

Example– Bloodpressureandage

Page 12: Machine Learning for Healthcare - GitHub Pages · Machine Learning for Healthcare HST.956, 6.S897 Lecture 14: Causal Inference Part 1 David Sontag

Treated

𝑥 = 𝑎𝑔𝑒

𝑦 =𝑏𝑙𝑜𝑜𝑑_𝑝𝑟𝑒𝑠.

𝑌+ 𝑥

𝑌( 𝑥

Bloodpressureandage

𝐶𝐴𝑇𝐸(𝑥)

Page 13: Machine Learning for Healthcare - GitHub Pages · Machine Learning for Healthcare HST.956, 6.S897 Lecture 14: Causal Inference Part 1 David Sontag

Treated

𝑥 = 𝑎𝑔𝑒

𝑦 =𝑏𝑙𝑜𝑜𝑑_𝑝𝑟𝑒𝑠.

𝑌+ 𝑥

𝑌( 𝑥

Bloodpressureandage

𝐴𝑇𝐸

Page 14: Machine Learning for Healthcare - GitHub Pages · Machine Learning for Healthcare HST.956, 6.S897 Lecture 14: Causal Inference Part 1 David Sontag

Treated

𝑥 = 𝑎𝑔𝑒

𝑦 =𝑏𝑙𝑜𝑜𝑑_𝑝𝑟𝑒𝑠.

𝑌+ 𝑥

𝑌( 𝑥

Bloodpressureandage

Treated

Control

Page 15: Machine Learning for Healthcare - GitHub Pages · Machine Learning for Healthcare HST.956, 6.S897 Lecture 14: Causal Inference Part 1 David Sontag

Treated

𝑥 = 𝑎𝑔𝑒

𝑦 =𝑏𝑙𝑜𝑜𝑑_𝑝𝑟𝑒𝑠.

𝑌+ 𝑥

𝑌( 𝑥

Bloodpressureandage

Treated

Control

Counterfactualtreated

Counterfactualcontrol

Page 16: Machine Learning for Healthcare - GitHub Pages · Machine Learning for Healthcare HST.956, 6.S897 Lecture 14: Causal Inference Part 1 David Sontag

(age,gender,exercise,treatment)

Sugarlevelshadtheyreceived

medicationA

Sugarlevelshadtheyreceived

medicationB

Observedsugarlevels

(45,F,0,A) 6 5.5 6(45,F,1,B) 7 6.5 6.5(55,M,0,A) 7 6 7(55,M,1,B) 9 8 8(65,F,0,B) 8.5 8 8(65,F,1,A) 7.5 7 7.5(75,M,0,B) 10 9 9(75,M,1,A) 8 7 8

(ExamplefromUriShalit)

Page 17: Machine Learning for Healthcare - GitHub Pages · Machine Learning for Healthcare HST.956, 6.S897 Lecture 14: Causal Inference Part 1 David Sontag

(age,gender,exercise)

Sugarlevelshadtheyreceived

medicationA

Sugarlevelshadtheyreceived

medicationB

Observedsugarlevels

(45,F,0) 6 5.5 6(45,F,1) 7 6.5 6.5(55,M,0) 7 6 7(55,M,1) 9 8 8(65,F,0) 8.5 8 8(65,F,1) 7.5 7 7.5(75,M,0) 10 9 9(75,M,1) 8 7 8

(ExamplefromUriShalit)

Page 18: Machine Learning for Healthcare - GitHub Pages · Machine Learning for Healthcare HST.956, 6.S897 Lecture 14: Causal Inference Part 1 David Sontag

(age,gender,exercise)

Y0:Sugarlevelshadtheyreceived

medicationA

Y1:Sugarlevelshadtheyreceived

medicationB

Observedsugarlevels

(45,F,0) 6 5.5 6(45,F,1) 7 6.5 6.5(55,M,0) 7 6 7(55,M,1) 9 8 8(65,F,0) 8.5 8 8(65,F,1) 7.5 7 7.5(75,M,0) 10 9 9(75,M,1) 8 7 8

(ExamplefromUriShalit)

Page 19: Machine Learning for Healthcare - GitHub Pages · Machine Learning for Healthcare HST.956, 6.S897 Lecture 14: Causal Inference Part 1 David Sontag

(age,gender,exercise)

Sugarlevelshadtheyreceived

medicationA

Sugarlevelshadtheyreceived

medicationB

Observedsugarlevels

(45,F,0) 6 5.5 6

(45,F,1) 7 6.5 6.5

(55,M,0) 7 6 7

(55,M,1) 9 8 8

(65,F,0) 8.5 8 8

(65,F,1) 7.5 7 7.5

(75,M,0) 10 9 9

(75,M,1) 8 7 8

mean(sugar|medication B)–mean(sugar|medicaton A)=?

mean(sugar|had theyreceived B)–mean(sugar|had theyreceived A)=?

(ExamplefromUriShalit)

Page 20: Machine Learning for Healthcare - GitHub Pages · Machine Learning for Healthcare HST.956, 6.S897 Lecture 14: Causal Inference Part 1 David Sontag

(age,gender,exercise)

Sugarlevelshadtheyreceived

medicationA

Sugarlevelshadtheyreceived

medicationB

Observedsugarlevels

(45,F,0) 6 5.5 6

(45,F,1) 7 6.5 6.5

(55,M,0) 7 6 7

(55,M,1) 9 8 8

(65,F,0) 8.5 8 8

(65,F,1) 7.5 7 7.5

(75,M,0) 10 9 9

(75,M,1) 8 7 8

mean(sugar|medication B)–mean(sugar|medicaton A)=7.875- 7.125=0.75

mean(sugar|had theyreceived B)–mean(sugar|had theyreceived A)=7.125- 7.875=-0.75

(ExamplefromUriShalit)

Page 21: Machine Learning for Healthcare - GitHub Pages · Machine Learning for Healthcare HST.956, 6.S897 Lecture 14: Causal Inference Part 1 David Sontag

Typicalassumption– nounmeasuredconfounders

𝑌(, 𝑌+:potentialoutcomesforcontrolandtreated𝑥:unitcovariates(features)T:treatmentassignment

Weassume:(𝑌(, 𝑌+) ⫫ 𝑇|𝑥

Thepotentialoutcomesareindependentoftreatmentassignment,conditionedoncovariates𝑥

Page 22: Machine Learning for Healthcare - GitHub Pages · Machine Learning for Healthcare HST.956, 6.S897 Lecture 14: Causal Inference Part 1 David Sontag

Typicalassumption– nounmeasuredconfounders

𝑌(, 𝑌+:potentialoutcomesforcontrolandtreated𝑥:unitcovariates(features)T:treatmentassignment

Weassume:(𝑌(, 𝑌+) ⫫ 𝑇|𝑥

Ignorability

Page 23: Machine Learning for Healthcare - GitHub Pages · Machine Learning for Healthcare HST.956, 6.S897 Lecture 14: Causal Inference Part 1 David Sontag

covariates(features)

treatment

Potentialoutcomes

𝑻𝒙

𝒀𝟏𝒀𝟎

Ignorability

(𝑌(, 𝑌+) ⫫ 𝑇|𝑥

Page 24: Machine Learning for Healthcare - GitHub Pages · Machine Learning for Healthcare HST.956, 6.S897 Lecture 14: Causal Inference Part 1 David Sontag

𝑻𝒙

𝒀𝟏𝒀𝟎

anti-hypertensivemedication

bloodpressureaftermedicationA

age,gender,weight,diet,heartrateatrest,…

bloodpressureaftermedicationB

Ignorability

(𝑌(, 𝑌+) ⫫ 𝑇|𝑥

Page 25: Machine Learning for Healthcare - GitHub Pages · Machine Learning for Healthcare HST.956, 6.S897 Lecture 14: Causal Inference Part 1 David Sontag

𝒙

𝒀𝟏𝒀𝟎bloodpressureaftermedicationA

age,gender,weight,diet,heartrateatrest,…

bloodpressureaftermedicationB

𝒉

NoIgnorability

diabetic𝑻

anti-hypertensivemedication

(𝑌(, 𝑌+) ⫫ 𝑇|𝑥

Page 26: Machine Learning for Healthcare - GitHub Pages · Machine Learning for Healthcare HST.956, 6.S897 Lecture 14: Causal Inference Part 1 David Sontag

Typicalassumption– commonsupport

Y(, 𝑌+:potentialoutcomesforcontrolandtreated𝑥:unitcovariates(features)𝑇:treatmentassignment

Weassume:𝑝 𝑇 = 𝑡 𝑋 = 𝑥 > 0∀𝑡, 𝑥

Page 27: Machine Learning for Healthcare - GitHub Pages · Machine Learning for Healthcare HST.956, 6.S897 Lecture 14: Causal Inference Part 1 David Sontag

Framingthequestion

1. Wherecouldwegotofordatatoanswerthesequestions?

2. WhatshouldX,T,andYbetosatisfyignorability?3. Whatisthespecificcausalinferencequestionthat

weareinterestedin?4. Areyouworriedaboutcommonsupport?

Page 28: Machine Learning for Healthcare - GitHub Pages · Machine Learning for Healthcare HST.956, 6.S897 Lecture 14: Causal Inference Part 1 David Sontag

Outlineforlecture

• Howtorecognizeacausalinferenceproblem• Potentialoutcomesframework– Averagetreatmenteffect(ATE)– Conditionalaveragetreatmenteffect(CATE)

• AlgorithmsforestimatingATEandCATE

Page 29: Machine Learning for Healthcare - GitHub Pages · Machine Learning for Healthcare HST.956, 6.S897 Lecture 14: Causal Inference Part 1 David Sontag

AverageTreatmentEffect

Theexpectedcausaleffectof𝑇 on𝑌:ATE := E [Y1 � Y0]

Page 30: Machine Learning for Healthcare - GitHub Pages · Machine Learning for Healthcare HST.956, 6.S897 Lecture 14: Causal Inference Part 1 David Sontag

AverageTreatmentEffect–theadjustmentformula

• Assumingignorability,wewillderivetheadjustment formula (Hernán &Robins2010,Pearl2009)

• Theadjustmentformulaisextremelyusefulincausalinference

• AlsocalledG-formula

Page 31: Machine Learning for Healthcare - GitHub Pages · Machine Learning for Healthcare HST.956, 6.S897 Lecture 14: Causal Inference Part 1 David Sontag

AverageTreatmentEffect

Theexpectedcausaleffectof𝑇 on𝑌:ATE := E [Y1 � Y0]

Page 32: Machine Learning for Healthcare - GitHub Pages · Machine Learning for Healthcare HST.956, 6.S897 Lecture 14: Causal Inference Part 1 David Sontag

AverageTreatmentEffect

Theexpectedcausaleffectof𝑇 on𝑌:ATE := E [Y1 � Y0]

E [Y1] =

Ex⇠p(x)

⇥EY1⇠p(Y1|x) [Y1|x]

⇤=

Ex⇠p(x)

⇥EY1⇠p(Y1|x) [Y1|x, T = 1]

⇤=

Ex⇠p(x) [E [Y1|x, T = 1]]

lawoftotalexpectation

Page 33: Machine Learning for Healthcare - GitHub Pages · Machine Learning for Healthcare HST.956, 6.S897 Lecture 14: Causal Inference Part 1 David Sontag

AverageTreatmentEffect

Theexpectedcausaleffectof𝑇 on𝑌:ATE := E [Y1 � Y0]

E [Y1] =

Ex⇠p(x)

⇥EY1⇠p(Y1|x) [Y1|x]

⇤=

Ex⇠p(x)

⇥EY1⇠p(Y1|x) [Y1|x, T = 1]

⇤=

Ex⇠p(x) [E [Y1|x, T = 1]]

ignorability(𝑌(, 𝑌+) ⫫ 𝑇|𝑥

Page 34: Machine Learning for Healthcare - GitHub Pages · Machine Learning for Healthcare HST.956, 6.S897 Lecture 14: Causal Inference Part 1 David Sontag

AverageTreatmentEffect

Theexpectedcausaleffectof𝑇 on𝑌:ATE := E [Y1 � Y0]

E [Y1] =

Ex⇠p(x)

⇥EY1⇠p(Y1|x) [Y1|x]

⇤=

Ex⇠p(x)

⇥EY1⇠p(Y1|x) [Y1|x, T = 1]

⇤=

Ex⇠p(x) [E [Y1|x, T = 1]] shorternotation

Page 35: Machine Learning for Healthcare - GitHub Pages · Machine Learning for Healthcare HST.956, 6.S897 Lecture 14: Causal Inference Part 1 David Sontag

AverageTreatmentEffect

Theexpectedcausaleffectof𝑇 on𝑌:ATE := E [Y1 � Y0]

E [Y0] =

Ex⇠p(x)

⇥EY0⇠p(Y0|x) [Y0|x]

⇤=

Ex⇠p(x)

⇥EY0⇠p(Y0|x) [Y0|x, T = 1]

⇤=

Ex⇠p(x) [E [Y0|x, T = 0]]

Page 36: Machine Learning for Healthcare - GitHub Pages · Machine Learning for Healthcare HST.956, 6.S897 Lecture 14: Causal Inference Part 1 David Sontag

Quantitieswecanestimate

fromdata

Theadjustmentformula(

E[Y1|x,T=1]

E[Y0|x,T=0](E [Y1|x, T = 1]

E [Y0|x, T = 0]

ATE = E [Y1 � Y0] =

Ex⇠p(x)[ E [Y1|x, T = 1]�E [Y0|x, T = 0] ]

Undertheassumptionofignorability,wehavethat:

Page 37: Machine Learning for Healthcare - GitHub Pages · Machine Learning for Healthcare HST.956, 6.S897 Lecture 14: Causal Inference Part 1 David Sontag

Quantitieswecannotdirectly

estimatefromdata

Theadjustmentformula(

E[Y1|x,T=1]

E[Y0|x,T=0]ATE = E [Y1 � Y0] =

Ex⇠p(x)[ E [Y1|x, T = 1]�E [Y0|x, T = 0] ]

Undertheassumptionofignorability,wehavethat:

E [Y0|x, T = 1]

E [Y1|x, T = 0]

E [Y0|x]E [Y1|x]

Page 38: Machine Learning for Healthcare - GitHub Pages · Machine Learning for Healthcare HST.956, 6.S897 Lecture 14: Causal Inference Part 1 David Sontag

Quantitieswecanestimate

fromdata

Theadjustmentformula(

E[Y1|x,T=1]

E[Y0|x,T=0](E [Y1|x, T = 1]

E [Y0|x, T = 0]

ATE = E [Y1 � Y0] =

Ex⇠p(x)[ E [Y1|x, T = 1]�E [Y0|x, T = 0] ]

Empiricallywehavesamplesfrom𝑝(𝑥|𝑇 = 1) or𝑝 𝑥 𝑇 = 0 .Extrapolate to 𝑝(𝑥)

Undertheassumptionofignorability,wehavethat:

Page 39: Machine Learning for Healthcare - GitHub Pages · Machine Learning for Healthcare HST.956, 6.S897 Lecture 14: Causal Inference Part 1 David Sontag

Manymethods!

CovariateadjustmentPropensityscore re-weightingDoublyrobustestimatorsMatching…

Page 40: Machine Learning for Healthcare - GitHub Pages · Machine Learning for Healthcare HST.956, 6.S897 Lecture 14: Causal Inference Part 1 David Sontag

Covariateadjustment

• Explicitlymodeltherelationshipbetweentreatment,confounders,andoutcome

• Alsocalled“ResponseSurfaceModeling”• UsedforbothITEandATE• Aregressionproblem

Page 41: Machine Learning for Healthcare - GitHub Pages · Machine Learning for Healthcare HST.956, 6.S897 Lecture 14: Causal Inference Part 1 David Sontag

𝑥+

𝑥\

𝑥]

𝑇

… 𝑓(𝑥, 𝑇)

𝑦

Regressionmodel

OutcomeCovariates(Features)

Page 42: Machine Learning for Healthcare - GitHub Pages · Machine Learning for Healthcare HST.956, 6.S897 Lecture 14: Causal Inference Part 1 David Sontag

𝑥+

𝑥\

𝑥]

𝑇

𝑦

NuisanceParameters

Regressionmodel

Outcome

Parameterofinterest

𝑓(𝑥, 𝑇)

Page 43: Machine Learning for Healthcare - GitHub Pages · Machine Learning for Healthcare HST.956, 6.S897 Lecture 14: Causal Inference Part 1 David Sontag

Covariateadjustment(parametricg-formula)

• Explicitlymodeltherelationshipbetweentreatment,confounders,andoutcome

• Underignorability,theexpectedcausaleffectof𝑇 on𝑌:𝔼7~5 7 𝔼 𝑌+ 𝑇 = 1, 𝑥 − 𝔼 𝑌( 𝑇 = 0, 𝑥

• Fitamodel𝑓 𝑥, 𝑡 ≈ 𝔼 𝑌 𝑇 = 𝑡, 𝑥

𝐴𝑇𝐸a =1𝑛c𝑓 𝑥', 1 − 𝑓(𝑥', 0)d

'e+

Page 44: Machine Learning for Healthcare - GitHub Pages · Machine Learning for Healthcare HST.956, 6.S897 Lecture 14: Causal Inference Part 1 David Sontag

Covariateadjustment(parametricg-formula)

• Explicitlymodeltherelationshipbetweentreatment,confounders,andoutcome

• Underignorability,theexpectedcausaleffectof𝑇 on𝑌:𝔼7~5 7 𝔼 𝑌+ 𝑇 = 1, 𝑥 − 𝔼 𝑌( 𝑇 = 0, 𝑥

• Fitamodel𝑓 𝑥, 𝑡 ≈ 𝔼 𝑌 𝑇 = 𝑡, 𝑥

𝐶𝐴𝑇𝐸a 𝑥' = 𝑓 𝑥', 1 − 𝑓(𝑥', 0)

Page 45: Machine Learning for Healthcare - GitHub Pages · Machine Learning for Healthcare HST.956, 6.S897 Lecture 14: Causal Inference Part 1 David Sontag

Treated

𝑥 = 𝑎𝑔𝑒

𝑦 =𝑏𝑙𝑜𝑜𝑑_𝑝𝑟𝑒𝑠.

𝑌+ 𝑥

𝑌( 𝑥

Covariateadjustment

Treated

Control

Page 46: Machine Learning for Healthcare - GitHub Pages · Machine Learning for Healthcare HST.956, 6.S897 Lecture 14: Causal Inference Part 1 David Sontag

Treated

𝑥 = 𝑎𝑔𝑒

𝑦 =𝑏𝑙𝑜𝑜𝑑_𝑝𝑟𝑒𝑠.

𝑌+ 𝑥

𝑌( 𝑥

Covariateadjustment

Treated

Control

Counterfactualtreated

Counterfactualcontrol

𝒇

Page 47: Machine Learning for Healthcare - GitHub Pages · Machine Learning for Healthcare HST.956, 6.S897 Lecture 14: Causal Inference Part 1 David Sontag

Exampleofhowcovariateadjustmentfailswhenthereisnooverlap

TreatedTreated

Control 𝑥 = 𝑎𝑔𝑒

𝑦 =𝑏𝑙𝑜𝑜𝑑_𝑝𝑟𝑒𝑠.

𝑌+ 𝑥

𝑌( 𝑥