[10pt] adjusting for unmeasured spatial confounding with ...adjusting for unmeasured spatial...

24
Adjusting for unmeasured spatial confounding with distance adjusted propensity score matching Georgia Papadogeorgou with Christine Choirat and Cory Zigler OSU, June 4, 2019 G. Papadogeorgou Spatial Causal Inference 1 / 16

Upload: others

Post on 27-Mar-2020

25 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: [10pt] Adjusting for unmeasured spatial confounding with ...Adjusting for unmeasured spatial confounding with distance adjusted propensity score matching Georgia Papadogeorgou with

Adjusting for unmeasured spatial confounding withdistance adjusted propensity score matching

Georgia Papadogeorgou

with Christine Choirat and Cory Zigler

OSU, June 4, 2019

G. Papadogeorgou Spatial Causal Inference 1 / 16

Page 2: [10pt] Adjusting for unmeasured spatial confounding with ...Adjusting for unmeasured spatial confounding with distance adjusted propensity score matching Georgia Papadogeorgou with

Causal inference and unmeasured structured confounding

Causal inference formalizes the notion of an effect, and providesidentifiability assumptions

One often invoked assumption is the no unmeasured confoundingassumption (+ positivity = ignorability)

It cannot be tested but sensitivity of results to violations of thisassumption can be evaluated [Rosenbaum, 2002]

Can we use unmeasured confounders’ structure to adjust for them?

Spatial structure: spatial variables vary continuously over space

G. Papadogeorgou Spatial Causal Inference 2 / 16

Page 3: [10pt] Adjusting for unmeasured spatial confounding with ...Adjusting for unmeasured spatial confounding with distance adjusted propensity score matching Georgia Papadogeorgou with

Causal inference and unmeasured structured confounding

Causal inference formalizes the notion of an effect, and providesidentifiability assumptions

One often invoked assumption is the no unmeasured confoundingassumption (+ positivity = ignorability)

It cannot be tested but sensitivity of results to violations of thisassumption can be evaluated [Rosenbaum, 2002]

Can we use unmeasured confounders’ structure to adjust for them?

Spatial structure: spatial variables vary continuously over space

G. Papadogeorgou Spatial Causal Inference 2 / 16

Page 4: [10pt] Adjusting for unmeasured spatial confounding with ...Adjusting for unmeasured spatial confounding with distance adjusted propensity score matching Georgia Papadogeorgou with

Spatial data and causal inference in air pollution research

The scientific questions are causal

Do emissions cause pollution?

What effect does an intervention onpolluting sources have on air pollutionconcentrations?

The data are spatial

Spatially-indexed

Exposure, outcome, and covariates arespatially structured

Unmeasured confounders are spatial!

Integration of spatial data and causal inference

https://kcstormfront.wordpress.com/2015/01/11/2014-in-review/

G. Papadogeorgou Spatial Causal Inference 3 / 16

Page 5: [10pt] Adjusting for unmeasured spatial confounding with ...Adjusting for unmeasured spatial confounding with distance adjusted propensity score matching Georgia Papadogeorgou with

Spatial data and causal inference in air pollution research

The scientific questions are causal

Do emissions cause pollution?

What effect does an intervention onpolluting sources have on air pollutionconcentrations?

The data are spatial

Spatially-indexed

Exposure, outcome, and covariates arespatially structured

Unmeasured confounders are spatial!

Integration of spatial data and causal inference

https://kcstormfront.wordpress.com/2015/01/11/2014-in-review/

G. Papadogeorgou Spatial Causal Inference 3 / 16

Page 6: [10pt] Adjusting for unmeasured spatial confounding with ...Adjusting for unmeasured spatial confounding with distance adjusted propensity score matching Georgia Papadogeorgou with

Spatial data and causal inference in air pollution research

The scientific questions are causal

Do emissions cause pollution?

What effect does an intervention onpolluting sources have on air pollutionconcentrations?

The data are spatial

Spatially-indexed

Exposure, outcome, and covariates arespatially structured

Unmeasured confounders are spatial!

Integration of spatial data and causal inference

https://kcstormfront.wordpress.com/2015/01/11/2014-in-review/

G. Papadogeorgou Spatial Causal Inference 3 / 16

Page 7: [10pt] Adjusting for unmeasured spatial confounding with ...Adjusting for unmeasured spatial confounding with distance adjusted propensity score matching Georgia Papadogeorgou with

Air pollution regulations and their impact

Regulations such as the Clear Air Act enforce stricter rules on emissionsaiming to reduce ambient air pollution

Source-specific emissions like power plants and motor vehicles

Power plants follow various strategies to comply to these regulations

We focus on the installation of NOx emission reduction controltechnologies

Are SCR/SNCR more effective than alternative strategies in reducingambient ozone concentrations?

NOx: Nitric oxide and nitrogen dioxides, precursors of ozone, reacting with othercompounds in the presence of sunlight to create ozone

SCR/SNCR: Selective Catalytic/Non-Catalytic Reduction technology

G. Papadogeorgou Spatial Causal Inference 4 / 16

Page 8: [10pt] Adjusting for unmeasured spatial confounding with ...Adjusting for unmeasured spatial confounding with distance adjusted propensity score matching Georgia Papadogeorgou with

Air pollution regulations and their impact

Regulations such as the Clear Air Act enforce stricter rules on emissionsaiming to reduce ambient air pollution

Source-specific emissions like power plants and motor vehicles

Power plants follow various strategies to comply to these regulations

We focus on the installation of NOx emission reduction controltechnologies

Are SCR/SNCR more effective than alternative strategies in reducingambient ozone concentrations?

NOx: Nitric oxide and nitrogen dioxides, precursors of ozone, reacting with othercompounds in the presence of sunlight to create ozone

SCR/SNCR: Selective Catalytic/Non-Catalytic Reduction technology

G. Papadogeorgou Spatial Causal Inference 4 / 16

Page 9: [10pt] Adjusting for unmeasured spatial confounding with ...Adjusting for unmeasured spatial confounding with distance adjusted propensity score matching Georgia Papadogeorgou with

Comparative effectiveness of power plant NOx emission reductiontechnologies

SCR/SNCR systems are the most effective in reducing NOx

Ozone is a secondary pollutant

NOx reacts with volatile organic compounds (VOCs) and carbonmonoxide in the presence of sunlight to create ozone

VOCs, sunlight might be spatial confounders and they are unmeasured

NOx: Nitric oxide and nitrogen dioxides, precursors of ozoneG. Papadogeorgou Spatial Causal Inference 5 / 16

Page 10: [10pt] Adjusting for unmeasured spatial confounding with ...Adjusting for unmeasured spatial confounding with distance adjusted propensity score matching Georgia Papadogeorgou with

Notation

For unit i

Treatment Zi ∈ {0, 1}Potential outcomes {Yi(0), Yi(1)}Observed outcome Yi = Yi(Zi)

Covariates Xi = (Xi1, Xi2, . . . , Xip)

Average treatment effect on the treated

ATT = E[Y (1)− Y (0)|Z = 1]

G. Papadogeorgou Spatial Causal Inference 6 / 16

Page 11: [10pt] Adjusting for unmeasured spatial confounding with ...Adjusting for unmeasured spatial confounding with distance adjusted propensity score matching Georgia Papadogeorgou with

Identifiability and estimation of the ATT

Common identifiability assumptions

Positivity: p(Z = z|X) > 0, z ∈ ZNo unmeasured confounding: Y (z) ⊥⊥ Z|X

Estimate the average potential outcome via propensity score methods,outcome regression, or combinations

Confounders L = (C,U), C are observed, U are unobserved

If U varies spatially, can we adjust for it?

Matched pairs should be similar in terms of

1 Observed covariates2 Unmeasured spatial covariates (small geographical distance)

G. Papadogeorgou Spatial Causal Inference 7 / 16

Page 12: [10pt] Adjusting for unmeasured spatial confounding with ...Adjusting for unmeasured spatial confounding with distance adjusted propensity score matching Georgia Papadogeorgou with

Identifiability and estimation of the ATT

Common identifiability assumptions

Positivity: p(Z = z|X) > 0, z ∈ ZNo unmeasured confounding: Y (z) ⊥⊥ Z|X

Estimate the average potential outcome via propensity score methods,outcome regression, or combinations

Confounders L = (C,U), C are observed, U are unobserved

If U varies spatially, can we adjust for it?

Matched pairs should be similar in terms of

1 Observed covariates2 Unmeasured spatial covariates (small geographical distance)

G. Papadogeorgou Spatial Causal Inference 7 / 16

Page 13: [10pt] Adjusting for unmeasured spatial confounding with ...Adjusting for unmeasured spatial confounding with distance adjusted propensity score matching Georgia Papadogeorgou with

Identifiability and estimation of the ATT

Common identifiability assumptions

Positivity: p(Z = z|X) > 0, z ∈ ZNo unmeasured confounding: Y (z) ⊥⊥ Z|X

Estimate the average potential outcome via propensity score methods,outcome regression, or combinations

Confounders L = (C,U), C are observed, U are unobserved

If U varies spatially, can we adjust for it?

Matched pairs should be similar in terms of

1 Observed covariates2 Unmeasured spatial covariates (small geographical distance)

G. Papadogeorgou Spatial Causal Inference 7 / 16

Page 14: [10pt] Adjusting for unmeasured spatial confounding with ...Adjusting for unmeasured spatial confounding with distance adjusted propensity score matching Georgia Papadogeorgou with

Distance Adjusted Propensity Score Matching

Propensity score model using measured variables C:P (Zi = 1|Ci) = expit

(CT

i β)

For a treated unit i and a control unit j define

DAPSij = w|PSi − PSj |+ (1− w) ∗Distij , w ∈ [0, 1]

where PS are propensity score estimates, and Dist represents spatialproximity

Small value of DAPSij means:

Similar propensity scores

Points in close geographical distance (similar values of U !)

w: relative importance of the observed and unobserved confounders

High values of w - priority to observed covariates

Low values of w - priority to spatial proximity

G. Papadogeorgou Spatial Causal Inference 8 / 16

Page 15: [10pt] Adjusting for unmeasured spatial confounding with ...Adjusting for unmeasured spatial confounding with distance adjusted propensity score matching Georgia Papadogeorgou with

Choosing w

1 Match treated to control units for various values of w

2 Assess balance of the observed covariates

3 Choose the smallest value of w that achieves observed covariate balance

G. Papadogeorgou Spatial Causal Inference 9 / 16

Page 16: [10pt] Adjusting for unmeasured spatial confounding with ...Adjusting for unmeasured spatial confounding with distance adjusted propensity score matching Georgia Papadogeorgou with

Data

Coal and natural gas power plantsduring June-August 2004

Z = 1 if at least half of facility heatinput is used by units with installedSCR/SNCR technologies, Z = 0otherwise

152 treated facilities, 321 controls

SnCR

Treated

Control

Treatment Assignment

Y : NOx emissions / 4th maximum ambient ozone concentration

Covariates: Power plant characteristics, demographics, weather

Publicly available data sources: Air Markets Program Data, 2000 Census, EPA monitoring sitesG. Papadogeorgou Spatial Causal Inference 10 / 16

Page 17: [10pt] Adjusting for unmeasured spatial confounding with ...Adjusting for unmeasured spatial confounding with distance adjusted propensity score matching Georgia Papadogeorgou with

Observed covariate balance as a function of w

Absolute standardized difference of means as a function of w

Weight

AS

DM

0.0

0.1

0.2

0.3

0.4

Full−Data 0 0.1 0.21 0.33 0.46 0.59 0.72 0.85 0.97

% Operating CapacityARP Phase 2Heat InputGas facilitySmall sized facilityMedium sized facility

Power plant characteristics

Weight

AS

DM

0.0

0.1

0.2

0.3

0.4

Full−Data 0 0.1 0.21 0.33 0.46 0.59 0.72 0.85 0.97

4th Max Temp% Urban% White% Black% Hispanic% High SchoolHousehold Income% Poor% Occupied% 5−year residentsHouse ValuePopulation / square mile

Area level characteristics

G. Papadogeorgou Spatial Causal Inference 11 / 16

Page 18: [10pt] Adjusting for unmeasured spatial confounding with ...Adjusting for unmeasured spatial confounding with distance adjusted propensity score matching Georgia Papadogeorgou with

Matches

Naive pairs DAPSm pairs

Average distance of matched pairs

Naıve: 1066 miles

DAPSm: 141 miles

G. Papadogeorgou Spatial Causal Inference 12 / 16

Page 19: [10pt] Adjusting for unmeasured spatial confounding with ...Adjusting for unmeasured spatial confounding with distance adjusted propensity score matching Georgia Papadogeorgou with

Results

-400

-300

-200

-100

0

Naive Distance Caliper Keele et al DAPSm

Estim

ate

SCR/SNCR on NOx emissions

-0.002

0.000

0.002

Naive Distance Caliper Keele et al DAPSm

Estim

ate

SCR/SNCR on 4th maximum ozone

Reduction by 205 tons of NOx emissions (95% CI: 4 – 406)

−0.27 (95% CI: −2.1 to 1.56) parts per billion in ambient ozone

– The national ambient air quality standard for ozone is 70 parts per billion.– Keele et al. [2015]

G. Papadogeorgou Spatial Causal Inference 13 / 16

Page 20: [10pt] Adjusting for unmeasured spatial confounding with ...Adjusting for unmeasured spatial confounding with distance adjusted propensity score matching Georgia Papadogeorgou with

Evaluating the presence of unmeasured spatial confounding

-600

-400

-200

0

0.00 0.25 0.50 0.75 1.00

Weight

Estim

ate

Estimates with 95% confidence intervals for NOx analysis

-0.002

0.000

0.002

0.00 0.25 0.50 0.75 1.00

Weight

Estim

ate

Estimates with 95% confidence intervals for Ozone analysis

G. Papadogeorgou Spatial Causal Inference 14 / 16

Page 21: [10pt] Adjusting for unmeasured spatial confounding with ...Adjusting for unmeasured spatial confounding with distance adjusted propensity score matching Georgia Papadogeorgou with

Conclusions

SCR/SNCR control technologies lead toReductions in NOx emissions

Their effect on ozone is not significant

When interference between units is accounted for, SCR/SNCR leads toreductions in ambient ozone concentrations [Papadogeorgou et al., 2019]

Approaches like DAPSm are not immediately compatible with spatialmodels

Bridging the two strands of literature (Patrick Schnell’s talk yesterday)

Unmeasured confounding is one of the main criticisms of air pollutionepidemiology

We can address this usingSensitivitiy analysis

Analysis mitigating bias by unmeasured structured confounders

Papadogeorgou, Choirat, and Zigler [2018]G. Papadogeorgou Spatial Causal Inference 15 / 16

Page 22: [10pt] Adjusting for unmeasured spatial confounding with ...Adjusting for unmeasured spatial confounding with distance adjusted propensity score matching Georgia Papadogeorgou with

Conclusions

SCR/SNCR control technologies lead toReductions in NOx emissions

Their effect on ozone is not significant

When interference between units is accounted for, SCR/SNCR leads toreductions in ambient ozone concentrations [Papadogeorgou et al., 2019]

Approaches like DAPSm are not immediately compatible with spatialmodels

Bridging the two strands of literature (Patrick Schnell’s talk yesterday)

Unmeasured confounding is one of the main criticisms of air pollutionepidemiology

We can address this usingSensitivitiy analysis

Analysis mitigating bias by unmeasured structured confounders

Papadogeorgou, Choirat, and Zigler [2018]G. Papadogeorgou Spatial Causal Inference 15 / 16

Page 23: [10pt] Adjusting for unmeasured spatial confounding with ...Adjusting for unmeasured spatial confounding with distance adjusted propensity score matching Georgia Papadogeorgou with

Conclusions

SCR/SNCR control technologies lead toReductions in NOx emissions

Their effect on ozone is not significant

When interference between units is accounted for, SCR/SNCR leads toreductions in ambient ozone concentrations [Papadogeorgou et al., 2019]

Approaches like DAPSm are not immediately compatible with spatialmodels

Bridging the two strands of literature (Patrick Schnell’s talk yesterday)

Unmeasured confounding is one of the main criticisms of air pollutionepidemiology

We can address this usingSensitivitiy analysis

Analysis mitigating bias by unmeasured structured confounders

Papadogeorgou, Choirat, and Zigler [2018]G. Papadogeorgou Spatial Causal Inference 15 / 16

Page 24: [10pt] Adjusting for unmeasured spatial confounding with ...Adjusting for unmeasured spatial confounding with distance adjusted propensity score matching Georgia Papadogeorgou with

References

https://github.com/gpapadog/DAPSm

Luke Keele, Rocio Titiunik, and Jose R Zubizarreta. Enhancing a geographic regressiondiscontinuity design through matching to estimate the effect of ballot initiatives on voterturnout. Journal of the Royal Statistical Society: Series A (Statistics in Society), 178(1):223–239, 2015.

Georgia Papadogeorgou, Christine Choirat, and Corwin M Zigler. Adjusting for unmeasuredspatial confounding with distance adjusted propensity score matching. Biostatistics, 20(2):256–272, 2018.

Georgia Papadogeorgou, Fabrizia Mealli, and Corwin M Zigler. Causal inference with interferingunits for cluster and population level treatment allocation programs. Biometrics, 2019.

Paul R. Rosenbaum. Sensitivity to Hidden Bias, pages 105–170. Springer New York, New York,NY, 2002. ISBN 978-1-4757-3692-2.

G. Papadogeorgou Spatial Causal Inference 16 / 16