advanced quantitative research methodology, lecture...gary king (harvard, iqss) advanced...

126
Advanced Quantitative Research Methodology, Lecture Notes: Model Dependence in Counterfactual Inference 1 Gary King March 26, 2016 1 GaryKing.org, c Copyright 2016 Gary King, All Rights Reserved. Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes: March 26, 2016 1 / 23

Upload: others

Post on 30-Dec-2020

50 views

Category:

Documents


6 download

TRANSCRIPT

Page 1: Advanced Quantitative Research Methodology, Lecture...Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes:March 26, 2016 2 / 23Model Dependence in Counterfactual

Advanced Quantitative Research Methodology, LectureNotes: Model Dependence in Counterfactual Inference1

Gary King

March 26, 2016

1GaryKing.org, c©Copyright 2016 Gary King, All Rights Reserved.Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes: Model Dependence in Counterfactual InferenceMarch 26, 2016 1 / 23

Page 2: Advanced Quantitative Research Methodology, Lecture...Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes:March 26, 2016 2 / 23Model Dependence in Counterfactual

References

King, Gary and Langche Zeng. “The Dangers of ExtremeCounterfactuals,” Political Analysis, 14, 2, (2007): 131-159.

King, Gary and Langche Zeng. “When Can History be Our Guide?The Pitfalls of Counterfactual Inference,” International StudiesQuarterly, 2006, 51 (March, 2007): 183–210.

Related Software: WhatIf, MatchIt, Zelig, CEM

http://j.mp/causalinference

Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes: Model Dependence in Counterfactual InferenceMarch 26, 2016 2 / 23

Page 3: Advanced Quantitative Research Methodology, Lecture...Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes:March 26, 2016 2 / 23Model Dependence in Counterfactual

References

King, Gary and Langche Zeng. “The Dangers of ExtremeCounterfactuals,” Political Analysis, 14, 2, (2007): 131-159.

King, Gary and Langche Zeng. “When Can History be Our Guide?The Pitfalls of Counterfactual Inference,” International StudiesQuarterly, 2006, 51 (March, 2007): 183–210.

Related Software: WhatIf, MatchIt, Zelig, CEM

http://j.mp/causalinference

Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes: Model Dependence in Counterfactual InferenceMarch 26, 2016 2 / 23

Page 4: Advanced Quantitative Research Methodology, Lecture...Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes:March 26, 2016 2 / 23Model Dependence in Counterfactual

References

King, Gary and Langche Zeng. “The Dangers of ExtremeCounterfactuals,” Political Analysis, 14, 2, (2007): 131-159.

King, Gary and Langche Zeng. “When Can History be Our Guide?The Pitfalls of Counterfactual Inference,” International StudiesQuarterly, 2006, 51 (March, 2007): 183–210.

Related Software: WhatIf, MatchIt, Zelig, CEM

http://j.mp/causalinference

Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes: Model Dependence in Counterfactual InferenceMarch 26, 2016 2 / 23

Page 5: Advanced Quantitative Research Methodology, Lecture...Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes:March 26, 2016 2 / 23Model Dependence in Counterfactual

References

King, Gary and Langche Zeng. “The Dangers of ExtremeCounterfactuals,” Political Analysis, 14, 2, (2007): 131-159.

King, Gary and Langche Zeng. “When Can History be Our Guide?The Pitfalls of Counterfactual Inference,” International StudiesQuarterly, 2006, 51 (March, 2007): 183–210.

Related Software: WhatIf, MatchIt, Zelig, CEM

http://j.mp/causalinference

Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes: Model Dependence in Counterfactual InferenceMarch 26, 2016 2 / 23

Page 6: Advanced Quantitative Research Methodology, Lecture...Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes:March 26, 2016 2 / 23Model Dependence in Counterfactual

Counterfactuals

Three types:

1 Forecasts Will the U.S. be in Afghanistan in 2016?2 Whatif Questions What would have happened if the U.S. had not

invaded Iraq?3 Causal Effects What is the causal effect of the Iraq war on U.S.

Supreme Court decision making? (a factual minus a counterfactual)

Counterfactuals are some part of most social science research

Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes: Model Dependence in Counterfactual InferenceMarch 26, 2016 3 / 23

Page 7: Advanced Quantitative Research Methodology, Lecture...Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes:March 26, 2016 2 / 23Model Dependence in Counterfactual

Counterfactuals

Three types:

1 Forecasts Will the U.S. be in Afghanistan in 2016?2 Whatif Questions What would have happened if the U.S. had not

invaded Iraq?3 Causal Effects What is the causal effect of the Iraq war on U.S.

Supreme Court decision making? (a factual minus a counterfactual)

Counterfactuals are some part of most social science research

Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes: Model Dependence in Counterfactual InferenceMarch 26, 2016 3 / 23

Page 8: Advanced Quantitative Research Methodology, Lecture...Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes:March 26, 2016 2 / 23Model Dependence in Counterfactual

Counterfactuals

Three types:1 Forecasts Will the U.S. be in Afghanistan in 2016?

2 Whatif Questions What would have happened if the U.S. had notinvaded Iraq?

3 Causal Effects What is the causal effect of the Iraq war on U.S.Supreme Court decision making? (a factual minus a counterfactual)

Counterfactuals are some part of most social science research

Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes: Model Dependence in Counterfactual InferenceMarch 26, 2016 3 / 23

Page 9: Advanced Quantitative Research Methodology, Lecture...Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes:March 26, 2016 2 / 23Model Dependence in Counterfactual

Counterfactuals

Three types:1 Forecasts Will the U.S. be in Afghanistan in 2016?2 Whatif Questions What would have happened if the U.S. had not

invaded Iraq?

3 Causal Effects What is the causal effect of the Iraq war on U.S.Supreme Court decision making? (a factual minus a counterfactual)

Counterfactuals are some part of most social science research

Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes: Model Dependence in Counterfactual InferenceMarch 26, 2016 3 / 23

Page 10: Advanced Quantitative Research Methodology, Lecture...Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes:March 26, 2016 2 / 23Model Dependence in Counterfactual

Counterfactuals

Three types:1 Forecasts Will the U.S. be in Afghanistan in 2016?2 Whatif Questions What would have happened if the U.S. had not

invaded Iraq?3 Causal Effects What is the causal effect of the Iraq war on U.S.

Supreme Court decision making? (a factual minus a counterfactual)

Counterfactuals are some part of most social science research

Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes: Model Dependence in Counterfactual InferenceMarch 26, 2016 3 / 23

Page 11: Advanced Quantitative Research Methodology, Lecture...Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes:March 26, 2016 2 / 23Model Dependence in Counterfactual

Counterfactuals

Three types:1 Forecasts Will the U.S. be in Afghanistan in 2016?2 Whatif Questions What would have happened if the U.S. had not

invaded Iraq?3 Causal Effects What is the causal effect of the Iraq war on U.S.

Supreme Court decision making? (a factual minus a counterfactual)

Counterfactuals are some part of most social science research

Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes: Model Dependence in Counterfactual InferenceMarch 26, 2016 3 / 23

Page 12: Advanced Quantitative Research Methodology, Lecture...Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes:March 26, 2016 2 / 23Model Dependence in Counterfactual

Model Dependence in Practice

How do you conduct empirical analyses?

collect the data over many months or years.finish recording and merging.sit in front of your computer with nobody to bother you.run one regression.run another regression with different control variables.run another regression with different functional forms.run another regression with different measures.run yet another regression with a subset of the data.end up with 100 or 1000 different estimates.put 1 or maybe 5 regression results in the paper.

What’s the problem?

Some specification is designated as the “correct” one, only afterlooking at the estimates.Is this a true test of an ex ante hypothesis or merely a demonstrationthat it is possible to find results consistent with your favoritehypothesis?

Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes: Model Dependence in Counterfactual InferenceMarch 26, 2016 4 / 23

Page 13: Advanced Quantitative Research Methodology, Lecture...Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes:March 26, 2016 2 / 23Model Dependence in Counterfactual

Model Dependence in Practice

How do you conduct empirical analyses?

collect the data over many months or years.finish recording and merging.sit in front of your computer with nobody to bother you.run one regression.run another regression with different control variables.run another regression with different functional forms.run another regression with different measures.run yet another regression with a subset of the data.end up with 100 or 1000 different estimates.put 1 or maybe 5 regression results in the paper.

What’s the problem?

Some specification is designated as the “correct” one, only afterlooking at the estimates.Is this a true test of an ex ante hypothesis or merely a demonstrationthat it is possible to find results consistent with your favoritehypothesis?

Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes: Model Dependence in Counterfactual InferenceMarch 26, 2016 4 / 23

Page 14: Advanced Quantitative Research Methodology, Lecture...Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes:March 26, 2016 2 / 23Model Dependence in Counterfactual

Model Dependence in Practice

How do you conduct empirical analyses?

collect the data over many months or years.

finish recording and merging.sit in front of your computer with nobody to bother you.run one regression.run another regression with different control variables.run another regression with different functional forms.run another regression with different measures.run yet another regression with a subset of the data.end up with 100 or 1000 different estimates.put 1 or maybe 5 regression results in the paper.

What’s the problem?

Some specification is designated as the “correct” one, only afterlooking at the estimates.Is this a true test of an ex ante hypothesis or merely a demonstrationthat it is possible to find results consistent with your favoritehypothesis?

Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes: Model Dependence in Counterfactual InferenceMarch 26, 2016 4 / 23

Page 15: Advanced Quantitative Research Methodology, Lecture...Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes:March 26, 2016 2 / 23Model Dependence in Counterfactual

Model Dependence in Practice

How do you conduct empirical analyses?

collect the data over many months or years.finish recording and merging.

sit in front of your computer with nobody to bother you.run one regression.run another regression with different control variables.run another regression with different functional forms.run another regression with different measures.run yet another regression with a subset of the data.end up with 100 or 1000 different estimates.put 1 or maybe 5 regression results in the paper.

What’s the problem?

Some specification is designated as the “correct” one, only afterlooking at the estimates.Is this a true test of an ex ante hypothesis or merely a demonstrationthat it is possible to find results consistent with your favoritehypothesis?

Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes: Model Dependence in Counterfactual InferenceMarch 26, 2016 4 / 23

Page 16: Advanced Quantitative Research Methodology, Lecture...Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes:March 26, 2016 2 / 23Model Dependence in Counterfactual

Model Dependence in Practice

How do you conduct empirical analyses?

collect the data over many months or years.finish recording and merging.sit in front of your computer with nobody to bother you.

run one regression.run another regression with different control variables.run another regression with different functional forms.run another regression with different measures.run yet another regression with a subset of the data.end up with 100 or 1000 different estimates.put 1 or maybe 5 regression results in the paper.

What’s the problem?

Some specification is designated as the “correct” one, only afterlooking at the estimates.Is this a true test of an ex ante hypothesis or merely a demonstrationthat it is possible to find results consistent with your favoritehypothesis?

Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes: Model Dependence in Counterfactual InferenceMarch 26, 2016 4 / 23

Page 17: Advanced Quantitative Research Methodology, Lecture...Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes:March 26, 2016 2 / 23Model Dependence in Counterfactual

Model Dependence in Practice

How do you conduct empirical analyses?

collect the data over many months or years.finish recording and merging.sit in front of your computer with nobody to bother you.run one regression.

run another regression with different control variables.run another regression with different functional forms.run another regression with different measures.run yet another regression with a subset of the data.end up with 100 or 1000 different estimates.put 1 or maybe 5 regression results in the paper.

What’s the problem?

Some specification is designated as the “correct” one, only afterlooking at the estimates.Is this a true test of an ex ante hypothesis or merely a demonstrationthat it is possible to find results consistent with your favoritehypothesis?

Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes: Model Dependence in Counterfactual InferenceMarch 26, 2016 4 / 23

Page 18: Advanced Quantitative Research Methodology, Lecture...Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes:March 26, 2016 2 / 23Model Dependence in Counterfactual

Model Dependence in Practice

How do you conduct empirical analyses?

collect the data over many months or years.finish recording and merging.sit in front of your computer with nobody to bother you.run one regression.run another regression with different control variables.

run another regression with different functional forms.run another regression with different measures.run yet another regression with a subset of the data.end up with 100 or 1000 different estimates.put 1 or maybe 5 regression results in the paper.

What’s the problem?

Some specification is designated as the “correct” one, only afterlooking at the estimates.Is this a true test of an ex ante hypothesis or merely a demonstrationthat it is possible to find results consistent with your favoritehypothesis?

Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes: Model Dependence in Counterfactual InferenceMarch 26, 2016 4 / 23

Page 19: Advanced Quantitative Research Methodology, Lecture...Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes:March 26, 2016 2 / 23Model Dependence in Counterfactual

Model Dependence in Practice

How do you conduct empirical analyses?

collect the data over many months or years.finish recording and merging.sit in front of your computer with nobody to bother you.run one regression.run another regression with different control variables.run another regression with different functional forms.

run another regression with different measures.run yet another regression with a subset of the data.end up with 100 or 1000 different estimates.put 1 or maybe 5 regression results in the paper.

What’s the problem?

Some specification is designated as the “correct” one, only afterlooking at the estimates.Is this a true test of an ex ante hypothesis or merely a demonstrationthat it is possible to find results consistent with your favoritehypothesis?

Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes: Model Dependence in Counterfactual InferenceMarch 26, 2016 4 / 23

Page 20: Advanced Quantitative Research Methodology, Lecture...Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes:March 26, 2016 2 / 23Model Dependence in Counterfactual

Model Dependence in Practice

How do you conduct empirical analyses?

collect the data over many months or years.finish recording and merging.sit in front of your computer with nobody to bother you.run one regression.run another regression with different control variables.run another regression with different functional forms.run another regression with different measures.

run yet another regression with a subset of the data.end up with 100 or 1000 different estimates.put 1 or maybe 5 regression results in the paper.

What’s the problem?

Some specification is designated as the “correct” one, only afterlooking at the estimates.Is this a true test of an ex ante hypothesis or merely a demonstrationthat it is possible to find results consistent with your favoritehypothesis?

Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes: Model Dependence in Counterfactual InferenceMarch 26, 2016 4 / 23

Page 21: Advanced Quantitative Research Methodology, Lecture...Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes:March 26, 2016 2 / 23Model Dependence in Counterfactual

Model Dependence in Practice

How do you conduct empirical analyses?

collect the data over many months or years.finish recording and merging.sit in front of your computer with nobody to bother you.run one regression.run another regression with different control variables.run another regression with different functional forms.run another regression with different measures.run yet another regression with a subset of the data.

end up with 100 or 1000 different estimates.put 1 or maybe 5 regression results in the paper.

What’s the problem?

Some specification is designated as the “correct” one, only afterlooking at the estimates.Is this a true test of an ex ante hypothesis or merely a demonstrationthat it is possible to find results consistent with your favoritehypothesis?

Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes: Model Dependence in Counterfactual InferenceMarch 26, 2016 4 / 23

Page 22: Advanced Quantitative Research Methodology, Lecture...Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes:March 26, 2016 2 / 23Model Dependence in Counterfactual

Model Dependence in Practice

How do you conduct empirical analyses?

collect the data over many months or years.finish recording and merging.sit in front of your computer with nobody to bother you.run one regression.run another regression with different control variables.run another regression with different functional forms.run another regression with different measures.run yet another regression with a subset of the data.end up with 100 or 1000 different estimates.

put 1 or maybe 5 regression results in the paper.

What’s the problem?

Some specification is designated as the “correct” one, only afterlooking at the estimates.Is this a true test of an ex ante hypothesis or merely a demonstrationthat it is possible to find results consistent with your favoritehypothesis?

Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes: Model Dependence in Counterfactual InferenceMarch 26, 2016 4 / 23

Page 23: Advanced Quantitative Research Methodology, Lecture...Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes:March 26, 2016 2 / 23Model Dependence in Counterfactual

Model Dependence in Practice

How do you conduct empirical analyses?

collect the data over many months or years.finish recording and merging.sit in front of your computer with nobody to bother you.run one regression.run another regression with different control variables.run another regression with different functional forms.run another regression with different measures.run yet another regression with a subset of the data.end up with 100 or 1000 different estimates.put 1 or maybe 5 regression results in the paper.

What’s the problem?

Some specification is designated as the “correct” one, only afterlooking at the estimates.Is this a true test of an ex ante hypothesis or merely a demonstrationthat it is possible to find results consistent with your favoritehypothesis?

Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes: Model Dependence in Counterfactual InferenceMarch 26, 2016 4 / 23

Page 24: Advanced Quantitative Research Methodology, Lecture...Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes:March 26, 2016 2 / 23Model Dependence in Counterfactual

Model Dependence in Practice

How do you conduct empirical analyses?

collect the data over many months or years.finish recording and merging.sit in front of your computer with nobody to bother you.run one regression.run another regression with different control variables.run another regression with different functional forms.run another regression with different measures.run yet another regression with a subset of the data.end up with 100 or 1000 different estimates.put 1 or maybe 5 regression results in the paper.

What’s the problem?

Some specification is designated as the “correct” one, only afterlooking at the estimates.Is this a true test of an ex ante hypothesis or merely a demonstrationthat it is possible to find results consistent with your favoritehypothesis?

Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes: Model Dependence in Counterfactual InferenceMarch 26, 2016 4 / 23

Page 25: Advanced Quantitative Research Methodology, Lecture...Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes:March 26, 2016 2 / 23Model Dependence in Counterfactual

Model Dependence in Practice

How do you conduct empirical analyses?

collect the data over many months or years.finish recording and merging.sit in front of your computer with nobody to bother you.run one regression.run another regression with different control variables.run another regression with different functional forms.run another regression with different measures.run yet another regression with a subset of the data.end up with 100 or 1000 different estimates.put 1 or maybe 5 regression results in the paper.

What’s the problem?

Some specification is designated as the “correct” one, only afterlooking at the estimates.

Is this a true test of an ex ante hypothesis or merely a demonstrationthat it is possible to find results consistent with your favoritehypothesis?

Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes: Model Dependence in Counterfactual InferenceMarch 26, 2016 4 / 23

Page 26: Advanced Quantitative Research Methodology, Lecture...Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes:March 26, 2016 2 / 23Model Dependence in Counterfactual

Model Dependence in Practice

How do you conduct empirical analyses?

collect the data over many months or years.finish recording and merging.sit in front of your computer with nobody to bother you.run one regression.run another regression with different control variables.run another regression with different functional forms.run another regression with different measures.run yet another regression with a subset of the data.end up with 100 or 1000 different estimates.put 1 or maybe 5 regression results in the paper.

What’s the problem?

Some specification is designated as the “correct” one, only afterlooking at the estimates.Is this a true test of an ex ante hypothesis or merely a demonstrationthat it is possible to find results consistent with your favoritehypothesis?

Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes: Model Dependence in Counterfactual InferenceMarch 26, 2016 4 / 23

Page 27: Advanced Quantitative Research Methodology, Lecture...Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes:March 26, 2016 2 / 23Model Dependence in Counterfactual

Which model would you choose? (Both fit the data well.)

Compare prediction at x = 1.5 to prediction at x = 5

How do you choose a model?

R2? Some “test”? “Theory”?

The bottom line: answers to some questions don’t exist in the data.

Same for what if questions, predictions, and causal inferences

Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes: Model Dependence in Counterfactual InferenceMarch 26, 2016 5 / 23

Page 28: Advanced Quantitative Research Methodology, Lecture...Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes:March 26, 2016 2 / 23Model Dependence in Counterfactual

Which model would you choose? (Both fit the data well.)

Compare prediction at x = 1.5 to prediction at x = 5

How do you choose a model?

R2? Some “test”? “Theory”?

The bottom line: answers to some questions don’t exist in the data.

Same for what if questions, predictions, and causal inferences

Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes: Model Dependence in Counterfactual InferenceMarch 26, 2016 5 / 23

Page 29: Advanced Quantitative Research Methodology, Lecture...Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes:March 26, 2016 2 / 23Model Dependence in Counterfactual

Which model would you choose? (Both fit the data well.)

Compare prediction at x = 1.5 to prediction at x = 5

How do you choose a model?

R2? Some “test”? “Theory”?

The bottom line: answers to some questions don’t exist in the data.

Same for what if questions, predictions, and causal inferences

Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes: Model Dependence in Counterfactual InferenceMarch 26, 2016 5 / 23

Page 30: Advanced Quantitative Research Methodology, Lecture...Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes:March 26, 2016 2 / 23Model Dependence in Counterfactual

Which model would you choose? (Both fit the data well.)

Compare prediction at x = 1.5 to prediction at x = 5

How do you choose a model? R2?

Some “test”? “Theory”?

The bottom line: answers to some questions don’t exist in the data.

Same for what if questions, predictions, and causal inferences

Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes: Model Dependence in Counterfactual InferenceMarch 26, 2016 5 / 23

Page 31: Advanced Quantitative Research Methodology, Lecture...Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes:March 26, 2016 2 / 23Model Dependence in Counterfactual

Which model would you choose? (Both fit the data well.)

Compare prediction at x = 1.5 to prediction at x = 5

How do you choose a model? R2? Some “test”?

“Theory”?

The bottom line: answers to some questions don’t exist in the data.

Same for what if questions, predictions, and causal inferences

Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes: Model Dependence in Counterfactual InferenceMarch 26, 2016 5 / 23

Page 32: Advanced Quantitative Research Methodology, Lecture...Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes:March 26, 2016 2 / 23Model Dependence in Counterfactual

Which model would you choose? (Both fit the data well.)

Compare prediction at x = 1.5 to prediction at x = 5

How do you choose a model? R2? Some “test”? “Theory”?

The bottom line: answers to some questions don’t exist in the data.

Same for what if questions, predictions, and causal inferences

Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes: Model Dependence in Counterfactual InferenceMarch 26, 2016 5 / 23

Page 33: Advanced Quantitative Research Methodology, Lecture...Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes:March 26, 2016 2 / 23Model Dependence in Counterfactual

Which model would you choose? (Both fit the data well.)

Compare prediction at x = 1.5 to prediction at x = 5

How do you choose a model? R2? Some “test”? “Theory”?

The bottom line: answers to some questions don’t exist in the data.

Same for what if questions, predictions, and causal inferences

Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes: Model Dependence in Counterfactual InferenceMarch 26, 2016 5 / 23

Page 34: Advanced Quantitative Research Methodology, Lecture...Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes:March 26, 2016 2 / 23Model Dependence in Counterfactual

Which model would you choose? (Both fit the data well.)

Compare prediction at x = 1.5 to prediction at x = 5

How do you choose a model? R2? Some “test”? “Theory”?

The bottom line: answers to some questions don’t exist in the data.

Same for what if questions, predictions, and causal inferences

Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes: Model Dependence in Counterfactual InferenceMarch 26, 2016 5 / 23

Page 35: Advanced Quantitative Research Methodology, Lecture...Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes:March 26, 2016 2 / 23Model Dependence in Counterfactual

Model Dependence Proof

Model Free Inference

To estimate E (Y |X = x) at x , average many observed Y with value x

Assumptions (Model-Based Inference)

1 Definition: model dependence at x is the difference between predictedoutcomes for any two models that fit about equally well.

2 The functional form follows strong continuity (think smoothness,although it is less restrictive)

Result

The maximum degree of model dependence: solely a function of thedistance from the counterfactual to the data

Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes: Model Dependence in Counterfactual InferenceMarch 26, 2016 6 / 23

Page 36: Advanced Quantitative Research Methodology, Lecture...Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes:March 26, 2016 2 / 23Model Dependence in Counterfactual

Model Dependence Proof

Model Free Inference

To estimate E (Y |X = x) at x , average many observed Y with value x

Assumptions (Model-Based Inference)

1 Definition: model dependence at x is the difference between predictedoutcomes for any two models that fit about equally well.

2 The functional form follows strong continuity (think smoothness,although it is less restrictive)

Result

The maximum degree of model dependence: solely a function of thedistance from the counterfactual to the data

Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes: Model Dependence in Counterfactual InferenceMarch 26, 2016 6 / 23

Page 37: Advanced Quantitative Research Methodology, Lecture...Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes:March 26, 2016 2 / 23Model Dependence in Counterfactual

Model Dependence Proof

Model Free Inference

To estimate E (Y |X = x) at x , average many observed Y with value x

Assumptions (Model-Based Inference)

1 Definition: model dependence at x is the difference between predictedoutcomes for any two models that fit about equally well.

2 The functional form follows strong continuity (think smoothness,although it is less restrictive)

Result

The maximum degree of model dependence: solely a function of thedistance from the counterfactual to the data

Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes: Model Dependence in Counterfactual InferenceMarch 26, 2016 6 / 23

Page 38: Advanced Quantitative Research Methodology, Lecture...Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes:March 26, 2016 2 / 23Model Dependence in Counterfactual

Model Dependence Proof

Model Free Inference

To estimate E (Y |X = x) at x , average many observed Y with value x

Assumptions (Model-Based Inference)

1 Definition: model dependence at x is the difference between predictedoutcomes for any two models that fit about equally well.

2 The functional form follows strong continuity (think smoothness,although it is less restrictive)

Result

The maximum degree of model dependence: solely a function of thedistance from the counterfactual to the data

Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes: Model Dependence in Counterfactual InferenceMarch 26, 2016 6 / 23

Page 39: Advanced Quantitative Research Methodology, Lecture...Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes:March 26, 2016 2 / 23Model Dependence in Counterfactual

Model Dependence Proof

Model Free Inference

To estimate E (Y |X = x) at x , average many observed Y with value x

Assumptions (Model-Based Inference)

1 Definition: model dependence at x is the difference between predictedoutcomes for any two models that fit about equally well.

2 The functional form follows strong continuity (think smoothness,although it is less restrictive)

Result

The maximum degree of model dependence: solely a function of thedistance from the counterfactual to the data

Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes: Model Dependence in Counterfactual InferenceMarch 26, 2016 6 / 23

Page 40: Advanced Quantitative Research Methodology, Lecture...Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes:March 26, 2016 2 / 23Model Dependence in Counterfactual

Model Dependence Proof

Model Free Inference

To estimate E (Y |X = x) at x , average many observed Y with value x

Assumptions (Model-Based Inference)

1 Definition: model dependence at x is the difference between predictedoutcomes for any two models that fit about equally well.

2 The functional form follows strong continuity (think smoothness,although it is less restrictive)

Result

The maximum degree of model dependence: solely a function of thedistance from the counterfactual to the data

Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes: Model Dependence in Counterfactual InferenceMarch 26, 2016 6 / 23

Page 41: Advanced Quantitative Research Methodology, Lecture...Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes:March 26, 2016 2 / 23Model Dependence in Counterfactual

Model Dependence Proof

Model Free Inference

To estimate E (Y |X = x) at x , average many observed Y with value x

Assumptions (Model-Based Inference)

1 Definition: model dependence at x is the difference between predictedoutcomes for any two models that fit about equally well.

2 The functional form follows strong continuity (think smoothness,although it is less restrictive)

Result

The maximum degree of model dependence: solely a function of thedistance from the counterfactual to the data

Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes: Model Dependence in Counterfactual InferenceMarch 26, 2016 6 / 23

Page 42: Advanced Quantitative Research Methodology, Lecture...Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes:March 26, 2016 2 / 23Model Dependence in Counterfactual

Model Dependence Proof

Model Free Inference

To estimate E (Y |X = x) at x , average many observed Y with value x

Assumptions (Model-Based Inference)

1 Definition: model dependence at x is the difference between predictedoutcomes for any two models that fit about equally well.

2 The functional form follows strong continuity (think smoothness,although it is less restrictive)

Result

The maximum degree of model dependence: solely a function of thedistance from the counterfactual to the data

Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes: Model Dependence in Counterfactual InferenceMarch 26, 2016 6 / 23

Page 43: Advanced Quantitative Research Methodology, Lecture...Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes:March 26, 2016 2 / 23Model Dependence in Counterfactual

Detecting Model Dependence

Randomly select a large number of infants

Randomly assign them to 0,6,8,10,12,16 years of education

Assume 100% compliance, and no measurement error, omittedvariables, or missing data

Regress cumulative salary in year 17 on education

We find a coefficient of β = $1, 000, big t-statistics, narrowconfidence intervals, and pass every test for auto-correlation, fit,normality, linearity, homoskedasticity, etc.

Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes: Model Dependence in Counterfactual InferenceMarch 26, 2016 7 / 23

Page 44: Advanced Quantitative Research Methodology, Lecture...Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes:March 26, 2016 2 / 23Model Dependence in Counterfactual

Detecting Model DependenceA (Hypothethical) Research Design

Randomly select a large number of infants

Randomly assign them to 0,6,8,10,12,16 years of education

Assume 100% compliance, and no measurement error, omittedvariables, or missing data

Regress cumulative salary in year 17 on education

We find a coefficient of β = $1, 000, big t-statistics, narrowconfidence intervals, and pass every test for auto-correlation, fit,normality, linearity, homoskedasticity, etc.

Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes: Model Dependence in Counterfactual InferenceMarch 26, 2016 7 / 23

Page 45: Advanced Quantitative Research Methodology, Lecture...Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes:March 26, 2016 2 / 23Model Dependence in Counterfactual

Detecting Model DependenceA (Hypothethical) Research Design

Randomly select a large number of infants

Randomly assign them to 0,6,8,10,12,16 years of education

Assume 100% compliance, and no measurement error, omittedvariables, or missing data

Regress cumulative salary in year 17 on education

We find a coefficient of β = $1, 000, big t-statistics, narrowconfidence intervals, and pass every test for auto-correlation, fit,normality, linearity, homoskedasticity, etc.

Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes: Model Dependence in Counterfactual InferenceMarch 26, 2016 7 / 23

Page 46: Advanced Quantitative Research Methodology, Lecture...Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes:March 26, 2016 2 / 23Model Dependence in Counterfactual

Detecting Model DependenceA (Hypothethical) Research Design

Randomly select a large number of infants

Randomly assign them to 0,6,8,10,12,16 years of education

Assume 100% compliance, and no measurement error, omittedvariables, or missing data

Regress cumulative salary in year 17 on education

We find a coefficient of β = $1, 000, big t-statistics, narrowconfidence intervals, and pass every test for auto-correlation, fit,normality, linearity, homoskedasticity, etc.

Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes: Model Dependence in Counterfactual InferenceMarch 26, 2016 7 / 23

Page 47: Advanced Quantitative Research Methodology, Lecture...Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes:March 26, 2016 2 / 23Model Dependence in Counterfactual

Detecting Model DependenceA (Hypothethical) Research Design

Randomly select a large number of infants

Randomly assign them to 0,6,8,10,12,16 years of education

Assume 100% compliance, and no measurement error, omittedvariables, or missing data

Regress cumulative salary in year 17 on education

We find a coefficient of β = $1, 000, big t-statistics, narrowconfidence intervals, and pass every test for auto-correlation, fit,normality, linearity, homoskedasticity, etc.

Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes: Model Dependence in Counterfactual InferenceMarch 26, 2016 7 / 23

Page 48: Advanced Quantitative Research Methodology, Lecture...Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes:March 26, 2016 2 / 23Model Dependence in Counterfactual

Detecting Model DependenceA (Hypothethical) Research Design

Randomly select a large number of infants

Randomly assign them to 0,6,8,10,12,16 years of education

Assume 100% compliance, and no measurement error, omittedvariables, or missing data

Regress cumulative salary in year 17 on education

We find a coefficient of β = $1, 000, big t-statistics, narrowconfidence intervals, and pass every test for auto-correlation, fit,normality, linearity, homoskedasticity, etc.

Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes: Model Dependence in Counterfactual InferenceMarch 26, 2016 7 / 23

Page 49: Advanced Quantitative Research Methodology, Lecture...Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes:March 26, 2016 2 / 23Model Dependence in Counterfactual

Detecting Model DependenceA (Hypothethical) Research Design

Randomly select a large number of infants

Randomly assign them to 0,6,8,10,12,16 years of education

Assume 100% compliance, and no measurement error, omittedvariables, or missing data

Regress cumulative salary in year 17 on education

We find a coefficient of β = $1, 000, big t-statistics, narrowconfidence intervals, and pass every test for auto-correlation, fit,normality, linearity, homoskedasticity, etc.

Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes: Model Dependence in Counterfactual InferenceMarch 26, 2016 7 / 23

Page 50: Advanced Quantitative Research Methodology, Lecture...Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes:March 26, 2016 2 / 23Model Dependence in Counterfactual

What Inferences Would You Be Willing to Make?

A Factual Question: How much salary would someone receive with 12years of education (a high school degree)?

The model-free estimate: mean(Y ) among those with X = 12.

The model-based estimate: Y = X β = 12× $1, 000 = $12, 000

Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes: Model Dependence in Counterfactual InferenceMarch 26, 2016 8 / 23

Page 51: Advanced Quantitative Research Methodology, Lecture...Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes:March 26, 2016 2 / 23Model Dependence in Counterfactual

What Inferences Would You Be Willing to Make?

A Factual Question: How much salary would someone receive with 12years of education (a high school degree)?

The model-free estimate: mean(Y ) among those with X = 12.

The model-based estimate: Y = X β = 12× $1, 000 = $12, 000

Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes: Model Dependence in Counterfactual InferenceMarch 26, 2016 8 / 23

Page 52: Advanced Quantitative Research Methodology, Lecture...Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes:March 26, 2016 2 / 23Model Dependence in Counterfactual

What Inferences Would You Be Willing to Make?

A Factual Question: How much salary would someone receive with 12years of education (a high school degree)?

The model-free estimate: mean(Y ) among those with X = 12.

The model-based estimate: Y = X β = 12× $1, 000 = $12, 000

Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes: Model Dependence in Counterfactual InferenceMarch 26, 2016 8 / 23

Page 53: Advanced Quantitative Research Methodology, Lecture...Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes:March 26, 2016 2 / 23Model Dependence in Counterfactual

What Inferences Would You Be Willing to Make?

A Factual Question: How much salary would someone receive with 12years of education (a high school degree)?

The model-free estimate: mean(Y ) among those with X = 12.

The model-based estimate: Y = X β = 12× $1, 000 = $12, 000

Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes: Model Dependence in Counterfactual InferenceMarch 26, 2016 8 / 23

Page 54: Advanced Quantitative Research Methodology, Lecture...Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes:March 26, 2016 2 / 23Model Dependence in Counterfactual

Counterfactual Inferences with Interpolation

How much salary would someone receive with 14 years of education(an Associates Degree)?

Model free estimate: impossible

Model-based estimate: Y = X β = 14× $1, 000 = $14, 000

Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes: Model Dependence in Counterfactual InferenceMarch 26, 2016 9 / 23

Page 55: Advanced Quantitative Research Methodology, Lecture...Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes:March 26, 2016 2 / 23Model Dependence in Counterfactual

Counterfactual Inferences with Interpolation

How much salary would someone receive with 14 years of education(an Associates Degree)?

Model free estimate: impossible

Model-based estimate: Y = X β = 14× $1, 000 = $14, 000

Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes: Model Dependence in Counterfactual InferenceMarch 26, 2016 9 / 23

Page 56: Advanced Quantitative Research Methodology, Lecture...Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes:March 26, 2016 2 / 23Model Dependence in Counterfactual

Counterfactual Inferences with Interpolation

How much salary would someone receive with 14 years of education(an Associates Degree)?

Model free estimate: impossible

Model-based estimate: Y = X β = 14× $1, 000 = $14, 000

Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes: Model Dependence in Counterfactual InferenceMarch 26, 2016 9 / 23

Page 57: Advanced Quantitative Research Methodology, Lecture...Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes:March 26, 2016 2 / 23Model Dependence in Counterfactual

Counterfactual Inferences with Interpolation

How much salary would someone receive with 14 years of education(an Associates Degree)?

Model free estimate: impossible

Model-based estimate: Y = X β = 14× $1, 000 = $14, 000

Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes: Model Dependence in Counterfactual InferenceMarch 26, 2016 9 / 23

Page 58: Advanced Quantitative Research Methodology, Lecture...Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes:March 26, 2016 2 / 23Model Dependence in Counterfactual

Counterfactual Inference with Extrapolation

How much salary would someone receive with 24 years of education(a Ph.D.)?

Y = X β = 24× $1, 000 = $24, 000

Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes: Model Dependence in Counterfactual InferenceMarch 26, 2016 10 / 23

Page 59: Advanced Quantitative Research Methodology, Lecture...Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes:March 26, 2016 2 / 23Model Dependence in Counterfactual

Counterfactual Inference with Extrapolation

How much salary would someone receive with 24 years of education(a Ph.D.)?

Y = X β = 24× $1, 000 = $24, 000

Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes: Model Dependence in Counterfactual InferenceMarch 26, 2016 10 / 23

Page 60: Advanced Quantitative Research Methodology, Lecture...Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes:March 26, 2016 2 / 23Model Dependence in Counterfactual

Counterfactual Inference with Extrapolation

How much salary would someone receive with 24 years of education(a Ph.D.)?

Y = X β = 24× $1, 000 = $24, 000

Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes: Model Dependence in Counterfactual InferenceMarch 26, 2016 10 / 23

Page 61: Advanced Quantitative Research Methodology, Lecture...Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes:March 26, 2016 2 / 23Model Dependence in Counterfactual

Another Counterfactual Inference with Extrapolation

How much salary would someone receive with 53 years of education?Y = X β = 53× $1, 000 = $53, 000Recall: the regression passed every test and met every assumption;identical calculations worked for the other questions.What’s changed? How would we recognize it when the example is lessextreme or multidimensional?

Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes: Model Dependence in Counterfactual InferenceMarch 26, 2016 11 / 23

Page 62: Advanced Quantitative Research Methodology, Lecture...Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes:March 26, 2016 2 / 23Model Dependence in Counterfactual

Another Counterfactual Inference with Extrapolation

How much salary would someone receive with 53 years of education?

Y = X β = 53× $1, 000 = $53, 000Recall: the regression passed every test and met every assumption;identical calculations worked for the other questions.What’s changed? How would we recognize it when the example is lessextreme or multidimensional?

Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes: Model Dependence in Counterfactual InferenceMarch 26, 2016 11 / 23

Page 63: Advanced Quantitative Research Methodology, Lecture...Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes:March 26, 2016 2 / 23Model Dependence in Counterfactual

Another Counterfactual Inference with Extrapolation

How much salary would someone receive with 53 years of education?Y = X β = 53× $1, 000 = $53, 000

Recall: the regression passed every test and met every assumption;identical calculations worked for the other questions.What’s changed? How would we recognize it when the example is lessextreme or multidimensional?

Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes: Model Dependence in Counterfactual InferenceMarch 26, 2016 11 / 23

Page 64: Advanced Quantitative Research Methodology, Lecture...Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes:March 26, 2016 2 / 23Model Dependence in Counterfactual

Another Counterfactual Inference with Extrapolation

How much salary would someone receive with 53 years of education?Y = X β = 53× $1, 000 = $53, 000Recall: the regression passed every test and met every assumption;identical calculations worked for the other questions.

What’s changed? How would we recognize it when the example is lessextreme or multidimensional?

Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes: Model Dependence in Counterfactual InferenceMarch 26, 2016 11 / 23

Page 65: Advanced Quantitative Research Methodology, Lecture...Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes:March 26, 2016 2 / 23Model Dependence in Counterfactual

Another Counterfactual Inference with Extrapolation

How much salary would someone receive with 53 years of education?Y = X β = 53× $1, 000 = $53, 000Recall: the regression passed every test and met every assumption;identical calculations worked for the other questions.What’s changed? How would we recognize it when the example is lessextreme or multidimensional?

Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes: Model Dependence in Counterfactual InferenceMarch 26, 2016 11 / 23

Page 66: Advanced Quantitative Research Methodology, Lecture...Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes:March 26, 2016 2 / 23Model Dependence in Counterfactual

Model Dependence with One Explanatory Variable

Suppose Y is starting salary; X is education in 10 categories.

To estimate E (Y |X ): we need 10 parameters, E (Y |X = xj),j = 1, . . . , 10.

Model-free method: average 50 observations on Y for each value of X

Model-based method: regress Y on X , summarizing 10 parameterswith 2 (intercept and slope).

The difference between the 10 we need and the 2 we estimate withregression is pure assumption.

(If X were continuous, we would be reducing ∞ to 2, also byassumption)

Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes: Model Dependence in Counterfactual InferenceMarch 26, 2016 12 / 23

Page 67: Advanced Quantitative Research Methodology, Lecture...Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes:March 26, 2016 2 / 23Model Dependence in Counterfactual

Model Dependence with One Explanatory Variable

Suppose Y is starting salary; X is education in 10 categories.

To estimate E (Y |X ): we need 10 parameters, E (Y |X = xj),j = 1, . . . , 10.

Model-free method: average 50 observations on Y for each value of X

Model-based method: regress Y on X , summarizing 10 parameterswith 2 (intercept and slope).

The difference between the 10 we need and the 2 we estimate withregression is pure assumption.

(If X were continuous, we would be reducing ∞ to 2, also byassumption)

Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes: Model Dependence in Counterfactual InferenceMarch 26, 2016 12 / 23

Page 68: Advanced Quantitative Research Methodology, Lecture...Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes:March 26, 2016 2 / 23Model Dependence in Counterfactual

Model Dependence with One Explanatory Variable

Suppose Y is starting salary; X is education in 10 categories.

To estimate E (Y |X ): we need 10 parameters, E (Y |X = xj),j = 1, . . . , 10.

Model-free method: average 50 observations on Y for each value of X

Model-based method: regress Y on X , summarizing 10 parameterswith 2 (intercept and slope).

The difference between the 10 we need and the 2 we estimate withregression is pure assumption.

(If X were continuous, we would be reducing ∞ to 2, also byassumption)

Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes: Model Dependence in Counterfactual InferenceMarch 26, 2016 12 / 23

Page 69: Advanced Quantitative Research Methodology, Lecture...Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes:March 26, 2016 2 / 23Model Dependence in Counterfactual

Model Dependence with One Explanatory Variable

Suppose Y is starting salary; X is education in 10 categories.

To estimate E (Y |X ): we need 10 parameters, E (Y |X = xj),j = 1, . . . , 10.

Model-free method: average 50 observations on Y for each value of X

Model-based method: regress Y on X , summarizing 10 parameterswith 2 (intercept and slope).

The difference between the 10 we need and the 2 we estimate withregression is pure assumption.

(If X were continuous, we would be reducing ∞ to 2, also byassumption)

Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes: Model Dependence in Counterfactual InferenceMarch 26, 2016 12 / 23

Page 70: Advanced Quantitative Research Methodology, Lecture...Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes:March 26, 2016 2 / 23Model Dependence in Counterfactual

Model Dependence with One Explanatory Variable

Suppose Y is starting salary; X is education in 10 categories.

To estimate E (Y |X ): we need 10 parameters, E (Y |X = xj),j = 1, . . . , 10.

Model-free method: average 50 observations on Y for each value of X

Model-based method: regress Y on X , summarizing 10 parameterswith 2 (intercept and slope).

The difference between the 10 we need and the 2 we estimate withregression is pure assumption.

(If X were continuous, we would be reducing ∞ to 2, also byassumption)

Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes: Model Dependence in Counterfactual InferenceMarch 26, 2016 12 / 23

Page 71: Advanced Quantitative Research Methodology, Lecture...Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes:March 26, 2016 2 / 23Model Dependence in Counterfactual

Model Dependence with One Explanatory Variable

Suppose Y is starting salary; X is education in 10 categories.

To estimate E (Y |X ): we need 10 parameters, E (Y |X = xj),j = 1, . . . , 10.

Model-free method: average 50 observations on Y for each value of X

Model-based method: regress Y on X , summarizing 10 parameterswith 2 (intercept and slope).

The difference between the 10 we need and the 2 we estimate withregression is pure assumption.

(If X were continuous, we would be reducing ∞ to 2, also byassumption)

Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes: Model Dependence in Counterfactual InferenceMarch 26, 2016 12 / 23

Page 72: Advanced Quantitative Research Methodology, Lecture...Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes:March 26, 2016 2 / 23Model Dependence in Counterfactual

Model Dependence with One Explanatory Variable

Suppose Y is starting salary; X is education in 10 categories.

To estimate E (Y |X ): we need 10 parameters, E (Y |X = xj),j = 1, . . . , 10.

Model-free method: average 50 observations on Y for each value of X

Model-based method: regress Y on X , summarizing 10 parameterswith 2 (intercept and slope).

The difference between the 10 we need and the 2 we estimate withregression is pure assumption.

(If X were continuous, we would be reducing ∞ to 2, also byassumption)

Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes: Model Dependence in Counterfactual InferenceMarch 26, 2016 12 / 23

Page 73: Advanced Quantitative Research Methodology, Lecture...Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes:March 26, 2016 2 / 23Model Dependence in Counterfactual

Model Dependence with Two Explanatory Variables

How many parameters do we now need to estimate?

20? Nope. Its10× 10 = 100. This is the curse of dimensionality: the number ofparameters goes up geometrically, not additively.

If we run a regression, we are summarizing 100 parameters with 3 (anintercept and two slopes).

But what about including an interaction? Right, so now we’resummarizing 100 parameters with 4.

The difference: an enormous assumption based on convenience, notevidence or theory.

Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes: Model Dependence in Counterfactual InferenceMarch 26, 2016 13 / 23

Page 74: Advanced Quantitative Research Methodology, Lecture...Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes:March 26, 2016 2 / 23Model Dependence in Counterfactual

Model Dependence with Two Explanatory VariablesVariables: X (education) and Z , parent’s income, both with 10 categories

How many parameters do we now need to estimate?

20? Nope. Its10× 10 = 100. This is the curse of dimensionality: the number ofparameters goes up geometrically, not additively.

If we run a regression, we are summarizing 100 parameters with 3 (anintercept and two slopes).

But what about including an interaction? Right, so now we’resummarizing 100 parameters with 4.

The difference: an enormous assumption based on convenience, notevidence or theory.

Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes: Model Dependence in Counterfactual InferenceMarch 26, 2016 13 / 23

Page 75: Advanced Quantitative Research Methodology, Lecture...Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes:March 26, 2016 2 / 23Model Dependence in Counterfactual

Model Dependence with Two Explanatory VariablesVariables: X (education) and Z , parent’s income, both with 10 categories

How many parameters do we now need to estimate?

20? Nope. Its10× 10 = 100. This is the curse of dimensionality: the number ofparameters goes up geometrically, not additively.

If we run a regression, we are summarizing 100 parameters with 3 (anintercept and two slopes).

But what about including an interaction? Right, so now we’resummarizing 100 parameters with 4.

The difference: an enormous assumption based on convenience, notevidence or theory.

Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes: Model Dependence in Counterfactual InferenceMarch 26, 2016 13 / 23

Page 76: Advanced Quantitative Research Methodology, Lecture...Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes:March 26, 2016 2 / 23Model Dependence in Counterfactual

Model Dependence with Two Explanatory VariablesVariables: X (education) and Z , parent’s income, both with 10 categories

How many parameters do we now need to estimate? 20?

Nope. Its10× 10 = 100. This is the curse of dimensionality: the number ofparameters goes up geometrically, not additively.

If we run a regression, we are summarizing 100 parameters with 3 (anintercept and two slopes).

But what about including an interaction? Right, so now we’resummarizing 100 parameters with 4.

The difference: an enormous assumption based on convenience, notevidence or theory.

Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes: Model Dependence in Counterfactual InferenceMarch 26, 2016 13 / 23

Page 77: Advanced Quantitative Research Methodology, Lecture...Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes:March 26, 2016 2 / 23Model Dependence in Counterfactual

Model Dependence with Two Explanatory VariablesVariables: X (education) and Z , parent’s income, both with 10 categories

How many parameters do we now need to estimate? 20? Nope.

Its10× 10 = 100. This is the curse of dimensionality: the number ofparameters goes up geometrically, not additively.

If we run a regression, we are summarizing 100 parameters with 3 (anintercept and two slopes).

But what about including an interaction? Right, so now we’resummarizing 100 parameters with 4.

The difference: an enormous assumption based on convenience, notevidence or theory.

Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes: Model Dependence in Counterfactual InferenceMarch 26, 2016 13 / 23

Page 78: Advanced Quantitative Research Methodology, Lecture...Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes:March 26, 2016 2 / 23Model Dependence in Counterfactual

Model Dependence with Two Explanatory VariablesVariables: X (education) and Z , parent’s income, both with 10 categories

How many parameters do we now need to estimate? 20? Nope. Its10× 10 = 100.

This is the curse of dimensionality: the number ofparameters goes up geometrically, not additively.

If we run a regression, we are summarizing 100 parameters with 3 (anintercept and two slopes).

But what about including an interaction? Right, so now we’resummarizing 100 parameters with 4.

The difference: an enormous assumption based on convenience, notevidence or theory.

Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes: Model Dependence in Counterfactual InferenceMarch 26, 2016 13 / 23

Page 79: Advanced Quantitative Research Methodology, Lecture...Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes:March 26, 2016 2 / 23Model Dependence in Counterfactual

Model Dependence with Two Explanatory VariablesVariables: X (education) and Z , parent’s income, both with 10 categories

How many parameters do we now need to estimate? 20? Nope. Its10× 10 = 100. This is the curse of dimensionality: the number ofparameters goes up geometrically, not additively.

If we run a regression, we are summarizing 100 parameters with 3 (anintercept and two slopes).

But what about including an interaction? Right, so now we’resummarizing 100 parameters with 4.

The difference: an enormous assumption based on convenience, notevidence or theory.

Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes: Model Dependence in Counterfactual InferenceMarch 26, 2016 13 / 23

Page 80: Advanced Quantitative Research Methodology, Lecture...Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes:March 26, 2016 2 / 23Model Dependence in Counterfactual

Model Dependence with Two Explanatory VariablesVariables: X (education) and Z , parent’s income, both with 10 categories

How many parameters do we now need to estimate? 20? Nope. Its10× 10 = 100. This is the curse of dimensionality: the number ofparameters goes up geometrically, not additively.

If we run a regression, we are summarizing 100 parameters with 3 (anintercept and two slopes).

But what about including an interaction? Right, so now we’resummarizing 100 parameters with 4.

The difference: an enormous assumption based on convenience, notevidence or theory.

Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes: Model Dependence in Counterfactual InferenceMarch 26, 2016 13 / 23

Page 81: Advanced Quantitative Research Methodology, Lecture...Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes:March 26, 2016 2 / 23Model Dependence in Counterfactual

Model Dependence with Two Explanatory VariablesVariables: X (education) and Z , parent’s income, both with 10 categories

How many parameters do we now need to estimate? 20? Nope. Its10× 10 = 100. This is the curse of dimensionality: the number ofparameters goes up geometrically, not additively.

If we run a regression, we are summarizing 100 parameters with 3 (anintercept and two slopes).

But what about including an interaction? Right, so now we’resummarizing 100 parameters with 4.

The difference: an enormous assumption based on convenience, notevidence or theory.

Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes: Model Dependence in Counterfactual InferenceMarch 26, 2016 13 / 23

Page 82: Advanced Quantitative Research Methodology, Lecture...Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes:March 26, 2016 2 / 23Model Dependence in Counterfactual

Model Dependence with Two Explanatory VariablesVariables: X (education) and Z , parent’s income, both with 10 categories

How many parameters do we now need to estimate? 20? Nope. Its10× 10 = 100. This is the curse of dimensionality: the number ofparameters goes up geometrically, not additively.

If we run a regression, we are summarizing 100 parameters with 3 (anintercept and two slopes).

But what about including an interaction? Right, so now we’resummarizing 100 parameters with 4.

The difference: an enormous assumption based on convenience, notevidence or theory.

Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes: Model Dependence in Counterfactual InferenceMarch 26, 2016 13 / 23

Page 83: Advanced Quantitative Research Methodology, Lecture...Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes:March 26, 2016 2 / 23Model Dependence in Counterfactual

Model Dependence with Many Explanatory Variables

Suppose: 15 explanatory variables, with 10 categories each.

need to estimate 1015 (a quadrillion) parameters with how manyobservations?Regression reduces this to 16 parameters; quite an assumption!

Suppose: 80 explanatory variables.

1080 is more than the number of atoms in the universe.Yet, with a few simple assumptions, we can still run a regression andestimate only 81 parameters.

The curse of dimensionality introduces huge assumptions, oftenrecognized.

Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes: Model Dependence in Counterfactual InferenceMarch 26, 2016 14 / 23

Page 84: Advanced Quantitative Research Methodology, Lecture...Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes:March 26, 2016 2 / 23Model Dependence in Counterfactual

Model Dependence with Many Explanatory Variables

Suppose: 15 explanatory variables, with 10 categories each.

need to estimate 1015 (a quadrillion) parameters with how manyobservations?Regression reduces this to 16 parameters; quite an assumption!

Suppose: 80 explanatory variables.

1080 is more than the number of atoms in the universe.Yet, with a few simple assumptions, we can still run a regression andestimate only 81 parameters.

The curse of dimensionality introduces huge assumptions, oftenrecognized.

Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes: Model Dependence in Counterfactual InferenceMarch 26, 2016 14 / 23

Page 85: Advanced Quantitative Research Methodology, Lecture...Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes:March 26, 2016 2 / 23Model Dependence in Counterfactual

Model Dependence with Many Explanatory Variables

Suppose: 15 explanatory variables, with 10 categories each.

need to estimate 1015 (a quadrillion) parameters with how manyobservations?

Regression reduces this to 16 parameters; quite an assumption!

Suppose: 80 explanatory variables.

1080 is more than the number of atoms in the universe.Yet, with a few simple assumptions, we can still run a regression andestimate only 81 parameters.

The curse of dimensionality introduces huge assumptions, oftenrecognized.

Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes: Model Dependence in Counterfactual InferenceMarch 26, 2016 14 / 23

Page 86: Advanced Quantitative Research Methodology, Lecture...Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes:March 26, 2016 2 / 23Model Dependence in Counterfactual

Model Dependence with Many Explanatory Variables

Suppose: 15 explanatory variables, with 10 categories each.

need to estimate 1015 (a quadrillion) parameters with how manyobservations?Regression reduces this to 16 parameters; quite an assumption!

Suppose: 80 explanatory variables.

1080 is more than the number of atoms in the universe.Yet, with a few simple assumptions, we can still run a regression andestimate only 81 parameters.

The curse of dimensionality introduces huge assumptions, oftenrecognized.

Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes: Model Dependence in Counterfactual InferenceMarch 26, 2016 14 / 23

Page 87: Advanced Quantitative Research Methodology, Lecture...Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes:March 26, 2016 2 / 23Model Dependence in Counterfactual

Model Dependence with Many Explanatory Variables

Suppose: 15 explanatory variables, with 10 categories each.

need to estimate 1015 (a quadrillion) parameters with how manyobservations?Regression reduces this to 16 parameters; quite an assumption!

Suppose: 80 explanatory variables.

1080 is more than the number of atoms in the universe.Yet, with a few simple assumptions, we can still run a regression andestimate only 81 parameters.

The curse of dimensionality introduces huge assumptions, oftenrecognized.

Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes: Model Dependence in Counterfactual InferenceMarch 26, 2016 14 / 23

Page 88: Advanced Quantitative Research Methodology, Lecture...Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes:March 26, 2016 2 / 23Model Dependence in Counterfactual

Model Dependence with Many Explanatory Variables

Suppose: 15 explanatory variables, with 10 categories each.

need to estimate 1015 (a quadrillion) parameters with how manyobservations?Regression reduces this to 16 parameters; quite an assumption!

Suppose: 80 explanatory variables.

1080 is more than the number of atoms in the universe.

Yet, with a few simple assumptions, we can still run a regression andestimate only 81 parameters.

The curse of dimensionality introduces huge assumptions, oftenrecognized.

Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes: Model Dependence in Counterfactual InferenceMarch 26, 2016 14 / 23

Page 89: Advanced Quantitative Research Methodology, Lecture...Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes:March 26, 2016 2 / 23Model Dependence in Counterfactual

Model Dependence with Many Explanatory Variables

Suppose: 15 explanatory variables, with 10 categories each.

need to estimate 1015 (a quadrillion) parameters with how manyobservations?Regression reduces this to 16 parameters; quite an assumption!

Suppose: 80 explanatory variables.

1080 is more than the number of atoms in the universe.Yet, with a few simple assumptions, we can still run a regression andestimate only 81 parameters.

The curse of dimensionality introduces huge assumptions, oftenrecognized.

Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes: Model Dependence in Counterfactual InferenceMarch 26, 2016 14 / 23

Page 90: Advanced Quantitative Research Methodology, Lecture...Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes:March 26, 2016 2 / 23Model Dependence in Counterfactual

Model Dependence with Many Explanatory Variables

Suppose: 15 explanatory variables, with 10 categories each.

need to estimate 1015 (a quadrillion) parameters with how manyobservations?Regression reduces this to 16 parameters; quite an assumption!

Suppose: 80 explanatory variables.

1080 is more than the number of atoms in the universe.Yet, with a few simple assumptions, we can still run a regression andestimate only 81 parameters.

The curse of dimensionality introduces huge assumptions, oftenrecognized.

Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes: Model Dependence in Counterfactual InferenceMarch 26, 2016 14 / 23

Page 91: Advanced Quantitative Research Methodology, Lecture...Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes:March 26, 2016 2 / 23Model Dependence in Counterfactual

We Ask: How Factual is your Counterfactual?

Readers have the right to know: is your counterfactual close enoughto data so that statistical methods provide empirical answers?

If not, the same calculations will be based on indefensible modelassumptions. With the curse of dimensionality, its too easy to fall intothis trap.

A good existing approach: Sensitivity testing, but this requires theuser to specify a class of models and then to estimate them all andcheck how much inferences change

An alternative “Convex Hull” approach:

Specify your explanatory variables, X .Assume E(Y |X ) is (minimally) smooth in XNo need to specify models (or a class of models), estimators, ordependent variables.Results of one run apply to the class of all models, all estimators, andall dependent variables.

Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes: Model Dependence in Counterfactual InferenceMarch 26, 2016 15 / 23

Page 92: Advanced Quantitative Research Methodology, Lecture...Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes:March 26, 2016 2 / 23Model Dependence in Counterfactual

We Ask: How Factual is your Counterfactual?

Readers have the right to know: is your counterfactual close enoughto data so that statistical methods provide empirical answers?

If not, the same calculations will be based on indefensible modelassumptions. With the curse of dimensionality, its too easy to fall intothis trap.

A good existing approach: Sensitivity testing, but this requires theuser to specify a class of models and then to estimate them all andcheck how much inferences change

An alternative “Convex Hull” approach:

Specify your explanatory variables, X .Assume E(Y |X ) is (minimally) smooth in XNo need to specify models (or a class of models), estimators, ordependent variables.Results of one run apply to the class of all models, all estimators, andall dependent variables.

Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes: Model Dependence in Counterfactual InferenceMarch 26, 2016 15 / 23

Page 93: Advanced Quantitative Research Methodology, Lecture...Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes:March 26, 2016 2 / 23Model Dependence in Counterfactual

We Ask: How Factual is your Counterfactual?

Readers have the right to know: is your counterfactual close enoughto data so that statistical methods provide empirical answers?

If not, the same calculations will be based on indefensible modelassumptions. With the curse of dimensionality, its too easy to fall intothis trap.

A good existing approach: Sensitivity testing, but this requires theuser to specify a class of models and then to estimate them all andcheck how much inferences change

An alternative “Convex Hull” approach:

Specify your explanatory variables, X .Assume E(Y |X ) is (minimally) smooth in XNo need to specify models (or a class of models), estimators, ordependent variables.Results of one run apply to the class of all models, all estimators, andall dependent variables.

Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes: Model Dependence in Counterfactual InferenceMarch 26, 2016 15 / 23

Page 94: Advanced Quantitative Research Methodology, Lecture...Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes:March 26, 2016 2 / 23Model Dependence in Counterfactual

We Ask: How Factual is your Counterfactual?

Readers have the right to know: is your counterfactual close enoughto data so that statistical methods provide empirical answers?

If not, the same calculations will be based on indefensible modelassumptions. With the curse of dimensionality, its too easy to fall intothis trap.

A good existing approach: Sensitivity testing, but this requires theuser to specify a class of models and then to estimate them all andcheck how much inferences change

An alternative “Convex Hull” approach:

Specify your explanatory variables, X .Assume E(Y |X ) is (minimally) smooth in XNo need to specify models (or a class of models), estimators, ordependent variables.Results of one run apply to the class of all models, all estimators, andall dependent variables.

Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes: Model Dependence in Counterfactual InferenceMarch 26, 2016 15 / 23

Page 95: Advanced Quantitative Research Methodology, Lecture...Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes:March 26, 2016 2 / 23Model Dependence in Counterfactual

We Ask: How Factual is your Counterfactual?

Readers have the right to know: is your counterfactual close enoughto data so that statistical methods provide empirical answers?

If not, the same calculations will be based on indefensible modelassumptions. With the curse of dimensionality, its too easy to fall intothis trap.

A good existing approach: Sensitivity testing, but this requires theuser to specify a class of models and then to estimate them all andcheck how much inferences change

An alternative “Convex Hull” approach:

Specify your explanatory variables, X .Assume E(Y |X ) is (minimally) smooth in XNo need to specify models (or a class of models), estimators, ordependent variables.Results of one run apply to the class of all models, all estimators, andall dependent variables.

Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes: Model Dependence in Counterfactual InferenceMarch 26, 2016 15 / 23

Page 96: Advanced Quantitative Research Methodology, Lecture...Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes:March 26, 2016 2 / 23Model Dependence in Counterfactual

We Ask: How Factual is your Counterfactual?

Readers have the right to know: is your counterfactual close enoughto data so that statistical methods provide empirical answers?

If not, the same calculations will be based on indefensible modelassumptions. With the curse of dimensionality, its too easy to fall intothis trap.

A good existing approach: Sensitivity testing, but this requires theuser to specify a class of models and then to estimate them all andcheck how much inferences change

An alternative “Convex Hull” approach:

Specify your explanatory variables, X .

Assume E(Y |X ) is (minimally) smooth in XNo need to specify models (or a class of models), estimators, ordependent variables.Results of one run apply to the class of all models, all estimators, andall dependent variables.

Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes: Model Dependence in Counterfactual InferenceMarch 26, 2016 15 / 23

Page 97: Advanced Quantitative Research Methodology, Lecture...Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes:March 26, 2016 2 / 23Model Dependence in Counterfactual

We Ask: How Factual is your Counterfactual?

Readers have the right to know: is your counterfactual close enoughto data so that statistical methods provide empirical answers?

If not, the same calculations will be based on indefensible modelassumptions. With the curse of dimensionality, its too easy to fall intothis trap.

A good existing approach: Sensitivity testing, but this requires theuser to specify a class of models and then to estimate them all andcheck how much inferences change

An alternative “Convex Hull” approach:

Specify your explanatory variables, X .Assume E(Y |X ) is (minimally) smooth in X

No need to specify models (or a class of models), estimators, ordependent variables.Results of one run apply to the class of all models, all estimators, andall dependent variables.

Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes: Model Dependence in Counterfactual InferenceMarch 26, 2016 15 / 23

Page 98: Advanced Quantitative Research Methodology, Lecture...Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes:March 26, 2016 2 / 23Model Dependence in Counterfactual

We Ask: How Factual is your Counterfactual?

Readers have the right to know: is your counterfactual close enoughto data so that statistical methods provide empirical answers?

If not, the same calculations will be based on indefensible modelassumptions. With the curse of dimensionality, its too easy to fall intothis trap.

A good existing approach: Sensitivity testing, but this requires theuser to specify a class of models and then to estimate them all andcheck how much inferences change

An alternative “Convex Hull” approach:

Specify your explanatory variables, X .Assume E(Y |X ) is (minimally) smooth in XNo need to specify models (or a class of models), estimators, ordependent variables.

Results of one run apply to the class of all models, all estimators, andall dependent variables.

Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes: Model Dependence in Counterfactual InferenceMarch 26, 2016 15 / 23

Page 99: Advanced Quantitative Research Methodology, Lecture...Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes:March 26, 2016 2 / 23Model Dependence in Counterfactual

We Ask: How Factual is your Counterfactual?

Readers have the right to know: is your counterfactual close enoughto data so that statistical methods provide empirical answers?

If not, the same calculations will be based on indefensible modelassumptions. With the curse of dimensionality, its too easy to fall intothis trap.

A good existing approach: Sensitivity testing, but this requires theuser to specify a class of models and then to estimate them all andcheck how much inferences change

An alternative “Convex Hull” approach:

Specify your explanatory variables, X .Assume E(Y |X ) is (minimally) smooth in XNo need to specify models (or a class of models), estimators, ordependent variables.Results of one run apply to the class of all models, all estimators, andall dependent variables.

Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes: Model Dependence in Counterfactual InferenceMarch 26, 2016 15 / 23

Page 100: Advanced Quantitative Research Methodology, Lecture...Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes:March 26, 2016 2 / 23Model Dependence in Counterfactual

Interpolation vs Extrapolation in one Dimension

Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes: Model Dependence in Counterfactual InferenceMarch 26, 2016 16 / 23

Page 101: Advanced Quantitative Research Methodology, Lecture...Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes:March 26, 2016 2 / 23Model Dependence in Counterfactual

Interpolation or Extrapolation in One and Two Dimensions

Figure: The Convex Hull

Interpolation: Inside the convex hull

Extrapolation: Outside the convex hull

Works mathematically for any number of X variables

Software to determine whether a point is in the hull (which is all weneed) without calculating the hull (which would take forever), so itsfast; see http://GKing.harvard.edu/whatif

Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes: Model Dependence in Counterfactual InferenceMarch 26, 2016 17 / 23

Page 102: Advanced Quantitative Research Methodology, Lecture...Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes:March 26, 2016 2 / 23Model Dependence in Counterfactual

Interpolation or Extrapolation in One and Two Dimensions

Figure: The Convex Hull

Interpolation: Inside the convex hull

Extrapolation: Outside the convex hull

Works mathematically for any number of X variables

Software to determine whether a point is in the hull (which is all weneed) without calculating the hull (which would take forever), so itsfast; see http://GKing.harvard.edu/whatif

Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes: Model Dependence in Counterfactual InferenceMarch 26, 2016 17 / 23

Page 103: Advanced Quantitative Research Methodology, Lecture...Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes:March 26, 2016 2 / 23Model Dependence in Counterfactual

Interpolation or Extrapolation in One and Two Dimensions

Figure: The Convex Hull

Interpolation: Inside the convex hull

Extrapolation: Outside the convex hull

Works mathematically for any number of X variables

Software to determine whether a point is in the hull (which is all weneed) without calculating the hull (which would take forever), so itsfast; see http://GKing.harvard.edu/whatif

Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes: Model Dependence in Counterfactual InferenceMarch 26, 2016 17 / 23

Page 104: Advanced Quantitative Research Methodology, Lecture...Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes:March 26, 2016 2 / 23Model Dependence in Counterfactual

Interpolation or Extrapolation in One and Two Dimensions

Figure: The Convex Hull

Interpolation: Inside the convex hull

Extrapolation: Outside the convex hull

Works mathematically for any number of X variables

Software to determine whether a point is in the hull (which is all weneed) without calculating the hull (which would take forever), so itsfast; see http://GKing.harvard.edu/whatif

Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes: Model Dependence in Counterfactual InferenceMarch 26, 2016 17 / 23

Page 105: Advanced Quantitative Research Methodology, Lecture...Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes:March 26, 2016 2 / 23Model Dependence in Counterfactual

Interpolation or Extrapolation in One and Two Dimensions

Figure: The Convex Hull

Interpolation: Inside the convex hull

Extrapolation: Outside the convex hull

Works mathematically for any number of X variables

Software to determine whether a point is in the hull (which is all weneed) without calculating the hull (which would take forever), so itsfast; see http://GKing.harvard.edu/whatif

Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes: Model Dependence in Counterfactual InferenceMarch 26, 2016 17 / 23

Page 106: Advanced Quantitative Research Methodology, Lecture...Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes:March 26, 2016 2 / 23Model Dependence in Counterfactual

Replication: Doyle and Sambanis, APSR 2000

Data: 124 Post-World War II civil wars

Dependent variable: peacebuilding success

Treatment variable: multilateral UN peacekeeping intervention (0/1)

Control variables: war type, severity, and duration; developmentstatus; etc...

Counterfactuals: UN intervention switched (0/1 to 1/0) for eachobservation

Percent of counterfactuals in the convex hull:

0%

Thus, without estimating any models, we know inferences will bemodel dependent; for illustration, let’s find an example. . . .

Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes: Model Dependence in Counterfactual InferenceMarch 26, 2016 18 / 23

Page 107: Advanced Quantitative Research Methodology, Lecture...Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes:March 26, 2016 2 / 23Model Dependence in Counterfactual

Replication: Doyle and Sambanis, APSR 2000

Data: 124 Post-World War II civil wars

Dependent variable: peacebuilding success

Treatment variable: multilateral UN peacekeeping intervention (0/1)

Control variables: war type, severity, and duration; developmentstatus; etc...

Counterfactuals: UN intervention switched (0/1 to 1/0) for eachobservation

Percent of counterfactuals in the convex hull:

0%

Thus, without estimating any models, we know inferences will bemodel dependent; for illustration, let’s find an example. . . .

Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes: Model Dependence in Counterfactual InferenceMarch 26, 2016 18 / 23

Page 108: Advanced Quantitative Research Methodology, Lecture...Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes:March 26, 2016 2 / 23Model Dependence in Counterfactual

Replication: Doyle and Sambanis, APSR 2000

Data: 124 Post-World War II civil wars

Dependent variable: peacebuilding success

Treatment variable: multilateral UN peacekeeping intervention (0/1)

Control variables: war type, severity, and duration; developmentstatus; etc...

Counterfactuals: UN intervention switched (0/1 to 1/0) for eachobservation

Percent of counterfactuals in the convex hull:

0%

Thus, without estimating any models, we know inferences will bemodel dependent; for illustration, let’s find an example. . . .

Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes: Model Dependence in Counterfactual InferenceMarch 26, 2016 18 / 23

Page 109: Advanced Quantitative Research Methodology, Lecture...Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes:March 26, 2016 2 / 23Model Dependence in Counterfactual

Replication: Doyle and Sambanis, APSR 2000

Data: 124 Post-World War II civil wars

Dependent variable: peacebuilding success

Treatment variable: multilateral UN peacekeeping intervention (0/1)

Control variables: war type, severity, and duration; developmentstatus; etc...

Counterfactuals: UN intervention switched (0/1 to 1/0) for eachobservation

Percent of counterfactuals in the convex hull:

0%

Thus, without estimating any models, we know inferences will bemodel dependent; for illustration, let’s find an example. . . .

Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes: Model Dependence in Counterfactual InferenceMarch 26, 2016 18 / 23

Page 110: Advanced Quantitative Research Methodology, Lecture...Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes:March 26, 2016 2 / 23Model Dependence in Counterfactual

Replication: Doyle and Sambanis, APSR 2000

Data: 124 Post-World War II civil wars

Dependent variable: peacebuilding success

Treatment variable: multilateral UN peacekeeping intervention (0/1)

Control variables: war type, severity, and duration; developmentstatus; etc...

Counterfactuals: UN intervention switched (0/1 to 1/0) for eachobservation

Percent of counterfactuals in the convex hull:

0%

Thus, without estimating any models, we know inferences will bemodel dependent; for illustration, let’s find an example. . . .

Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes: Model Dependence in Counterfactual InferenceMarch 26, 2016 18 / 23

Page 111: Advanced Quantitative Research Methodology, Lecture...Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes:March 26, 2016 2 / 23Model Dependence in Counterfactual

Replication: Doyle and Sambanis, APSR 2000

Data: 124 Post-World War II civil wars

Dependent variable: peacebuilding success

Treatment variable: multilateral UN peacekeeping intervention (0/1)

Control variables: war type, severity, and duration; developmentstatus; etc...

Counterfactuals: UN intervention switched (0/1 to 1/0) for eachobservation

Percent of counterfactuals in the convex hull:

0%

Thus, without estimating any models, we know inferences will bemodel dependent; for illustration, let’s find an example. . . .

Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes: Model Dependence in Counterfactual InferenceMarch 26, 2016 18 / 23

Page 112: Advanced Quantitative Research Methodology, Lecture...Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes:March 26, 2016 2 / 23Model Dependence in Counterfactual

Replication: Doyle and Sambanis, APSR 2000

Data: 124 Post-World War II civil wars

Dependent variable: peacebuilding success

Treatment variable: multilateral UN peacekeeping intervention (0/1)

Control variables: war type, severity, and duration; developmentstatus; etc...

Counterfactuals: UN intervention switched (0/1 to 1/0) for eachobservation

Percent of counterfactuals in the convex hull:

0%

Thus, without estimating any models, we know inferences will bemodel dependent; for illustration, let’s find an example. . . .

Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes: Model Dependence in Counterfactual InferenceMarch 26, 2016 18 / 23

Page 113: Advanced Quantitative Research Methodology, Lecture...Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes:March 26, 2016 2 / 23Model Dependence in Counterfactual

Replication: Doyle and Sambanis, APSR 2000

Data: 124 Post-World War II civil wars

Dependent variable: peacebuilding success

Treatment variable: multilateral UN peacekeeping intervention (0/1)

Control variables: war type, severity, and duration; developmentstatus; etc...

Counterfactuals: UN intervention switched (0/1 to 1/0) for eachobservation

Percent of counterfactuals in the convex hull: 0%

Thus, without estimating any models, we know inferences will bemodel dependent; for illustration, let’s find an example. . . .

Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes: Model Dependence in Counterfactual InferenceMarch 26, 2016 18 / 23

Page 114: Advanced Quantitative Research Methodology, Lecture...Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes:March 26, 2016 2 / 23Model Dependence in Counterfactual

Replication: Doyle and Sambanis, APSR 2000

Data: 124 Post-World War II civil wars

Dependent variable: peacebuilding success

Treatment variable: multilateral UN peacekeeping intervention (0/1)

Control variables: war type, severity, and duration; developmentstatus; etc...

Counterfactuals: UN intervention switched (0/1 to 1/0) for eachobservation

Percent of counterfactuals in the convex hull: 0%

Thus, without estimating any models, we know inferences will bemodel dependent; for illustration, let’s find an example. . . .

Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes: Model Dependence in Counterfactual InferenceMarch 26, 2016 18 / 23

Page 115: Advanced Quantitative Research Methodology, Lecture...Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes:March 26, 2016 2 / 23Model Dependence in Counterfactual

Doyle and Sambanis, Logit Model

Original Model Modified ModelVariables Coeff SE P-val Coeff SE P-valWartype −1.742 .609 .004 −1.666 .606 .006Logdead −.445 .126 .000 −.437 .125 .000Wardur .006 .006 .258 .006 .006 .342Factnum −1.259 .703 .073 −1.045 .899 .245Factnum2 .062 .065 .346 .032 .104 .756Trnsfcap .004 .002 .010 .004 .002 .017Develop .001 .000 .065 .001 .000 .068Exp −6.016 3.071 .050 −6.215 3.065 .043Decade −.299 .169 .077 −0.284 .169 .093Treaty 2.124 .821 .010 2.126 .802 .008UNOP4 3.135 1.091 .004 .262 1.392 .851Wardur*UNOP4 — — — .037 .011 .001Constant 8.609 2.157 0.000 7.978 2.350 .000N 122 122Log-likelihood -45.649 -44.902Pseudo R2 .423 .433

Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes: Model Dependence in Counterfactual InferenceMarch 26, 2016 19 / 23

Page 116: Advanced Quantitative Research Methodology, Lecture...Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes:March 26, 2016 2 / 23Model Dependence in Counterfactual

Doyle and Sambanis: Model Dependence

Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes: Model Dependence in Counterfactual InferenceMarch 26, 2016 20 / 23

Page 117: Advanced Quantitative Research Methodology, Lecture...Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes:March 26, 2016 2 / 23Model Dependence in Counterfactual

Biases in Causal Inference: A New Decomposition

d = mean(Y |D = 1)−mean(Y |D = 0)

bias ≡ E (d)− θ = ∆o + ∆p + ∆i + ∆e

∆o Omitted variable bias (ignorability) (you know this!)

∆p Post-treatment bias (check this with theory!)

∆i Interpolation bias (Usually not so bad; use models or matching)

∆e Extrapolation bias (check this with data!)

Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes: Model Dependence in Counterfactual InferenceMarch 26, 2016 21 / 23

Page 118: Advanced Quantitative Research Methodology, Lecture...Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes:March 26, 2016 2 / 23Model Dependence in Counterfactual

Biases in Causal Inference: A New Decomposition

d = mean(Y |D = 1)−mean(Y |D = 0)

bias ≡ E (d)− θ = ∆o + ∆p + ∆i + ∆e

∆o Omitted variable bias (ignorability) (you know this!)

∆p Post-treatment bias (check this with theory!)

∆i Interpolation bias (Usually not so bad; use models or matching)

∆e Extrapolation bias (check this with data!)

Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes: Model Dependence in Counterfactual InferenceMarch 26, 2016 21 / 23

Page 119: Advanced Quantitative Research Methodology, Lecture...Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes:March 26, 2016 2 / 23Model Dependence in Counterfactual

Biases in Causal Inference: A New Decomposition

d = mean(Y |D = 1)−mean(Y |D = 0)

bias ≡ E (d)− θ

= ∆o + ∆p + ∆i + ∆e

∆o Omitted variable bias (ignorability) (you know this!)

∆p Post-treatment bias (check this with theory!)

∆i Interpolation bias (Usually not so bad; use models or matching)

∆e Extrapolation bias (check this with data!)

Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes: Model Dependence in Counterfactual InferenceMarch 26, 2016 21 / 23

Page 120: Advanced Quantitative Research Methodology, Lecture...Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes:March 26, 2016 2 / 23Model Dependence in Counterfactual

Biases in Causal Inference: A New Decomposition

d = mean(Y |D = 1)−mean(Y |D = 0)

bias ≡ E (d)− θ = ∆o + ∆p + ∆i + ∆e

∆o Omitted variable bias (ignorability) (you know this!)

∆p Post-treatment bias (check this with theory!)

∆i Interpolation bias (Usually not so bad; use models or matching)

∆e Extrapolation bias (check this with data!)

Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes: Model Dependence in Counterfactual InferenceMarch 26, 2016 21 / 23

Page 121: Advanced Quantitative Research Methodology, Lecture...Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes:March 26, 2016 2 / 23Model Dependence in Counterfactual

Biases in Causal Inference: A New Decomposition

d = mean(Y |D = 1)−mean(Y |D = 0)

bias ≡ E (d)− θ = ∆o + ∆p + ∆i + ∆e

∆o Omitted variable bias (ignorability) (you know this!)

∆p Post-treatment bias (check this with theory!)

∆i Interpolation bias (Usually not so bad; use models or matching)

∆e Extrapolation bias (check this with data!)

Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes: Model Dependence in Counterfactual InferenceMarch 26, 2016 21 / 23

Page 122: Advanced Quantitative Research Methodology, Lecture...Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes:March 26, 2016 2 / 23Model Dependence in Counterfactual

Biases in Causal Inference: A New Decomposition

d = mean(Y |D = 1)−mean(Y |D = 0)

bias ≡ E (d)− θ = ∆o + ∆p + ∆i + ∆e

∆o Omitted variable bias (ignorability) (you know this!)

∆p Post-treatment bias (check this with theory!)

∆i Interpolation bias (Usually not so bad; use models or matching)

∆e Extrapolation bias (check this with data!)

Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes: Model Dependence in Counterfactual InferenceMarch 26, 2016 21 / 23

Page 123: Advanced Quantitative Research Methodology, Lecture...Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes:March 26, 2016 2 / 23Model Dependence in Counterfactual

Biases in Causal Inference: A New Decomposition

d = mean(Y |D = 1)−mean(Y |D = 0)

bias ≡ E (d)− θ = ∆o + ∆p + ∆i + ∆e

∆o Omitted variable bias (ignorability) (you know this!)

∆p Post-treatment bias (check this with theory!)

∆i Interpolation bias (Usually not so bad; use models or matching)

∆e Extrapolation bias (check this with data!)

Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes: Model Dependence in Counterfactual InferenceMarch 26, 2016 21 / 23

Page 124: Advanced Quantitative Research Methodology, Lecture...Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes:March 26, 2016 2 / 23Model Dependence in Counterfactual

Biases in Causal Inference: A New Decomposition

d = mean(Y |D = 1)−mean(Y |D = 0)

bias ≡ E (d)− θ = ∆o + ∆p + ∆i + ∆e

∆o Omitted variable bias (ignorability) (you know this!)

∆p Post-treatment bias (check this with theory!)

∆i Interpolation bias (Usually not so bad; use models or matching)

∆e Extrapolation bias (check this with data!)

Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes: Model Dependence in Counterfactual InferenceMarch 26, 2016 21 / 23

Page 125: Advanced Quantitative Research Methodology, Lecture...Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes:March 26, 2016 2 / 23Model Dependence in Counterfactual

Interpolation vs Extrapolation Bias

Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes: Model Dependence in Counterfactual InferenceMarch 26, 2016 22 / 23

Page 126: Advanced Quantitative Research Methodology, Lecture...Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes:March 26, 2016 2 / 23Model Dependence in Counterfactual

Causal Effect of Multidimensional UN PeacekeepingOperations

Gary King (Harvard, IQSS) Advanced Quantitative Research Methodology, Lecture Notes: Model Dependence in Counterfactual InferenceMarch 26, 2016 23 / 23