multiple regression - massachusetts institute of...

15
Multiple Regression Why to Control March 18, 2004 17.871

Upload: dodieu

Post on 20-Mar-2018

227 views

Category:

Documents


4 download

TRANSCRIPT

Page 1: Multiple Regression - Massachusetts Institute of Technologydspace.mit.edu/.../contents/lecture-notes/multiple_regres2.pdf · Multiple Regression Why to Control March 18, 2004 17.871

Multiple Regression

Why to ControlMarch 18, 2004

17.871

Page 2: Multiple Regression - Massachusetts Institute of Technologydspace.mit.edu/.../contents/lecture-notes/multiple_regres2.pdf · Multiple Regression Why to Control March 18, 2004 17.871

Gore Likeability Example

• Suppose:– Gore’s* likeability is a function of Clinton’s

likeability and not directly a function of party– Clinton’s likeability is a function of one’s

partisan identification plus other factors

– What would the regression of Gore likeability on Clinton likeability look like?

Party ID Clinton Likeability

GoreLikeability

e1 e2

*This example probably works better if we’re predicting the likeability of Socks the cat.

Page 3: Multiple Regression - Massachusetts Institute of Technologydspace.mit.edu/.../contents/lecture-notes/multiple_regres2.pdf · Multiple Regression Why to Control March 18, 2004 17.871

Democratic picture

Clinton thermometer

Page 4: Multiple Regression - Massachusetts Institute of Technologydspace.mit.edu/.../contents/lecture-notes/multiple_regres2.pdf · Multiple Regression Why to Control March 18, 2004 17.871

Independent picture

Clinton thermometer

Page 5: Multiple Regression - Massachusetts Institute of Technologydspace.mit.edu/.../contents/lecture-notes/multiple_regres2.pdf · Multiple Regression Why to Control March 18, 2004 17.871

Republican picture

Clinton thermometer

Page 6: Multiple Regression - Massachusetts Institute of Technologydspace.mit.edu/.../contents/lecture-notes/multiple_regres2.pdf · Multiple Regression Why to Control March 18, 2004 17.871

Combined data picture

Clinton thermometer

Page 7: Multiple Regression - Massachusetts Institute of Technologydspace.mit.edu/.../contents/lecture-notes/multiple_regres2.pdf · Multiple Regression Why to Control March 18, 2004 17.871

Combined data picture with regression

Clinton thermometer

Page 8: Multiple Regression - Massachusetts Institute of Technologydspace.mit.edu/.../contents/lecture-notes/multiple_regres2.pdf · Multiple Regression Why to Control March 18, 2004 17.871

Combined data picture with “true” regression lines overlaid

Clinton thermometer

Page 9: Multiple Regression - Massachusetts Institute of Technologydspace.mit.edu/.../contents/lecture-notes/multiple_regres2.pdf · Multiple Regression Why to Control March 18, 2004 17.871

Tempting yet wrong normalizations

Clinton thermometer

Clinton thermometer

Page 10: Multiple Regression - Massachusetts Institute of Technologydspace.mit.edu/.../contents/lecture-notes/multiple_regres2.pdf · Multiple Regression Why to Control March 18, 2004 17.871

Summary: Why we control• Remove confounding effects

• Improve efficiency

Party

Clintonlikeability

Gorelikeability

e

e

Party

Clintonlikeability

Gorelikeability

e

Clinton thermometer

Page 11: Multiple Regression - Massachusetts Institute of Technologydspace.mit.edu/.../contents/lecture-notes/multiple_regres2.pdf · Multiple Regression Why to Control March 18, 2004 17.871

Look at actual data

Gore thermometer

0

100

0 100

0 100

Clintonthermometer

0

100

-1 1-1

1

Party (3 pointscale)

graph7 clinton gore party3, matrix

Page 12: Multiple Regression - Massachusetts Institute of Technologydspace.mit.edu/.../contents/lecture-notes/multiple_regres2.pdf · Multiple Regression Why to Control March 18, 2004 17.871

Look at actual data (jitter)

Gore thermometer

0

100

0 100

0 100

Clintonthermometer

0

100

-1 1-1

1

Party (3 pointscale)

graph7 gore clinton party3, matrix jitter(5)

Page 13: Multiple Regression - Massachusetts Institute of Technologydspace.mit.edu/.../contents/lecture-notes/multiple_regres2.pdf · Multiple Regression Why to Control March 18, 2004 17.871

Another way to deal with the mass point problem

Clin

ton

ther

mom

eter

Gore thermometer0 100

0

100 . collapse (count) n=party3,by(clinton gore)

. graph7 clinton gore [fweight=n]

Page 14: Multiple Regression - Massachusetts Institute of Technologydspace.mit.edu/.../contents/lecture-notes/multiple_regres2.pdf · Multiple Regression Why to Control March 18, 2004 17.871

Gore vs. ClintonG

ore

ther

mom

eter

Graphs by Party (3 point scale)Clinton thermometer

Gore thermometer pyp Fitted values

party3==-1

0

100

party3==0

0 100party3==1

0 1000

100 Overall

Within party

Rep. Ind.

Dem.

Page 15: Multiple Regression - Massachusetts Institute of Technologydspace.mit.edu/.../contents/lecture-notes/multiple_regres2.pdf · Multiple Regression Why to Control March 18, 2004 17.871

Gore vs. partyG

ore

ther

mom

eter

Graphs by clinton_levelParty (3 point scale)

Gore thermometer Fitted values py_party3_all

clinton_level==-1

0

100

clinton_level==0

-1 1clinton_level==1

-1 10

100

Overall

Within party

Clinton low Clinton med.

Clinton high