2011 10 18 moderation mediation
TRANSCRIPT
-
8/9/2019 2011 10 18 Moderation Mediation
1/21
1
Moderator and MediatorAnalysis
Marijtje van Duijn
October 18, 2011
Seminar General Statistics
October 18, 2011 Moderator and mediator analysis 2
Overview
What is moderation and mediation?
What is their relation to statistical
concepts? Example(s)
-
8/9/2019 2011 10 18 Moderation Mediation
2/21
2
October 18, 2011 Moderator and mediator analysis 3
Mediation and Moderation
X1
X2
Y
October 18, 2011 Moderator and mediator analysis 4
Examples
Y : test score
X1 : sex, SES, etc.
X2 : ability (IQ score)
Y : test score
X1 : brain volume, SES parents
X2 : ability
More?
-
8/9/2019 2011 10 18 Moderation Mediation
3/21
3
October 18, 2011 Moderator and mediator analysis 5
Multiple regression
Goal: to explain variation in Y using Xs
Assumptions Independent observations
Normality (of residuals) and constantvariance
Linearity (of relationship Y and Xs)
iiii EXXY +++=
22110
October 18, 2011 Moderator and mediator analysis 6
Regression Model
X1
X2
1
Y
2
E
-
8/9/2019 2011 10 18 Moderation Mediation
4/21
4
October 18, 2011 Moderator and mediator analysis 7
Explained variance in regression
Y
X1 X2
Circles represent variancesX1 andX2 explain different parts of Y and are independent
October 18, 2011 Moderator and mediator analysis 8
Multicollinearity
Y
X2X1
competition between variables for explaining Y
Degree depends on correlation between XsVariance inflation factor (VIF): worse (= less precise)
parameter estimation
-
8/9/2019 2011 10 18 Moderation Mediation
5/21
-
8/9/2019 2011 10 18 Moderation Mediation
6/21
6
October 18, 2011 Moderator and mediator analysis 11
Mediation as a special case ofmulticollinearity and/or model selection
Causal model defines direction of arrows
X2 is the mediator (M)
Also: intervening or process variable
Or: indirect causal relationship
Relations between all variables are assumed
to be positive Question is whether direct effect betweenX1
and Y disappears when M is added to theregression equation
October 18, 2011 Moderator and mediator analysis 12
Mediation(Baron & Kenny, 1986),
http://davidakenny.net/cm/mediate.htm)
X Yc
ba
c
M
-
8/9/2019 2011 10 18 Moderation Mediation
7/21
7
October 18, 2011 Moderator and mediator analysis 13
Mediation as prescribed byBaron and Kenny (1986)
Estimate regression of Yon only X1 Estimated parameter c
Estimate regression of Mon X1 Estimated parameter a
Estimate effect of Mon Y, together with X1
Estimated parameters b and c Complete mediation: c=0
Partial mediation: c< c(can be tested)
October 18, 2011 Moderator and mediator analysis 14
Testing of change in c
Amount of mediation c-c
Theoretically equal to ab (indirect path)
Standard error of ab is approximately squareroot of
b2sa2 + a2sb2 (Sobel test)
see (do) http://quantpsy.org/sobel/sobel.htmNote: neither c nor c are needed!
Some eye-balling also possible
Or: nonparametric tests (based on bootstrapping)
-
8/9/2019 2011 10 18 Moderation Mediation
8/21
8
October 18, 2011 Moderator and mediator analysis 15
Example (Miles and Shevlin)
Y: read, a measure of the number of books thatpeople have read.
X: enjoy, scale score to measure how much peopleenjoy reading books
M: buy, a measure of how many books people havebought in the previous 12 months
Idea: how much people enjoy reading books -> the
number of books bought -> the number of books readBut: is the number of books a complete mediator.?
(People could go to the library or borrow books fromfriends.)
October 18, 2011 Moderator and mediator analysis 16
Descriptive Statistics
MeanStd.
Deviation Nread 8,85 3,563 40
buy 15,73 8,165 40
enjoy 9,28 5,354 40
Correlations
read buy enjoyPearsonCorrelation
read 1,000 ,747 ,732
buy ,747 1,000 ,644
enjoy ,732 ,644 1,000
Coefficientsa
Model
Unstandardized
Coefficients
Standardized
Coefficients
t Sig.B Std. Error Beta1 (Constant) 4,331 ,785 5,517 ,000
enjoy ,487 ,074 ,732 6,625 ,000
a. Dependent Variable: read
Step 1
-
8/9/2019 2011 10 18 Moderation Mediation
9/21
9
October 18, 2011 Moderator and mediator analysis 17
Step 2Coefficientsa
Model
Unstandardized
Coefficients
Standardiz
ed
Coefficients
t Sig.B Std. Error Beta1 (Constant) 6,616 2,020 3,274 ,002
enjoy ,982 ,189 ,644 5,190 ,000
a. Dependent Variable: buy
Step 3Coefficientsa
Model
Unstandardized
Coefficients
Standardized
Coefficients
t Sig.
Correlations
B Std. Error BetaZero-order Partial Part
1 (Constant) 2,973 ,765 3,887 ,000
buy ,205 ,054 ,471 3,786 ,001 ,747 ,528 ,360
enjoy ,286 ,083 ,429 3,452 ,001 ,732 ,494 ,328
a. Dependent Variable: read
October 18, 2011 Moderator and mediator analysis 18
Examples
Baron/Kenny + Sobel a=.982 sa =.189
b=.487 sb =.074
ab=.478, sab =(.4872*.1892+.9822*.0742)= .12
z-test 4.08; p
-
8/9/2019 2011 10 18 Moderation Mediation
10/21
10
October 18, 2011 Moderator and mediator analysis 19
Model choice is important
Many other patterns of association arepossible Arrows between X1 and X2 may be
reversed not always clear which variablemediates
No causal relation, just association
Explicit assumption of positiveassociations and ordering of (semi-)partial correlations. Not guaranteed
October 18, 2011 Moderator and mediator analysis 20
Moderation
X1
X2
Y
-
8/9/2019 2011 10 18 Moderation Mediation
11/21
11
October 18, 2011 Moderator and mediator analysis 21
Moderation is interaction
The effect of X2depends on the value of X1(or vice versa) different slopes for different folks
can be a way to tackle non-linearity in regression
Interpretation depends on type of variable
Interaction usually (but not necessarily)
product of X1 and X2 X1 and X2may be correlated
Importance of model formulation
October 18, 2011 Moderator and mediator analysis 22
Regression model
Centering is (usually) advised
Facilitates interpretation Reduces the inevitable multicollinearity
incurred with interaction terms
iiiiii EXXXXY ++++=
21322110
i
c
i
c
i
c
i
c
ii EXXXXY ++++=
21322110
-
8/9/2019 2011 10 18 Moderation Mediation
12/21
12
October 18, 2011 Moderator and mediator analysis 23
X1 dichotomous andX2 continuous
X1 = 0 orX1 = 1 (e.g. man/woman; control/experimental group)
Y =0 +1X1 + 2X2 +3X1X2 X1 = 0: Y = 0 +2X2 X1 = 1: Y = (0 +1) + (2 +3)X2
so: intercept and regressioneffect of X2change interaction effect represents the change in effect ofX2 or,
the difference in the effect ofX2 between the two groups interpretation of 1 General formula: Y = (0 + 1X1) + (2+3X1)X2
October 18, 2011 Moderator and mediator analysis 24
X1 andX2 dichotomous
X1 = 0 orX1 = 1 (e.g. man/woman)
X2 = 0 orX2 = 1 (e.g. control/experimental grp)
Y =0 +1X1 +2X2 +3X1X2 X1 = 0, X2 = 0 : Y = 0 X1 = 1, X2 = 0 : Y = 0 + 1 X1 = 0, X2 = 1 : Y = 0 + 2 X1 = 1, X2 = 1 : Y = 0 + 1+2+3
so: defines four groups with their own mean
Interaction defines extra effect ofX1 =1 andX2 =1
-
8/9/2019 2011 10 18 Moderation Mediation
13/21
13
October 18, 2011 Moderator and mediator analysis 25
X1 nominal andX2 continuous
X1 takes on more than 2 (c) values (e.g. age groups, control/exp1/exp2)
Make dummies, for each contrast, wrt 1reference group (e.g. controls) Leads to c-1 dichotomous variables
And also c-1 interaction terms Also possible: dummies (indicators) for
each group leaving out the intercept
October 18, 2011 Moderator and mediator analysis 26
c=3; group 1 reference group
group D1 D2
1 0 0
2 1 0
3 0 1
Y =0 +1dD1 +2dD2 +2X2 +3D1X2 +4D2X2 group1: Y =0 +2X2 group2: Y = (0 +1d) + (2 +3)X2 group3: Y = (0 +2d) + (2 +4)X2
-
8/9/2019 2011 10 18 Moderation Mediation
14/21
14
October 18, 2011 Moderator and mediator analysis 27
X1 andX2 continuous
Y =0 +1X1 +2X2 +3X1X2
Y = (0 +1X1) + (2+3X1)X2
Y = (0 +2X2) + (1+3X2)X1
1X
3
2
*
2
2X
3
1
*
1
2X1
X3
2
X2
1
X1
0
*
0
2c1c32c
*
1c
**
222c
111c
XXXXY
XXX
XXX
210
=
=
+=
+++=
=
=
1c
**
22
2c
**
11
XY:XX
XY:XX
10
20
+==
+==
October 18, 2011 Moderator and mediator analysis 28
Example (Miles and Shevlin)
Y: grade in statistics course
X1: number of books read (0-4)
X2: number of classes attended (0-20)Descriptive Statistics
Mean Std. Deviation Ngrade 63,5500 16,70552 40
books 2,00 1,432 40
attend 14,10 4,278 40
-
8/9/2019 2011 10 18 Moderation Mediation
15/21
15
October 18, 2011 Moderator and mediator analysis 29
October 18, 2011 Moderator and mediator analysis 30
Coefficientsa
Model
Unstandardized
Coefficients
Standardiz
ed
Coefficient
s
t Sig.
Correlations
Collinearity
Statistics
B Std. Error Beta
Zero-
order Partial Part
Toleran
ce VIF1 (Constant) 63,422 2,223 28,534 ,000
booksc 4,037 1,753 ,346 2,303 ,027 ,492 ,354 ,310 ,803 1,245
attendc 1,283 ,587 ,329 2,187 ,035 ,482 ,338 ,295 ,803 1,245
2 (Constant) 61,469 2,320 26,494 ,000
booksc 4,081 1,677 ,350 2,433 ,020 ,492 ,376 ,314 ,803 1,245
attendc 1,333 ,562 ,341 2,372 ,023 ,482 ,368 ,306 ,802 1,247
bookscxatte
ndc
,735 ,349 ,271 2,104 ,042 ,241 ,331 ,271 ,997 1,003
a. Dependent Variable: grade
-
8/9/2019 2011 10 18 Moderation Mediation
16/21
16
October 18, 2011 Moderator and mediator analysis 31
Other example
(missed) interaction:non-linearity?
3 groups witheach a distinctlinear relation
October 18, 2011 Moderator and mediator analysis 32
-
8/9/2019 2011 10 18 Moderation Mediation
17/21
17
October 18, 2011 Moderator and mediator analysis 33
October 18, 2011 Moderator and mediator analysis 34
Model Summaryb
.405a .164 .134 2.57780
Model1
R R Square
Adjusted
R Square
Std. Error of
the Estimate
Predictors: (Constant), xa.
Dependent Variable: yb.
Coefficientsa
5.724 1.029 5.560 .000
.163 .070 .405 2.341 .027 .405 .405 .405 1.000 1.000
(Constant)
x
Model1
B Std. Error
Unstandardized
Coefficients
Beta
Standardized
Coefficients
t Sig. Zero-order Partial Part
Correlations
Tolerance VIF
Collinearity Statistics
Dependent Variable: ya.
Residuals Statisticsa
5.8864 9.7927 7.8667 1.12042 30
-4.88636 3 .16046 .00000 2.53297 30
-1.767 1.719 .000 1.000 30
-1.896 1.226 .000 .983 30
Predicted Value
Residual
Std. Predicted Value
Std. Residual
M in imum Maximum Mean S td . Deviat ion N
Dependent Variable: ya.
20-2
Regression Standardized Residual
6
4
2
0
Frequency
Mean =-2,78E-17Std. Dev. =0,983
N =30
Histogram
Dependent Variable: y
1,00,80,60,40,20,0
Observed Cum Prob
1,0
0,8
0,6
0,4
0,2
0,0
ExpectedCumProb
Normal P-P Plot of Regression Standardized Residual
Dependent Variable: y
-
8/9/2019 2011 10 18 Moderation Mediation
18/21
18
October 18, 2011 Moderator and mediator analysis 35
Model Summaryb
.980a .960 .957 .57477
Model1
R R SquareAdjustedR Square
Std. Error ofthe Estimate
Predictors: (Constant), xkwadraat, xa.Dependent Variable: yb.
Coefficientsa
8.512 .259 32.839 .000
.140 .016 .349 9.042 .000 .405 .867 .348 .996 1.004
-.054 .002 -.894 -23.156 .000 -.916 -.976 -.892 .996 1.004
(Constant)
x
xkwadraat
Model1
B Std. Error
Unstandardized
Coefficients
Beta
Standardized
Coefficients
t Sig. Zero-order Partial Part
Correlations
Tolerance VIF
Collinearity Statistics
Dependent Variable: ya.
Residuals Statisticsa
.5888 10.4412 7.8667 2.71361 30
-.82773 1.16910 .00000 .55460 30
-2.682 .949 .000 1.000 30-1.440 2.034 .000 .965 30
Predicted Value
Residual
Std. Predicted ValueStd. Residual
M in imum Maximum Mean S td . Deviat ion N
Dependent Variable: ya.
20-2
Regression Standardized Residual
6
4
2
0
Frequency
Mean =-3,61E-16Std. Dev. =0,965
N =30
Histogram
Dependent Variable: y
15,0010,005,000,00-5,00-10,00-15,00
x
2,00
0,00
-2,00
y
Partial Regression Plot
Dependent Variable: y
October 18, 2011 Moderator and mediator analysis 36
Model Summaryb
.410a .168 .107 2.61796
Model1
R R SquareAdjustedR Square
Std. Error ofthe Estimate
Predictors: (Constant), groep, xa.
Dependent Variable: yb.
Coefficientsa
6.021 1.301 4.629 .000
.220 .166 .548 1.329 .195 .405 .248 .233 .181 5.515
-.528 1.375 -.158 -.384 .704 .337 -.074 -.067 .181 5.515
(Constant)
x
groep
Model1
B Std. Error
Unstandardized
Coefficients
Beta
Standardized
Coefficients
t Sig. Zero-order Partial Part
Correlations
Tolerance VIF
Collinearity Statistics
Dependent Variable: ya.
Residuals Statisticsa
5.7131 9.9467 7.8667 1.13588 30
-4.71313 3 .17007 .00000 2.52607 30
-1.896 1.831 .000 1.000 30
-1.800 1.211 .000 .965 30
Predicted Value
Residual
Std. Predicted Value
Std. Residual
M in imum Maximum Mean S td . Deviat ion N
Dependent Variable: ya.
20-2
Regression Standardized Residual
6
4
2
0
Frequency
Mean =5,69E-16Std. Dev. =0,965
N =30
Histogram
Dependent Variable: y
4,002,000,00-2,00-4,00
x
2,50
0,00
-2,50
-5,00
y
Partial Regression Plot
Dependent Variable: y
-
8/9/2019 2011 10 18 Moderation Mediation
19/21
19
October 18, 2011 Moderator and mediator analysis 37
Model Summaryb
.750a .562 .512 1.93502
Model1
R R SquareAdjustedR Square
Std. Error ofthe Estimate
Predictors: (Constant), groep3, groep2, xa.Dependent Variable: yb.
Coefficientsa
4.556 .912 4.994 .000
.172 .123 .427 1.396 .174 .405 .264 .181 .180 5.552
3.476 1.310 .602 2.653 .013 .645 .462 .344 .327 3.057
-.326 2.038 -.056 -.160 .874 -.030 -.031 -.021 .135 7.394
(Constant)
x
groep2
groep3
Model1
B Std. Error
UnstandardizedCoefficients
Beta
StandardizedCoefficients
t Sig. Zero-order Partial Part
Correlations
Tolerance VIF
Collinearity Statistics
Dependent Variable: ya.
Residuals Statisticsa
4.7273 1 1.1227 7.8667 2.07709 30
-3.72727 3 .72727 .00000 1.83220 30
-1.511 1.568 .000 1.000 30-1.926 1.926 .000 .947 30
Predicted Value
Residual
Std. Predicted ValueStd. Residual
M in imum Maximum Mean S td . Deviat ion N
Dependent Variable: ya.
20-2
Regression Standardized Residual
8
6
4
2
0
Frequency
Mean=3,47E-16Std.Dev. =0,947
N =30
Histogram
Dependent Variable: y
4,002,000,00-2,00-4,00
x
4,00
2,00
0,00
-2,00
-4,00
y
Partial Regression Plot
Dependent Variable: y
October 18, 2011 Moderator and mediator analysis 38
Model Summaryb
.997a .993 .992 .25050
Model1
R R Square
Adjusted
R Square
Std. Error of
the Estimate
Predictors: (Constant), intxgr3, intxgr2, x, groep2,
groep3
a.
Dependent Variable: yb.
Coefficientsa
4.97E-014 .171 .000 1.000
1.000 .028 2.485 36.259 .000 .405 .991 .609 .060 16.657
10.145 .417 1.756 24.309 .000 .645 .980 .408 .054 18.505
18.000 .596 3.116 30.201 .000 -.030 .987 .507 .026 37.737
-.985 .039 -2.378 -25.250 .000 .626 -.982 -.424 .032 31.455
-1.500 .039 -5.401 - 38.458 .000 -.081 -.992 -.646 .014 69.919
(Constant)
x
groep2
groep3
intxgr2
intxgr3
Model1
B Std. Error
Unstandardized
Coefficients
Beta
Standardized
Coefficients
t Sig. Zero-order Partial Part
Correlations
Tolerance VIF
Collinearity Statistics
Dependent Variable: ya.
Residuals Statisticsa
1.0000 1 0.4182 7.8667 2.76031 30
-.41818 .65758 .00000 .22789 30
-2.488 .924 .000 1.000 30
-1.669 2.625 .000 .910 30
Predicted Value
Residual
Std. Predicted Value
Std. Residual
M in imum Maximum Mean S td . Deviat ion N
Dependent Variable: ya.
20-2
Regression Standardized Residual
12
10
8
6
4
2
0
Frequency
Mean =2,81E-15Std.Dev.=0,91
N =30
Histogram
Dependent Variable: y
4,002,000,00-2,00-4,00
x
4,00
2,00
0,00
-2,00
-4,00
y
Partial Regression Plot
Dependent Variable: y
-
8/9/2019 2011 10 18 Moderation Mediation
20/21
20
October 18, 2011 Moderator and mediator analysis 39
Model choice important
Missing interaction may result in
Violations of linearity assumption
non-constant variance (heterogeneity)
Incorrect model choice and interpretation
October 18, 2011 Moderator and mediator analysis 40
Conclusion
Model choice and selection crucial indetecting mediation and moderation
Substantive / theoretical considerationsshould guide the model selectionprocess!
-
8/9/2019 2011 10 18 Moderation Mediation
21/21
October 18, 2011 Moderator and mediator analysis 41
Other methods/models
Mediation sometimes too simple
More refined path analysis
Regression analysis sometimes too simple
More elaborate models for causalmodeling
Structural Equation Models