reading and interpreting quantitative intervention research syntheses: an introduction

52
Reading and interpreting quantitative intervention research syntheses: an introduction Steve Higgins, Durham University Robert Coe, Durham University Mark Newman, EPPI Centre, IoE, London University James Thomas, EPPI Centre, IoE, London University Carole Torgerson, IEE, York University Part 2

Upload: gwidon

Post on 13-Jan-2016

22 views

Category:

Documents


0 download

DESCRIPTION

Reading and interpreting quantitative intervention research syntheses: an introduction. Steve Higgins, Durham University Robert Coe, Durham University Mark Newman, EPPI Centre, IoE, London University James Thomas, EPPI Centre, IoE, London University Carole Torgerson, IEE, York University - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Reading and interpreting quantitative intervention research syntheses: an introduction

Reading and interpreting quantitative intervention research syntheses: an

introduction

Steve Higgins, Durham University

Robert Coe, Durham University

Mark Newman, EPPI Centre, IoE, London University

James Thomas, EPPI Centre, IoE, London University

Carole Torgerson, IEE, York University

Part 2

Page 2: Reading and interpreting quantitative intervention research syntheses: an introduction

Acknowledgements

• This presentation is an outcome of the work of the ESRC-funded Researcher Development Initiative: “Training in the Quantitative synthesis of Intervention Research Findings in Education and Social Sciences” which ran from 2008-2011.

• The training was designed by Steve Higgins and Rob Coe (Durham University), Carole Torgerson (Birmingham University) and Mark Newman and James Thomas, Institute of Education, London University.

• The team acknowledges the support of Mark Lipsey, David Wilson and Herb Marsh in preparation of some of the materials, particularly Lipsey and Wilson’s (2001) “Practical Meta-analysis” and David Wilson’s slides at: http://mason.gmu.edu/~dwilsonb/ma.html (accessed 9/3/11).

• The materials are offered to the wider academic and educational community community under a Creative Commons licence: Creative Commons Attribution-NonCommercial-ShareAlike 3.0 Unported License

• You should only use the materials for educational, not-for-profit use and you should acknowledge the source in any use.

Page 3: Reading and interpreting quantitative intervention research syntheses: an introduction

Session 2

1.30 pm Reading and interpreting a meta-analysisOverview of challenges to effective meta-analysis

3.00 pm Break

3.15 pm Summary, conclusions and evaluation

4.00 pm Finish

Page 4: Reading and interpreting quantitative intervention research syntheses: an introduction

Recap

• Should be conducted as part of a systematic (or at least transparent) review

• Meta-analysis is the statistical combination of research study findings to answer a specific question

• Uses a common metric - effect size - to aggregate and explore the findings across studies

Page 5: Reading and interpreting quantitative intervention research syntheses: an introduction

Stages of synthesis

What is the question?Theories and assumptions in the review question

What is the result?

What new research questions emerge?

What data are available?By addressing review question according to conceptual framework

How does integrating the data answer the question?To address the question (including theory testing or development).

What does the result mean? (conclusions)

How robust is the synthesis?For quality, sensitivity, coherence & relevance.

Cooper, H.M. (1982) Scientific Guidelines for Conducting Integrative Research Reviews Review Of Educational Research 52; 291See also: Popay et al. (2006) Guidance on the Conduct of Narrative Synthesis in Systematic Reviews. Lancaster: Institute for Health Research, Lancaster University. http://www.lancs.ac.uk/fass/projects/nssr/research.htm

What are the patterns in the data?Including study, intervention, outcomes and participant characteristicsCan the

conceptual framework be developed?

Page 6: Reading and interpreting quantitative intervention research syntheses: an introduction

Procedures for a meta analysis

• Key question• Search/ retrieval

strategy• Inclusion/ exclusion

criteria• Coding• Analysis• Synthesis

What is the question?

What data are available?

How robust is the synthesis?

How does integrating the data answer the question?

What are the conclusions?

What patterns are in the data?

What us the result?

Page 7: Reading and interpreting quantitative intervention research syntheses: an introduction

Overview

• Interpreting a meta-analysis– Forest plots and other forms of data presentation

• Issues in meta-analysis– Research designs and quality– Heterogeneity– Models for pooling the results

Page 8: Reading and interpreting quantitative intervention research syntheses: an introduction

Forest plots

• Effective way of presenting results– Studies, effect sizes, confidence intervals– Provides an overview of consistency of

effects– Summarises an overall effect (with

confidence interval)

• Useful visual model of a meta-analysis

Page 9: Reading and interpreting quantitative intervention research syntheses: an introduction

Anatomy of a forest plot

Studies

N of study

Line of no effect

C.I

Study effect size

Pooled effect size

Pooled effect size

Study effect size (with C.I.)

Weighting of study in meta-analysis

Page 10: Reading and interpreting quantitative intervention research syntheses: an introduction

Exercise

1. What is the effect size in the Bletchman et al. study

2. Is the effect size in the Kelley et al. study bigger or smaller than in the Patrick & Marsh study?

3. How many subjects were in the Patrick and Marsh Study?

4. What is the 95% confidence interval of the pooled effect size?

5. What is the weighting given to the Bletchman study in the meta-analysis?

6. How does the confidence interval differ between the Kelley et al. study and the other two?

Page 11: Reading and interpreting quantitative intervention research syntheses: an introduction

A Systematic Review of the Research Literature on the Use of

Phonics in the Teaching of Reading and Spelling

Torgerson, Brooks and Hall, 2006 Department for Education and Skills (DfES)

commissioned the Universities of York and Sheffield to conduct a systematic review of experimental research on the use of phonics instruction in the teaching of reading and spelling. This review is based on evidence from randomised controlled trials (RCTs).

Page 12: Reading and interpreting quantitative intervention research syntheses: an introduction

Interpreting a forest plot

Have a look at the forest plot from the meta-analysis of phonics interventions on the handout. These are RCTs with a separate analysis for lower attaining (Cluster 0) and normally attaining pupils (Cluster 1)– What do you notice?– Work in a pair or small group to ‘read’ it to each

other– What questions can you raise about the meta-

analysis?

Page 13: Reading and interpreting quantitative intervention research syntheses: an introduction

Forest plot

Standardised mean difference

Favours Control Favours Phonics

-3.7709 0 3.77098

Study Standardised mean difference (95% CI) % Weight

Ability==0 Greaney 0.30 (-0.36, 0.95) 6.8 Lovett89 0.22 (-0.14, 0.57) 23.1 Lovett90 -0.20 (-0.85, 0.46) 6.9 Martinussen 0.46 (-0.30, 1.21) 5.2 O'Connor 0.57 (-0.59, 1.73) 2.2 Torgesen99 0.07 (-0.34, 0.48) 17.3 Torgesen01 -0.31 (-0.87, 0.24) 9.5 Umbach 2.77 ( 1.77, 3.77) 2.9

Subtotal 0.21 ( 0.01, 0.41) 73.9

Ability==1 Haskell 0.07 (-0.73, 0.87) 4.6 Johnston 0.97 ( 0.43, 1.51) 10.1 Leach 0.84 (-0.08, 1.75) 3.5 Skailand -0.17 (-0.78, 0.44) 8.0

Subtotal 0.45 ( 0.11, 0.78) 26.1

Overall 0.27 ( 0.10, 0.45) 100.0

Page 14: Reading and interpreting quantitative intervention research syntheses: an introduction

Issues to consider

Is it reasonable to combine the results of the individual studies ?i) Study design/ quality

ii) Are the studies too different (heterogeneity)• Methodological heterogeneity • Educational heterogeneity• Statistical heterogeneity

Page 15: Reading and interpreting quantitative intervention research syntheses: an introduction

Assessing between study heterogeneity

• When effect sizes differ consistent with chance error, the effect size estimate is considered to be homogeneous (unique true effect).

• When the variability in effect sizes is greater than expected by chance, the effects are considered to be heterogeneous

• Presence of heterogeneity affects the process of the meta-analysis

Page 16: Reading and interpreting quantitative intervention research syntheses: an introduction

Methodological quality

• Traditional reviews privilege methodological rigour– Low quality studies have higher effect sizes (Hattie

Biggs & Purdie, 1996)

– No difference (Marzano, 1998)

– High quality studies, higher effect sizes (Lipsey & Wilson, 1993)

• Depends on your definition of quality– Assessing quality– Dimensions of quality– Exploring its impact

Page 17: Reading and interpreting quantitative intervention research syntheses: an introduction

Methodological quality

• What about ‘low quality’ studies?– All studies are likely to have weaknesses

(methodological quality is on a range or continuum)

– Exclusivity restricts the scope and scale of the analysis and generalizability

– Inclusivity may weaken confidence in the findings– Some methodological quality is in the “eye-of-the-

beholder”– Needs a balance appropriate to the key research

question

Page 18: Reading and interpreting quantitative intervention research syntheses: an introduction

Which designs?

• RCTs only?

• RCTs plus rigorously controlled experimental and quasi-experimental designs?

• All RCTs, and experimental designs?

• All pre-post comparisons?

Page 19: Reading and interpreting quantitative intervention research syntheses: an introduction

Task: Diamond Ranking

• Have a look at the different descriptions of research– Which do you think it would

be most appropriate for a meta-analysis?Which would be the least appropriate?

– Can you place or rank the others?

Most importantMost important

Least importantLeast important

Page 20: Reading and interpreting quantitative intervention research syntheses: an introduction

Methodological heterogeneity

• Study design

• Sample characteristics

• Assessment (measures, timing)

Page 21: Reading and interpreting quantitative intervention research syntheses: an introduction
Page 22: Reading and interpreting quantitative intervention research syntheses: an introduction

Educational heterogeneity

• ‘Clinical’or ‘pedagogical’ heterogeneity

• Systematic variation in response to the intervention– Teacher level effects– Pupil level effects

Page 23: Reading and interpreting quantitative intervention research syntheses: an introduction

Statistical

• Due to chance

• Unexplainable

Page 24: Reading and interpreting quantitative intervention research syntheses: an introduction

Statistical methods to identify heterogeneity

• Presence– Q statistic (Cooper & Hedges, 1994)

• Significance level (p-value) 2

2

• Extent– I2

(Higgins & Thompson, 2002)• If it exceeds 50%, it may be advisable not to combine the studies

All have low power with a small number of studies (Huedo-Medina et al. 2006)

Page 25: Reading and interpreting quantitative intervention research syntheses: an introduction

Exploring heterogeneity

• In a meta-analysis, exploring heterogeneity of effect can be as important as reporting averages

• Exploring to what extent the variation can be explained by factors in the coding of studies (age, gender, duration of intervention etc)

• Forming sub-groups with greater homogeneity

• Identifying the extent of the variation through further analysis

Page 26: Reading and interpreting quantitative intervention research syntheses: an introduction

Coding for exploration

• Factors which may relate to variation– The intervention

• E.g. duration, intensity, design, implementation

– The sample• E.g. age, gender, ethnicity, particular needs

– The research• E.g. design (RCT, quasi-experimental), quality,

tests/outcomes, comparison group

Page 27: Reading and interpreting quantitative intervention research syntheses: an introduction

Pooling the results

• In a meta-analysis, the effects found across studies are combined or ‘pooled’ to produce a weighted average effect of all the studies-the summary effect.

• Each study is weighted according to some measure of its importance.

• In most meta-analyses, this is achieved by giving a weight to each study in inverse proportion to the variance of its effect.

Page 28: Reading and interpreting quantitative intervention research syntheses: an introduction

Fixed effect model

• The difference between the studies is due to chance– Observed study effect = Fixed effect + error

Page 29: Reading and interpreting quantitative intervention research syntheses: an introduction

Fixed effect model

Each study is seen as being a sample from a distribution of studies, all estimating the same overall effect, but differing due to random error

Page 30: Reading and interpreting quantitative intervention research syntheses: an introduction

Random effects model

Assumes there are two component of variation

1. Due to differences within the studies (e.g. different design, different populations, variations in the intervention, different implementation, etc.)

2. Due to sampling error

Page 31: Reading and interpreting quantitative intervention research syntheses: an introduction

Random effects model

There are two separable effects that can be measured

1. The effect that each study is estimating

2. The common effect that all studies are estimating

Observed study effect = study specific (random) effect + error

Page 32: Reading and interpreting quantitative intervention research syntheses: an introduction

Random effects model

• Each study is seen as representing the mean of a distribution of studies

• There is still a resultant overall effect size

Page 33: Reading and interpreting quantitative intervention research syntheses: an introduction

• “Random effects” model assumes a different underlying effect for each study.

• This model gives relatively more weight to smaller studies and wider confidence intervals than fixed effect models.

• The use of this model is recommended if there is heterogeneity between study results.

• Usually recommended in education

Which model?

Page 34: Reading and interpreting quantitative intervention research syntheses: an introduction

Degrees of freedom

Significance level

Study effect size (with C.I.)

Extent (on % scale)

Page 35: Reading and interpreting quantitative intervention research syntheses: an introduction

Other forms of data presentation

• Box plots • Mean and range

Page 36: Reading and interpreting quantitative intervention research syntheses: an introduction

Presenting Results

• Stem and Leaf Plot

-.4 4 -.3 6 -.2 5 2 -.1 7 4 1 -.0 9 2 0 0 .0 0 5 8 9 .1 0 0 1 2 2 7 8 8 .2 0 3 3 8 .3 0 0 4 4 4 5 7 8 8 .4 0 6 6 9 9 .5 3 4 8 .6 6 7 .7 .8 .9 1

Page 37: Reading and interpreting quantitative intervention research syntheses: an introduction

Interpreting meta-analysis results

• Conceptual• Scope and scale -

searches• Robustness of

evidence• Wider applicability

What is the question?

How robust is the synthesis?

How does integrating the data answer the question?

What does the result mean?

Page 38: Reading and interpreting quantitative intervention research syntheses: an introduction

How many studies?

• How many studies and of what quality would be needed to make a ‘strong recommendation’ or for ‘strong evidence of effect’?

• On what scale?– How many participants/ sites, 350, 500?

• Is there an empirical answer?

Page 39: Reading and interpreting quantitative intervention research syntheses: an introduction

Issues and challenges in meta-analysis

• Conceptual– Comparability – Reductionist– Atheoretical

• Technical – Heterogeneity– Methodological quality – Publication bias

Page 40: Reading and interpreting quantitative intervention research syntheses: an introduction

Comparability• Apples and oranges

– Same test– Different measures of the same construct– Different measures of different constructs– What question are you trying to answer?– How strong is the evidence for this?

“Of course it mixes apples and oranges; in the study of fruit, nothing else is sensible; comparing apples and oranges is the only endeavor worthy of true scientists; comparing apples to apples is trivial” (Glass, 2000).

Page 41: Reading and interpreting quantitative intervention research syntheses: an introduction

Reductionist or ‘flat earth’ critique

The “flat earth” criticism is based on Lee Cronbach’s assertion that a meta-analysis looks at the “big picture” and provides only a crude average.

According to Cronbach: “… some of our colleagues are beginning to sound like a Flat Earth Society.

They tell us that the world is essentially simple: most social phenomena are adequately described by linear relations; one-parameter scaling can discover coherent variables independent of culture and population; and inconsistencies among studies of the same kind will vanish if we but amalgamate a sufficient number of studies…The Flat Earth folk seek to bury any complex hypothesis with an empirical bulldozer…” (Cronbach, 1982, in Glass, 2000).

Over simplification - the answer is .42?

Page 42: Reading and interpreting quantitative intervention research syntheses: an introduction

Empirical … so not theoretical?

• What is your starting point?

• Conceptual/ theoretical critique– Marzano, 1998– Hattie, 2008– Sipe and Curlette, 1997

• Theory testing

• Theory generating

Page 43: Reading and interpreting quantitative intervention research syntheses: an introduction

Remaining technical issues

• Interventions

• Publication bias

• (Methodological quality)

• (Homogeneity/ heterogeneity)

Page 44: Reading and interpreting quantitative intervention research syntheses: an introduction

Interventions

• “Super-realisation bias” (Cronbach & al. 1980)

– Small-scale interventions tend to get larger effects

– Enthusiasm, attention to detail, quality of personal relationships

Page 45: Reading and interpreting quantitative intervention research syntheses: an introduction

Publication bias• The ‘file drawer problem’

– Statistically significant (positive) findings– Smaller studies need larger effect size to reach

significance– Large studies tend to get smaller effect sizes

• Replications difficult to get published• Sources of funding

Page 46: Reading and interpreting quantitative intervention research syntheses: an introduction

Dealing with publication bias

• Trim and fill techniques

• ‘Funnel plot’ sometimes used to explore this Scatterplot of the effects from individual studies

(horizontal axis) against a study size (vertical axis)

Page 47: Reading and interpreting quantitative intervention research syntheses: an introduction
Page 48: Reading and interpreting quantitative intervention research syntheses: an introduction

Dealing with heterogeneity

• Tackle variation in effect sizes• Investigate to find clusters (moderator

variables)• Explore against coded variables• Evaluate whether a pooled result is an

appropriate answer to the question.42?

Page 49: Reading and interpreting quantitative intervention research syntheses: an introduction

Summary

• “Replicable and defensible” method for synthesizing findings across studies (Lipsey & Wilson, 2001)

• Identifies gaps in the literature, providing a sound basis for further research

• Indicates the need for replication in education• Facilitates identification of patterns in the

accumulating results of individual evaluations• Provides a frame for theoretical critique

Page 50: Reading and interpreting quantitative intervention research syntheses: an introduction

Some useful websites

EPPI, Institute of Education, Londonhttp://eppi.ioe.ac.uk/

The Campbell Collaborationhttp://www.campbellcollaboration.org/

Best Evidence Encyclopedia, Johns Hopkinshttp://www.bestevidence.org/

Best Evidence Synthesis (BES), NZhttp://www.educationcounts.govt.nz/themes/BES

Institute for Effective Education (York)http://www.york.ac.uk/iee/research/#reviews

Google Scholarhttp://scholar.google.com/ Keyword(s) + meta-analysis

Page 51: Reading and interpreting quantitative intervention research syntheses: an introduction

Evaluation

• Please complete the evaluation sheet

• Feedback will be used:– to inform other sessions/ revise materials

for independent use– For ESRC evaluation

Page 52: Reading and interpreting quantitative intervention research syntheses: an introduction

Acknowledgements

• This presentation is an outcome of the work of the ESRC-funded Researcher Development Initiative: “Training in the Quantitative synthesis of Intervention Research Findings in Education and Social Sciences” which ran from 2008-2011.

• The training was designed by Steve Higgins and Rob Coe (Durham University), Carole Torgerson (Birmingham University) and Mark Newman and James Thomas, Institute of Education, London University.

• The team acknowledges the support of Mark Lipsey, David Wilson and Herb Marsh in preparation of some of the materials, particularly Lipsey and Wilson’s (2001) “Practical Meta-analysis” and David Wilson’s slides at: http://mason.gmu.edu/~dwilsonb/ma.html (accessed 9/3/11).

• The materials are offered to the wider academic and educational community community under a Creative Commons licence: Creative Commons Attribution-NonCommercial-ShareAlike 3.0 Unported License

• You should only use the materials for educational, not-for-profit use and you should acknowledge the source in any use.