handling mobility in cluster- randomized cohort trials

52
Handling Mobility in Cluster-Randomized Cohort Trials

Upload: mabel-lindsey-french

Post on 03-Jan-2016

232 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Handling Mobility in Cluster- Randomized Cohort Trials

Handling Mobility in Cluster-Randomized Cohort Trials

Page 2: Handling Mobility in Cluster- Randomized Cohort Trials

Extent of Mobility

• Cities, neighborhoods, schools, hospitals, clinics• 12% of total US population moved during 2008• 24% of those below the poverty line• 36% changed schools between K and 3rd grade• Annual student mobility varies widely– 0.3% to 66.7% in Chicago City schools in 2001-02

• Even with 30% annual mobility, only 49% are the same students after 3 yrs and only 34% after 3 yrs

Page 3: Handling Mobility in Cluster- Randomized Cohort Trials

Mobility Threatens Validity

• Internal validity – Differential mobility/attrition – amount of by type

• Statistical Conclusion validity– ITT analysis may not be possible

• External validity– Effects generalizable only to those who stay

• Construct validity– “Non-compliance” with treatment

• Those who leave or enter get only part of the treatment

Page 4: Handling Mobility in Cluster- Randomized Cohort Trials

Strategies to Address Mobility

• Two common approaches– Do all you can to minimize mobility/attrition– E.g., Selective sampling or costly retention efforts

• But selective sampling limits external validity

• Statistical Adjustments– Multiple imputation– Potential outcome models– Maximum likelihood estimation– Growth curve analysis– Propensity scores

Page 5: Handling Mobility in Cluster- Randomized Cohort Trials

Our Proposal

• Systematic consideration of:– Internal validity– External validity

• Cross-classification of:– Focus on person or cluster• Assesses all randomized units at end point, regardless

of whether or not they complied with the intervention

– ITT or non-compliance analysis• Take into account whether units complied or not

Page 6: Handling Mobility in Cluster- Randomized Cohort Trials

ITT Analysis

• Is always important – the “primary” analysis– At the level of the unit of assignment– Yields an unbiased estimate of the effect of random

assignment to receive an intervention• In presence of attrition or non-compliance,

intervention effect estimate is composite of:– Actual effect on those who complied with it– Effect on non-compliers even though they were assigned

to receive it– Effect on those who dropped out before receiving the

complete treatment

Page 7: Handling Mobility in Cluster- Randomized Cohort Trials

Two major designs

• Repeated cross-section design– New samples of group members are measured at each time of

assessment– No issues of person mobility or attrition

• Though could still have unit (e.g., schools) attrition

– Outcomes assessed as rates at the cluster level• Cohort (or panel) design

– A cohort or panel of persons is followed over time– Outcomes and covariates are assessed at the individual level

• But analyzed hierarchically

– Moderators at the individual level can be examined• Few studies to date have included both design features

Page 8: Handling Mobility in Cluster- Randomized Cohort Trials

Handling of late entrants/joiners

• Historically not included in analysis– Instead much effort made to track leavers

• In many contexts, including joiners helps to maintain internal and external validity– In schools, leavers and joiners are often similar

• But this needs to be checked

– Helps statistical conclusion validity by maintaining sample size and power

– Helps generalizability to other similar schools• Differential dosage is a form of non-compliance

with treatment

Page 9: Handling Mobility in Cluster- Randomized Cohort Trials

Missingness

• Missing data at later waves for leavers• Missing data at earlier waves for joiners• Both types can be handled with– Imputation– Analytical approaches

• All need careful consideration of covariates and predictors of leaving or joining

• Leavers and joiners may be different under some conditions– E.g., changing neighborhoods

Page 10: Handling Mobility in Cluster- Randomized Cohort Trials

Multilevel classification of approaches to mobility

Focus of the Trial

Nature of the Analysis

Intent To Treat Compliance

PERSON 1. Person-focused ITT approach

3. Person-focused compliance approach

CLUSTER 2. Cluster-focused ITT approach

4. Cluster-focused compliance approach

Page 11: Handling Mobility in Cluster- Randomized Cohort Trials

1. Person-focused ITT approach• Question Answered: What is the impact of intervention

assignment on persons when implemented under real world conditions (including mobility)? – Uses data from all persons originally in the clusters

assigned to conditions.– Focus is on estimating the person level of program effect.– Persons who leave clusters (e.g., schools) during the trial

are followed (i.e., Leavers are retained)– Persons who enter research clusters during the trial are

not assessed (or analyzed) (i.e., Joiners are NOT added).

Page 12: Handling Mobility in Cluster- Randomized Cohort Trials

2. Cluster-focused ITT approach

• Question answered: What is the impact of intervention assignment on clusters of persons? – Uses data from all clusters originally assigned to

conditions.– Focus is on estimating the cluster-level intervention

effect – i.e., prevalence not incidence.– Requires assessing persons who enter research

clusters during the trial. Joiners are added.– Persons who leave research clusters during the trial

are not followed. Leavers are dropped.

Page 13: Handling Mobility in Cluster- Randomized Cohort Trials

3. Person-focused compliance approach

• Question answered: What is the impact of the intervention on those persons who complied with it to receive an adequate “dosage?” – Focus is estimating the effect on persons who

comply with the intervention. CACE analysis is preferred to “as-treated “ analysis.

– Dropouts are not followed.– Late entrants are not assessed (or analyzed).

Page 14: Handling Mobility in Cluster- Randomized Cohort Trials

4. Cluster-focused compliance approach

• Question answered: What is the impact of the intervention on clusters that complied with it? – Focus is only on those clusters that stay in the trial

or in which the intervention is fully implemented.– Joiners and leavers handled similar to Option 2:• Requires assessing persons who enter research clusters

during the trial. Joiners are added.• Persons who leave research clusters during the trial are

not followed. Leavers are dropped.

Page 15: Handling Mobility in Cluster- Randomized Cohort Trials

Flay & Collins (2005) Historical review of

school-based randomized trials

Page 16: Handling Mobility in Cluster- Randomized Cohort Trials

Improvements in School-based RCTs

• approaches to the randomization of whole schools• the choice of appropriate comparison or control groups• solutions when randomization breaks down• limiting and handling of variation in integrity of the

intervention received• limiting biases introduced by data collection• awareness of the effects of intensive and long-term data

collection• limiting and analysis of subject attrition and other

missing data

Page 17: Handling Mobility in Cluster- Randomized Cohort Trials

Improvements in School-based RCTs

• approaches to obtaining parental consent for children to engage in research

• design and analysis issues when only small numbers of schools are available or can be afforded

• the choice of the unit of analysis• phases of research• optimizing and extending the reach of interventions• differential effects in subpopulations

Page 18: Handling Mobility in Cluster- Randomized Cohort Trials

Six Important Issues

• Sequence planning • Time• Keeping up with, and being open to,

methodological advances• Publication of all results• Accumulation of knowledge• The devil is in the details

Page 19: Handling Mobility in Cluster- Randomized Cohort Trials

More on Sampling:What About Random Sampling?

• Sampling models are often ignored in intervention research in education and public health

BUT:• Sampling is where the randomness comes from

in education and PH research• Sampling therefore has profound consequences

for statistical analysis and research designs

Page 20: Handling Mobility in Cluster- Randomized Cohort Trials

Sampling Models

Simple random samples are rare in field research

Many populations are hierarchically nested:

• Students in classrooms in schools• Schools in districts in states• People in communities/neighborhoods• Patients in clinics

We usually exploit the population structure to sample people (e.g., students) by first sampling places (e.g., schools)

Even then, most samples are not probability samples, but they are intended to be representative (of some population)

Page 21: Handling Mobility in Cluster- Randomized Cohort Trials

Sampling Models

Survey research calls this strategy multistage (multilevel) clustered sampling

We often sample clusters (schools) first then individuals within clusters (students within schools)

This is a two-stage (two-level) cluster sample

We might sample schools, then classrooms, then students

This is a three-stage (three-level) cluster sample

Page 22: Handling Mobility in Cluster- Randomized Cohort Trials

Precision of Estimates Depend on the Sampling Model

Suppose the total population variance is σT2 and ICC is ρ

Consider two samples of size N = mn

A simple random sample or stratified sample

The variance of the mean is σT2/mn

A clustered sample of n students from each of m schools

The variance of the mean is (σT2/mn)[1 + (n – 1)ρ]

The inflation factor [1 + (n – 1)ρ] is called the design effectFor example, if n = 10 and ρ = .1, design effect = 1.9, that is, variance is inflated almost 2 times = almost double what it would be with 100 students who were not clustered in m schools

Page 23: Handling Mobility in Cluster- Randomized Cohort Trials

Precision of Estimates Depend on the Sampling Model

Suppose the population variance is σT2

School level ICC is ρS, class level ICC is ρC

Consider two samples of size N = mpn

A simple random sample or stratified sample

The variance of the mean is σT2/mpn

A clustered sample of n students from p classes in m schools

The variance is (σT2/mpn)[1 + (pn – 1)ρS + (n – 1)ρC]

The three level design effect is [1 + (pn – 1)ρS + (n – 1)ρC]

Page 24: Handling Mobility in Cluster- Randomized Cohort Trials

Precision of Estimates Depend on the Sampling Model

Treatment effects in experiments and quasi-experiments are mean differences

Therefore precision of treatment effects and statistical power will depend on the sampling model

Page 25: Handling Mobility in Cluster- Randomized Cohort Trials

Sampling Models in ED & PH Research

The fact that the population is structured does not mean the sample must be a clustered sample

Whether it is a clustered sample depends on:

• How the sample is drawn (e.g., are schools sampled first then individuals randomly within schools)

• What the inferential population is (e.g., is the inference these schools studied or a larger population of schools)

Page 26: Handling Mobility in Cluster- Randomized Cohort Trials

Sampling Models in ED & PH Research

A necessary condition for a clustered sample is that it is drawn in stages using population subdivisions

• schools then students within schools

• schools then classrooms then students

However, if all subdivisions in a population are present in the sample, the sample is not clustered, but stratified

Stratification has different implications than clustering

Whether there is stratification or clustering depends on the definition of the population to which we draw inferences (the inferential population)

Page 27: Handling Mobility in Cluster- Randomized Cohort Trials

Sampling Models in ED & PH Research

The clustered/stratified distinction matters because it influences the precision of statistics estimated from the sample

If all population subdivisions are included in every sample, there is no sampling (or exhaustive sampling) of subdivisions

• therefore differences between subdivisions add no uncertainty to estimates

If only some population subdivisions are included in the sample, it matters which ones you happen to sample

• thus differences between subdivisions add to uncertainty

Page 28: Handling Mobility in Cluster- Randomized Cohort Trials

Inference PopulationInference Models

Fixed and Random EffectsStatistical Power

Unit of Randomization

Courtesy of Larry V. Hedges, Northwestern UniversityIES Summer Research Training Institute June 18 – 29, 2007

Page 29: Handling Mobility in Cluster- Randomized Cohort Trials

Inferential Population and Inference Models

The inferential population or inference model has implications for analysis and therefore for the design of experiments

Do we make inferences to the schools in this sample or to a larger population of schools?

Inferences to the schools or classes in the sample are called conditional inferences

Inferences to a larger population of schools or classes are called unconditional inferences

Page 30: Handling Mobility in Cluster- Randomized Cohort Trials

Inferential Population and Inference Models

Note that the inferences (what we are estimating) are different in conditional versus unconditional inference models

• In a conditional inference, we are estimating the mean (or treatment effect) in the observed schools

• In unconditional inference we are estimating the mean (or treatment effect) in the population of schools from which the observed schools are sampled

We are still estimating a mean (or a treatment effect) but they are different parameters with different uncertainties

Page 31: Handling Mobility in Cluster- Randomized Cohort Trials

Fixed and Random Effects

When the levels of a factor (e.g., particular blocks included) in a study are sampled and the inference model is unconditional, that factor is called random and its effects are called random effects

When the levels of a factor (e.g., particular blocks included) in a study constitute the entire inference population and the inference model is conditional, that factor is called fixed and its effects are called fixed effects

Page 32: Handling Mobility in Cluster- Randomized Cohort Trials

Comparing Fixed and Mixed Effects Statistical Procedures (Hierarchical Design)

Conditional and unconditional inference models • estimate different treatment effects• have different contaminating factors that add uncertainty

Mixed procedures are good for unconditional inference

The fixed procedures are not generally recommended

The fixed procedures have higher power

Page 33: Handling Mobility in Cluster- Randomized Cohort Trials

Comparing Hierarchical Designs to Randomized Block Designs

Randomized block designs usually have higher power, but assignment of different treatments within schools or classes may be

• practically difficult• politically infeasible• theoretically impossible

It may be methodologically unwise because of potential for

• Contamination or diffusion of treatments• compensatory rivalry or demoralization

Page 34: Handling Mobility in Cluster- Randomized Cohort Trials

Applications to Experimental Design

We will address the two most widely used experimental designs in education

• Randomized blocks designs with 2 levels

• Randomized blocks designs with 3 levels

• Hierarchical designs with 2 levels

• Hierarchical designs with 3 levels

We also examine the effect of covariates

Hereafter, we generally take schools to be random

Page 35: Handling Mobility in Cluster- Randomized Cohort Trials

Precision of the Estimated Treatment Effect

Precision is the standard error of the estimated treatment effect

Precision in simple (simple random sample) designs depends on:

• Standard deviation in the population σ

• Total sample size N

The precision is

SE N

Page 36: Handling Mobility in Cluster- Randomized Cohort Trials

Precision of the Estimated Treatment Effect

Precision in complex (clustered sample) designs depends on: • The (total) standard deviation σT

• Sample size at each level of sampling (e.g., m clusters, n individuals per cluster)

• Intraclass correlation structure

It is a little harder to compute than in simple designs, but important because it helps you see what matters in design

Page 37: Handling Mobility in Cluster- Randomized Cohort Trials

Precision in Two-level Hierarchical DesignWith No Covariates

The standard error of the treatment effect

SE decreases as m (number of schools) increases

SE deceases as n increases, but only up to point

SE increases as ρ increases

2 1 ( 1)T

n ρSE

m n

Page 38: Handling Mobility in Cluster- Randomized Cohort Trials

Statistical PowerPower in simple (simple random sample) designs depends on: • Significance level

• Effect size

• Sample size

Look power up in a table for sample size and effect size

Page 39: Handling Mobility in Cluster- Randomized Cohort Trials

Fragment of Cohen’s Table 2.3.5

               

d

n 0.10 0.20 … 0.80 1.00 1.20 1.40

8 05 07 … 31 46 60 73

9 06 07 … 35 51 65 79

10 06 07 … 39 56 71 84

11 06 07 … 43 63 76 87

               

Page 40: Handling Mobility in Cluster- Randomized Cohort Trials

Computing Statistical PowerPower in complex (clustered sample) designs depends on: • Significance level

• Effect size δ

• Sample size at each level of sampling (e.g., m clusters, n individuals per cluster)

• Intraclass correlation structure

This makes it seem a lot harder to compute

Page 41: Handling Mobility in Cluster- Randomized Cohort Trials

Computing Statistical PowerComputing statistical power in complex designs is only a little

harder than computing it for simple designs

Compute operational effect size (incorporates sample design information) ΔT

Look power up in a table for operational sample size and operational effect size

This is the same table that you use for simple designs

Page 42: Handling Mobility in Cluster- Randomized Cohort Trials

Randomized Block DesignsIn randomized block designs, as in hierarchical designs, the

intraclass correlation has an impact on precision and power

However, in randomized block designs designs there is also a parameter reflecting the degree of heterogeneity of treatment effects across schools

We define this heterogeneity parameter ωS in terms of the amount of heterogeneity of treatment effects relative to the heterogeneity of school means

Thus

ωS = σTxS2/σS

2

Page 43: Handling Mobility in Cluster- Randomized Cohort Trials

Precision in Two-level Randomized Block DesignWith No Covariates

The standard error of the treatment effect

SE decreases as m (number of schools) increases

SE deceases as n increases, but only up to point

SE increases as ρ increases

SE increases as ωS = σTxS2/σS

2 increases

1 ( 1)2 S ST

n ρSE

m n

Page 44: Handling Mobility in Cluster- Randomized Cohort Trials

Power in Two-level Randomized Block DesignWith No Covariates

Basic Idea:Operational Effect Size = (Effect Size) x (Design Effect)

ΔT = δ x (Design Effect)

For the two-level hierarchical design with no covariates

Operational sample size is number of schools (clusters)

1 1

T n

n ρ

/ 2

1 1T

S S

n

n ρ

Page 45: Handling Mobility in Cluster- Randomized Cohort Trials

What Unit Should Be Randomized?(Schools, Classrooms, or Students)

Experiments cannot estimate the causal effect on any individual

Experiments estimate average causal effects on the units that have been randomized

• If you randomize schools the (average) causal effects are effects on schools

• If you randomize classes, the (average) causal effects are on classes

• If you randomize individuals, the (average) causal effects estimated are on individuals

Page 46: Handling Mobility in Cluster- Randomized Cohort Trials

What Unit Should Be Randomized?(Schools, Classrooms, or Students)

Theoretical Considerations

Decide what level you care about, then randomize at that level

Randomization at lower levels may impact generalizability of the causal inference (and it is generally a lot more trouble)

Suppose you randomize classrooms, should you also randomly assign students to classes?

It depends: Are you interested in the average causal effect of treatment on naturally occurring classes or on randomly assembled ones?

Page 47: Handling Mobility in Cluster- Randomized Cohort Trials

What Unit Should Be Randomized?(Schools, Classrooms, or Students)

Relative power/precision of treatment effect

Assign Schools(Hierarchical Design)

Assign Classrooms(Randomized Block)

Assign Students(Randomized Block)

1 1 ( 1)S Cpn ρ n

pn

1 1 ( 1)S S C Cpn ρ n

pn

1 1 ( 1)S Cpn ρ n

pn

Page 48: Handling Mobility in Cluster- Randomized Cohort Trials

What Unit Should Be Randomized?(Schools, Classrooms, or Students)

Precision of estimates or statistical power dictate assigning the lowest level possible

But the individual (or even classroom) level will not always be feasible or even theoretically desirable

Page 49: Handling Mobility in Cluster- Randomized Cohort Trials

Class Exercise:

Questions About Design

Divide into 3 groups4 questions each

Discuss then present to class

Page 50: Handling Mobility in Cluster- Randomized Cohort Trials

Questions About Assigning Schools

1. I assigned treatments to schools and am not using classes in the analysis. Do I have to take them into account in the design? Do I have to include classes as a nested factor?

2. My schools all come from two districts, but I am randomly assigning the schools. Do I have to take district into account some way?

3. I didn’t really sample the schools in my experiment (who does?). Do I still have to treat schools as random effects? What population can I generalize to anyway?

4. I am using a randomized block design with fixed effects. Do you really mean I can’t say anything about effects in schools that are not in the sample?

Page 51: Handling Mobility in Cluster- Randomized Cohort Trials

Questions About Failed Randomization1. We randomly assigned, but our assignment

was corrupted by treatment switchers. What do we do?

2. We randomly assigned, but our assignment was corrupted by attrition. What do we do?

3. We randomly assigned but got a big imbalance on characteristics we care about (gender, race, language, SES). What do we do?

4. We randomly assigned but when we looked at the pretest scores of our DV, we see that we got a big imbalance (a “bad randomization”). What do we do?

Page 52: Handling Mobility in Cluster- Randomized Cohort Trials

Questions About Treatment Effects

1. We care about treatment effects, but we really want to know about mechanism. How do we find out if implementation impacts treatment effects?

2. We want to know where (under what conditions) the treatment works. Can we analyze the relation between conditions and treatment effect to find this out?

3. We have a randomized block design and find heterogeneous treatment effects. What can we say about the main effect of treatment in the presence of interactions?

4. I have heard of using “school fixed effects” to analyze a randomized block design. Is it a good alternative to use ANOVA or HLM?