nonparametric thresholding methods (fwe inference w/ snpm)

47
1 Nonparametric Thresholding Methods (FWE inference w/ SnPM) Thomas Nichols, Ph.D. Assistant Professor Department of Biostatistics University of Michigan http://www.sph.umich.edu/~nichols SPM Course May 13, 2004

Upload: ashlyn

Post on 18-Jan-2016

54 views

Category:

Documents


2 download

DESCRIPTION

Nonparametric Thresholding Methods (FWE inference w/ SnPM). Thomas Nichols, Ph.D. Assistant Professor Department of Biostatistics University of Michigan http://www.sph.umich.edu/~nichols SPM Course May 13, 2004. Overview. Multiple Comparisons Problem - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Nonparametric Thresholding Methods (FWE inference w/ SnPM)

1

Nonparametric Thresholding Methods

(FWE inference w/ SnPM)

Thomas Nichols, Ph.D.Assistant Professor

Department of Biostatistics University of Michigan

http://www.sph.umich.edu/~nichols

SPM CourseMay 13, 2004

Page 2: Nonparametric Thresholding Methods (FWE inference w/ SnPM)

2

Overview

• Multiple Comparisons Problem– Which of my 100,000 voxels are “active”?

• SnPM– Permutation test to find threshold– Control chance of any false positives (FWER)

Page 3: Nonparametric Thresholding Methods (FWE inference w/ SnPM)

3

Hypothesis Testing in fMRI

• Massively Univariate Modeling– Fit model at each voxel– Create statistic images of effect

• Which of 100,000 voxels are significant? =0.05 5,000 false positives!

t > 0.5 t > 1.5 t > 2.5 t > 3.5 t > 4.5 t > 5.5 t > 6.5

Page 4: Nonparametric Thresholding Methods (FWE inference w/ SnPM)

4

• Null Hypothesis H0

• Test statistic T– t observed realization of T

level– Acceptable false positive rate

– P( T>u | H0 ) =

• P-value– Assessment of t assuming H0

– P( T > t | H0 )• Prob. of obtaining stat. as large

or larger in a new experiment

– P(Data|Null) not P(Null|Data)

Hypothesis Testing

u

Null Distribution of T

t

P-val

Null Distribution of T

Page 5: Nonparametric Thresholding Methods (FWE inference w/ SnPM)

5

Multiple Comparisons Problem (MCP)

• Standard Hypothesis Test– Controls Type I error of each test,

at say 5%

– But what if I have 100,000 voxels?• 5,000 false positives on average!

• Must control false positive rate– What false positive rate?

– Chance of 1 or more Type I errors

– Chance of 50 or more?

5%0

Page 6: Nonparametric Thresholding Methods (FWE inference w/ SnPM)

6

MCP Solutions:Measuring False Positives

• Familywise Error Rate (FWER)– Familywise Error

• Existence of one or more false positives

– FWER is probability of familywise error

• False Discovery Rate (FDR)– R voxels declared active, V falsely so

• Observed false discovery rate: V/R

– FDR = E(V/R)

Page 7: Nonparametric Thresholding Methods (FWE inference w/ SnPM)

7

FWER MCP Solutions

• Bonferroni

• Maximum Distribution Methods– Random Field Theory– Permutation

Page 8: Nonparametric Thresholding Methods (FWE inference w/ SnPM)

8

FWER MCP Solutions

• Bonferroni

• Maximum Distribution Methods– Random Field Theory– Permutation

Page 9: Nonparametric Thresholding Methods (FWE inference w/ SnPM)

9

FWER MCP Solutions: Controlling FWER w/ Max

• FWER & distribution of maximum

FWER= P(FWE)= P(One or more voxels u |

Ho)= P(Max voxel u | Ho)

• 100(1-)%ile of max distn controls FWERFWER = P(Max voxel u | Ho)

u

Page 10: Nonparametric Thresholding Methods (FWE inference w/ SnPM)

10

FWER MCP Solutions

• Bonferroni

• Maximum Distribution Methods– Random Field Theory– Permutation

Page 11: Nonparametric Thresholding Methods (FWE inference w/ SnPM)

11

Nonparametric Inference:Permutation Test

• Assumptions– Null Hypothesis Exchangeability

• Method– Compute statistic t

– Resample data (without replacement), compute t*

– {t*} permutation distribution of test statistic

– P-value = #{ t* > t } / #{ t* }

• Theory– Given data and H0, each t* has equal probability

– Still can assume data randomly drawn from population

Page 12: Nonparametric Thresholding Methods (FWE inference w/ SnPM)

12

Nonparametric Inference• Parametric methods

– Assume distribution ofstatistic under nullhypothesis

– Needed to find P-values, u

• Nonparametric methods– Use data to find

distribution of statisticunder null hypothesis

– Any statistic!

5%

Parametric Null Distribution

5%

Nonparametric Null Distribution

Page 13: Nonparametric Thresholding Methods (FWE inference w/ SnPM)

13

Permutation TestToy Example

• Data from V1 voxel in visual stim. experimentA: Active, flashing checkerboard B: Baseline, fixation6 blocks, ABABAB Just consider block averages...

• Null hypothesis Ho – No experimental effect, A & B labels arbitrary

• Statistic– Mean difference

A B A B A B

103.00 90.48 99.93 87.83 99.76 96.06

Page 14: Nonparametric Thresholding Methods (FWE inference w/ SnPM)

14

Permutation TestToy Example

• Under Ho

– Consider all equivalent relabelings

AAABBB ABABAB BAAABB BABBAA

AABABB ABABBA BAABAB BBAAAB

AABBAB ABBAAB BAABBA BBAABA

AABBBA ABBABA BABAAB BBABAA

ABAABB ABBBAA BABABA BBBAAA

Page 15: Nonparametric Thresholding Methods (FWE inference w/ SnPM)

15

Permutation TestToy Example

• Under Ho

– Consider all equivalent relabelings– Compute all possible statistic values

AAABBB 4.82 ABABAB 9.45 BAAABB -1.48 BABBAA -6.86

AABABB -3.25 ABABBA 6.97 BAABAB 1.10 BBAAAB 3.15

AABBAB -0.67 ABBAAB 1.38 BAABBA -1.38 BBAABA 0.67

AABBBA -3.15 ABBABA -1.10 BABAAB -6.97 BBABAA 3.25

ABAABB 6.86 ABBBAA 1.48 BABABA -9.45 BBBAAA -4.82

Page 16: Nonparametric Thresholding Methods (FWE inference w/ SnPM)

16

Permutation TestToy Example

• Under Ho

– Consider all equivalent relabelings– Compute all possible statistic values– Find 95%ile of permutation distribution

AAABBB 4.82 ABABAB 9.45 BAAABB -1.48 BABBAA -6.86

AABABB -3.25 ABABBA 6.97 BAABAB 1.10 BBAAAB 3.15

AABBAB -0.67 ABBAAB 1.38 BAABBA -1.38 BBAABA 0.67

AABBBA -3.15 ABBABA -1.10 BABAAB -6.97 BBABAA 3.25

ABAABB 6.86 ABBBAA 1.48 BABABA -9.45 BBBAAA -4.82

Page 17: Nonparametric Thresholding Methods (FWE inference w/ SnPM)

17

Permutation TestToy Example

• Under Ho

– Consider all equivalent relabelings– Compute all possible statistic values– Find 95%ile of permutation distribution

AAABBB 4.82 ABABAB 9.45 BAAABB -1.48 BABBAA -6.86

AABABB -3.25 ABABBA 6.97 BAABAB 1.10 BBAAAB 3.15

AABBAB -0.67 ABBAAB 1.38 BAABBA -1.38 BBAABA 0.67

AABBBA -3.15 ABBABA -1.10 BABAAB -6.97 BBABAA 3.25

ABAABB 6.86 ABBBAA 1.48 BABABA -9.45 BBBAAA -4.82

Page 18: Nonparametric Thresholding Methods (FWE inference w/ SnPM)

18

Permutation TestToy Example

• Under Ho

– Consider all equivalent relabelings– Compute all possible statistic values– Find 95%ile of permutation distribution

0 4 8-4-8

Page 19: Nonparametric Thresholding Methods (FWE inference w/ SnPM)

19

Permutation TestStrengths

• Requires only assumption of exchangeability– Under Ho, distribution unperturbed by permutation– Allows us to build permutation distribution

• Subjects are exchangeable– Under Ho, each subject’s A/B labels can be flipped

• fMRI scans not exchangeable under Ho– Due to temporal autocorrelation

Page 20: Nonparametric Thresholding Methods (FWE inference w/ SnPM)

20

Permutation TestLimitations

• Computational Intensity– Analysis repeated for each relabeling– Not so bad on modern hardware

• No analysis discussed below took more than 3 hours

• Implementation Generality– Each experimental design type needs unique

code to generate permutations• Not so bad for population inference with t-tests

Page 21: Nonparametric Thresholding Methods (FWE inference w/ SnPM)

21

Controlling FWER: Permutation Test

• Parametric methods– Assume distribution of

max statistic under nullhypothesis

• Nonparametric methods– Use data to find

distribution of max statisticunder null hypothesis

– Again, any max statistic!

5%

Parametric Null Max Distribution

5%

Nonparametric Null Max Distribution

Page 22: Nonparametric Thresholding Methods (FWE inference w/ SnPM)

22

Permutation Test& Exchangeability

• Exchangeability is fundamental– Def: Distribution of the data unperturbed by permutation

– Under H0, exchangeability justifies permuting data

– Allows us to build permutation distribution

• Subjects are exchangeable– Under Ho, each subject’s A/B labels can be flipped

• Are fMRI scans exchangeable under Ho?– If no signal, can we permute over time?

Page 23: Nonparametric Thresholding Methods (FWE inference w/ SnPM)

23

Permutation Test& Exchangeability

• fMRI scans are not exchangeable– Permuting disrupts order, temporal autocorrelation

• Intrasubject fMRI permutation test– Must decorrelate data, model before permuting– What is correlation structure?

• Usually must use parametric model of correlation

– E.g. Use wavelets to decorrelate• Bullmore et al 2001, HBM 12:61-78

• Intersubject fMRI permutation test– Create difference image for each subject– For each permutation, flip sign of some subjects

Page 24: Nonparametric Thresholding Methods (FWE inference w/ SnPM)

24

Permutation TestOther Statistics

• Collect max distribution– To find threshold that controls FWER

• Consider smoothed variance t statistic– To regularize low-df variance estimate

Page 25: Nonparametric Thresholding Methods (FWE inference w/ SnPM)

25

Permutation TestSmoothed Variance t

• Collect max distribution– To find threshold that controls FWER

• Consider smoothed variance t statistic

t-statisticvariance

mean difference

Page 26: Nonparametric Thresholding Methods (FWE inference w/ SnPM)

26

Permutation TestSmoothed Variance t

• Collect max distribution– To find threshold that controls FWER

• Consider smoothed variance t statistic

SmoothedVariancet-statistic

mean difference

smoothedvariance

Page 27: Nonparametric Thresholding Methods (FWE inference w/ SnPM)

27

Permutation TestExample

• fMRI Study of Working Memory – 12 subjects, block design Marshuetz et al (2000)

– Item Recognition• Active:View five letters, 2s pause,

view probe letter, respond

• Baseline: View XXXXX, 2s pause,view Y or N, respond

• Second Level RFX– Difference image, A-B constructed

for each subject

– One sample, smoothed variance t test

...

D

yes

...

UBKDA

Active

...

N

no

...

XXXXX

Baseline

Page 28: Nonparametric Thresholding Methods (FWE inference w/ SnPM)

28

Permutation TestExample

• Permute!– 212 = 4,096 ways to flip 12 A/B labels– For each, note maximum of t image.

Permutation DistributionMaximum t

Maximum Intensity Projection Thresholded t

Page 29: Nonparametric Thresholding Methods (FWE inference w/ SnPM)

29

Permutation TestExample

• Compare with Bonferroni = 0.05/110,776

• Compare with parametric RFT– 110,776 222mm voxels– 5.15.86.9mm FWHM smoothness– 462.9 RESELs

Page 30: Nonparametric Thresholding Methods (FWE inference w/ SnPM)

30

t11 Statistic, RF & Bonf. Thresholdt11 Statistic, Nonparametric Threshold

uRF = 9.87uBonf = 9.805 sig. vox.

uPerm = 7.67

58 sig. vox.

Smoothed Variance t Statistic,Nonparametric Threshold

378 sig. vox.

Test Level vs. t11 Threshold

Page 31: Nonparametric Thresholding Methods (FWE inference w/ SnPM)

Does this Generalize?RFT vs Bonf. vs Perm.

t Threshold (0.05 Corrected)

df RF Bonf Perm Verbal Fluency 4 4701.32 42.59 10.14 Location Switching 9 11.17 9.07 5.83 Task Switching 9 10.79 10.35 5.10 Faces: Main Effect 11 10.43 9.07 7.92 Faces: Interaction 11 10.70 9.07 8.26 Item Recognition 11 9.87 9.80 7.67 Visual Motion 11 11.07 8.92 8.40 Emotional Pictures 12 8.48 8.41 7.15 Pain: Warning 22 5.93 6.05 4.99 Pain: Anticipation 22 5.87 6.05 5.05

Page 32: Nonparametric Thresholding Methods (FWE inference w/ SnPM)

RFT vs Bonf. vs Perm. No. Significant Voxels

(0.05 Corrected) t SmVar t df RF Bonf Perm Perm

Verbal Fluency 4 0 0 0 0 Location Switching 9 0 0 158 354 Task Switching 9 4 6 2241 3447 Faces: Main Effect 11 127 371 917 4088 Faces: Interaction 11 0 0 0 0 Item Recognition 11 5 5 58 378 Visual Motion 11 626 1260 1480 4064 Emotional Pictures 12 0 0 0 7 Pain: Warning 22 127 116 221 347 Pain: Anticipation 22 74 55 182 402

Page 33: Nonparametric Thresholding Methods (FWE inference w/ SnPM)

33

Monte Carlo Evaluations

• What’s going wrong?– Normality assumptions?– Smoothness assumptions?

• Use Monte Carlo Simulations– Normality strictly true– Compare over range of smoothness, df

• Previous work– Gaussian (Z) image results well-validated– t image results hardly validated at all!

Page 34: Nonparametric Thresholding Methods (FWE inference w/ SnPM)

34

Monte Carlo EvaluationsChallenges

• Accurately simulating t images– Cannot directly simulate smooth t images– Need to simulate smooth Gaussian images

( = degrees of freedom)

• Accounting for all sources of variability– Most M.C. evaluations use known smoothness– Smoothness not known– We estimated it residual images

Page 35: Nonparametric Thresholding Methods (FWE inference w/ SnPM)

35

Monte Carlo Evaluations

• Simulated One Sample T test– 32x32x32 Images (32767 voxels)– Smoothness: 0, 1.5, 3, 6,12 FWHM– Degrees of Freedom: 9, 19, 29– Realizations: 3000

• Permutation– 100 relabelings– Threshold: 95%ile of permutation distn of maximum

• Random Field– Threshold: { u : E(u | Ho) = 0.05 }

• Also Gaussian

Autocorrelation Function

FWHM

Page 36: Nonparametric Thresholding Methods (FWE inference w/ SnPM)

36

Inf. df

29df

more

FamilywiseErrorThresholds

• RFT valid but conservative

• Gaussian not so bad (FWHM >3)

• t29 somewhat worse

Page 37: Nonparametric Thresholding Methods (FWE inference w/ SnPM)

37

Inf df

29 df

more

FamilywiseRejectionRates

• Need > 6 voxel FWHM

Page 38: Nonparametric Thresholding Methods (FWE inference w/ SnPM)

38

• RF & Perm adapt to smoothness

• Perm & Truth close

• Bonferroni close to truth for low smoothness

19 df

9 df

more

FamilywiseErrorThresholds

Page 39: Nonparametric Thresholding Methods (FWE inference w/ SnPM)

39

FamilywiseRejectionRates

• Bonf good on low df, smoothness

• Bonf bad for high smoothness

• RF only good for high df, high smoothness

• Perm exact

19 df

9 df

more

Page 40: Nonparametric Thresholding Methods (FWE inference w/ SnPM)

40

FamilywiseRejectionRates

• Smoothness estimation is not (sole) problem

19 df

9 df

cont

Page 41: Nonparametric Thresholding Methods (FWE inference w/ SnPM)

41

Performance Summary

• Bonferroni– Not adaptive to smoothness– Not so conservative for low smoothness

• Random Field– Adaptive– Conservative for low smoothness & df

• Permutation– Adaptive (Exact)

Page 42: Nonparametric Thresholding Methods (FWE inference w/ SnPM)

42

Understanding Performance Differences

• RFT Troubles– Multivariate Normality assumption

• True by simulation

– Smoothness estimation• Not much impact

– Smoothness• You need lots, more at low df

– High threshold assumption• Doesn’t improve for 0 less than 0.05 (not shown)

HighThr

Page 43: Nonparametric Thresholding Methods (FWE inference w/ SnPM)

43

Conclusions

• t random field results conservative for– Low df & smoothness

– 9 df & 12 voxel FWHM; 19 df & < 10 voxel FWHM

• Bonferroni surprisingly satisfactory for low smoothness

• Nonparametric methods perform well overall• More data and simulations needed

– Need guidelines as to when RF is useful

– Better understand what assumption/approximation fails

Page 44: Nonparametric Thresholding Methods (FWE inference w/ SnPM)

44

References• TE Nichols and AP Holmes.

Nonparametric Permutation Tests for Functional Neuroimaging: A Primer with Examples. Human Brain Mapping, 15:1-25, 2002.

• http://www.sph.umich.edu/~nichols

MC ThrRslt EstSmCfMC P RsltData ThrRslt

Page 45: Nonparametric Thresholding Methods (FWE inference w/ SnPM)

45

Page 46: Nonparametric Thresholding Methods (FWE inference w/ SnPM)

46

Permutation TestExample

• Permute!– 212 = 4,096 ways to flip A/B labels

– For each, note max of smoothed variance t image

.

Permutation DistributionMax Smoothed Variance t

Maximum Intensity Projection Threshold Sm. Var. t

Page 47: Nonparametric Thresholding Methods (FWE inference w/ SnPM)

47Back