contemporary data analysis and its communication

52
Significant No More: Contemporary Data Analysis and its Communication Richard A. Erickson Upper Midwest Environmental Sciences Center Barnett A. Rattner Eastern Ecological Science Center 1

Upload: others

Post on 28-Dec-2021

1 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Contemporary Data Analysis and its Communication

Significant No More: Contemporary Data Analysis

and its Communication

Richard A. EricksonUpper Midwest Environmental Sciences Center

Barnett A. RattnerEastern Ecological Science Center

1

Page 2: Contemporary Data Analysis and its Communication

Who are we and contact information?

Richie Erickson, Quantitative Ecologist

• Invasive species modeling• “other duties as assigned”

• Statistical consultant UMESC• Statistical reviews at UMESC

• Editorial Board Member ET&C

Barnett Ratter, Physiologist

• Ecotoxicology• Editor for ET&C

• SETAC Fellow

[email protected]@usgs.gov 2

Page 3: Contemporary Data Analysis and its Communication

If you need to use statistics, then you should design a better experiment.

Ernest Rutherford

3

Page 4: Contemporary Data Analysis and its Communication

Equations are the devil’s sentences.

Stephen Colbert

4

Page 5: Contemporary Data Analysis and its Communication

Statistics is the grammar of science.

Karl Pearson

5

Page 6: Contemporary Data Analysis and its Communication

Statistics plays a vital role in today's society by providing the foundation for sound decisions

Robert A. Donnelly inThe Complete Idiot's Guide to Statistics

6

Page 7: Contemporary Data Analysis and its Communication

Sound Principles of Ecotoxicology …..ERA and NRDA ? (Harris et al. 2014)

• Adequate planning, good study design• Define the baseline• Include appropriate controls• Use appropriate exposure routes and concentrations• Understand your tools• Think about statistical analysis when designing study• Consider dose-response• Replicate study• Consider confounding factors• Consider weight of evidence• Report findings in an unbiased manner

7

Page 8: Contemporary Data Analysis and its Communication

8

Page 9: Contemporary Data Analysis and its Communication

I. What is a p-value?

9

Page 10: Contemporary Data Analysis and its Communication

When you took statistics, or when first exposed to significance testing, you were taught that a “statistically significant” outcome with a p-value of 0.05 means…

Michael Newman

10

Page 11: Contemporary Data Analysis and its Communication

Possible answers

1. The null hypothesis is disproven 2. The p-value is the estimated probability of the null

hypothesis being true3. You proven that an effect is present4. You can deduce the probability of the effect actually

being present.5. You know the probability that you would make a

wrong decision if you rejected the null hypothesis.6. You know the probability of getting these [or more

extreme] data if the null hypothesis were true.7. If you repeated the test many times, you would

obtain a significant effect in 95 out of 100 trials.

11

Page 12: Contemporary Data Analysis and its Communication

You know the probability of getting these [or more extreme] data if the null hypothesis were true.

12

Page 13: Contemporary Data Analysis and its Communication

American Statistical Association

“Informally, a p-value is the probability under a specified statistical model that a statistical summary of the data (e.g., the sample mean difference between two compared groups) would be equal to or more extreme than its observed value.”

13

Page 14: Contemporary Data Analysis and its Communication

II. Problems with p-values?

14

Page 15: Contemporary Data Analysis and its Communication

xkcd jellybean example

https://xkcd.com/882/

15

Page 16: Contemporary Data Analysis and its Communication

https://xkcd.com/882/

1 out of 20!

16

Page 17: Contemporary Data Analysis and its Communication

https://xkcd.com/882/17

Page 18: Contemporary Data Analysis and its Communication

18

Page 19: Contemporary Data Analysis and its Communication

2016

2019

19

Page 20: Contemporary Data Analysis and its Communication

Principles (ASA 2016)• p-values can indicate how incompatible the data are with a specified

statistical model.

• p-values do not measure the probability that the studied hypothesis is true, or the probability that the data were produced by random chance alone.

• Scientific conclusions and business or policy decisions should not be based only on whether a p-value passes a specific threshold.

• Proper inference requires full reporting and transparency.

• A p-value, or statistical significance, does not measure the size of an effect or the importance of a result.

• By itself, a p-value does not provide a good measure of evidence regarding a model or hypothesis.

20

Page 21: Contemporary Data Analysis and its Communication

III. Use of the word "significant" and the phrase "statistically significant"

21

Page 22: Contemporary Data Analysis and its Communication

We conclude, based on our review of the articles in this special issue and the broader literature, that...it is time to stop using the term “statistically significant” entirely.Nor should variants such as “significantly different,” “p < 0.05,” and “nonsignificant” survive, whether expressed in words, by asterisks in a table, or in some other way.

Wasserstein et al. 201922

Page 23: Contemporary Data Analysis and its Communication

23

Page 24: Contemporary Data Analysis and its Communication

What is better....Blood clotting time was prolonged in warfarin treated kestrels compared to controls (log transformed values, one-way ANOVA F3,16 = 22.1, p = 0.003 and Tukey’s HSD p ≤ 0.0045; Figure 1).

Blood clotting time was significantly different in warfarin treated kestrels compared to controls (p < 0.05).

24

Page 25: Contemporary Data Analysis and its Communication

IV. Alternatives to NHST

25

Page 26: Contemporary Data Analysis and its Communication

What you will NOT find … is one solution that majestically replaces the outsized role that statistical significance has come to play.

Wasserstein et al. 2019

26

Page 27: Contemporary Data Analysis and its Communication

Some alternatives to NHST

• Better/correct use of “null hypothesis”• Effect size estimation• Confidence intervals, Credible intervals and

Reference Intervals (aside: detection limits)• Model selection (multiple competing hypotheses)• Appropriate hybrid methods• Structural equation modeling• Bayesian belief networks

27

Page 28: Contemporary Data Analysis and its Communication

Some examples

28

Page 29: Contemporary Data Analysis and its Communication

Blood Hg in Loons

Erickson and Rattner (2020)

29

Page 30: Contemporary Data Analysis and its Communication

Is difference important?

30

Page 31: Contemporary Data Analysis and its Communication

Confidence intervals with loonsMale loons had higher average blood mercury concentrations than female loons (0.263 μg/g, 95% CI 0.044–0.482; p = 0.0192, t = −2.40, df = 70.97).• Point estimate of effect size, 0.263 μg/g• Uncertainty 95% CI 0.044–0.482• Optionally, t-test results including p, test-statistics, and

degrees of freedom• Biological/Hazard/Risk standpoint...Is 0.263 μg/g important?Biologically... YES...mercury transferred to eggHazard/Risk ... No Simple Answer... approaching level affecting reproduction in loons

31

Page 32: Contemporary Data Analysis and its Communication

Reference Interval (8.3 to 16.5 sec, n=49 AMKE)

32

Page 33: Contemporary Data Analysis and its Communication

Reference Interval (8.3 to 16.5 sec, n=49 AMKE)

33

Page 34: Contemporary Data Analysis and its Communication

Reference Interval (8.3 to 16.5 sec, n=49 AMKE)

34

Page 35: Contemporary Data Analysis and its Communication

Limits of detection for selenium

Erickson and Rattner (2020)

35

Page 36: Contemporary Data Analysis and its Communication

Seven Perilous Errors in Environmental Statistics

Dennis R. Helsel, PracticalStats (practicalstats.com)

Error #7 – Substituting ½ the detection limitsfor nondetects works fine!

Summary: No it doesn't!

Produces a false signal ?

Precarious at low exposure levels36

Page 37: Contemporary Data Analysis and its Communication

Model selection

• Build competing models• Each model is a hypothesis• Compare models using selection• For example, which curve best describes a dose-

response?

37

Page 38: Contemporary Data Analysis and its Communication

Textbook example curves

Erickson and Rattner (2020)

38

Page 39: Contemporary Data Analysis and its Communication

AIC results

39

Page 40: Contemporary Data Analysis and its Communication

Clotting Time Exceeding Baseline vs Chlorophacinone ConsumptionDiet-Based Toxicity Reference Values

40

Page 41: Contemporary Data Analysis and its Communication

Habitat of endangered bats

Erickson et al (2016)

41

Page 42: Contemporary Data Analysis and its Communication

42

Page 43: Contemporary Data Analysis and its Communication

Time-to-event analysis

Historically known as survival analysis

Erickson and Rattner (2020)

43

Page 44: Contemporary Data Analysis and its Communication

Regression analysis

• Model Y ~ βX• If β “different” than 0, X associated with Y• “corrected for” covariates: Y ~ βX + age + site …• Many flavors of regression

• Generalized Linear Model (GLM): e.g., Logistic, Poisson• Generalized Additive Models (GAM): e.g., splines• Hierarchical/nested/mixed-effect/random-

effect/repeated measures models/LMERs/GLMMs• Even ANOVA is a special case of regression!

44

Page 45: Contemporary Data Analysis and its Communication

Example adding context to results

• Field study shows a 10% decrease in recruitment• Does this matter?• Existing knowledge of system, population modeling

45

Page 46: Contemporary Data Analysis and its Communication

46

Energetics and migration modelto assess oil spill effects in birds

Page 47: Contemporary Data Analysis and its Communication

Structural equation modeling

• Construct causal diagram

• Jim Grace, USGS• Example: What

predicts eDNA in a lake (Amberg et al. 2019)

Note inappropriate use of “significant” by Erickson

47

Page 48: Contemporary Data Analysis and its Communication

Bayesian belief networks

• Focused on correlation rather than cause• Example: Hydrology model by Fienen et al.

48

Page 49: Contemporary Data Analysis and its Communication

General approach

• Define what is important (hypothesis, damage, etc.)• Get data to measure important value• Estimate what is important with statistics • Include uncertainty

49

Page 50: Contemporary Data Analysis and its Communication

ATOM

Accept uncertainty.Be thoughtful, open, and modest

Guiding principles

• No “line in the sand”• Understanding uncertainty is key• Think about “why”

50

Page 51: Contemporary Data Analysis and its Communication

How can we help?

• Study plans including proposed statistics• Statistical reviews (e.g., UMESC review study plans)• Reproducible results and workflows• Matching study designs to statistical methods• Comparison of statistical approaches• Other ideas?

51

Page 52: Contemporary Data Analysis and its Communication

Resources• Erickson and Rattner (2020) Moving beyond p<-.05 in ecotoxicology: a

guide for practitioners. DOI: 10.1002/etc.4800 • Code for Erickson and Rattner (2020) Alternatives_NHST · main · UMESC /

introduction_to_r · GitLab (usgs.gov)• Harris et al. (2014) Principles of sound ecotoxicology doi:

10.1021/es4047507• ASA 2016 Statement DOI: 10.1080/00031305.2016.1154108• Wasserstein et al. 2019 https://doi.org/10.1080/00031305.2019.1583913• Helsel (2005) Nondetects and data analysis Statistics for censored

environmental data. John Wiley & Sons, Inc.• Deck and Johnson (2015) Methods for derivation of wildlife toxicity values

for use in ecological risk assessments. https://doi.org/10.1016/B978-0-12-800020-5.00002-8

52