power1

36
1 Power, Power Curves and Sample Size

Upload: pptech

Post on 04-Jan-2016

212 views

Category:

Documents


0 download

DESCRIPTION

aas

TRANSCRIPT

Page 1: power1

1

Power, Power Curves and Sample Size

Page 2: power1

2

Planning Reliable and Efficient Tests

For any job, you need a tool that offers the right amount of power for the task at hand. You wouldn't use a telescope to examine a stamp collection, or a handheld magnifying glass to search for new galaxies, because neither would provide you with meaningful observations. To complicate matters, if detecting a galaxy really was your goal, the cost of gaining the necessary power might be more than you can afford.

Page 3: power1

3

Planning Reliable and Efficient Tests

Anyone using statistical tests faces the same issues. You must consider the precision you need to meet your goals (should your test detect subtle effects or massive shifts?), and balance it against the cost of sampling your population (are you testing toothpicks or jet engines?).

Page 4: power1

4

Planning Reliable and Efficient Tests

You also want the confidence in your results that's appropriate for your situation (testing seat belts demands a greater degree of certainty than testing shampoo). We measure this certainty with statistical power – the probability your test will detect an effect that truly exists.

Page 5: power1

5

Planning Reliable and Efficient Tests

Minitab's Power and Sample Size tools, with Power Curves, help you balance these issues that may compete for your limited resources. Here are three examples of how a quick Power and Sample Size test can help you save time and money getting results you can trust.

Page 6: power1

6

Don't Leave Success to Chance

A paper clip manufacturer wants to detect significant changes in clip length. They sample thousands of clips because it is cheap and quick to do. But this huge sample makes the test too sensitive: the broken line (next slide) shows it will sound the alarm if the average length differs by a trivial amount (0.05).

Page 7: power1

7

Don't Leave Success to Chance

0.300.250.200.150.100.050.00

1.0

0.8

0.6

0.4

0.2

0.0

Difference

Pow

er

Alpha 0.05StDev 1Alternative >

Assumptions

1002000

SizeSample

Power Curve for 1-Sample t Test

Page 8: power1

8

Don't Leave Success to Chance

This Power Curve shows they are wasting resources on excessive precision. A sample size of just 100 will detect meaningful differences (0.25) without “crying wolf” at every negligible blip.

Page 9: power1

9

Don't Leave Success to Chance

Page 10: power1

10

Don't Leave Success to Chance

An aerospace company is designing an experiment to test a new rocket. Each rocket is very expensive, so it is critical to test no more than necessary.

Page 11: power1

11

Don't Leave Success to Chance

1.00.50.0-0.5-1.0

1.0

0.8

0.6

0.4

0.2

0.0

Effect

Pow

er

Alpha 0.05StDev 1# Factors 3# Corner Pts 8# Blocks none# Terms Omitted 0

Assumptions

6, 0Ctr Pts Per BlkReps,

Power Curve for 2-Level Factorial Design

Page 12: power1

12

Don't Leave Success to Chance

This Power Curve confirms an experiment with 6 replicates will give researchers the power they need without spending more than they must.

Page 13: power1

13

Don't Leave Success to Chance

Page 14: power1

14

Don't Leave Success to Chance

“We've always done it this way.” That's why a lumber company would sample 10 beams to test whether their strength meets the target.

Page 15: power1

15

Don't Leave Success to Chance

1.00.50.0-0.5-1.0

1.0

0.8

0.6

0.4

0.2

0.0

Difference

Pow

er

Alpha 0.05StDev 1Alternative Not =

Assumptions

1034

SizeSample

Power Curve for 1-Sample t Test

Page 16: power1

16

Don't Leave Success to Chance

According to the Power Curve, this small sample size made their test incapable of detecting important effects. They must sample 34 beams to detect meaningful differences (0.50).

Page 17: power1

17

Don't Leave Success to Chance

Page 18: power1

18

See the Big Picture

A power analysis helps you weigh your resources against your demands, and quantifies a test's ability to answer your question. It can expose design problems, like the lumber company's insufficient sample size. It can also reveal design solutions you hadn't considered.

Page 19: power1

19

See the Big Picture

Take for instance the packaging plant of a snack company. Customers complain that the company's pretzel bags are sealed with glue that's too strong, so researchers use One-Way ANOVA to compare their current glue with three potential replacements.

Page 20: power1

20

See the Big Picture

Differences in seal strength less than 10 are undetectable to most people, so their test only needs to detect a difference of 10. A power value of 80% is acceptable, but 90% is ideal.

What sample size meets their needs?

Page 21: power1

21

See the Big Picture

30 samples of each glue ensure the test detects a difference of 10 with 90% power.

Page 22: power1

22

See the Big Picture

Or, they could detect the same difference with 23 samples and 80% power. If this represents considerable savings, the researchers may consider using the smaller sample.

Page 23: power1

23

See the Big Picture

The Power Curve illustrates this information, but it also charts every other combination of power and difference for a given sample size.

Page 24: power1

24

See the Big Picture

20151050

1.0

0.8

0.6

0.4

0.2

0.0

Maximum Difference

Pow

er

Alpha 0.05StDev 10# Levels 4

Assumptions

2330

SizeSample

Power Curve for One-way ANOVA

Page 25: power1

25

See the Big Picture

The solid line indicates researchers can attain 90% power with just 23 samples if they are willing to seek a difference of 12 instead of 10. This might just be the ideal choice.

Page 26: power1

26

See the Big Picture

Page 27: power1

27

How to Create Power Curves in Minitab

Performing power analyses with Power Curves couldn't be simpler. You supply the factors you know, and Minitab calculates the one you omit.

Page 28: power1

28

How to Create Power Curves in Minitab

Suppose a trainer wants to compare two training courses for forklift operators.

She will use a two-sample t-test to compare the average scores that operators from each course earned on the final exam.

Page 29: power1

29

How to Create Power Curves in Minitab

She knows she must detect a difference of 5 in either direction between the two courses with 80% power, and historical data suggest a standard deviation of 5.

But how many participants must she sample from each course?

Page 30: power1

30

How to Create Power Curves in Minitab

Choose Stat > Power and Sample Size

> 2-Sample t In Differences, type -5, 5 In Power values, type 0.80 In Standard deviation, type 5 Click OK

Page 31: power1

31

How to Create Power Curves in Minitab

Page 32: power1

32

How to Create Power Curves in Minitab

6420-2-4-6

1.0

0.8

0.6

0.4

0.2

0.0

Difference

Pow

er

Alpha 0.05StDev 5Alternative Not =

Assumptions

17Size

Sample

Power Curve for 2-Sample t Test

Page 33: power1

33

How to Create Power Curves in Minitab

According to this Power Curve, sampling 17 participants from each class enables her test to detect the difference she seeks with 80% power.

Page 34: power1

34

How to Create Power Curves in Minitab

Page 35: power1

35

Putting Power Curves to Use

Without knowing the power of your test, it's hard to know if you can trust your results: your test could be too weak to answer your question, or too strong for your needs. Minitab's Power Curves (available for many common statistical procedures) help you balance your resources against your goals and design a test you can trust that costs no more than necessary.

Page 36: power1

36

Putting Power Curves to Use

Power Curves graph the dynamic relationships that define power, revealing the big picture and ensuring no option escapes your consideration. And perhaps most importantly, they make power analysis an easier and more accessible part of every project. Empower your test. Trust your results.