chapter 14 tests of hypotheses based on count data 14.2 tests concerning proportions (large...

46
Chapter 14 Tests of Hypotheses Based on Count Data 14.2 Tests concerning proportions (large samples) 14.3 Differences between proportions 14.4 The analysis of an r x c table

Post on 22-Dec-2015

223 views

Category:

Documents


3 download

TRANSCRIPT

Page 1: Chapter 14 Tests of Hypotheses Based on Count Data  14.2 Tests concerning proportions (large samples)  14.3 Differences between proportions  14.4 The

Chapter 14 Tests of Hypotheses Based on Count Data

14.2 Tests concerning proportions (large samples)

14.3 Differences between proportions 14.4 The analysis of an r x c table

Page 2: Chapter 14 Tests of Hypotheses Based on Count Data  14.2 Tests concerning proportions (large samples)  14.3 Differences between proportions  14.4 The

14.2 Tests concerning proportions (large samples)

np>5; n(1-p)>5 n independent trials; X=# of successes p=probability of a success Estimate:

n

Xp ˆ

Page 3: Chapter 14 Tests of Hypotheses Based on Count Data  14.2 Tests concerning proportions (large samples)  14.3 Differences between proportions  14.4 The

Tests of Hypotheses

Null H0: p=p0

Possible Alternatives:

HA: p<p0

HA: p>p0

HA: pp0

Page 4: Chapter 14 Tests of Hypotheses Based on Count Data  14.2 Tests concerning proportions (large samples)  14.3 Differences between proportions  14.4 The

Test Statistics

Under H0, p=p0, and

Statistic:

is approximately standard normal under H0 .

Reject H0 if z is too far from 0 in either direction.

n

pp

n

ppp

)1()1( 00ˆ

npp

ppppz

p )1(

ˆˆ

00

0

ˆ

0

Page 5: Chapter 14 Tests of Hypotheses Based on Count Data  14.2 Tests concerning proportions (large samples)  14.3 Differences between proportions  14.4 The

Rejection Regions

Alternative Hypotheses

HA: p>p0 HA: p<p0 HA: pp0

Rejection Regions

z>z z<-z z>z/2 or

z<-z/2

Page 6: Chapter 14 Tests of Hypotheses Based on Count Data  14.2 Tests concerning proportions (large samples)  14.3 Differences between proportions  14.4 The

Equivalent Form:

0 0

0 0 0 0

ˆ*

(1 ) (1 )

p p X npnz

np p np p

n

Page 7: Chapter 14 Tests of Hypotheses Based on Count Data  14.2 Tests concerning proportions (large samples)  14.3 Differences between proportions  14.4 The

Example 14.1

H0: p=0.75 vs HA: p0.75

=0.05 n=300 x=206 Reject H0 if z<-1.96 or z>1.96

68667.0300

206ˆ

n

Xp

Page 8: Chapter 14 Tests of Hypotheses Based on Count Data  14.2 Tests concerning proportions (large samples)  14.3 Differences between proportions  14.4 The

Observed z value

Conclusion: reject H0 since z<-1.96 P(z<-2.5 or z>2.5)=0.0124<reject H0.

0.68667 0.752.5

0.75(1 0.75)300

206 2252.5

300(0.75)(1 0.75)

z

or

z

Page 9: Chapter 14 Tests of Hypotheses Based on Count Data  14.2 Tests concerning proportions (large samples)  14.3 Differences between proportions  14.4 The

Example 14.2

Toss a coin 100 times and you get 45 heads Estimate p=probability of getting a head

Is the coin balanced one? =0.05

Solution:

H0: p=0.50 vs HA: p0.50

45.0100

45ˆ p

Page 10: Chapter 14 Tests of Hypotheses Based on Count Data  14.2 Tests concerning proportions (large samples)  14.3 Differences between proportions  14.4 The

Enough Evidence to Reject H0?

Critical value z0.025=1.96 Reject H0 if z>1.96 or z<-1.96

Conclusion: accept H0

15

5

)50.01)(50.0(100

)50.0(10045

z

Page 11: Chapter 14 Tests of Hypotheses Based on Count Data  14.2 Tests concerning proportions (large samples)  14.3 Differences between proportions  14.4 The

Another example

The following table is for a certain screening test

91010Results Negative

80140Result Positive

Cancer Absent

Cancer Present

FNA status

Truth = surgical biopsy

Total

220

920

Total 150 990 1140

True positive 140sensitivity 0.93

True Positives False Negatives 150

Page 12: Chapter 14 Tests of Hypotheses Based on Count Data  14.2 Tests concerning proportions (large samples)  14.3 Differences between proportions  14.4 The

Test to see if the sensitivity of the screening test is less than 97%.

Hypothesis

Test statistic

0 0

0

: .97

: .97

H p p

Ha p p

0 0

ˆ 0 0

estimated proportion-prestated proportion

standard error of the estimated proportion

ˆ ˆ 140 150 .972.6325

(1 ) .97 (1 .97)150

p

z

p p p p

SE p p

n

Page 13: Chapter 14 Tests of Hypotheses Based on Count Data  14.2 Tests concerning proportions (large samples)  14.3 Differences between proportions  14.4 The

Check p-value when z=-2.6325, p-value = 0.004

Conclusion: we can reject the null hypothesis at level 0.05.

What is the conclusion?

Page 14: Chapter 14 Tests of Hypotheses Based on Count Data  14.2 Tests concerning proportions (large samples)  14.3 Differences between proportions  14.4 The

One word of caution about sample size: If we decrease the sample size by a factor of 10,

911Results Negative

814Result Positive

Cancer Absent

Cancer Present

FNA status

Truth = surgical biopsy

Total

22

92

Total 15 99 114

True positive 14sensitivity 0.93

True Positives False Negatives 15

Page 15: Chapter 14 Tests of Hypotheses Based on Count Data  14.2 Tests concerning proportions (large samples)  14.3 Differences between proportions  14.4 The

And if we try to use the z-test,

0 0

ˆ 0 0

estimated proportion-prestated proportion

standard error of the estimated proportion

ˆ ˆ 14 15 .970.8324

(1 ) .97 (1 .97)15

p

z

p p p p

SE p p

n

P-value is greater than 0.05 for sure (p=0.2026). So we cannot reach the same conclusion.

And this is wrong!

Page 16: Chapter 14 Tests of Hypotheses Based on Count Data  14.2 Tests concerning proportions (large samples)  14.3 Differences between proportions  14.4 The

So for test concerning proportions

We want

np>5; n(1-p)>5

Page 17: Chapter 14 Tests of Hypotheses Based on Count Data  14.2 Tests concerning proportions (large samples)  14.3 Differences between proportions  14.4 The

14.3 Differences Between Proportions

Two drugs (two treatments) p1 =percentage of patients recovered after

taking drug 1 p2 =percentage of patients recovered after

taking drug 2 Compare effectiveness of two drugs

Page 18: Chapter 14 Tests of Hypotheses Based on Count Data  14.2 Tests concerning proportions (large samples)  14.3 Differences between proportions  14.4 The

Tests of Hypotheses

Null H0: p1=p2 (p1-p2 =0)

Possible Alternatives:

HA: p1<p2

HA: p1>p2

HA: p1p2

Page 19: Chapter 14 Tests of Hypotheses Based on Count Data  14.2 Tests concerning proportions (large samples)  14.3 Differences between proportions  14.4 The

Compare Two Proportions

Drug 1: n1 patients, x1 recovered

Drug 2: n2 patients, x2 recovered

Estimates: Statistic for test:

If we did this study over and over and drew a histogram of the resulting values of , that histogram or distribution would have standard deviation

1 2ˆ ˆp p

2

22

1

11 ˆ;ˆ

n

xp

n

xp

21 ˆˆ

21 ˆˆ

pp

ppz

1 2ˆ ˆp p

Page 20: Chapter 14 Tests of Hypotheses Based on Count Data  14.2 Tests concerning proportions (large samples)  14.3 Differences between proportions  14.4 The

Estimating the Standard Error

Under H0, p1=p2=p. So

Estimate the common p by

)11

)(1(21

)1()1(2ˆ

2ˆˆ 2

22

1

11

2121

nnpp

npp

npp

pppp

21

21

21

2211 ˆˆˆ

nn

xx

nn

pnpnp

Page 21: Chapter 14 Tests of Hypotheses Based on Count Data  14.2 Tests concerning proportions (large samples)  14.3 Differences between proportions  14.4 The

So put them together

1 2

1 2

ˆ ˆ

1 1ˆ ˆ(1 )( )

p pz

p pn n

Page 22: Chapter 14 Tests of Hypotheses Based on Count Data  14.2 Tests concerning proportions (large samples)  14.3 Differences between proportions  14.4 The

Example 12.3

Two sided test: H0: p1=p2 vs HA: p1p2

n1=80, x1=56

n2=80, x2=38

7.0ˆ1 p 475.0ˆ 2 p 5875.08080

3856ˆ

p

Page 23: Chapter 14 Tests of Hypotheses Based on Count Data  14.2 Tests concerning proportions (large samples)  14.3 Differences between proportions  14.4 The

Two Tailed Test

Observed z-value:

Critical value for two-tailed test: 1.96 Conclusion: Reject H0 since |z|>1.96

88.2078.0

225.0

)801

801

(4125.05875.0

475.07.0

z

Page 24: Chapter 14 Tests of Hypotheses Based on Count Data  14.2 Tests concerning proportions (large samples)  14.3 Differences between proportions  14.4 The

Rejection Regions

Alternative Hypotheses

HA: p1>p2

HA: p1<p2

HA: p1p2

Rejection Regions

z>z z<-z z>z/2 or

z<-z/2

Page 25: Chapter 14 Tests of Hypotheses Based on Count Data  14.2 Tests concerning proportions (large samples)  14.3 Differences between proportions  14.4 The

P-value of the previous example

P-value=P(z<-2.88)+P(z>2.88)=2*0.004

So not only we can reject H0 at 0.05 level, we can also reject at 0.01 level.

Page 26: Chapter 14 Tests of Hypotheses Based on Count Data  14.2 Tests concerning proportions (large samples)  14.3 Differences between proportions  14.4 The

14.4 The analysis of an r x c table

Recall Example 12.3 Two sided test; H0: p1=p2 vs HA: p1p2

n1=80, x1=56 n2=80, x2=38

We can put this into a 2x2 table and the question now becomes is there a relationship between treatment and outcome? We will come back to this example after we introduce 2x2 tables and chi-square test.

4224

3856recover

Not recover

trt1 trt2

80 80

94

66

Page 27: Chapter 14 Tests of Hypotheses Based on Count Data  14.2 Tests concerning proportions (large samples)  14.3 Differences between proportions  14.4 The

2x2 Contingency Table

The table shows the data from a study of 91 patients who had a myocardial infarction (Snow 1965). One variable is treatment (propranolol versus a placebo), and the other is outcome (survival for at least 28 days versus death within 28 days).

Treatment

1729

738

OUTCOME

Survival for at least 28 days Death

Propranolol

Placebo

Total

45

46

91Total 67 24

Page 28: Chapter 14 Tests of Hypotheses Based on Count Data  14.2 Tests concerning proportions (large samples)  14.3 Differences between proportions  14.4 The

Hypotheses for Two-way TablesHypotheses for Two-way Tables

The hypotheses for two-way tables are very “broad stroke”.

• The null hypothesis H0 is simply that there is no association between the row and column variable.

• The alternative hypothesis Ha is that there is an association between the two variables. It doesn’t specify a particular direction and can’t really be described as one-sided or two-sided.

Page 29: Chapter 14 Tests of Hypotheses Based on Count Data  14.2 Tests concerning proportions (large samples)  14.3 Differences between proportions  14.4 The

Hypothesis statement in Our Example

Null hypothesis: the method of treating the myocardial infarction patients did not influence the proportion of patients who survived for at least 28 days.

The alternative hypothesis is that the outcome (survival or death) depended on the treatment, meaning that the outcomes was the dependent variable and the treatment was the independent variable.

Page 30: Chapter 14 Tests of Hypotheses Based on Count Data  14.2 Tests concerning proportions (large samples)  14.3 Differences between proportions  14.4 The

Calculation of Expected Cell Count

To test the null hypothesis, we compare the observed cell counts (or frequencies) to the expected cell counts (also called the expected frequencies)

The process of comparing the observed counts with the expected counts is called a goodness-of-fit test. (If the chi-square value is small, the fit is good and the null hypothesis is not rejected.)

TotalStudy

TotalColumnTotalRowE

111,1

Page 31: Chapter 14 Tests of Hypotheses Based on Count Data  14.2 Tests concerning proportions (large samples)  14.3 Differences between proportions  14.4 The

Observed cell counts

Treatment

1729738

OUTCOMESurvival for at least 28 days

Death

Propranolol

Placebo

Total

45

4691Total 67 24

Treatment

12.1333.87

11.8733.13

OUTCOMESurvival for at least 28 days

Death

Propranolol

Placebo

Total

45

4691Total 67 24

Expected cell counts

Page 32: Chapter 14 Tests of Hypotheses Based on Count Data  14.2 Tests concerning proportions (large samples)  14.3 Differences between proportions  14.4 The

The Chi-Square ( The Chi-Square ( 22) Test Statistic) Test Statistic

The chi-square statistic is a measure of how much the observed cell counts in a two-way table differ from the expected cell counts. It can be used for tables larger than 2 x 2, if the average of the expected cell counts is > 5 and the smallest expected cell count is > 1; and for 2 x 2 tables when all 4 expected cell counts are > 5. The formula is:

2 = (observed count – expected count)2/expected count

Degrees of freedom (df) = (r –1) x (c – 1)

Where “observed” is an observed sample count and “expected” is the computed expected cell count for the same cell, r is the number of rows, c is the number of columns, and the sum () is over all the r x c cells in the table (these do not include the total cells).

Page 33: Chapter 14 Tests of Hypotheses Based on Count Data  14.2 Tests concerning proportions (large samples)  14.3 Differences between proportions  14.4 The

The Chi-Square ( The Chi-Square ( 22) Test Statistic) Test Statistic

Page 34: Chapter 14 Tests of Hypotheses Based on Count Data  14.2 Tests concerning proportions (large samples)  14.3 Differences between proportions  14.4 The

.infarction myocardial of

outcome affect the does lpropranolo that hypothesis ealternativ accept the

and ceindependen of hypothesis null reject thecan or investigat that the

wrong)being of 50about in chance 1(only probablehighly isit is,That

t.significanlly statistica are table22 in this noted results The

:tionInterpreta

025.0

1)1)(1(

38.596.17.0272.013.12

72.23

87.33

72.23

87.11

72.23

13.33

72.2313.12

)87.4(

87.33

)87.4(

87.11

)87.4(

13.33

)87.4(

13.12

)13.1217(

87.33

)87.3329(

87.11

)87.117(

13.33

)13.3338(

)(

value)( square-chi theofn Calculatio

2222

2222

22

2

valuep

CRdf

E

EO

Adobe Acrobat 7.0 Document

Page 35: Chapter 14 Tests of Hypotheses Based on Count Data  14.2 Tests concerning proportions (large samples)  14.3 Differences between proportions  14.4 The

Example: Patient Compliance w/ RxExample: Patient Compliance w/ Rx

TreatmentCompliance 10 mg Daily 5 mg bid Total

95%+ 46 40 86

< 95% 4 10 14

Total 50 50100

In a study of 100 patients with hypertension, 50 were randomly allocated to a group prescribed 10 mg lisinopril to be taken once daily, while the other 50 patients were prescribed 5 mg lisinopril to be taken twice daily. At the end of the 60 day study period the patients returned their remaining medication to the research pharmacy. The pharmacy then counted the remaining pills and classified each patient as < 95% or 95%+ compliant with their prescription. The two-way table for Compliance and Treatment was:

Page 36: Chapter 14 Tests of Hypotheses Based on Count Data  14.2 Tests concerning proportions (large samples)  14.3 Differences between proportions  14.4 The

Example: Patient Compliance w/ RxExample: Patient Compliance w/ Rx

TreatmentCompliance 10 mg Daily 5 mg bid Total95%+ 460 400 860< 95% 40 100 140Total 500 5001000

TreatmentCompliance 10 mg Daily 5 mg bid Total95%+ 430 430 860< 95% 70 70 140Total 500 5001000

The expected cell counts were:

2 = 29.9, df = (2-1)*(2-1) = 1, P-value <0.001

Page 37: Chapter 14 Tests of Hypotheses Based on Count Data  14.2 Tests concerning proportions (large samples)  14.3 Differences between proportions  14.4 The

If we use the two sample test for proportion

1 2

1 2

1 2

2

2 2(1)

460 400 860ˆ ˆ ˆ, ,

500 500 1000ˆ ˆ

5.468121 1

ˆ ˆ(1 )( )

5.46812 29.9

p p p

p pz

p pn n

z

Page 38: Chapter 14 Tests of Hypotheses Based on Count Data  14.2 Tests concerning proportions (large samples)  14.3 Differences between proportions  14.4 The

The The 22 and and zz Test Statistics Test StatisticsThe comparison of the proportions of “successes” in two populations leads to a 2 x 2 table, so the population proportions can be compared either using the 2 test or the two-sample z test . It really doesn’t matter, because they always give exactly the same result, because the 2 is equal to the square of the z statistic and the chi-square with one degree of freedom 2

(1) critical values are equal to the squares of the corresponding z critical values.• A P-value for the 2 x 2 2 can be found by calculating the square

root of the chi-square, looking that up in Table for P(Z > z) and multiplying by 2, because the chi-square always tests the two-sided alternative.

• For a 2 x 2 table with a one-sided alternative hypothesis the two-sample z statistic would need to be used.

• To test more than two populations the chi-square must be used• The chi-square is the one most often seen in the literature

Page 39: Chapter 14 Tests of Hypotheses Based on Count Data  14.2 Tests concerning proportions (large samples)  14.3 Differences between proportions  14.4 The

Summary: Computations for Two-way TablesSummary: Computations for Two-way Tables

1. create the table, including observed cell counts, column and row totals.

2. Find the expected cell counts.

• Determine if a 2 test is appropriate

• Calculate the 2 statistic and number of degrees of freedom

3. Find the approximate P-value

• use Table III chi-square table to find the approximate P-value

• or use z-table and find the two-tailed p-value if it is 2 x 2.

4. Draw conclusions about the association between the row and column variables.

Page 40: Chapter 14 Tests of Hypotheses Based on Count Data  14.2 Tests concerning proportions (large samples)  14.3 Differences between proportions  14.4 The

Yates Correction for Continuity

The chi-square test is based on the normal approximation of the binomial distribution (discrete), many statisticians believe a correction for continuity is needed.

It makes little difference if the numbers in the table are large, but in tables with small numbers it is worth doing.

It reduces the size of the chi-square value and so reduces the chance of finding a statistically significant difference, so that correction for continuity makes the test more conservative.

E

EOYates

22 )5.0|(|

Page 41: Chapter 14 Tests of Hypotheses Based on Count Data  14.2 Tests concerning proportions (large samples)  14.3 Differences between proportions  14.4 The

What do we do if the expected values in any of the cells in a 2x2

table is below 5?For example, a sample of teenagers might be divided into male and female on the one hand, and those that are and are not currently dieting on the other. We hypothesize, perhaps, that the proportion of dieting individuals is higher among the women than among the men, and we want to test whether any difference of proportions that we observe is significant. The data might look like this:

men women total

dieting 1 9 10

not dieting

11 3 14

totals 12 12 24

Page 42: Chapter 14 Tests of Hypotheses Based on Count Data  14.2 Tests concerning proportions (large samples)  14.3 Differences between proportions  14.4 The

The question we ask about these data is: knowing that 10 of these 24 teenagers are dieters, what is the probability that these 10 dieters would be so unevenly distributed between the girls and the boys? If we were to choose 10 of the teenagers at random, what is the probability that 9 of them would be among the 12 girls, and only 1 from among the 12 boys?

--Hypergeometric distribution!

--Fisher’s exact test uses hypergeometric distribution to calculate the “exact” probability of obtaining such set of the values.

Page 43: Chapter 14 Tests of Hypotheses Based on Count Data  14.2 Tests concerning proportions (large samples)  14.3 Differences between proportions  14.4 The

Fisher’s exact test

Before we proceed with the Fisher test, we first introduce some notation. We represent the cells by the letters a, b, c and d, call the totals across rows and columns marginal totals, and represent the grand total by n. So the table now looks like this:

men women total

dieting a b a + b

not dieting

c d c + d

totals a + c b + d n

Page 44: Chapter 14 Tests of Hypotheses Based on Count Data  14.2 Tests concerning proportions (large samples)  14.3 Differences between proportions  14.4 The

Fisher showed that the probability of obtaining any such set of values was given by the hypergeometric distribution:

                                                                     

men women total

dieting a b a + b

not dieting c d c + d

totals a + c b + d n

Page 45: Chapter 14 Tests of Hypotheses Based on Count Data  14.2 Tests concerning proportions (large samples)  14.3 Differences between proportions  14.4 The

In our example

10!14!12!12!0.00134

24!1!9!11!3!p

Recall that p-value is the probability of observing data as extreme or more extreme if the null hypothesis is true. So the p-value is this problem is 0.00137.

10!14!12!12!0.00003

24!0!10!12!2!p

241212totals

14311not dieting

1091dieting

totalwomenmen

241212totals

14212not dieting

10100dieting

totalwomenmen

As extreme

as observed

More extreme than

observed

Page 46: Chapter 14 Tests of Hypotheses Based on Count Data  14.2 Tests concerning proportions (large samples)  14.3 Differences between proportions  14.4 The

The fisher Exact Probability Test Used when one or more of the expected counts in a contingency

table is small (<2). Fisher's Exact Test is based on exact probabilities from a

specific distribution (the hypergeometric distribution). There's really no lower bound on the amount of data that is

needed for Fisher's Exact Test. You can use Fisher's Exact Test when one of the cells in your table has a zero in it. Fisher's Exact Test is also very useful for highly imbalanced tables. If one or two of the cells in a two by two table have numbers in the thousands and one or two of the other cells has numbers less than 5, you can still use Fisher's Exact Test.

Fisher's Exact Test has no formal test statistic and no critical value, and it only gives you a p-value.