© 2003 prentice-hall, inc.chap 8-1 business statistics: a first course (3 rd edition) chapter 8...

65
© 2003 Prentice-Hall, Inc. Chap 8-1 Business Statistics: A First Course (3 rd Edition) Chapter 8 Hypothesis Tests for Numerical Data from Two or More Samples

Upload: dale-horn

Post on 19-Jan-2016

217 views

Category:

Documents


1 download

TRANSCRIPT

Page 1: © 2003 Prentice-Hall, Inc.Chap 8-1 Business Statistics: A First Course (3 rd Edition) Chapter 8 Hypothesis Tests for Numerical Data from Two or More Samples

© 2003 Prentice-Hall, Inc. Chap 8-1

Business Statistics: A First Course

(3rd Edition)

Chapter 8Hypothesis Tests for Numerical

Data from Two or More Samples

Page 2: © 2003 Prentice-Hall, Inc.Chap 8-1 Business Statistics: A First Course (3 rd Edition) Chapter 8 Hypothesis Tests for Numerical Data from Two or More Samples

© 2003 Prentice-Hall, Inc. Chap 8-2

Chapter Topics

Comparing Two Independent Samples Independent samples Z Test for the

difference in two means Pooled-variance t Test for the difference in

two means F Test for the Difference in Two Variances Comparing Two Related Samples

Paired-sample Z test for the mean difference Paired-sample t test for the mean difference

Page 3: © 2003 Prentice-Hall, Inc.Chap 8-1 Business Statistics: A First Course (3 rd Edition) Chapter 8 Hypothesis Tests for Numerical Data from Two or More Samples

© 2003 Prentice-Hall, Inc. Chap 8-3

The Completely Randomized Design: One-Way Analysis of Variance

ANOVA Assumptions F Test for Difference in More than Two

Means The Tukey-Kramer Procedure

Chapter Topics(continued)

Page 4: © 2003 Prentice-Hall, Inc.Chap 8-1 Business Statistics: A First Course (3 rd Edition) Chapter 8 Hypothesis Tests for Numerical Data from Two or More Samples

© 2003 Prentice-Hall, Inc. Chap 8-4

Comparing Two Independent Samples

Different Data Sources Unrelated Independent

Sample selected from one population has no effect or bearing on the sample selected from the other population

Use the Difference between 2 Sample Means

Use Z Test or Pooled-Variance t Test

Page 5: © 2003 Prentice-Hall, Inc.Chap 8-1 Business Statistics: A First Course (3 rd Edition) Chapter 8 Hypothesis Tests for Numerical Data from Two or More Samples

© 2003 Prentice-Hall, Inc. Chap 8-5

Independent Sample Z Test (Variances Known)

Assumptions Samples are randomly and independently

drawn from normal distributions Population variances are known

Test Statistic

1 2 1

2 2

1 2

( ) ( )X XZ

n n

Page 6: © 2003 Prentice-Hall, Inc.Chap 8-1 Business Statistics: A First Course (3 rd Edition) Chapter 8 Hypothesis Tests for Numerical Data from Two or More Samples

© 2003 Prentice-Hall, Inc. Chap 8-6

Independent Sample (Two Sample) Z Test in EXCEL

Independent Sample Z Test with Variances Known Tools | Data Analysis | z-test: Two Sample

for Means

Page 7: © 2003 Prentice-Hall, Inc.Chap 8-1 Business Statistics: A First Course (3 rd Edition) Chapter 8 Hypothesis Tests for Numerical Data from Two or More Samples

© 2003 Prentice-Hall, Inc. Chap 8-7

Pooled-Variance t Test (Variances Unknown)

Assumptions Both populations are normally distributed Samples are randomly and independently

drawn Population variances are unknown but

assumed equal If both populations are not normal, need

large sample sizes

Page 8: © 2003 Prentice-Hall, Inc.Chap 8-1 Business Statistics: A First Course (3 rd Edition) Chapter 8 Hypothesis Tests for Numerical Data from Two or More Samples

© 2003 Prentice-Hall, Inc. Chap 8-8

Developing the Pooled-Variance

t Test

Setting Up the Hypotheses

H0: 1 2

H1: 1 > 2

H0: 1 -2 = 0

H1: 1 - 2

0

H0: 1 = 2

H1: 1 2

H0: 1

2

H0: 1 - 2 0

H1: 1 - 2 > 0

H0: 1 - 2

H1: 1 -

2 < 0

OR

OR

OR Left Tail

Right Tail

Two Tail

H1: 1 < 2

Page 9: © 2003 Prentice-Hall, Inc.Chap 8-1 Business Statistics: A First Course (3 rd Edition) Chapter 8 Hypothesis Tests for Numerical Data from Two or More Samples

© 2003 Prentice-Hall, Inc. Chap 8-9

Developing the Pooled-Variance

t Test

Calculate the Pooled Sample Variance as an Estimate of the Common Population Variance

(continued)

2 22 1 1 2 2

1 2

21

21 2

22

( 1) ( 1)

( 1) ( 1)

: Pooled sample variance : Size of sample 1

: Variance of sample 1 : Size of sample 2

: Variance of sample 2

p

p

n S n SS

n n

S n

S n

S

Page 10: © 2003 Prentice-Hall, Inc.Chap 8-1 Business Statistics: A First Course (3 rd Edition) Chapter 8 Hypothesis Tests for Numerical Data from Two or More Samples

© 2003 Prentice-Hall, Inc. Chap 8-10

Developing the Pooled-Variance

t Test

Compute the Sample Statistic

(continued)

1 2 1 2

2

1 2

2 21 1 2 22

1 2

1 1

1 1

1 1

p

p

X Xt

Sn n

n S n SS

n n

Hypothesized

difference1 2 2df n n

Page 11: © 2003 Prentice-Hall, Inc.Chap 8-1 Business Statistics: A First Course (3 rd Edition) Chapter 8 Hypothesis Tests for Numerical Data from Two or More Samples

© 2003 Prentice-Hall, Inc. Chap 8-11

Pooled-Variance t Test: Example

You’re a financial analyst for Charles Schwab. Is there a difference in dividend yield between stocks listed on the NYSE & NASDAQ? You collect the following data:

NYSE NASDAQNumber 21 25Sample Mean 3.27 2.53Sample Std Dev 1.30 1.16

Assuming equal variances, isthere a difference in average yield (= 0.05)?

© 1984-1994 T/Maker Co.

Page 12: © 2003 Prentice-Hall, Inc.Chap 8-1 Business Statistics: A First Course (3 rd Edition) Chapter 8 Hypothesis Tests for Numerical Data from Two or More Samples

© 2003 Prentice-Hall, Inc. Chap 8-12

Calculating the Test Statistic

1 2 1 2

2

1 2

2 22 1 1 2 2

1 2

2 2

3.27 2.53 02.03

1 11 1 1.51021 25

1 1

1 1

21 1 1.30 25 1 1.16 1.502

21 1 25 1

p

p

X Xt

Sn n

n S n SS

n n

Page 13: © 2003 Prentice-Hall, Inc.Chap 8-1 Business Statistics: A First Course (3 rd Edition) Chapter 8 Hypothesis Tests for Numerical Data from Two or More Samples

© 2003 Prentice-Hall, Inc. Chap 8-13

Solution

H0: 1 - 2 = 0 i.e. (1 = 2)

H1: 1 - 2 0 i.e. (12)

= 0.05

df = 21 + 25 - 2 = 44

Critical Value(s):

Test Statistic:

Decision:

Conclusion:

Reject at = 0.05

There is evidence of a difference in means.

t0 2.0154-2.0154

.025

Reject H0 Reject H0

.025

2.03

3.27 2.532.03

1 11.502

21 25

t

Page 14: © 2003 Prentice-Hall, Inc.Chap 8-1 Business Statistics: A First Course (3 rd Edition) Chapter 8 Hypothesis Tests for Numerical Data from Two or More Samples

© 2003 Prentice-Hall, Inc. Chap 8-14

p -Value Solution

p-Value 2

(p-Value is between .02 and .05) < ( = 0.05). Reject.

02.03

Z

Reject

2.0154

is between .01 and .025

Test Statistic 2.03 is in the Reject Region

Reject

-2.0154

=.025

Page 15: © 2003 Prentice-Hall, Inc.Chap 8-1 Business Statistics: A First Course (3 rd Edition) Chapter 8 Hypothesis Tests for Numerical Data from Two or More Samples

© 2003 Prentice-Hall, Inc. Chap 8-15

Pooled-Variance t Test in PHStat and Excel

If the Raw Data are Available Tools | Data Analysis | t-Test: Two Sample

Assuming Equal Variances If only Summary Statistics are Available

PHStat | Two-Sample Tests | t Test for Differences in Two Means...

Page 16: © 2003 Prentice-Hall, Inc.Chap 8-1 Business Statistics: A First Course (3 rd Edition) Chapter 8 Hypothesis Tests for Numerical Data from Two or More Samples

© 2003 Prentice-Hall, Inc. Chap 8-16

Solution in EXCEL

Excel Workbook that Performs the Pooled-Variance t Test

Microsoft Excel Worksheet

Page 17: © 2003 Prentice-Hall, Inc.Chap 8-1 Business Statistics: A First Course (3 rd Edition) Chapter 8 Hypothesis Tests for Numerical Data from Two or More Samples

© 2003 Prentice-Hall, Inc. Chap 8-17

Confidence Interval Estimate for of Two Independent Groups

Assumptions Both populations are normally distributed Samples are randomly and independently

drawn Population variances are unknown but

assumed equal If both populations are not normal, need large

sample sizes Confidence Interval Estimate:

1 2

100 1 %

1 2

21 2 / 2, 2

1 2

1 1n n pX X t S

n n

Page 18: © 2003 Prentice-Hall, Inc.Chap 8-1 Business Statistics: A First Course (3 rd Edition) Chapter 8 Hypothesis Tests for Numerical Data from Two or More Samples

© 2003 Prentice-Hall, Inc. Chap 8-18

Example

© 1984-1994 T/Maker Co.

You’re a financial analyst for Charles Schwab. You collect the following data:

NYSE NASDAQNumber 21 25Sample Mean 3.27 2.53Sample Std Dev 1.30 1.16

You want to construct a 95% confidence interval for the difference in population average yields of the stocks listed on NYSE and NASDAQ.

Page 19: © 2003 Prentice-Hall, Inc.Chap 8-1 Business Statistics: A First Course (3 rd Edition) Chapter 8 Hypothesis Tests for Numerical Data from Two or More Samples

© 2003 Prentice-Hall, Inc. Chap 8-19

Example: Solution

1 2

21 2 / 2, 2

1 2

1 1n n pX X t S

n n

1 13.27 2.53 2.0154 1.502

21 25

1 20.0088 1.4712

2 21 1 2 22

1 2

2 2

1 1

1 1

21 1 1.30 25 1 1.16 1.502

21 1 25 1

p

n S n SS

n n

Page 20: © 2003 Prentice-Hall, Inc.Chap 8-1 Business Statistics: A First Course (3 rd Edition) Chapter 8 Hypothesis Tests for Numerical Data from Two or More Samples

© 2003 Prentice-Hall, Inc. Chap 8-20

Solution in Excel

An Excel Spreadsheet with the Solution:

Microsoft Excel Worksheet

Page 21: © 2003 Prentice-Hall, Inc.Chap 8-1 Business Statistics: A First Course (3 rd Edition) Chapter 8 Hypothesis Tests for Numerical Data from Two or More Samples

© 2003 Prentice-Hall, Inc. Chap 8-21

F Test for Difference in Two Population Variances

Test for the Difference in 2 Independent Populations

Parametric Test Procedure Assumptions

Both populations are normally distributed Test is not robust to this violation

Samples are randomly and independently drawn

Page 22: © 2003 Prentice-Hall, Inc.Chap 8-1 Business Statistics: A First Course (3 rd Edition) Chapter 8 Hypothesis Tests for Numerical Data from Two or More Samples

© 2003 Prentice-Hall, Inc. Chap 8-22

The F Test Statistic

= Variance of Sample 1

n1 - 1 = degrees of freedom

n2 - 1 = degrees of freedom

F 0

21S

22S = Variance of Sample 2

2122

SF

S

Page 23: © 2003 Prentice-Hall, Inc.Chap 8-1 Business Statistics: A First Course (3 rd Edition) Chapter 8 Hypothesis Tests for Numerical Data from Two or More Samples

© 2003 Prentice-Hall, Inc. Chap 8-23

Hypotheses H0:1

2 = 22

H1: 12 2

2 Test Statistic

F = S12 /S2

2

Two Sets of Degrees of Freedom df1 = n1 - 1; df2 = n2 - 1

Critical Values: FL( ) and FU( )

FL = 1/FU* (*degrees of freedom switched)

Developing the F Test

Reject H0

Reject H0

/2/2Do NotReject

F 0 FL FU

n1 -1, n2 -1 n1 -1 , n2 -1

Page 24: © 2003 Prentice-Hall, Inc.Chap 8-1 Business Statistics: A First Course (3 rd Edition) Chapter 8 Hypothesis Tests for Numerical Data from Two or More Samples

© 2003 Prentice-Hall, Inc. Chap 8-24

F Test: An Example

Assume you are a financial analyst for Charles Schwab. You want to compare dividend yields between stocks listed on the NYSE & NASDAQ. You collect the following data:

NYSE NASDAQNumber 21 25Mean 3.27 2.53Std Dev 1.30 1.16

Is there a difference in the variances between the NYSE & NASDAQ at the 0.05 level?

© 1984-1994 T/Maker Co.

Page 25: © 2003 Prentice-Hall, Inc.Chap 8-1 Business Statistics: A First Course (3 rd Edition) Chapter 8 Hypothesis Tests for Numerical Data from Two or More Samples

© 2003 Prentice-Hall, Inc. Chap 8-25

F Test: Example Solution

Finding the Critical Values for = .05

20,24 24,20

20,24

1/ 1/ 2.41 .415

2.33

L U

U

F F

F

1 1

2 2

1 21 1 20

1 25 1 24

df n

df n

Page 26: © 2003 Prentice-Hall, Inc.Chap 8-1 Business Statistics: A First Course (3 rd Edition) Chapter 8 Hypothesis Tests for Numerical Data from Two or More Samples

© 2003 Prentice-Hall, Inc. Chap 8-26

F Test: Example Solution

H0: 12 = 2

2

H1: 12 2

2

.05 df1 20 df2 24

Critical Value(s):

Test Statistic:

Decision:

Conclusion:

Do not reject at = 0.05

There is insufficient evidence to prove a difference in variances.

0 F2.330.415

.025

Reject Reject

.025

2 212 22

1.301.25

1.16

SF

S

1.25

Page 27: © 2003 Prentice-Hall, Inc.Chap 8-1 Business Statistics: A First Course (3 rd Edition) Chapter 8 Hypothesis Tests for Numerical Data from Two or More Samples

© 2003 Prentice-Hall, Inc. Chap 8-27

F Test in PHStat

PHStat | Two-Sample Tests | F Test for Differences in Two Variances

Example in Excel Spreadsheet

Microsoft Excel Worksheet

Page 28: © 2003 Prentice-Hall, Inc.Chap 8-1 Business Statistics: A First Course (3 rd Edition) Chapter 8 Hypothesis Tests for Numerical Data from Two or More Samples

© 2003 Prentice-Hall, Inc. Chap 8-28

F Test: One-Tail

H0: 12 2

2

H1: 12 2

2

H0: 12 2

2

H1: 12 > 2

2

Reject

.05

F0 F0

Reject

.05

= .05

or

Degrees of freedom switched

1 2

2 1

1, 11, 1

1L n n

U n n

FF

1 21, 1U n nF 1 21, 1L n nF

Page 29: © 2003 Prentice-Hall, Inc.Chap 8-1 Business Statistics: A First Course (3 rd Edition) Chapter 8 Hypothesis Tests for Numerical Data from Two or More Samples

© 2003 Prentice-Hall, Inc. Chap 8-29

Comparing Two Related Samples

Test the Means of Two Related Samples Paired or matched Repeated measures (before and after) Use difference between pairs

Eliminates Variation between Subjects 1 2i i iD X X

Page 30: © 2003 Prentice-Hall, Inc.Chap 8-1 Business Statistics: A First Course (3 rd Edition) Chapter 8 Hypothesis Tests for Numerical Data from Two or More Samples

© 2003 Prentice-Hall, Inc. Chap 8-30

Z Test for Mean Difference (Variance Known)

Assumptions Both populations are normally distributed Observations are paired or matched Variance Known

Test Statistic

D

D

DZ

n

1

n

ii

DD

n

Page 31: © 2003 Prentice-Hall, Inc.Chap 8-1 Business Statistics: A First Course (3 rd Edition) Chapter 8 Hypothesis Tests for Numerical Data from Two or More Samples

© 2003 Prentice-Hall, Inc. Chap 8-31

t Test for Mean Difference (Variance Unknown)

Assumptions Both populations are normally distributed Observations are matched or paired Variance unknown If population not normal, need large samples

Test Statistic

D

D

Dt

S

n

2

1

( )

1

n

ii

D

D DS

n

1

n

ii

DD

n

Page 32: © 2003 Prentice-Hall, Inc.Chap 8-1 Business Statistics: A First Course (3 rd Edition) Chapter 8 Hypothesis Tests for Numerical Data from Two or More Samples

© 2003 Prentice-Hall, Inc. Chap 8-32

Existing System (1) New Software (2) Difference Di

9.98 Seconds 9.88 Seconds .109.88 9.86 .029.84 9.75 .099.99 9.80 .199.94 9.87 .079.84 9.84 .009.86 9.87 - .0110.12 9.98 .149.90 9.83 .079.91 9.86 .05

Paired-Sample t Test: Example

Assume you work in the finance department. Is the new financial package faster (=0.05 level)? You collect the following processing times:

2

.072

1 .06215

i

iD

DD

n

D DS

n

Page 33: © 2003 Prentice-Hall, Inc.Chap 8-1 Business Statistics: A First Course (3 rd Edition) Chapter 8 Hypothesis Tests for Numerical Data from Two or More Samples

© 2003 Prentice-Hall, Inc. Chap 8-33

Paired-Sample t Test: Example Solution

Is the new financial package faster (0.05 level)?

.072D =

H0: D H1: D

Test Statistic

Critical Value=1.8331 df = n - 1 = 9

Reject

1.8331

Decision: Reject H0

t Stat. in the rejection zone.Conclusion: The new software package is faster.

3.66

t

.072 03.66

/ .06215/ 10D

D

Dt

S n

Page 34: © 2003 Prentice-Hall, Inc.Chap 8-1 Business Statistics: A First Course (3 rd Edition) Chapter 8 Hypothesis Tests for Numerical Data from Two or More Samples

© 2003 Prentice-Hall, Inc. Chap 8-34

Paired-Sample t Test in EXCEL

Tools | Data Analysis… | t-test: Paired Two Sample for Means

Example in Excel Spreadsheet

Microsoft Excel Worksheet

Page 35: © 2003 Prentice-Hall, Inc.Chap 8-1 Business Statistics: A First Course (3 rd Edition) Chapter 8 Hypothesis Tests for Numerical Data from Two or More Samples

© 2003 Prentice-Hall, Inc. Chap 8-35

General Experimental Setting

Investigator Controls One or More Independent Variables Called treatment variables or factors Each treatment factor contains two or more

groups (or levels) Observe Effects on Dependent Variable

Response to groups (or levels) of independent variable

Experimental Design: The Plan Used to Test Hypothesis

Page 36: © 2003 Prentice-Hall, Inc.Chap 8-1 Business Statistics: A First Course (3 rd Edition) Chapter 8 Hypothesis Tests for Numerical Data from Two or More Samples

© 2003 Prentice-Hall, Inc. Chap 8-36

Completely Randomized Design

Experimental Units (Subjects) are Assigned Randomly to Groups Subjects are assumed homogeneous

Only One Factor or Independent Variable With 2 or more groups (or levels)

Analyzed by One-way Analysis of Variance (ANOVA)

Page 37: © 2003 Prentice-Hall, Inc.Chap 8-1 Business Statistics: A First Course (3 rd Edition) Chapter 8 Hypothesis Tests for Numerical Data from Two or More Samples

© 2003 Prentice-Hall, Inc. Chap 8-37

Factor (Training Method)

Factor Levels

(Groups)

Randomly Assigned

Units

Dependent Variable

(Response)

21 hrs 17 hrs 31 hrs

27 hrs 25 hrs 28 hrs

29 hrs 20 hrs 22 hrs

Randomized Design Example

Page 38: © 2003 Prentice-Hall, Inc.Chap 8-1 Business Statistics: A First Course (3 rd Edition) Chapter 8 Hypothesis Tests for Numerical Data from Two or More Samples

© 2003 Prentice-Hall, Inc. Chap 8-38

One-way Analysis of VarianceF Test

Evaluate the Difference among the Mean Responses of 2 or More (c ) Populations E.g. Several types of tires, oven temperature

settings Assumptions

Samples are randomly and independently drawn This condition must be met

Populations are normally distributed F Test is robust to moderate departure from

normality Populations have equal variances

Less sensitive to this requirement when samples are of equal size from each population

Page 39: © 2003 Prentice-Hall, Inc.Chap 8-1 Business Statistics: A First Course (3 rd Edition) Chapter 8 Hypothesis Tests for Numerical Data from Two or More Samples

© 2003 Prentice-Hall, Inc. Chap 8-39

Why ANOVA? Could Compare the Means One by One

using Z or t Tests for Difference of Means Each Z or t Test Contains Type I Error The Total Type I Error with k Pairs of

Means is 1- (1 - ) k

E.g. If there are 5 means and use = .05 Must perform 10 comparisons Type I Error is 1 – (.95) 10 = .40 40% of the time you will reject the null

hypothesis of equal means in favor of the alternative when the null is true!

Page 40: © 2003 Prentice-Hall, Inc.Chap 8-1 Business Statistics: A First Course (3 rd Edition) Chapter 8 Hypothesis Tests for Numerical Data from Two or More Samples

© 2003 Prentice-Hall, Inc. Chap 8-40

Hypotheses of One-Way ANOVA

All population means are equal No treatment effect (no variation in means

among groups)

At least one population mean is different

(others may be the same!) There is a treatment effect Does not mean that all population means are

different

0 1 2: cH

1 : Not all are the sameiH

Page 41: © 2003 Prentice-Hall, Inc.Chap 8-1 Business Statistics: A First Course (3 rd Edition) Chapter 8 Hypothesis Tests for Numerical Data from Two or More Samples

© 2003 Prentice-Hall, Inc. Chap 8-41

One-way ANOVA (No Treatment Effect)

The Null Hypothesis is True

0 1 2: cH

1 : Not all are the sameiH

1 2 3

Page 42: © 2003 Prentice-Hall, Inc.Chap 8-1 Business Statistics: A First Course (3 rd Edition) Chapter 8 Hypothesis Tests for Numerical Data from Two or More Samples

© 2003 Prentice-Hall, Inc. Chap 8-42

One-way ANOVA (Treatment Effect Present)

The Null Hypothesis is

NOT True

0 1 2: cH

1 : Not all are the sameiH

1 2 3 1 2 3

Page 43: © 2003 Prentice-Hall, Inc.Chap 8-1 Business Statistics: A First Course (3 rd Edition) Chapter 8 Hypothesis Tests for Numerical Data from Two or More Samples

© 2003 Prentice-Hall, Inc. Chap 8-43

One-way ANOVA(Partition of Total Variation)

Variation Due to Treatment SSA

Variation Due to Treatment SSA

Variation Due to Random Sampling SSW

Variation Due to Random Sampling SSW

Total Variation SSTTotal Variation SST

Commonly referred to as: Within Group Variation Sum of Squares Within Sum of Squares Error Sum of Squares

Unexplained

Commonly referred to as: Among Group Variation Sum of Squares Among Sum of Squares Between Sum of Squares Model Sum of Squares

Explained Sum of Squares

Treatment

= +

Page 44: © 2003 Prentice-Hall, Inc.Chap 8-1 Business Statistics: A First Course (3 rd Edition) Chapter 8 Hypothesis Tests for Numerical Data from Two or More Samples

© 2003 Prentice-Hall, Inc. Chap 8-44

Total Variation

2

1 1

1 1

( )

: the -th observation in group

: the number of observations in group

: the total number of observations in all groups

: the number of groups

the over

j

j

nc

ijj i

ij

j

nc

ijj i

SST X X

X i j

n j

n

c

X

Xn

all or grand mean

Page 45: © 2003 Prentice-Hall, Inc.Chap 8-1 Business Statistics: A First Course (3 rd Edition) Chapter 8 Hypothesis Tests for Numerical Data from Two or More Samples

© 2003 Prentice-Hall, Inc. Chap 8-45

Total Variation(continued)

2 2 2

11 21 cn cSST X X X X X X

X

Response, X

Group 1 Group 2 Group 3

Page 46: © 2003 Prentice-Hall, Inc.Chap 8-1 Business Statistics: A First Course (3 rd Edition) Chapter 8 Hypothesis Tests for Numerical Data from Two or More Samples

© 2003 Prentice-Hall, Inc. Chap 8-46

2

1

( )c

j jj

SSA n X X

Among-Group Variation

Variation Due to Differences Among Groups.

1

SSAMSA

c

i j

: The sample mean of group

: The overall or grand mean

jX j

X

Page 47: © 2003 Prentice-Hall, Inc.Chap 8-1 Business Statistics: A First Course (3 rd Edition) Chapter 8 Hypothesis Tests for Numerical Data from Two or More Samples

© 2003 Prentice-Hall, Inc. Chap 8-47

Among-Group Variation(continued)

2 2 2

1 1 2 2 c cSSA n X X n X X n X X

X1X 2X

3X

Response, X

Group 1 Group 2 Group 3

Page 48: © 2003 Prentice-Hall, Inc.Chap 8-1 Business Statistics: A First Course (3 rd Edition) Chapter 8 Hypothesis Tests for Numerical Data from Two or More Samples

© 2003 Prentice-Hall, Inc. Chap 8-48

Summing the variation within each group and then adding over all groups.

: The sample mean of group

: The -th observation in group

j

ij

X j

X i j

Within-Group Variation

2

1 1

( )jnc

ij jj i

SSW X X

SSWMSW

n c

j

Page 49: © 2003 Prentice-Hall, Inc.Chap 8-1 Business Statistics: A First Course (3 rd Edition) Chapter 8 Hypothesis Tests for Numerical Data from Two or More Samples

© 2003 Prentice-Hall, Inc. Chap 8-49

Within-Group Variation(continued)

22 2

11 1 21 1 cn c cSSW X X X X X X

1X 2X3X

X

Response, X

Group 1 Group 2 Group 3

Page 50: © 2003 Prentice-Hall, Inc.Chap 8-1 Business Statistics: A First Course (3 rd Edition) Chapter 8 Hypothesis Tests for Numerical Data from Two or More Samples

© 2003 Prentice-Hall, Inc. Chap 8-50

Within-Group Variation(continued)

2 2 21 1 2 2

1 2

( 1) ( 1) ( 1)

( 1) ( 1) ( 1)c c

c

SSWMSW

n c

n S n S n S

n n n

For c = 2, this is the pooled-variance in the t-Test.

•If more than 2 groups, use F Test.

•For 2 groups, use t-Test. F Test more limited.

j

Page 51: © 2003 Prentice-Hall, Inc.Chap 8-1 Business Statistics: A First Course (3 rd Edition) Chapter 8 Hypothesis Tests for Numerical Data from Two or More Samples

© 2003 Prentice-Hall, Inc. Chap 8-51

One-way ANOVAF Test Statistic

Test Statistic

MSA is mean squares among MSW is mean squares within

Degrees of Freedom

MSAF

MSW

1 1df c 2df n c

Page 52: © 2003 Prentice-Hall, Inc.Chap 8-1 Business Statistics: A First Course (3 rd Edition) Chapter 8 Hypothesis Tests for Numerical Data from Two or More Samples

© 2003 Prentice-Hall, Inc. Chap 8-52

One-way ANOVA Summary Table

Source of

Variation

Degrees of

Freedom

Sum ofSquares

Mean Squares

(Variance)

FStatistic

Among(Factor)

c – 1 SSAMSA =

SSA/(c – 1 )MSA/MSW

Within(Error)

n – c SSWMSW =

SSW/(n – c )

Total n – 1SST =SSA + SSW

Page 53: © 2003 Prentice-Hall, Inc.Chap 8-1 Business Statistics: A First Course (3 rd Edition) Chapter 8 Hypothesis Tests for Numerical Data from Two or More Samples

© 2003 Prentice-Hall, Inc. Chap 8-53

Features of One-way ANOVA F Statistic

The F Statistic is the Ratio of the Among Estimate of Variance and the Within Estimate of Variance The ratio must always be positive df1 = c -1 will typically be small df2 = n - c will typically be large

The Ratio Should be Close to 1 if the Null is True

Page 54: © 2003 Prentice-Hall, Inc.Chap 8-1 Business Statistics: A First Course (3 rd Edition) Chapter 8 Hypothesis Tests for Numerical Data from Two or More Samples

© 2003 Prentice-Hall, Inc. Chap 8-54

Features of One-way ANOVA F Statistic

If the Null Hypothesis is False The numerator should be greater than the

denominator The ratio should be larger than 1

(continued)

Page 55: © 2003 Prentice-Hall, Inc.Chap 8-1 Business Statistics: A First Course (3 rd Edition) Chapter 8 Hypothesis Tests for Numerical Data from Two or More Samples

© 2003 Prentice-Hall, Inc. Chap 8-55

One-way ANOVA F Test Example

As production manager, you want to see if 3 filling machines have different mean filling times. You assign 15 similarly trained and experienced workers, 5 per machine, to the machines. At the .05 significance level, is there a difference in mean filling times?

Machine1 Machine2 Machine325.40 23.40 20.0026.31 21.80 22.2024.10 23.50 19.7523.74 22.75 20.6025.10 21.60 20.40

Page 56: © 2003 Prentice-Hall, Inc.Chap 8-1 Business Statistics: A First Course (3 rd Edition) Chapter 8 Hypothesis Tests for Numerical Data from Two or More Samples

© 2003 Prentice-Hall, Inc. Chap 8-56

One-way ANOVA Example: Scatter Diagram

27

26

25

24

23

22

21

20

19

••

•••

•••••

••••

Time in SecondsMachine1 Machine2 Machine325.40 23.40 20.0026.31 21.80 22.2024.10 23.50 19.7523.74 22.75 20.6025.10 21.60 20.40

1 2

3

24.93 22.61

20.59 22.71

X X

X X

1X

2X

3X

X

Page 57: © 2003 Prentice-Hall, Inc.Chap 8-1 Business Statistics: A First Course (3 rd Edition) Chapter 8 Hypothesis Tests for Numerical Data from Two or More Samples

© 2003 Prentice-Hall, Inc. Chap 8-57

One-way ANOVA Example Computations

Machine1 Machine2 Machine3

25.40 23.40 20.0026.31 21.80 22.2024.10 23.50 19.7523.74 22.75 20.6025.10 21.60 20.40

2 2 25 24.93 22.71 22.61 22.71 20.59 22.71

47.164

4.2592 3.112 3.682 11.0532

/( -1) 47.16 / 2 23.5820

/( - ) 11.0532 /12 .9211

SSA

SSW

MSA SSA c

MSW SSW n c

1

2

3

24.93

22.61

20.59

22.71

X

X

X

X

5

3

15

jn

c

n

Page 58: © 2003 Prentice-Hall, Inc.Chap 8-1 Business Statistics: A First Course (3 rd Edition) Chapter 8 Hypothesis Tests for Numerical Data from Two or More Samples

© 2003 Prentice-Hall, Inc. Chap 8-58

Summary Table

Source of

Variation

Degrees of

Freedom

Sum ofSquares

Mean Squares

(Variance)

FStatistic

Among(Factor)

3-1=2 47.1640 23.5820MSA/MSW

=25.60

Within(Error)

15-3=12

11.0532 .9211

Total15-

1=1458.2172

Page 59: © 2003 Prentice-Hall, Inc.Chap 8-1 Business Statistics: A First Course (3 rd Edition) Chapter 8 Hypothesis Tests for Numerical Data from Two or More Samples

© 2003 Prentice-Hall, Inc. Chap 8-59

One-way ANOVA Example Solution

F0 3.89

H0: 1 = 2 = 3

H1: Not All Equal = .05df1= 2 df2 = 12

Critical Value(s):

Test Statistic:

Decision:

Conclusion:

Reject at = 0.05

There is evidence that at least one i differs from the rest.

= 0.05

FMSA

MSW

23 5820

921125 6

.

..

Page 60: © 2003 Prentice-Hall, Inc.Chap 8-1 Business Statistics: A First Course (3 rd Edition) Chapter 8 Hypothesis Tests for Numerical Data from Two or More Samples

© 2003 Prentice-Hall, Inc. Chap 8-60

Solution In EXCEL

Use Tools | Data Analysis | ANOVA: Single Factor

EXCEL Worksheet that Performs the One-way ANOVA of the example

Microsoft Excel Worksheet

Page 61: © 2003 Prentice-Hall, Inc.Chap 8-1 Business Statistics: A First Course (3 rd Edition) Chapter 8 Hypothesis Tests for Numerical Data from Two or More Samples

© 2003 Prentice-Hall, Inc. Chap 8-61

The Tukey-Kramer Procedure

Tells which Population Means are Significantly Different e.g., 1 = 2 3

2 groups whose means may be significantly different

Post Hoc (a posteriori) Procedure Done after rejection of equal means in ANOVA

Pairwise Comparisons Compare absolute mean differences with

critical range

X

f(X)

1 = 2

3

Page 62: © 2003 Prentice-Hall, Inc.Chap 8-1 Business Statistics: A First Course (3 rd Edition) Chapter 8 Hypothesis Tests for Numerical Data from Two or More Samples

© 2003 Prentice-Hall, Inc. Chap 8-62

The Tukey-Kramer Procedure: Example

1. Compute absolute mean differences:Machine1 Machine2 Machine3

25.40 23.40 20.0026.31 21.80 22.2024.10 23.50 19.7523.74 22.75 20.6025.10 21.60 20.40

1 2

1 3

2 3

24.93 22.61 2.32

24.93 20.59 4.34

22.61 20.59 2.02

X X

X X

X X

2. Compute Critical Range:

3. All of the absolute mean differences are greater than the critical range. There is a significance difference between each pair of means at the 5% level of significance.

( , )'

1 1Critical Range 1.618

2U c n cj j

MSWQ

n n

Page 63: © 2003 Prentice-Hall, Inc.Chap 8-1 Business Statistics: A First Course (3 rd Edition) Chapter 8 Hypothesis Tests for Numerical Data from Two or More Samples

© 2003 Prentice-Hall, Inc. Chap 8-63

Solution in PHStat

Use PHStat | c-Sample Tests | Tukey-Kramer Procedure …

EXCEL Worksheet that Performs the Tukey-Kramer Procedure for the Previous example

Microsoft Excel Worksheet

Page 64: © 2003 Prentice-Hall, Inc.Chap 8-1 Business Statistics: A First Course (3 rd Edition) Chapter 8 Hypothesis Tests for Numerical Data from Two or More Samples

© 2003 Prentice-Hall, Inc. Chap 8-64

Chapter Summary Compared Two Independent Samples

Performed Z test for the differences in two means

Performed t test for the differences in two means

Addressed F Test for Difference in two Variances

Compared Two Related Samples Performed paired sample Z tests for the mean

difference Performed paired sample t tests for the mean

difference

Page 65: © 2003 Prentice-Hall, Inc.Chap 8-1 Business Statistics: A First Course (3 rd Edition) Chapter 8 Hypothesis Tests for Numerical Data from Two or More Samples

© 2003 Prentice-Hall, Inc. Chap 8-65

Chapter Summary

Described The Completely Randomized Design: One-way Analysis of Variance ANOVA Assumptions F Test for Difference in c Means The Tukey-Kramer Procedure

(continued)