chapter 10 statistical inferences about means and proportions with two populations

Chap 11-1

Chapter 10

Statistical Inferences About Meansand Proportions with Two Populations

Statistics for Business and Economics

9th Edition

Chap 10-2

Two Sample Tests

Two Sample Tests

Population Means,

Independent Samples

Population Means,

Matched Pairs

Group 1 vs. independent Group 2

Same group before vs. after treatment

Examples:

Population Proportions

Proportion 1 vs. Proportion 2

IntervalInterval

EstimationEstimation

Hypothesis Hypothesis TestsTests

Chap 10-3

Difference Between Two Means

Population means, independent

samples

Test statistic is a z value

Test statistic is a a value from the Student’s t distribution

σx2 and σy

2 assumed equal

σx2 and σy

2 known

σx2 and σy

2 unknown

σx2 and σy

2 assumed unequal

小样本

大样本

Chap 10-4



samples

Goal: Form a confidence interval for the difference between two population means, μx – μy

Different data sources Unrelated Independent

Sample selected from one population has no effect on the sample selected from the other population

Chap 10-5


samples

σx2 and σy

2 Known

Assumptions:

Samples are randomly and independently drawn

both population distributions are normal

Population variances are known

*σx2 and σy

2 known

σx2 and σy

2 unknown

Chap 10-6

Estimating the Difference BetweenEstimating the Difference BetweenTwo Population MeansTwo Population Means

Let Let 11 equal the mean of population 1 and equal the mean of population 1 and 22 equalequal

the mean of population 2.the mean of population 2. The difference between the two population The difference between the two population means ismeans is 11 - - 22.. To estimate To estimate 11 - - 22, we will select a simple , we will select a simple randomrandom

sample of size sample of size nn11 from population 1 and a from population 1 and a simplesimple

random sample of size random sample of size nn22 from population 2. from population 2. Let equal the mean of sample 1 and Let equal the mean of sample 1 and

equal theequal the

mean of sample 2.mean of sample 2.

x1x1 x2x2

The point estimator of the difference between The point estimator of the difference between thethe

means of the populations 1 and 2 is .means of the populations 1 and 2 is .x x1 2x x1 2

Chap 10-7

Expected ValueExpected Value

Sampling Distribution of Sampling Distribution of x x1 2x x1 2

E x x( )1 2 1 2 E x x( )1 2 1 2

Standard Deviation (Standard Error)Standard Deviation (Standard Error)

x x n n1 2

12

1

22

2

x x n n1 2

12

1

22

2

where: where: 1 1 = standard deviation of population 1 = standard deviation of population 1

2 2 = standard deviation of population 2 = standard deviation of population 2

nn1 1 = sample size from population 1= sample size from population 1

nn22 = sample size from population 2 = sample size from population 2

Chap 10-8

Interval Estimate

Interval Estimation of 1 - 2: 1 and 2 Known

2 21 2

1 2 / 21 2

x x zn n

2 21 2

1 2 / 21 2

x x zn n

where:where:

1 - 1 - is the confidence coefficient is the confidence coefficient

Chap 10-9



samples

Test statistic is a z value

Test statistic is a a value from the Student’s t distribution

σx2 and σy

2 assumed equal

σx2 and σy

2 known

σx2 and σy

2 unknown

σx2 and σy

2 assumed unequal

(continued)

小样本

大样本

Chap 10-10


samples

σx2 and σy

2 Known

Assumptions:

Samples are randomly and independently drawn

both population distributions are normal

Population variances are known

*σx2 and σy

2 known

σx2 and σy

2 unknown

Chap 10-11


samples

…and the random variable

has a standard normal distribution

When σx2 and σy

2 are known and

both populations are normal, the variance of X – Y is

y

2y

x

2x2

YX n

σ

n

σσ

(continued)

*

Y

2y

X

2x

YX

n

σ

nσ

)μ(μ)yx(Z

σx2 and σy

2 known

σx2 and σy

2 unknown

σx2 and σy

2 Known

Chap 10-12


samples

Test Statistic, σx

2 and σy2 Known

*σx2 and σy

2 known

σx2 and σy

2 unknown

y

2y

x

2x

0

n

σ

nσ

Dyxz

The test statistic for

μx – μy is:

Chap 10-13

Interval Estimate

Interval Estimation of 1 - 2: 1 and 2 Known

2 21 2

1 2 /21 2

x x zn n

2 21 2

1 2 /21 2

x x zn n

where:where:

1 - 1 - is the confidence coefficient is the confidence coefficient

Chap 10-14

Hypothesis Tests forTwo Population Means

Lower-tail test:

H0: μx μy

H1: μx < μy

i.e.,

H0: μx – μy 0H1: μx – μy < 0

Upper-tail test:

H0: μx ≤ μy

H1: μx > μy

i.e.,

H0: μx – μy ≤ 0H1: μx – μy > 0

Two-tail test:

H0: μx = μy

H1: μx ≠ μy

i.e.,

H0: μx – μy = 0H1: μx – μy ≠ 0

Two Population Means, Independent Samples

Chap 10-15

Two Population Means, Independent Samples, Variances Known

Lower-tail test:

H0: μ1 – μ2 0H1: μ1 – μ2 < 0

Upper-tail test:

H0: μ1 – μ2 ≤ 0H1: μ1 – μ2 > 0

Two-tail test:

H0: μ1 – μ2= 0H1: μ1 – μy ≠ 0

/2 /2

-z -z/2z z/2

Reject H0 if z < -z Reject H0 if z > z Reject H0 if z < -z/2

or z > z/2

Decision Rules

Chap 10-16

Interval Estimation of Interval Estimation of 11 - - 22:: 1 1 and and 2 2 Known Known

In a test of driving distance using a In a test of driving distance using a mechanicalmechanical

driving device, a sample of Par golf balls wasdriving device, a sample of Par golf balls was

compared with a sample of golf balls made by compared with a sample of golf balls made by Rap,Rap,

Ltd., a competitor. The sample statistics appear Ltd., a competitor. The sample statistics appear on theon the

next slide.next slide.

Par, Inc. is a manufacturerPar, Inc. is a manufacturer

of golf equipment and hasof golf equipment and has

developed a new golf balldeveloped a new golf ball

that has been designed tothat has been designed to

provide “extra distance.”provide “extra distance.”

Example: Par, Inc.

Chap 10-17

Example: Par, Inc.


Sample SizeSample Size

Sample MeanSample Mean

Sample #1Sample #1Par, Inc.Par, Inc.

Sample #2Sample #2Rap, Ltd.Rap, Ltd.

120 balls120 balls 80 balls80 balls

275 yards 258 yards275 yards 258 yards

Based on data from previous driving distanceBased on data from previous driving distancetests, the two population standard deviations aretests, the two population standard deviations areknown with known with 1 1 = 15 yards and = 15 yards and 2 2 = 20 yards. = 20 yards.

Chap 10-18


Example: Par, Inc.

Let us develop a 95% confidence interval Let us develop a 95% confidence interval estimateestimate

of the difference between the mean driving of the difference between the mean driving distances ofdistances of

the two brands of golf ball.the two brands of golf ball.

Chap 10-19

Estimating the Difference BetweenEstimating the Difference BetweenTwo Population MeansTwo Population Means

mm11 – – 22 = difference between= difference between the mean distancesthe mean distances

xx11 - - xx22 = Point Estimate of = Point Estimate of mm11 – – 22

Population 1Population 1Par, Inc. Golf BallsPar, Inc. Golf Balls

11 = mean driving = mean driving distance of Pardistance of Par

golf ballsgolf balls

Population 1Population 1Par, Inc. Golf BallsPar, Inc. Golf Balls

11 = mean driving = mean driving distance of Pardistance of Par


Population 2Population 2Rap, Ltd. Golf BallsRap, Ltd. Golf Balls

22 = mean driving = mean driving distance of Rapdistance of Rap


Population 2Population 2Rap, Ltd. Golf BallsRap, Ltd. Golf Balls

22 = mean driving = mean driving distance of Rapdistance of Rap


Simple random sampleSimple random sample of of nn22 Rap golf balls Rap golf balls

xx22 = sample mean distance = sample mean distance for the Rap golf ballsfor the Rap golf balls

Simple random sampleSimple random sample of of nn22 Rap golf balls Rap golf balls

xx22 = sample mean distance = sample mean distance for the Rap golf ballsfor the Rap golf balls

Simple random sampleSimple random sample of of nn11 Par golf balls Par golf balls

xx11 = sample mean distance = sample mean distance for the Par golf ballsfor the Par golf balls

Simple random sampleSimple random sample of of nn11 Par golf balls Par golf balls

xx11 = sample mean distance = sample mean distance for the Par golf ballsfor the Par golf balls

Chap 10-20

Point Estimate of Point Estimate of 11 - - 22

Point estimate of Point estimate of 11 2 2 ==x x1 2x x1 2

where:where:

11 = mean distance for the population = mean distance for the population of Par, Inc. golf ballsof Par, Inc. golf balls

22 = mean distance for the population = mean distance for the population of Rap, Ltd. golf ballsof Rap, Ltd. golf balls

= 275 = 275 258 258

= 17 yards= 17 yards

Chap 10-21

x x zn n1 2 212

1

22

2

2 2

17 1 9615120

2080

/ .( ) ( )

x x zn n1 2 212

1

22

2

2 2

17 1 9615120

2080

/ .( ) ( )

Interval Estimation of 1 - 2:1 and 2 Known

We are 95% confident that the difference betweenWe are 95% confident that the difference betweenthe mean driving distances of Par, Inc. balls and Rap,the mean driving distances of Par, Inc. balls and Rap,Ltd. balls is 11.86 to 22.14 yards.Ltd. balls is 11.86 to 22.14 yards.

17 17 ++ 5.14 or 11.86 yards to 22.14 yards 5.14 or 11.86 yards to 22.14 yards

Chap 10-22

Example: Par, Inc.Example: Par, Inc.

Hypothesis Tests About 1 2: 1 and 2 Known

Can we conclude, usingCan we conclude, using

= .01, that the mean driving= .01, that the mean driving

distance of Par, Inc. golf ballsdistance of Par, Inc. golf balls

is greater than the mean drivingis greater than the mean driving

distance of Rap, Ltd. golf balls?distance of Rap, Ltd. golf balls?

Chap 10-23

HH00: : 1 1 - - 22 << 0 0

HHaa: : 1 1 - - 22 > 0 > 0where: where: 11 = mean distance for the population = mean distance for the population of Par, Inc. golf ballsof Par, Inc. golf balls22 = mean distance for the population = mean distance for the population of Rap, Ltd. golf ballsof Rap, Ltd. golf balls

1. Develop the hypotheses.1. Develop the hypotheses.

pp –Value and Critical Value Approaches –Value and Critical Value Approaches


2. Specify the level of significance.2. Specify the level of significance. = .01= .01

Chap 10-24

3. Compute the value of the test statistic.3. Compute the value of the test statistic.


pp –Value and Critical Value Approaches –Value and Critical Value Approaches

1 2 0

2 21 2

1 2

( )x x Dz

n n

1 2 0

2 21 2

1 2

( )x x Dz

n n

2 2

(235 218) 0 17 6.49

2.62(15) (20)120 80

z

2 2

(235 218) 0 17 6.49

2.62(15) (20)120 80

z

Chap 10-25

p p –Value Approach–Value Approach

4. Compute the 4. Compute the pp–value.–value.

For For zz = 6.49, the = 6.49, the pp –value < .0001. –value < .0001.


5. Determine whether to reject 5. Determine whether to reject HH00..

Because Because pp–value –value << = .01, we reject = .01, we reject HH00..

At the .01 level of significance, the sample At the .01 level of significance, the sample evidenceevidenceindicates the mean driving distance of Par, Inc. indicates the mean driving distance of Par, Inc. golfgolfballs is greater than the mean driving distance balls is greater than the mean driving distance of Rap,of Rap,Ltd. golf balls.Ltd. golf balls.

Chap 10-26


5. Determine whether to reject 5. Determine whether to reject HH00..

Because Because zz = 6.49 = 6.49 >> 2.33, we reject 2.33, we reject HH00..

Critical Value ApproachCritical Value Approach

For For = .01, = .01, zz.01.01 = 2.33 = 2.33

4. Determine the critical value and rejection rule.4. Determine the critical value and rejection rule.

Reject Reject HH00 if if zz >> 2.33 2.33

The sample evidence indicates the mean The sample evidence indicates the mean drivingdrivingdistance of Par, Inc. golf balls is greater than distance of Par, Inc. golf balls is greater than the meanthe meandriving distance of Rap, Ltd. golf balls.driving distance of Rap, Ltd. golf balls.

Chap 10-27


samples

σx2 and σy

2 Unknown,

*大样本

σx2 and σy

2 known

σx2 and σy

2 unknown

小样本

Chap 10-28

*

Test Statistic, σx

2 and σy2 Unknown, Equal

σx2 and σy

2 assumed equal

σx2 and σy

2 unknown

σx2 and σy

2 assumed unequal

大样本

（ nx>=30 且 ny>=30 ）

小样本（ nx<30 或 ny<30 ）

22

/21 2

yxx y

ssx x Z

n n 22

/21 2

yxx y

ssx x Z

n n

Chap 10-29

*

Test Statistic, σx


σx2 and σy

2 assumed equal

且小样本σx

2 and σy2 unknown

σx2 and σy

2 assumed unequal

Assumptions: Samples are randomly and independently drawn

Populations are normally distributed

Population variances are unknown but assumed equal

Chap 10-30

*

Test Statistic, σx


σx2 and σy

2 assumed equal

且小样本σx

2 and σy2 unknown

σx2 and σy

2 assumed unequal

Forming interval estimates:

The population variances are assumed equal, so use the two sample standard deviations and pool them to estimate σ

use a t value with (nx + ny – 2) degrees of freedom

Chap 10-31

*

Test Statistic, σx


σx2 and σy

2 assumed equal

且小样本σx

2 and σy2 unknown

σx2 and σy

2 assumed unequal

2nn

1)s(n1)s(ns

yx

2yy

2xx2

p

σ 2 的合并估计量：

Chap 10-32

*

Test Statistic, σx


σx2 and σy

2 assumed equal

且小样本σx

2 and σy2 unknown

σx2 and σy

2 assumed unequal

2nn

1)s(n1)s(ns

yx

2yy

2xx2

p

Where t has (n1 + n2 – 2) d.f.,

and

yx

2p

yx

n1

n1

S

μμt

yx


μx – μy is:

Chap 10-33

*

Interval Estimation, σx


σx2 and σy

2 assumed equal

且小样本σx

2 and σy2 unknown

σx2 and σy

2 assumed unequal

2nn

1)s(n1)s(ns

yx

2yy

2xx2

p

Where t has (n1 + n2 – 2) d.f.,

and

Chap 10-34

*

σx2 and σy

2 Unknown,Assumed Unequal

σx2 and σy

2 assumed equal

且小样本σx

2 and σy2 unknown

σx2 and σy

2 assumed unequal

Assumptions: Samples are randomly and independently drawn

Populations are normally distributed

Population variances are unknown and assumed unequal

Chap 10-35

*

σx2 and σy

2 Unknown,Assumed Unequal

σx2 and σy

2 assumed equal

且小样本σx

2 and σy2 unknown

σx2 and σy

2 assumed unequal

Forming interval estimates:

The population variances are assumed unequal, so a pooled variance is not appropriate

use a t value with degrees of freedom, where

1)/(nn

s1)/(n

ns

)n

s()

ns

(

y

2

y

2y

x

2

x

2x

2

y

2y

x

2x

v

Chap 10-36

*

Test Statistic, σx

2 and σy2 Unknown, Unequal

σx2 and σy

2 assumed equal

且小样本σx

2 and σy2 unknown

σx2 and σy

2 assumed unequal

1)/(nn

s1)/(n

ns

)n

s()

ns

(

y

2

y

2y

x

2

x

2x

2

y

2y

x

2x

vWhere t has degrees of freedom:


μx – μy is:

Y

2y

X

2x

0

n

σ

nσ

D)yx(t

Chap 10-37

*

Interval Estimation, σx

2 and σy2 Unknown, Unequal

σx2 and σy

2 assumed equal

且小样本σx

2 and σy2 unknown

σx2 and σy

2 assumed unequal

1)/(nn

s1)/(n

ns

)n

s()

ns

(

y

2

y

2y

x

2

x

2x

2

y

2y

x

2x

vWhere t has degrees of freedom:

Y

2y

X

2x

0

n

σ

nσ

D)yx(t

Chap 10-38

Lower-tail test:

H0: μx – μy 0H1: μx – μy < 0

Upper-tail test:

H0: μx – μy ≤ 0H1: μx – μy > 0

Two-tail test:

H0: μx – μy = 0H1: μx – μy ≠ 0

Decision Rules

/2 /2

-t -t/2t t/2

Reject H0 if t < -tn-1, Reject H0 if t > tn-1, Reject H0 if t < -tn-1 ,

or t > tn-1 , Where t has n - 1 d.f.

Two Population Means, Independent Samples, Variances Unknown

Chap 10-39

Pooled Variance t Test: Example

You are a financial analyst for a brokerage firm. Is there a difference in dividend yield between stocks listed on the NYSE & NASDAQ? You collect the following data:

NYSE NASDAQNumber 21 25Sample mean 3.27 2.53Sample std dev 1.30 1.16

Assuming both populations are approximately normal with equal variances, isthere a difference in average yield ( = 0.05)?

Chap 10-40

Calculating the Test Statistic

1.5021

1)25(1)-(21

1.161251.30121

1)n()1(n

S1nS1nS

22

21

222

2112

p

2.040

251

211

5021.1

02.533.27

n1

n1

S

μμXXt

21

2p

2121

The test statistic is:

Chap 10-41

Solution

H0: μ1 - μ2 = 0 i.e. (μ1 = μ2)

H1: μ1 - μ2 ≠ 0 i.e. (μ1 ≠ μ2)

= 0.05

df = 21 + 25 - 2 = 44Critical Values: t = ± 2.0154

Test Statistic: Decision:

Conclusion:

Reject H0 at = 0.05

There is evidence of a difference in means.

t0 2.0154-2.0154

.025

Reject H0 Reject H0

.025

2.040

2.040

251

211

5021.1

2.533.27t

Chap 10-42

Two Sample Tests

Two Sample Tests

Population Means,

Independent Samples

Population Means,

Matched Pairs



Examples:



IntervalInterval



Chap 10-43

Matched Pairs

Tests Means of 2 Related Populations Paired or matched samples Repeated measures (before/after) Use difference between paired values:

Assumptions: Both Populations Are Normally Distributed

Matched Pairs

di = xi - yi

Chap 10-44

The test statistic for the mean difference is a t value, with n – 1 degrees of freedom:

n

sDd

td

0

Test Statistic: Matched Pairs

WhereD0 = hypothesized mean differencesd = sample standard dev. of differencesn = the sample size (number of pairs)

Matched Pairs

Chap 10-45

Lower-tail test:

H0: μx – μy 0H1: μx – μy < 0

Upper-tail test:

H0: μx – μy ≤ 0H1: μx – μy > 0

Two-tail test:

H0: μx – μy = 0H1: μx – μy ≠ 0

Paired Samples

Decision Rules: Matched Pairs

/2 /2

-t -t/2t t/2

Reject H0 if t < -tn-1, Reject H0 if t > tn-1, Reject H0 if t < -tn-1 ,

or t > tn-1 ,

Wheren

sDd

td

0

has n - 1 d.f.

Chap 10-46

Assume you send your salespeople to a “customer service” training workshop. Has the training made a difference in the number of complaints? You collect the following data:

Matched Pairs Example

Number of Complaints: (2) - (1)Salesperson Before (1) After (2) Difference, di

C.B. 6 4 - 2 T.F. 20 6 -14 M.H. 3 2 - 1 R.K. 0 0 0 M.O. 4 0 - 4 -21

d = di

n

5.67

1n

)d(dS

2i

d

= - 4.2

Chap 10-47

Has the training made a difference in the number of complaints (at the = 0.01 level)?

- 4.2d =

1.6655.67/

04.2

n/s

Ddt

d

0

H0: μx – μy = 0H1: μx – μy 0

Test Statistic:

Critical Value = ± 4.604 d.f. = n - 1 = 4

Reject

/2

- 4.604 4.604

Decision: Do not reject H0

(t stat is not in the reject region)

Conclusion: There is not a significant change in the number of complaints.

Matched Pairs: Solution

Reject

/2

- 1.66 = .01

Chap 10-48

Two Sample Tests

Two Sample Tests

Population Means,

Independent Samples

Population Means,

Matched Pairs



Examples:



IntervalInterval



Chap 10-49

Two Population Proportions

Goal: Test hypotheses for the difference between two population proportions, P1 – P2

Population proportions

Assumptions: Both sample sizes are large,

Chap 10-50

Two Population Proportions


Chap 10-51

Interval Estimation forTwo Population Proportions


Chap 10-52

Test Statistic forTwo Population Proportions



H0: P1 – P2 = 0 is a z value:

1 2

0 0 0 0

1 2

ˆ ˆp pz

ˆ ˆ ˆ ˆp (1 p ) p (1 p )

n n

1 1 2 20

1 2

ˆ ˆn p n pp̂

n n

Where

Chap 10-53

Decision Rules: Proportions


Lower-tail test:

H0: p1 – p2 0H1: p1 – p2 < 0

Upper-tail test:

H0: p1 – p2 ≤ 0H1: p1 – p2 > 0

Two-tail test:

H0: p1– p2 = 0H1: p1 – p2 ≠ 0

/2 /2

-z -z/2z z/2

Reject H0 if z < -z Reject H0 if z > z Reject H0 if z < -z or z > z

Chap 10-54

Example: Two Population Proportions

Is there a significant difference between the proportion of men and the proportion of women who will vote Yes on Proposition A?

In a random sample, 36 of 72 men and 31 of 50 women indicated they would vote Yes

Test at the .05 level of significance

Chap 10-55

The hypothesis test is:H0: PM – PW = 0 (the two proportions are equal)

H1: PM – PW ≠ 0 (there is a significant difference between

proportions) The sample proportions are:

Men: = 36/72 = .50

Women: = 31/50 = .62

.549122

67

5072

50(31/50)72(36/72)

nn

pnpnp

yx

yyxx0

ˆˆ

ˆ

The estimate for the common overall proportion is:


(continued)

Mp̂

Wp̂

Chap 10-56

The test statistic for PM – PW = 0 is:


(continued)

.025

-1.96 1.96

.025

-1.31

Decision: Do not reject H0

Conclusion: There is not significant evidence of a difference between men and women in proportions who will vote yes.

1.31

50.549)(1.549

72.549)(1.549

.62.50

n)p(1p

n)p(1p

ppz

2

00

1

00

WM

ˆˆˆˆ

ˆˆ

Reject H0 Reject H0

Critical Values = ±1.96For = .05

Chap 10-57

Chapter Summary

Compared two dependent samples (paired samples) Performed paired sample t test for the mean

difference Compared two independent samples

Performed z test for the differences in two means Performed pooled variance t test for the differences

in two means Compared two population proportions

Performed z-test for two population proportions

chapter 10 statistical inferences about means and proportions with two populations

Documents

y2 knownx2

population distributions

mean of population

y2 unknown x2

y2 knownassumptions

y2 knownchap

y2 unknown chap

normal population variances