chapter 13: comparing two population parameters
DESCRIPTION
Chapter 13: Comparing Two Population Parameters. 13.1 – Comparing Two Means. - PowerPoint PPT PresentationTRANSCRIPT
![Page 1: Chapter 13: Comparing Two Population Parameters](https://reader036.vdocuments.mx/reader036/viewer/2022062400/56813554550346895d9cb6da/html5/thumbnails/1.jpg)
Chapter 13: Comparing Two Population Parameters
![Page 2: Chapter 13: Comparing Two Population Parameters](https://reader036.vdocuments.mx/reader036/viewer/2022062400/56813554550346895d9cb6da/html5/thumbnails/2.jpg)
13.1 – Comparing Two Means
![Page 3: Chapter 13: Comparing Two Population Parameters](https://reader036.vdocuments.mx/reader036/viewer/2022062400/56813554550346895d9cb6da/html5/thumbnails/3.jpg)
Comparative studies are more convincing than single-sample investigations, so one-sample inference is not as common as comparative (two-sample) inference. In a comparative study, we may want to compare two treatments, or we may want to compare two populations. In either case, the samples must be chosen randomly and independently in order to perform statistical inference.
How is this different than a matched pairs design?A matched pairs design is when you compare two similar things given the same treatment. This is when you are comparing two sets of samples given different treatments!
![Page 4: Chapter 13: Comparing Two Population Parameters](https://reader036.vdocuments.mx/reader036/viewer/2022062400/56813554550346895d9cb6da/html5/thumbnails/4.jpg)
Two-Sample inference:
Compare two treatments or two populations. The null hypothesis is that there is no difference between the two parameters.
1 2: or oH 1 2: 0oH
![Page 5: Chapter 13: Comparing Two Population Parameters](https://reader036.vdocuments.mx/reader036/viewer/2022062400/56813554550346895d9cb6da/html5/thumbnails/5.jpg)
Review:
How do you subtract two means?
1 2–
How do you subtract two standard deviations?
1 2+2 2
Add their variances and take their square roots!
![Page 6: Chapter 13: Comparing Two Population Parameters](https://reader036.vdocuments.mx/reader036/viewer/2022062400/56813554550346895d9cb6da/html5/thumbnails/6.jpg)
Two Sample Z:
is known
Two Sample T:
is not known
SRS SRS
Normality Normality•Population approx normal• n1 + n2 30 by CLT
•Population approx normal• n1 + n2 30 by CLT• n1 + n2 < 30 and data doesn’t have strong skewness
Independence IndependenceN 10nThe two samples are independent
N 10nThe two samples are independent
![Page 7: Chapter 13: Comparing Two Population Parameters](https://reader036.vdocuments.mx/reader036/viewer/2022062400/56813554550346895d9cb6da/html5/thumbnails/7.jpg)
Note!The t* statistic does not have an exact t-distribution.
The degrees of freedom are calculated differently.22 2
1 2
1 22 22 2
1 2
1 1 2 2
1 11 1
s sn n
dfs s
n n n n
Your calculator will do this for you!
![Page 8: Chapter 13: Comparing Two Population Parameters](https://reader036.vdocuments.mx/reader036/viewer/2022062400/56813554550346895d9cb6da/html5/thumbnails/8.jpg)
Two Sample Z:
Two Sample T:
Confidence Interval:
2 21 2
1 21 2
*x x Zn n
2 21 2
1 21 2
*dfs sx x tn n
estimate test statistic sd
![Page 9: Chapter 13: Comparing Two Population Parameters](https://reader036.vdocuments.mx/reader036/viewer/2022062400/56813554550346895d9cb6da/html5/thumbnails/9.jpg)
Two Sample Z:
Two Sample T:
Hypothesis Test:
1 2 1 22 21 2
1 2
x xZ
n n
test statistic = estimate – hypothesized valuestandard deviation of statistic
1 2 1 22 21 2
1 2
x xt
s sn n
![Page 10: Chapter 13: Comparing Two Population Parameters](https://reader036.vdocuments.mx/reader036/viewer/2022062400/56813554550346895d9cb6da/html5/thumbnails/10.jpg)
Two Sample Z: Two Sample T:STAT-TESTS- 2-SampZtest STAT-TESTS- 2-SampTtest
STAT-TESTS- 2-SampZInt STAT-TESTS- 2-SampTInt
Note: The only time you pool is when the standard deviations are the same. This almost never happens, so just don’t do it!
Calculator Tip!
![Page 11: Chapter 13: Comparing Two Population Parameters](https://reader036.vdocuments.mx/reader036/viewer/2022062400/56813554550346895d9cb6da/html5/thumbnails/11.jpg)
Example #1Patients with heart-attack symptoms arrive at an emergency room either by ambulance or self-transportation provided by themselves, family, or friends. When a patient arrives at the emergency room, the time of arrival is recorded. The time when the patient’s diagnostic treatment begins is also recorded. An administrator of a large hospital wanted to determine whether the mean wait time (time between arrival and diagnostic treatment) for patients with heart-attack symptoms differ according to the mode of transportation. A random sample of 150 patients with heart-attack symptoms who had reported to the emergency room was selected. For each patient, the mode of transportation and wait time were recorded. Summary statistics for each mode of transportation are shown in the table below.
![Page 12: Chapter 13: Comparing Two Population Parameters](https://reader036.vdocuments.mx/reader036/viewer/2022062400/56813554550346895d9cb6da/html5/thumbnails/12.jpg)
Mode of Transportation
Sample Size
Mean Wait Time (in minutes)
Standard Deviation of Wait Time
(in minutes)
Self 73 8.30 5.16
Ambulance 77 6.04 4.30
a. Use a 99% confidence interval to estimate the difference between the mean wait times for ambulance transported patients and self-transported patients at this emergency room.
![Page 13: Chapter 13: Comparing Two Population Parameters](https://reader036.vdocuments.mx/reader036/viewer/2022062400/56813554550346895d9cb6da/html5/thumbnails/13.jpg)
P: μS = mean wait time for diagnostic treatment if traveled by self-transportation
μA = mean wait time for diagnostic treatment if traveled by ambulance
μD = μA - μS = Difference in wait times
![Page 14: Chapter 13: Comparing Two Population Parameters](https://reader036.vdocuments.mx/reader036/viewer/2022062400/56813554550346895d9cb6da/html5/thumbnails/14.jpg)
A:
SRS (says so)
Normality73 + 77 ≥ 30
150 ≥ 30
By the CLT, ok to assume normality
Independence(More than 1500 people with heart-attack symptoms)
Self-transported patients shouldn’t influence the wait time in ambulance transported patients
N: Two-Sample t-interval
nA + nS 30
![Page 15: Chapter 13: Comparing Two Population Parameters](https://reader036.vdocuments.mx/reader036/viewer/2022062400/56813554550346895d9cb6da/html5/thumbnails/15.jpg)
I: 22
* SAA S df
A S
ssx x tn n
222
22 2211 1
1 1
SA
A S
S
A A S S
ssn n
dfss
n n n n
22 2
2 22 2
5.16 4.3073 77
1 5.16 1 4.373 1 73 77 1 77
0.365860.00185 0.0007587
140.3717611
![Page 16: Chapter 13: Comparing Two Population Parameters](https://reader036.vdocuments.mx/reader036/viewer/2022062400/56813554550346895d9cb6da/html5/thumbnails/16.jpg)
![Page 17: Chapter 13: Comparing Two Population Parameters](https://reader036.vdocuments.mx/reader036/viewer/2022062400/56813554550346895d9cb6da/html5/thumbnails/17.jpg)
22
100 * SAA S
A S
ssx x tn n
4.30 5.166.04 8.30 2.62677 73
2.26 2.626 0.3557
4.302, 0.218
2.26 0.93409
![Page 18: Chapter 13: Comparing Two Population Parameters](https://reader036.vdocuments.mx/reader036/viewer/2022062400/56813554550346895d9cb6da/html5/thumbnails/18.jpg)
Note: Using the calculator!
22
* SAA S df
A S
ssx x tn n
*140.37
4.30 5.166.04 8.3077 73
t
4.2910, 0.2291
![Page 19: Chapter 13: Comparing Two Population Parameters](https://reader036.vdocuments.mx/reader036/viewer/2022062400/56813554550346895d9cb6da/html5/thumbnails/19.jpg)
I am 99% confident the true mean difference of wait time of ambulance and self-transported patients is between –4.2910 and –0.2291 minutes
C:
b. Based only on this confidence interval, do you think the difference in the mean wait times is statistically significant? Justify your answer.
Since 0 is not in the confidence interval, we can say that the ambulance wait times are statistically significantly shorter than the wait times for self-transported patients at the 99% confidence level.
![Page 20: Chapter 13: Comparing Two Population Parameters](https://reader036.vdocuments.mx/reader036/viewer/2022062400/56813554550346895d9cb6da/html5/thumbnails/20.jpg)
Example #2: The following is a list of salary rates (per hour in dollars) for men and women with a high school diploma.
Women Men8 10.6 7.5 11.98.25 10.8 8.5 11.959 11 8.5 129.25 11.5 9.85 129.35 11.9 10.5 129.8 12.25 10.5 12.59.95 12.5 10.5 1310 12.5 10.9 13.710 12.95 10.95 13.7510 13.9 11 14.510.25 13.95 11 14.7510.5 14.45 11.65 1510.5 14.8 11.9 15.5
![Page 21: Chapter 13: Comparing Two Population Parameters](https://reader036.vdocuments.mx/reader036/viewer/2022062400/56813554550346895d9cb6da/html5/thumbnails/21.jpg)
If the two samples are independent and are taken randomly, is there significant evidence that the men make more money than the women? Assume that in past experience = 1.99 dollars for men and = 2.01 for women.
P: μM = mean dollars per hour for men with high school diploma
μW = mean dollars per hour for women with high school diploma
μD = μM - μW = Difference in dollars per hour
![Page 22: Chapter 13: Comparing Two Population Parameters](https://reader036.vdocuments.mx/reader036/viewer/2022062400/56813554550346895d9cb6da/html5/thumbnails/22.jpg)
H:
: or : 0o M W o M WH H
: or : 0A M W A M WH H
![Page 23: Chapter 13: Comparing Two Population Parameters](https://reader036.vdocuments.mx/reader036/viewer/2022062400/56813554550346895d9cb6da/html5/thumbnails/23.jpg)
A:
SRS (says so)
Normality26 + 26 ≥ 30
52 ≥ 30
By the CLT, ok to assume normality
Independence(More than 520 people with engineer degree)
Men’s salaries shouldn’t influence the salaries of women with high school diploma. Also, says independent
N: Two-Sample Z-Test
nM + nW 30
![Page 24: Chapter 13: Comparing Two Population Parameters](https://reader036.vdocuments.mx/reader036/viewer/2022062400/56813554550346895d9cb6da/html5/thumbnails/24.jpg)
T:
22
( ) ( )M W M W
WM
M W
x xZ
n n
2 2
(11.76153 11.075) (0)
1.99 2.0126 26
0.686530.5547
1.2376
![Page 25: Chapter 13: Comparing Two Population Parameters](https://reader036.vdocuments.mx/reader036/viewer/2022062400/56813554550346895d9cb6da/html5/thumbnails/25.jpg)
1.24
P(Z > 1.24) = 1 – P(Z < 1.24) =
O:
![Page 26: Chapter 13: Comparing Two Population Parameters](https://reader036.vdocuments.mx/reader036/viewer/2022062400/56813554550346895d9cb6da/html5/thumbnails/26.jpg)
![Page 27: Chapter 13: Comparing Two Population Parameters](https://reader036.vdocuments.mx/reader036/viewer/2022062400/56813554550346895d9cb6da/html5/thumbnails/27.jpg)
1.24
P(Z > 1.24) = 1 – P(Z < 1.24) = 1 – 0.8925 = 0.1075
![Page 28: Chapter 13: Comparing Two Population Parameters](https://reader036.vdocuments.mx/reader036/viewer/2022062400/56813554550346895d9cb6da/html5/thumbnails/28.jpg)
M:
____ p 0.1075 0.05
>
Accept the Null
![Page 29: Chapter 13: Comparing Two Population Parameters](https://reader036.vdocuments.mx/reader036/viewer/2022062400/56813554550346895d9cb6da/html5/thumbnails/29.jpg)
There is not enough evidence to say that men with a high school diploma make more money per hour than women.
S:
![Page 30: Chapter 13: Comparing Two Population Parameters](https://reader036.vdocuments.mx/reader036/viewer/2022062400/56813554550346895d9cb6da/html5/thumbnails/30.jpg)
13.2 – Comparing Two Proportions
![Page 31: Chapter 13: Comparing Two Population Parameters](https://reader036.vdocuments.mx/reader036/viewer/2022062400/56813554550346895d9cb6da/html5/thumbnails/31.jpg)
If we want to compare two populations or compare the responses to two treatments from independent samples, we look at a two-sample proportion:
21: ppHo 0: 21 ppHoor
![Page 32: Chapter 13: Comparing Two Population Parameters](https://reader036.vdocuments.mx/reader036/viewer/2022062400/56813554550346895d9cb6da/html5/thumbnails/32.jpg)
Conditions for Proportion Interval:
SRS
Normality
IndependenceN 10(n1 + n2)The two samples are independent
1 1ˆ 5n p
1 1ˆ1 5n p 2 2ˆ 5n p
2 2ˆ1 5n p
![Page 33: Chapter 13: Comparing Two Population Parameters](https://reader036.vdocuments.mx/reader036/viewer/2022062400/56813554550346895d9cb6da/html5/thumbnails/33.jpg)
2
22
1
1121
)ˆ1(ˆ)ˆ1(ˆ*ˆˆ
npp
nppzpp
Confidence Interval:
estimate test statistic sd
![Page 34: Chapter 13: Comparing Two Population Parameters](https://reader036.vdocuments.mx/reader036/viewer/2022062400/56813554550346895d9cb6da/html5/thumbnails/34.jpg)
Conditions for Proportion Test:
SRS
Normality
IndependenceN 10(n1 + n2)The two samples are independent
1 ˆ 5Cn p
1 ˆ1 5Cn p 2 ˆ 5Cn p
2 ˆ1 5Cn p
count of success in both samplesˆcount of individuals from both samplesCp 1 2
1 2
x xn n
![Page 35: Chapter 13: Comparing Two Population Parameters](https://reader036.vdocuments.mx/reader036/viewer/2022062400/56813554550346895d9cb6da/html5/thumbnails/35.jpg)
Hypothesis Test:
test statistic = estimate – hypothesized valuestandard deviation of statistic
1 2
1 2
ˆ ˆ
1 1ˆ ˆ(1 )c c
p pz
p pn n
![Page 36: Chapter 13: Comparing Two Population Parameters](https://reader036.vdocuments.mx/reader036/viewer/2022062400/56813554550346895d9cb6da/html5/thumbnails/36.jpg)
Confidence Interval: Hypothesis Test
STAT-TESTS- 2-PropZInt STAT-TESTS- 2-PropZTest
Note: The only time you pool is when the standard deviations are the same. This almost never happens, so just don’t do it!
Calculator Tip!
![Page 37: Chapter 13: Comparing Two Population Parameters](https://reader036.vdocuments.mx/reader036/viewer/2022062400/56813554550346895d9cb6da/html5/thumbnails/37.jpg)
Example #1An election is bitterly contested between two rivals. In a poll of 750 potential voters taken 4 weeks before the election, 420 indicated a preference for candidate Grumpy over candidate Dopey. Two weeks later, a new poll of 900 randomly selected potential voters found 465 who plan to vote for Grumpy. Dopey immediately began advertising that support for Grumpy was slipping drastically and that he was going to win the election. Statistically speaking (at the 0.05 level), how happy should Dopey be?
P: p1 = true proportion of people who want Grumpy to win in 1st poll
p2 = true proportion of people who want Grumpy to win in 1st poll
pD = p1 - p2 = Difference in proportion of people in 1st poll and second
![Page 38: Chapter 13: Comparing Two Population Parameters](https://reader036.vdocuments.mx/reader036/viewer/2022062400/56813554550346895d9cb6da/html5/thumbnails/38.jpg)
21: ppHo 0: 21 ppHoor
1 2:AH p p 1 2: 0AH p p or
H:
![Page 39: Chapter 13: Comparing Two Population Parameters](https://reader036.vdocuments.mx/reader036/viewer/2022062400/56813554550346895d9cb6da/html5/thumbnails/39.jpg)
SRS
Normality
1 ˆ 5Cn p
1 ˆ1 5Cn p
2 ˆ 5Cn p
2 ˆ1 5Cn p
count of success in both samplesˆcount of individuals from both samplesCp 1 2
1 2
x xn n
(Says in second one only. Must assume the first)
885 0.5361650
402.27 5(750)(0.536) 5
(750) 1 0.536 5 347.73 5
(900)(0.536) 5482.73 5
(900) 1 0.536 5
417.27 5
![Page 40: Chapter 13: Comparing Two Population Parameters](https://reader036.vdocuments.mx/reader036/viewer/2022062400/56813554550346895d9cb6da/html5/thumbnails/40.jpg)
Independence
Safe to assume there were more than 10(750+900), or 16,500 voters
The first poll might have influenced the second poll, proceed with caution!
N: 2-PropZTest
![Page 41: Chapter 13: Comparing Two Population Parameters](https://reader036.vdocuments.mx/reader036/viewer/2022062400/56813554550346895d9cb6da/html5/thumbnails/41.jpg)
T:
1 2
1 2
ˆ ˆ
1 1ˆ ˆ(1 )c c
p pz
p pn n
0.56 0.5167
1 10.536(1 0.536)750 900
0.04330.02466
1.7576
![Page 42: Chapter 13: Comparing Two Population Parameters](https://reader036.vdocuments.mx/reader036/viewer/2022062400/56813554550346895d9cb6da/html5/thumbnails/42.jpg)
1.75
P(Z > 1.75) = 1 – P(Z < 1.75) =
O:
![Page 43: Chapter 13: Comparing Two Population Parameters](https://reader036.vdocuments.mx/reader036/viewer/2022062400/56813554550346895d9cb6da/html5/thumbnails/43.jpg)
![Page 44: Chapter 13: Comparing Two Population Parameters](https://reader036.vdocuments.mx/reader036/viewer/2022062400/56813554550346895d9cb6da/html5/thumbnails/44.jpg)
1.24
P(Z > 1.24) = 1 – P(Z < 1.24) = 1 – 0.9599 = 0.0401
Or, by calculator:
P(Z > 1.24) = 0.03941
![Page 45: Chapter 13: Comparing Two Population Parameters](https://reader036.vdocuments.mx/reader036/viewer/2022062400/56813554550346895d9cb6da/html5/thumbnails/45.jpg)
M:
____ p 0.03941 0.05
<
Reject the Null
![Page 46: Chapter 13: Comparing Two Population Parameters](https://reader036.vdocuments.mx/reader036/viewer/2022062400/56813554550346895d9cb6da/html5/thumbnails/46.jpg)
There is enough evidence to say that the proportion of voters that support Grumpy has dropped from the 1st poll to the second.
S:
Dopey should be very happy!
![Page 47: Chapter 13: Comparing Two Population Parameters](https://reader036.vdocuments.mx/reader036/viewer/2022062400/56813554550346895d9cb6da/html5/thumbnails/47.jpg)
Example #2Two groups of 40 randomly selected students were selected to be part of a study on drop-out rates. One group was enrolled in a counseling program designed to give them skills needed to succeed in school and the other group received no special counseling. Fifteen of the students who received counseling dropped out of school, and 23 of the students who did not receive counseling dropped out. Construct a 90% confidence interval for the true difference between the drop-out rates of the two groups. P: pC = true proportion of students who drop out with
counseling
pN = true proportion of students who drop out without any counseling
pD = pC - pD = Difference in proportion of students who drop out with counseling vs. without
![Page 48: Chapter 13: Comparing Two Population Parameters](https://reader036.vdocuments.mx/reader036/viewer/2022062400/56813554550346895d9cb6da/html5/thumbnails/48.jpg)
SRS
Normality
1 1ˆ 5n p
1 1ˆ1 5n p
2 2ˆ 5n p
2 2ˆ1 5n p
A: (says in both groups)
(40)(0.375) 5
15 5
40 1 0.375 5
25 5
40 0.575 5
40 1 0.575 5
23 5
17 5
![Page 49: Chapter 13: Comparing Two Population Parameters](https://reader036.vdocuments.mx/reader036/viewer/2022062400/56813554550346895d9cb6da/html5/thumbnails/49.jpg)
Independence
Safe to assume there were more than 10(40+40), or 800 students
The drop out rate of the group with counseling might influence the group without counseling. Proceed with caution!
N: 2-PropZInt
![Page 50: Chapter 13: Comparing Two Population Parameters](https://reader036.vdocuments.mx/reader036/viewer/2022062400/56813554550346895d9cb6da/html5/thumbnails/50.jpg)
I:
2
22
1
1121
)ˆ1(ˆ)ˆ1(ˆ*ˆˆ
npp
nppzpp
0.375(1 0.375) 0.575(1 0.575)0.375 0.575 1.64540 40
0.2 1.645 0.1094
0.3799, 0.0201
![Page 51: Chapter 13: Comparing Two Population Parameters](https://reader036.vdocuments.mx/reader036/viewer/2022062400/56813554550346895d9cb6da/html5/thumbnails/51.jpg)
I am 90% confident the true difference in the proportion of dropouts with counseling vs. without counseling is between –0.3799 and –0.0201.
C:
It appears that drop out rates are lower with the group that got counseling than without it.