complement: trapezoid - katy...

12
Homework Chapter 5 1. Use the graph below 2. Use the graph below c) Find the proportion of observations within each interval i 0.6 < x < 0.8 ii 0 < x < 0.4 iii 0 < x < 0.2 3) The sketch below contains three normal curves; think of them as approximating the distribution of exam scores for three different classes. One (call it A) has a mean of 70 and a standard deviation of 5; another (call it B) has a mean of 70 and a standard deviation of 10; the third (call it C) has a mean of 50 and a standard deviation of 10. Identify which is which by labeling each curve with its appropriate letter. a) Why is the total area under this curve equal to 1? b) What percent of the observations lie above 0.8? c) What percent of the observations lie between 0.25 and 0.75? d) What is the mean of this distribution? 0 0.2 0.4 0.6 0.8 3 2 1 a) Verify that the graph is a valid density curve. b) The median of this density curve is a point between X = 0.2 and X = 0.4. Explain why. Name: 1 Rectangle; A = LW A = 1(1) = 1 1 - .8 = .2; A = LW A = 1(.2) = .2; 20% .75 - .25 = .5; A = LW A = 1(.5) = .5; 50% = .5 All on or above horizontal axis; Rectangle + Triangle1(. 8) + 1 2 (1). 4) = 1 0 ≤ ≤ .2 = .35 . 4 ≤ ≤ .8 = .4; .5 ℎ .2 .4 . 8 − .6 = .2 1(. 2) = .2 Complement: 0.4 ≤ ≤ 0.8 = .4 1 − .4 = .6 Trapezoid: = 1 2 (. 2)(2 + 1.5) = .35 A B C

Upload: duonganh

Post on 15-Jun-2018

213 views

Category:

Documents


0 download

TRANSCRIPT

Homework Chapter 5

1. Use the graph below

2. Use the graph below

c) Find the proportion of observations within each interval

i 0.6 < x < 0.8 ii 0 < x < 0.4 iii 0 < x < 0.2

3) The sketch below contains three normal curves; think of them as approximating the distribution

of exam scores for three different classes. One (call it A) has a mean of 70 and a standard

deviation of 5; another (call it B) has a mean of 70 and a standard deviation of 10; the third

(call it C) has a mean of 50 and a standard deviation of 10. Identify which is which by labeling

each curve with its appropriate letter.

a) Why is the total area under this curve equal to 1?

b) What percent of the observations lie above 0.8?

c) What percent of the observations lie between

0.25 and 0.75?

d) What is the mean of this distribution?

0 0.2 0.4 0.6 0.8

3

2

1

a) Verify that the graph is a valid density curve.

b) The median of this density curve is a point

between X = 0.2 and X = 0.4. Explain why.

Name:

1 Rectangle; A = LW A = 1(1) = 1

1 - .8 = .2; A = LW A = 1(.2) = .2; 20%

.75 - .25 = .5; A = LW A = 1(.5) = .5; 50%

𝜇 = .5

All on or above horizontal axis; Rectangle + Triangle1(. 8) +1

2(1). 4) = 1

0 ≤ 𝑥 ≤ .2 = .35 𝑎𝑛𝑑. 4 ≤ 𝑥 ≤ .8 = .4; .5 ℎ𝑎𝑠 𝑡𝑜 𝑏𝑒 𝑏𝑒𝑡𝑤𝑒𝑒𝑛 .2 𝑎𝑛𝑑 .4

. 8 − .6 = .2 1(. 2) = .2

Complement: 0.4 ≤ 𝑥 ≤ 0.8 = .4

1 − .4 = .6

Trapezoid:

𝐴 =1

2(. 2)(2 + 1.5) = .35

A

B C

4) For each of the following normal curves, identify (as accurately as you can from the graph) the

mean and standard deviation of the distribution.

5) Suppose the average height of women collegiate volleyball players is 5’9”, with a standard

deviation of 2.1”. Assume that heights among these players follow a mound-shaped distribution.

a) Draw the curve and label it.

b) According to the empirical rule, about 95% of women collegiate volleyball players have

heights between what two values?

c) What does the empirical rule say about the proportion of players who are between 62.7

inches and 75.3 inches?

d) Reasoning from the empirical rule, what is the tallest we would expect a woman collegiate

volleyball player to be?

e) About what percent of women collegiate volleyball players are taller than 71.1 inches?

f) About what percent of women collegiate volleyball players are shorter than 64.8 inches?

g) Find the percentiles for the following heights: 64.8

69

73.2

75.3

𝜇 = 50 𝜎 = 5

𝜇 = −10 𝜎 = 40

𝜇 = 1100 𝜎 = 300

𝜇 = 225 𝜎 = 75

62.7 64.8 66.9 69 71.1 73.2 75.3

64.8 to 73.2

99.7% of players are between 62.7 and 75.3 inches tall

75.3 inches would cover all but .15%

16%

2.5%

2.5th 50th 97.5th 99.85th

6) Given an approximately normal distribution, N(175, 37).

a. Draw a normal curve and label 1, 2, and 3 standard deviations on both sides of the mean.

b. What percent of values are within the interval (138, 212)?

c. What percent of values are within the interval (101, 249)?

d. What percent of values are within the interval (64, 286)?

e. What percent of values are outside the interval (138, 212)?

f. What percent of values are outside the interval (64, 286)?

7) The incubation time for Rhode Island Red chicks is normally distributed with a mean 21 days and

standard deviation approximately 1 day.

a. Draw a normal curve and label 1, 2, and 3 standard deviations on both sides of the mean.

If 1000 eggs are being incubated, how many chicks do we expect will hatch in each of the following

situations?

b. In 19 to 23 days c. In 18 days or more d. In 22 days or less

8) Use the table of standard normal probabilities to determine the proportion of the normal curve

that falls within:

a. one standard deviation of its mean (in other words, between z-scores of –1 and 1).

b. two standard deviations from the mean

c. three standard deviations from the mean.

d. Compare these values to the values obtained from the empirical rule.

18 19 20 21 22 23 24

64 101 138 175 212 249 286

68%

95%

99.7%

32%

0.3%

95% of 1000 = 950

99.85% of 1000 = 999

84% of 1000 = 840

.8413 - .1587 = .6826

They are all very close to the 68 – 95 – 99.7 proportions described by the Empirical Rule.

.9772 - .0228 = .9544

.9987 - .0013 = .9974

9) Use the table of standard normal probabilities to determine the proportion of observations

from a standard normal distribution that satisfies each of the following statements. For each,

sketch a standard normal curve and shade the area under the curve that is the answer.

a. z < 2.85 b. z > 2.85 c. z > -1.66 d. -1.66 < z < 2.85

10) Use the table of standard normal probabilities to find the value z of a standard normal variable

that satisfies each of the following statements. For each sketch a standard normal curve and

mark your value of z on the axis.

a. 25% of observations fall below it b. 40% of observations fall above it

c. proportion less than it is 0.8 d. 90% of observations are greater than it

11) What are the quartiles (all three) of a standard normal distribution?

12) The deciles of any distribution are the points that mark off the lowest 10% and the highest

10%. What are the deciles of the standard normal distribution?

.2500 .4000

.8000 .9000

𝑃(𝑧 < 2.85) = .9978 𝑃(𝑧 > 2.85) = 1 − .9978 = .0022

𝑃(𝑧 > −1.66) = 1 − .0485 = .9515

𝑃(−1.66 < 𝑧 < 2.85) = . 9978 − .0485 = .9493

𝑧 = −.67 𝑧 = .25

𝑧 = .84 𝑧 = −1.28

𝑄1: 𝑧 = −.67 𝑄2: 𝑧 = 0

𝑄3: 𝑧 = .67

𝑧 = −1.28 𝑎𝑛𝑑 𝑧 = 1.28

13) Which of the following are true statements?

a. The area under a normal curve is always equal to 1, no matter what the mean and

standard deviation are.

b. The smaller the standard deviation of a normal curve, the higher and narrower

the graph.

c. Normal curves with different means are centered around different numbers.

i. I and II

ii. I and III

iii. II and III

iv. I, II, and III

v. None of the above gives the complete set of true responses.

14) Which of the following are true statements?

a. The area under the standard normal curve between 0 and 2 is twice the area between 0

and 1.

b. The area under the standard normal curve between 0 and 2 is half the area between –2

and 2.

c. For the standard normal curve, the interquartile range is approximately 3.

i. I and II

ii. I and III

iii. II only

iv. I, II and III

v. None of the above gives the complete set of true responses.

15) Populations P1 and P2 are normally distributed and have identical means. However, the standard

deviation of P1 is twice the standard deviation of P2. What can be said about the percentage

of observations falling within two standard deviations of the mean for each population?

i. The percentage for P1 is twice the percentage for P2.

ii. The percentage for P1 is greater, but not twice as great, as the percentage for

P2.

iii. The percentage for P2 is twice the percentage for P1.

iv. The percentage for P2 is greater, but not twice as great, as the percentage for

P1.

v. The percentages are identical.

16) Suppose that a college admissions office needs to compare scores of students who take the

SAT with those who take the ACT. Suppose that among the college’s applicants who take the

SAT, scores have a mean of 896 and a standard deviation of 174. Further suppose that among

the college’s applicants who take the ACT, scores have a mean of 20.6 and a standard deviation

of 5.2.

a) If applicant Bobby scored 1080 on the SAT, how many points above the SAT mean did he

score?

b) If applicant Kathy scored 28 on the ACT, how many points above the ACT mean did she

score?

c) Is it sensible to conclude that since your answer to (a) is greater than your answer to b),

Bobby outperformed Kathy on the admissions test? Explain.

d) Determine how many standard deviations above the mean Bobby scored by dividing your

answer to (a) by the standard deviation of the SAT scores. Do the same for Kathy’s score.

e) Which applicant has the higher z-score for his or her admissions test score?

f) Explain in your own words which applicant performed better on his or her admissions test.

g) Calculate the z-score for applicant Peter, who scored 740 on the SAT, and for applicant

Kelly, who scored 19 on the ACT.

h) Which of Peter and Kelly has the higher z-score?

i) Under what conditions does a z-score turn out to be negative?

1080 – 896; 184 points above the SAT mean.

28 – 20.6; 7.4 points above the ACT mean.

No, they are different tests with different scoring scales.

Bobby: 𝑧 =184

174; 𝑧 = 1.06

Kathy: 𝑧 =7.4

5.2; 𝑧 = 1.42

Kathy had the higher z-score, 1.42 > 1.06

Kathy did better. Since she is more standard deviations above the mean she would be in a higher percentile than Bobby.

Peter: 𝑧 =740−896

174; 𝑧 = −.90

Kelly: 𝑧 =19−20.6

5.2; 𝑧 = −.31

Kelly has the higher z-score, -.31 > -.90:

A z-score will be negative when a value is below the mean.

17) Data from the National Vital Statistics Report reveal that the distribution of the duration of

human pregnancies (i.e., the number of days between conception and birth) is approximately

normal with mean = 270 and standard deviation = 15. Use this normal model to determine

the probability that a given pregnancy comes to term in:

a. less than 244 days (which is about 8 months).

b. more than 275 days (which is about 9 months).

c. over 300 days.

d. between 260 and 280 days.

e. Data from the National Vital Statistics Report reveal that of 3,880,894 births in

the US in 1997, the number of pregnancies that resulted in a preterm delivery,

defined as 36 or fewer weeks since conception, was 436,600. Compare this to the

prediction that would be obtained from the model.

18) Suppose that the IQ scores of students at a certain college follow a normal distribution with

mean 115 and standard deviation 12.

a. Use the normal model to determine the proportion of students with an IQ score below

100.

b. Find the proportion of these undergraduates having IQs greater than 130.

c. Find the proportion of these undergraduates having IQs between 110 and 130.

d. With his IQ of 75, what would the percentile of Forrest Gump’s IQ be?

f. Determine how high one’s IQ must be in order to be in the top 1% of all IQs at this

college.

.0418

.3707

.0228

.4972

446691 vs. 436600; pretty close

.1056

.1056

.5559

.04th percentile

142.92

19) Suppose that Professors Wells and Zeddes have final exam scores that are approximately

normally distributed with mean 75. The standard deviation of Wells’ scores is 10, and that

of Zeddes’ scores is 5.

a. With which professor is a score of 90 more impressive? Support your answer with

appropriate probability calculations and with a sketch.

b. With which professor is a score of 60 more discouraging? Again support your answer

with appropriate probability calculations and with a sketch.

20) Suppose that the wrapper of a certain candy bar lists its weight as 2.13 ounces. Naturally,

the weights of individual bars vary somewhat. Suppose that the weights of these candy

bars vary according to a normal distribution with mean = 2.2 ounces and standard deviation

= 0.04 ounces.

a. What proportion of candy bars weigh less than the advertised weight?

b. What proportion of candy bars weigh more than 2.25 ounces?

c. What proportion of candy bars weigh between 2.2 and 2.3 ounces?

d. If the manufacturer wants to adjust the production process so that only 1 candy bar

in 1000 weighs less than the advertised weight, what should the mean of the actual

weights be (assuming that the standard deviation of the weights remains 0.04

ounces)?

Zeddes; z-score is 3 which is farther above the mean than Wells where the z-score is .33

Zeddes; z-score is -3 which is farther below the mean than Wells where the z-score is -.33

.0401

.1056

.4938

𝜇 = 2.2532

21) A trucking firm determines that its fleet of trucks averages a mean of 12.4 miles per gallon

with a standard deviation of 1.2 miles per gallon on cross-country hauls. What is the

probability that one of the trucks averages fewer than 10 miles per gallon?

22) A factory dumps an average of 2.43 tons of pollutants into a river every week. If the

standard deviation is 0.88 tons, what is the probability that in a week more than 3 tons are

dumped?

23) An electronic product takes an average of 3.4 hours to move through an assembly line. If

the standard deviation is 0.5 hour, what is the probability that an item will take between 3

and 4 hours?

24) The mean score on a college placement exam is 500 with a standard deviation of 100.

Ninety-five percent of the test takers score above what?

25) The average noise level in a restaurant is 30 decibels with a standard deviation of 4

decibels. Ninety-nine percent of the time it is below what value?

26) The mean income per household in a certain state is $9500 with a standard deviation of

$1750. The middle 95% of incomes are between what two values?

27) Jay Olshansky from the University of Chicago was quoted in Chance News as arguing that

for the average life expectancy to reach 100, 18% of people would have to live to 120.

What standard deviation is he assuming for this statement to make sense (assuming life

expectancies are normally distributed)?

With a z-score of -2, the probability that the trucks will average fewer than 10 mpg is 0.0228.

With a z-score of 0.65, the probability that in a week more than 3 tons of pollutants are dumped is 0.2578.

With z-scores of -0.8 and 1.2, the probability that an item will take between 3 and 4 hours to move through an assembly line is 0.6730.

95% of the test takers score above 335.5.

99% of the time the average noise level in a restaurant is less than 39.32 decibels.

With a z-score of -2, the probability that the trucks will average fewer than 10 mpg is 0.0228.

Jay Olshansky is assuming a standard deviation of approximately 21.7.

28) Cucumbers grown on a certain farm have weights with a standard deviation of 2 ounces.

What is the mean weight if 85% of the cucumbers weigh less than 16 ounces?

29) A coffee machine can be adjusted to deliver any fixed number of ounces of coffee. If the

machine has a standard deviation in delivery equal to 0.4 ounce, what should be the mean

setting so that an 8-ounce cup will overflow only 0.5% of the time?

30) If 75% of all families spend more than $75 weekly for food, while 15% spend more than

$150, what is the mean weekly expenditure and what is the standard deviation?

31) Three landmarks of baseball achievement are Ty Cobb’s batting average of .420 in 1911,

Ted Williams’s .406 in 1941, and George Brett’s .390 in 1980. These batting averages

cannot be compared directly because the distribution of major league batting averages has

changed over the decades. The distributions are quite symmetric and reasonably normal.

While the mean batting average has been held roughly constant by rule changes and the

balance between hitting and pitching, the standard deviation has dropped over time. Here

are the facts

0317.261.1970

0326.267.1940

0371.266.1910

..

s

s

s

DevStdMeanDecade

Comparatively, who ranked highest amongst his peers? Justify your answer.

For an 8 oz cup to overflow 0.5% of the time, the mean setting should be 6.97 oz.

The mean weekly food expenditure is $93.54 with a standard deviation of $27.68.

Ted Williams batting average of .406 is 1941 would be ranked highest amongst his peers. Ted was 4.26 standard deviations above the mean compared to 4.15 for Ty Cobb and 4.06 for George Brett.

The mean weight is 13.92 ounces if 85% of cucumbers weigh less than 16 ounces.

32) The following graph and data output are from the 108 years of rainfall in Austin. Let the

calculated mean and standard deviation from the data represent µ and σ for the

calculations.

a) What proportion of rainfall was between 30 and 40 inches?

b) Comment on the accuracy of your calculations.

33) A person with too much time on his hands collected 1000 pennies that came into his

possession in 1999 and calculated the age (as of 1999) of each. The distribution has mean

12.264 years and standard deviation 9.613 years. Knowing these summary statistics but

without seeing the distribution, can you comment on whether the normal distribution is

likely to provide a reasonable model for these penny ages? Explain.

34) Use the following data set.

154 109 137 115 152 140 154 178 200

103 126 126 137 165 165 129 200 148

Draw a stemplot, boxplot and normal probability plot to assess normality.

Annual Rainfall in Austin

Rainfall in33.6518

in9.67722

S1 = Rainfallmean

S2 = RainfallstdDev

Since the z-score for 30 inches is -0.38 and the z-score for 40 inches is 0.66, the approximate proportion of rainfall between 30 and 40 inches is 0.3934.

The calculations should be fairly accurate. The histogram of the distribution of rainfall for Austin appears to be approximately Normal with no obvious outliers. Therefore, it is appropriate to use standard Normal probabilities.

Data appears to be fairly normally distributed

It is probably not normally distributed. 12.264 – 3(9.613) is well below 0 (a new penny). This distribution is most likely skewed to the right as there are more newer pennies in circulation.

35) Use the following data set:

32 31 29 10 30 33 22 25 32 20

30 20 24 24 31 30 15 32 23 23

Is this data set best modeled by using a normal distribution? Create and draw a histogram,

boxplot and normal probability plot to decide. Explain your results.

The histogram and box-plot both show that the data is skewed to the left. The normal probability plot has too much curvature to be considered normal.