6.2: the normal distribution - shippensburg...

6.2: The normal distribution

Normal Distributions q  Characterized by symmetric, bell-shaped (mound-shaped) curve.

q  Heights, weights, standardized test scores

q  A particular normal distribution is determined by q  The mean q  The standard deviation

µ�

Normal Distribution and Deviation from the Mean

Example: Adult Heights

q  95% of female adult heights are between 58 and 72 inches

q  95% of male adult heights are between 62 and 78 inches

Z-scores (revisited)

q  The multiples 1, 2, and 3 or the number of standard deviations from the mean are denoted by z.

q  For a particular observation, x, its z-score is computed by

q  For each fixed number z, the probability within z standard

deviations of the mean is the area under the normal curve between and

z =x� µ

�

µ� z� µ + z�

Finding Probabilities for the Normal Distribution q  What if we want the probability within 1.43 standard

deviations of the mean? q  For normal distributions there is a table we can use (Table A

in back of the book). It tabulates the normal cumulative probability falling below the point

µ + z�

Finding Probabilities for the Normal Distribution: P(-1.43< z < 1.43) To find P(-1.43 < z < 1.43) we do it in 3 steps:

1)  We first find P(z < 1.43), using Table A or calculator

2)  We then also know P(z > 1.43), which by symmetry means we know P(z < -1.43).

3)  P(1.43 < z < 1.43) = P(z < 1.43) - P(z < -1.43)

To use Table A: �  Find the corresponding z-score.

�  Look up the closest standardized score (z)

in the table. ◦  First column gives z to the first decimal place. ◦  First row gives the second decimal place of z.

�  The corresponding probability found in the

body of the table gives the probability of falling below the z-score.

Part of Table A

The Probability Less Than 1.43 Standard Deviations

P(height < 70)=P(z<1.43) = 0.9236

Example 1: Mensa

Mensa is a society of high-IQ people with IQ test scores at the 98th percentile or higher. The Stanford-Binet IQ test scores that are used for admission are approximately normally distributed with a mean of 100 and a standard deviation of 16.

q  How many standard deviations above the mean is the 98th percentile?

q  What is the IQ score for that percentile?

Example 1: Mensa Solution

Example 1: Mensa Solution

q  98th percentile corresponds to a z-score of 2.05

q  So a person needs an IQ score of at least 100 + 2.05(16) = 133.

Example 2: SAT and ACT scores

q  SAT and ACT exams are the two primary college entrance exams. Both have a mathematics component.

q  The scores for the SAT range from 200 to 800 and are normally distributed with a mean of 500 and a standard deviation of 100.

q  The scores for the ACT range from 1 to 36 and are normally distributed with a mean of 21 and a standard deviation of 4.7.

u  Which is better, a 650 on the SAT or a 30 on the ACT? We will answer by looking at percentiles. Begin by finding z-scores!

Example 2: SAT and ACT Solution

q  650 on the SAT (mean is 500, std. dev. is 100)


q  650 on the SAT (mean is 500, std. dev. is 100) q  z-score is (650-500)/100 = 1.50. From Table A, this is in the

93rd percentile. In other words, 7% of people scored above 650.




q  30 on the ACT (mean is 21, std. dev. is 4.7) q  z-score is (30-21)/4.7 = 1.91. From Table A, this is in the 97th

percentile. In other words, 3% of people scored above 30.




q  30 on the ACT (mean is 21, std. dev. is 4.7) q  z-score is (30-21)/4.7 = 1.91. From Table A, this is in the 97th

percentile. In other words, 3% of people scored above 30.

q  Thus, a 30 on the ACT is better than a 650 on the SAT

Finding Probabilities on TI-83/84

q  Normalcdf(low,high,mean,std. dev.) q  For calculating P(a < X < b) when X has a normal distribution of

mean mu and standard deviation sigma. q  On Calculator: “2nd” “DISTR” “2” “a,b,mu,sigma) ENTER”

Normalcdf

q  Invnorm(% to left,mean,std. dev.) q  For finding value a so that P(X ≦ a) = p, when X has normal

distribution of mean mu and standard deviation sigma. q  On Calculator: “2nd” “DISTR” “3” “p,mu,sigma) ENTER”

Invnorm

Finding Probabilities on TI-83/84 q  Normalcdf(low,high,mean,std. dev.)

q  What percent of women are between 65 and 70 inches?

q  On Calculator: “2nd” “DISTR” “2” “65,70,65,3.5) ENTER”

q  Invnorm(% to left,mean,std. dev.) q  How tall does a woman need to be to be in the top 10%?

q  On Calculator: “2nd” “DISTR” “3” “.9,65,3.5) ENTER”

Finding Probabilities on TI-83/84 q  Normalcdf(low,high,mean,std. dev.)

q  What percent of women are between 65 and 70 inches?

q  Answer: 42.34%


q  Answer: 69.5 inches, or 5’9.5”

Building an Interval that Contains a Certain Percentage of the Data q  Suppose we have a normal distribution. q  We want the interval that contains 95% of the data (in

terms of z values, i.e., between –z* and z*). The Emperical Rule told us “about 2 standard deviations” but we want to be more precise.

q  This means that 5% of the data must not be between, and of this amount 2.5% will be to the left of –z*. Since 2.5% is 0.0250, we look in Table A for a z-score with an entry of 0.0250. This gives us -1.96.

q  We conclude that 95% of the data lies between -1.96 and 1.96.

Building an Interval that Contains a Certain Percentage of the Data (cont.) q  For normal distributions, 95% of the data has z-score

between -1.96 and 1.96.

q  Recall that female adult heights are normally distributed with a mean of 65 inches and a standard deviation of 3.5 inches.

q  We can convert the z-scores into heights. q  We conclude that 95% of adult women have a height

between 58.14 inches and 71.86 inches.

“Unusual” Observations

Adult male heights are normally distributed with a mean of 70 inches and a standard deviation of 4 inches. q  Consider these two

q  Sam is 79 inches tall (z-score is 2.25; corresponds to 0.9878 in Table A)

q  Joe is 61 inches tall (z-score is -2.25; corresponds to 0.0122 in Table A)

q  For a given person, we can think of “unusual” in two ways q  Sam is unusually tall, he is in the rarest 1.22% of tall people. q  Joe is unusually short, he is in the rarest 1.22% of short

people. q  Both have unusual height, they are in the rarest 2.44%

P-Values

q  The P-value is a measure of just how unusual the data is, in terms of what percentage of the data is even more unusual than the given data.

q  Recall that q  Sam is unusually tall, he is in the rarest 1.22% of tall people. q  Joe is unusually short, he is in the rarest 1.22% of short

people. q  Both have unusual height, they are in the rarest 2.44%

q  This can be restated as q  Sam’s one-tail (right-tail) P-value is 0 .0122 q  Joe’s one-tail (left-tail) P-value is 0.0122 q  Both have a two-tail P-value of 0.0244

Graphical Depiction of P-Values

Other Types of Distributions We will also work with other distributions. Some will not be symmetric. For a distribution like this, we are only interested in one-tail (right-tail) P-values.

q  Commuting time of 45 minutes has a P-value of 0.15.

Finding Probabilities on TI-83/84

q  Normalcdf(low,high) q  What is the percentage of data between 1 and 1.75 standard

deviations? Normalcdf(1,1.75)

q  Normalcdf(low,high,mean,std. dev.) q  What is the percentage of women between 62 and 70 inches?

Normalcdf(62,70,65,3.5)

q  Invnorm(% to left) q  What is the z-score for data in the top 10%? Invnorm(0.9)


Invnorm(0.9,65,3.5)

6.2: the normal distribution - shippensburg...

Documents