normal distributions 2/27/12
DESCRIPTION
Normal Distributions 2/27/12. Normal Distribution Central Limit Theorem Normal distributions for confidence intervals Normal distributions for p-values Standard Normal Corresponding Sections: 5.1, 5.2. Exam 1 Grades. Bootstrap and Randomization Distributions. - PowerPoint PPT PresentationTRANSCRIPT
![Page 1: Normal Distributions 2/27/12](https://reader035.vdocuments.mx/reader035/viewer/2022062302/56816764550346895ddc4263/html5/thumbnails/1.jpg)
Normal Distributions 2/27/12
• Normal Distribution• Central Limit Theorem• Normal distributions for confidence intervals• Normal distributions for p-values• Standard Normal
Corresponding Sections: 5.1, 5.2
![Page 2: Normal Distributions 2/27/12](https://reader035.vdocuments.mx/reader035/viewer/2022062302/56816764550346895ddc4263/html5/thumbnails/2.jpg)
Exam 1 Grades
![Page 3: Normal Distributions 2/27/12](https://reader035.vdocuments.mx/reader035/viewer/2022062302/56816764550346895ddc4263/html5/thumbnails/3.jpg)
slope (thousandths)-60 -40 -20 0 20 40 60
Measures from Scrambled RestaurantTips Dot Plot
r-0.6 -0.4 -0.2 0.0 0.2 0.4 0.6
Measures from Scrambled Collection 1 Dot Plot
Nullxbar98.2 98.3 98.4 98.5 98.6 98.7 98.8 98.9 99.0
Measures from Sample of BodyTemp50 Dot Plot
Diff-4 -3 -2 -1 0 1 2 3 4
Measures from Scrambled CaffeineTaps Dot Plot
xbar26 27 28 29 30 31 32
Measures from Sample of CommuteAtlanta Dot Plot
Slope :Restaurant tips
Correlation: Malevolent uniforms
Mean :Body Temperatures
Diff means: Finger taps
Mean : Atlanta commutes
phat0.3 0.4 0.5 0.6 0.7 0.8
Measures from Sample of Collection 1 Dot PlotProportion : Owners/dogs
What do you notice?
All bell-shaped distributions!
Bootstrap and Randomization Distributions
![Page 4: Normal Distributions 2/27/12](https://reader035.vdocuments.mx/reader035/viewer/2022062302/56816764550346895ddc4263/html5/thumbnails/4.jpg)
• The symmetric, bell-shaped curve we have seen for almost all of our bootstrap and randomization distributions is called a normal distribution
Normal Distribution
Freq
uenc
y
-3 -2 -1 0 1 2 3
050
010
0015
00
![Page 5: Normal Distributions 2/27/12](https://reader035.vdocuments.mx/reader035/viewer/2022062302/56816764550346895ddc4263/html5/thumbnails/5.jpg)
Central Limit Theorem!
For a sufficiently large sample size, the distribution of sample
statistics for a mean or a proportion is normal
http://onlinestatbook.com/stat_sim/sampling_dist/index.html
![Page 6: Normal Distributions 2/27/12](https://reader035.vdocuments.mx/reader035/viewer/2022062302/56816764550346895ddc4263/html5/thumbnails/6.jpg)
Central Limit Theorem
• The central limit theorem holds for ANY original distribution, although “sufficiently large sample size” varies
• The more skewed the original distribution is (the farther from normal), the larger the sample size has to be for the CLT to work
![Page 7: Normal Distributions 2/27/12](https://reader035.vdocuments.mx/reader035/viewer/2022062302/56816764550346895ddc4263/html5/thumbnails/7.jpg)
Central Limit Theorem
• For distributions of a quantitative variable that are not very skewed and without large outliers, n ≥ 30 is usually sufficient to use the CLT
• For distributions of a categorical variable, counts of at least 10 within each category is usually sufficient to use the CLT
![Page 8: Normal Distributions 2/27/12](https://reader035.vdocuments.mx/reader035/viewer/2022062302/56816764550346895ddc4263/html5/thumbnails/8.jpg)
• The normal distribution is fully characterized by it’s mean and standard deviation
Normal Distribution
mean,standard deviationN
![Page 9: Normal Distributions 2/27/12](https://reader035.vdocuments.mx/reader035/viewer/2022062302/56816764550346895ddc4263/html5/thumbnails/9.jpg)
Normal Distribution
0.523,0.048N
![Page 10: Normal Distributions 2/27/12](https://reader035.vdocuments.mx/reader035/viewer/2022062302/56816764550346895ddc4263/html5/thumbnails/10.jpg)
Bootstrap DistributionsIf a bootstrap distribution is approximately normally distributed, we can write it as
a) N(parameter, sd)b) N(statistic, sd)c) N(parameter, se)d) N(statistic, se)sd = standard deviation of variablese = standard error = standard deviation of statistic
![Page 11: Normal Distributions 2/27/12](https://reader035.vdocuments.mx/reader035/viewer/2022062302/56816764550346895ddc4263/html5/thumbnails/11.jpg)
Confidence Intervals
If the bootstrap distribution is normal:
To find a P% confidence interval , we just need to find the middle P% of the distribution
N(statistic, SE)
![Page 12: Normal Distributions 2/27/12](https://reader035.vdocuments.mx/reader035/viewer/2022062302/56816764550346895ddc4263/html5/thumbnails/12.jpg)
Best Picture
What proportion of visitors to www.naplesnews.com thought The Artist should win best picture?
ˆ .15p
???SE
![Page 14: Normal Distributions 2/27/12](https://reader035.vdocuments.mx/reader035/viewer/2022062302/56816764550346895ddc4263/html5/thumbnails/14.jpg)
Area under a Curve• The area under the curve of a normal distribution is equal to the proportion of the distribution falling within that range
• Knowing just the mean and standard deviation of a normal distribution allows you to calculate areas in the tails and percentiles
http://davidmlane.com/hyperstat/z_table.html
![Page 15: Normal Distributions 2/27/12](https://reader035.vdocuments.mx/reader035/viewer/2022062302/56816764550346895ddc4263/html5/thumbnails/15.jpg)
Best Picturehttp://davidmlane.com/hyperstat/z_table.html
![Page 16: Normal Distributions 2/27/12](https://reader035.vdocuments.mx/reader035/viewer/2022062302/56816764550346895ddc4263/html5/thumbnails/16.jpg)
Best Picture
![Page 17: Normal Distributions 2/27/12](https://reader035.vdocuments.mx/reader035/viewer/2022062302/56816764550346895ddc4263/html5/thumbnails/17.jpg)
For a normal sampling distribution, we can also use the formula
to give a 95% confidence interval.
Confidence Intervals
sample statistic 2 SE
2 .03
0.096.15
,0 66
1
.2
![Page 18: Normal Distributions 2/27/12](https://reader035.vdocuments.mx/reader035/viewer/2022062302/56816764550346895ddc4263/html5/thumbnails/18.jpg)
For normal bootstrap distributions, the formula
gives a 95% confidence interval.
How would you use the N(0,1) normal distribution to find the appropriate multiplier for other levels of confidence?
Confidence Intervals
sample statistic 2 SE
![Page 19: Normal Distributions 2/27/12](https://reader035.vdocuments.mx/reader035/viewer/2022062302/56816764550346895ddc4263/html5/thumbnails/19.jpg)
For a P% confidence interval, use
where P% of a N(0,1) distribution is between –z* and z*
Confidence Intervals
*sample statistic z SE
![Page 20: Normal Distributions 2/27/12](https://reader035.vdocuments.mx/reader035/viewer/2022062302/56816764550346895ddc4263/html5/thumbnails/20.jpg)
z*-z*
95%
Confidence Intervals
![Page 21: Normal Distributions 2/27/12](https://reader035.vdocuments.mx/reader035/viewer/2022062302/56816764550346895ddc4263/html5/thumbnails/21.jpg)
Confidence Intervals
Find z* for a 99% confidence interval.
http://davidmlane.com/hyperstat/z_table.html
z* = 2.576
![Page 22: Normal Distributions 2/27/12](https://reader035.vdocuments.mx/reader035/viewer/2022062302/56816764550346895ddc4263/html5/thumbnails/22.jpg)
News Sources“A new national survey shows that the majority (64%) of American adults use at least three different types of media every week to get news and information about their local community”
The standard error for this statistic is 1%
Find a 99% confidence interval for the true proportion.Source: http://pewresearch.org/databank/dailynumber/?NumberID=1331
![Page 23: Normal Distributions 2/27/12](https://reader035.vdocuments.mx/reader035/viewer/2022062302/56816764550346895ddc4263/html5/thumbnails/23.jpg)
News Sources*sample statistic z SE
2.5760.64 0 0. 1
0 0.6 64 .02
0.614,0.666
![Page 24: Normal Distributions 2/27/12](https://reader035.vdocuments.mx/reader035/viewer/2022062302/56816764550346895ddc4263/html5/thumbnails/24.jpg)
Confidence Interval Formula
*sample statistic z SE
From original data
From bootstrap
distribution
From N(0,1)
![Page 25: Normal Distributions 2/27/12](https://reader035.vdocuments.mx/reader035/viewer/2022062302/56816764550346895ddc4263/html5/thumbnails/25.jpg)
First Born Children
• Are first born children actually smarter?
• Based on data from last semester’s class survey, we’ll test whether first born children score significantly higher on the SAT
• From a randomization distribution, we find SE = 37
first born not first born 30.26X X
![Page 26: Normal Distributions 2/27/12](https://reader035.vdocuments.mx/reader035/viewer/2022062302/56816764550346895ddc4263/html5/thumbnails/26.jpg)
First Born Children
What normal distribution should we use to find the p-value?
a) N(30.26, 37)b) N(37, 30.26)c) N(0, 37)d) N(0, 30.26)
first born not first born 30.26, 37SX X E
![Page 27: Normal Distributions 2/27/12](https://reader035.vdocuments.mx/reader035/viewer/2022062302/56816764550346895ddc4263/html5/thumbnails/27.jpg)
Hypothesis TestingDistribution of Statistic Assuming Null
Statistic
-3 -2 -1 0 1 2 3
Observed Statistic
Distribution of Statistic Assuming Null
Statistic
-3 -2 -1 0 1 2 3
Distribution of Statistic Assuming Null
Statistic
-3 -2 -1 0 1 2 3
Observed Statistic
p-value
![Page 28: Normal Distributions 2/27/12](https://reader035.vdocuments.mx/reader035/viewer/2022062302/56816764550346895ddc4263/html5/thumbnails/28.jpg)
p-valuesIf the randomization distribution is normal:
To calculate a p-value, we just need to find the area in the appropriate tail(s) beyond the observed statistic of the distribution
N(null value, SE)
![Page 29: Normal Distributions 2/27/12](https://reader035.vdocuments.mx/reader035/viewer/2022062302/56816764550346895ddc4263/html5/thumbnails/29.jpg)
First Born ChildrenN(0, 37)
http://davidmlane.com/hyperstat/z_table.html
p-value = 0.207
![Page 30: Normal Distributions 2/27/12](https://reader035.vdocuments.mx/reader035/viewer/2022062302/56816764550346895ddc4263/html5/thumbnails/30.jpg)
First Born Children
![Page 31: Normal Distributions 2/27/12](https://reader035.vdocuments.mx/reader035/viewer/2022062302/56816764550346895ddc4263/html5/thumbnails/31.jpg)
Standard Normal• Sometimes, it is easier to just use one normal distribution to do inference
• The standard normal distribution is the normal distribution with mean 0 and standard deviation 1
0,1N
Distribution of Statistic Assuming Null
Statistic
-3 -2 -1 0 1 2 3
![Page 32: Normal Distributions 2/27/12](https://reader035.vdocuments.mx/reader035/viewer/2022062302/56816764550346895ddc4263/html5/thumbnails/32.jpg)
Standardized Test Statistic
• The standardized test statistic is the number of standard errors a statistic is from the null value
• The standardized test statistic (also called a z-statistic) is compared to N(0,1)
sample statistic null valueSE
z
![Page 33: Normal Distributions 2/27/12](https://reader035.vdocuments.mx/reader035/viewer/2022062302/56816764550346895ddc4263/html5/thumbnails/33.jpg)
p-value
1) Find the standardized test statistic:
2) The p-value is the area in the tail(s) beyond z for a standard normal distribution
sample statistic null valueSE
z
![Page 34: Normal Distributions 2/27/12](https://reader035.vdocuments.mx/reader035/viewer/2022062302/56816764550346895ddc4263/html5/thumbnails/34.jpg)
First Born Children
1) Find the standardized test statistic
sample statistic null valueSE
30.26 037
0.818
z
![Page 35: Normal Distributions 2/27/12](https://reader035.vdocuments.mx/reader035/viewer/2022062302/56816764550346895ddc4263/html5/thumbnails/35.jpg)
First Born Children
2) Find the area in the tail(s) beyond z for a standard normal distribution
p-value = 0.207
![Page 36: Normal Distributions 2/27/12](https://reader035.vdocuments.mx/reader035/viewer/2022062302/56816764550346895ddc4263/html5/thumbnails/36.jpg)
z-statistic
• Calculating the number of standard errors a statistic is from the null value allows us to assess extremity on a common scale
![Page 37: Normal Distributions 2/27/12](https://reader035.vdocuments.mx/reader035/viewer/2022062302/56816764550346895ddc4263/html5/thumbnails/37.jpg)
Formula for p-values
From randomization
distribution
From H0
sample statistic null valueSE
z
From original data
Compare z to N(0,1) for p-value
![Page 38: Normal Distributions 2/27/12](https://reader035.vdocuments.mx/reader035/viewer/2022062302/56816764550346895ddc4263/html5/thumbnails/38.jpg)
Standard Error
• Wouldn’t it be nice if we could compute the standard error without doing thousands of simulations?
• We can!!!
• Or rather, we’ll be able to on Wednesday!