07_business statistics:sampling and sampling distributions
TRANSCRIPT
-
8/4/2019 07_Business Statistics:Sampling and Sampling Distributions
1/44
Chapter 8
Confidence Interval Estimation
Statistics for Managers Using Microsoft Excel, 5e
2008 Pearson Prentice-Hall, Inc.Chap 8-1
-
8/4/2019 07_Business Statistics:Sampling and Sampling Distributions
2/44
Learning ObjectivesIn this chapter, you will learn:
To construct and interpret confidence intervalestimates for the mean and the proportion
How to determine the sample size necessaryto develop a confidence interval for the meanor proportion
How to use confidence interval estimates inauditing
Statistics for Managers Using Microsoft Excel, 5e
2008 Pearson Prentice-Hall, Inc. Chap 8-2
-
8/4/2019 07_Business Statistics:Sampling and Sampling Distributions
3/44
Chapter Outline1. Confidence Intervals for the Population
Mean,
when Population Standard Deviation is Known
when Population Standard Deviation is Unknown
2. Confidence Intervals for the PopulationProportion,
3. Determining the Required Sample Size
Statistics for Managers Using Microsoft Excel, 5e
2008 Pearson Prentice-Hall, Inc. Chap 8-3
-
8/4/2019 07_Business Statistics:Sampling and Sampling Distributions
4/44
Point Estimates Apoint estimate is a single number. For thepopulation mean (and population standarddeviation), a point estimate is the sample mean(and sample standard deviation).
A confidence interval provides additionalinformation about variability
Statistics for Managers Using Microsoft Excel, 5e
2008 Pearson Prentice-Hall, Inc. Chap 8-4
Point Estimate
Lower Confidence
Limit
Upper Confidence
Limit
Width ofconfidence interval
-
8/4/2019 07_Business Statistics:Sampling and Sampling Distributions
5/44
Confidence Interval EstimatesA confidence interval gives a range estimate of
values:
Takes into consideration variation in samplestatistics from sample to sample
Based on all the observations from 1 sample
Gives information about closeness to unknownpopulation parameters
Stated in terms of level of confidence
Ex. 95% confidence, 99% confidence
Can never be 100% confident
Statistics for Managers Using Microsoft Excel, 5e
2008 Pearson Prentice-Hall, Inc. Chap 8-5
-
8/4/2019 07_Business Statistics:Sampling and Sampling Distributions
6/44
Confidence Interval Estimates The general formula for all
confidence intervals is:
Statistics for Managers Using Microsoft Excel, 5e
2008 Pearson Prentice-Hall, Inc. Chap 8-6
Point Estimate (Critical Value) (Standard Error)
-
8/4/2019 07_Business Statistics:Sampling and Sampling Distributions
7/44
Confidence Level Confidence Level
Confidence in which the interval will containthe unknown population parameter
A percentage (less than 100%)
Statistics for Managers Using Microsoft Excel, 5e
2008 Pearson Prentice-Hall, Inc. Chap 8-7
-
8/4/2019 07_Business Statistics:Sampling and Sampling Distributions
8/44
Confidence Level Suppose confidence level = 95%
Also written (1 - ) = .95
A relative frequency interpretation: In the long run, 95% of all the confidence
intervals that can be constructed will containthe unknown true parameter
A specific interval either will contain or willnot contain the true parameter
Statistics for Managers Using Microsoft Excel, 5e
2008 Pearson Prentice-Hall, Inc. Chap 8-8
-
8/4/2019 07_Business Statistics:Sampling and Sampling Distributions
9/44
Confidence Interval for
( Known)Assumptions Population standard deviation is known
Population is normally distributed
If population is not normal, use large sampleConfidence interval estimate:
(where Z is the standardized normal distribution criticalvalue for a probability of/2 in each tail)
Statistics for Managers Using Microsoft Excel, 5e
2008 Pearson Prentice-Hall, Inc. Chap 8-9
n
ZX
-
8/4/2019 07_Business Statistics:Sampling and Sampling Distributions
10/44
Finding the Critical Value, ZConsider a 95% confidence interval:
Statistics for Managers Using Microsoft Excel, 5e
2008 Pearson Prentice-Hall, Inc. Chap 8-10
Z= -1.96 Z= 1.96
.951
.0252
.025
2
LowerConfidenceLimit
UpperConfidenceLimit
Z units:
X units: Point Estimate
0
-
8/4/2019 07_Business Statistics:Sampling and Sampling Distributions
11/44
Finding the Critical Value, ZCommonly used confidence levels are 90%, 95%, and99%
Statistics for Managers Using Microsoft Excel, 5e
2008 Pearson Prentice-Hall, Inc. Chap 8-11
Confidence
Level
Confidence
Coefficient Z value
1.28
1.645
1.96
2.33
2.58
3.08
3.27
.80
.90
.95
.98
.99
.998
.999
80%
90%
95%
98%
99%
99.8%
99.9%
-
8/4/2019 07_Business Statistics:Sampling and Sampling Distributions
12/44
Intervals and Level of Confidence
Statistics for Managers Using Microsoft Excel, 5e
2008 Pearson Prentice-Hall, Inc. Chap 8-12
x
Confidence Intervals
Intervalsextend from
to
(1-)x100%
of intervals
constructedcontain ;
()x100% do
not.
Sampling Distribution
of the Mean
n
ZX
n
ZX
x
x1
x2
/2 /21
-
8/4/2019 07_Business Statistics:Sampling and Sampling Distributions
13/44
Confidence Interval for
( Known) ExampleA sample of 11 circuits from a large normal
population has a mean resistance of 2.20
ohms. We know from past testing that thepopulation standard deviation is .35 ohms.
Determine a 95% confidence interval for the
true mean resistance of the population.
Statistics for Managers Using Microsoft Excel, 5e
2008 Pearson Prentice-Hall, Inc. Chap 8-13
-
8/4/2019 07_Business Statistics:Sampling and Sampling Distributions
14/44
Confidence Interval for
( Known) Example
2.4068),(1.9932
.20682.20
)11(.35/1.962.20
n
ZX
Statistics for Managers Using Microsoft Excel, 5e
2008 Pearson Prentice-Hall, Inc. Chap 8-14
We are 95% confident that the true mean resistance is between
1.9932 and 2.4068 ohms
Although the true mean may or may not be in this interval, 95%
of intervals formed in this manner will contain the true mean
-
8/4/2019 07_Business Statistics:Sampling and Sampling Distributions
15/44
Confidence Interval for
( Unknown) If the population standard deviation is
unknown, we can substitute the sample
standard deviation, S This introduces extra uncertainty, since S is
variable from sample to sample
So we use the t distribution instead of the
normal distribution
Statistics for Managers Using Microsoft Excel, 5e
2008 Pearson Prentice-Hall, Inc. Chap 8-15
-
8/4/2019 07_Business Statistics:Sampling and Sampling Distributions
16/44
Confidence Interval for
( Unknown)Assumptions Population standard deviation is unknown
Population is normally distributed
If population is not normal, use large sample
Use Students t Distribution
Confidence Interval Estimate:
(where t is the critical value of the t distribution withn-1 d.f. and an area of/2 in each tail)
Statistics for Managers Using Microsoft Excel, 5e
2008 Pearson Prentice-Hall, Inc. Chap 8-16
n
StX
1-n
-
8/4/2019 07_Business Statistics:Sampling and Sampling Distributions
17/44
Students t Distribution The t value depends on degrees of freedom (d.f.)
Number of observations that are free to vary aftersample mean has been calculated
d.f. = n - 1
Statistics for Managers Using Microsoft Excel, 5e
2008 Pearson Prentice-Hall, Inc. Chap 8-17
-
8/4/2019 07_Business Statistics:Sampling and Sampling Distributions
18/44
Degrees of FreedomIdea: Number of observations that are free to varyafter sample mean has been calculated
Example: Suppose the mean of 3 numbers is 8.0
Let X1 = 7
Let X2 = 8
What is X3?
Statistics for Managers Using Microsoft Excel, 5e
2008 Pearson Prentice-Hall, Inc. Chap 8-18
If the mean of these three
values is 8.0,
then X3 must be 9
(i.e., X3 is not free to vary)
Here, n = 3, so degrees of freedom = n1 = 31 = 2
(2 values can be any numbers, but the third is not free to vary for a
given mean)
-
8/4/2019 07_Business Statistics:Sampling and Sampling Distributions
19/44
Students t Distribution
Statistics for Managers Using Microsoft Excel, 5e
2008 Pearson Prentice-Hall, Inc. Chap 8-19
t0
t (df= 5)
t (df= 13)t-distributions are bell-shapedand symmetric, but havefatter tails than the normal
Standard
Normal(t with df = )
Note: t Z as n increases
-
8/4/2019 07_Business Statistics:Sampling and Sampling Distributions
20/44
Students t Table
Statistics for Managers Using Microsoft Excel, 5e
2008 Pearson Prentice-Hall, Inc. Chap 8-20
Upper Tail Area
df .25 .10 .05
1 1.000 3.078 6.314
2 0.817 1.886 2.920
3 0.765 1.638 2.353
t0 2.920
The body of the table contains t
values, not probabilities
Let: n = 3
df = n - 1 = 2
= .10/2 =.05
/2 = .05
-
8/4/2019 07_Business Statistics:Sampling and Sampling Distributions
21/44
Confidence Interval for
( Unknown) ExampleA random sample of n = 25 has X = 50 and S = 8. Form a
95% confidence interval for
d.f. = n 1 = 24, so
The confidence interval is
Statistics for Managers Using Microsoft Excel, 5e
2008 Pearson Prentice-Hall, Inc. Chap 8-21
25
8(2.0639)50
n
S1-n/2, tX
(46.698 , 53.302)
-
8/4/2019 07_Business Statistics:Sampling and Sampling Distributions
22/44
Confidence Intervals for the
Population Proportion, An interval estimate for the population
proportion ( ) can be calculated by adding
an allowance for uncertainty to the sampleproportion ( p )
Statistics for Managers Using Microsoft Excel, 5e
2008 Pearson Prentice-Hall, Inc. Chap 8-22
-
8/4/2019 07_Business Statistics:Sampling and Sampling Distributions
23/44
Confidence Intervals for the
Population Proportion, Recall that the distribution of the sample proportion isapproximately normal if the sample size is large, withstandard deviation
We will estimate this with sample data:
Statistics for Managers Using Microsoft Excel, 5e
2008 Pearson Prentice-Hall, Inc. Chap 8-23
n
p)p(1
n)(1p
-
8/4/2019 07_Business Statistics:Sampling and Sampling Distributions
24/44
Confidence Intervals for the
Population Proportion, Upper and lower confidence limits for the populationproportion are calculated with the formula
where
Z is the standardized normal value for the level ofconfidence desired
p is the sample proportion
n is the sample sizeStatistics for Managers Using Microsoft Excel, 5e
2008 Pearson Prentice-Hall, Inc. Chap 8-24
np)p(1Zp
-
8/4/2019 07_Business Statistics:Sampling and Sampling Distributions
25/44
Confidence Intervals for the
Population Proportion, ExampleA random sample of 100 people shows that 25 haveopened IRAs this year. Form a 95% confidence intervalfor the true proportion of the population who have
opened IRAs.
Statistics for Managers Using Microsoft Excel, 5e
2008 Pearson Prentice-Hall, Inc. Chap 8-25
00.25(.75)/196.125/100
p)/np(1p
Z
0.3349),(0.1651
(.0433)1.96.25
-
8/4/2019 07_Business Statistics:Sampling and Sampling Distributions
26/44
Confidence Intervals for the
Population Proportion, ExampleWe are 95% confident that the true
percentage of left-handers in the populationis between 16.51% and 33.49%. Although theinterval from .1651 to .3349 may or may notcontain the true proportion, 95% of intervalsformed from samples of size 100 in thismanner will contain the true proportion.
Statistics for Managers Using Microsoft Excel, 5e
2008 Pearson Prentice-Hall, Inc. Chap 8-26
-
8/4/2019 07_Business Statistics:Sampling and Sampling Distributions
27/44
Determining Sample Size The required sample size can be found to reach a
desired margin of error (e) with a specified level ofconfidence (1 - )
The margin of error is also called sampling error
the amount of imprecision in the estimate of thepopulation parameter
the amount added and subtracted to the point estimateto form the confidence interval
Statistics for Managers Using Microsoft Excel, 5e
2008 Pearson Prentice-Hall, Inc. Chap 8-27
-
8/4/2019 07_Business Statistics:Sampling and Sampling Distributions
28/44
Determining Sample Size To determine the required sample size for the
mean, you must know:
The desired level of confidence (1 - ), whichdetermines the critical Z value
The acceptable sampling error (margin oferror), e
The standard deviation,
Statistics for Managers Using Microsoft Excel, 5e
2008 Pearson Prentice-Hall, Inc. Chap 8-28
n
Ze
2
22
e
Zn Now solve
for n to get
-
8/4/2019 07_Business Statistics:Sampling and Sampling Distributions
29/44
Determining Sample SizeIf = 45, what sample size is needed to estimatethe mean within 5 with 90% confidence?
Statistics for Managers Using Microsoft Excel, 5e
2008 Pearson Prentice-Hall, Inc. Chap 8-29
219.195
(45)(1.645)
e
Zn
2
22
2
22
So the required sample size is n = 220
-
8/4/2019 07_Business Statistics:Sampling and Sampling Distributions
30/44
Determining Sample Size If unknown, can be estimated when using
the required sample size formula
Use a value for that is expected to be at leastas large as the true
Select a pilot sample and estimate with the
sample standard deviation, S
Statistics for Managers Using Microsoft Excel, 5e
2008 Pearson Prentice-Hall, Inc. Chap 8-30
-
8/4/2019 07_Business Statistics:Sampling and Sampling Distributions
31/44
Determining Sample SizeTo determine the required sample size for the proportion,you must know:
The desired level of confidence (1 - ), which
determines the critical Z value The acceptable sampling error (margin of error), e
The true proportion of successes,
can be estimated with a pilot sample, if necessary (or
conservatively use = .50)
Statistics for Managers Using Microsoft Excel, 5e
2008 Pearson Prentice-Hall, Inc. Chap 8-31
2
2 )1(
e
Zn
Now solve
for n to getn
Ze)1(
-
8/4/2019 07_Business Statistics:Sampling and Sampling Distributions
32/44
Determining Sample SizeHow large a sample would be necessary toestimate the true proportion defective in a largepopulation within 3%, with 95% confidence?
(Assume a pilot sample yields p = .12)
Statistics for Managers Using Microsoft Excel, 5e
2008 Pearson Prentice-Hall, Inc. Chap 8-32
-
8/4/2019 07_Business Statistics:Sampling and Sampling Distributions
33/44
Determining Sample Size
Statistics for Managers Using Microsoft Excel, 5e
2008 Pearson Prentice-Hall, Inc. Chap 8-33
Solution:
For 95% confidence, use Z = 1.96
e = .03
p = .12, so use this to estimate
So use n = 451
450.74(.03)
.12)(.12)(1(1.96))1(2
2
2
2
e
Z
n
-
8/4/2019 07_Business Statistics:Sampling and Sampling Distributions
34/44
Applications in Auditing Six advantages of statistical sampling in
auditing
Sample result is objective and defensible
Based on demonstrable statistical principles
Provides sample size estimation in advance onan objective basis
Provides an estimate of the sampling error
Statistics for Managers Using Microsoft Excel, 5e
2008 Pearson Prentice-Hall, Inc. Chap 8-34
-
8/4/2019 07_Business Statistics:Sampling and Sampling Distributions
35/44
Applications in Auditing Can provide more accurate conclusions on the
population Examination of the population can be time consuming and
subject to more nonsampling error
Samples can be combined and evaluated by differentauditors Samples are based on scientific approach
Samples can be treated as if they have been done by a single
auditor Objective evaluation of the results is possible
Based on known sampling error
Statistics for Managers Using Microsoft Excel, 5e
2008 Pearson Prentice-Hall, Inc. Chap 8-35
-
8/4/2019 07_Business Statistics:Sampling and Sampling Distributions
36/44
Population Total Amount Point estimate:
Confidence interval estimate:
Statistics for Managers Using Microsoft Excel, 5e
2008 Pearson Prentice-Hall, Inc. Chap 8-36
XNtotalPopulation
1N
nN
n
S
)t(NXN 1n
(This is sampling without replacement, so use the finite population correction
in the confidence interval formula)
-
8/4/2019 07_Business Statistics:Sampling and Sampling Distributions
37/44
Population Total Amount A firm has a population of 1000 accounts and
wishes to estimate the total population value.
A sample of 80 accounts is selected with averagebalance of $87.6 and standard deviation of $22.3.
Find the 95% confidence interval estimate of thetotal balance.
Statistics for Managers Using Microsoft Excel, 5e
2008 Pearson Prentice-Hall, Inc. Chap 8-37
-
8/4/2019 07_Business Statistics:Sampling and Sampling Distributions
38/44
Population Total Amount
Statistics for Managers Using Microsoft Excel, 5e
2008 Pearson Prentice-Hall, Inc. Chap 8-38
The 95% confidence interval for the population total balance
is $82,837.52 to $92,362.48
4,762.4887,600
11000
801000
80
22.3905)(1000)(1.96)(1000)(87.
1NnN
nS)(tNXN 1n
22.3S87.6,X80,n1000,N
-
8/4/2019 07_Business Statistics:Sampling and Sampling Distributions
39/44
Confidence Interval for
Total Difference Point estimate:
Where the mean difference, , is:
Statistics for Managers Using Microsoft Excel, 5e
2008 Pearson Prentice-Hall, Inc. Chap 8-39
DNDifferenceTotal
valueoriginal-valueD
D
i
1
auditedwhere
n
D
n
i
i
D
-
8/4/2019 07_Business Statistics:Sampling and Sampling Distributions
40/44
Confidence Interval for
Total DifferenceConfidence interval estimate:
where
Statistics for Managers Using Microsoft Excel, 5e
2008 Pearson Prentice-Hall, Inc. Chap 8-40
1N
nN
n
S)t(NDN D1n
1n
)DD(
S
n
1i
2i
D
-
8/4/2019 07_Business Statistics:Sampling and Sampling Distributions
41/44
One Sided Confidence IntervalsApplication: find the upper bound for the proportion ofitems that do not conform with internal controls
where Z is the standardized normal value for the level of
confidence desired p is the sample proportion of items that do not conform n is the sample size N is the population size
Statistics for Managers Using Microsoft Excel, 5e
2008 Pearson Prentice-Hall, Inc. Chap 8-41
1NnN
np)p(1ZpboundUpper
-
8/4/2019 07_Business Statistics:Sampling and Sampling Distributions
42/44
Ethical IssuesA confidence interval (reflecting sampling
error) should always be reported along with apoint estimate
The level of confidence should always bereported
The sample size should be reported
An interpretation of the confidence intervalestimate should also be provided
Statistics for Managers Using Microsoft Excel, 5e
2008 Pearson Prentice-Hall, Inc. Chap 8-42
-
8/4/2019 07_Business Statistics:Sampling and Sampling Distributions
43/44
Chapter Summary Introduced the concept of confidence intervals
Discussed point estimates
Developed confidence interval estimates
Created confidence interval estimates for themean ( known)
Determined confidence interval estimates for the
mean ( unknown)
Created confidence interval estimates for theproportion
Statistics for Managers Using Microsoft Excel, 5e
2008 Pearson Prentice-Hall, Inc. Chap 8-43
In this chapter, we have
-
8/4/2019 07_Business Statistics:Sampling and Sampling Distributions
44/44
Chapter Summary Determined required sample size for mean and
proportion settings
Developed applications of confidence intervalestimation in auditing
Confidence interval estimation for populationtotal
Confidence interval estimation for total difference
in the population One sided confidence intervals
Addressed confidence interval estimation andethical issues
Statistics for Managers Using Microsoft Excel, 5e
In this chapter, we have