week1-2 chap 3 descri data
TRANSCRIPT
-
8/8/2019 Week1-2 Chap 3 Descri Data
1/44
Describing Data: Numerical MeasuresDescribing Data: Numerical Measures
GOALS
When you have completed this chapter, you will be able to:
ONE
Calculate the arithmetic mean, median, mode, weighted
mean, and the geometric mean.
TWO
Explain the characteristics, uses, advantages, and
disadvantages of each measure of location.THREE
Identify the position of the arithmetic mean, median,
and mode for both a symmetrical and a skewed
distribution.
Goals
3- 1
-
8/8/2019 Week1-2 Chap 3 Descri Data
2/44
FOUR
Compute and interpret the range, the mean deviation, the
variance, and the standard deviation of ungrouped data.
Describing Data: Numerical MeasuresDescribing Data: Numerical Measures
FIVE
Explain the characteristics, uses, advantages, and
disadvantages of each measure of dispersion.
SIX
Understand Chebyshevs theorem and the Empirical Rule as
they relate to a set of observations.
Goals
Chapter Three3- 2
-
8/8/2019 Week1-2 Chap 3 Descri Data
3/44
Characteristics of the Mean
It is calculated bysumming the values
and dividing by thenumber of values.
It requires the interval scale.
All values are used.It is unique.
The sum of the deviations from the mean is 0.
The Arithmetic MeanArithmetic Meanis the most widely used
measure of location andshows the central value of the
data.
The major characteristics of the mean are: A era eJoe
3- 3
-
8/8/2019 Week1-2 Chap 3 Descri Data
4/44
Population Mean
N
X!Q
where
is the population mean
Nis the total number of observations.
Xis a particular value.
7 indicates the operation of adding.
For ungrouped data, the
Population MeanPopulation Mean is thesum of all the populationvalues divided by the total
number of population
values:
3- 4
-
8/8/2019 Week1-2 Chap 3 Descri Data
5/44
Example 1
500484
0007300056!
!!
N
XQ
Find the mean mileage for the cars.
A ParameterParameter is a measurable characteristic of apopulation.
The Kiers
family owns
four cars. The
following is thecurrent mileage
on each of the
four cars.
56,000
23,000
42,000
73,000
3- 5
-
8/8/2019 Week1-2 Chap 3 Descri Data
6/44
Sample Mean
nXX 7!
where n is the total number ofvalues in the sample.
For ungrouped data, the sample mean is
the sum of all the sample values dividedby the number of sample values:
3- 6
-
8/8/2019 Week1-2 Chap 3 Descri Data
7/44
Example 2
.1.1....1
!!
!7
!
n
XX
A statisticstatistic is a measurable characteristic of a sample.
A sample offive
executives
received the
following
bonus last
year ($ ):
1 .0,1 .0,
15.0,15.0,
17.0,17.0,16.0,16.0,
15.015.0
3- 7
-
8/8/2019 Week1-2 Chap 3 Descri Data
8/44
Properties of the Arithmetic Mean
Every set of interval-level and ratio-level data has amean.
All the values are included in computing the mean.
A set of data has a unique mean.
The mean is affected by unusually large or small
data values.
The arithmetic mean is the only measure of locationwhere the sum of the deviations of each value from
the mean is zero.
PropertiesProperties of the Arithmetic Mean3- 8
-
8/8/2019 Week1-2 Chap 3 Descri Data
9/44
Example
? A!!7 XX
Consi er t e set of al es: , , and .
T e meanmean is . Ill strating t e fiftropert
3- 9
-
8/8/2019 Week1-2 Chap 3 Descri Data
10/44
Weighted Mean
n
nnw
www
www
!
The Weighted MeanWeighted Mean of a set of
numbers X1, X2, ..., Xn, withcorresponding weights w1, w2,
...,wn, is computed from the
following formula:
3- 10
-
8/8/2019 Week1-2 Chap 3 Descri Data
11/44
Example
9.$.$
111)1.1($1)9.($1).($1).($
!!
!wX
During a one hour period on a
hot Saturday afternoon cabanaboy Chris served fifty drinks.
He sold five drinks for $0.50,
fifteen for $0.75, fifteen for
$0.90, and fifteen for $1.10.Compute the weighted mean of
the price of the drinks.
3- 11
-
8/8/2019 Week1-2 Chap 3 Descri Data
12/44
The Median
There are as manyvalues above themedian as below it inthe data array.
For an even set of values, the median will be the
arithmetic average of the two middle numbers and isfound at the (n+1)/2 ranked observation.
The MedianMedian is themidpoint of the values afterthey have been ordered from
the smallest to the largest.
3- 12
-
8/8/2019 Week1-2 Chap 3 Descri Data
13/44
The ages for a sample of five college students are:
1, 5, 19, 0, .
Arranging the data
in ascending ordergives:
19, 0, 1, , 5.
Thus the median is1.
The median (continued)
3- 13
-
8/8/2019 Week1-2 Chap 3 Descri Data
14/44
Example
Arranging the data in
ascending order gives:
7 , 75, 76, 80
Thus the median is 75.5.
The heights of four basketball players, in inches,
are: 76, 7 , 80, 75.
The median is found
at the (n+1)/ =( +1)/ = .5th data
point.
3- 14
-
8/8/2019 Week1-2 Chap 3 Descri Data
15/44
Properties of the Median
There is a unique median for each data set.
It is not affected by extremely large or smallvalues and is therefore a valuable measure of
location when such values occur.
It can be computed for ratio-level, interval-level, and ordinal-level data.
It can be computed for an open-endedfrequency distribution if the median does notlie in an open-ended class.
Properties of the Median
3- 15
-
8/8/2019 Week1-2 Chap 3 Descri Data
16/44
The Mode: Example
Example 6Example 6:: The exam scores for ten students are:1, 9 , , , , , 1, , 1, . Because the score
of 1 occurs the most often, it is the mode.
Data can have more than one mode. If it has twomodes, it is referred to as bimodal, three modes,
trimodal, and the like.
The ModeMode is another measure of location and
represents the value of the observation that appearsmost frequently.
3- 16
-
8/8/2019 Week1-2 Chap 3 Descri Data
17/44
Symmetric distributionSymmetric distribution: A distribution having thesame shape on either side of the center
Skewed distributionSkewed distribution: One whose shapes on eitherside of the center differ; a nonsymmetrical distribution.
Can be positively or negatively skewed, or bimodal
The Relative Positions of the Mean, Median, and Mode
3- 17
-
8/8/2019 Week1-2 Chap 3 Descri Data
18/44
The Relative Positions of the Mean, Median, and Mode:
Symmetric Distribution
Zero skewness Mean
=Median
=Mode
Mode
Median
Mean
3- 18
-
8/8/2019 Week1-2 Chap 3 Descri Data
19/44
The Relative Positions of the Mean, Median, and Mode:
Right Skewed Distribution
Positively skewed: Mean and median are to the right of the
mode.
Mean>Median>Mode
ode
edia
ea
3- 19
-
8/8/2019 Week1-2 Chap 3 Descri Data
20/44
Negatively Skewed: Mean and Median are to the left of the Mode.
Mean
-
8/8/2019 Week1-2 Chap 3 Descri Data
21/44
Geometric Mean
GM X X X X nn! ( )( )( )...( )1 2 3
The geometric mean is used to
average percents, indexes, and
relatives.
The Geometric MeanGeometric Mean
(GM) of a set of n numbersis defined as the nth root
of the product of the n
numbers. The formula is:
3- 21
-
8/8/2019 Week1-2 Chap 3 Descri Data
22/44
Example
The interest rate on three bonds were 5, 1, and
percent.
The arithmetic mean is (5+ 1+ )/ =10.0.
The geometric mean is
!!GM
The GMgives a more conservative
profit figure because it is notheavily weighted by the rate of
21percent.
3- 22
-
8/8/2019 Week1-2 Chap 3 Descri Data
23/44
Geometric Mean continued
eriodoe inninatVa ue
eriodoendatVa ue nGM
Another use of the
geometric mean is to
determine the percent
increase in sales,
production or other
business or economicseries from one time
period to another.
Grow th in Sales 1999-2004
0
10
20
30
40
50
1999 2000 2001 2002 2003 2004
Year
SalesinMillions($)
3- 23
-
8/8/2019 Week1-2 Chap 3 Descri Data
24/44
Example
!!GM
The total number of females enrolled in American
colleges increased from 755,000 in 199 to 8 5,000 in000. That is, the geometric mean rate of increase is
1. 7%.
3- 24
-
8/8/2019 Week1-2 Chap 3 Descri Data
25/44
DispersionDispersion
refers to thespread or
variability in
the data.
Measures of dispersion include the following: range,range,
mean deviation, variance, and standardmean deviation, variance, and standard
deviationdeviation.
RangeRange = Largest value Smallest value
Measures of Dispersion
0
5
10
15
20
25
30
0 2 4 6 8 10 12
3- 25
-
8/8/2019 Week1-2 Chap 3 Descri Data
26/44
The following represents the current years Return on
Equity of the 2 companies in an investors portfolio.
-8.1 3.2 5.9 8.1 12.3
-5.1 4.1 6.3 9.2 13.3
-3.1 4.6 7.9 9.5 14.0
-1.4 4.8 7.9 9.7 15.0
1.2 5.7 8.0 10.3 22.1
Example 9
Highest value: 22.1 Lowest value: - .1
Range = Highest value lowest value
= 22.1-(- .1)
= .2
3- 26
-
8/8/2019 Week1-2 Chap 3 Descri Data
27/44
MeanMean
DeviationDeviationThe arithmetic
mean of the
absolute values
of thedeviations from
the arithmetic
mean.
The main features of the
mean deviation are: All values are used in the
calculation.
It is not unduly influenced bylarge or small values.
The absolute values are
difficult to manipulate.
Mean Deviation
MD =7 X - X
n
3- 27
-
8/8/2019 Week1-2 Chap 3 Descri Data
28/44
The weights of a sample of crates containing booksfor the bookstore (in pounds ) are:
1 , 9 , 1 1, 1 , 1Find the mean deviation.
X = 10
The mean deviation is:
.11
11...11
!
!
!
7
! n
XX
MD
Example 1
3- 28
-
8/8/2019 Week1-2 Chap 3 Descri Data
29/44
VarianceVariance:: the
arithmetic meanof the squared
deviations from
the mean.
Standard deviationStandard deviation: The squareroot of the variance.
ariance and standard Deviation
3- 29
-
8/8/2019 Week1-2 Chap 3 Descri Data
30/44
Not influenced by extreme values.
The units are awkward, the square of theoriginal units.
All values are used in the calculation.
The major characteristics of the
Population VariancePopulation Variance are:
Population ariance
3- 30
-
8/8/2019 Week1-2 Chap 3 Descri Data
31/44
Population VariancePopulation Variance formula:
7 (X - Q)2
NW =
X isthevalueofanobservationinthepopulation
m isthearithmeticmeanofthepopulation
N isthenumberofobservationsinthepopulation
W!
Population Standard DeviationPopulation Standard Deviation formula:
WVariance and standard deviation
3- 31
-
8/8/2019 Week1-2 Chap 3 Descri Data
32/44
(-8. -6.62)2 (-5. -6.62)2 ... (22. -6.62)2
25W!
W
W!
2.227
=6. 8
In Example 9, the variance and standard deviation are:
7(X - Q)2
N
=
Example9continued
3- 32
-
8/8/2019 Week1-2 Chap 3 Descri Data
33/44
Sample variance (sSample variance (s ))
s2 =7( - )2
n-1
Sample standard deviation (s)Sample standard deviation (s)
2ss !
Sample variance and standard deviation
3- 33
-
8/8/2019 Week1-2 Chap 3 Descri Data
34/44
40.5
3!!
7!
n
XX
30.5
15
2.21
15
4.76...4.77
1
222
2
!
!
!
7!
n
XXs
Example 11
The hourly wages earned by a sample of five students are:
$7, $ , $11, $ , $ .
Find the sample variance and standard deviation.
.. !!! ss
3- 34
-
8/8/2019 Week1-2 Chap 3 Descri Data
35/44
Chebyshevs theoremChebyshevs theorem:: For any set of
observations, the minimum proportion of the valuesthat lie within kstandard deviations of the mean is at
least:
where kis any constant greater than 1.
211k
Chebyshevs theorem
3- 35
-
8/8/2019 Week1-2 Chap 3 Descri Data
36/44
Empirical RuleEmpirical Rule: For any symmetrical, bell-shaped
distribution:
About % of the observations will lie within 1s the
mean
About 9 % of the observations will lie within 2sof
the mean
Virtually all the observations will be within s of the
mean
Interpretation and Uses of the
Standard Deviation
3- 36
-
8/8/2019 Week1-2 Chap 3 Descri Data
37/44
Bell -Shaped Curve showing the relationship between and .W Q
QW QW QW Q QW QW QW
68%
9 %
99.7%
Interpretation and Uses ofthe Standard Deviation
3- 37
-
8/8/2019 Week1-2 Chap 3 Descri Data
38/44
The Mean of Grouped Data
n
XfX
7!
The MeanMean of a sample of data
organized in a frequencydistribution is computed by the
following formula:
3- 38
-
8/8/2019 Week1-2 Chap 3 Descri Data
39/44
Example 12
A sample of ten
movie theaters
in a large
metropolitan
area tallied the
total number ofmovies showing
last week.
Compute the
mean number ofmovies
showing.
Movies
showing
frequency
f
class
midpoint
X
(f)(X)
1 up to 3 1 2 2
3 up to 5 2 4 8
5 up to 7 3 6 18
7 up to 9 1 8 8
9 up to
11
3 10 30
Total 10 66
6.61
66!!
7!
n
XX
3- 39
-
8/8/2019 Week1-2 Chap 3 Descri Data
40/44
The Median of Grouped Data
)(2 if
CF
LMedia
!
where L is the lower limit of the median class, CF is the
cumulative frequency preceding the median class, fis
the frequency of the median class, and i is the median
class interval.
TheMedianMedian of a sa ple ofdata organized in a
fre encydistri tion is computed y:
3- 40
-
8/8/2019 Week1-2 Chap 3 Descri Data
41/44
Finding t e edianClass
To determine t e median class forgrouped
data
Construct a cumulative frequency distribution.
Divide t e total number ofdata values by 2.
Determine w ic class will contain t is value. Forexample, ifn= , /2 = 2 , t endetermine w ic
class will contain t e 2 t value.
3- 41
-
8/8/2019 Week1-2 Chap 3 Descri Data
42/44
Example 12 continued
o ie
o in
Fre uenc umulati e
Fre uenc1 up to 3 1 1
3 up to 5 2 3
5 up to 3 6
up to 1
up to 11 3 10
3- 42
-
8/8/2019 Week1-2 Chap 3 Descri Data
43/44
Example 12 continued
.6)2(21
)(2 !
!
! i
f
CFn
LMedian
From the table, L= , n=1 , f= , i=2, CF=
3- 43
-
8/8/2019 Week1-2 Chap 3 Descri Data
44/44
The Mode of Grouped Data
The modes in example 12 are 6 and 1
and so is bimodal.
The ModeMode for grouped data isapproximated by the midpoint of the
class with the largest class frequency.
3- 44