statistic copy 1

Upload: -

Post on 06-Jul-2018

214 views

Category:

Documents


0 download

TRANSCRIPT

  • 8/18/2019 Statistic Copy 1

    1/69

    Note: Most of the Slides were taken from

    Elementary Statistics: A Handbook of Slide

    Presentation prepared by Z.V.. Albacea! ".E.

    #eano! #.V. "ollado! $.N. "omia and N.A.

    %andan& in '(() for the *nstit+te of Statistics!

    "AS! ,P $os -anos

    Training on Teaching

    Basic Statistics for

    Tertiary Level Teachers

    Summer 2008

    INTRODUCTION TO

    STATISTICS AND

    STATISTICALINFERENCE

  • 8/18/2019 Statistic Copy 1

    2/69

    Session 1.2

    TEACHING BASIC STATISTICS ….

  • 8/18/2019 Statistic Copy 1

    3/69

    Session 1.3

    TEACHING BASIC STATISTICS ….

  • 8/18/2019 Statistic Copy 1

    4/69

    Session 1.4

    TEACHING BASIC STATISTICS ….

  • 8/18/2019 Statistic Copy 1

    5/69

    Session 1.5

    TEACHING BASIC STATISTICS ….

  • 8/18/2019 Statistic Copy 1

    6/69

    Session 1.6

    TEACHING BASIC STATISTICS ….

  • 8/18/2019 Statistic Copy 1

    7/69

    Session 1.7

    TEACHING BASIC STATISTICS ….

  • 8/18/2019 Statistic Copy 1

    8/69

    Session 1.8

    TEACHING BASIC STATISTICS ….

  • 8/18/2019 Statistic Copy 1

    9/69

    Session 1.9

    TEACHING BASIC STATISTICS ….

     Areas of Statistics

    Descriptive statisticsmethods concerned w/

    collecting descri!ing andanal"#ing a set of data

    witho$t drawing

    concl$sions %or inferences&

    a!o$t a large gro$'

    Inferential statisticsmethods concerned

    with the anal"sis of as$!set of data leading

    to 'redictions or

    inferences a!o$t the

    entire set of data

  • 8/18/2019 Statistic Copy 1

    10/69

    Session 1.10

    TEACHING BASIC STATISTICS ….

    Example of Descriptive Statistics

    (resent the (hili''ine 'o'$lation !" constr$cting a

    gra'h indicating the total n$m!er of Fili'inos co$nted

    d$ring the last cens$s !" age gro$' and se)

  • 8/18/2019 Statistic Copy 1

    11/69

    Session 1.11

    TEACHING BASIC STATISTICS ….

    Example of Inferential Statistics

     A new milk form+lation desi&ned to improe the psychomotordeelopment of infants was tested on randomly selected infants.

    -ased on the res+lts! it was concl+ded that the new milk form+lation is

    effectie in improin& the psychomotor deelopment of infants*

  • 8/18/2019 Statistic Copy 1

    12/69

    Session 1.12

    TEACHING BASIC STATISTICS ….

    Inferential Statistics

    Larger Set(N units/observations) Smaller Set

    (n

    units/observations)

    Inferences and

    Generalizations

  • 8/18/2019 Statistic Copy 1

    13/69

    Session 1.13

    TEACHING BASIC STATISTICS ….

    +e" Definitions

     A +nierse is the collection of things or

    o!ser,ational $nits $nder consideration*

     A ,aria!le is a characteristic o!ser,edor meas$red on e,er" $nit of the

    $ni,erse*

     A 'o'$lation is the set of all 'ossi!le,al$es of the ,aria!le*

  • 8/18/2019 Statistic Copy 1

    14/69

    Session 1.14

    TEACHING BASIC STATISTICS ….

    +e" Definitions

    Parameters are n$merical meas$res

    that descri!e the 'o'$lation or $ni,erse

    of interest* Us$all" donated !" -ree.letters µ %m$& σ %sigma& ρ %rho& λ 

    %lam!da& τ %ta$& θ %theta& α %al'ha& and

    β %!eta&*

    Statistics are n$merical meas$res of a

    sam'le

  • 8/18/2019 Statistic Copy 1

    15/69

    Session 1.15

    TEACHING BASIC STATISTICS ….

    T"'es of 0aria!les

    1$alitati,e ,aria!le non2n$merical ,al$es

    1$antitati,e ,aria!le n$merical ,al$es

    a* Discrete co$nta!le

    !* Contin$o$s meas$ra!le

    c* Constant

  • 8/18/2019 Statistic Copy 1

    16/69

    Session 1.16

    TEACHING BASIC STATISTICS ….

    Le,els of 3eas$rement4* Nominal

    N$m!ers or s"m!ols $sed to classif"

    5* Ordinal scale  Acco$nts for order no indication of distance

    !etween 'ositions Used in ran.ing no meaningf$l n$merical

    statements can !e made a!o$t difference !etweencategories*

    6* Inter,al scale E7$al inter,als no a!sol$te #ero

    8* Ratio scale 9as a!sol$te #ero

  • 8/18/2019 Statistic Copy 1

    17/69

    N!IN"L #en$er% &olitical &arty%'eligion% "utomo(ile)nership%

    ORDINAL Teachers (erformance 3o,ieClassification Fac$lt" Ran.9otel Ratings St$dent Class

    Designation

    INTER0AL Tem'erat$re

    RATIO :eight Age Salar"

    Session 1.17

    TEACHING BASIC STATISTICS ….

  • 8/18/2019 Statistic Copy 1

    18/69

    Session 1.18

    TEACHING BASIC STATISTICS ….

    !etho$s of *ollecting Data

     (+ective !etho$

    Su(+ective !etho$ 

    ,se of Existing 'ecor$s

  • 8/18/2019 Statistic Copy 1

    19/69

    Session 1.19

    TEACHING BASIC STATISTICS ….

    !etho$s of &resenting Data

    Textual

    Ta(ular 

    #raphical

  • 8/18/2019 Statistic Copy 1

    20/69

    Session 1.20

    TEACHING BASIC STATISTICS ….

    Mean

    Median

    Mode

    Summary Measures

    Variation

    Variance

    Standard Deviation

    Coefficient of

    Variation

    Range

    Location

    Maximum

    Minimum

    Central

    Tendency

    PercentileQuartile

    Decile

    Interquartile

    Range

    Se!ness

    "urtosis

  • 8/18/2019 Statistic Copy 1

    21/69

    Session 1.21

    TEACHING BASIC STATISTICS ….

    !easures of Location

      A Measure of Location s$mmari#es adata set !" gi,ing a ;t"'ical ,al$e< within

    the range of the data ,al$es that descri!esits location relati,e to entire data set*

    Some Common 3eas$res=

      3inim$m 3a)im$m

        Central Tendenc"

        (ercentiles Deciles 1$artiles

  • 8/18/2019 Statistic Copy 1

    22/69

    Session 1.22

    TEACHING BASIC STATISTICS ….

    !aximum an$ !inimum

    Minimum  is the smallest ,al$ein the data set denoted as M*N *

    Maximum is the largest ,al$e inthe data set denoted as MA/ *

  • 8/18/2019 Statistic Copy 1

    23/69

    Session 1.23

    TEACHING BASIC STATISTICS ….

    3eas$re of Central Tendenc"

     A single ,al$e that is $sed to identif"

    the ;center< of the data

    it is tho$ght of as a t"'ical ,al$e ofthe distri!$tion

    'recise "et sim'le

    most re'resentati,e ,al$e of thedata

  • 8/18/2019 Statistic Copy 1

    24/69

    Session 1.24

    TEACHING BASIC STATISTICS ….

    3ean

    3ost common meas$re of the center

     Also .nown as arithmetic a,erage

    Sam#le Mean

    Po#ulation Mean

  • 8/18/2019 Statistic Copy 1

    25/69

    Session 1.25

    TEACHING BASIC STATISTICS ….

    (ro'erties of the 3ean

    ma" not !e an act$alo!ser,ation in the data set

    can !e a''lied in at leastinter,al le,el

    eas" to com'$te

    e,er" o!ser,ation contri!$tes tothe ,al$e of the mean

  • 8/18/2019 Statistic Copy 1

    26/69

    Session 1.26

    TEACHING BASIC STATISTICS ….

    (ro'erties of the 3ean

    s$!gro$' means can !e com!ined

    to come $' with a gro$' mean

    easil" affected !" e)treme ,al$es

    0 1 2 3 4 5 6 7 8 9 10 0 1 2 3 4 5 6 7 8 9 10 12 14

    Mean $ %Mean $ &

  • 8/18/2019 Statistic Copy 1

    27/69

    Session 1.27

    TEACHING BASIC STATISTICS ….

    3edian

    Di,ides the o!ser,ations into two e7$al

    'arts

    If the n$m!er of o!ser,ations is odd themedian is the middle n$m!er*

    If the n$m!er of o!ser,ations is e,en the

    median is the a,erage of the 5 middle

    n$m!ers*

    Sam'le median denoted as

    while 'o'$lation median is denoted as

     x~

     µ ~

  • 8/18/2019 Statistic Copy 1

    28/69

    Session 1.28

    TEACHING BASIC STATISTICS ….

    (ro'erties of a 3edian

    ma" not !e an act$al o!ser,ation inthe data set

    can !e a''lied in at least ordinal le,el a 'ositional meas$re not affected !"e)treme ,al$es

    0 1 2 3 4 5 6 7 8 9 10 0 1 2 3 4 5 6 7 8 9 10 12 14

    Median $ %

  • 8/18/2019 Statistic Copy 1

    29/69

    Session 1.29

    TEACHING BASIC STATISTICS ….

    3ode

    occ$rs most fre7$entl"

    nominal a,erage

    ma" or ma" not e)ist

    0 1 2 3 4 5 6 7 8 9 10 11 12 13 14

    Mode $ '

    0 1 2 3 4 5 6

    (o Mode

  • 8/18/2019 Statistic Copy 1

    30/69

    Session 1.30

    TEACHING BASIC STATISTICS ….

    (ro'erties of a 3ode

    can !e $sed for 7$alitati,e as

    well as 7$antitati,e data ma" not !e $ni7$e not affected !" e)treme ,al$es can !e com'$ted for $ngro$'ed

    and gro$'ed data

  • 8/18/2019 Statistic Copy 1

    31/69

    Session 1.31

    TEACHING BASIC STATISTICS ….

    3ean 3edian > 3ode

    Use the mean when=

    sam'ling sta!ilit" is desired other meas$res are to !e

    com'$ted

  • 8/18/2019 Statistic Copy 1

    32/69

    Session 1.32

    TEACHING BASIC STATISTICS ….

    3ean 3edian > 3ode

    Use the median when=

    the e)act mid'oint of thedistri!$tion is desired

    there are e)treme

    o!ser,ations

  • 8/18/2019 Statistic Copy 1

    33/69

    Session 1.33

    TEACHING BASIC STATISTICS ….

    3ean 3edian > 3ode

    Use the mode when=

    when the ?t"'ical? ,al$e isdesired

    when the dataset is meas$red

    on a nominal scale

  • 8/18/2019 Statistic Copy 1

    34/69

    Session 1.34

    TEACHING BASIC STATISTICS ….

    &ercentiles

    N$merical meas$res that gi,e therelati,e 'osition of a data ,al$erelati,e to the entire data set*

    Di,ide an arra" %raw data arran&edin increasin& or decreasin& order ofma&nit+de& into 4@@ e7$al 'arts*

    The 0 th 'ercentile denoted as P  j , isthe data ,al$e in the the data setthat se'arates the !ottom 0  of thedata from the to' %4@@2 0 &*

  • 8/18/2019 Statistic Copy 1

    35/69

    Session 1.35

    TEACHING BASIC STATISTICS ….

    E-"!&LE

    S$''ose LB was told that relati,eto the other scores on a certain

    test his score was the th

     'ercentile*

     %his means that 1)2 of those

    who took the test had scores lessthan or e3+al to $4s score! while)2 had scores hi&her than $4s.

  • 8/18/2019 Statistic Copy 1

    36/69

    Session 1.36

    TEACHING BASIC STATISTICS ….

    Deciles

    Di,ide an arra" into ten e7$al'arts each 'art ha,ing ten

    'ercent of the distri!$tion of thedata ,al$es denoted !" 5 0 *

    The 4st decile is the 4@th 'ercentile the 5nd decile is the5@th 'ercentile**

  • 8/18/2019 Statistic Copy 1

    37/69

    Session 1.37

    TEACHING BASIC STATISTICS ….

    .uartiles

    Di,ide an arra" into fo$r e7$al'arts each 'art ha,ing 5 of

    the distri!$tion of the data,al$es denoted !" 6 0 *

    The 4st 7$artile is the 5th 'ercentile the 5nd 7$artile is

    the @th

     'ercentile also themedian and the 6rd 7$artile isthe th 'ercentile*

    TEACHING BASIC STATISTICS

  • 8/18/2019 Statistic Copy 1

    38/69

    Session 1.38

    TEACHING BASIC STATISTICS ….

    3eas$res of 0ariation

     A meas$re of ,ariation is a

    single ,al$e that is $sed to

    descri!e the s'read of the

    distri!$tion A meas$re of central tendenc"

    alone does not $ni7$el"

    descri!e a distri!$tion

    TEACHING BASIC STATISTICS

  • 8/18/2019 Statistic Copy 1

    39/69

    Session 1.39

    TEACHING BASIC STATISTICS ….

    !ean / 11  s / 3338  2 3 4 1 5 6 8 7 20 2

    2 3 4 1 5 6 8 7 20 2

    Data B

    Data "

    !ean / 11

      s / 7218

    2 3 4 1 5 6 8 7 20 2

    !ean / 11

      s / 416

    Data *

    " loo at $ispersion9

    TEACHING BASIC STATISTICS

  • 8/18/2019 Statistic Copy 1

    40/69

    Session 1.40

    TEACHING BASIC STATISTICS ….

    T)o Types of !easures of

    Dispersion Absol+te Meas+res of 5ispersion:

       Range

       Inter27$artile Range 0ariance

        Standard De,iation

    #elatie Meas+re of 5ispersion:  Coefficient of 0ariation

    TEACHING BASIC STATISTICS

  • 8/18/2019 Statistic Copy 1

    41/69

    Session 1.41

    TEACHING BASIC STATISTICS ….

    'ange :';

    The difference !etween the ma)im$m andminim$m ,al$e in a data set i*e* 

    R G 3AH 3INE7ample: ($lse rates of 4 male residents of a

    certain ,illage

    8  J J K@ K5 K KK 4

    8 J J@ J5 J 

    #  G J 2 8 G 64

    TEACHING BASIC STATISTICS

  • 8/18/2019 Statistic Copy 1

    42/69

    Session 1.42

    TEACHING BASIC STATISTICS ….

    Some &roperties of the 'ange 

    The larger the ,al$e of therange the more dis'ersed

    the o!ser,ations are* It is 7$ic. and eas" to

    $nderstand*

     A ro$gh meas$re ofdis'ersion*

    TEACHING BASIC STATISTICS

  • 8/18/2019 Statistic Copy 1

    43/69

    Session 1.43

    TEACHING BASIC STATISTICS ….

    Inter

  • 8/18/2019 Statistic Copy 1

    44/69

    Session 1.44

    TEACHING BASIC STATISTICS ….

    Some &roperties of I.'

    Red$ces the infl$ence of

    e)treme ,al$es*

    Not as eas" to calc$late

    as the Range*

    TEACHING BASIC STATISTICS

  • 8/18/2019 Statistic Copy 1

    45/69

    Session 1.45

    TEACHING BASIC STATISTICS ….

    0ariance

    im'ortant meas$re of ,ariation

    shows ,ariation a!o$t the mean

    (o'$lation ,ariance

    Sam'le ,ariance

     N 

     X  N 

    i

    i∑=

    = 1

    2

    2

    )(   µ 

    σ 

    1

    )(1

    2

    2

    =∑=

    n

     x x

     s

    n

    i

    i

    TEACHING BASIC STATISTICS ….

  • 8/18/2019 Statistic Copy 1

    46/69

    Session 1.46

    TEACHING BASIC STATISTICS ….

    Standard De,iation %SD&

    most im'ortant meas$re of ,ariation

    s7$are root of 0ariance

    has the same $nits as the original data

    (o'$lation SD

    Sam'le SD

     N 

     X  N 

    i

    i∑=

    =   1

    2)(   µ 

    σ 

    1

    )(1

    2

    =

    ∑=

    n

     x x

     s

    n

    i

    i

    TEACHING BASIC STATISTICS ….

  • 8/18/2019 Statistic Copy 1

    47/69

    Session 1.47

    TEACHING BASIC STATISTICS ….

    Data:  10 12 14 15 17 18 18 24

     n = 8 Mean=16 Com'$tation of Standard De,iation

    TEACHING BASIC STATISTICS ….

  • 8/18/2019 Statistic Copy 1

    48/69

    Session 1.48

    Remar.s on Standard De,iation

    If there is a large amo$nt of ,ariation

    then on a,erage the data ,al$es will !e

    far from the mean* 9ence the SD will !elarge*

    If there is onl" a small amo$nt of

    ,ariation then on a,erage the data

    ,al$es will !e close to the mean* 9encethe SD will !e small*

    TEACHING BASIC STATISTICS ….

  • 8/18/2019 Statistic Copy 1

    49/69

    Session 1.49

    Mean $ )%*%  s $ +*++,)) )- )+ ). )% )& )/ ), )' -0 -)

    )) )- )+ ). )% )& )/ ), )' -0 -)

    Data 1

    Data 2

    Mean $ )%*%

      s $ *'-%,

    )) )- )+ ). )% )& )/ ), )' -0 -)

    Mean $ )%*%

      s $ .*%/

    Data C

    Com'aring Standard De,iation

    TEACHING BASIC STATISTICS ….

  • 8/18/2019 Statistic Copy 1

    50/69

    Session 1.50

     Example: Team A - Heights of five maratho !"a#ers i i$hes

    65 % 65 % 65 % 65 % 65 %

    Mean $ &%  S  & 0

    Com'aring Standard De,iation

    TEACHING BASIC STATISTICS ….

  • 8/18/2019 Statistic Copy 1

    51/69

    Session 1.51

     Example: Team ' - Heights of five maratho !"a#ers i i$hes

    62 % 67 % 66 % 70 % 60 %

    Mean $ &%3  s $ .*03

    Com'aring Standard De,iation

    TEACHING BASIC STATISTICS ….

  • 8/18/2019 Statistic Copy 1

    52/69

    Session 1.52

    (ro'erties of Standard De,iation

    It is the most widel" $sed meas$re ofdis'ersion* %Che!"che,s Ine7$alit"&

    It is !ased on all the items and is rigidl"

    defined* It is $sed to test the relia!ilit" of meas$res

    calc$lated from sam'les* The standard de,iation is sensiti,e to the

    'resence of e)treme ,al$es* It is not eas" to calc$late !" hand %$nli.e the

    range&*

    TEACHING BASIC STATISTICS ….

  • 8/18/2019 Statistic Copy 1

    53/69

    Session 1.53

    *he(yshev=s 'ule

    It 'ermits $s to ma.e statements a!o$tthe 'ercentage of o!ser,ations that

    m$st !e within a s'ecified n$m!er of

    standard de,iation from the mean

    The 'ro'ortion of an" distri!$tion thatlies within k  standard de,iations of the

    mean is at least 898;k ' 

     

  • 8/18/2019 Statistic Copy 1

    54/69

    Session 1.54

    For an" data set with mean %µ& andstandard de,iation %SD& the following

    statements a''l"= At least of the o!ser,ations arewithin 5SD of its mean*

     At least JJ* of the o!ser,ations arewithin 6SD of its mean*

    *he(yshev=s 'ule

    TEACHING BASIC STATISTICS ….

  • 8/18/2019 Statistic Copy 1

    55/69

    Session 1.55

    Illustration

    At "east 75

     At least of the o!ser,ationsare within 5SD of its mean*

    TEACHING BASIC STATISTICS ….

  • 8/18/2019 Statistic Copy 1

    56/69

    Session 1.56

    Example

    The midterm e)am scores of 4@@ STAT 4 st$dents

    last semester had a mean of K and a standard

    de,iation of J 'oints*

     Applyin& the "hebyshe4s #+le! we can say that:

    8. At least =)2 of the st+dents had scores

    between >1 and ?8.

    '. At least ??.12 of the st+dents had scoresbetween >8 and ?1.

    TEACHING BASIC STATISTICS ….

  • 8/18/2019 Statistic Copy 1

    57/69

    Session 1.57

    Coefficient of 0ariation %C0&

    meas$re of relati,e ,ariation

    $s$all" e)'ressed in 'ercent

    shows ,ariation relati,e to mean $sed to com'are 5 or more gro$'s

    Form$la =

    00>×   

      =Mean

    SC! 

    TEACHING BASIC STATISTICS ….

  • 8/18/2019 Statistic Copy 1

    58/69

    Session 1.58

    Com'aring C0s

    Stoc. A= A,erage (rice G (@

      SD G (

      C0 G 4@

    Stoc. M= A,erage (rice G (4@@

      SD G (  C0 G

    TEACHING BASIC STATISTICS ….

  • 8/18/2019 Statistic Copy 1

    59/69

    Session 1.59

    !easure of Se)ness

    Descri!es the degree of de'art$res of thedistri!$tion of the data from s"mmetr"*

    The degree of s.ewness is meas$red !"the coefficient of s.ewness denoted as S@  and com'$ted as

    ( )SD Median Mean K    −=3S 

    TEACHING BASIC STATISTICS ….

  • 8/18/2019 Statistic Copy 1

    60/69

    Session 1.60

    ?hat is Symmetry@

     A distrib+tion is said to besymmetric abo+t the mean!

    if the distrib+tion to the left ofmean is the mirror ima&eBof the distrib+tion to the ri&htof the mean. $ikewise! asymmetric distrib+tion has

    S@C( since its mean ise3+al to its median and itsmode.

    TEACHING BASIC STATISTICS ….

  • 8/18/2019 Statistic Copy 1

    61/69

    Session 1.61

    S@   @

     positiely skewed 

    !easure of Se)ness

    S@   @

    ne&atiely skewed 

    TEACHING BASIC STATISTICS ….

  • 8/18/2019 Statistic Copy 1

    62/69

    Session 1.62

    !easure of Aurtosis

    Descri!es the e)tent of 'ea.edness orflatness of the distri!$tion of the data*

    3eas$red !" coefficient of .$rtosis %@ &com'$ted as

    ( )  4

    1

    4   3

     N 

    i

    i

     X  

     K  N 

     µ 

    σ 

    =

    =   −

    TEACHING BASIC STATISTICS ….

  • 8/18/2019 Statistic Copy 1

    63/69

    Session 1.63

       K  & 0

      mesokurtic

       K 0

     leptokurtic

       K * 0

     platykurtic

    !easure of Aurtosis

    TEACHING BASIC STATISTICS ….

  • 8/18/2019 Statistic Copy 1

    64/69

    Session 1.64

    Box

  • 8/18/2019 Statistic Copy 1

    65/69

    Session 1.65

    The diagram is made $' of a bo7  which

    lies !etween the first and third

    7$artiles*The whiskers are the straight line

    e)tending from the ends of the !o) to

    the smallest and largest ,al$es that

    are not o$tliers*

    Box

  • 8/18/2019 Statistic Copy 1

    66/69

    Session 1.66

    Steps to *onstruct a Box

  • 8/18/2019 Statistic Copy 1

    67/69

    Session 1.67

    Step ': (lace mar.s at distances 4* *6# from

    either end of the !o)*  4* *6# C8)<

    100

    +1 +3,

    75 78 8560

    1.5 IQR 1.5 IQR

    Steps to *onstruct a Box

  • 8/18/2019 Statistic Copy 1

    68/69

    Session 1.68

    Ste' 6=Draw the hori#ontal line segments

    .nown as the ;whis.ers< from each of the

    end !o) to the largest and smallest ,al$es

    in the data set that are not o$tliers*

    An obseration beyond ±8.) *6# is ano+tlier.<

    If the largest and smallest ,al$es in thedata set are o$tliers e)tend whis.ers $ntil4* *6#  from either ends of the !o)*

    Steps to *onstruct a Box

  • 8/18/2019 Statistic Copy 1

    69/69

    Step >: For e,er" o$tlier draw a dot* If two or more dots

    ha,e the same ,al$es draw the dots side !" side* 

    +1

    +3

    ,

    75 78 8560 100

    1.5 IQR 1.5 IQR

    9855

    **

    Steps to *onstruct a Box