descriptive statistics - ftmsΒ Β· 2019-02-20Β Β· interquartile range ... semi-interquartile range...

25
DESCRIPTIVE STATISTICS Ms Nurazrin Jupri

Upload: others

Post on 30-Jun-2020

5 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Descriptive Statistics - FTMSΒ Β· 2019-02-20Β Β· INTERQUARTILE RANGE ... SEMI-INTERQUARTILE RANGE Known as Quartile Deviation One of dispersion measurement 𝒂 π’Š π’Šπ’‚ π’Š

DESCRIPTIVE

STATISTICS

Ms Nurazrin Jupri

Page 2: Descriptive Statistics - FTMSΒ Β· 2019-02-20Β Β· INTERQUARTILE RANGE ... SEMI-INTERQUARTILE RANGE Known as Quartile Deviation One of dispersion measurement 𝒂 π’Š π’Šπ’‚ π’Š

SKEWNESS

Skewness measures the lack of symmetry in a data

distribution.

The skewed portion is the long and thin part of the curve.

A skewed distribution: the data are sparse at one end of

distribution but piled up at the other end.

Ms Nurazrin Jupri

Page 3: Descriptive Statistics - FTMSΒ Β· 2019-02-20Β Β· INTERQUARTILE RANGE ... SEMI-INTERQUARTILE RANGE Known as Quartile Deviation One of dispersion measurement 𝒂 π’Š π’Šπ’‚ π’Š

SKEWNESS IN RELATION TO

BETWEEN MEAN, MEDIAN & MODE

Mode : the highest point of the curve

Median : the middle value

Mean : located somewhere towards the tail of the

distribution

Affected by all values, including extreme values

Bell-shaped / normal distribution has NO SKEWNESS

Mean = Median = Mode

Ms Nurazrin Jupri

Page 4: Descriptive Statistics - FTMSΒ Β· 2019-02-20Β Β· INTERQUARTILE RANGE ... SEMI-INTERQUARTILE RANGE Known as Quartile Deviation One of dispersion measurement 𝒂 π’Š π’Šπ’‚ π’Š

MODE < MEDIAN < MEAN

Positively skewed

Skewed to the right

Ms Nurazrin Jupri

Page 5: Descriptive Statistics - FTMSΒ Β· 2019-02-20Β Β· INTERQUARTILE RANGE ... SEMI-INTERQUARTILE RANGE Known as Quartile Deviation One of dispersion measurement 𝒂 π’Š π’Šπ’‚ π’Š

MODE = MEDIAN = MEAN

Symmetrical

Zero-Skewness

Evenly or normally distributed

Ms Nurazrin Jupri

Page 6: Descriptive Statistics - FTMSΒ Β· 2019-02-20Β Β· INTERQUARTILE RANGE ... SEMI-INTERQUARTILE RANGE Known as Quartile Deviation One of dispersion measurement 𝒂 π’Š π’Šπ’‚ π’Š

MEAN < MEDIAN < MODE

Negatively skewed

Skewed to the left

Ms Nurazrin Jupri

Page 7: Descriptive Statistics - FTMSΒ Β· 2019-02-20Β Β· INTERQUARTILE RANGE ... SEMI-INTERQUARTILE RANGE Known as Quartile Deviation One of dispersion measurement 𝒂 π’Š π’Šπ’‚ π’Š

MEASURE OF SKEWNESS

To determine the difference between the mean and the mode of

the distribution

Mean – Mode = +ve distribution is right or positively

skewed

Mean – Mode = -ve distribution is left or negatively

skewed

Mean – Mode = 0 distribution is symmetrical

𝑷𝒆𝒂𝒓𝒔𝒐𝒏 π’„π’π’†π’‡π’‡π’Šπ’„π’Šπ’†π’π’• 𝒐𝒇 π’”π’Œπ’†π’˜π’π’†π’”π’” =πŸ‘(π’Žπ’†π’‚π’ βˆ’ π’Žπ’†π’…π’Šπ’‚π’)

𝒔𝒕𝒂𝒏𝒅𝒂𝒓𝒅 π’…π’†π’—π’Šπ’‚π’•π’Šπ’π’

Ms Nurazrin Jupri

Page 8: Descriptive Statistics - FTMSΒ Β· 2019-02-20Β Β· INTERQUARTILE RANGE ... SEMI-INTERQUARTILE RANGE Known as Quartile Deviation One of dispersion measurement 𝒂 π’Š π’Šπ’‚ π’Š

EXERCISE 1

1. What is the relationship between mean, median and

mode?

Find the mean median and mode of:

1, 2, 2, 3, 3, 3, 4, 4, 4, 4, 4, 4, 5, 5, 5, 6, 6, 7

β€’ Mean is 4.

β€’ Median is 4.

β€’ Mode is 4.

Ms Nurazrin Jupri

Page 9: Descriptive Statistics - FTMSΒ Β· 2019-02-20Β Β· INTERQUARTILE RANGE ... SEMI-INTERQUARTILE RANGE Known as Quartile Deviation One of dispersion measurement 𝒂 π’Š π’Šπ’‚ π’Š

EXERCISE 1 (CONT.)

What is the relationship between mean, median

and mode?

Find the mean, median and mode of:

0, 5, 10, 20, 40, 45, 45, 50, 50, 50, 60, 60, 60, 60, 60, 60, 70,

70, 70, 70, 70, 70, 70, 70

β€’ The mean is 51.5.

β€’ The median is 60.

β€’ The mode is 70.

Ms Nurazrin Jupri

Page 10: Descriptive Statistics - FTMSΒ Β· 2019-02-20Β Β· INTERQUARTILE RANGE ... SEMI-INTERQUARTILE RANGE Known as Quartile Deviation One of dispersion measurement 𝒂 π’Š π’Šπ’‚ π’Š

EXERCISE 1 (CONT.)

What is the relationship between mean, median

and mode?

β€’ Find the mean, median, and mode of:

20, 20, 20, 20, 20, 20, 20, 20, 30, 30, 30, 30, 30, 30, 45, 45, 45, 50, 50, 60,

70, 90

β€’ The mean is 36.1.

β€’ The median is 30.

β€’ The mode is 20.

Ms Nurazrin Jupri

Page 11: Descriptive Statistics - FTMSΒ Β· 2019-02-20Β Β· INTERQUARTILE RANGE ... SEMI-INTERQUARTILE RANGE Known as Quartile Deviation One of dispersion measurement 𝒂 π’Š π’Šπ’‚ π’Š

QUARTILE

Normally used to describe positional values of large sets

of numerical data.

First quartile (Q1)

Second quartile (Q2)

Third quartile (Q3)

Ms Nurazrin Jupri

Page 12: Descriptive Statistics - FTMSΒ Β· 2019-02-20Β Β· INTERQUARTILE RANGE ... SEMI-INTERQUARTILE RANGE Known as Quartile Deviation One of dispersion measurement 𝒂 π’Š π’Šπ’‚ π’Š

FIRST QUARTILE (Q1)

Is a positional value where :

25% of the observations are smaller

75% of the observation are larger

Step 1: Find first quartile position

π‘ΈπŸ =𝒏 + 𝟏

πŸ’

Step 2: Arrange data

Step 3: Find first quartile value which correspond with first

quartile position.

Ms Nurazrin Jupri

Page 13: Descriptive Statistics - FTMSΒ Β· 2019-02-20Β Β· INTERQUARTILE RANGE ... SEMI-INTERQUARTILE RANGE Known as Quartile Deviation One of dispersion measurement 𝒂 π’Š π’Šπ’‚ π’Š

THIRD QUARTILE (Q3)

Is a positional value where :

75% of the observations are smaller

25% of the observation are larger

Step 1: Find third quartile position

π‘ΈπŸ‘ =πŸ‘(𝒏 + 𝟏)

πŸ’

Step 2: Arrange data

Step 3: Find third quartile value which correspond with

third quartile position.

Ms Nurazrin Jupri

Page 14: Descriptive Statistics - FTMSΒ Β· 2019-02-20Β Β· INTERQUARTILE RANGE ... SEMI-INTERQUARTILE RANGE Known as Quartile Deviation One of dispersion measurement 𝒂 π’Š π’Šπ’‚ π’Š

EXAMPLE 1

The 3 year annual returns of 14 low-risk funds are given as

follows.

9.77 11.35 12.46 13.80 15.47 17.48 18.37

18.47 18.61 20.72 21.49 22.47 31.50 38.16

Find the first and third quartile.

𝑄1(π‘π‘œπ‘ π‘–π‘‘π‘–π‘œπ‘›) =14 + 1

4= 3.75

Approximately, the forth position of data : 13.80

𝑄3(π‘π‘œπ‘ π‘–π‘‘π‘–π‘œπ‘›) =3(14 + 1)

4= 11.25

Approximately, the eleventh position of data : 21.49

Ms Nurazrin Jupri

Page 15: Descriptive Statistics - FTMSΒ Β· 2019-02-20Β Β· INTERQUARTILE RANGE ... SEMI-INTERQUARTILE RANGE Known as Quartile Deviation One of dispersion measurement 𝒂 π’Š π’Šπ’‚ π’Š

QUARTILES FOR GROUPED DATA

Step 1: Obtain the cumulative frequencies

Step 2: Identify the first and third quartile position by using

formula quartile position.

Step 3: Identify the first and third quartile classes.

β€’ Quartile position

β€’ Cumulative frequencies

Step 4: Find the first and third quartile by using formula.

π‘ΈπŸ =𝒏

πŸ’ π‘ΈπŸ‘ =

πŸ‘π’

πŸ’

Ms Nurazrin Jupri

Page 16: Descriptive Statistics - FTMSΒ Β· 2019-02-20Β Β· INTERQUARTILE RANGE ... SEMI-INTERQUARTILE RANGE Known as Quartile Deviation One of dispersion measurement 𝒂 π’Š π’Šπ’‚ π’Š

QUARTILES FORMULA

First Quartile

Third Quartile

LCB = Lower class boundary

n = Number of observations

CF = Cumulative frequency before the quartile class

f = Frequency for quartile class

C = Class size

π‘ΈπŸ = 𝑳π‘ͺπ‘©πŸ + (π‘ͺ𝟏)

π’πŸ’ βˆ’ π‘ͺπ‘­πŸ

π’‡πŸ

π‘ΈπŸ‘ = 𝑳π‘ͺπ‘©πŸ‘ + (π‘ͺπŸ‘)

πŸ‘π’πŸ’ βˆ’ π‘ͺπ‘­πŸ‘

π’‡πŸ‘

Ms Nurazrin Jupri

Page 17: Descriptive Statistics - FTMSΒ Β· 2019-02-20Β Β· INTERQUARTILE RANGE ... SEMI-INTERQUARTILE RANGE Known as Quartile Deviation One of dispersion measurement 𝒂 π’Š π’Šπ’‚ π’Š

EXAMPLE 2

Table shows the distribution of test scores obtained by 42

students in Statistics class. Calculate Q1 and Q3.

Scores obtained Number of students

80-90 1

90-100 2

100-110 5

110-120 10

120-130 15

130-140 7

140-150 2

Total 42

Ms Nurazrin Jupri

Page 18: Descriptive Statistics - FTMSΒ Β· 2019-02-20Β Β· INTERQUARTILE RANGE ... SEMI-INTERQUARTILE RANGE Known as Quartile Deviation One of dispersion measurement 𝒂 π’Š π’Šπ’‚ π’Š

EXAMPLE 2 (CONT.)

Scores obtained Number of students Cumulative frequency

80-90 1 1

90-100 2 3

100-110 5 8

110-120 10 18

120-130 15 33

130-140 7 40

140-150 2 42

π‘ΈπŸ‘ =πŸ‘(πŸ’πŸ)

πŸ’= πŸ‘πŸ. πŸ“ π‘ΈπŸ =

πŸ’πŸ

πŸ’= 𝟏𝟎. πŸ“

Ms Nurazrin Jupri

Page 19: Descriptive Statistics - FTMSΒ Β· 2019-02-20Β Β· INTERQUARTILE RANGE ... SEMI-INTERQUARTILE RANGE Known as Quartile Deviation One of dispersion measurement 𝒂 π’Š π’Šπ’‚ π’Š

EXAMPLE 2 (CONT.)

Scores obtained Number of students Cumulative frequency

80-90 1 1

90-100 2 3

100-110 5 8

110-120 10 18

120-130 15 33

130-140 7 40

140-150 2 42

π‘ΈπŸ = 𝑳π‘ͺπ‘©πŸ + (π‘ͺ𝟏)

π’πŸ’ βˆ’ π‘ͺπ‘­πŸ

π’‡πŸ

= 110 + 120 βˆ’ 11010.5βˆ’8

10

= 110 + 2.5

= 112.50

π‘ΈπŸ‘ = 𝑳π‘ͺπ‘©πŸ‘ + (π‘ͺπŸ‘)

πŸ‘π’πŸ’ βˆ’ π‘ͺπ‘­πŸ‘

π’‡πŸ‘

= 120 + 130 βˆ’ 12031.5 βˆ’ 18

15

= 120 + 9

= 129

Ms Nurazrin Jupri

Page 20: Descriptive Statistics - FTMSΒ Β· 2019-02-20Β Β· INTERQUARTILE RANGE ... SEMI-INTERQUARTILE RANGE Known as Quartile Deviation One of dispersion measurement 𝒂 π’Š π’Šπ’‚ π’Š

INTERQUARTILE RANGE

The difference between the third and first quartiles in a set

of data.

One of dispersion measurement

π‘°π’π’•π’†π’“π’’π’–π’‚π’“π’•π’Šπ’π’† π’“π’‚π’π’ˆπ’† = π‘ΈπŸ‘ βˆ’ π‘ΈπŸ

Ms Nurazrin Jupri

Page 21: Descriptive Statistics - FTMSΒ Β· 2019-02-20Β Β· INTERQUARTILE RANGE ... SEMI-INTERQUARTILE RANGE Known as Quartile Deviation One of dispersion measurement 𝒂 π’Š π’Šπ’‚ π’Š

SEMI-INTERQUARTILE RANGE

Known as Quartile Deviation

One of dispersion measurement

π‘Έπ’–π’‚π’“π’•π’Šπ’π’† π‘«π’†π’—π’Šπ’‚π’•π’Šπ’π’ = π‘ΈπŸ‘ βˆ’ π‘ΈπŸ

𝟐

Ms Nurazrin Jupri

Page 22: Descriptive Statistics - FTMSΒ Β· 2019-02-20Β Β· INTERQUARTILE RANGE ... SEMI-INTERQUARTILE RANGE Known as Quartile Deviation One of dispersion measurement 𝒂 π’Š π’Šπ’‚ π’Š

EXAMPLE 3

Refer to example 2 , find the interquartile and semi

interquartile range.

π‘°π’π’•π’†π’“π’’π’–π’‚π’“π’•π’Šπ’π’† π’“π’‚π’π’ˆπ’† = πŸπŸπŸ— βˆ’ 𝟏𝟎𝟐. πŸ“ = πŸπŸ”. πŸ“

π‘Ίπ’†π’Žπ’Š π’Šπ’π’•π’†π’“π’’π’–π’‚π’“π’•π’Šπ’π’† π’“π’‚π’π’ˆπ’† =πŸπŸ”. πŸ“

𝟐= πŸπŸ‘. πŸπŸ“

Ms Nurazrin Jupri

Page 23: Descriptive Statistics - FTMSΒ Β· 2019-02-20Β Β· INTERQUARTILE RANGE ... SEMI-INTERQUARTILE RANGE Known as Quartile Deviation One of dispersion measurement 𝒂 π’Š π’Šπ’‚ π’Š

COEFFICIENT OF VARIATION

Used while comparing distributions of different means and

variances

Gives the ratio of standard deviation to mean expressed

as percent.

π‘ͺ𝑽 =𝒔𝒕𝒂𝒏𝒅𝒂𝒓𝒅 π’…π’†π’—π’Šπ’‚π’•π’Šπ’π’

π’Žπ’†π’‚π’Γ— 𝟏𝟎𝟎

Ms Nurazrin Jupri

Page 24: Descriptive Statistics - FTMSΒ Β· 2019-02-20Β Β· INTERQUARTILE RANGE ... SEMI-INTERQUARTILE RANGE Known as Quartile Deviation One of dispersion measurement 𝒂 π’Š π’Šπ’‚ π’Š

EXAMPLE 4

Typist Ani can type 40 words per minutes with standard

deviation of 5 while typist Jura can type 160 words per

minutes with standard deviation of 10. which typist is more

consistent in her work?

Standard deviation of Jura is twice than Ani

Ani can type four times the speed of Jura

π‘ͺπ‘½π‘¨π’π’Š =πŸ“

πŸ’πŸŽΓ— 𝟏𝟎𝟎 = 𝟏𝟐. πŸ“%

π‘ͺ𝑽𝑱𝒖𝒏𝒂 =𝟏𝟎

πŸπŸ”πŸŽΓ— 𝟏𝟎𝟎 = πŸ”. πŸπŸ“%

It shows that the typing ability of typist Jura is more consistent than

typist Ani

Ms Nurazrin Jupri

Page 25: Descriptive Statistics - FTMSΒ Β· 2019-02-20Β Β· INTERQUARTILE RANGE ... SEMI-INTERQUARTILE RANGE Known as Quartile Deviation One of dispersion measurement 𝒂 π’Š π’Šπ’‚ π’Š

EXERCISE 2

The investments of Karu and Kamal are given as below:

Whose investment is considered to be more consistent?

Karu Kamal

Profit (RM) 250 250

Standard Deviation 8.16 238.05

Ms Nurazrin Jupri