quantitative methods of decision making

29
. . . . . . . . . Quantitative Methods of Decision Making Slide presentations Jerzy Mycielski CMT 2008 Jerzy Mycielski (CMT) Quantitative Methods of Decision Making 2008 1 / 208

Upload: others

Post on 04-May-2022

12 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Quantitative Methods of Decision Making

. . . . . .

.

.

. ..

.

.

Quantitative Methods of Decision MakingSlide presentations

Jerzy Mycielski

CMT

2008

Jerzy Mycielski (CMT) Quantitative Methods of Decision Making 2008 1 / 208

Page 2: Quantitative Methods of Decision Making

. . . . . .

Using statistical techniques in business

The role of statistics is to collect, summarize and analyze the data

Two branches of statistics:

Descriptive statistics

Describes the collections of objects (e.g. persons, products, firms) withrespect to their characteristics

Inferential statistics

Techniques which make possible to make conclusions about largecollection of objects (all Poles, all employees in the firm) on the basisof small portion of this collection

The first part of this lecture will describe the methods of descriptivestatistics and the second part will cover the inferential statistics

Jerzy Mycielski (CMT) Quantitative Methods of Decision Making 2008 2 / 208

Page 3: Quantitative Methods of Decision Making

. . . . . .

Population and sample

Population is the collection of objects which is of interest of a giveninvestigation

population can be finite (e.g. population of firms in Poland) or infinite(e.g. possible values of a price index)

Sample is the part of the population which is used in investigation

sample is always finite

Sample frame is the list of all the population members

Jerzy Mycielski (CMT) Quantitative Methods of Decision Making 2008 3 / 208

Page 4: Quantitative Methods of Decision Making

. . . . . .

Random sample

Simple random sample is sample selected in such a way that eachelement of the population has equal chance of being selected.

We say that sample is selected without replacement if an element ofthe population can only be selected once to the sample

Sampling is almost always done without replacement

Jerzy Mycielski (CMT) Quantitative Methods of Decision Making 2008 4 / 208

Page 5: Quantitative Methods of Decision Making

. . . . . .

Relationship between probability and statistics

.Definition..

.

. ..

.

.

Parameter is a descriptive measure computed from or used to describe thepopulation

Parameter is nonrandom

Value of parameter is our object of interest

However, it is too difficult/costly to collect data in order to calculatethe value parameter

Jerzy Mycielski (CMT) Quantitative Methods of Decision Making 2008 5 / 208

Page 6: Quantitative Methods of Decision Making

. . . . . .

Relationship between probability and statistics

.Definition..

.

. ..

.

.

Statistic is a descriptive measure computed from or used to describe thesample

But: statistic is random as sample is randomly chosen

Statistic is either used to estimate (approximate) the parameter or tomake inference about the properties of the parameter

As statistic is random it is necessary to use the laws of probability toinvestigate properties of statistic

Jerzy Mycielski (CMT) Quantitative Methods of Decision Making 2008 6 / 208

Page 7: Quantitative Methods of Decision Making

. . . . . .

Data sources

Three main sources of data for the researchers and managers:

sample surveysdesigned experimentsroutine operation

With sample surveys and designed experiments we collect exactly thedata which is needed but they are usually costly

Routine operation data does not often include the information ofinterest and it is difficult to gather

Almost always, the construction of the database is the most costlypart of the statistical investigation

Jerzy Mycielski (CMT) Quantitative Methods of Decision Making 2008 7 / 208

Page 8: Quantitative Methods of Decision Making

. . . . . .

Constructing statistical tablesGrouped data

Ordered array: observations ordered according to their values

class intervals: contiguous, nonoverlapping and exhaustive

usually class intervals are of equal width

grouped data: frequency of the occurrence in class intervals

Jerzy Mycielski (CMT) Quantitative Methods of Decision Making 2008 8 / 208

Page 9: Quantitative Methods of Decision Making

. . . . . .

Constructing statistical tables

.Definition (Frequency distribution)..

.

. ..

.

.

Frequency distribution is any device which shows the values of the variabletogether with the frequency of occurrence of the values

cumulative frequency distribution function: cumulated frequenciesfrom the first class interval through the preceding interval, inclusive

relative frequencies: proportion of observations within certain classinterval

Using relative frequencies we may construct cumulative relativefrequency distribution

It is important to define class intervals in such a way to obtainsufficient number of observations in each interval

To obtain the sufficient numbers of observations in class intervals it issometimes necessary to define the class intervals with unequal width

Jerzy Mycielski (CMT) Quantitative Methods of Decision Making 2008 9 / 208

Page 10: Quantitative Methods of Decision Making

. . . . . .

Example: Analyzing employment structure

September survey of the wage structure classified by profession

Year 2004, Poland

Population: firms, organizations and individuals employing 10 personsor more

Exceptions: individual farms, NG0s, political parties, trade unions

Sample frame: REGON (National register of economic entities inPoland)

Sampling: all the entities are obliged to fill questionnaires but on thefirm level the employed are sampled

Number of employed included in the sample depends on employmentin the entity but not in the linear way

Sample is not fully random as probability of being included in thesample depends on entity size

Jerzy Mycielski (CMT) Quantitative Methods of Decision Making 2008 10 / 208

Page 11: Quantitative Methods of Decision Making

. . . . . .

Constructing statistical tablesHistogram - grouped data table

Lower Upper FrequencyRelative

frequencyCumulative frequency

0 - 1000 567 0.093 0.0931000 - 1500 1199 0.197 0.2901500 - 2000 1313 0.216 0.5062000 - 2500 1002 0.165 0.6702500 - 3000 672 0.110 0.7813000 - 3500 460 0.076 0.8563500 - 4000 268 0.044 0.9004000 - 4500 159 0.026 0.9264500 - 5000 108 0.018 0.9445000 - 5500 79 0.013 0.9575500 - 6000 53 0.009 0.9666000 - 6500 33 0.005 0.9716500 - 7000 37 0.006 0.9777000 - 7500 26 0.004 0.9827500 - 8000 16 0.003 0.9848000 - 8500 18 0.003 0.9878500 - 9000 13 0.002 0.9899000 - 9500 10 0.002 0.9919500 - 10000 9 0.001 0.992

10000 - 46 0.008 1.000Total 6088

Jerzy Mycielski (CMT) Quantitative Methods of Decision Making 2008 11 / 208

Page 12: Quantitative Methods of Decision Making

. . . . . .

Constructing statistical graphsHistogram - frequencies

0

200

400

600

800

1000

1200

1400

1000 1500 2000 2500 3000 3500 4000 4500 5000 5500 6000 6500 7000 7500 8000 8500 9000 9500

Salary

Fre

qu

ency

Jerzy Mycielski (CMT) Quantitative Methods of Decision Making 2008 12 / 208

Page 13: Quantitative Methods of Decision Making

. . . . . .

Constructing statistical graphsHistogram - frequencies

0

50

100

150

200

250

600

850

1100

1350

1600

1850

2100

2350

2600

2850

3100

3350

3600

3850

4100

4350

4600

4850

5100

5350

5600

5850

6100

6350

6600

6850

7100

7350

7600

7850

8100

8350

8600

8850

9100

9350

9600

9850

Salary

Fre

qu

ency

Jerzy Mycielski (CMT) Quantitative Methods of Decision Making 2008 13 / 208

Page 14: Quantitative Methods of Decision Making

. . . . . .

Constructing statistical graphsHistogram - relative frequencies

0.000

0.050

0.100

0.150

0.200

0.250

1000 1500 2000 2500 3000 3500 4000 4500 5000 5500 6000 6500 7000 7500 8000 8500 9000 9500 10000

Salary

Rel

ativ

e fr

equ

ency

Jerzy Mycielski (CMT) Quantitative Methods of Decision Making 2008 14 / 208

Page 15: Quantitative Methods of Decision Making

. . . . . .

Constructing statistical graphsHistogram - frequency polygon

0

200

400

600

800

1000

1200

1400

0 1000 1500 2000 2500 3000 3500 4000 4500 5000 5500 6000 6500 7000 7500 8000 8500 9000 9500

Salary

Fre

qu

ency

Jerzy Mycielski (CMT) Quantitative Methods of Decision Making 2008 15 / 208

Page 16: Quantitative Methods of Decision Making

. . . . . .

Constructing statistical graphsHistogram - ogive

0.000

0.100

0.200

0.300

0.400

0.500

0.600

0.700

0.800

0.900

1.000

0 1000 1500 2000 2500 3000 3500 4000 4500 5000 5500 6000 6500 7000 7500 8000 8500 9000 9500 10000

Salary

Cu

mu

lati

ve f

req

uen

cy

Jerzy Mycielski (CMT) Quantitative Methods of Decision Making 2008 16 / 208

Page 17: Quantitative Methods of Decision Making

. . . . . .

Constructing statistical graphsPie charts

1%

0%

2%

30%

4%

3%

9%1%7%

3%

8%

7%

14%

9%2%

FarmingFishery

MiningIndustryEnergy

ConstructionTradeTourism

TransportationFinanceServices to firms

Administration and militaryEducationHealth

Employed by housholds

Jerzy Mycielski (CMT) Quantitative Methods of Decision Making 2008 17 / 208

Page 18: Quantitative Methods of Decision Making

. . . . . .

Constructing statistical graphsPivot table

Man WomenFarming 70.00% 30.00%Fishery 80.00% 20.00%Mining 87.38% 12.62%Industry 65.59% 34.41%Energy 81.40% 18.60%Construction 88.24% 11.76%Trade 53.82% 46.18%Tourism 36.23% 63.77%Transportation 69.08% 30.92%Finance 26.04% 73.96%Services to firms 57.26% 42.74%Administration and military 26.35% 73.65%Education 22.94% 77.06%Health 18.69% 81.31%Employed by housholds 55.28% 44.72%

Jerzy Mycielski (CMT) Quantitative Methods of Decision Making 2008 18 / 208

Page 19: Quantitative Methods of Decision Making

. . . . . .

Constructing statistical graphsBar charts

0.00% 10.00% 20.00% 30.00% 40.00% 50.00% 60.00% 70.00% 80.00% 90.00% 100.00%

Farming

Fishery

Mining

Industry

Energy

Construction

Trade

Tourism

Transportation

Finance

Services to firms

Administration and military

Education

Health

Employed by housholds

Men Women

Jerzy Mycielski (CMT) Quantitative Methods of Decision Making 2008 19 / 208

Page 20: Quantitative Methods of Decision Making

. . . . . .

Measures of central tendency

.Definition (Sample mean)..

.

. ..

.

.

x =

∑ni=1 xi

n

Symbol∑n

i=1 means ”summation from i = 1 to i = n)

for a given sample, the value of the mean is unique

the sum of deviations of observations from the sample mean is equalto zero

mean is affected by the magnitude of each observation

mean is additive: the mean of the sum of two characteristics is equalto the sum of means of these characteristics

Note: sample mean is also called arithmetic average

Jerzy Mycielski (CMT) Quantitative Methods of Decision Making 2008 20 / 208

Page 21: Quantitative Methods of Decision Making

. . . . . .

Measures of central tendency

.Definition (Sample median).... ..

.

.Median is the value above which lie the half of the values of observation

for a given sample median can always be calculated

if the number of observation is even that it is calculated as a mean oftwo observations

the median is not affected by the magnitude of the extremeobservations

median can also be used to characterize the qualitative data

median is not additive

Jerzy Mycielski (CMT) Quantitative Methods of Decision Making 2008 21 / 208

Page 22: Quantitative Methods of Decision Making

. . . . . .

Measures of central tendency

.Definition (Sample mode).... ..

.

.

Mode for ungrouped discrete data is the value that occurs most frequently

for some samples mode does not exist (e.g. all values for observationsdifferent)

it can happen that mode is not unique

mode is not additive

Jerzy Mycielski (CMT) Quantitative Methods of Decision Making 2008 22 / 208

Page 23: Quantitative Methods of Decision Making

. . . . . .

Example: Analyzing wages of employees

Mean Median Mode

All 2425.3 1986.7 824.0Men 2634.0 2124.5 824.0Women 2204.1 1855.0 824.0

Jerzy Mycielski (CMT) Quantitative Methods of Decision Making 2008 23 / 208

Page 24: Quantitative Methods of Decision Making

. . . . . .

Dispersion

By dispersion we mean the degree to which values in a set varyaround their mean

Other terms for the same concept are variation, scatter, spread

When values in a set are concentrated around the mean we say thatthe dispersion is small

Jerzy Mycielski (CMT) Quantitative Methods of Decision Making 2008 24 / 208

Page 25: Quantitative Methods of Decision Making

. . . . . .

Measures of dispersion

.Definition (Range)..

.

. ..

.

.

Range is defined as the difference between the largest and the smallestvalues in a data set

The range is usually unsatisfactory measure of dispersion as it isdetermined only by two most extreme values in the dataset.

Notice that mean deviation is always equal to zero (see properties ofthe mean)

Negative and positive deviation should be treated the same

Jerzy Mycielski (CMT) Quantitative Methods of Decision Making 2008 25 / 208

Page 26: Quantitative Methods of Decision Making

. . . . . .

Measures of dispersion

.Definition (Mean absolute deviation)..

.

. ..

.

.

MAD (x) =

∑ni=1 |xi − x |

n

The mean absolute deviation is an intuitive measure of variation butit is not popular because of troublesome mathematical properties

Jerzy Mycielski (CMT) Quantitative Methods of Decision Making 2008 26 / 208

Page 27: Quantitative Methods of Decision Making

. . . . . .

Measures of dispersion

.Definition (Sample variance)..

.

. ..

.

.

s2x =

∑ni=1 (xi − x)2

n − 1

Properties of the variance

variance of yi = axi is equal to s2y = a2sx so that the change of units of

x results in change of variance which is proportional to the square of a

Jerzy Mycielski (CMT) Quantitative Methods of Decision Making 2008 27 / 208

Page 28: Quantitative Methods of Decision Making

. . . . . .

Measures of dispersion

.Definition (Sample standard deviation)..

.

. ..

.

.

sx =√

s2x

The main advantage of the standard deviation over variance is thatfor yi = axi , the standard deviation sy = asx .

Jerzy Mycielski (CMT) Quantitative Methods of Decision Making 2008 28 / 208

Page 29: Quantitative Methods of Decision Making

. . . . . .

Measures of dispersion

.Definition (Coefficient of variation)..

.

. ..

.

.

cv =sxx

Range, mean absolute deviation, variance and standard deviation alldepend on the units

Therefore these measures cannot be to compare dispersion of thecharacteristics expressed in different units

As coefficient of variation for yi = axi and xi are equal the coefficientof variation is dimensionless number

Therefore it can be used for comparisons of dispersion of variables

Coefficient of variation should not be used if the mean x is close zero

Jerzy Mycielski (CMT) Quantitative Methods of Decision Making 2008 29 / 208