quantitative data (graphical)

Post on 06-Feb-2016

116 Views

Category:

Documents

2 Downloads

Preview:

Click to see full reader

DESCRIPTION

Quantitative Data (Graphical). Quantitative Data (Graphical). This is numerical data We may describe quantitative data using the same methods as qualitative by breaking our numerical data into classes. That is 20-30, 30-40, 40-50, 50-60. Quantitative Data (Graphical). This is numerical data - PowerPoint PPT Presentation

TRANSCRIPT

1

Quantitative Data (Graphical)

2

Quantitative Data (Graphical)• This is numerical data

• We may describe quantitative data using the same methods as qualitative by breaking our numerical data into classes. That is 20-30, 30-40, 40-50, 50-60.

3

Quantitative Data (Graphical)• This is numerical data

• We may describe quantitative data using the same methods as qualitative by breaking our numerical data into classes. That is 20-30, 30-40, 40-50, 50-60.

• Histograms, stem and leaf plots and dot plots are other common methods of displaying quantitative data.

4

Histograms

• A histogram is a bar graph where you use intervals for your data class.

• The following histogram summarizes the NBA payroll. You should note that the are adjacent to one another.

5

NBA Payroll

teams

ofNumber

dollars of millionsin Payroll

6

Stem and Leaf, and Dot Plots

• Notice in the histogram on the previous page we lose some information. That is we don’t know exactly what each team is paying in salary just how many are paying in the range of 1.885 million dollars.

7

Stem and Leaf, and Dot Plots • Notice in the histogram on the previous page we

lose some information. That is we don’t know exactly what each team is paying in salary just how many are paying in the range of 1.885 million dollars.

• A stem and leaf plot is a graphical device which uses numbers so that no information is lost.

8

Stem and Leaf, and Dot Plots

• A stem and leaf plot is a graphical device which uses numbers so that no information is lost.

• The technique separates each data point into two numbers, the stem (the leading digit) and the leaves.

9

Stem and Leaf, and Dot Plots • The technique separates each data point into two

numbers, the stem (the leading digit) and the leaves.

• In a dot plot we start with a number line of all possible values for the data. Each data point is represented with a dot above the appropriate number. If a number appears more than once in your data you build a tower of dots above that point.

10

Example • Here is a list of exam scores:

88, 82, 89, 70, 85, 63, 100, 86, 67, 39, 90, 96, 76, 34, 81, 64, 75, 84, 89, 96

Construct a histogram (with interval size 10 starting at 24), a stem and leaf diagram and a dot plot .

11

Histogram of Exam Scores

Frequency

12

Stem and Leaf Plot of Exam Scores

88, 82, 89, 70, 85, 63, 100, 86, 67, 39, 90, 96, 76, 34, 81, 64, 75, 84, 89, 96

13

Stem and Leaf Plot of Exam Scores

88, 82, 89, 70, 85, 63, 100, 86, 67, 39, 90, 96, 76, 34, 81, 64, 75, 84, 89, 96

3

4

5

6

7

8

9

10

14

Stem and Leaf Plot of Exam Scores

88, 82, 89, 70, 85, 63, 100, 86, 67, 39, 90, 96, 76, 34, 81, 64, 75, 84, 89, 96

3

4

5

6

7

8 8

9

10

15

Stem and Leaf Plot of Exam Scores

88, 82, 89, 70, 85, 63, 100, 86, 67, 39, 90, 96, 76, 34, 81, 64, 75, 84, 89, 96

3

4

5

6

7

8 8 2

9

10

16

Stem and Leaf Plot of Exam Scores

88, 82, 89, 70, 85, 63, 100, 86, 67, 39, 90, 96, 76, 34, 81, 64, 75, 84, 89, 96

3

4

5

6

7

8 8 2 9

9

10

17

Stem and Leaf Plot of Exam Scores

88, 82, 89, 70, 85, 63, 100, 86, 67, 39, 90, 96, 76, 34, 81, 64, 75, 84, 89, 96

3

4

5

6

7 0

8 8 2 9

9

10

18

Stem and Leaf Plot of Exam Scores

88, 82, 89, 70, 85, 63, 100, 86, 67, 39, 90, 96, 76, 34, 81, 64, 75, 84, 89, 96

3

4

5

6

7 0

8 2 8 9

9

10

19

Stem and Leaf Plot of Exam Scores

88, 82, 89, 70, 85, 63, 100, 86, 67, 39, 90, 96, 76, 34, 81, 64, 75, 84, 89, 96

3 4 9

4

5

6 3 4 7

7 0 5 6

8 1 2 4 5 6 8 9 9

9 0 6 6

10 0

20

Dot Plot of Exam Scores

88, 82, 89, 70, 85, 63, 100, 86, 67, 39, 90, 96, 76, 34, 81, 64, 75, 84, 89, 96

21

Dot Plot of Exam Scores

88, 82, 89, 70, 85, 63, 100, 86, 67, 39, 90, 96, 76, 34, 81, 64, 75, 84, 89, 96

30 40 50 60 70 80 90 100

22

88, 82, 89, 70, 85, 63, 100, 86, 67, 39, 90, 96, 76, 34, 81, 64, 75, 84, 89, 96

Dot Plot of Exam Scores

30 40 50 60 70 80 90 100

23

88, 82, 89, 70, 85, 63, 100, 86, 67, 39, 90, 96, 76, 34, 81, 64, 75, 84, 89, 96

Dot Plot of Exam Scores

30 40 50 60 70 80 90 100

24

88, 82, 89, 70, 85, 63, 100, 86, 67, 39, 90, 96, 76, 34, 81, 64, 75, 84, 89, 96

Dot Plot of Exam Scores

30 40 50 60 70 80 90 100

25

88, 82, 89, 70, 85, 63, 100, 86, 67, 39, 90, 96, 76, 34, 81, 64, 75, 84, 89, 96

Dot Plot of Exam Scores

30 40 50 60 70 80 90 100

26

88, 82, 89, 70, 85, 63, 100, 86, 67, 39, 90, 96, 76, 34, 81, 64, 75, 84, 89, 96

Dot Plot of Exam Scores

30 40 50 60 70 80 90 100

Quantitative (in contrast to graphical) methods

• Measures of central tendency• Mean

• Median

• Mode

• Measures of dispersion• Range

• Standard deviation

• Mode

• Median Stndard deviatin

27

28

Summation Notation

Here is a typical (small) data set:

2 7 1 32

29

Summation Notation

Here is a typical (small) data set:

2 7 1 32

So we can talk about a general data set we let:,21 x ,72 x ,13 x ,34 x 25 x

30

Summation Notation

So we can talk about a general data set we let:

In general for a sample of n points of data we call them, in order:

,21 x ,72 x ,13 x ,34 x 25 x

nxxxxx ,...,,,, 4321

31

Summation NotationIn general for a sample of n points of data we call them, in order:

When we wish to sum (add them up), we use the notation:

This is called summation notation.

nxxxxx ,...,,,, 4321

n

n

ii xxxxxx

...43211

32

Summation NotationIn statistics, sometimes the i is not included in the sum since it is implied that we are summing over all points in our data set. That is you may see the following:

n

iixx

1

n

iixx

1

22

33

Descriptive Statistics

• Qualitative Variables– Graphical Methods

• Quantitative Variables– Graphical Methods– Numerical Methods

34

Numerical descriptive measures

Two types of measures we look for:

1) Ones which tell us about the central tendency of measurements

2) Ones which tell us about the variability or spread of the data.

35

Numerical Measures of Central Tendency

Three Measures

a) Mean

b) Median

c) Mode

Problem

36

Mean

The mean of a data set is the average or expected value of the readings in the data.

Problem: I wish to talk about the mean of the population and the mean of the sample separately. Therefore we need to introduce two different notations.

37

Mean

Sample: the size of the sample is usually denoted with n, and the mean of the sample (sample mean) is denoted with

Population: the size of the population is usually denoted N and the population mean is denoted µ.

.x

38

Mean

The mean is given by

n

xx

n

xx

n

ii

OR 1

39

Example

4 ,1 ,4 ,4 ,2 ,6 ,1 ,3Given the sample:

Find the mean.

40

Example

4 ,1 ,4 ,4 ,2 ,6 ,1 ,3Given the sample:

Find the mean.

n

xx

41

Example

4 ,1 ,4 ,4 ,2 ,6 ,1 ,3Given the sample:

Find the mean.

8

41442613

n

xx

42

Example

4 ,1 ,4 ,4 ,2 ,6 ,1 ,3Given the sample:

Find the mean.

8

41442613

n

xx

125.3

43

However, given the sample:

we find the mean is quite different from 3.125.

Example

40 ,1 ,4 ,4 ,2 ,6 ,1 ,3

44

However, given the sample:

we find the mean is quite different from 3.125.

This is not a good indication of the center of the sample.

Example

40 ,1 ,4 ,4 ,2 ,6 ,1 ,3

8

401442613

n

xx

625.7

45

Mean

Usually the sample mean is used to estimate the population mean µ.

The accuracy of this estimate tends to be effected by:

– The size of the sample

– Variability or spread of the data

x

46

Median

The median of a quantitative data set is the middle number in the set.

For example in the following data the median is 10.

10000 ,41 ,21 ,01 ,4 ,4 ,1

47

Median

The sample median is denoted M.

If n is even, take the average of the two middle numbers.

48

ExamplesFind the median in the following two data sets:

4 ,1 ,4 ,4 ,2 ,6 ,1 ,3 a)

40 ,1 ,4 ,4 ,2 ,6 ,1 ,3 b)

49

ExamplesFind the median in the following two data sets:

In both cases we found M=3.5.

The median is sometimes a better estimate of the population mean µ than the sample mean because it puts less emphasis on outliers.

4 ,1 ,4 ,4 ,2 ,6 ,1 ,3 a)

40 ,1 ,4 ,4 ,2 ,6 ,1 ,3 b)

x

50

What the median and mean tell you

A data set is skewed if one tail of the distribution has more extreme observations than the other.

http://www.shodor.org/interactivate/activities/SkewDistribution/

51

What the median and mean tell you

This data set is skewed to the right. Notice the mean is to the right of the median.

xM

52

What the median and mean tell you

Skewed to the right: The mean is bigger than the median.

xM

53

What the median and mean tell you

This data set is skewed to the left. Notice the mean is to the left of the median.

x M

54

What the median and mean tell you

Skewed to the left: The mean is less than the median.

x M

55

What the median and mean tell you

When the mean and median are equal, the data is symmetric

Mx

56

Mode

The mode is the measurement which occurs most frequently

4 ,1 ,4 ,4 ,2 ,6 ,1 ,3 a)

40 ,1 ,4 ,4 ,2 ,6 ,1 ,3 b)

57

Mode

The mode is the measurement which occurs most frequently

a) mode= 4

b) mode= 4, 1

4 ,1 ,4 ,4 ,2 ,6 ,1 ,3 a)

40 ,1 ,4 ,4 ,2 ,6 ,1 ,3 b)

58

Mode

When dealing with histograms or qualitative data, the measurement with the highest frequency is called the modal class.

top related