organizing and presenting data

20
Organizing and Presenting Data GTECH 201 Session 11

Upload: teagan-moss

Post on 30-Dec-2015

21 views

Category:

Documents


0 download

DESCRIPTION

Organizing and Presenting Data. GTECH 201 Session 11. Terminology. Classes Categories for grouping data Frequency Number of observations that fall in a class (frequency is a count) Frequency Distribution A listing of all classes along with their frequencies Relative Frequency - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Organizing  and Presenting Data

Organizing and Presenting Data

GTECH 201Session 11

Page 2: Organizing  and Presenting Data

Terminology Classes

Categories for grouping data Frequency

Number of observations that fall in a class (frequency is a count) Frequency Distribution

A listing of all classes along with their frequencies Relative Frequency

The ratio of the frequency of a class to the total number of observations

Relative Frequency Distribution A listing of all classes along with their relative frequencies

Width/Class Interval The difference between the upper and lower cut points (breaks) of a

class

Page 3: Organizing  and Presenting Data

Organizing Data Classification Rules

Aim is to create categories or classes First step is to compute range Range = Largest Value – Smallest Value

Interval or Ratio Scale data only

Class Intervals Width of Class Interval

Equal based on range Unequal based on range Quantile (Quartile or Quintile) Natural

Page 4: Organizing  and Presenting Data

Classification Methods

Natural breaks

Equal interval

Quantile Manual

Page 5: Organizing  and Presenting Data

How to Decide(on a classification

scheme)

Rule of thumb: 3 - 7 classes Classification histogram (see later

today)

Page 6: Organizing  and Presenting Data

How to Decide, part IIClassification

methodWhen to use

How many classes to have

Natural breaks When attributes are distributed unevenly across the overall range of values

Look for natural groups

Equal interval When you want all classes to have the same range

Easily understood interval, such as 2, 50, 1000, etc.

Quantile When attributes are distributed in a linear fashion

Determined by purpose of the map

Manual When you want classes to break at specific values

Page 7: Organizing  and Presenting Data

0

50

100

150

200

250

1 4 7 10 13 16 19 22 25 28 31 34 37 40 43 46 49 52

Graphs

Line graph

Bar graph

Scatterplots0.0

20.0

40.0

60.0

80.0

100.0

0 50 100 150 200 250

Page 8: Organizing  and Presenting Data

Creating a Line Graph

Year Population Growth1960 15001970 50001980 146701990 189232000 24000

The growth of the population of students at a Midwestern university is as follows

Page 9: Organizing  and Presenting Data

Line GraphPopulation Growth

0

5000

10000

15000

20000

25000

30000

1 2 3 4 5

Population Growth

Page 10: Organizing  and Presenting Data

Bar Graphs

All fields 29.5Biological Sciences 26Eduation 44.2Engineering 3.8Physical Sciences 12.5Psychology 41.7

Here are data on the percent of females among people earning doctoral degrees in 1990, in several different fields of study

Page 11: Organizing  and Presenting Data

Bar Graph

05

1015

2025

3035

4045

50

All fields BiologicalSciences

Eduation Engineering PhysicalSciences

Psychology

Page 12: Organizing  and Presenting Data

Scatter Plots Graph bi-variate data when both

variables are measured in an interval/ratio or ordinal scale

Units for one variable are marked on the horizontal axis

Independent variable should always go on the horizontal, x axis

Page 13: Organizing  and Presenting Data

ScatterplotsCalories in Common Foods

-50

0

50

100

150

200

250

300

350

400

450

0 2 4 6 8 10 12

Survey of 3368 people asking them to estimate number of calories in common foods.

Page 14: Organizing  and Presenting Data

Example

0 3 0 0 3 02 2 00 1 2 10 0 1 2 4 04 2 1 0 1 00 2 0 1 3 2

A city planner collected data on the number of school age children in each of 30 families.

Construct a grouped data table using classes based on a single value

Page 15: Organizing  and Presenting Data

Computing Frequency

Number of Children

Frequency Relative Frequency

0 12 0.4001 6 0.2002 7 0.2333 3 0.1004 2 0.067

30 1.000

There are three ways you can create classes

a < but not equal to bb < but not equal to

c a – b, c – d, e - f single value

grouping

Page 16: Organizing  and Presenting Data

Distributions

Histograms Difference between histograms and bar

graphs Bars in a histogram are always vertical Base scale is marked off in equal units;

there is no base scale in a bar graph Width of bars in a histogram have meaning Bars in a histogram touch each other

Page 17: Organizing  and Presenting Data

Constructing a HistogramC lass Intervals Frequency

Relative Frequency

Midpoint

30 -39 3 0.075 3540-49 1 0.025 4550-59 8 0.2 5560-69 10 0.25 6570-79 7 0.175 7580-89 7 0.175 8590-99 4 0.1 95

40 1

Histogram – height of bar equal to frequency of class represented

Bar extends from lowest value to highest value of the class

Page 18: Organizing  and Presenting Data

Histogram Chart

0

5

10

15

20

90

10

9

12

9

14

8

16

8

18

7

20

7

Mo

re

Fre

qu

en

cy

0%

50%

100%

150%

Frequency

Cumulative %

Page 19: Organizing  and Presenting Data

Frequency Polygons Similar to a histogram

Midpoint of the class is indicated

Points connected by straight lines

Cumulative frequency polygon, ogive

Page 20: Organizing  and Presenting Data

1 6

11

16

21

26

31

36

41

46

51

S1

0

50

100

150

200

250