displaying & describing categorical data chapter 3

21
Displaying & Describing Categorical Data Chapter 3

Upload: talon-laws

Post on 14-Dec-2015

257 views

Category:

Documents


1 download

TRANSCRIPT

Page 1: Displaying & Describing Categorical Data Chapter 3

Displaying & Describing Categorical

DataChapter 3

Page 2: Displaying & Describing Categorical Data Chapter 3

Objective

Look at different types of data and checking a condition before plugging ahead.

Writing clear explanations in context.Independence and the importance to

statistics.Simpson’s Paradox: think deeply or be

mislead!!

Page 3: Displaying & Describing Categorical Data Chapter 3

Frequency Tables

Number 1 Rule of Data Analysis? Draw a picture!!!

Frequency Table: records totals and categories names!!

Page 4: Displaying & Describing Categorical Data Chapter 3

Frequency Tables

Counts are useful but sometimes we want to know the fraction or proportion of data in each category.

These counts are expressed as percentages.

Relative Frequency Table: displays percentages rather than counts

Page 5: Displaying & Describing Categorical Data Chapter 3

Frequency Tables

Both these types of tables describe the distribution across categories.

Page 6: Displaying & Describing Categorical Data Chapter 3

Bar Charts

A bar chart displays the distribution of a categorical variable, showing the counts for each category next to each other for easy comparison.

Page 7: Displaying & Describing Categorical Data Chapter 3

Bar Charts

If we are interested in the relative proportion we could replace the counts with percentages and use a relative frequency bar chart.

Page 8: Displaying & Describing Categorical Data Chapter 3

Who Owns an MP3 Player?

Portable MP3 music players, such as the Apple iPod, are popular-but not equally popular with people of all ages. Here are the percents of people in various groups who own a portable MP3 player, according to an Arbitron survey of 1112 randomly selected people.

Page 9: Displaying & Describing Categorical Data Chapter 3

Area Principle

The area principle says that the area occupied by a part of the graph should correspond to the magnitude of the value it represents.

Violations of this principle are a common way to lie with statistics.

Page 10: Displaying & Describing Categorical Data Chapter 3

Who Buys iMacs?

Page 11: Displaying & Describing Categorical Data Chapter 3

Pie Charts

Pie Charts show the whole group of cases as a circle . They slice the circle into pieces whose sizes are proportional to the fraction of the whole in each category.

Page 12: Displaying & Describing Categorical Data Chapter 3

What should you pick!!

Think before you draw!!Always check the categorical condition.If you want to make a relative frequency

bar chart or a pie chart, make sure the categories do not overlap.

If they overlap you can still make the chart but your percentages will not add up to 100.

Page 13: Displaying & Describing Categorical Data Chapter 3

Contingency Tables

To look at two categorical variables together, we often arrange the counts in a two-way table.

A contingency table displays counts and, sometimes percentages of individuals falling into named categories on two or more variables. The table categorizes the individuals on all variables at once to reveal possible patterns in one variable that may be contingent on the category of the other.

Page 14: Displaying & Describing Categorical Data Chapter 3

Contingency Tables

In a contingency table, the distribution of either variable alone is called the marginal distribution. The counts or percentages are the totals found in the margins (last row or column) of the table.

Page 15: Displaying & Describing Categorical Data Chapter 3

Marginal Distribution

In a contingency table, the distribution of either variable alone is called the marginal distribution. The counts or percentages are the totals found in the margins (last row or column) of the table.

Page 16: Displaying & Describing Categorical Data Chapter 3

Example

A survey of 4826 randomly selected young adults (aged 19-25) asked, “What do you think are the chances you will have much more than a middle-class income at age 30?” The table below shows the responses, omitting a few people who refused to respond or who said they were already rich.

Page 17: Displaying & Describing Categorical Data Chapter 3

Example

Page 18: Displaying & Describing Categorical Data Chapter 3

Conditional Distributions

The distribution of a variable restricting the “Who” to consider only a smaller group of individuals is called a conditional distribution.

In a contingency table, when the distribution of one variable is the same for all categories of another, we say that the variables are independent.

Page 19: Displaying & Describing Categorical Data Chapter 3

Segmented Bar Charts

Segmented Bar Charts treat each bar as the “whole” and divides it proportionally into segments corresponding to the percentage in each group.

Page 20: Displaying & Describing Categorical Data Chapter 3

Simpson’s Paradox

When averages are taken across different groups, they can appear to contradict the overall averages.

Page 21: Displaying & Describing Categorical Data Chapter 3

Example

It’s the last inning of the important game. Your team is a run down with bases loaded and two outs. The pitcher is due up, so you’ll be sending in a pinch hitter. There are 2 batters available on the bench. Whom should you send to bat?

Player Overall Vs LHP Vs RHP

Wade 33 for 103 28 for 81 5 for 22

Lee 45 for 151 12 for 32 33 for 119