chapter 3: displaying and describing categorical data kennesaw state university math 1107
TRANSCRIPT
CHAPTER 3:Displaying and Describing
Categorical Data
KENNESAW STATE UNIVERSITY
MATH 1107
EXAMPLE: Titanic Data What kind of table is this?
ID Survival Age Sex Class
0001 Dead Adult Male Third0002 Dead Adult Male Crew0003 Dead Adult Male Third
. . . . .
. . . . .
. . . . .2200 Alive Adult Female First2201 Dead Adult Male Third
EXAMPLE: Frequency Table
EXAMPLE: Relative Frequency Table
EXAMPLE: Frequency Table
CLASS Frequency PercentCumulative Frequency
Cumulative Percent
First 325 14.77 325 14.77Second 285 12.95 610 27.71Third 706 32.08 1316 59.79Crew 885 40.21 2201 100
The Area Principle
EXAMPLE: Bar Chart
EXAMPLE: Pie Chart
EXAMPLE: Contingency Table
EXAMPLE: Joint Distribution of Survival & Class
EXAMPLE (1 of 3): Marginal Distribution of Survival
Survival Frequency
Alive 711Dead 1490
Total 2201
EXAMPLE (2 of 3): Marginal Distribution of Class
Class Frequency
First 325Second 285Third 706Crew 885
Total 2201
EXAMPLE (3 of 3): Marginal Distribution of Class
First Second Third Crew Total
325 285 706 885 2201
Class
Conditional Distribution of Class | Survival = ‘Alive’
Graphically Displaying Conditional Distributions: Pie Charts
Graphically Displaying Conditional Distributions: Segment Bar Charts
EXAMPLE: Heart Disease DataAre these variables independent?
Yes No Total
Males 17 64 81Females 7 56 63
Total 24 120 144
Diagnosis
Gen
der
EXAMPLE: Heart Disease DataAre these variables independent?
Yes No Total
Males 21.0% 79.0% 56.3%Females 11.1% 88.9% 43.8%
Total 16.7% 83.3% 100.0%
Diagnosis
Gen
der
Class Activity:Just Checking (p. 27)
Blue Brown Green/Hazel/Other Total
Males 6 20 6 32Females 4 16 12 32
Total 10 36 18 64
Eye Color
Gen
der
Class Activity:In Preparation for HW7:
Consider the following situation:– The Centers for Disease Control estimates the
frequency of the top 5 causes of death in the United States during 1999. Of a sample of 5000, 1515 died of heart disease, 1150 of cancer, 420 of circulatory disease and stroke, 325 of respiratory disease, and 205 of accidents. Find the relative frequency distribution of the causes of death and write a sentence describing it.
Class Activity:In Preparation for HW7:
1) Construct a frequency table:
Cause of Death Frequency
Heart Disease 1515Cancer 1150Circulatory 420Respiratory 395Accidents 205
Total 5000
Class Activity:In Preparation for HW7:
2) Construct a relative frequency table:
Cause of Death Proportion Percent
Heart Disease 1515/5000 = 0.303 *100 = 30.3Cancer 1150/5000 = 0.230 *100 = 23Circulatory 420/5000 = 0.084 *100 = 8.4Respiratory 395/5000 = 0.079 *100 = 7.9Accidents 205/5000 = 0.041 *100 = 4.1
Total 5000 1.000 *100 = 100.0
Class Activity:In Preparation for HW7:
3) Display the final relative frequency table:
Cause of Death Percent
Heart Disease 30.3Cancer 23Circulatory 8.4Respiratory 7.9Accidents 4.1
Total 100.0
Class Activity:In Preparation for HW7:
3) Write a sentence describing the distribution:
– Of the sample of 5,000 people, 30.3% died of heart disease, 23% of cancer, 8.4% of circulatory diseases and stroke, 7.9% of respiratory diseases, and 4.1% of accidents in 1999.