simple and cross tabulation
TRANSCRIPT
-
8/13/2019 Simple and Cross Tabulation
1/13
Simple and Cross Tabulation
Abu Bashar
-
8/13/2019 Simple and Cross Tabulation
2/13
2
Data exploration
Graphical plots of the data: to get a first overview of the maincharacteristics of the data-set, especially the distribution ofthe original variables across the whole sample and for sub-samples
Univariate descriptive statistics and one-way tabulation: tosynthesize the main characteristics of each of the variables inthe-set
Multivariate descriptive statistics and cross-tabulation: to geta first understanding of relationship existing betweendifferent variables and enabling the joint examination of two
or more variables
-
8/13/2019 Simple and Cross Tabulation
3/13
3
Graphs
Univariate plots of
qualitative or discrete data
Univariate plots of
quantitative data
Bivariate and multivariate
plots of quantitative data
Bivariate and multivariate
plots of quantitative
versus qualitative data
-
8/13/2019 Simple and Cross Tabulation
4/13
4
Univariate qualitative or discrete data
1 2 3 4 5 6 9
Household size
0
50
100
150
200
Co
unt
Number of sampled household by household size
Bar chart
1 2 3 4 5 6 9
Household size
0
50
100
150
200
Count
Sampled households by size
Line chart
1
2
3
4
5
6
9
Household size
Pies show counts
n=149
n=177
n=67
n=70
n=24n=12
n=1
Pie chart
Sampled households by household size
-
8/13/2019 Simple and Cross Tabulation
5/13
5
Univariate continuous data (1)
1000.00 2000.00 3000.00 4000.00
Anonymised hhold inc + allowances
0
25
50
75
100
Co
unt
HistogramNumber of sampled households by household income
0.00
500.00
1000.00
1500.00
2000.00
Anonymisedhholdinc+allowances
Box-Whiskers Diagram
Weekly household income plus allowances
Error Bars show Mean +/- 1.0 SD
Low income
Medium-low income
Medium-high income
High income
Quartile
0.00
500.00
1000.00
1500.00
2000.00
Anonymisedhholdinc+allowances
Error bar chart
Household income by quartile
Maximum
Minimum
Upper quartile
Lower quartile
Median
-
8/13/2019 Simple and Cross Tabulation
6/13
6
Univariate continuous data (2)
High income Medium-high
income
Medium-low
income
Low income
Anonymised hhold inc + allowances(Banded)
0.00
100000.00
200000.00
300000.00
Anonymisedhh
oldinc+allowances
0%
20%
40%
60%
80%
100%
Perce
nt
170054.19
78867.6645640.06
23488.32
Total Revenues by Income Quartile
Pareto Chart
0 100 200 300
Observed Value
-50
0
50
100
150
Expec
tedNormalValue
Normal Q-Q Plot of EFS: Total Food & non-alcoholic beverage
Pareto charts
Bars ordered in decreasing order of thefrequencies they represent
The line indicates the cumulative proportion
Useful for quality control (ANALYZE/QUALITY
CONTROL in SPSS)
Q-Q plots
Compare the empirical (observed) datadistribution and some theoretical distribution
When the observed distribution is close to the
theoretical one, the plotted values tend to lie on
a straight line.
-
8/13/2019 Simple and Cross Tabulation
7/13
7
Low income Medium- lowincome
Medium-highincome
High income
Anonymised hhold inc + allowances(Banded)
0.0
20.0
40.0
60.0
80.0
100.0
120.0
Mean
EFS: Total Food &non-alcoholicbeverage
EFS: Total Clothingand Footwear
EFS: TotalRecreation
EFS: TotalRestaurants andHotels
Average household expenditure for selected categories byincome range
Clustered Bar Chart
Bivariate and multivariate plots
11.84%
3.36%
6.16%
11.24%
9.42%
1.35%15.55%
2.88%
16.89%
1.93%
10.16%
9.22%
EFS: Total Food &non-alcoholicbeverage
EFS: TotalAlcoholicBeverages,Tobacco
EFS: Total Clothingand Footwear
EFS: TotalHousing, Water,Electricity
EFS: TotalFurnishings, HHEquipment,Carpets
EFS: Total Healthexpenditure
EFS: TotalTransport costs
Cases weighted by Annual weight
Household expenditure by category
Multi-variable Pie Chart
0.0 25.0 50.0 75.0
Value
EFS: Total Alcoholic Beverages, Tobacco
EFS: Total Clothing and Footwear
EFS: T otal Communication
EFS: Total Education
EFS: Total Food & non-alcoholic beverage
EFS: Total F urnishings, HH Equipment, Carpets
EFS: Total Health expenditure
EFS: Total Housing, Water, Electricity
EFS: Total Recreation
EFS: Total Restaurants and Hotels
Multi-variable Line Chart
Mean Weekly Household Expenditure by Categorywith Confidence Intervals
0.000 1.000 2.000 3.000 4.000 5.000
Sausages
0.000
25.000
50.000
75.000
Beerand
lager(broughthome)
Scatterplot
Beer and sausage expenditure
-
8/13/2019 Simple and Cross Tabulation
8/13
8
Low income Medium-low incomeMedium-high income High income
Anonymised hhold inc + allowances (Banded)
0.000
2.000
4.000
6.000
8.000
Mean
Wine from grape orother fruit (away fromhome)
Ciders and Perry (awayfrom home)
Beer and lager (away
from home)
Means by income quartile
Alcohol expenditure away from home
Clustered Bar Chart
0 2 1 3 4 5 6
Number of children
0
200
400
600
800
1,000
1,200
1,400
Count
0%
20%
40%
60%
80%
100%
Percent
381
196116 81
11 8
281
88
44
Soft drinks
Fruit juices
Cumulative
Soft Drink and Fruit Juice Consumption
Stacked Pareto Chart
EFS: Total Food & non-alcoholicbeverage
EFS: Total Restaurants andHotels
EFS: Total Alcoholic Beverages,Tobacco
0
10,000
20,000
30,000
40,000
50,000
Count
0%
20%
40%
60%
80%
100%
Percent
22,72518,110
6,335
Total Weekly Expenditure for Selected Categories
Pareto Chart
Bivariate and multivariate plots
-
8/13/2019 Simple and Cross Tabulation
9/13
-
8/13/2019 Simple and Cross Tabulation
10/13
10
Descriptive statisticsDescriptive statistics
In a typicalweek how
much fresh orfrozen chickendo you buy foryour household
consumption(Kg.)?
In a typicalweek how
much do youspend on fresh
or frozenchicken(Euro)? Age
Valid 446 443 500N
Missing 54 57 0
Mean 1.0582 5.6677 45.582
Std. Error of Mean .06843 .19640 .7100
Median .9100 5.0000 45.000
Mode 1.00 3.00 45.0
Std. Deviation 1.44514 4.13383 15.8763
Variance 2.088 17.089 252.055Minimum .00 .00 18.0
Maximum 25.03 30.00 87.0
25 .5000 3.0000 32.000
50 .9100 5.0000 45.000
Percentiles
75 1.3600 7.5000 57.000
-
8/13/2019 Simple and Cross Tabulation
11/13
11
Cross-tabulation
Food & non-alcoholic beverage (Binned) * Anonym ised hhold inc + allowances (Banded) Crosstabulation
47 19 18 4 88
9.4% 3.8% 3.6% .8% 17.6%
57 48 24 22 15111.4% 9.6% 4.8% 4.4% 30.2%
17 31 45 40 133
3.4% 6.2% 9.0% 8.0% 26.6%
4 27 38 59 128
.8% 5.4% 7.6% 11.8% 25.6%
125 125 125 125 500
25.0% 25.0% 25.0% 25.0% 100.0%
Count
% of Total
Count
% of Total
Count
% of Total
Count
% of Total
Count
% of Total
20 or less
From 20 to 40
From 40 to 60
More than 60
Food & non-alcoholic
beverage (Binned)
Total
Low income
Medium-low
income
Medium-high
income High income
Anonymised hhold inc + allow ances (Banded)
Total
-
8/13/2019 Simple and Cross Tabulation
12/13
12
3-variables frequency table
Childr en, income and age of HRP
1.4% .8% .2% 2.4%
2.6% 1.0% .4% .2% 4.2%
18.4% 18.4%
1.0% .2% .2% 1.4%
4.8% 2.2% 2.6% 1.6% 11.2%
12.0% .4% 12.4%
.6% .4% .8% .2% 2.0%
7.6% 2.6% 3.8% 2.4% 16.4%
6.4% .2% 6.6%
1.6% .4% 2.0%
9.2% 2.6% 4.6% 1.6% 18.0%
4.8% .2% 5.0%
Less than 30 years
From 30 to 55 years
More than 55 years
Age of HRP -
anonymised
(Binned)
Low income
Less than 30 years
From 30 to 55 years
More than 55 years
Age of HRP -
anonymised
(Binned)
Medium-low
income
Less than 30 years
From 30 to 55 years
More than 55 years
Age of HRP -
anonymised
(Binned)
Medium-high
income
Less than 30 years
From 30 to 55 years
More than 55 years
Age of HRP -
anonymised
(Binned)
High income
Anonymised
hhold inc +
allow ances(Banded)
Table %
No
children
Table %
One
children
Table %
Tw o
children
Table %
More than
tw o
children
Number of children (Banded)
Table %
Total
-
8/13/2019 Simple and Cross Tabulation
13/13
13
Quantitative by categorical
.589 (1.59) 2.701 (8.26) 1.112 (3.78)
.000 (.00) .067 (.39) .010 (.11)
.263 (.96) .396 (1.80) .139 (.84)
1.240 (5.78) .644 (3.37) .107 (.84)
Books
Ice creamInternet subscription fees
Cinemas
Mean
Standard
Deviation
Less than 30 years
Mean
Standard
Deviation
From 30 to 55 years
Mean
Standard
Deviation
More than 55 years
Age of HRP - anonymised (Binned)