univariate eda. quantitative univariate edaslide #2 exploratory data analysis univariate eda –...

14
Univariate EDA

Upload: junior-king

Post on 06-Jan-2018

234 views

Category:

Documents


4 download

DESCRIPTION

Categorical Univariate EDASlide #3 Quantitative vs. Categorical Do NOT describe shape, center, dispersion, or outliers with CATEGORICAL data. Identify the most outstanding characteristics.

TRANSCRIPT

Page 1: Univariate EDA. Quantitative Univariate EDASlide #2 Exploratory Data Analysis Univariate EDA – Describe the distribution –Distribution is concerned with

Univariate EDA

Page 2: Univariate EDA. Quantitative Univariate EDASlide #2 Exploratory Data Analysis Univariate EDA – Describe the distribution –Distribution is concerned with

Quantitative Univariate EDA Slide #2

Exploratory Data Analysis• Univariate EDA – Describe the distribution

– Distribution is concerned with what values a variable takes and how often it takes each value

• Univariate EDA (for quantitative data)– Graphically– Numerically– Model

Page 3: Univariate EDA. Quantitative Univariate EDASlide #2 Exploratory Data Analysis Univariate EDA – Describe the distribution –Distribution is concerned with

Categorical Univariate EDA Slide #3

Quantitative vs. Categorical• Do NOT describe shape, center, dispersion, or

outliers with CATEGORICAL data.

• Identify the most outstanding characteristics.

Page 4: Univariate EDA. Quantitative Univariate EDASlide #2 Exploratory Data Analysis Univariate EDA – Describe the distribution –Distribution is concerned with

Categorical Univariate EDA Slide #4

Numerical Summaries

RedBlondeBlondeBrunetteBlondeRedBlondeRed

Hair Color Freq

Blonde

Brunette

Red

• Frequency Table

Hair Color Perc

Blonde

BrunetteRed

• Percentages Table

4

13

50.012.5

37.5

Page 5: Univariate EDA. Quantitative Univariate EDASlide #2 Exploratory Data Analysis Univariate EDA – Describe the distribution –Distribution is concerned with

Categorical Univariate EDA Slide #5

Graphical Summaries• Bar chart

– Bars over category label– Height is frequency of individuals in that category

blonde brunette redHair Color

Freq

uenc

y0

12

34

Hair Color Freq

Blonde 4

Brunette 1

Red 3

Page 6: Univariate EDA. Quantitative Univariate EDASlide #2 Exploratory Data Analysis Univariate EDA – Describe the distribution –Distribution is concerned with

Categorical Univariate EDA Slide #6

• Bar chart• Pie chart

– Circle with pieces proportional to category frequencies

Graphical Summaries

blonde

brunette

red

Hair Color Freq

Blonde 4

Brunette 1

Red 3

Page 7: Univariate EDA. Quantitative Univariate EDASlide #2 Exploratory Data Analysis Univariate EDA – Describe the distribution –Distribution is concerned with

Categorical Univariate EDA

no, No, NO!!!

Slide #7

Page 8: Univariate EDA. Quantitative Univariate EDASlide #2 Exploratory Data Analysis Univariate EDA – Describe the distribution –Distribution is concerned with

Categorical Univariate EDA

no, No, NO!!!

Slide #8

Page 9: Univariate EDA. Quantitative Univariate EDASlide #2 Exploratory Data Analysis Univariate EDA – Describe the distribution –Distribution is concerned with

Categorical Univariate EDA

no, No, NO!!!

Slide #9

Page 10: Univariate EDA. Quantitative Univariate EDASlide #2 Exploratory Data Analysis Univariate EDA – Describe the distribution –Distribution is concerned with

Categorical Univariate EDA

no, No, NO!!!

Slide #10

Page 11: Univariate EDA. Quantitative Univariate EDASlide #2 Exploratory Data Analysis Univariate EDA – Describe the distribution –Distribution is concerned with

Categorical Univariate EDA Slide #11

Overall Summary• Identify most outstanding characteristic(s)

• Most student were blondes and very few were brunettes.

Hair Color Freq

Blonde 4

Brunette 1

Red 3blonde brunette red

Hair Color

Freq

uenc

y0

12

34

Page 12: Univariate EDA. Quantitative Univariate EDASlide #2 Exploratory Data Analysis Univariate EDA – Describe the distribution –Distribution is concerned with

Quantitative Univariate EDA

• Describe a univariate EDA for the data in Figure 4.Slide #12

Figure 4. Bar chart of the number of KNOWN species by organism type.

Page 13: Univariate EDA. Quantitative Univariate EDASlide #2 Exploratory Data Analysis Univariate EDA – Describe the distribution –Distribution is concerned with

Quantitative Univariate EDA

• Describe a univariate EDA for the data in Figure 5.Slide #13

Figure 5. Bar chart of the types of organizations that received funding by the Invasive Alien Species Partnership Program (Canada), 2005-2010.

Page 14: Univariate EDA. Quantitative Univariate EDASlide #2 Exploratory Data Analysis Univariate EDA – Describe the distribution –Distribution is concerned with

Quantitative Univariate EDA

Categorical Univariate EDA in R• Examine Handout

– xtabs()– percTable()– barplot()

Slide #14