Download - Chapter 1 Section 1.1
![Page 1: Chapter 1 Section 1.1](https://reader035.vdocuments.mx/reader035/viewer/2022062222/568165f8550346895dd92293/html5/thumbnails/1.jpg)
Displaying Distributions with Graphs
Chapter 1Section 1.1
![Page 2: Chapter 1 Section 1.1](https://reader035.vdocuments.mx/reader035/viewer/2022062222/568165f8550346895dd92293/html5/thumbnails/2.jpg)
the science of collecting, analyzing, and drawing conclusions from data
Statistics
![Page 3: Chapter 1 Section 1.1](https://reader035.vdocuments.mx/reader035/viewer/2022062222/568165f8550346895dd92293/html5/thumbnails/3.jpg)
Descriptive
Inferential
Main Types of Statistics
![Page 4: Chapter 1 Section 1.1](https://reader035.vdocuments.mx/reader035/viewer/2022062222/568165f8550346895dd92293/html5/thumbnails/4.jpg)
the methods of organizing & summarizing data
Descriptive Statistics
![Page 5: Chapter 1 Section 1.1](https://reader035.vdocuments.mx/reader035/viewer/2022062222/568165f8550346895dd92293/html5/thumbnails/5.jpg)
involves making generalizations from a sample to a population
Inferential Statistics
![Page 6: Chapter 1 Section 1.1](https://reader035.vdocuments.mx/reader035/viewer/2022062222/568165f8550346895dd92293/html5/thumbnails/6.jpg)
the entire collection of individuals or objects about which information is desired
Population
![Page 7: Chapter 1 Section 1.1](https://reader035.vdocuments.mx/reader035/viewer/2022062222/568165f8550346895dd92293/html5/thumbnails/7.jpg)
a subset of the population, selected for study in some prescribed manner
Sample
![Page 8: Chapter 1 Section 1.1](https://reader035.vdocuments.mx/reader035/viewer/2022062222/568165f8550346895dd92293/html5/thumbnails/8.jpg)
any characteristic whose value may change from one individual to another
Variable
![Page 9: Chapter 1 Section 1.1](https://reader035.vdocuments.mx/reader035/viewer/2022062222/568165f8550346895dd92293/html5/thumbnails/9.jpg)
observations on single variable or simultaneously on two or more variables
Data
![Page 10: Chapter 1 Section 1.1](https://reader035.vdocuments.mx/reader035/viewer/2022062222/568165f8550346895dd92293/html5/thumbnails/10.jpg)
Who?
What?
Why?
When you see a set of data, ask:
![Page 11: Chapter 1 Section 1.1](https://reader035.vdocuments.mx/reader035/viewer/2022062222/568165f8550346895dd92293/html5/thumbnails/11.jpg)
What individuals do the data describe?
How many individuals appear in the data?
Who?
![Page 12: Chapter 1 Section 1.1](https://reader035.vdocuments.mx/reader035/viewer/2022062222/568165f8550346895dd92293/html5/thumbnails/12.jpg)
How many variables?
What are the definitions of these variables?
What units?
What?
![Page 13: Chapter 1 Section 1.1](https://reader035.vdocuments.mx/reader035/viewer/2022062222/568165f8550346895dd92293/html5/thumbnails/13.jpg)
What is the reason data were gathered?
Why?
![Page 14: Chapter 1 Section 1.1](https://reader035.vdocuments.mx/reader035/viewer/2022062222/568165f8550346895dd92293/html5/thumbnails/14.jpg)
Types of Variables
![Page 15: Chapter 1 Section 1.1](https://reader035.vdocuments.mx/reader035/viewer/2022062222/568165f8550346895dd92293/html5/thumbnails/15.jpg)
Also called “qualitative”
Identify basic differentiating characteristics of the population
Categorical Variables
![Page 16: Chapter 1 Section 1.1](https://reader035.vdocuments.mx/reader035/viewer/2022062222/568165f8550346895dd92293/html5/thumbnails/16.jpg)
Also called “numerical”Observations or measurements that take on numerical values
Makes sense to average these values
Two types-discrete & continuous
Quantitative Variables
![Page 17: Chapter 1 Section 1.1](https://reader035.vdocuments.mx/reader035/viewer/2022062222/568165f8550346895dd92293/html5/thumbnails/17.jpg)
Listable set of values
Usually counts of items
Discrete (numerical)
![Page 18: Chapter 1 Section 1.1](https://reader035.vdocuments.mx/reader035/viewer/2022062222/568165f8550346895dd92293/html5/thumbnails/18.jpg)
Data can take on any value in the domain of the variable
Usually measurements of something
Continuous (numerical)
![Page 19: Chapter 1 Section 1.1](https://reader035.vdocuments.mx/reader035/viewer/2022062222/568165f8550346895dd92293/html5/thumbnails/19.jpg)
Univariate-data that describes a single characteristic of the population
Bivariate-data that describes two characteristics of the population
Multivariate-data that describes more than two characteristics
Classifications by the number of variables
![Page 20: Chapter 1 Section 1.1](https://reader035.vdocuments.mx/reader035/viewer/2022062222/568165f8550346895dd92293/html5/thumbnails/20.jpg)
Which category of variables would the following be?
GenderAgeHair colorSmokerSystolic blood pressureNumber of girls in class
Categorical or Quantitative
![Page 21: Chapter 1 Section 1.1](https://reader035.vdocuments.mx/reader035/viewer/2022062222/568165f8550346895dd92293/html5/thumbnails/21.jpg)
Types of Distributions4 Common Types
![Page 22: Chapter 1 Section 1.1](https://reader035.vdocuments.mx/reader035/viewer/2022062222/568165f8550346895dd92293/html5/thumbnails/22.jpg)
Tells us what values the variable takes and how often it takes these values
One variable may take values that are very close together while others might be spread out
Distribution
![Page 23: Chapter 1 Section 1.1](https://reader035.vdocuments.mx/reader035/viewer/2022062222/568165f8550346895dd92293/html5/thumbnails/23.jpg)
Refers to data in which both sides are (more or less) the same when the graph is folded vertically down the middle
Bell-shaped is a special typeHas a center mound with sloping tails
Symmetrical
![Page 24: Chapter 1 Section 1.1](https://reader035.vdocuments.mx/reader035/viewer/2022062222/568165f8550346895dd92293/html5/thumbnails/24.jpg)
Refers to data in which every class has equal or approximately equal frequency
Uniform
![Page 25: Chapter 1 Section 1.1](https://reader035.vdocuments.mx/reader035/viewer/2022062222/568165f8550346895dd92293/html5/thumbnails/25.jpg)
Refers to data in which one side (tail) is longer than the other side
The direction of the skewness is on the side of the longer tail
Skewed (left or right)
Skewed right Skewed left
![Page 26: Chapter 1 Section 1.1](https://reader035.vdocuments.mx/reader035/viewer/2022062222/568165f8550346895dd92293/html5/thumbnails/26.jpg)
Refers to data in which two or more classes have the largest frequency and are separated by at least one other class
Bimodal (multi-modal)
![Page 27: Chapter 1 Section 1.1](https://reader035.vdocuments.mx/reader035/viewer/2022062222/568165f8550346895dd92293/html5/thumbnails/27.jpg)
How to describe a graph
![Page 28: Chapter 1 Section 1.1](https://reader035.vdocuments.mx/reader035/viewer/2022062222/568165f8550346895dd92293/html5/thumbnails/28.jpg)
Normal, SymmetricalSkewedUniformBimodal
Shape of Graphs
![Page 29: Chapter 1 Section 1.1](https://reader035.vdocuments.mx/reader035/viewer/2022062222/568165f8550346895dd92293/html5/thumbnails/29.jpg)
Where the middle of the data falls
3 types of central tendencyMean, median, mode
Center of Graphs
![Page 30: Chapter 1 Section 1.1](https://reader035.vdocuments.mx/reader035/viewer/2022062222/568165f8550346895dd92293/html5/thumbnails/30.jpg)
Shows how spread out the data is
Refers to the variability of the data
Range, standard deviation, IQR
Spread
![Page 31: Chapter 1 Section 1.1](https://reader035.vdocuments.mx/reader035/viewer/2022062222/568165f8550346895dd92293/html5/thumbnails/31.jpg)
Outliers-value that lies away from the rest of the data
Clusters
Gaps
Unusual Occurrences
![Page 32: Chapter 1 Section 1.1](https://reader035.vdocuments.mx/reader035/viewer/2022062222/568165f8550346895dd92293/html5/thumbnails/32.jpg)
1. Bar2. Pie Chart3. Dotplot4. Stem-and-Leaf5. Histogram6. Relative cumulative frequency graph7. Time Plot8. Box & Whisker9. Scatter
Types of Graphs
![Page 33: Chapter 1 Section 1.1](https://reader035.vdocuments.mx/reader035/viewer/2022062222/568165f8550346895dd92293/html5/thumbnails/33.jpg)
Must label the axes and title the graph
Scale your axesDraw vertical bar above each category name to a height that corresponds to the count in that category
1. Bar Graphs
![Page 34: Chapter 1 Section 1.1](https://reader035.vdocuments.mx/reader035/viewer/2022062222/568165f8550346895dd92293/html5/thumbnails/34.jpg)
Bar Graph (example)The graph below shows the proportion of the female labor force aged 25 and older in the United States that falls into various educational categories. The coding used in the plot is as follows:
1. none-8th Grade 6. bachelor’s degree2. 9th grade-11th grade 7. master’s degree3. high school graduated 8. professional degree4. some college, no degree 9. doctorate degree5. associate degree
Proportion
Educational Attainment (women)
![Page 35: Chapter 1 Section 1.1](https://reader035.vdocuments.mx/reader035/viewer/2022062222/568165f8550346895dd92293/html5/thumbnails/35.jpg)
Must include all categories that make up a whole.
2. Pie Chart
![Page 36: Chapter 1 Section 1.1](https://reader035.vdocuments.mx/reader035/viewer/2022062222/568165f8550346895dd92293/html5/thumbnails/36.jpg)
3. Dotplot
0 100 200 300 400
Age (months)
Tally marks are made for each set of data.
The data below is graphed on the dotplot on the right.
195 194204 199204 204192 204192 192193 214209 222209
![Page 37: Chapter 1 Section 1.1](https://reader035.vdocuments.mx/reader035/viewer/2022062222/568165f8550346895dd92293/html5/thumbnails/37.jpg)
A stem and leaf plot displays data like a bar graph, but we break the data apart into stem and leaf plots.
The data point 30 is broken up into 3 0
stem leaf
The data point for 304 is broken up into 30 4
stem leaf
4. Stem and Leaf Plot
![Page 38: Chapter 1 Section 1.1](https://reader035.vdocuments.mx/reader035/viewer/2022062222/568165f8550346895dd92293/html5/thumbnails/38.jpg)
The data set {30, 27, 34, 28, 45, 31, 34, 40, 29}
is represented by:
2: 7 8 93: 0 1 4 44: 0 5
The 2 and 9 represents the data point 29. The stems are the tens place and the leaves are the ones digits. Notice numbers are listed in order from smallest to largest.
Stem and Leaf Plot
![Page 39: Chapter 1 Section 1.1](https://reader035.vdocuments.mx/reader035/viewer/2022062222/568165f8550346895dd92293/html5/thumbnails/39.jpg)
Draw a bar graph that represents the count in each class. Leave no horizontal space (unlike a bar graph).
The data below is graphed on the dotplot on the next slide.
195 194 204 199 204 204 192204 192 192 193 214 209 222 209
5. Histogram
![Page 40: Chapter 1 Section 1.1](https://reader035.vdocuments.mx/reader035/viewer/2022062222/568165f8550346895dd92293/html5/thumbnails/40.jpg)
Histogram (example)
192 198 204 210 216 222
0
1
2
3
4
5
15 Students
Freq
uenc
y
Age (Months)
![Page 41: Chapter 1 Section 1.1](https://reader035.vdocuments.mx/reader035/viewer/2022062222/568165f8550346895dd92293/html5/thumbnails/41.jpg)
Also called an Ogive.
A relative frequency histogram has the same shape and the same horizontal scale as the corresponding frequency histogram. The difference is that the vertical scale measures the relative frequencies (measured as a percentage), not frequencies.
6. Relative Cumulative Frequency Graph
![Page 42: Chapter 1 Section 1.1](https://reader035.vdocuments.mx/reader035/viewer/2022062222/568165f8550346895dd92293/html5/thumbnails/42.jpg)
Relative Cumulative Frequency Graph
192 198 204 210 216 222
0
1
2
3
4
5
15 Students
Freq
uenc
yAge (Months)
The frequency needs to be changed to percentages.
![Page 43: Chapter 1 Section 1.1](https://reader035.vdocuments.mx/reader035/viewer/2022062222/568165f8550346895dd92293/html5/thumbnails/43.jpg)
Place the time on the x axis. Time is the explanatory variable.
Place the observations on the y axis. The observations represent the response variable.
7. Time Plot
![Page 44: Chapter 1 Section 1.1](https://reader035.vdocuments.mx/reader035/viewer/2022062222/568165f8550346895dd92293/html5/thumbnails/44.jpg)
Time Plot (example)
![Page 45: Chapter 1 Section 1.1](https://reader035.vdocuments.mx/reader035/viewer/2022062222/568165f8550346895dd92293/html5/thumbnails/45.jpg)
These will be discussed later.Box and Whisker-Section 1.2Scatter Plot-Section 3.1
8. Box & Whisker/9. Scatter Plot
![Page 46: Chapter 1 Section 1.1](https://reader035.vdocuments.mx/reader035/viewer/2022062222/568165f8550346895dd92293/html5/thumbnails/46.jpg)
The distribution of a variable tells us what values it takes and how often it takes these values.
To describe a distribution, begin with a graph. Bar graphs and pie charts display the
distributions of categorical variables. These graphs use the counts or percents of the categories.
Stem & Leaf plots and histograms display the distributions of qualitative variables. Stem & Leaf Plots separate each observation into a stem and a one-digit leaf. Histograms plot the frequencies (counts) or percents of equal-width classes of values.
Summary
![Page 47: Chapter 1 Section 1.1](https://reader035.vdocuments.mx/reader035/viewer/2022062222/568165f8550346895dd92293/html5/thumbnails/47.jpg)
When examining a distribution, look for the shape, center and spread, and for clear deviations from the overall shape. Some distributions have simple shapes, such as symmetric or skewed. Others may be bimodal (more than one major peak).
Outliers are observations that lie outside the overall pattern of a distribution. Always look for outliers and try to explain them.
Summary continued…
![Page 48: Chapter 1 Section 1.1](https://reader035.vdocuments.mx/reader035/viewer/2022062222/568165f8550346895dd92293/html5/thumbnails/48.jpg)
A relative frequency graph (ogive) is a good way to see the relative standing of an observation.
When observations on a variable are taken over time, make a time plot that graphs horizontally and the values of the variable vertically. A time plot can reveal trends (patterns) or other changes over time.
Summary continued
![Page 49: Chapter 1 Section 1.1](https://reader035.vdocuments.mx/reader035/viewer/2022062222/568165f8550346895dd92293/html5/thumbnails/49.jpg)
On page 64 in your textbook, complete exercises 1.13 and 1.16.
Assignment