![Page 1: BASIC STATISTICAL TOOLS. What is Statistics Statistics refers to the collection, presentation, analysis, and utilization of numerical data to](https://reader036.vdocuments.mx/reader036/viewer/2022062322/56649d2f5503460f94a06845/html5/thumbnails/1.jpg)
BASIC STATISTICAL TOOLS
![Page 2: BASIC STATISTICAL TOOLS. What is Statistics Statistics refers to the collection, presentation, analysis, and utilization of numerical data to](https://reader036.vdocuments.mx/reader036/viewer/2022062322/56649d2f5503460f94a06845/html5/thumbnails/2.jpg)
What is Statistics
Statistics refers to the collection, presentation, analysis, and utilization of numerical data
to make inferences and reach decisions in the face of uncertainty in economics, business, and other social and physical sciences.
Statistics is subdivided into descriptive and inferential.
![Page 3: BASIC STATISTICAL TOOLS. What is Statistics Statistics refers to the collection, presentation, analysis, and utilization of numerical data to](https://reader036.vdocuments.mx/reader036/viewer/2022062322/56649d2f5503460f94a06845/html5/thumbnails/3.jpg)
Descriptive statistics
Descriptive statistics : Methods of organizing, summarizing, and presenting data in an informative way.
EXAMPLE : According to Consumer Reports, Whirlpool washing machine owners reported 9 problems per 100 machines during 1995. The statistic 9 describes the number of problems out of every 100 machines.
![Page 4: BASIC STATISTICAL TOOLS. What is Statistics Statistics refers to the collection, presentation, analysis, and utilization of numerical data to](https://reader036.vdocuments.mx/reader036/viewer/2022062322/56649d2f5503460f94a06845/html5/thumbnails/4.jpg)
Inferential statistics
Inferential statistics is the process of reaching generalizations about the whole (called the population) by examining a portion (called the sample).
A population is a collection of all possible individuals, objects, or measurements of interest.
A sample is a portion, or part, of the population of interest.
![Page 5: BASIC STATISTICAL TOOLS. What is Statistics Statistics refers to the collection, presentation, analysis, and utilization of numerical data to](https://reader036.vdocuments.mx/reader036/viewer/2022062322/56649d2f5503460f94a06845/html5/thumbnails/5.jpg)
EXAMPLE
Suppose that we have data on the incomes of 1000 U.S. families. This body of data can be summarized by finding the average family income and the spread of these family incomes above and below the average. The data also can be described by constructing a table, chart, or graph of the number or proportion of families in each income class. This is descriptive statistics.
If these 1000 families are representative of all U.S. families, we can then estimate and test hypotheses about the average family income in the United States as a whole. This is statistical inference.
![Page 6: BASIC STATISTICAL TOOLS. What is Statistics Statistics refers to the collection, presentation, analysis, and utilization of numerical data to](https://reader036.vdocuments.mx/reader036/viewer/2022062322/56649d2f5503460f94a06845/html5/thumbnails/6.jpg)
TYPES OF DATA
There are three types of data that are generally available for empirical analysis.
1. Time series2. Cross-sectional3. Pooled (A combination of time series and
cross-sectional)
![Page 7: BASIC STATISTICAL TOOLS. What is Statistics Statistics refers to the collection, presentation, analysis, and utilization of numerical data to](https://reader036.vdocuments.mx/reader036/viewer/2022062322/56649d2f5503460f94a06845/html5/thumbnails/7.jpg)
TIME SERIES DATA
Collected over a period of time, such as the data on:
GDP, employment, unemployment, money supply, government deficit.
Such data may be collected at regular intervals:Daily (e.g. Stock prices)Weekly (e.g. Money supply)Monthly (e.g. Unemployment rate)Quarterly (e.g. GDP)Annually (e.g. Government budget)This is called the frequency of the data.
![Page 8: BASIC STATISTICAL TOOLS. What is Statistics Statistics refers to the collection, presentation, analysis, and utilization of numerical data to](https://reader036.vdocuments.mx/reader036/viewer/2022062322/56649d2f5503460f94a06845/html5/thumbnails/8.jpg)
TIME SERIES DATA
These data may be quantitative in nature (e.g. Prices, income, money supply)
Or qualitative in nature (e.g. Male or female, employed or unemployed, married or unmarried, white or black)
Qualitative variables are also called dummy or categorical variables.
![Page 9: BASIC STATISTICAL TOOLS. What is Statistics Statistics refers to the collection, presentation, analysis, and utilization of numerical data to](https://reader036.vdocuments.mx/reader036/viewer/2022062322/56649d2f5503460f94a06845/html5/thumbnails/9.jpg)
CROSS-SECTIONAL DATA
These are data on one or more variables collected at one point in time
For example GDP of European Union Countries in 2010.
Government budget deficit of BRIC countries.
![Page 10: BASIC STATISTICAL TOOLS. What is Statistics Statistics refers to the collection, presentation, analysis, and utilization of numerical data to](https://reader036.vdocuments.mx/reader036/viewer/2022062322/56649d2f5503460f94a06845/html5/thumbnails/10.jpg)
POOLED DATA
In Pooled data we have elements of both time series and cross-sectional data.
For example Unemployment rate for 10 countries for a
period of 20 years. (Pooled data)Data on the unemployment rate for each
country for the 20 year period (Time series)Data on the unemployment rate for the 10
countries for any single year (Cross-sectional)
![Page 11: BASIC STATISTICAL TOOLS. What is Statistics Statistics refers to the collection, presentation, analysis, and utilization of numerical data to](https://reader036.vdocuments.mx/reader036/viewer/2022062322/56649d2f5503460f94a06845/html5/thumbnails/11.jpg)
Frequency Distribution
Frequency distribution: A grouping of data into categories showing the number of observations in each category.
The number of classes is usually between 5 and 15.
2-2
![Page 12: BASIC STATISTICAL TOOLS. What is Statistics Statistics refers to the collection, presentation, analysis, and utilization of numerical data to](https://reader036.vdocuments.mx/reader036/viewer/2022062322/56649d2f5503460f94a06845/html5/thumbnails/12.jpg)
Frequency Distribution
Class mark (midpoint): A point that divides a class into two equal parts. This is the average between the upper and lower class limits.
Class interval: For a frequency distribution having classes of the same size, the class interval is obtained by subtracting the lower limit of a class from the lower limit of the next class.
2-4
![Page 13: BASIC STATISTICAL TOOLS. What is Statistics Statistics refers to the collection, presentation, analysis, and utilization of numerical data to](https://reader036.vdocuments.mx/reader036/viewer/2022062322/56649d2f5503460f94a06845/html5/thumbnails/13.jpg)
Frequency Distribution
Class mark (midpoint): A point that divides a class into two equal parts. This is the average between the upper and lower class limits.
Class interval: For a frequency distribution having classes of the same size, the class interval is obtained by subtracting the lower limit of a class from the lower limit of the next class.
2-4
![Page 14: BASIC STATISTICAL TOOLS. What is Statistics Statistics refers to the collection, presentation, analysis, and utilization of numerical data to](https://reader036.vdocuments.mx/reader036/viewer/2022062322/56649d2f5503460f94a06845/html5/thumbnails/14.jpg)
EXAMPLE 1
Dr. Tillman is the dean of the school of business and wishes to determine the amount of studying business school students do. He selects a random sample of 30 students and determines the number of hours each student studies per week:
15.0, 23.7, 19.7, 15.4, 18.3, 23.0, 14.2, 20.8, 13.5, 20.7, 17.4, 18.6, 12.9, 20.3, 13.7, 21.4, 18.3, 29.8, 17.1, 18.9, 10.3, 26.1, 15.7, 14.0, 17.8, 33.8, 23.2, 12.9, 27.1, 16.6.
Organize the data into a frequency distribution.
2-5
![Page 15: BASIC STATISTICAL TOOLS. What is Statistics Statistics refers to the collection, presentation, analysis, and utilization of numerical data to](https://reader036.vdocuments.mx/reader036/viewer/2022062322/56649d2f5503460f94a06845/html5/thumbnails/15.jpg)
EXAMPLE 1 continued
Hours studying Frequency, f 8-12 1 13-17 12 18-22 10 23-27 5 28-32 1 33-37 1
2-6
Consider the classes 8-12 and 13-17. The class marks are 10 and 15. The class interval is 5 (13-8).
![Page 16: BASIC STATISTICAL TOOLS. What is Statistics Statistics refers to the collection, presentation, analysis, and utilization of numerical data to](https://reader036.vdocuments.mx/reader036/viewer/2022062322/56649d2f5503460f94a06845/html5/thumbnails/16.jpg)
Suggestions on Constructing a Frequency Distribution
The class intervals used in the frequency distribution should be equal.
Determine a suggested class interval by using the formula: i = (highest value-lowest value)/number of classes.
2-7
![Page 17: BASIC STATISTICAL TOOLS. What is Statistics Statistics refers to the collection, presentation, analysis, and utilization of numerical data to](https://reader036.vdocuments.mx/reader036/viewer/2022062322/56649d2f5503460f94a06845/html5/thumbnails/17.jpg)
Suggestions on Constructing a Frequency Distribution
Use the computed suggested class interval to construct the frequency distribution. Note: this is a suggested class interval; if the computed class interval is 97, it may be better to use 100.
Count the number of values in each class.
2-8
![Page 18: BASIC STATISTICAL TOOLS. What is Statistics Statistics refers to the collection, presentation, analysis, and utilization of numerical data to](https://reader036.vdocuments.mx/reader036/viewer/2022062322/56649d2f5503460f94a06845/html5/thumbnails/18.jpg)
Relative Frequency Distribution
The relative frequency of a class is obtained by dividing the class frequency by the total frequency.
The sum of the relative frequencies equals 1.
Frequency,f
RelativeFrequency
8-12 1 1/30=.0333
13-17 12 12/30=.400
18-22 10 10/30=.333
23-27 5 5/30=.1667
28-32 1 1/30=.0333
33-37 1 1/30=.0333
TOTAL 30 30/30=1
T
Hours
2-9
![Page 19: BASIC STATISTICAL TOOLS. What is Statistics Statistics refers to the collection, presentation, analysis, and utilization of numerical data to](https://reader036.vdocuments.mx/reader036/viewer/2022062322/56649d2f5503460f94a06845/html5/thumbnails/19.jpg)
EXAMPLE 2
The cans in a sample of 20 cans of fruit contain net weights of fruit ranging from 19.3 to 20.9 oz, as given in the Table. If we want to group these data into 6 classes, we get class intervals of 0.3 oz
[21,0 – 19,2/6 ]= 0,3 oz. The weights given in the Table can be arranged into the frequency distributions given in the next Table.
![Page 20: BASIC STATISTICAL TOOLS. What is Statistics Statistics refers to the collection, presentation, analysis, and utilization of numerical data to](https://reader036.vdocuments.mx/reader036/viewer/2022062322/56649d2f5503460f94a06845/html5/thumbnails/20.jpg)
Frequency Distribution of Weights
![Page 21: BASIC STATISTICAL TOOLS. What is Statistics Statistics refers to the collection, presentation, analysis, and utilization of numerical data to](https://reader036.vdocuments.mx/reader036/viewer/2022062322/56649d2f5503460f94a06845/html5/thumbnails/21.jpg)
Stem-and-Leaf Displays
Stem-and-Leaf Display: A statistical technique for displaying a set of data. Each numerical value is divided into two parts: the leading digits become the stem and the trailing digits the leaf.
Note: An advantage of the stem-and-leaf display over a frequency distribution is we do not lose the identity of each observation.
2-10
![Page 22: BASIC STATISTICAL TOOLS. What is Statistics Statistics refers to the collection, presentation, analysis, and utilization of numerical data to](https://reader036.vdocuments.mx/reader036/viewer/2022062322/56649d2f5503460f94a06845/html5/thumbnails/22.jpg)
EXAMPLE 3
Colin achieved the following scores on his twelve accounting quizzes this semester: 86, 79, 92, 84, 69, 88, 91, 83, 96, 78, 82, 85. Construct a stem-and-leaf chart for the data.
stem leaf
6 9
7 8 9
8 2 3 4 5 6 8
9 1 2 6
2-11
![Page 23: BASIC STATISTICAL TOOLS. What is Statistics Statistics refers to the collection, presentation, analysis, and utilization of numerical data to](https://reader036.vdocuments.mx/reader036/viewer/2022062322/56649d2f5503460f94a06845/html5/thumbnails/23.jpg)
Graphic Representation of a Frequency Distribution
The three commonly used graphic forms are histograms, frequency polygons, and cumulative frequency distribution.Histogram: A graph in which the classes are
marked on the horizontal axis and the class frequencies on the vertical axis. The class frequencies are represented by the heights of the bars and the bars are drawn adjacent to each other.
2-12
![Page 24: BASIC STATISTICAL TOOLS. What is Statistics Statistics refers to the collection, presentation, analysis, and utilization of numerical data to](https://reader036.vdocuments.mx/reader036/viewer/2022062322/56649d2f5503460f94a06845/html5/thumbnails/24.jpg)
Histogram for Hours Spent Studying
0
2
4
6
8
10
12
14
10 15 20 25 30 35
Hours spent studying
Fre
qu
ency
2-14
![Page 25: BASIC STATISTICAL TOOLS. What is Statistics Statistics refers to the collection, presentation, analysis, and utilization of numerical data to](https://reader036.vdocuments.mx/reader036/viewer/2022062322/56649d2f5503460f94a06845/html5/thumbnails/25.jpg)
Histogram of Weights
![Page 26: BASIC STATISTICAL TOOLS. What is Statistics Statistics refers to the collection, presentation, analysis, and utilization of numerical data to](https://reader036.vdocuments.mx/reader036/viewer/2022062322/56649d2f5503460f94a06845/html5/thumbnails/26.jpg)
Frequency Polygon
A frequency polygon consists of line segments connecting the points formed by the class midpoint and the class frequency.
![Page 27: BASIC STATISTICAL TOOLS. What is Statistics Statistics refers to the collection, presentation, analysis, and utilization of numerical data to](https://reader036.vdocuments.mx/reader036/viewer/2022062322/56649d2f5503460f94a06845/html5/thumbnails/27.jpg)
Frequency Polygon for Hours Spent Studying
2-15
0
2
4
6
8
10
12
14
10 15 20 25 30 35
Hours spent studying
Fre
qu
ency
![Page 28: BASIC STATISTICAL TOOLS. What is Statistics Statistics refers to the collection, presentation, analysis, and utilization of numerical data to](https://reader036.vdocuments.mx/reader036/viewer/2022062322/56649d2f5503460f94a06845/html5/thumbnails/28.jpg)
Cumulative Frequency Distribution
A cumulative frequency distribution is used to determine how many or what proportion of the data values are below or above a certain value.
![Page 29: BASIC STATISTICAL TOOLS. What is Statistics Statistics refers to the collection, presentation, analysis, and utilization of numerical data to](https://reader036.vdocuments.mx/reader036/viewer/2022062322/56649d2f5503460f94a06845/html5/thumbnails/29.jpg)
Cumulative Frequency Distribution For Hours Studying
0
5
10
15
20
25
30
35
10 15 20 25 30 35
Hours Spent Studying
Frequency
2-16
![Page 30: BASIC STATISTICAL TOOLS. What is Statistics Statistics refers to the collection, presentation, analysis, and utilization of numerical data to](https://reader036.vdocuments.mx/reader036/viewer/2022062322/56649d2f5503460f94a06845/html5/thumbnails/30.jpg)
Bar Chart
A bar chart can be used to depict any of the levels of measurement (nominal, ordinal, interval, or ratio).
EXAMPLE 3: Construct a bar chart for the number of unemployed people per 100,000 population for selected cities.
2-17
![Page 31: BASIC STATISTICAL TOOLS. What is Statistics Statistics refers to the collection, presentation, analysis, and utilization of numerical data to](https://reader036.vdocuments.mx/reader036/viewer/2022062322/56649d2f5503460f94a06845/html5/thumbnails/31.jpg)
EXAMPLE continued
City Number of unemployedper 100,000 population
Atlanta, GA 7300Boston, MA 5400Chicago, IL 6700
Los Angeles, CA 8900New York, NY 8200
Washington, D.C. 8900
2-18
![Page 32: BASIC STATISTICAL TOOLS. What is Statistics Statistics refers to the collection, presentation, analysis, and utilization of numerical data to](https://reader036.vdocuments.mx/reader036/viewer/2022062322/56649d2f5503460f94a06845/html5/thumbnails/32.jpg)
Bar Chart for the Unemployment Data
7300
5400
6700
89008200
8900
0
2000
4000
6000
8000
10000
1 2 3 4 5 6
Cities
# u
nem
plo
yed
/100
,000
AtlantaBostonChicagoLos AngelesNew YorkWashington
2-19
![Page 33: BASIC STATISTICAL TOOLS. What is Statistics Statistics refers to the collection, presentation, analysis, and utilization of numerical data to](https://reader036.vdocuments.mx/reader036/viewer/2022062322/56649d2f5503460f94a06845/html5/thumbnails/33.jpg)
Pie Chart
A pie chart is especially useful in displaying a relative frequency distribution. A circle is divided proportionally to the relative frequency and portions of the circle are allocated for the different groups.
EXAMPLE 4: A sample of 200 runners were asked to indicate their favorite type of running shoe.
2-20
![Page 34: BASIC STATISTICAL TOOLS. What is Statistics Statistics refers to the collection, presentation, analysis, and utilization of numerical data to](https://reader036.vdocuments.mx/reader036/viewer/2022062322/56649d2f5503460f94a06845/html5/thumbnails/34.jpg)
EXAMPLE continued
Draw a pie chart based on the following information.
Type of shoe # of runners
Nike 92
Adidas 49
Reebok 37
Asics 13
Other 9
2-21
![Page 35: BASIC STATISTICAL TOOLS. What is Statistics Statistics refers to the collection, presentation, analysis, and utilization of numerical data to](https://reader036.vdocuments.mx/reader036/viewer/2022062322/56649d2f5503460f94a06845/html5/thumbnails/35.jpg)
Pie Chart for Running Shoes
Nike
Adidas
ReebokAsics
Other
Nike
Adidas
ReebokAsics
Other
2-22
![Page 36: BASIC STATISTICAL TOOLS. What is Statistics Statistics refers to the collection, presentation, analysis, and utilization of numerical data to](https://reader036.vdocuments.mx/reader036/viewer/2022062322/56649d2f5503460f94a06845/html5/thumbnails/36.jpg)
MEASURES OF CENTRAL TENDENCY
Central tendency refers to the location of a distribution. The most important measures of central
tendency are (1) the mean, (2) the median, and (3) the mode.
![Page 37: BASIC STATISTICAL TOOLS. What is Statistics Statistics refers to the collection, presentation, analysis, and utilization of numerical data to](https://reader036.vdocuments.mx/reader036/viewer/2022062322/56649d2f5503460f94a06845/html5/thumbnails/37.jpg)
The Mean
![Page 38: BASIC STATISTICAL TOOLS. What is Statistics Statistics refers to the collection, presentation, analysis, and utilization of numerical data to](https://reader036.vdocuments.mx/reader036/viewer/2022062322/56649d2f5503460f94a06845/html5/thumbnails/38.jpg)
The Median
The median for ungrouped data is the value of the middle item when all the items are arranged in either ascending or descending order in terms of values:
where N refers to the number of items in the population (n for a sample).
![Page 39: BASIC STATISTICAL TOOLS. What is Statistics Statistics refers to the collection, presentation, analysis, and utilization of numerical data to](https://reader036.vdocuments.mx/reader036/viewer/2022062322/56649d2f5503460f94a06845/html5/thumbnails/39.jpg)
The Mode
The mode is the value that occurs most frequently in the data set.
The mean is the most commonly used measure of central tendency. The mean, however, is affected by extreme values in the data set, while the median and the mode are not.
Other measures of central tendency are the weighted mean, the geometric mean, and the harmonic mean
![Page 40: BASIC STATISTICAL TOOLS. What is Statistics Statistics refers to the collection, presentation, analysis, and utilization of numerical data to](https://reader036.vdocuments.mx/reader036/viewer/2022062322/56649d2f5503460f94a06845/html5/thumbnails/40.jpg)
EXAMPLE
A student received the following grades (measured from 0 to 10) on the 10 quizzes he took during a semester: 6, 7, 6, 8, 5, 7, 6, 9, 10, and 6.
Find the mean, median and mode for the population on the 10 quizzes.
![Page 41: BASIC STATISTICAL TOOLS. What is Statistics Statistics refers to the collection, presentation, analysis, and utilization of numerical data to](https://reader036.vdocuments.mx/reader036/viewer/2022062322/56649d2f5503460f94a06845/html5/thumbnails/41.jpg)
EXAMPLE
To find the median for the ungrouped data, we first arrange the 10 grades in ascending order: 5, 6, 6, 6, 6, 7, 7, 8, 9,10.
Then we find the grade of the (N+1)/2 or (10+1)/2= 5,5th item. Thus the median is the average of the 5th and 6th item in the array, or (6+7)/2=6,5
The mode for the ungrouped data is 6 (the value that occurs most frequently in the data set).
![Page 42: BASIC STATISTICAL TOOLS. What is Statistics Statistics refers to the collection, presentation, analysis, and utilization of numerical data to](https://reader036.vdocuments.mx/reader036/viewer/2022062322/56649d2f5503460f94a06845/html5/thumbnails/42.jpg)
Example : Mean for Grouped Data
estimate the mean for the grouped data given in the Table below.
![Page 43: BASIC STATISTICAL TOOLS. What is Statistics Statistics refers to the collection, presentation, analysis, and utilization of numerical data to](https://reader036.vdocuments.mx/reader036/viewer/2022062322/56649d2f5503460f94a06845/html5/thumbnails/43.jpg)
MEASURES OF DISPERSION
Dispersion refers to the variability or spread in the data. The most important measures of dispersion are (1) the average deviation,
(2) the variance, and (3) the standard deviation. We will measure these for populations and
samples,
![Page 44: BASIC STATISTICAL TOOLS. What is Statistics Statistics refers to the collection, presentation, analysis, and utilization of numerical data to](https://reader036.vdocuments.mx/reader036/viewer/2022062322/56649d2f5503460f94a06845/html5/thumbnails/44.jpg)
Average deviation
The average deviation (AD), also called the mean absolute deviation (MAD), is given by
where the two vertical bars indicate the absolute value, or the values omitting the sign.
![Page 45: BASIC STATISTICAL TOOLS. What is Statistics Statistics refers to the collection, presentation, analysis, and utilization of numerical data to](https://reader036.vdocuments.mx/reader036/viewer/2022062322/56649d2f5503460f94a06845/html5/thumbnails/45.jpg)
Variance
The population variance the Greek letter sigma squared) and the sample variance s2 for ungrouped data are given by
2
![Page 46: BASIC STATISTICAL TOOLS. What is Statistics Statistics refers to the collection, presentation, analysis, and utilization of numerical data to](https://reader036.vdocuments.mx/reader036/viewer/2022062322/56649d2f5503460f94a06845/html5/thumbnails/46.jpg)
Standard deviation
The population standard deviation and sample standard deviation s are the positive square roots of their respective variances. For ungrouped data
![Page 47: BASIC STATISTICAL TOOLS. What is Statistics Statistics refers to the collection, presentation, analysis, and utilization of numerical data to](https://reader036.vdocuments.mx/reader036/viewer/2022062322/56649d2f5503460f94a06845/html5/thumbnails/47.jpg)
EXAMPLE
Calculate the Average Deviation, Variance and Standart deviation by using the data for quiz grades.
![Page 48: BASIC STATISTICAL TOOLS. What is Statistics Statistics refers to the collection, presentation, analysis, and utilization of numerical data to](https://reader036.vdocuments.mx/reader036/viewer/2022062322/56649d2f5503460f94a06845/html5/thumbnails/48.jpg)
EXAMPLE continued
![Page 49: BASIC STATISTICAL TOOLS. What is Statistics Statistics refers to the collection, presentation, analysis, and utilization of numerical data to](https://reader036.vdocuments.mx/reader036/viewer/2022062322/56649d2f5503460f94a06845/html5/thumbnails/49.jpg)
SHAPE OF FREQUENCY DISTRIBUTIONS
The shape of a distribution refers to (1) its symmetry or lack of it (skewness) and (2) its peakedness (kurtosis).
![Page 50: BASIC STATISTICAL TOOLS. What is Statistics Statistics refers to the collection, presentation, analysis, and utilization of numerical data to](https://reader036.vdocuments.mx/reader036/viewer/2022062322/56649d2f5503460f94a06845/html5/thumbnails/50.jpg)
Skewness
A distribution has zero skewness if it is symmetrical about its mean. For a symmetrical (unimodal) distribution, the mean, median, and mode are equal.
A distribution is positively skewed if the right tail is longer. Then, mean > median > mode.
A distribution is negatively skewed if the left tail is longer. Then, mode > median > mean
![Page 51: BASIC STATISTICAL TOOLS. What is Statistics Statistics refers to the collection, presentation, analysis, and utilization of numerical data to](https://reader036.vdocuments.mx/reader036/viewer/2022062322/56649d2f5503460f94a06845/html5/thumbnails/51.jpg)
Kurtosis
A peaked curve is called leptokurtic, as opposed to a flat one (platykurtic), relative to one that is mesokurtic.
The kurtosis for a mesokurtic curve is 3.
![Page 52: BASIC STATISTICAL TOOLS. What is Statistics Statistics refers to the collection, presentation, analysis, and utilization of numerical data to](https://reader036.vdocuments.mx/reader036/viewer/2022062322/56649d2f5503460f94a06845/html5/thumbnails/52.jpg)
0
1
2
3
4
5
6
500 1000 1500 2000 2500
Series: X2Sample 1960 1982Observations 23
Mean 1035.065Median 843.3000Maximum 2478.700Minimum 397.5000Std. Dev. 617.8470Skewness 0.962455Kurtosis 2.818835
Jarque-Bera 3.582342Probability 0.166765
![Page 53: BASIC STATISTICAL TOOLS. What is Statistics Statistics refers to the collection, presentation, analysis, and utilization of numerical data to](https://reader036.vdocuments.mx/reader036/viewer/2022062322/56649d2f5503460f94a06845/html5/thumbnails/53.jpg)
correlation coefficient
A correlation coefficient is a number that summarizes the degree to which two variables move together.
Correlations range in value from -1 to +1. When the coefficient is 1 (either -1 or +1), the two variables are perfectly "in sync" with each other - a unit change in one is accompanied by a unit change in the other.
If the variables are moving in opposite directions (one increases as the other decreases), it is a negative relationship.
![Page 54: BASIC STATISTICAL TOOLS. What is Statistics Statistics refers to the collection, presentation, analysis, and utilization of numerical data to](https://reader036.vdocuments.mx/reader036/viewer/2022062322/56649d2f5503460f94a06845/html5/thumbnails/54.jpg)
correlation coefficient
We indicate a negative relationship by using a minus sign before the coefficient.
If the variables are moving in the same direction (both are increasing or both are decreasing together), we denote that by reporting the coefficient as a positive number.
When the coefficient is 0, there is no relationship between the two variables.
Typically, coefficients fall somewhere between no relationship (0) and a perfect relationship (+/—1).
![Page 55: BASIC STATISTICAL TOOLS. What is Statistics Statistics refers to the collection, presentation, analysis, and utilization of numerical data to](https://reader036.vdocuments.mx/reader036/viewer/2022062322/56649d2f5503460f94a06845/html5/thumbnails/55.jpg)
Correlation Matrix
![Page 56: BASIC STATISTICAL TOOLS. What is Statistics Statistics refers to the collection, presentation, analysis, and utilization of numerical data to](https://reader036.vdocuments.mx/reader036/viewer/2022062322/56649d2f5503460f94a06845/html5/thumbnails/56.jpg)
The scatterplot
The scatterplot is the visual complement for the correlation coefficient. It visually displays whether there's any connection between the movements of two variables.
One variable is displayed on the X axis while the other variable is displayed on the Y axis.
The values on either axis might be expressed in absolute numbers, percentages, rates, or scores.
![Page 57: BASIC STATISTICAL TOOLS. What is Statistics Statistics refers to the collection, presentation, analysis, and utilization of numerical data to](https://reader036.vdocuments.mx/reader036/viewer/2022062322/56649d2f5503460f94a06845/html5/thumbnails/57.jpg)
Scatterplot
35
40
45
50
55
60
65
70
75
0 500 1000 1500 2000 2500
X2
X3
X3 vs. X2
![Page 58: BASIC STATISTICAL TOOLS. What is Statistics Statistics refers to the collection, presentation, analysis, and utilization of numerical data to](https://reader036.vdocuments.mx/reader036/viewer/2022062322/56649d2f5503460f94a06845/html5/thumbnails/58.jpg)
Time Series Graph
40
80
120
160
200
240
60 62 64 66 68 70 72 74 76 78 80 82
X4 X5