1-1 what is meant by statistics? statistics is the science of collecting, organizing, presenting,...
TRANSCRIPT
1-1
What is Meant by Statistics?
Statistics is the science of collecting, organizing, presenting, analyzing, and interpreting numerical data to assist in making more effective decisions.
COPAID
1-5
Types of Statistics
Descriptive Statistics Inferential Statistics
1-6
Descriptive Statistics Descriptive Statistics deals with methods of
Organizing,
Summarizing, &
Presenting data* in an informative way.
Typically, descriptive statistics include Mean, Mode, Median, Variance, Deviation, Skewness,Charts – histogram, bar, pie
(we will see these in later chapters)
* past/current data but not estimated future data
1-7
Excel Example Output of Descriptive Statistics
1-8
PopulationPopulation – all possible individuals, objects.
SampleSample – part of the population of interest
Inferential StatisticsInferential Statistics: : methods used tomethods used to determine something about a population on the basis determine something about a population on the basis
ofof a sample.
1-10
Why sample?
•Time & cost are prohibitive
•Physical impossibility of checking all items in population(eg. Checking quality of product if they are made in the millions)
•Destructive nature of some tests
•Sample results are adequate for decision-making
1-11
Types of Variables
1-12
Types of Variables
G enderE ye
C olor
For a Qualitative Qualitative or Attribute VariableAttribute Variable the characteristic being studied is non-numeric. It can only be labeled. (sometimes also called Categorical variable)
T ype of car
1-13
Types of Variables
Number of children in a family
In a Quantitative VariableQuantitative Variable information is reported numerically.
Balance in your checking account
Minutes remaining in class
1-14Types of Quantitative Variables
Discrete Variables:Discrete Variables: can only assume certain values
-there are usually “gaps” between values
- usually “counted”
Example: the number of bedrooms in a house, or the number of hammers sold at the local Home Depot (1,2,3,…,etc).
Quantitative variables can be classified as either
DiscreteDiscrete or ContinuousContinuous.
1-15
The height or weight of students The height or weight of students in a class.in a class.
A Continuous VariableContinuous Variable can assume any value within a specified range (“no gaps”).
The pressure in a tireThe pressure in a tire
1-16
Chart to remember
1-17
Nominal Nominal OrdinalOrdinalIntervalInterval
RatioRatio
The level of measurement dictates the kind of calculations you can do on the data.
Eg. If one student’s major is Accounting and another’s IS, we cannot calculate the average major.On the other hand, we can average their heights, weights, etc.
1-18
Nominal levelNominal level
Data that is classified into categories.
Can be arranged in any order.
Measurement consists only of counts.
G ender
E yeC olor
1-19
Other examples: religion, major, gender, ethnicity, …
In this example, Country or Region is Nominal Level data
1-20
Mutually exclusiveMutually exclusive
An individual, object, or measurement is included in only one category.
Nominal level data must be:
Exhaustive Exhaustive Each individual, object, or measurement must appear in one of the categories.
1-21
During a taste test of 4 soft drinks, Coca Cola was ranked number 1, Dr. Pepper number 2, Pepsi number 3, and Root Beer number 4.
Ordinal levelOrdinal level - involves data arranged in some order
- magnitude of differences between data values cannot be determined.
1
2
3
4
1-22
Also, see the example table in page 12 (Homeland Security Advisory System)
Example of an Ordinal level variable
1-23
Eg.•Temperature on the Fahrenheit scale.•Difference between 10°F - 15°F is same as between 50°F& - 55°F •0° does not represent absence of temperature
Interval levelInterval level - similar to the ordinal level- amounts of differences between data values is of equal size- there is no natural zero point.
1-24
M onthly incomeof surgeons
M iles trav eled by salesrepresentativ e in a month
Ratio level Ratio level (“highest” level of measurement)(“highest” level of measurement)
-- zero value means “absence”- differences and ratios are meaningful for this level of measurement(A person with $2Million is twice as rich as another with $1Million)
Traveled ‘0’ miles means did not travel at all
Monthly income ‘0’ means did not make any money
1-25
N-O-I-R ( Nerd Of India Rocks! )
1-26
Ethical Considerations
Statistics can be used to mislead decision makers Don’t do it!
•Keep taking different samples until you get the result you want
•Quote ‘average’ to hide wide range of data values
•Misleading graphical outputs
•Make unwarranted conclusions on variable relationships
1-27
Ethical Considerations
The cost/year doubled in 5 years. But the graph appears to depict more than that.
1-28
Ethical Considerations
By changing the x-y scale, the rate of change in unemployment appears different.
1-29
Also, a statistical association between two variables does not automatically imply ‘causation’. More in Chapter 13.
Eg.
•Consumption of peanuts is correlated with aspirin consumption (eating peanuts gives headaches)
Ethical Considerations