don’t cry because it is all over, smile because it happened

Don’t cry because it is all over, smile because it

happened

Potential Problems in Potential Problems in SamplingSampling

Poor Sampling Frame

Cost of Sampling

Built -In Bias

Cost of SamplingCost of Sampling

Money

Time

Wide Geographic Region

Major Errors in Major Errors in SamplingSampling

Bias:

Consistent, repeated divergence in the same direction of a sample statistic from its associated population parameter.

Lack of Precision:

Large theoretical variation in a sample statistic

Sampling ErrorSampling Error

The difference between the sample statistic and its corresponding population

parameter.

Population:

97, 103, 96, 99, 105

(Mean = 100)

Non-Sampling ErrorsNon-Sampling Errors

Survey Timing

Survey Mode

Interviewer – Subject Relationship

Survey Topic

Question Wording

Question Sequence

Statistical SignificanceStatistical Significance

An observed effect so large that it would rarely occur by

chance.

Hypothesis TestingHypothesis Testing

What is a Hypothesis?

A statement about the value of a population parameter developed for the

purpose of testing.

Hypothesis TestingHypothesis TestingWhat is Hypothesis Testing?

A procedure, based on sample evidence and probability theory, used to determine whether the

hypothesis is a reasonable statement and should not be

rejected or is unreasonable and should be rejected.


Examples of hypotheses made about a population parameter are:

• The mean monthly income for systems analysts is $3,625.

• Twenty percent of all juvenile offenders are caught and sentenced to prison.


Null Hypothesis H0:

A statement about the value of a population parameter.

Alternative Hypothesis H1:

A statement that is accepted if the sample data provide evidence that the null hypothesis is false.


Level of Significance:

The probability of rejecting the null hypothesis when it is actually true.


Statistical testing is often done by testing a hypothesis that you expect

to reject.

Null HypothesisNull HypothesisNull Hypothesis H0: A statement about

the value of a population parameter. Stating the current fact(s).

PopulationPopulation

Graphic RepresentationGraphic Representationof the Populationof the Population

Alternative HypothesisAlternative HypothesisAlternative (Research) Hypothesis H1:

A statement that is accepted if the sample data provide evidence that the null hypothesis is false.

SampleSample

Graphic Representation Graphic Representation of a Large Sampleof a Large Sample

Graphic Representation of Graphic Representation of the Population & Samplethe Population & Sample

Sample

ZPopulationZ

Statistics! Statistics! Statistics! Finish the Maze and we get to take a break!

Testing a HypothesisTesting a Hypothesis

Tail Tail

Testing a HypothesisTesting a Hypothesis.05 level of significance.05 level of significance

One tailed test: More than; greater than; larger than; etc…

Critical Z

1.645ZCritical Value

Critical Region.05 Area

Null Hypothesis Area

One Tailed Test, .05Smaller than; less than, etc.

Critical Z

1.645 ZZ value

Critical Region


Two Tailed Test, .05Not Equal to; Different Than

Critical Z Critical Z

1.96 ZZ value

1.96 ZZ value

+

Critical RegionCritical Region


Graphic RepresentationGraphic Representation of Hypothesis Test Resultsof Hypothesis Test Results

This maze is longer than I thought.

Go Ahead and take a break!

Hypothesis TestingHypothesis TestingState null and alternative hypothesis

Select a level of significance

Formulate a decision rule

Identify the test statistic

Take a sample, arrive at a decision(Reject or fail to reject the null)

Test for Sample Means

X = Sample meanμ = Hypothesized population mean

s = Sample standard deviationN = Sample size

S

One Sample Mean Problem

A recent article in Vitality magazine reported that the mean amount of leisure time per

week for American men is 40.0 hours. You believe this figure is too large and decide to conduct your own test. In a random sample

of 60 men, you find that the mean is 37.8 hours of leisure per week with a standard

deviation of 12.2 hours. Can you conclude that the data in the article is too large? Use

the .05 significance level.

Step 1

State the null and alternative hypothesis.

H0: Mean = 40.0 hours

H1: Mean < 40.0 hours

Step 2

Select a level of significance.

This will be given to you. In this problem, it is .05.

Step 3Establish critical region by converting level of

significance to a Z score..5000 - .0500 = .4500 = 1.64z

If the test statistic falls below -1.64z, the null hypothesis will be rejected.

Step 4Identify the test statistic.

Z = 37.8 – 40.0 12.2 / 7.75

Z = -2.2 / 1.57

Test Statistic: Z = -1.40

Step 5Arrive at a decision.

The test statistic falls in the null hypothesis region. Therefore, we fail to

reject the null.

Test for Two Sample Means

Xi = Mean for group i

Si = Standard deviation for group i

ni = Number in group i

Two Sample Means ProblemThe board of directors at the Anchor Pointe Marina

is studying the usage of boats among its members. A sample of 30 members who have boats 10 to 20 feet in length showed that they

used their boats an average of 11 days last July. The standard deviation of the sample was 3.88

days. For a sample of 40 member with boats 21 to 40 feet in length, the average number of days

they used their boats in July was 7.67 with a standard deviation of 4.42 days. At the .02 significance level, can the board of directors

conclude that those with the smaller boats use their crafts more frequently?

Step 1

State the null and alternative hypothesis.

H0: Large boat usage = small boat usage

H1: Smaller boat usage > large boat usage

Step 2

Select a level of significance.

This will be given to you. In this problem it is .02.

Step 3

Formulate a decision rule.

.5000 - .0200 = .4800 = 2.05z

Step 4Identify the test statistic.

11 – 7.67 = 3.35z

3.882 + 4.422

30 40

Step 5

Arrive at a decision.

The test statistic falls in the critical region, therefore we reject the null.

p-Value in Hypothesis Testing

• p-Value: The probability, assuming that the null hypothesis is true, of getting a value of the test statistic at least as extreme as the computed value for the test.

• If the p-value area is smaller than the significance level, H0 is rejected.

• If the p-value area is larger than the significance level, H0 is not rejected.

Statistical Significance

p-Value: The probability of getting a sample outcome as far from what we would expect to get if the null hypothesis is true.

The stronger that p-value, the stronger the evidence that the null hypothesis is false.

Statistical Significance

P-values can be determined by

- computing the z-score

- using the standard normal table

The null hypothesis can be rejected if the p-value is small enough.

P-Value

1.64 Z 2.05Z

don’t cry because it is all over, smile because it happened

Documents

hypothesis testingwhat

hypothesis testingstate

alternative hypothesis

areanull hypothesis

sample evidence

sample data

sample meansx

sample graphic representation