one way analysis of variance - cba · 2018. 5. 7. · one-way analysis of variance evaluate the...

32
Business Statistics Department of Quantitative Methods & Information Systems Dr. Mohammad Zainal QMIS 220 Chapter 11 One Way analysis of Variance

Upload: others

Post on 25-Jul-2021

10 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: One Way analysis of Variance - CBA · 2018. 5. 7. · One-Way Analysis of Variance Evaluate the difference among the means of three or more populations Examples: Accident rates for

Business Statistics

Department of Quantitative Methods & Information Systems

Dr. Mohammad Zainal QMIS 220

Chapter 11

One Way analysis of Variance

Page 2: One Way analysis of Variance - CBA · 2018. 5. 7. · One-Way Analysis of Variance Evaluate the difference among the means of three or more populations Examples: Accident rates for

Chapter Goals

After completing this chapter, you should be able

to:

Recognize situations in which to use analysis of variance

Perform a single-factor hypothesis test and interpret results

Page 3: One Way analysis of Variance - CBA · 2018. 5. 7. · One-Way Analysis of Variance Evaluate the difference among the means of three or more populations Examples: Accident rates for

Chapter Overview

Analysis of Variance (ANOVA)

F-test

F-test Tukey-

Kramer

test Fisher’s Least

Significant

Difference test

One-Way

ANOVA

Randomized

Complete

Block ANOVA

Two-factor

ANOVA

with replication

Page 4: One Way analysis of Variance - CBA · 2018. 5. 7. · One-Way Analysis of Variance Evaluate the difference among the means of three or more populations Examples: Accident rates for

General ANOVA Setting

Investigator controls one or more independent

variables

Called factors (or treatment variables)

Each factor contains two or more levels (or

categories/classifications)

Observe effects on dependent variable

Response to levels of independent variable

Experimental design: the plan used to test

hypothesis

Page 5: One Way analysis of Variance - CBA · 2018. 5. 7. · One-Way Analysis of Variance Evaluate the difference among the means of three or more populations Examples: Accident rates for

One-Way Analysis of Variance

Evaluate the difference among the means of three or more populations

Examples: ● Accident rates for 1st, 2nd, and 3rd shift

● Expected mileage for five brands of tires

Assumptions

Populations are normally distributed

Populations have equal variances

Samples are randomly and independently drawn

Page 6: One Way analysis of Variance - CBA · 2018. 5. 7. · One-Way Analysis of Variance Evaluate the difference among the means of three or more populations Examples: Accident rates for

Completely Randomized Design

Experimental units (subjects) are assigned

randomly to treatments

Only one factor or independent variable

With two or more treatment levels

Analyzed by

One-factor analysis of variance (one-way ANOVA)

Called a Balanced Design if all factor levels

have equal sample size

Page 7: One Way analysis of Variance - CBA · 2018. 5. 7. · One-Way Analysis of Variance Evaluate the difference among the means of three or more populations Examples: Accident rates for

Hypotheses of One-Way ANOVA

All population means are equal

i.e., no treatment effect (no variation in means among

groups)

At least one population mean is different

i.e., there is a treatment effect

Does not mean that all population means are different

(some pairs may be the same)

k3210 μμμμ:H

same the are means population the of all Not:HA

Page 8: One Way analysis of Variance - CBA · 2018. 5. 7. · One-Way Analysis of Variance Evaluate the difference among the means of three or more populations Examples: Accident rates for

One-Factor ANOVA

All Means are the same:

The Null Hypothesis is True

(No Treatment Effect)

k3210 μμμμ:H

same the are μ all Not:H iA

321 μμμ

Page 9: One Way analysis of Variance - CBA · 2018. 5. 7. · One-Way Analysis of Variance Evaluate the difference among the means of three or more populations Examples: Accident rates for

One-Factor ANOVA

At least one mean is different:

The Null Hypothesis is NOT true

(Treatment Effect is present)

k3210 μμμμ:H

same the are μ all Not:H iA

321 μμμ 321 μμμ

or

(continued)

Page 10: One Way analysis of Variance - CBA · 2018. 5. 7. · One-Way Analysis of Variance Evaluate the difference among the means of three or more populations Examples: Accident rates for

Partitioning the Variation

Total variation can be split into two parts:

SST = Total Sum of Squares

SSB = Sum of Squares Between

SSW = Sum of Squares Within

SST = SSB + SSW

Page 11: One Way analysis of Variance - CBA · 2018. 5. 7. · One-Way Analysis of Variance Evaluate the difference among the means of three or more populations Examples: Accident rates for

Partitioning the Variation

Total Variation (SST) = the aggregate dispersion of the

individual data values across the various factor levels

Within-Sample Variation (SSW) = dispersion that exists

among the data values within a particular factor level

Between-Sample Variation (SSB) = dispersion among the

factor sample means

SST = SSB + SSW

(continued)

Page 12: One Way analysis of Variance - CBA · 2018. 5. 7. · One-Way Analysis of Variance Evaluate the difference among the means of three or more populations Examples: Accident rates for

Partition of Total Variation

Variation Due to

Factor (SSB) Variation Due to Random

Sampling (SSW)

Total Variation (SST)

Commonly referred to as:

Sum of Squares Within

Sum of Squares Error

Sum of Squares Unexplained

Within Groups Variation

Commonly referred to as:

Sum of Squares Between

Sum of Squares Among

Sum of Squares Explained

Among Groups Variation

= +

Page 13: One Way analysis of Variance - CBA · 2018. 5. 7. · One-Way Analysis of Variance Evaluate the difference among the means of three or more populations Examples: Accident rates for

Total Sum of Squares

k

i

n

j

ij

i

xxSST1 1

2)(

Where:

SST = Total sum of squares

k = number of populations (levels or treatments)

ni = sample size from population i

xij = jth measurement from population i

x = grand mean (mean of all data values)

SST = SSB + SSW

Page 14: One Way analysis of Variance - CBA · 2018. 5. 7. · One-Way Analysis of Variance Evaluate the difference among the means of three or more populations Examples: Accident rates for

Total Variation (continued)

Group 1 Group 2 Group 3

Response, X

22

12

2

11 )xx(...)xx()xx(SSTkkn

x

Page 15: One Way analysis of Variance - CBA · 2018. 5. 7. · One-Way Analysis of Variance Evaluate the difference among the means of three or more populations Examples: Accident rates for

Sum of Squares Between

Where:

SSB = Sum of squares between

k = number of populations

ni = sample size from population i

xi = sample mean from population i

x = grand mean (mean of all data values)

2

1

)xx(nSSB i

k

i

i

SST = SSB + SSW

Page 16: One Way analysis of Variance - CBA · 2018. 5. 7. · One-Way Analysis of Variance Evaluate the difference among the means of three or more populations Examples: Accident rates for

Between-Group Variation

Variation Due to

Differences Among Groups

i j

2

1

)xx(nSSB i

k

i

i

1

k

SSBMSB

Mean Square Between =

SSB/degrees of freedom

Page 17: One Way analysis of Variance - CBA · 2018. 5. 7. · One-Way Analysis of Variance Evaluate the difference among the means of three or more populations Examples: Accident rates for

Between-Group Variation (continued)

Group 1 Group 2 Group 3

Response, X

22

22

2

11 )xx(n...)xx(n)xx(nSSB kk

3x

1x 2xx

Page 18: One Way analysis of Variance - CBA · 2018. 5. 7. · One-Way Analysis of Variance Evaluate the difference among the means of three or more populations Examples: Accident rates for

Sum of Squares Within

Where:

SSW = Sum of squares within

k = number of populations

ni = sample size from population i

xi = sample mean from population i

xij = jth measurement from population i

2

11

)xx(SSW iij

n

j

k

i

j

SST = SSB + SSW

Page 19: One Way analysis of Variance - CBA · 2018. 5. 7. · One-Way Analysis of Variance Evaluate the difference among the means of three or more populations Examples: Accident rates for

Within-Group Variation

Summing the variation

within each group and then

adding over all groups

i

kn

SSWMSW

T

Mean Square Within =

SSW/degrees of freedom

2

11

)xx(SSW iij

n

j

k

i

j

Page 20: One Way analysis of Variance - CBA · 2018. 5. 7. · One-Way Analysis of Variance Evaluate the difference among the means of three or more populations Examples: Accident rates for

Within-Group Variation (continued)

Group 1 Group 2 Group 3

Response, X

22

212

2

111 )xx(...)xx()xx(SSW kknk

3x

1x 2x

Page 21: One Way analysis of Variance - CBA · 2018. 5. 7. · One-Way Analysis of Variance Evaluate the difference among the means of three or more populations Examples: Accident rates for

One-Way ANOVA Table

Source of

Variation df SS MS

Between

Samples SSB MSB =

Within

Samples nT - k SSW MSW =

Total nT - 1 SST =

SSB+SSW

k - 1 MSB

MSW

F ratio

k = number of populations

nT = sum of the sample sizes from all populations

df = degrees of freedom

SSB

k - 1

SSW

nT - k

F =

Page 22: One Way analysis of Variance - CBA · 2018. 5. 7. · One-Way Analysis of Variance Evaluate the difference among the means of three or more populations Examples: Accident rates for

Formula can be used

SSB(SSF)

SSB = SST-SSW

SSB = MSB (k-1)

SSB = 𝐓𝟏𝟐

𝐧𝟏+

𝐓𝟐𝟐

𝐧𝟐+⋯+

𝐓𝐤𝟐

𝐧𝐤−

𝐓𝟐

𝐧

SSB = 𝐧𝐢 𝐱 𝐢 − 𝐱 𝟐

SSW(SSE)

SSW = SST-SSB

SSW = MSW (n-k)

SSW = 𝐧𝟏 − 𝟏 𝐒𝟏𝟐 + 𝐧𝟐 − 𝟏 𝐒𝟐

𝟐 +⋯

+ 𝐧𝐤 − 𝟏 𝐒𝐤𝟐 = (𝐧𝐢−𝟏)𝐒𝐢

𝟐

SSW= 𝐱𝟐 −𝐓𝟏𝟐

𝐧𝟏+

𝐓𝟐𝟐

𝐧𝟐+⋯+

𝐓𝐤𝟐

𝐧𝐤

SST

SST = SSB+SSW

SST = 𝐱𝟐 −𝐓𝟐

𝐧

QMIS 220, by Dr. M. Zainal 22

Page 23: One Way analysis of Variance - CBA · 2018. 5. 7. · One-Way Analysis of Variance Evaluate the difference among the means of three or more populations Examples: Accident rates for

Notes

k: number of samples/ groups/ populations.

𝒏 = 𝒏𝟏 + 𝒏𝟐 +⋯+ 𝒏𝒌 (𝒕𝒐𝒕𝒂𝒍 𝒔𝒂𝒎𝒑𝒍𝒆 𝒔𝒊𝒛𝒆).

𝑻𝟏 = 𝒙𝟏, 𝑻𝟐 = 𝒙𝟐 , … , 𝑻𝒌 = 𝒙𝒌 𝒐𝒓 𝑻𝟏 = 𝒏𝟏𝒙 𝟏 , 𝑻𝟐= 𝒏𝟐𝒙 𝟐, … , 𝑻𝒌 = 𝒏𝒌𝒙 𝒌

𝑻 = 𝑻𝟏 + 𝑻𝟐 +⋯+ 𝑻𝒌

𝑺𝟏𝟐, 𝑺𝟐

𝟐, … , 𝑺𝒌𝟐

𝒙𝟐 = 𝒙𝟏𝟐 + 𝒙𝟐

𝟐 +⋯+ 𝒙𝒌𝟐

𝒙𝟐 ≠ 𝒙 𝟐

𝑻𝒊𝟐 ≠ 𝒙𝒊

𝟐

𝑻𝒊𝟐 = 𝒙𝒊

𝟐

𝑴𝑺𝑬 = 𝑺𝑷𝟐 =

𝒏𝟏−𝟏 𝑺𝟏𝟐+ 𝒏𝟐−𝟏 𝑺𝟐

𝟐+⋯+ 𝒏𝒌−𝟏 𝑺𝒌𝟐

𝒏−𝒌

QMIS 220, by Dr. M. Zainal 23

Page 24: One Way analysis of Variance - CBA · 2018. 5. 7. · One-Way Analysis of Variance Evaluate the difference among the means of three or more populations Examples: Accident rates for

One-Factor ANOVA F Test Statistic

Test statistic

MSB is mean squares between variances

MSW is mean squares within variances

Degrees of freedom

df1 = k – 1 (k = number of populations)

df2 = nT – k (nT = sum of sample sizes from all populations)

MSW

MSBF

H0: μ1= μ2 = … = μ k

HA: At least two population means are different

Page 25: One Way analysis of Variance - CBA · 2018. 5. 7. · One-Way Analysis of Variance Evaluate the difference among the means of three or more populations Examples: Accident rates for

Interpreting One-Factor ANOVA F Statistic

The F statistic is the ratio of the between estimate of variance and the within estimate of variance The ratio must always be positive

df1 = k -1 will typically be small

df2 = nT - k will typically be large

The ratio should be close to 1 if H0: μ1= μ2 = … = μk is true The ratio will be larger than 1 if H0: μ1= μ2 = … = μk is false

Page 26: One Way analysis of Variance - CBA · 2018. 5. 7. · One-Way Analysis of Variance Evaluate the difference among the means of three or more populations Examples: Accident rates for

One-Factor ANOVA F Test Example

You want to see if three

different golf clubs yield

different distances. You

randomly select five

measurements from trials on

an automated driving

machine for each club. At the

.05 significance level, is there

a difference in mean

distance?

Club 1 Club 2 Club 3

254 234 200

263 218 222

241 235 197

237 227 206

251 216 204

Page 27: One Way analysis of Variance - CBA · 2018. 5. 7. · One-Way Analysis of Variance Evaluate the difference among the means of three or more populations Examples: Accident rates for

One-Factor ANOVA Example: Scatter Diagram

Club 1 2 3

Page 28: One Way analysis of Variance - CBA · 2018. 5. 7. · One-Way Analysis of Variance Evaluate the difference among the means of three or more populations Examples: Accident rates for

One-Factor ANOVA Example Computations

Club 1 Club 2 Club 3

254 234 200

263 218 222

241 235 197

237 227 206

251 216 204

Page 29: One Way analysis of Variance - CBA · 2018. 5. 7. · One-Way Analysis of Variance Evaluate the difference among the means of three or more populations Examples: Accident rates for

One-Factor ANOVA Example Solution

Page 30: One Way analysis of Variance - CBA · 2018. 5. 7. · One-Way Analysis of Variance Evaluate the difference among the means of three or more populations Examples: Accident rates for

SUMMARY

Groups Count Sum Average Variance

Club 1 5 1246 249.2 108.2

Club 2 5 1130 226 77.5

Club 3 5 1029 205.8 94.2

ANOVA

Source of Variation

SS df MS F P-value F crit

Between

Groups 4716.4 2 2358.2 25.275 4.99E-05 3.885

Within

Groups 1119.6 12 93.3

Total 5836.0 14

ANOVA -- Single Factor: Excel Output

EXCEL: tools | data analysis | ANOVA: single factor

Page 31: One Way analysis of Variance - CBA · 2018. 5. 7. · One-Way Analysis of Variance Evaluate the difference among the means of three or more populations Examples: Accident rates for

Chapter Summary

Described one-way analysis of variance

The logic of ANOVA

ANOVA assumptions

F test for difference in k means

Page 32: One Way analysis of Variance - CBA · 2018. 5. 7. · One-Way Analysis of Variance Evaluate the difference among the means of three or more populations Examples: Accident rates for

Copyright

The materials of this presentation were mostly

taken from the PowerPoint files accompanied

Business Statistics: A Decision-Making Approach,

7e © 2008 Prentice-Hall, Inc.

QMIS 220, by Dr. M. Zainal Chap 10-32