guide to using excel 2007 for basic statistical applications to accompany business statistics: a...

42
Guide to Using Excel 2007 For Basic Statistical Applications To Accompany Business Statistics: A Decision Making Approach, 8th Ed. Chapter 15: Multiple Regression and Model Building By Groebner, Shannon, Fry, & Smith Prentice-Hall Publishing Company Copyright, 2011

Upload: bilal-lovins

Post on 14-Dec-2015

221 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Guide to Using Excel 2007 For Basic Statistical Applications To Accompany Business Statistics: A Decision Making Approach, 8th Ed. Chapter 15: Multiple

Guide to Using Excel 2007 For Basic Statistical Applications

Guide to Using Excel 2007 For Basic Statistical Applications

To Accompany

Business Statistics: A Decision Making Approach, 8th Ed.

Chapter 15:

Multiple Regression and Model Building

By

Groebner, Shannon, Fry, & SmithPrentice-Hall Publishing Company

Copyright, 2011

Page 2: Guide to Using Excel 2007 For Basic Statistical Applications To Accompany Business Statistics: A Decision Making Approach, 8th Ed. Chapter 15: Multiple

Chapter 15 Excel Examples

Multiple Regression First City Real Estate

Multiple Regression –Dummy Variable First City Real Estate

Curvilinear RegressionAshley Investment Services

Interaction Effects Ashley Investment Services

More Examples

Page 3: Guide to Using Excel 2007 For Basic Statistical Applications To Accompany Business Statistics: A Decision Making Approach, 8th Ed. Chapter 15: Multiple

Chapter 15 Excel Examples

Residual Analysis First City Real Estate

Page 4: Guide to Using Excel 2007 For Basic Statistical Applications To Accompany Business Statistics: A Decision Making Approach, 8th Ed. Chapter 15: Multiple

Multiple Regression – First City Real Estate

Issue: First City management wishes to build a model that can be used to predict sales prices for residential property.

Objective: Use Excel 2007 to build a multiple regression model relating sales price to a set of measurable variables.

Data file is FirstCity.xls

Page 5: Guide to Using Excel 2007 For Basic Statistical Applications To Accompany Business Statistics: A Decision Making Approach, 8th Ed. Chapter 15: Multiple

Multiple Regression – First City Real Estate

• Open the file FirstCity.xls, Sheet Homes-Sample 1

• Select the Data Tab• Select Data Analysis• Select Correlation• OK

Page 6: Guide to Using Excel 2007 For Basic Statistical Applications To Accompany Business Statistics: A Decision Making Approach, 8th Ed. Chapter 15: Multiple

Multiple Regression – First City Real Estate

• Input Range: A1:G320• Grouped by: Columns• New Worksheet Ply: corr-

1• OK

Page 7: Guide to Using Excel 2007 For Basic Statistical Applications To Accompany Business Statistics: A Decision Making Approach, 8th Ed. Chapter 15: Multiple

Multiple Regression – First City Real Estate

This cell shows the correlation, 0.7477, between Price and Square Feet

Page 8: Guide to Using Excel 2007 For Basic Statistical Applications To Accompany Business Statistics: A Decision Making Approach, 8th Ed. Chapter 15: Multiple

Multiple Regression – First City Real Estate

Return to Homes-Sample 1• Select the Data tab• Select Data Analysis• Select Regression• OK

Page 9: Guide to Using Excel 2007 For Basic Statistical Applications To Accompany Business Statistics: A Decision Making Approach, 8th Ed. Chapter 15: Multiple

Multiple Regression – First City Real Estate

• Input Y Range: A1:A320• Input X Range: B1:F320• Select labels• New Worksheet Ply:

regress-1• OK

Page 10: Guide to Using Excel 2007 For Basic Statistical Applications To Accompany Business Statistics: A Decision Making Approach, 8th Ed. Chapter 15: Multiple

Multiple Regression – First City Real Estate

Excel produces the Regression Model

Page 11: Guide to Using Excel 2007 For Basic Statistical Applications To Accompany Business Statistics: A Decision Making Approach, 8th Ed. Chapter 15: Multiple

Multiple Regression – First City Real Estate

To determine if multicollinearity is a problem:• Select the Add-Ins tab• Select PHStat• Select Regression• Select Multiple Regression

Page 12: Guide to Using Excel 2007 For Basic Statistical Applications To Accompany Business Statistics: A Decision Making Approach, 8th Ed. Chapter 15: Multiple

Multiple Regression – First City Real Estate

• Y variable … : A1:A320• X variable … : B1:F320• Select First Cells in both …• Select Regression Statistics

…• Select ANOVA and …• Select Variance Inflation … • OK

Page 13: Guide to Using Excel 2007 For Basic Statistical Applications To Accompany Business Statistics: A Decision Making Approach, 8th Ed. Chapter 15: Multiple

Multiple Regression – First City Real Estate

PHStat will find Variance Inflation Factors for all independent variables. This slide shows the VIF for Garage # and all other X variables. Click on the sheet tabs X4,X3,X2,X1 to see the VIF for the other variables.

Page 14: Guide to Using Excel 2007 For Basic Statistical Applications To Accompany Business Statistics: A Decision Making Approach, 8th Ed. Chapter 15: Multiple

Issue: First City managers wish to improve the model by adding a location variable for the area.

Objective: Use Excel 2007 to improve a regression model by adding a dummy variable for the area either foothills or flatland.

Data file is First City.xls

Multiple Regression - Dummy Variable -First City Real EstateMultiple Regression - Dummy Variable -First City Real Estate

Page 15: Guide to Using Excel 2007 For Basic Statistical Applications To Accompany Business Statistics: A Decision Making Approach, 8th Ed. Chapter 15: Multiple

Multiple Regression- Dummy Variable – First City Real Estate

• Select the Home-Sample 2 sheet

• Select the Add-In tab• Select PHStat• Select Regression• Select Multiple

Regression

Page 16: Guide to Using Excel 2007 For Basic Statistical Applications To Accompany Business Statistics: A Decision Making Approach, 8th Ed. Chapter 15: Multiple

Multiple Regression- Dummy Variable – First City Real Estate

• Cut Bedrooms and Bathrooms columns and paste in cell G1. Delete columns C and D so that all data is in connected columns

• Select Data tab• Select Regression• Input Y Range: A1 – A320• Input X Range B1-E320 • Select Labels• New Worksheet: Regress-2• OK

Page 17: Guide to Using Excel 2007 For Basic Statistical Applications To Accompany Business Statistics: A Decision Making Approach, 8th Ed. Chapter 15: Multiple

Multiple Regression- Dummy Variable – First City Real Estate

All variables are significant and have the expected sign

Page 18: Guide to Using Excel 2007 For Basic Statistical Applications To Accompany Business Statistics: A Decision Making Approach, 8th Ed. Chapter 15: Multiple

Curvilinear Relationships - Ashley Investment ServicesCurvilinear Relationships - Ashley Investment Services

Issue: The director of personnel is trying to determine whether there is a relationship between employee burnout and time spent socializing with co-workers.

Objective: Use Excel 2007 to determine whether the relationship between the two measures is statistically significant.

Data file is Ashley.xls

Page 19: Guide to Using Excel 2007 For Basic Statistical Applications To Accompany Business Statistics: A Decision Making Approach, 8th Ed. Chapter 15: Multiple

Curvilinear Relationships – Ashley Investment Services

Open the file Ashley.xls

Page 20: Guide to Using Excel 2007 For Basic Statistical Applications To Accompany Business Statistics: A Decision Making Approach, 8th Ed. Chapter 15: Multiple

Curvilinear Relationships – Ashley Investment Services

• Select A1:B21• Select the Insert Tab• Select Scatter Plot• Select the Scatter Plot desired

Page 21: Guide to Using Excel 2007 For Basic Statistical Applications To Accompany Business Statistics: A Decision Making Approach, 8th Ed. Chapter 15: Multiple

Curvilinear Relationships – Ashley Investment Services

Page 22: Guide to Using Excel 2007 For Basic Statistical Applications To Accompany Business Statistics: A Decision Making Approach, 8th Ed. Chapter 15: Multiple

Curvilinear Relationships – Ashley Investment Services

• Select the Data tab• Select Data Analysis• Select Regression

Page 23: Guide to Using Excel 2007 For Basic Statistical Applications To Accompany Business Statistics: A Decision Making Approach, 8th Ed. Chapter 15: Multiple

Curvilinear Relationships – Ashley Investment Services

• Y Range A1:A21• X Range B1:B21• Labels• New Worksheet Ply:

Ashley-1• OK

Page 24: Guide to Using Excel 2007 For Basic Statistical Applications To Accompany Business Statistics: A Decision Making Approach, 8th Ed. Chapter 15: Multiple

Curvilinear Relationships – Ashley Investment Services

Page 25: Guide to Using Excel 2007 For Basic Statistical Applications To Accompany Business Statistics: A Decision Making Approach, 8th Ed. Chapter 15: Multiple

Curvilinear Relationships – Ashley Investment Services

To develop a nonlinear model, return to the data file. Create a new variable Socialization Squared

Page 26: Guide to Using Excel 2007 For Basic Statistical Applications To Accompany Business Statistics: A Decision Making Approach, 8th Ed. Chapter 15: Multiple

Curvilinear Relationships – Ashley Investment Services

• Select the Data tab• Select Data Analysis• Select Regression• OK

Page 27: Guide to Using Excel 2007 For Basic Statistical Applications To Accompany Business Statistics: A Decision Making Approach, 8th Ed. Chapter 15: Multiple

Curvilinear Relationships – Ashley Investment Services

• Input Y Range: A1:A21• Input X Range: B1:C21• Select Labels• New Worksheet Ply: Ashley-2

Page 28: Guide to Using Excel 2007 For Basic Statistical Applications To Accompany Business Statistics: A Decision Making Approach, 8th Ed. Chapter 15: Multiple

Curvilinear Relationships – Ashley Investment Services

The output shows the R Square value and the Regression Coefficients.

Page 29: Guide to Using Excel 2007 For Basic Statistical Applications To Accompany Business Statistics: A Decision Making Approach, 8th Ed. Chapter 15: Multiple

Interaction Effects - Ashley Investment ServicesInteraction Effects - Ashley Investment Services

Issue: The director of personnel is trying to determine whether the model can be improved by separating observations between those taken from men and women.

Objective: Use Excel 2007 to determine whether the relationship between the measures can be improved.

Data file is Ashley-2.xls

Page 30: Guide to Using Excel 2007 For Basic Statistical Applications To Accompany Business Statistics: A Decision Making Approach, 8th Ed. Chapter 15: Multiple

Interaction Effects – Ashley Investment Services

• Open file Ashley-2.xls• Insert a new column C as

(Socialization Squared) which is Column B squared

• Add Column E as Columns B * D

• Add Column F as Columns C * D

Page 31: Guide to Using Excel 2007 For Basic Statistical Applications To Accompany Business Statistics: A Decision Making Approach, 8th Ed. Chapter 15: Multiple

Interaction Effects – Ashley Investment Services

Using The Insert tab and Chart tools set up a Scatter Plot for one gender

Page 32: Guide to Using Excel 2007 For Basic Statistical Applications To Accompany Business Statistics: A Decision Making Approach, 8th Ed. Chapter 15: Multiple

Interaction Effects – Ashley Investment Services

Add the second gender

Add exponential trend line for male and female

Page 33: Guide to Using Excel 2007 For Basic Statistical Applications To Accompany Business Statistics: A Decision Making Approach, 8th Ed. Chapter 15: Multiple

Interaction Effects – Ashley Investment Services

The regression for the curvilinear model.

Page 34: Guide to Using Excel 2007 For Basic Statistical Applications To Accompany Business Statistics: A Decision Making Approach, 8th Ed. Chapter 15: Multiple

Issue: The company is interested in analyzing the residuals of the regression model to determine whether the assumptions of multiple regression are satisfied.

Objective: Use Excel 2007 to analyze residuals from a regression model.

Data file is First City-3.xls

Residual Analysis - First City Real Estate

Residual Analysis - First City Real Estate

Page 35: Guide to Using Excel 2007 For Basic Statistical Applications To Accompany Business Statistics: A Decision Making Approach, 8th Ed. Chapter 15: Multiple

Residual Analysis – First City Real Estate

Open the file FirstCity3.xls

Since Excel requires independent variables to be in adjacent columns – cut and paste these columns.

We will be using:• Price• Sq. Feet• Bedrooms• Garage #• Log of Lot Size

• Note: I will swap Lot Size and Log Lot Size to simplify the operation

Page 36: Guide to Using Excel 2007 For Basic Statistical Applications To Accompany Business Statistics: A Decision Making Approach, 8th Ed. Chapter 15: Multiple

Residual Analysis – First City Real Estate

• Select the Data tab

• Select Data Analysis

• Select Regression

• OK

Page 37: Guide to Using Excel 2007 For Basic Statistical Applications To Accompany Business Statistics: A Decision Making Approach, 8th Ed. Chapter 15: Multiple

Residual Analysis – First City Real Estate

Define the range for X and Y variables.Excel gives several options for Residual Analysis but does not have as complete a set as Minitab

Page 38: Guide to Using Excel 2007 For Basic Statistical Applications To Accompany Business Statistics: A Decision Making Approach, 8th Ed. Chapter 15: Multiple

Residual Analysis – First City Real Estate

This is the Residual Plot for Square Feet

Page 39: Guide to Using Excel 2007 For Basic Statistical Applications To Accompany Business Statistics: A Decision Making Approach, 8th Ed. Chapter 15: Multiple

Residual Analysis – First City Real Estate

While Excel will not automatically generate a histogram of the standardized residuals , one can be created.

Page 40: Guide to Using Excel 2007 For Basic Statistical Applications To Accompany Business Statistics: A Decision Making Approach, 8th Ed. Chapter 15: Multiple

Residual Analysis – First City Real Estate

Define Bin values for the Histogram

Page 41: Guide to Using Excel 2007 For Basic Statistical Applications To Accompany Business Statistics: A Decision Making Approach, 8th Ed. Chapter 15: Multiple

Residual Analysis – First City Real Estate

• Select the Data tab

• Select Data Analysis

• Select Histogram• On the Histogram

Chart identify the data and bins

• Select Chart output

Page 42: Guide to Using Excel 2007 For Basic Statistical Applications To Accompany Business Statistics: A Decision Making Approach, 8th Ed. Chapter 15: Multiple

Residual Analysis – First City Real Estate