do now read pages 222 – 224 read pages 222 – 224 stop before “goals of re-expression” stop...

24
DO NOW Read Pages 222 – 224 •Stop before “Goals of Re- expression” Answer the following questions: •What is a purpose of re-expressing data? •What can we check to see that the re-expression makes the linear model appropriate?

Upload: mavis-spencer

Post on 05-Jan-2016

215 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: DO NOW Read Pages 222 – 224 Read Pages 222 – 224 Stop before “Goals of Re-expression” Stop before “Goals of Re-expression” Answer the following questions:

DO NOW

• Read Pages 222 – 224

• Stop before “Goals of Re-expression”

• Answer the following questions:

• What is a purpose of re-expressing data?

• What can we check to see that the re-expression makes the linear model appropriate?

Page 2: DO NOW Read Pages 222 – 224 Read Pages 222 – 224 Stop before “Goals of Re-expression” Stop before “Goals of Re-expression” Answer the following questions:
Page 3: DO NOW Read Pages 222 – 224 Read Pages 222 – 224 Stop before “Goals of Re-expression” Stop before “Goals of Re-expression” Answer the following questions:

 Previou

s Average

Current Average

Total Change

Failures Thanksgiving EC Points

# of Students w/

HW EC

3rd 78% 83% 5 pts 1 145 11

4th 74% 77% 3 pts 3 99 5

5th 74% 81% 7 pts 1 197 12

Page 4: DO NOW Read Pages 222 – 224 Read Pages 222 – 224 Stop before “Goals of Re-expression” Stop before “Goals of Re-expression” Answer the following questions:

HW CHECK - #7

7. Movie Dramas.

a) The units for the slopes of these lines are millions of dollars per minutes of running time.

b) The slopes of the regression lines are the same. Dramas and movies from other genres have costs for longer movies that increase at the same rate.

c) The regression line for dramas has a lower y-intercept. Regardless of running time, dramas cost about 20 million dollars less than other genres of movies of the same running time.

Page 5: DO NOW Read Pages 222 – 224 Read Pages 222 – 224 Stop before “Goals of Re-expression” Stop before “Goals of Re-expression” Answer the following questions:

HW CHECK - #8

8. Movie Ratings.

a) The slopes of the regression lines are approximately the same. The costs increase at about the same rate for all genres as the movies get longer.

b) Although the costs per minute are about the same, it costs about 20 million dollars less to make an R-rated movies than a movie of the other rating type with the same running time.

c) Omitting King Kong would make the slope for the PG-13 movies steeper. We would conclude that the cost per minute of PG-13 movies was greater than the cost per minute of movies with other rating.

Page 6: DO NOW Read Pages 222 – 224 Read Pages 222 – 224 Stop before “Goals of Re-expression” Stop before “Goals of Re-expression” Answer the following questions:

HW CHECK - #12 D

1) The point has high leverage and a small residual.

2) The point is not influential. It has the potential to be influential, because its position far from the mean of the explanatory variable gives it high leverage. However, the point is not exerting much influence, because it reinforces the association.

3) If the point were removed, the correlation would become weaker. The point heavily reinforces the association. Removing it would weaken the association.

4) The slope would remain roughly the same, since the point is not influential.

Page 7: DO NOW Read Pages 222 – 224 Read Pages 222 – 224 Stop before “Goals of Re-expression” Stop before “Goals of Re-expression” Answer the following questions:

HW CHECK - #16

Suppose that researchers find a moderately strong positive correlation between the amount of time that a child spends playing

computer games and the aggressiveness they display at school.

16. What’s the effect?

1) Playing computer games may make kids more violent.

2) Violent kids may like to play computer games.

3) Playing computer games and violence may both be caused by a lurking variable such as the child’s home life or a genetic predisposition to aggressiveness.

Page 8: DO NOW Read Pages 222 – 224 Read Pages 222 – 224 Stop before “Goals of Re-expression” Stop before “Goals of Re-expression” Answer the following questions:

Slide 10 - 8

STRAIGHT TO THE POINT (CONT.)

The relationship between fuel efficiency (in miles per gallon) and weight (in pounds) for late model cars looks fairly linear at first:

Page 9: DO NOW Read Pages 222 – 224 Read Pages 222 – 224 Stop before “Goals of Re-expression” Stop before “Goals of Re-expression” Answer the following questions:

Slide 10 - 9

STRAIGHT TO THE POINT (CONT.)

A look at the residuals plot shows a problem:

Page 10: DO NOW Read Pages 222 – 224 Read Pages 222 – 224 Stop before “Goals of Re-expression” Stop before “Goals of Re-expression” Answer the following questions:

CONVERTING UNITS

• 3 feet = ____ inches

• 30 inches = ____ feet

• 50 yards = ____ feet

• Does changing the units change the meaning of the quantity?

Page 11: DO NOW Read Pages 222 – 224 Read Pages 222 – 224 Stop before “Goals of Re-expression” Stop before “Goals of Re-expression” Answer the following questions:

Slide 10 - 11

STRAIGHT TO THE POINT (CONT.)

We can re-express fuel efficiency as gallons per hundred miles (a reciprocal) and eliminate the bend in the original

scatterplot:

Page 12: DO NOW Read Pages 222 – 224 Read Pages 222 – 224 Stop before “Goals of Re-expression” Stop before “Goals of Re-expression” Answer the following questions:

Slide 10 - 12

STRAIGHT TO THE POINT (CONT.)

A look at the residuals plot for the new model seems more reasonable:

Page 13: DO NOW Read Pages 222 – 224 Read Pages 222 – 224 Stop before “Goals of Re-expression” Stop before “Goals of Re-expression” Answer the following questions:

Slide 10 - 13

STRAIGHT TO THE POINT

• We cannot use a linear model unless the relationship between the two variables is linear. Often re-expression can save the day, straightening bent relationships so that we can fit and use a simple linear model.

• Two simple ways to re-express data are with logarithms and reciprocals.

• Re-expressions can be seen in everyday life—everybody does it.

Page 14: DO NOW Read Pages 222 – 224 Read Pages 222 – 224 Stop before “Goals of Re-expression” Stop before “Goals of Re-expression” Answer the following questions:

Slide 10 - 14

WHAT CAN GO WRONG?

• Beware of multiple modes.

• Re-expression cannot pull separate modes together.

• Watch out for scatterplots that turn around.

• Re-expression can straighten many bent relationships, but not those that go up then down, or down then up.

Page 15: DO NOW Read Pages 222 – 224 Read Pages 222 – 224 Stop before “Goals of Re-expression” Stop before “Goals of Re-expression” Answer the following questions:

Slide 10 - 15

GOALS OF RE-EXPRESSION

Goal 1: Make the distribution of a variable (as seen in its histogram, for example) more symmetric.

Page 16: DO NOW Read Pages 222 – 224 Read Pages 222 – 224 Stop before “Goals of Re-expression” Stop before “Goals of Re-expression” Answer the following questions:

Slide 10 - 16

GOALS OF RE-EXPRESSION (CONT.)

Goal 2: Make the spread of several groups (as seen in side-by-side boxplots) more alike, even if their centers

differ.

Page 17: DO NOW Read Pages 222 – 224 Read Pages 222 – 224 Stop before “Goals of Re-expression” Stop before “Goals of Re-expression” Answer the following questions:

Slide 10 - 17

GOALS OF RE-EXPRESSION (CONT.)

Goal 3: Make the form of a scatterplot more nearly linear.

Page 18: DO NOW Read Pages 222 – 224 Read Pages 222 – 224 Stop before “Goals of Re-expression” Stop before “Goals of Re-expression” Answer the following questions:

Slide 10 - 18

GOALS OF RE-EXPRESSION (CONT.)

Goal 4: Make the scatter in a scatterplot spread out evenly rather than thickening at one end.

• This can be seen in the two scatterplots we just saw with Goal 3:

Page 19: DO NOW Read Pages 222 – 224 Read Pages 222 – 224 Stop before “Goals of Re-expression” Stop before “Goals of Re-expression” Answer the following questions:

TEXTBOOK #1Suppose you have fit a linear model to some data and now

take a look at the residuals. For each of the following possible residuals plots, tell whether you would try a re-

expression and, if so, why.

Page 20: DO NOW Read Pages 222 – 224 Read Pages 222 – 224 Stop before “Goals of Re-expression” Stop before “Goals of Re-expression” Answer the following questions:

TEXTBOOK #3Here is the residual plot for a linear model describing the trend

in the number of passengers departing from the Oakland (CA) airport each month since the start of 1997.

1. Can you account for the pattern shown here?

2. Would a re-expression help us deal with this pattern? Explain.

Page 21: DO NOW Read Pages 222 – 224 Read Pages 222 – 224 Stop before “Goals of Re-expression” Stop before “Goals of Re-expression” Answer the following questions:

TEXTBOOK #7One of the important factors determining a car’s Fuel

Efficiency is its Weight. Let’s examine this relationship again, for 11 cars.

Describe the association between these variables shown in the scatterplot.

Page 22: DO NOW Read Pages 222 – 224 Read Pages 222 – 224 Stop before “Goals of Re-expression” Stop before “Goals of Re-expression” Answer the following questions:

TEXTBOOK #7One of the important factors determining a car’s Fuel

Efficiency is its Weight. Let’s examine this relationship again, for 11 cars.

The linear model for this data isFuel Efficiency = 47.96 – 7.65Weight.

What does the slope of the line say about this relationship?

Page 23: DO NOW Read Pages 222 – 224 Read Pages 222 – 224 Stop before “Goals of Re-expression” Stop before “Goals of Re-expression” Answer the following questions:

TEXTBOOK #7One of the important factors determining a car’s Fuel

Efficiency is its Weight. Let’s examine this relationship again, for 11 cars.

The linear model for this data isFuel Efficiency = 47.96 – 7.65Weight.

Let’s examine the residuals plot for this linear regression. Is this model appropriate?

Page 24: DO NOW Read Pages 222 – 224 Read Pages 222 – 224 Stop before “Goals of Re-expression” Stop before “Goals of Re-expression” Answer the following questions:

TEXTBOOK #7Let’s re-express the variable Fuel Consumption (gal/100 mi)

to examine the fuel efficiency of the 11 cars.

The revised linear regression isFuel Efficiency = 1.77 + 0.62 Weight.

Explain why this model appears to be better than the linear model.

Interpret the slope of this line.