taming the ap statistics investigative task daren starnes the lawrenceville school...

32
Taming the AP Statistics Investigative Task Daren Starnes The Lawrenceville School [email protected] g CMC South Oct 2015

Upload: amberly-rose

Post on 17-Jan-2016

219 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Taming the AP Statistics Investigative Task Daren Starnes The Lawrenceville School dstarnes@lawrenceville.org CMC South Oct 2015

Taming the AP Statistics Investigative Task

Daren Starnes

The Lawrenceville School

[email protected]

CMC South Oct 2015

Page 2: Taming the AP Statistics Investigative Task Daren Starnes The Lawrenceville School dstarnes@lawrenceville.org CMC South Oct 2015

AP Statistics Teachers Meeting

• In this room at 12:30.• Grab a bite to eat and come join us for a

panel discussion and some fabulous give-aways from the publishers.

Page 3: Taming the AP Statistics Investigative Task Daren Starnes The Lawrenceville School dstarnes@lawrenceville.org CMC South Oct 2015

Who are we?• Please raise your hand if you…

– Are an AP Statistics teacher– Are a high school, non-AP statistics teacher– Are a college intro stats instructor– Have taught statistics for 0-1 years– Have taught statistics for 2-4 years– Have taught statistics for 5-9 years– Have taught statistics for 10 or more years– Have graded AP Statistics exams– Want to get this workshop started!

Page 4: Taming the AP Statistics Investigative Task Daren Starnes The Lawrenceville School dstarnes@lawrenceville.org CMC South Oct 2015

The Investigative Task“In the free-response section of the AP Statistics Exam, students are asked to answer five questions and complete an investigative task. Each question is designed to be answered in approximately 12 minutes. The longer investigative task is designed to be answered in approximately 30 minutes. The purpose of the investigative task is not only to evaluate the student’s understanding in several content areas but also to assess his or her ability to integrate statistical ideas and apply them in a new context or in a nonroutine way.”

AP Statistics Course Description

Page 5: Taming the AP Statistics Investigative Task Daren Starnes The Lawrenceville School dstarnes@lawrenceville.org CMC South Oct 2015

Why are we here?

• The Investigative Task counts for 25% of a student’s Free Response section score.

• How can we prepare our students to successfully do the kind of outside-the-box thinking that is required on the investigative task?

Page 6: Taming the AP Statistics Investigative Task Daren Starnes The Lawrenceville School dstarnes@lawrenceville.org CMC South Oct 2015

2011 Exam (Form B) Question 66. Grass buffer strips are grassy areas that are planted between bodies of water and agricultural fields. These strips are designed to filter out sediment, organic material, nutrients, and chemicals carried in runoff water. The figure below shows a cross-sectional view of a grass buffer strip that has been planted along the side of a stream.

Page 7: Taming the AP Statistics Investigative Task Daren Starnes The Lawrenceville School dstarnes@lawrenceville.org CMC South Oct 2015

2011 Exam (Form B) Question 6

A study in Nebraska investigated the use of buffer strips of several widths between 5 feet and 15 feet. The study results indicated a linear relationship between the width of the grass strip (x), in feet, and the amount of nitrogen removed from the runoff water (y), in parts per hundred. The following model was estimated.

Page 8: Taming the AP Statistics Investigative Task Daren Starnes The Lawrenceville School dstarnes@lawrenceville.org CMC South Oct 2015

Part (a)

(a)Interpret the slope of the regression line in the context of this question.

For each additional foot that is added to the width of the grass buffer strip, an additional 3.6 parts per hundred of nitrogen is removed on average from the runoff water.

Page 9: Taming the AP Statistics Investigative Task Daren Starnes The Lawrenceville School dstarnes@lawrenceville.org CMC South Oct 2015

Part (b)

(b) Would you be willing to use this model to predict the amount of nitrogen removed for grass buffer strips with widths between 0 feet and 30 feet? Explain why or why not.

No. This is extrapolation beyond the range of data from the experiment. Buffer strips narrower than 5 feet or wider than 15 feet were not investigated.

Page 10: Taming the AP Statistics Investigative Task Daren Starnes The Lawrenceville School dstarnes@lawrenceville.org CMC South Oct 2015

Now things get interesting…

Suppose the scientist decides to use buffer strips of width 6 feet at each of four locations and buffer strips of width 13 feet at each of the other four locations. Assume the model, , estimated from the Nebraska study is the true regression line in California and the observations at the different locations are normally distributed with standard deviation of 5 parts per hundred.

Page 11: Taming the AP Statistics Investigative Task Daren Starnes The Lawrenceville School dstarnes@lawrenceville.org CMC South Oct 2015

The Big Idea behind parts (c) and (d)

Page 12: Taming the AP Statistics Investigative Task Daren Starnes The Lawrenceville School dstarnes@lawrenceville.org CMC South Oct 2015

Part (c)(c) Describe the sampling distribution of the sample mean of the observations on the amount of nitrogen removed by the four buffer strips with widths of 6 feet.

Because the distribution of nitrogen removed for any particular buffer strip width is normally distributed with a standard deviation of 5 parts per hundred, the sampling distribution of the mean of four observations when the buffer strips are 6 feet wide will be normal with mean 33.8 + 3.6 × 6 = 55.4 parts per hundred and a standard deviation of parts per hundred.

Page 13: Taming the AP Statistics Investigative Task Daren Starnes The Lawrenceville School dstarnes@lawrenceville.org CMC South Oct 2015

Part (d)(d) Using your result from part (c), show how to construct an interval that has probability 0.95 of containing the sample mean of the observations from four buffer strips with widths of 6 feet.The distribution of the sample mean is normal, so the interval that has probability 0.95 of containing the mean nitrogen content removed from four buffer strips of width 6 feet extends from 55.4 - 1.96 × 2.5 = 50.5 parts per hundred to 55.4 + 1.96 × 2.5 = 60.3 parts per hundred.

Page 14: Taming the AP Statistics Investigative Task Daren Starnes The Lawrenceville School dstarnes@lawrenceville.org CMC South Oct 2015

And for the next twist…For the study plan being implemented by the scientist in California, the graph on the left below displays intervals that each have probability 0.95 of containing the sample mean of the four observations for buffer strips of width 6 feet and for buffer strips of width 13 feet. A second possible study plan would use buffer strips of width 8 feet at four of the eight locations and buffer strips of width 10 feet at the other four locations. Intervals that each have probability 0.95 of containing the mean of the four observations for buffer strips of width 8 feet and for buffer strips of width 10 feet, respectively, are shown in the graph on the right below.

Page 15: Taming the AP Statistics Investigative Task Daren Starnes The Lawrenceville School dstarnes@lawrenceville.org CMC South Oct 2015

If data are collected for the first study plan, a sample mean will be computed for the four observations from buffer strips of width 6 feet and a second sample mean will be computed for the four observations from buffer strips of width 13 feet. The estimated regression line for those eight observations will pass through the two sample means. If data are collected for the second study plan, a similar method will be used.

Page 16: Taming the AP Statistics Investigative Task Daren Starnes The Lawrenceville School dstarnes@lawrenceville.org CMC South Oct 2015

Part (e)

(e) Use the plots above to determine which study plan, the first or the second, would provide a better estimator of the slope of the regression line. Explain your reasoning.

Page 17: Taming the AP Statistics Investigative Task Daren Starnes The Lawrenceville School dstarnes@lawrenceville.org CMC South Oct 2015

Part (e) Solution

Page 18: Taming the AP Statistics Investigative Task Daren Starnes The Lawrenceville School dstarnes@lawrenceville.org CMC South Oct 2015

Part (f)(f) The previous parts of this question used the assumption of a straight-line relationship between the width of the buffer strip and the amount of nitrogen that is removed, in parts per hundred. Although this assumption was motivated by prior experience, it may not be correct. Describe another way of choosing the widths of the buffer strips at eight locations that would enable the researchers to check the assumption of a straight-line relationship.

Page 19: Taming the AP Statistics Investigative Task Daren Starnes The Lawrenceville School dstarnes@lawrenceville.org CMC South Oct 2015

Part (f) Solution

To assess the linear relationship between width of

the buffer strip and the amount of nitrogen removed from runoff water, more widths should be used. To detect a nonlinear relationship, it would be best to use buffer widths that were spaced out over the entire range of interest. For example, if the range of interest is 6 to 13 feet, eight buffers with widths 6, 7, 8, 9, 10, 11, 12 and 13 feet could be used.

Page 20: Taming the AP Statistics Investigative Task Daren Starnes The Lawrenceville School dstarnes@lawrenceville.org CMC South Oct 2015

Intermission?!

What are some strategies we can use to help our students develop competence and confidence with investigative tasks like this one?

Page 21: Taming the AP Statistics Investigative Task Daren Starnes The Lawrenceville School dstarnes@lawrenceville.org CMC South Oct 2015

2009 Exam Question 6

A consumer organization was concerned that an automobile manufacturer was misleading customers by overstating the average fuel efficiency (measured in miles per gallon, or mpg) of a particular car model. The model was advertised to get 27 mpg. To investigate, researchers selected a random sample of 10 cars of that model. Each car was then randomly assigned a different driver. Each car was driven for 5,000 miles, and the total fuel consumption was used to compute mpg for that car.

Page 22: Taming the AP Statistics Investigative Task Daren Starnes The Lawrenceville School dstarnes@lawrenceville.org CMC South Oct 2015

Part (a)

(a)Define the parameter of interest and state the null and alternative hypotheses the consumer organization is interested in testing.

The parameter of interest is µ = population mean miles per gallon (mpg) of a particular car model. The null and alternative hypotheses are as follows:

Page 23: Taming the AP Statistics Investigative Task Daren Starnes The Lawrenceville School dstarnes@lawrenceville.org CMC South Oct 2015

One condition for conducting a one-sample t-test in this situation is that the mpg measurements for the population of cars of this model should be normally distributed. However, the boxplot and histogram shown below indicate that the distribution of the 10 sample values is skewed to the right.

Page 24: Taming the AP Statistics Investigative Task Daren Starnes The Lawrenceville School dstarnes@lawrenceville.org CMC South Oct 2015

Part (b)

(b) One possible statistic that measures skewness is the ratio .

What values of that statistic (small, large, close to one) might indicate that the population distribution of mpg values is skewed to the right? Explain.If the distribution is right-skewed, one would expect the mean to be greater than the median. Therefore the ratio should be large (greater than 1).

Page 25: Taming the AP Statistics Investigative Task Daren Starnes The Lawrenceville School dstarnes@lawrenceville.org CMC South Oct 2015

Part (c)

Even though the mpg values in the sample were skewed to the right, it is still possible that the population distribution of mpg values is normally distributed and that the skewness was due to sampling variability. To investigate, 100 samples, each of size 10, were taken from a normal distribution with the same mean and standard deviation as the original sample.

Page 26: Taming the AP Statistics Investigative Task Daren Starnes The Lawrenceville School dstarnes@lawrenceville.org CMC South Oct 2015

Part (c)For each of those 100 samples, the statistic was calculated. A dotplot of the 100 simulated statistics is shown below.

In the original sample, the value of the statistic sample mean sample median was 1.03.

Page 27: Taming the AP Statistics Investigative Task Daren Starnes The Lawrenceville School dstarnes@lawrenceville.org CMC South Oct 2015

Part (c)Based on the value of 1.03 and the dotplot above, is it plausible that the original sample of 10 cars came from a normal population, or do the simulated results suggest the original population is really skewed to the right? Explain.

Page 28: Taming the AP Statistics Investigative Task Daren Starnes The Lawrenceville School dstarnes@lawrenceville.org CMC South Oct 2015

Part (c) Solution

Because we are testing for right-skewness, the estimated p-value will be the proportion of the simulated statistics that are greater than or equal to the observed value of 1.03. The dotplot shows that 14 of the 100 values are more than 1.03. Because this simulated p-value (0.14) is larger than any reasonable significance level, we do not have convincing evidence that the original population is skewed to the right and conclude that it is plausible that the original sample came from a normal population.

Page 29: Taming the AP Statistics Investigative Task Daren Starnes The Lawrenceville School dstarnes@lawrenceville.org CMC South Oct 2015

Part (d)The table below shows summary statistics for mpg measurements for the original sample of 10 cars.

Choosing only from the summary statistics in the table, define a formula for a different statistic that measures skewness. What values of that statistic might indicate that the distribution is skewed to the right? Explain.

Page 30: Taming the AP Statistics Investigative Task Daren Starnes The Lawrenceville School dstarnes@lawrenceville.org CMC South Oct 2015

Part (d) Solution

One possible statistic is

If the distribution is right-skewed, one would expect the distance from the median to the maximum to be larger than the distance from the median to the minimum; thus the ratio should be greater than 1.

Page 31: Taming the AP Statistics Investigative Task Daren Starnes The Lawrenceville School dstarnes@lawrenceville.org CMC South Oct 2015

0 1 2 3

Send your text message to this Phone Number: 37607

Strongly Disagree

Strongly Agree

Disagree Agree

Speaker was well-prepared and

knowledgeable (0-3)

Speaker was engaging and an effective presenter (0-3)

Session matched title and description in

program book (0-3)

Other comments, suggestions, or

feedback (words)

___ ___ ___ ___________ __4285___

Example: 4285 323 Inspiring, good content

poll code for this session

(no spaces)

Non-Example: 4285 3 2 3 Inspiring, good content

(1 space) (1 space)

Non-Example: 4285 3-2-3Inspiring, good content

Page 32: Taming the AP Statistics Investigative Task Daren Starnes The Lawrenceville School dstarnes@lawrenceville.org CMC South Oct 2015

How Did We Do?

• Questions and answers• Parting thoughts

E-mail me with comments or questions:

[email protected]