ap statistics review two population proportions stoney pryor

52
AP Statistics Review Two Population Proportions Stoney Pryor

Upload: blaze-bridges

Post on 17-Dec-2015

231 views

Category:

Documents


0 download

TRANSCRIPT

AP Statistics Review

Two Population Proportions

Stoney Pryor

Stoney Pryor

• Husband for 18 years, father of 3• Teacher in CSISD for 20 years• Taught AP Statistics since 1998• Only 6 sophomores two years ago, 14 students

last year, about 90 this year• Varsity football coach for 17 years, including 6

years as offensive coordinator• Head girls soccer coach since 1999.

Two Population Proportions

Multiple Choice

Experimental Design

• Some experiments have a control group (a group of experimental units that receive no treatment or receive only a placebo), but this is not necessary for a well-designed experiment.

Experimental Units

• Experimental units are the smallest independent “objects” to which treatments are assigned and on which a response is measured. – Consider an experiment that is designed to determine

which of several types of fish food will result in the greatest weight gain for fish. If tanks contain several fish, and food is added to the water in the tank, then the tank is the experimental unit (not the individual fish), since the fish in a tank are not independent of one another, but tanks are independent of one another.

Replication

• Replication refers to having multiple experimental units in each treatment group (repeating the treatment), not to repeating the entire experiment.

1. Which of the following is a key distinction between well designed experiments and observational studies?

A. More subjects are available for experiments than for observational studies. B. Ethical constraints prevent large-scale observational studies. C. Experiments are less costly to conduct than observational studies. D. An experiment can show a direct cause-and-effect relationship, whereas an observational study cannot. E. Tests of significance cannot be used on data collected from an observational study.

2. A new medication has been developed to treat sleep-onset insomnia (difficulty in falling asleep). Researchers want to compare this drug to a drug that has been used in the past by comparing the length of time it takes subjects to fall asleep. Of the following, which is the best method for obtaining this information? A. Have subjects choose which drug they are willing to use, then compare the results. B. Assign the two drugs to the subjects on the basis of their past sleep history without randomization, then compare the results. C. Give the new drug to all subjects on the first night. Give the old drug to all subjects on the second night. Compare the results. D. Randomly assign the subjects to two groups, giving the new drug to one group and no drug to the other group, then compare the results. E. Randomly assign the subjects to two groups, giving the new drug to one group and the old drug to the other group, then compare the results.

3. A researcher wishes to test a new drug developed to treat hypertension (high blood pressure). A group of 40 hypertensive men and 60 hypertensive women is to be used. The experimenter randomly assigns 20 of the men and 30 of the women to the placebo and assigns the rest to the treatment. The major reason for separate assignment for men and women is that A. it is a large study with 100 subjects B. the new drug may affect men and women differently C. the new drug may affect hypertensive and nonhypertensive people differently D. this design uses matched pairs to detect the new-drug effect E. there must be an equal number of subjects in both the placebo group and the treatment group

4. Automobile brake pads are either metallic or nonmetallic. An experiment is to be conducted to determine whether the stopping distance is the same for both types of brake pads. In previous studies, it was determined that car size (small, medium, large) is associated with stopping distance, but car type (sedan, wagon, coupe) is not associated with stopping distance. This experiment would best be done A. by blocking on car size B. by blocking on car type C. by blocking on stopping distance D. by blocking on brake pad type E. without blocking

5. A group of students has 60 houseflies in a large container and needs to assign 20 to each of three groups labeled A, B, and C for an experiment. They can capture the flies one at a time when the flies enter a side chamber in the container that is baited with food. Which of the following methods will be most likely to result in three comparable groups of 20 houseflies each? A. Label the first 20 flies caught as group A, the second 20 caught as group B, and the third 20 caught as group C.

B. Write the letters A, B, and C on separate slips of paper. Randomly pick one of the slips of paper and assign the first 20 flies caught to that group. Pick another slip and assign the next 20 flies to that group. Assign the remaining flies to the remaining group.

C. When each fly is caught, roll a die. If the die shows an even number, the fly is labeled A. If the die shows an odd number, the fly is labeled B. When 20 flies have been labeled A and 20 have been labeled B, the remaining flies are then labeled C.

5. A group of students has 60 houseflies in a large container and needs to assign 20 to each of three groups labeled A, B, and C for an experiment. They can capture the flies one at a time when the flies enter a side chamber in the container that is baited with food. Which of the following methods will be most likely to result in three comparable groups of 20 houseflies each? D. Place each fly in its own numbered container (numbered from 1 to 60) in the order that it was caught. Write the number from 1 to 60 on slips of paper, put the slips in a jar, and mix them well. Pick 20 numbers out of the jar. Assign the flies in the containers with those numbers to group A. Pick 20 more numbers and assign the flies in the containers with those numbers to group B. Assign the remaining 20 flies to group C.

E. When each fly is caught, roll a die. If the die shows a 1 or 2, the fly is labeled A. If the die shows a 3 or 4, the fly is labeled B. If the die shows a 5 or 6, the fly is labeled C. Repeat this process for all 60 flies.

Free Response

6. When a tractor pulls a plow through an agricultural field, the energy needed to pull that plow is called the draft. The draft is affected by environmental conditions such as soil type, terrain, and moisture. A study was conducted to determine whether a newly developed hitch would be able to reduce draft compared to the standard hitch. (A hitch is used to connect the plow to the tractor.) Two large plots of land were used in this study. It was randomly determined which plot was to be plowed using the standard hitch. As the tractor plowed that plot, a measurement device on the tractor automatically recorded the draft at 25 randomly selected points in the plot. After the plot was plowed, the hitch was changed from the standard one to the new one, a process that takes a substantial amount of time. Then the second plot was plowed using the new hitch. Twenty-five measurements of draft were also recorded at randomly selected points in this plot.

(a) What was the response variable in this study? The response variable was the amount of draft. Identify the treatments. The two treatments were the standard hitch and the new hitch. What were the experimental units? The experimental units were the two large plots of land.

(b) Given that the goal of the study is to determine whether a newly developed hitch reduces draft compared to the standard hitch, was randomization used properly in this study? Justify your answer.

Yes, the two hitches (treatments) were randomly assigned to the two plots (experimental units).

(c) Given that the goal of the study is to determine whether a newly developed hitch reduces draft compared to the standard hitch, was replication used properly in this study? Justify your answer.

No, each treatment (type of hitch) was applied to only one experimental unit (plot of land). Replication is used to repeat the treatments on different experimental units so general patterns can be observed. There is no replication in this study.

(d) Plot of land is a confounding variable in this experiment. Explain why.

Although 25 measurements were taken at different locations in the two plots, each hitch was used in one plot (experimental unit) only. Thus, if a difference in the draft is observed we will not know whether the difference is due to the hitch or the plot. In statistical language, the treatments (hitches) are confounded with the plots.

7. Because of concerns about employee stress, a large company is conducting a study to compare two programs (tai chi or yoga) that may help employees reduce their stress levels. Tai chi is a 1,200 year old practice, originating in China, that consists of slow, fluid movements. Yoga is a practice, originating in India, that consists of breathing exercises and movements designed to stretch and relax muscles. The cpmany has assembled a group of volunteer employees to participate in the study during the first half of their lunch hour each day for a 10 week period. Each volunteer will be assigned at random to one of the two programs. Volunteers will have their stress levels measured just before beginning the program and 10 weeks later at the completion of it.

(a) A group of volunteers who work together ask to be assigned to the same program so that they can participate in that program together. Give an example of a problem that might arise if this is permitted. Explain to this volunteer group why random assignment to the two programs will address this problem.

For example, a deadline in the department where the group of volunteers works has been moved back, lowering the stress levels of those working in the department. If the volunteers from this department were all in the same treatment group, this change in stress level could mistakenly be attributed to the treatment. Without random assignment of volunteers to the two programs, it is possible that the two treatment groups could differ in some way that affects the outcome of the experiment. Randomization "evens out" the possible effects of potentially confounding variables.

(b) Someone proposes that a control group be included in the design as well. The stress level would be measured for each volunteer assigned to the control group at the start of the study and again 10 weeks later. What additional information, if any, would this provide about the effectiveness of the two programs?

Without the control group, the company could compare the two treatments, but would not be able to say whether the observed reduction in stress was attributable to participation in the programs. For example, a change in the work environment during this period might have reduced the stress level of all employees. The addition of a control group would enable the company to assess the magnitude of the mean reduction attributable to each treatment, as opposed to just determining if the two programs differ.

(c) Is it reasonable to generalize the findings of this study to all employees of this company? Explain.

It is not reasonable to generalize the findings of this study to all employees, because the participants in this experiment were volunteers and volunteers may not be representative of the population OR the participants were not randomly selected from the company employees.

In search of a mosquito repellent that is safer than the ones that are currently on the market, scientists have developed a new compound that is rated as less toxic than the current compound, thus making a repellent that contains this new compound safer for human use. Scientists also believe that a repellent containing the new compound will be more effective than the ones that contain the current compound. To test the effectiveness of the new compound versus that of the current compound, scientists have randomly selected 100 people from a state. Up to 100 bins, with an equal number of mosquitoes in each bin, are available for use in the study. After a compound is applied to a participant’s forearm, the participant will insert his or her forearm into a bin for 1 minute, and the number of mosquito bites on the arm at the end of that time will be determined.

(a) Suppose this study is to be conducted using a completely randomized design. Describe a randomization process and identify an inference procedure for the study.

Each of the 100 selected people will be assigned a unique random number using a random number generator. A list of names and numbers will be created and sorted from smallest to largest by the assigned numbers (and carrying along the name). The first 50 people on the list will be asked to apply the new compound to their right arm. The compounds will be put in identical, unmarked tubes so neither the participants nor the researchers know which compound is being applied. The analysts will be the only people who know which participants received which compound. Each person will be randomly assigned to bins by assigning random numbers to bins using a random number generator. The first person on the list will be assigned to the bin with the smallest number, the second person on the list will be assigned to the bin with the second smallest number, and so on. After each person inserts his or her right arm into the assigned bin for one minute, the number of mosquito bites will be counted. The mean number of mosquito bites for the two compounds will be compared using a two-sample t-test and/or a confidence interval for the difference in means for two independent samples.

(b) Suppose this study is to be conducted using a matched-pairs design. Describe a randomization process and identify an inference procedure for the study.

Each participant will be randomly assigned to a bin as describe in part (a). The researchers will distribute two identical tubes, one labeled 1 and the other labeled 2, to each participant. One of those tubes will contain the new compound and the other will contain the current compound. Neither the researchers nor the participants will know which compound is in which tube. Only the analyst will have this information. Each participant will apply one compound to one arm and the other compound to the other arm. The assignment of the compounds to the arms is completed using randomization. A random number will be generated for each participant, and the participants with the 50 smallest random numbers will apply tube 1 to their right arm, and the remaining 50 participants will apply tube 2 to their left arm. Each participant will insert both arms into the assigned bin at the same time for one minute, and the number of mosquito bites will be counted on each arm. The analyst will compute the difference in the number of bites (new – current) for each of the 100 participants and use a one-sample t-test and/or construct a confidence interval for the mean of the differences to test the null hypothesis that the mean difference is zero.

(c) Which of the designs, the one in part (a) or the one in part (b), is better for testing the effectiveness of the new compound versus that of the current compound? Justify your answer.

The matched-pairs design in part (b) is better because one potential source of variation, person-to-person variability in susceptibility to mosquito bites, is controlled.