randomized experiments samplepersonal.psu.edu/drh20/100/fall2008/exams/final/first3midterms.pdf(d)...

22
STAT 100, Section 4 Sample Final Exam Questions Fall 2008 The following questions are similar to the types of questions you will see on the final exam. The actual final will consist of 70 to 80 multiple-choice questions. NOTE: On the actual exam, the choices will not require calculator math. On this sample exam, some of the choices may require a calculator. Chapter 1 Question 1. The importance of randomized experiments is that they: (A) Require only small sample sizes. (B) Allow the statistician to assign units to the treatment group. (C) Allow the inference of causation. (D) None of the above. Question 2. In a statistical study, the sample is: (A) The group of people or objects for which conclusions are to be made. (B) A subset of people in the United States. (C) The subset of the population on which the study collects data. (D) The collection of data in sample surveys. Question 3. To estimate the unemployment rate, the Bureau of Labor Statistics contacts about 60,000 households chosen randomly from a list of all known households. The Bureau asks each adult in every sampled household whether they are in the labor force, i.e., employed or seeking employment. The population of interest is: (A) All known or unknown households. (B) All adults who are in the labor force. (C) All employed adults. (D) All adults who refused to participate in the survey. Question 4. In a statistical study, the population is: (A) The group of people from whom data cannot be collected. (B) The people or objects studied in the sample survey. (C) The group of people or objects for which conclusions are to be made. (D) All people in the United States. Chapter 3 1

Upload: others

Post on 01-Jan-2021

3 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: randomized experiments samplepersonal.psu.edu/drh20/100/fall2008/exams/final/first3midterms.pdf(D) Systematic random sampling. Question 17. In a study of PSU students’ awareness

STAT 100, Section 4 Sample Final Exam Questions Fall 2008

The following questions are similar to the types of questions you will see on the finalexam. The actual final will consist of 70 to 80 multiple-choice questions.

NOTE: On the actual exam, the choices will not require calculator math.On this sample exam, some of the choices may require a calculator.

Chapter 1

Question 1. The importance of randomized experiments is that they:

(A) Require only small sample sizes.(B) Allow the statistician to assign units to the treatment group.(C) Allow the inference of causation.(D) None of the above.

Question 2. In a statistical study, the sample is:

(A) The group of people or objects for which conclusions are to be made.(B) A subset of people in the United States.(C) The subset of the population on which the study collects data.(D) The collection of data in sample surveys.

Question 3. To estimate the unemployment rate, the Bureau of Labor Statistics contactsabout 60,000 households chosen randomly from a list of all known households. TheBureau asks each adult in every sampled household whether they are in the laborforce, i.e., employed or seeking employment. The population of interest is:

(A) All known or unknown households.(B) All adults who are in the labor force.(C) All employed adults.(D) All adults who refused to participate in the survey.

Question 4. In a statistical study, the population is:

(A) The group of people from whom data cannot be collected.(B) The people or objects studied in the sample survey.(C) The group of people or objects for which conclusions are to be made.(D) All people in the United States.

Chapter 3

1

Page 2: randomized experiments samplepersonal.psu.edu/drh20/100/fall2008/exams/final/first3midterms.pdf(D) Systematic random sampling. Question 17. In a study of PSU students’ awareness

Question 5. An advertiser of No-Pain aspirin claims it is the pain-killer most preferredby consumers. This claim was based on a consumer survey in which the choices were:Lavid, Acinna, No-Pain, and Yellnot. This is an example of:

(A) An easy question.(B) An open question.(C) A difficult question.(D) A closed question.

Question 6. When measured with extreme accuracy, the variable “height of a building”is:

(A) A discrete quantitative variable(B) A nominal categorical variable(C) A continuous quantitative variable(D) An ordinal categorical variable

Question 7. Which of the following measures is valid and categorical?

(A) Time on a clock that is always 10 minutes fast(B) The sale price of a house(C) Sex (male or female)(D) Weight of an individual on a scale that is sometimes 5 pounds too light, sometimes 5

pounds too heavy

Question 8. In a study of car ownership in Pennsylvania, the variable “Brand of carowned” is:

(A) A discrete quantitative variable(B) A nominal categorical variable(C) A continuous quantitative variable(D) An ordinal categorical variable

Chapter 4

Question 9. The Gallup Poll, a well-known polling group, regularly surveys the pub-lic on many issues. The number of interviews on which their surveys are based is,approximately:

(A) 60,000 – 120,000(B) 60–120(C) 600,000 – 1,200,000(D) The entire U.S. population.(E) 600 – 1,200

2

Page 3: randomized experiments samplepersonal.psu.edu/drh20/100/fall2008/exams/final/first3midterms.pdf(D) Systematic random sampling. Question 17. In a study of PSU students’ awareness

Question 10. To survey the opinions of its customers, a supermarket grouped its cus-tomers by the days when they did most of their shopping. The supermarket randomlyselected two such groups, and asked all customers in those two groups to complete asurvey. This (biased) method of sampling is called:

(A) Stratified random sampling.(B) Systematic sampling.(C) Random digit dialing.(D) Cluster sampling.

Question 11. A radio advertiser wishes to choose a sample of size 100 from a populationof 5000 listeners. He divides the population into five separate groups and then selectsa simple random sample from each group. This method of sampling is called:

(A) Simple random sampling.(B) Systematic random sampling.(C) Volunteer sampling.(D) Stratified random sampling.

Question 12. To obtain a margin of error of 1% we need a sample of size:

(A) 1,000(B) 100,000(C) 10,000(D) 100

Question 13. A simple random sample of ten subjects from a population is one in which:

(A) Any group of ten subjects has the same chance of being the selected sample.(B) A simple way was found to choose ten subjects at random from the population.(C) Most, but not all, groups of size ten have the same chance of being selected.(D) We record the data provided by the first ten subjects who respond to the survey.

Question 14. A radio advertiser wishes to choose a sample of size 100 from a populationof 5000 listeners. He divides the population into a large number of groups. He selectsa simple random sample of the groups and then surveys every subject in each of thegroups selected. This method of random sampling is called:

(A) Cluster sampling.(B) Random digit dialing.(C) Stratified random sampling.(D) Systematic random sampling.

3

Page 4: randomized experiments samplepersonal.psu.edu/drh20/100/fall2008/exams/final/first3midterms.pdf(D) Systematic random sampling. Question 17. In a study of PSU students’ awareness

Question 15. A supermarket manager wants to know if customers would pay slightlyhigher prices to have computers available throughout the store to help them locateitems. An interviewer is posted at the entrance door and asked to collect a sample of100 opinions by asking questions of the next person who came to the door each timeshe had completed an interview. This method of sampling is called:

(A) Cluster sampling.(B) Stratified random sampling.(C) Random digit dialing.(D) Convenience sampling.(E) Systematic sampling.

Question 16. A radio advertiser wishes to choose a sample of size 100 from a populationof 5000 listeners. After observing that 5, 000÷ 50 = 100, he first selects a subject atrandom from the first 50 names in the sampling frame, and then he selects every 50thsubject listed after that one. This method of sampling is called:

(A) Stratified random sampling.(B) Simple random sampling.(C) Cluster random sampling.(D) Systematic random sampling.

Question 17. In a study of PSU students’ awareness of world issues, a television crewsampled students sitting in the Hub at 12:00 p.m. (noon). This survey is an exampleof:

(A) A simple random sample, because it is simple to find students at the Hub in a randommanner.

(B) A haphazard (or convenience) sample, because the Hub is a convenient place to findstudents.

(C) A stratified random sample, because students are stratified by gender and they arefound at the Hub in a random manner.

(D) A volunteer response sample because, to be seen on the evening news, students willeagerly volunteer their responses.

Question 18. If a simple random sample of 1,600 subjects is chosen then an approximatemargin of error for making inferences about a percentage of the entire population is:

(A) 1/1, 600, or 0.06%.(B) Impossible to calculate without knowing the percentage observed in the sample.(C) Very high, because the sample size is a tiny percentage of the population size.(D) Approximately zero because the sample size is so large.

(E) 1/√

1, 600, or 2.5%.

Chapter 5

4

Page 5: randomized experiments samplepersonal.psu.edu/drh20/100/fall2008/exams/final/first3midterms.pdf(D) Systematic random sampling. Question 17. In a study of PSU students’ awareness

Question 19. To test the effects of sleepiness on driving performance, twenty volunteerstook a simulated driving test under each of three conditions: Well-rested, Sleepy, andExhausted. The order in which each volunteer took the three tests was randomized,and an evaluator rated their driving accuracy without knowing the condition of thevolunteer. This type of experiment is a:

(A) Single-blind, matched-pair experiment.(B) Single-blind, block design experiment.(C) Retrospective observational study.(D) Double-blind, matched-pair experiment.

Question 20. The term “control” means that:

(A) There must be a basis for making comparisons.(B) We control carefully the cost of performing the experiment.(C) We control carefully the observed outcomes of the experiment.(D) There is a need to control the subjects of the experiment.

Question 21. It has been observed that participants in a statistical experiment sometimesrespond differently than they otherwise would because they know that they are in anexperiment. This phenomenon is called the:

(A) Interacting effect.(B) Confounding effect.(C) Placebo effect.(D) Hawthorne effect.

Question 22. To test the effects of sleepiness on driving performance, twenty volunteerstook a simulated driving test under each of three conditions: Well-rested, Sleepy, andExhausted. The order in which each volunteer took the three tests was randomized,and an evaluator rated their driving accuracy without knowing the condition of thevolunteer. The explanatory variable is:

(A) The order in which the volunteers took the tests.(B) The condition of sleepiness.(C) The evaluator’s rating of the driver’s condition.(D) The evaluator’s rating of driving accuracy.

Question 23. The term “randomization” means that we:

(A) We study only a random subset of all observed outcomes of the experiment.(B) Allow subjects to assign themselves randomly to the placebo or treatment group.(C) Use a chance mechanism to assign subjects to the treatment and control groups.(D) Assign the subjects to the random experiment in a systematic manner.

5

Page 6: randomized experiments samplepersonal.psu.edu/drh20/100/fall2008/exams/final/first3midterms.pdf(D) Systematic random sampling. Question 17. In a study of PSU students’ awareness

Question 24. A researcher asks 1,600 randomly chosen doctors whether or not they takeaspirin regularly. She also asks them to estimate the number of headaches they havehad in the past six months, and compares the number of headaches reported by thosewho take aspirin regularly to the number of headaches reported by those who do nottake aspirin regularly. In this study, the number of headaches is a:

(A) Response variable.(B) Explanatory variable.(C) Confounding variable.(D) None of the above.

Question 25. In 1982, 490,000 subjects were asked about their drinking habits. Re-searchers tracked subjects’ death rates until 1991, and found that adults who regularlyhad one alcoholic drink daily had a lower death rate than those who did not drink.Most of the subjects were middle-class, married, and college-educated.

This experiment was:

(A) A randomized experiment; subjects were assigned in a randomized manner to haveone alcoholic drink each day.

(B) An observational study; it would not be ethical for the researchers to randomly assignsubjects to drink alcohol or not.

(C) Based on a stratified random sample; subjects were stratified randomly into overlap-ping groups according to whether or not they had one drink daily.

(D) Accurate; there is fundamentally solid anecdotal evidence that people’s health willimprove if they have one drink daily.

Question 26. In 1982, 490,000 subjects were asked about their drinking habits. Re-searchers tracked subjects’ death rates until 1991, and found that adults who regu-larly had one alcoholic drink daily had a lower death rate than those who did notdrink. Most of the subjects were middle-class, married, and college-educated. Thisexperiment is an example of:

(A) A block design; many groups were compared, e.g., married vs. not married, middle-class vs. not middle-class.

(B) A prospective study; after recording their drinking habits, subjects were studied intothe future.

(C) A matched pairs design; two groups were compared, those who had one drink dailyand those who did not.

(D) A retrospective study; after the nine-year period, survivors were asked about theirdrinking habits.

Question 27. A researcher asks 1,600 randomly chosen doctors whether or not they takeaspirin regularly. She also asks them to estimate the number of headaches they havehad in the past six months, and compares the number of headaches reported by thosewho take aspirin regularly to the number of headaches reported by those who do nottake aspirin regularly. This type of study design is a:

(A) Retrospective observational study.(B) Randomized experiment.(C) Census.(D) Prospective study.

6

Page 7: randomized experiments samplepersonal.psu.edu/drh20/100/fall2008/exams/final/first3midterms.pdf(D) Systematic random sampling. Question 17. In a study of PSU students’ awareness

Question 28. It has been observed that participants in a statistical experiment sometimesrespond positively to a placebo, a substance which has no active ingredients. Thisphenomenon is called the:

(A) Placebo effect.(B) Interacting effect.(C) Hawthorne effect.(D) Confounding effect.

Chapter 7

Question 29. For any data set, the proportion of the data that falls at or above themedian is:

(A) 95%(B) 99.7%(C) 25%(D) 50%(E) Dependent on the sample which was drawn from the population.

Question 30. When a distribution is greatly skewed to the right (i.e., has a long righttail but a comparitively short left tail), the median will usually be:

(A) In no relation whatsoever to the mean.(B) Smaller than the mean.(C) Larger than the mean.(D) Exactly equal to the mean.

Question 31. A sample of 28 temperature measurements in ◦F, all taken at 12:00 p.m.,was collected in a coastal town in NC. A second sample of 28 temperature readingsin a GA coastal town was also collected. Each GA measurement was recorded at thesame time and date as the NC data value, and turned out to be exactly 5◦F higherthan the corresponding NC measurement. We calculate the standard deviation (S.D.)of the two data sets and conclude that:

(A) The two data sets have the same standard deviations.(B) The S.D. of the NC data exceeds the S.D. of the GA data by 5◦F.(C) The S.D. of the GA data exceeds the S.D. of the NC data by 5◦F.(D) There is not enough information to determine the relationship between the two S.D.’s.

Question 32. For any data set, the largest number in the five-number summary is the:

(A) Third quartile.(B) Maximum.(C) Standard deviation.(D) Outlier.(E) Interquartile range.

7

Page 8: randomized experiments samplepersonal.psu.edu/drh20/100/fall2008/exams/final/first3midterms.pdf(D) Systematic random sampling. Question 17. In a study of PSU students’ awareness

Question 33. In a boxplot (oriented vertically, with the maximum at the top and theminimum at the bottom), the proportion of the data falling between the top edge andthe bottom edge of the box itself is

(A) 25%(B) 99.7%(C) 50%(D) Impossible to say without more information(E) 68%

Question 34. A sample of 28 temperature measurements in ◦F, all taken at 12:00 p.m.,was collected in a coastal town in NC. The data are given in the following stemplot:

Stemplot of Temperature Readings

3 25 56 0 1 2 4 4 87 3 5 5 6 8 8 9 98 0 0 2 3 4 5 89 0 2 3 5 8

For the data given in this stemplot, the five-number summary is:

(A) 55, 66, 78.5, 84.5, 95(B) 32, 68, 79, 88, 98(C) 32, 66, 78.5, 84.5, 98(D) 32, 66, 78, 84, 98

Question 35. For any data set, the standard deviation is:

(A) A measure of central tendency.(B) The average of the deviations.(C) A measure of the spread or variability of the data.(D) The average of the sample mean and quartiles.

Question 36. If a particular dataset has a five-number summary given by 10, 20, 30, 40,50, then the interquartile range is:

(A) 30− 5 = 25(B) 40− 20 = 20(C) (10 + 20 + 30 + 40 + 50)/5 = 30(D) 50− 10 = 40

8

Page 9: randomized experiments samplepersonal.psu.edu/drh20/100/fall2008/exams/final/first3midterms.pdf(D) Systematic random sampling. Question 17. In a study of PSU students’ awareness

Question 37. In any large data set, the proportion of data falling at or below Q3, thethird quartile, is:

(A) 25%(B) 75%(C) Dependent on the sample drawn from the population.(D) 68%(E) 99.7%

Chapter 8

Question 38. If a data set is normally distributed then:

(A) The mean is smaller than the median or the mode.(B) The mean is larger than the median but not larger than the mode.(C) The mean, median and mode are the same.(D) None of the above.

Question 39. As measured by the Stanford-Binet test, IQ scores are approximately nor-mally distributed with mean 100 and standard deviation 16. Mensa is an organizationwhose members have IQ scores in the top 2%of the population. To be admitted toMensa, a person’s Stanford-Binet IQ score is at least:

(A) 100 + (16× 0.98) = 115.68.(B) 100− (16× 2.05) = 67.20.(C) 100 + (16× 2.05) = 132.80.(D) 100− (16× 0.98) = 84.32.

Question 40. The Empirical Rule states, in part, that if a data set is approximatelynormally distributed (or bell-shaped) then:

(A) At most 90% of all observations fall within three standard deviations of the mean.(B) About 37% of all observations fall within two standard deviations of the mean.(C) About 68% of all observations fall within one standard deviation of the mean.(D) None of the above.

Question 41. As measured by the Stanford-Binet test, IQ scores are approximatelynormally distributed with mean 100 and standard deviation 16. A student who scores108 on the Stanford-Binet test has a standardized score of:

(A) (108− 100)/16 = 0.50.(B) Cannot be calculated because we are given insufficient information.(C) (108− 16)/100 = 0.92.

(D) (108− 100)/√

100 = 0.80.

9

Page 10: randomized experiments samplepersonal.psu.edu/drh20/100/fall2008/exams/final/first3midterms.pdf(D) Systematic random sampling. Question 17. In a study of PSU students’ awareness

Question 42. As measured by the Stanford-Binet test, IQ scores are approximately nor-mally distributed with mean 100 and standard deviation 16. Because of the EmpiricalRule (68-95-99.7 Rule), we can conclude that approximately 68% of all IQ scores willfall between:

(A) 32 and 168.(B) 84 and 116.(C) 0 and 200.(D) 68 and 132.

Question 43. As measured by the Stanford-Binet test, IQ scores are approximatelynormally distributed with mean 100 and standard deviation 16. A score of 108 on theStanford-Binet test falls in the:

(A) 69th percentile.(B) 92nd percentile.(C) 50th percentile.(D) 80th percentile.

Chapter 10

Question 44.In an example in the textbook, the correlation between wives’ and husbands’ heights, in

millimeters, was 0.36. If heights are measured in inches then the correlation will:

(A) Be unchanged, because correlation does not depend on the units of measurement.(B) Become zero; for there is no correlation between the two variables.(C) Decrease, because 1 millimeter is shorter than 1 inch.(D) Increase, because 1 inch is longer than 1 millimeter.(E) None of the above.

Question 45. The variables x (temperature in ◦C) and y (temperature in ◦F) are relatedby the formula y = 32 + 1.8x. Therefore the correlation between x and y will be:

(A) 32, because if y = 32 then x = 0.(B) −1, because if x decreases then y decreases.(C) 1.8, because if x increases by 1◦C then y increases by 1.8◦F.(D) 0, because if x = 0 then y = 32.(E) 1, because the variables have a deterministic, linear, increasing relationship.

10

Page 11: randomized experiments samplepersonal.psu.edu/drh20/100/fall2008/exams/final/first3midterms.pdf(D) Systematic random sampling. Question 17. In a study of PSU students’ awareness

Question 46. Suppose that in a particular sample, we find that 80% of female collegestudents use cell phones while 75% of male college students use cell phones. Whichof the following is true?

(A) This difference between males and females is less likely to be statistically significantif the sample is larger.

(B) This difference between males and females is more likely to be statistically significantif the sample is larger.

(C) This difference between males and females is more likely to be practically significantif the sample is larger.

(D) This difference between males and females is less likely to be practically significant ifthe sample is larger.

Question 47. In a study of the relationship between handspan (in centemeters) and height(in inches) it was found that the correlation is about .80 and the regression equationis

handspan = −3 + 0.35 height

What is the predicted handspan for someone 5 feet tall?

(A) 22.2 centimeters(B) 3 centimeters(C) 60 centimeters(D) 18 centimeters

Question 48. Consider the following two variables: Weight of a car and its gas mileage(the number of miles it can drive on a gallon of gasoline). We would expect thecorrelation to be

(A) negative(B) positive(C) zero(D) one

Question 49. A dataset contains yearly measurements since 1960 of the divorce rate(per 100,000 population) in the United States and the yearly number of people (per100,000 population) sent to prison for drug offenses in the United States. We observea strong correlation of .67. Each of the following statements is true EXCEPT

(A) Years with higher divorce rates have tended to be years with higher lockup rates.(B) Higher divorce rates lead to higher rates of drug lockups.(C) Higher divorce rates are associated with higher rates of drug lockups.(D) A positive correlation between divorce rates and drug lockups exists.

Chapter 11

11

Page 12: randomized experiments samplepersonal.psu.edu/drh20/100/fall2008/exams/final/first3midterms.pdf(D) Systematic random sampling. Question 17. In a study of PSU students’ awareness

Question 50. True or False: Outliers in a scatter plot can increase the correlation.

(A) True(B) False

Question 51. The following table presents experts’ and students’ rankings of the risksof eight activities. (Hint: Sketch the scatterplot of these rankings with the experts’rankings on the horizontal axis.)

The Eight Greatest RisksActivity or Technology Experts’ Rank (x) Students’ Rank (y)Motor Vehicles 1 5Smoking 2 3Alcoholic Beverages 3 7Handguns 4 4Surgery 5 11Motorcycles 6 6X-rays 7 17Pesticides 8 2

If “Pesticides” were deleted from the data above, then we would expect that:

(A) The correlation and the slope of the regression line both will decrease.(B) The correlation will increase and the slope will decrease.(C) The correlation and the slope of the regression line both will remain unchanged.(D) The correlation and the slope of the regression line both will increase.(E) None of the above.

Chapters 12 and 13Hypothetical research question, asked of a random sample of students: Do you own a

pet? (Data and related output below.)

Rows: Gender Columns: Pet

no yes AllFemale 26 50 76Male 17 18 35All 43 68 111

Chi-sq = 0.40 + 0.25 +0.87 + xxxx = 2.08

Question 52. In the debate between the research advocate and the skeptic, who wins inthis case involving the pet question above?

(A) Skeptic(B) Neither(C) Research advocate(D) Both

12

Page 13: randomized experiments samplepersonal.psu.edu/drh20/100/fall2008/exams/final/first3midterms.pdf(D) Systematic random sampling. Question 17. In a study of PSU students’ awareness

Question 53. In the table above about the pet data, the expected number correspondingto 18 is about:

(A) 17(B) 111(C) 21(D) 68(E) 14

Question 54. If all the counts in the table above about the pet data were multipled by10, the chi-squared value would

(A) change to .208(B) change to 20.8(C) change to 208(D) stay the same

Chapter 16

Question 55. A fair die if rolled repeatedly until the first six appears. What is theprobability that the first six appears on the fourth roll?

(A) (56)6 = .3349

(B) 3× 56 ×

16 = .4167

(C) 56 ×

56 ×

56 ×

16 = .0965

(D) 46 = .6667

Question 56.

No Ticket Ticket TotalFemale 52 25 77Male 17 19 36Total 69 44 113

The data show clearly that males received traffic tickets at a higher rate than females. Fromthese census data we conclude that the events “Female” and “Ticket” are:

(A) Independent(B) Mutually exclusive(C) Neither mutually exclusive nor independent(D) Both mutually exclusive and independent

Question 57. Suppose you are playing a game and you have a 0.3 probability of winning.Suppose you decide to play repeatedly until you win. The games are independent.What is the probability that you win on the first or second try?

(A) .60(B) .30(C) .64(D) .51

13

Page 14: randomized experiments samplepersonal.psu.edu/drh20/100/fall2008/exams/final/first3midterms.pdf(D) Systematic random sampling. Question 17. In a study of PSU students’ awareness

Question 58. Suppose the probability that a child lives with his or her mother as thesole parent is .216, and the probability that a child lives with his or her father as soleparent is .031. Then the probability that a child either lives with both or with neitherparent is:

(A) 1− .216 = .784(B) 1− .031 = .969(C) .216 + .031 = .247(D) 1− (.216 + .031) = .753(E) None of the above

Question 59. Suppose you repeatedly toss a coin for which the probability of tossingheads is .3, or 30%. The probability that you do not toss heads on any of your firstfour tosses is:

(A) 1− (.7)4 = .76(B) .7 + .7 + .7 + .7 = 2.8(C) .7

(D) (.7)4 = .24(E) None of the above

Chapter 17

Question 60. Suppose a door prize winner is selected at random from all the peopleattending an Italian electrical engineering conference. Your roommate believes thatthe winner is more likely to be a black-haired male than a male in general. Yourroommate has committed

(A) the gambler’s fallacy.(B) the anchoring fallacy.(C) no fallacy at all; it is well-known that men with black hair have more fun.(D) the conjunction fallacy.

Chapter 18

Question 61. Suppose that 1%of the population has hepatitis. Suppose we have a testfor the disease that has 80% sensitivity and 90% specificity. What is the probabilityof a false positive?

(A) .99(B) .10(C) .20(D) .01

14

Page 15: randomized experiments samplepersonal.psu.edu/drh20/100/fall2008/exams/final/first3midterms.pdf(D) Systematic random sampling. Question 17. In a study of PSU students’ awareness

Question 62. Suppose that 1%of the population has hepatitis. Suppose we have a test forthe disease that has 80% sensitivity and 90% specificity. What is Pr(hepatitis giventhat the test is positive)?

(A) .01(B) .14(C) .80(D) .075

Chapter 19

Question 63. In a random sample of 1,600 PSU students, 64% reported a preferencefor root beer (over birch beer). In estimating the population proportion of all PSUstudents who prefer root beer, we measure the precision of the sample proportion by:

(A) The square of the sample size: 1, 6002, or 960, 000.(B) The sample size as a proportion of all PSU students: 1, 600/82, 000, or 1.95%.(C) The sample proportion: 64%.

(D) The margin of error: 1/√

1, 600, or 2.5%.

Question 64. The histogram of sample means from a large number of equally-sizedrandom samples of size 50 will be shaped approximately like a:

(A) Semi-circle.(B) Skewed histogram, with a long right tail.(C) Triangle.(D) Normal (bell-shaped) curve.

Question 65. The mean of a large number of sample proportions from equally-sizedrandom samples will be approximately:

(A) The area below the normal curve and between -1.96 and +1.96.(B) A proportion of the population which is never sampled.(C) The square root of: (true proportion)× (1− true proportion)/(sample size).(D) The square root of: (true proportion)× (sample size)/(1− true proportion).(E) The true proportion of the population.

Question 66. It is known that in the presidential elections of 1992, 56% of all eligibleadults actually voted. If we collect a large number of simple random samples each ofsize 1,600 adults then, about 95% of the time, the sample proportion of adults whovoted will fall between:

(A) .56± 4×√

.56(1− .56)/1600, or 0.512 and 0.608.

(B) .56± 1×√

.56(1− .56)/1600, or 0.548 and 0.572.

(C) .56± 2×√

.56(1− .56)/1600, or 0.536 and 0.584.

(D) .56± 3×√

.56(1− .56)/1600, or 0.524 and 0.596.

15

Page 16: randomized experiments samplepersonal.psu.edu/drh20/100/fall2008/exams/final/first3midterms.pdf(D) Systematic random sampling. Question 17. In a study of PSU students’ awareness

Question 67. The standard deviation of the histogram of a large number of sample meansis, approximately:

(A) The area below the normal curve and between -2 and +2.(B) The mean of a proportion of the population which is never sampled.(C) The true mean of the population.

(D) (population standard deviation)/√

sample size.

Question 68. The mean of a large number of sample means from equally-sized randomsamples will be approximately:

(A) (population standard deviation)/√

sample size.(B) The mean of a proportion of the population which is never sampled.(C) The area below the normal curve and between -1.96 and +1.96.(D) The true mean of the population.

Question 69. The standard deviation of the histogram of a large number of sampleproportions is, approximately:

(A) The area below the normal curve and between -1.96 and +1.96.(B) The square root of: (true proportion)× (1− true proportion)/(sample size).(C) A proportion of the population which is never sampled.(D) The square root of: (true proportion)× (sample size)/(1− true proportion).(E) The true proportion of the population.

Chapter 20

Question 70. A confidence interval for a population mean is given as “Sample mean ±2.33×SEM.” The corresponding level of confidence is:

(A) 98%(B) 99%(C) 95%(D) 64%

Question 71. A confidence interval for a population proportion is a range of numberswhich:

(A) Has a 90% probability of containing the population proportion.(B) Increases in width as the sample size increases.(C) Is certain to contain the population proportion.(D) Is a plausible range of values for the population proportion.

16

Page 17: randomized experiments samplepersonal.psu.edu/drh20/100/fall2008/exams/final/first3midterms.pdf(D) Systematic random sampling. Question 17. In a study of PSU students’ awareness

Question 72. To calculate a 99% confidence interval for a population proportion, we use:

(A) Sample proportion ± 1.64× S.D.(B) Sample proportion ± 2× S.D.(C) Sample proportion ± 2.33× S.D.(D) Sample proportion ± 2.576× S.D.

Question 73. All other things remaining constant, if the population size quadruples from10 million to 40 million then the width of a confidence interval will:

(A) Increase by a factor of two.(B) Decrease by half.(C) Remain unchanged.(D) Increase and then decrease.

Question 74. All other things remaining constant, increasing the confidence coefficientcauses the width of a confidence interval to:

(A) Remain unchanged.(B) Increase.(C) Decrease.(D) Increase for a while and then decrease.

Question 75. In a random sample of 400 PSU graduates, 64% stated that they preferroot beer (over birch beer). Therefore, a 90% confidence interval for the proportionof all PSU graduates who prefer root beer is:

(A) 0.64± 1.64×√

.64/400

(B) 0.64± 2×√

.64× .36/400

(C) 0.64± 2×√

.64/400

(D) 0.64± 1.64×√

.64× .36/400

Question 76. All other things remaining constant, which of the following sample propor-tions will result in the widest confidence interval:

(A) .2(B) .5(C) .1(D) .4(E) .3

Chapter 21

17

Page 18: randomized experiments samplepersonal.psu.edu/drh20/100/fall2008/exams/final/first3midterms.pdf(D) Systematic random sampling. Question 17. In a study of PSU students’ awareness

Question 77. In a study of farmers’ caffeine levels, a random sample of 25 farmersyielded a sample mean of 22 and a sample standard deviation (S.D.) of 4. Therefore,the standard error of the mean (SEM) is:

(A) 22/√

25

(B) 4×√

25

(C) 4/√

25

(D) 22×√

25

Question 78. A random sample of 25 farmers was examined in a study of caffeine levels.From the data collected, a 95% confidence interval for the population mean caffeinelevel was calculated to be 21.5 to 23.0. We can conclude that:

(A) If we repeatedly collect random samples of size 25 and calculate the correspondingconfidence intervals then, over the long run, 95% of these intervals will capture thepopulation mean and 5% will fail to capture the population mean.

(B) For any random sample of 25 farmers, the resulting sample mean will always fallbetween 21.5 and 23.0.

(C) If we repeatedly sample the entire population then, about 95% of the time, thepopulation mean will fall between 21.5 and 23.0.

(D) None of the above.(E) 95% of all farmers have caffeine levels between 21.5 and 23.0.

Question 79. In a study of caffeine levels of farmers and doctors, suppose that a 95%confidence interval for the difference in the means did not contain the value zero. Wecould infer that the population means of farmers’ and doctors’ caffeine levels:

(A) Are significantly different from each other.(B) Are close to each other; there is no significant difference between them.(C) Have no relationship to each other; we have insufficient evidence to make an inference

about their relative location.(D) Are equal to each other; we are 95% confident that they are equal.

Question 80. A study of caffeine levels of farmers and doctors reported the followingdata:

Farmers DoctorsSample Size 25 25Sample Mean 13 10Sample S.D. 4 3

The standard error of the difference between the two sample means is:

(A) The standard deviation of the data in the combined samples.

(B) (Sample Mean1 − Sample Mean2)/√

SD1 − SD2 = (13− 10)/√

4− 3 = 3.0

(C)√

(Sample Mean1)2/SD1 + (Sample Mean2)2/SD2 =√

( 13√25

)2 + ( 10√25

)2 = 3.28

(D)√

(SEM1)2 + (SEM2)2 =√

( 4√25

)2 + ( 3√25

)2 = 1.0

18

Page 19: randomized experiments samplepersonal.psu.edu/drh20/100/fall2008/exams/final/first3midterms.pdf(D) Systematic random sampling. Question 17. In a study of PSU students’ awareness

Question 81. All other things remaining constant, if the sample size increases by a factorof nine then the confidence interval for the population mean will:

(A) Triple in width.(B) Become one-ninth as wide.(C) Become nine times as wide.(D) Become one-third as wide.

Question 82. A researcher repeatedly collects random samples of size 1,600 and computesa 95% confidence interval for the population mean using each sample. Over the longrun, the proportion of confidence intervals which will fail to capture the populationmean is:

(A) None of the above.

(B) 1/√

1, 600, or 1/40, 2.5%(C) 95%(D) 5%

Question 83. A random sample of 25 farmers was examined in a study of caffeine levels.From the data collected, a 95% confidence interval for the population mean caffeinelevel was calculated to be 21.5 to 23.0. We can conclude that:

(A) 23.3 is a plausible value for the population mean.(B) 21.8 is not a plausible value for the population mean.(C) 22.3 is a plausible value for the population mean.(D) 19.7 is a plausible value for the population mean.

Question 84. A student claims that, for any data set of size two or more, the standarderror of the mean (SEM) is smaller than the sample standard deviation. This claimis:

(A) Always true.(B) Not always true; it depends on the actual numbers in the sample.(C) Always false.(D) Not always false; it depends on the sample size.

19

Page 20: randomized experiments samplepersonal.psu.edu/drh20/100/fall2008/exams/final/first3midterms.pdf(D) Systematic random sampling. Question 17. In a study of PSU students’ awareness

Question 85.A study of caffeine levels of farmers and doctors reported the following data:

Farmers DoctorsSample Size 25 25Sample Mean 13 10Sample S.D. 4 3

A 95% confidence interval for the difference in population mean caffeine levels (farmers’mean minus doctors’ mean) is:

(A) (13− 10)± 1.64× (SE of the difference ).(B) (4− 3)± 2× (SE of the difference ).(C) (4− 3)± 2× (SE of the difference ).(D) (13− 10)± 2× (SE of the difference ).

Question 86. All other things remaining constant, if the sample size decreases then thestandard error of the sample mean:

(A) Decreases.(B) Increases, levels off, and then increases again.(C) Increases.(D) Will remain unchanged.(E) Decreases and then increases.

Question 87. All other things remaining constant, if the sample size increases by a factorof 25 then the standard error of the mean:

(A) Becomes one twenty-fifth as large.(B) Becomes five times as large.(C) Becomes one fifth as large.(D) Becomes twenty-five times as large.(E) Will remain unchanged.

Chapter 22

Question 88. Consider the research hypothesis: Working at least 5 hours per day at acomputer contributes to deterioration of eyesight. The null hypothesis is:

(A) working at least 5 hours per day does not affect your eyesight(B) cannot determine the null hypothesis(C) working at least 5 hours per day improves your eyesight(D) working at least 5 hours per day contributes to the deterioration of eyesight

20

Page 21: randomized experiments samplepersonal.psu.edu/drh20/100/fall2008/exams/final/first3midterms.pdf(D) Systematic random sampling. Question 17. In a study of PSU students’ awareness

Question 89. Suppose after viewing the results of a study you decide, on the basis of thereported p-value, to support the skeptic. Then

(A) It is impossible for you to have committed either a Type 1 or Type 2 error(B) It is impossible for you to have committed a Type 1 error(C) You must have committed a Type 1 error(D) It is impossible for you to have committed a Type 2 error

Question 90. In a randomized experiment, if the p-value is small when comparing thetreatment and control groups, we can infer that the treatment caused the difference:

(A) True(B) False

Question 91. An experiment was conducted to see if adding Quaker State motor oil toCoppertone suntan lotion enhances tanning. A Type 1 error is:

(A) claim QS does not enhance tanning(B) claim QS enhances tanning(C) claim QS does not enhance tanning when it does(D) claim QS enhances tanning when it does not

Question 92. A p-value is a probability that is computed under what assumption?

(A) The alternative hypothesis is true(B) A type 2 error has been committed(C) A type 1 error has been committed(D) The null hypothesis is true

Question 93. In a study to determine if putting newborn babies in an incubator con-tributed to claustrophobia in adult life, the researchers found a p-value of .023. Thisstudy supports:

(A) both the skeptic and the research advocate(B) the research advocate(C) the skeptic(D) neither the skeptic nor the research advocate

Question 94. A statistical study considers the question of whether the presence of plantsin an office might lead to fewer sick days. In this study, the null hypothesis is:

(A) The presence of sick people in an office leads to fewer plants.(B) Insufficient information is given to allow us to determine the null hypothesis.(C) The presence of plants in an office leads to fewer sick days.(D) The presence of plants in an office does not lead to fewer sick days.

21

Page 22: randomized experiments samplepersonal.psu.edu/drh20/100/fall2008/exams/final/first3midterms.pdf(D) Systematic random sampling. Question 17. In a study of PSU students’ awareness

Question 95. A sudy of PSU students failed ( p-value = .32) to find a difference in pulserates between men and women. Which of the following is true?

(A) The research hypothesis is supported(B) It is possible that they committed a Type 2 error(C) The null hypothesis is rejected(D) It is possible that they committed a Type 1 error

Question 96. A research experiment found that ABC bubblegum retains its flavor longerthan XYZ bubblegum. We then know that:

(A) the p-value was small(B) they must have committed a Type 1 error(C) the p-value was large(D) they must have committed a Type 2 error

22