chapter 2prepared by samantha gaies, m.a.1 example: based on a stress questionnaire given to n =151...

12
Chapter 2 Prepared by Samantha Gaies, M.A. 1 Example: Based on a stress questionnaire given to N =151 students (from Aron, Paris, & Aron, 1995). Sample question: How stressed have you been in the last 2½ weeks, on a scale of 0 to 10 (0 = not at all stressed, 10 = as stressed as possible)? We could list all 151 scores, but: Chapter 2: Frequency Distributions and Graphs

Upload: theodore-davidson

Post on 13-Dec-2015

213 views

Category:

Documents


1 download

TRANSCRIPT

Page 1: Chapter 2Prepared by Samantha Gaies, M.A.1 Example: Based on a stress questionnaire given to N =151 students (from Aron, Paris, & Aron, 1995). Sample question:

1Chapter 2 Prepared by Samantha Gaies, M.A.

• Example: Based on a stress questionnaire given to N =151 students (from Aron, Paris, & Aron, 1995). Sample question:– How stressed have you been in

the last 2½ weeks, on a scale of 0 to 10 (0 = not at all stressed, 10 = as stressed as possible)?

– We could list all 151 scores, but:• this is not very efficient• patterns are hard to detect

Chapter 2: Frequency Distributions and Graphs

Page 2: Chapter 2Prepared by Samantha Gaies, M.A.1 Example: Based on a stress questionnaire given to N =151 students (from Aron, Paris, & Aron, 1995). Sample question:

2Chapter 2 Prepared by Samantha Gaies, M.A.

Stress Rating Frequency

10 149 158 267 316 135 184 163 122 31 10 2

• Simplest Solution: Regular Frequency Distribution

• Shows how often each score occurs (i.e., its frequency)

• Allows us to see patterns

Page 3: Chapter 2Prepared by Samantha Gaies, M.A.1 Example: Based on a stress questionnaire given to N =151 students (from Aron, Paris, & Aron, 1995). Sample question:

Prepared by Samantha Gaies, M.A.

3Chapter 2

• Cumulative Frequency Distribution: sums the frequencies at and below a particular value

– Can determine the number of scores at or below your score

– E.g., the cf for a stress rating of 7 is 96, so someone with that rating is reporting as much or more stress than 96 of the 151 students.

Stress Rating Frequency Cf

10 14 1519 15 1378 26 1227 31 966 13 655 18 524 16 343 12 182 3 61 1 30 2 2

Page 4: Chapter 2Prepared by Samantha Gaies, M.A.1 Example: Based on a stress questionnaire given to N =151 students (from Aron, Paris, & Aron, 1995). Sample question:

4Chapter 2 Prepared by Samantha Gaies, M.A.

• Grouped Frequency Distribution– Example: Moorehouse and Sanders

(1992): Children’s attitudes toward learning and work (N = 153)

– Interval width (i) = 10– Number of intervals = 11

Page 5: Chapter 2Prepared by Samantha Gaies, M.A.1 Example: Based on a stress questionnaire given to N =151 students (from Aron, Paris, & Aron, 1995). Sample question:

5Chapter 2 Prepared by Samantha Gaies, M.A.

Percentiles and Percentile Ranks

• Definitions– Percentile Rank (of a score): a single

number that gives the percent of cases in the specific reference group scoring at or below that score. It can be found by con-verting the cf of the score to a percentage (e.g., the PR corresponding to a score of 7 in the stress study is: 96/156 ×100 = 61.5).

– Percentile: the score at or below which a given percent of the cases lie. First, find the cf corresponding to the desired percen-tile, and then find the score that is close to that cf (e.g., the cf for the 43rd percentile is: .43 × N, where N is the total number of scores; for the stress study, N = 151, so the desired cf is .43 × 151 = 64.9. Thus, the 43rd percentile corresponds approximately to a score of 6, which has a cf of 65).

Page 6: Chapter 2Prepared by Samantha Gaies, M.A.1 Example: Based on a stress questionnaire given to N =151 students (from Aron, Paris, & Aron, 1995). Sample question:

6Chapter 2 Prepared by Samantha Gaies, M.A.

Percentiles and Percentile Ranks for Grouped Distributions

• PR for an Upper Real Limit– e.g., the upper real limit of the 80–89 interval

in the Moorehouse & Sanders table is 89.5, and the corresponding cf is 72. Therefore, the PR for 89.5 is 72/153 × 100 = 47 (i.e., the 47th percentile is 89.5). The PR for a score of 85 is about 43, because its cf is about 66 (about halfway between cfs of 59 and 72 in the table), and 66/153 ×100 = 43.1

• Deciles and Quartiles– The percentiles of greatest interest are the

deciles and quartiles. The fifth decile (or second quartile) is called the median.

– Finding the median means finding the score that corresponds to a cf of .5N. For the M & S data, the cf for the median = .5 × 153 = 76.5, which falls between cfs of 72 and 95 in the table, which correspond to the upper real limits of 89.5 and 99.5. Thus, the score at the median is about 92.

Page 7: Chapter 2Prepared by Samantha Gaies, M.A.1 Example: Based on a stress questionnaire given to N =151 students (from Aron, Paris, & Aron, 1995). Sample question:

7Chapter 2 Prepared by Samantha Gaies, M.A.

Graphic Representations:• Bar Chart: The heights of the bars indicate

frequency of occurrence; the bars do not touch, because the variable represented on the horizontal axis consists of distinct categories.

• Example: Different types of family house-holds in U.S. in 1994 (see bar chart below).

Frequency equals the proportion multiplied by the total sample size (68.5 million) (U.S. Census Bureau)

Page 8: Chapter 2Prepared by Samantha Gaies, M.A.1 Example: Based on a stress questionnaire given to N =151 students (from Aron, Paris, & Aron, 1995). Sample question:

8Chapter 2 Prepared by Samantha Gaies, M.A.

Histogram: The heights of the bars indicate frequency of occurrence; the bars do touch, because the variable represented on the horizontal axis is continuous, and each interval extends from one upper real limit to the next.

Example: Based on the stress rating data of Aron, Paris, and Aron (1995). See histogram below:

Page 9: Chapter 2Prepared by Samantha Gaies, M.A.1 Example: Based on a stress questionnaire given to N =151 students (from Aron, Paris, & Aron, 1995). Sample question:

9Chapter 2 Prepared by Samantha Gaies, M.A.

Regular Frequency Polygon:

Points are placed above the midpoints of the intervals in the histogram of the previous slide, corresponding to the frequencies of the intervals (based on the data from Aron, Paris, & Aron,1995).

Page 10: Chapter 2Prepared by Samantha Gaies, M.A.1 Example: Based on a stress questionnaire given to N =151 students (from Aron, Paris, & Aron, 1995). Sample question:

10Chapter 2 Prepared by Samantha Gaies, M.A.

Cumulative Frequency Polygon: Points are placed above the upper real limits of the intervals in the histogram of the previous slide, and correspond to the cfs of those intervals (based on the data from Aron, Paris, & Aron,1995)

Page 11: Chapter 2Prepared by Samantha Gaies, M.A.1 Example: Based on a stress questionnaire given to N =151 students (from Aron, Paris, & Aron, 1995). Sample question:

11Chapter 2 Prepared by Samantha Gaies, M.A.

Stem-and-Leaf Display:

Example: The ratings range from 25 to 120; the stems represent the tens place for each number in the plot below (based on the data of Moorehouse & Sanders,1992).

You can see the pattern of the distribution by looking at the plot sideways.

Stemplot of Teacher Ratings2 53 024 0234555 03345679996 123345555677888997 000123334444444456778898 01224456667789 00112222344555566777789

10 00113334444556677777888899911 00011112222333444455555577889912 0

Page 12: Chapter 2Prepared by Samantha Gaies, M.A.1 Example: Based on a stress questionnaire given to N =151 students (from Aron, Paris, & Aron, 1995). Sample question:

12Chapter 2 Prepared by Samantha Gaies, M.A.

Shapes of Frequency Distributions

– Symmetry vs. Skewness• Easy exams tend to produce negatively

skewed distributions (scores are bunched at the high end and trail off at the low end).

• Hard exams tend to produce positively skewed distributions (scores are bunched at the low end and trail off at the high end).

• Exams that are medium in difficulty tend to produce symmetrical distributions

– Modality• Distributions with one distinct peak

are called unimodal. • Distributions with two distinct peaks

are called bimodal.• Distributions with more than two

distinct peaks are unusual.