measurement. scales of measurement stanley s. stevens’ five criteria for four scales nominal...

31
Measurement

Upload: buddy-watson

Post on 20-Jan-2016

220 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Measurement. Scales of Measurement Stanley S. Stevens’ Five Criteria for Four Scales Nominal Scales –1. numbers are assigned to objects according to rules

Measurement

Page 2: Measurement. Scales of Measurement Stanley S. Stevens’ Five Criteria for Four Scales Nominal Scales –1. numbers are assigned to objects according to rules

Scales of Measurement

• Stanley S. Stevens’ Five Criteria for Four Scales

• Nominal Scales– 1. numbers are assigned to objects

according to rules– can establish equivalence or nonequivalence

of measured objects

Page 3: Measurement. Scales of Measurement Stanley S. Stevens’ Five Criteria for Four Scales Nominal Scales –1. numbers are assigned to objects according to rules

Ordinal Scales

2. m(O1 ) m(O2 ) only if t(O1 ) t(O2 )

3. m(O1 ) > m(O2 ) only if t(O1 ) > t(O2 )

Page 4: Measurement. Scales of Measurement Stanley S. Stevens’ Five Criteria for Four Scales Nominal Scales –1. numbers are assigned to objects according to rules

– equivalence or nonequivalence– order: which has more of the measured

attribute– cannot establish equivalence of differences

Page 5: Measurement. Scales of Measurement Stanley S. Stevens’ Five Criteria for Four Scales Nominal Scales –1. numbers are assigned to objects according to rules

Interval Scales

• letting Xi stand for t(Oi )

• 4. m(Oi ) = a + bXi, b > 0

• t(O1 ) ‑ t(O2 ) = t(O3 ) ‑ t(O4 )if m(O1 ) ‑ m(O2 ) = m(O3 ) ‑ m(O4 )

Page 6: Measurement. Scales of Measurement Stanley S. Stevens’ Five Criteria for Four Scales Nominal Scales –1. numbers are assigned to objects according to rules

Ratio Scales

• 5. a = 0, that is m(Oi ) = bXi,

• true zero point• m(O1 ) m(O2 ) = bX1 bX2 = X1 X2

• Remember gas law problems?

Page 7: Measurement. Scales of Measurement Stanley S. Stevens’ Five Criteria for Four Scales Nominal Scales –1. numbers are assigned to objects according to rules

Reliability

• Repeatedly measure unchanged things.• Do you get the same measurements?• Charles Spearman, Classical

Measurement Theory• If perfectly reliable, then corr between true

scores and measurements = +1.• r < 1 because of random error.• error symmetrically distributed about 0.

Page 8: Measurement. Scales of Measurement Stanley S. Stevens’ Five Criteria for Four Scales Nominal Scales –1. numbers are assigned to objects according to rules

Reliability is the proportion of the variance in the measurement scores that is due to differences in the true scores rather than due to random error.

222

2

2

2

TMET

T

M

TXX rr

Page 9: Measurement. Scales of Measurement Stanley S. Stevens’ Five Criteria for Four Scales Nominal Scales –1. numbers are assigned to objects according to rules

• Systematic error– not random– measuring something else, in addition to the

construct of interest• Reliability cannot be known, can be

estimated.

Page 10: Measurement. Scales of Measurement Stanley S. Stevens’ Five Criteria for Four Scales Nominal Scales –1. numbers are assigned to objects according to rules

Test-Retest Reliability

• Measure subjects at two points in time.• Correlate ( r ) the two sets of

measurements.• .7 OK for research instruments• need it higher for practical applications

and important decisions.• M and SD should not vary much from Time

1 to Time 2, usually.

Page 11: Measurement. Scales of Measurement Stanley S. Stevens’ Five Criteria for Four Scales Nominal Scales –1. numbers are assigned to objects according to rules

Alternate/Parallel Forms

• Estimate reliability with r between forms.• M and SD should be same for both forms.• Pattern of corrs with other variables should

be same for both forms.

Page 12: Measurement. Scales of Measurement Stanley S. Stevens’ Five Criteria for Four Scales Nominal Scales –1. numbers are assigned to objects according to rules

Split-Half Reliability

• Divide items into two random halves.• Score each half.• Correlate the half scores.• Get the half-test reliability coefficient, rhh

• Correct with Spearman-Brown

hh

hhsb r

rr

12

Page 13: Measurement. Scales of Measurement Stanley S. Stevens’ Five Criteria for Four Scales Nominal Scales –1. numbers are assigned to objects according to rules

Cronbach’s Coefficient Alpha

• Obtained value of rsb depends on how you split the items into haves.

• Find rsb for all possible pairs of split halves.

• Compute mean of these.• But you don’t really compute it this way.• This is a lower bound for the true reliability.• That is, it underestimates true reliability.

Page 14: Measurement. Scales of Measurement Stanley S. Stevens’ Five Criteria for Four Scales Nominal Scales –1. numbers are assigned to objects according to rules

Maximized Lambda4

• This is the best estimator of reliability.• Compute rsb for all possible pairs of split

halves.• The largest rsb = the estimated reliability.

• If more than a few items, this is unreasonably tedious.

• But there are ways to estimate it.

Page 15: Measurement. Scales of Measurement Stanley S. Stevens’ Five Criteria for Four Scales Nominal Scales –1. numbers are assigned to objects according to rules

Intra-Rater Reliability

• A single person is the measuring instrument.

• Rate unchanged things twice.• Correlate the ratings.

Page 16: Measurement. Scales of Measurement Stanley S. Stevens’ Five Criteria for Four Scales Nominal Scales –1. numbers are assigned to objects according to rules

Inter-Rater Reliability: Categorical Judgments

• Have 2 or more judges or raters.• Want to show that the scores are not

much affected by who the judge is.• With a categorical variable, could use

percentage of agreement.• But there are problems with this.

Page 17: Measurement. Scales of Measurement Stanley S. Stevens’ Five Criteria for Four Scales Nominal Scales –1. numbers are assigned to objects according to rules

Rater 2

Rater 1 No Fight Aggressor Victim marginal

No Fight 70 (54.75) 3 2 75

Aggressor 2 6 (2.08) 5 13

Victim 1 7 4 (1.32) 12

marginal 73 16 11 100

• % agreement = 80%• agree on whether is fighting or not• but not on whether is aggressor or victim

Page 18: Measurement. Scales of Measurement Stanley S. Stevens’ Five Criteria for Four Scales Nominal Scales –1. numbers are assigned to objects according to rules

Cohen’s Kappa

• Corrects for tendency to get high % just because one category is very often endorsed by both judges.

• For each cell in main diagonal, compute E– E = (row total)(column total) / table total– upper left cell, E = 73(75) / 100 = 54.75

Page 19: Measurement. Scales of Measurement Stanley S. Stevens’ Five Criteria for Four Scales Nominal Scales –1. numbers are assigned to objects according to rules

EN

EO

52.085.4185.21

32.108.275.5410032.108.275.544670

Page 20: Measurement. Scales of Measurement Stanley S. Stevens’ Five Criteria for Four Scales Nominal Scales –1. numbers are assigned to objects according to rules

• Here kappa is .82

Rater 2

Rater 1 No Fight Aggressor Victim marginal

No Fight 70 (52.56) 0 2 72

Aggressor 2 13 (2.40) 1 16

Victim 1 2 9 (1.44) 12

marginal 73 15 12 100

Page 21: Measurement. Scales of Measurement Stanley S. Stevens’ Five Criteria for Four Scales Nominal Scales –1. numbers are assigned to objects according to rules

Inter-Rater Reliability: Ranks

• Two raters ranking things– Spearman’s rho– Kendall’s tau

• Three or more raters ranking things– Kendall’s coefficient of concordance

Page 22: Measurement. Scales of Measurement Stanley S. Stevens’ Five Criteria for Four Scales Nominal Scales –1. numbers are assigned to objects according to rules

Inter-Rater Reliability: Continuous Scores

• Two raters– could use Pearson r

• Two or more raters– better to use intraclass correlation coefficient– scores could be highly correlated and show good

agreement

Subject 1 2 3 4 5 6 7 8 9 10 Judge 1 10 9 8 7 6 5 4 3 2 1 Judge 2 9 10 8 7 5 6 4 3 1 2

Page 23: Measurement. Scales of Measurement Stanley S. Stevens’ Five Criteria for Four Scales Nominal Scales –1. numbers are assigned to objects according to rules

Inter-Rater Reliability: Continuous Scores

– or scores could be highly correlated but show little agreement

– r = .964 for both pairs of judges.– ICC = .967 for first pair, .054 for second pair

Subject 1 2 3 4 5 6 7 8 9 10 Judge 4 10 9 8 7 6 5 4 3 2 1 Judge 5 90 100 80 70 50 60 40 30 10 20

Page 24: Measurement. Scales of Measurement Stanley S. Stevens’ Five Criteria for Four Scales Nominal Scales –1. numbers are assigned to objects according to rules

Construct Validity

• To what extent are we really measuring/manipulating the construct of interest?

• Face Validity – do others agree that it sounds valid?

Page 25: Measurement. Scales of Measurement Stanley S. Stevens’ Five Criteria for Four Scales Nominal Scales –1. numbers are assigned to objects according to rules

Content Validity

• Detail the population of things (behaviors, attitudes, etc.) that are of interest.

• Consider our operationalization of the construct as a sample of that population.

• Is our sample representative of the population – ask experts.

Page 26: Measurement. Scales of Measurement Stanley S. Stevens’ Five Criteria for Four Scales Nominal Scales –1. numbers are assigned to objects according to rules

Criterion-Related Validity

• Established by demonstrating that our operationalization has the expected pattern of correlations with other variables.

• Concurrent Validity – demonstrate the expected correlation with other variables measured at the same time.

• Predictive Validity – demonstrate the expected correlation with other variables measured later in time.

Page 27: Measurement. Scales of Measurement Stanley S. Stevens’ Five Criteria for Four Scales Nominal Scales –1. numbers are assigned to objects according to rules

• Convergent Validity – demonstrate the expected correlation with measures of other constructs.

• Discriminant Validity – demonstrate the expected lack of correlation with measures of other constructs.

Page 28: Measurement. Scales of Measurement Stanley S. Stevens’ Five Criteria for Four Scales Nominal Scales –1. numbers are assigned to objects according to rules

Threats to Construct Validity

• Inadequate Preoperational Explication– the population of things defining the construct

was not adequately detailed• Mono-Operation Bias

– have used only one method of manipulating the construct

• Mono-Method Bias– have used only one method of measuring the

construct

Page 29: Measurement. Scales of Measurement Stanley S. Stevens’ Five Criteria for Four Scales Nominal Scales –1. numbers are assigned to objects according to rules

• Interaction of Different Treatments– effect of manipulation of one construct is

altered by the previous manipulation of another construct

• Testing x Treatment Interaction– in pretest posttest design, did taking the

pretest alter the effect of the treatment?

Page 30: Measurement. Scales of Measurement Stanley S. Stevens’ Five Criteria for Four Scales Nominal Scales –1. numbers are assigned to objects according to rules

• Restricted Generalizability Across Constructs– experimental treatment might affect constructs

we did not measure– so we can’t describe the full effects of the

treatment• Confounding Constructs with Levels of

Constructs– Would our manipulated construct have a

different effect if we used different levels of it?

Page 31: Measurement. Scales of Measurement Stanley S. Stevens’ Five Criteria for Four Scales Nominal Scales –1. numbers are assigned to objects according to rules

• Social Threats– Hypothesis Guessing

• Good guy effect• Screw you effect

– Evaluation Apprehension– Expectancy Effects

• Experimenter expectancy• Participant expectancy• Blinding and double blinding