kevin s. mcgrew, phd institute for applied … · institute for applied psychometrics &...

71
“Intelligent” intelligence testing with the WJ IV cognitive battery Kevin S. McGrew, PhD Institute for Applied Psychometrics & University of Minnesota © Institute for Applied Psychometrics, Dr. Kevin S. McGrew, 12-28-15 Links to more complete sets of handouts and/or PPT slides will be provided the day of the workshop

Upload: letram

Post on 23-Aug-2018

214 views

Category:

Documents


0 download

TRANSCRIPT

“Intelligent” intelligence testing with the WJ IV cognitive battery

Kevin S. McGrew, PhD

Institute for Applied Psychometrics& University of Minnesota

© Institute for Applied Psychometrics, Dr. Kevin S. McGrew, 12-28-15

Links to more complete sets of handouts and/or PPT slides will be

provided the day of the workshop

• Institute for Applied Psychometrics (IAP)-Director

• University of Minnesota - Visiting Professor (Educ. Psych.)

• Interactive Metronome - Director of Research and Science (External Consultant) *

• Darhma Berkmana Foundation (YDB; Indonesia) –Intelligence expert for development of first Indonesian CHC-based intelligence battery for children

* Conflict of interest disclosure: Financial relationship and interest in IM; Coauthor of WJ III and WJ IV (royalty interest)

Kevin S. McGrew, PhD.

Affiliations and Disclosures

© Institute for Applied Psychometrics, Dr. Kevin S. McGrew, 012314

• General introduction and workshop logistics

• Intelligence testing in the “big picture” context

• Brief overview of Kaufman’s “intelligent” testing approach

• Foundational empirical knowledge– “romancing the stones” (tests)

• The WJ IV/CHC Periodic Table of Cognitive Test Elements• WJ IV variation and comparison procedures - brief• Test/cluster score difference (% base rate) rules-of-thumb

© Institute for Applied Psychometrics, Dr. Kevin S. McGrew, 12-28-15

“Intelligent” intelligence testing with the WJ IV cognitive battery

• WJ IV published & new supplemental/clinical test groupings

• WJ IV assessment trees

• Within-CHC domain assessment trees (“drilling down”)• Academic domain referral-focused assessment trees

• Miscellaneous topics and tidbits

• Conclusions and Q/A

“Intelligent” intelligence testing with the WJ IV cognitive battery

© Institute for Applied Psychometrics, Dr. Kevin S. McGrew, 12-28-15

Will be covered

concurrentlywith aid of case study

Waves Of Intelligence Test Interpretation

(Kamphaus et al., 1997)

• Wave 1 - Quantification of a General Level (g)

• Wave 2 - Clinical Profile Analysis

• Wave 3 - Psychometric Profile Analysis

• Wave 4 - Applying Theory to Intelligence Test Interpretation

Cohen, J. (1959). The factorial structure of the WISC at ages 7-6, 10-6, and 13-6, Journal of Consulting Psychology, 23, 285-299.

Wave 3:Psychometric Profile

Analyses

Kaufman, A. S. (1979). Intelligent testing with the WISC-R. New York: Wiley-Interscience.

Wave 4:Applying Theory to Test Interpretation

(and research & development)

PASS CAS/CAS II

Sim/Suc KABC/KABC-II

CHC (Gf-Gc) WJ-R/III/IV CHC (Gf-Gc) SB5

Gf-Gc KAITCHCKABC-II

Sample ITDR summary page from McGrew & Flanagan (1997)

Sample ITDR summary page

The intent of the intelligent testing model was and remains to “bring together

empirical data, psychometrics, clinical acumen, psychological theory, and careful

reasoning to build an assessment of an individual leading to the derivation of an

intervention to improve the life circumstances of the subject” (Reynolds,

2007, p. 1133) – in Fletcher-Janzen (2009)

-The gold standard for clinical-psychometric test interpretation

-Incorporates both quantitative and qualitative analysis

-The first system of test interpretation that followed scientific principles and at the same tame overtly sought to reduce inappropriate use of obtained test scores

-Demands a very high standard of clinical expertise

-The central point of intelligent testing is that the clinician’s judgement regarding the patient is the central point

Intelligent Testing: Bridging the Gap between Classical and Romantic Science in Assessment (Elaine Fletch-Janzen, 2009)

“ Tests do not think for themselves, nor do they directly communicate with patients. Like a stethoscope, a blood pressure gauge, or an MRI scan, a psychological test is a

dumb tool, and the worth of the tool cannot be separated from the sophistication of the clinician who draws inferences from it and then communicates with

patients and professionals”

Meyer et al. (2001). Psychological testing and psychological assessment. American Psychologist

If you give a monkey a Stradivarius violin and you get bad music……

You don’t blame the violin !!!!

We are the instrument !!!!

“Intelligent” intelligence testing and interpretationrequires … knowing thy instruments

An “intelligent” clinician understands and “romances the stones (tests)” which have different and multiple facets

CHC ability factor classifications

Neuropsych. interpretation

External criterion relations

Info. Proc. stimulus & response characteristics (e.g., BIS)

Cognitive operations

Level/type of cognitive processing (Type 1 v Type 2)

Exec. Functions/Attentional control

Error variance (reliability)

Uniqueness (specificity)

g-loading

Degree of cultural loading

Degree of linguistic demand

Ability domain cohesion

Degree of cognitive complexity

...most disciplines have a common set of termsand definitions (i.e., a standard nomenclature)that facilitates communication among professionalsand guards against misinterpretations. In chemistry,this standard nomenclature is reflected in the Tableof Periodic Elements. Carroll (1993a) has providedan analogous table for intelligence…..

(Flanagan & McGrew, 1998)

The importance of taxonomies and classification in science

© Institute for Applied Psychometrics (IAP) Dr. Kevin McGrew 4-11-14

A Good Taxonomy

The Cattell-Horn-Carroll (CHC) model is the contemporary consensus taxonomy of human

cognitive abilities

PMA1

T2 T3 T4 T5 T6 T7 T8 T9T1 T12T10 T11

PMA2 PMA3 PMA4 …etc

…etc

G1 G2 G3

…etc

g ?

(Consensus Cattell-Horn-Carroll Hierarchical Three-Stratum Model)

Richard Snow (1993):“John Carroll has done a magnificent thing. He has reviewed and

reanalyzed the world’s literature on individual differences in cognitive abilities…no one else could have done it… it defines the taxonomy of cognitive differential psychology for many years to come.”

John Horn (1998):A “tour de force summary and integration” that is the “definitive foundation for current theory” (p. 58). Horn compared Carroll’s summary to “Mendelyev’s first presentation of a periodic table of elements in chemistry” (p. 58).

The verdict is unanimous re: the importance of Carroll’s (1993) work

© Institute for Applied Psychometrics (IAP) Dr. Kevin McGrew 4-11-14

Gh

Go OM

Ga

Gv Vz SR MV CS SS CF IM PI LE IL PN

Sen

sory

-Mo

tor

Do

mai

n-

Spec

ific

Ab

iliti

es*

Sensory

The Cattell-Horn-Carroll (CHC) Periodic Table of Human AbilitiesGf I RG RQ

Gwm WM MS AC

Gps R3 PT MT

Gs P N R9

Gt R1 R2 R4 R7 IT

Glr MA MM M6 FI FA FE SP F0 NA FW LA FF FX

Do

mai

n-I

nd

epen

den

t C

apac

itie

s*

Gq KM A3

Gc LD VL K0 LS CM MY

Gkn KL K1 A5 MK KF LP BC

Grw V RD RC RS WA SG EU WS

Acq

uir

ed K

no

wle

dge

Syst

ems* K2

Gk

Gp PI P2 P3 P4 P6 P7 P8

PC US UM U8 URU1U9

UP ULU1U9

UP UL

Motor

A1

Glr-Learning efficiency

Glr-Retrieval fluency

Broad ability

Narrow ability

* Three major domain categories based on Schneider & McGrew (2012)

(No well supported cognitive Gh & Gk narrow

abilities have been identified)

Ideas Words Figures

The WJ IV Cattell-Horn-Carroll (CHC) periodic table of COG/OL test elements: Ages 6 to 19

© Institute for Applied Psychometrics (IAP) Dr. Kevin McGrew 01-21-16

CO

G C

HC

clu

ster

s an

d g

-lo

adin

gs

.15

.18

.50

.29 .26

.18 .30

.29 Gq

.32 GrwCOG

ACH

OL

NonWRpPC,UM/MS

.90 .58 M .18

.34 .21 .41

NumSer

(RQ)

.93 .63 H .80

.63 .73 .64#H

ConFrm

(I)

.93 .65 M .66

.44 .47 .35H

AnlSyn

(RG)

.92 .62 M .62

.25 .43 .34

H

VrbAtn(WM/AC)

.86 .66 H .76

.42 .42 .50

HNumRev(WM/AC)

.86 .62 M .36

.41 .44 .42

#HObNmSq

(WM)

.89 .70 M .74

.32 .42 .35

HMemWrd

(MS)

.82 .63 M .58

.27 .28 .32

H

VsAdLg(MA)

.96 .52 L .48

.30 .29 .30

HStrRec

(MM)

.93 .59 M .54

.31 .39 .42

H

LtPMat(P)

.91 .56 M .77

.46 .55 .48

HPairCn(P/AC)

.89 .49 L .60

.30 .44 .28H

NmPMat(P)

.85 .53 M .80

.53 .57 .61

H

OrlVoc(LD/VL)

.89 .75 H .86

.56 .51 .49

MGenInf

(K0)

.84 .59 M .78

.35 .38 .24H

Visual(Vz)

.83 .60 M .70

.38 .49 .38H

PicRec(MV)

.71 .47 L .50

.36 .25 .36H

PhnPro(PC)

.83 .73 H .59

.52 .51 .53H H

SenRep(MS/LS)

.83 .60 M .48

.43 .35 .41

HUndDir(WM)

.86 .66 M .64

.45 .43 .39

H

RpdPNm(NA)

.85 .51 L .24

.34 .22 .22H

OralCmp(LS)

.82 .69 M .76

.52 .42 .48

LPicVoc

(VL)

.81 .65 M .82

.40 .32 .33

L

SndAwr(PC)

.71 .68 H .52

.56 .48 .55L

Segmnt(PC)

.93 .60 M .74

.49 .42 .44H

SndBld(PC)

.88 .53 L .62

.27 .31 .28H

.80 .57 M .42

RetFlu(FI)

.35 .37 .35H

.80 .57 M .42

GIA cluster tests

CHC COG cluster tests

.29

(Secondary CHC factor loading)

Relative degree of cognitive complexity

High Medium (M/M) Low

NumSer

(RQ)

.93 .63 H .80

.63 .73 .64#H

g-loading*

Reliability*

Specificity*

CHC broad factor

loading

Test name abbreviation

CHC narrow

ability code

BIS content or stimulus

characteristic

Median correlation with WJ IV Rdg, Math & WrLng clusters

[* high, med, low – as per Kaufman (1979) & McGrew & Flanagan (1998)]

• Reliability: The degree to which a test score is free from errors of measurement. Test score precision.

• Specificity: The portion of a test’s score variance that is reliable and unique to the test.

• g-loading: A test’s loading on the first unrotated factor or component in factor or principal component analysis.

• Cognitive complexity: The relative degree of cognitive information processing load (e.g., resource demands on working memory, attentional control, executive functions) demanded by a test.

• CHC narrow ability code: (see back)• BIS content characteristic:

Information based on analysis of the WJ IV 6-19

year old norm sample

Additional resources available at www.themindhub.com (MindHubTM)

#Verbal

Figural-visual

Speed-fluency

Quant.-numeric

Auditory

Relative degree of cognitive complexity:• High Medium (M/M) Low

CHC broad factor loading

Test name abbreviation

CHC narrow ability code(s)

All information based on analysis of WJ IV norm data from ages 6 thru 19

BIS content/stimulus characteristic

NmSeries

(RQ)

.93 .63 H .80

g-loading*

Reliability*

#

.63 .73 .64Median correlations with R, M, W clusters

H

Specificity*

[* high, med, low – as per Kaufman (1979) & McGrew & Flanagan (1998)]

The WJ IV Cattell-Horn-Carroll (CHC) periodic table of COG/OL test elements: Ages 6 to 19

© Institute for Applied Psychometrics (IAP) Dr. Kevin McGrew 01-20-16

CO

G C

HC

clu

ster

s an

d g

-lo

adin

gs

.15

.18

.50

.29 .26

.18 .30

.29 Gq

.32 GrwCOG

ACH

OL

NonWRpPC,UM/MS

.90 .58 M .18

.34 .21 .41

NumSer

(RQ)

.93 .63 H .80

.63 .73 .64#H

ConFrm

(I)

.93 .65 M .66

.44 .47 .35H

AnlSyn

(RG)

.92 .62 M .62

.25 .43 .34

H

VrbAtn(WM/AC)

.86 .66 H .76

.42 .42 .50

HNumRev(WM/AC)

.86 .62 M .36

.41 .44 .42

#HObNmSq

(WM)

.89 .70 M .74

.32 .42 .35

HMemWrd

(MS)

.82 .63 M .58

.27 .28 .32

H

VsAdLg(MA)

.96 .52 L .48

.30 .29 .30

HStrRec

(MM)

.93 .59 M .54

.31 .39 .42

H

LtPMat(P)

.91 .56 M .77

.46 .55 .48

HPairCn(P/AC)

.89 .49 L .60

.30 .44 .28

HNmPMat

(P)

.85 .53 M .80

.53 .57 .61

H

OrlVoc(LD/VL)

.89 .75 H .86

.56 .51 .49

MGenInf

(K0)

.84 .59 M .78

.35 .38 .24M/H

Visual(Vz)

.83 .60 M .70

.38 .49 .38H

PicRec(MV)

.71 .47 L .50

.36 .25 .36H

PhnPro(PC)

.83 .73 H .59

.52 .51 .53L/H H

SenRep(MS/LS)

.83 .60 M .48

.43 .35 .41

HUndDir(WM)

.86 .66 M .64

.45 .43 .39

H

RpdPNm(NA)

.85 .51 L .24

.34 .22 .22H

OralCmp(LS)

.82 .69 M .76

.52 .42 .48

LPicVoc

(VL)

.81 .65 M .82

.40 .32 .33

L

SndAwr(PC)

.71 .68 H .52

.56 .48 .55L

Segmnt(PC)

.93 .60 M .74

.49 .42 .44H

SndBld(PC)

.88 .53 L .62

.27 .31 .28H

.80 .57 M .42

RetFlu(FI)

.35 .37 .35H

.80 .57 M .42

GIA cluster tests

CHC COG cluster tests

Additional resources available at www.themindhub.com (MindHubTM)

Matrix Reasoning (I)Figure Weights (RQ)Picture Concepts (I)Arithmetic (RQ; Gq)

Similarities (VL/LD)Vocabulary (VL)Information (K0)Comprehension (LD/K0)

Block Design (Vz)Visual Puzzles (Vz/SR?)

Digit Span (MS,WM)Letter-Number Seq. (WM)Picture Span (WM/MS; Gv-MV?)Arithmetic (Gwm-WM)

Coding (R9/MA?)Symbol Search (P/R9; Gv-SS?)Cancellation (P/R9)

Naming Speed Literacy (NA)Naming Speed Quantity (NA)Immediate Symbol Translation (MA)Delayed Symbol Translation (MA)Recognition Symbol Translation (MA)

WISC-V tests & tentative CHC

classifications (based on

multiple sources)

Italic font designates Canivez,

Watkins & Dombrowski’s (2015)

conclusion that Gf and Gv are not

separate factors-instead combine

as perceptual reasoning

No Ga tests

WISC-V related measures

2-1 0 1-2

-1

0

1

2

ORLVOC

NUMSER

VRBATN

LETPAT

PHNPRO

VALVISUAL

GENINF

CONFRM

NUMREVNUMPAT

NWDREP

STYREC

PICREC

ANLSYN

OBJNUM

PAIRCN

MEMWRD

PICVOC

ORLCMP

SEGMNT

RPCNAM

SENREP

UNDDIR

SNDBLN

RETFLU

SNDAWR APPROB

SPELLPSGCMP

CALC

WRTSMP

ORLRDG

SNRDFL

MTHFLU

SNWRFL

RDGREC

NUMMAT

EDIT

WRDFLU

SPLSND

SCI SOCHUM

Exploratory MDS of WJ IV norm

subjects ages 6-19

© Institute for Applied Psychometrics; Kevin

McGrew 12-14-15

-2 -1 0 1

-2

-1

0

1

2

ORLVOC

NUMSER

VRBATN

LETPAT

PHNPRO

VALVISUAL

GENINF

CONFRM

NUMREV NUMPAT

NWDREP

STYREC

PICREC

ANLSYN

OBJNUM

PAIRCN

MEMWRD

PICVOC

ORLCMP

SEGMNT

RPCNAM

SENREP

UNDDIR

SNDBLN

RETFLU

SNDAWR APPROB

SPELLPSGCMP

CALC

WRTSMP

ORLRDG

MTHFLU

SNWRFL

RDGREC

NUMMAT

EDIT

WRDFLU

SPLSND

SCI SOCHUM

WJ IV test 2D MDS (Ages 6 to 19; n =

4,082)

www.iapsych.com/mimap.pdf

Relative degree of cognitive complexity:• High Medium (M/M) Low

CHC broad factor loading

Test name abbreviation

CHC narrow ability code(s)

All information based on analysis of WJ IV norm data from ages 6 thru 19

BIS content/stimulus characteristic

NmSeries

(RQ)

.93 .63 H .80

g-loading*

Reliability*

#

.63 .73 .64Median correlations with R, M, W clusters

H

Specificity*

[* high, med, low – as per Kaufman (1979) & McGrew & Flanagan (1998)]

Reliability

High

Medium

Low

The degree to which a test score is free from errors of measurement. Test score precision.

Coefficients of .90 or above.

Coefficients from .80 to .89 inclusive.

Coefficients below .80.

Important for making accurate educational and/or diagnostic decisions.

Test scores are sufficiently reliable and can be used to make diagnostic decisions.

Test scores are moderately reliable and can be used to make screening decisions or can be combined with other tests to form a composite with “high” reliability.

Test scores are not sufficiently reliable and cannot be used to make important screening or diagnostic decisions. Need to be combined with other tests to form a composite with “medium” or “high” reliability.

Specificity

High

Medium

Low

The portion of a test’s score variance that is reliable and unique to the test.

A test’s unique reliable variance is equal to or above 25% of the total test variance and it exceeds error variance (1-reliability).

When a test meets only one of the criteria for High.

When a test does not meet either of the criteria for High.

A test with high specificity may be interpreted as measuring an ability distinct within a battery of tests.

A test with medium specificity should be interpreted cautiously as measuring an ability distinct within a battery of tests).

A test with low specificity should not be interpreted as representing a unique ability but may prove useful in interpretation when it is considered as part of a composite or cluster of other similar tests.

Reliability

NmSeries

(RQ)

.93 .63 H .80

#

.63 .73 .64

H

Spec

ific

ity

Specificity –reliable unique

variance

Reliable variance

Reliability

NmSeries

(RQ)

.93 .63 H .80

#

.63 .73 .64

H

Spec

ific

ity

g-loading

High

Medium

Low

Each test’s loading on the first unrotated factor or component in principal factor or component analysis with all other tests from a specific intelligence battery.

General factor or g loading of .70 or higher.

A loading of .51 to .69.

A loading of .50 or lower.

Important indicator of the degree to which a test of an individual battery measures general intelligence. Aids in determining the extent to which a test score can be expected to vary from other scores within a profile.

Tests with high g loadings are not expected to vary greatly from the mean of the profile and are considered good indicators of general intelligence.

Tests with medium g loadings may vary from the mean of the profile as tests with this classification are considered fair indicators of general intelligence.

Tests with low g loadings can be expected to vary from the mean of the profile as tests with this classification are considered poor indicators of general intelligence.

T3 T4 T5 T6 T7 T8 T9 T10T1 T2

High g

Low g

Intelligence test batterytest g (general intelligence)

loadings (weights)

Derived from factor analysis

Think of a general intelligence pole that is saturated with more g-ness (like

magnetism) at the top and less g-ness at the bottom

Factor analysis orders the tests on the pole based on their saturation of g-ness

IQ test battery subtestg-loadings or saturation

Subtests

General intelligence (g)

T2 T3 T4 T5 T6 T7 T8 T9T1 T12T10 T11

g

(1a) Spearman’s general Factor model

© Institute for Applied Psychometrics; Kevin McGrew 1-18-15

Visual-Auditory Learning 0.62 0.38Nonword Repetition 0.62 0.38Symbol Search 0.62 0.38Analysis-Synthesis 0.61 0.37Number-Pattern Matching 0.59 0.35Story Recall 0.58 0.34Pair Cancellation 0.58 0.34Visualization 0.55 0.30

Picture Recognition 0.49 0.24Letter-Pattern Matching 0.48 0.23Coding 0.47 0.22

Cancellation 0.42 0.18

g h2

g h2

First (unrotated) principal component

for WJ IV COG & WISC-IV tests (n=173)

Arithmetic 0.81 0.66Phonological Processing 0.81 0.66Vocabulary 0.80 0.64Oral Vocabulary 0.80 0.64Information 0.79 0.62Concept Formation 0.78 0.61

Matrix Reasoning 0.75 0.56Similarities 0.74 0.55Verbal Attention 0.73 0.53Block Design 0.71 0.50General Information 0.71 0.50

Number Series 0.70 0.49Numbers Reversed 0.69 0.48Comprehension 0.69 0.48Letter-Number Sequencing 0.68 0.46Digit Span 0.65 0.42Object-Number Sequencing 0.64 0.41Picture Concepts 0.63 0.40

g-loading

NmSeries

(RQ)

.93 .63 H .80

#

.63 .73 .64

H

CHC narrow factor classification

NmSeries

(RQ)

.93 .63 H .80

#

.63 .73 .64

H

Ages 9-13 in technical manual

CHC broad factor loading (average)

NmSeries

(RQ)

.93 .63 H .80

#

.63 .73 .64

H

What is (relational) cognitive complexity?

Two major classes of cognitive complexity theories (Bertling, 2012)

• Empirical: Post-hoc purely data-driven theories (e.g., Marshalek, Lohman & Snow, 1983)• g-loadings• Proximity to center of MDS spatial maps

• Cognitive theories: Working memory theories and the constraints placed on reasoning (e.g., Gf). Increasing processing demands results in an increase (demand/load) on cognitive resources

Load placed on working memory by a task

• Focus is on the sheer number of elements or element relations in a task

Relational Complexity theory (RC): The relational complexity of a task (e.g., Birney et al., 2006); Halford, 1993; Halford et al., 1988; Just & Carpenter, 1992)

• Focus is on the complexity of the interrelated elements (pieces of information) that need to be processed in parallel

Two types of cognitive theories regarding cognitive complexity

Example of task with high relational cognitive complexity (RC)

The processing load (demand on resources) imposed by interacting components of a task can be captured with the concept or relational complexity (Bertling, 2012; Birney et al., 2006; Halford et al., 1998)

• The key is the number of interacting variables (elements; arguments) that must be represented in parallel to implement the process

• Conceptually RC is similar to the number of factors in an experimental design

More on relational cognitive complexity (RC) theory

Processing complexity

• May depend on executive functions. • The strategy used by a person may differ across people or within

the same person at different times• The optimal strategy may not be the one that is best theoretically or

as generated by an artificial intelligence (AI) algorithm• Individuals operate in ways that are different from theoretically

optimal algorithms

More on relational cognitive complexity (RC) theory

• Increase the information processing demands of the tests within a specific narrow CHC domain.

• Design tests that place greater demands on:

• Cognitive information processing (cognitive load)• Greater allocation of key cognitive resources (working

memory or attentional control)• The involvement of more cognitive control or executive

functions

WJ IV cognitive complexity design approach based on work of Lohman & Larkin (2011)

© Institute for Applied Psychometrics; Kevin McGrew 1-18-15

ORLVOCVRBATN

LTPMAT

GENINF

NUMREVNMPMAT

NONWRP

STRREC

PICREC

ANLSYN

PAIRCN

MEMWRD

ORLCMP

SEGMNT

RPDPNM

SENREP

UNDDIR

SNDBLD

RETFLU

SPELL

CALC

ORLRDG

MTHFLU

SNWRFL

NUMMAT

EDIT

WRDFLU

SPLSND

SOCHUM

Speed-fluency

Reading-writing

WJ IV test 2D MDS (Ages 6 to 19; n = 4,082)

Quantitative-numeric

Figural-visual

Auditory

Verbal

#

© Institute for Applied Psychometrics (IAP) Dr. Kevin McGrew 01-20-16

Tests closest to the center

are considered more cognitively

complex

-2 -1 0 1

-2

-1

0

1

2

VRBATN

NUMREV

OBJNUM

MEMWRD

SENREP

UNDDIR

H

MM

L

Cognitive complexity interpretation aid

Gwm tests

H = High M = Moderate M = Low Mod.L = Low

© Institute for Applied Psychometrics; Kevin McGrew 01-06-16

-2 -1 0 1

-2

-1

0

1

2

NUMSER

VRBATNPHNPRO

VAL

VISUAL

GENINF

NUMREV

NWDREP

STYREC

PICREC

ANLSYN

OBJNUM

PAIRCN

MEMWRD

ORLCMP

SEGMNT

RPCNAM

SENREP

UNDDIR

SNDBLN

RETFLU

SNDAWR

Achievement tests removed for readability

H

MM

L

Cognitive complexity by CHC domain

interpretation aid

H = High M = Moderate M = Low Mod.L = Low

© Institute for Applied Psychometrics; Kevin McGrew 01-06-16

© Institute for Applied Psychometrics; Kevin

McGrew 12-14-15

Low

Hypothesized degree of

linguistic or language-domain

demand

Sentence Repetition

High

HighLow

Nonword Repetition

Memory for

Words

Numbers

Reversed

Understanding

Directions

Object-Number

Sequencing

Verbal

Attention

Gwm A - tasks that make greater use of the

articulatory rehearsal maintenance mechanism

(Camos, 2015)

Gwm B - tasks that make greater use the of

attentional refreshing maintenance mechanism

(Camos, 2015)

Distances betweentests intended to

reflect relative hypothesized

differences (not quantified) along two

axis

Linguistic/language dimension

classifications based on inspection of correlations with

other WJ IV tests of Gc and Ga and

Flanagan & Ortiz (2015) linguistic

demand classifications Hypothesized degree of central-executive control network

(cog. load; attentional control; degree of relational cognitive complexity)

Verbal/linguistic working memory

Attentionial control working memory

Cognitive complexity

NmSeries

(RQ)

.93 .63 H .80

#

.63 .73 .64

H

ORLVOC

NUMSER

VRBATN

LETPAT

PHNPRO

STYREC

VISUAL

GENINF

NUMREVNUMPAT

NWDREP

VAL

PICREC

ANLSYN

OBJNUM

PAIRCN

SEGMNT

RPCNAM

SNDBLN

RETFLU

BD

SIM

DS

PCCN

COD

VOC

LNSQ

MR

CMP

SS

CAN INFAR

CONFRM

H = High M = Moderate M = Low Mod.L = Low

WJ IV and WISC-IV 2D MDS solutions (n=173)

© Institute for Applied Psychometrics; Kevin McGrew 01-06-16

0.4 0.5 0.6 0.7 0.8g-loading

-0.10

0.45

1.00

1.55

2.10

MD

S r

ela

tive

CC

(rC

C)

ORLVOC-Gc

SNDAWR-Ga

ORLCMP-Gc

VRBATN-Gwm

PHNPRO-Ga

OBJNUM-Gwm

NUMSER-GfCONFRM-GfUNDDIR-Gwm

NUMREV-Gwm

PICVOC-Gc

STYREC-Glr

SENREP-Gwm

SEGMNT-Ga

ANLSYN-GF

MEMWRD-Gwm

SNDBLN-Ga

VISUAL-Gv

RETFLU-GlrLETPAT-Gs

NWDREP-GaNUMPAT-Gs

PAIRCN-Gs

RPCNAM-Glr

PICREC-Gv

Relationship between g-loadings and MDS-based relative cognitive complexity (rCC) for WJ IV COG and OL tests

Correlation = .93; but correspondencediverges as test become higher in g and

cognitive complexity

© Institute for Applied Psychometrics; Kevin McGrew 06-19-14

Median test correlations with R, M, & W

clusters for ages 6-19 (not reported in technical

manual)

NmSeries

(RQ)

.93 .63 H .80

#

.63 .73 .64

H

0 50 100 150 200

Number Series

0

50

100

150

200

Bro

ad

Re

ad

ing

For example, correlation of Number Series with Broad Reading cluster

External criterion relations validity

NmSeries

(RQ)

.93 .63 H .80

#

.63 .73 .64

HMedian correlation with WJ IV reading, math, and written

language achievement clusters

(ages 6-19)

Faceted models of intelligence

Auditory has recently been proposed to be added

Cognitive operations and content dimensions

There has been an explosion of research on auditory abilities since Carroll’s (1993) seminal work (Schneider & McGrew, 2012). A wide-ranging collection of Ga characteristics have been related to disorders of reading, speech, and language. For example, Gaabilities are now recognized as playing a pivotal scaffolding role in the development of language and general cognitive abilities

(Conway, Pisoni, & Kronenberger, 2009).

© Institute for Applied Psychometrics; Kevin McGrew 06-18-14© Institute for Applied Psychometrics; Kevin McGrew 06-18-14

Verbal Auditory Figural Numeric

Reasoning

Memory

Creativity

Speed

Stimulus Content

Co

gnit

ive

Op

erat

ion

s

The BIS intelligence framework (2016)

#

Auditory (A)

Speed-fluency (SF)

Quantitative-numeric

#

Reading-writing (RW)

Verbal (V)

Figural-visual (FV)

2-1 0 1-2

-1

0

1

2

ORLVOC

NUMSER

VRBATN

LETPAT

PHNPRO

VALVISUAL

GENINF

CONFRM

NUMREV NUMPAT

NWDREP

STYREC

PICREC

ANLSYN

OBJNUM

PAIRCN

MEMWRD

PICVOC

ORLCMP

SEGMNT

RPCNAM

SENREP

UNDDIR

SNDBLN

RETFLU

SNDAWR APPROB

SPELLPSGCMP

CALC

WRTSMP

ORLRDG

MTHFLU

SNWRFL

RDGREC

NUMMAT

EDIT

WRDFLU

SPLSND

SCI SOCHUM

Gwm

Gv

Glr

Glr-LA

Gs-COG

Gs-ACH

Gq

Grw

Gc

Another perspective:

BIS framework: The stimulus

content dimension

Ages 6-19

Ga

Auditory-linguistic

Speed-fluency (SF)

Quantitative-numeric

#

Reading-writing (RW)

Figural-visual (FV)

2-1 0 1-2

-1

0

1

2

ORLVOC

NUMSER

VRBATN

LETPAT

PHNPRO

VALVISUAL

GENINF

CONFRM

NUMREV NUMPAT

NWDREP

STYREC

PICREC

ANLSYN

OBJNUM

PAIRCN

MEMWRD

PICVOC

ORLCMP

SEGMNT

RPCNAM

SENREP

UNDDIR

SNDBLN

RETFLU

SNDAWR APPROB

SPELLPSGCMP

CALC

WRTSMP

ORLRDG

MTHFLU

SNWRFL

RDGREC

NUMMAT

EDIT

WRDFLU

SPLSND

SCI SOCHUM

Gwm

Gv

Glr

Glr-LA

Gs-COG

Gs-ACH

Gq

Grw

Gc

Another perspective:

BIS framework: The stimulus

content dimension

Ages 6-19

Ga

-2 -1 0 1

-2

-1

0

1

2

ORLVOC

NUMSER

VRBATN

LETPAT

PHNPRO

VALVISUAL

GENINF

CONFRM

NUMREV NUMPAT

NWDREP

STYREC

PICREC

ANLSYN

OBJNUM

PAIRCN

MEMWRD

PICVOC

ORLCMP

SEGMNT

RPCNAM

SENREP

UNDDIR

SNDBLN

RETFLU

SNDAWR APPROB

SPELLPSGCMP

CALC

WRTSMP

ORLRDG

MTHFLU

SNWRFL

RDGREC

NUMMAT

EDIT

WRDFLU

SPLSND

SCI SOCHUM

Speed-fluency

Reading-writing

WJ IV test 2D MDS (Ages 6 to 19; n =

4,082)

Quantitative-numeric

Figural-visual

AuditoryVerbalVerbal

#

BIS content/stimulus characteristicNmSeries

(RQ)

.93 .63 H .80

#

.63 .73 .64

H

It’s a pleasure when you use the correct measure

How to evaluate the unusualness (base rate) of WJ IV cluster or test score differences

Kevin McGrew, PhD.Educational/School Psychologist

DirectorInstitute for Applied Psychometrics (IAP)

© Institute for Applied Psychometrics; Kevin McGrew 11-23-15

© Institute for Applied Psychometrics; Kevin McGrew 11-23-15

CorrelationSD(diff) 1.50 (≈ 13 % base rate)SD(diff) 1.65 (≈ 10% base rate)

Oral Vocabulary

General Information

Number Series

Concept Formation

Verbal Attention

Number Reversed

Story Recall

Vis-Auditory Learning

Visualization

Picture Recogntion

Let-Pattern Matching

Pair Cancellation

PhonologicalProcessing

Nonword Repetition

.71/≈18/≈20

Gc-Ext

Gc

.97/≈5/≈6

.47/≈24/≈27

Gf-Ext

.94/≈8/≈9

Gf

.47/≈24/≈27

Gwm-Ext

.94/≈8/≈9

Gwm Glr

.34/≈27/≈30 .43/≈25/≈28 .37/≈27/≈29 .60/≈21/≈24

Gv Ga Gs

Select WJ IV COG cluster/test score significance values (ages 6-19) *

GIA (7 tests)

SAPT’s (4 tests)

Gf-Gc (4 tests)

BIA (2 tests)

.87/≈12/≈13

.86/≈12/≈13

.94/≈8/≈9

* Rounded values

calculated in WJ IV norm data(ages 6 to 19)

….and…..”intelligent” intelligent testing is complex….and important

We are the instrument !!!!In the remainder of this

presentation I will model “intelligent” intelligence

test interpretation for the WJ IV COG+OL

Will provide you with some aids and templates to organize thinking and

test data

Not to be used as cookbooks