future of assessment in india challenges and solutions hariharan swaminathan university of...

81
FUTURE OF ASSESSMENT IN INDIA CHALLENGES AND SOLUTIONS HARIHARAN SWAMINATHAN UNIVERSITY OF CONNECTICUT

Upload: marshall-victor-gardner

Post on 19-Jan-2016

218 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: FUTURE OF ASSESSMENT IN INDIA CHALLENGES AND SOLUTIONS HARIHARAN SWAMINATHAN UNIVERSITY OF CONNECTICUT

FUTURE OF ASSESSMENT IN INDIA CHALLENGES AND SOLUTIONS

HARIHARAN SWAMINATHANUNIVERSITY OF CONNECTICUT

Page 2: FUTURE OF ASSESSMENT IN INDIA CHALLENGES AND SOLUTIONS HARIHARAN SWAMINATHAN UNIVERSITY OF CONNECTICUT

Testing: A Brief History

1. Testing is part of human nature and has a long history. In the old testament, “Mastery Testing” was used in the old testament to “classify” people into two groups:

• The Gileadites defeated the Ephraimites, and when the Ephraimites tried to escape, they were given a high stakes test;

• The test was to pronounce the word “shibboleth”. Failure had a drastic consequence.

2. Civil service exams were used in China more than 3000 years ago.

Page 3: FUTURE OF ASSESSMENT IN INDIA CHALLENGES AND SOLUTIONS HARIHARAN SWAMINATHAN UNIVERSITY OF CONNECTICUT

Testing: A Brief History

3. Testing in India may go even further back in time

4. In the West, testing has a mixed history – its use has waxed and waned.

5. In the US, Horace Mann argued for written exam (objective type) and the first was introduced in Boston in 1845

6. Tests were used for grade-to-grade promotion

7. This testing practice fell into disrepute because of teaching to the test.

Page 4: FUTURE OF ASSESSMENT IN INDIA CHALLENGES AND SOLUTIONS HARIHARAN SWAMINATHAN UNIVERSITY OF CONNECTICUT

Testing: A Brief History

8. Grade promotion based on testing was abolished in Chicago in 1881.

9. Binet introduced mental testing in 1901 (became the Stanford Binet Test).

10. The issue of fairness that “everyone should get the same test” was not relevant to Binet.

11. Binet rank ordered the items in order of difficulty and targeted items to the child’s ability.

12. The first “Adaptive Test” was born.

Page 5: FUTURE OF ASSESSMENT IN INDIA CHALLENGES AND SOLUTIONS HARIHARAN SWAMINATHAN UNIVERSITY OF CONNECTICUT

• Adaptive testing was the primary mode of testing before the notion of group testing was introduced.

• With the advent of group testing, individualized (adaptive) testing went to the back burner.

• Primarily because it was impossible to administer to large groups.

• Group based adaptive testing was not feasible, Until……

• We will return to adaptive testing later.

Individualized Testing

Page 6: FUTURE OF ASSESSMENT IN INDIA CHALLENGES AND SOLUTIONS HARIHARAN SWAMINATHAN UNIVERSITY OF CONNECTICUT

Testing in India

• Testing in India has a long history

Knowledge/Skill testing was common in ancient India

• Rama was subjected to competency testing by Sugriva

• Yudhisthira was tested with a 133 item, high stakes test (Yaksha Prashna)

• According to the Bhagavatam, the name Parikshit (Examiner) was given to the successor of Yudhisthira

Testing in the form of puzzles and games was often used for entertainment

Page 7: FUTURE OF ASSESSMENT IN INDIA CHALLENGES AND SOLUTIONS HARIHARAN SWAMINATHAN UNIVERSITY OF CONNECTICUT

Testing in India

India is a country of superlatives

• India can boast of probably the longest tradition of education, stretching over at least two and half millennia

• Taxasila is considered the oldest seat of learning in the world

• Nalanda perhaps the oldest university in the world

• This tradition of learning and the value placed on education continues in India

Page 8: FUTURE OF ASSESSMENT IN INDIA CHALLENGES AND SOLUTIONS HARIHARAN SWAMINATHAN UNIVERSITY OF CONNECTICUT

Testing in India

Population Distribution in India

Country Age Range NumberIndia 0-4 128 Million

5-9 128 Million 10-15 128 Million 0-15 384 Million

USA 0-15 32 Million USA Total Population 352 Million

Page 9: FUTURE OF ASSESSMENT IN INDIA CHALLENGES AND SOLUTIONS HARIHARAN SWAMINATHAN UNIVERSITY OF CONNECTICUT

Testing in India

• These numbers are expected to decline slightly over the next twenty years, but nevertheless, will far exceed that of any other country in the world.

• The tradition of learning, the value placed on education, and the population explosion has created considerable stress on the Indian education system.

• It is not surprising that assessment and testing procedures in India have focused on selection and certification .

Page 10: FUTURE OF ASSESSMENT IN INDIA CHALLENGES AND SOLUTIONS HARIHARAN SWAMINATHAN UNIVERSITY OF CONNECTICUT

Testing in India

• Some of the selection examinations in India are perhaps the most grueling and most selective in the world.

-- IAS Examination: of the 450,000 candidates, 1200 selected (.3% )

-- IIT Joint Entrance examination: of the 500,000 applicants only about 10,000 are selected for admission (2%)

• According to the scientific advisor to the previous prime minister, C.N.R. Rao, “India has an examination system but not an education system.”

• Another criticism levelled against the examination system is that they promote intensive coaching with little regard for a properly grounded knowledge base.

Page 11: FUTURE OF ASSESSMENT IN INDIA CHALLENGES AND SOLUTIONS HARIHARAN SWAMINATHAN UNIVERSITY OF CONNECTICUT

Testing in India

• Needless to say, these criticisms are not unlike the criticism levelled against testing in the US.

• However, intensive testing in schools in the US is now being directed more towards assessment of learning and growth for accountability purposes rather than on assessment of student achievement for the purposes of certification and promotion.

• Although assessment and testing play an important role in Indian education, assessment practices in India do not seem to have kept pace with the modern approaches and trends in testing and assessment.

Page 12: FUTURE OF ASSESSMENT IN INDIA CHALLENGES AND SOLUTIONS HARIHARAN SWAMINATHAN UNIVERSITY OF CONNECTICUT

Test Uses in the U.S.

1. Management of Instruction

2. Placement and Counseling

3. Selection

4. Licensure and Certification

5. Accountability

Page 13: FUTURE OF ASSESSMENT IN INDIA CHALLENGES AND SOLUTIONS HARIHARAN SWAMINATHAN UNIVERSITY OF CONNECTICUT

1. Management of Instruction• Classroom and standardized tests for daily

management of instruction (formative and diagnostic evaluation)

• Classroom and standardized tests for grading (summative evaluation)

2. Placement and Counseling• Standardized tests for transition from one level of

school to another or from school to work

Test Uses in the U.S.

Page 14: FUTURE OF ASSESSMENT IN INDIA CHALLENGES AND SOLUTIONS HARIHARAN SWAMINATHAN UNIVERSITY OF CONNECTICUT

3. Selection (Entry Decisions)

• standardized achievement and aptitude tests for admission to college, graduate school, and special programs

4. Licensure and Certification

• Standardized tests for determining qualification for entry into a profession

• School graduation requirement

Test Uses in the U.S.

Page 15: FUTURE OF ASSESSMENT IN INDIA CHALLENGES AND SOLUTIONS HARIHARAN SWAMINATHAN UNIVERSITY OF CONNECTICUT

5. Accountability

• Standardized tests to show satisfactory achievement or growth of students in schools receiving public money, often required by state and federal legislation. If students in schools do not show adequate progress, sanctions are imposed on the schools and on the state.

Test Uses in the U.S.

Page 16: FUTURE OF ASSESSMENT IN INDIA CHALLENGES AND SOLUTIONS HARIHARAN SWAMINATHAN UNIVERSITY OF CONNECTICUT

• For all these purposes we need to design tests for measuring the student’s “ability” or “proficiency”

• The use of the ability/proficiency test scores must be appropriate for the intended use

• Tests are measurement instruments and when we use them to measure, like with all measurement devises, we make measurement errors

Theoretical Framework for Tests

Page 17: FUTURE OF ASSESSMENT IN INDIA CHALLENGES AND SOLUTIONS HARIHARAN SWAMINATHAN UNIVERSITY OF CONNECTICUT

• The objective of measurement is to measure what we want to measure appropriately and do so with minimum error.

• The construction of tests and the determination of an examinee’s proficiency level/ability scores are carried out within one of two theoretical frameworks:

• Classical Test Theory

• Modern Test Theory or Item Response Theory (IRT)

Theoretical Framework for Tests

Page 18: FUTURE OF ASSESSMENT IN INDIA CHALLENGES AND SOLUTIONS HARIHARAN SWAMINATHAN UNIVERSITY OF CONNECTICUT

Classical Test Theory Model

X = T + E

Observed Score True Score Error

= Observed Score Variance

= True Score Variance

= Error Variance

2T2E

2X

Page 19: FUTURE OF ASSESSMENT IN INDIA CHALLENGES AND SOLUTIONS HARIHARAN SWAMINATHAN UNIVERSITY OF CONNECTICUT

Indices Used In Traditional Test Construction

• Item and Test Indices:

Item difficulty

Item discrimination

Test score reliability

Standard Error of Measurement

• Examinee Indices:

Test score

Page 20: FUTURE OF ASSESSMENT IN INDIA CHALLENGES AND SOLUTIONS HARIHARAN SWAMINATHAN UNIVERSITY OF CONNECTICUT

Classical Item Indices: Item Difficulty

• Item difficulty : Proportion of examinees answering the item correctly

• It is an index of how difficult an item is.

• If the value is low, it indicates that the item is very difficult. Only a few examinees will respond correctly to this item.

• If the value is high, the item is easy as many examinees will respond correctly to it.

Page 21: FUTURE OF ASSESSMENT IN INDIA CHALLENGES AND SOLUTIONS HARIHARAN SWAMINATHAN UNIVERSITY OF CONNECTICUT

Classical Item Indices: Item Discrimination

• Item discrimination: correlation between item score and total score

• A value close to 1 indicates that examinees with high scores (ability) answer this item correctly, while examinees with low ability will respond incorrectly

• A low value implies that there is hardly any relationship between ability and how examinees respond to this item

• Such items are not very useful for separating high ability examinees from low ability examinees

• Items with high values of discrimination are very useful for ADAPTIVE TESTING

Page 22: FUTURE OF ASSESSMENT IN INDIA CHALLENGES AND SOLUTIONS HARIHARAN SWAMINATHAN UNIVERSITY OF CONNECTICUT

Standard Error Of Measurement

• Indicates the amount of error to be expected in the test scores

• Arguably, the most important quantity

• Depends on the scale, and therefore difficult to assess its magnitude

• Can be re-expressed in terms of reliability which varies between 0 and 1.

Page 23: FUTURE OF ASSESSMENT IN INDIA CHALLENGES AND SOLUTIONS HARIHARAN SWAMINATHAN UNIVERSITY OF CONNECTICUT

Test Score Reliability

• Reliability Index, is defined as the correlation between true scores on “parallel” tests.

• It will take on value between 0 and 1, with 0 denoting totally unreliable test scores and 1 perfectly reliable test scores.

• It is related to the Standard Error of Measurement according to the expression

• If the test scores are perfectly reliable, =1 and = 0.

Page 24: FUTURE OF ASSESSMENT IN INDIA CHALLENGES AND SOLUTIONS HARIHARAN SWAMINATHAN UNIVERSITY OF CONNECTICUT

Reliability

• Error in scores is due to factors such as testing conditions, fatigue, guessing, emotional or physical condition of student, etc.

• Different types of reliability coefficients reflect different interpretations of error

Page 25: FUTURE OF ASSESSMENT IN INDIA CHALLENGES AND SOLUTIONS HARIHARAN SWAMINATHAN UNIVERSITY OF CONNECTICUT

Reliability

1. Reliability refers to consistency of test scores

• over time

• over different sets of test items

2. Reliability refers to test results, not the test itself

3. A test can have more than one type of reliability coefficient

4. Reliability is necessary but not sufficient for validity

Page 26: FUTURE OF ASSESSMENT IN INDIA CHALLENGES AND SOLUTIONS HARIHARAN SWAMINATHAN UNIVERSITY OF CONNECTICUT

Shortcomings of the Indices based on Classical test Theory

• They are group DEPENDENT

i.e., they change as the groups change.

• Reliability and hence Standard Error of Measurement are defined in terms of Parallel Tests, almost impossible to realize in practice.

Page 27: FUTURE OF ASSESSMENT IN INDIA CHALLENGES AND SOLUTIONS HARIHARAN SWAMINATHAN UNIVERSITY OF CONNECTICUT

And what’s wrong with that?

• We cannot compare item characteristics for items whose indices were computed on different groups of examinees

• We cannot compare the test scores of individuals who have taken different sets of test items

Page 28: FUTURE OF ASSESSMENT IN INDIA CHALLENGES AND SOLUTIONS HARIHARAN SWAMINATHAN UNIVERSITY OF CONNECTICUT

Wouldn’t it be nice if ….?

• Our item indices did not depend on the characteristics of the individuals on which the item data was obtained

• Our examinee measures did not depend on the characteristics of the of items that was administered

Page 29: FUTURE OF ASSESSMENT IN INDIA CHALLENGES AND SOLUTIONS HARIHARAN SWAMINATHAN UNIVERSITY OF CONNECTICUT

ITEM RESPONSE THEORY solves the problem!*

* Certain conditions apply. Individual results may vary. IRT is not for everyone, including those with small samples. Side effects include nausea, drowsiness, and difficulty swallowing. If symptoms persist, consult a psychometrician. For more information, see Hambleton and Swaminathan (1985), and Hambleton, Swaminathan and Rogers (1991).

Page 30: FUTURE OF ASSESSMENT IN INDIA CHALLENGES AND SOLUTIONS HARIHARAN SWAMINATHAN UNIVERSITY OF CONNECTICUT

Item Response Theory

• Based on the postulate that the probability of a correct response to an item depends on the ability of the examinee and the characteristics of the item

Page 31: FUTURE OF ASSESSMENT IN INDIA CHALLENGES AND SOLUTIONS HARIHARAN SWAMINATHAN UNIVERSITY OF CONNECTICUT

• The mathematical relationship between the probability of a response, the ability of the examinee, and the characteristics of the item is specified by the

ITEM RESPONSE MODEL

The Item Response Model

Page 32: FUTURE OF ASSESSMENT IN INDIA CHALLENGES AND SOLUTIONS HARIHARAN SWAMINATHAN UNIVERSITY OF CONNECTICUT

Item Characteristics

• An item may be characterized by its

DIFFICULTY level (usually denoted as b),

DISCRIMINATION level (usually denoted by a),

“PSEUDO-CHANCE” level (usually denoted as c).

Page 33: FUTURE OF ASSESSMENT IN INDIA CHALLENGES AND SOLUTIONS HARIHARAN SWAMINATHAN UNIVERSITY OF CONNECTICUT

-5 -4 -3 -2 -1 0 1 2 3 4 50.0

0.1

0.2

0.3

0.4

0.5

0.6

0.7

0.8

0.9

1.0

Item Response Model

Theta (Proficiency)

Pro

bab

ilit

y o

f C

orr

ect

Resp

on

se

a = 0.5b = -0.5c = 0.0

Page 34: FUTURE OF ASSESSMENT IN INDIA CHALLENGES AND SOLUTIONS HARIHARAN SWAMINATHAN UNIVERSITY OF CONNECTICUT

-5 -4 -3 -2 -1 0 1 2 3 4 50.0

0.1

0.2

0.3

0.4

0.5

0.6

0.7

0.8

0.9

1.0

Item Response Model

Theta (Proficiency)

Pro

bab

ilit

y o

f C

orr

ect

Resp

on

se

a = 2.0b = 0.0c = .25

a = 0.5b = -0.5c = 0.0

Page 35: FUTURE OF ASSESSMENT IN INDIA CHALLENGES AND SOLUTIONS HARIHARAN SWAMINATHAN UNIVERSITY OF CONNECTICUT

-5 -4 -3 -2 -1 0 1 2 3 4 50.0

0.1

0.2

0.3

0.4

0.5

0.6

0.7

0.8

0.9

1.0

Item Response Model

Theta (Proficiency)

Pro

bab

ilit

y o

f C

orr

ect

Resp

on

se

a = 2.0b = 0.0c = .25

a = 0.8b = 1.5c = 0.1

a = 0.5b = -0.5c = 0.0

Page 36: FUTURE OF ASSESSMENT IN INDIA CHALLENGES AND SOLUTIONS HARIHARAN SWAMINATHAN UNIVERSITY OF CONNECTICUT

IRT Item Difficulty b

• Differs from classical item difficulty

• b is the θ value at which the probability of a correct response is .5

• The harder the item, the higher the b

• b is on the same scale as θ and does not depend on the characteristics of the group of test takers

Page 37: FUTURE OF ASSESSMENT IN INDIA CHALLENGES AND SOLUTIONS HARIHARAN SWAMINATHAN UNIVERSITY OF CONNECTICUT

IRT Item Discrimination a

• Differs from classical item discrimination

• a is proportional to the slope of the ICC at θ = b

• The slope indicates how much the probability of a correct response changes for individuals with slightly different θ values, i.e., how well the item discriminates between them

Page 38: FUTURE OF ASSESSMENT IN INDIA CHALLENGES AND SOLUTIONS HARIHARAN SWAMINATHAN UNIVERSITY OF CONNECTICUT

IRT Item Guessing Parameter

• No analog in classical test theory

• c is the probability that an examinee with very low θ will answer the item correctly

• b is now the θ value at which the probability of a correct response is (1 + c)/2

Page 39: FUTURE OF ASSESSMENT IN INDIA CHALLENGES AND SOLUTIONS HARIHARAN SWAMINATHAN UNIVERSITY OF CONNECTICUT

Item Response Models

( )

( )(correct response )

1

b

b

eP

e

• The One-Parameter Model (Rasch Model)

• The Two-Parameter Model( )

( )(correct response )

1

a b

a b

eP

e

• The Three-Parameter Model( )

( )(correct response ) (1 )

1

a b

a b

eP c c

e

Page 40: FUTURE OF ASSESSMENT IN INDIA CHALLENGES AND SOLUTIONS HARIHARAN SWAMINATHAN UNIVERSITY OF CONNECTICUT

How Is IRT Used In Practice?

• Test construction

• Equating of test forms

• Vertical scaling (for growth assessment)

• Detection of differential item functioning

• Adaptive testing

Page 41: FUTURE OF ASSESSMENT IN INDIA CHALLENGES AND SOLUTIONS HARIHARAN SWAMINATHAN UNIVERSITY OF CONNECTICUT

Test Construction

• Traditional approach: select items with p-values in the .2 - .8 range and as highly discriminating as possible

• We cannot, however, design a test that has the required reliability, SEM, and score distribution.

• We cannot design a test with pre-specified characteristics.

Page 42: FUTURE OF ASSESSMENT IN INDIA CHALLENGES AND SOLUTIONS HARIHARAN SWAMINATHAN UNIVERSITY OF CONNECTICUT

Test Construction (cont.)

• IRT approach: INFORMATION FUNCTIONS

• The information provided by the test about an examinee with given ability is directly related to the Standard Error of Measurement

• We CAN assemble a test that has the characteristics we want, impossible to accomplish this in a classical framework

Page 43: FUTURE OF ASSESSMENT IN INDIA CHALLENGES AND SOLUTIONS HARIHARAN SWAMINATHAN UNIVERSITY OF CONNECTICUT

Test Construction (cont.)

• The TEST INFORMATION FUNCTION specifies the information provided by the test across the θ range

• Test information is a sum of the information provided by each item

• Because of this property, we can combine items to obtain a pre-specified test information

Page 44: FUTURE OF ASSESSMENT IN INDIA CHALLENGES AND SOLUTIONS HARIHARAN SWAMINATHAN UNIVERSITY OF CONNECTICUT

Test Construction (cont.)

• Items can be selected to maximize information in desired θ regions depending on test purpose

• Test can be constructed of minimal length to keep standard errors below a specified maximum

• By selecting items that have optimal properties, we can create a shorter test that have the same degree of precision as a longer test

Page 45: FUTURE OF ASSESSMENT IN INDIA CHALLENGES AND SOLUTIONS HARIHARAN SWAMINATHAN UNIVERSITY OF CONNECTICUT

Item Information Functions

• Bell-shaped

• Peak is at or near difficulty value b: item provides greatest information at θ values near b

• Height depends on discrimination; more discriminating items provide greater information over a narrow range around b

• Items with low c provide most information

Page 46: FUTURE OF ASSESSMENT IN INDIA CHALLENGES AND SOLUTIONS HARIHARAN SWAMINATHAN UNIVERSITY OF CONNECTICUT

Test And Item Information Functions

-2.5 -2.0 -1.5 -1.0 -0.5 0.0 0.5 1.0 1.5 2.0 2.50

1

2

3

4

5

6

7

8

Theta

Info

rma

tio

n

Page 47: FUTURE OF ASSESSMENT IN INDIA CHALLENGES AND SOLUTIONS HARIHARAN SWAMINATHAN UNIVERSITY OF CONNECTICUT

And what do we get for all this?

• The proficiency level (ability) of an examinee is not tied to the specific items we administer

• We CAN compare the ability scores of examinees who have taken different sets of test items

• We can therefore match items to examinee’s ability level and measure ability more precisely with shorter tests

Page 48: FUTURE OF ASSESSMENT IN INDIA CHALLENGES AND SOLUTIONS HARIHARAN SWAMINATHAN UNIVERSITY OF CONNECTICUT

And what do we get for all this?

• We can create a bank of items by administering different items to different groups of examinees at different times

• This will allow us to administer comparable tests or individually tailored tests to examinees

• By administering different items to different individuals or groups we can improve test security and minimize cheating

Page 49: FUTURE OF ASSESSMENT IN INDIA CHALLENGES AND SOLUTIONS HARIHARAN SWAMINATHAN UNIVERSITY OF CONNECTICUT

And what do we get for all this?

• We can ensure the fairness of tests by making sure the test and test items are functioning in the same way across different groups

• For assessment of learning, we can give different SHORT tests made up of items that measure the entire domain of skills; otherwise such coverage will require a very long test, and be unmanageable.

Page 50: FUTURE OF ASSESSMENT IN INDIA CHALLENGES AND SOLUTIONS HARIHARAN SWAMINATHAN UNIVERSITY OF CONNECTICUT

Equating

• Purpose of equating is to place scores from one form of a test on the scale of another

• The goal of equating is for scores to be exchangeable; it should not matter to examinees which form of the test they take

• True equating is not strictly possible using traditional procedures

Page 51: FUTURE OF ASSESSMENT IN INDIA CHALLENGES AND SOLUTIONS HARIHARAN SWAMINATHAN UNIVERSITY OF CONNECTICUT

IRT “Equating”

• Under IRT, equating is not necessary• But we have to undo the scaling; when we

calibrate a test, i.e., estimate the ability scores and item parameters, we commonly “standardize” the ability scores. We have to undo this scaling to place the parameters on a common scale.

• IRT “equating” is simply rescaling. • We will describe the scaling procedure in detail

later.

Page 52: FUTURE OF ASSESSMENT IN INDIA CHALLENGES AND SOLUTIONS HARIHARAN SWAMINATHAN UNIVERSITY OF CONNECTICUT

Differential Item Functioning (DIF)

• When we design a test to measure a construct of interest, that test should not measure something else that is irrelevant.

• For example, if we are measuring reading comprehension, we should not include items that have mathematical content (This would be silly unless this skill is a relevant and required skill).

Page 53: FUTURE OF ASSESSMENT IN INDIA CHALLENGES AND SOLUTIONS HARIHARAN SWAMINATHAN UNIVERSITY OF CONNECTICUT

Differential Item Functioning (DIF)

• Similarly, we should not include items that have a heavy reading component if our purpose is to measure the mathematical ability of the student.

• Such items will favor one group over another.

• Construct irrelevant items adversely affect validity.

Page 54: FUTURE OF ASSESSMENT IN INDIA CHALLENGES AND SOLUTIONS HARIHARAN SWAMINATHAN UNIVERSITY OF CONNECTICUT

Differential Item Functioning (DIF)

• IRT provides a natural framework for defining and assessing DIF

Definition:An item shows DIF if two examinees at the same ability level but from different groups do not have the same probability of answering the item correctly.

Page 55: FUTURE OF ASSESSMENT IN INDIA CHALLENGES AND SOLUTIONS HARIHARAN SWAMINATHAN UNIVERSITY OF CONNECTICUT

Differential Item Functioning (DIF)

• Detecting and eliminating these items are carried out routinely in all testing programs.

• It is a crtical part in the validation process, in determining if construct irrelevant variables pose threats to the intended uses of the test.

Page 56: FUTURE OF ASSESSMENT IN INDIA CHALLENGES AND SOLUTIONS HARIHARAN SWAMINATHAN UNIVERSITY OF CONNECTICUT

Tailored Testing

• Fred Lord introduced Item Response Theory in the early 50’s and with it, the notion of Tailored Testing.

• Without computers, tailored testing was not feasible.

• To overcome this problem, Lord developed “FLEXILEVEL TESTING”

• Flexilevel test follows Binet’s idea; only the difficulty level of the item is used in routing

Page 57: FUTURE OF ASSESSMENT IN INDIA CHALLENGES AND SOLUTIONS HARIHARAN SWAMINATHAN UNIVERSITY OF CONNECTICUT

Computerized Adaptive Testing (CAT)

• Adaptive testing is the process of tailoring the test items administered to the best current estimate of an examinee’s trait level

• Items are most informative when their difficulty is close to the examinee’s θ value

• Different examinees take different tests

• Only through IRT can items be appropriately selected, trait values estimated after each item administered, and the resulting test scores compared

Page 58: FUTURE OF ASSESSMENT IN INDIA CHALLENGES AND SOLUTIONS HARIHARAN SWAMINATHAN UNIVERSITY OF CONNECTICUT

Advantages Of CAT

• Testing time can be shortened

• Examinees’ trait values can be estimated with a desired degree of precision

• Scoring and reporting can be immediate

• Scoring errors and loss of data are reduced

• Test security is preserved (in theory)

• Paper use is eliminated

• Need for supervision is reduced

Page 59: FUTURE OF ASSESSMENT IN INDIA CHALLENGES AND SOLUTIONS HARIHARAN SWAMINATHAN UNIVERSITY OF CONNECTICUT

Adaptive Testing

• Items are pre-calibrated and stored in a bank

• Item bank should be large and varied in difficulty

• Examinee is administered one or more items of moderate difficulty to obtain initial trait estimate

Page 60: FUTURE OF ASSESSMENT IN INDIA CHALLENGES AND SOLUTIONS HARIHARAN SWAMINATHAN UNIVERSITY OF CONNECTICUT

Computerized Adaptive Testing (CAT)

• Once an initial θ estimate is obtained, items are selected for administration based on their information functions – the most informative item at the current estimate of θ is selected, taking into account content considerations

• The trait estimate is updated after each item response

• Testing is terminated after a fixed number of items or when the standard error of the estimate is at a desired level

Page 61: FUTURE OF ASSESSMENT IN INDIA CHALLENGES AND SOLUTIONS HARIHARAN SWAMINATHAN UNIVERSITY OF CONNECTICUT

Computerized Adaptive Testing (CAT)

• The Office of Naval Research, the Army, and the Air Force funded research for advancing CAT during the 70s.

• David Weiss and his team at University of Minnesota were funded for developing operational procedures for implementing CAT

• I was funded for the development of Bayesian estimation procedures so that we can estimate item parameters more accurately

• All these activities were motivated because of the large volume of test takers in the armed forces

Page 62: FUTURE OF ASSESSMENT IN INDIA CHALLENGES AND SOLUTIONS HARIHARAN SWAMINATHAN UNIVERSITY OF CONNECTICUT

Operationalizing CAT

• I was on the Board of Directors of GRE in the mid 80s.

• The Board authorized and funded research for implementing GRE-CAT

• GRE CAT was operational in the early 90s. GMAT followed suit soon after.

Page 63: FUTURE OF ASSESSMENT IN INDIA CHALLENGES AND SOLUTIONS HARIHARAN SWAMINATHAN UNIVERSITY OF CONNECTICUT

Theory v. Practice

• First clash between theory and practice occurred in the implementation of CAT

• As Personal Computers were not common GRE had to contract with a delivery system provider

• “Seat-time” was the major obstacle.

• In theory, testing should continue until the stopping criterion, prescribed standard error, was reached. Instead the time i.e., test length, had to be fixed.

• Examinees had to complete 80% of items

Page 64: FUTURE OF ASSESSMENT IN INDIA CHALLENGES AND SOLUTIONS HARIHARAN SWAMINATHAN UNIVERSITY OF CONNECTICUT

Issues

• Item Bank: A large item bank is needed and maintained well. In developing item banks items from paper and pencil administration should not be used without careful investigation.

• Exposure Control: In high stakes testing, exposure control is critical

• Content Specification and Balancing: This is a critical issue and must be addressed early on in the development of item bank and item selection criteria

Page 65: FUTURE OF ASSESSMENT IN INDIA CHALLENGES AND SOLUTIONS HARIHARAN SWAMINATHAN UNIVERSITY OF CONNECTICUT

Issues (cont’d)

• CAT algorithm: Unchecked, a CAT algorithm will be greedy and choose items with the highest information. Algorithms for selecting items with content balancing must be in place.

• Item parameter shift : Over time item parameter values will change because of instruction, targeted instruction, and exposure of items. Item parameters must be re-estimated and items that show large drifts must be eliminated.

Page 66: FUTURE OF ASSESSMENT IN INDIA CHALLENGES AND SOLUTIONS HARIHARAN SWAMINATHAN UNIVERSITY OF CONNECTICUT

Issues (Cont’d)

• Item BIAS (Differential Item Functioning): Performance of subgroups on items must be examined to determine if subgroups are performing differentially on items . This analysis must be carried out not only in the development of the item bank but also during operational administrations.

Page 67: FUTURE OF ASSESSMENT IN INDIA CHALLENGES AND SOLUTIONS HARIHARAN SWAMINATHAN UNIVERSITY OF CONNECTICUT

VALIDITY

"The concept of validity refers to the appropriateness, meaningfulness, and usefulness of the specific inferences made from test scores. Test validation is the process of accumulating evidence to support such inference."

Standards for Educational and Psychological Testing

AERA/APA/NCME

Page 68: FUTURE OF ASSESSMENT IN INDIA CHALLENGES AND SOLUTIONS HARIHARAN SWAMINATHAN UNIVERSITY OF CONNECTICUT

VALIDITY

-- refers to the appropriateness of interpretations of test scores

-- is not a property of the test itself

-- is a matter of degree

-- is specific to a particular use of the test

Page 69: FUTURE OF ASSESSMENT IN INDIA CHALLENGES AND SOLUTIONS HARIHARAN SWAMINATHAN UNIVERSITY OF CONNECTICUT

VALIDITY

is assessed using evidence from three categories:

content-related

criterion-related

construct-related

Page 70: FUTURE OF ASSESSMENT IN INDIA CHALLENGES AND SOLUTIONS HARIHARAN SWAMINATHAN UNIVERSITY OF CONNECTICUT

Validity of Ancient Tests

• Was Sugriva’s Test of Rama’s Skills Valid for the intended use?

• Sugriva could have given Rama a general knowledge test to see if Rama knew of Vali’s strength and skill in battle

Page 71: FUTURE OF ASSESSMENT IN INDIA CHALLENGES AND SOLUTIONS HARIHARAN SWAMINATHAN UNIVERSITY OF CONNECTICUT

Validity of Ancient Tests

• Sugriva could have given Rama a math test to determine if Rama could compute the angle to shoot the arrow to reach the target.

• Sugriva could have given Rama a Physics test to determine if he could calculate the force necessary to penetrate the giant trees.

Page 72: FUTURE OF ASSESSMENT IN INDIA CHALLENGES AND SOLUTIONS HARIHARAN SWAMINATHAN UNIVERSITY OF CONNECTICUT

Validity of Ancient Tests

• But then anyone with a degree of general knowledge, and knowledge of mathematics and physics could have answered the question, without even knowing how to use a Bow and Arrow.

• Sugriva, chose to give Rama an “authentic” test to determine his ability.

• The test was clearly appropriate for the use and therefore Valid.

Page 73: FUTURE OF ASSESSMENT IN INDIA CHALLENGES AND SOLUTIONS HARIHARAN SWAMINATHAN UNIVERSITY OF CONNECTICUT

Validity of Ancient Tests

• The God of Death, Yama, could have given Yudhisthira a general knowledge test.

• But that would not have served his purpose. The God Yama designed a test that was valid for its

intended purpose.

Page 74: FUTURE OF ASSESSMENT IN INDIA CHALLENGES AND SOLUTIONS HARIHARAN SWAMINATHAN UNIVERSITY OF CONNECTICUT

Innovative Item Formats

• Sugriva and the God Yama have shown how to choose the right item format to conduct their examinations.

• With the aid of computers, we can design tests with innovative item formats.

• These item formats will permit authentic assessments of the skills we intend to measure.

• These formats are currently being used in credentialing examinations.

Page 75: FUTURE OF ASSESSMENT IN INDIA CHALLENGES AND SOLUTIONS HARIHARAN SWAMINATHAN UNIVERSITY OF CONNECTICUT

In closing….

I have provided a very general overview of:

• How tests are used in the US

• How testing is being viewed in the US

• The Classical and the modern IRT frameworks that underpin the development of measurements

Page 76: FUTURE OF ASSESSMENT IN INDIA CHALLENGES AND SOLUTIONS HARIHARAN SWAMINATHAN UNIVERSITY OF CONNECTICUT

In closing….

I have provided a very general overview of:

• The advantages IRT offers over the classical framework, i.e., how IRT

Provides item characteristics that are invariant over subgroups

Provides proficiency scores that are not dependent on the sets of items

Enables the construction of tests that have pre-specified standard error of measurement and reliability

Page 77: FUTURE OF ASSESSMENT IN INDIA CHALLENGES AND SOLUTIONS HARIHARAN SWAMINATHAN UNIVERSITY OF CONNECTICUT

In closing….

how IRT

Enables the determination of proficiency with the desired accuracy at critical points on the proficiency continuum

Enables the delivery of individualized and targeted tests for the efficient determination of proficiency scores

Page 78: FUTURE OF ASSESSMENT IN INDIA CHALLENGES AND SOLUTIONS HARIHARAN SWAMINATHAN UNIVERSITY OF CONNECTICUT

In closing….

• Are any of these innovations and approaches to testing relevant or applicable in India?

• Assessment is a tradition in India.

• By necessity, assessment has been employed primarily for selection

• These assessments, while necessary, can be streamlined and made shorter and targeted through CAT

• Advances in technology have made innovative item formats possible. Higher order skills and creativity can be assessed through these innovative item formats.

Page 79: FUTURE OF ASSESSMENT IN INDIA CHALLENGES AND SOLUTIONS HARIHARAN SWAMINATHAN UNIVERSITY OF CONNECTICUT

In closing….

• India can most certainly benefit from the advances made in assessment.

• The weak link in Indian assessment system is that not much attention seems to have been paid to the issue of validity.

• The current system of examinations has had some negative side effects. It has promoted extensive coaching and cheating, two factors that can lead to inequity in education and stifle creativity.

Page 80: FUTURE OF ASSESSMENT IN INDIA CHALLENGES AND SOLUTIONS HARIHARAN SWAMINATHAN UNIVERSITY OF CONNECTICUT

In closing….

• Assessment can play a critical role in the education system, and lead to improvements in learning.

• Cognitive Diagnostic Assessment is receiving considerable attention in the US.

• Cognitive diagnostic assessment can be applied successfully to identify misconceptions, especially in the mathematics and science areas, and through feedback to students, student learning and instructional techniques can be enhanced.

Page 81: FUTURE OF ASSESSMENT IN INDIA CHALLENGES AND SOLUTIONS HARIHARAN SWAMINATHAN UNIVERSITY OF CONNECTICUT

India Has Got Talent!

• India has the talent and manpower to lead the way in the use of technology in instruction and assessment.

• We have world class experts, such Professor Dhande, on the panel here to provide leadership in this area.

• Indian students, if given a fair chance, can become leading world class scholars, a fact evidenced by the achievements of Indian students who have gone abroad to seek education.

• India can most certainly benefit from the advances made in assessment. By judiciously applying assessments, it can transform itself from a country of examination systems to a country of education systems.