stages of test development

Stages of Test Development

By Lily Novita - 69090007

Make a full and clear statement of the testing ‘problem’.

Write complete specifications for the test.

Write and moderate items.

Trial the items informally on native speakers and reject or modify problematic ones as necessary.

Trial the test on a group of non-native speakers similar to those for whom the test is intended.

Analyse the results of the trial and make any necessary changes.

Calibrate scales.

Validate.

Write handbooks for test takers, test users and staff.

Train any necessary staff (interviewers, raters, etc.).

1. Stating the Problem1. Stating the Problem

The essential initial step in any testing is to make oneself perfectly clear what one wants to know and for what purpose

What kind of test it is constructed for? What is the precise purpose? What abilities are to be tested? How detailed must the results be? How accurate must the results be? How important is backwash? What constraints are set by unavailability of

expertise, facilities, time ? (for construction, administration and scoring)

2. Writing 2. Writing specifications specifications

for the testfor the test Content

Operations Types of text Addresses of texts Length of text(s) Topics Readability Structural range Vocabulary Range Dialect, accent, style Speed of processing


for the testfor the testStructure , timing, medium/channel and techniques

Test structure Number of items Medium / channel Timing Techniques


for the testfor the test Criterial levels of performance

AccuracyAppropriacyRangeFlexibilitySize


for the testfor the test Scoring procedures

Subjectivity Achievement of high reliability

& validity in scoring Rating scale to be used? No. of people rating each piece

of work? Solutions on disagreements

between raters

3. Writing and 3. Writing and moderating itemsmoderating items

SamplingWriting itemsModerating items

4. Informal trialling of 4. Informal trialling of items on native items on native

speakersspeakersModeration of grammar test Ye

sNo

1. Is the English grammatically correct?

2. Is the English natural and acceptable?

3. Is the English in accordance with the specifications?

4. Does the item test what it is supposed to test, as specified?

5. The correct response cannot be obtained without the appropriate knowledge of grammar (other than random sampling)

6. Is the item economical?

7. (a) Multiple choice – is there just one correct response? (b) Gap filling – are there just one or two correct responses?

8. Multiple choice : Are all the distractors likely to distract?

9. Is the key complete and correct?

5. Trialling of the test on a group of non-5. Trialling of the test on a group of non-

native speakers similar to those for whom the native speakers similar to those for whom the

test is intendedtest is intended trials are designed to help ensure that

the items function appropriately and are not confusing for the students.

this is accomplished by embedding field test items in the operational test, to ensure that the items are taken by a representative group of motivated students under standard conditions.

6. Analysis of the results of 6. Analysis of the results of the trial – making of any the trial – making of any

necessary changesnecessary changes 2 kinds of analysis should be carried

out : Statistical analysis : reveals qualities

(reliability) as a whole and individual items – how difficult they are , how well they discriminate between stronger and weaker candidates.

Qualitative analysis : responses are examined to discover misinterpretations, unanticipated but possibly correct answers and indicators of other faulty items.

7. Calibration of scales7. Calibration of scales

It means collecting samples of performance which cover the full range of the scales.

A calibration test is a procedure in which an instrument, tool, or device is tested to confirm that it conforms with the standard. Calibration is very important, as it ensures that objects are working properly. There are a number of reasons to conduct a calibration test, ranging from concerns that something is not working as it should to preparations for an event in which very precise calibration is desired, and there are a number of ways to perform a calibration.

8. Validation 8. Validation

Essential validation – for high stakes or published tests

Small-scale validation – for low stakes used within an institution

9. Writing handbooks for test 9. Writing handbooks for test takers, test users and staffs takers, test users and staffs

(contents)(contents) The rationale for the test;

An account of how the test was developed and validated

A description of the test

Sample items

Advice on preparing for taking the test

An explanation of how test scores are to be interpreted

Training materials

Details of test administration

10. Training Staff10. Training Staff

All staffs who will be involved in the test process should be trained : interviewers, raters, scorers, computer operators, and invigilators.

stages of test development

Documents

item test

test users

test takers

operational test

field test items

purposewhat kind of

individual items

moderate items