karthik muralidharan on research on achieving universal quality primary education in india
DESCRIPTION
A presentation by Prof. Karthik Muralidharan on research on achieving universal quality primary education in India. This was presented at the Commission for Science and Technology (COSTECH) in Dar es Salaam, Tanzania, on June 19, 2014, to an audience of researchers.TRANSCRIPT
Karthik MuralidharanUC San Diego, NBER, BREAD, and J-PAL
COSTECHDar Es Salaam, 19 July 2014
Achieving universal quality primary education in India
Lessons from the Andhra Pradesh Randomized Evaluation Studies (AP RESt)
Agenda
Background / motivationDesign of APREStResultsPolicy implications
3
There have been sharp improvements in various
measures of school quality in the past decade
Source: Kremer et al (2005) for 2003 data; Muralidharan et al (2013) for 2010 data; Enrollment data from World Bank (2003) and ASER (2010)
Enrollm
ent
Drinking W
ater
Toilets
Electricit
y
Mid-Day M
eals
Teachers
are Paid Regularly
Recognition Sch
eme Exists
Inspecte
d in La
st 3 m
onths
PTA Met in
Last
3 months
PTR0
102030405060708090
100
20032010
4
Despite improvements in inputs, learning levels are alarmingly
low
Source: ASER 2012
Basic Arithmetic
Children in class 1 who can't subtract
Children aged 6-14 who can't subtract
0%
10%
20%
30%
40%
50%
60%
70%
80%
90%
100%
96%
59%
Basic Reading
Children in class 1 who can't read at grade level
Children aged 6-14 who cannot read a 2nd-class level
paragraph
0%
10%
20%
30%
40%
50%
60%
70%
80%
90%
100%
93%
62%
5
Increased expenditure alone is unlikely to improve learning
Source: Spending – Accountability Initiative (www.accountabilityindia.in); Outcomes - ASER www.asercentre.org.
2005-06 2010-11 2011-12 2012-20130
20,000
40,000
60,000
80,000
100,000
120,000
140,000
160,000
0
2000
4000
6000
8000
10000
12000
14000Elementary Education Budget
Rs. C
rore
s
Spen
ding
per
stu
dent
in R
s.
2007 2008 2009 2010 2011 20120%
10%
20%
30%
40%
50%
60%
70%
58.7%56.2%
52.8% 53.4%
48.2% 46.8%
42.4%
37.0% 38.0%35.9%
27.6%24.8%
Children in Class 5 who...
Read a simple paragraph Do a simple division
6
Broad objectives of AP RESt(Andhra Pradesh Randomized
Evaluation Studies)
• Measure and document levels and trajectories of student learning• Imperative that policy be based on outcomes – very narrow window
for ‘demographic dividend’ (10-15 years at most)
Move the focus of education policy from outlays to outcomes
Focus systematically on teacher motivation and effectiveness
• Strong suggestive evidence that teachers are the main lever of education policy in improving learning outcomes
• In India, over 90% of non-capital spending goes to teacher salaries
Improve the empirical orientation of education policy making by:
• Rigorous evaluations of what works and relative effectiveness of different policy options
• Critical in a world of limited resources• Budgetary increases must translate to improved outcomes
1
2
3
7
APRESt is a multi-stakeholder partnership
• Government of Andhra Pradesh (GoAP)- Main client – project initiated at request of Principal Secretary, Education- All relevant letters of permission and administrative support- Financial contribution (cost of contract teachers; direct contribution)
• Azim Premji Foundation- Main counterpart to MoU with GoAP- Fully responsible for all aspects of project implementation, school communications, test
administration, and data collection8 Over 50 full time project staff and 750 part-time evaluators8 Continuous engagement with government8 Financial contribution as well
• World Bank- Technical support- Financial support (mainly through DFID)- Institutional continuity with government (6 secretaries in 6 years!)
• Educational Initiatives- Test design and scoring, diagnostic and gain reports to schools
Background / motivationDesign of APREStResultsPolicy implications
Agenda
9
How do you evaluate the impact of large social sector programs?
Let’s use mid-day meals as our example:What has been the impact of the mid-day meal program?
3: Compare to appropriate control
1: Define outcomes
2: Measure outcomes
• The control and treatment groups are similar in all other ways except for the program
• The difference in the outcome measure between the two is a measure of the impact of the mid-day meals program
• Often, even this first step is not undertaken• Let’s assume it is, and we define some outcomes, e.g. nutrition, attendance
and learning
• Is this a valid measure of the impact of the program?• No, because there are many other things that have
changed at the same time• Need a meaningful comparison group
2008
2003
Out
com
e
2008
2003
Out
com
e
Treatment Control
We use a randomised evaluation methodology: the “gold standard” in social science research
10
We tested five specific interventions, with a mix of input- and incentive-based
policies
Contract teachers (mix input-incentive)
Block grants (input only)
Performance pay ×2 (incentive only)
Feedback + monitoring(input only)
• Schools provided with additional teacher (on contract)
• Schools provided cash grants for student inputs
• Existing teachers provided with detailed feedback on students and subject to low-stakes monitoring
• Teachers eligible for bonuses based on improved student performance (either in own class or whole school)
MOTIVATION INTERVENTION
• One reason learning levels may be low is teachers don’t know how to help students
• Can better information help?
• Use of contract teachers is widespread, but highly controversial
• Are contract teachers effective?
• Significant amounts of money committed under RTE.
• What is the effectiveness of such spending?
• Teacher salaries are the largest component of education spending in India, but a poor predictor of outcomes
• Can linking pay to performance improve outcomes?
11
Location of study
• Andhra Pradesh (AP)- 5th most populous state in India
8 Population of 80 million - 23 Districts (2-4 million each)
• Close to All-India averages on many measures of human development
India APGross Enrollment (6-11) (%)
95.9 95.3
Literacy (%) 64.8 60.5
Teacher Absence (%) 25.2 25.3
Infant Mortality (per 1,000)
63 62
12
Randomization was stratified at the sub-district level
1. First, we chose 5 districts across three distinct ‘regions’ within AP2. Then, within each district we randomly chose 10 mandals (blocks)3. Then, within each mandal we randomly chose 12 schools4. Finally, of these, we assigned 2 to each treatment and 2 to control
13
Summary of Experimental Design
• Study conducted across a representative sample of 600 primary schools in AP
• Conduct baseline tests in these schools (June/July 05) [process pilots in 04-05]
• Stratified random allocation of 100 schools to each treatment (2 schools in each mandal to each treatment) (August 05)
• Monitor process variables over the course of the year via unannounced monthly tracking surveys (Sep 05 – Feb 06)
• Conduct 2 rounds of endline tests to assess the impact of various interventions on learning outcomes (March/April 06)
• Interview teachers after program but before outcomes are communicated to them (July 06)
• Continue interventions for measuring 2-year impact (July/August 06)
14
Review of Key Steps
1
2
Define the research question(s)! Why does it matter? What is the likely mechanism of impact?
Identify the evaluation methodology. Internal & external validity. Why did an experiment make sense in this case?
Fine tune the details: pilot and refine measurement instruments, power and sample size calculations, get feedback on design
Making it happen: Identify sites, implementation partners and structure, permissions, funding, key personnel
3
4
Conduct baseline (is this always necessary)? Do randomization, implement treatments, monitor process and outcomes
5
Data cleaning & management, analysis, writing papers/reports, presenting for feedback, refine, peer-review, disseminate
6
Agenda
Background / motivationDesign of APREStResults- Feedback + MonitoringPolicy implications
16
Teachers in feedback + monitoring schools appeared to perform better on measures of
teaching activityDifference between feedback + monitoring and comparison
schools on various measures of teaching activity
*Statistically significant difference
Reads fro
m textb
ook*
Active
ly teach
ing*
Active
blackboard
usage*
Assigned homework*
Clean & ord
erly cl
assroom
Asks questi
ons*
Provid
es help
Childre
n using te
xtbook*
Encoura
ges parti
cipation
0%
2%
4%
6%
8%
10%
12%
14%
16%
17
However, there was no difference in test scores between students in
treatment and comparison schools
Teaching Activity Student Learning0
0.02
0.04
0.06
0.08
0.1
0.12 0.107
0.00200000000000001
Effec
t Size
Outcomes for treatment schools relative to comparison schools
The lack of impact on test scores, despite enhanced teaching activity, suggests that teachers temporarily changed behavior when observed, but did not actively use the feedback reports in their teaching.
Agenda
Background / motivationDesign of APREStResults- Block grantPolicy implications
19
Schools spend most of the grant on non-durables – similar
pattern in both years
Year 1 Year 20
2,000
4,000
6,000
8,000
10,000
12,000
Textbooks Practice booksClassroom materials Child StationaryChild Durable Materials Sports Goods + Others
INR
• Nearly half the grant allocation was spent on child stationary (notebooks, slates, chalks)
• Close to another 40% was spent on classroom materials (such as charts, maps and toys) and practice books (such as workbooks, exercise books, etc)
• Small amounts were allocated to durable materials and sports goods
Average school annual grant allocation pattern
20
Impact of the program is lower after 2 years than after 1 year
Y1 Y2
-140
-120
-100
-80
-60
-40
-20
0
-40
-138
Change in HH spending in response to school spending
INR Unanticipated
Y1 Y20
0.01
0.02
0.03
0.04
0.05
0.06
0.07
0.08
0.09
0.0880000000000001
0.0490000000000001
Student test scores (normalized)
Effec
t Si
ze
Household spending fell significantly when the grant was anticipated
Student learning improved in the first year, but not the second
Anticipated
Agenda
Background / motivationDesign of APREStResults- Contract teacherPolicy implications
22
Contract teachers are significantly different to
regular teachers
Regular Teachers (RTs)
Contract Teachers (CTs)
Significantly different?
Proportion male 63.1% 31.8%
Average age 40.35 25.81
College degree or higher 84.3% 45.5% Formal teacher training degree or certificate 98.3% 9.1%
Received any training in last twelve months 93.5% 54.5%
From the same village 7.2% 81.8%
Distance to school (km) 11.9 1.1
Average salary (Rs./month) 8,698 1,250
CTs are hired by school committees and typically tend to be young females, with no formal teacher training qualification and from the same village as the school in
which they teach. CTs are paid significantly less than RTs.
23
There have been several concerns with respect to
contract teachers
Two main questions:
1) “What is the impact of an extra CT” hired in a “business as usual” way?2) How would reducing PTR with a CT compare with doing so with an RT?
3
1
2
• CTs are exploited as a result of being paid significantly less than RTs
• Using untrained and less qualified CT’s will not improve learning
• Decentralizing hiring will lead to local elite capture of the teacher post
24
Not only did extra CTs enhance student learning, there were
found to be no less effective than RTs
One year Two years0
0.02
0.04
0.06
0.08
0.1
0.12
0.14
0.16
0.09
0.141
Effec
t Si
ze
Students in extra CT schools significantly outperform students
in comparison schools
Improving student learning from adding an extra teacher to school
LHS: effect sizes are statistically significant. RHS: difference is not statistically significant.
Extra CT Extra RT0
0.05
0.1
0.15
0.2
0.25
0.3
0.350.32
0.22
Effec
t Si
ze
25
Evidence also suggests that CTs are not exploited vis-à-vis the
market
Agenda
Background / motivationDesign of APREStResults- Performance pay
Policy implications
27
Performance Pay : Background and Research Questions
1. Can teacher performance-pay improve test scores?2. What, if any, are the negative consequences?3. How do group and individual incentives compare?4. How does teacher behaviour change in response to the bonuses?5. Do different types of teachers respond differentially to the bonuses?6. What is teacher opinion on performance pay?
• Lack of differentiation by performance is a major demotivator for teachers− Teachers with highest job satisfaction were most absent
• Program was designed to recognise and reward good performance
Motivation
Key questions addressed
28
Potential concerns with such a program are addressed pro-actively in the study design
Potential concern How addressed
Teaching to the test
• Test design is such that you cannot do well without deeper knowledge / understanding
• Less of a concern given extremely low levels of learning• Research shows that the process of taking a test can enhance learning
Threshold effects/ Neglecting weak kids
• Minimized by making bonus a function of average improvement of all students, so teachers are not incentivized to focus only on students near some target;
• Drop outs assigned low scores
Cheating / paper leaks• Testing done by independent teams from Azim Premji Foundation,
with no connection to the school
Reduction of intrinsic motivation
• Recognize that framing matters• Program framed in terms of recognition and reward for outstanding
teaching as opposed to accountability
29
Incentive schools perform better across the board
Outcomes for bonus schools relative to control schools
• Students in bonus schools do better for all major subgroups, including: all five grades (1-5); both subjects; all five project districts; and levels of question difficulty
• No significant difference by most student demographic variables, including household literacy, caste , gender, and baseline score
• Lack of differential treatment effects is an indicator of broad-based gains
Y1 on Y0 Y2 on Y00
0.05
0.1
0.15
0.2
0.25
0.153
0.217
Effec
t Si
ze
Overall, almost every child in an incentive school performed significantly better than comparable children in control schools
30
Incentives have broad-based impact
Y1 Y20
0.02
0.04
0.06
0.08
0.1
0.12
0.14
0.16
0.18
0.2
0.14
0.17
0.14
0.18
Mechanical Conceptual
Effec
t Si
ze
Normalized by mechanical / conceptual distribution in control schoolsAll figures statistically significant
Y1 Y20
0.02
0.04
0.06
0.08
0.1
0.12
0.14
0.16
0.18
0.11 0.11
0.14
0.18
Science Social studies
Effec
t Si
ze
Normalized endline scores grades 3-5 onlyAll figures statistically significant
True learning: Bonus students perform better on conceptual, not just mechanical questions
Spillovers: And they also perform better on non-incentive subjects
31
Individual incentives versus group incentives
• The theory on group- versus individual-level incentives is ambiguous
− On the one hand, group incentives may induce less effort due to free-riding
− On the other, if there are gains to cooperation, then it is possible that group incentives might yield better results
• Both group and individual incentive programs had significantly positive impacts on test scores in both years
• In the first year, they were equally effective, but in the second year, the individual incentives do significantly better
• Both were equally cost-effectiveY1 Y2
0
0.05
0.1
0.15
0.2
0.25
0.3
0.16
0.27
0.150.16
Individual Group
Effec
t Sei
ze
In theory…
Our findings…
32
Teacher absence did not change, but effort intensity went up
Incentive teachers did no better under observation…
… But report undertaking various forms of special preparation
Extra homework
Extra classwork
Extra classes Practice tests Focus on weaker children
0%
5%
10%
15%
20%
25%
30%
35%
40%
45%
50%
42%
47%
16%
30%
20%20%23%
5%
14%
7%
Incentive Control
Absence Actively teaching0%
5%
10%
15%
20%
25%
30%
35%
40%
45%
Incentive Control
33
Teacher opinion on performance pay is overwhelmingly positive
• It is easy to support a program when it only offers rewards and no penalties
• However, teachers also support performance pay under an overall wage-neutral expectation
Increase
d motiva
tion as a re
sult o
f PP
Favo
rable opinion of P
P
Govern
ment should co
nsider i
mplementing PP0%
10%20%30%40%50%60%70%80%90% 75%
85%
67%
Strong teacher support for performance pay
• Significant positive correlation between teacher performance and the extent of performance pay desired beforehand
− Suggests that effective teachers know who they are and there are likely to be sorting benefits from performance pay
Agenda
Background / motivationDesign of APREStResults- Summary
Policy implications
35
Overall, bonuses condition on performance had a larger impact
than unconditional provision of inputs…
• Pure incentives (individual and group bonuses) are most effective
• The mixed input-incentive program (contract teachers) is next most effective
• Pure inputs (block grants and diagnostic feedback) are least effective
Individual bonuses
Group bonuses
Contract teacher
Block grant Diagnostic feedback
0
0.05
0.1
0.15
0.2
0.25
0.3
0.160.15
0.09 0.09
0.00
0.27
0.160.14
0.05
Combined impact (Maths and Telugu)
Y1 on Y0 Y2 on Y0
Effec
t Si
ze
Agenda
Background / motivationDesign of APREStResultsPolicy implications
37
There are four key policy messages from our study
1
2
The education system has to focus on learning outcomes- You get what you measure, and if you want learning you have to measure it
Provide high-quality remedial instruction in early schooling years- Students start school at different levels and unless you set different bars or
extend number of school years, need remedial education
Focus on teacher performance measurement and management- Teachers are the highest potential lever at the policymaker’s disposal- System has to have a meaningful career ladder based on performance
Use contract teachers to focus on remedial education- Plenty of evidence to support the effectiveness of such programs- Provide credit for performance/service as a CT during RT selection
3
4
38
Bibliography
• Abhijit Banerjee et al: “Remedying Education: Evidence from Two Randomized Experiments in India”
• Michael Kremer, Karthik Muralidharan, Nazmul Chaudhury, Jeffrey Hammer, F. Halsey Rogers: “Teacher Absence in India: A Snapshot”
• Karthik Muralidharan, Michael Kremer: “Private Schools in Rural India: Some Facts”• Eric Hanushek and Ludger Woessman: “The Role of Education Quality for Economic Growth”• Jishnu Das and Tristan Zajonc: “India Shining and Bharat Drowning”• Jishnu Das, Stefan Dercon, James Habyarimana, Pramila Krishnan, Karthik Muralidharan and
Venkatesh Sundararaman: “School Inputs, Household Substitution, and Test Scores”• Karthik Muralidharan and Venkatesh Sundararaman: “The Impact of Diagnostic Feedback to
Teachers on Student Learning: Experimental Evidence from India”• Karthik Muralidharan and Venkatesh Sundararaman: “Contract Teachers: Experimental Evidence
from India”• Karthik Muralidharan and Venkatesh Sundararaman: “Teacher Performance Pay: Experimental
Evidence from India”• Karthik Muralidharan and Venkatesh Sundararaman: “Teacher Opinions on Performance Pay:
Evidence from India”