impact of a school district's science reform effort on the achievement and attitudes of third-...

20
JOURNAL OF RESEARCH IN SCIENCE TEACHING VOL. 41, NO. 8, PP. 771–790 (2004) Impact of a School District’s Science Reform Effort on the Achievement and Attitudes of Third- and Fourth-Grade Students James A. Shymansky, 1 Larry D. Yore, 2 John O. Anderson 2 1 Regional Institute for Science Education, Suite 7, RCEW Building, 8001 Natural Bridge Road, University of Missouri, St. Louis, Missouri 53121-4499 2 Faculty of Education, University of Victoria, Victoria, British Columbia, Canada V8W 3N4 Received 25 November 2003; Accepted 6 February 2004 Abstract: This article is about one school district’s effort to reform its elementary science curriculum through a program of professional development called Science, Parents, Activities and Literature (Science PALs). The differential exposure of the district’s K–6 teachers to Science PALs and differences in how well teachers implemented Science PALs-type inquiry strategies allowed us to conduct a quasi-experimental study of the impact of Science PALs on student achievement and attitudes. We measured achievement with an instrument based on items taken from the Third International Mathematics and Science Study (TIMSS; International Association for the Evaluation of Educational Achievement, 1997) and selected attitudes about science with the Student Perceptions of Classroom Climate (SPOCC; Yore et al., 1998), an instrument that we designed. Our analyses of student attitude scores as a function of years of teacher participation in Science PALs and supervisor’s rating of a teacher’s implementation of the project’s instructional approaches showed a significant overall positive impact on student attitudes toward school science. Student TIMSS scores on multiple-choice items or constructed-response items did not improve significantly when analyzed by the number of years a student’s teacher was involved in the Science PALs effort or by the supervisor’s rating of that implementation. We found no significant differences in attitude or achievement scores among students taught by a series of teachers rated high, medium, or low in quality of imple- mentation by the district’s science supervisor. We discuss possible explanations for the lack of clear and positive connections between Science PALs and student performance in light of the increased focus on accountability in reform projects. ß 2004 Wiley Periodicals, Inc. J Res Sci Teach 41: 771–790, 2004 Professional development of teachers in the areas of science and mathematics continues to receive strong support from the National Science Foundation (NSF) because of persisting evidence that large numbers of untrained and undertrained teachers still enter the classroom each year (National Commission on Teaching and America’s Future, 1996). The NSF support, Correspondence to: J.A. Shymansky; E-mail: [email protected] DOI 10.1002/tea.20025 Published online 25 August 2004 in Wiley InterScience (www.interscience.wiley.com). ß 2004 Wiley Periodicals, Inc.

Upload: james-a-shymansky

Post on 06-Jul-2016

212 views

Category:

Documents


0 download

TRANSCRIPT

JOURNAL OF RESEARCH IN SCIENCE TEACHING VOL. 41, NO. 8, PP. 771–790 (2004)

Impact of a School District’s Science Reform Effort on the Achievementand Attitudes of Third- and Fourth-Grade Students

James A. Shymansky,1 Larry D. Yore,2 John O. Anderson2

1Regional Institute for Science Education, Suite 7, RCEW Building, 8001 Natural Bridge Road,

University of Missouri, St. Louis, Missouri 53121-4499

2Faculty of Education, University of Victoria, Victoria, British Columbia, Canada V8W 3N4

Received 25 November 2003; Accepted 6 February 2004

Abstract: This article is about one school district’s effort to reform its elementary science curriculum

through a program of professional development called Science, Parents, Activities and Literature (Science

PALs). The differential exposure of the district’s K–6 teachers to Science PALs and differences in how well

teachers implemented Science PALs-type inquiry strategies allowed us to conduct a quasi-experimental

study of the impact of Science PALs on student achievement and attitudes. We measured achievement with

an instrument based on items taken from the Third International Mathematics and Science Study (TIMSS;

International Association for the Evaluation of Educational Achievement, 1997) and selected attitudes

about science with the Student Perceptions of Classroom Climate (SPOCC; Yore et al., 1998), an instrument

that we designed. Our analyses of student attitude scores as a function of years of teacher participation

in Science PALs and supervisor’s rating of a teacher’s implementation of the project’s instructional

approaches showed a significant overall positive impact on student attitudes toward school science. Student

TIMSS scores on multiple-choice items or constructed-response items did not improve significantly when

analyzed by the number of years a student’s teacher was involved in the Science PALs effort or by the

supervisor’s rating of that implementation. We found no significant differences in attitude or achievement

scores among students taught by a series of teachers rated high, medium, or low in quality of imple-

mentation by the district’s science supervisor. We discuss possible explanations for the lack of clear and

positive connections between Science PALs and student performance in light of the increased focus on

accountability in reform projects. � 2004 Wiley Periodicals, Inc. J Res Sci Teach 41: 771–790, 2004

Professional development of teachers in the areas of science and mathematics continues to

receive strong support from the National Science Foundation (NSF) because of persisting

evidence that large numbers of untrained and undertrained teachers still enter the classroom each

year (National Commission on Teaching and America’s Future, 1996). The NSF support,

Correspondence to: J.A. Shymansky; E-mail: [email protected]

DOI 10.1002/tea.20025

Published online 25 August 2004 in Wiley InterScience (www.interscience.wiley.com).

� 2004 Wiley Periodicals, Inc.

however, is not without its critics. In their review of teacher-enhancement programs, Frechtling,

Sharp, Carey, and Vaden-Kierman (1995) noted that ‘‘most program evaluations either ignore

student achievement or provide unconvincing . . . anecdotal teacher reports of positive student

outcomes’’ (p. 18). There is strong pressure on those who provide professional development

to show evidence that the efforts are impacting student achievement (Education Commission of

the States, 1997). And as Loucks-Horsley and Matsumoto (1999) stated in their review of the

professional-development research, despite this mounting pressure, ‘‘there is relatively little

research addressing this connection’’ (p. 258).

Slavin (2003) explained this lack of ‘‘scientifically based research’’ on student achievement:

‘‘In addition to what a teacher does in the classroom, student achievement is affected by other

things such as socioeconomic status, school administrators’ and parents’ knowledge and

commitment, school district and state policies regarding standards and testing, . . . and the list goes

on’’ (p. 12). These complexities make the design and execution of true experimental studies that

can isolate and measure the specific effects of professional development on student achievement

both difficult and expensive. But the question of accountability is not likely to disappear just

because persuasive evidence is difficult and expensive to collect.

This article is about one school district’s effort to reform its elementary science curriculum

and the effect of that reform on student achievement and attitudes. We measured achievement

with an instrument based on items taken from the Third International Mathematics and Science

Study (TIMSS; International Association for the Evaluation of Educational Achievement,

1997) and selected attitudes about science with the Student Perceptions of Classroom Climate

(SPOCC; Yore et al., 1998), an instrument that we designed. The district already had an extensive

hands-on curriculum in which science kits were circulated among teachers during the school year.

The curriculum was supported by a half-time science supervisor and a full-time materials dis-

tribution and maintenance center employee. Each year, the science supervisor provided about

2 days of professional development that focused mainly on activity mechanics and little on any

theoretical underpinnings for inquiry-based learning. Students and teachers in the district seemed

to enjoy doing the science activities contained in the kits; however, despite the support of the

district and the enthusiasm for doing activities, most teachers admitted to having little under-

standing of the science ideas being explored in the kits or the rationale for doing inquiry-based

instruction and, more important, they felt they had no real evidence on how their science

instruction was impacting students. There was a strong sense among teachers, administrators, and

parents that their K–6 science program was good, but they also felt that students could be learning

even more.

Working with university experts and with special funding from the NSF and Howard Hughes

Medical Foundation, the district launched a reform effort called ‘‘Science Parents, Activities and

Literature’’ (Science PALs). The goal of Science PALs was simple: Increase the kit-based

program’s capacity to promote student conceptual growth and positive attitudes toward science

without dampening the enthusiasm that the kits seemed to generate among teachers, students, and

parents. Science PALs set out to provide a sustained program of professional development focused

on a constructivist theory-based, four-part teaching strategy to support inquiry science learning,

ways to involve parents in the science instruction, and strategies to connect science to other areas

in the curriculum. Though the summer workshop setting served as the primary vehicle in Science

PALs, the workshops were anything but traditional, utilizing ‘‘reform structures’’ described by

Garet, Porter, Desimone, Birman, and Yoon (2001). Special K–6 teacher ‘‘advocates’’ and Grades

7–12 ‘‘science partners’’ served as mentors for kit-focused collaborative groups. University

science and science education experts worked, for the most part, through the advocate–mentor

teams in supporting roles rather than as workshop instructors.

772 SHYMANSKY, YORE, AND ANDERSON

The need to initially and continually elicit student’s ideas and then challenge and extend those

ideas is the cornerstone of all varieties of constructivism. In Science PALs, two unique strategies

for eliciting student ideas were emphasized: (a) using children’s literature with story lines

containing embedded science ideas (both scientifically valid and invalid ideas) and (b) using

parents to help identify student ideas. During the funded phase of the reform (Years 1–4) effort,

73% of all elementary science teachers in the district participated in 1 to 3 years of Science PALs

professional development while 27% declined any involvement. All district teachers, however,

continued to use the same kits of science materials (discussed later). Moreover, we recognized that

both participating and nonparticipating teachers were using Science PALs-type inquiry strategies

to varying degrees, though none had received any formal instruction in constructivist learning

theory or its related teaching models. The differential exposure to Science PALs professional

development and differential utilization of Science PALs-type inquiry strategies combined with

the use of common science kits, however, provided an opportunity to conduct a quasi-

experimental study focusing on the following research questions:

� How does number of years of teacher exposure to Science PALs professional-

development activities impact student attitudes toward school science, attitudes about

careers in science, and achievement in science?

� Does level of teacher exposure to or quality of utilization of strategies learned in Science

PALs professional-development activities differentially affect boys’ and girls’ attitudes

toward school science, attitudes about careers in science, and achievement in science?

� How does the quality of teacher utilization of Science PALs professional-development

activities during the current year of school affect student attitudes toward school science,

attitudes about careers in science, and achievement in science?

� How does the cumulative quality of teacher utilization of Science PALs professional-

development activities over 3 years of school affect student attitudes toward school

science, attitudes about careers in science, and achievement in science?

This study adds to the professional development and reform literature in two important ways

identified by Loucks-Horsley and Matsumoto (1999): (a) It provides data linking a sustained

program of professional development (Science PALs) to student achievement and selected atti-

tudes, and (b) it addresses the problem of disentangling the complex professional development–

student performance connection by accounting for differences in exposure to and quality of

implementation of practices and strategies targeted in (Science PALs) professional development.

Background

Several documents provided guidance and underpinning for the structure and content of the

Science PALs reform effort: Science for All Americans (American Association for the

Advancement of Science, 1990), Benchmarks for Science Literacy (American Association for

the Advancement of Science, 1993), the National Science Education Standards (National

Research Council, 1996), and the Report of the National Commission on Teaching and America’s

Future (Darling-Hammond, 1996). Collectively, the documents provide a vision of what science

should be taught and how science should be taught.

Concerning ‘‘what science should be taught,’’ reform-document authors speak of science

literacy, being able to engage intelligently in public discourse and debate about issues that involve

science and technology, and sharing in the excitement, personal fulfillment, and positive feelings

that can come from understanding and learning about the natural world (National Research

IMPACT OF A SCHOOL DISTRICT’S SCIENCE REFORM 773

Council, 1996). Authors of Science for All Americans, Benchmarks for Science Literacy, and the

National Science Education Standards contend that all students should be challenged to develop

an understanding of a few powerful, unifying science ideas and that inquiry should be the

foundation for learning science. Norris and Phillips (2003) described this duality of science

literacy as being composed of a fundamental sense involving the use of language in and about

science and a derived sense of understanding the big ideas of science.

On the issue of ‘‘how science should be taught,’’ reform-document authors speak of teachers

having theoretical and practical knowledge about science, learning, and science teaching

(National Research Council, 1996). This knowledge includes being able to plan, design, and

manage curriculum that promotes the procedures and the intellectual rigor of scientific inquiry and

to assess their own teaching and student learning. The vision consistently described in the

collective reform documents is of science teaching that engages all students in a quest for science

literacy involving the abilities, critical thinking, and habits-of-mind to construct understanding of

the big ideas and unifying concepts of science, and the communications to share with and persuade

other people about these ideas (Ford, Yore, & Anthony, 1997).

When these changing emphases in teaching are considered in the context of science as inquiry

and technology as design and the epistemology described by the nature of science as a way of

knowing and seeking the best possible explanations based on empirical standards, logical

arguments, and skepticism, it becomes apparent that an ‘‘interactive-constructivist’’ perspective is

supported by the National Science Education Standards (National Research Council, 1996). An

interactive-constructive model utilizes an ecology metaphor to illustrate learning in which

dynamic interactions of prior knowledge, concurrent sensory experiences, belief systems, and

other people in a sociocultural context lead to multiple interpretations that are verified against

evidence and privately integrated into a person’s knowledge network (Shymansky et al., 1997).

Knowledge, in this model, is perceived as individualistic conceptions of reality that have been

verified by the epistemic traditions of a community of learners.

Reform efforts of the past 15 years and the resultant ‘‘standards’’ documents have succeeded

in defining what and how science should be taught, but the science education community

continues to struggle with programs of professional development that will establish a clear link

to student outcomes. Loucks-Horsley and Matsumoto (1999) proposed a model of ‘‘influences

on the relationship between professional development and student learning’’ (p. 260) that iden-

tifies school, cultural, parental, policy, and leadership factors embedded in state and national

contexts as critical elements in the teaching–learning equation. To this equation we would add

‘‘international’’ as a context factor, as evidenced by the worldwide interest generated by the

TIMSS results.

Compounding the problem of doing research that directly connects professional development

to student achievement is the fact that effects of professional development on classroom practice

do not often manifest themselves immediately. Shields, Marsh, and Adelman (1998) and Weiss,

Montgomery, Ridgway, and Bond (1998) suggested that the depth of teacher change is directly

related to the duration of professional development efforts. Supovitz and Turner (2000) found that

teachers experiencing less than 40 hr of professional development often reported inquiry-based

practices and investigative classroom cultures that were less positive than teachers receiving no

professional development at all. They found that it was not until teachers received more than about

80 hr of professional development did their reported inquiry practices and investigative culture

significantly improve. Assuming that it would take an extended time for students to adapt to and

resonate with changes in classroom practice and culture, short-term research studies seeking to

establish a link between improved inquiry practices and student achievement would almost

certainly fail to reveal the link.

774 SHYMANSKY, YORE, AND ANDERSON

Theoretical Framework

The teacher-enhancement approach utilized in Science PALs was based on a situated learning

model of professional development relying heavily on authentic teacher work and professional

mentorship (Lavoie & Roth, 2001). The authentic teacher work provided opportunity and context

for preparing, adapting, teaching, reflecting, and revising a specific science unit associated with

the participating teacher’s professional responsibility and grade-level assignment. A common

difficulty with changing teachers’ vision of effective science teaching and facilitating the

associated practice in their classrooms is their lack of direct experience with the desired approach,

awareness of the instructional resources, and pedagogical-content knowledge associated with the

science unit (Smith, 1999; Zembal, Starr, & Krajcik, 1999). Science PALs addressed these

problems by using the desired teaching approach—interactive-constructivist science inquiry

teaching—as the instructional approach for the professional-development activities. By using the

science kit to be taught as the conceptual environment and physical context for grade-specific

activities and the prior knowledge, curricular priorities, and pedagogical strengths of the generalist

elementary teachers as the starting point for instruction and for selecting project goals, the

fundamental and derived senses of science literacy, interactive-constructivist teaching common to

language arts, mathematics, and social studies reforms, and cross-curricular connections were

woven prominently into all aspects of Science PALs.

Frequently in professional-development projects, teachers are introduced to innovations and

concepts but are not given experience in how to incorporate and apply these new ideas to their

elementary classrooms. The transfer problems were reduced in Science PALS by establishing a

mentorship among a participating teachers, an expert pedagogical lead elementary teacher, and

a science-content-expert secondary science teacher or project staff member. The mentorship

provided a supportive climate for the implementation of the science unit and addressed day-to-

day logistical problems, content difficulties, teaching issues, assessment concerns, and cross-

curricular connections in the context of real classroom science teaching as the problems arose,

providing just-in-time professional development on an individualized basis. The general pattern

of the mentorship involved preparing the science unit during a summer workshop, supervisory

support during the teaching of the unit in the late fall or early spring, and guidance during the

reflection on the teaching experience and the revision of the science unit based on the evaluation

of the science instruction experience in late spring. This cycle was repeated in each year of

participation with a different science unit.

Setting

The Science PALs project was funded as a teacher-enhancement partnership between a level-

one research university and a single school district. In the 5 years of the project (Years 1–4 were

externally funded, and Year 5 was supported by the school district.), 238 K–6 teachers and 15

Grades 7–12 teachers from the school district participated in summer and school-year inservice

activities. In addition to the teachers, approximately 3,400 parents participated in special training

sessions designed to integrate them into the K–6 science program. Across the 5 years of Science

PALs, teachers received an average of 110 hr of professional development designed to enhance

their science content and pedagogical-content knowledge.

At the beginning of the Science PALs Project, the participating school district had a kit-based

elementary school science program that contained exemplary NSF-supported materials, such as

the Full Option Science System (FOSS; Carolina Biological), Science and Technology for

Children (STC; National Science Resources Center), and INSIGHTS (Educational Development

IMPACT OF A SCHOOL DISTRICT’S SCIENCE REFORM 775

Center). The kits were delivered to the teacher on a rotating basis with minimal professional

development.

The first year of the Science PALs Project began with 16 elementary school teachers

designated as science advocates—one from each elementary school in the district. These teachers

were selected in part for their willingness to serve as science leaders in their schools as well as their

interest in participating in the teacher enhancement project. The science advocates began the

project by attending a special problem-centered summer workshop similar to the Focus on

Children’s Ideas in Science project (FOCIS; Shymansky et al., 1993). The FOCIS project utilized

middle school science teachers’ interest in children’s misconceptions and their sincere desire to

promote conceptual change in their students as an authentic problem focus for the summer

workshop and multiyear collaboration with a science-content mentor. The focus on children’s

ideas served as the ‘‘straw man’’ since enhancement of the teachers’ science-content knowledge

and pedagogical-content knowledge were the actual goals of the project. The FOCIS project was

effective in achieving meaningful science and science pedagogical learning among the middle

school teachers on science topics of their choice.

The Science PALs workshops in that first year were designed to help teachers explore selected

curriculum units (NSF-supported versions) and activities using students’ ideas again as the ‘‘straw

man.’’ The workshops matched science-content consultants with small groups of advocates to

explore the science concepts in specific units and to promote an interactive-constructivist teaching

strategy among the teachers. In the workshop and the ensuing school-year inservice sessions,

various strategies were employed to have the science advocates articulate their alternative

frameworks for the science concepts related to the school district’s science units, and additional

extension activities to challenge these understandings were implemented. The ultimate objective

was to address the advocates’ personal misconceptions and have them rethink their under-

standings to develop more accurate scientific conceptions critical to teaching the unit. These

science advocates then supplemented the specific FOSS, STC, and INSIGHTS units with under-

standings of the science reforms, misconception literature, additional science activities, children’s

literature, and interdisciplinary connections to produce teacher resource binders (TRBs) for each

science unit.

The 16 science advocates field tested the enriched units (field-test versions) in their own

classrooms in the fall and attended three 1-day workshops during and after teaching the units.

The field-test experiences were shared with colleagues and science-content consultants to

further clarify science understandings and explore other activities to challenge student mis-

conceptions uncovered while teaching the unit. These insights were used to revise the TRBs for

each science unit (final version) and to develop home science-activity bags. The activity bags

consisted of a children’s literature selection related to the central science topic of the unit, a

simple science equipment, and a parent interview and activity guide. Parents and children read

the story together and explored various science challenges in the story as they occurred, using

the activity guide and equipment provided in the activity bags. Feedback from parents was

used to make adjustments to the science instruction that more accurately reflected their students’

prior knowledge. Parent orientation meetings were developed to introduce parents to the Science

PALs project and activity bags. A Science PALs project newsletter was published to keep the

parent community informed about the project’s progress and to maintain contact with students’

families.

The cascading leadership design of Science PALs involved a progression of participating

teachers and an evolution of their specific leadership roles. Early months of the project focused on

recruiting and working with the 16 science advocates. Twelve of the original advocates continued

to serve in the advocate capacity after external funding for the project ceased. Year-2 activities

776 SHYMANSKY, YORE, AND ANDERSON

focused on recruiting and working with 24 lead teachers to complement and share leadership

responsibilities with the advocates in a school. Year-3 activities focused on 37 additional teachers

recruited as a cohort to join the 40 advocates and lead teachers. In the final year (Year 4) of funding,

the advocate, lead teacher, and teacher group consisted of 140 teachers, bringing the total active

K–6 teacher cohort in PALs across the 4 funded years of the project to 195 different teachers. This

number represents about 73% of the 267 K–6 teachers in the school district and about 90% of the

216 who taught science on a regular basis. The cascading leadership model used meant that

advocates and lead teachers progressively assumed greater responsibility for the summer

workshops, professional development activities, and science decisions. Table 1 shows the layout

of a typical year of professional development in Science PALs.

Research Design

The study that we conducted to determine the impact of Science PALs on student performance

is best described as a posttest only, nonrandomized, comparative-group quasi-experiment. Years

of teacher participation or exposure to the Science PALs professional-development activities and

the level of teachers’ utilization of Science PALs strategies were used as the main independent

variables in our study. The exposure variable represents nothing more than amount of teacher

involvement in the summer workshop and school-year inservice activities in a 12-month period. It

was chosen for study because of its prevalence in the reform literature. The utilization variable

represents the effectiveness with which a teacher used student ideas and selected children’s

literature and parent partners to plan and implement instructional activities promoted in the

professional development/reform effort. The utilization variable was chosen for study because it

simultaneously addresses the problems of fidelity (participating teachers not embracing and

implementing the selected reform ideas and strategies) and of history and contamination

(nonparticipating teachers using the same selected reform ideas and strategies that were learned

previously or indirectly from participating teachers).

The science supervisor’s ratings of the teachers’ utilization of Science PALs strategies

(described later) allowed us to examine utilization by the students’ current teacher as well as

Table 1

A 1-year cycle of Science Pals professional development activities

School Year Activity Focus

May 2-day retreat of science advocates, project staff, and district administrators toclarify project goals and critical features

June–July Summer workshop—select and explore target science; develop cross-curricularconnections, parent bags, and TRBs

August Share unit plans and organize implementation; plan parent orientation meetings andin-service schedules

September Parent orientation: Conduct parent meetings, distribute activity bags for first fallunit, collect parent activity data

October–December Explore additional cross-curricular connections and alternative assessments; workon new science units for fall and spring

January–April Follow-up on parent activity data, assessment ideas, children’s literature, sciencecontent, applications of technology, etc.

May Science advocates’ retreat—leadership responsibilitiesJune–July Introduce new teachers and develop new science units

Note. TRBs¼ teacher resource binders.

IMPACT OF A SCHOOL DISTRICT’S SCIENCE REFORM 777

another variable: the cumulative effect of Science PALs on student performance over a 3-year

period. The cumulative-effect factor was determined by combining the science supervisor ratings

of a given student’s current-year teacher and his or her teachers of the previous 2 years.

Selected student attitudes toward school science and student achievement were chosen as the

dependent variables for study. The attitude measures that we report focused on students’ percep-

tions of the nature of science and interest in a possible career in science. Student achievement was

measured with multiple forms developed from released science items from the TIMSS for Grades

3/4 (International Association for the Evaluation of Educational Achievement, 1997). Student

gender was added as a secondary independent variable to examine any differential effects of the

Science PALs strategies on girls and boys.

Instruments

Teacher Ratings

The school-district science supervisor determined the degree to which teachers utilized

interactive-constructivist teaching methods generally and Science PALs strategies specifically in

their science teaching. All K–6 teachers in the district were rated regardless of how or where these

attributes were acquired. The science supervisor rated each teacher (Science PALs and Non-

Science PALs) on a five-dimension rubric developed to assess the unique features of the Science

PALs approach and the global features of an interactive-constructivist approach. The rubric

required the supervisor to assess the degree of compliance on each dimension (1¼ very weak,

2¼weak, 3¼ satisfactory, 4¼ strong, 5¼ very strong). Thus, an individual teacher’s Science

PALs rating could range from 1 to 5 on the overall rating (rubric Item 5) or 4 to 20 on the sum of the

specific Science PALs strategies (rubric Items 1–4) as shown in Table 2.

The substantive validity, external validity, structural validity, and reliability of the rubric were

established by a series of inquiries (Messick, 1989). First, the substantive validity was considered

by insuring that the dimensions in the rating scale matched the theoretical and practical

assumptions and goals of the project and that the Science PALs features were associated with

interactive-constructivist science teaching. The results of Dimensions 1 to 4 (Science PALs

features) were correlated with Dimension 5 (global indicator of interactive-constructivist

teaching) as individual pairs (Correlations of the Year-3 ratings were 0.68–0.95, and correlations

of the Year-4 ratings were 0.78–0.95.) and as a unified cluster (Correlation of the Year-3 ratings

was 0.94, and correlation of the Year-4 ratings was 0.96.) The external validity was explored by

t-test analyses of 128 teachers’ ratings over a complete Science PALs cycle. Comparisons of

the clustered dimensions (Science PALs approach) and the global dimension (interactive-

Table 2

Science PALs rating rubric

Rating Science PALs Teaching Practice

1–5 Use of strategies to access and utilize information on student ideas in planning instruction1–5 Use of strategies to challenge student ideas and to have them reflect on and integrate those ideas

into their thinking1–5 Use of strategies that routinely and continuously use children’s literature and personal

experiences as context for learning science1–5 Use of strategies that promote ongoing, substantive parent involvement in the science

instruction1–5 Overall rating as a constructivist teacher, as defined in the goals of the Science PALs project

778 SHYMANSKY, YORE, AND ANDERSON

constructivist approach) revealed significant predicted improvements in the ratings, clustered

dimension: t¼ 5.0, p< 0.001, global dimension: t¼ 4.2, p< 0.001.

The structural validity was checked by a series of factor analyses utilizing one- and two-factor

solutions. Both solutions were supportive (component loadings of 0.80–0.97), indicating that the

five dimensions were acting as a unified factor or that the first four dimensions could be considered

a factor, with Dimension 5 acting as a separate, single-item factor. Reliability was supported by the

correlation results and by a rate-rerate analysis of a random sample of 12 teachers. Measures

of internal consistency for the four-dimension cluster were 0.96 (Year-3 ratings) and 0.97 (Year-4

ratings). The science supervisor’s rating consistency was explored by asking her to rate

216 teachers who teach science in the 16 elementary schools. One week later, a 5% random sample

of 12 teachers was rerated by the science supervisor on the ploy that their rating results were lost.

The correlation of these paired ratings was 0.95.

Student Achievement

The released items from the TIMSS for Grades 3/4 were collected into six tests to assess

science achievement (International Association for the Evaluation of Educational Achievement,

1997). Each form of the science achievement test consisted of 32 items: 25 multiple-choice items

and 7 constructed-response items (short-answer and extended-response) that were categorically

aligned by a representative group of school-district teachers who judged the items as being

relevant to the science topics addressed in the district’s K–6 science framework (Webb, 1999). All

forms of the test had nine common multiple-choice items and three common constructed-response

items. Various combinations of the forms had other common constructed-response items. The

validity was assumed to be reasonable since the items were selected from a pool of items used in a

major international assessment project (International Association for the Evaluation of Educa-

tional Achievement, 1997). The reliability of the six forms was measured by the internal

consistencies of each form. The consistency values ranged from 0.73 to 0.77.

The six forms were randomly distributed across students in each classroom tested so that

some students were assessed by each of the six forms of the TIMSS test in every teacher’s

classroom. Statistical analysis of the results on the common items across the six subgroups taking

the various forms of the achievement test revealed no significant differences. Based on this

evidence, it was assumed that the test forms were not biased or of different difficulties and that the

results across all forms of the test could be combined using standard scores (Z¼ 50þ 10z score,

SD¼ 10SDz) to adjust results for the uncommon composition. This allowed us to examine the

Z scores for the multiple-choice and constructed-response subscales (short-answer and extended-

response) as measures of lower level and higher level knowledge to further inform the research

questions.

Multiple-choice items on the TIMSS were optically scanned and scored against a master key.

Constructed-response items were hand scored by an independent expert rater using scoring rubrics

developed by the TIMSS project staff. The full set of one constructed-response item was scored

before moving to the next item. A random sample of 10 responses per 100 responses for each of the

constructed-response items was double scored to monitor scorer reliability. Rater agreement for

the constructed-response items across all items averaged 0.95.

Student Attitudes

An instrument containing Likert-type items designed to assess students’ agreement, absence

of opinion, or disagreement on a five-position response scale to statements describing students’

IMPACT OF A SCHOOL DISTRICT’S SCIENCE REFORM 779

affective stance toward science and science careers was used to measure students’ perceptions and

attitudes (Dunkhase, Hand, Shymansky, & Yore, 1997; Shymansky, Yore, Dunkhase, & Hand,

1998; Yore et al., 1998). Validity and reliability of this attitude and awareness survey was explored

using expert analysis, factor analyses, and internal consistency. The construct validity of the

instrument was investigated by having experts examine the items selected from established item

pools or constructed by science educators for the project. Factor analyses were conducted on the

Year-2 results from 722 students in Grades 3 and 4 taking the original pool of items. The final

version of the instrument retained only those items that had a > 0.30 loading value. The items

retained were analyzed again to insure that the resulting factors matched the design features of the

instrument. The two factors retained were the attitudes toward school science (nine items with a

total score range of 9–45) and awareness of science careers (three items with a total score range of

3–15). Internal consistency was 0.79 for the combined scales, 0.74 for the attitudes toward school

science subscale, and 0.72 for the science careers subscale. The same instrument administered to

456 students the next year revealed internal consistencies of 0.75, 0.74, and 0.69 for the respective

combined scales and the two subscales on which our study focused.

Results

The project staff and the school district teachers, parents, and administrators were interested

in learning how the trait variable of years of Science PALs professional development or exposure

impacted student attitudes and achievement in science. Table 3 contains descriptive statistics for

the analyses exploring the effect of years of professional development on student attitudes and

achievement of students in Grades 3 and 4 without regard to student gender.

Of interest in Table 3 are (a) the decline in student attitudes toward school science coupled

with an increased interest in science careers among students in classrooms of teachers with 2 years

of PALs training and (b) the drop in performance on the constructed-response portion of the

TIMSS test among students in classes of teachers with 1 to 2 years of PALs training. On all

measures, the student scores showed a tendency to increase in some cases, but also to decrease in

other cases as a function of teacher exposure to Science PALs professional development. Analyses

of variance and pairwise comparisons of the student scores showed that these undulations were

significant in several instances, as shown in Table 4.

We also analyzed the data to determine any differential effects of years of Science PALs

professional development on girls’ and boys’ achievement and attitudes. Boys significantly

(p� 0.05) outscored girls on both the TIMSS multiple-choice items and the total TIMSS test

regardless of the number of years of Science PALs professional development experienced by their

Table 3

Summary statistics for Grade 3/4 student attitudes and achievement (Z score) as a function of teacher years

of Science PALs professional development

Years

Science Careers TIMSS-MC TIMSS-CR TIMSS-TS

n M SD n M SD n M SD n M SD n M SD

0 154 27.0 3.0 159 7.9 2.2 241 51.3 9.6 225 51.8 8.9 208 51.3 9.61 185 27.7 2.6 185 7.9 2.0 308 50.2 9.7 294 49.9 10.2 270 49.4 9.92 98 27.3 3.2 98 8.6 2.4 112 50.1 10.5 123 47.9 11.0 97 50.0 10.43 139 27.9 3.0 139 7.6 2.5 203 49.5 10.3 189 50.7 9.3 172 49.6 10.1

Note. TIMSS-MC¼Third International Mathematics and Science Study–Multiple Choice; CR¼Constructed Response;

TS¼Teacher Scores.

780 SHYMANSKY, YORE, AND ANDERSON

teacher. There were no significant (p> 0.05) differences between boys and girls on the TIMSS

constructed-response items or either of the attitude scales. Descriptive statistics for the student

gender analyses are presented in Table 5.

Analysis of the differential effects of a given teacher’s utilization of the Science PALs

teaching approaches on his or her students’ attitudes and achievement involved the science

supervisor’s ratings. Two ‘‘quality of implementation’’ scores were created for each teacher in

each of the last 3 years of the project: (a) a ‘‘utilization of Science PALs strategies’’ score

represented by the combined science supervisor’s ratings of a teacher’s use of student ideas, level

of parental involvement in the science classroom, and use of children’s literature in the science

instruction (Items 1–4, Table 2) and (b) a ‘‘utilization of constructivist strategies’’ score repre-

sented by the science supervisor’s rating of a teacher’s overall use of interactive-constructivist

strategies (Item 5, Table 2). Composite ratings of the Science PALs strategies of 16 or greater were

considered ‘‘high,’’ composite ratings of 9 to 15 ‘‘medium,’’ and composite ratings of 8 or less

‘‘low’’ for analyses (Table 6). Overall ratings of the interactive-constructivist strategies of 4 or

5 were considered ‘‘high’’ ratings, of 3 ‘‘medium,’’ and ratings of 2 and 1 ‘‘low’’ for analyses

(Table 7).

Analyses of variance were conducted on the students’ attitudes, awareness, and achievement

for the supervisor’s composite and overall ratings of their current teacher’s science instruction.

These analyses revealed significant main effects for the supervisor’s composite ratings of

implemented Science PALs features, F¼ 7.20, df¼ 2, 603, p< 0.001, and for supervisor’s overall

rating of constructivist teaching, F¼ 9.21, df¼ 2, 603, p< 0.001, on students’ attitude toward

school science. No other significant main effects were found for attitudes toward science careers,

TIMMS multiple-choice items, TIMSS constructed-response items, and TIMSS total test scores.

Pairwise comparisons of the significant main effects for attitudes toward school science revealed

significant (p< 0.05) differences between all pairs for the ratings on ‘‘current teacher’s utilization

of Science PALs strategies,’’ Low<Medium (effect size, h¼ 0.20), Low<High (h¼ 0.56), and

Medium<High (h¼ 0.36); for ratings of ‘‘current teacher’s utilization of overall constructivist

strategies,’’ Low<Medium (h¼ 0.27), Low<High (h¼ 0.54), and Medium<High (h¼ 0.27).

The last analysis that we performed examined the long-term impact of Science PALs

professional development on student achievement and attitudes. We were able to study this impact

Table 4

One-way analyses of variance and the significant pairwise comparisons of Grade 3/4 student achievement

and attitudes as a function of years of Science PALs professional development

Student MeasureMain Effect

p Value Pairwise Comparisons

Attitude–School Science 0.037 Year 1>Year 0; Years 3>Year 0Attitude–Science Careers 0.013 Year 2>Years 0, 1, 3TIMSS-CR (Constructed Response) 0.004 Year 0>Years 1, 2; Years 3>Years 2

Table 5

Summary of Z-score statistics for Grade 3/4 boys’ and girls’ attitudes and achievement across all years of

Science PALs professional development

Gender

Science Careers TIMSS-MC TIMSS-CR TIMSS-TS

n M SD n M SD n M SD n M SD n M SD

Male 293 27.7 3.0 293 8.0 2.5 465 51.3 9.9 443 50.2 9.9 399 51.1 10.0Female 290 27.3 2.9 290 7.9 2.0 448 48.9 9.7 434 50.0 9.9 388 48.9 9.8

IMPACT OF A SCHOOL DISTRICT’S SCIENCE REFORM 781

by summing the science supervisor’s ratings of a given student’s current-year teacher and the

student’s teachers from the previous 2 years—for Grade-3 students, their current-year teacher and

their Grade-1 and Grade-2 teacher; for Grade-4 students, their current-year teacher and their

Grade-2 and Grade-3 teacher. The supervisor’s ratings of three teachers were combined to yield a

composite teacher-utilization score for each student on the Science PALs subscale [the sum of the

ratings of Items 1–4 on the Science PALs rating rubric (see Table 2) and on the ‘‘overall

constructivist’’ rating (Item 5 on the Science PALs rating rubric)]. The 3-year combined score

could range from 15 to 60 on the Science PALs subscale and from 3 to 15 on the ‘‘overall

constructivist’’ item.

Students taught by a sequence of teachers across a 3-year period with a combined Science

PALs utilization rating of 46 to 60 were considered as having a ‘‘high’’ quality Science PALs

learning experience; students taught by teachers with a combined utilization rating of 30 to 45

were considered as having an experience of ‘‘medium’’ quality; and students taught by teachers

with a combined utilization rating of 12 to 29 were considered as having an experience of ‘‘low’’

quality. In similar fashion, students taught by a sequence of teachers across a 3-year period with a

combined rating of 12 to 15 were considered as having an ‘‘overall constructivist’’ experience of

‘‘high’’ quality; 7 to 11 as ‘‘medium’’ quality; and 3 to 6 as ‘‘low’’ quality.

When we analyzed the student achievement and attitude data as a function of the quality of

the Science PALs teaching experienced by students across 3 years and the quality of the overall

constructivist teaching experienced by students across 3 years, we found no significant (p> 0.05)

differences. The descriptive statistics for the quality of Science PALs and the quality of the overall

constructivist teaching experienced by students across 3 years are presented in Tables 8 and 9,

respectively.

Discussion

Ongoing professional development of teachers is an accepted practice in the United States

that is supported by myriad statistics showing that many teachers are underprepared for their

Table 6

Summary statistics for Grade 3/4 student attitudes and achievement (Z score) as a function of the quality of

their current teacher’s utilization of Science PALs teaching strategies

Rating

Science Careers TIMSS-MC TIMSS-CR TIMSS-TS

n M SD n M SD n M SD n M SD n M SD

Low 36 26.7 2.5 36 8.7 1.7 61 51.3 9.3 66 52.9 6.9 57 51.0 9.5Medium 377 27.2 3.1 377 7.9 2.4 541 49.9 9.7 521 49.9 10.3 466 49.7 9.9High 193 28.1 2.8 193 7.8 2.1 261 51.0 10.4 243 50.6 9.5 223 51.3 10.1

Table 7

Summary statistics for Grade 3/4 student attitudes and achievement (Z score) as a function of the quality of

their current teacher’s utilization of the overall constructivist approach

Ratings

Science Careers TIMSS-MC TIMSS-CR TIMSS-TS

n M SD n M SD n M SD n M SD n M SD

Low 113 26.6 2.6 113 8.1 2.2 204 51.0 9.2 203 51.1 9.2 181 50.8 9.4Medium 253 27.3 3.2 253 7.8 2.3 341 49.8 9.9 333 49.6 10.6 296 49.4 10.1High 240 28.0 2.8 240 7.9 2.3 318 50.6 10.3 294 50.6 9.5 269 50.9 10.1

782 SHYMANSKY, YORE, AND ANDERSON

teaching assignments (National Commission on Teaching and America’s Future, 1996). Until

recently however, it was presumed that time and money invested in professional development

would lead to enhanced teacher quality and would eventually lead to improved student learning.

This link between professional development and student learning has gone essentially untested in

the area of science education for most of the last 40 years for two reasons beyond the cost and

difficulty of conducting such research: Professional-development efforts have rarely been applied

systemically and rarely over an extended period of time. The systemic factor is critical because

student learning and subsequent performance on high-stakes tests cannot be linked to any one

teacher, especially in the elementary school years. Elementary schools are complex systems of

interacting subsystems of students, teachers, and other factors such as parent involvement,

financial support for schools and programs, and administrative policies (Guskey & Sparks, 1996).

The extended time factor is critical because teacher behaviors learned and practiced over several

years and deficiencies in content and pedagogical-content knowledge cannot be undone or

meaningfully learned in a short period of time nor should the effects of improved teacher

background or teaching practices be expected to take hold immediately (Supovitz & Turner,

2000).

In our study of the effectiveness of Science PALs as a professional-development effort, we

were able to meet some of the challenges of trying to link the effort to student outcomes. Science

PALs was systemic. It targeted an entire teaching staff, eventually involving about 90% of the K–6

teachers who teach science. It also involved parents. Teachers developed special ‘‘take-home’’

activity bags and conducted yearly information and training sessions for parents—sessions that

enjoyed 90% parent participation (Shymansky, Yore, & Hand, 2000). The effort was sustained.

Teachers had the opportunity to participate in 40-hr summer workshops and 20 hr of professional

development during the school year for 3 consecutive years. System-wide, the mean number of

hours of professional development for the K–6 teaching staff was 1 full year, or 60 hr. The school

district enjoyed significant support for instructional materials. Science kits used by the K–6

teachers were housed in a separate facility, and prepared and maintained by a paraprofessional

staff. Finally, school-building principals, the superintendent, and the school-district board all

Table 8

Summary statistics for Grade 3/4 student attitudes and achievement (Z score) as a function of the quality of

Science PALs teaching experienced by students across 3 years

Quality

Science Careers TIMSS-MC TIMSS-CR TIMSS-TS

n M SD n M SD n M SD n M SD n M SD

High 58 27.7 2.9 58 7.7 2.6 78 51.0 10.8 74 51.8 9.3 64 51.5 10.6Medium 458 27.5 2.8 458 8.0 2.2 431 51.1 9.6 422 51.1 9.4 377 51.3 9.5Low 34 27.9 3.3 34 8.0 2.7 94 50.7 9.1 89 52.5 8.3 81 50.7 8.9

Table 9

Summary statistics for Grade 3/4 student attitudes and achievement (Z score) as a function of the quality of

the overall constructivist teaching experienced by students across 3 years

Quality

Science Careers TIMSS-MC TIMSS-CR TIMSS-TS

n M SD n M SD n M SD n M SD n M SD

High 239 27.5 2.8 239 7.7 2.4 266 57.3 9.9 252 51.2 9.1 227 51.6 9.7Medium 274 27.5 2.9 274 8.0 2.2 274 50.7 9.6 269 51.2 9.5 239 50.8 9.7Low 37 28.0 2.7 37 8.3 2.4 63 51.3 8.5 64 52.2 8.7 56 51.4 7.9

IMPACT OF A SCHOOL DISTRICT’S SCIENCE REFORM 783

supported science as a curricular area and the Science PALs effort specifically carrying on with the

interactive-constructivist teaching approach, curriculum resource center, and the summer

workshops beyond the duration of the external funding. In short, it is reasonable to say that the

Science PALs professional-development effort appears to have been implemented in such a way

and under such conditions that our research questions about the impact of the effort on student

attitudes and achievement could be answered. So what did the analyses of our student attitude and

achievement data reveal?

Analyses of student attitude scores as a function of how many years their teachers participated

in the Science PALs professional development showed a significant overall positive effect in terms

of attitude toward school science (Year 3>Year 0), but a decline in attitudes about careers in

science after a significant spike in scores for students taught by teachers with 2 years of Science

PALs participation (Year 2>Years 0, 1, 3). These data suggest that students may have been

reacting positively to their teachers’ enhanced content knowledge, pedagogical-content knowl-

edge, and strategies learned in the Science PALs professional-development effort.

Analyses of student achievement data, however, tell a different story. Student scores on the

multiple-choice items of the TIMSS did not improve as a function of the number of years their

teachers were involved in the Science PALs effort and even declined slightly, though statistically

insignificantly. The same pattern was observed in the analysis of the total TIMSS score. On the

constructed-response items in the TIMSS, student scores actually declined in classrooms taught

by teachers participating in 1 to 2 years of Science PALs professional development before they

started to rebound (Year 0>Years 1, 2; Year 3>Year 2). This pattern is very similar to the one

described in Supovitz and Turner’s (2000) study. Their analyses showed that teachers’ reported

inquiry-based practice and investigative classroom climate declined initially and did not start

rebounding until the teachers had participated in about 80 hr of professional development.

Two years of Science PALs participation equals about 120 hr of professional development,

seemingly enough time for changes in classroom practices to take effect, but apparently not

enough time for those changes to show impact on student performance on multiple-choice or

constructed-response assessment items. The lack of improvement in the student scores on

multiple-choice items is not too great a concern, but the lack of improvement in the student-

generated response items is disappointing since the interactive-constructivist approach stressed

depth of understanding, which is the central focus of the higher level constructed-response items

and which may not be detected by the lower level multiple-choice items.

The results of analyses of student attitude and achievement as a function of years of

participation of teachers in Science PALs professional development were not surprising. There

were no significant differences in boys’ and girls’ scores on the attitude scales, but boys scored

significantly higher on the TIMSS multiple-choice items and the total TIMSS score regardless of

years of teacher participation in Science PALs. The nonsignificant difference in scores on the

TIMSS constructed-response items can be viewed as a positive gender result for the Science PALs

effort in light of the multiple-choice and total-score results.

The supervisor ratings of all teachers in the school district presented an opportunity for us to

link teachers individually and collectively to their students’ performance. Our idea was to look

specifically at how the ‘‘quality of implementation’’ of content knowledge, pedagogical-content

knowledge, and interactive-constructivist practices presented in the Science PALs professional-

development effort (quality as defined by the supervisor rating scale shown in Table 2) was related

to students’ attitudes and achievement. The ‘‘quality of implementation’’ allowed us to go beyond

an examination of the effect of ‘‘seat time’’ or mere ‘‘participation’’ in professional development

and to look at how teachers were actually using knowledge learned through Science PALs in their

classrooms. The extended term of our project further allowed us to study the impact that a series of

784 SHYMANSKY, YORE, AND ANDERSON

teachers might have on student attitudes and achievement: Would students who are taught by a

series of teachers rated ‘‘high’’ in quality of implementation by the district supervisor exhibit more

positive attitudes and higher achievement scores than students taught by a series of teachers

rated ‘‘medium’’ or ‘‘low’’ in quality of implementation? Unfortunately, we found no significant

differences.

The results of the students’ current teacher’s implementation of the Science PALs features

and the overall constructivist approach revealed the same significant main effects, consistencies,

and variations as described for years of Science PALs experiences. The analyses of the imple-

mentation quality of the students’ teachers over the last 3 years revealed no significant main effects

but similar patterns in the data as the other teacher factors.

Why did we find so few significant results in the various analyses performed? Teacher,

administrator, and parent reactions to the Science PALs program were all very positive (Yore,

Shymansky, & Anderson, in press). Everyone felt that the program was very successful, and

indeed, the program continues to receive strong support still today. Why do the attitude and

achievement data contradict or at least not support these positive feelings? The extended pattern of

argument encourages researchers to explicitly consider and rebut counterarguments and alterna-

tive interpretations for the claims, evidence, backings, or warrants presented. The consideration of

alternative claims and theoretical frames are often done during the research proposal, research

implementation, data collection, data interpretation, and report-writing stages. This process

continues naturally into the peer-review process of a manuscript where objective experts analyze

and evaluate the research reports and offer suggestions for conceptual and editorial improvements.

This has been the case during the coauthoring, review, and revision process for this article. There

has been divergent interpretations concerning the lack of success in documenting student-learning

effects as a function of professional development. We have offered our best interpretation of the

lack of significant linkages between the teacher-enhancement activities and student achievement

on a knowledge test and an attitude inventory, but there are several other contextual factors and

possible explanations that need to be considered. We describe them in the following paragraphs,

not necessarily in their order of importance or their validity as rival hypotheses.

The school district in which the Science PALs effort was implemented is not an ordinary

district. It is located in a medium-sized university town that boasts a relatively high concentration

of professional people with above-average education and an above-average median family

income. The K–6 teaching staff, indeed the full K–12 teaching staff, is fully certified and teaching

in field with most teachers of 5 or more years of experience holding master’s degrees and beyond.

The student dropout rate is very low, and the percentage who go on to postsecondary school is very

high. The school district ranks in the top one to three school districts in a state that ranks in the top

one to three states in standardized test scores and at or above the 90th percentile on the number of

National Merit Scholars in most years. All this information is a way of saying that students in the

school district do very well. In light of this, it is very likely that we encountered a kind of ‘‘ceiling

effect’’ in some of our test data, particularly on the achievement scores.

This ceiling-effect argument is supported by data in a report prepared for the local school

district administrative staff and school board (Table 10). The report shows the scores of students in

the school district compared to the national and international scores on the TIMSS. Grade 3 or

Grade 4 students in the school district equaled or outscored the national students on all but one of

the items and outscored the international students on all items tested. The comparisons among

district, national, and international performance are made crystal clear when attention is focused

on the constructed-response items. Since our research design and analyses focused on com-

parisons of teachers and their students within the school district, we were trying to find differences

among students from a group that was uniformly very strong overall. Had the Science PALs effort

IMPACT OF A SCHOOL DISTRICT’S SCIENCE REFORM 785

been implemented in a school district where student achievement scores were distributed more

normally, the results might have been radically different from what we found.

Another possible explanation for the apparent lack of the Science PALs’ impact on student

attitude and achievement is the possibility of contamination of the untreated teachers in the district

(those who chose not to participate fully or not to participate at all). All teachers in the district had

access to the same science kits prior to the start of the Science PALs project and during the 4-year

term of the project. Teachers not participating in the Science PALs professional development did

not have direct access to the resource materials generated by participating teachers as part of the

professional-development activities (i.e., TRBs, special parent-activity bags, reading and writing

in science activities, assessments, cross-curricular activities, etc.). However, there was no attempt

to vigorously guard the special resource materials; nonparticipating teachers as well as those who

committed to less than the 60 hr of professional development each year had ample opportunity to

borrow and use materials and strategies from Science PALs colleagues. As a result, data analyses

based on level of involvement may have been compromised considerably.

Then there is the critical question of how well the TIMSS instruments that we used aligned

with the science units being taught. As mentioned earlier, the TIMSS items were aligned only

categorically, the most superficial level of alignment suggested by Webb (1997, 1999) and

currently being promoted by the American Association for the Advancement of Science (2001).

Not considered by teachers who were asked to align the TIMSS with the district’s science

curriculum was the range or depth of knowledge required by the various items or their demand in

terms of cognitive complexity (Impara, 2001). However, it was simply not feasible to train the

teachers to perform the more sophisticated alignment strategies even though a more rigorous

alignment of the TIMSS with the district’s K–6 science curriculum may well have produced a

fairer test of student achievement and more valid results for our study.

Table 10

How district Grades 3 and 4 students compare to U.S. and international students

on selected TIMSS items (average percent correct for Grade 3/4 students)

International U.S. District

Multiple Choice1. (moon-shine) 64/67 71/75 66/722. (temperature) 32/44 37/54 49/613. (line cycle) 83/85 96/97 95/984. (sunscreen) 65/76 75/83 80/905. (fruits & veggies) 58/65 50/62 56/656. (bicycle on hill) 45/52 54/60 50/587. (floating block) 30/34 25/31 30/368. (inference bag) 34/43 43/58 59/679. (seed experiment) 29/36 43/61 49/62

Constructed Response1. (river in flood plain-a) 48/62 66/83 66/832. (river in flood plain-b) 16/23 21/28 28/453. (snow on mountains) 31/46 36/53 61/734. (animal protection-a) 46/60 61/77 87/925. (animal protection-b) 29/42 48/64 48/646. (candle in jar) 49/64 52/62 80/857. (sugar masses) 27/37 28/43 61/758. (air pollution-a) 31/48 45/59 52/679. (air pollution-b) 21/34 37/48 35/51

10. (oil spills) 16/27 27/46 51/64

786 SHYMANSKY, YORE, AND ANDERSON

The science supervisor’s rating of the school-district teachers using the rubric described in

Table 2 constituted yet another threat to our study’s validity. The supervisor was given free reign in

assigning the ratings. For practical reasons of an overcrowded schedule and a reluctance to

jeopardize trust between the supervisor and her staff, we could not impose a strict set of guidelines

in applying the rating rubric. Thus, some ratings were based on extensive interaction and ob-

servation data while other ratings were based on very limited data. A more rigid set of rating

guidelines and a demand for more exhaustive evidence on which to base ratings may well have

produced a more variable and different set of quality of implementation ratings.

It may be questionable to believe that inquiry science teaching of any kind can uniformly

stimulate the desired achievement gains in elementary school students with any or all generalist

teachers. Inquiry science teaching may place too great of conceptual, pedagogical, logistical, and

resource demands on generalist teachers in the current elementary school environment. If inquiry

science teaching is an ineffective instructional approach in this context, one would not expect to

detect the associated gains in conceptual and affective growth of all students involved. The 1980

meta-analyses and the research base for the science education reform in the United States did not

suggest such an interpretation (American Association for the Advancement of Science, 1993;

Shymansky, Kyle, & Alport, 2003). We continue to believe that the interactive-constructivist

approach (modified learning cycle) to inquiry teaching with cross-curricular connections to

language arts, mathematics, and social studies has unrealized potential for self-contained

classrooms involving generalist teachers.

Teacher workshops focused on inquiry modules specific to their teaching responsibilities and

developing associated instructional resources to adapt the science kits to their classroom and

students may not be an effective professional-development strategy. These instructional activities

do not fully reflect the complete challenge for teachers of implementing a new instructional

resource into their teaching that involves changing pedagogical-content knowledge, beliefs about

science and science teaching, and classroom practices, and opening the implicit beliefs of their

students to such innovative curriculum and instruction. This alternative interpretation may well

have merit, but the NSF with its call for proposals for the reported research projects appears to have

disregarded the likelihood of such an interpretation. The professional-development activities of

Science PALs were designed around the authentic work that elementary teachers do outside of

their classrooms in preparing to teach a specific science topic and providing the ongoing support

of a science advocate in their school and a science partner in their school district to help with

the actual classroom implementation. We believe the publication of research reports such as this

may influence funding agencies to have a much more open mind to what constitutes effective

professional development and to recognize the complexities and difficulties of systemic change.

Finally, perhaps no significant student learning effects were found because the Science PALs

program itself is not a robust and intense approach and the research design was not sensitive to the

changes that did occur. We considered these interpretations at the outset of the research design and

during the data analysis and interpretation. Qualitative responses reported elsewhere indicate that

teachers believe and support the parental involvement, school–home activities, and literacy

approach to teaching elementary science (Shymansky et al., 2000; Yore et al., in press). There are

convincing data that Science PALs itself works in some circumstances and that in this instance, its

failure to have a significant impact may be related to the difficulties teachers have enacting their

newly developed understanding into classroom practice, the lack of sensitivity of the measures

of science understanding and attitudes, or the quasi-experimental approach did not capture the

positive effects in some situations. But, like many controlled implementations of teaching stra-

tegies and instructional materials in selective contexts and participant samples that demonstrate

positive effects, similar effects are not generally found when a larger clientele pool is involved in

IMPACT OF A SCHOOL DISTRICT’S SCIENCE REFORM 787

the implementation. This may be the case in the 16 elementary schools of the host school district.

Exemplar cases of selected data from some schools, teachers, and students could tell a much more

exciting story, and such a case study of successful subsystems within the larger system might well

provide insights for future research and teacher enhancement projects. Unfortunately, this study

was about systemic change involving all participating elementary schools and classrooms.

Conclusion

The results of our study support the Supovitz and Turner (2000) claim that teacher effects of

professional development should not be expected until teachers have experienced a minimum of

about 80 hr of work and suggest that student effects should not be expected for at least 3 or more

years after that. Our results also reinforce the conclusions of Loucks-Horsley and Matsumoto

(1999), who pointed out how difficult it is to study, let alone clearly establish, direct cause–effect

connections between professional development and student performance. Yet, we have no choice

as professional educators but to continue to study those connections, lest we admit that we are only

‘‘stabbing in the dark’’ with our professional-development promises and practices. Without solid

evidence that our professional-development activities do result in enhanced student performance,

we can only claim ‘‘artistic license’’ for what we do in teacher education. Despite the difficulties

and cost of studying the connection between professional development and student performance,

we have to persevere.

Notes

This article is based upon research supported by National Science Foundation Grant ESI-9911857.

Any opinions, findings, and conclusions or recommendations expressed in this article are those of the

authors and do not necessarily reflect the views of the National Science Foundation.

References

American Association for the Advancement of Science. (1990). Science for all Americans.

New York: Oxford University Press.

American Association for the Advancement of Science. (1993). Benchmarks for science

literacy. New York: Oxford University Press.

American Association for the Advancement of Science. (2001). Atlas of science literacy.

Washington, DC: Project 2061.

Darling-Hammond, L. (1996). What matters most: Teaching for America’s future (Summary

Report). New York: National Commission on Teaching and America’s Future.

Dunkhase, J., Hand, B., Shymansky, J.A., & Yore, L.D. (1997, November). The effects of a

teacher enhancement project designed to promote interactive-constructive teaching strategies in

elementary school science on students’ perceptions and attitudes. Paper presented at the Annual

School Science and Mathematics Association Conference, Milwaukee, WI.

Education Commission of the States. (1997). Investing in teacher professional development:

A look at 16 school districts. Denver, CO: Author.

Ford, C., Yore, L.D., & Anthony, R. (1997, March). Reforms, visions, and standards: A cross-

cultural curricular view from an elementary school perspective. Paper presented at the annual

meeting of the National Association for Research in Science Teaching, Oak Brook, IL.

Frechtling, J., Sharp, L., Carey, N., & Vaden-Kiernan, N. (1995, June). Teacher enhancement

programs: A Perspective on the last four decades. Washington, DC: National Science Foundation.

788 SHYMANSKY, YORE, AND ANDERSON

Garet, M., Porter, A., Desimone, L., Birman, B., & Yoon, K. (2001). What makes professional

development effective? Results from a national sample of teachers. American Educational

Research Journal, 38, 915–945.

Guskey, T. & Sparks, D. (1996). Exploring the relationship between staff development and

improvements in student learning. Journal of Staff Development, 17, 34–38.

Impara, J.C. (2001, April). Alignment: One element of an assessment’s instructional utility.

Paper presented at the National Council on Measurement in Education, Seattle, WA.

International Association for the Evaluation of Educational Achievement. (1997). TIMMS

science items (third and fourth grades). Chestnut Hill, MA: Author.

Lavoie, D.R. & Roth, W.-M. (Eds.). (2001). Models of science preparation: Theory into

practice. Dordrecht, The Netherlands: Kluwer.

Loucks-Horsley, S. & Matsumoto, C. (1999). Research on professional development for

teachers of mathematics and science: The state of the scene. School Science and Mathematics, 99,

258–271.

Messick, S. (1989). Validity. In R.L. Lenn (Ed.), Education measurement (3rd ed., pp. 13–

103). New York: Macmillan.

National Commission on Teaching and America’s Future. (1996). What matters most:

Teaching for America’s future. New York: Author.

National Research Council. (1996). National science education standards. Washington, DC:

National Academy Press.

Norris, S.P. & Phillips, L.M. (2003). How literacy in its fundamental sense is central to

scientific literacy. Science Education, 87, 224–240.

Shields, P.M., Marsh, J.A., & Adelman, N.E. (1998). Evaluation of NSF’s Statewide

Systemic Initiatives (SSI) Program: The SSI’s impacts on classroom practice. Menlo Park, CA:

SRI.

Shymansky, J.A., Kyle, W.C., Jr., & Alport, J.F. (2003). The effects of new science curricula

on student performance. Journal of Research in Science Teaching, 40, S68–S85.

Shymansky, J.A., Woodworth, G., Norman, O., Dunkhase, J., Matthews, C., & Liu, C.T.

(1993). A study of changes in middle school teachers’ understanding of selected ideas in science as

a function of an inservice program focusing on student preconceptions. Journal of Research in

Science Teaching, 30, 737–755.

Shymansky, J.A., Yore, L.D., Dunkhase J.A., & Hand, B.M. (1998, January). Science,

parents, activities, and literature: Overview, results, and reflections. Paper presented at the Inter-

national Conference of the Association for the Education of Teachers in Science, Minneapolis,

MN.

Shymansky, J.A., Yore, L.D., & Hand, B.M. (2000). Empowering families in hands-on

science programs. School Science and Mathematics, 100, 48–56.

Shymansky, J.A., Yore, L.D., Treagust, D.F., Thiele, R.B., Harrison, A., Waldrip, B.G.,

Stocklmayer, S.M., & Venville, G. (1997). Examining the construction process: A study of

changes in level 10 students’ understanding of classical mechanics. Journal of Research in Science

Teaching, 34, 571–593.

Slavin, R.E. (2003). A reader’s guide to scientifically based research. Educational

Leadership, 60(5), 12–16.

Smith, D.C. (1999). Changing our teaching: The role of pedagogical content knowledge in

elementary science. In J. Gess-Newsom & N.G. Lederman (Eds.), Examining pedagogical content

knowledge (pp. 163–197). Dordrecht, The Netherlands: Kluwer.

Supovitz, J.A. & Turner, H.M. (2000). The effects of professional development on science

teaching practices and classroom culture. Journal of Research in Science Teaching, 37, 963–980.

IMPACT OF A SCHOOL DISTRICT’S SCIENCE REFORM 789

Webb, N.L. (1997). Criteria for alignment of expectations and assessments in mathematics

and science education [Research Monograph No. 6]. Washington, DC: Council of Chief State

School Officers.

Webb, N.L. (1999). Alignment of science and mathematics standards and assessments in four

states [Research Monograph No. 18]. Madison, WI: National Institute for Science Education.

Weiss, I.R., Montgomery, D.L., Ridgway, C.J., & Bond, S.L. (1998). Local systemic change

through teacher enhancement: Year Three Cross-Site Report. Chapel Hill, NC: Horizon Research.

Yore, L.D., Shymansky, J.A., & Anderson, J.O. (in press). Sensing the impact of elementary

school science reform: A study of stakeholder perceptions of implementation, constructivist

strategies and school–home collaboration. Journal of Science Teacher Education.

Yore, L.D., Shymansky, J.A., Henriques, L., Hand, B.M., Dunkhase, J.A., & Lewis, J.O.

(1998, January). Students’ perceptions of science teaching and attitudes toward science learning

and teachers’ self-report of using children’s ideas, applications of science, and use of print

resources as indicators of classroom teaching. Paper presented at the International Conference of

the Association for the Education of Teachers in Science, Minneapolis, MN.

Zembal, C., Starr, M., & Krajcik, J. (1999). Constructing a framework for elementary science

teaching using pedagogical content knowledge. In J. Gess-Newsom & N.G. Lederman (Eds.),

Examining pedagogical content knowledge (pp. 237–256). Dordrecht, The Netherlands: Kluwer.

790 SHYMANSKY, YORE, AND ANDERSON