Professional Documents
Culture Documents
RoutledgeHandbooks 9780203961063 Chapter9
RoutledgeHandbooks 9780203961063 Chapter9
143
On: 16 Sep 2023
Access details: subscription number
Publisher: Routledge
Informa Ltd Registered in England and Wales Registered Number: 1072954 Registered office: 5 Howick Place, London SW1P 1WG, UK
Publication details
https://www.routledgehandbooks.com/doi/10.4324/9780203961063.ch9
Dan Goldhaber
Published online on: 26 Nov 2007
How to cite :- Dan Goldhaber. 26 Nov 2007, Teachers Matter, But Effective Teacher Quality Policies
Are Elusive from: Handbook of Research in Education Finance and Policy Routledge
Accessed on: 16 Sep 2023
https://www.routledgehandbooks.com/doi/10.4324/9780203961063.ch9
This Document PDF may be used for research, teaching and private study purposes. Any substantial or systematic reproductions,
re-distribution, re-selling, loan or sub-licensing, systematic supply or distribution in any form to anyone is expressly forbidden.
The publisher does not give any warranty express or implied or make any representation that the contents will be complete or
accurate or up to date. The publisher shall not be liable for an loss, actions, claims, proceedings, demand or costs or damages
whatsoever or howsoever caused arising directly or indirectly in connection with or arising out of the use of this material.
Downloaded By: 10.3.97.143 At: 14:51 16 Sep 2023; For: 9780203961063, chapter9, 10.4324/9780203961063.ch9
9
Teachers Matter, But Effective Teacher
Quality Policies Are Elusive
Dan Goldhaber
For most, it comes as no surprise that research has consistently shown that teachers can have pro-
found effects on their students. Hanushek (1992), for instance, finds that the quality of a teacher
can make the difference of a full year’s learning growth. Research dating all the way back to the
“Coleman Report” (Coleman, 1966) has shown teacher quality to be the most important school-
ing factor influencing student achievement.1 Recent findings, based on better data and more rigor-
ous statistical methods, have confirmed that the quality of the teacher in the classroom remains
the most important schooling factor predicting student outcomes, and have furthermore shown
that there is considerable variation in quality between teachers (Goldhaber, 2007; Rivkin, Ha-
nushek, & Kain, 2005; Rockoff, 2004; Sanders & Horn, 1998; Sanders & Rivers, 1996; Wright,
Horn, & Sanders, 1997).2
Teacher quality is a common term that may be clear in concept, but has been used to mean
many different things. Some measure quality based on the credential or set of credentials that
a teacher holds. For instance, the No Child Left Behind (NCLB) Act specifically denotes that
“highly qualified teachers” are those who hold a bachelor’s degree, obtain full certification from
their state, and demonstrate competency in each subject they teach (as defined by the state).3
Others focus on teacher practices in the classroom to gauge teacher quality or use the behavior
or learning (measured in a variety of ways) of students as a metric for teacher quality. Likewise,
teacher quality has been studied and measured using a variety of both qualitative and quantitative
approaches from case study observations of teachers, to teacher reports of practices, to “value-
added production function” studies that attempt to use statistical methods to isolate the contribu-
tions that teachers make toward student achievement and/or relate particular teacher attributes
to student achievement.4 Teachers no doubt contribute in many different ways to students’ intel-
lectual and emotional growth, implying that their quality could be measured along any number
of dimensions. While admittedly a narrow conception of what teachers do, in this chapter I pri-
marily focus on research that employs a production function approach to assess the relationship
between teacher characteristics and student achievement. Throughout the chapter, I use the term
“teacher quality” to refer to a teacher’s value-added contribution in producing measurable gains
for students on standardized exams (or other easily quantifiable student outcomes, such as high
school graduation). I focus on this measure of teacher quality with the belief that effective teach-
146
9. TEACHERS MATTER; EFFECTIVE TEACHER QUALITY POLICIES ARE ELUSIVE 147
ers may have a range of attributes, strategies, and skills, some quite different from one another,
Downloaded By: 10.3.97.143 At: 14:51 16 Sep 2023; For: 9780203961063, chapter9, 10.4324/9780203961063.ch9
that contribute to student achievement on tests, which does in fact represent an important mea-
sure of genuine learning.
Unfortunately for policy makers, a large body of research suggests that teacher quality (as
defined above), while an important predictor of student success, is not strongly associated with
readily observable teacher characteristics like degree and experience levels (Hanushek, 1986,
1997). This point is well-illustrated by some research I conducted with colleagues (Goldhaber,
Brewer, & Anderson, 1999) that first estimated the overall impact of teachers on students, then
parsed out which part could be explained by specific teacher characteristics and which part was
attributable to less-easily quantifiable teacher attributes (e.g., whether they are enthusiastic and
hard working).5 We found that teachers account for approximately 8.5 percent of the variation in
students’ tenth grade achievement.6 But, the impact of unobservable teacher attributes (i.e., those
that are not included in the dataset) dwarf that of objective, observable, easily measured attributes
such as degree and experience levels: less than 5 percent of the variation in teacher quality is
explained by such quantifiable characteristics.
Different studies reach varying conclusions as to the extent to which teacher quality can be
predicted by readily observable teacher characteristics.7 A nuanced reading of the research litera-
ture suggests that some readily identifiable characteristics do predict success in the classroom. In
particular, measures of academic proficiency or cognitive ability, such as a teacher’s performance
on standardized tests (e.g., licensure tests or the SAT or the selectivity of the colleges she gradu-
ated from), and subject specific training (e.g., a degree in mathematics) in a teacher’s specialty
area appear to be predictors of teacher quality.8 However, an important theme emerging from
the newest, most sophisticated educational production function studies is that, when it comes to
teachers, there is far more variation in the effectiveness of teachers who hold a particular creden-
tial than there is between teachers with different credentials. In other words, even when a particu-
lar characteristic does predict teacher quality in a statistical sense, there are greater differences
between teachers who share a common characteristic or credential than there are differences
between teachers with different characteristics or credentials.
In this chapter, I briefly explore the research linking teachers and students and the policy im-
plications of this research. I focus on several pre-service and in-service teacher policy interven-
tions: (1) the growing use of alternative certification, (2) efforts to “professionalize” the teacher
workforce through the voluntary certification of teachers by the National Board for Professional
Teaching Standards (NBPTS), and (3) experimentation with alternative compensation plans (e.g.,
merit pay).
The chapter is organized as follows. In the next section, I provide a brief overview of research
findings that link teacher quality to readily observable teacher credentials and characteristics. In
the third section, I discuss the “more variation within than between” theme in greater detail,
explore what this might mean for state and local policy makers who play a role in shaping the
teacher workforce, and set it in the context of the major trends in educational policies designed
to enhance teacher quality through manipulation of pre-service requirements (“teacher gateway
policies”). I conclude by specifically examining the implications of the research for policies de-
signed to affect teachers who are in the workforce (“teacher workforce policies”).
Teacher quality does not appear to be strongly correlated across educational contexts (e.g., subjects,
grade-levels, etc.) with any readily observable and easily quantifiable attributes, such as licensure
148 GOLDHABER
status or degree level. This is not an uncontroversial assertion: education researchers have come
Downloaded By: 10.3.97.143 At: 14:51 16 Sep 2023; For: 9780203961063, chapter9, 10.4324/9780203961063.ch9
to different conclusions based on different studies (and sometimes based on reviews of the same
studies), and one can generally find educational research “evidence” in support of a particular
position on the relationship between teachers and students. Furthermore, it is clear that there are
some significant contextual gaps in terms of what we know about the relationships between vari-
ous teacher characteristics and student achievement. For example, most of the studies on teacher
content knowledge tend to focus on student math and reading achievement at the secondary level
(Goldhaber, 2004). With these caveats as a backdrop, I offer a brief summary of the educational
production function literature linking teacher characteristics and student achievement.
Teacher experience, and its value as a predictor of teacher effectiveness in the classroom, has been
studied extensively. For example, a 1986 review article by Eric Hanushek reports findings from
109 studies that include teacher experience as an explanatory variable. Of the 109 studies, 40 find
experience to be a statistically significant predictor of teacher effectiveness and of these, 33 find
a significant positive effect. While it is plausible that a positive finding on experience results not
from a causal relationship between experience and student outcomes, but from the tendency of
more-experienced teachers to be matched with classes composed of higher-achieving students
(e.g., due to seniority transfer policies) (Hanushek, 1986). In fact, recent empirical work utilizing
more-sophisticated econometric approaches to account for this potential problem tends to confirm
that teachers do become more effective as they gain experience early on in their careers.9
A problem with the great majority of studies of teacher experience is that they implicitly,
through their modeling techniques, treat every year of teaching experience as though it is the
same. They do this despite the fact that there may be substantial differences between, for instance,
the impact of experience gained between the first and second year of teaching and experience
gained between the twenty-fifth to the twenty-sixth year of teaching. Indeed, some more nuanced
educational productivity studies have found that not every year of teacher experience has the
same impact on teacher effectiveness as every other year. Early studies, such as Murnane (1975)
and Murnane and Phillips (1981) show non-linear experience impacts on teacher productivity
with teachers showing significant productivity benefits from early career experience gains that
tend to taper off after five or so years. This finding has grown more convincing as increasingly
detailed longitudinal data has become available, allowing researchers to examine the impacts of
experience gains at different points in a teacher’s career while accounting for the fact that teach-
ers of varying experience levels are likely not randomly assigned to students of varying abilities
(Clotfelter, Ladd, & Vigdor, 2006; Goldhaber, 2007; Kane, Rockoff, & Staiger, 2006; Rivkin et
al., 2005; Rockoff, 2004).10
The magnitude of the teacher-experience effect differs from one study to the next, but most
studies that account for non-random match of teachers and students, and that allow for non-lin-
earities in the relationship between teacher experience and student achievement, suggest that a
teacher tends to become significantly more effective in each of the first 3 to 5 years of his or her
career, with the gains from experience being larger in math.11 The positive findings on teacher
experience span different grade-levels and subjects and thus appear to be robust to different
teaching contexts.
There is relatively little evidence that teacher degree level per se is a good proxy for teacher
quality. Hanushek’s 1986 review found the expected statistically significant, positive relationship
9. TEACHERS MATTER; EFFECTIVE TEACHER QUALITY POLICIES ARE ELUSIVE 149
between teacher degree level and student achievement in only about 5 percent of reviewed studies
Downloaded By: 10.3.97.143 At: 14:51 16 Sep 2023; For: 9780203961063, chapter9, 10.4324/9780203961063.ch9
and about the same number showed a statistically significant, negative effect—roughly what one
might expect simply by chance.12
Were it the case that teacher degree-level has a large effect on teacher quality it likely would
not be so difficult to detect it. Furthermore, since more-credentialed teachers tend to be assigned
to higher-achieving students (Clotfelter et al., 2006; Lankford Loeb, & Wyckoff, 2002), one
would expect that if a bias exists in the estimates of the effect of teacher degree it would be an
upward bias, implying that many studies likely overstate the findings on experience.
There is evidence that a teacher’s degree level may matter in some specific contexts. In par-
ticular, Goldhaber and Brewer’s analyses present a more nuanced portrait of the impact of teacher
degrees in those grades (1997a, 1997b). They find that a teacher’s advanced degree is not gener-
ally associated with increased student learning from the eighth to the tenth grade, but that having
an advanced degree in math and science for math and science teachers does appear to influence
students’ achievement (1997b).13 This was not, however, true for English and history degrees
when focusing on student achievement in English/reading and history.
Goldhaber and Brewer’s findings in math and science are broadly consistent with research
by Monk (1994) and Monk and King (1994). They find that subject-matter training in math and
science predicts teacher quality. Specifically, they find that each additional course a teacher has
taken in math improves student mathematics achievement by about three quarters of one percent
of a standard deviation, and similar science preparation increases student achievement in science
by up to two thirds of one percent of a standard deviation. Assuming a math or science major
requires ten to eleven courses in subject, this would roughly be the equivalent to seven to eight
percent of the standard deviation in math achievement and around six to seven percent in science,
very similar in size to Goldhaber and Brewer’s findings.
Importantly, Monk and King find the impact of subject-specific training depends on the con-
text of the classes taught. Specifically, their results suggest that the number of math and science
courses taken by teachers while in college had an impact on sophomores’ and juniors’ achieve-
ment in math and science, but the effect is non-linear and teacher coursework had a larger impact
if a teacher is teaching a more-advanced course.
There is relatively little empirical evidence suggesting a strong relationship between subject-
matter training of teachers and student achievement at the elementary level, and the existing
evidence is less conclusive. Eberts and Stone (1984) address this relationship and find no statisti-
cally significant relationship between the number of math courses taken by teachers and their 4th
grade students’ achievement in math. Similarly, Hill, Rowan, and Ball (2005) do not find a rela-
tionship between the number of math methods and content courses (they group these together)
a teacher has completed and student gains in mathematics (in the 1st and 3rd grades). They do,
however, find that teachers who have more demonstrated mathematical knowledge (based on
their performance on a subject-matter test) produce larger gains. They estimate that a 1 standard
deviation increase in teacher math knowledge increases student achievement by 2 to 4 points on
the Terra Nova test, which, expressed as a function of average monthly student growth, is equiva-
lent to about a half month’s worth of learning gains.14 In contrast, they do not find any significant
effect for a teacher’s reading knowledge on student performance in reading.
States regulate who may become a teacher through their licensure policies. The primary purpose
of these policies is to assure the public that individuals in the teaching profession have at least the
minimal standard of teaching competence to be qualified to begin practicing in the profession.15
Most licensure system require that prospective teachers complete a standard set of college-level
150 GOLDHABER
courses in pedagogy and/or in the subject they wish to teach, and for them to demonstrate com-
Downloaded By: 10.3.97.143 At: 14:51 16 Sep 2023; For: 9780203961063, chapter9, 10.4324/9780203961063.ch9
petence by passing one or more standardized tests. However, there is also great variation among
states in the specific requirements to enter teaching, either through both traditional and alterna-
tive routes. Furthermore, there is likely to be considerable within state variation in the quality and
content of teacher training programs as these programs can differ substantially from one another
in terms of educational philosophies and approaches (Wilson, Floden, & Ferrini-Mundy, 2001).
Recent reviews of licensure studies have reached very different conclusions and have helped
to spur a public debate over the role of the state in regulating the teacher labor market. Walsh
(2001) investigates over 200 studies on licensure and finds the process to be deeply flawed. Walsh
concludes the research on the teacher attributes correlated with teacher quality “does not show
that certified teachers are more effective teachers than uncertified teachers” (p. iv). Darling-Ham-
mond (2002) refutes Walsh’s conclusions writing that: “Given the crudeness of the measure [cer-
tification], it is perhaps remarkable that so many studies have found significant effects of teacher
certification.”16
Notwithstanding the more than 200 studies cited in these reviews, there is relatively little
high-quality empirical research on the effectiveness of teacher preparation programs and licen-
sure due primarily to a lack of good data for such studies. The overwhelming majority of these
cited studies do not directly link teacher licensure status or pedagogical preparation to student
achievement at the teacher-student level (instead they all rely on aggregated data or link licen-
sure to outcomes such as teacher beliefs, teaching style, or behavior in the classroom). In fact,
in a recent comprehensive review of empirical papers relating teacher characteristics to student
achievement, Wayne and Youngs (2003) found only two papers on the impacts of teacher licen-
sure that were rigorous enough (i.e., they were peer-reviewed, used longitudinal student-level
achievement data, and controlled for prior student achievement and student SES) to meet the
requirements for inclusion in their review: Goldhaber and Brewer (1997a) and Goldhaber and
Brewer (2000).17
Both Goldhaber and Brewer studies find evidence at the high school level of positive student
achievement effects of in math of having a teacher who is certified in math, as opposed to be-
ing certified in another area. They do not, however, find statistically significant effect of science
teacher certification on science achievement (they did not examine certification in other areas).18
But, Goldhaber and Brewer do not find significant differences in achievement between students
with teachers who hold standard state certification in math, and thus have received pedagogical
preparation, and those with emergency certification in math (or other subjects), who likely do
not receive pedagogical training.19 A more recent study (Dee & Cohodes, 2005) focusing on 8th
grade students finds results that are almost identical to those of Goldhaber and Brewer’s at the
10th and 12th grades: assignment to a math teacher who is certified in her subject is predicted
to increase student achievement by about 10 percent of a standard deviation, but there are no
similarly statistically significant findings for science, social studies, or English. Croninger et al.’s
study (2007) of elementary school teachers suggests that certification status is not predictive of
teacher quality in math or reading.
More recently, several studies have focused on differences between traditionally certified
teachers and a prominent alternative certification program: Teach For America (TFA). One of the
goals of TFA is to recruit candidates who may not otherwise consider teaching. TFA recruitment
focuses on academic aptitude and leadership experience and does not require that applicants
complete education training in college. In fact, TFA explicitly states, “A degree in or coursework
in education has no bearing on a candidate’s chances of admission [to TFA]” (http://www.teach-
foramerica.org/tfa/FAQ).20
Several studies have examined TFA and non-TFA teachers in the Houston Independent School
9. TEACHERS MATTER; EFFECTIVE TEACHER QUALITY POLICIES ARE ELUSIVE 151
District (Darling-Hammond et al., 2005; Raymond and Fletcher, 2002) and reached somewhat
Downloaded By: 10.3.97.143 At: 14:51 16 Sep 2023; For: 9780203961063, chapter9, 10.4324/9780203961063.ch9
different conclusions from one another. Raymond and Fletcher use results on the TAAS exam
(the former Texas student achievement test) to compare the performance of students with TFA
teachers to students with traditional teachers. They consistently find that TFA teachers perform
comparably in reading with other new teachers (both certified and uncertified), and outperform
other new teachers in math by roughly 10 percent of a standard deviation. They also find that the
distribution of student scores with TFA teachers is more narrowly centered around the mean than
for non-TFA teachers, and conclude from this that TFA teachers have more predictable classroom
ability.
Darling-Hammond et al. find more mixed evidence on TFA teachers versus other teachers in
general (including both certified and uncertified teachers), but strongly conclude that certification
makes a difference for teacher quality. In particular, they note that uncertified TFA teachers (as
most are initially) are less effective than fully certified teachers (and about as effective as other
uncertified teachers), but TFA teachers who become certified do about as well as, and in some
cases outperform, other certified teachers.21
Neither study accounts for the potential for non-random sorting of teachers across class-
rooms, nor do they allow for non-linear experience effects—both important considerations.22
Less-than-fully credentialed teachers are likely to be teaching in more difficult work settings,
so failure to account for this could bias the estimates of the effects of these teachers downward.
Furthermore, the effects of the certification variables in the model (e.g., uncertified, TFA status,
etc.) may well be confused with the impact of teacher experience: the way the models are speci-
fied opens the possibility that the positive effects of early career experience could inappropriately
be attributed to the effects of a teacher becoming certified.23
One of the most methodologically sound studies of TFA is conducted by Glazerman, Mayer,
and Decker (2006). What makes this study unique and the findings convincing is its experimental
design: students are randomly assigned to TFA and non-TFA teachers.24 Glazerman et al. find
that students with TFA teachers do no worse in reading and do better in math than the students
of non-TFA teachers. Specifically, in the TFA—non-TFA comparison (where non-TFA includes
both certified and uncertified teachers), the authors find statistically significant positive effects
in the growth of math scores of 15 percent of a standard deviation, but no significant effects in
the growth of reading scores. More striking is the finding that these results held whether the
comparison teachers were novice or experienced, and certified or uncertified, with statistically
significant effect sizes ranging from 9 to 26 percent of a standard deviation in mathematics, de-
pending on the comparison. None of these effects were statistically significant for reading in any
of the comparisons.
Another study that accounts for the potential nonrandom match of teachers to schools is
that of Boyd et al. (2005a), which examines the performance of teachers in New York City (in
grades 3–8, in math and English language arts) who enter the teacher workforce though various
alternative pathways relative to that of teachers who obtained a traditional teacher license. The
alternative pathways Boyd et al. consider include Teach For America, the NYC Teaching Fellows
program, and the Teaching Opportunity Program (two alternative programs that are specific to
NYC). Overall, students of TFA teachers were found to have gains similar to those of traditionally
certified teachers, but students of teachers who entered through one of the other alternate path-
way had slightly lower gains. The authors also provide evidence that the effects of having a TFA
teacher are different across grades, with TFA teachers being relatively more effective in higher
grade levels (the differences in the effect sizes are small and not consistently statistically signifi-
cant, so not worth reporting).25
One of the major fault lines in the research on the effectiveness of TFA teachers (and other
152 GOLDHABER
alternative licensure programs) has been over defining an appropriate comparison group to TFA
Downloaded By: 10.3.97.143 At: 14:51 16 Sep 2023; For: 9780203961063, chapter9, 10.4324/9780203961063.ch9
teachers. While some argue that the only appropriate comparison to make is TFA teachers to
traditionally (fully) licensed teachers, others believe that this does not paint an accurate picture
because it is unrealistic to assume that TFA teachers are the relevant alternative to traditionally
licensed teachers and vice versa. The TFA program, by design, tends to place teachers in schools
that have great difficulty hiring traditionally licensed teachers and, in the absence of other policy
changes (e.g., increases in teacher salaries), hiring officials for these types of schools often have
to make choices between various types of alternatively licensed teachers—and sometimes be-
tween alternatively licensed teachers and other options, such as long-term substitutes. Thus, my
own view is that, while an interesting comparison, the TFA or alternatively licensed teacher
versus the traditionally licensed teacher comparison is not really the relevant one for making
policy decisions. Rather, the other teachers, certified or not, who are also employed in the schools
employing TFA teachers are the appropriate comparison group.
More importantly, new research suggests that there is far greater variation between teach-
ers who enter the workforce with the same credential than between teachers who hold different
credentials, which implies there may be a benefit to focusing on in-service teacher policies as
opposed to pre-service policies, like certification. This issue is discussed in greater detail below.
assign them to more advanced classes.26 Educational production function studies tend to find far
Downloaded By: 10.3.97.143 At: 14:51 16 Sep 2023; For: 9780203961063, chapter9, 10.4324/9780203961063.ch9
stronger relationships between resources and student achievement the more aggregated the data
and, as Hanushek Rivkin, and Taylor (1996) show, these studies are more likely to suffer from
“aggregation bias” due to problems such as the correlation between the effect in question and
omitted factors (e.g., state education policies).
Several more recent papers that explore the relationship between teacher performance on
licensure exams and student achievement using disaggregated data find a significant but smaller
relationship between teacher performance on licensure exams (the Praxis tests used in North
Carolina) and student achievement and, importantly, show that the distribution of teachers across
schools and students can greatly influence the magnitude of estimated effects. 27 Clotfelter et al.
(2006) analyze this relationship for fifth-grade students and find that a 1 standard deviation in-
crease in a teacher’s average test score is predicted to increase students’ reading achievement by
1.1 percent of a standard deviation, and to increase math achievement by 1.8 percent of a standard
deviation.28 Furthermore, they convincingly show that the distribution of teachers across schools
greatly influences the estimates of the effects of teacher characteristics like their licensure scores.
Goldhaber (2007) focuses on elementary grades (4th–6th ) and estimates the value of licensure
tests as a signal of teacher effectiveness. He finds consistent evidence that teachers who pass the
licensure tests are, on average, more effective, but the difference between those who pass and
those who fail is relatively small: increasing student achievement by 4 to 6 percent of a standard
deviation. However, licensure tests appear to provide information about teacher quality beyond
the simple pass/fail signal, as there is a positive relationship between a teacher’s performance
on licensure tests and student achievement even among teachers who have all met the state stan-
dards for passing the exam. Like Clotfelter et al., Goldhaber’s study confirms the importance of
accounting for teacher sorting by showing that the estimates of the relationship between teacher
licensure and student achievement are far higher when one does not account for the nonrandom
distribution of teachers across schools and classrooms.
Several new research studies (Cavalluzzo, 2004; Goldhaber and Anthony, 2007; Vandevoort et
al. (2004) suggest that the National Board for Professional Teaching Standards (NBPTS) cer-
tification provides an indicator of teacher quality. All three find positive effects of having an
NBPTS-certified teacher in both math and reading tests, though the magnitude of the estimated
effects differs. Cavalluzzo, who focuses on the mathematics achievement of high school students
in the 9th and 10th grades, finds that, all else equal, students with an NBPTS-certified teacher
gain 12 percent of a standard deviation more than other students on a year-end exam. Goldhaber
and Anthony study students in grades 3–5, and find that having an NBPTS-certified teacher is
predicted to improve student achievement (outside of the year that a teacher becomes certified)
by about 5 percent of a standard deviation (in both subjects). Since research suggests that NBPTS
certified teachers, on average, perform substantially better on licensure tests than non-NBPTS
teachers and tend to be teaching in more advantaged educational settings (Goldhaber, Anthony,
& Perry, 2004), one might hypothesize that the NBPTS effect is simply a reflection of non-
random student-teacher matching or that all the information provided by NBPTS certification
status could be garnered by knowing a teacher’s licensure test performance. This turns out not
to be the case. Goldhaber and Anthony’s study suggests that the estimated effect is influenced
by student-teacher matching, but not solely attributed to it, and that the NBPTS credential pro-
vides information beyond that conveyed by other objective teacher attributes, including licensure
test performance. Finally, the Vandervoort, Amrein-Beardsley, and Berliner study (2004), which
154 GOLDHABER
focuses on students in grades 2 through 6, suggests that, when averaged across grade and subject
Downloaded By: 10.3.97.143 At: 14:51 16 Sep 2023; For: 9780203961063, chapter9, 10.4324/9780203961063.ch9
area (math, reading, and language), students with an NBPTS certified teacher gain 12 percent of
a standard deviation more than other students.29
Not all the research on NBPTS teachers shows such positive effects. A more recent large-
scale quantitative study (Sanders, Alston, & Wright 2005) generally finds little evidence of sta-
tistically significant, positive, NBPTS effects.30 The differences in findings between this and the
aforementioned NBPTS studies may be explained by differences in datasets, methodology em-
ployed, or the fact that NBPTS itself has gone through some changes (e.g., in 2001 there was a
change in the weighting of different assessments used to judge NBPTS candidates). But, what is
interesting about the Sanders et al. study is the emphasis on the fact that there is greater variation
in teachers who receive the NBPTS credential than between NBPTS and non-NBPTS teachers.
As Sanders et al. (2005) state, “the variation among teachers within the same certification status
was significantly large such that whatever small average differences there were between teach-
ers in different certification status categories were rather meaningless in comparison.” (p. 2). I
explore the implication of this emerging theme in research findings in the next two sections of
the chapter.
The findings discussed above suggest that a teacher’s readily observable characteristics (e.g.,
licensure status) are a relatively weak screen for teacher quality. This makes crafting policies de-
signed to ensure a basic level of teacher quality based on pre-service credentials difficult: simply
put, there do not appear to be readily quantifiable paper credentials that strongly predict teacher
quality. Several recent papers aptly illustrate this conclusion. The aforementioned Goldhaber
(2007) paper on teacher licensure tests shows that there is far more variation between teachers
who either pass or fail the North Carolina licensure test than there is between all those who pass
and all those who fail. Kane et al. (2006) investigate the relationship between a teacher’s route
into the classroom (e.g., traditional certification, TFA, Teaching Fellows Program) and student
achievement, and find that there is much more variation in teacher quality within a route to the
classroom than there is between routes.
The finding that paper credentials, such as certification, do not sharply delineate good teach-
ers from bad is quite prevalent in research. In fact, it is perfectly consistent with the research
previously cited (Goldhaber et al., 1999; Nye et al., 2004) showing that the portion of teacher
quality that is explained by commonly measured attributes such as degree and experience levels
is dwarfed by that explained by any number of more subtle, and less-easily quantified teacher
attributes. Even when a particular teacher attribute meets the standard of being “statistically sig-
nificant,” it is far from a guarantee that a teacher holding that attribute would be considered an
effective teacher.
Teachers, of course, come in “package form,” implying that they may tend to have groups of
attributes or credentials. And, as Clotfelter et al. (forthcoming) show, even if individual teacher
credentials don’t appear to accurately predict teacher quality, it is conceivable that groups of cre-
dentials do. Still, one logical implication flowing out of the finding that paper credentials do not
accurately predict teacher quality is that policies ought to deemphasize the certification filter of
individuals into the teacher workforce and place far greater emphasis on teacher performance in
the classroom, based on actual classroom performance (I discuss this in greater detail in the next
section). This suggestion, which has been made by a number of researchers (Ballou & Podgur-
sky, 1998; Gordon, Kane, & Staiger, 2006; Hanushek & Rivkin, 2003), is consistent with states’
9. TEACHERS MATTER; EFFECTIVE TEACHER QUALITY POLICIES ARE ELUSIVE 155
growing reliance on alternative certification programs that permit individuals with divergent pre-
Downloaded By: 10.3.97.143 At: 14:51 16 Sep 2023; For: 9780203961063, chapter9, 10.4324/9780203961063.ch9
service experiences to move into the teaching profession, at least briefly (many programs require
teachers to obtain “full” state certification within a few years of teaching).31 The idea is that by
relaxing the gateway (state-level) requirements for entering teachers, local school districts will
be provided the opportunity to evaluate a greater number of teachers based on their actual per-
formance, thus allowing districts to make more informed decisions about whether to retain them
and how much compensation to provide them.
While a relaxation of the certification gateway may seem a sensible shift in policy emphasis
in light of the research findings described in this chapter, it is important to note that changes in
pre-service requirements have ambiguous impacts on the quality of teachers in the workforce,
and there are likely important distributional impacts (Boyd et al., 2007; Goldhaber, 2004). For
example, a relaxation of the certification gateway might ultimately improve the average quality of
the teacher workforce, but allow more very low-quality teachers into the workforce, at least for a
time, in the process.32 Is this a worthy tradeoff? Clearly this is a judgment call that will depend on
how one values the potential harm done by having more low-quality teachers in the workforce as
compared to the potential benefit of having high-quality teachers in the workforce.
Furthermore, a relaxation of the certification gateway raises questions about an important
mediating factor in determining the quality of the teacher workforce: the link between teacher
certification policies and the quality of the teacher workforce in the local school system at the
point of hire. After certification, local school systems act as a second gateway in determining the
composition of the teacher workforce; therefore, how apt they are in selecting teachers is an im-
portant mediating factor in determining the quality of the teacher workforce (Goldhaber, 2004).
In theory at least, local hiring officials have the capacity to garner extensive information about
teacher candidates—information that may go a long way toward explaining the significant varia-
tion researchers observe for teachers who are in the labor force. Unfortunately, school system
hiring processes are largely a black box operation to researchers, at least in terms of the available
systemic information about the efficacy of the specific processes schools (or districts) use in se-
lecting among teacher applicants (DeArmond & Goldhaber, 2005).
The little evidence that does exist suggests that the hiring processes could be better. Ballou
(1996), for instance, conducts research on the likelihood that teachers with various attributes are
hired, and finds that districts do not appear to value measures of academic proficiency: teacher
applicants who attended “above average” colleges are found to be significantly less likely to be
hired than were applicants who had attended “below average colleges.” Furthermore, Ballou
finds that teacher attributes such as undergraduate GPA and subject specialties have only a small
effect on an applicant’s probability of being hired. The available empirical evidence suggests that
public schools tend to rely on local labor pools to find teachers (Boyd et al., 2005b) and, though
they do have preferences when it comes to hiring teacher candidates (Boyd et al., 2003), they rely
on the presence of teacher licensure as a primary means of screening among applicants. They
then look to graduation from a state-approved teacher education program as a secondary means
of winnowing out the applicant field (U.S. Department of Education, 1996).
Research suggests that new teachers have relatively few interactions with school-based
personnel during the hiring process (Liu & Johnson, 2006; Strauss et al., 2000). Few school
systems—less than 10 percent in a recent study (Liu, 2003)—require prospective teachers to
demonstrate their craft through the teaching of a sample lesson. Instead, districts rely on tran-
scripts, letters of reference, and resumes (Balter & Duncombe, forthcoming) and many—perhaps
as many as 2,000 school districts, though the actual number is unknown (Metzger & Wu, in
press)—rely on relatively short structured interviews to help make hiring decisions. There does
not appear to be any independent analysis suggesting that these teacher interview instruments
156 GOLDHABER
(e.g., the Urban Teacher Selection “Star” Teacher Interview and Gallup’s Teacher Perceiver
Downloaded By: 10.3.97.143 At: 14:51 16 Sep 2023; For: 9780203961063, chapter9, 10.4324/9780203961063.ch9
Shaping a high-quality teacher workforce through legislative and regulatory policies has proven
thus far to be an elusive goal. It is difficult if not impossible to address the teacher quality issue
through federal or state mandates given the apparent weak link between paper credentials and
student outcomes, which makes choosing the “right” teachers an imprecise endeavor. Illustrate
this point with an example from my own work on NBPTS certification. As I described above,
work with a colleague (Goldhaber & Anthony, 2007) finds that a teacher being NBPTS-certified
has a statistically significant impact on student achievement and the effect, about 5 percent of a
standard deviation, is non-trivial. But, what does this mean for NBPTS status as a predictor of
teacher quality? To answer this question, in follow-up work (Goldhaber, 2006), I estimate the to-
tal impact that teachers in the study have on their students and divide the sample of teachers into
those who are NBPTS-certified and those who are not.34 This simulation shows that teachers who
become NBPTS-certified are likely to have larger teacher impacts than the average teacher in the
sample, but not terribly consistently: 59 percent of NBPTS-certified teachers have above-average
effects in math, and 56 percent have above-average effects in reading—certainly more than what
one would expect from a teacher taken from the workforce at random (50 percent), but far from
an overwhelming percentage of the NBPTS-certified population as a whole.
One conclusion that could be drawn from the considerable heterogeneity in teacher effec-
tiveness is that schools ought to craft and emphasize policies that evaluate teachers based on their
actual in-service practices and performance. There are various types of reforms along these lines.
For example, it is not too controversial to suggest that the type of professional development of-
fered to teachers be tailored to individual teacher needs. More controversial are calls for changes
in teacher-tenure policies that diminish the up-front (often after three years) teacher job security
that exists in many districts (Johnson & Donaldson, 2006). Also contentious is the notion that pay
systems ought to be restructured, either so they are more closely aligned to teacher productivity,
so-called “merit pay” (Ballou & Podgursky, 1993; Hanushek & Rivkin, 2003; Teaching Com-
mission, 2006), or so that they encourage teachers to acquire key teaching skills, “knowledge and
skills pay” (Odden & Wallace, 2007).35
The NBPTS model probably represents the most prominent and significant of these teacher
workforce policies. Supporters of NBPTS believe it plays an important role in professionaliz-
ing teaching and ultimately will increase students’ learning by helping to change the culture in
schools and spread productive teaching practices. As of 2005 there were over 47,500 NBPTS-cer-
tified teachers. The cost to society of these teachers depends on the extent of support and services
they receive from states or school districts. Rice (forthcoming), in a case study of four NBPTS
support programs, estimates that the participant cost—including, for instance, the payment of
the application fee, administrative infrastructure and mentoring costs—of these programs ranges
from about $18,000 to $31,000 per participant. And, Goldhaber and Anthony (2007) estimate that
the investment to-date in the NBPTS model, either through application fees or direct government
or foundation grant funding, has exceeded $400 million.
The financial support for NBPTS shows that it has been broadly accepted as a workable
9. TEACHERS MATTER; EFFECTIVE TEACHER QUALITY POLICIES ARE ELUSIVE 157
and productive educational policy. But, as I described above, it is clear that, at least as measured
Downloaded By: 10.3.97.143 At: 14:51 16 Sep 2023; For: 9780203961063, chapter9, 10.4324/9780203961063.ch9
and localities have clearly taken the position that the devil they don’t know is better than the devil
Downloaded By: 10.3.97.143 At: 14:51 16 Sep 2023; For: 9780203961063, chapter9, 10.4324/9780203961063.ch9
they do, at least in the context of teacher compensation. Denver and Houston are examples of
localities that have embarked on high-profile experiments with pay-for-performance, Minnesota
and Florida are moving in that direction at the state level, and the new Teacher Incentive Fund
from the U. S. Department of Education encourages these types of experiments. Ideally, policy
makers, regardless of their position on these particular reforms, will encourage research on their
effects so we can learn from these experiments. This would move the debate about workforce
reforms from one that is largely based on inferences about the lack of efficacy of pre-service poli-
cies to one based on demonstrated effects of specific teacher-workforce policies and programs.
NOTES
1. Here and henceforth the terms “student achievement” and “student outcomes” should be treated as
synonymous with “value-added gains on achievement tests”. The term “teacher quality” refers to the
ability of teachers to produce such gains on achievement tests.
2. Both Rivkin et al. (2005) and Rockoff (2004) find that a one-standard deviation increase in teacher
quality is estimated to raise student achievement in reading and math by about 10 percent of a standard
deviation. Furthermore, this effect swamps other educational resource interventions— being equiva-
lent to a reduction in class size of 10 to 13 students (Rivkin et al., 2005).
3. For specific legal definition of “highly qualified teachers” see No Child Left Behind Act, Title IX, Sec.
901, Part A, Sec. 9101.23.
4. For more background on educational production function methods and studies, see Goldhaber and
Brewer (1997a, 1997b), Hanushek (1979), or Todd and Wolpin (2003).
5. Specifically, we estimated the contribution of unobservable teacher effects by calculating the incre-
mental contribution to explained variation of adding teacher fixed effects to an educational production
function model that includes school fixed effects and observable student and class covariates.
6. This estimate falls squarely within the range of estimates reported by Nye, Konstantopoulos, and
Hedges (2004) from 18 different studies.
7. Rivkin et al. (2005), for instance, finds about 10 percent is explained by such characteristics. For
a summary of this literature see, for example, Darling-Hammond (2000), Rice (2003), Wayne and
Youngs (2003) or Wilson, Floden, and Ferrini-Mundy (2001).
8. Non-random attrition from the teacher labor force could also lead to biased estimates of the coefficient
on teacher experience.
9. The fact that teachers of varying experience levels are likely not randomly assigned to students of
varying abilities can lead researchers to misattribute the impact of student-teacher matching to teacher
experience (or any other observable teacher characteristic), but this issue can be dealt with using ap-
propriate statistical techniques given a particular data structure.
10. Specifically, Rivkin et al. (2005) estimate that elementary and middle school students (in 4th–7th
grades) of teachers with no prior experience are about 3–5 percent of a standard deviation worse off in
both math and reading than those with teachers who have one year of experience. In contrast, the gain
from the increased experience achieved by going from the 1st to the 2nd year of teaching is smaller
(and borderline statistically significant) at around 2 to 3 percent of a standard deviation. Rivkin et al.
find little gain in experience beyond year two of teaching. Clotfelter et al. (2006; forthcoming) find
similar results. In math they find a gain between the first and second year of teaching of between 5
to 7 percent of a standard deviation, and in reading the gain ranges from 2 to 4 percent of a standard
deviation. The Rivkin et al. study includes student and school fixed effects, while the Clotfelter et al.
includes student lagged test scores.
11. Follow-up studies by Hanushek reach similar conclusions about the effects of teacher degree and ex-
perience levels. As an example, in Hanushek’s 1989 review, teachers’ advanced degrees predicted
higher levels of student achievement in only 13 of the 183 studies reviewed, and there was a negative
9. TEACHERS MATTER; EFFECTIVE TEACHER QUALITY POLICIES ARE ELUSIVE 159
relationship in 6 of the 13 studies. Even the Greenwald et al. (1996) review, which reaches far more
Downloaded By: 10.3.97.143 At: 14:51 16 Sep 2023; For: 9780203961063, chapter9, 10.4324/9780203961063.ch9
positive conclusions about the ability to predict teacher quality based on teacher attributes, only finds
a statistically significant, positive association between teacher degree and student achievement in 15
percent of the cases (and a statistically significant, negative effect in 13 percent of the cases).
12. Specifically, they find that a teacher having an undergraduate major in math increases students’ math
achievement by about 5 percent of a standard deviation and having a Masters degree in math increases
students’ math achievement by an additional 4 percent of a standard deviation. A teacher having an
undergraduate science major increases students’ achievement in science by 9 percent of a standard
deviation (they did not find a statistically significant effect for advanced degrees in science). The same
results were not found to be true for teachers of English or history, where the authors do not find col-
lege major (or subject area of a Masters degree) to influence student achievement.
13. Hill et al. did not report the student post-test standard deviation so it is not possible to present conven-
tional effect sizes that are comparable to those specified elsewhere in this report.
14. I use the terms “licensure” and “certification” interchangeably (except when discussing NBPTS certi-
fication) as is the practice in education research.
15. Darling-Hammond (2002) wrote that Walsh’s 2001 report “dismissed or misreported much of the exist-
ing evidence in order to argue that teacher education makes no difference to teacher performance or
student learning, and that students would be better off without state efforts to regulate entry into teach-
ing or to provide supports for teachers’ learning” (p. 60). A rejoinder by Walsh to Darling-Hammond
refutes this assertion.
16. For examples of other reviews, see Wilson et al. (2001) and Clift and Brady (2005).
17. They find that there is roughly a 10 percent of a standard deviation increase in math scores for high
school students whose math teacher is fully licensed in math.
18. Emergency certification can be issued to teachers who have not satisfied all of the requirements neces-
sary to obtain a standard certificate.
19. Upon selection to the TFA program, TFA corps members must complete a five-week summer training
institute at one of the regional centers. Following the summer institute, teachers must comply with any
other state and/or local requirements for certification such as passing required certification exams or
continuing coursework during their teaching assignment. The exact requirements vary depending on
the local agreements that districts have made with TFA. See http://www.teachforamerica.org/tfa/certi-
fication.html for more information.
20. The effect sizes (relative to traditionally certified teachers) for uncertified TFA teachers range from
–0.03 to –0.30 of a standard deviation in reading and math, while the effect sizes for certified TFA
teachers range from 0 to 0.11 of a standard deviation.
21. Nor do they account for clustering of students within classes, which is likely to influence the level of
statistical significance of their findings (i.e., the precision of their estimates are likely overstated).
22. Aside from concerns about their effectiveness, a second concern about TFA teachers is their high rate
of attrition out of the teacher labor market. A number of studies highlight issues associated with the
attrition rate out of teaching. Teacher turnover has received substantial attention in research because
of the costs associated with finding and hiring teachers as well as potential impacts on student perfor-
mance. Besides the relatively short (formal two–year) time commitment that TFA teachers make to
teach, there are other reasons to expect that TFA teachers would have high turnover rates. For instance,
research has found that teachers have higher rates of attrition if they are teaching in urban schools
(Levinson, 1988; Hanushek et al., 2004), if they are from strong academic undergraduate institutions
(Murnane & Olsen, 1989, 1990), if they are secondary science teachers (Murnane & Olsen, 1989,
1990), and if their level of specific human capital (i.e., they have training specifically applicable for
teaching, such as an education major) is low relative to their general human capital (Grissmer & Kirby,
1992).
23. Because these students are randomly assigned, one would not expect the match between teachers and
students to result in any systematic differences in the estimates of student achievement growth.
24. Kane et al. (2006) is another recent study (using New York City data) that compares TFA teachers to
teachers who enter the workforce through alternate routes. The findings from this study are similar to
160 GOLDHABER
those in Boyd et al.: differences in teacher performance by route (TFA or other) are generally quite
Downloaded By: 10.3.97.143 At: 14:51 16 Sep 2023; For: 9780203961063, chapter9, 10.4324/9780203961063.ch9
small. In fact, this research shows that the differences between routes are small relative to the differ-
ences between teachers within each of these routes.
25. Aggregation also limits the ability to detect differential teacher effects on different types of students.
26. A few individual student-level studies focus on other measures of academic proficiency. Ehrenberg
and Brewer (1994), for instance, include a measure of the selectivity of a teacher’s undergraduate
institution (based on the Barron’s college ratings) in their model and find that students learn more
from teachers who attended more-selective undergraduate institutions. They found that an increase
of one selectivity category would increase white students’ gain scores by about 0.4 points and black
students’ scores by about 1.5 points. This is to be compared with a mean white student gain score in
the sample of 0.56 points. Rowan, Chiang, and Miller (1997), using individual-level student data from
the National Educational Longitudinal Study of 1988 (NELS), find that whether a teacher correctly
answers the single mathematics question on the NELS survey predicts their students’ performance on
a mathematics test. Answering correctly is estimated to increase student achievement by 2 percent of a
standard deviation.
27. Goldhaber (forthcoming) focuses on elementary grades (3rd–5th) and finds roughly similar results: a 1
standard deviation change in teacher test-score performance is predicted to increase student test scores
by slightly less than 1 percent of a standard deviation in reading and slightly less than 4 percent of a
standard deviation in math.
28. A number of studies on NBPTS-certified teachers and their impacts on student learning are ongoing
at the time of this writing. The results of these writings should be available as they are published at
http://www.nbpts.org/research/index.cfm.
29. NBPTS-certified teachers tend to have positive effects, but they are small and generally do not reach
conventional levels of statistical significance.
30. For more information on the growing use of alternative certification and changes in eligibility require-
ments to teach, see, for instance, NASDTEC Manual (2002), The Teaching Commission (2006), or
National Center for Alternative Certification website (http;//www.teach-now.org).
31. The implication of this is that the variation in teacher quality is increased as a consequence of the re-
laxation of certification requirements.
32. Based on a review of the evidence, Metzger and Wu (in press) conclude that there is little empirical
evidence that these instruments are related to (any measure of) teacher effectiveness.
33. To measure the total teacher impact, I estimate a model that includes teacher fixed effects and then
predict the size of each teacher’s fixed effect. The findings from this simulation should be interpreted
with caution, since the “teacher effect” is based only on a three-year panel of matched teachers and
students, and the simulation discussed below does not account for either the size of individual teacher
effects nor the statistical precision to which they are measured.
34. Calls to reform teacher compensation are not new; in fact, most of the Nation at Risk recommendations
called for the link between teacher pay and performance (Odden & Wallace, 2007).
35. NBPTS and its supporters likely would suggest that NBPTS certified teachers do far more than just
increase student learning on standardized tests.
36. As Kane and Staiger (2002) illustrate in the case of measuring school-level performance, there is a sub-
stantial amount of “noise” resulting from test measurement error or the luck of the draw in students.
37. See, for instance, Aaronson, Barrow, and Sander (2003), Ballou et al. (2004), and Koedel and Betts
(2005), as recent studies that focus on some of these issues.
38. New research suggests there may be alternatives to using student tests as the primary benchmark for
teacher performance. Research by Jacob and Lefgren (2006), for instance, suggests that principals
generally do a good job of distinguishing the most- and least-effective teachers. Regardless of the
accuracy of measuring teacher performance, simply knowing that there exists some type of teacher
performance-based accountability may itself change behavior.
39. Research on professional development has focused both on the form or delivery that in-service training
takes (e.g., who leads in-service training, whether the training is a one-shot workshop or more time
intensive with follow-up) and on the content of the training (e.g., subject-matter, working with special
9. TEACHERS MATTER; EFFECTIVE TEACHER QUALITY POLICIES ARE ELUSIVE 161
education students, or classroom management techniques). There are a tremendous number of stud-
Downloaded By: 10.3.97.143 At: 14:51 16 Sep 2023; For: 9780203961063, chapter9, 10.4324/9780203961063.ch9
ies on these issues, but most of them are either small-sample case studies or they focus on the impact
of training on teacher attitudes or instructional practices. Those studies that do focus on measures of
student achievement tend to be methodologically weak. For example, few studies include the types of
school, teacher, and student background controls that are common in the educational production func-
tion studies.
REFERENCES
Aaronson, D., Barrow, L., & Sander, W. (2003). Teachers and student achievement in the chicago public
high schools. Federal Reserve Bank of Chicago, Working Paper Series: WP-02–28.
Ballou, D. (1996). Do public schools hire the best applicants? Quarterly Journal of Economics, 111(1),
97–133.
Ballou, D. (2001).Pay for performance in public and private schools. Economics of Education Review, 20,
51–61.
Ballou, D., & Podgursky, M. (1993). Teachers’ attitudes toward merit pay: Examining conventional wis-
dom. Industrial and Labor Relations Review, 47(1), 50–61.
Ballou, D., & Podgursky, M. (1998). The case against teacher certification. Public Interest, 132, 17–29.
Ballou, D., Sanders, W., & Wright, P. (2004). Controlling for student background in value-added assessment
of teachers. Journal of Educational and Behavioral Studies, 29(1), 37–65.
Balter, D., & Duncombe, W. (forthcoming). Recruiting highly qualified teachers: Do district recruitment
practices matter? Public Finance Review.
Boyd, D., Lankford, H., Loeb, S., & J. Wyckoff. (2003). Analyzing the determinants of the matching pub-
lic school teachers to jobs: Estimating compensating differentials in imperfect labor markets. NBER
Working Paper No. 9878.
Boyd, D., Grossman, P., Lankford, H., Loeb, S., & Wyckoff, J. (2005a). How changes in entry requirements
alter the teacher workforce and affect student achievement. Retrieved September 11. 2007, from the
New York State School Finance Reform online archive: http://finance.tc-library.org/Content.asp?abst
ract=true&uid=1771.
Boyd, D., Lankford, H., Loeb, S., & Wyckoff, J. (2005b). The draw of home: How teachers’ preferences for
proximity disadvantage urban schools. Journal Analysis of Policy Analysis and Management, 24(1),
113–132.
Boyd, D., Goldhaber, D., Lankford, H., & Wyckoff, J. (2007). The effect of certification and preparation on
teacher quality. Future of Children, 17(1), 45–68.
Cavalluzzo, L. C. (2004). Is national board certification an effective signal of teacher quality? The CNA Cor-
poration. Retrieved September 11, 2007 from http://www.cna.org/documents/CavaluzzoStudy.pdf.
Clift, R. T., & Brady, P. (2005). Research on methods courses and field experiences. In M. Cochran-Smith &
K. Zeichner (Eds.), Studying teacher education: The report of the AERA panel on research and teacher
education (pp. 309–424). Mahwah, NJ: Erlbaum.
Clotfelter, C. T., & Ladd. H. F.. (1996). Recognizing and rewarding success in public schools. In H. F. Ladd
(Ed.), Holding schools accountable: Performance-based reform in education (pp. 23–63). Washington,
D.C.: The Brookings Institution
Clotfelter, C. T., Ladd, H. F., & Vigdor, J. (2006). Teacher-student matching and the assessment of teacher
effectiveness. Journal of Human Resources, 41(4), 778–820.
Clotfelter, C. T., Ladd, H. F., & Vigdor, J. (forthcoming). Teacher credentials and student achievement:
Longitudinal analysis with student fixed effects.” Economics of Education Review. (A longer version
of this paper is also available as NBER Working Paper 12828, How and why do teacher credentials
matter for student achievement?”)
Cohen, D. K., & Hill, C. (2000). Instructional policy and classroom performance: The mathematics reform
in California. Teachers College Record, 102(2), 294–343.
162 GOLDHABER
Coleman, J. S. (1966). Equality of educational opportunity. Washington, D.C.: U.S. Dept. of Health, Educa-
Downloaded By: 10.3.97.143 At: 14:51 16 Sep 2023; For: 9780203961063, chapter9, 10.4324/9780203961063.ch9
Goldhaber, D., & Anthony, E. (2007). Can teacher quality be effectively assessed? National Board Certifica-
Downloaded By: 10.3.97.143 At: 14:51 16 Sep 2023; For: 9780203961063, chapter9, 10.4324/9780203961063.ch9
tion as a signal of effective teaching. Review of Economics and Statistics, 89(1), 134–150.
Goldhaber, D., Anthony, E., & Perry, D. (2004). NBPTS certification: Who applies and what factors are as-
sociated with success? Educational Evaluation and Policy Analysis, 26(4), 259–280.
Goldhaber, D., & Brewer, D. J. (1997a). Why don’t schools and teachers seem to matter? Assessing the
impact of unobservables on educational productivity. Journal of Human Resources, 32(3), 505–523.
Goldhaber, D., & Brewer, D. J. (1997b). Evaluating the effect of teacher degree level on educational perfor-
mance. In J. W. Flower (ed.), Developments in school finance 1996 (pp. 197–210). Washington, D.C.,
National Center for Education Statistics..
Goldhaber, D., & Brewer, D. J. (2000). Does teacher certification matter? High school teacher certification
status and student achievement. Educational Evaluation and Policy Analysis, 22(2), 129–145.
Goldhaber, D., Brewer, D. J., Anderson, D. (1999). A three-way error components analysis of educational
productivity. Education Economics, 7(3), 199–208.
Goldhaber, D., Player, D., DeArmond, M., & Choi, H. J. (2006). Why do so few public school districts use
merit pay? Working paper.
Gordon, R., Kane, T. J., & Staiger, D. O. (2006). Identifying effective teachers using performance on the
job. The Hamilton Project: Discussion Paper 2006–01. Washington, D.C.: Brookings Institution. Re-
trieved September 11, 2007, from http://www.brook.edu/views/papers/200604hamilton_1.pdf.
Greenwald, R., Hedges, L. V., & Laine, R. D. (1996). The effect of school resources on student achievement.
Review of Educational Research, 66(3), 361–396.
Grissmer, D. W. & Kirby, S. N. (1992). Patterns of attrition among Indiana teachers, 1965–1987 (Report No.
R-4076-LE). Santa Monica, CA, The RAND Corporation.
Hanushek, E. A. (1979). Conceptual and empirical issues in the estimation of education production func-
tions. Journal of Human Resources, 14(3), 351–388.
Hanushek, E.A. (1981). Throwing money at schools. Journal of Policy Analysis and Management, 1, 19–41.
Hanushek, E. A. (1986). The economics of schooling: Production and efficiency in public schools. Journal
of Economic Literature, 24(3), 1141–1177.
Hanushek, E. A. (1989). The impact of differential expenditures on school performance. Educational Re-
searcher, 18(4), 45–51,62.
Hanushek, E. A. (1992). The trade-off between child quantity and quality. Journal of Political Economy,
100(1), 84–117.
Hanushek, E. A. (1997). Assessing the effects of school resources on student performance: An update. Edu-
cational Evaluation and Policy Analysis, 19(2), 141–164.
Hanushek, E.A., & Rivkin, S. G. (2003). How to improve the supply of high quality teachers. Brookings
Papers on Education Policy.
Hanushek, E. A., Rivkin, S. G., & Taylor, L. L. (1996). Aggregation and the estimated effects of school
resources. Review of Economics and Statistics, 78(4), 611–627.
Hanushek, E. A., Kain, J. F., & Rivkin, S. G. (2004). Why public schools lose teachers. Journal of Human
Resources, 39(2), 326–354.
Hatry, H. P., Greiner, J. M., & Ashford, B. G. (1994). Issues and case studies in teacher incentive plans (2nd
ed.). Washington, D.C.: The Urban Institute Press.
Hess, F. M., Maranto, R., & Milliman, S. (2000). Resistance in the trenches: What shapes teachers’ attitudes
toward school choice? Educational Policy, 14(2), 195–213.
Hill, H. C., Rowan, B., & Ball, D. L. (2005). Effects of teachers’ mathematical knowledge for teaching on
student achievement. American Educational Research Journal, 42(2), 371–406.
Jacob, B., & Lefgren, L. (2006). When principals rate teachers. Education Next, 6(2): 59–64.
Johnson, S. M., & Donaldson, M. L. (2006). The effects of collective bargaining on teacher quality. In J.
Hannaway & A. J. Rotherham (Eds.), Collective bargaining in education (pp. 111–140). Cambridge,
MA: Harvard Education Press.
Johnson, S. M., & Kardos, S. M. (2000). Reform bargaining and its promise for school improvement. In T.
Loveless (Ed.), Conflicting missions? Teachers unions and educational reform (pp. 7–46). Washington,
D.C.: Brookings Institution Press.
164 GOLDHABER
Kane, T. J., & Staiger, D. O. (2002) The promise and pitfalls of using imprecise school accountability mea-
Downloaded By: 10.3.97.143 At: 14:51 16 Sep 2023; For: 9780203961063, chapter9, 10.4324/9780203961063.ch9
Raymond, M. E., & Fletcher, S. (2002). Teach for America: An evaluation of teacher differences and student
Downloaded By: 10.3.97.143 At: 14:51 16 Sep 2023; For: 9780203961063, chapter9, 10.4324/9780203961063.ch9