Professional Documents
Culture Documents
The Changing State of Assessment - EdWeek Spotlight
The Changing State of Assessment - EdWeek Spotlight
edweek.org
2013
Table of CONTENTS:
1 One Districts Common-Core
Bet: Results Are In
5 New NAEP Demands
Application of Knowledge
Commentary:
10 Waving the Flag for
Formative Assessment
11 Gaining a New Appreciation
for Assessments
Resources:
13 Resources on The Changing
State of Assessment
By Catherine Gewertz
Washington
In a meeting at Watkins
Elementary School last
month, teachers from
Stuart-Hobson Middle
School comb through
data from last spring's
DC CAS test.
edweek.org
Promoting a Class
The few months before the August data sessions had been intense ones for Ms. McNair-Lee.
In late April, she battened down hard with her
8th graders to prepare for the year-end tests. In
the weeks that followed, she battled their exhaustionand her owntrying to keep them
focused on the material that had to be covered
before graduation.
Finally, they were done. On a mid-June evening, the teenagers flooded the polished StuartHobson hallways, a river of girls in white dresses,
with pinned-up hair, and a stream of boys in crisp
collared shirts and snazzy shoes. Just the sight
of them dissolved Ms. McNair-Lees promise not
to cry. She had taught them for two years, called
them her babies, and knew all too well that
while some would soar when they left her, others
would stumble.
In an auditorium packed with family members,
the students listened to speeches and songs, then
rose from their seats and filed across the stage for
the last time. It was Ms. McNair-Lee who called
each of them by name as they came up the stairs
and made their way through a line of handshakes. When it was Mikels turn, he crossed the
stage with a serious face, then broke into a big
grin when social studies teacher Sean McGrath
grabbed him for a hug.
Principal Dawn Clemens took the microphone
and said: By the power vested in me, I pronounce each and every one of you a high school
student. The room erupted in cheers and camera flashes, then emptied, slowly, into hallway
gridlock. Without a tissue, Ms. McNair-Lee kept
wiping her cheeks as swarms of girls locked her
in group hugs.
On the front steps of the school, Mikel found
a moment for a farewell hug from his English
teacher. Then he turned his back to the school
to join his family, and they drifted off down
the sidewalk in the fading evening light.
Making Headway
Six weeks later, at a middle school a few
miles away, districtwide test scores were announced. At a celebratory event featuring
District of Columbia Mayor Vincent C. Gray
and Schools Chancellor Kaya Henderson, the
school district lauded its progress.
While city schools are far from where they
need to be, they said, with just under half of
students reading on grade level, theyve come
a long way: Reading scores are 4 percentage
points higher than in 2012, and 13 points
higher than six years ago. (Scores in math,
science, and composition rose, also.)
Reflecting on the scores in an interview over
the summer, Ms. Henderson said they show
that the districts investments in curricular
materials, professional development, and good
edweek.org
basic, proficient, and advancedare now divided into seven. That allows a school to earn
20 points on the districts accountability index
when a child moves up even within one of
those bands, from low below basic to high
below basic, for instance.
The change is meant to encourage teachers
to focus not just on the bubble kids poised
to move into proficiency, but on all students
regardless of their place on the performance
spectrum.
The August data sessions flowed into a
schoolwide improvement plan for StuartHobson that includes the goal of reaching 70
percent proficiency in reading this school year.
Better coordinating the work of the three
English/language arts teachers and working
with teachers across the curriculum on techniques to help students master complex text
will be key strategies in reaching that goal.
A New Focus
The District of Columbia systems focus on
the common standards in reading now moves
into year 3, but layered on top is a push into
the writing standards. That new priority was
front and center in a late-August professionaldevelopment day.
Spread across classrooms on two floors of a
high school, secondary-level teachers hunker
down with instructional coaches to work on
sentence composition.
Stuart-Hobsons Matt Foster is here, with
other 6th grade teachers, in a session co-led
by Sarah Hawley, Stuart-Hobsons assigned
instructional coach. Mr. Plaisted joins 7th
grade teachers across the hall, and Ms. Green
does likewise with her 8th grade group.
Echoing the session leaders in the other
rooms, Ms. Hawley guides her 6th grade
teachers in an exercise about subordinating
conjunctions. Theyre learning how to work
this kind of instruction into a class study of a
text, instead of teaching it in isolation. Theyre
exploring how to scaffold the ideas, so all
students can grasp them.
In the coming months, other professionaldevelopment sessions will focus on composing
sentences and building paragraphs. Working
with their own coaches at their schools this
year, teachers will bring samples of their students work to analyze and to inform their
instructional plans.
As Ms. Hawley begins shaping this years
coaching plans for the teachers, she factors
in a complex blend of teachers and students
needs, test-score data, last years emphasis
on text complexity and close reading, and this
years move into the writing standards.
Two of the three English/language arts
teachers are new to Stuart-Hobson this year,
so new working relationships must be formed.
edweek.org
Advertisement
COMMON CORE
Assessments
2013 Measured Progress. All rights reserved. Measured Progress is a registered trademark and Measured Progress COMMON CORE and its logo are trademarks of Measured Progress, Inc.
edweek.org
New Direction
The TEL represents a significant shift for
the battery of tests commonly dubbed the
nations report card. It will be NAEPs first
entirely computer-based test and the first to
use a majority of interactive scenario-based
questions.
International assessments, in particular the Program for International Student
Assessment, already gauge proficiency in
more comprehensive and applied-science
questions, which is in part why experts say
American students performance tends to lag
behind that of students in other countries on
PISA.
More than 2,000 engineering and technology professionals from around the United
States contributed to the development of
the tests framework, which covers three interconnected areas: the design process and
principles of dealing with technology in daily
life; information and communication-systems
technology, such as computer networks and
mobile devices; and the social and ethical
implications of technologys effects in the
natural world.
Were pretty good at assessing students in
science, but how do we assess the difference
between a scientific solutionsome sort of
global, perfect universal solutionand engineering, which is a lot more about trade-offs
and constraints in a given situation to get a
solution that works? Mr. Buckley said.
The solution, he said, is to include much
more complex and higher-order-thinking
Testing Scenarios
Roughly 20 percent of the tests questions
will cover concrete facts and information. The
rest will use a new kind of question, which requires students to interact in engineering or
technology scenarios, to apply ways of critical thinking and problem-solving that are associated with engineering.
Each scenario is 10, 20, or 30 minutes long
and gauges a students mastery of engineering practices, such as systematically using
technology, tools, and skills to solve a problem
or achieve a specific goal, or using technology
to communicate and collaborate with a team
and consult experts.
For example, a student may be asked to collaborate with a simulated boss via videoconference to improve the consumer life cycle
of a toaster.
NAEP is not alone in the world of largescale standardized assessment in trying to
come up with ways to better assess how people work collaboratively, Mr. Buckley said.
Moreover, the test will also begin to use
student-activity data to report and evaluate
how the student solves each problem. For
example, NAEPs writing test collects information about how students used the in-test
word-processing software to check spelling
and edit sentences, but does not use that information to evaluate students performance.
In this assessment, a student might get more
points for answering a problem efficiently and
making the best use of the tools available.
The intent is to be much more authentic
and closer to a real project, Mr. Buckley said.
During the tests administration, the NCES
will also collect data on students access to
technology at home and teachers use of technology in the classroom.
Coverage of science, technology, engineering, and
mathematics education is supported by a grant
from the Noyce Foundation, at www.noycefdn.org.
edweek.org
Pleasantly Surprised
When it comes to childrens ability to take
a test on a computer rather than with paper
and pencil, many district officials around the
country say their students had no problems adapting to the new format.
I was pleasantly surprised that these
third graders were able to maneuver from
problem to problem much better than I
had anticipated, says Kent Henson, the
assistant superintendent for instructional services for the West Ottawa public
schools in Holland, Mich. About 240 students in the 7,200-student district took
tests in the Smarter Balanced pilot this
spring.
Henson says: They had to drag and
drop, to highlight, and they had to compare and contrast. They had to write a
letter. They had to watch a video, which
meant putting on headphones. They had
to fill in boxes on a table. There were a lot
of different mouse-manipulation tasks.
The pilot test questions were a mix of
multiple-choice questions, problem solving, short-answer responses, and other
tasks. Students had to drag and drop answers into different boxes.
Some districts in Michigan experienced
technical difficulties with the pilot testing, he says.
I heard about schools that had issues
with servers that werent working. Some
kids were kicked out [of the system] or it
wouldnt accept their login, says Henson.
I heard that in one-to-one [computing]
schools where kids were taking these
tests on laptops of various kinds, they had
[technical] problems.
McKernan of the Albuquerque district
says joining in the PARCC pilot was eyeopening for teachers because they could
see how the common standards will be
assessed.
It makes things more concrete. It
leaves less room for each of us to interpret the standards in our own way, she
says. It isnt about assessment driving
instruction. Its about assessment articulating the expectations in a more concrete
fashion. That was very helpful for our
teachers.
Matthews of Californias LarkspurCorte Madera schools also says the pilot
was helpful for teachers to know in what
ways students will be asked to show what
they know.
Common-core standards are more
rigorous, with more of an emphasis on
critical thinking and problem-solving, he
says. So this pilot helped to inform their
instruction because teachers saw how
learning is going to be measured in the
future.
Some students who were comfortable
with the technology itself struggled with
the actual content of the test, according to
some district officials.
They loved doing math on the computer, and they are very quick with the
mouse, Loughrey, the assessment manager in Albuquerque, says. But after observing a 6th grade class taking the test,
he asked the teacher about how she felt
her students handled the material.
She said that while they may say that
they did fine, her sense was that a lot of
them struggled with the material, he
says. The problems were rigorous. They
pushed the kids.
edweek.org
Busting Up
Misconceptions
About Formative
Assessment
By Catherine Gewertz
edweek.org
People still
talk about
delivering
instruction to
children, as if
theyre mailboxes
into which
learning will be
delivered.
MArgaret Heritage
Assistant Director for Professional
Development, National Center for
Research on Evaluation, Standards
and Student Testing
edweek.org
Pellegrino, who led the session discussion with Herman, told participants that
the crafters of the criteria have a suite of
assessments in mind, not just one summative test. Formative strategies and tools to
help teachers gauge and guide learning as
it happens are an important part of that picture, as are interim tests, he said. But what
appears on large-scale summative tests is
important, he said, because of its power to
shape what happens in the classroom. So
statewide summative tests must include
and encourage the types of activities that
are complex and meaningful learning activities, he said.
One education researcher in the audience,
Fritz Mosher, noted that embracing largescale summative exams with the qualities
outlined in the report would suggest that
states and districts have embraced curricula
defined by similar criteria. Chuckling that
Mosher had brought the C word into the
conversation, Pellegrino noted that most
curricula and tests dont reach that level.
We have a long way to go in our curriculum
development process, he said.
Participants brainstormed about how
to keep states and districts focused on the
potential benefits of higher-quality tests.
Particularly important, some said, was including a number of performance tasks that
require students to engage in longer, more
complex activities to demonstrate their
understanding of the materials. But those
items are more costly to build and score.
How to build crucial buy-in for good assessments as brushfires of anti-testing sentiment keep cropping up in states is a crucial
question for which there were no easy answers.
Experience in recent decades with largescale testing has left the public disillusioned
by fairly sterile multiple-choice exams, Pellegrino said, so the task of convincing people
that new tests will be fundamentally differentand betteris a tough one.
Part of the backlash is legitimate, he
said. [People ask], Why are my kids spending time answering meaningless questions?
We have a ways to go to demonstrate that
the questions we are designing are relevant
and valuable.
Advertisement
COMMON CORE
Assessments
Web: MeasuredProgress.org
Advertisement
COMMON CORE
Assessments
Interim
Benchmarks
Benchmarks
Instruction
& Testlets
Interim
Instruction
& Testlets
Benchmarks
Instruction
& Testlets
Benchmarks
Interim
Summative
Instruction
& Testlets
Benchmarks may be administered every five to nine weeks and allow for in-depth coverage of more than one set
of standards used to inform instructional pacing.
Testlets support classroom formative assessment practices by assessing a targeted set of standards within
Common Core domains and clusters.
Items from the Item Bank can be searched by standard and chosen to create customized assessments for daily
instruction. Teachers can also author their own items, including drag-and-drop, hot spot, and other technologyenhanced items.
Interim Assessments are general achievement measures used to evaluate the full scope of Common Core
knowledge and skills relative to a whole academic year.*
Summative assessments (such as consortia or state summative assessments) serve as a year-end
accountability measure.
1. Student-Friendly Interface
Students can type, draw, and interact with various item types,
giving them the ability to mark up passages, type in answers
to constructed-response items, and draw graphs or work out
math problems right beside an item or passage.
Web: MeasuredProgress.org
Advertisement
COMMON CORE
Assessments
3. Real-Time Results
Web: MeasuredProgress.org
2013 Measured Progress. All rights reserved. Measured Progress is a registered trademark and Measured Progress COMMON CORE and its logo are trademarks of Measured Progress, Inc.
edweek.org
10
Commentary
very educator knows what a teachable moment is. Its the brief period
of time when events serendipitously
conspire to teach students something
that otherwise might be difficult for them to
learn. Teachable moments are really quite
special, and they dont come along all that
often. A teacher who wastes a teachable moment, therefore, commits a pedagogical sin of
omission.
Interestingly, American educators are now
on the cusp of a different sort of special moment. In this instance, it stems from a unique
historical occasion during which teachers
adoption of the formative-assessment process
should be advocated with both honesty and
unparalleled zeal. Yes, this is formative assessments advocatable moment.
First off, it is important to recognize that
formative assessment works. Thats right:
Ample research evidence is now at hand to indicate emphatically that when the formativeassessment process is used, students learn
betterlots better. This should come as no
surprise, for the essence of formative assessment is surely commonsensical.
Formative assessment is simply a planned
process wherein teachers, or their students,
use assessment-elicited evidence of student
learning to decide whether to make changes
in what theyre currently doing. Teachers
find out if they need to adjust their ongoing
instruction. Students find out if they need to
alter the ways in which theyre trying to learn.
Formative assessment is, at bottom, an endsmeans process in which teachers and/or students rely on assessment consequences (the
ends) to decide whether any adjustments are
warranted in what theyre doing (the means).
Its really not surprising that formative assessment works so well.
What is surprising is how few U.S. teachers
use the process.
It does work, and it can make teachers more
effective. Yet, although considerable rhetoric
has been expended in recent years calling for
teachers to employ formative assessment, its
usage in our classrooms is meager. Nonetheless, two events now taking place in American
education provide us with a unique opportunity to remedy this shortcoming. In fact, they
set the stage for a special moment when education leaders of all stripes can legitimately
advocate the use of formative assessment.
Lets briefly consider what they are.
The first event stems from adoption of the
Common Core State Standards by almost all
our states. Not surprisingly, commercial publishers are inundating U.S. educators with
instructional materials ostensibly directed at
promoting student mastery of the standards.
But lets be honest, we really wont know the
true nature of the common cores success until
the two assessment consortiathe Partnership for Assessment of Readiness for College
and Careers, or PARCC, and the Smarter Balanced Assessment Consortium, or SBAC
complete their test-building, by the spring
of 2015. Only then will U.S. educators know
with certainty how the common core has been
operationalized, and whether students have
mastered the content.
And heres where formative assessment can
prove beneficial to the nations teachers.
Remember, formative assessment helps
students master curricular targets. Rather
than asking teachers to guess about what
the common core really means, shouldnt we
urge teachers to sharpen their instructional
skills through the use of formative assessment? Then, when the assessment-consortia
tests are released, those teachers can focus
their more potent instruction on the skills
and knowledge the tests are measuring. The
choice for educators shouldnt be between curricular guessing and becoming more instructionally skilled.
A second event thats setting the stage for
full-on advocacy of formative assessment is
the installation of more-stringent teacherevaluation procedures throughout the United
States. Spurred by federal incentives, including the Race to the Top grants, state officials
have recently adopted teacher-evaluation
systems in which student growth must be
a significant factor. Indeed, in many states,
fully 50 percent of a teachers evaluation will
edweek.org
11
First Person
lot of time. Assessments are intended to provide important information about a student,
but at a certain point you begin to wonder if
they may actually be hurting student learning.
Return on My Investment
It took me two whole months to finish my
basic literacy assessments in Mexico City.
Based on the results, I was able to group
students for work on particular skills, strategies, and reading levels. With only a few
months left of school, I could now focus on
supporting the specific needs of these students.
Was it worth the time? At first I couldnt
help but wonder if it would have been more
effective to just dive into working with stu-
edweek.org
12
WEB
Links
edweek.org
13
Resources on
The Changing State
of Assessment
Now featuring interactive hyperlinks.
Just click and go.
resources
Algebra
Assessment
E-Learning
Gifted Education
Uses for
1 Schools Find
Data Techniques
Predictive
for Real-Time
4 Leading the Charge
Data
rivacy Rules
6 Proposed Data-P
States
for
Timely
Seen as
on
Swift Progress
7 States Make
logy
Student-Data Techno
Crash
8 Surviving a Data
n
Tractio
Gains
9 Data Mining
in Education
Y:
COMMENTAR
s
of Data Analysi
11 My Nine Truths
a Data-Driven
12 Education as
Enterprise
Information Poor
13 Data Rich But
RESOURCES:
30, 2011
Published June
in Education
Week
es for
Schools Find Us
Techniques
ta
Da
ive
ict
ed
Pr
the
stand ard in
long been a
and
t They ve
h credit scores
tic tools to predic
ess worldbot
ated
he use of analy
ing busin
ums are calcul
mance is explod
surance premi
student perfor
Yet they have
s say car-in
analytic tools.
tion, and expert
with predictive
in higher educa
in education.
for K-12
hold
se
take
promi
to
slower
even more
looking anthe tools show
r place- been
ts are great at
hing from teache
School distric
ative assessschools, in everyt
, doing summ
t prevention.
nually at things
few are
ment to dropou
iques is
g back, but very
statistical techn
ments and lookin
Erlendson, the
Use of such
s, however,
rd, said Bill
legiate school
looking forwa
32,000-stuhindered in precol
d to help
ntendent for the
traine
superi
nt
ct in
rchers
d School Distri
ing assista
by a lack of resea
San Jos Unifie our economy surof the data, accord
dent
sense
g
make
iderin
districts
to
California. Cons
ers.
ics, its amazing
to education watch
array of vives on predictive analyt
tive analytics
ics include an
me that predic
Predictive analyt
edudata
public
as
ds, such
dont drive
e in
statistical metho
catio n. Mayb
mode ling,
minin g and
ify
ident
to
used
the factors that
pred ict the
likelihood of
a specific
result.
By Sarah D. Sparks
Reinventing
Special
Technology
2011
ing
n Decision Mak
On Data-Drive
CONTENTS:
INTERACTIVE
Math Instruction
Access to quality
Editors Note:
with
district leaders
data provides
ed
to make inform
the opportunity
management
instructional and
Spotlight
decisions. This
ial risks and
examines the potent
and
s
system
data
advantages of
in which data can
the various ways
e learning.
be used to improv
Homework
Reading Instruction
Differentiated
Teacher Evaluation
Response to Intervention
l
Motivation
Parental Involvement
Professional Development
in the Classroom
On Teacher Evaluation
Editors Note: Assessing teacher
performance is a complicated
issue, raising questions of how to
best measure teacher
effectiveness. This Spotlight
examines ways to assess teaching
and efforts to improve teacher
evaluation.
INTERACTIVE CONTENTS:
1 Wanted: Ways to Assess
the Majority of Teachers
4 Gates Analysis Offers Clues
to Identification of Teacher
Effectiveness
5 State Group Piloting Teacher
Prelicensing Exam
6 Report: Six Steps for Upgrading
Teacher Evaluation Systems
7 Peer Review Undergoing
Revitalization
COMMENTARY:
10 Moving Beyond Test Scores
12 My Students Help Assess
My Teaching
13 Taking Teacher Evaluation
to Extremes
15 Value-Added: Its Not Perfect,
But It Makes Sense
RESOURCES:
WEEK Spotl
ight
on imple ment
ing comm
on
StandardS
edweek.org
2012
On Implementi
ng Common Sta
ndards
Editors Note:
In order to
implement the
Common Core
State Standards,
educators
need instructional
materials and
assessments.
But not all states
are moving at
the same pace,
and some district
s are finding
common-core
resources in
short supply. This
Spotlight
highlights the
curricu
professional develo lum,
pment, and
online resources
available to
help districts prepar
e for the
common core.
InteractIve
cOntentS:
1 Educators in
Search of
Common-Core
Resources
4 Higher Ed. Gets
Voting
Rights on Assessm
ents
6 Common Cores
Focus on
Close Reading
Stirs Worries
7 Few States Cite
Full Plans
for Carrying Out
Standards
8 Common Core
Poses
Challenges for
Preschools
10 Common Core
Raises PD
Opportunities,
Questions
cOmmentar
y:
11 Standards: A
Golden
Opportunity for
K-16
Collaboration
PAGE 2>
12 The Commo
n-Core
Contradiction
Data-Driven
15 Resources on
Decision Making
reSOurceS:
14 Resources on
Common Core
iStock/ olandesina
iStock /123render
Educ ation
Published Februa
ry
Educators in Searc
h
of Common-Core
Resources
By Catherine Gewe
rtz
??
Principals
Bullying
Data-Driven Decisionmaking
Autism
iStock/kyoshino??
www.edweek.org/go/spotlights