Ann Brown

Design Experiments: Theoretical and Methodological Challenges in Creating Complex
Interventions in Classroom Settings

Author(s): Ann L. Brown
Source: The Journal of the Learning Sciences, Vol. 2, No. 2 (1992), pp. 141-178
Published by: Taylor & Francis, Ltd.
Stable URL: http://www.jstor.org/stable/1466837 .
Accessed: 07/03/2011 03:59
Your use of the JSTOR archive indicates your acceptance of JSTOR's Terms and Conditions of Use, available at .
http://www.jstor.org/page/info/about/policies/terms.jsp. JSTOR's Terms and Conditions of Use provides, in part, that unless
you have obtained prior permission, you may not download an entire issue of a journal or multiple copies of articles, and you
may use content in the JSTOR archive only for your personal, non-commercial use.
Please contact the publisher regarding any further use of this work. Publisher contact information may be obtained at .
http://www.jstor.org/action/showPublisher?publisherCode=taylorfrancis. .
Each copy of any part of a JSTOR transmission must contain the same copyright notice that appears on the screen or printed
page of such transmission.
JSTOR is a not-for-profit service that helps scholars, researchers, and students discover, use, and build upon a wide range of
content in a trusted digital archive. We use information technology and tools to increase productivity and facilitate new forms
of scholarship. For more information about JSTOR, please contact support@jstor.org.
Taylor & Francis, Ltd. is collaborating with JSTOR to digitize, preserve and extend access to The Journal of
the Learning Sciences.
http://www.jstor.org
THE JOURNALOF THE LEARNINGSCIENCES,2(2), 141-178
CopyrightO 1992, LawrenceErlbaumAssociates,Inc.
Design Experiments:Theoretical
and Methodological Challengesin
Creating Complex Interventions
in Classroom Settings
Ann L. Brown
Universityof California-Berkeley
The lion's shareof my currentresearchprogramis devotedto the study of

learningin the blooming, buzzingconfusion of inner-cityclassrooms.My
high-levelgoal is to transformgrade-schoolclassroomsfrom work sites
where studentsperformassignedtasks underthe managementof teachers
into communitiesof learning (Bereiter& Scardamalia,1989; Brown &
Campione, 1990)and interpretation(Fish, 1980),wherestudentsare given
significantopportunityto take chargeof theirown learning.In my current
work, I conduct what Collins (in press) refers to as design experiments,
modeled on the proceduresof design sciences such as aeronauticsand
artificialintelligence.As a designscientistin my field, I attemptto engineer
innovativeeducationalenvironmentsand simultaneouslyconduct experi-
mentalstudiesof those innovations.This involvesorchestratingall aspects
of a periodof dailylife in classrooms,a researchactivityfor whichI was not
trained.My trainingwas that of a classiclearningtheoristpreparedto work
with "subjects"(rats, children,sophomores),in strictlycontrolledlabora-
tory settings. The methods I have employedin my previouslife are not
readilytransportedto the researchactivitiesI overseecurrently.
In Figure 1, I illustratethe critical aspects of my current classroom
research. Central to the enterpriseis that the classroom must function
smoothly as a learningenvironmentbefore we can study anythingother
than the myriadpossibleways that things can go wrong. Classroomlife is
synergistic:Aspects of it that are often treated independently,such as
teachertraining,curriculumselection, testing, and so forth actuallyform
Requestsfor reprintsshouldbe sentto Ann L. Brown,Universityof California- Berkeley,

Graduate School of Education, Education in Mathematics,Science, and Technology,
Berkeley,CA 94720.
-A
DESIGN EXPERIMENT
Contributions Cl
to LearningTheory Te
Cu
TA
Engineering a Working
Environment
Output
Assessment of the right things
Accountability
FIGURE1 The complexfeaturesof design exper

DESIGNEXPERIMENTS 143
partof a systemicwhole. Just as it is impossibleto changeone aspectof the

systemwithoutcreatingperturbationsin others,so too it is difficultto study
any one aspect independentlyfrom the whole operatingsystem. Thus, we
are responsiblefor simultaneouschangesin the system,concerningthe role
of studentsand teachers,the type of curriculum,the place of technology,
and so forth. These are all seen as inputs into the workingwhole.
Similarly,we are concernedwith outputsfrom the system,a concernthat
leads us to look at new formsof assessment.It is essentialthat we assessthe
aspectsthat our learningenvironmentwas set up to foster, suchas problem
solving, criticalthinking,and reflectivelearning.Assessmentalso allowsus
to be accountablefor the resultsof our workto the childrenthemselves,to
parents,to teachers,to local authorities,and, last but not least, to fellow
scientists.
Another criticaltension in our goals is that between contributingto a
theoryof learning,a theoreticalaim that has alwaysbeen a keystoneof our
work, and contributingto practice.This is interventionresearchdesignedto
inform practice. For this to be true, we must operate always under the
constraintthat an effectiveinterventionshouldbe ableto migratefrom our
experimentalclassroomto averageclassroomsoperatedby and for average
students and teachers, supportedby realistictechnologicaland personal
support.
In the first part of the paper, I trace briefly how I moved from the
classicalpsychologicalposition of concentratingon a theoreticalstudy of
the learningprocessesof individualstudentsto a concentrationon concep-
tual changein teachersas well as students,to settingup a classroomethos
that wouldfosterself-reflectivelearning,and finallyto biggerconcernswith
technology, curriculumissues, and assessment.
My change in focus was a gradual evolution rather than an unpre-
meditatedleap to instruction(Brown& Campione,1986).Even thoughthe
researchsetting has changed dramatically,my goal remainsthe same: to
work toward a theoreticalmodel of learningand instructionrooted in a
firm empirical base. I regard classroom work as just as basic as my
laboratoryendeavors,although the situatednature of the researchlends
itself most readily to practicalapplication. In the classroom and in the
laboratory, I attempt to engineer interventionsthat not only work by
recognizablestandardsbut are also based on theoreticaldescriptionsthat
delineatewhy they work, and thus renderthem reliableand repeatable.
In the first part of this paper, I describehow I arrivedat the current
juncture, that is, attempting to restructureclassrooms at many levels
includingdesigningcurricula,introducingnew roles for teachers,students,
and researchers,and reconceptualizingassessment.The problemwith such
design experimentsis that changes made in one part of the system
reverberate throughout, and one is often forced to conduct the multiply
144 BROWN
confoundedexperimentsthat are a nightmarefor an experimentalpsychol-

ogist. This discussionprovidesthe groundworkfor the major part of the
article,in which I addressthe theoreticaland methodologicalproblemsof
workingin the rich, complex, and constantlychangingenvironmentof the
classroom.
LEARNINGTHEORY:A PERSONAL HISTORY

In this section,I tracemy own shift towardstudyinglearningin classrooms,
althoughthe steps I took reflectchangesin the field in general.Beginning
in the 1970s, and hence avoiding psychology'smajor shift away from
behavioristlearningtheories,I concentrateon the more subtlechangesthat
took place in learning theory after the so-called cognitive revolution.
Although the theoreticalperturbationsof this period were less traumatic
than in the previousera, therewas nothingshortof a sea changein theories
of learning,with a correspondingrevolutionin how learningwas conceived,
observed, and fostered. There was also a dramaticchange in what "sub-
jects"wererequiredto learn,even in laboratorysettings,and an awakening
to the fact that real-lifelearninginevitablytakes place in a social context,
one such setting being the classroom. Psychologistsare creaturesof their
time, and the methodsthey use to attacksuchdurableproblemsas learning
must be reconsideredin the light of theorychange.The main point of this
article is that learning theory has undergonemajor modifications even
within the cognitivist period of the last 20 years, and methodological
changes are needed to reflect these developments.I trace the sea change
througha personalhistory.
Throughoutmy career,I have been interestedin the problemof learning
to learn, a topic with an honorablehistory in psychologyand education.
Educationalists have been interested in the problem because of two
stumblingblocksto lastinglearning:(a) inertknowledge(Whitehead,1916):
studentsacquirefacts that they cannotaccessand use appropriately;and (b)
passivelearning:studentsdo not readilyengagein intentional,self-directed
action. Thesetwo problemsarecommonlyregardedas diseasesof schooling
(Brown,1977).Attemptsto studythesephenomenaled to a researchagenda
that saw fundamentalchanges in the focus of developmentaltheory, an
agendathat focusedon activestrategiesfor learning,whatit meansto learn
(rote learningvs. understanding),the content that learnersare to acquire,
and the context in whichthey are to acquireit.
Strategies, Capacity, and the Training Study

Like many experimentaland developmentalpsychologistsoperatingduring
the 1970s, my research focused on human memory: We avoided the term
learningbecause of its associationwith behavioristtheories. During this

period, there was an importantshift in how we characterizedchildren's
purportedlypoorer memories;we progressedfrom blaming poor perfor-
mance on limited memorycapacityper se to concentratingon the role of
active memory strategies (Brown, 1974, 1975; Chi, 1976, 1978). This
controversyled to the introductionof the trainingstudyas a theoreticaltool
to tease apartthe relativecontributionsof capacityand strategicactivity.
The argumenttook the form of a question:Do young childrenfail to use
strategiesbecause they do not think to do so, or do they suffer from a
mediationaldeficiencysuchthat strategicinterventionwould not help them
anyway?The responseto this theoreticalquestionwas to train childrenin
the use of strategies to see if their performancewould be mediated
effectively once they applied appropriatestrategies.In informationpro-
cessing terms the distinction was between voluntary control processes
(strategies)and structuralfeatures (memorycapacity), a distinctionthat
also led to attemptsto distinguishthe two throughtraining(Atkinson&
Shiffrin, 1968; Brown, 1975). Note, however, that this was an issue of
theory, an attemptto unravelmysteriesof the memorysystem. The work
had nothing to do with theoriesabout how to make a betterlearner.The
playershad absolutelyno educationalrelevancein mind.
The main findings of 10 years' work can be summarizedin two
statements:
1. Trainingworked, even given the pathetic amounts providedin the
typical study. Childrencould be trainedto use simplestrategies,and
when they did use them, memoryimproved.
2. Trainingdidn'twork,becausetherewas little evidenceof maintenance
in the absenceof the experimenter'spromptingand less evidenceof
independenttransfer(Brown, 1978;Brown& Campione, 1978).
Importantmethodologicalissuessprangfromthis changeof focus. When
dealing with capacity, one simply asked how much was learned;this was
true even when taking into accountsuch complexproblemsas "chunking"
(Chi, 1976;Miller, 1956).But cognitivestrategiesaretypicallyconductedin
the privacy of one's mind, and therefore it became necessaryto devise
creative means of externalizingthese critical mental events. Extremely
cleverwaysweredevisedto tricksubjectsinto showinghow they wentabout
remembering,not just how much they retained. The focus on internal-
memoryprocessesratherthan externalmemoryproductsraisedinteresting
measurement questions that remain problematic (Brown, Bransford,
Ferrara,& Campione, 1983). And a recurringtheme in this articleis that
theoreticalchangesduringthe last 20 years broughtwith them increasing
demands for more subtle methods to render overt the human thought
processes that are usually covert.
146 BROWN
Metacognition
A major impetus to my early work was a review in which a leading

theoreticiancharacterizedthe vast body of memoryresearchas trivial at
best, and pointed out that psychologistsdo not study the most interesting
thing about human memory, the fact that people have knowledge and
beliefs about it (Tulving& Madigan, 1970). This topic became known as
metamemory(Flavell, 1971), and it was enthusiasticallystudiedby devel-
opmentalpsychologistsbecausethe problemof inertknowledgeandpassive
learningappearedto be more prevalentin the young.
My researchagendain the 1970swas governedby these questions:Why
is it that young childrendo not use strategies?Is it that they do not know
that such activitiesare useful, or that they do not care, or a little of both?
Entermetacognition:knowledgeabout and controlof one's own learning.
Studies of metacognitiondominated the latter half of the 1970s, even
though it was agreedthat the termwas fuzzy and ill defined(Brown, 1975,
1978;Flavell& Wellman, 1977;Wellman, 1983).Again, to summarizethe
outcomescrudely:Childrendo not use a wholevarietyof learningstrategies
becausethey do not know much about: (a) remembering(they know little
about the strategiesand tacticsof overcomingmemorylimitations)and (b)
monitoring(theydo not thinkto orchestrate,oversee,plan, and revisetheir
own learningactivities). The terms remembering,monitoring, strategies,
and metacognitionwidely replacedmemory, denotingan importanttheo-
retical shift from passive to active metaphorsof learning.This shift was
attendedby a need to developnew methodsto captureactive strategicand
monitoringprocesses(Brown& Campione, 1979;Schoenfeld, 1985).
Again, along with others, I took the next step in this saga and attempted
to train metacognition.The reasoningwas as follows. If metacognitionis
the missinglink, let's trainit. To cut a long story short, it turnsout not to
be easy to train a learner to be strategic, to select cognitive activities
intelligently,to plan, to monitor,to be cognitivelyvigilant,economical,and
effective!And it is particularlydifficultto do so in arbitrarycontextswhere
a learneris attackingmeaninglessmaterialfor no purposeother than to
please an experimenter. I believe it is for this reason that the
decontextualizedapproachto metacognitivetrainingwas largelyunsuccess-
ful, and issuesof the contentand contextof learningcameto dominatethe
1980s.
Content and contexts of learning. I will gloss over the history of

these developments and merely point out that, concomitant with the
emphasison activelearning,therewere far-reachingmodificationsin what
it was that childrenwere requiredto learn, togetherwith the contexts of
learning in which they were observed. These developmentsalso had a

fundamentalinfluenceon the natureof learningtheory.
Duringthe 1970s,psychologistsinterestedin learningand memorybegan
shifting from the almost exclusivestudy of the learningof lists of words,
pictures,and pairedassociatesto a concentrationon coherentcontent.They
began also to look at the acquisitionof expertisewithina domain- gained
over long periods of time via concentrated, and often self-motivated,
learning (chess, for example). More recently, learning theorists have
consideredthe acquisitionof disciplinedbodiesof knowledge,characteristic
of academicsubjectareas (mathematics,programming,physics, and even
some areasof social science).Learningtheorieshad to changeto reflectthis
(Glaser& Bassok, 1989).My own researchalso reflecteda shift to the study
of learning,remembering,and understandingcomplextexts, whichin turn
led to studiesof readingcomprehensionand comprehensionmonitoringin
specific content areas.
These changeswere accompaniedby the need for novel methodologies.
When studying list learning, one simply asked how many items were
recalled,or, moresubtlywhatorganizationis inherentin the orderin which
items are recalled.This organizationwas taken as a measureof conceptual
level, preference,or proclivity. But when dealing with understandingof
complex texts, the problem became one of how to capture degrees of
understanding,nuances of meaning, acceptablealternativeviewpoints, a
problemthat has kept the field busy for severalyears.
Duringthe sameperiod,therewerealso dramaticchangesin the contexts
in whichstudentswereaskedto learn.For example,let us tracebrieflywhat
happenedto the training study. The prototypicaltrainingstudy had the
following features:
1. Trainingwas absurdlycursory;few studiesincludedmore than a day

or two of intervention.One might ask, what major cognitivechange
could be accomplishedin a day or two?
2. The form of instructionwas rarelydiscussedas a theoreticalissue, and
in generalit was straightdidacticteaching.
3. The interventionsweretypicallyone-on-one.Littlethoughtwas given
to the social context of learning,or to collaborativecognition, which
is so typicalof everydaylearning,includingthat of the classroomand
the workplace.
To think of learningin this widersensemakeslife moredifficult, and again

introduces nontrivial methodological problems. But changing views of
learningin the 1980shave made that messy leap inevitable.
148 BROWN
READING COMPREHENSION
These changes in learning theory led me down the slippery slope to

educationaltheory and practice.Togetherwith my colleagues, I began a
series of studies concentratingon guided instructionand assessmentin
social contexts (Brownet al., 1983;Brown, Campione,Reeve, Ferrara,&
Palincsar,1991).In this articleI concentrateon the most extensiveof these
programs,reciprocalteaching,a proceduredesignedto foster comprehen-
sion and cognitive monitoringwhile reading (Brown & Palincsar, 1982;
Palincsar & Brown, 1984), solving mathematicsproblems (Campione,
Brown,& Connell, 1988;Reeve, Gordon, Campione,& Brown, 1990),and
learningscience(Brown& Campione, 1990).
The primarywork on reciprocalteachingcenteredon strategicreading.
Although the focus was still theoretical- what was the role of strategies,
metacognition,content, and context on learningand understanding - the
applied value of the work was more apparent. The basic method was
modeled after studies of Socraticor inquiryteaching, and theories about
plausiblereasoning,explanation,and analogy (Brown& Palincsar, 1989;
Collins& Stevens,1982).The procedurewas designedto createa forumfor
externalizingsimplecomprehension-monitoring activitiesand to providea
repetitivestructureto scaffold studentdiscourse.(See Brown& Palincsar,
1989, and Palincsar& Brown, 1984, for details of the procedures.)
What happens in a typical reciprocal teaching session is that the
participantsin a group take turns leading a discussionabout a text. The
participantsdividethemselvesup into a learningleader(adultor child)and
learninglisteners/criticsfor each segmentof text. All groupmembersget a
turn as learning leader, who begins by asking a question and ends by
summarizingthe gist of what has been read. The group rereads and
discussespossibleproblemsof interpretationwhen necessary.Questioning
providesthe impetusto get the discussiongoing. Summarizingat the end of
a periodof discussionhelpsstudentsestablishwherethey are in preparation
for tacklinga new segmentof text. Attemptsto clarifyany comprehension
problemsthat might occur arise opportunistically,and the leadersask for
predictionsabout future content. These four activities-questioning, clar-
ifying, summarizing,and predicting- wereselectedto bolsterthe discussion
becausethey are excellentcomprehension-monitoring devices;for example,
if one cannot summarizewhat one has read, it is an indication that
understandingis not proceedingsmoothlyand that remedialactionis called
for. The strategiesalso providethe repeatablestructurenecessaryto get a
discussion going, a structurethat can be faded out when students are
experiencedin the discoursemode.
Over time our studies of reciprocalteachingof readingcomprehension
became more complex as the method became increasingly incorporated into
the dynamicsof a functioningclassroom. We began workingone-on-one

with childrenwho werereadingunconnectedpassagesin laboratorysettings
(Brown& Palincsar,1982)and progressedto studyingchildrenin groupsin
resourcerooms outside the classroom(Palincsar& Brown, 1984), then to
consideringnaturallyoccurringreadinggroupsin the classroom(Brown&
Palincsar,1989),and finallyto studyingreadingcomprehensiongroupsthat
were fully integratedinto scienceclassrooms(Brown& Campione, 1990).
We began by concentratingon a few constrainedstrategiesand proceeded
to studycomplexexplanation,argument,and discussionforms. Initiallywe
looked at students reading unconnectedpassages, proceededto look at
studentsreadingcoherentcontent(Brownet al., in press;Palincsar,Brown,
& Campione,1990),and we are now observingreadingcomprehensionas it
takes placein social groupswho are reading,discussing,and arguingabout
cohesivematerialthat they have preparedthemselvesover long periodsof
time, thus acquiringownershipof that knowledge.Finally, the students
themselves appropriatedreciprocalteaching as a tool to enhance their
comprehension.Faced with importantmaterialthey had difficulty under-
standing,they would call for a reciprocalteachingsession to enhanceand
monitortheirown comprehension.Similarly,beforeteachingtheirmaterial
to others, they would conductreciprocalteachingsessions on it to ensure
that it was comprehensible.Thus, in our currentclassrooms, reciprocal
teachingsessionsare initiatedby childrenopportunisticallywhenneededto
ensurecomprehension.
COMMUNITIESOF LEARNERS
In our currentwork, reciprocalteaching is only one component of the

designexperimentintendedto encouragedistributedexpertisein a commu-
nity of learners(Brown et al., in press). The settings are intact science
classroomswheresixth-,seventh-,and eighth-gradestudentsareresponsible
for doing collaborativeresearch and sharing their expertise with their
colleagues.
In orderto createa communityof learners,we must set up a classroom
ethos that differs from that found in traditionalclassrooms. The main
differencesare indicatedin Table 1. Although presentedas dichotomous,
the distinctionsshould be regardedas ends of continua. As bald dichoto-
mies, they representstereotypes.
In the traditional classroom, students are seen as relatively passive
receiversof wisdomdispensedfrom teachers,textbooks,or othermedia.In
the intentionallearningclassroom(Brown& Campione,1990;Scardamalia
& Bereiter, 1991), students are encouragedto engage in self-reflective
learning and critical inquiry. They act as researchers responsible to some
150 BROWN
TABLE1
Changes in Classroom Philosophy
IntentionalLearning
Role TraditionalClassroom Environment
Students Passiverecipientsof Studentsas researchers,teachers,
incominginformation and monitorsof progress
Teachers Didacticteaching Guideddiscovery
Classroommanager Modelof activeinquiry
Content Basicliteracycurriculum Thinkingas basic literacy
Lowervs. higherskills
Contentcurriculum Contentcurriculum
Breadth Depth
Fragmented Recurrentthemes
Fragmented Explanatorycoherence
Fact retention Understanding
Computers Drill and practice Tools for intentionalreflection
Programming Learningand collaboration
Assessment Fact retention Knowledgediscoveryand utilization
Traditionaltests Performance
Projects
Portfolio
extent for definingtheirown expertise.Teachers'roles also changedramat-

icallyin that they are expectedto serveas activerole modelsof learningand
as responsiveguidesto students'discoveryprocesses.Theyteach on a need-
to-know basis responsiveto students'needs, ratherthan on a fixed scope
and sequenceschedule,or accordingto an inflexiblelesson plan (Brown&
Campione, 1990, in press). The content of the curriculumthroughwhich
childrenare guided featuresa few recurringthemesratherthan breadthof
coverage,themesthat studentscome to understanddeeplyand recognizeat
increasinglydeeperlevels of explanatorycoherenceand theoreticalgener-
ality. The technologicalenvironmentis not designed to foster drill and
practice or even programming,but to encourage intentional learning,
reflectionand communication.Finally,methodsof assessmentfocus on the
students'ability to discoverand use knowledge,ratherthan just retainit.
On-line dynamic measures of performanceare as important as static
measuresof product(for moredetailsof the actualclassroomactivitiesand
outcomes, see Brown& Campione, 1990, in press;Brownet al., in press).
In order to foster a community of learnersthat features students as
designers of their own learning, we encourage students to be partially
responsiblefor creatingtheir own curriculum.The two major forms of
cooperative learning used to accomplish this are the Jigsaw method
(Aronson, 1978)and reciprocalteaching.Studentsare assignedcurriculum
themes (e.g., animal defense mechanisms,endangeredspecies, changing

populations, food chains, etc.) each divided into five subtopics (e.g.,
changingpopulations = extinct, endangered,artificial, assisted, and ur-
banizedpopulations;or food chains = producing,consuming,recycling,
distributing,and energy exchange). Students form five researchgroups,
each assignedresponsibilityfor one of the five subtopics. These research
groups prepareteaching materials using state-of-the-artcomputer tech-
nology (Campione, Brown, & Jay, in press). Then, using the Jigsaw
method, the studentsregroupinto learninggroupsin whicheach studentis
expertin one subtopic,holding 5sof the information.Each one-fifth needs
to be combined with the remainingfifths to make a whole unit, hence
"jigsaw."The experton each subtopicis responsiblefor guidingreciprocal
teachingseminarsin his or her area.Thus, the choiceof a discussionleader
is now based on expertiseratherthan randomselection,as was the case in
the originalreciprocalteachingwork. All childrenin a learninggroup are
experton one partof the material,teachit to others,and preparequestions
for the test that all will take on the complete unit. During this cyclical
process, students both acquire content expertiseand learn how to learn
from texts and other media (Brown& Campione, 1990).
The studentsare involved in (a) extensivereadingin order to research
theirtopic, (b) writingand revisionto producebookletsfrom whichto teach
and to publishin-classbooks coveringthe entiretopic, and (c) computeruse
to publish, illustrate,and edit their booklets. In addition, a great deal of
cognitive monitoring must take place in order for the students to set
prioritiesconcerningwhat to includein theirbooks, what to teach, what to
test, how to explainmechanisms,and so forth. They read, write, discuss,
revise, set priorities,and use computersall in the serviceof learning.And
they generatedata!
I do not have spaceto describethe multipletypes of data we gather.It is
sufficient to say that in additionto relativelystandardoutcome measures
involvingreading,writing,content knowledge,and computercompetence,
all of which improve significantly(Brown & Campione, in press), this
projectgeneratesa vast amountof informationthat is not readilysubjected
to standard measurementdevices. We collect transcriptsof children's
planning, revising, and teaching sessions. We collect observations of
teachers'coachingand responsiveteaching, as well as their directinstruc-
tion. We haverecordsof studentportfolios,includingindividualand group
long-termprojectsthat requirethem to exploitaccumulatingknowledgein
novel ways. We score electronic mail queries to peers, teachers, and
collaboratorsin the universitycommunity.Ethnographicobservationsof
cooperative,or not so cooperative,interactions,such as groupdiscussions,
planning sessions, help seeking, peer tutoring and so forth are taken
152 BROWN
routinely, together with extensivevideo and audio taping of individuals,

groups, and whole classroomsettings.In fact, we have no room to storeall
the data, let alone time to score it.
In the remainderof this articleI concentrateon the methodologicalissues
that arise when one attemptsto situate a researchagenda in a classroom
settingthat one is simultaneouslyinvolvedin designingunderconditionsof
continuousflux.
METHODOLOGICALISSUES IN EVALUATING
COMPLEXINTERVENTIONSTUDIES
In this section, I considersome of the persistentproblemsassociatedwith

attemptsto assess conceptualchange in situ. I will concentrateon three
main methodologicalissues: (a) The relationshipbetweenlaboratoryand
classroom work, (b) Idiographicversus nomothetic approaches, or the
grain-size issue, and (c) The Bartlett Effect, or the problem of data
selection.
Classroom Versus Laboratory Contexts
The first issue I addressis the tensionbetweenand relativecontributionsof

classroom and laboratorystudies. For most of my professionalcareer, I
have studied childrenlearningin laboratorysettings, usually in isolation
from other children.Graduallyover the years I have increasinglysituated
my studyof learningin classrooms,firstin suchlab-likesettingsas pull-out
time (for readinggroups, etc.), then in socially sanctionedsettingsin the
classroom (reading group), and finally orchestrating,some might say
disrupting,the entireclassroomactivityfor at least one houra day. Making
this shift involvesan increasingtrade-offbetweenexperimentalcontroland
richness and reality. The classroom is not the natural habitat of many
experimentalpsychologists, and our methods did not evolve to capture
learningin situ. Indeed, the first grant proposal I ever had rejectedwas
about 10 yearsago, when anonymousreviewersaccusedme of abandoning
my experimentaltrainingand conducting"Pseudo-experimental researchin
quasi-naturalisticsettings!" This was not a flatteringdescriptionof what I
took to be microgenetic/observational studiesof learningin the classroom.
As a personalresearchstrategy,I find that switchingbackand forth from
both types of researchsettingsenrichesmy understandingof a particular
phenomenon.I give but one example.I have studiedchildren'sanalogical
reasoning and explanation strategies under tight experimentalcontrol
(Brown & Kane, 1988) and as it occurs spontaneously in classroom
DESIGN EXPERIMENTS 153
discussions(Brownet al., in press).My observationsgo in both directions;

my laboratorywork informsmy classroomobservations- and vice versa.
My laboratoryexperiencesenable me to see a developmentalpattern
emerging in classroom dialogues similar to that found in laboratory
problemsolving;for example,there is a progressionfrom merelynoticing
the occurrenceof analogiesto productiveuse of analogyto solve problems
(Brown& Kane, 1988;Gick& Holyoak, 1980).Similarly,thereis a familiar
progressionfrom using surfacesimilaritiesto a relianceon deep analogies
(Gentner, 1983, 1989). For example, childreninitially notice the analogy
betweentwo passageswherethe protagonists(ladybugand lacewing)look
alike, are both insects,and are examplesof a commontheme, for example,
naturalpest control. Next they notice the deep similaritybetweena crested
rat and a viscountbutterfly(verydissimilaranimalswho are examplesof a
commontheme, visualmimicry).Similarly,childreninitiallythink the best
analogy between a car and a human body is the eyes and the headlights
(surface)whereaslatertheythinkthat a betteranalogyis betweenthe engine
and the heart (deep). With increasingknowledgechildrenprogressfrom
acceptingsuperficialanalogyto usingdeep analogyto explainmechanisms.
They question initiallyacceptablesurface analogies(such as "plantstems
are like straws")and come to prefer analogies based on deeper under-
standingof underlyingbiologicalmechanisms("plantsare food factories").
This progression, reflecting the increasinglycoherent and mechanistic
nature of their biological theories, is shown in Figure2. Although in the
laboratorythis developmentfrom noticing to using, and from surfaceto
deep, was thoughtto be age-dependent,the classroomwork suggeststhat
the shift is knowledge-based,occurringmicrogeneticallywithin a year as
readilyas cross-sectionallyacross severalyears.
Classroomwork can motivate laboratorypracticealso: Trendsdiscov-
ered in spontaneousclassroomdiscussionscan be tested in the laboratory
under more controlled conditions. For example, analysis of classroom
dialoguessuggeststhat the conditionsof spontaneoususe of explanation
may be developmentallysensitive.First, impasse-drivenexplanationoccurs
in the face of breakdownsin comprehension.This is followedby the use of
explanationto help resolveannoyinginconsistencies.Finally, spontaneous
explanationis used in the absence of comprehensionfailure or obvious
inconsistencies as learners continually revise and deepen their under-
standingof complex causal mechanisms.This microgeneticprogressionis
shown in Figure3.
Facedwith this apparenttrendin the spontaneousdialogues,our routine
procedureis to set up controlledlaboratorystudiesto evaluatewhetherthe
developmentaltrend can be reproducedunderexperimentalcontrol. Simi-
larly, facedwith an experimentalfindingin the laboratory,we are sensitized
to watch for its occurrence in the morass of classroom discourse. This
154 BROWN
Productionof Analogyin Discourse
100.
80,
SQ
a.
E 60
a
x
us
w
Simple
c. Advanced
- 40
o
o
2
40
0
0
Unit 1 Unit 2 Unit 3
9 months
FIGURE 2 Production of analogy in spontaneous discourse of 5th and 6th graders

during reciprocal teaching.
cross-fertilizationbetweensettingsenrichesour understandingof the devel-

opmentalphenomenonin question.I wouldlike to point out, however,that
I regardneitheraspectof the workas basicor applied.Theoreticaladvances
can emergefrom both the laboratoryand classroomsettings.They are just
that, differentsettingswhose featuresmustbe includedin the descriptionof
the data they produce.
IdiographicVersus Nomothetic Approaches
One of the major decisionspsychologistsmust make is to decide on the

"grainsize"of the phenomenathey wishto observe.The classicdistinctions
are laid out neatlyin introductorytextbooks.Developmentalpsychologists
are informedof the strengthsand weaknessesof severalsuch options. First
thereis the choice betweenan idiographicand a nomotheticapproach,that
is, few or many. Does one wantto studya "singlevariablein manysubjects
for the purposeof discoveringgenerallaws or principlesof behavior"or
Explanations
80 -
Impasse Driven
CL
i-
Resolving Inconsistencies
Deeper Mechanisms
60
40 "
0
0 0
0 20
0.
0
0.
Unit 1 Unit 2 Unit 3
9 Months
FIGURE3 Productionof explanationin the spontaneousdiscourseof 5th and 6th

gradersduringreciprocalteaching.
adopt the idiographicapproach,"thethorough study of individualcases,

with emphasison each subject'scharacteristictraits."
Althoughnomotheticstudiesare moretypicalin developmentalpsychol-
ogy, case studieshave an honorablehistorytoo, particularlyin the studyof
very young children.The most famous of these studiesis Darwin's(1877)
delightfuldaily recordof his eldestson. "So-called"diarystudiesrecording
infant developmentwerepopularat the turn of the century(Kessen,1965);
primarilyconductedby mothers, they generateda great deal of the early
languagedevelopmentdata. More famous was the work of fathers, with
Piaget (1952) studying his three children, Laurent, Lucienne, and
Jacqueline,following the lead of Binet's(Binet, 1903)observationsof his
daughters, Madeleine and Alice. Large scale nomothetic studies were
largelyan Americaninvention.
The second major variablehas to do with change over time, a critical
elementfor all learningtheoristsand especiallyfor developmentalpsychol-
ogists. One majoroption is betweencross-sectionaldesigns,wheredata are
taken from, say, 7-, 10-, and 14-year-olds,and inferencesabout develop-
156 BROWN
mentaltrajectoriesmade;and longitudinaldesigns,where,for example,all

4-year-oldswho watched "SesameStreet"in 1969, 1971, and 1973 are
followed up for a period of years. Another option is the microgenetic
design, an importanttool in the psychologist'skit that is receivingrenewed
interest. Here childrenare observedover a relativelyshort period of time
(days, weeks) as they acquire a certain form of understanding.This
approachis most often takenwith veryyoungchildrenwho are at a stageof
rapid learning (DeLoache, Brown, & Kane, 1985) but also with older
children learning a particular scientific concept (Karmiloff-Smith&
Inhelder,1974-1975;Kuhn, Amsel, & O'Loughlin,1988).
The thirdmajordecisionis the type of analysisconducted,qualitativeor
quantitative.This decisionis often partiallydictatedby the prior choices:
Complex quantitative analyses are more often applied to large scale
longitudinal and cross-sectionaldata bases but not always. Although
theoristsas clever as Binet, Piaget, and Skinnernever used statistics,it is
perfectlypossibleto subjectcase studiesto statisticalanalysisif one should
choose to do so.
Increasingly,I find that in the interest of convergingoperations, and
because of the multifacetednature of my data base, I prefer a mixed
approach, suiting the method to the particulardata. I mix and match
qualitativeand quantitativemethodologiesin orderto describethe phenom-
ena, a mixturethat is becomingcommonplacein the journals,reflectingthe
increasinglycomplex issues that psychologistsnow address. In my own
work, I routinelycombine a concentrationon large scale data bases with
in-depthmicrogeneticanalysesof a few childrenor perhapsa group(Brown
et al., in press;Campione,Rutherford,Gordon,& Brown,in press).At any
one stagemy laboratoryis engagedin studyingthreeto four classroomsin
depthfor a periodof at least a year(a 3-yearlongitudinalstudyis currently
underway).This means that typicallywe are workingwith approximately
100 studentsper year. For each student,we generatean enormousamount
of data. Ourroutinepracticeis to take fairlytraditionalpretestand posttest
data from all the experimentaland control studentsand combinethat with
in-depth analyses of some of the students, complementedwith a few
selectedcase studies.For example,in the initialstudyof reciprocalteaching
(Palincsar& Brown, 1984),we providedpretestand posttestdata on the 37
participatingstudents, mini case studies on six children, together with
transcriptsfrom two childrenwho differed considerablyin how quickly
they picked up the reciprocalteaching procedure,a precedentwe have
followed in more recent work (Brown & Palincsar, 1987, 1989). This
approachenablesus to see the magnitudeof the effect in termsof outcome
measures and to get a feel for the phenomenonitself by looking at a
particularchild or groupin depth. (Brownet al., in press;Campioneet al.,
in press).
To givejustoneexampleof thismixedapproach,considerourtreatment
of contentknowledge acquisition. Wegiveshortanswerquizzeson a pretest
andposttestbasison all unitsof coveredcontent.An exampleof suchdata
is shownin Figure4. Herethe performance of approximately 60 fifth-and
sixth-grade children in the experimental researchclassrooms is compared
witha partialcontrolgroup(PCont)thatwastreatedexactlythesameasthe
researchclassroomforthe firstsemester(Unit1), andthentaughtenviron-
mentalscienceby theirregularscienceteacher.Thepartialcontrolgroup
hadexactlythesameaccessto books,videos,computers, andso forthasdid
theresearch classrooms. As canbe seen,thetwogroupsdidnotdifferfrom
eachotherin Unit 1, wheretheywere,in effect,bothexperimental groups,
treatedexactlythe same;but the childrenin the researchclassrooms
outperformed thepartialcontrolgroupon bothUnits2 and3. A readonly
control(ROC)group,whoreadthe keymaterialsbutdidno research,did
poorlythroughout.This traditionaluse of staticpretestsand posttests,
combinedwithappropriate controldata,providesus withclearevidenceof
the effectivenessof the interventionand is easy to sharewith school
personnelas wellas fellowscientists.
A richerpictureof knowledgeacquisition,however,comesfrommore
in-depthprobingof a subsetof students.Forexample,weconsiderstudents'
responsesto clinicalinterviewsandtransfertests. Studentsdifferin their
levelof understanding andin theconfidence withwhichtheyholdopinions;
in the clinicalinterviewit is the skilledinterviewer'sjob to posequestions
that are sensitiveto the bandwidthof competencewithinwhicheach
individualstudentcan navigate(Brown& Reeve,1987).Thisis a formof
dynamicassessment(Campione& Brown, 1987, 1990)that we use to
measureemergent competence in general(seeCampioneet al., in press,for
dynamic assessment of computeruse in the communityof learners).The
point is to map the zone of proximaldevelopment (Vygotsky,1978),the
region competence a studentcantraversewithandwithoutaid.
of that
To mapthiswindowof opportunity for learning,theinterviewer raisesa
seriesof keyquestionsconcerning, forexample,thefoodchain,adaptation,
and so forth. First,the interviewer elicitsbasic expositoryinformation
(Whatdoesthestudentknowaboutphotosynthesis?). If thestudentcannot
answer,theinterviewer provideshintsandexamplesas necessary to testthe
student'sreadinessto learnthat concept.If, however,the studentseems
initiallyknowledgeable, the experimenter mightquestionthatunderstand-
ingby introducing counterexamples to thestudent's beliefs(Isa mushroom
a plant?Whataboutyeast?),and,againif appropriate, he or shemightask
the studentto engagein thoughtexperimentsthat demandnovel uses of the
information.For example,whena studenthas sortedpicturesperfectlyinto
herbivoresand carnivores,and provideda good descriptionof the differ-
ence, he or she may be asked, "What would happen on the African plain if
KNOWLEDGE
TEST. UNITI
80
POST
70 POST
60
W 50
0
Q 40
30
20
10
RESEARCH PCont FCC
GROUP
KNOWLEDGE
TEST,UNITII
80
PFE
POST
70
60
0
ch 50
40
30
RESEARCH PCont FC
GROUP
KNOWLEDGE
TEST,UNITIII
80 FIGURE4 Pre- and posttest
short answerknowledgetests for
70 POST three curriculumunits of envir-
onmental science. The data are
60 from "RestructuringGradeSchool
w 50 Learning Environmentsto Pro-
mote Scientific Literacy"in Re-
40 structuringLearning:Analysisand
Recommendationsof the Council
30
of Chief State School Officersby
A. L. Brownand J. C. Campione,
20
in press,Washington,DC:Council
10 of Chief State School Officers.
RESEARCH PCont FDC Copyright by Ann L. Brown.
GROUP Adaptedby permission.
158
there were no gazelles or other meat for cheetahsto eat? Could they eat
grain?" Some students are surprisinglyuncertain about this, and may
suggestthat cheetahscould eat grainundercertaincircumstances,although
they would not live happily. Some even entertaina criticalperiodhypoth-
esis-that the cheetah could change if it were forced to eat grain from
infancy, but once it reachedadolescence,it would be too set in its ways to
change.Onlya few invokenotionsof form and function,suchas properties
of the digestive tract, to support the assertion that cheetahs could not
changewithin their life span. These extensionactivitiesof thoughtexperi-
ments and counterexamplesare far more revealingof the currentstate of
students' knowledge than their first unchallengedanswers, which often
providean overly optimisticpictureof their knowledge.
For example, Katy, a sophisticatedseventh grader, gave a textbook-
perfect descriptionof photosynthesisthat would certainlybe taken as an
indicationthat she fully understoodthe basic mechanisms.She was then
asked, "What would happen if there were 30% less sunlight?"Katy's
responseneverincludedthe criticalinformationthat as plants make food
with the sun'senergy,seriousreductionin the availabilityof sunlightwould
disrupt the entire food chain-no sun, no plants, no food! Instead she
concentratedon light to see with:
That would kill off the plants, beetles, and, um, nocturnalthings would be
OK.Thedayturnalthings- snakes,rabbits,hares- wouldbe alright,couldbe
nocturnal.But the dayturnalthingswouldneed sunlightto see- couldn'tfind
theirfoodinthedarkandwouldeventually
starveto death.Hawkswouldalso
die out, but owls are nocturnal-would be able to see at night and, um,
raccoonswould probablybe near the top of the food chain.
Katyclearlyhad not understoodthe basic place of photosynthesisas the

mainspringof life. She can repeat back the mechanismsand form food
chains when asked directly, but she cannot yet reason flexibly with her
newfoundknowledge.
Using these thought experiments,we can track not only retentionof
knowledge, but also how fragile-robustit is and how flexibly it can be
applied.Considerthe followingexcerptsfrom John, a sixthgrader.During
the pretest interview,John mentionedspeed, body size, mouth size, and
tearingteeth as functionalphysicalcharacteristicsof carnivores.He seemed
to have the carnivore/herbivoredistinctiondown pat. But when asked the
cheetahthoughtexperiment(Couldthey surviveif therewereno game?),he
mused, ". .. well I mean if people can, like, are vegetarians, I mean I think
a cheetah could change. . . ." When asked how this might happen, he said,
Well... justto switchoff, but,um,it wouldbeeasierforthemto changeon

to plantsthan for me; if I had been eatingmeat . .. becausetherewouldstill
160 BROWN
be meat aroundfor me to eat, but for them therewouldn'tbe ... so if they

wantedto survive,they'regoing to have to eat grass.
When asked if it would be easier for a baby cheetah to eat grass, his
response was:
Well, if it was a baby, it wouldbe easierbecauseit could eat it ... it would
be rightthere, it wouldjust have to walk a little bit to get it . . . but I think
it would be easier ... but then if it does happenfor a long time, then the
animalscome back [the gazelles return],then the cheetahprobablywould
have lost its speed, becausethey wouldn'thave to run. . . . Yeah, and they'd
get usedto the grassandnot careaboutthe gazelle,becausealongthe linethey
would forget.
During a subsequent interview 6 months later, when asked about

herbivores and carnivores, John made complex analogies to the cow's
intestinal system, and argued that herbivore digestive tracts are more
complicated than those of carnivores. By knowing an animal's diet, he
argued, he would be able to predict its digestive tract length and how long
digestion might take, and vice versa.
This time, when confronted with a variant of the cheetah thought
experiment, John responded, "No ... no, their digestive system isn't good
enough ... it's too uncomplicated to digest grasses and also their teeth
wouldn't be able to chew, so then the grass would overpopulate . . and the
cheetah dies. . . ." When asked if the baby cheetah could survive by eating
grass, John asserted that they would be the first to die.
These responses are in distinct contrast to those given to the same
questions during the pretest. John dropped personification (Hatano &
Inagaki, 1987) as a justification ("humans can do it so cheetahs can too")
and replaced it with a form-function response. Thrown a twist on the old
question - would deer be able to eat meat if there were no longer grass - the
newly confident John favored the interviewer with a broad smile and said,
"Nice try-the digestive tract of the deer is too complicated and also the
teeth wouldn't be able to grind meat."
Students in the research classroom show considerable improvement on
such clinical interviews over the course of a year's study, in contrast to
students taking a more traditional course that touches on the same materials
(the partial control group).
We see the same benefit of being in the research classroom on application
questions, such as "design an animal to fit the following habitat" (desert,
tundra, or rain forest habitat described). Data from two replications of this
test (at 6 months and 1 year) are shown in Figure 5. The sixth graders in the
experimental classroom outperformed control students in the number of
biologically-appropriate mechanisms that they included in their animals
(such as mechanisms of defense, mechanisms of reproductive strategies,
etc.). Over time the experimental subjects introduced more novel (untaught
Use of Biology Principal:HabitatTask

Design an Animal Version
EXPERIMENTAL CONTROL
5
POST1
POST2
4
Co
z
O
w
O
P-u
UJ
z
~Iz
1
TAUGHT TAUGHT/NOVEL NOVEL ALLTYPES
GROUPS
FIGURE5 Use of taughtversusnovel biologicalprinciplesin the designan animal
transfertest.
mechanisms)and more novel variations of mechanismsthey had been

taught. For example, the class had discussedthe notion of mimicryas a
defense mechanism.In a responsescored taught/novel, one student said
that the eggs of his animalwereplacedin a line and the markingsmadethe
eggs look like a full growncobra,a novel use of the mimicryprinciplewhich
to our knowledgedoes not exist in nature.
In a variety of such clinical interviewsand tests of application and
transfer,the researchchildrenoutperformed.control groupsin theirability
to apply biologicalprinciplesto novel tasks, such as designinga habitatto
supportan animal,or predictingsystemicchangesgivenone disruptionin a
food chainor ecosystem,activitiesthat tap theiraccumulatingknowledgein
novel ways (Brown, et al., in press). Responsesto clinical interviewsand
applicationactivities are subjectedto qualitativeas well as quantitative
analysesand tell us a greatdeal about the statusof the child'saccumulating
knowledge and ability to reason on the basis of incompleteknowledge.
162 BROWN
The Bartlett Effect
The third methodologicalissue, which goes beyond that of qualitative

versus quantitativeanalyses, is that of data selection, particularlyacute
when portions of edited transcriptsor clinical interviewsare selected to
illustratea theoreticalpoint, or whendescriptionsof planningsessions,peer
tutoring, or teacher coaching are culled from a vast array of potential
examples.Withaccessto dailyethnographicnotes, teacher'slogs, andvideo
and audio tapes, it is clearthat we must select a very small samplefrom a
large data base, and that selection is obviously going to buttress our
theoreticalstance. This selectionissue is nontrivial.The problemis how to
avoid misrepresentingthe data, howeverunintentionally.
I referto the problemof dataselectionas the BartlettEffect. In his classic
work, Remembering,Bartlett (1932) undertookthe study of systematic
changes in the memorytrace. One method he used, the method of serial
reproduction,examinedsubjects'retentionof folk storiesselectedbecause
of their unusual content, the most famous story being The War of the
Ghosts. One of Bartlett'spredictionswas that changesin the recallof this
story over multiple retellings would initially distort and then convert
unfamiliarcontentinto familiarmaterial.And indeedthis is exactlywhathe
found. However,Bartlettselectedthe recallprotocolsto illustratehis point.
Bartlettviewed the protocols through the eyes of his theory. Subsequent
studies that scored all recalls somewhatmore objectively,although repli-
catingthe mainBartletteffect, found a greatdeal morevarietyin the recalls
than Bartlettwould have liked. It is particularlyironic that Bartlettwas
tryingto proveexactlythis point that we see the worldthroughour existing
knowledgeand beliefs.
This problemof the theoristselectingthose segmentsthat provehis or her
point is endemicin researchthat dependson transcriptsor protocolsculled
from a largedata base. One potentialsolutionis illustratedin Schoenfeld's
(1992)article,wherehe arguesthat the data base and scoringcriteriashould
be made availableto the field. Although we applaud this position, it is
hardlypractical,even in Schoenfeld'scase, for it took his team a consid-
erable period of time to analyze the 7 hours of one subject'slearning
sessions. One can scarcelyexpectthe field to devotethe necessarytime. For
archivalreasons, though, Schoenfeldis right and we continueto keep all
notes, audio and videotapes,and transcriptson file so that our selection
bias can be checkedby a laterindefatigablesleuth, even though we believe
that this is unlikelyto occur. The influencethat the introductionof archival
video recordingswill have on the field has hardlybegun to be felt.
In orderto make my task somewhatmanageable,I now have observers,
ethnographers,teachers, and even childrenindicatingin their field notes
when an interesting interaction occurs, so that we can transcribe "just those
events of interest."This is, of course, selectionbefore selection. But it is a

nontrivialtask to capturethe rich social and intellectuallife of a classroom
with a level of analysisthat would permit one to look at real conceptual
changetaking place over time.
LESSONS LEARNED FROM THE HISTORY

OF EDUCATIONALDESIGN EXPERIMENTS
Settingasidepurelymethodologicalissues,thereis anotherimportantset of
concernsthat face the designerof educationalinterventions.Thereis a long
history of attemptsto reconstructlearningand work environments.Con-
temporaryattempts cannot be viewed in an historical vacuum. In this
sectionI look at lessonsto be learnedfrom some of the most famousdesign
experiments.I referto these as: (a) The HawthorneEffect (Roethlisberger
& Dickson, 1939)-the nature of control in multifacetedinterventions,
(b) The DeweyEffect- the romanticizationof the processof discovery,and
(c) The Reality Principle-the problem of the shelf life of successful
interventions.
The Hawthorne Effect
The HawthorneEffect, as it appearsin standardtexts, refersto the fact that

any interventiontends to have positive effects merely because of the
attentionof the experimentalteam to the subjects'welfare. The infamous
Hawthorneeffect has been dogging my trail for a long time. Reciprocal
teaching'ssuccesshas beencalled"onlya Hawthorneeffect."The successof
my interactiveclassroomsis "merelyan exampleof the Hawthorneeffect."
EverywhereI go I can predictthat someonewill tell me that my resultsare
just a Hawthorneeffect. So just what is the Hawthorneeffect? Received
wisdomtells us that in an experimentconductedat the Hawthorneplant of
WesternElectricin the 1920s, psychologistsexaminedthe workingcondi-
tions of plant workersdoing repetitivetasks.
The major finding quoted in secondarysources is that irrespectiveof
what one does to improveor degradeconditions,productivitygoes up. The
usual example given in secondary sources is variation in light. If light
conditionsimproved,so did productivity;however,when light conditions
weredegraded,productivityalso improved.The standardinterpretationof
these findings is that the mere presenceof a researchteam will lead to
enhancedperformancebecauseof the motivationaleffect of the attention
receivedby the "subjects."
I have nevertakenthe Hawthornecriticismof my workseriouslybecause
of the very specific nature of the improvements obtained. If I were creating
164 BROWN
a true Hawthorneeffect, I would not be able to predictwhichperformance

wouldimprove.But in fact we see a close couplingof the cognitiveactivities
practicedand the type of improvementsshown. In reciprocalteaching,only
certainforms of comprehensionactivitiesincrease;in the students'writing,
only certain forms of structuresevolve, and those are clearly predictable
from the kinds of cognitive practice that the students engage in. For
example,the developmentof hierarchicalstructuresin the students'writing
exactlyparalleledthe developmentof competencein usingthe file structure
on the Macintoshinterface.In orderto find theirnotes on, for instance,the
crestedrat, studentshad to know that this animalis an exampleof animal
defense mechanismsunder which topic they needed to enter the file on
mimicry,knowthat it is necessaryto referto the file on visualmimicry,and
only then would they find the animalin question,one that visuallymimics
a skunk to defend itself. Forcedto organizeinformationinto files within
files withinfiles, the childrenregularlytraceda routethrougha hierarchical
organization,an organizationstructurethat was appropriatedinto their
writing(Brownet al., in press). The very specificityof cognitiveimprove-
ment (i.e., certainpracticedactivitiesresultin predictableimprovementsin
some areas and not in others) sets findings like ours apart from the
Hawthorneeffect. A classic exampleof the extremespecificityof practice
and improvementis Scribnerand Cole's (1981) work on the cognitive
consequencesof various forms of literacyamong the Vai, but there are
many examples of this specificity and predictabilityin the patterns of
improvementin the literature,and whenthey occurthey rule out simplistic
recourseto a Hawthorneeffect.
To satisfy my own curiosity, however, I decidedto revisitthe original
Hawthorne work (see Roethlisberger& Dickson, 1939, for a complete
overviewof the multiple-yearprojectthat began in 1927). The result was
illuminating.First, the famous light experimentwas only one (and one of
the worstdocumented)of approximately35 experimentsinvolvingnot only
repetitiveworkers(relay assemblers,mica splitters)but also foremen and
otherdecisionmakers.And the resultswerenot quiteas simpleas one might
think from readingonly secondarysources.
The studiescan be dividedinto two main types. The first concernedthe
physicalconditionsof work (light, air, hours, rest pauses, etc.) and their
physicaland psychologicaleffects on productivity(fatigue,monotony).The
second set concentratedon the psychologicalconditionsof work, such as
perceivedstatus,and the effects of suchactivitiesas askingworkersto serve
as consultantsor collaborators.To summarizebriefly, the main findings
were:
1. All manipulationsdid not resultin improvement.

2. When improvements occurred, they did so under three general condi-
tions: (a) Workers perceived there to be improvements in the conditions

being manipulated, whether or not this was so; (b) workers perceived the
changes to be in their interest; and (c) workers perceived that they were in
control of their own conditions of work, that is, that they were truly
consultants or coinvestigators in the research endeavor. For example, the
provision of rest pauses was beneficial only if the workers thought that
management had their interest at heart. If the impression was that rest
pauses were provided only as a Machiavellian method of increasing
productivity, improvements were less apparent. The best results occurred
when workers felt that they were consulted about the effects of rest pauses
on both worker satisfaction and productivity.
But note that this illusion of control, or real control, is one of the things
I want to happen in my classrooms. I want students to act as consultants,
to be coinvestigators of their own learning (Scardamalia & Bereiter, 1983),
to take charge of their own learning environment to the extent possible
(Brown, 1985). On the ground that the best defense is a good offense, I
argue that, redefined in this way, the Hawthorne effect is exactly what I am
aiming for in my classrooms. The Hawthorne experiments have a great deal
to say about worker and student responsibility and satisfaction.
The Hawthorne effect also has a great deal to say about learning in
complex social environments. Consider an interesting point: Why was the
intervention perceived to be a failure; why does it serve as a cautionary tale
in secondary sources? Actually, only the experimenters thought it was a
failure. Management was delighted with the improved esprit de corps,
productivity, reduced drop-out rate, and so forth. The experimenters'
feeling of failure is illustrated nicely in the following quotation
(Roethlisberger & Dickson, 1939):
The difficulty, however, went much deeper than the personal feelings of
failureof the investigators.Theywereentertainingtwo incompatiblepointsof
view. On the one hand, they weretryingto maintaina controlledexperiment
in which they could test for the effects of single variableswhile holding all
other factorsconstant.On the otherhand, theyweretryingto createa human
situationwhichremainedunaffectedby theirown activities.It becameevident
that in human situationsnot only was it practicallyimpossibleto keep all
other factors constant, but trying to do so in itself introducedthe biggest
change of all; in other words, the investigatorshad not been studyingan
ordinaryshopsituationbut a sociallycontrivedsituationof theirown making.
With this realization,the inquirychangedits character.No longerwere the
investigatorsinterestedin testing for the effects of single variables.In the
place of a controlled experiment,they substitutedthe notion of a social
situation which needed to be describedand understood as a system of
interdependent elements. (p. 185)
166 BROWN
This reflectsthe majorproblemof tryingto conductdesignexperiments

consistingof many interwovenaspects. Componentsare rarelyisolatable,
the whole reallyis more than the sum of its parts. The learningeffects are
not even simple interactions, but highly interdependentoutcomes of a
complexsocial and cognitiveintervention.And this presentsa methodolog-
ical headache for traditional psychology, allergic as it is to multiply
confoundedexperiments.
To illustratethistensionbetweennormallaboratorypracticeandstudying
learningin situ, take the exampleof groupcompositionin laboratoryversus
classroomstudies. Normal operatingprocedureis to select subjectseither
randomlyor accordingto preexistingcriteriaand, if the latter,to matchthem
closelywiththose who will servein the controlgroup.The experimenteruses
standardizedproceduresand has control over time on task and other per-
tinentvariables.If variablesare aggregated,componentanalysisstudiescan
be conductedto disaggregatethe effects as partof a programmaticresearch
effort. This was the case in the originalreciprocalteachingwork (Palincsar
& Brown, 1984).Studentswereselectedfor the readinginterventionbecause
theymetstringentcriteria,theirdecodingratewasat gradelevelbuttheywere
2 or moreyearsbehindin comprehensionscores.Considerablescreeningwas
needed to generatethese students. Control group students also met the
criteria.The experimenterwas eitherone of the authorsor an experienced
teachercarefullyselectedand trained;care was also taken to ensurethat
everyonefollowedthe procedurecorrectly- time on task was carefullycon-
trolled across experimentaland control groups. When reciprocalteaching
was successful,the relativecontributionsof componentstrategies,proce-
dures, materials,etc. were examinedin a series of follow-up studiesthat
teased the componentsapart (Brown& Palincsar,1987, 1989).
In contrast, subsequentreciprocalteaching studies have taken place in
naturally occurringgroups situated in classroom settings. The students
varied considerablyin their ability, teacherswere no longer experienced
volunteers,and they had less controlof time; proceduralreliabilitybecame
a constant problem. Control groups were difficult to engineerbecauseof
resourcelimitations, and even moral problems. It is difficult to expend
limited resourceson control settings if they involve the testing of whole
classrooms,and thereis the moralproblemof excludingchildrenwho need
the intervention.These "naturalistic"studies are inherentlymultiplycon-
founded, and it would take enormousresourcesto unconfoundthem, even
if this were hypotheticallypossible.
In the light of this problem, consider the design experimentthat my
researchteam is currentlytryingto engineerin the classroom.This includes
effecting basic change in the role of students and teachers, modifying
assessment,introducinga novel curriculum,establishinga technologically
rich environment, setting up cooperative learning situations, establishing a
classroom ethos where individual responsibility and group collaboration are
the norm, and so forth. In short, we intervene in all aspects of the

environment.Our interventionsare deliberatelydesignedto be multiply
confounded. Although I was taught to avoid such messy things like the
plague, I do not see an alternative.Of course one alwaystries to "control
for" the obvious (as illustratedin Figure4). One can effect the classroom
changes without computer support, one can introduce the computer
environmentwithout attendingto the curriculum,and so forth. Giventhe
systemicnatureof the environment,however,simplecontrolscan neverbe
entirely satisfactory;but they can provide insights into the operation of
some of the major variables. Nonetheless, my major defense is that a
"Hawthorneeffect"is what I want: improvedcognitiveproductivityunder
the control of the learners,eventuallywith minimalexpense, and with a
theoreticalrationalefor why things work.
The Dewey Effect
Having attemptedto defend against the accusationof merely creatinga

Hawthorneeffect, let us turn to the next most common criticism,that the
work is merely a recapitulationof Dewey. After all, we have projects,
discoverylearning,child-centeredactivity,readinessto learn- what'snew?
Although it is undoubtedlytrue that the research shares philosophical
commonalitieswith Dewey'sapproach,as it also does with Binet'smental
orthopedics(Binet, 1909; Brown, 1985), Bruner'sseminal writingsabout
education(Bruner,1963, 1969), and the philosophybehindthe curriculum
reformsof the 1960s(Schwab,1963),thereare also some subtleand not so
subtle differences.I illustratethis point by consideringthree interrelated
tenets of Dewey's(1900, 1929)faith: readinessto learn, discoverylearning,
and the curriculumand society.
Readiness to learn. One of the central themes of Dewey's (1929)

philosophyis the notion of readinessto learn,readinessthat is definedboth
in termsof the child'scognitivelevel and his or her currentplacein society.
It has become a truismthat educationalpracticeshould take into consid-
eration the child's existing level of competence,knowledge,and interest.
But Dewey seemed to imply that all education should be situatedin the
child's realm. Although consideringthe child's level is now conventional
wisdom,whenit is honoredit is often used as justificationfor not teaching
something,just as a superficialknowledgeof Piagetianstagetheoryis often
taken as license not to teach because a certainchild has not yet reached
Stage 2, is not yet readyto learnprocessX, Y, or Z.
AlthoughDewey was undoubtedlycorrectwhen he arguedfor diagnosis
of a child's capacities, interests, and habits as a launching point for
instruction, Bruner (1969) also made a telling point when he counterargued
that "a point of departure is not an itinerary" (p. 116).
168 BROWN
It is just as mistakento sacrificethe adultto the childas to sacrificethe child

to the adult. It is sentimentalismto assumethat the teachingof life can be
fitted alwaysto the child'sinterestsjust as it is emptyformalismto force the
child to parrot the formulasof adult society. Interestscan be createdand
stimulated.In this sphereit is not far fromthe truthto say that supplycreates
demand,that the provocationof whatis availablecreatesresponse.One seeks
to equip the child with deeper,more gripping,and subtlerways of knowing
the world and himself. (Bruner,1969, pp. 117-118)
My own work on readiness has been influenced by the Vygotskian notion

of a Zone of Proximal Development (Brown & French, 1979; Brown &
Reeve, 1987; Campione, 1989; Campione & Brown, 1990), the distance
between current levels of comprehension and levels that can be achieved in
collaboration with other people or powerful artifacts. Vygotsky introduced
this developmental theory partially in response to his responsibilities as
director of Moscow's Institute of Defectology (Vygotsky, 1978). Contra-
dicting the prevailing notion (still alive today) that retarded children should
be taught through concrete, simplified means because that is what they are
"ready for," he argued that one should go beyond the current level of
competence, stretch the limits, and take children to the upper boundaries of
their potentiality. Vygotsky championed an introduction to the abstract,
complex academic ideas that do not necessarily make contact with lived
experience. Far from readiness as a notion to block off certain avenues of
learning, Vygotsky argued for measures of readiness that would indicate
upper levels of competence through which a child could navigate with help.
Furthermore, these upper boundaries are not seen as immutable, rather they
are constantly changing as the child becomes independently competent at
preceding levels. This optimistic concept of ever expanding capabilities lies
behind both our teaching and assessment methodologies (Brown,
Campione, Webber, & McGilly, 1992; Campione, 1989). Too often Dewey's
position has been interpreted in the negative, blocking sense rather than the
optimistic, stretching sense.
Discovery learning. A second populartenet of Dewey's(1929)peda-

gogical creed is the concept of discovery learning. The children learn best
when discovering for themselves the verities of life. This notion has been
fully incorporated into constructivist theories of learning, and one can
surely have little critical to say about it. Discovery learning, when success-
ful, has much to recommend it. The motivational benefits of generating
knowledgecannot be overestimated,and the sense of ownershipthat this
creates is a powerful reward. Successful discovery learning is clearly a
desired feature of our communities of learners, where students are encour-
aged to discover, own, and share expertise.
Discovery learning is often contrasted with didactic instruction, and given
that choice, I vote for discovery. There is considerable evidence that
didacticteachingleadsto passivelearning,but by the sametoken, unguided

discovery can be dangeroustoo. Children"discovering"in our biology
classrooms are quite adept at inventing scientific misconceptions. For
example,they readilybecomeLamarckians,believingthat acquiredcharac-
teristicsof individualsare passedon and that all things exist for a purpose
(teleological reasoning). They overdeterminecause, thus blinding them-
selves to essentialnotions of randomness,spontaneity,and chance.
Although it is commonplacefor teachersto be called upon to foster
"discovery,"the role of the teacherin discoverylearningclassroomsis still
largelyuncharted.Invokingcomfortablemetaphorssuch as the teacheras
coach does not tell us how and whenthe teachershouldintervene.We know
that challenging students' assumptions, providing them with counter-
examplesto their own rules, and so forth are good instructionalploys. But
how intrusiveshouldteachersbe, whenshouldthey guide,whenshouldthey
teach? When should they leave well enough alone? In short, how can
teachersfoster discoveryand at the same time furnishguidance?
We believe that the middle ground of guided discovery is the most
appropriatefor our classrooms, but this role is difficult to maintain.
Considerthe positionof teachersthat know somethingthat the studentsdo
not. They are in the position of makinga judgmentcall about whetherto
interveneor not: How long should they let the students flounder?They
must decide whetherthe problem surroundsan importantprinciplethat
needs work or involves only a trivialerrorthat they can let pass for now.
Considerthe case of teacherswho do not know the answer, or who may
sharethe students'puzzlementor misconception.In this case they are first
required to recognize this fact (which might not be easy) and, after
admittingpuzzlementor confusion, find waysto remedyit, for exampleby
seeking help. This is not an easy role for many teachers; it demands
competence and confidence. The provision to our classrooms of an
electronicmail systemthat links the teachersand studentsto a community
of scholars(biologists, etc.) helps teachershandle the lack of knowledge
problem.
Guidedlearningis easierto talk aboutthan do. It takesclinicaljudgment
to know when to intervene.Successfulteachersmust engagecontinuallyin
on-line diagnosis of student understanding.They must be sensitive to
overlappingcurrentzones of proximaldevelopment,wherestudentsareripe
for new learning.Guideddiscoveryplaces a great deal of responsibilityin
the hands of teachers,who must model, foster, and guide the "discovery"
processinto formsof disciplinedinquirythat wouldnot be reachedwithout
expertguidance.
The curriculum and society. Finally,Dewey (1929)stressedthe need

to situate curriculum activity in the lives of children. Curricula should
reflect the child's lived experience and provide continuity with the family
170 BROWN
and communitylife. The idea was that if childrenwereinvolvedin projects

wherethey ran farms, built treehouses,and so forth, they would learnnot
only mathematics, for example, but they would understandbetter the
purpose of mathematics.Project type interventionswere popular for a
while until the inevitableconservativebacklash(Cremin,1961)and we are
seeing a great deal of renewed interest today. Indeed, Dewey himself
denounceda greatdeal of project-typework purportedlyconductedin the
spiritof his Chicagoschool innovation(Dewey, 1931;Kliebard,1987).
The spiritof Dewey'spositionwas intendedto counteractthe isolationof
much of school learningfrom the familiarhabits of childhoodon the one
hand, and adultoccupationson the other (Cole & Bruner,1971);however,
there is another side to the story. Schools came into existenceto expose
childrento knowledgeoutside the realm of lived experience,an exposure
rarelypossible in preliteratesociety. As Bruner(1969) argued, to under-
stand somethingis to relinquishpriorways of thinking,and he is skeptical
about the practice of situating learning exclusively in the child's lived
experienceand social activities.For example,he arguedthat:
The significanceof the concept of commutativityin mathematicsdoes not
derivefrom the social insightthat two houseswith fourteenpeoplein each is
not the sameas fourteenhouseswith two peoplein each. Ratherit inheresin
the powerof the idea to createa way of thinkingabout numberthat is lithe
and beautiful and immensely generative.. ... Without the idea of
commutativity,algebrawould be impossible. If set theory ... had to be
justified in terms of its relationto immediateexperienceand social life, it
would not be worth teaching. Yet set theory lays a foundation for the
understandingof order and numberthat could never be achievedwith the
social arithmeticof interest rates and bales of hay at so much per bale.
(Bruner,1969, p. 121)
"Litheand beautifuland immenselygenerative"ideas are assuredlywhat

one is searching for as a basis for a curriculum. And it is unreasonable to
expect childrento reinventpivotal ideas for themselves.Providingexpert
guidancein the form of teachers,or books and otherartifacts,is one of the
primeresponsibilitiesof schooling.Immenselygenerativeideasmaybe few,
and the idea behindeducationis to point childrenin the rightdirectionso
that they may discoverand rediscovertheseideascontinuously.This notion
of a spiralingcurriculum(Bruner,1969)is one that we try to embodyin our
classrooms.Centralthemesareintroducedby the teacherearlyand revisited
often. In the environmentalscienceclassroom,these themesmight include
notions of interdependenceand balance,competitionand cooperation,and
adaptationthat are centralto an understandingof ecosystems.
Coupledwiththeserecurrentthemesarehabitsof mindby whichchildren
are encouraged to extrapolate, refine, and use these underlying themes so
DESIGN
EXPERIMENTS171
that they can discover new commonalties for themselves. As Bruner argued
(1969), education:
should be an invitationto generalize,to extrapolate,to make a tentative

intuitiveleap, evento builda tentativetheory.The leap frommerelearningto
using what one has learnedin thinkingis an essentialstep in the use of the
mind. Indeed, plausibleguessing, the use of the heuristichunch, the best
employment of necessarily insufficientevidence-theseare the activitiesin
whichthe child needspracticeand guidance.They are amongthe great
antidotesto passivity.(p. 124)
Note that this is not untrammeleddiscoverylearning,but learningclearly

underthe expertguidanceof a gifted teacher.
Althoughrecognizingan intellectualdebt to Dewey, it is also true that a
greatdeal of theoreticaland methodologicalworkis still neededif we areto
renderthese ideal educationaltypes into anythingclose to reality. Collins
(in press) argued that educational theorists (Plato, Rousseau, Dewey,
Bruner,and Illich) have addressedthemselvesto the processof education
for many years. The contemporaryagendais for experimentaliststo fulfill
this legacy with concrete design experiments. Such experimentsmust
providea level of descriptionthat wouldafford the opportunityto uncover
mechanismsof learning that are not capturedin preexistingtheoretical
descriptions.So in responseto the criticism,"It'sall Dewey, what'snew?"
the answeris, nothingand everything.
The Reality Principle
The final criticalissue concerningeducationalreformI addressis whether

or not it has any lasting effect. In the light of the poor long-termtrack
record of the laboratory schools of Binet, Dewey, and others, one must
considerthe shelf life of educationalinterventionswarily.Receivedwisdom
tells us that such innovations as the Dewey school and the science
curriculumreforms of the 1960s were experimentsthat were tried and
failed. But it is not clear what was meant by tried or failed, nor is it clear
what the criteriawere againstwhich successor failureis measured(Dow,
1991; Shymansky,1989). Historianstend to be pessimisticabout educa-
tional reform(Cuban, 1984, 1986, 1990),an attitudeof "laplus ga change"
(Cohen, 1989)prevails.The argumentis that successfulinterventionsare a
chimeraor at least are extremelyfleetingand fragile,not readilytransport-
able to settingsoutsidethe innovator'scontrol. Becauseof this skepticism,
it is extremelyimportantfor the design experimenterto considerdissemi-
nation issues. It is not sufficientto arguethat a reasonableend point is an
existence proof, although this is indeed an important first step.
172 BROWN
In this light, the long-term track record of reciprocal teaching is

encouraging;the method enjoys widespreaddisseminationcomparedwith
other innovativeinstructionaldesigns with which I have been involved. I
believe that the reasons for this are (a) it looks deceptively easy to
implement, (b) the term reciprocal teaching has been picked up by
researchers,teachers,and textbook publishersand has become part of the
discourseof the readingcommunity,and (c) most important,the procedure
slots neatlyinto a hallowedclassroomniche(Cuban, 1984).Readinggroup
has a long-standingplace in the school day- teachersare used to arranging
their classes to accommodatereadinggroup; all (all!) that is needed is to
redefinethe activitiesthat take place in a sociallysanctionedniche.
The prognosis for the widespreaddisseminationof communities of
learningis pessimistic.The desiredparticipantstructuresof this program
(Brownet al., in press)would requirefundamentalchangesin the roles of
both studentsand teachers, disrupting"practiceas usual," and we know
that historicallyteachershave been resistantto such disruptions.
To say the least, it is a cautionarynote for contemporarydesignersthat
Dewey (1901)a centuryago warnedthat educationalreformwould not be
easy to engineer.Dewey'sdescriptionof cyclesof innovationand resistance
sounds uncannily like Cuban's (1984, 1990) contemporaryCassandra
bulletins.Firstcomes unrestconcerningthe schools and how they operate,
followedby ferventclaimsand promisesfromreformers.Intensiveresearch
by the convertedis then carriedout in a small set of classroomsrich with
human and, today, technological resources. "The victory is won and
everybody- unlessit is somealreadyoverburdenedand distractedteacher-
congratulateseverybody else that such advanced steps can be taken"
(Dewey, 1901, p. 334). But then come the frustratedattemptsby ordinary
teachersto adopt the new methodsin the absenceof support,followed by
the inevitabledeclinein use, andthe eventualabandonmentof the program.
As Dewey argued,"withina shorttime, complaintsare heardthat children
do not readas well,""ora publicoutcrycallsfor the reformsto be rescinded
in favor of the status quo." One major question facing contemporary
designersis how to avoid repeatingthe Cuban-Deweycycle: exhilaration,
followed by scientificcredibility,followed by disappointmentand blame.
I see the problemas analogousto the alpha, beta, and gammaphasesof
software development.The alpha, or developmental,phase is under the
control of the advocate,and by definitionit must work for thereto be any
later phases. It works, though, under ideal supportiveconditions. Next
comes the beta phase, tryouts at carefullychosen sites with less, but still
considerable,support. Criticalis the gamma stage, widespreadadoption
with minimalsupport. If this stage is not attempted,the shelf life of any
interventionmust be called into question.
My classroom research agenda now is far less that of the frustrated
tryingto unconfoundvariablesfor explanatory

Hawthorneexperimenter
power, although I still care about control issues. My agenda is more like
that of a designeror engineer.I needto unconfoundvariables,not only for
theoreticalclarity, but also so that necessaryand sufficientaspects of the
interventioncan be disseminated.The question becomes, what are the
absolutelyessential featuresthat must be in place to cause change under
conditionsthat one can reasonablyhope to exist in normalschool settings?
In orderto effect this last stage, I again need additionalexpertise,more
methods if you like, that were not part of my training. I need to know a
great deal more about school restructuring,teachertrainingand support,
and teachersas researchers.I need to use ethnographicmethodologiesto
study which aspects of the programare readily adopted and which are
resisted. I need to know the conditionsfavorablefor adoption. I need to
study the sociology of dissemination.I need to know about public policy
issues, and so forth. Again changes in theory and practice demand
concomitantadvances in methodology for the conduct, documentation,
and disseminationof research.
SEX, LIES, AND VIDEOTAPES
I couch my final commentson methodologyin termsof this facetiousbut

not entirely inappropriatesubtitle. Designers of successful classroom
interventionsmust make sure that they are engaging enough to seduce
childreninto the worldof learning,hence"sex"in the title. Onceensnared,
it may be possibleto guidestudentstowardthe intrinsicrewardsthat follow
from self-initiateddisciplinedinquiry.As a designscientist,it is necessary
to tease apartthe majorfeaturesof enticinglearningenvironments:the role
of teachers,students,and researchers;the actualcontributionof curricula
and computersupport;methodsby which distributedexpertiseand shared
meaningare engineered,and so forth. Thereis a constanttension between
designing an exciting classroom for happy campers and maintaining
researchstandardsof control and prediction.
The excitementof teachersand studentsis palpablewhen it all works,
when they experiencewhat one of my teacherscalls "goldenmoments."
However,in reportingthesemoments,it is importantto realizethatthey are
selected from a streamof events that are not so encouraging.One must
separatethe gold fromthe dross. But how does one representthe gold-dross
ratio?Hence"lies"in the title;thereis a tendencyto romanticizeresearchof
this natureand rest claims of successon a few engaginganecdotesor par-
ticularlyexcitingtranscripts.One of the majormethodologicalproblemsis
to establishmeans of conveyingnot only the selectiveand not necessarily
representative, but also the more important general, reliable, and repeatable.
174 BROWN
Finally, why "videotapes."Severalyearsago I was writinga reportfor a

governmentagencyabout major advancesin developmentalpsychologyin
the precedingdecade,and I wroteto colleaguesaskingfor theircandidates.
One colleaguerepliedwith a singleword:videotapes.My first reactionwas
that he was beingtypicallyfacetious,but on furtherthought,I realizedthat
this response was profound. The advantage that modern designers of
classroominnovationshaveis that theymakevideotaperecordsfor archival
purposes.Dewey, Binet, and other earlierdesignershad to dependon the
selectivereportingof the innovatorand his disciples.But today videotape
recordsservemanyfunctions:Theycan be reviewedin the lightof changing
perspectives.Theycan be usedto look backon the performanceof children
or groups of childrenwho are interestingexamplesof a certainlearning
phenomenonand so forth. Tapes are invaluablefor documentingconcep-
tual change, in teachersas well as students, and they provide a common
data base for discussionand reflectiveaction on the part of teachersand
researchers.Today videotaperecordsof successfulteacherinterventions,
peer tutoring, discussions, plans, insights, and generativeideas can be
preservedso that they can be analyzedsubsequentlyby those with radically
differenttheoreticalspectacles.As Schoenfeld(1992) arguedin this issue,
these recordscan be made availableto the field, availablefor continual
reanalysisas new and more powerfultheoriesof learningand concomitant
methodologiesemerge.
CONCLUSION
In this article, I have describedmy currentattemptsto engage in design

experimentsintendedto transformclassroomsfrom academicwork facto-
ries to learning environmentsthat encourage reflective practice among
students,teachers,and researchers.I tracedbrieflythe historyof my own
progressionfrom the study of laboratorylearningto classroomobserva-
tions and experimentation.The needfor newand complexmethodologiesto
capture the systemic nature of learning, teaching, and assessmentwas
described,as was the need to consider the history of prior attemptsto
reorganizeschool and work environments.
ACKNOWLEDGMENTS
An earlier version of this paper was presentedat the meetings of the
AmericanEducationalResearchAssociation, Chicago, 1991 in a sympo-
sium entitled"Onparadigmsand methods:What do you do whenthe ones
you know don't do what you want them to?"
The preparationof this paperwas supportedby grantsfromthe JamesS.

McDonnelland AndrewW. MellonFoundations,and fromthe EvelynLois
Corey ResearchFund.
I expressmy appreciationto my colleagues, notably Joseph Campione
and AnnemariePalincsar,who arejointly responsiblefor a greatdeal of the
work described in this article, and to Alan Schoenfeld who provided
importantfeedbackon an earlydraftof thisarticle.
REFERENCES
Aronson, E. (1978). The jigsaw classroom. Beverly Hills, CA: Sage.

Atkinson, R. C., & Shiffrin, R. M. (1968). Human memory: A proposed system and its control
processes. In R. W. Spence & J. T. Spence (Eds.), The psychology of learning and
motivation (Vol. 2, pp. 89-195). New York: Academic.
Bartlett, F. C. (1932). Remembering:A study in experimentaland social psychology.
Cambridge: Cambridge University Press.
Bereiter, C., & Scardamalia, M. (1989). Intentional learning as a goal of instruction. In L. B.
Resnick(Ed.), Knowing,learning,and instruction:Essaysin honor of RobertGlaser(pp.
361-392). Hillsdale, NJ: Lawrence Erlbaum Associates, Inc.
Binet, A. (1903). L' Etude experimentali de l'intelligence [The experimental study of
intelligence]. Paris: Schleicher Freres.
Binet, A. (1909). Les idees modernes sur les infants [Modern ideas about children]. Paris:
Ernest Flammarion.
Brown, A. L. (1974). The role of strategic behavior in retardate memory. In N. R. Ellis (Ed.),
Internationalreviewof researchin mentalretardation(Vol. 7, pp. 55-111). New York:
Academic.
Brown, A. L. (1975). The development of memory: Knowing, knowing about knowing, and
knowing how to know. In H. W. Reese (Ed.), Advances in child development and behavior
(Vol. 10, pp. 103-152). New York: Academic.
Brown, A. L. (1977). Development, schooling and the acquisition of knowing about
knowledge. In R. C. Anderson, R. J. Spiro, & W. E. Montague (Eds.), Schooling and the
acquisition of knowledge (pp. 241-253). Hillsdale, NJ: Lawrence Erlbaum Associates, Inc.
Brown, A. L. (1978). Knowing when, where, and how to remember: A problem of
metacognition. In R. Glaser (Ed.), Advances in instructional psychology (Vol. 1, pp.
77-165). Hillsdale, NJ: Lawrence Erlbaum Associates, Inc.
Brown, A. L. (1985). Mental orthopedics: A conversation with Alfred Binet. In S. Chipman,
J. Segal, & R. Glaser(Eds.), Thinkingand learningskills: Currentresearchand open
questions (Vol. 2, pp. 319-337). Hillsdale, NJ: Lawrence Erlbaum Associates, Inc.
Brown, A. L., Ash, D., Rutherford, M., Nakagawa, K., Gordon, A., & Campione, J. C. (in
press). Distributed expertise in the classroom. In G. Salomon (Ed.), Distributed cognitions.
New York: Cambridge University Press.
Brown, A. L., Bransford, J. D., Ferrara, R. A., & Campione, J. C. (1983). Learning,
remembering, and understanding. In J. H. Flavell & E. M. Markman (Eds.), Handbook of
child psychology (4th ed.): Cognitive development (Vol. 3, pp. 77-166). New York: Wiley.
Brown, A. L., & Campione, J. C. (1978). Permissible inferences from cognitive training
studiesin developmentalresearch.QuarterlyNewsletterof the Institutefor Comparative
HumanBehavior,2(3), 46-53.
176 BROWN
Brown, A. L., & Campione, J. C. (1979). The effects of knowledge and experience on the
formation of retrieval plans for studying from texts. In M. M. Gruneberg, P. E. Morris, &
R. N. Sykes (Eds.), Practical Aspects of Memory (pp. 378-384). London: Academic.
Brown, A. L., & Campione, J. C. (1986). Psychological theory and the study of learning
disabilities. American Psychologist, 41(10), 1059-1068.
Brown, A. L., & Campione, J. C. (1990). Communities of learning and thinking, or a context
by any othername. HumanDevelopment,21, 108-125.
Brown, A. L., & Campione, J. C. (in press). Restructuring grade school learning environments
to promotescientificliteracy.In Restructuring
learning:Analysisand recommendations
of
the Council of Chief State School Officers. Washington, DC: Council of Chief State School
Officers.
Brown, A. L., Campione, J. C., Reeve, R. A., Ferrara, R. A., & Palincsar, A. S. (1991).
Interactive learning and individual understanding: The case of reading and mathematics. In
L. T. Landsmann(Ed.), Culture,schoolingandpsychologicaldevelopment(pp. 136-170).
Hillsdale, NJ: Lawrence Erlbaum Associates, Inc.
Brown, A. L., Campione, J. C., Webber, L. S., & McGilly, K. (1992). Interactive learning
environments - a new look at assessment and instruction. In B. R. Gifford & M. C.
O'Connor (Eds.), Future assessment:Changing views of aptitude, achievementand
instruction (pp. 121-213). Boston: Academic.
Brown, A. L., & French, L. A. (1979). The zone of potential development: Implications for
intelligence testing in the year 2000. Intelligence, 3, 253-271.
Brown, A. L., & Kane, M. J. (1988). Preschool children can learn to transfer: Learning to
learn and learning from example. Cognitive Psychology, 20, 493-523.
Brown, A. L., & Palincsar, A. S. (1982). Inducing strategic learning from texts by means of
informed, self-control training. Topics in Learning and Learning Disabilities, 2(1), 1-17.
Brown, A. L., & Palincsar, A. S. (1987). Reciprocal teaching of comprehension strategies: A
natural history of one program for enhancing learning. In J. D. Day & J. G. Borkowski
(Eds.), Intelligenceand exceptionality:New directionsfor theory,assessment,and instruc-
tional practice (pp. 81-132). Norwood, NJ: Ablex.
Brown, A. L., & Palincsar, A. S. (1989). Guided, cooperative learning and individual
knowledge acquisition. In L. B. Resnick (Ed.), Knowing, learning, and instruction: Essays
in honor of Robert Glaser (pp. 393-451). Hillsdale, NJ: Lawrence Erlbaum Associates, Inc.
Brown, A. L., & Reeve, R. A. (1987). Bandwidths of competence: The role of supportive
contexts in learning and development. In L. S. Liben (Ed.), Development and learning:
Conflict or congruence? (pp. 173-223). Hillsdale, NJ: Lawrence Erlbaum Associates, Inc.
Bruner, J. S. (1963). The process of education. Cambridge, MA: Harvard University Press.
Bruner, J. S. (1969). On knowing: Essays for the left hand. Cambridge, MA: Harvard
University Press.
Campione, J. C. (1989). Assisted assessment: A taxonomy of approaches and an outline of
strengthsand weaknesses.Journalof LearningDisabilities,22, 151-165.
Campione, J. C., & Brown, A. L. (1987). Linking dynamic assessment with school achieve-
ment. In C. S. Lidz (Ed.), Dynamic assessment (pp. 82-115). New York: Guilford.
Campione, J. C., & Brown, A. L. (1990). Guided learning and transfer: Implications for
approaches to assessment. In N. Frederiksen, R. Glaser, A. Lesgold, & M. Shafto (Eds.),
Diagnosticmonitoringof skill and knowledgeacquisition(pp. 141-172). Hillsdale, NJ:
Lawrence Erlbaum Associates, Inc.
Campione, J. C., Brown, A. L., & Connell, M. L. (1988). Metacognition: On the importance
of understanding what you are doing. In R. I. Charles & E. A. Silver (Eds.), Research
agendafor mathematicseducation:The teachingand assessingof mathematicalproblem
solving (pp. 93-114). Hillsdale, NJ: Lawrence Erlbaum Associates, Inc.
Campione, J. C., Brown, A. L., & Jay, M. (in press). Computers in a community of learners.
In E. De Corte, M. Linn, H. Mandl, & L. Verschaffel (Eds.), Computer-based learning
environments and problem solving. (NATO ASI Series. F: Computer and Systems Series).
Berlin: Springer.
Campione, J. C., Rutherford, M., Gordon, A., & Brown, A. L. (in press). Communities of
learners: Zones of development for those at risk. In N. C. Jordan & J. Goldsmith-Phillips
(Eds.), Learningdisabilities:New directionsfor assessmentand intervention.Needham
Heights, MA: Allyn and Bacon.
Chi, M. T. H. (1976). Short-term memory limitations in children: Capacity or processing
deficits?Memoryand Cognition,4, 559-572.
Chi, M. T. H. (1978). Knowledge structures and memory development. In R. S. Siegler (Ed.),
Children's thinking: What develops? (pp. 73-96). Hillsdale, NJ: Lawrence Erlbaum
Associates, Inc.
Cohen, D. (1989). Teaching practice: Plus ga change ... In P. Jackson (Ed.), Contributing to
educationalchange: Perspectiveson researchand practice (pp. 27-84). Berkeley,CA:
McCutchan. (Also published by the National Center for Research on Teacher Education.
Michigan: Michigan State University, 88-3, September 1988.)
Cole, M., & Bruner, J. S. (1971). Cultural differences and inferences about psychological
processes. American Psychologist, 26, 867-876.
Collins, A. (in press). Toward a design science of education. In E. Scanlon & T. O'Shea (Eds.),
New directionsin educationaltechnology.New York:Springer-Verlag.
Collins, A., & Stevens, A. (1982). Goals and strategies of inquiry teachers. In R. Glaser (Ed.),
Advances in instructional psychology (Vol. 2, pp. 65-119). Hillsdale, NJ: Lawrence
Erlbaum Associates, Inc.
Cremin,L. A. (1961). Thetransformation
of theschool:Progressivismin Americaneducation
1876-1957. New York: Knopf.
Cuban, L. (1984). How teacherstaught: Constancyand change in Americanclassrooms,
1890-1980. New York: Longman.
Cuban,L. (1986).Teachersandmachines:Theclassroomof technologysince1920.NewYork:
Teachers College Press.
Cuban, L. (1990). Reforming again, again and again. Educational Researcher, 13, 3-13.
Darwin, C. (1877). A biographical sketch of an infant. Mind, 2, 286-294.
DeLoache, J. S., Brown, A. L., & Kane, M. J. (1985, April). Young children's error correction
procedures in manipulative play. Paper presented at the meeting of the Society for Research
in Child Development, Toronto.
Dewey, J. (1900). The school and society. Chicago: University of Chicago Press.
Dewey, J. (1901). The situation as regards the course of study. Journal of Proceedings and
Addressesof theFortiethAnnualMeetingof theNationalEducationAssociation,332-348.
Dewey, J. (1929). My pedagogical creed. Washington, DC: Progressive Education Association.
Dewey, J. (1931). The way out of educational confusion. Cambridge, MA: Harvard University
Press.
Dow, P. (1991). Schoolhouse politics. Cambridge, MA: Harvard University Press.
Fish, S. (1980). Is there a text in this class? The authorityof interpretivecommunities.
Cambridge, MA: Harvard University Press.
Flavell, J. H. (1971). First discussant's comments: What is memory development the
development of? Human Development, 14, 272-278.
Flavell, J. H., & Wellman, H. M. (1977). Metamemory. In R. V. Kail, Jr. & J. W. Hagen
(Eds.),Perspectiveson thedevelopmentof memoryandcognition(pp. 3-33). Hillsdale,NJ:
Lawrence Erlbaum Associates, Inc.
Gentner, D. (1983). Structure-mapping: A theoretical framework for analogy. Cognitive
Science,7, 155-170.
Gentner, D. (1989). The mechanisms of analogical learning. In S. Vosniadou & A. Ortony
178 BROWN
(Eds.), Similarity and analogical reasoning. Cambridge: Cambridge University Press.

Gick, M. L., & Holyoak, K. J. (1980). Analogical problem solving. Cognitive Psychology, 12,
306-355.
Glaser, R., & Bassok, M. (1989). Learning theory and the study of instruction. Annual Review
of Psychology,40, 631-666.
Hatano, G., & Inagaki, K. (1987). Everyday biology and school biology: How do they
interact?TheNewsletterof the Laboratoryof ComparativeHumanCognition,9, 120-128.
Karmiloff-Smith, A., & Inhelder, B. (1974). If you want to get ahead, get a theory. Cognition,
3, 195-212.
Kessen, W. (1965). The child. New York: Wiley.
Kintsch, W., & van Dijk, T. A. (1978). Toward a model of text comprehension and
production. Psychological Review, 85, 363-394.
Kliebard,H. M. (1987). The strugglefor the Americancurriculum1893-1958.New York:
Routledge.
Kuhn, D., Amsel, E., & O'Loughlin, M. (1988). The development of scientific thinking skills.
New York: Academic.
Miller, G. A. (1956). The magical number seven, plus or minus two: Some limits on our
capacity for processing information. Psychological Review, 63, 81-97.
Palincsar, A. S., & Brown, A. L. (1984). Reciprocal teaching of comprehension-fostering and
monitoring activities. Cognition and Instruction, 1(2), 117-175.
Palincsar, A. S., Brown, A. L., & Campione, J. C. (1990, April). Discourse as a mechanism
for acquiring process and knowledge. Paper presented at the American Educational
Research Association Conference, Boston.
Piaget, J. (1952). The origins of intelligence in children. New York: International Universities
Press.
Reeve, R. A., Gordon, A., Campione, J. C., & Brown, A. L. (1990, April). Enhancing and
predictingthe mathperformanceof elementaryschool childrenusing reciprocalteaching
and dynamic assessment. Paper presented at the American Educational Research Associa-
tion Conference, Boston.
Roethlisberger, F. S., & Dickson, W. J. (1939). Management and the worker. Cambridge,
MA: Harvard University Press.
Scardamalia, M., & Bereiter, C. (1983). Child as coinvestigator: Helping children gain insight
into their own mental processes. In S. Paris, G. Olson, & H. Stevenson (Eds.), Learning and
motivation in the classroom (pp. 61-82). Hillsdale, NJ: Lawrence Erlbaum Associates, Inc.
Scardamalia, M., & Bereiter, C. (1991). Higher levels of agency for children in knowledge
building: A challenge for the design of new knowledge media. The Journal of Learning
Sciences,1, 37-68.
Schoenfeld, A. H. (1985). Mathematical problem solving. New York: Academic.
Schwab, J. J. (1963). Biology teacher's handbook. New York: Wiley.
Scribner, S., & Cole, M. (1981). The psychology of literacy. Cambridge, MA: Harvard
University Press.
Shymansky, J. A. (1989). About ESS, SCIS, and SPAP. Science and Children, 7(26), 33-35.
Tulving, E., & Madigan, S. A. (1970). Memory and verbal learning. Annual Review of
Psychology,21, 437-484.
Vygotsky,L. S. (1978).Mind in society: The developmentof higherpsychologicalprocesses
(M. Cole, V. John-Steiner, S. Scribner, & E. Souberman, Eds. and Trans.). Cambridge,
MA: Harvard University Press.
Wellman, H. M. (1983). Metamemory revisited. In M. Chi (Ed.), What is memory develop-
ment the development of? A look after a decade. Human Development (pp. 31-51). Basel,
Switzerland: S. Karger.
Whitehead, A. N. (1916). The aims of education. Address to the British Mathematical Society,
Manchester, England.

Ann Brown

Uploaded by

Document Information

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Ann Brown

Uploaded by

Copyright:

Available Formats

Design Experiments: Theoretical and Methodological Challenges in Creating Complex

Interventions in Classroom Settings

The lion's shareof my currentresearchprogramis devotedto the study of

Requestsfor reprintsshouldbe sentto Ann L. Brown,Universityof California- Berkeley,

FIGURE1 The complexfeaturesof design exper

partof a systemicwhole. Just as it is impossibleto changeone aspectof the

confoundedexperimentsthat are a nightmarefor an experimentalpsychol-

LEARNINGTHEORY:A PERSONAL HISTORY

Strategies, Capacity, and the Training Study

learningbecause of its associationwith behavioristtheories. During this

A major impetus to my early work was a review in which a leading

Content and contexts of learning. I will gloss over the history of

learning in which they were observed. These developmentsalso had a

1. Trainingwas absurdlycursory;few studiesincludedmore than a day

To think of learningin this widersensemakeslife moredifficult, and again

These changes in learning theory led me down the slippery slope to

the dynamicsof a functioningclassroom. We began workingone-on-one

In our currentwork, reciprocalteaching is only one component of the

extent for definingtheirown expertise.Teachers'roles also changedramat-

themes (e.g., animal defense mechanisms,endangeredspecies, changing

routinely, together with extensivevideo and audio taping of individuals,

In this section, I considersome of the persistentproblemsassociatedwith

Classroom Versus Laboratory Contexts

The first issue I addressis the tensionbetweenand relativecontributionsof

discussions(Brownet al., in press).My observationsgo in both directions;

Productionof Analogyin Discourse

Unit 1 Unit 2 Unit 3

FIGURE 2 Production of analogy in spontaneous discourse of 5th and 6th graders

cross-fertilizationbetweensettingsenrichesour understandingof the devel-

IdiographicVersus Nomothetic Approaches

One of the major decisionspsychologistsmust make is to decide on the

FIGURE3 Productionof explanationin the spontaneousdiscourseof 5th and 6th

adopt the idiographicapproach,"thethorough study of individualcases,

mentaltrajectoriesmade;and longitudinaldesigns,where,for example,all

Katyclearlyhad not understoodthe basic place of photosynthesisas the

Well... justto switchoff, but,um,it wouldbeeasierforthemto changeon

be meat aroundfor me to eat, but for them therewouldn'tbe ... so if they

During a subsequent interview 6 months later, when asked about

Use of Biology Principal:HabitatTask

mechanisms)and more novel variations of mechanismsthey had been

The Bartlett Effect

The third methodologicalissue, which goes beyond that of qualitative

events of interest."This is, of course, selectionbefore selection. But it is a

LESSONS LEARNED FROM THE HISTORY

The Hawthorne Effect

The HawthorneEffect, as it appearsin standardtexts, refersto the fact that

a true Hawthorneeffect, I would not be able to predictwhichperformance

1. All manipulationsdid not resultin improvement.

tions: (a) Workers perceived there to be improvements in the conditions

This reflectsthe majorproblemof tryingto conductdesignexperiments

the norm, and so forth. In short, we intervene in all aspects of the

The Dewey Effect

Having attemptedto defend against the accusationof merely creatinga

Readiness to learn. One of the central themes of Dewey's (1929)

It is just as mistakento sacrificethe adultto the childas to sacrificethe child

My own work on readiness has been influenced by the Vygotskian notion

Discovery learning. A second populartenet of Dewey's(1929)peda-

didacticteachingleadsto passivelearning,but by the sametoken, unguided

The curriculum and society. Finally,Dewey (1929)stressedthe need

and communitylife. The idea was that if childrenwereinvolvedin projects

"Litheand beautifuland immenselygenerative"ideas are assuredlywhat

should be an invitationto generalize,to extrapolate,to make a tentative

Note that this is not untrammeleddiscoverylearning,but learningclearly

The Reality Principle