Professional Documents
Culture Documents
Perry CredibilityIncredulityMilgrams 2020
Perry CredibilityIncredulityMilgrams 2020
Perry CredibilityIncredulityMilgrams 2020
Author(s): Gina Perry, Augustine Brannigan, Richard A. Wanner and Henderikus Stam
Source: Social Psychology Quarterly , MARCH 2020, Vol. 83, No. 1 (MARCH 2020), pp. 88-
106
Published by: American Sociological Association
REFERENCES
Linked references are available on JSTOR for this article:
https://www.jstor.org/stable/10.2307/48588966?seq=1&cid=pdf-
reference#references_tab_contents
You may need to log in to JSTOR to access the linked references.
JSTOR is a not-for-profit service that helps scholars, researchers, and students discover, use, and build upon a wide
range of content in a trusted digital archive. We use information technology and tools to increase productivity and
facilitate new forms of scholarship. For more information about JSTOR, please contact support@jstor.org.
Your use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at
https://about.jstor.org/terms
American Sociological Association is collaborating with JSTOR to digitize, preserve and extend
access to Social Psychology Quarterly
Experiments: A Reanalysis
of an Unpublished Test
Abstract
This article analyzes variations in subject perceptions of pain in Milgram’s obedience experi-
ments and their behavioral consequences. Based on an unpublished study by Milgram’s assis-
tant, Taketo Murata, we report the relationship between the subjects’ belief that the learner
was actually receiving painful electric shocks and their choice of shock level. This archival
material indicates that in 18 of 23 variations of the experiment, the mean levels of shock
for those who fully believed that they were inflicting pain were lower than for subjects who
did not fully believe they were inflicting pain. These data suggest that the perception of
pain inflated subject defiance and that subject skepticism inflated their obedience. This anal-
ysis revises our perception of the classical interpretation of the experiment and its putative
relevance to the explanation of state atrocities, such as the Holocaust. It also raises the issue
of dramaturgical credibility in experiments based on deception. The findings are discussed in
the context of methodological questions about the reliability of Milgram’s questionnaire data
and their broader theoretical relevance.
Keywords
dramaturgical credibility, experimental deception, methodological dilemmas in assessing
obedience, obedience to authority, Stanley Milgram
many current scholarly narratives tend to assistant. These interviews were semi-
accept Milgram’s assumption that we are structured and were designed to probe
all capable of torture and murder at the the subjects’ reactions to their experience
behest of an authority figure (Russell and to try to expose the reasons for their
2018), a conclusion that is transferred behavior. Hollander and Turowetz
uncritically to the general public and pro- (2017) analyzed 117 recordings that
spective students through the textbook were selected from condition 2 (voice feed-
industry (Griggs and Whitehead 2015). back), condition 3 (proximity), condition
In this article, we explore the degree to 20 (women as subjects), condition 23
which subjects believed the experimental (Bridgeport), and condition 24 (intimate
scenario and the influence of such beliefs relationships). From the 117 cases, they
on the experimental results. Central to settled on 91 recordings where the sub-
the claims Milgram made about uncover- jects gave at least one discernible account
ing an innate tendency to follow orders is or explanation of their behavior. Hol-
the assumption that his subjects, under lander and Turowetz’s analysis focused
the guise of a memory test, and presum- on the commonsense reasoning given by
ably hoodwinked by the experimental the subjects to account for their reactions
deception, continued to follow the com- in the immediate aftermath of the experi-
mands of an experimenter to administer ments. The 91 recordings included 46
what were presented as increasingly cases where the subjects had been classi-
painful electric shocks to a person who fied as ‘‘obedient’’ (i.e., completed the
seemed to be suffering. entire escalation of shocks) and 45 cases
defined as ‘‘defiant’’ (i.e., discontinued
HOLLANDER AND TUROWETZ’S the regimen at any point).
ANALYSIS OF THE Among those classified as obedient,
POSTEXPERIMENTAL INTERVIEWS there were four major accounts identified.
In some cases, more than one account was
Milgram’s very first probes regarding the offered for why the subjects continued
effectiveness of his deceptive cover story with the experiment. The most prevalent
occurred in the conversations conducted explanation was that the subjects did not
immediately after the termination of the believe the learner was actually being
experiments. These were designed to harmed. This finding was astonishing in
switch the definition of the situation light of Milgram’s earlier assurances of
away from the bogus learning task to the credibility of the cover story. This
the subjects’ reactions to the administra- was offered in 33 out of the 76 accounts
tion of shocks. Archival materials indi- uncovered. ‘‘The most common ‘obedient’
cate that Milgram systematically audio- explanation is L [learner] not being
taped all cases that were tested through harmed, as 33 (out of 46 total)
some 23 different iterations of the obedi- ‘obedient’-outcome participants used it
ence experiment, although tapes for two at least once (72%)’’ (Hollander and Turo-
key experiments (5 and 6) are unaccount- wetz 2017:660). This was followed by
ably missing. The audiotapes continued accounts of ‘‘following instructions’’ of
after the experiment ended and captured the scientist (27 cases), the ‘‘importance
the reflections of the subjects in an infor- of the experiment’’ (11 cases), and the
mal interview with either the ‘‘scientist,’’ existence of a tacit ‘‘contract’’ after having
John Williams; Milgram himself; or agreed to participate in the experiment
Alan Elms, Milgram’s young research (5 cases).
completed the experiment. That meant he subjects may well have entered ‘‘a pact
had no opportunity to question the major- of ignorance,’’ in which the subject main-
ity of the subjects about their belief in the tains a mask of naivete to safeguard the
cover story prior to administration of the integrity of the experiment. In fact, Hall
questionnaire. In a public debate with (1963) suggested that Milgram ‘‘should
Martin Orne in 1969, Milgram argued get post-experimental interviews to find
that the subjects’ reflections on the exper- out how Ss perceived various features of
iment contained in the questionnaire the experimental condition.’’ Orne and
were a more valid measure of their reac- Holland argued that in social psychologi-
tions than the postexperimental conver- cal experiments using deception, it was
sations because subjects were so ‘‘worked not clear who was deceiving whom. They
up’’ immediately after the experiment questioned the plausibility of the experi-
(Milgram 1969). By the time they ment and gave credit to subjects as ‘‘sen-
received the questionnaire months later, tient’’ beings (1968:285) actively attuned
they were ‘‘more detached’’ and presum- to cues about how they were expected to
ably better able to reflect on their experi- behave and alert to incongruities in the
ence. Describing a typical subject, Mil- experimental setup. This was an opportu-
gram said, ‘‘He’s got time to see his nity for Milgram to reply with a presenta-
participation in perspective. . . maybe he tion of his questionnaire data on what
can speak a little more honestly at that subjects said they believed during the
time. As a matter of fact, you get tremen- experiment. He chose not to do so. Mil-
dous conclusions from the time factor. . . I gram received subject feedback, including
think the subjects were basically honest phone calls and letters, from many former
at that point’’ (Milgram 1969). participants that detailed a wide variety
In total, the results of six items from of triggers for suspicion. The dog-eared
the ten-item questionnaire have been check given to the learner, the cries ema-
published. Milgram (1964) published nating from what sounded like a speaker,
three reports on the theme of subject the impassivity of the experimenter in the
well-being and one on subject skepticism face of the learner’s supposed pain, the
(Milgram 1974). In support of Milgram’s experimenters’ refusal of teachers’ offers
claim that no harm was done, Blass to change places with the learner, suspi-
(2004) published additional data from cions that the laboratory mirror was
two more questionnaire items that asked two-way, and the unlikelihood of Yale
subjects how they felt during the experi- conducting an experiment involving tor-
ment and whether they were bothered ture were variously identified by subjects
by it afterward. The partial and piece-
as incongruent elements in the experi-
meal publication of these data and the
mental setup (Perry, 2013a).
fact that five of the six questionnaire
items have been on the theme of subject
MURATA’S ANALYSIS
well-being has tended to keep the focus
on ethical as opposed to methodological With the questionnaire data complete, in
issues in the obedience experiments. late summer of 1962, Milgram instructed
But methodological misgivings did not his research assistant, Taketo Murata, to
abate. Echoing the NSF, Orne and Hol- take a closer look at the questionnaire
land (1968:285) questioned whether the data on the credibility of the deception
subjects were ‘‘taken in’’ by the experi- and its effect on his subjects. This was
ment and concluded that, without exam- the only time in the project when Milgram
ining this in detail, Milgram and his linked the questionnaire data on an
individual level with the subjects’ behavior means resulted in an equal distribution of
in the experiment. Milgram had obtained positive and negative effects (as assumed
five-level scale responses from his ques- by the null hypothesis). On the evidence
tionnaire concerning the degree of convic- (18 negatives, 3 positives), he rejected the
tion that subjects had during the experi- null hypothesis (Murata 1962).
ment about how real the shocks were,
varying from fully believing to fully not AN EXTENDED ANALYSIS
believing. He asked Murata to compare
The sign test may provide a quick deter-
the degrees of obedience between those
mination of significant differences in
subjects who said they were doubtful that
binomial distributions, but it discards
the shocks were painful and those who
the data where the means are equivalent
were certain they were. The report was
and fails to incorporate other relevant
titled ‘‘Reported Beliefs in Shocks and
data, such as the differences in the exper-
Level of Obedience’’ (Murata 1962). Mur-
imental designs. It is possible, however,
ata (1962:1) wrote, ‘‘The following is a
to provide a robust estimate of any signif-
condition-by-condition analysis to deter-
icant difference between means by incor-
mine whether shock level reached was
porating the number of observations on
affected by the extent to which the subject
which the means in each of the 23 experi-
believed that the learner was actually
receiving shock.’’ Murata listed conditions ments are based. In Table 2, we estimate
1 through 24, although there is no shock two ordinary least squares (OLS) regres-
associated with condition 21. This condi- sion models in which mean number of
tion contained the estimates given by var- shocks constitutes the dependent vari-
ious groups (psychiatrists, college stu- able, weighting the results for the num-
dents, and middle-class adults) of how ber of cases in each experimental condi-
persons would be expected to act in such tion.1 In addition to a bivariate model
an experiment. This condition is omitted with level of belief as a predictor (Model
in our analysis. The report’s main results 1), we added a set of effect-coded varia-
were a comparison of the mean shock lev- bles for the experimental conditions using
els administered by subjects who had the ‘‘intimate relationship’’ condition as
‘‘fully believed’’ (FB) in the reality of the reference category (Model 2). In
shocks versus those classified as having effect, coding the coefficients is inter-
‘‘not fully believed’’ (NFB). The mean preted as deviating from the unweighted
shock levels varied from 1 to 30, represent- grand mean of the dependent variable.
ing the gradations in shock from 15 to 450 The effect of the left-out category
volts in 15-volt increments. Table 1 dis- equals the negative of the sum of all
plays the mean shock level for each group other coefficients.2 We did not include
(FB vs. NFB) as well as the numbers in
1
each experiment. Murata found that in We checked the distribution of the residuals
18 of 23 experiments, those subjects who from these models for normality by graphing
them against a generated normal distribution
fully believed the learner was getting pain- and with a quantile-quantile plot of studentized
ful shocks gave lower levels of shock than residuals against the cumulative normal. Both
subjects who doubted the shocks were plots indicate that the assumption of normality
real. In two conditions, the shock levels is met by these data.
2
were equal; and in three conditions, they For a detailed explanation of effect coding,
see Alkharusi (2012). Unlike indicator (dummy)
were higher. Murata conducted a nonpara- coding, this method involves generating separate
metric sign test to determine whether sub- variables for each value of a categorical variable
tracting the NFB means from the FB by assigning 11, 0, or 21 to each value.
Experimental condition
Condition designation FB mean FB n NFB mean NFB n Sign
Note: FB refers to subjects who ‘‘fully believed’’ in the reality of the shocks; NFB refers to those who had
‘‘not fully believed’’ in it. Sign refers to the difference between the FB mean and the NFB mean.
Level of belief
Did not fully believe (reference category)
Fully believed –2.664 –2.158
Experimental condition
Carte blanche –13.985
Benign experimenter –9.834
Double authority (common man shocked) –9.553
Authority as victim –8.943
Intimate relationship –3.925
Groups for disobedience –3.597
Common-man authority 2 –3.208
Group choice –2.592
Touch proximity –1.618
No experimenter –0.230
Proximity 1.136
Double authority (experimenter shocked) 2.214
Bridgeport replication 2.488
Qualifying volunteer 2.524
Coronary tape: Elges & Tracy 2.790
Experimenter departs 3.069
Common-man authority 4.236
Coronary tape: Williams & McDonough 4.868
Voice feedback 5.693
Groups for obedience 5.791
Women 5.850
Remote feedback 7.706
Nontrigger 9.120
Constant 21.723 28.403
Observations 656 656
R2 0.047 0.968
Note: Regressions weighted by number of cases in each condition. Experimental conditions use effect
coding with conditions ranked on size of effect. All coefficients significant beyond p \ .001 except ‘‘no
experimenter present.’’
Belief dichotomized
Belief high (belief low = reference category) 2.571***
Degree of belief (ordinal)
Belief highest = 1 3.640*
2 2.435
3 2.689
4 1.311
Belief lowest = 5 (reference category)
Observations 618 658
Pseudo R2 0.020 0.024
Likelihood-ratio x2 16.78*** 21.38***
Note: Dichotomized level of belief does not include level 3. Coefficients are odds ratios.
*p \ .05. **p \ .01. ***p \ .000.
less convinced of pain, their defiance a high level of belief that the shocks
decreased and their obedience increased. were real were 2.57 times more likely to
Also, Milgram’s original table obscured be defiant than those who had a low level
the observation that across all conditions, of belief. No matter whether one focuses
the majority—368 out of the 658 subjects on obedience or defiance, the effects of
(56 percent)—were defiant, not obedient. levels of belief are highly significant,
The cross-tabulation in Table 3, in both statistically and substantively.
which the chi-square statistic testing the Model 2 shows that compared to those
association is significant, shows in per- in the baseline category (belief level 5),
centage terms that defiance generally the level of defiance increases by 3.7
declines significantly from the highest times when the subjects fully believed
level of belief in pain to the lowest and, the learner was being shocked compared
conversely, that obedience increases to being certain that the learner was not
with skepticism of pain. It is difficult to being shocked. Even though the effects
identify with precision an effect size on of the intermediate degrees of belief
obedience/defiance across levels of belief are not statistically significant, this pro-
in such a cross-tabulation. In order to vides even more compelling evidence
determine the effect size, we estimated that belief in the reality of the shocks
frequency-weighted logistic regressions strongly affected subjects’ obedience or
to identify changes in defiance across lev- defiance. More generally, this means
els of belief. The models shown in Table 4 that variations in dramaturgical credibil-
report the effects of level of belief and ity resulted in dramatic variations in the
experimental condition. In Model 1, we levels of obedience and defiance.
dichotomized the belief variable used by The effect of level of belief on the num-
Milgram to contrast the top two versus ber of shocks administered, shown in
the bottom two measures of belief, while Table 2, and the strong effect of level of
omitting the middle category, to provide belief on defiance, shown in Table 4, are
an overall estimate of effect of belief on complementary in their implications.
defiance. This resulted in an odds ratio They both indicate that those who were
of 2.57, suggesting that those who had stronger in their belief that the shocks
were real both administered fewer shocks across the 23 experiments, the majority
and were more likely to be defiant. The of subjects perceived that the learner
effect of belief on obedience/defiance, was suffering and defied the experi-
however, is much stronger than the effect menter. Those who were less successfully
on number of shocks. Of course, we can- convinced by the cover story were more
not make a direct comparison, since one obedient.
effect is captured by an odds ratio and There are several caveats that should
the other by an OLS regression coeffi- temper the interpretation of results.
cient. Nevertheless, the effect on number First, while 85 percent of the question-
of shocks, –2.664, while statistically sig- naires on which Murata’s analysis was
nificant, is relatively small compared to based were returned, it is impossible to
the overall mean number of shocks across determine whether there was a system-
all designs, 19.05. The odds ratio associ- atic bias in the return rate and how this
ated with the dichotomous version of might have influenced the results. This
belief, however, indicates that those is unlikely, however, given the high
with a high level of belief were over two- return rate.
and-a-half times more likely to express Second, there are differences in the
defiance, a very large, significant effect. conclusions reached by Murata and Mil-
An even larger effect is observed when gram due in part to the different
we compare those who fully believed approaches to their data analyses. Mur-
that the shocks were real to those who ata employed only two levels of belief
were certain that the learners were not (full belief versus everything else), while
experiencing shocks. Milgram used all five levels of belief,
from full belief to full skepticism. Simi-
DISCUSSION larly, Murata used the actual mean score
levels of shocks, while Milgram used com-
This analysis, like other studies based on plete obedience versus any level of defi-
the Milgram archives, raises critical ance. As a result, the mean difference
questions about the significance and reported by Murata is modest but signifi-
meaning of the obedience studies. Like cant (2.66), and the explained variance is
Hollander and Turowetz’s (2017) analysis modest (4.53 percent). The effect size in
of postexperimental admissions, it sug- Milgram, however, with the larger vari-
gests that both the perception of pain ance in the measure of belief, yields
and skepticism about inflicting pain a highly substantive measure of the
appear to have been significantly associ- impact of belief on defiance/obedience
ated with the behavior of the subjects. reflected in the changing odds ratios
Miller (2014:216) says that ‘‘understand- (from the baseline to the highest level of
ing precisely why different specific indi- belief, an increase to a factor of over 3.6).
viduals in the same experimental condi- There is a third issue, which concerns
tion decide to obey or withdraw remains the very idea of belief, what it represents,
a major unanswered question in the obe- and when and if it can be measured reli-
dience story.’’ The results reported here, ably. This is a more intractable problem.
which control for different individuals in Hollander and Turowetz (2017) take the
the same conditions, suggest that one sig- view that the immediate open-ended
nificant explanation for their behavior experimental conversations probably
was whether they thought the experi- yield relatively reliable and valid meas-
ment was actually painful to anyone. If ures of the subjects’ perceptions of what
we examine the research in its entirety had transpired in the previous hour,
when their memories were vivid. Their influenced the subsequent perception of
conclusions, however, based on a pains- the experiment generally and the
taking methodology, rely on a sample of responses to questions about perceptions
about 10 percent of all the subjects. In of harm in particular (see Turowetz and
contrast, in a public debate with Orne, Hollander 2018). Among the 25 percent
Milgram (1969) argues that the question- of cases that experienced the full post
naire results, which are based on a plural- hoc oral debriefing were the ‘‘women’’ con-
ity of all subjects, are more valid because dition (which had one of the highest levels
the passage of time had allowed the of obedience) and the ‘‘intimate relation-
nerves to cool and the experience to be ship’’ condition (which had one of the low-
put in some perspective. He claims at est). Any differences in belief in pain that
other times, however, that denial of pain might be attached to the debriefing pro-
is not an automatic token of belief, since cess would be significantly confounded
it may reflect a defense mechanism to by differences in the actual experimental
manage shame. In a short preface to Mur- conditions as identified in Table 2.
ata’s (1962) paper, he says that ‘‘probably A separate issue is the potential incon-
some of it is a real relationship and some sistency between the post hoc interviews
of it is defensive,’’ although he never and the later questionnaire data. This
explains how one would establish the dif- remains a thorny problem not only for
ference. This disparity highlights the dif- us but for anyone who wishes to advance
ficulty of drawing definitive conclusions an explanation for differences in subject
from either type of evidence. behavior based on inferences about their
Gibson (2013b) and Gibson et al. mental states, including Milgram. It was
(2017) raise the situated nature of Milgram whose research raised the issue
accounts containing the expressions of of obedience and its responsiveness to
belief and what these utterances achieve perceptions of harm. If his premise was
as speech acts in specific settings. If we correct, then our work has raised an
adopt their perspective, it follows that ironic linkage between belief and obedi-
the social scientist cannot arbitrarily ence that was unexplored by Milgram
privilege one context of utterance and and the obedience literature. One of the
expression over others in the search for potentially redeeming elements of our
evidence of ‘‘real belief.’’ The positions revisiting the Murata paper, and accept-
expressed in the structured question- ing, even provisionally, the validity of
naires may or may not have correlated the questionnaire responses, is that it
with the postexperimental utterances re-centers the issue of responsibility and
offered in explanations for the subjects’ agency that Milgram’s work tended to
choices. This was not measured. One overlook. Milgram’s work appears to
line of inquiry that might cast light on have been in denial of two salient facts.
this would be to probe expressions of On the one side, many subjects were not
belief over time (after the experiment, in effectively hoaxed, as noted by Orne and
a later interview, and in a subsequent Holland (1968) and Baumrind (1964).
questionnaire) to establish consistency, On the other side, when the studies are
but such repeated intrusions would them- taken as an integrated paradigm of
selves be subject to question for inadver- research, the most prevalent event that
tently coaching, if not implanting, recol- Milgram recorded, contrary to what he
lections of experience. reported, was defiance. Even if the quan-
In addition, it remains to be deter- tum of influence presented here is not enor-
mined whether differences in debriefing mous, a significant contributor to subject
defiance appears to be the reported percep- harm was the predominant response in
tion of pain that restrained aggression, obedient subjects and (b) fear of hurting
a factor that the obedience literature has the learner was a most salient account
failed to fully appreciate. for defiance.
On the basis of their analysis of
THEORETICAL RELEVANCE responses to Milgram’s collection of sub-
jects’ comments obtained from the ques-
Our analysis offers little support for the
tionnaire, Haslam et al. (2015) conclude
theory that Milgram’s subjects, despite
that most subjects were ‘‘happy to have
the usual dictates of conscience, became
been of service’’ and ‘‘the majority pro-
inattentive functionaries who inflicted
fessed that they were happy about their
painful shocks against an innocent per-
participation’’ (2015:79). This is an impor-
son indifferent to the resulting conse-
tant foundation for the engaged follower-
quences (see Fenigstein 2015). This per-
ship perspective. How does one resolve
spective is associated with Milgram’s this discrepancy between Murata’s quan-
initial speculation about the agentic state titative analysis of data from the same
and has become largely discredited (Bran- questionnaire? Haslam et al.’s analysis
nigan and Perry 2016). The dominant drew comments from one subset (section
alternative explanation is the engaged 13) of 1,057 transcribed comments that
followership perspective advanced by Milgram had classified into 17 thematic
Haslam and Reicher (2012) in a remark- groups. The 140 comments Haslam et al.
able series of publications that cover the employed had been labeled by Milgram
work of Zimbardo as well as that of Mil- as ‘‘thoughts about the value of having
gram. These scholars suggest that obedi- participated in the research.’’ It is
ent subjects identify with the experi- unclear, however, how many individuals
menter rather than the learner, and made comments that appeared in section
follow the experimenter’s directions, 13, let alone whether they represented
believing themselves to be ‘‘contributing the majority of subjects. In addition, Mil-
to a moral, worthy, and progressive gram notably included section 6, which
cause’’ (Haslam et al. 2015:60). Teachers dealt with ‘‘feelings and suspicions during
administer shocks in the service of this study.’’ This included 131 entries. This
larger cause. Even though they may sus- may have included feelings of suspicion
pect that what they are doing is wrong, that later eventuated in doubt. Again,
they ‘‘believe it to be right’’ (Haslam et al. we have the problem that these are counts
2015:78). Our analysis of the Murata of comments, not individuals, and that
(1962) evidence is largely inconsistent individuals may have made comments
with this conclusion, given that the that ended up in various thematic sec-
majority of subjects defied the ‘‘scientific’’ tions. In our view, relying exclusively on
pressure to conform and did so in section 13 to make the case for engaged
response to the perceived suffering of followership seems to dismiss prema-
the learner. For those who continued, in turely the potential relevance of the
what was at times a highly ambiguous majority of comments. It is also regretta-
and stressful situation, at least some sus- ble that the comments were not linked
pected that there was deception at work systematically to the subjects’ record of
and proceeded on the basis that the obedience, as was the case in Murata’s
learner was not being harmed. This is (1962) employment of questionnaire data.
consistent with the findings of Hollander Our findings suggest a degree of sub-
and Turowetz (2017) that (a) disbelief in ject agency that challenges the power
hierarchy implied by the binary of leader– over and above those arising from the
follower and the emphasis on the attach- prestige of science.
ment to the scientist upon which the The recent exchange between Haslam
engaged followership model is based. It and Reicher (2018) and Turowetz and
opens up a multiplicity of potential moti- Hollander (2018) was preoccupied with
vations for subject behavior heterogeneity obedience. One contribution in this article
in both belief and the situational struc- is to shift emphasis to the 56 percent of
ture. This was the conclusion drawn by subjects who resisted the experimenter
Hollander and Turowetz (2018:307) and exercised a degree of self-control
when they advocated ‘‘a variety of rea- and independence by breaking off. The
sons’’ for persons acting in obedience-con- majority of subjects disobeyed the experi-
straining situations, including a commit- menter, and such resistance was corre-
ment to scientific objectives, that is, lated with a belief that the learner
engaged followership. Turowetz and Hol- seemed to be in pain, among other things.
lander similarly argued that ‘‘no single This suggests that Milgram’s framing of
social psychological process uniquely suf- the experiments as an explanation for
fices to explain [subject] actions but atrocities of the Holocaust has con-
rather that compliance resulted from mul- strained discussion and understanding
tiple processes involving a complex inter- of the subject behavior in terms of aggres-
play of situational forces and individual sion. Our position points in contrast to
dispositions’’ (2018:89). A problem with notions of empathy and altruism as dom-
the engaged followership perspective is inant subject reactions to the experiment.
that acknowledgement of its broad reach,
when widened beyond ‘‘the epistemic cap- PERENNIAL QUESTIONS ABOUT
ital of science’’ (Haslam and Reicher EXPERIMENTATION, MILGRAM, AND
2018:294) to include the attachments THE ARCHIVES
related to religion, professional obliga-
tions, concern with living up to a contract, In the late 1960s and early 1970s, the obe-
and fear of legal liability, tends to lose its dience study was ineluctably drawn into
specific traction as a factor encouraging the so-called crisis in social psychology
obedience. Haslam et al. (2015:67) men- (Stam, Radke, and Lubek 2000). Milgram’s
tion the case of the lawyer who broke off work was challenged in the first instance
because he thought he would be held to for its questionable ethics but later
a higher legal standard. Another individ- attracted doubt as social psychology ques-
ual claimed that his military training tioned the epistemological status of the
made him reluctant to break off since experiments. As the crisis dissipated and
that was not what professional soldiers social psychologists moved on from decep-
are in the habit of doing. Other comments tive experimentation, the obedience experi-
concerned the contractual payment and ments continued to live in the textbooks of
the implied obligation to complete the social psychology and in popular represen-
assignment. In 1961, the average salary tations, and the mythology around the
in Connecticut was about $2.40 per hour, experiments only grew. In the early
so $4.50 for participation and transporta- 2000s, as partial replications were under-
tion was probably a very attractive source taken and the archives became available
of income for many workers (U.S. Bureau to scrutiny by historians, Milgram’s
of the Census 1976:380). By implication, research once again became the focus of
the subjects were pulled by a number of critique (Burger, Girgis, and Manning
strands of prior institutional engagements 2011). Milgram’s was among a vast swath
History of the Behavioral Sciences major work was Beyond the Banality of
36(4):365–82. Evil: Genocide and Criminology (2013),
Turowetz, Jason, and Matthew H. Hollander.
2018. ‘‘From ‘Ridiculous’ to ‘Glad to Have based in large part on his fieldwork on
Helped’: Debriefing News Delivery and the Rwandan genocide. Earlier notable
Improved Reactions to Science in Milgram’s books include The Rise and Fall of Social
‘Obedience’ Experiments.’’’ Social Psychol- Psychology: The Use and Misuse of the
ogy Quarterly 81(1):71–93. https://doi.org/ Experimental Method (2004) and The
10.1177/0190272518759968.
U.S. Bureau of the Census. (1976). Statistical Social Basis of Scientific Discoveries
Abstract of the United States 1976. Wash- (1981).
ington, DC: Government Printing Office.
https://babel.hathitrust.org/cgi/pt?id=mdp Richard A. Wanner is professor emeri-
.39015021301612; view=1up;seq=3. tus in the sociology at the University of
Yale. 2017. Guide to the Stanley Milgram Calgary and academic director of the
Papers. MS 1406, July 1993, revised April Prairie Regional Research Data Centre.
2017. Yale University Library Manuscripts His recent research has focused on public
and Archives. http://hdl.handle.net/10079/
fa/mssa.ms.1406. perceptions of the Olympics, the effects of
immigration on the Canadian economy,
BIOS internal migration, and trends in social
stratification. His work has appeared in
Gina Perry is an Australian writer and many scholarly journals, including the
author of Behind the Shock Machine: Journal of Urban Affairs, International
The Untold Story of the Notorious Mil- Migration Review, Canadian Journal of
gram Psychology Experiments (2012) Sociology, and Sociology.
and The Lost Boys: Inside Muzafer Sher-
Henderikus Stam is a professor of psy-
if’s Robber Cave Experiment (2018). Both
chology at the University of Calgary,
works draw on extensive archival
and an adjunct professor in the Depart-
research and interviews with experimen-
ment of History. His recent work has
tal participants. She completed her PhD
focused on contemporary theoretical
at the University of Melbourne, where
problems in psychology and the historical
she is an associate in the School of Cul-
foundations of twentieth-century psychol-
ture and Communication.
ogy. In 2015 he was honored with the
Augustine Brannigan is professor American Psychological Foundation’s
emeritus of sociology at the University Joseph Gittler Award for his contribu-
of Calgary in Canada. His most recent tions to the philosophy of psychology.