Professional Documents
Culture Documents
Taylor & Francis, Ltd. and American Statistical Association Are Collaborating With JSTOR To Digitize, Preserve and Extend Journal of The American Statistical Association
Taylor & Francis, Ltd. and American Statistical Association Are Collaborating With JSTOR To Digitize, Preserve and Extend Journal of The American Statistical Association
Your use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at http://www.jstor.org/page/
info/about/policies/terms.jsp
JSTOR is a not-for-profit service that helps scholars, researchers, and students discover, use, and build upon a wide range of content
in a trusted digital archive. We use information technology and tools to increase productivity and facilitate new forms of scholarship.
For more information about JSTOR, please contact support@jstor.org.
Taylor & Francis, Ltd. and American Statistical Association are collaborating with JSTOR to digitize, preserve and extend
access to Journal of the American Statistical Association.
http://www.jstor.org
This content downloaded from 200.12.134.94 on Mon, 03 Aug 2015 15:08:46 UTC
All use subject to JSTOR Terms and Conditions
BenjaminPeirceand theHowlandWil
PAULMEIERand SANDY ZABELL*
The Howlandwillcase is possiblythe earliestinstancein American that the earlier will should be recognized.But this the
law of the use of probabilistic
and statisticalevidence.Identifying
in the signatureofSylviaAnnHowland,Benjamin Executor, Thomas Mandell, declined to do. Not only
30 downstrokes
Peirceattemptedto showthat a contestedsignatureon a willhad did he claim that the later will governed,but he also
been tracedfromanotherand genuinesignature.He arguedthat contendedthat two of the threesignatureson the earlier
theiragreementin all 30 downstrokes
was improbablein the extremeunder a binomialmodel. Peirce supportedhis model by will were tracings from the third, thus providing an
providinga graphicaltest of goodnessof fit.We give a critique independent ground for invalidating that will (see
of Peirce's modeland discussthe use and abuse of the "product Appendix 1).
rule"formultiplying
probabilities
ofindependent
events.
1. INTRODUCTION
On July2, 1865, Sylvia Ann Howland of New Bedford,
Massachusetts, died, leaving an estate of $2,025,000,
and a single heir-her niece Hetty H. Robinson, who
had lived withher forsome years. Two milliondollars is
a respectable sum, even by today's inflatedstandards,
and in 1865 it was certainlya prize worthcontesting.
The Howland will, dated September 1, 1863, and a
codicil of November 18, 1864, left half the estate to a
number of individuals and institutions. It provided
that the residuewas to be held in trustforthe benefitof
Hetty Robinsonand, at the time of Hetty's death, to be
distributedto the lineal descendentsof Hetty's paternal
grandfather,Gideon Howland, Sr. Hetty's father had
died a monthearlier,leaving her $910,000 outrightand
the income froma trust of $5,000,000. AlthoughHetty
Robinson was thus already a wealthy woman, she
claimed a rightto inheritoutrightthe entire estate of
her aunt, providingas evidence forthis claim an earlier
will,dated January11, 1862, whichleftthe entireestate
to Hetty with instructionsthat no later will was to be
honored.
Family feudsare relevanthere,as is a claimed agreement between Hetty and her aunt to leave mutual wills
excludingHetty's father.For our purposes,it is enough
to observe that Hetty Robinson had an arguable claim
* Paul Meieris Professor
ofChicago,
ofStatistics,The University
Chicago, IL 60637, and Sandy Zabell is AssociateProfessorof
Mathematics, NorthwesternUniversity,Evanston, IL 60201.
Supportfor this researchwas providedin part by NSF Grant
SOC 76-80389.The authorsare gratefulto a numberof friends
and colleagueswith whomthey discussedaspects of this caseP. Diaconis, S. Fienberg,J. Langbein,P. McCullagh,J. Tukey,
and H. Zeisel, among others.Special thanksare due to David
the inference
Wallace forextensivediscussionconcerning
problem
that arose,and StephenStiglerforhelpfulcorrespondence
on the
The staffofthe New BedfordPublicLibrary
historical
background.
were especiallyaccommodating
in locatingand arrangingforthe
copyingof unique source documentsand exhibits,and to them
theauthorsexpresstheirspecialgratitude.
1 December,1865. Testimonywas taken in 1867, decisionrendered October 1868 (Robinsonv. Mandell, 20 Fed. Cas. 1027).
A copyofthetrialtranscript
is availablein theNew BedfordPublic
Libraryand the testimony
in the textand Appendixes2 and 3 is
extractedfromthat source.
497
This content downloaded from 200.12.134.94 on Mon, 03 Aug 2015 15:08:46 UTC
All use subject to JSTOR Terms and Conditions
498
Journal
of the American
Statistical
Association,
September
1980
2. THE PROBLEM
O
t.^:Fg.v..*~~~~4-
w.
r4
,72
tjtt..
,-.fA
jy?
?o
o. 4s
Zn9Sy0,f
.
r.
4/J7/ i14-L
49
od-
At' KX
'7 4.
p0r.zzo.
2/6to
/is 7$6./CA
i*rt
. /K.
A1
4Z. o{e
+
it,..,,X
? *;
;- ie1*
<a0t
nr?
j/&4%4J
~ ~ ~ ~ ~ ~ ~ ~ ~
,:
V%z,
A,
#g
4 i0
/v8, g{F$vAo.
.,,,ta'47'
r%,r
This content downloaded from 200.12.134.94 on Mon, 03 Aug 2015 15:08:46 UTC
All use subject to JSTOR Terms and Conditions
?0f'A'
-
A
,ie<
W#uiVtt;
LW4zEtv;
499
ofCharlesSandersPeirce
2.1 Testimony
2
3
4
5
6
7
8
9
10
11
12
13
Number of Pairs
Number of Strokes
0
15
97
131
147
143
99
88
55
34
17
15
30
291
524
735
858
693
704
495
340
187
180
20
288
861
5325
30_
added:
the
One mighttry to improveon this a littleby considering
This numberfar transcendshuman experience.So vast an
probabilitythat out of 44 signatures,a given pair will have the
Such evanescent
is practicallyan impossibility.
improbability
greatestindex of coincidences,that is, (4Y4)1 = .0011. (Strictly
of probabilitycannotbelongto actual life.They are
shadows
speaking,we are not given the numberof coincidencesbetween
whichthelawcares
lessthanthoseleastthings
unimaginably
eithersignature#1 or #10 on the one hand, and any of the 42
notfor.
othersignatureson the other,but presumablynone of these even
of such
approachesthe 30 of #1 and #10.) The appropriateness
When asked the purpose of his son's display of the
issue, but one that we will
test is an interesting
a significance
distributionof the number of coincidences,Peirce exnot touchon.
2
This content downloaded from 200.12.134.94 on Mon, 03 Aug 2015 15:08:46 UTC
All use subject to JSTOR Terms and Conditions
of the American
Journal
StatisticalAssociation,
1980
September
500
A graphical display of the comparisonwas also provided, which is reconstructedin Figure C accordingto FREQUE14CY
the descriptiongiven in the trial record (the actual 200
display was omittedfromthe trial record).
of Coincidences
C. Distribution
---
OBSERVED FREQUENCY
EXPECTED FREQUENCY
3. CRITIQUEOF PEIRCE'SMODEL
ofFit
3.1 Goodness
How convincingis Peirce's model and his evidence
for it? No formaltheory of goodness-of-fit
testing existed at the time,and Peirce can scarcelybe faultedfor
failingto provide one. Still, the quality of fit can reasonably be questioned. Peirce's graph shows only the
middle range,and even here thereappears to be a consistenttrendin the deviations.What the graph does not
show is the substantialabsence of reallypoor matches-
150100-
I~~~
~NME
OF
This content downloaded from 200.12.134.94 on Mon, 03 Aug 2015 15:08:46 UTC
All use subject to JSTOR Terms and Conditions
GREMNT
501
not cross-examinedat all on the numericaland mathe- dependence,on the other hand, remains in full force.)
matical parts of his testimony,let alone made to list the Thattest,however,as notedbefore,yieldsonlya strongly
20 highvalues (see Appendix3).
suggestive significanceprobability,but nothing at all
as compellingas Peirce's calculation.
For all that one can raise questions about Peirce's
3.2 Model Assumptions
model and analysis, it remainsto considerthe extentto
The frequentparalysisof the law beforea demonstra- which the evidence available really contradictsit. Altion of mathematicshas long been noted. An extreme thoughthe conventionalchi-squaredtest clearlyrejects
view, expressedby Laurence Tribe in his 1971 Harvard the binomial model, it is not at all clear that the chiLaw Reviewarticle"Trial by Mathematics" (Tribe 1971) squared statistic should have even approximatelythe
is that laymen are too readily impressedand misled by familiar distribution.Under Peirce's model the probsuch presentationsand that they should thereforebe ability that a random pair of true signatures should
excluded-at least in jury trials. Whatever one thinks match in k stokes is indeed binomial, as claimed, and
of this position as a generality,Peirce's model, as well thus the expectedfractionof pairs with k agreementsis
as his data analysis, deserves closer examinationthan it also binomial. However, the (42) = 861 pairs are generated fromonly 42 signaturesand are not at all the same
receivedduringthe trial.
One might,for example, question the assumption- as 861 independenttrialsfroma binomialpopulation. It
that the relativefrequencies
required for the binomial model-that the probability is to be expected,therefore,
ofa matchshouldbe the same at each ofthe 30 positions. will show variabilitygreaterthan multinomialand that
One can easily imagine that certain strokes-perhaps the chi-squared statistic will also exhibit greater varithose early in the signature-would have a higher ability. To assess more rigorouslythe possible lack of
probabilityof matchingthan do others.The assumption fit of Peirce's model we turn to considerationof the
of equal probabilitycould readily have been checked rather large excess of pairs with over 12 matches (20
in the course of C.S. Peirce's work,but it seems never observed, only 2.7 expected). The Markov inequality
to have been questioned. (Of course, the effectof such (e.g., see Billingsley1979, p. 65), of which the familiar
inequalitywould be to reducethe variance in the number Chebyshev inequality is a direct corollary,states that
of matches, and thus to make the probability of 30 forany nonnegativerandomvariable Z, one has
matcheseven smallerthan that quoted.)
Pr{Z > t} < t-'E{Z} .
Far more serious is the implicitassumption of independence. One would expect that agreement in, say, Taking Z = the numberof pairs out of the 861 showing
positions1 and 3, would make far morelikelyan agree- 13 or more agreements,and t = 20, we findPrIZ > t}
mentin position2. The effectof such dependencewould < 2.7/20 = .14-not in itselfsufficient
reason to reject
be to increasethe variance and thus increase the prob- the Peirce model.A morerefinedanalysis,however,gives
ability of 30 matches over that quoted, quite possibly fairlyconvincingevidence that the Peirce model does
by orders of magnitude. To the extent that the data not fitthe data (see Appendix4).
bears on this point,it does indeed suggestthe possibility
andtheProduct
Rule
of dependenceby exhibitingwhat appear to be far too 3.3 Independence
many cases with over 12 matches to agree with the
Peirce's unargued and possibly unwarrantedassumpbinomialmodel.
tion of independence should occasion no surprise. It
A finalcritiqueof Peirce's discussionis the absence of was often
taken,for granted by thoughtfuland astute
any commenton correlationbetween signatures made scientistsof that era and later ones-not entirelyexclose in time. It is at least plausible that signatureslate ceptingthe scientificcommunityof today. For
example,
in life would differsubstantiallyfromthose made at a Simon Newcomb wrote in The Universal
Cyclopedia
youngerage, and also plausiblethat signaturesmade at a (1900, s.v. "Probability,Theory of"):
single sitting would be more similar than those made
The mathematicalrule for determining
probabilityin such
days or weeks apart. (Similar problems arise in the
a case [when the concurrence
of a large numberof circum,
statistical analysis of authorship,where both style and
stancesis necessaryto the productionof an event] is that
vocabulary may depend on the writer'ssubject matter
the probabilityof the concurrence
of all the eventsis equal
to the continuedproductoftheprobabilities
ofall theseparate
and mayvaryoverhis lifetime;see Mostellerand Wallace
events. As one example,suppose that a law requiringthe
1964, pp. 18-22, 195-199, 265-266.) Since many of the
concurrence
of the two housesof Congressand the President
42 signaturesused in the study were on leases and other
wereas likelyas not to be rejectedby any one of them,and
formedhis own opinion
that each one of the threeauthorities
dated documents,this question could also have been
of the othertwo. Then the probabilityof its
independently
examined,but thereis no evidencethat it was considered.
passingall threewouldbe a X a X 2 - SIn part these challengesto the fairnessof Peirce's test
can be met by the nonparametricproceduresuggested The product rule is misstatedhere, independencebeing
earlier.The questionsof uniformprobabilityof matching only introducedsubsequentlyin the context of an exin each of the 30 positions,and independencebetween ample. And while Newcomb has the mathematicsright
(The problem of time in his example,the application,althoughit can be given
them, would then be irrelevrant,.
This content downloaded from 200.12.134.94 on Mon, 03 Aug 2015 15:08:46 UTC
All use subject to JSTOR Terms and Conditions
502
Journal
of the American
StatisticalAssociation,
1980
September
is exceedinglyapt to be given
the correctinterpretation,
an incorrectone. For let us agree that each branch of
governmentarrivesat its own opinion independentlyof
the other two on each law, where "independently"is
given the common meaningof decidingin ignoranceof
the deliberationsor decisionof the others.It does not at
all followfromthat that the decisionswillbe statistically
independent.Thus, if a law on capital punishmentis
under consideration,all might be influencedby public
opinion,a varietyof lobbies,or the recentreportof some
heinous crime. The failureof each to directlyinfluence
one another does not protect them from being simultaneouslyinfluencedby externalevents.
The confusionof this ordinarymeaningof "independence" with "statistical independence" remains a common errortoday and, although Newcomb himselfmay
have been clear about the matter,his examplehereseems
especiallyunfortunate.
For whileNewcomb mayjust have been careless,skimmingovera pointobviousto himself,
thestatisticallayman
who read him could easily go astray. This was certainly
the case fora noted experton forgery,AlbertS. Osborn
(1929, pp. 225-234), whose text QuestionedDocuments
cites "ProfessorNewcomb's rule" withapprobation,but
whose illustrationsof it show no awareness of the need
forindependence.In one instance,Osborn supposed an
individualsoughtwho was 5 feet114 inchestall, withblue
eyes and brown hair, had lost his left thumb and the
lower part of his rightear, and had a distinctivemole,
tattoo, and scar. Osborn assumed for the sake of argument that the firstof these traits had a probabilityof
0, thesecond 3, thethird4, and all therest1/200,and concluded that the probabilityof theirjoint occurrencewas
1 in 38,400,000,000,000.There is, of course,no reason to
suppose a priorithat height,eye color, and hair color
are independenttraits,althoughin this instancepossible
mutual dependence might not be too serious. Surely,
however,the occurrencesofa tattoo,a scar,and a missing
thumb and ear would be highly dependent. (A more
recent and notoriousinstance is the celebrated Collins
case.4)
significanceprobability
Pr (evidenceIalternative)
This content downloaded from 200.12.134.94 on Mon, 03 Aug 2015 15:08:46 UTC
All use subject to JSTOR Terms and Conditions
4. AFTERMATH
Peirce was only one among many witnessesfor both
sides, but fromour point of view the next most interesting witness was J.C. Crossman, an engraver.Crossman testifiedfor the plaintiff,essentially in rebuttal
to Peirce. Peirce had said that such agreementas found
betweensignature#1 and #10 could not occurexceptby
design,no matterhow regularthe habits of the writer.
Crossman,however,gave testimonyabout the regularity
of the signatures of several individuals, most notably
those of the sixth Presidentof the United States, John
Quincy Adams (who signed his name "J.Q. Adams").
In each of these cases, Crossman testified,pairs of
signatureshad been foundin whichthe matchingwas as
good as or better than the matchingof#1 and #10. Unfortunately,the testimonydoes not suggest the correspondingvalue of p (.2 for Sylvia Ann Howland),
which powerfully influences the final probabilityespeciallysince the numberof strokesto be matched in
this case is considerablyless than 30.
How convincingwas the Peirce testimonyat the time?
Or what of Crossman's, supplementingthe microscopic
examinationofProfessorsHolmes and Agassiz? The court
did not say directly,but its decision can be viewed as
suggestingthat the courtwas at least leftin doubt about
the validity of the contested signatures. In brief,the
legal situationwas as follows.
As a general matter,a will may be revoked by the
testatorat any time until his death, althoughhe can by
contractbind himselfnot to exercisehis powerof revocation. Thus one mustlook forthe groundson whichHetty
Robinson asked that the earlier will and not the later
one prevail. She mighthave soughtto discreditthe later
will on the ground of undue influence.(Indeed, in an
earlier proceeding,Hetty had done exactly that, but
then withdrewthe case.) Alternatively,if the termsof a
will do evidence a contractualobligation-as if A puts
B's childrenthroughcollege, and B agrees to leave A
his house in return-a situation may arise in which a
testatoris not freeunilaterallyto change the terms.It
was Hetty Robinson's claim, supported with the 1862
will and otherevidence,that she and her aunt had contractedto makemutualwills,specifically
excludingHetty's
fatherEdward Robinson, so that in no case could he
inheritthe Howland fortune.However, Massachusetts
law at the time treated a beneficiaryunder a will as
disqualifiedon account of self-interestfrom testifying
about the circumstancessurroundingthe will, unless
503
called to testifyby the opposite party or by the court.5
Without Hetty's testimony,the evidence was judged
insufficient
to support her claim that the 1862 will was
in fulfillment
of a contract. On this ground the court
foundforthe defendant,Thomas Mandell, and it can only
be conjectured whetherit might have ruled otherwise
had it consideredthe contested signaturesto be valid.
4.1 DidSheorDidn'tShe?
So ends the case-without a findingon the question
of most interestto the outsider.Clearly if the contested
signatureswereforgeries,
Hetty Robinsonwas responsible
for them. Did she trace her aunt's signatureor didn't
she? Here the amateur findshimselfon shiftingsands.
Peirce includes in his testimonysome furtherbits of
evidence that point toward forgery,such as the position
of the signatureon the page, and the level of each letter.
Osborn,a specialistin the fieldof questioneddocuments
and a student of this case in particular,had no doubt
whatever that the codicil signature was a forgeryby
tracing. However, he makes no comment on the J.Q.
Adams signaturesthat appear to defythe usual tests of
forgery.Examination of those of the 42 comparison
signaturesstillextantshowsa greatdeal ofregularity,
but
one does not findthe same sortof identityas between#1
and #10. But the originaland both the contestedsignatures were said to have been writtenon the same day,
and thus mightwellbe morealike than signatureswritten
many days apart. The tendency of all three of the
signaturesat issue to followone line fromS throughw,
and then slope offforthe last threelettersin a slightly
differentdirection,looks at firstcompelling-until we
noticethat preciselythis patternis characteristicof most
of the 42 comparisonsignatures.This is, perhaps,as far
as an amateur can go, and no unequivocal conclusion
seems possible.
5. EPILOGUE
Hetty Robinson Green-she had recentlymarriedlost this case, and other legal actions subsequently.
However, far frombeing weak in those skills necessary
to the managementof so large a sum-the ostensible
reason forputtingthe moneyin trustin the firstplaceHetty Green turned out to be a dynamic and effective
operator in finance.According to contemporarynewspaper accounts she devoted herselfto the multiplication
of her fortuneand became one of the most colorful
figuresWall Street had known for many years. Despite
her enormous wealth, she lived for many years in an
inexpensiveflat in Hoboken and took the ferryto work
in New York City each morning.All this was part of a
consciously projected image of penury, which even
extendedto her wearingold and worn clothing.Viewed
as eccentricby many, she had a complex personality
I The American Law Review (1870, Vol. 4, pp. 656-663), discussesthisaspectofthe case in considerable
detail.
This content downloaded from 200.12.134.94 on Mon, 03 Aug 2015 15:08:46 UTC
All use subject to JSTOR Terms and Conditions
504
Journal
of the American
StatisticalAssociation,
September
1980
CHRONOLOGYOF EVENTSCONNECTED
WITH THE WILL
APPENDIX2: CROSS-EXAMINATION
OF
OLIVERWENDELLHOLMES,SR.
6NotableAmericanWomen(1971) gives a brief,excellent,and for his side, at times perhaps too assured. It is instrucobjectiveaccountof HettyGreen'slifewithfurther
bibliography; tive to contrastthe firstpart of his testimonywith the
a somewhatmoresensationalist
accountappearsin JohnT. Flynn's
inappropriately
titledMen of Wealth(1941). Much betterthanits finalportionof his cross-examination.
title would suggestis Boyden Sparkes and Samuel T. Moore's
book-length
biographyHettyGreen:A WomanWhoLovedMoney Q. Your life has been spent, has it not, for the most
(1931), reprinted
in 1948 as HettyGreen:The Witchof Wall Street.
part in mathematical studies, and in teaching the
This case, along withotherinstancesof the use of probabilityin
has it not?
same,
legal cases, is describedby Elmer Mode, Journalof theAmerican
StatisticalAssociation(1963). Unfortunately,
his presentationis A. It has.
compressedand erroneousin importantparticulars.The later Q. You are not an instructorin writingand do not
vicissitudesand ultimatedissolutionof the Howlandfortuneafter
professany peculiarskill in handwriting?
Hetty's death are describedin ArthurH. Lewis's The Day They
This content downloaded from 200.12.134.94 on Mon, 03 Aug 2015 15:08:46 UTC
All use subject to JSTOR Terms and Conditions
__
.0232 - .0031
___
___
__
___
_-
l (.0031) (.9969))I
861
10.61
505
(or equivalently,
12 = cov(E (o (X) X2) IX1)E(9I(X1,X3)IX1))
(See, e.g., Lehmann 1975, p. 368). Note that
E(&(xi,
k(Xi, X,)
Then
{0
PI 1j-Ql
(.0201)2
(.0031) (.9969)
(861) (.0201)2
=
l<i<j?n
(Xi,Xi)
where
?ri2 =
.0089
and Peirce's model is rejected at the one percentsignificance level. For example, we mightsuppose that each
one of the 30 downstrokesin Sylvia Ann Howland's
signaturecan be in one of five equally possible states.
Then
-B
Q(X1) -E((X1,
X2)iX1)
JX1= x0)
X2)
2,
var[E(ik(X1,X2)1X1)J
1.0232 - .00311
X 10
X
< 1.96
+ 8O12)-
861/
> .033 .
[Received
March1979.RevisedOctober
1979.]
REFERENCES
Billingsley,Patrick (1979), Probability
and Meaure, New York:
JohnWiley& Sons.
and
Fairley,WilliamB., and Mosteller,Frederick(1977), Statistics
PublicPolicy,New York: Addison-Wesley
PublishingCo.
Flynn, John T. (1941), Men of Wealth,New York: Simon &
Schuster.
This content downloaded from 200.12.134.94 on Mon, 03 Aug 2015 15:08:46 UTC
All use subject to JSTOR Terms and Conditions
506
of the American
Journal
StatisticalAssociation,
September
1980
This content downloaded from 200.12.134.94 on Mon, 03 Aug 2015 15:08:46 UTC
All use subject to JSTOR Terms and Conditions