Professional Documents
Culture Documents
Statistics Hacks - by Bruce Frey
Statistics Hacks - by Bruce Frey
By Bruce Frey
...............................................
Publisher: O'Reilly
Pub Date: May 2006
Print ISBN-10: 0-596-10164-3
Print ISBN-13: 978-0-59-610164-0
Pages: 356
T his book is ideal for anyone who likes puzzles , brainteas ers ,
games , gambling, magic tric ks , and thos e who want to apply
math and s c ienc e to everyday c irc ums tanc es . Several hac ks in
the firs t c hapter alone- s uc h as the "c entral limit theorem,",
whic h allows you to know everything by knowing jus t a little-
s erve as s ound approac hes for marketing and other bus ines s
objec tives . U s ing the tools of inferential s tatis tic s , you c an
unders tand the way probability works , dis c over relations hips ,
predic t events with unc anny ac c urac y, and even make a little
money with a well- plac ed wager here and there.
Statis tics Hacks pres ents us eful tec hniques from s tatis tic s ,
educ ational and ps yc hologic al meas urement, and experimental
res earc h to help you s olve a variety of problems in bus ines s ,
games , and life. You'll learn how to:
P lay s mart when you play Texas H old 'E m, blac kjac k,
roulette, dic e games , or even the lottery
Whether you're a s tatis tic s enthus ias t who does c alc ulations in
your s leep or a c ivilian who is entertained by c lever s olutions to
interes ting problems , Statis tics Hacks has tools to give you an
edge over the world's s lim odds .
Statistics Hacks
By Bruce Frey
...............................................
Publisher: O'Reilly
Pub Date: May 2006
Print ISBN-10: 0-596-10164-3
Print ISBN-13: 978-0-59-610164-0
Pages: 356
credits Credits
Preface
Chapter 1. The Basics
Hack 1. Know the Big Secret
Hack 2. Describe the World Using Just Tw o
Numbers
Hack 3. Figure the Odds
Hack 4. Reject the Null
Hack 5. Go Big to Get Small
Hack 6. Measure Precisely
Hack 7. Measure Up
Hack 8. Pow er Up
Hack 9. Show Cause and Effect
Hack 10. Know Big When You See It
Chapter 2. Discovering Relationships
Hack 11. Discover Relationships
Hack 12. Graph Relationships
Hack 13. Use One Variable to Predict Another
Hack 14. Use More Than One Variable to Predict
Another
Hack 15. Identify Unexpected Outcomes
Hack 16. Identify Unexpected Relationships
Hack 17. Compare Tw o Groups
Hack 18. Find Out Just How Wrong You Really Are
Hack 19. Sample Fairly
Hack 20. Sample w ith a Touch of Scotch
Hack 21. Choose the Honest Average
Hack 22. Avoid the Axis of Evil
Chapter 3. Measuring the World
Hack 23. See the Shape of Everything
Hack 24. Produce Percentiles
Hack 25. Predict the Future w ith the Normal Curve
Hack 26. Give Raw Scores a Makeover
Hack 27. Standardize Scores
Hack 28. Ask the Right Questions
Hack 29. Test Fairly
Hack 30. Improve Your Test Score While Watching
Paint Dry
Hack 31. Establish Reliability
Hack 32. Establish Validity
Hack 33. Predict the Length of a Lifetime
Hack 34. Make Wise Medical Decisions
Chapter 4. Beating the Odds
Hack 35. Gamble Smart
Hack 36. Know When to Hold 'Em
Hack 37. Know When to Fold 'Em
Hack 38. Know When to Walk Aw ay
Hack 39. Lose Slow ly at Roulette
Hack 40. Play in the Black in Blackjack
Hack 41. Play Smart When You Play the Lottery
Hack 42. Play w ith Cards and Get Lucky
Hack 43. Play w ith Dice and Get Lucky
Hack 44. Sharpen Your Card-Sharping
Hack 45. Amaze Your 23 Closest Friends
Hack 46. Design Your Ow n Bar Bet
Hack 47. Go Crazy w ith Wild Cards
Hack 48. Never Trust an Honest Coin
Hack 49. Know Your Limit
Chapter 5. Playing Games
Hack 50. Avoid the Zonk
Hack 51. Pass Go, Collect $200, Win the Game
Hack 52. Use Random Selection as Artificial
Intelligence
Hack 53. Do Card Tricks Through the Mail
Hack 54. Check Your iPod's Honesty
Hack 55. Predict the Game Winners
Hack 56. Predict the Outcome of a Baseball Game
Hack 57. Plot Histograms in Excel
Hack 58. Go for Tw o
Hack 59. Rank w ith the Best of Them
Hack 60. Estimate Pi by Chance
Chapter 6. Thinking Smart
Hack 61. Outsmart Superman
Hack 62. Demystify Amazing Coincidences
Hack 63. Sense the Real Randomness of Life
Hack 64. Spot Faked Data
Hack 65. Give Credit Where Credit Is Due
Hack 66. Play a Tune on Pascal's Triangle
Hack 67. Control Random Thoughts
Hack 68. Search for ESP
Hack 69. Cure Conjunctionitus
Hack 70. Break Codes w ith Etaoin Shrdlu
Hack 71. Discover a New Species
Hack 72. Feel Connected
Hack 73. Learn to Ride a Votercycle
Hack 74. Live Life in the Fast Lane (You're Already
In)
Hack 75. Seek Out New Life and New Civilizations
Colophon
Index
Credits
About the Author
Bruc e Frey, P h.D ., is a c omic book c ollec tor and film buff. I n his
s pare time, he teac hes s tatis tic s to graduate s tudents and
c onduc ts res earc h in his s ec ret identity as an as s is tant
profes s or in E duc ational P s yc hology and Res earc h at the
U nivers ity of Kans as . H e is an award- winning teac her, and his
s c holarly res earc h interes ts are in the areas of teac her- made
tes ts and c las s room as s es s ment, the meas urement of
s pirituality, and program evaluation methods . Bruc e's honors
inc lude taking third plac e in the Kans as M onopoly C hampions hip
as a teenager, s ec ond plac e in the Kans as Film Fes tival as a
c ollege s tudent, and a res pec table third- plac e finis h in the
L awrenc e, Kans as , Texas H old 'E m P oker Tournament as a
middle- aged man. H e is proudes t of two ac c omplis hments : his
marriage to his s weet wife, and his purc has e of a low- grade c opy
of Showcas e #4 , a c omic book wherein the "Silver A ge Flas h firs t
appears ," whatever that means .
Contributors
T he following people c ontributed their hac ks , writing, and
ins piration to this book:
Acknowledgments
I 'd like to thank all the c ontributors to this book, both thos e who
are lis ted in the "C ontributors " s ec tion and thos e who helped
with ideas , reviewed the manus c ript, and provided s ugges tions
of s ourc es and res ourc es . T hanks in this c apac ity es pec ially go
to T im L angdon, neon bender, whos e gift of H arry Blac ks tone,
J r.'s paperbac k book There's One Born Every Minute (J ove
P ublic ations ) provided great ins piration for many of the hac ks
herein.
I 'd like to thank N eil Salkind, s tatis tic s writer s upreme, for his
help with many fac ets of my profes s ional life and this book.
T his book is a c ollec tion of s tatis tic al tric ks and tools . Statis tics
Hacks pres ents us eful tools from s tatis tic s , of c ours e, but als o
from the realms of educ ational and ps yc hologic al meas urement
and experimental res earc h des ign. I t provides s olutions to a
variety of problems in the world of s oc ial s c ienc e, but als o in the
worlds of bus ines s , games , and gambling.
I f you are already a top s c ientis t and do s tatis tic al c alc ulations
in your s leep, you'll enjoy this book and the c reative
applic ations it finds for thos e rus ty old tools you know s o well. I f
you jus t like the s c ientific approac h to life and are entertained
by c ool ideas and c lever s olutions to interes ting problems , don't
worry. Statis tics Hacks was written with the nons c ientis t in mind,
too, s o if that is you, you've c ome to the right plac e. I t's written
for the nons tatis tic ian as well, s o if this s till des c ribes you,
you'll feel s afe here.
I f, on the other hand, you are taking a s tatis tic s c ours e or have
s ome interes t in the ac ademic nature of the topic , you might find
this book a pleas ant c ompanion to the textbooks typic ally
required for thos e s orts of c ours es . T here won't be any
c ontradic tions between your textbook and this book, s o hearing
about real- world applic ations of s tatis tic al tools that s eem only
theoretic al won't hurt your development. I t's jus t that there are
s ome pretty c ool things that you c an do with s tatis tic s that
s eem more like fun than like work.
C hapter 1 , T he Bas ic s
I talics
Constant width
Gray type
You s hould pay s pec ial attention to notes s et apart from the text
with this ic on:
T his is a tip, s ugges tion, or general note.
I t c ontains us eful s upplementary
information about the topic at hand.
Safari Enabled
When you s ee a Safari® E nabled ic on on the c over of your
favorite tec hnology book, that means the book is available
online through the O 'Reilly N etwork Safari Books helf.
How to Contact Us
We have tes ted and verified the information in this book to the
bes t of our ability, but you may find that the rules or
c harac teris tic s of a given s ituation are different than des c ribed
here. A s a reader of this book, you c an help us to improve future
editions by s ending us your feedbac k. P leas e let us know about
any errors , inac c urac ies , mis leading or c onfus ing s tatements ,
and typos that you find anywhere in this book.
O 'Reilly M edia, I nc .
1 0 0 5 G ravens tein H wy N .
Sebas topol, C A 9 5 4 7 2
8 0 0 - 9 9 8 - 9 9 3 8 (in the U .S. or C anada)
7 0 7 - 8 2 9 - 0 5 1 5 (international/loc al)
7 0 7 - 8 2 9 - 0 1 0 4 (fax)
T he web s ite for Statis tics Hacks lis ts examples , errata, and
plans for future editions . You c an find this page at:
http://www.oreilly.c om
Got a Hack?
To explore H ac ks books online or to c ontribute a hac k for future
titles , vis it:
http://hac ks .oreilly.c om
Chapter 1. The Basics
T here's only a s mall group of tools that s tatis tic ians us e to
explore the world, ans wer ques tions , and s olve problems . I t is
the way that s tatis tic ians us e probability or knowledge of the
normal dis tribution to help them out in different s ituations that
varies . T his c hapter pres ents thes e bas ic hac ks .
Samples are numeric values that you have gathered together and
c an s ee in front of you that repres ent s ome larger population of
s c ores that you have not gathered together and c annot s ee in
front of you. Bec aus e thes e values are almos t always numbers
that indic ate the pres enc e or level of s ome c harac teris tic ,
meas urement folks c all thes e values s cores . A probability
s tatement is a s tatement about the likelihood of s ome event
oc c urring.
So, traditionally, the teac hing of s tatis tic s inc ludes at leas t
s ome time s pent on the bas ic rules of probability: the methods
for c alc ulating the c hanc es of various c ombinations or
permutations of pos s ible outc omes . M ore c ommon applic ations
of s tatis tic s , however, are the us e of des criptive s tatis tics to
des c ribe a group of s c ores , or the us e of inferential s tatis tics to
make gues s es about a population of s c ores us ing only the
information c ontained in a s ample of s c ores . I n s oc ial s c ienc e,
the s c ores us ually des c ribe either people or s omething that is
happening to them.
I t turns out, then, that res earc hers and meas urers (the people
who are mos t likely to us e s tatis tic s in the real world) are c alled
upon to do more than c alc ulate the probability of c ertain
c ombinations and permutations of interes t. T hey are able to
apply a wide variety of s tatis tic al proc edures to ans wer
ques tions of varying levels of c omplexity without onc e needing
to c ompute the odds of throwing a pair of s ix- s ided dic e and
getting three 7 s in a row.
For example, if you know that a quiz jus t adminis tered in a c las s
you are taking res ulted in a dis tribution of s c ores in whic h 2 5
perc ent of the c las s got 1 0 points , then I might s ay, without
knowing you or anything about you, that there is a 2 5 perc ent
c hanc e that you got 1 0 points . I c ould als o s ay that there is a
7 5 perc ent c hanc e that you did not get 1 0 points . A ll I have
done is taken known information about the dis tribution of s ome
values and expres s ed that information as a s tatement of
probability. T his is a tric k. I t is the s ec ret tric k that all
s tatis tic ians know. I n fac t, this is mos tly all that s tatis tic ians
ever do!
Statis tic ians take known information about the dis tribution of
s ome values and expres s that information as a s tatement of
probability. T his is worth repeating (or, tec hnic ally, threepeating,
as I firs t s aid it five s entenc es ago). Statis tic ians take known
information about the dis tribution of s ome values and expres s
that information as a s tatement of probability.
T he key, of c ours e, is how one knows the dis tribution of all thes e
exotic types of values that might interes t a s tatis tic ian. H ow c an
one know the dis tribution of average differenc es or the
dis tribution of the s ize of a relations hip between two s ets of
variables ? C onveniently, pas t res earc hers and mathematic ians
have developed or dis c overed formulas and theorems and rules
of thumb and philos ophies and as s umptions that provide us with
the knowledge of the dis tributions of thes e c omplex values mos t
often s ought by res earc hers . T he work has been done for us .
Mean
Typic ally, the bes t meas ure of c entral tendenc y, for a variety of
reas ons , is the mean [H ac k #2 1 ]. T he mean is the arithmetic
average of all the s c ores and is c alc ulated by adding together all
the values in a group, and then dividing that total by the number
of values . T he mean provides more information about all the
s c ores in a group than other c entral tendenc y options (s uc h as
reporting the middle s c ore, the mos t c ommon s c ore, and s o on).
So What?
Bec aus e we know the proportion of values that res ide all along
the normal c urve [H ac k #2 3 ], and the C entral L imit T heorem
tells me that thes e s ummary values are normally dis tributed, I
c an plac e probabilities on eac h s tatis tic al outc ome. I c an us e
thes e probabilities to indic ate the level of s tatis tic al
s ignific anc e (the level of c ertainty) I have in my c onc lus ions and
dec is ions . Without the C entral L imit T heorem, I c ould hardly
ever make s tatements about s tatis tic al s ignific anc e. A nd what a
drab, s ad life that would be.
I need s ome data from my s c outs to figure all this out. Table 1 - 1
s hould provide s ome good information.
M ean I Q = 9 1 .7 5
Standard deviation = 4 .5 3
So, I know in my s ample that mos t s c ores are within about 4 1/2
I Q points of 9 1 .7 5 . I t is the invis ible population they c ame from,
though, that I am mos t interes ted in. T he C entral L imit T heorem
allows me to es timate the population's mean, s tandard
deviation, and, mos t importantly, how far s ample means will
likely s tray from the population mean:
Mean I Q
A fuzzy vers ion of the C entral L imit T heorem points out that:
D ata that are affec ted by lots of random forc es and unrelated
events end up normally dis tributed.
Will I win the lottery? Will I get struck by lightning and hit by a
bus on the same day? Will my basketball team have to meet our
hated rival early in the NCA A tournament? A t its core, statistics
is all about determining the likelihood that something will
happen and answering questions like these. The basic rules f or
calculating probability allow statisticians to predict the f uture.
Additive rule
T hes e two tools will be enough to ans wer mos t of your everyday
"What are the c hanc es ? " ques tions .
When a s tatis tic ian s ays s omething like "a 1 out of 1 0 c hanc e of
happening," s he has jus t made a predic tion about the future. I t
might be a hypothetic al s tatement about a s eries of events that
will never be tes ted, or it might be an hones t- to- goodnes s
s tatement about what is about to happen. E ither way, s he's
making a s tatis tic al s tatement about the likelihood of an
outc ome, whic h is jus t about all s tatis tic ians ever s ay [H ac k
#1 ].
A s a perc entage
A s odds
A s a proportion
.0 8 3 + .0 5 6 + .0 2 8 = .1 6 7
What about when the probability ques tion is whether more than
one independent event will happen? T his ques tion is us ually
as ked when you want to know whether a s equenc e of s pec ific
events will oc c ur. T he order of the events us ually does n't matter.
.0 8 3 x.0 5 6 x.0 2 8 = .0 0 0 1 3
T his very s pec ific outc ome is very unlikely. I t will happen les s
than .1 perc ent, or 1 /1 0 of 1 perc ent of the time. T he
multiplic ative rule is us ed here bec aus e you are interes ted in
whether all of s everal independent events will happen.
Analytic view
Hypothesis Testing
T here are four pos s ible outc omes when s c ientis ts tes t
hypothes es by c ollec ting data. Table 1 - 3 s hows the pos s ible
outc omes for this dec is ion- making proc es s .
U nlike res earch hypothes is tes ting, with s tatis tical hypothes is
tes ting, the probability s tatement that a s tatis tic ian makes at
the end of the hypothes is tes ting proc es s is not related to the
likelihood that the res earc h hypothes is is true. Statis tic ians
produc e probability s tatements about the likelihood that the
res earc h hypothes is is fals e. To be more tec hnic ally ac c urate,
s tatis tic ians make a s tatement about whether a hypothes is
oppos ite to the res earc h hypothes is is likely to be c orrec t. T his
oppos ite hypothes is is typic ally a hypothes is of no relations hip
among variables , and is c alled the null hypothes is . A generic
s tatis tic ian's null hypothes is looks like this : there is no
relations hip between variable X and variable Y in the population
of interes t.
T he res earc h and null hypothes es c over all the bas es . T here
either is or is not a relations hip among variables . E s s entially,
when having to c hoos e between thes e two hypothes es ,
c onc luding that one is fals e provides s upport for the other.
L ogic ally, then, this approac h is jus t as s ound as the more
intuitive approac h pres ented earlier and utilized naturally by
humans every day. T he preferred outc ome by res earc hers
c onduc ting null hypothes is tes ting is a bit different than the
general hypothes is - tes ting approac h pres ented in Table 1 - 3 .
Why It Works
Statis tic ians tes t the null hypothes is gues s the oppos ite of what
they hope to findfor s everal reas ons . Firs t, proving s omething to
be true is really, really tough, es pec ially if the hypothes is
involves a s pec ific value, as s tatis tic al res earc h often does . I t
is muc h eas ier to prove that a prec is e gues s is wrong than prove
that a prec is e gues s is true. I c an't prove that I am 2 9 years
old, but it would be pretty eas y to prove I am not.
Whenever res earc hers are playing around with s amples ins tead
of whole populations , they are bound to make s ome mis takes .
Bec aus e the bas ic tric k of inferential s tatis tic s is to meas ure a
s ample and us e the res ults to make gues s es about a population
[H ac k #2 ], we know that there will always be s ome error in our
gues s es about the values in thos e populations . T he good news
is that we als o know how to make the s ize of thos e errors as
s mall as pos s ible. T he s olution is to go big.
Improving Accuracy
Why It Works
L et's look at this important s tatis tic al princ iple from s everal
different angles . I 'll s tate the law us ing three different
approac hes , beginning with the gambler's c onc erns , moving on
to the is s ue of error, and ending with the implic ations for
gathering a repres entative s ample. A ll of the entries in this lis t
are the exac t s ame rule, jus t s tated differently.
Gambling
Error
Implications
Samples are more repres entative of the population from whic h
they are drawn when they inc lude many people than when they
inc lude fewer people. T he number of important c harac teris tic s in
the population repres ented in a s ample inc reas es , as does the
prec is ion of their es timates , as the s ample s ize gets larger.
For the las t 3 0 0 years , though, all of s oc ial s c ienc e has made
us e of this elegant tool to es timate how ac c urately s omething we
s ee des c ribes s omething we c annot s ee. T hanks , J ake!
See Also
"Find O ut J us t H ow Wrong You Really A re" [H ac k #1 8 ]
Hack 6. Measure Precisely
Clas s ical tes t theory, or reliability theory, examines the c onc ept of
a tes t s c ore. T hink of the obs erved s c ore (the s c ore you got) on
a tes t you took s ometime. C las s ic al tes t theory defines that
s c ore as being made up of two parts and pres ents this
theoretic al equation:
O bs erved Sc ore = True Sc ore + E rror Sc ore
O bs erved s c ore
True s c ore
E rror Sc ore
E ven though random errors s hould c anc el eac h other out ac ros s
tes ting s ituations , les s than perfec t reliability is a c onc ern
bec aus e, of c ours e, dec is ions are almos t always made bas ed on
s c ores from a s ingle tes t adminis tration. I t does n't do you any
good to know that in the long run, your performanc e would reflec t
your true s c ore if, for example, you jus t bombed your SAT tes t
bec aus e the pers on next to you wore dis trac ting c ologne.
Reliability es timates for s c ores from this tes t are typic ally
around .9 2 , whic h indic ates very high reliability. I f you rec eive a
s c ore of 5 2 0 when you take this exam, c ongratulations , you
performed higher than average. 5 2 0 was your obs erved s c ore,
though, and your performanc e was s ubjec t to random error. H ow
c los e is 5 2 0 to your true s c ore? U s ing the s tandard error of
meas urement formula, our c alc ulations look like this :
1. 1 - .9 2 = .0 8
2. T he s quare root of .0 8 is .2 8
3. 1 0 0 x.2 8 = 2 8
T he s tandard error of meas urement for the G RE is about 2 8
points , s o your s c ore of 5 2 0 is mos t likely within 2 8 points of
what you would s c ore on average if you took the tes t many
times .
I f you are the pers on who took the tes t, then you c an be pretty
s ure that your true s c ore is within a c ertain range. T his might
enc ourage you to take the tes t again with s ome reas onable
expec tation of how muc h better you are likely to do by c hanc e
alone. With your s c ore of 5 2 0 on the G RE , you c an be 9 5
perc ent s ure that if you take the tes t again right away, your new
s c ore c ould be as high as 5 7 6 . O f c ours e, it c ould drop and be
as low as 4 6 4 the next time, too.
Hack 7. Measure Up
When you dec ide to meas ure s omething, you mus t c hoos e the
rules by whic h you as s ign s c ores very c arefully. T he level of
meas urement determines whic h s orts of s tatis tic al analys es are
appropriate, whic h will work, and whic h will be meaningful.
I f you are planning to us e s c ores to indic ate only that the things
belong to different groups , meas ure at the nominal level. T he
nominal level of meas urement us es numbers only as names :
labels for various c ategories (nominal means "in name only").
Take, for example, your rank order in your high s c hool c las s . T he
valedic torian is us ually the pers on who rec eived a s c ore of 1
when grade point averages are c ompared. N otic e that you c an
c ompare s c ores to eac h other, but you don't know anything about
the dis tanc e between the s c ores . I n a footrac e, the firs t- plac e
finis her might have been jus t a s ec ond ahead of the s ec ond-
plac e runner, while the s ec ond- plac e runner might have been 3 0
s ec onds ahead of the runner who c ame in third plac e.
Whic h level of meas urement is right for you? Bec aus e of the
advantages of moving to at leas t the interval level, mos t s oc ial
s c ientis ts prefer to meas ure at the interval or ratio level. A t the
interval level, you c an s afely produc e des c riptive s tatis tic s and
c onduc t inferential s tatis tic al analys es , s uc h as t tes ts ,
analys es of varianc e, and c orrelational analys es . Table 1 - 6
provides a s ummary of the s trengths and weaknes s es of eac h
level of meas urement.
Controversial Tools
A majority c ons ens us in the res earc h literature is that if you are
at leas t at the ordinal level and believe that you c an make
meaning out of interval- level s tatis tic al analys es , then you c an
s afely perform inferential s tatis tic al analys es on this type of
data. I n the real world of res earc h, by the way, almos t everybody
c hoos es this approac h (whether they know it or not).
T here are two potential pitfalls when c onduc ting s tatis tic ally
bas ed res earc h. Sc ientis ts might dec ide that they have found
s omething in a population when it really exis ts only in their
s ample. C onvers ely, s c ientis ts might find nothing in their s ample
when, in reality, there was a beautiful relations hip in the
population jus t waiting to be found.
Power
P ower is not the c hanc e of finding a s ignific ant res ult; it is the
c hanc e of finding that relations hip if it is there to find. T he
formula for power c ontains three c omponents :
Sample s ize
I magine that you think the ac tual differenc e in your gender- and-
s leep s tudy will be real, but s mall. A differenc e of about .2
s tandard deviations between groups in t tes t analys es is
c ons idered s mall, s o you might expec t a .2 effec t s ize. To find
that s mall of an effec t s ize, you need 4 0 0 people in each group!
A s the effec t s ize inc reas es , the nec es s ary s ample s ize gets
s maller. I f the population effec t s ize is 1 .0 (a very large effec t
s ize and a big differenc e between the two groups ), 2 0 people per
group would s uffic e.
To have a good c hanc e of reac hing a s tatis tic ally s ignific ant
finding, one c ondition beyond your c ontrol mus t be true. T he null
hypothes is mus t be fals e, or your c hanc es of "finding"
s omething are s lim. A nd, if you do "find" s omething, it's not
really there, and you will be making a big errora Type I error.
T here mus t ac tually be a relations hip among your res earc h
variables in the population for you to find it in your s ample data.
So, fate dec ides whether you wind up in the c olumn on the right
in Table 1 - 8 . Power is the c hanc e of moving to the top of that
c olumn onc e you get there. I n other words , power is the c hanc e
of c orrec tly rejec ting the null hypothes is when the null
hypothes is is fals e.
Why It Works
T his relations hip between effec t s ize and s ample s ize makes
s ens e. T hink of an animal hiding in a hays tac k. (T he animal is
the effect s ize; jus t work with me on this metaphor, pleas e.) I t
takes fewer obs ervations (handfuls of hay) to find a big ol' effec t
s ize (like an elephant, s ay) than it would to find a tiny animal
(like a c ute baby otter, for ins tanc e). T he number of people
repres ents the number of obs ervations , and big effec t s izes
hiding in populations are eas ier to find than s maller effec t s izes .
Soc ial s c ienc e res earc h that us es s tatis tic s operates under a
c ouple of broad goals . O ne goal is to c ollec t and analyze data
about the world that will s upport or rejec t hypothes es about the
relations hips among variables . T he s ec ond goal is to tes t
hypothes es about whether there are c aus e- and- effec t
relations hips among variables . T he firs t goal is a breeze
c ompared to the s ec ond.
T here are all s orts of relations hips between things in the world,
and s tatis tic ians have developed all s orts of tools for finding
them, but the pres enc e of a relations hip does n't mean that a
partic ular variable c aus es another. A mong humans , there is a
pretty good pos itive c orrelation [H ac k #1 1 ] between height and
weight, for example, but if I los e a few pounds , I won't get
s horter. O n the other hand, if I grow a few inc hes , I probably will
gain s ome weight.
Group 1
T hirty c ollege fres hmen, who I would rec ruit from the
population of the M idwes tern univers ity where I work.
T his group would be the experimental group; I would
inc reas e their weight and meas ure whether their height
inc reas es .
Group 2
T hirty c ollege fres hmen, who I would rec ruit from the
population of the M idwes tern univers ity where I work.
T his group would be the control group; I would not
manipulate their weight at all and would then meas ure
whether their height c hanges .
Bec aus e this des ign matc hes the c riteria for experimental
des igns , we c ould interpret any relations hips found as evidenc e
of c aus e and effec t.
Res earc h c onc lus ions fall into two types . T hey have to do with
the c aus e- and- effec t c laim and whether any s uc h c laim, onc e it
is es tablis hed, is generalizable to whole populations or outs ide
the laboratory. Table 1 - 9 dis plays the primary types of validity
c onc erns when interpreting res earc h res ults . T hes e c onc erns
are the hurdles that mus t be c ros s ed by res earc hers .
E ven when res earc hers have c hos en a true experimental des ign,
they s till mus t worry that any res ults might not really be due to
one variable affecting another. A c aus e- and- effec t c onc lus ion
has many threats to its validity, but fortunately, jus t by thinking
about it, res earc hers have identified many of thes e threats and
have developed s olutions .
His tory
Maturation
Selection
I ns trumentation
H awthorne E ffec t
T he validity of res earc h des ign and the validity of any c laims
about c aus e and effec t are s imilar to c laims of validity in
meas urement [H ac k #2 8 ]. Suc h arguments are open and
unending, and validity c onc lus ions res t on a reas oned
examination of the evidenc e at hand and c ons ideration for what
s eems reas onable.
See Also
What you need to know to make dec is ions about your life and
reality in light of any new s c ientific report is the s ize of the
relations hip that has jus t been brought to light. H ow much better
is brand A than brand B? H ow big is that SAT differenc e between
boys and girls in meaningful terms ? I s it worth it to take that half
an as pirin a day, every day, to lower your ris k of a heart attac k?
H ow muc h lower is that ris k anyway?
Statis tic al res earc h has always been interes ted in relations hips
among variables . T he c orrelation c oeffic ient, for example, is an
index of the s trength and direc tion of relations hips between two
s ets of s c ores [H ac k #1 1 ]. L es s obvious , but s till valid,
examples of s tatis tic al proc edures that meas ure relations hips
inc lude t tes ts [H ac k #1 7 ] and analys is of varianc e, a proc edure
for c omparing more than two groups at one time.
When effec t s izes are reported, they are typic ally one of three
types . T hey differ depending on the proc edure us ed and the way
that proc edure quantifies the information of interes t. I n eac h
c as e, the effec t s ize c an be interpreted as es timates of the "s ize
of the relations hip between variables ." H ere are the three typic al
types of effec t s izes :
Eta-s quared
T he s tandards for big, medium, and little are bas ed, for the mos t
part, on the effec t s izes that are normally found in real- world
res earc h. I f a given effec t s ize is s o large as to be rarely found in
publis hed res earc h, it is c ons idered to be big. I f the effec t s ize is
tiny and eas y to find in real- life res earc h, then it is c ons idered to
be s mall.
You s hould dec ide yours elf, though, how big an effec t s ize is of
interes t to you when interpreting res earc h res ults . I t all depends
on the area of inves tigation. Table 1 - 1 0 provides the rules of
thumb for how big is big.
Remember our example of whether you s hould dec ide to take half
an as pirin eac h day to c ut down your c hanc es of having a heart
attac k? A well- public ized s tudy in the late 1 9 8 0 s found a
s tatis tic ally s ignific ant relations hip between thes e two
variables . O f c ours e, you s hould talk with your doc tor before you
make any s ort of dec is ion like this , but you s hould als o have as
muc h information as pos s ible to help you make that dec is ion.
L et's us e effec t s ize information to help us interpret thes e s orts
of findings .
Why It Works
A res earc her c an ac hieve s ignific ant res ults , but s till not have
found anything for anyone to get exc ited about. T his is bec aus e
s ignificance tells you only that your s ample res ults probably did
not oc c ur by c hanc e. T he res ults are real and likely exis t in the
population. I f you have found evidenc e of a s mall relations hip
between two variables or between the us e of a drug and s ome
medic al outc ome, the relations hip might be s o s mall that no one
is really interes ted in it. T he effec t of the drug might be real, but
weak, s o it's not worth rec ommending to patients . T he
relations hip between A and B might be greater than zero, but s o
tiny as to do little to help unders tand either variable.
M odern res earc hers are s till interes ted in whether there is
s tatis tic al s ignific anc e in their findings , but they s hould almos t
always report and dis c us s the effec t s ize. I f the effec t s ize is
reported, you c an interpret it. I f it is not reported, you c an often
dig out the information you need from publis hed reports of
s c ientific findings and c alc ulate it yours elf. T he c ool part is that
you might then know more about the importanc e of the dis c overy
than the media who reported the findings and, maybe, even the
s c ientis ts thems elves .
Chapter 2. Discovering
Relationships
T here are invis ible webs of relations hips around us . Variable A
c aus es Variable B, whic h influenc es Variable C , whic h is entirely
independent of Variable D , unles s Variable E c omes into play.
T he hac ks in this c hapter allow you to dis c over thes e
c onnec tions and des c ribe them ac c urately. T hes e are the hac ks
that reveal the hidden reas ons for why people do the things they
do and why things are the way they are.
Bec aus e we c annot meas ure every example of a pers on, fis h, or
pine tree that we might be interes ted in, we mus t rely on
repres entative s amples [H ac k #1 9 ] to provide our obs ervations .
Sampling c an mis lead us [H ac k #1 8 ], however, or it c an work in
s urpris ingly c ool ways [H ac k #2 0 ].
P ac k thes e tools in your tool belt and head out to find whatever
there is to find.
Hack 11. Discover Relationships
Regardles s of what s c ienc e c alls it, you probably do it. You might
make thes e gues s es about as s oc iations between attitudes and
behavior or between attitudes and attitudes or behaviors and
behaviors . You might do it informally as you s eek to unders tand
people in the world around you, or you might need to do it as a
marketing s pec ialis t to unders tand your c us tomer, or you might
be a s truggling ps yc hology graduate s tudent who needs to
c omplete a c las s as s ignment that requires s tatis tic al analys is
of the relations hip between s elf- es teem and depres s ion.
2. Sees how dis tant that s c ore is from the mean of that
c olumn
3. I dentifies the dis tanc e from the mean of its matc hing
s c ore in the other c olumn
Z refers to a Z-s core, whic h is the dis tanc e of a s c ore from the
mean. T hes e dis tanc es are then divided by the s tandard
deviation for that dis tribution. So, Zx means all the Z- s c ores
from the firs t c olumn, and Zy means all the Z- s c ores from the
s ec ond c olumn. ZxZy means multiply them together. T he S
s ymbol means add up. So, the equation s ays to multiply together
all the pairs of Z- s c ores and add thos e c ros s - produc ts together.
T hen, divide by the number (N) of pairs of s c ores minus 1 .
I f only there were s ome way to magic ally know how good
bus ines s will be on any given day. A s an amateur s tatis tic ian,
you as s ume there mus t be a s c ientific way to gues s how many
c ones will s ell without having to ac tually open for bus ines s and
tes t the market for the day. You're in luc k. T here is a way to
make es timates of the value or s c ore on s ome variable (s uc h as
ic e c ream s ales ) by us ing other information.
"What if? " is a fun game to play with regres s ion lines . E nter a
value in one end and a gues s c omes out the other end; you c an
get an ans wer even for unrealis tic s c enarios . T hrow s ome c razy
value onto the line, s uc h as 2 0 0 degrees , and you c an s till get
an es timate for c one s ales : 1 1 5 !
T he regres s ion equation for this relations hip would des c ribe a
line that c ould be drawn to s how this relations hip vis ually. With
real data, the relations hip is s eldom as c lear as it is in our
example. (T he c orrelation for our s mall fic tional data s et is a
perfec t 1 .0 .)
Why It Works
Sec ond, the relations hip mus t be at leas t s ort of linear. A s in our
ic e c ream c one example, if the nature of the relations hip
c hanges s omewhere along the regres s ion line, the regres s ion
line will mis s s ome of the data. Fortunately, mos t relations hips in
the natural world are linear or at leas t c los e to it.
M any profes s ionals , both in and outs ide of the s oc ial s c ienc es ,
often need to predic t how a pers on will perform on s ome tas k or
s c ore on s ome variable, but they c annot meas ure the c ritic al
variable direc tly. T his is a c ommon need when making admis s ion
dec is ions into c ollege, for example. A dmis s ions offic ers want to
predic t c ollege performanc e (perhaps grade point average or
years until c ompletion). H owever, bec aus e the pros pec tive
s tudent has not ac tually gone to c ollege yet, admis s ions offic ers
mus t us e whatever information they c an get now to gues s what
the future holds .
Simple linear regres s ion is the proc edure that produc es all the
values we need to c ook up the magic formula that will predic t the
future. T his proc edure produc es a regres s ion line that we c an
graph to determine what the future holds [H ac k #1 2 ], but onc e
we have the formula, we don't ac tually need to do any graphing
to make our gues s es .
Firs t, examine the rec ipe for c reating the formula (s ee the
"Regres s ion Formula Rec ipe" s idebar), and then we'll s ee how to
us e it with real data. You c an c lip this rec ipe out and keep it in
the kitc hen drawer.
Regression Formula Recipe
Ingredients
2 s ample means
Container
Directions
Serves
By plac ing all this information into the regres s ion equation
format, we get this formula for predic ting fres hman G P A us ing
A C T s c ores :
Predicting Scores
U s ing the regres s ion equation we have built, there would be two
different predic tions for thes e folks ' eventual grade point
averages :
For Melis s a
P redic ted G P A = - .2 4 + 4 .1 6
P redic ted G P A = 3 .9 0
For Bruce
P redic ted G P A = - .2 4 + 2 .2 4
P redic ted G P A = 2 .0 0
I hope, for Bruc e's s ake, there is more than one s pot available.
Why It Works
Regres s ion is helpful in ans wering res earc h ques tions beyond
making predic tions . Sometimes , s c ientis ts jus t want to
unders tand a variable and how it operates or how it is dis tributed
in a population. T hey c an do this by looking at how that variable
is related to another variable that they know more about.
T here will be error in predic tions under three c irc ums tanc es .
Firs t, if the c orrelation is les s than perfec t between two
variables , the predic tion will not be perfec tly ac c urate. Sinc e
there are almos t never really large relations hips between
predic tors and c riteria, let alone perfec t 1 .0 c orrelations , real-
world applic ations of regres s ion make lots of mis takes . I n the
pres enc e of any c orrelation at all, though, the predic tion is more
ac c urate than blind gues s ing. You c an determine the s ize of your
errors with the s tandard error of es timate [H ac k #1 8 ].
Sec ond, linear regres s ion as s umes that the relations hip is linear.
T his is dis c us s ed in "G raph Relations hips " [H ac k #1 2 ] in
greater detail, but if the s trength of the relations hip varies at
different points along the range of s c ores , the regres s ion
predic tion will make large errors in s ome c as es .
O ften, real- life res earc hers want to make us e of the information
found in a bunc h of variables , not jus t one variable, to make
predic tions or es timate s c ores . When they want greater
ac c urac y, s c ientis ts attempt to find s everal variables that all
appear to be related to the c riterion variable of interes t (the
variable you are trying to predic t). T hey us e all this information
to produc e a multiple regres s ion equation.
T his s ingle predic tor produc ed a regres s ion equation with output
that c orrelated .5 5 with the c riterion. P retty good, and pretty
ac c urate, but it c ould be better.
I magine our adminis trator dec ides s he's unhappy with the level
of prec is ion s he c ould get us ing the regres s ion line or equation
s he had built, and wants to do a better job. She c ould get a more
ac c urate res ult if s he c ould find more variables that c orrelate
with c ollege grades . L et's imagine that our amateur s tatis tic ian
found two other predic tor variables that c orrelated with c ollege
performanc e:
T he quality of a written es s ay
P erhaps performanc e on a c ollege attitude s urvey is c ollec ted
by the c ollege (s c ores range between 2 0 and 1 0 0 ), and is found
to have s ome c orrelation with future G P A . A dditionally, a s c ore
of 1 to 5 on a pers onal es s ay c ould c orrelate with c ollege G P A
and might be inc luded in the multiple regres s ion equation.
L et's look firs t at the abs trac t format of the regres s ion equation
in general. T hen, we'll apply the tool to the tas k at hand. H ere is
the bas ic regres s ion equation us ing jus t one predic tor variable:
U s ing realis tic data that we might find with three predic tors that
c orrelate with the c riterion, as well as c orrelate with eac h other
s omewhat, we might produc e a regres s ion equation with values
like this :
P redic ted G P A = 3 .0 1 +
(A C T Sc orex.0 2 ) +
(A ttitude Sc orex.0 0 7 ) +
(E s s ay Sc orex.0 2 5 )
C ompare the weight for A C T s c ore with the weight for attitude
s c ore, for example. T he weight of .0 2 for A C T is larger than the
weight of .0 0 7 for attitude, but don't be fooled into thinking that
A C T s c ores play a larger role in predic ting G P A than attitude.
Remember, G P A s c ores range from about 1 .0 to 4 .0 , whereas
attitude s c ores range from 2 0 to 1 0 0 . A s maller weight for
attitude ac tually res ults in a bigger jump on the c riterion than
does the larger weight for A C T s c ores .
C omputer program res ults for multiple regres s ion analys es often
provide information in the format s hown in Table 2 - 4 .
Why It Works
T his bit of c omplexity is nec es s ary bec aus e if the predic tors are
related to eac h other, they s hare s ome information. T hey aren't
really independent s ourc es of predic tion if they c orrelate with
eac h other. To make the regres s ion equation as ac c urate as
pos s ible, s tatis tic al proc edures remove the s hared information
from eac h predic tor in the equation. T his produc es independent
predic tors that c ome at the c riterion from different angles ,
produc ing the bes t predic tion pos s ible.
H ave you ever notic ed that s omething s eemed to be going on, but
weren't s ure if it was jus t your imagination? D o a greater number
of hippies s hop at the loc al c ommunity merc antile than would be
expec ted by c hanc e? I f the ans wer is yes , and you are looking to
meet hippies , you s hould s tart hanging out there.
I magine you are res pons ible for s c heduling the polic e offic ers in
your town. T he problem is that you don't know whether to
s c hedule the s ame amount of offic ers for every s hift or whether
more c rime might oc c ur during partic ular s hifts . I f one s hift is
likely to be bus ier, you s hould probably as s ign more offic ers . O f
c ours e, another reas on to as s ign more offic ers during that time
is that their patrolling might c ut c rime down a bit.
We c ould c ompute a c hi- s quare for this data. I f the c hi- s quare is
really big, then the 1 2 0 c rimes is unus ually larger than the other
two c rime periods . H ow big "really big" needs to be is an
important ques tion that we will explore later in this hac k.
L et's c alc ulate a c hi- s quare for this data. T he obs erved
frequenc y for eac h c ategory is given. T he expec ted frequenc y for
eac h c ell would be 3 0 0 divided by three c ategories , or 1 0 0 :
Why It Works
D ata for c hi- s quare analys es are laid out in a way in whic h the
obs erved number of things in eac h c ategory c an be c ompared
with the expec ted number of things in eac h c ategory. T he
"expec ted number of things in eac h c ategory" is us ually defined
as an equal number. I f nothing is going on (i.e., if the c ategory
makes no differenc e), we expec t an equal number of things in
eac h c ategory.
I f there are two or more c ategories and the res earc her wants to
know whether the ac tual dis tribution ac ros s thes e c ategories is
what would be expec ted by c hanc e alone, then the c hi- s quare is
an appropriate tes t. T he ac tual value that is tes ted is the
differenc e between what the res earc her expec ts to find and what
ac tually oc c urs .
Statis tic ians know the s ize of normal fluc tuations in obs erved
frequenc ies c ompared to expec ted frequenc ies . With this
knowledge, they c an c ompute the likelihood that any obs erved
deviation from the expec ted oc c urs by c hanc e or bec aus e
s omething els e is going on.
For example, if tes t developers are c onc erned about item bias
(that one item might work differently for one identifiable group
over anothers uc h as rac es , genders , and s o on), they will c hec k
whether the patterns of ans wer options meet c ertain
expec tations regardles s of whic h group generated the data. T he
c hi- s quare analys is c ompares the expec tations to ac tual tes t
performanc e.
See Also
You probably already have s ome as s umption about this , but how
would you go about c hec king the ac c urac y of s uc h an
as s umption?
O ur initial ques tion inc luded only Republic ans , s o while politic al
party might have s eemed like a variable in our firs t analys is , it
was really jus t a des c ription of the population; it did not vary at
all. We c an add party to our analys is , though, by adding another
c ategoryDemocrat, for example and rec ruiting 7 5 more
partic ipants , and s uddenly we have data with two variables .
I magine frequenc y data as s hown in Table 2 - 7 .
We do the s ame with the two- way c hi- s quare. T he expec ted
frequenc y in eac h c ell is equal to the number of people in that
c ell's row multiplied by the number of people in that c ell's
c olumn and then divided by the total s ample s ize. U s ing the data
in Table 2 - 7 , the c alc ulations for expec ted frequenc ies are
s hown in Table 2 - 8 .
T hus , the two- way c hi- s quare c alc ulations look like this :
Statis tic ians know that the c ritic al c hi- s quare value for 2 x2
tables (like the c hi- s quare we jus t c omputed) is 3 .8 4 . C hi-
s quare values greater than 3 .8 4 are found by c hanc e about 5
perc ent of the time or les s [H ac k #1 5 ].
Bec aus e our c hi- s quare value was 3 .2 4 and that is les s than the
key 5 perc ent value of 3 .8 4 , we know that s uc h a fluc tuation c an
oc c ur by c hanc e s omewhat greater than 5 perc ent of the time.
We c annot c laim s tatis tic al s ignific anc e here, and s o we mus t
c onc lude that though our s ample s eemed to s how a relations hip
between the two c ategoric al variables of party affiliation and s ex,
it might have oc c urred bec aus e of c hanc e s ampling error. I n the
population from whic h the s ample was drawn, there might not be
any relations hip.
Why It Works
A two- way c hi- s quare ans wers this relations hip ques tion by
looking at differenc es . T his might s eem c ounterintuitive,
bec aus e mos t s tatis tic s look for differenc es in order to s how,
well, a differenc e, not to s how s imilarities . But here's the
thinking:
See Also
T he larger the t value, the les s likely that any differenc es found
between your s ample groups oc c urred by c hanc e. Typic ally, t
values larger than about 2 are big enough to reac h the
c onc lus ion that the differenc es exis t in the whole population, not
jus t in your s amples .
T he t formula s hown here works bes t when
both groups have the s ame number of
people in them. A s imilar formula that
averages the varianc e information is us ed
when there are unequal s ample s izes .
I f we plac e thos e key values into our t formula, it looks like this :
Statis tic s is all about prec is ion, though, s o let's explore our
1 .4 2 a little further. H ow big, exac tly, would it need to be for us to
c onc lude that U nc le Frank is ac tually right?
Why It Works
Suc h des igns produc e two s ets of s c ores , and thos e s ets of
values often differ, at leas t in the s amples us ed. Res earc hers
(and I , too, when it c omes to proving U nc le Frank wrong) are
more interes ted in whether there would be differenc es in the
populations repres ented by the two s amples .
I ns tead of examining all the grain, a beer s tatis tic ian (how's that
for a dream job? ) wanted a method that requires looking at a
s mall s ample only, randomly drawn from the larger population of
grain. T he res t is his tory, and s o we c an s ay today that muc h of
the work done by s tatis tic al res earc hers is literally driven by
beer.
Hack 18. Find Out Just How Wrong You
Really Are
Statis tic ians are perhaps the only profes s ionals who not only
proudly admit that their ans wers are probably wrong, but will go
to great lengths to tell you exac tly how wrong they are. When
you c onduc t a s urvey, rec ord obs ervations , or c onduc t s ome
s ort of experiment, your res ults des c ribe only your s amplethe
c us tomers , patients , s tudents , goldfis h, or piec es of Kryptonite
that you have in front of you. I nferential s tatis tic s us es values
c omputed for a s ample to es timate what that value would be for
the population it is meant to repres ent. For example, the mean of
a s ample is a pretty good gues s for the mean of the population.
T he problem is knowing whether to trus t your res ults .
A s the s ample s ize inc reas es , the c los er the s ample mean is to
the true population mean. T his makes s ens e if you think of
s ample s ize as the number of independent obs ervations ; the
more looks you get at s omething, the more ac c urate your
des c ription will be.
Proportion Estimates
When the c alc ulations are made, then, the further the s ample
proportion is from .5 0 , the s maller the s tandard error of the
proportion. A nother point of interes t is that the top part of the
formula is an indic ation of the amount of variability in the
s ample. (proportion)(1 - proportion) is the s tandard deviation for
proportions s quared.
N otic e with this formula that the larger the c orrelation, the
s maller the s tandard error of the es timate. T his makes s ens e,
bec aus e if there is a lot of informational overlap between two
variables , you c an get a good s ens e of the s c ore on one variable
by looking at the other.
Standard
error of the 15 30 100 2
mean
Standard
error of the 15 60 100 1
mean
Standard
error of the .25 30 .50 .0
proportion
Standard
error of the .25 60 .50 .0
proportion
Standard
error of the 15 30 100 1
estimate
Standard
error of the 15 60 100 1
estimate
T he formulas are c ons truc ted in s uc h a way that if you have little
or no information about the population, then the s ize of the error
in your s ample es timate is about the s ize of the s tandard
deviation of the population.
L ook what happens with the s tandard error of the mean or the
s tandard error of the proportion when the s ample s ize is 1 , or
what happens with the s tandard error of the es timate when the
c orrelation is 0 .0 0 . I ntuitively, a good formula for figuring the
s tandard error s ize s hould produc e s maller errors when more is
known about the population.
Hack 19. Sample Fairly
I nferenc e is pretty eas y to do, but it works well only when the
s ample is good. C ons truc ting a good s ample is the key.
G eneral univers e
Sampling unit
Sampling frame
Tec hnic ally, the term random des c ribes a s ampling proc es s that
gives every member of a population an equal and independent
c hanc e of being s elec ted. Equal means that every s ampling unit
in the s ampling frame has as good a c hanc e as anyone els e.
I ndependent means that a pers on's or thing's c hanc es of being
s elec ted are unrelated to whether any other partic ular pers on or
thing has been s elec ted.
C onvenienc e s ampling
Stratified s ampling
J udgment s ampling
I f you are able to c ons truc t a good s ample, as we have defined it,
even a s mall s ample c an be effec tive. A s with c hoc olate c hip
c ookies , though, bigger is better. T he larger the s ample, the
more repres entative of the population it is . C ons equently, the
obs ervations are more generalizable and you c an better trus t
their ac c urac y.
See Also
A Sampling Problem
T he s tatis tic ian s tarts with two glas s es of the s ame s ize. O ne
glas s (the firs t glas s ) has two ounc es of Sc otc h in it; the other
(the s ec ond glas s ) has two ounc es of water in it. H e s tarts by
pouring an ounc e of water from the water glas s into the Sc otc h.
H e apparently already s c rewed up, bec aus e he c hanges his mind
and pours an ounc e of the new mixture (three ounc es of Sc otc h
and water mixed up) bac k into the water glas s . Both glas s es now
have two ounc es of liquid in them, but the liquid in eac h glas s is
s ome mix of water and Sc otc h.
N ervous ly, the s tatis tic ian attempts to s tart all over, but his
department c hair s tops him. She s ays :
I have a propos ition for you. We c an't pos s ibly know the exac t
proportion of Sc otc h and water in eac h glas s right now, bec aus e
we c an't know how mixed up everything is . But if you c an ans wer
the following ques tion c orrec tly, I 'll write a s trong letter of
s upport to your tenure c ommittee. I f not, well, I 'm s ure s omeone
with your qualific ations s hould have no trouble finding work in
the hotel/motel or food s ervic e indus try. H ere's the ques tion:
right now, does the firs t glas s have more water in it, or does the
s ec ond glas s have more Sc otc h in it?
T hink of the ques tion as a s ampling is s ue. D oes the firs t s ample,
the liquid in the firs t glas s , have more water in it, or does the
s ec ond s ample, the liquid in the s ec ond glas s , have more Sc otc h
in it? Bec aus e both Sc otc h and water are made up of really s mall
partic les , it is diffic ult to pic ture how muc h of eac h liquid is
repres ented in eac h s ample. E ven proportionately, we c an't be
s ure how many water partic les (or s ampled s c ores that equal
"water") are mixed into the s ample of "Sc otc h" s c ores , bec aus e
who knows how muc h water drifted down into the bottom of the
firs t glas s and would have remained there as the top part of the
liquid near the s urfac e was poured bac k into the s ec ond glas s .
A n intuitive ans wer is c alled for. U nfortunately, it is wrong.
Sample 1
Sample 2
5 0 white marbles
1 0 0 blue marbles
Sample 2
1 0 0 white marbles
Sample 1
Sample 2
Sample 1
Try this with any mix of marbles you wis h, but remember you
have to draw out a total of 5 0 marbles (to duplic ate the one
ounc e, or half, of the water originally mixed up).
N otic e that any mixture you try res ults in 1 0 0 marbles in eac h
glas s at the end. A ls o, mos t importantly, notic e that the ratio of
blue to white marbles in the firs t glas s at the end is always equal
to the ratio of white to blue marbles in the s ec ond glas s . A ny
blue marble that is not in the s ec ond glas s mus t be in the firs t
glas s , and any white marble that is not in the firs t glas s mus t be
in the s ec ond glas s .
Real- life polling c ompanies , who make their living and s take
their reputations on the ac c urac y of elec tion predic tions , are
als o primarily c onc erned with the proportion of s amples who are
in eac h of s everal c ruc ial c ategories . I f people have jus t voted
and there are two c andidates , anyone who did not vote for
c andidate A voted for c andidate B. T heir abs enc e in one
c ategory guarantees their pres enc e in the other. Reporting
predic tions as perc entages c reates the potential for greater
ac c urac y. I t als o allows for greater error, as a voter predic ted to
be in c ategory A who ends up in c ategory B has therefore
produc ed error in both c ategories .
When s tatis tic al s oc ial s c ienc e res earc hers want to be
c onvinc ed that their s ample is repres entative of its population,
their primary c onc ern is always the proportions of
c harac teris tic s in their s ample, not the number of people with
thos e c harac teris tic s . What matters mos t is that the proportions
of eac h s c ore for the key res earc h variables are the s ame in
both s amples and their populations .
Hack 21. Choose the Honest Average
When mos t people hear a s tatement like "the average pric e for a
hous e in this town is $ 2 9 0 ,0 0 0 " (whic h might s ound low, high, or
jus t right, depending on where you c all home), they imagine that
this figure was determined by adding up all of the s ales pric es
from all of the hous es in the town, and then dividing that s um by
the number of hous es . But s tatis tic ians know there is more than
one way to determine the "average," and s ometimes one kind is
better than another.
N ot even the median will always be hones t, though. C ons ider the
following s c enario. Say you're a yoga ins truc tor, and half of the
s tudents in your c las s are between 2 5 and 3 5 years old, and the
other half are between 5 0 and 6 0 . H ow would you des c ribe the
average age of your s tudents ?
T he problem in s ituations like thes e is that neither the mean nor
the median will adequately des c ribe the group of individuals .
What to do? T he mos t hones t c hoic e for an average in this
s ituation is to report the mode, whic h is s imply the mos t
frequently oc c urring value in a s ample of data, as s hown in the
example in Figure 2 - 7 .
I n this c as e, there are two modes : one at 3 0 years old and the
other at 5 4 years old. Reporting both of thes e values is the bes t
way to c hoos e the hones t average. T he mean and median both
mis lead for thes e s orts of data.
So, when is the mean the hones t average? Bas ic ally, the mean is
the bes t c hoic e when there is only one mode and the dis tribution
is s ymmetric , whic h means that there is no obvious s kew in
either direc tion. I f your yoga c las s were attended by your 2 5 - to
3 5 - year- old s tudents only, the mean would be the hones t
average.
When all is s aid and done, how do you c hoos e the mos t
appropriate average? Following thes e three s imple rules will
keep you hones t if you are reporting s ummaries , and will help
you make informed c hoic es if you are the one making dec is ions
bas ed on the data:
C hoos e the mode if there are two or more "trends " in the
data (i.e., two or more areas of high- frequenc y values ),
and report one mode for eac h trend.
William Skorups ki
Hack 22. Avoid the Axis of Evil
Typic al graphs have two axes , bec aus e they des c ribe two
different variables . A xes are the lines along the bottom, c alled
the X-axis , and along the s ide, c alled the Y-axis .
Bar c hart
H is togram
L ine c hart
Graphic Violence
A fter all, to an avid inves tor who c hanges her portfolio often and
buys and s ells frequently, a drop of 2 .8 perc ent is s erious
bus ines s . A graph des igned to make s mall c hanges look s erious
might be the mos t valid for that reader. I f an inves tor is one of
thos e "in it for the long haul" types , a relatively s mall c hange is
meaningles s , however.
To get the mos t meaning out of graphs like thes e, always c hec k
the bottom value on the Y- axis . T his way, you c an get a s ens e of
the real differenc es on the X- axis as you c rawl from bar to bar. I f
you are making graphs like thes e, think about the mos t hones t
way to pres ent the information. You want to inform, not dec eive
(probably).
See Also
T here are a few mirac les in the world of s tatis tic s . T here are at
leas t three tools three dis c overies that are s o c ool and magic al
that onc e s tudents of s tatis tic s learn about them and begin to
c omprehend their beauty, they frequently explode.
T he normal c urve
Sinc e we've dis c us s ed the us es of the firs t two mirac les in other
hac ks , let's s pend our time now getting to know the s hape and
us es of the third: the normal curve. I am pleas ed to pres ent the
normal c urve, the normal dis tribution, the bell- s haped c urve, the
whole world, as s hown in Figure 3 - 1 .
Statis tic ians have defined the normal c urve very s pec ific ally.
U s ing both c alc ulus and hundreds of years of real- world data
c ollec tion, the two methods have reac hed the s ame s et of
c onc lus ions about the exac t s hape of the normal dis tribution.
Figure 3 - 2 s hows the important c harac teris tic s of the normal
c urve. T he mean is in the middle, and there is room for fewer and
fewer s c ores as you move away from that c enter.
Rec all the c laim I made earlier that anything you meas ure will
dis tribute its elf as a normal c urve. By implic ation, then, anything
we meas ure will have mos t of the s c ores c los e to the mean and
only a few s c ores far from the mean. M eas ure enough people and
you will get the oc c as ional extreme s c ore very far from the
mean, but s c ores far from the mean will be rare. T he expec ted
proportion of people getting any partic ular s c ore gets s maller as
that s c ore moves away from the mean.
T hat next tes t you take? I don't know the tes t or anything about
you, but I am willing to wager that you will get a s c ore c los e to
the mean. I predic t your s c ore will be average. You might get
above average or below average, but the normal c urve tells me
that you will likely be pretty c los e to the mean.
H ere are s ome key fac ts about the c urve that you c an us e to
predic t performanc e:
Setting standards
N otic e als o that the normal c urve is s ymmetric al: you c ould fold
it in half and one s ide would perfec tly c over the other. T he other
c harac teris tic of the normal c urve that is important to know is
that it goes on forever. I t is a theoretic al c urve, s o the two ends
of the c urve will never touc h the bas eline.
I f you have been s hown a perc entile rank that repres ents your
tes t performanc e, it won't be us eful unles s you know what it
means . O n the flip s ide, if you have to explain s omeone's tes t
performanc e and you s how the tes t taker a raw s c ore only, you
aren't really being very helpful. Being able to build or interpret
perc entile ranks is a us eful s kill for both s ides of the tes ting
game.
I f you are a c las s room teac her, human res ourc es manager, or
anyone who has to report tes t res ults to others , being able to
report a perc entile rank ins tead of a raw s c ore will help tes t
takers unders tand how well they performed and als o help
dec is ion makers unders tand the c ons equenc es of s etting
various s tandards of performanc e.
P roduc ing perc entiles begins with organizing all your tes t
s c ores . For a s mall data s et, it is fairly s imple to build a
frequency table, whic h ans wers all s orts of ques tions in addition
to providing perc entile ranks . H ere is a s ample dis tribution for
3 0 s c ores on a c las s room tes t (arranged from lowes t to highes t)
in whic h 1 0 0 points was the highes t pos s ible s c ore:
59, 65, 72, 75, 75, 75, 80, 83, 83, 85, 85, 85, 85, 85,
85, 86, 86, 86, 86, 88, 88, 88, 90, 90, 90, 90, 90, 92,
94, 97
For effic ienc y's s ake, this data c an be dis played and the
frequenc y of eac h s c ore c an be c omputed, as s hown in Table 3 -
1.
I magine that you are s itting down with your guidanc e c ouns elor
and have been told that your perc entile rank is 9 3 . So, what
does this mean? Well, the mos t direc t interpretation is that 9 3
perc ent of all people who took the tes t s c ored les s than you did.
I t is als o c orrec t to s ay that 7 perc ent of people s c ored equal to
you or higher. We c an als o think of perc entile ranks as s aying
how far the s c ore is from normal. T he mean perc entile rank is
us ually around the 5 0 th perc entile and will be exac tly that if
s c ores are normally dis tributed, as they (ahem) normally are. So,
we c ould als o s ay that the 9 3 rd perc entile is pretty far above
average.
D on't make the mis take that many otherwis e s avvy s tat-
hac kers s ometimes make. E arlier in this hac k, we us ed an
example of a tes t s c ore in whic h you got 1 5 items c orrec t out of
2 0 on a quiz and half the c las s got fewer c orrec t than you. Your
perc entile rank in that example was 5 0 . N otic e that in that
example, your perc ent c orrec t is 7 5 perc ent (1 5 /2 0 ), but your
perc entile rank is 5 0 . D on't c onfus e the two! Knowing your
perc entile rank does not tell you how many ques tions you got
right.
See Also
A variety of hac ks in this book c apitalize on s tatis tic ians ' c los e
pers onal relations hip with the normal curve. "See the Shape of
E verything" [H ac k #2 3 ] s hows how to us e the normal c urve to
predic t tes t performanc e in a general way. We c an do better than
that, though.
So muc h is known about the exac t s hape of this mys tic al c urve
that we c an make exac t predic tions about the probability that
s c ores in a c ertain range will be obtained. T here are many other
types of ques tions that c an be as ked related to tes t
performanc e, and s tatis tic s c an help us to ans wer thes e s orts of
ques tions before we ever take the tes t!
For example:
What are the c hanc es that you will s c ore between any
two given s c ores ?
H ow many people will s c ore between thos e two s c ores ?
What are the c hanc es that you will pas s your next tes t?
For thes e types of ques tions , a prec is e tool is needed. T his hac k
provides that tool: a table of areas under the normal curve.
A fter unders tanding the s implific ations made to the table, the
firs t s tep toward us ing it to make probability predic tions about
performanc e or ans wer s tatis tic al ques tions is to unders tand the
four c olumns .
The z column
You c ould als o des c ribe the area between any given z
and a z of 4 .0 0 , or the end of the c urve.
Why It Works
See Also
When you jus t tell s omeone a raw s c ore, very little real
information has been s hared. You don't know if 1 6 is good. You
don't know if 1 6 is relatively high or low. D id mos t people get a
1 6 or higher, or did mos t people get s omething les s than 1 6 ?
E ven if we know the range of s c ores on that tes t and the points
pos s ible and s o on, we s till c an't c ompare performanc e on that
tes t to performanc e on the pas t tes t or the next tes t or a tes t on
s ome other s ubjec t. Raw s c ores are virtually meaningles s .
D on't fret! You c an s till unders tand your performanc e and the
performanc es of others . You c an s till make s elec tion dec is ions
and c ompare performanc e ac ros s people and ac ros s tes ts .
T here is s till hope!
Raw s c ores c an be c hanged into a new number that does all the
things that that 9 7 - pound weakling, the raw s c ore, c ould never
do. Raw s c ores c an be trans formed into a s uper number: a z
s core. U nlike a raw s c ore, a z tells you whether the performanc e
is above or below average, and how far above or below average it
is . A z als o allows you to c ompare performanc e ac ros s tes ts and
oc c as ions , and even between people.
Calculating z Scores
Understanding Performance
Why It Works
N ever fear. H ere are the tools you need to both interpret thes e
s trange s tandardized s c ores and, if you want, even c reate your
own (for when you report s c ores to other people from your own
weirdo tes t that is jus t about to s weep the nation and make you
as ric h as M r. A C T or M s . I Q or whoever makes money from our
tes t- bas ed s oc iety).
Problems with z Scores
M eas urement folks have s earc hed for and found other
s tandardized s c ales to report tes t performanc e that have more
pleas ing properties . T he tric k is to s tart with a z s c ore, and then
c onvert it onto s ome other s c ale with a mean and s tandard
deviation that is friendlier.
For fun, you c an c reate your own s tandardized s c ore dis tribution
with any mean and s tandard deviation you wis h. D on't like your
SAT s c ore of 3 5 0 ? Trans form it into a s c ore within a dis tribution
of your c hoos ing.
For meas uring knowledge quic kly and effic iently, it is hard to
beat the multiple-choice item as a ques tion format.
A s you s ee, eac h part of the ques tion has a name. T he c orrec t
ans wer is c alled the correct ans wer (how's that for s c ientific
jargon? ), and wrong ans wers are c alled dis tractors .
N ot muc h, but s ome real- world res earc h has been done on the
c harac teris tic s of multiple- c hoic e items and how to write good
ones . To write good multiple- c hoic e items , follow the following
c ritic al item- writing guidelines from this res earc h:
I nclude 3 to 5 ans wer options
=======================================
====================
====================
====================
====================
A s mart fellow and educ ational res earc her, Benjamin Bloom,
writing in the 1 9 5 0 s , s ugges ted a way of thinking about
ques tions and the level of unders tanding required to res pond
c orrec tly. H is c las s ific ation s ys tem has bec ome known as
Bloom's Taxonomy, a c las s ific ation s ys tem of educ ational
objec tives bas ed on the level of unders tanding nec es s ary for
ac hievement or mas tery. Bloom and c olleagues have s ugges ted
s ix different c ognitive s tages in learning. T hey are, in order from
lowes t to highes t:
1 . Knowledge
2 . C omprehens ion
3 . A pplic ation
4 . A nalys is
5 . Synthes is
You s hould not worry too muc h about the fine dis tinc tions
between the s ix levels as defined by Bloom. For example,
comprehens ion and application are c ommonly treated as
s ynonymous , as it is the ability to apply what is learned that
indic ates c omprehens ion. M os t tes ting theoris ts and c las s room
teac hers today pay the mos t attention to the dis tinc tion between
the knowledge level and all the res t of the levels . M os t teac hers ,
exc ept at introduc tory s tages of brand new areas , prefer to teac h
and meas ure to objec tives that are above the knowledge level.
See Also
C onc erned teac hers c ons tantly work to improve their tes ts , but
they are often working in the dark without s olid data to guide
them. What c an a s mart, c aring teac her do to improve his tes ts
or improve the validity of his grading? A family of s tatis tic al
methods c alled item analys is c an provide direc tion to teac hers
as they s eek to develop fair as s es s ments and grading.
Item Analysis
I tem analys is is the proc es s of examining c las s room
performanc e on individual tes t items . A c las s room teac her might
want to examine performanc e on parts of a tes t s he has written,
to s ee what areas are being mas tered by her s tudents and what
areas need more review. A c ommerc ial tes t developer produc ing
exams for nurs ing c ertific ation might want to know whic h items
on his tes t are the mos t valid and whic h s eem to meas ure
s omething els e and s hould therefore be removed.
I f you are a c las s room teac her worried about your own
as s es s ments , there are three different types of ques tions that
you probably need to ans wer. Fortunately, there are three item-
analys is tools that will provide you with the three different types
of information you need.
T he diffic ulty of any s pec ific tes t ques tion c an be c alc ulated
fairly eas ily us ing the formula for the difficulty index. You c an
produc e a diffic ulty index for a tes t item by c alc ulating the
proportion of s tudents taking the tes t that got that item c orrec t.
T he larger the proportion, the more tes t takers who know the
information meas ured by the item.
H ow hard is too hard? You get to dec ide that yours elf. Some
teac hers treat diffic ulty indic es at .5 0 or below as too hard
bec aus e mos t people mis s ed the item. You might have higher
s tandards . I f you believe that mos t s tudents s hould have
learned the material and your diffic ulty index for an item
s ugges ts that a s ubs tantial portion of your c las s mis s ed it, it
might be too hard.
To eliminate educ ated gues s es that res ult in c orrec t ans wers
purely by c hanc e, teac hers and tes t developers want as many
plaus ible dis trac tors as is feas ible. A nalys es of res pons e
options allow teac hers to fine- tune and improve items they
might want to us e again with future c las s es .
H ere are the proc edures for the c alc ulations involved in item
analys is , us ing data for an example item. For this example,
imagine a c las s room of 2 5 s tudents who took a tes t that
inc luded the item in Table 3 - 6 (keep in mind, though, that even
large- s c ale s tandardized tes t developers us e the s ame
proc edures for tes ts taken by hundreds of thous ands of people).
1. C ount the number of people who got the c orrec t ans wer.
1 6 / 2 5 = .6 4
2. For eac h group, c alc ulate a diffic ulty index for the item.
3. Subtrac t the diffic ulty index for the low s c ores group
from the diffic ulty index for the high s c ores group.
.7 7 - .5 0 = .2 7
I f the dis c rimination index is negative, that means that, for s ome
reas on, s tudents who s c ored low on the tes t were more likely to
get the ans wer c orrec t. T his is a s trange s ituation, and it
s ugges ts poor validity for an item or that the ans wer key was
inc orrec t. Teac hers us ually want eac h item on the tes t to tap
into the s ame knowledge or s kill as the res t of the tes t.
T his makes gues s ing c orrec tly more likely, whic h hurts the
validity of an item. A teac her might interpret this data as
evidenc e that mos t s tudents make the c onnec tion between The
Great Gats by and Fitzgerald, and that the s tudents who don't
make this c onnec tion c an't differentiate between Faulkner and
H emingway very well.
I f you as a teac her have c onc erns about tes t fairnes s , you c an
c hange the way you teac h, c hange the way you tes t, or c hange
the way you grade the tes ts :
I f s ome items are too hard, you c an adjus t the way you
teac h. E mphas ize unlearned material or us e a different
ins truc tional s trategy. You might s pec ific ally modify
ins truc tion to c orrec t a c onfus ing mis unders tanding
about the c ontent.
T hes e c onc erns that teac hers have about the quality of their
tes ts are not muc h different than the res earc h ques tions that
s c ientis ts as k. J us t like s c ientis ts , teac hers c an c ollec t data in
their c las s room, analyze the data, and interpret res ults . T hey
c an then dec ide, bas ed on their own pers onal philos ophies , how
to ac t on thos e res ults .
Hack 30. Improve Your Test Score While
Watching Paint Dry
If you don't like the score you just got on that important high-
stakes test, maybe you should take the test again. Or should
you?
O h, and for this tes t, you don't have to s ee the ac tual ques tions
thems elves . Sc ores will c hange on this tes t without any c hange
in the c ons truc t that is being meas ured [H ac k #3 2 ]. So, all you
c an do on this quiz is gues s . Bec aus e they are true/fals e
ques tions , you will have a 5 0 perc ent c hanc e of getting any
ques tion c orrec t, and the average performanc e for your group of
1 0 tes t takers (or 1 0 0 if you are really s erious about this ...c an
you do at leas t 3 0 maybe? ...anyone? ) s hould be a s c ore of 5 out
of 1 0 .
A dminis ter the Advanced Quantum Phys ics Quiz to all the people
you were able to get. A nd when you and the others take this quiz,
don't c heat by looking at the ans wer key, even though it is only
inc hes away from your eyes right now (in Table 3 - 9 )!
C ollec t the c ompleted tes ts (make s ure they put their names on
them) and s c ore them up, us ing the ans wer key in Table 3 - 9 .
N ow, pic k your highes t s c orer (this repres ents s omeone like
you, perhaps , who s c ores higher than average on s tandardized
tes ts s uc h as the SAT ) and the lowes t s c orer (this repres ents
s omeone not like you, perhaps , who s c ores lower than average).
G ive thes e two people the quiz again (without them s eeing the
c orrec t ans wers ) and s c ore them again.
H ere's where regres s ion to the mean kic ks in. I am pretty
s urewithout knowing you or your friends or what their ans wers
areof two things :
Why It Works
What we expec t to happen with the two s c ores is that all the tes t
s c ores that are below 5 (or whatever your tes t mean was ) would
move up toward the mean, and thos e s c ores above 5 would move
down toward the mean. T his may or may not have happened with
your two s c ores , but it is the mos t probable outc ome.
Why would more extreme s c ores bec ome les s extreme with
repeated tes ting? L ook at the likelihood of getting two extreme
s c ores (s uc h as a s c ore of 2 and then another s c ore of 2 ) vers us
getting a s c ore of 2 (probability = .0 4 4 ), and then a s c ore of 4
(probability = .2 0 5 ). I t's almos t five times as likely that a
pers on with a 2 the firs t time will s c ore a 4 on a s ec ond
adminis tration. I t is almos t 9 5 perc ent c ertain that he will s c ore
higher than 2 (1 - .0 4 4 - .0 1 0 - .0 0 1 = .9 4 5 ).
With this tes t, in whic h s c ores were entirely due to c hanc e, there
is a 6 5 .6 perc ent c hanc e of s c oring at or very near the mean
(c ombining probabilities of s c ores 4 , 5 , and 6 ). With mos t tes ts ,
whic h have a greater number of items and produc e normal
dis tributions , you have a 6 8 perc ent c hanc e of s c oring at or near
the mean [H ac k #2 3 ].
Predicting the Likelihood of a Higher
Score
T his is all very interes ting, but how will it help you dec ide
whether it is worth it to take a tes t a s ec ond time? Bac k to our
original dilemma. Taking thes e important tes ts (s uc h as c ollege
admis s ions tes ts ) a s ec ond time takes more money, time,
s tres s , and, perhaps , preparation, s o one needs to be s trategic
in dec iding when to try again.
Tes t reliability
N eil Salkind
Hack 31. Establish Reliability
Some bas ic s , firs t, about tes t reliability and why you s hould
s eek out reliability evidenc e for important tes ts you take. Tes ts
and other meas urement ins truments are expec ted to behave
c ons is tently, both internally (meas uring the s ame c ons truc t
behaving in s imilar ways ) and externally (providing s imilar
res ults if they are adminis tered again and again over time).
T hes e are is s ues of reliability.
Reliability is meas ured s tatis tic ally, and a s pec ific number c an
be c alc ulated to repres ent a tes t's level of c ons is tenc y. M os t
indic es of reliability are bas ed on c orrelations [H ac k #1 1 ]
between res pons es to items within a tes t or between two s ets of
s c ores on a tes t given or s c ored twic e.
I nternal reliability
Calculating Reliability
Internal reliability
I f you took a tes t and s plit it in halfthe odd items in one half and
the even items in the other, for exampleyou c ould c alc ulate the
c orrelation between the two halves . T he formula for s plit- half
c orrelations is the c orrelation c oeffic ient formula [H ac k #1 1 ]
and is a traditional method for es timating reliability, though it is
c ons idered a bit old- fas hioned thes e days .
Test-retest reliability
Inter-rater reliability
I nternal reliability
Before you take a high- s takes tes t that c ould determine whic h
roads are open to you, make s ure that the tes t has ac c epted
levels of reliability. T he type of reliability you'd like to s ee
evidenc e of depends on the purpos e of the tes t.
T he eas ies t way to ens ure a high c oeffic ient alpha or any other
reliability c oeffic ient is to inc reas e the length of your tes t. T he
more items as king about the s ame c onc ept and the more
opportunities res pondents have to c larify their attitudes or
dis play knowledge, the more reliable a total s c ore on that tes t
would be. T his makes s ens e theoretic ally, but als o inc reas es
reliability mathematic ally bec aus e of the formula us ed to
c alc ulate reliability.
L ook bac k at the equation for c oeffic ient alpha. A s the length of
a tes t inc reas es , the variability for the total tes t s c ore inc reas es
at a greater rate than the total variability ac ros s items . I n the
formula, this means that the value in the parenthes es gets larger
as a tes t gets longer. T he n/n-1 portion als o inc reas es as the
number of items inc reas es . C ons equently, longer tes ts tend to
produc e higher reliability es timates .
Why It Works
Validity is not s omething that a tes t s c ore either has or does not
have. Validity is an argument that is made by the tes t des igner,
thos e relying on the tes t's res ults , or anyone els e who has a
s take in the ac c eptanc e of the tes t and its res ults .
D o the items on the tes t fairly repres ent the items that
c ould be on the tes t? I f a tes t is meant to c over s ome
well- defined domain of knowledge, do the ques tions fairly
s ample from that domain?
Content-Based Arguments
I f you dec ide to meas ure a c onc ept, there are many as pec ts of
that c onc ept and many different ques tions that c an be as ked on
a tes t. Some demons tration that the items you c hoos e for your
tes t repres ent all pos s ible items would be a c ontent- bas ed
argument for validity.
T his s ounds like a daunting requirement. Traditionally, this s ort
of evidenc e has been c ons idered more important for tes ts of
ac hievement. I n areas of ac hievementmedic ine, law, E nglis h,
mathematic s there are fairly well- defined domains and c ontent
areas from whic h a valid tes t s hould s ample. A c las s room
teac her als o, pres umably, has defined a s et of objec tives or
c ontent areas that a tes t s hould meas ure. Suc h c onc is ely
defined as pec ts of a s ubjec t are rarely available, however, when
tes ting a range of behaviors , knowledge, or attitudes .
C ons equently, making a reas onable argument that you have
s elec ted ques tions that are repres entative of s ome imaginary
pool of all pos s ible ques tions is diffic ult.
Criterion-Based Arguments
Construct-Based Arguments
We further hope that the tes t takers ac c urately and hones tly
res pond to tes t items . I n prac tic e, tes t res ults are often treated
as a direc t meas ure of a c ons truc t, but we s houldn't forget that
they are educ ated gues s es only. T he s uc c es s of this whole
proc es s depends on another s et of as s umptions : that we have
c orrec tly defined the c ons truc t we are trying to meas ure and
that our tes t mirrors that definition.
C ons truc t evidenc e, then, often inc ludes both a defens e of the
defined c ons truc t its elf and a c laim that the ins trument us ed
reflec ts that definition. E videnc e pres ented for c ons truc t validity
c an inc lude a demons tration that res pons es behave as theory
would expec t res pons es to behave. C ons truc t validity evidenc e
c ontinues to ac c umulate whenever a s urvey or tes t is us ed, and,
like all validity arguments , it c an never be fully c onvinc ing. I n a
s ens e, c ons truc t validity arguments inc lude both c ontent and
c riterion validity arguments , bec aus e all validity evidenc e s eeks
to es tablis h a link between a c onc ept and the ac tivity that
c laims to meas ure it.
For your s pelling tes t, there might be res earc h on the nature of
s pelling ability as a c ognitive ac tivity or pers onality trait or
s ome other well- defined entity. I f you c an define what you mean
by s pelling ability and demons trate that your tes t's s c ores
behave as your definition would expec t, then you c an c laim
c ons truc t- bas ed validity evidenc e. D oes theory s ugges t that
better readers are better s pellers ? Show that relations hip,
perhaps with a c orrelation c oeffic ient [H ac k #1 1 ], and you have
pres ented validity evidenc e that might c onvinc e others .
Consequences-Based Arguments
U ntil the las t dec ade or two, meas urement folks interes ted in
es tablis hing validity were c onc erned only with demons trating
that the tes t s c ore reflec ted the c ons truc t. Bec aus e of
inc reas ing c onc erns that c ertain tes ts might unfairly penalize
whole groups of people, plus other c onc erns about the s oc ial
c ons equenc es of the c ommon us e of tes ts , polic y makers and
meas urement philos ophers now look at the c ons equenc es
experienc ed by the tes t taker bec aus e of taking a tes t.
For your s pelling tes t, the key negative c ons equenc es you want
to rule out involve tes t bias . I f your theory of s pelling ability
expec ts no differenc es ac ros s gender, rac e, or s oc io- ec onomic
s tatus , then s pelling s c ores s hould be equal between thos e
groups . P roduc e evidenc e of s imilar s c ores between groups ,
perhaps with a t tes t [H ac k #1 7 ], and you will be well on your
way to es tablis hing that your tes t is fair and valid.
P hys ic is t J . Ric hard G ott I I I has s o far c orrec tly predic ted when
the Berlin Wall would fall and c alc ulated the duration of 4 4
Broadway s hows .1 C ontrovers ially, he has predic ted that the
human rac e will probably exis t between 5 ,1 0 0 and 7 .8 million
more years , but no longer. H e argues that this is a good reas on
to c reate s elf- s us taining s pac e c olonies : if the human rac e puts
s ome eggs in other nes ts , we might extend the life s pan of our
s pec ies in c as e of an as teroid s trike or nuc lear war on the home
planet.2
In Action
How It Works
In Real Life
Minimum
Maximum
Minimum
Maximum
Minimum
Maximum
I f the pers on does not have the dis eas e, how likely is the
pers on to s c ore a negative tes t res ult? T his likelihood is
s pecificity. O f thos e people in c olumn B, what perc ent
will rec eive a negative tes t res ult?
We are firs t interes ted in ans wering the ques tions regarding the
s ens itivity and s pec ific ity of breas t c anc er s c reening. With that
information and knowledge of the bas e rate for breas t c anc er, we
c an ans wer the mos t important ques tion:
By as king your doc tor or doing s ome res earc h, you might
dis c over that s ens itivity for mammograms is about 9 0 perc ent.
Spec ific ity is about 9 2 perc ent.
L et's return now to the third ques tion in our lis t of important
ques tions to as k before interpreting the res ults of a medic al
tes t. I f a pers on s c ores a pos itive tes t res ult, how likely is the
pers on to have the dis eas e? O ut of 1 0 ,0 0 0 women who have a
breas t c anc er s c reening, 8 9 8 will rec eive a pos itive s c ore. For
7 9 0 of thos e women, the s c ore is wrong; they do not ac tually
have breas t c anc er. For 1 0 8 of thos e women, the tes t was right;
they do have c anc er. I n other words , if a pers on s c ores a
pos itive res ult, it is only 1 2 perc ent likely that they have the
dis eas e. T he mos t c ommon res ult for follow- up tes ting to a
pos itive mammogram is that the patient is , in fac t, c anc er free.
Why It Works
To ans wer the all- important ques tion in our breas t c anc er
example ("I f a woman s c ores a pos itive tes t res ult, how likely is
s he to have breas t c anc er? "), the mammogram equation takes
on thes e values :
Making Informed Decisions
M edic al tes ts are not perfec tly reliable and valid. Tes t res ults
c an be wrong. T here are four pos s ibilities for anyone who
undergoes medic al tes ting. A patient might have the dis eas e
and the tes t indic ates this , or the patient does not have the
dis eas e and the tes t finds no pres enc e of it. I n thes e c as es , the
tes t worked right and the s c ores are valid.
C onvers ely, the tes t res ults might reflec t the oppos ite of the
true medic al c ondition, with a pos itive res ult wrongly indic ating
pres enc e of a dis eas e that is not there, or a negative res ult
wrongly indic ating that the patient is dis eas e- free. I n thes e
c as es , the tes t did not work right and the res ults are not valid.
T his table of outc omes is s imilar to the pos s ibilities when one
ac c epts or rejec ts a hypothes is in s tatis tic al dec is ion making
[H ac k #4 ].
See Also
A s lot mac hine that has n't paid out in a while is due.
A poker player who has had nothing but bad hands all
evening will s oon get a s uper c olos s al hand to even
things out.
A los ing bas eball team that has los t the las t three
games is more likely to win the fourth.
Bec aus e rolling dic e and getting three 7 s in a row is
unlikely to oc c ur, rolling a fourth after having jus t rolled
three s traight mus t be bas ic ally impos s ible.
C as inos are not required to report the ac tual money they take in
from table games , however. Bas ed on the depth of the s hag
c arpet at L um's Travel I nn of L aughlin, N evada (my favorite
c as ino), though, I 'm gues s ing c as inos do okay. T he general
gambler's hac k here is to walk away after a c ertain period of
time, whether you are ahead or behind. I f you are luc ky enough
to get far ahead before your time runs out, c ons ider running out
of the c as ino.
Systems
Bec aus e s ome c ards are fac e up, players have information. T hey
als o know whic h c ards they have in their own hands , whic h is
more information. T hey als o know the dis tribution of all c ards in
a s tandard 5 2 - c ard dec k. A ll this information about a known
dis tribution of values [H ac k #1 ] makes Texas H old 'E m a good
opportunity to s tat hac k all over the plac e [H ac ks #3 6 and
#3 8 ].
O ne partic ularly c ruc ial dec is ion point is the round of betting
right after the flop. T here are two more c ards to c ome that might
or might not improve your hand. I f you don't already have the
nuts (the bes t pos s ible hand), it would be nic e to know what the
c hanc es are that you will improve your hand on the next two
c ards . T he rule of four allows you to eas ily and fairly ac c urately
es timate thos e c hanc es .
How It Works
Example 1
So, nine c ards will help you. T he rule of four es timates that you
have a 3 6 perc ent c hanc e of making that flus h on either the turn
or the river (9 x4 = 3 6 ). So, you have about a one out of three
c hanc e. I f you c an keep playing without ris king too muc h of your
s tac k, you s hould probably s tay in the hand.
Example 2
Why It Works
For the firs t example, the rule of four es timates a 3 6 perc ent
c hanc e of making that flus h. T he ac tual probability is 3 5
perc ent. I n fac t, the es timated and ac tual perc ent c hanc e us ing
the rule of four tends to differ by a c ouple perc entage points in
either direc tion.
N otic e that this method als o works with jus t one c ard left to go,
but in that c as e, the rule would be c alled the rule of two. A dd up
the c ards you want and multiply by two to get a fairly ac c urate
es timate of your c hanc es with jus t the river remaining. T his
es timate will be off by about two perc entage points in mos t
c as es , s o s tatis tic ally s avvy poker players c all this the rule of
two plus two.
To prove this to yours elf without doing the c alc ulations , imagine
that there are 2 5 c ards (out of 4 7 ) that c ould help you. T hat's a
great s pot to be in (and right now I c an't think of a s c enario that
would produc e s o many outs ), but the rule of four s ays that you
have a 1 0 0 perc ent c hanc e of drawing one of thos e c ards . You
know that's not right. A fter all, there are 2 2 c ards you c ould
draw that don't help you at all. T he real c hanc e is 7 9 perc ent. O f
c ours e, making a mis c alc ulation in this s ituation is unlikely to
hurt you. U nder either es timate, you'd be nuts to fold.
Hack 37. Know When to Fold 'Em
When the pot odds are right, you s hould c all a hand even when
the odds are that you will los e. So, what are pot odds and why
would I ever put more money into a pot that I am likely to los e?
Pot Odds
Table 4 - 4 s hows the us e of pot odds after the flop. T helma has a
pair of ac es to s tart and hits the third ac e on the flop.
C ons equently, s he begins eac h round by betting. T he other
players who have yet to hit anything mus t dec ide whether to
s tic k around and hope to improve their hands into s trong, likely
winners .
P ot odds c ome into play primarily when making the dec is ion
whether to s tic k around or fold. L ouis e needs a five to make her
s traight, and s he es timates a 1 6 perc ent c hanc e of getting that
5 s omewhere in the next two c ards . H owever, with that pot
c urrently at $ 2 5 0 and a $ 5 0 rais e from T helma, whic h s he would
have to c all, L ouis e would have to pay 2 0 perc ent of the pot.
T his is a 2 0 perc ent c os t c ompared with a 1 6 perc ent c hanc e of
winning the pot. T he ris k is greater than the payoff, s o L ouis e
folds . M ike and V inc e, however, have more outs , s o pot odds
dic tate that they s tic k around.
L et's as s ume that the players are us ing only pot odds to make
their dec is ions , ignoring for the s ake of illus tration that they are
probably trying to get a read on the other players (e.g., who c ould
bluff, rais e, and s o on). By the way, players are c alc ulating the
c hanc e that they will get a c ard to improve their hand us ing the
rule of four and the rule of 2 + 2 [H ac k #3 6 ].
Why It Works
I nteres tingly, if V inc e had been betting into a s lightly larger pot
that c ontained M ike's c all, the pot odds for V inc e's 1 0 0 - c hip
c all would then have dropped to 1 8 perc ent and V inc e might
have c alled. I n fac t, if M ike were a s uper genius - type player, he
well c ould have c alled on the turn knowing that would c hange the
pot odds for V inc e and therefore enc ourage him to c all. Real- life
profes s ional poker players who are really, really goodreally do
think that way s ometimes .
Remember that pot odds are bas ed on the as s umption that you
will be playing poker for an infinite amount of time. I f you are in a
no- limit tournament format, though, where you c an't dig into your
poc kets , you might not be willing to ris k all or mos t of your c hips
on your faith about what will happen in the long run.
T he other problem with bas ing life and death dec is ions on pot
odds is that you are treating a "really good hand" as if it were a
guaranteed winner. O f c ours e, it's not. T he other players may
have really good hands , too, that are better than yours .
Hack 38. Know When to Walk Away
In Texas Hold 'Em, when you are "short-stacked," you have only
a couple of choices: go all-in right now or go all-in very soon. A s
you might have guessed, knowing when to make your last stand
is all about the odds.
I t does n't feel eas y, though, does it, when you are s hort- s tac ked
and have to go all-in (bet everything you have)? I t feels very,
very hard for two reas ons :
You are probably not going to win the tournament. You
realize that you are down to very few c hips and would
have to double up s everal times to get bac k in the game.
Realis tic ally, you doubt that you have muc h of a c hanc e.
T hat's depres s ing, and any dec is ion you make when you
are s ad is diffic ult.
O ne mis take and you are out. T here is little margin for
error, and it is hard to pull the trigger in s uc h a high-
s takes s ituation.
A pplying s ome bas ic s tatis tic al princ iples to the dec is ion might
help make you feel better. A t leas t you'll have s ome
nonemotional guidelines to follow. When you los e (and you s till
probably will; you're s hort- s tac ked, after all), now you c an blame
me, or the fates , and not yours elf.
T hough you c ould play quite a while longer without running out of
c hips , you will want to bet on any dec ent hand. You hope to win
s ome blinds here. T he more blinds you win, the longer you c an
wait for killer hands . I f you are rais ed, at leas t c ons ider
res ponding with an all- in.
I n any pos ition, whether you are on the button, in the big blind,
or the firs t to bet, c ons ider announc ing all- in with any top- 1 0
hand. You s till have enough c hips here to s c are off s ome players ,
es pec ially thos e with s imilarly s ized s tac ks .
You are s tarting to get low enough, though, that you really want
to be c alled. I f you c an play s ome low pairs c heaply, try it, but
bail out if you don't get three of a kind in the flop. You need to
keep as many big blinds as you c an to c oas t on until you get
that all-in opportunity.
H ere are the 1 0 hands that are the mos t likely to double you up:
A t this point, you need to go all- in, even on hands that have a
more than 5 0 perc ent c hanc e of los ing. P urpos efully making a
bad wager s eems c ounterintuitive, but you are fighting agains t
the ever- s hrinking bas e amount you hope to double up. I f you
wait and wait until you have c los e to a s ure thing, whatever
s tac k remains will have to be doubled a few extra times to get
you bac k.
I 'm going to group 5 0 dec ent, playable s tarting Texas H old 'E m
poker hands , hands that give you a c hanc e to win agains t a
s mall number of opponents . I 'll be us ing three groupings , s hown
in Tables 4 - 6 , 4 - 7 , and 4 - 8 . While different poker experts might
quibble a bit about whether a given hand is good or jus t okay,
mos t would agree that thes e hands are all at leas t playable and
s hould be c ons idered when s hort- s tac ked.
Finally, imagine that you have jus t a few hands left bec aus e the
blinds are s hrinking your s tac k down to nothing. You look down
and s ee a dec ent hand, an okay hand, s uc h as 8 - 7 in the s ame
s uit. Table 4 - 9 allows you to ans wer the big ques tion: is it likely
that your very next hand will be better than this one? T here is
about an 1 1 perc ent c hanc e of getting a good or great hand next.
So, no, it is unlikely you will improve. Stake your future on this
hand.
Be realis tic
Take control
O f c ours e, he might get luc ky and win s ome money, whic h would
be dandy, but the L aw of Big N umbers [H ac k #2 ] mus t be
obeyed. I n the long run, he is mos t likely to have les s money
than if he had never played at all. I n fac t, if he plays an infinite
amount of time, he is guaranteed to los e money. (M os t roulette
players play for a period of time s omewhat les s than infinity, of
c ours e.) To extend your amount of playing time, there is
important s tatis tic al information you s hould know about this
game with the s pinning wheel, the orbiting ball, and the blac k and
red layout.
Basic Wagers
T here are 3 8 s pac es on the wheel and, bec aus e all 3 8 pos s ible
outc omes are equally likely, the c alc ulations are fairly
s traightforward. Table 4 - 1 0 s hows the types of bets players c an
make, the information nec es s ary to c alc ulate the odds of winning
for a s ingle s pin of the wheel and a one- dollar bet, the ac tual
amounts the c as ino pays out, and the hous e advantage.
Why It Works
Statis tic ians c an play this game wis ely by us ing two s ourc es of
information: the dealer's fac e- up c ard and the knowledge of
c ards previous ly dealt. Bas ic s trategies bas ed on probability will
let s mart players play almos t even agains t the hous e without
having to pay muc h attention or learn c omplic ated s ys tems .
M ethods of taking into ac c ount previous ly dealt c ards are
c ollec tively c alled counting cards , and us ing thes e methods
allows players to have a s tatis tic al advantage over the hous e.
U .S. c ourts have ruled that c ard c ounting
is legal in c as inos , though c as inos wis h
you would not do it. I f they dec ide that you
are c ounting c ards , they might as k you to
leave that game and play s ome other
game, or they might ban you from the
c as ino entirely. I t is their right to do this .
Basic Strategy
T he remaining four c olumns pres ent the typic al options and what
the dealer's c ard s hould be for you to c hoos e eac h option. A s
you c an s ee, for mos t hands there are only a c ouple of options
that make any s tatis tic al s ens e to c hoos e. T he table s hows the
bes t move, but not all c as inos allow you to double- down on jus t
any hand. M os t, however, allow you to s plit any matc hing pair of
c ards .
Why It Works
8 /1 3 = .6 1 6
O f c ours e, even though the s ingle bes t gues s is that the dealer
has a 1 0 down, there is ac tually a better c hanc e that the dealer
does not have a 1 0 down. A ll the other pos s ibilities (9 /1 3 ) add
up to more than the c hanc es of a 1 0 (4 /1 3 ).
N ow, imagine that you have a 1 6 agains t the dealer's down c ard
of 6 . Your c hanc e of bus ting when you take a c ard is 6 2 perc ent.
C ompare that 6 2 perc ent c hanc e of an immediate los s to the
dealer's c hanc e of beating a 1 6 , whic h is 5 8 perc ent. Bec aus e
there is a greater c hanc e that you will los e by hitting than that
you will los e by not hitting (6 2 is greater than 5 8 ), you s hould
s tay agains t the 6 , as Table 4 - 1 2 indic ates .
A ll the branc hing pos s ibilities for all the different permutations
of s tarting hands vers us dealers ' up c ards res ult in the
rec ommendations in Table 4 - 1 2 .
Sucker Bet
E laborate and very s ound (s tatis tic ally s peaking) methods exis t
for keeping trac k of c ards previous ly dealt. I f you are s erious
about learning thes e tec hniques and dedic ating yours elf to the
life of a c ard c ounter, more power to you. I don't have the s pac e
to offer a c omplete, c omprehens ive s ys tem here, though. For the
res t of us , who would like to dabble a bit in ways to inc reas e our
odds , there are a few c ounting proc edures that will improve your
c hanc es without you having to work partic ularly hard or
memorize many c harts and tables .
Counting Aces
You get even money for all wins , exc ept when you are dealt a
blac kjac k. You get a 3 - to- 2 payout (e.g., $ 1 5 for every $ 1 0 bet)
when a blac kjac k c omes your way. C ons equently, when there is a
better- than- average c hanc e of getting a blac kjac k, you would
like to have a larger- than- average wager on the line.
4 /1 3 x4 /5 1 = .0 2 4 1
Getting an Ace firs t and then a 10-card
1 /1 3 x1 6 /5 1 = .0 2 4 1
O bvious ly, you c an't get a blac kjac k unles s there are A c es in
the dec k. When they are gone, you have no c hanc e for a
blac kjac k. When there are relatively few of them, you have les s
than the normal c hanc e of a blac kjac k. With one dec k, a
previous ly dealt A c e lowers your c hanc es of hitting a blac kjac k
to .0 3 6 2 (about 3 .6 perc ent). D ealing a quarter of the dec k with
no A c es s howing up inc reas es your c hanc es of a blac kjac k to
about 6 .5 perc ent.
E ven if you jus t us e thes e s imple s ys tems , you will improve your
c hanc es of winning money at the blac kjac k tables . Remember,
though, that even with thes e s orts of s ys tems , there are other
pitfalls awaiting you in the c as ino, s o be s ure to always follow
other good gambling advic e [H ac k #3 5 ] as well.
Hack 41. Play Smart When You Play the
Lottery
O f c ours e, if I don't play, I c an't win. Buying a lottery tic ket is n't
nec es s arily a bad bet, and if you are going to play, there are a
few things you c an do to inc reas e the amount of money you will
win (probably) and inc reas e your c hanc es of winning (pos s ibly).
Whoever bought the winning $ 3 4 0 million tic ket in J ac ks onville,
O regon, that O c tober day likely followed a few of thes e winning
s trategies , and you s hould too.
O f c ours e, c orrec tly pic king all the winning numbers is hard to
do, but it's what you need to do to win the jac kpot. I n P owerball,
you c hoos e five numbers and then a s ixth number: the red
powerball. T he regular white numbers c an range from 1 to 5 5 ,
and the powerball c an range from 1 to 4 2 . Table 4 - 1 4 s hows the
different c ombinations of matc hes that res ult in a prize, the
amount of the prize, and the odds and probability of winning the
prize.
Powerball Payoff
A rmed with all the wis dom you likely now have as a s tatis tic ian
(unles s this is the firs t hac k you turned to in this book), you
might have already made a few interes ting obs ervations about
this payoff s c hedule.
Easiest prize
T he eas ies t prize to win is the powerball only matc h, and even
then there are s lim c hanc es of winning. I f you matc h the
powerball (and no other numbers ), you win $ 3 . T he c hanc es of
winning this prize are about 1 in 6 9 .
So, after having matc hed the powerball, you have a little better
than a third c hanc e of hitting at leas t one white ball as well. E ven
s o, your expec ted payoff is about $ 3 .3 9 for every $ 6 9 you throw
down that rat hole (I mean, s pend on the lottery), whic h is s till
not a good bet.
Powerball only
T he odds for the powerball only matc h don't s eem quite right. I
s aid there were 4 2 different numbers to c hoos e from for the
powerball, s o s houldn't there be 1 out of 4 2 c hanc es to matc h it,
not 1 in 6 9 ?
Yes , but remember this s hows the c hanc es of hitting that prize
only and not doing better (by matc hing s ome other balls ). Your
odds of winning s omething, anything, if you c ombine all the
winning permutations together are 1 in 3 7 , about 3 perc ent. Still
not a good bet.
Grand prize
T he odds for the grand prize don't s eem quite right either. (O kay,
okay, I don't really expec t you to have "notic ed" that. I didn't
either until I did a few c alc ulations .)
For P owerball and its number of balls and their range of values ,
1 4 6 ,1 0 7 ,9 6 2 is the magic number. T he idea that your c hanc es
of winning haven't c hanged but the payoff amount has inc reas ed
to a level where playing is worthwhile is s imilar to the c onc ept of
pot odds in poker [H ac k #3 7 ].
O ne important hint about dec iding when to buy lottery tic kets
has to do with determining the actual magic number, the prize
amount, whic h triggers your buying s pree. T he amount that is
advertis ed as the jac kpot is not, in fac t, the jac kpot. T he
advertis ed "jac kpot" is the amount that the winner would get
over a period of years in a regular s eries of s maller portions of
that amount. T he real jac kpotthe amount you s hould identify as
the payout in the gambling and s tatis tic al s ens eis the amount
that you would get if you c hos e the one lump s um option. T he one
lump s um is typic ally a little les s than half of the advertis ed
jac kpot amount.
So, if you have determined that your lottery has grown a jac kpot
amount that s ays it is now s tatis tic ally a good time to play, how
many tic kets s hould you buy? Why not buy one of eac h? Why not
s pend $ 1 4 6 ,1 0 7 ,9 6 2 and buy every pos s ible c ombination? You
are guaranteed to win. I f the jac kpot is greater than that amount,
then you'll make money, guaranteed, right? Well, ac tually not.
O therwis e, I 'd be ric h and I would never s hare this hac k with
you. Why wouldn't you be guaranteed to win? T he probably is
that you might be forc ed to...wait for it...s plit the prize! A rgh!
See the next s ec tion...
Firs t off, I 'm working under the as s umption that the winning
number is randomly c hos en. I tend not to be a c ons pirac y
theoris t, nor do I believe that G od has the time or inc lination to
affec t the drawing of winning lottery numbers , s o I 'm going to not
lis t any s trategy that would work only if there were not
randomnes s in the drawing of lottery numbers . H ere are s ome
more reas onable tips to c ons ider when pic king your lottery
numbers :
T hough your odds of winning a giant lottery prize are s lim, you
c an follow s ome s tatis tic al princ iples and do a few things to
ac tually c ontrol your own des tiny. (T he word for des tiny in
I talian, by the way, is lotto.) O h, and one more thing: buy your
tic ket on the day of the drawing. I f too muc h time pas s es
between your purc has e and the announc ement of the winning
numbers , you have a greater likelihood of being hit by lightning,
drowning in the bathtub, or being s truc k by a minivan than you
do of winning the jac kpot. T iming is everything, and I 'd hate for
you to mis s out.
Hack 42. Play with Cards and Get Lucky
P eople, es pec ially c ard players , and es pec ially poker players ,
feel pretty good about their level of unders tanding of the
likelihood that different c ombination of c ards will appear. T heir
experienc e has taught them the relative rarity of pairs , three- of-
a- kind, flus hes , and s o on. G eneralizing that intuitive knowledge
to playing- c ard ques tions outs ide of game s ituations is diffic ult,
however.
The bet
A little flus h (oops , s orry; I mean li'l flus h) is any two c ards of
the s ame s uit. Frank has a wager that he almos t always wins
that has to do with finding two c ards of the s ame s uit in your
hand. A gain, bec aus e of time c ons traints , his poker hands have
only four c ards , not five.
Why it works
N ext, imagine the firs t two c ards of a four- c ard hand. T hes e will
matc h only .2 3 5 2 of the time (1 2 c ards of the s ame s uit remain
out of a 5 1 - c ard dec k). So, about one- and- a- half million four-
c ard deals will find a flus h in the firs t two c ards . T hey won't
matc h another .7 6 4 8 of the time. T his leaves 4 ,9 6 8 ,6 0 1 hands
with two differently s uited firs t two c ards .
O f that number of hands , how many will not rec eive a third c ard
that does not s uit up with either of the firs t two c ards ? T here are
5 0 c ards remaining, and 2 6 of thos e have s uits that have not
appeared yet. So, 2 6 /5 0 (5 2 perc ent) of the time, the third c ard
would not matc h either s uit.
You have a dec k of c ards . I have a dec k of c ards . T hey are both
s huffled (or, perhaps , s ouffl\x8 e d, as my s pell c hec k s ugges ted
I meant to s ay). I f we dealt them out one at a time and went
through both dec ks one time, would they ever matc h? I mean,
would they ever matc h exactly, with the exac t s ame c ardfor
example, us both turning up the J ac k of C lubs at the s ame time?
The bet
Why it works
H ere's how to think about this problem s tatis tic ally. Bec aus e the
dec ks are s huffled, one c an as s ume that any two c ards that are
flipped up repres ent a random s ample from a theoretic al
population of c ards (the dec k). T he probability of a matc h for any
given s ample pair of c ards c an be c alc ulated. Bec aus e you are
s ampling 5 2 times , the c hanc e of getting a matc h s omewhere in
thos e attempts inc reas es as you s ample more and more pairs of
c ards . I t is jus t like getting a 7 on a pair of dic e: on any given
roll, it is unlikely, but ac ros s many rolls , it bec omes more likely.
To c alc ulate the probability of hitting the outc ome one wis hes
ac ros s a s eries of outc omes , the math is ac tually eas ier if one
c alc ulates the c hanc es of not getting the outc ome and
multiplying ac ros s attempts . For any given c ard, there is a 1 out
of 5 2 c hanc e that the c ard in the other dec k is an exac t matc h.
T he c hanc es of that not happening are 5 1 out of 5 2 , or .9 8 0 8 .
You are trying to make a matc h more than onc e, though; you are
trying 5 2 times . T he probability of not getting a matc h ac ros s
5 2 attempts , then, is .9 8 0 8 multiplied by its elf 5 2 times . For
you math types , that's .9 8 0 8 52.
Wait a s ec ond and I 'll c alc ulate that in my head (.9 8 0 8 times
.9 8 0 8 times .9 8 0 8 and s o on for 5 2 times is ...about...0 .3 6 4 3 ).
O K, s o the c hanc e that it won't happen is .3 6 4 3 . To get the
c hanc e that it will happen, we s ubtrac t that number from 1 and
get .6 3 5 7 .
You'll find at leas t one matc h between two dec ks about two-
thirds of the time! Remarkable. G o forth and win that free
lemonade.
Hack 43. Play with Dice and Get Lucky
Here are some honest wagers using honest dice. Just because
you aren't cheating, though, doesn't mean you won't win.
H ere are a few ways to win a bet us ing any fair pair of dic e.
Firs t, let's get ac quainted with the pos s ibilities of two dic e rolled
onc e. You'll rec all that mos t dic e have s ix s ides (my fantas y
role- playing friends and I c all thes e s ix-s ided dice) and that the
values range from 1 to 6 on eac h c ube.
C alc ulating the pos s ible outc omes is a matter of lis ting and
c ounting them. Figure 4 - 2 s hows all pos s ible outc omes for
rolling two dic e.
T his dis tribution res ults in the frequenc ies s hown in Table 4 - 1 5 .
I n fac t, if you didn't have to be s pec ific , you c ould wager that a 6
or an 8 will c ome up before a 7 does . O f all totals that c ould be
s howing when thos e dic e are done rolling, more than one- fourth
of the time (about 2 8 perc ent) the dic e will total 6 or 8 . T his is
s ubs tantially more likely than a 7 , whic h c omes up only one-
s ixth of the time.
Sometimes , old Frankie would wager that on any one roll of a pair
of dic e, there would be a 6 or a 1 s howing. T hough, at firs t
thought, there would s eem to be at leas t a les s than 5 0 perc ent
c hanc e of this happening, the truth is that a 1 or 6 will be
s howing about 5 6 perc ent of the time. T his is the s ame
probability for any two different numbers , by the way, s o you
c ould us e an attrac tive s tranger's birthday to pic k the digits and
maybe s tart a c onvers ation, whic h c ould lead to marriage,
c hildren, or both.
A B
2 or 12 3
2, 3, or 4 7
5, 6, or 7 8, 9, 10, 11, or 12
Why It Works
For the bets pres ented in this hac k, here are the c alc ulations
demons trating the probability of winning:
Number
of
Wager Calculation Resulti
winning
outcomes
5 or 9
8 versus 6 8/14 .571
versus 7
1 or 6
20 20/36 .556
showing
2 or 12
2 versus 2 2/4 .500
versus 3
2, 3 or 4
6 versus 6 6/12 .500
versus 7
5, 6 or 7
versus 8 15 versus 15 15/30 .500
or higher
T he "Wager" c olumn pres ents the two c ompeting outc omes (e.g.,
will a 5 or 9 c ome up before a 7 ? ). T he "N umber of winning
outc omes " c olumn indic ates number of different dic e rolls that
would res ult in either s ide of the wager (e.g., 8 c hanc es of getting
a 5 or 9 vers us only 6 c hanc es of getting a 7 ). T he "Res ulting
proportion" c olumn indic ates your c hanc es of winning.
In Texas Hold 'Em and other poker games, there are a f ew basic
preliminary skills and a bit of basic knowledge about probability
that will immediately push you f rom absolute beginner to the
more comf ortable level of knowing just enough to get into
trouble as a card sharp.
Q uic kly identifying the pos s ible better hands that c ould
be held by other players
T his hac k pres ents s ome tips and tools for moving from novic e
to s emi- pro. T hes e are s ome s imple hunks of knowledge and
quic k rules of thumb for making dec is ions . L ike the other poker
hac ks in this book, they provide s trategy tips bas ed purely on
s tatis tic al probabilities , whic h as s ume a random dis tribution of
c ards in a s tandard 5 2 - c ard dec k.
H alf the time, you will get a pair or better in Texas H old 'E m. I 'll
repeat that bec aus e it is s o important in unders tanding the
game. H alf the time (a little under 5 2 perc ent ac tually), if you
s tay in long enough to s ee s even c ards (your two c ards plus all
five c ommunity c ards ), you will have at leas t one pair. I t might
have been in your hand (a pocket or wired pair), it might be made
up of one c ard in your hand and one from the c ommunity c ards ,
or your pair might be entirely in the c ommunity c ards for
everyone to c laim.
I f for the majority of the time the average player will have a pair
when dealt s even c ards , then s tic king around until the end with a
low pair means you areonly s tatis tic ally s peaking, of
c ours elikely to los e. I n other words , there is a greater than 5 0
perc ent c hanc e that the other player has at leas t a pair, and that
pair will probably be 8 s or higher (only s ix out of thirteen pairs
are 7 s or lower.)
Probabilities
D ec is ions about s taying in or rais ing your bet in an attempt to
lower the number of opponents you have to beat c an be made
more wis ely if you know s ome of the c ommon probabilities for
s ome of the c ommonly hoped- for outc omes . Table 4 - 1 6
pres ents the probability of drawing a c ard that helps you at
various s tages in a hand. T he probabilities are c alc ulated bas ed
on how many c ards are left in the dec k, how many different c ards
will help you (your outs ), and how many more c ards will be drawn
from the dec k. For example, if you have an A c e- King and hope to
pair up, there are s ix c ards that c an make that happen; in other
words , you have s ix outs . I f you have only an A c e high but hope
to find another A c e, you have three outs . I f you have a poc ket
pair and hope to find a powerful third in the c ommunity c ards ,
you have jus t two outs .
Implications
H ere are a few quic k obs ervations and implic ations to etc h in
your mind, bas ed on the dis tribution des c ribed in Table 4 - 1 6 .
H alf the time, you will pair up. T his is true for high c ards , s uc h as
Big Slick (A c e- King) or low c ards , s uc h as 2 - 7 . You c an even
pic k from the two c ards you have and pair that one up 2 8
perc ent of the time. I mplic ation: when low on c hips in
tournament play, go all- in as s oon as you get that A c e.
I f you don't hit the third c ard, you need to turn a pair into a s et
(three of a kind) on the flop, and there is only an 8 perc ent
c hanc e you will hit it down the road. I mplic ation: don't s pend too
muc h money waiting around for your low pair to turn into a
gangbus ter hand.
Some events that s eem logic ally unlikely c an ac tually turn out
to be quite probable in s ome c as es . O ne s uc h example is
determining the probability that at leas t two people in a group
s hare a birthday. M any people are s hoc ked to learn that as long
as there are at leas t 2 3 people in the group, there is a better
than 5 0 perc ent c hanc e that at leas t 2 of them will have the
s ame birthday! By us ing a few s imple rules of probability, you
c an figure out the likelihood of this event oc c urring for groups of
any s ize, and then amaze your friends when your predic tions
c ome true.
Getting Started
T his is not nec es s arily perfec tly true, but it's c los e enough for
us to s till trus t our res ults . H owever, there is one birthday for
whic h this is definitely not true: February 2 9 , whic h oc c urs only
every four years on L eap Year. T he good news is that few enough
people are born on February 2 9 that it is eas y for us to jus t
ignore it and s till get ac c urate es timates .
I n our problem, there are only two mutually exclus ive pos s ible
outc omes :
Sinc e one of thes e two things mus t oc c ur, the s um of the two
probabilities will always be equal to one. Statis tic ians c all this
the Law of Total Probability, and it c omes in handy for this
problem.
A s imple c oin- flipping example c an help pic ture how this works .
With a fair c oin, the probability of getting a heads is 0 .5 , jus t as
the probability of getting a tails is 0 .5 (whic h is another example
of mutually exc lus ive events , bec aus e the c oin c an't c ome up
heads and tails in the s ame flip! ). O nc e you flip the c oin, one of
two things has to happen. I t mus t land either heads up or tails
up, s o the probability of heads or tails oc c urring is 1 (0 .5 + 0 .5 ).
C onvers ely, we c an think of the probability of heads as one
minus the probability of tails (1 - 0 .5 = 0 .5 ), and vic e vers a.
So, when you flip a c oin twic e, one out of every four times the
res ults will be two heads in a row. I f you wanted to know the
probability of flipping three heads in a row, the ans wer is 0 .1 2 5
(0 .5 x0 .5 x0 .5 ), whic h means that three heads in a row happens
only one time out of every eight.
To review, no matter how many people are in our group, only one
of two mutually exclus ive events c an oc c ur: at leas t two people
s hare a birthday or no one s hares a birthday. Bec aus e of the L aw
of Total P robability, we know that we c an determine the
probability of no one s haring a birthday, and one minus that
value will be equal to the probability that at leas t two s hare a
birthday. L as tly, we als o know that eac h pers on's birthday is
independent of the other group members . G ot all that? G ood, let's
proc eed!
T he firs t frac tion is the probability that the s ec ond pers on won't
s hare a birthday with the firs t pers on. T he s ec ond frac tion is the
probability that the third pers on won't s hare a birthday with the
firs t two. T he third frac tion is the probability that the fourth
pers on won't s hare a birthday with the firs t three, and s o on. T he
ninth and final frac tion is the probability that the tenth pers on
won't s hare a birthday with any of the other nine.
A s the group s ize inc reas es , it bec omes inc reas ingly more likely
that at leas t two people will s hare a birthday. T his makes perfec t
s ens e, but what s urpris es mos t people is how quic kly the
probability inc reas es as the group gets bigger. Figure 4 - 3
illus trates the rate at whic h the probability goes up when you
add more and more people.
When all is s aid and done, this is a pretty neat tric k that never
c eas es to s urpris e people. But remember to make the bar bet
only if you have at leas t 2 3 people in the room (and you're
willing to ac c ept 5 0 /5 0 odds ). I t works even better with more
people, bec aus e your c hanc es of winning go up dramatic ally
every time another pers on is added. To have a better than 9 0
perc ent c hanc e of winning your bet, you'll need 4 1 people in the
room (probability of at leas t 2 people s haring a birthday =
0 .9 0 3 ). With 5 0 people, there's a 9 7 perc ent c hanc e you'll win
your money. O nc e you have 6 0 people or more, you are
prac tic ally guaranteed to have at leas t 2 people in the room who
s hare a birthday and, of c ours e, if you have 3 6 6 people pres ent,
there is a 1 0 0 perc ent c hanc e of at leas t 2 people s haring a
birthday. T hos e are great odds if you c an get s omeone to take
the bet!
William Skorups ki
Hack 46. Design Your Own Bar Bet
Principle 1
T he res t of this hac k will s how you how to us e thes e princ iples to
your advantage in your own c us tom- made bar bets .
Principle 1
What about the probability that any one of two people reading
this book has the s ame birth month as me? I ntuitively, that
s hould be a bit more likely than 1 out of 1 2 . T he formula to figure
this out is not quite as s imple as one would like, unfortunately. I t
is not 1 /1 2 times its elf, for example. T hat would produc e a
s maller probability than we began with (i.e., 1 /2 4 ). N or is the
formula 1 /1 2 + 1 /1 2 . T hough 2 /1 2 s eems to have promis e as
the right ans werbec aus e it is bigger than 1 /1 2 , indic ating a
greater likelihood than beforethes e s orts of probabilities are not
additive. To prove to yours elf that s imply adding the two
frac tions together won't work, imagine that you had 1 2 people in
the problem. T he c hanc e of finding a matc h with my birth month
among the 1 2 is obvious ly not 1 2 /1 2 , bec aus e that would
guarantee a matc h.
Principle 2
For this wager, I 'll pic k five letters of the alphabet. I bet that if I
c hoos e s ix people and as k them to randomly pic k any s ingle
letter, one or more of them will matc h one of my five letters .
H ere's how the bet plays out:
Number of attempts
6
.8 0 8 6 = .2 7 8
1 - .2 7 8 = .7 2 2
10
9 0 10 = .3 4 9
1 - .3 4 9 = .6 5 1
On your own
C opy the s teps and c alc ulations jus t s hown to develop your own
original party tric ks . N one of thes e demons trations require any
props , jus t a willing and hones t volunteer.
I t is fun to play with others , but you never know when you will get
c aught in s omeone els e's c lever s tatis tic s trap. For ins tanc e,
remember that 1 - out- of- 1 2 c hanc e that you have the s ame birth
month as me? I fooled you! I was born in February. T here are
fewer days in that month than the others , s o your c hanc es of
being born in that month are ac tually les s than 1 out of 1 2 .
T here are 2 8 .2 5 days in February (an oc c as ional February 2 9
ac c ounts for the .2 5 ) and 3 6 5 .2 5 days in the year (the
oc c as ional L eap Year ac c ounted for again). T he c hanc e that you
were born in the s ame month as me is 2 8 .2 5 /3 6 5 .2 5 , or 7 .7 3
perc ent, not the 8 .3 3 perc ent that is 1 out of 1 2 .
So, you are les s likely to have the s ame birth month as me.
C ome to think of it, the rec ords of my birth, my birth c ertific ate,
and s o on were los t in a fire many years ago. So, the original
data about my birth is now mis s ing.
Before we talk about the problem with playing with wild cards
(c ards , often jokers , that c an take on any value the holder
wis hes ), let's review the ranking of poker hands . Table 4 - 1 7
s hows the probability that a given hand will oc c ur in any random
five c ards , as well as eac h hand's relative rarity when c ompared
to the hand ranked jus t below it in the table.
T his is all very interes ting, but what does it have to do with the
us e of wild c ards ? Well, adding wild c ards to the dec k s c rews up
all of thes e time- tes ted probabilities . A s s uming that the holder
of a wild c ard wis hes to make the bes t hand pos s ible, and als o
as s uming that one wild c ard, a joker, has been added to the dec k,
Table 4 - 1 8 s hows the new probabilities , c ompared to the
traditional ones .
Why It Works
But as it turns out, if you s pin a c oin, es pec ially a brand- new
c oin, it might land tails up more often than heads up.
You know the look and feel of a brand- new, mint- c ondition
penny? I t's s o bright that it looks fake. I t's s o detailed and
s harp around the edges that you have to be c areful not to c ut
yours elf.
Well, get yours elf one of them bright, s harp little fellas and s pin it
1 0 0 times or s o. C ollec t data on the heads - or- tails res ults , and
prepare to be amazed bec aus e tails is likely to c ome up more
than 5 0 times . I f our unders tanding of the fairnes s of c oins is
c orrec t, a c oin s hould c ome up tails more than half the time les s
than half the time. (Say that las t s entenc e out loud and it makes
more s ens e.) N ot with the s pin of a new penny, though.
Binomial Expectations
T he pos s ible exis tenc e of a bottle-cap effect pres ents a tes table
hypothes is :
N otic e that the power of this analys is really inc reas es as the
s ample s ize gets big [H ac k #8 ]. You need only a s light
fluc tuation from the expec ted to s upport your hypothes is if you
s pin that c oin 5 0 0 or 1 ,0 0 0 times . With 1 0 0 s pins , you need to
s ee a proportion of tails at or above .5 8 to believe that there
really is an advantage for tails with a newly minted penny.
T he dis tanc e of the obs erved proportion from the expec ted
proportion is expres s ed as a z s c ore [H ac k #2 6 ]. H ere's the
equation that produc es z s c ores and generated the data in Table
4 -1 9 :
O nc e you prove to yours elf that this tail advantage is real, heed
this reminder before you go running off to win all s orts of c razy
wagers . You mus t s pin the c oin! D on't flip it. Say it with me:
s pin, don't flip.
See Also
T he s tandard dec is ion- making proc es s for s tatis tic ally s avvy
gamblers involves figuring the average payoff for a hypothetic al
wager and the c os t to play, and then determining whether they
are likely to break even or, better yet, make a boatload of money.
T hough one c ould produc e dozens of s tatis tic al analys es of
gambling all about when a pers on s hould and s houldn't play, the
ps yc hology of the human mind s ometimes takes over, and people
will refus e to take a wager bec aus e it jus t does n't feel right.
2. Flip a c oin. I f it c omes up heads , you win and I 'll pay you
$2.
4. Suppos ing heads s till has n't c ome up, we flip again.
H eads on this third flip, and I pay you 2 3 ($ 8 ).
So far, it s ounds pretty good and more than fair for you. But it
gets better. We keep flipping until heads c omes up. When it
eventually arrives , I pay you $ 2 n, where n is the number of flips
it took to get heads .
G reat game, at leas t from your pers pec tive. But here's the killer
ques tion: how muc h would you pay to play?
You als o probably would gladly pay a full $ 2 . You will win the $ 2
bac k half the time, and the other half of the time you will get
muc h more than that! T his is a game you are guaranteed to win
eventually, s o it's not a ques tion of winning. When you don't get
heads the firs t time, you have guaranteed yours elf at leas t $ 4
bac k, and pos s ibly morepos s ibly muc h more.
Statistical Analysis
Some s oc ial s c ienc e res earc hers s ugges t that mos t people
would play this game for s omething around four buc ks , maybe a
little more. Few would pay muc h more. What about s tatis tic ally,
though? What is the mos t you s hould pay?
Proportion
Flips Likelihood Winning
of games
1 1 out of 2 .50 $2
2 1 out of 4 .25 $4
3 1 out of 8 .125 $8
4 1 out of 16 .0625 $16
5 1 out of 32 .03125 $32
6 1 out of 64 .015625 $64
N otic e a few things about this s eries of flips and how the
c hanc es drop at the s ame rate as the winnings go up:
T he dec is ion rule among us Stats Fan C lub members for whether
to play a gambling game is whether the expected value of the
game is more than the c os t of playing. E xpec ted value is
c alc ulated by adding up the expec ted payoff for all pos s ible
outc omes .
You'll rec all that the expec ted payoff for eac h pos s ible trial is
$ 1 . T here are an infinite number of pos s ible outc omes , bec aus e
that c oin c ould jus t keep flipping forever. To get the expec ted
value, we s um this infinite s eries of $ 1 and get a huge total. T he
expected value for this game is infinite dollars . Sinc e you s hould
play any game where the c os t of playing is les s than the
expec ted value, you s hould play this game for any amount of
money les s than infinity.
O f c ours e, in real life, people won't pay muc h more than $ 2 for
s uc h a game, even if they knew all the s tatis tic s . N o one really
knows for s ure why s mart people turn their nos es up at paying
very muc h money for s uc h a pros pec t, but here are s ome
theories .
Infinite is a lot
E ven if you ac c ept in s pirit that the game is fair over the long run
and would oc c as ionally pay off really big if you played it many,
many times , that "long run" is infinitely long, whic h is an awfully
long time. Few people have the patienc e or deep enough poc kets
to play a game that relies on s o muc h patienc e and demands
s uc h a large fee.
Some philos ophers would argue that people do not ac c ept the
c onc ept of infinity as a c onc rete reality. A ny s ales pitc h to
enc ourage people to play this game by promoting the infinity
as pec ts would be les s than c ompelling.
T his might be why I don't buy lottery tic kets . I don't play the
lottery bec aus e my odds of winning are inc reas ed only s lightly
by ac tually playing. I n my mind, the odds of me winning are
infinitely s mall, or c los e enough to it that I don't treat the
pos s ibility of winning as real.
See Also
Sinc e his tory is always our bes t guide to the future, your bes t
predic tions will require various ways to trac k, vis ualize [H ac k
#5 7 ], and rank [H ac k #5 9 ] the performanc e of teams and
players .
O f c ours e, if you have the heart of a true s tatis tic s hac ker, then
you think that s ome s tatis tic al games s uc h as building a learning
c omputer out of c oc onuts [H ac k #5 2 ], doing c ard tric ks through
the mail [H ac k #5 3 ], keeping your iP od hones t [H ac k #5 4 ], or
es timating the value of pi purely by c hanc e [H ac k #6 0 ]are fun
all by thems elves .
Hack 50. Avoid the Zonk
I magine, if you will, that you are traveling with your U nc le Frank
through an unc harted region of Tonganoxie, Kans as . You c ome to
a fork in the road that branc hes out into three pos s ible paths : A ,
B, and C . You don't know whic h will lead you to your des tination,
the fabled world's larges t ball of twine (in C awker C ity, Kans as ).
A n old pros pec tor is res ting with his burro at the c ros s roads .
"Say, old timer," you s ay, "whic h road leads to the world's larges t
ball of twine? "
"Well," s ays he, "I know, but I won't tell you. What I will do,
though, is tell you that one road is the c orrec t road. Two are
wrong and lead to c ertain dis as ter (or at leas t poorly maintained
res trooms ). G o ahead and take your pic k, c ity s lic ker. A s you
drive off, look bac k at me. I won't s ignal whether you are right or
wrong, but I will point at one of the other two roads . T he one I
point at will be a wrong road. You s till won't know for s ure
whether you gues s ed right or not, of c ours e, but I guarantee that
I 'll point at one of the two roads you are not on and it will be a
wrong road."
You ac c ept the s trange man's offer (what c hoic e do you really
have? ) and you as k U nc le Frank, the experienc ed gambler
among you, to pic k a road. H e does s o randomly and you head off
optimis tic ally down one of the three paths let's s ay A . A s you
look bac k, the kindly pros pec tor points to one of the other
roads let's s ay B. I mmediately, you s lam on the brakes and bac k
the c ar up. O ver the objec tions of U nc le Frank, you head down
the remaining road, C , with the peddle to the metal, fairly
c onfident that you are now on the right path.
C razy, are you? Suffering from white- line fever? N o, you've jus t
applied the s tatis tic al s olution to what is known as the Monty Hall
problem and c hos en the road among the three that has the
greates t c hanc e of being c orrec t. H ard to believe? Read on, my
friend, and prepare to win ric hes beyond your wildes t dreams .
I n our example with the three roads and the pros pec tor, there is ,
in fac t, a two- thirds (about 6 7 perc ent) c hanc e that C is the
c orrec t road. To apply this odd s trategy to a more realis tic
s ituation, think of c ontes tants on game s hows or gamblers in
any game in whic h prizes are hidden in boxes or behind doors .
A s typic ally dis c us s ed among game s how theoris ts and c ranky
s tatis tic ians , the problem is pres ented as a fairly c ommon
ac tual s ituation on the game s how Let's Make a Deal (whic h had
its heyday in the 1 9 6 0 s and 1 9 7 0 s ), but it is a s ituation s till
s een today in T V game s hows . T he hos t of Let's Make a Deal was
M onty H all, s o the problem c arries his name.
A s with the three roads problem, the ans wer is yes , you s hould
s witc h. T he ans wer jus t never s eems right the firs t time one
hears it. But, if you want to inc reas e your odds of winning the c ar,
you s hould now s witc h.
T hink of the probability of you gues s ing the c orrec t c urtain. L et's
as s ume that it is a random gues s none of this "I notic e that one
c urtain moved, s o I figured there was a donkey behind it" s tuff.
T hree c urtains , with only one c urtain being a winner, means there
is a 1 out of 3 c hanc e that you will gues s right and win the c ar.
T hat's about 3 3 perc ent. O n that firs t gues s , with no additional
information, you are likely to be wrong; in fac t, you have a 2 out
of 3 c hanc e of being wrong. I n other words , there is about a 6 7
perc ent c hanc e that the c ar is s omewhere behind the two
c urtains you did not pic k.
O nc e you know that one of thos e other two c urtains does not
have the c ar, that does n't c hange the original probability that the
c ar is 6 7 perc ent likely to be s omewhere behind thos e two
uns elec ted c urtains . Remember, M onty will always have a wrong
c urtain he c an open, no matter whic h one you c hoos e. T he 6 7
perc ent c hanc e that the c ar is behind B or C remains true, even
after B is revealed to not be hiding the c ar. T he 6 7 perc ent
likelihood now trans fers to c urtain C . T hat's why you s hould
always s witc h to the other c urtain.
I n any s ituation like this , you s hould s witc h. You might be wrong,
of c ours e, but you have a better s hot of winning that c ar or
whatever other prize you are playing for if you ac c ept any offers
to s witc h. T his is always the bes t s trategy, if a few c riteria are
met:
D on't be too c onc erned if the c orrec tnes s of this s olution is n't
immediately apparent. Really s mart people often firs t view the
new odds as being 5 0 /5 0 between the two unopened c urtains
and, therefore, it does n't matter if you s witc h. T he key to
remember, though, is that your original c hanc e of pic king the
c orrec t door, 3 3 .3 perc ent, c annot c hange no matter what
happens after you make your c hoic e. E ven experts s ometimes
dis agree about the bes t way to view this ques tion. E ven people
as wis e as the old pros pec tor you met out in Tonganoxie that
s tarted our dis c us s ion don't always know the right ans wer to the
M onty H all problem. H ow do you think he won that burro?
Hack 51. Pass Go, Collect $200, Win the
Game
A s two s ix- s ided dic e (and a randomly s huffled pile of c ards ) are
the primary determinants for dec iding what s quare you land on,
luc k pays more than jus t a s mall role in the outc ome.
C ompetitive s tatis tic ians s uc h as you and me (or, at leas t, me)
are drawn to any game in whic h probability plays a key part
bec aus e, by applying a few probability bas ic s , we s hould win
more often than your average, run- of- the- mill railroad baron.
Key Properties
Electric
13 2.61 percent
Company
Indiana
14 2.56 percent
Avenue
St. Charles
15 2.56 percent
Place
Atlantic
16 2.54 percent
Avenue
Pacific Avenue 17 2.52 percent
Ventnor
18 2.52 percent
Avenue
Boardwalk 19 2.48 percent
North
Carolina 20 2.47 percent
Avenue
You c an draw s ome important tac tic al c onc lus ions from this
data:
Without a s tatis tic al analys is , it might not be s o c lear the c ruc ial
role that the J ail and "G o to J ail" s quares play in the overall true
value of real es tate. O ne wis hes it was for s ale. P layers will s tart
or end their turn on the J ail s quare more often than they will land
on any monopoly on the board. A c ons tant s tream of releas ed
pris oners flood ac ros s one s ide of the board, inc reas ing the
opportunity to c ollec t rents on properties all the way up to
I llinois .
J ail c an als o provide a welc ome res pite from having to travel the
s treets paying rent to other players , though early in the game,
J ail c an prevent you from buying up your dream properties . A
final obs ervation on the importanc e of J ail: there is only one
s quare that you c an never end your turn on. C an you name it? Go
to Jail.
See Also
Bill Butler runs another web s ite that pres ents the
probabilities as s oc iated with M onopoly at
http://www.durangobill.c om/M onopoly.html. A mong other
things , the s ite hos ts a dis c us s ion of the inc redible
c alc ulation diffic ulties involved when one wis hes to
inc lude every real- life detail of M onopoly play, s uc h as
keeping trac k of whether a partic ular C hanc e or
C ommunity C hes t c ard has been drawn already.
You won't need a volleyball like Tom H anks 's buddy in Cas taway,
and it won't have muc h pers onality, but your c omputer will be
able to play games with you, and it will even learn and get
s marter. T he driving forc es behind the learning algorithm are
c hanc e and the power of random s elec tion.
Trial-and-Error Learning
T his mac hine learns : the more times you play agains t it, the
better it will be. T he game this mac hine plays is T ic -Tac -Toe,
but theoretic ally, you c ould build a devic e for any two- pers on
s trategy game us ing the s ame princ iples . T ic -Tac -Toe is s imple
enough that it demons trates well the methods of des ign,
c ons truc tion, and operation.
287 matchboxes
I f the c omputer los es , "punis h" it by taking the pebbles that you
drew randomly from the c oc onuts and throwing them into the
oc ean.
I f the mac hine wins or draws the game, return the pebbles to the
c oc onuts from whic h they c ame and "reward" it by adding an
additional pebble of the s ame c olor.
Why It Works
X O X
O
You probably rec ognize that the bes t movereally, the only move
to c ons ideris for the c omputer to bloc k your impending win by
putting its X in the bottom c enter s pac e. T he c omputer, though,
rec ognizes s everal pos s ibilities . I t c ons iders any legal move.
Two moves that it would c ons ider (whic h means , literally, that it
would allow to be drawn randomly out of the c oc onut s hell) are
the bes t move and a bad move:
X O X X O X
O O
X X
When the c omputer firs t s tarts playing the game, both thes e
moves (or behaviors ) are equally likely. O ther moves are als o
pos s ible in this s ituation, and they are als o equally likely. T he
move on the left probably won't res ult in a los s , at leas t not
immediately, s o as pebbles repres enting that move are added to
the c oc onut, the relative probability of that move inc reas es
c ompared to other moves . T he move on the right probably ends
in a los s (exc ept agains t G illigan, maybe), s o the c hanc e of that
move being s elec ted next time mathematic ally dec reas es , as
there are fewer pebbles of that c olor to be randomly s elec ted.
I magine you rec eive a thic k, mys terious envelope in the mail.
Rather than having it dis pos ed of by the neares t domes tic
s ec urity offic ers , you open it and find an ordinary dec k of c ards
and the following s et of ins truc tions :
1. C ut the dec k.
6. Remove the top c ard of the dec k, write it down, and plac e
it anywhere in the dec k.
10. M ail this dec k bac k to the enc los ed addres s (a pos t
offic e box in Tonganoxie, Kans as , or s ome other plac e
with a name that c onjures up wonder and whims y).
You follow all thes e ins truc tions (while wearing protec tive rubber
gloves ) and return the dec k. A bout a week later, a s maller
envelope arrives . I n it is your c hos en c ard! (T here als o might be
a reques t for $ 3 0 0 and an offer to predic t your future, but you
jus t throw the offer away.)
A mazing, yes ? I mpos s ible, you s ay? T hanks to the known likely
dis tribution of s huffled c ards , it is more than pos s ible, and even
a budding s tatis tic ian like you c an do it. N o enrollment in
H ogwarts nec es s ary.
How It Works
Statis tic ians have analyzed thes e patterns and publis hed them
in s c ientific journals . T he work is s imilar to that whic h res ulted
in the groundbreaking s ugges tion that one s hould s huffle a dec k
of c ards exac tly s even times to attain the bes t mix before
dealing the next round of hands for poker, s pades , or bridge.
P ic ture a dec k of c ards in s ome order. A fter one s huffle, if the
s huffle is perfec t, the original order would s till be vis ible within
the now s uppos edly mixed dis tribution of c ards . I n fac t, there
would be two original s equenc es now overlapping eac h other, and
by taking the alternate c ards , you c ould rec ons truc t the original
overall order.
I f you knew the s tarting order of thes e 1 2 c ards , you c ould pic k
it out fairly eas ily by jus t looking at every other c ard in the new
grouping. T hes e s ubpatterns are c harac terized as ris ing
s equences : the c ards ris e in value as you move along the
s equenc e. I f c ards begin in one long ris ing s equenc e (or a group
of four, bec aus e there are four s uits ), riffle s huffles will maintain
thes e ris ing s equenc es ; they will jus t be interwoven together.
T hes e groupings of ris ing s equenc es will remain, even after
many s huffles .
I f at any time during the s huffling and c utting of the dec k, a c ard
is taken from the dec k and purpos efully plac ed anywhere els e in
the dec k, it will appear "out of plac e" c ompared to the overall
pattern of ris ing s equenc es . T his , of c ours e, is exac tly what the
c ard tric k's ins truc tions demand, and it explains how your
mys terious magic ian (or you when you as s ume that role) c ould
s pot what c ard has been moved.
3. Three of
8. Eight of Clubs
Clubs
4. Four of
2. Two of Clubs
Clubs
5. Five of
3. Three of Clubs
Clubs
6. Six of
9. Nine of Clubs
Clubs
7. Seven of
10. Ten of Clubs
Clubs
8. Eight of
5. Five of Clubs
Clubs
9. Nine of
4. Four of Clubs
Clubs
10. Ten of
11. Jack of Clubs
Clubs
11. Jack of
6. Six of Clubs
Clubs
12. Queen of
12. Queen of Clubs
Clubs
Probability of Success
Bec aus e the exac t nature of the s c rambling of the dec k c annot
be known, the magic ian c an identify a c ard as out of s equenc e
only bec aus e the s huffles were les s than perfec t. A ls o, the tric k
is muc h more likely to be s uc c es s ful (only one c ard is out of
s equenc e) if the ins truc tions do not allow anymore c utting or
s huffling after the c ard is taken from the top of the dec k and
plac ed in the middle.
Statis tic ians from C olumbia and H arvard U nivers ity, D ave Bayer
and P ers i D iac onis , have c onduc ted a mathematic al exploration
of the pos s ible outc omes of a dec k of c ards s huffled and mixed
in the ways des c ribed for this magic tric k. (P res umably, the
fac ulty at thes e ins titutions has a lot of free time on its hands ? )
T hey developed a mathematic al formula for identifying the one
c ard out of plac e and ran a million c omputer s imulations to tes t
the ac c urac y of gues s es by their c yber- s orc erer as to the
c hos en c ard. T heir analys is as s umed perfec t dovetail s huffles .
T hey found that with only a c ouple of s huffles , the tric k works
pretty well, but the odds of s uc c es s dec reas e quic kly as more
s huffles are allowed.
To play it s afe, you might do the tric k with at leas t three people.
T hen, as s uming 8 0 perc ent likelihood for eac h pers on, the
c hanc es that you will amaze at leas t one of thos e three people
inc reas es to 9 8 .4 perc ent, whic h is almos t a c ertainty. I f you are
wrong on all three, jus t never s peak or write to thos e people
again, c los e your pos t offic e box, and c onc entrate on more
important things in life. A fter all, with hard work, you might get
into C olumbia or H arvard s omeday and do really important
things .
See Also
Find out how random your iPod's "random" shuf f le really is.
A fter hearing one artis t played over and over during a s huffled
play of your entire mus ic library in iTunes , you might think your
player has a preferenc e of its own. A pple, though, c laims the
iTunes 's s huffle algorithm is c ompletely random. T he s huffle
algorithm c hoos es s ongs without replacement. I n other words ,
muc h like going through a s huffled dec k of c ards , you will hear
eac h s ong only onc e until you have heard them all (or until you
have s topped the player or s elec ted a different playlis t).
A fter res etting the play c ount to zero, I hit P lay and left my des k
for the weekend. I ran the s ame s ongs twic e: onc e s elec ting
random (P arty Shuffle) and onc e s elec ting both random and the
"P lay higher rated s ongs more often" option. Table 5 - 8 s hows
the play c ounts , as of M onday morning.
Subs c ripts in this expres s ion indic ate the s ong rating. T he
c hanc e of a s ong being c hos en is bas ed on x (number of s ongs
with eac h rating) and P (the proportional weight as s igned by the
iTunes algorithm for eac h rating).
A lthough the higher- rated s ongs are given preferenc e, you will
not definitively hear more five- s tar rated s ongs than all other
s ongs . L et's as s ume mos t people follow a normal dis tribution for
their ratings [H ac k #2 3 ], with the three- s tar rating being the
mos t c ommon. Table 5 - 9 dis plays a hypothetic al iTunes library
with this bell- s haped c urve for the rating s ong c ount.
I us e the "P lay higher rated s ongs more often" option, but thes e
were eac h middle- of- the- road 3 - s tar s ongs , and my s ong library
has nearly 4 ,0 0 0 s ongs . T he odds might s eem outrageous at
firs t, but you have to realize jus t how many s ongs you hear
throughout a workday. I f I average 1 0 hours at work eac h day
and average a 3 1/2- minute s ong duration, odds s ay I s hould
hear a c ons ec utive repeat in les s than a month.
I t's s imply the mind's tendenc y to find a pattern that makes you
think iTunes has a preferenc e.
See Also
Brian H ans en
Hack 55. Predict the Game Winners
But our everyday world c ons is ts of way more than only one
variable predic ting another. I n fac t, in mos t c as es there are
s everal or multiple variables that predic t a partic ular outc ome.
H ere we are not dealing with the predic tion of jus t one variable
from another, but the predic tion of one variable from s everal.
T his tool is c alled multiple regres s ion (bec aus e there is more
than one predic tor variable).
T he firs t s tep is to build your model (the predic tors and their
weights that you will us e to make your predic tion). For football,
there are dozens of s tatis tic s kept and available about teams '
pas t performanc es and player c harac teris tic s . Some make
s ens e as predic tors of future performanc e (e.g., pas t
performanc e), while others do not (e.g., c utenes s of the mas c ot).
T he c hanc e to win money, though, is a powerful motivator, s o I
would take the time and effort to c ollec t jus t about every
s tatis tic I c ould find about every team and every game. T he key
is to find variables that on their own c orrelate pretty well with
winning the Super Bowl.
L et's pretend that you have done your res earc h and found s ix
variables that c orrelate with whether a team wins or los es . Some
make s ens e; s ome do not. You are interes ted in getting the mos t
ac c urate real- life predic tion you c an get, s o you are willing to
inc lude the kitc hen s ink if it will make a differenc e. To be c lear,
you took eac h year that a team was in a Super Bowl and then
gathered data for that team from that year.
When you do this analys is with real data, you'll likely find a
different mix of potential predic tors .
Entering the Data into a Spreadsheet
You might remember from your high s c hool days that the formula
for a s imple s traight line looks s omething like this :
Y'
But when we have more than one predic tor variable, things get
more interes ting and more fun. T here is a longer s eries of
predic tors (many Xs ) and weights (many bs ).
T here are two ways to get s tatis tic al regres s ion info
us ing E xc el. Firs t, you c an us e the SLOPE and INTERCEPT
func tions , whic h you c an find on the I ns ert Func tion
menu. Selec t the func tion and enter the argument (the
c ells where the data is loc ated), and E xc el returns
thes e values , allowing you to plug in known values and
predic t others . T his method works bes t when there is
jus t one predic tor.
I f we c ombine all of thes e into one big equation for predic ting
Super Bowl outc omes , here's the model we get:
So, for eac h of the predic tors (variables X 1 through X 5), there is
s pec ific weight (the bs in the formula or the c oeffic ients in the
res ults ).
N ow, the s ame formula in E nglis h:
I magine us ing this equation with all the rows of data you entered
into your s preads heet. T here would be a pretty high c orrelation
between the ac tual Super Bowl outc omes and the predic ted
outc ome. I know this bec aus e of the "M ultiple R" part of the
output s hown in Table 5 - 1 1 , whic h s hows a pretty high
c orrelation. 0 .8 4 is c los e to 1 , whic h is the highes t c orrelation
you c ould get.
So, let's s ay that this year's D enver C annonballs has the data
points s hown in Table 5 - 1 3 .
P lugging this data into the equation s hown earlier, here's what we
get for a predic tor of Y:
N eil Salkind
Hack 56. Predict the Outcome of a
Baseball Game
L ook, I 'm a bus y guy. I 'm always looking for a way to s ave time
on the les s important things in life, s uc h as following my loc al
bas eball team, s o I 'll have more time to s pend on the important
things in lifefriends , family, debating the logic of the H olms '
s equential Bonferroni proc edure as the appropriate follow- up
method to analys is of varianc e, and s o on. A c as e in point
happened jus t the other day. Wanting to know whether the
Kans as C ity Royals would win a bas eball game that was in
progres s , I hardly had time to wait until the game was over. I
wanted to know right now!
How It Works
D uring the firs t c ouple hours of a bas eball game, turn on the
radio broadc as t of that game. L is ten jus t long enough to identify
the team that is at bat. T hat team has a greater than 5 0 perc ent
c hanc e of winning that game.
Why It Works
Bas eball is a game where the longer you are on offens e, the more
points you c an s c ore. A s more batters c ome to bat in a s ingle
inning, the c hanc es of moving runners along the bas e paths and
ac ros s home plate inc reas es . A nother way to look at it is to
imagine the end of an inning that was huge for one team. I f a
team s c ored a lot of runs , they had to have us ed c ons iderably
more than the minimum of three batters in that inning and,
c ons equently, been at bat a proportionately longer length of time
than the other team. O ver the c ours e of a game, the team that is
at bat longes t is more likely to s c ore more (or have more
produc tive innings ).
Figure 5 - 4 s ugges ts a pos s ible dis tribution of at- bat time for a
regulation nine- inning game. I n this example, the winning team
was on offens e for 5 8 perc ent of the time. I n retros pec t, a
random tuning in to the broadc as t had a 5 8 perc ent c hanc e of
finding the winning team at bat.
Proving It Works
While the team with the fewes t at- bats s ometimes did win the
C hic ago C ubs games , the larger the dis c repanc y between at-
bats , the more likely the team with the mos t at- bats was to win
the game. When the mos t- at- bats teams won, they averaged
4 .1 4 more at- bats than the los er. When the leas t- at- bats teams
won, they averaged only 2 .8 8 at- bats les s than the los er.
T his pres ents an interes ting variation of this predic tion method
that applies only to the ninth inning. Turn on the game in the
ninth inning; if your team is batting, things don't look s o good.
T he data pres ented for the C hic ago C ubs that found the winning
team oc c as ionally having fewer at- bats than their opponent c an
be partly explained by the fac t that the winning team s ometimes
bats in only eight innings .
T his method does n't work for all s ports . I n bas ketball, for
example, time of pos s es s ion wouldn't be expec ted to pos itively
c orrelate with points s c ored and, in the c as e of high- energy,
fas t- s c oring teams , might even negatively c orrelate. I n football,
on the other hand, time of pos ition is c ons idered a key indic ator
of quality performanc e and us ually c orrelates with a win.
Hack 57. Plot Histograms in Excel
T here is s ome truth to the c lic h\x8 e "a pic ture is worth a
thous and words ." A pic ture is often the bes t way to unders tand
1 ,0 0 0 numbers . P eople are vis ually oriented. We're good at
looking at a pic ture and obs erving different c harac teris tic s ;
we're bad at looking at a lis t of 1 ,0 0 0 numbers .
L et's think of eac h range as a buc ket. E very player- s eas on goes
into a buc ket. For example, in 1 9 5 9 , H ank A aron had a .3 5 4
average, s o we'll put that s eas on in the .3 5 0 - .3 5 5 buc ket. So,
here's our plan: we'll put eac h player- s eas on into a buc ket,
c ount the number of player- s eas ons in eac h buc ket, and draw a
graph s howing (in as c ending order) the number of players in
eac h buc ket. T his s ingle diagram is a his togram.
The Code
CONCATENATE(...)
I f you want to take this to the next level, you c an replac e the bin
s ize with a named value. (For example, name c ell A 1 bin_size.)
T his makes it eas y to c hange the bin s ize dynamic ally and
experiment with different numbers of bins .
J os eph A dler
Hack 58. Go for Two
A few years bac k, I was enjoying watc hing my loc al profes s ional
football team as they were los ing a c los e game. I was n't
entertained by my team's dis mal performanc e as muc h as I was
delighted by my team's befuddled c oac h as he attempted to read
and unders tand a two-point convers ion chart.
T he likelihood that their field goal kic ker will make the
field goal
A s for pos s es s ions left, however, this is exac tly the type of
information that dec is ion s ys tems bas ed on probability need to
take into ac c ount. Bas ed on a proc es s of working bac kward from
the ending of a hypothetic al football game that takes the
probability of s uc c es s on either option (9 8 perc ent for one- point
plays and 4 0 perc ent for two- point plays ) into ac c ount,
s tatis tic ians have produc ed a c hart bas ed on not only on the
c urrent s c ore, but als o on the total number of pos s es s ions
remaining for both teams .
T his two- point c onvers ion c hart is bas ed on the branc hing
pos s ibilities s tarting at different points in the game and
as s uming bas ic probabilities of s uc c es s for either an extra point
or a two- point c onvers ion. A n average N FL quarter s ees s ix
pos s es s ions in total, s o think of this c hart as being mos t us eful
in the fourth quarter. Sac krowitz als o as s umes a 5 0 perc ent
c hanc e for overtime vic tories .
How It Works
Whic h c hart s hould you us e the next time you find yours elf
c oac hing in a c ruc ial football game with a key dec is ion to make?
T hat's up to you, but jus t remember that befuddled football
c oac h I watc hed on T V a few years ago. N ot only was he
replac ed the next year by D ic k Vermeil, c ons idered one of the
brighter football c oac hes around, but it was Vermeil who helped
develop the U C L A two- point c onvers ion c hart s hown in Table 5 -
1 4 . N ow you know the res t of the s tory!
Hack 59. Rank with the Best of Them
There are many ways to use data to make judgments about who
is best in any sport. A ll the intuitive ways to compare
perf ormance in individual sports have validity concerns,
however.
Firs t, let's take a look at the nature of the data I had to analyze.
Your data will likely be s imilar, whether you run your weekly home
M onopoly game or you run the P rofes s ional G olf A s s oc iation.
T hough poker is not a s port, any organized c ompetitive endeavor
provides data for rankings . Table 5 - 1 6 s hows the res ults from
eight tournaments in my own s ummer poker league.
T he points under eac h player's name indic ate the order in whic h
they went out. I f there are s ix players and you go out firs t, you
get one point for taking las t plac e. I f you are the winner among
s ix players , you get s ix points for taking firs t.
H ow, then, to rank players in the poker league? H ere are three
c ommon s olutions , all of whic h work to s ome extent.
Total points
Mean performance
Total wins
A ll three s c oring s ys tems make s ens e. But the ques tion about
who is the bes t has a different ans wer under eac h of the three
s ys tems ! T his is c ertainly a frus trating finding for a poker
s c ientis t like me. Bec aus e one c ould defend any of the three
methods as the "bes t" way to rank, it is a bit of a paradox that
eac h method produc es a different "bes t" poker player. Table 5 -
1 8 s hows how the rankings differ under eac h s c oring method.
Real- life profes s ional s ports organizations have dealt with the
advantages and dis advantages of eac h s ys tem by c reating
c ompos ite point s ys tems . Some of the tinkering to improve
ranking s ys tems in tennis and golf (and tournament poker, too)
inc ludes :
Before s howing how to es timate the value of pi, I 'll begin our
dis c us s ion by pres enting a c ouple of bas ic fac ts from geometry.
D on't panic ; I don't know muc h about geometry, s o we won't
s pend a lot of time on this . I 'll jus t c over the bas ic s we need to
apprec iate the magic of this hac k.
Pi
T his is one of thos e problems that s eem impos s ible to s olve the
firs t time you hear it, but it is s olvable. T here's no need to s pend
any time c alc ulating the s olution here, though I c ertainly could
do it, I as s ure you. Really, I c ould. Really. T he s olution has to do
with geometry, and it takes into ac c ount two key c omponents of
information. T he keys to any given random landing pos ition are:
D efining the random pos ition of the needle with thes e two bits of
information allows for s ome general obs ervations that help to
s implify the problem:
T his is the ans wer to the problem. L et's try it quic kly with s ome
real numbers , jus t to c hec k L ec lerc 's work. I magine a needle
three inc hes long falling randomly on a s ewing table with a
pattern on the grain s uc h that there are two parallel lines four
inc hes apart. What proportion of the time will the needle touc h
one of the two lines ? H ere are the nec es s ary c omputations :
Probability and Pi
Set up an area with two horizontal lines , drop s ome needles , and
keep trac k. M eas ure the dis tanc e between your lines and the
length of your needle, and let the random whims of c hanc e do all
the c ognitive heavy lifting. C ollec t a large s ample of data from
many needle drops to get a probability that is prec is e to s everal
plac es pas t the dec imal, perhaps a thous and drops or s o. G ood
luc k and keep c areful rec ords .
L et's s ay that you drew two lines that were 8 inc hes apart and
us ed a knitting needle about 7 inc hes long. I f you us ed this
equipment for a large number of drops , you would likely find that
the needle touc hed a line s omewhere between 5 0 and 6 0
perc ent of the time. L et's s ay it was 5 5 perc ent. To us e this data
to c alc ulate pi, you would apply the math like this :
T his c hapter c onc entrates on hac ks that help you to think more
c learly, c leverly, or c reatively. Start out by us ing the rules of
probability and proving yours elf s marter than a s uperhero [H ac k
#6 1 ]. Keep feeling s mart by mas tering s tatis tic al s hortc uts
[H ac k #6 6 ] and the ability to detec t fraud [H ac k #6 4 ].
C ontinue impres s ing yours elf and others by tapping into your
s keptic al s ide: demys tify amazing c oinc idenc es [H ac k #6 2 ] and
hac k your way to the truth about weird phenomena [H ac k #6 3 ].
A fter dis proving (or perhaps proving) the exis tenc e of E SP
[H ac k #6 8 ], your friends will be amazed when you read their
minds [H ac k #6 7 ].
Superman pres ents what he s ays are the odds for the fantas tic
feats that L ois performs , but the author of the s tory (anonymous )
does not provide the c alc ulations . L et's review the random
gues s es that L ois makes , do our own c alc ulations , and c hec k
the M an of Steel's math. For determining the probability of this
s eries of independent events , we will apply the multiplic ative
rule [H ac k #2 5 ].
The Guesses
She finally fails , after Superman has res c ued her, to gues s the
number of jellybeans in a jar. A s Superman explains to M s . L ane
that s he is not ps yc hic , he s ugges ts that the odds of her making
thes e four c orrec t gues s es by c hanc e are 3 2 6 ,4 5 4 ,8 3 9 ,0 4 7 to
1 , or 1 out of 3 2 6 ,4 5 4 ,8 3 9 ,0 4 8 .
"I s ee, Superman! " s he s ays . "I was luc ky enough to hit that
'one c hanc e'." "Yes ," s ays Superman, "after all, s omeone always
wins big lotteries , too" (or s ome s uc h nons ens e to that effec t).
T hat number c alc ulated by Superman or his Superc omputer
c ertainly is big, whic h s eems right, but I don't think it is c los e to
being c orrec t. M y gues s is that this outc ome is even more
mirac ulous .
The Calculations
L et's work through our own c alc ulations . For gues s es 1 and 4 ,
we c an figure pretty c los e to the odds of gues s ing the ans wer to
that problem independently. For gues s es 2 and 3 , we'll have to
make s ome as s umptions .
H ere again are the gues s es L ois made and real c alc ulations of
the odds for eac h one, taken by thems elves .
Guess 1
Guess 2
So far, s o good. I t gets very tric ky here, though, bec aus e of the
c ombination L ois ac tually s ugges ts . She predic ts "1 1 right...1 3
left...5 left...bac k to 8 ...forward to 1 5 ." T his is a very odd
c ombination. Firs t, a c ombination is us ually read in a different
order: left 13, ins tead of 13 left. Sec ond, what c an it pos s ibly
mean to go left twice in a row! Surely you have to c hange
direc tion of the dial to loc k in eac h number in the s equenc e.
A fter all, the dial pas s es over many numbers on its way left
every time. H ow does it know whether to c ount eac h number it
pas s es as a part of the c ombination s equenc e? I 'm going to jus t
pretend that the s equenc e is mis reported s lightly by the
anonymous author; otherwis e, I 'd have to paus e here in an
endles s loop of c onfus ion, with my fingers over the keyboard,
never able to c ontinue.
Finally, why does L ois s tart s aying "bac k" and "forward" ins tead
of left and right? T his jus t makes her direc tions unc lear (perhaps
to c over hers elf in c as e of failure? ). A gain, I 'm going to as s ume
s he us es the terms to mean a c hange in direc tion, even though
back probably means left and forward probably means right,
whic h would jus t c omplic ate things more. A c ons ervative s et of
probabilities for this gues s , then, is
1 /2 x1 /5 x1 /1 0 0 x1 /1 0 0 x1 /1 0 0 x1 /1 0 0 x1 /1 0 0 . T hat's 1 out of
1 0 0 ,0 0 0 ,0 0 0 ,0 0 0 .
Guess 3
L ois als o gues s es the unlis ted phone number to the ric hes t
pers on in town. T here are a c ouple of ways to figure this .
Firs t, if L ois were a bit naïve (and, no offens e to L ois 's fans , but
I 'm gues s ing s he is ), s he might s et only the parameters that the
phone number had to have s even digits and not s tart with 0 .
U nder thes e rules , there are 9 ,0 0 0 ,0 0 0 pos s ible phone
numbers . T his as s umes that we s tart with 1 0 ,0 0 0 ,0 0 0 pos s ible
s even- digit numbers (9 ,9 9 9 ,9 9 9 is the highes t s even- digit
number, plus add one for the number 0 ,0 0 0 ,0 0 0 ).
Guess 4
Final Probability
So, the c hanc es of gues s ing c orrec tly on thes e four problems in
a row, giving L ois all s orts of benefits of the doubt for knowing all
s orts of things about s afes and telephone numbering s ys tems , is
1 /5 x1 /1 0 0 ,0 0 0 ,0 0 0 ,0 0 0 x1 /5 0 0 ,0 0 0 x1 /2 0 ,0 0 0 . T he c hanc es
of this s equenc e of luc ky gues s es oc c urring is , c ons ervatively, 1
out of 5 ,0 0 0 ,0 0 0 ,0 0 0 ,0 0 0 ,0 0 0 ,0 0 0 ,0 0 0 even more remarkable
than the already hard to believe 1 out of 3 2 6 ,4 5 4 ,8 3 9 ,0 4 8 .
"I s ee, Superman! I was luc ky enough to hit that one c hanc e,"
L ois c onc ludes . I ndeed. O f c ours e, the odds were even wors e
that Superman would propos e to L ois s omeday, and that
happened. So, who am I to rain on M r. and M rs . Superman's
parade?
Hack 62. Demystify Amazing
Coincidences
I f you play poker with any regularity (and, if you are a minor
H ollywood c elebrity, you apparently play all the time), you know
that you rarely s ee a royal flus h: a five- c ard hand with the 1 0 ,
J ac k, Q ueen, King, and A c e all of one s uit. I f your opponent were
dealt a royal flus h, would that be remarkable? Would you s us pec t
c heating? I t all depends on how many poker hands you have
s een in your lifetime, I gues s , or perhaps in rec ent memory.
1. Bec aus e there are only four pos s ible royal flus hes , one
for eac h s uit, we divide this number of pos itive outc omes
(4 ) by the total number of pos s ible outc omes
(2 ,5 9 8 ,9 6 0 ), for a probability of .0 0 0 0 0 1 5 3 9 , or 1 out
of 6 4 9 ,7 4 0 .
A s you bec ome more and more aware of the role that c hanc e
plays in the world around you, and begin to habitually s tat- hac k
your way through everyday s ituations , you might bec ome overly
s ens itive to patterns that don't s eem right. D on't abus e your
newfound powers , though, and treat probabilities as c ertainties .
A dditionally, don't make the mis take of expec ting events that
are s uppos ed to be random to look random.
Looking random and being random are not the s ame things . When
events have s everal pos s ible and equally likely outc omes , any
of them c an happen. T he way the human mind works , though,
many people think that the pattern of outc omes of events with
s everal equally likely outc omes ought to look a c ertain way, a
way that s omehow looks random (whatever that means ).
For example, real- world res earc h has found that people tend to
believe that, when flipping c oins , the mos t probable outc omes
are thos e that look the mos t mixed up. To illus trate this idea,
look at Table 6 - 2 . (A void looking at Table 6 - 3 until you have
read a bit more.) Whic h exac t s equenc e of c oin flips do you think
is mos t likely to oc c ur?
M any people give the ans wer "A ." M aybe you did, too. When
as ked to explain why A s eems the mos t likely outc ome, the
ans wers inc lude s tatements like thes e:
"T he others are too ordered."
E ven though you know that c oin flipping is random (as s uming the
c oin is n't weighted), looking random does n't make s omething
more probable. A ll of thes e patterns of c oin flips are ac tually
equally probable, as s hown by the math in Table 6 - 3 .
I f you are as ked a ques tion about whic h outc ome is the mos t
likely, or whether a given outc ome c ould have oc c urred by
c hanc e, firs t determine whether you are being as ked about
c ombinations (the total number of H eads and Tails in any order,
for example, or the number of different ways of drawing five
playing c ards of the s ame s uit) or about the permutations that
are pos s ible. H ere are the important dis tinc tions between the
two:
Combinations
A c ombination is the total number of ways that one c ould
end up with a partic ular number of values when drawing
randomly from s ome population. C oin flips are s amples
drawn from a theoretic ally infinitely large population
made up of 5 0 perc ent H eads and 5 0 perc ent Tails . T he
number of c ombinations varies , depending on the
number of a c ertain value one is interes ted in. I n other
words , with five draws or flips , there are more ways to
draw out three heads than there are ways to draw out
five heads . So, drawing three heads is likelier than five
heads .
Permutations
So, the number of ways to get three H eads out of five c oin flips
is :
H H H H H T H H H H H H H H T THHHT HHTTH T H T T H H H T T T
T H T T T H H H T H THHTH HHHTT T H H T T H H T H H THTHH
HHTHT T H T H T H T H H H TTHHH HTHHT T T H H T H T T T H
T T T T H H T H T T T T H T T HTTHH T T T H H H T T T T T T T T T
HTHTH T T H T H H T T H T T T T H T
When to Be Suspicious
You get to dec ide for yours elf, though, when you want to ac c us e
s omeone of being a c heat. G ood luc k on making that dec is ion! I t
s hould res ult in fis t fights les s than 5 perc ent of the time.
I n the 1 9 th c entury, long before the age of elec tronic c alc ulators ,
s c ientis ts us ed tables publis hed in books to find values of
logarithms . A partic ularly obs ervant 1 9 th- c entury as tronomer
and mathematic ian, Simon N ewc omb, notic ed that the pages of
logarithm tables were more worn in the firs t pages than in the
las t pages . N ewc omb c onc luded that numbers beginning with 1
oc c ur more frequently than numbers beginning with 2 , numbers
beginning with 2 oc c ur more frequently than numbers beginning
with 3 , and s o on.
How It Works
To demons trate Benford's law, I 'll c ons ider two examples that
you c an verify yours elf.
Street addresses
1 0.301 0.301
2 0.167 0.176
3 0.133 0.125
4 0.095 0.097
5 0.082 0.079
6 0.071 0.067
7 0.055 0.058
8 0.045 0.051
9 0.049 0.046
Benford's law does not apply to the firs t nonzero digit only, but
als o inc ludes probabilities of other digits . O nc e again, following
the treatment dis c us s ed earlier, let D 2(x) denote the s ec ond
bas e- 1 0 s ignific ant digit of a number x. For example, D 2(9 1 0 8 )
= 1 , D 2(9 0 1 8 ) = 0 , and D 1(0 .0 2 5 1 0 8 ) = 5 . N otic e that, unlike
the firs t s ignific ant digit, the s ec ond s ignific ant digit c an be zero.
0 0.11968
1 0.11389
2 0.10882
3 0.10433
4 0.10031
5 0.09668
6 0.09337
7 0.09035
8 0.08757
9 0.08500
Scale invariance
Base invariance
Sufficient variability
Random s ampling
Why It Works
A lthough the proof of Benford's law is quite tec hnic al, there are
s ome ins ightful and intuitive explanations for this mathematic al
princ iple. O ne s uc h explanation that I find partic ularly attrac tive
has been provided by M ark N igrini (1 9 9 9 ).
T he real world is a bit more c omplic ated, but this does help to
explain why 1 is a more c ommon firs t digit than larger digits .
A nother intuitive explanation is that there are more s mall towns
than large c ities , and there are more s hort rivers than long
rivers .
Benford's law is les s likely to apply in data s ets with ins uffic ient
variability or data s ets that are nonrandomly s elec ted. For
example, c omputer files s izes approximately follow Benford's
law, but only if no res tric tion is plac ed on the type of files
s elec ted.
To illus trate this , I found the frequenc ies of the firs t digit of the
file s izes on an A pple P owerBook G 4 . T he res ults s hown in
Figure 6 - 3 and Table 6 - 1 1 exhibit the Benford's law pattern.
See Also
Bec ker, T. J . (2 0 0 0 ). "Sorry, wrong number: C entury- old
math rule ferrets out modern- day digital dec eption,"
Georgia Tech Res earch Horizons ,
http://gtres earc hnews .gatec h.edu/res hor/rh-
f0 0 /math.html.
E rnes t E . Rothman
Hack 65. Give Credit Where Credit Is
Due
Fortunately, the good doc tor of philos ophy had many years of
experienc e and was wis er than his adjunc t pos ition at State
C ommunity C ollege and Truc king Sc hool might have s ugges ted.
A mong other obs c ure s tatis tic al hobbies , D r. H owe- M utc h
dabbled in the art of s tylometry, a s tatis tic al method for
c ategorizing the s tyle of written works . T he method c an als o be
us ed to identify anonymous authors . I t works bes t when there
are a c ouple of pos s ibilities or s us pec ts to c hoos e from, and
when the typic al writing s tyles of the s us pec ts are known and
have been quantified. L et's watc h as the broken- hearted
profes s or applies thes e tec hniques to find the true author.
Building a Model
For the firs t analys is , all the words written by the two writers are
kept together. D r. H owe- M utc h c ounts the frequenc y with whic h
eac h word is us ed and identifies the 5 0 to 1 0 0 mos t c ommonly
us ed words in the c ombined databas e. T hes e words bec ome the
items or key variables that s upply the data for a factor analys is .
Fac tor analys is is a s tatis tic al proc es s that looks at the
c orrelations [H ac k #1 1 ] among groups of variables and
identifies c lus ters of variables that c orrelate better among
thems elves than they do with other variables . Whatever thes e
grouped- together variables have in c ommon is as s umed to be a
fac tor, c omponent, or dimens ion that they all s hare.
For the s ake of our s tory, I 'll s how only 1 0 of the words that D r.
H owe- M utc h identified as mos t c ommon ac ros s both writers '
works . Table 6 - 1 4 s hows the words and their frequenc y of us e.
When looking at all the words P aul and L is a wrote, the was us ed
4 .2 perc ent of the time, weas el was us ed 1 perc ent of the time,
and s o on.
T he data for D r. H owe- M utc h's fac tor analys is are s upplied by
eac h s ec tion of 5 0 0 words in the writing s amples . E ac h s ec tion
rec eives a s c ore on eac h of the word variables . T he s c ore will be
the number of times the word is us ed in that paragraph. Table 6 -
1 5 s hows examples of the data M r. H owe- M utc h c ollec ts .
Factor Analysis
When the fac tor s c ores from eac h of P aul and L is a's s ample
papers are c omputed, it bec omes c lear that the two authors
have different s tyles . L is a tends to us e the word weas el more
frequently than P aul; her papers s c ore high on Fac tor 2 . L is a's
papers als o tend toward the high us e of artic les and rec eive
fairly high Fac tor 1 s c ores . P aul's papers , on the other hands ,
tend to avoid the us e of the word weas el and tend toward the
prepos ition end of Fac tor 1 .
T his is diffic ult to gras p us ing words alone, s o an illus tration will
help draw a pic ture to demons trate the plac ement of the s ample
texts . Figure 6 - 5 s hows the two fac tors , the word us age that
defines them, and where the different writing s amples loaded on
the two fac tors . For the s ake of c onvenienc e for this dis c us s ion,
Figure 6 - 5 dis plays only a few of the writing s amples and maps
only the 1 0 words in Table 6 - 1 4 and Table 6 - 1 5 . A ls o inc luded
in the figure is the plac ement of the dis puted paper in this
theoretic al dimens ional s pac e.
See Also
N ow, you need to know the c hanc es that you will be in the front
s eat and who you might be with. T he problem is c alc ulating how
many different c ombinations of relatives c ould be in that front
s eat. For both s imple wagers , s uc h as c oin flips , and life- and-
death s ituations , s uc h as long c ar trips , you c an us e a layout of
numbers c alled P as c al's Triangle to do the math for you.
Presenting Pascal's Triangle
2. 1 2 1
2. 1 6 15 20 15 6 1
1. C ount down s ix rows and ac ros s two entries , and you hit
the number 1 5 . T here are 1 5 pos s ible c ombinations of
two people drawn from s ix people.
Why It Works
For ins tanc e, the total pos s ible number of c oin flip c ombinations
for s ix c oin flips is ans wered on the triangle by totaling the
values in the s ixth row: 6 4 . You would have mathematic ally
derived that value by applying the general formula for number of
pos s ible outc omes for a c oin: 2 number of flips = 2 6 = 6 4 .
A s for the c hanc es that you would both be c hos en as one of two
people out of s ix and that the other pers on would be a s pec ific
one of the other people (our trip to the airport example), the
triangle s aid 1 out of 1 5 . But you als o c ould have figured it this
way:
So, when you have a c omplic ated- looking problem that involves
c ombinations and permutations and s o many pos s ibilities that it
makes your head s pin, let the s oothing mus ic of P as c al's
Triangle bring peac e to your troubled mind.
Hack 67. Control Random Thoughts
H ave you ever been talking to s omeone, and your mind wanders
off for a little while? T hen, you bring up whatever it was that you
were thinking about and, lo and behold, the other pers on was
thinking about the exac t s ame thing!
For example, if you want your friend to s tart thinking about lions
and tigers and bears , you might prime her thought proc es s with
words that are as s oc iated with that themewords s uc h as Wizard
of Oz, Dorothy, Toto, or even s tripes , s inc e s tripes and tigers are
highly as s oc iated with eac h other.
So, if you want to "c ontrol" s omeone's mind, the tric k is s imply
to know whic h things oc c ur mos t frequently together. T he more
frequent a word is , the more likely s omeone is to think it.
L ikewis e, the more frequently two words oc c ur together, the more
likely one is to think of both words when only one is s tated.
Probability and Word Association
I nformation like this is us eful for when you want your s ubjec t to
think of c ertain words or ideas . With s ex, for example, you will
have more luc k c ueing with condom than you will with bumpy.
By talking about words or topic s that are c los ely related to other
words or topic s , you begin a thought proc es s in your friend's
brain in whic h ac tivation of neurons s preads to neurons that
generally fire at the s ame time. Your brain has learned that
c ertain words and topic s almos t always oc c ur together, s o it
knows that when one of the as s oc iated words or topic s is
ac tivated, it s hould als o fire in the regions where thos e
as s oc iated words and topic s are ac tivated. T hat way, your
thought proc es s c an proc eed s moothly.
T his partic ular mind tric k has s ome ris k of failure, es pec ially
when the as s oc iations that you are relying on are low- probability
as s oc iations . H owever, you might jus t enjoy knowing that you
are s ec retly manipulating others and don't have to make a big
s how out of it.
You probably have been primed yours elf many times . When you
are lis tening to your favorite C D and one s ong ends , do you s tart
hearing the next s ong in your head before it even begins ? I f you
know what things s omeone as s oc iates with other things , it
bec omes relatively eas y to predic t s omeone's thoughts after
you've primed them. T his is partially why married people c an
often finis h eac h other's s entenc es .
I f s omeone does n't s hare your language bac kground, bec aus e
they s peak either a different language or a different dialec t, they
might not have the s ame word as s oc iations that you have.
I t als o might not work if a word has s everal equally likely word
as s oc iations . For example, if you prime s omeone with the word
hot, s ome people might s tart thinking about the weather (hot and
cold). Some might think about food (hot dogs ). O thers might s tart
thinking about people they admire (a hot babe).
What do you think of next when you s ee the word hot? I knew you
were going to s ay that!
Today, though, you don't really hear muc h about E SP, and mos t
s c ientis ts have c onc luded that s uc h a thing probably does not
exis t. M ore s pec ific ally, it has n't met the c riteria for s c ientific
ac c eptanc e that any other hypothes ized phenomenon is
expec ted to meet, s uc h as experimental evidenc e, replic ated
s tudies , and s o on. You c an add to the data, though, by
c onduc ting your own s tudies and identifying whether you or your
friend might be ps yc hic .
So, what would be expec ted by c hanc e? I f you are gues s ing for
2 5 c ards and there are five c ards of eac h type, c hanc e alone
would get about 5 c orrec t. I magine, for example, that you
gues s ed s tar every s ingle time ac ros s all 2 5 times . You would be
guaranteed 5 hits and 2 0 mis s es bec aus e you know s tar will
c ome up exac tly five times overall. I f you gues s ed randomly
eac h time among the 5 pos s ibilities , your average s uc c es s
would als o be 5 out of 2 5 , or 2 0 perc ent.
A s ample s ize of 2 5
A n obs erved proportion of .2 4
Without s howing the formula and c alc ulations for this partic ular
analys is , I 'll s how you the res ult. By c hanc e alone, with 2 5
gues s es , a s ubjec t will gues s at leas t 2 4 perc ent of the c ards
c orrec tly 3 1 perc ent of the time. A nother way of s aying that is
that out of 1 0 0 s ubjec ts going through your s tudy, 3 1 of them
will get this res ult or better. So, a hit rate of 2 4 perc ent is better
than average, but not s o unus ual that I would c all The National
Enquirer jus t yet.
What about other hit rates or if you tes t with more than 2 5
trials ? Table 6 - 1 7 s hows the c hanc e of gues s ing given
perc entages of c ards (or higher) c orrec tly. T his table as s umes
an expec ted hit rate of 2 0 perc ent.
O n your way home from the dinner party, your s pous e as ks you
about the man you were c hatting with before dinner. You s hare a
little bit about J ohn, but realize that you never learned what he
does for a living. I n fac t, as you realize, you really don't know
that muc h about him. Your s pous e dec ides to play a little mind
game with you and explains :
8. J ohn is married.
T here are probably more married men in the world than there are
c omputer s c ientis ts , c ar s ales man, former bas eball players ,
Republic ans , preac hers , marathon runners , and c larinet players .
T hus , it is mos t likely that J ohn is married. Where did you rank
that pos s ibility?
Bec aus e we probably don't really know the bas e rates of all the
other pos s ibilities , we c an us e the information we have about
J ohn to predic t whic h of the other pos s ibilities is mos t likely. We
do know that if we c ons ider the group of all former bas eball
players and the group of all c omputer s c ientis ts , there will
probably only be a s mall number of men who belong to both
groups . T hus , the likelihood of being in that group of c omputer
s c ientis ts who us ed to play bas eball mus t be s maller than the
likelihood of being in the group of c omputer s c ientis ts or of being
in the group of former bas eball players .
P erhaps the mos t well- known problem that demons trates the
c onjunc tion fallac y is the now- famous (at leas t in c ognitive
ps yc hology c irc les ) L inda P roblem:
L inda is 3 1 years old, s ingle, outs poken, and very bright. She
majored in P hilos ophy. A s a s tudent, s he was deeply c onc erned
with is s ues of dis c rimination and s oc ial jus tic e, and s he als o
partic ipated in antinuc lear demons trations .
1. C ut it out.
2. Stop.
3. D on't do that.
H it a home run
Strike out
See Also
J ill L ohmeier
Hack 70. Break Codes with Etaoin Shrdlu
Both the wear and tear on your keys and the plac ement of them
in a s tandard typewriter (a.k.a. Q WE RT Y, for the firs t s ix letters
on the top row) pattern are bas ed on their frequenc y of us e in
E nglis h. D ifferent letters in the alphabet are us ed with different
frequenc ies in the s pelling of words in a language. By applying
the known frequenc y of thes e letters , along with other s tatis tic al
tric ks , you c an quic kly dec ode c las s ified doc uments , whether
they are L eonardo da V inc i's diary, a puzzle in the news paper, or
big, bright letters being turned by Vanna White on T V.
T he pas s age looks like nons ens e, but with the key s hown in
Table 6 - 1 8 , anyone c ould eas ily replac e the nons ens e letters
with the original letters , c aus ing the opening s entenc e of the
s ec ond paragraph in C hapter Two of Tom Sawyer to reveal its elf.
Using Probability to Decode Substitution
Ciphers
ETAOIN SHRDLU
H ere's how you might us e thes e letter s tats in real life to dec ode
a s ec ret mes s age or s olve a puzzle. T his method works bes t if
the c oded text is lengthy, but it works s urpris ingly well even for
s horter pas s ages . C alc ulate the dis tribution of the c oded,
s ubs titute letters (the c ipher text), and then c ompare it to the
dis tribution s hown in Table 6 - 1 9 .
Figure 6 - 8 s hows how this proc es s might look graphic ally. O nly
the firs t 1 0 mos t c ommon letters are s hown, but the analys is
would us e all the letters . T his example pretends that there is a
lot of c oded text and that the s ubs titute c ipher s hown in Table 6 -
1 8 is being us ed.
Bec aus e the mos t c ommon s ubs titute letters are P, followed by
J , a good gues s for breaking the c ode would be to s ee whether P
c ould really be E and J c ould really be T. T hes e firs t gues s es c an
be made all the way down the line for eac h letter. By s tarting with
the mos t frequently appearing letters and moving down the lis t, a
c ode breaker c an quic kly s ee whether thes e firs t hypothes es are
right or wrong and c hange gues s es around until E nglis h words
s tart to appear.
Other Common Letter Patterns
Frequently appearing two- letter words inc lude of, to, in,
it, and is .
By far, the mos t c ommon three- letter words are the and
and. O ther c ommon three- letter words inc lude for, are,
and but.
See Also
A few years bac k, a new s pec ies , a type of pos s um, was
identified. T he new s pec ies was named trichos urus cunninghamii.
Trichos urus means , um...pos s um (I gues s ), and the cunninghamii
part refers to its dis c overer, Ros s C unningham, a s tatis tic ian at
A us tralian N ational U nivers ity. I f you'd like to have a s pec ies
named for you, here's how s tatis tic s c an help.
For s ome group of animals to tec hnic ally be a s eparate s pec ies ,
it mus t s hare a unique s et of biologic al c harac teris tic s that
make it dis tinc t from s imilar animals . Sure, animals within the
s ame family all look a little different from eac h other, but then,
people look a lot different from eac h other and we are all one
s pec ies (my U nc le Frank being perhaps the exc eption that
proves the rule).
T his pos s um exis ted in A us tralia near people for more than 2 0 0
years and no one notic ed. To be fair, it looked an awful lot like the
other pos s ums , the mos t c ommon of whic h was the trichos urus
caninus , now c alled the s hort-eared pos s um.
I t was as s umed for s ome time that there was really jus t this one
s pec ies of the little guys . P art of D r. C unningham's job was to
c ollec t and organize des c riptive data for the wildlife around him.
C ons equently, he had a ton of very s pec ific quantitative
des c riptions of various pos s um parts eyes , ears , nos e, and
throatand meas urements of other phys ic al c harac teris tic s .
Statis tic ians tes t hypothes es , s o you s hould begin your analys is
with a gues s about whether there is or is not a dis tinc tion
between the groups of partic ipates who s upplied your data.
H ead length
Skull width
E ye s ize
E ar length
Tail length
C hes t width
Foot length
I n this data, you c ould find two groups of pos s ums that differed
the mos t bas ed on ear length, tail length, and foot length. T he
amount of variability explained was s o large that, s tatis tic ally,
C unningham c onc luded that the mathematic ally identified
groupings were real. T he two groups of pos s ums found in the
data were ac tually two different s pec ies of pos s um, and the
s pec ies c ould be defined by their ear length and a c ouple of
other variables . T he larger the weights in the equation s hown
earlier, the more the two s pec ies differed on thes e body parts .
Table 6 - 2 0 s hows the offic ial des c riptions of the two pos s um
s pec ies firs t identified as s uc h by our s tatis tic ian and his
mathematic s . N otic e the names are even bas ed on the key
predic tors found in the s tatis tic al analys is !
So, s tart c ollec ting your own data on thos e odd, s tinky bugs you
find on your s c reen door and you are well on your way to
greatnes s and immortality. I s there one s pec ies of s tink bug or
two? You tell me.
See Also
I know a guy who knew a guy who us ed to work for the P res ident
of the U nited States . Small world, eh? I 'm not s aying I have
great c onnec tions , but I am jus t two hands hakes away from the
leader of the free world. Before you get too impres s ed, you
s hould know that you probably are jus t a few links away from
almos t anybody in the world.
T he phras e and c onc ept c ome from a s tudy that c ons idered the
s mall-world problem. H ave you ever been at a party or been
c hatting with a s tranger at a c offee s hop and dis c overed that you
both know the s ame pers on? Soc ial ps yc hologis t Stanley
M ilgram was c urious about this phenomenon in the late 1 9 6 0 s
(when there were a lot more c oc ktail parties than there are now).
H ow muc h overlap was there in s oc ial networks ? I f we c ould all
get together and lis t everyone we know, would there always be
s ome c onnec tion? P robably, eventually, as we explored further
and further out of the c enter of our web of ac quaintanc es , we
would find s ome c onnec tion with almos t everyone. But how many
links would it take?
Recruit participates
Train participates
M ilgram s ent a pac ket in the mail to eac h rec ruit. T he pac ket
c ontained ins truc tions des c ribing the s tudy and a letter for the
Bos ton broker. T hey were as ked to deliver the letter to our guy,
but only if they knew him pers onally. I f they did not know him
pers onally, they were as ked to rec ord s ome information, s uc h as
their name, and s end the pac ket on to s omeone who they did
know who they thought might have a better c hanc e of knowing
him. T hos e next people in the c hain rec eived the s ame pac ket
with the ins truc tions and the letter. T hey might have s ent it to
the broker if they knew him, or s ent it on to a third link in the
c hain, and s o on.
I n your own s tudy, make s ure to write the ins truc tions c learly
and s imply, and, thes e days , you might explain that this is
legitimate res earc h, not a c ommerc ial s olic itation and not a
c hain letter (though it literally is , I gues s ), and all the
dis c laimers you think will help. You s hould als o inc lude c ontac t
information for you if anyone has any ques tions about the
legitimac y of the projec t.
A fter a reas onable amount of time, c hec k with your target and
gather all the letters rec eived. O n eac h letter, c ount the number
of names that form the c hain. A verage all the different lengths of
c hains to determine the typic al number of c onnec tions . Find the
s malles t number nec es s ary to inc lude even the longes t c hain,
and you have the maximum dis tanc e.
Two more rec ent s tudies have c onfirmed that the average
number of c onnec tions between people in s oc ial networks is
about s ix or even a little les s .
Doing a Small Study
Throw a party
When hos ting a large party (M ilgram would have loved it if you
us ed a c oc ktail party, the ins piration for his original s tudy), hand
out s upplies to your gues ts . G ive them eac h a large index c ard
and a pen. A t the bottom of eac h c ard, lis t the name of a gues t at
the party. I f gues ts don't know the pers on lis ted below, they
s hould s ign their name at the top of the c ard and hand it to
s omeone els e who they think might know the pers on.
See Also
D emoc rac y works like that as well. When it is time to vote for
P res ident, or M ayor, or D ogc atc her, we us ually mus t c hoos e
between s everal alternatives . We might not be happy with any of
the options , but we vote anyway (at leas t s tatis tic ians do).
D id you ever leave the voting booth, though, and feel that
s omehow your real feelings weren't repres ented very well by
thos e c hoic es ? P olitic al s c ientis ts know that feeling. T hey have
analyzed the s ometimes uns atis fying outc omes of votes
between alternatives and dis c overed that s uc h a proc es s c an
res ult in outc omes in whic h no one is happy (exc ept the winner,
of c ours e).
Vote Cycling
How It Works
T here are the Apples who prefer red, the Elephants who favor
white, and the Jayhawks who like blue bes t. T he groups als o differ
on whic h c olor they like s ec ond bes t and whic h c olor they like
leas t. Table 6 - 2 2 s hows the three groups and their politic al
agendas .
To determine the will of the s c outs , you c ould hold a two- s tage
elec tion. Stage one pres ents two alternatives . T he winner of that
s tage then "c ompetes " with the third alternative to pic k a winner.
T he two s tages and res ults c ould look like this :
2. Red or Blue? I n this matc hup, Red rec eives 2 0 perc ent
of the vote and Blue wins with a huge 8 0 perc ent.
I f the des igners of the voting s ys tem are interes ted in the rank-
order preferenc es of voters , voters c ould be as ked to rank- order
all c andidates . T he lowes t mean rank wins . T his is a fairer
method that us es all the information available, but it c an lead to
c hoic es that no one is really thrilled about.
For elec tions in whic h there are many c andidates (in s ome
mayoral or governor elec tions , for example), there is often a
runoff elec tion in whic h the larger number of c andidates is
whittled down to a s maller number. T his does n't have the
weaknes s of vote c yc ling, bec aus e all alternatives are
c ons idered at the s ame time. I t als o eliminates the weaknes s of
the s ingle- trip- to- the polls approac h bec aus e it inc reas es the
likelihood of a winning c andidate with majority s upport.
Hack 74. Live Life in the Fast Lane
(You're Already In)
T he problem is that rec ent res earc h involving s tatis tic ally bas ed
c omputer s imulations s ugges ts that drivers will us ually judge
another lane is moving more quic kly than theirs , even if it is
ac tually moving at the s ame s peed! T his mis perc eption, s urvey
res earc h s hows , is enough for mos t drivers to try to c hange into
that other lane.
Traffic res earc hers c all the times when you are pas s ing other
c ars s kips and the times when other c ars are pas s ing you s lips .
Rec ent res earc h refers to s kips as pas s ing epochs and s lips as
being-overtaken epochs . I t probably does not s urpris e you that
drivers greatly prefer pas s ing epoc hs over being- overtaken
epoc hs .
I n addition to looking for fas ter lanes to move into, drivers have
another goal, whic h is to keep their own vehic le moving as
quic kly as pos s ible, or at leas t c los e to their target s peed (whic h
might be the s peed limit, for example). I f there are perc eived
gaps between thems elves and the vehic le in front of them, and
they are not c urrently moving at their target s peed, drivers will
ac c elerate to c los e the gap. I t is thes e burs ts of ac c eleration
that ac c ount for the s kips (periods of pas s ing other c ars ) and
s lips (periods of other c ars pas s ing them). We are likely to
experienc e more periods of time when we are being pas s ed than
periods when we are doing the pas s ing. I t is this perc eived
inequity that c an res ult in drivers c onc luding that they are in the
s low lane, even if both lanes are equally s low.
I magine two lanes of traffic s ide by s ide that are moving at the
s ame average s peed. G aps between c ars form randomly; more
ac c urately, they form s ys tematically, but bas ed on a random
s tarting c onfiguration. G aps are filled as they form, and when
gaps are filled, c ars have ac c elerated.
Sitting s till while watc hing other c ars ac c elerate to fill gaps
c reates the illus ion that our lane is moving more s lowly.
C anadian res earc hers D onald Redelmeier and Robert Tobs hirani,
who c onduc ted c omputer s imulations to determine the ac c urac y
of driver perc eptions of other lanes ' s peed, made s ome
as s umptions about traffic patterns that were bas ed on the
normal dis tribution [H ac k #2 3 ].
To mirror the reality that a partic ular pattern of s pac ing on a
c rowded highway has s everal c aus es (c onditions , exits and
entranc es , and s o on), they randomly as s igned intervals
between moving c ars bas ed on two normal dis tributions : 9 0
perc ent of intervals were about two meters apart, give or take a
1 0 th of a meter, while 1 0 perc ent of the intervals were 1 0 0
meters apart, give or take 5 meters . A t the s tart of eac h of
hundreds of s imulations , c ars were c reated and s pac ed following
this randomization plan.
T he res earc hers c reated data for two lanes of traffic moving in
the s ame direc tion at the s ame s peed, full of hundreds of
imaginary vehic les with typic al ac c eleration and braking
c apabilities . T hey programmed in a s afe driver s trategy of
moving up when there was s pac e in a lane, but not getting too
c los e. T heir s imulated drivers were not allowed to get too c los e
to another vehic le's tailgate. A ls o, they were not allowed to
c hange lanes , whic h mus t have been frus trating for the little
c omputer- c ontrolled drivers . N o ac c idents here.
Redelmeier and Tobs hirani found that 1 3 perc ent of the time,
c ars are either pas s ing or being pas s ed. M os t of the time, c ars
are running equal to eac h other. While there was a better c hanc e
that any partic ular driver was being pas s ed than that s he was
doing the pas s ing, when s he did pas s c ars , s he pas s ed a bunc h.
T he math worked out to a draw in terms of c ars pas s ed and the
number of c ars doing the pas s ing. T he total number of c ars
overtaken by our driver was equal to the number of c ars that
pas s ed her.
Keep this other lane is better mis perc eption in mind and
find better ways to judge the s peed of other lanes . P ic k a
unique c ar in the other lane, and after a few minutes
c ompare your pos ition to it. A fter all, there s ometimes
are fas ter lanes than others ; it's jus t that you c an't look
at pas s ing c ars as the bes t evidenc e for s peed.
See Also
The search f or extraterrestrial lif e is alive and well. You can use
statistical sampling and probability to f ocus the search.
When the equation was firs t introduc ed, only one of the terms
c ould be es timated with any c ons ens us among as tronomers . R,
the number of new s tars produc ed in our galaxy eac h year, is
believed to be about 1 0 .
O ne pos s ible outc ome of the D rake equation is that there are
only two planets in our galaxy with advanc ed intelligent
c ivilizations c apable of s ending and rec eiving radio waves . I f we
really have only one other potential c os mic pen pal, it will be
tough to find him or her or it in s uc h a large hays tac k of planets .
So, what to do?
T hey are s uns that s hare c harac teris tic s of our own.
T he data won't make s ens e to you when you get it, but your
c omputer will begin to us e s tatis tic al analys es to s ort through
the s ignal information, looking for the telltale nonrandom narrow
bandwidths that might mean another planet has reac hed the
level of s ophis tic ation to produc e s omething like Gomer Pyle or
Melros e Place. You c ould be the firs t to dis c over life on other
planets , s o get to work!
Colophon
O ur look is the res ult of reader c omments , our own
experimentation, and feedbac k from dis tribution c hannels .
D is tinc tive c overs c omplement our dis tinc tive approac h to
tec hnic al topic s , breathing pers onality and life into potentially
dry s ubjec ts .
[SYMBOL] [A ] [B] [C] [D] [E] [F] [G] [H] [I] [J]
[K] [L] [M] [N] [O] [P] [Q] [R] [S] [T] [V ] [W] [X]
[ Y ] [ Z]
Index
[SYMBOL] [A ] [B] [C] [D] [E] [F] [G] [H] [I] [J]
[K] [L] [M] [N] [O] [P] [Q] [R] [S] [T] [V ] [W] [X]
[ Y ] [ Z]
! (exclamation point)
<Emphasis>d</> (effect size) 2nd
<Emphasis>t</> tests
comparing groups
establishing validity
measuring relationships
purpose
sample distribution
variance and
Z-test and
Index
[SYMBOL] [A] [B] [C] [D] [E] [F] [G] [H] [I] [J]
[K] [L] [M] [N] [O] [P] [Q] [R] [S] [T] [V ] [W] [X]
[ Y ] [ Z]
Aces
counting
z scores and
additive rule 2nd 3rd
Adler, Joseph
aliquot parts
all-in
analysis level (learning) 2nd
analysis of answ er options
analysis of variance
answer options
analysis of
[SYMBOL] [A ] [B] [C] [D] [E] [F] [G] [H] [I] [J]
[K] [L] [M] [N] [O] [P] [Q] [R] [S] [T] [V ] [W] [X]
[ Y ] [ Z]
bar bets
designing
dice and
li'l flushes
[SYMBOL] [A ] [B] [C] [D] [E] [F] [G] [H] [I] [J]
[K] [L] [M] [N] [O] [P] [Q] [R] [S] [T] [V ] [W] [X]
[ Y ] [ Z]
card games
card-sharping
getting lucky
probabilities and
w ild cards
card tricks
card-sharping
casinos
card counting in
profit on roulette
categorical measurement
categorical variables
cause-and-effect relationships
correlation and
show ing
beauty of
overview
central tendency, measures of
chi-square test
one-w ay
tw o-w ay
ciphers, decoding
classical test theory 2nd
cluster sampling
coefficient alpha 2nd 3rd
coin toss
heads or tails
possible outcomes
community cards
flop as
reading quickly
comparison groups
<Emphasis>t</> test
pretests and
comprehension level (learning)
CONCATENATE function
concurrent validity
conditional probabilities 2nd 3rd
confidence intervals
building 2nd 3rd 4th
continuous values
discrete values vs.
correlation
betw een variables 2nd
defined 2nd
direction of
negative 2nd
partial
variable groupings
variance and
z scores and
correlation coefficient
(<Emphasis>r</>)
beauty of
computing
conditions
establishing validity
formula
interpreting
counting method
counting cards 2nd
rule of four
craps (dice game)
criterion variables
defined
criterion-referenced scores
defined
functionality 2nd
Cronbach's alpha
Cunningham, Ross
cut score
Index
[SYMBOL] [A ] [B] [C] [D] [E] [F] [G] [H] [I] [J]
[K] [L] [M] [N] [O] [P] [Q] [R] [S] [T] [V ] [W] [X]
[ Y ] [ Z]
dependent variables
<Emphasis>t</> tests and 2nd 3rd
defined
descriptive statistics 2nd
Diaconis, Persi 2nd
dice roll
gambler's fallacy about 2nd
overview
dichotomous variables
difficulty index 2nd
discrete values, continuous vs.
discriminant analysis
discrimination index 2nd 3rd 4th
distances
in distributions
chi-square values
defined
distances in
histograms and
letters in English alphabet
probability and
sample
standardized score
w ell-defined
z score
double dow n 2nd
dovetail shuffles 2nd
Dow ning, S.M
Drake equation
Drake, Frank
draw poker
[SYMBOL] [A ] [B] [C] [D] [E] [F] [G] [H] [I] [J]
[K] [L] [M] [N] [O] [P] [Q] [R] [S] [T] [V ] [W] [X]
[ Y ] [ Z]
effect size
applying interpretations
pow er and
sampling 2nd
trial and error
Type 1 error
Type I error
Excel (Microsoft)
DATAS softw are
histograms
experimental designs
comparison groups 2nd
defined
effective 2nd
validity of
experimental groups
extra-sensory perception (ESP)
extraterrestrial life
Index
[SYMBOL] [A ] [B] [C] [D] [E] [F] [G] [H] [I] [J]
[K] [L] [M] [N] [O] [P] [Q] [R] [S] [T] [V ] [W] [X]
[ Y ] [ Z]
flop
defined
football
betting on
tw o-point conversion
fractions
frequency tables
dice rolls
[SYMBOL] [A ] [B] [C] [D] [E] [F] [G] [H] [I] [J]
[K] [L] [M] [N] [O] [P] [Q] [R] [S] [T] [V ] [W] [X]
[ Y ] [ Z]
Galton, Francis
gambler's fallacy 2nd
gambling
basic truths
blackjack
card-sharping
coin toss
lottery
playing cards
playing w ith dice
pot odds
roulette
rule of four
sharing birthdays
short-stacked
w ild cards
game playing
card tricks
estimating pi
histograms in Excel
iPods and
Monopoly
ranking players
strategies
tw o-point conversion
game show s 2nd
games of chance
fair payouts
Monopoly
roulette as
general universe
generalizations
cause-and-effect and 2nd
samples and
Gigerenzer, G.
Gilligan's Island 2nd
GMAT (Graduate Management Admission Test)
Golden Theorem
goodness-of-fit statistic 2nd
Gott's Principle
Gott, J. Richard, III 2nd 3rd
Grandparent Paradox
graphing relationships 2nd
GRE (Graduate Record Exam)
group designs 2nd 3rd
Guare, John
Index
[SYMBOL] [A ] [B] [C] [D] [E] [F] [G] [H] [I] [J]
[K] [L] [M] [N] [O] [P] [Q] [R] [S] [T] [V ] [W] [X]
[ Y ] [ Z]
Haladyna, T.M.
Hale-Evans, Ron
Hall, P.
Hanks, Tom
Hansen, Brian
haphazard sampling
Hastings, J.T.
Haw thorne Effect
overview
randomness and
high scores
higher scores, likelihood of
Hill, Theodore 2nd 3rd 4th
histograms 2nd
hit the nuts
Hitzges, Norm
Hofferth, Jerrod
house edge 2nd 3rd
Huff, D.
statistical
hypothesis testing
errors in
interpreting findings
[SYMBOL] [A ] [B] [C] [D] [E] [F] [G] [H] [I] [J]
[K] [L] [M] [N] [O] [P] [Q] [R] [S] [T] [V ] [W] [X]
[ Y ] [ Z]
inferential statistics
controversial tools
defined 2nd
overview
populations and
relationships and
samples and
insurance in card games
intelligence tests 2nd
inter-rater reliability
INTERCEPT function
internal reliability 2nd
defined 2nd
negative numbers
pow er of
[SYMBOL] [A ] [B] [C] [D] [E] [F] [G] [H] [I] [J]
[K] [L] [M] [N] [O] [P] [Q] [R] [S] [T] [V ] [W] [X]
[ Y ] [ Z]
Jordan, C.T.
judgment sampling
Jung, Carl
Index
[SYMBOL] [A ] [B] [C] [D] [E] [F] [G] [H] [I] [J]
[K] [L] [M] [N] [O] [P] [Q] [R] [S] [T] [V ] [W] [X]
[ Y ] [ Z]
[SYMBOL] [A ] [B] [C] [D] [E] [F] [G] [H] [I] [J]
[K] [L] [M] [N] [O] [P] [Q] [R] [S] [T] [V ] [W]
[X] [Y ] [Z]
labels
lane-changing decisions
law of finite pocket size
Law of Large Numbers 2nd
Law of Total Probability
learning
cognitive levels of 2nd
trial-and-error
Leclerc, Georges-Louis 2nd
Let's Make a Deal
regression]
graphing relationships
[SYMBOL] [A ] [B] [C] [D] [E] [F] [G] [H] [I] [J]
[K] [L] [M ] [N] [O] [P] [Q] [R] [S] [T] [V ] [W]
[X] [Y ] [Z]
Madaus, G.F.
magic number, lotteries and
MANOVA (multivariate analysis of variance)
MCAT (Medical College Admission Test)
mean]
ACT
calculating
defined 2nd
effect size and
normal distribution
precision of
T scores
measurement]
<Emphasis>t</> tests
asking questions
categorical
defined
graphs and
levels of 2nd
normal distribution
percentile ranks
precise
probability characteristics
reliability of
testing fairly
median
central tendency and 2nd 3rd
defined
Microsoft Excel
DATAS softw are
histograms
mode
central tendency and 2nd
defined
models
building 2nd
defined
money
casinos and 2nd
infinite doubling of
Monopoly
Monty Hall problem
multiple regression
criterion variables and
defined
[SYMBOL] [A ] [B] [C] [D] [E] [F] [G] [H] [I] [J]
[K] [L] [M] [N] [O] [P] [Q] [R] [S] [T] [V ] [W] [X]
[ Y ] [ Z]
norm-referenced scoring
defined 2nd
percentile ranks
simplicity of
normal curve
Central Limit Theorem and
overview
precision of
predicting w ith
normal distribution
applying characteristics
overview
shape of
traffic patterns
null hypothesis
defined
errors in testing
possible outcomes
[SYMBOL] [A ] [B] [C] [D] [E] [F] [G] [H] [I] [J]
[K] [L] [M] [N] [O] [P] [Q] [R] [S] [T] [V ] [W] [X]
[ Y ] [ Z]
outcomes
blackjack 2nd
coin toss
identifying unexpected
likelihood of 2nd
mutually exclusive
predicting 2nd
spotting random
trial-and-error learning
[SYMBOL] [A ] [B] [C] [D] [E] [F] [G] [H] [I] [J]
[K] [L] [M] [N] [O] [P] [Q] [R] [S] [T] [V ] [W]
[X] [Y ] [Z]
p-values
pairs of cards, counting by
parallel forms reliability
partial correlations
Party Shuffle (iTunes) 2nd 3rd 4th
Pascal's Triangle
Pascal, Blaise
passing epochs
payoffs
expected 2nd
sample estimates
of scores
percentile ranks 2nd
performance
criterion-based arguments
ranking players
permutations 2nd 3rd
Petersen, S.E.
Peyton, V.
Phye, G.D.
pi, estimating
pivot tables
plain text
pocket pair 2nd
Poe, Edgar Allen
point system
ranking players
Poisson, Sim\x8e on-Denis
w ild cards
populations
Central Limit Theorem
defined 2nd
positive correlation
defined
power
effect size and
precision
calibrating 2nd
game w inners
length of lifetime
outcome of events
regression analysis and
test performance
predictive validity
predictor variables
combining
defined
probability
additive rule
analytic view of
basic equation
of card distributions
decoding ciphers
defined
expressing distributions as
extraterrestrial life
li'l flushes
multiplicative rule
Pascal's Triangle
patterns of
pi and
sharing birthdays
traffic patterns
various digits
zonks and
probability) 2nd 3rd
proportions
level of measurement and
normal curve and 2nd
z score and
proxy variables
psychic abilities
Index
[SYMBOL] [A ] [B] [C] [D] [E] [F] [G] [H] [I] [J]
[K] [L] [M] [N] [O] [P] [Q] [R] [S] [T] [V ] [W] [X]
[ Y ] [ Z]
questions
asking
difficulty index
Index
[SYMBOL] [A ] [B] [C] [D] [E] [F] [G] [H] [I] [J]
[K] [L] [M] [N] [O] [P] [Q] [R] [S] [T] [V ] [W] [X]
[ Y ] [ Z]
Ramseyer, Gary
random data
random error 2nd 3rd 4th
random sampling
Benford's law 2nd
defined 2nd
random selection
game playing and
iTunes option
random shuffle
random thoughts
randomness of life
ranking
determining for players
raw scores
converting to z scores
relationships
averages in
comparing groups
discovering 2nd
discrete sampling
effect sizes
efficient sampling
graphing 2nd
identifying unexpected
predicting outcomes
statistical inference
statistical significance of
reliability
defined
research designs
categories of 2nd
threats to validity
research hypothesis 2nd 3rd 4th
response rate
Rhine, J. B.
riffle shuffles 2nd 3rd
rising sequences
river
Rodriguez, M.C.
Rothman, Ernest E.
roulette
fair payouts
overview
ROUND function 2nd
ROUNDDOWN function 2nd
ROUNDUP function 2nd
rule of four
rule of tw o
rule of tw o plus tw o
Index
[SYMBOL] [A ] [B] [C] [D] [E] [F] [G] [H] [I] [J]
[K] [L] [M] [N] [O] [P] [Q] [R] [S] [T] [V ] [W] [X]
[ Y ] [ Z]
sample size
coin toss and
samples
<Emphasis>t</> tests and
cluster
defined 2nd
discrete/continuous objects in
efficient
extraterrestrial life
statistical significance of
sampling errors 2nd
sampling frame 2nd
sampling unit 2nd
SAS softw are
SAT 2nd
Saxbe, Darby
scale invariance
consistency in
correlation coefficient
defined
error
high
level of measurement
likelihood of higher
percentage of
predicting 2nd
standardized 2nd
T scores 2nd
validity of
Search for Extraterrestrial Intelligence (SETI)
sensitivity
serendipity, interpreting
SETI (Search for Extraterrestrial Intelligence)
Shadish, W.R.
shared variance 2nd
short-stacked
shuffling cards
standard deviation
ACT
defined 2nd
formula
T scores
z score
standard error
calibrating precision 2nd
defined
determining
overview
scores and
defined 2nd
determining
defined
statistical significance
Central Limit Theorem and
chi-square values
heads or tails
increasing pow er
judging importance
[SYMBOL] [A ] [B] [C] [D] [E] [F] [G] [H] [I] [J]
[K] [L] [M] [N] [O] [P] [Q] [R] [S] [T] [V ] [W]
[X] [Y ] [Z]
T scores 2nd
table of areas under the normal curve
table of specifications
tax returns, fraudulent
test scores
establishing reliability
establishing validity
improving
norm-referenced scoring
precision for
predicting performance
statistical significance
test-retest reliability 2nd 3rd
testing
fairly
improving scores
validity in
tests of significance
pot odds
ranking players
rule of four
short-stacked
thoughts, random
Tibshirani, Robert 2nd 3rd
Tic-Tac-Toe
traffic patterns
trial-and-error learning
true score 2nd 3rd
true zero
turn
TV game show s 2nd
Tversky, Amos 2nd 3rd
tw o-point conversion (football)
tw o-tailed test
tw o-w ay chi-square test
Type 1 error 2nd
Type I error
Type II error 2nd
Index
[SYMBOL] [A ] [B] [C] [D] [E] [F] [G] [H] [I] [J]
[K] [L] [M] [N] [O] [P] [Q] [R] [S] [T] [V ] [W] [X]
[ Y ] [ Z]
validity
establishing
scores and
threats to
variables
categorical
cause-and-effect relationships
dichotomous
discovering relationships
effect sizes
groupings in
measuring correlation
proxy
surrogate
variance
correlation and
defined 2nd
shared 2nd
Vermeil, Dick
Vos Savant, Marilyn
voting cycle paradox
Index
[SYMBOL] [A ] [B] [C] [D] [E] [F] [G] [H] [I] [J]
[K] [L] [M] [N] [O] [P] [Q] [R] [S] [T] [V ] [W] [X]
[ Y ] [ Z]
wagers
betting systems based on
dice and
increasing
roulette and
[SYMBOL] [A ] [B] [C] [D] [E] [F] [G] [H] [I] [J]
[K] [L] [M] [N] [O] [P] [Q] [R] [S] [T] [V ] [W] [X]
[ Y ] [ Z]
[SYMBOL] [A ] [B] [C] [D] [E] [F] [G] [H] [I] [J]
[K] [L] [M] [N] [O] [P] [Q] [R] [S] [T] [V ] [W] [X]
[Y] [Z]
[SYMBOL] [A ] [B] [C] [D] [E] [F] [G] [H] [I] [J]
[K] [L] [M] [N] [O] [P] [Q] [R] [S] [T] [V ] [W] [X]
[ Y ] [ Z]
z score
coin toss and
defined
problems w ith