Professional Documents
Culture Documents
Article Role of Stories in Data Storytelling
Article Role of Stories in Data Storytelling
in data storytelling
by Shawn Callahan
As Edward R. Tufte published each of his beautiful What follows is a framework for how story
books on the visual display of complex data – The techniques can help the data analysis process.
Visual Display of Quantitative Information (1983)1, It is useful for any individual (or group) working
Envisioning Information (1990) 2 and Visual with data, whether you’re a scientist, a marketer,
Explanations (1997) 3 – I became more and more an engineer or a policy-maker. The challenge in
fascinated by the question of how to grab the each case is similar: how do you put yourself in
attention of decision-makers and inspire them the best position to make sense of a mass of data
to act based on the data. in order to gain insights, and then inspire people
to change based on the discoveries?
I remember an early success back at the start of
the 1990s. I was helping some councillors see We are starting to see things written about the
the need for a tree-planting program to provide role of stories in data analysis. For example, Brent
nature corridors across their shire. Before they Dykes over at Forbes5 has written a compelling
saw the results of our geographic systems analysis piece describing the need for data storytelling,
(what we called computerised mapping before showing that the emphasis today is on data
Google Maps), the councillors had told themselves manipulation and analysis tools and skills. He
the story of how, over the years, they’d invested in predicts there will be a shift to storytelling when
planting trees and that’s why their towns had such the gap widens between the analysts who are
leafy surrounds. They felt they’d done more than discovering the insight and the decision-makers
enough. So when we unfurled maps showing tiny who are learning about it in a way that helps them
and mostly unconnected stands of mature trees and care. Cole Nussbaumer Knaflic’s Storytelling with
a vast expanse of paddock, they were more than Data6 also points to the popularity of the idea.
surprised. After the initial shock and some resistance,
However, Nussbaumer Knaflic’s book is really
they agreed to fund the tree-planting project.
a guide to data visualisation, with only a single
Cognitive psychologist Gary Klein has said that chapter on storytelling. And while Dykes laments
‘insight is when you unexpectedly come to a the missing link between the analyst and the
better story’.4 These councillors had just had an decision-maker, his article doesn’t take the
insight, one that we helped them to have. opportunity to appreciate the wider role stories
play in data analysis, beyond just inspiring the
Back then, I was unaware of the role stories play
decision-maker.
in the process of making sense of the data and
communicating insights the analyst uncovers. Here, we will explore the three types of story
But over the last 15 years, my work in business work, the role stories play before, during and
story techniques and my interest in conveying the after the data analysis, and the various story
results of data analysis have merged, and I now patterns that could be employed to inspire a
see a strong role for story work beyond just telling decision-maker to take action.
the story of the results.
Figure 2
DATA E X P L A N AT I O N D I S C O V E RY TRIGGERING
S TO RY S TO RY STORY A NEW STORY
T IME SERIE S POINT POINT T IME SERIE S
DATA IN T IME IN T IME OR POINT
DATA DATA IN T IME DATA
Figure 3
1982-2000
correlation = 0%
Profits
Profits
1961-1982 1961-1982
correlation = correlation =
94% 94%
Sales Sales
Source: Sell with a Story, by Paul Smith Source: Sell with a Story, by Paul Smith
Source: Sell with a Story, by Paul Smith Source: Sell with a Story, by Paul Smith
Was it when commodity costs got out of control? The connection between cause and effect, how-
Paul chased down each hypothesis and in the end ever, doesn’t have to get down to root causes.
discovered that it was when the market reached Marketers have discovered that if you can uncover
full penetration. (Figure 5) a reliable correlation, then you can make decisions
– I’m sure this sends the scientists nuts, but for
Paul tells the story this way:
a business it can be a practical approach. For
“Before we launched disposable diapers in the example, large retailers collect masses of data
early ’60s, everyone used cloth diapers. But around loyalty cards, such as purchases, dates,
it’s not like once disposable diapers came out, times, geographies, shopper demographics
everybody switched from cloth immediately. It and so on. An analyst can explore this data for
took years for that to happen. In fact, it turns strong correlations, and once they are found,
out it took exactly 21 years.” predictions can be drawn.
“By 1983, the market for disposable diapers The following scenario, told by Charles Duhigg
had essentially reached 100 percent of in The New York Times,11 is from the retailer
households with kids who wore diapers, and Target. A woman aged 23 buys cocoa-butter
cloth diapers had almost entirely vanished from lotion, a purse large enough to double as a
the marketplace. Up to that point, everyone diaper bag, zinc and magnesium supplements,
making disposable diapers had rapidly growing and a bright blue rug. Target can predict she
sales numbers, and the rapidly growing profit has an 87% chance of being pregnant and
numbers to go with them. The cloth diaper will have her baby in five months. With that
makers, of course, were going out of business.” knowledge, they can then send brochures
directly to the woman, encouraging pregnancy-
“What that means is that the disposable diaper
related purchases.
business in the United States went from a
‘developing market’ to a ‘mature market’ in As Target discovers, though, you have to be
1983. And apparently, we (Procter & Gamble) careful with this knowledge. Their analysis results
failed to notice it. We’re still following the same in a high-school girl receiving advertisements
basic ‘sell more’ strategy we’ve been using of maternity clothing and nursery furniture.
during the developing market period.”10 Her father is enraged and complains to a Target
manager about the obvious mistake. A few days
Analysis is a battle of stories in a very Darwinian
later the manager calls to apologise again to the
fashion. The one with the best fit with the data
father. A little sheepishly, the dad admits he’s
wins. It’s the job of the analyst to explore the
had a conversation with his daughter and she
many possible stories that might explain what
is due in August.
they are seeing.
32
30
28
26
24
A data story is vulnerable in one way: it can be you could say it’s likely that on 28 August 1854,
usurped by a better story. For example, in relation the water pump on Broad Street became infected
to the Norway story, what if scientists discovered when the cesspool for the block overflowed
that when humans are under extreme stress, we because of broken brickwork. Houses on Broad
produce a chemical in our blood that reduces the Street fell first – as water was taken from the
likelihood of heart failure? Then the story could pump, you could see a fanning out of the disease
become something like the following. and the resulting deaths in a radial pattern.
Interestingly, there were no deaths at the nearby
Before the war, Norwegians were a relatively
brewery as everyone there drank beer to hydrate
relaxed population, and the incidence of heart
and they had their own well to take water from.
disease increased on a par with other Western
The few other unaffected households in the area
cultures. But when the war began, Norwegians’
were discovered to prefer the water from a pump
stress levels went through the roof. Their bodies
that was further afield and unaffected by cholera.
produced heaps of chemical X, and heart disease
almost disappeared. But when the war and its John Snow wrote extensively on how the disease
stresses ended, the Norwegians resumed their spread in London and explained all the anomalies
old stress-free lives, and rates of heart disease he found in the data. His story was a compelling
climbed again. account and, as I said earlier, it changed public
health policy in England. As Steve Johnson,
OK, it’s perhaps not the most compelling
author of The Ghost Map18 and my source for
alternative story, but you get my meaning. A key
John Snow’s epic data storytelling, has noted:
story principle is that you can’t beat a story with
“It was going to take more than body counts to
fact. You can only beat it with a better story.
prove that the pump was the culprit behind the
Stories of the past are often overtaken by new Broad Street epidemic. Snow was going to need
discoveries. Clearly, the founder of IBM couldn’t footprints too.”
have imagined the scale of future technology
The discovery story
when he predicted the world would only ever
need a handful of computers. Sometimes you have to explain how a discovery
was made for the audience to both appreciate
The explanation story the insight and understand how much work
When your analysis is not a time series, then your went into having it.
story could explain your insight. John Snow’s
Google recently completed a comprehensive
cholera map is a good example. On its own,
study19 into what makes a team productive.
without a story, the map lacks meaning. However,
1
The Visual Display of Quantitative Information
http://amzn.to/29Ks9V9 Shawn Callahan is the author of
2
Envisioning Information http://amzn.to/29W9dWy Putting Stories to Work and the
3
Visual Explanations http://amzn.to/29W81Ts Founder at Anecdote, the world’s
4
Klein, G. A. (2013). Seeing what others don’t: the largest business storytelling
remarkable ways we gain insights. New York, PublicAffairs.
company.
5
www.forbes.com/sites/brentdykes/2016/03/31/data-
storytelling-the-essential-data-science-skill-everyone-
You can get his book at
needs/#7acea305f0c8
www.puttingstoriestowork.com
6
Storytelling with Data http://amzn.to/2aorpHu
7
The facts presented here about John Snow and the cholera
He works with Global 1,000 companies
epidemic of 1854 are drawn from Steven Johnson’s terrific
2007 book, The Ghost Map: A Street, an Epidemic and the such as Shell, Danone, Microsoft, TESCO,
Hidden Power of Urban Networks (Penguin Books). Allianz and Bayer all around the world.
8
The Ultimate Guide to Anecdote Circles www.anecdote.
com/pdfs/papers/Ultimate_Guide_to_ACs_v1.0.pdf Other articles by Shawn and the
9
Paul Smith (2016). Sell with a Story: How to Capture Anecdote team at www.anecdote.com
Attention, Build Trust, and Close the Sale, AMACOM
http://amzn.to/2aEsSbV You can automatically receive Shawn’s
10
Paul Smith (2016), Sell with a Story: How to Capture
future articles via email by subscribing to
Attention, Build Trust, and Close the Sale, AMACOM
11
Charles Duhigg in The New York Times www.nytimes.
Anecdote’s newsletter at
com/2012/02/19/magazine/shopping-habits.html www.anecdote.com/subscribe
12
Putting Stories to Work http://amzn.to/29Y4Mrm
13
www.anecdote.com/2014/09/story-framework-infographic/ Contact Shawn at:
14
www.anecdote.com/2013/01/storytelling-tips-for- Phone: +61 1300 720 537
leaders-spotting-stories/
www.anecdote.com
15
A. Strom, R. A. Jensen, M. D. Oslo and M. D. Oslo (1951),
‘Mortality from Circulatory Diseases in Norway 1940–1945’,
shawn.callahan@anecdote.com
The Lancet, 257(6647): 126–9. twitter.com/shawncallahan
16
Mystery stories www.anecdote.com/2008/04/why-
should-we-care-about-mystery-stories/
17
My friend and fellow story practitioner Paul Smith pointed
out this approach to me.
18
The Ghost Map http://amzn.to/2aoyg3I
19
Comprehensive study https://rework.withgoogle.com/
blog/five-keys-to-a-successful-google-team/
I would like to thank my friends Paul Smith,
20
Duhigg’s article www.nytimes.com/2016/02/28/
magazine/what-google-learned-from-its-quest-to-build- Andrew Ford and Walter Adamson who helped
the-perfect-team.html me think through my ideas.