Professional Documents
Culture Documents
3 Essentials For Better Graphs
3 Essentials For Better Graphs
data visualization
Dr. Abby Benninghoff
3 Essentials
1
1 Use the right kind of
plot for your data
Pictures as figures
• Never assume that anyone knows what
is in a picture
– Use arrows, markers to identify features
– Include scale bar
– Specify meanings of colors
– Include key explanations in figure legend
2
Use diagrams to convey complex ideas
This works.
Particularly as an
overview in an
introduction section
or a review paper.
3
What type of graph?
• Line graphs for dynamic comparisons
– Dependent variable changes with respect to
time, dose, etc.
– Do not over-crowd with multiple (>4) lines
– Use curve-fitting when appropriate
(provide curve fit model)
• Scatter plot for correlations
– Most commonly made in two dimensions
– Line of best fit – linear regression
– Show r statistic, confidence interval lines or
other indication of robustness of the data
3 Essential steps for better data visualization. January 20, 2016
4
What type of graph? bc
c
• Box plot useful for descriptive 100
statistics ab
a
– Will highlight differences in
5
2 Graphs should elucidate
information, not obfuscate
25 a 25 a
b b
20 20
15 15
or inappropriately 10 10
emphasize or disguise 5
0
DIO
TWD
AIN93G
5
0
DIO
TWD
AIN93G
trends 2 4 6 8
Weeks of age
10 12 4 8
Weeks of age
12
30
26
break)
22
10
20
0 18
Male Female Male Female Male Female M F M F M F M F M F M F
AIN93G TWD DIO AIN93G TWD DIO
6
Including/excluding data
40
Variable B
30
blue influence the regression fit.
25 25
E2 E2
PFOS PFOS
0 10:2 FtOH 0 10:2 FtOH
-12 -11 -10 -9 -8 -7 -6 -5 -4 -3 -2 -1 0 -12 -11 -10 -9 -8 -7 -6 -5 -4 -3 -2 -1 0
Inhibitor concentration (Log M) Inhibitor concentration (Log M)
Response (units)
Response (units)
6 6
7
Graph style should reflect experiment design
Water Water
GT GT GT
• Data groupings should reflect the 8 58 5
8 8
Response (units)
(units)
Response (units)
Response (units)
Response (units)
6 6 6 6
statistical analysis performed. 3 3
Response
4 4 4 4
– This presentation suggests two 2 2
Response (units)
5 ac
appropriate or this 2x2 experiment 4 c
design. 3
b
8
Characteristics of well-designed figures
• Neatness – Image is clean and sharp, invites attention
• Readability – Eye can discern important information easily, quickly
• Font – Text is large enough to be read, placed well and limited
• Size – Graph is sized appropriately for anticipated reduction during printing.
• Aesthetics – Balanced graphs, good use
of white space, eyes drawn to most important
features
• Use of color – Distinguishes important
content, pleasing to eye
• Consistency – Similar graphs have same
stylistic scheme (line width, font type
and size, labeling, scales, etc.)
3 Essential steps for better data visualization. January 20, 2016
9
How can I fix it?
Scale is more appropriate;
subdivide Y-axis into big Use solid fills and big
Use large font for axis
units; don’t use minor ticks patterns to distinguish
titles, two sizes smaller
(unless log scale) bars
for axis labels; be
consistent in font sizes
10
Reformatted (not Excel) Keep the field behind the data clear of
any grid lines (unless needed to show
reference measurement, such as for
Follow previous normalized data)
recommendations for
font sizes, axes
thickness, etc. Also use
the same thickness for
symbol outline. Use big symbols, easy
to see when graph is
reduced for publication.
Use shading to help
distinguish symbols.
11
The 3D temptation
• Do not use 3D unless absolutely
necessary.
• Many other ways to show 3 levels of data
without using a rotated 3D graph
When 3D works
(Actually, these plots incorporate 4 levels of information; 3 different axes and color!)
3 Essential steps for better data visualization. January 20, 2016
12
Exceptions to the “rules”
• Sometimes, breaking the rules of figure formatting is necessary to make
the graph more clear and to emphasize the key findings
13
Exceptions to the “rules”
• Horizontal graphs are okay, if they aid in data interpretation
14
Black & White Color
15
Some figures require color
16
Better design
• Only need to show the legend once since it clearly applies to both panels
• A graph title is optional (depends on reviewer); use the letters to identify the
panels in the figure legend
• Mirror formatting, scale, colors, etc. between the panels when appropriate
• Don’t have to repeat Y-axis title for panel B since it is the same measurement
3 Essential steps for better data visualization. January 20, 2016
17
What is an
infographic?
• A visual image such
as a chart or diagram
used to represent
information or data
• Highly shareable!
• Makes information
accessible to non-
experts
18
Use elements of the infographic style
• Just as with regular figure, infographic
100
* No supplement
80 Tart Cherry
Incidence (%)
must be easily discernable within allowed 60
20
19
Software options
• Excel is just awful for making science graphs
– Especially bad for multi-panel figures
– Thinks too much for you
• Try out other software that specializes in scientific data presentation
– SigmaPlot http://www.sigmaplot.com/
• Haven’t worked with this one in years
• Advanced graphics with somewhat steep learning curve
– GraphPad Prism http://www.graphpad.com/
• Easy to use interface
• Integrates statistics with graphing
• Drawback – pricey for individuals ($100/yr); bulk licensing available
• 30-day trial available
Primer on Prism
• Organization of sheets by
folder structure
– Data tables
– Notes
– Analyses
– Charts
– Layouts
• Each data table is linked
to other “family” sheets
20
Advantages of Prism A
400
Food intake
B
1500
Energy intake
c
C
40
Final body weight
• Very customizable
300 b a 30
Grams
a a
1200 a
250 25
1100
200 1000 20
E
0
AIN TWD DIO MM
Fasting Glucose
VMM
F
AIN TWD DIO MM VMM
Glucose (mg/dL)
c a DIO
Glucose (mg/dL)
a 300 MM
VMM bc
VMM
20 ab 100
a
a 200
charts
10 50
100
0 0
0 2 4 6 8 10 12 14 16 AIN TWD DIO MM VMM
0 15 30 45 60 75 90 105 120
1.5 0.15 d
4.0 10 a 4
1.0 0.10
ac a
2.0 5 c 2
0.5 0.05
c c
0.0 0.0 0 0.00 0
AIN93G TWD DIO VMM MM AIN93G TWD DIO VMM MM
IN
IN
IN
D
IO
D
IO
D
IO
TW
TW
TW
A
A
D
D
3 Essential steps for better data visualization. January 20, 2016
Advantages of Prism
• Integrates basic statistics
– Students t-test
– One-way and two-way
ANOVA
– Histogram analyses
– Contingency analysis
• Includes interpretation of
results to help students
understand analysis results
• Excellent for curve fitting
21
Other tools for data visualization
Enrollment: College of Agriculture & Applied Sciences, Fall 2013
using MS PowerPoint
1
–
6 6
1
data 1
1
1
1
• Work with digital image data using Adobe About Tableau maps: www.tableausoftware.com/mapdata
software*
6
1
10
1
14
2 2
1 8 1 1
1
image-based data
2
1
2
7 136 3
11
4 1
1 1 1 2
2
3
1
1
5
1
22
Comments on sample figures
• Patterns of lines difficult to discern
– Is it important to be able to distinguish
these lines?
– Some colors are quite similar to each other.
I see two different green dotted lines that
are quite hard to distinguish
23
Revised example
Average monthly temperature
100
80
Temperature (°F)
60
40
20
1 2 3 4 5 6 7 8 9 10 11 12
Month
Phoenix Raleigh Minneappolis
24
Comments on sample figure
8 Cch1 (r2 = 0.994)
Cch3 (r2 = 0.999)
Cch4 (r2 = 0.996)
Cch5 (r2 = 0.990)
0
0 2 4 6 8
Differential Head over Levee (m)
I assumed connecting line was a regression in this example, and showed the r2 value
I also shortened the legend text, as the 1.47E-1 cm 2/s can easily go into the figure legend text.
But, if showing the figure as a presentation, the author may want to keep that info.
3 Essential steps for better data visualization. January 20, 2016
Abby D. Benninghoff
Department of Animal, Dairy and Veterinary Sciences
abby.benninghoff@usu.edu
435-797-8649
25