Professional Documents
Culture Documents
Misleading Grapgh Module 3
Misleading Grapgh Module 3
Contents
I-Introduction
V-Academia
1
I-Introduction
-a misleading graph, =a distorted graph: biểu đồ sai lệch = đồ thị bị biến dạng
is a graph that misrepresents data, trình bày sai dữ liệu
constituting a misuse of statistics :cấu thành việc lạm dụng số liệu thống
kê
and with the result that an incorrect conclusion may be derived from it.
-Graphs may be misleading through being
excessively complex : quá phức tạp
or poorly constructed.
-Even when constructed to accurately display the characteristics of their data,
graphs can be subject to different interpretation. diễn giải
-Misleading graphs may be created
Intentionally: Cố ý
to hinder the proper interpretation of data: cản trở việc giải thích
hợp lý dữ liệu
or accidentally due to
unfamiliarity with graphing software, sự xa lạ
misinterpretation of data,
or because data cannot be accurately conveyed. chuyển tải
-Misleading graphs are often used in false advertising.
-One of the first authors to write about misleading graphs was Darrell Huff,
publisher of the 1954 book How to Lie with Statistics.
-The field of data visualization: trực quan hóa dữ liệu
describes ways to present information that avoids creating misleading
graphs.
2
II-Misleading graph methods
It [a misleading graph] is vastly more effective, however, because it contains no
adjectives or adverbs to spoil the illusion of objectivity, there's nothing anyone
can pin on you.
--How to Lie with Statistics (1954)
nó không chứa tính từ hoặc trạng từ để làm hỏng ảo tưởng về tính khách quan,
nên không ai có thể gán cho bạn điều gì cả
A-General Methods
1-Excessive usage
The use of graphs where they are not needed
can lead to unnecessary confusion/interpretation.nhầm lẫn/giải thích.
Generally, the more explanation a graph needs,
the less the graph itself is needed.
Graphs do not always convey information better than tables.
2-Biased labeling thiên vị
The use of biased or loaded words in the graph's title, axis labels, or caption
may inappropriately prime the reader.tiêu đề của biểu đồ, nhãn trục
dẫn dắt người đọc một cách không thích hợp
B-Methods related to design
1-Poor construction
Poorly constructed graphs can make data difficult to discern and thus
interpret. phân biệt và do đó diễn giải
2-Complexity Độ phức tạp
Graphs are designed to allow easier interpretation of statistical data.
However, graphs with excessive complexity can obfuscate the data and make
interpretation difficult.làm xáo trộn
3-Pie chart
Comparing pie charts of different sizes
could be misleading as people cannot accurately read the comparative
area of circles.
The usage of thin slices,lát
which are hard to discern,
may be difficult to interpret.
The usage of percentages as labels on a pie chart
can be misleading when the sample size is small. cỡ mẫu
Making a pie chart 3D or adding a slant thêm một góc nghiêng
will make interpretation difficult due to distorted effect of
perspective.góc nhìn
height of the slices is varied chiều cao của các lát cắt rất đa dạng
3
Bar-charted pie graphs in which the height of the slices is varied may
confuse the reader.[9]
Biểu đồ hình tròn dạng thanh trong đó chiều cao của các lát cắt thay đổi
có thể khiến người đọc bối rối.
3D Pie chart slice perspective
A perspective (3D) pie chart is used to
give the chart a 3D look.
Often used for aesthetic reasons,lý do thẩm mỹ
chiều thứ ba the third dimension does not improve the reading of the
data;
trái ngược on the contrary, these plots are difficult to interpret
because of the distorted effect of perspective associated with
the third dimension.
thừa thãi The use of superfluous dimensions not used to display the data
of interest is discouraged for charts in general, not only for pie charts.
In a 3D pie chart, the slices that are closer to the reader appear to be
larger than those in the back
due to the angle at which they're presented.góc
Comparison of pie charts
4
– Given their low data-density and failure to order numbers along a
visual dimension, pie charts should never be used.
4-three D
The use of a superfluous third dimension, which does not contain information,
is strongly discouraged, as it may confuse the reader.
5
Volatility: sự biến động
Steady: vững chắc
Slope: dốc
Plot: phác họa
6
biểu thị sự ngắt trục y
3-Axis changes
8
Exaggerated: phóng đại
destructive capacity: khả năng hủy diệt
proportional: tỷ lệ thuận
Additionally, an improperly scaled pictogram may leave the reader with the
sense that the item itself has actually changed in size.
9
Logarithmic scaling
Logarithmic (often referred to as log) scales can be a very valid( có hiệu
lực) means of representing data, biểu thị dữ liệu
however if used without clearly being labelled as so,
or displayed to a reader who is unfamiliar with the concept of log
scales,
a graph using a log scale can be misleading.
Log scales put the data values in terms of a chosen number (the base of
the log) to a particular power,lũy thừa cụ thể
the base is often e (2.71828...) or 10.
For example, log scales may give a height of 1 "unit" for a value of
10 in the data and a height of 6 "units" for a value of 1000000
(1x10^6) in the data.
Log scales have common usage in some fields,
including the VEI (volcanic explosivity index) scale
or the richter scale for earthquakes,
magnitudes of stars in astronomy: độ lớn của các ngôi sao trong
thiên văn học
and the pH of acidic and alkaline solutions are also based on a form
of log scale,
but can have the effect of making data less immediately apparent to the
eye. làm cho dữ liệu ít được nhìn thấy ngay lập tức bằng mắt.
A graph with a log scale which was not clearly labelled as such,
or a graph with a log scale presented to a viewer who did not have
knowledge of logarithmic scales,
would generally result in a representation which made data values
look of similar size whilst in fact being of widely differing
magnitudes.
Misuse of a log scale
-can make vastly different values (such as 10 and 10 thousand)
appear close together (on a log scale they would be only "1" and
"4" ),
- or it can make small values appear to be negative due to the way
in which logarithmic scales represent numbers smaller than the
chosen value used as their base.
10
-may also cause relationships between quantities to appear to be
linear whilst those relationships are in fact exponentials số mũ
or power laws which rise very rapidly towards higher values.
It has been stated, although mainly in a humorous way, that
"anything looks linear on a log-log plot with thick marker pen"
.
2-Omitting data
Graphs created with omitted data
remove information from which to base a conclusion.
11
In financial reports, negative returns or data that do not correlate a positive
outlook lợi nhuận âm hoặc dữ liệu không tương quan với triển vọng tích cực
may be excluded to create a more favorable visual impression.
có thể được loại trừ để tạo ấn tượng thị giác thuận lợi hơn
III-Measuring distortion
12
Several methods have been developed to determine whether graphs are distorted
and to quantify this distortion.
1-Lie factor
where
where
-The graph discrepancy index, also known as the graph distortion index (GDI),
was originally proposed by Paul John Steinbart in 1998.
- GDI is calculated as a percentage ranging from −100% to positive infinity,
with zero percent
indicating that the graph has been properly constructed and anything
outside the ±5% margin is considered to be distorted.
-Research into the usage of GDI as a measure of graphics distortion has found
it to be inconsistent and discontinuous,
making the usage of GDI as a measurement for comparisons difficult.
3-Data density
The data density should be relatively high, otherwise a table may be better
suited for displaying the data.
4-Data-ink ratio
13
The data-ink ratio should be relatively high,
otherwise the chart may have unnecessary graphics.
V-Academia
The perception of graphs is studied in
psychophysics,
cognitive psychology,
and computational visions.
15