Professional Documents
Culture Documents
(10 Marks) : Sample Answer
(10 Marks) : Sample Answer
Please write down what you feel Principle: The overall design is based on a metaphor of a DNA molecule that consists of two
wrong with these designs. twisting helices. Likewise, our design visually represents two conflicting sides. 3. The Securities and Futures Commission of Hong Kong has built a database to store all the
news articles related to finance and stock. They once contacted us to develop a visualization
(a) [5 marks] Community strands: By default, we polarize the community strand between two sentiment poles system which can show the sentiments of different groups of people towards a finance policy
(e.g., positive and negative) and interpolate the data samples at different timestamps along a and whether the sentiments change over time. For example, from the following articles, we
horizontal timeline. Thus, a smooth curving strand is created. In our design, we encode the know that Meng Xiaoming is negative on the first policy but is positive on second policy.
sentiment information on each community strand to enhance the visual patterns driven by
sentiments. First, the sentiment information is quantitatively represented as the screen distance --------------------
between a community strand and a sentiment pole. Leaning toward one sentiment pole indicates
that most of the people in the community share that sentiment. Second, the sentiment information
Article 1: Financial industry in Hong Kong and China slams government debt swap
is also represented by a color gradient from green to red, pertaining to two sentiment poles. The programme
community size implies its influence and is encoded as the thickness of the strand. With this ….
encoding scheme, we are able to identify whether the sentiment change of a community is caused
by thousands of people or only by tens of people. “The government’s manner of enforcing the debt swap has been very crude,” said Meng
Xiaoning, chief executive of TF Securities in Hong Kong.
Event box: Inside the event box, keywords are represented as small bars. The size and color of the
bar encode the normalized frequency and the sentiment of the keyword, respectively. Within
different time windows, users may discuss the topics using different keywords, thus resulting in Article 2: China's watchdogs step in to avert Sinosteel bond default
various events under the same topic. By default, an event box’s size varies based on the distance
between two community strands. This design spontaneously assigns more space and forms a multi - Mainland regulators make rare intervention in bond market, with the company's debt holders
foc view for displaying the details when divergence is large. told to redeem their notes a month later
User group: A user group is visualized as a circle embedded within the community strand. The …
users within the group are represented as dots whose sizes and colors represent the users’ Meng Xiaoning, the chief executive of TF Securities, said he was cautiously optimistic on the
normalized activeness and their sentiments, respectively.
Ignoring conventions. At first glance, it looks like gun deaths are on the decline in Florida. drive to internationalise the country's bond market.
But a closer look shows that the y-axis is upside-down, with zero at the top and the maximum …
y-axis upside down, needs to show the full scale (Tufte design principle) The design can support the tasks like comparison between two group of opinions clearly: 5 marks value at the bottom. As gun deaths increase, the line slopes downward, violating a well-
Follow design principles & intuitive design: 5 marks established convention that y-values increase as we move up the page. -----------------
Good scalability: 3 marks
(b) [5 marks] Design rationale: 2 marks Please design visualizations:
Edge-bundling is widely used to reduce visual clutter in graphs. The following figures show the
The following figure shows the paths that Napoleon’s troops move to and retreat from
original graph and three different edge-bundling algorithms. You do not need to know the details Moscow: • Given a policy, visualize the sentiments of a group people toward this policy and whether
of these algorithms for answering this question. they change over time
• Given a person, visualize the sentiments of this person towards a set of policies and
whether they change over time
• Your system also needs to show the keyword summary of the newspapers related to the
sentiments like the one above with the request of users
Sample Answer
There are mainly four factors needed to consider in the visualization: policies, people,
sentiments, time. For the detail information in the third requirements, we can display them
during interaction.
1. One policy, group of people
unjustified 3D design
For a given policy, the sentiments changes of a group of people can be visualized as a theme
river.
4. Problem Solving [10 marks]
We can categorize the sentiment into positive, neutral and negative. And use the width of the
(a) The original graph with 1715 nodes and 9780 edges showing the immigration among different
Multi-dimension data is very common in many applications. Here is a sample dataset with five
states in the USA; (b) The edges are bundled using FDEB with inverse-linear model; (c) The edges
river to encode the strength / number of the sentiments.
attributes.
are bundled with GBEB; (d) The edges are bundled using FDEB with inverse-quadratic model.
A. Based on your comparing Fig. (a) and Fig. (b)(c)(d), in your opinion, what are the advantages Suppose we also know the exact date (year/month) for each dot in the map and the number of
and disadvantages of edge bundling? [5 marks]
soldiers at that date.
Advantage: reduce visual clutter and show the overall pattern (2.5 marks) Please design a visualization to encode the above information (i.e., time and number of
Disadvantage: lose detailed information (2.5 marks) soldiers).
B. You are asked to design a controlled experiment to evaluate the three edge-bundling
algorithms. We hope the evaluation should be quantitative and as rigorous as possible. Please write Answer 1: (10 marks)
down your detailed plan to conduct the evaluation. [15 marks]
Please refer to: https://robots.thoughtbot.com/analyzing-minards-visualization-of-napoleons-
Basic Info: with-in subject design, recruit a group of participants (say 20) (3 marks) 1812-march
Task: Track how many destination points an edge-bundle can split into (5 marks)
2. One person, set of policies
Dataset: 5 different real-world graph (as synthetic graphs may have a uniform trend for each edge
Similar river-based visualization can be used. We can visualize each policy as a river, then
direction)
Technique: 3 different edge bundling algorithms with the default parameters encode the sentiment with color.
Independent variables: Edge bundling techniques & datasets An advanced setting is multiple policies and groups of people. The river based visualization
Controlled variables: 1920*1080 resolution display, same keyboard & mouse
(5 marks for above design setting) can be extended to fulfill this requirement.
(a) Please design a visualization to show the data. [5 marks] For example, the following Opinion Flow is a good design.
Dependent variables: the number of destination points & response time
Multi-dimensional data visualization techniques such as parallel coordinates, scatter plot matrix Analysis: use the original graph as the baseline, using statistical methods such as ANOVA to
(b) From your visualization, what kind of tasks can be performed? [5 marks] check whether there are significant differences between different techniques
(2 marks)
1. Clustering Please analyze the color scheme used in the PG defense demo:
2. Outlier detection http://vis.cse.ust.hk/pqeDefenceVis/
3. Positive / negative relation between different attributes
...
(5 marks for two or above reasonable tasks) Other answers sampled from the submitted exercises.
3. Problem Solving [10 marks]
Answer 2: (10 marks)
In the class, the professor mentioned that visualization is also some kind of transl ation, i.e.,
Use the width of the trajectory to encode the number of soldiers.
translating data to visual forms. It should obey the same rules of language translations (e.g.,
translating English to Chinese). Do you agree or disagree with his metaphor? Specifically, please Use a sequential color scheme to encode the time
write down the similarity and difference between language translation (e.g., English -> Chinese)
and visual encoding (i.e., Data -> Visual Form). Answer 3: (6 marks) (Geo-Time)
Please first describe how you want to design the system, especially the principles to follow, and
then sketch some key visualization schemes in your system.
Please design a controlled user study to evaluate these three methods. You need to provide
the scheme of the user study (within- or between-subjects), the number of subjects you plan
Cumulative graphs. We can’t tell much from this graph. It’s moving up and to the right, so to invite and their background, the tasks you give, the data you collect, the hypothesis you
things must be going well! But the non-cumulative graph paints a different picture: want to verify, and how you plan to analyze the data.
Reference answer
The advantage of this visualization is that you can more easily detect patterns that involve 1) Subjects: about 20 subjects; 10 with knowledge in visualization, and 10 with little
knowledge in visual analysis. Their genders and ages should be uniformly distributed.
dimensions higher than two (which is difficult in parallel coordinates or scatterplot matrix).
Its disadvantage is that the visualization will become overwhelming when the dataset is too 2) Example Analytical tasks:
large. a) Identify the most popular state that people immigrate in and out;
b) For a given state, identify the state that most people migrate to/from;
3) Data to collect: for each participant, record (i) the correctness of the performed tasks. (ii)
See https://link.springer.com/chapter/10.1007%2F978-3-540-33037-0_8 for more the time used to perform each task.
information (you should be able to access this resource on campus) 4) Analysis of test data:
a) calculate statistics (average, standard deviation, etc.) of the performance data to
evaluate effectiveness, efficiency, etc.;
Now things are a lot clearer. Revenues have been declining for the past ten years! If we b) visualize the performance data (boxplot, bar chart, etc.) to more intuitively see the
scrutinize the cumulative graph, it’s possible to tell that the slope is decreasing as time goes dominance and stability of the two designs;
on, indicating shrinking revenue. However, it’s not immediately obvious, and the graph is
c) perform hypothesis test (suppose A is better than B on task and calculate p-value).
incredibly misleading.