Professional Documents
Culture Documents
Big Data Visualization An Empirical Study Highlighting Techniques Tools Future Research Challenges and Issues
Big Data Visualization An Empirical Study Highlighting Techniques Tools Future Research Challenges and Issues
x, 2022 1
Olfa Arfaoui
Computer Department, University Of Carthage, Tunisia
E-mail: olfa.arfaoui@—–
Abstract: Data is growing every day at a faster rate in practically every field. Indeed,
the traditional process of managing and processing ”classic” data cannot analyze these
massive volumes. Hence, the implementation of a Big Data Analytics system is therefore
essential to be able to take advantage of all these voluminous datasets and this ”gold
mine” of information. Therefore, with these new opportunities also appear new issues of
processing very high data volumes which continue to push companies to call on solutions
specialized in Big Data. Data need to be analyzed to make decision, and the data
visualization could be a way to lead to this goal. In this paper, we discusses about the
Big Data Visualization project, its characteristics, benefits, issues and we overview some
popular tools for beginners as well as experienced users.
arrive in the Big Data systems every second. Modern Big representations lead to decisions. Visuals are not
Data systems collect inherently complex data streams effective without context.
due to the 3 basic Vs (Gorodetsky 2014) which are But the solution is fairly simple: let the tools and
Volume, Velocity and Variety and to which added people do their work. As long as you use the right tools,
Veracity, Validity, Vulnerability, Volatility, Visualization and as long as the people doing the data analysis know
and Value and consequently give rise to the 10Vs where the data came from, who can consume it, and
(Manogaran et al. 2017) of Big Data. know how it will be processed and translated, the data
The well-designed Big Data systems must be able visualization will be on a much clearer path toward
to deal with all 10Vs effectively by creating a balance driving those big decisions.
between data processing objectives and cost (i.e., Every day it is discovered how important data
computational, financial, programming efforts) in Big visualization is in the field of business information. As
Data systems. Data collection and storage capabilities high-performance analytics tools that provide better
have enabled researchers in diverse domains to observe ways to analyze data faster than ever before, they have
and collect huge amount of data. However, large data the ability to not only provide meaningful data, but
sets present substantial challenges to existing data understand how to process it that ensures the company
analysis tools. stays competitive.
We will focus in our paper on one of the most In fact, Visualization is critical in today’s world.
important Big Data’ Vs which is data Visualization. Big data is difficult to visualize. Due to in-memory
Companies surrounded by a virtual environment technology limitations and low scalability (scaling
of data. Information of great value is constantly up), functionalities and development time response,
flowing, both internally and externally, so at least many visualization tools, current Big Data are faced with
companies are aware of this. But to show the value more technical challenges. Traditional graphs cannot be relied
clearly there has to be a way to collect the data and make upon to attempt to plot a billion data points. We
it understandable and meaningful. Through business therefore need different ways of representing data. If we
intelligence solutions, the company collects, organizes, take into account the multitude of variables resulting
analyzes and transforms data into actionable insights. from the variety and speed of Big Data and the complex
These strategies transform raw data into decisions, which relationships that unite, we can see that developing a
allow businesses to operate efficiently and competitively. visualization significant is not so easy.
But to turn that raw data into such decisions, it has to Using charts and graphs to visualize large amounts
be processed in order for the data to be understandable. of complex data is much more effective in conveying
Data visualization is one of the most important and meaning than spreadsheets and reports chock-full of
valuable tools for understanding business information. numbers and formulas. Current Big Data visualization
That’s why, one picture is worth a thousand words. tools face technical challenges due to limitations of in
Humans have been visually displaying data for hundreds memory technology and poor scalability, functionality,
of years. From maps to charts to graphs, we’ve taken and response time. You can not rely on traditional
data, arranged it, and formatted it, in this way it tells a graphs when trying to plot a billion data points, so
better and deeper story than it might alone. you need different ways of representing data such as
With the boom in technology, the data boom. data clustering or using tree maps, sunbursts, parallel
And this same technology has allowed us to process coordinates, circular network diagrams, or cone trees.
increasingly larger amounts of data at an ever-increasing If one combines this with the multitude of variables
speed. Trends, patterns, and other insights may not be resulting from Big Data’s variety and velocity and the
easily visible in the first text format, which is quickly complex relationships between them, and you can see
caught using data visualization software. that developing a meaningful visualization is not an easy
Once reports and dashboards replace them, the most matter (Khalid et al. 2021).
powerful approach becomes visual data displays because
they can convey large amounts of information in small
spaces. It can take a person hours, days, and weeks to 2 The DataViz and you: Presentations
delve into the long data sets and visual presentations
that allow for fast and efficient translation. 2.1 Definitions’Anthology
Thanks to the advanced technology, many data
visualization tools allow for interactive functions. This Many data scientits define data visualization in different
flexibility provides the ability to switch and change ways. Indeed, they agree that it is indeed a visual form
quickly, which helps the user to discover and learn about to visualize and this by facilitating access to it.
alternative viewpoints. This comprehensive, interactive • Visual form: it is a question of representing the
presentation can rarely be achieved quickly by processing data in a visual way, graphic;
raw data without visualization software.
The quantitative factor is a common and common • Facilitate access: The graphic representation is not
challenge facing information business. It takes a free, it is for the service objectives, the first of
good understanding of data to know that visual which is to provide greater access easily to the
Big Data visualization and you : An empirical study. 3
information conveyed by the data. It is indeed optimal analysis and to consider using all the data
easier and more pleasant to consider a graph than at your disposal. This allows you to cross-reference
a series of figures. information and thus bring out a more complete
analysis to better support your digital marketing
Another data experts’group agree that data visualization department(Rodrigues Jr et al. 2003).
is meaningless if it does not encompass understanding,
exploit and decision-making, speed and information
sharing. 2.2.3 Be innovative
2.2.2 Be relevant
Figure 1: Charles Joseph Minard’figurative Map.
At a time when Big Data is a central issue for
companies, several techniques to process this mass
of information in a relevant way must be put in Shortly after, nurse Florence Nightingale had the
place. idea of using graphic representation to allow her
Relevance is linked to interpretability. The data reader to compare facts with complex correlations
visualization must make it possible to answer (Magnello 2012).
questions in a defined context and aimed at specific She presented, for the attention of Queen Victoria,
objectives (Deller et al. 2007). the main causes of death of British soldiers engaged
Data sources must be reliable. Data integrity is in the Crimean War. His graphic support, dated
the basis for meaningful data visualization. You 1858, allowed him to eloquently highlight that
must ensure that your information is correct and epidemics were much more devastating on the
up to date. It is necessary to sort the data for workforce than the injuries suffered in combat.
4
This grammar was laid down by Jacques Bertin Finally, we cannot evoke contemporary thinkers
who in 1967 developed the real bases of graphic of Data Visualization without citing Ben
language. Shneiderman and Stephen Few.
One of the first uses of data visualization is The Dataviz takes the information a person
to contribute to more effective management of needs and presents it in a way that it is easily
activity and performance, oriented towards action. understandable.
6
Infographics are a mix between Dataviz, When I reflect for a few moments on the term
journalism and marketing. They use strategically “data”, I realize that it is particularly ambiguous
chosen data visualizations and lexicon to explain and vague. What does data represent in an
a complex story easily. organization? Figures, indicators, tables. . . It is
The confusion in terminology is understandable true that data is ubiquitous today and seems
however the terms are not interchangeable. Both obvious. However, they are very difficult to grasp.
turn data into easy-to-understand visualizations. The data that are thus available to professionals
These tools are extremely powerful when it comes to guide them in their decisions are increasingly
to explaining numbers in an educational way to numerous and multi-structured. But how not to be
people who are reluctant to analyze data. This is overwhelmed and make it a real tool for reflection
their only common point. Here is a definition for and decision-making? How to obtain answers to
both forms of presentations. fundamental questions whose answers are for the
Some distinction points between these two terms moment totally unknown?
can take place in Table 1. It is in the face of these requirements that
the dataviz takes on its full meaning. The
2.7 Main reasons for using DataViz representation of data in the form of images
makes it easier to understand them. There are
several definitions of dataviz in the academic
Three main reasons explain the use of data
and industrial state of the art. These different
visualization namely confirm or refuse hypotheses
definitions all converge on the fact that dataviz
on a market, educate and explore.
is a way to give meaning to data in order
– Confirm or refuse hypotheses on a market to extract information from it and therefore to
exploit it. Dataviz not only enables intellectual
∗ The DataViz can then take the form of
understanding, it transforms a set of raw data into
a dashboard, making it possible to make
actionable information. In addition, it accelerates
a decision while having a global vision of
the understanding, decision and action that we
the studied market.
have just mentioned. It is also a mode of
– Educate communication, allowing data not to remain
∗ Internally, companies use the DataViz for confined to the world of BI or statistics but
research work reporting or brainstorming to infuse the entire organization and become
sessions. a support for decision-making and collaborative
∗ It can be a good complement to creative work.
approaches such as ”gamification” The dataviz has many uses and leads to a
– Explore variety of benefits for the organization. First, it
contributes to a more effective management of
∗ This is the most futuristic aspect of activity and performance, oriented towards action.
data visualization, which certainly will This improvement in management is manifested by
develop. taking a step back in addition to other tools whose
∗ Dataviz can help build predictive models. horizon is in the shorter term. In other words, the
We are then in the field of data analysis dataviz can be used as a decision-making, strategic
tool, usable by a local manager to manage his
2.8 The DataViz: buzzword or real performance.
innovation? Another important use of dataviz is the reinvention
of customer service to improve its efficiency.
Over the past 3 years, there has been a very strong SFR, with the aim of improving the management
craze for dataviz. However, some companies and and understanding of their KPIs, uses dataviz
startups, who are thinking about turning their to identify causal relationships in their data
deluge of data into insights, are beginning to sources in order to find hidden patterns through
wonder if this is a buzzword commercial term or a their main sales channels. A good customer
real reality in companies. ? relationship is based on perfect knowledge of the
customer himself. What are their characteristics
One piece of information could, however, put us
and behaviors? How to segment and classify them?
on the alert in the answer to this question: the
The exploration capabilities in the data enabled
company Tableau Software raised $254m on the
by data visualization find their full meaning in
NASDAQ in May 2013 (Huang 2018). If behind
providing answers to these questions.
this company there was only a passing craze, I
think investors would have been hesitant to invest Another key point about dataviz is its ability
so much money. to foster innovation and its potential to get the
Big Data visualization and you : An empirical study. 7
business to consider new possibilities. In particular, visualization. These visual representations make
it is a testing ground for new modes of interaction it easier to understand raw data and thus help
with users. in decision-making. Big Data, is not just “more”
data. It is so much data, that is so mixed and
unstructured, and is accumulating so rapidly, that
3 The dataViz’s benefits traditional techniques and methodologies including
“normal” software do not really work (like Excel,
Crystal reports or similar).
In a context of ever-increasing and often highly
complex volumes of data, dataviz has many The DataViz makes it possible to make the most
advantages. comprehensible data important and what they
Data visualization is far from being an accessory mean, regardless of the audience concerned. Its
intended to embellish your website or your effectiveness is based on the fact that a majority
presentations. of us grasp and retain information better when
it is represented visually. The following image
In a synthetic way, we can say that the dataviz illustrates this fact, which was studied by an
improves: American psychologist.
– The data understanding;
By looking very quickly at this visual, we perceive
– The data communication; important information (the red dot) immediately,
– The decision-making; without no special effort on our part. Of course, for
this approach to be effective, the data visualization
– The ability to innovate.
must play on well thought out visual choices.
DataViz’benefits could be described more in the
figure 5.
With the Big Data advent, and the proliferation of The table shows us the values for each country, of
data sources, companies are increasingly using data course.
8
Lightness is involved in the technical resources – All the graphs do not make it possible to
required which are quite light. Example: present the same analyzes (distribution,
Recent technological developments, such as the evolution, decomposition. . . ). Hence the
development of json-type formats (DS.JS3), put importance for the designer to question
us in direct connection with data. Thanks to his intention.
these formats, we can recover varied data from 2. Who are we talking to?
all horizons, using standard applications. A real
human-data interface is thus being established. – Is he an expert or a layman? What should
he do with the information (e.g. retain
the information for later or make an
4.3 Success key factors immediate decision)?
3. In what context is the interlocutor?
To succeed in a data visualization project, it is
– The good reception of the graphic
necessary to bring together key success factors that
does not only depend on the graphic
can be classified into three categories.
itself, but also of the intellectual and
First of all, there are the classic good practices visual availability of the reader. All
of any project: ensuring preparation and planning, that the graphic designer can do is to
choosing the right scope, implementing the try to anticipate this greater or lesser
appropriate methodologies, etc. availability, in order to choose the most
suitable representation.
The second category is that a dataviz
project concerns data: their targeting, their
The difficulty increases when the data visualization
quality, respect for confidentiality and access
must address different audiences. It is then
authorizations, are therefore essential.
necessary to provide modes of representation
Finally, of course, ergonomics and graphic adapted to each of them. This is the case, for
intelligence play a key role in the acceptance of the example, of the Belgian FPS Economy, which
dataviz and its effective use (although, by the way, communicates with both the general public and
these aspects should be part of any IT project...) professionals.
10
Another good practice is to ”get your hands If certain good practices maximize the chances
dirty” on first restricted perimeters, if possible as of succeeding in your data visualization project,
controlled as possible. This allows you to move you can expect, as with any project, to encounter
forward, without too many risks, in trial and error difficulties:
mode until a first satisfactory solution is obtained.
1. The risk of overloading it with information
2. A General Management that does not
4.3.4 Ensure data quality at source
necessarily perceive the interest of data
visualization immediately
In Business Intelligence (BI), you reap what you
sow or, to put it more lapidary way: garbage 3. Skepticism about the performance of the tool
in, garbage out. In other words, if we want data 4. A pitfall to avoid: forget your classics : We
visualization to be able to communicate the right should not confuse simplicity with simplism
messages, to make it possible to make informed (Nielson 1995, Deng et al. 2005).
decisions, to explore unknown territories, there is
one condition to be met above all: to have quality
data entrance. In return, the DataViz improves the 4.5 The DataViz’s impact on the
quality of the data. First, because it compels a relationship between IT and business lines
certain discipline; then, because it also visualizes...
the non-quality of the data. Repeated outliers may Data visualization projects have the particularity,
appear at first glance as oddly placed dots, for as we saw previously, of offering great autonomy
example. to users. users. This is why, in this area, the
relationship between IT and the Professions are set
to evolve. It has happened that friction has arisen,
4.3.5 Focus on cooperation between several with the IT Department feeling deprived of some of
departments its prerogatives. These tensions have unfortunately
been maintained by some providers of dataviz
Another ingredient of success lies in the solutions by addressing only the Business Lines
cooperation between the actors of the project. without going through the IT Department to win
The DataViz thus contributes to breaking down contracts (Magee et al. 2016, Grant 2019).
the silos that may exist in the company and
Thus, we cannot speak of a loss of prerogatives of
contributes to greater cross-functionality.
the DSI or the BI teams. Simply, data visualization
raises new questions about how to represent data,
4.3.6 Train the teams about the distribution of roles and responsibilities
between the business lines and the IT department,
Data visualization does not really require side about how to conduct projects (Graessley et al.
training users. On the contrary, we can say that it 2019, Villars et al. n.d.).
is fully successful when it is immediately adopted. CIOs understand this. Even those who were
For this, simplicity and intuitiveness are essential. initially reluctant are realizing that data
But offering a simple rendering can be extremely visualization is not a threat, and are softening
complicated. This is why the training of those who their stance (Santolalla 2020, Howard 2013).
produce the visualizations is an undeniable plus.
In truth, data visualization is a chance for CIOs
and BI teams. On the one hand, it will relieve them
4.3.7 Using aesthetics as a lever for of time-consuming tasks, allowing them to focus
appropriating information on their missions with higher added value. On the
other hand, they even have a unique opportunity
Data visualization cannot be reduced to a to invent a new form of BI and relationship with
representation aesthetics of data. One can make business (CADENAZZI 2020).
pretty representations that are perfectly useless. What we can remember is that data visualization,
But that’s not to say that aesthetics don’t play even if it provides great autonomy to the
a role. Used well, it is an essential criterion business lines, is a question that should interest
of efficiency for the dataviz. In this concern to the IT department and the BI teams, quite
combine aesthetics and efficiency, companies have simply because it touches the data. Autonomy of
every interest in being imaginative and going businesses is useful if it is implemented in a smart
beyond traditional Excel-type charts, if that makes way and if it allows them to obtain even more value
sense. from the CIO and the BI (Schaeffer et al. 2017).
Big Data visualization and you : An empirical study. 11
Presumably, by accustoming the Professions to You can improve your Excel experience with
speaking the language of data, thanks to graphics, add-ins. These latter allow users to extend the
the dataviz will contribute to the taking of functionality of Microsoft Excel and help save their
awareness of the value of data. It can therefore play time and effort (Ali et al. 2016). Excel add-ins work
a role unifier, at the service of business creation like apps that you download or buy for your mobile
(Chatfield & Reddick 2018). phone or computer. They are mini software tools
that you can install in Microsoft Excel and add
many features such as shortcuts, tasks and time
saving options that you cannot find in a standalone
5 which tools for which data Excel application.
visualization?
Third parties create Excel add-ins to provide Excel
users with extended functionality and save their
With the advent of the Big Data era come new time and effort. Developing these add-ins requires
challenges for Information Visualization. First, coding expertise in languages such as XML and
the amount of data to be visualized exceeds the VBA and providing an easy-to-use interface that
available screen space. Second, the data cannot be complements Excel.
stored and processed on a conventional computer.
Table 2 overviews some Excel add-ins for dataviz.
To alleviate both of these problems, a Big Data
visualization system must provide perceptual and
performance scalability. Libre Office. “A picture says a thousand words.”
These days, data comes in a variety of forms,
In this section, we will focus on some data but cannot be interpreted easily most times. At
visualization tools for beginners as well as for this point, data visualisation becomes more and
experienced users. more important, as the human mind captures and
interprets data more easily through vision than
5.1 Tools accessible to beginners any other sense. Although spreadsheet applications
provide numerous ways to process data and adapt
it to the user’s needs, it is often easier to just look
tools for beginners are available to allow them to at a diagram to see trends or get a general overview
create dataviz without resorting to programming of a particular dataset.
or its basis and no expertise is required.
Libre Office is a free and open-source office suite,
derived from the OpenOffice.org project, created
5.1.1 office software and extensions and managed by The Document Foundation.
LibreOffice is notably supported by the Free
Excel. spreadsheets aren’t as popular as they Software Foundation and brings together a large
were ten years ago. While still great for entering part of the former “OpenOffice.org community”. It
and calculating data, all those cells and formulas supplies extensions to make different tasks such as
can be cumbersome (Lee et al. 2019, Oike et al. visualization.
2019). Despite the volume of large data, Excel still Libre Office extensions are software plug-ins that
remains the reference in many companies and is you install as an extra to the standard LibreOffice
the only way to process data (Patel 2022). Most suite and which add additional functionality to the
customers approach companies with spreadsheets suite, either to a particular application (Writer,
full of data, and we use the same analytics process Calc, Impress, ), or to all applications. In figure 9
for many of us to tell their data story, visually. We a dataviz example with the Libre Office using the
collaborate with customers from data to visualized chart type.
product, but sometimes you don’t have the time to
hire a vendor to do the job.
Excel remains one of the basic tools for data
visualization. The maximum number of values in
a column is about 1,999,999,997 (Hiljazi & Curtis
2018).
Some examples of Libre Office extensions dedicated Online, Teams and Yammer. The Office suite
to data visualization can be presented in the table allows work in offline mode like a perpetual suite,
3. which distinguishes it from Office Online, which
is used from a Web browser. The principle of
5.1.2 online office suites Microsoft 365 is to be updated as new versions of
Office are released (Wilson 2014).
Google Drive. When choosing between data Office 365 provides also some integration apps to
visualization tools, one option worth considering visualize you data in an interpretable, innovative
is Google Sheets. Google’s spreadsheet application and relevant way. Table 4 describes some ones
can be used to generate charts, tables and even highlighting their main functionalities.
maps that can be embedded on a website. They’re
easy to make and can be configured to update
automatically (Dougherty & Ilyankou 2021). 5.1.4 Simple online tools
It’s not for every visualization need. Some projects
Tableau. Tableau Public is a free platform
require more complicated data visualization
for exploring, creating, and publicly sharing
techniques and more customization than what
data visualizations online. Anyone can create
Google Sheets provides. An example can be
visualizations with our web authoring platform or
described in figure 10.
Tableau Desktop Public Edition which is available
for free. Users with Tableau Desktop Professional
Edition can also publish to Tableau Public for
free (Kennedy & Allen 2016). With millions
of inspiring data visualizations to discover and
explore, Tableau Public makes it easy to develop
your own data skills and build a portfolio of work
online. Join the Tableau Public community where
you can grow and learn from each other while
bringing data into your daily life. More advantages
are presented in the Figure 11.
– It is an enterprise decision management platform suitable for IT, government, legal, and
marketing teams;
– It can be adapted to any type of business and easily mapped to existing processes and
Coras governance;
– It allows every team member to stay on top of their tasks with the Coras workload
visualization feature which can be in the form of a mind map, Kanban board, Gantt
timeline, or a range of other visual displays.
– It is an all-in-one dashboard app that helps users monitor & analyze data scattered across
Cyfe all of your online services like Google Analytics, Salesforce, Google Ads, MailChimp,
Facebook, Twitter, and more from one single location in real-time.
– It is an ad-hoc reporting and business intelligence solution which provides businesses with
the tools to design and deploy custom reports on business metrics;
– It offers features including a report and dashboard designer, online report deployment, a
DBxtra report scheduler, and an excel reporting service;
– With DBxtra even non-technical users can generate interactive business intelligence reports
and dashboards, and deploy them across the web.
– It helps business people make faster, better business decisions, empowering them with
self-service tools to explore data and share insights in minutes;
MicroStrategy
Analytics – Simple drag-and-drop tools are paired with intuitive visualizations;
– Quick connections to any data source are combined with one-click sharing of any insight.
Plot.ly. Is knwons as ”A must know tool for Chartblocks. This is an online charting software.
building visualizations”. Plotly is an open source It helps to create basic charts quite quickly and to
visualization library for data visualization and import more data from different external sources.
analysis. Plotly provides many products including With this tool, you will have the possibility to
Dash, Chart Studio, a Python framework, R and export your visualization in SVG or PNG format
recently JULIA for building fast, easy and powerful and also add it to your website and then share it
analytical applications. It is a bookstore use for on social media platforms (Fahad & Yahya 2018).
data visualization. he takes in supports various
graphs such as scientific graphs, 3D graphs, charts
Periscope Data. Is a powerful platform
statistics. It gives the hand to draw several types
dedicated to Data-Analysis. It can gather all the
of graph such as 3D graphs, histograms, easy to
data of your company and create reports. With
use and handle, totally free and very interactive
this tool, you can easily convert your figures
and flexible. More advantages are presented in the
into an easy-to-understand graph or report. It is
Figure 12.
powerful, but quite expensive to buy (Pattanaik
& Wiegand 2021).
Periscope Data enables analysts to turn their
SQL queries into interactive dashboards, charts,
and reports for data consumers with frequent
data needs. Periscope Data’s breakthrough data
warehouse infrastructure quickly connects to your
databases to deliver incredibly fast, low-cost
query processing. Unlimited users and no query
limits remove workflow barriers and promote data
literacy across your organization (Richardson et al.
2020).
Piktochart. Piktochart is a web-based graphic making the visual representation of complex data
design tool and infographic maker. With it, you can easy for everyone. Primarily conceived as a tool
create bar charts, maps, line graphs, scatter plot, for designers and vis geeks, RAWGraphs aims
and more. Create interactive data visualizations at providing a missing link between spreadsheet
with an easy-to-use dashboard.When creating a applications (e.g. Microsoft Excel, Apple Numbers,
new file, users can start from a blank sheet or OpenRefine) and vector graphics editors (e.g.
choose from one of the 600 templates offered, Adobe Illustrator, Inkscape, Sketch) (Mauri et al.
arranged in a multitude of categories (Peddoju & 2017). Based on the svg format, visualizations can
Upadhyay 2020). be easily edited with vector graphics applications
The editor is intuitive and very easy to use. It for further refinements, or directly embedded into
is possible to add photos, videos, or icons to a web pages.
document via a simple ”drag and drop”. The fonts
and colors used are also modifiable. The tool also Wordle. Wordle is a tool for editing “word
allows you to synchronize with Google Spreadsheet clouds” based on the Wordle. The initial word
or SurveyMonkey to retrieve data and thus create cloud can be generated from the input text or
interactive graphs or tables. read from an existing one. You can re-font, re-
Another interesting feature is the ability to protect colore, resize, move, rotate, add and delete words
your work with a password if it involves sensitive to create custom visualizations (Viegas et al.
files for clients or colleagues. A formula dedicated 2009). Wordle’s main benefit is that it allows
to teams makes it possible to collaborate within a neighborhood-preserving editing process, which
the same platform and to have access to the keeps words at predictable and close locations
comments of the members involved. A very good during and after the editing process. Like Wordle,
tool for those looking for a simple and accessible the images you create with Wordle are yours to use
solution to create all types of presentations and however you like. You can save them to your own
communication media. desktop to use as you wish. An illustrative example
made with wordle for the DataViz context can be
infogram. Infogram is a web-based data presented in the figure 16.
visualization and infographics platform, created in
Riga, Latvia. It allows people to create and share
digital charts, infographics and maps.
Infogram helps create infographics, reports, and
maps for your organization. Infographics from
Infogram help you tell a compelling data-driven
story and present your content with easy to
understand data that grabs the viewers attention.
Infogram operates as a data visualization tool with
the goal of making your data easy to understand,
discovering unknown facts/outliers/trends, Figure 16: DataViz with Wordle.
visualize relationship patterns, and ask better
questions. More advantages are presented in the
Figure 15. Easel.ly. Easel.ly is where people can visualize
information easily through quick infographic
creation and data visualization. No design
background required! Easel.ly is an infographic
design tool and can create any visualized content
as any kind of information. Easel.ly provides many
templates, themes and any objects to edit your
detailed information in their designs (Weiner &
Lorber 2021).
– Best for teams and developers looking for basic charting requirements and an open-sourced
Charts.js product.
FusionCharts – It is best for web and enterprise application charting and data visualization requirements.
Pts.js – Best for composing objects as you perceive them with a basic level of abstraction as points.
Raphael.js – Best for creating detailed drawings and graphics with very few lines of code.
– Best for creating powerful user interface animation with support for all major modern
Anime.js browsers.
ReCharts – Best for teams looking to create charts for React-based web applications.
– Best for building advanced charts primarily for web-based Forex and stock-trading
TradingVue.js applications.
– Best for teams looking for an extensive charting library for supporting multiple platforms
HighCharts like web and mobile.
– Best for creating basic charts across multiple programming language libraries like Python,
ChartKick Ruby, JS, etc.
Pixi.js – Best for teams looking for JavaScript libraries to create digital content based on HTML5.
ZDog – Best for open-sourced doesn’t give creating and rendering 3-D images for canvas and SVG.
for modern web and mobile browsers. The Table interesting reports like this one (Serik et al. 2021).
5 including the top 15 libraries dedicated to Google Data Studio is essentially a streamlined
visualization will be able to describe them better. version of data visualization tools like Tableau and
Clickview. While you won’t have access to quite as
5.2.2 Dashboard builders many features or coding capabilities, the platform
is free and pretty easy to use.
Google Data Studio. Is an online data
visualization tool that helps users convert data into Unlike platforms like Google Analytics or
informative reports and interactive dashboards. HubSpot, Data Studio is not a data source.
It is a powerful tool to help marketers and business It doesn’t collect the data; rather, it combines
owners use their data effectively by creating data from different sources, analyzes it, and then
20
lets you create interactive reports, charts, and such as Google Chrome or Mozilla Firefox. Sharing
dashboards. documents becomes more flexible, thanks to the
recognition of CSL format and text files.
Toucan Toco. Is a cloud-based data
visualization tool. Intended for non-technical
business executives, the objective of this highly
configurable data visualization solution is to
6 DataViz’ issues
provide essential information and data for decision-
making. With the application studio integrated A good visualization illustrates the data so viewers
into the Toucan Toco solution, any structure can quickly extract meaning. One of the most
can create and display personalized BI (business common data visualization mistakes is including
intelligence) applications (Arruabarrena 2017). too much information. This makes it difficult
These applications can then be deployed on for viewers to formulate takeaways. Likewise,
different media and devices, and each application visualizations suffer when designers include too
can itself be integrated into a dashboard, a many visual effects.
graph, a PDF document. . . Connectable to other
applications, including those used in the daily
routine of companies , Toucan Toco also develops 6.1 Why are most visualization designs
APIs that allow it to integrate with other IT ineffective?
solutions, such as Cognos Analytics and Salesforce,
for example. To retrieve the data to be used and Data visualizations are often ineffective because
then displayed, this advanced reporting tool is they are designed for the wrong audience in mind.
thus able to connect to more than a hundred The perceived value of dashboards is lost due to
applications: Excel, Google Analytics, Microsoft poor communication with end users. The data
SQL ServerIn short, according to Charles Miglietti visualization design process begins with learning
one of the founding members, of a kind of the audience who will use the dashboard.
analytical application WordPress.
Data Hero. Is a data visualization software to 6.2 What mistakes should be avoided in data
grow your business. DataHero is a cloud computing visualization?
business intelligence software platform specializing
in data visualization and dashboards. Visualize Common errors include duplicate data, missed
and analyze data from all your cloud services. data, unmarked NA values, etc. For example, in
DataHero is the fastest and easiest way to get this pie chart, the three sectors of the pie chart
insights from your data. Create charts, reports, and add up to 193%, which makes no sense. Such errors
dashboards from your business data that you can in the data would render your final visualizations
easily share with teams and clients (Pedersen & useless.
Bossen 2021).
Looker. Is a cloud platform dedicated to data 6.3 Does data visualization require coding?
visualization. It belongs to Google, and is its
analytics and business intelligence service. Its use You don’t need to write any code to easily create
offers a better exploitation of professional resources an interactive data visualization. When it comes
for different sectors of activity. As a business to presenting data, spreadsheets and text-heavy
intelligence platform, Looker integrates several reports aren’t enough to explain what we found.
features, including many options dedicated to data This is when we need data visualization to present
visualization. This includes the use of reporting data in a way that helps everyone grasp difficult
tools, the creation of dashboards, multicloud concepts.
storage, not to mention the customization of
the database. Coders have the ability to use
the LookML language to program visualization 6.4 Why is misleading data bad?
parameters.
For professional use, Looker has many advantages. If there is too much data presented or irrelevant,
It is a cloud platform that does not require the audience may not see the relevant information.
the installation of any software or application. The more data displayed at once, the more difficult
The maintenance of the system is thus greatly it becomes to detect specific trends. Misleading
facilitated. In addition, its accessibility is with too much data is often used to mislead the
optimized for the vast majority of web browsers, public from small but relevant information.
Big Data visualization and you : An empirical study. 21
6.5 Can the data be misleading? Amer, A. M. & El-Hadi, M. M. (2019), ‘Tableau big
data visualization tool in the higher education
The data can be misleading due to the sampling institutions for sustainable development goals’,
method used to obtain the data. For example, the International Journal of Computer Science and
size and type of sample used in any statistic plays Mobile Computing (IJCSMC) .
an important role – many polls and questionnaires
Arruabarrena, B. (2017), ‘L’expert en dataviz, un
target certain audiences who provide specific
métier en transition’, I2D-Information, donnees
responses, resulting in small and biased sample
documents 54(3), 7–8.
sizes.
Atwood, T. P. & Reznik-Zellen, R. (2018), ‘Using
6.6 7 ways to spot bad data the visualization software evaluation rubric
to explore six freely available visualization
– Speeding. applications’, Journal of eScience Librarianship
– Open ends meaningless. 7(1).
– Choose all options for a screening question. Basole, R. C., Bellamy, M. A. & Park, H. (2017),
– Failed quality control questions. ‘Visualization of innovation in global supply
– Inconsistent numeric values. chain networks’, Decision Sciences 48(2), 288–
306.
– Straight line and patterns.
– Logically inconsistent answers. Brasseur, L. (2005), ‘Florence nightingale’s visual
rhetoric in the rose diagrams’, Technical
Communication Quarterly 14(2), 161–182.
7 Conclusion Burnett, C., Merchant, G. & Guest, I. (2021),
‘Destabilising data: The use of creative
Data visualization is a very important task data visualisation to generate professional
nowadays for the data scientist. the main reason dialogue’, British Educational Research Journal
for recourse is decision-making. An interpretable, 47(1), 105–127.
relevant and innovative visualization can lead
to a right decision for a company knowing that CADENAZZI, M. (2020), ‘Performance
this decision could be radical. Conventional measurement in non-profit cultural
visualization techniques cannot handle the organisations. fondazione brescia musei case
enormous volume, variety and velocity of data. study’.
To do this, several tools have emerged and are Cardona, J. A. S. & Garcia, D. A. A. (2017),
constantly evolving. ‘Evaluación y selección de herramientas de
Thus, among other things, modeling for big data analı́tica visual para su implementación en
is a valuable issue these days. In fact, data una institución de educación superior’, Revista
modeling is a process that enables organizations to IngEam 4(1), 1–20.
discover, design, visualize, standardize and deploy
high-quality data assets through an intuitive Chatfield, A. T. & Reddick, C. G. (2018),
graphical interface. Now, A proper data model ‘Customer agility and responsiveness through
serves as a blueprint for designing and deploying big data analytics for public value creation: A
databases that leverage higher-quality data sources case study of houston 311 on-demand services’,
to improve application development and make Government Information Quarterly 35(2), 336–
better decisions (Ribeiro et al. 2015, Patel 2019, 347.
Zais et al. n.d.). So,we will be interested in the Big
David, M. (2020), ‘How to design dashboard’.
Data modeling systems.
Deller, M., Ebert, A., Bender, M., Agne, S. &
Barthel, H. (2007), Preattentive visualization
References of information relevance, in ‘Proceedings of
the international workshop on Human-centered
Ahn, Y.-Y. Y. (2019), ‘Data visualization’. multimedia’, pp. 47–56.
Ali, S. M., Gupta, N., Nayak, G. K. & Lenka, Deng, F., Zhang, Z., Zhang, J. & Zhang, D.
R. K. (2016), Big data visualization: Tools (2005), Building extraction from multiple images
and challenges, in ‘2016 2nd International and lidar data, in ‘MIPPR 2005: SAR and
Conference on Contemporary Computing and Multispectral Image Processing’, Vol. 6043,
Informatics (IC3I)’, pp. 656–660. pp. 515–520.
22
Doshi, J., Goradia, A. & Mistry, D. (2014), Islam, M. & Jin, S. (2019), An overview of data
‘A review of google data visualization tools’, visualization, in ‘2019 International Conference
International Journal of Current Engineering on Information Science and Communications
and Technology 4(5), 3134–3138. Technologies (ICISCT)’, pp. 1–7.
Dougherty, J. & Ilyankou, I. (2021), Hands-On Jung, S., Xiao, R., Buruk, O. & Hamari, J. (2021),
Data Visualization. Designing gaming wearables: From participatory
design to concept creation, in ‘Proceedings of the
Fahad, S. A. & Yahya, A. E. (2018), Big data Fifteenth International Conference on Tangible,
visualization: allotting by r and python with Embedded, and Embodied Interaction’, pp. 1–
gui tools, in ‘2018 International Conference on 14.
Smart Computing and Electronic Enterprise
(ICSCEE)’, pp. 1–8. Kennedy, H. & Allen, W. (2016), ‘Data
visualisation as an emerging tool for online
Forkan, A. R. M., Kimm, G., Morshed, A., research’, The Sage handbook of online research
Jayaraman, P. P., Banerjee, A. & Huang, W. methods pp. 307–326.
(2019), Aqvision: A tool for air quality data
visualisation and pollution-free route tracking Khalid, Z. M., Zeebaree, S. R. et al. (2021), ‘Big
for smart city, in ‘2019 23rd International data analysis for data visualization: A review’,
Conference in Information Visualization–Part International Journal of Science and Business
II’, pp. 47–51. 5(2), 64–75.
Galletta, A., Carnevale, L., Bramanti, A. & Khan, S. (2021), ‘Data visualization to explore
Fazio, M. (2018), ‘An innovative methodology the countries dataset for pattern creation.’,
for big data visualization for telemedicine’, International Journal of Online & Biomedical
IEEE Transactions on Industrial Informatics Engineering 17(13).
15(1), 490–497. Kirk, A. (2012), Data Visualization: a successful
design process.
Gorodetsky, V. (2014), Big data: opportunities,
challenges and solutions, in ‘International Kirk, A. (2016), Data visualisation: A handbook for
Conference on Information and Communication data driven design.
Technologies in Education, Research, and
Industrial Applications’, Springer, pp. 3–22. Lee, L., Shifflett, E. & Downen, T. (2019),
‘Teaching excel shortcuts: A visualization and
Graessley, S., Suler, P., Kliestik, T. & Kicova, game-based approach’, Journal of Accounting
E. (2019), ‘Industrial big data analytics for Education 48, 22–32.
cognitive internet of things: wireless sensor
networks, smart computing algorithms, and Lensen, A., Xue, B. & Zhang, M. (2020),
machine learning techniques’, Analysis and ‘Genetic programming for evolving a front of
Metaphysics 18, 23–29. interpretable models for data visualization’,
IEEE transactions on cybernetics 51(11), 5468–
Grant, R. (2019), ‘Pretty persuasion: The 5482.
advantages of data visualisation’, Impact
2019(2), 19–23. Leung, C. K., Wen, Y., Zhao, C., Zheng, H.,
Jiang, F. & Cuzzocrea, A. (2021), A visual
Hiljazi, S. & Curtis, T. (2018), ‘Developing data science solution for visualization and
an introductory class in business intelligence visual analytics of big sequential data, in
(bi) using ms excel powerpivot.’, Association ‘2021 25th International Conference Information
Supporting Computer Users in Education . Visualisation (IV)’, pp. 229–234.
Holbrook, J. B. (2019), ‘Open science, open access, Magee, B., Sammon, D., Nagle, T. &
and the democratization of knowledge’, Issues in O’Raghallaigh, P. (2016), ‘Introducing data
Science and Technology 35(3), 26–28. driven practices into sales environments:
examining the impact of data visualisation on
Howard, R. (2013), ‘Big data hype cut down to user engagement and sales results’, Journal of
size’, Government News 33(5), 26–27. Decision systems 25(sup1), 313–328.
Huang, M. (2018), Bridging the Gap Between Magnello, M. E. (2012), ‘Victorian statistical
Silicon Valley and the Capital Beltway: Lessons graphics and the iconography of florence
Learned from the US Government’s Venture nightingale’s polar area graph’, BSHM Bulletin:
Capital and Startup Engagements in Intelligence Journal of the British Society for the History of
and Defense, PhD thesis. Mathematics 27(1), 13–37.
Big Data visualization and you : An empirical study. 23
Manogaran, G., Lopez, D., Thota, C., Abbas, Peddoju, S. K. & Upadhyay, H. (2020), Evaluation
K. M., Pyne, S. & Sundarasekar, R. (2017), Big of iot data visualization tools and techniques, in
data analytics in healthcare internet of things, ‘Data visualization’, pp. 115–139.
in ‘Innovative healthcare systems for the 21st
century’, pp. 263–284. Pedersen, A. M. & Bossen, C. (2021), Data work
in healthcare: An ethnography of a bi unit,
Mauri, M., Elli, T., Caviglia, G., Uboldi, G. & Azzi, in ‘Infrahealth 2021-Proceedings of the 8th
M. (2017), Rawgraphs: a visualisation platform International Conference on Infrastructures in
to create open outputs, in ‘Proceedings of the Healthcare 2019’.
12th biannual conference on Italian SIGCHI
chapter’, pp. 1–5. Poola, I. (n.d.), ‘Innovate and differentiate your
(bi) analytics product with intelligent narratives
McCosker, A. & Wilken, R. (2014), ‘Rethinking and deeper context of your data’.
‘big data’as visual knowledge: the sublime and
the diagrammatic in data visualisation’, Visual Raineri, P. & Molinari, F. (2021), Innovation in
Studies 29(2), 155–164. data visualisation for public policy making, in
‘The Data Shake’, pp. 47–59.
Morita, T. (2011), ‘Reflections on the works of
jacques bertin: From sign theory to cartographic Ribeiro, A., Silva, A., da Silva, A. R. et al. (2015),
discourse’, The Cartographic Journal 48(2), 86– ‘Data modeling and data analytics: a survey
91. from a big data perspective’, Journal of Software
Engineering and Applications 8(12), 617.
Nærland, T. U. & Engebretsen, M. (2021),
‘Towards a critical understanding of Richardson, J., Sallam, R., Schlegel, K., Kronz, A.
data visualisation in democracy: a & Sun, J. (2020), ‘Magic quadrant for analytics
deliberative systems approach’, Information, and business intelligence platforms’, Gartner ID
Communication & Society pp. 1–19. G00386610 .
Nielson, G. M. (1995), ‘Visualization takes its Rodrigues Jr, J. F., Traina, A. & Traina Jr, C.
place in the scientific community’, IEEE (2003), Enhancing data visualization techniques,
Transactions on Visualization & Computer in ‘Third IEEE Intl. Workshop on Visual Data
Graphics 1(02), 97–98. Mining-VDM@ ICDM03’, pp. 97–112.
Oike, H., Ogawa, Y. & Oishi, K. (2019), ‘Simple Santolalla, O. (2020), Dataviz, in ‘Rock the Tech
and quick visualization of periodical data using Stage’, pp. 33–49.
microsoft excel’, Methods and protocols 2(4), 81.
Sarica, S., Yan, B., Bulato, G., Jaipurkar,
Olivér, H. (2020), ‘Kı́sérleti gyártáshoz P. & Luo, J. (2019), Data-driven network
kapcsolódó adatvizualizációs fejlesztés a purtár visualization for innovation and competitive
rendszerben’, Multidiszciplináris Tudományok intelligence, in ‘Proceedings of the 52nd Hawaii
10(4), 238–252. International Conference on System Sciences’.
Organisciak, P., Schmidt, B. M. & Downie, J. S.
Schaeffer, C., Booton, L., Halleck, J., Studeny, J.
(2022), ‘Giving shape to large digital libraries
& Coustasse, A. (2017), ‘Big data management
through exploratory data analysis’, Journal of
in us hospitals: benefits and barriers’, The health
the Association for Information Science and
care manager 36(1), 87–95.
Technology 73(2), 317–332.
Serik, M., Nurbekova, G. & Mukhambetova,
Orlovskyi, D., Kopp, A. & Kondratiev, V. (2019),
M. (2021), ‘Optimal organisation of a big
‘Using dashboards for the business processes
data training course: big data processing with
status analysis’.
bigquery and setting up a dataproc hadoop
Patel, A. (2022), ‘Data visualization using framework’, World Trans. on Engng. and
tableau’. Technol. Educ 19(4), 417–422.
Patel, J. (2019), An effective and scalable data Shneiderman, B. & Plaisant, C. (1998),
modeling for enterprise big data platform, in ‘Treemaps for space-constrained visualization of
‘2019 IEEE International Conference on Big hierarchies’.
Data (Big Data)’, pp. 2691–2697.
Srivastava, G. & Venkataraman, R. (2022), ‘A
Pattanaik, S. N. & Wiegand, R. P. (2021), ‘Data review of the state of the art in business
visualization’, Handbook of Human Factors and intelligence software’, Enterprise Information
Ergonomics pp. 893–946. Systems 16(1), 1–28.
24
Torphy, K. T., Brandon, D. L., Daly, A. J., Wilson, K. (2014), Microsoft office 365, in ‘Using
Frank, K. A., Greenhow, C., Hu, S. & office 365’, pp. 1–14.
Rehm, M. (2020), ‘Social media, education,
and digital democratization’, Teachers College Wright, S. A. (2019), Privacy in iot blockchains:
Record 122(6), 1–7. with big data comes big responsibility, in ‘2019
IEEE International Conference on Big Data (Big
Tufte, E. R., McKay, S. R., Christian, W. & Matey, Data)’, pp. 5282–5291.
J. R. (1998), ‘Visual explanations: Images and
quantities, evidence and narrative’. Zais, C. M., Aspelund, K., Cesar, M. P., Gerum,
L. & Uberoi, M. N. (n.d.), ‘The basics of big
ur Rehman, M. H., Chang, V., Batool, A. data modeling’, Big Data for Generals. . . and
& Wah, T. Y. (2016), ‘Big data reduction Everyone Else over 40 p. 33.
framework for value creation in sustainable
enterprises’, International journal of Zhang, L., Vinodhini, B. & Maragatham, T.
information management 36(6), 917–928. (2021), ‘Interactive iot data visualization
for decision making in business intelligence’,
Usova, T. & Laws, R. (2021), ‘Teaching a Arabian Journal for Science and Engineering
one-credit course on data literacy and data pp. 1–11.
visualisation.’, Journal of Information Literacy
15(1).
Vashisht, V. & Dharia, P. (2020), Integrating
chatbot application with qlik sense business
intelligence (bi) tool using natural language
processing (nlp), in ‘Micro-Electronics and
Telecommunication Engineering’, pp. 683–692.
Vellido, A. (2020), ‘The importance of
interpretability and visualization in machine
learning for applications in medicine and
health care’, Neural computing and applications
32(24), 18069–18083.
Viegas, F. B., Wattenberg, M. & Feinberg,
J. (2009), ‘Participatory visualization with
wordle’, IEEE transactions on visualization and
computer graphics 15(6), 1137–1144.
Viljanen, I. (2020), ‘Improving solutions for
analytics services in a mid-sized insurance
company’.
Villars, R. L., Olofson, C. W. & Eastwood, M.
(n.d.), ‘Big data: What it is and why you should
care’, White paper, IDC 14, 1–14.
VIOREL, N. C. & LUCIA, N. (2019), ‘Analysis of
information on tourism in the european union
using the power bi business analysis service.’,
Agricultural Management/Lucrari Stiintifice
Seria I, Management Agricol 21(1).
Walter, M., Lovett, R., Maher, B., Williamson,
B., Prehn, J., Bodkin-Andrews, G. & Lee, V.
(2021), ‘Indigenous data sovereignty in the era
of big data and open data’, Australian Journal
of Social Issues 56(2), 143–156.
Weiner, A. & Lorber, K. (2021), Infographics: A
methodology for student research presentations
and other academic projects, in ‘Society for
Information Technology & Teacher Education
International Conference’, pp. 649–652.