Download as pptx, pdf, or txt
Download as pptx, pdf, or txt
You are on page 1of 29

Data visualization-1

U-4 P-1
Visualization
• Visualization is a graphical representation of
quantitative information and data by using
visual elements like graphs, charts, and maps.
• There are 2 ways to visualize data:
1. Infographics
2. Data visualization
Infographic
• An infographic (information graphic) is a
representation of information in a graphic
format designed to make the data easily
understandable at a glance.
• People use infographics to quickly
communicate a message, simplify the
presentation of large amounts of data, see
data patterns and relationships, and monitor
changes in variables.
Data visualization
• It convert large and small data sets into
visuals, which is easy to understand and
process for humans.
• It provides accessible ways to understand
outliers, patterns, and trends in the data.
• It explores data in different angles.
• It finds key or interesting breakthroughs in a
large dataset.
• Data can be classified based on the following
three criteria.
• Method of creation: refers to the type of content
used while creating any graphical representation.
• Quantity of data displayed: refers to amount of
data which is represented.
• Degree of creativity applied: refers to the extent
to which the data is created graphically or
designed in a colorful way or it is just showing
some important data in black and white
diagrams.
Content types
• Graph: A representation in which X and Y axes are used to
depict the meaning of the information
• Diagram: A two-dimensional representation of information to
show how something works
• Timeline: A representation of important events in a sequence
with the help of self-explanatory visual material
• Template: A layout is a design for presenting information
• Checklist: A list of items for comparison and verification
• Flowchart: A representation of instructions that shows how
something works or a step-by-step procedure to perform a
task
• Mind map: A type of diagram which is used to visually
organize information
Techniques used for visual data representation

• Isoline: 2D representation of a curved line that


moves constantly on the surface of a graph.
• Isosurface
• An isosurface is a three-dimensional analog of
an isoline.
• It is a surface that represents points of a
constant value within a volume of space.
• Direct volume rendering:
• Method used for obtaining 2D projection for a
3D dataset.

• Streamline:
• A field line that results from the velocity
vector field description of the data flow.
• Map:
• Visual representation of locations within in
specific area.
• Parallel coordinate plot:
• A parallel coordinate plot maps each row in
the data table as a line

• Venn diagram:
• Illustrate the logical relationships between
two or more sets of items
• Timeline:
• A timeline is a display of a list of events in
chronological order.

• Euler diagram:
• An Euler diagram is a diagrammatic means of
representing sets and their relationships.
• Hyperbolic trees:
• It defines a graph
drawing method
inspired by
hyperbolic
geometry.
• Cluster diagram:
It displays clusters
or groups of information,
objects, or ideas.
• Ordinogram:
• It is used to analyze various sets of
multivariate objects.
• Types of Data Visualization:
• You already know that data can be visualized in many ways, such as in the
forms of 1D, 2D, or 3D structures. The following are the different types of
data visualizations:
1. 1D (Linear) data visualization:
In the linear data visualization, data is presented in the form of lists.
Hence, we cannot term it as visualization. It is rather a data organization
technique. Therefore, no tool is required to visualize data in a linear
manner. For example, a list of items organized in a predefined manner.
2. 2D (Planar) data visualization:
This technique presents data in the form of images, diagrams, or charts
on a plane surface. Some tools used to create 2D data visualization
patterns are GeoCommons, Google Fusion Tables, Google Maps API,
Polymaps, Tableau Public, etc. For example, choropleth, cartogram, dot
distribution map, and proportional symbol map.
3. 3D (Volumetric) data visualization:
In this method, data presentation involves exactly three
dimensions to show simulations, surface and volume
rendering, etc. Generally, it is used in scientific studies. To
create 3D visualizations, we use some visualization tools
that involve AC3D, AutoQ3D, TrueSpace, etc. For example,
3D computer models, surface rendering, volume
rendering, and computer simulations.
4. Temporal data visualization:
Sometimes, visualizations are time dependent. To visualize
the dependence of analyses on time, the temporal data
visualization is used, which includes Gantt chart, time
series, sanky diagram, etc. TimeFlow, Timeline JS, Excel,
Timeplot, TimeSearcher, Google Charts, Tableau Public,
and Google Fusion Tables, etc. are some tools used to
create temporal data visualization.
5. Multidimensional data visualization:
• In this type of data visualization, numerous dimensions are used to
present data. We have pie charts, histograms, bar charts, etc. to
exemplify multidimensional data visualization. Many Eyes, Google
Charts, Tableau Public, etc. are some tools used to create
multidimensional data visualization.
6. Tree/Hierarchical data visualization:
• Sometimes data relationships need to be shown in the form of
hierarchies. To represent such kind of relationships, we use tree or
hierarchical data visualizations. Examples of tree/hierarchical data
visualization include hyperbolic tree, wedge-stack graph, etc. Some
tools to create hierarchical data visualization are d3, Google Charts, and
Network Workbench/Sci2.
7. Network data visualization:
• It is used to represent data relations that are too complex to be
represented in the form of hierarchies. Some examples include matrix,
node link diagram, hive plot. Network data visualization tools are Pajek,
Gephi, NodeXL, VOSviewer, UCINET, GUESS, Network Workbench/Sci2,
sigma.js, d3/Protovis, Many Eyes, Google Fusion Tables, etc.
• Applications of Data Visualization:
• Data visualization tools and techniques are used in various applications.
Some of the areas in which we apply data visualization are as follows:
• Education:
• Visualization is applied to teach a topic that requires simulation or
modeling of any object or process. Have you ever wondered how difficult
it would be to explain any organ or organ system without any visuals?
Organ system or structure of an atom is best described with the help of
diagrams or animations.
• Information:
• Visualization is applied to transform abstract data into visual forms for easy
interpretation and further exploration.
• Production:
• Various applications are used to create 3D models of products for better
viewing and manipulation. Real estate, communication, and automobile
industry extensively use 3D advertisements to provide a better look and
feel to their products.
• Science:
• Every field of science, including fluid dynamics, astrophysics, or
medicine use visual representation of information. Isosurfaces
and direct volume rendering are typically used to explain
scientific concepts.
• Systems visualization:
• Systems visualization is a relatively new concept that integrates
visual techniques to better describe complex systems.
• Visual communication:
• Multimedia and entertainment industry use visuals to
communicate their ideas and information.
• Visual analytics:
• It refers to the science of analytical reasoning supported by the
interactive visual interface. The data generated by social media
interaction is interpreted using visual analytics techniques.
• Visualizing Big Data:
• Almost every organization today is struggling to tackle the huge amount of
data pouring in every day.
• Data visualization is a great way to reduce the turn-around time consumed
in interpreting Big Data.
• Traditional visualization techniques are not efficient enough to capture or
interpret the information that Big Data possesses.
• For example, such techniques are not able to interpret videos, audios, and
complex sentences. Apart from the type of data, the volume and the
speed with which it is generating pose a great challenge.
• Big Data comprises both structured as well as unstructured form of data
collected from various sources. Heterogeneity of data sources, data
streaming, and real-time data are also difficult to handle using traditional
tools.
• Traditional tools are developed using relational models that work best on
static interaction.
• Big Data is highly dynamic in function and therefore most traditional tools
are not able to generate quality results. The response time of traditional
tools is quite high, making it unfit for quality interaction.
• Deriving Business Solutions:
• The most common notation used for Big Data is 3Vs—volume, velocity, and
variety.
• Big Data generated through social media sites is a valuable source of
information to understand consumer sentiments and demographics.
• Almost every company nowadays is working with Big Data and facing the
following challenges:
1. Most data is in unstructured form
2. Data is not analyzed in real-time
3. The amount of data generated is huge
4. There is a lack of efficient tools and techniques
• Considering all these factors, IT companies are focusing more on the
research and development of robust algorithms, software, and tools to
analyze the data that is scattered in the Internet space. Tools such as Hadoop
are providing state-of-the-art technology to store and process Big Data.
• Analytical tools are now able to produce interpretations on smartphones and
tablets. It is possible because of the advanced visual analytics that is enabling
business owners and researchers to explore the data to find out trends and
patterns.
• Turning Data into Information:
• Visualization facilitates the identification of patterns in the form of graphs or charts, which
in turn helps to derive useful information. Data reduction and abstraction are generally
followed during data mining to get valuable information.
• Visual data mining also works on the same principle as simple data mining; however, it
involves the integration of information visualization and human-computer interaction.
• Visualization of data produces cluttered images that are filtered with the help of clutter-
reduction techniques. Uniform sampling and dimension reduction are two commonly used
clutter-reduction techniques.
• Visual data reduction process involves automated data analysis to measure density, outliers,
and their differences. These measures are then used as quality metrics to evaluate data-
reduction activity. Visual quality metrics can be categorized as:
1. Size metrics, (e.g. number of data points)
2. Visual effectiveness metrics, (e.g. data density, collisions)
3. Feature preservation metrics, (e.g. discovering and preserving data density
differences)
• In general, we can conclude that a visual analytics tool should be:
1. Simple enough so that even non-technical users can operate it
2. Interactive to connect with different sources of data
3. Competent to create appropriate visuals for interpretations
4. Able to interpret Big Data and share the information
• Apart from representing data, a visualization tool must be able to establish links between
different data values, restore the missing data, and polish data for further analysis
• Data Visualization Tools:
• Excel
• Last.Forward
• Digg.comm
• Pics
• Arc
• Google charts API
• TwittEarth
• Tag Galaxy
• D3
• Rootzmap Mapping the internet
Open Source Data Visualization Tools

1. VTK
2. Cave5D
3. ELKI
4. Tulip
5. Gephi
6. IBM OpenDX
7. Tableau public
8. Vis5D
Analytical techniques used in data visualization

1. Regression analysis: Used for prediction


• Least squares regression
• Logistic
• Hierarchical linear modeling
• Duration models
2. Grouping methods: Categorizing observation into blocks
3. Multiple equation models: Analyzes pathways from
independent to dependent variables.
• Path analysis
• Structural equation modeling
Tableau
• Tableau is a data visualization platform that
allows data analysts, scientists, statisticians,
and other professionals to analyze data and
form clear conclusions based on their findings.
• Tableau handles large amounts of data fast
and produce the necessary data visualization
result.
Tableau Products
• Tableau offers five main products to fit into diverse data visualization requirements for
professionals and organizations. They are:

• Tableau desktop:
• Tableau Desktop is a paid source, personal edition- $35 per month and professional edition-
$70 per month.
• Tableau desktop data source can connect to any data source file, including databases, web
applications, and more.
• Tableau desktop can also install on Window and Mac operating system.
• Data and Visualization are secured in Tableau desktop.
• In Tableau desktop, data can extract from various data sources and stored as Tableau extract
file.
• Tableau desktop uses the details at Professional and Enterprise level.

• Tableau public:
• Tableau Public is a free and open-source.
• Tableau public data source can connect to Excel and Text files.
• Tableau public can be installed on Window and Mac operating system.
• Data and Visualizations are not secured in the Tableau public because it is available in public.
• In Tableau public, data cannot be obtained from different data sources as it is limited to
connect only Excel and Text files.
• Tableau public uses the details at Personal level.
• Tableau reader:
• It is a free application that can be downloaded from tableau website.
• It can be used to analyze the workbooks that have been saved with .tbwx
extension.
• The only thing worth keeping in mind is that anyone who receives the
workbook can use the tableau reader to open it.
• So, essentially, there is no security.

• Tableau server:
• It is enterprise-level tableau software.
• You can publish dashboards with Tableau desktop and then share them with
others inside the organization with the help of a web-based tableau server.
Following are the components of tableau server:
1. Application server: It handles permission and browsing for web and mobile
interfaces of tableau server.
2. Vizsql server: After receiving a request from clients, Vizsql server sends the
query to data source and returns the result. The result set is then rendered as
image and finally is presented to the user.
3. Data server: manages and stores data sources and metadata.
Tableau public installation
• https://intellipaat.com/blog/tutorial/tableau-t
utorial/installation-tableau-desktop/#no2

You might also like