Data - Visualisation - Charts and Types of Data

You might also like

Download as pdf or txt
Download as pdf or txt
You are on page 1of 7

CT UNIVERSITY

LUDHIANA

DATA VISUALISATION
23BTAI12E01

• Submitted By : • Submitted To :
NAME : HARMANPREET SINGH BIRDI
MS. RASHI
REGISTRATION NO. : 72313285

SUBJECT : DATA VISUALISATION WITH IBM

(AI & DS)


Data and Types of Data
Q1: Briefly Explain the concept of Data ?
Ans: Data is a collection of information gathered by observations, measurements, research or
analysis. They may consist of facts, numbers, names, figures or even description of things. Data is
organized in the form of graphs, charts or tables.

Q2: Explain types of Data with examples?


Ans: Data can be categorized in many ways, including:
Structured data

Data that has been org

anized into a database so it can be easily processed and analyzed. Structured data is usually tabular,
with rows and columns that clearly define data attributes.

Binary data

A type of data that can only have two possible values, such as 0 and 1, or "Yes" and "No". In statistics,
binary data is also called dichotomous data.

Textual data

Any information or message conveyed in written or printed form. This can include books, articles,
surveys, social media posts, chat conversations, and emails.

Numerical data

Data that is in the form of numbers, and not in any language or descriptive form. Numerical data is also
known as quantitative data. It's always measurable, and you can add it together.

Q3: Explain Numeric Data and Categorical Data with examples?


Ans: Categorical data examples include personal biodata information—full name, gender, phone
number, etc. Numerical data examples include CGPA calculator, interval sale, etc.

Numerical data is a type of data that is expressed in terms of numbers rather than natural language
descriptions. Similar to its name, numerical, it can only be collected in number form. Also known as
quantitative data, this numerical data type can be used as a form of measurement, such as a person’s
height, weight, IQ, etc.

Q4: Explain Structured and Unstructured Data and its


Characteristics?
Ans: Structured data is essentially data that conforms to a data model, has a well-defined structure,
and follows a consistent order. It should also be easily accessible and/or usable by people or computer
programs. Structured data is generally stored in well-defined schemas like databases. It tends to be
tabular and has rows and columns that define its attributes in a very clear manner.You will generally find
:Structured Query Language (SQL) being used to manage the structured data that is stored in databases.
The sources of unstructured data include: SQL Databases, Spreadsheets like Excel, OLTP Systems, Online
forms, Sensors like GPS or RFID tags, Network and Web server logs, Medical devices.

Unstructured data is qualitative data that does not follow a specific, pre-defined data model or is not
organized in a pre-defined manner. Examples of unstructured data include text documents, email text,
images, audio files, videos and free text answers within surveys. Essentially, this data is not designed
primarily to be analysed. It has no identifiable structure or data model. It has no obvious organization.

Q5: Difference between Structured Data and Unstructured


Data?
Ans:
Structured data Unstructured data
Data that fits in a predefined data model Data without an underlying model to
What is it?
or schema. discern attributes.
Basic example An Excel table. A collection of video files.
An associated collection of discrete, An associated collection of data, objects,
Best for short, non-continuous numerical and or files where the attributes change or are
text values. unknown.
Relational databases, graph databases,
File systems, DAM systems, CMSs,
Storage types spatial databases, OLAP cubes, and
version control systems, and more.
more.
Easier to organize, clean, search, and Can analyze data that can’t be easily
Biggest benefit
analyze. shaped into structured data.
Biggest All data must fit in the prescribed data
Can be difficult to analyze.
challenge model.
Main analysis
SQL queries. Varies.
technique
Q6: Define Semi-structure Data with examples?
Ans: Semi-structured data is a type of structured data that doesn't follow the tabular structure of data
models. It's also known as self-describing structure. Semi-structured data has tags or markers that
separate semantic elements and enforce hierarchies of records and fields. For example, in photos and
videos, meta tags may relate to the location, date, or by whom they were taken. HTML code. Examples:
Graphs and tables, E-mails, XML documents, Photos and videos, TCP/IP packets.

Q7: Advantages and Disadvantages of Structured and


Unstructured Data?
Ans: Structured Advantages: Requires less processing in comparison to unstructured data and is
easier to manage. Machine algorithms can easily crawl and use structured data which simplifies
querying. As an older format of data, there are several tools available for structured data that simplify
usage, management, and analysis.

Distadvantages: Limited usability because of its pre-defined structure/format. Structured data is stored
in data warehouses which are built for space saving but are difficult to change and not very
scalable/flexible.

Unstructures Advantages: Variety of native formats facilitate a greater number of use-cases and
applications. As there is no need to predefine data, unstructured data is collected quickly and easily.
Unstructured data is stored in on-premises or cloud data lakes which are highly scalable. Although
challenging, the greater volume of unstructured data provides better insights and more opportunities to
turn your data into a competitive advantage.

Disadvantages: The greater number of formats makes it equally challenging to analyze and leverage
unstructured data. The large volume and undefined formats make data management a challenge and
specialized tools a necessity.
Data Visualisation Charts
Q1: Explain what is data visualization charts.
Ans: Data visualization is the representation of data through use of common graphics, such as charts,
plots, infographics, and even animations. These visual displays of information communicate complex
data relationships and data-driven insights in a way that is easy to understand.

Q2: Give brief explanation about the basic Types of charts used
in data visualization.
Ans: In a bar chart, values are indicated by the length of bars, each of which corresponds with a
measured group. Bar charts can be oriented vertically or horizontally; vertical bar charts are sometimes
called column charts. Horizontal bar charts are a good option when you have a lot of bars to plot, or the
labels on them require additional space to be legible.

Line charts show changes in value across continuous measurements, such as those made over time.
Movement of the line up or down helps bring out positive and negative changes, respectively. It can also
expose overall trends, to help the reader make predictions or projections for future outcomes. Multiple
line charts can also give rise to other related charts like the sparkline or ridgeline plot.

A scatter plot displays values on two numeric variables using points positioned on two axes: one for
each variable. Scatter plots are a versatile demonstration of the relationship between the plotted
variables—whether that correlation is strong or weak, positive or negative, linear or non-linear. Scatter
plots are also great for identifying outlier points and possible gaps in the data.

A box plot uses boxes and whiskers to summarize the distribution of values within measured groups.
The positions of the box and whisker ends show the regions where the majority of the data lies. We
most commonly see box plots when we have multiple groups to compare to one another; other charts
with more detail are preferred when we have only one group to plot.

Q3: Show Uses of line chart and area chart.


Ans: Line Chart: A line chart supports monitoring behavior in a set of data. These charts are useful for
more than tracking change over time. They also help highlight differences and correlations within your
data. Furthermore, a line chart can help a viewer make predictions about what might happen next.

Area Chart: An area chart is a powerful data visualization tool that combines the simplicity of a line
chart with the emphasis on cumulative values. It effectively showcases the evolution of various data
series over time or categories, highlighting both individual trends and the overall pattern.
Q4: Brief description about the column chart , bar chart and
scatter plot graph.
Ans: A column chart is a method of displaying data with categories represented by a rectangle—
sometimes called vertical bar charts. They allow easy comparisons among a number of items and trends
analysis. In general, statistics and figures are difficult to understand when presented in tables or written
format.

A bar chart or bar graph is a chart or graph that presents categorical data with rectangular bars with
heights or lengths proportional to the values that they represent. The bars can be plotted vertically or
horizontally. A vertical bar chart is sometimes called a column chart.

Scatter Plots are described as one of the most useful inventions in statistical graphs. Originally, the
scatter plot was presented by an English Scientist, John Frederick W. Herschel, in the year 1833.
Herschel used it in the study of the orbit of the double stars. He plotted the positional angle of the
double star in relation to the year of measurement. The scatter plot was used to understand the
fundamental relationship between the two measurements.

Q5: Show the advantages of the charts used in our daily life.
Ans:
• Diagrams are easy to understand
• You can represent volumes of data in a simplified manner.
• They reveal hidden facts.
• They are quick to grasp and easy to compare.
• Diagrams have universal acceptability.
• Charts make the data more presentable and easy to understand. By looking
at the chart, one can draw certain inferences or analyses.
• It helps in summarizing data in a very crisp and easy manner.
• The chart helps in a better comparison of data.

Q6: Explain bullet graph , pie chart and it's use.

Ans: A bullet graph is a bar marked with extra encodings to show progress towards a goal or
performance against a reference line. Each bar focuses the user on one measure, bringing in
more visual elements to provide additional detail.
USE: Business analysts check if teams or departments are on track to meet their goals.
Healthcare systems use them to gauge capacity at specific locations. Nonprofits measure
progress towards fundraising goals.
A pie chart, sometimes called a circle chart, is a way of summarizing a set of nominal data or
displaying the different values of a given variable (e.g. percentage distribution). This type of
chart is a circle divided into a series of segments. Each segment represents a particular
category.
USE: A straightforward and easy-to-understand illustration.It visually portrays data as a
fraction of a whole, which can be an important communication tool for even the most
inexperienced audience. It allows the viewer to see a data comparison at a glimpse, allowing
them to do an immediate analysis or quickly comprehend details. The use of this chart
eliminates the need for readers to examine or quantify underlying numbers.

You might also like