Download as pptx, pdf, or txt
Download as pptx, pdf, or txt
You are on page 1of 28

School of Computing

Science and Engineering

Program: B.Tech Specialization


Course Code: BCSE3092
Course Name: Data Science
School of Computing Science and Engineering
Course
ourse Code
Code :: BCSE3092
BCSE3092 Course
Course Name:
Name: Data
Data Science
Science

Course Outcomes :
• CO1 To acquire good introducing knowledge of the essentials in
Statistical
Fundamentals used in Data science.(K1)
• CO2 Develop an ability to apply algorithmic principles and Programing
knowledge
using Python and R language on Data Science .(K2,K3)
• CO3 Develop ability to visualise the data for Analysis. (K4)
• CO4 Apply and Implement ML principles using Probability and Statistics
(K5)
• CO5 Understating and Recommending statistics and Machine learning
solutions (K6)
• CO6 Gaining Research insights and latest solutions provided by
researchers(K6)
Program Name: BCSE3029 Program Code: Data Science
School of Computing Science and Engineering
Course Code : BCSE3092 Course Name: Data Science

Course Prerequisites

• PYTHON BASICS
• STATISTICS
• LINEAR ALGEBRA

Program Name: BCSE3029 Program Code: Data Science


School of Computing Science and Engineering
Course Code : BCSE3092 Course Name: Data Science

Syllabus

Program Name: BCSE3029 Program Code: Data Science


School
School of
of Computing
Computing Science
Science and
and Engineering
Engineering
C
Course
ourse Code
Code :: BCSE3092
BCSE3092 Course
Course Name:
Name: Data
Data Science
Science

Recommended Books
Text books

Reference Book

Program Name: BCSE3029 Program Code: Data Science


Introduction to data Visualizations
• Data visualization is the graphical representation of
information and data.

• By using  visual elements like charts, graphs, and maps,


data visualization tools provide an accessible way to see
and understand trends, outliers, and patterns in data.
• Data visualization is the process of acquiring, interpreting
and comparing data in order to clearly communicate
complex ideas, thereby facilitating the identification and
analysis of meaningful patterns.
Visualization process

• Filtering & processing


Refining and cleaning data to convert it into information
through analysis, interpretation, contextualization, comparison,
and research
• Translation & visual representation
Shaping the visual representation by defining graphic
resources, language, context, and the tone of the
representation, all of which are adapted for the recipient.
• Perception & interpretation
Finally, the visualization becomes effective when it has a
perceptive impact on the construction of knowledge.
• Common general types of data visualization:
• Charts
• Tables
• Graphs
• Maps
• Infographics
• Dashboards

• More specific examples of methods to visualize data:


• Area Chart
• Bar Chart
• Box-and-whisker Plots
• Bubble Cloud
• Bullet Graph
• Cartogram
• Circle View
• Dot Distribution Map
• Gantt Chart
• Heat Map
• Highlight Table
• Histogram
kinds of data
Data Relationship
• Why data visualization is such a powerful tool:

• Intuitive: Presenting a graph as a node-link structure instantly makes sense, even to


people who have never worked with graphs before.

• Fast: It is fast because our brains are great at identifying patterns, but only when data
is presented in a tangible format. Armed with visualization, we can spot trends and
outliers very effectively.

• Flexible: The world is densely connected, so as long as there is an interesting


relationship in your data somewhere, you will find value in graph visualization.

• Insightful: Exploring graph data interactively allows users to gain more in-depth
knowledge, understand the context and ask more questions, compared to static
visualization or raw data.
Principles of Data Visualization
• The role of  data visualization  in
communicating the complex insights hidden
inside data is vital.

• This is becoming more and more important


since the audience for data visualizations is
also expanding along with the size of data.
• Balance the design

• Emphasize the key areas

• Illustrating movement

• Smart use of patterns

• Proportion

• Proper rhythm

• Variety

• Theme
The first step in representing information is trying
to understand that data visualization.

1. Overview first

2. Zoom and filter

3. Details on demand
Layout and design:
communicative
elements
• All visual representations begin with a blank
dimensional space that will eventually hold the
information which will be communicated.

• The process of spatial coding is a fundamental part


of visual representation because it is the medium in
which the results of our compositional decisions
and the meaning of our visual statement will be
visualized, thereby having an impact on the user.
Three different kinds of color schemes, based on
the nature of the data:
1. Monochromatic sequential palettes or their
analogue
2. Diverging palettes
3. Qualitative palettes
Histograms and Box Plots

• Histograms
Histograms display the distribution of a continuous
variable by dividing the range of scores into a
specified number of bins on the x-axis and displaying
the frequency of scores in each bin on the y-axis.
A histogram takes only one variable from the dataset
and shows the frequency of each occurrence. I will
use a simple dataset to learn how histogram helps to
understand a dataset.
Box plots

• Box plots
• A box-and-whiskers plot describes the distribution of a
continuous variable by plotting its five-number summary.
• Using parallel box plots to compare groups
• Box plots can be created for individual variables or for
variables by group.
• A boxplot shows the distribution of the data with more
detailed information. It shows the outliers more clearly,
maximum, minimum, quartile(Q1), third quartile(Q3),
interquartile range(IQR), and median.
Code - Box plots
Notched box plots

You might also like