Download as pdf or txt
Download as pdf or txt
You are on page 1of 37

BUSINESS ANALYTICS

SESSION 3
VISUALIZING AND
EXPLORING DATA
VISUALIZING AND
EXPLORING DATA

WHAT ARE WE
GOING TO LEARN?
1. Data Visualization Tools
Understand tools in Excel to create data visualization (column/bar charts, line/area charts, pie charts, etc.)

2. Other Data Visualization Tools


Understand other Excel data visualization tools (conditional formatting and sparklines)

3. Data Queries: Tables, Sorting, Filtering


Understand using Excel Table, Data Sorting, and Data Filtering for data exploration

4. Statistical Methods for Summarizing Data


Understand features in Excel for statistical analysis

Important Note: Use Microsoft Excel 2010 or newer version! Or use Excel Online using your UI Account!
Excel Office 365 (or Excel Online) version will be used throughout this course.
1. DATA VISUALIZATION BASIC TOOLS
DATA VISUALIZATION: OVERVIEW
Data visualization: the process of presenting data in a certain way to gain insights for
decision making.

Part of Descriptive Analytics, presenting past/historical or current data to


understand what happened or what is happening.

Essential for building decision models (e.g. linear or nonlinear) and


interpreting the results.

Dashboard: visual representation of important business measures.


DATA VISUALIZATION BASIC TOOLS
Column/Bar Charts
TYPES OF CHARTS

Column/bar charts are used for:


Vertical charts/
Column charts
1 Comparing values among categories

2 Comparing parts of a whole data

Horizontal charts/ 3 Comparing percentages of a whole


Bar charts
Line/Area Charts Pie Charts

To show data trends


over time
To show data in
partitions or in
proportion
Not recommended
To show data trends
due to difficulty in
over time, including its
comparing the
parts/proportion
relative sizes of
(line chart + pie chart)
areas.
Scatter Charts Hierarchy Charts

To provides a hierarchical
view of the data and to
To compare at least compare different levels
two sets of data of categorization.
To show
To display hierarchical
relationships
data and can be plotted
between sets of
when empty (blank) cells
values
exist within the hierarchal
structure.
To compare at least
three sets of data
To show
relationships
between sets of
values
Statistic Charts Combo Charts

To combine two or more


To show the
chart types
frequencies within a
distribution.

To show distribution of
data into quartiles,
highlighting the mean
and outliers.
Other Charts Other Charts
To find optimum
combinations between two
To show a running total of sets of data
your financial data as values
are added or subtracted.
To compare the aggregate
To show values across values of several data series.
multiple stages in a process.

To show fluctuations in stock


prices, daily rainfall or annual
temperatures.
EXAMPLE

Example 1: “Purchase Orders” Data Alternative: Using PivotChart

Create a chart to show the number of orders based on


order size and order status.

Number of Orders Based on Order Size and Order Status


30

25

20
Number of Orders

15

10

0
Very Small Small Big Very Big

Fast Normal Late


EXERCISE

Exercise 1: “Purchase Orders” Data

Create a combo chart to show both the number


and the value of orders based on order sizes.

Number of Orders vs Value of Orders


4,500,000 60

4,000,000
50
3,500,000

Number of orders
3,000,000 40
Value of Orders

2,500,000
30
2,000,000

1,500,000 20

1,000,000
10
500,000

- 0
Very Small Small Big Very Big
Order Size

Number of Orders Value of Orders


MAPS FOR GEOGRAPHICAL DATA
Maps Example 2: “GeoData” Set 1

Create a map to show the income group classification.


EXERCISE

Exercise 2: “GeoData” Set 2


Create a map to show the classification of
projected population number for each province
in Indonesia in 2035.

Range (in thousands) Category


<2,000 Very Low
2,000 – 4,999 Low
5,000 – 9,999 Medium
10.000 – 24,999 High
≥ 25,000 Very High
2. OTHER DATA VISUALIZATION TOOLS
CONDITIONAL FORMATTING
EXAMPLE

Example 3a: “Monthly Sales” Data

Create conditional formatting for monthly


sales data using data bars.

High bars → high values


Low bars → low values
EXAMPLE
Period Jakpus Jaktim Jakbar Jakut Jaksel
Example 3b: “Monthly Sales” Data
31-Jan-19 185,740 194,379 210,626 136,134 107,927
28-Feb-19 105,259 115,649 72,305 109,908 208,552
Create conditional formatting for monthly
31-Mar-19 82,289 180,242 65,417 153,460 174,444
sales data using color scales.
30-Apr-19 105,247 187,493 127,835 91,677 72,980
31-May-19 123,851 110,897 159,966 160,005 68,231
30-Jun-19 109,908 116,270 132,840 80,307 167,026
Green → high values 31-Jul-19 95,580 212,220 107,549 106,515 77,390
Yellow → middle values 31-Aug-19 98,133 114,108 205,635 130,608 87,121
Red → low values 30-Sep-19 82,587 116,222 147,006 76,455 102,440
31-Oct-19 86,658 118,446 78,026 205,389 152,442
30-Nov-19 112,217 168,584 161,548 159,665 179,145
31-Dec-19 155,213 150,830 116,020 173,613 121,918
31-Jan-20 113,177 171,760 159,082 136,916 179,842
29-Feb-20 56,293 43,376 46,769 54,178 40,634
31-Mar-20 43,390 55,905 48,783 35,747 56,107
30-Apr-20 56,948 41,658 53,130 44,931 44,943
31-May-20 61,667 56,527 36,522 46,821 58,321
30-Jun-20 36,195 40,476 50,281 33,306 43,828
EXAMPLE

Example 3c: “Monthly Sales” Data

Create conditional formatting for monthly


sales data using icon sets.

Up → high values (≥ 67 percentile)


Right → middle values (33-67 percentile)
Down → low values (< 33 percentile)
SPARKLINES

1 Line Sparkline

2 Column Sparkline Mini chart in a cell

3 Win/Loss Sparkline
EXAMPLE

Example 4a: “Monthly Sales” Data

Create line sparklines for monthly


sales data.

Shows trend line for each category over


time period.
EXAMPLE

Example 4b: “Monthly Sales” Data

Create column sparklines for monthly


sales data.

Shows comparison between categories


for each period.
EXAMPLE

Example 4c: “Monthly Sales” Data

Create win/loss sparklines for monthly


sales data.

Shows win (positive change) in blue and


loss (negative change) in red.
3. DATA QUERIES: TABLES, SORTING, FILTERING
TABLE

1 Block data (CTRL+SHIFT+Right Arrow and Down Arrow)

2 From Insert tab select Table

Order No. Supplier Item No. Item Description Unit Cost Quantity Order Date Arrival Date
2020010001 PT Hereford 1702 Hatch Decal - type 2 $1.02 950 2-Jan-20 17-Jan-20
2020010002 PT Canterbury 2003 Pressure Gauge - type 3 $114.00 705 2-Jan-20 19-Jan-20
2020010003 PT Canterbury 2203 Side Panel - type 3 $210.00 775 2-Jan-20 11-Jan-20
2020010004 PT Durham 2202 Side Panel - type 2 $234.00 710 2-Jan-20 7-Jan-20
2020010005 PT Hereford 1401 Door Decal $0.66 225 2-Jan-20 19-Jan-20
2020010006 PT Durham 2202 Side Panel - type 2 $234.00 1,020 2-Jan-20 12-Jan-20
2020010007 PT Hereford 1703 Hatch Decal - type 3 $1.20 725 2-Jan-20 8-Jan-20
2020010008 PT Hereford 1401 Door Decal $0.66 450 2-Jan-20 12-Jan-20
2020010009 PT Canterbury 2203 Side Panel - type 3 $210.00 1,050 3-Jan-20 18-Jan-20
2020010010 PT Canterbury 2003 Pressure Gauge - type 3 $114.00 410 3-Jan-20 9-Jan-20
2020010011 PT Flinshire 1904 O-Ring - type 4 $3.60 1,850 3-Jan-20 19-Jan-20
2020010012 PT Brighton 2201 Side Panel - type 1 $222.00 1,040 3-Jan-20 12-Jan-20
2020010013 PT Durham 2202 Side Panel - type 2 $234.00 230 6-Jan-20 11-Jan-20

Perform automatic formatting and updating when


adding a new entry to the table.
DATA SORTING

1 Block data

2 From Data tab select Sort

Add other sorting


criteria

Sort order
Sort basis
DATA FILTERING

1 Block data

2 From Data tab select Filter


DATA FILTERING

Filter with customized criteria

Filter with more criteria


4. STATISTICAL METHODS FOR SUMMARIZING
DATA
DATA ANALYSIS

1 File >> More… >> Options

2 Add-ins >> Analysis ToolPak >> Go >> OK


DATA ANALYSIS

Descriptive analysis tools:


• Descriptive Statistics
• Histogram
• Rank and Percentile
DATA ANALYSIS – DESCRIPTIVE STATISTICS

Show descriptive statististical measures for each data category

Descriptive Statistics
Quantity Order Date

Mean 6,422.30 Mean 18-Jan-20


Standard Error 750.62 Standard Error 1.08
Median 2,525 Median 20-Jan-20
Mode 5,000 Mode 30-Jan-20
Standard Deviation 7,277.49 Standard Deviation 10.48137327
Sample Variance 52,961,910 Sample Variance 109.8591855
Kurtosis 0 Kurtosis -1.419614877
Skewness 1 Skewness -0.311622712
Range 25405 Range 29
Minimum 195 Minimum 43832
Maximum 25600 Maximum 43861
Sum 603696 Sum 4121803
Count 94 Count 94
Largest(5) 23200 Largest(5) 31-Jan-20
Smallest(5) 410 Smallest(5) 2-Jan-20
Confidence Level(95.0%) 1,490.57 Confidence Level(95.0%) 2-Jan-00
DATA ANALYSIS – HISTOGRAM

Show frequency and cumulative distribution for each data category.

Frequency and Cumulative - Qty Pareto Analysis - Qty


Bin Frequency Cumulative % Bin Frequency Cumulative %
500 7 7.45% 2,000 22 23.40%
1,000 10 18.09% 5,000 21 45.74%
2,000 22 41.49% 20,000 12 58.51%
5,000 21 63.83% 1,000 10 69.15%
10,000 10 74.47% 10,000 10 79.79%
15,000 5 79.79% 500 7 87.23%
20,000 12 92.55% 15,000 5 92.55%
25,000 5 97.87% 25,000 5 97.87%
More 2 100.00% More 2 100.00%
DATA ANALYSIS – RANK AND PERCENTILE

Sort, rank, and determine the percentile of all the data entries.

Rank and Percentile for Quantity and Order Date


Point Quantity Rank Percent Point Order Date Rank Percent
73 25,600 1 100.00% 81 31-Jan-20 1 86.00%
16 25,200 2 98.90% 82 31-Jan-20 1 86.00%
92 23,900 3 97.80% 83 31-Jan-20 1 86.00%
90 23,300 4 96.70% 84 31-Jan-20 1 86.00%
85 23,200 5 95.60% 85 31-Jan-20 1 86.00%
91 22,400 6 94.60% 86 31-Jan-20 1 86.00%
94 20,400 7 93.50% 87 31-Jan-20 1 86.00%
46 18,700 8 92.40% 88 31-Jan-20 1 86.00%
86 18,600 9 91.30% 89 31-Jan-20 1 86.00%
89 18,300 10 90.30% 90 31-Jan-20 1 86.00%
63 18,100 11 89.20% 91 31-Jan-20 1 86.00%
19 17,700 12 88.10% 92 31-Jan-20 1 86.00%
27 17,600 13 87.00% 93 31-Jan-20 1 86.00%
61 17,200 14 86.00% 94 31-Jan-20 1 86.00%
75 16,300 15 84.90% 66 30-Jan-20 15 69.80%
67 15,800 16 83.80% 67 30-Jan-20 15 69.80%
BOX AND WHISKER CHART

From Insert tab select Charts and choose Box and Whisker
Thank You

You might also like