Download as pdf or txt
Download as pdf or txt
You are on page 1of 51

Statistics Descriptive

#2
Niniet Indah A., MT
niniet@ie.its.ac.id
Industrial Engineering Department
Sepuluh Nopember Institute of Technology
INDONESIA
This Chapter include :
 Create and interpret bar charts, pie charts, steam
and leaf diagram
 Construct a frequency distribution
 Construct and interpret a histogram

Objective :

1. How to describe the data in a useful


ways (precise, simple, but complete)
2. How to understand pattern of the data
3. How to conclude the basic shape of data
DATA PRESENTATION
 There are two ways for presenting the data :
Thinking Challenge
Our market share
far exceeds all X
competitors! - VP Y
Us
30% 32% 34% 36%

But, let us see, what is the


problem on the chart??
DATA PRESENTATION
DATA PRESENTATION
SUMMARY TABLE
1. Lists Categories & No. Elements in Category
2. Obtained by Tallying Responses in Category
3. May Show Frequencies (Counts), % or Both

Row Is
Category NO. MAJOR COUNT Tally:
1 Accounting 130
|||| ||||
2 Economics 20
3 Management 50 |||| ||||
Total 200
DATA PRESENTATION
NO. MAJOR COUNT
1 Accounting 130
2 Economics
3 Management
20
50
BAR CHART
Total 200

Horizontal Major Bar Length


Bars for Shows
Categorical Frequency
Variables
Mgmt. or %

Equal Bar
Econ. Widths
1/2 to 1 Bar
Width
Acct.

Zero Point 0 50 100 150


Percent Used Also Frequency
What’s the
interpretation??
What’s the
Different shape of Bar Charts – interpretation??
used when an analyst need
another information on the bar What is the diference?
chart

Pareto Diagram Example 45% 100%

90%
% invested in each category

40%

cumulative % invested
80%
35%

70%
30%
(bar graph)

(line graph)
60%
25%
50%
20%
40%

15%
30%

10%
20%

5% 10%

0% 0%
Stocks Bonds Savings CD

Frequency order
Side-by-Side Chart Example
 Sales by quarter for three sales territories:

1st Qtr 2nd Qtr 3rd Qtr 4th Qtr


East 20.4 27.4 59 20.4
W e st 30.6 38.6 34.6 31.6
North 45.9 46.9 45 43.9

60

50

40
East
30 West
North
20

10

0
1st Qtr 2nd Qtr 3rd Qtr 4th Qtr What’s the
interpretation??
DATA PRESENTATION
PIE CHART
1. Shows Breakdown of Total
Quantity into Categories
2. Useful for Showing Relative Majors
Differences
Mgmt.
3. Angle Size Econ. 25%
(360°)(Percent) 10% 36°

Acct.
NO. MAJOR COUNT
65%
1 Accounting 130
2 Economics 20 (360°) (10%) = 36°
3 Management 50
Total 200
DATA PRESENTATION
NO. MAJOR COUNT
1 Accounting 130
2 Economics
3 Management
20
50
DOT CHART
Total 200

Like Major Line Length


Horizontal Shows
Bar Chart Frequency or %
Mgmt.
Horizontal
Equal
Lines for Econ. Spacing
Categorical
Variables
Acct.

Zero Point 0 50 100 150


Percent Used Also Frequency

16
THINKING CHALLENGE

You want to show the market shares held Mfg. Mkt. Share (%)
by Windows program manufacturers in Lotus 15
1992. Construct a bar chart, pie chart, & Microsoft 60
dot chart to describe the data. WordPerfect 10
Others 15
BAR CHART SOLUTION*

Mfg.

Lotus

Microsoft

Wordperf.

Others

0% 20% 40% 60%


Market Share (%)
PIE CHART SOLUTION*

Market Share
Others
Wordperf. 15%
10%
Lotus
15%

Microsoft
60%
DOT CHART SOLUTION*
Mfg.

Lotus

Microsoft

Wordperf.

Others

0% 20% 40% 60%


Market Share (%)
Line Charts and Scatter Diagrams

 Line charts show values of one variable vs.


time
 Time is traditionally shown on the horizontal axis

 Scatter Diagrams show points for bivariate


data
 one variable is measured on the vertical axis and
the other variable is measured on the horizontal
axis
Line Chart Example
Inflation
Year Rate
1985 3.56 U.S. Inflation Rate
1986 1.86 6
1987 3.65
5
Inflation Rate (%)

1988 4.14
1989 4.82
4
1990 5.40
1991 4.21 3
1992 3.01
1993 2.99 2
1994 2.56
1995 2.83
1
1996 2.95
0
1997 2.29
1998 1.56
1984 1986 1988 1990 1992 1994 1996 1998 2000 2002
1999 2.21 Year
2000 3.36
2001 2.85
2002 1.58
Scatter Diagram Example

Volume Cost per Production Volume vs. Cost per Day


per day day
23 125 250
26 140
200
Co st p er Day

29 146
33 160 150
38 167 100
42 170
50
50 188
55 195 0
60 200 0 10 20 30 40 50 60 70
Volume per Day
Types of Relationships
 Linear Relationships Simple Linear Regression

Y Y

X X
Types of Relationships (continued)

 Curvilinear Relationships
Y Y

X X
Types of Relationships (continued)

 No Relationship
Y Y

X X
DATA PRESENTATION
STEM - AND - LEAF DISPLAY
1. Divide Each Observation into
Stem Value and Leaf Value 2 144677
– Stem Value Defines Class
– Leaf Value Defines 3 028 26
Frequency (Count)

4 1

2. Data: 21, 24, 24, 26, 27, 27, 30, 32, 38, 41
Example:
Data in ordered array:
12, 13, 17, 21, 24, 24, 26, 27, 27, 30, 32, 35, 37, 38, 41, 43, 44, 46, 53, 58

 Here, use the 10’s digit for the stem unit:


Stem Leaf
 12 is shown as 1 2

 35 is shown as 3 5
Example:
Data in ordered array:
12, 13, 17, 21, 24, 24, 26, 27, 27, 30, 32, 35, 37, 38, 41, 43, 44, 46, 53, 58

 Completed Stem-and-leaf diagram:


Stem Leaves
1 2 3 7
2 1 4 4 6 7 8
3 0 2 5 7 8
4 1 3 4 6
5 3 8
DATA PRESENTATION
FREQUENCY DISTRIBUTIONS
What is a Frequency Distribution?
 A frequency distribution is a list or a table …

 containing the values of a variable...

 and the corresponding frequencies with which


each value occurs (or frequencies with which data falls
within each range)
DISTRIBUSI FREKUENSI : adalah pengelompokan data ke dalam
beberapa kategori yang menunjukkan banyaknya data dalam setiap
kategori, dan setiap data tidak dapat dimasukkan ke dalam dua atau
lebih kategori.
Why Use Frequency Distributions?
 A frequency distribution is a way to summarize
data

 The distribution convert the raw data into a more


useful form.

 Allows for a quick visual interpretation of


the data
FREQUENCY DISTRIBUTIONS
LANGKAH – LANGKAH PEMBUATAN :
1. Mengurutkan data dari terkecil ke terbesar (atau sebaliknya)
2. Membuat kategori atau kelas
2.1 menentukan banyaknya kategori/ kelas : 2k ≥ N
*aturan thumb :
< 25 : 5 - 6 kelas
25 - 50 : 7 - 14 kelas
> 50 : 15 - 20 kelas
*aturan sturges/ cramer : (pembulatan)
jumlah kategori (k) = 1 + 3,322 log N
2.2 menentukan interval kategori/ kelas :
Nilai terbesar - Nilai terkecil
Interval kelas =
Jumlah kelas
3. Melakukan tabulasi : menentukan batas kelas dan menempatkan data
ke kelas yang sesuai.
Supaya tidak ada data
yang jatuh di batas kelas
FREQUENCY DISTRIBUTIONS
NO. PERUSAHAAN HARGA PER LEMBAR SAHAM
1 MUSTIKA RATU Tbk 550
2 KIMIA FARMA Tbk 160
CASE STUDY : 3 BANK BUANA NUSANTARA Tbk 650
Berikut ini merupakan 4 HERO SUPERMARKET Tbk 875
harga per lembar 5 BERLIAN LAJU TANGKER Tbk 500
saham dari beberapa 6 HEXINDO ADI PERKASA Tbk 360
perusahaan yang 7 BANK LIPPO Tbk 370
sehat. 8 JAKARTA INT. HOTEL Tbk 450
Buatlah distribusi 9 INDOSIAR VISUAL MANDIRI Tbk 525
frekuensinya ! 10 TIMAH Tbk 700
11 BANK DANPAC Tbk 500
12 UNITED TRACTOR Tbk 285
13 GREAT RIVER INT. Tbk 550
14 ASURANSI RAMAYANA Tbk 600
Case!
Case! 15 DANKOS LABORATORIES Tbk 405
Case! 16 ULTRA JAYA MILK Tbk 500
17 MATAHARI PUTRA PRIMA Tbk 410
18 LIPPO LAND DEVELOPMENT Tbk 575
19 BANK SWADESI Tbk 300
20 ADES ALFINDO Tbk 550
FREQUENCY DISTRIBUTIONS
LANGKAH 1 : Mengurutkan data dari terkecil ke terbesar (atau sebaliknya)

NO. PERUSAHAAN HARGA PER LEMBAR SAHAM


Saham
2 KIMIA FARMA Tbk 160
12 UNITED TRACTOR Tbk 285
termurah
19 BANK SWADESI Tbk 300
6 HEXINDO ADI PERKASA Tbk 360
7 BANK LIPPO Tbk 370
15 DANKOS LABORATORIES Tbk 405
17 MATAHARI PUTRA PRIMA Tbk 410
8 JAKARTA INT. HOTEL Tbk 450
5 BERLIAN LAJU TANGKER Tbk 500
11 BANK DANPAC Tbk 500
16 ULTRA JAYA MILK Tbk 500
9 INDOSIAR VISUAL MANDIRI Tbk 525
1 MUSTIKA RATU Tbk 550
13 GREAT RIVER INT. Tbk 550
20 ADES ALFINDO Tbk 550
18 LIPPO LAND DEVELOPMENT Tbk 575
14 ASURANSI RAMAYANA Tbk 600
3 BANK BUANA NUSANTARA Tbk 650
10 TIMAH Tbk 700 Saham
4 HERO SUPERMARKET Tbk 875 termahal
FREQUENCY DISTRIBUTIONS

aturan sturges/
LANGKAH 2 : Membuat kategori atau kelas cramer
- menentukan banyaknya kategori/ kelas
jumlah ketegori (k) = 1 + 3,322 log N
k = 1 + 3,322 log 20
k = 1 + 3,322 * 1,301 pembulatan
k = 1 + 4,322
k = 5,322. CEK : 2k ≥ N = 25 ≥ 20 = 32 ≥ 20, k = 5

pembulatan
- menentukan interval ketegori/ kelas

Nilai terbesar - Nilai terkecil 875 - 160


Interval kelas = = = 143
Jumlah kelas 5

Jadi, # kelas = 5 dengan interval


masing-masing kelas = 143
Frek. Relatif = frekuensi /
Tabel distribusi frekuensi : jumlah data

Kelas Interval (batas kelas) frekuensi (fi) frek. Kumulatif frekuensi relatif
1 …-…
2 …-…
3 …-…
4 …-…
5 …-… (N)
TOTAL 1

Akumulasi frekuensi dengan


Batas bawah Batas atas kelas sebelumnya, dimana nilai
kelas kelas frek. Kumulatif pada kelas
terakhir = N (banyak data)
Banyaknya data yang berada pada kelas tsb, sebelum
dinyatakan dalam bentuk numerik, umumnya dinyatakan
dalam bentuk tally terlebih dahulu
FREQUENCY DISTRIBUTIONS

KELAS KE - INTERVAL KETERANGAN


1 160 - 303 160 + 143 = 303
2 304 - 447 304 + 143 = 447
3 448 - 591 448 + 143 = 591 Interval untuk setiap kelas
4 592 - 735 592 + 143 = 735 (belum memperhatikan
5 736 - 879 736 + 143 = 879 batas kelas)

Data min = 160


Data max = 875

KELAS KE - INTERVAL KETERANGAN


1 159.5 - 303.5 Batas bawah kelas 1 : (159 + 160)/ 2 = 159.5
2 303.5 - 447.5 Batas atas kelas 1 : (303 + 304)/ 2 = 303.5
3 447.5 - 591.5 dst….
4 591.5 - 735.5
5 735.5 - 879.5

40
FREQUENCY DISTRIBUTIONS
LANGKAH 3 : Membuat tabulasi
PERUSAHAAN HARGA SAHAM
KIMIA FARMA Tbk 160
UNITED TRACTOR Tbk 285 Frekuensi
BANK SWADESI Tbk 300 dalam
HEXINDO ADI PERKASA Tbk 360 bentuk tally
BANK LIPPO Tbk 370
DANKOS LABORATORIES Tbk 405 Frekuensi
MATAHARI PUTRA PRIMA Tbk 410 dalam nilai
JAKARTA INT. HOTEL Tbk 450 numerik
BERLIAN LAJU TANGKER Tbk 500
BANK DANPAC Tbk 500
ULTRA JAYA MILK Tbk 500 KELAS KE - INTERVAL FREK. Σ FREKUENSI
INDOSIAR VISUAL MANDIRI Tbk 525
MUSTIKA RATU Tbk 550 1 159.5 - 303.5 II 2
GREAT RIVER INT. Tbk 550 2 303.5 - 447.5 IIIII 5
ADES ALFINDO Tbk 550
LIPPO LAND DEVELOPMENT Tbk 575
3 447.5 - 591.5 IIIII IIII 9
ASURANSI RAMAYANA Tbk 600 4 591.5 - 735.5 III 3
BANK BUANA NUSANTARA Tbk 650 5 735.5 - 879.5 I 1
TIMAH Tbk 700
HERO SUPERMARKET Tbk 875 20
FREQUENCY DISTRIBUTIONS

KELAS KE - INTERVAL Σ FREKUENSI FREKUENSI RELATIF


1 159.5 - 303.5 2 10.00%
2 303.5 - 447.5 5 25.00%
3 447.5 - 591.5 9 45.00%
4 591.5 - 735.5 3 15.00%
5 735.5 - 879.5 1 5.00%
20 100.00%
HISTOGRAM
HISTOGRAM
Gunakan cara manual maupun software (ex: ms excel , minitab, dll.)
untuk membuat histogram!
Ms excel : insert – chart - column

5
Series1
4

3
Bars
2
Touch
1

0
159.5 - 303.5 303.5 - 447.5 447.5 - 591.5 591.5 - 735.5 735.5 - 879.5
1 2 3 4 5
Distorting the Truth
with Descriptive
Techniques
‘Chart Junk’
Bad Presentation Good Presentation
Minimum Wage Minimum Wage

1960: $1.00 $
4
1970: $1.60
2
1980: $3.10

0
1990: $3.80 1960 1970 1980 1990
Compressing Vertical Axis
Bad Presentation Good Presentation
Quarterly Sales Quarterly Sales

200
$ 50
$

100 25

0 0
Q1 Q2 Q3 Q4 Q1 Q2 Q3 Q4
No Zero Point on Vertical Axis

Bad Presentation Good Presentation


Monthly Sales Monthly Sales
$ $
45 60

42 40

39 20

36 0

J M M J S N J M M J S N
No Relative Basis
Bad Presentation Good Presentation
A’s by Class A’s by Class

300
Freq. 30%
%
200 20%

100 10%

0 0%
FR SO JR SR FR SO JR SR
Errors in Presenting Data
1. Using ‘Chart Junk’
2. No Relative Basis in Comparing
Data Batches
3. Compressing the Vertical Axis
4. No Zero Point on the Vertical
Axis
~  DISCUSSION TIME  ~
Seorang mahasiswa melakukan pengamatan terhadap kinerja seorang operator mesin
dalam melakukan setup setiap kali job change terjadi.
Berikut ini merupakan raw data (dalam satuan detik) yang diperoleh mahasiswa
tersebut.
Pertanyaan :
125 117
95 103 1. Buatlah tabel distribusi frekuensi beserta histogram dari raw
80 80 data disamping! Analisa dan interpretasikan !
83 97
98 85
90 105
140 110 NOTE : tambahkan asumsi sendiri bila diperlukan.
119 136
125 97
117 116
88 107 2. Data berkelompok yang disajikan dalam range inteval kelas
86 79 yang kecil dan jumlah kelas yang banyak memiliki keuntungan
91 122 dan kekurangan. Jelaskan keuntungan dan kekurangannya !
130 65
74 98

You might also like