Download as pdf or txt
Download as pdf or txt
You are on page 1of 24

Introduction to statistics

statistics → a set of mathematical procedure for

summarizing and
organizing ,
,
interpreting

purpose -
organize and summarize

\ answering general question

sampling → sample
population → →
generalization

t i t i
individuals set of result from
all
sample is
representative sample
of interest selected
individuals generalize
from population into

population

value described population
parameter
statistics → value described sample
Frequency distributions

- an organized tabulations of the number of individuals

located in each category

how the individual scores are


-
a
picture of

distributed
order
unorganized score into

/ lowest )
function ( highest →

\ individuals scores into


having similar

same group

distribution
:

Table of frequency
data
b-YK

| :| /
datGf
20
to " fx =
f. ✗

45
g 5 f = how many

56 measures are
8 7
resulted
21 within the specific
7 3
score
.

Ef = N
O O
5
total selvrvh
N =
4 4
frekuensi

E 20 158
proportion and percentage

relate with the total


proportion =
show frequency

P =
F-
N

distribution could shown


Percentage = as

percentage

% P 100 %
¥
= =
100
.

÷ff÷÷
t)
✗ F F ✗ P %

.
10

45 0.25 25
9 5

35
8 7 56 0-35

15

:
21
7-

10
µ .
a .

.
°
5

4 0.05
5
1
4
hrs 100
158 hrs 1
20
Class Intervals

table of grouped scores

rules :

① consist of I 10 Class intervals

sebisa 9-11
mvngkin

② width of
the intervals should be simple

( 2 ,
5 ,
10,20 )

③ starting score for


each Cl is the multiplication

the width
of .

④ All intervals have the same width , cover the range

of possible scores ,
with no gap / overlap
example :

1- 215
range : 215.5 -

0.5 =
215

t i -

0.5
215-5
pake width yang 20
agar

215
= 10
To
. .
- .

I
11 class intervals

0 -

1g

20
39
depannyahanusbisa ÷
-

20

40 -

5g

60 -
79

80 -
gg

100 -

119

120 -

139

140 - 159

160 -
179

180 -199

200 -

219
real limits and limits
apparent

real limits → upper real limit


\
lower real limit

example : a score
of 165

upper real limit : 165.5

lower real limit =


164.5

also applies to class intervals

Cl : 40-44 → apparent limits

✗ = 40 ✗ = 94

40 → lower apparent limit

39.5 → lower real limit

44 limit
→ upper apparent

44.5 → upper real limit


Frequency distributions graphs

a) Interval and ratio

( limits ) → space
between 2
Histogram real no
-

consecutive bars
-

polygon ( midpoint )

b) nominal and ordinal

-
bar graph → equal space between every 2

pie chart consecutive bars


and
percentile rank
percentile

percentile =
score

percentile rank =
percentage
↳ rank / position

a. cumulative frequency

-
number of person who scored similarly or below

a specific score

addition of the frequency from the bvttop -

top group

b. cumulative percentage

Cp CF
=
-
.
100%
N

percentage of CF
Example

5 "÷

4 95 %

3 70 %

2 30 %

I 2
to %
Interpolation

the intermediate
method to
find value ,
which is

a score that is located between 2


specific score

a. Interpolation in un grouped data

:| i. {
::
Percentage
"
"
°

what is the T
23
g2%
percentile rank

for ✗ 7
60%
=

8 4
7.5 = 32%
-7 11
44%
⑦ -

5 20 %
6 4
/
G.g
5 1 I 4 %

tau width 7.5-6.5=1


① hrs →

② 7.5 -7 44 -
X
=
-
-

7. 5- 6.5 -20
44
t
width
✗ = 32%
b. Interpolation in grouped data

|I
""|"
✗ f- (f CP

" 2

15-19 3 "
90%

a. µ 3 15 %
9-5
5- g- " ° 60%
12
-4
? -50%
o g
%
-

2 ,,
,

what is the 50th percentile



score

9. 5 X
60--50
-

-
=

9.5 -
4.5 60 -10

✗ =
8.5
Central Tendency

single score
that defines the center
of a distribution

purpose -

describe population / sample


\
compare groups

?d%I
3 values

µ mode

nominal :
categories of things

ordinal low income medium income ,


high income
; ,

tapi tidak tahu high income nosing I


Orang berapa

Interval Koma not absolute /


:
bison ,
0

0 doesn't have meaning

ratio : 0 absolute ( o have meaning )


⑨ mean

arithmetic average

notation :

sample mean → M or I [ statistics )


population mean →
µ ( parameter )

used for interval or ratio data

rumors :
sumo
Sum Of f

score f fx

'

9 18
2

8
4 32

7
o o

6 I 6

I =

÷ = 8.25
weighted mean

a. non weight = do not consider the distribution

understand each score in details


b. weight =

=% µ
B combined
class A class

N= 102
N = 52
N = 50

{✗ = 50.75
= 3750 =
4056

M -75 M
78-06
-

M=zg
=

102

=
76.52
?⃝
④ median

a score that divides a


frequency

distribution into two equal part

median = Pso ( 50th percentile )


Used ordinal data
for

-
Data with odd N

median 10,11 12 13
=
7,8 g . i
, ,

-
Data with even N

median 13 14
=
7,8 , 9 ,
10 , 11 ,
12 ,
.

10¥ = 10.5
① Mode

that most often shows in the distribution


a score

( highest frequency )

the only measurement


can be apply to
any scale ,

data
applicable to nominal

types of mode

a. normal ( 1 mode )

b. bimodal ( 2 modes )

C. multimodal ( 72 modes )

modes similar frequency )


D. rectangular ( no =

2.3.415 age
discrete variable → legit I /
,

bvkah
variable → ada Koma ,
weight .

continuous
distribution shape

① symmetrical distributions

each
where the left and right side are mirroring
other

normal
c. rectangular
a.

# ÷! ← median
mean

bell -
shape mode
mean

b. bimodal

mode mode
f
2 mode
mean

median
② skewed distributions

An distribution where the


unsymmetrical ,

located in different positions


central tendencies

positively skewed

( mode < median < mean )

the right determined as


tail on so

most people receive


positively skewed →
lower scores

in
-
negatively skewed

( mode > median > mean )

the left determined as


tail on so

negatively skewed → most people receive

higher scores

a
Variability

better distributions the data


you can see the of

distribution out
which scores in the are spread

or
clustered together

all scores the same → no variability


small differences → variability small

Large differences → variability large


④ range

real limit of the largest score -

upper

the lower real limit of the smallest score

example :

3- 7- , 12 , 8,5 ,
I 0

✗ max = 12.5

2.5
✗ min =

= 12.5 -
2.5 = 10
range
⑤ standard deviation

-
only use if there is the mean

distance from each score of the mean


-

average

SD =
Fiance
-
2
V = E. ( X -
m ) =
{ / ✗ -

E)
T T

1) Is 1 ,
2 ,
2 ,
3 , Y

m = 14 2
=

g-

2+(1-2) -112-212+(2-2)<-1
' '
V :( 1- 2) + ( I -
2)

(3-25-114-25)
= 1.142

SD = 1. 069
Normal distributions

distributed in a bell-shaped manner

normal distribution area is identified with 2- score

7- score -
unit to explain normal distribution area

\
measures position

M = mean


2- =

0 = SD

the to the right / left → small


further
frequency

highest frequency → tengah


.

2- score distribution

body = bigger
tail = smaller

D= between
mean and 2-
2- score

proportion
=
nilai body , tail ,
between

probability =
proportion ✗ 100%
= di %

score = ✗

You might also like