Professional Documents
Culture Documents
LESSON 4 Data Management Schedule 2
LESSON 4 Data Management Schedule 2
LESSON 4 Data Management Schedule 2
D ATA
MANAGE ME NT
SCHEDULE-
S TAT I S T I C S
MODULE 4
J O N D E L S . I H A L A S
P A RT- T I M E I N S T R U C TO R
1.Uses a variety of statistical tools to process and manage
numerical data.
2.Use the methods of linear regression and correlations to
predict the value of a variable given certain conditions.
3.Advocate the use of statistical data in making important
decisions
45 55 55 55
65 65 65 65
65 65 65 75
75 75 85
STEM LEAF 6
4 5 5
5 5 5 5 4
6 5 5 5 5 5 5
3
5
7 5 5 5 2
8 5 1
35 45 55 65 75 85 95
From the stem-and-leaf plot or histogram, we can observe that the
mean, median and mode are the same
LESSON 4 DATA MANAGEMENT SCHEDULE-STATISTICS
7
STEM LEAF 5
4 5
4
5 5 5 5
3
6 5 5 5 5 5 5
5 2
7 5 5 5
1
8 5
35 45 55 65 75 85 95
7
LESSON 4 DATA MANAGEMENT SCHEDULE-STATISTICS
PROPERTIES OF THE NORMAL
PROBABILITY DISTRIBUTION
A standard normal
curve is a probability
distribution that has a
mean 𝜇 = 0 and a
standard deviation 𝜎 = 1.
11
LESSON 4 DATA MANAGEMENT SCHEDULE-STATISTICS
THE Z-SCORE
The areas under the normal curve are given in terms of 𝑧-
values or scores.
12
LESSON 4 DATA MANAGEMENT SCHEDULE-STATISTICS
13
LESSON 4 DATA MANAGEMENT SCHEDULE-STATISTICS
14
LESSON 4 DATA MANAGEMENT SCHEDULE-STATISTICS
EXAMPLE 1: GIVEN THE MEAN 𝜇 = 50 AND THE
STANDARD DEVIATION, 𝜎 = 4 OF A POPULATION
OF READING SCORES. FIND THE 𝑧-VALUE THAT
CORRESPONDS TO A SCORE X = 58.
Solution:
38 42 46 50 54 58 62
16
LESSON 4 DATA MANAGEMENT SCHEDULE-STATISTICS
EXAMPLE 3: FIND THE AREA UNDER THE
STANDARD NORMAL CURVE OF THE FOLLOWING
𝑧-SCORES. (USE THE TABLE OF AREAS UNDER
THE NORMAL CURVE)
17
LESSON 4 DATA MANAGEMENT SCHEDULE-STATISTICS
18
LESSON 4 DATA MANAGEMENT SCHEDULE-STATISTICS
EXAMPLE 3: FIND THE AREA UNDER THE
STANDARD NORMAL CURVE OF THE FOLLOWING
𝑧-SCORES. (USE THE TABLE OF AREAS UNDER
THE NORMAL CURVE)
20
LESSON 4 DATA MANAGEMENT SCHEDULE-STATISTICS
21
LESSON 4 DATA MANAGEMENT SCHEDULE-STATISTICS
EXAMPLE 3: FIND THE AREA UNDER THE
STANDARD NORMAL CURVE BETWEEN 𝑧 = 0 AND
THE FOLLOWING 𝑧-SCORES. (USE THE TABLE OF
AREAS UNDER THE NORMAL CURVE)
22
LESSON 4 DATA MANAGEMENT SCHEDULE-STATISTICS
EXAMPLE 4: FIND THE AREA UNDER THE
STANDARD NORMAL CURVE BOUNDED BY THE
FOLLOWING PAIRS OF 𝑧-SCORES. (USE THE
TABLE OF AREAS UNDER THE NORMAL
CURVE)
23
LESSON 4 DATA MANAGEMENT SCHEDULE-STATISTICS
24
LESSON 4 DATA MANAGEMENT SCHEDULE-STATISTICS
25
LESSON 4 DATA MANAGEMENT SCHEDULE-STATISTICS
EXAMPLE 4: FIND THE AREA UNDER THE
STANDARD NORMAL CURVE BOUNDED BY THE
FOLLOWING PAIRS OF 𝑧-SCORES. (USE THE
TABLE OF AREAS UNDER THE NORMAL
CURVE)
26
LESSON 4 DATA MANAGEMENT SCHEDULE-STATISTICS
EXAMPLE 5: FIND THE AREA OR PROPORTION
(PROBABILITY) INDICATED BY EACH ITEM.
(USE THE TABLE OF AREAS UNDER THE
NORMAL CURVE)
27
LESSON 4 DATA MANAGEMENT SCHEDULE-STATISTICS
28
LESSON 4 DATA MANAGEMENT SCHEDULE-STATISTICS
EXAMPLE 5: FIND THE AREA OR PROPORTION
(PROBABILITY) INDICATED BY EACH ITEM.
(USE THE TABLE OF AREAS UNDER THE
NORMAL CURVE)
P (𝑧 > -1)
=1-0.1587
P (𝑧 > -1)
=0.8413 OR 84.13%
29
LESSON 4 DATA MANAGEMENT SCHEDULE-STATISTICS
Example 6. A coffee bending machine is set to
dispense amounts of coffee per cup that follows a
normal distribution with a mean of 200 ml and a
standard deviation of 10ml. Let the random variable
x be the amount (in ml) per cup
Given:
n = 100
μ = P73M = 73
X = P80M = 80
σ = P3.25M =
3.25
73M 80M
36
LESSON 4 DATA MANAGEMENT SCHEDULE-STATISTICS
EXAMPLE 8: DG company has 100 branches nationwide. The annual
profit of DG company is normally distributed with a mean of Php 73 million
a year with a standard deviation of Php 3.25 million. How many branches
have a profit of Php 73 million to Php 80 million?
Number of branches
= (Area between 𝑧 = 0 & 𝑧 =
2.15)(n)
= (0.4842)(100)
= 48.42
≈ 48 branches
0.4842
Therefore, 48 branches have a
73M 80M profit of Php 73 M to Php 80 M.
37
Quartiles
a quartile is a type of quantile which divides the
number of data points into four parts, or quarters, of
more-or-less equal size. The data must be ordered
from smallest to largest to compute quartiles; as such,
quartiles are a form of order statistic.
14)45 15)45 16)48 17)48 18)49 19)50 20)50 21)53 22)56 23)58 24)62 25)73
Step 2: Identify the median
1)26 2)32 3)36 4)36 5)37 6)39 7)39 8)40 9)40 10)41 11)42 12)42 13)43
14)45 15)45 16)48 17)48 18)49 19)50 20)50 21)53 22)56 23)58 24)62 25)73
where:
𝑛 = sample size; 𝑋 = value of the independent variable; 𝑌 = value of the dependent
variable
• Large sample size requires a tedious manual process. Thus, statistical software like SPSS may be used.
LESSON 4 DATA MANAGEMENT SCHEDULE-STATISTICS
LESSON 4 DATA MANAGEMENT SCHEDULE-STATISTICS
Example 1: A large industrial plant has seven divisions that do the same type of work. For their
safety report, a safety inspector visits each division of 20 workers quarterly.
a. Identify the independent and dependent variables;
b. test if there is a relationship between the variables.
c. Determine the equation that best fits the line
Number of work-hours lost
Number of work-hours devoted
Division due to industry-related
to safety training
accidents
1 10.0 80
2 19.5 65
3 30.0 68
4 45.0 55
5 50.0 35
6 65.0 10
7 80.0 12
Division X Y X2 Y2 XY
1 10.0 80 100 6400 800
2 19.5 65 380.25 4225 1267.5
3 30.0 68 900 4624 2040
4 45.0 55 2025 3025 2475
5 50.0 35 2500 1225 1750
6 65.0 10 4225 100 650
7 80.0 12 6400 144 960
69597.5−97337.5
=
115711.75− 89700.25 138201− 105625
≈ −0.9530
Physics (x) 3 9 10 12 7
Statistics (y) 5 8 10 9 8
X Y X2 Y2 XY
3 5 9 25 15
9 8 81 64 72
10 10 100 100 100
12 9 144 81 108
7 8 49 64 56
1755−1640
=
1915− 1681 1670−1600
≈ 0.8985
≈ 0.90
The computed r is 0.8985 or 0.90. The result show that there is a very
high or very strong positive association between the scores in Physics and Statistics.
This means that if a students has a high score in one subject it is expected that the
student also has a high score in the other subject. In addition, a low score in one subject
means a low score in the other subject.
The association between Physics (x) and Statistics (y) can be modelled
by the regression line 𝑌 = 0.491𝑋 + 3.97.
REFERENCES:
Priscilla S. Altares, A. R. (2012). Elementary Statistics with Computer
Application. Quezon City: Rex Printing Company, Inc.
https://www.questionpro.com/blog/interval-scale/
http://lsc.cornell.edu/wp-content/uploads/2016/01/Intro-to-
measurement-and-statistics.pdf