Download as pdf or txt
Download as pdf or txt
You are on page 1of 52

11

STATISTICS AND
PROBABILITY
Learning Activity Sheets
Quarter 3: Week 1-5

Division of Angeles City


Name of learner: ___________________________ Section: ___________________

WEEK 1: RANDOM VARIABLES AND PROBABILITY DISTRIBUTION


Background Information
In the study of basic probability, you have learned that an experiment is any activity
which can be done repeatedly under similar condition. The set of all possible outcomes of an
experiment is called the sample space. You have also learned how to mathematically list the
possible outcome of a given experiment. In tossing a coin, for instance, the possible outcomes
are turning up a head or a tail.
For you to begin let us all understand that probability distributions can be illustrated or
classified as discrete probability distributions or as continuous probability distributions,
depending on whether they define probabilities associated with discrete variables and
continuous variables.
A variable X whose value depends on the outcome of a random process is called a
RANDOM VARIABLE. A random variable is a variable whose value is a numerical outcome
of a random phenomenon. A random variable is denoted with a capital letter. The probability
distribution of a random variable X tells what the possible values of X are and how probabilities
are assigned to those values.
A random variable can be discrete or continuous.

A Discrete variable A Continuous variable


Is a numerical value associated with Has infinite numerical values
the desired outcomes. It is also either associated with any interval
a finite or infinite number of values on the number line system
but countable such as whole numbers without any gaps or breaks.
0,1,2,3.

For example: For example:


- number of Siblings - Height
- admissions in a school - Weight
- the number of teachers - Skull Circumference

A Random Variable is a numerical quantity that is derived from the outcomes of random
experiments.
The random variable in the activity above is a discrete random variable because the
set of possible outcomes is countable. The possible values of random variable X = {0, 1, 2, 3}

Example 1
Suppose we toss a fair coin 3 times. The sample space for this chance process is :
{ HHH HHT HTH THH HTT THT TTH TTT}

Since there are 8 possible outcomes then probability of each outcome is 1/8.

2
Define the variable X = the number of heads obtained.
What are the outcomes of possible heads?

X=0 TTT
X=1 HTT , THT , TTH
X=2 HHT , HTH , THH
X=3 HHH

PROBABILITY DISTRIBUTION
Probability distribution of a random variable tells all of its possible values along with
their associated probabilities.
Example 1: The random variable X is the number of tails in two tosses of a coin. What are
the probabilities of the possible outcomes? What is the probability distribution of X?

Solution:
Possible outcomes: {HH HT TH TT}
Each has probability ¼ so the probability distribution is:
X=No. of Tails 0 1 2

P(x ) 1 2 1
4 4 4

In Graphical Form:
We can use the probability distribution to answer questions about variable x. In symbols, we
want to find P ( x ≥1). We could add probabilities to find the answer:

P(x≥1) = P(x=1) + P(x=2)


1 2 3
= + =
4 4 4

P(x≥1) =1 – P(x < 1) =


1 3
1– =
4 4

3
FINDING POSSIBLE VALUES OF A RANDOM VARIABLE

To understand how one finds the possible values of random variables, do the activity
below...
Suppose two cellphones are tested at random. We want to find out the number of
defective cellphones. Thus, to each outcome in the sample space we shall assign a value.
These are 0, 1, or 2. If there is no defective cellphone, we assign the number 0; if there is 1
defective cellphone, we assign the number 1; and 2, if there are two defective cellphones. The
number of defective cellphones is a random variable.

The possible values of this random variable are 0, 1, and 2. Read and analyze the
given situation below.

Illustration
Let D represent the defective cellphones and N for the non-defective cellphones. If we let Y
be the random variable representing the number of defective cellphones, show the values of
the random variable Y. The table below to show the values of the random variable.

Possible Outcomes Value of the Random Variable


Y (number of defective
cellphone)
DD 2
NN 0
DN 1
ND 1

Example 2. Suppose two coins are tossed, let Z be the random variable representing the
number of heads that occur. Find the values of the random variable Z.

1. Determine the sample space. Let H represent head and T for tail

Solution: The sample space for this experiment is: S = {TT, TH, HH, HT}

2. Count the number of heads in each outcome in the sample space and assign
this number to this outcome.
Possible Outcomes Value of the Random
variable Z (Number of
heads)
TT 0

TH 1

HH 2

HT 1

So, the possible values of the random variable Z = {0, 1, 2 }

4
Illustrating a Probability Distribution for a Discrete Random Variable and its Properties

A random variable is a set whose elements are the numbers assigned to the outcomes
of an experiment. It is usually denoted by uppercase letters such as X, whose elements are
denoted by lower case letters x1, x2, x3 and so on.

Probability distribution of a discrete random variable specifies the probability of each


possible value of the random variable. The distribution functions of discrete random variables
are concentrated as a mass for a particular value, and generally known as Probability Mass
Function.

Properties of discrete probability distribution

1. The probability of each value of the random variable must be between or equal to
0 and 1. In symbol 0 < P(X) < 1.
2. The sum of all the probabilities of all values of the random variable must be equal
to 1. In symbol, we write it as ΣP(X) = 1

Probability Histogram

We can represent a probability distribution graphically by constructing a type of bar graph


called a probability histogram. This is constructed by displaying the possible distinct values of
the random variable along a horizontal axis. Above each value x of the random variable, we
draw vertical bar having height equal to the probability P(x).

Example 1 : Number of Green Balls


Write the probability distribution of a random variable Y representing the number of
green balls when 2 balls are drawn in succession without replacement from a jar containing
4 red and 5 green balls. Construct a histogram for this probability distribution.

STEPS:

1 Determine the sample space. Let R represent the red ball and G represent the
green ball.

Solution : The sample space for this experiment is: S= { RR, RG, GR, GG}
2.Count the number of green balls in each outcome in the sample space and assign this
number to this outcome.
Possible Value of the Random
Outcomes Variable Y (No. Of
Green Balls)

RR 0

RG 1

GR 1

GG 2

5
3. Assign probability values P(Y) to each value of the random variable.

 There are 4 possible outcomes, if no green ball occurs once, the probability that
we shall assign to the random variable 0 is ¼
 If 1 green ball occurs two times, the probability that we shall assign to the
random variable 1 is 2/4 or ½.
 If 2 green balls occur once, the probability that we shall assign to the random
variable 1 is ¼.
Number of Green Probability P(Y)
Balls Y

0 1/4

1 2/4 or ½

2 ¼

Histogram

Construct a histogram for this probability distribution. A histogram is a bar graph. To


construct a histogram for a probability distribution, follow these steps.
Plot the values of the random variable along the horizontal axis.

a) In plotting the probabilities along the vertical axis, you can change fractions to
decimals (ex. ¼=0.25 and ½= 0.5)

0.6

0.5

0.4
P(Y)

0.3

0.2

0.1

0
0 1 2
No. of Green Balls (Y)

Example 2
Let X be a random variable giving the number of girls in a randomly selected three-
child family. Assuming that boys and girls are equally likely , construct the probability
distribution of X and its corresponding histogram.

6
a.
Possible Outcomes Value of the Random Variable X
(No. of Girls)

BBB 0

BBG 1

BGB 1

GBB 1

BGG 2

GBG 2

GGB 2

GGG 3

b. No. of Girls 0 1 2 3

Probability P(X) 1/8 or 0.125 3/8 or 0.375 3/8 or 0.375 1/8 or 0.125

ΣP(x)= 1/8 +3/8 + 3/8 + 1/8 = 8/8 =1


c. Histogram
0.4
0.3
P(X)

0.2
0.1
0
0 1 2 3

No. of Girls (X)

Learning Competencies with code:

 Illustrate a random variable (discrete and continuous) M11/12SP – IIIa-1


 Distinguish between a discrete and a continuous random variable(M11/12SP-IIIa2)
 Find the possible values of a random variable. (M11/12SP-IIIa-3)
 Illustrate a probability distribution for a discrete random variable and its properties.
(M11/12SP-IIIa-4)

7
GENERAL INSTRUCTION: Write your answer on a separate paper
Practice A
Classify whether the given experiment implies a discrete random variable or a
continuous random variable. Write D if discrete and C if continuous.
_____ 1. The temperature of a solution in the laboratory
_____ 2. Collecting data about the heights of students in senior high schools.
_____ 3. Recording the distance travelled by the bus
_____ 4. Surveying about the number of cases due to Covid -19 pandemic in Angeles City
_____ 5. Number of promoted students at the end of school year

Practice B.
Complete the given table by finding the possible values of random variable.
1. In a box are 2 balls - one white and one yellow. Two balls are picked one at a time with
replacement. Let X be the random variable representing the number of white balls.
Find the values of the random variable X.
Possible Outcomes Value of the random variable X
(Number of white balls)

2. Four coins are tossed. Let Y be the random variable representing the number of tails
that occur. Find the values of the random variable Y.

Possible Outcomes Value of Random Variable Y


(number of tails)

8
Practice C.
Determine whether the given values can serve as the values of a probability distribution.
1. P(1) = 0.05, P(2)=1.01, P(3)= 0.2
2. P(1) = 3/20 , P(2)= 7/20 , P(3)= ½
3 P(1) = 0.35 , P(3) =0.25 , P(5) 0.28 , P(7)= 0.12

X 1 5 7 8
3.
P(x) 1/4 1/8 1/4 1/8

X 1 3 5 7
4.
P(x) 0.35 0.25 0.22 0.12

Practice D. Fill in the blanks with the correct answer.


1. It is the list of possible values of a random variable X and the corresponding probabilities of
the values. _______________________________.
2. What is the sum of all the probabilities of all values of the random variable?
______________________________________________________________.
3. What should be the probability of each value of the random variable? Write your answer in
symbol. __________________________________________.
4. How will you determine if the given distribution is a probability distribution?
__________________________________________________________.
5. Enumerate the steps on how to illustrate the probability distribution of a discrete random
variable by completing the statements given below.
Determine first the _____________space in the given experiment. Then, find the possible
values of the _____________________________. Illustrate the probability distribution by
assigning ____________________________ to each value of the random variable.

References
Albert, J. R. G. (2008). Basic Statistics for the Tertiary Level (ed. Roberto Padua,
Welfredo Patungan, Nelia Marquez), published by Rex Bookstore.

Handbook of Statistics 1 (1st and 2nd Edition), Authored by the Faculty of the
Institute of Statistics, UP Los Baños, College Laguna 4031

R Pierce, Rod. (3 Feb 2020). "Random Variables". Math Is Fun. Retrieved 24 May 2020
from http://www.mathsisfun.com/data/random-variables.html
Malate, Jose S. Statistics and Probability for Senior High School. Vicarisg Publications
and Trading, Inc, 2017.

Lim, Yvette F., Nocon, R., Nocon, E., Ruivivar, L. Math for Engaged Learning Statistics
and Probability. Sibs Publishing House, Inc. 2016.

Statistics and Probability 11, Pivot 4A Learners Material (N/A) Pages 4-53

9
ANSWER KEY
Practice A Practice B Practice C Practice D
1. 1. Probability
1 C 1. not a Distribution
2. 1
2 C Possible Value of probability 3. 0 ≤ P(x) ≤ 1
3 C Outcomes the distribution 4. ∑ P(x) = 1 and
4 D Random 2. probability 0 ≤ P(x) ≤ 1
5 D Variable Y distribution 5. Sample
3. probability Random variable
(No. Of
Probability values
white distribution or P(x)
balls) 4. not a
probability
WW 2 distribution
YY 0 5. not a
probability
WY 1 distribution
YW 1
Possible value of the random
X = { 0,1,2}

2.

Possible value of the random


X= { 0,1,2,3,4}

Prepared

JOANAH S. BAUTISTA
Secondary School Teacher I - FGNMHS

10
WEEK 2: COMPUTING PROBABILITY CORRESPONDING TO A GIVEN RANDOM
VARIABLE

Background Information

RANDOM VARIABLES

A Random Variable is a set of possible values from a random experiment.

Example: Tossing a coin: we could get Heads or Tails.

In short:
𝑋 = {0, 1}
Note:
We could choose Heads = 100 and
Tails= 150 or other values if we want! It
is our choice

So:
 We have an experiment such as tossing coin.
 We give values to each event.
 The set of values is a Random Variable.

A Random Variable has a whole set of values, and it could take on any of those values,
randomly.

We use a capital letter, like X or Y, to avoid confusion with the Algebra type of variable.

A Random Variable’s set of values is the Sample Space.


Example: Throw a die once
Random Variable X= “The score shown on the top face.”
X could be 1, 2, 3, 4, 5 or 6
So the sample space is {1, 2, 3, 4, 5, 6}

PROBABILITY
We can show the probability of any one value using this style:
P(X= value) = probability of that value

Example (continued): Throw a die once


X= {1, 2, 3, 4, 5, 6}
In this case they are all equally likely, so the probability of landing on any face when rolling a
die is 1/6
 P(x = 1)= 1/6
 P(x = 2)= 1/6
 P(x = 3)= 1/6
 P(x = 4)= 1/6
 P(x = 5)= 1/6
 P(x = 6)= 1/6

Note that the sum of the probabilities =1, as it should be.

11
Example:
How many heads do you get when we toss three coins?
X= “The number of the Heads” is the Random Variable
In this case, there could be 0 Heads (if all the coins land Tails
up), 1 Head, 2 Heads or 3 Heads.

So the Sample Space = {0, 1, 2, 3}

But this time the outcomes are NOT all equally likely.

The three coins can land in eight possible ways:

X= “Number of Heads”

Looking at the table we see just 1 case of Three


Heads, but 3 cases of Two Heads, 3 cases of
One Head, and 1 case of Zero Heads. So:

 P(x = 3)= 1/8


 P(x = 2)= 3/8
 P(x = 1)= 3/8
 P(x = 0)= 1/8

Example: two dice are tossed.


The random variable is X= “The sum of the scores on the two dice”.

Let’s make a table of all the possible values.

There are 6 x 6= 36 possible outcomes, and the


sample space (which is the sum of the scores on
the two dice) is {2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12}

12
Let’s count how often each value occurs, and work out the probabilities:
 2 occurs just once, so P(X = 2)= 1/36
 3 occurs twice, so P(X = 3)= 2/36 or 1/18
 4 occurs three times, so P(X = 4)= 3/36 or 1/12
 5 occurs four times, so P(X = 5)= 4/36 or 1/9
 6 occurs five times, so P(X = 6)= 5/36
 7 occurs six times, so P(X = 7)= 6/36 or 1/6
 8 occurs five times, so P(X = 8)= 5/36
 9 occurs 4 times, so P(X = 9)= 4/36 or 1/9
 10 occurs thrice, so P(X = 10)= 3/36 or 1/12
 11 occurs twice, so P(X= 11)= 2/36 or 1/18
 12 occurs just once, so P(X = 12)= 1/36

THE PROBABILITY SCALE

Probabilities are given on a scale of 0 to 1, as decimals or as fractions; sometimes probabilities


are expressed as percentages using a scale of 0% to 100%, particularly on weather forecasts.

0 This is the probability of something that is impossible.


1 This is the probability that something is certain.
½ This is the probability of something that is likely to happen as it is not to happen.

THE PROBABILITY OF A SINGLE EVENT

For equally likely events:


𝑛𝑢𝑚𝑏𝑒𝑟 𝑜𝑓 𝑤𝑎𝑦𝑠 𝑜𝑏𝑡𝑎𝑖𝑛𝑖𝑛𝑔 𝑜𝑢𝑡𝑐𝑜𝑚𝑒𝑠
𝑝(𝑎 𝑝𝑎𝑟𝑡𝑖𝑐𝑢𝑙𝑎𝑟 𝑜𝑢𝑡𝑐𝑜𝑚𝑒) =
𝑡𝑜𝑡𝑎𝑙 𝑛𝑢𝑚𝑏𝑒𝑟 𝑜𝑓 𝑜𝑢𝑡𝑐𝑜𝑚𝑒𝑠

Example
1) A card is taken at random from a full pack of 52 playing cards. What is the probability
that the card taken is:
(a) Red card?
(b) a queen?
(c) a red ace?
(d) the seven of hearts?

Solution
As each card is equally likely to be drawn from the pack there are 52 equally likely outcomes.

(a) There are 26 red cards in the pack, so:


26 1
𝑝 (𝑟𝑒𝑑) = =
52 2
(b) There are 4 queens in the pack, so:
4 1
𝑝 (𝑞𝑢𝑒𝑒𝑛) = =
52 13
(c) There are 2 red aces in the pack, so:
2 1
𝑝 (𝑟𝑒𝑑 𝑎𝑐𝑒) = =
52 26
(d) There is only 1 seven of hearts in the pack, so:
1
𝑝 (𝑠𝑒𝑣𝑒𝑛 𝑜𝑓 ℎ𝑒𝑎𝑟𝑡𝑠) =
52

13
2) A packet of sweets contains 18 red sweets, 12 green sweets and 10 yellow sweets. A
sweet is taken at random from the packet. What is the probability that the sweet is:
(a) Red,
(b) Not green,
(c) Green or yellow?

Solution
The total number of sweets in the packet is 40, so there are 40 equally likely outcomes when
one is taken at random.
(a) There are 18 red sweets, so:
18 9
𝑝(𝑟𝑒𝑑) = =
40 20
(b) There are 28 sweets that are not green in the packet, so:
28 7
𝑝 (𝑛𝑜𝑡 𝑔𝑟𝑒𝑒𝑛) = =
40 10
(c) There are 22 sweets that are green or yellow in the packet, so:
22 11
𝑝 (𝑔𝑟𝑒𝑒𝑛 𝑜𝑟 𝑦𝑒𝑙𝑙𝑜𝑤) = =
40 20

3) You roll a fair dice 120 times. How many times would you expect to obtain:
(a) A6
(b) An even score
(c) A score of less than 5?

Solution:
1
(a) 𝑝 (6) =
6
1
𝐸𝑥𝑝𝑒𝑐𝑡𝑒𝑑 𝑛𝑢𝑚𝑏𝑒𝑟 𝑜𝑓 6𝑠 = 𝑥 120 = 20
6
3 1
(b) 𝑝(𝑒𝑣𝑒𝑛 𝑛𝑢𝑚𝑏𝑒𝑟) = =
6 2
1
𝑒𝑥𝑝𝑒𝑐𝑡𝑒𝑑 𝑛𝑢𝑚𝑏𝑒𝑟 𝑜𝑓 𝑒𝑣𝑒𝑛 𝑠𝑐𝑜𝑟𝑒𝑠 = 𝑥 120 = 60
2
4 2
(c) 𝑝(𝑠𝑐𝑜𝑟𝑒 𝑙𝑒𝑠𝑠 𝑡ℎ𝑎𝑛 5) = =
6 3
2
𝑒𝑥𝑝𝑒𝑐𝑡𝑒𝑑 𝑛𝑢𝑚𝑏𝑒𝑟𝑜𝑓 𝑠𝑐𝑜𝑟𝑒𝑠 𝑙𝑒𝑠𝑠 𝑡ℎ𝑎𝑛 5 = 𝑥 120 = 80
3

A RANGE OF VALUES
We could also calculate the probability that a Random Variable takes on a range of values.

Example:
You rolled two dice, what is the probability that the sum of the scores is 5, 6, 7, or 8?
In other words: What is 𝑃(5 ≤ 𝑥 ≤ 8)?
𝑃(5 ≤ 𝑥 ≤ 8)= P(X=5) + P(X=6) + P(X=7) + P(X=8)
= (4 + 5 + 6 + 5)/36
= 20/36
= 5/9

Expected Value (or Mean) of a Discrete Random Variable


For a discrete random variable, the expected value, usually denoted as µ or E(X), is calculated
using:
µ = 𝐸(𝑋) = ∑ 𝑥𝑖 𝑓 (𝑥𝑖 )
The formula means that we multiply each value, x, in the support by its respective probability,
f (x), and then add them all together. It can be seen as an average value but weighted by the
likelihood of the value.

14
Example:
x 0 1 2 3 4
f(x) 1/5 1/5 1/5 1/5 1/5
What is the expected value?

Answer
1 1 1 1 1
µ = 𝐸(𝑋) = ∑ 𝑥𝑓(𝑥) = 0 ( ) + 1 ( ) + 2 ( ) + 3 ( ) + 4 ( )
5 5 5 5 5

For this example, the expected value was equal to a possible value of X. this may not
always be the case. For example, if we flip a fair coin 9 times, how many heads should we
expect? We will explain how to find this later but we should expect 4.5 heads. The expected
value in this case is not a valid number of heads.

VARIANCE OF A DISCRETE RANDOM VARIABLE

The variance of a discrete random variable is given by:


σ² = Var(X) = ∑ 𝑥 2 𝑖 𝑓(𝑥𝑖 ) − 𝐸(𝑥)2 = ∑ 𝑥 2 𝑖 𝑓(𝑥𝑖 ) − µ2
The formula means that first, we add the square of each value times its probability then
subtract the square of the mean. We will use this form of the formula in all of our examples.

STANDARD DEVIATION OF A DISCRETE RANDOM VARIABLE


The standard deviation of a random variable, X, is the square root of the variance.
σ = SD(X) = √𝑉𝑎𝑟(𝑋) = √σ²

Example
x 0 1 2 3 4
f(x) 1/5 1/5 1/5 1/5 1/5

Find the variance and the standard deviation of X.


σ = SD(X) = √𝑉𝑎𝑟(𝑋) = √σ²
1 1 1 1 1
𝑉𝑎𝑟(𝑋) = [02 ( ) + 12 ( ) + 22 ( ) + 32 ( ) + 42 ( )] − 22 = 6 − 4 = 2
5 5 5 5 5
𝑆𝐷(𝑋) = √2 = 1.4142

Learning Competencies with code


 Compute probabilities corresponding to a given random variable. (M11/12SP-IIIa-6)
 Illustrate the mean and variance of a discrete random variable. (M11/12SP-IIIb-2)
 Calculate the mean and the variance of a discrete random variable. (M11/12SP-IIIb-2)

GENERAL INSTRUCTION: Write on your answer on a separate paper


Activity 1 : Solve for the probability of the following
1. A coin is tossed 125 times and head is obtained 64 times. Now, if a coin is tossed at
random, what is the probability of getting a head?
2. A coin is tossed 200 times and tails have obtained 128 times. Now, if a coin is tossed
at random, what is the probability of getting a tail?
3. In 60 throws of a dice, 6 is obtained 24 times. Now, in a random throw of a dice, what
is the probability of getting 6?
4. In 100 throws of a dice, 5 is obtained 42 times. In a random throw of a dice, what is
the probability of getting 5?
5. In 150 throws of a dice, 2 is obtained 76 times. In a random throw of a dice, what is
the probability of getting 2?

15
Activity 2: Practice calculating and interpreting the mean and standard deviation of a
discrete random variable.

1. Organizers of a concert are limiting tickets sales to a maximum of 444 tickets per
customer. Let T be the number of tickets purchased by a random customer. Here is
the probability distribution of T:
T=# of tickets 1 2 3 4
P(T) 0.1 0.3 0.2 0.4
Calculate the expected value of T.

2. Choose the best interpretation of the mean (or expected value) that you found in the
previous question.
a. Most customers will purchase 3 tickets.
b. A group of 10 customers will purchase exactly 29 tickets.
c. If we look at a large number of customers, then each customer, on average, will
have purchased about 2.9 tickets.

3. Calculate the standard deviation of T.


Round your answer to three decimal places.

Reference
https://online.stat.psu.edu/stat500/lesson/3/3.2/3.2.1

Answer Key
Activity 1
1. 64/125
2. 16/25
3. 2/5
4. 21/50
5. 38/75

Activity 2
1. 2.9 tickets
2. C
3. 1.09

Prepared by

APRIL JOY S. MANIALUNG


Teacher II

16
WEEK 3: MEAN AND VARIANCE OF A DISCRETE RANDOM VARIABLE

Background Information
The mean or expected value of a probability distribution tells the value of a random
variable that we expect to get if the experiment is done repeatedly. In other words, if the
experiment or process is repeated long enough, there is a likelihood or chance that the
average of the outcomes will begin to approach the expected value or the mean value.

The standard deviation of a probability distribution indicates how far, on the average,
is an observed value of a random variable X from its mean. The smaller the standard deviation,
the more likely that an observed value of the random variable will be close to its mean.

Use these concepts to solve problems involving the mean and the variance of
probability distribution.

Example 1:

The following data show the probability distribution of the number of computers sold
daily in a computer shop during the past several months.

Number of Computers X Probability P(X)


0 0.10
1 0.20
2 0.45
3 0.15
4 0.05
5 0.05

Questions:

1. What is the probability that on a given day ---


a. Fewer than three computers will be sold?
b. At most, two computers will be sold?
c. At least, four computers will be sold?
d. Exactly three computers will be sold?
Solution:

To find the probabilities---

a. To find the probability that fewer than three computers will be sold, use X = 0, 1, and 2.
𝑃(𝑋 < 3) = 𝑃(0) + 𝑃(1) + 𝑃(2)
= 0.10 + 0.20 + 0.45
𝑃(𝑋 < 3) = 0.75

17
b. To find the probability that, at most, two computers will be sold, use X = 0, 1, and 2.

𝑃(𝑋 ≤ 2) = 𝑃(0) + 𝑃(1) + 𝑃(2)


= 0.10 + 0.20 + 0.45
𝑃(𝑋 ≤ 2) = 0.75

c. To find the probability that, at least, four computers will be sold, use X = 4 and 5.
𝑃(𝑋 ≥ 4) = 𝑃(4) + 𝑃(5)
= 0.05 + 0.05
𝑃(𝑋 ≥ 4) = 0.10

d. To find the probability that exactly three computers will be sold, use X = 3.
𝑃(𝑋 = 3) = 0.15

2. The mean of the probability distribution is computed as follows:


X P(X) X  P(X)
0.10 0.00
0
0.20 0.20
1
0.45 0.90
2
0.15 0.45
3
0.05 0.20
4
0.05 0.25
5

𝜇 = ∑[𝑋  𝑃(𝑋)]

= 0.00 + 0.20 + 0.90 + 0.45 + 0.20 + 0.25

𝜇 = 2.00

The mean of the probability distribution is 2. This implies that in the long run,
we expect that the average number of computers that will be sold on a given day will
be two.

3. The standard deviation is computed as follows:

X P(X) X2 X2  P(X)
0 0.10 0 0.00
1 0.20 1 0.20
2 0.45 4 1.80
3 0.15 9 1.35
4 0.05 16 0.80
5 0.05 25 1.25

18
𝜎 = √[𝑋2 𝑃(𝑋)] − 𝜇2

𝜎 = √[0.00 + 0.020 + 1.80 + 1.35 + 0.80 + 1.25] − 22

= √5.40 − 4

= √1.40

𝜎 = 1.18

The standard deviation of the probability distribution is 1.18. This implies that
on the average, the number of computers sold is 1.18 units from the mean 2.

Example 4:

In a convenient store, the number of tellers (X) busy with customers at 12:00 noon
varies from day to day. Past records indicate that the probability distribution of X is as
follows:

Number of Tellers X Probability P(X)


0 0.029
1 0.049
2 0.078
3 0.155
4 0.212
5 0.262
6 0.215

Questions:

1. What is the probability that exactly four tellers are busy at 12:00 noon?
2. What is the probability that, at least, two tellers are busy at 12:00 noon?
3. What is the probability that, fewer than five tellers are busy at 12:00 noon?
4. What is the probability that at least two, but fewer than five tellers are busy at
12:00 noon?
5. What is the mean of the probability distribution?
6. What is the standard deviation of the probability distributions?

Solution:

1. To find the probability that exactly four tellers are busy, use X = 4.
𝑃(𝑋 = 4) = 0.212

2. To find the probability that at least two tellers are busy, use X = 2, 3, 4, 5, and 6.
𝑃(𝑋 ≥ 2) = 𝑃(2) + 𝑃(3) + 𝑃(4) + 𝑃(5) + 𝑃(6)
= 0.078 + 0.155 + 0.212 + 0.262 + 0.215
𝑃(𝑋 ≥ 2) = 0.922

19
3. To find the probability that fewer than five tellers are busy,
use X = 0, 1, 2, 3, and 4.
𝑃(𝑋 < 5) = 𝑃(0) + 𝑃(1) + 𝑃(2) + 𝑃(3) + 𝑃(4)

= 0.029 + 0.049 + 0.078 + 0.155 + 0.212

𝑃(𝑋 < 5) = 0.523

4. To find the probability that at least two, but fewer than five tellers are busy,
use X = 2, 3, and 4.
𝑃(2 ≤ 𝑋 < 5) = 𝑃(2) + 𝑃(3) + 𝑃(4)
= 0.078 + 0.155 + 0.212
𝑃(2 ≤ 𝑋 < 5) = 0.445

5. The mean of the probability distribution is computed as follows:

X P(X) X  P(X)
0 0.029 0.000
1 0.049 0.049
2 0.078 0.156
3 0.155 0.465
4 0.212 0.848
5 0.262 1.310
6 0.215 1.290

𝜇 = ∑[𝑋  𝑃(𝑋)]
= 0.00 + 0.49 + 0.156 + 0.465 + 0.848 + 1.310 + 1.290
𝜇 = 4.118
The mean of the probability distribution is 4.118. This implies that in the long
run, we expect that the average number of tellers who are busy at 12:00 noon
will be 4.118 or 4.

6. The standard deviation of the probability distribution is computed as follows:

X P(X) X2 X2  P(X)
0 0.029 0 0.000
1 0.049 1 0.049
2 0.078 4 0.312
3 0.155 9 1.395
4 0.212 16 3.392
5 0.262 25 6.550
6 0.215 36 7.740

20
𝜎 = √[𝑋2 𝑃(𝑋)] − 𝜇2

𝜎 = √[0.00 + 0.049 + 0.312 + 1.395 + 3.392 + 6.550 + 7.740] − (4.118)2

= √19.438 − 16.958

= √2.48

𝜎 = 1.57

Normal Probability Distribution

The probability distribution for a continuous random variable X is a bell-shaped curve.


The curve is a function of the random variable X and is sometimes called a probability density
function or probability distribution. The probability values under this curve enable us to make
probability statements about a random variable. Hence, it is usually used as a mathematical
model in inferential statistics.
The graphical form of the probability distribution for a continuous random variable X,
where X is the number of heads occurring, is a smooth bell-shaped curve. The graph is a
pictorial representation of discrete number. If we continue generating data for the random
variable X, we shall observe an emerging pattern.

Normal Probability Distribution is a probability distribution of continuous random


variables. It shows graphical representations of random variables obtained through
measurement like the height and weight of the students, the percentile ranks of the A&E
(Accreditation and Equivalency) Test result of the Alternative Learning System students, or
any data with infinite values.

It is used to describe the characteristics of populations and help us visualize the


inferences we make about the population. It also used to determine the probabilities and
percentile of the continuous random variables in the distribution.

For example, your grades in Mathematics is one of the scores in the distribution, you
can predict the location of that score in the distribution and interpret it with regards to the mean
and standard deviation.

Properties of the Normal Curve

The graphical representation of the normal distribution is popularly known as a normal


curve. The normal curve is described clearly by the following characteristics:

1. The normal curve is bell-shaped.


2. The curve is symmetrical about its center. This means that, if we draw a segment from
the peak of the curve down to the horizontal axis, the segment divides the normal curve
into two equal parts or areas.
3. The mean, median, and mode coincide at the center. This also means that in a normal
distribution, or a distribution described by a normal curve, the mean, median, and mode
are equal.

21
4. The width of the curve is determined by the standard deviation of the distribution.
5. The tails of the curve are plotted in both directions and flatten out indefinitely along the
horizontal axis. The tails are thus asymptotic to the baseline. A portion of the graph
that is asymptotic to a reference axis or another graph is called an asymptote, always
approaching another but never touching it.
6. The total area under a normal curve is 1. This means that the normal curve represents
the probability, or the proportion, or the percentage associated with specific sets of
measurement values.
To visualize and understand the properties/characteristics of a normal curve, please
refer to figure as shown below.

Graph of a NORMAL CURVE

A normally distributed random variable with a mean µ = 0 and standard deviation ơ =


1 is called a standard normal variable. It is presented using standard normal distribution where
the center of the curve is zero and add one unit from the center to the right and subtract one
unit from the center to the left. This is shown below in the figure.

The shape of a normal curve is based on the two given parameters, the mean and the
standard deviation of the distribution. When comparing two distributions each described by
the normal curve, the following are the three situations based on the said parameters

a. When the means are not equal, but the standard deviations are equal. (µ1 ≠ µ2 ; ơ1 =ơ2 ),
the curves have a similar shape but centered at different points, as shown in the figure.

22
b. When the means are equal, but the standard deviations are not equal. (µ1 = µ2 ; ơ1 ≠ ơ2
), the curves are centered at the same point but they have different height and spreads. as
shown in the figure.

c. When the means are different and the standard deviations are also different (µ1≠ µ2 ; ơ1 ≠
ơ2 ), the curves are centered at different points and vary in shapes, as shown in the figure.

Every normal curve (regardless of its mean or standard deviation) conforms to the
following "empirical rule"

The empirical rule is better known as 68% - 95% - 99.70% rule. This rule states that
the data in the distribution lies within one (1), two (2), and three (3) of the standard deviation
from the mean are approximately 68%, 95%, and 99.70%, respectively. Since the area of a
normal curve is equal to 1 or 100% as stated on its characteristics, there are only a few data
which is 0.30% falls outside the 3-standard deviation from the mean.

23
Example 1:

The scores of the Senior High School students in their Statistics and Probability quarterly
examination are normally distributed with a mean of 35 and a standard deviation of 5.

Questions:
a. What percent of the scores are between 30 to 40?
b. What scores fall within 95% of the distribution?

Solution:

Draw a standard normal curve and plot the mean at the center. Then, add the standard
deviation to the mean once and mark it to the right of the mean. Add twice the standard
deviation to the mean and put it to the right of the first sum. Then, add thrice the standard
deviation to the mean and mark it to the right of the second sum. Do the same to the left.
This time, instead of adding, subtract the standard deviation from the mean.

Answer:

a. The scores 30 to 40 falls within the first standard deviation from the mean.
Therefore, the scores that fall between 30 and 40 is approximately 68% of the
distribution.

b. Since 95% of the distribution lies within 2 standard deviations from the mean, then
the scores corresponding to this area of the distribution are scores from 25 up to
45.

Example 2:

The district nurse of “Ospital ning Angeles” needs to measure the BMI (Body Mass
Index) of the Alternative Learning System students. She found out that the heights of male
students are normally distributed with a mean of 160 cm and a standard deviation of 7 cm.
Find the percentage of male students whose height is within 153 cm to 174 cm.

24
The mean of this problem is 160, it is subtracted by 7 to the left (160 -7 = 153; 153-7
= 146; 146 – 7 = 139) and add 7 to the right. (e.g. 160 + 7 = 167; 167 + 7 = 174 + 7 = 181).

153 cm falls at 1 standard deviation from the mean to the left and the height of 174 cm
falls at 2 standard deviations from the mean to the right.

Therefore, it covers the whole 68% and 13.5%. of the distribution and the sum of it is 81.5%

ANSWER: 81.5% of the male students have a height between 153 cm to 174 cm.

Learning Competencies with code


 Interpret the mean and the variance of a discrete random variable. (M11/12SP-IIIb-3)
 Solve problems involving mean and variance of probability distributions. (M11/12SP-IIIb-4)
 Illustrate a normal random variable and its characteristics (M11/12SP-IIIc-1)

GENERAL INSTRUCTION: Write your answer on a separate paper


Activity 1 : Solve the following problem

The probabilities that a market inspector will discover violations of the public health code in a
public market are given in the following table.

Number of Violations X Probability P(X)


0 0.41
1 0.22
2 0.17
3 0.13
4 0.05
5 0.02

a. What is the probability that a market inspector will discover, at least, two violations
of the public health code?
b. What is the probability that a market inspector will discover, at most, three
violations of the public health code?

25
c. What is the probability that a market inspector will discover at least two, but fewer
than four violations of the public health code?
d. What is the mean of the probability distribution?
e. What are the variance and the standard deviation of the probability distribution?

Activity 2: Solve the following problem.

The following are the number of cars sold per day at ABC Motor Corporation during
the past 500 working days.

Number of Cars Sold Per Day (X) Frequency of Occurrence


0 40
1 100
2 142
3 66
4 36
5 30
6 26
7 20
8 16
9 14
10 8
11 2
Total 500

a. Construct the probability distribution of the random variable X.


b. What is the probability that exactly four cars will be sold at a given day?
c. What is the probability that, at most, four cars will be sold at a given day?
d. What is the probability that at least six, but at most ten cars will be sold at a given
day?
e. What is the probability that fewer than eight cars, but greater than three cars will be
sold at a given day?
f. What is the expected number of cars that will be sold at a given day? Explain the
meaning of this value.
g. What is the standard deviation of the number of cars that will be sold at a given day?

26
Activity 3: True or False. Write the word TRUE if the statement is correct and FALSE if the
statement is incorrect

_______1. The total area under the normal curve is 1.


_______2. Normal probability distribution is used to determine the probabilities of a discrete
random variable.
_______3. The mean, median and mode in a normal distribution have the same value.
_______4. The distance of the number in the horizontal line or the x-axis should always
proportional.
_______5 The normal probability distribution has a mean µ = 1 and standard deviation ơ = 0.

Activity 4

A. Use Empirical rule to complete the following table.


B. Write on the respective column the range or interval of the scores based on the given
parameters.
C. Illustrate the distribution through a diagram.

Mean Standard 68% 95% 99.7%


Deviation
Ex 50 3 47 to 53 44 to 56 41 to 59
1 135 28
2 87 5.5
3 213 15
4 567 20
5 785 29

Activity 5: Read and analyze the following problem to answer the question.

The government launched Enhanced Community Quarantine on March 15, 2020, as a


preventive measure to fight the spread of COVID-19 in the country. Many of us experienced
difficulties in getting our daily needs like food or getting money to pay for our bills and rentals.

Our government gave financial support and food, but it is not enough for the people.
Fortunately, we are blessed because the spirit of love and generosity is still present in the
heart of our fellow countrymen. They donated money, food, time, and other essential things.

Suppose you and your friends conducted a donation-drive in your barangay. Each of
the donors gave a certain amount with an average of Php 320.00 and a standard deviation of
Php 60.00. Assuming, it is normally distributed,

1. What percentage of the distribution that the amount falls between Php. 260.00 and
Php 380.00?
2. How much of the donations fall in 95% of the distributions?
3. How much of the donations fall in 99.70% of the distributions?

27
References

Department of Education. Statistics and Probability: Teacher’s Guide. pp. 117-181.

Department of Education. Statistics and Probability: Learner’s Material. pp. 119-131.


Ocampo, Jose M., and Wilmer G. Marquez. 2016. Conceptual Math and Beyond: Statistics
and Probability. Quezon City: Brilliant Creations Publishing Inc.

Belecina, Rene R., Elisa S. Baccay, and Efren B. Mateo. 2016. Statistics and Probability, 1st
ed. Manila: Rex Bookstore Inc.

Answer Key
Activity 1 Activity 2 Activity 3

a. P(X≥2) = 0.37 a. Tabulate 1. True


b. P(X≤3) = 0.93 b. P(X=4) = 0.072 2. False
c. P(2≤X≤4) = 0.30 c. P(X≤4) = 0.768 3. True
d. µ = 1.25 d. P(6≤X≤10) = 0.168 4. True

e. 2 = 2.12 e. P(3<X<8) = 0.224 5. False

 = 1.456 f. µ = 3.056

g. 2 = 12.352

 = 3.515

Activity 4 Activity 5

1. 68%
2. 200 to 440
3. 140 to 500

Prepared by:

ERVIN A. PEREZ
Malabanias Integrated School
28 SHS Department
WEEK 4: IDENTIFYING REGIONS UNDER NORMAL CURVE CORRESPONDS TO
DIFFERENT STANDARD NORMAL VALUES

Background Information

We have learned that the total area under the curve is equal to 1. Since the standard
normal distribution is defined as normally distributed random variable having a mean of zero (
µ= 0 ) and standard deviation of one (ơ = 1), the fifty (50) percent of the region under the curve
is below the mean and the other 50% is above the mean. Figure 1, as shown below illustrate
the standard normal curve.

Figure 1: Standard Normal Curve

You can easily identify the area of the regions under the normal curve by using the
Table of Areas under the Normal Curve which is also known as z-Table (see attachment at
the last page of this module). This table gives the area of any value of z from -3.99 to 3.99.
The value from this table will describe the area of the specific region of the curve to the left of
the given z-value.

How to use the Table of Areas under the Normal Curve?

Example: Find the area to the left of −1.69

Solution: First, split the given z-value into hundredths. We can find the whole
number and the tenths digit (-1.6) at the left side of the table while the hundredths (0.09)
located at the uppermost of the table. Then the intersection of these numbers will be the
area of the normal curve to the left of the z-value.

Answer: The area to the left of z= -1.69 is 0.0455

29
STEPS ON HOW TO FIND THE AREA THAT CORRESPONDS TO Z –VALUE

1. Draw/sketch a normal curve and locate the given z-value on the normal curve

2. Shade the region of the curve according to the condition of z-value whether it is below,
above, or between.

3. Use the table of the area under the normal curve to find the corresponding area.

4. Choose the appropriate operation based on step 2 and 3

4.1. When the z-value is to the left or any related terms (e.g. below, less than) just write the
value we obtained in step 3

4.2. When the z-value is to the right or any related terms (e.g. above, greater than), subtract
1 by the obtained value in step 3

4.3. When the shaded region is in between of the two z-value, subtract the biggest by the
smallest value obtained in step 3

5. Label the shaded region and draw a conclusion

Example 1: Find the area that corresponds below z = -1.35

Solution:

Step 1. Draw/sketch a normal curve and locate the given z-value.

Step 2: Shade the region of the curve according to the condition of z-value whether it is
below, above, or between.

30
Step 3. Use the table of the area under the normal curve to find the corresponding area

The intersection between -1.3 and 0.05 is 0.0885

Step 4. Choose the appropriate operation based on step 2 and 3

4.1 When the z-value is to the left or any related terms (e.g. below, less than) just write
the value we obtained in step 3 Since the shaded region of the curve is to the left of the
z= -1.35 and the intersection between -1.3 and 0.05 is 0.0885. Therefore, the area of
the shaded region is 0.0885

Step 5. Label the shaded region and draw a conclusion.

Therefore, the area that corresponds below z = -1.35 is 0.0885

31
0.9115
0.9115

0.9115

32
CONVERTING A NORMAL RANDOM VARIABLE TO A STANDARD NORMAL
VARIABLE AND VICE-VERSA

In solving real-life problems involving the normal curve, it is particularly important to


convert a random variable x to a standard normal variable or z-score. This procedure is
known as “standardizing” or “standardization” of a random variable, where a
standardized value is called a z-score. A z-score is a measure of the number of standard
deviations (σ) a particular data value is away from the mean (μ).

Suppose your score on a test in Probability and Statistics is 39 and the scores are
normally distributed with a mean of 33 and standard deviation of 3, then your score is exactly
2 standard deviations above the mean. If you scored 30, then it is exactly 1 standard deviation
below the mean. All values that are above the mean have positive z-scores and all values that
are below the mean have negative z-scores. If you obtained a z-score of -3, this means that
your score is 3 standard deviations below the mean, that is, 33 – 3(3) = 33 – 9 = 24.

Given any value x from a normal distribution with mean μ and standard deviation σ, to
convert x to a z-score (standard normal score), you need to;

(a) Subtract the mean μ from x.

(b) Divide this quantity, 𝒙 – 𝝁, by the standard deviation σ.

The formula used in converting a random variable x to a standard normal variable z


is:
𝑥−𝜇
𝑧 =
𝜎
where: 𝑧 – standard normal score or z – score
𝑥 – any data value in a normal distribution
𝜇 - mean
𝜎 - standard deviation

33
Example 1.
Suppose IQ scores are normally distributed with a mean of 100 and standard
deviation of 10. If your IQ is 85, what is your z-score? (Round off your answer to the nearest
hundredths)
Solution:

The z-score can be computed using the formula;


𝑥−𝜇
𝑧 =
𝜎

85 − 100 15
𝑧 = = − = −𝟏. 𝟓𝟎
10 10

Example 2.
On a nationwide placement test that is normally distributed, the mean was 125 and
standard deviation was 15. If you scored 149, what was your z-score? (Round off your
answer to the nearest hundredths)

Solution:

Using the formula:


𝑥−𝜇
𝑧 =
𝜎

149 − 125
𝑧 =
15

24
= = 𝟏. 𝟔𝟎
15

Example 3.
The heights of teachers in Sta. Catalina National High School are normally
distributed with a mean of 150 cm and standard deviation of 15 cm. The height of Sir Victor
has a z-score of 3.25. What is the actual height of Sir Victor? (Round off your answer to the
nearest hundredths)
Solution:
In this example, z-score is given while the normal random variable is unknown. To
solve for the normal random variable x, multiply the z-score (z) by the standard deviation σ,
then add the mean μ. Therefore;
𝑥 = zσ + 𝜇

𝑥 = 3.25(15 cm) + 150𝑐𝑚

𝑥 = 48.75 cm + 150 𝑐𝑚

𝒙 = 𝟏𝟗𝟖. 𝟕𝟓 𝐜𝐦

34
Example 4.
The time it takes for a cell to divide is normally distributed with an average of 60
minutes and standard deviation of 5 minutes. How long will it take for a given cell to divide if
its “mitosis” has a z-score of -1.35?
Solution:
𝑥 = zσ + 𝜇
𝑥 = −1.35(5 min) + 60𝑚𝑖𝑛
𝑥 = −6.75 min + 60 𝑚𝑖𝑛
𝑥 = 𝟓𝟑. 𝟐𝟓 𝐦𝐢𝐧𝐮𝐭𝐞𝐬

Example 5.
Suppose you have the population values 40 and 70 and that their corresponding z –
scores are -1 and 2, respectively.
1. Is it possible to determine the population’s mean and standard deviation?
2. If so, what are these values?

Solution:
To begin, let us first identify the given. Let
𝑥1 = 40 𝑥2 = 70
𝑧1 = −2 𝑧2 = 1
Use these given and substitute to the formula
𝑥1 = 𝑧1 σ + 𝜇 𝑥2 = 𝑧2 σ + 𝜇
40 = −2σ + 𝜇 70 = 1σ + 𝜇

Note: In this example, we do not have 𝜎1 and 𝜎2 , and 𝜇1 and 𝜇2 since we are
describing a single population. Thus, 𝜎1 = 𝜎2 = 𝜎 and 𝜇1 = 𝜇 2 = 𝜇. Furthermore, we need to
review our algebra specifically on systems of linear equation. So,

40 = −2σ + 𝜇 equation 1
70 = 1σ + 𝜇 equation 2
Subtract equation 1 from equation 2, we have

40 = −2σ + 𝜇 equation 1
70 = 1σ + 𝜇 equation 2
30 = 1σ − (−2 σ)
30 =3 σ the difference
30 3𝜎
=
3 3

10= σ
The standard deviation of the population is 10. To obtain the mean, substitute it to either
equation 1 or 2.
40 = −2σ + 𝜇 ⟹ 40 = −2(10) + 𝜇 ⟹ 60 = 𝜇
70 = 1σ + 𝜇 ⟹ 70 = 1(10) + 𝜇 ⟹ 60 = 𝜇

Both agrees that the mean of the population is 60.


Therefore,
1. It is possible to determine the population’s mean and standard deviation given only
the raw score (x) and its corresponding standard score (z).
2. The mean and standard deviation of the population is 60 and 10,
respectively.

35
COMPUTING PROBABILITIES AND PERCENTILES USING THE STANDARD NORMAL
TABLE

For any set of measurements (arranged in ascending or descending order), a


percentile is a measure of relative standing. It is a descriptive measure of the relationship of
a measurement to the rest of the data.
For example, in a test in Algebra, you got a score of 82 and you want to know how
fared in comparison with your classmates. If your teacher tells you that you scored at 90 th
percentile, it means that 90% of the grades were lower than yours and 10% were higher. If
the relative frequency histogram were used to show the scores, the 90th percentile would be
located at a point such that 90% of the total area under the relative frequency histogram lies
below the 90th percentile and 10% lies above.
In a previous lesson, we learned how to compute the z-scores based on means and
standard deviations for both sample and population. The resulting z – scores allowed us to
read the corresponding areas (or probabilities) using the standard normal table.

Suppose that we know the probability this time. What is the corresponding z-score?
For example, where is 95% under the normal curve?

There are three important things to remember when we are given probabilities and
we want to know their corresponding z-scores.

 First, a probability value corresponds to an area under the normal curve.


 Second, in the table of Areas Under the Normal Curve, the numbers in the
extreme left across the top are z-scores, which are the distances along the
horizontal scale. The number in the body of the table are areas or
probabilities.
 Third, the z-scores to the left of the mean are negative values.

Thus, when we are given the area and we wish to find the corresponding z-value, we locate
the given area at the body of the table. If the exact area is not available, we take the nearest
area. Then, we look up the corresponding z-value in the Table of Areas Under the Normal
Curve or z- Table.

A normal distribution curve can be used as a probability distribution curve for normally
distributed variables. The area under the standard normal distribution curve can also be
thought of as a probability. That is, if it's possible to select any 𝑧 value at random, the
probability of choosing one, say, below 1.45 would be the same as the area under the curve
at the left of 1.45. In this case, the area is 0.9265. Therefore, the probability of randomly
selecting a 𝑧 value below of 1.45 is 0.9265 or 92.65%. The problems involving probabilities
and percentiles are solved in the same manner as finding the areas under a normal curve.

In finding probabilities, the following notations will be used:

(a) 𝑃(𝑍 < 𝑧) – probability at the left of 𝑧


(b) 𝑃(𝑍 > 𝑧) = 1 – 𝑃(𝑍 < 𝑧) – probability at the right of 𝑧
(c) 𝑃(𝑎 < 𝑧 < 𝑏) - the probability of 𝑧 that is in between two other 𝑧 values, say 𝑎 and 𝑏
(d) 𝑃(𝑧 < 𝑎 ) ∪ 𝑃(𝑧 > 𝑏 ) – the probability of 𝑧 is in the opposite direction of two values,
say 𝑎 and 𝑏
(e) 𝑃(𝑋 < 𝑥) – probability at the left of a normal random variable 𝑥

36
(f) 𝑃(𝑋 > 𝑥) – probability at the right of a normal random variable 𝑥
(g) 𝑃(𝑎 < 𝑋 < 𝑏) – the probability of a normal random variable 𝑋 that is in between two
other normal random variables, say 𝑎 and 𝑏.
(h) 𝑃(𝑋 < 𝑎 ) ∪ 𝑃(𝑋 > 𝑏 ) – the probability of 𝑋 is in the opposite direction of two
values,
say 𝑎 and 𝑏

Example 1.
Find the probabilities for each of the following.
(a) 𝑃(𝑍 < 1.32)
(b) 𝑃(𝑍 < −1.05)
(c) 𝑃(−0.75 < 𝑍 < 1.56)
(d) 𝑃(𝑍 > −0.88)

Solution:
(a) The probability 𝑃(𝑍 < 1.32) means to find
the area under the normal curve at the left of
𝑧 = 1.32. From the table, find the intersection
on row 1.3 and column 0.02. Thus, the
probability is 𝟎. 𝟗𝟎𝟔𝟔 or 𝟗𝟎. 𝟔𝟔%.

(b) The probability 𝑃(𝑍 < −1.05) is the same as


the area under the normal curve at the left of 𝑧 =
−1.05. From the table, the area is 0.1469. Thus,
the probability is 𝟎. 𝟏𝟒𝟔𝟗 𝒐𝒓 𝟏𝟒. 𝟔𝟗%.

(c) The probability 𝑃(−0.75 < 𝑍 < 1.56) is the


same as the area under the normal curve between
𝑧 values -0.75 and 1.56. To get the area, subtract
the area of 𝒛 = −𝟎. 𝟕𝟓 from the area of 𝑧 = 1.56.
Thus,𝑃(−0.75 < 𝑍 < 1.56) = 𝑃(𝑍 <
1.56)– 𝑃(𝑍 < −0.75) = 0.9406 – 0.2266 =
𝟎. 𝟕𝟏𝟒𝟎 𝒐𝒓 𝟕𝟏. 𝟒𝟎%.

(d) The probability 𝑃(𝑍 > −0.88) is the same as


the area under the normal curve at the right of 𝑧
= −0.88. To get the area, subtract the area
of 𝑧 = −0.88 from 1. Thus
, 𝑃(𝑍 > −0.88) = 1 – 𝑃(𝑍 < −0.88) = 1 –
0.1894 = 𝟎. 𝟖𝟏𝟎𝟔 𝑜𝑟 𝟖𝟏. 𝟎𝟔%.

Example 2.
Let 𝑋 be a normal random variable with mean 𝜇 = 15 and standard deviation 𝜎 = 3. Find the
probabilities of the following:
(a) (𝑋 < 19)
(b) (10 < 𝑋 < 19)

37
Solution:
(a) To find the probability, convert the normal random variable 19 to a z-score.
𝑥−𝜇 19−15
Thus, (𝑋 < 19) = 𝑃 (𝑍 < ) = 𝑃 (𝑍 < )
𝜎 3
= (𝑍 < 4 3 ) = 𝑃(𝑍 < 1.33).

From the table, the probability at the left of 𝒛 = 𝟏. 𝟑𝟑 is equal to 𝟎. 𝟗𝟎𝟖𝟐 𝒐𝒓 𝟗𝟎. 𝟖𝟐%.
(b) To find the probability, convert the normal random variables 10 and 19 to z values.
10−15 19−15
Thus, 𝑃(10 < 𝑋 < 19) = 𝑃 ( < 𝑍 < )
3 3
= (−1.67 < 𝑍 < 1.33)
= (𝑍 < 1.33) – (𝑍 < −1.67).

From the table, we have 0.9082 – 0.0475 = 𝟎. 𝟖𝟔𝟎𝟕 𝒐𝒓 𝟖𝟔. 𝟎𝟕%. These probabilities can also
be applied to real-life problems.

Learning Competencies with code

 Identify regions under the normal curve corresponding to different standard normal
values. (M11/12SP-IIIc-3)
 Convert a normal random variable to a standard normal variable and vice versa
(M11/12SP-IIIc-4)
 Compute probabilities and percentiles using the standard normal table. (M11/12SP-IIIc-d-1)

Activity 1.
Identify the area under the normal curve of the following standard normal values
using the z table (refer to the last page of this module).
1. 𝑧= −3.25 _____________ 6. 𝑧 = −1.33 _________
2. 𝑧= 1.25 _____________ 7. 𝑧 = −2.5 _________
3. 𝑧= 0.56 _____________ 8. 𝑧 = 0.29 _________
4. 𝑧= 3 _____________ 9. 𝑧 = −0.13 _________
5. 𝑧= 1.96 _____________ 10. 𝑧 = −1.96 _________

Activity 2.
Find the area or region that corresponds to each of the following z values. Sketch
and label each curve.
1. to the right of z = 2.18
2. below z= -1.40
3. above z= 0.57
4. between z = -2.34 and z = 1.57
5. between z = 0.32 and z = 2.42

38
Activity 3.
Supply the needed information by solving the missing value.

1. 𝑥 = 25, 𝜇 = 40, σ = 6, 𝐳 = ?
2. 𝑧 = −2.25, 𝜇 = 218, σ = 36, 𝐱 = ?
3. 𝑥 = 39, 𝜇 = 30, σ = 3, 𝐳 = ?
4. 𝑥 = 125, 𝜇 = 105, σ = 8, 𝐳 = ?
5. 𝑧 = 1.11, 𝜇 = 145, σ = 20, 𝐱 = ?

Activity 4.
Solve the following problems.
1. The mean number of hours a Filipino worker spends on the computer is 3.1 hours per
workday. Assume the standard deviation is 0.5 hour and is normally distributed, how long
does a worker spend on the computer if his z-score is 1.2?
2. Each month, a Filipino household generates an average of 28 pounds of newspaper for
garbage or recycling. Assume the standard deviation is 2 pounds. Determine the z-score of
a household that generates 22 pounds of newspaper.
Activity 5.
A. Use the Table of Standard Normal distribution to determine corresponding
probabilities of the following:
1. 𝑃(𝑍 < −1.45)
2. 𝑃(0.5 < 𝑍 < 2.33)
3. 𝑃(𝑍 > 1.78)

B. 𝑋 is a normally distributed random variable with a mean of 60 and a standard


deviation of 8. Find the probabilities indicated by using the table.
4. 𝑃(𝑋 < 52)
5. 𝑃(48 < 𝑋 < 64)
6. 𝑃(𝑋 > 57)

References

Department of Education. Statistics and Probability-PIVOT: Learner’s Material. pp. 132-


168.

Belecina, Rene R., Elisa S. Baccay, and Efren B. Mateo. 2016. Statistics and Probability, 1st
ed. Manila: Rex Bookstore Inc.

39
Answer Key
Activity 1
1. 0.0006 6. 0.0918
2. 0.8944 7. 0.0062
3. 0.7123 8. 0.6141
4. 0.9987 9. 0.4483
5. 0.9750 10. 0.0250

Activity 2

1. to the right of z = 2.18 4. between z = -2.34 and z = 1.57

0.0146 0.9322

2. below z= -1.40

5. between z = 0.32 and z = 2.42


0.0808

0.3667

3. above z= 0.57

0.2843

`Activity 3 Activity 4 Activity 5


1. 3.7 hrs A.
2. -3
1. -2.5 1. 0.0735
2. 137 2. 0.2986
3. 3 3. 0.0375
4. 2.5
B.
5. 167.2
4. 0.1587
5. 0.6247
6. 0.6480

40
STANDARD NORMAL TABLE: Table Values Represent AREA to the LEFT of Z-score

41
Prepared by:

JELENE V. BASILIO
Sto. Domingo Integrated School
SHS Department

42
WEEK 5 : RANDOM SAMPLING

Background Information

Random Sampling is a sampling method of choosing representatives from the


population wherein every sample has an equal chance of being selected. Accurate data can
be collected using random sampling techniques.
A population includes all of its elements from a set of data. The size of the population
is the number of observations in the population. For example, if ABSCBN network has
11,000 employees having the required blood type in a certain study, then we have a
population of size 11,000.
Sample consists of one or more data drawn from the population. It is a subset, or an
incomplete set taken from a population of objects or observations. Taking samples instead of
the population is less time-consuming and cost-effective. Although sampling has
advantages, it can also be a source of bias and inaccuracy

Four Types of Random Sampling


1. Simple Random Sampling - is the basic
sampling technique where we select a
group of subjects (a sample) for study
from a larger group (a population). Each
individual is chosen entirely by chance
and each member of the population has
an equal chance of being included in the
sample. Every possible sample of a given
size has the same chance of selection.

2. Systematic Random Sampling - involves


choosing your sample based on a regular
interval, rather than a fully random selection.
It can also be used when you don’t have a
complete list of the population.

3. Stratified Random Sampling – is used when


you want to ensure that specific
characteristics are proportionally
represented in the sample. You split your
population into strata (for example, divided by
gender or race), and then randomly select
from each of these subgroups.

43
4. Cluster Random Sampling - is used when
you are unable to sample from the entire
population. You divide the sample into
clusters that approximately reflect the whole
population, and then choose your sample
from a random selection of these clusters.

PARAMETER AND STATISTICS

Parameter
A parameter is a measure that is used to describe the population while statistic is a
measure that is used to describe the sample. To understand more, let us discuss the two
measures. Population mean (µ), population variance (𝜎2) and population standard deviation
(𝜎) are what we called parameters.

Population Mean
The mean is the sum of the data divided by the number of data. The mean is used to
describe where the set of data tends to concentrate at a certain point.
Below are grades in Statistics of Grade 11 students during the third quarter.

94 85 88 79 78

75 89 91 84 77

Let us compute the population mean, population variance, and population standard
deviation.

Grades in Statistics of Grade 11 Students during the Third Quarter


Student’s number Grade (X)

1 94

2 85

3 88

4 79

5 78

6 75

7 89

8 91

9 84

10 77

N = 10 ∑X = 840

44
ΣX
To compute the population mean (𝜇) we can use this equation 𝜇 = whereas ∑X is the
𝑁
summation of all values of grades (x) and N is the total number of population therefore,
840
𝜇= = 84 the computer population mean (𝜇) is 84
10

POPULATION VARIANCE AND POPULATION STANDARD DEVIATION

Variance and standard deviation determine how to spread or to scatter each data on
the set from the mean.
To compute for the population variance, we apply the formula:
∑(𝑋 − 𝜇)2
𝜎2 =
𝑁
To compute for the population standard deviation, we use the formula:

∑(𝑋 − 𝜇)2
𝜎= √
𝑁

Whereas, X is the given data, µ is the population mean 𝑁 is the number of elements in the
population.
Consider the data given above, to solve for the population variance and population standard
deviation, we have this table:
Student’s number Grade (X) 𝑋−𝜇 (𝑋 − 𝜇)2

1 94 94 – 84 = 10 102 = 100

2 85 85 – 84 = 1 12 = 1

3 88 88 – 84 = 4 42 = 16

4 79 79 – 84 = -5 -52 = 25

5 78 78 – 84 = – 6 -62 = 36

6 75 75 – 84 = – 9 -92 = 81

7 89 89 – 84 = 5 52 = 25

8 91 91 – 84 = 7 72 = 49

9 84 84 – 84 = 0 02 = 0

10 77 77 – 84 = 7 72 = 49

N = 10 ∑X = 840 ∑(X − µ) 2 = 382

To solve for the population variance and population standard deviation we have to use these
equations:
Population Mean Population Variance Population Standard Deviation
ΣX ∑(𝑋 − 𝜇)2
𝜇= 𝜎2 = ∑(𝑋 − 𝜇)2
𝑁 𝑁 𝜎= √
𝑁

45
Solution:
ΣX 840
Population mean: 𝜇 = = = 84
𝑁 10
∑(𝑋− 𝜇)2 382
Population variance: 𝜎2 = = = 38.2
𝑁 10
∑(𝑋− 𝜇)2 382
Population Standard Deviation: 𝜎 = √ = √ = √38.2 ≈ 6.18
𝑁 10

STATISTIC

Statistic refers to the sample in a given population.


Sample mean (𝑥̅ ), sample variance (s2) and sample standard deviation (s) are what we call
statistic.

From the previous data of the population, suppose that we randomly select only 7 data
(numbers with check) out of the total 10 data in the population.
 94 85  88  79 78

75  89  91  84  77

Compute the sample mean, sample variance, and sample standard deviation. Here is the
result:

SAMPLE MEAN

The sample mean is the average of all the data of the samples. The symbol 𝑥̅ ,
(read as “x bar”) is used to represent the sample mean. To compute for the sample mean 𝑥̅
ΣX
we can use this equation: 𝑥̅ =
𝑛
Where: 𝑥̅ is the sample mean
∑x is the summation of x (sum of the measures)
𝑛 is number of elements in the sample

Grades in Statistics of Grade 11 Students during the Third Quarter


Population Sample
Student’s number
(Student’s grade) (Student’s grade)
1 94 94
2 85
3 88 88
4 79 79
5 78
6 75
7 89 89
8 91 91
9 84 84
10 77 77
N = 10; n = 7 ∑X = 840 ∑X = 602

46
To get the sample mean:
ΣX 602
We substitute to the formula 𝑥̅ = = = 86
𝑛 7
Our computed sample mean 𝑥̅ is 86.

SAMPLE VARIANCE AND SAMPLE STANDARD DEVIATION

Sample variance is the computed variance of the elements of the sample. s 2 is used to
represent sample variance.

Sample standard deviation is the computed standard deviation of the elements of the
sample. s is used to represent sample standard deviation.

To compute for the sample variance, we apply the formula:


∑(𝑥 − 𝑥̅ )2
𝑠2 =
𝑛−1
To compute for the sample standard deviation, we use the formula:
∑(𝑥 − 𝑥̅ )2
𝑠= √
𝑛−1
Where: x is the given data
𝑥̅ is the sample mean

𝑛 is number of elements in the sample

To apply this here’s another example using the previous example.


Population Sample
Student’s
(Student’s (Student’s 𝑥 − 𝑥̅ (𝑥 − 𝑥̅ )2
number
grade) grade)

1 94 94 94 – 86 = 8 82 = 64

2 85

3 88 88 88 – 86 = 2 22 =4

4 79 79 79 – 86 = – 7 -72 = 49

5 78

6 75

7 89 89 89 – 86 = 3 32 = 9

8 91 91 91 – 86 = 5 52 = 25

9 84 84 84 – 86 = – 2 -22 = 4

10 77 77 77 – 86 = – 9 -92 = 81

∑X = 840 ∑X = 602
∑(x − 𝑥̅)2 = 236

47
To solve for the sample variance and sample standard deviation we have to use these
equations:
Sample Mean Sample Variance Sample Standard Deviation
ΣX ∑(𝑥 − 𝑥̅ )2
𝑥̅ = 𝑠2 = ∑(𝑥 − 𝑥̅ )2
𝑛 𝑛−1 𝑠= √
𝑛−1

Solution:
ΣX 602
Sample mean: 𝑥̅ = = = 86
𝑛 7
∑(𝑥− 𝑥̅ )2 236 236
Sample Variance: 𝑠2 = = = ≈ 39.33
𝑛−1 7−1 6
∑(𝑥− 𝑥̅ )2 236
Sample Standard Deviation: 𝑠 = √ = √ = √39.33 = 6.27
𝑛−1 7−1

Remember that parameters are for population while statistics are for sample.

IDENTIFYING SAMPLING DISTRIBUTION OF STATISTICS

Sampling distribution of the sample means is a frequency distribution using the computed
sample mean from all the possible random samples of a particular sample size taken from
the given population.

Steps to follow in making a sampling distribution of the sample mean:


1. Determine the number of sets of all possible random samples that can be drawn from the
given population by using the formula, NCn, where N is the population size and n is the
sample size.
Example:
The given population is 1, 2, 3, 4, 5 and sample size of 3, using the formula NCn
𝑁! 5! 5! 5∙4∙3∙2∙1 20
NCn = 5C3 = [𝑛!(𝑁−𝑛)!] = = = (3∙2∙1)(2∙1∙) = = 10
3!(5−3)! 3!2! 2
2. List all the possible random samples and solve for the sample mean of each set of
samples.
Sample Mean
1,2,3 2
1,2,4 2.33
1,2,5 2.67
1,3,4 2.67
1,3,5 3
1,4,5 3.33
2,3,4 3
2,3,5 3.33
2,4,5 3.67
3,4,5 4

48
3. Construct a frequency and probability distribution table of the sample means indicating
its number of occurrence or the frequency and probability.
Probability P(x)
Sample Means Frequency 𝐹𝑟𝑒𝑞𝑢𝑒𝑛𝑐𝑦
𝑁
1
2 1 = .10
10
1
2.33 1 = .10
10
2
2.67 2 = .20
10
2
3 2 = .20
10
2
3.33 2 = .20
10
1
3.67 1 = .10
10
1
4 1 = .10
10

Learning Competencies with code:


 Illustrate random sampling. (M11/12SP-IIId-2)
 Distinguish between parameter and statistic. (M11/12SP-IIId-3)
 Identify sampling distributions of statistics (sample mean). (M11/12SP-IIId-4)

GENERAL INSTRUCTION: Write your answer on a separate paper


Practice A
Choose the letter of the best answer. Encircle the letter of the correct answer.
1. What do you call the set of all data that researchers want to investigate?
a. data b. population
c. random d. sample
2. Which of the following can be considered as the best sample?
a. Conveniently sample b. Selected at random
c. Purposively selected d. Self-selected
3. From the given populations, which of the following samples is most likely to be
representative of the population from which it is drawn?
a. population: SAP beneficiaries b. population: lawmakers
sample: car owners sample: students
c. population: online shoppers d. population: graduating students
sample: cellphone users sample: grade 8 students
4. In random sampling, the sample should be_______ the population
a. as large as b. different from
c. not taken from d. representative of
5. Which of the following is NOT a type of random sampling?
a. cluster b. purposive
c. simple d. systematic

49
6. Which of the following is random sampling?
a. convenience b. non-probability
c. purposive d. stratified
7. As a researcher, you ask the people of your barangay on how they spent their money
during the community quarantine. You decided to group them according to their
socio-economic status and then proportionately and randomly selecting samples
from these groups. Which type of sampling are you using?
a. convenience b. purposive
c. stratified d. systematic
8. As a researcher, you want to know the plans of graduating senior high school
students. You ask all by-passers to participate in your research. Which type of
sampling are you using?
a. convenience b. purposive
c. stratified d. systematic
9. You wish to examine the effects of community quarantine on your fellow students.
You decided to borrow a list from the guidance office and select every 10th student
on it. What sampling technique are you using?
a. convenience b. purposive
c. stratified d. systematic
10. You want to know the effects of the COVID-19 to the Filipino people and you will ask
the medical frontliners. Which type of sampling will you use?
a. convenience b. purposive
c. stratified d. systematic

Practice B
Choose the letter of the best answer. Encircle the letter of the correct answer.

1. What do you call a number that describes a sample characteristic?


a. data b. sample
c. parameter d. statistic

2. What do you call a number that describes a population characteristic?


a. data b. sample
c. parameter d. statistic

3. Which of the following is a statistic?


a. 𝜎2 b. 𝜎
c. µ d. s

For numbers 4-9, the following are scores of students in their quiz:
10, 8, 7, 20, 11, 8, 15, 10, 6, 14, 16, 20
4 – 5. What is the value of µ? (2 points)
a. 10.4 b. 11
c. 12.08 d. 14.4

6 – 7.What is the value of 𝜎? (2 points)


a. 4.6 b. 5
c. 6 d. 5.6

50
8– 9. What is the value of the population variance? (2 points)
a. 18.7 b. 25
c. 20 d. 21.5
For numbers 10-15, the following are the seven scores of the students above that are
randomly selected: 8, 7, 20, 15, 10, 14, 16

10 – 11. What is the value of the sample mean? (2 points)


a. 15.2 b. 12.9
c. 13.5 d. 14.7
12 – 13. What is the value of s2? (2 points)
a. 20.8 b. 19.7
c. 22.1 d. 25.4
14 – 15. What is the sample standard deviation? (2 points)
a. 6.2 b. 4.7
c. 8.3 d. 5.6

Practice C
Choose the letter of the best answer. Encircle the letter of the correct answer.

1. What distribution pertains to the frequency distribution of the sample mean from all
the possible random samples of a particular sample size n taken from the given
population?
a. frequency b. normal
c. population d. sampling
2. Which of the following is NOT a step-in creating sampling distribution of the sample
mean?
a. Determine the number of sets of all b. Compute for the standard
possible random samples. deviation and variance of the
samples.
c. Construct a frequency distribution d. List all the possible random
table of the sample means and samples and solve for the sample
probability. mean of each set of samples
3. Which of the following is the mean of sample 6, 10, 21, 25, and 28?
a. 15.17 b. 18
c. 21.2 d. 22
For numbers 4-9, refer to the following set of data of a population {5, 6, 7, 8}
4. How many different samples of size n = 2 can be drawn from the population?
a. 6 b. 5
c. 4 d. 3
5. Which of the following sample mean appears most frequent in the sampling
distribution?
a. 5 b. 5.5
c. 6 d. 6.5
6. What is the lowest value of the sample mean in this sampling distribution?
a. 5.5 b. 6.5
c. 7 d. 8

51
7. What is the frequency of the sample mean 7?
a. 0 b. 1
c. 2 d. 3
8. What is the probability of the sample mean 6 in the sampling distribution?
a. 1/6 b. 2/3
c. 1/2 d. 2/5
For numbers 9-10, refer to the following population consists of the data {3,6,9,12,15}.
9. How many different samples of size n = 3 can be drawn from the population?
a. 6 b. 12
c. 8 d. 10
10. Which of the following sample mean appears most frequent in the sampling
distribution?
a. 6 b. 7
c. 8 d. 12

References:
 Statistics and Probability 11, Pivot 4A Learners Material (N/A) Pages 172 - 206
 https://www.scribbr.com/methodology/simple-random-sampling/
 http://www.stat.yale.edu/Courses/199798/101/sample.htm#:~:text=Simple%20rando
m%20sampling%20is%20the,being%20included%20in%20the%20sample.

Answer Key:

Practice A: Practice B: Practice C:

1. b 1. d 1. d
2. b 2. c 2. b
3. c 3. d 3. b
4. d 4– 5. c 4. a
5. b 6-7. a 5. d
6. d 8-9. d 6. a
7. c 10-11. b 7. c
8. a 12-13. c 8. a
9. d 14-15. b 9. d
10. b 10. c

Prepared by:

Jeral S. Reyes

Teacher - II

52

You might also like