Professional Documents
Culture Documents
Notes 1 For Week 1 PDF
Notes 1 For Week 1 PDF
Notes 1 For Week 1 PDF
With a random phenomenon each attempt or trial to be observed produces an outcome that usually
varies from an attempt to another and it is not guaranteed that the same outcome occurs all the times.
Opposite to Certainty where outcomes repeats exactly when observed under the same conditions.
In real life, all phenomena are random. Certainty is observed through laws that assume given factors
having no effect on the condition which is subject to those factors.
Examples:
From real life:
Mobile app size in MB
Time in minutes is takes you back home from work or study in a given day
A friend answering your call at a given try
Number of new cases per day testing positive for Covid19 in Lebanon
1
Outcomes and Sample Spaces
A random Outcome is the result of the random phenomenon when observed on a subject or event.
The Sample Space is the set of all possible outcomes of a random phenomenon or random experiment.
Sample space: it is a range from the smallest size to the largest. [0.0024 MB , 500 MB] ?
2- Time in minutes is takes you back home from work or study in a given day
It depends on you. The sample space is [5 min, 100 min] ?
4- Number of new cases per day testing positive for Covid19 in Lebanon
Just check the updated chart today. The sample space is {0,1,2,3,….,27,…} you may want to have an
upper limit… OK you may, but say how many!
The other examples depend on the experimentations and observation. But at the end, a specified sample
space must be identified.
Remember, the sample space is attributed to all possible outcomes of the random phenomena.
Those outcomes are expected when the members of the population that are subject to the phenomena
are observed.
2
The Probability Measure (or simply Probability)
Definition: the probability is a real-number measure between 0 and 1 of the likelihood or the chance
for a set of possible outcomes, which is a subset of the sample space
The set of possible outcomes is usually characterized by a property defined as an Event.
So, an Event is specific condition of the random phenomena that can be described by a subset of
outcomes from the sample space.
Example:
You are attempting to answer a True/False question.
The outcome is either Correct or Wrong (assume no answer will result into a wrong outcome)
An event is for example {a correct answer}=C. The other event is obviously {a wrong answer}=W.
With an uneducated guess, you measure the likelihood for wither C or W is 50% (the 50/50 guess)
Hence, the probability for C or for W is 0.5.
This probability measure can vary if you have an educated knowledge for the possible correct answer.
So, if you studied for the exam, you may measure the likelihood of a correct answer for a given T/F
question to be like 0.95 (i.e. 95% chance it is correct)
Example:
The famous die experiment. Throw a die
An outcome can be obtaining the up-showing face (the face) with 3 dots, say 3.
The sample space is S={1,2,3,4,5,6}
Assume the die is fair in the outcome chances (all faces are equally likely to show up).
1
Then the probability for any outcome is 6
Consider the event A={obtaining an odd number of dots on the die face}={1,3,5}
3 1
The probability for A is therefore 3 chances out of 6, or 6 = 2.
3
The Population
The population consists of subjects or events where the random phenomena is observed.
The population must be clearly identified when a statistical analysis is conducted.
Example: Students’ cumulative GPAs (CGPA) in BAU at the end of the Spring semester.
An outcome is any value between 0 (theoretically) and 4.0. So, the sample space for the CGPA is the
interval of real numbers [0.0 , 4.0]
The population consists of all BAU active students where each student will end up with a given
CGPA…
Example: Number of new cases per day testing positive for Covid19 in Lebanon
The sample space is {0,1,2,3,….,27,…}
The population consists of all observed daily new cases since the beginning of the pandemic till
perpetually.
You may get many observations having the same outcome, e.g. “No new case” i.e. the value of 0.
4
How to determine the probability of an outcome
The occurrence of the given outcome when observed by trying many members (subjects or events)
from the population can be counted and a relative frequency of its occurrence can be computed using
the rule:
𝑁𝑢𝑚𝑏𝑒𝑟 𝑜𝑓 𝑜𝑐𝑐𝑢𝑟𝑟𝑒𝑛𝑐𝑒𝑠 𝑜𝑓 𝑡ℎ𝑒 𝑒𝑣𝑒𝑛𝑡
𝑅𝑒𝑙𝑡𝑖𝑣𝑒 𝑓𝑟𝑒𝑞𝑢𝑒𝑛𝑐𝑦 𝑜𝑓 𝑜𝑐𝑐𝑢𝑟𝑒𝑛𝑐𝑒 =
𝑁𝑢𝑚𝑏𝑒𝑟 𝑜𝑓 𝑡𝑟𝑖𝑎𝑙 𝑜𝑟 𝑒𝑥𝑝𝑒𝑟𝑖𝑚𝑒𝑛𝑡𝑠 𝑝𝑒𝑟𝑓𝑜𝑟𝑚𝑒𝑑
As the number of trials or experiments performed get larger and larger, then the relative frequency of
the outcome converges to a fixed value for the fraction.
Try it yourselves: Toss a coin and find the relative frequency of getting “heads” through subsequent
trials and record the relative frequency of heads every time. On the first flip say you get “tail”, the
proportion of heads is 0. Say on the next toss you get a “head”, then the proportion of heads goes up
to 0.5. Suppose you get on the next two flips just “tails”, now the proportion of heads in the four flips
is down to 0.25. Continue flipping the coin – 50 times, 100 times, 1000 times. Over the long term a
pattern emerges – the proportions hover around 0.5 – as can be seen in the following Figure.
Source: https://www.learner.org/wp-content/uploads/2019/03/AgainstAllOdds_StudentGuide_Unit18-Introduction-to-Probability.pdf
The Law of Large Numbers: the long-run relative frequency of repeated independent events
emerges to a given relative frequency as the number of trials increases.
MAT2
5
Sampling
Sampling is the process of selecting random members from a population to observe whether a specified
event is detected. Remember the event can be a single outcome or a range of outcomes that fulfill a
given characteristic or requirement.
6
Describing the sample
The sample can be described either numerically or graphically
Numeric description of the sample data
Measures of location
Measure of variability
Measures of location
Also called measures of central tendency.
Example:
The following measurements were recorded for the drying time, in hours, of a certain brand of latex
paint. 3.4, 2.5, 4.8, 2.9, 3.6, 2.8, 3.3, 5.6, 3.7, 2.8, 4.4, 4.0, 5.2, 3.0, 4.8. Assume that the measurements
are a simple random sample. What is the sample size for the above sample?
Solving manually:
However, use a scientific calculator to make your life easy! Using such a calculator is a must. The
procedure varies from a calculator brand/type to another. Have one and get used to how to use it for
statistical calculations.
7
Other measures of location
The data must be ordered first from smallest to largest then based on the sample size the appropriate
formula is used.
Example: continuing from before..
The Mode
The mode is the observation that occurs most often in the sample.
Example:
The following is the number of calls making orders from a supermarket for the last 20 days.
4, 2, 3, 4, 5, 3, 5, 6, 5, 5, 3, 4, 5, 6, 2, 3, 4, 3, 6, 3
Obviously, the mode is 5
8
Measures of variability
Sample Range
The sample range is the difference between the largest and the smallest observations in the sample.
𝑅𝑎𝑛𝑔𝑒 = 𝑋𝑚𝑎𝑥 − 𝑋𝑚𝑖𝑛
Example:
The data is: 3.4, 2.5, 4.8, 2.9, 3.6, 2.8, 3.3, 5.6, 3.7, 2.8, 4.4, 4.0, 5.2, 3.0, 4.8
The use of a calculator is imminent.
9
Graphical methods to describe the sample
1- The scatter plot
2- The histogram
3- The stem-and-leaf plot
4- The box plot
Note: drawing such plots usually takes time. Instead, using computer packages is required.
10