Professional Documents
Culture Documents
2021 EDA - Module 1 - DESCRIPTIVE STATISTICS Lecture Oct. 19
2021 EDA - Module 1 - DESCRIPTIVE STATISTICS Lecture Oct. 19
2021 EDA - Module 1 - DESCRIPTIVE STATISTICS Lecture Oct. 19
Descriptive Statistics
UC-CEA MODULE 1
ENGR. FM ESTEBAN
Network Gadgets LMS
In an experimental study, a variable of interest is first identified. Then one or more other variables are
identified and controlled so that data can be obtained about how they influence the variable of
interest. For example, a pharmaceutical firm might be interested in an experiment to learn how a new
drug affects blood pressure . Nowadays , Pharmaceutical companies study how vaccination of
individuals can be deterrent to the COVID-19 virus. Now it has been mandated that vaccination of
individuals is a must to control the spread of the virus. Of course the safety protocols of face masks
and social distancing remains.
Nonexperimental, or observational, statistical studies make no attempt to control the variables of
interest. A survey is perhaps the most common type of observational study. For instance, in a personal
interview survey, research questions are first identified. Then a questionnaire is designed and
administered to a sample of individuals. Some restaurants use observational studies to obtain data
about their customer’s opinion on the quality of food.
STATISTICS – SAMPLE DATA
DATA ACQUISITION ERRORS
Managers and engineers should always be aware of the possibility of statistical studies. Using
erroneous data can be worse than not using any data at all. An error in data acquisition occurs when
data value obtained is not equal to the true or actual value that would be obtained with a correct
procedure. For example, the person interviewing might make a recording error such as transposing
into writing (typo errors like age 24 instead of 42 etc.).
Experienced data analysts take care in collecting and recording data to ensure that errors are not
made.
Errors occur often during data acquisition. Misleading information can lead to bad judgments and
decisions.
STATISTICAL INFERENCE
Many situations require information about a large group of elements (individuals, companies, voters,
households, products, customers and so on. But because of time, cost and considerations, data can be
collected from a small portion of a group. The larger group of elements in a particular study is called
the population and the smaller group is called the sample.
The process of conducting a survey of the entire population is called a census. The process of
conducting a survey to collect data from a sample is called a sample survey.
As one of the major contributions, statistics uses data from a sample to make estimates and test
hypothesis about the characteristics of a population through a process referred to as statistical
inference.
STATISTICS – INTRODUCTION
“Workers must be equipped not simply with the technical know-how but the ability
to create, analyze, and transform information and to interact effectively with others”
… Dr. Alan Greenspan – Former chairman of the Federal reserve Board speaking before a National skills
summit.
Mr. Alan Greenspan understands the importance of statistical tools and techniques to provide accurate and
timely information to make public statements that have the power to make global stock markets and
influence political thinking..
STATISTICS – SAMPLE DATA
VARIABLES
Types of variables
Qualitative Quantitative
Brand of PC
Marital Status Discrete Continuous
Hair Color
Theorems:
1. The summation of the sum of two or more variables is
the sum of their summations.
𝑛 𝑛 𝑛 𝑛
𝑋𝑖 + 𝑌𝑖 + 𝑍𝑖 = 𝑋𝑖 + 𝑌𝑖 + 𝑍𝑖
𝑖=1 𝑖=1 𝑖=1 𝑖=1
2. If C is a constant, then
𝑛 𝑛
𝐶𝑋𝑖 = 𝐶 𝑋𝑖 𝑛
𝑖=1 𝑖=1 𝐶 = 𝑛𝐶
3. If C is a constant, then 𝑖=1
SUMMATION NOTATION, Σ
Sample Problems
1. Given X1 = - 2, X2 = 3, X3 = 1,Y1 = 4,Y2 = 0, Y3 = - 5, find the value of the
following
𝑎. 𝑋𝑖 𝑌𝑖 2
𝑏. 2𝑋𝑖 + 𝑌𝑖 − 3
𝑖=2
𝑐. 𝑋2 𝑌
SUMMATION NOTATION, Σ
𝑎. σ 𝑋𝑖 𝑌𝑖 2 = -7
Solution below
𝒙𝒊 𝒚𝒊 𝒚𝟐𝒊
𝑋𝑖 𝑌𝑖 2
1 -2 4 16 -32
2 3 0 0 0
3 1 -5 25 25
total -7
SUMMATION NOTATION, Σ
Sample Problems – solution by MS Excel
1. Given X1 = - 2, X2 = 3, X3 = 1,Y1 = 4,Y2 = 0, Y3 = - 5, find the
value of the following
𝑎. 𝑋𝑖 𝑌𝑖 2
x y y² ∑xy²
1 -2 4 16 -32
2 3 0 0 0
3 1 -5 25 25
Total -7
SUMMATION NOTATION, Σ
𝑎. 𝑋𝑖 𝑌𝑖 2
𝒙𝒊 𝒚𝒊 𝒚𝟐𝒊
𝑋𝑖 𝑌𝑖 2
1 -2 4 16 -32
2 3 0 0 0
3 1 -5 25 25
Total -7
SUMMATION NOTATION, Σ
Sample Problems
1. Given X1 = - 2, X2 = 3, X3 = 1,Y1 = 4,Y2 = 0, Y3 = - 5, find the value of
the following
3
𝑏. 2𝑋𝑖 + 𝑌𝑖 − 3 = −3
𝑖=2
Solution below by MS Excel
x y ∑(2x+y-3)
2 3 0 3
3 1 -5 -6
Total -3
SUMMATION NOTATION, ∑
𝑥𝑖 = 𝑥1 + 𝑥2 + ⋯ 𝑥𝑛
𝑖=1
For example: 𝑥1 = 5, 𝑥2 = 8, 𝑥3 = 14
σ3𝑖=1 𝑥𝑖 = 𝑥1 + 𝑥2 + 𝑥3
3
𝑥𝑖 = 5 + 8 + 14
𝑖=1
σ3𝑖=1 𝑥𝑖 = 27
SAMPLE OF SUMMATION NOTATION, ∑
𝑐 = 𝑐 + 𝑐 + ⋯ 𝑐 = 𝑛𝑐
𝑖=1
For example: 𝑐 = 5, 𝑛 = 10,
σ10
𝑖=1 5 = 10(5)
3
5 = 10 ∗ 5 = 50
𝑖=1
SAMPLE OF SUMMATION NOTATION, ∑
𝑐 = 𝑐 + 𝑐 + ⋯ 𝑐 = 𝑛𝑐
𝑖=1
𝑥ҧ = 𝑛𝑥ҧ
𝑖=1
SAMPLE OF SUMMATION NOTATION, ∑
For example:
𝑛
= 𝑐(𝑥1 +𝑥2 + ⋯ 𝑥𝑛 )
𝑖=1
𝑛
𝑐 𝑥𝑖
𝑖=1
𝑐 𝑥𝑖 = 2 𝑥𝑖 = 2 27 = 54
𝑖=1 𝑖=1
SAMPLE OF SUMMATION NOTATION, ∑
For example:
𝑛 𝑛 𝑛
(𝑎𝑥𝑖 + 𝑏𝑦𝑖 ) = 𝑎 𝑥𝑖 + 𝑏 𝑦𝑖
𝑖=1 𝑖=1 𝑖=1
𝑛 𝑛 𝑛
(2𝑥𝑖 + 4𝑦𝑖 ) = 2 𝑥𝑖 + 4 𝑦𝑖
𝑖=1 𝑖=1 𝑖=1
2 𝑥𝑖 + 4 𝑦𝑖 = 2 27 + 4( 18) = 54 + 72 = 126
𝑖=1 𝑖=1
SAMPLE OF DOUBLE SUMMATION
NOTATION, ∑
For example:
Consider the following data involving the variable 𝑥𝑖𝑗 , where 𝑖 is the subscript for row position and 𝑗 is the
subscript for column position
Column (j)
1 2 3
1 𝑥11 = 10 𝑥12 = 8 𝑥13 = 6
Row (i)
2 𝑥21 = 7 𝑥22 = 4 𝑥23 = 12
SAMPLE OF DOUBLE SUMMATION NOTATION, ∑
Definition
𝑛 𝑚
𝑥𝑖𝑗 = 𝑥11 + 𝑥12 … + 𝑥1𝑚 + 𝑥21 + 𝑥22 + ⋯ 𝑥2𝑚 + 𝑥31 + 𝑥32 + ⋯ + 𝑥3𝑚 + (𝑥𝑛1 + 𝑥𝑛2 + ⋯ + 𝑥𝑛 )
𝑖=1 𝑗=1
2 3
= 10 + 8 + 6 + 7 + 4 + 12 = 47
Definition
𝑛
𝑎. 𝑋𝑖 𝑌𝑖 2
𝑏. 2𝑋𝑖 + 𝑌𝑖 − 3
𝑖=2
𝑐. 𝑋2 𝑌
SUMMATION NOTATION, Σ
𝑥2 𝑥3 27𝑥4 16𝑥5
𝑏. + + +
9𝑦1 2𝑦2 25𝑦3 9𝑦4
STATISTICS – VIDEO CASE STUDIES
STATISTICS – VIDEO CASE STUDIES
STATISTICS – VIDEO CASE STUDIES
STATISTICS – VIDEO CASE STUDIES
STATISTICS – VIDEO CASE STUDIES
STATISTICS – VIDEO CASE STUDIES
THANK YOU
fmesteban@uc-bcf.edu.ph