2021 EDA - Module 1 - DESCRIPTIVE STATISTICS Lecture Oct. 19

You might also like

Download as pdf or txt
Download as pdf or txt
You are on page 1of 34

EDA – MODULE 1

Descriptive Statistics
UC-CEA MODULE 1
ENGR. FM ESTEBAN
Network Gadgets LMS

TECH REQUIREMENTS FOR EDA ONLINE DISTANCE LEARNING


DESCRIPTIVE STATISTICS

Most of the statistical information in newspapers, magazines,


company reports, and nowadays ,the internet consists of data
summarized and presented in the form that that is easy for
the reader to understand

Such summaries of data , which may be tabular, graphical or


numerical, are referred to as descriptive statistics
COURSE OUTLINE

A. Basic Statistical Concept


1. Descriptive and Inferential Statistics
2. Definition of Terms
3. Summation Notations
STATISTICAL STUDIES

 In an experimental study, a variable of interest is first identified. Then one or more other variables are
identified and controlled so that data can be obtained about how they influence the variable of
interest. For example, a pharmaceutical firm might be interested in an experiment to learn how a new
drug affects blood pressure . Nowadays , Pharmaceutical companies study how vaccination of
individuals can be deterrent to the COVID-19 virus. Now it has been mandated that vaccination of
individuals is a must to control the spread of the virus. Of course the safety protocols of face masks
and social distancing remains.
 Nonexperimental, or observational, statistical studies make no attempt to control the variables of
interest. A survey is perhaps the most common type of observational study. For instance, in a personal
interview survey, research questions are first identified. Then a questionnaire is designed and
administered to a sample of individuals. Some restaurants use observational studies to obtain data
about their customer’s opinion on the quality of food.
STATISTICS – SAMPLE DATA
DATA ACQUISITION ERRORS

 Managers and engineers should always be aware of the possibility of statistical studies. Using
erroneous data can be worse than not using any data at all. An error in data acquisition occurs when
data value obtained is not equal to the true or actual value that would be obtained with a correct
procedure. For example, the person interviewing might make a recording error such as transposing
into writing (typo errors like age 24 instead of 42 etc.).
 Experienced data analysts take care in collecting and recording data to ensure that errors are not
made.
 Errors occur often during data acquisition. Misleading information can lead to bad judgments and
decisions.
STATISTICAL INFERENCE

 Many situations require information about a large group of elements (individuals, companies, voters,
households, products, customers and so on. But because of time, cost and considerations, data can be
collected from a small portion of a group. The larger group of elements in a particular study is called
the population and the smaller group is called the sample.
 The process of conducting a survey of the entire population is called a census. The process of
conducting a survey to collect data from a sample is called a sample survey.
 As one of the major contributions, statistics uses data from a sample to make estimates and test
hypothesis about the characteristics of a population through a process referred to as statistical
inference.
STATISTICS – INTRODUCTION

 “Workers must be equipped not simply with the technical know-how but the ability
to create, analyze, and transform information and to interact effectively with others”

 … Dr. Alan Greenspan – Former chairman of the Federal reserve Board speaking before a National skills
summit.
 Mr. Alan Greenspan understands the importance of statistical tools and techniques to provide accurate and
timely information to make public statements that have the power to make global stock markets and
influence political thinking..
STATISTICS – SAMPLE DATA
VARIABLES
Types of variables

Qualitative Quantitative

Brand of PC
Marital Status Discrete Continuous
Hair Color

Amount of income tax paid


Children in a family
Weight of a student
Strokes on a golf course
Yearly rainfall in Baguio
TV sets owned
LEVELS OR SCALES OF MEASUREMENT
Levels of
measurement

Nominal Ordinal Interval or Range Ratio

Data may only be Meaningful


classified Meaningful 0 point and
Data are ranked difference
ratio between values
between values

Jersey Numbers Your rank in class Temperature Number of patients seen


of football players Team Standings in Dress or shoe Number of sales call
Make of Car Intramurals size Distance to class
SUMMATION NOTATION, ∑

Theorems:
1. The summation of the sum of two or more variables is
the sum of their summations.
𝑛 𝑛 𝑛 𝑛

෍ 𝑋𝑖 + 𝑌𝑖 + 𝑍𝑖 = ෍ 𝑋𝑖 + ෍ 𝑌𝑖 + ෍ 𝑍𝑖
𝑖=1 𝑖=1 𝑖=1 𝑖=1

2. If C is a constant, then
𝑛 𝑛

෍ 𝐶𝑋𝑖 = 𝐶 ෍ 𝑋𝑖 𝑛
𝑖=1 𝑖=1 ෍ 𝐶 = 𝑛𝐶
3. If C is a constant, then 𝑖=1
SUMMATION NOTATION, Σ

Sample Problems
1. Given X1 = - 2, X2 = 3, X3 = 1,Y1 = 4,Y2 = 0, Y3 = - 5, find the value of the
following
𝑎. ෍ 𝑋𝑖 𝑌𝑖 2

𝑏. ෍ 2𝑋𝑖 + 𝑌𝑖 − 3
𝑖=2

𝑐. ෍ 𝑋2 ෍𝑌
SUMMATION NOTATION, Σ

Sample Problems – solution by tabulation and MS Excel


1. Given X1 = - 2, X2 = 3, X3 = 1,Y1 = 4,Y2 = 0, Y3 = - 5, find the value of the following

𝑎. σ 𝑋𝑖 𝑌𝑖 2 = -7
Solution below

𝒙𝒊 𝒚𝒊 𝒚𝟐𝒊
෍ 𝑋𝑖 𝑌𝑖 2

1 -2 4 16 -32
2 3 0 0 0
3 1 -5 25 25
total -7
SUMMATION NOTATION, Σ
Sample Problems – solution by MS Excel
1. Given X1 = - 2, X2 = 3, X3 = 1,Y1 = 4,Y2 = 0, Y3 = - 5, find the
value of the following
𝑎. ෍ 𝑋𝑖 𝑌𝑖 2

x y y² ∑xy²
1 -2 4 16 -32
2 3 0 0 0
3 1 -5 25 25
Total -7
SUMMATION NOTATION, Σ

Sample Problems – solution by tabulation


1. Given X1 = - 2, X2 = 3, X3 = 1,Y1 = 4,Y2 = 0, Y3 = - 5, find the value of the following

𝑎. ෍ 𝑋𝑖 𝑌𝑖 2

𝒙𝒊 𝒚𝒊 𝒚𝟐𝒊
෍ 𝑋𝑖 𝑌𝑖 2

1 -2 4 16 -32
2 3 0 0 0
3 1 -5 25 25
Total -7
SUMMATION NOTATION, Σ
Sample Problems
1. Given X1 = - 2, X2 = 3, X3 = 1,Y1 = 4,Y2 = 0, Y3 = - 5, find the value of
the following
3

𝑏. ෍ 2𝑋𝑖 + 𝑌𝑖 − 3 = −3
𝑖=2
Solution below by MS Excel

x y ∑(2x+y-3)

2 3 0 3
3 1 -5 -6
Total -3
SUMMATION NOTATION, ∑

෍ 𝑥𝑖 = 𝑥1 + 𝑥2 + ⋯ 𝑥𝑛
𝑖=1

For example: 𝑥1 = 5, 𝑥2 = 8, 𝑥3 = 14
σ3𝑖=1 𝑥𝑖 = 𝑥1 + 𝑥2 + 𝑥3
3

෍ 𝑥𝑖 = 5 + 8 + 14
𝑖=1

σ3𝑖=1 𝑥𝑖 = 27
SAMPLE OF SUMMATION NOTATION, ∑

෍ 𝑐 = 𝑐 + 𝑐 + ⋯ 𝑐 = 𝑛𝑐
𝑖=1
For example: 𝑐 = 5, 𝑛 = 10,
σ10
𝑖=1 5 = 10(5)
3

෍ 5 = 10 ∗ 5 = 50
𝑖=1
SAMPLE OF SUMMATION NOTATION, ∑

෍ 𝑐 = 𝑐 + 𝑐 + ⋯ 𝑐 = 𝑛𝑐
𝑖=1

For example: 𝑐 = 𝑥,ҧ


σ𝑛𝑖=1 𝑥ҧ = 𝑛𝑥ҧ
𝑛

෍ 𝑥ҧ = 𝑛𝑥ҧ
𝑖=1
SAMPLE OF SUMMATION NOTATION, ∑
For example:
𝑛

෍ 𝑐𝑥𝑖 = 𝑐𝑥1 + 𝑐𝑥2 + ⋯ 𝑐𝑥𝑛


𝑖=1
𝑛

෍ = 𝑐(𝑥1 +𝑥2 + ⋯ 𝑥𝑛 )
𝑖=1
𝑛

𝑐 ෍ 𝑥𝑖
𝑖=1

For example: For example: 𝑥1 = 5, 𝑥2 = 8, 𝑥3 = 14,𝑐 = 2,


𝑛 3

𝑐 ෍ 𝑥𝑖 = 2 ෍ 𝑥𝑖 = 2 27 = 54
𝑖=1 𝑖=1
SAMPLE OF SUMMATION NOTATION, ∑

For example:
𝑛 𝑛 𝑛

෍(𝑎𝑥𝑖 + 𝑏𝑦𝑖 ) = 𝑎 ෍ 𝑥𝑖 + 𝑏 ෍ 𝑦𝑖
𝑖=1 𝑖=1 𝑖=1
𝑛 𝑛 𝑛

෍(2𝑥𝑖 + 4𝑦𝑖 ) = 2 ෍ 𝑥𝑖 + 4 ෍ 𝑦𝑖
𝑖=1 𝑖=1 𝑖=1

For example: For example: 𝑥1 = 5, 𝑥2 = 8, 𝑥3 = 5,a = 2, b = 4, 𝑦1 = 7, 𝑦2 = 3, 𝑦3 = 8


𝑛 𝑛

2 ෍ 𝑥𝑖 + 4 ෍ 𝑦𝑖 = 2 27 + 4( 18) = 54 + 72 = 126
𝑖=1 𝑖=1
SAMPLE OF DOUBLE SUMMATION
NOTATION, ∑
For example:

Consider the following data involving the variable 𝑥𝑖𝑗 , where 𝑖 is the subscript for row position and 𝑗 is the
subscript for column position

Column (j)

1 2 3
1 𝑥11 = 10 𝑥12 = 8 𝑥13 = 6
Row (i)
2 𝑥21 = 7 𝑥22 = 4 𝑥23 = 12
SAMPLE OF DOUBLE SUMMATION NOTATION, ∑

Definition
𝑛 𝑚

෍ ෍ 𝑥𝑖𝑗 = 𝑥11 + 𝑥12 … + 𝑥1𝑚 + 𝑥21 + 𝑥22 + ⋯ 𝑥2𝑚 + 𝑥31 + 𝑥32 + ⋯ + 𝑥3𝑚 + (𝑥𝑛1 + 𝑥𝑛2 + ⋯ + 𝑥𝑛 )
𝑖=1 𝑗=1

2 3

෍ ෍ 𝑥𝑖𝑗 = 𝑥11 + 𝑥12 + 𝑥13 + 𝑥21 + 𝑥22 + 𝑥23


𝑖=1 𝑗=1

= 10 + 8 + 6 + 7 + 4 + 12 = 47
Definition
𝑛

෍ 𝑥𝑖𝑗 = 𝑥1𝑗 + 𝑥2𝑗 + ⋯ 𝑥3𝑗 = 8 + 4 = 12


𝑖=1
2

෍ 𝑥𝑖2 = 𝑥12 + 𝑥22 = 8 + 4 = 12


𝑖=1
SUMMATION NOTATION, Σ
Sample Problems
1. Given X1 = - 2, X2 = 3, X3 = 1,Y1 = 4,Y2 = 0, Y3 = - 5, find the value of
the following

𝑎. ෍ 𝑋𝑖 𝑌𝑖 2

𝑏. ෍ 2𝑋𝑖 + 𝑌𝑖 − 3
𝑖=2

𝑐. ෍ 𝑋2 ෍𝑌
SUMMATION NOTATION, Σ

Sample Problems – Student to do


2. Write the summation notation of the following;

5𝑦2 3𝑦3 7𝑦4 4𝑦5


𝑎. 3
+ 3
+ 3
+
3𝑥1 2𝑥2 5𝑥3 3𝑥4 3

𝑥2 𝑥3 27𝑥4 16𝑥5
𝑏. + + +
9𝑦1 2𝑦2 25𝑦3 9𝑦4
STATISTICS – VIDEO CASE STUDIES
STATISTICS – VIDEO CASE STUDIES
STATISTICS – VIDEO CASE STUDIES
STATISTICS – VIDEO CASE STUDIES
STATISTICS – VIDEO CASE STUDIES
STATISTICS – VIDEO CASE STUDIES
THANK YOU
fmesteban@uc-bcf.edu.ph

You might also like