Professional Documents
Culture Documents
(M5-MAIN) Data Management
(M5-MAIN) Data Management
Data Management
About This Module
This module is will teach the student basic statics like
types of data, organizing and presenting data, statistical
measures and probability.
Substopics
Learn the basic terms and applications of statistics
Learn how to organize data in tables and graphs. Also
learn how to interpret tables and graphs.
Learn how to create a frequency distribution table from a
given set of raw data. Learn how to interpret values in
the frequency distribution table.
Learn how to compute basic statistical measures and
their meaning.
Learn how computer and interpret other measures of
location
Learn the basics of classical probability.
You may stop here and proceed
to the subtopic materials or you
may continue and read through
the full course materials.
Data Management
Introduction to Statistics
Intended Learning
Outcomes
• Understand the steps in conducting a statistical
investigation.
• Understand basic statistical terms
• Differentiate between descriptive and inferential
statistics
Statistics
stat
Numerical data - representing counts or measurements
stat
stat
stat
Statistical Inference comprises those methods concerned
with the analysis of a subset of data leading to predictions or
inferences about the entire set of data.
• Infer the expected amount of rain for July next year based
on the average precipitation data for July in the past 30
years.
[1]
Ex. Select a random sample of 15 out of 324 participants.
[1]
Data Management
Ex.
• Net value of different companies
• Number of subscribers of different networks
Number of fruits supplied to each store
Number of fruits supplied to each branch per week
store branch per week
180
Branch Apples Oranges Total 160
Fruits 140
120
1 125 75 200 100
2 118 84 202 80
60
3 164 72 236 40
Apples Oranges
A pie graph is used to show the distribution or
proportions of parts to a whole
Ex.
• National budget allocation
• Caloric distribution
Recommended Diet Distribution Recommended Diet
Food Recommended
Diet
Fruit 36%
Protein 28%
Vegetables 14%
Dairy 13%
Grain 9%
35%
Month Growth Profit
30%
January -3% -2% 25%
10%
April 3% 25%
5%
May 7% 40% 0%
Growth Profit
Data Management
Each class is defined by its class limits, which are the smallest and highest
data value that can be included in the class.
Class boundaries are numbers used to separate the classes so that there
are no gaps in the frequency distribution.
12 15 16 8 10 17 25 34 42 18
24 18 45 33 38 45 40 3 20 12
10 10 27 16 37 45 15 16 26 32
35 8 14 18 15 27 19 32 6 12
14 20 10 16 14 28 31 21 25 8
32 46 14 15 20 18 8 10 25 22
Class Frequency
Boundaries 20
1.5 – 10.5 11
Frequency
15
10.5 – 19.5 21
10
19.5 – 28.5 13
5
28.5 – 37.5 8
37.5 – 46.5 7 0
1.5 – 10.5 10.5 – 19.5 19.5 – 28.5 28.5 – 37.5 37.5 – 46.5
Class Boundaries
1.5 0 20
Frequency
6 11 15
15 21 10
24 13
5
33 8
0
42 7 1.5 6 15 24 33 42 46.5
46.5 0 Class Mark
Frequency Polygon is the line graph of the frequencies against the class marks.
Close the polygon at the lowest and highest class boundaries.
70
60
Cumulative Frequency
Class Cumulative
Boundaries Frequency 50
1.5 0 40
30
10.5 11
20
19.5 32
10
28.5 45
0
37.5 53 1.5 10.5 19.5 28.5 37.5 46.5
46.5 60 Class Boundaries
𝒙𝒊 = 𝒙𝟏 + 𝒙𝟐 + 𝒙𝟑 + ⋯ + 𝒙𝒏
𝒊=𝟏
𝒙𝒊 = 𝒙𝟏 + 𝒙𝟐 + 𝒙𝟑 + 𝒙𝟒 + 𝒙𝟓 = 𝟏 + 𝟓 + 𝟐 + −𝟑 + 𝟎
𝒊=𝟏
𝟑
𝒊 ⋅ 𝒙𝒊 = 𝟏 𝒙𝟏 + 𝟐 𝒙𝟐 + 𝟑 𝒙𝟑 = 𝟏 𝟏 + 𝟐 𝟓 + 𝟑(𝟐)
𝒊=𝟏
These values are used to represent a set of data.
• Mean
• Median
• Mode
Population Mean
Given a population of size 𝑁, the population mean is defined as
σ𝑁𝑖=1 𝑥𝑖
𝜇=
𝑁
where the 𝑥𝑖 ’s are the observation values
Sample Mean
Given a sample of size 𝑛, the sample mean is defined as
σ𝑛𝑖=1 𝑥𝑖
𝑥ҧ =
𝑛
where the 𝑥𝑖 ’s are the observation values
The median is the middle number when all observations are arranged in
increasing or decreasing order.
Median
Arrange the observations from lowest to highest.
Let 𝑛 be the number of observations.
𝑛+1
If 𝑛 is odd, let 𝑘 = .
2
Then the median is the 𝑘th observation, 𝑥𝑘 .
𝑛
If 𝑛 is even, let 𝑘 = .
2
Then the median is the average of the 𝑘th and 𝑘 + 1 th observations,
𝑥𝑘 + 𝑥𝑘+1
.
2
.
The mode of a set of observations is that value which occurs
most often with the greatest frequency.
Find the mean, median, and mode of the given set of data:
26 29 29 29 30 31 34 41 42 42 47 48
Find the mean, median, and mode of the given set of data:
26 29 29 29 30 31 34 41 42 42 47 48
26+29+29+29+30+31+34+41+42+42+47+48
Mean: = 35.67
12
Find the mean, median, and mode of the given set of data:
26 29 29 29 30 31 34 41 42 42 47 48
31+34
Thus the median is : = 32.5
2
Find the mean, median, and mode of the given set of data:
26 29 29 29 30 31 34 41 42 42 47 48
Mode : 29
Weighted Mean
When each data point, 𝑥𝑖 , contribute different weights, 𝑤𝑖 , on the mean.
σ𝑁
𝑖=1 𝑤𝑖 𝑥𝑖
𝑥ҧ = 𝑁
σ𝑖=1 𝑤𝑖
Examples:
Grade point average of subjects with varying units.
Average daily balance of a bank account.
Mean computed from the frequency distribution table
Compute for the weighted mean grade
• Range
• Variance
• Standard Deviation
Range
the difference between the largest and smallest number in the set.
Example:
26 29 29 29 30 31 34 41 42 42 47 48
𝑟𝑎𝑛𝑔𝑒 = 48 − 26 = 22
Sometimes the range is not enough to describe the
distribution of values
𝑥−𝜇
𝑧=
𝜎
Since for this problem, the best would be one with the lowest relative time. Thus the
applicant is most suitable for Accounting.
Data Management
Let 𝐼𝑘 be the integer part of 𝑘 and 𝐷𝑘 be it’s decimal part. Identify the 𝐼𝑘th and
𝐼𝑘 + 1 st observations.
85
𝑟 = 85 𝑘= 60 + 1 = 51.85
100
𝐷𝑘 = 0.85 𝐼𝑘 = 51 𝐼𝑘 + 1 = 52 𝑥51 = 34 𝑥52 = 35
Example: Find 𝐷6 and 𝑄1 from the data set in the previous slide.
Find the 6th decile or 𝐷6 = 𝑃60 in the given data set below.
3 6 8 8 8 8 10 10 10 10
10 12 12 12 14 14 14 14 15 15
15 15 16 16 16 16 17 18 18 18
18 19 20 20 20 21 22 24 25 25
25 26 27 27 28 31 32 32 32 33
34 35 37 38 40 42 45 45 45 46
60
𝑟 = 60 𝑘= 60 + 1 = 36.6
100
𝐷𝑘 = 0.6 𝐼𝑘 = 36 𝐼𝑘 + 1 = 37 𝑥36 = 21 𝑥37 = 22
𝐷6 = 21 + 0.6 22 − 21 = 21.6
Find the 1st quartile or 𝑄1 = 𝑃25 in the given data set below.
3 6 8 8 8 8 10 10 10 10
10 12 12 12 14 14 14 14 15 15
15 15 16 16 16 16 17 18 18 18
18 19 20 20 20 21 22 24 25 25
25 26 27 27 28 31 32 32 32 33
34 35 37 38 40 42 45 45 45 46
25
𝑟 = 25 𝑘= 60 + 1 = 15.25
100
𝐷𝑘 = 0.25 𝐼𝑘 = 15 𝐼𝑘 + 1 = 16 𝑥15 = 14 𝑥16 = 14
𝑄1 = 14 + 0.25 14 − 14 = 14
Data Management
Probability
Intended Learning
Outcomes
• Find basic probabilities
• Apply properties of probabilities in solving
problems.
Classical Probability
Basic Concepts
Definition
A probability experiment is a chance process that leads to well-defined results called
outcomes.
Examples
Rolling Dice
Die 2
Die 1 1 2 3 4 5 6
1 (1,1) (1,2) (1,3) (1,4) (1,5) (1,6)
2 (2,1) (2,2) (2,3) (2,4) (2,5) (2,6)
3 (3,1) (3,2) (3,3) (3,4) (3,5) (3,6)
4 (4,1) (4,2) (4,3) (4,4) (4,5) (4,6)
5 (5,1) (5,2) (5,3) (5,4) (5,5) (5,6)
6 (6,1) (6,2) (6,3) (6,4) (6,5) (6,6)
Gender of Children
Use a tree diagram to find the sample space for the gender fo the children if a family has three
children. Use B for boy and G for girl.
Classical Probability
Definition
An event consists of a set of outcomes of a probability experiment.
Equally likely events are events that have the same probability of occuring.
Classical probability assumes that all outcomes in the sample space are equally likely to
occur.
Example
Drawing Cards
A card is drawn from an ordinary deck. Find the probability that the card is a queen.
Example
Drawing Cards
A card is drawn from an ordinary deck. Find the probability that the card is a queen.
The sample space consists of all the 52 different cards in the deck.
The event, E, of drawing a queen consist of 4 different queens, queen of hearts, queen of
diamonds, queen of spade, and queen of clubs.
4 1
P(E) = =
52 13
Example
Gender of Children
If a family has three children, find the probability that two of the children are girls.
Example
Gender of Children
If a family has three children, find the probability that two of the children are girls.
E = {BGG,GBG, GGB}
Thus
3
P(E) =
8
Probability Rules
Probability Rules
Probability Rules
Probability Rules
Probability Rules
Examples
Examples
Examples
Examples
Examples
Examples
Examples
1
When a single die is rolled, find the probability of getting a 9. Ans: 0
2
When a single die is rolled, what is the probability of getting a number less than 7.
Ans: 1
3
In the event of drawing a letter from the english alphabet,
1 what is the complement of the event of drawing a vowel? Ans: draw a consonant
2 What is the probability of drawing a vowel?
Examples
Examples
Examples