Professional Documents
Culture Documents
602 Statistics - Tangean Marilou D
602 Statistics - Tangean Marilou D
602 Statistics - Tangean Marilou D
DEPARTMENT OF EDUCATION
CEBU TECHNOLOGICAL UNIVERSITY
CORNER MJ. CUENCO, R. PALMA STREET, CEBU CITY
WEEK 1
SELF INTRODUCTION
We are all gifted in a unique and important way. And it is our privilege and our adventure to
My name is Marilou D. Tangean. I live in Camp 8, Minglanilla, Cebu. I have always drawn
towards this profession. I’m always looking for means to improve my skills and acquire new
understandings of the needs of younger children not just in school but outside the school
setting. I often think of ideas on how I can give implements for the children to succeed,
Thus, by the help of pursuing and making time for the things that makes me happy , I am
very much honoured to continue this journey. For I know that I’m on the right path towards
the betterment of our education system and soon enough I’ll be close to where I want to be.
REPUBLIC OF THE PHILIPPINES
DEPARTMENT OF EDUCATION
CEBU TECHNOLOGICAL UNIVERSITY
CORNER MJ. CUENCO, R. PALMA STREET, CEBU CITY
WEEK 2
The importance of Statistics, Data, Population, Census, Sample, Parameter, Quantitative Data,
and Qualitative Data
STATISTICS- the practice or science of collecting and analyzing numerical data in large
quantities, especially for the purpose of inferring proportions in a whole from those in a
representative sample. The field of statistics is the science of learning from data. Statistical
knowledge helps you use the proper methods to collect the data, employ the correct
analyses, and effectively present the results. Statistics is a crucial process behind how we
make discoveries in science, make decisions based on data, and make predictions. Statistics
allows you to understand a subject much more deeply.
DATA- Data are the facts and figures that are collected, analyzed, and summarized for
presentation and interpretation. Data may be classified as either quantitative or qualitative.
POPULATION-In statistics the term “population” has a slightly different meaning from the
one given to it in ordinary speech. It need not refer only to people or to animate creatures.
In statistics, the term population is used to describe the subjects of a particular study—
everything or everyone who is the subject of a statistical observation. Populationscan be
large or small in size and defined by any number of characteristics, though these groups are
typically defined specifically rather than vaguely.
QUANTITATIVE DATA- Quantitative data is defined as the value of data in the form of counts
or numbers where each data-set has an unique numerical value associated with it. This data
is any quantifiable information that can be used for mathematical calculations and statistical
analysis, such that real-life decisions can be made based on these mathematical derivations.
Quantitative data is used to answer questions such as “How many?”, “How often?”, “How
much?”. This data can be verified and can also be conveniently evaluated using
mathematical techniques.
QUALITATIVE DATA- is defined as the data that approximates and characterizes. Qualitative
data can be observed and recorded. This data type is non-numerical in nature. This type of
data is collected through methods of observations, one-to-one interviews, conducting focus
groups, and similar methods. Qualitative data in statistics is also known as categorical data –
data that can be arranged categorically based on the attributes and properties of a thing or
a phenomenon. Qualitative data is important in determining the particular frequency of
traits or characteristics. It allows the statistician or the researchers to form parameters
through which larger data sets can be observed. Qualitative data provides the means by
which observers can quantify the world around them.
REPUBLIC OF THE PHILIPPINES
DEPARTMENT OF EDUCATION
CEBU TECHNOLOGICAL UNIVERSITY
CORNER MJ. CUENCO, R. PALMA STREET, CEBU CITY
WEEK 3
EXAMPLE:
2. LEVELS of MEASUREMENT
2.1 Nominal- The first level of measurement. In this level of measurement, the
numbers in the variable are used only to classify the data. In this level of
measurement, words, letters, and alpha-numeric symbols can be used. Suppose
there are data about people belonging to three different gender categories. In
this case, the person belonging to the female gender could be classified as F, the
person belonging to the male gender could be classified as M, and transgendered
classified as T. This type of assigning classification is nominal level of
measurement.
2.2 Ordinal- The second level of measurement. This level of measurement depicts
some ordered relationship among the variable’s observations. Suppose a student
scores the highest grade of 100 in the class. In this case, he would be assigned
the first rank. Then, another classmate scores the second highest grade of an 92;
she would be assigned the second rank. A third student scores a 81 and he would
be assigned the third rank, and so on. The ordinal level of measurement indicates
an ordering of the measurements.
2.3 Interval- The third level of measurement. The interval level of measurement not
only classifies and orders the measurements, but it also specifies that the
distances between each interval on the scale are equivalent along the scale from
low interval to high interval. For example, an interval level of measurement could
be the measurement of anxiety in a student between the score of 10 and 11, this
interval is the same as that of a student who scores between 40 and 41. A
popular example of this level of measurement is temperature in centigrade,
where, for example, the distance between 940C and 960C is the same as the
distance between 1000C and 1020C.
1. RANDOM SAMPLING- is a part of the sampling technique in which each sample has
an equal probability of being chosen. A sample chosen randomly is meant to be an
unbiased representation of the total population. If for some reasons, the sample
does not represent the population, the variation is called a sampling error. It is one of
the simplest forms of collecting data from the total population. Under random
sampling, each member of the subset carries an equal opportunity of being chosen as
a part of the sampling process. For example, the total workforce in organisations is
300 and to conduct a survey, a sample group of 30 employees is selected to do the
survey. In this case, the population is the total number of employees in the company
and the sample group of 30 employees is the sample.
7. NON-SAMPLING ERROR- is a statistical term that refers to an error that results during
data collection, causing the data to differ from the true values. Non-sampling errors
may be present in both samples and censuses in which an entire population is
surveyed. Non-sampling errors fall under two categories: random and systematic.
1. MEASURES OF DISPERSION
1.1 RANGE
The range is the difference between the largest and the smallest observation in the
data. The prime advantage of this measure of dispersion is that it is easy to calculate.
On the other hand, it has lot of disadvantages. It is very sensitive to outliers and does
not use all the observations in a data set. It is more informative to provide the
minimum and the maximum values rather than providing the range.
2. Find the interquartile range the following ages of the employees of an insurance
company:
48, 44, 59, 56, 61, 60, 65, 63, 72, 67, 79, 73
Before calculating you will need to arrange the various points of your data set.
To do this, begin by ordering your data points from least to greatest.
44, 48, 56, 59, 60, 61,63, 65, 67, 72, 73,79
Once your data points have been ordered like this, you can move onto the
next step.
Next, we have to split the data. First, take the 50% of the left side and next get
the the 50% of the right side.
44, 48, 56, 59, 60, 61 / 63, 65, 67, 72, 73, 79
Next, we have to get the median of the 1st group
56+59 = 115/2= 57.5-----Q1
67+72 = 139/2= 69.5----Q3
Q1-Q3= IQR
69.5--57.5= 12
Step 1: Find the mean value for the given data values
Step 2: Now, subtract mean value form each of the data value given
(Note: Ignore the minus symbol)
Here,
Σ represents the addition of values
2. A student took 5 exams in a class and had scores of 92, 75, 95, 90 and 98.
Find the mean deviation for his test scores.
92, 75, 95, 90, 98
= 92 + 75 + 95 + 90 + 98 = 450
= 450/5 = 90
REPUBLIC OF THE PHILIPPINES
DEPARTMENT OF EDUCATION
CEBU TECHNOLOGICAL UNIVERSITY
CORNER MJ. CUENCO, R. PALMA STREET, CEBU CITY
WEEK 10
1. VARIANCE
The variance is a measure of variability. It is calculated by taking the
average of squared deviations from the mean. It tells you the degree of
spread in your data set. The more spread the data, the larger the
variance is in relation to the mean. It is defined as the average of the
squared differences from the Mean.
The sample variance formula looks like this:
Formula Explanation
s2 = sample variance
Σ = sum of…
Χ = each value
x̄ = sample mean
n = number of values in the sample
X x- x̄ 2
(x- x̄)
8 -7.2 51.84
9 -6.2 38.44
10 -5.2 27.04
12 -3.2 10.24
17 1.8 3.24
18 2.8 7.84
18 2.8 7.84
19 3.8 14.44
20 4.8 23.04
21 5.8 33.64
2
x̄ = 15.2 Σ (x- x̄) = 217.6
2 --
=S 217.6 = 24.17
9
WEEK 11 & 12
1. STANDARD DEVIATION
The standard deviation is a summary measure of the differences of each observation from
the mean. If the differences themselves were added up, the positive would exactly balance
the negative and so their sum would be zero. Consequently the squares of the differences
are added. The sum of the squares is then divided by the number of observations minus one
to give the mean of the squares, and the square root is taken to bring the measurements
back to the units we started with. (The division by the number of observations minus one
instead of the number of observations itself to obtain the mean square is because “degrees
of freedom” must be used. In these circumstances they are one less than the total.
2. Compute for the variance and standard deviation of the given data.
Number of books delivered in XYZ Library from January to August 2020, respectively.
200, 300, 150, 400, 200, 350, 400, 250
X x- x̄ (x- x̄)2
200 -81.25 6, 601. 5625
300 18.75 351. 5625
150 -131.25 17, 226. 5625
400 118.75 14, 101. 5625
200 -81.25 6, 601. 5625
350 68.75 4, 726. 5625
400 118.75 18, 101. 5625
250 31.25 976. 5625
x̄ = 281.25
Variance 2 = 64, 687.5 = 9, 241.071
S
7
Standard Deviation S= 96.1304916692 or 96.13