602 Statistics - Tangean Marilou D

REPUBLIC OF THE PHILIPPINES
DEPARTMENT OF EDUCATION
CEBU TECHNOLOGICAL UNIVERSITY
CORNER MJ. CUENCO, R. PALMA STREET, CEBU CITY
Name: Marilou D. Tangean Subject: Educ. 602: Statistics

Professor: Dr. Ricky Yabo / Dr. Butch Ocba MAED: Major in Administration & Supervision
WEEK 1
SELF INTRODUCTION
We are all gifted in a unique and important way. And it is our privilege and our adventure to
discover and unlock our own special light.
My name is Marilou D. Tangean. I live in Camp 8, Minglanilla, Cebu. I have always drawn
towards this profession. I’m always looking for means to improve my skills and acquire new
understandings of the needs of younger children not just in school but outside the school
setting. I often think of ideas on how I can give implements for the children to succeed,
Thus, by the help of pursuing and making time for the things that makes me happy , I am
very much honoured to continue this journey. For I know that I’m on the right path towards
the betterment of our education system and soon enough I’ll be close to where I want to be.
Name:Marilou D. Tangean Subject: Educ. 602: Statistics

Professor: Dr. Ricky Yabo/ Dr. Butch Ocba MAED: Major in Administration & Supervision
WEEK 2
The importance of Statistics, Data, Population, Census, Sample, Parameter, Quantitative Data,
and Qualitative Data
STATISTICS- the practice or science of collecting and analyzing numerical data in large
quantities, especially for the purpose of inferring proportions in a whole from those in a
representative sample. The field of statistics is the science of learning from data. Statistical
knowledge helps you use the proper methods to collect the data, employ the correct
analyses, and effectively present the results. Statistics is a crucial process behind how we
make discoveries in science, make decisions based on data, and make predictions. Statistics
allows you to understand a subject much more deeply.
DATA- Data are the facts and figures that are collected, analyzed, and summarized for
presentation and interpretation. Data may be classified as either quantitative or qualitative.
POPULATION-In statistics the term “population” has a slightly different meaning from the
one given to it in ordinary speech. It need not refer only to people or to animate creatures.
In statistics, the term population is used to describe the subjects of a particular study—
everything or everyone who is the subject of a statistical observation. Populationscan be
large or small in size and defined by any number of characteristics, though these groups are
typically defined specifically rather than vaguely.
CENSUS- A census is a survey conducted on the full set of observation objects

belonging to a given population or universe. A census is the complete
enumeration of a population or groups at a point in time with respect to well
defined characteristics: for example, population, production, traffic on
particular roads. In some connection the term is associated with the data
collected rather than the extent of the collection so that the term sample census has a
distinct meaning.
SAMPLE- refers to a smaller, manageable version of a larger group. It is a subset containing

the characteristics of a larger population. Samples are used in statistical testing when
population sizes are too large for the test to include all possible members or observations. A
sample should represent the population as a whole and not reflect any bias toward a specific
attribute.
PARAMETER- Parameters in statistics is an important component of any statistical analysis.

In simple words, a parameter is any numerical quantity that characterizes a given population
or some aspect of it. This means the parameter tells us something about the whole
population.
QUANTITATIVE DATA- Quantitative data is defined as the value of data in the form of counts
or numbers where each data-set has an unique numerical value associated with it. This data
is any quantifiable information that can be used for mathematical calculations and statistical
analysis, such that real-life decisions can be made based on these mathematical derivations.
Quantitative data is used to answer questions such as “How many?”, “How often?”, “How
much?”. This data can be verified and can also be conveniently evaluated using
mathematical techniques.
QUALITATIVE DATA- is defined as the data that approximates and characterizes. Qualitative
data can be observed and recorded. This data type is non-numerical in nature. This type of
data is collected through methods of observations, one-to-one interviews, conducting focus
groups, and similar methods. Qualitative data in statistics is also known as categorical data –
data that can be arranged categorically based on the attributes and properties of a thing or
a phenomenon. Qualitative data is important in determining the particular frequency of
traits or characteristics. It allows the statistician or the researchers to form parameters
through which larger data sets can be observed. Qualitative data provides the means by
which observers can quantify the world around them.

WEEK 3
1. QUANTITATIVE TYPES OF DATA

1.1 Discrete- A discrete quantitative variable is one that can only take specific
numeric values (rather than any value in an interval), but those numeric values
have a clear quantitative interpretation. Examples of discrete quantitative
variables are number of needle punctures, number of pregnancies and number of
hospitalizations.
1.2 Continuous- continuous data is quantitative data that can be measured. It has an
infinite number of possible values within a selected range.
EXAMPLE:
2. LEVELS of MEASUREMENT
2.1 Nominal- The first level of measurement. In this level of measurement, the
numbers in the variable are used only to classify the data. In this level of
measurement, words, letters, and alpha-numeric symbols can be used. Suppose
there are data about people belonging to three different gender categories. In
this case, the person belonging to the female gender could be classified as F, the
person belonging to the male gender could be classified as M, and transgendered
classified as T. This type of assigning classification is nominal level of
measurement.
2.2 Ordinal- The second level of measurement. This level of measurement depicts
some ordered relationship among the variable’s observations. Suppose a student
scores the highest grade of 100 in the class. In this case, he would be assigned
the first rank. Then, another classmate scores the second highest grade of an 92;
she would be assigned the second rank. A third student scores a 81 and he would
be assigned the third rank, and so on. The ordinal level of measurement indicates
an ordering of the measurements.
2.3 Interval- The third level of measurement. The interval level of measurement not
only classifies and orders the measurements, but it also specifies that the
distances between each interval on the scale are equivalent along the scale from
low interval to high interval. For example, an interval level of measurement could
be the measurement of anxiety in a student between the score of 10 and 11, this
interval is the same as that of a student who scores between 40 and 41. A
popular example of this level of measurement is temperature in centigrade,
where, for example, the distance between 940C and 960C is the same as the
distance between 1000C and 1020C.


WEEK 4
1. OBSERVATIONAL STUDY- is a study in which the researcher simply observes

the subjects without interfering. That is, the researcher has no control over
any treatments the subjects may be given or which groups the subjects may
be separated into, etc. They just observe the subjects and record data based
on their observations.
A very simple example would be a survey of some sort. Consider someone on
the busy street of Cebu City neighbourhood asking random people that pass
by how many pets they have, then taking this data and using it to decide if
there should be more pet food stores in that area. This is an observational
study, because the researcher is simply observing the answers of the survey
without influencing the outcome in any way.
2. EXPIREMENTAL STUDY- is the one where researchers introduce an

intervention and study the effects. Experimental studies are usually
randomized, meaning the subjects are grouped by chance. An experiment is a
procedure carried out to support, refute, or validate a hypothesis.
Experiments provide insight into cause-and-effect by demonstrating what
outcome occurs when a particular factor is manipulated. Experiments vary
greatly in goal and scale, but always rely on repeatable procedure and logical
analysis of the results.
3. CROSS SECTIONAL STUDY- A cross-sectional study involves looking at data

from a population at one specific point in time. The participants in this type of
study are selected based on particular variables of interest. Cross-sectional
studies are often used in developmental psychology, but this method is also
used in many other areas, including social science and education.
Some of the key characteristics of a cross-sectional study include:
 The study takes place at a single point in time

 It does not involve manipulating variables
 It allows researchers to look at numerous characteristics at once (age,
income, gender, etc.)
 It's often used to look at the prevailing characteristics in a given
population
 It can provide information about what is happening in a current
population
4. RETROSPECTIVE STUDY- In a retrospective study, the outcome of interest has
already occurred at the time the study is initiated. A retrospective study
design allows the investigator to formulate ideas about possible associations
and investigate potential relationships, although casual statements usually
should not be made. A retrospective study looks backwards and examines
exposures to suspected risk or protection factors in relation to an outcome
that is established at the start of the study.
5. PROSPECTIVE STUDY- watches for outcomes, such as the development of a

disease, during the study period and relates this to other factors such as
suspected risk or protection factor(s). The study usually involves taking a
cohort of subjects and watching them over a long period. The outcome of
interest should be common; otherwise, the number of outcomes observed
will be too small to be statistically meaningful (indistinguishable from those
that may have arisen by chance). All efforts should be made to avoid sources
of bias such as the loss of individuals to follow up during the study.
Prospective studies usually have fewer potential sources of bias and
confounding than retrospective studies.


WEEK 5
1. RANDOM SAMPLING- is a part of the sampling technique in which each sample has
an equal probability of being chosen. A sample chosen randomly is meant to be an
unbiased representation of the total population. If for some reasons, the sample
does not represent the population, the variation is called a sampling error. It is one of
the simplest forms of collecting data from the total population. Under random
sampling, each member of the subset carries an equal opportunity of being chosen as
a part of the sampling process. For example, the total workforce in organisations is
300 and to conduct a survey, a sample group of 30 employees is selected to do the
survey. In this case, the population is the total number of employees in the company
and the sample group of 30 employees is the sample.
2. SYSTEMATIC SAMPLING- is defined as a probability sampling method where the

researcher chooses elements from a target population by selecting a random starting
point and selects sample members after a fixed ‘sampling interval.’
3. CONVENIENCE SAMPLING- is a type of non-probability sampling method where the

sample is taken from a group of people easy to contact or to reach. For example,
standing at a mall or a grocery store and asking people to answer questions would be
an example of a convenience sample. This type of sampling is also known as grab
sampling or availability sampling. There are no other criteria to the sampling method
except that people be available and willing to participate. In addition, this type of
sampling method does not require that a simple random sample is generated, since
the only criterion is whether the participants agree to participate.
4. STRATIFIED SAMPLING- is a type of sampling method in which the total population is

divided into smaller groups or strata to complete the sampling process. The strata is
formed based on some common characteristics in the population data. After dividing
the population into strata, the researcher randomly selects the sample
proportionally.
5. CLUSTER SAMPLING- is a two-step procedure. First, the entire population is selected

and separated into different clusters. Random samples are then chosen from these
subgroups. For example, a researcher may find it difficult to construct the entire
population of customers of a grocery store to interview. However, they may be able
to create a random subset of stores; this represents the first step in the process. The
second step is to interview a random sample of the customers of those stores.
6. SAMPLING ERROR- is a statistical error that occurs when an analyst does not select a
sample that represents the entire population of data and the results found in the
sample do not represent the results that would be obtained from the entire
population. Sampling is an analysis performed by selecting a number of observations
from a larger population, and the selection can produce both sampling errors and
non-sampling errors.
7. NON-SAMPLING ERROR- is a statistical term that refers to an error that results during
data collection, causing the data to differ from the true values. Non-sampling errors
may be present in both samples and censuses in which an entire population is
surveyed. Non-sampling errors fall under two categories: random and systematic.


WEEK 6 and WEEK 7
1. MEASURES OF DISPERSION
1.1 RANGE
The range is the difference between the largest and the smallest observation in the
data. The prime advantage of this measure of dispersion is that it is easy to calculate.
On the other hand, it has lot of disadvantages. It is very sensitive to outliers and does
not use all the observations in a data set. It is more informative to provide the
minimum and the maximum values rather than providing the range.
1.2 INTERQUARTILE RANGE

Interquartile range is defined as the difference between the 25th and 75th percentile
(also called the first and third quartile). Hence the interquartile range describes the
middle 50% of observations. If the interquartile range is large it means that the middle
50% of observations are spaced wide apart. The important advantage of interquartile
range is that it can be used as a measure of variability if the extreme values are not
being recorded exactly (as in case of open-ended class intervals in the frequency
distribution). Other advantageous feature is that it is not affected by extreme values.
The main disadvantage in using interquartile range as a measure of dispersion is that
it is not amenable to mathematical manipulation.
1.3 QUARTILE DEVIATION

The Quartile Deviation (QD) is the product of half of the difference between the
upper and lower quartiles. Mathematically we can define as:
Quartile Deviation = (Q3 – Q1) / 2.
2. Find the interquartile range the following ages of the employees of an insurance
company:
48, 44, 59, 56, 61, 60, 65, 63, 72, 67, 79, 73
 Before calculating you will need to arrange the various points of your data set.
To do this, begin by ordering your data points from least to greatest.
 44, 48, 56, 59, 60, 61,63, 65, 67, 72, 73,79
Once your data points have been ordered like this, you can move onto the
next step.
 Next, we have to split the data. First, take the 50% of the left side and next get
the the 50% of the right side.
44, 48, 56, 59, 60, 61 / 63, 65, 67, 72, 73, 79
Next, we have to get the median of the 1st group
 56+59 = 115/2= 57.5-----Q1
 67+72 = 139/2= 69.5----Q3
 Q1-Q3= IQR
 69.5--57.5= 12


WEEK 8 and WEEK 9

1. MEAN DEVIATION
The mean deviation is defined as a statistical measure which is
used to calculate the average deviation from the mean value of the
given data set. The mean deviation of the data values can be easily
calculated using the below procedure.
Step 1: Find the mean value for the given data values
Step 2: Now, subtract mean value form each of the data value given
(Note: Ignore the minus symbol)
Step 3: Now, find the mean of those values obtained in step 2.
Mean Deviation Formula

The formula to calculate the mean deviation for the given data set is
given below.
Mean Deviation = [Σ |X – µ|]/N
Here,
Σ represents the addition of values
X represents each value in the data set
Μ represents the mean value of the data set
N represents the number of data values
|| represents the absolute value, which ignores the “-” symbol
2. A student took 5 exams in a class and had scores of 92, 75, 95, 90 and 98.
Find the mean deviation for his test scores.
92, 75, 95, 90, 98
= 92 + 75 + 95 + 90 + 98 = 450
= 450/5 = 90

WEEK 10
1. VARIANCE
The variance is a measure of variability. It is calculated by taking the
average of squared deviations from the mean. It tells you the degree of
spread in your data set. The more spread the data, the larger the
variance is in relation to the mean. It is defined as the average of the
squared differences from the Mean.
The sample variance formula looks like this:
Formula Explanation
s2 = sample variance
Σ = sum of…
Χ = each value
x̄ = sample mean
n = number of values in the sample
2. Calculate the sample variance of the following data:

8, 9, 10, 12, 17, 18, 18, 19, 20, 21
X x- x̄ 2
(x- x̄)
8 -7.2 51.84
9 -6.2 38.44
10 -5.2 27.04
12 -3.2 10.24
17 1.8 3.24
18 2.8 7.84
18 2.8 7.84
19 3.8 14.44
20 4.8 23.04
21 5.8 33.64
2
x̄ = 15.2 Σ (x- x̄) = 217.6
2 --
=S 217.6 = 24.17
9


WEEK 11 & 12
1. STANDARD DEVIATION
The standard deviation is a summary measure of the differences of each observation from
the mean. If the differences themselves were added up, the positive would exactly balance
the negative and so their sum would be zero. Consequently the squares of the differences
are added. The sum of the squares is then divided by the number of observations minus one
to give the mean of the squares, and the square root is taken to bring the measurements
back to the units we started with. (The division by the number of observations minus one
instead of the number of observations itself to obtain the mean square is because “degrees
of freedom” must be used. In these circumstances they are one less than the total.
2. Compute for the variance and standard deviation of the given data.
 Number of books delivered in XYZ Library from January to August 2020, respectively.
200, 300, 150, 400, 200, 350, 400, 250
X x- x̄ (x- x̄)2
200 -81.25 6, 601. 5625
300 18.75 351. 5625
150 -131.25 17, 226. 5625
400 118.75 14, 101. 5625
200 -81.25 6, 601. 5625
350 68.75 4, 726. 5625
400 118.75 18, 101. 5625
250 31.25 976. 5625
x̄ = 281.25
Variance 2 = 64, 687.5 = 9, 241.071
S
7
Standard Deviation S= 96.1304916692 or 96.13

602 Statistics - Tangean Marilou D

Uploaded by

Copyright:

Available Formats

You might also like

602 Statistics - Tangean Marilou D

Uploaded by

Document Information

Original Description:

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

602 Statistics - Tangean Marilou D

Uploaded by

Copyright:

Available Formats

REPUBLIC OF THE PHILIPPINES

Name: Marilou D. Tangean Subject: Educ. 602: Statistics

discover and unlock our own special light.

Name:Marilou D. Tangean Subject: Educ. 602: Statistics

CENSUS- A census is a survey conducted on the full set of observation objects

SAMPLE- refers to a smaller, manageable version of a larger group. It is a subset containing

PARAMETER- Parameters in statistics is an important component of any statistical analysis.

Name: Marilou D. Tangean Subject: Educ. 602: Statistics

1. QUANTITATIVE TYPES OF DATA

REPUBLIC OF THE PHILIPPINES

Name: Marilou D. Tangean Subject: Educ. 602: Statistics

1. OBSERVATIONAL STUDY- is a study in which the researcher simply observes

2. EXPIREMENTAL STUDY- is the one where researchers introduce an

3. CROSS SECTIONAL STUDY- A cross-sectional study involves looking at data

Some of the key characteristics of a cross-sectional study include:

 The study takes place at a single point in time

5. PROSPECTIVE STUDY- watches for outcomes, such as the development of a

REPUBLIC OF THE PHILIPPINES

Name: Marilou D. Tangean Subject: Educ. 602: Statistics

2. SYSTEMATIC SAMPLING- is defined as a probability sampling method where the

3. CONVENIENCE SAMPLING- is a type of non-probability sampling method where the

4. STRATIFIED SAMPLING- is a type of sampling method in which the total population is

5. CLUSTER SAMPLING- is a two-step procedure. First, the entire population is selected

REPUBLIC OF THE PHILIPPINES

Name: Marilou D. Tangean Subject: Educ. 602: Statistics

1.2 INTERQUARTILE RANGE

1.3 QUARTILE DEVIATION

REPUBLIC OF THE PHILIPPINES

Name: Marilou D. Tangean Subject: Educ. 602: Statistics

WEEK 8 and WEEK 9

Step 3: Now, find the mean of those values obtained in step 2.

Mean Deviation Formula

Mean Deviation = [Σ |X – µ|]/N

X represents each value in the data set

Μ represents the mean value of the data set

N represents the number of data values

|| represents the absolute value, which ignores the “-” symbol

Name: Marilou D. Tangean Subject: Educ. 602: Statistics

2. Calculate the sample variance of the following data:

REPUBLIC OF THE PHILIPPINES

Name: Marilou D. Tangean Subject: Educ. 602: Statistics

You might also like