Professional Documents
Culture Documents
MMW Reviewer Data Management
MMW Reviewer Data Management
MMW Reviewer Data Management
Data Management________________________________
Introduction to Statistics
The second level is the Ordinal level. In Exercise 17: State the level of measurement
this stage, the words or numbers are of each of the following.
assigned to objects to represent the rank 1. Blood type
Page 2 of 12
2. Doctors’ salary 4. Temperature in Fahrenheit
3. Latin honors 5. Student number
SAMPLING AND SAMPLING TecHNIQUes numbers to select the numbers for the elements
to include in the sample.
Sampling is the process of choosing elements,
such as person, objects or groups from a known In Systematic Skip Sampling, elements are listed
population of interest to be included in a study in numerically and then every “kth” element from
order to generate a fair result. Sampling is done the list is selected from a randomly selected
to reduce cost since it is less expensive conduct starting point.
survey in a sample than in whole population.
Another advantage of using a sample instead of a
population is that in sampling, data can be Stratified Random Sampling is a method where
obtained faster. Also, greater scope and accuracy the population is divided into sub-groups (called
are expected since the volume of work in strata) base on some well-known characteristics of
encoding and computing will be reduced. the population, such as age, gender or socio-
There are two types of sampling techniques: economic status; then take a random sample from
probability sampling and non-probability each strata. The selection of elements is then
sampling.
made separately from within each strata, usually
by random or systematic sampling methods.
Probability sampling or random sampling gives
all members of the population a known and In stratified random sampling, the number of
equal chance of being part in the sample. In samples per strata may be equal or proportional.
other words, the selection of individuals does not
affect the chance of anyone else in the Example 24: A study is conducted to 1,000
Simple random sampling is also called the lottery of the study. How many samples are needed per
or the fishbowl method. Simple random sampling year level using equal distribution?
Page 2 of 12
DescRIPTIVe MeASURES 3. 12, 10, 15, 14, 11, 18
4. 1, 9, 10, 2, 9, 4, 2, 1
Measures of Central Tendency are descriptive
5. 3, 6, 4, 4, 6, 3, 6, 3, 4
measures that are used to describe the center of a
Remark 34: Best use of the mean, median and
set of data, arranged numerically. The three
mode.
different types of “average” will be discussed, the
mean, the median and the mode. The mean is computed if the values are in
interval or ratio scale. The mean is influenced by
outliers that may be at the extremes of the data set.
The most commonly used to measure the central The median is used for ordinal scale. Unlike the
tendency is the mean. It is also called the mean, the median is not influenced by outliers at
computed average. It is defined as the sum of the the extremes of the data set. The mode is practical
values divided by the total number of items. for nominal data. In such cases, the mode may
not exist or may not be very meaningful.
The median is the middle value in a set of data.
The value which divides the distribution into two Now, consider the given set of data:
equal parts, with one half of the values is lower
Set A: 9, 12, 13, 15, 15, 17, 24
than the median and other half are higher than the
median. Set B: 7, 11, 15, 15, 17, 19, 21
The third measure on central tendency is the Set C: 11, 11, 15, 15, 15, 18, 20
mode. It is easily found by inspection. It is a point Using the measures of central tendency, it seems
on the distribution in which the frequency is that the sets are equal (that is, 15). But obviously,
higher than any other value. the sets of data are different. Like, the values of
Set A are more disperse or scattered than of Set B
A distribution with only one mode is called
and C. Using only these measures are not enough
unimodal while f it has two modes, then it is
to describe a given set of data, we need to use
called bimodal. If it has more than two modes, the
other descriptive measures to further describe a
distribution is called multimodal. The mode does
distribution.
not exist in a distribution if no value is repeated
where 𝑥̅ is the mean, 𝑥 are the values and 𝑛 is Definition 44: In negatively skewed
number
distribution, the order of the measures of central
of values. tendency would be the opposite of the positively
skewed distribution, with the mean being smaller
Exercise 39: Compute the average deviation of set
A in the data above. than the median, which is smaller than the mode.
Variance
HYPOTHESIS TESTING
Page 13 of 12
Page 14 of 12