Professional Documents
Culture Documents
01 - Handout - 1 Stat PDF
01 - Handout - 1 Stat PDF
STATISTICS
Statistics is a science that involves the efficient use of numerical data relating to groups of individuals. It is
related to the collection, analysis, and interpretation of data, including data collection design in the form of
surveys and experiments. As widely known, statistics is defined as the science of collecting, organizing,
presenting, analyzing, and interpreting numerical data to help in the process of making decisions efficiently.
Statistics has wide contributions in different fields like marketing, accounting, quality control, sports,
education, politics, medicines, and others that require analyses, interpretations, and decision making.
Branches of Statistics
1. Descriptive Statistics – describes the information collected through numerical measurements,
charts, graphs, and tables. The main purpose of descriptive statistics is to provide an overview of
the information gathered.
2. Inferential Statistics – uses methods that take results obtained from a sample, extend them to the
population, and measures the reliability of the results.
Qualitative variables
Qualitative variables are often considered as the opposite of quantitative variables as it describes
certain types of information. For instance, qualitative variables provide items in a variety of qualities or
categories that may be ‘informal’ or even using features that are relatively obscure, such as warmth and
taste. Some examples of qualitative variables are; name, gender, address, religion, name of a school,
subject, and program.
Quantitative variables
A quantitative variable measures or identifies population or sample based on a numerical scale.
This type of variable can be analyzed using statistical methods; that is, the values obtained can be illustrated
using diagrams such as tables, graphs, and histograms. For example, a researcher will pose questions to
the respondents in the form of frequency, number, or percentage. The answers to these questions will be
in the form of numbers. Hence, researchers will then analyze the collected quantitative data to produce
relevant statistics.
Quantitative variables can be classified as either discrete or continuous. A discrete variable is a
variable that can assume finite, or, at most, countably infinite number of values, usually measured by
counting or enumeration. For example, the number of children in a family, the number of students in a class,
the maximum number of adults that can fit into a car, etc. typically, discrete variables results from counting.
A continuous variable is a variable that can assume infinitely many values corresponding to a line interval.
For example, time, temperature, weight, height, speed, etc. in other words, continuous variables result from
measuring something.
Levels of Measurements
The levels of measurement, also known as scales of measurement, refer to ways in which
variables/ quantities are defined or categorized. Each level of measurement has certain properties, which
in turn determines the appropriateness for the use of certain statistical analyses. There are four levels of
measurements: nominal, ordinal, interval, and ratio.
1. Nominal: This is a figurative labeling scheme in which the numbers serve only as labels or tags
for identifying and classifying objects. For example, the number assigned to the runner in a race
is nominal. Here each number is assigned to only one (1) runner, and the numbers are unique.
Another example is the Social Security Number. The numbers on a nominal scale do not reflect
the amount of the characteristic possessed by the object. For example, a person with a higher
SSN is not superior to those with lower value SSN. The only mathematical operation we can do is
counting on a nominal scale.
2. Ordinal: An ordinal scale is a ranking scale in which numbers(ranks) are assigned to objects to
indicate the relative extent to which the objects possess some characteristics. It indicates the
relative position, but it doesn’t indicate the magnitude of the difference between the objects.
Along with counting, we can calculate percentile, quartile, median, rank-order correlation, or other
summary statistics from ordinal data.
3. Interval: In an interval scale, the scale represents an equal distance between the values in the
characteristic being measured. The most important point is that on an interval scale, the location
of the zero point is not fixed. The difference between any two scale values is identical to the
difference between any other two adjacent values.
4. Ratio: Ratio scale possesses all the properties of nominal, ordinal, and interval scale and, in
addition, an absolute zero point. Common examples of ratio scale are height, weight, distance,
and age. All statistical techniques can be applied to ratio scale
Data Collection
Data collection is the process of gathering and measuring information on variables of interest in
an established systematic fashion that enables one to answer stated research questions, test hypotheses,
and evaluate outcomes.
There are two (2) sources of data, primary data, and secondary data. Primary data is the specific
information collected by the person who is doing the research. The researchers collect the data through
surveys, interviews, direct observations, and experiments. An example of primary data is the Census of
Population and Housing conducted by the Philippines Statistics Authority or market research projects
conducted by the private sector. Secondary data is any material that has been collected from published
records, such as newspapers, journals, research papers, and so on. Examples are details of imports and
exports that were compiled by the Philippines Statistics Authority and the Bureau of Customs for
declarations made by importers and exporters.
Secondary Data
Secondary data is data that has already been collected and is available from other sources. This data is
less costly and is easier to obtain than primary data. It may also be available if primary data cannot be
obtained for some reason.
Advantages of Secondary Data
• It is economical. It saves efforts and expenses.
• It is time-saving
• It provides a basis for comparison for the data that is collected by the researcher.
• It generates new insights from previous analyses of the primary data
Disadvantages of Secondary Data
• The accuracy of the data can be questionable.
• Data from secondary sources may not be appropriate to the needs of the user.
• Not all secondary data is readily available or inexpensive.
References:
Adhikari, S. (2018). Scales of measurements [Web log post]. Retrieved from
https://www.publichealthnotes.com/scales-of-measurement/
Adi Bhat. (2020). Data collection: Definition, methods, examples and design. Retrieved from Question Pro:
https://www.questionpro.com/blog/data-collection/
01 Handout 1 *Property of STI
student.feedback@sti.edu Page 4 of 5
IT2011
Adi Bhat. (2020). Open-ended questions: Definition, characteristics, examples and advantages. Retrieved
from Question Pro: https://www.questionpro.com/blog/what-are-open-ended-
questions/#Open_Ended_Questions_Definition
Formplus Blog. (2020). Primary vs. secondary data: 15 Key differences and similarities. Retrieved from:
https://www.formpl.us/blog/primary-secondary-data
Punzalan, J. (2018). Statistics and probability. Malaysia: Oxford Publishing.