Professional Documents
Culture Documents
Chapter2 - LectureNotes STA
Chapter2 - LectureNotes STA
Data or data set is a collection of observations on one or more variables. In other words,
variables produce data. The information in Table 2.1 below is called the data or a data set.
Aina 47.8
Zahara 50.2
Faliq 78.6
Suthan 65.5
Ali 67.8
Ahmad 60.4
Izara 52.5
Aishah 60.5
Naufal 80.8
Afiqah 55.6
Samantha 45.6
Aswad 90.8
Data can be gathered either primarily or secondarily. Primary data is the first hand
information which is gathered by the researcher himself or herself using methods of data
collection such as personal interviews, questionnaires (mailed or administered) or
observations. Secondary data is a second hand information that is already being collected,
recorded or published by someone other than the person who collected it. This data can
usually be found in magazines, bulletins, journals, companies’ reports, newspapers, books
and internet.
In the Table 2.2 below shows the advantages and disadvantages between primary and
secondary data.
STA111 | Fundamentals of Statistics
Table 2.2: Advantages and disadvantages for primary and secondary data
Primary Data Secondary Data
Advantages
Data gathered are more accurate. Less time, money and efforts required since
the data are already available.
No reproduction errors since data are Data helps to decide what further research
gathered the first time. needs to be done.
Disadvantages
Requires more time and effort to gather the Leads to reproduction error since data are
data. gathered the second time.
Most of primary data are expensive to gather. Some data are very difficult to be obtained
since some of them are confidential and not
to be disclosed.
Variables can be divided into two categories which are qualitative variable and quantitative
variable.
A variable that can be measured numerically or counted. The volume of water consumed per
day by a person is a quantitative variable. The number of passengers in a bus is a quantitative
variable. Quantitative variables can be divided into two which are discrete quantitative and
continuous quantitative.
STA111 | Fundamentals of Statistics
If the values of a quantitative variable are countable, the variable is a discrete quantitative.
The discrete variable can assume only certain values with no other sub-values in between
and often obtained by counting exactly (i.e countable and finite) such as number of vans
sold, number of cars pass through the Subang toll plaza in one hour, number of students in a
classroom.
Continuous variables can assume any numerical value within interval of values. It is a
variable with unlimited number of values that may be measured to some possible degree of
accuracy (i.e approximate result and infinite number). Thus, a person’s weight is a
continuous variable as it can take any values such as 47.5kg, 87.6kg, 75.8kg etc. Other
examples are distance travelled by a car, height of a building, length of a bridge, monthly
spending on fuel in RM.
Answers:
(a) Discrete quantitative
(b) Qualitative
(c) Continuous quantitative
(d) Discrete quantitative
(e) Continuous quantitative
All collected data are measured in some form. For example, we measure discrete
quantitative data such as the number of students in a class by the process of counting. Data
can be measured using four types of scales which are nominal scale, ordinal scale, interval
scale and ratio scale.
Nominal scale is known as categorical scale. It classifies data in the form of labels or
categories such as gender in which it is given as male or female, types of vehicles which are
bus, car, van or motorcycle. In general, nominal scale classifies a qualitative data. Usually,
each data will be assigned by a number to distinguish them from each other. However, since
this scale does not have any numeric value, any mathematical operations (addition,
subtraction, multiplication, division) cannot be conducted and are meaningless. For
example,
Note: In the context of the measurement scale theory, nominal scale can be classified as a
weakest tool of the variable measurement.
2.4.2 Ordinal Scale
The ordinal scale contains a data that can be placed in a rank or order system. We normally
use ranks to give ordering to the data. Sometimes, we can see that numbers are assigned to
the variable so that the comparisons of greater than and less than can be made, in addition
to equality and inequality. However, mathematical operations are still meaningless for this
type of scale. For example,
For interval scale, the zero value is an arbitrary zero point and not a true or a fixed point. This
means that the zero value has no meaning. Therefore, it allows the measurement scale on
either side of zero. In the context of mathematical operations, only addition and subtraction
can be performed. For example, we cannot say that there is no date for today and 0 degree
Celsius does not mean there is no temperature. Another example:
Ratio scale is an ordered or ranked scale with meaningful difference between measurements
and zero is true, a fixed point and has a meaning. Since zero is meaningful in ratio scale, this
scale allows the researchers to determine the smallest or highest value as well as their ratio
differences. In the context of mathematical operations, all operations (addition, subtraction,
multiplication and division) are applicable and can be performed. For example,
2. Weight
Explanation: 0kg of weight means no weight and 60kg of weight means that it is 30 units
heavy than 30kg of weight and 60kg of weight is twice as heavy as 30kg of weight.
Note: In the context of the measurement scale theory, ratio scale can be classified as a
strongest tool of the variable measurement.
Strongest
Weakest
Example:
Classify each of the following responses by type of data (qualitative, discrete quantitative
and continuous quantitative) and type of measurement scale (nominal, ordinal, interval,
ratio)