Professional Documents
Culture Documents
Week 2 Introduction To Statistics
Week 2 Introduction To Statistics
Week 2 Introduction To Statistics
Introduction to Statistics
by
Anath Rau Krishnan, PhD
Senior Lecturer at Labuan Faculty of International Finance, Universiti Malaysia Sabah
Learning outcome
By end of this lecture, students should be able to:
Define what statistics is.
Discuss the use of statistics in business.
Interpret the basic components of a data set.
Identify the measurement scales used to describe a variable.
Differentiate qualitative and quantitative variables.
Explain the difference between descriptive and inferential statistics.
List down the techniques of collecting data.
Use suitable sampling approach for the data collection purpose.
What is statistics?
Hotel
Breakf
ast
Room
size
Distance
(km)
Room
Temperat
ure
(Celsius)
Yes
Medium
No
Small
No
Large
0.5
Yes
Medium
0.8
-1
Yes
Very
large
Scales of Measurements
Nominal
Ordinal
Interval
Ratio
Nominal
The information on of variable is represented or described using labels, names, and categories
(linguistic terms)
Numeric codes may be used to represent the data
Example:
Students of a Labuan Faculty of International Finance are clustered by the management according to
their program, namely International Finance, Banking, Marketing, and so on.
Alternatively, a numeric code could be used for the school variable (e.g. 1 denotes International
Finance, 2 denotes Banking, 3 denotes Marketing, and so on).
Ordinal
The information of variable is represented or described using labels, names, and categories (linguistic terms), but the
ranking of these terms is meaningful.
Example:
Students of a university are classified by their position using a nonnumeric label such as:
Tutor < Lecturer < Senior Lecturer < Associate Professor < Professor
Alternatively, a numeric code could be used for the position variable (e.g. 1 denotes tutor, 2 denotes lecturer, 3 denotes
senior lecturer and so on).
Interval
The data have the properties of ordinal data, but the distance between two values has a meaningful distance.
Numeric values
Example: temperature;
difference between 40F and 50F = difference between 70F and 80F
Ratio
The data have all the properties of interval data and the ratio of two values is meaningful.
Variables such as distance, height, weight, and time use the ratio scale.
This scale must contain a zero value that indicates that nothing exists for the variable at the zero point.
Categorical Data
Labels or names used to identify an attribute of
each element
Often referred to as qualitative data
Use either the nominal or ordinal scale of
measurement
Can be either numeric or nonnumeric
Appropriate statistical analyses are rather limited
Scales of Measurement
Data
Categorical
Numeric
Nomina
l
Ordina
l
Quantitativ
e
Non-numeric
Numeric
Nominal Ordinal
Interval Ratio
Cross-Sectional Data
Cross-sectional data are collected at the same or
approximately the same point in time.
Example: data detailing the number of building
permits issued in November 2010 in each of the
counties of Ohio
Source:
Source: Energy
Energy Information
Information Administration,
Administration, U.S.
U.S. Department
Department of
of Energy,
Energy, May
May 2009.
2009.
Existing Sources /
Secondary data
Data Sources
Production records
Inventory records
Sales records
Credit records
Customer profile
Data Sources
Data Available From Selected Government
Agencies
Government
Agency
Some of the Data Available
Census Bureau
www.census.gov
Department of Commerce
www.doc.gov
Bureau of Labor Statistics
www.bls.gov
Data Sources
Statistical Studies Experimental
In experimental studies the variable of interest is
first identified. Then one or more other variables
are identified and controlled so that data can be
obtained about how they influence the variable of
interest.
The largest experimental study ever conducted is
believed to be the 1954 Public Health Service
experiment for the Salk polio vaccine. Nearly two
million U.S. children (grades 1- 3) were selected.
Data Sources
Descriptive statistics
Most of the statistical information in newspapers, magazines, company
reports, and other publications consists of data that are summarized and
presented in a form that is easy to understand.
Such summaries of data, which may be tabular, graphical, or numerical, are
referred to as descriptive statistics.
deal with statistical techniques that could be used to organized and
summarize the collected data
78
69
74
97
82
93
72
62
88
98
57
89
68
68
101
75
66
97
83
79
52
75
105
68
105
99
79
77
71
79
80
75
65
69
69
97
72
80
67
62
62
76
109
74
73
Tabular Summary:
Frequency and Percent Frequency
Percent
Frequency Frequency
4
2
50-59
26 (2/50)100
13
60-69
32
16
70-79
14
7
80-89
14
7
90-99
10
5
100-109
100
50
Graphical
Summary: Histogram
Example: Hudson Auto
18
16
Frequency
14
12
10
8
6
4
2
Parts
Cost ($)
5059 6069 7079 8089 9099 100-110
Inferential statistics
Population
the set of all elements of interest in a particular study
Sample
the process of using data obtained from a sample to make estimates and test
hypotheses about the characteristics of a population
Sampling approaches
Probability sampling
o Simple random sampling
o Systematic sampling
o Stratified sampling
o Cluster sampling
Non-probability sampling
o Convenient sampling
o Judgmental sampling
THE END