Professional Documents
Culture Documents
(Statistics - 101) : Chapter One
(Statistics - 101) : Chapter One
[Statistics|101]
Chapter One
Motivation
Statistics, for some time now, I keep saying the sexy
job in the next ten years
has been tagged as the sexiest
will be statisticians.
field there is. This owes it to the - Varian, 2009
various disciplines the field can
transcend to. With utmost value
for data, the field surely reaches
all. All of us need processed
data, called information, to
further analyze, make plans,
and ultimately decide.
[Statistics|101] Introduction
Learning Objectives
By the end of this module, each student should be able:
[Statistics|101] Introduction
DEFINITION OF STATISTICS
In its singular sense, In its plural sense,
[Statistics|101] Introduction
DATA empowers!
massive data pertinent
volumes information
Data Decision
STATISTICS
Processing Making
[Statistics|101] Introduction
STATISTICS: SCIENCE AND ART
Statistics is the most important science in the whole world; for upon it
depends the practical application of every science and of every art; the
one science essential to all political and social administration, all
education, and all organization based on experience, for it only gives
results of our experience. F l o r e n c e N i g h t i n g a l e
S t a t i s t i c s i s t h e a r t a n d s c i e n c e o f u n c e r t a i n t y.
[Statistics|101] Introduction
BRIEF HISTORY OF STATISTICS
etymology
The original meaning of the word statistics is science of states, and in its early
existence it was also called political arithmetic. Although the use of the term
started as late as the 18th century, the practice of collecting and analyzing data
dates back to the early biblical times.
It was then first used in statisticum collegium (Modern Latin) meaning lecture
course on state affairs. The first statisticians are statistas (Italian) which means
one skilled in statescraft or the politicians. In 1770, it was formally defined as a
science dealing with data about the condition of a state or a community
(political arithmetic). The German word statistik was coined by political scientist
Gottfried Aschenwall (1719-1772) in his paper Vorbereitung zur
Staatswissenschaft. At present, the applications of statistics have expanded
from the political science to almost all fields of knowledge.
[Statistics|101] Introduction
BRIEF HISTORY OF STATISTICS
probability theory
Probability theory was inspired mainly by games of chance. The first mathematical analysis of
games of chance were undertaken by Italian mathematicians in the 16th century. The main results
were initially obtained by Gerolamo Cardano (1501-1576) about 1565. There were many wrong
propositions in this book. Although Cardano later realized the errors and eventually corrected
them, he didnt remove thestatements in the book. This book false was never published because
Cardano kept it secret. Accused of being a heretic in 1570, he was arrested, dismissed and denied
the rights to lecture publicly and to have his books printed.
The theory of probability had already been discovered by Pierre de Fermat (1601-1665), Blaise
Pascal (1623-1662), and Christiaan Huyghens (1629-1695) by the time Cardanos book was
rediscovered almost a hundred years later. C. de Mere, a notorious gambler who was a close
friend of Pascal, initiated the study of probability theory by mathematicians by posing a gambling
problem that offered the mathematicians a real challenge. This led to a lengthy correspondence
between Fermat and Pascal, and brought about not only a solution to the problem at hand, but
to other more general problems as well. In 1655, the young Dutchman, Huyghens, learned of the
results of Fermat and Pascal not the proofs and arguments, and he wrote a short book on
probability theory called How to Reason in Dice Games. It is fair to say that this book represents
the real beginning of probability theory as a mathematical subject.
[Statistics|101] Introduction
BRIEF HISTORY OF STATISTICS
inferential statistcs
The successful development of probability theory did not immediately lead to a
theory of inferential statistics. In 1346, the world faced the most infectious and lethal
disease of all times the black plague. After that time, the plague occurred regularly
until 1712. Statistics (or political arithmetic as it was named in its first years of existence)
was then the art of deducing estimates and properties of quantities which cannot be
observed directly. The pioneer seems to be an English tradesman and haberdasher of
small wares, John Graunt (1620-1674). In 1661, he published a book called Natural and
Political Observations upon Bills of Mortality. He applied the bills to estimate birth
mortality, the number of inhabitants, the number of years to recover the former
population level after a plague epidemic, etc. His methods were bold but dubious, but
surprisingly were in accordance with later and more reliable observations. Soon the
mathematicians involved in the study of probability theory took up the challenge to
invent rigorous methods to estimate unknown quantities, particularly to compute
reliable life-tables, which became an important tool for the emerging life insurance
companies.
[Statistics|101] Introduction
POPULATION V.S. SAMPLE
Population is the collection of all
elements under consideration in a
statistical inquiry.
[Statistics|101] Introduction
POPULATION V.S. SAMPLE
Population
Sample
[Statistics|101] Introduction
example
A manufacturer of kerosene heaters wants to determine if customers are
satisfied with the performance of their heaters. Towards this goal, 5,000
of his 200,000 customers are contacted and each is asked, Are you
satisfied with the performance of the kerosene heater you purchased?
[Statistics|101] Introduction
Another example
Research Problem:
What is the average expenditure of households in Metro Manila?
[Statistics|101] Introduction
REMARKS
The elements of the population can be individuals, objects, animals,
geographic areas, and so on.
One may think: Why do we have to get a sample from the population
when the population is always available? Sampling is required,
recommended, in some cases even inevitable because of at least two
reasons: studying the whole population is very costly in terms of time
and resource and sometimes it is infeasible to study the whole
population.
Given the fact that we are only dealing with the sample, how can we
be sure that the inferences about the population are close to the true
value or scenario of the population? The whole Statistics process is
quantitatively and rigorously done in such a way error is minimized or
measured. Statistics has its foundations in the theory of probability.
[Statistics|101] Introduction
VARIABLE, OBSERVATION, AND DATA
[Statistics|101] Introduction
VARIABLE, OBSERVATION, AND DATA
height F 1.63 M 1.47
sex (in m) F 1.63
0 blue 1 red
no. of favorite
children color 0 blue
BS MS
Stat 20 Stat 26
degree
program age BS Stat 20
M 2.32 F 2.06
0 pink 0 green
BS BA
Econ 19 History 20
[Statistics|101] Introduction
VARIABLE, OBSERVATION, AND DATA
Variable Possible Observations
[Statistics|101] Introduction
EXAMPLE
The Office of Admissions is studying the relationship between the score
in the entrance examination during application and the general
weighted average (GWA) upon graduation among graduates of the
university from 2000 to 2005.
[Statistics|101] Introduction
SUMMARY MEASURE
A summary measure is a single numeric
figure that describes a particular feature
of the whole collection.
[Statistics|101] Introduction
PARAMETER V.S. STATISTIC
Parameter is a summary measure describing
a specific characteristic of the population. It is
computed using population data.
[Statistics|101] Introduction
PARAMETER V.S. STATISTIC
parameter
Population
statistic
Sample
[Statistics|101] Introduction
example
A manufacturer of kerosene heaters determined that 172,000 customers
out of the 200,000 were satisfied with the performance of their heaters. Its
competitor company wants to verify this claim by asking a sample of
10,000 customers the same question Are you satisfied with the
performance of the kerosene heater you purchased? It was revealed that
6,450 customers were satisfied.
[Statistics|101] Introduction
EXERCISES
Mr. Donaldo Chan, a candidate for Vice Mayor in Orion, Bataan,
wants to find out if there is a need to intensify his campaign efforts
against his opponents. He requested the services of a group of
students to interview 1,000 of the 3,000 registered voters of Orion,
Bataan. The survey results showed that 75% of the 1,000 voters in the
sample will vote for him as Vice Mayor.
[Statistics|101] Introduction
EXERCISES
The average weekly allowance of students last year at a private high
school was Php 600.00 per week, based on an enrollment of 1,080
students. The third year students who did not have this information
interviewed 50 students and found their average weekly allowance
last year to be Php 550.00.
[Statistics|101] Introduction
FIELDS OF STATISTICS
There are two major areas or fields of Statistics.
Mathematical and
Applied
Statistics Statistics
[Statistics|101] Introduction
MATHEMATICAL STATISTICS
Mathematical (or Theoretical) Statistics is
concerned with the development of the
mathematical foundations of the methods
used in Applied Statistics.
[Statistics|101] Introduction
APPLIED STATISTICS
Applied Statistics is concerned with the
procedures and techniques used in the
collection, presentation, organization,
analysis, and interpretation of data.
[Statistics|101] Introduction
ACTIVITY!!
In A Mu
Whats your
favorite question?
[Statistics|101] Introduction
DESCRIPTIVE STATISTICS
Descriptive Statistics comprises those
methods concerned with the collection,
description, and analysis of a set of data
without drawing conclusions or inferences
about a larger set.
[Statistics|101] Introduction
EXAMPLE
Given the daily sales performance for a product for the
previous year, we can draw a line chart or a column
chart to emphasize the upward/downward movement of
the series. Likewise, we can use descriptive statistics to
calculate a quantity index per quarter to compare the
sales by quarter for the previous year.
[Statistics|101] Introduction
INFERENTIAL STATISTICS
Inferential Statistics comprises those methods
concerned with making predictions or
inferences about a larger set of data using
only the information gathered from a subset
of this larger set.
The main concern is not merely to describe but actually predict and
make inferences based on the information gathered.
[Statistics|101] Introduction
EXAMPLE
Election polls make use of inferential statistics to predict
the winners for the coming election based on data
collected from a sample of registered voters.
[Statistics|101] Introduction
DESCRIPTIVE V.S. INFERENTIAL
Descriptive Statistics Inferential Statistics
A bowler wants to find his bowling A bowler wants to estimate his chance of
average for the past 12 games winning a game based on his current
season averages and the averages of his
opponents
A politician wants to know the exact A politician would like to estimate, based
number of votes he received in the last on an opinion poll, his chance for
election winning in the upcoming election
[Statistics|101] Introduction
EXERCISES
Identify whether the following situations belongs to the field of
Descriptive or Inferential Statistics.
a. A badminton player wants to know his average score for the past 10 games.
b. Wincy wants to determine the variability of his six exam scores in Algebra.
c. Pat would like to forecast the average monthly electricity bill she will pay for
the next year based on her average monthly bill in the past year.
d. Novie wants to determine the proportion spent on transportation during the
past four months using the daily records of expenditure that she keeps.
e. Vinse wishes to determine the number of families not eating three times a
day in the sample used for their survey.
f. A politician wants to determine the total number of votes his rival obtained
in the past election based on his copies of the tally sheet of electoral returns.
g. A politician wants to determine the total number of votes his rival obtained
in the sample used in the exit poll.
[Statistics|101] Introduction
APPLICATIONS OF STATISTICS
Natural
Sciences
Sports Social
Sciences Sciences
Statistics
Business
Tourism and
Economics
Engineering
[Statistics|101] Introduction
REMARKS
Statistics is used in virtually any field and seen in everyday
life. Whenever there is data and numbers, there is Statistics.
There is always statistics in the news, survey results,
speeches made by politicians, stocks, and Peso-Dollar
exchange rates to name a few. We see and hear numbers
almost always in billboards, TV, and radio advertisements.
In your bachelors course, you will certainly pass through
some kinds of data. The following presents the applications
of Statistics to various fields.
[Statistics|101] Introduction
ACTIVITY!!
stat-is-sexy
Share some studies or
research in your field
where Statistics is used.
[Statistics|101] Introduction
OBJECTIVES OF A STATISTICAL INQUIRY
describe reveal
compare
clarify justify
determine
identify forecast
predict
[Statistics|101] Introduction
OBJECTIVES OF A STATISTICAL INQUIRY
1. describe the characteristic of the elements in the population under
study through the computation or estimation of a parameter such as
the proportion, total, and average;
[Statistics|101] Introduction
OBJECTIVES OF A STATISTICAL INQUIRY
6. reveal the natural groupings of the elements in the population
based on the values of a set of variables;
[Statistics|101] Introduction
STEPS IN A STATISTICAL INQUIRY
Regardless of the complexity of the research problem at hand, a researcher
can complete any one of these inquiries by following these basic steps.
[Statistics|101] Introduction
STEP 1: IDENTIFY THE PROBLEM.
Any statistical inquiry must begin with a clearly stated research problem.
The stated problem is the basis of all the actions that the researchers will
take in the other stages of the research process.
[Statistics|101] Introduction
EXAMPLE
Suppose the researchers want to determine if there is an association between the
price and production of lumber. They would have to answer the following questions
first to obtain a precise statement of the problem.
What kind of lumber will be included in the study? Will all types of lumber be
included or just one specific type?
Will the study include the whole production of lumber or only lumber produced
for sale?
What price for lumber will be used, the market price or the factory price?
What is the scope of the study? Are all the regions of the country included or
just a specific region or province only?
What period is covered in the study?
After answering these questions, the researchers may finally state the problem as
What is the relationship between the total mahogany production of Mindanao and
the market price of mahogany in the past 10 years?
[Statistics|101] Introduction
STEP 1: IDENTIFY THE PROBLEM.
The statement of the research problem is usually in the form of a question.
However, there are also other ways of stating the problem. It can also be in
the form of a statement.
After stating the problem, the researchers must list down all of the specific
objectives or the specific information needed that will help them answer
the stated problem.
[Statistics|101] Introduction
EXAMPLE
In the form of a question
What are the factors affecting he job performance of an employee?
[Statistics|101] Introduction
STEP 2: PLAN THE STUDY.
In coming up with a plan, the researchers need to consider all the outputs
in Step 1.
[Statistics|101] Introduction
RESEARCH DESIGN
[Statistics|101] Introduction
STEP 3: COLLECT THE DATA.
Here, the investigators carry out the plans specified in the
research design of data collection.
[Statistics|101] Introduction
STEP 4: EXPLORE THE DATA.
Prior to data analysis, the investigators need to explore
and understand the essential features of their data.
[Statistics|101] Introduction
STEP 5: ANALYZE THE DATA AND INTERPRET THE RESULTS
They need to check that they were able to meet all of the specified
objectives and to answer the research problem and give recommendations
on how it can be useful in decision making.
The investigators also double check the results that contradict existing
theories or the earlier hypothesis made. They may have committed errors
in data collection or analysis. If not, they would have to propose possible
explanations for these results or suggest future statistical inquiries that
could help explain the inconsistency.
[Statistics|101] Introduction
STEP 6: PRESENT THE RESULTS.
After analyzing the data and interpreting the results,
the investigators must present these results in a clear
and concise manner to the users of the research.
[Statistics|101] Introduction
MG ACTIVITY!!
Summation
Top 3 Learning Points
3
i th learning point
i=1
[Statistics|101]
Chapter One