Professional Documents
Culture Documents
Basic Concept in Statistics-Biostat
Basic Concept in Statistics-Biostat
Biostatistics
1
Definition of Statistics
• Different authors have defined statistics differently. The best
definition of statistics is given by Croxton and Cowden
as the
according to whom statistics may be defined
science, which deals with collection,
presentation, analysis and interpretation
of numerical data.
• The science and art of dealing with variation in data through collection,
classification, and analysis in such a way as to obtain reliable results. —
(John M. Last, A Dictionary of Epidemiology )
• Branch of mathematics that deals with the collection, organization,
and analysis of numerical data and with such problems as experiment
design and decision making. —(Microsoft Encarta
2
Premium 2009)
Definition of Biostatistics=
Medical statistics
• Biostatistics may be defined as
application of statistical methods
to medical, biological and public
health related problems.
3
Definition of Biostatistics=
Medical statistics
• It is the scientific treatment given to the
medical data derived from group of
individuals or patients
➢Collection of data.
➢Presentation of the collected data.
➢Analysis and interpretation of the
results.
➢Making decisions on the basis of such
analysis
4
Role of Statistics in Clinical
Medicine
The main theory of statistics lies in the term
variability.
There is No two individuals are same. For example, blood
pressure of person may vary from time to time as well as from
person to person.
We can also have instrumental variability as well
as observers variability.
Methods of statistical inference provide largely objective
means for drawing conclusions from the data about the issue
under study. Medical science is full of uncertainties and
statistics deals with uncertainties.
5
Role of Statistics in Clinical
Medicine
Statistical methods try to quantify the uncertainties
present in medical science.
6
Role of Statistics in
Public Health and Community Medicine
Statistics finds an extensive use in Public Health and Community Medicine.
Statistical methods are foundations for public health administrators to
understand what is happening to the population under their care at
community level as well as
individual level. If reliable information regarding the disease is available,
the public health administrator is in a position to:
●● Assess community needs
●● Understand socio-economic determinants of health
●● Plan experiment in health research
●● Analyze their results
●● Study diagnosis and prognosis of the disease for
taking effective action
●● Scientifically test the efficacy of new medicines and
methods of treatment.
7
Why we need to study BioStatistics?
Three reasons:
(1) Basic requirement of medical research.
8
Role of statisticians
To guide the design of an experiment or survey
prior to data collection
9
1.1: What is Statistics?
Statistics: The science of collecting,
describing, and interpreting data.
• Statistics:
- A means by which a set of data may be
described and interpreted in a meaningful
way.
- A method by which data can be analyzed
and inferences and conclusions drawn.
.
10
Two areas of statistics:
11
Descriptive Statistics
• Descriptive statistics are methods for
organizing and summarizing data.
• For example, tables or graphs are used to
organize data, and descriptive values such
as the average score are used to
summarize data.
• A descriptive value for a population is
called a parameter and a descriptive
value for a sample is called a statistic.
12
Inferential Statistics
• Inferential statistics are methods for using
sample data to make general conclusions
(inferences) about populations.
• Because a sample is typically only a part of the
whole population, sample data provide only
limited information about the population. As a
result, sample statistics are generally imperfect
representatives of the corresponding population
parameters.
13
Example: A recent study examined the math and verbal
NAT scores of high school seniors. Which of the following
statements are descriptive in nature and which are
inferential.
14
Introduction to Basic Terms
15
Population
• The entire group of individuals is called the
population.
• For example, a researcher may be
interested in the relation between class
size (variable 1) and academic
performance (variable 2) for the population
of third-grade children.
16
Sample
• Usually populations are so large that a
researcher cannot examine the entire
group. Therefore, a sample is selected to
represent the population in a research
study. The goal is to use the results
obtained from the sample to help answer
questions about the population.
• Sample: A subset of the population
17
Variables
• A variable is a characteristic or condition
that can change or take on different
values.
• Most research begins with a general
question about the relationship between
two variables for a specific group of
individuals.
19
Types of Variables
• Variables can be classified as discrete or
continuous.
• Discrete variables (such as class size)
consist of indivisible categories
– done by counting
• continuous variables (such as time or
weight) are infinitely divisible into whatever
units a researcher may choose. For
example, time can be measured to the
nearest minute, second, half-second, etc.
- done by measuring
20
Data (singular): The value of the variable
associated with one element of a population or
sample. This value may be a number, a word, or
a symbol.
Data (plural): The set of values collected for the
variable from each of the elements belonging to
the sample.
Experiment: A planned activity whose results
yield a set of data.
Parameter: A numerical value summarizing all
the data of an entire population.
Statistic: A numerical value summarizing the
21
sample data.
Example: A college dean is interested in learning about the
average age of faculty. Identify the basic terms in this situation.
The population is the age of all faculty members at the college.
A sample is any subset of that population. For example, we
might select 10 faculty members and determine their age.
The variable is the “age” of each faculty member.
One data would be the age of a specific faculty member.
The data would be the set of values in the sample.
The experiment would be the method used to select the ages
forming the sample and determining the actual age of each
faculty member in the sample.
The parameter of interest is the “average” age of all faculty at the
college.
The statistic is the “average” age for all faculty in the sample.
22
Kinds of variables:
Qualitative, or Attribute, or Categorical,
Variable: A variable that categorizes or
describes an element of a population.
23
Kinds of variables:
Quantitative, or Numerical, Variable: A
variable that quantifies an element of a
population.
24
Example: Identify each of the following examples as
attribute (qualitative) or numerical (quantitative)
variables.
26
4 Types of Measurement Scales
3. An interval scale is an ordered series of equal-
sized categories. Interval measurements
identify the direction and magnitude of a
difference. The zero point is located arbitrarily
on an interval scale.
4. A ratio scale is an interval scale where a value
of zero indicates none of the variable. Ratio
measurements identify the direction and
magnitude of differences and allow ratio
comparisons of measurements.
27
Qualitative and quantitative variables may be further
subdivided:
Nominal
Qualitative
Ordinal
Variable
Interval
Quantitative
Ratio
Example: Identify each of the following as examples of (1)
nominal, (2) ordinal, (3) ratio, or (4) interval:
29