Professional Documents
Culture Documents
Biostat Lab and Lec Prelims
Biostat Lab and Lec Prelims
PROCESSING
1. DESCRIPTIVE
STATISTICS Methods of summarizing and
presenting data
An art of summarizing data
Computation of measures of central
Tool in decision making
tendency and variability
METHOD or DATA
Tabulation and graphical presentation
USES: Facilitate understanding, analysis, and
interpretation of data
Data reduction technique
Tool for analyzing research projects and 2. INFERENTIAL
clinical trials Methods of arriving at conclusions and
Tool for objective appraisal and generalizations about a target
evaluation of programs population based on information from a
Tool in decision-making process and sample
policy making Estimation of parameters and
2 AREAS OF STATISTICS hypotheses testing
”0” for male and “1” for female Number of code must be kept to
“1” for agree and “2” for disagree minimum (preferably < 8).
Codes should be exhaustive and
mutually exclusive
TYPES OF CODE Adopt coding convention for questions
with similar answers.
A. Field Code: actual value or information
given by the respondent CODING MANUAL - A document which contains
a record of all codes assigned to the responses
Example: to all questions in the data collection forms.
Age (yrs) – 30 years old Minimum information that must be included in
Weight (lbs) – 180 lbs a coding manual:
Height (cm) – 144 cm
Variable name
B. Bracket Code: recorded as range of Variable description
values rather than actual values Coding instructions
n = sample
e = margin of error or percentage of Sampling population
error
Sampling unit
𝑁
𝑛=
1 + 𝑁 (𝑒)2 Elementary unit/ element
Sampling frame
DISADVANTAGES
Easier to execute
Stratified sampling
PROBABILITY SAMPLING DESIGNS
Cluster sampling
Systematic Sampling (SYS)
Multi-stage sampling
Advantages
Simple random sampling
Less time consuming and easier to
Most basic type
perform
Every element in the population has an
Can sometimes result in representative
equal chance of being included in the
sample
sample
Disadvantages Clusters are usually of the same size
and the characteristics of units across
Units could Widely spread-out
clusters are homogenous or similar.
Systematic bias
MUTI-STAGE SAMPLING
Samples are selected from each stratum Sampling design: 4-stage stratified,
through SRS or SYS. systematic, cluster, simple sampling
design
PROCESS IN STRATIFIED SAMPLING
Multi-Stage Sampling
1. Divide sample population into strata
Advantages
2. Obtain the sampling frame for each
stratum Cost efficient design
3. Compute the sampling fraction, p=n/N Sampling frame for all elementary units
not required
4. Select random samples of p in each
stratum sample is easier to select
CLUSTER SAMPLING
GRAPHICAL PRESENTATION
TYPES:
Pie chart
Bar graph
Line graph
Scatter plot
GRAPH PRESENTATION
In a bar chart, the various categories vertical axis: variable values
into which the observations fall are
presented along a horizontal axis.
Histogram
bar is used to
depict
number or
A vertical bar is drawn above each
relative
category such that the height of the bar
frequencies
represents either the frequency or the
of data
relative frequency of observations
points falling
within that class.
into the
given class
Line Graph
plot of dots joined with lines over some vertical axis: number of relative
period of time in sequential series frequencies
Histograms on the other hand are usually used shows description of a large
to present “continuous data” that is data that quantitative data
represents measured quantity. The data would
include center, spread, shape, tail
then be collected into categories to present a
length, and outlying data points
histogram.
Can be presented horizontal or vertical
STEM-AND-LEAF PLOT
Graphical Presentation
ADVANTAGES
DISADVANTAGES
3
Frequency
(# occurrences) 2
1
65 70 75 80 85 90 95
Test Score
Normally Distributed Curve
Skewed Distributions
n DISPERSION
n How spread out is the distribution?
n What shape is it?
The MEASURES of Central Tendency
X = (3 + 5 + 10 + 4 + 3) = 25
5 5
X =5
Find the Mean
Q: 85, 87, 89, 91, 98, 100
A: 91.67
n Variance (σ2)
Why can’t the mean tell us everything?
X – Xi Abs. Dev.
1. Compute X (Average)
7–6 1 2. Compute X – X and take
7 – 10 3 the Absolute Value to get
Absolute Deviations
7–5 2 3. Sum the Absolute
7–4 3 Deviations
4. Divide the sum of the
7–9 2
absolute deviations by N
7–8 1
Total: 12 12 / 6 = 2
What Does it Mean?
n On Average, each value is two units away
from the mean.