Professional Documents
Culture Documents
Biostatistics and Research Methods in Pharmacy Pharmacy c479
Biostatistics and Research Methods in Pharmacy Pharmacy c479
BIOSTATISTICS AND
RESEARCH METHODS IN PHARMACY
Pharmacy C479
(4 quarter credits)
Assignment #1 .........................................................................................................................................46
Assignment #3 .........................................................................................................................................54
Assignment #4 .........................................................................................................................................59
Lesson Eight Confidence Estimation 60
Introduction to Statistical Estimation; Constructing a Confidence
Interval for a Mean; Comparing Two Means; Confidence Intervals for
an Odds Ratio
Assignment #5 .........................................................................................................................................64
Assignment #7 .........................................................................................................................................71
Assignment #8 .........................................................................................................................................76
Assignment #9 .........................................................................................................................................81
Supplemental Materials
You will need a calculator, preferably one that has scientific or statistical
functions. We do not recommend that you spend hundreds of dollars on a
calculator for this course. Before undertaking Lessons Six through
Fourteen, you may want to refer to a standard algebra refresher text to
refamiliarize yourself with the concepts of notation (e.g., Σ, σ, µ, α, β, ν),
formulas, expressions, equations, coefficients, plotting and coordinates,
slope, and intercept.
The course will cover most but not all of the material in the two required
textbooks. The fact that this is a tremendous amount of reading is not lost
on us. Without the benefit of lectures, you are limited to the information
contained in the textbooks and this guide. To make the textbooks more
practical and useful, we have endeavored to select the key concepts in
each chapter to review in this study guide. In addition, for the statistical
analysis sections we have selected problems for you to work that
accompany each of the key concepts. This study guide works well as a
companion to the textbooks and not as a substitute. Please do all the
readings before attempting the problems.
The practice problems for Lessons Six through Fourteen have been
selected from the problem sets in the Le and Boen book. The answers to
some these problems can be found in the book. You should make every
attempt to complete each of the problems even though many of the
answers appear in the back of the book. Hence, when grading your
assignments, we will be able to discern how you arrived at your answer
and give you feedback in areas where you may be having some difficulty.
In addition, you may see one or more of these problems on the
examinations.
Each written assignment will consist of between one and twenty problems
that you will submit for grading. These problems will come from the Le
and Boen book and other sources. Because these will be graded it is
important for you to write clearly and, in the case of statistics problems, to
show the steps you took to arrive at the solution. The midterms, Lessons
Five and Nine, and the final examination, Lesson Fourteen, should be
scheduled only after you have received all your corrected assignments
back. Please allow sufficient time (ten to fifteen days) for these to be
graded. The midterms cover the lessons since the last exam. The final is
comprehensive, covering all of the lessons. All exams are to be taken
without the aid of the textbooks or other materials, with two exceptions:
you are allowed to bring one page of formulae to help you on the exams,
and you are allowed to bring and use a calculator. To help complete the
exams, you will also be provided with a copy of appendixes A, B, and C
from the Le and Boen text. You will have ninety minutes to complete each
midterm exam and two hours to complete the final.
Grading Policy
The written assignments will collectively count for 25% of your grade,
each of the midterms will count 25% of your grade (for a total of 50%),
Pharmacy C479 5
and the final will count for the remaining 25% of the grade. Each
assignment and exam will be graded on a four-point scale.
It is best to always keep a copy of the work you submit. Mail can
occasionally get lost.
Deadlines
If you need to have your final course grade turned in to the registrar by a
particular date, plan ahead. Turn in all of the preceding assignments before
you schedule an exam and allow two weeks after submitting the final
exam for Distance Learning to process your grade. No special
accommodations can be made on this point.
In the course guide, you will find many of the key terms explained in the
commentary. The textbooks, however, have the added advantage of an
index that you can use to find many of the key terms used throughout the
texts. Please take the time to look up and memorize these terms.
A Word of Caution
Procrastination is your enemy in completing a self-study course. Without
the set schedule of a lecture course, some students may put off doing
lessons and assignments. This course can be completed in the time frame
of a ten-week quarter if you apply yourself. You will have to manage your
own time to successfully complete this course.
Do not read the text like a novel. While there is a rather important “story
line,” the books do not read like a novel. You may find it easiest to read
each chapter in sections, mastering each before proceeding to the next.
There are three possible reasons for the discrepancy between your answer
and the one in the book:
So what do you do? Quickly look over your work for clerical errors like
dropping a negative sign. Compare your answer with the book’s, and use
the similarities and differences to focus on the particular spot where you
disagree. If you still can’t find the error, look at other problems similar to
the one you just did. If your answers agree on four out of five problems,
you understand how to do the problems and you can safely forget this one.
On the other hand, if you answers agree with the book’s on zero out of
five problems, you are probably on the wrong track, and you should go
back and read some more.
Pharmacy C479 7
Optional Materials
The following books may help you with course material but are not
required and will not be referred to in the course.
Biostatistics
These two books are commonly used as introductory texts in biostatistics
courses for health professionals. You may find them helpful if you wish
further clarification or more in-depth discussion on the statistics topics.
¥ Norman and Streiner. Biostatistics: The Bare Essentials. Mosby-
Year, 1994.
For more information on setting up and using electronic mail and other
Internet resources, see the “Online Access” section of the Distance
Learning Student Handbook.
If you have any problems using electronic mail, you can get help by
calling Distance Learning. You can also send e-mail to Distance Learning
at distance@u.washington.edu or UW Computing and Communications at
help@cac.washington.edu.
Sean D. Sullivan
I am a native of west central California and I currently live in Kirkland,
Washington. I have been teaching clinical research methods and medical
technology assessment at the University of Washington since 1992. I
received a B.S. in pharmacy from Oregon State University, an M.S. in
pharmacy administration from the University of Texas, and a Ph.D. in
health economics and policy from the University of California at
Berkeley. As an instructor, I have tried to convey a sense of the practical
when it comes to medical statistics and clinical research design.
Holly Andrilla
I am a native of Seattle and attended the University of Washington as an
undergraduate, earning my B.S. in statistics. I continued at the UW and
completed my M.S. in biostatistics. I currently work as a biostatistician in
the Departments of Family Medicine, Pharmacy, and Biostatistics. I also
teach at North Seattle Community College.
Dave Blough
I have been pushing numbers around since receiving my doctorate in
statistics from Iowa State University in 1982. I left the corn and soybean
fields to teach and publish, and currently enjoy working with researchers
from various disciplines through my own consulting firm. When I moved
to Seattle in 1994, it was to work for the Department of Health Services at
the University of Washington. There I helped develop medical risk
adjustment models for the state of Washington. When not playing the
piano or backpacking in the mountains, I love to hunch over the computer
and derive great insights from masses of data. Even more than that, I enjoy
teaching others that statistics can be useful, interesting, and even fun.
Andy Stergachis
I am a lifelong resident of the Pacific Northwest. A graduate of the
Washington State University College of Pharmacy and the University of
Minnesota, I have held a variety of research and administrative positions
at the University of Washington School of Pharmacy and at Group Health
Cooperative of Puget Sound. My research has used several study designs,
ranging from experimental to epidemiologic methods. I am the recipient of
numerous research grants and awards in the area of pharmacoepidemiol-
Pharmacy C479 9
ogy and drug policy. My teaching methods emphasize real-life problems
facing pharmacists and other health-care professionals.
Objectives
At the end of this lesson, you should be able to
o identify the elements of a research protocol;
o formulate a set of primary and secondary clinical research
questions;
o differentiate between internal and external validity;
o critique clinical study enrollment (inclusion and exclusion)
criteria for appropriateness to the research question;
o select from among the possible approaches to sampling; and
o identify the characteristics of study variables.
Key Terms
research questions
design
selection of study subjects
measures
statistical evaluation
treatment group
internal validity
external validity
random error
systematic error
sampling error
measurement error
target population
accessible population
study sample
inclusion and exclusion criteria
sample
probability sampling
nonprobability sampling
consecutive sampling
convenience sampling
judgement sampling
The following sections in this course guide describe each of the elements
of the research protocol in more detail.
Research Questions
The research questions describe the primary and secondary objectives of
the study in a manner that is feasible for clinical investigation. There is
always at least one primary research question. There is often an additional
set of secondary research questions related to the primary objective of the
study. For example, a primary research question might be “Are patients
who receive calcium channel blockers for hypertension treatment at
greater risk of a myocardial infarction than those not receiving calcium
channel blockers?” A secondary research question might relate to the risk
of myocardial infarction in a special subset of hypertensive patients
receiving calcium channel blockers, or it might relate to the risk of MI in
patients taking the drug at a specific dose or taking the short-acting agents
and not the long-acting preparations. The research question is a focused
statement about the clinical problem to be addressed.
Pharmacy C479 13
Significance
The significance section of the protocol should state with clarity the
scientific rationale for the study. In other words, this section should
develop and answer the question “Why is this study important to
undertake?” To do this, the researcher must critically review and make
reference to previously published literature (including the investigator’s
own work) and the gaps in knowledge that exist. The primary and
secondary research questions should emerge from the significance section.
Research Design
Most clinical studies in pharmacy involve research related to new or
existing drugs, devices, or services. There are two basic types of research
designs for studying the effects of these interventions. Clinical studies are
frequently experimental in that the researcher elects to actively test the
effect of an intervention in one group of subjects (e.g., drug, device, or
program) when compared to a control or usual care group. The most
widely used experimental research design is the randomized trial, which is
often viewed as the standard by which all other clinical study designs are
compared. As noted in the text, not all situations require the use of a
randomized trial.
Study Subjects
The section of the research protocol on study subjects describes the
criteria for including and excluding potentially eligible patients. For
example, the investigator may be interested in studying the respiratory
outcomes of adolescent asthmatics who use peak flow meters as part of a
self-monitoring program. In this case, the inclusion criteria would reflect
an age restriction so that adolescents would be included in the study
sample and all others would be excluded. Another inclusion criteria may
relate to ability to use a peak flow meter or willingness to participate in an
asthma self-monitoring program. In any event, the inclusion and exclusion
criteria serve to select and standardize as much as possible the
characteristics of the population that is to be studied.
Measures
A major element of the research protocol is the description of data
elements or variables that are to be measured and collected as part of the
evaluation. The primary focus of most clinical intervention studies is on
the clinical outcome or primary clinical endpoint. The outcome is typically
collected from intervention and control subjects both before and after the
intervention. The group receiving the intervention is sometimes referred to
as the treatment group. Outcome variables for clinical studies of
pharmaceuticals are related to the potential benefits and risks of the drug
and can include measures of physiologic and symptomatic function such
as blood pressure, frequency of nocturnal awakenings, cholesterol level, or
intraocular pressure; health status variables such as measures of physical
and mental activity or health-related quality of life; and socioeconomic
measures such as time off from school or work and medical resource
consumption.
Statistical Plan
The final section of the research protocol contains a description of the
statistical evaluation strategy as proposed by the researchers. It is
important that the statistical plan is developed prior to data analysis. As
you will learn later in the course, the statistical evaluation plan relates
directly to the primary and secondary research questions, which are posed
in the form of research hypotheses. Without a clinically plausible
theoretical link from the research question to the analysis plan, the study is
left open to criticism from other researchers.
Validity
The purpose of conducting clinical research is so that new medical
technology, devices, procedures, drugs, or programs can be correctly and
accurately described in terms of effectiveness and safety. This is generally
accomplished by a statistical evaluation of the relevant clinical outcome
measures. Chance, bias, and confounding, however, can be alternative
explanations for positive statistical findings.
The validity of a study result is the extent to which the result observed in
the study is in fact the “true” result. Strictly speaking, the results of any
research study apply only to those subjects selected through the
inclusion/exclusion criteria and enrolled in the study. The extent to which
the results of a single clinical study accurately describe what happened
within the study is a measure of the study’s internal validity.
Pharmacy C479 15
The extent to which the findings of a single study can be applied to
settings and populations outside the immediate boundaries of the clinical
study is external validity or generalizability. Investigators strive to
maximize both internal and external validity. Sound internal validity
strengthens the interpretability of the statistical evaluation. External
validity is necessary for the acceptance of clinical trial results beyond the
confines of the clinical trial population. Figures 1.4 and 1.5 in Hulley and
Cummings illustrate the concepts of study validity.
Error
Error may interfere with inference within clinical studies and should be
minimized through appropriate research design. The two types of error
most common to clinical studies are random error and systematic error.
Random error results from chance or unknown influence on the
measurement of important clinical variables.
Systematic error results from bias or distortion where the source of the
bias is not random. Suppose that we wanted to estimate the average height
of all females in the United States, but there was only enough research
money to query a random sample of 100 females. If the random sample
were to be drawn from among female basketball players who visited a
sports medicine clinic, then an overestimate of the population or “true”
average height would surely result. The estimate would be systematically
biased in an upward direction and the source of the bias would be
sampling error.
Suppose that the height measurements were taken from among a truly
random sample of all females, and the height measuring device added
(mistakenly, of course) an extra six inches to the height of all subjects.
This would result in a substantial bias of the population average value.
Bias derived from a faulty measurement tool is termed measurement
error. This type of error has the potential to be common and devastating in
clinical research studies because of all of the measurement tools that are
used to collect health outcomes data. The entire field of laboratory
The origin of the main research question depends upon the past experience
of the researchers, knowledge of the literature, and key or controversial
topics. Most clinical research questions derive from the application or use
of new technologies. The development of new drugs leads to interesting
questions about their efficacy and safety in various diseases and special
populations.
There are many sources of inspiration for new research questions. The
most notable are the medical literature, journal clubs, national meetings
and other scientific forums, direct observation of patients in a clinical
practice, and interactions with students, residents, graduate students, and
fellows.
Pharmacy C479 17
The following section describes the process of establishing appropriate
research selection and sampling strategies in order to meet statistical
significance criteria, satisfy the clinical research objective, and implement
the study at a reasonable cost.
Populations
Every person with Type I diabetes alive today on the planet Earth
constitutes the population of Type I diabetics; every male alive today with
HIV-1 in the United States represents the population of U.S. male HIV-1
carriers; and so forth. A subset of the population, let’s say males with
HIV-1 in New York City, represents a sample of the population.
Sampling
It is not feasible for investigators to enroll the entire accessible population
in a research study because of geographic and temporal barriers. As a
consequence, a smaller number, or sample, can be selected from among
the accessible population whose characteristics closely represent the larger
population. The process of selecting the sample is called sampling.
The external validity of a study depends upon how well the measures in
the study represent the “true” clinical events or outcomes. How well does
systolic and diastolic blood pressure measurements accurately reflect
hypertension? How well does peak expiratory flow reflect clinical asthma?
The internal validity of a study rests, in part, on the ability of the
measurement tools to accurately and precisely represent the phenomenon
of interest to the investigator. Is the electronic blood pressure device
measuring the “true” blood pressure of the study subject? Is the
questionnaire on mental status describing “true” mental impairment?
Measurement Scales
Data or measures often are represented numerically. For analytic purposes,
these measures fall into one of two general classifications: (1) continuous
measures or (2) categorical measures. Table 4.1 in Hulley and Cummings
displays these measurement scales. These distinctions are important in that
Pharmacy C479 19
the choice of statistical tests depends upon the type of measure being
compared. (This will be discussed in detail later in the course.) Note that
the technical term for a measure often used by scientists is “variable.”
When data are not readily quantifiable (i.e., when they do not have innate
numeric values), researchers often reconfigure them into categories and
assign numerical values to these categories. For example, characterizing
the gender of study subjects is not something that can be represented
numerically in the absence of a categorization scheme. Measures in which
there are only two possible categories are referred to as dichotomous, or
binary, variables. Gender (male or female) is a perfect example of a
dichotomous variable.
Nominal variables are those with more than two categories for which the
ordering of the categories is not important. Ordinal variables are probably
quite familiar to you; they are measures with multiple categories for which
the ordering of the categories has meaning. For example, you are no doubt
familiar with the following type of question:
Q1. Please rate your overall health during the past two weeks:
1. Excellent
2. Very Good
3. Good
4. Fair
5. Poor
Pharmacy C479 21
random error. This is important for you as evaluators of the medical
literature in that any threats to the precise measurement of outcome or
predictor variables directly affect the statistical results and your
interpretation of the findings. When evaluating medical and
pharmaceutical papers, pay particular attention to the methods section and
the extent to which the investigators have employed the strategies listed in
table 4.2. Furthermore, note that all five approaches involve activities that
the research team can attend to prior to implementation of the study.
Outcome assessment tools can be refined and standardized through pilot
testing and previous research, observers and data collectors can be trained,
diagnostic tests can be automated and standardized across different sites,
and if necessary, measurements can be repeated to improve precision.
Accuracy
The accuracy of a measure refers to the degree to which the measure
actually represents what it is intended to. Recall the sleep quality study we
discussed in the previous section. If one of the sleep quality questionnaires
were to be written so that it focused less on the actual quality of sleep and
more on the duration of sleep, the measure could be said to lack accuracy
with respect to sleep quality measurement (and would probably be less
precise, too). Figure 4.2 in the text presents the relationship between
precision and accuracy in a basic and easily understood diagram. You
should study this chart carefully.
Unlike precision, for which there are quantifiable methods for assessing
the degree of bias (e.g., the standard deviation and coefficient of
variation), equivalent metrics are not available to determine the degree of
systematic error that ultimately affects accuracy. To do so, one would
need an estimate of the “true” value of a variable from which to compare
observed measures. These “gold standard” measures are rarely accessible.
Validity
Accuracy is more difficult to assess when the outcome measures are more
subjective, such as quality of life, pain, and cognitive function. These
measures are typically assessed not by mechanical tools but rather by
questionnaires. The accuracy of questionnaires is defined in terms of
external validity. That is, how well does the measurement tool (the
questionnaire) represent the phenomenon (e.g., quality of life, pain) in
which the researcher is interested? Has the instrument been validated?
Pharmacy C479 23
the time of a clinic visit reports asthma symptom scores that are also
suggestive of poor lung function, the symptom questionnaire is said to
have good criterion validity. This is because the results of the
symptom questionnaire correlate with the results of the spirometer
values. If the symptom scores were not related to the lung function
values, the symptom questionnaire could be said to be an invalid
measure of lung function, even though it still could be a very good
measure of asthma symptoms.
Conclusion
This concludes the discussion on research protocol design and
measurement. While complex, these issues are at the heart of a clinical
research study. If you have reached this point and lack a basic
understanding of any of these topics, please contact your instructor (by
electronic mail) for further explanation or additional readings. We will
move next into types of research designs for the purpose of answering
clinical questions. (Note that we skip chapters 5 and 6 in Hulley and
Cummings on purpose.)
Objectives
At the end of this lesson, you should be able to
o differentiate between a prospective and retrospective cohort
study;
o recount inherent biases in cohort studies and describe methods to
eliminate or reduce these biases; and
o define and calculate a relative risk ratio as a measure of
association between the risk factor and outcome measure.
Key Terms
cohort
cohort study
incidence
prospective cohort study
confounding factors
retrospective cohort study
relative risk ratio
Pharmacy C479 27
¥ The associations found in cohort studies can be erroneous if they
are due to confounding factors. Confounding factors are those that are
related to both the risk factor and health outcome of interest. The
example in the text regarding smoking as a confounding factor in the
relationship between exercise and CHD is particularly clear. As you
will learn in subsequent chapters, these confounding factors can be
measured and accounted for (controlled) through appropriate design
features and through the use of appropriate statistical methods.
¥ studying rapidly fatal disease (they are the only way to study such
disease);
Pharmacy C479 29
represented by a confidence interval. (These concepts will be developed
further in the statistical analysis section of this course.)
Table 2-1 shows how data from a cohort study may be presented and used
to estimate relative risk.
Examples illustrated in the text are the relationship between exercise and
the risk of coronary heart disease (example 7.1) or the association between
mitral-valve prolapse and the risk of death, cerebral embolus, and
endocarditis (example 7.2). However, simple relative risk ratios fail to
account for the effect (bias) of confounding factors. Multivariate methods
(e.g., logistic regression) and stratification can take account of the effect of
confounding factors and allow for calculation of an “adjusted” relative risk
ratio. For more on this approach, see chapter 10 in Hulley and Cummings.
Objectives
At the end of this lesson, you should be able to
o describe the basic features of a cross-sectional and a case-control
study;
o recognize the strengths and weaknesses of cross-sectional studies;
o discuss biases of particular concern in case-control studies, and
ways to minimize their influence; and
o define and calculate an odds ratio and interpret 95% confidence
intervals.
Key Terms
cross-sectional study
prevalence
case-control study
sampling bias
measurement bias
recall bias
confounding
odds ratio
confidence interval
Cross-Sectional Studies
Cross-sectional studies, sometimes called prevalence studies, are used to
determine the distribution of exposures, other risk factors, and/or diseases
in a sample of the population at a single point in time. Cross-sectional
studies tell us about the distribution of a disease or its risk factors in the
population, rather than its etiology. However, the distribution patterns may
suggest etiologic hypotheses that can then be tested by case-control and/or
cohort studies.
This example uses the term prevalence, which is defined as the proportion
of the population who have a disease at one point in time. One can also
The main statistic for expressing disease (or risk factor) frequency in a
cross-sectional study is prevalence. The example noted at the beginning of
the chapter reported that the overall prevalence of C. trachomatis in the
study population was 0.038 or 3.8%. The same study found that the
prevalence of C. trachomatis was much higher (13.7%) among women
aged 15–19 years of age. Contrary to the example in the text, this
particular study also reported that there was no association between
current use of oral contraceptives and the prevalence of C. trachomatis (a
prevalence odds ratio of 0.99, 95% confidence interval of 0.57–1.73).
Pharmacy C479 33
histories of cases and control are ascertained to compute measures of
association (e.g., odds ratio) between exposure and disease.
Example: You decide to assess the effect of current use of triphasic oral contraceptives
(OCs) on the risk of ovarian cyst development. You identify 106 women aged 15–59
years old with a primary diagnosis of ovarian cysts at an HMO. You also identify 255
control women without ovarian cysts who are randomly selected from the HMO,
matched to the cases for age. Pharmacy and medical records are then reviewed to
determine if cases and controls were using OCs at the time the case was diagnosed (or
at the same date for the controls).
Choosing Controls
The goal is to select individuals in whom the frequency of exposure would
be the same as that of the cases in the absence of an association between
the exposure and the disease.
• They should be selected from a population whose distribution of
the exposure is the same as the population at-risk for becoming a case.
• Almost always, sampling is used to select control subjects.
Sources of controls include third-party payors’ enrollment records,
hospital records, and household samples.
• Controls should be selected from among persons in whom the
presence of exposure and other characteristics can be measured in a
comparable manner.
• Population-based controls have several advantages, including the
degree to which they represent the general population. They allow for
calculating attributable risk as well as an estimate of the population
frequency of exposure.
• Hospital- or clinic-based controls may be relatively inexpensive
to identify but are more susceptible to several types of bias and results
may not be as generalizable.
Assessment of Exposures
The goal is to accurately assess the presence (and level) of exposure (e.g.,
the risk factor) for the period of time prior to the onset of the disease
during which the exposure would have acted as a causal factor.
Sampling Bias
Also referred to as a form of selection bias, sampling bias pertains to how
cases and controls get into a study. The separate sampling of cases and
controls makes it difficult to ensure that the controls represent the same
Pharmacy C479 35
underlying population. Selection bias occurs when noncomparable criteria
are used to enroll subjects in a study.
Measurement Bias
Because of the retrospective approach to measuring exposures, case-
control studies are susceptible to measurement bias. A type of
measurement bias referred to as recall bias is of particular concern. Recall
bias occurs when subjects report information in noncomparable ways. For
example, parents of infants with birth defect (i.e., the cases) may be more
likely than parents of normal infants (i.e., the controls) to report the use of
certain pharmaceuticals, because they will already have been concerned
about what might have caused the defect.
Confounding
Confounding occurs when a third factor, associated with both the exposure
and the disease, contributes independently to the observed association
between the exposure and the disease. Thus, one would have an indirect,
or confounded association. For example, a study of the effect of OCs on
the risk of myocardial infarction (MI) should also assess cigarette smoking
since OC users may be more likely to smoke and smoking increases the
risk of MI.
The measure of association between exposure and disease is the odds ratio
(OR) or relative odds. The OR is an estimate of the relative risk (RR),
particularly when the disease under study is rare. Often there is close
agreement between the OR and RR.
a c
a + c a × d
= a+c
case exposure odds
=
control exposure odds b d b×c
b + d b + d
ORs and RRs are usually reported with confidence intervals (CIs), which
indicate the range of the point estimates (e.g., OR or RR). For example, a
95% confidence interval around an OR means that we can be 95%
confident that the “true” OR lies in this range. If a 95% CI excludes 1.0,
the finding is considered to be statistically significant. For example, a
study yielding an OR of 3.0 (95% CI of 2.2 to 3.8) is clearly showing an
important increase in risk. An OR of 0.6 (95% CI of 0.4 to 0.8) provides
strong evidence of a reduction in risk. As a final example, an OR of 1.23
(95% CI of 0.5 to 3.3) suggests that an association is unlikely, since the
95% CI includes 1.0.
Pharmacy C479 37
LESSON FOUR
EXPERIMENTS
Reading Assignment
Objectives
At the end of this lesson, you should be able to
o identify and discuss the important aspects of the randomized trial
including placebo or active control, blinding, and randomization;
o describe the deficiencies inherent to nonrandomized studies with
regard to study design, analysis and clinical interpretation of the
results;
o show the importance of a cross-over design; and
Key Terms
experimental study
between-group design
within-group design
randomized control trial (RCT)
randomized blinded trial (RBT)
experimental/intervention group
control group
run-in RBT
crossover study
38 Lesson FourÑExperiments
subjects are then followed over time to observe the effect of the
intervention on various health outcomes. The purpose of the study is to
causally link the intervention to observed differences in health outcomes.
Experimental designs in clinical research are preferred over observational
studies such as case-control and cohort studies because they provide the
most rigorous test of the causal relationships between interventions and
health outcomes.
The randomized controlled trial (RCT) is the most widely used between-
group experimental design in clinical medicine. Graphically, the research
process for the RCT is depicted in figure 11.1 in Hulley and Cummings.
(Note that the book uses the term randomized blinded trial [RBT] to
describe the same method. For ease of clarity, through out the remaining
sections of this lesson we will refer to this design as the RBT. But be
aware that a preponderance of clinical literature refers to these types of
studies as RCTs.) The five steps in designing an RBT are as follows:
1. After considering the primary research question, investigators (a)
select an appropriate target population for sampling, (b) identify
necessary inclusion and exclusion criteria for subject identification,
(c) estimate the sample size requirement to answer the study question,
and (d) develop a recruitment plan. Recall from previous readings that
decisions about sample identification and the sampling procedure
determine, in part, the generalizability of the findings.
Pharmacy C479 39
the treatment groups, it is possible that the randomization procedure
was unsuccessful.
40 Lesson FourÑExperiments
may occur during the conduct of the study. The consequence of
unblinded studies is that unintended interventions (co-
intervention) can affect the between-group differences that the
investigator is attempting to measure. If complete blinding is
achieved, differential bias from any co-intervention that may
occur is minimized. It is important to remember that unblinded
studies suffer from this important limitation.
Pharmacy C479 41
regimen of insulin. Note that any difference in health outcomes
between the treatment and control groups in the DCCT study
would represent only the difference between these two
approaches to the treatment of insulin-dependent diabetes and not
the absolute efficacy of intense insulin therapy compared to no
treatment.
42 Lesson FourÑExperiments
CD-4 count. This biological marker of disease activity is
considered a surrogate measure. If CD-4 counts are improved, the
pharmaceutical agents are said to have “biological activity in
AIDS.” But the true outcome measure of interest in AIDS is
survival, an outcome that is difficult to study in short-term drug
trials. Recall that for a dichotomous variable like survival,
frequent events over a long period of time are required to provide
sufficient power for statistical testing.
Once the outcome measure has been measured and collected, the task of
statistical evaluation of the outcome data follows. In general, the
researcher aggregates the data by treatment group and performs the
necessary statistical tests, selected a priori, of the between-group
difference in the primary outcome measure. Generally, if the outcome
Pharmacy C479 43
measure is dichotomous the investigator will use a chi-square test or a test
of proportions. If the outcome measure is continuous, the researcher will
likely use a t-test if the data are normally distributed or a nonparametric
test if the data are not normally distributed. If the outcome is measured in
units of time (e.g., time to death or time to an event), survival analysis
techniques are used. If there is baseline or treatment-period confounding,
multiple regression techniques can be used to adjust for these differences.
The technical detail of these methods is described in the subsequent
section of this course.
You are advised to read appendix 11.b (pp. 212–14), which covers intent-
to-treat, subgroup investigations, and stopping rules. These are important
topics that deserve consideration. Note that the Medical Literature
Evaluation course (Pharmacy 493) discusses these issues in greater detail.
The run-in RBT is used frequently for drug studies. The goal is to increase
compliance with the intervention. Subjects are placed either on active or
placebo treatment for a brief period prior to the baseline assessment in
order to select only those patients who are likely to comply with the
treatments and protocol during the course of the study. Once selected,
patients are randomized in the usual fashion to treatment and control and
the study proceeds.
Another variation of the RBT is the crossover study. With this design,
persons randomized to the initial treatments (active or placebo) are
switched to the other treatment (“crossed over”) after a period of time.
This allows the investigator some flexibility. Fewer patients can be used,
time-dependent confounders can be avoided, and between-group and
within-group analyses can be performed. Recall that between-group
analysis is one in which treatment differences between the groups can be
assessed. Within-group analysis implies that the effects of treatments can
be assessed within the same patient or group. This design is frequently
used for dose ranging or dose finding studies of new pharmaceuticals.
Conclusion
Lessons Two through Four have endeavored to discuss the elements of
basic clinical research designs in sufficient detail to present an
understanding for those interested in evaluating the medical and
pharmaceutical literature. Any further discussions of research design are
better undertaken in the context of the review of actual studies, which are
presented in the literature evaluation course.
The next section of this course focuses on the descriptive, graphical, and
statistical analysis of data collected as part of a clinical research study.
44 Lesson FourÑExperiments
The lessons deal with the complex issues of presenting and depicting the
results of clinical studies. We encourage you to study these forthcoming
topics carefully and thoroughly; this material is the basis for much clinical
decision making. The statistical results of clinical studies have had far-
reaching effects on the practice of medicine and pharmacy. It is best that
you and your colleagues gain knowledge about the applications of
biostatistics in medical and pharmaceutical research.
Further Reading
If you are interested in further reading on the topics of clinical research
methods, you are encouraged to explore chapters 12 through 14. If you are
contemplating a clinical research career, you should read the rest of the
book. Even if you have no additional interest, you may want to make a cup
of coffee and spend an hour or so reading or skimming the contents of
chapter 14. This chapter addresses the very real issues of the ethics of
clinical research, such as placebo-control, financial compensation, and
other important topics. Mostly without your knowledge, many of your
patients are enrolled in clinical drug studies. If you work in an inpatient
facility, you may be aware of those that are. You are not likely to know if
you work in an ambulatory setting. After reading chapter 14, you are
likely to learn how your patients were approached for participation, given
informed consent, compensated, and encouraged to complete the study.
The more informed you are about the conduct of clinical research, the
better pharmaceutical care you are likely to give your patients.
Pharmacy C479 45
ASSIGNMENT #1
Following the outline presented in Lesson One, prepare a brief research
protocol of no more than three pages to address a research project that is
of interest to you in your current work environment. This can be a drug
study, a study of a pharmaceutical care program or service, or any other
medical or pharmaceutical topic. In the protocol, state the research
questions, the study sample, the selection criteria, the health outcomes,
and the intervention and nonintervention groups. Describe the research
design you selected and give the benefits and limitations of this design
over other possible designs.
46 Lesson FourÑExperiments
Pharmacy C479 47
LESSON FIVE
PREPARING FOR THE FIRST MIDTERM
EXAMINATION
This test covers chapters 1-4, 7, 8, and 11 of Hulley and Cummings.
Remember that it is a closed-book, closed-note test. However, you are
allowed to use one page of formulae to help you during the exam and also
a calculator. To help you complete the exam, a copy of appendices A, B,
and C from the Le and Boen text will be provided. You will have ninety
minutes to complete the examination.
Objectives
At the end of this lesson, you should be able to
o explain the difference between a continuous and discrete variable;
o compute a proportion and graphically display your results in a bar
or pie chart;
o construct a two-by-two table and from it calculate disease
prevalence, and the sensitivity and specificity of a test;
o compute a mean, variance, and standard deviation of a data set;
o read and understand a survival curve; and
o explain the concept of standardizing rates for comparison.
Key Terms
histogram
stem and leaf diagram
mean
median
variance
standard deviation
sensitivity
specificity
The mean and median are both statistics of any data set. The mean is
calculated by adding the values of all data points and dividing that sum by
the number of data points. Another name for the mean is the average. The
median is found by arranging the data in ascending order and taking the
middle value. When a data set has an odd number of observations N, the
median is the (N + 1)/2 observation when the data are ordered. If a data set
has an even number of observations the median is the average of the two
middle observations.
Example 1
Data Set A
1, 2, 3, 7, 9, 12, 22
Mean: 1 + 2 + 3 + 7 + 9 + 12 + 22 = 56/7 = 8
The mean of data set A is 8.
The median is 7 because half of the data points are greater than 7, and half are less
than 7.
Data Set B
2, 3, 5, 6, 8, 14
Mean: 2 + 3 + 5 + 6 + 8 + 12 = 36/6 = 6
The mean of data set B is 6.
The median is (5 + 6)/2 = 5.5.
The median is not affected by extreme values and doesn’t use information
about the spread of the data. When a data set is symmetric the mean and
median are quite similar to each other. Compare the mean and median of
data sets C and D.
In data set C, the median is not affected by the large size of the
observation 80 and 92. The mean is a more appropriate measure of
centrality. In data set D, which is symmetric, the mean and median both
provide a good measure of centrality.
Pharmacy C479 51
Another descriptive attribute of a data set is its variance, which is a
numeric value describing the range of the data. The larger the variance, the
more spread out the data set.
The data graphed as a solid line in the preceding figure have a larger
variance than those in the figure graphed as a dashed line.
The positive square root of the variance is called the standard deviation. It
is a useful measure because it is measured in the same units as the data.
Example 2a
Data Set E
0, 3, 4, 6, 8, 9
Mean: 0 + 3 + 4 + 6 + 8 + 9 = 30/6 = 5
Variance: (0 – 5)2 + (3 – 5)2 + (4 – 5)2 + (6 – 5)2 + (8 – 5)2 + (9 – 5)2
= 25 + 4 + 1 + 1 + 9 + 16 = 56
55/5 = 11.2.
The variance of data set E is 11.2. The standard deviation of Data Set E is the square
root of 11 or approximately 3.3.
(∑ x )
2
∑x 2
−
n
A quicker way to compute the variance is given by .
n −1
Example 2b
Data Data Squared
0 0
3 9
4 16
6 36
8 64
9 81
Total: 30 206
206 − 900 / 6
variance = = 11.2
5
Most scientific calculators will compute the mean and variance of a data
set. Check your manual to see if yours will. Try the previous example with
your calculator to insure you understand how to compute a variance by
hand and on your calculator.
When a data set is larger, calculating the Kaplan-Meier curve can be quite
computationally intensive. We can create a curve using the actuarial
method similar to a cumulative frequency distribution. Your text discusses
this process in detail at the end of chapter 2. The curve graphically depicts
the survival rate as a percent at any point in time. See figure 2.10 in your
text. For example, in figure 2.10, at t = 5 years approximately 30% of the
cancer patients in the study are still surviving. When comparing two
curves (to compare two treatments for instance), the better treatment
would have a higher percent of survivors over the course of the curve.
Pharmacy C479 53
ASSIGNMENT #3
Le and Boen, chapter 1: exercises 3, 4, 9, 16, 27
Le and Boen, chapter 2: exercises 2, 11, 13–15, 18, 20
Objectives
At the end of this lesson, you should be able to
o explain conditional probability;
o explain the meaning of independent events;
o express the sensitivity, specificity, positive predictive value and
the negative predictive value of a test as a conditional
probability;
o calculate a probability from a z-score;
o standardize statistics to a standard normal curve; and
o approximate a probability from a binomial distribution using a
normal approximation.
Key Terms
probability
simple random sampling (SRS)
conditional probability
positive predictive value (PPV)
negative predictive value (NPV)
z-score
binomial distribution
Introduction
This section is designed to introduce you to the tools you will need in
subsequent lessons. The chapter begins by defining and discussing
probability and expresses some ideas already introduced in terms of
probability. Additionally you will be introduced to the normal distribution.
The normal distribution is arguably the most powerful tool in statistical
experimentation. It is vital to interpreting literature and performing
research to understand its application.
Probability
Scientific conclusions are drawn after examining evidence from
experimentation. This is true is all fields. The evidence considered leads
us to believe our conclusion. Sometimes our conclusions are incorrect. We
formulate our conclusions based on probability. Probability can be defined
as
56 Lesson SevenÑDistributions
What we consider a positive outcome is defined by what we are studying.
If we were rolling a fair die, we could define a positive outcome as a 3 or
5 and the probability would be equal to 2/6 (which reduces to 1/3) because
there are two positive outcomes (3 and 5) and six possible outcomes (1, 2,
3, 4, 5, and 6). We could define a positive outcome as a disease-free state
or even as a disease-present state. In the case of a disease-outcome, the
probability is the number of persons diagnosed as having the disease
divided by the number of persons checked for the disease. If 1,000
randomly selected children were screened for Disease A, and 150 of them
were diagnosed as diseased, the probability of Disease A would be
denoted by Pr(Disease A) = 150/1000 = 0.15.
Simple random sampling (SRS) means each person has an equally likely
chance of being selected to the sample. Researchers often use random digit
dialing to select samples for studies. Practically speaking, most homes
have phones and therefore any household would have an equally likely
chance of being selected. Studies done using only volunteers must be
careful of the conclusions drawn and applied to the general population
because people willing to participate in a study may be fundamentally
different in some ways than those who refuse to participate.
Pharmacy C479 57
In order to draw conclusions from our studies, we make comparisons
between different treatments or applications. For example, if we want to
introduce a new drug treatment, it needs to be compared to and outperform
in some aspect (effectiveness, cost, side-effects) the current standard
treatment. In order to make this comparison we frequently use statistics
from our data and the normal distribution. In this lesson, you need to be
able to read a table of the area under a normal curve (appendix B).
Carefully study the examples in your book in section 3.2.2. Keep in mind
the total area under the curve is equal to 1. The curve is symmetric, and
therefore the area to the right of 0 equals 0.5, as does the area to the left of
0.
x−µ
z=
σ
The value of z, called the z-score, is the number of standard deviations our
measurement is from the mean. Important values to know are z = 0.99,
which cuts off 68% of observations, and z = 1.96, which cuts off 95% of
observations. This means that 68% of observations fall within one
standard deviation of the mean in either direction, and 95% of
observations fall within two standard deviations of the mean in either
direction. (See figure 3.4 on page 85.)
Example
Suppose that the mean mg/100mL of cholesterol is 275 with a standard deviation of
75. Find the probability that a person’s cholesterol is below 200.
Pr(X < 200) = Pr([X – 275] / 75 < [200 – 275] / 75) = Pr(Z < –1)
= .5 – Pr(–1 < Z < 0)
= .5 – .3413
= .1587
A person must fall into one of the two categories. If number of trials
performed (persons studied) is large enough, we can use a normal
distribution to approximate a binomial distribution. The mean is estimated
by nP and the variance is estimated by nP(1 – P). Once we have these
numbers, we apply the same procedures as we did when using the normal
distribution.
58 Lesson SevenÑDistributions
ASSIGNMENT #4
Le and Boen, chapter 3: exercises 1–3, 5, 7, 8, 10, 12, 19, 20
Pharmacy C479 59
LESSON EIGHT
CONFIDENCE ESTIMATION
Reading Assignment
Objectives
At the end of this lesson, you should be able to
o explain the concept of a random variable;
o explain how a statistic has a distribution of its own;
o estimate a population mean;
o compute a standard error;
o construct a confidence interval with varying measures of
certainty; and
o estimate a population proportion from a sample and construct a
confidence interval around it.
Key Terms
statistical estimation
confidence interval
Central Limit Theorem
standard error
Example 1
A simple random sample was selected from a population. The sample size, mean and
variance are given here. Calculate the 95% confidence interval around the mean.
sample size: 50
sample mean: 39.4
variance: 0.7
95% confidence interval: = 39.4 ±1.96√(0.7/50) = 39.4 ± 0.12
Independent Samples
If the data of the two samples are unrelated or independent, our approach
is slightly different. The mean for each group is computed and then the
difference of the means is estimated and its confidence interval calculated.
The standard error of the difference of two means is given by the
following:
SE( X1 − X2 ) = (s2
1 n1 ) + (s22 n2 )
Pharmacy C479 61
Sometimes the quantity we want to estimate about a population is a
proportion. If the outcome variable is categorical or dichotomous, this is
the case. We use the sample proportion p to estimate the population
proportion. We denote the population proportion with π, pronounced
“pie.” The variance of the sample proportion is [p(1 – p)]/n.
Example 2
A simple random sample of 100 people yields the following: 75 of the people have
brown hair and the remaining 25 have some hair color other than brown. From this we
can estimate p (the population parameter for brown hair) as 0.75. The variance of this
binomial is [(.75)(.25)]/100 = 0.001875. The standard error is √(0.001875) = 0.043,
The 95% confidence interval for the proportion of the population with brown hair is
1 1 1 1
Variance[ln(OR)] ≅ + + + ,
a b c d
The 95% confidence interval for the odds ratio is given by exponentiating both parts of
the following:
ln(1.43) ± 1.96(.1526)
eln(1.43) ±1..96(.1526) = (1.06 ⇒ 1.93)
Notice the confidence interval is quite narrow here and that it does not include 1. We
therefore can conclude an association between vitamin A intake as defined here and
breast cancer.
Pharmacy C479 63
ASSIGNMENT #5
Le and Boen, chapter 4: exercises 2–4, 6–8, 13, 15, 20
Objectives
At the end of this lesson, you should be able to
o explain the concept of a hypothesis test;
o state a research question as a null and alternative hypothesis;
o explain the meanings of Type I and Type II errors;
o differentiate between a one-tailed and two-tailed significance test
and explain when each is appropriate to use;
o perform a hypothesis test; and
o describe the relationship between a hypothesis test and a
confidence interval.
Key Terms
Type I error
Type II error
Errors
When we set up a study, we hope that we will draw the correct
conclusions from the data. However, this is not always the case. We can
make two kinds of errors when we make decisions about data. Consider
that there are only two possibilities in our drug trial: either the new drug
performs better than the current drug therapy or it does not. Given only
these options, we could correctly conclude that the drug performs better
than the current drug when it in fact does, or we could erroneously
conclude that it does not perform as well as the current drug. This is called
a Type II error: the mistake of not finding a difference when one exists. If
no difference existed between the two drugs, we could also draw two
different conclusions. We could correctly state there was no difference, or
we could make another type of error. The other error would be to conclude
Example 1
We want to investigate the study question “Mothers with a low socioeconomic status
(SES) deliver babies whose birth weights are lower than normal.” Let the normal mean
weight for newborns be 120 ounces. Our null hypothesis and alternative hypotheses
are as follows:
H0: µlw = µ0 (mean for low SES is the same as the average mean birth weight)
HA: µlw < µ0 (mean for low SES is less than the average mean birth weight)
Note that we are only interested in the birth weight from low socioeconomic status
women being less than normal so we will express our alternative hypothesis with only
that option and use a one-sided test. Suppose we observe a mean of 100 ounces in our
sample with a standard error of 10. What should we conclude?
100 − 120
We calculate a z-score: z= = 2.0
10
A z-score of 2.0 has a p-value of .045. This means that if the null hypothesis were true,
we would get a value this extreme (or different) only 4.5% of the time. This leads us to
reject the null hypothesis that there is no difference in the mean birth weights of the
two groups.
Pharmacy C479 69
A p-value can be thought of as a measure of the strength of the evidence in
favor of the null hypothesis. A large p-value supports the null hypothesis,
while a small p-value does not provide much support for the null
hypothesis. A p-value greater than .10 is called “not significant.” A p-
value less than .01 is called “significant.” A p-value between .01 and .10 is
interpreted in the context of the study, with a value less than .05
commonly considered “significant.”
In the preceding birth weight example, the 95% confidence interval for
birth weight would be
Our observed value of 100 did not fall in the range of the confidence
interval, and therefore we rejected the null hypothesis.
Pharmacy C479 71
LESSON ELEVEN
BASIC STATISTICAL TESTING
Reading Assignment
Objectives
At the end of this lesson, you should be able to
o compute a statistical test to compare two proportions;
o explain the concept of a matched sample;
o explain the concept of independent samples;
o compute a statistical test to compare several independent
samples;
o compute a statistical test to compare two means from matched
data; and
o compute a statistical test to compare two means from
independent samples.
Key Terms
population proportion
chi-square
McNemar’s chi-square test
Introduction
Once we have gathered the data for a study, our next step is to analyze the
data to discover what conclusions we can draw. Choosing an appropriate
statistical test is an important part of the analysis process. This unit will
introduce you to some of the most common statistical tests for a variety of
types of outcome variables.
p − π0
z=
π 0 (1 − π 0 )
n
When the data from which the proportions are being tested are matched,
our approach is a little bit different. A two-by-two table can be constructed
with each column and row entry being a combination of possible
outcomes.
For example,
Observatio
n Time 1
+ –
Observation + a b
Time 2
– c d
In this table, d represents the number of people who did not have the
characteristic of interest at either Time 1 or Time 2. People who had the
characteristic at Time 1 but not at Time 2 are represented by c, and so on.
To test if the proportions are the same at the two points in time, we can
compute a z-score using
(b − c)
z=
b+c
When a two-tailed test is used the above value is squared to give a statistic
called a chi-square (denoted χ 2). This test, called McNemar’s chi-square,
is quite common. When a one-tailed test is used, the preceding z-score is
used and rejected at 1.645 (α = .05). When a two- sided test is used, the
rejection region is χ 2 ≥ 3.84 (which is 1.962).
Pharmacy C479 73
If we are interested in comparing proportions from two independent
samples we calculate a z-score for their difference. Our null hypothesis is
stated as “H0: π1 – π2 = 0” or π 1 = π 2.
p2 − p1
z=
p(1 − p)(1 / n1 + 1 / n2 )
x−µ
t=
SE( x )
x1 − x2
t=
SE( x1 − x2 )
Limitations
When you are doing statistical tests, it is important to remember their
limitations. If enough tests are done, one will probably give a significant
result, by chance alone. For example, if you were testing 20 different
blood characteristics between two groups, and using a .05 significance
level, you could expect to find one significant result even if there were no
true differences. When we know we need to conduct a lot of tests, we
sometimes set more stringent levels for significance. For example, instead
of using a .05 cut point for significance, we might use .01. This allows us
to believe that a significant result really indicates a difference between
groups and is less likely to be a Type I.
Pharmacy C479 75
ASSIGNMENT #8
Le and Boen, chapter 6: exercises 1–3, 6–11, 17, 18, 20, 21, 24
Objectives
At the end of this lesson, you should be able to
o explain the meaning of a correlation coefficient;
o interpret different values of a correlation coefficient;
o perform a statistical test on a correlation coefficient;
o explain the concept of linear regression; and
o explain the meaning of the parameters generated using linear
regression.
Key Terms
correlation coefficient
regression
Correlation
We have discussed the methods for testing for an association between
discrete variables and in this unit will address the methods for testing for
an association between continuous variables. When we are interested in
determining a relationship between two continuous measures we compute
their correlation coefficient. A correlation is a numeric value describing
the degree of association. Correlation values can range from –1 to 1. If
two variables have a correlation coefficient equal to a large positive value
(e.g., 0.9) the implication is that as one variable increases so does the
other. A scatter plot of the data would increase from left to right, as in the
following diagram:
-1
-2
-3
-2 -1 0 1 2
-1
-2
-3
-2 -1 0 1 2
We use the correlation coefficient from sample data in much the same way
as we use our sample mean. That is, we use the sample correlation
coefficient (denoted by r) to estimate the true population correlation
coefficient ρ (“rho”). We can statistically test if ρ is equal to zero, which,
if true, would mean that there is no association between the variables of
interest. (This is similar to the statistical tests we do on a sample mean.)
We could state “H0: ρ = 0” as our null hypothesis and “HA: ρ ≠ 0” as our
alternate hypothesis. The statistical test we would use is
n−2
t=r ,
1 − r2
Regression
If we determine that ρ ≠ 0, we can use a technique called regression to
determine a relationship (described mathematically) between our two
variables. The goal in regression is to accurately predict the value of one
variable called Y or the dependent variable from the other called X or the
independent variable. Y is called the dependent variable because the value
we predict for Y depends on the corresponding value of X that we know.
Pharmacy C479 79
All statistical packages easily compute correlation coefficients and
perform regression analysis. It is most important that you understand the
concept of what is being done rather than the specific computations.
Pharmacy C479 81
LESSON THIRTEEN
STATISTICAL TESTING
Reading Assignment
Objectives
At the end of this lesson, you should be able to
o explain why nonparametric tests are useful and when it is
appropriate to use them;
o compute a Wilcoxon rank-sum test;
o compute a Wilcoxon sign test; and
o compute a Spearman’s rank correlation test.
Key Terms
Wilcoxon rank-sum test
Wilcoxon sign test
Spearman’s rank correlation test
Nonparametric Methods
We have considered many statistical methods and tests that are commonly
used. Many of these tests require certain assumptions to be made to be
valid, such as normality or near-normality of a distribution. We often are
willing to make these assumptions when we are not sure if they are in fact
true. In reality, most of the tests we have talked about are quite robust
(meaning fairly uninfluenced) to some non-normality in the data.
However, extreme non-normality may affect the validity of the statistical
tests used and the corresponding conclusions we draw from them.
It is with this problem in mind that we introduce some nonparametric
tests. Nonparametric tests, as the name suggests, do not involve a statistic
or parameter from the data. They require that no assumptions about
normality of the data be made. Usually the inferences drawn using
nonparametric tests are about the distribution as a whole rather than the
value of any particular statistic (the mean, for example).
n!( p) x (1 − p)n − x
P=
x!(n − x )!
Example 1
Under the null hypothesis of no difference between T 1 and T2, we would expect the
chance that the sign of the difference di is positive (or negative) to be 0.5. This p-value
provides little evidence in support of our null hypothesis (D = 0), and so we reject it.
R − µR
z= ,
σR
where R is the sum of the ranks from one of the original samples and µR =
n1(n1 + n2 + 1)/2. The standard deviation of R, which is denoted by σR, is
given by n1n2 (n1 + n2 + 1) / 12 . For sample sizes n1 and n2 of 10 or larger,
the statistic is approximately normal and the null hypothesis should be
rejected at the .05 level if z is larger than 1.96 or smaller than –1.96. There
are also tables of rank-sum probabilities to which one can refer in order to
come up with a p-value. These may be used for smaller sample sizes also.
Pharmacy C479 83
thereby diminishing the effect of any particular data point. The first
variable, call it X, is ranked from smallest to largest. The same thing is
done for the second variable, Y. Then we compute a correlation coefficient
of the ranks rather than of the data values as we have done previously.
This process is called the Spearman’s rank correlation.
Nonparametric tests are useful in any situation, but especially when little
is known about a distribution or when assumptions for other tests are
clearly not met. If a nonparametric test is used when in fact the data are
distributed normally, the resulting p-value will be very close to the p-value
obtained using a parameter-dependent test.
Pharmacy C479 85
LESSON FOURTEEN
PREPARING FOR THE FINAL EXAMINATION
This test covers all chapters covered in the course. Remember that it is a
closed-book, closed-note test. However, you will be allowed one page of
formulae to help you during the exam, and you may also bring a
calculator. To help you complete the exam, a copy of appendixes A, B,
and C from the Le and Boen text will be provided. You should find the
test problems to be similar to the homework problems. You will have two
hours to complete the examination.