Professional Documents
Culture Documents
Aba Lecture 2
Aba Lecture 2
Target Behavior
Learning Objectives
After studying this chapter, you will be able to:
Six-year-old Darren was extremely uncooperative and nonverbal instructions to his mother, such as pushing her
commanding with his parents. In the hope of learning how into a chair or saying such things as “You go over there
to deal more effectively with his excessive commanding and I’ll stay here” or “No, that’s wrong. Do it this way.”
behavior, Darren’s parents took him to the Gatzert Child Cooperative behavior was defined as any non-command-
Developmental Clinic at the University of Washington. ing statements, actions, or questions, such as “Do you
According to his parents, Darren virtually “ran the show” want to sit here?” while pointing to a chair. Darren showed
at home, deciding when he would go to bed, what foods a very low rate of cooperative behavior during the two
he would eat, when his parents could play with him, and observation sessions. His commanding behavior, on the
so on. To obtain direct observations of Darren’s behavior, other hand, occurred at an extremely high rate. Following
both cooperative and uncooperative, Dr. Robert Wahler the observation sessions, Darren’s mother was asked to
asked Darren’s mother to spend some time with Darren apply a treatment in which she was to be very positive
in a playroom at the clinic. The playroom was equipped and supportive of any instances of cooperative behavior
with adjoining observation rooms with one-way windows shown by Darren. At the same time, she was instructed to
for data recording. During the first two 20-minute completely ignore his commanding behavior. During two
observation sessions, Darren’s mother was instructed: treatment sessions, Darren’s cooperative behavior steadily
“Just play with Darren as you might at home.” Darren’s increased. During the same time, his commanding
commanding behavior was defined as any verbal or behavior decreased to near zero.
Throughout this book, numerous examples, like Darren’s case, illustrate the effectiveness of behavior modification
procedures. Many of these examples are accompanied by graphs showing the changes—increases or decreases—
that occurred in behavior when particular procedures were applied. Some of the graphs also include follow-up
observations indicating that the improvements were maintained after the programs had terminated. The graphs are
presented not just to make it easier for you to understand the material; precise records of behavior represent an
inseparable part of behavior modification procedures. Indeed, some have gone so far as to say that the major con-
tribution of behavior modification has been the insistence on accurately recording specific behaviors and making
decisions based on recorded data rather than merely on subjective impressions. As stated in Chapter 1, behavioral
Chapter 3 Defining, Measuring, and Recording Behavior 25
assessment involves the collection and analysis of information and data in order to (a) identify and describe tar-
get behavior; (b) identify possible causes of the behavior; (c) select appropriate treatment strategies to modify the
behavior; and (d) evaluate treatment outcome. After reviewing the “Minimal Phases of a Behavior Modification
Program” below, the rest of this chapter will focus on Part (a) of behavioral assessment.
Defining-the-Target-Behavior Phase
As stated in Chapter 1, behaviors to be improved in a behavior modification program are called target behaviors; these
are specific behaviors to decrease (i.e., behavioral excesses) or to increase (i.e., behavioral deficits). As we saw in Darren’s
case, his commanding behavior (a behavioral excess) and his cooperative behavior (a behavioral deficit) were specifi-
cally defined. Three of the many examples from later chapters include increasing an adult’s daily exercising, decreasing
a figure skater’s negative self-talk, and increasing a seventh-grader’s work rate in math class. Assessment procedures for
identifying and describing target behaviors are described later in this chapter.
that the problem is being helped. If a child is having difficulty in school, for example, the behavior modifier—most likely
an applied behavior analyst—would be especially interested in obtaining a baseline of specific behavioral excesses (e.g.,
disruptive behavior) or deficits (e.g., a reading deficiency) that constitute the problem. Further details about the sources of
information for the baseline phase are given later in this chapter.
Treatment Phase
After making a precise baseline assessment, an applied behavior analyst or behavior therapist will design and apply
a program, as in Darren’s case, to bring about the desired behavior change. In educational settings, such a program is
typically referred to as a training or teaching program. In community and clinical settings, the program is often called an
intervention strategy or a therapy program.
Behavior modification programs typically provide for frequent observation and monitoring of the target behav-
ior during training or treatment to a degree rarely found in other approaches. In addition, applied behavior analysts
and behavior therapists strongly emphasize changing the program if the measurements taken indicate that the desired
change in behavior is not occurring within a reasonable period of time.
Follow-up Phase
Finally, a follow-up phase is conducted to determine whether the improvements achieved during treatment are main-
tained after the program’s termination. When feasible, this will consist of observations in the natural environment—i.e.,
the home—or under circumstances in which the behavior is expected to occur.
TABLE 3.1 Examples of Questions an Applied Behavior Analyst or a Behavior Therapist Typically
Asks During an Intake Interview
the advantages of being convenient, not requiring an inordinate amount of time, and potentially providing informa-
tion about covert behaviors. However, they suffer from the disadvantages that those providing information might
not remember relevant observations accurately or have biases that would influence them to provide inaccurate data.
Questionnaires
A well-designed questionnaire provides information that might be useful in assessing a client’s problem and developing
a behavioral program tailored to the client. A large number of such questionnaires are available. Many of these, includ-
ing questionnaires for couples, families, children, and adults, can be found in a two-volume compendium complied by
Fischer and Corcoran (2007a, 2007b). Several types of questionnaires are popular with applied behavior analysts and
behavior therapists.
Life history questionnaires provide demographic data, such as marital status, vocational status, and religious affiliation,
and background data, such as sexual, health, and educational histories. A notable example is The Multimodal Life History
Inventory (Lazarus & Lazarus, 2005).
Self-report problem checklists have the client indicate from a detailed list those problems that apply to him or her.
Such questionnaires are particularly useful in helping the therapist completely specify the problem or problems for
which the client is seeking therapy.
An example of a self-report behavioral checklist is one developed by Martin and Thomson (2010) for helping
young figure skaters identify problems at practice that might require sport psychology consultation (see Figure 3.1).
Asher, Gordon, Selbst, and Cooperberg (2010) developed a variety of checklists for clinical work with children and
adolescents experiencing a wide range of diagnoses, including attention-deficit hyperactivity disorder (ADHD) and
mood disorders.
Survey schedules provide the therapist with information needed to conduct a particular therapeutic technique
with the client. The questionnaire shown in Table 6.2 provides information useful in applying positive reinforcement
procedures. Other types of survey schedules are designed to provide information preparatory to using other behavioral
procedures (for examples, see Asher et al., 2010).
Third-party behavioral checklists or rating scales allow a client’s significant others and professionals to subjectively
assess the frequency and quality of certain client behaviors. An example of such a checklist is the Discrete-Trials Teaching
Evaluation Form (Fazzio, Arnal, & Martin, 2010) which can be used to reliably evaluate the quality of one-on-one training
sessions conducted by a behavior modifier with a child with autism spectrum disorder (Jeanson et al., 2010).
28 Part I The Behavior Modification Approach
FIGURE 3.1 A questionnaire to assess areas in need of help during a seasonal sport psychology program for figure skaters
It should be noted that the term frequency of behavior can be used to refer to two related but different things.
It can refer to the number of instances of a behavior, as in the above example. It can also refer to the number of
instances of a behavior in a given time period, or rate, as indicated in Chapter 1. In this text, we will use frequency
as synonymous with rate unless otherwise indicated. (For further discussion of the distinction between the terms
frequency and rate in behavior modification, see Merbitz, Merbitz, & Pennypacker, 2016; Carr, Nosik, & Luke, 2018.)
Role-Playing
If it is not feasible for the therapist to observe the client in the actual situation in which the problem occurs, an alter-
native is to re-create that situation or certain crucial aspects of it in the therapist’s office. That, essentially, is the ratio-
nale behind role-playing—the client and therapist enact interpersonal interactions related to the client’s problem
(described in Chapter 20). For example, the client might enact being interviewed for a job with the therapist playing
the role of the interviewer. Role-playing is frequently used both in assessing a problem and in treating it (discussed
further in Chapters 20, 23, and 26; also see Spiegler & Guevremont, 2010, pp. 77–80).
Client Self-Monitoring
Self-monitoring—the direct observation by the client of his or her own behavior—is the next best thing to the thera-
pist’s direct observation. We mention it under indirect assessment procedures because the therapist does not observe
the behavior directly. Thus, as with the other indirect assessment procedures, the therapist cannot have as much
confidence in the observations as would be the case if a trained observer had made them.
Except for covert behavior, behaviors that might be self-monitored are the same as those that a trained observer
would observe directly. These are described later in this chapter. Self-monitoring might also aid the discovery of the
causes of the problem behavior, as discussed in Chapter 22. Additional examples of self-monitoring are provided in
Chapter 25.
Chapter 3 Defining, Measuring, and Recording Behavior 29
Topography of Behavior
The topography of a response is the specific movements involved in making the response. For example, Stokes, Luiselli,
and Reed (2010) analyzed the movements of effective tackling in high school football as 10 distinct components (e.g.,
head up, wrap arms around ball carrier’s thighs, etc.).
Picture prompts are sometimes useful for helping observers to identify variations in the topography of a response.
One of the authors developed detailed checklists with picture prompts for evaluating swimming strokes of young com-
petitive swimmers. See Figure 3.2 for the backstroke checklist.
Frequency of Behavior
Frequency or rate of behavior refers to the number of instances of a behavior that occur in a given period of time.
If you want to improve the practice performance of figure skaters, you might examine the frequency with which the
skater performed various jumps and spins. That was the approach taken by Michelle Hume, a figure skating coach at
St. Anne’s Figure Skating Club in Manitoba (Hume, Martin, Gonzales, Cracklen, & Genthon, 1985). Coach Hume first
defined jumps and spins in such a way that student observers could decide when either of those responses occurred.
A jump was defined as any occasion when a skater jumped in the air so that both skates left the ice, a minimum of one
complete revolution occurred in the air, and the skater landed on one foot, facing in the opposite direction without
falling. A spin was defined as spinning on one skate for a minimum of three revolutions while maintaining a balanced,
30 Part I The Behavior Modification Approach
stationary position. When the observers knew what behaviors to look for, Coach Hume’s next step was to take a baseline
of the number of jumps and spins each individual skater performed during several practices. The observers used the
data sheet shown in Figure 3.3.
The baseline performance of one of the figure skaters can be seen in Figure 3.4. This type of graph is called a fre-
quency graph. Each data point represents the total number of elements (jumps plus spins) completed by a skater during
a practice session. Following baseline, Coach Hume introduced a treatment program. A chart prepared for each skater
contained a checklist of all jumps and spins that he or she should be practicing. These charts were posted at the side of
the rink. Pointing to the charts, Coach Hume said to the skaters,
Each practice session, do the first three elements on your chart and then record them here. Then practice the
next three elements and record them. Continue in this way until you’ve practiced all the elements. Then go
Chapter 3 Defining, Measuring, and Recording Behavior 31
FIGURE 3.3 A sample data sheet for recording jumps and spins at figure skating practices
FIGURE 3.4 Frequency graph of the number of elements (jumps and spins) per session performed by a figure skater during
baseline and treatment (self-recording)
150
Elements Attempted
100
50
0
1 3 5 7 9 11 13 15 17 19 21 23 25 27 29 31 33 35
Sessions
through the whole routine again until the end of practice. At the end of practice, I will check your charts to see
how you are doing.
The self-charting program combined with positive feedback from Coach Hume at the end of each practice was
effective in improving the number of jumps and spins performed (see Figure 3.4). Interestingly, when the treatment
was discontinued, performance decreased to near baseline levels. When the treatment was reinstated, performance
again improved.
It is sometimes convenient to design a recording sheet that also serves as a final graph. Consider the case of
a child who frequently swore at the teacher and teacher aides in the classroom. The teacher decided to record this
behavior using the chart shown in Figure 3.5. Each time the teacher or teacher aides observed an instance of swearing,
they were to ignore the child and go to the front desk and place an X in the appropriate place on the chart.
The instances of swearing were recorded up the side of the graph and the days of the program were recorded
across the bottom (see Figure 3.5). Each time an instance of swearing occurred, the staff would simply add an X for the
appropriate day to the number of Xs that were already on the chart for that particular day. The graph shows clearly that
many instances of swearing occurred during the first 10 days. Beginning on day 11, the teacher or teacher aide decided
to praise the child at the end of each 15-minute period in which swearing did not occur. The result can be clearly seen:
Swearing showed an immediate drop and eventually decreased to zero. (Note: These data are hypothetical and are
32 Part I The Behavior Modification Approach
FIGURE 3.5 Jackie’s swearing behavior. Each X represents one swear word.
12 × × × × × × × × × ×
11 × × × × × × × × × ×
10 × × × × × × × × × ×
9 × × × × × × × × × ×
8 × × × × × × × × × ×
7 × × × × × × × × × ×
6 × × × × × × × × × ×
5 × × × × × × × × × ×
4 × × × × × × × × × × × ×
3 × × × × × × × × × × × ×
2 × × × × × × × × × × × × × × × ×
1 × × × × × × × × × × × × × × × × × × × × ×
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27
Days
presented to illustrate the plotting procedure.) This type of graph is useful for those who do not have the time to transfer
their data from their data sheet to a graph.
Each instance of a behavior that is recorded in terms of frequency, such as jumping or spinning as defined for
the figure skaters, is a separate, individually distinct behavior that is easy to tally in a given period. Behavior modifiers
have recorded the frequency of such behaviors as saying a particular word, swearing, throwing objects, completing
arithmetic problems, chewing mouthfuls of food, taking puffs on a cigarette, and exhibiting nervous twitches.
Each of these behaviors has the following two characteristics that make successive instances of the behavior rela-
tively easy to record: (1) they are relatively brief, and (2) the amount of time that it takes to perform them is about the
same from one occasion to the next.
Duration of Behavior
While frequency or rate is the most common measure of the amount of a given behavior, another common measure is
its duration. The duration of behavior is the length of time from the beginning to the end of an episode of behavior. In
dealing with a behavior such as a tantrum, you may be more concerned with its duration than with its frequency. In fact,
frequency can be ambiguous when trying to apply it to something like temper tantrums (Pear, 2004). What should be
counted as a separate response? Each cry, scream, or kick on the floor? Or should we count each episode of a tantrum as
a separate response? Because it is usually difficult to answer these questions, we can generally avoid them by focusing on
the duration of the behavior. Other examples of behavior for which duration may be more appropriate than frequency
are listening attentively, sitting in one’s seat in a classroom, watching television, talking on the telephone, and taking a
break. Duration of behavior is measured using timers, stopwatches, or clocks.
Intensity of Behavior
Sometimes we are concerned with measuring the intensity, magnitude, or force of a response. Assessments of intensity
often utilize instrumentation. For example, when voice loudness is the behavior of concern, decibel level can be mea-
sured by a device called a voice meter. To measure the strength of grip pressure—such as during a handshake—a device
called a dynamometer can be used. Measures of force are common in the skills involved in various sports. For example,
devices are available that assess the speed at which a pitcher can throw a baseball, a hockey player can shoot a puck, or
a tennis player can perform a serve (e.g., Robinson & Robinson, 2016). The speed of an object as determined by such
devices is used to deduce the force with which it was propelled.
Latency of Behavior
The latency of a behavior, often called reaction time, is the time between the occurrence of a particular event or cue
and the start of that behavior. For example, suppose a child in a classroom works effectively once started, but seems to
fool around forever before starting. This child has a long latency getting started. Like duration, latency is assessed using
timers, stopwatches, or clocks.
Quality of Behavior
Concern about the quality of a behavior is frequently encountered in everyday life. Teachers might describe the quality
of a child’s handwriting as good, average, or poor. In judgmental sports—i.e., sports in which human judgement regard-
ing the occurrence of the desired response is required—such as specific dives off a diving board, gymnastic movements,
and figure skating jumps, athletes receive points based on the quality of their performances. We all make resolutions to
do various activities “better.” But quality is not a characteristic additional to those mentioned previously. Rather, it is a
refinement of one or more of them. Sometimes differences in judgments of quality are based on topography, as when
a figure skating jump that is landed on one foot is considered better than one that lands on two feet. Sometimes it is a
combination of frequency and stimulus control. For example, someone who is considered a good student is likely to
show a high frequency of studying and answering test questions correctly. A child who is said to be “good” shows a high
frequency of following instructions from parents and teachers. In terms of latency, a runner who leaves the blocks very
quickly after the firing of the starter’s pistol might be considered to have a “good” start, while a runner who shows a lon-
ger latency had a “poor” start. Thus, quality of response is essentially an arbitrary—and often subjective—designation of
one or more of the previously mentioned characteristics of behavior that has functional or social value. An important
aim of behavior analysis is to make measures of behavior more objective, as the effectiveness of behavior modification
procedures depends critically on the precise specification of the behavior to be treated.
21. Define stimulus control. Give an example of it that is not in this chapter.
22. What do we mean by the latency of a response? Describe an example that is not in this chapter.
23. Using an example, explain how the quality of a behavior is a refinement of one or more of the other dimensions of
behavior.
Observer 1
Observation Intervals (10 seconds each)
Total
5 10 15 20 25 30 35 40 45 Behavior
Touching
/ / / / / / / / / / / / / / / / / / 18
Vocalizing
/ / / / / / / / / / / / /
Observer 2
Observation Intervals (10 seconds each) Total
5 10 15 20 25 30 35 40 45 Behavior
/ / / / / / / / / / / / / / / / / / 18
Touching
Vocalizing
/ / / / / / / / / / / / / / / /
Chapter 3 Defining, Measuring, and Recording Behavior 35
child might be concerned about the frequency of the child’s self-stimulation behavior, like rocking back and
forth while sitting. It might be useful to have records of when this behavior occurs and for how long it occurs
throughout the child’s waking hours, but usually this is not realistic. An alternative is for the parent to seek out
the child once every hour and make a note of whether the child shows any rocking behavior while sitting during
a 15-second observation interval. Each observation interval is separated from the next by approximately 1 hour.
This type of observational technique enables one observer to record one or more behaviors of one or more indi-
viduals even though the observer has many other commitments during the day. A special case of time sampling is
referred to as momentary time sampling, in which a behavior is recorded as occurring or not occurring at specific
points in time, such as every hour on the hour rather than during specific brief intervals (e.g., see McIver, Brown,
Pfieffer, Dowda, and Pate, 2009). (For a comparison of momentary time sampling and partial-interval recording,
see Meany-Daboul, Roscoe, Bourret, & Ahearn, 2007.) For a summary of the various recording strategies dis-
cussed above, see Table 3.2.
24. Define continuous recording. Describe an example that is not in this chapter.
25. Define interval recording. Distinguish between partial-interval recording and whole-interval recording.
26. When would one likely select to use continuous recording?
27. When would one likely select an interval recording system over continuous recording?
28. Define time-sampling recording. Describe an example that is not in this chapter.
29. Briefly describe momentary time-sampling recording.
Wearable activity trackers that operate on their own or can transfer data to smartphones have been developed to
automatically record physical activity, such as number of steps walked (e.g., see Loprinzi & Cardinal, 2011; Hendrikx
et al., 2017; Mercer, Li, Giangregorio, Burns, & Grindrod, 2016; Rye Hanton et al., 2017; Schrager et al., 2017). Mobile
phone apps have been developed for self-recording dietary intake by individuals desiring to lose weight (Carter, Bur-
ley, & Cade, 2017; Tay, Garland, Gorelik, & Wark, 2017), for self-monitoring blood glucose levels by individuals with
diabetes (Bonoto et al., 2017), for self-monitoring other physiological activity such as brain waves, cardiovascular
function, and breathing for individuals suffering from difficult-to-manage stress (Christmann, Hoffmann, & Bleser,
2017), and for self-recording adherence to medical prescriptions (Bain et al., 2017).
Such apps are in the process of being developed and improved and—given the strongly increasing trend of own-
ership of desktop computers, laptops, tablets, and smartphones by the general public (e.g., see Pew Research Center,
2017)—will probably be widely available in the near future (e.g., see Crutchfield, Mason, Chambers, Wills, & Mason,
2015; Heron & Smyth, 2010; Préfontaine, Lanovaz, McDuff, McHugh, & Cook, in press).
straightforward uncomplicated measure of IOA is needed and overall performance rather than individual instances
of a behavior is important.
Another IOA procedure is used with partial-interval recording. Recall that partial-interval-recording procedures
can record one and only one response during each brief period of time—usually 5 or 10 seconds—over an extended
observation period (see Table 3.2). If we have two independent observers recording the same behavior and each is using
a partial-interval-recording procedure, then the question is how their corresponding intervals compare with regard to
those that contain a response versus those that do not. Let’s suppose that two observers are recording two types of social
interaction for one child. The behaviors are defined as touching another child and vocalizing in the direction of the other
child. Their partial-interval recordings are shown in Figure 3.6.
As you can see, both observers counted the same number of instances of touching: 18. However, the two observers
agreed on only 16 of these 18 instances. Each counted two instances that the other missed, yielding a total of four disagree-
ments. If we used the procedure described previously, we would obtain an IOA of 100%. However, in the second procedure,
the IOA is obtained by dividing the number of intervals on which the two observers agree that the behavior occurred by
the total number of intervals on which either recorded a behavior—agreements divided by agreements plus disagreements
on the occurrence of a behavior—and multiplying by 100%. Thus, in this instance, the second procedure would yield an
IOA of 80%.
When individual instances of a behavior are important, point-by-point agreement measures are recommended in
order to better ensure that the agreements and disagreements being measured are about specific instances of the behav-
ior. For discussions on point-by-point agreement and ways to measure it, see Kazdin (2011, pp. 103–105) and Yoder and
Symons (2010, pp. 141–182).
Typically, by convention, IOA scores between 80% and 100% are considered acceptable in behavior modification
programs. Potential variation in computational procedures, however, renders the final IOA value potentially misleading
when considered by itself. We suggest that readers of behavior modification literature consider the response definitions,
observer-training procedures, recording system, method of calculating IOA, and the final IOA value as a total package
when judging the reliability of reported data. Defects in any of these make the results suspect (see Barrett, 2009).
36. Distinguish between a frequency ratio index (or total accuracy agreement) and point-by-point agreement.
37. What is an acceptable IOA in a behavior modification program?
38. What is procedural reliability or treatment integrity? What is an acceptable level of procedural reliability in a
behavior modification program?
FIGURE 3.7 Monitoring and charting performance can serve at least six functions. Can you name them?
coatroom was indicated along the side. Each day, a circle was marked on the chart to indicate the amount of time spent
in the coatroom in the morning, and a small paper rabbit was attached to the most recent circle. Using simple language,
the therapist explained the procedure to the child and concluded by saying, “Now let’s see if you can get the bunny
down to eat the carrots.” When the rabbit was down to the level of the carrots, the child was encouraged to keep it there.
“Remember, the longer the bunny stays in the carrot patch, the more it can eat.” A follow-up showed that the improved
behavior persisted over a period of one year.
Behavior modifiers were not the first to discover the usefulness of recording one’s behavior to help modify it. As
with many other supposedly “new” psychological discoveries, the real credit should perhaps go to the writers of great
literature. For example, novelist Ernest Hemingway used self-recording to help maintain his literary output (Plimpton,
1965). Author Irving Wallace also used self-recording even before he was aware that others had done the same. In a book
touching on his writing methods (1971, pp. 65–66), he made the following comment.
I maintained work charts while writing my first four published books. These charts showed the date I started
each chapter, the date I finished it, and number of pages written in that period. With my fifth book, I started
keeping a more detailed chart which also showed how many pages I had written by the end of every working
day. I am not sure why I started keeping such records. I suspect that it was because, as a free-lance writer, entirely
on my own, without employer or deadline, I wanted to create disciplines for myself, ones that were guilt-making
when ignored. A chart on the wall served as such a discipline, its figures scolding me or encouraging me.
Summary of Chapter 3
A successful behavior modification program typically involves five phases during which the target behavior is identi-
fied, defined, and recorded: (a) a screening or intake phase; (b) a defining the target behavior phase; (c) a pre-program
or baseline-assessment phase; (d) a treatment phase; and (e) a follow-up phase. Behavioral assessment procedures for col-
lecting information to define and monitor target behaviors fall into three categories: indirect, direct, and experimental.
Indirect assessment procedures are assessments for which the behavior modifier or trained observer does not directly
observe the target behavior in the actual situation in which the behavior occurs. Indirect assessment procedures include
interviews with the client and the client’s significant others, questionnaires, role-playing, and client self-monitoring.
Direct assessment procedures are assessments in which the behavior modifier or trained observer directly observes and
records the target behaviors in the actual settings in which the behavior occurs. Experimental assessment procedures
are used to clearly reveal the controlling variables—or causes—of the problem behavior.
Characteristics of behavior for direct assessment include (a) the topography of a response, which is the specific
movements involved in making the response; (b) the frequency or rate of behavior, which refers to the number of
instances of behavior that occur in a given time interval; (c) the duration of behavior, which is the length of time from
the beginning to the end that an instance of the behavior occurs within some period; (d) the intensity of behavior, which
refers to the magnitude or force of behavior; (e) the stimulus control of behavior, which refers to the degree of cor-
relation between a specific event and a subsequent response; (f) the latency of behavior, which is the time between the
occurrence of a specific event or cue and the start of that behavior; and (g) the quality of behavior, which is a refinement
of one or more the above characteristics.
Strategies for recording a behavior include (a) continuous recording or event recording, which is the recording of
every instance of the behavior during a designated observation period; (b) interval recording—either partial-interval or
whole-interval recording—which registers the behavior as either occurring or not occurring during short intervals of
equal duration; and (c) time-sampling recording, which registers a behavior as occurring or not occurring during very
brief observation intervals that are separated from each other by a much longer period of time.
Interobserver agreement (IOA) estimates, which compare the recording of behavior by two or more observ-
ers, are used to assess the accuracy of observations of behavior. There are five categories of error that can affect the
accuracy of observation and five sources of bias and artifact that can lead to inaccuracies when recording observa-
tions. The success of a behavior modification program depends on collecting accurate data during the assessment or
baseline phase and throughout the program. In addition to assessing IOA, it is also important to assess procedural
reliability or treatment integrity in a behavior modification program. If the treatment is not the one the behavior
modifier intended to give, no valid conclusion can be drawn about it, regardless of the accuracy of the data. In addi-
tion, if the treatment is not the intended treatment, it is unlikely to be as effective as the intended treatment or may
even be harmful.
Application Exercise
Self-Modification Exercise
Pick a behavioral excess of yours that you would like to decrease. Design a self-monitoring data sheet that you could use to
record instances of the behavior over a one-week period to serve as a baseline. On your data sheet, set it up so that you can
record antecedents of each instance of the behavior, the behavior itself, and consequences of each instance of the behavior.
1. Martin, Toogood, and Tkachuk (1997) developed a manual of self-report problem checklists for sport psychology
consultation. For example, the sport psychology checklist for basketball players asks such questions as: “Would
you say that you need to improve at tuning out negative thoughts, staying relaxed, and not getting too nervous
just before or during a game?” and “Do you need to improve at identifying and reacting to your opponents’
weaknesses and making adjustments as the game progresses?” The manual includes self-report problem checklists
Chapter 3 Defining, Measuring, and Recording Behavior 41
for 21 sports. Each checklist contains 20 items to identify areas in which an athlete might need to improve before
or during competitions, 5 items to identify areas in which an athlete might need to improve on postcompetition
evaluations, and 17 items to identify areas in which an athlete might need to improve at practices. These behavioral
checklists are not like traditional psychological tests such as the Wexler Adult Intelligence Scale (Wexler, 1981) or
the 16-Personality Factor Inventory (Cattell, Eber, & Tatsuoka, 1970). Behavioral checklists do not have norms,
nor are they designed to measure character or personality traits. Rather, such behavioral assessment tools provide
information necessary to design effective interventions for remediating deficits or excesses in specific situations with
individual athletes. While formal research on these tools has been limited, those checklists that have been formally
researched have shown high test–retest reliability and high face validity (Leslie-Toogood & Martin, 2003; Lines,
Schwartzman, Tkachuk, Leslie-Toogood, & Martin, 1999). Feedback from athletes and sport psychology consultants
who have used various checklists in the manual mentioned here has been uniformly positive concerning their value
in obtaining useful behavioral assessment information.
2. When the observational method affects the behaviors being observed, we say that the observation is obtrusive.
To record observations unobtrusively means that the observations do not cause those being observed to deviate
from their typical behavior. Our observations should not influence the behavior we are observing. You can ensure
that your observations are unobtrusive in several ways. One possibility is to observe the behavior from behind
a one-way window, as illustrated in the case of Darren at the beginning of this chapter. Another possibility is to
inconspicuously observe individuals from a distance, such as that used to study the drinking habits of patrons
in bars (Sommer, 1977). Another method is to have a confederate or co-observer make observations while
working side by side with a client in a normal work setting (Rae, Martin, & Smyk, 1990). Other alternatives
include recording the behavior with a hidden camera and evaluating products of the client’s behavior, such as
items littered in a public campground Osborne & Powers, 1980. However, such tactics raise a moral question: Is
it ethical to observe individuals without their consent? The American Psychological Association, in its Ethical
Principles of Psychologists and Code of Conduct, Including 2010 Amendments, has developed a set of ethical
guidelines governing all types of experiments by psychological researchers (see www.apa.org/ethics/code/index.
aspx?item=11#803). Of particular relevance to this note, Standard 8.03, titled Informed Consent for Recording
Voices and Images in Research, states,
Psychologists [must] obtain informed consent from research participants prior to recording their voices or
images for data collection unless (1) the research consists solely of naturalistic observations in public places,
and it is not anticipated that the recording will be used in a manner that could cause personal identification
or harm, or (2) the research design includes deception, and consent for the use of the recording is obtained
during debriefing.
Anyone contemplating recording the behavior of another person should consult the ethical guidelines of his or her
professional organization and the applicable laws pertaining to privacy and confidentiality. (See also Chapter 29 of
this text.)
3. The procedure that we have suggested for computing IOA during interval recording is that of dividing the number
of intervals on which observers agree that the behavior occurred by the total number of intervals on which either
observer recorded the occurrence of the behavior—agreements plus disagreements on the occurrence of the
behavior—and multiplying by 100%. Some researchers include instances in which the observers agreed that the
behavior did not occur in the measure of agreements—in other words, agreements on blank intervals. When very
few behaviors have been recorded, however, this can greatly inflate a reliability score. For example, consider the 45
observation intervals given in Figure 3.6. Let’s suppose that observer 1 recorded an instance of touching during
interval 5 and that observer 2 recorded an instance of touching during interval 6. No other instances of touching were
recorded. In such a case, the two observers would disagree completely on the occurrence of the behavior, and the IOA
would be zero if computed as agreement only on instances in which the behavior occurred. However, if agreements
on blank intervals are included, the IOA equals 43 agreements divided by 43 agreements plus 2 disagreements times
100%, which equals 95.6%. Because of this distortion, many researchers do not count agreements on blank intervals.
In other words, intervals in which neither observer scores the behavior as having occurred are ignored. An acceptable
exception to this would be when one is concerned with decreasing a behavior and it is important to have agreement
that the behavior did not occur. However, this requires making a decision that in many cases could be subjective and
arbitrary. Perhaps the best solution would be to routinely report IOA for both occurrences and non-occurrences
of the target behavior separately. See Bailey and Burch (2018, pp. 91–111) regarding the above points and other
comments on the complexity of computing IOA.
42 Part I The Behavior Modification Approach
Notes
1. This example is based on an article by Wahler, Winkel, Peterson, and Morrison (1965).
2. The authors are grateful to Nancy Staisey for providing us with the details of this procedure.