Professional Documents
Culture Documents
Understanding Preference Shifts - A Review and Alternate Explanation of Within-Trial Contrast and State-Dependent Valuation
Understanding Preference Shifts - A Review and Alternate Explanation of Within-Trial Contrast and State-Dependent Valuation
2 (Fall)
Stimuli that are followed by aver- ent mechanisms to explain the phe-
sive events may become less preferred nomenon and thus makes different
than stimuli that are followed by predictions (discussed later), in gen-
nonaversive events. It is also possible, eral both predict that organisms will
however, for stimulus preference to demonstrate a preference for stimuli
be altered by the events that precede that follow less preferred events rela-
the presentation of the stimulus. This tive to stimuli that follow more
phenomenon has been investigated preferred events.
under a variety of names, most com- In one of the first studies examining
monly, within-trial contrast (WTC) this phenomenon, Clement, Feltus,
and state-dependent valuation (SDV). Kaiser, and Zentall (2000) exposed
Both WTC and SDV are conceptual eight pigeons to two chain-schedule
models that essentially describe an conditions that differed by the
effect wherein exposure to a less amount of effort required.1 In the
preferred event increases preference first condition (see Figure 1), pigeons
for stimuli that follow that event. were exposed to a fixed-ratio (FR) 1
Although each model invokes differ- schedule with a lit center key (initial
179
180 JAMES N. MEINDL
Figure 1. Training trials adapted from Clement et al. (2000). A key peck to Circ produced
either an FR 1 or FR 20 schedule followed by discrimination trials. A key peck to S+ resulted in
food reinforcement. A key peck to S2 resulted in no food reinforcement.
or did not directly measure prefer- conceptual perspective was not iden-
ence but rather relied on preference tifiable for one study (Armus, 2001).
rating scales. Also excluded was Because the procedures used in these
research on the concorde fallacy and studies were quite similar despite
sunk cost effects (see Arkes & Ayton, different conceptual underpinnings,
1999, for discussion). Although these the articles will be collapsed and
concepts are somewhat related to WTC discussed together. (See Appendix
and SDV, the procedures are quite A for a breakdown of each article
dissimilar and may not examine the and to examine effect by individual
same behavior-change phenomenon. experiment.)
Subject types. Of the 22 experi-
RESULTS AND DISCUSSION ments that successfully demonstrated
the effect, 18 were conducted with
Generality of the Findings
various nonhumans including pi-
In total, 23 articles were identified geons, locusts, grasshoppers, rats,
containing a total of 38 separate banded tetras, and starlings. The
experiments. Of these 38 experiments, remaining four experiments demon-
22 (across 18 separate articles) dem- strated the effect with humans, both
onstrated an increase in preference children and adults. Of the 16 exper-
for a stimulus that followed a less iments that were unable to replicate
rather than more preferred event, and the findings, all were conducted with
16 (across 8 separate articles) failed nonhuman organisms including star-
to replicate the phenomenon. Note lings, pigeons, or rats. These data
that some articles contained both suggest that the phenomenon is
replications and failures to replicate. general and not a species-specific
Of the 23 total articles, six appeared characteristic.
to interpret the phenomenon from Middle components. Across the 22
the conceptual perspective of SDV experiments that successfully demon-
(Aw, Holbrook, Burt de Perera, & strated the effect, a variety of events
Kacelnik, 2009; Kacelnik & Marsh, have been programmed for or re-
2002; Marsh, Schuck-Paim, & Kacel- quired in the middle component. For
nik, 2004; Pompilio & Kacelnik, example, the effect has been docu-
2005; Pompilio, Kacelnik, & Behmer, mented when the preceding events
2006; Waite & Passino, 2006), and 15 were high versus low effort (Alessan-
interpreted the phenomenon from dri, Darcheville, Delevoye-Turrell, &
the perspective of WTC (Alessandri, Zentall, 2008; Aw et al., 2011; Cle-
Darcheville, Delevoye-Turrell, & ment et al., 2000; Clement & Zentall,
Zentall, 2008; Alessandri, Darche- 2002; Friedrich & Zentall, 2004;
ville, & Zentall, 2008; Arantes & Kacelnik & Marsh, 2002; Klein et
Grace, 2008; Clement et al., 2000; al., 2005), high versus low probabili-
Clement & Zentall, 2002; DiGian, ties of reinforcement (Clement &
Freidrich, & Zentall, 2004; Friedrich, Zentall, 2002; Gipson et al., 2009),
Clement, & Zentall, 2005; Friedrich short versus long delays to the termi-
& Zentall, 2004; Gipson, Miller, nal component Alessandri, Darche-
Alessandri, & Zentall, 2009; Klein, ville, Delevoye-Turrell, et al., 2008;
Bhatt, & Zentall, 2005; O’Daly, Alessandri, Darcheville, & Zentall,
Meyer, & Fantino, 2005; Singer, 2008; Clement et al., 2000; DiGian
Berry, & Zentall, 2007; Vasconcelos et al., 2004; O’Daly et al., 2005),
& Urcuioli, 2008, 2009; Vasconcelos, preferred versus less preferred sched-
Urcuioli, & Lionello-DeNolf, 2007). ules of reinforcement (Singer et al.,
One study directly compared empir- 2007), the absence or presence of
ical predictions from both conceptual reinforcement (Friedrich et al.,
frameworks (Aw, Vasconcelos, & 2005), and low versus high states of
Kacelnik, 2011). Finally, a clear food deprivation (Aw et al., 2009;
182 JAMES N. MEINDL
Marsh et al., 2004; Pompilio & they noted a tendency towards pref-
Kacelnik, 2005; Pompilio et al., erence change, this tendency was not
2006; Vasconcelos & Urcuioli, 2008). statistically significant. Second, an
Although the effect has been most effect has been found when using a
robustly documented with food dep- delay as the less preferred event (e.g.,
rivation, in each of the above cases, Alessandri, Darcheville, & Zentall,
preference was increased for stimuli 2008), which would seem to counter
that followed the less preferred event. both the argument concerning conti-
Thus, the effect does not appear to be guity between middle and terminal
limited to conditions that involve components and the argument that
differential response requirements energy expenditure is required. Final-
(or more generally, effort) alone. ly, researchers have used experimen-
Although the effect has been pro- tally naive pigeons and employed
duced with a variety of events in overtraining but still failed to replicate
middle components, there have also previous findings (e.g., Vasconcelos &
been many failures to replicate the Urcuioli, 2009).
effect. Studies that have failed to There are other possible explana-
produce the effect have examined tions for some replication failures.
middle components such as high Armus (2001), for example, conduct-
versus low effort (Arantes & Grace, ed experiments using two differently
2008; Armus, 2001; Friedrich & flavored pellets (grape and bacon) as
Zentall, 2004; Vasconcelos & Ur- terminal components following dif-
cuioli, 2009; Vasconcelos et al., ferent amounts of effort. Repeated
2007; Waite & Passino, 2006), long exposure to different food items may
versus short delay (Aw et al., 2011), increase preference (Wardle, Herrera,
and high versus low states of food Cooke, & Gibson, 2003), and it is
deprivation (Vasconcelos & Urcuioli, possible that this repeated exposure
2008). masked or influenced the results of
Explanations for replication failures. the experiment.
Explanations for these failures to
replicate the phenomenon have been Procedural Variations
numerous. Singer et al. (2007) sug- Across Experiments
gested that the effect is slow to
develop. Zentall (2008) suggested a There are a variety of procedural
variety of possible reasons for failure differences across the various exper-
to replicate, including that overtrain- iments that have examined the phe-
ing is required, that the terminal nomenon. It is possible that these
component must be contiguous with variations contribute to the differenc-
the middle component, and that prior es in experimental findings. The
exposure to lean schedules influences following sections describe several
the effect. Finally, Aw et al. (2011) procedural variations regarding the
suggested that the effect occurs only presentation of the initial and termi-
when the middle component requires nal components across the reviewed
energy expenditure. studies.
For each suggested explanation, Initial component. A stimulus that
however, there is at least one study precedes a less preferred event may
that demonstrates that explanation to become a conditioned aversive stim-
be insufficient. First, Arantes and ulus and function as a punisher (e.g.,
Grace (2008) provided an amount of Vorndran & Lerman, 2006). Because
training equal to that used in other a change in preference for stimuli
successful studies and were unable to presented in the terminal component
reproduce the results. Further, Vas- is presumed to be directly related to
concelos and Urcuioli (2009) provided preference for the event in the middle
extensive overtraining, but although component, it is necessary to be able to
UNDERSTANDING PREFERENCE SHIFTS 183
attribute the preference change to those 3 did not include initial components
two components alone. If the initial (Armus, 2011; Vasconcelos & Ur-
component is a less preferred or cuioli, 2008, Experiment 2; Waite &
aversive event, this may confound Passino, 2006).
interpretations by either adding or Of the 13 experiments that used
subtracting from the effect. It is there- identical stimuli in the initial compo-
fore important to distinguish between nents, four produced an effect and
studies that used distinct initial com- nine did not. Of the 13 experiments
ponents (in which differential prefer- that used different stimuli in the
ence could develop) and those that used initial components, 10 produced an
identical initial components. effect and three did not. Thus,
Of the experiments that successful- whether the stimuli used in the initial
ly demonstrated the phenomenon, component influence the contrast
10 used distinct initial components effect is currently unclear, but the
(Alessandri, Darcheville, Delevoye- data suggest the possibility that they
Turrell, et al., 2008, Phases 1 and 2; do. Studies directly examining this
Alessandri, Darcheville, & Zentall, possibility include, for example, Di-
2008; Aw et al., 2011, Experiment 3; Gian et al. (2004) who used different
Clement & Zentall, 2002, Experi- initial components (vertical or hori-
ments 1, 2, and 3; Friedrich et al., zontal lines) for one group of pigeons
2005; O’Daly et al., 2005, Experiment and identical initial components
2; Singer et al., 2007), 4 used identical (white keys) for a different group of
initial components (Clement et al., pigeons. Although both groups sig-
2000; Friedrich & Zentall, 2004, nificantly preferred the stimulus that
Experiment 1; Gipson et al., 2009; followed longer delays, the first
group (different initial components)
Klein et al., 2005), 2 used both
displayed a greater degree of prefer-
distinct and identical initial compo- ence change than the second group
nents for different groups (DiGian (identical initial components). This
et al., 2004; O’Daly et al., 2005, may indicate that the initial compo-
Experiment 1), and 6 did not include nent exerts some effect in the condi-
initial components (Aw et al., 2009; tioning process, although such a
Kacelnik & Marsh, 2002; Marsh et conclusion is speculative at this point.
al., 2004; Pompilio & Kacelnik, 2005; Terminal component. The presenta-
Pompilio et al., 2006; Vasconcelos & tion of terminal component stimuli
Urcuioli, 2008, Experiment 1). Those varied among studies primarily based
that did not include initial compo- on whether the researchers presented
nents investigated the effects of dif- a single stimulus after each condition
ferent levels of food deprivation, and or presented two stimuli together in a
an initial component was not feasible. discrimination task. In 12 of the 22
Of the 16 experiments that were experiments that demonstrated an
unable to replicate the phenomenon, effect, a discrimination task followed
3 used distinct initial components the middle component (Alessandri,
(Aw et al., 2011, Experiment 2; Darcheville, Delevoye-Turrell, et al.,
Vasconcelos & Urcuioli, 2009, Ex- 2008, Phases 1 and 2; Alessandri,
periments 1 and 2), 9 used identical Darcheville, & Zentall, 2008; Cle-
initial components (Arantes & Grace, ment et al., 2000; Clement & Zentall,
2008, Experiments 1 and 2; Aw et al., 2002, Experiments 1 through 3;
2011, Experiment 1; Friedrich & DiGian et al., 2004; Friedrich et al.,
Zentall, 2004, Experiment 2; Vascon- 2005; Gipson et al., 2009; Klein et al.,
celos et al., 2007, Experiments 1 2005; Singer et al., 2007, Experi-
through 5), 1 used both distinct and ment 2). The remaining 10 studies
identical initial components (Vascon- presented a single stimulus after each
celos et al., 2007, Experiment 6), and condition. Of the 16 experiments that
184 JAMES N. MEINDL
Figure 2. ‘‘A model based on change in relative hedonic value, proposed to account for
within-trial contrast effects. According to the model, trials begin with a relative hedonic state, H;
key pecking results in a negative change in hedonic state, H 2 DH1 for FR 1 and H 2 DH20 for
FR 20; obtaining a reinforcer results in a positive change in hedonic state, H + DHRf; the net
change in hedonic state depends on the difference between H + DHRf and H 2 DH1 on an FR 1
and between H + DHRf and H 2 DH20 on an FR 20 trial.’’ (Zentall, 2005, p. 280).
ule, very little attention is given to the Arantes, J., & Grace, R. C. (2008). Failure to
effect the schedule, and correspond- obtain value enhancement by within-trial
contrast in simultaneous and successive
ing work requirements, have on the discriminations. Learning & Behavior, 36,
value of the reinforcer. The studies 1–11.
reviewed here indicate that the work Arkes, H. R., & Ayton, P. (1999). The sunk
requirement may alter the value of cost and concorde effects: Are humans less
the corresponding reinforcer. Care, rational than lower animals. Psychological
Bulletin, 125, 591–600.
therefore, should be exerted when Armus, H. L. (2001). Effect of response effort
selecting and delivering reinforcers on the reward value of distinctively flavored
because the schedule requirements food pellets. Psychological Reports, 88,
may either increase or decrease the 1031–1034.
Aw, J. M., Holbrook, R. I., Burt de Perera, T.,
value of those stimuli. & Kacelnik, A. (2009). State-dependent
A second implication is related to valuation learning in fish: Banded tetras
the current understanding of the prefer stimuli associated with greater past
interaction between MOs and the deprivation. Behavioral Processes, 81, 333–
function-altering effect. The value- 336.
Aw, J. M., Vasconcelos, M., & Kacelnik, A.
altering effect of an establishing (2011). How costs affect preferences: Exper-
operation is understood as ‘‘an in- iments on state dependence, hedonic state
crease in the current [italics added] and within-trial contrast in starlings. Animal
effectiveness of some stimulus, object, Behaviour, 81, 1117–1128.
Clement, T. S., Feltus, J. R., Kaiser, D. H., &
or event as reinforcement’’ (Cooper, Zentall, T. R. (2000). ‘‘Work ethic’’ in
Heron, & Heward, 2007, p. 376). pigeons: Reward value is directly related to
Food deprivation is presumed to the effort or time required to obtain the
momentarily increase the effectiveness reward. Psychonomic Bulletin & Review, 7,
of food as a reinforcer. When the 100–106.
Clement, T. S., & Zentall, T. R. (2002).
level of food deprivation subsides, the Second-order contrast based on the expec-
effectiveness of food as a reinforcer tation of effort and reinforcement. Journal
diminishes. The research reviewed of Experimental Psychology, 28, 64–74.
here, however, suggests that the Cooper, J. O., Heron, T. E., & Heward, W. L.
(2007). Applied behavior analysis (2nd ed.).
function-altering effect interacts with Upper Saddle River, NJ: Pearson Education.
MOs. That is, the food is made more DiGian, K. A., Freidrich, A. M., & Zentall,
valuable when it is presented during T. R. (2004). Discriminative stimuli that
food deprivation, and that increase follow a delay have added value for pigeons.
in value persists into the future. Psychonomic Bulletin & Review, 11,
889–895.
The finding that less preferred Fantino, E. (1977). Conditioned reinforce-
events alter preference for subsequent ment: Choice and information. In W. K.
stimuli warrants further inquiry. Fu- Honig & J. E. R. Staddon (Eds.), Handbook
ture research into this phenomenon of operant behavior (pp. 313–363), Engle-
wood Cliffs, NJ: Prentice Hall.
may help explain how stimulus pref- Friedrich, A. M., Clement, T. S., & Zentall,
erences develop with greater preci- T. R. (2005). Discriminative stimuli that
sion than is currently available, and follow the absence of reinforcement are
may allow a greater understanding of preferred by pigeons over those that follow
how the environment and behavior reinforcement. Learning & Behavior, 33,
337–342.
interact and affect one another. Friedrich, A. M., & Zentall, T. R. (2004).
Pigeons shift their preference toward loca-
REFERENCES tions of food that take more effort to
obtain. Behavioral Processes, 67, 405–415.
Alessandri, J., Darcheville, J.-C., Delevoye- Gipson, C. D., Miller, H. C., Alessandri, J. J. D.,
Turrell, Y., & Zentall, T. R. (2008). & Zentall, T. R. (2009). Within-trial contrast:
Preference for rewards that follow greater The effect of probability of reinforcement in
effort and greater delay. Learning & Behav- training. Behavioral Processes, 82, 126–132.
ior, 36, 352–358. Kacelnik, A., & Marsh, B. (2002). Cost can
Alessandri, J., Darcheville, J.-C., & Zentall, increase preference in starlings. Animal
T. R. (2008). Cognitive dissonance in Behavior, 63, 245–250.
children: Justification of effort or contrast? Klein, E. D., Bhatt, R. S., & Zentall, T. R.
Psychonomic Bulletin & Review, 15, 673–677. (2005). Contrast and justification of effort.
UNDERSTANDING PREFERENCE SHIFTS 191
Psychonomic Bulletin & Review, 12, 335– test of within-trial contrast. Learning &
339. Behavior, 36, 12–18.
Marsh, B., Schuck-Paim, C., & Kacelnik, A. Vasconcelos, M., & Urcuioli, P. J. (2009).
(2004). Energetic state during learning Extensive training is insufficient to produce
affects foraging choices in starlings. Behav- the work-ethic effect in pigeons. Journal of
ioral Ecology, 15, 396–399. the Experimental Analysis of Behavior, 91,
Michael, J. (2004). Concepts and principles of 143–152.
behavior analysis. Kalamazoo, MI: Associ- Vasconcelos, M., Urcuioli, P. J., & Lionello-
ation for Behavior Analysis. DeNolf, K. M. (2007). Failure to replicate
O’Daly, M., Meyer, S., & Fantino, E. (2005). the ‘‘work ethic’’ effect in pigeons. Journal
Value of conditioned reinforcers as a of the Experimental Analysis of Behavior, 87,
function of temporal context. Learning and 383–399.
Motivation, 36, 42–59. Vorndran, C. M., & Lerman, D. C. (2006).
Pompilio, L., & Kacelnik, A. (2005). State- Establishing and maintaining treatment
dependent learning and suboptimal choice: effects with less intrusive consequences via
When starlings prefer long over short delays a pairing procedure. Journal of Applied
to food. Animal Behavior, 70, 571–578. behavior Analysis, 39, 35–48.
Waite, T. A., & Passino, K. M. (2006).
Pompilio, L., Kacelnik, A., & Behmer, S. T.
Paradoxical preferences when options are
(2006). State-dependent learned valuation
identical. Behavioral Ecology and Sociobiol-
drives choice in an invertebrate. Science, ogy, 59, 777–785.
311, 1613–1615. Wardle, J., Herrera, M. L., Cooke, L., &
Schlinger, H. D., & Blakely, E. (1994). A Gibson, E. L. (2003). Modifying children’s
descriptive taxonomy of environmental food preferences: The effects of exposure
operations and its imlpications for behav- and reward on acceptance of an unfamiliar
ior analysis. The Behavior Analyst, 17, vegetable. European Journal of Clinical
43–57. Nutrition, 57, 341–348.
Singer, R. A., Berry, L. M., & Zentall, T. R. Zentall, T. R. (2005). A within-trial contrast
(2007). Preference for a stimulus that effect and its implications for several so-
follows a relatively aversive event: Con- cial psychological phenomena. International
trast or delay reduction. Journal of the Journal of Comparative Psychology, 18,
Experimental Analysis of Behavior, 87, 273–297.
275–285. Zentall, T. R. (2008). Within-trial contrast:
Vasconcelos, M., & Urcuioli, P. J. (2008). When you see it and when you don’t.
Deprivation level and choice in pigeons: A Learning & Behavior, 36, 19–22.
192 JAMES N. MEINDL
APPENDIX
Initial Middle
Author Population component component
Alessandri, Phases 2–3 Phase 2 Preferred vs.
Darcheville, N 5 30 different stimuli nonpreferred effort
Delevoye-Turrell, undergraduate
and Zentall, 2008 students
Phase 3 10-s delay vs.
different stimuli nonpreferred effort
Alessandri, N 5 42 children Different Stimuli No delay vs. 8-s delay
Darcheville, and age 7–8 years
Zentall, 2008
Arantes and Experiment 1 Experiment 1 FR 1 vs. FR 20
Grace, 2008 N 5 10 pigeons identical stimuli
DiGian, Friedrich, N 5 16 pigeons Identical stimuli 0-s delay vs. 6-s delay
and Zentall, Different stimuli
2004
APPENDIX
Extended
MC initial Terminal component TC initial
preference assessed? discrimination task? pref assessed? Effect found?
Yes Yes; simultaneous No Yes for S+
APPENDIX
Continued
Initial Middle
Author Population component component
Gipson, Miller, N 5 16 pigeons Identical stimuli FR 1 vs. FR 30 50% or
Alessandri, and 100% reinforcement
Zentall, 2009
APPENDIX Extended,
Continued
No Experiments 1–5 No No
Yes; Simultaneous
No Yes; simultaneous No No
No No No Yes
No Yes; simultaneous No No
No No; color-coded No No
foraging
opportunities