Professional Documents
Culture Documents
Web Dev Research
Web Dev Research
Web Dev Research
MANOJ THOMAS
ELLIE J. KYUNG
1274
THOMAS AND KYUNG 1275
monetary constructs such as contingent valuation, willing- that consumers have stable valuations for only a few ba-
ness to pay (WTP), internal reference prices, and latitude sic things that they frequently encounter, and that for
of price acceptance (Adaval and Monroe 2002; Burson, most real-world situations they construct valuations dur-
Larrick, and Lynch 2009; Monroe 1971), but the extant lit- ing decision making. Second, we assume that people use
erature is silent on the effect of these response formats on a nonlinear mental number line to evaluate monetary
measurement. To measure these constructs, some research- values.
ers have used open-ended text boxes (Ward and Dahl
2014), some have used discrete semantic differential scales Constructing Price Judgments
(O’Guinn, Tanner, and Maeng 2015), and others have used
The constructivist account of valuations (Bettman,
continuous slider scales (Galak et al. 2014). We examined
Luce, and Payne 1998; Burson et al. 2009; Fischhoff
experiments measuring willingness to pay in the Journal of
1991) posits that people’s valuations of goods are subjec-
FIGURE 1
the mental number line. This mental number line does not Recalibrating the Mental Number Line
resemble the linear arithmetic ruler. Unlike a linear arith-
We propose that category limens on the mental number
metic ruler with evenly spaced markers, the mental ruler in
line are not only nonlinear but also context dependent—
our minds is not calibrated linearly or with uniformly
specifically, that the natural category limens on the mental
spaced markers. Instead, it is an unbounded line with
number line can unconsciously shift depending on response
diminishing discriminability: the same objective numerical
difference is seen as larger when closer to the starting point format. When people use a text box to submit a price re-
of the mental number line than if it is away from the start- sponse, they are likely to rely on the natural nonlinear num-
ing point. Just as Ernst Weber and his student Gustav ber line in their mind. Their judgments about the boundaries
Fechner postulated that the ability to recognize a just-no- between low, medium, and high prices are calibrated with
ticeable difference with sensory stimuli depends on the respect to the starting price, with responses anchored on this
magnitude of the initial stimuli (e.g., people easily detect starting point (see figure 1). For example, if a buyer can bid
the difference between a 15- vs. 30-watt lamp, but not the anywhere between $200 (starting price) and $1,000 (retail
difference between a 120- vs. 135-watt lamp), numerical price) for a laptop using an open-ended text box, then her
cognition researchers have posited that the mental number low bid might be around $300, her medium bid might be
line used to evaluate numeric stimuli follows a similar re- around $400, and her high bid around $600. Note that all of
sponse function. Specifically, Dehaene et al. (2008) sug- these values are assimilated toward the starting price on the
gested that our brain perceives the change between 10 and lefthand side of the mental number line.
20 as greater than that between 20 and 30, and the change In contrast, slider scales provide a visual representation
between 20 and 30 as greater than that between 30 and 40, of a line with salient starting and end points. We propose
even though the arithmetic differences are identical. Thus, that when using a slider scale, people judge price magni-
the psychological impact of increasing a price from $10 to tudes by assessing the visual distance from the starting as
$20 will be greater than that of increasing it from $30 to well as from the end point of the slider scale. Consistent
$40. This has implications for calibration of the mental with a dual-process model of price cognition (Monroe and
number line: the positions of the boundaries between eval- Lee 1999; Thomas and Morwitz 2009a, 2009b), we pro-
uative categories that serve as discrimination thresholds pose that while the reflective part of the brain deals with
between low, medium, and high values are not uniformly deliberative numeric comparisons, the reflexive part of the
spaced. To use a term from psychophysics (Johnson brain spontaneously assesses the visual distances from the
1944), these category limens are compressed toward the starting and end points of the slider scale. Price magnitude
starting value of the mental number line as depicted in judgments are influenced by the distance from the starting
figure 1. point (“how far is it from the lowest possible value”) as
THOMAS AND KYUNG 1277
FIGURE 2
well as the distance from the end point of the scale (“how H1a: In ascending payment formats, values elicited through
close is it to the highest possible value”). Thus, slider scales a slider scale will be higher than those elicited through a
change the boundaries between evaluative categories—the text box.
category limens—on the mental number line. Relative to Our theorizing does not suggest that slider scales will al-
the category limens elicited by a text box, the category ways elicit higher values; in fact, our theory posits that the
limens elicited by a slider scale are more stretched toward direction of the end point assimilation effect will reverse
the end point of the slider. Continuing with the illustrative for descending payment formats. In a descending payment
bidding example introduced earlier, if a buyer can bid any- format, where prices submitted are lower than a starting
where between $200 and $1,000 using a slider scale, then a price (e.g., Priceline), people adjust the prices downward
low bid might be around $300, a medium bid might be from the list or retail price. In such cases, the category
around $600, and a high bid around $800. limens on the mental number line will be marked relative
More generally, we propose that when people use a text to this highest possible response. As depicted in figure 2,
box for an ascending payment format, where price magni- when people use a text box for descending payment for-
tudes input are higher than some starting point, if Pmin is mats, if Pmax is the starting price, then the category limens
the lowest possible price that one can pay, then category Phigh, Pmedium, and Plow will be marked relative to Pmax
limens Plow, Pmedium, and Phigh will be marked relative to and assimilated toward Pmax. However, when people use a
Pmin, such that these limens are assimilated toward Pmin. slider scale, the visual distances from Pmax (starting point)
However, when people use a slider scale for an ascending and Pmin (end point) are salient, reducing the assimilation
payment format, the visual distances from both starting toward Pmax and increasing the assimilation toward Pmin.
point Pmin and end point Pmax are perceptually more sa- Thus, in a descending payment format, values elicited
lient, reducing the assimilation toward Pmin and increasing through a slider scale will be lower than those elicited
the assimilation toward Pmax. People submitting a high through a text box. Formally,
price response will consider not only whether the value is
far enough from Pmin but also whether it is close enough to H1b: In descending payment formats, values elicited
Pmax. This shift in category limens caused by the slider for- through a slider scale will be lower than those elicited
mat is depicted in figure 1. Therefore, in an ascending pay- through a text box.
ment format, where payment values are higher than the
starting point (e.g., eBay), people are likely to submit
higher values when they use a slider scale versus a text box
Proximity to the End Point
because the markers that identify categories are shifted to The novel insight that underlies our conceptualization is
the right toward the end point. We refer to this as the slider that slider scales create a bias in magnitude judgments by
scale end point assimilation effect. Formally, increasing the salience of the visual distance to the end
1278 JOURNAL OF CONSUMER RESEARCH
point of the response scale.1 If visual distance from the end length of the slider scale. It is this linear calibration of the
point is the root cause of more extreme responses on slider slider scales that makes it different from the mental num-
scales, then it follows that the extent of the bias will de- ber, which is inherently nonlinear in nature. This postulate
pend on proximity to the slider end point. Proximity to an suggests that a nonlinear slider scale wherein numerical
end point can make visual distance from that end point per- values are a convex function of distance from the starting
ceptually more salient. This postulate suggests that the end point should mimic the natural mental number line and at-
point assimilation effect will depend on the response mag- tenuate the slider scale end point assimilation effect.
nitude. For ascending payment formats (figure 1) category A second factor that can influence the visual distances
limens for higher price values, which are visually closer to from the end points of the slider is the response range. If
the end point (right), are more likely to assimilate toward the length of a slider scale in an ascending payment format
this perceptually salient end point than lower price values. remains the same but the value of the end point increases
Visual Distance from the End Point As we embarked on this research, in addition to the
metal number line recalibration account, we considered
The key difference between the slider scale and text box several alternative accounts for why slider scales might
formats is that with the former, people rely on visual dis- elicit different responses from text boxes. Two of them are
tance for magnitude judgments, whereas with the latter, particularly plausible and pertinent: the response momen-
they rely on numeric comparisons. When people use a text tum account and the asymmetric range awareness account.
box, they evaluate magnitudes by comparing values to the While the mental number line recalibration account is
starting point of the response range. This is a numeric com- based on the numerical cognition literature, the response
parison process. But when people use slider scales, they momentum account is inspired by the notion of psycholog-
consider visual distances from the starting point as well as ical momentum in human kinetics (Markman and Guenther
the end point of the response continuum as cues for magni- 2007), which suggests that slider scales might intensify
tude judgment. This postulate suggests that relative to text responses because of a natural inclination to continue the
box responses, slider scale responses should be more sus- physical sliding movement once it has started in a particu-
ceptible to contextual factors that alter the visual distances lar direction. People may end up pulling the slider knob
from the end point of the response range. We consider two farther than they intended relative to a starting point, and
such factors that can change the visual distance from end past a value they might indicate in a text box, because they
points and thus moderate the slider scale end point assimi- are caught up in the momentum of the movement itself.
lation effect. Like the mental number line recalibration account, the re-
The first factor is distance calibration of the slider sponse momentum account also predicts that responses eli-
scale—that is, the change in numeric response when the cited on a slider scale will be more extreme than those
slider knob moves by a unit distance. Note that slider elicited through a text box. But unlike the previous ac-
scales usually have a linear calibration, such that when the count, the response momentum account suggests that the
slider knob moves by one unit, the numeric response effect will be restricted to scales that entail a sliding re-
changes by a fixed amount. The change in numeric re- sponse and will not generalize to discrete scales that simu-
sponse per unit distance remains constant throughout the late line visualization, but do not entail a sliding response
(e.g., response scales with radio buttons).
1 Here the term bias refers to the difference between the perceived The asymmetric range awareness account suggests that
magnitude primed by the slider scale and that primed by the text box. the difference between slider scale and text box responses
THOMAS AND KYUNG 1279
stems not from the use of visual distance as magnitude medians by conditions using nonparametric methods.
cues, but instead from varying awareness of the response Condition medians and the results of these analyses are
range information. It is conceivable that when people use reported in table 2. These supplementary analyses demon-
text boxes, they might not be aware of the full response strate that the results of the studies are robust to modeling
range, particularly the end point. If the effect of response and distributional assumptions. In the following sections
format is caused only by such information asymmetry, then we discuss results from 11 studies. Because of space con-
making text box users mindful of the response range straints, we describe results from only seven studies in de-
should easily alleviate it. However, our conceptualization tail. The results of the remaining studies are summarized in
suggests that even when text box users are aware of the re- the meta-analysis and web appendixes G and H.
sponse range, responses elicited through slider scales can
be different from text box responses because of inherent EXPERIMENT 1A: MUG AUCTION
TABLE 1
both the conditions. They were asked, “Do you under- “Place Bid” button (see figure 3). Finally, participants
stand how to submit your bid?” (yes/no). Once they con- were asked, “In the instructions that you just read about
firmed they understood the instructions, they were shown placing your bid, you were told that you could enter any
the product information for the insulated mug and asked response between which two numbers?” ($0 to $100, $0
to enter their bid. The participants then entered their re- to $150, $0 to $200) and answered demographic
sponse on a slider scale or text box and clicked on the questions.
THOMAS AND KYUNG 1281
TABLE 2
Response magnitude Text box Convex slider Slider Wilcoxon two-sample tests
Text box vs. slider Text box vs. convex slider
Median Median Median Z p Z p
Low 260.0 269.5 287.5 –3.67 <.001 –.86 .20
Medium 350.0 373.0 425.0 –3.39 <.001 –1.27 .10
High 500.0 550.0 600.0 –1.81 .04 –.40 .34
6. Moderation by end point size (ascending payment format: wine)
Results format as the independent measure and bid price as the de-
pendent measure. It revealed a significant effect of re-
Bids. We conducted the analysis using PROC sponse format where participants in the slider condition bid
GLIMMIX specifying a Poisson distribution with response more on the insulated mug (M ¼ $11.40) than those in the
1282 JOURNAL OF CONSUMER RESEARCH
FIGURE 3 bid amount. The participant met with the experimenter and
exchanged $35 for the mug. However, because this was an
EXPERIMENT 1A STIMULI: TEXT BOX VERSUS SLIDER undergraduate student from the subject pool, after the
CONDITIONS
transaction was completed, we returned the money and let
the participant keep the mug as a token of goodwill.
Confound Check: Range Information. The vast major-
ity of participants (90%) correctly identified the range out-
lined in the experiment stimuli as $0 to $150, confirming
that attention to range information did not vary across the
two conditions. Furthermore, removing participants that in-
correctly identified the range did not change the results.
Method
Participants. Two hundred four US-based panelists on
Amazon Mechanical Turk (MTurk), verified by IP address,
participated in this study (52% female, Mage ¼ 36.3 years).
To avoid confounds from differences in screen size, partic-
ipants were able to complete the study only if they were on
a computer; mobile users were automatically blocked from
responding to the study. These procedures were followed
for all subsequent experiments.
Procedure. Participants recruited were told that they
would be paid 45 cents for completing a brief study about
text box condition (M ¼ $8.01, F(1, 125) ¼ 37.05, p < their interest in “donating to a particular charity for US veter-
.001, see table 1). ans.” The study was run on Veteran’s Day to make the char-
The results are robust to distributional assumptions and ity more relevant for all individuals. After starting the study,
the type of statistical test. The median values in the two they received the following additional instructions about re-
conditions and the nonparametric test are reported in ta- ceiving a $1 bonus amount, in addition to the 45 cents:
ble 2. The results from a similar analysis assuming normal
distribution are available in web appendix B. The means You will be paid 45 cents for completing this brief study.
without the outliers are reported in web appendix C. You will also be paid an additional $1 (as a bonus). This
The highest bid, $35, was submitted by two participants study is about your interest in donating to a particular char-
in the slider condition. One of these two participants was ity for US veterans. You will then be asked how much you
chosen as the winner and asked to buy the product for the are interested in donating to this charity.
THOMAS AND KYUNG 1283
They were then given a description of the Intrepid Fallen $0 and $150, and in experiment 1b participants knew that
Heroes Fund (see web appendix A.1B) and asked, “Please their donation value had to be between 0 and 100 cents. In
enter the number of cents from the $1 that you would like to addition, in experiment 1b all participants who chose to do-
donate to the Intrepid Fallen Heroes Fund. You can enter nate had to spend real money. Thus, the absence of range in-
any value from 0 to 100 cents.” All participants were also formation or inconsequentiality of the response is not a
given the average donation value: “In a previous version of necessary condition for the observed effect. In the next two
this study, the average donation was 17 cents.”2 Thus, the experiments, we take a closer look at the plausible mecha-
range information and the average donation value were nisms causing end point assimilation when using slider
made clear in both conditions. Underneath, participants scales—ruling out several potential alternative explana-
were shown either a text box or a slider scale to enter their tions—and use a more externally valid participative pricing
response. The end points of the slider scales were not num- context similar to that used by companies.
to elicit bids, then shoppers are likely to offer higher bids, movement, then the responses in both slider scale condi-
supporting hypothesis 1a. More importantly, the process tions should be lower than in the text box condition.
measures from this experiment suggest that we can rule out Note that in this study, we used the same process meas-
that the effect is caused by ease of use: participants found ures and replicated the results of experiment 2, but the ex-
both response formats—slider scales and text boxes— plication and results of these measures are included in web
equally easy to use. Furthermore, most participants were appendix D for the sake of brevity.
unaware of the surreptitious influence of response formats
on their bid values. Even when asked directly after making Method
their judgments whether response formats might have
Participants. Three hundred four US-based partici-
influenced their bids, only a small minority considered this
pants on MTurk participated in this computer experiment
possibility. This suggests that the psychological mecha-
in exchange for $1 (38% women; Mage ¼ 33.4 years). To
FIGURE 6
FIGURE 8 the same pattern (see table 2). However, this is the only
study in the set of studies wherein the results were contin-
EXPERIMENT 5: CALIBRATION OF THE TWO SLIDERS gent on distributional assumptions (see web appendix B).
We conducted follow-up analyses for this study to validate
the robustness of our results and also ran another post hoc
condition with a concave slider scale, details of which are
reported in web appendixes E and F, respectively.
Method
Participants. Four hundred thirteen US-based partici-
pants on MTurk participated in this experiment in ex-
change for 51 cents (51% women; Mage ¼ 37.1 years).
Procedures. The experiment used a 2 2 3 mixed-
as the dependent measure. The effects of response format factorial design with response format (text box, slider) and
(F(2, 259) ¼ 3.92, p ¼ .02), response magnitude (F(2, 518) response range (small, large) as randomly assigned
¼ 12, 157.10, p < .001), and the interaction between the between-subjects factors and response magnitude (low,
two (F(4, 518) ¼ 8.47, p < .001) were all significant (see medium, high) as a within-subjects factor. Participants
figure 9, table 1). The average bid in the linear slider con- were told that an online store has enabled consumers to bid
dition ($445) was higher than in the text box condition on wines and they could bid whatever they wanted, and
($403, t(259) ¼ 2.71, p < .01) and the convex slider condi- that sellers would honor bids that they find reasonable.
tion ($416, t(259) ¼ –1.96, p ¼ .05), while the average Several screens then explained the information they would
bids in the text box and convex slider conditions were not receive about the wines—this information included the
significantly different from each other (t(259) ¼ .76, p ¼ vintage of the wine, its name, the Wine Spectator rating,
.45). A nonparametric test of medians also demonstrated and a brief description with the starting bid price. Those in
1290 JOURNAL OF CONSUMER RESEARCH
the small range condition were told “You will see the start- FIGURE 10
ing bid for the wines. The starting bid is the amount sug-
gested by the seller to open the bid. Your bid has to be EXPERIMENT 6: MODERATION BY RESPONSE RANGE
higher than the starting bid. The maximum bid anyone can
submit is $500. So you can bid up to $500, if you want to.”
Those in the large range condition were given the exact
same information except they were told that the maximum
bid they could submit is $1,000. (See web appendix A.6
for screenshots). They were then told that they would be
asked to place a low, medium, and high bid for the de-
scribed wine. Next, they were presented with information
response than those in the small response range condition p < .001, indicating that this mean sample size effect is
(M ¼ 2.03, F(1, 409) ¼ 5.48, p ¼ .02). These results sug- statistically significant. The 95% confidence interval is .31
gest that participants in the text box conditions, who were to .46. The details around the descriptive statistics used in
presumably relying on deliberative numeric comparisons, the meta-analysis are summarized in web appendix G.
were conscious of the influence of the end points, whereas
those in the slider condition, who were influenced by auto- GENERAL DISCUSSION
matic visual distance comparisons, were oblivious to the
influence of the end points. Organizations and researchers have been using text
The results support hypotheses 2 and 3. Consistent with boxes and bounded scales—specifically slider scales—
hypothesis 3, the results show that slider scales, compared interchangeably to record consumers’ participative pricing
to text boxes, are more susceptible to the manipulation of decisions. The tacit assumption underlying this practice is
participative pricing decisions should be more mindful of experiments 1b, 4, 5, 6, and web appendix experiments A1,
the response format used for eliciting monetary payments. A2, A3, A4, and F, through the MTurk online panel. Data
In situations where consumers might consider offering were collected in 2015 through 2017. The first and second
higher prices, using a slider scale instead of a text box can authors separately and jointly analyzed the data from these
elicit higher payments from consumers. Retailers that auc- studies, confirming all results.
tion products, such as eBay or WineBid, are likely to see
higher revenues if they use a slider scale with a wide re- REFERENCES
sponse range. In a similar vein, service providers, such as
taxi services and restaurants, are likely to realize higher Adaval, Rashmi (2013), “The Utility of an Information Processing
revenues when they use a slider scale instead of a text box. Approach for Behavioral Price Research,” AMS Review, 3
However, in situations where consumers adjust their prices (3), 130–4.
Adaval, Rashmi and Kent B. Monroe (2002), “Automatic
Johnson, Donald M. (1944), “Generalization of a Scale of Values Raghubir, Priya and Joydeep Srivastava (2008), “Monopoly
by the Averaging of Practice Effects,” Journal of Money: The Effect of Payment Coupling and Form on
Experimental Psychology, 34 (6), 425–36. Spending Behavior,” Journal of Experimental Psychology:
Kahneman, Daniel (1992), “Reference Points, Anchors, Norms, Applied, 14 (3), 213–25.
and Mixed Feelings,” Organizational Behavior and Human Roster, Catherine A., Lorenzo Lucianetti, and Gerald Albaum
Decision Processes, 51 (2), 296–312. (2015), “Exploring Slider vs. Categorical Response Formats
Lembregts, Christophe and Mario Pandelaere (2013), “Falling in Web-Based Surveys,” Journal of Research Practice, 11
Back on Numbers: Quantitative Specifications as Sources of (1), 1–16.
Control,” paper presented at the 2013 Society for Consumer Schley, Dan R., Christophe Lembregts, and Ellen Peters (2017),
Psychology Annual Winter Conference, San Antonio, TX. “The Role of Evaluation Mode on the Unit Effect,” Journal
Lipsey, Mark W. and David B. Wilson (2001), Practical Meta- of Consumer Psychology, 27 (2), 278–86.
Analysis, Vol. 49, Thousand Oaks, CA: Sage. Soman, Dilip (2003), “The Effect of Payment Transparency on
Markman, Keith D. and Corey L. Guenther (2007), Consumption: Quasi-Experiments from the Field,”