Professional Documents
Culture Documents
Alemu 2018
Alemu 2018
Alemu 2018
1–34
doi:10.1093/erae/jby009
Abstract
In this paper, we test whether a Repeated Opt-Out Reminder (ROOR) can mitigate
hypothetical bias in stated discrete choice experiments (DCE). The data originate
from a field experiment concerning consumer preferences for a novel food product
made from cricket flour. Utilising a between-subject design with three treatments, we
find significantly higher marginal willingness-to-pay values in hypothetical than in
nonhypothetical settings, confirming the presence of hypothetical bias. Comparing
this to a hypothetical setting where the ROOR is introduced, we find that the ROOR
effectively mitigates hypothetical bias for one attribute and significantly reduces it
for the rest of the attributes.
1. Introduction
Since its advent in the early 1980s in marketing research, the stated discrete
choice experiment (DCE) method has been used extensively in the environ-
mental, agricultural, food, transport, health and marketing sectors to elicit the
value of goods and services. However, critics have continually questioned
the validity and the reliability of the results obtained from hypothetical
experiments including stated DCE. If results from hypothetical experiments
are to be useful for testing theories or to guide policy decisions, such
approaches should yield reliable results. Nevertheless, this seems to not be
generally the case (List and Gallet, 2001; Lusk and Schroeder, 2004; Ding,
Grewal and Liechty, 2005; Murphy et al., 2015a, 2015b), and one of the
© Oxford University Press and Foundation for the European Review of Agricultural Economics 2018;
all rights reserved. For permissions, please e-mail: journals.permissions@oup.com
Downloaded from https://academic.oup.com/erae/advance-article-abstract/doi/10.1093/erae/jby009/4973656
by University of Findlay user
on 22 August 2018
2 M. H. Alemu and S. B. Olsen
recurring issues tied to the stated DCE and stated preference methods in gen-
eral is the issue of hypothetical bias. Due to the hypothetical nature of the
choice questions asked, respondents may for various reasons overstate their
Willingness-To-Pay (WTP) since their answers have no real consequences in
terms of required payment for the good or service in question (e.g. Fox et al.,
1996; Lusk and Schroeder, 2004; Harrison, 2006; Murphy et al., 2015a,
2015b; Vossler, Doyon and Rondeau, 2012; Vossler and Watson, 2013).
There is a rich body of literature investigating hypothetical bias within the
areas of environmental valuations employing the contingent valuation (CV)
method to elicit the value of public goods (e.g. Carson et al., 1996; List and
Shogren, 1998; Cummings and Taylor, 1999; Carlsson and Martinsson,
2001; List and Gallet, 2001; List, 2001; Little and Berrens, 2004; Murphy
et al., 2015a, 2015b; Blumenschein et al., 2008). This issue is somewhat less
explored in the DCE literature even though attention to hypothetical bias has
increased in recent years. Given that people may behave differently depend-
ing on the type of the good and the experimental setup (Carson et al., 1996;
Carson and Groves, 2007), the conclusions from external validity tests on
CV studies based on public goods cannot be transferred directly to the DCE
context nor to a private goods context. Hence, similar tests should be applied
in DCE applications to thoroughly examine their validity. This is of particular
importance as DCEs are extensively being used to assess demand for new
products to guide policy development. In this regard, recent studies (e.g.
Lusk and Schroeder, 2004; Ding, Grewal and Liechty, 2005; Chang, Lusk
and Norwood, 2009; Collins and Vossler, 2009; Loomis et al., 2009;
Chowdhury et al., 2011; Vossler, Doyon and Rondeau, 2012; De-Magistris,
Gracia and Nayga, 2013; Grebitus, Lusk and Nayga, 2013; Moser, Raffaelli
and Notaro, 2014; Vossler and Watson, 2013) have examined the validity of
DCEs empirically as well as theoretically. Most of these studies suggest that
DCEs are indeed susceptible to hypothetical bias, even when they are used to
value marketable private goods.
Some investigations attempt to reduce hypothetical bias and increase
explanatory power of empirical models by combining data from stated choice
with reveal preference studies (e.g. Adamowicz, Louviere and Williams,
1994; Azevedo, Herriges and Kling, 2003; Brooks and Lusk, 2010). Others
focus on increasing the realism of stated DCE studies by either employing
hypothetical bias mitigation strategies (HBMS) (see Loomis, 2011, 2014 for
a review) or by shifting to nonhypothetical choice experiments in order to
reduce or completely eliminate hypothetical bias. Employing the latter, how-
ever, can be difficult since in addition to being costly and time consuming,
the products of interest may not be available (Fox et al. 1996; De-Magistris,
Gracia and Nayga, 2013). Therefore, HBMS may serve as important tools to
remove hypothetical bias, and one such strategy, which has been often used
in the literature, is the Cheap Talk (CT) script (Cummings and Taylor, 1999).
However, the existing evidence with respect to the effectiveness of the CT
script is mixed. While some have found that the CT script effectively miti-
gates hypothetical bias (Lusk, 2003; Carlsson, Frykblom and Lagerkvist,
2005; Murphy et al., 2015a, 2015b; List, Sinha and Taylor, 2006;
Chowdhury et al., 2011; Tonsor and Shupp, 2011), others have found it to be
effective only for certain sub-groups of respondents (List, 2001; Lusk, 2003;
Aadland and Caplan, 2003, 2006; Barrage and Lee, 2010; Ami et al., 2011).
Furthermore, the findings of Brown, Ajzen and Hrubes (2003) suggest that
the CT is bid range sensitive, and Nayga, Woodward and Aiew (2006),
Brummett, Nayga and Wu (2007) and Blumenschein et al. (2008) all found
that the CT did not have any influence on the WTP values.
A number of studies have tested the credibility of the CT script in DCE set-
tings and their results generally show that the CT is ineffective in removing the
hypothetical bias (List, Sinha and Taylor, 2006; Özdemir, Johnson and Hauber,
2009; Yue and Tong, 2009; Carlsson, García and Löfgren, 2010; Bosworth and
Taylor, 2012; Moser, Raffaelli and Notaro, 2014). This might be explained by
the fact that the CT script was originally developed for CV studies, and applying
it to the DCEs may not be appropriate as there are contextual and structural dif-
ferences between the two valuation approaches (Ladenburg and Olsen, 2014),
thus, responses may differ markedly (List and Shogren, 1998; Carson and
Groves, 2007). Given these inconsistencies surrounding the effectiveness of the
CT script, researchers have proposed other HBMS. For instance, Jacquemet
et al. (2013) implemented a Solemn Oath script, and De-Magistris, Gracia and
Nayga (2013) tested an Honesty priming approach. Both approaches were found
to be effective in removing hypothetical bias. Other researchers have instead
relied on calibrating hypothetical values using respondents’ stated certainty in
choice (Blumenschein et al., 2008).
More recently, along the same lines, Ladenburg and Olsen (2014) sug-
gested augmenting the CT script with an Opt-Out Reminder (OOR), which
reminds respondents that if the prices of the experimentally designed alterna-
tives are greater than what their household will pay, they should choose the
opt-out alternative. In a split sample setup, they found that when the CT was
applied with the OOR added, WTP estimates were significantly reduced com-
pared to using the CT without the OOR. Varela et al. (2014) also tested the
impact of presenting the CT with the OOR and contrary to Ladenburg and
Olsen (2014), the OOR was not found to influence WTP. A possible explan-
ation might be that Ladenburg and Olsen (2014) repeated the OOR before
each single choice set whereas Varela et al. (2014) only presented it once in
the scenario description. This seems to support Ladenburg and Olsen (2014)
who speculate that, given the repeated choice nature of DCE, it may be of
particular importance to repeat the reminder since respondents might other-
wise forget about the reminder as they progress through the choice tasks.
This article contributes to these developments in the literature in terms of
extending the work by Ladenburg and Olsen (2014). The aim is to properly
test the effectiveness of including a Repeated OOR (ROOR) in stated DCEs
by directly testing its ability to mitigate hypothetical bias. Ladenburg and
Olsen (2014) implemented the ROOR in conjunction with a CT script. In this
study, we include only the ROOR to avoid confounding the effects of the
ROOR and the CT. This allows for a more rigorous test of the pure effect of
2. Background
2.1. Edible insects as a sustainable source of food
The world population is projected to increase to 9 billion in 2050 (UN, 2015).
Faced with climate change and environmental sustainability concerns, the current
food production systems may not be able to meet the growing demand for food
which is estimated to rise by 60 per cent in 2050 (FAO, 2009, 2013). In this
respect, it is evident that the number of malnourished people in developing coun-
tries is (still) alarmingly high, although some nutritional improvements have been
observed in recent years (Braun, 2007). Specifically, despite a globally positive
change in the supply of food energy, total protein and animal protein in the past
few decades, a decreasing trend has been observed in many developing regions
(Pellet and Ghosh, 2004), especially in several sub-Saharan African (SSA) coun-
tries where food security has been and still is one of the biggest societal chal-
lenges. Increasing livestock production is one option to tackle the shortage of
animal protein but it contributes significantly to the emission of greenhouse gas-
ses (Steinfeld et al., 2006) while also requiring substantial inputs of water and
crop-based feed. Both these required production inputs are expected to decrease
in availability in SSA with future global warming. Increasing food security thus
requires that either the productivity of the current agricultural sector increases e.g.
through new technological developments or alternative sources of food that
increase food security in a sustainable manner must be identified through research
and development (Boland et al., 2013; van Huis, 2013).
In line with this, the concept of producing edible insects for food has
received substantially increasing attention recently (FAO, 2013). Insects can
be a good source of important nutrients for human consumption (Bukkens,
1997; Christensen et al., 2006). As shown by Oonincx et al. (2010) and
Halloran et al. (2017), edible insects can be considered environmentally
friendly as they emit significantly lower amounts of greenhouse gasses than
livestock production. Despite insects being traditionally consumed as food in
large parts of Africa, Asia and Latin America, and historically all over the
world, the many different aspects of eating insects, which has earlier been
labelled with the collective term ‘entomophagy’ (Evans et al., 2015), have
received rather limited attention in research (van Huis, 2013). Recently, how-
ever, the literature investigating e.g. food safety, nutritional value, consumer
attitude and legislative aspects of producing insects for food has begun to
increase very rapidly (e.g. Finke, 2013; van Huis, 2013; Looy, Dunkel and
Wood, 2014; Ruby, Rozin and Chan, 2015; Alemu et al., 2017b). Most of
the recent research focuses on such supply side issues. However, the demand
side, i.e. consumers’ preferences for insects as food, will ultimately be
equally important for a successful introduction of edible insects as a new and
substantial component of the future food production sector. Therefore, gath-
ering information about consumers’ preferences is necessary since decisions
concerning establishment of new insect production sectors as well as regula-
tory, standardisation and quality control schemes are likely to be suboptimal
if knowledge about potential demand in the market is not accounted for.
theory (Lancaster, 1966; McFadden, 1986) most commonly used, and they
rely on asking respondents to make choices in a manner that mimics their
choices in actual purchasing situations. This makes them more familiar to
consumers than experimental auctions (Alfnes et al., 2006). The application
of DCEs to elicit values of goods and services is currently gaining popularity
in developing countries (Bennett and Birol, 2010; Birol et al., 2015; Gibson
et al., 2016). Notwithstanding this, most DCE studies, particularly consumer
valuation of food attributes based on nonhypothetical DCEs have been con-
ducted in developed countries (Alphonce and Alfnes, 2017). This is related
to the challenges in conducting economic valuation studies in developing
countries. Durand-Morat, Wailes and Nayga (2016) and Whittington (1998,
2002) highlight that the ability of respondents to follow and process informa-
tion, the competences of local personnel, unavailability of proper population
sampling frames, limited coverage of infrastructures, and poor design and
implementation of survey scenarios are the main challenges. These issues
need to be considered when conducting DCE valuation studies in developing
countries. In this study, we exerted a great effort to obtain high-quality data
by directly addressing these challenges in the survey design. The problem of
hypothetical bias is a concern in any DCE, though it has not yet been thor-
oughly explored in developing country settings. To the authors’ knowledge,
only two studies addressing this issue are available in the DCE literature so
far. Aiming to assess the value of bottled mineral water, Ehmke, Lusk and
List (2008) found that hypothetical bias differs across locations and it is lar-
ger in developed than in developing countries. The authors attributed this dis-
parity to cultural differences. Chowdhury et al. (2011) found that Ugandan
consumers’ WTP for biofortified foods are significantly higher in hypothet-
ical treatments than in real treatments indicating the presence of hypothetical
bias. In the current study, we aim to contribute to the sparse DCE literature
in developing countries especially concerning hypothetical bias using data
from consumer valuation of novel food products in Kenya.
believe that their responses have real consequences in real situations (conse-
quentiality) (e.g. Carson and Groves, 2007), (ii) using solemn oath and honesty
priming approaches through which respondents are asked to swear to respond
honestly to the study questions (e.g. De-Magistris, Gracia and Nayga, 2013;
Jacquemet et al., 2013), (iii) using inferred valuation by asking respondents to
state what they think other people would be willing to pay for the good in
question (e.g. Lusk and Norwood, 2009), (iv) using the theory of cognitive dis-
sonance to enable respondents to act in a trustworthy and rational manner (e.g.
Alfnes, Yue and Jensen, 2010) and, finally (v) cheap talk (e.g. List, Sinha and
Taylor, 2006). Cheap talk, which is probably the most commonly used HBMS,
simply explains to respondents what hypothetical bias is. It was first applied by
Cummings and Taylor (1999) and other researchers followed suit (e.g. Lusk,
2003; Carlsson, Frykblom and Lagerkvist, 2005; Chowdhury et al., 2011;
Tonsor and Shupp, 2011). The ex-post strategies are used to mitigate hypothet-
ical bias after the completion of the survey. One can find a detailed account of
such approaches in Loomis (2014). The most common approach is to make
use of follow-up questions on response uncertainty to adjust for hypothetical
bias. Blumenschein et al. (2008) and Champ et al. (2009) showed that recod-
ing uncertain responses mitigated the hypothetical bias in their respective stud-
ies. In a DCE set up, Ready, Champ and Lawton (2010) similarly found that
calibrating choices using the uncertainty level of respondents resulted in hypo-
thetical bias being mitigated. In this study, we focus on testing a recently pro-
posed new ex-ante HMBS, namely the ROOR which until now has not been
properly assessed for effectiveness.
Kinyuru, Kenji and Njoroge (2009). Serving as attribute levels in the DCE, the
amounts of CF were varied at 0 per cent, 5 per cent and 10 per cent of the total
flour. The product development process, the recipe formulation and the charac-
teristics of the final bun products used for the field experiment are presented in
Appendix A.
Nonhypothetical DCEs involving new products require development of all
the product combinations with all the relevant attributes. This is, however, typ-
ically constrained by budget and other logistic complications. For this study, we
identified three attributes that would enable us to actually develop and produce
all possible product combinations for the DCE. The first attribute represents the
insect component in terms of the amount of CF in the buns (CF). The second
attribute describes whether a portion of the wheat flour is fortified or not
(Fortified), while the third attribute is the price of a bag of buns (Price). Insects
like crickets may be consumed in their whole form or ground into flour.
Besides representing the more realistic market entry strategy, we focus on the
flour form because low protein foods such as maize and sorghum, which are
popular in east Africa (Stevens and Winter-Nelson, 2008), can be easily
enhanced with CF to increase their nutritional value. In addition, CF is already
being produced on a rather large scale elsewhere in the world, hence, a similar
production would seem achievable in Kenya.
The Fortified attribute was included to assess Kenyan consumers’ prefer-
ences and WTP for fortified foods. The level for the fortified wheat flour was
arbitrarily set to be 40 per cent of the total flour used to bake the different
buns. Consumers in Kenya are familiar with food fortification because the
Kenyan government through the Kenya National Food Fortification Alliance
(KNFFA) has introduced a food fortification programme to reduce micronu-
trient deficiency. This programme covers foods including maize and wheat
(KNFFA, 2011). A secondary purpose with including the fortified flour attri-
bute was an attempt to (at least to some extent) separate out the pure effect of
nutritional improvements. In other words, we wanted to give participants the
option to choose buns that were nutritionally better than the standard alterna-
tive without necessarily forcing them to choose cricket flour. This way, if
participants strongly rejected the CF, they might still exhibit preferences for
better nutrition, thus avoiding confounding preferences for nutrition, which
we expected to be positive, with preferences for other aspects of eating
insects which might be negative, e.g. if respondents would find the idea of
eating insects disgusting (Looy, Dunkel and Wood, 2014).
The attributes and their levels are presented in Table 1. The amount of CF
has three levels: 0 g (Standard with no CF), 6.25 g (Medium CF) and 12.5 g
(High CF). This is equivalent to 0 per cent, 5 per cent and 10 per cent of the
total wheat flour used per bag of buns. The Fortified attribute has two levels
at 0 and 50 g. The total amount of flour is kept constant at 125 g per bag.
Thus, when CF and/or fortified wheat flour is added, the amount of wheat
flour is reduced accordingly. Besides the quality attributes, a Price attribute
is included with six levels. An initial investigation of actual market prices of
buns showed that a single bun would typically cost around 10 Kenyan
Variable
Attribute description Level name Coding
Shillings1 (KShs), rarely less than KShs 5. Since focus group discussions
indicated that respondents would not respond negatively to the other attri-
butes, it was thus decided to set the minimum price for a bag with three buns
at KShs 20. Focus group discussions further suggested that a suitable choke
price for the price range would be KShs 90, and the six price levels were sub-
sequently set at KShs 20, 25, 30, 40, 60 and 90.
Once the attributes and their levels had been determined, the DCE design was
produced. It was subjected to standard efficient design procedures in line with
state-of-the-art experimental design theory (Louviere, Hensher and Swait, 2000;
Ferrini and Scarpa, 2007; Scarpa and Rose, 2008; ChoiceMetrics, 2012). Three
alternatives are included in the design: two of them are deigned ones and the last
one is a specific alternative which is constant across choice sets. The three attri-
butes, i.e. the amount of CF, Fortified and Price, have three, two and six levels,
respectively. Based on this, a full factorial design results in 36 alternatives. To
avoid exceeding the cognitive capacity of respondents, a fractional factorial
design was employed. The design procedure involved two stages. First, we pro-
duced a design using zero priors and conducted two rounds of pilot studies with
42 respondents. Second, the data from the pilot studies were estimated to obtain
priors in terms of attribute parameter estimates. The priors were subsequently
used to inform and update the D-efficient fractional factorial design which was
generated using the Ngene software (ChoiceMetrics, 2012). The design pro-
duced 12 choice sets which were presented to participants. Figure 1 shows an
example of one of the choice sets used.
As is evident from Figure 1, we use a forced choice type of DCE by includ-
ing an alternative with a standard bag of buns which is constant across all
choice sets. This is motivated by the fact that we are dealing with a staple con-
venience consumer product, namely bread (Cova and Pace, 2006). Even though
the context of shopping for bread is a familiar everyday experience for most
consumers, the main attribute of interest in our survey, namely the addition of
cricket flour, is adding complexity and difficulty to the choice tasks. In this
case, including a no-buy option may prompt respondents to use it as a simplify-
ing strategy without actually making the required trade-offs to reveal their true
responses (see von Haefen, Massey and Adamowicz, 2005; Beshears et al.,
2008; Burton and Rigby, 2008; Boxall, Adamowicz and Moon, 2009; Rigby,
Alcon and Burton, 2010). Hence, we chose the forced choice setting to reduce
the risk of respondents resorting to such heuristics. While it is strictly speaking
always possible for consumers in the real world to not buy a specific product,
we argue that the majority of consumers in our case would in fact not consider
no-buy an option when shopping for bread in the real world. Focus group inter-
views supported this, and during the personal interviews, no respondents com-
plained about being forced to buy bread. Furthermore, while it may be more
common in food DCE studies to include a no-buy option, it is not uncommon
to use forced choice settings (e.g. Carlsson, Frykblom and Lagerkvist, 2007a,
2007b; Kallas et al., 2013).
2 As pointed out by a reviewer, consumers’ WTP for the novel insect-based food products could
be different in real world situations where they would have to pay directly from their own pock-
ets and not by some endowment offered by an interviewer. If the endowment is perceived as
windfall money, the REAL treatment may not be as incentive compatible as desired, and the
WTP estimates obtained in this sample may consequently be an overstatement of the ‘true’
WTP. We have not tested this in the present study as it was clear from the early stages of survey
planning, that asking respondents to bring money to the interview would be problematic from
an ethical point of view since the survey is conducted in relatively poor areas of Kenya.
3 We have not controlled for participants having or not having eaten bread at home before the
experiment. However, we tried to control for the hunger level of the participants and the taste
of previous consumed foods at breakfast or lunch time by placing the interviews between 10.00
and 12.00 in the morning and 14.00 and 16.00 in the afternoon.
4 This information was provided before consumers were subject to the DCE to mirror real market
situations in that consumers often process a range of information before they make purchase
decisions. However, we have not tested the effects of information provision on consumers’
responses as it falls beyond the scope of this paper.
buns they had chosen in the binding choice set and they had to pay the price
of that bag.
4. Econometric specification
The random utility theory (RUT) (McFadden, 1986) serves as a theoretical
framework to specify utility expressions in DCEs. In what follows, we follow
Train and Weeks (2005). According to RUT, the utility of individual n choosing
alternative k among J given alternatives in choice situation t can be expressed as
Unkt = − αn m nkt + βn′ xnkt + ϵnkt . (1 )
Equation (4) is called utility in WTP space. Train and Weeks (2005) men-
tioned that equations (3) and (4) are behaviourally equivalent. Nevertheless,
one can see, and as mentioned before, WTP calculation based on the prefer-
ence space specification is directly impacted by the distribution of λn and cn ,
and, in some cases, like normal distributions, the moments of the WTP distri-
bution may be undefined and furthermore the WTP distribution may suffer
from a fat tail problem causing identification problems.
Heterogeneity in preferences can be accounted for by specifying mixed
logit models, which can be derived in a number of different ways (Train,
2003). The most common specification is a random parameter logit (RPL)
model5 in which attribute parameters are treated as random parameters. The
random parameters may in principle follow any relevant distribution, but
often a normal distribution is used for quality attribute parameters and a log-
normal is used for the negative of the price attribute parameter (Train, 2003;
Train and Sonnier, 2005). We employ the RPL model as focus group inter-
views had indicated preference heterogeneity across participants. In line with
Scarpa, Thiene and Train (2008), we assume that λn = −exp (υn ), where υn is
the latent random factor of the price coefficient. In addition, let β represent
all the random parameters estimated in the WTP space model, then equa-
tion (4) can be rewritten as
where Vnk is the indirect utility function, which is a function of the attributes
of the alternatives, i.e. Vnk = βn′ Xnkt . The (negative of the) coefficient on price
(λn ) is assumed to follow a lognormal distribution in order to ensure a strictly
positive impact of income on utility. The focus group discussions and pilot
tests indicated that some people would have negative preferences for the
non-price attributes of the products while others would have positive. As a
result, the distributions of the coefficients for these attributes are assumed to
follow a normal distribution. We allow the scale, which is accommodated
through λn , to vary across decision makers by specifying all random para-
meters to be correlated with each other, i.e. no restrictions are placed on the
variance–covariance matrix. As the estimated parameters, β′, vary over indi-
viduals according to the specified distributions with density f (β ) (Train,
2003), the probability that an individual n chooses alternative k among J
given alternatives in choice situation t can be derived as
⎛ βn′ Xnkt ⎞
⎜ e ⎟ f (β ) d β .
Pnk = ∫ ⎜ ∑J e β′xntj ⎟
(6 )
⎝ j ⎠
5 This model can be extended to accommodate an error component representing potential correl-
ation between designed alternatives as proposed by Scarpa, Ferrini and Willis (2005), Scarpa,
Spalatro and Canavari (2007a) and Scarpa, Willis and Acutt (2007b). However, given that the
status quo option in the current experiment is defined as a standard alternative which in many
aspects is similar to the experimentally designed alternatives, it was decided to use the fairly
standard RPL model.
In line with, e.g. Thiene and Scarpa (2009) and Hole and Kolstad (2012),
equation (6), which is based on the WTP space specification, is estimated
using the MSL procedure by maximising the full simulated log-likelihood
over the sequence of choices and over the sample. We estimate the models
using the BIOGEME 2.2 software (Bierlaire, 2003), maximising the simu-
lated likelihood function with the CFSQP algorithm (Lawrence, Zhou and
Tits, 1997) to avoid the local optima problem (Scarpa, Thiene and Train,
2008). For the simulation procedure, 3,000 Modified Latin Hypercube
Sampling draws (Hess, Train and Polak, 2006) were used since this amount
of draws was found to produce stable parameter estimates.
In line with, e.g. Lusk and Schroeder (2004), Tonsor and Shupp (2011),
De-Magistris, Gracia, Nayga (2013) and Moser, Raffaelli and Notaro (2014),
we use the complete combinatorial method (Poe, Giraud and Loomis, 2005)
to test the different hypotheses outlined above. For this purpose, 1,000 WTP
estimates are bootstrapped for each attribute in each sample based on the
Krinsky and Robb (1986) method using 1,000 draws from multivariate nor-
mal distributions of the coefficient estimates and the variance–covariance
matrix of the random parameters. Differences between the WTP estimates
can also be tested by pooling data for the two treatments in a specific hypo-
thesis test as in De-Magistris, Gracia and Nayga (2013) and Bazzani et al.
(2017). One can then specify an extended utility function by interacting dum-
my variables representing treatment effects with attribute levels. Accordingly,
equation (4) can be extended as
Unkt = − λ n [m nkt + wn′xnkt + ωn (xnkt ∗ dtreatment )] + εnkt (7 )
Two dummy variables are created since there are three treatments. They
are denoted by dtreatment taking a value of 1 for the second treatment in a spe-
cific hypothesis test, 0 otherwise. Differences in WTP estimates between the
two treatments will be captured directly by the coefficient ωn, and its sign
will indicate the direction of the treatment effect.
5. Results
The summary of sample socio-demographics presented in Table 2 shows no
differences across treatments. Except the level of education between the
ROOR and REAL treatments, chi-square tests failed to reject that the socio-
demographic distributions differ across treatments, suggesting that partici-
pants in the three treatments are sufficiently similar in their characteristics to
rule this potential factor out as cause of any preference differences found.
As a first step in investigating possible preference differences across sample
splits, Table 3 provides a comparison of the choice distributions. Interestingly,
while the Standard buns were chosen in 14 per cent of the choices in the
HYPO treatment, they were chosen in 27 per cent and 31 per cent of the
choices in the ROOR and REAL treatments, respectively. These results are in
line with Lusk and Schroeder (2004) and Bazzani et al. (2017) who also
x2-tests, p-values
HYPO ROOR REAL
Variable (a) (b) (c) a vs. b a vs. c b vs. c
Age
18–34 years 38.5 45.9 40.5
35–54 years 55.0 41.3 47.4 0.107 0.431 0.665
55–64 years 5.5 8.3 9.5
Above 64 years 1 4.5 2.6
Household size
1–4 persons 56.9 48.6 53.4
More than four persons 43.1 51.4 46.6 0.531 0.823 0.782
Gender
Female 53.2 52.3 50.1 0.999 0.827 0.935
Male 46.8 47.7 49.9
Region
Rural 60.1 60.1 64 0.999 0.716 0.716
Urban 39.9 39.9 36
Monthly income in KShs
<15,000 66 63 58 0.658 0.315 0.697
15,000–50,000 25 30 37
>50,000 9 7 5
Education
Primary school 31.2 43.1 27.6
Secondary school 36.7 31.2 28.5 0.152 0.178 0.009
Tertiary level 11 14.7 15.5
University education 6.4 2.8 3.5
Othera 14.7 8.2 24.6
Number of participants 109 109 116
a
No education, and drop outs from primary and secondary schools.
x2-test, p-value
HYPO ROOR REAL
Alternative (1) % (2) % (3) % (1) vs. (2) (1) vs. (3) (2) vs. (3)
found participants to choose the opt-out option more frequently in a real set-
ting than in a hypothetical setting, though they used a ‘none of these’ opt-out.
The chi-square tests confirm that the choice frequencies differ significantly for
HYPO versus ROOR and HYPO versus REAL. However, there seems to be
6 The estimates of the Cholesky matrices, which were obtained from the finite difference hessian,
are reported in Appendix C.
than the HYPO treatment. Hence, we can also confirm hypothesis H3. For
the High CF attribute, the WTP obtained in the ROOR treatment is though
still significantly higher than in the REAL treatment, while for the Medium
CF, the difference is not significant, and for the Fortified attribute, the differ-
ence is only borderline significant. The ASC estimates are not significantly
different, but the fact that the ASC estimate is still significantly lower than
zero in the ROOR indicates that the negative status quo effect is not com-
pletely removed. Hence, the slightly stronger hypothesis H4 can only be
partly confirmed since hypothetical bias, despite being significantly reduced,
is not completely removed for all attributes. Notwithstanding this, presenting
the ROOR before each choice set appears to be highly effective, removing
87–98 per cent of the hypothetical bias in absolute terms across the range of
attributes as well as for the negative opt-out effect. All the above results
regarding hypothetical bias and the impact of the ROOR in reducing it are
graphically described in Figure 2. Probability density functions of the mean
marginal WTPs are plotted for each treatment using 10,000 draws from
multivariate normal distributions of the WTPs. The densities representing the
WTP distributions in REAL and ROOR treatments are tighter than those in
the HYPO treatment, confirming the results presented above.
An excerpt of the results from running an extended utility function model
on the pooled data is presented in Table 5 (see Appendix D for the full
results). The estimates pertaining to the extended part capturing the treatment
effects directly, i.e. the vector w in equation (7), generally confirm the find-
ings of Table 4. The signs of the estimated coefficients show negative effects
of the second treatments on WTP estimates in a specific hypothesis test. This
is in line with our alternative hypotheses.
6. Conclusions
Eliciting the value of goods based on approaches that reflect the real behav-
iour of decision makers is a central element of valuation studies be it for non-
market or market applications. Stated DCE approaches are widely used to
elicit the value of goods and their attributes, e.g. in order to predict demand
for new products. How to reduce or eliminate hypothetical bias remains an
important challenge for DCE researchers. Nonhypothetical DCEs are obvi-
ously the preferred option; however, in many cases, product unavailability
and the associated high costs of implementation impair researchers from rely-
ing on such approaches. As a result, hypothetical DCEs are still used exten-
sively. A range of different strategies, e.g. consequential survey questions,
honesty priming, inferred valuation, cognitive dissonance and solemn oath
scripts, have been suggested as amendments to hypothetical DCEs, aiming to
reduce or eliminate hypothetical bias (Carson and Groves, 2007; Lusk and
Norwood, 2009; Jacquemet et al., 2013; Alfnes, Yue and Jensen, 2010; De-
Magistris, Gracia and Nayga, 2013).
Inspired by previous studies by Ladenburg and Olsen (2014) and Varela
et al. (2014), we extend this line of research by investigating the effectiveness
0.030
0.004
REAL REAL
0.015
ROOR ROOR
HYPO HYPO
0.000
0.000
Fortified buns
Probability density function
0.12
0.08
REAL
ROOR
0.04
HYPO
0.00
one hypothetical with ROOR and one nonhypothetical incentivised DCE – are
tested in a between-subject design aimed at determining, firstly, the magnitude
of hypothetical bias and, secondly, the ability of the ROOR to remove it.
The results confirm that the mean marginal WTP values based on the
simple hypothetical DCE suffer severely from hypothetical bias with esti-
mates being up to several hundred percentages higher than the WTP esti-
mates obtained in the nonhypothetical treatment. These results are generally
in line with Ehmke, Lusk and List (2008) and Chowdhury et al. (2011)
which are the only studies where hypothetical bias has been investigated in
developing countries. The results further indicate that the ROOR consider-
ably reduces or even eliminates the hypothetical bias while also being cap-
able of reducing or removing negative opt-out effects. The results based on
the ROOR are considerably closer to the results based on the nonhypothetical
sample than the results based on the hypothetical sample without the ROOR.
The ROOR eliminates hypothetical bias for one attribute while strongly redu-
cing, though not completely eliminating it for others, highlighting the promis-
ing potential of introducing it in DCEs.
The results concerning the presence of hypothetical bias presented in this
paper are in contrast with Carlsson and Martinsson (2001), Cameron et al.
(2002), Lusk and Schroeder (2004) and Alfnes and Steine (2005) where the
hypothesis of equal marginal WTP values in real and hypothetical settings
could not be rejected. However, other authors reported results which are in
agreement with our results (see Johansson-Stenman and Svedsäter, 2008;
Chowdhury et al., 2011; Moser, Raffaelli and Notaro, 2014). It should be noted
that direct comparison of our results with these studies may not be appropriate
due to reasons associated with the nature of the goods being valued, the sam-
pling design and the nature of the experimental settings (Harrison and List,
2004; Harrison, 2006; Johansson-Stenman and Svedsäter, 2008). Both Carlsson
and Martinsson (2001) and Alfnes and Steine (2005) used a within-subject
design and they reported that there are no differences in marginal WTP values
between hypothetical and real treatments. In this study, we used a between-
subject design and find significant differences in the results in keeping with,
e.g. Johansson-Stenman and Svedsäter (2008) and Moser, Raffaelli and Notaro
(2014). Johansson-Stenman and Svedsäter (2008) found that hypothetical bias
is larger in a between-subject design than in a within-subject design, and they
concluded that it should be tested using the former type of design. However,
List and Gallet (2001) did not find any significant differences in hypothetical
bias when comparing between-subject and within-subject designs in a meta-
analysis. Considering the nature of the experimental setting, as opposed to our
field study, the similar food choice experiments by Lusk and Schroede (2004)
and Alfnes and Steine (2005) were conducted in laboratory settings. Since they
did not find substantial hypothetical bias, it is tempting to suggest that ex-ante
hypothetical mitigation measures are more important in field settings than in lab
settings. However, List and Gallet (2001) did not find any significant differ-
ences between lab and field settings in this regard. They did however find hypo-
thetical bias to generally be higher for public goods than private goods, which
was further confirmed by e.g. Alfnes and Steine (2005) and Johansson-Stenman
and Svedsäter (2012). Importantly, this does not mean that hypothetical bias
concerns can be disregarded when evaluating private goods. Our results
strongly underline this, as we find substantial hypothetical bias in a private
good context. This suggests that DCE analysts should consider employing
some kind of ex-ante (or ex-post) hypothetical bias mitigation measures regard-
less of the type of good. Providing economic incentives as we do in our REAL
treatment is of course the preferable way to avoid hypothetical bias in a private
good context as it clearly enhances valuation scenario consequentiality – in line
with the recommendations in Johnston et al. (2017). However, in most DCE
cases, e.g. when assessing private goods incorporating new product attributes
that are not yet available or when assessing public goods, such economic incen-
tivisation is not a possibility, and the analyst must consider other options. We
show that the ROOR is worth considering, as it seems to encourage truthful
responses, or at least responses that are quite well aligned with those obtained
in the ideal economically incentivised setting. We can of course only conjecture
that this result is transferable to public good settings where economic incentiv-
isation is very rarely possible.
Our results differ from Ladenburg and Olsen (2014) and Varela et al. (2014)
in four important ways. First, their studies considered preferences for public
goods whereas the current study considers a private good context in terms of
food choice. Second, these studies did not establish the extent to which hypo-
thetical bias existed, which may be seen as a major limitation since it essentially
precluded the authors from making any conclusions concerning the effective-
ness of the ROOR in terms of mitigating hypothetical bias. Our experimental
setup allows us to conclude that hypothetical bias is indeed present and of a
non-negligible magnitude. Third, while the ROOR did not influence responses
in the case of Varela et al. (2014), it significantly reduced the marginal WTP
estimates of only some of the attributes in Ladenburg and Olsen (2014).
Contrary to this, the results of the current study reveal that the ROOR signifi-
cantly reduces the hypothetical bias for all attributes and even completely elimi-
nates it for one attribute. Fourth, they implemented the ROOR together with a
CT script, which confounded the effects of the ROOR and the CT. Thus, com-
pared to these, our study provides a more strict experimental setup allowing a
much more rigorous test of the ROOR in isolation. It also provides a clearer
result in terms of the impact of the ROOR on hypothetical bias.
The results reported in this study have particular relevance to stated DCE
since framing them with the ROOR does not pose further challenges in terms
of monetary cost, product availability and or logistic burden that otherwise
might be the case in nonhypothetical DCEs. In sum, the main results of this
study show that not only financial incentives but also ROOR can lead to
plausible behavioural manifestations. Therefore, in situations where it is not
possible to employ nonhypothetical DCEs, framing hypothetical DCEs with
a ROOR would seem to be a promising strategy.
Our results, however, provide only first-hand evidence and are not conclu-
sive by themselves. Therefore, future studies are important in order to
replicate our results and provide further empirical evidence in relation to the
ability of the ROOR to improve the external validity of stated DCEs. In this
regard, several areas of research can be identified for further research. First,
future studies may consider conducting research in different experimental set-
tings and with different types of goods so as to validate whether the effect of
ROOR is context-specific or not. This includes investigating whether the
ROOR leads to similar results to the ones reported in this study in situations
where participants do not experience the attributes. Second, the actual word-
ing of the ROOR is not tested here. Future studies may test other wordings
of the ROOR, e.g. by focusing on the designed alternatives rather than on the
opt-out alternative. Third, further research could be testing whether the
ROOR leads to the same results as the results from the current study when it
is applied to experimental designs that include a no-choice alternative rather
than a forced choice. Fourth, unlike De-Magistris, Gracia and Nayga (2013),
we only considered the ROOR without including other ex-ante hypothetical
bias mitigation strategies to avoid a potential confounding effect. It can be
argued that examining the effectiveness of the ROOR as compared to other
strategies such as CT is necessary to draw a more general conclusion. This
may be accomplished using an experimental design that includes a wide
range of treatments with different combinations of the various ex-ante strat-
egies for mitigating hypothetical bias.
Acknowledgements
We thank the management of the Department of Food Science and
Technology at the Jomo Kenyatta University of Agriculture and Technology
for giving us the opportunity to bake the bun products at the Food Processing
Workshop Unit. Finally, we thank seminar participants at the university of
Copenhagen and University of Stirling for comments and suggestions. The
usual disclaimer applies for any remaining errors.
Funding
This research is part of the GREEiNSECT (13-06KU) project which was funded
by the Danish International Development Agency (DANIDA), Ministry of
Foreign Affairs of Denmark.
References
Aadland, D. and Caplan, A. D. (2003). Willingness to pay for curbside recycling with
detection and mitigation of hypothetical bias. American Journal of Agricultural
Economics 85: 492–502.
Aadland, D. and Caplan, A. D. (2006). Cheap talk reconsidered: new evidence from
CVM. Journal of Economic Behavior and Organization 60: 562–578.
Adamowicz, W., Louviere, J. and Williams, M. (1994). Combining revealed and stated
preference methods for valuing environmental amenities. Journal of Environmental
Economics and Management 26: 271–292.
Alemu, M. H., Olsen, S. B., Vedel, S. E., Kinyuru, J. N. and Pambo, K. O. (2017a). Can
insects increase food security in developing countries? An analysis of Kenyan con-
sumer preferences and demand for cricket flour buns. Food Security 9: 471–484.
Alemu, M. H., Olsen, S. B., Vedel, S. E., Pambo, K. O. and Owino, V. O. (2017b).
Combining product attributes with recommendation and shopping location attributes to
assess consumer preferences for insect-based food products. Food Quality and
Preference 55: 45–57.
Alfnes, F., Guttormsen, A. G., Steine, G. and Kolstad, K. (2006). Consumers’ willingness
to pay for the color of salmon: a choice experiment with real economic incentives.
American Journal of Agricultural Economics 88: 1050–1061.
Alfnes, F. and Steine, G. (2005). None-of-these bias in stated choice experiments.
Discussion Paper DP-06/05. Department of Economics and Resource Management,
Norwegian University of Life Sciences, Norway.
Alfnes, F., Yue, C. and Jensen, H. H. (2010). Cognitive dissonance as a means of redu-
cing hypothetical bias. European Review of Agricultural Economics 37: 147–163.
Alphonce, R. and Alfnes, F. (2017). Eliciting consumer WTP for food characteristics in a
developing context: application of four valuation methods in an African market.
Journal of Agricultural Economic 68: 123–142.
Ami, D., Aprahamian, F., Chanel, O. and Luchini, S. (2011). A test of cheap talk in differ-
ent hypothetical contexts: the case of air pollution. Environmental and Resource
Economics 50: 111–130.
Azevedo, C. D., Herriges, J. A. and Kling, C. L. (2003). Combining revealed and stated
preferences: consistency tests and their interpretations. American Journal of
Agricultural Economics 85: 525–537.
Balcombe, K., Chalak, A. and Fraser, I. (2009). Model selection for the mixed logit with
Bayesian estimation. Journal of Environmental Economics and Management 57:
226–237.
Barrage, L. and Lee, M. S. (2010). A penny for your thoughts: inducing truth-telling in
stated preference elicitation. Economics Letters 106: 140–142.
Bazzani, C., Caputo, V., Nayga, R. M. and Canavari, M. (2017). Testing commitment
cost theory in choice experiments. Economic Inquiry 55: 383–396.
Bennett, J. and Birol, E. (2010). Choice Experiments in Developing Countries:
Implementation, Challenges and Policy Implications. Cheltenham, UK; Northampton,
USA: Edward Elgar.
Beshears, J., Choi, J. J., Laibson, D. and Madrian, B. C. (2008). How are preferences
revealed? Journal of Public Economics 92: 1787–1794.
Bierlaire, M. (2003). BIOGEME: A Free Package for the Estimation of Discrete Choice
Models. Paper presented at the 3rd Swiss Transport Research Conference Monte
Verita, Ascona, March 2003.
Birol, E., Meenakshi, J. V., Oparinde, A., Perez, S. and Tomlins, K. (2015). Developing
country consumers’ acceptance of biofortified foods: a synthesis. Food Security 7:
555–568.
Blumenschein, K., Blomquist, G. C., Johannesson, M., Horn, N. and Freeman, P. (2008).
Eliciting willingness to pay without bias: evidence from a field experiment. The
Economic Journal 118: 114–137.
Boland, M. J., Rae, A. N., Vereijken, J. M., Meuwissen, M. P. M., Fischer, A. R. H., van
Boekel, M. A. J. S., Rutherfurd, S. M., Gruppen, H., Moughan, P. J. and Hendriks, W.
H. (2013). The future supply of animal-derived protein for human consumption.
Trends in Food Science and Technology 29: 62–73.
ChoiceMetrics (2012). Ngene 1.1.1 User Manual and Reference Guide, Australia.
Chowdhury, S., Meenakshi, J. V., Tomlins, K. I. and Owori, C. (2011). Are consumers in
developing countries willing to pay more for micronutrient-dense biofortified foods?
Evidence from a field experiment in Uganda. American Journal of Agricultural
Economics 93: 83–97.
Christensen, D. L., Orech, F. O., Mungai, M. N., Larsen, T., Friis, H. and Aagaard-
Hansen, J. (2006). Entomophagy among the Luo of Kenya: a potential mineral source?
International Journal of Food Sciences and Nutrition 57: 198–203.
Collins, J. P. and Vossler, C. A. (2009). Incentive compatibility tests of choice experiment
value elicitation questions. Journal of Environmental Economics and Management 58
(2): 226–235.
Cova, B. and Pace, S. (2006). Brand community of convenience products: new forms of
customer empowerment – the case ‘my Nutella The Community’. European Journal
of Marketing 40: 1087–1105.
Cummings, R. G. and Taylor, L. O. (1999). Unbiased value estimates for environmental
goods: a cheap talk design for the contingent valuation method. American Economic
Review 89: 649–665.
Cárdenas, J. C., De Roux, N., Jaramillo, C. R. and Martinez, L. R. (2014). Is it my money
or not? An experiment on risk aversion and the house-money effect. Experimental
Economics 17: 47–60.
De-Magistris, T., Gracia, A. and Nayga, R. M. (2013). On the use of honesty priming
tasks to mitigate hypothetical bias in choice experiments. American Journal of
Agricultural Economics 95: 1136–1154.
Ding, M., Grewal, R. and Liechty, J. (2005). Incentive-aligned conjoint analysis. Journal
of Marketing Research 42: 67–82.
Durand-Morat, A., Wailes, E. J. and Nayga, J. R. M. (2016). Challenges of conducting
contingent valuation studies in developing countries. American Journal of Agricultural
Economics 98: 597–609.
Ehmke, M. D., Lusk, J. L. and List, J. A. (2008). Is hypothetical bias a universal phenom-
enon? A multinational investigation. Land Economics 84: 489–500.
Evans, J., Alemu, M. H., Flore, R., Frøst, M. B., Halloran, A., Jensen, A. B., Maciel-
Vergara, G., Meyer-Rochow, V. B., Münke-Svendsen, C., Olsen, S. B., Payne, C.,
Roos, N., Rozin, P., Tan, H. S. G., van Huis, A., Vantomme, P. and Eilenberg, J.
(2015). ‘Entomophagy’: an evolving terminology in need of review. Journal of Insects
as Food and Feed 1: 293–305.
FAO (2009). The State of Food and Agriculture: Livestock in the Balance. Rome: Sales
and Marketing group, Communication Division.
FAO (2013). Edible Insects: Future Prospects for Food and Feed Security. Rome.
Ferrini, S. and Scarpa, R. (2007). Designs with a priori information for nonmarket valu-
ation with choice experiments: a Monte Carlo study. Journal of Environmental
Economics and Management 53: 342–363.
Finke, M. D. (2013). Complete nutrient content of four species of feeder insects. Zoo
Biology 32: 27–36.
Fox, J. A., Hayes, D. J., Shogren, J. F. and Kliebenstein, J. B. (1996). Experimental meth-
ods in consumer preference studies. Journal of Food Distribution Research 27: 1–7.
Gibson, J. M., Rigby, D., Polya, D. A. and Russell, N. (2016). Discrete choice experi-
ments in developing countries: willingness to pay versus willingness to work.
Environmental and Resource Economics 65: 697–721.
Gracia, A. (2014). Consumers’ preferences for a local food product: a real choice experi-
ment. Empirical Economics 47: 111–128.
Gracia, A., Loureiro, M. L. and Nayga Jr, R. M. (2011). Are valuations from nonhypothe-
tical choice experiments different from those of experimental auctions? American
Journal of Agricultural Economics 93: 1358–1373.
Grebitus, C., Lusk, J. L. and Nayga Jr, R. M. (2013). Explaining differences in real and
hypothetical experimental auctions and choice experiments with personality. Journal
of Economic Psychology 36: 11–26.
Halloran, A., Hanboonsong, Y., Roos, N. and Bruun, S. (2017). Life cycle assessment of
cricket farming in north-eastern Thailand. Journal of Cleaner Production 156: 83–94.
Harrison, G. (2006). Experimental evidence on alternative environmental valuation meth-
ods. Environmental and Resource Economics 34: 125–162.
Harrison, G. W. and List, J. A. (2004). Field experiments. Journal of Economic Literature
42: 1009–1055.
Hensher, D. A. and Greene, W. H. (2011). Valuation of travel time savings in WTP and
preference space in the presence of taste and scale heterogeneity. Journal of Transport
Economics and Policy 45: 505–525.
Hess, S., Train, K. E. and Polak, J. W. (2006). On the use of a Modified Latin Hypercube
Sampling (MLHS) method in the estimation of a Mixed Logit Model for vehicle
choice. Transportation Research Part B: Methodological 40: 147–163.
Hole, A. and Kolstad, J. (2012). Mixed logit estimation of willingness to pay distributions:
a comparison of models in preference and WTP space using data from a health-related
choice experiment. Empirical Economics 42: 445–469.
Jacquemet, N., Joule, R.-V., Luchini, S. and Shogren, J. F. (2013). Preference elicitation
under oath. Journal of Environmental Economics and Management 65: 110–132.
Johansson-Stenman, O. and Svedsäter, H. (2008). Measuring hypothetical bias in choice
experiments: the importance of cognitive consistency. The B.E. Journal of Economic
Analysis and Policy 8, (Article 41).
Johansson-Stenman, O. and Svedsäter, H. (2012). Self-image and valuation of moral
goods: stated versus actual willingness to pay. Journal of Economic Behavior &
Organization 84: 879–891.
Johnston, R. J., Boyle, K. J., Adamowicz, W., Bennett, J., Brouwer, R., Cameron, T. A.,
Hanemann, W. M., Hanley, N., Ryan, M., Scarpa, R., Tourangeau, R. and Vossler, C.
A. (2017). Contemporary guidance for stated preference studies. Journal of the
Association of Environmental and Resource Economists 4: 319–405.
Kallas, Z., Escobar, C. and Gil, J. M. (2013). Analysis of consumers’ preferences for a
special-occasion red wine: a dual response choice experiment approach. Food Quality
and Preference 30: 156–168.
Kinyuru, J., Kenji, G. and Njoroge, M. (2009). Process development, nutrition and sen-
sory qualities of wheat buns enriched with edible termites (Macrotermes subhylanus)
from Lake Victoria region, Kenya. African Journal of Food, Agriculture, Nutrition
and Development 9: 1739–1750.
Krinsky, I. and Robb, A. L. (1986). On approximating the statistical properties of elastici-
ties. The Review of Economics and Statistics 68: 715–719.
Ladenburg, J. and Olsen, S. B. (2014). Augmenting short Cheap Talk scripts with a
repeated Opt-Out Reminder in Choice Experiment surveys. Resource and Energy
Economics 37: 39–63.
Lancaster, K. J. (1966). A new approach to consumer theory. Journal of Political
Economy 74: 132–157.
Lawrence, C., Zhou, J. L. and Tits, A. L. (1997). User’s Guide for CFSQP Version 2.5: A
C Code for Solving (Large Scale) Constrained Nonlinear (Minimax) Optimization
Problems, Generating Iterates Satisfying All Inequality Constraints. Technical Report
Murphy, J., Stevens, T. and Weatherhead, D. (2005b). Is cheap talk effective at eliminat-
ing hypothetical bias in a provision point mechanism? Environmental and Resource
Economics 30: 327–343.
Mørkbak, M. R., Olsen, S. B. and Campbell, D. (2014). Behavioral implications of pro-
viding real incentives in stated choice experiments. Journal of Economic Psychology
45: 102–116.
Nayga, R. M., Woodward, R. and Aiew, W. (2006). Willingness to pay for reduced risk
of foodborne illness: a nonhypothetical field experiment. Canadian Journal of
Agricultural Economics/Revue canadienne d’agroeconomie 54: 461–475.
Oonincx, D. G. A. B., Itterbeeck, J. v., Heetkamp, M. J. W., van den Brand, H., van
Loon, J. J. A. and van Huis, A. (2010). An exploration on greenhouse gas and ammo-
nia production by insect species suitable for animal or human consumption. PLoS One
5: 1–7.
Özdemir, S., Johnson, F. R. and Hauber, A. B. (2009). Hypothetical bias, cheap talk, and
stated willingness to pay for health care. Journal of Health Economics 28: 894–901.
Pellett, P. L. and Ghosh, S. (2004). Lysine fortification: past, present, and future. Food
Nutr Bull 25: 107–113.
Poe, G. L., Giraud, K. L. and Loomis, J. B. (2005). Computational methods for measuring
the difference of empirical distributions. American Journal of Agricultural Economics
87: 353–365.
Ready, R. C., Champ, P. A. and Lawton, J. L. (2010). Using respondent uncertainty to
mitigate hypothetical bias in a stated choice experiment. Land Economics 86:
363–381.
Rigby, D., Alcon, F. and Burton, M. (2010). Supply uncertainty and the economic value
of irrigation water. European Review of Agricultural Economics 37: 97–117.
Ruby, M. B., Rozin, P. and Chan, C. (2015). Determinants of willingness to eat insects in
the USA and India. Journal of Insects as Food and Feed 1: 215–225.
Scarpa, R., Ferrini, S. and Willis, K. (2005). Performance of error component models for
status-quo effects in choice experiments. In: Scarpa R. and Alberini A. (eds),
Applications of Simulation Methods in Environmental and Resource Economics.
Netherlands: Springer, 247–273.
Scarpa, R. and Rose, J. M. (2008). Design efficiency for non-market valuation with choice
modelling: how to measure it, what to report and why. Australian Journal of
Agricultural and Resource Economics 52: 253–282.
Scarpa, R., Spalatro, F. and Canavari, M. (2007a). Investigating preferences for environ-
ment friendly production practices. In: Canavari M. and Olson K. (eds), Organic
Food: Consumers’ Choice and Farmers’ Opportunities. New York: Springer,
115–124.
Scarpa, R., Thiene, M. and Train, K. (2008). Utility in willingness to pay space: a tool to
address confounding random scale effects in destination choice to the Alps. American
Journal of Agricultural Economics 90: 994–1010.
Scarpa, R., Willis, K. G. and Acutt, M. (2007b). Valuing externalities from water supply:
Status quo, choice complexity and individual random effects in panel kernel logit ana-
lysis of choice experiments. Journal of Environmental Planning and Management 50:
449–466.
Sonnier, G., Ainslie, A. and Otter, T. (2007). Heterogeneity distributions of willingness-
to-pay in choice models. Quantitative Marketing and Economics 5: 313–331.
Steinfeld, H., Gerber, P., Wassenaar, T., Castel, V., Rosales, M. and De Haan, C. (2006).
Livestock’s Long Shadow. Rome: FAO.
Yue, C. and Tong, C. (2009). Organic or local? Investigating consumer preference for
fresh produce using a choice experiment with real economic incentives. HortScience
44: 366–371.
Appendix A
The buns were baked at the Food Processing Workshop Unit at the Jomo
Kenyatta University of Agriculture and Technology in Kenya by professional
technicians. Since Kinyuru, Kenji and Njoroge (2009) developed their recipe
for buns from termite flour, we chose to validate this by first producing trial
buns from CF. At this point, the product characteristics were assessed in
cooperation with a nutrition scientist and several technicians. Following this,
10 participants that included five males and five females were recruited to
evaluate the products for taste, texture, colour and smell. The recipe formula-
tion that was used for the main baking is presented in Table A1. And the final
bun products which were used for the field experiment are characterised as
shown in Table A2. One can see that as the amount of the CF increases, the
bun products become heavy, soft and brown. The increase in weight of the
buns can be linked to the fact that the increase in the amount of CF lead to an
increase in the amount of fat in the buns that can reduce water transpiration.
Table A1. Ingredient composition of the buns
Appendix B
Subject instruction for the REAL treatment
‘You will be provided with 12 different choice scenarios within which three
bags (Bags 1, 2 and 3) of buns are included. Bags 1 and 2 may contain buns
made from wheat flour mixed with cricket flour. Some portion of the wheat
flour can be fortified. Bag 3 contains only buns made from the wheat flour
which was not fortified. In each scenario, you should choose ONE of the
bags you would like to purchase (Bag 1 or 2) or you can choose Bag 3 if
you would not like to purchase Bag 1 or 2. After you complete all 12 shop-
ping scenarios, we will ask you to draw a number (1–12) from an envelope
to determine which shopping scenario will be binding. In the envelope, are
numbers 1–12. If the number 1 is drawn, then the first shopping scenario will
be binding, and so on. For the binding scenario, we will look at the product
you have chosen, give you your chosen product, and you will pay the listed
price in that scenario. You should use the 90 KShs for the purchase. The
most expensive alternatives cost 90 KShs. Although only one of the 12 shop-
ping scenarios will be binding there is an equal chance of any shopping scen-
ario being selected as binding, so think about each answer carefully.’
Appendix C
Table C1. HYPO Cholesky matrix from WTP space estimates
Appendix D
Table D2. Estimates of the ω parameters based on pooled data (two treatments at a time)
Parameter HYPO and REAL HYPO and ROOR ROOR and REAL