Professional Documents
Culture Documents
Olsen - 2015 - Citizen (Dis) Satisfaction An Experimental Equivalence Framing Study PDF
Olsen - 2015 - Citizen (Dis) Satisfaction An Experimental Equivalence Framing Study PDF
Abstract: This article introduces the importance of equivalence framing for understanding how satisfaction measures Asmus Leth Olsen is assistant profes-
sor in the Department of Political Science at
affect citizens’ evaluation of public services. Does a 90 percent satisfaction rate have a different effect than a logically the University of Copenhagen. His research
equivalent 10 percent dissatisfaction rate? Two experiments were conducted on citizens’ evaluations of hospital services focuses on the effects of performance
in a large, nationally representative sample of Danish citizens. Both experiments found that exposing citizens to a information, political and administrative
psychology, behavioral public administra-
patient dissatisfaction measure led to more negative views of public service than exposing them to a logically equivalent tion, and the application of experimental
satisfaction metric. There is some support for part of the shift in evaluations being caused by a negativity bias: dissatis- methods in public administration. His
faction has a larger negative impact than satisfaction has a positive impact. Both professional experience at a hospital work has appeared in journals such as
Political Behavior, Public Choice, and
and prior exposure to satisfaction rates reduced the negative response to dissatisfaction rates. The results call for further Judgment and Decision Making.
study of equivalence framing of performance information. E-mail: ajlo@ifs.ku.dk
Practitioner Points
• The valence (positive/negative) of performance information can have substantial effects on citizens’
perception of public services—even if the underlying performance is exactly the same.
• Presenting citizens with a dissatisfaction rate of 10 percent induces a much more negative evaluation of public
services than presenting them with a logically equivalent satisfaction rate of 90 percent.
• Policy makers must carefully consider how minor equivalent changes in the presentation of performance
information can induce large shifts in citizens’ perceptions of public service performance.
C
itizen and user satisfaction surveys have The article advances this research agenda along two
become a widespread performance informa- lines. First, it introduces the importance of equiva-
tion metric across countries, services, and lence framing for our understanding of how satisfac-
levels of government (Bouckaert, Van de Walle, and tion measures affect citizens’ evaluations of public
Kampen 2005; Stipak 1980). Citizen satisfaction is services (Druckman 2004; Levin, Schneider, and
seen as a key way of overcoming the many difficul- Gaeth 1998). Specifically, it points out that logi-
ties of measuring actual outcomes in the public sector cally equivalent changes to the valence of satisfaction
by applying more subjective user-centered measures measures can have large effects on citizens’ subsequent
rather than objective quality or output metrics evaluations of public services. The trivial substitu-
(Bouckaert and Van de Walle 2003; Folz 1996; tion of a 90 percent satisfaction rate for a 10 percent
Holzer and Yang 2004). At the same time, there has dissatisfaction rate may transform positive associations
been a focus on potential biases in citizen satisfaction and memories into negative ones for the exact same
as an indicator of service outcomes (Kravitz 1998; underlying information, which, in turn, shifts citizens’
Stipak 1979; Van de Walle and Van Ryzin 2011; evaluations of the data. The article tests this hypoth-
Van Ryzin 2013; Van Ryzin et al. 2004; Williams esis by employing two experimental studies using a
1994). However, there has been little to no focus on large-scale, nationally representative sample of Danish
the potential biases that publicly available satisfac- citizens (n = 3,443). It will highlight just how sensi-
tion measures induce in the attitudes and behaviors tive the effect of performance measures on citizens’
of citizens and policy makers. This article turns its perceptions of public services is to minor changes in
focus to how malleable citizens’ evaluations of public the descriptive valence.
services are when citizens are confronted with satisfac-
tion measures of different valence but with equivalent Second, the article offers an explanation of the
information content. That is, how does a positive potency of equivalency frames from the basis of Public Administration Review,
Vol. 75, Iss. 3, pp. 469–478. © 2015 by
versus a negative framing of the exact same level of a negativity bias. The negativity bias implies that The American Society for Public Administration.
satisfaction affect citizens’ perceptions about a service? “negative events are more salient, potent, dominant DOI: 10.1111/puar.12337.
treatment.” That is, for each set of respondents asked to evaluate a Satisfied frame
Density
ents with around 100 different treatment values under each frame.
0.01
The assignment of a large range of different values allows for testing
whether the framing of performance is dependent on the numerical
content of the frame. Numerical content has been found to affect
both citizens’ attitudes and the behaviors of policy makers in unex- 0
pected ways (Ansolabehere, Meredith, and Snowberg 2013; Olsen 0 25 50 75 100
2013b, 2013c). By randomizing the numerical content, the results Citizens’ Evaluation of Hospital Performance
cannot be driven by idiosyncratic artifacts of the numerical values
Notes: The x-axis represents the dependent variable of citizens’ evaluations of
that respondents were given. Therefore, it enhances the robustness hospital performance. Higher scores indicate a better evaluation.
of the findings if they hold for a large range of numerical values. For N = 3,443.
the outcome measure, the respondents were asked to provide their
Figure 1 Density Plot of the Distribution of Responses under
evaluation of the hospital. Their response was given on a 101-point
the Two Frames
sliding scale ranging from “very bad” (0) to “very good” (100).
Respondents could not choose to not respond or to provide a “don’t
know” response. Across all treatments, the average response was Empirical Results
55.6 (SD = 24.7) and the median response 57. The distributions of responses under the satisfied and dissatisfied
frames are reported in the density plot in figure 1. Under the satis-
It is important to note that the experiment did not allow us to fied frame, citizens gave the unnamed hospital an average score of
directly assess the question of a negativity bias. As outlined in the 65.9 (SD = 20.7). However, for the dissatisfied frame, the average
theory section, if we found a difference in hospital evaluations score was only 45.4 (SD = 24.1). On average, citizens evaluated
between the two frames, we would not be able to directly attribute hospitals under the satisfied frame as significantly better, with an
this difference solely to one of them specifically. Ideally, we would average difference of 20.5 points (p < .01). The effect is substantially
like to have some neutral point of satisfaction or dissatisfaction similar or even larger if medians or trimmed means are calculated.1
to which we could compare the two frames in order to measure The effect is substantial: the mean difference corresponds to about
which one of them has an asymmetrical impact on evaluations. a one-standard-deviation change in the dependent variable. This
Unfortunately, no such neutral category is easy to come up with for strongly indicates a valence-consistent shift in evaluations induced
satisfaction measures. by framing performance information as either satisfaction or
dissatisfaction.
Instead, as argued in the theory section, we used alternative
information sources as a moderator that could shed some light on In figure 2, the framing effect is shown across the numerical treat-
which frame was driving a potential effect. In terms of alternative ments. Table 3 reports further tests of the framing effect. From the
information sources, we relied on two different indicators. The first figure and table, we can see that citizens responded positively to both
was a dummy variable indicating whether the respondent had been higher satisfaction and lower dissatisfaction rates. On average, evalu-
to a hospital within the last year (17.8 percent). This indicated a ation improved significantly, by around 0.9 point for each percent-
personal experience with hospital services, which can be seen as age point improvement in satisfaction/dissatisfaction (see model B).
an important alternative source of performance information. The We can now compare the frame treatment and numerical treatment
second indicator was a dummy variable indicating whether the in magnitude. Doing so tells us that an approximately 22-point
respondent either currently worked or previously had worked at a improvement in the percentage of satisfied/dissatisfied has the same
hospital (10.6 percent). This indicator captured any type of profes- effect as changing the overall framing from dissatisfaction to satisfac-
sional work-related experience in a hospital setting. In summary, tion. This strongly indicates the potency of the valence framing effect.
both indicators captured the extent to which respondents had
alternative sources of hospital performance other than the perfor- We can compare differences in responses to changes in satisfac-
mance information they were provided in the experiment. We also tion and dissatisfaction rates. In table 3, model C, this is done by
included some controls in some specifications in order to make sure interacting the treatment frame with the numerical frame. The
that these two indicators did not simply capture some other factor interaction term is positive, which indicates a stronger response to
correlated with alternative information sources and evaluations. numerical changes in the dissatisfaction frame. However, the effect
The controls included age, gender, and region. We also included a is not significant. This implies that a difference in the numerical
dummy for respondents with private sector employment (32.6 per- magnitude of positive- and negative-valence information does not
cent) to capture differences between them and those in the public assert any influence. The only thing that matters is the valence of
sector or out of work. Finally, a dummy captured respondents with the information, not its magnitude.
an intent to vote for one of the four parties that were either in gov-
ernment or that supported the government at the time of the study The next step was to understand the underlying mechanisms
(36.5 percent). for the large difference in evaluations for the satisfaction and
Citizen (Dis)satisfaction: An Experimental Equivalence Framing Study 473
Treatment B: Pct. Dissatisfied with Hospital Services
25% 20% 15% 10% 5%
100
Dissatisfied
50
25
Notes: Ordinary least squares estimated slopes for the satisfied (black line and dots) and dissatisfied (gray line and dots) frames. The lower horizontal axis shows the
treatment percent received under the satisfied frame. The upper horizontal axis shows the treatment percent assigned under the dissatisfied frame.
N = 3,443.
dissatisfaction frame. Here we studied heterogeneous responses for Interestingly, the main effect of work experience is negative and
respondents with alternative information sources about hospital insignificant, which suggests that there is no difference under the
services. Table 3, models D–F report these results. In figure 3, satisfied frame. In other words, alternative sources of informa-
coefficients with 95 percent confidence intervals are shown for (1) tion matter for the effect of negative-valence information but do
whether the respondent had been hospitalized within the last year not alter the effect of positive-valence information. This result
or (2) whether the respondent currently worked or had worked at is substantially the same if we add a set of control variables (cf.
a hospital. The mean differences are based on the coefficients in model F). The finding is also substantially the same if we inter-
table 1, model F. act all control variables with the treatment frame in the same
model.2
For respondents with hospital work, the interaction effect is
positive and significant at 7.4 points. That is, in the dissatisfied In figure 3, we can directly compare the effects for the two frames.
frame, respondents with current or prior work experience at a The mean difference between the two frames amounts to a statisti-
hospital gave higher evaluations than those with no experience. cally significant effect of 9.1 points (p < .01) between those with and
474 Public Administration Review • May | June 2015
12
10
−2
−4
without hospital work experience. The other indicator of alternative Table 4 Experimental Design in Study 2
sources of information compared respondents that had been to a Information Source Experimental Variation
hospital in the past year with everybody else. The interaction effect Media coverage of the hospital is: Positive/negative
is also positive but not significant. We do not observe the same The proportion of former patients who were 90% satisfied/50% satisfied/
effect for this group. The mean difference-in-difference between the satisfied/dissatisfied with their treatment: 10% dissatisfied/50% dissatisfied
Your neighbors view of the hospital is: Positive/negative
two frames is 2.4 (p = .31).
Notes: Respondents were assigned to one of 16 possible combinations of values
for the three stimuli variables. The experiment was conducted with YouGov’s
In summary, the distance between evaluations under the positive Danish online panel (n = 3,443).
and negative frame is reduced by having alternative sources of infor-
mation, and this effect is driven solely by a less negative response
under the dissatisfied frame. Importantly, if the results were driven that . . .” Below the introduction was a box with three types of
by a generally higher degree of satisfaction with hospital services information about the hospital that varied in terms of the valence
among these groups, then we would expect them to respond more of the information provided. The three sources of information were
positively under both frames. However, this is not what the findings shown simultaneously. Information sources and the experimental
indicate. The fact that alternative information sources affect only variation are reported in table 4. The main source of information
evaluations under the dissatisfaction frame indicates that alterna- echoed that of study 1: participants were informed about satisfac-
tive information sources are likely to play a role in diminishing the tion/dissatisfaction performance measures for former patients of the
negativity bias.3 hospital.
Study 2: Experiment with Multiple Performance Cues As in study 1, the percentage of satisfied/dissatisfied was var-
The second study served two purposes: (1) we were able to assess ied, here with only four equivalent quantities. The two alterna-
whether the results of study 1 would hold up if the performance tive sources of information were media coverage and neighbors’
information on satisfaction and dissatisfaction was presented in a opinions. These could vary only in terms of being either positive
context of alternative sources of information, and (2) the sequence or negative. Importantly, these were not equivalents of different
of valence stimuli between study 1 and study 2 allowed us to obtain valence but stated actual substantive differences. Combining treat-
a better understanding of the underlying mechanism. Specifically, ment variations across the treatment types of information sources
the experiment in study 2 occurred after study 1, which allowed generated 16 different treatment combinations (2 x 2 x 4 full facto-
us to analyze the effects in study 2 conditional on the treatment rial design). This gave us approximately 215 respondents in each
received in study 1. Study 2 relied on the exact same participants as condition.
study 1.
Below the information box, the respondents were asked to report
Experimental Design and Procedure how likely they were to undergo the operation at the aforemen-
Study 2 had a between-subjects design. All respondents were con- tioned hospital. The response was provided on a scale similar to that
fronted with the following hypothetical scenario: “Imagine that you used in study 1, a 101-point sliding scale ranging from “not at all
have become sick and that you therefore need to have a nonemer- likely” (0) to “very likely” (100). Across all treatments, the average
gency operation made at a hospital. About the hospital, you know response was 49.0 (SD = 26.9), with a median of 51.
Citizen (Dis)satisfaction: An Experimental Equivalence Framing Study 475
Table 5 Ordinary Least Squares Results from Study 2 However, if respondents received a dissatisfaction frame in study
Satisfaction/dissatisfaction (ref. = 10% dissatisfied) (ref.) 2, their response depended in part on their treatment frame in
90% satisfied 15.62** study 1. Specifically, those who received a satisfaction frame in
(1.04) study 1 evaluated the dissatisfaction frame significantly better, by
50% satisfied −11.67** 4.7 points, in study 2 than those who had dissatisfaction frames in
(1.03)
50% dissatisfied −19.24**
both studies (p < .01). This did not, however, work in reverse. Those
(1.04) first exposed to a dissatisfaction frame did not rate the satisfaction
Media coverage (1 = positive) 12.97** frame any differently from those first exposed to a satisfaction frame
(0.73) (p = .83). The results are substantially the same or stronger if median
Neighbor opinion (1 = positive) 13.28**
(0.73) values are calculated.4 In other words, exposure to positive-valence
Adjusted R2 0.36 information diminished the (negative) response to subsequent
F-statistic 386.3 negative-valence information—but not the other way around. One
N 3443 interpretation is that respondents first exposed to the satisfaction
Notes: Ordinary least squares estimates with standard errors in parentheses. frame became aware of the equivalence and therefore viewed the dis-
Dependent variable: Likelihood of undergoing the operation at that hospital,
“not at all likely” (0) to “very likely” (100). Significance levels denote *p < .05
satisfaction frame more mildly (Druckman 2004). This “correction”
and **p < .01. then did not happen the other way around, which could indicate
that the satisfaction frame was viewed as the baseline or “regular”
Table 6 Mean Responses in Study 2 Conditional on Treatment in Study 1 metric. This interpretation supports the negativity bias in the sense
Study 2 that exposure to both forms leads to a calibration of judgment con-
Study 1 Satisfaction Frame Dissatisfaction Frame cerning the negative information and not the positive information.
Satisfaction frame 54.7 45.6
[52.9–56.5] [43.9–47.3] Conclusion
(n = 868) (n = 848) Citizens’ immediate responses to satisfaction metrics can be highly
Dissatisfaction frame 55.0 40.9
[53.2–56.8] [39.2–42.6] contingent on an arbitrary choice of a positive or negative label
(n = 855) (n = 872) description. This is not a trivial fact in a world in which citizens’ sat-
Notes: Dependent variable: Likelihood of undergoing the operation at that isfaction rates play a key role in informing the public about public
hospital, “not at all likely” (0) to “very likely” (100). Mean responses with 95% sector performance. This article has shown how equivalence framing
confidence intervals in brackets and sample sizes in parentheses. of citizen satisfaction measures can assert a huge impact on how citi-
zens evaluate public services. Two experiments in a large, nationally
Empirical Results representative sample of Danish citizens highlighted the following
The main results are reported in table 5. First, we can note that major findings on the potency of equivalence framing.
respondents valued positive media coverage and neighbors’ opinions
at the same order of magnitude. Changing Study 1 found that framing hospital perfor-
media coverage or neighbors’ opinions from Citizens’ immediate responses mance in terms of dissatisfaction instead of
negative to positive caused about a 13-point to satisfaction metrics can be satisfaction had a substantial negative impact
increase in the likelihood of choosing the hos- on citizens’ evaluations of hospital services.
pital for a nonemergency operation. Turning to
highly contingent on an arbi-
The effect was constant along a large inter-
the satisfaction/dissatisfaction frames, we find a trary choice of a positive or val of different numerical treatments for the
similarly large and significant effect as in study negative label description. percentage of patients being either satisfied
1. Shifting from a hospital with 90 percent or dissatisfied. Study 2 replicated the fram-
being satisfied to one with 10 percent being dis- ing effect in an experiment that also offered
satisfied changes the likelihood of participants choosing that hospital alternative sources of information with both negative and positive
by around 15 percentage points. For the lower level of 50 percent valence. Importantly, the change in effect attributable to the equiva-
satisfaction/dissatisfaction, the difference is around 7.5 points ( p < lence frame was comparable to valence changes for media coverage
.01). This highlights that study 1 may have induced an awareness of and neighbor opinions, which both were nonequivalence frames.
the equivalence between satisfaction and dissatisfaction that reduced
the overall framing effect in study 2 (Druckman 2011). Theoretically, we also aimed to interpret part of this effect to reflect
a negativity bias. That is, differences in responses to satisfaction and
This being said, it speaks to the power of the equivalency frame of dissatisfaction measures are driven by a disproportionate impact
satisfaction/dissatisfaction that the effects are similar for those found of negative-valence performance information—in this instance, a
for logically equivalent changes in valence compared with logically dis- dissatisfaction rate. Some additional findings support this inter-
similar changes in valence (e.g., positive versus negative media cover- pretation. First, respondents with current or prior work experi-
age). Finally, we combined the treatments in study 1 and study 2 ence at a hospital were less affected by the dissatisfaction frame
in order to test whether switching frame types changes the effect of but responded like everyone else to the satisfaction frame. We take
exposure to satisfaction and dissatisfaction percentages. The results this as an indication that alternative sources of information about
are reported in table 6. performance diminish the negativity bias. Second, by combining
treatments across studies, we were able to show that those exposed
For those who received the satisfaction frame in study 2, the to satisfaction in study 1 and dissatisfaction in study 2 were less
frame received in study 1 did not affect their mean evaluations. negative in their assessment than those exposed to other treatment
476 Public Administration Review • May | June 2015
combinations. We again take this to support a negativity bias either negative or positive response among citizens and, from there
interpretation of the main results because it highlights that prior on, the rest of the political-administrative system.
exposure to positive-valence information weakens the response to
negative-valence information—but not the other way around. Acknowledgments
The research was made possible by Grant No. 0602-02577B from the
A minor cautionary note on the limits of the findings should be Danish Council for Independent Research. An earlier version of the
made as a result of the hypothetical, low-incentive setting of the paper was presented at the 11th Public Management Research Confer-
experimental design. Performance information is often presented in ence, Madison, Wisconsin, June 20–23, 2013. I would like to thank
a more data-rich context of news reports, government Web pages, the conference participants for valuable comments. The manuscript
or official publications. This being said, the analysis points to the has also benefited greatly from the feedback from three anonymous
importance of equivalence framing and, in part, the negativity bias reviewers and the editors of PAR. Any remaining errors are my own.
for citizens’ response to satisfaction measures, and the findings were
very substantial and identified in a large representative sample. In Notes
addition, we could argue that the identified framing effect and nega- 1. The median responses are 72 for the satisfaction frame and 45 for the dissatisfac-
tivity bias could be even larger in a real-world setting, where citizens tion frame. This gives a median difference of 27 points. If we apply trimmed
have to make sense of a greater number of cues about performance. means, where the 10 percent most extreme observations in each tail are cut out,
This could leave them even more vulnerable to the effects found the results are 67.8 for the satisfaction frame and 44.9 for the dissatisfaction
across the two studies. frame. This gives a mean difference of 22.9 points (rounded).
2. The interaction effect of work experience is 6.4 (p < .05), and the main effect of
The study offers three broader implications for our understanding work experience is −1.0 (p = .45).
of citizens’ satisfaction measures and performance information more 3. Psychological research has found slower reaction times for processing nega-
generally: The first implication relates to our use and understanding tive information compared with positive information (Baumeister et al. 2001,
of the effects of citizen satisfaction measures. Today, both govern- 342). Some researchers have attributed this to the way that negative informa-
ments and researchers are focused on citizen satisfaction. We now tion initiates more complex and deeper processing (Rozin and Royzman 2001).
know that changing the discussion to dissatisfaction has the potential Respondents provided with the dissatisfied frame were about 4.7 seconds slower
to fundamentally affect citizens’ evaluations. This should remind in their response time than respondents under the satisfied frame (dissatisfied
both practitioners and researchers about the fragile nature of the frame, M = 25.1 seconds; satisfied frame, M = 20.4 seconds). If median response
performance measures we apply and how they affect the greater times are used, the difference is cut to about 1.4 seconds.
public. Small, arbitrary changes in the reporting of performance 4. Median values: dissatisfaction to satisfaction, 53; satisfaction to satisfaction, 52;
information have the potential to significantly shift the perception satisfaction to dissatisfaction, 49; dissatisfaction to dissatisfaction, 42.5.
of public services among those receiving the information in both
negative and positive directions. References
Ansolabehere, Stephen, Marc Meredith, and Erik Snowberg. 2013. Asking about
The second implication is a need to integrate our study of the Numbers: Why and How. Political Analysis 21(1): 48–69.
effects of performance information into the broader framing Baumeister, Roy F., Ellen Bratslavsky, Catrin Finkenauer, and Kathleen D. Vohs.
research agenda (James 2011b; Moynihan 2008). There are end- 2001. Bad Is Stronger than Good. Review of General Psychology 5(4): 323–70.
less ways to present performance information to citizens, and Behn, Robert D. 2003. Why Measure Performance? Different Purposes Require
shifting the valence of a metric’s descriptive label is just a very Different Measures. Public Administration Review 63(5): 586–606.
basic example. Other examples include logically equivalent ways Bloom, Howard S., and H. Douglas Price. 1975. Voter Response to Short-Run
to present performance information on different scales and with Economic Conditions: The Asymmetric Effect of Prosperity and Recession.
the aid of various reference points (Olsen 2013a). This opens up American Political Science Review 69(4): 1240–54.
a broader research agenda on the informational cues that affect Bouckaert, Geert, and Steven Van de Walle. 2003. Comparing Measures of Citizen
citizens’ attitudinal and behavioral response to performance Trust and User Satisfaction as Indicators of “Good Governance”: Difficulties in
information. Linking Trust and Satisfaction Indicators. International Review of Administrative
Sciences 69(3): 329–43.
The final implications of an equivalence framing effect and the Bouckaert, Geert, Steven Van de Walle, and Jarl K. Kampen. 2005. Potential for
underlying mechanism of a negativity bias point at downstream Comparative Public Opinion Research in Public Administration. International
effects for the broader political-administrative system. The impor- Review of Administrative Sciences 71(2): 229–40.
tance of performance information valence adds another example Boyne, George A., Oliver James, Peter John, and Nicolai Petrovsky. 2009.
to the list of ways in which individuals’ perceptions of blame are Democracy and Government Performance: Holding Incumbents Accountable in
potentially malleable (Marvel 2014). Maybe the apparent wide- English Local Governments. Journal of Politics 71(4): 1273–84.
spread use of positive-valence performance information is indicative Carpenter, Daniel P., and George A. Krause. 2012. Reputation and Public
of policy makers’ fear of backlash. Some have linked the negativity Administration. Public Administration Review 72(1): 26–32.
bias to the general blame avoidance observed at the political level Chingos, Matthew M., Michael Henderson, and Martin R. West. 2012. Citizen
(Carpenter and Krause 2012; Mortensen 2013; Weaver 1986). As Perceptions of Government Service Quality: Evidence from Public Schools.
Hood notes, a “key test of political power can be said to be the Quarterly Journal of Political Science 7(4): 411–45.
ability to overcome or counteract negativity bias” (2007, 199). It Druckman, James N. 2004. Political Preference Formation: Competition,
is therefore highly important that future studies aim to understand Deliberation, and the (Ir)relevance of Framing Effects. American Political Science
how the informational valence of satisfaction measures can evoke an Review 98(4): 671–86.