Professional Documents
Culture Documents
Deontic Modality-Oxford University Press (2016)
Deontic Modality-Oxford University Press (2016)
Deontic Modality
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
Deontic Modality
edited by
Nate Charlow and
Matthew Chrisman
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
3
Great Clarendon Street, Oxford, OX DP,
United Kingdom
Oxford University Press is a department of the University of Oxford.
It furthers the University’s objective of excellence in research, scholarship,
and education by publishing worldwide. Oxford is a registered trade mark of
Oxford University Press in the UK and in certain other countries
© the several contributors
The moral rights of the authors have been asserted
First Edition published in
Impression:
All rights reserved. No part of this publication may be reproduced, stored in
a retrieval system, or transmitted, in any form or by any means, without the
prior permission in writing of Oxford University Press, or as expressly permitted
by law, by licence or under terms agreed with the appropriate reprographics
rights organization. Enquiries concerning reproduction outside the scope of the
above should be sent to the Rights Department, Oxford University Press, at the
address above
You must not circulate this work in any other form
and you must impose this same condition on any acquirer
Published in the United States of America by Oxford University Press
Madison Avenue, New York, NY , United States of America
British Library Cataloguing in Publication Data
Data available
Library of Congress Control Number:
ISBN ––––
Printed in Great Britain by
Clays Ltd, St Ives plc
Links to third party websites are provided by Oxford in good faith and
for information only. Oxford disclaims any responsibility for the materials
contained in any third party website referenced in this work.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
Contents
Introduction
Nate Charlow and Matthew Chrisman
. Deontic Modals and Probabilities: One Theory to Rule Them All?
Fabrizio Cariani
. Decision Theory: Yes! Truth Conditions: No!
Nate Charlow
. Linguistic and Philosophical Considerations on Bayesian Semantics
Daniel Lassiter
. Contextualism about Deontic Conditionals
Aaron Bronfman and J. L. Dowell
. Objective and Subjective ‘Ought’
Ralph Wedgwood
. ‘Ought’: Out of Order
Stephen Finlay
. On a Shared Property of Deontic and Epistemic Modals
Jessica Rett
. Modalities of Normality
Seth Yalcin
. Extreme and Non-extreme Deontic Modals
Paul Portner and Aynat Rubinstein
. Rationalization and the Ross Paradox
Benj Hellie
. Dynamic Foundations for Deontic Logic
Malte Willer
. Dynamic Expressivism about Deontic Modality
William B. Starr
. Metanormative Theory and the Meaning of Deontic Modals
Matthew Chrisman
Index
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
List of Contributors
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
Introduction
Nate Charlow and Matthew Chrisman
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
(but usually also capable of taking epistemic or metaphysical modal meaning). But
deontic modal notions can also be expressed with adjectives such as obligatory and
permissible, adverbs such as “necessarily” and “possibly”, and various constructions
such as “Have a drink!” and “Unless the proposed payout is very high, one is to
reject the offer.” One of the central topics of this volume is how to understand the
meaning of the bits of language used to express deontic modal notions. We focus
by default on English, but the lessons drawn are usually intended to extend to other
languages as well to expedite the search for an overall theory of deontic modal language
(syntax, semantics, pragmatics, metasemantics) capable of integrating this language
with overlapping bits of language used to express other modal notions while explaining
notable differences.
The study of deontic modal language is interesting and important in its own right
as a central linguistic phenomenon in need of explanation. However, this volume also
concerns more philosophical topics about the operation of deontic modal concepts and
the logic of deontic reasoning. For instance, regardless of how one models the meaning
of a word such as “ought”, we face questions about what application of the concept(s) it
expresses commits one to. In what sense, if any, does ought imply can? Does a belief
about what you yourself ought to do commit you to having an intention to do this
thing? Deontic necessities and possibilities do not seem to follow all of the same logical
patterns as metaphysical necessities and possibilities (e.g., must(p) doesn’t seem to
entail must(p or q), when the must is deontic). But which logical patterns do deontic
modalities follow?
It is an interesting psychological fact that humans engage in modal thought. We
might wonder what, in general, that ability does for us, but we might also wonder how
the more specific ability to displace from what is actually the case and think about what
may, could, should, or must be the case in light of various rules or practical laws does for
us. Notably, unlike metaphysical and epistemic necessities, a deontic necessity doesn’t
guarantee actuality; as everyday life repeatedly teaches, the fact that some action is
morally, legally, prudentially, and so on, obligatory doesn’t guarantee that it will be
performed. Also, deontic modal reasoning seems to be tied up crucially in our ability
to reason practically. A good theory of deontic modality would explain all of this.
A background assumption behind the organization of this volume is that this ability
is importantly connected to the way humans are subject to norms. So developing
an adequate theory of deontic modality will also—we think—be a crucial part of
understanding the elusive but all-important topic of normativity. Moreover, given that
humans do think with deontic modal concepts, philosophers and logicians face an
important challenge: explain the logic of this thought. What are the principles by
which we should reason when deploying deontic modal concepts? How are these
related to the principles by which we should reason when determining what to do?
Are the formal tools of modal logic adequate for representing these principles of
reasoning?
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
introduction
We take the study of deontically modal language to be a window into the phe-
nomenon of deontic modality proper (and vice versa). Deontic modality, we said,
is a phenomenon that encompasses normative concepts and thought (e.g. practi-
cal reasoning), the normative rules that govern deontically modal thought, and an
account of the role deontic modality plays in our cognitive and social lives. For
philosophers, this general line of thinking is familiar from the work of C. L. Stevenson,
R. M. Hare, H.-N. Castañeda, Wilfrid Sellars, G. H. von Wright, Christine Korsgaard,
Peter Railton, Allan Gibbard, Simon Blackburn, Ruth Millikan, and Sharon Street.
For linguists (and philosophers steeped in the methods of modern linguistics) it
is probably more unfamiliar. Such questions have perhaps not traditionally been at
the forefront of the minds of those who work on the meaning of deontically modal
language. Indeed, it might seem that those interested in the phenomena surrounding
deontic modality proper are interested in something altogether different from the
things that occupy the attention of linguistic work on deontic modality. Is that right?
In part, yes. Modern linguistic semantics tries to derive correct truth conditions for
a whole sentence as a function of the meanings and arrangement of that sentence’s
syntactic parts. The task of the linguist, faced with a sentence such as (), is straight-
forward.
() You must wash your hands.
We identify ()’s syntactic parts (and their mode of arrangement), and assign ‘must’
an interpretation that allows it to combine with the other parts of (), so that () is
assigned a truth condition of the appropriate sort.
In this vein, Angelika Kratzer (see, e.g., Kratzer , , ) has developed and
defended what we might call the “Standard” or “Orthodox” or “Classical” semantics
of deontically modal language. Kratzer’s theory—endorsed by acclaim in certain areas
of linguistic semantics—is subtle and multi-faceted (see Portner , chapter ). The
driving ideas are that, first, deontic modality is of the same species as, for example,
metaphysical and epistemic modality and, secondly, that any differences in meaning
between these species of modality can be explained by appeal to different ways of
determining a domain of quantification.1 The idea is that all types of modality involve
some type of quantification over a domain of possibilities that is determined contextu-
ally, by appeal to a contextually fixed range of relevant possibilities and a contextually
fixed way of ordering or organizing those relevant possibilities according to relevant
1 Neither this idea—that modal expressions exhibit something like polysemy—nor the treatment of the
modalities as quantifiers whose specific interpretation is resolved by context is due originally to Kratzer.
These ideas tend, however, to be most closely associated with Kratzer, and with good reason. Kratzer was
the first to develop a detailed and predictive account of the nature of this polysemy, the nature and properties
of the relevant context-sensitive parameters (her “modal base” and “ordering source”), and the relationship
between these parameters and the domain of possibilities over which modals are thought to quantify. The
fact that the account integrates seamlessly with Kratzer’s “Restrictor” account of indicative conditionals
(which is also effectively standard in certain circles in linguistic semantics) renders this achievement nothing
short of monumental.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
criteria. (Potentially relevant criteria can include, on the non-normative side of things,
comparative normality or likelihood; and on the normative side of things, how well,
comparatively speaking, a possibility does at realizing designated goals, manifesting
various values, satisfying particular preferences, or upholding moral ideals, etc.).
Hence, on Kratzer’s view, deontic modality is distinguished primarily by the fact that
its necessities and possibilities order the relevant possibilities according to normative
criteria: criteria that bear somehow on the preferences, motivations, plans, or intentions
of some relevant agent. So, according to Kratzer, a deontic necessity modal such as
() says, roughly, that the relevant possibilities that are best (according to the relevant
criteria) are all possibilities in which you wash your hands.
Kratzer’s is an elegant, explanatory, and predictive theory of deontically modal
language (and modal language more generally). By itself, however, it does not seem
to shed much light on the distinctive features of deontic modality proper. (Notice, for
example, that the “structural” characteristics of deontic modality are the same as those
associated with any other kind of modality.)
Several papers in this volume represent broadly critical reactions to this feature
of the Orthodox account of deontic modality. Fabrizio Cariani, Nate Charlow, and
Daniel Lassiter each propose to replace Kratzer’s theory with a theory that tries
to illuminate the connection between deontic modality and substantive theories of
practical rationality (such as, for example, normative decision theories). This would
seem a very natural thing to do with deontic modally language, insofar as a normal
utterance of a sentence such as () can be plausibly thought to express something
like a verdict of a theory of practical rationality (applied to a certain kind of decision
problem).
But, these authors argue, this is a fact that is not appropriately reflected in the
Orthodox account. Cariani argues that the truth condition of a deontic modal should
reflect the fact that whether a deontic modal such as () is true can depend on the
availability of certain probabilistic information, e.g., whether or not washing your
hands is likely to leave them cleaner than before. Charlow, on the other hand, argues
that the truth condition of a deontic modal should reflect the fact that whether a
deontically modal claim such as () is acceptable can depend on the sort of theory
of normative practical rationality that it is appropriate to deploy in evaluating that
sentence. (Ultimately he sees this fact as pushing us to an Expressivistic interpretation
of the “truth condition” that is computed by semantics.) Lassiter, extending the work
of his important dissertation (as well as Portner ), argues that the truth
condition of a deontic modal should model the gradability of deontic value—as
reflected in, e.g., the fact some action can be more required than another, or very
required—and suggests that the right kind of scalar structure is provided by expected
(deontic) value, roughly of the sort familiar from Bayesian decision theory.
Such theories raise important questions about whether (and to what degree) a
semantics for deontic modals should presuppose particular commitments in nor-
mative decision theory. For it’s debatable how far competent use of deontic modals
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
introduction
requires one to have specific views about which decision theory is right (to say nothing
of requiring endorsement of some specific decision theory or other). Cariani and
Charlow are each careful to avoid presupposing a certain substantive account of the
relationship between the information that is relevant for evaluating a deontic modal
and the truth of that deontic modal—that is to say, a substantive account of practical
rationality. Each provides his own reasons for this—Cariani’s of an empirical variety,
Charlow’s of a more theoretical variety. Lassiter is mostly unmoved by these: he argues
that there is no genuine obstacle, theoretical or empirical, to writing down a correct
theory of practical rationality in our semantics for deontic modals. Whether it is
appropriate to write down theories of practical rationality in one’s semantics (and, if
so, how to do so) represents a fundamental point of disagreement between Cariani
and Charlow, on the one hand, and Lassiter, on the other.
Though this inevitably cuts things too coarsely, we find it helpful to view Cariani,
Charlow, and Lassiter as trying to motivate different, more or less radical, alternatives
to Orthodoxy. On the other hand, it is helpful to read Aaron Bronfman/Janice Dowell
and Ralph Wedgwood as offering (qualified) defenses of certain important pieces of
Orthodoxy, while highlighting the degree to which accounts in the “Kratzerian” vein
can be adapted to challenges of the sort highlighted by Cariani, Charlow, and Lassiter
(among others).
Bronfman and Dowell set out to defend Kratzer’s own theory of the meaning
of deontic conditionals: conditionals that typically express an obligation conditional
on the obtaining of some relevant fact or other. Many recent articles take Kratzer’s
semantics to task for yielding apparently incorrect truth conditions for deontic
conditionals, in, for example, the Miner Puzzle of Kolodny and MacFarlane ().
Bronfman and Dowell continue the project2 of demonstrating how a subtler under-
standing of (i) the logical forms of deontic conditionals and (ii) the manner in
which the domain of quantification for a deontic modal is resolved in context can
yield a theoretically and empirically satisfying treatment of the meaning of such
constructions.
Both Wedgwood and Finlay, like Cariani and Lassiter, make use of broadly proba-
bilistic notions such as expectation to motivate alternative semantic analyses of deontic
modals; these authors agree, roughly, that deontic modals have a semantics that is, at
least in part, “probabilistic” in nature. Unlike Cariani and Lassiter, Wedgwood suggests
that it is possible to accommodate the probabilistic character of deontic modals in a
manner consistent with the “classical” semantics (by ordering the relevant possibilities
roughly according to their expected value). Though, Wedgwood suggests, it is essential
to semantically represent the (probabilistic) “epistemic perspective” of an agent—
to account, inter alia, for the meaning of the oft-discussed “subjective ‘ought”’—we
2 Recently pursued by Kratzer herself (see her ). See also von Fintel (), Dowell (), and
references therein.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
can nevertheless accomplish this while endorsing an account that retains the overall
structure of the standard semantics.3
Finlay represents a kind of return to first principles. Virtually all work on the
semantics of deontic modals has assumed, with Kratzer, that (i) the distinction
between relevant information and relevant priorities must be somehow represented in
the truth condition of a deontic modal, (ii) the truth condition for a deontic modal
must be doubly-relative—sensitive not only to relevant possibilities, but also to ways of
organizing or ordering these possibilities according to some body of relevant priorities.
Finlay rejects this particular piece of orthodoxy—en route to developing an account of
the contrast between so-called “strong” deontic modals such as “must” and so-called
“weak” deontic modals such as “ought”. For Finlay, a strong deontic necessity modal of
the form “mustE (p)” (read: in order that E, it must be that p) quantifies universally over
the relevant E-possibilities (i.e. asserts that these are all p-possibilities). A weak deontic
necessity modal of the form “oughtE (p)” (read: in order that E, it ought to be that p)
asserts that, given E, p is likelier than any alternative—roughly, that achieving end E
makes it probable that p. Finlay, like Bronfman and Dowell, aims to show how much
power a formally unornamented “classical” account can achieve, with a sufficiently
subtle understanding of the relevant empirical phenomena and the flexibility of the
semantic apparatus at hand.
These approaches are, we think, in accord regarding the assertion that an under-
standing of deontic modality proper should inform a semantics for expressions used
to express deontically modal opinions. Consider the the introduction of a notion
such as expectation into one’s semantics for deontic modals. The introduction of
such a notion cannot obviously be motivated on ordinary “compositional” grounds.
It is rather motivated by the consideration that what one ought to do depends on
appropriate expectation. Where these approaches disagree, primarily, is over how well
the Orthodox semantics performs in representing those factors that are relevant in
determining what one ought to (or must) do.
The techniques and methods of compositional semantics also enrich our under-
standing of deontic modality—a theme evident in the papers of Jessica Rett, Seth
Yalcin, and Paul Portner and Aynat Rubinstein.
Rett argues on linguistic grounds—having to do with the interaction of deontic
modals with exclamatives (e.g. “wow!”)—that deontic modals share with epistemic
modals something like an “evidentiality” requirement—the requirement, namely,
that their complements be inferred as the conclusion of an episode of reasoning.
(The reasoning is, roughly, practical in the case of deontic modals, theoretical in
the case of epistemic modals.) If right, this should bear on debates concerning the
epistemic status of deontically modal judgments. Like epistemically modal “states of
affairs”, representations of deontically modal “states of affairs” are not formed, e.g.,
3 Cariani’s account also strives to maintain this structure, but the theoretical departures seem to us much
more significant than those in Wedgwood’s proposal.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
introduction
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
introduction
roughly, given a set of premises that entails a deontic modal conclusion, adding a
further premise won’t necessarily preserve the entailment. If Jones promised to meet
Mary for lunch, then he ought to do so. However, if he promised and breaking the
promise is necessary to save a drowning child, then it’s not the case that he ought to
keep the promise. Willer argues that the nonmonotinicity of deontic logic extends
beyond the nonmonotinicity we are familiar with from reasoning with defeasible
generalizations. It also arises—he argues—owing to the violability of obligations and
the sensitivity of obligations to epistemic uncertainty. In response, he develops a
dynamic account of deontic logic capable of handling these additional sources of
nonmonotinicity. This generates some support for the methodology of dynamic logic
over the methodology of static logic.
In a similar vein (though with more focus on natural language), Starr’s paper
explores a dynamic semantics for a fragment of deontic modal language. The general
framework is one wherein one attempts to model the meaning of a sentence in terms
of a function specifying how accepting the sentence would change an agent’s overall
state of mind. Starr argues that accepting a deontic modal sentence primarily functions
to change one’s preferences rather than one’s beliefs about the world. In this sense,
he intends this semantics as an expressivist semantics for deontic modal language,
whereby the discourse function of this language is to coordinate on preference
orderings rather than to coordinate on beliefs about the world.
Charlow and Chrisman are motivated by some of the same intuitions as Hellie
and Starr about the preference/plan-coordinating, rather than world-representing
function of deontic modals. Accordingly, they agree with Hellie’s and Starr’s anti-
representationalist outlook regarding deontic modals. However, they develop this in
ways (different from one another) meant to be more friendly to truth-conditionalist
approaches to compositional semantics (largely leaving dynamic aspects of the mean-
ing of deontic modals at the pragmatics/semantics interface).
As mentioned above, Charlow proposes to enrich standard possible worlds seman-
tics with elements of decision theory in order to reflect the way deontic modals seem
inextricably tied up with the practical language of planning and action. Then he puts
this to service of a metasemantic view about why deontic modal language has the
semantics he predicts it to have. More specifically, he argues for an expressivistic gloss
on his semantics for deontic modals: the purpose of the semantics for deontic modals
is to represent a relation of competent acceptance between agent and deontic modal.
Chrisman, by contrast, outlines what he views as three metasemantic orientations
one could take towards truth-conditional semantics: the representationist view that
truth conditions articulate ways reality might be, the ideationalist view that truth con-
ditions articulate what one asserting a sentence ought to think, and the inferentialist
view that truth conditions articulate a place in a logical space of implications. He
argues that the fact that deontic modals are intensional operators encourages the view
that they function primarily not as devices for representing bits of reality but rather as
“metaconceptual” devices for performing various logical or quasi-logical operations
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
References
Dowell, J. L. () Contextualist solutions to three puzzles about practical conditionals. In
Shafer-Landau, R. (ed.) Oxford Studies in Metaethics. Volume . Oxford: Oxford University
Press.
Fintel, Kai von () The best we can (expect to) get? Challenges to the classic semantics
for deontic modals. [online] Available from: http: //mit.edu/fintel/fintel--apa-ought.pdf.
Manuscript. MIT. [Accessed Nov .]
Horty, John F. () Reasons as Defaults. Oxford: Oxford University Press.
Kolodny, Niko and John MacFarlane () Ifs and oughts. Journal of Philosophy. . pp. –.
Kratzer, Angelika () The notional category of modality. In Eikmeyer, H. and Rieser, H. (eds.)
Words, Worlds, and Contexts. pp. –. Berlin: De Gruyter.
Kratzer, Angelika () Modality. In von Stechow, A. and Wunderlich, D. (eds.) Semantics: An
International Handbook of Contemporary Research. pp. –. Berlin: De Gruyter.
Kratzer, Angelika () Modals and Conditionals. Oxford: Oxford University Press.
Portner, Paul () Modality. Oxford: Oxford University Press.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
Deontic Modals and Probabilities:
One Theory to Rule Them All?
Fabrizio Cariani
Introduction
An important chapter of the semantics and pragmatics of modals concerns deontic
sentences such as:
() You should move that pawn.
() You may not move a pawn that way, you must move it this way.
() It is better to move a pawn than to eat it.
A theory of the meaning of these sentences must specify when they are acceptable,
the kind of content they express, how they contribute to more complex expressions
that embed them, and what systematic effects they may have on the evolution of a
conversation.
In this paper, I compare some ways of developing such a theory on the basis of
empirical considerations as well as philosophical and methodological ones. Philo-
sophically, the main theme of my discussion is the connection between the conven-
tional meanings of deontic vocabulary and substantive theories about what one ought
to do. Accounts of the meaning of deontic vocabulary are not alone in making pre-
dictions about deontic sentences. Such sentences may also be implied by substantive,
nonlinguistic theories. For example, () may follow, depending on the circumstances,
from (i) the rules of chess (ii) from a theory of tactically correct play or (iii) from
a more general theory of rational decision making. Say that a practical theory is any
theory that issues verdicts of the form,
() Given α’s circumstances, goals and information, α ought to ϕ
Given α’s circumstances, goals and information, α is allowed to ϕ.
What is the relationship between practical theories and the semantics of deontic
modals? Should a practical theory be built into the conventional meaning of deontic
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
fabrizio cariani
modals? The hypothesis that has informed much research in formal semantics1 is
that the semantic theory should be, within certain limits, neutral between different
practical theories. This is the hypothesis I aim to clarify, articulate and defend here.
It is important to distinguish at the outset two different degrees of neutrality.
One may note that practical theories can conflict with one another about particular
sentences, and maintain that it is not up to the semantics to settle these conflicts.
Consider again: “You should move that pawn”. If my desire is to lose the game and go
home, a decision theory yields different verdicts from a theory of tactically correct play.
To be neutral in the first sense, a semantic theory should be above these differences.
A precise way of putting the requirement is that for any agents α, and predicate ϕ, such
that it is contingent whether ϕ applies to α, α should ϕ is consistent.
This degree of neutrality is usually achieved by adopting a flexible semantics
for modals. For example in the classical, contextualist ordering semantics, deontic
modals—and, in fact, modals in general—are analyzed relative to contextual param-
eters. Among these contextual parameters is an ordering of possibilities. In context,
practical theories can help fix this ordering, but that is the entire extent of their
contribution. However, there are no context-independent constraints on what can be
the source of this ordering. On this picture, there is a thin lexical meaning for deontic
modals that is largely independent of practical theories, but practical theories can
affect the sharpenings and connotations that individual deontic modals acquire in
context. Given some orderings, “You should move that pawn” is true; given others,
it is not.2
I will argue that we need a slightly stronger degree of neutrality. There are reasonable
practical theories that make simultaneous predictions about sets of sentences. If the
semantic theory is to be compatible with these theories, it seems, it should make those
sets of sentences consistent. Accordingly, the stronger degree of neutrality is this: for
every coherent practical theory P, if there are circumstances in which P entails a set
of deontic sentences S, then the semantic theory ought to imply that S is consistent.3
So, for example, if there is a coherent practical theory that entails “You should move
that pawn unless your king is threatened” and “If your king is threatened, you should
move the queen”, then these sentences ought to come out as jointly consistent. It
is important to note at this stage that even the stronger degree of neutrality does
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
not amount to the suggestion that the verdicts of every practical theory should be
consistent. Some practical theories are straightforwardly incoherent. No one claims
that their predictions ought to be made consistent.
My main negative thesis is that analyses that attempt to explain the meaning of
deontic modals in terms of expected value are in violation of this stronger, but not
universal, requirement of neutrality.4 The argument against expected value accounts
will not assume neutrality as an abstract principle. What I hope to show, instead, is
that violating it has specific problematic empirical consequences (§).
On the other hand, expected value accounts have some attractive features. They
systematically capture the idea that our dispositions to accept and reject deontic
statements are tightly linked with probabilistic judgments. Indeed, before criticizing
them, I will describe some facts concerning the interaction of probability operators
and deontic modals that seem to motivate expected value accounts (§).
The main positive thesis of the chapter is that it is possible to preserve the virtues of
expected value accounts while avoiding their problems. I show how to characterize a
probabilistic premise semantics that shares the features that motivate expected value
accounts without their problematic consequences (I develop an initial version of my
proposal in § and a refinement in §).5
In addition to these programmatic theses, the paper also has a more modest
ambition. It is to show, as the semantics of §§– does, that it is possible to merge the
insights of a classical premise semantics for deontic modals with the current explosion
of work on probabilistic semantic theories for epistemic vocabulary. In other words,
to show that it is a mistake to view the space of possible theoretical options as neatly
partitioned in two—with classical non-probabilistic theories on the one hand and
expected value analyses on the other.
4 Expected value accounts are developed, among other places in Goble (); Cariani (); Lassiter
(); Wedgwood (). This kind of view is also explored without endorsement in Yalcin (a). The
theoretical proposals in these references differ on several substantial points.
5 The proposal generalizes the theory of Cariani et al. (). A preliminary sketch has appeared in my
Cariani (b).
6 In this paper, metalinguistic variables A, B, C range over sentences, while A, B, C range over sets of
worlds. In informal discussion and within the same stretch of discourse the value of A is the set of worlds
at which A is true. In more formal discussion, I use the more standard notation [[A]] to denote A.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
fabrizio cariani
theory counts as probabilistic because expected values are defined partly in terms
of probability functions. Another probabilistic theory is the end-relational analysis
(Finlay, , ). According to this view, should A is true in a context c just in
case the (contextually salient) probability of A given the contextually salient end E
is greater than the probability of the relevant alternatives given E. According to yet
another analysis (see Carr ) should A is true iff A is entailed by all the “best”
alternatives where “best” is calculated on the basis of contextually supplied decision
rule d, probability Pr, and value function v. To these theories, I will add a fourth
in §.
More precisely, these analyses appear to commit to:
Probabilize Deontic Semantics (in short: Probabilize):
(a) In defining an intepretation function [[·]]· for a fragment of natural language
including deontic modals, we should include a probabilistic parameter (e.g.
a probability function) among the parameters relative to which we interpret
sentences.
(b) Deontic sentences depend non-trivially on this probabilistic parameter (to
be precise this means that for some deontic sentence A there are parameter
assignments π and π ∗ that differ only in their Pr coordinate, such that
∗
[[A]]π = [[A]]π ).
What motivates this idea? In this section, I propose two arguments in its favor.
I emphasize that neither argument is intended as an unconditional proof of Prob-
abilize. In other words, neither argument aims to motivate every kind of theorist
regardless of their prior commitments. Instead, both arguments depend on substantive
assumptions that I do not intend to defend here.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
() It is twice as likely that your opponents will attack you on the right flank
as not.
These are the kinds of facts we would like a joint theory of modals, conditionals, and
probability operators to explain.7
Accounting for the meanings of embeddings such as ()–(), I claim, motivates
Probabilize. The argument I advance here is that, given a natural view of the meanings
of their probabilistic antecedents, we must handle their consequents as sensitive to an
underlying probabilistic structure.
Let me first discuss the probabilistic antecedents. Much recent literature has devel-
oped reasons in favor of a probabilistic semantics for such sentences as:8
() It is likely that they will attack you on the right.
For example, Yalcin () shows that a probabilistic semantics is a natural fit for a
core of inferences involving probability operators.9 Swanson () notes that it seems
difficult, if not impossible, to reduce the quantitative aspect of probabilistic discourse
to a purely qualitative basis. On a simple probabilistic semantics, the semantic value
of the probability operator likely (short for ‘It is likely that. . .’) may be characterized
as in ().
() [[likely A]]c,Pr,w = T iff Pr(A) > .
If a probabilistic semantics such as () is on the right track, we should ask what
sort of local context is created by evaluating conditional antecedents of the form
If it is likely that A, . . . .
One way to get a grip on this problem is to think about what it is to suppose
that It is likely that A. One option is factualism—the view that in making such
suppositions one entertains a standard possible world proposition. Factualism is
compatible with the semantics in (), provided that one is a contextualist of a certain
kind. One can maintain that context determines a probability function Prc , and so
7 It is sometimes said that the goal of semantic theory is to predict which inferences are deemed
acceptable by competent speakers of a language (Chierchia and McConnell-Ginet, , p. ; Yalcin, c;
Holliday and Icard, , §). From this point of view, it may appear unclear exactly what needs to be
explained about ()–(). Surely, a minimal bar would be to explain why they are consistent. However, and
crucially, a semantic theory must deliver at least two more things: first, together with the metasemantics,
it must make predictions about speakers’ dispositions to accept sentences in context; secondly, it must
connect smoothly to an account of communication and conversation, so as to explain phenomena such as
disagreement, assertion, retraction and to form a basis for the calculation of implicatures. It is with respect
to the first of these two additional tasks that ()–() pose the biggest problems.
8 Yalcin (, , b); Swanson (); Lassiter (); Rothschild (); Moss (). In addition
to these authors, some think that probabilistic resources might be needed for the semantics of conditionals:
see Edgington (), Bennett () among others.
9 But see Holliday and Icard (, §), for an account that retrieves essentially the same core inferences
identified by Yalcin in a comparative probability setting. In reply, Lassiter (forthcoming) notes that there
might still be inferences involving sentences with ratio modifiers (A is twice as likely as B) that elude the
comparative probability approaches.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
fabrizio cariani
It is likely that A expresses in c the proposition that is true at a world w just in case
Prc (A) > .. As for how context might determine such a probability function, one
option is to claim that Prc is the subjective credence of the speaker of c. More complex
accounts are possible (e.g. the collective credence of some salient group in c).
Despite the familiarity of this proposal, there are well-known reasons to doubt
it (Yalcin, , ; Swanson, ; Rothschild, ). For instance, Yalcin (,
pp. –) and Rothschild (, p. ) note that factualism seems to make
attitudes towards modal and probabilistic contents be about the wrong sorts of things.
To make a supposition that A is likely is to (provisionally) enter into a particular state of
uncertainty about the world. According to the factualist, however, when Dana makes
such a supposition, she enters into a state of certainty about a credence, either her own
or the group’s. This seems to be the wrong content for the supposition in question, and
it motivates a different model of attitudes towards probabilistic contents.
I don’t wish to debate the factualist here about whether her position has been
thoroughly refuted by these objections. These arguments are meant to open up new
lines of inquiry, not close off the old ones. They motivate a non-factualist model
of probabilistic discourse. Ultimately, the score can be settled only globally—by
considering a large variety of phenomena and fully fleshed-out theories.
The baseline non-factualist idea for probabilistic sentences is this: to suppose it will
likely rain is provisionally to adopt a state that supports the probability of rain. Moving
back to the case of conditionals, the basic idea will be to conceive of probabilistic
antecedents as operating on an epistemic (and specifically probabilistic) state that
is given as a coordinate of semantic evaluation. Evaluating an antecedent of the
form likely A involves creating a local context according to which A is likely. The
consequent of the conditional is evaluated relative to such a local context. The details
of this operation vary from semantic theory to semantic theory. However, a simple
preliminary model of how this might work (not the one I will end up adopting in §)
is as follows. Assume that probability operators are evaluated relative to a probability
function, as in () above. To evaluate conditionals of the form (if likely A)(B), shift the
probability function Pr, to some other function Pr that is suitably related to Pr and
makes A likely.
If this picture of the behavior of probabilistic antecedents is on the right track,
an account of the meaning of () and () requires a semantics for should on which
the deontic modal is sensitive to the probabilistic update introduced by likely A.
Otherwise, we would not be able to predict the consistency of ()–(). The same
holds for other modal expressions with deontic interpretations, such as may, must,
better, etc.
The first argument for Probabilize, then, is that it constitutes a natural extension
of a plausible picture concerning the meaning and conversational role of expressions
of probability. The extension is needed to make sense of what pairs such as ()–()
mean. It should be obvious that this argument for a probabilistic framework is only
as strong as the underlying picture of probability claims. I take this as a welcome
result: Probabilize does not seem to be an uncontroversial thesis, but it is supported
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
10 There is a complex debate to be had about what sort of state i is and how it gets fixed. Towards the end
of their (), Kolodny and MacFarlane hint towards relativism. According to the relativist, the knowledge
state of the assessor sets the initial value for i (other operators might shift it away from its initial value).
However, their compositional semantics is explicitly neutral on this issue: a contextualist and even a certain
kind of expressivist might deploy it. Likewise, the semantic theories presented here do not require a stand on
these issues. Relativism is more explicitly avowed in MacFarlane (, §.ff.), but again the compositional
semantics presented there does not require a relativist construal.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
fabrizio cariani
() [[should A]]c,d,i,w = T iff ∀w ∈ d(i), [[A]]c,d,i,w = T
Here d is a deontic selection function that inputs the information state i and out-
puts a domain. Since Kolodny and MacFarlane assume that the job of conditional
antecedents is to shrink the information state i, deontic conditionals may quantify
over a significantly different domain from bare deontic claims. This, in turn, allows an
explanation of the consistency of (), () and (). It turns out to be pretty difficult
to explain how to derive all three verdicts at once if, instead of being information-
sensitive, deontic modals quantify over a domain that is fixed by context in a more
direct way.11
The move to an information-sensitive semantics is, in my view, well-motivated.12
Since the present aim is to develop the information-sensitive approach, rather than to
justify it, I do not discuss ways of resisting arguments for information-sensitivity (see
the references in footnote ). My focus here is to argue that if we adopt an information-
sensitive approach, there is reason to take an extra step and adopt a probabilistic model
of information.
Kolodny and MacFarlane’s semantics makes deontic modals responsive to qualita-
tive information states. Consider, for example, this variant on Rescue:
Knowledge: Everything is exactly as it is in Rescue but now you (the agent) as well as all the
participants in the conversation acquire conclusive and irrefutable evidence that you have the
red gene.
11 In addition to Kolodny and MacFarlane (), see also Charlow (); Cariani et al. ();
Silk () for developments of the pro-information sensitivity arguments. For defenses of the classical
semantics, see von Fintel (unpublished); Dowell ().
12 In my preferred account, the parameter i does not have to be tied to information. But whatever it is
tied to, the job of prioritizing possibilities must be executed as a function of i.
13 I am not claiming that the semantic theory must deliver this verdict of unacceptability for (). To do
so would be to say that risk-averse decision theories are false as a matter of meaning. The point in the main
text is not to exclude these theories. It is, instead, that, in context, there are natural practical theories that
reject () in Slanted.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
red pill (of course, if . is not enough we could choose a larger probability value short
of ).
On the basis of these cases, I advance the following argument:
(P) If the difference between Knowledge and Rescue is traced to the value of an
informational parameter in the semantics, then so is the difference between
Slanted and Rescue.
(P) The difference between Knowledge and Rescue is traced to the value of an
informational parameter in the semantics.
(P) If the difference between Slanted and Rescue is traced to the value of an
informational parameter in the semantics, then Probabilize holds.
(C) Probabilize holds.
I think (P) is intuitively plausible: the cases are not sufficiently different to warrant
altogether different treatments. Those who accept Kolodny and MacFarlane’s proposal
that deontic modals are information sensitive also accept (P). But I suppose that one
may—and Kolodny and MacFarlane () do—disagree with (P). As they point out,
they adopt:
an epistemic and non-probabilistic model of information states; it takes information states to
be sets of known facts. We have chosen this model because we think that what one ought to
do (relative to an information state) supervenes on what is known: mere differences in beliefs
(or partial beliefs) or perceptual states, unaccompanied by differences in what is known, cannot
make a difference to what an agent ought to do (footnote , p. ).
Armed with this picture, one may diagnose Slanted as involving acquisition of a piece
of non-probabilistic evidence which affects the probabilistic facts only via the relevant
supervenience function. Imagine, for example, that the context of Slanted arises after
some reliable test for the red gene has come back positive.
I do not know whether the supervenience claim Kolodny and MacFarlane endorse
is true. Luckily, I do not think we need to engage with the debate over its truth. Even
granting the supervenience claim, it still does not follow that we should theorize at
the level of the supervenience basis (i.e., qualitative information states). It is a familiar
point that there are supervenience structures in which the “higher” levels are explana-
torily autonomous. For example, chemical facts might supervene on physical facts
without undermining the autonomy of chemical explanations. Closer to our concerns,
the explanatory constraints on semantics dictate relativizations that are independent
of whatever supervenience relations might hold. For instance, one may relativize the
truth-conditions of “Someone owns a guitar” to worlds and assignments but that
is perfectly compatible with the claim that existential facts about guitar ownership
supervene on worlds alone. The case of the relationship between probabilistic states
and deontic judgments may be one such case.
In sum, Kolodny and MacFarlane’s argument does not motivate resistance to my
premise (P), and without such motivation, my argument for Probabilize stands.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
fabrizio cariani
This quantity is the weighted average of the value of the worlds that belong to A,
weighted by their probability (conditional on A). Some versions of decision theory
evaluate the rationality of choices on the basis of what might appear to be a sharpening
of the above concept of expected value.15 The sharpening in question involves several
additional commitments. For example, v is identified with a quantity called utility—a
numerical representation of the subjective desirability of an outcome. It is also claimed
in these theories that the rationally permissible choices are exactly those choices that
maximize expected utility.
Neither of these commitments is essential to use expected values in semantics.
According to the best versions of this approach, the choice of value function is not
encoded by the semantics and is left up to context (hence the value function need
14 For different critical discussions of the approaches I discuss in this section, see Rubinstein (,
§..) and Yalcin (a, §IX).
15 When one looks closer, it turns out to be quite different. Before formulating a precise notion of
expected utility, a decision theorist will generally set up a sophisticated modeling discipline concerning
what counts as a decision problem (see, inter alia, Joyce , chapter ). This modeling discipline affects
how one formulates the relevant notion of expected value. For example, utility does not attach to individual
possible worlds, but to coarser objects called outcomes.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
not be identified with an agent’s utility function).16 Proponents emphasize that the
point of the expected value analysis is not to stipulate that deontic modals implement
a particular decision theory—after all, they accept the milder of our two degrees of
neutrality outlined in the introduction. It is, rather, that we must do semantics with
a finer set of resources and more structure. For similar reasons, the semantic analysis
is not wedded to the particular way of computing expected values I have identified
here. One may well develop a semantic theory based on a notion of expected value
that is closer to how causal decision theorists (e.g. Skyrms, ; Joyce, ) define
the notion.
Expected value analyses come in many different variants. On one variant, should-
sentences directly express comparisons of expected value: should A may be inter-
preted as meaning that A has higher expected value than the relevant alternatives;
or perhaps it may mean that A has an expected value higher than a contextually set
threshold.
Both of these approaches yield a non-classical deontic logic (for example, should is
not closed under logical consequence and does not agglomerate over conjunction).
For those who prefer more canonical logics,17 it is easy to sketch an expected value
analysis that generates a relatively classical deontic logic. One just needs to adopt two
ideas:
(i) should/ought are analyzed as universal quantifiers
and
(ii) the domain over which they quantify is the union of those salient alternatives
that maximize expected value (relative to the given contextual parameters).
All of this is to highlight an important point: favoring expected value analyses does not
require choosing a particular semantic account and it does not determine a particular
logic.
In the rest of this section, I discuss expected value accounts generally, regardless of
differences in the logics that they project. This makes my critical task more difficult,
because I restrict myself to raising objections that apply to all expected value theories
(and there are other objections that apply to some theories, but not to others). To
narrow down the field of opposing views, I limit myself to views that entail:
(FB) should A is true if (but not necessarily only if ) the set of salient A-worlds
has higher expected value than each of the salient alternatives.
16 This is why Wedgwood () helpfully suggests using the terms “value” and “expected value” rather
than “utility” and “expected utility”. I follow Wedgwood on this score.
17 Even though I have argued (Cariani, a) that closure under logical consequence is not a desidera-
tum in a logic for should and ought, I also proposed that agglomeration is desirable over conjunction—with
the possible exception of moral dilemmas. See Cariani (forthcoming).
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
fabrizio cariani
(FB) should A is false if (but not necessarily only if ) for some B, the set of salient
A-worlds and the set of salient B-worlds (a) are disjoint and (b) have the
same expected value.
In less formal terms, (FB) says that if o is better than every other option, then o is what
you should do. (FB) says that if two incompatible options have equal expected value,
then it cannot be that one of them is exactly what you should do. These constraints are
satisfied by all the expected value accounts I know of.
Expected value theories can easily account for Rescue and its variants (Lassiter, ,
§...). Consider again:
() You should refuse the pills.
() If you have the red gene, you should take the red pill.
() If you have the blue gene, you should take the blue pill.
Suppose that context determines the value function v as follows v(w) = the number of
people you rescue in w (it is an arbitrary choice, in the sense that we might have picked
a different v, but it will do as an illustration) and that Pr is a probability function that
satisfies the constraints stated in the description of Rescue (e.g., the probability of your
having the red gene is .). Then,
• () is predicted to be acceptable because refusing the pills has the highest
expected value among the alternatives.
• ()–() are also predicted to be acceptable. On one way of construing condi-
tionals, conditional antecents can update the salient probability function (see the
account of the conditional in the system in §, which is available to the expected
value theorist).18 In the local context created by the antecedent of (), the option
that maximizes expected value is taking the red pill. In the local context created
by the antecedent of (), the option that maximizes expected value is taking the
blue pill.
Expected value theories also predict the variants of Rescue I have considered. For
instance, the reversal on () in Slanted and Knowledge is predicted because the
expected value of taking the red pill increases as the probability of your having the red
gene increases. Once it increases far enough, taking the red pill might become the
option with the highest expected value.
Summing up: Rescue and its variants involve decision-theoretic verdicts. A theory
that mirrors the structure of Bayesian decision theory is well suited to explain them.
18 This assumes that one of the effects of evaluating a conditional is to conditionalize the underlying
probability function. I think it is plausible that this is one possible construal of deontic conditionals. But
I also think that there is another construal on which conditional antecedents restrict a covert necessity
modal, even when should is present. See my discussion in fn. .
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
19 The analysis that follows is built on a modification of Hintikka’s account based on ideas by Stephenson
() and Yalcin (). The innovation is to model modal belief by shifting not just the world of evaluation
to match the belief worlds, but also the entire belief state i.
20 A possible complication is relevant here: plausibly, thinks(j, should(A)) should not switch the value
function v to one based on j’s relevant priorities, but rather to the value function that j assigns to whoever is
the agent of the deontic modal (if there is one). To address this point, Yalcin (a) formulates the expected
value theory by replacing v, with functions h(A, w) (modeled after the hyperplans of Gibbard ) that take
each agent A and world w to a value function. We could then say that thinks shifts the salient h-function,
to hj , i.e., the hyperplan of the subject of the attitude ascription. This all strikes me as correct, but it isn’t
necessary to cover the cases I deal with here: I assume, instead, that j’s value function in w is based on the
agent’s priorities.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
fabrizio cariani
() For every probability function and value function compatible with John’s state,
refusing the pill has higher expected value than the alternatives.
At this point, the problem should be clear: () does not capture the meaning of ().
() is not an ascription to John of the content that refusing the pill maximizes expected
value. Rather, () is an ascription to John of a way of prioritizing alternatives relative
to which refusing the pill is the best option.
Expected value theorists might reply that the attitude ascription expresses a com-
parison among expected values relative to a value function that builds in John’s risk
aversion. After all, as I emphasized, they need not think that one’s value function
coincides with one’s utility in the decision theorist’s sense, and it may appear that my
argument above requires this.
Although I agree that this move is available, it does not solve this particular problem.
To see why, we just have to change some features of the case, by taking the option of
refusing the pill off the table. Consider this variant context:
Rescue− : Everything is exactly as it is in Rescue but now someone is going to force-feed you a
pill (either the red one or the blue one). You get to choose which pill you will be given.
21 Technically, it is possible to define a value function that violates this. For example:
v(w)= if, in w, you are certain that you have the red (viz. blue) gene and you take the red (viz. blue)
pill.
v(w)= if, in w, you are uncertain about your genetic state and take neither pill.
v(w)=– if, in w, you either take a pill in a state of uncertainty, or take the wrong pill.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
Therefore, the expected-value theorist predicts, contrary to our judgments, that () is
unacceptable in Slanted− .
To avoid this result, the expected-value theorist might claim that, in updating from
Rescue− to Slanted− , we must also change the value function. I see two problems
with this reply. First, this is neither intuitive nor theoretically motivated: nothing in
the update to Slanted− suggested that we had to affect the value function. Secondly,
even in Rescue− , we can say things like:
() Even if it were more likely than not that you have the red gene, John would
still not think you should take the red pill.
It is implausible that the local context created by the antecedent of () involves a
change in value function.
Another reply that is available to the expected-value theorist might be to reject the
operator treatment of epistemic attitudes. In the literature on propositional attitudes,
the standard alternative to operator treatments is a relational account. Let C be a
context, however conceived. Then, the first stab at a relational semantics for thinks is
given by:
[[thinks(j, A)]]C,w = T iff in w, j stands in the thinks-relation to the structured
content of A in C.
There are two main reasons to adopt a relational analysis of thinks over an oper-
ator account. First, to slice contents more finely than is allowed by a possible-
worlds representation (e.g., if and are distinct tautologies, we may want
to accept John thinks that but not John thinks that ). Secondly, to block
problems of logical omniscience (e.g., if A is a contingent sentence, it need not follow
from John thinks that A that John thinks that ). The problem I raised for the
expected-value theories does not appear to be of either kind: it does not involve the
claim that the expected-value approaches slice contents too coarsely, and it it does not
appear related to issues of logical omniscience.
This observation leads to a more general point: the appeal to the relational analysis
is merely evasive, unless it is complemented by an account of what content is expressed
by a deontic sentence in a given context. And it is difficult to imagine an account of the
content of should A that (i) is plausible in light of the meaning that the expected-
value semantics assigns to deontic sentences and (ii) makes the right predictions
about () and all the related cases in which an agent has what I have been calling a
“non-Bayesian” attitude. Let me emphasize that this is in no way a criticism of the
This kind of value function is counter to the spirit of information-sensitivity theories. All these theories
separate the contribution of the information state from the contribution of whatever device we use to
prioritize possibilities. This objection has an empirical component, in that the expected-value approach,
with v as value function and the natural assignments to the other parameters, wrongly predicts that
(i) John thinks that if you have the red gene, you should take the red pill.
should come out false.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
fabrizio cariani
relational analysis. My target is the claim that the relational analysis helps with the
problem. The point that supports it is that the kinds of facts that justify the relational
analysis seems orthogonal to the present argument.
My opponent may reply here that they are not orthogonal, after all—perhaps John
really does have inconsistent deontic beliefs. But there is no reason to accept this
claim (the points that follow are more extensively discussed in Cariani, , which
was written as an elaboration of this argument). As a preliminary point, the contents
of John’s beliefs are strictly speaking consistent even according to expected value
analyses: these analyses make them inconsistent only relative to background facts
about the context. More importantly, there is a clear intuitive difference between how
we judge John’s beliefs and how we judge the beliefs of someone who ranks alternatives
in a totally incoherent way (e.g., by minimizing the minimum value). There is no
pressure for the semantics to predict the consistency of the deontic beliefs of such
an agent. From this point of view, the “miniminer” is just like someone who has
incoherent views about, say, universal quantification. I am happy to rest my argument
on the judgment that not every deviation from the expectational paradigm is a form
of logical inconsistency.
22 As with the account of attitudes I entertained in the previous section, this is an extremely simplified
approach that is nonetheless sufficient to highlight a difficulty that remains problematic even when we
complicate our account of disagreement. For a much more nuanced story about normative disagreement,
see Plunkett and Sundell ().
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
D∞ ∼D∞
23 Note that this notion of disagreement concerns a sentence in context. In my view the most fundamen-
tal notion of disagreement concerns contents of speech acts and attitudes, but I intend to run the present
argument without adding to the expected-value framework controversial views about contents. Given this,
the notion in the text seems fairly intuitive (see Willer, , for a use of a similar notion in the context of
disagreement with epistemic modal sentences).
24 Dan Lassiter points out to me that this is not technically true on the account of Lassiter (): on his
view, should-sentences can be false if there are small differences in expected value among options. However,
I don’t think that this response avoids the problem, though it may patch some of its occurrences. Even
on Lassiter’s view, there are many contexts in which two options A and B have similar expected value
unconditionally, but have vastly different expected values if we shift or update the underlying probability
function.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
fabrizio cariani
I concede everything except for the last sentence. Maybe we do often treat events such
as D∞ as if they had positive probability. However, the decision theorists are competent
speakers of English. Other people who are well acquainted with the relevant measure-
theoretic facts are also competent speakers of English. It seems doubtful that their
judgments must be based on surreptitiously assigning positive probability to D∞ . To
put the point another way: if the objection successfully blocked my use of () as a
constraint on deontic semantics, it would be just as forceful in blocking the original
decision-theoretic arguments. But it does not do the latter, so it does not do the former.
Examples such as Darts support a more general objection. Expected-value accounts
would gain plausibility if they could be supported by the claim that they approxi-
mate the correct normative theory of rationally permissible choice. But this is not
a concession that we should grant. The decision-theory literature supports the view
that an expectational analysis of rational decision-making makes correct predictions
in a large class of one-off choice problems, restricted to individual agents and given
probabilities that are (i) determinate and (ii) regular (no contingent proposition gets
probability ). Whether it generalizes beyond that to life long plans (e.g., being a
vegetarian), collective choices or non-regular probability models is a (series of) wide-
open question(s). Darts is one example in which it seems not to, but the general
point is that it is premature to think that a formal model that is intended to have a
very clearly delimited domain of application should be constitutive of the meaning of
deontic modals.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
attitude reports for non-Bayesian agents. I have used subjects who accept maximin as
a decision rule, but the point applies to any subject who does not rank alternatives
according to their expected-value. For similar reasons, expected value analyses do
not interface well with models of disagreement. Finally, they predict that we compare
expected values even in decision problems involving zero-probability states (cases in
which even Bayesian decision theorists are reluctant to go with expectations).
25 Carr (, ) also advances a probabilistic account that is motivated by considerations of
neutrality. For Carr, the semantic theory should have a “decision rule” parameter. I prefer my proposal on a
number of specific counts, but I will not make the comparisons explicit. There also appear to be similarities
between my framework and the explanatory approach of researchers in the program of inquisitive semantics
(Ciardelli et al., ). I defer the exploration of these similarities to separate works.
26 Schroeder () argues that ought is ambiguous, and that, at least in one sense, ought should not
be understood as denoting a function from propositions to truth-values, but rather as denoting a relation
between a subject and an action. The systems I describe here can be easily reformulated to fit Schroeder’s
view of the syntactic and semantic category of ought. For the opposing side of the argument, see Chrisman
(); Finlay and Snedegar ().
27 I adopt this “translational” view of logical form for its familiarity to the philosophical audience. The
alternative, which is prevalent among linguists and I think preferable, is the view that logical form is a level
of syntactic representation of the English sentences themselves, without mediation from an intermediate
formal language.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
fabrizio cariani
28 This is an attempt to replicate Kratzer’s idea that conditional antecedents always restrict modals
in their consequents, without commitment to Kratzer’s (a) syntactic hypothesis that there are no
conditional operators at the level of logical form.
29 Two remarks. First, to define certain concepts of logical consequence, it would be useful to also have a
notion of proper point of evaluation (that is a triple whose index and world coordinate are directly assigned
by context), but we will not need this here. Secondly, implementing some deontic semantic theories might
require double indexing in the sense of Kamp (). An example of this is the actualism of Jackson ();
Jackson and Pargetter ().
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
30 See Schroeder () for some criteria that govern deliberative interpretations.
31 Further weakenings are also possible: in fact, my semantics does not require any probabilistic structure
at all. The design principle is to make it compatible with probabilistic structure, not to require it.
32 See Cariani (a, §) for my preferred way of understanding deliberative alternatives in deontic
semantics.
33 In Cariani et al. (), we have also urged this kind of analysis to model the information sensitivity
of modals. The theory I advance in this paper is a probabilistic generalization of the view in that article.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
fabrizio cariani
alternatives: to keep as much of the structure of the classical semantics, I induce the
ordering by means of properties that determine binary preferences among alternatives.
Since alternatives are modeled as propositions, the simplest extension would be to
model priorities as sets of propositions (I call these elevated priorities).
Definition (Elevated Priorities and Ordering Sources).
(i) An elevated priority is a set of propositions whose extension is fixed relative to
a probability function Pr.34
(ii) An elevated ordering source o is a set of elevated priorities.
These moves open up the possibility of ranking alternatives on the basis of genuinely
probabilistic considerations. For example, the following is a property that an alterna-
tive A might have (relative to Pr):
(*) {A | given A, it is likely (according to Pr) that Joe will be less hungry}
In many ordinary contexts, the alternative Joe eats a sandwich has this property, while
the alternative Joe runs a marathon does not. Because Pr is a parameter, changes in Pr
may affect the composition of sets such as (*).
By moving to elevated priorities we can distinguish between (*) and:
(**) {A | given A, it is guaranteed that Joe will be less hungry}
(***) {A | A is compatible with Joe’s being less hungry}
As we will see, these distinctions play a crucial role in my account.
. The Semantics
Informed by the discussion in §., we can define our indices.
Definition (Indices). An index r is a quadruple r = i, Pr, Alt, o consisting of a set
of worlds i, a probability function Pr, a set of alternatives and an elevated ordering
source.
Notation: Given r, ir , Prr , Altr , or denote the respective parameters in r.
The non-modal part of the characterization of [[·]]· is straightforward: relative to r and
w, atomic sentences get T or F according to how things are in w. Boolean operators
are given their standard clauses. It follows that non-modal sentences only depend on
the world w in a point of evaluation r, w. Here are the entries for the modal part of the
vocabulary:
34 In other words, elevated priorities are functions from probability functions to sets of propositions.
I thank Malte Willer and a reviewer for this volume for making me notice that I need this complication.
The reviewer also wonders whether the fact that elevated priorities are such functions makes my view a
notational variant of the expected-value analysis. It does not: my resulting semantics is more permissive
than (i.e., lacks some of the validities of) all the expected-value analyses discussed above, as I verify towards
the end of §..
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
Completing the entry for the conditional. The entry for if appeals to an operation
r + A we have not yet defined. To define it, intersect ir with the set of A-worlds and
conditionalize Pr on the resulting qualitative state (while leaving Alt and o unchanged).
Definition (Intersective Update). r + A = ir ∩ [[A]], Prr (· | ir ∩ [[A]]), Altr , or .
It might appear at first sight that conditionals with non-modal consequents are settled
by what happens at the world of evaluation w, but recall that we assumed that for such
conditionals we always add in must when translating.
Note also that there is independent evidence that we need another construal for
indicative conditionals (in addition to the one we just characterized). In particular, we
might need a construal on which the antecedent does not have this “conditionaliza-
tion” effect.35 The conditionalization reading, however, is the one that is salient here.
Completing the entry for should. The entry for should appeals to a domain selection
function Selected(·): this function inputs an index and outputs a domain for should (if
we want to change the system to make deontic sentences contingent, Selected might
have to be relativized to a world as well).
Following the Kratzerian playbook, Selected(·) is determined by the ordering source
(but recall that we use elevated ordering sources).
Definition (Elevated Preorder).
A
r B iff {π ∈ or | (A ∩ ir ) ∈ π } ⊇ {π ∈ or | (B ∩ ir ) ∈ π }
Informally, A is at least as good as B (relative to r) just in case A ∩ ir satisfies all of the
elevated priorities (in r) that are satisfied by B ∩ ir . If, in addition, A ∩ ir satisfies some
35 Within Kratzer’s framework, Frank (); Geurts (unpublished); von Fintel () all maintain that
there is an alternative reading for conditionals on which a covert must is always added to the logical form of
the salient conditionals, whether the consequent has an overt modal or not. If we allow each modal to have its
own modal base, this will have a significant truth-conditional difference: (if A)(should B) would have the
following truth condition: for every epistemically possible world w in which A is true, [[should B]]r,w = T.
This is different from the truth-condition in the text, partly because it does not update r to r + A, and hence
does not conditionalize Pr. There is some evidence for this reading, but it does not affect any of the cases I
discuss here. See the discussion of the interplay of conditionals and deontic modals in Cariani et al. (),
§...
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
fabrizio cariani
priorities that are not satisfied by B ∩ ir , we can say that A is ranked strictly above B
in the preorder (relative to r).
Definition implements another central idea of my probabilistic premise semantics:
the relative ranking of an alternative A in the preorder depends in part on ir . This is
because when we evaluate whether a priority π applies to A, we do not check whether
it applies to the whole set of A-worlds. Rather, we check whether it applies to the
distinguished subset A ∩ ir . To make this intuitive, consider two deontic alternatives,
say flying to France and flying to Japan. The claim is that, to determine which alternative
is better (relative to some contextual priorities, say, making it probable that you see
cherry blossoms), we only consider the worlds in which you fly to France that belong
to ir and the worlds in which you fly to Japan that belong to ir . If it is early spring,
the elevated priority might favor flying to Japan; if it is late fall, it does not favor either
alternative. This idea is a crucial element of the account of Rescue, Knowledge and
Slanted to be given below (§.).
The preorder in Definition determines the domain for should:
Definition (Deontic Selection).
Selected(r) = {v ∈ W | ∃B ∈ Altr [∼ ∃A ∈ Altr (A r B & v ∈ B)]}36
The informal gloss on Definition is that the domain for should consists of the worlds
that belong to the maximally ranked alternatives. This completes my discussion of the
lexical entries.
The last thing we need to check are the predictions of the semantics relative to given
indices is a notion of acceptance at an index, which is defined as follows:
Definition . A is accepted in r iff for all w ∈ ir , [[A]]r,w .
Two quick remarks: although I described must and may as having epistemic inter-
pretations, the semantics can be modified to add entries for their deontic readings. As
in Kratzer’s semantics, there is no need to postulate lexical ambiguities (at least not on
the basis of anything I have said here). The so-called “strong” necessity modal must and
the possibility modal may get their own elevated ordering source o∗ which is a subset
of o (as proposed by von Fintel and Iatridou ) and selection function Selected∗ .
The entries are altered as follows:
[[must(A)]]r,w = T iff ∀w ∈ Selected∗ (r), [[A]]r,w = T
[[may(A)]]r,w = T iff ∃w ∈ Selected∗ (r), [[A]]r,w = T
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
In epistemic flavors, we could stipulate that o∗ is empty, and hence Selected∗ (r) = ir . In
deontic flavors, o∗ is a designated subset of the elevated priorities in o. This completes
my presentation of a probabilistic premise semantics in the context of a relatively
standard deontic logic.
A second, more radical modification is possible. The First Pass semantics validates
the logical schema known as Inheritance.37
Definition (Inheritance). If A | B, then should(A) | should(B).
In Cariani (a), I challenged this principle and gave a theory that invalidates
Inheritance while validating Agglomeration:
Definition (Agglomeration). should(A), should(B) | should(A & B).
It is easy to expand the present framework to implement that non-classical theory.
Since the data I discuss here are orthogonal to that debate, I stick to the Inheritance-
satisfying variant.
. Applying the Semantics
Let us check how the account handles information sensitivity. My first goal is to show
that there are natural assignments of values to the contextual parameters that deliver
the expected verdicts in Rescue, Slanted and the whole family of cases from §. In each
case, I identify indices that plausibly represent the salient conversational contexts and
show that they make the expected predictions. Recall that the target sentences are:
() You should refuse the pills.
() If you have the red gene, you should take the red pill.
() If you have the blue gene, you should take the blue pill.
For Rescue, consider an index r with:
i = the set containing any world that is compatible with the agent’s information
in the context specified in Rescue
Pr = any probability function that respects the constraints stated in the
description of the context of Rescue
In particular: Pr(red gene) = Pr(blue gene) = ..
Alt = {take the blue pill, taking the red pill, refuse the pills}
o ={ {A | given A, it is at least . likely that nine people are saved},
{A | given A, it is at least . likely that ten people are saved}}
37 Although there are debates on the appropriate notion of validity for modal systems such as mine
(Veltman, ; Yalcin, ; Kolodny and MacFarlane, ; Willer, ; Dowell, ), every notion
I know of entails that an argument with premise A and conclusion B is valid if there is no pair r, w such
that [[A]]r,w = T and [[B]]r,w = F. This sufficient condition is enough to derive Inheritance, given the
semantics I spelled out in this section.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
fabrizio cariani
I have chosen the priorities somewhat arbitrarily here, but it is easy to see that there
are many other assignments that would work
Fact . In Rescue, r accepts (), (), and ().
I.e., for any w in ir , [[()]]r ,w = [[()]]r ,w = [[()]]r ,w = T.
In the initial state, taking the red pill fails to satisfy either of the elevated priorities in
r (conditional on taking the red pill you are only . likely to save a person; similarly
for taking the blue pill). By contrast, refusing the pills satisfies the first priority: it
guarantees saving nine people. So [[()]]r = T. When we evaluate the conditional
(), however, the deontic modal appears in an embedding. We must consider the
result of updating r —and in particular ir and Prr —with the proposition that you
have the red gene (as in Definition ). Relative to this updated index, refusing the
pill still satisfies the first elevated priority (it guarantees the rescue of nine people)
and nothing else (you cannot save ten if you refuse). However, taking the red pill
guarantees (and hence makes probable) the rescue of ten people, so satisfies both
elevated priorities. For this reason, [[()]]r ,w = T—and by parallel reasoning the
same holds of ().
It is important to appreciate the significance of this reasoning. In my notation, the
technical notion of serious information-dependence in Kolodny and MacFarlane ()
amounts to the following property:
Definition (Serious Information Dependence). Selected(·) is information-
dependent iff for some A and r, Selected(r) ∩ A Selected(r + A).
Fact . Selected is seriously information-dependent.
The reasoning in support of Fact establishes Fact . The witnesses for the existential
quantifier are (i) the proposition that you have the red gene (for A) and r (for r).
To model Slanted, let r be the result of changing Pr in r to:
Pr = some probability function that respects the constraints stated in the
description of the context of Slanted.
In particular, s.t. Pr(red gene) = .
This change suffices to select taking the red pill as the highest-ranked alternative,
without changing anything else in the index. To see this, note that the option of taking
the red pill satisfies both elevated priorities, while refusing can only make it probable
that nine people are saved.
Fact . In Slanted, r accepts () and (), the negation of () and in fact, it also
accepts “You should take the red pill”.
I.e., for any w in ir , [[()]]r ,w = F and [[()]]r ,w = [[()]]r ,w = T. Moreover,
[[should (take red)]]r ,w = T
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
The verification is routine and is omitted here. The broader conclusion is that there is a
natural assignment of values to contextual parameters such that, both unconditionally
and conditionally, you should take the red pill. Moreover, this natural assignment
results from the index in Rescue by modifying only the probabilistic coordinate of
context.
Fact . There is an index r that differs from r only in its elevated ordering source,
and that accepts (), () and ().
Suppose that r is obtained by replacing o in r with:
According to r , you ought to take neither pill, even in Slanted. Effectively, r involves
priorities that are distinctive of a more risk-averse evaluation of the deliberative
situation. Even though in Slanted there is . probability that you have the red gene,
that is not enough to trigger either priority.
This showcases a precise sense in which my framework is neutral. Different com-
binations of elevated priorities can match the verdicts of a large variety of decision
rules.
As a consequence, on this theory, we can correctly model:
() John thinks you should refuse the pill.
The effect of “John thinks” is to shift the index to one that is compatible with John’s state
(we write h(x, r, w) for the index containing x’s information state in w, x’s probability
function in w, Altr , and x’s priorities in w; we do not shift the alternative set).
[[thinks(x, A)]]r,w = T iff ∀w ∈ ih(x,r,w) , [[A]]h(x,r,w),w = T
Even if the initial context is Slanted, the embedded deontic claim in () gets inter-
preted relative to something like r . In particular, it gets interpreted relative to a state
whose elevated priorities reflect John’s risk aversion.
Finally, the probabilistic premise semantics allows a simple account for Darts. First,
include models in which W is uncountably infinite. Since in Darts we still have only
finitely many alternatives, no other changes to the semantics are needed. Now, consider
an index whose only elevated priority:
() {A | A is compatible with Jeff ’s winning $ million}.
This elevated priority applies, in context, to Jeff ’s raising his right hand, but not to Jeff ’s
raising his left hand, so that we can accurately derive the judgment on ().
This example also illustrates a particularly intuitive feature of my probabilistic
premise semantics: the theory allows us to model priorities that involve assigning low
probability to a certain goal. Consider this dialogue:
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
fabrizio cariani
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
deontic modals, but rather in the theory’s modeling of probabilistic antecedents. In this
section, I briefly sketch an approach to probabilistic antecedents that, when conjoined
with my account of deontics, can handle ()–()—I do not claim it to be the only
possible approach.
The approach I have chosen for this illustration is Yalcin’s account of conditionals
with probabilistic antecedents, which he formulates as an update semantics (in the
style of Veltman , but with several important and subtle modifications). I reformu-
late Yalcin’s system using the indices from § as states and using p as a metalinguistic
variable ranging over the atomic sentences of L. In this system, we restrict [[·]] to just
the atomic sentences and give a recursive characterization of an update operation [·]
on states.
Note that Prr (· | ∅) is not and cannot be a probability function. To remain consistent
with the treatment of ir , we can suppose that Prr (A | ∅) = for all A. This means
that we have to qualify the claim that the second coordinate of our indices is always a
probability function: this is only true if understood as restricted to non-degenerate
states. In degenerate states such as ∅, Prr (· | ∅), Altr , or we allow the function
Prr (· | ∅).
This system is designed for epistemic modalities, but we can add a clause for deontic
should by assuming that, like the other modals, it performs a test on the state r (in the
sense of Veltman ).
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
fabrizio cariani
(ii) the sentences in set are jointly accepted in r iff for all A ∈ , A is accepted
in r.
The technical machinery is best illustrated by exploring some of its consequences.
Fact . When A is non-modal and non-probabilistic,
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
fabrizio cariani
For reasons of space, I do not take up the detailed construction of R . However, the
idea should be obvious in light of the earlier facts. Let R contain some sharp states r
according to which the opponents are likely to attack on the right, and some according
to which the opponents are not likely to attack on the right. Then () is accepted in
R provided that all of the sharp states according to which the opponents are likely
to attack on the right pass the test for “you should concentrate the defense on the
right”. Moreover, () is accepted in R provided that all of the sharp states according to
which the opponents are not likely to attack on the right fail the test for “You should
concentrate the defense on the right”. It is easy to construct an elevated ordering source
that accomplishes this.
The upshot of this section is that the challenge posed by ()–() can be factored into
two separate problems. The first problem concerns update on probabilistic informa-
tion. The other concerns the interface between deontic judgments and probabilistic
states. The semantics of § solves the latter problem. This is all that it should be
reasonably asked to do. If we borrow an answer to the first problem, we can inject it
in the probabilistic premise semantics and explain the meanings of these troublesome
conditionals.
Conclusion
So, is there one theory to rule them all? Probably not. For one thing, the motivational
arguments that justify my proposal depend on significant assumptions about semantic
theories for the language of subjective uncertainty, about information-sensitivity for
deontic modals, about how to integrate deontic semantics with off-the-shelf treat-
ments of attitudes and so on. For another, even granting those assumptions, a few
different systems are compatible with the ideas I have developed.
Nonetheless, there is much to recommend the core ideas of the semantic account
I developed in §. The neutrality-based desiderata are plausible and can be reflected
in specific empirical predictions. My theory does as well as one can with respect to
these desiderata. On the one hand, it is comfortably able to handle the dependence
of deontics on probabilistic states. On the other, it can model judgments that track
orderings based on expected utility, maximin, dominance and what have you, without
assuming that these practical theories are built into the semantics. It strikes a solid
balance between the desire to develop a probabilistic semantics for modals and the
desire to keep the conventional meanings of deontic modals as thin and flexible as
possible.
Acknowledgments
Special thanks to Nate Charlow, Matthew Chrisman, Daniel Lassiter, Paolo Santorio, Ralph
Wedgwood, and Malte Willer for detailed feedback on previous versions of this paper, as
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
well as to Thony Gillies and Shyam Nair for public commentaries and to Magdalena and
Stefan Kaufmann for our collaboration, which helped me sharpen the ideas in this paper. I am
also indebted to conversations and e-mail exchanges with: Mike Caie, Janice Dowell, Kai von
Fintel, Michael Glanzberg, Jeff Horty, Graham Katz, Angelika Kratzer, Steve Kuhn, Manuel
Križ, Hanti Lin, Peter Ludlow, Dilip Ninan, Paul Portner, Aynat Rubinstein, Alex Silk, Seth
Yalcin, audiences at Georgetown Linguistics Colloquium, Maryland Philosophy Colloquium,
the Rutgers Semantics Workshop, Kai von Fintel and Sabine Iatridou’s MIT Graduate
Seminar on Deontic Modals and Imperatives, the Pacific APA, Northwestern Deontic
Modality Workshop and USC Deontic Modality Conference. Finally, thanks (again!) to both
Nate Charlow and Matthew Chrisman for editing this volume.
References
Bennett, Jonathan () A Philosophical Guide to Conditionals. Oxford: Oxford University
Press.
Briggs, Rachael () ‘Normative Theories of Rational Choice: Expected Utility’. In
E.N. Zalta (ed.) The Stanford Encyclopedia of Philosophy, Fall. [online] Available from:
http://plato.stanford.edu/archives/fall/entries/rationality-normative-utility/. [Accessed
Oct .]
Cariani, Fabrizio () The Semantics of ‘ought’ and the Unity of Modal Discourse. Ph.D. thesis.
UC Berkeley.
Cariani, Fabrizio (a) ‘Ought and Resolution Semantics’. Noûs. . . pp. –.
Cariani, Fabrizio (b) ‘Epistemic and Deontic Should’. Thought. . . pp. –.
Cariani, Fabrizio () ‘Attitudes, Deontics and Semantic Neutrality’. Pacific Philosophical
Quarterly. . . pp. –.
Cariani () ‘Consequence and Contrast in Deontic Semantics’. Journal of Philosophy.
Cariani, Fabrizio, Kaufmann, Magdalena, and Kaufmann, Stefan () ‘Deliberative Modality
under Epistemic Uncertainty’. Linguistics and Philosophy. . pp. –.
Carr, Jennifer () ‘Deontic Modals without Decision Theory’. Proceedings of Sinn und Bedeu-
tung deta: Chemla, E., Homer, V. and G. Winterstein, (eds.); . École normale supérieure,
Paris, – September. Semantics Archive. . pp. –.
Carr, Jennifer () ‘Subjective Ought’. Ergo. . . pp. –.
Charlow, Nate () ‘What We Know and What To Do’. Synthese. . pp. –.
Charlow, Nate () ‘Decision Theory: Yes! Truth-Conditions: No!’ In Charlow, N. and Chris-
man, M. (eds.). Deontic Modality. Oxford: Oxford University Press.
Chierchia, Gennaro and McConnell-Ginet, Sally () Meaning and Grammar: an Introduction
to Semantics. MA: MIT Press.
Chrisman, Matthew () “‘Ought” and Control’. Australasian Journal of Philosophy. . .
pp. –.
Ciardelli, Ivano, Groenendijk, Jeroem, and Roelofsen, Floris () ‘Inquisitive Semantics: a
New Notion of Meaning’. Language and Linguistics Compass. . . pp. –.
Dowell, Janice () ‘Contextualist Solutions to Three Puzzles about Practical Conditionals’.
In Shafer-Landau, Russ (ed.) Oxford Studies in Metaethics. volume . pp. –. Oxford:
Oxford University Press.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
fabrizio cariani
Easwaran, Kenny () The Foundations of Conditional Probability. Ph.D. thesis. UC Berkeley.
Easwaran, Kenny () ‘Regularity and Hyperreal Credences’. Philosophical Review. .
pp. –.
Edgington, Dorothy () ‘On Conditionals’. Mind. . pp. –.
Finlay, Stephen () ‘Oughts and Ends’. Philosophical Studies. . . pp. –.
Finlay, Stephen () A Confusion of Tongues. Oxford: Oxford University Press.
Finlay, Stephen and Snedegar, Justin () ‘One Ought too Many’. Philosophy and Phenomeno-
logical Research. . . pp. –.
Frank, Annette () Context Dependence in Modal Constructions. Ph.D. thesis. University of
Stuttgart.
Geurts, Bart (unpublished) ‘On an Ambiguity in Quantified Conditionals’. Manuscript.
University of Njemegen.
Gibbard, Allan () Thinking How To Live. Cambridge, MA: Harvard University Press.
Goble, Lou () ‘Utilitarian Deontic Logic’. Philosophical Studies. pp. –.
Hájek, Alan (unpublished) ‘Staying Regular’. Australian National University.
Hintikka, Jaako () Knowledge and Belief: an Introduction to the Logic of the Two Notions.
Ithaca, NY: Cornell University Press.
Holliday, Wesley and Icard, Thomas () ‘Measure semantics and qualitative semantics for
epistemic modals’. Proceedings of SALT . pp. –.
Horty, John F. () Agency and Deontic Logic. Oxford: Oxford University Press.
Jackson, Frank () ‘On the Semantics and Logic of Obligation’. Mind. . . pp. –.
Jackson, Frank () ‘Decision Theoretic Consequentialism and the Nearest Dearest Objection’.
Ethics. . . pp. –.
Jackson, Frank and Pargetter, Robert () ‘Oughts, Options and Actualism’. The Philosophical
Review. . . pp. –.
Joyce, James M. () The Foundations of Causal Decision Theory. Cambridge: Cambridge
University Press.
Kamp, Hans () ‘Formal Properties of “Now”’. Theoria. . pp. –.
Kolodny, Niko and MacFarlane, John () ‘Ifs and oughts’. Journal of Philosophy. . .
pp. –.
Kratzer, Angelika () ‘What “Must” and “Can” Must and Can Mean’. Linguistics and Philoso-
phy. . . pp. –.
Kratzer, Angelika () ‘The Notional Category of Modality’. In Partee, B. and Portner, P. (eds.)
Formal Semantics: the Essential Readings. London: Blackwell.
Kratzer, Angelika (a) ‘Conditionals’. In von Stechow, A. and Wunderlich, D. (eds.), Seman-
tics: An International Handbook of Contemporary Research. Berlin: DeGruyter Mouton.
Kratzer, Angelika (b) ‘Modality’. In von Stechow, A. and Wunderlich, D. (eds.), Semantics:
An International Handbook of Contemporary Research. Berlin: De Gruyter Mouton.
Kratzer, Angelika () Modals and Conditionals. Oxford: Oxford University Press.
Lassiter, Daniel () Measurement and Modality: The Scalar Basis of Modal Semantics. Ph.D.
thesis. NYU.
Lassiter, Daniel (forthcoming) ‘Epistemic Comparisons, Models of Uncertainty, and the Dis-
junction Puzzle’. Journal of Semantics.
Lewis, David K. () ‘Reply to McMichael’. Analysis. . . pp. –.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
Lewis, David K. () ‘Ordering Semantics and Premise Semantics for Counterfactuals’. Journal
of Philosophical Logic. . pp. –.
MacFarlane, John () Assessment Sensitivity. Oxford: Oxford University Press.
Moss, Sarah () ‘On the Semantics and Pragmatics of Epistemic Vocabulary’. Semantics &
Pragmatics. . . pp. –.
Parfit, Derek () On What Matters. Oxford: Oxford University Press.
Parfit, Derek (unpublished) ‘What We Together Do’. Manuscript. Oxford University.
Plunkett, David and Sundell, Tim () ‘Disagreement and the Semantics of Normative and
Evaluative Terms’. Philosophers Imprint. . pp. –.
Portner, Paul () Modality. Oxford: Oxford University Press.
Regan, Donald () Utilitarianism and Cooperation. Oxford: Oxford University Press.
Rothschild, Daniel () ‘Expressing Credences’. Proceedings of the Aristotelian Society. .
pp. –.
Rubinstein, Aynat () Roots of Modality. Ph.D. thesis. UMass, Amherst.
Schroeder, Mark () ‘Oughts, Agents and Actions’. The Philosophical Review. . . pp. –.
Silk, Alex () ‘Evidence Sensitivity in Deontic Modals’. Journal of Philosophical Logic. .
pp. –.
Skyrms, Brian () Causal Necessity. New Haren, CI: Yale Univeristy.
Skyrms, Brian () ‘Causal Decision Theory’. The Journal of Philosophy. . . pp. –.
Stephenson, Tamina () ‘Judge Dependence, Epistemic Modals, and Predicates of Personal
Taste’. Linguistics and Philosophy. . . pp. –.
Swanson, Eric () ‘How Not to Theorize About the Language of Subjective Uncertainty’. In
Egan, Andy and Weatherson, Brian, (eds.) Epistemic Modality. pp. –. Oxford: Oxford
University Press.
Veltman, Frank () ‘Defaults in Update Semantics’. Journal of Philosophical Logic. .
pp. –.
von Fintel, Kai () ‘Conditionals’. In von Heusinger, Klaus, Maienborn, Claudia and
Portner, Paul (eds.) Semantics: An international handbook of meaning. Berlin: DeGruyter
Mouton.
von Fintel, Kai (unpublished) ‘The best we can (expect to) get? Challenges to the classic
semantics for deontic modals’. Presented at the Central APA, Chicago, IL. Available from:
http://mit.edu/fintel/fintel--apa-ought.pdf. [Accessed October .]
von Fintel, Kai and Iatridou, Sabine () ‘How to Say Ought in Foreign: the Composition
of Weak Necessity Modals’. In Guéron, Jacqueline and Lecarme, Jacqueline (eds.) Time
and Modality. (Studies in Natural Language and Linguistic Theory ). pp. –. Dordrecht:
Springer.
Wedgwood, Ralph () ‘Subjective and Objective Ought’. In Charlow, N. and Chrisman, M.
(eds.) Deontic Modality. Oxford: Oxford University Press.
Willer, Malte () ‘A Note on Iffy Oughts’. Journal of Philosophy. . pp. –.
Willer, Malte () ‘Dynamics of Epistemic Modality’. The Philosophical Review. . pp. –.
Yalcin, Seth () ‘Epistemic Modals’. Mind. . . pp. –.
Yalcin, Seth () ‘Probability Operators’. Philosophy Compass. pp. –.
Yalcin, Seth () ‘Nonfactualism About Epistemic Modality’. In Egan, A. and Weatherson, B.
(eds.) Epistemic Modality. Oxford: Oxford University Press.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
fabrizio cariani
Yalcin, Seth (a) ‘Bayesian Expressivism’. Proceedings of the Aristotelian Society. .
pp. –.
Yalcin, Seth (b) ‘Context Probabilism’. In Aloni, M. et al. (ed.) Logic, Language, and Meaning:
Proceedings of the th Amsterdam Colloquium. University of Amsterdam, pp. –. Berlin:
Springer.
Yalcin, Seth (c) ‘A Counterexample to Modus Tollens’. Journal of Philosophical Logic. .
pp. –.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
Decision Theory: Yes! Truth
Conditions: No!
Nate Charlow
Introduction
This essay attempts to leverage a special theoretical pressure involved in accounting
for the meaning and function of language with loosely-speaking normative or action-
guiding meaning. A core case—and my focus here—is that of deontic modals such
as should and ought.1 Many authors claim that a “standard” semantics for deontic
modals—on which they express varieties of quantification over contextually deter-
mined domains of deontically admissible possibilities—is unviable. In particular, the
standard semantics is said to ignore—in virtue of lacking the structure to represent—
factors relevant to determining the truth of a deontic modal in a given situation. To
yield plausible truth-verdicts for claims expressing what should or may be done in
various kinds of situation, a semantics for such claims should enrich itself with the
fruits of the normative literature on how to determine and reason about what should
or may be done in such situations. By and large, existing accounts suggest the way to
fix this is by adding structure—much implicitly borrowed from normative theory—to
the standard semantics.
The literature has grown thick with proposals in this vein (and with theoretical and
empirical justifications for complications that would otherwise appear unparsimo-
nious). But it is comparatively thin on methodological and philosophical reflection,
1 Linguists used to standard ways of carving up modals in natural language will think “deontic modal” a
poor name for the focus of this chapter. Better would be “prioritizing modal,” a label used to cover modals
with a broadly action-guiding or advisory function (namely: proper deontic, bouletic, and teleological)
(Portner ). Philosophers tend to use “deontic modal” to cover roughly the same ground as the linguist’s
“prioritizing modal.” Although I tend to prefer “practical modal” here (see Charlow a), I follow the
philosophers here. (Most of the chapters of this volume do so too.) Allow me to also note that my primary
focus in this chapter is on so-called weak deontic necessity modals (such as should and ought) whose content
is directly paraphrasable in terms of comparative betterness (on which more below; cf. especially Kratzer
() on “weak necessity”). Strong deontic necessity modals (for example, must) are a more difficult case.
(Like most who have written on the distinction, I am inclined to treat strong deontic necessity as a special
case of weak deontic necessity.)
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
nate charlow
whether about the endeavor itself, where the significant theoretical divisions and
methodological defaults lie,2 or the principles—besides the constraint that requires a
correct semantics to match considered judgments of correctness and consequence—
that might govern this sort of theorizing. Should we augment our theorizing about the
semantics of deontic modals with the fruits of normative theory? If so, how should we
do it? Have past efforts gone wrong in important ways?
This chapter tries to address such questions. It first attempts to build the strongest
possible case for packing the fruits of the normative literature into the semantics
of deontic modals. Unfortunately, the case crumbles on examination. There are
metasemantic reasons to demand that semantics remain neutral on certain normative
matters—namely, all of them. We get closer to the truth about the semantics of deontic
modals by simply excising normative assumptions from our semantic theory. This,
however, seems to mean giving up on the project of matching the verdict of truth
delivered by a semantics to the one delivered by intuition (or so I will argue).
This is a pessimistic conclusion, but this chapter has an optimistic response in mind:
a view that might be clumsily called Nonpropositionalist, Minimalist Expressivism. I will
suggest that deontic modals support a reorientation of semantic theorizing, away from
the aim of delivering an account of a sentence’s truth-condition, toward the aim of
modeling the state of competently accepting a sentence (“Nonpropositionalism”). More
precisely: the correct semantics for deontic modals is one that lays bare the manifold
ways in which a normative view can be combined with linguistic competence with
respect to deontic modals to generate a concrete linguistic judgment of the form
A should do X. The core compositional semantics for deontic modals is exhausted
by the idea that deontic modals quantify over a domain determined by a sequence
of parameters (of basically any length) whose values are fixed by the context of use
(namely, by the specific normative view of a subject) (“Minimalism”). The project
of attempting to understand the “content” or “truth-condition” of a deontic modal
on an occasion of use by investigating its compositional semantics should, in large
part, be abandoned. (This, I argue, appears less of a cost when married to the right
pragmatics—an “Expressivist” pragmatics.)
What is the upshot, if I am right? Corresponding issues clearly arise for language
with broadly normative or action-guiding or advisory meaning—moral language,
imperatives, and so forth. If I am right, much of the work in these areas needs to be
reconsidered, on the grounds that its authors are engaged in normative theorizing,
rather than semantics. How far this kind of critique can be extended to recent,
“cognitively laden” work on, e.g., modals, probability operators, and conditionals is
a question I will not address here, but which will merit future attention.
2 A vignette: the type of amendment to Kratzer’s view suggested in, for example, Charlow (b);
Cariani () is variously portrayed as a “challenge” and a “conservative amendment” to the “classical
semantics” (see respectively von Fintel ; Carr ). This is a theoretically confounding state of affairs
(about which I will have more to say below). (Let me note, however, that Carr’s work is an increasingly
influential exception to the trend I am referencing here.)
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
3 I will remain silent on what it takes for a possibility to be a relevant alternative when interpreting a
DNM.
4 We will understand possible worlds as functions from atomic sentences to truth-values—i.e., as
something that settles whether grass is green, sheep scream, etc. It is open to us to understand it as a set
of more complicated entities—worlds centered on a time coordinate (or agent or whatever), relations on
worlds (i.e., actions), and so on. For a sense of the different options here, see Cariani (); Cariani et al.
(); Charlow (, §).
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
nate charlow
Definition . The lifted strict ordering ≺ is the relation p bears to q iff p q and
q p.
The definition most important for our purposes here is Definition . According to
our innocent gloss, DNMs are interpreted relative to a Modal Base and Ordering. What
conditions does a DNM express on a Modal Base and Ordering? In other words, under
what conditions is a DNM p true at R, ? According to our gloss, p says that p is
better than ¬p, relative to R, . What does this mean? It would seem to mean that
there’s no ¬p possibility (∈ R) that cannot be improved on by some p-possibility (also
∈ R); p is better than ¬p relative to R, just if, if ¬p, it is always possible to better
(i.e., if p). Formally (letting [[φ]] give the set of possibilities φ maps to true):
(Q) ∀x ∈ [[¬p]] ∩ R : ∃y ∈ [[p]] ∩ R : y ≺ x
Informally: seeing to it that ¬p will always leave things sub-optimal. It is easy to see p
is better than ¬p, by the lights of Q, iff p ≺ ¬p, by the lights of Definition . Recall
that p ≺ ¬p iff p ¬p and ¬p p. Hence, iff each ¬p world is equaled or bettered by
some p world, but not vice versa (so that at least one p-world betters any ¬p-world).
In other words, it seems that a condition of the form X ≺ Y is a reasonable way of
encoding the content of a claim of the form X is better than Y.
5 Possibly, according to Kratzer, it is partial: for some w and v: w v and v w. Partiality introduces
complications that I do not wish to discuss in the main text—and its original motivation is frankly quite
weak. I will implicitly assume that the ordering is non-partial whenever I need it to be total (see, e.g., the
sort of case mentioned in Footnote ).
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
Now suppose we make the Limit Assumption (Lewis ), so that there is always
some non-empty set of -best worlds in R:6
(Limit) ∃x ∈ R : ¬∃y ∈ R : y ≺ x
Signpost. The point of that was this: the Standard Semantics—on which deontic
necessity involves universal quantification (and deontic possibility existential quan-
tification) over min (R)—seems to follow from a very minimal understanding of
6 The Limit Assumption is reasonable insofar as the domain of relevant alternative possibilities should
be presumed to be finite (for its negation implies that the domain is infinite). I think the Limit Assumption
is reasonable, at least for the deontic case, and I will make it here. Nothing of ultimate importance will turn
on this, however.
7 I do not mean to suggest that this should be regarded as a revelation. However, standard reasons
for thinking so are arguably mistaken. von Fintel and Iatridou (), for example, write: “Kratzer ()
distinguishes between necessity and weak necessity as well. Her informal characterization is similar to ours:
p is a weak necessity iff p is a better possibility than not p. The technical implementation is different from ours
and crucially involves not accepting that there is always a set of most favored worlds (what is known as the
Limit Assumption in the trade). It appears to us that if one makes the Limit Assumption, Kratzer’s definitions
collapse, leaving no distinction between simple necessity and weak necessity” (p. n. , emphasis added).
Contra the view expressed in this passage, the Limit Assumption is not sufficient for the relevant definitions
to collapse. Consider a finite model (a fortiori satisfying the Limit Assumption) with R = {x, a, b} in which
p = {x, a} and in which a and b are equally good (a b and b a), but x is incomparable to both a
and b. Obviously, since a b, p ¬p. But since b x, ¬p p. Hence p ≺ ¬p. Obviously, however,
min (R) = R—since no world in R can be strictly improved on—hence min (R) p. The easiest way
around this is to assume away the troublesome forms of partiality in , and I have implicitly done so in the
main text. Thanks here to Fabrizio Cariani, Alex Silk, and Eric Swanson.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
nate charlow
what deontic modals are in the business of saying (together with some relatively
innocent assumptions about how to structure the formal semantic apparatus). This
fact could, moreover, very reasonably be taken as a kind of roundabout vindication for
the semantic value that, e.g., Kratzer (, ) assigns to necessity modals generally.
On Kratzer’s classic treatment, necessity modals exhibit something like polysemy. The
necessity modal is assigned a context-independent, doubly-parametrized semantic
value, that, in context, maps values for these two parameters—the Modal Base (analo-
gous to our relevant alternative space) and Ordering Source (analogous to our Order-
ing) into a universally quantificational truth-condition. DNMs are distinguished from
E(pistemic)NMs, not by quantificational force, rather by being interpreted in contexts
in which deontic-specific assignments of the relevant pair-values are appropriate. The
fact that an independently motivated Standard Semantics for DNMs can be derived
from the Kratzer semantics for NMs is no small point in favor of the latter.
Refinement
It would be natural to draw the following moral from the foregoing: accounting for
facts for which the Standard Semantics fails to account will mean refining the basic
apparatus of the Standard Semantics, often times in ways that derive insight and
plausibility from phenomena falling within the domain of practical rationality. Let us
consider some examples.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
This refinement seems to derive plausibility from reflection on the parallel character
of practical rationality—more precisely, on an aspect of practical rationality that the
unrefined semantics seems to ignore. Treating some goal as designated in practical rea-
soning, as evoked in the following sort of instruction, means treating it as something
that (in Dworkinian, Rawlsian, and Nozickian idioms, respectively) trumps, is lexically
prior to, or functions as a side-constraint on the pursuit of secondary considerations.
To illustrate, consider:
() Suppose my goal is to get to Harlem from Brooklyn. What should I do?
It is beside the point to note that you will get some enormous prize if you stay in
Brooklyn; the practical question what should I do? queries which ways of getting to
Harlem are best, while taking no note of rankings amongst the various ways of not
getting to Harlem. Similarly, notice that, no matter how highly an alternative w ∈ R
ranks according to , if w does comparatively poorly by the metric of , so that
w∈ / min (R), then necessarily w ∈
/ min (min (R)).
. Epistemic and deontic orderings
Here is another illustration.8 Consider the following impeccable practical inference.
() If you can wash your hands or can wipe them on your pants, you should wash
them. (if X ∨ Y)(A)
If you can wash your hands or can use hand sanitizer, you should wash them.
(if X ∨ Z)(A)
So, if you can wash your hands or can wipe them on your pants or can use
hand sanitizer, you should wash them. (if X ∨ (Y ∨ Z))(A)
() is apparently underwritten by the following truism of practical rationality: if p is
preferable to both q and r, given a choice between p and (q ∨ r), p rationally must
remain preferable. Relatedly, and in fact more generally, if p is preferable given all
the relevant alternatives individually, p rationally must remain preferable given all
the relevant alternatives collectively, as illustrated by, e.g., a case in which the relevant
alternatives are given by the circumstances of the decision problem:
() If your fellow prisoner defects, you should defect. (if X)(A)
If your fellow prisoner cooperates, you should defect. (if Y)(A)
So, whatever your fellow prisoner does, you should defect. (if X ∨ Y)(A)
8 Inspired by discussion in Lassiter () of Kratzer’s treatment of the better possibility than relation. A
difference worth noting is that Lassiter is not worried about inferences involving modals and conditionals in
the object language. He is rather worried about the fact that, regardless of whether better has a preferential
or probabilistic meaning, Kratzer’s treatment of the metalinguistic better possibility than relation validates
the inference from (i) A is a better possibility than B, (ii) A is a better possibility than C, to (iii) A is a better
possibility than B or C.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
nate charlow
Signpost. The natural spin on the cases we considered in this section is that they
motivate certain emendations to the Standard Semantics. More specifically, they seem
9 The relationship between Statewise Dominance and the Sure Thing Principle is in fact complicated
(as is the matter of their relationships to the examples mentioned here). Since none of the points I make will
turn on the fact that it is the Sure Thing Principle, as opposed to Statewise Dominance (or indeed another
dominance principle entirely), that is encoded within a semantics for DNMs, I will tend to speak loosely
about these relationships in the main text.
10 Lassiter () draws the lesson that epistemic and deontic should probably do not share a common
semantic core (and that these modals do not ultimately have a quantificational semantics). While I am
sympathetic to the second part of his conclusion, it is several steps removed from the bare data we are
considering here.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
Uncertainty
A key opportunity for refinement explored in the literature on DNMs concerns the
semantic role of uncertainty. Specifically, the Standard Semantics has a hard time
accommodating the manifest importance of uncertainty in determining what we
should do. While some—Dowell (forthcoming); von Fintel (), not to mention
Kratzer ()—claim the Standard Semantics can accommodate uncertainty as-is,
it is commonly accepted that some sort of refinement is needed (as I’ll explain).
A number of people have attempted to provide one (see Carr ; Cariani et al. ;
Charlow b; Silk ). Here I will review the major issues and build a case for, in
Kratzer’s phrase, “packing in” some of the fruits of decision theory into the semantics
of DNMs.
. MaxiMax
Here’s a simple schematic case. Possibilities where g is realized are preferred to
possibilities in which g is realized, which are in turn preferred to possibilities in which
g is realized:
g ≺ g ≺ g
p ¬p
A g g
B g g
In short, you prefer A-ing if p to B-ing, which you in turn prefer to A-ing if ¬p. Now:
what should you do in this sort of decision problem? We cannot, it seems, answer this
question without further information regarding, at least, the likelihood of p (as well
as information about how preferable g is to g and how preferable g is to g ). Absent
such information, what you should do is indeterminate.
The Standard Semantics—on a perhaps naïve reading (see §.)—seems, wrongly,
to say that the answer is determinate. Possibilities in which g is realized are strictly
preferred to possibilities where it is not. The best possibilities (according to the relevant
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
nate charlow
ordering ≺) are thus all possibilities where g is realized. But all the possibilities
where g is realized are possibilities where you A. And so all the best possibilities
are possibilities where you A: A ≺ B. As Carr puts it, the Standard Semantics seems
to encode the decision rule MaxiMax. “This is a straightforward consequence of the
more basic commitment...that we should always simply bring about the best possible
outcome in the modal background” (Carr , p. ). If we had to pick any decision
rule for our semantics for DNMs to encode, MaxiMax would be far down the list.
. Decision-theoretic quietism
I bring up this sort of case to introduce a possible response on behalf of the Standard
Semantics before approaching a more well-known case. The response is familiar from
the decision-theoretic dialectic surrounding, e.g., the Ellsberg Paradox: the decision
table—more precisely, its preferential dimension—is said to be under-described. In the
Ellsberg Paradox, the charge is that the payoffs explicitly represented in the decision
problem ignore the role of risk-aversion in informing the preferences of apparently
rational agents. Analogously, in this case, the charge might be that the relevant payoffs
should be enriched. What is (dis-)valuable outstrips what can be represented with
payoffs g –g . Also valuable is choosing an action that maximizes expected value (or a
suitable stand-in for expected value). More generally, agents manifestly value choices
that meet procedural conditions on how such choices are made (call such preferences
rational preferences). Rational preferences manifestly bear on the desirability of actions
that implement them (or fail to).11
Let’s take this a bit further. In the schematic case in §., it is natural to ascribe first-
order preferences—preferences that supervene on your preferences among outcomes—
of broad shape:
(A ∧ p) ≺FO B ≺FO (A ∧ ¬p)
Nevertheless, it is still clearly possible (for, e.g., an expected-utility maximizer) to have
considered preferences among actions of shape:
B ≺ATC A
Given Definition (and, indeed, any reasonable understanding of the properties of
≺) necessarily ≺FO = ≺ATC . So long, then, as we allow ≺ATC , rather than ≺FO , to
determine the domain of quantification for the DNM, we can avoid the problem posed
above.
This reply is decision-theoretically quietist, in the following sense. Contra what
was suggested in §., MaxiMax is in fact rejected as a “theorem” of the semantics.12
However, no replacement decision theory is encoded in its stead. Indeed, it would seem
11 Such an account is presented approvingly in an unpublished reply by Kratzer to Cariani et al. (),
and echoed in von Fintel (). For criticisms similar to the one developed here, see Carr (); Silk ().
12 Compare Cariani (): “A semantic theory might select the best worlds out of an ordering, but [this]
does not mean that it implements MaxiMax.”
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
that effectively any recipe for transforming ≺FO into ≺ATC must be invisible to the
compositional semantics of DNMs.
Kratzer’s unpublished argument for this sort of invisibility is a question whose
answer is intended as obvious: “Why pack information about rational decision making
into the meaning of modals?” A perhaps more perspicacious version of this question
would be as follows: why pack any more information about rational decision making
into the meaning of modals than we have to in order to make our semantics mini-
mally predictive? After all, Kratzer’s stipulations about the ordering encode certain
“axioms” of rational decision-making, including:
• Kratzer Dominance (p ≺ q ∧ p ≺ r) ⇒ p ≺ (q ∨ r)
• Acyclicity of ≺ p ≺ . . . ≺ p n ⇒ p n ≺ p
• Independence of Irrelevant Alternatives (see Charlow b):
(w ∈ min (R) ∧ w ∈ R ∧ R ⊆ R) ⇒ w ∈ min (R )
Any and all object-language expressions of these features of the preference ordering
are thus treated as theorems within Kratzer’s semantics. These, however, are intuitively
“good” things to treat as theorems within the relevant formal system (whereas it would
be intuitively bad to render, e.g., a particular decision rule such as Expected Utility
Maximization or MaxiMin a theorem).
Why pack any more into the compositional semantics of DNMs than we have to
in order to generate the sorts of theorems we regard as desirable? I will refer to the
proponents of the “obvious” answer to this rhetorical question as Decision-Theoretic
Quietists. In the next section I will apply Quietism to a well-known puzzle case in the
semantics of DNMs.
in_A in_B
block_A All live All die
block_B All die All live
¬(block_A ∨ block_B) Nine live Nine live
This is actually an elaboration of the sort of case described in the previous section.
Your first-order preferences among outcomes look like this:
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
nate charlow
Now, what should we do? Absent information about the probability, e.g., that the
miners are in A, what we should do is indeterminate (or, perhaps, it is determinate
that we should block neither). But evidently the best possibilities are all possibilities
where all miners live. So the best possibilities are all possibilities where we block A or
block B. Apparently, then, according to the Standard Account, the following is true:
() We should either block A or block B. (block_A ∨ block_B)
Not only does this sound false (or at least not determinately true); it seems that, on
certain assignments of probabilities to the claim that the miners are in A (e.g., %),
the following is instead true:
() We should do nothing. ¬(block_A ∨ block_B)
As we discussed above, a fan of the Standard Account might reply: there are things,
other than the lives of the miners, but which we manifestly do care about (e.g., dealing
with uncertainty, managing risk), that the decision table fails to represent. In this vein,
von Fintel (), addressing cases specifically in which we judge () true, writes: “It
is not obvious that [a world where all ten miners are saved] is a better world than one
where nine miners survive because we choose not to run the risk.” Thus our considered
preferences are, in fact, of the following shape:
This, the claim goes, underwrites our judgment of the truth of () and the falsity of ().
Impressively, this sort of thought can be extended to explain why we tend
(“instantly,” in the phrase of von Fintel ()) to judge the following conditionals
true:
() If the miners are in A, we should block A. (if in_A)(block_A)
() If the miners are in B, we should block B. (if in_B)(block_B)
All we need do is combine it with the further thought—for which there is an ever-
expanding body of evidence (see, e.g., Yalcin ; Gillies ; Charlow a, b)—
that the if-clause of an indicative conditional functions, in part, to shift the information
relevant for evaluating information-sensitive expressions (such as DNMs). This allows
us to exploit this claim of Kolodny and MacFarlane ():
A world may be more ideal than another relative to one information state and less ideal than it
relative to another. For example, a world in which both shafts are left open may be more ideal
than one in which shaft A is closed relative to a less informed state, but less ideal relative to a
more informed state. (p. )
Let ≺ATC[φ] give our considered preferences given a hypothetical update with infor-
mation φ—i.e., the sort of update induced by the if-clause of an indicative conditional
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
of the form (if φ)(ψ). If we take on Kolodny and MacFarlane’s quoted claim, we have
a rationale for endorsing the following:
block_x ≺ATC[in_x] ¬(block_A ∨ block_B) ≺ATC[in_x] block_y
Bracketing details, it isn’t hard to use this sort of claim about the structure of the sup-
positional or hypothetical preference order to underwrite a semantics for conditional
DNMs on which () and () come out true, even while () comes out false (relevant
formal tools can be found in Charlow b; Cariani et al. ; Silk ).13
. Decision theory and truth-conditions
Here we have a Quietist account of the Miners Puzzle that seems, at a first pass,
adequate. Kratzer’s question presents itself with even greater force: “Why pack infor-
mation about rational decision making into the meaning of modals?” Here is a
response (also pursued in Charlow b).
A preliminary answer: in fact, it seems that a key motive for introducing orderings
into the semantics for DNMs is to recapitulate, if partially, the structure of practical
reasoning. Practical reasoning takes place against preferences over outcomes as well
as information about outcome-relevant circumstances. This explains, in part, why the
semantics of DNMs must (and does) distinguish between something analogous to
a preference-state and something analogous to an information-state. Note, further,
the fundamental theoretical purpose of a normative decision theory to bridge the
gap between, on the one hand, descriptions of the agent’s preferences over outcomes
(and the agent’s information about outcome-relevant circumstances) and, on the other
hand, an account of what that agent should decide, intend, do. (Important note
to the reader: by “decision theory” I will mean nothing more than something that
determines a choice function. A choice function is an object that, given a specification
of preferences over outcomes and information about outcome-relevant circumstances
as input, generates a set of permissible actions as its output.)
If the recapitulation claim is right, there is reason for the semantics of DNMs—
claims which serve to express verdicts about what agents should do—to represent
the normative role of a decision theory. Note that, in a non-semantic context, it
would be something like a category error to build the decision theorist’s conception
of the preferred way to bridge the gap between preference/information and action
into the description of what outcomes are preferable for the agent. Further, it is clear
that verdicts of what an agent should do, in light of a description of what outcomes
13 Kolodny and MacFarlane () argue that this means giving up modus ponens, since (in_A ∨ in_B)
together with () and () apparently entail (). But this view—note I include my concurrence in Charlow
(b)—is mistaken. As Willer () notes, if our notion of entailment is dynamic in nature—roughly,
φ , . . . , φn |Dyn ψ iff the result of updating an information-state on φ , . . . , φn yields an information-
state that accepts ψ—this does not follow. Given one very natural treatment of the conditional, on which a
state σ accepts (if φ)(ψ) iff the result of updating σ with φ accepts ψ—modus ponens is dynamically valid—
(if φ)(ψ), φ |Dyn ψ. But constructive dilemma is clearly not—(if φ)(χ ), (if ψ)(ρ), (φ ∨ψ) |Dyn (χ ∨ρ).
For some discussion, see Yalcin (b, n. ); Charlow (a, §.).
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
nate charlow
are preferable for the agent (and the information to which she has access), must be
generated by appeal to some decision theory or other. What goes for decision theory
goes, then, for the theory of what the agent should do: take care to distinguish between
first-order and rational preferences. There is, I would think, strong pressure to say that
what goes for the theory of what the agent should do goes, in turn, for a semantic
theory for sentences that conventionally serve to express views about the verdicts of
the theory of what the agent should do. Regardless, then, of whether we are doing
decision theory or theorizing about the meaning of modals that express the considered
verdicts of a decision theory, it seems we should take care to avoid building rational
preferences into our description of first-order preferences. But it seems like this is
exactly the conflation Kratzer is encouraging us to make.
To put it differently, consider a decision problem for an agent with preferences
P , information I (which also specifies a range of available actions for the agent,
A = X , ..., Xn ); we will represent the decision problem as an ordered triple of the
form = P , I , A . Let D be the selected decision theory,14 and let CHD be the
choice function determined by D from decision problems into sets of D-admissible
actions in a decision problem. Now consider:
The Master Argument for “Packing In” Decision Theory
i. X (given ) iff X ∈ CH D (P , I , A )
ii. “X” is true at iff X (given )
iii. X (given ) iff ¬¬X (given )
iv. So “X” is true at iff ¬X ∈/ CHD (P , I , A )
Because which actions are permissible within a decision problem is in part a function
of a choice of decision theory (i), by an application of the T-Schema15 (ii) and the
metalinguistic duality of deontic necessity and possibility (iii), the truth-condition of
a sentence describing what an agent should do in a decision problem is also such a
function (iv).16
14 D can be any decision theory whatever—any recipe from transforming preferences over outcomes
and information into a choice function, including recipes that, e.g., treat information as irrelevant. D need
not be an expected utility calculus, or indeed something borrowed from the decision-theoretic literature
at all.
15 Instances of (ii) are not, strictly, instances of the T-Schema, which, as Fabrizio Cariani has pointed
out to me, concerns the application of a monadic, not relational, truth-predicate. Still: schema (ii) is correct
and clearly related to the T-Schema (and seems to me to be endorsed by Tarski himself when he discusses
monadic truth as a special case of relational satisfaction). Consider an index-sensitive truth-condition for
the sentence grass is green. This sentence is true at w iff [λv.grass is green at v](w) = iff grass is green at w.
Generalizing, φ is true at some sequence of indices S iff [λS .φ(S )](S) = iff φ(S).
16 I concede, of course, that work needs to be done to fill out this basic picture. Here I will mention two
outstanding issues. First, if it turns out that which things are chairs depends on the distribution of a certain
kind of fundamental physical particle P, this argument could thereby be used to show the need to represent P
in the semantics of a sentence such as that’s a chair. That would be a reductio of this argument. (I regret that I
cannot recall the name of the person who raised this worry to me.) Avoiding this sort of overgeneration is not
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
Two related lessons can be drawn from this. First, if a kind of parameter—more
precisely, a kind of distinction between parameters—is constitutively relevant to the
determination of what the agent should do, it is thereby semantically relevant to the
truth-condition of a sentence describing what she should do. Second, a semantics that
refuses to distinguish between first-order and rational preferences cannot represent
the distinct contribution of first-order and rational preferences to the truth-condition
of a DNM—a state of affairs that is at once both theoretically embarrassing and liable
to lead to empirical difficulties. (I will outline an empirical difficulty below.)
This is the case for “pack[ing] information about rational decision making” into the
meanings of modals—at least into the meanings of modals that express verdicts about
rational decision making. Quoting myself, cases like the Miners (and the schematic
case on which it is based) . . .
. . . recommend the introduction of some sort of independent theoretical apparatus taking some
specification of (i) substantive, first-order ends and (ii) an information-state as its input, and
generating a modified ordering on possibilities as output. We could, of course, conflate what
is “best” or “ideal” with what is minimal with respect to the modified ordering [i.e. ≺ATC ].
Semantically, this would amount to restyling the deontic selection [min] as selecting, not worlds
that are best sans phrase [i.e. relative to ≺FO ], but with respect to some mixture of [first-order
and rational preferences] . . . [But we] should take care to avoid running together theoretical
notions (and pieces of theoretical apparatus) that, like actual and expected utility, a theory ought
not to run together. (Charlow b, pp. ff.)
To be clear, I am not arguing against allowing ≺ATC , rather than ≺FO , to determine
the DNM’s domain of quantification. I am arguing against a decision-theoretically
quietest semantics—one that refuses to represent anything more than ≺ATC in the
truth-condition for the DNM. Even if we allow ≺ATC to determine the domain of
quantification for the DNM, we must also represent the distinct contributions of (at
least) first-order preferences and rational preferences in determining ≺ATC . A story
that fails to represent the distinctive roles of first-order preferences and preferences
concerning ways of reasoning about what to do in the truth-conditions of claims
invoking DNMs cannot be the whole story about their meanings.
so hard, if we understand the premises of the argument as involving special equivalencies—e.g., conceptual
or analytic—that do not hold between chairs (and claims about chairs) and claims about Ps.
Second, CHD (P , I , A ) must be closed under conditions that are appropriate given the logic of DMs.
For instance, if X entails Y, then for any CHD and such that X ∈ CHD (P , I , A ), it must be that
Y ∈ CHD (P , I , A ). Giving a precise description of the operations under which CHD (P , I , A ) is
closed requires settling quite a lot of disputed issues in the logic of DMs—e.g., whether permission itself is
closed under arbitrary disjunction (i.e., the problem of Free Choice Permission)—and I will avoid it here.
I will note that imposing closure conditions on CHD (P , I , A ) will make it less natural to say that the
elements of CHD (P , I , A ) are actions—rather, they will be constructs out of the range of permissible
actions delivered directly by the relevant decision theory, often, but not always, corresponding themselves
to actions. In no way does this vitiate the argument of this section: a DM expresses a judgment about what
should be done, which in turn depends on which actions are permissible in light of the relevant decision
theory.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
nate charlow
17 To be clear: I do not really mean to assume that the preference-structure relevant for interpreting a
DNM is supplied by the agent who utters the DNM (hence the scare quotes around “in the mouth of ”).
Here I intend something much weaker: that when we (qua semantic theorists) evaluate DNMs relative to
orderings that are supplied by Fetishistic and ordinary preference-structures, we should be able to identify
a semantic distinction between them.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
one influential story about the semantics of strong DNMs (e.g., must) (see von Fintel
and Iatridou ),18 a claim of roughly the form in (). . .
() To implement the basic preferences, we must block neither shaft.
. . .is therefore true relative to a Rationally Fetishistic preference-structure. For the
ordinary agent, however, there are two preference-structures in play: first-order and
decision-theoretic (the former valuing things like miners’ lives, the latter perhaps
valuing things like risk-sensitivity in pursuit of the former). It is not a necessary
condition on implementing the “primary” preference-structure that neither shaft be
blocked (in fact, as we’ve seen, it is a necessary condition on implementing the
“primary” preference-structure that at least one shaft be blocked). Blocking neither
shaft is, however, the best way of implementing the primary preference-structure,
given risk-averse rational preferences. So, applying the von Fintel and Iatridou ()
account of the semantics of weak DNMs (e.g. should), a claim of roughly the form in
() is therefore true when evaluated relative to an ordinary preference-structure.
() To implement the basic preferences, we should block neither shaft.
However, a claim of the form in () is judged (appropriately, in my view) false.19
There is, then, a fairly clear semantic difference between DNMs “in the mouths
of ” Rational Fetishists and those “in the months of ” ordinary agents. To explain
this, it seems we should augment the Standard Semantics with some of the fruits
of normative decision theory. For instance, in Charlow (b), to account for our
ordinary judgments about the Miners scenario, I suggested making use of a truth-
condition inspired by the truth-condition in Q, repeated here:
18 Note: none of what I say here requires taking on the treatment of the strong/weak distinction in
von Fintel and Iatridou (), to which I will object below. My view is that on any adequate treatment
of the strong/weak distinction, () should come out true relative to a Rationally Fetishistic preference-
structure, false relative to an ordinary preference-structure. I will not attempt to defend any view of the
strong/weak distinction here.
19 For more on the importance of the strong/weak distinction in the Miners scenario, see Charlow
(b); Silk ().
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
nate charlow
making use of different decision theories, could easily be plugged into this basic
apparatus.
Signpost. I have argued that a semantics for DNMs should have, in the first instance,
the following sort of shape: X is true (given according to D) just if ¬X is
impermissible in by lights of D:
/ CHD (P , I , A )
X is true (given ) iff ¬X ∈
The truth-condition of a DNM will reference a choice function CHD , supplied by
the relevant decision theory D, and a decision problem = P , I , A , given
presumably at a context.
Before moving on, I wish to emphasize, briefly, that this in no sense involves a
departure from the suggested lesson of §. Rather, for all we have seen, it reinforces it.
The Standard Semantics is too coarse-grained to represent phenomena of normative
significance which seem to bear clearly on the truth-conditions of DNMs. The way
to fix this is to refine it—in particular, by augmenting it with the fruits of decision
theory.20
Nor does it necessarily involve any departure from the idea that DNMs have quan-
tificational truth-conditions (cf. §). We are free to pursue an equivalence between a
quantificational truth-condition and a decision-theoretic truth-condition of the form
given here (analogous to the equivalence demonstrated in §).21 (While I am doubtful
we can retain anything resembling Kratzer’s treatment of comparative betterness, as
encoded in Definitions and , the assumptions that endow these definitions with
predictive power are—as I explain in §.; see also Lassiter ()—independently
problematic. It is, in my view, a virtue of the account I am suggesting here that we
can move beyond these definitions. The account I advocate achieves predictiveness
without taking on problematic stipulations about the properties of the relevant order-
ing.)
Here we have what I take to be an intuitive, well-supported, and altogether fairly
conservative picture of how theorizing about the semantics of DNMs should proceed.
There are, however, deep difficulties in implementing it. I describe these in the next
section.
20 As I emphasized in Charlow (b), the representation of decision theories in the semantics of weak
DNMs is motivated by the same sorts of considerations that motivate the von Fintel and Iatridou analysis
of weak DNMs (§.). For this reason it is puzzling that von Fintel () portrays the kind of analysis
advocated in this section as a “challenge” to the classical semantics for DNMs (since he does not portray
his own understanding of weak DNMs similarly). The view I have defended does rely on a challenge to the
classical (Kratzer) semantics for indicative conditionals, but these are very different things.
21 How would that work? A choice function CH , when applied to a decision problem , selects a
D
set of alternatives that are permissible according to D. The “disjunction” of these alternatives CHD ()
represents something like the “sphere of permissibility” (compare Lewis b) for the agent in , which
is formally akin to an ordinary domain of quantification, and which can therefore be linked to claims of
comparative betterness using the sort of strategy pursued in §.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
Packing It In
The difficulties begin with the observation that this schematic sort of truth-condition
cannot be, strictly speaking, correct. It ignores a range of data suggesting that DNMs
have context- and index-sensitive truth-conditions. Here I will connect this rather
prosaic fact about DNMs to Carr’s (, ) (in my view successful) objection that
all semantics for DNMs in the mold I have described objectionably write normative
assumptions into the semantics.22 The scope of Carr’s objection is, I shall argue, rather
greater than you might expect. I shall argue for this by attempting to describe a
proposal broadly along the lines she suggests (on which DNMs are parametrized, not
only to decision problems, but to decision theories). It is fairly clear that taking this sort
of proposal seriously will mean excising all “substantive” normative assumptions—
basically, I will argue, all the normative assumptions—from the semantics. This is a
delicate matter, since it is, I’ll argue, prima facie incompatible with the sort of research
program sketched in the prior sections.
. Disagreement
It is a truism that agents with different preferences and priorities will tend toward
disagreement about what is to be done. It is also a truism that such agents are generally
linguistically competent with respect to DNMs and the broadly normative concepts
they function to express (note: I will henceforth refer to this mixture of linguistic and
conceptual competence simply as competence with respect to DNMs). While it would
be too strong to say that it is the job of the semantic theorist to account for this sort of
disagreement—here I disagree with, e.g., MacFarlane () for reasons broadly like
those given by Plunkett and Sundell ()—it is at least incumbent on the semantic
theorist not to state a theory which renders the disagreement unintelligible (compare
Gibbard , chapter ).
Consider the Newcomb Problem. For the uninitiated, a brief description. There are
two boxes, A and B. You get to keep whatever is inside any box you open. There is a
powerful Predictor, which makes extremely reliable predictions about your actions,
which at the time of your decision has already acted thus:
i. It has put $, in box A.
ii. If it has predicted you will open just B, it has in addition put $,, in B.
iii. If it has predicted you will open both boxes, it has put nothing in box B.
22 I take myself here to be fleshing out an objection belonging to Carr (, ); this is also part of
the project of Cariani (), as I note at many points below. Carr’s main worry about writing a normative
assumption N into the semantics of DNMs is that it predicts a kind of linguistic incompetence in those who
use DNMs to express normative views distinct from N. (Intriguingly, she connects this to G. E. Moore’s
Open Question Argument against naturalistic analyses of “good”.) I quite agree with her. I only claim that
there is a lot worth saying about the issue beyond what Carr has said about it. I will try to say a small portion
of it here. I’ve no reason for thinking she would disagree with any of it.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
nate charlow
At the time of your decision, you know the Predictor has already acted thus. You are
given a choice between two actions: opening both boxes (two-boxing), or opening just
box B (one-boxing). The decision problem is given by roughly the following table:
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
23 For differing views of how this “binding” happens, see von Fintel and Iatridou () and Charlow
(a). In the latter, I argue that the sort of doubly modalized account of these sorts of conditionals sketched
in von Fintel and Iatridou () cannot give a fully general account of the semantics of these conditionals,
roughly because it does not take the notion that this is a form of binding seriously enough. In a similar vein,
Yalcin (a); Cariani () cite the ability of probabilistic claims in antecedents to affect the interpretation
of modals in their consequents to motivate a treatment of probabilistic semantic parameters akin to the
treatment of the decision-theoretic semantic parameter for which I argue here.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
nate charlow
24 Similarly, Cariani () argues against a “Fully Bayesian” attempt to write a decision theory into
the semantics on DNMs (on which, roughly, X says the expected value of X is sufficiently high), by
noting that a Fully Bayesian semantics for DNMs cannot intelligibly represent the disagreement between
someone whose choices are characterized by a risk-averse choice function and someone whose choices are
characterized by a Fully Bayesian choice function. (Cariani additionally offers a convincing response to any
attempt to represent the view of the risk-averse agent as Fully Bayesian.)
25 The positive proposal of Cariani () is an interesting direct challenge to this claim, since it does
not involve the use of any decision-theory parameter, but still claims to achieve the right kind of decision-
theoretic neutrality. Apart from the sort of neutrality Cariani is interested in, however—which is in his essay
limited to being able to represent attitudes of and disagreements between agents with different attitudes
toward risk—there is still the issue of accounting for the meaning of ()–(). On this matter: I think there is
likely a tension between Cariani’s use of a binding-type argument to motivate parametrization to probability
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
functions (see fn. ) and his decision not to parametrize to choice functions (since similar arguments can
be mustered for both).
26 One may object to the apparent assumption that the decision theory “of ” the speaker determines the
decision-theoretic parameter against which the speaker’s assertion of () is evaluated for truth. But there
is still the problem that evaluating what the EDT-er asserts, relative to CHEDT , yields a verdict of truth.
To avoid the conclusion that the EDT-er speaks truly, at, we need to prohibit evaluation of what the EDT-
er asserts relative to CHEDT . It is unclear on what grounds such a prohibition might rest. Phrased this
way, I suspect this is simply a vivid instance of a more general difficulty with model-theoretic semantics—
its difficulty deriving truth-conditions for false sentences on which they can be seen to come out false. As
Lepore (, p. ) puts it, “[F]rom a relativized truth-theory we cannot derive an absolute truth-theory.”
I do not want to be construed as endorsing this critique generally, but I think it is powerful in the case of
DNMs.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
nate charlow
No correct view? Perhaps there is no fact of the matter about what the correct deci-
sion theory is (perhaps owing to a meta-normative Error Theory, à la Mackie ).27
If this is right, then the supposition that CDT is correct is in some sense an ineligible
hypothetical.
This would come as a surprise to partisans of the CDT/EDT debate. It seems to
have the further consequence that commitments to and assertions of DNMs that are,
on the face of it, premised on an assumption of the correctness of either CDT or EDT—
() and ()—cannot be rationally maintained. I take this provisionally as a reductio
of the view under consideration. Whatever the meta-normative facts turn out to be,
the possibility of rational commitment to the correctness of a normative view (and
the ability to rationally assert this commitment) should not be relinquished.28 At the
least, the notion that recherché metasemantic considerations such as those in play here
would prompt us to relinquish it should not, absent much further argument, be taken
seriously.
Ecumenism? Perhaps, despite appearances, both EDT-ers and CDT-ers speak cor-
rectly (by which I mean something more than that they appropriately deploy their
competence with respect to DNMs). There are two main ways of making good on this
thought: Contextualism and Relativism.29
According to Contextualism, the EDT-er who asserts () makes a claim whose
content is glossed roughly as the choice function of EDT selects one-boxing, while the
CDT-er who asserts () makes a claim whose content is glossed roughly as the choice
function of CDT selects two-boxing. But, while I acknowledge that there are theorists
who would disagree, I will suppose that this does not correctly report the content of
the CDT-er’s claim. This would render the content of the CDT-er’s assertion of ()
trivial (likewise for the EDT-er’s assertion). It would thus render the purpose of any
article in which a CDT-er undertook to defend the correctness of such an assertion
27 This sort of response works better in cases involving disagreements with epistemically modal claims.
There is no fact of the matter about what the correct domain of quantification for epistemic modals is: there
is nothing over and above different bodies of evidence from whose vantage different epistemically modal
claims can be made. (A relatively more informative body of evidence is in some sense superior to a relatively
less informative body of evidence, but taking this idea seriously in a metasemantics for epistemic modals will
ultimately require us to shrink domains of quantification to maximally informative domains, i.e., singleton
sets of possibilities.) Similar remarks could be made for other sources of variability in epistemic claims. For
instance, assuming that knowledge-attributions are sensitive to stakes or to standards, most epistemologists
would agree that it is nonsensical to ask which stakes or standards are correct. The most we can say as theorists
is to describe which certain stakes or standards are operative in a context, by way of explaining the judgments
of truth and falsity that speakers who are competent with the relevant epistemic language tend to make.
(The skeptic, of course, disagrees with this general assessment, but this has earned the skeptic a fair share
of opprobrium in the contemporary epistemological literature on skepticism.) For reasons that will emerge
shortly, this is also why I think Contextualism about epistemic language remains viable in the face of this
sort of worry, while Contextualism about deontic modals does not.
28 For a defense of such commitments (within a broader program of anti-Realism about the normative)
that I take to be basically sound, see Blackburn (, ); Gibbard (, ).
29 There are, of course, many more than two ways, but I must confine my attention here to the main
contenders.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
mysterious. This is not exactly a palatable way to regard the work of your philosophical
colleagues.30
According to Relativism, however, the EDT-er who asserts () makes a claim
whose propositional content is glossed roughly as the correct choice function selects
one-boxing.31 This seems like exactly the right thing to say. For the Relativist, the
truth of this content is, however, said to be perspective-dependent. In particular, it is
dependent on a choice of choice function. The claim the correct choice function selects
one-boxing is true relative to CHEDT , false relative to CHCDT . Thus both the EDT-er
and the CDT-er speak correctly, since both assert contents that are true relative to the
choice functions supplied by their respective perspectives. So Relativism is a form of
Ecumenism.
Relativist Ecumenism seems to offer the best hope of a propositional semantics for
DNMs that is responsive to the worry developed in this section. In the next section,
I will explain why it is not a very good hope and go on to explain the implications of
this for theorizing about DNMs.
Metasemantics
In this last section, I will first explain my dissatisfaction with propositional semantics
for DNMs. I will suggest a semantics—really, a metasemantics—that, rather than
attempting to deliver “truth-conditions” for DNMs, simply attempts to model linguistic
competence with respect to DNMs: to represent (very roughly) what state of mind
an agent is in when she judges a DNM to be the case. Because such competence
can co-exist with effectively any normative view, the core semantics for DNMs must
be extremely minimal: the compositional semantics of a DNM of the form φ
is nothing more than a function from sequences of parameters P , . . . , Pn into a
30 One suspects Plunkett and Sundell () will deploy an alternative way of rationalizing the
assertion—e.g., as an attempt to achieve a mutually acceptable conversational scoreboard (compare Lewis
c) on which CHCDT is mutually agreed to partly determine the content of assertions of DNMs, or as an
attempt to persuade one’s interlocutors to modify their own decision-theoretic views. (Their preferred term
here is “negotiation.”) I am, as a rule, basically happy with these sorts of pragmatic stories, but, for reasons
I will describe below, I think it makes better sense to embed them in an Expressivist (Charlow b, ;
Yalcin a; Rothschild ) or Constraint-Theoretic (Swanson forthcoming) metasemantics.
I note that the most convincing defenders of Contextualism for epistemic modals—von Fintel and
Gillies—choose as their ur-semantics for epistemic modals an apparently non-truth-conditional dynamic-
semantics borrowed from Veltman () (see, e.g., von Fintel and Gillies ). Insofar as Contextualism
about epistemic modals is committed to the doctrine that epistemic modals conventionally express propo-
sitions, their view is simply not a Contextualist view. However, insofar as Contextualism about epistemic
modals is not committed to the doctrine that epistemic modals conventionally express propositions, with all
of the dubious pragmatic commitments that are normally thought to come along with such a commitment—
e.g., that the essential effect of accepting an epistemic modal involves adding the proposition it expresses to
one’s beliefs (against this, see esp. Yalcin )—I would not object to being called a Contextualist.
31 As this suggests, I am understanding Relativism to encompass both MacFarlane-style Assessor
Relativism and various forms of Nonindexical Contextualism. The differences between these views do not,
I think, matter for my purposes here.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
nate charlow
32 It will in §. become clear that, by propositional content, I mean a representational content. A repre-
sentational content is an entity that encodes what I term a locational perspective: a property that agents can
self-ascribe by way of self-locating in a space of centered possible worlds (cf. Lewis a). The state-type
of self-location is functionally distinguished from motivational state-types such as preference or desire, as
well as from broadly representational state-types that nevertheless cannot be propositionally individuated
(e.g., assigning p a credence of . conditional on q). There is, to be sure, a different sense of “proposition”—
namely, content that can “bear” truth-values, can serve as the denotation of a that-clause, and can (thereby?)
serve as the object of an attitude-type which we might neutrally term acceptance (as well as attitudes such as
doubt, uncertainty, etc.). Propositions in this latter sense potentially admit both representational and non-
representational varieties and might be understood as the semantic values of DNMs without running up
against anything I say here (see Schroeder ). Here I claim only that DNMs fail to encode perspectives
that help an agent to self-locate in a space of centered worlds.
33 Some readers may see a bit of slippage here. Note that I explicitly do not suggest that the right view of
the relation between compositional semantic value and communicated content is identity (compare Ninan
). I explicitly invoke determination in lieu of identity. More broadly, I take the view of the subject matter
of semantic theorizing I am articulating here to be basically consonant with Yalcin (); indeed, it draws
inspiration from Yalcin’s own work on Expressivism.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
If I am right that DNMs lack truth-conditions in the relevant sense, it is fairly clear
that the standard Stalnakerian pragmatics, on which assertions of sentences propose
their truth-conditions for addition to the Common Ground or some other body of
information, is not the right pragmatics for DNMs. In its place I briefly describe an
Expressivistic understanding of pragmatics, on which (i) asserting a DNM expresses
that DNM’s satisfaction-condition (by which I will not mean that it asserts that the
satisfaction-condition is met), (ii) the perlocutionary force of expressing a satisfaction-
condition is to propose that one’s addressees adjust their cognitive state so that the
satisfaction-condition of the DNM is met.
. Against propositions
I have no decisive objection to Relativism to air. But I do have an objection, which
I will describe briefly—the purpose being less to convince than to cajole.34 The claim
that there is a Relativistic propositional content in play for DNMs is ill-motivated. Such
contents are not really fit to play a good number of the theoretical roles we normally
require propositional contents to play.
• It is really not the right sort of thing to be asserted in cases where disagreement is
known to obtain (since it is known to exclude the perspective of one’s addressee,
and so simply cannot be consistently accepted by the addressee) (compare Egan
).
• Nor is it the right sort of thing to ground an account of why two agents who assert
incompatible perspective-dependent claims disagree (Dreier ).
• Nor, most significantly, is it the right sort of thing to be believed. At least the
standard account of what it is to believe a perspective-dependent proposition
(Lewis a) does not seem to apply. On the face of things, an agent who accepts
() is not locating herself in one region of perspectival space as opposed to
another—the region of perspectival space occupied by fans of EDT. An agent
who accepts () is just an agent who has this kind of perspective, not one
who self-ascribes the property of having this kind of perspective.35 Similarly, an
agent who accepts a conditionalized DNM such as () is not self-ascribing a
34 I develop the sort of view I describe here further in Charlow (a, b, ). See also Yalcin
(a); Rothschild (); Moss (); Swanson (forthcoming).
35 Compare Yalcin () on the attitude constitutively involved in acceptance of an epistemic modal.
Compare also Cariani (), who (in making a different point) notes that a sentence such as John thinks
you should refuse the pill “is not an ascription to John of the content that refusing the pill maximizes expected
value. Rather [it] is an ascription to John of a way of prioritizing alternatives relative to which refusing the
pill is the best option.” Egan () would likely disagree about whether Relativists can give a compelling
account of the state of mind of accepting a DNM along the lines suggested here. His case rests largely on
the claim that the de se-ified state of mind typically has the right functional profile—closely, but defeasibly,
connected to motivation. I cannot argue against the de se understanding of the relevant state of mind here—
although I do find it prima facie incredible. I can, however, do a bit of dividing and conquering. If you are (as
I am) inclined to a more robust form of motivational internalism—on which sincere and competent moral
judgments motivate, not defeasibly, but necessarily—you will not be happy with Egan’s account here.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
nate charlow
conditional proposition (whatever this would amount to); rather, her acceptance
consists in a fact about her preferences under the indicative supposition that the
conditional’s antecedent is true (Charlow a).
Summing up, accommodating the facts about competence and binding behavior
for DNMs seemed to require parametrizing DNMs to a choice of decision theory.
But this in turn makes it hard to see how to give an (in the phrase of Lepore )
“absolute truth-theory” for claims such as () and (). Not that I think there is
anything wrong with relativized truth per se. But none of the various ways of making
sense of a decision-theory-relative (and, more generally, preference-relative) notion
of truth for DNMs seems palatable. Relativism offers the best hope of a propositional
semantics for DNMs that is responsive to this sort of worry. I would, however, prefer
to look elsewhere.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
36 For some initial attempts, see the citations above. Charlow (a) and Starr (forthcoming) argue
that the sort of reconceptualization of semantic theorizing being entertained here can be motivated by
independent considerations having to do with imperatives. I would also suggest, tentatively, that this is
the right sort of metasemantics for the Dynamic Semantic program (see, e.g., Kamp ; Groenendijk and
Stokhof ; Veltman ).
37 This turns out to be basically trivial for all the sentences ordinarily (and helpfully) thought to express
propositions, while less trivial, but extremely illuminating, for sentences—conditionals and the language
of subjective uncertainty are key cases—often thought to confound the ambitions of truth-conditional
semantics.
38 A deflationary truth-predicate is a natural option, but I am of the view that it should be resisted in
light of the “Hiyo” objection (Dreier ). Charlow (c) develops my own view.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
nate charlow
CHD (P , I , A ). To be frank, this does not really tell us anything very substantial
about the pragmatics of DNMs (indeed, many would regard it as a truism). But it
is a good start. For it allows us to formulate the following question: why do agents
express judgments of this type? More precisely, what, in general, is the illocutionary
and perlocutionary function of expressing a judgment of this type?
Different answers are possible here. Here is a sketch of the answer I favor (developed
in more detail in Charlow a, b, see also Starr ). Speakers propose
concrete updates of the relevant state—understandable variously as the Conversational
Scoreboard (Lewis c), the cognitive states of their addressees, or perhaps some-
thing else. (I will assume here that states are just Representors.) There are two broad
kinds of updates that speakers propose: additive updates and tests. Additive updates, if
accepted, strengthen the Representor’s beliefs or preferences; imperatives and ordinary
declaratives are plausibly thought to involve such updates. Tests query the Representor
for the property of accepting a sentence; passing the test means that the sentence is
accepted, failing means that it is rejected. Modals of various stripes have long been
thought to be prime candidates for test-expressing sentences in natural language.
(Veltman , e.g., claims the only appropriate reaction to a modal is generally to
agree—if the modal is acceptable relative to a representation of your state of mind—or
to disagree—if not.)
Expressing a test might seem a trivial sort of speech act, since it does not propose
a change to the relevant state of mind. Not so. It is not hard to see how expressing
a test that yields disagreement, in particular, might set the stage for negotiation and,
ultimately, coordination about the relevant cognitive feature.39 There is much more to
say here, but this is a perfectly good start.
This is roughly what Expressivists (Gibbard) and Quasi-Realists (Blackburn) have
tended to say about these sorts of actions. I’ve chosen the Expressivist label because the
story here is directly inspired by the view of pragmatics articulated by Gibbard (espe-
cially in Gibbard ), who in turn seems to be drawing on Lewisian Conventionalism
(on which expressing a meaning is principally a means for achieving coordination in
attitude).
. Minimalism
No semantics for any sentence should encode presuppositions that rule out the
possibility of someone’s being competent with respect to a sentence, but substantively
mistaken in their acceptance or endorsement of it. For a sentence φ having any
normative, or action-guiding, “content” as part of its conventional meaning—roughly,
for a sentence that encodes any kind of normative view—the proper way to handle
39 This language is similar to that invoked by Plunkett and Sundell (). However, if this is the story we
tell about the pragmatics of DNMs, I see no reason to sustain the idea that the compositional semantic value
of a DNM is a proposition. The proposition qua proposition seems to be doing no explanatory work in a
theory of this sort; in fact, it is hard to make sense of a theory of DNMs in which the proposition supposedly
expressed by a DNM is given a central theoretical role qua proposition.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
this content in the semantics seems to be (i) to parametrize φ’s semantic value to
the relevant kind of normative view, (ii) to adopt a metasemantics on which the
fundamental satisfaction relation of the theory models competent acceptance of φ.
I take this to yield a kind of semantic minimalism about, not only DNMs, but any
kind of language functioning, even if only in part, to encode a normative view. For
sentences of such type, the job of the semantics is to lay bare the manifold ways in
which a normative viewpoint can competently ground acceptance of such sentences.
Whenever we try to say something substantive about the “content” of sentences of
such type, we will be stymied by the diversity of normative views compatible with
competent acceptance of such sentences.
To illustrate, return to the examples of §. These cases were designed to illustrate the
possibility of refining the basic semantics for DNMs to allow it to represent normative
phenomena of apparent significance for the content of DNMs. Neither of the sorts of
refinements I mentioned there seems compatible with the claim with which I began
this section.
The first case involved von Fintel and Iatridou’s () suggestion that weak DNMs
are sensitive to primary and secondary goals, the latter of which further refine the
ordering established by one’s primary goals—an insight borrowed from the theory
of practical rationality. Question: can there be competent use of weak DNMs by, for
instance, Rational Fetishists—agents who systematically fail to distinguish between
first-order and rational preferences? What about psychological value monists—agents
for whom the choice-worthiness of anything is a function of how much of a certain
quality (e.g., pleasure) it possesses, hence for whom it makes no psychological sense to
distinguish primary and secondary goals? On the face of it, (i) the answer to both ques-
tions ought to be yes, (ii) the von Fintel and Iatridou () framework seems to say no.
Simplifying somewhat, the second case involved the suggestion that the semantics
for DNMs be sensitive to differences between the is likelier than relation and the is
better than relation. In particular, the former does not, but the latter does, validate
something related to the Sure Thing Principle. Question: can there be competent use of
DNMs by agents for whom the Sure Thing Principle (or a related dominance principle)
fails—whose preferences cannot be accurately represented with a relation of this sort?
On the face of it, yes: in, for instance, the Allais and Ellsberg Paradoxes, the preferences
of actual agents seem to violate the Sure Thing Principle.40 Such agents are nevertheless
competent with respect to DNMs.
More generally, for just about any rational preference axiom you can state, it is pos-
sible to imagine an agent whose preferences cannot be represented with a preference
40 The Allais and Ellsberg Paradoxes are famously in tension with the Sure Thing Principle, which is why
I mention them here. Ultimately this is incidental to the point I am making: for any plausible dominance
principle, should we encode it in the semantics for DNMs, there will be many cases in which an agent who is
semantically competent with respect to DNMs competently, if irrationally, accepts claims whose negations
are nevertheless theorems from the point of view of the semantics. Gibbard (: Ch. ) was particularly
prescient on these points.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
nate charlow
relation satisfying that axiom.41 It is, on the face of it, simply wrong to try to account for
the impeccability of inferences () and () by claiming that these are semantic validities
in a compositional semantic system in which the Sure Thing Principle (or related
dominance principle) is a Theorem.42 How to understand the impeccability of these
inferences is another matter. (The obvious answer: these inferences are impeccable
because they are acceptance-preserving for a rational agent whose preferences respect
the relevant axioms.) But the way that they are most naturally understood—and
as Kratzer implicitly would understand them, given her apparent codification of a
dominance principle in her definition of the lifted ordering—cannot be sustained on
reflection. Again, I am not trying to suggest that we cannot criticize agents whose
preferences violate apparently obvious preference axioms on grounds of irrationality.
I am only insisting that, when we do so, we do not prevent ourselves from regarding
them as substantively mistaken about DNMs which they competently accept.
The overall effect here is to circumscribe the range of phenomena for which the
compositional semantic theory for DNMs can hope to account. This ripples out to
any kind of language functioning, even if only in part, to encode a normative view.
Should we parametrize the semantic value of some sentence φ to the probability
functions of Bayesian epistemology? We should be wary of this, unless we are prepared
to rule out the possibility of an agent who can access that semantic value via her
competence with respect to φ but whose partial beliefs are not representable with a
real-valued probability function. (How to account for the meaning of probabilistic
language while remaining sensitive to this point is a vexed question indeed.) Or
consider your favorite update semantic theory, on which semantic values are functions
from input information-states to output information-states. Such functions often
write a substantive theory of rational attitude-revision into the semantic value of any
sentence.43 For the same reasons, I think theories like this need to be reconsidered.
I do not say abandoned. But their subject matters need rethinking. Similarly,
theories which obscure Gilbert Harman’s bright line between logical validities and
rationally obligatory inferences (see, e.g., Harman )—e.g., theories that would
41 Probably even the minimal notion of a Representor (as defined in Definition ) assumes too much,
in light of the examples of the Rational Fetishist and value monist. I will not attempt to deal with this here.
42 This works as a criticism of Kratzer’s system, as pointed out in a different manner by Lassiter ().
The normative assumptions that are ironically simply written into Kratzer’s definition of the ordering ≺ have
received insufficient attention. Recall that the ordering is stipulated to be Transitive; to satisfy Independence
of Irrelevant Alternatives (see Charlow b); to forbid cardinal comparisons in value (see Carr );
and to necessarily yield incomparabilities-in-desirability between possibilities when the sets of propositions
satisfied respectively by those possibilities cannot be ordered by ⊆. Any object-language expression of these
features of the preference ordering is thus treated as a theorem within Kratzer’s semantics. This is bad: all of
these features of the preference ordering can be (and have been) disputed without impugning one’s semantic
competence; the latter two, in fact, are roundly rejected by most decision theorists.
43 I criticize Starr (forthcoming) for this in Charlow (a, §..). The theory of Veltman ()
actually is scrupulous about avoiding this. Veltman suggests that when genuine attitude revision is called
for—in cases where updating would lead to the absurd information-state—the semantics should be silent
about how best to proceed.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
appeal to something like the Sure Thing Principle to validate ()—should get a clearer
view on their subject matters. Semantic theories, when working correctly, ultimately
yield a theory of the broad cognitive directive associated with a sentence (as in
§.). Normative epistemology (along with other normative theories) furnishes a
substantive theory of synchronic and diachronic compliance for cognitive directives.
It is imperative to keep these separate, to the extent that you can.
References
Blackburn, Simon () Spreading the Word. Oxford: Clarendon Press.
Blackburn, Simon () Ruling Passions. Oxford: Oxford University Press.
Cariani, Fabrizio () ‘Ought’ and resolution semantics. Noûs. , pp. –.
Cariani, Fabrizio () Deontic modals and probabilities: One theory to rule them all? In
Charlow, N. and Chrisman, M. (eds.) Deontic Modals. Oxford: Oxford University Press.
Cariani, Fabrizio, Kaufmann, M. and Kaufmann, S. () Deliberative modality under epis-
temic uncertainty. Linguistics and Philosophy. , pp. –.
Carr, Jennifer () Deontic modals without decision theory. In Chemla, E. Homer, V. and
Winterstein, G. (eds.) Proceedings of Sinn und Bedeutung , pp. –. ENS Paris.
Carr, Jennifer () Subjective Ought. Ergo. , pp. –.
Charlow, Nate () Imperative statics and dynamics. [Online] Available from: http://
natecharlow.com/work/imperative_statics_and_dynamics.pdf. [Accessed Nov. .]
Charlow, Nate (a) Conditional preferences and practical conditionals. Linguistics and
Philosophy. , pp. –.
Charlow, Nate (b) What we know and what to do. Synthese. , pp. –.
Charlow, Nate (a) Logic and semantics for imperatives. Journal of Philosophical Logic. ,
pp. –.
Charlow, Nate (b) The problem with the Frege-Geach problem. Philosophical Studies. ,
pp. –. doi:./s---.
Charlow, Nate () Prospects for an expressivist theory of meaning. Philosophers’ Imprint.
, pp. – [Online] Available from: http://hdl.handle.net//spo....
[Accessed Nov. .]
Dowell, J. L. () A flexible contextualist account of epistemic modals. Philosopher’s Imprint.
, pp. –. [Online] Available from: http://hdl.handle.net//spo....
[Accessed Nov. .]
Dowell, J. L. () Contextualist solutions to three puzzles about practical conditionals. In
Shafer-Landau, R. (ed.) Oxford Studies in Metaethics. Volume . Oxford: Oxford University
Press.
Dreier, James () Expressivist embeddings and minimalist truth. Philosophical Studies. ,
pp. –.
Dreier, James () Relativism (and expressivism) and the problem of disagreement. Philo-
sophical Perspectives. , pp. –.
Egan, Andy () Epistemic modals, relativism, and assertion. Philosophical Studies. ,
pp. –.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
nate charlow
Egan, Andy () Relativist dispositional theories of value. The Southern Journal of Philosophy.
, pp. –.
Fintel, Kai von () The best we can (expect to) get? Challenges to the classic semantics for
deontic modals. http://mit.edu/fintel/fintel--apa-ought.pdf. Ms. MIT.
Fintel, Kai von and Gillies A. S. () An opinionated guide to epistemic modality. In Gendler,
T. and Hawthorne, J. (eds.) Oxford Studies in Epistemology. Volume . Oxford: Oxford
University Press.
Fintel, Kai von and Iatridou, Sabine () What to do if you want to go to Harlem. Manuscript
MIT.
von Fintel, Kai and Iatridou, Sabine () How to say ought in foreign. In Guèron, J. and
Lecarme, J. (eds.) Time and Modality. Dordrecht: Springer.
Gibbard, Allan () Wise Choices, Apt Feelings. Cambridge, MA: Harvard University Press.
Gibbard, Allan () Thinking How to Live. Cambridge, MA: Harvard University Press.
Gibbard, Allan and Harper, W. L. () Counterfactuals and two kinds of expected utility. In
Harper, W., Stalnaker, R. and Pearce, G. (eds.) Conditionals, Belief, Decision, Chance and
Time. pp. –. Netherlands: Springer.
Gillies, Anthony S. () Iffiness. Semantics and Pragmatics. , pp. –.
Groenendijk, Jeroen and Stokhof, Martin () Dynamic predicate logic. Linguistics and
Philosophy. , pp. –.
Harman, Gilbert () Change in View: Principles of Reasoning. Cambridge, MA: MIT Press.
Hawthorne, John and Stanley, Jason () Knowledge and action. The Journal of Philosophy.
, pp. –.
Kamp, Hans () A theory of truth and semantic representation. In Groenendijk, J., Janssen, T.
and Stokhof, M. (eds.) Truth, Interpretation, and Information: Selected Papers from the Third
Amsterdam Colloquium. Amsterdam: Matematisch Centrum.
Kolodny, Niko and MacFarlane, John () Ifs and oughts. Journal of Philosophy. , pp. –.
Kratzer, Angelika. () What ‘must’ and ‘can’ must and can mean. Linguistics and Philosophy.
, pp. –.
Kratzer, Angelika () The notional category of modality. In Eikmeyer, H. and Rieser, H. (eds.)
Words, Worlds, and Contexts. pp. –. Berlin: De Gruyter.
Kratzer, Angelika () Modality. In von Stechow, A. and Wunderlich, D. (eds.) Semantics: An
International Handbook of Contemporary Research. pp. –. Berlin: De Gruyter.
Kratzer, Angelika () Modals and Conditionals. Oxford: Oxford University Press.
Lassiter, Daniel () Measurement and modality. Ph.D. dissertation. New York University.
Lepore, Ernest () What model theoretic semantics cannot do? Synthese. , pp. –.
Lewis, David () Counterfactuals. Malden: Blackwell.
Lewis, David () Semantic analyses for dyadic deontic logic. In Stenlund, S. (ed.) Logical
Theory and Semantic Analysis. Dordrecht: D. Reidel.
Lewis, David () Probabilities of conditionals and conditional probabilities. Philosophical
Review. , pp. –.
Lewis, David (a) Attitudes de dicto and de se. The Philosophical Review. , pp. –.
Lewis, David (b) A problem about permission. In Saarinen, E., Hilpinen, R., Niiniluoto, I.
and Hintikka, M. B. (eds.) Essays in Honour of Jaakko Hintikka. pp. –. Dordrecht: D.
Reidel.
Lewis, David (c) Scorekeeping in a language game. Journal of Philosophical Logic. ,
pp. –.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
Linguistic and Philosophical
Considerations on Bayesian
Semantics
Daniel Lassiter
For the past several years I have found myself advocating what might be called a
“Bayesian” semantics for the deontic interpretations of modal expressions such as
ought, should, good, bad, may, must, and their modified forms (better, etc.). On the
version of this account that I have in mind, deontic modals are not quantifiers over
possible worlds; instead, they have a lexical semantics structured around degree scales,
just like gradable adjectives such as heavy/light, tall/short, and full/empty (Kennedy
and McNally, , etc.). Deontic scales are formally identical to the expected utility
scales used in Bayesian decision theory, but the function assigning values to states
is not necessarily interpreted as representing personal utility—it could be, as in the
case of teleological modals, but it could also represent other kinds of value, such
as moral value. I have argued that this approach accounts better than its main
rivals for a number of empirical phenomena including information-sensitivity, non-
monotonicity, and grammatical gradability (Lassiter, , c, ).1
In this chapter I will summarize several of the arguments for the Bayesian semantics
and discuss challenges that have been posed in four recent papers, two of which are
in this volume: Carr (); Charlow (); and Cariani (, ). The challenges
involve both empirical and high-level philosophical objections, and they have been
1 The Bayesian semantics made its first appearance (to my knowledge) in a brief paper by Jeffrey (a),
and was fleshed out and empirically motivated by Jackson (), formalized by Goble (), and integrated
into a compositional semantics for English, with special attention to the theory of gradability, by Lassiter
(, c, ). Related ideas can be found in Cariani (); Wedgwood ().
Note that the label “Bayesian” is not much used in these works, since the connection with Bayesian
decision theory is mostly formal in nature. But the term is used in subsequent reactions (e.g., Yalcin, a;
Cariani, ), and I find that it is not inappropriate as long as it is clear that this semantic theory does not
stand or fall with the empirical adequacy of a Bayesian theory of human decision-making. At most, there is a
philosophical connection in Jackson’s (), p. observation that a moral value function can be thought
of as describing how people ought to be motivated in their subjective decision-making.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
bayesian semantics
taken to motivate weakening the semantics so that it can mimic the predictions of
the Bayesian theory, or of other existing semantic theories, as desired. For obvious
reasons, it is not possible to show on empirical grounds that a strictly weaker theory is
incorrect. However, I will argue that the objections that Carr, Charlow, and Cariani
raise are not compelling. For the most part, the problems discussed are not even
specific to the Bayesian semantics, but rather very general issues that arise for anyone
who is trying to do lexical semantics within the framework of compositional model-
theoretic semantics, and we can reasonably expect that solutions to these general
problems will explain the special cases which involve deontic modals. Given this,
general methodological considerations favor adopting a more restrictive theory over
a more expressive one.
An important exception to this characterization is Charlow’s () objection
involving the deontic analogue of an evidential versus causal decision theory: e.g.,
if a behavior and a disease have a common genetic cause, should individuals refrain
from the behavior? Charlow argues that, to the extent that there is variation among
English speakers in their judgments on this point, no single decision theory can
be implicated in the semantics of ought. However, this argument assumes that the
distinction between causal and evidential decision theory must be located in the rule
used to compute expected values. As I will discuss in section , there is a prominent
version of decision theory according to which the causal/evidential divide is located
in the structure of a probabilistic model and the way that actions/interventions are
represented. On this construal, the Bayesian semantics can generate either kind of
prediction, depending on features of the probability distribution that is given as a
semantic parameter. If this general construal of causal and evidential decision theory
is successful, then, Charlow’s argument does not motivate adopting a more expressive
semantics.
A final category of objections involves real and apparent limitations in the Bayesian
theory’s ability to model certain conceivable kinds of reasoning about obligation. The
real limitations involve MaxiMax or MaxiMin choice rules, but I will argue that the
mere conceivability of someone employing such a choice rule in deontic reasoning is
not a sufficient reason to weaken the semantic theory dramatically. For a compelling
objection, empirical evidence that actual people do so would be needed, and this
evidence is conspicuously absent. The apparent limitation involves, for example, non-
consequentialist judgments in trolley problems. While there is abundant evidence that
many people have such judgments, the Bayesian theory can handle it straightforwardly
by locating variation in the value parameter: non-consequentialist intuitions are
associated with value functions which attach positive or negative value to the fact
that a particular individual takes a particular kind of action. In other words, the
Bayesian theory is not a semanticization of consequentialist ethics. It is a logical
and grammatical theory, and it stands or falls with the descriptive adequacy of its
validities, and with its ability to mesh with good theories of natural language syntax
and pragmatics.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
daniel lassiter
It may be useful for me to make clear from the beginning what my goals and
presuppositions are. I am a linguist and a cognitive scientist, and I am primarily
interested in understanding language, concepts, and reasoning as aspects of human
cognition. Metaphysical questions are relevant to these research interests, but only to
the extent that humans have metaphysical beliefs and assumptions which influence
their cognition and behavior. The question of the ultimate truth of one or another set of
metaphysical claims simply does not arise in this enterprise. Probably in part because
of this orientation, I am unsure to what extent I disagree philosophically with critics
of the Bayesian semantics. (I don’t have any particular objection to Charlow’s expres-
sivism, for example, but I also don’t know whether the distinction between truth-
conditional and expressivist frameworks does any work in a cognitively-oriented
theory of meaning and communication.) In any case, I hope that it will at least emerge
that the criticisms considered here are not special problems for the Bayesian semantics,
and that this theory does a pretty good job of regimenting the grammatical and
inferential features of deontic modals. Indeed, I think that it does this job much better
than any other theory that has been spelled out with a comparable level of specificity
and predictive power.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
bayesian semantics
ought, should, good, bad, required, and supposed to are gradable, there is no clear
evidence for the gradability of may. It is still unclear what the situation is for must,
or have to: it’s usually thought that they are not gradable, but Portner and Rubinstein
(this volume, chapter ) argue that they are, and that their limited gradability is closely
related to the limitations on gradability of extreme adjectives like huge and gorgeous
(Morzycki, ). Regardless of how this empirical question comes out, gradability and
scalarity are not the same thing. Gradability is a grammatical property of expressions,
while scalarity is a property of the model-theoretic objects that expressions denote
(see Lassiter, c for discussion). It might turn out that deontic modals that are not
gradable nevertheless have a scalar component of their meanings, but the thresholds
that they invoke cannot be bound by other operators.
If this is correct, the next question to ask is what kinds of scales are relevant.
In the recent literature on degree expressions a great deal of attention has been
devoted to scale structure, i.e., the formal properties of degree scales (see surveys
in Lassiter, a; Morzycki, ). The bulk of attention in this literature has been
devoted to two issues. First, there is the question of how to model the difference
between one-dimensional adjectives like tall/short and higher-dimensional adjectives
like healthy/sick, where many factors go into the determination of the degree of the
property that an individual possesses (Kamp, ; Bierwisch, ; Sassoon, ).
Second, there is the question of whether a scale has or lacks lower and upper bounds,
which has been argued to influence the vagueness of adjectives and their potential
to combine with degree modifiers such as slightly and completely (Hay, Kennedy and
Levin, ; Rotstein and Winter, ; Kennedy and McNally, ; Kennedy, ).
In Lassiter (, d, ), I build on the insights of Measurement Theory
(Krantz, Luce, Suppes and Tversky, ) to identify a third parameter of scalar
variation: the logical part-whole structure of scales, i.e., systematic relationships
between the degree of a property that some object possesses and the degree to which
its parts possess the property. In particular, some scales are additive in the sense that
the degree to which an object has the property is the sum of the degrees to which the
object’s proper parts have the property. Clear examples of additive scales are weight
and size: the degree to which I am heavy is the sum of (i) the degree to which my right
arm is heavy, and (ii) the degree to which the rest of my body is heavy. In contrast,
predicates of temperature are not additive but intermediate. The degree to which an
object is hot is not the sum of the degrees of heat of the object’s parts, but rather
some value in between these degrees of heat. If my two arms, two legs, torso, and
head are all around degrees Fahrenheit, then (fortunately) my body temperature
is not around degrees Fahrenheit but around the same degrees. It is easy to
imagine scales which behave differently with respect to part-whole relationships. One
important alternative possibility is maximality. If a scale is maximal, then whenever
an object z is fully covered by non-overlapping parts x and y, z’s degree on the scale
is either the same as x’s degree or as y’s, whichever is greater. Some scales do not
seem to encode any such systematic connection, e.g., happy/sad and beautiful/ugly.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
daniel lassiter
How and whether parts and wholes are systematically related constitutes an additional
parameter of variation in degree scales.
The issue of part-whole structure is vital for a scalar treatment of deontic modals
because many interesting questions in the study of deontic modals are about
monotonicity—when and whether embedding under a deontic modal preserves or
reverses entailment relations between propositions. In possible worlds semantics
entailment is ultimately about part/whole relations, because it is modeled as a subset
(part-of) relation between propositions construed as sets of worlds.
Given this, questions about the monotonicity of deontic modals can be refined into
questions about the part-whole structure of deontic scales. Both additive and maximal
scales have the property that the degree to which an object (proposition) has some
property is at least as great as the degree to which its constituent parts (subsets) have
the property. Suppose that ought means “has moral value greater than θ ” for some
threshold θ . (Under fairly light assumptions, this subsumes the common “better than
not” gloss as a special case.) Then additivity or maximality of deontic scales would
entail that ought is upward monotonic: if φ entails ψ then ought(φ) entails ought(ψ).
This is because the entailing sentence φ denotes a subset of the denotation of the
entailed sentence ψ, and maximal and additive scales require a set to have a degree
of the relevant property at least as great as any of its subsets.
If deontic scales are either additive or maximal, then, we expect inferences such as
the following to be valid on the threshold semantics for ought just mentioned.
() a. You ought to mail this letter.
b. So, you ought to mail this letter or burn it.
Ross’s puzzle (generalized from a related issue around imperatives, Ross, ) is
precisely that these inferences are intuitively invalid. Now perhaps this intuitive
invalidity can be explained away on Gricean lines as due to the presence of disjunction;
if so, an upward monotonic semantics for ought remains plausible (Hare, ;
Wedgwood, ; von Fintel, , but see Cariani, for objections). However,
Jackson Pargetter’s () Professor Procrastinate scenario provides a structurally
identical failure of monotonicity which cannot be explained away by Gricean con-
siderations about disjunction. Here is a version which I think brings out the relevant
intuitions more clearly than the original (from Lassiter, d, inspired by a scenario in
Cariani, ).
Juliet is considering whether to feign death by taking the drug that Friar Laurence has offered
her. If she does, it will put her in a coma, and she will die unless Friar Laurence administers the
antidote exactly hours later. If she takes it and the Friar does administer the antidote, she will
succeed in convincing her family of her death and she will be able to live happily ever after with
Romeo. If she does not take the drug, she will live a long life without Romeo and will be less
happy; this is much better than being dead, though. Unfortunately, the Friar is known for being
cruel and capricious, and it is extremely likely (though not totally certain) that he will “forget”
to administer the antidote if she takes the drug.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
bayesian semantics
Many people find both of the judgments in () reasonable in this scenario.
() a. It ought to be that Juliet does not take the drug.
b. It ought to be that Juliet takes the drug and the Friar administers the
antidote.
But if ought is upward monotonic this pattern of judgments should be incoherent. The
sentence embedded under ought in (b) entails the negation of the embedded sentence
in (a), and so the truth of (b) should entail the truth of It ought to be that Juliet takes
the drug and the falsity of (a).2
This is one of several arguments that have been advanced in favor of the claim that
ought is not upward monotonic; see Lassiter (, chs –) for several more. As a
corollary, if we were correct to suppose that the meaning of ought is built around a
scale of moral/practical value, this scale cannot be either additive or maximal. (By the
way, the theories of Lewis () and Kratzer (, ) make use of qualitative scales
that are effectively maximal.)
A third option, if the value scale is neither additive nor maximal, is that it is
intermediate. If so, we predict that () is a coherent pattern of judgments: φ ∧ ψ can
have value greater than θ while φ does not, as long as the value of φ ∧ ¬ψ (Juliet takes
the drug and the Friar does not administer the antidote) is lower than θ . Then, the
value of φ—equivalently, of the disjunction (φ ∧ ψ) ∨ (φ ∧ ¬ψ)—will be intermediate
between those of φ ∧ ψ and of φ ∧ ¬ψ, and may well fall below the threshold. For
similar reasons, if deontic scales are intermediate we also do not predict the validity
of the Ross inference in ().
But it would be nice to go beyond simply rendering intuitively true judgments such
as () coherent, and rendering intuitively false sentences such as (b) not entailed
by obviously true ones. As Charlow () emphasizes, it would be much better if
we could also explain why the former are felt to be true, and the latter false, in
the relevant scenarios. To do this we need to propose a specific intermediate scale
for the relevant expressions whose structural properties are related to independently
motivated features of the context; we also need to make some assumptions about how
the lexical meanings of the various deontic expressions make use of this scale. We
can make progress on this front if we adopt the proposition that deontic scales have
the structure of expected value—including, as a special case for moral judgments,
expected moral value.3
2 That is, on the usual assumption that ought(φ) and ought(¬φ) cannot both be true. Note that this
version of the puzzle problematizes von Fintel’s () attempt to explain away Procrastinate cases within a
quantificational semantics by supposing that the domain of quantification shifts between the two sentences.
The fact that it is specifically mentioned that the Friar might administer the drug makes it look pretty ad
hoc to stipulate that worlds where he does so are not considered in forming this ought-judgment.
3 To simplify the math, I’m pretending that the set of possible worlds W is finite. I will also continue to be
sloppy about the distinction between sentences and the propositions that they denote relative to a context.
It should always be clear in what follows which is intended.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
daniel lassiter
First, we assign a value V(w) to each possible world w in the set of all worlds W
(or perhaps to coarser-grained objects such as cells in a partition of the set of possible
worlds; the difference will not be crucial here). The value of a world is a real number
representing how morally or practically desirable it would be if all of the facts of the
world were arranged as in w. The use to which we put value assignments will ensure
that they are unique up to positive affine transformation. That is, the numerical value
V(w) assigned to a world w is not meaningful per se, but it gains meaning by virtue
of the relative sizes of the gaps between V(w) and the values of other worlds V(w )
and V(w ).
Secondly, we assume that a probability measure P is provided (“by context”, however
this is spelled out). This function maps propositions A ⊆ W to real numbers in
the [, ] interval. This function obeys the usual constraints: it is additive for disjoint
propositions, P(W) = , and its domain is a (σ -)algebra, closed under (countable)
union and complement. This could be a probability measure representing the state
of knowledge of the speaker or of the holder of an obligation, but we probably need
to allow that it could belong to another relevant individual or be abstracted from the
information available to a group. It could even be the trivial probability measure of an
omniscient being, assigning probability to the unit sets containing the actual world
and to all others. (This conceit can plausibly be used to model the “objective” use of
ought: see Wedgwood, .)
Next we define the expected value of a proposition, i.e., the average value of the
worlds in the proposition, where each world’s value is weighted by the probability that
that world will be actual on the condition that the proposition is true.
EV (φ) = V(w) × P({w}|φ)
w∈φ
We can derive from this definition a formula for calculating the expected value of a
disjunction of disjoint propositions: it is a weighted sum of the expected values of the
individual disjuncts, where the weights are given by the probabilities of the disjuncts
conditional on the disjunction itself (Jeffrey, b, §).
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
bayesian semantics
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
daniel lassiter
which Kolodny and MacFarlane focus. A more fundamental problem is how the
judgment that We ought to block neither is true can be so robust, given that it is clear
that the best outcomes are all of the following form: either we block A (and the miners
are in fact in A) and we block B (and the miners are in fact in B). In either case, all
miners survive, while doing nothing ensures that someone will die.4 On the usual
assumption that “ought” means “true in all of the best accessible worlds”, this set-up
entails that We ought to do nothing should be clearly false, and We ought to either block
A or block B should be clearly true. In the Miners’ Puzzle, the thing that we ought to
do is guaranteed suboptimal: it is something that we do in none of the best worlds.
When we focus on this issue, the puzzle is a variant of Jackson’s () Medicine
scenario, involving a doctor who must choose whether to prescribe a promising
but very risky experimental treatment or a safe but mediocre treatment. Even if
the best possible result of the experimental treatment is clearly better than the best
possible result of the safe treatment, most people have the intuition that the doctor
ought to prescribe the safe treatment, given that there is a significant chance that the
experimental treatment will (say) cause the patient to die.
In the Medicine puzzle, as Jackson (, p. ) points out, “[t]he obvious answer
is to take a leaf out of decision theory’s book”. First, weigh the values of the possible
outcomes of the available actions against the probability that they will be actual if the
action is taken. Whichever action has the greatest expected value is the one which we
ought to take. This procedure is encoded in the Bayesian semantics we have discussed,
and it combines reasoning about values with reasoning about probabilistic infor-
mation in a common-sensical way. The information-sensitivity of ought is predicted
immediately, with no need to add additional mechanisms or assumptions to the theory.
4 This analysis is challenged by von Fintel (), who claims that the worlds where we do nothing
and some miner(s) die actually are better than the worlds where we pick right and they all live. This
judgment strikes me as incredible. But the discussion around it suggests that the judgment is based not
on consideration of possible worlds qua fully specified states of affairs with no remnant uncertainty, but on
sets of worlds where we perform certain actions, where uncertainty about which world is actual remains
a relevant consideration. (Kolodny and MacFarlane (, §.) seem to slip similarly between intuitions
about worlds and propositions: “a world in which both shafts are left open may be more ideal than one in
which shaft A is closed relative to a less informed state, but less ideal relative to a more informed state”.) This
is not the question, though: we already knew what ordering on worlds a classical semantics would have to
deliver in order to generate the right predictions, but the problem was that it is implausible that a ranking
of worlds in terms of (moral) value could give us that ordering. What we need is a rule that can derive the
relevant judgments about which action is best (taking into account uncertainty about consequences) from a
plausible ranking of fully specified worlds. The Bayesian semantics delivers exactly this. However, if we want
to complicate the account by taking into account whether individuals with certain kinds of information take
certain actions, this is no barrier: see section . for discussion.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
bayesian semantics
The same treatment extends immediately to the Miners’ Puzzle, and to variants
which involve detailed manipulations of probabilities and outcomes which are prob-
lematic for non-Bayesian theories (Lassiter, d). A plausible analysis of Kolodny
and MacFarlane’s () ought-conditional interactions is also available in terms of
conditional expected value. See Lassiter for details.
5 Note—as a number of authors have recently pointed out, including Cariani () and Charlow
()—that these semantic theories do not technically force deontic judgments to emulate (respectively)
MaxiMax or MaxiMin decision procedures. But this is for a boring reason: until recently, everybody
assumed that deontic judgments were related to something that was independently motivated, such as
moral judgments about the relative values of certain fully specified outcomes, for which uncertainty is not
a relevant consideration. If we weaken our metasemantic theory so that the parameter controlling values is
not tied to anything that can be independently motivated and examined empirically, we grant ourselves the
freedom to reverse-engineer whatever value parameter (ordering source, etc.) is needed to support whatever
the observed judgments are in a given context. There does not seem to be any technical barrier to making
this move, but if it is the only way to save the formal structure of a best- or worst-worlds semantics, the
choice seems clear: a theory which delivers the right judgments on the basis of information about the values
of fully specified outcomes—without requiring extra degrees of freedom—is greatly preferable on general
methodological grounds of restrictiveness and predictive power.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
daniel lassiter
from the failure of MaxiMax and MaxiMin to the proposal that we should parametrize
the semantics by decision rules, allowing that any rational decision theory could be
supplied by “context” and used to calculate truth-conditions.6 In this way the account
effectively subsumes the Bayesian semantics as a special case, invoked whenever the
context supplies a decision rule based on expected value. Perhaps the context always
happens to be like this, in which case the data would appear to support the Bayesian
theory; but Carr’s position entails that this would be merely apparent.
It is striking that Carr moves without argument from the premise that neither
of two salient alternative semantic proposals explains information-sensitivity to the
conclusion that no proposal can solve the problem, unless we greatly increase the
expressiveness of the semantics—equivalently, unless we temper the predictive ambi-
tions of our theory. What is missing is a generic argument showing that no semantic
theory of ought that encodes substantive normative assumptions could be successful.
Without such an argument, we would be justified in insisting on the most empirically
restrictive semantics available that is compatible with the available data (which is, I
have argued, the Bayesian semantics).
A Moorean Open Question argument might do the trick here: if the meaning of
deontic modals encoded controversial normative assumptions, they would not be
controversial, since it would be apparent to all that these assumptions are incorrect.
Like many before me, I don’t find this kind of argument very convincing. It assumes
that we are able to reliably introspect meaning facts, and it relies on a sharp distinction
between meaning facts and facts about the world (compare §. below). However,
there is a hint of an empirical argument in Carr’s (, §) discussion of binding
in conditionals, which is elaborated by Charlow (, §.). Carr points out that it
makes sense in the Miners’ scenario to say such things as
() a. If MaxiMax is right, we ought to either block shaft A or block shaft B.
b. If MaxiMin is right, we ought to do nothing.
This can be explained if the meaning of ought is sensitive to a decision-rule parameter,
and the antecedent of this conditional binds this parameter temporarily for the
purpose of evaluating the consequent. What’s more, it could be taken to show that
the meaning of ought could not build in a specific decision rule: if it did, conditionals
of this form would all be trivial. That is, a conditional whose antecedent describes
the decision rule given by the correct semantics would be equivalent to (a), and any
antecedent which describes a decision rule incompatible with the correct semantics
would be equivalent to (b).
6 One could reasonably disagree with the implication that MaxiMax and MaxiMin are examples of
rational decision rules. The general point remains that there might be multiple rational decision rules which
make different recommendations in some situations. Certainly it would be interesting and relevant for
empirical semantics if we could find evidence that people vary in the procedures that they use to integrate
uncertainty and values in reasoning about obligation, regardless of whether such people would count as
“rational.”
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
bayesian semantics
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
daniel lassiter
b. If horses cannot be athletes, then Secretariat does not belong on the list of
greatest athletes of the th century.
Plunkett and Sundell () come to a similar conclusion, and point out that a similar
argument could be run involving midwest, war, and many other ordinary terms.
Standard theories tell us that the conditionals in () and () must receive a
trivializing interpretation, similar to those in () and (). This is not a problem for
anyone’s theory of the meaning of ought, tomato, or athlete: it is a problem for standard
theories of conditionals. What is really going on here is that the antecedents of the
troublesome conditionals somehow bind the interpretation of certain terms for the
purpose of evaluating the consequent. This process can also produce side effects by
influencing the interpretation of other terms that are semantically related to the term
in question. () and () illustrate.
() If Bill ran to the store, Mary positively sprinted. (“We both saw how fast
Bill and Mary were moving; if we agree to call Bill’s style of movement
‘running’, then—given the semantic relationship between these terms—we’re
committed to describing Mary’s faster movement as ‘sprinting’. ”)
() If Sam is small, Al is tiny. (“We both know how big they are; if we decide to call
Sam ‘small’, then—given the semantic relationship between these terms, and
the fact that Al is substantially smaller than Sam—we’re committed to calling
Al ‘tiny’. ”)
Neither of these conditionals commits the speaker to endorsing the consequent; there
may even be an implicature that the speaker thinks the consequent is not a good
description of the relevant situation.
I predict that a solution to the general problem exemplified by (), (), and
especially () and () will account for the binding relationship in () automatically.
The supposition that the antecedent MaxiMax is right is true has an effect on the
interpretation of the consequent, via the supposition that a certain semantic issue has
been resolved in a particular way. This is parallel to () and (), which can be used
even if the speaker is strongly inclined to judge the relevant consequents false. For
similar reasons, someone who is implicitly committed to the falsity of the antecedents
of () and ()—because they describe an incorrect theory of the meaning of ought—is
still able to consider the truth-values of ought-sentences on the supposition that ought
is interpreted differently.
How to make sense of the metalinguistic use of conditionals within a compositional
model-theoretic semantics is a fascinating question about which I have little to say at
the moment. Whatever the best story turns out to be, though, the problem clearly
goes beyond the narrow issue of the lexical semantics of deontic modals; trying to
resolve the problem in () by parametrizing the semantics of ought (or athlete, tomato,
or run) is missing the generality of the issue. My response here is one which will recur
throughout the rest of this essay: critics have mistaken very general semantic puzzles
for special problems of particular theories.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
bayesian semantics
gene G
milk obese
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
daniel lassiter
a. The correct semantics for deontic modals refers to a scale of expected value
calculated using conditional probabilities.
b. The correct semantics for deontic modals refers to a scale of expected value
calculated using probabilities of counterfactuals.
c. The correct semantics for deontic modals refers to a scale of expected value
calculated using unconditional probabilities.
d. The correct semantics for deontic modals is noncommittal between these
options—either by being parametrized by a decision theory (Carr) or because
deontic modals can be used to express, in a non-truth-conditional fashion, the
recommendations of these decision theories and many more (Charlow).
Charlow argues that, since native speakers show a mixture of causal-deontic and
evidential-deontic judgments, none of (a)–(c) can be right, on pain of rendering these
speakers’ judgments incorrect as a matter of meaning. He concludes that (d) must be
the correct approach.
This argument misses the fact that there is another approach to causal decision
theory, relying on causal models, which does not require us to modify Jeffrey’s
formula for calculating expected values. This approach and its implications for the
causal/evidential debate is discussed clearly by Meek and Glymour (). It is closely
related to Pearl’s (, §) formulation of causal decision theory, which relies on
the addition to the probability calculus of a do operator representing an intervention
which modifies the probabilistic dependencies among variables in a causal graph. (See
Sloman, ; Sloman and Lagnado, , and the papers in Gopnik and Schultz,
, among many others, for empirical evidence of the relevance of this approach
to causal modeling to human cognition.)
In brief, Meek and Glymour () argue that the causal/evidential divide reduces
to the choice of how actions are modeled in a given probabilistic model: expected
values can be calculated with respect to a measure conditionalized on the observation
that an action has been performed (evidential) or the existence of an intervention
designed to produce that action (causal). Expected utilities are calculated in the same
way in both cases, but when an action is treated as an intervention, the action does
not influence the probability of nodes which are not causally dependent on it. In other
words, the expected utility of an action construed as the result of an intervention will
not be influenced by the “news value” of the action about its causes and their other
effects: interventions render actions independent of their other causes, and so do not
influence their probability or those of other effects which are not also effects of the
action in question.
More concretely, in the case of the milk/obesity example: if we model drinking milk
as an ordinary event, the choice to drink milk is evidence that you have G, and having G
means that you will become obese (Figure , left). This means that the Jeffrey expected
value of drinking milk will be lowered by the fact that it raises the probability of having
G, and so of becoming obese. This gives rise to the evidential-deontic prediction you
ought not to drink milk.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
bayesian semantics
On the other hand, we could also model milk-drinking as the result of an exogenous
intervention, i.e., an uncaused variable Imilk with three possible values: T, F, and
None, where the latter represents no intervention. On Meek and Glymour’s ()
account, setting Imilk to T or F has two effects: it sets the probability of milk to or
(respectively), and it renders milk independent of G. As a result, the existence of
an intervention Imilk = T is not informative about the probability of the undesirable
outcome obese (Figure , right). (Note that this independence does not follow from
the structure of the causal model depicted, but rather from the definition of the full
joint distribution: see Meek and Glymour, , pp. –.) In this case, the expected
utility of drinking milk is no longer sensitive to the possibility of obesity. Since we
are assuming that drinking milk is intrinsically desirable, we wind up with the causal-
deontic prediction you ought to drink milk.
The difference between an observation and an intervention can be represented
in a visually more perspicuous—but equivalent—way by adopting Pearl’s () do
operator, where an intervention triggers a “surgery” which removes edges from the
causal graph. This is pictured in Figure .
Meek and Glymour’s () approach to modeling the difference between evidential
and causal judgments does not require varying the formula for calculating expected
values. Rather, the difference is that we are conditioning on different events: drinking
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
daniel lassiter
milk, or the existence of an intervention which sets P(milk) = . Since the internal
structure of the probability distribution is not something that is fixed by the Bayesian
theory—like other theories, we treat it as a contextual parameter—the Bayesian theory
makes room for both causal-deontic and evidential-deontic judgments. As a result, the
observation that there is (or at least may be) variation among speakers in intuitions
along these lines is not problematic for the Bayesian theory, and does not support
the conclusion that we should weaken the semantic theory in the ways that Carr and
Charlow suggest.
For the case of binding specifically, it is natural to analyze Charlow’s examples as
follows:
() a. If causal decision theory is right, you ought to drink milk.
b. If evidential decision theory is right, you ought not to drink milk.
While I am strongly inclined to endorse both the antecedent and the consequent
of (a), I also think that (b) is clearly true. Is this a problem? Not at all, if the
antecedent if evidential decision theory is right is interpreted as “If the right way to
model milk-drinking here is to treat it as a caused event, rather than an exogenous
intervention.” If we require the P(·) parameter to conform to that supposition, (b)
will come out true on the Bayesian semantics. For similar reasons, someone with
deontic-evidential judgments should be able to see (a) as true because he can
consider the effects of adopting a probabilistic model in which milk-drinking is treated
as an intervention.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
bayesian semantics
7 Note that the claim that it is a problem is not made explicitly by the authors that I am responding to
here. However, I have heard it from numerous philosophers in public and private communications on the
topic, and I suspect that some assumption along these lines is widespread: judging by his choice of title,
even Goble () seems to make it in “Utilitarian deontic logic,” one of the first statements of the Bayesian
semantics.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
The answer is “no,” and the reasoning is essentially the same as Lewis’s in his
response to McMichael (Lewis, ). McMichael had accused Lewis of encoding a
problematic utilitarianism into the deontic theory of Counterfactuals (Lewis, , §).
This objection would be valid if Lewis’s theory contained, in addition to a semantic
parameter representing deontic value as a world-ordering, some metasemantic restric-
tion on the kinds of information about a world that can go into determining its value.
But Lewis alleges no such restriction: in principle, any piece of information about
two worlds can play a role in determining their relative goodness. This could include
whether Bobby harmed Jimmy intentionally or accidentally, whether he harmed him
actively or simply allowed him to be harmed, and so on. These are facts about actions,
to be sure, but actions are part of the world, and a semantic theory which has access to
information about the whole world will of course have access to information about
properties of actions that take place in that world. As Lewis (, pp. –) puts
it: “The semantic analysis tells us what is true (at a world) under an ordering. It
modestly declines to choose the proper ordering. That is work for a moralist, not a
semanticist.”
Similarly, the Bayesian theory has no difficulty with the fact that people’s deontic
judgments are frequently sensitive to more fine-grained information about the world
than (for instance) the overall balance of good and evil. No metasemantic restriction
is in place to prevent the moral value function from ranking worlds in which the harm
that Bobby inflicted on Jimmy was intentional below worlds which are identical except
that this harm was accidental. The information that distinguishes these worlds involves
psychological facts about people in them, but nothing in the theory restricts the moral
value function from taking such information into account. If we wanted to encode
(say) a utilitarian ethical theory into the Bayesian semantics sketched above, we would
have to do it by adding a metasemantic restriction which constrains admissible value
functions in this way. Even radically nonconsequentalist theories—those in which the
consequences of actions are never taken into account—can be modeled within the
Bayesian semantics by placing constraints on allowable value functions; all we would
have to do is to supply a value function which ignores the consequences of actions
in choosing how to rank two worlds, and pays attention only to the universalizability
of agents’ actions, their conformance with God’s will, or whatever other feature we
choose to focus on.
There are important limitations to the Bayesian theory’s expressiveness, but these
limitations involve logical relations among sentences. The theory rules out, for exam-
ple, the possibility that φ ∨ ψ can have value greater than that of φ. If God decrees
that worlds are classified according to whether people aim to praise Him and to help
the poor, He still cannot make praying or giving to the poor better than each of
the following individually: praying, and giving to the poor. Such logical predictions
are the main selling point of the Bayesian semantics. Perhaps there is some strange
ethical theory which is not compatible with this prediction; if so, and if we could find
non-theorists whose moral judgments coincided with the predictions of this theory,
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
8 I’ve used w where Lewis and Kratzer would use a symbol closer to w . I find it more natural to use
the former for “at least as good as,” and the latter for “at most as good as”/“at least as bad as.”
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
(Again, the latter is stronger: EV (·) naturally determines an agreeing preference order
via φ s φ ⇔ EV (φ ) ≥ EV (φ ).) For the specific case of ought we can be
fairly noncommittal about how this maps into judgments, except to require that a
cooperative speaker in ideal conditions will endorse ought(φ) only if she judges that φ
is deontically better than all relevant alternatives. “Better” is spelled out in the obvious
way using s or EV (·), depending on the theory. Now we can define what it means for
a semantics for deontic modals to be “contentful”:
A contentful semantics for deontic modals is a semantics which places constraints
on the kinds of probability measures P(·) and preferences over worlds (w or V)
which can be associated with which preferences over propositions (s or EV (·)).
The Bayesian theory is as contentful as can be: its parameters P(·) and V together fully
determine EV (·). So is the Lewis/Kratzer semantics, where s is fully determined by
w , and P(·) is ignored.
A semantics which is not at all contentful would be one which begins with a P(·)
and w , or a P(·) and V, and allows them to be associated with any s or EV (·)
(as appropriate). Such a semantics would allow, for example, that ought judgments
could be associated with a s ordering generated by a MiniMin rule (you ought
to perform the action with the worst possible worst-case outcome), or a random
shuffling of propositions (RandomChoice), or that the proposition-ordering could be
intransitive or cyclic. While we are focusing on the back-and-forth between maximally
contentful theories (such as the Bayesian one) and very weak theories (such as Carr’s
and Charlow’s), there is clearly a lot of logical space between the extremes: it could be
that the semantics constrains but does not fully determine the relationship between
these components.
I don’t know of any direct empirical evidence of variation among non-theorists in
which choice rules are used in forming ought-judgments, holding information and val-
ues fixed. (As discussed above, “values” must be read here to permit value judgments to
include all of the information about a world, including issues of personal vs impersonal
harm, action vs omission, etc.) A prominent class of potential counter-examples here
involves varying attitudes toward risk in ought-judgments. Such variation is well-
motivated empirically in the case of choice behavior, and it’s reasonable to speculate
that it exists in deontic judgment as well (though direct empirical evidence involving
deontic judgment specifically is needed before we can be confident). However, it is
Note that, in Kratzer () theory w is not a contextual parameter, but is determined indirectly by a
modal base f : w → P(W), an ordering source g : w → P(W), and a rule for constructing w from f and
g: relative to an evaluation world w ,
w w w ⇔ w , w ∈ f (w ) ∧ {p ∈ g(w )|w ∈ p} ⊇ {p ∈ g(w )|w ∈ p}.
Any choice of f and g will determine a reflexive and transitive order w . In addition, Lewis () proved
that, for any reflexive and transitive order w over a subset of W, Kratzer can choose some f and g which
deliver exactly that w . The theories are thus equally expressive, and the choice to treat w as a parameter
or derive it using Kratzer’s more elaborate method makes no difference for our purposes.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
9 Rabin () is a prominent critique, but see also Chetty and Szeidl () for evidence that expected
utility models are not as badly off as Rabin’s critique might make it appear.
10 There is interesting work suggesting intra-individual variation in judgments triggered by cognitive
load manipulations (Greene, Morelli, Lowenberg, Nystrom, and Cohen, ). I assume here that we are
modeling reflective judgments formed under minimal task demands. There is also evidence of framing
effects in moral judgments, with logically equivalent scenarios being judged as worse when their negative
aspects are highlighted (Sunstein, ; Kern and Chugh, ). This effect, known as “moral loss aversion,”
is a different phenomenon from risk-aversion, and is usually thought to be a judgmental bias rather than an
effect that should be predicted by a decision theory.
11 Throughout the argument, “people,” “speakers,” etc. are implicitly restricted to non-theorists; otherwise
several of the premises would be quite implausible, for reasons discussed above.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
c. So, it is possible [in the sense relevant to (a)] for two competent speakers to
form contradictory ought judgments as a result of employing different choice
rules. [From (a) and (b)]
. Sub-argument:
a. If two competent speakers form contradictory ought judgments as a result of
employing different choice rules, this is a substantive disagreement about an
ought judgment which is generated by the use of different choice rules.
b. In the sense of possible relevant to (a), If p then q entails If it is possible that
p then it is possible that q.
c. So, if it is possible [in the sense relevant to (a)] for two competent speakers
to form contradictory ought-involving judgments as a result of employing
different choice rules, then it is possible [in the sense relevant to (a)] for
there to be a substantive disagreement about an ought judgment which is
generated by the use of different choice rules. [From (a) and (b)]
. So, it is possible [in the sense relevant to (a)] for there to be a substantive
disagreement about what ought to happen which is generated by the use of
different choice rules. [From (c) and (c)]
. Disagreements are either substantive or verbal. Substantive disagreements
require that the parties involved use all of the lexical items with the same meaning
when describing the issue about which they are disagreeing. If there is any
difference in the meanings expressed by the parties to the disagreement when
they describe the issue under contention, the disagreement is merely verbal.
. So, it is possible to vary the choice rule used in the generation of an ought
judgment without varying its meaning. [From () and ()]
The argument appears to be valid, though it falls short of establishing that ought is
not contentful, i.e., that any choice rule is compatible with the meaning of ought. For
that, we would need to assume that, for any definable combination of P(·), w , s —
or P(·), V(·), EV (·), depending on the theory we’re working with—we can conceive
of a disagreement where one party is employing this combination. This would have to
include combinations determined by crazy rules like MiniMin and RandomChoice, as
well as combinations where s is intransitive or cyclic. I find it more or less impossible
to imagine these things, though perhaps this difficulty can be attributed to other
factors (say, to constraints imposed by an intuitive theory of others’ psychology).
The real problems with this argument are broader: a number of the premises are
debatable at best. Consider premise (a), for example. The plausibility of this premise
depends on what sense of “possible” is relevant. Epistemic possibility is a non-starter—
it’s easy to conceive of things that are epistemically impossible, like the moon being
made of green cheese. Metaphysical possibility is a plausible candidate for the intended
sense. If “possible” is resolved in this way, then premise (a) amounts to the assumption
that conceivability entails metaphysical possibility—a hotly disputed point, to put it
mildly. For example, Putnam () argues that it is conceivable that water is not
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
how to classify it: perhaps one party is right and the other is wrong, or perhaps the two
parties are speaking languages which differ ever so subtly. Intuitions about this case
are not clear, and we cannot expect to rely on them in adjudicating the issue. Perhaps,
as is plausible in the tomato case, there would not even be a fact of the matter.
The “tomato” and “athlete” examples problematize premises (a) and () of the
argument about ought-involving disagreements as well. Both of these premises pre-
suppose that there is a clear distinction to be drawn between substantive and verbal
disagreements. The assumption that such boundaries exist is independently troubling
in light of the general problematization of the analytic/synthetic distinction since
Quine (). The claim that a disagreement must be either one of meaning or one
of fact presupposes that a such a distinction exists. Worse, the assumption that we
can reliably intuit which is at play in a given case presupposes that we can reliably
distinguish meaning facts from facts about the world. But even if we suppose, contra
Quine, that the analytic/synthetic distinction is real and theoretically useful, the
history of philosophical work which uses or criticizes the concept of analyticity
demonstrates clearly that our intuitions about how to classify particular cases are not
generally reliable.
In sum, the argument from the conceivability of certain kinds of disagreement is
very generic in form, and has several problematic aspects that have nothing particular
to do with the semantics of deontic modals. As a result, this argument does not cause
me to lose sleep. On the other hand, solid empirical evidence of deontic choice-rule
disagreements among non-theorists would constitute a strong argument for relaxing
the semantics to make room for whatever patterns are actually observed. Alternatively,
if there were separable populations of speakers who consistently used ought in different
ways along this parameter, we might be justified in supposing that their languages
differ subtly in the meaning of ought. In my estimation, the most fruitful direction for
research on deontic modals will be oriented toward empirical issues of this type.
This response still leaves important philosophical questions unanswered. What is
the relationship between (e.g.) the Bayesian semantics and the notion of analyticity?
Isn’t the Bayesian semantics implausible because, if correct, it would be an analytic
truth that it is correct, and so obvious and trivial? In the back-and-forth between
philosophers of language and practicing lexical semanticists, there seems to be a
fair bit of confusion on this point at a larger scale. For example, Fodor and Lepore
() criticize Cruse’s () classic, descriptively-oriented lexical semantics text-
book, which takes much time describing relationships among lexical items such as
entailment, antonymy, and synonymy. In their estimation, this work is hopelessly
confused because it confuses the meaning of an expression like hot with the inferences
that people tend to draw from it (e.g., that something that is hot is not cold). They
formulate extensive related criticisms of Pustejovsky’s () Generative Lexicon the-
ory, a richly structured theory of lexical semantics intended to account for systematic
meaning shifts which occur predictably in certain environments and across entire
classes of items. (For example, book, CD, and file can each be interpreted both as a
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
12 Interestingly, Cariani () and Charlow () both give ought more semantic structure than this,
and Cariani endorses a probabilistic semantics for epistemic likely along the lines of Yalcin (, b);
Lassiter (, ). It’s not obvious to me that either of these positions could survive the philosophical
weapon of mass destruction conjured up by the disagreement argument: it’s possible to imagine people
disagreeing about pretty much anything that you might think to call “meaning.” If we take the disagreement
argument to its logical conclusion, I suspect that we will eventually be compelled to Fodor and Lepore’s
bare-bones concept of lexical semantics for all of these expressions.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
be sharply distinguished from everything else that they believe about the world.13 As
I understand it, this is how lexical semantics is generally done. On the other hand,
this methodology does not presuppose the non-existence of analyticity either: it could
turn out that some information really is so central that it cannot be doubted and/or
could not turn out to be otherwise. It could even turn out, as Fodor and Lepore argue,
that the boundary between language and the rest of cognition is drawn so narrowly
that all of the interesting work that has been done in lexical semantics is really about
concepts and reasoning. For linguists such as Pustejovsky (, ), arguments like
this, unaccompanied by testable empirical predictions, are rightly met with a shrug.
My attitude toward the debate around deontic modals and decision theoretic
concepts—“How much is in the lexicon, as opposed to being part of a concept of
obligation or a theory of moral reasoning?”—is similar. I’m not convinced that there
are meaningful boundaries to be drawn between meaning, concepts, and reasoning.
But maybe there are; if so, then we cannot expect to locate them by examining
intuitions about the conceivability of various scenarios. What is needed is a predictive
theory of ought that is embedded in a good compositional semantics for the whole
language, and responsive to the theory of syntax, pragmatics, and other aspects of
cognition. We can then judge the result in terms of the ability of the larger whole
to predict the empirical evidence available to us, as well as its overall coherence and
simplicity.
It might well turn out that there is a determinate answer to the question of what
precisely the meaning of ought is, in the heavy-duty, analytic-truth-generating sense.
If so, perhaps ought has a skeletal meaning that is normatively noncommittal, as
Charlow and Carr argue. If so, much of the work that we do under the rubric of “lexical
semantics” is probably better categorized as an investigation into the concept ought,
or into English speakers’ ought-involving reasoning habits, or something else of this
sort. This would be an interesting discovery, though not an especially troubling one
(except, perhaps, when searching for an institutional affiliation). In any case, we will
never know until a testable empirical difference is located.
Further Objections
Cariani () gives two further arguments against a Bayesian semantics: one involv-
ing attitude ascriptions and one involving dominance reasoning and zero-probability
events. In this section I consider these in turn, arguing again that they are special cases
of very general issues rather than specific problems for the Bayesian semantics. Given
that these puzzles arise independently, and that their general solutions can be expected
13 Obviously linguists working in the Montagovian tradition would be hard pressed to follow Quine’s
verificationism, behaviorism, skepticism about modal notions, etc. What I mean is that this attitude fits
in with Quine’s general philosophy of science, and in particular his belief that philosophy’s job is to aid
scientific inquiry, not to dictate methods to it.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
to extend to the case at hand, they fail to support Cariani’s conclusion that a much more
powerful semantic theory of deontic modality is needed.
. Attitudes
Consider again the MaxiMax enthusiast in the disagreement above; let’s name him
“Carl”. If asked what the doctor in the Medicine scenario ought to do, Carl would
answer that he ought to prescribe the experimental treatment which brings along
with it a very high chance of death. This is because the best possible outcome of
this treatment is a full recovery, which is better than the best possible outcome of the
safe but mediocre treatment. We could move probabilities around as much as we like,
making it .% certain that the patient will die under the experimental treatment,
but crazy Carl won’t budge: unless the experimental treatment leads to death with
probability , his beliefs about what the doctor ought to do are sensitive only to the
most optimistic outcome, the tiny chance that the patient will have a full recovery.
Long-suffering Martha shares Carl’s information and values, but has come to
the reasonable judgment that the doctor ought to prescribe the safe but mediocre
treatment. Having endured Carl’s arguments for several pages now, she might turn
to you and say in exasperation:
() Carl thinks that the doctor ought to prescribe the risky experimental treat-
ment. (But he’s wrong.)
Cariani (, §.) poses an interesting puzzle about how to make sense of Martha’s
statement. Even allowing that Martha’s use of ought is well described by the Bayesian
semantics of §, what are the truth-conditions of () in her mouth? () is no good,
for example.
() Carl thinks that, of the options available to the doctor, the action with the
highest expected moral value is to prescribe the risky experimental treatment.
If Carl knows what expected value is, he will presumably judge The action with the
highest expected moral value is to prescribe the risky experimental treatment to be false—
after all, he and Martha share all of their values and information. So () is false, but
Martha’s statement in () is clearly true; so () is not a good paraphrase of ().
Unfortunately, as Cariani points out, a straightforward, standard operator treatment
of thinks will end up giving us () as the interpretation of ().
What does () mean? A prima facie reasonable interpretation might be ():
() Carl thinks that, of the options available to the doctor, the action whose best
possible outcome is morally best is the risky experimental treatment.
But it would be a mistake to enrich the semantics by allowing thinks to shift some
additional parameter, and then conclude that the problem has been solved. This could
generate (), but it isn’t enough: Carl and Martha would both endorse the content of
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
the embedded clause in (), too. The real problem is that Carl and Martha don’t agree
on how ought should be interpreted, and neither () nor () captures this. The full
interpretation of () has a normative, metalinguistic component:
() Carl thinks that, of the options available to the doctor, the action whose best
possible outcome is morally best is the risky experimental treatment, and
that ought should be used to describe the action whose best possible outcome
is morally best.
This kind of interpretative flexibility in attitude reports is needed on independent
grounds. Consider (yet again) the tomato controversy in Nix v. Hedden. Port Con-
troller Hedden (the defendant) could have described the position of the plaintiffs
like this:
() The Nix family think that tomatoes are not fruit. (But they’re wrong: tomatoes
are fruit, and I was correct to make them pay the import duty.)
Let’s allow that, since Hedden classifies tomatoes as fruit, the interpretation function
relevant to interpreting his utterances maps tomatoes to a subset of the things that it
maps fruit to.14 Still, the intended interpretation of () is not “the Nix family think
that everything that is a tomato in Nix’s English is not a fruit in Nix’s English.” Rather,
the issue is about how “fruit” should be interpreted. () means roughly ():
() The Nix family think that, for the purpose of interpreting the Tariff Act,
objects in the extension of “tomatoes” should be treated as being outside the
extension of “fruit”. But in fact they are wrong: for the purpose of interpreting
the Tariff Act, “fruit” should be construed so that everything in the extension
of “tomatoes” falls into its extension as well.
Accounting for this interpretation in the context of a broad compositional semantics
for attitude verbs is an interesting challenge that I won’t try to pursue here. (See
Stalnaker, ; Shan, for relevant discussion and some directions.) But the
lesson, once again, is that the problems attributed to the Bayesian semantics are
very general problems indeed. A full theory of how () receives the normative,
metalinguistic interpretation paraphrased in () will—I predict—also explain how
() is interpreted as ().
. Dominance Reasoning and Zero-probability Events
Cariani brings up another objection to the Bayesian semantics, drawn from a recent
manuscript by Hájek (ms.) on the foundations of probability. Hájek imagines himself
throwing an infinitely thin dart at the [, ] interval, with a uniform distribution over
14 If you like the alternative interpretation and are convinced that this is metaphysically impossible,
modify the example so that one of the Nixes is the speaker, describing Hedden’s position that tomatoes
are fruit. The argument then goes through as in the main text.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
the possible landing points. It must land somewhere in this interval, but—for any
particular value x in this interval—the probability that the dart will land on x is .
Indeed, the probability that the dart’s landing point will fall within X is zero for many
large, even infinite, sets X ⊂ [, ]. (For example, X might be the set of all rational
numbers in this interval.) Hájek uses this example to bring out several important
problems for probability theory, such as the ratio definition of conditional probability
and the multiplication definition of independence.
The dart example also brings up an interesting puzzle about Bayesian decision
theory. The latter encodes a certain highly intuitive form of dominance reasoning as
a theorem, but only when possible-but-zero-probability events are excluded. Imagine
that Alan Hájek is about to throw an infinitely thin dart at the [, ] interval. You have to
choose between Option and Option . If you choose Option , the world will continue
in its current lousy condition no matter where the dart lands. If you choose Option ,
there will be eternal world peace if the dart lands on a certain real value r, and nothing
will happen otherwise.
Intuitively, it is clear that you ought to choose Option . But suppose that world
peace has moral value , , and the current state has moral value −. The
expected value of option is − × + , , × = −. The expected value
of Option is, well, − × + , , × = −. The zero-probability event of
the dart’s landing on r has no effect on the calculation, and so, against intuition, it is
morally indifferent which option you choose.
Here again, the objection is not really specific to the Bayesian semantics, but rather
is a very general issue for the foundations of Bayesian decision theory, and one which
we may expect optimistically to resolve by adopting whatever patch ends up being the
right one for the theory in general. But we can also say a bit more about the attempt
to use an argument of this form to problematize a proposal about the semantics of
English.
One general objection to this style of argument is that infinitely thin darts are
extremely remote from our everyday experience. If there were an infinitely thin dart
in the room, Hájek wouldn’t be able to find it, or to hold onto it long enough to
throw it. But I personally find it flatly impossible to imagine such a thing: my mental
image is just of a really, really thin dart (which wouldn’t be enough to get the puzzle
going). These observations may sound silly, but they point to a serious issue: we
have no reason to expect that linguistic or moral intuitions that have been forged
in our ordinary lives should be a reliable guide to what would or ought to happen
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
I take this to suggest that dominance reasoning can be upheld if the language being
modeled (non-mathematical English) is restricted so that it is unable to talk about the
troublesome zero-probability sets. It seems plausible that this is the case.
Conclusion
The Bayesian semantics does a good job of regimenting grammatical and inferential
properties of deontic modals which are troublesome for many theories. In particular, it
automatically encodes many of the information-sensitivity facts which have generated
so much excitement in the recent literature, and gives a clear semantics for widespread
gradability among deontic modals. Since ought is the most widely discussed deontic
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
item in the philosophical literature, we have focused on it here as well, but there is a
great deal to be said about may, must, should, and about good and its modified forms
as good as, (much) better than, etc.: see Lassiter ().
A number of arguments have been made in recent literature to the effect that the
Bayesian semantics is not expressive enough to deal with the interaction of ought with
conditionals and attitudes, with certain kinds of hypothetical disagreements scenarios,
and with situations involving possible events that have zero probability. In each case,
the proposed solution is to weaken so that the predictions of the Bayesian semantics
can emerge as one special case among many.
While it might turn out eventually that there is a good theoretical or empirical
reason to weaken the semantics, I have argued that none of the arguments surveyed
here is compelling. We can explain the problematic data in terms of independent
factors—some formal, some philosophical, and some involving known but unsolved
linguistic problems.15 Until a compelling empirical objection is found, the Bayesian
semantics continues to be a viable theory of the semantics of deontic modals, and—
in my opinion—the best theory available in its balance of empirical coverage and
theoretical restrictiveness.
References
Arrow, Kenneth J () The theory of risk aversion. Helsinki: Yrjö Jahnssonin Säätiö.
Bastiaanse, Harald () The rationality of round interpretation. In Sauerland, U., Nouwen, R.,
van Rooij, R., and Schmitz, H.-C (eds.) Vagueness in Communication. pp. –. Heidelberg:
Springer.
Bierwisch, Manfred () The semantics of gradation. In Bierwisch, M. and Lang, E (eds.)
Dimensional Adjectives: Grammatical Structure and Conceptual Interpretation. pp. –.
Berlin: Springer-Verlag.
Cariani, Fabrizio () The semantics of ‘ought’ and the unity of modal discourse. Ph.D.
dissertation. University of California at Berkeley.
Cariani, Fabrizio () “Ought” and resolution semantics. Noûs. . , pp. –.
Cariani, Fabrizio () Attitudes, deontics and semantic neutrality. Pacific Philosophical Quar-
terly. . , pp. –.
Cariani, Fabrizio () Deontic modals and probabilities: One theory to rule them all? In
Charlow, N. and Chrisman, M (eds.) Deontic Modals. Oxford: Oxford University Press.
Cariani, F., Kaufmann, S., and Kaufmann, S () Deliberative modality under epistemic
uncertainty. Linguistics and Philosophy. , pp. –.
Carr, Jennifer () Deontic modals without decision theory. In Chemla, E., Homer, V., and
Winterstein, G (eds.) Proceedings of Sinn und Bedeutung . pp. –.
15 Note, by the way, that this conclusion should not be construed as an argument against Charlow’s
expressivism. Expressivism is a plausible thesis about deontic and epistemic modals, and probably other
aspects of language as well. I don’t think that being an expressivist requires one to reject a Bayesian semantics,
though. See Yalcin (a) for some relevant discussion.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
Chalmers, David () The conscious mind: In search of a fundamental theory. Oxford: Oxford
University Press.
Charlow, Nate () What we know and what to do. Synthese. . , pp. –.
Charlow, Nate () Decision theory: Yes! Truth conditions: No! In Charlow, Nate and
Chrisman, Matthew (eds.) Deontic Modals. Oxford: Oxford University Press.
Chetty, Raj and Szeidl, Adam () Consumption commitments and risk preferences. The
Quarterly Journal of Economics. . , pp. –.
Cruse, D. Alan () Lexical semantics. Cambridge: Cambridge University Press.
Fintel, K. von () The best we can (expect to) get? Challenges to the classic semantics
for deontic modals. Paper presented at the Central meeting of the American Philosophi-
cal Association, February . Available from: http://mit.edu/fintel/fintel--apa-ought.pdf.
[Accessed Nov .]
Fodor, Jerry A. and Lepore, Ernie () The emptiness of the lexicon: Reflections on James
Pustejovsky’s The Generative Lexicon. Linguistic Inquiry. . , pp. –.
Gendler, Tamar Szabó and Hawthorne, John (eds.) () Conceivability and Possibility. Oxford:
Clarendon Press.
Gibbard, Allan and Harper William L () Counterfactuals and two kinds of expected utility.
In Harper, R., Stalnaker, R., and Pearce, G. (eds.) Ifs: Conditionals, belief, decision, chance, and
time. Dordrecht: D. Reidel.
Goble, Lou () Utilitarian deontic logic. Philosophical Studies. . , pp. –.
Gopnik, Alison and Schultz Laura (eds.) () Causal learning: Psychology, philosophy, and
computation. Oxford: Oxford University Press.
Greene, Joshua D., Morelli, Sylvia A., Lowenberg, Kelly, Nystrom, Leigh E., and Cohen,
Jonathan D () Cognitive load selectively interferes with utilitarian moral judgment.
Cognition. . , pp. –.
Hájek, Alan Staying regular. Manuscript, Australian National University.
Hare, R.M () Some alleged differences between imperatives and indicatives. Mind. . ,
p. .
Hay, Jennifer, Kennedy, Chris, and Levin, Beth () Scalar structure underlies telicity in
‘degree achievements’. In Mathhews, T. and Strolovitch, D. (eds), Semantics and Linguistic
Theory . Ithaca, NY: CLC Publications. pp. –.
Jackson, Frank () Decision-theoretic consequentialism and the nearest and dearest objec-
tion. Ethics. . , pp. –.
Jackson, Frank and Pargetter, Robert () Oughts, options, and actualism. The Philosophical
Review. . , pp. –.
Jeffrey, Richard C (a) Ethics and the logic of decision. Journal of Philosophy. . ,
pp. –.
Jeffrey, Richard C (b) The logic of decision. Chicago, IL: University of Chicago Press.
Kahneman, Daniel and Tversky, Amos () Prospect theory: An analysis of decision under
risk. Econometrica: Journal of the Econometric Society. . , pp. –.
Kamp, Hans () Two theories about adjectives. In Keenan, E. (ed.) Formal semantics of
natural language. pp. –. Cambridge: Cambridge University Press.
Kennedy, Chris () Vagueness and grammar: The semantics of relative and absolute gradable
adjectives. Linguistics and Philosophy. . , pp. –.
Kennedy, Chris and McNally, Louise () Scale structure, degree modification, and the
semantics of gradable predicates. Language. . , pp. –.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
Kern, Mary C. and Chugh, Dolly () Bounded Ethicality. The Perils of Loss Framing.
Psychological Science. . , pp. –.
Kolodny, Niko and MacFarlane, John () Ifs and oughts. Journal of Philosophy. . ,
pp. –.
Krantz, David H., Luce, R. D., Suppes, Patrick, and Tversky, Amos () Foundations of
Measurement. New York, NY: Academic Press.
Kratzer, Angelika () The notional category of modality. In Eikmeyer and Rieser (eds.) Words,
Worlds, and Contexts. pp. –. Berlin: de Gruyter.
Kratzer, Angelika () Modality. In von Stechow and Wunderlich (eds.) Semantics: An inter-
national handbook of contemporary research. Berlin: de Gruyter.
Krifka, Manfred () Approximate interpretation of number words: A case for strategic
communication. In Bouma, G., Krämer, I., and Zwarts, J (eds.) Cognitive foundations of inter-
pretation. pp. –. Amsterdam: Koninklijke Nederlandse Akademie van Wetenschapen.
Kripke, Saul () Naming and necessity. Cambridge, MA: Harvard University Press.
Lassiter, Daniel () Gradable epistemic modals, probability, and scale structure. In Li Nan
and Lutz David (eds.) Semantics & Linguistic Theory (SALT). , pp. –.
Lassiter, Daniel () Measurement and Modality: The Scalar Basis of Modal Semantics. Ph.D
dissertation. New York University.
Lassiter, Daniel (a) Adjectival modification and gradation. In Lappin S. and Fox C (eds.)
Handbook of Contemporary Semantic Theory. Malden: Wiley-Blackwell. nd edition.
Lassiter, Daniel (b) Epistemic comparison, models of uncertainty, and the disjunction
puzzle. Journal of Semantics. . , pp. –.
Lassiter, Daniel (c) Graded modality. Manuscript: Stanford University.
Lassiter, Daniel (d) Modality, scale structure, and scalar reasoning. Pacific Philosophical
Quarterly. . . pp. –.
Lassiter, Daniel () Graded Modality: Qualitative and Quantitative Perspectives. Oxford:
Oxford University Press (in press).
Lewis, David () Counterfactuals. Cambridge MA: Harvard University Press.
Lewis, David () Reply to McMichael. Analysis. . , p. .
Lewis, David () Scorekeeping in a language game. Journal of Philosophical Logic. . ,
pp. –. doi: ./BF.
Lewis, David () Causal decision theory. Australasian Journal of Philosophy. . , pp. –.
Ludlow, Peter () Living words: Meaning underdetermination and the dynamic lexicon.
Oxford: Oxford University Press.
May, Joshua () Moral judgment and deontology: Empirical developments. Philosophy
Compass. . , pp. –.
Meek, Christopher and Glymour, Clark () Conditioning and intervening. The British
Journal for the Philosophy of Science. , pp. –.
Morzycki, Marcin () Adjectival extremeness: Degree modification and contextually
restricted scales. Natural Language & Linguistic Theory. . , pp. –.
Morzycki, Marcin () Modification. Cambridge: Cambridge University Press.
Pearl, Judea () Causality: Models, reasoning and inference. Cambridge: Cambridge Univer-
sity Press.
Plunkett, David and Sundell, Tim () Disagreement and the semantics of normative and
evaluative terms. Philosopher’s Imprint. , pp. –.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
Portner, Paul and Rubinstein, Aynat () Extreme and non-extreme deontic modals. In
Chrisman M. and Charlow N. (eds.) Deontic Modals. Oxford: Oxford University Press.
Pratt, John W () Risk aversion in the small and in the large. Econometrica: Journal of the
Econometric Society. . /, pp. –.
Pustejovsky, James () The Generative Lexicon. Cambridge, MA: MIT Press.
Pustejovsky, James () Generativity and explanation in semantics: A reply to Fodor and
Lepore. Linguistic Inquiry. . , pp. –.
Putnam, Hilary () The meaning of “meaning”. In Mind, Language and Reality: Philosophical
papers. Volume . Cambridge: Cambridge University Press.
Quine, Willard van Orman () Two dogmas of empiricism. The Philosophical Review. . ,
pp. –.
Rabin, Matthew () Risk aversion and expected-utility theory: A calibration theorem.
Econometrica. . , pp. –.
Regan, Donald () Utilitarianism and Co-operation. Oxford: Oxford University Press.
Ross, Alf () Imperatives and logic. Philosophy of Science. . , pp. –.
Rotstein, Carmen and Winter, Yoad () Total adjectives vs. partial adjectives: Scale structure
and higher-order modifiers. Natural Language Semantics. . , pp. –.
Sassoon, Galit W () A typology of multidimensional adjectives. Journal of Semantics. . ,
pp. –.
Sauerland, Uli and Stateva, Penka () Scalar vs. epistemic vagueness: Evidence from approx-
imators. In Gibson M. and Friedman T. (eds.) Semantics and Linguistic Theory. . Ithaca, NY:
CLC Publications.
Shan, Chung-chieh () The character of quotation. Linguistics and Philosophy. . ,
pp. –.
Sloman, Steven A () Causal models: How we think about the world and its alternatives.
Oxford: Oxford University Press.
Sloman, Steven A. and Lagnado, David A. () Do we “do”? Cognitive Science. . , pp. –.
Stalnaker, Robert () On considering a possible world as actual. In Aristotelian Society
Supplementary Volume, Vol. , , pp. –.
Sunstein, Cass R () Moral heuristics. Behavioral and brain sciences. . , pp. –.
Wedgwood, Ralph () The Meaning of “Ought”. In Shafer-Landau R., (ed.) Oxford studies
in Metaethics. , pp. –. Oxford: Oxford University Press.
Wedgwood, Ralph () Objective and subjective ought. In Charlow, Nate and Chrisman,
Matthew (eds.) Deontic Modality. Oxford: Oxford University Press.
Yablo, Stephen () Is conceivability a guide to possibility? Philosophy and Phenomenological
Research. . , pp. –.
Yalcin, Seth () Epistemic modals. Mind. . , pp. –.
Yalcin, Seth () Probability Operators. Philosophy Compass. . , pp. –.
Yalcin, Seth (a) Bayesian expressivism. Proceedings of the Aristotelian Society. ,
pp. –.
Yalcin, Seth (b) Context probabilism. In Aloni, M., Kimmelman, V., Roelofsen, F., Sassoon,
G. W., Schulz, K., and Westera M. (eds.) Logic, language and meaning. Vol. , Lecture Notes
in Computer Science. pp. –. Heidelberg: Springer.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
Contextualism about Deontic
Conditionals
Aaron Bronfman and J. L. Dowell
If you are a semanticist, how best to understand the formal semantics of modal
expressions is an issue that wears its interest on its sleeve. The issue, however, is of
broader interest and importance to those concerned with other debates. One main
task of metaethics, for example, is to understand ordinary moral and, more broadly,
normative and evaluative discourse. Identifying the best semantics and pragmatics of
deontic modal expressions in particular would make an important contribution to
metaethicists’ understanding of such discourse.
Recently, some philosophers of language and linguists have wondered whether there
are any expressions that require a relativist’s distinctive treatment. Contextualists about
some expression E hold that the contribution E makes to the determination of the
truth-conditions of utterances containing E varies from context of use to context
of use. Relativists about E, in contrast, hold that it makes an invariant contribution
to the determination of truth-conditions on any occasion of use. Unlike standard
semantic invariantists, however, relativists hold that the circumstances of evaluation
that determine the truth of utterances containing E are more fine-grained than the
standardly assumed possible worlds. What in addition is needed to determine a truth-
value, for the relativist, depends upon what E is. In the case of deontic modals, some
relativists argue that that addition is a body of information: Deontic modal sentences
are true or false at world, information pairs.1
Assessing the prospects for relativism about deontic modals is crucial to answering
the larger question of whether relativism is a viable research program. Central among
the cases that are thought to motivate relativism are cases involving deontic modals
whose truth requires that they are sensitive to a body of information in some way.2
Parfit’s miners scenario is such a case. A significant point of contention is whether
a contextualist can account for our judgments about deontic modals in that case.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
3 Some have argued that the best data for relativism about deontic and epistemic modals is given not by
data at issue here, but by data involving disagreement. For replies to the contention that the contextualist
cannot accommodate the disagreement data, see Dowell (, ).
4 To be clear, we are not suggesting that a Lewis-Kratzer-style formal semantics for modals can only be
given a contextualist construal. For all we say here, there is a relativist interpretation of that semantics that
does as well as the contextualist one we shall defend. Here we aim to assess the claim some contextualists
have defended that a Lewis-Kratzer-style semantics under a contextualist interpretation cannot fit with the
data we discuss here. (For discussion of one way to implement relativism in a Kratzer-style framework, see
Egan ().)
5 “Pragmatics deals with utterances, by which we will mean specific events, the intentional acts of
speakers at times and places, typically involving language. Logic and semantics traditionally deal with
properties of types of expressions, and not with properties that differ from token to token, or use to use. . . The
utterances philosophers usually take as paradigmatic are assertive uses of declarative sentences, where the
speaker says something. Near-side pragmatics is concerned with the nature of certain facts that are relevant
to determining what is said. Far-side pragmatics is focused on what happens beyond saying: what speech
acts are performed in or by saying what is said, or what implicatures. . . are generated by saying what is said.”
(Korta and Perry, : pp. –)
6 Along with Dowell (, , ) and Bronfman and Dowell (forthcoming), this discussion thus
contributes to the larger project of defending a Kratzer-style, flexible contextualist semantics for modal
expressions, supplemented with a near-side pragmatic account of how it is that contexts provide the
parameter values needed to secure appropriate readings.
7 For another such case study, see von Fintel () and Gillies () who each argue for a dynamic
semantic theory of counterfactuals on the grounds that the standard, Lewis-Stalnaker semantics is unable
to explain our judgments about the felicity of Sobel and reverse Sobel sequences. See also Moss (), who
defends the standard semantics by providing a near-side pragmatic explanation of our judgments in such
cases.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
Below, we first explain the basic features of Kratzer’s semantics for modal expres-
sions, including conditionals. Then we consider and meet the challenge cases in turn.
Finally, we show how our readings are able to meet any remaining objections and pose
a few of our own to the rival theories thought to be motivated by these challenges to
Kratzer’s canonical view.
Kratzer-Style Contextualism
On Angelika Kratzer’s canonical semantics, modal expressions are semantically neu-
tral; they make a single contribution to the determination of a proposition on
every occasion of use. What modulates the type of modality expressed—teleological,
bouletic, deontic, epistemic or alethic—is the context of use.8 The plausibility of the
resulting view lies in part in its ability to provide simple and unified explanations
of a wide range of language use. Together with broad cross-linguistic support, the
simplicity of Kratzer’s semantics earns its status as the default view.9
Central to a Kratzer-style semantics is its treatment of modal expressions as
quantifiers over possibilities. Typically, those domains of quantification are restricted.
Restrictions not represented explicitly in the linguistic material are provided as a
function of the context of utterance. The contextual supplementation is twofold. First,
context determines a modal base, f , a function from a world of evaluation, w, to a set
of worlds, ∩f (w), the modal background.
Modal bases may be either epistemic or circumstantial. An epistemic modal base
is a function f that takes a world of evaluation w and returns the set of worlds
consistent with the body of information in w that has some property or properties.
Which properties are relevant is determined by which f is contextually selected; for
example, that function may take the information that has the property of being the
speaker’s at a designated time t in w as an argument and give us the set of worlds
compatible with that information. In principle, context might select any number of
different f s.
A circumstantial modal base is a value for f that takes a world of evaluation
as an argument and delivers a set of worlds circumstantially alike in particular
respects. Here, too, what makes a circumstance among the relevant ones at a world
of evaluation will depend upon which f is contextually selected; for example, a
particular value for f may make circumstances that determine causal relations
between actions and outcomes at the world of evaluation relevant. The modal
background in that case would be the set of worlds alike with respect to those
circumstances.
A second source of contextual supplementation is an ordering source, a function g
from a world of evaluation w to a ranking of worlds in the modal background. Which
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
features of a world w in ∩f (w) give it its relative ranking depends upon the value
for g. For example, g might rank w depending upon how well some salient agent
acts in accordance with the reasons she has or the obligations that apply to her in
w. Or it might rank w in terms of how well it approximates some impartial ideal. The
highest ranked or best such worlds make up the modal’s domain. “Ought,” the modal
of concern here, functions as a universal quantifier over its domain: “ought ϕ” comes
out true at a context-world pair just in case all of the best worlds as determined by that
context and world are ϕ-worlds.10
Since part of what is at issue in the puzzle cases here is the plausibility of a full
Kratzer-style account of deontic conditionals, we’ll need her account of the indicative
conditional on the table. On her semantics for that conditional, the function of the
antecedent is to restrict the domain of a modal in the consequent.11 This is, at least
typically, a covert necessity modal. So, a conditional of the form “if ϕ, then ” where
“” does not itself contain a modal, has the structure if ϕ, must . To see whether
such a conditional is true, we see whether every (relevant) ϕ-world is also a -world.
If so, then the conditional is true. Here we follow Kratzer in assuming that the covert
modal is an epistemic necessity modal: The relevant ϕ-worlds are the ϕ-worlds that
are compatible with some contextually determined body of information12
There are a few options for combining this account of the indicative conditional
with her semantics for modal expressions generally. For all the deontic conditionals
we discuss, we’ll adopt the view that a covert epistemic necessity modal takes scope
over the deontic modal.13 There are a couple of different readings14 of the whole
conditional, if ϕ, must[ought ] that may result, depending upon the context. In
all cases, we assume the antecedent retains its usual semantic function of restricting
the domain of the covert modal. A bit more formally, treating w as the world of
evaluation, the covert modal’s domain will be worlds, w , each of which is a ϕ-world.
To be true, the conditional then requires that the deontic modal is true at each of the
worlds w . To determine this, the deontic modal requires values for f (w ) and g(w ).
These, we’ll argue, are determined flexibly as a function of the context of utterance.
Together they’ll determine a set of worlds w that make up the deontic modal’s
domain.
10 Here we simplify aspects of Kratzer (a) and () to avoid introducing complexities of that
account not at issue here. In particular, we adopt the Limit Assumption, and we ignore the issue of how best
to mark the apparent distinction between “must” and “ought”. Cariani, Kaufmann, and Kaufmann ()
and Carr () also adopt these simplifications in expositing their own views.
11 In some cases not at issue here, the antecedent may restrict a quantifier elsewhere in the sentence, as
in “Always, if a man buys a horse, he pays cash for it” (Kratzer, b).
12 Kratzer, b.
13 For discussion of this type of view and some reasons for adopting it, see, for example, Carr (),
von Fintel (), von Fintel and Iatridou (), Frank (), Geurts (), and Kratzer ().
14 By a “reading,” we simply mean a way a listener might reasonably interpret what’s said in an utterance.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
Miner Variations
. The MINERS objection
The famous miners scenario is one case thought to pose a challenge for a Kratzer-style
framework. Here is Niko Kolodny and John MacFarlane’s characterization (MINERS):
Ten miners are trapped either in shaft A or in shaft B, but we do not know which. Flood waters
threaten to flood the shafts. We have enough sandbags to block one shaft, but not both. If we
block one shaft, all the water will go into the other shaft, killing any miners inside it. If we block
neither shaft, both shafts will fill halfway with water, and just one miner, the lowest in the shaft,
will be killed. (, p. )
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
sense. This is compatible with the conditionals receiving objective readings. As we’ll
argue below, it’s easy to identify readings on which utterances of all three sentences
are true with the conditionals receiving objective readings. So, if to be deliberative is
to figure in practical reasoning, it will be easy to see how all three can be true under a
“deliberative reading.”
Moreover, insofar as conditionals can suggest that some action is the “thing to do”,
objective readings of the conditionals may also be deliberative in their third sense, e.g.
in contexts in which agents make it clear that what they objectively ought to do settles
the question of what is the thing to do. Since we believe that the contrast with objective
readings is what is most important for understanding the objection to Kratzer that
Cariani, Kaufmann, and Kaufmann seem to have in mind, we will try to improve on
their suggestions in a way that preserves this.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
Let this be a case in which you do not and could not learn that Blue Blazer has
been drugged prior to placing your bet and so you go ahead and bet on that horse.
Have you done as you ought, in Schroeder’s deliberative sense? No doubt you are not
subject to legitimate criticism for betting as you do; in this sense, you have done as
you ought. But you have not done what it would be advisable for you to do. It is
not advisable for you to bet on a drugged horse. Here your utterance bears some of
the hallmarks of Schroeder’s “deliberative sense” of “ought” and my utterance bears
others. MacFarlane has characterized these “ought”s of advice as in between a so-
called “subjective” “ought”, which is tied to information within a deliberating agent’s
epistemic reach, and an objective “ought,” which is not information-sensitive. Such
“in-between” “ought”s are central to his case for relativism. Below we show how all
three readings can be made available within a Kratzer-style framework. Seeing how
this is so will be important for seeing how a Kratzer-style semantics can fit with the
pretheoretical judgments of ordinary speakers for the full range of MINERS cases.
. Kratzer-friendly readings for miner variations
Since our view is contextualist, which reading a deontic modal sentence receives will
be determined as a function of the context of utterance. This means that the best data
for testing theories will be speakers’ judgments about a series of variations on the
basic MINERS scenario, each of which fills out the conversational context in a slightly
different way. As we’ll see, which reading will be most natural for our sentences will
depend upon which version of the scenario is under consideration. One important
dimension along which MINERS scenarios may vary is in whether deliberating agents
expect to receive more information about the location of the miners prior to the time
at which they need to act. Call “EXPECTATION-KNOW” some scenario in which
deliberating agents know they will learn the location of the miners prior to that time.
Call “EXPECTATION-MIGHT” some scenario in which they know they might, but
also might not, learn their location (learning and not learning their location are equally
likely). Let “EXPECT-NOT” be a case in which agents know they won’t learn more.
We consider the following judgments to constitute theoretically neutral data:
NEITHER sounds bad—indeed, clearly false—in EXPECTATION-KNOW, sounds
unwarranted in EXPECTATION-MIGHT, and sounds fine—indeed, clearly true—in
EXPECT-NOT. IF-A and IF-B can each sound fine in any of these cases. We also accept
the Cariani, Kaufmann, and Kaufmann claim that NEITHER, IF-A, and IF-B may all
be true as uttered in the course of a single conversation among agents deliberating
about what to do in MINERS.
One of their central motivations for positing a more complex semantics16 for
deontic modals along with a novel semantic rule for deontic conditionals is their claim
that no Kratzer-style semantics can fit with the full range of this data. Our next task
is to show that this is not so by showing how a Kratzer-style semantics can secure
16 For the sense in which their semantics is more complex, see footnote .
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
readings that fit with these judgments. We do this in several steps. First, we identify
readings and contexts that make NEITHER, IF-A, and IF-B all true. Then we show
how they can all be assertible in the course of a single piece of deliberation. Recall that
Cariani, Kaufmann, and Kaufmann claim that part of the data is that they are all true
under a “deliberative reading” (, p. , footnote ). As we saw, to test this, we
need to identify what it takes for a reading of a modal to be “deliberative.” We now
have a few overlapping, but distinct senses of “deliberative reading.” Following one of
their suggestions, we have the view that a deliberative reading is any reading of a modal
sentence such that, under that reading, it is properly assertible in the course of a single
piece of practical deliberation. We also, as suggested by Schroeder’s hallmarks, have
two more specific types of deliberative reading: a subjective reading, which is tied to an
agent’s available information, and an advisability reading, which is connected to what
it is advisable for a deliberating agent to do (perhaps tied to an advisor’s information).
Here we show how the Kratzerian can accommodate true readings under each of these
senses of “deliberative reading.”
We begin with NEITHER. Our reading for NEITHER will be the same for all
of our cases, but for the sake of concreteness, we’ll focus on the context in which
it sounds best, EXPECT-NOT. Here we think NEITHER receives what we’ll call a
“subjective” reading. Subjective readings are information-sensitive, where the relevant
information is, very roughly, the information a relevant agent has at the time of
action, t.17 Informally, NEITHER would seem to express the proposition that blocking
neither is, of the actions available to the deliberating agents, deontically ideal, in light
of the information they have at the time that action is necessary.18
More formally, context will select an f that maps a world of evaluation w onto the
set of worlds w like w with respect to the laws and circumstances up until the time of
action t, where we assume that this will hold fixed the options agents have in w at t.
Various issues arise in characterizing an agent’s options. We simply adopt one workable
model with the following features. (i) An option of an agent is represented as the
proposition that she performs some physical or mental action or intentional inaction.
(ii) For something to count as an agent’s option, the agent must know that she is able to
perform it and know how to implement it.19 Thus, for example, “S selects the winning
lottery number” can fail to be one of S’s options, even though, for each number, n,
17 More generally, subjective readings can be sensitive to bodies of information available at a world of
evaluation by some designated time relevant for the ranking of the agent’s options. The information need
not be limited to the information the agent actually has; it may also include information she could or should
have gathered, or information within her “epistemic reach.”
18 Some might see NEITHER as making a claim, not about deontic ideality, but rather about some
particular conception of deontic ideality, such as maximizing expected utility. While we do not rule this
out as a possible reading, we find sentences such as NEITHER to be most naturally understood as making
claims that can be the objects of dispute between, for example, consequentialists and non-consequentialists,
and so we see them as invoking the thinner notion of deontic ideality.
19 Or perhaps an agent’s options are fixed not by what she knows but, more broadly, by what she is in a
position to know she is able to perform by t.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
“S selects n” may be one of S’s options. (iii) Options are assumed to be as fine-grained
as possible. Thus “S walks” would not qualify as one of S’s options, but “S walks straight
ahead, slowly, while chewing gum. . . ” could. Because each option specifies the agent’s
behavior so precisely that she does not have any further flexibility in how to act, options
are mutually exclusive.20 For simplicity, we will treat blocking neither as such a fine-
grained option, although doing this is not necessary to our account here. (iv) Options
are assumed to take place over some fixed time period, which may vary with context.
To simplify, we assume the relevant options (e.g. blocking neither) can be performed
instantaneously: perhaps one must now irrevocably decide whether to block a shaft or
none. This helps to put aside complexities such as starting to block one shaft and then
switching to the other.
In the case of NEITHER, context will select a value for g that maps w onto the set of
worlds w in which agents perform that action, ϕ, of their options, O, that is deontically
ideal, given the information our agents have at t in w.21 Since in EXPECT-NOT that
action is blocking neither, NEITHER comes out true. Under this reading, NEITHER
is unwarranted as a conclusion of practical deliberation in EXPECTATION-MIGHT
and false in EXPECTATION-KNOW. This pattern fits precisely with our pretheoretical
judgments about these cases.
Having provided a plausible Kratzer-style reading for NEITHER, we turn now to
identifying plausible readings of IF-A and IF-B, showing how they are assertible in the
course of a single piece of deliberation. So, what should the Kratzerian say about IF-A
and IF-B in our MINERS cases? The issue here is a bit complex as these sentences have
available objective, subjective, and advisability readings in some of these cases.
Start with the objective readings in EXPECT-NOT. There are a couple of ways of
filling out the conversational context of EXPECT-NOT to secure felicitous objective
readings of these conditionals in the course of a single piece of deliberation. Imagine
that agents have not yet arrived at the place in their deliberations in which they realize
they will not learn the location of the miners by t. In that case, they may think that
what they objectively ought to do may settle their deliberative question. IF-A and
IF-B may then represent their thinking about what might objectively be the case.
Alternatively, IF-A and IF-B on an objective reading might each articulate part of their
understanding of the case. They might, for example, play such a role in a conversation
such as the following:
Emma: Ok. Here’s the situation: The miners are all trapped in either shaft A or shaft
B. Only if we use all of our sandbags to block the shaft they’re in, will we save all the
miners.
20 Here we may simplify by assuming options to be as fine-grained as possible because any Kratzer-style
semantics validates Inheritance, so settling which fine-grained options one ought to perform will settle
which coarse-grained options one ought to perform. (The principle of Inheritance holds that if ϕ entails ,
then ought ϕ entails ought .)
21 Recall that we’re trying to keep this reading simple and reader-friendly. For a more general formula-
tion of subjective readings, see footnote .
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
Lily: I see. So, if they’re all in A, we should block A and if they’re all in B, we should
block B.
Emma: Right. Unfortunately, we’re not going to be able to figure out where they are
by the time we have to decide what to do.
This might be the beginning of a conversation that eventuates in their deliberative
conclusion, expressed by NEITHER.
Finally, each conditional IF-A and IF-B may express a lament about the tragedy of
the situation, even after they have concluded that blocking neither is what they should
do. It would figure naturally, for example, in a conversation such as this one:
Emily: All things considered, we’re going to have to block neither shaft. It’s such a
tragedy that we don’t know where the miners are!
Lily: What’s so bad about that?
Emily: Because, if they’re in A, we should block A. And if they’re in B, we should
block B!
Representing our objective readings for IF-A and IF-B more formally requires
saying a bit about our treatment of deontic conditionals. Recall that we treat such
conditionals as doubly modalized: Each contains a covert, epistemic modal scoped
over the overt, deontic modal. On our objective reading, then, the antecedent of IF-A
restricts the domain of a covert, epistemic modal to the epistemically possible worlds
w in which the miners are in A. The modal background for the overt, deontic modal
will be the set of worlds alike with respect to the relevant circumstances in w , including
that the miners are in A. The value for g(w ) will be the worlds in which agents perform
that action, of those available to them, which is deontically ideal in light of the relevant
circumstances in w . These will all be worlds in which agents block A. So, IF-A comes
out true. Similar considerations will make IF-B come out true.
We have been discussing EXPECT-NOT, in which the conditionals may be felic-
itously uttered, though less practically useful than in EXPECTATION-MIGHT and
EXPECTATION-KNOW. Their practical use, under objective readings, improves in
those latter two cases. This won’t make those readings deliberative in either of the two
senses we identified from Schroeder’s hallmarks, what we’re calling the “subjective
sense” or the “sense of advisability.” But it will suffice to make them deliberative in
the Cariani, Kaufmann, and Kaufmann sense we’re focusing on, namely the sense of
playing a role in a single piece of deliberation.
So far, we’ve focused on Kratzer-friendly objective readings of IF-A and IF-
B that can play a deliberative role. In addition, there’s a second, information-
sensitive, Kratzer-friendly reading available, with which the features of EXPECT-NOT,
EXPECTATION-KNOW, and EXPECTATION-MIGHT are also compatible. As we’ll
see, this reading may be thought of as an advisability reading, one that is “in-between”
the subjective and objective readings. Crucially, we think these cases do not mandate
an advisability reading, rather than an objective reading, of these conditionals; indeed,
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
for EXPECTATION-KNOW, subjective readings are also available. Here we merely use
these conditionals to illustrate how advisability readings are available within a Kratzer-
style framework, as well as to show that these conditionals may receive deliberative
readings in one of the senses suggested by Schroeder’s hallmarks. Below we will
consider cases in which subjective and objective readings are unavailable, as evidence
that sometimes modals naturally take advisability readings.22
On an advisability reading of IF-A and IF-B, in addition to its usual semantic
function of updating the value for f for the covert modal, the antecedent serves to
pragmatically indicate a value for the deontic modal’s g parameter. In contrast to the
objective reading, here g(w ) will rank each world w in the deontic modal’s modal
background in terms of the deontic ideality, in light of some body of information,
of the action agents perform in w . Which information is relevant? We suggest it is
a hypothetical body of information consisting of the information deliberating agents
have at t in w together with information specifying where the miners are located in w .
Thus if the miners are located in A at w , the additional information specifies that they
are in A; if they are located in B at w , the additional information specifies that they are
in B. Since the semantic function of the antecedent of IF-A is to restrict the domain of
the covert modal to worlds in which the miners are in A, and these are the worlds at
which the overt deontic modal will be evaluated, the information getting added to the
agent’s information for the purposes of ranking their options is the information that
the miners are in A.
Adding this information to the relevant body, we’re suggesting, is a pragmatic
function of the antecedent. Since the objective reading is also available in our MINERS
cases, we are not suggesting that this reading is forced. Rather, the conversational
context permits the antecedent to play this pragmatic role.
How exactly might it play this role? One idea is that playing this role is suggested by
the relevance of the conditional for conversational purposes. In each of our MINERS
scenarios, agents are deliberating about what to do. Uttering the conditionals is to help
settle this practical question. Agents do not know the location of the miners in any of
these scenarios. In EXPECTATION-KNOW, they know they will come to know their
location. In that case, what would be deontically ideal in light of information agents are
in a position to have by t updated with information about where the miners are located
is highly relevant; indeed, it reflects the epistemic position they expect to be in. This
suffices to make uttering each of the conditionals, on this reading, highly relevant. In
EXPECTATION-MIGHT, agents know they may or may not learn the miners’ location
by t. In this case, too, IF-A and IF-B under the advisability reading will be relevant to
their practical deliberation.
Having discussed NEITHER, IF-A, and IF-B, we now turn to two additional
sentences from Cariani, Kaufmann, and Kaufmann:
22 Here and elsewhere, in offering these readings, we do not claim that no other readings are possible.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
23 This is not quite the sentence Cariani, Kaufmann, and Kaufmann consider. Theirs is “there is a shaft
we ought to block” (EXISTS ). We prefer EXISTS on the grounds that, at least in the case of EXPECTATION-
KNOW, it has a clearer action-guiding use than EXISTS .
24 The subjective reading of the conditionals is unwarranted in EXPECTATION-MIGHT and false in
EXPECT-NOT.
25 We seem to find “still” playing this role in other indicative conditionals containing context-sensitive
expressions, e.g.
A: The Sharks might win. A: If he wants to play basketball, he’s not tall.
B: But what if they lose their best player? B: No, even if he wants to play basketball, he’s still tall.
A: Even if they lose their best player, they
still might win.
26 Besides the use of the word “still,” there are other means by which conversational context can signal the
irrelevance of the antecedent to determining which action is best. In such cases it is possible for conditionals
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
Von Fintel reports that he can hear this as true. We agree that there are ways
of filling out the conversational context that would make an utterance of MORE
INFORMATION felicitous. A best-case scenario will be one in which agents know
they might learn whether the miners are in A and know they’ll learn nothing else. Call
“EXPECTATION-WHETHER-IN-A-MIGHT” a version of such a case in which agents
believe they have a % chance of learning whether the miners are in A before the time
they need to act. The felicity of utterances such as MORE INFORMATION can seem
puzzling within a Kratzer-style framework, as their felicity can’t be explained by either
a subjective or an objective reading. On the subjective reading we’ve suggested, context
selects a value for f that takes a world of evaluation w to a set of worlds compatible
with facts about the agents’ options in w together with facts about which information
agents in w have at t. Suppose agents don’t learn by t whether the miners are in A.
In that case, the conditional will be false: Blowing A up is not the deontically ideal
action in light of the body of information agents will have at t as, in that case, that
body will leave open the possibility that the miners are in A and blowing up A will
result in all of their deaths. Suppose the miners are in B; in that case the objective
reading comes out false. So what would explain what makes MORE INFORMATION
sound true in EXPECTATION-WHETHER-IN-A-MIGHT? We suggest that MORE
INFORMATION most naturally receives an advisability reading. On such a reading,
recall, the antecedent, in addition to its usual semantic role, pragmatically indicates
an update to the body of information relevant for ranking the worlds in the deontic
modal’s modal background. In MORE INFORMATION, this update adds to the
not containing the word “still” to carry the reading we offer for IF-STILL. For example, if in EXPECT-NOT,
one conversational participant says, “We can’t decide what to do until we know where the miners are, since
if the miners are in A, we should block A,” another may reply, “No, if the miners are in A, we should block
neither shaft. And the same holds if they are in B. We have no way of knowing where the miners are, so
regardless of where they are, we should block neither shaft.” In this case, the conditional “if the miners are
in A, we should block neither shaft” is acceptable because context makes it clear that the antecedent is not
intended as a necessary condition for the assertion of the consequent.
27 Von Fintel ().
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
information agents have in worlds w in the domain of the covert modal information
about whether the miners are in A. The antecedent guarantees that each such w will
be a world in which the miners are not in A. So the worlds w in the domain of the
deontic modal will be ranked in accordance with whether the agent performs, in w ,
the action, of their options, which is deontically ideal in light of that updated body of
information. In all of the best such worlds, our agents are blowing up A. So, MORE
INFORMATION comes out true.
Von Fintel has offered an alternative explanation of how MORE INFORMA-
TION comes out true. He holds that MORE INFORMATION is “shorthand” or
“enthymematic” for the longer sentence:
“If we learn that they are not in A, we ought to blow A up.” ()
In EXPECTATION-WHETHER-IN-A-MIGHT, this sentence is straightforwardly true
on a subjective reading. In this case, if the agents learn the miners are not in A, they
will learn nothing else. In particular, they will not learn more specific information
pinpointing the location of the miners in B or in C. So if they learn the miners are not
in A, it will be deontically ideal given their information at t for them to blow A up.
Thus if von Fintel is right, there is no need to posit an advisability reading to explain
the truth of MORE INFORMATION.
There is, however, an additional piece of data that our account is better placed to
explain. Begin with the case EXPECTATION-WHETHER-IN-A-MIGHT, in which the
agents have a % chance of learning nothing about the location of the miners, and
a % chance of learning whether or not the miners are in A. Suppose the agents are
aware that if, when the time of action comes, they knowingly allow even one miner to
die unnecessarily, they will be put in jail. In this case, we hear the following sentence
as unwarranted:
If the miners are not in A, we’ll go to jail if we don’t blow A up.
Intuitively, the agents are not warranted in asserting this sentence since there is a %
chance they will remain in their state of complete ignorance about the location of the
miners. If so, they will not be put in jail for failing to blow A up. Similarly, we hear the
following sentence as unwarranted:
MORE INFORMATION JAIL: If the miners are not in A, then we ought to blow A
up and we’ll go to jail if we don’t.
But the proposal that MORE INFORMATION is read enthymematically would tend
to predict that MORE INFORMATION JAIL should be heard as warranted since it
would be read as enthymematic for:
If we learn that the miners are not in A, then we ought to blow A up and we’ll go to
jail if we don’t.
In contrast, our account, on which MORE INFORMATION is not enthymematic, does
not issue such a prediction. This is some evidence in favor of the advisability reading.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
Self-Frustrating Decisions
Jennifer Carr has argued that Kratzer-style contextualism cannot adequately account
for cases of “self-frustrating” decisions. These arise in unusual cases where performing
an action would indicate the existence of reasons against performing that very action.
Carr uses the case DEATH IN DAMASCUS, from Gibbard and Harper (), as an
example:
If you are in the same city as Death tomorrow, then you’ll die. Death has planned to be wherever
he predicts you’ll be, and he’s very reliable in such predictions. Your options are to stay in
Damascus or to go to Aleppo. But, as you know, if you stay in Damascus, then that’s excellent
evidence that Death will already be there. Similarly for going to Aleppo.
(Carr, , p. )
This places you in an unfortunate situation: you expect with high probability that,
whichever decision you make, you will die. Assume you have not made up your mind
about where to go, and you now regard either city as equally likely. Your options
are then symmetric: they offer equally bad prospects. Given this symmetry, we will
assume, with Carr, that both options are permissible: you may go to either city.
Because you may go to Aleppo, we cannot say you should not go to Aleppo. Hence
we have:
ALEPPO: It’s not the case that you should not go to Aleppo.
Consider now the conditional:
IF-ALEPPO: If you go to Aleppo, you should not go to Aleppo.
The reasoning behind IF-ALEPPO goes roughly as follows: If you will in fact go
to Aleppo, then Death is very likely waiting for you in Aleppo, and so you should
not go to Aleppo. IF-ALEPPO uses the antecedent to generate a new set of proba-
bilities for Death’s location, and then evaluates your options in light of those new
probabilities. We will assume, with Carr, that IF-ALEPPO has a true reading along
these lines.
We now consider how Kratzer-style contextualism can account for the truth of
ALEPPO and IF-ALEPPO. Just as for the miners sentences, such a view will hold
that ALEPPO says what it does because context supplies appropriate values for the
parameters f and g. ALEPPO is most plausibly heard as a claim about how it is rational
or reasonable for the agent to act, given her information. It is motivated by the thought
that, given the agent’s information, it is equally reasonable to go to either city. It denies
the claim that not going to Aleppo is the agent’s uniquely most reasonable option. In
other words, it denies the claim that not going to Aleppo is deontically ideal in light of
the agent’s information.
We propose to capture the content of ALEPPO with similar parameter values to
those used for NEITHER. The modal background is circumstantial: f (w) maps a world
to the set of worlds in which the laws and circumstances up through the time of action
t are the same as they are in w. In all these worlds, the agent has the same options as
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
she does at w: go to Aleppo and stay in Damascus. The only feature that matters to how
a world w in the modal background is ranked is the option the agent chooses in w . In
particular, g(w) ranks worlds w on the basis of whether the option performed in w is
deontically ideal in light of the information that the agent has in w. If all options are
deontically ideal in light of the agent’s information at w, then g(w) treats all worlds in
f (w) as tied-for-best. Otherwise, g(w) divides the worlds into two groups: it ranks as
tied-for-best all w where the option the agent selects in w is deontically ideal in light
of her information at w, and it ranks all other worlds as tied-for-worst.
With these choices for f and g, ALEPPO plausibly comes out true according to
causal decision theory.28 Given that the case and her information are symmetric, she
judges her prospects if she were to go to Aleppo as equivalent to her prospects if she
were to go to Damascus. So a proponent of causal decision theory will hold that the
worlds where the agent stays in Damascus and those where she goes to Aleppo are all
deontically ideal. Hence some of the highest g(w)-ranked worlds in f (w) are worlds
where she goes to Aleppo, and so it is not the case that she should not go to Aleppo.
Just as for IF-A, IF-B, and MORE INFORMATION, we suggest an advisability
reading for IF-ALEPPO.29 Going through our account step by step, we see IF-ALEPPO
as doubly modalized.30 The antecedent you go to Aleppo restricts the higher, covert
epistemic modal, limiting us to what is true in all epistemically possible worlds in
which you go to Aleppo. In itself, this does not do much to help IF-ALEPPO come
out true. If context supplies the same parameter values f and g to the deontic modal
in IF-ALEPPO as it does for ALEPPO, then this conditional will come out false, since
those parameter values make the consequent you should not go to Aleppo false in all
epistemically possible worlds.
This is where the second, pragmatic role for the antecedent comes in. On our view,
the antecedent if you go to Aleppo can indicate to the hearer that different parameter
values are in play. Our suggestion is that the value of f remains unchanged: f (w) still
consists of worlds w that are circumstantially like w. But g is no longer a ranking
of worlds in terms of deontic ideality given the agent’s information. Rather, it is a
ranking of worlds in terms of deontic ideality given a hypothetical body of information
consisting of the agent’s information plus information specifying which city she goes
to in w. In other words, g(w) ranks worlds w on the basis of whether the option
performed in w is deontically ideal in light of the information that the agent has in
28 For the sake of concreteness, we follow Carr in focusing on how a causal decision theorist might
approach this case. Of course, the type of Kratzer-style semantics we’re defending is not committed to the
truth of causal decision theory as opposed to, for example, evidential decision theory.
29 Because the case stipulates only a high likelihood that Death has correctly predicted the agent’s
location, an assertion of IF-ALEPPO would not be warranted on an objective reading. A subjective reading
holds more promise since we might read IF-ALEPPO along the following lines: if the agent will go to Aleppo,
then she will know this at the time of decision, and given this knowledge at that time she subjectively should
not go to Aleppo. But we can put this aside by assuming the agent will make her decision without advance
notice of what she will decide.
30 As Carr () explains, the single-modal view would not predict the truth of IF-ALEPPO, for reasons
derived from the “If p, ought p” problem (Frank, ).
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
w plus information specifying which city she goes to in w. Because the agent does not
know where she will go, two kinds of worlds w are epistemically possible for her: those
where this hypothetical body of information includes her going to Aleppo and those
where it includes her staying in Damascus. The antecedent of IF-ALEPPO, however,
restricts the modal background to those epistemically possible worlds where she goes
to Aleppo, and hence to worlds where the hypothetical body of information includes
her going to Aleppo. Given causal decision theory, staying in Damascus is deontically
ideal given such a body of information. The reason is that a body of information that
includes the fact that the agent goes to Aleppo supports with high probability the claim
that Death is in Aleppo, and so assigns the highest causal expected utility to the agent’s
going to Damascus.31
Our analysis of IF-ALEPPO is thus quite similar to our analysis of the advisability
reading of IF-A (and of IF-B and MORE INFORMATION). However, in discussing
the miners case, we noted that IF-A is most clearly relevant to the conversation when
it is possible we will learn the location of the miners before we need to act, since then
it offers advice that may be practically useful. IF-ALEPPO does not appear relevant
in the same way. While IF-A could potentially lead us to block A were we to learn
the miners are in A, IF-ALEPPO cannot lead the agent to stay in Damascus should
she learn that she will go to Aleppo: if the agent genuinely learns that she will go to
Aleppo, then it cannot be true that she will stay in Damascus.
However, we also noted that IF-A can be relevant even if we know we will not
learn the location of the miners before we need to act. It can be part of the process of
articulating our understanding of the situation, or part of a lament about the tragedy
of the situation. IF-ALEPPO appears to have a similar use. In conjunction with “If
you stay in Damascus, you shouldn’t stay in Damascus,” it can help to articulate our
understanding of the facts of the case. It can also express an aspect of the tragedy of
the situation, pointing out that knowledge of either decision would support doing the
opposite.
31 To follow Carr’s discussion and focus on the semantic issue, we put aside some controversy over
whether this is indeed the best interpretation of causal decision theory. See Joyce () for discussion.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
where you go to Aleppo and worlds where you go to Damascus. Carr writes that this
proposal “assumes in the very same breath that Death must be in the same place as
you and that he might not be” and concludes such a stipulation may not even be
“coherent” (p. ). Our proposal avoids such an apparent contradiction in a way
similar to the way in which Carr’s own proposal does. We distinguish between the
agent’s options and the agent’s information (actual or hypothetical) about which option
she will choose. The hypothetical information state including the agent’s information
plus the fact that she will go to Aleppo places both the agent and (with high probability)
Death in Aleppo tomorrow. Working with such a hypothetical information state is
consistent with maintaining that the agent has the option to choose either city: in
understanding the conditional, we assume hypothetically that the agent will, out of
her two actual options, choose Aleppo. Of course, the case has the odd feature that the
agent believes that, whichever city she chooses, she will be in the same place as Death,
but also believes that if she were to choose otherwise than she actually does, she would
avoid Death. Still, this feature appears to be coherent and, in any event, is essential to
the description of the case given by Gibbard and Harper.
In arguing against Kolodny and MacFarlane (), Cariani, Kaufmann, and Kauf-
mann point to transparency as a “main advantage” of their own account. They hold
that it is desirable to represent the agents’ priorities “transparently and independently
of the information available to them” (p. ). In MINERS, they represent the priorities
as saving more, rather than fewer, miners; the priorities are thus specified without
reference to the agents’ information. The account we have offered above lacks this
feature: the ranking of worlds given by our parameter value for g is not independent
of the agent’s information.
We do not see this as a disadvantage. Notice first that the ordering of worlds
in play in a given use of a modal expression may depend on many things. For
example, what one must do, legally speaking, may depend on the facts of the case,
the laws themselves, court opinions, conventions of interpretation, and perhaps
on the substance of morality itself. It would be premature to conclude from this
that the semantic value for the legal “must” has argument places for each of these
things.
In the case at hand, it is true that some substantive views about how one should
act are naturally represented as separating the role of information and priorities:
the MaxiMin view invoked by Cariani, Kaufmann, and Kaufmann is an example.
However, other substantive views do not naturally fit this model. For example, a non-
consequentialist view might impose an absolute prohibition on imposing a significant
risk of harm on an innocent person in any circumstance where this can be avoided.
The view allows trivial risks, but rules out risks above a threshold. Such a view, in its
most natural representation, invokes the agent’s information in specifying the relevant
priorities: what the agent should be concerned with is specified partly in terms of the
risks from her point of view. While one could implement such a view consistent with
the letter of transparency (for example, by having most or all of the work done by a
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
decision rule parameter, with little or no role for priorities), transparency in itself does
not appear desirable here.
Probably the main source of resistance to our proposal will be a broadly theoretical
consideration cited by both Carr and Cariani, Kaufmann, and Kaufmann: systematic-
ity.32 Indeed, Cariani, Kaufmann, and Kaufmann defend transparency at least partly
on the grounds that it secures systematicity: their account derives the ordering of
options via a novel semantic rule that operates on the antecedent of a conditional as
well as information, priorities, and a decision problem. Carr, too, cites systematicity
and derives the ordering of options from a novel semantic rule that operates on the
antecedent of a conditional as well as information and a function from information
to orderings.33 Our account appeals to no such semantic derivation. The parameter
values (e.g. the value for g that refers to deontic ideality in light of an information
state) are supplied by context directly to the consequent of the conditional, rather
than being derived by a semantic rule. In assigning this role to context, rather than
to a novel semantic rule, our account provides a near-side pragmatic explanation for
how the needed truth-conditions get assigned.
Given that, as we show above, a Kratzer-style semantics does make room for
readings that fit with our judgments in the puzzle cases, some more theoretical
consideration is required to decide between the two rival, semantic and pragmatic,
explanations of the cases. Appeal to systematicity would seem to be the right sort of
consideration to play that role. However, a significant worry about the systematicity
argument is whether the rival semantic proposals can, in fact, be developed in a
systematic way. For example, the derivations offered by Cariani, Kaufmann, and
Kaufmann assume the MaxiMin decision rule. While it is easy to generalize the
account to, say, the MaxiMax decision rule, it is not clear how to generalize it to
more sophisticated decision rules (such as expected utility maximization or non-
consequentialist approaches to uncertainty) while preserving the kinds of derivations
cited as evidence of the account’s systematicity. The ability to offer such derivations
only for a small subset of decision rules is not an advantage in systematicity.
But we will argue more directly that considerations of systematicity actually favor
our pragmatic approach. Notice first that, given the variety of readings in principle
available to a modal expression, hearers clearly do have a substantial ability to tell
which parameter values are intended for a given sentence. In particular, readings can
vary in what information or facts are relevant: it can be the information the agent will
in fact have at the time of action, the information she could or should have by that
time, an advisor’s information, or alternatively all the relevant facts (whether known
by anyone or not). A key part of our proposal is to say that the relevant information
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
may be the agent’s information plus some contextually relevant fact as in MORE
INFORMATION or IF-ALEPPO.
To see that such a proposal isn’t ad hoc, but enjoys independent support, consider
a case from DeRose () regarding “John, who has some symptoms indicative of
cancer, and a ‘filtering’ test which John’s doctor decides to run and which has two
possible results: If the results are ‘negative,’ then cancer is conclusively ruled out; if the
results are ‘positive,’ then John might, but also might not, have cancer: further tests
will have to be run.” (p. ) The test has been run, but the results are not known by
anyone. DeRose notes that Jane, who is familiar with the situation but does not have
the test results, could say:
“I don’t know whether it’s possible that John has cancer.” (p. )
What Jane does not know is, roughly, whether John’s having cancer is compatible with
what she knows combined with the information from the test results. This has a clear
structural similarity to our own account, where the relevant body of information is the
agent’s combined with some additional fact.34
On analogy with DeRose’s case, consider a version of von Fintel’s three-shaft version
of the miners case. The agents in this case know the following: They will not learn
anything about the location of the miners before the time they need to act. There is,
however, a test that can determine whether or not the miners are in A. If they are in
A, it says they are in A. If they are in B or C, it simply says they are not in A. The test
has been run, but the results are not known by anyone. An agent in the case could say:
I don’t know whether we ought to blow A up.
The information relevant to this sentence is not the information the agent will actually
have at the time of action, since that supports doing nothing and so definitively does
not support blowing A up. Nor is it all the possible information about the case, since
this would support blocking whichever shaft the miners are in. Rather it is the agent’s
information about the case combined with information specifying whether or not the
miners are in A.
This example is thus best interpreted with just the kind of parameter value we have
posited. Because “I don’t know whether we ought to blow A up” is not a conditional,
we cannot say here that the parameter value is somehow derived from a semantic rule
operating on an antecedent. Instead, the context, which includes the sentence itself,
supplies the parameter value directly to the modal. Thus, the very mechanism and
type of parameter value we posit for MORE INFORMATION and IF-ALEPPO quite
plausibly operates in this case. Our account of these sentences is systematic insofar as
it simply extends a mechanism we already have reason to accept.
Indeed, our account offers a particularly unified and systematic account of the
following piece of discourse (TEST):
34 See Dowell () for discussion of a version of DeRose’s case in keeping with our present proposal.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
I don’t know whether we ought to blow A up. If the miners are not in A, we ought
to blow A up. If the miners are in A, we ought not to blow A up.
On our account, context supplies the same parameter values to the modals in all of
these sentences: the ordering g ranks worlds on the basis of what is deontically ideal
in light of the agent’s information plus the fact about whether or not the miners are
in A. Below we will consider in more detail a dilemma that this case raises for our
rivals. But at this point, it is worth noting that the pattern of explanation they offer
for MINERS and DEATH IN DAMASCUS, in which a semantic rule operates on a
contextually supplied parameter value (or values) in conjunction with the antecedent
of a conditional to yield a new ordering, would be overly complex here. On that
pattern, one would end up saying that context supplies one value for g to the first
sentence, and then a different value g to the two conditionals, but that this value gets
operated on in conjunction with the antecedents to yield a new ordering, possibly the
same as the original g, that gets the truth conditions right.35 Such an account invokes
both a novel semantic rule with additional parameter(s) and an unmotivated context
shift. We posit neither of these, and thus offer a simpler, more systematic account of
this discourse.
Carr offers a final objection to the potential for a Kratzer-style account to handle the
data in DEATH IN DAMASCUS: equivocation. She writes that an adequate account
must allow an “unequivocal treatment of expressions of reasonable decision theories.”
Carr seems to grant here that a Kratzer-style view could accommodate the data with
appropriate parameter values, but maintains that ALEPPO and IF-ALEPPO “should
be compatible at a single context” on the grounds that “they are all the deliverances of a
unified and coherent body of norms: namely, causal decision theory plus the desire to
avoid death” (p. ). Similarly Cariani, Kaufmann, and Kaufmann write, “we believe
(although there is naturally room for further argument) that the data point is that
it is these very conditionals (i.e. [IF-A and IF-B]) that are true on the deliberative
interpretation of ought” (p. , footnote ). This, too, suggests the view that a single
parameter value must be in play for NEITHER, which is clearly deliberative, as well as
IF-A and IF-B.
If there is an intuition that NEITHER, IF-A, and IF-B are true on the very same
readings of the modals, we are not sure just what it is supposed to be. As a point
of comparison, consider a reading of IF-A on which it is false. Cariani, Kaufmann,
and Kaufmann call this the “non-reflecting” reading of the conditional, which they
elucidate as follows:
If the miners are in shaft A, we (still) ought to block neither shaft, for their being in shaft A
doesn’t mean that we know where they are. Indeed, no matter where the miners are, we ought
to block neither shaft. (p. )
35 On Carr’s account, this could be implemented via a context shift in the information parameter.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
It does seem plausible that this instance of “if the miners are in shaft A, we ought to
block neither shaft” involves the very same reading of the modal as does NEITHER;
indeed, Cariani, Kaufmann, and Kaufmann “take it to be obvious that the non-
reflecting interpretation is deliberative” (p. , footnote ). Yet, as these authors
acknowledge, there is not quite this feeling of sameness between IF-A when heard
as true and NEITHER (p. , footnote ; p. , footnote ; p. , footnote ).
We agree that there is some feeling of commonality between IF-A, even when heard
as true, and NEITHER. Indeed, our account explains this in line with Carr’s idea that,
in DEATH IN DAMASCUS, sentences like these are “the deliverances of a unified and
coherent body of norms.” On our view, IF-A and NEITHER both invoke an ordering
of worlds in terms of deontic ideality. These sentences do flow from the same norms,
perhaps even norms of causal decision theory. However, there is a slight difference
between them: NEITHER applies these norms to our actual information, while IF-
A (on its advisability reading) applies these norms to our actual information plus
information specifying the location of the miners. Our account therefore explains why
the feeling of commonality is weaker here than it is between IF-A on its non-reflecting
reading and NEITHER. In the latter case, the ordering is identical between the two
sentences, while in the former it is merely very similar.
One might try to press the point about equivocation against our proposal on more
theoretical grounds. Carr, as quoted above, may hold as a desideratum that it should
be possible in English to express the deliverances of causal decision theory without a
change in contextually supplied parameter values. We need not decide whether this
is a legitimate desideratum, since our view satisfies it. On our view, one may express
the deliverances of causal decision theory without a change in contextually supplied
parameter values by using the subjective “ought” in a set of conditionals of the form: “if
your credences and values are . . . , you ought to . . ..” Our view would not, of course,
meet the stronger desideratum that every discourse that expresses the deliverances
of causal decision theory must involve no change in contextually supplied parameter
values. But this stronger desideratum would be unmotivated given the general utility to
conversation of such changes for modals, quantifier domains, and demonstratives.36 , 37
Considerations of whether a set of sentences is intuitively equivocal may ultimately
support our account in a different way. Consider again the piece of discourse men-
36 For example, we easily navigate an unannounced shift from the circumstantial modality of
“hydrangeas can grow here” to the legal modality of “you can plant anything that won’t block visibility
around the corner.” It would be possible, but more cumbersome, to express these thoughts without relying
on context to supply the appropriate parameter values. Dowell (, ) defends the view that our ability
to detect speaker intentions underwrites our competence with contextually supplied parameter values.
37 Jennifer Carr (pc) asks whether our willingness to posit covert modals, combined with contextual
flexibility, might undermine the reasons for accepting the view of if -clauses as restrictors. She suggests that
reading “if p, then q” as must(p ⊃ q), where an epistemic “must” takes scope over a material conditional,
would then offer a simpler semantics. We note that such a view would need to add a condition to avoid
the consequence that “if p, then q” is true whenever must(∼p) is true. Given this, we do not regard the
alternative as simpler than the restrictor view. The viability of the alternative depends in part on whether
it can be developed to deal systematically with the full range of data that have been taken to motivate the
restrictor view (e.g. Lewis, ), a question we do not explore here.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
tioned above, TEST, for von Fintel’s three-shaft version of the miners case, where we
stipulate the existence of test results that tell whether or not the miners are in A:
I don’t know whether we ought to blow A up. If the miners are not in A, we ought
to blow A up. If the miners are in A, we ought not to blow A up.
These sentences do not appear to be equivocal. This leads to a dilemma for the views
of Carr and Cariani, Kaufmann, and Kaufmann. They may choose to go contrary to
this intuition, and insist that context supplies different parameter values for the first
sentence than for the conditionals. This, we think, would further undermine their
appeal to a sense of commonality between IF-A and NEITHER to argue for sameness
of contextually supplied parameter values there. As mentioned above, it also appears
to be a needlessly complex account of this piece of discourse, one which invokes both
a novel semantic rule with additional parameter(s) and an unmotivated context shift,
in contrast to our straightforward account.
Alternatively, they could hold that context supplies the same parameter values
to all these sentences. In that case, they grant that conditionals such as MORE
INFORMATION, IF-A, and IF-ALEPPO receive, in this and similar cases, their true
readings simply because context supplies the appropriate parameter values, and not
because a novel semantic rule is in play. This puts strong pressure on their accounts
to hold that these conditionals receive their true readings in other cases, such as the
standard MINERS or DEATH IN DAMASCUS cases, for the same reason. Overall,
on this branch, all sides are committed to the core of the Kratzer semantics and to
the existence of a pragmatic mechanism that can assign the kind of parameter values
we have posited in the standard MINERS or DEATH IN DAMASCUS cases. Our
opponents add commitments to a more complicated semantics and to the claim that
the pragmatic mechanism that is common ground cannot operate in those cases. The
additional commitments of the opposing views might be justified if there were strong
independent reason to hold that those cases cannot involve a shift in contextually
supplied parameter values, but our investigation of this issue above has revealed no
such reason. Theoretical virtues would thus appear to favor our account over the
alternatives.
Conclusion
There has been much recent work in the literature in the philosophy of language,
linguistics, and metaethics over whether a Kratzer-style contextualist semantics for
modal expressions can be made to fit with the full range of data in a series of puzzle
cases. Part of what is at stake in these debates is the viability of relativism as a research
program in the philosophy of language and linguistics; much of the motivation
for relativism is the claim that no contextualist semantics for some expression E is
plausible.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
Also at issue is to what extent linguists and philosophers of language should prefer
semantic over near-side pragmatic explanations of ordinary speakers’ judgments that
make up the primary data for semantic and pragmatic theories. As we’ve seen here,
some contextualists, like relativists, defend their novel semantic proposals on the
grounds that no Kratzer-style semantics can be made to fit with the full range of data
in MINERS or in DEATH IN DAMASCUS. Here we have shown how this is not so, by
identifying available Kratzer-friendly readings that fit with the agreed-upon data. This
includes showing both how all of NEITHER, IF-A, and IF-B may be true in a single
piece of practical reasoning in MINERS and how ALEPPO and IF-ALEPPO may both
be true in DEATH IN DAMASCUS, under the assumption that the norms in play are
those of causal decision theory.
The remaining objection to Kratzer-friendly readings in those cases is the claim
that it is “ad hoc” or “unsystematic” to suppose that context is able to secure the
needed parameter values, as our pragmatic explanation requires. But we have seen
how this objection is misplaced; clear features of the context in which those utterances
are felicitous are features that make the needed parameter values highly salient. We
have also, in considering DeRose’s case and a parallel three-shaft MINERS case,
identified independent evidence for the contextual availability of what we are calling
the “advisability” readings that we posit for MORE INFORMATION and for IF-
ALEPPO. This is further evidence that positing such readings is not ad hoc, but
required on grounds independent of the issues raised here.
Finally, we have seen that the charge that the readings we identify are “unsystematic”
is also misplaced. Our theory is able to identify the sense of commonality between
NEITHER and IF-A and IF-B that is pretheoretically plausible and likewise for the
piece of discourse TEST. In contrast, our opponents are faced with a dilemma. In
accounting for TEST, they must either offer a needlessly complex explanation that
undermines their general appeal to the sameness of contextually supplied parameter
values, or else endorse all the materials needed for own account, thus making their
additional commitments appear superfluous.
We conclude that a Kratzer-style, flexible contextualist semantics, supplemented
with a near-side pragmatic account of how it is that contexts supply the needed
parameter values, remains the view to beat.
Acknowledgments
We are grateful to Fabrizio Cariani, Jennifer Carr, Nate Charlow, and Matthew Chrisman for
detailed and helpful comments.
References
Bronfman, A. and Dowell, J. L. (forthcoming) The Language of Reasons and Ought. In Star, D.
(ed.) The Oxford Handbook of Reasons and Normativity. Oxford: Oxford University Press.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
Cariani, F., Kaufmann, M., and Kaufmann, S. () Deliberative Modality under Epistemic
Uncertainty. Linguistics and Philosophy. ., pp. –.
Carr, J. () The “If p, Ought p” Problem. Pacific Philosophical Quarterly. ., pp. –.
Charlow, N. () What We Know and What to Do. Synthese. ., pp. –.
DeRose, K. () Epistemic Possibilities. Philosophical Review. ., pp. –.
Dowell, J. L. () A Flexible Contextualist Account of Epistemic Modals. Philosophers’ Imprint.
., pp. –.
Dowell, J. L. () Contextualist Solutions to Three Puzzles about Practical Conditionals.
Oxford Studies in Metaethics. , pp. –.
Dowell, J. L. () Flexible Contextualism about Deontic Modals. Inquiry. .–, pp. –.
Egan, A. () Relativism about Epistemic Modals. In Hales, S. D. (ed.) A Companion to
Relativism. pp. –. Malden, MA: Wiley-Blackwell.
Fintel, K. von () Counterfactuals in a Dynamic Context. In Kenstowicz, M. (ed.) Ken Hale:
A Life in Language. pp. –. Cambridge, MA: MIT Press.
Fintel, K. von () Conditionals. In von Heusinger, K. Maienborn, C. and Portner, P. (eds.)
Semantics: An International Handbook of Natural Language Meaning. Vol. . pp. –.
Berlin: de Gruyter.
Fintel, K. von () The Best We Can (Expect to) Get? Ms.
Fintel, K. von and Iatridou, S. () What to Do If You Want to Go to Harlem. Ms.
Frank, A. () Context Dependence in Modal Constructions. Ph.D. dissertation. University of
Stuttgart.
Geurts, B. () On an Ambiguity in Quantified Conditionals. Ms.
Gibbard, A. and Harper, W. L. () Counterfactuals and Two Kinds of Expected Utility. In
Hooker, C., Leach, J., and McClennen, E. (eds.) Foundations and Applications of Decision
Theory. pp. –. Dordrecht: Reidel.
Gillies, T. () Counterfactual Scorekeeping. Linguistics and Philosophy. ., pp. –.
Joyce, J. () Regret and Instability in Causal Decision Theory. Synthese. ., pp. –.
Kolodny, N. and MacFarlane, J. () Ifs and Oughts. The Journal of Philosophy. .,
pp. –.
Korta, K. and Perry, J. () Pragmatics. In Zalta, E. N. (ed.) The Stanford Encyclope-
dia of Philosophy. Winter. [Online] Available from: <http://plato.stanford.edu/archives/
win/entries/pragmatics/>. [Accessed Nov .]
Kratzer, A. () What “Must” and “Can” Must and Can Mean. Linguistics and Philosophy. .,
pp. –.
Kratzer, A. () The Notional Category of Modality. In Eikmeyer, H. and Rieser, H. (eds.)
Words, Worlds, and Contexts. pp. –. Berlin: de Gruyter.
Kratzer, A. (a) Modality. In von Stechow, A. and Wunderlich, D. (eds.) Semantics: An
International Handbook of Contemporary Research. pp. –. Berlin: de Gruyter.
Kratzer, A. (b) Conditionals. In von Stechow, A. and Wunderlich, D. (eds.) Semantics: An
International Handbook of Contemporary Research. pp. –. Berlin: de Gruyter.
Kratzer, A. () Modals and Conditionals. Oxford: Oxford University Press.
Lewis, D. () Adverbs of Quantification. In Keenan, E. L. (ed.) Formal Semantics of Natural
Language. pp. –. Cambridge: Cambridge University Press.
MacFarlane, J. () Assessment Sensitivity. Oxford: Oxford University Press.
Moss, S. () On the Pragmatics of Counterfactuals. Noûs. ., pp. –.
Schroeder, M. () Ought, Agents, and Actions. Philosophical Review. ., pp. –.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
Objective and Subjective ‘Ought’
Ralph Wedgwood
Over the years, several philosophers have argued that deontic modals, such as ‘ought’
and ‘should’ in English, and their closest equivalents in other languages, are systemat-
ically polysemous or context-sensitive. On this view, in effect, there are many different
concepts that can be expressed by ‘ought’—as we might call them, many different
‘ “ought”-concepts’—and whenever the term is used, the particular context in which it
is used somehow determines which of these concepts it expresses on that occasion.
More specifically, one way in which these ‘ought’-concepts differ from each other
is that some of them are more ‘objective’, while others are more ‘subjective’ or
‘information-relative’. When ‘ought’ expresses one of these more objective concepts,
what an agent ‘ought’ to do at a given time may be determined by facts that neither the
agent nor any of his friends or advisers either knows or is even in a position to know;
when it expresses one of the more ‘subjective’ concepts, what an agent ‘ought’ to do is
in some way more sensitive to the informational state that the agent (or his advisers
or the like) find themselves in at the conversationally salient time.1
In this chapter, I shall first present some linguistic evidence in favour of this view of
‘ought’. Then I shall propose a precise account of the truth conditions of sentences
involving terms that express these different ‘ought’-concepts. Unfortunately, in the
available space I shall not be able to do much more than simply to propose this
semantic account of these ‘ought’-concepts. In my opinion, the linguistic evidence
makes this account more plausible than any alternative account that metaethicists or
semanticists have devised so far; but I shall only be able to gesture in the direction of
this evidence here.
The general idea of the kind of account that I shall propose is not new. It is basically
akin to the theory of ‘subjective rightness’ that was given by Frank Jackson ()—
since like Jackson’s theory, it gives a starring role to the notions of probability and
of the expected value of a proposition. Nonetheless, my account has several crucial
1 For some philosophers who have advocated distinguishing between the objective and the subjective
‘ought’, see Brandt (, pp. –), Ewing (), Parfit (, p. ), Jackson (), Jackson and Pargetter
(, p. ), and Gibbard (). In a somewhat similar way, Sidgwick (, p. ) distinguished between
objective and subjective rightness and wrongness.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
containing ‘ought’ contain hidden variables (or hidden terms of any other kind), so
that the way in which the term’s semantic value shifts between contexts results simply
from different items’ being referred to by these hidden terms, or whether some other
syntactic phenomenon underlies these shifts. I shall not even rule out the idea that the
term ‘ought’ is syntactically simple and unstructured, and simply demands different
semantic interpretations in different contexts.
Similarly, with respect to the second component (b), I shall not here defend any
particular view of what it is to understand or to be a component user of the term. In
fact, I am inclined to favour a certain sort of account of this second component of a
term’s meaning. Specifically, according to an account of this sort, we can explain what
it is to be linguistically competent with a term by appealing to the range of concepts
that the term can be used to express: to be linguistically competent with the term
requires having the ability to use the term to express concepts within that range (in
a way that enables competent hearers to interpret one’s use of the term as expressing
the concept within that range that one intends to express). Then the nature of each of
these concepts can be explained in terms of the conceptual role that the concept plays
in one’s thinking, and in terms of the way in which this conceptual role determines the
object, property, or relation that the concept stands for or refers to.2
However, even though I am attracted to this view of what linguistic competence
consists in, I shall not attempt to defend this view here. Instead, I shall simply give
an account of the range of truth conditions that sentences involving ‘ought’ can have.
To bring out the similarity between the different truth conditions in this range, I shall
put my account in the form of a schema involving three different parameters; as I shall
explain, the different truth conditions that a sentence involving ‘ought’ can have in
different contexts all correspond to different ways of setting these three parameters.
So, in effect, something in the conversational context in which the term ‘ought’ is used
must determine what these parameters are; I shall try to comment, at least in passing,
on what features of the conversational context could do this.
A Semantic Framework
The general semantic approach that I shall take here is in line with what could be
called the ‘classical’ semantics for deontic logic. According to this approach, ‘ought’
and ‘should’ and their equivalents in other languages are all broadly modal terms, just
like ‘must’, ‘may’, ‘can’, and the like. Every occurrence of ‘ought’ expresses a concept
that functions as a propositional operator—that is, as a concept that operates on a
proposition (the proposition that is expressed by the sentence that is embedded within
the scope of this occurrence of ‘ought’), to yield a further proposition (the proposition
that is expressed by the sentence in which this occurrence of ‘ought’ has largest scope).
2 I have attempted to sketch some parts of this story elsewhere; see especially Wedgwood (,
chapters –).
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
Thus, for example, the occurrence of ‘ought’ in the English sentence ‘This room
ought to be swept’ expresses an ‘ought’-concept that operates on the proposition that is
expressed by the embedded sentence ‘This room is swept’. So the proposition expressed
by the sentence ‘This room ought to be swept’ has the logical form ‘O (This room is
swept)’, where ‘O (. . .)’ is the relevant ‘ought’-concept. In a proposition of the form
‘O(p)’, I shall call the proposition p on which the relevant ‘ought’-concept operates the
‘embedded proposition’.
In general, the conditions under which a sentence expressing such an ‘ought’-
proposition is true at a possible world can be specified as follows. For every such
sentence, and for every possible world w, there is a function that maps possible worlds
onto domains of possible worlds, and a relevant ordering on these worlds, such that
the sentence expressing the ‘ought’-proposition ‘O(p)’ is true at w if and only if, out of
all worlds in the domain that this function assigns to w, the embedded proposition p
is true at all worlds that are not ranked any lower down in this ordering than any other
worlds in this domain.3
If—as will usually be the case—it is possible to express this ordering by means
of words like ‘better’ and ‘worse’, then we can say more simply that the sentence
expressing ‘O(p)’ is true at w if and only if the embedded proposition p is true at all
the optimal worlds in the relevant domain. So, for example, the sentence ‘This room
ought to be swept’ is true at w if and only if the proposition that this room is swept is
true at all the relevantly optimal worlds in the relevant domain.4 So long as there are
always some worlds in the relevant domain that count as optimal in the relevant way,
it turns out that all of the principles of standard deontic logic—in effect, the modal
system KD—will be valid for every ‘ought’-concept.
In this way, this classical approach to the semantics of ‘ought’ involves two param-
eters: a function that maps each possible world onto a domain of possible worlds,
and the relevant ordering on these worlds. As I shall propose in the third section of
this paper, this ordering of worlds can itself be regarded as having an expectational
structure: that is, there is some kind of value, and some probability distribution,
such that this ordering of the worlds is equivalent to an ordering in terms of the
expected value of the worlds, according to this probability distribution. However,
before developing this expectational conception of the relevant ordering, I shall survey
some of different concepts that the term ‘ought’ can express.
3 So, if the proposition ‘O(p)’ is not to be trivial, there must be some worlds that are not ranked any lower
in this ordering than any other worlds in the domain. That is, what David Lewis () called the ‘Limit
Assumption’ must hold. Some philosophers—such as Eric Swanson ()—have denied that the Limit
Assumption must hold for all ‘ought’-concepts. But in my view, there are independent reasons for thinking
that it must hold. Specifically, if ‘ought’ agglomerates over conjunction—including infinite conjunction—
and ‘ought’ implies logical possibility, then it seems that the Limit Assumption must indeed hold: that is, in
effect, there must be a possible world where everything is as it ought to be.
4 This ‘classical’ semantics for deontic operators was defended by such pioneering deontic logicians
as Åqvist () and Lewis (). My defence of this classical semantics is given in Wedgwood (,
chapter ).
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
5 See especially Wedgwood (, section ., and , section ).
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
between the more objective and the more subjective versions of that kind of ‘ought’.
Moreover, it seems that it must be broadly speaking the same kind of systematic
connection in each case. The next two sections of this chapter will focus on exploring
this connection.
In addition to giving an account of the relationship between the subjective and
objective versions of each of these kinds of ‘ought’, I shall also aim to unify my account
of these phenomena with yet another kind of ‘ought’—specifically, with the so-called
epistemic ‘ought’, as in:
() Tonight’s performance ought to be a lot of fun.
This seems just to mean, roughly, that it is highly probable given the salient body of
evidence that tonight’s performance will be a lot of fun. If this is indeed at least roughly
what the epistemic ‘ought’ means, then it is clear that the ‘salient body of evidence’ need
not include the total evidence available to the speaker at the time of utterance, since it
seems that even if one knows that the orbit of Pluto is not elliptical, it might be true
for one to say:
() The orbit of Pluto ought to be elliptical (although of course it isn’t).
I shall aim to give an account of the semantic values of a range of uses of ‘ought’ that
includes these uses of the term.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
metaphysical thesis about the nature of the orderings in question. In fact, I shall argue
in Section that there are reasons, concerning the truth conditions of conditional
sentences involving ‘ought’, for interpreting this expectational conception in the first
way, as built into the semantics of terms such as ‘ought’. For the time being, however,
we simply shall leave open the question of which interpretation of this expectational
conception is correct.
In the rest of this section, I shall explain this expectational schema in more detail,
starting with some comments on each of its three elements—the domain function f ,
the probability distribution E, and the value function V.
(i) The first element of any instance of this expectational schema is familiar: it is a
domain function f , which maps every world w on to the relevant domain of possible
worlds f (w). It is this function that identifies the worlds that are, as we might put
it, ‘up for assessment’ by the ‘ought’-concept in question, relative to w. In effect, this
function f fixes what Angelika Kratzer (, chapter ) called the ‘modal base’—
the set of propositions that are true throughout the domain of worlds that are up
for assessment by the ‘ought’-concept, relative to w. We shall explore some specific
examples of such domains of worlds in the next section.
(ii) The second element of any instance of this expectational schema is a proba-
bility distribution E. I shall assume that every probability distribution is a function
that assigns real numbers in the unit interval from to to the propositions in
a propositional algebra (that is, a set of propositions that is closed under Boolean
operations like negation, disjunction, and so on). Any function of this sort that obeys
the fundamental axioms of probability theory counts as a probability distribution. So,
in particular, the omniscient probability function—the function that assigns to every
true proposition and to every false proposition in the relevant algebra—is itself a
probability distribution.
Another way of thinking of such probability distributions is as defined over a space
of possible worlds, relative to a certain ‘field’ of subsets of this space of worlds. This
‘field’ also constitutes an algebra, in the sense that it is closed under operations such as
complementation, union, and the like; the probability function assigns real numbers
to the sets of worlds in this field.6 This probability function can be thought of as a
measure on the space of worlds: intuitively, it tells us how much of the whole space of
worlds is taken up by each set in this field. (This is why the probability measure has
to obey a basic additivity principle: the proportion of the whole space taken up by
the union of any two disjoint sets of worlds is the sum of the proportions taken up by
6 For technical reasons (see Easwaran ), if the space contains indenumerably many worlds, it may
not be possible to assign a probability to every set of worlds in the space—it may be that only certain sets of
worlds can have a probability assigned to them. This is why the probability distribution is defined over the
worlds only relative to a ‘field’ of sets of worlds—where this field contains all and only those sets of worlds
that correspond to propositions in the relevant algebra. Fortunately, this complication will not matter for
present purposes.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
7 Invoking two spaces of possible worlds—a space of epistemically possible worlds and a domain of
metaphysically possible worlds—in this way sets my account apart from most previous accounts of deontic
modals, which have typically sought to explain the semantic value of these modals purely in terms of a single
domain of possible worlds. Thus, for example, the account of Silk () resembles mine to the extent that
it allows the ordering on the worlds to vary with an ‘information state’, but for Silk this information state
is simply a kind of modal base, and so is nothing more than a ‘set of worlds’ (p. ). The main exception
is Jennifer Carr (, p. ) who proposes that the semantic value of deontic modals involves a modal
background, a probability function, and a value parameter (although confusingly she describes the modal
background and the probability function as together constituting an ‘informational parameter’). The main
difference between my account and Carr’s is that her account involves yet another parameter, a ‘decision
rule parameter’, which seems unnecessary to me; she also does not distinguish between metaphysically
and epistemically possible worlds in the way that I regard as important. (For further discussion of Carr’s
proposal, see Section below.)
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
the Russellian propositions that are true at those worlds: a metaphysically possible
world w is identical to a metaphysically possible world w if and only if exactly
the same Russellian propositions are true at w and w . The Russellian proposition
that you visit Hesperus is composed out of you, the visiting relation, and the planet
Hesperus itself. This proposition is therefore identical to the Russellian proposition
that you visit Phosphorus. Since the propositions that in this way individuate a
possible world must form a logically complete and consistent set, this explains why
there cannot be metaphysically possible worlds in which you visit Hesperus but not
Phosphorus.
By contrast, the epistemically possible worlds are individuated by the Fregean
propositions that are true at those worlds—where Fregean propositions are structured
entities that are composed, by means of operations such as predication and the like, out
of concepts, which are modes of presentation of such entities as individuals, properties,
and relations. An epistemically possible world w is identical to an epistemically
possible world w if and only if exactly the same Fregean propositions are true at
w and w . Since one and the same planet may have several different modes of
presentation—including a ‘Hesperus’ mode of presentation and a ‘Phosphorus’ mode
of presentation—this allows for the existence of an epistemically possible world in
which you visit Hesperus but not Phosphorus.
(iii) Finally, the third element of any instance of this expectational schema is a value
function of a certain kind.
In general, this value function will evaluate a certain set of alternatives—such as a
set of alternative acts, or the like. When we speak of an ‘act’ here, it seems that what we
really mean is a proposition to the effect that the relevant agent performs an act of the
relevant type at the relevant time. So a more general account would involve regarding
this value function as evaluating a certain set of alternative propositions.
To say that these propositions are ‘alternatives’ to each other is to say that they are
mutually exclusive: no more than one of these propositions is true at any world in
the relevant domain of metaphysically possible worlds. I shall also assume that these
propositions are jointly exhaustive: that is, at least one of these proposition is true at
every world in this domain. In other words, this set of propositions forms a partition
of this domain of worlds: at every possible world in this domain, exactly one of these
propositions is true.
Since no more than one of these propositions is true at every world in this domain,
and there is no metaphysically possible world where you visit Hesperus without also
visiting Phosphorus, the proposition that you visit Hesperus cannot be a distinct
member of this set of propositions from the proposition that you visit Phosphorus.
Thus, the propositions in this set must be Russellian propositions (indeed, each such
proposition might simply be identified with a subset of the domain of metaphysically
possible worlds). In effect, every such value function provides a set of Russellian
propositions {A , . . . An } that forms a partition of the relevant domain of worlds, and
assigns a value to each of these propositions.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
We may think of the value that the value function assigns to each Russellian
proposition Ai in this set as a real number V(Ai ) that represents the value of
this proposition Ai . This measure is presumably not unique: the choice of unit
will obviously be arbitrary (just as it is arbitrary whether we measure distance in
miles or kilometres), and the choice of zero point may also be arbitrary as well
(just as it is arbitrary whether we take the zero point on a thermometer to be
Fahrenheit or Celsius). But to fix ideas, let us suppose that except in these two
ways, this value function is not arbitrary. Given an arbitrary choice of a unit and
a zero point, this function gives the true measure of the relevant value. In more
technical terms, we are supposing that the value in question can be measured on an
interval scale.8
(iv) In this way, any instance of this expectational schema involves three items:
a function f from each metaphysically possible world to a relevant domain of such
worlds; a probability distribution E; and a value function V defined over a set of
propositions that constitutes a partition of the relevant domain of metaphysically
possible worlds. To represent the fact that a particular instance of the expectational
schema gives an account of the conditions under which a use of a sentence involving
‘ought’ is true, I shall explicitly index this occurrence of ‘ought’ to this trio of items:
‘Ought<f ,E,V> ’.
I have proposed that the value function V is defined over a set of Russellian
propositions that forms a partition of the domain of metaphysically possible worlds.
However, the probability distribution E can assign probabilities to hypotheses about
the value that V assigns to various propositions—where each of these hypotheses
is, in effect, a Fregean proposition. For example, such hypotheses might include:
‘The proposition that I visit Hesperus has value n’, and ‘The proposition that I visit
Phosphorus has value m’—where these two hypotheses are distinct from each other.
In this way, the hypotheses to which E assigns probability refer to Russellian
propositions by means of modes of presentation—where these modes of presentation
of Russellian propositions are, in effect, Fregean propositions. It seems that just as the
relevant set of Russellian propositions forms a partition of the domain of metaphysi-
cally possible worlds, the corresponding set of Fregean propositions forms a partition
of the space of epistemically possible worlds. Since each of these hypotheses involves
a Fregean proposition A E (as a mode of presentation of a Russellian proposition AM ),
we may think of each of these hypotheses as having the form ‘V(AE ) = n’.
We can now give a definition of the EV-expected value of a Fregean proposition
AE , in the following way. Consider a collection of hypotheses {h , . . . hk }, where each
hypothesis hi has the form ‘V(AE ) = ni ’. Suppose that this collection of hypotheses
also forms a partition, in the sense that it is epistemically certain that exactly one of
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
these hypotheses is true; and suppose that E assigns a probability to each of these
hypotheses. Then the EV-expected value of AE is the probability-weighted sum of
the values of AE according to these hypotheses, where the value of AE according to
each hypothesis is weighted by the probability of that hypothesis. Symbolically, the
EV-expected value of AE is:
i ni E(V(AE ) = ni ).
9
Since the set of Fregean propositions that feature in these hypotheses forms a parti-
tion of the epistemically possible worlds, the epistemically possible worlds themselves
can be ordered in terms of the EV-expected value of the proposition in this set that is
true at each world. Let us say that the epistemically possible worlds that are not ranked
lower down in this ordering than any other such worlds have ‘maximal EV-value’.
For each of these epistemically possible worlds, we need to find the metaphysically
possible worlds that in the relevant way ‘correspond to’ that epistemically possible
world. In the simple cases, a metaphysically possible world wM corresponds to an
epistemically possible world wE if and only if all the Fregean propositions true at
wE are true at wM (that is, these Fregean propositions are modes of presentation
of Russellian propositions that are true at wM ). In more complex cases (such as
epistemically possible worlds in which you visit Hesperus but not Phosphorus), there
are no metaphysically possible worlds where all these Fregean propositions are true. In
these cases, we would have to give a different account of what it is for a metaphysically
possible world wM to ‘correspond to’ an epistemically possible wE ; for example, we
could say that that the metaphysically possible worlds that correspond to wE are those
at which a maximal subset of the Fregean propositions that are true at wE are true.
We can now define a selection function S<f , E, V> over the metaphysically possible
worlds that will pick out the metaphysically possible worlds that correspond to
the epistemically possible worlds with maximal EV-value: for any metaphysically
possible world wM , wM belongs to the subset of f (w) picked out by this selection
function S<f , E, V> (f (w)) if and only if for some epistemically possible world wE , wM
corresponds to wE , and wE has maximal EV-value.
The truth conditions of sentences of the form ‘Ought<f ,E,V> (p)’ can now be
specified in terms of this selection function S<f , E, V> :
‘Ought<f ,E,V>(p)’ is true at w if and only if p is true at every world w ∈
S<f , E, V>(f (w)).
Let us illustrate this proposal by considering the example of Frank Jackson’s ()
‘three drug’ case. In this case, the speakers using ‘ought’ are focussing on the practical
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
situation of a certain agent x at a time t; in this situation, there are three options
available to x at t—giving the patient drug , giving the patient drug , and giving the
patient drug . The agent x knows that drug is second best. Unfortunately, although
x knows that either drug is best or drug is best, x does not know which—and x
knows that if drug is best, drug will be disastrous, while if drug is best, drug
will be disastrous. The speakers are considering what x ought to do at t given the
informational state that x is in at t. Then the three parameters f , E, and V will be
something like the following:
• f (w*) is the set of metaphysically possible worlds that are practically available to
x at t (so in these worlds, everything that x cannot change by x’s actions at t is
exactly as it is in w*).
• E is a probability distribution that in the appropriate way corresponds to x’s
informational state at t.
• V is a value function that assigns values to the three Russellian propositions, A ,
A , and A —the propositions that at t, x gives the patient drug , drug , and drug
, respectively—where these three propositions form a partition of the domain of
worlds f (w*).
E assigns probabilities to various hypotheses—including hypotheses about the value
that V assigns to A , A , and A . In referring to these Russellian propositions A , A ,
and A , these hypotheses use modes of presentation of these propositions—and we are
assuming that these modes of presentation of Russellian propositions are themselves
Fregean propositions. To keep things simple, however, let us suppose that E puts the
relevant Fregean propositions into a one-to-one correspondence with the Russellian
propositions. (That is, for each of these Russellian propositions, there is exactly one
Fregean proposition that is a mode of presentation of that Russellian proposition
such that E attaches non-zero probability to any hypotheses involving that Fregean
proposition.) Thus, there is also a corresponding set of Fregean propositions forming
a partition of the epistemically possible worlds—AE. , AE. , and AE. —corresponding
to A , A , and A .
Assume that for each of these Fregean propositions AE.i , there are two hypotheses
hi. and hi. of the form ‘V(AE.i ) = n’ to which E assigns non-zero probability:
• h. is ‘V(AE. ) = ’; and h. is ‘V(AE. ) = ’.
• h. is ‘V(AE. ) = ’; and h. is ‘V(AE. ) = ’.
• h. is ‘V(AE. ) = ’; and h. is ‘V(AE. ) = ’.
Suppose that for all i, E(hi. ) = E(hi. ) = .. Then the EV-expected value of each of
these three Fregean propositions AE. , AE. , and AE. is as follows:
• EV(AE. ) = E(h. ) × + E(h. ) × = .
• EV(AE. ) = E(h. ) × + E(h. ) × = .
• EV(AE. ) = E(h. ) × + E(h. ) × = .
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
Thus, the epistemically possible worlds that have maximal EV-value are all and only
the worlds at which AE. is true, and since the metaphysically possible worlds in f (w*)
corresponding to the epistemically possible worlds at which AE. is true are all and only
the worlds where A is true, S<f , E, V> f (w*) contains precisely these metaphysically
possible worlds. It follows that ‘Ought<f , E, V> (A )’ is true at w*.
As I explained above, this proposal is simply an account of conditions under which
‘ought’-sentences are true. I am not proposing that there are hidden variables referring
to these parameters f , E, and V in the actual syntax of these sentences. I am also
not claiming that linguistic competence with ‘ought’ involves some kind of implicit
knowledge or grasp of this semantic account; this semantic account does not by itself
settle the question of how best to account for our competence with ‘ought’.
However, I shall argue in Section that all of these three parameters—f , E, and
V—are part of the semantics of ‘ought’, in the sense that they must be included in
any systematic account of the truth conditions of the full range of sentences involving
‘ought’. So, in normal contexts when ‘ought’ is used, something must determine what
these three parameters are. Presumably, this will involve the speakers in the context
actually thinking of something that somehow determines these parameters. I shall not
take a definite stand on what exactly the speakers in the context must be focussing on
in this way. (No doubt, few actual speakers employ the formal mathematical concept
of probability to think of a probability distribution!) For example, the probability
distribution E might be determined by the speakers’ in some way thinking of or
imagining a certain epistemic perspective—where as a matter of fact, this perspective
can be modelled by the probability distribution E.
As I shall put it, in the context in question, each of these three parameters f , E,
and V is ‘contextually salient’ (although—as I have said—I shall remain neutral about
what exactly is involved in these parameters’ being contextually salient in this way).
In the next section, I shall show how different settings of these three parameters f , E,
and V can yield intuitively plausible truth conditions for each of the kinds of ‘ought’
considered in Section .
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
It seems plausible that the semantic value of every instance of the practical ‘ought’
is focussed on the situation of a particular agent x at a particular time t. (It is this that
has tempted many philosophers—including Mark Schroeder ()—to argue that the
practical ‘ought’ actually stands for a relation between an agent and an act-type.) So it
seems that the semantic value of this use of ‘ought’ will involve a function f that maps
each world w onto the worlds that are ‘practically available’ from the situation that the
agent x is in at the time t in w—in effect, the worlds that the agent x can realize through
the acts that she performs at t in w.
This semantic value will also involve a function V that measures the value of the
various acts that the agent performs at any of these available possible worlds. For
example, more specifically, V might be a measure of the choiceworthiness of the act
that the agent performs in this situation within each of these worlds. On this view,
then, if the relevant ‘ought’ is the objective practical ‘ought’, focussed on the situation
of an agent x at a time t, then ‘Ought (p)’ is true at a world w if and only if p is true
in all the worlds that are practically available from the situation that x is in at t in w
where x does one of the maximally choiceworthy acts available at that time t.
With the more subjective forms of the practical ‘ought’, V and f are exactly as they
are with the objective practical ‘ought’, and E is some less omniscient probability
distribution—that is, it is a probability distribution that encodes a significant degree of
ignorance and uncertainty about the world. For example, in many contexts we might
use a practical ‘ought’ in such a way that its semantic value involves a probability
distribution that corresponds to the system of credences that would be ideally rational
for a thinker to have if their experiences, background beliefs, and other mental states
were exactly like those of the agent x at t.
This, however, is not the only concept that a subjective practical ‘ought’ can express.
If the speakers have pertinent information that is not yet available to the agent who
is under discussion, it will often be natural for the speakers to use an ‘ought’-concept
whose semantic value involves a probability distribution that reflects this information.
Moreover, if the agent herself also thinks that there is some available information that
she has not yet acquired, it will be very natural for the agent to use an ‘ought’-concept
that in this way involves a probability distribution that incorporates this information
that the agent hopes to acquire.10
In general, a probability distribution is in effect a way of representing a certain epis-
temic perspective; and an epistemic perspective can become conversationally salient
for many reasons. For example, as we have noted, many probability distributions
correspond to the systems of credences that an ideally rational thinker would come to
have in response to certain experiences, given a certain set of background beliefs and
other mental states. If this collection of experiences and other mental states is precisely
the collection of experiences and states that a conversationally salient agent has at
a conversationally salient time, this can explain why the corresponding epistemic
10 This is how I would aim to answer the objections of Kolodny and MacFarlane ().
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
perspective will be salient in the conversational context in question. There are many
factors that can explain why a certain agent and time are salient in a conversational
context. For example, in many contexts, the salient time will often be the time of action,
rather than the time of utterance; and the salient agent may be an adviser or observer
of the agent on whom this occurrence of the practical ‘ought’ is focussed, rather than
that agent herself.
This idea of relativizing ‘ought’-concepts to probability distributions is clearly akin
to the idea of Björnsson and Finlay () that occurrences of ‘ought’ are relativized
to bodies of information, conceived of simply as sets of propositions. However,
there are a number of crucial differences. First, although every probability function
determines a body of information (consisting of the propositions to which the function
assigns probability ), the converse does not hold: there are many different probability
distributions in which exactly the same propositions have probability . In this way,
probability distributions contain more structure than mere bodies of information.
Secondly, my proposal is not committed to their view that every occurrence of ‘ought’
is relativized to an ‘end’ or ‘standard’ that can be understood in wholly non-normative
terms. Finally, my proposal is easier to integrate with some of the classical theories
in this area: unlike their account, my proposal entails standard deontic logic; and it
clearly yields the right verdicts in contexts where it is assumed that the agent ought to
maximize some kind of expectation of some kind of value.
We can make sense of objective and subjective versions of many kinds of ‘ought’.
For example, this point seems to hold, not just of the practical ‘ought’, but of the
purpose-relative ‘ought’, the ‘ought’ of general desirability, and the rational ‘ought’ as
well. In each case, the objective and the subjective ‘ought’ differ only with respect to
the relevant probability distribution E: with the objective ‘ought’, E is the omniscient
probability distribution, whereas with the more subjective ‘ought’, E is a probability
distribution that corresponds to the credence function of a possible thinker who
(although perfectly rational) is significantly more ignorant and uncertain about the
world.
It would be intrinsically interesting to explore exactly how this schema can be
worked out in detail for each of these other kinds of ‘ought’; but to save space, I shall
here only explain how it would work for the purpose-relative ‘ought’. So far as I can
see, the purpose-relative ‘ought’ resembles the practical ‘ought’ in that they are both
implicitly focussed on the situation of a particular agent x at a particular time t. So the
relevant function f from worlds to domains of worlds is again the function that maps
each world w onto the worlds that are ‘practically available’ from the situation that the
agent x is in at the time t in w.
The only respect in which the purpose-relative ‘ought’ differs from the practical
‘ought’ is in involving a different value function V. For the purpose-relative ‘ought’,
there is some purpose P that is contextually salient, and the value function V ranks the
various acts that the agent performs at any of the worlds that are practically available
to the agent at the time in question, not in terms of their overall choiceworthiness, but
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
purely in terms of how good these acts are as a means to accomplishing that purpose
P. Otherwise, the two kinds of ‘ought’ work in more or less the same way.
As I remarked in Section above, it would be preferable if our account of ‘ought’
could also encompass the other kinds of ‘ought’ that I considered in that section—
including the epistemic ‘ought’ (as in ‘Tonight’s performance ought to be a lot of fun’,
which as I said seems roughly equivalent to saying that the embedded proposition that
tonight’s performance will be a lot of fun is highly probable given the salient evidence).
The schema that I proposed in the previous section may be able to capture the
epistemic ‘ought’, in something like the following way. For the epistemic ‘ought’, the
three parameters may be the following. First, f can simply be the function that maps
each world onto the set of all possible worlds that are compatible with everything
that is known for certain in the context. Secondly, E can be a probability distribution
modelling some possible epistemic perspective. (Again, this could be pretty well any
perspective; the participants in a conversation will just have to interpret the contextual
clues in order to discern which perspective is contextually salient in the relevant way.)
Finally, V could simply be a function that ranks answers to a certain question, which
we can think of as a partition of alternative answers to the question, by ranking the true
answer to the question above all the false answers—say, by assigning a value of to the
true answer and to false answers. Now, as is well known, probabilities are themselves
simply expectations of truth values. So the ranking of answers to this question in
terms of their EV-expected value is identical to the ranking in terms of these answers’
probability according to E; and this ranking determines a corresponding ordering
of worlds in accordance with the probability of each world’s answer to the question.
So, for example, if the rival answers to the question are simply p and ‘¬p’, then the
sentence ‘It ought to be that p’, involving this epistemic ‘ought’, will be true just in
case p is more than probable than ‘¬p’ (according to the probability distribution that
corresponds to E).
One might wonder whether p’s being barely more probable than ‘¬p’ is enough to
make it true to say ‘It ought to be that p’, using this epistemic ‘ought’. At least, if we
were considering a fair lottery with numbered tickets, we would not typically say
such things as ‘The winning ticket ought to be one of the tickets numbered between
and ’.
However, the reason for this may be that the question that we normally have in mind
is not simply whether or not the embedded proposition is true, but whether or not
some more general explanatory picture of the world is true. If this general explanatory
picture is more than % probable, and the proposition p follows from this explanatory
picture, then it will be true to say ‘It ought to be that p’ (since p will be true in all the
worlds within the domain where this explanatory picture is true). A proposition p that
follows from a general explanatory picture of this sort will typically be significantly
more probable than that general picture itself.
This simple account of the value function V, in terms of the truth value of answers
to a certain question, may turn out not to be completely defensible in the end; a more
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
complicated account of this value function may be required. But at all events, to
capture the range of ways in which we use the epistemic ‘ought’, we have to allow that
many different probability distributions (or spaces of epistemically possible worlds)
can be involved. In particular, when a speaker asserts a proposition involving an
epistemic ‘ought’-concept of this sort, the probability distribution E involved in this
concept’s semantic value does not have to correspond to the information that is
actually available to the speaker. It may be a different probability distribution.
For example, even if the speaker knows perfectly well that the orbit of Pluto is not
elliptical, the relevant probability distribution E does not have to assign a probability
of to the proposition that the orbit of Pluto is elliptical; it may be a probability
distribution that corresponds to the credences that it would be rational to have given
a body of information that is different from the speaker’s actual total evidence but
contextually salient for other reasons. So this approach has no difficulty handling such
puzzling instances of the epistemic ‘ought’ as ‘The orbit of Pluto ought to be elliptical
(though of course it isn’t)’.11
11 This view of the epistemic ‘ought’ also helps to explain why it has such different truth conditions from
the epistemic ‘must’—even though both modals are broadly speaking necessity operators. For ‘must’, the
ordering on the possible worlds makes no difference to the sentence’s truth conditions; and according to my
proposal, the only relevance of the probability distribution E is to generate the ordering of possible worlds
in terms of their EV-expected value. So the truth conditions of ‘Must (p)’ depend purely on whether p is
true throughout f (w), and is unaffected by what E and V are in the relevant context.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
Once we have the idea of a space of possible worlds—as opposed to a mere domain
or set of worlds—it is natural to reinterpret this ‘restricting’ function of conditionals.
Instead of simply replacing the domain of possible worlds with the subset of the
original domain where the conditional’s antecedent is true, we may conceive of the
conditional as replacing the original space of possible worlds with the sub-region of
the space where the conditional’s antecedent is true.
Where the space of worlds has no more structure than a simple set of worlds,
the sub-region of the original space will simply be the subset where the antecedent
is true—just as on Kratzer’s original proposal. However, where the space of worlds
has the structure of a probability distribution, replacing the space with the sub-
region where the antecedent is true is equivalent to replacing the original probability
distribution with the result of conditionalizing it on the antecedent.
According the account that I have proposed here, the semantics of ‘ought’ involves
two spaces or domains of possible worlds—the domain of metaphysically possible
worlds that is fixed by the function f , and the space of epistemically possible worlds E.
The antecedent of the conditional will restrict one of these spaces of worlds; but it may
be up to the particular conversational context to determine which of these two spaces
is restricted in this way.
So, some conditionals will restrict the domain of metaphysically possible worlds
f (w) to the subset of that domain where the antecedent is true; but other conditionals
will restrict the space of epistemically possible worlds E to the sub-region of that
space where the antecedent is true. Just to give them labels, I shall call the first sort
of conditional ‘ought’ the ‘metaphysical conditional’, and I shall call the second sort of
conditional the ‘epistemic conditional’.
The truth conditions of these two kinds of conditionals can be specified as follows:
. Metaphysical: For any two propositions p and q: ‘[If p] q’ is true at w iff q [f / f ] is
true at w—where q [f / f ] is the result of uniformly replacing f in q with f , which
is the function from any possible world w to the subset of f (w ) where p is true.
. Epistemic: For any two propositions p and q: ‘[If p] q’ is true at w iff q [E/ E ]
is true at w—where q [E/ E ] is the result of uniformly replacing E in q with E ,
which is the sub-region of E where p is true.
It is clear that the clause for this second epistemic conditional requires that the space
of possible worlds E must itself be part of the semantics of the sentence that expresses
the proposition q. It is only if E is part of the semantics that the effect of embedding
this sentence within a conditional can be to restrict this space E to the sub-region of
the space where the antecedent proposition p is true.12
12 It is at this point that my account differs most clearly from the account of Bronfman and Dowell (,
chapter ). In their account, the ordering source g (interpreted as a function from worlds of evaluation to
orderings on the worlds in the modal base) is a single parameter, rather than a pair of parameters E and V.
(Following Dowell (), they propose that with the subjective ‘ought’, but not the objective ‘ought’, the
ordering source is sensitive to the contents of some salient body of information at the world of evaluation.)
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
The truth conditions that I have assigned here to the metaphysical conditionals
involving ‘ought’ are in effect the same as those that were assigned to the so-called
dyadic ‘ought’-operator by the classical deontic logicians such as Åqvist () and
Lewis (). On the other hand, the truth conditions that I have assigned to the
epistemic conditionals involving ‘ought’ have the effect of replacing the probability
distribution E that would be involved in the semantic value of the consequent of
the conditional if it appeared unembedded with the result of conditionalizing that
probability distribution on the antecedent.
For an example of the metaphysical conditional, consider the familiar examples of
the ‘second best “ought” ’ that have been used to illustrate the dyadic ‘ought’-operator.
Suppose that an adviser is remonstrating with a recalcitrant advisee. First, the adviser
says ‘You ought not to shoot up heroin’, and then when the advisee indicates that he
may not follow this advice, the adviser continues, ‘And if you do shoot up heroin, you
ought to shoot up with clean needles’.
If these statements involve the practical ‘ought’, focussed on the advisee’s situation
at the time of the utterance, then the adviser’s first statement is true because out of all
the worlds that are practically available to the advisee at the relevant time, the worlds
where the advisee acts in a maximally choiceworthy way are all ones where he does
not shoot up heroin. The second statement is true because out of all the worlds that
are practically available to the advisee at the relevant time and the advisee does shoot
up heroin, the worlds where the advisee acts in a maximally choiceworthy way are all
worlds where he shoots up with clean needles.
For an example of the epistemic conditional, consider the following variant of Frank
Jackson’s () three-drug case—specifically, a four-drug case. There are two drugs,
and , such that it is known for certain that one of these two drugs will completely
cure the patient while the other drug will kill him, but unfortunately it is unknown
which of the two drugs will cure the patient and which will kill him. In addition, there
are two other drugs, and , each of which will effect a partial cure, but one of which
will have an unpleasant side-effect—though it is not yet known which drug will have
that side-effect. Suppose that the patient is about to have a test: it is known that if the
test is negative, it is drug that will have the unpleasant side-effect, while if the test is
positive, drug will have the unpleasant side-effect. Then it is true to say ‘If the test
result is positive, we should give the patient drug .’
This statement is true because we give drug in all possible worlds in the relevant
domain in which we take the course of action that maximizes expected choicewor-
thiness, according to the probability distribution that results from our current system
They interpret deontic conditionals as involving two modal operators, with the conditional’s antecedent
restricting the domain of an operator that has wide scope, and the deontic modal operator having narrow
scope, embedded inside the scope of the other operator. While I find my account marginally more plausible
than theirs, I cannot compare the advantages of the two approaches here; further work is clearly needed on
this point.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
13 This interpretation of these epistemic deontic conditionals seems to me to avoid the problems for rival
accounts that are canvassed by Nate Charlow (). Those rival accounts all represent the relevant body of
information by means of the ‘modal base’—that is, the propositions that are true throughout the relevant
domain of worlds f (w); my account represents this body of information in a fundamentally different way—
by means of the probability distribution E that, together with the relevant value V, determines the ordering
of the worlds in this domain. In this way, my account agrees with Charlow’s central point, that a good
semantic account must make provision for conditionalizing, not only the modal base, but also the relevant
ordering of the worlds. It is precisely for this reason that I propose that there are two different kinds of
deontic conditionals.
14 I owe this example to Alex Silk.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
In this final section, I shall offer a brief reply to an objection that might be raised
against my suggestion that this family of truth conditions really captures the semantic
values of uses of ‘ought’. Some readers may worry that my account seems to build
in some controversial assumptions about rational choice into the very semantics of
‘ought’. In some unpublished notes quoted by Kai von Fintel (, p. ), Kratzer
objects to accounts that do this, asking rhetorically: ‘Why pack information about
rational decision making into the meaning of modals?’
Strictly speaking, however, I have not in fact said anything about rational decision-
making here. Admittedly, my account makes use of the general idea of the expected
value of a proposition, which is an idea that is also invoked in many theories of rational
choice—for example, by those theories that imply that a rational choice must maximize
expected utility. My account has in fact made absolutely no mention of utility at all.
(There may be contexts where the value function V involved in the semantic value of
an occurrence of ‘ought’ is a utility function; I take no stand on the issue.) Still, it may
seem that the mere fact that I have made use of the general idea of the expected value
of a proposition brings my account too close to ‘packing information about rational
decision making into the meaning of modals’.
There are two main problems that might be alleged to affect accounts of the
semantics of ‘ought’ that appeal to the idea of the expected value of a proposition. First,
one might think that this idea is too controversial and too technical to be implicit in
the linguistic competence of ordinary speakers. Secondly, one might think that there
are some specific cases that cannot be handled in an intuitively acceptable way by any
such account.
My account is not vulnerable to the first problem, since I have explicitly distanced
myself from any attempt to explain linguistic competence in terms of an implicit
grasp of the truth conditions that I have described. It is undeniably an important
question what linguistic competence consists in, but unfortunately I cannot address
that question here. At all events, it is far from obvious that the truth conditions
that I have described are incompatible with any plausible account of linguistic
competence.
The second problem is potentially more serious. For example, consider an agent—
call him John—who harbours grave doubts about all views according to which one
should choose options that maximize some kind of probabilistic expectation of some
kind of value. Instead, John is attracted to a rival theory of rationality, such as the
maximin theory—according to which in every choice situation, one should choose
one of the options whose worst possible outcomes are at least no worse than the worst
possible outcomes of the available alternatives. Suppose that the most plausible version
of the expected-value theory would favour John’s choosing act A, and the maximin
theory would favour his choosing act B. It would seem true to say ‘For all John knows,
he ought to choose B’. Can we really handle cases of this sort in a satisfactory manner
if the notion of maximizing expected value is built into the semantics of ‘ought’ as
I propose?
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
These cases seem hardly typical of the normal use of deontic modals, since they
concern the use of these terms by theorists or philosophers in talking about other
theorists. It is questionable whether such esoteric uses are the most reliable evidence
for a theory of the meaning of words in a natural language. Nonetheless, a number of
recent writers seem to have been moved by cases of this sort. For example, Jennifer
Carr () has proposed that there should be a separate parameter in the semantics
of deontic modals for a decision rule: in some contexts, this decision rule might be
maximizing expected utility, but in other contexts, it might be some other decision
rule, such as the maximin rule.
In my view, however, this manoeuvre greatly complicates the semantic account
of deontic modals, in a way that is far from obviously warranted by the linguistic
evidence. Statements of the form ‘For all John knows, p’ are hard to interpret. In some
contexts, it seems that it could be true to say such things as ‘For all Pythagoras knew,
there are only finitely many prime numbers’. To explain why this sentence is true, it is
surely not necessary to argue that there are some possible worlds where there are only
finitely many prime numbers! In a similar way, it should be possible to explain why
the sentence ‘For all John knows, he ought to choose B’ without supposing that there
is any context such that the notion of maximizing expected value plays no role in the
correct account of the truth conditions that an ‘ought’-sentence has in that context.
A similar objection is raised by Fabrizio Cariani (, p. ), who focusses on cases
where a sentence involving ‘ought’ is embedded inside a larger sentence, such as ‘John
believes that he ought to choose B’, which ascribes a belief to a heterodox theorist like
John. Cariani argues that an account like mine will have difficulties explaining why
this belief-ascription is true.
In fact, however, my account has no difficulty providing such an explanation. It
seems most promising to link my account of the semantics of ‘ought’ with a relational
analysis of belief-ascriptions. According to this relation analysis, the belief-ascription
is true because John stands in the belief-relation to a content of the appropriate kind
that can be expressed in this context by the embedded sentence ‘he ought to choose B’.
According to Cariani (ibid.), ‘the appeal to the relational analysis is merely evasive,
unless it is complemented by an account of what content is expressed by a deontic
sentence in a given context’. But it is surely not obvious that in order to defend my
account of the semantics of ‘ought’, I need to commit myself to a full account of
the semantics of belief-ascriptions here. It is enough if I can make it plausible that
it is possible in principle to give an illuminating analysis of belief-ascriptions that
harmonizes with my account of ‘ought’.
So, to fix ideas, I shall suggest a possible analysis of this sort. I am not firmly
committed to all the details of this suggestion; the suggestion is included here
only to respond to Cariani’s objection. According to this suggestion, in this context
the embedded sentence ‘he ought to choose B’ expresses a Fregean proposition—
presumably, a Fregean proposition that John could express in an appropriate context
by uttering the sentence ‘I ought to choose B’.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
It seems clear that this Fregean proposition is capable of being true or false; in
that sense, this proposition has truth conditions. Presumably, to make this sugges-
tion compatible with my account, this Fregean proposition must have has the same
(extensional) truth conditions that my account assigns to the ‘ought’-sentence in this
context. However, there is no reason to think that John himself must entertain this
Fregean proposition by means of explicitly thinking of these truth conditions. John
must latch onto this proposition somehow, but it is not necessary for him to latch onto
the proposition by means of an implicit grasp of the most systematic account of the
truth conditions of sentences that express this proposition. Instead, I suggest, John
latches onto this proposition by deploying some mode of presentation of the property
that, according to my account, the embedded proposition that John chooses B would
have to possess for the whole ‘ought’-proposition to be true. The content of John’s
belief is a Fregean proposition that applies this mode of presentation to this embedded
proposition. Exactly how John grasps this mode of presentation of this property is a
delicate question, but it seems possible that he could grasp this mode of presentation
without having any awareness of how the property is analysable in terms of a domain
function f , a probability distribution E, and a value function V.
A further concern that Cariani raises is whether my account will make it the case
that John’s beliefs are ‘logically inconsistent’. In principle, there are many views that
philosophers have defended that are inconsistent with the correct semantics for some
natural-language expressions. For example, some philosophers have defended the view
that there are deontic dilemmas—cases in which it is simultaneously true that you
ought to do A and also that you ought not to do A. According to almost all the accounts
of ‘ought’ that semanticists have proposed, these philosophers’ views are inconsistent
with the correct semantics for ‘ought’. In principle, I accept that cases could be devised
in which the beliefs of John the maximin theorist would be similarly ‘inconsistent’.
However, since beliefs can be in this sense ‘inconsistent’ in highly non-obvious ways,
I do not see how this counts as any sort of objection to my account.
In general, cases where an ‘ought’-sentence of this kind is embedded within a
hyperintensional context such as ‘John believes that. . . ’ or ‘For all Barbara knows. . . ’
raise so many problems of their own that they seem not to provide firm grounds for
objecting to my account. Moreover, so far as I can see, there is no clear case where
we have the intuition that a sentence that has ‘ought’ as the dominant operator—
for example, a sentence of the form ‘Barbara ought to do A’—is true, in a way that
clearly cannot be handled by the account that I have proposed.15 In short, the linguistic
evidence does not clearly undermine my account of the semantics of ‘ought’.
Even though my account is unified in that the notion of maximization features in
my account of the semantic value of every occurrence of ‘ought’, it is in other ways
15 Indeed, I suspect that in any case where we are tempted to assert a sentence of the form ‘Barbara ought
to do A’, on the grounds that A is what is recommended by Barbara’s non-maximizing theory, our assertion
is either false, or else true only when this occurrence of ‘ought’ is understood as the purpose-relative ‘ought’,
relativized to the goal of conforming to the non-maximizing theory in question.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
an immensely broad and flexible account of the term. Many other philosophers of
language and metaethicists have proposed much narrower interpretations of ‘ought’,
which dramatically under-predict many of the readings of ‘ought’ that seem genuinely
available.16 By contrast, the range of truth conditions that I have identified in Sections
– above is much wider. So my suggestion—that all the truth conditions identified
here belong to concepts that can be expressed by ‘ought’ in ordinary English—implies
that these deontic modals, such as ‘ought’ and ‘should’, are capable of expressing this
wide range of concepts, depending on the particular context in which they are used.
In this way, my suggestion clearly runs the opposite danger—that of over-predicting
the readings of ‘ought’ that are available. For example, the schema that I outlined in
Section seems to predict that there is a practical ‘ought’-concept that is indexed to the
situation that I am in right now, and to the space of epistemically possible worlds that
corresponds to Julius Caesar’s state of information on that fateful morning of March
BC. It is doubtful, to say the least, whether there is any way of using such terms as
‘ought’ in English that will express this concept.
It does not seem clear to me that this point grounds any decisive objection to my
approach. We should concede, it seems to me, that this concept really exists, but that we
have no natural way of expressing it in English (or in Latin, or in any natural human
language), largely because of the very limited interest that this concept would have
for us. Admittedly, the suggestions that I have made in this paper would need to be
supplemented in order to explain why there is no natural way of using our natural-
language terms to express many of these concepts. But I see no reason to think that
such supplementation will prove impossible.
In general, of the two dangers that face such interpretations of natural-language
expressions, the danger of over-predicting the readings that are available seems less
grave than the danger of under-predicting such readings, since it will often be possible
to supplement an account that looks likely to over-predict the available readings of a
term with some further account that explains why those readings will not in fact be
available in any real conversational context. An account that underestimates the range
of concepts that a term can express, on the other hand, seems to admit no way of being
supplemented in order to rectify this deficiency. So there are some general reasons to
be optimistic that the sort of approach that I have sketched here will help us to achieve
a better understanding of these deontic modals such as ‘ought’ and ‘should’.
Acknowledgments
This paper was originally written and posted on my website in the summer of . In the spring
and summer of , it was presented as talks at Berkeley and at Edinburgh; I am grateful to
the members of those audiences for helpful comments. Finally, in revising the paper in ,
16 For an example of an interpretation of ‘ought’ that is dramatically narrower than mine, see Judith
Thomson ().
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
I benefited greatly from some highly illuminating comments from Nate Charlow, Matthew
Chrisman, Alex Silk, and Malte Willer.
References
Åqvist, Lennart () Good Samaritans, Contrary-to-Duty Imperatives, and Epistemic Obli-
gations. Nous. . pp. –.
Björnsson, Gunnar and Finlay, Stephen () Metaethical Contextualism Defended. Ethics.
. . pp. –.
Brandt, R. B. () Ethical Theory. Englewood Cliffs, NJ: Prentice Hall.
Bronfman, Aaron and Dowell, J. L. () Contextualism about Deontic Conditionals. In
Charlow, Nate and Chrisman, Matthew (eds.) Deontic Modality. Oxford: Oxford University
Press.
Cariani, Fabrizio () Deontic modals and probabilities: One theory to rule them all? In
Charlow, Nate and Chrisman, Matthew (eds.) Deontic Modality. Oxford: Oxford University
Press.
Carr, Jennifer () Deontic modals without decision theory, Proceedings of Sinn und Bedeu-
tung , Paris, September –. pp. –.
Charlow, Nate () What we know and what we do. Synthese. . pp. –.
Dowell, J. L. () Flexible Contextualism about Deontic Modals: A Puzzle about Information-
sensitivity. Inquiry. . –. pp. –.
Easwaran, Kenny () Regularity and Hyperreal Credences. Philosophical Review. . .
pp. –.
Ewing, A. C. () The Definition of Good. New York, NY: MacMillan.
Fintel, Kai von () The best we can (expect to) get? Challenges to the classic semantics for
deontic modals. Presented at the Central Meeting of the American Philosophical Association.
February , . [Online] Available from: http://mit.edu/fintel/fintel--apa-ought.pdf
[Accessed November .]
Gibbard, Allan () Truth and Correct Belief. Philosophical Issues. . pp. –.
Jackson, Frank () A probabilistic approach to moral responsibility. In Barcan Marcus, R.,
Dorn, George, and Weingartner, Paul (eds.) Logic, Methodology, and Philosophy of Science
VII. pp. –. Amsterdam: North-Holland.
Jackson, Frank () Decision-Theoretic Consequentialism and the Nearest and Dearest Objec-
tion. Ethics. . . pp. –.
Jackson, Frank and Pargetter, Robert () Oughts, options, and actualism. Philosophical
Review. . pp. –.
Kolodny, Niko and MacFarlane, John () Ifs and Oughts. Journal of Philosophy. .
pp. –.
Kratzer, Angelika () Modals and Conditionals: New and Revised Perspectives. Oxford: Oxford
University Press.
Lewis, David K. () Counterfactuals. Oxford: Blackwell.
Parfit, Derek () Reasons and Persons. Oxford: Clarendon Press.
Schroeder, Mark () Ought, Agents, and Actions. Philosophical Review. . . pp. –.
Sidgwick, Henry () The Methods of Ethics th edition. London: Macmillan.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
Silk, Alex () Evidence-Sensitivity in Weak Necessity Deontic Modals. Journal of Philosoph-
ical Logic . . pp. –.
Swanson, Eric () Ordering Supervaluationism, Counterpart Theory, and Ersatz Fundamen-
tality. Journal of Philosophy. . . pp. –.
Thomson, J. J. () Normativity. Chicago, IL: Open Court.
Wedgwood, Ralph () The Nature of Normativity. Oxford: Clarendon Press.
Wedgwood Ralph () The ‘Good’ and the ‘Right’ Revisited. Philosophical Perspectives. .
pp. –.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
‘Ought’: Out of Order
Stephen Finlay
The semantics of ‘ought’ and related modal verbs in natural language is not for the faint
of heart. A central desideratum is a unifying theory accommodating the many diverse
uses, both normative (moral, instrumental, rational, legal, etc.) and nonnormative
(logical, metaphysical, nomological, epistemic, dispositional, etc.) This places it at
the intersection of many issues in different subfields of philosophy, linguistics, and
logic, at least, and so presumably no single scholar could acquire all relevant expertise.
Angelika Kratzer’s (David Lewis-influenced) ordering semantics, which today is widely
regarded as orthodoxy, is therefore a remarkable and audacious achievement. Whereas
the Lewis–Kratzer semantics was developed from a primary focus on counterfactuals
and then extended to normative and other uses, this chapter offers a metaethicist’s per-
spective, taking as its primary cues the behavior of English modal verbs in normative
sentences.
When I first developed my own theory of the meaning of ‘ought’ in – (initially
published as “Oughts and Ends” in ), like other metaethicists I was ignorant
of Kratzer’s seminal work on modals in linguistics (, , ). This defect in
metaethics has since been corrected, and today philosophical work on normative
modals typically adopts the Lewis–Kratzer framework. But this new attention has
uncovered serious difficulties for its application to normative sentences, stimulating
an explosion of work at the intersection of metaethics and linguistics. Whereas these
efforts at repairing or replacing the Lewis–Kratzer framework invariably propose
additional semantic complexity, I’ll argue here that these difficulties are all avoided
more straightforwardly and naturally by the simpler semantics I’ve advanced in
“Oughts and Ends” and subsequent work (a, b, ), a version of the dyadic
semantics which the Lewis–Kratzer approach supplanted. I’ll argue that Kratzer’s
signature innovation of an ordering source parameter, though ingenious, introduced
unnecessary complexity into the semantics of modals that we’re better off without, at
least for English auxiliaries such as ‘ought’, ‘must’, and ‘may’. The simpler semantics has
the resources to address the issues motivating ordering semantics, without having its
weaknesses.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
After sketching the central features and motivations of the ordering semantics in
Section , I’ll programmatically investigate problems arising from (i) instrumental
conditionals (Section )—which motivate my rival approach to normative modality
(Section ), (ii) gradability and “weak necessity” (Section ), (iii) information-
sensitivity (Section ), and (iv) conflicts (Section ). I will argue that a simpler dyadic
approach addresses these problems at least as well as the ordering semantics, given
three moves: (a) an end-relational analysis of normative modality, (b) an analysis of
‘ought’ or “weak necessity” in terms of less-than-universal quantification (most rather
than all), and (c) appeal to the same resources of conversational pragmatics as utilized
by Kratzer. I conclude (Section ) by addressing metasemantic objections to my claims
about simplicity, with observations about what we should want from a semantics for
‘ought’.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
laws), or with salient circumstances (e.g. what one ought to do given violation of some
norm, or “contrary-to-duty obligations”). Since anything follows from an inconsistent
premise-set, a dyadic semantics predicts that relative to such backgrounds must p is
true for any arbitrary p, and, since nothing is consistent with an inconsistent set, that
may p is false for any arbitrary p. But inconsistent ideals don’t seem to render normative
claims trivially true or false.
Kratzer’s ingenious solution to both problems is to suggest that modal verbs take a
second conversational background, an ordering source. This is to move from a dyadic
to a polyadic semantics (although ‘dyadic’ is often used in the literature, confusingly,
where ‘polyadic’ is meant). This ordering source is also a variable function, g, from a
context w to a premise-set of propositions g(w), comprised of goals, ideals, laws, etc.
But rather than restricting the domain, g orders it, roughly according to how closely
the possibilities approximate the premise-set. The modal force of a verb consists in
how it quantifies over a subset of possibilities picked from the ordering by a selection
function. Simplifying slightly, must p says, relative to the f and g in w, that p is true
in all possibilities consistent with f (w) that are highest-ranked by g(w), while may p
says that p is true in some possibilities consistent with f (w) that are highest-ranked by
g(w). In this paper I use the term ordering semantics narrowly to refer only to theories
positing an ordering source parameter.
These two kinds of background are utilized in different ways to account for different
modal flavors. The modal base is stipulated always to be realistic, on Kratzer’s division
of labor. Normative or deontic flavors are generated by normative ordering sources,
whereas bare alethic (e.g. logical, metaphysical) modals have empty ordering sources,
so that all possibilities in the domain are equally ranked. Other modalities (epistemic,
counterfactual, etc.) are distinguished by particular combinations of backgrounds.
We’ll see that normative modalities pose significant difficulties for this shift from
dyadic semantics to polyadic ordering semantics.
Instrumental Conditionals
Consider instrumental conditionals,1 such as,
() If you want to go to Harlem, you must take the A train.
() If you are (going) to go to Harlem, you must take the A train.
() (In order) to go to Harlem, you must take the A train.
1 (Some of) these are commonly called ‘anankastic conditionals’ by linguists and ‘hypothetical imper-
atives’ by philosophers; both labels are misnomers. (a) ‘Anankastic’ signifies necessity/compulsion, but
necessity is neither necessary (e.g. “existential anankastics” such as ‘If you want to go to Harlem, you could
take the A train’) nor sufficient (e.g. ‘If you are in Harlem, then you must be in New York’). (b) These
sentences have declarative rather than imperative mood.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
2 First, the contrapositive of () is apparently ‘If you don’t take the A train, you can’t go to Harlem’, rather
than ‘If you don’t take the A train, you can’t want to go to Harlem’ (Sæbø , p. ).
Secondly, () seems roughly equivalent to () and (). While these other constructions can sometimes
indicate an agent’s purpose, this doesn’t seem to be their essential function here (Finlay b). Nissenbaum
() proposes the gloss: you must take the A train with the purpose of going to Harlem; von Fintel and
Iatridou () offer convincing objections.
Thirdly, these sentences exhibit anomalous detaching behavior, resisting modus ponens. One can coher-
ently assert ‘If Henry wants to be a famous mass murderer, then he has to kill a lot of people’, while refusing
to assert ‘Henry has to kill a lot of people’ upon learning that Henry has this disturbing desire.
3 Condoravdi and Lauer () offer a dissenting view, turning on a reading of ‘want’ that I would argue
is artificial.
4 von Fintel and Iatridou (), von Stechow et al. ().
5 Proposed in an early version of von Fintel and Iatridou ().
6 As von Stechow et al. () point out, it is also (almost) functionally equivalent to eliminating ordering
sources altogether, as I propose below.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
7 This is contested; see discussion of the “Chinese train” example in von Stechow et al. (). The correct
response, I believe, appeals to ellipsis in the antecedent (von Fintel and Iatridou , p. ). The existence
of inferior means justifies challenges to and retractions of ‘must’ claims; e.g. A: “No, you don’t have to take
the Chinese train to go to Vladivostok; you could take the Russian train.”; B: “Yes, but you do have to take
the Chinese train to go to Vladivostok comfortably.”
8 Cf. von Stechow et al. (, p. ).
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
semantics,9 requiring that the scope of a modal operator be neither entailed nor ruled
out by f whenever g isn’t empty. But I wish to point out how much more simply
and naturally a dyadic semantic analysis of instrumental conditionals avoids these
problems.
Notice that the challenge confronting ordering semantics here is, effectively, to
explain how a goal in an ordering source could behave as if it were in the modal
base instead. The truth conditions of instrumental conditionals, such as ()–(), seem
sensitive only to possibilities consistent with achieving the designated goal. So suppose
we simply allow the conditional to function in its ordinary way, unambiguously adding
the antecedent’s designated goal to the modal base instead.10 I have argued (especially
Finlay, b, ) that a fully unifying, compositional, and conservative analysis then
comes into view at least for sentences like () with ‘if s is (going) to. . . ’ antecedents,
which also supports promising analyses of other grammatical variants such as () and
().11 Briefly:
The prospective aspect in these antecedent clauses encourages a transparently and
compositionally temporal analysis: to represent an event e as “[going] to” happen is to
project it into the relative future. A sentence such as () therefore grammatically rep-
resents the modality in the consequent clause as being (temporally or metaphysically)
prior to the event e in the antecedent; i.e. the necessity of taking the A train is located
prior to your going to Harlem. This reverses the order we read into conditionals by
default;12 for example, the non-instrumental conditional () is naturally interpreted
as representing the necessity of taking the A train as (temporally) consequent upon
going to Harlem:
() If you go to Harlem, then you must take the A train.
Whereas Kratzer doesn’t herself analyse how modality interacts with temporality, I
maintain that by virtue of the temporal structure of ()–(), ‘must’ there expresses
the necessity of (temporal or metaphysical) pre-conditions for the antecedent’s event
obtaining, and thereby an instrumental (or, roughly, means-end) relationship.
This simple analysis avoids the ordering semantics’ problem from inconsistent goals
(since it requires consistency with the designated goal) and also the problem from
consistent goals (since e.g. kissing van Nistelrooy isn’t a necessary condition for going
to Harlem), while avoiding any ambiguity in the semantics for ‘if ’. Of course, we also
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
need an account of the contrast between ‘must’ and ‘ought’, and of inconsistencies (e.g.
to address the problem of unattainable goals); here I beg the reader’s patience until
Sections and , respectively. But if I am correct, then instrumental conditionals are
best analyzed without appeal to any ordering source.
13 Paul Portner labels these ‘priorities’; I prefer ‘ends’ both for continuity with ancient philosophical
tradition, and because ‘priority’ suggests rather the importance or weight of an end/norm (e.g. as used in
Horty ).
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
14 I analyze the prescriptive, “categorical” character of moral utterances as rhetorical, arising from
suppressing reference to the end made salient by the speaker’s rather than the agent’s desires; see especially
Finlay (, chapter ), and compare Harman ().
15 Why is this theory dyadic, if we can distinguish between the end e, and the “realistic” part of the modal
base, f-e? Answer: the modal force of ‘ought’ (i.e. the operation it performs on its inputs) doesn’t differentiate
between e and anything else in f . The end therefore needn’t be identified as a separate parameter; see further
discussion in Section .
16 Two further supporting considerations: () the distinctive deontic logic shared by teleological and
deontic modals reduces to ordinary modal logic given only the assumption of conditionalization on a
hypothetical outcome (the “Kanger–Anderson reduction”). () Teleological and deontic uses of modal verbs
share grammatical features of tense which distinguish them from epistemic and other uses. I argue (b,
, chapter ) that the end-relational analysis explains these compositionally, and suggest that in deontic
uses they function as grammatical indicators of suppressed conditionalization on an end, which triggers
our sense for normativity.
17 In unpublished work, Janice Dowell proposes analyzing moral ‘ought’ with ordering sources invoking
another normative concept, such as: in view of the most important standards. This solution depends on these
further concepts not being themselves relativistic, but I argue (, pp. –) for a parallel treatment. I
address the problem of disagreement in Björnsson and Finlay (), Finlay (, chapter ).
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
all. Despite the aspiration to neutrality, the ordering semantics may therefore be
compatible with only a narrow range of metaethical theories. Since taking sides
in these debates is unavoidable in pursuing a unified modal semantics, we should
boldly go wherever the linguistic evidence leads—which, I’ll now argue in earnest,
is consistently in the direction of a dyadic semantics and the end-relational theory.
18 Kratzer responds that this is a “language-specific fact” and that auxiliaries in some other languages
and some English modal adjectives do take such modifiers (, p. )—which I don’t think justifies
attributing a gradable semantics to English auxiliaries. Admittedly, there are some ways to qualify these
with degree modifiers (see Portner and Rubinstein ), but these seem to apply indiscriminately, even to
uncontroversially nongradable terms such as ‘dead’ and ‘guitar’ (e.g. ‘X is more a guitar than Y is.’).
19 P.c. She does offer an analysis of “weak necessity” (, p. ).
20 For problems for multiple ordering sources, see Rubinstein (). Silk faces the problem that ‘ought’
isn’t synonymous with ‘would have to’, and a nonarbitrary kind of counterfactual condition is yet to be
identified. Rubinstein’s suggestion that ‘ought’ indicates a controversial ordering source conflicts with the
Theseus/miner scenarios below, in which relevant ideals are uncontroversial, and with moral uses of ‘must’,
which are often controversial. See Section for some objections to other accounts.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
21 This is called a “common intuition” in Portner (, p. ), and the “traditional view” in Copley
(, p. ), but I’m unaware of anyone else defending it in print. Copley attributes it to Larry Horn, who
reports (p.c.) maintaining the view outside of print. Previously I emphasized probably, tentatively analyzed
in terms of most (a, p. ; , p. ); see also Wheeler (, ).
22 Some common objections: (i) Ought p but not p is acceptable, unlike probably p but not p (e.g. Copley
). Reply: unlike tenseless modal adverbs, auxiliaries such as ‘ought’ needn’t be relativized to full present
evidence (Thomson ; Finlay , p. ; Wedgwood ). (ii) Ought p seems bad when p is only
incrementally more likely than some relevant alternatives. Reply: it may just be infelicitous; compare most
likely, p.
23 Compare Kratzer (, p. ).
24 Finlay (, p. ). (Some writers, such as Cariani, Kaufmann and Kaufmann ; Dowell ;
Charlow , induce such partitions in a possible worlds framework, to solve problems for ordering
semantics; see section ). Another option is to adopt a measure function defined for infinity.
25 Formally:
[[must p]]f ,g = T iff for all u ∈ ∩f (w) there is a v ∈ ∩f (w) such that v ≤g(w) u, and for all z ∈ ∩f (w),
if z ≤g(w) v, then z → p.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
A formalization of a basic multiple ordering source semantics for ‘ought’ that avoids the Limit
Assumption:
[[ought p]]f ,g,g = T iff for all u ∈ ∩f (w) there is a v ∈ ∩f (w) such that v ≤g(w) u, and for all z ∈ ∩f (w),
if z ≤g(w) v, then there is a q ∈ ∩f (w) such that q ≤g(w) z and q ≤g(w) z, and for
all r ∈ ∩f (w), if r ≤g(w) q and r ≤g(w) q, then r → p.
Swanson () presents a scenario that requires even further complications to the ordering semantics.
26 For example, Goble (), Wedgwood (, ), Cariani ().
27 Compare Williamson (), Kolodny and MacFarlane (). Probabilistic information can be
included in modal bases where necessary.
28 For an amateur attempt, see Finlay (, pp. –). One constraint on an acceptable model is that
relevant possibilities needn’t be known.
29 This claim is defended in Section . Ordering semantics are often defended against quantitative
frameworks on similar grounds (e.g. Kratzer , p. ).
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
generally introduce third or more parameters.30 For this reason, I think it is reasonable
to continue describing this as a move within the dyadic semantics framework.) For the
practical ‘ought’ used in advice and deliberation, this background d is articulated with
a phrase like ‘rather than any other option in the agent’s power to perform at t’. My
proposal is that ‘ought’ has the modal force of comparative probability, meaning in
more possibilities (or: more likely) than any alternative, ri , in d(w).31
To see the application to normative modalities, consider a teleological ‘ought’ like
() To evade arrest, Max ought to mingle with the crowd.
Following the blueprint of the end-relational analysis of ‘must’ above, we assume a
preliminary modal base f of circumstances or evidence, which is then updated with
the end e: that Max subsequently evades arrest. As with the teleological ‘must’ and
‘may’ (and also ability modals, following Kratzer (, p. )), we exclude from f any
facts or information about the agent’s psychological dispositions to choose any one
option over any other. Suppose Max is disposed not to mingle with the crowd because
he falsely believes it wouldn’t succeed; we wouldn’t on those grounds dissent with the
assertion of an instrumental possibility sentence like ():
() To evade arrest, Max could mingle with the crowd.
Relative to a preliminary modal base of this kind, every option is equally represented
in the possibility-space; I call this symmetry of choice.32
Updating this preliminary background f , characterized by symmetry of choice, with
the end e yields the following result: the option chosen in most of the remaining
possibility-space f +e(w) is the option on the choice of which e eventuates in more
of the possibility-space than it does on the choice of any alternative, ri . In simpler
language, the option most likely chosen is that which, if chosen, would make the end
most likely. Formally, given symmetry of choice, Equivalence holds:
Equivalence: ∀ri ∈ d(w): pr(p|f +e) > pr(ri |f +e) iff pr(e|f +p) > pr(e|f +ri ).
Equivalence might seem too complex to be playing any role in ordinary normative
thought. But represented geometrically it’s highly intuitive:
30 We could dispense with this parameter by treating d(w) as fixed by p plus context, but a contrast
parameter for ‘ought’ is independently well-motivated, embraced for numerous reasons; see Sloman (),
Jackson (), Cariani (a, b), Snedegar (), Finlay and Snedegar ().
31 My simple dyadic semantics can be formally expressed as:
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
r5 p
e
r4 r1
r3 r2
33 Cariani (b, p. ) offers two objections to symmetry of choice. First, “it is plausible to assume
that the contextually supplied probability function might be either some salient credence. . . or an evidential
probability function.” Reply: this is to reject without argument my claim (and Kant’s) that thought about
whether S ought to do A essentially ignores any evidence that S will do A, by virtue of its very nature
as normative. Second, “there is no guarantee that a set of alternatives equiprobable relative to an initial
background will remain equiprobable after the background is updated”, e.g. by a conditional such as if it’s
snowing outside. Reply: symmetry of choice is always (re-)applied after any circumstantial updating, for
the reason that the assumption of transcendental freedom applies precisely in the circumstance of choice.
Another objection to the most analysis, pressed by Cariani and Ralph Wedgwood (p.c.), is that it entails
failure of Agglomeration: Op & Oq → O(p & q). I believe this may rather be another virtue; see Jackson
().
34 Finlay (a, p. n; , p. ).
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
The ordering semantics is unable to yield these results so simply or naturally. Perhaps
the most promising strategy is to treat reliability as a secondary ideal for the ‘ought’
claim, utilizing multiple ordering sources (von Fintel and Iatridou ). But reliability
isn’t normatively optional in the way secondary ideals are. Even if the only thing that
matters is saving his compatriots, Theseus ought to take door A. Reliability might be
thought a secondary ideal because it can be traded off against cost or comfort, etc.,
but this is really to trade off the importance of success, i.e. of the end itself, against
conflicting ideals.
This end-relational semantics has an additional advantage. It accommodates
degrees of possibility through the (classical) probabilistic structure of possibility-
spaces, which in effect directly induces an ordering of the options in the contrast
set. Importantly, this differs from the gradability provided by ordering sources in the
Lewis-Kratzer semantics, which instead directly induce an order on possible worlds
according to their approximation to the ideal. So while the most analysis simply
identifies the “best” option as the most likely/reliable option, the ordering semantics
rather identifies an option as “best” just in case (roughly) it is true in all the best
worlds. This indirectness is problematic: what is true in the best worlds might also
be true in the very worst worlds, and it seems arbitrary to determine an option’s value
by looking exclusively at best worlds and ignoring worst worlds.35 As we’ll see below,
some recent work in ordering semantics therefore seeks to induce orderings on sets
of possible worlds, partitioned by options—by introducing a contrast-set parameter
(e.g. Cariani a), a special selection function (e.g. Charlow ), or a special kind
of ordering source (e.g. Dowell ). The most analysis delivers these desired results
for free.
Information-sensitivity
Further difficulties for the ordering semantics are observed in how normative modals
are sometimes sensitive to subjects’ information. Whereas Kratzer originally analyzes
normative modals as always taking circumstantial modal bases, picking out premise-
sets of facts objectively, some normative claims are subjective or information-relative.
A simple amendment is to allow modalities with epistemic modal bases, picking out
sets of facts that constitute what the relevant subject knows (“information-states”).36
But this may not be enough, as there are also more subjective normative uses of modal
verbs, such as the ‘ought’ of practical reason, which is sensitive to what is merely
believed rather than known.
Kratzer proposes (p.c.) to accommodate these subjective uses with a circumstantial
modal base of facts about the agent’s beliefs, and an ordering source consisting of
35 Compare Jackson (). This selection function is also what forces the Limit Assumption, which the
most analysis avoids without fuss.
36 Finlay (, p. n); Kolodny and MacFarlane (, p. ).
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
norms of rationality, which I’ll call a subjectivized end strategy. While I agree that we
sometimes make claims about what an agent ought rationally to do given his beliefs,
this fits uncomfortably with other ordinary subjective ‘ought’ claims. First, we often
refer to ordinary, objective ends when making subjective ‘ought’ claims. Suppose for
example that Theseus’s beliefs favor choosing door A, although this option is in fact
incompatible with achieving his goal. We might then say,
() Given what he believes, to save his compatriots Theseus ought to take door A.
Secondly, it makes wrong predictions about appropriate modal force. There is
plausibly no separation between what is rationally necessary and what is rationally
optimal. In order to act rationally, Theseus must take door A. It seems infelicitously
weak to say that to act rationally he ought to take door A, and simply false to say that
to act rationally he could take door B. By contrast it’s natural to say that given what he
believes, to save his compatriots Theseus ought to take door A but doesn’t have to, and
could take door B though he oughtn’t.
Once Kratzer’s realism constraint is abandoned, we can simply appeal to a modal
base consisting of the (nonrealistic) contents of the subject’s beliefs (“given what you
believe”) to account for these subjective normative claims.37 The realism constraint
seems gratuitous, in any case. While the modal base has the function of domain restric-
tor in the semantics, sometimes conversationally relevant modalities are restricted
rather than merely ordered by nonfactual information.38 Instead of appealing to norms
of rationality, the dyadic semantics allows us simply to appeal to the same objective
ends (e.g. to save his compatriots), and account for the subjective ‘must’, ‘may’, and
‘ought’ as concerning what is respectively necessary, possible, and most likely relative
to the subject’s information i and the end e. Consider a subjectivized version of
Single-Minded Theseus, where the probabilities are subjective (i.e. relative to Theseus’s
beliefs); my semantics then predicts the following, intuitive verdict, parallel to ():
() Given what he believes, to save his compatriots Theseus ought to choose A,
though he doesn’t have to, as he could choose B (albeit foolishly!), but he must
choose either A or B, and mustn’t choose C.
Unlike Kratzer’s proposed solution, this semantics generates the appropriate judg-
ments and respects the relativization to objective ends. It requires no appeal to
controversial norms of rationality (indeed, I think it might rather explain them), and
directly explains why what one ought to do given complete accurate beliefs converges
on what one ought to do given the facts: these different functions pick out identical
premise-sets.
37 Finlay (a, p. ). With inconsistent beliefs the modal base must be restricted to a consistent subset
(Section ).
38 E.g. exocentric epistemic claims. Consider a variation of von Fintel and Gillies’ mastermind scenario
where a player accepts false information which she realizes entails that there are two reds. A nondeceived
observer can correctly say, “She knows there must be two reds”; i.e. must given her (false) information.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
In a context of this information, the following sentences are all typically judged true:
(a) If the miners are all in shaft A, then we ought to block A.
(b) If the miners are all in shaft B, then we ought to block B.
(c) Either the miners are all in A or they are all in B.
(d) We ought to block neither shaft.
This presents a puzzle. The conditional clauses of (a) and (b) apparently just
strengthen the information, updating the modal base with the antecedent. In all the
best worlds where the miners are in A we block A, and in all the best worlds where
they are in B we block B. But since all accessible worlds are either in-A or in-B worlds,
it follows that all the best accessible worlds are either block-A or block-B worlds, so
none can be block-neither worlds, which falsifies (d) on the ordering semantics.40
The fundamental issue here concerns nonmonotonicity, as Nate Charlow ()
observes.41 Note first that simple necessity or ‘all’ conditionals are (left downwards)
monotonic; i.e. if p then necessarily q entails if r then necessarily q for any r that entails
p. For example, from () we can safely infer ():
() If Garfield is a cat, then he must be a mammal.
() If Garfield is a yellow cat, then he must be a mammal.
The ordering semantics does allow for nonmonotonicity, but in only the following
way: eliminating the p worlds which were initially ranked highest can result in the
highest-ranked worlds that remain being not-p worlds. However, this explanation is
unavailable for the miner puzzle. No worlds are eliminated in the state of ignorance,
and apparently all the best worlds are block-A or block-B worlds. What the ordering
semantics needs to explain is how adding information can change the ordering of
possible worlds: what Kolodny and MacFarlane label ‘serious information depen-
dence’. This is a serious challenge, because in the Lewis–Kratzer framework, adding
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
information merely eliminates possible worlds from the domain, which cannot by
itself change the relative order of any two worlds within the domain.
The various responses recently offered on behalf of ordering semantics fall into
two categories, which both encounter serious difficulties. One (conservative) strategy
appeals to an information-sensitive (or subjectivized) ordering source, such as: in view
of what maximizes expected value.42 Plausibly speakers can and do sometimes make
normative claims about what ought to be done to maximize some salient kind of
expected value, and this would indeed generate different orderings as information
strengthens. But as a general interpretation of information-sensitive claims, this
analysis is problematic. It’s natural to suppose that the salient ends or ideals for these
claims remain the objective ends (e.g. of saving miner , miner , etc.), rather than
higher-order or subjectivized ends.43
This is easier seen in simple cases involving a single end, like Theseus’s above.
(Or equivalently, suppose blocking neither shaft instead provides a % chance of
saving all ten miners.) Suppose Theseus acquires additional information reducing the
likelihood of success if he selects door A from . to .. Given this strengthening of his
information, he ought to choose door B (. probability of success) rather than door A.
But it’s natural to say,
() In order to save his compatriots, Theseus ought to choose door B
just as it’s also natural to say, relative to his earlier information, that in order to save
his compatriots, he ought to choose A. Here we have serious information dependence
relative to a constant, objective end.
Further evidence comes from data about appropriate modal force. In the
(unstrengthened) subjective Theseus scenario, the following is true:
() In order to maximize expected value, Theseus must choose door A, and can’t
choose B.
But it’s intuitive to say rather that he merely ought to choose A, and could choose B, as in
() above, indicating that these information-sensitive ‘ought’ claims aren’t relativized
to a subjectivized end like to maximize expected value.
Relatedly, Charlow (, p. ) observes that serious information-dependence
seems to arise uniquely for “weak necessity” verbs such as ‘ought’, but not for ‘must’,
discovering intuitions that the following set of sentences aren’t consistent:44
(a) If the miners are all in A, then we must block A.
(b) If the miners are all in B, then we must block B.
(c) The miners are either all in A, or they are all in B.
(d) We must block neither shaft.
42 von Fintel (); Dowell (); Silk (). 43 Cf. Charlow (), Silk ().
44 These intuitions are disputed (e.g. Carr ), I think because in the original miner puzzle it’s too easy
to accommodate context shifts in ends/ideals; see Section .
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
45 A problem: not all worlds where at least nine miners are saved are block-neither worlds. Charlow
suggests one solution (, pp. –n): the set of practical ends generated by the merging also includes
the negation of any non-actionable primary ideals; i.e. not (all ten miners are saved). However, suppose that
if we block neither shaft there is a % chance the tenth miner survives, while another option guarantees
saving only nine. This analysis then predicts we ought not to block neither shaft—because we might thereby
save all ten miners.
46 Charlow suggests this is merely infelicitous, but it seems correct to say, “It’s not true that you must
block either A or B; you could and should block neither.”
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
“chooseable” options.47 This partitions the f −accessible worlds into sets defined by
chosen option. The ordering source g ranks each partition by just the ends true
throughout it, i.e. those ends guaranteed by the option. Since saving nine miners
is the only relevant end guaranteed by any option, worlds are ranked only by their
closeness to this end. Serious information-dependence is explained by the possibility
that strengthening information may lead to additional ends being guaranteed by some
option (e.g. given that the miners are in A, blocking A is guaranteed to save ten). This
avoids the problems observed for Charlow, but shares a further problem: appeal to
“guaranteed” or “actionable” outcomes is artificially restrictive, because sometimes we
ought to pursue a chance of an outcome. Even if blocking neither shaft provides only a
% chance of saving nine miners, it plausibly ought to be chosen over a % chance
of saving everyone (Carr ).48
A variation from Graham Katz, Paul Portner and Aynat Rubinstein () offers a
fix for this problem. They propose separate epistemic (gE ) and deontic (gD ) ordering
sources, and a merging operation on these. As applied to the miner puzzle, the idea is
that valued outcomes are first ordered into tiers by how likely (actionable) they are, by
gE , and then are ordered within each tier by their closeness to the deontic ideal gD . An
option ought to be chosen iff it corresponds to the deontically best of the epistemically
most actionable outcomes. In Jennifer Carr’s probabilized miner scenario, this is the
option of blocking neither shaft, since the most actionable end (at % if one blocks
neither shaft) is saving nine. But this fails to accommodate normative judgments that
trade low probability for high value. Suppose blocking neither shaft will certainly
save exactly one miner, while blocking a shaft gives a % chance of saving all ten.
Presumably we ought to block a shaft, although saving exactly one is the uniquely
most actionable outcome.
I have not shown that serious information dependence is a fatal problem for
the ordering semantics. Its champions may reply, as Charlow does (p.c.), that my
objections merely target contingent details of the way they have proposed imple-
menting these more general strategies to address particular variations of the miner
puzzle. I think this itself reflects a problem: it remains to be shown that any of these
strategies can be developed into a concrete theory that systematically generates correct
results when applied to arbitrary cases, rather than on an ad hoc basis. In one of the
most sophisticated attempts, Cariani () concludes from these problems that the
semantics for ‘ought’ must be probabilistic, which he implements by modifying his
earlier view, above, replacing the modal base with a pair consisting of an information-
state and a probability function <i, Pr>, and assigning probabilistic content to the
premises in the ordering source. I can’t do justice to this proposal here, but one
problem it faces is explaining why ‘must’ and ‘may’ aren’t similarly probabilistic, once
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
we’ve probabilized ordering sources; Cariani suggests a further semantic rule that
these terms ignore such premises.
The point I want to make is that here, also, the challenge for ordering semantics
is to show that it can generate the same results that my simpler semantics delivers
already, without further elaboration. For the end-relational semantics provides a
straightforward explanation of serious information-dependence. Nonmonotonicity is
a basic characteristic of probability and most conditionals: strengthening information
often switches most p from true to false, unlike all p. While most philosophers already
born are now dead, for example, most philosophers already born after are not
now dead. Similarly, if the information is strengthened in a way that eliminates the
possibilities where Theseus both chooses door A and saves his compatriots, then in
most of the remaining possibility-space where his compatriots are saved, he takes door
B. (Similarly for the simplified miner case where blocking neither shaft is % likely
to save all the miners.)
Unlike the kind of nonmonotonicity provided by ordering sources in the basic
Lewis–Kratzer semantics, the most analysis explains serious information-dependence.
It also avoids all the problems observed above for versions of ordering semantics.
It invokes the same (objective) ends as objective normative claims, accommodating
the use of restrictors like in order to save his compatriots, and it makes the right
predictions about appropriate modal force. It correctly predicts nonmonotonicity
uniquely for “weak necessity” verbs like ‘ought’ and ‘should’, and not for ‘must’. It
accommodates probabilities with ease. As already observed, it doesn’t imply, coun-
terintuitively, that worlds where agents act as they subjectively ought are always the
“best” worlds: the worlds where Theseus saves his compatriots are best, even if they
aren’t those in which he acts as he ought. Finally, it predicts the right verdicts in at
least these simple single-end cases, because so long as the end is held fixed, changes
in expected value are just changes in the conditional probability of the end. The
information-sensitivity of ‘ought’ therefore provides further support for abolishing
ordering sources.
However, it may fairly be objected that celebration is premature, as I haven’t yet
shown that my semantics yields the right results for the original miner puzzle. A full
solution to this requires a solution for cases involving multiple ends/ideals, to which
I now turn.
Conflicts
One argument for an ordering source parameter remains: the need to accommodate
inconsistent premise-sets, or conflicts either between incompatible ends/ideals or
between ideals and facts. Ordering sources provide a way of reaching a consistent
domain from inconsistent inputs. But despite this advertised virtue, here too the
ordering semantics encounters a problem: the Lewis–Kratzer ordering mechanism
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
seems too crude to handle all but the simplest of normative scenarios. An ordering
source g ranks a possible world w as better than a world w iff w satisfies all the
ideals in the premise set g(w) that w does, plus at least one more. This presents some
obvious difficulties when we consider ordinary use of normative modals.
First, it doesn’t allow comparisons between two worlds if each satisfies some ideal
the other doesn’t, although this is an ordinary situation where it is often perfectly clear
that one possibility is better than the other. Suppose you have two desired ends, (e )
attending your daughter’s wedding, and (e ) watching the football match live; here
() is surely true:
() You ought to attend your daughter’s wedding rather than watch football.
It is obviously because of the greater importance of attending the wedding that it ought
to be chosen over watching football. But ordering sources don’t encode the degrees
of importance (or normative “weight”) of different ideals, treating every ideal in the
premise-set g(w) on a par. A second problem arises from the interaction of importance
with uncertainty, as the ordering semantics doesn’t allow the intrinsic importance of
ends in g(w) to be balanced against the probability or expectation of achieving them,
as modeled in decision theory.
Many creative solutions have been proposed on behalf of ordering semantics. We
might try capturing differences in importance by assigning ends to higher- or lower-
ranked ordering sources. However, this can’t capture a properly decision-theoretic
framework, in which less important ends can outweigh more important ends either by
combining their normative weight or by their greater expected (probability-weighted)
value. In Section we observed proposals to “merge” separate ordering sources (either
normative and epistemic, or first- and second- order normative), and for probability-
sensitive ordering sources. But no general account is offered of how to generate
decision-theoretic verdicts systematically, balancing importance against probability
in intuitive ways. I think these strategies are unlikely to succeed, for the reason that
ordinal scales can never fully reproduce cardinality.
There is a way the ordering semantics can generate any desired ordering of worlds,
however.49 Suppose there are three possible outcomes, to be ranked in the order o , o ,
o . This result can be generated by the premise set {o , (o ∨o ), (o ∨o ∨o )}. But this
technical fix is problematic as an account of natural language semantics. In the Lewis–
Kratzer framework, the ordering ≤g(w) is generated by a semantically fixed rule from
the premise-set g(w), which is in turn generated by the conversational background g,
a function from the context determined by the speaker’s intentions. So the speaker’s
intended set of ideals is in the driver’s seat. By contrast, this solution reverse-engineers
a premise-set from the desired ordering, which can’t be generated from any intuitive
inputs such as degrees of importance and probability. The only kind of conversational
background that could plausibly generate such a premise-set is something like: given
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
the set of ideals that by the semantic rule ≤ yields the ordering corresponding to expected
value by the salient probability and value functions. But it is implausible that this is the
kind of conversational background operative in ordinary normative thought. Whereas
Lewis–Kratzer ordering sources are usually described in highly intuitive ways (e.g. in
view of what you desire), on this fix they and their derivative premise-sets bear little
resemblance to intuitive inputs for normative thought, even in mundane cases.
In view of such problems, many writers reject ordering semantics in favor of
quantitative (“fully Bayesian”) semantics that instead have parameters for at least (i)
a probability distribution over worlds, and (ii) a value function assigning normative
weights to outcomes, and give ‘ought’ the modal force of a decision rule on expected
value.50 This has the virtue of systematically generating the verdicts of decision theory,
but there are problems here too. First, whereas an ability to accommodate decision-
theoretic judgments is a desideratum, such judgments shouldn’t be guaranteed true by
semantics alone, since competent speakers can vary in their tolerance of risk.51 Carr
() therefore proposes an additional (third) parameter for a decision rule. Secondly,
it isn’t clear how to apply these semantics to ordinary epistemic or circumstantial uses
of ‘ought’,52 which don’t seem to involve value functions or decision rules at all.
I believe that the simple dyadic semantics has been abandoned too quickly in the
face of this issue, because it supports an attractive solution which makes the extra
semantic machinery of ordering and quantitative semantics gratuitous. This solution
appeals only to resources that play a central role in Kratzer’s own application of the
ordering semantics to different uses of modal verbs: i.e. to the pragmatics (or “pre-
semantics”) of how the appropriate conversational backgrounds are fixed and premise-
sets constructed:
The way we understand a particular occurrence of a modal can be at least partly explained by
an interaction of independently motivated semantic and pragmatic principles.
(Kratzer , pp. –)
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
53 In Kratzer’s (borrowed) slogan, we need commas rather than ampersands (, p. ).
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
Third, in cases where multiple ends are weighed under uncertainty, an agent’s
overall motivation will balance intrinsic preference for outcomes (importance) against
their expectability (probability). The contextually preferred end therefore needn’t be
intrinsically preferred, or what the agent most desires. In the miner scenario, for
example, our intuitive judgments plausibly reflect a contextual preference for saving
at least nine (whether at % or % odds) over saving ten (at % odds).54 This
accounts for the truth of (d): if we’re going to save at least nine miners, it is most
likely that we block neither shaft (“We ought to block neither”). In support of this
analysis, observe that if asked, “Why ought we to block neither shaft?” it’s natural to
reply, “In order to save at least nine miners”, but not “In order to save all the miners”.55
On this dyadic semantics, the complex interactions between importance and prob-
ability are assigned to the psychology of normative thought rather than to semantics.
While more work is needed to establish that these pragmatic resources are sufficient to
address every issue arising from conflicts,56 this explanation has noteworthy advan-
tages over both the ordering semantics and its decision-theoretic rivals. It respects
linguists’ concerns about the artificiality of building quantitative measures into natural
language semantics (e.g. Kratzer , p. ), yet also easily accommodates decision-
theoretic verdicts without ad hoc semantic maneuvers. This is accomplished without
either building a decision rule into semantic competence (thereby respecting the
semantic desideratum of normative neutrality), or relativizing ordinary normative
speech to a decision rule as an additional conversational background.57 It therefore
employs only the same simple semantic resources needed for epistemic and circum-
stantial uses, not postulating empty ordering sources or other redundant argument-
places. Surprisingly, a dyadic semantics might even provide the best natural language
framework for handling cases of conflicts.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
Metasemantic Issues
My objections to ordering semantics may be thought to stem from an idiosyncratically
narrow conception of semantics. I’ve argued that a dyadic semantics is sufficient in part
because certain peculiarities of our use of ‘ought’ can be attributed to “pragmatics”.
But whereas I’ve assumed a narrow conception of semantics as investigating the
conventional meanings of words (etc.), others understand “semantics” more broadly, as
investigating the truth-conditions of sentences as uttered in contexts, and understand
“pragmatics” more narrowly, as concerning what is merely implicated by utterances.
Kratzer might seem to endorse this broader conception of semantics by beginning her
textbook on compositional semantics by writing that “a theory of meaning . . . pairs
sentences with their truth-conditions” (Heim and Kratzer , p. ). Such a semantics
for ‘ought’ would aim not just to identify the meaning of the word ‘ought’ itself, but
rather to give a complete theory of the truth conditions of ‘ought’ sentences as uttered
in contexts.
My claim to have offered a simpler semantics may therefore be thought confused,
on the grounds that I’ve really just relabeled what is functionally still an ordering
source as part of the “pragmatics”. The ordering source’s function in the Lewis–Kratzer
semantics is nothing other than to generate consistent premise-sets from inconsistent
inputs, and it is therefore just a formal implementation of the same kinds of contextual
pressures I’ve described in Section . Then I haven’t really shown that we can do
without ordering sources. Since the ordering-source mechanism is flexible enough to
yield any ordering we might want, my objections to it as crude also fail.
A theory of the meaning of ‘ought’ should provide more than this, however. If
the task of semantics is just to give a formal statement of the truth conditions of
sentences, then semantic theories would be nonexclusive. Kratzer provides one way
of modeling the truth conditions of ‘ought’ sentences, but these could be modeled
with equal legitimacy either by (i) more fine-grained semantic theories with three or
more parameters, which separate different kinds of inputs to orderings such as utilities,
probabilities, and decision-rules (as in Carr ), or by (ii) coarser semantic theories
with just one parameter, identifying the background f with an often complex function
from context which itself eliminated inconsistencies—as in the dyadic semantics I’ve
championed.58
This fails to provide other things that semanticists, including Kratzer, clearly want
from a semantic theory. First, it has nothing to say about whether a term is lexically
ambiguous—if it doesn’t eliminate the possibility of lexical ambiguity altogether.59
Capturing truth conditions with a single complex formula provides no evidence that
a word has a single common meaning—especially once we begin positing parameters
commonly left empty. A single rule can be provided even for paradigms of ambiguity
like ‘bank’ and ‘mole’, by positing a parameter that toggles to different values. There
58 Hall (unpublished) makes this point forcefully. 59 Compare Kratzer (, p. ).
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
would be no difference between saying that ‘must’ has a unified meaning involving two
parameters including an ordering source which is empty in alethic uses, and saying
that ‘must’ has two meanings, being either a simple dyadic operator, or a polyadic
operator taking ordering sources.
Secondly, it needn’t reveal anything about the semantic competence of ordinary
speakers, and so wouldn’t realize Kratzer’s stated aim of identifying the “abilities of a
person who has a complete grasp of the modal system” of some natural language (,
p. ). Unity is a desideratum of lexical semantics mainly because of the need to explain
how speakers are able to learn the language, and construct and interpret arbitrary new
sentences. A theory of meaning for ‘ought’ therefore owes us a compositional account
of the contribution that competent speakers implicitly understand the word ‘ought’
itself as making to the truth conditions of a sentence, alongside the contributions of
other words and contextual features.
The modal base and ordering source aren’t merely inputs to truth–conditions
in the Lewis-Kratzer semantics, but conversational backgrounds: functions fixed by
what the speaker has in mind, and made explicit by expressions like ‘given what we
know’ and ‘in view of what the law prescribes’. The modal force of ‘ought’ is the
rule that competent speakers apply in identifying the truth conditions of an ‘ought’
sentence relative to these backgrounds. Different semantic theories are therefore not
just formal or notational variants, and it is a serious challenge to the feasibility of the
ordering semantics if, as I’ve argued, it requires “conversational backgrounds” that
don’t capture what ordinary speakers have in mind, or posits a semantic rule that
doesn’t reflect ordinary modal or normative thought. If I am right about how ordinary
users of normative ‘ought’ select the relevant domain—by a complex function f picking
out both circumstances/information under symmetry of choice and a contextually
preferred end—then the ordering semantics are wrong.
Recent semantic theorizing about modals has fixated on the ideal of unity, but has
largely ignored the ideal of simplicity, which is also important with respect to the
acquisition, use, and interpretation of expressions in natural language. For simple tasks
we can expect use of simple tools; to butter a slice of bread we reach for a simple butter
knife, not for a Swiss Army knife. We should therefore be skeptical of a theory—no
matter how unifying—which claims that when someone says (e.g.) “If this apple costs
less than a quarter, then it must cost less than a dollar”, by ‘it must cost less than a
dollar’ she means (without the usual simplifying gloss making the Limit Assumption)
that for every accessible world u there is an accessible world v that comes at least as close
to the ideal determined by the (empty) ordering source g, and for every accessible world
z, if z is at least as close to that ideal as v, then this apple costs less than a dollar in z
(Kratzer , p. ).
If some modal judgments require more complex semantic resources then we should
expect to find special terms to have developed for those purposes, rather than all our
modal vocabulary developing extra machinery left unused in simpler applications, like
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
Conclusion
The difficulties for the Lewis–Kratzer ordering semantics from the behavior of nor-
mative modals, despite stimulating ever-more-complicated fixes in the literature, can
all be avoided in simple and intuitive ways by instead giving up on ordering sources
and returning to a simpler dyadic framework. Instrumental conditionals, the “weak
necessity” and serious information dependence of ‘ought’, the information-sensitivity
of subjective normative claims, and even judgments weighing conflicting ends under
uncertainty all seem more amenable to the simpler dyadic semantics. While more
work is needed to show that the dyadic approach is viable, these results provide
compelling reasons for believing that the move to ordering semantics was a mistake.
Of course, given the desideratum of a fully unifying theory, the data to which semantic
theorizing must answer extends far beyond the behavior of English modal auxiliaries
in normative uses that we’ve examined here. But my suspicion is that other kinds of
use (e.g. epistemic, counterfactual) raise parallel problems for ordering semantics and
will yield to parallel solutions in a dyadic framework.
60 The existence of such readings of ‘might’ is often denied (e.g. Portner , p. ), but consider the
normative flavor of suggestive advice in ‘To go to Harlem, you might take the A train’.
61 Our lack of grammatical devices for distinguishing conversational backgrounds as modal bases or
ordering sources (‘given. . . ’, ‘in view of. . . ’, ‘in light of. . . ’ etc. are all neutral) is also suggestive, as we should
expect linguistic resources to have developed to mark such differences in logical role.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
Acknowledgments
I’m indebted to many people for their help, but especially Angelika Kratzer, Matthew Chrisman,
Nate Charlow, Bridget Copley, Fabrizio Cariani, Mark Schroeder, an anonymous reviewer, the
participants of my Fall graduate seminar, and the audience at the Arché Workshop on
Normative Language at St. Andrews. Remaining errors are not theirs.
References
Björnsson, Gunnar and Finlay, Stephen () Metaethical Contextualism Defended. Ethics.
.. pp. –.
Boghossian, Paul () What is Relativism? In Greenough, P. and Lynch, M. (eds.) Truth and
Relativism. pp. –. Oxford: Clarendon Press.
Cariani, Fabrizio (a) Ought and Resolution Semantics. Noûs. .. pp. –.
Cariani, Fabrizio (b) Epistemic and Deontic Should. Thought. .. pp. –.
Cariani, Fabrizio () Deontic Modals and Probabilities: One Theory to Rule Them All.
In Charlow, N. and Christman, M. (eds.) Deontic Modality. pp. –. Oxford: Oxford
University Press.
Cariani, Fabrizio, Kaufmann, Magda and Kaufmann, Stefan () Deliberative modality under
epistemic uncertainty. Linguistics and Philosophy. .. pp. –.
Carr, Jennifer () Subjective Ought. Ergo: An Open Access Journal of Philiosophy. ..
Charlow, Nate () What We Know and What To Do. Synthese. .. pp. –.
Condoravdi, Cleo () Temporal Interpretation of Modals. In Beaver, D., Kaufmann, S., Clark,
B., Casillas-Martinez, L. (eds.) The Construction of Meaning. pp. –. Stanford CA: CSLI
Publications.
Condoravdi, Cleo and Lauer, Sven () Preference-Conditioned Necessities: Detachment and
Practical Reasoning. Pacific Philosophical Quarterly. .. pp. –.
Copley, Bridget () What Should Should Mean? [Online] Available from: https://hal.
archives-ouvertes.fr/halshs-/ [Accessed Oct .]
Dowell, Janice () Flexible Contextualism about Deontic Modals. Inquiry. .–. pp. –.
Evers, Daan () Weight for Stephen Finlay. Philosophical Studies. .. pp. –.
Finlay, Stephen () Oughts and Ends. Philosophical Studies. .. pp. –.
Finlay, Stephen (a) What Ought Probably Means, and Why You Can’t Detach It. Synthese.
.. pp. –.
Finlay, Stephen (b) Normativity, Necessity, and Tense. In Shafer-Landau, R. (ed.) Oxford
Studies in Metaethics. . pp. –. New York, NY: Oxford University Press.
Finlay, Stephen () Confusion of Tongues. New York, NY: Oxford University Press.
Finlay, Stephen and Snedegar, Justin () One Ought Too Many. Philosophy and Phenomeno-
logical Research. .. pp. –.
Fintel, Kai von () The Best We Can (Expect to) Get? [Online] Available from:
http://web.mit.edu/fintel/fintel--apa-ought.pdf. [Accessed Oct .]
Fintel, Kai von and Iatridou, Sabine () What to Do If You Want to Get to Harlem. [Online]
Available from: http://mit.edu/fintel/www/harlem-rutgers.pdf. [Accessed Oct .]
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
Fintel, Kai von and Iatridou, Sabine () How to Say Ought in Foreign. In Guéron J. and
Lecarme J. (eds.) Time and Modality. pp. –. Dordrecht: Springer.
Foot, Philippa () Morality as a System of Hypothetical Imperatives. Philosophical Review.
.. pp. –.
Goble, Lou () Utilitarian Deontic Logic. Philosophical Studies. .. pp. –.
Hall, Keith (unpublished) Normative Neutrality and Kratzer’s Semantics for Deontic Modals.
Manuscript.
Harman, Gilbert () Moral Relativism. In Harman, G. and Thomson, J. (eds.) Moral Rela-
tivism and Moral Objectivity. pp. –. Cambridge, MA: Blackwell.
Heim, Irene and Kratzer, Angelika () Semantics in Generative Grammar. Malden, MA:
Blackwell.
Horty, John F. () Reasons as Defaults. New York, NY: Oxford University Press.
Huitink, Janneke () Anankastic Conditionals and Salient Goals. In Maier, E., Bary, C., and
Huitink, J. (eds.) Proceedings of Sinn und Bedeutung . pp. –.
Jackson, Frank () On the Semantics and Logic of Obligation. Mind. .. pp. –.
Katz, Graham, Portner, Paul and Rubinstein, Aynat () Ordering Combination for Modal
Comparison. In Chereches. A. (ed.), Proceedings of SALT. . pp. –.
Kolodny, Niko and MacFarlane, John () Ifs and Oughts. Journal of Philosophy. ..
pp. –.
Kratzer, Angelika () What ‘Must’ and ‘Can’ Must and Can Mean. Linguistics and Philosophy.
.. pp. –.
Kratzer, Angelika () The Notional Category of Modality. In Eikmeyer, H. J. and Rieser, H.
(eds.) Words, Worlds, and Contexts. pp. –. Berlin: De Gruyter.
Kratzer, Angelika () Modality. In von Stechow, A. and Wunderlich, D. (eds.) Semantics: An
international handbook of contemporary research. pp. –. Berlin: De Gruyter.
Kratzer, Angelika () Conditionals and Modals. New York, NY: Oxford University Press.
Lassiter, Daniel () Measurement and Modality. Ph.D. Thesis NYU.
Nissenbaum, Jon () Kissing Pedro Martinez: (existential) anankastic conditionals and
rationale clauses. In Georgala, E. and Howell, J. (eds.) Proceedings of SALT. . pp. –.
Portner, Paul () Modality. New York, NY: Oxford University Press.
Portner, Paul and Rubinstein, Aynat () Extreme and Non-Extreme Deontic Models. In
Charlow, N. and Chrisman, M. Deontic Modality. Oxford: Oxford University Press.
Regan, Donald () Utilitarianism and Cooperation. New York, NY: Oxford University Press.
Rubinstein, Aynat () Figuring out what we ought to do: the challenge of delineating
priorities. U. Penn Working Papers in Linguistics. .. pp. –.
Sæbø, Kjell Johan () Necessary Conditions in a Natural Language. In Féry, C. and
Sternefeld, W. (eds.) Audiatur Vox Sapientiae: A Festschrift for Arnim von Stechow. pp. –.
Berlin: Akademie.
Silk, Alex () ‘Ought’ and ‘Must’: Some Philosophical Therapy. In Chereches, A. (ed.)
Proceedings of SALT. . pp. –.
Silk, Alex () Evidence Sensitivity in Weak Necessity Deontic Modals. Journal of Philosoph-
ical Logic. .. pp. –.
Sloman, Aaron () Ought and Better. Mind. .. pp. –.
Snedegar, Justin () Contrastive Semantics for Deontic Modals. In Blaauw, M. (ed.) Con-
trastivism in Philosophy: New Perspectives. pp. –. New York, NY: Routledge.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
Stechow, Arnim von, Krasikova, Sveta and Penka, Doris () Anankastic Conditionals Again.
In Solstad, T., Gronn, A. and Haug, D. (eds.) Festschrift for Kjell Johann Sæbø. Oslo: University
of Oslo.
Swanson, Eric () Modality in Language. Philosophy Compass. .. pp. –.
Swanson, Eric () On the treatment of incomparability in ordering semantics and premise
semantics. Journal of Philosophical Logic. . pp. –.
Thomson, Judith Jarvis () Normativity. Chicago, IL: Open Court.
Wedgwood, Ralph () The Meaning of ‘Ought’. In Shafer-Landau, R. (ed.) Oxford Studies in
Metaethics. . pp. –. New York, NY: Oxford University Press.
Wedgwood, Ralph () Objective and Subjective ‘Ought’. In Charlow, N. and Chrisman, M.
(eds.) Deontic Modality. Oxford: Oxford University Press.
Wheeler, Samuel C. III () Inference and the Logical ‘Ought’. Noûs. .. pp. –.
Wheeler, Samuel C. III () Neo-Davidsonian Metaphysics. New York, NY: Routledge.
Williamson, Timothy () Knowledge and Its Limits. New York, NY: Oxford University Press.
Yalcin, Seth () Probability Operators. Philosophy Compass. .. pp. –.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
On a Shared Property of Deontic
and Epistemic Modals
Jessica Rett
Introduction
Modals such as the English must and may can receive an epistemic or deontic inter-
pretation to the exclusion of some other interpretations (e.g. an ability interpretation,
Hacquard, ). The same is true for some modals in other languages: mesti (‘must’)
in Malay (Drubig, ), Egyptian Arabic laazim (‘must’), and Tamil modal suffixes
(Palmer, ).1
Despite this lexical tendency to treat deontic and epistemic modality as a natural
class, it’s relatively hard to find ways in which these two flavors behave similarly seman-
tically to the exclusion of others. Instead, epistemic modals seem to behave differently
from all other ‘root’ modal flavors, including deontic modality. In particular, epistemic
modals tend to scope high while root modals tend to scope low, as is evident in their
interaction with other quantifiers and in their temporal anchoring (Ippolito, ;
Hacquard, , ). Given this, from a semantic point of view, it’s surprising to
find a lexical kinship between deontic and epistemic modality across languages.
Meanwhile, epistemic modals display some semantic properties that seem best
encoded in the lexical entries of modals such as must: an anaphoricity requirement
(Stone, ); an indirect evidence requirement (von Fintel and Gillies, ); and a
flexibility in terms of whose knowledge is reflected in the modal base (von Fintel and
Gillies, ). Because must can additionally receive a deontic interpretation, these
proposals inadvertently predict that deontic must displays similar behavior. At the
moment, it is unclear whether these predictions are borne out.
1 I will focus on the contrast between deontic and epistemic interpretations on the one hand and ability
and future interpretations on the other. There are a few additional modal flavors distinguished in the
descriptive literature which are available for must: teleological (e.g. In view of his goals, John must pass the
test) and bouletic (e.g. In view of his desires, John must pass the test). I will set these aside for now, as I am
unaware of any significant way in which they differ from deontic modality. I will however briefly return to
bouletic modals (and should and ought) in §.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
I will argue that we can reconcile the linguistic evidence for a kinship between
deontic and epistemic modality with at least some recognized properties of epistemic
modality if we assume that modals such as must impose what I’ll call an ‘inference
requirement’: a requirement that the prejacent be inferred from some premises, a set
of propositions reflecting contingent and logical assumptions about the context at the
time of utterance. This proposal is effectively an extension of the epistemic modal
account in Stone () to deontic modals, following suggestions in Glass (). In the
case of epistemics, the consequence is an inferential evidence requirement (similar to
the one described in von Fintel and Gillies, ). In the case of deontics, as discussed
in Lance and Little (), this same requirement is manifested as a normative claim
that admits of exceptions.2
My evidence for treating deontic and epistemic modality as a natural class in this
way comes from the unacceptability of must (and to a large extent may) in inversion
exclamatives, exemplified in ().3
() a. *(Wow,) Must/May Sue be the murderer! epistemic
b. *(Wow,) Must/May Sue complete the assignment on time! deontic
In contrast, inversion exclamatives can be headed by modals with other interpreta-
tions, as illustrated in ().
() a. (Wow,) Can Sue dance! ability
b. (Wow,) Will Sue be mad! future indicative
c. (Wow,) Would Sue like to win the race! future subjunctive
I will argue that exclamations are unacceptable in contexts in which the speaker
has inferential evidence for the prejacent; that this is what explains the contrast in
acceptability between () and (), and gives credence to an account in which deontic
and epistemic modality both indicate that the prejacent was inferred from a salient set
of premises.
In this paper, I will be focusing on the so-called ‘strong’ necessity modal must (and
its possibility counterpart may), leaving aside the so-called ‘weak’ necessity modals
ought and should. While these weak necessity modals arguably receive epistemic in
addition to deontic interpretations (e.g. It ought to be raining by now), they differ in
substantial enough ways from strong necessity modals to warrant putting them aside,
at least for now (von Fintel and Iatridou, ). It is, however, worthwhile noting that
these weak necessity modals are in general unacceptable in inversion exclamatives.
2 I am indebted to Nate Charlow for pointing out the relevance of this work.
3 This empirical claim is sensitive to certain intonational and stress patterns; for instance, the inversion
exclamative *Man, must Sue complete the assignment! has a distinct interpretation from a superficially
similar utterance with no intonation break after man: MAN must Sue complete the assignment! (McCready,
). I will address these subtleties in §.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
Evidence type
Direct Indirect
Results Reasoning
Figure
has reported evidence for a proposition φ if the speaker has learned that φ from some
third party, or if φ is a matter of common cultural knowledge (folklore), etc.
Finally, there are two ways, according to this typology, in which a speaker can
have inferential evidence for a proposition φ: the speaker can infer that φ from some
physical evidence that she interprets as an indicator of φ (the ‘results’ reading); or she
can infer that φ from other things she knows (the ‘reasoning’ reading). An example of
the former is inferring that John is home from seeing his shoes and bag in the hallway;
an example of the latter is inferring that John is home from knowing what time it is
coupled with knowing John’s schedule (and that he is reliable and punctual, etc.).
These evidential distinctions are demonstrated below with data from Tsafiki, a
Barbacoan language spoken in Ecuador (Dickinson, , pp. –). (-a) demon-
strates the direct-evidence interpretation that Tsafiki sentences which lack an overt
evidential marker receive. (-b) illustrates the Tsafiki reportative evidential. Tsafiki
has two distinct inferential evidentials: the one in (-c) is used when the speaker has
inferred the prejacent from sensory evidence, while the one in (-d) is used when the
speaker has inferred the prejacent from information already in her knowledge base.
Notably, evidential languages mark the strongest evidence in a situation in which the
speaker has more than one type of evidence for a proposition.
() a. Manuel ano fi-e.
Manuel food ate-decl
‘Manuel ate.’ (The speaker saw him.)
b. Manuel ano fi-nuti-e.
Manuel ano ate-ev -decl
‘Manuel has eaten.’ (They said so.)
c. Manuel ano fi-nu-e.
Manuel food ate-ev -decl
‘Manuel ate.’ (The speaker sees the dirty dishes.)
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
4 Although see Winans (to appear) for evidence that will is even more picky than this, at least in certain
syntactic contexts.
5 In a scenario in which John hears from Bill that it is raining outside, and Sue then asks him if it’s raining
outside, John can felicitously respond, Bill said so, so it MUST be raining outside. As far as I can tell, these
utterances are different from standard uses of reportative evidentials because they generally require focal
stress on must; they require that the evidence source be made explicit; and they are more natural if the
prejacent already have been introduced in the discourse. In these cases, the utterance also seems to reflect a
certain authority on behalf of the source. In light of the unacceptability of must in more canonical reported
evidence cases, such as the radio example above, I will tentatively suggest that these acceptable cases involve
a third-person report in addition to an inference from the source’s authority, making it a more natural fit for
inferential evidentials. This is in direct contrast to the conclusions drawn in Matthewson ().
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
there might be a tendency for direct evidence to be reliable and for inferential evidence
to be unreliable (you might have more confidence that John is home if you see him at
home than if you infer that he’s home from knowledge of his schedule), evidence type
does not necessarily correlate with evidence strength.
Davis et al. () address this point in great detail. Two brief examples will
suffice to illustrate the point: a speaker could have direct evidence for φ but might
be hallucinating, in which case that evidence is unreliable. Or a speaker could have
inferential evidence from nondefeasible, non-contingent premises, as in the case of
a mathematical proof, the conclusions of which are notoriously quite natural with
epistemic necessity modals, as in Therefore, x must equal .
Despite this, the account in von Fintel and Gillies () is formalized in terms of a
prohibition against reliable or strong evidence rather than a requirement in terms of
the type of evidence. Informally, it requires that the speaker’s evidence for the prejacent
must ‘fail to directly settle whether’ φ (p. ). This might, as they suggest, appropri-
ately prohibit must in cases in which the evidence source is a trustworthy report, but
it does not make the right predictions in scenarios like the two discussed above when
the strength of evidence does not correlate in expected ways with the type of evidence.
More formally, von Fintel and Gillies first define what it means for something to
count as evidence for a proposition: they use the term ‘kernel’—represented as a set of
propositions K—for privileged information that counts as evidence in the context.
They then characterize must as a prohibition on evidence that ‘directly settles’ the
prejacent () in a context of evaluation c and a world of evaluation w.
() Fix a c-relevant kernel K:
i. [[must φ]]c,w is defined only if K does not directly settle [[φ]]c
ii. If defined, [[must φ]]c,w = iff BK ⊆ [[φ]]c
The first clause of () encodes their version of the evidential restriction as part of the
definedness conditions of a must(φ) statement.6
What it means for a kernel to ‘directly settle’ the prejacent φ is clearly central here;
von Fintel and Gillies say (p. ), ‘The basic intuition is that K can fail to directly
settle whether P even though K entails whether P; epistemic modals carry an evidential
signal that exploits that gap’. As discussed above, this doesn’t necessarily rule out direct
evidence because sensory evidence from a hallucinating individual doesn’t entail the
prejacent; it also doesn’t succeed in allowing all types of inferential evidence, because
inferential evidence from non-defeasible premises does entail the prejacent in the
relevant sense.
I’ll end this section by discussing the scope of the restriction presented here. I’ve
argued that epistemic must encodes an evidential restriction, and that this restriction
6 von Fintel and Gillies (, p. ) characterize the evidential restriction of must as a presupposition
by a process of elimination; they reject the idea that evidentiality is encoded in a conventional implicature
in terms of specific details of the account of conventional implicature in Potts (). But because
evidentials generally introduce new information, the evidential restriction is arguably better thought of
as non-truth-conditional, not-at-issue content (Murray, ).
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
commitment by the speaker. For must does not have the same kind of meaning as the
adverbs certainly, definitely, etc., which are, indeed, indications simply of the speaker’s
confidence or commitment.’ His is an account of the evidential restriction based on
evidence type rather than evidence strength, and I will adopt it and extend it in what
follows.
Part of Stone’s argument against characterizing the evidential restriction of must in
terms of reliability or validity comes from the frequent use of must to mark conclusions
in mathematical proofs, ‘where every step of reasoning meets the strictest criteria
of validity’ (p. ). Informally, Stone’s analysis of must treats it as anaphoric to an
argument that justifies the conclusion of the prejacent. Doing so, he suggests, ‘serves
[speakers’] communicative intentions to make the dependence of claims on evidence
particularly clear’ (p. ). His informal description of the meaning of must(φ) is in ().
() Some particular collection of facts A, salient in the common ground, provide
(or have provided) a decisive reason to adopt the belief that φ.
Decisive reasons for the belief that φ can be logical (i.e., can entail φ) but can also be
defeasible.
To implement his account, Stone draws on the notion of (potentially defeasible)
generalizations from accounts of default reasoning in Reiter (), Harman (),
and Simari and Loui () (later developed in Horty, ). This is in part because
his account ‘depends on a model of reasoning in which defeasible conclusions may be
adopted on the basis of packages of evidence natural enough that speakers can refer
to them in conversation’ (p. ).
Stone first establishes the nature of the set of possible premises for an inference
(p. ): ‘The formalization starts from a context κ consisting of a set K of established
propositions consisting of ground formulas KC and (logical) rules KN and an addi-
tional set of defeasible rules.’ He defines what constitutes an argument for a given
proposition: effectively, a set of propositions T (a subset of established propositions K)
counts as an argument for a proposition h in a context κ–, i.e. T, hκ – if h is entailed
by any context that includes T and standard common-ground assumptions in that
context.
Stone also provides a definition of what it means for an argument T to justify a
proposition h in a context κ (notated as κ T, hκ ). Informally, in Stone’s account,
arguments are constructed step by step from subarguments, and an argument justifies
a proposition if ‘after a certain point in this induction, no further evidence against it
comes to light’ (p. ). This notion of a justified argument is invoked in the definition
of truth in a context κ, given in (). The end result is the semantics for epistemic must in
(), characterized in terms of two semantic arguments: a sentence S and a contextually
salient argument A.
() [[S]] is true in κ if and only if κ [[S]]) (i.e., if [[S]] is justified in κ).
() [[Must S(A)]] is true in κ if and only if κ A, [[S]])κ .
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
According to (), must(φ) is true iff there is a salient argument A that justifies φ.
The analysis therefore invokes anaphora to a salient argument to account for the
inferential evidence restriction encoded by epistemic must. This innovation makes a
particularly useful prediction: must(φ) can be false while φ is true. Stone’s example of
such a situation is a context in which two potentially conflicting arguments are taken
for granted (i.e. are in K), namely:
() a. A : A (recently) struck match is a hot match (unless it was wet when
struck).
b. A : Something that has (recently) been boiled is hot and wet.
In a context in which it’s known that a match has been struck, but also that it’s been
boiled, the proposition ‘The match is hot’ is true. But the sentence The match must be
hot is either true or false, depending on whether A or A is a (or the most) contextually
salient argument for must. This is illustrated in the following (felicitous) exchange:
() A: The match was struck, so it must be hot.
B: Well, no. It is hot because it was boiled. It didn’t light.
This example highlights the same point made in von Fintel and Gillies (), namely,
that determining the relative strength of φ and must(φ) is significantly complicated by
the evidential restriction of must.
The account in Stone () reinforces the claim from §. that the semantics of
must (and, arguably, other epistemic modals) should require that the prejacent follow
from an argument or generalization. The above example highlights the need to ensure
that these arguments can (but need not) be defeasible. In the next section, I’ll argue
that this inferential requirement can be used to explain the apparent affinity between
deontic and epistemic modality.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
(p. ). In the case of moral reasoning, this amounts to a) knowing whether an
action is wrong in an idealized situation, and b) knowing which (if any) circumstances
could nevertheless justify that action (a context-sensitivity referred to as ‘switch
valence’). Examples include (pp. –) causing pain (acceptable when it’s ‘constitutive
of athletic challenge’); lying (acceptable when done to Nazis, or as part of a game); and
not heeding the express wishes of competent agents (acceptable ‘in the S&M room’).
Effectively, the claim is that ‘[m]oral understanding, while drenched in exception,
is understanding of a structure, not merely a series of instances’ (Lance and Little,
, p. ). This understanding of a structure is the natural result of knowledge of
these normative generalizations along with an ability to infer from them based on
the particulars of a given context. Horty () embraces this parallel, as well, in his
formal treatment of reasons as defaults (parallel to Stone’s treatment of epistemic
modals): he argues that defeasible generalizations in ethics and epistemology help
account for the generalizations, used in natural language, that are ‘useful for our
ordinary reasoning’ (p. ). However, I take my claim that deontic reasoning involves
inferential evidence to be compatible with the view that prejacents are inferred
from moral or ethical intuitions instead of (or in addition to) general principles
(e.g. Dancy, ).
I am not in a position to evaluate these ethical theories, but I find them encouraging
for a semantic theory that attempts to explain the morphological connection between
epistemic and deontic modals via an inferential evidence restriction. Important for
the present discussion is that deontic and epistemic judgments have in common an
inference from some premises, not—at least for my purposes—that these premises be
defeasible. As Stone () already pointed out, epistemic must is perfectly acceptable
in cases in which the prejacent φ is inferred from a logical truth (or a series of logical
truths), in e.g. mathematical proofs. Lance and Little () argue that, in the case of
moral reasoning, the relevant premises are by nature defeasible, but I will take a weaker
stance, namely that the relevant premises in both cases are potentially defeasible.
There is, however, one important difference between moral judgments and epis-
temic judgments: I can make an epistemic judgment (i.e. decide that φ is true)
without inferring from a set of premises, but (as Lance and Little and others have
argued) I cannot make a deontic judgment without inferring from a set of premises
or an intuition. In other words, deontic judgments depend on deontic reasoning, but
epistemic judgments may or may not depend on epistemic reasoning. In the epistemic
realm, the difference corresponds to the difference between having direct (or reported)
evidence for φ versus having inferential evidence for φ.
The consequence is that a definition of must that requires that the prejacent be
inferred from some premises prohibits the use of epistemic must in certain contexts
(ones not involving inference), but does not prohibit the use of deontic must, because
inference is always involved in cases of moral judgment.
As a result of this difference, an account of deontic and epistemic must in which
it carries an inference requirement—i.e., an extension of Stone’s () account to
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
the deontic domain—predicts that deontic modals will not display something parallel
to an inferential evidence restriction: a commitment that φ is inferred from certain
premises will rule out some uses of epistemic must, but not of deontic must. We will
thus have to look elsewhere for evidence that deontic and epistemic modals have in
common this inferential restriction. This is the topic of §.
Below is a modal-semantic proposal for a common account of deontic and epistemic
must. It represents a basic extension of the inferential analysis of epistemic must in
Stone () to deontic modals. It characterizes must in both cases as acceptable only
if the prejacent is justified according to some salient set of premises K in the context
of utterance. I will therefore refer to these modals as ‘inferential modals’.
This approach was anticipated by Glass (), who argues that deontic and epis-
temic modals have in common that they ‘invoke a body of rules of some sort: for
deontic must, these are normative rules such as “do not litter,” and for epistemic must,
descriptive rules about how things tend to unfold, such as “wet umbrellas indicate
rain” ’ (p. ). Her focus was on addressing Karttunen’s Problem, and she encodes the
inferential restriction in the ordering source of the modal.
I will maintain Stone’s use of the variable K to range over a set of established
propositions, including logical rules and contingent (defeasible) premises.
() For some accessibility relation R and ordering source g salient in c:
a. [[must φ]]c,w is defined iff there is some salient set of propositions K such
that K justifies φ in c.
b. If defined, [[must φ]]c,w is true iff ∀w [wRw → φ(w )].
Like the analysis in von Fintel and Gillies (), () encodes the inferential restriction
as a precondition on the truth or falsity of must(φ). This means that a sentence of the
form must(φ) will be undefined if there is no salient K in c that justifies φ. Unlike the
analysis in Stone (), I will leave unspecified what it means for a set of premises
to justify a proposition, although it is entirely plausible that Stone’s account or some
equivalent can be extended to handle the deontic cases as well as his original epistemic
cases.
Instead of defining the relationship between K and φ in terms of indirectness or K’s
inability to ‘directly settle’ whether φ, () requires there be a set of propositions K that
justifies φ in c. This accounts for must’s apparent reliance on inference while prohibit-
ing neither logical entailments nor defeasible generalizations. Along with others in the
evidential literature, I will assume that some pragmatic mechanism (possibly similar to
scalar implicature) explains why elements that encode a requirement for a weak type
of evidence are generally interpreted as prohibiting stronger types of evidence (e.g.
direct evidence). Alternatively, the restriction in (-a) could be strengthened from an
existential requirement to a requirement about the strongest evidence salient in the
context.
I’ll discuss an epistemic and a deontic example that are correctly predicted to be
acceptable. Imagine that A and B are in a windowless room, watching a group of
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
people enter from outside with wet umbrellas. In this scenario, A can felicitously and
truthfully utter to B, It must be raining outside. () predicts that this utterance is
felicitous because a set of propositions K—containing the proposition that people are
entering from outside with wet umbrellas—is salient in the context of utterance and
also justifies the prejacent under normal circumstances. The sentence is true because
it holds in all the nearby worlds accessible from the world of evaluation by the salient
epistemic modal base.
() makes several further predictions about the above situation. First, that the
utterance is infelicitous or undefined if the relevant premise (that people are entering
with wet umbrellas) is not salient in the context of utterance. Imagine a different
scenario in which A sees people entering in from outside with wet umbrellas, then
walks down the hall to B’s office. In this scenario, A’s utterance It must be raining outside
is infelicitous, unless it is followed by an explicit introduction of the relevant evidence
(...I just saw some people come in with wet umbrellas).
Secondly it predicts that this reference to a set of premises—as an explicit part of
the non-truth-conditional content—can be indirectly denied in discourse, like other
not-at-issue content (Potts, ). This is demonstrated in ().
() A: (Watching people enter with wet umbrellas) It must be raining outside.
B: Hey, wait a minute, they’re washing the roof right now. So you can’t
conclude that for sure.
Thirdly, ()—by virtue of its anaphora to a salient K—preserves the prediction from
Stone’s () account that φ can be true while must(φ) is false, depending on which
K is salient. Stone’s example is repeated below from ():
() A: The match was struck, so it must be hot.
B: Well, no. It is hot because it was boiled. It didn’t light.
A’s utterance is anaphoric to a particular set of premises: the inference from a match
being struck to its being lit to its being hot. B’s response challenges the validity of
this inference while endorsing the conclusion that φ. If these predictions are right,
they support the treatment of the evidential restriction of must as anaphora to a
salient generalization, as opposed to encoding it in e.g. the ordering source (as Glass
() does).
Finally, () predicts that epistemic must(φ) is undefined in a context in which the
speaker does not have inferential evidence for φ (or has better than inferential evidence
for φ). This distinction is made in evidential systems across languages, e.g. the Tsafiki
data in (). Inferential evidence is a type of evidence; it is not characterized relative
to direct and reported evidence by its weakness. In a situation in which the speaker
looks out the window and sees it raining, her conclusion that it is raining comes from
a salient perceptual event, not from inference. In a situation in which the speaker
hears from a colleague that it’s raining, her conclusion comes from a salient speech
event. In these cases, an utterance of It must be raining outside is predicted by ()
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
Ability modals don’t need to involve inference, as () shows (Austin, ; Brennan,
; Hackl, ).
() John can ride horses/a horse. ability
() is acceptable in a context in which the speaker has seen John ride a horse before,
which arguably counts as direct evidence for the prejacent. It also seems acceptable in
a context in which the speaker has been informed that John rides horses. This suggests,
in contrast to deontic and epistemic must, that ability can does not encode an inference
requirement, or isn’t an inference modal.7
Future modals, as well, don’t seem to encode an inferential restriction, although this
is less clear.8 The example in () suggests that will can be interpreted with respect to
an epistemic modal base (although this use might be restricted to generic statements;
Haegeman, ).9
() As far as I know, oil will float on the water.
It is relatively hard to conceive of what would count as direct, perception-based
evidence for a claim about the future (crystal balls aside). But we can conclude that
will is not an inferential modal because it is compatible with even the strongest, most
reliable reported evidence for an eventuality. Imagine that Mary’s daughter Sue is
graduating from high school and is deciding which university to attend. On the day
she is required to make a decision, Sue announces to Mary that she will attend UCLA.
Mary can then report to her friend:
() Sue will attend UCLA in the fall.
In this scenario, Mary has reported evidence for the prejacent, resulting in a felicitous
utterance. This suggests that the future will, unlike deontic and epistemic must, does
not carry an inferential restriction.
7 It’s interesting to note that can also receives deontic or epistemic interpretation, depending on context.
((-b) is an example of the former; as an example of the latter, Hackl () gives: John can be married to
his cousin, according to law.) These interpretations, like the deontic and epistemic interpretations of must,
seem to have an inferential requirement. I’ve suggested here that () is evidence that can—in contrast to
must—doesn’t lexicalize an inferential requirement; if this is the case, it remains to be seen why the deontic
and epistemic interpretations of can are restricted to inferential evidence.
8 See Klecha () for very compelling arguments that will is a modal. His analysis, in (i), treats will as
encoding universal quantification over worlds in the modal base M as well as existential quantification over
future times j.
(i) [[will]]w,i = λp∀v ∈ M(w, i)[∃j > i [p(j)(v)]]
Klecha offers several empirical arguments in favor of a modal analysis of will and its subjunctive counterpart
would. Primary among them is the ability of will to interact with if -clauses in conditionals and to participate
in modal subordination. Winans (to appear) presents an account of will as an evidential modal that derives
its use as a future marker from its evidential restriction, which she characterizes as non abductive.
9 There is no clear consensus on this issue, however. Giannakidou and Mari () argue that the future
does not encode epistemic knowledge but rather direct or reported partial knowledge. They argue that this
difference corresponds to a difference in speaker confidence for the prejacent (high confidence in the case of
the future, relatively low in the case of epistemics). Palmer () and De Haan () claim that the future
is not an evidential, and Winans (to appear) argues that will and must are both epistemic modals but differ
in their evidential restrictions.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
In presenting this account of deontic and epistemic must, I’ve made the following
claims: Epistemic must carries an evidential restriction requiring that the speaker have
inferential evidence for the prejacent. This evidential restriction is best characterized,
as in Stone (), as anaphora to a set of premises from which the speaker is justified in
concluding the prejacent. Deontic and epistemic must have this evidential restriction
in common (Glass, ); this correctly predicts that the salient generalization behind
both the epistemic and deontic generalizations can be denied in discourse. And it has
the potential to offer insight into why deontic and epistemic modalities are often co-
lexicalized across languages.
As I’ve suggested, there is some additional linguistic evidence in favor of this analy-
sis. There is independent reason to think that exclamations, a type of expressive speech
act, are incompatible with inferential evidence. And it seems as though exclamations—
or at least exclamations in which the modal is prominent in a certain way—are
unacceptable with deontic and epistemic modals (but not with other modals).
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
10 Subject-auxiliary inversion is most commonly associated with yes/no questions in English, but
there are many ways in which inversion exclamatives differ from yes/no questions (see McCawley, ),
suggesting that the inversion in exclamatives comes about for other reasons, perhaps akin to the inversion
triggered by only (Progovac, ).
(i) Only last night *(I did)/(did I) eat pizza for the first time.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
In contrast, () is infelicitous in a context in which the speaker has only inferred
the content of the exclamative. It is infelicitous in a context in which the speaker has
examined the engine under the hood (but not e.g. seen the car perform), and it is
infelicitous in a context in which the speaker knows that John only drives incredibly
fast cars, and sees a picture of John in the car in question. This is in contrast to the
sentence That car must go fast, which (as we’ve seen) is compatible with inferential
evidence.
I interpret this as showing that inversion exclamatives carry an evidential restric-
tion: they are infelicitous if the speaker’s best evidence for their content is inferential.
Inversion exclamatives thus provide a useful test for which modal auxiliaries require
inferential evidence and which do not.
Recall that inversion exclamatives can be headed by modals as well as by the
auxiliary verb do, as in () (repeated in ()). But as McCawley () observed, they
are unacceptable with deontic and epistemic modals, as in () (repeated in ()).
() a. (Wow,) Can Sue dance! ability
b. (Wow,) Will Sue be mad! future indicative
c. (Wow,) Would Sue like to win the race! future subjunctive
() a. *(Wow,) Must/May Sue be the murderer! epistemic
b. *(Wow,) Must/May Sue complete the assignment on time! deontic
The judgments in () and () reflect the utterance of these strings as exclamatives,
which (in the case of inversion exclamatives) means that they receive an intonation
pattern distinct from that of yes/no questions, in particular a high-level intonation with
emphasis, typically manifested in lengthening effects (Bartels, ). This intonation
is brought out by such particles as wow, but these particles are not obligatory.11
We can see that the contrast in () and () tracks the speaker’s type of evidence.
Imagine that Mary is a detective investigating a murder, and she has just discovered
incontrovertible proof that Sue perpetrated the crime (say, she received the results of a
critical DNA test). In this scenario, her conclusion that Sue is the murderer is based on
an inference from a set of premises, which licenses an epistemic modal in the assertion
11 Some particles, for instance man and boy, can cause an inversion sentence to receive a reading slightly
different from an exclamative interpretation. The two interpretations are exemplified in (i), and discussed
at length in McCready ().
(i) a. Man, does Robin like cake!
b. MAN does Robin like cake!
The particle in (i-a) receives what McCready refers to as ‘comma intonation’, and corresponds to the
inversion exclamatives addressed here. In contrast, (i-b) receives McCready’s ‘integrated intonation’, and
receives an interpretation similar to but arguably distinct from exclamatives. In what follows, I will make
claims only about exclamations like (i-a), and will stick to the particle wow to attempt to bring out this
intonation unambiguously.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
in (-a). But in this scenario, despite the relevance and truth of the proposition, and
despite Mary’s having recently learned it, an utterance of (-b) is unacceptable. It
cannot be felicitously used to express surprise at the proposition that Sue must be the
murderer (or even at the extent to which she must be the murderer).
() a. Sue must be the murderer.
b. *Wow, must Sue be the murderer!
The same can be done with deontic modals. Imagine that Bill is Sue’s father and
helps keep track of Sue’s schoolwork. Sue’s teacher imposes relatively modest penalties
for late assignments, but Sue still isn’t doing well in the class. Bill has just learned,
however, that the teacher will not be accepting this particular homework assignment
late; he has also just learned that Sue needs to pass the assignment in order to pass
the class. In this scenario, his utterance of (-a) is appropriate and felicitous; he has
inferred the prejacent from a set of premises (including the teacher’s new policy and
Sue’s current grade in the class). However, he cannot felicitously utter (-b); it cannot
be used to express surprise at the proposition that Sue must complete the assignment
on time (or even at the extent to which she must).
() a. Sue must complete the assignment on time.
b. *Wow, must Sue complete the assignment on time!
In contrast is the ability interpretation of e.g. can. Imagine a scenario in which Beth
is at a club, watching Sue dance the samba extremely well. In this situation, it’s felicitous
for her to utter (-a) to Joe. It is also felicitous for her to utter (-b). In this context,
Beth’s utterance of (-b) counts as an expression of surprise that Sue can dance (or,
more accurately, about the great extent to which she can; Rett, ).
() a. Sue can dance.
b. Wow, can Sue dance!
As discussed in §., this is arguably because ability modals are not inferential
modals; there is no sense in which a claim about John’s abilities requires inference
(although it is compatible with inference). In fact, Hackl () uses compatibility with
direct evidence as a test for the difference between epistemic uses of can and its uses
as a circumstantial, opportunity or ability modal (p. ).
() shows that the modals will and would pattern with can in this respect, suggest-
ing that they are not inferential modals. As I argued in §., this is plausibly because
these modals also do not encode an inferential restriction. Imagine a scenario in
which Mary hears from her meteorologist friend that it will rain a substantial amount
tomorrow. In such a context, she can felicitously utter either (-a) or (-b).
() a. It will rain tomorrow.
b. Wow, will it rain tomorrow!
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
13 While modal auxiliaries don’t seem to be gradable, this does not mean, as Lassiter () and Klecha
() have argued, that their modal bases are intrinsically non-scalar (cf. It’s more that Sam is at home
than that Bill is at home). For our purposes, the relevant conclusion is this: in looking for an explanation
of why deontic and epistemic modals are unacceptable in inversion exclamatives, we cannot appeal to their
lack of gradability, because standard tests for gradability characterize can, will, must and may as equally
non-gradable.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
and deontic interpretations of the modals must and may, which I characterized in §
as ‘inferential modals,’ lexically encoding an inferential evidence requirement.
I have restricted my claim about inferential modals to must and may, but weak
necessity modals such as should and ought are also unacceptable in inversion excla-
matives, as () shows.
() a. *(Wow,) Should he shut his mouth!
b. *(Wow,) Ought John to turn in his homework!
Despite this, I will continue to put weak necessity modals aside; they differ from
strong necessity modals in a number of ways I cannot control for here (Iatridou and
Zeijlstra, ).14 I will also remain agnostic on the status of possible bouletic or
gnomic interpretations of these modals.
. Inferential evidence and other types of exclamation
I’ve argued that inversion exclamatives are unacceptable when the speaker has only
inferential evidence for their content. There is some evidence that this prohibition is
attributable to the nature of mirativity or speaker surprise; there is other evidence that
it is not. For the purpose of this paper, I will stipulate the inferential restriction for
inversion exclamatives. It remains to be seen how general this restriction is, and why
it exists. However, it does seem that the more tolerant an exclamation construction is
of inferential evidence, the more tolerant it is of inferential modals.
Wh-exclamatives are relatively unacceptable in inferential evidence contexts. Hold-
ing fixed the relevance of the President’s tallness (and assuming he is tall in the world
of evaluation), () is acceptable in a context in which the speaker has just seen the
President (a direct evidence context), or read online that he is (a reportative
context).
() How very tall the President is!
But it seems odd in a context in which the speaker has seen evidence from which she
can (reliably) infer that the President is tall: if she sees the height of the teleprompter at
a speech or if she sees a bathtub at the White House that was custom-made for him. In
these contexts, an epistemic modal seems much more appropriate: The President must
be tall.
And, as expected, possibility inferential modals are unacceptable in
wh-exclamatives (), while non-inferential modals are acceptable ().
() a. *How very tall the President may/might be! epistemic
b. ?What a big car she may drive! deontic
() a. What a big mountain she can climb! ability
b. What a smart student she will be! future
14 As Nate Charlow (p.c.) points out, should seems to be scalar in the way that other modal auxiliaries
aren’t: it’s fine to say Sam should wash his hands after work more than Bill should.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
more prominent than in the other types of exclamation. In other words, the syntax
of the inversion construction might prohibit the hearer from ignoring the role of
inference in the speaker’s expression of surprise, something that might be easier in the
other types of exclamation. The relative acceptability of the examples in ()—which
contain but are not headed by inferential modals—speak in favor of an explanation
along these lines.
() a. Wow, does Mary have some debt she must pay back!
b. Wow, can John work out who the murderer must be!
If this is right, then it might be possible to consider the prohibition against infer-
ential modals in inversion exclamatives to be the result of a general incompatibility of
exclamation and inferential evidence; we might explain the differences within the class
of exclamation constructions with some notion of structured content that differenti-
ates between, among other things, the content encoded in an inversion exclamative
headed by must and the examples in (). This is an encouraging possibility, but I am
unable to offer such a refinement here. I’ll begin the final section by summarizing the
arguments so far and offering brief discussion of one potential area for expanding the
discussion here: non-specific indefinites.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
15 Nate Charlow (p.c.) points out that, in this context, the free relative Wow, is whoever purchased that
ticket going to win the lottery! is acceptable. This is a compelling observation, although at least one prominent
analysis of -ever free relatives (von Fintel, ) characterizes the contribution of ever in terms of uncertainty
rather than non-specificity.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
reason to obscure the identity of his intended referent (e.g., the speaker in (-a) knows
Mary baked the cake, but does not want to reveal that he knows).
This intolerance of non-specific indefinites is a property of exclamatives generally,
as the wh-exclamatives in () demonstrate (I use ‘§’ to mark ‘specific interpretation
only’). Imagine a context in which Sue overhears a concerto played in an elevator, but
does not know who wrote it. In this context, she cannot utter the sentences in () to
express surprise at how talented its composer is.
() a. §(Wow,) What a composer someone is!
b. §(Wow,) How very talented someone is!
Like the examples in (), these exclamatives are acceptable only in a situation in which
it’s clear that Sue knows who the composer or talented individual is.
A typical example of the specific/non-specific contrast is in (), from
Von Heusinger ().
() A student in Syntax cheated on the exam.
a. His name is John. specific
b. We are all trying to figure out who it was. non-specific
Assuming, following Groenendijk and Stokhof () and Jayez and Tovena (),
that non-specific indefinites lack the property of speaker identifiability (that is, that
an indefinite is non-specific iff the speaker cannot identify the individual that satisfies
the description), then non-specific indefinites are licensed only when the speaker has
inferred the existence of an individual that satisfies the description. In other words, if
speaker identifiability is the right way to characterize non-specificity, we might be able
to cash out non-specificity in terms of inferential evidence.
The non-specific interpretation of () requires a context in which the speaker
has evidence that is informative enough to tell them someone cheated, but not so
informative that they can tell who; for example, a context in which the professor’s
answer key was discovered stolen. In such a context, the speaker must infer from this
indirect evidence that someone has cheated. It is an inference based on descriptive
generalizations about other worlds in which answer keys are stolen; it comes about
neither from direct evidence nor from reported evidence.
The non-specific interpretation of the indefinite in (), repeated below, involved a
similar pattern of inference.
() §(Boy,) Will someone from New York win the lottery!
The non-specific interpretation was natural in a context in which the speaker knew
only that New York’s lottery had hit a particularly high jackpot. The leap from this
knowledge to the claim in () involves the premise that someone will buy the
winning ticket, and that that person will be from New York. This, too, represents a
(defeasible) inference based on descriptive generalizations about the world or model of
evaluation.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
I have not done this topic or these data enough justice to make a strong claim; I
only mention them in passing as another phenomenon, that is barred from inversion
exclamatives, to which considerations of inferential evidence could be brought to
bear. If non-specific indefinites denote existential individual quantifiers that carry an
indirectness requirement, they are clear counterparts to deontic and epistemic modals,
and we are closer to understanding their unacceptability in inversion exclamatives.
The plausibility of this comparison is, I believe, bolstered to some small degree
by work in Rullmann et al. (), in which St’át’imcets (Lillooet Salish) modals
are argued to be ‘akin to specific indefinites in the nominal domain’ (p. ) and
are analyzed accordingly with choice functions. If language encodes an inferential
restriction in one domain, it’s reasonable to expect that it might mark inference in
another domain.
Acknowledgments
Thanks to Nate Charlow, Matt Chrisman, Sam Cumming, Lelia Glass, and Lauren Winans for
comments and suggestions, and to my Winter semantics class for judgments.
References
Abusch, D. () The scope of indefinites. Natural Language Semantics. . pp. –.
Aikhenvald, A. () Evidentiality. Oxford: Oxford University Press.
Austin, J. () Ifs and cans. In Unnson, J. and Warnock, G. (eds.), J.L. Austin: Philosophical
Papers, Part . pp. –. Oxford: Oxford University Press.
Bartels, C. () The intonation of English statements and questions: A compositional interpre-
tation. New York, NY: Garland.
Brennan, V. () Root and epistemic modal auxiliary verbs. Ph.D. thesis. University of Mas-
sachusetts, Amherst.
Dancy, J. () Ethical particularism and morally relevant properties. Mind. . pp. –.
Davis, C., Potts, C. and Speas, M. () The pragmatic values of evidential sentences. In Gibson,
M. and Friedman, T. (eds.), SALT . pp. –. CLC Publications.
De Haan, F. () Evidentiality and epistemic modality: setting boundaries. Southwest Journal
of Linguistics. . pp. –.
Dickinson, C. () Mirativity in Tsafiki. Studies in Language. . pp. –.
Doetjes, J. () Adverbs and quantification: degrees versus frequency. Lingua. . pp. –.
Drubig, H. () On the syntactic form of epistemic modality. Manscript. University of
Tübingen.
Farkas, D. () Specificity distinctions. Journal of Semantics. . pp. –.
Fintel, K. von () Whatever. In Jackson, B. and Matthews, T. (eds.) Proceedings of Semantic
and Linguistic Theory . CLC Publications.
Fintel, K. von and Gillies, A. () Must stay strong. Natural Language Semantics. . pp. –.
Fintel, K. von and Gillies, A. () Might made right. In Egan, A. and Weatherson, B. (eds.)
Epistemic modality. Oxford: Oxford University Press.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
Fintel, K. von and Iatridou, S. () How to say ought in foreign: the composition of weak
necessity modals. In Guéron, J. and Lecarme, J. (eds.) Time and Modality. pp. –. New
York, NY: Springer.
Fodor, J. and Sag, I. () Referential and quantificational indefinites. Linguistics and Philoso-
phy. . pp. –.
Giannakidou, A. and Mari, A. () The future of Greek and Italian: an epistemic analysis. In
Proceedings of the Chicago Linguistic Society. . pp. –.
Glass, L. () Deriving indirectness and questioning entailment for epistemic must. Annual
Meeting of the Linguistic Society of America.
Groenendijk, J. and Stokhof, M. () Modality and conversational information. Theoretical
Linguistics. . pp. –.
Groenendijk, J. and Stokhof, M. () A pragmatic analysis of specificity. In Heny, F. (ed.)
Ambiguities in intensional contexts. Boston, MA: Reidel.
Groenendijk, J. and Stokhof, M. () Dynamic predicate logic. Linguistics and Philosophy. ..
pp. –.
Hackl, M. () On the semantics of ability attributions. Manscrupt. MIT.
Hacquard, V. () On the interaction of aspect and modal auxiliaries. Linguistics and Philos-
ophy. . pp. –.
Hacquard, V. () Modality. In Maienborn, C., von Heusinger, K., and Portner, P. (eds.),
Semantics: An International Handbook of Natural Language Meaning. pp. –. Berlin:
Mouton De Gruyter.
Haegeman, L. () The semantics of will in present-day English: a unified account. Brussels:
Royal Academy of Belgium.
Harman, G. () Change in view: principles of reasoning. Cambridge, MA: MIT Press.
Harris, J. and Potts, C. () Perspective-shifting with appositives and expressives. Linguistics
and Philosophy. . pp. –.
Heim, I. () The semantics of definite and indefinite noun phrases. Ph.D. thesis. University of
Massachusetts, Amherst.
Homer, V. () Neg-raising and positive polarity: the view from modals. Semantics and
Pragmatics. .. pp. –.
Horty, J. () Reasons as defaults. Oxford: Oxford University Press.
Iatridou, S. and Zeijlstra, H. () On the scopal interaction of negation and deontic modals. In
Aloni, M. (ed.) Logic, Language and Meaning Proceedings of the th Amsterdam Colloquium.
In Croum, C., Smith-Stark, T. C., and Weiser, A. (eds.) pp. –. Berlin: Springer-Verlag.
Ippolito, M. () The time of possibilities. Ph.D. thesis. MIT.
Jayez, J. and Tovena, L. () Epistemic determiners. Journal of Semantics. . pp. –.
Kamp, H. () A theory of truth and semantic representation. In Groenendijk, J., Janssen, T.,
and Stokhof, M. (eds.), Formal methods in the study of language. pp. –. Amsterdam:
Mathematical Centre.
Kaplan, D. () The meaning of ‘ouch’ and ‘oops’. In Cornell Conference on Context Depen-
dency. Ithaca, NY: Comell University, March.
Karttunen, L. () Possible and must. In Kimball, J. (ed.) Syntax and Semantics. Volume .
pp. –. New York, NY: Academic Press.
Klecha, P. () Positive and conditional semantics for gradable modals. In Guevara,
A. A., Chernilovskaya, A., and Nouwen, R. (eds.) Proceedings of Sinn und Bedeutung ,
pp. –. MITWPL.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
Von Heusinger, K. () Specificity and definiteness in discourse structure. Journal of Seman-
tics. . pp. –.
Willett, T. () A cross-linguistic survey of the grammaticalization of evidentiality. Studies in
Language. . pp. –.
Winans, L. (to appear). Predictions and Inferences. Ph.D. thesis. University of California, Los
Angeles.
Zwicky, A. and Pullum, G. () Cliticization vs. inflection: English n’t. Language. .
pp. –.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
Modalities of Normality
Seth Yalcin
Introduction
The modals ought and should are widely thought to take both deontic and epistemic
readings. At some informal level, this claim is hardly questionable. Considering a
sentence like
() Noam ought to be in his office.
It is evident that there are at least two ways to take this sentence. One of these ways
has a deontic ring, and seems to concern what is normatively called for in some sense;
the other has an epistemic ring, and seems more tied to what it is reasonable to expect
to be true in some sense.
But more could be, and usually is, meant by the idea that these modals can take
both deontic and epistemic readings. On the stronger interpretation, what is meant
is that ought and should each exhibit two kinds of reading (two “flavors” of modality)
which are common to other modals, such that they stand in certain nontrivial logical
relations with those other modals (relative to a choice of flavor). If we say in this sense
that ought and should can be epistemic, what we mean is that they share a certain
epistemically-flavored reading with other modals such as must, have to, may and might,
such that holding fixed this flavor (and relevant features of context), the corresponding
modal sentences exhibit some nontrivial logical interaction or pattern of entailment—
in particular, they can be ordered by logical strength:
must φ, have to φ ought φ, should φ may φ, might φ
Indeed, ought and should are often called “weak” necessity modals because they are
thought to be logically weaker than the corresponding “strong” necessity modals such
as must and have to, when we hold fixed a flavor of these modals as (e.g.) epistemic or
deontic. When I talk about the thesis that ought and should possess both deontic and
epistemic readings, I have this stronger sense of “reading” in mind.
This chapter has two main objectives. First, I make a case against the idea that
ought and should are capable of true epistemic readings. Anticipated in certain respects
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
by Copley (, ), we will see that these modals admit of a certain reading
distinct from, but easily confused with, the epistemic flavor of modality, which I will
provisionally call the pseudo-epistemic reading. Once we recognize the existence of
the pseudo-epistemic reading, we face the question whether there is any reason to
think that, in addition to it, ought and should can also take a true epistemic reading,
and function as true epistemic modals. I ultimately find no reason to postulate a true
epistemic reading: the pseudo-epistemic reading suffices to explain the data that have
prompted theorists to suppose that there is an epistemic reading.
My second objective is to explore the nature of the pseudo-epistemic reading. I will
suggest that this reading has to do with what is normally the case (compare von
Fintel and latridou, ), or with what it would be reasonable to expect to be true.
I ultimately end up calling the pseudo-epistemic reading the normality reading of
these modals. Inspired by Veltman (), I consider some connections between these
modals and default reasoning. I close by noting some connections between the ground
covered here and the default logic-based approach to deontic modals explored by
Horty ().
To be explicit about it: this paper is not directly about deontic should and ought. It
is about their close pseudo-epistemic cousins. But part of what animates the project
is the hope to shed some indirect light on deontic should and ought by first trying to
get clearer about the most common non-deontic reading of these modals. A relatively
unified theoretical understanding of these two sorts of reading would be desirable if
we could get it. Seeking a unified understanding, we expect there to be some structural
parallels between the pseudo-epistemic modals and their deontic counterparts.
Pseudo-epistemics
Consider a case which many would, at least initially, take as drawing out the putative
epistemic reading of the English modals ought and should. Suppose Jones is in a
crowded office building when a severe earthquake hits. The building topples. By sheer
accident, nothing falls upon Jones; the building just happens to crumble in such a way
as not to touch the place where he is standing. He emerges from the rubble as the only
survivor. Talking to the media, Jones says in wonderment one or other of the following:
() I should be dead right now.
() I ought to be dead right now.
In a similar vein, Jones’s sister says things like this:
() It’s incredible! That quake was massive. He should be/ought to be dead. We’re
so lucky he survived.
Obviously Jones is not saying that in deontically or bouletically preferred situations, he
is dead; nor is his sister saying that. That is, we don’t have here any (normal) deontic
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
reading of the weak necessity modals. Instead it seems we have a paradigm case of
the putative epistemic reading. But the modals are not functioning here like epistemic
modals, for observe that no modal which is uncontroversially epistemic works in these
sentences. Notably it would be bizarre for Jones to say:
() # I am probably dead right now.
Further, observe that if () and () were true epistemics, they would on any account
obviously entail the epistemic readings of:
() # I might be dead right now.
() # I may be dead right now.
Generally, sentences that entail defective sentences are defective themselves; () and
() are not defective; so plausibly () and () do not entail ()/(); so plausibly they do
not contain true epistemic modals. To have a temporary label, let’s call the reading that
the weak necessity modals are taking in these sentences the pseudo-epistemic reading.
The first point is that it is easy to mistake the pseudo-epistemic reading for a true
epistemic reading.
Does there exist, in addition to the pseudo-epistemic reading of the weak necessity
modals, a true epistemic modal reading of these modals? We investigate this possibility
below.
An unpublished paper by Bridget Copley (Copley, ) anticipates these observa-
tions. She calls attention to the following contrast:
() # The beer must be cold by now, but it isn’t.
() # The beer may be cold by now, but it isn’t.
() The beer should be cold by now, but it isn’t.
Copley correctly notes that these data present a problem for the idea that the should
in () is epistemic. Of course, again, this does not yet show that should cannot be
epistemic. It shows that there is reading that looks very epistemic but is not.1, 2
1 On her ultimate analysis, Copley allows for an epistemic reading of should. As noted, I will push against
this view.
2 Swanson (), building on Copley, gives another example of the contrast:
(i) They left an hour ago, and there isn’t any traffic. So they should be here by now. But they’re not.
(ii) # They left an hour ago, and there isn’t any traffic. So they must be here by now. But they’re not.
In explanation of the contrast, Swanson writes that with “They must be here by now”, the speaker aims
to add to the common ground the proposition that they are here—a discourse move in tension with the
subsequent remark “But they’re not”—whereas with “They should be here by now”, the speaker does not
attempt to change the common ground in this way (Swanson, , p. ). But this difference cannot be
the full story, since (as effectively noted by Copley ()) it would not account for the contrast between (i)
and the following:
(iii) # They left an hour ago, and there isn’t any traffic. So they might be here by now. But they’re not.
(iv) # They left an hour ago, and there isn’t any traffic. So they probably are here by now. But they’re not.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
Teasing Apart
Closely related data are also noted by Thomson (). Considering the idea that
there is a reading of ought tantamount to the epistemic modal probably, she raises
the following objection:
Consider Rasputin. He was hard to kill. First his assassins poisoned him, then they shot him,
then they finally drowned him. Let us imagine that we were there. Let us suppose that the
assassins fed him pastries dosed with a powerful, fast-acting poison, and then left him alone for
a while, telling him they would be back in half an hour. Half an hour later, one of the assassins
said to the others, confidently, “He ought to be dead by now.” The others agreed, and they went
to look. Rasputin opened his eyes and glared at them. “He ought to be dead by now!” they said,
astonished. It might be thought that when they first said the words, they meant that it was then
probable that he was dead. Not so when they second said the words. By the time they second
said the words, they knew perfectly well that he wasn’t dead. (pp. –)
(The assassins could not have replaced their second remark with “He is probably dead
right now!”) The first use of ought by the assassins would be reckoned by most everyone
as the putative epistemic reading. The second use corresponds to what I have called
the pseudo-epistemic reading. But is there any good reason to insist that there are two
distinct readings of the modal here, rather than a single (pseudo-epistemic) reading?
In response to her case, Thomson recommends the following analysis:
. . . what it calls for is simply that we distinguish: if I say “The car keys ought to be on the hall
table,” then I assert different propositions, according as my state of knowledge is different. If (i)
I don’t know that the car keys are, or that they aren’t, on the hall table, then if I say “They ought
to be on the hall table,” what I mean is that it is probable that they are there. If (ii) I know that
they aren’t there, then if I say “They ought to be on the hall table,” what I mean is that it was
probable that they would be there. (p. )
The thought is that on what I have called the quasi-epistemic reading, ought (and,
I take it, should), unlike probably, can optionally be evaluated relative to a past state
of information, even when superficially appearing in a present tense construction.
Roughly, the thought is that ought and should can mean probably, but they can also
mean was probable that it would be.3
Thomson’s idea is a natural one. However, it is subject to the following counter-
examples. Suppose an urn has five marbles, one black, four white. A marble is selected
at random. We observe it is black. We can say:
() It was probable that the marble selected would be white.
It is not plausible that “They might be here by now” or “They are probably here by now” serve as a means
of adding the proposition that they are here to the common ground; yet (iii) and (iv) are marked. A natural
conclusion to draw is that the should in (i) is not a true epistemic; hence it does not entail the corresponding
epistemic might claim.
3 Thomson later hints that we might need to change this to: ought and should can mean very probably,
but they can also mean was very probable that it would be.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
4 Another way to put it might be in terms of what one would be entitled to expect. If () or () were true,
one would be entitled to expect, in some sense, that the marble selected would be white. But it is plainly not
true in this case that one would be entitled to expect that. (I am indebted here to conversation with Judy
Thomson.)
5 Notice further that the judgments flip if the situation is that you bought every lottery ticket except one.
In that case,
You should/ought to win.
seems true. Observe that the probability that you win in this scenario is the same as the probability you
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
Games of chance supply easy cases where ought and should pull apart from what is or
was epistemically likely or unlikely. But we can find such examples without them. Most
Bostonians are Americans. Hannah just married Henry, a Bostonian. Is that enough
to say: Henry ought to be an American? That feels comparatively worse than: Henry
is probably American.6 Or again, suppose I am struggling to summarize a colleague’s
objection. In the course of doing so, I might say:
() I am probably misunderstanding you.
() It’s very likely I am misunderstanding you.
() I must be misunderstanding you.
By contrast, it would be queer to say:
() ? I ought to be misunderstanding you.
() ? I should be misunderstanding you.
Finlay () adopts the view that what I am calling pseudo-epistemic ought is
basically akin to probably, except:
Whereas ‘probably’ is tied total present evidence, ‘ought’ like other auxiliaries can be relativized
to any background, such as evidence at some previous time t, some subset of evidence at t, or
what some agent s believes, and so on. ()
What we have been observing, however, is that this level of context-sensitivity is prof-
ligate. Such a view would suggest that ought should be fine pretty much everywhere
probably is, and in more places besides. That is not the case. A view like Finlay’s
overgenerates. Overgeneration is what usually happens when the level of contextual
flexibility postulated is out of proportion with the facts.
The earthquake and Rasputin examples were cases where modals which are uncon-
troversially epistemic are marked, but where should and ought can take a perfectly
acceptable reading. The above examples highlight cases where epistemics such as
probably, very likely, and even must are fine, but where should and ought are marked.
There seem to be two possibilities. One is that there is no epistemic reading of should
and ought; there is only what I have called the pseudo-epistemic reading, a reading
whose nature it remains to explain. A second possibility is that should and ought are
yet capable of a true epistemic reading (in addition to the pseudo-epistemic reading),
but it is not one that is even loosely equivalent to something like probably or very
probable.
Either way, these results tell against the idea, in Kratzer (), of classifying
probably/likely semantically with ought and should as a “weak necessity” modal. It
lose in the scenario where you bought only one ticket. Nevertheless, the truth values of the corresponding
should and ought sentences diverge. This highlights the way in which oughts and shoulds come apart from
what is likely.
6 I owe this example to Judy Thomson (p.c.).
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
is unlikely these operators have structurally analogous semantics. While the kind of
model structure appropriate to probability operators is currently a matter of active
investigation (Yalcin, , b; Lassiter, ; Holliday and Icard, ), the exam-
ples above make it appear doubtful that ought and should possess a reading that is to
be articulated in terms of this kind of structure, as would naturally be expected if they
possessed true epistemic readings. The pseudo-epistemic reading of weak necessity
modals appears to involve some different kind of structure.
I take it that we should not posit possible readings of modals beyond necessity; and
I am unaware of data showing that, above and beyond the pseudo-epistemic reading
of weak necessity modals, we must also recognize a true epistemic reading. I do not
claim to have shown that this further reading does not exist. But the burden of proof is
on those who would wish to recognize such a reading. Meanwhile it seems worthwhile
to understand how far one can get without supposing there is such a reading.
Here I should briefly pause to defend against one kind of objection. Kratzer ()
suggested that it is possible to explicitly control the restriction of a modal with an in
view of -phrase. If one has this view, one will think it is trivial to show that there can be
an epistemic reading of the weak necessity modals. One merely has to cite an example
like this:
() In view of the evidence, Bob ought to be in his office.
But I deny that () settles anything, for I deny it is generally true that in view of -
phrases systematically semantically control the interpretation of modals. The connec-
tion between the interpretation of modals and in view of -phrases is loose and indirect
at best, as the following examples illustrate:
() In view of what we know, you can’t be parked here. (deontic reading of the
modal clearly available)
() In view of what the tribal laws are, the guy performing the ceremony must be
the chief. (epistemic reading of the modal clearly available)
() In view of what the laws say, we should protest them. (deontic reading
preferred, but not one relative to what the laws require)
Examples could be multiplied. It is not at all obvious that there is any canonical overt
realization of the restriction of a modal. So there is no real obstacle here to the view
that the modal in a construction like () is never a true epistemic.
Let me also defend against another sort of objection. Building on Horn (),
Copley () observes the following contrast:
() a. #Xander must be there, in fact, he should be.
b. Xander should be there, in fact, he must be.
One might think that this is evidence that should has a reading on which it is strictly
weaker than epistemic must. But that would be much too fast. Observe:
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
Regrouping
I have recommended against Kratzer’s grouping of probably with the weak necessity
modals, and against the idea of allowing that there is an epistemic reading of these
modals. Still, some might suppose that the apparatus of Kratzer (, ) yet has the
resources to classify the pseudo-epistemic reading. Let me consider one possibility in
this vein. First I briefly review the main features of Kratzer’s approach.
On Kratzer’s system, modals induce quantification over some partially ordered,
restricted class of worlds. The relevant partially ordered, restricted class of worlds is
generally fixed by two ingredients (what Kratzer calls conversational backgrounds): a
modal base and an ordering source. Formally these are both functions from worlds to
sets of propositions. The modal base fixes the restricted class of worlds quantified over,
as a function of the evaluation world: this will just be the set of worlds making all the
propositions delivered by the modal base true (the intersection of the propositions
given by the modal base). The ordering source is then used to induce an order on the
worlds in this domain.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
Kratzer originally hypothesized that there were two principal kinds of modal base
(corresponding roughly to the traditional linguistic distinction between epistemic and
“root” modals): epistemic and circumstantial. Epistemic modals have epistemic modal
bases. These deliver, relative to an evaluation world, a set of propositions known. Non-
epistemic modals have circumstantial modal bases. As I discuss further below, these
are somewhat harder to give a generic characterization of; but the basic idea is that
given a world, they deliver a set of true propositions characterizing some contextually
determined set of relevant circumstances at that world.
Kratzer further hypothesized that the two kinds of modal base are apt to be ordered
by characteristically different kinds of ordering sources, and that these differences give
rise to the various possible “flavors” of modal operators we observe. Circumstantial
modal bases, when ordered at all, can combine with deontic, teleological (or goal-
oriented), or bouletic ordering sources, and perhaps others. These ordering sources
yield sets of propositions characterizing what is required, what is aimed for, what is
wished for, and so on; and they induce a ranking of the circumstantial modal base
worlds according to how well the worlds conform to what is required, aimed for,
wished for, etc. Give or take some lexical idiosyncrasies, root modals are assumed to
potentially take any of these flavors. Kratzer proposed that epistemic modal bases, by
contrast, were apt to combine with either doxastic or stereotypical ordering sources,
ranking the epistemically accessible worlds according to how well they conform
to certain (contextually fixed) beliefs or to stereotypical situations, respectively. In
particular, a stereotypical conversational background yields, relative to a world, a set
of propositions characterizing the normal course of events in that world.
I set aside the idea that there exists a specifically doxastic reading of epistemic
modals, since there appears to be no evidence for this idea.7 Stereotypical ordering
sources play the more dominant role in the discussion of epistemic modals in Kratzer
(). On a natural reading of her account, stereotypicality is what primarily orders
the epistemic domain (the worlds quantified over by epistemic modals): properties of
stereotypical ordering sources are what grounds the differences between the grades of
epistemic modality (must versus probably, and probably versus might).
Against this, I want to suggest that we work under the hypothesis that stereotypical
ordering sources never play a role in the semantics of epistemic modals. Epistemic
7 The idea that there exists a doxastic reading of epistemic modals is mentioned only once by Kratzer
(), and in passing, without evidence. Portner (), following Kratzer, more explicitly maintains that
there is such a reading of epistemic modals. He supplies a number of examples which, he suggests, illustrate
modals interpreted with an epistemic modal base and a doxastic ordering source. But, for these examples, it
appears possible to hold either that the relevant epistemic modal base is not ordered, or that it orders itself,
or that the order is a primitive feature of the relevant information state. So these examples do not motivate
the idea of a doxastic reading.
(Of course, epistemic modals under explicit belief operators arguably quantify over belief worlds
(Stephenson, , Yalcin, ); but that is orthogonal to the present issue, which is about what kinds
of information can order the worlds quantified over. Moreover, we are talking about what is possible for the
interpretation of modals in abstraction from explicit semantic shifting.)
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
modals are not sensitive in any special way to facts of normality or stereotypicality.
These modals are indeed sensitive to bodies of information, and a body of information
may well be modeled by a set of worlds together possibly with some kind of ordering
on those worlds. But the relevant ordering is not well-interpreted as anything like an
ordering by stereotypicality or normality. The relevant notion is instead likelihood,
broadly construed. I take it this suggestion is already motivated by the examples given
above. The epistemic modality primarily concerns what might actually be the case,
and what is likely or unlikely to be the case. It does not concern what is normally
the case.
Of course, truths about what is normally the case may affect what you think
might be the case. But truths about anything can affect what you think might be
the case. The point is that epistemic modality does not have a special semantic
connection to the notion of normality. Epistemic modals are not sensitive to normality
orderings.8
(To some, this will seem like hairsplitting about how to use the technical term
“epistemic modal.” It is not. Epistemic modals are semantically distinctive in ways that
set them apart from other modals in significant respects (see for instance Groenendijk
et al., ; von Fintel and Iatridou, ; Yalcin, , ; Anand and Hacquard,
; Bledin, ). The evidence strongly favors the thesis that the epistemic modals
form a natural class.9 The thesis that ought and should do not admit of epistemic
readings—readings that belong in this class—is substantive and nonterminological.)
Still, Kratzer was not wrong to suggest that some modals can be sensitive to facts
of normality or stereotypicality. This was an important insight. Evidently, the weak
necessity modals can be understood in this way: that is what I have already suggested
about the pseudo-epistemic reading. But for reasons already reviewed, if we stay within
the Kratzerian approach to modality, we do not want to interpret this reading of
the modals as involving quantification over the epistemically accessible worlds (the
worlds quantified over by epistemic modals); that is, we do not want to say that these
modals take an epistemic modal base, and thus are epistemic modals. Again, this
is because we do not want ought φ/should φ to have a reading on which it entails
(epistemic) might φ, or is entailed by (epistemic) must φ. For that runs counter to
our examples.
We have noted that Kratzer bifurcates the space of modal bases into epistemic
and circumstantial varieties. So if we want to stay within the Kratzerian paradigm,
we should say that that the pseudo-epistemic reading of weak necessity modals
corresponds to the combination of a circumstantial modal base with a stereotypical
ordering source.
8 Though there may be a connection between normality and epistemic modality at the level of default
inference. See §.
9 I favor the thesis that the puzzles discussed in Yalcin (, ) are diagnostic of epistemic modals.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
Difficulties
But the “circumstantial modal base/stereotypical ordering source”-analysis faces some
difficulties. Relevant for assessing it is the question whether and what other modals
can take the pseudo-epistemic reading. Consider first the strong necessity modals.
Suppose Jones had instead said:
() ? I must be dead right now.
() ? I have to be dead right now.
The epistemic and deontic readings of these modals are false, owing to the facts of the
context; so we might naturally expect the pseudo-epistemic readings of these modals
to shine through here. Surely Jones is dead in all the relevant normal (circumstan-
tially accessible) worlds—not just, say, those that are especially normal according to
the relevant normality ordering (as Kratzerians might naturally analyze ought and
should11 ). But these constructions are plainly marked. There are two possibilities: (a)
must and have to cannot take the pseudo-epistemic reading; or (b) on the pseudo-
epistemic reading, the strong necessity modals entail their prejacents.
10 Can’t we say something like, “In light of the information in the report, Jones must be dead. But he
isn’t—he’s right there!”—thereby selectively deleting whatever items of information entailed that Jones is
dead? But I take it that an operator like In light of the information in the report is a (hyper)intensional
environment which semantically shifts the state of information relevant for the epistemic modal under it
(see, e.g., Hacquard, ; Yalcin, ; Stephenson, ; Anand and Hacquard, ). The claim made
in the main text concerns bare epistemic modals sentences where no compositionally induced semantic
shifting takes place. There is no tension here.
11 See for instance von Fintel and Iatridou (), in which weak necessity modals are construed as having
a strictly smaller domain of quantification than the corresponding strong necessity modals: “strong necessity
modals say that the prejacent is true in all of the favored worlds, while weak necessity modals say that the
prejacent is true in all of the very best (by some additional measure) among the favored worlds” (p. ).
They implement this idea in a Kratzerian framework by hypothesizing that weak necessity modals generally
involve two ordering sources rather than one (with the second serving to impose a further restriction not
present in the case of the corresponding strong necessity modals). See also Rubinstein ().
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
Either proposal seems unexpected from the point of view of the Kratzerian analysis
just described. From a theoretical point of view, (b) is not motivated. If the set of
worlds that pseudo-epistemic have to and must quantify over is some set of worlds
that are normal relative to the evaluation world, there is no conceptual reason that the
evaluation world needs to be in this set. The world can be a very abnormal place, even
relative to its own standards of normality.
One might propose that the strong necessity modals simply universally quantify
over the circumstantial modal base worlds, and not over any further restriction of that
modal base by normality or stereotypicality. Since the actual world must be in this set
(the circumstantial worlds), this would indeed compel the strong necessity modals to
entail their prejacents. Note that this is, in effect, to accept (a): while weak necessity
modals can get a special pseudo-epistemic normality reading, strong necessity modals
would only get a pure circumstantial reading.
Option (a) is also prima facie unattractive from the point of view of the basic
Kratzerian framework. In the context of this account, there is no principled reason
why there should not be a pseudo-epistemic reading of the strong necessity modals—
in the way that the system predicts that there are, for example, deontic readings of
both weak and strong necessity modals. It is a familiar point that often the most
important evidence for linguistic theory is negative evidence—the unavailability of
certain readings or constructions. As we have been observing throughout, modals are
in fact highly restricted in the readings that they can take. The absence of a pseudo-
epistemic reading for the strong necessity modals requires explanation in the context
of Kratzer’s theory.
There are further sources of discomfort with option (a). We are considering the
hypothesis that the pseudo-epistemic reading corresponds to a circumstantial modal
base and a stereotypical ordering source. Other things being equal, then, a pure
circumstantial reading of a strong necessity modal should entail a sentence with a
pseudo-epistemic weak necessity modal. But the intuitive evidence for this entailment
is slim. Consider one of Kratzer’s examples of a case of a strong necessity modal getting
a pure circumstantial reading:
() I must sneeze. (pure circumstantial)
() I have to sneeze. (pure circumstantial)
Neither of these intuitively entails either of:
() ? I ought to sneeze. (pseudo-epistemic)
() ? I should sneeze. (pseudo-epistemic)
One might reply that the pseudo-epistemic weak necessity sentences presuppose (or
entail) that their prejacents are false in some of the relevant circumstantial worlds,
and this blocks the entailment. Whether or not this suggestion is artificial depends
on whether this kind of constraint applies also to other readings of weak necessity
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
modals. It seems not to be widely assumed in the deontic case, where strong necessity
modals are generally thought to entail the weak ones (indeed, that is where the jargon
of “strong” versus “weak” comes from).
In any case, a further problem is that it is just strange to say these in the relevant kind
of context. Why? Suppose you sense a sneeze coming on, but you are not convinced
it is inevitable. Why don’t () and () seem like natural words of warning, slightly
weaker than the warning conveyed by () and ()? (As deontic ought is thought
to be weaker than deontic must.) We have the intuition that the flavor of modality
is qualitatively different—not just weaker—when we move from the strong necessity
modals here to the weak ones. This is surprising. Even if the pseudo-epistemic readings
of () and () are marked without some additional setup in this kind of scenario, we
naively might have thought that there should be a pure circumstantial reading of ought
and should available, such that () and () can be appropriate when you feel a sneeze
approaching, in the way () and () are. But such a reading seems not to be available.
This requires explanation.12
The evidence so far reviewed suggests—tentatively—the following: strong necessity
modals do not have a pseudo-epistemic reading, and weak necessity modals do not
have a pure circumstantial reading. Further, it is not clear whether circumstantial
strong necessity modals entail the pseudo-epistemic weak necessity modals.
We should wrap up this discussion of the way that circumstantial modality might
interact with the pseudo-epistemic modality by briefly considering the situation with
possibility modals. Kratzer () suggests that can can take a pure circumstantial
reading. If the pseudo-epistemics involve a circumstantial modal base, it would
therefore be natural to expect pseudo-epistemic modal sentences to entail sentences
with pure circumstantial possibility modals. So suppose Jones says:
() ? I can be dead right now.
This sounds like an strangely worded offer, from Jones, to kill himself.
Though odd owing to the facts of the particular example, () arguably does have
a reading on which it is strictly speaking true (and on which the entailment from the
corresponding pseudo-epistemic weak necessity claim goes through). The modal here
takes an ability-like reading, which some would run together with, or closely tie to, the
circumstantial reading. Interestingly, however, this possibility modal (like may) does
not seem capable of being used to express the (true, contextually appropriate) thought
12 The pseudo-epistemic readings of () and () also do not come very naturally, but they do at least
seem clearly available. It is easier to hear if we fill in some special circumstances—for example, that the
speaker has just inhaled some sneezing powder.
Seemingly also relevant to the availability of the pseudo-epistemic reading is the aspectual structure of
these constructions. Notably, in the presence of a telic verb, the pseudo-epistemic reading is rather more
natural with a progressive infinitive, or with some explicit temporal marking:
() I ought to be sneezing.
() I should sneeze any minute now.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
that among the relevant normal worlds is a world where Jones is dead. Many speakers
can achieve something like that reading with:
() I could be dead right now. (pseudo-epistemic?)
This tends as a default to get a pure epistemic reading (on which it is marked), but
(especially with the right tone of amazement) it can receive the pseudo-epistemic
reading, or something intuitively close to it.13 This reading is heard as equivalent to
something along the lines of:
() I could have been dead right now.
() I might have been dead right now.
—where these are not expressing the philosopher’s mere metaphysical possibility,
but rather seem instead to effect existential quantification over some nearby normal
worlds.14
Where does this leave us? The idea that the pseudo-epistemic reading involves a
circumstantial modal base and a stereotypical ordering source might yet be massaged
into working. I have raised some questions about it, but I have not shown that it cannot
be done. A successful development would presumably say more about the character
of circumstantial modal bases. It is fair to say that there is not a tremendous amount
of clarity about the nature of circumstantial readings in the literature. Kratzer herself
now seems ambivalent, if not skeptical, about clearly defining this category from a
semantic point of view (see e.g. Kratzer, , p. ). Still, it may be possible to do, for
all I have said.
Nevertheless, in the following sections, I wish to explore a different line of analysis.
Normality
In light of the examples discussed above, I want to consider the idea that on the
pseudo-epistemic reading, ought φ and should φ are rather directly connected to
something about the way things normally unfold. As a first attempt, let us consider
the following idea:
It should/ought to be that φ ≈ Normally, φ
We could also put the idea in a way more aligned with the fact that these modals
generally take infinitival clauses:
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
first pass
α should/ought to F ≈ It is normal for α to F.
(It is of course highly doubtful that we will be able to exactly paraphrase pseudo-
epistemic ought and should somehow in terms of the word normal. Still, exploring
how close we can get may teach us something.)
The following examples seem to illustrate a prima facie mutual entailment, and
hence motivate first pass:
() Normally, Bob is in his office right now.
It is normal for Bob to be in his office right now.
() Bob should be/ought to be in his office right now.
A difficulty for the first pass is that Rasputin-like cases does not fit this paradigm
as nicely:
() Rasputin should be/ought to be dead now.
() ? Normally, Rasputin is dead now.
() ? It is normal for Rasputin to be dead now.
Normally φ seems to suggest that φ corresponds to some kind of repeatable event, or
a situation of a repeatable type; but, at least out of context, Rasputin’s death does not
strike us as such an event or situation. Yet the pseudo-epistemics are (as already noted)
fine here, even without any special setup. Similarly for the earthquake case:
() ? It is normal for me to be dead now.
The sentences are improved if we combine a normality operator with the sort of
morphology usual to counterfactuals. We can say:
() Normally, Rasputin would be dead now.
() It would be normal for Rasputin to be dead now.
() It would be normal for me to be dead now.
We might therefore adjust the proposal: α ought to F and α should F, on their pseudo-
epistemic readings, mean something roughly equivalent to:
second pass
α should/ought to F ≈ It would be normal for α to F.
(This brings to mind von Fintel and Iatridou’s () observation that crosslinguisti-
cally, ought is often expressed by combining a strong necessity modal with counterfac-
tual morphology. It also brings to mind Swanson’s suggestion that this kind of should
concerns what “one would naturally expect” to be the case (Swanson, , p. ).)
The following seems to be a problem for this approach. On a Monday at the office,
I can say:
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
() Bob normally doesn’t come into the office on Mondays; he stays home. So it
would be normal for Bob to be home now. But I see that his car is parked
outside. So he ought to be in his office.
But according to the second pass, there should be a problem here: the sentence It would
be normal for Bob to be home now and the sentence He ought to be in his office should
be in conflict. But evidently they are not. We can affirm them both.
What to say? Perhaps the following: in (), the normal is talking about: normal
for a Monday. By contrast, I tentatively conjecture that the pseudo-epistemic modality
involves a notion of normality which is in some sense “all relevant things considered”.
The ought in () concerns what is normally true on a Monday where Bob’s car
is parked outside—and perhaps factoring in further relevant contextually supplied
information, too. Although Bob is not normally in his office Mondays, it may yet be
true that Bob is normally in his office on those Mondays in which his car is parked
outside. Perhaps this latter fact, or something in the near vicinity, explains why we
accept “He ought to be in his office” in ().
This leaves us with something like:
third pass
α should/ought to F ≈ It would be normal, all relevant things considered, for α to F.
Of course, we should like to say much more about what makes for normality in this
sense. This seems to be a difficult problem, the surface of which I have barely scratched.
The only point I wish to make here is that there seems to be no reason why this kind
of reading of ought and should could not be captured in terms of simple universal
quantification over a domain of normal worlds—the worlds where, roughly, things
in fact unfold as they (all relevant things considered) normally should. Tentatively
embracing this possibility, I will now switch to calling the pseudo-epistemic reading
the normality reading. I take it this reading has a normative quality. Roughly it
expresses something about what one is, or would be, entitled to expect to be true.
Defaults
Pseudo-epistemic modality now distinguished from epistemic modality, I will try to
take some steps towards clarifying what exactly the relationship is between these
modalities. In the next two sections, I explore two possible points of contact. First,
it may be that the domain of normal worlds is fixed as a function of the set of worlds
relevant for epistemic modals. Second, there may be important relations of default
inference from the pseudo-epistemic modality to the epistemic modality.
Start with the first idea. Even if ought and should, on their normality readings, are
taken to quantify universally over a domain of normal worlds, it may yet be that
this domain of normal worlds is one that is somehow determined as a function of
the set of worlds that epistemic modals quantify over. We could probe this question
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
Observe that Pφ φ, since the optimal set will be a nonempty subset of any
nonempty information state.20 Since as recently noted, Oφ φ, we know Oφ Pφ.
That is the desired result. We would not approve of Jones’s reasoning like this:
I should be dead right now.
⇒ Presumably, I am dead now.
Still, as noted above, the movement from a normality should/ought-claim to a claim
about what is presumably true plausibly has default license. The intuitive problem with
the above reasoning is that in Jones’s context, the epistemic possibility that he is dead
is eliminated; consequently the corresponding presumably-claim is not licensed. To
consider what patterns of inference are default licensed, we should define a notion of
consequence that restricts to states of information incorporating the premises and the
premises only—states that are informationally minimal. We may do this as follows:
Definition . The minimal states i incorporating a set of sentences are the states
i such that i incorporates every element of , and there is no i such that (a) i
incorporates every element of , and (b) si ⊂ si .
Definition . ψ is a default consequence of a set of sentences , D ψ, just in case
all the minimal states incorporating incorporate ψ.
Now we can observe that although Oφ Pφ, Oφ D Pφ. For example, return to:
Noam ought to be in his office.
⇒ Presumably, Noam is in his office.
The minimal states incorporating the premise are those where the information state
component is just W. These are all the states i such that the proposition that Noam is
in his office is true throughout ni . At these states ni coincides with oi , so the inference
is licensed.
Observe next that although Oφ D Pφ, {Oφ, ¬φ} D Pφ. (So D is nonmono-
tonic.) For example, the following is not default licensed:
Noam ought to be in his office.
Noam is not in his office.
⇒ Presumably, Noam is in his office.
The minimal states incorporating the premises are those where the information state
component is just the set of worlds where Noam is not in his office. This means that
at each of these states i, oi is some subset of these worlds. The conclusion drawn is not
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
incorporated by any of these states. (On the contrary, they all incorporate “Presumably,
Noam is not in his office”—intuitively a correct result.)
In this chapter so far we have seen many reasons to doubt that the following is a
valid inference:
Noam ought to be in his office.
⇒ Noam is probably in his office.
But now we can ask a different question. Is this inference default valid? That idea is
considerably more plausible. If the only relevant information one has is the premise,
the conclusion does seem like an acceptable one to (tentatively) draw. Vindicating this
pattern as a default inference would, it seems, do much to accommodate the intuition
that there is some nontrivial connection between what normally ought or should be
and what is likely. (And correspondingly, it would further subtract from whatever
motivation remains for recognizing a bona fide epistemic reading of ought and should.)
There are various ways in which one might vindicate this pattern as a default
inference. It depends on what semantics for probability operators one assumes. One
possibility is to think of information states as equipped with a probability measure
Pr conditionalized on the state (as discussed for instance in Yalcin (c)). Then one
could require that in minimal states i incorporating only information about normality,
Pri (ni ) > . (or whatever threshold is thought to be appropriate to license probably-
claims in general).21 This would be to say that in abstraction from factual information,
it is likely that the world is normal.
Information-sensitivity, Nonfactualism
About Normality
We have yet to say what is going on with:
() Noam ought to be in his office.
() If Noam was arrested on the way to work, he ought to be in jail.
Above we noted some theories on which indicative if-clauses are understood to shift
the information state relevant for evaluating the consequent of the conditional. Let
me give a basic example of this kind of semantics. I will restrict attention to indicative
conditionals with nonmodal antecedents.
Definition . For nonmodal sentences φ, the proposition expressed by φ, [φ], is {w :
[[φ]]w,i = } (for arbitrary choice of i).
Definition . For nonmodal φ, the nearest state to i = s, incorporating φ, i + φ,
is the pair sφ , , where sφ is s ∩ [φ].
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
in how one takes the world to be. (Compare (Gibbard, , chapter ), discussing
normative discourse.)
We could put it like this: one component of one’s state of mind determines an expec-
tation function f , which maps information states to expectation patterns. This reflects
one’s conception of how the information one possesses governs the expectations it
would be appropriate to have. The important feature of the expectation function is that
it is a function: given an information state, we do not take there to be multiple, equally
legitimate expectation patterns. Disagreements between agents about what (normally)
ought to or should be the case may be traceable to differences in the ways that each
agent takes the world to be (their information); but they may also be traceable to a
more fundamental difference in the expectation function each agent endorses.
Taking this approach, we would do well to model expectation-laden information
states as pairs s, f of an information state and an expectation function. The expec-
tation pattern of an expectation-laden information state i is then just f (s). This will
determine the normal set for the state, and the semantics will be as above. In principle,
this approach can capture the shiftiness apparent in (), as the expectation pattern will
now shift with the information.
It remains to explore what formal constraints (if any) govern f . The matter deserves
separate exploration. But I note that one constraint we surely do not want is the
requirement that the normal worlds determined by a given state s, f be a subset of s.
This would make it the case that Oφ φ—the incorrect result, as we have repeatedly
observed.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
rule corresponding to Birds fly default recommends the conclusion that x flies, given
the information that x is a bird.) Such rules can also be used to capture which actions
are default licensed given some information. (Rough example: the default rule corre-
sponding to Keep your promises recommends that I show up, given that I promised
to.) Horty uses the latter application to clarify deontic oughts. To say φ ought to be the
case is to say that the realization of φ is default recommended, in a sense formalized
with the notion of a default theory.22 Together with the work above, this would seem
to point in the direction of a unified treatment of deontic and pseudo-epistemic oughts
as devices for expressing defaults. I don’t suggest I have achieved that unification here,
of course; I suggest only that it emerges as a promising avenue to pursue.
While Horty’s discussion of oughts is trained chiefly on the deontic reading and
on surrounding issues about practical reasons and deliberation, it is plain he intends
his style of analysis to apply to the notion of an epistemic reason and to theoretical
deliberation, and to what we have been calling pseudo-epistemic oughts. (Not surpris-
ing, since perhaps the standard application of default logic is to theoretical reasoning.)
From a linguistic point of view, it would be worthwhile to investigate the connections
between a semantics for pseudo-epistemic oughts of the sort sketched above and one
based on default theories of Horty’s variety—to see which differences are notional and
which substantive, and where the data point in the cases where they come apart. I leave
this for future work.
Stepping back, I would be inclined to take Horty’s analysis, and the analysis of this
chapter, in what I would call an expressivistic direction, in the style of Yalcin (a). If
it is right that we need something like expectation-laden information states or default
theories to model the language of oughts and shoulds, it needn’t follow that this talk
serves to literally describe expectation-laden states of mind, or the default we embrace,
or our habits of belief or action. Unembedded, these kinds of sentences do not describe
the defaults we embrace; rather, they serve to express those defaults. Yalcin (a)
offers a recipe for making precise of this sense of “express".
Defaults, as formalized here or by Horty, might also sensibly be called norms.
Characteristic of norms, after all, is their defeasibility. Switching terminology, then,
the picture that emerges is that ought and should correspond, in a relatively technical
sense, to modalities of norms, on both their deontic and pseudo-epistemic readings.
Acknowledgments
Thanks to Nate Charlow, Matthew Chrisman and Sophie Dandelet for detailed comments
on an earlier draft. Thanks also to Fabrizio Cariani, Wes Holliday, Daniel Rothschild, Aynat
Rubinstein, Judy Thomson and to participants in my minicourse on modality and dynamics at
the Hebrew University of Jerusalem in May of for helpful discussion.
22 Actually, he formalizes several possible senses, discussing the prospects for each.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
References
Anand, Pranav and Hacquard, Valentine () Epistemics and attitudes. Semantics and Prag-
matics. . .
Bledin, Justin () Logic informed. Mind. . . pp. –.
Cariani, Fabrizio () ‘Ought’ and resolution semantics. Noûs. . . pp. –.
Charlow, Nate () What we know and what to do. Synthese. . . pp. –.
Copley, Bridget () So-called epistemic should. Snippets. . pp. –.
Copley, Bridget () What should ‘should’ mean? Unpublished manuscript. [Online]
Available from: http://halshs.archives-ouvertes.fr/docs/////PDF/copley.should.pdf.
[Accessed Nov .]
Finlay, Stephen () Confusion of Tongues: A Theory of Normative Language. New York, NY:
Oxford University Press.
Fintel, Kai von and Iatridou, Sabine () Epistemic containment. Linguistic Inquiry. . .
pp. –.
Fintel, Kai von and Iatridou, Sabine () How to say ought in Foreign: The composition
of weak necessity modals. In Guéron, Jacqueline and Lecarme, Jacqueline (ed.) Time and
Modality,. pp. –. Springer.
Gibbard, Allan () Thinking How to Live. Cambridge, MA: Harvard University Press.
Gillies, Anthony () Epistemic conditionals and conditional epistemics. Noûs. . .
pp. –.
Gillies, Anthony () Iffiness. Semantics and Pragmatics. . . pp. –.
Groenendijk, Jeroen, Stokhof, Martin, and Veltman, Frank (). Coreference and modality.
In Lappin, Shalom (ed.) Handbook of Contemporary Semantic Theory. pp. –. Malden,
MA: Blackwell.
Hacquard, Valentine () Aspects of Modality. PhD thesis. MIT.
Heim, Irene () On the projection problem for presuppositions. In Barlow, Michael,
Flickinger, Daniel P., and Wescoat, Michael T. (eds.) Proceedings of the Second West Coast
Conference on Formal Linguistics. pp. –. Stanford, CA: Stanford Linguistics Association.
Holliday, Wesley and Icard, Thomas () Measure semantics and qualitative semantics for
epistemic modals. In Snider, T.(ed.) Proceedings of Semantics and Linguistic Theory .
University of California, Santa Cruz, CA, – May. pp. –. CLC Publications.
Horn, Laurence () A Natural History of Negation. Chicago, IL: Chicago University Press.
Horty, John () Reasons as Defaults. New York, NY: Oxford University Press.
Kolodny, Niko and MacFarlane, John () Ifs and oughts. Journal of Philosophy. CVII. .
pp. –.
Kratzer, Angelika () What ‘must’ and ‘can’ must and can mean. Linguistics and Philosophy.
. pp. –.
Kratzer, Angelika () The notional category of modality. In Eikmeyer, Hans-Jürgen and
Rieser, Hannes (eds.) Words, Worlds, and Contexts, New Approaches to Word Semantics.
pp. –. Berlin: de Gruyter.
Kratzer, Angelika () Modality. In von Stechow, Arnim and Wunderlich, Dieter (eds.)
Semantics: An International Handbook of Contemporary Research. pp. –. Berlin: de
Gruyter.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
Kratzer, Angelika () Modals and Conditionals: New and Revised Perspectives. Oxford: Oxford
University Press.
Lassiter, Daniel () Measurement and Modality: The Scalar Basis of Modal Semantics. Ph.D.
thesis. New York University.
Portner, Paul () Modality. New York, NY: Oxford University Press.
Rubinstein, Aynat () Roots of Modality. Ph.D. thesis. University of Massachusetts, Amherst.
Silk () Evidence sensitivity in weak necessity deontic modals. Journal of Philosophical Logic.
. . pp. –.
Stalnaker, Robert () Indicative conditionals. Philosophia. . . pp. –.
Stephenson, Tamina C. () Judge dependence, epistemic modals, and predicates of personal
taste. Linguistics and Philosophy. . . pp. –.
Swanson, Eric () Modality in language. Philosophy Compass. . . pp. –.
Thomson, Judith Jarvis () Normativity. Peru, Illinois: Open Court Publishing Company.
Veltman, Frank () Defaults in update semantics. Journal of Philosophical Logic. . .
pp. –.
Yalcin, Seth () Epistemic modals. Mind. . . pp. –.
Yalcin, Seth () Probability operators. Philosophy Compass. . . pp. –.
Yalcin, Seth (a) Bayesian expressivism. Proceedings of the Aristotelian Society, CXII. .
pp. –.
Yalcin, Seth (b) Context probabilism. In Aloni, M, Kimmelman, V., Roelofsen, F., Weidman
Sassoon, G., Schulz, K., and Westera, M. (eds.) Logic, Language, and Meaning: Proceedings of
the th Amsterdam Colloquium. University of Amsterdam, – Ddecember . pp. –.
Berlin: Springer.
Yalcin, Seth (c) A counterexample to Modus Tollens. Journal of Philosophical Logic. . .
pp. –.
Yalcin, Seth () Epistemic modality de re. Ergo. . . pp. –.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
Extreme and Non-extreme
Deontic Modals
Paul Portner and Aynat Rubinstein
Introduction
When we argue that our priorities militate in favor of an action, we can do so in a way
that allows for competing alternatives, or in a way which does not. Suppose that A and
B agree that no policy can both provide insurance for uninsured citizens and allow
people to make their own choices:
() A: It is important that our uninsured citizens get insurance.
B: It’s also important that people make their own choices.
A: So how do we balance these things?
() A: It is crucial that our uninsured citizens get insurance.
B: And it’s crucial that we allow people to make their own choices.
A: So we’re stuck.
() A: Our uninsured citizens should get insurance.
B: People should also make their own choices.
A: So how do we balance these things?
() A: Our uninsured citizens must get insurance.
B: And yet people must make their own choices.
A: What can we do?!
We refer to operators such as important, crucial, should and must as deontic necessity
modals.1 Our goal in this chapter is to contribute to the understanding of the
1 This use of the term ‘necessity’ is common in linguistics but is more inclusive than is usual in
philosophy. It reflects a theoretical assumption that the operators with weaker force (should, important)
have the same quantificational core as clearer examples of necessity modals such as must. In a similar way,
linguists often assume that attitude verbs such as believe and want and upper-range probability operators
such as likely and probable have a necessity semantics (Hintikka, ; Kratzer, , ).
The modals in question are not strictly limited to deontic interpretations, but rather have the full range
of priority readings (deontic, teleological, and buletic; see Portner ). These subtypes are similar by
linguistic criteria, and the use of a particular modal in context is often indeterminate as to its subtype.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
differences between the two types of deontic necessity statements that they produce.
What is it about must and crucial that leads to an impasse, while should and important
do not?
This chapter will focus on the semantic and pragmatic analysis of the modal
elements in ()–(). We situate our work with respect to two broader issues in the
semantics of modality. The first concerns the nature of differences in strength among
necessity modals. In (), we see strong necessity modals paired with their weak
necessity counterparts:
() (a) You must call Barbara. (entails You should . . .)
(b) It is crucial to talk to Barbara. (entails It is important . . .)
(c) It is certain that Barbara will win. (entails It is likely . . .)
• Question : Why are there two strength levels of modal necessity, and how do
they differ?
The recent literature on this topic is diverse, and includes important work by, among
others, von Fintel and Iatridou (); Finlay (, ); Kolodny and MacFarlane
(); Rubinstein ().
The second broader issue is based on the observation that many modal operators
have scalar semantics, and are gradable in the same way as concrete, non-modal adjec-
tives are. The gradability properties of a number of modal operators are exemplified
in (). We see that should, important, and likely are gradable, similarly to a concrete
adjective such as big.
() a. You should call Barbara more than (you should call) Alice.2 (cf. bigger)
b. It is very important to talk to Barbara. (cf. very big)
c. It is just as likely that Barbara will win as it is that Alice will. (cf. as big)
• Question : How can the gradability properties of modal operators be composi-
tionally modeled within a general framework of gradability?
The issue of the gradability of modal expressions has received attention only fairly
recently, by Portner (); Yalcin (, ); Lassiter (, ); Klecha (,
); and Katz et al. (), among others.
In this chapter, we contribute a novel linguistic characterization of the difference
between weak and strong deontic necessity modals. Among non-modal adjectives,
the literature has identified a class known as the extreme adjectives; many relative
adjectives have extreme counterparts, for example big has the extreme counterpart
huge and smart has brilliant (Bolinger, ; Paradis, ; Rett, ; Morzycki,
2 Note that (a) has an irrelevant reading where more compares frequency. We are focusing on the
reading where it compares degrees of importance or priority. One naturally occurring example which
arguably has this reading is the following: A fillet more than a whole fish should have that beautiful
fresh smell (http://www.smh.com.au/lifestyle/homestyle/tried-and-tasted/how-to-cook-the-perfect-piece-
of-fish--mkn). Accessed from GloWbE. All corpus data cited in this chapter were produced by
the BYU corpus project (Davies, , , ).
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
). We will show that must, crucial, certain are also extreme elements, contrasting
with non-extreme should, important, likely. Since pairs of extreme vs. non-extreme
adjectives are so common, it is to be expected that gradable modal operators would
come in extreme and non-extreme versions as well. The existing literature on extreme
adjectives then points us to a formal analysis of their gradability properties.3
The analysis here will focus on deontic and other priority modals, giving less
attention to epistemic examples like (c). Although we discuss epistemic modals in
reviewing the relevant previous literature, our formal analysis will only apply to the
priority types. We limit our attention in this way not only because deontic modality is
the focus of the present volume, but also because priority modals show a logical profile
distinct from epistemic and ability/opportunity modals.4 , 5 We also limit our attention
to unembedded occurrences of modals. In other work, we have begun to discuss how
the same pragmatic distinction can be applied to cases in which the modal is embeded
in an attitude context (Portner and Rubinstein, ).
The Chapter is organized as follows: section reviews previous work on modal
semantics, focusing in particular on the motivation for developing a theory with links
to the scale-based semantics of adjectives, and then pushes this perspective further by
incorporating the class of extreme expressions. We begin in . by reviewing the stan-
dard premise semantics classification and theory of modality within formal semantics
due to Kratzer. We then provide in section . some background on semantic classes of
adjectives, discussing how different types of gradable adjectives can be characterized in
terms of a scale-based semantics. Based on this background, we review and assess work
which looks at modals from a similar perspective, such as Lassiter () and Klecha
(). A prominent idea in this literature is that many modal operators require a scale-
based semantics, and that subclasses of modal operators should be identified with
subclasses of gradable adjectives. In section . we use a variety of tests to establish a
parallel between strong and weak necessity modals and the extreme and non-extreme
classes of adjectives exemplified by huge vs. big. We show that all of the strong necessity
modals are in fact extreme words and point out that this perspective reconciles some
of the theoretical tension in previous work on gradable modals.
In section , we develop a semantic and pragmatic analysis of extreme and non-
extreme deontic and other priority modals. In this section, we propose that the scales
of these modals are constructed from the same types of modal parameters which
are familiar from the standard Kratzerian premise semantics for modality, that is
3 Lassiter () notes the similarity between certain strong necessity modals (in particular, deontics) and
what he calls ‘high degree adjectives’ such as huge. He does not, however, note their unique grammatical
properties or connect them to the literature on extreme adjectives.
4 For example, while priority modals show the Union Property pattern, epistemic ones do not (Halpern,
; Yalcin, ; Lassiter, ): φ ≥ ψ, φ ≥ χ entail φ ≥ (ψ ∨ χ ). The fact that epistemics don’t show
the Union Property is an important puzzle for the standard Kratzerian (, ) analysis.
5 Dynamic modals do not show the same possibility/necessity distinction as other types. This can be
seen in the fact that can is stronger than a modal logic existential (can(φ) does not entail can(φ ∧ ψ) ∨
can(φ ∧ ¬ψ), compare Kenny, , Portner, , section .) and the fact that it has no obvious necessity
counterpart.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
objects according to how they measure in length. There are competing views about
how to formally implement gradability, some of which assume degrees as part of the
semantic ontology, and others which do not. In this paper we adopt a degree-based
approach to gradability, following Kennedy and McNally (); Kennedy () and
others. We do so for concreteness, and under the assumption that this choice does
not affect our main claims about the distinction between extreme and non-extreme
predicates, and modals in particular. As an entry point to the extensive literature on
adjective meaning and gradability, including the debate about the semantic status of
scales and degrees and the cognitive basis of these notions, we refer the reader to recent
overview articles on these topics (Beck, ; Demonte, ).
Some of the most common reflexes of gradability crosslinguistically, exemplified in
(), are the ability of a predicate to occur in comparative constructions, to be modified
by degree modifiers such as almost, very or slightly, and to appear in degree how-
questions.
() a. The green vase is prettier than the brown vase.
b. The glass is almost full.
c. How long was the Beagle?
Languages may also allow modification of certain gradable predicates by measure
phrases. Long in English allows this type of modification, whether in its comparative
form (a), or in its morphologically unmarked form (b), known as the positive form.
() a. The Mayflower was (about) meters longer than the Beagle.
b. The Beagle was meters long.
Gradability has been most closely studied with respect to adjectives, and indeed
most of the modal expressions we focus on in this paper belong to this category.
It is well known, however, that other grammatical categories admit of gradability
as well (Bolinger, ; Doetjes, ; Kennedy and McNally, ; Kennedy and
Levin, ; Sassoon, , ). The verbal elements should and ought to attest
the cross-categorial nature of gradability in the modal domain (see (a) above and
data below).
It is noteworthy that not all adjectives pass the tests for gradability, or at least that
some pass them more easily than others (Sapir, ; Burnett, ). Good candidates
for non-gradability are adjectives such as true and pregnant and privative adjectives
such as fake:
() a. ?Your sister is more pregnant than this woman.
b. ?It is more true that crows are black than it is that apples are red.
While these examples are not ungrammatical, they require contextual support to
be interpreted. As is often assumed, we take it that coercion (a semantically or
pragmatically induced shift in meaning) is involved (see, for example, Kennedy ;
Burnett , ), although in this domain we do not find explicit theories of the
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
coercion process on a par with those discussed in the aspectual domain (Moens and
Steedman, ; de Swart, ).
The literature is somewhat ambivalent about the gradability status of one particular
class of adjective, particularly important here, the extreme adjectives.
() a. ?Godzilla is more gigantic than Mothra. (Morzycki, , p. )
b. ?The Tesla is more excellent than the Volt. (Compare Paradis, )
Both Morzycki and Paradis rate extreme adjectives as questionable in the comparative,
while Rett () assumes that they are grammatical. (As we’ll see below, Morzycki
discusses the fact that they are more acceptable in some contexts than others.)
Among the modal predicates we focus on, we observe that the strong necessity
modals (must, crucial, certain) are somewhat degraded in the comparative, unlike their
weak counterparts:
() a. You ought/?have to call Zoe more than (you ought/?have to call) Barbara.
You should/?must call Zoe more than (you should/?must call) Barbara.6
b. It is more important/?crucial to talk to Zoe than it is to meet her face to
face.
c. It is more likely/?certain that the sun will set tonight than it is that it will
rise tomorrow.
Quantitative data bears out our feeling that strong necessity modals are less natural
than weak ones in the comparative: important and likely occur in the comparative
much more frequently than crucial and certain, and this pattern is comparable to that
observed with non-modal extreme/non-extreme pairs.7
6 We note that some speakers find the examples with modal auxiliaries worse on the whole than those
with semi-modals. The relative pattern of acceptability judgments reported in the text is stable across these
types, however.
7 The percent of occurrence in the comparative and mutual information scores between target items
and the comparative in the British National Corpus (Davies, ) are as follows: important=.%
(PMI=.), crucial=.% (.), likely=.% (.), certain=.% (.). As examples of non-modal
adjectives: big=.% (.), gigantic=.% (.), pretty=.% (.), gorgeous=.% (.).
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
() a. μlength = [λx : x has a length . the degree d such that x measures d in
length]
b. [[ long ]] = [λxλd . μlength (x) = d] (informally = λxλd . x is d-long)
Comparative morphemes and degree modifiers place restrictions on the degree
returned by the measure function, typically by relating it to another degree (Kennedy,
, p. ). These morphemes are essentially the glue that derives appropriate truth
conditions for sentences in which gradable predicates occur. In (), assuming a
simple-minded syntax, we see how the denotation for a phrasal comparative mor-
pheme -er/more works:8
() a. [[ -er/more ]]= λGλyλx.∃d∃d [d > d ∧ G(x)(d) ∧ G(y)(d )]
b. The Mayflower was longer than the Beagle.
c. [[ (b) ]] = [[ [Mayflower [[-er long] than Beagle]] ]] = True iff
∃d∃d [d > d ∧ μlength (Mayflower) = d ∧ μlength (Beagle) = d ]
In order to account for gradable adjectives in their simplest, positive form, the
grammar needs to provide semantic glue, parallel to -er/more . . . than, which can relate
the measured degree to another degree not explicitly mentioned, the standard. For
some adjectives (namely, relative adjectives), the standard is contextually determined,
while for others (absolute adjectives), the standard is lexically determined by the
properties of the scale (Unger, ; Rotstein and Winter, ; Kennedy and McNally,
). A null morpheme pos (for ‘positive’) introduces the standard and expresses that
the measured degree is at least as high (Cresswell, ; Bartsch and Vennemann, ;
Bierwisch, ; von Stechow, ; Kennedy, ).
() [[ pos ]]= λGλx.∃d[d ≥ std(G) ∧ G(x)(d)]
Example (a) could be judged true in the context of a conversation about old
explorer ships, but it would likely be judged false in a discussion about the evolution
of non-military ships in the past two centuries. Thus, long has a contextual standard
in this case; std([[ long ]] ) is given by context. In contrast, (b) has a lexical standard,
and is true if the ship was completely full; hence std([[ full ]] ) is the maximum degree
of fullness.9
() a. The Beagle was long.
b. The Beagle was full.
8 The phrasal analysis of the comparative is coupled with a straightforward semantic interpretation
(Heim, ), but it is unclear whether it is to be preferred over a more elaborate clausal analysis involving
ellipsis (Lechner, ). See Bhatt and Takahashi () and Beck () for an overview of the debate.
The particular formalizations given in (b), (a), and () are not as simple as they could be, but are
given in these ways to make them compatible with the presentation of Morzycki’s ideas below.
9 Normal issues of granularity mean that, in different contexts, there can be variation in what counts as
reaching the maximum or minimum on the scale. For example, a glass may count as full even if it could
hold slightly more liquid. These uses are known as ‘loose’ or ‘imprecise’ (Kennedy and McNally, ).
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
10 See Rotstein and Winter (); Kennedy and McNally () for additional tests.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
certain a maximum standard. Klecha points out, however, that this analysis conflicts
with the principles relating an adjective’s standard to its scale structure. In particular,
Lassiter’s proposal for likely runs counter to the argument that adjectives with closed
scales do not take relative standards. Klecha () analyzes gradable modality in a way
which borrows from both of the above approaches. Focusing on epistemics, he denies
that possible is a scalar adjective (see also Herburger and Rubinstein, ), and argues
that likely and certain are interpreted on different scales, despite the entailment relation
between them. Because Klecha does not analyse the adjectives based on a single scale,
the entailment from certain to likely is not automatic, and must be stipulated.
The debate between Lassiter and Klecha leaves us somewhat uncertain about the
scales and standards for epistemic adjectives, as neither approach seems to give a
full explanation of their modification patterns. In the next section, we will argue that
another parameter of classification can help to elucidate the properties of gradable
modals.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
that they only occur with a subset of modifiers associated with gradability. Specifically,
they are used with what she calls ‘totality modifiers’ (absolutely, totally, utterly), but not
with approximators (almost). According to Paradis, some but not all people accept
them with the comparative and superlative.
() a. The salsa is absolutely terrible.
b. *The salsa is almost terrible.
c. ?The salsa is more terrible than the guacamole.
To explain this pattern Paradis says that extreme adjectives ‘represent the ultimate
point of a scale,’ but can be coerced into meanings which apply to elements not at the
maximum point of the relevant scale.
Paradis’ intuitions about the semantics of extreme adjectives are appealing, but it
does not seem correct to say simply that they describe their arguments as being at the
endpoint of their scale. For example, (a) does not mean that the salsa is of the highest
possible (or even the highest actually realized) degree of badness. There certainly are
more terrible things, and quite possibly more terrible condiments on the table. Later
work on extreme adjectives has therefore pursued the intuition that extreme adjectives
utilize subscales related to the more normal scales of their non-extreme counterparts.
Rett () assumes that extreme adjectives utilize totally closed scales which are
subscales of their non-extreme counterparts. Terrible would utilize a scale which
begins at some threshold t and reaches to the top of the scale. The idea that extreme
adjectives’ scales are totally closed is motivated, it seems, by two factors: first, that
there must be a lower bound defined by t, and secondly that there must be an upper
bound to explain the appearance of maximality modifiers such as totally. However,
against this vision is the fact that modifiers which normally go with lower bounded
scales, like barely, are not acceptable (*barely terrible). Moreover, as we have seen,
maximality operators are not interpreted as expected if they diagnose closed scales for
extreme adjectives. As we noted above, (a) does not mean that the salsa is maximally
terrible, as it would if absolutely placed the adjective’s argument at the top of the scale.
Rather, it means something like that it is without question in the range of terrible
condiments.11
Given these points about barely and absolutely, it seems that we do not have clear
evidence concerning the scale type for extreme adjectives. Morzycki () presents
an analysis of extreme adjectives which, though he does not emphasize the fact, allows
for extreme adjectives to be either bounded or unbounded at both top and bottom. His
approach is that scalar adjectives are interpreted with respect to a contextually salient
set of degrees DC on their scales; non-extreme adjectives relate to values within DC ,
11 Not all maximality operators have precisely the same range of meanings. As noted by Stephen Finlay
(p.c.), Janie is totally gorgeous has a reading ‘Janie is gorgeous in every respect.’ This shows that totally
gorgeous can grade an object on a scale other than the scale of gorgeous. Many adjectives are like this, and
we note that totally pretty has a similar reading.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
while extreme adjectives relate to values above all of the values in DC .12 The standards
follow suit: the standard for a non-extreme adjective is in DC , while that for an extreme
adjective is above DC . In this analysis, the extreme portion of the scale will be bounded
at the top if S is. This fact may allow us to understand what’s going on with (a).
One of Morzycki’s major contributions was a discussion of a range of properties
which differentiate extreme from non-extreme adjectives and which also can serve as
diagnostics for whether a term is an extreme expression.13 We will look at five such
properties in detail:
. Extreme adjectives occur with extreme degree expressions, or extreme modifiers,
such as flat-out, positively and downright, while non-extreme adjectives (even
those with maximum standards) are less natural.14
() a. ??flat-out good
b. flat-out excellent
c. ??flat-out straight
Though it’s not common to find a maximum standard adjective with an extreme
counterpart, in such cases, only the extreme adjective is natural with these
modifiers:
() a. ??downright full
b. downright brimming
. The near inverse of the previous property is that extreme adjectives strongly resist
modification with very.15 They contrast in this respect with relative adjectives
such as good and absolute ones such as straight.
() a. very good
b. ??very excellent
c. very straight
() a. very full
b. ??very brimming
12 ‘Salience’ could be interpreted in different ways, and Morzycki glosses it in this context as ‘degrees
. . . that we regard, for the purposes of the discussion, as reasonable candidates for values we might want to
consider’ (Morzycki, , pp. –). The issue of how to properly characterize the pragmatics of ‘salient’
vs ‘non-salient’ degrees is difficult and closely connected to the analysis of exclamatives (see for example
Zanuttini and Portner, ; Miró, ; Rett, ).
13 Morzycki makes a distinction between lexical extreme adjectives and contextual extreme adjectives.
The lexical extreme adjectives behave as extreme in any context in which they are used; we are focusing on
this class. Contextual extreme adjectives can behave as extreme with contextual support.
14 Morzycki points out that Cruse () can be seen as having first identified this property. Note that
other items with intuitively similar meanings might have different properties. Words such as categorically
and definitely are not extreme modifiers.
15 Similar adverbs such as really and so show different properties and do not serve as a test for the
extreme/non-extreme contrast.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
Note that Kennedy and McNally (, p. ) argue that coercion is involved
when very occurs with a minimum or maximum standard adjective; the key point
here, though, is that these nevertheless differ from extreme adjectives.
. Extreme adjectives are often degraded in the comparative (as in (c), repeated
below as (a)), but they do occur and improve with even (b).
() a. ?The salsa is more terrible than the guacamole.
b. The salsa is even more terrible than the guacamole.
. Though they resist the comparative, extreme adjectives fully accept the equative
construction.
() The salsa is as terrible as the guacamole.
. Extreme adjectives are unacceptable when placed in comparatives with their non-
extreme counterparts.
() a. *The salsa is more terrible than the guacamole is bad.
b. *The salsa is worse than the guacamole is terrible.
() a. *The bucket is more brimming than the cup is full.
b. *The cup is fuller than the bucket is brimming.
As Morzycki notes, the data in ()–() contrast with other examples where two
adjectives on related scales are compared:
() The field is longer than it is wide.
Morzycki () lists several other properties which distinguish extreme and non-
extreme adjectives, but does not provide a formal analysis of them. Briefly, these
properties are: (i) Extreme adjectives can readily be intensified by prosodic means
(Cruse, ); (ii) in discourse, an extreme adjective is perceived as disagreeing (not
agreeing) with an understatement; and (iii) extreme adjectives are readily used in
hyperbole. We will not discuss these properties further.
Recall that within Morzycki’s analysis, non-extreme adjectives associate their argu-
ment with a contextually salient degree, while extreme adjectives associate their
argument with a degree above the contextually salient portion of the scale. He assigns
big and gigantic the following meanings, where C by default refers to the contextually
salient set of degrees (= DC above):
() a. [[ bigC ]] = λxλd . d ∈ C ∧ x is d-big
b. [[ giganticC ]] = λxλd . d > max(C) ∧ x is d-big
Morzycki states that this difference in meaning goes along with a difference in
contextual standard: the standard for big must be within C while that for gigantic must
be above C. This proposal allows Morzycki to provide explanations for the patterns
involving extreme and non-extreme adjectives observed above.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
. Extreme modifiers make reference to a set of degrees C+ which extends the salient
set C into higher degrees, and they presuppose that the contextual standard is
within the extreme portion of this extended scale (i.e., C+ − C).16
() [[ downrightC ]] = λaλx : std(a) ∈ (C+ − C) . ∃d[d ≥ std(a) ∧ aC+ (x)(d)]
Because the contextual standard for an extreme adjective is above C, it can satisfy
the presupposition of the extreme modifier, but because the standard for a non-
extreme adjective is within C, it cannot. For a non-extreme adjective to satisfy
the presupposition of an extreme modifier, it must be treated as extreme in the
context. In other words, it must be given a standard above C.
. Morzycki (, p. ) analyzes very+ADJ as asserting that the individual it takes
as argument is at the top of the contextually salient set of degrees of the scale of
ADJ:
() [[ very C ]] = λaλx . ∃d[d ≥ std(a) ∧ small(max(C) − d) ∧ aC (x)(d)]
The infelicity of ??very gigantic is thus explained on this analysis through the
contradictory requirements it places on the measure of the object described;
specifically, it asserts that the argument is big to a degree which is both below
max(C) (via very) and above max(C) (via gigantic).
. Morzycki’s explanation of why extreme adjectives often resist the comparative is
complex, but the basic idea is that it is not natural to compare degrees which are
not in the salient range. Ideally, we would have a theory of salience of degrees
to provide a more specific reason why this would be so,17 but intuitively it
makes sense that if the salsa is terrible, it is so overwhelmingly bad that it might
be difficult or pointless to decide whether it is better or worse than the (also
terrible) guacamole. After all, if it’s terrible, you know all you need to know:
you’re not going to eat it. But this difficulty or pointlessness can be overcome
pragmatically, as in the context of a Mexican cooking class where an instructor’s
detailed feedback on the student’s failed salsa and guacamole would be relevant.
Moreover, it seems that even (specifically the presupposition it triggers) supports
the kind of context in which extreme degrees can be compared; see Morzycki
(, §.) for further discussion.
. In contrast to comparatives, Morzycki argues that the semantics for equatives
does not involve comparison. For example, () states that there is a degree of
badness beyond the salient set C which is the degree of the salsa, and which is the
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
degree of the guacamole. Seen this way, the equative only asserts the existence
of a single degree, and this is why it does not show the pragmatic incongruity
sometimes seen with comparatives.
Morzycki’s explanation for the comparative and equative facts may not be
entirely convincing, for two reasons: it’s not clear why comparing extreme degrees
in the comparative should be less felicitous than referring to them in the equative,
and the formulation of the equative semantics without comparison is non-
standard, and so might be seen as ad hoc. Our intuitions about the comparatives
and equatives are slightly different from his. It seems to us that, in a simple context,
any distinctions above C along the relevant dimension are collapsed, and so there
is only a single extreme degree. In such contexts, a sentence comparing two
extreme degrees would be logically false, unless they are equated. Context could,
however, articulate the portion of the scale above C, making multiple extreme
degrees available, and in such a context, a comparative could be informative.
Further work will be needed to determine the best analysis of the behavior of
extreme adjectives in comparatives and equatives.
. In Morzycki’s analysis, comparisons involving an extreme and a non-extreme
adjective, like (), have trivial truth conditions. (a) states that salsa’s degree
of badness is beyond C, and moreover beyond the guacamole’s degree of badness,
which is within C. This must be the case. (b) states that the salsa’s degree of
badness, which is within C, is above the guacamole’s degree of badness, which is
beyond C. This cannot be the case.
18 In the corpora we have examined, extreme modifiers are extremely rare with the modal elements
on which we have focused our discussion. In the BNC, there is one occurrence of positively must and
no other combinations of interest. In COCA (Davies, ), the raw counts and percentages of flat-
out/positively/downright with the modal elements are: must= (.%), should= (.%), certain= (.%),
likely, crucial, and important=. Overall, the quantitative trends conform to our hypothesis, but the numbers
are too small to count as strong evidence by themselves.
19 Corpus patterns in the BNC support this difference between adjective classes. Percentage occurrence
with very and association measures are as follows: crucial=.% (PMI=.), important=.% (.),
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
certain=.% (-.), likely=.% (.); non-modal adjectives for comparison: gorgeous=.% (.),
pretty=.% (.).
We used very much in place of very alone with the auxiliaries (a). The intuition data in this case confirm
our predictions, but the combinations are very rare. Neither occurred in the BNC, while the raw counts
in COCA were very much must= and very much should= (<.%). The single example is interesting,
though, and fits our analysis: You should know—actually, for complete, suspenseful enjoyment of the film, you
very much should not know, but the word is out, so we’re obliged to tell you—that Heavenly Creatures is based
on a notorious murder case. (‘A Heavenly Trip Toward Hell’, Richard Corliss Time, Nov , ).
20 Corpus data related to the modals in () was provided in footnote above.
21 This sentence is somewhat degraded in comparison to the examples with crucial, certain, but
considerably better than the corresponding (a) without even.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
() a. *Susan must call her mother more than she should call her father.
*Susan should call her mother more than she must call her father.
b. *It is more crucial for Mary to call her mother than it is important for
her to call her father.
*It is more important for Mary to call her mother than it is crucial for
her to call her father.
c. *It is more certain that Mary will call her mother than it is likely that she
will call her father.
*It is more likely that Mary will call her mother than it is certain that she
will call her father.
We have shown that the strong necessity modals pattern with extreme adjectives, and
conclude that they should be included within this class. The weak necessity modals, in
contrast, should be treated as non-extreme expressions.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
your name has an extreme degree of necessity, while coloring your picture has a non-
extreme degree.
In the next section, we work towards a formal analysis of necessity modals which
incorporates the intuition that the scale of necessity is divided into a salient, or non-
extreme, part and a non-salient, or extreme, part. This approach not only provides
an explanation for the empirical differences between strong and weak modals enu-
merated in this section. It also promises a pragmatic grounding for the difference. On
our approach, must and should, and crucial and important, are all necessity modals;
they differ in that the strong ones paint an alternative as necessary and reject further
discussion, while the weak ones portray it as being necessary but still up for discussion.
In terms of issues of scales and standards, our approach can be seen as pursuing
a synthesis of the earlier ideas of Lassiter and Klecha.22 In considering how to rep-
resent the relationships between semantically related modal adjectives, these authors
represent two reasonable strategies. Lassiter follows the strategy of putting the pair
likely and certain on the same scale, and thus makes the differences between them
fairly minimal (literally, a matter of degree). However, as we have seen, the differences
between extreme and non-extreme adjectives amount to more than that. Klecha
follows a different strategy, where likely and certain are associated with different scales,
stipulating a relation between those scales. However, because the relation between the
scales is not motivated by any broader picture of adjective semantics, this leaves an
explanatory gap. Our approach puts the extreme and non-extreme pairs on subscales
of the same basic scale. Hence, as in Lassiter’s analysis, the members of the pairs differ
in degree. However, they are on pragmatically different parts of this scale, and so there
is a sense in which Klecha is right as well in saying they use different scales. Crucially,
the difference between the extreme and non-extreme parts of modal scales is based on
a motivated pragmatic distinction and is parallel to what is observed with non-modal
adjectives.
Analysis
The intuition behind theories of extreme adjectives is that they characterize an
argument as going beyond the normal or expected range of its scale. As we have
discussed, this intuition is apt for extreme modals; for example, just as downright huge
things are big to a degree beyond the expected, the positively crucial actions are so
important that we do not expect to discuss them. We aim to capture this intuition by
defining a general scale-type labeled the scale of necessity. This scale will be used as the
basis for the analysis of both weak and strong necessity modals. It may be compared
to a general scale of size, which would provide the core of the analysis of both big
and huge.
22 It should be kept in mind, though, that their debate focused on epistemic modals, and so we must
extrapolate a bit to attribute them contrasting positions concerning deontics.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
Our formulation builds on existing work on strong and weak necessity within a
premise semantics framework. Specifically, von Fintel and Iatridou () proposed
an analysis of must and should that involves at least two ordering sources, rather
than the single one in the standard version of Kratzer’s framework. With a strong
necessity modal, only the primary ordering source is relevant, and it determines the
modal’s domain of quantification in the way familiar from Kratzer’s work. With a weak
necessity modal, the secondary ordering source operates on the set of worlds which
are relevant according to the modal base and primary ordering source, to further
restrict the domain of quantification. Specifically, the semantics first identifies the set
of worlds compatible with the modal base m(w) which are best-ranked according
to the primary ordering source, o (w), and then picks out the best-ranked worlds
among those according to the secondary ordering source o (w), as in (). We assume,
following von Fintel and Iatridou (), that there is always a set of ‘best’ worlds
among those ordered by an ordering source.23
() a. u ≤O v iff {p ∈ O : v ∈ O} ⊆ {p ∈ O : u ∈ O}
b. Best(M, O) = {v ∈ M : ¬∃u[u <O v]}
() Best(M, O , O ) = Best(Best(M, O ), O )
In this way, they account for the fact that strong necessity modals logically entail their
weak counterparts, and provide some degree of intuition about what the difference
between them amounts to.
Although their ideas advance our understanding, von Fintel and Iatridou do not
provide much explanation of the crucial difference between primary and secondary
ordering sources (see their footnote ). Building on their work, Rubinstein ()
proposes that this difference is to be understood in terms of pragmatics, where the
difference between the two ordering sources tracks the distinction between what is
treated as non-negotiable versus what is treated as negotiable. The primary ordering
source, in these terms, contains priorities that the speaker presents as ones she is
not willing to give up, while the secondary ordering source contains priorities that
might give way in the face of other factors.24 We can illustrate Rubinstein’s analysis by
considering the examples at the beginning of the paper:
() a. Our uninsured citizens should get insurance.
b. Our uninsured citizens must get insurance.
Sentence (a) is appropriate in a context in which the speaker is contributing to a
negotiation about health-care reform, while (b) is appropriate when the speaker
23 See Katz et al. () for an alternative way of combining multiple ordering sources. The definitions
cite the premise sets given by the modal base and ordering source(s) at the world of evaluation, e.g., M =
m(w).
24 Stephen Finlay (p.c.) makes the interesting suggestion that the priorities in o might be secondary
not because they are seen as unsettled, but rather because it is at issue whether they are achievable. This
suggestion might allow us to make a connection between this analysis and the information sensitivity of
deontic modals (Kolodny and MacFarlane, ; Charlow, ; Cariani et al., ; Cariani, ).
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
wants to rule out further debate. Combining these ideas with the semantics from von
Fintel and Iatridou (), the prediction is that must(φ) says that φ follows from
non-negotiable requirements, while should(φ) says that φ follows taking into account
requirements about which the speaker allows negotiation.
Our analysis connects the semantic and pragmatic differences between primary and
secondary ordering sources to the conceptualization of scale structure developed in
work on gradable and extreme adjectives. Specifically, we propose that the extreme
portion of the scale (the non-salient portion) is defined only in terms of the priorities
in the primary ordering source. This makes sense of the intuition that in the extreme
portion of the scale, distinctions between degrees are not salient. They are not salient
because any such distinctions would require treating one non-negotiable requirement
as more important than another. The non-extreme (salient) portion of the scale,
in contrast, is defined by taking into account both the primary and the secondary
ordering sources, and given Rubinstein’s understanding of the secondary ordering
source, this captures the intuition that distinctions within this portion of the scale are
salient. They are salient because they are based on priorities which the speaker treats
as important, but not set in stone.
Formally speaking, the non-extreme part of the scale is derived by considering
subsets of the secondary ordering source o (w). In what follows f is a contextually
provided function from sets of premises o to sets of non-null subsets of o, where the
members of f (o) are totally ordered by ⊆:
() Sc,w is defined only if f (o (w)) is defined, for every world w. When defined,
Sc,w =
Dw , ≤: Dw = {{p : Best(m(w), o (w), o ) ⊆ p} : o ∈ f (o (w))}
≤ = {
d , d : d , d ∈ D ∧ d ⊇ d }
() S+
c,w is only defined if f (o (w)) and Sc,w are defined, for every world w. When
defined,
S+ + + +
c,w =
Dw , ≤ : Dw = Dw ∪ {{p : Best(m(w), o ) ⊆ p} : o ∈ f (o (w))}
+ +
≤ = {
d , d : d , d ∈ D ∧ d ⊇ d }
Notice that S+
c,w is defined using the two-place version of Best, since the secondary
ordering source is not relevant. The extreme part of the scale S+ + +
c,w is
(Dw − Dw ), ≤ ∩
+
(Dw − Dw ) .
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
(a) defines a measure function μ which will allow us to give a scalar semantics
for necessity modals. The lexical entries in (b–c) capture the two levels of strength
observed with necessity modals: weak ones, specifically should and important, are
measures of propositions of non-extreme degree, whereas strong ones, namely must
and crucial, are restricted to higher values.
() a. Measure function: For any proposition p, world w, and necessity scale ,
μ (p, w) is the highest degree d in D+
w such that p ∈ d.
b. Weak necessity modal: [[ N ]] = [λpλdλw . d ∈ Dw ∧ d = μ (p, w)]
c
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
Change the scenario so that eating ramen and seeing Casablanca are incompatible yet
equally strong goals. As a result, going to Adams Morgan (to have ramen) and going
to Silver Spring (to see Casablanca) are equally important, but we cannot do both.
In this scenario, f (o (w)) is the set of premise sets {{F}, {R, C, F}}, from which the
premise set {R, C, F} generates two disjoint maxima in the ordering, worlds where we
eat ramen (in Adams Morgan) and worlds where we see Casablanca (in Silver Spring).
(In both sets of worlds, we also go to Georgetown to see the friend.) Neither destination
is a necessity with respect to {R, C, F}, so neither has any degree of necessity. This is
incorrect.
We would like to be able to use both of the maxima defined by {R, C, F} to associate
propositions with the same degree of necessity. In other words, because eating ramen
entails going to Adams Morgan, and seeing Casablanca entails going to Silver Spring,
{R, C, F} should determine the same degree for going to Adams Morgan and going
to Silver Spring. One approach would be to have f select not just subsets of the
ordering source, but sets of consistent subsets of the ordering source, for example
f (o (w)) = {{{F}}, {{R, F}, {C, F}}}. Here, the second member encodes the fact that
{R, F} and {C, F} are equally important collections of goals. For each set of sets of
premises (e.g., {{R, F}, {C, F}}), we would then collect the necessities with respect to
each member ({R, F} and {C, F}) to define a degree of necessity. Going to Adams
Morgan and going to Silver Spring would thus have the same degree of necessity (and
this degree would be lower on the scale than the degree of going to Georgetown).
As an alternative to the above, we get a similar effect by allowing inconsistent
premise sets to be selected by f , and defining degrees by looking at all of the maxima in
the world-ordering defined by each premise set. In particular, the following definition
might work, where GP(A, B, C)(p) indicates that p is a ‘good possibility’ (in the sense
of Kratzer ) with respect to A, B, and C:
() S c,w is only defined if f (o (w)) is defined, for every world w. When defined,
Sc,w =
Dw , ≤: Dw = {{p : GP(m(w), o (w), o )(p)} : o ∈ f (o (w))}
≤ = {
d , d : d , d ∈ D ∧ d ⊇ d }
Assuming that f (o ) = {{F}, {R, C, F}} (encoding that R and C are equally important),
this definition assigns the highest degree of necessity to going to Georgetown, and
equal but lower degrees to going to Adams Morgan and Silver Spring. This definition
has the surprising property that the scale of necessity is defined not in terms of
standard necessity (‘entailed by the union of the maxima’), but rather in terms of the
weaker notion of good possibility (‘entailed by at least one maximum’).
The revised definition allows us to account for examples with incompatible goals
like the following.
() a. It is as important to preserve the wetlands as it is to build the new housing
(which would drain the wetlands).
b. We should preserve the wetlands just as much as we should build the new
housing.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
As Morzycki () pointed out, equatives are also natural with extreme expressions,
and there seems to be little problem in interpreting cases where they describe incom-
patible propositions as having the same degree:
() a. It is as crucial to preserve the wetlands as it is to build the new housing.
b. We must preserve the wetlands just as much as we must build the new
housing.
These sentences are clearly acceptable, and describe situations which are true dilem-
mas. They could be analysed using the same strategy as ().
Although extreme and non-extreme modals are both perfectly acceptable with
equatives, Cruse (); Paradis (); Bolinger (); Morzycki () note that
extreme expressions are less natural with comparatives:
() a. ?It is more crucial to preserve the wetlands than it is to build the new
shopping center.
b. ?We must preserve the wetlands more than we must build the new shop-
ping center.
As we’ve mentioned, we interpret this fact in a slightly different way than does
Morzycki. We propose that the extreme part of the scale has only a single degree
(though it can be coerced into multiple degrees). We can model this idea by assuming
that f (o (w)) as a default is a singleton; this stipulation makes sense given the
assumption that o (w) contains only non-negotiable propositions, and so in many
contexts there would be no useful way for f to distinguish among them. However,
under coercion we can make distinctions among these propositions, in which case f
can select multiple subsets of o (w), giving rise to multiple extreme degrees.
Our analysis handles positive forms as well. Both (a) and (b) are true in the
scenario envisioned.
() a. It is important to preserve the wetlands.
b. It is crucial to preserve the wetlands.
Our account predicts that (b) can be true even if a proposition inconsistent with
preserving the wetlands also has the extreme degree. This is clearly possible, for
example when it’s uttered after (). More typically, (b) would imply that we
preserve the wetlands in all accessible worlds, and so build the new housing in none.
This result follows if, in the normal case, o (w) is consistent. Note that when f (o (w)) is
a consistent singleton, our semantics is equivalent to the standard Kratzerian analysis
of must.
We have seen how our analysis accounts for some of the subtle differences between
strong and weak deontic modals in a way which parallels the treatment of other
extreme/non-extreme expressions. Specifically, we have mentioned the distinction
between very and extreme modifiers such as positively and discussed contrasts in the
acceptability and interpretation of equatives, comparatives, and the positive form. We
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
have not returned to the final three properties listed in section ., and doing so must
await another occasion.
Discussion
We have given strong evidence that strong and weak necessity modals with priority
readings should be classified as extreme and non-extreme expressions, respectively.
Moreover, we have argued that this observation, when combined with other ideas
from the scalar approach to modals, points to a better understanding of the semantic
and pragmatic differences between the two types of modal necessity. At an empirical
level, our analysis accounts for some of the subtle differences between strong and
weak deontic modals in a way that parallels the treatment of other extreme/non-
extreme expressions. Specifically, we have mentioned the distinction between very and
extreme modifiers such as positively and discussed contrasts in the acceptability and
interpretation of equatives, comparatives, and the positive form.
We have focused specifically on deontic and other priority readings of necessity
operators. In future work, we hope to extend these ideas to other judgment types, in
particular epistemics. Once an analysis of epistemics is in hand, it will be important to
compare the kind of approach we have advocated here with the probabilistic semantics
for gradable epistemic modals developed by Yalcin () and Lassiter (). We also
hope to extend our ideas to non-priority root modals such as can, able, and adjectives
formed with -able. These are not necessity operators, but are nonetheless gradable. The
distinct logical properties of these two groups of non-priority modals will allow us to
further refine the formal analysis of gradable modality.
Acknowledgments
We thank Graham Katz and Elena Herburger for discussions which have shaped our under-
standing of the topic and Jessica Rett, Nate Charlow, Matthew Chrisman, and Stephen Finlay
for comments on previous versions. Any errors are of course our responsibility, not theirs.
We gratefully acknowledge the support of National Science Foundation grant BCS- to
both authors, and support from the Mandel Scholion Interdisciplinary Research Center in the
Humanities and Jewish Studies to the second author.
References
Bartsch, R. and Vennemann, T. () Semantic structures: A study in the relation between syntax
and semantics. Frankfurt: Athenäum.
Beck, S. () Comparison constructions. In von Heusinger, K., Maienborn, C., and Portner, P.
(eds.) Semantics: An International Handbook of Natural Language Meaning. Volume .
pp. –. Berlin: de Gruyter.
Bhatt, R. and Takahashi, S. () Reduced and unreduced phrasal comparatives. Natural
Language and Linguistic Theory. . pp. –.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
Bierwisch, M. () The semantics of gradation. In Bierwisch, M. and Lang, E. (eds.) Dimen-
sional Adjectives. pp. –. Berlin: Springer.
Bolinger, D. L. () Adjective comparison: a semantic scale. Journal of English Linguistics. ..
pp. –.
Bolinger, D. () Degree words. The Hague: Mouton.
Burnett, H. () The puzzle(s) of absolute adjectives: On vagueness, comparison and the
pragmatic origin of scale structure. In Paperno, D. (eds.) UCLA Working Papers in Linguistics:
Special Issue on Semantics and Mathematical Linguistics. Los Angeles: University of California.
Burnett, H. () A delineation solution to the puzzles of absolute adjectives. Linguistics and
Philosophy. .. pp. –.
Cariani, F. () Deontic modals and probabilities: One theory to rule them all? In Charlow, N.
and Chrisman, M. (eds.) Deontic Modality. Oxford: Oxford University Press.
Cariani, F., Kaufmann, M., and Kaufmann, S. () Deliberative modality under epistemic
uncertainty. Linguistics and Philosophy. . pp. –.
Charlow, N. () What we know and what to do. Synthese. . pp. –.
Cresswell, M. J. () The semantics of degree. In Partee, B. H. (ed.) Montague grammar.
pp. –. New York: Academic Press.
Cruse, D. A. () Lexical semantics. Cambridge: Cambridge University Press.
Davies, M. () BYU-BNC (based on the British National Corpus from Oxford University
Press). [Online] Available from: http://corpus.byu.edu/bnc/. [Accessed Nov. .]
Davies, M. () The Corpus of Contemporary American English: million words, –
present. [Online] Available from: http://corpus.byu.edu/coca/. [Accessed Nov. .]
Davies, M. () Corpus of Global Web-Based English: . billion words from speakers in
countries. [Online] Available from: http://corpus.byu.edu/bnc/. [Accessed Nov. .]
Demonte, V. () Adjectives. In von Heusinger, K., Maienborn, C., and Portner, P. (eds.)
Semantics: An International Handbook of Natural Language Meaning. Volume . pp. –.
Berlin: de Gruyter.
de Swart, H. () Aspect shift and coercion. Natural Language and Linguistic Theory. .
pp. –.
Doetjes, J. S. () Quantifiers and Selection: On the distribution of quantifying expressions in
French, Dutch and English. Leiden: Holland Academic Graphics.
Finlay, S. () Oughts and ends. Philosophical Studies. . pp. –.
Finlay, S. () What ought probably means, and why you can’t detach it. Synthese. .
pp. –.
Fintel, K. von () The best we can (expect to) get? Challenges to the classical semantics for
deontic modals. Manuscript. MIT Paper presented at the Central APA, February . MIT.
Fintel, K. von and Iatridou, S. () How to say ought in foreign: The composition of weak
necessity modals. In Guéron, J. and Lecarme, J. (eds.) Time and Modality. pp. –. Berlin:
Springer.
Goble, L. () Utilitarian deontic logic. Philosophical Studies. .. pp. –.
Halpern, J. () Reasoning about Uncertainty. Cambridge, Mass.: MIT Press.
Heim, I. () Notes on comparatives and related matters. Manuscript. MIT.
Herburger, E. and Rubinstein, A. () Is ‘more possible’ more possible in German? In Snider,
T., D’Antonio, S., and Weigand, M. (eds.) Proceedings of SALT . New York University,
May to June. pp. –.
Hintikka, J. () Modality and quantification. Theoria. . pp. –.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
Katz, G., Portner, P., and Rubinstein, A. () Ordering combination for modal semantics. In
Chereches, A. (ed.) The Proceedings of SALT , University of Chicago, Chicago, IL, –
May. pp. –. CLC Publications.
Kennedy, C. () Projecting the Adjective: The Syntax and Semantics of Gradability and
Comparison. Ph.D. thesis. University of California Santa Cruz.
Kennedy, C. () Vagueness and grammar: the semantics of relative and absolute gradable
adjectives. Linguistics and Philosophy. . pp. –.
Kennedy, C. and Levin, B. () Measure of change: The adjectival core of degree achieve-
ments. In McNally, L. and Kennedy, C. (eds.) Adjectives and Adverbs: Syntax, Semantics and
Discourse. pp. –. Oxford: Oxford University Press.
Kennedy, C. and McNally, L. () Scale structure, degree modification, and the semantics of
gradable predicates. Language. .. pp. –.
Kenny, A. () Human abilities and dynamic modalities. In Manninen, J. and Tuomela, R.
(eds.) Essays on Explanation and Understanding: Studies in the Foundations of Humanities
and Social Sciences. pp. –. Reidel, Dordrecht.
Klecha, P. () Positive and conditional semantics for gradable modals. In Guevara, A. A.,
Chernilovskaya, A., and Nouwen, R. (eds.) Proceedings of Sinn and Bedeutung . Volume ,
Utrecht, – September . pp. –. MITWPL.
Klecha, P. () Bridging the Divide: Scalarity and Modality. Ph.D. thesis. University of Chicago.
Kolodny, N. and MacFarlane, J. () Ifs and oughts. The Journal of Philosophy. ..
pp. –.
Kratzer, A. () The notional category of modality. In Eikmeyer, H.-J. and Rieser, H. (eds.)
Words, Worlds, and Contexts. pp. –. Berlin: de Gruyter.
Kratzer, A. () Modality. In von Stechow, A. and Wunderlich, D. (eds.) Semantik/Semantics:
An International Handbook of Contemporary Research. pp. –. Berlin: de Gruyter.
Kratzer, A. () Modals and Conditionals. Oxford: Oxford University Press.
Lassiter, D. () Gradable epistemic modals, probability, and scale structure. In Li, N. and Lutz,
D. (eds.) Proceedings of SALT , University of British Columbia and Simon Waser University,
Vancouver, BC, April to May. pp. –. CLC Publications.
Lassiter, D. () Measurement and Modality: The Scalar Basis of Modal Semantics. Ph.D. thesis.
New York University.
Lechner, W. () Ellipsis in Comparatives. Berlin: Mouton de Gruyter.
Miró, E. C. () Wh-exclamatives in Catalan. Ph.D. thesis. University of Barcelona.
Moens, M. and Steedman, M. () Temporal ontology and temporal reference. Computational
Linguistics. . pp. –.
Morzycki, M. () Degree modification of extreme adjectives. In Bochnak, R., Nicola, N.,
Klecha, P., Urban, J., Lemieux, A., and Weaver, C. (eds.) Proceedings of the Chicago Linguistic
Society . – April .
Morzycki, M. () Adjectival extremeness: degree modification and contextually restricted
scales. Natural Language and Linguistic Theory. .. pp. –.
Paradis, C. () Adjectives and boundedness. Cognitive Linguistics. .. pp. –.
Portner, P. () Modality. Oxford: Oxford University Press.
Portner, P. and Rubinstein, A. () Mood and contextual commitment. In Chereches, A. (ed.)
The Proceedings of SALT . University of Chicago, Chicago, IL, – May. pp. –. CLC.
Publications.
Rett, J. () Degree Modification in Natural Language. Ph.D. thesis. Rutgers University.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
Rotstein, C. and Winter, Y. () Total adjectives vs. partial adjectives: Scale structure and
higher-order modifiers. Natural Language Semantics. .. pp. –.
Rubinstein, A. () Roots of Modality. Ph.D. thesis. Amherst: University of Massachusetts.
Sapir, E. () Grading. a study in semantics. Philosophy of Science. .. pp. –.
Sassoon, G. W. () Vagueness, Gradability and Typicality: A Comprehensive Semantic Analy-
sis. Ph.D. thesis. Tel Aviv University.
Sassoon, G. W. () Vagueness, gradability and typicality: The interpretation of adjectives and
nouns. Leiden: Brill.
Stechow, A. von () Comparing semantic theories of comparison. Journal of Semantics. .
pp. –.
Swanson, E. () Interactions with context. Ph.D. thesis. MIT.
Unger, P. () Ignorance. Oxford: Clarendon Press.
Villalta, E. () Spanish subjunctive clauses require ordered alternatives. In Jackson, B. and
Matthews, T. (eds.) The Proceedings of SALT , Cornell University, – June. pp. –. CLC
Publications.
Villalta, E. () Context dependence in the interpretation of questions and subjunctives. Ph.D.
thesis. Tübingen.
Villalta, E. () Mood and gradability: an investigation of the subjunctive mood in Spanish.
Linguistics and Philosophy. .. pp. –.
Yalcin, S. () Epistemic modals. Mind. .. pp. –.
Yalcin, S. () Probability operators. Philosophy Compass. .. pp. –.
Zanuttini, R. and Portner, P. () Exclamative clauses: At the syntax-semantics interface.
Language. .. pp. –.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
Rationalization and the Ross
Paradox
Benj Hellie
Introduction
I must finish this chapter—and here I am, working away. To type this sentence, as I am
doing, I must push the hyphen key three times—lo and behold, I just did it. Conversely,
although I surely should donate a lot more to causes fighting the prison-industrial
complex, and doubtless oughta do much more on behalf of ending fossil-fuel use, I
will not do either (hey, nobody’s perfect!).
The link between ‘must’ and intentional action appears tight. Whatever one thinks
one must do (whether important or trivial), one does. More cautiously—one might be
stymied, after all—one acts with the intention of doing it.
Indeed, among the English modal auxiliary verbs I understand, the connection
appears uniquely tight. What one thinks one can, could, may, might, should, or would
do (whether important or trivial), one often enough does not even try to do (Sloman,
, p. ); and much of what one will do, one won’t do intentionally.1 With the end
of understanding relations among agency, language, and modality, must stands out as
central, the remaining modals retreat to the periphery.2
Other linguistic phenomena connected to intentional action are imperative sen-
tences and the speech act of command. If Rance accepts Brent’s command ‘Pay the
rent!’ , Rance forms an intention to pay the rent, thinks he must pay the rent, and
endorses the sentence ‘I must pay the rent.’ . As for Brent, he issues the command
endorsing ‘Pay the rent!’, in the belief that Rance must pay the rent, endorsing the
sentence ‘Rance must pay the rent.’ , and in order to put on display an intention for
1 What one shall do is a matter I fear I cannot make much sense of: ‘shall’ is not in good standing in my
idiolect, and corpus uses are freighted with legalese and distorted by prescriptivism.
2 If ‘ethics’ is understood as the theory of value, ethics may be appropriately aligned with should/ought.
But if ‘ethics’ is understood as the theory of practical reason, ethics is appropriately aligned with must.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
Rance to pay the rent.3 Running in the opposite direction, if Rance intends to pay the
rent, he arguably does so thinking he must pay the rent and endorsing ‘I must pay the
rent.’4 —and, with some plausibility, also endorsing the self-directed imperative ‘Pay
the rent!’.5
These various phenomena—individual and general-will intention, imperative sen-
tences, the issuance and acceptance of commands, and the associated obligative
readings of ‘must’—I group together under the rubric of obligation.6 This chapter maps
out the territory of obligation. At its center is the psychological state of intention. An
intention for Rance to pay the rent has a content R of a kind I call a procedure—
a structured, nonpropositional representation of a plan of action.7 The imperative
sentence ‘You pay the rent!’, with Rance as the addressee, shares this procedural
content R.8 Brent’s command addressed to Rance using this sentence puts on display
an intention—harbored by the ‘general will’ of a community including Rance and
Brent, voiced by Brent—with the content R. Accepting the command, Rance forms
an intention with content R.9
My core thesis is that a modal sentence given an obligative reading is a modalized
imperative (Castañeda, ; Hellie, ).10 Read obligatively (and as addressed to
3 Castañeda (, p. ) brings Kant and Hare to his side in maintaining that ‘imperatives are at the
center of the connection of ‘ought’[]-assertions with both action and rationality’. To my mind, this settles
on the wrong modal.
4 Contrast the view that the obligative meaning of must-sentences is ineliminably social (Ninan, ,
§.; Portner, , p. ).
5 Contrast the view that the obligative meaning of imperative sentences is ineliminably social (Lewis,
a; Kaufmann, ).
6 The deontic logic tradition studies an operator . That operator is given the intuitive glosses ‘ought’
and ‘it is obligatory that’. On my terminology, ‘obligation’ is expressed with must rather than ought. If
ordinary ‘obligation’ conforms to my usage, is either underspecified or semantically defective.
7 Contrast the view that intentions are a kind of belief (Setiya, , p. ).
8 A selection of opposing views of imperative contents: properties (Portner, , p. ); propositions
(Kaufmann, ); satisfaction–violation pairs (Vranas, ); ‘actions’ understood as ‘relations over worlds’
(Barker, ).
9 For a nuance, see note .
Charlow () salutarily links imperatival content to practical psychological states: for α an imperative,
[[α]] is a condition on ‘plans,’ objects with at least the structure of a set of action-kinds (); plans are (near
enough, perhaps) contents of a practical psychological state of ‘planning to do’ (Charlow, , passim). The
communicative meaning of an act of command using α is then (roughly) to constrain the content of the
audience’s ‘planning to do’-state so that it satisfies [[α]].
Charlow (, IV) advances this as improving upon a ‘Standard Account’ affirming the Display Thesis, on
which the content of σ is the content p of a mental state conventionally put on display by speech acts with
σ , and the orthodox Propositional-Attitude Thesis that all mental states are propositional attitudes. To my
mind, Charlow makes a compelling case that the Standard View founders on practical language phenomena.
On the present approach, the Display Thesis serves as a fixed point, while linguistic and psychological
content float in unison. I fear that without the Display Thesis, both linguistic and psychological theory
would be underconstrained. And (as a matter of personal taste) I would find questions about linguistic
meaning far less interesting if they did not shed light on psychology. I also wonder about the stability of
Charlow’s case against the Display Thesis: if ‘plans’ are psychological contents but are not propositional, the
Propositional-Attitude Thesis is already abandoned; but that makes room to preserve the Display Thesis.
10 In this slogan, a modal sentence is a sentence with a modal as its highest operator, while a modalized
blah is a modal sentence in which the modal taking widest scope operates on a sentence of type blah—for
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
Rance), ‘You must pay the rent.’ operates syntactically on the imperative ‘You pay the
rent!’ and semantically on its content R.11 As is standard (Kaplan, ), I analyze
modals as operators on intensions, and assign intensions the job of representing entail-
ment; but I analyze entailment as the preservation not of truth, but of endorsement (a
mental attitude of ‘implicit acceptance’ toward a sentence: Hare, , p. ; contrast
Castañeda, ).12 In consequence, modals are not ‘metaphysical’ but rather ‘per-
spectival’ operators. Gathering these theses, the predicted meaning of ‘You must pay
the rent.’ is to express one’s perspective as endorsing ‘You pay the rent!’—namely, to
express one’s intention with content R.
A logical peculiarity unifies the phenomena of obligation, and contrasts them with
belief and its affiliated language of declarative sentences and epistemic/metaphysical
modality: the literature on obligative language labels this contrast the Ross Paradox
(Ross, ; compare Williams, ).13
For belief, conjunction both introduces and eliminates classically, and disjunction
introduces classically. I believe that I both posted this letter and drank up your wine
just if I believe that I posted this letter and I believe that I drank up your wine. And if
I believe that I posted this letter, I believe that I either posted this letter or drank up
your wine. Similarly for declarative sentences. ‘You posted this letter and drank up my
wine.’ is equivalent to the pair ‘You posted this letter.’ and ‘You drank up my wine.’ (the
conjunction entails each conjunct, the conjuncts together entail the conjunction). And
‘You posted this letter.’ entails ‘Either you posted this letter or you drank up my wine.’ .
Similarly for epistemic/metaphysical uses of must. ‘You must have posted this letter
and drunk up my wine.’ is equivalent to the pair ‘You must have posted this letter.’ and
‘You must have drunk up my wine.’ And ‘You must have posted this letter.’ entails ‘You
must have posted this letter or drunk up my wine.’ .
example, a modalized declarative and a modalized imperative are both modal sentences. I presume that
every modal sentence is declarative, so that a modalized imperative is a declarative modal sentence.
According to Geach (, p. ), ‘[in] a language that entirely lacked proper imperatives[,] we could
just use the plain future tense to give orders. [For example], “The patient will go down to the operating
theater.” ’. But if all directive modal sentences are modalized imperatives, this is wrong: the operand of ‘will’
is the indispensable imperative ‘The patient go down to the operating theater!’.
11 Contrast the ‘orthodox’ view that sentential-scope modals operate semantically on propositions
(Lewis, , , a, , §.; Kratzer, , ).
12 The conception of entailment as endorsement-preservation has been revived in more recent literature:
see H. Kamp, ; Veltman, ; Gillies, ; Yalcin, , inter alia.
This literature appears to presuppose endorsement-preservation to be incompatible with the intensional-
operator conception of modals—to my mind, incorrectly (Yli-Vakkuri, ; Hellie, ).
13 According to G. Kamp (, p. ), ‘Ross’s paradox [is] not only a reason to question the adequacy
of this or that deontic-logical rule or axiom, but provide[s] sufficient reason to drop the deontic-logical
approach altogether’ (quoted as translated in Hansen (, p. n. )). If so, casualties include Geach
(); Chellas (, p. ); Lewis (b); Parsons (). For a more extensive survey of the logical and
linguistical impact, respectively, see Hansen () and Starr ().
The subsequent literature on the Ross Paradox is extensive. A selection of notable recentish work:
Lascarides and Asher (); Aloni (); Vranas (); Willer (); Kaufmann (); Portner (,
§.); Starr (, §.); Charlow (, §..); Cariani ().
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
14 Contrast the view that the Ross Paradox is essentially linguistic (‘the idea here is to introduce
something like disjunction at the level of speech acts’: Charlow (, p. ), original emphasis).
15 I am inclined to agree with Charlow (, p. ) that the Ross Paradox for oughta is not quite as
compelling: ‘I oughta post this letter, so I oughta either post this letter or drink up all your wine.’ is better
than the must/bare imperative versions; the same goes for should. This suggests that oughta and should
cannot operate on procedures, but only on propositions.
Schroeder () and Chrisman () agree that oughta operates sometimes on propositions; Schroeder
but not Chrisman thinks oughta operates also on ‘actions’. Charlow’s observation supports Chrisman on the
question about oughta; though on the general question whether deontic modals can operate on things other
than declarative sentence semantic values, I side with Schroeder; though on the nuanced question whether
they operate on imperative semantic values, Chrisman (p. ) expresses sympathy: compare Chrisman
(, chapter ).
16 But procedure-disjunction does eliminate classically. Procedure-negation is similarly classical in
elimination but not in introduction.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
Rationalization in belief (section .; Hellie, ) starts in evidence about local
physiological very recent history and flows toward belief about the more abstract and
remote, along channels of expectation. The origination in evidence makes information
stored in belief, such as that Rance posted a certain letter, transferrable: Fred preserves
information for his future reference, or transmits it to Brent. But if Fred wants the
information later or Brent wants it now, neither typically cares about why Fred wanted
initially to get it: their interest is limited to the raw propositional information. In that
sense, belief is ‘fundamentally’ propositional.
By contrast, rationalization in intention (section .) originates in abstract long-
term aims and/or open-ended projects and flows toward intention for the local
physiological very near future, along channels of instrumentality: to walk to work,
I walk halfway then the rest of the way; to walk halfway, I walk a quarter of the way
then the rest of the way; and so on (Anscombe, ; Thompson, ). Because the
rationalization of intention traces back to aims for which it is instrumental, intention
has a constrained ‘mandate.’ An intention to post a certain letter in service of paying
the rent is rationalized only by the end of paying the rent; the intention to post the letter
has no independent power to rationalize anything orthogonal to the end of paying the
rent, such as drinking up all Brent’s wine. That excludes assigning significance in the
content of intention to raw propositional information.
The closing % or so of the chapter brings obligative language into contact with
this psychological story. Assigning granulated proposition-determining procedures as
their contents, an abstract form of Ross-Paradoxicality percolates up to imperative
sentences (section ).17 The alignment between linguistic and psychological content
subserves a unification of intention with natural-language imperative endorsement
and command (section ): a ‘statics’ for imperative sentences grounding their endorse-
ment in intentions, together with an analysis of sentential intensions (and the affiliated
notions of validity and entailment) as reflecting patterns of endorsement (rather than
of truth),18 predicts natural-language Ross-Paradoxicality; a ‘dynamics’ for command
models the distribution of intentions using imperative sentences in their canonical
speech act. Obligative readings of must-sentences are brought into the fold by a
combination of means: a familiar statics for declarative sentences and dynamics for
their assertion; a grammatical analysis of obligative must-sentences as modalized
imperatives; and an analysis of modals as intensional operators (section ).19 This
package predicts that ‘You pay the rent!’ and ‘You must pay the rent.’ have the same
17 Boolean connective semantic values are classical Boolean operations, while composition adjusts:
namely, Booleanizing imperatives, the Boolean operation acts on the propositions while the partitions are
multiplied.
18 I therefore agree with Charlow (this volume, chapter ) about the desirability of a ‘reorientation
of semantic theorizing away from the aim of delivering a sentence’s truth-condition, toward the aim of
modeling the state of competently accepting a sentence’.
19 Cariani () analyses φ as true just if (i) of the best salient options partitioning modal space, all
entail [[φ]], (ii) no option entailing [[φ]] is ‘undesirable’, and (iii) the options do not cross-cut [[φ]]. Cariani
aims to solve, inter alia, the Ross Paradox and the ‘Professor Procrastinate’ puzzle (it is not true that he
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
endorsement-condition and are therefore equivalent; and that an assertion using the
latter can have the same communicative meaning as a command using the former.
The phenomenon of permission appears only in concluding remarks:20 I consider
why the dual of obligative must is in a sense ‘elusive’ in natural language; and I propose
a solution to Lewis’s (b) ‘problem about permission’.
oughta accept an assignment, because he presumably won’t carry it out, but he oughta accept the assignment
and carry it out).
Some worries. First, perhaps there is no Ross Paradox for oughta: see note . Second, Cariani complains
his opponent ‘must resort [] to completely different resistance strategies for’ the Ross Paradox and the
Professor Procrastinate puzzle. He also ‘refrain[s] from drawing any conclusions about the imperative
version of the [Ross Paradox]’ (p. , n. ). But there is no imperative version of the Professor Procrastinate
puzzle, so a unified treatment is not warranted. Third, if it is salient that I play with the dog all afternoon
and not ‘undesirable’ that I play with the dog but do not post this letter, then (post this letter) entails
(post this letter ∨ play with the dog all afternoon). Fourth, if it is salient that I drink up all your wine,
then (post this letter) entails ((post this letter ∧ drink up all my wine) ∨ (post this letter ∧ don’t drink
up all my wine)).
20 On a contrasting approach originated by Åqvist () (compare Aloni, ; Veltman, ; and
Charlow, and , §..), an imperative has both permission- and obligation-content. I am sym-
pathetic to the spirit behind this account, and I conjecture that its apparatus affords the same predictive
resources as mine. But permission is a ‘positive’ phenomenon only socially, and not psychologically:
postulating a distinctive variety of content for permission is theoretically odd, somewhat akin to doing
so for epistemic possibility.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
actual world: horses do eat hay, so the set of worlds where they do includes actuality,
so the proposition that they do is true; goats do not eat cans, so the set of worlds where
they do excludes actuality, so the proposition that they do is false.
The content of Fred’s total belief state can be identified with a certain set of
worlds, Fred’s doxastic possibilities (Hintikka, ; Stalnaker, ; Lewis, ): the
set containing w just if Fred, so to speak, takes w seriously as a candidate for actuality.
In believing that goats eat cans, Fred has ruled out of candidacy for actuality all worlds
in which goats do not eat cans; in his uncertainty whether horses eat hay, Fred takes
seriously at least some worlds in which they do and some in which they do not.
More generally, Fred believes that p just if his doxastic possibilities are a subset of
the p-worlds, disbelieves that p just if his doxastic possibilities are a subset of the not-
p-worlds, and is otherwise uncertain.
Belief is said to ‘aim at the truth’ (compare Frege, ). If belief aims at the whole
truth—aims to get doxastic possibilities down to just the actual world—belief will
never hit this target, because we never uncover the whole truth. By contrast, missing
the target is easy: in ruling out all worlds in which goats do not eat cans, Fred has ruled
out the actual world, and (unless he changes his mind) he will never get it back in his
sights. As far as the whole truth is concerned, the best we can hope for is to have not
yet missed it.
Aiming at the whole truth might seem pointless: we can never succeed. And it
might seem a fool’s errand: we can be reasonably sure of failing right out of the gate.
Why bother? Fortunately, individual truths are much bigger targets. If Fred aimed at
the truth that lambs eat ivy, he hit it: his doxastic possibilities are a subset of that
proposition, so (unless he changes his mind) Fred will never lose his grip on that truth.
Does belief aim at the whole truth, or just at truths? —Let us not yet legislate, but
instead distinguish protean belief, which aims at the whole truth, from granular belief,
which aims at particular truths.
When aiming at the truth, in which direction do we point? —If Fred knew to aim at
the truth that lambs eat ivy, he would already have it in hand as a truth, and would not
need to aim at it. That at which we aim need not be the truth down to the last detail,
but only the truth about a certain issue: of the worlds where lambs eat ivy and the set
of worlds where they do not, which of those sets includes the actual world. Aiming
at an individual truth is setting up a question to be answered; letting the arrow fly is
answering it, by settling belief into one of its answers.
Setting up a question to be answered is wondering about it.21 If one wonders, one
wonders whether goats eat cans, or who shot JR, or where Waldo went. Papering over
certain complexities, whatever one wonders sets up a range of answers on any of
which one might settle. A range of answers can be pictured as a bunch of lines of
subdivision cutting through modal space: what the wonderer aims to settle is which
of the subdivided lots contains the actual world. Or, more formally, as partition of
21 I retell an orthodox story: classic texts are Belnap () and Groenendijk and Stokhof (, ).
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
is a subset of p, we may say that Fred has the granular belief that p. That proposition p
is not incorrect just if C@ is a subset of p. That is so just if Fred’s granular belief that p
is a success; otherwise it is a failure. If b is not a subset of p, then Fred does not believe
p: perhaps b is a subset of the complement of p, so Fred disbelieves p. Otherwise, b
leaves it open whether p, and Fred is uncertain whether p; moreover, when one is
uncertain about a proposition discriminable within the questions one has asked, say
one is indecisive about that proposition. The success/failure conditions of disbelief are
obviously the opposite of those of belief.
What of those propositions p which are not discriminable within π ? For example,
I supposed earlier that Fred had not asked whether ibexes eat amphorae. Perhaps that
proposition is independent of the totality of questions Fred has asked: any cell in π
contains worlds where ibexes eat amphorae and cells where they do not. Then Fred
can have no opinion on the matter: Fred does not believe that ibexes eat amphorae,
nor does he disbelieve it.
In that case, Fred has no opinion on the matter because nothing he cares to find out
could settle it. But other cases of propositions not discriminable within the totality of
Fred’s questions are not that way. Consider the proposition that either goats eat cans or
ibexes eat amphorae. Fred believes the stronger proposition that goats eat cans. Does
he thereby believe the weaker proposition? It is not clear whether intuition pronounces
on the matter; perhaps theoretical motivations may be found for saying one, perhaps
for saying the other; perhaps for giving one answer for some cases, the other for
others. Fortunately, we have apparatus enough to allow us to say either. Fred protean-
believes it because all doxastic worlds are members of the disjunctive proposition.
As for granular belief, we could group the threefold division of belief, disbelief, and
indecision under a category of concern, and set that supercategory against an exclusive
category of indifference: and we could say that Fred has concern whether p just if p is
discriminable within the content of his questions, and is otherwise indifferent. If we
decided to use the apparatus in that way, we would rule Fred as indifferent regarding
the disjunctive proposition.
How does the normative status of indecision (recall, uncertainty regarding a ques-
tion one has asked) compare to that of indifference? Having asked a question but
not answered it, one’s situation is clearly neither (yet) a success nor a failure by the
standards of belief. Arguably, though, indecision has a negative normative status along
a different dimension: questions are ‘to-be-answered’; having asked one, we are in a
situation that is in a sense ‘unsettled’ until we settle it. Indifference, by contrast, is
none of the above: neither successful nor failed, neither settled nor unsettled.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
So I claim. Intention is far less well-understood than belief, so I do not hope to ride
the winds of orthodoxy, but will have to argue the point. Unfortunately, establishing a
stable foundation requires drilling down far to hit bedrock: familiar formally-driven
stories lack phenomenological plausibility; familiar phenomenologically-driven sto-
ries lack formal rigor. So rather than with tools or doctrine, I begin by collecting data—
in particular, about the bearing of various worldly circumstances on the success or
failure of intention. I will then run these through the apparatus of the previous section
to generate my thesis that the content of intention is always granular.
For data-gathering, I exploit Lewis’s (b) device of a ‘questionnaire’—to my
mind, and I hope you will agree, it shows we have some ‘very firm and definite
opinions’ about the success and failure of intention.
First question. On Monday, you are strolling along on the way to work, preoccupied with
thoughts about deontic modals. Your attention is briefly drawn to a stone skittering along the
sidewalk, which you appear to have just kicked. You pay it little mind and continue on your
way. Consider the process-type kicking a stone. Does your having just undergone a process of
this type constitute your success or failure at anything?
Answer. Of course not. Whether you kick any stones is a matter of complete indifference to you.
Had you not kicked any stones at all on the way to work, that would not have constituted your
success or failure at anything either.
Second question. On Tuesday, it is as on Monday, except that you have set out with a mindfulness
project requiring not kicking any stones—but, again preoccupied, you kick a stone. Does your
having just undergone a process of the type kicking a stone constitute your success or failure at
anything this time?
Answer. Yes. You have failed at your intention to avoid kicking any stones.
Third question. On Wednesday, vexed at yesterday’s failure, you have set yourself the intention
of giving a good hard kick to some stone. Spotting one, you kick it. Does your having just
undergone a process of the type kicking a stone constitute your success or failure at anything
this time?
Answer. Yes. You have succeeded at your intention to kick a stone.
Last question. On Thursday, when you arrive at work, you set yourself the intention of either
reading the new article by N or the new article by M that day. You manage to steal enough time
to read the new article by N over the course of the day. Does your having undergone a process
of the type reading the new article by N constitute your success or failure at anything?
Answer. Yes. You have succeeded at your intention to read either the new article by N or the
new article by M. You would have equally well succeeded had you instead read the new article
by M. And you would have succeeded at that intention had you been able to read both.
If some of this does not seem obvious, please reconsider; if it all does, splendid! —Let
us use it as evidence for what follows.
It is not the answers to the questionnaire by themselves that establish the content of
intention to be never protean. For that matter, they do not even establish the content
of intention to be ever granular. The obvious proposal for protean content is this: the
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
content of intention includes, on Monday, both worlds where one kicks stones and
worlds where one does not; on Tuesday, only worlds where one does not kick stones;
on Wednesday, only worlds where one does kick stones; and on Thursday, only worlds
where one reads either the article by N or the article by M (or both).
But a case for granular content emerges in working through the following exercise:
Exercise. On Friday, you are strolling along on the way to work, self-possessed, confident in
your way of being. You have recognized that true mindfulness consists in equanimity about the
trivial. Regarding the vexatious kicking of stones, you have set yourself the intention to either
kick (one or more) stones or not kick (any) stones. As it happens, you kick a stone. Does your
having just undergone a process of the type kicking a stone constitute your success or failure at
anything this time?
Answer. Yes. You have succeeded at your intention to either kick stones or not kick stones. But
you would have equally well succeeded had you not kicked any stones.
After all, recall that on Thursday, one has resolved to do at least one of two things: read the
article by N; read the article by M. So long as one does at least one of them, one will succeed,
and will not fail. One will fail only if one does neither.
To generalize, whenever one intends to act in at least one of two ways, one fails only by doing
neither, succeeds by doing either (or both). It should not matter whether the thing in question
is ‘positive’ or ‘negative’. If one resolves either to read the article by N or drink five or more cups
of coffee, one fails only if one neither reads the article nor drinks five or more cups of coffee,
otherwise succeeds. And if one resolves either to read the article by N or not drink five or more
cups of coffee, one fails only if one does not read the article and does drink five or more cups of
coffee, otherwise succeeds.
Now, when an intention is to satisfy at least one of two properties, one of them the negation of the
other, it should still be that one fails only by doing neither, succeeds by doing either (or both).
Why should the content make a difference? Normative psychology does not ordinarily involve
that kind of exception, so best not to postulate it now, either. Now of course, when one intends
to act at least by kicking a stone or not kicking a stone, one cannot do neither and cannot do
both. Because the only remaining alternatives are do the one—kick a stone—or do the other—do
not kick a stone—one must do exactly one of those. Either way, one succeeds.
So sometimes one’s intentions are guaranteed to succeed. I am inclined to see this consequence
as an occasion for rejoicing, rather than resistance.
Intention, like belief, can involve a distinction between concern with an issue—the
prospect of success or failure, depending on how the world meshes with the content
of one’s attitude—and indifference, in which the answer to the question cannot imbue
the relevant aspect of one’s mental state with either success or failure.
Indeed, distinguishing concern and indifference is at least as warranted for intention
as for belief. Contrast Monday and Friday. On Monday, whether one kicks a stone is a
matter of no significance for the success or failure of any intention: one is indifferent.
But on Friday, there is an intention such that, if one kicks a stone, it will succeed, and
if one does not kick a stone, it will succeed: one’s attitude is equanimity.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
The distinction between indifference and equanimity, like the distinction in belief
between indifference and indecision, cannot be represented with protean content. On
the earlier attempt at a protean characterization, the Monday content contained kick a
stone-worlds and don’t kick a stone-worlds; the Tuesday content contained only don’t
kick a stone-worlds; the Wednesday content contained only kick a stone-worlds. The
Friday content should not distinguish kick a stone and don’t kick a stone-worlds; and
we surely do not want it to do so by excluding all worlds; so the Friday content is like
the Monday content in containing both kinds of world.
Saying what the content is does not yet settle the bearing of this on the
success/failure-status of one’s intention. Presumably false content makes for failure.
Perhaps true content makes for success (as with granular belief). Then one’s intention
succeeds no matter what on Friday, desirably—but then it also succeeds no matter
what on Monday, undesirably. Or perhaps true content makes for mere non-failure
(as with protean belief). Then one’s intention neither fails nor succeeds no matter what
on Monday, desirably—but then it also neither fails nor succeeds no matter what on
Friday, undesirably.
So protean intention cannot make enough distinctions. By contrast, this is no
problem for granular intention: Monday does not partition kick a stone from don’t
kick a stone-worlds, but Tuesday, Wednesday, and Friday do; Tuesday, in addition,
designates the don’t kick a stone-answer, by involving a proposition that is a subset of
the don’t kick a stone-cell; Wednesday designates the kick a stone-answer, by involving a
proposition that is a subset of the kick a stone-cell; and Friday designates both answers,
by involving a proposition overlapping both cells.23
Direction of Rationalization
Belief is arguably sensibly spoken of both as protean and as granular; intention is at
least sensibly spoken of as granular. Can intention be sensibly spoken of as protean?
No: this section and the next trace the contrast between intention and belief to their
opposing polarity in what I call direction of rationalization.
I will sidle up on this notion by reflecting on the challenges faced by the venerable if
obscure notion of direction of fit. In the tradition, belief and intention (often: ‘desire’)
are said to display oppositely polarized directions of fit with regard to the world: in
some sense duly capturing their fundamental normative contrast, belief is to fit the
world, whereas the world is to fit intention—where ‘A is to fit B’ means B somehow
exerts normative pressure on A to fit B. Alluring, to be sure. But what on earth could
it mean?
23 I explicate designation in terms of what makes for success, in contrast with in terms of what is permitted.
In that sense, my view represents disjunction as ‘alternative-presenting’ in contrast with ‘choice-offering’:
Rescher and Robison (, p. ); Åqvist ().
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
For ‘fit,’ I imagine truth of content is just enough. Substituting, beliefs are often true
thanks to the world’s exerting normative pressure on them, while intentions are often
true thanks to their exerting normative pressure on the world. This is not so plausible:
how is the world ‘normatively pressured’ to make intentions true (Humberstone, ;
Frost, )? If an errant bus undermines my intention to walk to work, that may
be problematic in many ways—but I doubt one of them will be the world’s regret at
violating its half of the bargain struck in my forming my intention.
A popular fix has been to exchange normative pressure for causal pressure (Smith,
; Velleman, ; Setiya, ). Beliefs are often true thanks to the world’s exerting
causal pressure on them, while intentions are often true thanks to their exerting causal
pressure on the world. Of course the world impinges on our sensory organs, and in this,
we acquire evidence, which then ramifies in further, more abstract beliefs; when these
do not go too far from evidence, they are often true. And of course, we implement our
more abstract intentions through intentional motions of our bodies, which exert gentle
and local causal pressure on the world; when we planned well and matters beyond
our locality do not impede us, our intentions often come true. But the body is where
causality ends, for belief, and starts, for intention. Characterizing the post/pre-bodily,
psychological part of the story as just more causation does not square comfortably
with ordinary understanding of psychology (is ‘phenomenologically off-key’). But the
point of discussing direction of fit is to shed light on psychology: so this fix changes
the subject unhelpfully.
This suggests that the remedy is to constrain our ambition: we need to limit the
world’s exposure to normativity. We can continue to distinguish belief and intention
in direction of normative pressure if the norm of truth is exchanged for a more
psychology-internal, structural normative constraint based in a notion of rationaliza-
tion (Hellie, , ). If psychology and the world are isolated, this will overshoot,
kicking the world out of the story. Fortunately, psychology and the world overlap at
the body: for belief, it is where causality ends, but also where rationalization begins;
for intention, it is where causality begins, but also where rationalization ends. I will
add detail to this sketch after introducing ‘rationalization’.
The root structural psychology-internal norm is intelligibility.24 For an example of
its absence, suppose Fred asserts ‘it is false that either goats eat cans or horses eat hay’
and then asserts ‘goats eat cans’. How is Fred thinking of the world? Without further
information, I am in the dark.
This case does not distinguish the internal norm of intelligibilty from the external
norm of truth: Fred appears to have contradicted himself, so his statements at face
value cannot possibly be true. But intelligibility and truth can be distinguished by
means of Moore’s Paradox. Suppose we ask Fred whether goats eat cans, and he
24 Compare ‘If we say “it does not make sense for this man to say he did this for no particular reason” we
are not “excluding a form of words from the language”; we are saying “we cannot understand such a man” ’
(Anscombe, , §).
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
emphatically replies ‘yes, goats do indeed eat cans’. One of us did not catch his answer,
and follows up: ‘sorry, was it your opinion that goats eat cans?’ Fred answers ‘Now
you are asking me my opinion—I thought this was supposed to be about goats. No,
I am not of that opinion’. Here too it is hard to make sense of what is going on with
Fred. But this is not because it is impossible for goats to eat cans though Fred lacks the
opinion that they do. Instead, it is because there is no intelligible perspective which
pictures goats as eating cans without lucid recognition under at least some mode of
presentation that it does so.
Why think intelligibility is a ‘norm’? —Not because intelligibility is guaranteed
to be beneficial (contrast Parfit, , §; Harman, , § .) or is connected to
anything else of value or to any other sort of value. Rather, Fred’s intelligibility is a
boundary condition on our using psychological categories to describe or explain him.
No intelligibility, no location in the ‘space of reasons’ at all: intelligibility is a ‘necessary
precondition on the possibility of normativity’. If that is not enough for counting as a
norm, but only as a ‘norm*’, I will concede the point—but in the interest of saving ink
and trees, I leave the ‘*’ implicit.
Intelligibility provides a basis for defining a variety of ‘normative pressure’ exerted
by mental states on one another: this rationalization. To illustrate, Sam thinks her pet
rabbit, Rupert, has escaped. Why? Sam replies ‘if Rupert is in the house, I always know
which room he is in; and I don’t know which room Rupert is in’—modus tollens. The
first explanans is the state supporting the conditional: a pattern of expectation that is
notably ‘stable’ in Sam’s mental life. The second is the rejection of the consequent, a
temporary condition of uncertainty in Rupert’s whereabouts. Having accepted these
explanantia, we would find Sam unintelligible if she did not think Rupert had escaped.
Unintelligibility is a ‘boundary condition’ on psychological characterization; knowing
all this, we therefore have no choice but to accept the explanandum (that Sam
thinks Rupert has escaped). So what we learned explains the belief; the bankshot
off of intelligibility is what makes the explanation a rationalization. In general, then,
some states rationalize others when the former explain the latter by way of its being
unintelligible to have the former without the latter. Because rationalization is an
explanatory relation, it inherits all the directedness of explanation.
I propose that the opposite polarity of belief and intention pertains to the direction
of rationalization. For belief, rationalization flows, in a sense, ‘inward’; for intention,
it flows ‘outward’. But inward or outward toward what and from what?
To see the answer, suppose Brent is off to Lower Slobbovia. He thinks Sam knows
how one will eat in Lower Slobbovia, and wants to know how he will eat, so he asks
her. Sam replies ‘you will eat terribly in Lower Slobbovia’. So: should Brent believe he
will eat terribly in Lower Slobbovia? And should he intend to eat terribly in Lower
Slobbovia?
Very plausibly, learning Sam’s answer rationalizes Brent believing he will eat terribly
in Lower Slobbovia. Brent thinks Sam knows how one will eat in Lower Slobbovia, and
has learned that Sam believes he will eat terribly in Lower Slobbovia. I am inclined to
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
say that if A thinks B knows the answer to a certain question, and A thinks B’s answer
to the question is P, then A thinks P is true—on pain of unintelligibility (%‘Rance
knows whether goats eat cans and thinks they don’t—but I don’t know whether goats
eat cans’). So it is unintelligible for Brent not to believe he will eat terribly in Lower
Slobbovia.
But Sam’s answer does not rationalize Brent’s intending to eat terribly in Lower
Slobbovia. Brent might very well, knowing this, strive with great energy—albeit
pessimistically—to eat well in Lower Slobbovia. Perhaps intention in some sense
requires belief. But only for the completely acquiescent/megalomanical fatalist is belief
always accompanied by intention!
To explain the contrast, I will quickly rehearse a model of ‘rational architecture’ I
have defended elsewhere (Hellie, , § .; , §§ –). Belief begins in evidence:
evidence must be true, must be believed, consists of facts broadly about ‘perception’,
and is given under the ‘mode of presentation’ of perception (Lewis, b). So, for
example, if I see a dagger before me, my evidence is some true proposition about my
‘visual condition’—perhaps that I see a dagger before me, perhaps that my retina is
stimulated daggerishly, perhaps that my brain is agitated daggerishly. Evidence must
be true and must be believed, but there remains plenty of room for falsehood in belief.
The perceptual ‘mode of presentation’ typically must be ‘recoded’ into something less
evanescent and more ‘conceptualized’ (Hellie, , § .–); even then, my evidence
concerns just my perceived surroundings up to the present, so my full rich set of beliefs
requires extensive ‘amplification’ by my pattern of expectations.25
(If I lucidly dream of a dagger before me, my evidence is again some true
proposition about my ‘visual condition’—perhaps that I am imagining a dagger, or
that my visual cortex is agitated daggerishly; either way, it is certainly possible to
have mistaken ‘background beliefs’ about my condition and therefore ‘recode’ the
perceptual mode of presentation in a way that conflicts with my actual evidence: in
that case, I wind up with inconsistent beliefs, and am not fully intelligible: Hellie, ,
§ ; Hellie, , § ..)
Because evidence must be believed, evidence pushes belief around. Our expecta-
tions extract a great deal more from evidence than the content of evidence strictly
entails; still, this leveraging move is not without risk, in that evidence might always
surprise us and wind up contradicting what we expected: when that happens,
we lapse from full intelligibility until consistency is restored (making ‘belief willy
nilly’ generally to be avoided). In this sense, rationalization in belief flows ‘from
evidence’.
That evidence can only have true content is good fortune, given that it must be
believed. But this also gives evidence a foot on both sides of the fence: evidence is
25 A congenial theory of the mechanics of expectation exploits the sort of construction behind ‘presum-
ably’ in Veltman () and exploited by Yalcin (this volume, chapter ) in the analysis of should; I conjecture
that ‘ranking functions’ (Spohn, ; Huber, ) afford an interchangeable apparatus.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
‘in the world’, and also ‘in the mind’. In that sense, when rationalization in belief flows
‘from evidence’, it flows ‘inward’; and that is the good sense to be made of the claim
that belief ‘fits the world’.
One’s evidence at a given time pertains to the stretch of history ending at the present
moment. But one’s intentions pertain to the stretch of history beginning at the present
moment. So intention cannot radiate from evidence: evidence constrains intention at
best in partnership with expectation. So rationalization for intention does not (for that
reason anyway) flow ‘inward’.
Indeed, it flows ‘outward’. The paradigmatic rationalization in intention is from
a longer-term, more ambitious ‘governing intention’ to a structure of shorter-term,
less ambitious ‘implementing intentions’ undertaken as instrumental to executing the
governing intention (Anscombe, ; Thompson, ). Suppose I set myself to walk
to the office. I have to implement this intention somehow: I can’t just blink my eyes
and it happens. In particular, I have to walk the first half of the way, and then walk the
second half of the way. But to do all this intentionally, I need intentions to do both of
those things. And they too need to be implemented: to do the first, I have to walk the
first quarter of the way, then the second quarter of the way. And so on. At some point I
am implementing my intention to walk to the corner by walking halfway to the corner,
implementing that by taking three steps, implementing that by taking one step, . . .
implementing that by exerting all sorts of micro-forces throughout my body, . . . .
More generally, long-term ambitious intentions rationalize shorter-term less ambi-
tious intentions, and so on—eventually arriving at motor coordination in the here
and now. Now, motor coordination in the here and now is at least somewhat like
evidence: each pertains to the here and now, and to matters impinging pretty closely
on physiology. If they differ, it is in that motor coordination is about moving, while
evidence is about sensing. And maybe moving and sensing, at the level of physiology,
are not really dissociable: perhaps all there is is the ‘sensorimotor’. So either way,
rationalization of belief flows ‘inward’ from evidence about the sensory (or perhaps
the sensorimotor), while rationalization of intention flows ‘outward’ toward intention
for the motor (or perhaps the sensorimotor).
So, in sum, belief and intention are oppositely polarized in their direction of
rationalization: rationalization in belief flows inward from the physiological here and
now qua evidence; rationalization in intention flows outward toward the physiological
here and now qua near-momentary intentional action.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
Why is that not a compelling reason to jettison protean belief entirely? Because
belief is inward-directed, it has a kind of ‘objectivity’: my beliefs are not just for me, nor
yours just for you; rather, mine can be for you and yours for me. The granulation Fred
imposes on his beliefs, conversely, lacks ‘objectivity’: it is an idiosyncratic matter of
how Fred packages his information for himself, indispensable to Fred but potentially
of no interest to anyone else. Fancy words? This ‘objectivity’ is easy enough to
operationalize: the most salient symptom is that accepting testimony is rationalizable.
I therefore argue as follows. First: because belief is inward-directed, accepting
testimony is rationalizable. Second: because accepting testimony is rationalizable,
belief has protean content.
I do not expect the first premiss to be controversial. If Brent thinks Fred’s beliefs
about animal husbandry have lost their moorings in evidence, it would be unintelli-
gible for Brent to accept anything Fred says on the matter. Conversely: even if Rance
thinks Sam’s beliefs about the culinary amplify her evidence considerably, that is not
in itself a reason not to accept what she says. After all, it may be that Rance’s own
beliefs amplify evidence in more or less the same pattern as Sam’s. If so, Sam’s beliefs
about the culinary can be expected to be roughly the same as Rance’s would be if he
had her evidence. So if this is Rance’s view, Rance will expect Sam’s testimony to just
put on display reasonable expectations on the basis of some (in absentia) evidence. If
so, when Sam asserts that p, and Rance is concerned whether p, for Rance to remain
indecisive whether p would be either just as hard to make sense of as ignoring evidence
on a matter of concern or just as hard to make sense of as skepticism about his own
reasonable expectations. So without inward-directedness, accepting testimony would
not be rationalizable, while with inward-directedness, what is not rationalizable is
indecisiveness regarding at least favorable cases of testimony over matters of concern.
So in that sense, it is because of the inward-directedness of belief that accepting
testimony is rationalizable.
Now to the second premiss. The idea here is that sometimes when accommodating
your issues would be a barrier to my acquiring information from you, I can still do it;
but then I can gather your information without accommodating your issues; and if so,
I think of your information as protean belief.
Suppose each of Rance and Brent was indifferent whether Amber would be at the
party when he arrived, and indifferent whether Bruno would be there. But while
individually, neither Amber nor Bruno was a matter of concern to either Rance or
Brent, Amber and Bruno collectively were a matter of concern to each—though in
different ways. Rance was worried that both Amber and Bruno might be there, while
Brent was worried that Amber might be there without Bruno. The issue of concern to
each, then, was a polar question: for Rance, ?(a ∩ b); for Brent, ?(a ∩ b). Upon arrival,
each was happy to answer his question in the negative—Rance believing a ∩ b, Brent
a ∩ b—and gave it no further thought.
Some time later, Sam, who does not know Bruno, is wondering whether Amber was
at the party: of concern to Sam is ?a, but not ?b. Knowing Rance and Brent were there,
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
she asks each whether Amber was there. Rance replies ‘I don’t know, but Amber wasn’t
there with Bruno’; Brent replies ‘I don’t know, but Amber wasn’t there without Bruno’.
Sam concludes that Amber was not at the party—believing a and thereby settling her
question in the negative.
Suppose now for reductio that belief is essentially granular. In that case, the content
of Rance’s posterior belief entails neither ?a, W
nor ?(a ∩ b), W
, the content of
Brent’s posterior belief entails neither ?a, W
nor ?(a ∩ b), W
, and the content of
Sam’s posterior belief entails neither ?(a ∩ b), W
nor ?(a ∩ b), W
.
Grant me that if an assertion communicates a content X, for the speaker’s posterior
belief-content S and the audience’s posterior belief-content A, both S X and A
X. So whatever was communicated to Sam by either Rance or Brent entails neither
?a, W
, nor ?(a∩b), W
, nor ?(a∩b), W
. But the only other granulated proposition
even conceivably in the ballpark for what was communicated is the trivial granulated
proposition := {W}, W
—which could not have settled anything for Sam, as the
communications evidently did.
So, I conclude, belief is not essentially granular. Rance’s assertion communicated his
protean belief that a ∩ b, and Brent’s communicated a ∩ b, with the consequence that
Sam’s posterior protean belief entails a—and her question ?a is settled in the negative.
. Outward-directedness and granular content
The direction of rationalization of intention is oppositely polarized, in the sense
that what rationalizes a given intention is (ordinarily,26 and in significant part) the
‘governing’ intention it implements.
For this reason, intentions are not in any reasonable sense transferrable. Suppose
Rachel intends to hang a picture Joan thinks is ugly. Joan need not intend to hang that
picture herself; or support Rachel in doing so; or even stay out of the way: Joan might
quite reasonably try, perhaps, to foil Rachel. The intention itself does not transfer. Each
of Joan and Rachel might be confident their side will prevail, with Rachel believing she
will hang the picture and Joan believing Rachel will not hang the picture. Any belief
associated with the intention of either need not transfer either. The reason for ascribing
protean content to belief does not apply to intention.
Conversely, the business of intention—implementing its governing intention—
requires its content to be granular. A given intention has a limited mandate: it is to
carry out its role in the plan for implementing the governing intention, and then
expire.
For example, Sam intends to do her job today, and to do that, she has to get to the
office. So she sets out after breakfast with an intention to walk to work at the office.
26 The hedge is to accommodate the rationalization of ‘highest’ intention: it has to top out at some stage,
so at that stage intention is either self-rationalizing or arational or rationalized by a governing non-intention.
The status of the ‘ultimate sources of motivation’ is orthogonal to the planning relations that are the focus
of my argument.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
She plans, and therefore intends, to do this by first walking halfway and then the rest
of the way; she plans to walk halfway by first walking the first quarter of the way and
then walking the second quarter of the way; and so forth. The intention to walk the
second quarter of the way is rationalized by the intention to walk halfway. Its mandate
is just to do its part in making Sam walk halfway, and then expire. While exercising that
mandate requires the creation of other intentions it in turn rationalizes, the intentions
it creates are rationalized only through the contribution they make to attaining the
goal of walking the second quarter of the way.
So in particular, the intention to walk the second quarter of the way could rationalize
an intention to walk the third eighth of the way. But it could not rationalize a
disjunctive intention to either walk the third eighth of the way or empty the bank
account and go on a drinking spree after work. That disjunctive intention succeeds
unless one neither walks the third eighth of the way nor empties the bank account and
goes on a drinking spree after work. In particular, it succeeds if one empties the bank
account and goes on a drinking spree after work. An intention rationalizes whatever
might be appropriate for realizing its success-condition; so the disjunctive intention
rationalizes, if appropriate, emptying the bank account and going on a drinking spree.
So if the intention to walk the second quarter of the way rationalizes the disjunctive
intention, it can (derivatively) rationalize emptying the bank account and going on a
drinking spree (just so long as it also rationalizes walking the third eighth of the way).
But emptying the bank account and going on a drinking spree makes no contribution
whatever to walking to work; so because the intention to walk to work is what governs
the intention to walk the second quarter of the way, the rationalization bestowed on
the latter is restricted to its contribution to the aim of walking to work. If it ‘goes rogue’,
so to speak, it does so without any rationalization, and therefore without any power to
rationalize the wacky splinter plan.
So a disjunctive intention is not in general rationalized by its disjunct-intentions.
But a conjunctive intention does seem to rationalize its conjunct-intentions: if I intend
to have breakfast and then go back to sleep, I intend to have breakfast and I intend to
go back to sleep (after eating breakfast).
Disjunction-introduction, understood as a rule of rationalization for the contents
of intentions, seems to be invalid; but conjunction-elimination is valid. That means
that rationalization by the contents of intention sometimes traces decreasing strength
of protean content, but not always. If not, the contents of intentions cannot be protean.
Whatever these contents may be—I call them procedures—it is plausible that
they at least determine a partition over something. As above, entailment over
granular propositions validates classical conjunction-introduction and -elimination,
and disjunction- and negation-elimination, but neither disjunction- nor negation-
introduction. Intentions, we have seen, validate conjunction-elimination but not
disjunction-introduction: and they seem to conform to the remaining patterns as well.
Conjunct-intentions do seem to rationalize their conjunctive intention: if I intend to
work and I intend to whistle, I intend to whistle and work (conjunction-introduction).
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
Imperative Semantics
The conclusion of the foregoing discussion is that belief and intention contrast in
that belief is optionally (and perhaps fundamentally) protean, while intention is
mandatorily granular. This mandates Ross-Paradoxical behavior for the content of
intention while permitting classical behavior for the (perhaps fundamental) content
of belief.
I now transition from the psychology to the language of obligation. In these
concluding sections, I will describe a series of increasingly complex languages.
Collectively, these (i) show how agents conforming to my psychological hypotheses
might perform speech acts to put (individual) beliefs and (general-will) intentions on
display, in order to collect individual beliefs and parcel out collective intentions; and
(ii) illustrate meanings within such a practice for expressions designed to resemble
ordinary-English Boolean connectives and modal operators. These languages con-
form, in various nuanced ways, to the phenomena of obligation with which I opened
the chapter: the connection between the language and psychology of obligation;
certain commonalities and distinctions between command uses of imperatives and
obligative uses of must, including the Ross Paradox; the elusiveness of their ‘duals’,
permission and permissive can/could.
The initial language L contains simple imperative sentences, with procedure-type
semantic values, and classical Boolean connectives: with its natural entailment rela-
tion, the of section , L is Ross-Paradoxical. Next, L introduces subject-predicate
structure to imperatives, allowing connectives to take scope: narrow-scope disjunction
‘softens’ Ross-Paradoxicality, as in English.
27 Three examples. Delayed triggering: wait five minutes, then stir; remove on July ; leave at sundown;
smile when they walk in; fold in the whipped cream, then put it in the oven. Conditionalization: if it is A, do
α; if it is B, do β; if you can’t tell, do γ . Delegation: the contrast between Rance’s intention to pay the rent,
which is essentially ‘de se’ or subjectless, and our intention for Rance to pay the rent, which adds a subject.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
syntax
Lexicon: (i) a stock of elemental imperative sentences A, B, etc.; (ii) Boolean
connectives ∧, ∨, ¬.
Composition: if α , . . . , αn are imperative sentences and C is an n-place Boolean
connective, C(α , . . . , αn ) is an imperative sentence.
semantics
Ontology: Let PR be the set of all procedures; and for present purposes, let a
procedure be a granulated proposition: recall, a partition paired with a proposition
it discriminates (so PR := {π , p
: π a partition of W ∧ (∃S ⊆ π )(p = S)});
for = X, Y
, let π() = X and p() = Y.
Lexicon: (i) When α is an elemental imperative, for some p ⊆ W, [[α]] = {p, p}, p
;
(ii) [[∧]] = ∩, [[∨]] = ∪, [[¬]] = λx x.29
Composition: [[C(α , . . . , αn )]] = π([[α ]]) × · · · × π([[αn ]]), [[C]](p([[α ]]), . . . ,
p([[αn ]]))
.
28 The treatment of compositionality falls short of full rigor, but without I hope undermining my central
aims.
29 I am not actually so confident that negation can take wide scope over imperatives: ‘Fred don’t open
the door!’/‘Fred open the door and Brent take out the trash!’/?‘Don’t (Fred open the door and Brent take
out the trash)!’.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
entailment
If β , . . . , βn , α are sentences of L , then β , . . . , βn α just if [[β ]], . . . , [[βn ]] [[α]].
discussion
It is easy to see that classical conjunction-introduction and -elimination, and negation-
and disjunction-elimination are valid, but negation- and disjunction-introduction are
invalid: L is, in effect, a projection into the syntactic of the procedure-contents of
intention, with an entailment relation isomorphizing content-entailment. (Example:
let [[A]] = {q, q}, q
and [[B]] = {r, r}, r
; then [[A ∨ B]] = {q, q} × {r, r}, q ∪ r
;
the partition component of [[A ∨ B]] is then {q ∩ r, q ∩ r, q ∩ r, q ∩ r} {q, q}, while
the proposition component is q ∪ r ⊇ q, so whenever q and r are orthogonal, neither
A A ∨ B nor A ∨ B A.)30
Stipulations for L
The imperatival language L extends L with subject–predicate structure and permits
Boolean connectives to take either wide (sentential) or narrow (predicate) scope.
This produces a ‘softer’ Ross-Paradoxical effect arising when disjunction takes narrow
scope.
My intent with this is to model the datum that imperative disjunction-introduction
can be coerced into sounding less objectionable. It takes some advance preparation.
First reflect on the excellence of the conjunction-elimination argument. Then high-
light that if something is a certain way, it is thereby any less specific way—so in partic-
ular, if someone undergoes a process of a certain type, they thereby undergo a process
of any more inclusive type. Then think of a disjunctive action-characterization as just
less specific than either disjunct action-characterization. And finally, coerce the dis-
junctiveness in the conclusion into the disjunctiveness of its action-characterization,
as follows: ‘Fred post this letter!—So: Fred (either-post-this-letter-or-take-out-the-
trash)!’. That sounds better than before, if not yet strikingly valid.31
The predicates of L pertain to action-kinds. Actions are sustained processes; an
action-kind groups together a bunch of processes under a certain respect of similarity.
In the modal spirit, a property of individuals is a set of ‘centered worlds’ aka world–
time–individual triples: red is the set containing w, t, j
just if, in w, at t, j is red. A
similarity-class of processes could then be thought to determine a set of ‘extended
centered worlds’ aka world–interval–individual triples: running around the block is
the set containing w, (t, t ), j
just if, in w, at t, j commences a run around the block
which concludes at t . But for the purposes of the speech act theory to be introduced
30 Starr’s otherwise congenial (if intricate) approach (Starr, , § .) unfortunately predicts that it is
always preferable to carry out both disjuncts (when compatible) to just carrying out one (Charlow, ,
pp. –).
31 Starr (, pp. –) maintains that disjunction takes scope either over or under the imperatival mood
operator !, and takes pains to secure the equivalence of the updates associated with both.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
for language L , it is only the onset of the action that will matter: the impact of our
imperative language will paper over matters of practical reasoning and the structuring
of actions for which this matters; the issue of interest will be setting each other to action
through issuing commands, for which it is only the onset that matters.32
syntax
Lexicon: (i) a stock of elemental action-predicates F, G, etc.; (ii) a stock of names a,
b, etc.; (iii) Boolean connectives ∧, ∨, ¬.
Composition: (i) if
, . . . ,
n are action-predicates and C is an n-place Boolean
connective, C(
, . . . ,
n ) is an action-predicate; (ii) if ν is a name and
is an
action-predicate, ν
is an elemental imperative sentence; (iii) if α , . . . , αn are
imperative sentences and C is an n-place Boolean connective, C(α , . . . , αn ) is an
imperative sentence.33
semantics
Ontology: Let W, T, and J be (respectively) the sets of worlds, times, and individuals.
Let A be the set of all action-kinds, where an action-kind determines a set of world–
time–individual triples, with which I will identify it: where K is an action-kind, I will
notate {w : w, t∗ , j∗
∈ K} as K(t ∗ , j∗ ).
Let a context c determine a moment of time tc .
Lexicon: (i) For an elemental action-predicate
, [[
]]c ∈ A; (ii) for a name ν, [[ν]]c ∈
J; (iii) [[∧]]c = ∩, [[∨]]c = ∪, [[¬]]c = λx x.
Composition: (i) [[C(
, . . . ,
n )]]c = [[C]]c ([[
]]c , . . . , [[
n ]]c ); (ii) [[ν
]]c =
?([[
]]c ([[ν]]c , tc )), [[
]]c ([[ν]]c , tc )
; (iii) [[C(α , . . . , αn )]]c = π([[α ]]c ) × · · · ×
π([[αn ]]c ), [[C]]c (p([[α ]]c ), . . . , p([[αn ]]c ))
.
entailment
If β , . . . , βn , α are sentences of L , then β , . . . , βn α just if, for all c, [[β ]]c , . . . ,
[[βn ]]c [[α]]c .
32 This actually raises an annoying side-issue: as the pragmatics for L is set up, commands occur in a
temporal sequence, and require immediate commencement. But they also put on display stable intentions
that are not in a position to be acted upon until speech acts occur. In natural language, that would be resolved
by tacit or explicit temporal adverbialization; or perhaps by looseness about what counts as the onset of
action (compare: ‘is there much to choose between “She is making tea” and [] “She is going to make tea”?
Obviously not’: Anscombe, , §).
33 For further details of the syntactic and semantic composition of imperatives, see Hellie (). In
eschewing an imperatival operator, I side with Portner (, , ); and I oppose Hare (, § .);
Kenny (); Stenius (); Chellas (); Lewis (b); Kaufmann (); Charlow (, ); and
Starr ().
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
discussion
The scope of negation does not matter: [[¬(ν
)]]c = [[ν(¬
)]]c . After all, [[¬(ν
)]]c =
?([[
]]c ([[ν]]c , tc )), [[¬]]c ([[
]]c ([[ν]]c , tc ))
= ?([[
]]c ([[ν]]c , tc )), [[
]]c ([[ν]]c , tc )
=
?([[¬]]c [[
]]c ([[ν]]c , tc )), [[¬]]c [[
]]c ([[ν]]c , tc )
= [[ν(¬
)]]c .
But the scope of disjunction does matter. Let p be the proposition that Fred opens
the door, q that he takes out the trash. The content of ‘Fred take out the trash!’, of
course, is ?q, q
. And the content of ‘Fred (open-the-door-or-take-out-the-trash)!’
is ?(p ∪ q), p ∪ q
, while that of ‘(Fred open the door) or (Fred take out the trash)!’ is
?p×?q, p ∪ q
.
The sentential-scope disjunction raises more issues than the disjunct: it offers
Fred the option of success by opening the door in addition to by taking out the
trash. Consequently, the partition component is stronger than that of the disjunct. By
contrast, the predicate-scope disjunction offers Fred only one option, that of taking
the disjunctive action: the resulting partition is neither stronger nor weaker than that
of the simple imperative.
Perhaps we reason about the relative strengths of natural-language imperatives α
and β by comparing the relative strengths of their partitions and of their propositions.
If so, we might perceive the predicate-scope Ross-inference as ‘more valid’ than the
sentential-scope: with predicate scope, the premiss proposition is stronger than the
conclusion, the partitions orthogonal; with sentential scope, the premiss proposition
is stronger, the conclusion partition is stronger. With sentential scope, respects of
strength are balanced between premiss and conclusion; with predicate scope, by either
measure, whenever their strength is comparable, the premiss is stronger than the
conclusion.
Command Pragmatics
Next, the language L assembles a command-‘language game’ around L -syntax/
semantics. Conversations in L are centered on the intentions held in a ‘general will’
of the assembled conversationalists—which I understand as to be a group mental state
in a fully robust sense (Rovane, ; Hellie, ), in many salient respects just like
individual intention.34 (Why? If canonical episodes of command require some sort of
‘group project’, it would be peculiar for non-Catholics to issue commands to the Pope
(compare Ninan, , § .); involving a collective mental state permits reusing struc-
tures for individual practical rationality. A more Lewis (b)/Portner ()-like
approach, on which commands merely constrain without rationalizing, must also find
some alternative explanation of the imperative Ross Paradox.) Conversation begins in
common ignorance of how the general will intentions obligate the conversationalists,
and moves toward total common recognition of the obligations of each; the acts of
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
command in L serve to distribute this common recognition (I set to the side the
epistemological question of how anyone knows what the general will intends; note also
that L stripped of its speech act theory can represent individual practical calculation).
In mechanical terms, L extends L with a statics and a dynamics: the statics
postulates contexts to record the general will intentions, the intentions of each con-
versationalist, and the extent of common knowledge of the distribution of general will
intentions;35 the dynamics sets the start- and end-states of the language game and the
effect of speech-acts on the state of play.36
In its postulation of psychologically-laden contexts, the statics of L permits a defi-
nition of endorsement: implicit acceptance of a sentence by a mental state (individual
or group), represented by positive evaluation at a context of the content of the sentence,
interpreted at that context.37 Armed with endorsement, entailment can be defined as
endorsement-preservation: any context endorsing all premisses endorses the conclu-
sion.38 Despite this, the entailment-relation of L does not differ extensionally from
that of L : that happens only in L , with the introduction of intensional operators to
shift the context at which a sentence is evaluated for endorsement.
The objective is to illustrate these points: how the contents of commands and of
intentions can be identified; how the meaning of a command can be obligative, in
the sense of binding its recipient to the course of action commanded; and, more
programmatically, what is required of the relations among content, mental states,
speech act meaning, and entailment.
Stipulations for L
statics
The central device of statics is the set C of all contexts. The approach uses a particular
context c ∈ C to represent a mental state relevant to language use. Those include
the mental states of individual language-users, of course, but also collective mental
states of groups of individuals gathered in conversation. A context representing an
individual I call solo; a context representing a conversational group I call social. (I
allow that contexts may represent not only ‘sincerely’ held mental states, but also
other states ‘treated as’ held—as for hypothesis, fiction, contingency planning, and
so forth: Stalnaker, , p. . I henceforth pass over this qualification without
acknowledgment.)
35 In this mentalistic conception of context, I side with Stalnaker (, ), Lewis (a), and Veltman
(, p. ), and oppose Kaplan () and Lewis (a); compare Hellie () and, to a considerable
extent, Yalcin ().
36 My story here is a spin on the familiar Stalnakean pragmatics (Stalnaker, , ).
37 In this ‘diagonal’ conception of entailment, I side with Kaplan () and against Stalnaker (),
Veltman (), and Yalcin ().
38 In thinking of entailment as endorsement-preservation, I side with Hare (), Veltman (), and
Yalcin (), and against Stalnaker () and Kaplan ().
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
A mental state needs to know whose it is, and when—matters of particular signifi-
cance for agency, because if I am uncertain whether I am BH in Toronto as I write this
or Julius Caesar just before his crossing the Rubicon, I will have no idea whether to
type or command my army (and I certainly have no idea how to split the difference!).39
Accordingly, I continue to let c determine its time tc . I also require c to determine a
nonempty set of individual conversationalists Vc ⊆ J: c is solo just if Vc is a singleton
set, otherwise social. For solo c, Vc is the singleton of the agent whose state of mind is
represented by c; for social c, Vc represents the members assembled in conversation as
represented by c.
In order to link imperative meaning to intentions, a context must bear a representa-
tion of the content of someone’s or some group’s intention. Accordingly, let c determine
a procedure c ∈ PR: for solo c with Vc = {j}, c is the procedural content of the
intentions of j at tc . For social c, c represents the obligations imposed on one another
by the ‘general will’ of the assembled members of the conversation (Castañeda, ,
pp. –).
I next insist that, for social c, for any j ∈ Vc , there is a solo context cj such that
Vcj = {j}. After all, j can only converse in an imperative language-game if it can place
itself under obligations (by having intentions, and thereby being representable by a
device cj representing j as itself and the content of its intentions as c j ); I impose the
requirement that cj be solo to exclude the bizarre prospect that a conversation is a
conversationalist in another conversation. And next, I require that tc = tc j : at any
time, it is the same time for everyone.
The purpose of assembling in L -conversation is to parcel out the general will. If
that is what we are doing, we need a sense for how much of it we have parceled out.
To this end, I require a social context c to determine a to-do list function tdc assigning
a procedure to each member of the conversation (tdc : Vc → PR):40 tdc represents
the common recognition among the conversationalists of how much of the general
will has been parceled out, and to whom. In order for tdc to parcel out the general
will, I require that it assign no procedure to anyone unless the general will intends that
procedure to be carried out by them (for j ∈ Vc , c tdc (j)). And in order for a to-do
list to reflect the parceling out of the general will—for it to be commonly recognized
who is under an obligation to do what—it needs to be that it accurately reflects what
the intentions of the assembled conversationalists actually are (c j tdc (j)).
entailment
Sentential-entailment in L is grounded in its statics: the relation at the definitional
basis L -entailment, (content)-endorsement, holds between a context c (a creature of
39 In requiring this ‘immunity to error through misidentification’ I side with Stalnaker () and against
Kaplan () and Lewis (a).
40 I borrow the ‘to-do list’ terminology from Portner (, , ) (my conception of these objects
adjusts Portner’s to meet with my apparatus and doctrine, as needed, and without comment). Portner
acknowledges Han () and Potts () as precursors; and of course (Charlow, , § .–) the to-do
list is a generalization of Lewis’s single-subject ‘sphere of permissability’ (Lewis, b).
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
dynamics
The central device of dynamics is a set of events E, the set of all speech acts. I have
troubled to tease apart the speech acts from the contexts—the former are linguistic,
the latter mental—in order to emphasize the purification of sentential-entailment of
any linguistic componentry. In this respect, the approach contrasts with ‘dynamic
semantics’.
The dynamics pictures a temporally-ordered sequence of speech acts—a
conversation, conv = e , . . . , eω
—with each ei ∈ E. A speech act e determines a
moment of time t(e) at which it occurs: the temporal sequencing of conv means that
t(ei ) < t(ei+ ).
41 In this latter respect I side with Yalcin () and against ‘dynamists’ (Veltman, ; Starr, ).
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
42 Speech acts therefore require contexts, and not the other way around. In this sense, contexts are
essentially psychological, and when associated with speech acts, are only accidentally so: compare Chalmers
() against ‘contextual two-dimensionalism’; contrast Stalnaker (, passim); Kaplan (); and Lewis
(a). This wiggle-room introduces the prospect that meanings of personal pronouns are grounded in
dynamic strategies for resolution of anaphora rather than static rules of interpretation.
43 Imperatives can be used for a laundry list of speech acts other than commands (Kaufmann, ;
compare Portner, , , Starr, , § ., Charlow, , § .). Because of space, I set this aside.
44 See fn. for a complication.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
45 Another use for this looseness is negotiation of the ‘general will’ (compare Lewis, c, example
on the collective negotiation of a plan): the constraint that c(e) = cc(e) requires merely that there is no
speech act essentially for adjusting the general will. That is compatible with the prospect of adjusting the
general will ‘between speech acts’.
46 Do I thereby abandon the ‘utility of the truth-conditional paradigm’ (Charlow, , p. )—
namely, the ‘standard’ notion of entailment as incompatibility of the truth-condition of the negation of the
conclusion with the truth-conditions of the premisses, and the ‘familiar’ conception of compositionality as
requiring the contribution of the truth-condition of an embedded sentence to the meaning of its embedding
sentence? I do not feel discomfitingly anxious. What is actually the ‘standard’ (Kaplan, ) notion of
entailment is truth of the conclusion at the index of the context whenever the premisses are true at the
index of the context; as noted above, endorsement does as well in this role as truth. And needless to say,
objects other than truth-conditions can embed.
47 Might truth still be lurking at the bottom, in the contents of intentions, by (as suggested in the early
sections) grounding success of intentions in truth of their contents? But with intelligibility in hand as a
norm, I suspect truth can be stripped even of that job, and content can be given a purely phenomenological
reduction. Further discussion must await another forum.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
Obligative Modals
Extending L with modals requires not just new semantic clauses for modals, but an
extension of the pragmatics. Modal sentences are declarative; accordingly, sticking
with the endorsement-preservation conception of entailment requires a declarative
statics; moreover, the use of obligative ‘must’-claims to command is among the
explananda, requiring a declarative dynamics. The language L extends L with a
declarative pragmatics; L , at last, adds modals.
The move to L is easy, because the general form of a declarative pragmatics
is relatively clear: each asserts propositions they believe to enrich common belief.
Endorsement is belief; the ‘essential effect’ of assertion intersects the proposition
asserted with the ‘context set’. While the rigors of segregating statics and dynamics
call for a few subtle adjustments, the pragmatics is in the main orthodox.
The semantics for modals presented in L is less so. The founding datum of this
chapter, recall, is that one thinks one must do something just if one acts with the
intention of doing it. Or, expressed in the jargon, a solo context (sentence-)endorses
‘I must pay the rent.’ just if it (content-)endorses the procedure pay the rent.48
The orthodoxy says that modals are intensional operators, comparing their
operanda to aggregations of those things at which intensions are evaluated, what-
ever they may be. It says also that modals quantify over worlds, comparing their
operanda to aggregations of worlds. The orthodoxy can say both things because it
thinks intensions are evaluated at worlds. But I think entailment is endorsement-
preservation, intensions are evaluated at representations of mental states—what I have
been calling contexts—so I have to choose. (Revanchists attempting to avoid this choice
by retreating from endorsement-preservation must find some alternative approach to
the imperative Ross Paradox.)
One option is to break the contact between modals and intensions but preserve
contact between modals and worlds:49 on this approach, ‘I must pay the rent.’ is
sensitive to whether, in all worlds that have been somehow selected by the context,
I pay the rent. Selected how? Perhaps they are those where I do as I intend. Suppose I
intend to pay the rent; then in all worlds where I do as I intend, I pay the rent; then in
all worlds selected by the context, I pay the rent. And that is what it takes for ‘I must
pay the rent.’ to be ‘good’ (true, or acceptable, or what have you). Unfortunately, in
all worlds where I pay the rent, I either pay the rent or drink up all your wine. So if
I intend to pay the rent, ‘I must either pay the rent or drink up all your wine.’ is also
‘good’. So if I intend to pay the rent, I think I must either pay the rent or drink up all
your wine. But I don’t! The Ross Paradox works for obligative modals, too.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
The other option holds out more promise of a treatment of the Ross Paradox for
must: break the contact between modals and worlds but preserve contact between
modals and intensions. On this option, modals are intensional operators, comparing
their operanda to aggregations of contexts, looking for patterns of endorsement. What
makes a reading of ‘must’ obligative is its use to compare its procedural operand to
the intentions of the context at which it is interpreted; with the consequence that
a solo context endorses obligative ‘I must pay the rent.’ just if it endorses pay the
rent and ‘I must pay the rent or go drinking.’ just if it endorses pay the rent or go
drinking.
Stipulations for L
syntax
Lexicon: A stock of elemental declarative sentences P, Q, and so forth.
Composition: If φ , . . . , φn are declarative sentences and C is an n-place Boolean
connective, C(φ , . . . , φn ) is a declarative sentence.
semantics
Lexicon: For an elemental declarative sentence φ, [[φ]]c ⊆ W.
Composition: [[C(φ , . . . , φn )]]c = [[C]]c ([[φ ]]c , . . . , [[φn ]]c ).
statics
Let a context c ∈ C determine an information state, a nonempty set ic ⊆ W (compare
Stalnaker, , pp. –; also (inter alia) Veltman, ; Yalcin, ). For solo c,
ic represents the content of the beliefs of the subject c represents; for social c, ic
represents the common beliefs of the participants to the conversation c represents (the
strongest proposition each believes each believes . . . each believes). The nonemptiness
requirement on ic stipulates that no (nondefective) belief state can be inconsistent. A
union-set requirement on social contexts stipulates that nothing can count as common
belief unless it is severally believed: if c is social, then j∈Vc icj ⊆ ic .
entailment
To bring in declarative entailment, it suffices to add a ‘module’ for content-
endorsement of declarative content: namely, contextual endorsement of a proposition
is believing it (c p := ic p).
dynamics
As with L , a conversation in L is a temporally ordered sequence of speech acts
conv = e , . . . , eω
. Conversation progresses from complete common ignorance to
complete common belief in that which is not controversial (more formally: ic(e ) = W
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
Stipulations for L
statics
If modals are intensional operators, they ‘quantify’ over whatever intensions are
evaluated at. If entailment is endorsement-preservation, intensions are evaluated at
contexts. So modals quantify over contexts. As on the orthodoxy, the quantification
is typically restricted. I represent the restriction with a set of contexts Cc determined
by c. As on the orthodoxy, restriction represents a limitation in which entities are ‘taken
seriously’ in c; those not meeting the restriction are ‘ignored’.
The orthodoxy typically imposes a requirement of realism (Kratzer, , p. ):
actuality is never ignored, always taken seriously. I impose a similar restriction: a
context never ignores itself, always takes itself seriously. Formally: c ∈ Cc .
syntax
Lexicon: Modal operators , .
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
semantics
Lexicon: [[]]c = λF {w : (∀c ∈ Cc )(c F(c))}, [[]]c =
λF {w : (∀c ∈ Cc )(c F(c))}.
These clauses represent a modal as an intensional operator, comparing the Cc -
restricted set of contexts to the content of its operand: for , whether all members of
Cc endorse that content; for , whether not all members of Cc antiendorse the content. I
return to ‘antiendorsement’ in the wrap-up discussion. What it is for a modal sentence
to be ‘good’ is for it to be endorsed, as interpreted, however evaluated: as a declarative
sentence, that is for it to be noncontingent—trivial if true, absurd if false.50
Composition: [[Mσ ]]c = [[M]]c (λc [[σ ]]c ).
. Obligative necessity
Every context takes itself seriously: c ∈ Cc . But under reflexive restriction, a context
takes only itself seriously: Cc = {c}. Let and be unofficial symbols for and
under reflexive restriction: then [[]]c = λF {w : (∀c ∈ Cc = {c})(c F(c))} =
λF {w : c F(c)}; similarly, [[ ]]c = λF {w : c F(c)}.
Accordingly, [[σ ]]c = {w : c [[σ ]]c } = {w : c σ } = W just if c σ ,
otherwise ∅: from which it follows that c σ just if c σ , from which it follows
that σ σ and σ σ : any sentence is equivalent to its reflexively-restricted
necessitation (compare Yalcin, , p. ).
And accordingly, [[ σ ]]c = {w : c [[σ ]]c }. Now, as it happens, for imperative
σ = α, [[α]]c is undefined, and with it [[ α]]c : I return to this point in the concluding
remarks. But for declarative σ = φ, [[ φ]]c = {w : c ¬φ}. It follows that c ¬ φ
just if c ¬φ, from which it follows that ¬φ ¬ φ and ¬ φ ¬φ: the negation
of any declarative sentence is equivalent to the negation of its reflexively-restricted
possibilization (compare Yalcin, , pp. , ).
Because modal sentences are declaratives, they conform to disjunction-
introduction.51 In particular, α α ∨ β. But because σ and σ are equivalent,
the Ross-Paradoxical nonentailment α α ∨β results in a comparable nonentailment
α (α ∨ β). (Compare: I must post this letter, so either I must post this letter or
I must drink up all your wine—sounds fine to me.) Of course φ (φ ∨ ψ).
An assertion e of α can have a force resembling that of a command e of α.
How so? Let the imputed context mc(e) = c(e), the social context of the speech act
50 Because the completely ignorant belief state must still endorse a ‘good’ modal sentence, the content of
that sentence can be no more specific than the trivial proposition.
51 Not so disjunction-elimination. Read epistemically, gives this counterexample: φ φ, ¬φ
¬φ, φ ∨ ¬φ, but φ ∨ ¬φ.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
(while assertion is ordinarily imputed to the speaker, that is not a hard-and-fast rule:
in general, epistemic uses of ‘they might have already left’ to probe for compatibility
with common belief are imputed to the social context). Then, by stipulation, mc(e) =
c(e) α. That is so just if c(e) α. That is so just if c(e) [[α]]c .
Now, by stipulation, c(e ) σ (e ) = α as well. But command has the conventional
force of introducing [[α]]c to the to-do list of its addressee, while assertion has no such
conventional force. To bridge the gap, observe with Stalnaker (, pp. –) that
common belief includes commonly salient information, in addition to the content
so far asserted. Common extraction of the content of a speech act requires common
recognition of the act’s occurrence as an act with a certain conventional meaning. So
typically this common recognition regarding a speech act will be incorporated into its
prior context (recall that a trailing prior context need only be at least as strong as its
leading posterior context).
So if the assertion e of α has been accepted, it is commonly recognized that the
social context has accepted a speech act requiring c(e) [[α]]c and in that sense
commonly recognized that the general will endorses α. But that has the same effect
as putting this endorsement on display, the conventional force of command. The
objective of the conversation conv is to parcel out the content of (conv) among the
conversationalists by putting the general will on display; they should not care whether
that is done by a speech act with that conventional force or via this more scenic route.
on its remoteness
Suppose we let go this paperweight here. Fred says it must fall, Brent says it can/could
fail to fall. It is easy to access a reading on which they disagree, hard to access readings
on which they do not; it would be unnatural for Brent to rejoin %‘There are a lot of
things this object can/could fail to do that it must do—so what?’. Suppose we toss this
chemical into water. Fred says it can/could explode, Brent says it must fail to explode.
Again, it is easy to access a reading on which they disagree, hard to access readings on
which they do not; it would again be unnatural for Brent to rejoin %‘There are a lot
of things this chemical can/could do that it must fail to do—so what?’. Giving them
‘natural capacity’ readings, it is not easy to avoid coordinating must and can/could so
that they are dual to one another.
What to do about the rent? Brent says Rance must pay the rent, Fred says Rance
can/could fail to pay the rent. It is easy to access readings on which they do not
disagree; indeed, Brent quite sensibly rejoins ‘There are a lot of things Rance can/could
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
fail to do that he must do—so what?’. What to do about Brent’s wine? Brent says Rance
must not drink it all up, Fred says Rance can/could drink it all up. Again, it is easy
to access readings on which they do not disagree; again, Brent sensibly rejoins ‘There
are a lot of things Rance can/could do that he must not do—so what?’. Giving must its
obligative reading, it is easy to avoid coordinating can/could so that they are dual to
one another.52
Granting my core thesis, the obligative readings of can/could Fred hopes to access
would be reflexively restricted possibility-modalized imperatives: Brent says β, Fred
says ¬β; Brent says ¬α, Fred says α. In L , [[ ¬β]]c = {w : c [[¬β]]c };
[[ α]]c = {w : c [[α]]c }. What is the problem?
The problem is that, for a granulated proposition P, it is not straightforward to
define its complement P. Now, for a (protean) proposition p, its complement p is
the unique proposition for which the ⊆-maximally weak ‘trivial’ proposition is the
strongest proposition weaker than both p and p and the ⊆-maximally strong ‘absurd’
proposition ⊥ is the weakest proposition stronger than both p and p.
Is there a comparable definition of partition-complementation? In the set of par-
titions of W, the -maximally weak partition is the one-cell partition {W}, while
the -maximally strong partition is the partition into singletons {{w} : w ∈ W}.
But a partition does not generally have a unique complement—namely, given π ,
there is sometimes more than one π for which is the strongest partition weaker
than both and is the weakest partition stronger than both. Example: over the set
{A, B, C, A, B, C} let π be the ‘column’ partition {{A, A}, {B, B}, {C, C}}; let
ρ be the ‘row’ partition {{A, B, C}, {A, B, C}}; and let ρ be the ‘skew’ partition
{{A, C}, {B, C}, {A, B}}. Then each of ρ and ρ meets the conditions required of
a complement for π : for both of the pairs (π and ρ) and (π and ρ ), is the strongest
partition weaker than both and the weakest partition stronger than both.
If partitions cannot be complemented, what sense can be made of the complement
of a granulated proposition? We manage to determine [[¬α]]c 53 by complementing
the proposition-component of [[α]]c . So perhaps while [[ α]]c = {w : c [[α]]c }
is incomprehensible as {w : c π([[α]]c ), p([[α]]c )
}, we can make sense of it as
{w : c π([[α]]c ), p([[α]]c )
}. Still, the latter competes with the ‘natural capacity’
reading of ‘Rance can drink up Brent’s wine.’ as the modalized declarative (Rance
drinks up Brent’s wine). Because the proposition Rance drinks up Brent’s wine can be
complemented without digging under the hood, we ceteris paribus prefer the natural
52 Our parents would irritate us by answering ‘Can I be excused from the table?’ with ‘You can if you
may.’ . Our parents were correct that the issue was compatibility with a gradated standard of good manners.
We did not embrace those standards as our own, but instead felt them to have the force of natural bonds—
they imprisoned us at the table. Our irritation was at being not only imprisoned, but compelled to ritually
legitimize the imprisoning standards as a condition of freedom.
53 If we do: see note .
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
capacity reading to the obligative reading. That is why it is easy to avoid coordinating
the obligative readings of must and can/could.
Acknowledgments
My great thanks to the editors, both for their sternness regarding matters of content and
their indulgence regarding matters of process, but most importantly for their wise, savvy, and
sympathetic guidance. Thanks also to Jessica Wilson, and to Luke Roelofs, Adam Murray,
Jonathan Payton, Sergio Tenenbaum, Andrew Sepielli, Bryan Reece, Peter Ludlow, Charlie
Cooper-Simpson, Chris Meyns, Maarten Steenhagen, and Steven Coyne.
References
Aloni, Maria () Free choice, modals, and imperatives. Natural Language Semantics. .
pp. –.
Anscombe, Gertrude Elizabeth Margaret () Intention. Cambridge, MA: Harvard University
Press.
Åqvist, Lennart () Choice-offering and alternative-presenting disjunctive commands. Anal-
ysis. . pp. –.
Austin, J. L. () How to talk: Some simple ways. Proceedings of the Aristotelian Society. .
pp. –.
Barker, Chris () Imperatives denote actions. In A. A. Guevara, A. Chernilovskaya, and
R. Nouwen (eds.) Proceedings of Sinn und Bedeutung . Volume I. pp. –.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
Huber, Franz () The logic of theory assessment. Journal of Philosophical Logic. .
pp. –.
Humberstone, I. L. () Direction of fit. Mind. . pp. –.
Kamp, Georg () Logik und Deontik. Paderborn: Mentis.
Kamp, Hans () Free choice permission. Proceedings of the Aristotelian Society. .
pp. –.
Kaplan, David () Demonstratives. In Joseph Almog, John Perry, and Howard Wettstein
(eds.) Themes From Kaplan. () Oxford: Oxford University Press.
Kaufmann, Magdalena () Interpreting Imperatives. Berlin: Springer Science & Business
Media.
Kenny, A. J. () Practical inference. Analysis. . pp. –.
Kratzer, Angelika () What ‘must’ and ‘can’ must and can mean. Linguistics and Philosophy.
. pp. –.
Kratzer, Angelika () The notional category of modality. In H. J. Eikmeyer and H. Reiser
(eds.) Words, Worlds, and Contexts. Berlin and New York: de Gruyter.
Kratzer, Angelika () Modality. In Arnim von Stechow and Dieter Wunderlich (eds.) Hand-
buch Semantik/Handbook Semantics. Berlin: de Gruyter.
Lascarides, Alex and Asher, Nicholas () Imperatives in dialogue. Pragmatics and Beyond
New Series. . pp. –.
Lewis, David () General semantics. Synthese. . pp. –.
Lewis, David () Counterfactuals. Cambridge, MA: Harvard University Press.
Lewis, David () Adverbs of quantification. In Edward L. Keenan (ed.) Formal Semantics of
Natural Language. Cambridge: Cambridge University Press. Reprinted in Lewis, .
Lewis, David (a) Attitudes de dicto and de se. The Philosophical Review. . pp. –.
Reprinted in Lewis, .
Lewis, David (b) A problem about permission. In Essays in Honour of Jaakko Hintikka on the
Occasion of His Fiftieth Birthday on January , . pp. –. Dordrecht: Reidel. Reprinted
in Lewis, .
Lewis, David (c) Scorekeeping in a language game. Journal of Philosophical Logic. .
pp. –.
Lewis, David (a) Index, context, and content. In Stig Kanger and Sven Öhman (eds.)
Philosophy and Grammar. Dordrecht: Reidel. Reprinted in Lewis, .
Lewis, David (b) A subjectivist’s guide to objective chance. In Richard C. Jeffrey (ed.) Studies
in Inductive Logic and Probability. Volume II. Berkeley: University of California Press.
Lewis, David () Philosophical Papers. Volume I. Oxford: Oxford University Press.
Lewis, David () On the Plurality of Worlds. London: Blackwell.
Lewis, David () Papers in Philosophical Logic. Cambridge: Cambridge University Press.
Lewis, David (a) Papers in Metaphysics and Epistemology. Cambridge: Cambridge Univer-
sity Press.
Lewis, David (b) Why conditionalize? In Lewis, a. Handout from .
Lewis, David () Papers in Ethics and Social Philosophy. Cambridge: Cambridge University
Press.
Ninan, Dilip () Two puzzles about deontic necessity. MIT Working Papers in Linguistics. .
pp. –.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
Parfit, Derek A. () Reasons and Persons. Oxford: Oxford University Press.
Parsons, Josh () Command and consequence. Philosophical Studies. . pp. –.
Portner, Paul () The semantics of imperatives within a theory of clause types. In R. B. Young
(ed.) Proceedings of SALT . pp. –.
Portner, Paul () Imperatives and modals. Natural Language Semantics. . pp. –.
Portner, Paul () Permission and choice. In G. Grewendorf and T. Zimmermann (eds.)
Discourse and Grammar. Berlin: Mouton de Gruyter.
Potts, Chris () Keeping world and will apart: A discourse-based semantics for imperatives.
Handout: Talk at NYU, October.
Rescher, Nicholas and Robison, John () Can one infer commands from commands?
Analysis. . pp. –.
Ross, Alf () Imperatives and logic. Philosophy of Science. . pp. –.
Rovane, Carol () Group agency and individualism. Erkenntnis. . pp. –.
Schroeder, Mark () Ought, agents, and actions. The Philosophical Review. . pp. –.
Searle, John R. () The intentionality of intention and action. Inquiry. . pp. –.
Setiya, Kieran () Reasons Without Rationalism. Princeton: Princeton University Press.
Sloman, Aaron () ‘Ought’ and ‘better’. Mind. . pp. –.
Smith, Michael () The Humean theory of motivation. Mind. . pp. –.
Spohn, Wolfgang () Ordinal conditional functions: A dynamic theory of epistemic states.
In William L. Harper and Bryan Skyrms (eds.) Causation in Decision, Belief Change, and
Statistics. Volume II. Dordrecht: Kluwer.
Stalnaker, Robert C. () Pragmatics. Synthese. . Reprinted in Stalnaker, .
Stalnaker, Robert C. () Indicative conditionals. Philosophia. . Reprinted in Stalnaker, .
Stalnaker, Robert C. () Assertion. Syntax and Semantics. . pp. –. Reprinted in
Stalnaker, .
Stalnaker, Robert C. () Inquiry. Cambridge, MA: MIT Press.
Stalnaker, Robert C. () On the representation of context. Journal of Logic, Language, and
Information. . Reprinted in Stalnaker, .
Stalnaker, Robert C. () Context and Content. Oxford: Oxford University Press.
Stalnaker, Robert C. () Our Knowledge of the Internal World. Oxford: Oxford University
Press.
Starr, William () A preference semantics for imperatives. Semantics and Pragmatics.
[Online] Available from: williamstarr.net/research/ a-preference_semantics-for-
imperatives.pdf. [Accessed Nov .]
Stenius, Erik () Mood and language-game. Synthese. . pp. –.
Thompson, Michael () Naive action theory. In Life and Action. Cambridge, MA: Harvard
University Press.
Velleman, David () The Possibility of Practical Reason. Oxford: Oxford University Press.
Veltman, Frank () Defaults in update semantics. Journal of Philosophical Logic. .
pp. –.
Veltman, Frank () Or else, what? Imperatives on the borderline of semantics and
pragmatics. Lego Seminar Presentation, Institute for Logic, Language, and Computation.
November. [Online] Available from: https://staff.fnwi.uva.nl/u.endriss/teaching/lolaco/
/slides/veltman.pdf [Accessed Nov .]
Vranas, Peter () New foundations for imperative logic I: Logical connectives, consistency,
and quantifiers. Noûs. . pp. –.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
Wedgwood, Ralph () The aim of belief. Philosophical Perspectives. . pp. –.
Willer, Malte () Modality in Flux. Ph.D. thesis. University of Texas at Austin.
Williams, Bernard () Imperative inference. Analysis. . pp. –.
Yalcin, Seth () Epistemic modals. Mind. . pp. –.
Yli-Vakkuri, Juhani () Comments on Hellie, ‘Actuality and triviality in mindset semantics’ .
Personal communication, May.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
Dynamic Foundations
for Deontic Logic
Malte Willer
The Plot
Several authors—most notably Horty (, , , , ) but also Asher and
Bonevac (, ), Belzer (), Bonevac (), McCarty (), Nute (), and
Ryu and Lee ()—have observed that the tools and techniques developed within the
field of nonmonotonic logic provide a fruitful framework for the theoretical study of
deontic discourse and reasoning.1 The classical motivation for this approach is the idea
that certain obligations are only prima facie obligations and may thus be overridden
by other considerations (see Ross, ). Here is a familiar case. Jones has promised
to meet Mary for lunch and so ought to meet her for lunch. But it may very well be
that Jones finds himself in a situation in which he would need to miss lunch to save
a drowning child, and then he is permitted, arguably even ought, to miss lunch and
instead save the drowning child. Such situations, while perfectly ordinary, pose trouble
for the standard monotonic approach to deontic reasoning.2
To see what the issue is, assume that the logical consequence relation is monotonic
in the following sense:
Monotonicity: If φ , . . . , φn ψ, then φ , . . . , φn , φn+ ψ
The initial observation is that ()–() are consistent:
() If Jones has promised to meet Mary for lunch, he ought to meet Mary for lunch.
() If a drowning child needs Jones’s help, then he ought to help the drowning
child.
1 Some classical frameworks in the nonmonotonic tradition: model-preference theories (beginning
with McCarthy’s () theory of circumscription); fixed-point theories (see, e.g., McDermott and Doyle,
and Reiter, ); logics for argument-based defeasible reasoning (see Nute, , Pollock, , and
Touretzky, for seminal discussions).
2 Classical deontic logic goes back to the foundational work in von Wright (, ). See Åqvist ()
and McNamara () for overviews.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
3 Charlow (a) argues that deontic conditionals such as “If you want to go to Harlem, you ought to
take the A-train” exhibit a resistance to antecedent strengthening that is grounded in the goal sensitivity of
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
their consequents and cannot be captured by the classical analysis of modals and conditionals. I set such
conditionals aside but everything I am about to say here is sympathetic to Charlow’s discussion.
4 An exception is the proposal by van der Torre and Tan (), which resembles mine because of its
dynamic spirit but differs substantially in motivation, execution, and scope.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
Then it may very well be that Jones ought not tell his neighbors that he is coming:
leaving them hanging is already bad enough, but creating false expectations in addition
to that would make things even worse.5 Once again it looks as if strengthening a set
of premises with an extra bit of information blocks an inference that the original set
licenses.
Earlier I said that there is a striking parallel between prima facie obligations and
defeasible generalizations: the former may be overridden while the latter may be
defeated. What we have seen just now is that there is also a striking parallel between
exceptional circumstances and violations of obligations: both may transform a duty
such as keeping a promise or telling one’s neighbors that one is coming into a wrong
action. The obvious thing to say then is that reasoning with violable obligations is
just as susceptible to monotonicity failures as is reasoning with obligations that allow
for exceptions. But while the lessons from default reasoning may very well help us
understand why reasoning with prima facie obligations exhibits monotonicity failures,
they do not explain why reasoning with violable obligations should exhibit the same
phenomenon: it is, after all, a truism that violations of obligations are not exceptional
circumstances in which the obligation is no longer binding (Prakken and Sergot, ,
). In particular, a situation in which Jones does not go to the aid of his neighbors
is not an exceptional circumstance that relieves him from the duty articulated by
()—his obligation to go—and it cannot be an exception to the conditional obligation
articulated by () either, since the latter does not even pertain to situations in which
Jones does not go. Reasoning with violable obligations has a nonmonotonic flavor
alright, but insights from the logic of reasoning with exceptions do not immediately
illuminate why this should be so.
Reasoning with obligations under epistemic uncertainty arguably has a nonmono-
tonic flavor as well, though the case is slightly more involved. Consider the miners
paradox from Kolodny and MacFarlane (). Ten miners are trapped either in shaft
A or in shaft B, but we do not know which one. Water threatens to flood the shafts. We
have enough sandbags to only block one shaft but not both. If one shaft is blocked, all
of the water will go into the other shaft, killing every miner inside. If we block neither
shaft, both will be partially flooded, killing one miner.
5 The intended interpretation of deontic ought here is the deliberative one in the sense of Thomason
(): the one that figures prominently in advice and takes the facts as given. The claim that deontic ought
thus interpreted is subject to monotonicity failures—the advice we give to Jones is not preserved under
information strengthening—is compatible with the intuition that it also has interpretations less sensitive to
what is taken for granted.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
In this scenario, () seems to be the right thing to say since all we know is that () is
true. However, we are also willing to accept () and ():
() We ought to block neither shaft.
() Either the miners are in shaft A or they are in shaft B.
() If the miners are in shaft A, we ought to block shaft A.
() If the miners are in shaft B, we ought to block shaft B.
()–() seem consistent. But let be the set of premises consisting of ()–() and
observe that () intuitively entails both () and ():
() It is not the case that we ought to block shaft A.
() It is not the case that we ought to block shaft B.
() and () just negate the consequents of the conditionals in () and (), respec-
tively, and so given monotonicity and modus ponens, ∪ {inA} ⊥ and hence
¬inA by reductio. For parallel reasons, ¬inB. Hence is inconsistent after
all since it entails the negation of ().
The paradoxical argument just given appeals to a monotonic conception of logical
consequence—specifically, it assumes that if entails (), then so do ∪ {inA} and
∪ {inB}—and so one might hope for a nonmonotonic escape route. But reasoning
with obligations under epistemic uncertainty is not another instance of reasoning with
obligations that allow for exceptions. For suppose that the miners’ being in shaft A is
just a case in which the obligation to block neither shaft is no longer in force: then so
would be a case in which the miners are in shaft B, and since the miners are in one
of the two shafts we know that we are facing an exceptional situation and thus the
obligation to block neither shaft should not hold in the first place.
One reaction to the issues I have just outlined is to say that nonmonotonicity in
deontic reasoning is indeed limited to cases in which prima facie obligations are in
play: Chisholm’s scenario and the miners paradox highlight other problems with the
classical conception of deontic logic.6 But another reaction—the one I will pursue
here—is to say that nonmonotonicity in deontic reasoning is a pervasive phenomenon
that does not reduce to reasoning with rules allowing for exceptions.7 This reaction
6 Rejecting deontic detachment blocks Chisholm’s paradox but the prominent proposals by Lewis ()
and Kratzer (, ) preserve the validity of this rule (though they deny modus ponens for deontic
conditionals, which is sometimes labeled “factual detachment”). Appealing to tense is another option but
does not help with “timeless” Chisholm scenarios (see Carmo and Jones, and McNamara, and
references therein). Rejecting modus ponens is Kolodny and MacFarlane’s preferred solution to their miners
paradox, but Willer () argues that this is an overreaction and also says a bit about Bledin’s ()
and Yalcin’s () suggestion to respond to the paradox by rejecting reductio (see also § for additional
remarks). Silk () critically discusses Dowell’s () and von Fintel’s () suggestion that the deontic
modal ought receives different interpretations in () and ()/(), respectively.
7 See also the discussion by van der Torre and Tan (), who argue that the violability of obligations
should be understood as just such a kind of defeasibility, though they focus on a quite narrow conception
of defeasibility as failure of antecedent strengthening for deontic conditionals.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
is appealing for reasons other than theoretical simplicity. The key conceptual driver
behind nonmonotonic logic is the idea that the validity of an inference is not only
sensitive to the presence of information—as in classical logic—but also to the absence
of information, and it does not take much to apply this idea to the cases just dis-
cussed. Prima facie obligations are binding unless there are exceptional circumstances,
violable obligations may entail other duties unless they are violated, and obligations
under epistemic uncertainty may no longer hold in case the underlying uncertainty
is resolved by additional information. In brief, while the cases discussed are at some
level of description quite disparate, there is more than a merely initial appeal to the idea
that they also have something important in common that a nonmonotonic analysis of
deontic logic is ideally suited to capture.
What is needed, then, is a comprehensive perspective on deontic discourse and
reasoning that captures the variety of ways in which deontic inference is sensitive
to the absence of information. Such a perspective, I maintain, arises in a dynamic
semantic analysis of ifs and oughts that, while being non-classical, incorporates ideas
from the familiar linguistics literature by analysing deontic modals and conditionals
using possible worlds and in light of a context determining what is deontically ideal.8
The resulting framework is flexible enough to allow for the interpretation of deontic
ought in light of contextually provided norms that may be overridable, violable, or
information-sensitive. While the potential overridability, violability, and information-
sensitivity of norms are distinct features that are captured by different aspects of the
formal apparatus, the semantic framework also highlights a common denominator:
they all induce deontic commitments that carry in their wake distinct epistemic
commitments and since the latter are defeasible, so are the former. More precisely, in
the framework I propose the previously observed sensitivity of deontic inference to the
absence of information uniformly manifests in entailment relations between deontic
ought and epistemic might. Once we have explained why epistemic might allows for
monotonicity failures—a fact that finds a natural explanation in dynamic semantics—
we also know why deontic ought does so as well.
My strategy is straightforward. § briefly outlines a basic dynamic semantics for
epistemic modals and conditionals, highlights its nonmonotonic outlook on logical
consequence, and briefly explains how this outlook may serve as a comprehensive
foundation for a nonmonotonic perspective on deontic discourse and reasoning. §
offers a simple dynamic semantics for deontic ought that makes sense of reasoning
with prima facie obligations and with obligations under epistemic uncertainty by
highlighting its sensitivity to epistemic possibilities. § extends the simple proposal
to avoid a few shortcomings when it comes to the analysis of contrary-to-duty
8 See Lewis () for an overview of early studies of deontic logic within possible world frameworks
(including his own proposal in Lewis, ). In the linguistics literature, the work by Kratzer (see Kratzer,
and references therein) is seminal.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
obligations.9 § concludes the discussion by briefly comparing the story told here with
alternative views that treat nonmonotonic effects as a topic for belief revision theory
rather than semantics.
Basics
A basic dynamic story about conditionals and epistemic modals has been told before
(see Veltman, and Gillies, for seminal discussion), and so I will just briefly
highlight its key ideas and explain why they matter for current purposes. Semantic
values are stated in terms of context change potentials (CCPs): functions that take
an information state as input and return another state—the result of updating the
input state with the sentence under consideration—as output.10 Thinking of semantic
values in such a way owes some inspiration to the classical story about assertion
from Stalnaker (), who observes that context-content interaction is not a one-
way road: context affects what proposition an assertion expresses, but an assertion in
turn affects the context by adding the proposition expressed to the common ground.
In Stalnaker’s picture the latter effect is a pragmatic afterthought to classical truth-
conditional semantics, but here the facts about context-content interaction are integral
to how we model semantic values.
Thinking of semantic values dynamically as CCPs motivates an equally dynamic
conception of logical consequence as guaranteed preservation of rational commit-
ment. On this view, an argument is valid just in case its conclusion is accepted by
any carrier of information σ under the supposition of its premises:
φ , . . . , φn ψ just in case for all σ : σ [φ ] . . . [φn ] ψ
Here σ [φ] is the result of strengthening σ with the information carried by φ, and to
say that σ φ is to say that σ is rationally committed to φ. The question whether
logical consequence is nonmonotonic now hinges on the question whether additional
premises in discourse and reasoning are guaranteed to preserve existing rational com-
mitments and, what is more, considerations about epistemic might strongly suggest
that information agglomeration fails to be commitment-preserving in such a way: the
9 Nothing I am about to say here by itself enforces the validity of deontic detachment (which, recall,
figures prominently in Chisholm’s paradox about contrary-to-duty obligations) but the upcoming story can
be modified so that it does. Motivating the required twists and turns in detail and explaining why they do
the trick goes beyond the scope of this paper—in brief, what is needed are some plausible restrictions on the
notion of a deontic context (§.) and some minimal modifications of the notion of retraction (§)—and so I
will focus here on the (no less important) task of telling a story about Chisholm’s case that is compatible with
the assumption that deontic detachment is valid and explains the data from a nonmonotonic perspective.
See Willer () for a detailed discussion of deontic detachment that sits happily with everything said in
this paper but does not mention many of the issues that matter here.
10 Classical dynamic frameworks include Discourse Representation Theory (see Kamp et al., for an
up-to-date discussion), Dynamic Predicate Logic (Groenendijk and Stokhof, ), File Change Semantics
(Heim, ), and Update Semantics (Veltman, , ).
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
belief that, say, Mary might be in Chicago needs to go once one learns that Mary is not
in Chicago (see Levi, , Fuhrmann, , and Rott, for seminal discussion).
More generally:
σ Might φ −→
σ [¬φ] Might φ
A state may be committed to Might φ but fail to be thus committed if strengthened
with ¬φ. And that already leads to monotonicity failures given the following
minimal assumptions:
Acceptance: For all σ : σ φ just in case σ [φ] = σ
Epistemic Success: For all σ : σ [Might φ] Might φ
The first principle says that φ is accepted in σ just in case σ already carries the
information carried by φ. The second principle requires that the result of updating
some carrier of information with an epistemic modal claim accepts that claim. It
follows immediately that Might φ Might φ for all φ, and we also know that for
some σ and φ, σ Might φ yet σ [¬φ] Might φ and thus σ [Might φ][¬φ]
Might φ. Accordingly, Might φ Might φ yet Might φ, ¬φ Might φ and thus
the proposed notion of logical consequence is nonmonotonic.
To make all of this more precise, define the initial target of our semantic analysis as
a simple propositional language extended with the conditional connective if (⇒) and
epistemic might (e ):
Definition (Basic Language). L is the smallest set that contains a set of atomic
sentences A = {p, q, r, . . .} and is closed under negation (¬), conjunction (∧),
epistemic might (e ), and the conditional connective if (⇒). Inclusive and exclusive
˙ the material conditional (⊃), the biconditional (≡), and epistemic
disjunction (∨, ∨),
must (e ) are defined in the usual way. L is defined as the propositional fragment
of L.
Semantic values take a state as input and return one as output. Such carriers are
sets of indices consisting of a possible world coupled with a frame (more on frames
momentarily):
Definition (Information States). σ is an information state iff σ ⊆ I, where I is the set
of all indices. wi is the possible world parameter of some index i, where w is a possible
world iff w : A → {, } and W is the set of all possible worlds. is the set of all
information states. σ is the initial information state and identical with I.
Information states are sets of indices, and to avoid the need for redefinitions at a later
stage we will not simply identify an index with a plain possible world, though this
would be good enough if we were interested only in the semantics for L.
Recursively define the update rules for elements of L as follows:
Definition (Basic Update Rules). Associate with each φ ∈ L a context change rule
[φ] : → as follows:
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
defeater just in case updating σ with ψ results in such a state. An argument is valid
just in case any information state that is updated with its premises is committed to its
conclusion.
It will be helpful to highlight three facts about the outlined framework, the first one
being that modus ponens is valid:
Fact φ ⇒ ψ, φ ψ
Take any σ and consider σ [φ ⇒ ψ][φ]: if σ [φ ⇒ ψ] = ∅, the conclusion is accepted
for trivial reasons. Otherwise, σ [φ ⇒ ψ] = σ and σ [φ] ψ, which just proves the
point.
Secondly, observe that the dynamic logical consequence relation is nonmonotonic:
Fact For all contingent φ ∈ L : e φ e φ and e φ, ¬φ e φ.
The reason is familiar: might-commitments are not guaranteed to be preserved as new
information is acquired, and in particular a commitment to e p fails to be preserved
by the information that p is not the case.
The third observation of relevance is that the defeasibility of epistemic might may
enforce the defeasibility of other rational commitments:
Fact Consider any σ ∈ such that σ φ and suppose that φ ψ: if χ defeats
a commitment to ψ in σ , then χ defeats a commitment to φ in σ .
This is just to say that whenever a commitment to φ brings in its wake a commitment
to ψ, then the commitment to φ is defeasible in case the commitment to ψ is. And that
is important since there is every reason to think that commitments to deontic ought
frequently bring in their wake a commitment to epistemic might: the defeasibility of
deontic commitments can be illuminated in terms of the by now established dynamic
defeasibility to certain epistemic commitments. Let us go through the details.
Simple Oughts
This section presents a simple dynamic framework that makes sense of reasoning
with prima facie obligations but is also flexible enough to make sense of some
other nonmonotonic effects in deontic reasoning. Doing so requires that we offer a
dynamic analysis of the deontic modal ought (d ) as well as of a defeasible conditional
connective (>) that will be used in stating prima facie obligations. The extended target
language is defined as follows:
Definition (Extended Language). Define a language L so that L ⊆ L and
whenever φ, ψ ∈ L , then d φ ∈ L and also φ > ψ ∈ L . L+ is the smallest
set that contains all elements of L and is closed under negation (¬), conjunction (∧),
epistemic might (e ), and the conditional connective if (⇒). Other connectives are
again defined as usual.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
Then π(W) = {w , w } and π([[east]]) = {w }. Since w violates the expectations that
come with an easterly wind—w ∈ [[east]] yet w ∈ / π([[east]])—it does not count as
a normal world. In contrast, w does not violate any of our expectations—notice that
w ∈/ [[east]] and so even though it rains at w , no expectation is violated—and so
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
N π(W) = {w }: normally, it rains and there is no easterly wind. But assuming that
there is an easterly wind, we expect no rain since N π([[east]]) = {w }.
We may now make the notion of an index precise and determine the update rule for
the defeasible conditional connective:
Definition (Indices). i is an index iff i ∈ W × , that is, an index is a pair consisting
of a possible world w and a coherent frame π . πi is the frame parameter of i.
Definition (Defeasible Conditional Connectives). Extend the previous update
rules for L and deontic ought with the following entry:
() σ [φ > ψ] = {i ∈ σ : πi ([[φ]]) ⊆ [[ψ]]}
An information state keeps track of the hard information modeled by a set of possible
worlds but also of what is to be expected, which is represented by a set of coherent
frames. A formula of the form φ > ψ eliminates from an information carrier all
those indices whose frame parameters fail to treat [[ψ]] as a [[φ]]-default.
Veltman () offers a model for how to reason with defeasible generalizations in
dynamic semantics, but for our purposes there is no need to repeat his story here.
I am first and foremost interested in the role of prima facie obligations for deontic
discourse and reasoning, and for this I need to say precisely what is expected by a
carrier of information. As a preparation, observe that we can associate with each such
carrier a unique scenario and frame as follows:
Definition (Depicted Scenarios and Frames). Consider arbitrary σ ∈ :
. The scenario depicted by σ is defined as sσ = {wi : i ∈ σ }.
. The frame depicted by σ is defined so that [[φ]] is a default in πσ (s) iff for all i ∈ σ :
[[φ]] is a default in πi (s).
The scenario depicted by σ is simply the set of possible worlds compatible with the
hard information carried by σ . And σ treats [[ψ]] as a [[φ]]-default just in case σ accepts
the defeasible generalization articulated by φ > ψ.
Suppose then that one accepts a set of default rules. The simple intuition is that
a default rule should play a role in forming expectations just in case the rule is
triggered—one’s information entails the premise of the default rule—and at the same
time undefeated—one’s information does not depict an exceptional scenario in which
the rule is no longer binding. The only wrinkle we have to add to the story is that
default rules may sometimes lead to conflicting expectations, and so we must define
what it takes for a set of default rules to be unconflicted in expectation forming.11 The
11 My story about how to form expectations on the basis of accepted defaults owes inspiration to
Veltman’s () and Horty’s () accounts. But the difference between the upcoming proposal and theirs
are substantial, not least because it offers a possible-world analysis of the defeasible conditional connective
(thus differing from Horty’s) and ties the applicability of a default rule to a triggering condition (thus
differing from Veltman’s).
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
key notions involved in this characterization of how expectations are formed on the
basis of accepted default rules can be made more precise as follows:
Definition (Triggers, Defeats, Conflicts). Consider arbitrary π ∈ , σ ∈ , and
say that a world w complies with a [[φ]]-default [[ψ]] iff w ∈
/ [[φ ∧ ¬ψ]]:
. A [[φ]]-default [[ψ]] is triggered in σ iff σ φ.
. A [[φ]]-default [[ψ]] is undefeated in σ iff for every s such that sσ ⊆ s: there is some
w ∈ N π(s) that complies with the [[φ]]-default [[ψ]].
. A set of defaults is unconflicted in σ iff for every s such that sσ ⊆ s: there is some
w ∈ N π(s) that complies with each of its members.
To say that a world complies with a default rule is just to say that it does not violate
the expectation articulated by that rule. For a default rule to be triggered in σ , its
condition must be entailed by the scenario depicted by σ . But the rule is defeated in
σ if another, more specific rule is triggered that conflicts with it, and a set of default
rules is conflicted in σ just in case no normal world in the scenario depicted complies
with each of its members. Note here that whenever a single default rule is defeated in
σ , no set of rules containing it can be unconflicted in σ .
To give a very simple example of the notion of defeat, go back to the case in which
we believe that it normally rains but if there is an easterly wind, the weather is normally
dry: [[Rain]] is W-default but [[¬Rain]] is an [[East]]-default. Then intuitively a
scenario in which the wind comes from the east is an exception to the rule that it
normally rains, and this is just what the framework predicts. For suppose σ accepts that
the wind comes from the east: then sσ ⊆ [[East]] and of course every normal [[East]]-
world is one at which the weather is dry. So no normal [[East]]-world complies with
the W-default [[Rain]], which is just to say that the default rule is defeated in σ .
To illustrate the possibility of conflicting expectations, and to go through an
example that will be of relevance in the upcoming discussion, start with the initial
information carrier σ and consider the state σ = σ [Promise > Meet][Need >
Help][Promise ∧ Need], with the atomic sentences being translated as follows:
promise: Jones has promised to meet Mary for lunch.
meet: Jones will meet Mary for lunch.
need: The drowning child needs Jones’s help.
help: Jones will help the drowning child.
Then clearly σ accepts that Jones will, all things being equal, meet Mary for lunch
if he promised to do so and that he will, all things being equal, help a child if
it is in need of his help. Not surprisingly, both default rules are triggered and
undefeated in the scenario we consider—that Jones has promised to meet Mary
for lunch and that there is a drowning child in need of Jones’s help. The worlds
compliant with both rules are just those at which Jones keeps his promise and saves
the drowning child.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
Things become more interesting if we assume that Jones cannot do both, which I will
do here by considering τ = σ [meet ∨ ˙ help].12 Both default rules remain triggered and
undefeated—the scenario depicted by τ is not an exception to any of those rules—but
they conflict with each other: no world in the scenario under consideration complies
with both expectations. For at any such world, Jones does not meet Mary for lunch
even though he promised to do so—thus violating the expectation that he will keep
his promise—or fails to help the child even though it is drowning—thus violating the
expectation that he helps the child in need. It makes sense to expect that Jones will at
least keep his promise or save the drowning child (rather than doing neither) but there
is, without additional assumptions about Jones’s character anyway, no reason to favor
one default rule over the other in thinking about what he is going to do in the scenario
under consideration.
Now suppose we want to say that in Jones’s case, the default rules are not of
equal strength: Jones will not let the child in need drown just to keep his promise
to meet Mary for lunch. One way to go is to appeal to a priority relation between
default rules (see Horty, for recent discussion) but here I exploit the already
existing feature that more specific rules trump less specific ones in case of a conflict.
For consider χ = τ [((promise ∧ need) ∧ (meet ∨ ˙ help)) > help], that is, the
result of strengthening τ with the additional default rule that Jones will help the child
in need if faced with the unhappy choice between doing so and keeping his promise.
This update would have no interesting effect if it were epistemically possible for Jones
to both keep his appointment and help the drowning child, but it makes a big difference
in the case under consideration.
The crucial observation about χ is that it differs from τ in treating a scenario in
which Jones must choose as an exception to the rule that, all things being equal, he
will meet Mary for lunch if he has promised to do so. To see this, simply observe that
a world now counts as normal in πχ (([[promise ∧ need) ∧ (meet ∨ ˙ help)]]) only if
John helps the drowning child—and thus breaks his promise to meet Mary for lunch—
at that world. No such world can thus comply with the [[promise]]-default [[meet]],
which just means that the default rule is defeated in the scenario under consideration.
Since the other default rules remain triggered and undefeated, we expect that Jones
will help the drowning child.
Much more could be said about how we form expectations in everyday discourse
and reasoning, but the emerging picture is good enough for our purposes. What
matters here is that the role of a default rule in expectation forming is sensitive to
the presence or absence of global features of an information state that is not preserved
under the process of strengthening. Accordingly, expectations may be defeated as new
12 This is simplifying a bit: it is desirable to distinguish the claim that John makes a choice between
keeping his promise and saving the child—which is what “meet ∨ ˙ help” literally says—from the stronger
one that Jones must make this choice. Here and throughout it will do no harm to avoid additional
complications and simply assume that Jones makes the choice only if he cannot do both.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
information comes into view, and the following definition makes it easier to see why
this is so.
Definition (Expectations). Consider any σ ⊆ depicting a frame π :
. i ∈ σo iff i ∈ σ and wi complies with a maximal set of π -defaults such that (i) the
set is unconflicted in σ and (ii) each member is triggered in σ .
. i ∈ σon+ iff i ∈ σon and wi complies with a maximal set of π -defaults such that
(i) the set is unconflicted in σon and (ii) each member is triggered in σon .
. The set of optimal indices in σ is defined as σo = n≥ σon .
. σ expects φ, σ φ, iff σo φ.
To say that φ is expected in σ is to say that φ is accepted by the state got by focusing on
the optimal indices in σ . The optimal indices are defined in a stepwise fashion: first,
determine σo , that is, the indices whose world-parameter complies with any maximal
unconflicted set of triggered default rules in σ . Then determine σo , that is, the indices
in σo that come with a world-parameter complying with any maximal unconflicted
set of triggered default rules in σo , and so on. The set of optimal indices—defined as
n≥ σo —is just the state of information that the deliberating agent will arrive at after
n
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
σ : ∃i ∈ σ. wi = wi } and i <σd i iff i, i ∈ σ and (i) for all [φ] ∈ od : if i ∈ σ [φ],
then i ∈ σ [φ] and (ii) for some [φ] ∈ od : i ∈ σ [φ] and i ∈ / σ [φ]. i is minimal in σ
given d iff ¬∃i : i <σd i. σd is then just the set of indices that are minimal in σ given d.13
Whenever the ordering source exclusively consists of elements of L , we may just as
well have propositions fix what is deontically ideal. The current framework is more
flexible, however, and in particular we will put the possibility of stating ordering
sources using epistemic modals and defeasible conditionals to good use. Let me
explain.
Start with Jones, who has promised to meet Mary for lunch but also stumbles upon a
drowning child. Suppose that the deontic context fixes the following ordering source:
˙
od = {[promise > meet], [need > help], [((promise ∧ need) ∧ (meet∨
help)) > help]}
Here the first member of od articulates the rule that Jones’s promise to meet Mary for
lunch creates the prima facie obligation to meet her for lunch, while the second says
that a child’s need for help creates the prima facie obligation to help that child. The
third rule effectively resolves a potential conflict between the first and second prima
facie obligation in favor of helping the child in need.
Take any σ ∈ and let π = πσd : then π([[promise]]) = [[promise ∧ meet]],
π([[need]]) = [[need ∧ help]], π([[(promise ∧ need) ∧ (meet ∨ ˙ help)]]) =
[[(promise ∧ need) ∧ (¬meet ∧ help)]], and π(s) = s for all other scenarios. The
good news is that we already know what it takes for it to be the case that σd meet
from the earlier discussion in §.: of course, it is required that σ promise,
but for the [[promise]]-default [[meet]] to apply in σd , it must also be the case that
σ need ∧ ¬(meet ∧ help)—otherwise, the [[promise]]-default [[meet]] is defeated.
And that is just to say that a commitment to () needs to go in light of the additional
information that (i) there is a drowning child in need of Jones’s help and (ii) Jones
cannot help that child and keep his lunch appointment. It follows immediately that ()
and () entail () but no longer do so if strengthened by the information carried by
(). Here is a summary of the predictions:
Fact Consider d as fixed for Jones’s scenario:
. promise ⇒ d meet, promise d meet
. promise ⇒ d meet e (¬need ∨ (meet ∧ help))
. ˙ help) promise ⇒ d meet
promise ⇒ d meet, promise, need ∧ (meet ∨
. ˙ help) d meet
promise ⇒ d meet, promise, need ∧ (meet ∨
The first observation follows from the general validity of modus ponens while the
second is a direct consequence of the applicability conditions for defeasible general-
13 A disclaimer with a familiar ring: analysing deontic ought as a quantifier over a set of possible worlds
that are minimal in σ given d is technically convenient but requires that there is always such a set. The
classical analogue is the Limit Assumption (Lewis, ).
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
izations together with the semantics for ifs and oughts. It follows from the defeasibility
of epistemic might and its connection with deontic ought that the information carried
by () defeats a commitment to (), and this yields the final observation together with
the semantics of the Ramsey conditional. It is easy to verify that Jones’s obligation to
help the drowning child, while induced by a norm that is in principle defeasible, is not
affected by ().14
I am about to conclude that the framework developed so far offers a straightforward
treatment of prima facie obligations: such obligations may be overridden, and this
is not surprising since they stem from deontic rules that are articulated using the
defeasible conditional connective. But one may wonder whether the story told here
really captures Ross’s () idea that prima facie obligations, even if overridden,
continue to count in favor of some action rather than lose their status as a reason
for action altogether. The answer, I think, depends on what it means to accept the
antecedent of a defeasible norm (that is, to be in a state that triggers the default norm).
We are free to suggest that doing so provides a reason for action, albeit one that may be
outweighed by other reasons. Since the state of accepting the antecedent of a defeasible
norm is guaranteed to be preserved even if new information is acquired, it follows
that the overridability of prima facie obligations is compatible with their persistence
as moral reasons for action. But this is not the only available interpretation: perhaps
it is better to say that moral valence is entirely a matter of what norms are triggered
and undefeated in the scenario under consideration, thus allowing for the possibility
that a feature counting in favor some action given one scenario may not be a reason at
all given another, or perhaps even count against choosing that action. If so, accepting
the antecedent of a defeasible norm by itself does not give one a moral reason for
acting in one way or another. Both views have some intuitive appeal, and both can be
accommodated by the story told here.15
14 One might complain that the proposal made here hardly tells us everything we want to know about
Jones’s case since it simply hardwires our intuitions about what he is normally required to do and about what
counts as an exceptional case into an ordering source. However, it is arguably not the task of a framework
for deontic reasoning to explain our normative intuitions but rather to correctly predict which inferences,
given those intuitions, are valid and invalid, and here I add the additional twist that these predictions derive
from plausible assumptions about the semantics of modals, conditionals, and defeasible generalizations.
Integrating techniques from frameworks such as Horty’s () allows us to derive the ordering source for
Jones’s scenario from its first two rules and the assumption that the requirement to save a child in need takes
priority over the requirement to keep a lunch appointment, but clearly one would then still have to rely on
normative intuitions that are not derived from more basic ethical principles.
15 The first interpretation is in line with Horty’s () conception of reasons as triggered defaults, though
he develops his framework further so that it leaves room for valence switching. The second interpretation
is in line with a contextualist perspective on reasons, which has been developed in detail by, for instance,
Lance and Little in a series of papers, starting with Lance and Little () (which they originally labeled
“particularist”). What all these views have in common is that they appeal to general, albeit defeasible,
principles in the derivation of obligations, thus being at odds with moral particularism. See Dancy ()
for a recent articulation of the case for particularism that also criticizes the appeal to defeasible principles as
guides to reasoning about obligations. Alas, discussing the merits of these criticisms goes beyond the scope
of this chapter.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
All of that preserves the advocated connection between prima facie obligations and
defeasible generalizations, and it translates the classical idea that deontic reasoning
is sensitive to the absence of information into the claim that deontic commitments
are sensitive to epistemic commitments: since the latter may be defeated by addi-
tional information—in particular, commitments involving epistemic might are not
necessarily preserved by information agglomeration—it is not surprising that deontic
commitments are no less susceptible to preservation failure. The even better news
is that the attested sensitivity of deontic ought to epistemic might highlights other
potential nonmonotonic effects in deontic discourse and reasoning that would remain
hidden if we merely focused on the parallel between prima facie obligations and
defeasible generalizations. Let me explain.
Taking a nonmonotonic perspective on discourse and reasoning, I have said,
promises an attractive response to the problem surrounding the miners paradox. But
whatever the underlying mechanism is, it cannot be that we are dealing with a prima
facie obligation to block neither shaft that is overridden whenever the miners are in
shaft A (shaft B). To see this, suppose we think of the deontic context in the miners
scenario as follows:
od = {[(inA ∨ inB) > ¬(blA ∨ blB)], [inA > blA], [inB > blB]}
Any frame that treats [[¬(blA∨blB)]] as an [[inA∨inB]]-default while treating [[blA]]
as an [[inA]]-default and [[blB]] as an [[inB]]-default is incoherent. For let s = [[inA ∨
inB]] and consider any w ∈ s : then w ∈ N π(s) only if w ∈ [[¬(blA ∨ blB)]]. But
clearly either w ∈ [[inA]] and in that case w ∈ / π([[inA]]) or w ∈ [[inB]] and in that
case w ∈ / π([[inB]]), which is just to say that N π(s) = ∅.
The last observation does not show that there is something wrong with our story
about defeasible generalizations and prima facie obligations, but just that a nonmono-
tonic escape route from the miners paradox cannot rely on what this particular story
has to say. The good news is that the dynamic framework developed so far does not
tie nonmonotonic effects in deontic discourse and reasoning to the defeasibility of
certain generalizations but rather to the defeasibility of epistemic might. The following
deontic context does justice to our intuitions about the miners scenario while allowing
for nonmonotonic effects to play a role in reasoning about that scenario:
od = {[blA ≡ e inA], [blB ≡ e inB], [¬(blA ∨ blB) ≡ (e inA ∧ e inB)]}
Let σ be the information we have about the miners’ whereabouts. Then σ e inA ∧
e inB and accordingly σ ¬e inA and σ ¬e inB. It follows that the minimal
indices in σ are those at which we block neither shaft. But consider σ = σ [inA]: then
σ e inA and accordingly, the minimal indices in σ are those at which we block
shaft A. For parallel reasons, σ [inB] d blB. Here is a summary of the output:
Fact Consider d as fixed for the miners paradox and let σ be the information
we have about the miners’ whereabouts:
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
. σ d ¬(blA ∨ blB)
. σ inA ⇒ d blA
. σ inB ⇒ d blB
. σ inA ∨ inB
The framework thus accounts for our intuitions about the miners scenario and resolves
its air of paradox. For while σ d ¬(blA ∨ blB), we also know that σ [inA]
d ¬(blA ∨ blB) and so even though σ [inA] d blA, nonetheless σ [inA] ⊥
and so σ ¬inA. For similar reasons, σ ¬inB. What makes all this possible is that
adding the information that the miners are in shaft A (in shaft B) defeats one’s rational
commitment to the claim that we ought to block neither shaft, and the fact underlying
this observation is the following:
Fact Consider d as fixed for the miners paradox: then for all σ ∈ such that
σ e blA∧e blB it holds that if σ d ¬(blA∨blB), then σ e inA∧e inB
A commitment to the claim that we ought to block neither shaft (even though we might
block either of them) is dependent on one’s ignorance about the miners’ whereabouts.
Once this ignorance is removed the deontic commitment has to go as well.16
We may now highlight what Jones’s case has in common with the miners scenario
and where they differ. Both scenarios require that an action that is deontically optimal
with respect to some state σ fails to be deontically optimal with respect to a contracted
state even though that action is still choosable. The possibility of meeting Mary
for lunch, albeit deontically optimal in case Jones might do so and help the child
in need, fails to be deontically optimal once we assume that he needs to choose
between keeping his appointment and helping the child—even though he still might
do the former instead of the latter. Likewise, the possibility of blocking neither shaft,
albeit deontically optimal in case we do not know where the miners are, fails to be
deontically optimal once we assume that the miners are in shaft A—even though we
still might block neither shaft. So in both cases deontic ought exhibits a sensitivity to
epistemic might, but for different reasons. In Jones’s scenario, the underlying fact is that
information strengthening may override certain obligations since certain rules may
be defeated as additional information becomes available. This is not so in the miners
scenario since, for instance, the rule to block none of the shafts just in case the miners
might be in either of them is not defeasible in the first place. Instead, what is going on
16 Carr () and Cariani () argue that the semantics of deontic modality is sensitive to probabilistic
considerations, and it would be possible to expand the story developed here so that ordering sources can
be stated using probability operators. This would not change the fact that the framework developed here
is “quietist” in Charlow’s () sense since it does not explicitly encode decision-theoretic considerations
in the semantics: rather, such considerations play an implicit role in selecting the relevant ordering source
for the case under consideration. Charlow argues that so much quietism is not a good thing, but for our
purposes there is no need to resolve this issue here: whether or not decision-theoretic considerations are
to be explicitly represented in the semantics, the point remains that our intuitions about what ought to be
done in the miners scenario are sensitive to the presence or absence of certain epistemic possibilities in a
way that the framework developed here is well equipped to capture.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
here is that contraction may result in the rule requiring an action—blocking at least
one of the shafts—that it forbids in light of a weaker carrier of information (and simi-
larly for the other rules in the ordering source). In other words, the rules that are at play
in the miners scenario are absolute in the sense that they apply within any scenario,
but they may require different actions since they are stated using epistemic modals.
Horty () considers various obstacles to integrating the full insights from default
logic in a Kratzerian possible-worlds analysis of deontic modals. I have not demon-
strated that all these obstacles can be overcome—for instance, I have said nothing
about higher-order default norms in deontic reasoning—but the progress made here
should create some confidence that such a project is anything but futile. Let me briefly
highlight one particular issue that Horty discusses in some detail. Suppose that there is
a general prohibition against eating with your fingers ( > ¬F) with one exception: if
you are served cold asparagus, eating with your fingers is required (A > F).17 Horty’s
preferred framework for conditional oughts predicts that in light of these norms, “You
ought not eat with your fingers” and “If you are served cold asparagus, you ought to eat
with your fingers” are accepted in an out-of-the-blue context. It also predicts that the
norms under consideration do not license what he labels the “asparagus inference” to
the conclusion that you ought not be served cold asparagus. And this is a good result
since rules with exceptions should not render their exceptions as deontically sub-ideal
by design.
Horty also correctly observes that Kratzer-style possible worlds analyses of deontic
conditionals license the unfortunate asparagus inference. The inference pattern, how-
ever, is grounded in a “stability” feature of deontic contexts that is characteristic of
classical frameworks but not of the one presented here: that if an index i is optimal in
σd and included in some τ ⊆ σ , then i is guaranteed to be optimal in τd as well.18
It is a well-worn story that so much stability is problematic when it comes to the
miners paradox (see Cariani et al., , Charlow, b, and Kolodny and MacFarlane,
)—the crucial observations here are, first, that the feature is no less problematic
when it comes to prima facie obligations, and, secondly, that it is avoided by the story
about defaults told here. For consider od = {[ > ¬F], [A > F]} and let σ be the
initial information state: then clearly σd ¬F yet σd ¬A even though σ [A]d F,
and so σ d ¬F and σ A ⇒ d F yet σ d ¬A, as desired.19 It follows that
17 Following standard protocol, unconditional default rules are understood as defaults for the trivial
scenario W; is just any tautology.
18 To see the connection, suppose that σ ¬φ and thus that for some optimal i in σ , w ∈ [[φ]].
d d i
Clearly i ∈ σ [φ] and thus, assuming that d is stable, optimal in σ [φ]d . And if σ d ¬ψ as well, then
wi ∈ [[¬ψ]] and hence σ φ ⇒ d ψ. Accordingly, if σ d ¬ψ and σ φ ⇒ d ψ, then σ d ¬φ
under the assumption that d is stable.
19 The underlying fact here is that only the W-default [[¬F]] but not the [[A]]-default [[F]] is triggered in
σd . Accordingly, optimality in σd merely requires compliance with the W-default [[¬F]] and so an index
may be optimal even if its world-parameter is an A-world. When it comes to the miners paradox, suppose
again that σ is the information we have about the miners paradox: then all the indices in σd are optimal
since no default norms are involved and for some i ∈ σd , wi ∈ [[(¬blA ∧ ¬blB) ∧ inA]]. Such an index
is an element of σ [inA] but not included in σ [inA]d , which shows that the deontic context for the miners
scenario is just as unstable as the one for the asparagus case.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
Loose Ends
The framework developed so far has something useful to say about contrary-to-duty
obligations, though the story does not turn out to be entirely satisfying. To see the
positive aspect of the story, fix the ordering source for Chisholm’s scenario as follows:
od = {[go], [go ⊃ tell], [¬go ⊃ ¬tell]}
Since we are concerned here only with absolute rules, considerations about frames are
irrelevant for current purposes and so may simply consider the following distribution
of truth-values across possible worlds:
go tell
w T T
w T F
w F T
w F F
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
longer license the inference of () under the additional assumption that Jones does
not go.
All of this is good news, but there remains the question why reasoning with violable
obligations is susceptible to monotonicity failures, and it is not clear that the story told
here gives quite the right answer. For notice that the current framework predicts that
d φ e φ for all φ ∈ L, and so the reason for why ¬go defeats d tell is that
¬go defeats d go. But of course we want to say that Jones ought to go regardless of
whether or not he goes, and so the treatment of the case under consideration is not
fully satisfying. Let me outline how the basic framework may be expanded so that we
can do better (see Willer, for additional discussion).
I rely on Frank’s () proposal that deontic ought requires that the input context
be “non-trivial” in the sense that its prejacent as well as its negation must be open
possibilities. This requirement was satisfied in all the cases we considered earlier but
it naturally becomes an issue when we look at cases in which obligations are violated.
Frank suggests that nontriviality violations result in the retraction of information to
arrive at an appropriate state. How to retract information from a state is a very complex
issue, and here I choose a very simple approach and assume that context associates
with each σ ∈ a system of spheres S(σ ) that is ordered by ⊆ and centered on σ . The
intuitive role of S(σ ) is to capture which commitments stand and fall together, and it is
then possible to define a downdating operation on carriers of information as follows:
Definition (Downdating). Consider arbitrary σ ∈ and φ ∈ L. S(σ ) ◦ φ = {σ ∈
S(σ ) : σ φ}. The result of downdating σ with φ, σ ↓ φ, is the minimal element of
S(σ ) ◦ φ in case S(σ ) ◦ φ = Ø, and σ otherwise.
Downdating σ with φ removes any commitment to φ by weakening σ to its minimal
revision that is no longer committed to φ. Notice that downdating idles whenever the
input state already fails to be committed to φ.
The modified proposal then is that deontic ought is a universal quantifier over the
set of possible worlds that are deontically optimal in light of a carrier of information
that leaves room for the prejacent as well as its negation to be a possibility. Precisely:
Definition (Deontic Ought with Retraction). Extend the update rules for L with
the following entry:
() σ [d φ] = {i ∈ σ : (σ ↓ φ ↓ ¬φ)d φ}
Downdating thus guarantees that we consider an appropriate carrier of information
even if nontriviality is violated by the input state (as long as the prejacent is contingent).
It is easy to see that the refined proposal avoids the unfortunate result that any
update with ¬φ defeats a commitment to d φ. For consider again the distri-
bution of truth-values across possible worlds from the beginning of this section, let
sσ = {w , w , w , w } and consider σ [¬go]: then downdating with the prejacent of
d tell or d ¬tell idles and so σ [¬go] d ¬tell. But downdating with the
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
20 See Heim () and, among others, von Fintel (), Gillies (, ), and Starr () for the
presuppositional analysis of conditionals. The basic idea goes back to Stalnaker ().
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
proposal, remember, was that such obligations may entail other obligations unless they
are violated, and I can now make this more precise. Say that an inference presupposes
φ just in case any state for which updating with its premises and its conclusion is
defined is committed to φ: then an inference licensed by deontic detachment—the
rule allowing the step from () and () to ()—presupposes that the satisfaction of
the unconditional obligation is compatible with the input state. It follows that even
if deontic detachment is valid, the inferences it licenses may be defeated through
information agglomeration.
On the view I have proposed here, then, an obligation [[φ]] is violated but binding
in σ just in case σ ¬φ ∧ d φ. Such cases differ sharply from those in which an
obligation [[φ]] is defeated in σ , which are such that σ ¬d φ yet for some σ so
that σ ⊆ σ , σ d φ. And intuitively, whenever [[φ]] is a prima facie obligation
conditional on [[ψ]] but defeated in σ , this is because σ depicts an exceptional
circumstance just in case σ ψ ∧ ¬d φ.
So in our Chisholm-style scenario, Jones violates his obligation to go but his obli-
gation to tell his neighbors is defeated and thus no longer binding. And once again we
have identified a nonmonotonic effect in deontic discourse and reasoning that cannot
be explained by appealing to the fact that certain obligations allow for exceptions. To
see this, suppose an alternative deontic context d providing the following ordering
source consisting of defeasible rules:
od = {[ > go], [go > tell], [¬go > ¬tell]}
Then clearly any σd treats [[go]] as a W-default, [[tell]] as a [[go]]-default, and
[[¬tell]] as a [[¬go]]-default. The simple observation then is that σ [¬go] d go,
which is just to say that σ [¬go] does not depict an exceptional circumstance in which
the default obligation to go fails to be binding—this just captures the intuition that we
should not collapse violations and exceptions. And while the prima facie obligation
[[tell]] conditional on [[go]] is defeated in σ [¬go], this is not so because σ [¬go]
depicts an exceptional circumstance either since σ [¬go] go—this just captures the
intuition that a prima facie obligation conditional on [[go]] fails to pertain to situations
in which Jones does not go, and so a scenario in which Jones does not go hardly
qualifies as an exceptional circumstance. That certain obligations allow for exceptions
does not explain why reasoning with violable obligations has a nonmonotonic flavor.
The fact that the framework developed here has no trouble accounting for the
connection between defeasibility and violability demonstrates once again that it
captures a rich variety of nonmonotonic effects in discourse and reasoning without
reducing them to exceptional circumstances. While providing a promising account
of violable obligations requires adding some non-trivial complications to the basic
dynamic account told in §, its success at fulfilling a range of key desiderata for a
nonmonotonic story about reasoning with violable obligations suggests that the story
is on the right track.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
Conclusion
Deontic discourse and reasoning, so a familiar story goes, is nonmonotonic, but the
variety of ways in which it is sensitive to the absence of information has traditionally
been underappreciated. The traditional focus on prima facie obligations overlooks
that information agglomeration may trigger nonmonotonic effects without creating
contexts in which a prima facie obligation no longer applies, and I have highlighted
two reasons for why this is so: the fact that obligations may be violated and the fact
that obligations may be sensitive to epistemic uncertainty. The analysis developed
here differs from what has happened before in the literature in that it offers a
comprehensive perspective on the nonmonotonic nature of deontic discourse and
reasoning, and it does so by translating the sensitivity of deontic inferences to the
absence of information into the sensitivity of deontic ought to epistemic might. Since
information agglomeration may defeat commitments to epistemic might, it may also
defeat commitments to deontic ought: the nonmonotonicity of deontic thought and
talk can be illuminated in terms of the familiar nonmonotonicity of epistemic thought
and talk that finds a natural articulation in dynamic semantics.
I have remained silent on another popular motivation for nonmonotonic
approaches to deontic logic: the possibility of genuine moral dilemmas. This is
deliberate, for even if deontic logic must leave room for such dilemmas there is some
reason to think that their existence is compatible with a monotonic outlook (see,
e.g., van Fraassen, ; Cariani, ; and von Fintel, ). But let me briefly come
back to the question of whether attempting to arrive at a suitably nonmonotonic
logical consequence relation puts an explanatory burden on semantic theorizing that
is better reserved for some other component of a complete story about meaning,
communication, and reasoning.
The question connects with the issue whether dynamic semantics has any advan-
tages over a truth-conditional alternative coupled with an adequate pragmatic story
about conversational dynamics (see Rothschild and Yalcin (forthcoming) for discus-
sion). To get the issue into better view, suppose we recursively define truth-conditions
for our target language relative to an index i and information state σ that deliver the
following result for all φ of our target language: if i ∈ σ , then [[φ]]i,σ is true just in case
i ∈ σ [φ]. We may then define two distinct notions of logical consequence:
. Neoclassical Consequence: φ , . . . , φn ψ iff for all i and σ such that i ∈ σ :
if [[φ ]]i,σ is true and . . . and [[φn ]]i,σ is true, then [[ψ]]i,σ is true.
. Informational Consequence: φ , . . . , φn ψ iff e ψ is a neoclassical
consequence of e φ , . . . , e φn .
These semantic entailment relations have been extensively discussed in the context
of the miners paradox: neoclassical consequence avoids the problem by denying
modus ponens (see Kolodny and MacFarlane, ) while informational consequence
does so by allowing for reductio failures (see Bledin, and Yalcin, ). But
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
what about Ross- and Chisholm-style scenarios, which stand at odds with the fact
that neoclassical and informational consequence are monotonic by design? The most
promising strategy is to appeal to the already familiar story about conversational
dynamics from Stalnaker (): additional information in discourse and reasoning
affects, as a matter of pragmatics, the informational parameter in light of which
subsequent utterances are evaluated. More precisely, say that σ + φ = σ ∩ [[φ]]σ .
Then we may define a pragmatic inference relation (inspired by Stalnaker, ) that
treats the inference of ψ from φ , . . . φn as reasonable (rather than valid) just in case
for all i and σ : if i ∈ σ + φ + . . . φn , then i ∈ [[ψ]]σ +φ +...φn . Given the striking
similarity between reasonable inference and dynamic logical consequence, it should
not be surprising that the former is just as nonmonotonic as the latter is: epistemic and
deontic commitments are, as we have seen, defeasible and hence a sentence may be true
in light of some carrier of information but fail to be true with respect to some stronger
state. We may thus leave it to pragmatics to explain why additional information in
discourse and reasoning can defeat existing deontic commitments, and of course this
shift of explanatory burden would live happily with the suggestion that belief revision
theorists, rather than those concerned with semantic entailment, should worry about
nonmonotonicity. Why let semantics do all the work?
Let me make two points in response. First, many key aspects of the previous
discussion remain unaffected by the decision on how to divide the labor between
semantics and pragmatics (or between logic and belief revision theory). Specifically,
the point remains that the nonmonotonicity of deontic discourse and reasoning stands
in need of an explanation that goes beyond the one provided by theories that take
their inspiration from default logic, and that such an explanation can be provided by
a possible worlds semantics for conditionals and deontic modals that takes dynamic
effects in discourse and reasoning seriously. Furthermore, I contend that any such
explanation needs a model of how norms are sensitive to the flow of information along
the lines I have provided in this paper. These aspects of my story, as well as the key
ideas behind their formal elaboration, remain in place regardless of the semantics-
pragmatics distinction.
Playing a bit more offense, the choice of a monotonic consequence relation has
repercussions that go beyond the need to put substantial explanatory burden onto
a pragmatic story about information dynamics. Most strikingly, any monotonic con-
ception of logical consequence will inevitably be at odds with the semantic fact that
conditionals are nonmonotonic in the antecedent (resist antecedent strengthening),
and in particular we have cases in which ψ yet φ ⇒ ψ for some
φ, ψ ∈ L+ and ⊆ L+ .21 But of course on a monotonic conception of logical
consequence, , φ ψ whenever ψ, which is just to say that any such
conception must flat out reject the semantic equivalent of the deduction theorem:
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
22 For sure, one may rely on CCPs in semantic theorizing and just arrive at a framework that makes the
same predictions about consistency and entailment as its truth-conditional alternative (see von Fintel and
Gillies, ). Such proposals arguably fail to be interestingly dynamic, but clearly the one I have developed
here is not one of them.
23 The underlying observation from van Benthem () is that updating with an element of L is always
mediated by a classical proposition. This is no longer the case once we consider modal formulas and state
their semantics in terms of acceptance conditions that are sensitive to global features of the input state.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
discourse and reasoning is well-established. I submit that we have just as much reason
to take seriously a dynamic perspective on deontic discourse and reasoning.
Acknowledgments
Key ideas of this paper were presented at a workshop on nonstandard modals at the University of
Leeds (organized by Paolo Santorio). Special thanks to the audience for insightful comments and
discussion, as well as to Fabrizio Cariani, Nate Charlow, Matthew Chrisman, Frank Veltman,
and an anonymous peer commentator for detailed feedback that proved to be invaluable
in preparing the final manuscript. Support from the Franke Institute for the Humanities is
gratefully acknowledged.
References
Åqvist, Lennart () Deontic Logic. In Gabbay, Dov M. and Guenthner, Franz (eds.) Hand-
book of Philosophical Logic. nd Edition. Volume . pp. –. Dordrecht: Kluwer.
Asher, Nicholas and Bonevac, Daniel () Prima Facie Obligation. Studia Logica: An Interna-
tional Journal for Symbolic Logic. . . pp. –.
Asher, Nicholas and Bonevac, Daniel () Common Sense Obligation. In Nute, Donald E.
(ed.) Defeasible Deontic Logic. pp. –. Dordrecht: Kluwer.
Beaver, David I. () Presupposition and Assertion in Dynamic Semantics. Stanford, CA: CSLI
Publications.
Belzer, Marvin () Reasoning with Defeasible Principles. Synthese. . . pp. –.
Bledin, Justin () Logic Informed. Mind. . . pp. –.
Bonevac, Daniel () Against Conditional Obligation. Noûs. . . pp. –.
Cariani, Fabrizio () ‘Ought’ and Resolution Semantics. Noûs. . . pp. –.
Cariani, Fabrizio () Deontic Modals and Probabilities: One Theory to Rule Them All? In
Charlow, Nate and Chrisman, Matthew (eds.) Deontic Modality. Oxford: Oxford University
Press.
Cariani, Fabrizio, Kaufmann, Magdalena, and Kaufmann, Stefan () Deliberative Modality
under Epistemic Uncertainty. Linguistics and Philosophy. . . pp. –.
Carmo, José and Jones, Andrew J. I. () Deontic Logic and Contrary-to-Duties. In Gabbay,
Dov M. and Guenthner, Franz (eds.) Handbook of Philosophical Logic. nd Edition. Volume .
pp. –. Dordrecht: Kluwer.
Carr, Jennifer () Deontic Modals without Decision Theory. In Chemla, E., Homer, V., and
Winterstein, G. (eds.) Proceedings of Sinn und Bedeutung . École normale supérieure, Paris,
– September. pp. –.
Charlow, Nate (a) Conditional Preferences and Practical Conditionals. Linguistics and
Philosophy. . . pp. –.
Charlow, Nate (b) What We Know and What To Do. Synthese. .. pp. –.
Charlow, Nate () Decision Theory: Yes! Truth Conditions: No! In Charlow, Nate and
Chrisman, Matthew (eds.) Deontic Modality. Oxford: Oxford University Press.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
Chisholm, Roderick M. () Contrary-to-Duty Imperatives and Deontic Logic. Analysis. . .
pp. –.
Dancy, Jonathan () Ethics Without Principles. New York, NY: Oxford University Press.
Dowell, Janice L. () Contextualist Solutions to Three Puzzles about Practical Conditionals.
In Shafer-Landau, Russ (ed.) Oxford Studies in Metaethics. Volume . pp. –. New York,
NY: Oxford University Press.
Fintel, Kai von () The Presupposition of Subjunctive Conditionals. In Sauerland , Uli and
Percus, Orin (eds.) The Interpretive Tract (MIT Working Papers in Linguistics ). pp. –.
Cambridge, MA: MITWPL.
Fintel, Kai von () NPI Licensing, Strawson Entailment, and Context Dependency. Journal
of Semantics. .. pp. –.
Fintel, Kai von () The Best We Can (Expect to) Get? Challenges to the Classic Semantics
for Deontic Modals. Manuscript, MIT. [Online] Available from: http://mit.edu/fintel/fintel-
-apa-ought.pdf. [Accessed Nov .]
Fintel, Kai von and Anthony S. Gillies () An Opinionated Guide to Epistemic Modality.
In Gendler, Tamar and Hawthorne, John (eds.) Oxford Studies in Epistemology. . pp. –.
New York, NY: Oxford University Press.
Frank, Anette () Context Dependence in Modal Constructions. Ph.D. thesis. University of
Stuttgart.
Fuhrmann, André () Reflective Modalities and Theory Change. Synthese. . . pp. –.
Gillies, Anthony S. () Epistemic Conditionals and Conditional Epistemics. Noûs. . .
pp. –.
Gillies, Anthony S. () On Truth-Conditions for If (but Not Quite Only If ). Philosophical
Review. . . pp. –.
Gillies, Anthony S. () Iffiness. Semantics and Pragmatics. . . pp. –.
Greenspan, Patricia S. () Conditional Oughts and Hypothetical Imperatives. The Journal of
Philosophy. . . pp. –.
Groenendijk, Jeroen and Stokhof, Martin () Dynamic Predicate Logic. Linguistics and
Philosophy. . . pp. –.
Harman, Gilbert () Change in View. Cambridge, MA: MIT Press.
Heim, Irene () The Semantics of Definite and Indefinite Noun Phrases. Ph.D. thesis. Univer-
sity of Massachusetts, Amherst.
Horty, John F. () Moral Dilemmas and Nonmonotonic Logic. Journal of Philosophical Logic.
. . pp. –.
Horty, John F. () Nonmonotonic Foundations for Deontic Logic. In Nute, Donald E. (ed.)
Defeasible Deontic Logic. pp. –. Dordrecht: Kluwer.
Horty, John F. () Reasoning with Moral Conflicts. Noûs. . . pp. –.
Horty, John F. () Reasons as Defaults. Philosophers’ Imprint. . . pp. –. [Online] Available
from: http://dx.doi.org//spo.... [Accessed Nov .]
Horty, John F. () Reasons as Defaults. New York, NY: Oxford University Press.
Horty, John F. () Deontic Modals: Why Abandon the Classical Semantics? Pacific Philosoph-
ical Quarterly. . . pp. –.
Kamp, Hans, van Genabith, Josef, and Reyle, Uwe () Discourse Representation Theory. In
Gabbay, Dov M. and Guenthner, Franz (eds.) Handbook of Philosophical Logic. nd edition.
Volume . pp. –. Dordrecht: Springer.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
Kolodny, Niko and MacFarlane, John () Ifs and Oughts. Journal of Philosophy. . .
pp. –.
Kratzer, Angelika () Modality. In von Stechow, Armin and Wunderlich, Dieter (eds.)
Semantics: An International Handbook of Contemporary Research. pp. –. Berlin: de
Gruyter.
Kratzer, Angelika () Modals and Conditionals. New York, NY: Oxford University Press.
Lance, Mark N. and Little, Margaret O. () Defeasibility and the Normative Grasp of
Context. Erkenntnis. . –. pp. –.
Levi, Isaac () Iteration of Conditionals and the Ramsey Test. Synthese. . . pp. –.
Lewis, David () Counterfactuals. Cambridge, MA: Harvard University Press.
Lewis, David () Semantic Analyses for Dyadic Deontic Logic. In Stenlund, Sören (ed.)
Logical Theory and Semantic Analysis: Essays Dedicated to Stig Kanger on His Fiftieth Birthday.
pp. –. Dordrecht: Reidel.
McCarthy, John () Circumscription—A Form of Non-Monotonic Reasoning. Artificial
Intelligence. . –. pp. –.
McCarty, J. and Thorne, L. () Defeasible Deontic Reasoning. Fundamenta Informaticae. .
–. pp. –.
McDermott, Drew and Doyle, Jon () Non-Monotonic Logic I. Artificial Intelligence. . –.
pp. –.
McNamara, Paul () Deontic Logic. In Gabbay, Dov M. and Woods, John H. (eds.) The
Handbook of the History of Logic, vol. : Logic and Modalities in the Twentieth Century.
pp. –. Amsterdam: Elsevier.
Nute, Donald E. () Defeasible Reasoning and Decision Support Systems. Decision Support
Systems. . . pp. –.
Nute, Donald E. () Apparent Obligation. In Nute, Donald E. (ed.) Defeasible Deontic Logic.
pp. –. Dordrecht: Kluwer.
Pollock, John L. () Defeasible Reasoning. Cognitive Science. . . pp. –.
Prakken, Henry and Sergot, Marek () Contrary-to-duty Obligations. Studia Logica. . .
pp. –.
Prakken, Henry and Sergot, Marek () Dyadic Deontic Logic and Contrary-to-Duty Obli-
gations. In Nute, Donald E. (ed.) Defeasible Deontic Logic. pp. –. Dordrecht: Kluwer.
Ramsey, Frank P. () The Foundations of Mathematics. London: Routledge and Kegan Paul.
Reiter, Raymond () A Logic for Default Reasoning. Artificial Intelligence. . –. pp. –.
Ross, William D. () The Right and the Good. Oxford: Oxford University Press.
Rothschild, Daniel and Yalcin, Seth (forthcoming) On the Dynamics of Conversation. Noûs.
Rott, Hans () Conditionals and Theory Change: Revisions, Expansions, and Additions.
Synthese. . . pp. –.
Ryu, Young U. and Lee, Ronald M. () Deontic Logic Viewed as Defeasible Reasoning. In
Nute, Donald E. (ed.) Defeasible Deontic Logic. pp. –. Dordrecht: Kluwer.
Silk, Alex () Truth Conditions and the Meanings of Ethical Terms. In Shafer-Landau, Russ
(ed.) Oxford Studies in Metaethics. Volume . pp. –. New York, NY: Oxford University
Press.
Stalnaker, Robert C. () Indicative Conditionals. Philosophia. . . pp. –.
Stalnaker, Robert C. () Assertion. In Cole, Peter (ed.) Syntax and Semantics . pp. –.
New York, NY: New York Academic Press.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
Starr, William B. () A Uniform Theory of Conditionals. Journal of Philosophical Logic. ..
pp. –.
Thomason, Richmond H. () Deontic Logic as Founded on Tense Logic. In Hilpinen, Risto
(ed.) New Studies in Deontic Logic. pp. –. Dordrecht: Reidel.
Touretzky, David S. () The Mathematics of Inheritance Systems. Los Altos, CA: Morgan
Kaufmann.
van Benthem, Johan () Essays in Logical Semantics. Dordrecht: Reidel.
van der Torre, Leendert W. N. and Tan, Yao-Hua () The Many Faces of Defeasibility. In
Nute, Donald E. (ed.) Defeasible Deontic Logic. pp. –. Dordrecht: Kluwer.
van der Torre, Leendert W. N. and Tan, Yao-Hua () An Update Semantics for Deontic
Reasoning. In Proceedings of DEON. Bologna, January . pp. –. Amsterdam: IOS
Press.
van Fraassen, Bas () Values and the Heart’s Command. Journal of Philosophy. . . pp. –.
Veltman, Frank () Logics for Conditionals. Ph.D. thesis. University of Amsterdam.
Veltman, Frank () Defaults in Update Semantics. Journal of Philosophical Logic. . .
pp. –.
Willer, Malte () A Remark on Iffy Oughts. Journal of Philosophy. . . pp. –.
Willer, Malte () Dynamic Thoughts on Ifs and Oughts. Philosophers’ Imprint. . .
pp.: –. [Online] Available from: http://dx.doi.org//spo.... [Accessed
Nov .]
Wright, von George H. () Deontic Logic. Mind. . . pp. –.
Wright, von George H. () A Note on Deontic Logic and Derived Obligation. Mind. . .
pp. –.
Yalcin, Seth () A Counterexample to Modus Tollens. Journal of Philosophical Logic. . .
pp. –.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
Dynamic Expressivism about
Deontic Modality
William B. Starr
1 Note that even on this approximation, expressivism should not be equated with a non-propositional
semantics. A non-propositional semantics, e.g. Hamblin’s () for interrogatives, needn’t involve motiva-
tional attitudes. Further, one could offer a propositional expressivist semantics as long as one had a story
about how those contents are connected to motivational attitudes rather than representational ones. For
discussion see Chrisman (), Schroeder (), Ridge (), and Charlow ().
2 For a more detailed version of this narrative see Schroeder (a, Chapters –).
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
as Geach () stressed, it is not clear how non-representational sentences could bear
relations of consequence and consistency.
One goal of this chapter is to press this dilemma against recent expressivists that use
the tools of truth-conditional semantics to model expressive meaning and communi-
cation (Gibbard, ; Dreier, ; Yalcin, , ; Silk, ) (§§–). I will argue
that these accounts either rely on representational concepts that are inappropriate
for expressive motivational discourse, or they end up blurring the distinctions that
make expressivism a distinctive and interesting thesis. In doing so, I will propose
a new way of decomposing the negation problem (§) and argue that expressivism
cannot succeed without a distinctive theory of expressive communication (§). My
other goal is more positive. I will assume for the purposes of the argument set
out that representational language inherits its distinctive semantic properties from
the function of representational attitudes: to track how the world is. Further, I will
assume that motivational attitudes have a very different function, namely to motivate
choice and action, and that motivational language inherits rather different semantic
properties from this motivational function. This is not so much because I am confident
in these assumptions, but because they provide the most interesting and challenging
setting to defend expressivism. Against this backdrop, I will use dynamic semantics, a
relatively recent semantic framework discussed in computer science and linguistics,
to offer a model of expressive and representational language where both kinds of
discourse are seamlessly integrated, and where uniform explanations of composition-
ality and logical relations are forthcoming (§§–). Dynamic semantics provides an
algebra of processes rather than contents. I will show how this embodies a more
general conception of meaning and logic that yields motivational and representational
meaning as special cases.3
Here at the outset, I should clarify that I will not be offering a positive linguistic
or philosophical argument in favor of expressivism. Yet, linguists and empirically-
minded philosophers of language may find something of interest here. The empirical
investigation of ‘expressive meaning’ is a burgeoning field, whose object of study has
remained somewhat difficult to classify and integrate in compositional systems.4 The
perspective on expressivism, communication and compositional semantics offered
here should be useful for these purposes. The semantics developed here also makes
interestingly different predictions than more orthodox approaches (e.g. Kratzer,
), thereby connecting to a broader debate over non-truth-conditional semantics
3 This emphasis on processes (acceptance, rejection) rather than content is a theme in Gibbard (),
but is not embraced in Gibbard’s formalism. Dynamic semantics provides the tools for doing so. Alwood
() pursues a similar dynamic approach and situates it in more detail with work in metaethics. I aim
to provide a more thorough technical implementation here and connect such an approach to the issue of
communication in expressivism. One virtue of this implementation is not having to distinguish, as Alwood
() must, between two kinds of negation. I will offer one semantics for negation which models both
propositional and expressive negation.
4 E.g. evidentiality (Murray, ; Faller, ), attention (Bittner, ), narrative structure (Hobbs,
), slurs (Anderson and Lepore, ) and other phenomena (Potts, ; McCready, ).
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
(Veltman, , p. ; Portner, , p. )5 and expressivist pragmatics (Yalcin, ,
) for modality. Throughout, I will be assuming that expressivism is a semantic
thesis about the linguistic meaning of sentences which challenges the idea that all
communication proceeds by referring to things and describing them. Along the way
(§), I will present difficulties for the idea that expressivism is merely a pragmatic
thesis (Yalcin, , ). Unfortunately, I will not have space to discuss important
recent work that pitches expressivism as a metasemantic thesis (Chrisman, ;
Ridge, ) about how certain expressions get their traditional meanings, or what
it means to ascribe those traditional meanings to those expressions (Carballo, ).
It is interesting to note, however, that the linguistic meanings ascribed in my dynamic
approach are abstract objects that model the processes which Lewis () and Grice
() took to endow sentences with their contents. However, in taking these processes
themselves to be the meanings the dynamic approach arrives at additional semantic
resources for solving the Frege-Geach problem.6 Only further, more general, work
on metasemantics will reveal the comparative virtues of semantic and metasemantic
approaches.
5 In an earlier version of this work I adopted a semantics much closer to Veltman (, p. ) and
Portner (, p. ). The new semantics is now a bit of a hybrid between that semantics, one from dynamic
logic (van Benthem and Liu, ) and Starr’s () dynamic semantics for imperatives.
6 This approach has much in common with Millikan (, ).
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
no basis for choice, but if they had no beliefs they could still (naively) choose actions.
These different theoretical roles justify rather different assumptions about probabilities
and utilities. If an agent has credence . that Zoyd shared, then that agent is assumed
to have credence . that Zoyd didn’t share. By contrast, if Zoyd’s sharing has utility
for an agent, it is not assumed that Zoyd’s sharing gets utility t − where t is the total
amount of utility actions can have for the agent. The assumption for probabilities is
reasonable since they capture how the agent represents the world: in the world either
Zoyd shared or he didn’t. This assumption is not plausible for utilities because they
don’t represent the world.
Against this backdrop, an expressivist might say that utterances of Zoyd must share
express utility functions. Perhaps functions where Zoyd’s sharing is assigned a high
utility relative to some standard. In simple cases, this might serve to bring one agent’s
values in line with another’s. By contrast, the subjectivist would say that deontic
utterances of Zoyd must share express propositions about utility functions. In simple
cases, these utterances would serve primarily to align agents beliefs (i.e. probability
spaces). Though this might result in agents’ aligning their values, that’s not the primary
function of deontic discourse according to the subjectivist. A third position, call it
simple naturalism, maintains with subjectivism that deontic utterances express beliefs
about the world. But it rejects that these beliefs are about utility-relative facts: they
are simple beliefs about the utility-independent world. Seeing how the Frege-Geach
problem arises in this setting will allow me to say what dynamic expressivism is and
how it differs from other accounts.
Frege (, ) famously proposed the standard explanation of how represen-
tational content behaves in a compositional way. But, less famously, Frege () also
contended that those explanations do not work for non-representational meaning. The
argument was simple: the meaning of negation is exhausted by its role of turning a true
content into a false one and vice versa. This explains why any sentence and its negation
are inconsistent, i.e. cannot both be true. Here’s the problem for expressivists: if (a)
does not represent the world, then how does negating it do anything meaningful, let
alone inconsistent with (a)?
() a. Zoyd must share.
b. It’s not the case that Zoyd must share.
This is the Frege-Geach Problem, whose title reflects its application by Geach ()
to non-representational theories of moral language.7 To solve this problem, it seems
that expressivists must say what non-representational attitude (a) expresses, how the
attitude expressed by (b) is generated by negating the expression of an attitude by (a)
and what it is for two non-representational attitudes to be inconsistent. Parallel issues
arise with other connectives and explaining entailment patterns.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
One can see the challenge more precisely from within the simple model of rational
agency above. There is a relationship between a rational agent’s credence in Zoyd
shared and Zoyd didn’t share: no credence should be assigned to the possibility of
both being true. This, as mentioned above, follows from background assumptions
about truth, representation and the functional role of beliefs—more on that in §.
It is less obvious that such a relationship exists between how much a rational agent
values Zoyd sharing and how much they value Zoyd not sharing. One can certainly
imagine a scenario where both outcomes have a high utility for the agent. Further, the
relationship between how an agent values Zoyd sharing and how they value Zoyd not
sharing speaks to the consistency of (a) and (b), rather than (a) and (b).8
() a. Zoyd must share.
b. Zoyd must not share.
Although (b) intuitively entails (b), the opposite is not true. Note that it is only
consistent to follow up (b) with but Zoyd may share. This leads to an important
observation: there are really two ‘negation problems’ for the expressivist. Suppose one
can successfully analyze (a) and (b) as expressing ‘motivationally conflicted’ values:
being in favor of sharing and being in favor of not sharing. This would be a solution to
the internal negation problem. Since one can endorse (b) without endorsing (b), an
analysis of (b) is still needed. This leaves open the possibility that a different account
of the conflict between (a) and (b) will be needed. Call this the external negation
problem. I believe the two problems require different solutions. While this point will
be argued for more precisely later, the basic idea can be articulated with the resources
currently on hand. It will then serve as a basis for contrasting dynamic expressivism
with others (Gibbard, ; Dreier, ; Yalcin, , ; Silk, ).
Even if (a) and (b) express motivationally conflicted states of mind, it is hard
to see how the same is true of (a) and (b). Initially, this is because it is hard to see
how a sentence with external negation expresses a particular motivational attitude at
all. If one thinks of Zoyd must share as serving to instill pro-sharing utilities in the
hearer, then It is not true that Zoyd must share rejects instilling those utilities. But
what particular utilities does this rejection promote? I believe that focusing on this
question gets to the heart of the problem for expressivists. Previous attempts to say
what attitude is expressed have failed to render (a) and (b) inconsistent in a way that
is clearly non-representational.
Gibbard (, pp. –) offers the following expressivist model:
() a. ‘You must share’ expresses: Agreeing with sharing
b. ‘You must not share’ expresses: Disagreeing with sharing
c. ‘It is not the case that you must share’ expresses: Disagreeing with agreeing
with sharing
8 Unwin () highlights the importance of this distinction in the context of expressivism.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
This model does not say anything more about what disagreeing and agreeing are which
explains why agreeing with X and disagreeing with X are in rational tension. In the
setting above, this amounts to saying that You must share and It is not the case that you
must share express conflicting utility functions without saying what utility functions
these are and how exactly those functions or their acceptance conflict. Some have
worried whether such a semantic theory is adequately explanatory (e.g. Schroeder,
a, d). § will press that concern in another way by arguing that expressivists
need to offer a positive account of how expressive communication works and how
it differs from representational communication. It is hard to see how this could work
without characterizing representational and expressive communication as importantly
different processes.
Building on Dreier (, ), Silk () develops what might be called prefer-
ence expressivism, which formulates a deontic semantics in terms of preference order-
ings. In Starr () I pursue roughly the same strategy to explain the incompatibility
of contrary imperatives such as Dance! and Don’t dance!, without assuming that imper-
atives are representational. This aims to fill the gaps in Gibbard’s approach. The basic
idea is that deontic sentences express preferences. Much like utilities, the function of
preferences is to motivate choice.9 Crucially, this means that preferences which prevent
a choice will be dysfunctional. The idea is then that two deontic sentences can be
inconsistent if they express preferences which, taken together, are dysfunctional in this
way. A theory along these lines offers an interesting expressivist solution of the internal
negation problem. But I do not think it can solve the external negation problem.
A (strict) preference for you sharing could be modeled as an ordering which ranks
any world w where you share over every world w where you don’t: w w . Now,
consider a semantics on which:
() a. ‘You must share’ expresses: w w
b. ‘You must not share’ expresses: w w
For now, set aside how the meaning for (b) is determined from the meaning of (a)
and a semantics for negation. Here is how one might explain the inconsistency of
these two sentences on such a semantics.10 The first step is to determine what it is to
take these two preference orderings ‘together’. Unioning them into a bigger preference
ordering is the most natural idea: w w , w w .11 It is a platitude about
choice that you shouldn’t choose A if you strictly prefer B to A and you can choose B
9 The earlier utility expressivist proposal was that Must(A) expresses utility functions which assign a
utility to A-worlds that is high according to some standard. By contrast, preference expressivism says: the
utility of A-worlds is greater than the utility of ¬A-worlds.
10 Neither Dreier (, ) nor Silk () explicitly formulates the decision-theoretic constraints on
preference and choice that they take to constitute preferential coherence. Following Starr (), I assume in
the main text above a non-dominance principle about choice and tie this to the need for acyclic preferences.
I do not know whether this is what Dreier (, ) and Silk () had in mind.
11 Note that is the set {w , w } and is the set {w , w }. The fact that union is the operation
appropriate to combining preferences is discussed by van Benthem and Liu (). This is fairly clear from
thinking about how to combine the orderings from Must(A) and Must(B). Intersection predicts that A ∧ B-
worlds, ¬A ∧ B-worlds and A ∧ ¬B-worlds are best, while union predicts that only A ∧ B-worlds are best.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
instead. This means that any cyclic strict preference ordering is irrational. Something is
preferred to every alternative, so no alternative can be rationally chosen. This is exactly
the situation in . Since it fails to motivate a choice, it is motivationally defective. This
seems like exactly the kind of explanation of inconsistency that expressivists seek. But,
I claim, it does not work for the external negation problem.
Following standard accounts of modality, Silk () proposes that Must(A)
requires all the most preferred worlds to be A-worlds, while ¬Must(A) requires some
of the most preferred worlds to be ¬A-worlds. The issue is that taking together—
unioning—two preference orderings that meet these respective descriptions does not
always yield a dysfunctional one. To illustrate this, consider a model in which we
are concerned only with A and B worlds. Using a capital letter to indicate truth
and a lowercase letter for falsity, Must(A) can express an ordering such as while
¬Must(A) can express an ordering such as : wAB and wAb are the most preferred
worlds according to , and waB is among the most preferred worlds according to .
As the graphs illustrate, there is nothing dysfunctional about taking these preferences
together. Together, they will motivate the agent to choose wAB or wAb . These combined
preferences are not motivationally dysfunctional.
AB Ab AB Ab AB Ab
aB ab aB ab aB ab
1 2 1∪ 2
Must(A) ¬Must(A)
From the dynamic perspective, the preference expressivist went wrong in two
assumptions: (i) accepting ¬Must(A) is the same kind of process as accepting
Must(A) and (ii) consistency is a relationship between contents rather than processes.
On the dynamic account developed in §, the meaning of an expressive sentence is
characterized in terms of how accepting it changes a preference ordering. Must(A)
introduces a strict preference for all live A-worlds over ¬A-worlds, and tests that
doing so makes A a practical necessity: all the choices it motivates result in A-worlds.
(The success of this test will depend on which preferences have been previously
introduced.) ¬Must(A), on the other hand, does not introduce a preference at all,
so its inconsistency with Must(A) is not an inconsistency of preference. ¬Must(A)
changes a preference ordering by removing any preferences that would be added by
Must(A). Must(A) and ¬Must(A) are dynamically inconsistent: since one sentence
adds what the other removes there is no ordering which contains the effects both have
on a preference ordering. But that is crucially different from saying that they lead to
orderings which, combined in the standard way, are rationally defective. Must(A) and
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
¬Must(A) are inconsistent in the same way that certain actions are. Consider a robot
that rolls around a bar to each table. It fills an empty shot glass with vodka and it
vacuums up the contents of non-empty shot glasses. Then it cruises on. The action of
pouring a shot and the action of vacuuming one are incompatible in the sense that no
state of a given glass that directly results from the robot’s actions contains the effects of
both actions. But this is not because the effects themselves are incompatible: the empty
space in the shot glass and the shot of vodka are clearly compatible in that one can fill
the other. As will be discussed, a non-dynamic account does not make room for the
idea that φ and ¬φ have radically different communicative effects. On a non-dynamic
account, negation takes one content (e.g. preference-ordering or proposition) and
yields another of the same kind. Their communicative effects are modeled by a single
pragmatic process which combines contents of that kind (the previously accepted
contents incremented with the new content).
The dynamic approach to the external negation problem combines elegantly with
the preference expressivists’ approach to the internal negation problem. In §, I define
dynamic preferential inconsistency as follows: φ and ψ are dynamically preferentially
inconsistent just in case there is no single practically rational preference ordering that
contains the effects on preferences that both would have. In the case of Must(A)
and Must(¬A), there exists a preference ordering that contains preferences both
introduce, but it is irrational because those preferences are cyclic. Must(A) and
¬Must(A) are different: there is no preference ordering which contains the effects on
preferences that each sentence produces. If an ordering has the preferences Must(A)
adds, then updating with ¬Must(A) would remove them. If an ordering lacks the
preferences ¬Must(A) would remove, then updating with Must(A) would add them.
Formalizing the dynamic account requires a logic of actions rather than contents,
one which speaks to questions of how connectives combine actions and how actions
can bear logical relations to each other. Fortunately, logics of this kind—called dynamic
logics (Harel et al., )—have been extensively developed in computer science and
have inspired new semantic approaches to natural language (Heim, ; Kamp, ;
Groenendijk et al., ). The basic idea of this approach is to formally describe how
accepting each type of sentence changes an agent’s state of mind. Surprisingly, one can
reformulate classical logic in this way and then add non-classical operators to produce
operations that do not amount to operations on content (§). Hence, this is not an
alternative approach to semantics, but a more general one. By importing the dynamic
semantics for disjunction from Starr () below, I will also be able to navigate a chal-
lenge presented by Schroeder () for previous dynamic analyses and analyses such
as Yalcin’s (, ): they incorrectly predict that believing Must(A) ∨ B entails
either believing Must(A) or believing B.12 As I will emphasize, parallel observations
hold for what disjunctions communicate. This connection between an expressivist
12 Schroeder () makes a more general observation: this prediction holds even if both disjuncts are
expressive. The semantics in § also blocks this result.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
13 The quotation from Silk (, §) nicely captures the operative expressivist thesis at play in much
recent work including Gibbard (, p. ), Schroeder (c, p. ) and Dreier (, , p. ).
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
14 Schroeder (b) also highlights this gap and explores several expressivist accounts of ‘expression’
which pair a sentence with the attitude it expresses. The expressivist accounts I’ll consider, like Yalcin (),
do not make use of such a pairing, and focus on the interpersonal relation of communication rather than
the individualistic notion of expression. So while Schroeder (b) makes important and related points,
its criticism is complementary to that presented here.
15 See also Silk () and Charlow () who discuss the explanatory thesis.
16 I will say more about what I take ‘explaining inconsistency’ to be in §.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
by replacing talk of sentences expressing mental states with updating mental states,
and focusing on explanations which appeal to the non-representational dimensions
of those states (e.g. attention, preference).
Dynamic Expressivist Explanatory Thesis Semantic properties of expressive
sentences are explained, fundamentally, in terms of the way they update non-
representational dimensions of language users’ mental states.
I’ve said a bit above about why expressivism is a thesis worthy of attention in
metaethics, but I think it is also important to say why it is worthy of attention in the
study of natural language meaning.
Expressivism may seem like an esoteric philosophical doctrine, but its communi-
cation thesis is familiar from commonsense observations. Suppose we’re looking at
the same field of wild flowers. I’ve noticed a bee fly on a nearby wild flower and I’m
attending to it. I point at it and say, that’s what a bee fly looks like. I was attending
to the bee fly and you are now attending to the bee fly. I’ve communicated my state
of attention to you. But I did not do so by referring to my state of attention and
stating a fact about it, i.e. I did not say something like My state of attention is thus-
and-so. The way I communicated my state of attention was very different than the way
I communicated the object of my attention, to which I referred with a gesture. Further,
one can imagine two agents’ having identical visual representations of a scene but their
attention to be centered on different objects. Since such a difference is likely to impact
actions where they coordinate, it seems that successful communication will involve
agents attuning their state of attention, despite the fact that they are not referring
to those states and talking about them. Indeed, given that agents seem capable of
communicating certain features of their mental states without even being aware of
them, expressivism in these domains seems inevitable. Attention, of course, is not the
only expressive dimension where this difference seems to matter. When I tell you
that bee flies lay their eggs in beetle nests and that after hatching their larvae feed
on their hosts’ eggs, you might express disgust at this parasitism with a grimace. You’ve
thereby communicated your disgust to me.17 But you did not do so by referring to your
state of disgust and stating a fact about it. You did not say anything like My state of
disgust is thus-and-so. Nothing in your facial expression seems to have brought your
affective state into the conversation in the way my pointing brought the bee fly into
the conversation, nor does anything in your facial expression describe your affective
state. These two examples illustrate the expressivist communication thesis at work in
a more general setting. In these cases it seems plausible enough. But neither example
involved essentially deontic language, or language at all: gestures were the expressive
17 Add to the story that I intended to do so if you are inclined to the Gricean orthodoxy that requires
an effect to be intended for it to be communicated. Note that the cost of such orthodoxy is positing a
higher-order mental state which involves the speaker attending to their state of attention. Expressivism
is naturally allied with the various attempts to do without the Gricean intellectualization of communication
and convention (e.g. Burge, ).
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
signals. Thus, an expressivist must say how this kind of communication operates in the
linguistic realm in general, and the deontic realm in particular. Note, however, that it is
not problematic for the expressivist to admit that deontic discourse does not appear to
be psychological. On an expressivist view, psychological states are behind the scene but
not on stage. In fact this is likely another advantage to be celebrated by the expressivist.
The proper characterization of the psychological states relevant to the interpretation
of modals requires theoretical sophistication beyond our everyday grasp of our own
minds, e.g. orderings or probability spaces.
I will now proceed by presenting an entirely standard truth-conditional semantics
for propositional logic and highlighting how its approach to logic, compositionality
and communication depends on reference (§). I then critically examine attempts to
use an extension of this framework to solve the negation problem and to provide an
expressivist account of communication in §. That discussion will propel us towards a
dynamics semantics. I will first present a dynamic semantics for propositional logic to
illustrate the basic ways in which it departs from classical semantics (§). I will then,
in §, provide a dynamic semantics for deontic discourse and explain how dynamic
expressivism tackles the difficulties for other approaches.
18 This would be more clear in a first-order system where the ‘referent’ of an atomic sentence is
determined by the referents of its terms and predicate. But that would introduce unnecessary complexities.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
19 This allows me to compress the distinction between worlds and models, or between ways the world
could be and ways the atomic expressions could refer.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
The standard definitions of logical relations also rely essentially on reference, though
that is less than clear on a first pass.20
Definition (Consequence).
φ , . . . , φn ψ ⇔ ([[φ ]] ∩ · · · ∩ [[φn ]]) ⊆ [[ψ]]
Definition (Consistency).
φ , . . . , φn are consistent ⇔ ([[φ ]] ∩ · · · ∩ [[φn ]]) = Ø
On the referential picture of meaning sentences point to regions of logical space
by pointing at some distinction between possible worlds and describing the points
in the space as being on a particular side of that distinction. The consequence
relation requires φ , . . . , φn to collectively point to a subregion of what [[ψ]] points
to. This means that the distinctions made by φ , . . . , φn guarantee that the distinction
made by [[ψ]] has already been made. Similarly, consequence requires that φ , . . . , φn
collectively point to a non-empty region. This means that the distinctions they describe
can be satisfied by at least one world.
As before, there is a difficulty for an expressivist who would like to assign a meaning
[[φ]] which expresses psychological states without referring to them. Because both
definitions are entirely moored in the referential framework, it is simply unclear what
they would mean when extended to a variety of non-referential meaning. Consider
consistency. If one isn’t trying to refer to distinctions in worlds is it actually problematic
if there is not a world which bears all those distinctions simultaneously? This draws
out a feature that will be important below, and is often suppressed in discussions about
what an expressivist must do to explain the inconsistency of expressive sentences. Even
in entirely standard truth-conditional semantics, there is a teleological dimension to
the explanations of logical relations.
One might think that if an expressivist could assign denotations to non-referential
expressive sentences, then Definition could be simply applied to them. However,
such a minimal answer is arguably unsatisfactory even for non-expressive sentences.
When one seeks an explanation of A and ¬A’s inconsistency, one often wants an expla-
nation of why an agent cannot believe/assert both without changing their mind and an
explanation of why two agents which believe/assert these contraries are disagreeing.
Simply saying that the denotation of the two sentences has an empty overlap does
not fully address this question. What is wrong with simultaneously asserting/believing
sentences whose denotations have an empty overlap? Answering this question requires
saying something about what denotation, assertion and belief are such that empty
overlap results in some kind of rational tension for an agent. For example, one might
maintain that the function of belief and assertion is to represent the world (Stalnaker,
, ), and the denotation of a sentence captures how it represents the world to
be. Then, semantically inconsistent sentences represent the world as being some way
20 Geach () stresses the difficulty of capturing inferences involving expressive sentences.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
no world can be, so they are dysfunctional things to simultaneously believe or assert.
This explanation combines three resources.
Explaining Inconsistency Representational explanations of inconsistency explain
the defective nature of simultaneously believing/asserting two sentences by:
. Construing the denotation relation as the representation relation
. Construing the function of asserting/believing those sentences as representing
the world
. A definition of semantic consistency as empty overlap, and an explanation of why
that tracks dysfunctional assertions/beliefs.
Agreement and disagreement can then be thought of as consistency between the
beliefs/assertions of two agents. This broader understanding of what it means to
explain inconsistency clarifies both the challenge and prospects for an expressivist.
The prospect is clear: they need an alternative conception of the denotation rela-
tion, a different construal of the function sentences serve, a formal definition of
consistency and an explanation of how that formal definition tracks dysfunctional
assertions/beliefs.21 Now for the challenge. The expressivist can semantically treat
the sensitivity to psychological states exactly as one would treat referential discourse,
but then they will face two challenges. They must have a different account of why
empty overlaps in denotation are bad, since they believe the function of expressive
discourse is non-representational. Indeed, it seems that anything in the neighborhood
of the explanatory expressivist thesis commits them to appealing here in an essential
way to the non-representational attitudes that are supposed to explain the semantic
properties of expressive sentences. Additionally, the expressivist must explain how that
meaning, which is formally modeled with means suited to representational meaning,
is used in a non-representational form of communication. In the next section I will
argue that recent expressivist analyses (Silk, ; Yalcin, , , ) appear to
explain inconsistency, but it is not actually clear that they meet the first challenge.
Further, I will argue that they do not meet the second.
21 Charlow () construes explaining consistency more narrowly: deriving a contradiction in the
meta-language from the assumption that the sentences are jointly true/satisfied. Charlow (, §.) argues
that a psychologist semantics is not entitled to assume in such a derivation that irrational psychological
states do not exist, since they are logically possible. I’m taking a broader view here, which begins with the
fact that content is functionally grounded. Setting aside which approach is better, note that an account like
the above can happily admit the existence of irrational states of mind and still explain inconsistencies. They
are essential to that explanation!
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
().22 Both approaches adopt a truth-conditional semantics, but one with a format
that is different from the above: it assigns a sentence to a truth-value relative to a world
w and, crucially, a practical parameter. For Silk (), the practical parameter is a
weak-preference (pre)ordering (reflexive, transitive) on possible worlds, i.e. w w
means w is preferred at least as much as w , and when w w also holds w and
w are equally preferred. Yalcin (, §) stays closer to Gibbard (), and uses
‘hyperplans’.23 I’ll focus on Silk’s () semantics, since it will be closer to the view
I develop later. Both analyses follow Gibbard () in proposing truth-conditions
that do not place any constraint on the world parameter, only the practical parameter.
For Silk (), this means placing a constraint only on the parameter: there is
no mention of the world of evaluation w on the right-hand side of the semantic
clauses. Deontic modals constrain by requiring something of the worlds that are
best according to , i.e. worlds that are at least as preferable as every other world.
Definition (Preference Expressivism, Silk , ).
Where Best() = {w | ∀w : w w if w w}:
. [[Must(φ)]]w, = ⇐⇒ ∀w ∈ Best(): [[φ]]w , =
• Must(φ) is true at w, iff all the worlds best according to are φ-worlds.
22 I am tempted, but hesitant, to categorize the account proposed by Charlow (, ) along with
the pragmatic expressivists. The view developed there has a more nuanced and developed metasemantics
that make this categorization unclear. How exactly the points made in this section relate to these voices will
have to be left for another day.
23 Unlike Gibbard (), Yalcin (, §) constructs each hyperplan h out of possibilia: h is a function
that maps a set of worlds s to one of its subsets, namely those worlds permissible according to the hyperplan
given that one is in circumstances s.
24 [[¬φ]]
w, = ⇐⇒ [[φ]]w, =
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
represents such a body of preferences. Preferential incoherence appears to be the right kind of
inconsistency in attitude to explain the inconsistency between normative sentences ‘φ’ and ‘¬φ’.
(Silk, , )
The key idea is that the two sentences express preferences that are incoherent. It is
crucial for the expressivist that this incoherence is non-representational. This is where
I think there is room for skepticism. The kinds of non-representational coherence
constraints placed on preferences are like the one discussed in §: no cyclic orderings.
That is because such orderings exist and would fail to motivate choice. There is no
parallel coherence constraint on preferences in decision theory that says one cannot
have all of one’s most weakly preferred alternatives be ones where A and some of
one’s most weakly preferred alternatives be ones where ¬A. It’s not that there is a
preference ordering which does have these two properties but would fail to motivate
choice. It’s that no preference ordering meets the two descriptions at all. It is no more a
practical constraint on preference orderings than the fact that they cannot both range
over ten alternatives and twelve alternatives. The nonexistence of such a preference
ordering does not follow from any background theory about how preference motivates
choice, but from the fact that they are inconsistent ways of representing preferences.
More generally, if there is a genuine appeal to preferential incoherence and not
representational inconsistency here, then one should be able to construct an ordering
which has these two properties and show how it fails to motivate choice. Since the
descriptions are logically incompatible, I do not see how this is possible.
Another way of illustrating the problem here is to return to the observations of
§. There I motivated the idea that the natural way to combine preferences is to
union them. This means that the motivational consistency of two preference orderings
amounts to whether their union can serve its function of motivating choices. Now,
consider two orderings and over one A-world w and one ¬A-world w .
Suppose then that w w and w w , while the other ordering is just the
opposite: w w and w w . Must(A) then expresses while ¬Must(A)
expresses . If the two orderings are combined into one, call it , a perfectly
rational ordering results where one is indifferent between A and ¬A: w w
and w w . Indeed, since is also expressed by Must(¬A) the same problem
arises for explaining the rational tension produced by committing oneself to both
Must(A) and Must(¬A).25 Yet, is a perfectly rational state of preference. There are
no background decision-theoretic principles the preference expressivist can appeal to
here to render this ordering incoherent. There is, however, an important lesson one can
take away from this discussion. Solving the external negation problem by appealing to
incoherent preferences may not be possible.
25 To some degree, this latter point is a bug of Silk’s formalization using weak preferences. Weak
preferences are ill-suited to the job here because it is impossible to distinguish between an irrational
symmetric strict preference and indifference when you take weak preference as basic and define strict
preference as an asymmetric preference. For this reason, I use strict preferences in §.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
Let me now turn to the question of how deontic modals with the semantics in
Definition could be used to communicate preferences without referring to them
and describing them, i.e. representing them. This is where Yalcin (, §) makes a
valuable proposal.26 Beginning with the dynamics of language use, the view is that the
common ground of a conversation consists not just of mutually supposed information
s = {w , . . . , wn , . . .} but also a set of preferences P = { , . . . , n , . . .} (Yalcin, ,
p. ). While descriptive discourse constrains the world and eliminates worlds from s,
deontic language constrains preferences and eliminates preference orderings from P.
In brief: each type of sentence affects the common ground in a different way.27 Since
s is a representational state and P is a motivational state this distinction appears apt
for distinguishing representational from motivational discourse. Yalcin’s (, p. )
sketch suggests a formalization in the following territory:28
Pragmatic Expressivist Discourse Model
. A descriptive sentence φ expresses a constraint on how the world can be, i.e. the
set of left-indices that make it true: φ = {w | ∃: [[φ]]w, = }
. An expressive deontic sentence φ expresses a constraint on preferences, i.e. the
set of right-indices that make it true: φ = { | ∃w: [[φ]]w, = }
. Common ground: C = sC , PC , where sC = {w , . . . , wn }, PC = { , . . . , m }
• Background assumption: sC is the ways the world could be given what the
agents are taking for granted
• Background assumption: PC is the preferences compatible with the situations
the agents’ are in favor of
. Representational communication: C φ = sC ∩ φ , PC
. Expressive communication: C φ = sC , PC ∩ φ
How does this allow one to distinguish referring to and describing preferences
from expressing preferences? The idea seems to be that referring to and describ-
ing preferences eliminate worlds while expressing preferences eliminates orderings.
The problem is that this interpretation of the model is entirely optional. Since the
dynamics of expressive and representational content are identical, the system could
be reformulated in terms of updating a set of world-ordering pairs, or centered-
worlds, which distinguish ways the world could be from the perspective of a preference
ordering. Then one can easily describe the system as behaving as referring to and
describing preferences, just as worlds centering on individuals are said to refer to and
describe an individual (Egan, ). The only way to clearly distinguish expressive
26 See Rothschild (); Swanson (forthcoming); and Moss () for related work. Ninan ()
presents a view that is ambivalent between semantic and pragmatic expressivism. The examination of
pragmatic expressivism through examining embedded must claims is a strategy Ninan (, §.) already
highlights as an important way to probe these views. See note in § for more discussion of Ninan ().
27 The terminology of expressing constraints comes from Swanson (; forthcoming).
28 The notation of φ and φ is novel here, but iconic: [[φ]] is a set of pairs w, while φ is the
set of left members (worlds) and φ is the set of right members (orderings).
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
from representational communication is to motivate and adopt a model where the two
kinds of discourse have clearly different dynamics, say one where expressive sentences
do not refer to orderings and place constraints on them. It is hard to see how one can
do this without departing from truth-conditional semantics. As discussed above, this
difference also seems necessary for a solution to the negation problem. It turns out
there is a rather similar issue with disjunction.
Mixed disjunctions provide a hard challenge for the expressivist approach above.
It is hard to see how they can be explained while maintaining that expressive and
representational communication have distinct communicative effects. The challenge is
related to Schroeder’s () concerns about mixed disjunctions in belief ascriptions.
But my focus will be on what mixed disjunctions communicate.29 Suppose you know
that exactly two things make Alex happy, namely Pat being in town or Alex receiving
permission to skateboard, and I ask Why is Alex happy? you might answer:
() Either Pat is in town or Alex may skateboard.
Consider how this sentence would update the common ground on the pragmatic
expressivist model above. The standard semantics for disjunction gives us the follow-
ing truth-conditions and corresponding constraints:
() [[P]]w, = or [[May(S)]]w, =
() P ∨ May(S) = {w | ∃: [[P]]w, = or [[May(S)]]w, = }
() P ∨ May(S) = { | ∃w: [[P]]w, = or [[May(S)]]w, = }
The problem for the world constraint is that the deontic disjunct will allow every world
to meet this condition. There is a preference ordering that makes May(S) true and if
one world-ordering pair featuring that ordering makes it true, then any other pair
with that ordering makes it true: deontics do not constrain the world parameter. The
same problem arises for the preference constraint, since there is a world that makes
P true and P does not constrain the ordering parameter. This predicts that a mixed
disjunction cannot communicate anything, contrary to our intuitions about ().30
There is an alternative analysis of disjunction that pragmatic expressivists could
adopt to capture this data. Suppose we took common grounds to be sets of the old
common grounds: C = {s , P , . . .}. For any non-disjunctive sentence, proceed as
before, only applying the update to each information state or preference set. But for
disjunctions, don’t use truth conditions, but rather perform a special update process.
Take the result of updating C with the left disjunct and union it with the result of
29 Mixed disjunctions are a focal point in the arguments I’ve offered for dynamic analyses of imperatives
(Starr, , ).
30 Silk (, §) offers a modified semantics designed to solve Schroeder’s () disjunction problem.
But this results in practical sentences that constrain the world parameter and hence update S, and
representational ones that constrain the preference parameter and hence update the preference set. This
solves the problem at the expense of having a way of neatly distinguishing expressive from representational
communication.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
updating with the right disjunct. To illustrate how that would work, consider a simple
common ground where there is one pair: C = {{w , w }, { , }}. Suppose w
is a P-world, w is a ¬P-world, makes May(S) true and makes May(S)
false. Then updating with () will produce C = {{w }, { , }, {w , w }, { }}.
We may interpret these new common grounds as ‘imprecise common grounds’, on
analogy with imprecise credences. The agents’ states of mind are not precise enough
to distinguish which state they are in.31 This revised analysis can be interpreted in
two ways. Perhaps disjunction does not have a truth-conditional semantics, but only
a distinctive pragmatic update rule that is different from that which applies to every
other kind of sentence. This assumption conflicts with the received methodology of
semantics and pragmatics. No connective has only a pragmatics. It does, however,
suggest a more radical approach: what if all sentence forms were given update rules as
their semantics? This more radical approach is simply dynamic semantics, and will be
pursued in the next section.
31 This interpretation and solution are a wink to Rothschild () who, essentially, uses lifted states
like this to address a related problem for pure expressive disjunctions. Mixed disjunctions, however, are
not modeled there. This is the only reason Rothschild () can do without a special disjunction pragmatic
update rule. For more on the motivations for imprecise credences and pointers to that literature see Halpern
(, §.).
32 Unlike Veltman (, p. ) but like Groenendijk et al. (, §) I will treat conjunction as
sequential update. Unlike Groenendijk et al. (, §) but like Veltman (, p. ) I will treat disjunction
as forming the union of parallel updates to the initial state.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
result from interpreting φ. Intuitively, this means saying how processing φ requires
the agents’ to adjust the information they are assuming in their exchange. To save a
few words I will say that s is the result of updating s with φ.
Definition (Compositional Semantics).
() s[p] = {w ∈ s | w(p) = } () s[¬φ] = s − s[φ]
() s[φ ∧ ψ] = (s[φ])[ψ] () s[φ ∨ ψ] = s[φ] ∪ s[ψ]
Atomic sentences eliminate possibilities incompatible with their truth. For complex
sentences, the equations specify how the interpretation of φ depends on interpreting
its constituents. Negation eliminates the possibilities compatible with its scope. Con-
junctions update with each of their conjuncts in sequence. Disjunctions update with
each of their disjuncts in parallel and form the union of each result. The clause for
atomic sentences directly appeals to truth/reference in specifying the interpretation
procedure for an atomic sentence. But note that unlike in the classical semantics
the connectives here are not given a representational interpretation. Intuitively, they
do not work by increasing the precision with which one can refer to distinctions
between possible worlds. They increase the space of transitions between states the
language can traverse. This different understanding of expressive power is the crucial
difference between dynamic and classical semantics. It allows one to conceive of two
different ways to make the language more expressive: add operators which discern
finer distinctions in the space of possible worlds, and add operators which trace more
transitions through the space of states. The key in understanding the relationships
between the two forms of semantics comes to this: what kinds of transitions amount
to drawing distinctions between possible worlds and what kinds of transitions amount
to something else? One way of answering this in the present context is considering
which operators exploit operating on a whole set of worlds rather than considering
each world in isolation. In systems such as that above where s[φ] ⊆ s, this formal
difference amounts to whether or not the update is distributive.33
Definition (Distributive). s[φ] is distributive just in case s[φ] = {{w}[φ] | w ∈ s}.
The semantics above is distributive, and so the underlying language does not ‘do
anything’ which does not amount to making distinctions between possible worlds.
This is what I will eventually give up. The possibility of doing so hangs crucially on
the fact that dynamic semantics furnishes logical concepts more general than those
employed in truth-conditional semantics.
A more general conception of meaning is molded by a more general conception of
logic. Instead of truth, dynamic semantics builds its logic on support.
Definition (Support, Truth in w).
() Support s φ ⇔ s[φ] = s () Truth in w w φ ⇔ {w}[φ] = {w}
33 This property is sometimes also called continuity, and has been isolated as a key feature by van Eijck
and Visser (, §.), Muskens et al. (, p. ) and van Benthem (, p. ).
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
34 This definition is mentioned, but not adopted, by Muskens et al. (, p. ).
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
35 Some gloss w ∼ w as indifference but this gives the misleading impression that the agent doesn’t care
about w and w at all.
36 Appropriately capturing the distinction between ¬Must(φ) and Must(¬φ) is a well-documented
difficulty for expressivism (Unwin, ; Schroeder, c; Dreier, ), and so one this analysis must
solve.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
be reducible to choice behavior, but I will assume that preferences have a function:
to motivate choice. This functional account assumes that rational preferences have
the properties they do because those are the properties they must have to achieve
their function of motivating choice. This idea about the function of preferences
will be important when I discuss consistency, and it should recall the discussion of
consistency at the end of §.
Even though I will not require preferences to be rational, their rationality will
nonetheless be relevant. It is important to be clear about how. Their first relevance
will be in the pragmatics: I will assume that a speech act which brings about irrational
preferences is unstable: the discourse participants will not be satisfied with it. Their
second relevance is in the semantics for deontic modals. For example, Must(φ) will
test whether the alternatives which may be rationally chosen are the φ-alternatives. If
the underlying preferences are not rational, this test is sure to fail since no alternatives
may be rationally chosen from irrational preferences. This means that I must say
more about how alternatives are rationally chosen, and which properties this requires
preferences to have. This is obviously an enormous and delicate issue, but all I will need
below is two plausible claims about rational choice: (i) rational agents can choose any
alternative which is not dominated by—strictly less preferred than—an alternative; (ii)
if w can be rationally chosen and w ∼ w , then w can be chosen.37 The set of rational
alternatives to choose from a space of worlds s can then be defined as follows.
Definition (Choice Possibilities).
Choice(s, ) := {w ∈ s | w ∈ Nd or ∃w ∈ Nd: w ∼ w }
• Nd(s, ) := {w ∈ s | w ∈ s: w w}
• Choice possibilities in s are either non-dominated, or equally preferable to some
w ∈ s that is.
On this conception of choice, a cyclic strict preference ordering such as w w , w
w has the function of promoting the choice of both w and w , but discounts both. It is
thereby irrational. There are many other properties rational preference must have on
this definition, but asymmetry is the only one that will be crucial below.
I seem to have arrived at a good model of states: a space of worlds and a preference
frame s, . But, for the reasons discussed at the end of §, disjunctions featuring
deontic modals challenge such a model. At least in the static setting this challenge
motivated a higher type for states, namely a set of these pairs. I will also adopt this
higher type. Here’s why. Suppose s, was the form states take. A standard idea is
that Must(φ) tests that the ‘choice’ worlds in s, are the φ-worlds.38
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
39 Veltman’s (, p. ) normally works this way, but I will pursue yet another way.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
equally good, just that she can’t say which is better. Whatever she is conveying, it
seems to leave both orderings open. A simple move can capture this idea: states are sets
of s , .40 So interpreting Alex’s disjunction results in the state {s , L , s , R }.
The idea is not that agents posses a representation of their preferences which fails to
distinguish these two options. Instead, both preferences are competing for control of
their choices. Given her epistemic state, all Alex can do is induce this competition
among Pat’s preferences.41
. The Semantics
Recall the guiding slogan for the expressivist: Must(φ) promotes certain motivational
attitudes towards φ.42 The difficult part for semantically implementing this slogan has
been ‘promotion’. The essential idea of the semantics I will offer is that this promotion
consists in adding a strict preference for φ-worlds. More precisely, Must(φ) extends
each existing strict preference ordering by ranking each live φ-world over each live
¬φ-world. This preference update is defined below as φ+(). It simply extends
with a strict preference for each φ-world over each ¬φ-world and leaves ∼ as it was.
Crucially, the semantics also tests, for every substate, that this augmented ordering
makes φ a practical necessity: the choice worlds are the φ-worlds.43 This test relates
the preference update to the information supposed in the interaction.44
Definition (Dynamic Expressivist Semantics).
⎧
⎨{sφ+() | s ∈ S} if ∀s ∈ S: Choice(sφ+() ) = s
φ
S[Must(φ)] =
⎩{∅φ+() | s ∈ S} otherwise
• s
φ := ({s }[φ])
◦ sφ is the set of φ-worlds in s.
40 This allows variation along both axes: {s , , s , }, {s , , s , }, and {s , , s , }
are all states. Variation along the informational axis is needed for mixed disjunctions such as A ∨ Must(B).
41 This idea of various states competing for control of an agent’s actions is widespread in artificial
intelligence, in particular in the use of evolutionary algorithms to model adaptive problem solving. See
Franklin (, Chapter ) for a helpful overview. This seems in the spirit of Blackburn’s () being ‘tied
to a tree.’
42 Ninan () is a clear predecessor of this view, but is ambivalent about whether the ordering-
effect is semantic. Making this effect semantic is essential for expressivist purposes: it makes Must(A) and
¬Must(A) inconsistent in a non-representational way.
43 I here define May(φ) as the dual of Must(φ), but a less conservative option is worth noting: May(φ)
creates a new substate in which φ-worlds are preferred to ¬φ-worlds, and tests that φ is consistent
with the choice worlds in each substate. Given the logical definitions to follow this predicts free-choice
permission patterns: May(A) ∨ May(B) May(A) and May(A ∨ B) May(A) while ¬May(A ∨ B)
¬May(A) ∧ ¬May(B).
44 Note that this account captures both performative and descriptive uses of deontic modals within the
same semantics: descriptive ones are those issued in a state that already contains the preference it promotes
and performative ones are those issue in a state that previously lacked that preference. Of course, pragmatic
social conditions concerning authority govern when performative uses are felicitous.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
After all, this is a model of declarative modal sentences and interacting with informa-
tion is the function of declaratives. The fact that they have some relation to information
makes them suitable to negate and embed under believe. But they hardly work by
providing information, and they certainly do not eliminate worlds by referring to a
distinction in each world. Their key contribution is the promotion of preferences and
this seems on track for expressivist aims. It is rather crucial for these aims that the
test component of the meaning is not redundant. After all, that would mean that an
update with any deontic modal claim will succeed. To see why the preference update
does not guarantee that the test will be passed, consider the fact that the incoming
preferences may be inhospitable. The test asks whether the alternatives a rational
agent could choose according to the augmented preferences are φ-alternatives. But
if the augmentation leads to an irrational body of preferences, then the choice set
will be empty. (That’s why those preferences are irrational.) To illustrate, consider
a simple scenario where the agents have narrowed in on two worlds, an A-world
w and a ¬A-world w . Further suppose the agents’ only strict preference is for w
over w and the agents only equally prefer each world to itself: = {w , w } and
∼ = {w , w , w , w }. The agents’ state is then modeled as {{w , w } }. The first
thing that an update with Must(A) will do is create a new preference ordering where
w is also strictly preferred to w : A+( ) = {w , w , w , w }, ∼ . But such a
preference ordering is irrational. Its symmetric preference guarantees that there are
+
no non-dominated worlds, thus: Choice({w , w }A ( ) ) = ∅. That means Must(A) is
testing that there are no A-worlds. Since w is an A-world, this test is failed, resulting
+
in the state ∅A ( ) . The information ∅ in this result state reflects that it is not stable
and the preferences explain why: A+( ) is irrational. The agents then face a choice of
which preference to jettison. I do not have space to elaborate how I see that pragmatic
process unfolding, but suffice it to say that it may involve debate about the particular
preferences, or less conscious deference based on social influences.
As it turns out, is the ordering that would have resulted from updating a
state with an empty strict ordering and trivial equality ordering {{w , w }∅,∼ } with
Must(¬A). This allows me to sketch how I propose to explain the inconsistency of
Must(¬A) and Must(A): there is no rational preference ordering which contains both
the preferences they promote. If the state is going to contain the preferences Must(A)
promotes, it is going to strictly prefer A-worlds to ¬A-worlds. If the state is going to
contain the preferences Must(¬A) promotes, it is going to strictly prefer ¬A-worlds to
A-worlds. But any state containing both will be irrational in the sense that it contains
dysfunctional preferences. To fill out this explanation, and treat the harder case of
Must(A) and ¬Must(A), a semantics for negation (and atomics) is needed.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
Apart from negation, the semantics for the rest of the language parallels Definition .
Atomics eliminate worlds where they are false from each substate. Conjunction is
still sequential update and disjunction is again the union of parallel updates to the
initial state. This semantics for disjunction does generate a new behavior, but I will
return to that. Focus now on negation: ¬φ not only removes the information φ would
provide (as before), it removes the preferences that φ would promote.45 This is done
by subtracting any preferences that φ would add to an empty ordering.
Definition (Dynamic Expressivist Boolean Semantics).
. S[p] = {{w ∈ s | w(p) = } | s ∈ S}
−
. S[¬φ] = {sφ () − ({s }[φ]) | s ∈ S}
• φ−() := − {w, w ∈ i | {W ∅,= }[φ] = {s m
, . . . , sn } & i
m}, ∼
• φ−() removes from any pairs that updating with φ would add to an
empty ordering. For non-expressive discourse this will idle. If φ = Must(ψ)
this will extract preferences for ψ-worlds over ¬ψ-worlds.
. S[φ ∧ ψ] = S[φ][ψ]
. S[φ ∨ ψ] = S[φ] ∪ S[φ]
Since descriptive discourse never adds preferences, negations of descriptive sentences
will behave exactly as before. However, when an expressive sentence is negated both the
preferences it would promote and the information it would provide are removed. Con-
sider again the example from above where testing the state {{w , w } } with Must(A)
+
failed, returning ∅A ( ) . What if one updates that same state with ¬Must(A) instead?
The semantics predicts that this returns the initial state {{w , w } } via the following
+
process. It would first arrive at the state ∅A ( ) and then find the informational
difference {w , w } − ∅ and the preferential difference Must(A)−(A+( )). Since
Must(A)−(A+( )) = ,46 this results in the original state {{w , w } }. The fact that
negation dynamically manipulates preferences in this way guarantees that there will
be no state, not even an irrational one, which contains the preferences both Must(A)
and ¬Must(A) promote. If that state already has the preferences in it that Must(A)
would add, then ¬Must(A) would remove them. Conversely, if the state already lacks
the preferences ¬Must(A) would remove then Must(A) would add them. This is the
gist of my expressivist explanation of why Must(A) and ¬Must(A) are inconsistent.
45 This analysis of negation is inspired by the converse operator of Dynamic Logic (Harel et al., ,
p. ), which serves as a variety of program negation. While there is a growing literature on using dynamic
logic for deontic reasoning, it does not directly speak to the key issues here.
46 Recall that + − +
= {w , w }, so A ( ) = {w , w , w , w }. Then Must(A) (A ( )) =
Must(A)−({w , w , w , w }). To find Must(A)−({w , w , w , w }) one removes any strict preference
that ends up in the result state of {W ∅,= }[Must(A)] = {W {w ,w ,...},= }. So w , w is removed from
{w , w , w , w } leaving {w , w } = .
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
However, recall that {{w , w } } is a state where the test imposed by Must(¬A)
is also successful. How exactly are ¬Must(A) and Must(¬A) distinguished on this
semantics? This is where states with equally preferred worlds are required.
Consider a state where w ∼ w , and there are no strict preferences. Intuitively,
this state supports ¬Must(A), but not Must(¬A). If one prefers A and ¬A equally,
one definitely isn’t compelled to choose only ¬A-worlds or to choose only A-
worlds. The first fact means that one rejects Must(¬A), while the second means
that one accepts ¬Must(A). To see how the formalism above replicates this intu-
itive reasoning, consider a state {s } where s = {w , w }, = ∅, and ∼ =
{w , w , w , w , w , w , w , w }. Updating {s } with Must(¬A) will augment
to = {w , w } (leaving ∼ as it was), and test whether the choice worlds relative to
the augmented ordering are the ¬A-worlds: {w }. The only non-dominated alternative
in is w , but recall that w ∼ w . Since any choice world is either non-dominated,
or equally preferred to a non-dominated one, the choice worlds are {w , w } and so
the test is failed. In the same way, the test for Must(A) will fail, and result in the
state {∅ }, where = {w , w }. But this means that updating the original state
with ¬Must(A) will involve taking the informational difference {w , w } − ∅ and
removing the preference w , w . And so an update of {s } with ¬Must(A) returns
{s } while an update of {s } with Must(¬A) returns {∅ }. In other words: {s }
supports ¬Must(A) but does not support Must(¬A).
It is important to note that I do not distinguish ¬Must(A) and Must(¬A) by
appealing to a state where there are two competing substates, one where A is strictly
preferred and one where ¬A is strictly preferred, e.g. {{w , w } , {w , w } }. That
is the kind of state that results from updating with Must(A) ∨ Must(¬A).47 This
is not a state which supports ¬Must(A). Updating with ¬Must(A) after updating
with the disjunction would change the preferences and lead one to rule out the
substate produced by the left disjunct, thereby bringing one to a state where only
the substate produced by the right disjunct remains. In other words, one would
infer Must(¬A). This illustrates why it is so important to distinguish ¬Must(A)
from Must(A) ∨ Must(¬A). Conflating them straightforwardly leads to a collapse of
¬Must(A) into Must(¬A) (Unwin, ; Schroeder, c; Dreier, ). Mention
of logical matters highlights the fact that above I have been relying on the logical
notions of support and consequence roughly in the vein of §. But it is crucial to make
these notions explicit and to note that they arise in the way expressivists require. In
particular it will be important to see how the semantic properties of deontic sentences
are explained by properties of the attitudes they express.
47 This dynamic semantics of disjunction comes from Starr () where it is used to analyze disjunctions
of imperatives, declaratives (and combinations thereof). It works by dynamically generalizing the idea
that disjunctions introduce alternative propositions (Kratzer and Shimoyama, ; Simons, ; Alonso-
Ovalle, ), to the idea that disjunctions introduce alternative updates.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
• PrefS = { | ∃s = ∅: s ∈ S}
Informational support is familiar from §, but preferential support tracks the addi-
tional expressive dimension added by deontic modals. It requires that the preference
frames in S stay the same after updating with φ and that φ does not add any preference
frames.49 This furnishes two notions of consequence and of consistency:
Definition (Informational Consequence). φ , . . . , φn ψ ⇐⇒ ∀S: S[φ ] · · ·
[φn ] ψ
Definition (Preferential Consequence). φ , . . . , φn ψ ⇐⇒ ∀S: S[φ ] · · ·
[φn ] ψ
Definition (Informational Consistency).
φ , . . . , φn are informationally consistent ⇐⇒ ∃S: iS = ∅ & S φ , . . . , S φn
Definition (Preferential Consistency).
φ , . . . , φn are preferentially consistent ⇐⇒ ∃S: Ch(S) = ∅ & S φ , . . . , S φn
• Where Ch(S) = {Choice(s, ) | s ∈ S}
48 There is actually a third to consider, choice-support, which produces a more classical logic for Must:
S choice-supports φ just in case {Choice(s, ) | s ∈ S} = {Choice(s, ) | s ∈ S[φ]}. This will
predict that both Must(φ) and Must(ψ) are consequences of Must(φ ∧ ψ). However it also predicts that
Must(φ ∨ ψ) is a consequence of Must(φ). See § for discussion of these principles.
49 It also discounts changes to the preferences that concern only problematic substates whose informa-
tion is contradictory: ∅ . Intuitively, those preferences are to be ignored since they are not constraining
agents’ choices: Choice(∅, ) will always be ∅. This restriction is needed to predict that Must(B) is a
preferential consequence of Must(A) ∨ Must(B) and ¬Must(A).
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
updates. Crucially, they differ not just in the contents (preferences vs propositions)
they manipulate, but in how those contents are manipulated.50 The adding and
removing of preferences is quite different from the way descriptive discourse works.
Descriptive discourse provides information by making distinctions between worlds.
As I mentioned in §, this way of providing information can be formally charac-
terized as a distributive and eliminative update: s[φ] = {{w}[φ] | w ∈ s},
and s[φ] ⊆ s. One way of putting the problem for pragmatic expressivism was
that it was indistinguishable from a view where expressive discourse works via a
distributive and eliminative update over more fine-grained worlds such as w, . It
then becomes hard to elucidate the expressivist communication thesis, since it looks as
if expressive communication works by making distinctions between centered worlds,
and one of those distinctions is made by referring to preferences. But on the dynamic
view, the fact that preferences are added, while worlds are removed, guarantees that
preference update and information update cannot be the same process. Indeed, on
the dynamic account there are four kinds of updates: adding preferences (deontic
modals), removing preferences (negation), eliminating worlds (atomics, negation) and
creating substates (disjunction). Language compositionally interleaves these distinct
processes rather than providing a single semantic object which is factored into distinct
contributions.
Ultimately, the dynamic account makes room for the expressivist communica-
tion thesis by breaking from the traditional ‘conduit’ metaphor for communication:
expressing yourself is like putting a package on a conveyor belt between minds.
That metaphor assumes there is a discrete object shared and a single process by
which you unpack the box and sort the contents. On the dynamic approach, a better
metaphor would be two minds connected by a series of vibrating cords which encode
different modalities with different frequency ranges. Language is a virtuosic tool for
weaving these frequencies and modalities together. This departure from the conduit
metaphor is far from a debt for expressivists. It is one urged by researchers not debating
expressivism and spanning the fields of animal communication (Owren et al., ;
Scott-Phillips and Kirby, ) and natural language (Murray and Starr, ).
A Wider View
I want to conclude this chapter by taking a wider view and being clear about what
has and has not been accomplished. I claim to have offered a dynamic semantics
which vindicates the expressivist communication thesis, and done so better than
competing pragmatic accounts. Dynamic meanings were essential for implementing
the expressive effect of deontic modals, and generalizing our familiar semantics for
50 Varieties of update is an allusion to Murray (), which exploits dynamic semantics to composition-
ally interleave a number of distinct update processes. It is there applied to a number of constructions across
languages which manipulate but differentiate at-issue and not-at-issue content.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
negation so as to interact with that expressive effect. This provided a new expressivist
solution to the negation problem. Must(A) and ¬Must(A) have converse dynamic
effects, which predicts that they are inconsistent because there is no state that can
contain the converse effects of their updates. Must(A) and Must(¬A) are inconsistent
because only irrational preferences support them. It was argued that these explanations
of inconsistency perfectly parallel those for descriptive discourse. There is nevertheless
much work left to do. There are still several constructions which have not been
analysed here and may turn out to be problematic. For example, the interaction
of modals with attitude verbs, modal questions, deontic predicates/adverbs such as
wrong/wrongly, more complex modals such as ought, should, could and non-deontic
uses of must and may all lie outside the scope of the analysis here.51 Further-
more, at some point one would like a positive argument in favor of an expressivist
semantics over the impressive achievements of its descriptive competitors such as
Kratzer (, ). This must wait for another day, but some suggestive remarks
on a few fronts may help chart the territory where such an argument might be
found.
The compositional virtuosity of the dynamic account was essential for analyzing
mixed disjunctions. It explained how communicating Must(A) ∨ B was different
from communicating Must(A) or communicating B. The disjunction involves a
distinctive substate-creating update, where quite different updates are unleashed on
each incoming substate to create two new substates. It is worth highlighting how this
addresses the concern raised by Schroeder () about the interaction of expressive
modals and attitude verbs. That concern is whether BelX (Must(A) ∨ B) will entail
BelX (Must(A)) ∨ BelX (B). BelX (φ) can be analyzed as a descriptive update requiring
of a world w that the state SX modeling X’s state of mind in w both informationally
and preferentially supports φ. But consider a world in which SX is a state with two
substates: SX = {sA , sB }, the first has information s and an ordering which prefers
A-worlds, and the second lacks that particular preference but has the information
carried by B. SX will preferentially and informationally support Must(A) ∨ B. Yet
SX will not preferentially support Must(A), since PrefSX [Must(A)] will lack . Also, SX
will not informationally support B, since iSX [B] will exclude the ¬B worlds in s. Thus
BelX (Must(A) ∨ B) will not informationally entail BelX (Must(A)) ∨ BelX (B). The
success of this analysis would not surprise Schroeder (). He acknowledges that if
belief ascriptions mix information and motivation, and there is a special sort of state
for believing disjunctions, then the entailment can be blocked. What he doubts is a
philosophically plausible interpretation of these constructs. Even the small amount I
have said here suggests a way of resisting this doubt. A substate is an informational
and motivational perspective, and a state containing multiple substates captures the
perspectives competing for control over the agent’s actions. There seems to be no good
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
52 Charlow () explores these issues in much more depth. He pursues an account which I learned
much from and which bears at least some affinities with the account pursued here.
53 Given this analysis of imperatives, one may draw a comparison between the semantics above and
the analysis in Ninan () where imperatives and deontic must have the same effect on context. On the
analysis above, the semantics of deontic must includes that effect, but adds the crucial element of a test. This
approach also predicts Ninan’s () key data: ¬A ∧ Must(A) will be infelicitous. Further investigation
will be needed to see whether the added test element is empirically advantageous.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
Murder Paradox (Forrester, ). Willer, (this volume, Chapter ) elaborates on the
relationship between this kind of non-monotonic approach and one based on default
logic. The semantics above makes two predictions that may add momentum to this
movement. Suppose a father tells his son that he must take out the trash or do the
dishes. The son proceeds to the trashcan only to find that it has already been taken
out. The son should infer that he must wash the dishes. However, if the son goes to do
the dishes and they too have been taken care of, he may no longer sensibly think he
must wash the dishes. These are facts classical theories do not predict:
() Must(A ∨ B), ¬A Must(B)
() Must(A ∨ B), ¬A, ¬B Must(B)
Just because the best worlds are all A ∨ B worlds and we aren’t in an A-world, it doesn’t
follow that all the best worlds are B-worlds. Further, the pattern in () seems to show
exactly the kind of non-monotonicity which classical consequence relations fail to
predict. By contrast, both patterns are predicted by the analysis above.
Now return to the scenario where the son has correctly inferred that he must do the
dishes. Note that he should not infer that he must do the dishes or watch a movie. He
also shouldn’t infer that either he must do the dishes or he must watch a movie:
() Must(B) Must(B ∨ C)
() Must(B) Must(B) ∨ Must(C)
This is a modal version of Ross’s Paradox (Ross, ) concerning imperatives. Their
validity in standard approaches to modality has been widely recognized as problem-
atic. Several attempts have been made to leverage pragmatic reasoning and a non-
standard semantics for disjunction to explain them (Zimmermann, ; Simons,
; Geurts, ; Aloni, ). Yet Cariani () and Lassiter () argue that a
strictly semantic approach is justified. While their semantic approach predicts (),
it does not, to my knowledge, predict (). The analysis above predicts both.54 While
none of these prospects constitutes a full empirical argument for the kind of dynamic
analysis pursued here, they illustrate some of the territory where those arguments may
evolve. It would, of course, also be unsurprising if dynamic expressivism proved useful
in the burgeoning literature on non-descriptive meaning.
Acknowledgments
I am extremely thankful to Andrew Alwood, Nate Charlow, Matthew Chrisman, Andy Egan,
Sarah Murray, Paul Portner, Angelika Kratzer, Aynat Rubenstein, Mark Schroeder, Alex Silk,
Justin Snedegar, Malte Willer and audiences at Edinburgh, NYU, Rutgers and USC, some for
54 Like Cariani () and Lassiter () the analysis here predicts that Must(A ∧ B) Must(A), which
is one way to explain Jackson’s () ‘professor procrastinate’ puzzle.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
detailed feedback and others for brief, incisive questions. Charlow and Chrisman deserve special
thanks, not only for editing this volume, but also for guiding this project towards properly
engaging with the expressivism literature. I should also acknowledge a debt to Yuna Won. I
have learned at least as much from advising her PhD research on deontic logic as she has learned
from me.
References
Aloni, M. () ‘Free Choice, Modals and Imperatives.’ Natural Language Semantics. . .
pp. –.
Alonso-Ovalle, L. () Disjunction in Alternative Semantics. Ph.D. thesis. UMass Amherst,
Amherst, MA.
Alwood, A. () ‘Nondescriptive Negation for Normative Sentences.’ (To be published in
Philosophical Quarterly.)
Anderson, L. and Lepore, E. () ‘Slurring Words.’ Noûs. . . pp. –.
Beaney, M. (ed.) () The Frege Reader. Malden, MA: Blackwell.
Bittner, M. () ‘Topical Referents for Individuals and Possibilities.’ In Hastings, R., Jackson, B.
and Zvolenszky, Z. (eds.) Proceedings from Semantics and Linguistic Theory . pp. –.
Ithaca, NY. Cornell University.
Blackburn, S. () ‘Attitudes and Contents.’ Ethics. . . pp. –.
Burge, T. () ‘On Knowledge and Convention.’ The Philosophical Review. . pp. –.
Carballo, AP. () ‘Semantic Hermeneutics.’ In Burgess, A. and Sherman, B. (eds.) Metaseman-
tics: New Essays on the Foundations of Meaning. pp. –. New York, NY: Oxford University
Press.
Cariani, F. () ‘ “Ought” and Resolution Semantics.’ Noûs. . . pp. –.
Charlow, N. () ‘Logic and Semantics for Imperatives.’ Journal of Philosophical Logic.
pp. –.
Charlow, N. () ‘The problem with the Frege–Geach problem.’ Philosophical Studies. . .
pp. –.
Charlow, N. () ‘Prospects for an Expressivist Theory of Meaning.’ Philosophers’ Imprint.
. . pp. –. [Online] Available from: http://hdl.handle.net//spo....
[Accessed Nov .]
Chisholm, R. M. () ‘Contrary-to-Duty Imperatives and Deontic Logic.’ Analysis. . .
pp. –.
Chrisman, M. () ‘On the Meaning of “Ought”. ’ In Shafer-Landau, R. (ed.) Oxford Studies
in Metaethics. Volume . pp. –. Oxford: Oxford University Press.
Dreier, J. () ‘Negation for Expressivists: a collection of problems with a suggestion for their
solution.’ In Shafer-Landau, R. (ed.) Oxford Studies in Metaethics. Volume . pp. –. New
York, NY: Oxford University Press.
Dreier, J. () ‘Relativism (And Expressivism) And The Problem Of Disagreement.’ Philosoph-
ical Perspectives. . . pp. –.
Egan, A. () ‘Epistemic Modals, Relativism and Assertion.’ Philosophical Studies. . .
pp. –.
Faller, M. () Semantics and Pragmatics of Evidentials in Cuzco Quechua. Ph.D. thesis.
Stanford University, Palo Alto, CA.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
Forrester, J. W. () ‘Gentle Murder, or the Adverbial Samaritan.’ Journal of Philosophy. .
pp. –.
Franklin, S. () Artificial Minds. Cambridge, MA: MIT Press.
Frege, G. () Begriffschrift, eine der arithmetischen nachgebildete Formalsprache des reinen
Denkens. Halle: L. Nebert. pp. –. References to M. Beaney’s translation in Beaney ()
pp. –.
Frege, G. () ‘Logische Untersuchungen.’ Beiträge zur Philosophie des deutschen Idealismus.
. pp. –. References to Frege ().
Frege, G. () ‘Compound Thoughts.’ Mind. . . pp. –. Translation of Frege ().
Geach, P. T. () ‘Assertion.’ The Philosophical Review. . . pp. –.
Geurts, B. () ‘Entertaining Alternatives: Disjunctions as Modals.’ Natural Language Seman-
tics. . . pp. –.
Gibbard, A. () ‘An Expressivistic Theory of Normative Discourse.’ Ethics. . . pp. –.
Gibbard, A. () Thinking How to Live. Cambridge, MA: Harvard University Press.
Grice, H. P. () ‘Utterer’s Meaning, Sentence-Meaning, and Word-Meaning.’ Foundations of
Language. . pp. –.
Groenendijk, J. and Stokhof, M. () ‘Dynamic Predicate Logic.’ Linguistics and Philosophy.
. . pp. –.
Groenendijk, J., Stokhof, M. and Veltman, F. () ‘Coreference and Modality.’ In Lappin. S.
(ed.) The Handbook of Contemporary Semantic Theory. pp. –. Oxford: Blackwell.
Halpern, J. Y. () Reasoning about Uncertainty. Cambridge MA: MIT Press.
Hamblin, C. L. () ‘Questions in Montague English.’ Foundations of Language. . . pp. –.
Han, C. H. () ‘Force, negation and imperatives.’ Linguistic Review. . . pp. –.
Hansson, S. O. and Grüne-Yanoff, T. () ‘Preferences.’ In Zalta (ed.), The Stanford Encyclo-
pedia of Philosophy, fall. [Online] Available from: http://plato.stanford.edu/archives/fall/
entries/preferences/. [Accessed Nov .]
Harel, D., Kozen, D. and Tiuryn, J. () Dynamic Logic. Cambridge, MA: MIT Press.
Heim, I. R. () The Semantics of Definite and Indefinite Noun Phrases. Ph.D. thesis. University
of Massachusetts, Amherst, Massachusetts.
Heim, I. () ‘File Change Semantics and the Familiarity Theory of Definiteness.’ In
Bäuerle, R., Schwarze, C. and von Stechow, A. (eds.) Meaning, Use, and Interpretation of
Language. pp. –. Berlin: Walter de Gruyter.
Hobbs, J. R. () Literature and Cognition. Stanford, CA: Center for the Study of Language
and Information.
Jackson, F. () ‘On the Semantics and Logic of Obligation.’ Mind. . . pp. –.
Kamp, H. () ‘A Theory of Truth and Semantic Representation.’ In Groenendijk, J. A.,
Janssen, T. and Stokhof, M. (eds.) Formal Methods in the Study of Language. pp. –.
Dordrecht: Foris.
Kolodny, N. and MacFarlane, J. () ‘Ifs and Oughts.’ Journal of Philosophy. . . pp. –.
Kratzer, A. () ‘Modality.’ In von Stechow, A. and Wunderlich, D. (eds.) Semantics: An
International Handbook of Contemporary Research. pp. –. Berlin: De Gruyter Mouton.
Kratzer, A. () Modals and Conditionals: New and Revised Perspectives. New York, NY: Oxford
University Press.
Kratzer, A. and Shimoyama, J. () ‘Indeterminate Pronouns: the View from Japanese.’ In
Otsu, Y. (ed.) Proceedings of the Third Tokyo Conference on Psycholinguistics. Tokyo: Hituzi
Syobo.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
Lassiter, D. () Measurement and Modality: the scalar basis of modal semantics. Ph.D. thesis.
New York University.
Lewis, D. K. () Convention: A Philosophical Study. Cambridge, MA: Harvard University
Press.
Mackie, J. L. () Ethics: Inventing Right and Wrong. New York: Penguin.
McCready, E. () ‘Emotive equilibria.’ Linguistics and Philosophy. . . pp. –.
Millikan, R. G. () Language, Thought and Other Biological Categories. Cambridge, MA: MIT
Press.
Millikan, R. G. () Language: A Biological Model. New York, NY: Oxford University Press.
Moore, G. E. () Ethics. Oxford: Oxford University Press.
Moss, S. () ‘On the Semantics and Pragmatics of Epistemic Vocabulary.’ Semantics and
Pragmatics. . , pp. –. [Online] Available from: http://dx.doi.org/./sp. [Acccessed
Nov .]
Murray, S. E. () Evidentiality and the Structure of Speech Acts. Ph.D. thesis. Rutgers
University.
Murray, S. E. () ‘Varieties of Update’. Semantics and Pragmatics. . , pp. –. [Online]
Available from: http://dx.doi.org/./sp... [Accessed Nov .]
Murray, S. E. and Starr, W. B. () ‘The Structure of Communicative Acts.’ Manuscript Cornell
University.
Muskens, R. van Benthem, J. and Visser, A. () ‘Dynamics.’ In van Benthem, J. and ter
Meulen, A. (eds.) Handbook of Logic and Language. st edition. pp. –. Cambridge, MA:
MIT Press.
Ninan, D. () ‘Two Puzzles About Deontic Necessity.’ In Gajewski, J., Hacquard, V. and
Yalcin, S. (eds.) New Work on Modality, MIT Working Papers in Linguistics. Volume .
Cambridge, MA: MIT Press.
Owren, M. J., Rendall, D. and Ryan, M. J. () ‘Redefining Animal Signaling: influence versus
information in communication.’ Biology and Philosophy. . . pp. –.
Portner, P. () Modality. New York, NY: Oxford University Press.
Portner, P. () ‘Permission and Choice.’ In Grewendorf, G. and Zimmermann, T. E. (eds.) Dis-
course and Grammar: From Sentence Types to Lexical Categories. Berlin: Mouton de Gruyter.
Potts, C. () ‘The Expressive Dimension.’ Theoretical Linguistics. . . pp. –.
Ridge, M. () Impassioned Belief. New York, NY: Oxford University Press.
Ross, A. () ‘Imperatives and Logic.’ Theoria. . . pp. –. References to Ross ().
Ross, A. () ‘Imperatives and Logic.’ Philosophy of Science. . . pp. –.
Rothschild, D. () ‘Expressing Credences.’ Proceedings of the Aristotelian Society. . .
pp. –.
Schroeder, M. (a) Being For: Evaluating the Semantic Program of Expressivism. New York,
NY: Oxford University Press.
Schroeder, M. (b) ‘Expression for Expressivists.’ Philosophy and Phenomenological Research
. . pp. –.
Schroeder, M. (c) ‘How Expressivists Can and Should Solve Their Problem with Negation.’
Noûs. . . pp. –.
Schroeder, M. (d) ‘What is the Frege-Geach Problem?’ Philosophy Compass. . .
pp. –.
Schroeder, M. () ‘Two Roles for Propositions: Cause for Divorce?’ Noûs. . . pp. –.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
Schroeder, M. () Attitudes and Epistemics. Expressing Our Attitudes: explanation and
expression in ethics. Volume II. New York, NY: Oxford University Press.
Scott-Phillips, T. and Kirby, S. () ‘Information, Influence and Inference in Language
Evolution.’ In Stegmann. U. (ed.) Animal Communication Theory: information and influence.
pp. –. Cambridge: Cambridge University Press.
Silk, A. () ‘Truth-Conditions and the Meanings of Ethical Terms.’ In Shafer-Landau, R. (ed.)
Oxford Studies in Metaethics. Volume . pp. –. New York, NY: Oxford University Press.
Silk, A. () ‘How to Be an Ethical Expressivist.’ Philosophy and Phenomenological Research.
. . pp. –.
Simons, M. () ‘Dividing Things Up: The Semantics of or and the Modal/or Interaction.’
Natural Language Semantics. . . pp. –.
Stalnaker, R. C. () ‘Assertion.’ In Cole, P. (ed.) Syntax and Semantics : Pragmatics. pp.
–. New York, NY: Academic Press. References to Stalnaker ().
Stalnaker, R. C. () Inquiry. Cambridge, MA: MIT Press.
Stalnaker, R. C. () Context and Content: Essays on Intentionality in Speech and Thought.
Oxford: Oxford: Oxford Univsersity Press.
Starr, W. B. () Conditionals, Meaning and Mood. Ph.D. thesis. Rutgers University.
Starr, W. B. () ‘A Preference Semantics for Imperatives.’ Ms. Cornell University.
Swanson, E. () Interactions with Context. Ph.D. thesis. MIT.
Swanson, E. (forthcoming) ‘The Application of Constraint Semantics to the Language
of Subjective Uncertainty.’ Journal of Philosophical Logic. [Online] Available from:
link.springer.com/article/.%F---. [Accessed Nov .]
Unwin, N. () ‘Norms and Negation: A Problem for Gibbard’s Logic.’ The Philosophical
Quarterly. . . pp. –.
van Benthem, J. () ‘Semantic Parallels in Natural Language and Computation.’ In
Ebbinghaus, H. D., Fernandez-Prida, J., Garrido, M., Lascar, D. and Artalejo, M. R. (eds.)
Logic Colloquium ’: Proceedings of the Colloquium held in Granada, Spain. Volume of
Studies in Logic and the Foundations of Mathematics. pp. –. Amsterdam: North-Holland.
van Benthem, J. and Liu, F. () ‘Dynamic Logic of Preference Upgrade.’ Journal of Applied
Non-Classical Logics. . . pp. –.
van Eijck, J. and Visser, A. () ‘Dynamic Semantics.’ In Zalta (ed.) The Stanford
Encyclopedia of Philosophy, winter. [Online]. Available from: http://plato.stanford.edu/
archives/win/entries/dynamic-semantics/. [Accessed Nov .]
Veltman, F. () ‘Defaults in Update Semantics.’ Journal of Philosophical Logic. . .
pp. –.
Willer, M. () ‘Dynamic Thoughts on Ifs and Oughts.’ Philosophers’ Imprint. . . pp. –.
[Online] Available from: http://hdl.handle.net//spo.... [Accessed Nov
.]
Yalcin, S. () ‘Epistemic modals.’ Mind. . . pp. –.
Yalcin, S. () ‘Nonfactualism about Epistemic Modality.’ In Egan, A. and Weatherson, B.
(eds.) Epistemic Modality. pp. –. New York, NY: Oxford University Press.
Yalcin, S. () ‘Bayesian Expressivism.’ Proceedings of the Aristotelian Society Supplementary
Volume. . . pp. –.
Zimmermann, T. E. () ‘Free Choice Disjunction and Epistemic Possibility.’ Natural
Language Semantics. . . pp. –.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
Metanormative Theory and the
Meaning of Deontic Modals
Matthew Chrisman
Introduction
Metanormative theory is, in part, about the meaning of sentences such as “One ought
always act for reasons for which one could consistently allow everyone else to act as
well,” and “One should proportion degree of belief solely in accordance with evidence.”
There are other normative terms, of course, but ‘ought’ (≈ ‘should’), is clearly one of
the handful of core normative terms, and I will focus on it here.1
Given this focus, here is a way to put an idea that I think many philosophers
find attractive: ought-claims prescribe possible action, thought, and feeling, rather
than describing how things actually stand in reality. Arguably, this idea underwrites
the popularity of the is/ought divide, explains some of the attraction in Moore’s
Open Question Argument, and motivates some of the interest philosophers have had
in noncognitivist, emotivist, prescriptivist, projectivist, and expressivist accounts of
normative thought and discourse.
Here is another popular line of thought that has sometimes seemed to be in tension
with that initial idea: A central part of the best overall account of the meaning of
declarative sentences will explain how their truth conditions can be derived from the
semantic values of their basic components and their logical form. The word ‘ought’
can, of course, figure in nondeclarative sentences, but the ought-sentences primarily
at issue in metanormative theory are grammatically declarative. This implies that
our overall theory of meaning should provide a compositional assignment of truth
conditions to ought-sentences as a part of its treatment of declarative sentences more
generally. This seems necessary for accounting for the fact that it is obviously mean-
ingful to embed ought-sentences in propositional contexts, such as under the truth
1 Returning only very briefly at the end to reflect on the broader issue of how my conclusions here might
bear on similar issues regarding other normative and evaluative vocabulary.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
2 Nothing I say in this paper will engage substantively with the question of whether a truth-conditionalist
approach to compositional semantics is preferable in general to some dynamic alternative such as one
based on Discourse Representation Theory (Kamp, ), File Change Semantics (Heim, ), or Update
Semantics (Veltman, , ). Here I shall simply assume (along with most metanormative theorists) that
a truth-conditionalist approach is the default position for nonnormative declaratives, and so the interesting
question is what to make of the fact that it is also appropriate for normative declaratives.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
3 By using language of “in virtue of ” here, I mean to indicate that they are very general grounding
explanations that seek to identify the kinds of nonsemantic facts that undergird the semantic facts postulated
by empirical semantics. Compare Charlow (a, p. ) and Yalcin (, p. ). Some (e.g. Speaks, )
may prefer the term “foundational theory of meaning” or “theory of the foundations of meaning” for what I
am calling metasemantics. Yalcin () surveys arguments for a distinction between content and semantic
value, where the former is (at least) what is said by uses of a sentence and what grounds the intentionality
and causal efficacy of mental states expressed by such uses, and the latter is what is attributed to sentences to
explain various semantic facts, e.g. about productivity of language, entailment relations between sentences,
the un/acceptability or un/interpretability of particular sentences, etc. Given this distinction, there are two
domains about which we might ask second-order questions: we can ask about what it is in virtue of which
speech-acts and the mental states they express have the contents that they do, and we can ask about what
it is in virtue of which sentences have the semantic values that they do. While I am highly sympathetic to
this distinction, observing it in what follows would have unnecessarily complicated an already complicated
discussion. It may be the case that representationalism and inferentialism are best viewed as second-order
accounts of content rather than—or at least in addition to—semantic value. However, I write here under
the simplifying assumption that “metasemantics” is about what I sometimes call “semantic content,” which
is meant to cover both.
4 Perez Carballo () usefully distinguishes a “hermeneutic” question about how to interpret the
formal specifications of truth conditions in truth-conditional semantics from an “explanatory” question
about what it is that explains the (approximate) correctness of the formal specifications of truth conditions
for a sentence. I agree that these are distinct, but I also think the metasemantic views I will discuss here are
all at a high enough level of generality that they can be seen as offering package answers to these questions.
I’m mainly interested in the explanatory question, but sometimes I will ask about how we are to interpret the
results of truth-conditional semantics. By this, I don’t mean just what hermeneutic gloss of the formalism
is appropriate but also what underlying facts about language and language users explain what it is in virtue
of which sentences have the truth conditions that they have.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
5 This is a simplified version of the rule predicted by Kratzer () for necessity modals. In addition to
the fact that it suppresses world-relativity of semantic values and elides the distinction between sentences
of the object language and metalinguistic variables, it also prescinds from the interesting issue of how
to account for ‘ought’s apparent weakness compared to ‘must’. Kratzerian proposals for handling this are
explored in Fintel and Iatridou (), Swanson (), and Silk (). Wertheimer (), Finlay (),
and Portner and Rubinstein () make somewhat different proposals, which are nonetheless consistent
with the basic idea that ‘ought’ is a necessity modal and to be modeled in terms of a universal quantification
over a set of worlds. Here I will also suppress the fascinating question of how (if at all) this rule should
be modified to handle the distinction between the objective ‘ought’ of what’s actually best and the so-
called subjective ‘ought’ of what best optimizes ones choices given imperfect information and varying
standards for practical decisions. For discussion, see Kolodny and MacFarlane (), Björnsson and Finlay
(), Dowell (), Cariani (), Charlow (), Carr (), Silk (), Bronfman and Dowell (),
Wedgwood (). Furthermore, there are other metaethically important challenges to this semantics that
will remain off stage here. For instance, the standard account also has difficulty with sentences of the form
“If p, then it ought to be the case that p” (see Zvolenszky, ; Carr, ). It seems to rule out strong
ought-dilemmas by semantic fiat rather than philosophical argument. See Swanson () and Fintel ()
for discussion. I think each of these problems can be addressed within a broadly intensional semantics for
‘ought’, which is all I really need for my argumentative purposes below; but I won’t discuss this further here.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
other concepts in important ways, we should recognize that the modal rule’s ability to
provide such a systematic treatment of sentences as diverse as ()–() without positing
lexical ambiguity is an impressive feat—arguably one that any semantic analysis of
‘ought’ in terms of obligations, fittingness, reasons, or what’s best will fail to do.6
Nevertheless, I still believe there is a lesson proponents of the modal rule might learn
from the urge amongst metanormative theorists to connect ‘ought’ to notions such as
obligations, fittingness, or reasons. Ought-sentences seem, when used in the sense that
is of most interest in metanormative theory, to call on agents to do something; and
arguably this is a more fine-grained notion than a proposition’s being true at possible
worlds. Here is Geach in a classical statement of the objection:
If the deontic operator applied to whole propositions, then the results of modifying the
active and the equivalent passive form with an ‘ought’ must likewise be equivalent. But such
equivalences often appear not to hold. ‘John beats up Tom’ and ‘Tom is beaten up by John’ are
equivalent; but it looks as though ‘John ought to beat up Tom’ and ‘Tom ought to be beaten up
by John’ are not necessarily equivalent. (, p. )
In light of this good point, rather than abandoning the modal rule altogether,7 I think
we should enhance it by integrating it with a semantics for imperatives. After all, saying
that S ought to φ—in the sense of most interest in metanormative theory—seems
closely related to issuing the imperative, “S φ!”
How might we enhance the modal rule in a way that respects this connection? Here’s
a start: Formally, semantic models typically assign declarative contents (propositions)
truth values relative to points of evaluation, which are conceived as (at least) possible
worlds (which in turn might be thought of in a variety of ways, e.g. as concrete possible
universes, abstract ways reality could be, sets of propositions, or something else). As
far as formal semantics goes, we might also want to assign imperative contents (what
I’ll call “prescriptions”) a kind of semantic value relative to a point of evaluation—
after all, imperatives are also logically-composed sentences of language standing in
semantic relations such as entailment and equivalence to other sentences—but the
kind of semantic value and the conception of the point of evaluation has to be
different (since imperatives do not seem to have truth values).8 One proposal that I
like assigns imperatives the semantic values legitimate/illegitimate relative to possible
norms (which in turn might be thought of in a variety of ways, e.g. as concrete laws of
6 I consider various attempts to reductively analyze ‘ought’ in these terms in chapter of Chrisman
(); the main stumbling block is plausibly capturing the meaning of epistemic ought-sentences such as
() while retaining a plausible treatment of paradigmatic normative ought-sentences such as () and ().
7 Schroeder () argues that it motivates treating ‘ought’ as ambiguous between raising-verb and
control-verb readings. I discuss this proposal and show why it isn’t linguistically motivated in Chrisman
(a). See also Finlay and Snedegar () for pragmatic explanations of many of the phenomena that
lead Schroeder to posit ambiguity.
8 A point which is argued for in greater detail in Charlow (b, section ), Starr (, section ), and
Chrisman and Hubbs (unpublished, section ).
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
possible practices, abstract ways agents might behave, sets of general prescriptions, or
something else).9
It is not important here whether this is the right semantics for imperatives, but if
we posit the imagined prescriptional contents with legitimacy values rather than truth
values, then we can embed this in our semantics for ‘ought’ by adopting something like
the following as a new general rule articulating the truth-conditional contribution of
‘ought’:
Enhanced Modal Rule: [[ought p]]fg = T iff for all <w, n> ∈ Pfg , <w, n> ∈ p
Here we no longer restrict p to propositions that might be expressed by declarative
sentences and modeled as a set of possible worlds; instead we allow that it could also
be the sorts of prescriptions that might be expressed by imperative sentences, and we
model it in general as a set of world-norm pairs (the worlds where the prescription is
legitimate relative to the norm). This rule says that an ought-sentence is true10 just in
case its prejacent is correct relative to each member of some contextually determined
set of world-norm pairs. This means, if we allow that the prejacent p could be either
propositional or prescriptional, we get two species of ought-claims out of a univocal
semantic rule. When the prejacent is propositional, the rule works similarly to the one
above: the ought-sentence is true just in case the prejacent is true relative to each of
some contextually determined set of world-norm pairs, where the norm parameter
is usually idle. By contrast, when the prejacent is practical, the rule evaluates its
legitimacy relative to each of some contextually determined set of world-norm pairs,
where the norm parameter is usually crucial.11
9 Compare Castañeda (, chapters –) for the most detailed way of working out this basic idea about
imperatives (but note that he uses the term “prescription” in a different way to how I am here). Charlow
(b) defends a related idea in an expressivistic framework which treats imperatives and declaratives as
having properties of representations of states of mind as their semantic values, where ordinary declaratives
denote properties of representations of beliefs and imperatives denote properties of representations of
preferences. Hauser (), Segerberg (), Portner (), and Hanks (, chapter ) defend competing
views about the semantic value of imperatives that can still be used to generate sentence-level contents
that are not propositions, which in turn can be thought to be embedded under ‘ought’s. In Chrisman and
Hubbs (unpublished), we show how treating such nonpropositional contents as the semantic values of
imperatives can be motivated by action-theoretic considerations ancillary to the project of compositional
semantics. As indicated above, what is crucial here is not the correctness of this approach to the semantics
of imperatives but rather that it can handle the content embedded under prescriptive-seeming ‘ought’s. A
perhaps more orthodox view is that imperatives have action-related properties as their contents and their
semantic relations to other sentences are derivative of the semantic relations between the propositions that
result from ascribing these properties to individuals. I think this view has trouble explaining the semantic
difference between necessarily coextensive but nonsynonymous imperatives, and I doubt that it gets the
logic of imperatives right. But that doesn’t matter here since I am not relying on the view of prescriptions
above as the right view of the content of imperatives.
10 Ought-sentences are clearly declaratives and so they get truth values and have truth conditions on this
view. Although this treatment of their truth conditions allows that they can embed prejacents which are not
truth-apt, this is not a version of the failed noncognitivist idea that ought-sentences are neither true nor
false.
11 This bears some similarity to the proposal developed in Gibbard (, chapter ) and Gibbard (,
chapters. –) to extend the standard possible-world semantics, adding an extra norm-index relative to
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
With a rule such as this one, I think we overcome the worry from Geach about
applying truth-conditional semantics to ‘ought.’ The prescriptions John, beat up Tom!
and Tom, be beaten up by John! are different from each other (and from the proposition
that John beats up Tom); they could easily differ in whether they are legitimate relative
to a norm. The enhanced modal rule lets us understand the truth conditions of ought-
sentences in terms of a universal quantification over a contextually determined set of
possibilia.12 Even within a univocal treatment, we can recognize an important seman-
tic difference between obviously propositional ought-sentences, such as () and (),
and obviously prescriptional ought-sentences, such as () and (). It may be less clear
where other ought-sentences fall in respect of this divide, but that is plausibly because
determining this requires more information about the context in which they are used.
In any case, the enhanced modal rule is the rule articulating ‘ought’s contribution
to truth conditions that I will use in the rest of this paper. As I indicated above,
I will not provide any more argument for this semantics of ‘ought’ than is already
in the barebones explanation of the rule, and I fully grant that further refinements
of it are needed for various purposes in semantics. My purpose in explaining it
is mainly to frame the question I want to pursue in the rest of this paper: how
should we interpret this rule? Recall at the outset I suggested that a central debate
in metanormative theory seems to be pulled in two directions based on whether one
takes more seriously the idea that ‘ought’s (at least the ones relevant to metanormative
theory) are distinctively prescriptive, or the idea that all declarative sentences should
be given a truth-conditionalist semantics. With something like the enhanced modal
rule in hand, I think we can see this to be a false choice by exploring two different
metasemantic accounts of what this rule tells us about the facts in virtue of which
ought-sentences have the contents that they do. Both of these accounts are consistent
with the idea that some ought-claims function not as descriptions of what is actually
the case but rather prescriptions for what someone is to do in various circumstances.
The interesting question is how they differ in capturing this idea.
which some sentences could be evaluated. However, Gibbard uses this extension to provide semantic values
for declarative sentences with normative elements, in essence committing to either a kind of noncognitivism
or a kind of norm-relativism about the possibility of true normative declaratives. By contrast, I’m applying
this idea only to nonpropositional contents, like those we might assign to imperatives, which I deny are
truth-apt and for which I think it is much more plausible to assign some kind of semantic value other than
truth.
12 An alternative attempt to do this posits elided “stits” (“sees-to-it-that”) in the prejacent proposition
of ‘ought’s intuitively thought to be practical. As a semantic proposal, I think this is ad hoc and non-
explanatory. But arguing for that is not necessary to the points I want to make here.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
of how they represent reality as being. To see how this works, consider first a simple
sentence, such as
() Grass is green.
It is very tempting to think that sentences such as this one are true just in case reality
is the way they represent it as being. That is to say, e.g., that () represents grass as
having the property of being green. Generalizing from this example, it is tempting
to think that pursuit of a truth-conditional semantics boils down to the development
of a recursive method for spelling out the way reality would have to be in order for
each declarative sentence of the language to be true. Using the terminology I prefer,
we can call this a representationalist explanation of truth conditions. Arguably, it is a
core assumption of all traditional realist and error-theoretic views in metaethics. They
would say that
() Stealing is wrong.
is true just in case reality is the way () represents it as being. Then there is room
for debate among these philosophers about the nature of this putative piece of
reality (naturalism vs nonnaturalism) and whether it actually obtains (success-theory
vs. error-theory).
The pervasiveness of this representationalist interpretation of truth-conditional
semantics can help to make some sense of why anyone would have ever denied,
as early noncognitivists did, that normative sentences have truth conditions. For,
on the face of things, it seems obvious that some normative sentences are true and
others are false; and from the point of view of ordinary language, embedding ethical
sentences such as () under the truth predicate and other propositional contexts
such as belief/knowledge attributions and epistemic modals seems beyond reproach.
However, if your view is that normative language at its base is prescriptive rather than
descriptive, and you accept (at least tacitly) the representationalist conception of what
truth-conditional semantics is telling us, then you might be tempted to argue that
normative sentences do not have truth conditions (at least not in any theoretically
interesting sense).
Below I’ll suggest that there’s a way that even those metanormative theorists inspired
by noncognitivism can resist this temptation—as they should, since it is difficult to
see any hope of realizing the aspirations of compositional semantics if we pursue
a bifurcated approach to semantics, using truth conditions as our basic semantic
framework for nonnormative sentences such as () but something else for normative
sentences such as ().13 But before we get there, I want to explain why I think the
metanormative situation with respect to ‘ought’ complicates things in a way that allows
13 This is the main lesson of Hale (), Dreier (), Unwin (). Schroeder () shows how
difficult it is for an expressivist to avoid this even by advancing a psychologistic semantics across the board,
i.e. applied also to paradigmatically descriptive sentences.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
even the hardcore representationalist to accept, indeed to champion the idea that
normative language (at least insofar as it is “fraught with ought”) is prescriptive rather
than descriptive, undercutting some of the traditional noncognitivist motivation for
denying that normative sentences have truth conditions.
Here’s why things are more complicated. Even if you accept the representationalist
conception of truth conditions in the basic case, you will want to allow there are some
declarative sentences that do something other than describe what is actually the case.
This is because some sentences are formed with intensional operators, e.g.
() Jack and Mary might be in the house.
And intensional operators do not seem to function like normal predicates. Instead,
they are generally thought by semanticists to serve the purpose of shifting some
parameter of the circumstance relative to which we evaluate the content that they
embed. That is, for instance, they tell us to evaluate some embedded proposition not
at the actual world but at some possible world(s). For example, the idea is to think of
() as true just in case the embedded proposition that Jack and Mary are in the house
is true in some world regarded as possible given a particular body of evidence. This is
why its truth conditions are usually given with something like the following:
[[()]]c = T iff [[Jack and Mary are in the house]]c = T in some possible world
consistent with our evidence in this world
As a working hypothesis, this is all relatively familiar from textbook treatments of
intensional operators in semantics.14 It may need to be refined, but the important
question at this stage is not semantic but metasemantic: What is the representationalist
gloss of these truth conditions? One fairly natural thing for the representationalist to
say is that these truth conditions tell us how this sentence represents reality as being,
but the piece of reality it represents is more complicated than anything having to do
merely with what is actually the case. The thought is that () represents how things are
not just in the actual world but also in other possible worlds; this involves representing
some kind of relation between possible worlds. So, if we assume an ontology of
possible worlds,15 we can continue to think of all declarative sentences, even those
formed by intensional operators, as representing ways reality could be. But (here’s the
payoff) we can simultaneously deny that all declarative sentences describe how things
actually are. For on this view some sentences represent a piece of reality that involves
14 Not that it is uncontroversial. Even Kratzer () recognizes that its application to epistemic modals
faces challenges making sense of apparent disagreement, which has led some to follow MacFarlane ()
in complicating the definition of truth for such claims, relativizing to a context of assessment; but see von
Fintel and Gillies (, ) for criticism and a more conservative response.
15 What about linguistic Ersatz-theorists who reject the reality of possible worlds but contend that some
other part of our ontology can play the role of possible worlds in our interpretation of the truth conditions
of modal sentences? Whether they count as representationalists in the sense I’m trying to capture here is
going to depend on how exactly they conceive of the other part of our ontology as playing the role of possible
worlds.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
a relation between the actual world and other merely possible worlds. Hence, on the
representationalist interpretation of the standard truth-conditionalist semantics for
them, might-sentences are no mere description of how things actually are.
Application to ‘Ought’
I hope it’s already clear that this is relevant to the apparent tension between the
idea that ‘ought’s do not describe what is actually the case but prescribe possible
action, thought, and feeling, and the idea that all declarative sentences deserve the
same general semantic treatment, e.g. in terms of compositionally specified truth
conditions. Even if we just use the simple modal rule for ‘ought,’ it can be viewed like
‘might’ as a device for shifting some parameter of the circumstance relative to which
we evaluate the propositions it embeds rather than for describing some feature of
what is actually the case. Then, representationalists will interpret the truth conditions
this rule predicts for ought-sentences as specifying how the sentences represent a
complex modal feature of reality as being—not as merely describing how things
actually are.16 But more to the point, if we use the enhanced modal rule, I think the
representationalist can even capture an important sense in which at least some ‘ought’s
are prescriptive rather than descriptive. For example, we can use this rule to interpret
() Jack and Mary ought to leave.
predicting the following truth conditions:
[[()]]fg = T iff [[Jack and Mary, leave!]] = L in for all < wn >∈ Pfg
This means that () is true just in case its prejacent prescription is legitimate relative
to all of the relevant points of evaluation (in this case world-norm pairs highly
ranked by >g that are consistent with background conditions f ). Then, assuming
an ontology of possible worlds and possible norms, representationalists can interpret
these truth conditions as representing a modally complex way reality could be: a way
the actual world is related to possible worlds and possible norms. More specifically,
representationalists will think the formula above tells us that the prescription for Jack
and Mary to leave is legitimate across a contextually determined range of possible
norms in light of the circumstances common in a contextually determined range of
possible worlds. In this way, () is assigned truth conditions, but since these are truth
conditions involving what is prescribed by norms in various possible worlds, this
16 So it’s possible to reject my suggestion that ‘ought’s can embed prescriptive prejacents and still buy
the rest of what I say here as long as there’s an alternative way to capture the prescriptivity of some ‘ought’s.
Some may be inclined to do this by treating one of the ordering sources evoked by ‘ought’ (according to
the modal rule) as ordering worlds in terms of whether some set of prescriptions is satisfied. I think this is
going to be too coarse-grained to respect Geach’s point from above, but a lot will depend on how we work
out the details of prescriptive ordering sources.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
17 Are these things perhaps real but not “just as real” as the greenness of grass or the location of people
in the house, because they are abstract? I doubt that the distinction between concrete and abstract elements
of reality helps here, as many of the things we probably want to say are “more real” than possible worlds and
possible norms would, on many ways of drawing this divide, count as abstract.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
coheres with the popular stance among many philosophers that possible-worlds talk
in semantics is a convenient fiction or a mere notational device, or that possible worlds
are “pleonastic entities” rather than real parts of our ontology.
18 This is, I believe, what distinguishes the view from Wittgenstein-inspired “use theories of meaning”
based on skepticism about drawing any principled line between the uses of language that reveal semantic
competence and those that reveal merely pragmatic understanding of the way language can be used to
achieve sundry goals. Brandom has recently characterized his linguistic pragmatism as the combination
of two principles. Methodological pragmatism: “the point of associating meanings, extensions, contents,
or other semantic interpretants with linguistic expressions is to codify proprieties of use,” and semantic
pragmatism: “all there is to effect the association of meanings, contents, extensions, rules or other semantic
interpretants with linguistic expressions is the way those expressions are used by the linguistic practitioners
themselves”(, pp. –). As long as we restrict the focus to the uses of language to make assertions,
which are distinguished in the way they license inference to other assertions and require defense via other
assertions, I think we get close to what most workaday semanticists take as their principal data: ordinary
speaker intuitions about the entailment, inconsistency, equivalence properties of sentences.
19 One may not always acknowledge the inferential consequences of what one asserts. So its being part
of what one is committed to is not the same as believing that it follows. But the more someone fails to
acknowledge what we take to follow from what one asserts, the less we will think she understands what she
is saying.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
challenged, have to back it up with reasons, for instance arguing that stealing involves
a gross violation of the respect we owe to each other as human beings.20
Importantly, these upstream and downstream inferential connections are not
merely logical entailments but rather in the broader family sometimes called semantic
implications, which are usually defeasible and context-sensitive.21 For this reason,
competence with the meaning of sentences can be thought of, on the inferentialist
picture, as coming in degrees. How much one understands what one is saying depends
on how much one knows one’s way around a space of semantic implication-relations
containing the sentence and the types of things that would defeat these relations.22
Before we can bring this idea to bear on the metanormative issues introduced above,
there are two questions I want to address in order to flesh out the inferentialist idea.
First question: What does endorsing inferentialism mean for truth-conditional
semantics? Brandom sometimes writes as if inferentialism is inconsistent with appeal-
ing to truth conditions in one’s explanation of the semantic composition of declarative
sentences,23 but I doubt inferentialism and truth-conditional semantics are really
in tension, at least not when we understand “truth-conditional semantics” as the
approach in compositional semantics where one attempts to recursively derive some
specification of when a declarative sentence should be assigned the preferred semantic
value true relative to some parameters, thereby displaying the semantic function of the
sentence’s subsentential parts. Because of the pervasiveness of representationalism,
the representationalist gloss on the results of this approach is sometimes labelled
“truth-conditional semantics.” But I think it’s helpful to view representationalism
20 Often one may not actually be challenged to defend what one asserts, and, regarding certain kinds
of topics, one may be default entitled in most or all of the everyday situations where one would make an
assertion; but being the sort of linguistic move whose form makes it challengeable for reasons is essential
for being an assertion. (Brandom , pp. –)
21 We can then seek to recover specifically “logical” entailments by restricting ourselves to consideration
of the semantic implications ensured by the “logical” form and words of the sentence.
22 Sellars and Brandom tend to take a very expansive view of the implication-relations that are relevant
to the determination of meaning. However, any plausible version of their views would embrace the idea
that how central such a relation is to determining the meaning of a particular word/concept is something
that comes in degrees, and the lesson we were supposed to have learned from Quine’s attack of the
analytic/synthetic divide is that it is not possible, in general, to draw a sharp line between the implications
that hold in virtue of meaning alone and the implications that hold in virtue of something else (though if
we hold enough other stuff fixed, we might be able to draw a practically useful line in particular contexts).
23 For example, he writes, “. . . truth is not a concept that has an important explanatory role to play in
philosophy. Appearances to the contrary, are the result of misunderstanding its distinctive expressive role.
The word ‘true’ does indeed let us say things that in many cases we could not say without it. But when we
understand what it lets us say, and how it does that, we will see that the very features that make it expressively
useful make it completely unsuitable to do the sort of theoretical explanatory work for which philosophers
have typically enlisted its aid.” (, pp. –) Later, however, he strikes a slightly more conciliatory tone,
“What about the role of truth in semantic explanation, via a definition of propositional content in terms
of truth conditions? We certainly do use ‘true’ to say what the content of a claim is. . . . But it would be a
mistake to infer from this sort of appeal to truth conditions to express propositional contents that one can
explain what propositional contents are by appeal to the conditions under which sentences are true.” (ibid.,
p. )
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
24 Compare Williams (, pp. –; ), who also argues that inferentialism is compatible with a
truth-conditionalist approach to compositional semantics.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
intuitive thought that many declarative sentences describe the way things are in the
world around us? Many philosophers seem to think that inferentialism undermines
the very idea of a sentence’s describing reality. The thought is something like this: if
semantic contents are positions in something we characterize as merely a “game” of
giving and asking for reasons rather than specifications of how reality has to be in
order for the relevant sentence to be true, then surely no sentences should be viewed as
describing reality. Alternatively, inferentialism is sometimes portrayed as taking what
is usually a local denial that some area of discourse is representational (e.g. normative
or modal) and generalizing it to all areas of discourse. Here, the thought is something
like this: If everything we say is somehow about how we should infer, and you think this
“should” is prescriptive rather than descriptive, then surely that means that nothing we
say is a description of how things are in the world around us.25
While granting that some things some inferentialists have written may suggest
that this is the view,26 I think a wholesale rejection of the idea that some declarative
sentences describe the world around us is an extreme version of the view not actually
endorsed by some of its central proponents. For example, Sellars was famously an
inferentialist about content and a realist about science, writing “in the dimension of
describing and explaining the world, science is the measure of all things, of what is
that it is, and of what is not that it is not” (, §). And Brandom characterizes
his inferentialism as supported by a “sophisticated expressivism” about logical, modal,
and normative vocabulary that “. . . is essentially, and not just accidentally, a local
expressivism.” This is because, in the way he understands the expressive role of
these vocabularies, it depends on the existence of other vocabularies that play a
descriptive role: “Autonomous discursive practices must contain vocabularies playing
other expressive roles—for instance, observational vocabulary that reports features of
the non-linguistic bits of the world” (, p. ). In the context of the arguments
here, this is good because any view that implies that ought-sentences aren’t descriptive
because no declarative sentence is descriptive could achieve only a pyrrhic victory.
25 For example, Price imagines what a quasirealist expressivist like Blackburn might say if he adopted a
Brandomian inferentialist conception of assertion. Price suggests the expressivist might say, “I used to think
of my Humean expressivism as a local position, applicable to some vocabularies but not others. . . . However,
I have now come to realise that for no vocabulary at all is it theoretically interesting to say that its function
is to ‘represent’ particular kinds of states of affairs. . . .This. . . does nothing to undermine the interesting
observations that got me started, about the distinctive—and different—functions of moral and modal
vocabulary. On the contrary, it simply implies that they are exemplars of an approach to language we should
be employing everywhere. In other words, what I took to be linguistic islands are simply the most visible
extremities of an entire new continent—a universal program for theorising about language in expressivist
rather than representationalist terms . . .” (, pp. –).
26 Macarthur and Price () appear very sympathetic to this idea. But, in later work, Price (,
chapter ) has developed a more nuanced position, which trades on a distinction between thoughts that
carry information (what he terms i-representations) and a species of these which empirically track the mind-
independent world (what he terms e-representations). See also Blackburn () and Williams () for
useful discussion and repudiation of global expressivism in the sense at issue here.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
27 Williams (, pp. –) explores a more sophisticated way to cash out this distinction in terms of
various clauses in an explanation of meaning in terms of use. As Williams argues, some such explanations
may be ontologically conservative in that they do not mention referents (properties when it comes to
predicates) of the terms whose use they are appealing to in explaining meaning, but others won’t be. My
suggestion here (following Williams) is that the explanations of the meaning of color vocabulary in terms
of its use might not be ontologically conservative, while the explanation of deontic modal vocabulary is a
plausible candidate for an explanation that can be ontologically conservative.
28 Drawing on Peacocke (; ) and Block (), Wedgwood (, , chapter ) articulates a
theory of conceptual roles as determining the representational function of the words that express concepts
with these roles. On this view, each word refers to something, and what it refers to is determined in part
by the conceptual role of the concept it expresses; and it is the logical composition of these concepts that
determines how each declarative sentence represents reality as being. Because he uses a truth-conditional
semantic rule similar to the modal rule stated above, he ends up defending a conceptual-role interpretation
of this rule that bears important similarities to the one I develop below. However, because he treats all
words as referring to something in reality, his inferentialism makes no room for distinguishing between
representational and nonrepresentational vocabulary. By contrast, I see in the inferentialist metasemantics
resources for motivating a distinction at least when it comes to ‘ought.’
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
29 Kant (, A/B), for example, identified twelve “pure concepts of the understanding” interpret-
ing them in something like this way. In a similar vein, Sellars () argued that terms such as “substance”
and “quality” are metalinguistic devices that get their content from the way they can be used to endorse the
types of inferential connections between other empirical terms, connections which must be in place, at least
implicitly, for these terms to count as applying to empirically knowable pieces of reality in the first place.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
Sellars and Brandom would say that this sentence does not represent a piece of reality
involving the modal relation between locations which are situated east/west of each
other. Rather it is a device for acknowledging the conceptual validity of inferences of
the form
x is east of y → y is west of x
thereby (partially) spelling out and explicitly committing to the inferential connec-
tions between more basic terms such as ‘east’ and ‘west.’30
Does this mean that, according to Sellars and Brandom, conceptual necessity
claims such as () are mere “inference tickets” in a “game” of giving and asking for
reasons, rather than sentences with truth-conditional content? I think it would be
foolish to make that claim.31 Instances of () can clearly be embedded in “It is true
that . . .” and other truth-functional contexts, and they express something that can
be doubted, believed, known, etc. So, it would be very strange to say that they do
not have truth-conditional content. Indeed, something akin to the standard possible-
worlds semantics for alethic modality provides a pretty good articulation of these truth
conditions.
Assuming this is (roughly) right, the crucial metasemantic question is what it is in
virtue of which this truth-conditional specification of the content of some instance of
() is correct (insofar as it is)? As we have already seen, the representationalist answer
is that it is correct in virtue of correctly identifying how that instance of () represents
reality as being. By contrast, an inferentialist following Sellars and Brandom would
say these truth conditions are correct in virtue of correctly modeling the position in
space of implications taken up by someone who asserts that instance of () in normal
discursive practice. One who accepts this sentence is committed to y‘s being to the
west of x, if she is committed to x’s being to the east of y. In most contexts, were it to
come up, in order to entitle oneself to an instance of (), one might need only to affirm
one’s competence with the words ‘east’ and ‘west.’32
30 Compare Thomasson (, ) who argues that this is the appropriate account of metaphysical
necessity claims. Here my suggestion on behalf of Sellars and Brandom is the weaker idea that at least one
species of necessity claim plays this inference-rule endorsing role. If Thomasson is right, however, that’s a
congenial conclusion.
31 Kant and Frege sometimes seem to be making something like this claim, e.g. “. . . the modality of
judgments is a very special function thereof, which has the distinguishing feature that it does not contribute
to the content of the judgment” (Kant, , A/B), and “By saying that a proposition is necessary I
give a hint about the grounds for my judgment. But, since this does not affect the conceptual content of the
judgment, the form of the apodictic judgment has no significance for us” (Frege, , p. ). However it is
probably anachronistic to view them as speaking of the compositional semantic content of modal sentences
rather than something like the empirical or descriptive content of the thoughts expressed by canonical uses
of these sentences.
32 What about conceptual necessity claims whose prejacents are not explicitly conditional in form? As
with all modal claims, we’d still articulate their truth conditions relative to background conditions in the
context of use, and these truth conditions could be viewed as depicting the idea that one who takes those
background conditions to hold is committed to the truth of the explicit prejacent.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
This is how I see an inferentialist metasemantics, when combined with the further
Sellarsian/Brandomian idea that some terms are metaconceptual devices, as provid-
ing resources for interpreting the truth conditions of some sentences along non-
representationalist lines. So, now we should turn back to the main topic of this paper:
‘ought.’
Above I claimed that the standard treatment of this word in compositional seman-
tics is as a necessity modal rather than an ordinary predicate-forming term. That
already suggests interpreting the truth-conditional contribution of ‘ought’ in terms
of its being a metaconceptual device. The main objection to this basic idea, I think,
is that the standard possible-worlds semantics for necessity modals faces difficulty
in making sense of the distinctive prescriptive uses of ‘ought’ common in normative
discourse. That is why I favor the enhancement of the standard account sketched in §.
This continues to treat ‘ought’s truth-conditional contribution in terms of a universal
quantification over possibilia, but it refines the kinds of prejacent content ‘ought’ is
thought to embed, and it makes correlative enhancements to the conception of the
possibilia at which these prejacents are to be evaluated.
I think this can now be leveraged into a very plausible inferentialist treatment of
‘ought’ as a metaconceptual device, which then opens up space for distinguishing all
ought-claims from descriptive claims. The core idea is to interpret ‘ought’ as getting its
content from its role in embracing commitment to inferential connections between
more basic items, rather than from its representational purport. This is analogous to
the metasemantic story about conceptual necessities provided by Sellars and Brandom.
Then we deploy the enhanced modal rule from before to generate specific explanations
of the types of more basic claims whose inferential connections are embraced via an
ought-claim.
For instance,
() We ought to do more to relieve great suffering.
would be said to have truth conditions something like the following:
[[()]]fg = T iff [[Let’s do more to relieve to relieve great suffering!]] = L in all
<w, n> ∈ Pfg
where this is conceived as a modal operation on a prescription, evaluating its legiti-
macy across a contextually determined set Pfg of world-norm pairs.
However, inferentialists won’t gloss this as an articulation of how () represents
modal reality as being (e.g. as containing various possible worlds and possible norms,
according to which the relevant prescription is legitimate). Rather they could inter-
pret the truth-conditional specification of the content of () as a depiction of a
point in a network of inferentially specifiable commitments one takes on when one
endorses (). Downstream, one is committed to prescriptions to act (in particular
contexts) in ways that comply with the prejacent prescription; and upstream one might
defend () by appealing to more general normative claims that support the norms
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
Comparison to Expressivism
I want to conclude by discussing a more familiar constellation of ideas that might also
be thought to provide an alternative to representationalist treatments of normative
sentences. This is born out of the noncognitivist thesis that normative claims express
conative attitudes rather representations of reality, the sorts of attitudes that are
supposed to class with desires rather than beliefs in their motivational potentials (or
“directions of fit”) in a Humean psychology of motivation. This noncognitivist thesis
has received considerable attention and refinement over the past half century, resulting
in a number of different positions now defended under the banner of “expressivism.”
Moreover, expressivism has been applied in several domains besides normative dis-
course (modal, epistemic, aesthetic, alethic, etc.) with importantly differing theoretical
constraints. So, I find it far from clear what the core of expressivism is and how it
relates to the inferentialist ideas explored above. As we already saw, some think of
inferentialism as a kind of “global expressivism”—but that’s not the view one finds
in Sellars and Brandom. Moreover, many philosophers conceive of expressivism as
premised on rejecting truth-conditionalist semantics in favor of a “psychologistic”
alternative,33 at least in its application to normative discourse—but that’s not how
I think we should think of inferentialism.
I’m not going to be able to discuss all possible (or even many) versions of expres-
sivism here.34 But I think it will prove useful to discuss one expressivist line of thought
about ‘ought’ that I think comes closest to the inferentialist line of thought developed
above. This will put me in a position to explain how some ways of developing the view
might result in an alternative to inferentialism that has various problems while others
33 This is, for instance, the way expressivism is conceived by critics such as Rosen (, pp. –),
Wedgwood (, p. ), Kalderon (, pp. –), and Schroeder (, p. ).
34 See Chrisman () for my take on some historical and contemporary versions of expressivism.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
might result in friendly amendments to inferentialism. Hopefully, this will also clarify
the inferentialist view outlined above.
As long as we continue to observe the distinction between semantics and metase-
mantics, I think it should be clear that expressivism needn’t be understood as a view
about compositional semantics.35 We can find a version of expressivism as part of a
larger metasemantic view that has, until now in this paper, remained conspicuously off
stage. This is the view that the truth conditions assigned by compositional semantics
are articulations of what one who asserts one of these sentences ought to think. For
instance, in the most basic case, it’s plausible to say that one who asserts
() Grass is green.
ought to think (judge, affirm) that grass is green. The idea is that someone who asserts
this sentence but does not think this thought has violated the semantic rule associated
with () in virtue of its core communicative role. So, rather than starting with the
view that truth conditions articulate the way reality must be in order for a sentence
to be true, an “ideationalist” metasemantics would encourage us to start with the
view that semantic assignments to sentences tell us what idea one ought to have when
one uses the sentence canonically, in order to conform with the core communicative
norms associated with the sentence in the language.36 This is one plausible way to
flesh out the vague and popular suggestion that different sentences are canonical and
conventional means for expressing different mental states.37 Then, on this ideationalist
view, the truth-conditionalist articulation of the subsentential elements of language
could be viewed as an explanation of how an indefinite number of such expressive
commitments are generated by a finite number of terms and logical forms.
Although ideationalism starts in a different place from representationalism, it’s
wholly compatible with the representationalist view that sentences such as () rep-
resent reality. For an ideationalist can say that the truth conditions of () specify
the way this sentence represents reality as being by virtue of articulating the content
of a particular kind of thought, a representational thought about the way reality is,
35 Compare Suikkanen (), Chrisman (b), Yalcin (), Silk (), Charlow (a, p. ),
and Ridge (, chapter ).
36 Precursors and versions of this idea can be found in Grice (), Blackburn (), Davis ().
Although defending a somewhat similar idea, I think Richard () would say that it’s wrong to think of
these as truth conditions but that compositional semantic values can be interpreted as modeling a space of
commitments to various thoughts.
37 See Schroeder (, chapter ) and Ridge (, §.) for further discussion of the expression
relation appropriate for developing a form of metanormative expressivism. Both end up developing forms
of expressivism with an account of expression very similar to the account I am using here. The key
commonality is in construing the relation as a hypothetical linguistic norm, something like: if one uses
the sentence to make an assertion, one ought to think the relevant thought in order to conform with the
core communicative rules of the language. In Bar-On and Chrisman (), we argue that distinguishing
the way in which users of a sentence express mental states from the way in which sentence-types express
their semantic contents provides for a much simpler and more plausible explanation of the apparently
distinctive connection between moral claims and motivational attitudes than is on offer in standard forms
of metaethical expressivism.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
which one who asserts the sentence ought to have in order to conform with the core
communicative rules of the language.
Then, a “metasemantic” form of expressivism begins to come into view when
we observe that, although the ideationalist can agree with the representationalist
about sentences like (), she needn’t agree that all declarative sentences represent
reality. For she can argue that some sentences are conventional means for expressing
nonrepresentational thoughts, the content of which is specified by the truth conditions
the correct semantic theory assigns to the sentence.38 For example, an expressivist
might argue that one who uses a sentence such as:
() Stealing is wrong.
to make an assertion ought to think a particular thought, i.e. that stealing is wrong.
Then, however, she will argue that the functional role of this thought in the psychology
of motivation is directive of action rather than representative of reality.
This move obviously requires giving up on the idea that truth is something like
correct representation, which is why most contemporary expressivists embrace some
sort of minimalism about truth (though I think some kind of pluralism about truth
would also serve this purpose).
But that raises an important question: how could the expression of a directive
thought be something that one could doubt or know to be true, even in a defla-
tionary sense? After all, paradigmatic directive mental states are desires, preferences,
and plans, and these are not the proper objects of doubt and knowledge. In the
present context, I think expressivists might insist that the characterization of directive
thoughts as “desire-like” rather than “belief-like” is simply misleading and distracting.
The important claim is that some thoughts are thoughts about reality, they represent
reality as being a particular way; other thoughts are about what to do, they direct us to
make reality become a particular way. The standard Humean view of the psychology
motivation is that action always results from the cooperation of these two kinds of
thoughts. We should allow expressivists to call both “beliefs” in many core cases; the
important suggestion is that one of them has a directive rather than descriptive func-
tional role in the psychology of human motivation. It is because of the way in which
this role stands in contrast with the descriptive functional role of other thoughts that
this kind of expressivism promises to carry whatever attractions were in earlier views
38 For similar ideas, compare Blackburn (, pp. –), Silk (, p. ), Ridge (, chapter ), Perez
Carballo (). Charlow (forthcoming) develops a metasemantic form of expressivism in the context of a
dynamic “test” semantics for deontic and epistemic modals. This is importantly different in its rejection of
the semantic explanations of truth-conditional semantics, but it still shares in conceiving of expressivism
as a metasemantic thesis. Yalcin (, p. ) characterizes his expressivism as a pragmatic rather than a
semantic thesis. However, if that interpretation is going to carry the ontological benefits expressivism is
supposed to carry, then I think the pragmatics needs to be part of a broader metasemantic theory—perhaps
one that explains why sentences have the truth conditions they do in terms of what conversational moves
they are canonical means for making. I believe this is consonant with the characterization of metasemantics
in Yalcin ().
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
in the expressivist tradition. Moreover, if one can make sense of evaluating directive
thoughts as being more and less reasonable, justified, or warranted, a proponent of this
view would be on the way to making sense of the way that normative beliefs seem to
be the objects of epistemic evaluations.39
So, at least one way to conceive of metanormative expressivism is as the combination
of the Humean psychology of motivation with an ideationalist metasemantic interpre-
tation of the truth conditions of normative sentences. More specifically, we will need to
think of the truth conditions of normative sentences such as () as the articulation of
directive rather than descriptive thoughts. We can give them truth conditions such as
[[()]] = T iff stealing is wrong
and think of this as an articulation of what one who asserts this sentence ought
to think, in order to conform to the core communicative rules associated with this
sentence. However, in their philosophy of mind, expressivists will argue that this
thought plays a directive rather than a descriptive role in the psychology of moti-
vation. By understanding expressivism as part of an overall metasemantic package,
rather than a controversial thesis in compositional semantics, we could then grant
expressivists access to all of the successes of standard truth-conditional approaches to
compositional semantics, while leaving room in the philosophy of mind for them to
make the Humean argument that normative thoughts are nondescriptive because they
are directive.
There is a lot more we might want to know about metasemantic forms of expres-
sivism, but the crucial question for me at this stage is whether this generates a
competitor to the inferentialist metasemantic view about ‘ought’ outlined above or a
possible partner for it in a coalition against representationalism about ‘ought.’ I think
that’s going to depend on what expressivists say about several further questions.
First Question: What sort of thoughts do metasemantic expressivists say are expressed
by logically complex sentences? Those versed in debate about the Frege-Geach prob-
lem might grant that cleaving to a truth-conditionalist semantics and now attempting
to develop expressivism in one’s metasemantics provides for a better answer to this
question than old-school versions of expressivism did. For the expressivist can now
begin by saying that a sentence such as
() Either stealing is common or stealing is wrong.
has the truth conditions of a disjunction, i.e. a logical form ensuring that it is true
if either of its disjuncts is true. Still, if our metasemantic expressivist says that it
39 Though we will have to reject Cuneo’s suggestion that it is platitudinous that “. . . propositional
attitudes display one or another epistemic merit (or positive epistemic status) such as being a case of
knowledge, being warranted, being an instance of understanding, insight or wisdom and the like, only
insofar as they are representative in some respect.” (, p. ) See Chrisman (, , c) for
discussion of the prospects of nonrepresentational attitudes’ achieving positive epistemic status.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
has these truth conditions in virtue of expressing the disjunctive thought that either
stealing is common or stealing is wrong, we can ask: what kind of thought is that? In the
simple Humean psychology of motivation we were working with above, there are only
descriptive and directive thoughts. And while we may want to say that this thought has
descriptive and directive elements, the expressivist metasemantics seems to require
us to be able to say what kind of mental state the whole thought represents. After
all, as I glossed it above, the expressivist’s way of securing the claim that normative
sentences don’t describe reality is by claiming that these sentences express directive
thought rather than descriptive thought.
In response, expressivists might try to argue that logically complex sentences such
as () are directive, but in an attenuated sense. The idea would be to treat them as
the expression of conditional dispositions to infer, where inferring is conceived as a
mental action. So, in this case, we might think of () as expressing a thought about
what to do: if I come to reject one of the disjuncts I shall embrace the other.40
As someone sympathetic with inferentialism, naturally I think there’s something
importantly right about understanding the meaning of logically complex sentences in
terms of their inferential interrelations with other sentences. However, I doubt using
this attenuated sense of “directive” is going to let us draw the line between represen-
tational and nonrepresentational thoughts anywhere near where we want to draw it
for the purposes of metanormative theory. For notice that even a paradigmatically
descriptive sentence such as () might be thought to direct inference (e.g. if you accept
this, do not infer any conclusion that entails that grass is not green!). Moreover, it
would be highly odd to place
() Grass is green or roses are red.
() Grass is green and roses are red.
on the directive side of the ledger, given that we place each of the atomic parts on the
descriptive side of the ledger.41
Because of this, the answer I think expressivists should give to the first question
is that the Humean psychology of motivation is too stark; there are more than just
descriptive and directive thoughts, and the thoughts expressed by logically complex
sentences are neither purely descriptive nor purely directive. But, if expressivists gives
that answer, then they owe us a new account of what distinguishes the class of sentences
that have their truth conditions in virtue of expressing a representational thought and
40 This is one way to make sense of Blackburn’s (, pp. –; ) suggestion that logically complex
thoughts “tie one to a tree.”
41 I think a similar problem arises for Ridge’s (, , ) suggestion that logically complex
sentences express hybrid states. The idea is that sentences such as () express beliefs with a descriptive
content that is logically isomorphic to the normative sentence but they also express interlocking directive
attitude (e.g. the endorsement of an ideal adviser or a normative perspective). The problem I see for this idea
is that it ends up committing one to treating all logically complex sentences as expressive of a (partially)
desire-like state of mind, or else it involves an ad hoc treatment of the role of logical particles in various
logically complex sentences.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
function of sentences. So, this is a cost for the overall plausibility of the theory.
Moreover, as we have already observed, there are non-agentive but at least weakly
normative ought-claims, such as the teleological and evaluative ought-sentences ()–
() from above. Whatever connection there is between having the thought expressed
by these sentences and being motivated to perform some specific act is going to
be rather indirect and highly defeasible. We are right, I think, to question whether
someone understands what she is saying if she says something of the form “All things
considered, I ought to φ,” and yet has no motivation to φ. However, it is much
less plausible to think that competence with ‘ought’ more generally—even in many
paradigmatic normative uses—requires any particular motivational propensities.
This is why I think expressivists should instead answer the question by arguing that
many ought-thoughts fit into some third category of thoughts, neither purely descrip-
tive nor purely directive.42 Again, however, we should ask: what are they like and
what about them makes them nonrepresentational? And again I think the inferentialist
story sketched above might provide some helpful resources for expressivists. That is
to say that expressivists could treat some thoughts (paradigmatically ones expressed
with necessity modals) as not themselves representational but rather ways of encoding
metaconceptual manipulations on more basic pieces of semantic content, resulting in
thoughts explicating the structure of a space of inferential relations. Maybe there are
other way expressivists could answer this question,43 but obviously if they answer it
this way, the view is again a plausible partner to rather than a competitor with the
inferentialist view sketched above.
Third Question: If semantic content of sentences is to be explained in terms of the
content of the mental states (or “thoughts”) they express, as expressivists maintain,
then what explains the content of these mental states? The ideationalist conception of
truth conditions sketched above takes for granted mental content in order to explain
semantic content, which means that anyone endorsing it owes a further account of
mental content.
42 Although Blackburn (, ) is deeply influenced by Hume, he finds in Hume a much richer
diversity of types of mental states besides the descriptive and the directive, including all sorts of different
projectable stances and mental dispositions.
43 Another way, which I do not quite know how to categorize, is to follow Yalcin’s treatment of claims of
probability and epistemic modality (see especially Yalcin, , ). He replaces the traditional Humean
distinction between descriptive and directive thoughts with a distinction between mental states representing
the world as being one way or another (modelable as conditions on worlds) and second-order properties of
one’s first-order representational states (not modelable conditions on worlds but rather as a function from
sets of worlds). In a similar vein but within a dynamic framework, Charlow (forthcoming) distinguishes
between information-carrying thoughts and thoughts that are instructions to “test” one’s overall belief and
preference state for certain structural features. The more general idea behind Yalcin’s and Charlow’s versions
of expressivism is that there may be declarative sentences whose function is not to express representational
beliefs, nor to express desire-like states, but rather to coordinate on a higher-order property of one’s global
mental state, e.g. credence distributions and preference orderings. Depending on how these are related
to inferential commitments, this too might constitute a useful ally with rather than competitor with the
inferentialist view sketched above.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
Early expressivists were, I think, tempted to say that the mental states expressed by
normative sentences are not thoughts with normative content; rather, all content is
representational but we can have nonrepresentational attitudes towards such content
(e.g. we can desire that stealing be less common). Then normative sentences were
treated as conventional and canonical means for expressing such attitudes. If we pro-
ceed this way, expressivists could insist on a purely representationalist conception of
mental content, but argue that it’s not the content of but the type of attitude that makes
a mental state nonrepresentational. However, I think it is wildly implausible to say that
there are no thoughts with normative content (what then is one doubting, assuming,
hoping, etc., when one doubts, assumes, hopes, etc. that stealing is wrong?).44
Another way expressivists could go at this point is to apply something akin to
the sort of inferentialist account of content I sketched above to the case of mental
content.45 If this is right, the ideationalist conception of truth conditions might be
correct as far as it goes, but the inferentialist view would be providing a deeper
explanation of what content is in virtue of which sentences have the meaning that
they do.
Conclusion
As I see things, the key issue between representationalists, inferentialists, and ideation-
alists is one about order of theoretical priority. Everyone should agree that some
sentences describe things in our environment, that most sentences have inferential
connections to other sentences, and that all sentences are vehicles by which we express
our minds. The interesting question is which of these is most theoretically basic for
understanding that in virtue of which sentences have the contents that they have.
Here I have argued that each of these views can accept the predictions that truth-
conditionalist approaches to compositional semantics give for ‘ought’; in effect that
it is a modal operator that quantifies over possibilia at which it evaluates embedded
contents. Given this, I think we can take representation as the master concept and
argue that these truth conditions reveal that ought-claims don’t describe how things
actually are but rather how things are with respect to possible worlds and possible
norms. But we might also take inference as the master concept and argue that these
truth conditions reveal that ought-claims are not ways of encoding a representation
of modal space but rather ways of explicating inferential relations standing between
other things we can say. It’s tempting to see the resulting inferentialist view in the
lineage of expressivism. And I’ve suggested that there may be sophisticated forms
44 Because of this, I think appealing to off-the-shelf Fodorian (, ) or Millikanian () or
Dretskean (, ) accounts of mental content is not going to work for expressivists, since these all
proceed in terms of representation relations between concepts and reality, whereas expressivists need
something different for normative concepts. Moreover, I suspect going in this direction opens expressivists
up to Dorr’s () wishful-thinking challenge.
45 See Blackburn () for an expressivist who ends up moving in the direction of inferentialism.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
of expressivism that are naturally allied with inferentialism about ‘ought.’ However,
it’s important to the way I am thinking of inferentialism that inferential connections
rather than conventional expressive potentials are most fundamental. This is the idea
that I want to recommend as an interesting metasemantic account of that in virtue of
which ought-sentences have the truth-conditional content that they do, which I see as
providing new foundations for a form of antidescriptivism that has often been ignored
in metanormative theory.46
Acknowledgments
For helpful feedback on earlier versions of this material, I am grateful to audiences at the
University of Edinburgh, University of North Carolina at Chapel Hill, University of Szczecin,
Robert Beddor, Nate Charlow, Graham Hubbs, David Plunkett, Michael Ridge, Alex Silk, and
two anonymous referees.
References
Bar-On, Dorit and Matthew Chrisman () Ethical Neo-Expressivism. In Oxford Studies in
Metaethics. Volume , edited by Russ Shafer-Landau. pp. –. Oxford; New York, NY:
Oxford University Press.
Björnsson, Gunnar, and Stephen Finlay () Metaethical Contextualism Defended. Ethics. .
. pp. –.
Blackburn, Simon () Spreading the Word: Groundings in the Philosophy of Language. New
York, NY: Oxford University Press.
Blackburn, Simon () Essays in Quasi-Realism. New York, NY: Oxford University Press.
Blackburn, Simon () Ruling Passions: A Theory of Practical Reasoning. New York, NY:
Oxford University Press.
46 I say “foundations” because I have not tried to motivate this idea for all claims one might want to
classify as nondescriptive—not even for all normative claims, let alone evaluative claims. I see three general
ways the foundations I have laid here with respect to ‘ought’ could be developed to address other normative
and evaluative terms. First, one might pursue a priority view, according to which ‘ought’ is the fundamental
normative/evaluative term, perhaps because when one is deciding what to think, feel, or do, all other
considerations should be thought to bear on a question most perspicuously posed with an ‘ought.’ Compare
Broome (, pp. – and chapter ). Along these lines, one might then try to analyze other terms,
insofar as they have normative uses, in terms of more fundamental normative ‘ought’s which one treats
as metaconceptual devices along the lines I have sketched above. Secondly, one might pursue a no-priority
view, according to which other terms with normative or evaluative uses deserve a different treatment, which
is not somehow derivative of the treatment of ‘ought’ presented here, but which also does not provide a
reduction going the other way for ‘ought.’ Along these lines, maybe the sort of treatment of ‘ought’ as a
metaconceptual device presented here works for what is clearly one of our core normative terms; and we
simply have to dig deeper to untangle the bilateral relationship between ought-claims and other normative
and evaluative claims. Finally, one might want to maintain that other terms with normative import (e.g.
‘right’ and ‘good’) are themselves intensional operators. Along these lines maybe they are metaconceptual
devices for endorsing different sorts of inferential commitments.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
Blackburn, Simon () The Semantics of Non-Factualism. In The Blackwell Guide to the
Philosophy of Language, edited by Michael Devitt and Richard Hanley. pp. –. Malden,
MA, Oxford: Blackwell.
Blackburn, Simon () Pragmatism: All or Some? In Expressivism, Pragmatism, and Repre-
sentationalism, edited by Huw Price. Cambridge: Cambridge University Press.
Block, Ned () Functional Role and Truth Conditions. In Proceedings of the Aristoteian
Society. Supplementary Volume LXI. pp. –.
Brandom, Robert () Making It Explicit: Reasoning, Representing, and Discursive Commit-
ment. Cambridge, MA: Harvard University Press.
Brandom, Robert () Articulating Reasons: An Introduction to Inferentialism. Cambridge,
MA: Harvard University Press.
Brandom, Robert () Reason in Philosophy: Animating Ideas. Cambridge, MA: Harvard
University Press.
Brandom, Robert () Perspectives on Pragmatism. Cambridge, MA: Harvard University Press.
Brandom, Robert () Global Anti-Representationalism? In Expressivism, Pragmatism and
Representationalism, edited by Huw Price. pp. –. Cambridge: Cambridge University
Press.
Bronfman, Aaron, and Janice Dowell () Contextualism About Deontic Conditionals. In
Deontic Modality, edited by Nate Charlow and Matthew Chrisman. Oxford: Oxford Univer-
sity Press.
Broome, John () Rationality Through Reasoning. London: Wiley-Blackwell.
Cariani, Fabrizio () ‘Ought’ and Resolution Semantics. Noûs. . . pp. –.
Carr, Jennifer () Deontic Modals Without Decision Theory. Proceedings of Sinn Und
Bedeutung : Chemla, E., Homer, V., and Winterstein, G. (eds.) École Normal Supérieure,
Paris, – September. pp. –.
Carr, Jennifer () The If P, Ought P Problem. Pacific Philosophical Quarterly. . . pp. –.
Castañeda, Hector-Neri () Thinking and Doing: The Philosophical Foundations of Institu-
tions. Berlin: Springer.
Charlow, Nate () What We Know and What to Do. Synthese. . . pp. –.
Charlow, Nate (a) The Problem with the Frege-Geach Problem. Philosophical Studies. .
pp. –.
Charlow, Nate (b) Logic and Semantics for Imperatives. Journal of Philosophical Logic.
. . pp. –.
Charlow, Nate (forthcoming) Prospects for an Expressivist Theory of Meaning. Philosophers’
Imprint.
Chrisman, Matthew () Expressivism, Truth, and (Self-)Knowledge. Philosopher’s Imprint.
. . pp. –.
Chrisman, Matthew () Constructivism, Expressivism and Ethical Knowledge. International
Journal of Philosophical Studies. . pp. –.
Chrisman, Matthew () Ethical Expressivism. In The Continuum Companion to Ethics, edited
by Christian Miller. Chapter . London: Continuum.
Chrisman, Matthew (a) ‘Ought’ and Control. Australasian Journal of Philosophy. . .
pp. –.
Chrisman, Matthew (b) On the Meaning of ‘Ought’. In Oxford Studies in Metaethics. Volume
, edited by Russ Shafer-Landau. pp. –. New York, NY: Oxford University Press.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
Kamp, Hans () A Theory of Truth and Semantic Representation. In Formal Methods in the
Study of Language, Part I, edited by Jeroen Groenendijk, Theo Janssen, and Martin Stokhof.
pp. –. Amsterdam: Mathematisch Centrum.
Kant, Immanuel () Kritik Der Reinen Vernunft. Riga: Johann Friedrich Hartknoch.
Kolodny, Niko and John MacFarlane () Ifs and Oughts. Journal of Philosophy. .
pp. –.
Kratzer, Angelika () The Notional Category of Modality. In Words, Worlds, and Contexts,
edited by Eikmeyer, H. J. and Reiser, J. Berlin: de Gruyter.
Kratzer, Angelika () Conditionals. In Handbuch Semantik/Handbook of Semantics, edited
by von Stechow Arnim and Dieter Wunderlich. Berlin, New York, NY: de Gruyter.
Macarthur, David, and Huw Price () Pragmatism, Quasi-Realism and the Global Challenge.
In New Pragmatists, edited by Cheryl Misak. . Oxford: Oxford University Press.
MacFarlane, John () Epistemic Modals Are Assessment-Sensitive. In Epistemic Modality,
edited by Andy Egan and Brian Weatherson. pp. –. Oxford: Oxford University Press.
Millikan, Ruth G. () Language, Thought and Other Biological Categories. Bradford, MA: MIT
Press.
Peacocke, Christopher () Thoughts: An Essay on Content. Oxford: Blackwell.
Peacocke, Christopher () Understanding Logical Constants: A Realist’s Account. Proceed-
ings of the British Academy. . pp. –.
Perez Carballo, Alejandro () Hermeneutic Expressivism. In New Essays in Metasemantics,
edited by Burgess, A. and Sherman B. New York, NY: Oxford University Press.
Portner, P. () The Semantics of Imperatives Within a Theory of Clause Types. In Proceedings
of SALT editor: Young, R. B. Northwestern University, – May. : pp. –.
Portner, Paul and Aynat Rubinstein () Extreme and Non-Extreme Deontic Modals. In
Deontic Modality, edited by Nate Charlow and Matthew Chrisman. Oxford: Oxford Univer-
sity Press.
Price, Huw () Expressivism for Two Voices. In Pragmatism, Science and Naturalism, edited
by Knowles, J. and Rydenfelt, H. pp. –. Frankfurt am Main: Peter Lang.
Price, Huw () Expressivism, Pragmatism and Representationalism. Cambridge: Cambridge
University Press.
Richard, Mark () When Truth Gives Out. New York, NY: Oxford University Press.
Ridge, Michael () Ecumenical Expressivism: Finessing Frege. Ethics. . . pp. –.
Ridge, Michael () Anti-Reductionism and Supervenience. Journal of Moral Philosophy. .
. pp. –.
Ridge, Michael () Impassioned Belief. Oxford: Oxford University Press.
Rosen, Gideon () Blackburn’s Essays in Quasi-Realism. Noûs. . . pp. –.
Schroeder, Mark () Being for. New York, NY: Oxford University Press.
Schroeder, Mark () Noncognitivism in Ethics. New York, NY: Routledge.
Schroeder, Mark () Oughts, Agents, and Actions. Philosophical Review. . pp. –.
Segerberg, Krister () Validity and Satisfaction in Imperative Logic. Notre Dame Journal of
Formal Logic. . pp. –.
Sellars, Wilfrid () Inference and Meaning. Mind. . . pp. –.
Sellars, Wilfrid () Empiricism and the Philosophy of Mind. In Minnesota Studies in the
Philosophy of Science. Volume , edited by Feigl, H. and Scriven, M. pp. –. Minneapolis,
MN: University of Minnesota Press.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
Sellars, Wilfrid () Counterfactuals, Dispositions, and the Causal Modalities. In Minnesota
Studies in the Philosophy of Science. Volume II, edited by Herbert Feigl, Michael Scriven, and
Grover Maxwell. pp. –. Minneapolis, MN: University of Minnesota Press.
Sellars, Wilfrid () Meaning as Functional Classification. Synthese. . pp. –.
Silk, Alex () Modality, Weights, and Inconsistent Premise Sets. Proceedings of SALT
Chereches, Anya (ed.) University of Chicago, Chicago, IL, – May . pp. –.
Silk, Alex () Truth Conditions and the Meaning of Moral Terms. In Oxford Studies in
Metaethics. Volume , edited by Russ Shafer-Landau. pp. –. New York, NY: Oxford
University Press.
Silk, Alex () Evidence Sensitivity in Weak Necessity Deontic Modals. Journal of Philosoph-
ical Logic. . . pp. –.
Speaks, Jeff () Theories of Meaning. In Stanford Encyclopedia of Philosophy, edited by
Edward N. Zalta, Winter, edn.
Starr, William () Mood, Force and Truth. Protosociology. . pp. –.
Suikkanen, Jussi () Metaethics, Semantics, and Metasemantics. Pea Soup Blog. [Online]
Available from: http://peasoup.typepad.com/peasoup///metaethics-semantics-and-
metasemantics.html. [Accessed November .]
Swanson, Eric () On the Treatment of Incomparability in Ordering Semantics and Premise
Semantics. Journal of Philosophical Logic. . . pp. –.
Thomasson, Amie () Modal Normativism and the Methods of Metaphysics. Philosophical
Topics. . and . pp. –.
Thomasson, Amie () The Nancy D. Simco Lecture: Norms and Necessity. Southern Journal
of Philosophy. . . pp. –.
Unwin, Nicholas () Quasi-Realism, Negation and the Frege-Geach Problem. Philosophical
Quarterly. . pp. –.
Veltman, Frank () Logics for Conditionals. Ph.D. thesis. University of Amsterdam.
Veltman, Frank () Defaults in Update Semantics. Journal of Philosophical Logic. . . pp.
–.
Wedgwood, Ralph () Conceptual Role Semantics for Moral Terms. The Philosophical
Review. . . pp. –.
Wedgwood, Ralph () The Nature of Normativity. Oxford, New York, NY: Oxford University
Press.
Wedgwood, Ralph () Objective and Subjective ‘Ought’. In Deontic Modality, edited by Nate
Charlow and Matthew Chrisman. Oxford: Oxford University Press.
Wertheimer, Roger () The Significance of Sense. Ithaca, NY: Cornell University Press.
Williams, Michael () Meaning and Deflationary Truth. The Journal of Philosophy. . .
pp. –.
Williams, Michael () Pragmatism, Minimalism, Expressivism. International Journal of
Philosophical Studies. . . pp. –.
Williams, Michael () How Pragmatists Can Be Local Expressivists. In Expressivism, Prag-
matism and Representationalism, edited by Huw Price. Cambridge: Cambridge University
Press.
Yalcin, Seth () Epistemic Modals. Mind. . . pp. –.
Yalcin, Seth () Nonfactualism About Epistemic Modality. In Epistemic Modality, edited by
Andy Egan and Brian Weatherson. New York, NY: Oxford University Press.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
Yalcin, Seth () Bayesian Expressivism. Aristotelian Society. . pp. –.
Yalcin, Seth () Semantics and Metasemantics in the Context of Generative Grammar. In
Metasemantics: New Essays on the Foundations of Meaning, edited by Alexis Burgess and Brett
Sherman. Oxford: Oxford University Press.
Zvolenszky, Zsófia () Is a Possible-Worlds Semantics of Modality Possible? Proceedings of
SALT Jackson, B. (ed.) University of California, San Diego, CA, – March. . pp. –.
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
Index
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
index
credence , , , n, , , , , , hyperplan n,
, n, , –, , n
Iatridou, S. –, –, –, –, , n,
deliberation , , –, –, , , , , n, –
, . See also practical ideationalism , , –, –
reason/rationality illocutionary force/function , ,
demonstratives imperatives , , –, , n, ,
direction of fit , –, , –, –, , –, –, ,
disagreement n, –, , –, n, , n, , n, n, –,
, , –, , –, , n, , –
n, , , , n categorical imperative
dominance , , , , , –, , –, hypothetical imperative , ,
n implicature n, , n, , n, n,
dynamic semantics , n, n, n,
n, , , , , –, –, inferentialism –, –, –, –
, –, –, , , n, information-sensitivity –, n, , , ,
–, n, n, n , , , –, , –, , , –,
, –, n, , , , , –,
EDT See evidential decision theory , , , –, –, n, n,
ellipsis n, n , –, –, , –
Ellsberg Paradox , h inheritance , n
evidential decision theory –, , –, , intention , , , –, n, , n,
n. See also causal decision theory , , –, –, –, –, ,
exclamitives , , –, –, n
expected utility , n, , , –, n, irrationality , , –, n, –,
, , , –, , n, , , –, , . See also rationality
–, ,
expected value –, –, –, n, , Jackson, F. n, , , , –, ,
n, n, , –, –, , , ,
, , , –, , n, –, , Kant, I. n, n, n, n, n
, – Kaufmann, M. –, –, –, –,
expressivism , –, n, , –, –, , n,
, n, , –, –, –, Kaufmann, S. –, –, –, –,
–, –, n, n, –, n,
– Kennedy, C. –,
Kolodny, N. , –, , –, , –, ,
Finlay, S. –, , , , , n, , n, , , n
n, n, n Kratzer, A. –, , –, n, n, –,
Fodor, J. –, n –, , , , n, , –, –,
Frege , , , , n , , –, , –, , , –,
Frege-Geach problem , –, –, –, , , –, , ,
, n, , , n, n
Geach, P. nn, , , , n, ,
, , n Lassiter, D. –, , n, n, n, n,
Gentle Murder Parodox – n, , –, , ,
Gibbard, A. , n, , , , , –, Lepore, E. n, , –
, –, , n Lewis, D. , , , , , –, ,
Gillies, A. n, n, n, , –, nn, , , , , , , , ,
, , –, –, –, , , nn,
gradability , , , , , , , , , , , n, , , n, n,
, –, , , lexical ambiguity n, , , , , ,
Grice, P. , , n –
lexical semantics –, , –,
Há – limit assumption n, , n, n,
Harman, G. , , –, n, , n
Harper, W. L. , , logical form , n, n, n, , ,
Horty, J. F. , –, n, ,
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
index
MacFarlane, J. , –, –, , , , n, practical reason/rationality , , –, , , ,
–, –, , , , n, n , –, , , , , , n,
McNally, L. –, –,
Medicine Puzzle , , pragmatics , , –, , , n, , , ,
metanormativity –, , –, , –, , –, , , , , , –,
n, –, , , –, , , –, , ,
metasemantics , –, n, , , n, , , –, n, , n, ,
, , , , n, , , , , , , –, , , , ,
n, , , , –, , –, n, –, –, n, ,
–, –, , n, n, n
Miner Puzzle , , –, –, –, , pragmatism
, , , –, –, –, n, preference , –, –, –, , –, ,
, –, –, –, –, , –, , –, , , , –, , –,
n, , , n, n, –, –,
modal base n, –, n, –, , n, –, n, , , , n
, , n, n, n, –, proposition , –, , , –, , , ,
–, –, , –, –, , , –, n, –, , –, , , ,
–, n, , –, –. See also –, –, –, –, –, ,
ordering source , –, , –, –, , , ,
mode of presentation –, , – , n, , –, –, –, ,
monotinicity –, n, –, –, , , , , –, n, –,
, –. See also nonmonotinicity –, –, –, , , n,
Moore’s Paradox –, –, –, , –, , ,
–, n, , , –, –,
neutrality –, , , , , , n, n, , n, –, n
–, , , Fregean proposition –, –
Newcomb Problem – Russellian proposition –
nonmonotinicity –, , , , , , propositional attitude , n, n, ,
–, , , , –, –. n
See also monotinicity Putnam, H. ,
normativity –, –, , n, , –, ,
, –, , , –, –, , , , quantifier n, , , , –, , , ,
, –, –, –, , –, , , –, , n,
–, , –, , , , , , Quine, W. V. O. –, n
–, n, , –, n, –,
–, –, , –, –, Ramsey, F. , ,
n rationality , , , , n, . See also
irrationality
Open Question Argument n, , relativism n, –, –, , –,
ordering source n, , –, , , , n, n, n
n, , n, , –, –, relevant alternative , –, , , , n,
–, –, –, –, , , –, , n,
–, , –, –, , n. representationalism –, , n, –,
See also modal base –, –, –,
restrictor n, , n, , ,
perlocutionary force/function –, Ross’s Paradox , –, , –, ,
Plunkett, D. n, , n, n, –, , ,
polysemy n, Rubinstein, A. –, , , , n
Portner, P. –, , , n, , n,
n, , , n, n scale , –, , , , , , ,
possible world , , , , , n, , –, –, –
n, , , –, n, , , , , Schroeder, M. n, n, –, –, ,
, –, , , –, –, , n, n, , n, , ,
, , , , –, , , , , n, n, n
, , –, –, n, –, , Sellars, W. , , , n, , –,
–, , –, –, –, Silk, A. n, n, n, n, –,
–, –, –, n
i i
i i
i i
OUP CORRECTED PROOF – FINAL, //, SPi
i i
index
i i
i i