Professional Documents
Culture Documents
Counterfactual Thinking, Counterfactual Writing (PDFDrive)
Counterfactual Thinking, Counterfactual Writing (PDFDrive)
Counterfactual Thinking, Counterfactual Writing (PDFDrive)
Edited by
Peter Auer · Gesa von Essen · Werner Frick
Editorial Board
Michel Espagne (Paris) · Marino Freschi (Rom)
Erika Greber (Erlangen) · Ekkehard König (Berlin)
Per Linell (Linköping) · Angelika Linke (Zürich)
Christine Maillard (Strasbourg) · Pieter Muysken (Nijmegen)
Wolfgang Raible (Freiburg)
Editorial Assistant
Aniela Knoblich
12
De Gruyter
Counterfactual Thinking -
Counterfactual Writing
Edited by Dorothee Birke, Michael Butter,
Tilmann Köppe
De Gruyter
ISBN 978-3-11-026858-4
e-ISBN 978-3-11-022904-2
ISSN 1869-7054
Contents
Tobias Klauk
Thought Experiments and Literature . . . . . . . . . . . . . . . . 30
Daniel Dohrn
Counterfactual Explanation in Literature and the Social Sciences . 45
Miko Elwenspoek
Counterfactual Thinking in Physics . . . . . . . . . . . . . . . . . 62
Patrizia Catellani
Counterfactuals in the Social Context:
The Case of Political Interviews and Their Effects . . . . . . . . . 81
Martin Hilpert
A Cognitive Linguistic Perspective on Counterfactuality . . . . . . 95
Bernhard Kleeberg
Significance and Abstraction: Scientific Uses of Counterfactual
Thought Experiments in the Early 20th Century . . . . . . . . . . 112
Birte Christ
“If I Were a Man”: Functions of the Counterfactual
in Feminist Fiction . . . . . . . . . . . . . . . . . . . . . . . . . 190
Rüdiger Heinze
Temporal Tourism: Time Travel and Counterfactuality
in Literature and Film . . . . . . . . . . . . . . . . . . . . . . . . 212
Robyn Warhol
“What Might Have Been Is Not What Is”:
Dickens’s Narrative Refusals . . . . . . . . . . . . . . . . . . . . 227
Richard Saint-Gelais
How To Do Things With Worlds:
From Counterfactuality to Counterfictionality . . . . . . . . . . . 240
On June 26, 2010, the German soccer team beat England 4–1 in the first
knockot stage of the FIFA World Cup in South Africa. Afterwards, the
international media hailed this victory as the inevitable triumph of a young,
inspired, and well-organized side over an aged and spiritless opponent.
However, there were also voices that stressed that despite the German su-
periority throughout most of the match the result could have been quite
different if England’s second goal had not been disallowed by the referee,
who did not see that Frank Lampard’s shot in the 39th minute had crossed
the line. “[I]f it was a draw that would have been very important for us”, the
enraged England manager Fabio Capello complained afterwards, reminding
journalists that the goal would have leveled the score at 2–2 and suggest-
ing that his team might then have gone on to win the match.1 Based on
the premise of “what if …”, Capello’s complaint is a wonderful example of
counterfactual thinking, which the social psychologist Keith Markman and
his colleagues define as “the imagination of alternatives to reality”.2
We have chosen this example not only because it allows for the playful
reversal of the most prominent illustration used in introductory texts on
counterfactual thinking – what if Germany had won World War II – but be-
cause it captures two important aspects of developing such scenarios. First,
it shows that mentally constructing alternatives to what really happened is
indeed a wide-spread human practice, “something familiar to everyone”,
even if they have never heard about the concept, as Neal Roese and Jim
Olson have pointed out.3 After all, it was not only Capello who harbored the
thought that things could have turned out differently but everybody who had
watched the game.
1 Capello quoted in Sachin Nakrani, “Fabio Capello Says ‘Little Thing’ behind Eng-
land Exit”, in: The Guardian, June 27, 2010, http://www.guardian.co.uk/football/
2010/jun/27/world-cup-england-germany1 (July 15, 2010).
2 Keith D. Markman/Igor Gavanski/Stephen J. Sherman/Matthew N. McCullen,
“The Mental Simulation of Better and Worse Possible Worlds”, in: Journal of Ex-
perimental Social Psychology 29/1993, pp. 87–109, p. 93.
3 Neal Roese/James M. Olson, “Preface”, in: Neal Roese/James M. Olson (eds.),
What Might Have Been: The Social Psychology of Counterfactual Thinking, Mahwah, NJ,
1995, pp. vii–xi, p. vii.
2 Dorothee Birke, Michael Butter, and Tilmann Köppe
Second, the example of the disallowed goal illustrates nicely the two major
effects that counterfactual thinking achieves according to psychologists. To
begin with, counterfactuals achieve a “contrast effect”, as they sharpen the
awareness of an actual state or outcome through the mental juxtaposition
with a possible one. As Roese and Morrison put it: “[A] factual outcome may
be judged to be worse if a more desirable alternative outcome is salient, and
that same outcome may be judged better if a less desirable alternative out-
come is salient”.4 Capello feels the sting of defeat all the more because he can
easily imagine a match in which England equalize and eventually win. He thus
constructs what psychologists call an “upward” counterfactual. For the Ger-
man side, the situation is more complicated. The players and the coach may
mentally construct a “downward” counterfactual in which they lost the game
and therefore feel especially relieved that they haven’t. But they may also con-
struct an upward counterfactual in which there is no disallowed goal so that
their otherwise convincing victory shines all the brighter. Thus, the phantom
goal may increase or decrease their joy. By contrast, what psychologists de-
scribe as the “causal inference effect” refers to the power of counterfactual
scenarios to yield insights into the causes of the actual outcome.5 If FIFA had
been using the video proof, the goal would have been allowed and all subse-
quent discussions about the effect of disallowing it mute, all sides agreed after
the game. And in fact, due to the public outcry over the referee’s mistake,
FIFA announced its intention to reconsider its stance on this issue. Counter-
factual thinking, then, can at least potentially influence future decisions.6
10 Our volume thus covers considerably more ground than the recent special issue
of Historical Social Research dedicated to “Counterfactual Thinking as a Scientific
Method”.
11 Mieke Bal, Travelling Concepts in the Humanities: A Rough Guide. Toronto, 2002, p. 25.
12 Bal, Concepts, p. 13.
Introduction 5
an example of the story-type. Among the major functions that Widmann as-
cribes to counterfactual historical novels are the foregrounding of contin-
gency in historical developments, but also the critique of dominant national
self-images, which are exposed as ideological constructs.
In the feminist fictions that Birte Christ analyzes in her article, counterfac-
tuality is also employed for social critique. Whether these works project al-
ternate societies in which the hierarchy between male and female has been
reversed or abandoned altogether, or imagine individuals who change their
gender, they use counterfactual patterns to expose traditional gender roles
as arbitrary and unjust. Christ shows how the binary logic of “fact” and
“counterfact” serves to playfully reverse and thus question the kind of think-
ing that structures and hierarchizes society according to the binarism of
“male” and “female”. Her analysis of four American feminist works from
the 1910s and the 1970s, respectively, illustrates how these achieve both
contrast effects and causal inference effects, which contribute to two over-
arching functions: “analytical”, i.e. offering an investigation and evaluation
of existing structures of thought and power in American society, and “syn-
thetic”, i.e. offering ways of rethinking or abandoning these structures.
Both Christ and Widmann emphasize the different ways in which novels
with counterfactual plots can achieve the contrast effect: They can either
explicitly compare counterfactual aspects to the actual reality of the reader,
or they can rely on the recipient’s ability to spot the discrepancies. A genre
that typically offers an explicit comparison between two levels of reality, or
worlds, is the time travel narrative that Rüdiger Heinze treats in his contribu-
tion. Counterfactuality becomes central to time travel narratives either when
a protagonist travels back in time and his behavior changes the course of his-
tory, or when she travels forward and is faced with a version of “what might
happen if ”. Heinze looks at three prominent examples – two 19th-cen-
tury novels, Mark Twain’s A Connecticut Yankee in King Arthur’s Court, and
H.G. Wells’ The Time Machine, and the film trilogy Back to the Future from the
1980s. He shows how the time travel trope is, for one thing, used for social
commentary – especially Wells’ and Twain’s works are, in this respect, com-
parable to the works looked at by Widmann and Christ. Moreover, he fo-
cuses on the different notions of temporality and causality that are implicitly
or explicitly endorsed by the texts.
While the previous three articles feature examples where counterfactual
scenarios are in some sense presented as storyworlds in which characters
interact, Robyn Warhol looks at the phenomenon from a different angle: She is
interested in the way in which texts evoke counterfactual alternatives that are
not acted out. Warhol focuses on the level of narrative mediation and traces
10 Dorothee Birke, Michael Butter, and Tilmann Köppe
the use of what she calls “narrative refusals” – references to “what might
have been and yet is not” – in the narrators’ commentaries in Charles Dick-
ens’ novels. She demonstrates how by drawing the reader’s attention to
events that are left out (“unnarration”) or that did not happen (“disnar-
ration”), the novels create counterfactual storyworlds that prompt us to
evaluate the world that is actually represented. Moreover, narrative refusals
referring to characters’ actions contribute to an impression of psychological
complexity. As in the examples analyzed by Daniel Dohrn, in these works
the “factual” to which the “counterfactual” is opposed is no state in the ac-
tual world of the reader, but a fictional reality. Like Dohrn, Warhol proposes
the term “counterfictionality” to clearly distinguish these cases from the
kind of counterfactuality in fiction as it is described by Widmann or Christ.
“Counterfictionality” is also at the center of Richard Saint-Gelais’ contribu-
tion. What distinguishes his use of the term from Warhol’s, however, is that
he uses it in a more specific way, to describe the phenomenon of one fic-
tional text changing the events told in a pre-existing one – as for example
in Jacques Cellard’s novel Emma, oh! Emma!, which retells the story of Mad-
ame Bovary with a different ending. Saint-Gelais examines a wide scope of
texts which feature this kind of counterfictionality, with a special focus on
the question of how they foreground or, conversely, conceal their status as
fiction. As he shows, counterfictionality can be employed to very different
ends. In some cases it may be employed to remind the reader of the artificial-
ity of the textual universe, which can be rewritten. The notion that counter-
fictionality is a quintessentially postmodernist strategy, however, would be
mistaken: In other cases, it serves to strengthen narrative illusion, for
example when the original fiction is “exposed” as a lie and the counterfiction
presented as “what really happened”.13
All five articles on literary studies proceed from an understanding of
counterfactuality that does not conflate it with the general notion of “fic-
tionality”. In other words: the essays do not operate with the notion that all
fictional literature should be regarded as “counterfactual” because it repre-
13 These findings parallel one of the points made by Hilary P. Dannenberg in Coinci-
dence and Counterfactuality: Plotting Time and Space in Narrative Fiction (2008): Far from
being limited to the realm of science fiction or postmodernist fiction, counterfac-
tuality has played an important role in the history of narrative fiction. It has often
contributed to a reality effect: When characters or narrators speculate about what
could have happened, this may “strengthen the impression that the narrative
world is ‘real’ by constructing a further, contrastive ‘less real’ sequence of events
that reinforces the apparent reality of the narrative world” (p. 54). Characters may
appear more complex, narrators more trustworthy.
Introduction 11
sents invented scenarios. At the same time, however, they also show how
productively literary texts employ the interplay between fictionality and
counterfactuality in order to involve the reader in their fictional world, to
make him or her think about the state of his or her actual world, or about the
way in which texts themselves shape out thinking about “reality”. As a con-
sequence, from the perspective of literary studies there are many ways of
looking at counterfactual thinking as an “imagination of alternatives to real-
ity”. Literary scholars may use counterfactual patterns to explore the ways in
which our notions of ‘the real’ are engendered – a concern they share with
historians and social scientists. Or they may, like psychologists, be interested
in the impact of counterfactuals on our emotions and evaluations. For liter-
ary studies, then, ‘counterfactuality’ should prove a particularly valuable con-
cept in the foreseeable future, one that will require and enable scholars inter-
ested in literature to enter into dialogues with colleagues from a broad variety
of disciplines.
12 Andrea Albrecht and Lutz Danneberg
Max Weber warns in Critical Studies in the Logic of the Cultural Sciences that “[t]he
attempt to construct in a positive way what ‘would’ have happened can, if it
is made, lead to monstrous results”.1 In recent years, such attempts have re-
ceived increasing attention, usually less critical than Weber’s, to the extent
that one could speak of a veritable boom of what would now be termed
“counterfactual imaginations”. There exists now a considerable number of
studies in philosophy and in the methodology and history of science on the
functions counterfactual thinking can perform in argumentative contexts,
as well as on the understanding and analysis of concrete examples from
intellectual history and the history of knowledge. Counterfactual imagin-
ations are most often considered in connection with thought experiments,2
1 Max Weber, “Critical Studies in the Logic of the Cultural Sciences”, in: Edward A.
Shils/Henry A. Finch (eds.), The Methodology of the Social Sciences, Illinois 1949,
pp. 113–187, p. 180. We have slightly modified the translation.
2 Cf. Tamara Horowitz/Gerald J. Massey (eds.), Thought Experiments in Science and
Philosophy, Pittsburgh 1991; Wulf Rehder, “Versuche zu einer Theorie von Gedan-
kenexperimenten”, in: Grazer Philosophische Studien, 11/1980, pp. 105–123; Tyler
Burge, “Two Thought Experiments Reviewed”, in: Notre Dame Journal of Symbolic
Logic, 23/1982, pp. 284–292; David Cole, “Thought and Thought Experiments”,
in: Philosophical Studies, 45/1984, pp. 432–444; C. Mason Myers, “Analytical
Thought Experiments”, in: Metaphilosophy, 17/1986, pp. 109–118; David Gooding,
Experiment and the Making of Meaning. Human Agency in Scientific Observation and
Experiment, Dordrecht 1990; Antoni Gomila, “What Is a Thought Experiment?”,
in: Metaphilosophy, 22/1991, pp. 84–92; Roy A. Sorensen, Thought Experiments,
Oxford 1992; Roy A. Sorensen, “Thought Experiments and the Epistemology of
Laws”, in: Canadian Journal of Philosophy, 22/1992, pp. 15–44; D. H. M. Brookes,
“The Method of Thought Experiment”, in: Metaphilosophy, 25/1994, pp. 71–83;
Sören Häggqvist, Thought Experiments in Philosophy, Stockholm 1996; Martin Bunzl,
“The Logic of Thought Experiment”, in: Synthese, 106/1996, pp. 227–240; James
W. McAllister, “The Evidential Significance of Thought Experiment in Science”,
in: Studies in History and Philosophy of Science, 27/1996, pp. 233–250; James W. McAl-
lister, “Thought Experiments and the Belief in Phenomena”, in: Philosophy
of Science, 71/2004, pp. 1164–1175; Verena Mayer, “Was zeigen Gedanken-
First Steps Toward an Explication of Counterfactual Imagination 13
6 For a concept of explication and the conditions of adequacy, cf. Lutz Danneberg,
“Zwischen Innovation und Tradition: Begriffsbildung und Begriffsentwicklung
als Explikation”, in: Christian Wagenknecht (ed.), Zur Terminologie der Literaturwis-
senschaft, Stuttgart 1989, pp. 50–68.
7 Cf. Lutz Danneberg, “Überlegungen zu kontrafaktischen Imaginationen in argu-
mentativen Kontexten und zu Beispielen ihrer Funktion in der Denkgeschichte”,
in: Toni Bernhart/Philipp Mehne (eds.), Imagination und Innovation, Berlin 2006,
pp. 73–100; Lutz Danneberg, “Säkularisierung, epistemische Situation und
Autorität”, in: Lutz Danneberg/Sandra Pott/Jörg Schönert/Friedrich Vollhardt
(eds.), Säkularisierung in den Wissenschaften seit der Frühen Neuzeit, vol. 2, Zwischen
christlicher Apologetik und methodologischem Atheismus, Berlin/New York 2002,
pp. 19–66.
8 Cf. Gregor Weber, “Vom Sinn kontrafaktischer Geschichte”, in: Kai Brodersen
(ed.), Virtuelle Antike. Wendepunkte der Alten Geschichte, Darmstadt 2000, pp. 11–23,
p. 14: “Dabei ist der Begriff ‘kontrafaktisch’, also gegen die historisch-wissen-
schaftlich eruierten Fakten, insofern nicht unproblematisch, als diese Fakten
selbst ja keine objektiven Gegenstände sind, sondern aus einer zufällig auf uns ge-
kommenen Überlieferung herauspräpariert und durch Interpretation in den Rang
eines Faktums erhoben wurden”.
First Steps Toward an Explication of Counterfactual Imagination 15
their adequacy. Only if their type and range of application open the possibil-
ity of a critique, counterfactual imaginations do not seem arbitrary. A condi-
tion for this seems to be a minimal understanding of their characterization
that makes the arguments based on them comprehensible. However, typical
of the discussion of the criteria that can be used to evaluate the quality of
counterfactual imaginations are statements as the following, which is taken
from a recent contribution to scholarly discourse by Richard Ned Lebow:
At the outset, I advance a novel and provocative epistemological claim: that the
difference between so-called ‘factual’ and counterfactual arguments is more one
of degree than of kind. Both rest on assumptions about the world and how it
works and connect hypothesized causes to outcomes by means of a chain of logic
consistent with available evidence.12
Unfortunately, the content of the subsequent elaborations of the subject fails
to live up to the audacity of this far-reaching claim. Lebow thus concedes:
The fundamental similarity between the structure of counterfactual and factual ar-
guments means that many of the criteria for assessing the plausibility of one kind
of argument are appropriate to the other. There are nevertheless additional crite-
ria for good counterfactual arguments, and here we must be careful to distinguish
good from valid counterfactuals. The criteria for a [sic] good counterfactuals says
a lot about their utility for purpose of analysis but nothing about their external
validity.
What and which exactly these “criteria” are, is left to the imagination of the
reader – even in the elaborate example of a counterfactual Mozart imagin-
ation. The conclusion offered is: “External validity can sometimes be tested
on the basis of evidence. Like all propositions, counterfactuals can be fals-
ified but never validated”.13 Even for non-counterfactual propositions this
seems to be a bold assertion.
Although for the multifaceted phenomenon of counterfactual imagin-
ations there is no standard function, although their arbitrariness is not easily
controlled, and although for their variety of forms and usages there can
be no simple answer to the question of what has to be considered a good
counterfactual argument,14 there is still something to be said about their
12 Richard Ned Lebow, “Counterfactuals, History and Fiction”, in: Historical Social
Research, 34/2009, pp. 57–73, p. 67.
13 Richard Ned Lebow, “Counterfactuals”, pp. 67–68.
14 Cf. Philip E. Tetlock/Aaron Belkin, “Counterfactual Thought Experiments in
World Politics: Logical, Methodological, and Psychological Perspectives”, in: Tet-
lock/Belkin (eds.), Counterfactual Thought Experiments in World Politics: Logical, Metho-
dological, and Psychological Perspectives, Princeton 1996, pp. 3–38, p. 16: “Given the
diverse goals that people have in mind when they advance counterfactual argu-
18 Andrea Albrecht and Lutz Danneberg
achievements and their comparative evaluation. Time and again, critical lit-
erature offers six criteria for the evaluation of counterfactual imaginations:
(1) clarity – mostly with regard to the specification of the antecedent and the
consequent of the imagination; (2) logical consistency or cotenability – this
refers to the assumptions that connect the antecedent and the consequent;
(3) historical consistency – this means that there is a minimal deviation of the
shared knowledge claims (“minimal-rewrite rule”); (4) theoretical consist-
ency – this means primarily that the knowledge necessary for the impli-
cations of the counterfactual imagination ought to be consistent with gen-
erally accepted theoretical knowledge claims; (5) statistical consistency –
here the same requirement as in (4) is made with respect to well-established
statistical generalizations; and finally, (6) projectability – informed by a con-
cept of Nelson Goodman, this means that the applied rules of inference are
not contingent, arbitrary generalizations, but essentially law-like operations
that can support projections to the past as well as to the future.15 While these
criteria offer an impetus for the discussion of requirements for the quality of
counterfactual imaginations, they remain geared to certain types of such im-
aginations. Once again, we are relegated to the reconstruction and analysis of
concrete cases.
To sum up: Counterfactual imaginations (1) appear in different types that
can and shall (2) perform different cognitive functions in (3) diverging argu-
mentative contexts. In addition, there are (4) the overarching assumptions of
the respective epistemic situation. The aptitude and quality of counterfactual
imaginations depend on such an epistemic context, and only in relation to
their contexts can imaginations avoid the suspicion of arbitrariness. For the
analytic reconstruction of concrete cases of counterfactual imaginations,
several levels have to be considered: the reconstruction must start with
(i) the counterfactual imagination itself – at whatever point in time –, then
(ii) continue with the assessment of the act of such an imagination, its re-
ception, but also its criticism relative to the given standards of the respective
time, and finish with (iii) the retrospective reconstruction from the present
time. Here, we must distinguish whether the reconstruction is made (iii1) with
a historically given instrument of analysis alone, or (iii2) with an application
of present standards of plausibility, adequacy, and quality for counterfactual
imaginations.
Perhaps one end of a potential scale for measuring the quality of counter-
factual imaginations is not only marked by the requirement that counterfac-
tual imaginations must make a contribution to the acquisition of new knowl-
edge, but also that they are unique in this respect, i.e. that they alone can
make this very contribution. The other end of the scale may be marked by
apologetic usages as mere quasi-argumentative trickeries to fabricate the ex-
post justification of their authors’ claims to knowledge, because a concrete
form of counterfactual imaginations often follows individual or collective
predilections, fantasies, or ideologies. In which form their functions are per-
formed depends, not least, on the shared knowledge and the epistemic situ-
ation of their usage. Hence, it is decisive how they are tied to the respective
knowledge, not only for the historical access to counterfactual imaginations
in intellectual history and the history of knowledge, but also for their present
usage, because this actually determines when a counterfactual imagination
acquires any value by performing a cognitive function. The epistemic situ-
ation ensures that not just any arbitrary counterfactual imagination is pos-
sible and that, perhaps with regard to a given cognitive function, competing
counterfactual imaginations possess different degrees of plausibility.
16 Cf. Lutz Danneberg, “Weder Tränen noch Logik: Über die Zugänglichkeit fik-
tionaler Welten”, in: Uta Klein/Katja Mellmann/Steffanie Metzger (eds.), Heuris-
tiken der Literaturwissenschaft. Einladung zu disziplinexternen Perspektiven auf Literatur,
Paderborn 2006, pp. 35–83; Lutz Danneberg/Carlos Spoerhase, “Wissen in Lite-
ratur als Herausforderung einer Pragmatik von Wissenszuschreibungen: sechs
Problemfelder, sechs Fragen und zwölf Thesen”, in: Tilmann Köppe (ed.), Lite-
ratur und Wissen. Theoretisch-methodische Zugänge, Berlin/New York 2010, pp. 29–76.
17 A recent, but unsatisfying attempt to describe counterfactuals as short stories and
to associate them with Kendall Walton’s concept of truth in fiction is offered by
Seahwa Kim and Cei Maslen: “Counterfactuals as Short Stories”, in: Philosophi-
cal Studies, 129/2006, pp. 81–117. More illuminating is David Davies, “Thought
Experiments and Fictional Narratives”, in: Croatian Journal of Philosophy, 7/2007,
pp. 29–45.
18 John Earman/John Roberts/Sheldon Smith argue against the common view that
ceteris paribus clauses play a significant role in physics and are, therefore, also jus-
tified within other contexts of less exact disciplines (cf. “Ceteris Paribus Lost”, in:
Erkenntnis, 57/2002, pp. 281–301). They also point to the fact that Hempel’s provi-
sos are not to be confounded with ceteris paribus clauses (cf. Carl Gustav Hempel,
“Provisos: A Problem Concerning the Inferential Function of Scientific The-
ories”, in: Adolf Grünbaum/Wesley Salmon [ed.], The Limitations of Deductivism,
Berkeley 1988, pp. 19–36; John Earman/John Roberts, “Ceteris Paribus, There Are
First Steps Toward an Explication of Counterfactual Imagination 21
all,19 and which types can be distinguished.20 Nonetheless, the condition for
an overlap of ceteris paribus clauses with counterfactual imaginations can be
stated independently: It is given when it can be excluded beforehand that
these clauses cannot be satisfied by already existing, shared knowledge.
The problem of the delimitation and overlap of abstractions and idealiz-
ations, on the one hand, and counterfactual imaginations, on the other, is
complicated by the fact that there is a multitude of definitions for abstrac-
tions and idealizations and no generally accepted classification. In addition,
we have to take into account historical definitions. In particular, the various
abstractiones, including the separatio, have a long conceptual history that be-
gins, at the latest, with the Early Middle Ages (admittedly being of antique
descent). Whatever the definition of abstraction turns out to be in each case,
it is always a disregard of something, with the intention not to deceive the
other (abstrahentium non est mendacium), as much as it is the case for counter-
factual imaginations, when the truth is disregarded for a non-deceptive ex-
position. But abstraction is not based on an intentionally false statement, and
this marks its difference to counterfactual imaginations.
Like counterfactual imaginations, idealizations are, in some sense, based
on a disregard for the truth. Here, though, it is for the purpose of simplifi-
cation that something viewed as irrelevant can be disregarded. Although
idealizations are, strictly speaking, neither verifiable nor falsifiable, attempts
to determine their cognitive functions are always viewed as constituents of
a complex scientific modus operandi with respect to which performed or ex-
pected cognitive functions are determined – for example, for the explanation
or prediction of empirical phenomena. At this point, a difference to counter-
factual imaginations can already be seen, for neither unverifiability nor
unfalsifiability imply obvious falseness; and in the case of counterfactual
imaginations, obvious falseness does not just include unverifiability or unfal-
sifiability. What is more, idealizations must either represent the properties of
real entities that are considered to be essential, or they must approximately
reflect real circumstances. Therefore, establishing the cognitive functions of
No Provisos”, in: Synthese, 118/1999, pp. 439–478). See also Nancy Cartwright,
“In Favor of Laws That Are Not Ceteris Paribus After All”, in: Erkenntnis,
57/2002, pp. 425–439; Clark Glymour, “A Semantics and Methodology for Ceteris
Paribus Hypotheses”, in: Erkenntnis, 57/2002, pp. 395–405.
19 Cf. Jim Woodward, “There Is No Such Thing as a Ceteris Paribus Law”, in: Erkennt-
nis, 57/2002, pp. 303–328.
20 Different types of ceteris paribus clauses are distinguished by Gerhard Schurz,
“Ceteris paribus Laws: Classification and Deconstruction”, in: Erkenntnis, 57/2002,
pp. 351–372.
22 Andrea Albrecht and Lutz Danneberg
21 The discussion of these questions started in the middle of the 1970s in the Poznan
School, first and foremost led by Leszek Nowak and Izabella Nowakowa, who sum-
marize their results in: Idealization X. The Richness of Idealization, Amsterdam/
Atlanta 2000. Since then, the number of studies has been exponentially growing;
cf., for the more recent discussion, the series Idealization: Craig Dilworth (ed.),
Idealization IV : Intelligibility in Science, Amsterdam/Atlanta 1992; Martti Kuokkanen
(ed.), Idealization VII : Structuralism, Idealization and Approximation, Amsterdam/
Atlanta 1994. Cf. also Hans Lind, “A Note on Fundamental Theory and Ideal-
izations in Economics and Physics”, in: British Journal for the Philosophy of Science,
44/1993, pp. 493–503; Michaela Haase, Galileische Idealisierung. Ein pragmatisches
Konzept, Berlin/New York 1995; Andreas Hüttemann, Idealisierungen und das Ziel der
Physik. Eine Untersuchung zum Realismus, Empirismus und Konstruktivismus in der Wis-
senschaftstheorie, Berlin/New York 1997; Chang Liu, “Approximation, Idealization,
and Laws of Nature”, in: Synthese, 118/1999, pp. 229–256; Michael J. Shaffer,
“Bayesian Confirmation of Theories That Incorporate Idealizations”, in: Philos-
ophy of Science, 68/2001, pp. 36–52.
First Steps Toward an Explication of Counterfactual Imagination 23
would be true (and no other). At the same time, the possibility of concretiz-
ation or approximation of abstractions, idealizations, and models marks
a central difference to counterfactual imaginations, especially with regard
to their amenability to criticism. Abstractions, idealizations, and models can
be criticized or evaluated by viewing them as gradual and, hence, demand-
ing concretizations. For most counterfactual imaginations such an approach
would be without merit.
24 Fritz Machlup, “The Universal Bogey: Economic Man”, in: Fritz Machlup, Method-
ology of Economics and Other Social Sciences, New York 1978, pp. 283–301; Walter
Eucken, Die Grundlagen der Nationalökonomie, Jena 1940, p. 251.
25 John Stuart Mill, “On the Definition of Political Economy; and on the Method of
Investigation Proper to It”, in: John M. Robson (ed.), Collected Works of John Stuart
Mill: Essays on Economics and Society, Toronto 1963–1991, vol. 4, pp. 309–339, p. 321.
26 Cf. Mary S. Morgan, “Economic Man as Model Man: Ideal Types, Idealization
and Caricatures”, in: Journal of the History of Economic Thought, 28/2006, pp. 1–27;
Manfred Tietzel, “Die Rationalitätsannahme in den Wirtschaftswissenschaften
oder Der homo oeconomicus und seine Verwandten”, in: Jahrbuch für Sozialwissen-
schaft, 32/1981, pp. 115–139.
27 Mill, “On the Definition”, p. 322.
First Steps Toward an Explication of Counterfactual Imagination 25
partial agreement of his abstraction with the economic reality. At the same
time, he insists on the adequacy of the psychological factors that enter in iso-
lated and abstract form into his concept, namely: “desire of wealth”, “aver-
sion to labor, and desire of the present enjoyment of costly indulgences”.28
It is exactly this characterization that stimulated Mill’s critics to denounce
his theory as counterfactual. From the ongoing controversy surrounding
the homo oeconomicus we can only highlight a few exemplary positions. Carl
Menger’s theory of marginal utility, for example, in which the homo oeconomicus
plays a decisive part, is criticized by Karl Knies on the grounds that it takes
into account only self-interest but neither “Gemeinsinn” (sense of commu-
nity) nor a “sense for justice and equitability”, even though the latter are
also “facts” rather than “fiction”.29 Menger replies to this criticism that no
national economist ever claims that “men are actually guided only by self-in-
terest”.30 The concept is just supposed to represent “one of the most import-
ant sides of human life”.31 With this argument, Menger does endorse self-in-
terest as an actual motive for economic behavior, but, at the same time,
legitimizes the concept under criticism by referring to the essentiality of the
abstracted factor. Shifting the perspective to the methodology, i.e. the choice
of the aspect under which an economic theorist considers his object, he pre-
pares for a new, formal conceptualization of the homo oeconomicus that com-
petes with Mill’s abstraction as well as with the holistic model of the historical
economists. According to the formal conceptualization, the homo oeconomicus
is, as, for example, Max Weber argues, a “constructed economic agent” that
is deliberately “in opposition to the empirical man” because he is ascribed
“feigned” qualities such as “economic omniscience”, “absolute ‘efficiency’”,
and “leisureless productivity”. This “mathematical ideal type”32 has little
in common with Mill’s “instinct-driven” economic man. Conceptualized as
homo rationalis, he acts rationally in a formal sense, that is, guided by a rational
strategy that can, in principle, be reconstructed by the economic theorist
without prior need for a closer psychological or materialist specification.33
But this conceptualization does still not meet the demands of the historical
school for a holistic, empirically saturated consideration and can, thus, again
be disqualified as a counterfactual assumption. It is the “naive doctrine that
needs”34 “the silly and objectionable fiction of the homo oeconomicus”,35
Friedrich Gottl-Ottlilienfeld quips in 1931 against Weber and his followers.
According to Gottl-Ottlilienfeld, such a conceptualization curbs “Mr. Every-
man” until “his soul withers in the pursuit of that highest profit, presented by
this principle” – a “vain act of violence against reality”.36 Or, as Götz Briefs
writes, it is a disregard of the German reality of 1915, a time of pronounced
Anglophobia, claiming that Mill’s point of view is a factually correct descrip-
tion, albeit only of English society:
The old classics believed to have solid ground under their feet, when they assumed
self-interest and the other basic premises, which in their eyes were corroborated
by experience, as foundation of their inquiries; premises, the partly subtle, partly
coarse and brutal realities of which everybody had experienced in the England of
their time.37
For the contemporary German economy, however, this description is alleged
to be false – an argument that saved the dignity of the English economic
theorists, while confirming the image of the English “folk character” (“Volks-
charakter”) and separating it from the German one. The “pallid ‘homo oe-
conomicus’”, as Gerhart von Schulze-Gaevernitz suggests one year later,
must be adjusted to the German situation, that is, the model must be plural-
ized and adapted to historical and cultural differences.38
But a further twist complicates the controversy: Beyond the criticism of
its reality and validity, the homo oeconomicus has also been criticized as a model
that was used to promote a certain ideological agenda, insofar as the counter-
factual assumptions about the (self-interested, rational) behavior of the
economic agents shape the factual (altruistic, instinctive-folkish) behavior,39
and thus advance a liberal, capitalistic economic order. This form of criti-
cism was still en vogue after World War II . Similarly, notions analogous to the
homo oeconomicus (e.g. the homo sociologicus) are often criticized or denounced
with regard to their alleged ideologically-inspired counterfactuality. Friedrich
Tenbruck, for example, complains about sociology’s “unprecedented new
form of dehumanization”40 that replaces the individual with the homo sociologi-
cus: “Person and culture are – it must be said poignantly – annihilated by the
social sciences”.41
In summary, it becomes clear that the identification of a cognitive
formation as abstraction, model, fiction, or counterfactual imagination is
highly context dependent and based, within its epistemic situation, on the-
oretical premises and argumentative strategies that must, in each case, be re-
constructed and evaluated, if the cognitive functions are to be determined
and taken seriously.
Let’s move on to the second example. In his “Letter to Mr. Werner in
Gießen, concerning the Newtonian theory of light” [“Schreiben an Herrn
Werner in Gießen, die Newtonische Theorie vom Licht betreffend”] Lich-
tenberg writes quizzically: “These are dreams, novels, that in effect are best
refuted by writing another novel […] therein they equal the Maybe of some
philosophers which by a single maybe is not immediately knocked over”.42
The conflicting counterfactual imaginations to which Lichtenberg alludes are
those in which the antecedent and the cognitive function essentially agree
but in which the consequent differs. Counterfactual imaginations can be criti-
cized and improved directly, but they can also be confronted with an oppos-
ing imagination. As it is the case with thought experiments, they can enter
into a competition against each other.
In his 1935 retrospective of “Goethe Veneration in Five Decades”
[“Goetheverehrung in fünf Jahrzehnten”] the German scholar Julius Pe-
tersen (1878–1941) maintains that “the day of Potsdam”, when Hitler sym-
bolically bowed before the Prussian tradition, meant “a rejection of the false
spirit of Weimar, but not a defection from Goethe”. Instead, Goethe stays
“in honor as purest and richest embodiment of the German character, as
greatest custodian and keeper of our primordium, as Alfred Rosenberg calls
him”.43 And yet, Petersen must also mention the irksome circumstance that
Goethe “has not led the way into battle”.44 In Petersen’s assessment “we
cannot avoid the question of what Goethe’s position would have been on
the enormous transformations his people has undergone in recent years”45 –
a welcome occasion for a counterfactual imagination. But what knowledge
of Goethe would be relevant answering such a question? To what knowl-
edge can we take recourse in order to imagine Goethe not merely as a con-
temporary of a certain time, but to deduce a specific Goethean reaction? Pe-
tersen relies on Goethe’s “patriotic feeling”, determining this “feeling” by
an anecdote that describes Goethe’s behavior in a particular situation that,
after being decontextualized, gives a hint about how he would have “felt”
after 1933:
As in the spring of 1813 at the Elbe, when he blessed the weapons of the Lützow
rangers who were to move into the battle for freedom, he would not have denied
the salute to the black-clad companions and brown-clad comrades [SS and SA ],
who were ready to sacrifice themselves 120 years later for the inner liberation of
Germany. As back then, when he was converted by the miracle of the uprising of
the people and saw in the liberation of Germany things he had not thought to be
possible but hoped for deep down in his heart, he would have stood in awe before
the awakening of the people’s force reaching a goal that lay in his farthest hopes.
He had himself not believed to ever witness the day when all Germans would feel
as one and they too would achieve what other peoples had long been granted: the
building of a nation.46
47 Ferdinand Lion, “Goethesche Politik”, in: Maß und Wert, 1/1937–38, pp. 764–782,
p. 775 (and cf. Lion, Geist und Politik in Europa. Verstreute Schriften aus den Jahren
1915–1961, Heidelberg 1980, pp. 174–192). Translations are ours.
48 Lion, “Goethesche Politik”, p. 771.
49 Lion, “Goethesche Politik”, p. 771.
50 We would like to thank Matthew Handelman for his assistance in the translation
of this article.
30 Tobias Klauk
I. Introduction
In this paper, I will be concerned with the thesis (LT ) that in writing and read-
ing literature we typically conduct thought experiments as well as with the
consequences of comparing literature and philosophical thought experi-
ments. Part II of the paper is designed to show that (LT ) is false, despite many
interesting similarities between literature and thought-experimental scen-
arios. Parts III and IV will explore two of the similarities that might prove
valuable for scholars. But first, I will start by explaining what is meant by (LT ).
Many people in literary studies believe that writing and reading literature
has something to do with conducting thought experiments. After all, what
could be more self-evident? In reading, it seems, we are imagining scenarios
that are quite different from the way things actually are. And to imagine a
counterfactual scenario is simply to conduct a thought experiment. To argue
this way is perfectly all right. However, such an argument hinges on a very
broad understanding of the term “thought experiment”. If by a thought ex-
periment we understand any cognitive action that somehow includes de-
scribing or imagining things differently than they actually are, (LT ) is surely
true. Sadly, it then turns out to be a fairly uninteresting thesis. Not only does
everyone believe it anyway, it also does not shed light on any interesting
topic. Accordingly, to sharpen the thesis, I will take “thought experiments”
to mean something more specific. Roughly, my use stems from philosophical
discourse, and I will explain it below. Again, this is not to say that it is wrong
to use the term “thought experiment” in the wider sense. But if we are will-
ing to engage in the more specific use of the term, we might actually learn
something. Unfortunately, even the sharpened thesis is, in and of itself, still
uninteresting – a characteristic it shares with most classificatory theses. The
trick is to put the thesis to good use. Therefore, parts III and IV will exploit
(LT ) – or what is still left of it by then – to transport ideas from philosophy
over to literary theory.
In order not to get confused, it will be important to distinguish between
two types of phenomena that are frequently described as counterfactual
thinking. One phenomenon is using counterfactual conditionals, the other
imagining a counterfactual scenario. Let me make the difference clear: When
Thought Experiments and Literature 31
1 For Ernst Mach’s take on experiments and thought experiments, see his chapter on
thought experiments in Erkenntnis und Irrtum, 4th ed., Leipzig 1920, pp. 183–200.
32 Tobias Klauk
find out what happens. So even if the simulation idea was fruitful for scien-
tific thought experiments, it still would not translate to literature. But if it
does not apply to thought experiments in the first place, it is of no relevance
for (LT ). Either way it does not help us in understanding literature, and we
should look for a more fruitful idea.
I also refrain from talking about experiments that are thought experi-
ments only by name. The following is a good example from a popular,
though not scientific, linguistic book: David Crystal states that speakers of
English like two- or three-syllable words with m, n, l, and r in them better
than one-syllable ones with p, k, and g in them.
You can do linguistic thought experiments to see whether these tendencies work.
Imagine you are in a space-ship approaching a new planet, Xarg. All you know
about Xarg is that it contains two races, one friendly towards Earth-people, the
other antagonistic. One race is called the Lamonians. The other is called the Ga-
taks. Which do you think is the friendly race?2
Despite Crystal’s explicit statement to the contrary, this is not a thought
experiment at all. It is not enough to ask yourself if you think whether the
Lamonians are friendlier than the Gataks. Only a “real” experiment that con-
fronts a significant number of speakers of English with a significant number
of examples can establish the thesis mentioned above. Crystal relies on the
assumption that his readers find an answer for themselves and that every
reader finds the same answer. Again, reading or writing literature obviously
does not work like conducting experiments and (LT ) would become trivially
false if we defined “thought experiments” as “real experiments that have
been labelled thought experiments”.
Philosophical thought experiments are diverse, and one might easily
despair when confronted with the sheer mass of phenomena we tend to call
“thought experiments”. But it is not impossible to find some general rules
that will allow us to find out if (LT ) is true or not. Philosophical thought ex-
periments typically have three steps:3
(a) We imagine a scenario.
(b) We judge the scenario concerning a certain question.
(c) We make use of this judgement.
steps (b) and (c), we do not have a thought experiment. This is especially
important since there is a loose way of talking about philosophical thought
experiments where a scenario alone counts as a thought experiment. The
situation is quite similar to what we encounter in the case of literature. But
although we sometimes talk that way, it is not helpful for our project at hand.
On the contrary, it obscures two important elements of thought experi-
ments, namely steps (b) and (c).
will not work unless the story […] is more fully worked out than our usual
examples”.7
Establishing a possibility clearly counts as step (c) of the thought-experi-
mental schema I have given above. In fact, some philosophical thought ex-
periments work just that way. Establishing a possibility is not as easy as re-
futing one, where we could come up with a counterexample. Instead, one
promising way to proceed is to explore the counterfactual consequences of a
scenario and show that we cannot establish contradictions where they seem
most probable. If a story was written or read with the aim of showing that
something is possible, we would have a literary thought experiment. But
again, this seems to be the exception, not the rule. Therefore, as long as we
do not want to claim that one of literature’s central aims is to provide other
disciplines with scenarios, we should not say that literature usually conducts
thought experiments. Step (c) is – most of the time – just missing. There are
certainly texts that are intended by their authors to be literary thought ex-
periments in the strict sense we are concerned with here. And it is always
possible to take a literary scenario and turn it into a thought experiment. But
the idea that in reading or writing literature we typically conduct thought ex-
periments has proven false. (LT ) is wrong.
Where does this leave us? We can already see that one should be care-
ful about linking literature to thought experiments. As soon as we define
“thought experiment” more precisely in the sense that only nonfactual scen-
arios are involved, differences between literature and thought experiments
begin to show. As I mentioned in the beginning, apart from this cautious
note, an answer to the question if (LT ) is true or not has limited cognitive
value. But we did find similarities between philosophical thought experi-
ments and literature in steps (a) and (b). And we can now start to exploit these
similarities. I will give two examples of how to make the comparison fruitful.
We know a lot more about the three steps of thought experiments than
I have revealed so far. The comparison of thought-experimental scenarios
and literary texts allows us to transport ideas from one area to the other. Let
us forget about step (c), since we generally find it missing in literature. The
matter is different with steps (a) and (b).
Take a second look at step (a). Imagining a scenario might involve feeling
how it is to be in the scenario. This is an element that is sometimes involved
in ethical thought experiments, and it is certainly present in our reading of
many literary texts. Exploring this similarity and explaining how feeling with
somebody can be an essential element in understanding a scenario seem to
be interesting projects to me, but I will not concern myself with such aspects
here.
Imagining a scenario might also involve seeing something before the
mind’s eye. Other senses might be involved, though typically to a lesser
extent, since we are rather visual beings. If I am not mistaken, there is a
further distinction between philosophical thought experiments and litera-
ture that goes back to these “pictures in the mind”. In philosophy, the inner
pictures are almost always unimportant, as a test going back to Descartes
shows: He imagines seeing a polygon with 1000 angles. Then he imagines
seeing a polygon with 1001 angles. The pictures before the mind’s eye, he
says, look exactly the same.8 What distinguishes imagining those two things
is just our understanding of the two phrases “1000 angles” and “1001
angles”. Therefore, inner pictures do not add to our understanding of the
difference, since the two pictures are the same. Philosophical thought
experiments mostly are similar to this case. Inner pictures play a minor role,
if any.
It seems to me that literature works differently. The power of literature
often consists exactly in letting us live through certain events, to conjure up
pictures, sounds, emotions. It is a major difference for most people if they
read a scientific book on some phenomenon, or read a novel in which
the phenomenon is described. Literature can give a feeling of “this could
happen to me”, which can be a joyful or even a very disturbing experience.
Again, to explain how exactly this aspect of reading literature works is
not my project. Instead, let us focus on a similarity between philosophical
thought experiments and literature. To conceive of a scenario at the least
means, as I pointed out above, to describe the scenario consistently or to
understand a consistently described scenario. No inner pictures need to be
involved in this process. Conceiving of a scenario in this sense can still fail.
I cannot imagine a scenario in which I have just one body (of roughly its ac-
tual size) and am in Göttingen and at the same time in Freiburg. Such a scen-
ario is not consistently describable. As one can see, consistency here means:
9 I leave aside the further complications that arise when we acknowledge that there
are different kinds of possibility. For example, a scenario might be logically pos-
sible, but still nomologically impossible, i.e. impossible according to the laws of
nature. The general point I am trying to make is not affected by this.
10 To avoid misunderstanding and confusion: I do believe that the possible-worlds
theory of fiction has serious flaws. The main question is if the theory can be made
fruitful at all. Here, I am only concerned with the rather technical question if the
spirit of the theory can be saved against the two main arguments.
11 For an excellent introduction into non-classical logic, see Graham Priest, An Intro-
duction to Non-Classical Logic, 2nd ed., Cambridge 2009.
Thought Experiments and Literature 39
12 Cf. Hilary Putnam, “The Meaning of ‘Meaning’”, in: Hilary Putnam, Mind, Lan-
guage and Reality, Philosophical Papers, vol. 2, Cambridge 1975, pp. 215–271.
13 Platon, Politeia, sec. 359b–360c. Socrates dissociates himself from the thought
experiment given by Glaukon in the dialogue; a complication we can safely ignore
here.
14 Cf. for example Kathleen V. Wilkes, Real People: Personal Identity without Thought
Experiments, Oxford 1988, p. 11.
40 Tobias Klauk
physically impossible, since there are enough scenarios of its type which are
not.15 We can even invent logically inconsistent scenarios. Imagine Hume
proving that (p f Gp). Would that have made him famous? How would the
scientific community have reacted? The scenario is logically impossible and
therefore inconceivable. Nobody can prove that (p f Gp). But we can take
the scenario as a stand-in for some other, consistent scenario in which
Hume made a spectacular logical or mathematical discovery, and we have a
good notion of what would have happened then. So we need to distinguish
between scenarios for which there exist similar possible or actual scenarios
and scenarios for which there do not exist such substitutes. As long as we
believe that there are consistent substitute scenarios, we might as well keep
working with the flawed scenario.
How does this apply to literature? It is quite a triviality that in literature we
are especially interested in scenarios that bear resemblance to our life in rel-
evant aspects. But that is just another way of saying that in literature there
often is a consistent substitute scenario that is in relevant aspects similar to
the scenario of the literary text. And if we allow ourselves to use the instru-
ment of possible-world analysis in philosophy when such inconsistent but
repairable scenarios are used, why not in literature? In the light of this, it
seems to me that the impossibility argument against the possible-worlds the-
ory of fiction loses much of its appeal.
Now someone could grant all that, but still insist on the impossibility ar-
gument. Saving the possible-worlds theory of fiction against this argument,
one could say, is not a matter of showing that some inconsistent scenario can
be exchanged. The original idea of the theory was that fictional scenarios are
possible worlds. And that is still wrong, no matter how much substitution we
perform.
I think this reminder is correct. Under standard logic, fictional scenarios
are not possible worlds, although there might be ways to make even this the-
sis work by using non-standard logic with impossible worlds. I will not pur-
sue that strategy here. Instead, I want to point out something else that is
strange about the possible-worlds theory of fiction as its opponents (and
possibly many of the theory’s adherents) understand it: That something is a
possible world is not much of an explanation at all, since there is a constant
philosophical discussion about what exactly possible worlds are. Such an
Sometimes we need to go to a very abstract level and say that what is shown
in a play, seemingly full of impossibilities, are really the fears and wishes of
the protagonists. So there is one level at which the text describes an impos-
sible scenario. But, on a different level, we can always find a perfectly pos-
sible scenario. As soon as we think that a text really is just inconsistent, we are
not interested in it anymore. I think this is another feather in the hat of the
possible-worlds theorist. What a text really is about, it seems, will always be
possible in some sense.
Holmes.17 This shows that in reading we are typically not interested in the
material conditional.
But the same holds for the strict conditional. “Necessarily, if Sherlock
Holmes orders Watson about, then he is silently in love with Watson” is false.
We can think up many scenarios in which Sherlock Holmes orders Watson
about and is not secretly in love with him. What we want to know is if,
given that in the stories of Conan Doyle Holmes orders Watson about, those
worlds in which he is silently in love with Watson are closer to ours than
those in which he is not. And those are precisely the truth conditions of the
counterfactual conditional.
If that is true, then a point about literature follows. In order to know if
a counterfactual is true, it is not enough to know the scenario. We also
have to know many things about our world. In this case, we need to know
something about psychology and how people who are secretly in love typi-
cally act.
The reason for our need for background knowledge is that we do not get
single counterfactual premises. Here is an example: If I was not here, lots of
other things would also change. The space where I am now would be filled
by air molecules. People looking at the space where I am now would be look-
ing at the wall behind me. Maybe somebody would miss me, wonder where
I am, or bemoan my sudden illness. In any case, people would have lots of
different beliefs.
What exactly would have to be different, if I were not here? When we
judge what would be the case, given that I am not here, we try to stay as close
as possible to our world, i.e. we try to hold as many sentences true as pos-
sible. That is what we call a conservative judgement, and it means that
in using counterfactuals we always rely on background knowledge. David
Lewis brought up the question if in evaluating fictional truths we try to stay
as close as possible to our background knowledge, to the background beliefs
of the author, or the background beliefs of his intended readers. Actually,
I believe that there is no fixed set of background beliefs that would be ap-
propriate to all interpretations, since I believe the polyvalence thesis to be
right. But no matter which kind of background beliefs you choose, the point
remains that we use counterfactual conditionals to evaluate literary scenarios
17 You might want to argue that the sentence is not false but has no truth value,
since a precondition for its being true or false is not fulfilled, namely that Sherlock
Holmes exists. Indeed this idea has some advantages. But the point stands: We do
not use material conditionals in answering questions about the text, since no ma-
terial conditionals then would have truth value.
44 Tobias Klauk
18 Cf. Hilary Putnam, “Literature, Science, and Reflection”, in: New Literary History,
7/1976, pp. 483–491, p. 488.
Counterfactual Explanation in Literature and the Social Sciences 45
I. Explanation in Fiction
When reading literary fiction, it is natural to ask such questions as “Why does
Ahab chase Moby Dick?” The right kind of answer is “because Moby has
hurt him”. In some accounts, this explanatory “because” must be sustained
by a counterfactual: “If Moby had not hurt Ahab, Ahab would not pursue
Moby”. However, this intuitively true counterfactual does not fit the stan-
dard analysis of counterfactuals:
A counterfactual of the form ‘If it were that , then it would be that ’ is non-
vacuously true if some possible world where both and are true differs less
from our actual world, on balance, than does any world where is true but is
not true.4
It is completely open whether a world in which Moby has not hurt Ahab
and Ahab does not pursue Moby differs less from the actual world than any
world in which Moby has not hurt Ahab and Ahab pursues Moby. What
seems relevant to evaluating the counterfactual is closeness not to the actual
world but to the world(s) which make(s) the fiction true.5 I propose to define
counterfictionals in the same way as counterfactuals, with the difference that
the worlds which make the fiction true play the role the actual world plays in
counterfactuals:
A counterfictional of the form ‘According to the fiction F, if it were that , then it
would be that ’ is non-vacuously true if some possible world where both and
are true differs less from the world of F (i.e. the world in which every sentence Y is
true that figures in a true sentence of the form ‘in the fiction F, Y’) than does any
world where is true but is not true.6
Counterfictionals are not confined to explanation. Normally, a subjunctive
conditional in a piece of fiction is a counterfictional. And some fiction, for
instance rewriting Moby Dick so that Ahab is saved from drowning, may be
interpreted as a piece of counterfictional reasoning with regard to the world
of a fiction, in this case Moby Dick.
However, what interesting facts can be learnt from explanation in fiction?
How do we proceed from explaining in fiction to explanation by fiction?
There must be some mechanism of transferring explanation in fiction to the
actual world. Counterfictional explanatory relationships may make us aware
of corresponding counterfactual relationships. I want to draw a parallel to
the way counterfictionals in scientific thought experiments relate to counter-
factuals which figure in explanations of actual facts. In some pieces of fic-
tion, which I would like to refer to as cognitive fictions, counterfictionals re-
late to actual-world counterfactuals in just this way.
4 David Lewis, “Truth in Fiction”, in: Philosophical Papers 1, Oxford 1983, pp. 261–280,
p. 269.
5 Cf. Lewis, “Truth”, pp. 270–273, and Daniel Dohrn, “Counterfactual Narrative
Explanation”, in: Journal of Aesthetics and Art Criticism, 67/2009, pp. 37–47,
pp. 41–42.
6 An intricate issue is whether in evaluating a counterfictional we interpret words
like “water” according to our use in non-fiction or according to how they are used
in the world of the fiction. For instance, if water in the world of the fiction is not
H2O, the first alternative in contrast to the latter yields that water stands for H2O.
Counterfactual Explanation in Literature and the Social Sciences 47
7 Noël Carroll, “The Wheel of Virtue: Art, Literature, and Moral Knowledge”, in:
Journal of Aesthetics and Art Criticism, 60/2002, pp. 3–23, p. 12.
8 Carroll, “Wheel”, p. 12.
9 Carroll, “Wheel”, p. 14.
10 Carroll, “Wheel”, p. 19.
11 Carroll, “Wheel”, p. 9.
12 Carroll, “Wheel”, p. 15.
48 Daniel Dohrn
13 Carroll, “Wheel”, p. 8.
14 Cf. Timothy Williamson, The Philosophy of Philosophy, Oxford 2007.
15 “What one gets from a text is not a concept (‘justice’) or a claim (‘justice is for the
weak’) but the experience of what it would be like to be a specific individual suf-
fering a specific injustice. This is a cognitive gain.” (Scott R. Stroud, “Simulation,
Subjective Knowledge, and the Cognitive Value of Literary Narrative”, in: Journal
of Aesthetic Education, 42/2008, pp. 19–41, p. 32).
16 James McAllister, “Thought Experiments and the Belief in Phenomena”, in: Phil-
osophy of Science, 71/2004, pp. 1164–1175, p. 1168.
Counterfactual Explanation in Literature and the Social Sciences 49
B1 and B2, with the masses m1 and m2, and m1 > m2, joint together so as
to yield one body B12 with mass m1 + m2. In the Aristotelian view, if B12 is
dropped under ideal conditions, it falls faster than B1 and B2 as its mass is
bigger, but its speed ranges in the middle between B1 and B2 because B12
combines their natural velocities. The problem this poses can be avoided by
regarding speed as independent of weight.17
Comparable with Carroll’s view that philosophy deals with conceptual
analysis, Tamar Gendler contends that scientific thought experiments are,
like philosophical thought experiments, designed to test our tacitly known
conceptual commitments in imaginary cases:18
What the Galilean does is provide the Aristotelian with conceptual space for a
new notion of the kind of thing natural speed might be: […] there was no room on
the Aristotelian picture for the thought that natural speed might be constant, not
varying […], that it might be dependent not on some specific features of the body
in question, but only on the fact that it is a body at all. After contemplation of the
case, there seems to be no conceptual space for the view that it might be variable.19
The disagreement between Galilei and the Aristotelian pertains to a fact
about speed: Does natural speed depend on the specific features of the body
in question or not? If Gendler is right, Galilei and the Aristotelian must take
this issue to be decided by our concept of speed. But Gendler gives no further
argument for her conceptual interpretation.
For this reason, I prefer McAllister’s analysis. The Aristotelians’ aim of
minutely describing how physical bodies behave in actual situations conflicts
with Galilei’s metaphysical background convictions:
The world contains causal factors of two kinds: phenomena and accidents. Phe-
nomena are universal and stable modes in which physical reality is articulated. Ac-
cidents, by contrast, are local, variable, and irreproducible. Whereas phenomena
account for the underlying uniformities and invariances of the world, accidents
are responsible for the great variability of natural occurrences […]. Mechanics, for
Galileo, aims solely to identify and describe phenomena; no scientific knowledge
of accidents is possible in his view.20
17 Cf. Tamar Szabó Gendler, “Galileo and the Indispensability of Scientific Thought
Experiment”, in: The British Journal for the Philosophy of Science, 49/1998,
pp. 397–424, p. 403.
18 For competing accounts cf. John D. Norton, “Are Thought Experiments Just
What You Thought?”, in: Canadian Journal of Philosophy, 26/1996, pp. 333–366,
pp. 341–342; James Robert Brown, The Laboratory of the Mind: Thought Experiments in
the Natural Sciences, London 1991, pp. 76–77.
19 Gendler, “Galileo”, p. 412.
20 McAllister, “Thought Experiments”, p. 1167.
50 Daniel Dohrn
The actual scientific practice of doing frequentist statistics with actual cases
inevitably contains an element of thought experimentation, which allows to
implement suitable idealization.
25 There is a parallel to Daniel Cohnitz’s “view that, in fact, the function of thought
experiments in philosophy is to prepare the common ground upon which theories
to resolve philosophical puzzles are to be constructed” (“Poor Thought Experi-
ments. A Reply to Peijnenburg and Atkinson”, in: Journal for General Philosophy
of Science, 37/2006, pp. 373–392, p. 389). Cohnitz, however, restricts his view to
coordinating philosophical concepts, whereas I also emphasize coordinating
implicit dispositional knowledge.
Counterfactual Explanation in Literature and the Social Sciences 53
among others. Again, this is best achieved in a series of slight changes from
ideal conditions to non-ideal ones. Both series must be guided by the implicit
abilities which guide idealization.
strikes, the greater the value of the explained variable human flourishing; the
more an individual inclines to one extreme, the farther the explained variable
diverges in one direction from its maximum. Note that it would be prepos-
terous to claim that such a relationship lies in our concept of virtue. As in
science, Forster starts from an empirical observation – his experience of life
and society – and smoothes and polishes away interfering conditions in
order to achieve an ideal relationship which explains human behaviour under
suitable background conditions. He invites us to adapt our capacities of
smoothing and polishing in our apprehending social situations and to re-
verse his thought experiment so as to apply to real situations.30
Philosophical thought experiments can also be interpreted along these
lines: Carroll’s virtue tableau is an ethical thought experiment. As an alter-
native, consider Gettier examples. The definition of knowledge as justified
true belief can be seen as an explanatory relationship. One knows that p in
virtue of justifiably and truly believing that p (of course, the relationship is
not a causal one and it does not compete with a causal explanation). This
relationship allegedly holds over all normal cases of true belief and justifi-
cation.31 The intervention variable is the justification.32 Gettier’s example
can be described by a counterfactual which states a case of true belief which
is clearly justified under suitable background conditions, but which does
not amount to knowledge: For instance, I know that the faculty’s dean pos-
sesses a Ford because I have seen her buying one. But consider the following
counterfactual variation: If, unbeknownst to me, she had bought the Ford
for her daughter but happened to possess another one herself, I would have a
justified belief without knowledge.
But is there a polishing procedure comparable to Galilei’s? It is easy
to add surrounding conditions which destroy the evidential achievement
The role of explanation in the social and historical sciences is beset by two
main problems:
(i) the problem of laws,
(ii) non-cognitivism about narrative form.
I argue that these problems can be assuaged by the account proposed.
I begin with outlining how interventionism allows to cope with the problem
of laws.
33 Williamson, Philosophy, p. 185. When students are asked to consider the numbers
problem (faced with the tragic choice to either save one person or three other per-
sons from death, how are we to decide?), they tend to bring in additional assump-
tions about the individual persons at stake: What if one person were Jack the
Ripper and another Florence Nightingale? It proves difficult to convince them
that that this obscures the original problem.
Counterfactual Explanation in Literature and the Social Sciences 57
Narrative (Non-)Cognitivism
According to the “naïve” notion of science, it represents things as they are,
and it provides true explanations of them. Especially in the aftermath of
Hayden White and Louis Mink’s influential metahistorical studies, the role of
narratives has served to question to the validity of this view for the historical
and social sciences. Like Velleman, White insists that the emplotment of his-
torical facts in a narrative serves to create a feeling of familiarity.37 It is com-
pletely open whether this feeling correlates with a cognitive achievement. One
problem is that White views historical narratives only as convincing if they
seem true: “The rhetorical force of historical prose usually depends upon the
single solution, the true presentation of the past”.38 When we look through
the mechanisms of historical prose and realize that it is not true, we should
come to deem it unsatisfying. For this reason, White’s view is not reflection-
proof. Once we adopt it, we should cease to accept historical narratives.
I want to use this account to counter non-cognitivism by exposing the
cognitive function of narrative. Conjoining facts and well-regimented fiction
39 Cf. Noël Carroll, Beyond Aesthetics, New York 2001, pp. 118–133.
40 Cf. Rolf Zwaan, “Situation Models: The Mental Leap Into Imagined Worlds”, in:
Current Directions in Psychological Science, 8/1999, pp. 15–18.
41 Cf. Alison Gopnik/Clark Glymour/David Sobel/Laura Schulz/Tamar Kushnir/
David Danks, “A Theory of Causal Learning in Children: Causal Maps and Bayes
Nets”, in: Psychological Review, 111/2004, pp. 3–32.
Counterfactual Explanation in Literature and the Social Sciences 59
basic narratives. We may ask why the conventions guiding thick narratives
have developed. One answer is that they try to capture what is especially rel-
evant from a human standpoint. To put it with Lewis Coser citing Henry
James: “there is no impression of life, no manner of seeing it and feeling it, to
which the plan of the novelist may not offer a place”.42
The novelist’s plan may serve to organize impressions of life, seeing, and
feeling.43 To be sure, I do not want to deny that literature often liberates
us from the bounds of cognitive tasks and rationality. Often artistic form
may be used as a tool of such liberation. And often it may be used to convey
prejudice or satisfy a simplifying sense of fit. But so far nothing rules out
that thick narratives may serve, among others, objective cognitive tasks. This
rudimentary account gives rise to a classification of narratives along the de-
marcation between fact and fiction.
ent from the Galilei case. We want to understand an individual event, but in
order to do so, the individual event must be considered as normal as possible
within the constraints imposed by known fact. Normality here is character-
ized with regard to a process of polishing those features that distinguish the
case from a paradigmatic application of folk causal reasoning. The filling
procedure allows to match actual facts on the one hand and candidates for
explanatory relationships and the ideal conditions of their obtaining on the
other hand. In filling the gaps in the situation of Louis XIV as we know it
with ideal conditions, we assess how probable it is under such circumstances
that his detrimental behaviour caused his unpopularity. Given basic nar-
ratives are supplemented so as to correspond to further, more demanding
explanatory relationships.
The historical narratives I have discussed combine fact and fiction. There
is another sort of narrative which eschews fictional parts: “purely factual
narrative”, as in a protocol. There are also “pure” thought experiments in the
social sciences, which are not strictly bound by actual facts but partially revise
them so as to yield ideal conditions, as Galilei’s thought experiments did.
To R. G. Collingwood, historiography as opposed to fiction is bound by evi-
dence and consistency.45 I suggest to somewhat blur Collingwood’s distinc-
tion. There may be fiction which revises evidence and nevertheless con-
tributes to understanding history. And as scientists may consider different
polishing procedures leading to inconsistent explanatory relationships, his-
toriographers may supplement given facts by several mutually inconsistent
pieces of fiction in order to explore by thought experiment how putative ex-
planatory relationships fare. The inconsistency arises from supplementing
the same facts by diverging ideal conditions. All these seemingly disparate as-
pects – a keen strive for accuracy, fictional elements, mutually inconsistent
fictions – can be found combined in historiography, for instance in the work
of the ancient Chinese historian Ssu-ma Ch’ien.46
A salient contemporary example is Monica Ali’s Brick Lane, which arguably
“has contributed to wider public understanding of development in ways that
no academic writing ever has”.47 Ali acknowledges her direct inspiration by
45 Cf. Robin George Collingwood, The Idea of History, Oxford 1946, p. 246.
46 Grant Hardy, “Can an Ancient Chinese Historian Contribute to Modern Western
Theory? The Multiple Narratives of Ssu-ma Ch’ien. History and Theory”, in:
Studies in the Philosophy of History, 33/1994, pp. 20–38, p. 34.
47 David Lewis/Dennis Rodgers/Michael Woolcock, “The Fiction of Development:
Literary Representation as a Source of Authoritative Knowledge”, in: Journal of
Developmental Studies, 44/2008, pp. 198–216, p. 208.
Counterfactual Explanation in Literature and the Social Sciences 61
social research about Bangladeshi women workers.48 We may see Ali’s nar-
rative as a thought-experimental supplementation of empirical developmen-
tal science. Ali describes the counterfactual ideal-typical fates of two sisters
working in Dhaka and London. Picturing two sisters makes one prone to im-
agine their fates switched. It allows to abstract from certain background and
personal differences and to focus on the general social structure.49
1 This situation also has uncomfortable aspects, in particular, not everything that
we find interesting can be cast into equations, and not everything can be quanti-
fied or measured in a sensible manner. Physics deals only with those aspects of
the world for which the quantitative method is adequate. It appears, however, that
the realm of physics is expanding, and that the method can now be applied to situ-
ations that had been thought inaccessible earlier.
Counterfactual Thinking in Physics 63
extending our ideas over the domain where observations are technically
possible with current technology, and searching for inconsistencies of vari-
ous aspects in the whole set of our ideas. To set the stage, counterfactual
thinking of several types can be defined here as pondering a problem in
physics by asking the “what if ” question in the sense that the “if ” refers to a
statement believed to be incorrect, outside the range where ideas are thought
to be applicable, or outside the range accessible for observation.
A meteorologist once told me that he would like to stop the revolution
of the earth just to see how the climate would change. This is physically
impossible, but a simulation on a computer, or the examination of the
equations that describe the climate, is possible, although quite challenging.
Of course, meteorologists work with simulations – which are counterfactual,
because the earth is spinning – since they are very curious as to whether their
theories (e.g. no cyclones on a non-spinning world) yield reasonable results,
and they want to test the predictions. In this example we see that counterfac-
tual thinking can, in principle, be used to test theories. The theory is thought
to be valid generally. For a certain situation, when a certain aspect of the situ-
ation is changed, the theory accurately predicts the results of the change:
One can test this prediction in an experiment.
In a sense, every experiment is in itself counterfactual, as in an experiment
an environment is constructed that does not actually exist. When Galileo
performed his experiments on free fall, he tried to eliminate all forces
(except the force we now call gravity) acting on a mass which was dropped
from some height. He was lucky because the drag of air becomes appreci-
able only at rather high speeds and for small masses. Galileo’s experiments
studied the free fall of heavy bodies from a small height. He inferred that all
masses at any speed are accelerated by gravity in the same way. When masses
fall from a given height, they all attain exactly the same speed, and the fall
takes exactly the same time for all masses. His claim was that this would
happen if only gravity worked on the mass in question. However, to perfectly
conduct the experiment, the air around the mass has to be removed in order
to eliminate friction. Galileo could not do this, but now we can: We can
prepare an artificial world to perform the experiment in. In this example we
also see that a rule can be discovered (all masses tested attain the same
acceleration) and extended to situations where the test cannot be performed.
Since the formulation of the rule extrapolates beyond what can be seen in
practice – the “factual” –, the rule itself in its strict formulation is literally
counterfactual.
Counterfactual Thinking in Physics 65
A famous example of the first situation in the list above is Newton’s theory
of gravitation (shown in fig. 1).2 Newton was able to reproduce Kepler’s ob-
servations about the motion of the planets. Kepler found that planets move
along ellipses around the sun, with the sun located in one of the two focuses
of the ellipse. Newton’s theory made a similar prediction: Planets do revolve
along an ellipse, but so does the sun. The figure schematically represents how
the orbits of earth (top left, dark) and sun (right, light) move along ellipses
(the large one is the earth’s, the small one is the sun’s orbit) with a common
center of mass (the black dot inside the sun). A well-defined mathematical
point, the center of mass – not the sun – is at the focus. Because of the enor-
mous difference in masses between the sun and the planets, the sun is very
close to the center of mass. It was quite difficult to confirm this particular
prediction, but we know now that it – and many other predictions which
neither Kepler nor Newton thought of – is correct. The precision of the
measurement of the position of stars is now such that the slight wobble
of those stars having planets can be detected. This way several hundred so-
called exoplanets have been found. In this example, the aspect of counter-
factual thinking lies in the extension of the theory of gravitation beyond ob-
servations technologically possible in Newton’s time.
equations James Clerk Maxwell wrote down for the first time, which is why
they are named after him.3 In fact, Maxwell added only one little piece to
equations that had been found earlier, but this piece turned out to be essen-
tial. To get the flavor of the story some explanation of these equations is
necessary – although it is not necessary to actually write them down.
All matter is the source of gravitational forces and all matter feels these
forces. Additionally, visible matter is the source of a second fundamental
force. Visible matter is composed of atoms which have a very compact nu-
cleus surrounded by electrons. Electrons repel each other as do nuclei, but
nuclei and electrons attract each other. We say, quite arbitrarily, that electrons
carry a negative, and nuclei a positive charge.4 Charges cause two types of
forces to which they also react. There is a force between two charges that
works in a straight line connecting the charges. When the charges are at rest,
this force is called the electrostatic force. A moving charge additionally causes
magnetic forces, which in turn act only on moving charges. In general, both
forces are present, and what is seen as the magnetic and what is seen as the
electric force depends on the observer. The phenomena of electric and mag-
netic forces are so interlinked that the whole subject area is called electro-
magnetism.
Mathematically, these forces are described by an abstract concept called a
field. In figure 2 two types of fields are shown: On the left is a field with field
lines starting at a source (a mass or a charge); on the right is a field with lines
without an end. In this case, a current through a straight wire coming out of
the page (in the center) is surrounded by a magnetic field similar to the one
shown.5 One can envision a field as something which changes the proper-
ties of space everywhere: If there is an electromagnetic field at some point
in space then a charge at this point would feel a force. A familiar field is the
gravitational field at the surface of the earth. I expect that a mass falls down
whenever I let it go. To visualize a field one draws lines parallel to the direc-
tion of the field. In principle, there are two kinds of such lines: those that
begin and end somewhere, and those that do not begin or end anywhere.6
The latter ones form closed loops.
Let me stress that there was no experimental evidence that forced Max-
well to add the extra term. On the contrary, the first reactions of his col-
leagues were skeptical, and, at this time, it was experimentally very difficult
to measure the effects of the extra term. It took until 1887 for Heinrich
Hertz to show that Maxwell’s theory was correct in its description of elec-
tromagnetic waves. Actually, Maxwell’s addition was not based on anything
known: it was a counterfactual theory at that time. The theory is the result of
the critical analysis of the ideas on electric and magnetic forces, which turned
out to be inconsistent.
There is an oddity in the equations in that they are asymmetric. There
is no fundamental reason known to us why ending magnetic field lines do
not exist. To put it differently, there are no magnetic charges. Magnetic poles
only exist in pairs. A piece of iron has a magnetic north pole and a magnetic
south pole. Cutting the iron into pieces does not deliver isolated poles.
Instead all newly created pieces have north and south poles. Maxwell’s
equations provide a neat explanation of this fact: The magnetic fields are the
result of current loops, analogous to a current-carrying wire in a closed loop.
The poles are located on either side of the loop. In this analogy cutting the
loop into pieces must be done along the loop (otherwise the loop would not
be closed anymore). This delivers two thinner loops with two poles. Thus,
the poles will always come in pairs.
If there were magnetic charges, the equations would look like this:
– Ending electric field lines are connected to charges.
– Not ending electric field lines are connected to moving magnetic charges and
to changing magnetic fields.
– Ending magnetic field lines are connected to magnetic charges.
– Not ending magnetic field lines are connected to moving charges and
by changing electric fields.
This is a counterfactual theory. It is deliberately counterfactual. It explores
the consequences of what would change in our world if the theory were dif-
ferent. One of the great physicists who investigated these ([wrong] counter-
factual) equations was Nobel laureate Paul Dirac. He discovered that if there
was one single magnetic charge somewhere in the universe, both electric and
magnetic charges would occur only in lumps. This means that both types of
charge would be composed of elementary charges that could not be divided.
Without magnetic charges Maxwell’s equations say nothing about lumped
charges. But charges do come in lumps. Dirac assumed something counter-
factual, and reached a factual result that had been previously unexplained.
Physicists like magnetic charges because they explain something very basic.
No one has ever found a magnetic charge. Once, in a balloon experiment,
70 Miko Elwenspoek
something was found with the right signature,8 but only once, so it is very
probable that this event was an error.9 Very recently it was found that in a
certain type of material, called spin ice, the magnetic properties are such that
it is as if there were magnetic monopoles.10 However, these are not elemen-
tary particles but the result of the collective behavior of moving electrons.
Magnetic charges play a part in the theory of elementary particle physics,11
and theories beyond the so-called standard model. These theories predict
that magnetic charges were generated in the early universe in enormous
numbers and with large masses (for elementary particles). It would have
been impossible to miss them if they were around, and yet, they do not exist
now. Allan Guth came up with a brilliant idea of a brief period of very fast
expansion in the very early universe that diluted the density of the magnetic
charges so that now there is, maybe, a single magnetic charge in the observ-
able universe. Guth’s idea has other attractive consequences for cosmology
and is, therefore, taken quite seriously. In fact, all the data we have now sup-
ports the idea of a period of extremely fast expansion. This idea is now
known by the name of cosmic inflation. This will be discussed further in the
next section. So Dirac possibly was right: Magnetic charges might exist, but
are just too few to be found.
The properties of space, time, matter, and interaction are combined in the
standard model. There are two aspects of the standard model: One is related
to the types of material particles and their interactions, the other relates to the
beginning and evolution of the universe as a whole. This aspect states that the
universe began 13.8±0.1 billion years ago with the so-called big bang.12 The
picture we have of the big bang is that the universe started with an infinitely
large temperature and density, and experienced a brief period of inflation
during which it expanded by the enormous factor of about 1050. When this
period ended, after a tiny fraction of a second, our currently observable uni-
verse had a size of 1m. The universe continued to expand at a much slower
pace, closer to the rate that is currently observed. In doing so, it cooled down,
and the density of matter decreased. Once it was cool enough, protons
(the positively charged nuclei of the lightest element, hydrogen) and neutrons
(neutral particles a little heavier than protons) formed. From these a certain
amount of helium (the second lightest element) and tiny amounts of heavier
nuclei were synthesized within the first few minutes after the big bang. About
400,000 years after the big bang, the free electrons and nuclei combined to
form atoms of hydrogen and helium. From this time on, the universe was
transparent for light. This light can still be seen as a cosmic microwave back-
ground. It comes from all directions with the same intensity and uniformly
pervades the universe. It has the signature of a body in perfect thermal equi-
librium at a temperature of 3.7°K, with tiny fluctuations in intensity of 1 in
100,000.
A common misunderstanding regarding the big bang is that it was an ex-
plosion that happened at some location and from which material is flying
away. Actually, the big bang theory describes a space filled with matter
and radiation, and it is the space itself that is expanding. A helpful analogy
is the increase of the surface of a balloon or an expanding checkerboard
(see fig. 4).13 The checkerboard’s surface increases by homogeneous and
isotropic stretching.14 The increase does not start from a spot on the surface;
instead, the surface itself becomes larger. While the balloon is inflated at a
constant rate, spots on the surface drift apart with some velocity. The greater
the separation between the spots, the larger is the velocity. Sitting at any spot,
one would see all other spots drift away and it would appear as if one was sit-
ting in the center. This is not the case of course, this is just what is observed,
and one would have this impression from every spot. All but a few spots –
notably the Andromeda nebula – move away from us with a velocity that in-
creases with distance. We are not at the center of the universe.
verse is infinitely old. The steady state idea has generally been abandoned by cos-
mologists because existing observations do not support it, but they do support the
idea of a big bang.
13 The galaxies shown do not expand since their internal parts (stars, dust) are held
together by gravitational forces.
14 “Homogeneous” means “everywhere the same”; “isotropic” means “in all direc-
tions the same”.
72 Miko Elwenspoek
Figure 5: © NASA
parison, our sun has a life span of ca. ten billion years, some thousand times
longer). At the same time, visible matter started to organize into small gal-
axies which grew, over time, into the gigantic galaxies with up to 100 billion
stars that can be seen today.
All elements heavier than hydrogen and helium were formed in stars – we
are literally made of star dust! It took approximately seven billion years
before enough matter made from heavy elements was available for the
formation of planetary systems with rocky planets that were able to sustain
life. In the centers of galaxies this happened earlier; however, galactic centers
are not friendly for living organisms because of high radiation levels and fre-
quent supernovae explosions which sterilize their neighborhood. Only at a
safe distance, such as our current location, organic life is able to develop and
thrive over extended periods of time. It is an interesting fact that life on earth
originated close to the earliest possible cosmic time. The evolution of the
universe is shown schematically in figure 5.15
15 Note the increase in the expansion rate starting about 5 billion years ago.
74 Miko Elwenspoek
V. Dark Energy
A serious hint at the existence of dark energy was found only ten years ago.16
This result was supported a few years later by the measurements of the
Wilkinson Microwave Anisotropy Probe (WMAP ). This probe measured the
tiny ripples in the cosmic background radiation mentioned above. All obser-
vational results seem to support the conclusion that today, about 70 % of all
energy in the universe is dark energy. Dark energy pushes space apart. The
expansion of the universe under the influence of dark matter is exponen-
tial,17 and there is no known mechanism that could halt it.
To fully appreciate the consequences of dark matter, an explanation of a
special type of horizon is needed. On earth, the horizon is the line beyond
which one cannot see due to the curvature of the earth. In cosmology there
is a horizon beyond which one cannot see due to a much more fundamental
reason: The velocity of galaxies seen drifting away from us increases with the
distance. There is no limit to this velocity because the galaxies are not mov-
ing in space, while space itself expands. Thus, at a certain distance, called the
Hubble distance, space and everything in it moves away from us with a vel-
ocity greater than that of light.18 The value for this distance is given by the
velocity of light divided by the expansion rate. A large expansion rate leads
to a small Hubble distance. Only events within the Hubble distance can be
causally related to each other. Beyond this distance, signals characterizing the
event never reach that far. Exponential expansion means that the expansion
rate will increase beyond all limits, therefore the Hubble distance, which is
the diameter of the causally connected space (in other words, the observable
universe) will decrease beyond all limits.
Three effects control the expansion of the universe: dark energy (which is of
no importance for the following point), the global curvature, and the energy
content. These turn out to be balanced in such a way that the curvature in
today’s universe is very small. Astonishingly, it is as if the density of our uni-
verse is fine-tuned so that the expansion is slow enough to allow sufficient
time for the formation of stars and galaxies before they drift apart. This
would not be the case if the density of the universe was a little bit smaller
than it is. On the other hand, if the density was greater, the universe would
have ceased expanding, and instead it would have collapsed to an infinite
density again a long time ago. A schematic representation of the scale of the
19 Cf. John D. Barrow/Frank J. Tippler, The Anthropic Cosmological Principle, New York
1986.
20 Cf. Martin J. Rees, Just Six Numbers: The Deep Forces that Shape the Universe, New York
2000.
76 Miko Elwenspoek
physics would be able to calculate, that Kepler’s laws would be very different.
The orbits would no longer form closed ellipses, and they would be much
more complicated; they would look rather similar to ellipses but such that
the ellipses themselves would rotate in space. In Kepler’s laws the orientation
of the ellipse is fixed in space. In four dimensions, the planets would con-
stantly sweep through each other’s orbits, giving rise to an unstable system.
Similarly, the electrons in atoms would not be in fixed stable orbits. Having
more than four dimensions does not improve this situation.
These equations have also been analyzed for the case that time was two-
dimensional instead of one-dimensional.21 Again, no stable systems would
exist in such a world. We might say: I am, therefore the world has three spa-
tial dimensions and one temporal dimension.
Gravitational forces are much weaker than the other forces. The repulsive
electromagnetic force between two electrons is much, much larger than the
attractive gravitational force.22 Yet, if gravitation was 1,000 times weaker,
21 Cf. Max Tegmark, “Parallel Universes”, in: Scientific American, May 2003, pp. 40–51.
22 The factor between these forces is 1046. No one is able to imagine such a large
number. The ratio of the circumference of the earth to my length is 2x107, 1039
78 Miko Elwenspoek
the light that is created when matter condenses in the star-forming process
would blow the matter apart. If it was stronger by a similar amount, black
holes would form instead of stars: massive objects that release no light since
gravitation is so strong that light cannot escape from the object. The uni-
verse would be dark.
Atomic nuclei are formed by protons and neutrons. There is a delicate bal-
ance of forces; there are attractive nuclear forces and repulsive electromag-
netic forces. This balance leads to the stable elements we have in our world.
One might ask what would happen if the balance of these two forces
changed. The consequences would be disastrous. If the nuclear force was a
tiny amount stronger, two protons would stick together forming the element
helium – without the two neutrons that we have in our real world. Helium is
stable because the two neutrons push the two protons a little bit farther away
from each other. Therefore, the repulsive electromagnetic force is insuffi-
cient to rip the nucleus apart; the neutrons make helium stable. A nuclear
force 1 % stronger is sufficient to stabilize a helium nucleus without neu-
trons. This (not existing) nucleus is called the diproton. Assuming a stronger
nuclear force during the period in which helium was formed, during the first
few minutes, all protons in the universe would have formed diprotons, and
some would have included neutrons. There would be no free protons left.
There would be no hydrogen, and, accordingly, no water, no organic chem-
istry, no life. Helium is an inert, chemically inactive element. On the other
hand, if the nuclear force was a little bit weaker, oxygen, carbon, and all the
heavier elements would be radioactive. There would be an abundance of hy-
drogen but, again, no water, no organic chemistry, no life.
There are many more of these remarkable facts. For example, to form
heavier elements in stars, there must be a collision between three helium
nuclei. This would be a very ineffective process, were there not a so-called
resonance, an energy state of the nucleus of carbon, which makes the triple
collision effective enough for the production of heavy elements. The reson-
ance is a consequence of the value of the forces in the nucleus. The synthesis
times smaller! The ratio of the diameter of the milky way (100,000 light-years)
to my length is 1021, unimaginably large but still vastly smaller. The visible universe
is believed to be as large as 13 billion light years, 1026 times larger than me, still not
close to 1046 – a factor of 1020 to go.
Counterfactual Thinking in Physics 79
of heavy elements would have stopped if this resonance was not at this par-
ticular energy.
Water is so important for life because it is very special compared to other
molecules. This is related to the position of the two protons that form a
triangle together with the oxygen. This special form causes an unusual in-
teraction of water molecules that makes many molecular processes possible
which are essential for life. It is also the cause of the fact that solid water
(ice) is less dense than liquid water, therefore ice drifts on the ocean’s surface.
Otherwise it would sink down to the bottom of the ocean, and most of the
water in the ocean would be frozen.
Counterfactual thinking in physics leads to an amazing conclusion: If one
mapped possible worlds in a space spanned by the values of fundamental
constants, our world would be found on an isolated, tiny island in this space
that supports complex structures. Deviate a little from the constants of
nature, and no complex structures are possible. These universes would be
mostly dark, dull places. The few where light existed would be without inter-
esting chemistry. These universes would either be too small, or they would
exist for too short a time to develop interesting structures, or matter would
be too diluted to form stars leading to an interesting chemistry.
23 Current state of research cannot strictly exclude the existence of other islands
(cf. Alejandro Jenkins/Gilad Perez, “Looking for Life in the Multiverse”, in: Scien-
tific American, January 2010, pp. 42–49).
24 Cf. Leonard Susskind, The Cosmic Landscape: String Theory and the Illusion of Intelligent
Design, New York 2006.
80 Miko Elwenspoek
XII. Conclusions
Counterfactual thinking is one of the physicist’s core tools, and, given its im-
portance in physics, it would be astonishing if this was not the case for all
other natural sciences. The “what if ” question stands at the beginning of
many scientific inquiries. Here, only some highlights in physics have been de-
scribed, but much more mundane examples would have led to the same con-
clusion. In particular, we have seen that in cosmology the “what if ” question
leads to quite remarkable consequences: The laws of nature seem to be fine-
tuned to make possible the existence of life and the existence of observers.
Slight changes in the laws and the values of the basic constants would lead to
uninhabitable universes.
Previous research has shown that counterfactual thinking serves several psy-
chological functions, which may come down to three main ones:1
Affective function. Counterfactuals influence emotional reactions. For
example, after being involved in a car accident, thinking that things might
have gone better (an upward counterfactual) is likely to trigger a negative
emotion such as discomfort or regret. On the contrary, thinking that things
could have gone worse (a downward counterfactual) is likely to trigger a
positive emotion like relief. This seems to be due to a “contrast effect”.2
An outcome, even a negative one, triggers more positive emotions when an
even less desirable outcome is made salient to one’s mind. After a negative
event, people have been shown to spontaneously generate more upward
than downward counterfactuals.3 However, people may also react to the
spontaneous upcoming of upward counterfactuals through intentionally
focusing on the generation of downward counterfactuals. Actually, the
more frequent generation of downward counterfactuals after an unsuc-
cessful outcome has been shown to distinguish optimistic from pessimistic
people.4
Preparatory function. Besides cognitively restructuring the past, counterfac-
tuals “construct” the future, that is, they can favor the preparation of future
actions. Past research has shown that the best way to plan an action consists
in mentally simulating both the process (i.e., the various steps) leading to an
expected goal and the goal itself.5 Similarly, counterfactual thinking is a form
of “post hoc” simulation, including both the process leading to an expected
outcome and the outcome itself. Thus, it may serve as “correction” of an un-
1 Cf. Neal J. Roese, “The Functional Basis of Counterfactual Thinking”, in: Journal
of Personality and Social Psychology, 66/1994, pp. 805–818.
2 Cf. Norbert Schwarz/Herbert Bless, “Constructing Reality and Its Alternatives:
Assimilation and Contrast Effects in Social Judgment”, in: Leonard L. Martin/
Abraham Tesser (eds.), The Construction of Social Judgment, Hillsdale 1992,
pp. 217–245.
3 Cf. Keith D. Markman/Igor Gavanski/Steven J. Sherman/Matthew N. Mc-
Mullen, “The Mental Simulation of Better and Worse Possible Worlds”, in: Journal
of Experimental Social Psychology, 29/1993, pp. 87–109.
4 Cf. Lawrence J. Sanna, “Defensive Pessimism, Optimism, and Simulating Alter-
natives: Some Ups and Downs of Prefactual and Counterfactual Thinking”, in:
Journal of Personality and Social Psychology, 71/1996, pp. 1020–1036.
5 Cf. Lien B. Pham/Shelley E. Taylor, “From Thought to Action: Effects of Pro-
cess- Versus Outcome-Based Mental Simulations on Performance”, in: Personality
and Social Psychology Bulletin, 25/1999, pp. 250–260.
Counterfactuals in the Social Context 83
What elements of the real event are more likely to trigger counterfactuals?
According to the so-called Norm Theory,11 elements perceived as “excep-
tional” are more likely to trigger counterfactual thinking because “normal”
alternatives are easily available to the person’s mind. But what are “normal”
alternatives or, in other words, what norms do we refer to when generating
counterfactuals?
Early research on counterfactual thinking was mainly focused on labora-
tory studies where participants were presented with events in which excep-
tionality consisted in deviation from routine. For example, a man has a car
accident after having changed his usual way back home from work. Faced
with this event, participants were likely to generate the counterfactual “If the
man had followed his usual route home, the accident would not have hap-
pened”.12 More recently, research has been extended to more ecological, that
is, less artificial, scenarios and this has led to focusing attention on how
counterfactuals may be triggered not only by deviations from routine-based
norms, but also by deviations from social-based norms, such as stereotypical
expectations regarding the behavior of people involved in a given event.
Let us consider the following event. A woman who usually goes to work
by train decides to go by car for a change. Her car has a breakdown and she
accepts a lift from a male stranger who afterwards rapes her. Faced with this
event, a juror might generate a counterfactual like: “If only she had taken
the train, things would have been different”. In this case, the reference norm
would be a routine-based or intrapersonal norm. The woman’s behavior is com-
pared with her own standard behavior, and the element showing low consist-
ency with this standard is mutated in the counterfactual.13
However, in a socially embedded context the actor of an event may be
perceived not only as an individual, but also as a member of a given social
category (for example, a woman, an old person, or a gipsy). As a conse-
quence, the actor’s behavior may be compared not only with intrapersonal
norms, but also with social norms triggered by the social category the actor
belongs to. Thus, faced with the same event, our juror might also generate
a counterfactual like: “If only she had not accepted a lift from a stranger,
things would have been different”. In this case, the reference norm would be
a stereotype-based or social norm. The woman’s behavior is compared with the
perceived standard behavior of a (non-raped) woman, which includes not ac-
cepting lifts from strangers.14
Trying to assess what kind of norm violations are more likely to trigger
counterfactuals is especially relevant if we take into account the already men-
tioned relationships between counterfactual thinking, responsibility attribu-
tion, and blame. In the above example, a juror generating the counterfactual
“If the woman had not accepted the lift …” is more likely to perceive her as
responsible for what happened, than a juror who did not generate the same
counterfactual. Some research has indeed demonstrated that reference to
social norms (such as the one of not accepting lifts from strangers) may in-
fluence jurors’ judgments even if these norms do not have any correspon-
dence in legal norms.15
So far, we have made reference to how single individuals generate
counterfactuals, but counterfactuals may also be conveyed through interper-
sonal and public communication. The dynamics underlying counterfactual
communication have not been widely investigated so far.16 Our research on
counterfactuals in political discourse is aimed at deepening our knowledge
of how counterfactuals are employed in communication and what effects
they have on receivers.
Our basic assumption is that, through the evocation of given counterfactual
scenarios a speaker is likely to enhance the salience of given reference norms
to the receiver’s mind, and thus to influence the receiver’s perception of the
real scenarios. For example, through the above-mentioned sentence “Things
would be better if the government had listened to warnings of danger”,
Antonio Di Pietro presumably made the reference norm “a government
should listen to warnings of danger” salient to the mind of the citizens.
Very likely, he also highlighted that the government did not respect a shared
norm, and thus enhanced the likelihood that the government would be held
responsible for the negative outcome of the event the counterfactual re-
ferred to. In other words, through counterfactuals speakers may communi-
cate that shared expectations or reference norms have been violated. In this
way, those norms that might otherwise have gone unnoticed are made more
salient to receivers’ minds. For example, when reconstructing past political
events, politicians may compare the actual events with a variety of possible
alternatives. Very likely, politicians will choose one or another alternative in a
way that may be functional to their discursive goals, among which are the de-
fense of a positive image of themselves and their group as well as the attack
against their adversaries.17 For example, in replying to Di Pietro’s “counter-
factual attack” regarding the government’s inadequate reaction to the danger
of an earthquake, Berlusconi might employ a “counterfactual defense” by
saying something like: “If the opposition had taken care of the interests of
the country, they would have made the government action easier”. In this
way, Berlusconi would suggest that the opposition had violated the widely
shared reference norm according to which “politicians should act in the
interests of the country”, thus shifting the responsibility for the negative
consequences of the event from himself to the opposition.
These two studies offer examples of the theoretical and methodological ap-
proach adopted by social psychology when studying counterfactual thinking
in applied domains.
In the two months preceding the 2006 Italian general elections, we recorded
and fully transcribed a number of televised pre-electoral broadcastings with
Silvio Berlusconi and Romano Prodi as main guests. Transcribed texts were
then analyzed by two independent coders, who looked for the presence of
counterfactuals, in either an explicit or an implicit form. Counterfactuals can
be expressed in discourse explicitly, through the use of hypothetical periods
of unreality. More often, however, counterfactuals are conveyed implicitly,
through linguistic indicators alluding to scenarios that have never occurred
in reality. Linguistic markers of counterfactuals include, among others,
adverbs such as even, at least, without, or besides.20 In our study, implicit counter-
factuals were turned into their explicit form (e.g., the sentence “The euro
was introduced too quickly, without taking the necessary precautions” was
turned into “If the necessary precautions had been taken, things would have
been better”).
After having identified all counterfactuals, the two coders independently
classified them according to a series of criteria, the main ones being listed
below:
a) The speaker who produces the counterfactual, either the incumbent leader
(Silvio Berlusconi) or the challenging leader (Romano Prodi).
b) The target on which the counterfactual antecedent is focused, distin-
guishing among antecedents focused on the government (e.g., “If the govern-
ment had checked more strictly the transformation of prices from lira into
euro, things would have been better”), the opposition (e.g., “If the opposition
had not thwarted the government …”), and others, including political actors
and events of the national or international political/economic context (e.g.,
“If the terrorist attacks of September 11 had not happened …”).
Starting from the results of qualitative studies such as the one presented
above, we have designed a series of experimental studies in which different
versions of fictitious political discourses and interviews are submitted to dif-
ferent groups of participants. These texts vary as to the types of counter-
factuals embedded in them in order to assess whether and how far the use of
specific counterfactuals may influence the receivers’ perception of the politi-
cian employing them. It may well be the case that some types of counter-
factuals are more effective than others. Besides, counterfactuals may have a
differential influence according to some characteristics of the citizens who
are exposed to them, such as their degree of political sophistication or their
sharing or not sharing the ideology of the speaker.
It should be mentioned that an in-depth investigation of the effects of
counterfactual communication requires a number of experimental studies,
because only a few independent variables may be taken into account in one
single study at the same time. In one of these studies, 203 university students
were presented with an excerpt of an interview of a hypothetical incumbent
politician. In the interview, a journalist told the politician that he had not
92 Patrizia Catellani
done enough to improve the bad financial conditions of the country. The in-
tervention of the journalist ended with the following sentence: “Voters are
skeptical regarding your intervention on public expenses. Many of them
think you could have done much more”. In his reply, the politician employed
some counterfactuals, such as: “Surely, the situation would be better if I had
firmly stated my position within the coalition and if I had insisted in putting
forward my ideas”. Four different versions of the politician’s reply were pre-
pared, differing according to the target (either the politician or the adversary)
and the direction (either upward or downward) of the counterfactuals em-
bedded in it. For example, the sentences quoted above are examples of politi-
cian-focused upward counterfactuals. But we also employed examples of politi-
cian-focused downward counterfactuals such as: “Sure, but the situation would
be worse if I had hesitated in stating my position within the coalition and if
I had given up my ideas”. The remaining versions included adversary-focused
upward counterfactuals and adversary-focused downward counterfactuals. The
four versions of the politician’s reply were submitted to four different sub-
groups of participants.
After reading the interview, all participants were asked to evaluate the
politician according to a number of criteria in order to assess whether the
evaluations would vary as a function of the characteristics of the counterfac-
tuals employed by the politician. First of all, participants rated the politician
with regard to a series of personality traits that past research has shown to
be crucial in voters’ judgment of political leaders. These traits can be linked
back to two larger dimensions: the leadership dimension, measured through
personality traits such as “dynamic”, “energetic”, and “decided”, and the
morality dimension, measured through personality traits such as “honest”,
“loyal”, and “sincere”. Results showed that participants attributed differ-
ent traits to the politician according to the different characteristics of the
counterfactuals he employed in his reply to the journalist. The politician
employing downward counterfactuals having himself as target (“the situation
would be worse if I …”) was perceived as more energetic than the politician
employing upward counterfactuals still having himself as target (“the situation
would be better if I …”). In both cases the politician was perceived as fairly
moral. By contrast, the politician was perceived as less moral, but still very
energetic, when he employed upward counterfactuals having the opposition
as target (“the situation would be better if they …”).
Overall, evaluations of the politician were significantly more positive
when the politician’s reply included downward counterfactuals than when
it included upward counterfactuals. These results suggest that politicians,
when required to account for a negative outcome of their performance, may
Counterfactuals in the Social Context 93
VI. Conclusion
Our studies have demonstrated that politicians make wide use of counterfac-
tuals in their discourses. When publicly accounting for political events and
their performance, politicians focus not only on what they, or other political
actors, actually did, but also on what they (or others) could/should (or could
not/should not) have done. These comparisons between reality and its pos-
sible alternatives are made in a way that is consistent with politicians’ discur-
sive goals, specifically the ones of presenting a positive image of themselves
94 Patrizia Catellani
and their party and a negative image of their adversaries. Accordingly, oppos-
ing politicians differ with regard to the characteristics of the counterfactuals
they employ in discourse, mainly in terms of their target, direction, and con-
trollability.
Our studies have also shown that counterfactual communication may in-
fluence citizens’ perception of politicians. Some types of counterfactuals
seem to be more powerful than others in influencing the citizen in favor of
the politician who is using them, while the positive influence of other types
of counterfactuals seems to vary according to some characteristics of the
citizens, first of all their degree of political sophistication. Interestingly,
exposure to counterfactual communication has been shown to influence not
only citizens’ evaluations of what the politician did in the past, but also their
expectations regarding what the politician may do in the future. This is con-
sistent with the fact that counterfactual thinking has been shown to serve
not only the psychological function of explaining the past but also that of
preparing the future.
What is left to do in order to fully investigate the effects of counterfactual
communication in the political context? Quite a lot. For example, exploring
of the effects of counterfactuals employed by journalists when they inter-
view politicians. Nowadays, interviews are the most frequent form through
which politicians communicate with citizens, and what journalists say or ask
(including their use of counterfactuals) is very likely to influence citizens’
perceptions of both the journalist and the politician.
We hope this line of research on counterfactual communication in politics
may turn out to be useful both on a scientific and on a more applied level. On
a scientific level, it might help our understanding of how counterfactuals are
conveyed in discourse, as well as of their effects on people who are exposed
to them. On a more applied level, it might help politicians, but also citizens,
to become more aware of the dynamics underlying political communication,
and thus to develop a critical consciousness about it.
A Cognitive Linguistic Perspective on Counterfactuality 95
I. Introduction
This paper aims to broaden the scope of the present volume by providing a
cognitive linguistic perspective on counterfactuality. The commonplace idea
of counterfactuality as meaning that is contrary to fact may serve as an initial
working model that captures important central aspects but ultimately falls
short of a satisfactory characterization. It will therefore be re-cast in theor-
etical concepts of cognitive linguistics, in particular the concept of concep-
tual integration.1 The main objective of this paper is to show that counter-
factuality is not an exotic or rare mode of human thought. As will be
explained and exemplified, the cognitive processes that underlie counterfac-
tual reasoning are not only at work in linguistic expressions of counterfac-
tuality, but also in a multitude of other expressions.
This paper is organized as follows: Section II provides the background
for the analysis of counterfactuality by outlining central notions and research
areas of cognitive linguistics. Section III builds on that initial excursus and
explains how cognitive linguistics approaches the issue of counterfactuality.
Section IV concludes by offering a few thoughts on the question how the
proposed approach can contribute to the study of counterfactuality in disci-
plines other than linguistics.
1 Cf. Gilles Fauconnier/Mark Turner, The Way We Think: Conceptual Blending and the
Mind’s Hidden Complexities, New York 2002.
2 Cf. George Lakoff/Mark Johnson, Metaphors We Live By, Chicago 1980.
96 Martin Hilpert
contrast, in “I taught Old English to the students”, the words “taught” and
“students” are further apart, and the sentence allows the interpretation that
little or no learning took place. Another important cognitive capacity that
permeates many linguistic structures is metaphorical reasoning.5 The general
point of conceptual metaphor theory is that abstract issues are understood
and talked about in terms of things that can be experienced directly through
the body. It is thus no coincidence that for instance technological inno-
vations are talked about in terms of simple spatial language: People download
files from a server, have trouble getting into their email accounts, leave one
webpage to go to another, and log out once they are done. The systematicity of
these expressions is pervasive; the evidence suggests that the internet is not
only talked about in terms of space, it is in fact comprehended as a spatially
organized entity. A third fundamental characteristic of human cognition
is its embodied nature – how we perceive and describe the world is greatly
influenced by the way our bodies function within their physical and social
environments.6 There is now substantial empirical evidence that the activity
of producing and understanding language draws on the same cognitive re-
sources that are responsible for perception,7 motion,8 social cognition,9 and
emotion.10 For example, when participants are asked to make a sad face dur-
ing an experiment, they will be faster to process and understand language ex-
pressing sad events.
5 Cf. George Lakoff/Mark Johnson, Philosophy In The Flesh: The Embodied Mind and
its Challenge to Western Thought, New York 1999; Lakoff/Johnson, Metaphors, p. 22,
passim.
6 Cf. Andy Clark, Supersizing the Mind, Oxford 2008.
7 Cf. Rolf A. Zwaan/Robert A. Stanfield/Richard H. Yaxley, “Language Compre-
henders Mentally Represent the Shapes of Objects”, in: Psychological Science,
13/2002, pp. 168–171.
8 Cf. Benjamin Bergen/Kathryn Wheeler, “Sentence Understanding Engages
Motor Processes”, in: Proceedings of the 27th Annual Meeting of the Cognitive Science So-
ciety, 2005, pp. 238–243.
9 Cf. Laura Staum Casasanto, “Does Social Information Influence Sentence Pro-
cessing?”, in: Proceedings of the 30th Annual Meeting of the Cognitive Science Society, 2008,
pp. 799–804.
10 Cf. David Havas/Arthur M. Glenberg/Mike Rinck, “Emotion Simulation Dur-
ing Language Comprehension”, in: Psychonomic Bulletin & Review, 14/2007,
pp. 436–441.
98 Martin Hilpert
11 Cf. Donald Davidson, “Truth and Meaning”, in: Synthese, 17/1967, pp. 304–323.
12 Cf. Charles J. Fillmore, “Frame Semantics and the Nature of Language”, in: Annals
of the New York Academy of Sciences: Conference on the Origin and Development of Language
and Speech, 280/1976, pp. 20–32; and Fillmore, “Frame Semantics”, in: Linguistics
in the Morning Calm, edited by The Linguistic Society of Korea, Seoul 1982,
pp. 111–137.
A Cognitive Linguistic Perspective on Counterfactuality 99
tences is that the auxiliary follows the subject (1a), a number of construc-
tions invert this sequence (1b–e).
Figure 1: Which of the two bottom squares matches the square at the top? © Martin Hilpert
jects had to decide which one of two bottom squares matched the square
at the top. Reaction times were measured. Interestingly, a behavioral differ-
ence emerged between speakers of English and speakers of Russian. Russian
speakers were slower to respond when the two bottom squares were from
the same linguistic category, i.e. both were either “light blue” or “dark blue”.
No such difference was found for the English speakers, who lack such a lin-
guistic distinction. This finding suggests that language actually does have an
effect on non-linguistic thought. Current work thus goes beyond the debate
whether or not the Sapir-Whorf hypothesis holds true. Instead, it investi-
gates in what areas of cognition and to what extent it holds true.
Moving on to another issue, a cognitive approach to linguistic structures
must have an account of how language is processed and understood. Draw-
ing on the theory of perceptual symbols,16 current work in cognitive linguistics
subscribes to the notion that language is understood through mental simu-
lation. A simple sentence such as “I missed the bus this morning” prompts the
hearer to mentally simulate such a scenario, envisioning a departing bus, em-
pathizing with the person left behind, considering consequences of arriving
late at work, etc. This imaginative activity is not the result of understanding;
rather it is thought to be the act of understanding itself. Again, there is recent
16 Cf. Lawrence W. Barsalou, “Perceptual Symbol Systems”, in: Behavioral and Brain
Sciences, 22/1999, pp. 577–609; Barsalou, “Simulation, Situated Conceptualization,
and Prediction”, in: Philosophical Transactions of the Royal Society of London: Biological
Sciences, 364/2009, pp. 1281–1289.
A Cognitive Linguistic Perspective on Counterfactuality 101
ment towards the body, as in the “yes is near” condition. Compatibility be-
tween the meaning of a test sentence and the required action thus had a fa-
cilitating effect. This result receives a natural explanation in a model of
language understanding that is based on the notions of mental simulation
and embodiment of meaning.
Before the next section finally gets to the topic of counterfactuality, a few
remarks are in order on the relation of language use to other human cognitive
capacities. Whereas the program of generative linguistics was founded on the
premise that language was a cognitive module that could be analyzed without
reference to other systems, cognitive linguistics seeks to understand language
as a human behavior that is rooted in a number of other capacities that
closely interact with it. An example of such a system would be gesture,18
which is ubiquitous in spoken-language interaction and which human beings
share as a communicative strategy with certain primates.19 Another crucial
ability concerns the so-called phenomenon of joint attention,20 in which two
communicative partners engage in a triadic unit with a shared object of atten-
tion. Of course, language would not be possible without the ability to use and
interpret symbols,21 which allow human beings to communicate aspects of
the world that are not present in the immediate environment. Lastly, language
affords the communication of completely new and original states of affairs. A
finite set of linguistic structures can be rearranged and combined in a multi-
tude of ways. The ability to understand familiar items in new contexts draws
on a process known as conceptual integration,22 which is of central import-
ance to the discussion of counterfactuality that follows in the next sections.
A sentence such as “Had I been at the bus stop five minutes earlier, I would
not have missed the bus” is understood by mentally simulating the scenario
of being on time and experiencing the discrepancy by overlaying that scen-
ario with the actual state of affairs. It is explained in more detail below how
exactly this overlay can be modeled. While conceptual integration is a cogni-
tive capacity that is crucial for the comprehension of counterfactual condi-
tionals, it actually runs much deeper than that. Conceptual integration is not
only at work in counterfactual thought, but it permeates many aspects of
human cognition and numerous domains of grammar. In order to give some
substance to this argument, the next paragraphs briefly outline several the-
oretical notions that are of importance in conceptual integration, specifically
input spaces and blends, vital relations and compression, selective projec-
tion, and emergent structures.
In (2a), the two frames that are overlaid are American politics and German
politics. These frames exhibit some differences, such as the respective roles of
president and chancellor, but on the whole, they afford a far-reaching analogy.
By contrast, many examples of conceptual integration involve the overlay of
highly dissimilar frames, as in (2b), which integrates the frames of American
politics and seafaring. Example (2c) is similarly creative, blending air traffic
with computer operating systems. Pre-existing similarities between frames is
thus not a precondition for conceptual integration – human beings are cre-
ative enough to see connections between the most disparate of things. On
the other hand, conceptual integration is also applied to near-identical frames
such as being at the bus stop on time and being there five minutes too late.
104 Martin Hilpert
Selective Projection
The principle of selective projection captures the insight that not all el-
ements of an input frame necessarily have counterparts in the second input
frame, and that not all elements of the input frames are projected into the
blend. To take the Clinton-Titanic example, some frame elements clearly
correspond to one another across the two input frames and also represent an
identifiable element of the blend. Bill Clinton corresponds to the Titanic;
a political career corresponds to a sea journey; a scandal corresponds to a
grave accident that may put an end to the sea journey. These elements are
mapped into a blend that unites aspects of political life and sea voyaging in
a new and creative way. An important characteristic of this creativity is the
fact that many aspects of both input frames are not mapped into the blend.
The Titanic frame centrally involves passengers, but these do not directly
correspond to any aspect of American politics, and they do not matter in
the blend. The Clinton-Titanic does not have any passengers. Conversely,
the process of impeachment is not easily mapped onto the Titanic frame.
Correspondences for these frame elements may be conjured up, but such
mappings are not strictly necessary to get the point of the blend. These
asymmetries show that conceptual integration is a creative, non-determin-
istic act. It exhibits a number of regularities and recurring tendencies, but
there can be no algorithm that would automatically derive the meaning of a
certain blend from the characteristics of its input frames.
Emergent Structure
The term “emergent structure” describes a notion that is converse to the
idea of selective projection. While not all aspects of the input frames are
mapped into the blend, it is also true that the blend contains meanings that
are not present in either of the input frames. Emergent structure in blends
thus shows that conceptual integration is not the mere addition of two input
spaces: There is a surplus of meaning. The above-mentioned example “Bob’s
girlfriends get younger and younger” illustrates this. Neither of Bob’s girl-
friends is actually getting any younger; this meaning only arises as an emerg-
ent property of the blend, which compresses girlfriends of different ages
into a single entity. Another example that carries emergent meaning is the ut-
terance “I’d really like to meet myself at age 90”. This sentence evokes the
science-fiction scenario of time travel. The conceptual integration of present
and future has the emergent property that the time traveler can have a con-
versation with his older self: The “same” person corresponds to two dif-
106 Martin Hilpert
Negation
The first grammatical category to be discussed here is negation, which is il-
lustrated by the example in (4):
“I see nobody on the road”, said Alice. “I only wish I had such eyes”, the King
remarked, in a fretful tone. “To be able to see Nobody. And at that distance, too!
Why, it’s as much as _I_ can do to see real people, by this light!”25
In this passage from Lewis Carroll’s Through the Looking Glass, the King’s re-
mark exposes the cognitive complexity that underlies a simple grammatical
category such as negation. A full understanding of Alice’s utterance involves
the conceptual overlay of two scenes that differ in the presence or absence
25 Lewis Carroll, Alice’s Adventures in Wonderland and Through the Looking Glass [Oxford
World’s Classics], Oxford 2009 [1871], pp. 198–199.
A Cognitive Linguistic Perspective on Counterfactuality 107
Modality
A further grammatical domain that requires conceptual integration is the
category of modality, which encodes meanings such as obligation, possibil-
ity, or necessity. In English, a grammatically distinct set of modal auxiliaries
expresses these meanings. Each of the examples in (5) evokes two conflict-
ing scenarios, requiring the hearer to simulate the consequences of integrat-
ing the two.
Example (5a) evokes a counterfactual scenario in which the speaker does not
have any prior commitments. The second scenario is the actual situation, in
which the addressee’s talk is taking place. In the blend, the speaker’s counter-
factual alter ego is able to attend the talk. Similarly, examples (5b) and (5c)
evoke two almost parallel scenarios. The hearer is prompted to imagine an al-
ternative scenario, in which a single event (scoring a goal, being pronounced
husband and wife) leads to a complete transformation of the situation that
holds at the moment of speech.
Causation
In many of the world’s languages, the meaning of causation is expressed by
grammatical means, for instance by a suffix that attaches to a verb. English
does not have such structures, but it has a number of constructions with
verbs such as “cause” that serve the same purpose. The examples in (6) illus-
trate some of these.
The ability to reason about cause and effect crucially draws on conceptual
integration. Understanding the cause of an event always involves the mental
108 Martin Hilpert
Clause Linkage
Complex sentences can be simply juxtaposed, but speakers can also resort
to a range of grammatical structures that indicate the semantic relation that
holds between the components of such sentences. English has a set of con-
junctions that have the grammatical function of clause linkage, and, at the
same time, the semantic function of guiding the conceptual integration of
the frames that are described in each clause. The relations between events are
manifold; the examples in (7) offer some illustrations.
(7a) Bob speaks excellent French, even though he has never lived there.
(7b) You only won because you cheated.
(7c) I will believe that when I see the evidence.
In concessive clause linkage (7a), the speaker expresses the opinion that the
co-occurrence of two frames is in some way surprising. The example thus
invites the hearer to consider the French proficiency of other people who
have not lived in France and to draw a comparison with Bob. The speaker
states that in a mental simulation of this kind, Bob compares rather well.
Example (7b) is another illustration of grammatically expressed causation,
which was discussed above. The third example prompts the conceptual inte-
gration of two scenarios that differ in time and in the presence of visible evi-
dence.
Attributive Constructions
In English, nouns can be modified through a preceding attributive adjective,
as in “a red apple”. Since adjectives encode properties, it is not surprising
that such attributive constructions are understood as assigning a property to
the referent of the head noun. This could be thought to happen in a strictly
compositional fashion: “a red apple” is simply a red thing that is an apple.
The examples in (8) show that the conceptual integration of nouns and their
attributive adjectives can be much more complex.
A Cognitive Linguistic Perspective on Counterfactuality 109
To single out just one of these examples for discussion, the expression “a
safe distance” blends a frame of danger with a second frame of physical lo-
cation, denoting the distance between a protagonist and a source of danger
at which the protagonist is no longer at risk. Understanding the example
necessitates the counterfactual simulation of what would happen to the pro-
tagonist were he any closer to the source of danger.
Compounding
Categories such as negation, causation, adjectival modification, or clause
linkage reflect relatively complex grammatical matters. It is important to
realize that conceptual integration is also at work in comparatively simpler
linguistic structures, for instance in lexical formations. The phenomenon of
noun-noun compounding serves as an example. Given a compound such as
“apple juice”, the resultant meaning is derivable from a simple algorithm
such as “A compound XY denotes a kind of Y that is made out of X”.
This nicely accounts for examples such as “leather jacket”, “chocolate bar”,
or “gold watch”. However, many compounds follow different patterns, as a
comparison of “steel knife” and “butter knife” reveals. The expectable point
made here is that the meaning of any given compound is the result of con-
ceptual integration. The examples below show three particularly ingenious
blends.
Interim Summary
The preceding sections have presented a cognitive linguistic perspective on
counterfactuality. The mainstay of such a perspective is that conceptual inte-
gration makes use of general input frames and that counterfactual scenarios
result from the fusion of several input frames into a blend. The process of
blending involves the compression of vital relations such as time, space, iden-
tity, role, and others. The elements that are projected into the blend are drawn
from the input frames in a selective, non-deterministic fashion. By the same
token, conceptual integration creates emergent structure in the blend, which
conveys meaning that is not present in either of the input spaces. The brief
survey of grammatical categories has made the case that conceptual inte-
gration and counterfactual reasoning are not limited to counterfactual condi-
tional clauses. Instead, these processes extend over a multitude of grammati-
cal domains and even permeate the meaning of lexical elements. Importantly,
Fauconnier and Turner do not see conceptual integration as a cognitive pro-
cess that is only at work in language.26 They make the much stronger claim
that conceptual integration, and hence the ability to reason counterfactually,
is a prerequisite for human culture. This claim will not be further explored
here; instead, the next section briefly explores the implications of conceptual
integration for work on counterfactuality in disciplines other than linguistics.
I. Introduction
The historian […] must always maintain towards his subject an indeterminist
point of view. He must constantly put himself at a point in the past at which the
known factors still seem to permit different outcomes. If he speaks of Salamis,
then it must be as if the Persians might still win […]. Only by continually recog-
nizing that possibilities are unlimited can the historian do justice to the fullness of
life.1
Though counterfactuals come easily to historians of science, they have only
recently started to systematically reflect on the epistemic status and analyti-
cal uses of counterfactuals.2 Thus it does not come as a surprise that the
history of systematic reflection on counterfactuals has been largely neglected
as well – although the subject has been touched on in accounts of the history
of causality and the history of experiments in particular. The attention
focused on the natural sciences and on how they modeled reality by means of
counterfactual thought experiments in order to gain knowledge about natu-
ral laws and to design real experiments, whilst the humanities seemed to be
of lesser interest since they followed the approach of verstehen and therefore
put less emphasis on causal explanations of reality.3 Yet, in historiography,
the question of reality and its representations seems to be particularly chall-
enging. Unaided by direct observation or experimental tests, historians face
a range of epistemological and methodological problems when trying to
1 Johan Huizinga, “The Idea of History”, in: Fritz Stern (ed.), The Varieties of History:
From Voltaire to the Present, New York 1973, pp. 290–303, p. 292. All translations are
mine except where indicated otherwise. I would like to thank Florian Ernst for
corrections and bibliographical assistance.
2 See the special edition of Isis (99/2008) on Counterfactuals and the Historian of Science,
ed. by Gregory Radick.
3 The German term “Wissenschaft” applies to the natural sciences and the hu-
manities. In this article, the term “science” is generally used as a translation for
“Wissenschaft”, while “natural science” is equivalent to the German “Naturwis-
senschaft”.
Scientific Uses of Counterfactual Thought Experiments 113
4 Johan Huizinga, “The Task of Cultural History”, in: Men and Ideas: Essays by Johan
Huizinga, trans. James S. Holmes/Hans van Marie, New York 1959, pp. 17–76,
p. 39.
5 For recent positions challenging this opposition, cf. Uljana Feest (ed.), Historical
Perspectives on ‘Erklären’ and ‘Verstehen’, Vienna/New York 2009.
6 Cf. Ulrich Kühne, Die Methode des Gedankenexperiments, Frankfurt a.M. 2005.
114 Bernhard Kleeberg
Thought experiments are not real experiments. They merge the mental de-
sign of experiments with the procedure of conducting them, hence their hy-
potheses cannot be validated independently – there is no external resistance
that has impact on their operational sequence and could thus open up the
7 Cf. Max Weber, “Critical Studies in the Logic of the Cultural Sciences”, in: Weber,
The Methodology of the Social Sciences, trans. and ed. Edward A. Shils/Henry A. Finch,
Chicago, IL , 1949, pp. 113–188; Ernst Mach, “Über Gedankenexperimente”, in:
Mach, Erkenntnis und Irrtum: Skizze zur Psychologie der Forschung, 2nd ed., Leipzig
1906, pp. 181–197, in part trans. W.O. Price/S. Krimsky, “On Thought Experi-
ments. A Translation and Adaptation of a Work by Ernst Mach (1897) ‘Über Ge-
dankenexperimente’”, in: Philosophical Forum, 4/1973, pp. 446–457.
Scientific Uses of Counterfactual Thought Experiments 115
8 Cf. Ian Hacking, The Social Construction of What?, Cambridge, MA , 2000, pp. 71–72.
9 Elke Brendel, “Intuition Pumps and the Proper Use of Thought Experiments”,
in: Dialectica, 58/2004, pp. 89–108; Mary S. Morgan, “Models, Stories and the
Economic World”, in: Journal of Economic Methodology, 8/2001, pp. 361–384, p. 363.
Similar to historical narratives, these thought experiments lead towards a specific
ending (cf. Hayden White, The Content of the Form: Narrative Discourse and Historical
Representation, Baltimore/London 1987).
10 Cf. Yehuda Elkana, “Das Experiment als Begriff zweiter Ordnung”, in: Rechts-
historisches Journal, 7/1988, pp. 244–271, p. 249; Niklas Luhmann, Die Wissenschaft
der Gesellschaft, Frankfurt a.M. 1998, pp. 75–87; Marcus Krause/Nicolas Pethes,
“Zwischen Erfahrung und Möglichkeit: Literarische Experimentalkulturen im
19. Jahrhundert”, in: Krause/Pethes (eds.), Literarische Experimentalkulturen: Poeto-
logien des Experiments im 19. Jahrhundert, Würzburg 2005, pp. 7–18, p. 14. According
to Max Weber, scientific objects, their observer, and the concepts permanently
change, rendering it necessary to permanently rethink them (cf. John Drysdale,
“How are Social-Scientific Concepts Formed? A Reconstruction of Max Weber’s
Theory of Concept Formation”, in: Sociological Theory, 14/1996, pp. 71–88, p. 75).
116 Bernhard Kleeberg
presented the practice of experimenting and its method of variation as basic anthro-
pological operations. Human beings could either gain experience by pas-
sively observing changes in their environment, or by actively manipulating
their environment, which helped to acquire highly reliable, though vague,
perceptions of reality.11 Finally, experiments directed by reason arranged
settings in reflection of everyday experiences: They intentionally extended
experiences, provided far more details, and thus formed the basis of science.
Still, in order to link detailed experimental knowledge to experiential cer-
tainty, a third kind of experiment was required – thought experiments.12
Thought experiments helped to evaluate expectations about the effects of
specific events or actions without actually carrying them out. Based on a
special kind of experience, “thought experience”, imagination thus could help
form assumptions of the consequences of certain conditions.
The main uses of thought experiments hence lay in “saving” experience
by varying causes and circumstances in a simulated counterfactual scenario
in order to determine significance and facilitate abstraction. Due to their
relatively low cost, as compared to physical experiments, they constituted a
necessary precondition for designing experimental settings and averting fail-
ures. Sometimes a thought experiment could even be so decisive and defini-
tive that the experimenter would not render it necessary to verify it by physi-
cal experiments,13 and even if it did not provide a decisive solution, it could
serve as a starting point and support an educated guess about factors rel-
evant for the real experiment.14 Thus, thought experiments meet Mach’s
ideal of parsimonious cognitive effort – “economy of thought” (“Denköko-
nomie”):
It is the object of science to replace, or save, experiences, by the reproduction and
anticipation of facts in thought. Memory is handier than experience, and often
answers the same purpose. This economical office of science, which fills its whole
life, is apparent at first glance; and with its full recognition all mysticism in science
disappears.15
ages of facts”;20 and the more exact our memory of the “involuntary images
of facts” is, the higher are the chances to become aware of details that es-
caped our attention during direct observation, and thus to discover some-
thing new.21
By interfering with the habitual frames of reasoning, Mach argued, invol-
untary variation of details helped create knowledge, especially if triggered
by paradoxes.22 Used in retrospect, successful counterfactual thought experi-
ments initially depended on diligent direct observation that led to an exact
image of facts, but the power of imagination was able to call attention to ne-
glected facts and circumstances, and thus allowed for shifts in perspective:
In the reproduction of facts in thought, we never reproduce the facts in full, but
only that side of them which is important to us, moved to this directly or indirectly
by a practical interest. Our reproductions are invariably abstractions. Here again is
an economical tendency.23
Although Mach did not relate thought experiments to historical investi-
gation, his ideas on significance, abstraction, and perspectivity help to shed
light on the epistemic status of counterfactuals in historiography – even
though he himself placed emphasis on the contingent nature of historical de-
velopments:
The historical investigation of the development of a science is most needful, lest
the principles treasured up in it become a system of half-understood prescripts, or
worse, a system of prejudices. Historical investigation not only promotes the under-
standing of that which now is, but also brings new possibilities before us, by show-
ing that which exists to be in great measure conventional and accidental.24
25 John Stuart Mill, A System of Logic Ratiocinative and Inductive, Being a Connected View of
the Principles of Evidence and the Methods of Scientific Investigation, in: J.M. Robson (ed.),
Collected Works of John Stuart Mill, Vol. 7, Toronto/London 1974, III , 3, qtd. in
Michael Heidelberger, “From Mill via von Kries to Max Weber: Causality, Ex-
planation, and Understanding”, in: Feest (ed.), Historical Perspectives, pp. 241–266,
p. 246. Cf. also Heidelberger, “Origins of the Logical Theory of Probability: von
Kries, Wittgenstein, Waismann”, in: International Studies in the Philosophy of Science,
15/2001, pp. 177–188; Weyma Lübbe, “Die Fäden im Gewebe der Natur:
Determinismus und Probabilismus in der Kausalitätstheorie John Stuart Mills”,
in: Zeitschrift für philosophische Forschung, 47/1993, pp. 370–387; Lübbe, “Die Theo-
rie der adäquaten Verursachung: Zum Verhältnis von philosophischem und juris-
tischem Kausalitätsbegriff ”, in: Zeitschrift für allgemeine Wissenschaftstheorie, 24/1993,
pp. 87–102, pp. 89–93.
26 Cf. John Stuart Mill, “On the Definition of Political Economy”, in: Collected Works,
Vol. 4, Toronto 1967, pp. 321–327; Heidelberger, “From Mill”, pp. 244–248; Mary
S. Morgan, “Economic Man as Model Man: Ideal Types, Idealization and Carica-
tures”, in: Journal of the History of Economic Thought, 28/2006, pp. 1–27.
120 Bernhard Kleeberg
cant historical events: Meyer complained that this judgment simply reflected
different interests.32 For Rickert, this is precisely the point: He argues that
the content of historical representations changed with the leading cultural
values the historian relates to. A change in perspective led to different judg-
ments on the historical significance of past events or actions – and accord-
ingly to different stories told. Yet, this did neither imply that history was a
teleological process, as presumed by philosophers of history, nor that his-
toriography did not deal with causality. On the contrary,
[a]n individualizing and value-relating historiography has to analyze the causal
relations between the unique and individual processes it is concerned with, insofar
as they do not coincide with general laws of nature, even though the portrayal of
individual causal relations needs general terms as integral elements of historical concepts.33
The representation of historical development depended on the selection of
significant historical facts and causes: Any incident had an effect, but only
those incidents with significant value-related effects, to which we connect
a meaning we can understand, were of historical efficacy.34 Any “real historical
account” reflected not on general causes and causal laws, but only on indi-
vidual causes for individual effects.35 Unfortunately, Rickert keeps us in the
dark about the concrete techniques for distinguishing between significant and
insignificant causes, relating them to transcendental values, as Daniel Šuber has
argued.36
It was Max Weber who provided a practical solution to this problem, one
that helped to differentiate between significant and arbitrary causal chains
by establishing characteristic phenomena by means of selection and inte-
gration.37 In order to do so, Weber drew upon Gustav Radbruch’s concept of
“objective potentiality” and Johannes von Kries’s “theory of adequate cau-
sation”, as Weyma Lübbe, Fritz Ringer, and most recently Šuber and Hei-
32 Cf. Eduard Meyer, Zur Theorie und Methodik der Geschichte: Geschichtsphilosophische
Untersuchungen, Halle 1902, qtd. in: Rickert, Kulturwissenschaft, p. 89; Rickert, Die
Grenzen der naturwissenschaftlichen Begriffsbildung, 2nd ed., Tübingen 1913, pp. 290–291,
370–371, 424–425.
33 Rickert, Kulturwissenschaft, p. 92, my italics.
34 Cf. Rickert, Kulturwissenschaft, p. 93.
35 Rickert, Grenzen, p. 377.
36 Cf. Daniel Šuber, “Social Science Between Neo-Kantianism and Philosophy of
Life: The Cases of Weber, Simmel, and Mannheim”, in: Feest (ed.), Historical Per-
spectives, pp. 267–290, p. 279.
37 Max Weber, “Roscher und Knies und die logischen Probleme der historischen
Nationalökonomie (1903–05)”, in: Johannes Winckelmann (ed.), Gesammelte Auf-
sätze zur Wissenschaftslehre, Tübingen 1922, pp. 1–145, p. 5.
122 Bernhard Kleeberg
38 Cf. Lübbe, “Theorie der adäquaten Verursachung”; Fritz Ringer, “Max Weber on
Causal Analysis, Interpretation, and Comparison”, in: History and Theory, 41/2002,
pp. 163–178, pp. 163–164; Heidelberger, “From Mill”.
39 Šuber, “Social Science”, p. 279.
40 Šuber, “Social Science”, p. 279.
41 Heidelberger, “From Mill”, p. 241; cf. also Šuber, “Social Science”, p. 2; Ringer,
“Max Weber”, p. 163.
42 Cf. Max Weber, “‘Objectivity’ in Social Science and Social Policy”, in: The Method-
ology of the Social Sciences, trans. and ed. Edward A. Shils/Henry A. Finch, New York
1949, pp. 50–112, p. 72.
43 Weber, “Logic of the Cultural Sciences”, p. 138. The term “historical reality” is, of
course, highly problematic. My argument only assumes that historiography claims
epistemic validity as a system of reference that relies on something outside histori-
ography itself. That is, historical narratives can be considered a specific – factual –
mode of narration, a récit factual (Gérard Genette). Cf. Christian Klein/Matías Mar-
tínez, “Introduction”, in: Klein/Martínez (eds.), Wirklichkeitserzählungen: Felder,
Formen und Funktionen nicht-literarischen Erzählens, Stuttgart 2009, pp. 1–13, p. 2.
Scientific Uses of Counterfactual Thought Experiments 123
64 Weber, “Logic of the Cultural Sciences”, p. 176; cf. Ringer, “Max Weber”, p. 174.
65 Weber, “Logic of the Cultural Sciences”, p. 176.
66 Weber, “Logic of the Cultural Sciences”, pp. 176–177; my italics.
67 Cf. Wolfgang Krohn, “Deliberative Constructivism”, in: STI Studies, 1/2006,
pp. 41–60, p. 46.
Scientific Uses of Counterfactual Thought Experiments 129
soning would not lead to subjective and arbitrary stories about the past if
value-related meaning and causal significance could be correlated by means
of counterfactuality. Still, it only served as a negative heuristic tool to judge
the significance of actions and events, and the validity of conceptual abstrac-
tions – the construction of alternative histories was off-limits.
To this day, most historians have embraced Weber’s warning: “The at-
tempt to hypothesize in a positive way what ‘would’ have happened can,
if it is made, lead to grotesque results”.68 It might be interesting, however, to
apply his considerations about individual counter-facts to counter-scenarios in
order to assess entire frames of historical factuality and trigger shifts in per-
spective of general theories and master narratives: While different facts can
help to change the plot of a story, imagining different scenarios can lead
to a change in genre, to new approaches established on the basis of alter-
nate actors, forces, things, or discourses. Historians of ideas might ask what
would have happened if a prominent theorist like Weber had not written
about objective possibility and adequate causation. The historians’ answer
certainly would not be that Alexander Demandt would not have worked on
“history that never happened” a hundred years later.69
I. Introduction1
1 For reasons of space, footnotes have been limited to a minimum. All notes in
square brackets within quotations are mine, and so are all translations from Ger-
man into English unless otherwise noted. With regard to the scholars mentioned
in my paper, in some cases I have provided relevant biographical information.
2 Nancy Cartwright, “Counterfactuals in Economics: A Commentary”, in: Joseph K.
Campbell/Michael O’Rourke/Harry S. Silverstein (eds.), Causation and Explanation:
Topics in Contemporary Philosophy, 4th ed., Cambridge 2007, pp. 191–216, p. 191.
3 To avoid confusion I use the term “history” with reference to “past events and
processes” and “historiography” with reference to “the results of inquiries about
history”, as suggested by Aviezer Tucker (“Introduction”, in: Tucker (ed.), A Com-
panion to the Philosophy of History and Historiography, Chichester 2009, pp. 1–6, p. 2).
4 Elazar Weinryb, “Historiographic Counterfactuals”, in: Tucker, Companion,
pp. 109–119, p. 109.
5 Cf., in particular, Niall Ferguson (ed.), Virtual History: Alternatives and Counterfac-
tuals, London 1997; Robert Cowley (ed.), What If ? The World’s Foremost Historians
Imagine What Might Have Been, New York 1999; Robert Cowley (ed.), More What If ?
Eminent Historians Imagine What Might Have Been, New York 2000; Roland Wenzl-
huemer, “Counterfactual Thinking as a Scientific Method”, in: Wenzlhuemer
(ed.), Counterfactual Thinking as a Scientific Method, in: Historische Sozialforschung 34,
2/2009, pp. 27–54; Kai Brodersen (ed.), Virtuelle Antike: Wendepunkte der Alten
Geschichte, Darmstadt 2000; Michael Salewski (ed.), Was Wäre Wenn: Alternativ- und
Parallelgeschichte: Brücken zwischen Phantasie und Wirklichkeit, Historische Mittei-
lungen, Supplement 36, Stuttgart 1999.
6 Cf. Arnd Hoffmann, Zufall und Kontingenz in der Geschichtstheorie: Mit zwei Studien zu
Theorie und Praxis der Sozialgeschichte, Studien zur Europäischen Rechtsgeschichte
184, Frankfurt a.M. 2004, pp. 141–158; Geoffrey Hawthorn, Plausible Worlds: Possi-
bility and Understanding in History and the Social Sciences, Cambridge 1995; Wolfgang
Stegmüller, Probleme und Resultate der Wissenschaftstheorie und Analytischen Philosophie,
Counterfactuality and History 131
Vol. 1, Erklärung, Begründung, Kausalität, 2nd, revised and extended ed., Berlin/Hei-
delberg/New York 1983, pp. 329–345.
7 Tom Wicker, “If Lincoln Had Not Freed the Slaves: The Inevitable Results of No
Emancipation Proclamation”, in: Cowley (ed.), More What If ?, pp. 152–164.
8 Victor D. Hanson, “Socrates Dies at Delium, 424 B.C.: The Consequences of a
Single Battle Casualty”, in: Cowley (ed.), More What If ?, pp. 1–22.
9 Michael Salewski, “Vorwort”, in: Salewski (ed.), Was Wäre Wenn, pp. 7–12, p. 9.
10 Cf. the subtitle “Eminent Historians Imagine What Might Have Been” of Cowley
(ed.), More What If ?.
11 Cf. the subtitle “The World’s Foremost Historians Imagine What Might Have
Been” of Cowley (ed.), What If ?.
12 Paul B. Miller, “Counterfactual History: Not ‘What If ?’ but ‘Why Not?’”, in:
Chronicle of Higher Education, 50/2004, pp. B10–B11, p. B10.
13 The probably “most famous” (Arthur Marwick, The New Nature of History: Knowl-
edge, Evidence, Language, Houndsmill-Basingstoke 2001, p. 128), however contested
exception is: Robert Fogel, Railroads and the American Growth: Essays in the Economet-
ric History, Baltimore 1964.
14 Cf., in particular, the numerous works by Richard N. Lebow, who has just pub-
lished a monograph entitled Forbidden Fruit: Counterfactuals and International Re-
lations, Princeton 2010.
15 Werner Suppanz, “Maria Theresia”, in: Menschen – Mythen – Zeiten, Memoria Aus-
triae, Vol. 1, Munich 2004, pp. 26–47, p. 28.
132 Georg Christoph Berger Waldenegg
In the end the question whether the doom of the Habsburg monarchy could have
been avoided by a thorough remodelling in a federal-democratic sense […] cannot
be answered and is, all things considered, an idle question.16
In other words: historians should “study what actually happened, not what
could have, may have, or would have happened”.17
Yet, despite such scepticism and although “some contemporary histori-
ans still sternly warn us to avoid ‘what-might-have-been’ questions”,18 more
scholars than ever seem to acknowledge the potential value of more or less
detailed counterfactual reasoning. One of them is the German historian
Alexander Demandt (b. 1937), who as early as 1984 published a perti-
nent study (translated into English in 1993)19 which has aptly been labelled
“groundbreaking”20 – at least as far as contributions by professional histori-
ans go.21 Demandt has insistently defended the legitimacy of such reasoning,
regardless of the criticism directed against his book22 (which often must ac-
knowledge the “right of the probable to exist”).23 For instance, he writes:
“Provided we always take into account the relevant prehistory, the actual
16 Adam Wandruszka, “In der heutigen Welt eine Anomalie”, in: Wandruszka/Peter
Urbanitsch (eds.), Die Habsburgermonarchie 1848–1918, Vol. VI , Part I, Die Habsbur-
germonarchie im System der internationalen Beziehungen, Vienna 1989, pp. xi–xvi, p. xv.
17 Aviezer Tucker, “Causation in Historiography”, in: Tucker (ed.), Companion,
pp. 98–108, p. 103.
18 Philip E. Tetlock/Aaron Belkin, “Counterfactual Thought Experiments in World
Politics: Logical, Methodological, and Psychological Perspectives”, in: Tetlock/
Belkin (eds.), Counterfactual Thought Experiments in World Politics: Logical, Methodologi-
cal, and Psychological Perspectives, Princeton 1996, pp. 1–38, p. 1. A prominent
example is: Marwick, New Nature, p. 128. The sociologist Randal Collins is also
very sceptical (“Turning Points, Bottlenecks, and the Fallacies of Counterfactual
History”, in: Sociological Forum, 3/2007, pp. 247–269).
19 Cf. Alexander Demandt, History That Never Happened: A Treatise on the Question:
What Would Have Happened If …?, trans. Colin D. Thomson, 3rd ed., Jefferson, NC,
1993.
20 Wenzlhuemer, “Counterfactual Thinking”, p. 41.
21 Cf. Alexander Demandt, Ungeschehene Geschichte: Ein Traktat über die Frage: Was wäre
geschehen, wenn …?, 4th ed., Göttingen 2005.
22 Cf., above all, Hubert Kiesewetter, Irreale oder reale Geschichte? Ein Traktat über
Methodenfragen in der Geschichtswissenschaft, Herbolzheim 2002; Gregor Weber,
“Vom Sinn kontrafaktischer Geschichte”, in: Brodersen (ed.), Virtuelle Antike,
pp. 18–22.
23 Frank Müller, “Might-have-been-history: Vom Sinn der Frage ‘Was wäre ge-
schehen, wenn …?’”, in: ungewußt: Zeitschrift für angewandtes Nichtwissen, 12/2005,
pp. 21–40, http://www.ifan-online.de/ungewu/heft12/mueller12.htm, p. 36 (Feb-
ruary 2, 2010).
Counterfactuality and History 133
[T]he [political] forces responsible for the execution of the national loan [floated
in 1854] shared some responsibility for the fact […] that the official [imperial]
motto “viribus unitis”, sounding so nice, matched reality less and less. At the latest
by 1918, even the most intransigent optimists had to apprehend this.29
I wrote these sentences a few years ago. Re-reading them today, I feel some-
what queasy. At the end of my article you will understand why this is so.
One of the most remarkable results of counterfactual research is probably
a recent book dedicated to the allegedly “most controversial what if in the
history of American foreign policy”, that is, the question “What if John F.
Kennedy had lived?”.30 The three authors, political scientists quite deeply in-
terested in history and working to some extent like skilled historians, formu-
late a very peremptory counterfactual, namely: “Yes, there would have been
no [Vietnam] war if Kennedy had lived”.31 However, not surprisingly, a his-
torian notes in a foreword to this volume that one “never can know with
certainty [what a surviving Kennedy would have done in Vietnam]”.32 One
could even ask whether historians and scholars in general – in particular
political scientists – command sufficient scholarly means to assess or estab-
lish the likelihood (plausibility, possibility)33 of counterfactuals regarding his-
torical processes, events, and motives.
This is one of the aspects I will concentrate on in the following (part III ).
In addition, I will discuss three aspects which, at least in part, have been
rather neglected so far. This holds particularly true with regard to problems
concerning the definition of counterfactual historiography (part II ) and the
different forms of such an undertaking (part IV ). Furthermore, I will reflect
briefly on the question whether historians are aware of the conditions and
implications counterfactual historiographic reasoning implies (part V). In
conclusion, I will ponder on the utility of such reasoning. Deliberately, this
will be done in a somewhat provocative manner, in the hope to ignite further
discussion.
29 Georg Christoph Berger Waldenegg, Mit vereinten Kräften! Zum Verhältnis von Herr-
schaftspraxis und Systemkonsolidierung im Neoabsolutismus am Beispiel der Nationalanleihe
von 1854, Veröffentlichungen der Kommission für Neuere Geschichte Öster-
reichs 94, Vienna 2002, p. 613.
30 Fredrik Logevall, “Foreword”, in: James G. Blight/Janet M. Lang/David A.
Welch, Vietnam If Kennedy Had Lived: Virtual JFK , Lanham 2009, pp. ix–xi, p. ix.
31 James G. Blight/Janet M. Lang/David A. Welch, “Epilogue”, in: Blight/Lang/
Welch, Vietnam, p. 251.
32 Logevall, “Foreword”, p. ix.
33 These two terms are used by Hawthorn (cf. Plausible Worlds, pp. 79, 123). One
could even speak of “probability”.
Counterfactuality and History 135
who has seized and understood them”.40 Indeed, “these facts themselves are
not objective items, but rather dissected from a contingent transmission and
elevated to the status of a fact by way of interpretation”.41 Second, and for
our purpose more substantial: Are definitions as the ones mentioned above
“reasonably precise”, as the editors of one stimulating related miscellany
claim?42 This would be a great advantage because definitions are often sub-
ject to never-ending contention, not only among historians. But Demandt,
for instance, holds that “counterfactual history describes reflections on
likely events in the past which didn’t take place”.43
This is a crucial point: Must historiographic counterfactuals be likely?
Let us take the following counterfactual: “If only the Soviet Union had had
the atomic bomb in the summer of 1941 and if Joseph Stalin had been
irrevocably determined to use it immediately after the German attack he
wouldn’t have had to decide in the autumn of that year whether to leave Mos-
cow or not”. Obviously enough, Stalin could not have had the bomb in the
summer of 1941, therefore, I have created a kind of “miracle counterfac-
tual”,44 which one might denounce as pointless or even as a kind of bogus
science approach. However, it illustrates plainly the potential decisiveness of
overwhelming military power. I assume that Levine has exactly such an
“elucidation”45 in mind when he accepts historical counterfactuals as “heu-
ristic expressions” (although it might be more appropriate to speak of a
“heuristic device”).46 And whether such insights could as “easily be accom-
plished in ways more faithful to the historical record”,47 has yet to be seen.
48 Alexander Demandt, Ungeschehene Geschichte: Ein Traktat über die Frage: Was wäre ge-
schehen, wenn …?, Göttingen 1984, p. 46. My translation, because the English ver-
sion of this passage is imprecise.
49 But obviously this is part of the problem of counterfactual reasoning, which quasi
by definition implies to reflect on future developments.
50 Demandt, History, p. 47.
51 Demandt, History, pp. 107, 204.
52 Pohanka, Kein Denkmal, p. 204.
138 Georg Christoph Berger Waldenegg
53 Holger H. Herwig, “Hitler Wins in the East but Germany Still Loses World
War II ”, in: Philip E. Tetlock/Richard N. Lebow/Geoffrey Parker (eds.), Un-
making the West: ‘What-If ?’ Scenarios that Rewrite World History, Ann Arbor 2006,
pp. 323–360, p. 331.
54 Tim de Mey, “Remodeling the Past”, in: Foundations of Science, 10/2005, pp. 47–66,
p. 59.
55 Cf. Max Weber, “Objective Possibility and Adequate Causation in Historical Ex-
planation”, in: Weber, The Methodology of the Social Sciences, trans. and ed. Edward A.
Shils, Glencoe 1949, pp. 164–188.
56 For an adequate assessment of Weber’s position, cf. Hoffmann, Zufall, p. 145.
57 Herwig, “Hitler”, p. 331.
58 Herwig, “Hitler”, p. 331.
59 Cf. Dmitri Volkogonov, Stalin: Triumph and Tragedy, London 1991, pp. 434–435.
60 According to Volkogonov, Beria was “somewhat nervous” and Stalin “angry”
(Stalin, p. 435), while according to Herwig the latter was “not amused” and the
former “nervous” (“Hitler”, p. 331).
Counterfactuality and History 139
perfect record – and what ought this to be? – there would remain a second
problem: Could Stalin really have demanded to be transported to Kuntsevo?
According to some experts a counterfactual must be one already “contem-
plated by contemporaries”.61 This claim has been criticized quite harshly, and
it has been argued, for instance, that such a restriction “would rule out all
counterfactuals that were the result of impulsive behaviour (or the lack of
it), of human accident”.62 Be that as it may, and coming back to Herwig’s
example, Stalin apparently considered the possibility of going to Kuntsevo.
Yet, maybe he had no option to act on this thought.
If everything has a specific cause (or causes), then Stalin’s actual decision
also resulted from a specific cause (or causes) which historians maybe are
not able to identify. In our concrete case, Stalin could have been compelled
(despite of his political power) to follow the advice of subordinates, in this
case of Beria. According to the political scientist Richard N. Lebow – very
much engaged in counterfactual case studies – a close counterfactual should
not “strain our understanding of the world”,63 and thus not contradict “what
was technologically, culturally, temporally, or otherwise possible”.64 And evi-
dently, Stalin could have gone to Kuntsevo. This would not have “strain[ed]
our understanding of the world”.
But what is our understanding of Stalin, his character, and his ways of mak-
ing decisions? Can we identify his motives of action? Can we ascribe to him
a kind of typical, empirically observable behaviour in given, in specific circum-
stances? Admittedly, I am not very familiar with his life. But I know that his-
torians still today quarrel over the personality of a man like Adolf Hitler. And
most probably, they will continue to do so.
To begin with, in most cases historians cannot put their patients on the
famous couch. This statement appears to be trivial, but it is important,
since even if they could do so, maybe they would not be better off. Apart
from this, there is some trouble with the historian’s set of tools, the so-called
“historical method” – or rather “methods” (because there is no single his-
Now, to call for unanimous explanations is a rather tall order, the more
so, as historians cannot conduct controlled experiments the way other dis-
ciplines are able to (although their results are often contested, too). Hence,
many colleagues are satisfied if explanations fulfil two conditions established
by the philosopher of science Aviezer Tucker (b. 1965). The first condition
concerns consensus: “Usually, when the consensus involves hundreds of
people who are geographically, institutionally, and professionally dispersed,
it is safe to assume that it is large enough […] to avoid accidental results”.72
The second premise concerns the methodological basis on which the afore-
said consensus is achieved, namely “accuracy, consistency, scope, simplicity,
and fruitfulness”.73
So far, so good! Yet, we should bear in mind three normal, but quite fun-
damental, experiences for historians: First, by and by, even dominant expla-
nations become obsolete or at least require more or less substantial modifi-
cations (this also goes for so-called classics, even though their “form” or
rather their structure might get “irrefutable”);74 second, rather often expla-
nations shared only by a minority – members of the majority sometimes
blemish them as “outsiders” (that is part of the historian’s power game) –
turn out to be considered as more appropriate; third, even alleged hard his-
torical facts in the course of time may turn out to be only interpretations, and
thus, subjective.
What is more, explanations contain certain hidden assumptions (this
might be the case in all academic disciplines) such as: the view of mankind,
ideas about whether mankind in the course of time makes progress or not,
whether human history is made primarily by men, by impersonal forces, no-
torious structures, or even by accident and chaos, and – last but not least –
whether human beings act rationally, as historians normally “assume”.75
Another hidden assumption regards the view what scholarship is and what
it can achieve – if historiography is considered a scholarly endeavour and not
something else, for instance art or “fiction”.76 Finally, there is the problem of
“free will”. Regarding our concrete example, maybe what appears to be “im-
pulsive behaviour” is in reality behaviour resulting from certain brain activ-
ities.77 Hence, though “ultimately it is people who decide and act”,78 free will
might be an illusion (Herwig himself professes to “believe in choice, […] and
not in determinism”, and thus in free will).79
Overall, then, Herwig maybe did not reflect enough on the premises of
his own counterfactual starting point. Now, it has been argued that histori-
ans “hold the keys to their own release”, as far as their “preconceptions”
are concerned.80 I fear this is an illusion. But even if this were the case,
I fear that the other epistemological problems mentioned remain on the
table.
causes”.85 Maybe one could even argue that all indirect causal statements en-
close counterfactuals. A rather typical “direct” statement is the following:
“Had the Habsburg Empire enjoyed a homogeneous population, there is no
reason to suppose that it would not have left the war intact”.86 Here is an-
other example, put even more straightforwardly:
If [the Austrian chancellor] Metternich had […] realized [“constitution” and “par-
liament”] […] already in 1848, in middle Europe too the “spring of people” [“Völ-
kerfrühling”] would have led to a democratic solution of the social and political
problems. But history took a different course.87
Now, there are good reasons to desist from claims one cannot defend.
Yet, precisely because of such caution, it seems quite impossible to test the
relevant hypothesis. Speaking in Popperian terms, it is not open to falsifica-
tion. So maybe we are caught in a kind of dilemma. In other words, cautious
counterfactuals can lead to vagueness or even to vacuity, while strong ones
might be all too easy to falsify (could one establish a kind of happy medium
between the two?), as, for example, this indirect counterfactual by Sked:
“The [Habsburg] Empire fell because it lost a major war”.92 However, such
claims might be taken as “heuristic expressions”, as starting points to analyse
the impact of certain factors, in this case the degree of the impact of the lost
war on the fate of the monarchy.
Another distinction concerns the “temporal” and “substantive range” of
counterfactual statements. I divide them into three forms: first, “temporal”
(short-/mid-/long-range); second, “substantive” (small-/mid-/large-range);
and I confine myself to one temporal long-range counterfactual: Accord-
ing to one historian, the “century-long ‘decline’ [of the Habsburg monarchy]
that ultimately led to […] defeat and dissolution” began as early as 1815.93
Claims of this kind are open to criticism, even more so as they often are lin-
ear, not to say deterministic.
Yet a further distinction concerns causality: Counterfactuals can be
“pluri-causal” or “mono-causal” like the following one:
The absolutistic Austrian political system between 1851 and 1860 failed to settle
the social question in a fair manner. Therefore, even those parts of society which
until then weren’t infected with national ideas became nationalist. In the end, this
process decided upon the future of the monarchy.94
Most historians would denounce “mono-causal” claims as too simplistic.
But “multi-causal” claims imply the problem “of weighting”: “How much
importance should be assigned to factors A, B, C, and so forth?”95
The internal and external situation of the Habsburg monarchy was, of
course, highly complex. Hence, I doubt, for example, that one could formu-
late a “well-constructed”,96 sufficiently “evidentially based”97 counterfactual
that would withstand rather easy deconstruction, such as: “Even in case of
a general European conflagration, the downfall of the Habsburg monarchy
could have been avoided if there hadn’t existed Serbian nationalism or Pan-
Slavism”. Naturally, one could claim: “If nationalism had not existed at
all, there wouldn’t have been World War I and consequently the monarchy
would have survived”. Apart from the fact that the monarchy could have
fallen for other reasons, evidently, this is a “miracle counterfactual” whose
only (and I have no problems with that) heuristic value is that it stresses the
potential impact of nationalism.
A final distinction: Counterfactuals often accentuate personal factors
(human agency) instead of, in particular, structures. According to Walter
Laqueur (b. 1921), counterfactual historiography forms a “reaction to the ex-
treme de-personalization and determinism of current historical studies, with
their emphasis on social history opposed to events and personality-driven
history”.98 Be that as it may, obviously even apparently insignificant circum-
94 Christoph Stölzl, Die Ära Bach in Böhmen: Sozialgeschichtliche Studien zum Neoabsolutis-
mus 1849–1859, Veröffentlichungen des Collegium Carolinum, Vol. 26, Munich/
Vienna 1971, p. 312.
95 Richard F. Hamilton/Holger H. Herwig, “World Wars: Definition and Causes”,
in: Hamilton/Herwig (eds.), The Origins of World War I, Cambridge 2003, p. 41.
96 Giovanni Capoccia/R. Daniel Kelemen, “The Study of Critical Junctures: Theory,
Narrative, and Counterfactuals in Historical Institutionalism”, in: World Politics,
59/2007, pp. 341–369, p. 355.
97 Martin Bunzl, “Posting”, September 8, 2004, http://www.historycooperative.org/
phorum/read.php?11,245,250,quote=1#REPLY (February 1, 2010).
98 Walter Laqueur, “Disraelia: A Counterfactual History”, 2008, http://blogs.law.
harvard.edu/mesh/files/2008/04/disraelia_laqueur.pdf (February 11, 2010).
146 Georg Christoph Berger Waldenegg
stances might help to explain historical events such as the outbreak of World
War I: At the time of the assassination of Franz Ferdinand, the Austro-Hun-
garian Chief of the General Staff was Conrad von Hötzendorf. Even before
the killing he had insisted on a pre-emptive strike against Serbia. So his per-
emptory call for war in the summer of 1914 seems only consistent. Yet,
maybe Conrad “calculated that, as a war hero, he would be free to marry his
beloved Gina von Reininghaus”. Thus, his “infatuation cannot, obviously,
explain the outbreak of the First World War. But it remains a reminder that
the most banal and maudlin emotions, as well as the most deeply felt,
interacted with the wider context”,99 that “little things”, too, “do” or at least
“matter”.100
So what about the likelihood of the following counterfactual: “If Conrad
had never met his beloved Gina, (maybe) he would not have opted for war
the way he did, therefore (maybe) the war-party in Vienna would have had
less influence, therefore, the ultimatum to Serbia (maybe) would have been
at any rate formulated less drastically, therefore, the Serbian government
(maybe) would have accepted its (crucial) terms, therefore, World War I
(maybe) would not have taken place …”?
In conclusion, I must emphasize three aspects: First of all, further distinc-
tions could be made. Second, counterfactuals often combine some of the
categories mentioned. A very striking example, being “mono-causal” as well
as “indirect” and “strong”, is the one already quoted from Sked: “The Em-
pire fell because it lost a major war”.101 Third, there are gradations in all cat-
egories of my typology; they are not binary categories, but arrayed on a con-
tinuum.
Now, are historians aware of the possible conditions and implications re-
garding counterfactual reasoning? The answer is simple and can therefore be
stated briefly: In most cases this does not seem to be the case, since histori-
ans’ counterfactuals of whatever kind are almost never accompanied by ref-
99 Hew Strachan, The Outbreak of the First World War, Oxford 2004, p. 127.
100 Dylan Kissane, “The Balkan Bullett With Butterfly Wings”, in: Central and
Eastern European Political Science Journal, 4/2006, pp. 85–106, p. 100, http://
works.bepress.com/cgi/viewcontent.cgi?article=1000&context=dylankissane
(February 15, 2010).
101 Sked, “European Empires”, p. 150.
Counterfactuality and History 147
VI. Conclusion
counterfactuals”106 nor that they should follow Levine’s advice and use
counterfactuals only as “heuristic expressions”. On the contrary. For at least
four reasons I contend that historians should grapple with the challenge of
counterfactual historiography: First – and as I already said – they willy-nilly
argue counterfactually; in fact, such reasoning “plays a pivotal role in his-
torical inquiries”.107 Second, I agree with Lebow that counterfactual histori-
ography differs only in degree, not in substance from its so-to-say factual
twin brother (or sister), and thus from the normal business of historians. In
reverse this means, then, that historians who argue explicitly counterfactually
should try to base their interpretations and/or explanations on as much
evidence as possible;108 third, both “businesses” are inextricably intercon-
nected. Fourth, and last but not least, many historians like what they are
doing; so they should try to make the best of the problems, if not to say
shortcomings, of their discipline. All told, I support a thesis by the editors of
a recent volume dedicated to what-if scenarios: “The primary value of such an
exercise […] is humility. The world we inhabit is but one of the vast array of
possible worlds that might have been brought about”.109
Let us return to the outbreak of World War I – by the way this expression
is not a fact but an interpretation: For good reasons, some historians con-
sider this war to be already the fifth or even eighth World War.110 Soon after
the assassination of Franz Ferdinand, Vienna sent an emissary to Berlin to
assure unconditional German support in case of war, which was granted.
What if this assassination had not taken place? No blank cheque and no war
then? And could Austria-Hungary really “have become a Commonwealth”,
as Demandt conjectured? In other words: Franz Ferdinand’s death mattered!
Even historians who favour structural explanations are bound to admit its
historical impact. But I fear it will remain uncertain how much it mattered.
Thus, World War I – or something comparable – might have broken out for
other reasons, if not in the summer of 1914, then rather soon.111 And in this
case the end of the Habsburg Monarchy might have occurred in quite a simi-
lar way as it actually did.
Unfortunately, I forgot to mention two things: First, the monarchy could
have become a Commonwealth even in case of defeat or even score, and sec-
ond, it could have won the war! Many experts would judge both claims as
rather unlikely; yet, if “the world we inhabit is really but one of the vast array
of possible worlds that might have been brought about”, they appear to be
only two of so-to-say infinite likely counterfactuals! I would like to conclude
with a reflection borrowed from Robert Musil, in my opinion an inexhaust-
ible source of interesting, important, and useful insights:
Whoever has the sense of possibility [Möglichkeitssinn], does not say, for in-
stance: Here this or that has happened, will happen, must happen; but he invents:
Here this or that might, could, or ought to happen. If he is told that something is
the way it is, he will think: It could probably just as well be otherwise.112
112 Robert Musil, Der Mann ohne Eigenschaften, Reinbek 1978, p. 16, my translation.
150 Richard Ned Lebow
I. Case Studies
Social scientists have long recognized the utility, even necessity, of using
counterfactuals to help establish or buttress causal claims. If we hypothes-
ize that “X” caused “Y”, we assume that in the absence of “X”, other things
being equal, there would be no “Y”. The obvious way of testing such a prop-
osition is in a large sample of comparable, independent cases where there is
adequate variation on both the independent and dependent variables. This
condition can rarely be met in international relations given the limited
number of comparable and independent cases that are generally available.
Scholars using the case study method often attempt to establish causation
Counterfactuals, Contingency, and Causation 151
through process tracing.1 They try to document the links between a stated
cause and a given outcome in lieu of establishing a statistical correlation.
This strategy works best at the individual level of analysis, and only when
there is enough evidence to document the calculations and motives of actors.
Even when such evidence is available, it is often impossible to determine the
relative weight of the hypothesized causes and which, if any of them, might
have produced the outcome in the absence of others or in combination with
other causes or background conditions not present in the case. It is often
useful to supplement process tracing with some kind of comparative analy-
sis. Counterfactual thought experiments can sometimes be utilized toward
this end.
Within a single case comparative analysis can take two forms: intra-case
comparison and counterfactual analysis. Intra-case comparison breaks down
a case into a series of similar interactions that are treated as separate and in-
dependent cases for purposes of analysis. Counterfactual analysis imagines
parallel cases in which one or more hypothesized causes or contextual fac-
tors are mutated. The two techniques are closely related because they add or
subtract what are thought to be key causes or enabling conditions while at-
tempting as far as possible to hold everything else constant. Comparative
case studies, when they are feasible, have the advantage of addressing real
world cases, and the drawback of never being able to control all relevant fac-
tors beyond the cause that is allowed to vary. Counterfactual comparisons do
just the reverse.
I have used both methods of comparison in a case study of the origins of
World War I.2 The Austrian leadership considered going to war with Serbia
or Montenegro on four occasions between 1911 and 1914. These decision
points can be broken out into individual cases and comparisons used to
identify the reasons behind the decision for war in 1914. The most important
difference was the murder of Archduke Franz Ferdinand and his wife Sop-
hie. The archduke, committed to peaceful relations with Russia, had been the
principal opponent to war. Emperor Franz Josef, also inclined toward peace,
was more willing to go to war in the aftermath of the assassinations, as was
the German Kaiser. For Franz Josef, the assassinations transformed the con-
flict between Austria and Serbia from one of interest and security into one
one possibility – can easily sever this causal chain or alter its timing. Had
the Balkan events that Austria-Hungary found so threatening occurred a few
years earlier or later the Austrian response would almost certainly have been
different. If earlier, the German Kaiser and chancellor, given the causes and
timing of their gestalt shifts, would have been much less likely to have encour-
aged Austria to draw its sword. If later – Franz Josef died in 1916 – the Arch-
duke would have been emperor and in all likelihood engaged in a constitu-
tional struggle with Hungary that would have made him and other Austrians
even more inclined toward peace. Similar counterfactual interventions can
be made in the German and Russian causal chains and the consequences of
severing them or changing their timing have equally dramatic consequences
for risk-taking in 1914. Counterfactual thought experiments suggest that
World War I was highly contingent. It was the result of an unusual con-
fluence whose timing was fortuitous.
Elsewhere, Richard Herrmann and I use intra-case comparisons and
counterfactual analysis to probe the causes and contingency of the end of the
Cold War.3 We identified a single causal chain that led to the end of the Cold
War. It began with the election of Mikhail Gorbachev as general secretary,
followed by his decision to withdraw Soviet troops from Afghanistan, arms
control agreements with the West, the political independence of Eastern Eu-
rope, and unification of Germany. We recruited authors to write case studies
of these turning points and use counterfactuals to probe their contingency
and timing. Most of the turning points proved malleable in both respects and
could readily be untracked by credible minimal rewrites of history.
The sensitivity of these turning points to a range of counterfactual inter-
ventions helps to establish not only their contingency but the conditions re-
sponsible for them. In the case of the Cold War, leadership turned out to be
the important variable in most of them. Jacques Lévesque reasons – to offer
one example – that if Gorbachev had pressed for the replacement of Erich
Honecker by Hans Modrow – the preferred candidate of German reform-
ers – and Modrow, as expected, had made overtures to the internal opposi-
tion, events in Germany and Eastern Europe in general would have unfolded
differently. The German Democratic Republic (DDR ) might have remained
independent for some time as existing opposition forces were in favor of
its continued existence; the DDR might ultimately have been unified with
the Federal Republic rather than being absorbed. The new governments in
Poland and Hungary in the summer of 1989 did not call for an end to the
3 Cf. Richard K. Herrmann/Richard Ned Lebow (eds.), Ending the Cold War, New
York 2006.
154 Richard Ned Lebow
Warsaw Pact and reserved a major and crucial role of their communist par-
ties. It was the sudden collapse of the DDR and calls for German unification
that upset the equilibria in Poland and Hungary and made dominoes fall in
Bulgaria, Czechoslovakia, and Rumania, and with them, the Warsaw Pact.4
Counterfactual analysis of turning points helps us to understand why
these events functioned as turning points: they were preconditions or cata-
lysts for other turning points and changing understandings in Moscow and
Washington about the nature and possibilities of their relationship. The se-
lection of Gorbachev as president was critical to the Intermediate Nuclear
Forces’ (INF ) accord with the United States and the general secretary’s pub-
lic disavowal of the use of force to prevent political change in Eastern Eu-
rope. These developments paved the way for the unification of Germany.
Political change in Eastern Europe arguably made German unification diffi-
cult to prevent, as Davis and Wohlforth demonstrate, but did not determine
the form it took.5
Counterfactual probing of the origins of World War I and the end of the
Cold War raises questions about the ability of our theories to explain, let
alone predict, events of this kind. Theories in international relations are sys-
temic or structural, and I use these terms interchangeably. They rely on some
condition or set of conditions (e.g., balance of power, type of regime, nature
and implications of military technology) or process (equilibria, or their ab-
sence) to explain and predict outcomes. They assume that selection effects
are strong enough to shape behavior over time or that actors are sufficiently
rational and self-interested to grasp the implications of these structures or
bargaining games and respond as theorists expect they should.
Critics identify numerous ontological and psychological reasons for the
failure of such theories. The key ontological objections I noted in the intro-
duction: the open-ended, non-linear, path-dependent, and reflexive nature
of the social world. Systemic theories and formal models invariably ignore
the cognitive limitations and emotional commitments of actors and the at-
tention, even priority, they can give to other domestic and foreign problems
they confront. Models of bargaining impose a format of move and counter-
9 Cf. John Lukacs, Five Days in London, May 1940, New Haven 1999; Holger Herwig,
“Hitler Wins in the East but Germany Still Loses World War II ”, in: Philip E. Tet-
lock/Richard Ned Lebow/Geoffrey Parker (eds.), Unmaking the West: “What-If ”
Scenarios That Rewrite World History, Ann Arbor 2006, pp. 323–362.
Counterfactuals, Contingency, and Causation 157
Cold War could have ended in different ways at different times. When and
how the Cold War was resolved was all-important for the character of post-
Cold War international relations. Even theories that might alert us to the
possibility of a European War or an end of the Cold War – valuable as they
would be – could tell us only part of what we want to know. To make more
meaningful statements about transformations – to link events to outcomes –
we must take non-systematic factors into account.10
The origins of World War I and the end of the Cold War were very sen-
sitive to timing. By 1917, Archduke Franz Ferdinand would have been Em-
peror and unlikely to travel to Sarajevo or any destination where his security
would be at risk. Austria-Hungary would also have almost certainly been in
the throes of a serious constitutional crisis, touched off by the new Emper-
or’s extension of the universal franchise to those parts of the empire con-
trolled by the Hungarian minority. Germany would have been compelled
to adopt a defensive military strategy, as even Moltke was on record to the ef-
fect that an offensive against France would not work by 1917. A defensive
strategy, perhaps devised by his successor, or one based on an initial defense
followed by counter-offensives on either or both fronts, would have re-
moved any strategic pressure on Germany to go to war. According to some
authorities, Russia might have faced a revolution, or at least a serious do-
mestic crisis, without the setbacks and suffering of World War I. If a great
power war did start in 1917 or sometime afterwards, it might have had dif-
ferent antagonists and a different outcome.
The end of the Cold War and the way it was resolved were equally de-
pendent on timing. If Andropov or Chernenko had lived a few years longer,
Gorbachev might have had George Bush instead of Ronald Reagan as his
initial American interlocutor. We know that when Bush assumed the presi-
dency in 1988 he was still not convinced that Gorbachev was sincere and was
inclined to slow down, if not halt altogether, Soviet-American rapproche-
ment. Bush’s view was shared by a significant fraction of the official national
security community. In the absence of Reagan, who was more committed to
responding positively to Gorbachev’s overtures than any of his advisors, it
could have proved difficult to wind down the Cold War while Gorbachev
was still in power. If Saddam had still invaded Kuwait in August 1990 – or
anytime in the 1990s – the Soviet Union would almost certainly not have
been as supportive of the U.S. as it was under Gorbachev. The American re-
10 Cf. George W. Breslauer/Richard Ned Lebow, “Leadership and the End of the
Cold War: A Counterfactual Thought Experiment”, in: Herrmann/Lebow, End-
ing, pp. 161–188.
158 Richard Ned Lebow
sponse might have also been different, and if not, war in the Persian Gulf
might have intensified the Cold War. If the Cold War had dragged on into the
twenty-first century, even in muted form, there is no reason to think that the
attacks of September 11 against the World Trade Center and the Pentagon
would not have occurred, as the U.S. would still have had a close relation-
ship with Saudi Arabia. How the American response would have played out
against the background of a still on-going Cold War is difficult to say. The in-
ternational environment in which an American president acted and sought
to cobble together a coalition of allies and local powers – critical for any in-
vasion of Kuwait and Iraq – would have been very different than it was in
1991. In the absence of the Gulf War, subsequent relations between the U.S.
and Iraq would also have developed differently.
Timing effects are by no means limited to these two cases. Consider the
fate of divided nations (the two Germanys, Koreas, Chinas, and Vietnams)
and partitioned countries (Ireland, Cyprus, Palestine-Israel, India-Pakistan),
the other great source of conflict in the second half of the twentieth century.
India and South Korea consistently sought to isolate and undermine Pakis-
tan and North Korea respectively. Their goals changed in the aftermath of
the unification of Germany. Observing how much money it cost the Federal
Republic to begin to integrate the people and territory of the former Ger-
man Democratic Republic, Indian and South Korean politicians and busi-
ness leaders grew fearful of the consequences of the possible collapse of
their long-standing rivals. They envisaged themselves swamped with refu-
gees and burdened with costs of occupation and reconstruction.11 Their pol-
icies have undergone an observable shift in favor of doing what they can to
keep their rivals in business. If North Korea had collapsed first with abso-
lutely chaotic consequences for the South, it is entirely conceivable that the
German government would have responded differently to the prospect of
unification.
Counterfactual thought experiments prompt a radical conclusion. Vari-
ation across time, due to changing conditions and human reflection, the
openness of social systems and the complexity of the interaction among
stipulated causes make the likelihood of predictive theory extraordinarily
low.12 Theories that rely on multiple variables run into the problem of
11 Interviews with Indian and South Korean officials: Washington, D.C.; Columbus,
Ohio; Beijing, China; January 1999, March 2001, September 2007.
12 Cf. Gabriel A. Almond/Stephen J. Genco, “Clouds, Clocks, and the Study of
Politics”, in: World Politics, 4/1977, pp. 489–522, for an early but still powerful ar-
gument to this effect.
Counterfactuals, Contingency, and Causation 159
13 Cf. Ian Lustick, “Multiple Historical Records and the Problem of Selection Bias”,
in: American Political Science Review, 90/1996, pp. 505–518; Robert Jervis, System
Effects: Complexity in Political and Social Life, Princeton 1997.
14 Cf. Milja Kurki, Causation in International Relations: Reclaiming Causal Analysis, Cam-
bridge 2008.
15 Cf. Rom Harré/Peter Secord, The Explanation of Social Behaviour, Oxford 1973; Jer-
vis, System Effects.
160 Richard Ned Lebow
tations, focus more on explanation than prediction, and recognize that any
theory in the social world is nothing more than a starting point for working
through a policy problem or making a provisional forecast.
We are predisposed to think of big events as having big causes, a bias rou-
tinely reflected in historical narratives and social science theories. Ever since
Thucydides historians and international relations scholars have almost in-
variably privileged underlying over immediate causes. In the case of World
War I, as we have seen, the conventional wisdom holds that if the assassin-
ations in Sarajevo had not taken place, some other event, or set of events,
would have triggered a European war. The end of the Cold War is also at-
tributed to deep structural causes, most notably the failure of communism as
a social-economic system, and with it, the loss of communism’s appeal and
the relative decline of the Soviet Union vis-à-vis the West.
Counterfactual priming and case studies have the potential to make us
aware of the extent to which our theories build on and reinforce our procliv-
ity to see history as a linear progression from which deviation was improb-
able. They can alert us to the shortcomings of linear models, so central to
existing approaches to the study of international relations, and more funda-
mentally, to Humean causation and its search for regularities across cases –
the “holy grail” of social science, as currently formulated. By loosening the
hold of this primitive but psychologically satisfying approach to causation,
counterfactuals can make us more receptive to complex, non-linear models
that recognize that international relations is an open system and sensitive to
the role of chance, agency, and confluence. Many simulations, including the
so-called “game of life”, point to the same conclusions.16
Theory building along these lines would start from the premise that wars
and other events (e.g., economic crises) that have the potential to transform
the international system or behavior of actors are likely to be the result of
multiple reinforcing causes. They may also require catalysts that have inde-
pendent causes. The underlying and immediate causes of such events ac-
cordingly need to be addressed separately and in sequence. The problem can
17 Cf. chs. 7 and 9 in Richard Ned Lebow, Between Peace and War: The Nature of Inter-
national Crisis, Baltimore 1981, for these conditions and relevant cases.
162 Richard Ned Lebow
tue of the difficulties the Junker class had in response to the political,
economic, and social challenges of modernity.18 These examples indicate
the extent to which critical background conditions in turn have deeper
causes that must be traced and considered as the more fundamental under-
lying causes of war.
Immediate causes of war are catalysts that trigger hostilities. Sometimes
they are unproblematic in the sense that they are almost certain to arise
when the underlying conditions for war are present. They cannot be taken
for granted, as they sometimes have independent and infrequent causes of
their own right, as with the assassinations at Sarajevo. I have tried to dem-
onstrate that they had effects on Austrian and German policymaking that
probably would not have been brought about by other kinds of provoca-
tions. In this sense they were causes of war in their own right, and call into
question the usual hard-and-fast distinction between underlying and im-
mediate causes.
It is useful for purposes of analysis to identify four classes of catalysts or
immediate causes (fig. 1). Type I catalysts are common occurrences linked to
the underlying causes of the event we are trying to explain. “Fender benders”
are minor road accidents; they are more likely in cities where there is lots
of vehicular traffic. In this instance the increase in traffic – the underlying
cause – brings with it an increase in accidents. Border incidents might also fit
in this category. They are generally outgrowths of deeper conflicts between
countries and historically have served to trigger wars. They have sometimes
been arranged with this end in mind.
Type II catalysts are events with independent causes but occur frequently and
can serve as catalysts when the appropriate conditions are present. Rainfall is
independent of traffic but a regular event in temperate climates. It will almost
certainly bring about an increase in fender benders on heavily trafficked and
slick streets, and more so still in aggressive driving cultures or in countries like
India, where one does not need a license to purchase or drive a car.
Type III catalysts are events that are not only independent of underlying
causes but infrequent. Staying with our illustration of road accidents, this
might include fatalities caused by a bridge collapse, as happened in August
2007 on Interstate 35 where it crosses the Mississippi River in Minneapolis.
The collapse occurred during the rush hour and at dusk, a time of poor vis-
ibility, which increased the number of fatalities, but the timing was uncon-
nected to the causes of the collapse. Bridge failures are relatively infrequent
and their distribution is entirely independent of the underlying causes of the
events in which we are primarily interested.19
Type IV catalysts are also independent of underlying causes. Like viruses
that require a specific surface architecture to penetrate a target cell, they
must meet a set of additional requirements to serve as catalysts. Sarajevo is
the quintessential example. The assassinations were the outgrowth of an in-
ternal struggle for power in Serbia and their timing was entirely independent
of the confluences that made leaders in Vienna, Berlin, and Petersburg more
risk prone. As I noted earlier, Sarajevo created or helped to bring about four
critical conditions without which Austria would not have declared war on
Serbia. This is one of the reasons why World War I was so contingent.
The likelihood of a catalyst appearing when underlying causes and en-
abling conditions of context are present varies as a function of type. Type I
catalysts are the most common and can more or less be assumed to occur in
these conditions. Type II through IV catalysts are increasingly problematic.
We must treat them as causes with probabilities different from and indepen-
dent of underlying causes. To the extent that they can readily be untracked
by minimal rewrites we must consider the wars which they triggered to be
highly contingent.
My approach for probing the causes of events like wars is more or less the
reverse of how scholars usually attempt to analyze such events. It is far more
common to start with a theory or propositions about causes and test or
evaluate them against an appropriate data set or in selected case studies. In-
stead of working deductively from theory to data, I have worked inductively
from data to theory – with the understanding, of course, that what we select
as relevant data is inevitably conditioned by the theoretical assumptions we
bring to the problem.
The advantage of this strategy, I believe, is that it is capable of capturing
the richness of the causal nexus responsible for transformative events like
wars. In contrast to approaches that test for the presence or absence of single
cause, or even multiple causes, and ignore other possible causes and enabling
conditions, my approach foregrounds them, making it possible to account for
the effects of open-ended systems and non-linear effects. This approach also
stands in sharp contrast to historical narratives that often propose multiple
causes for war, and may describe key enabling conditions, but do so in an a-
theoretical way that makes it difficult to see the connections among the many
hypothesized causes and between them and relevant enabling conditions.
In social science rigor needs to be balanced against richness. This is not
infrequently done by excluding so-called contextual features from consider-
ation and treating cases as fully comparable when they are not. This is bad
science, and uses the claim of rigor as a rhetorical fig leaf. My strategy leads
us to consider multiple causes and the importance of context and admittedly
can lead to a proliferation of causes that makes theorizing difficult in the ab-
sence of any way of ranking the importance of these causes and of distin-
guishing those which were critical from those which were not. In an ideal
world we would do this by constructing a data set of independent but com-
parable cases that was large enough and contained sufficient variation on its
independent and dependent variables to allow us to run appropriate statis-
tical tests. For reasons that have been made clear, this is simply not feasible
with respect to wars and many other kinds of major international events.
An alternative, admittedly less rigorous, but generally more feasible strat-
egy is counterfactual probing. We remove one-by-one hypothesized causes
of war and enabling conditions and ask ourselves if the outcome would have
been any different. Would policymakers have behaved in similar ways in the
absence of one or more of these causes and conditions? Answers to these
questions invariably involve a degree of speculation, but there is often con-
siderable evidence available that allows us to make informed and empirically
defensible inferences. Suppose, for example, the German military had not
been committed to an offensive strategy but had plans for conducting a de-
fensive strategy on both fronts, intended to draw advancing French and Rus-
sian armies into traps where they could be repulsed, perhaps even destroyed,
with relatively moderate German losses in long-planned and carefully pre-
pared counter-offensives. Would the Kaiser and chancellor have given a
“blank check” to Austria in these circumstances? Would they have felt the
need to stiffen the spine of their Austrian ally? If it can be demonstrated
from the documents – as I think it can – that the Kaiser framed the ques-
tion as one of honor, not of security, and felt compelled to act, as he put it,
as Franz Joseph’s “second” in his duel with Serbia, then he would probably
still have acted aggressively even though security concerns were not deter-
minant. This was not true for his chancellor, who had been convinced by
Counterfactuals, Contingency, and Causation 165
chief-of-staff Moltke that war had to be fought before 1917 when the of-
fensive Schlieffen Plan – really the Moltke plan – would no longer have any
chance of success.20 With a good defensive strategy, the chancellor would
have been relatively immune to Moltke’s blandishments. However, the
Kaiser might well have ignored his chancellor’s pleas for caution in this cir-
cumstance, as he did in January 1917 when he supported his admirals’ de-
mand for unrestricted submarine warfare.21 This counterfactual thought
experiment suggests that an offensive military plan, while contributing to
war, was not a decisive cause, as the Germans might have issued the blank
check to Austria in its absence. They would have been less likely to have
mobilized when they did, as they would not have been under the same stra-
tegic pressure. However, German support for Austria would still not have
prevented Russian mobilization, which, in retrospect, would have remained
the point of no return.
Surgical counterfactuals are no more feasible than surgical air strikes, so
we must also consider what else would have been different – or the same –
when we remove a cause or enabling condition. As I noted above, a German
defensive strategy would have removed the need for rapid mobilization and
an unacceptable ultimatum to France demanding that key border fortresses
be handed over to Germany. War on two fronts was inevitable once Ger-
many and Russia went to war. However, a German defensive strategy would
have preserved Belgian neutrality and in all likelihood kept Britain neutral.
Germany would have been much more likely to have emerged victorious in
these circumstances. But how plausible was a defensive strategy? It is diffi-
cult to imagine Germany adopting such a military plan as long as Moltke was
chief-of-staff. If the Kaiser had not appointed him – and military authorities
thought Moltke an unusual choice at the time – other generals would have fa-
vored an offensive strategy, although they may have been less outspoken in
demanding war in every crisis from 1905 to 1914. Even this brief excursus
indicates that offensive military doctrines, while not a decisive cause of war,
were still important and would have been difficult to remove before 1916–17
with only minimal rewrites of history.
Similar thought experiments with other causes and enabling conditions
can, within reason, allow us to make some judgments about the extent to
which they were necessary for war, how contingent they were and how
closely linked they were to other causes and conditions. If we remove a cause
or condition and other causes and conditions drop out as well by virtue of
the counterfactuals we must introduce to do this, they can be said to be
coupled. In effect, we can build up a causal map of the event in question that
can help eliminate some causes as unnecessary or redundant, identify those
that appear to be the most important and perhaps some that can be traced
to similar origins. An example of the latter was the intense desire of so many
Austrian military and political officials for war with Serbia, even though they
recognized that it put Austria’s security at risk because it almost certainly en-
tailed war with Russia in circumstances where little military help could be ex-
pected from Germany. For these officials and the Emperor, Austria’s honor
demanded a military response almost regardless of the consequences. Aus-
trian motives for war and those of the German Kaiser and his chief-of-staff
can be traced back to the same source: an aristocratic honor code, not only
kept alive by leading officials in both countries but adhered to with a ven-
geance because of the perceived threat of modernity and the concomitant
spread of bourgeois values in society.22
The hindsight bias and historical scholarship encourage us to regard
major developments as overdetermined. We know this is not the case and
that there is considerable variation in the contingency of all important social
outcomes. Found at one end of the continuum is the so-called butterfly ef-
fect, where small changes have amplifying effects, moving the course of de-
velopments far from the path they would otherwise have taken. Butterflies
of this kind might have prevented events that were otherwise probable and
have been responsible for others that appeared at the time to have a low
probability. The determinist end of the continuum might be compared to a
freight train highballing down the track. In the absence of switches it takes
enormous counterforces to alter its course. Counterfactuals can be used to
make a reasonable determination of where along this continuum a particu-
lar event lies. Causation is often framed in terms of agency and structure. My
continuum recognizes that this is not an either-or choice and that agency
is merely one of the factors we must consider at the contingent end of the
spectrum. In chapter two of Forbidden Fruit I propose a six-step process for
determining the contingency of an event, which I applied to the rise of the
West, and later, to World War I and the end of the Cold War.23 I offer a brief
restatement of these steps here:
1. What do we have to do to negate a cause or confluence? Causes or confluences
should be altered in ways that are readily credible and render them innocu-
ous much the way small mutations in the genes of pathogens can eliminate or
significantly reduce their virulence.
2. How many credible minimal rewrites can be found that might prevent, alter or stall
the turning point? As a general rule, the more different components of a turn-
ing point can be removed by minimal rewrite counterfactuals, the more con-
tingent the turning point.
3. How far back must we go to find credible minimal rewrites? The further back in
time we go to find a minimal rewrite counterfactual, the more steps there are
likely to be between antecedent and consequent and the lower its probability,
as the probability of any counterfactual is the multiple of the probabilities of
every step in the chain. Multiple-step counterfactuals also increase the possi-
bility that other developments set in motion by the initial counterfactuals
could sever the chain of causation leading to the desired alternative outcome.
4. At what level of analysis are our minimal rewrites? Minimal rewrites of history
require small, plausible changes in reality that are likely to have big conse-
quences. For this reason practitioners of counterfactual history most often
invoke changes in personnel, policy, or the fortunes of war. Changes some-
times require intervention at the level of elites, bureaucracies, or domestic
politics, and more elaborate arguments linking antecedents to consequences.
When we change ideas, state structures, and the balance of power, the latter
requiring intervention at the system level, minimal rewrites are out of the
question unless we go back to a point in time when those ideas, structures,
and balances had not jelled and might be significantly affected by small,
plausible changes at levels one and two. The further back we introduce
changes, the less plausible the consequent becomes because our antecedent
is likely to introduce other changes with unknown interaction effects and
consequences. So a turning point can be considered contingent if it can
be untracked by numerous, plausible counterfactuals involving agents, con-
fluence, or chance, and much less so if it requires intervention at the level of
elites, bureaucracies, or ideas.
5. How redundant is the turning point? Different causal paths have different
implications for contingency. A turning point described by the simple, linear
pathway of A + B + C (and only A + B + C) might be prevented by severing
any link in the three-step chain. If “A” and “B” are themselves the products
of other chains of causation, there may be many possible ways of using mini-
mal rewrite counterfactuals to prevent the turning point by preventing pre-
conditions “A” and “B”. There are turning points that are the outcomes of
simple linear chains. Others are likely to have one or more paths leading to
them. If we prevent A + B + C, the possibility of G + H + C, and perhaps
of M + N + C, remain. To prevent a turning point with a minimal rewrite
168 Richard Ned Lebow
IV. Conclusion
The methods I propose for conducting such experiments are not intended to
replace traditional approaches but to supplement them. If theories and prop-
ositions work from the general to the particular, my approach moves from
the particular to the general. It is more inductive than deductive, but com-
Counterfactuals, Contingency, and Causation 169
plementary rather than antagonistic. By tacking back and forth between de-
ductive theory and inductive counterfactual probes of empirical cases we can
develop better theories and better understandings of both their potential and
limitations.
Such a project can be made more compelling when it is a collaborative
one, bringing together scholars with different substantive expertise and even
different initial estimates of the contingency of the event in question. Argu-
ment and counter-argument over counterfactuals and their implications have
the potential to bring about a consensus or bracket disagreement when that
is not possible. Such an exercise is not limited to the past. It is equally appro-
priate to forecasting, which makes use of many of the same techniques. It es-
tablishes possible branching points in scenarios and engages in speculative
inquiry about how much (or how little) the alternative paths they create di-
verge from the main narrative.24
I. Introduction
“History”, the English writer and critic David Lodge states in his book
The Novelist at the Crossroads, “may be, in a philosophical sense, a fiction but
it does not feel like that when we miss a train or somebody starts a war”.1
Lodge’s remark refers to theories many scholars are familiar with these days
but which were only starting to infuse critical debates when Lodge’s book
was published in 1971. These theories, centered around the assumption that
the writing of history is not an objective process of reconstruction but a cre-
ative and imaginative act, set out to remodel conceptions of reality, past and
present, by stressing the impact of narrative discourse on its subjects. His-
tory was therefore to be evaluated as a construct, shaped by the needs, possi-
bilities, and limitations of language. In order to understand the discomfort
following in the wake of such claims one must bear in mind that at least since
Leopold von Ranke established historiography as an academic discipline
in the early 19th century, it was taken for granted, more or less, that lan-
guage was an adequate tool to reproduce reality. The postmodernist thinkers
Lodge hints at turned the former order around by arguing that language does
not reproduce reality but generates it.
Looking back, it seems that one of the first scholars to articulate such
views was the American philosopher Arthur C. Danto, whose Analytical Phil-
osophy of History prepared the ground for others in the 1960s. Danto argued
that narrative sentences were a key feature of historical writing and he came
to the conclusion that works of historiography were written according to the
same principles as other narratives. Danto’s theses allowed for the assump-
tion that the facts of history were not found but made. Since then, in the hu-
manities these claims and their effects have become known as the linguistic
turn. It challenged history as an academic discipline as it prompted historians
to reflect on their methods, and it seems as if it helped to shape a zeitgeist in
which the counterfactual thought experiment gained in importance.
1 David Lodge, “The Novelist at the Crossroads”, in: David Lodge, The Novelist at
the Crossroads and Other Essays on Fiction, London 1971, pp. 3–34, p. 33.
Towards a Typology of Counterfactual Historical Novels 171
The shape that these experiments most often take are case scenarios in
which a historical event is altered by changing its outcome or by substitut-
ing one or several of its constituents. Yet, obviously and maybe paradoxically,
counterfactual versions of historical events require facts. The Dutch his-
torian Chris Lorenz explains that the existence of facts is a vital necessity for
a discourse about truth when he argues that the statement “Hitler died in
1945” is true because it is a fact that Hitler died in 1945, whereas the state-
ments “Hitler died in 1944” or “Hitler died in 1946” are not true.2 Instead of
“not true” one might say “counterfactual”. But how can we decide that an
account of history is counterfactual if the facts themselves are not rooted in
solid ground?
This question concerns the triangle in which facts, language, and narrative
strategies interact. Since history is essentially concerned with facts, it is state-
ments about these facts by which truth value is assessed. Historiographers
find themselves locked in a double bind: Specific facts must be established
while the facts as such have become a bit slippery. Without pretending to
solve the problem, however, it seems that Terry Eagleton has come up with
an approach that is sensible and useful. Starting from the observation that
there are hardly “brute facts”, but that facts must always include interpre-
tations, Eagleton concludes that an “interpretation on which everyone is
likely to agree is one way of defining a fact”.3 Eagleton’s definition acknowl-
edges the rhetorical quality of historical facts without abandoning their status.
Therefore, by means of a working definition, such representations of history
can be understood as “counterfactual” that deviate from the version that is
commonly known and accepted at least in one aspect of great significance.
It is crucial that the deviation from the facts takes place deliberately.
Why is this? In order to distinguish counterfactual propositions from other
fictional images of history some preliminary assumptions can be made:
Probably every historical novel ever written includes minor deviations or is
inaccurate in certain details, sometimes on purpose and sometimes by mis-
take. Yet there are novels in which this kind of deviation is immediately vis-
ible because of its dimensions. We can assume that it is there to be recog-
nized and that it is there for a reason. Both aspects are vital: Firstly, if the
deviation from the facts occurs accidentally, it is a flaw. Secondly, if the
disparity between the facts and the counterfactual version told in the novel is
invisible, if it is successfully disguised, it is a falsification or a lie. If this turns
2 Cf. Chris Lorenz, Konstruktion der Vergangenheit: Eine Einführung in die Geschichtstheo-
rie, Cologne 1997, p. 22.
3 Terry Eagleton, Literary Theory: An Introduction, Minneapolis 2008, p. 75.
172 Andreas Martin Widmann
out to be the case, the counterfactual depiction of history serves other pur-
poses.
The present essay will examine basic narrative strategies that are em-
ployed to create counterfactual scenarios in historical novels. In the first
part, a theoretical concept for the analysis of narratives is introduced, which
shall then be applied to two examples in the second part. The works dis-
cussed are of recent dates. The first one, Helden wie wir by the German writer
Thomas Brussig, was published in 1995. It was made into a feature film
shortly after and has been elevated into the canon of contemporary German
fiction due to its unorthodox treatment of the German reunification. The
second example is a novel by the Swiss author Christian Kracht called Ich
werde hier sein im Sonnenschein und im Schatten which came out in 2008. It rewrites
twentieth-century history by turning Switzerland into a communist super-
power engaged in a seemingly endless war against fascist states. Both novels
deal with episodes from the authors’ national histories, but in a way that
deviates from the established versions.4 Both novels can be easily recognized
as counterfactual in the sense outlined above. My analysis and interpretation
of these works of fiction will focus on the way history is used as material for
the fable told in each book. Thereby, I aim to show how these novels figure
as representatives of two different types of counterfactual historical fiction.5
4 I will use translated quotations from both novels in the text. The translations are
mine; the original German quotes can be found in the footnotes.
5 The argument presented here has been developed in detail and with regard to
a broader range of literary texts from different countries in Andreas Martin
Widmann, Kontrafaktische Geschichtsdarstellung: Untersuchungen an Romanen von Günter
Grass, Thomas Pynchon, Thomas Brussig, Michael Kleeberg, Philip Roth und Christoph
Ransmayr, Heidelberg 2009.
6 Cf. Alexander Demandt, Ungeschehene Geschichte. Ein Traktat über die Frage: Was wäre
geschehen, wenn …?, Göttingen 1984, p. 9.
7 A number of edited volumes provide examples dealing with historical turning
points from ancient Greece to the twentieth century. Barring the numerous pub-
lications from the popular history section, the volumes edited by Ferguson and
Towards a Typology of Counterfactual Historical Novels 173
and from the 1950s onwards, novelists have asked it more often than ever
before. “Alternate history”, “alternative history”, “uchronia”, “uchronian
fiction” and “parahistorical novels” are the terms most often used to
name the phenomenon. Christoph Rodiek, Jörg Helbig, and Edgar Vernon
McKnight Jr., among others, have described it by using similar parameters.8
Therefore the definition McKnight Jr. offers can stand on behalf of the
others. He defines alternative history as “a fictional genre defined by specu-
lation about what the present would be like had historical events oc-
curred differently”.9 More recently, Michael Butter has further developed the
understanding of what he argues should be labeled “alternate history”. Re-
garding it as “a genre of its own, a genre that exists at the intersections of his-
torical fiction and dystopian literature”,10 Butter defines alternate histories
as “narratives in which one or more past events are changed and the sub-
sequent consequences on history imagined”.11 Within that framework, he
distinguishes “between an affirmative and a revisionist type of alternate his-
tory”.12
A sensible definition when applied to the texts Butter analyzes, and prob-
ably even beyond, it should not be stretched to cover novels in which
other strategies to construct counterfactual versions of history are pursued.
Novels like Thomas Pynchon’s Gravity’s Rainbow, Günter Grass’s Der Butt, or
Brodersen deserve to be mentioned in this context (cf. Niall Ferguson [ed.], Vir-
tual History: Alternatives and Counterfactuals, New York 1999; and Kai Brodersen
[ed.], Virtuelle Antike: Wendepunkte der Alten Geschichte, Darmstadt 2000). Lubomir
Doležel has summed up the principle according to which most case studies work:
“The focus of alternative history is simple yes/no situations: win/lose a battle,
war, election, a power struggle; a leader assassinated/not assassinated” (cf.
“Possible Worlds of Fiction and History”, in: New Literary History, 29/1998,
pp. 785–809, p. 802). The method does not stand uncontested but has given rise
to criticism. Cf., for instance, Hubert Kiesewetter, Irreale oder reale Geschichte? Ein
Traktat über Methodenfragen der Geschichtswissenschaft, Herbolzheim 2002. For a dis-
cussion of the methods and principles of employing counterfactuals in histori-
ography, see also Berger Waldenegg’s essay in this volume.
8 Cf. Christoph Rodiek, Erfundene Vergangenheit: Kontrafaktische Gechichtsdarstellung
(Uchronie) in der Literatur, Frankfurt a.M. 1997; Jörg Helbig, Der parahistorische
Roman: Ein literarhistorischer und gattungstypologischer Beitrag zur Allotopieforschung,
Frankfurt a.M. 1988; Edgar Vernon McKnight Jr., “Alternative History: The De-
velopment of a Literary Genre”, Diss. University of North California 1994.
9 McKnight, “Alternative History”, p. iii.
10 Michael Butter, The Epitome of Evil: Hitler in American Fiction 1939–2002, New York
2009, p. 53.
11 Butter, Evil, p. 9.
12 Butter, Evil, p. 13.
174 Andreas Martin Widmann
Salman Rushdie’s Midnight’s Children deal with history in a way that qualifies
as counterfactual although the authors do not employ the “what-if ” pattern.
So far, two serious attempts have been made to discuss these novels by using
parameters of counterfactuality. Referring to Rodiek, Elisabeth Wesseling
has recast the concept of uchronian fiction as “uchronian fantasy”.13 It “lo-
cates utopia in history, by imagining an apocryphal course of events, which
clearly did not take place, but which might have taken place”.14 Although she
discusses Pynchon, Rushdie, and Grass, she cannot bring their novels to fit
in with her own understanding of the type of fiction she has outlined.15 In
one of the most comprehensive and thorough studies on the contemporary
historical novel, Ansgar Nünning introduces a taxonomy that allows for the
integration of the types of novels I discuss into the category of what he calls
“revisionist historical novel”.16 Such novels, Nünning argues, push the limits
of traditional historical fiction by questioning conventional ways of making
sense of the past and by stressing the differences between past and present.
Revisionist historical novels tend to take a critical position towards offi-
cial interpretations of history and challenge them by providing counter-
narratives; in doing so, they frequently use innovative narrative strategies.17
Nünning’s definition – of which I can only present a highly condensed ver-
sion – works with elements that are related to both the content and the for-
mal structure of the novels. Convincing and insightful as it is on a broader
scale, the integration of counterfactual novels into this scheme comes at the
prize of neglecting the features that make them unique.18 For that reason it
seems advisable to reconsider the understanding of counterfactual history in
literature as described above and to look for means of examining its different
varieties.
In order to do so I intend to look at the basic narrative operations by
which counterfactual set-ups are produced. It is therefore necessary to re-
turn to the analogies between the writing of history and the writing of fiction
that have already been mentioned. The American historian and philosopher
Hayden White has prominently pointed out formal similarities between fic-
tion and non-fiction. In The Tropics of Discourse White argues:
Novelists might be dealing only with imaginative events whereas historians are
dealing with real ones, but the process of fusing events, whether imaginary or real,
into a comprehensible totality capable of serving as the object of representation, is
a poetic process. Here historians must utilize precisely the same tropological strat-
egies, the same modalities of representing relationships in words, that the poet or
the novelist uses.19
Among the methods used by novelists as well as by historians there is, for
example, the selection of material. Every selection includes interpretation
because it must leave things out and give preference to others. Another im-
portant assumption is that both writers of history and writers of fiction
adopt a specific perspective on a subject. They evaluate, they look for rea-
sons, and finally they produce texts that are shaped by narrative patterns.
It has been argued that two different layers can be found in narrative rep-
resentations of history. On the one hand, there are dates and events in their
chronological order. Whether depicted that way or not, the inherent chrono-
logy of their occurrence enables us to regard them as one level of the histori-
cal narrative. On the other hand, there are causal relations. This would be an-
other level of the narrative. Arthur C. Danto has given a precise description
of these levels by distinguishing the so-called “chronicle” from the so-called
‘The king died, and then the queen died of grief ’ is a plot,” says Forster.27
The difference is a simple one. In the first version of this very short narrative
the two events that are referred to are arranged in a merely chronological
order that is signaled by the word “then”. In the second version a cause is
added. “The time sequence is preserved, but the sense of causality over-
shadows it”, as Forster puts it.28 In order to find out whether it is a story or a
plot, Forster suggests asking questions: “Consider the death of the queen. If
it is in a story we say: ‘And then?’ If it is in a plot we ask: ‘Why?’ That is the
fundamental difference between these two aspects of the novel”.29 With re-
gard to the latter question one can think of a number of reasons without
having to change the “story” of the narrative: “The king died, and then the
queen died of joy” would be another version of the narrative which adds a
different plot to the same story. The time sequence is preserved. The events
stay the same in the sense that no death is substituted by entirely different
events: “The king died and then the queen lived happily ever after” would be
a different story altogether.
By applying these considerations to counterfactual history I will argue
that our understanding of literary representations of counterfactual history
should not be restricted to narratives that imagine or depict possible histori-
cal consequences of missing or changed events in the past. In the discussion
of the two examples below I want to employ Forster’s distinction to analyze
and classify two types of counterfactual novels.
III. History’s Missing Link: Thomas Brussig’s Helden wie wir (1995)
At the beginning of Thomas Brussig’s novel Helden wie wir the protagon-
ist and first-person narrator makes a claim of great historical significance.
“Yes”, he says, “it’s true, it was me: I toppled the Berlin Wall”.30 Exagger-
ation is not uncommon in picaresque novels but the narrator is serious here.
That he has single-handedly brought about the downfall of the Berlin Wall
is the idea upon which the counterfactual set-up is built. The narrative
frame for this confession is an interview recorded on tape. The hero, Klaus
Uhltzscht, whose name is hard to spell and almost impossible to pronounce,
tells the story of his life to a journalist of the New York Times. By carefully
watching video tapes, an American reporter, Mr. Kitzelstein, has identified
Klaus as the person whose deed changed history. Klaus promises to shed
light on the events in question, but before doing so, he exploits the oppor-
tunity and speaks about his parents, about his childhood, and his education
in the GDR . The sensational pun, although frequently announced, is only
unveiled towards the end of the novel: In 1989, when the GDR government
appears to be on the brink of a collapse and crowds of protesters assemble
in the streets, Klaus is in Berlin. He listens to a speech the famous writer
Christa Wolf is delivering to the people. Klaus, the most ill-informed person
in the world, as he labels himself, mistakes her for Jutta Müller, the former
ice skating champion and secret object of his teenage fantasies. When he
tries to move closer to her he stumbles, falls down a flight of stairs, and gets
hurt in the groin. He is taken to a hospital where the treatment he is given
causes a mysterious enlargement of his genital. Afterwards he runs away and
re-appears in the middle of another demonstration in Berlin. This is where
his intertwining with history finally takes place.
Addressing his interview partner and the reader at the same time, he says:
“I ran away on the evening of November 9, as you may have gathered”.31
Klaus joins the masses and together with the other protestors he walks on to
one of the border control points:
The so-called masses were lining up in front of it. For reasons I didn’t understand
at the time they kept hoping for the gates of heaven to be raised, for them to flow
over into the West. […] What I didn’t know was that the masses had been stirred
up by a mysterious utterance from Günter Schabowski at his press conference:
For those who want to leave, Schabowski had meant to say, it is no longer neces-
sary to take the long way via the Czech-West-German border, now they can use
the inner German border – but just like any party politician he put the simple
matter in such words that could mean almost anything, after which only minutes
later the session of the Bundestag was hastily interrupted, some members of Par-
liament stood up, sang the national anthem, and believed the borders to be open.
The Tagesschau took the same point of view, whereupon ten thousands of the Ber-
lin people got up, only to find at the border control points that they were hoping
in vain.32
31 “Sie ahnen bereits, ich floh am Abend des 9. November” (Brussig, Helden, p. 304).
32 “Davor drängelten sich sogenannte Volksmassen, die aus mir damals unverständ-
lichen Gründen darauf hofften, die Himmelspforte werde gleich geöffnet, auf das
[sic] sie in den Westen strömen dürfen. […] Die Volksmassen waren, was ich nicht
wußte, durch eine undurchsichtige Formulierung auf der Pressekonferenz von
Günter Schabowski aufgescheucht: Wer ausreisen will, wollte Schabowski sagen,
muß nicht mehr den Umweg über die tschechischwestdeutsche Grenze nehmen,
sondern könne gleich über die deutsch-deutsche Grenze ausreisen – doch wie es
Towards a Typology of Counterfactual Historical Novels 179
Seeing these crowds helplessly in limbo provokes Klaus into doing some-
thing. This is the decisive moment. Klaus, who has always been obsessed
with his genital, undoes his trousers, and at the sight of his enormous penis
the soldiers give up and open the gate:
Slowly I unbuttoned my coat, I undid my belt next, and finally my trousers and I
looked the border guards straight into their eyes. […] Never before had they seen
such a thing! This was something they never thought was possible. […] I took my
time, plenty of time, I looked them in the eye one after another and finally one of
them unlocked the door as if he had been hypnotized.33
At this point the counterfactual claim is finally fully explained. The Wall
came down, but without Klaus the protesters would not have reached their
goal.
Brussig’s novel depicts the events prior to the opening of the inner Ger-
man border most accurately. It provides the facts against which the counter-
factual version is to be read and against which Klaus Uhltzscht’s narrative
is explicitly set. In these passages the historical process is split up into two
parts: First, citizens of the GDR organize peaceful protest, and then the
Berlin Wall comes down. Usually both parts of that historical narrative are
causally linked, which means that the collapse of the GDR is presented as the
result of the protests. In Brussig’s novel, however, the cause and effect re-
lation between the two events is dissolved. If we apply Forster’s terminology,
we can say that the “story” of history has hardly been changed here. The real
historical events are taking place in Helden wie wir but the plot is not the
same. It could be summarized like this: In 1989 citizens of the GDR organ-
ize peaceful protests and then the Berlin Wall comes down because Klaus
Uhltzscht undresses in front of the soldiers. In this version the cause of the
downfall is to be found in Uhltzscht’s action. He explicitly states his claim to
fame when he says: “Anyone who doesn’t believe my story won’t understand
what’s going on in Germany! Without me it doesn’t make sense. For in Ger-
many’s recent history I am the missing link”.34
Due to its depiction of the political changes of 1989, Brussig’s novel has
immediately been recognized (or rather classified) as a so-called “Wende-
roman”.35 This term designates a literary genre that came into being only
with the historical event it is related to, the “Wende”. It was born out of
the notion that a national process as groundbreaking as the German reu-
nification demanded an aesthetic response that would reflect the caesura of
1989. Within academic criticism the counterfactual parts in Helden wie wir
were noted as a violation of expectations attached to the traditional historical
novel. Rachel Halverson states with regard to Brussig’s rearranging of the es-
tablished facts: “Helden wie wir is far removed from the genre of historical
fiction, which does provide its readers with personalized, yet believable ac-
counts of historical events”.36 Unbelievable the plot may be, but Brussig
does not make up this counterfactual version without a reason. However,
he does not fully reveal his motives in the passages discussed but at various
other points in the novel where the picaresque attitude giving shape to the
narrator’s voice is abandoned. Uhltzscht’s distorted perception is replaced by
a more sober and serious approach to the matter when he remarks:
It’s a debate that’s totally distorted but no one seems to notice! How could this so-
ciety continue to exist for decades if everyone was dissatisfied with it? Mr. Kitzel-
stein, I ask you to take my question seriously, it’s not a rhetorical question! Every-
body was against it, but they were part of it, they cooperated, timidly, blindly, or
simply stupidly. I really want to know, as I believe that all modern societies move
34 “Wer meine Geschichte nicht glaubt, wird nicht verstehen, was mit Deutsch-
land los ist! Ohne mich ergibt alles keinen Sinn! Denn ich bin das Missing Link der
jüngsten deutschen Geschichte!” (Brussig, Helden, p. 323).
35 The novel is dealt with accordingly in the following studies and articles: Stephan
Brockmann, Literature and German Reunification, Cambridge 1999; Ulrike Bremer,
Versionen der Wende: Eine textanalytische Untersuchung erzählerischer Prosa junger deutscher
Autoren zur Wiedervereinigung, Osnabrück 2002; Stefan Neuhaus, Literatur und
nationale Einheit in Deutschland, Tübingen 2002; Markus Symmank, “Mutter-
sprache: Zu Thomas Brussigs Roman Helden wie wir”, in: Matthias Harder (ed.), Be-
standsaufnahmen: Deutschsprachige Literatur der neunziger Jahre aus interkultureller Sicht,
Würzburg 2001, pp. 177–194.
36 Rachel J. Halverson, “Comedic Bestseller or Insightful Satire: Taking the Inter-
view and Autobiography to Task in Thomas Brussig’s Helden wie wir”, in: Carol
Anne Costabile-Heming/Rachel J. Halverson/Kristie A. Foell (eds.), Textual
Responses to German Unification: Processing Historical and Social Change in Literature and
Film, Berlin/New York 2001, pp. 95–105, p.103.
Towards a Typology of Counterfactual Historical Novels 181
along these lines. As long as millions of failures do not face their failings they will
remain failures.37
As this paragraph as well as numerous others reveal, the intention of the novel
is a political one. The question it poses seems to be: How could a society that
was mostly at ease with the system under which it had to live suddenly turn
into a society of would-be resistance fighters? For Klaus Uhltzscht this is im-
possible and therefore the conclusion must be that there never were any Wall-
busting people. The novel functions as an instrument of criticism directed
against the dominant discourse about the German reunification, which, ac-
cording to Brussig, is based on an inappropriate image of the historical pro-
cess. Instead of merely stating a point of view Brussig uses his counterfactual
version as a method to deliver an interpretation of the real event.
37 “Eine völlig verzerrte Diskussion, und keiner merkt es! Wie konnte diese Gesells-
chaft Jahrzehnte existieren, wenn alle unzufrieden gewesen sein wollen? Mr. Kit-
zelstein, nehmen Sie meine Frage ernst, es ist keine rhetorische Frage! Alle waren
dagegen, und trotzdem waren sie integriert, haben mitgemacht, kleinmütig, ver-
blendet oder einfach nur dumm. Ich will das genau wissen, denn ich glaube,
daß sich alle modernen Gesellschaften in diesem Dilemma bewegen. Solange sich
Millionen Versager ihrem Versagen nicht stellen, werden sie Versager bleiben”
(Brussig, Helden, p. 312).
38 Cf. Stefan Zweig, “Der versiegelte Zug”, in: Stefan Zweig, Sternstunden der Mensch-
heit, Frankfurt a. M. 1998, pp. 240–252.
39 “Und hätte damals eines der prächtigen Automobile, die in scharfem Tempo von
Botschaft zu Botschaft sausen, diesen Mann durch einen Zufall auf der Straße zu
182 Andreas Martin Widmann
Tode gestoßen, auch die Welt würde ihn weder unter dem Namen Ulianow noch
unter jenem Lenins kennen” (Zweig, “Zug”, p. 242).
40 Cf. Christian Kracht, Ich werde hier sein im Sonnenschein und im Schatten, Cologne 2008.
41 “[…] die Geschichte des großen Eidgenossen Lenin, der, anstatt in einem plom-
bierten Zug in das zerfallende, verstrahlte Russland zurückzukehren, in der
Schweiz geblieben war, um dort nach Jahrzehnten des Krieges den Sowjet zu
gründen, in Zürich, Basel und Neu-Bern” (Kracht, Sonnenschein, p. 58).
42 “[…] byzantinische Verflechtung, die fast surreale Komplexität ihrer militärischen
Allianzen” (Kracht, Sonnenschein, p. 32).
43 “Der englische König, so hörten wir, hatte sich mit den Faschisten, den Deut-
schen, gegen uns verbündet, sie planten, ein dekadentes Großreich zu schaffen, in
dem wir Afrikaner Sklaven sein würden und sie die grinsenden Herren” (Kracht,
Sonnenschein, p. 59).
Towards a Typology of Counterfactual Historical Novels 183
reader approaching the text from the outside must be, naturally – can merely
get a few fragments of the whole picture. The narrator and protagonist is a
black soldier from Africa. He has been trained and educated in military acad-
emies, and he is now engaged in the fight for the communist cause. When he
receives the order to track down and kill another officer named Brashinsky,
because he has become insane, the protagonist embarks on a man-hunt. His
journey takes him through Switzerland and to the Communist Army’s head-
quarters in the Alps, which have been turned into a giant fortress. In the deep
tunnels of this mountain fortress he meets Brashinsky, but he does not com-
plete his mission. In the end, he abandons all ideologies, even language. He
ceases to speak and returns to Africa, where the new cities have all been de-
serted. Where technology used to reign, nature is taking over, and the Afri-
can people return to their villages.44
Kracht’s novel abounds with mostly unmarked intertextual allusions to
other works of literature. It borrows elements from other counterfactual his-
torical novels such as Thomas Pynchon’s Gravity’s Rainbow, whose famous
opening line is evoked at one point.45 Another telling allusion is made to
Philip K. Dick and his prototypical counterfactual novel The Man in the High
Castle when the narrator mentions a reference work on insects called “The
Grasshopper Lies Heavy”.46 The fable of the novel itself is highly reminis-
cent of Joseph Conrad’s Heart of Darkness but it is obviously inverted in Ich
44 “Ganze Städte wurden indes über Nacht verlassen, und ihre afrikanischen Ein-
wohner kehrten, einer stillen Völkerwanderung gleich, zurück in die Dörfer”
(Kracht, Sonnenschein, p. 148).
45 The first line of Pynchon’s novel refers to the sound of German V2 rockets that
could be heard only after their detonation: “A screaming comes across the sky”
(Thomas Pynchon, Gravity’s Rainbow, London 2000, p. 3). At one point Kracht’s
narrator observes: “High up in the sky the missile from a German long range
canon, coming from the north, was heading east with a hissing sound. Sometimes
they would fall down and hit our territory. It was pure chance, although one quite
naturally heard the impact first and only then the sound of the advancing rocket”.
(“Unendlich weit oben am Himmel sirrte das Geschoss einer deutschen Lang-
streckenkanone, von Norden kommend, nach Osten. Manchmal fielen sie herab
und schlugen bei uns ein. Es war reiner Zufall, wiewohl man natürlicherweise
erst den Einschlag wahrnahm und dann das Geräusch des sich nähernden Ge-
schosses” [Kracht, Sonnenschein, p. 21].) Notably, in this paragraph a dissolution of
the cause-and-effect relation that makes the protagonist’s present a nunc stans in
which the change of seasons no longer occurs is indicated.
46 Cf. Kracht, Sonnenschein, p. 68. Dick’s novel pictures a world in which Nazi Germany
and Imperial Japan have achieved world power after a World War II that lasted until
1948. As a novel within the novel, “The Grashopper Lies Heavy” tells a story about
how things could have turned out differently if Germany had lost the war.
184 Andreas Martin Widmann
werde hier sein im Sonnenschein und im Schatten, as the protagonist is a black man.
His journey takes him from Africa to Europe and to the heart of its ideologi-
cal blood circle, from which societies all over the world shall be transformed
for the better: “Racism did not exist, it should not exist”.47 Accordingly, the
servant has become the master in the new Swiss society, as Claude D. Conter
remarks.48
Kracht has for some time been considered a representative of a new aes-
theticism associated with a specifically German Pop-Literatur that emerged in
the 1990s. It was only after his third novel was published that scholars began
to seriously analyze his works.49 One of the merits of this new interest is that
some articles link Ich werde hier sein im Sonnenschein und im Schatten to Kracht’s
other books, especially to his novels. What several of these essays bring to the
fore is that Kracht’s habit to comment on history is anything but new. Even
his inclination towards the counterfactual could be detected very early, for
the title of his first novel Faserland alludes to Robert Harris’s best-selling thril-
ler Fatherland (1992), in which Nazi Germany continues to exist deep into the
1960s.50 In Ich werde hier sein im Sonnenschein und im Schatten the counterfactual
serves as a cognitive tool for Kracht. Like Brussig, he reacts to a widely ac-
cepted version of history. That image is hidden in an instructive passage
where the narrator finds the history of Switzerland carved into stone:51
I saw reliefs along the walls which told Swiss history in socialist-realist style, from
the beginnings of the wars against the Habsburgers and Burgunders, from the
47 “Es gab keinen Rassismus, sollte keinen geben” (Kracht, Sonnenschein, p. 59).
48 Cf. Claude D. Conter, “Christian Krachts posthistorische Ästhetik”, in: Johannes
Birgfeld/Claude D. Conter (eds.), Christian Kracht: Zu Leben und Werk, Cologne
2009, pp. 24–43, p. 37.
49 Although miscellaneous articles were published before, a collection of articles
edited by Johannes Birgfeld and Claude D. Conter is the first endeavor to discuss
Kracht’s works in an academic context.
50 Oliver Jahraus offers a convincing interpretation when he suggests that Kracht
uses the title as a means to point at the fact that Germany has in some way won the
war (cf. Oliver Jahraus, “Ästhetischer Fundamentalismus: Christian Krachts radi-
kale Erzählexperimente”, in: Birgfeld/Conter, Christian Kracht, pp. 24–43, p. 16).
51 Again, there is reason to assume that Kracht is using another work of literature as
a basis upon which he crafts his own text, this time Friedrich Dürrenmatt’s Der
Winterkrieg in Tibet, as Moritz Baßler points out. Dürrenmatt depicts a similarly
desolate world, with a Swiss government that is stuck inside a massif of rocks.
Moreover, this dystopian text features a hero who finally carves his own story into
labyrinth walls of a mountain fortress. Cf. Moritz Baßler, “Utopie und Apokalypse
im deutschsprachigen Gegenwartsroman (Kracht, Dath, Uschmann)”, paper
presented at the conference Utopie und Apokalypse in der Moderne: Internationales Sym-
posium der Universität Freiburg/CH , Fribourg, October 3, 2009.
Towards a Typology of Counterfactual Historical Novels 185
peasant rising on the mythical Rütli field – easily to be recognized was the old
Swiss salute that had been adapted by the SSR , the raised arm, two fingers held up
and a thumb, the holy oath to stand brave in battle – next the short period of the
Peace of Basel brought about by cowardly force, followed by the first expansion of
Switzerland’s heartland through its mercenaries all the way to Milan.52
By turning its history into a petrified stream of battle scenes Kracht presents
Switzerland as the opposite of what it is often seen as. The novel undermines
a national myth which explains Switzerland’s independence as a consequence
of its strong urge for freedom, for here it takes only one small step to turn
the politically neutral, peaceful island into an empire torn by wars and ideo-
logy. At the same time, the novel inverts the outsider’s, and especially Ger-
man, perspective on Switzerland, such as the one that Kracht employs in
Faserland, which seeks comfort in its alleged cleanliness and neatness.
Peter von Matt once argued that historical and philosophical speculations
are rare in Switzerland because the country was bound to vanish all too soon
in the course of such speculations.53 Kracht seems to prove him right in the
sense that in his fictional history Switzerland has disappeared, paradoxically
due to its expansion. The country’s legacy is the epitome of a noble idea gone
bad. The master-servant structure has not been overcome but in a misguided
attempt to achieve racial equality the world has been transformed in a dysto-
pian fashion. Kracht’s book is a playful postmodernist text but the play is not
only set in motion for its own sake. On a broader scale the apocalyptic vi-
sion54 in which the novel culminates can be read as the literary expression of
a deeply pessimistic philosophy of history.55 Thus, Kracht uses the counter-
52 “Ich sah an der Wand sich entlangziehende Reliefarbeiten, welche im Stil des so-
zialistischen Realismus die Geschichte der Schweiz erzählten, von den Anfängen
der Kriege gegen die Habsburger und Burgunder, vom Bauernaufstand auf der
mythischen Rütliwiese – gut zu erkennen war der alte, von der SSR übernommene
Schweizergruß, der hochgestreckte Arm, die erhobenen zwei Finger und der
Daumen, und der heilige Eid, sich fortan im Kriege zu bewähren, dann die kurze
Zeit des feig erzwungenen Friedens von Basel und die glücklich darauf folgende,
erste Expansion des Schweizer Kernlandes durch sein Söldnerheer bis ins italie-
nische Mailand” (Kracht, Sonnenschein, p. 101).
53 Cf. Peter von Matt, “Bilderkult und Bildersturm: Eine Zeitreise durch die literari-
sche und politische Schweiz”, in: Peter von Matt, Die tintenblauen Eidgenossen: Über
die literarische und politische Schweiz, Munich 2001, pp. 9–78, p. 57.
54 For a detailed discussion of Kracht’s treatment of the apocalypse motif and its
tradition, see Baßler, “Utopie”.
55 Conter recognizes the same “existentiell-fatalistische Geschichtsdeutung […],
laut der sich Geschichte lediglich wiederholt” in Kracht’s 1979 (Conter, “Ästhe-
tik”, p. 31).
186 Andreas Martin Widmann
Now that the basic patterns of their individual content and their intellec-
tual purpose have been outlined, the poetics of the counterfactual can be de-
scribed in a more general and abstract way by once more returning to E.M.
Forster’s Aspects of the Novel. Forster’s distinction between story and plot can
serve as an analytical model to classify the narrative strategies used in both
texts. If history is conceived as a narrative, the two novelists are operating on
different levels of that narrative in order to create their counterfactual ver-
sions of it. If the historical events mentioned in Brussig’s novel were reduced
to a series of abstract elements along a line, we would get a series like A, B, C,
D, and so on. Every letter along that line would correspond to an event out-
side the text. This series would be the story of the German reunification. If
we ask the question “And then?”, as Forster suggests, Brussig’s version does
not differ from the one that can be found in history textbooks. However, if
we ask “Why?”, the difference becomes obvious. In Helden wie wir the em-
phasis of the counterfactual narrative obviously is on causality. The deviation
from the facts is brought about by a counterfactual plot.56
downfall of the Berlin Wall. Brussig’s narrator explicitly establishes a direct link
between the counterfactual cause and the real event, thereby claiming the major
political event to be the direct outcome of the hero’s behavior.
57 Considering that in Roth’s novel World War II still ends with Germany’s defeat,
one might argue that in this case the difference between his and Brussig’s formal
pattern giving shape to the counterfactual is one of degree rather than of sorts.
Still, Roth rewrites the biographies of major historical and political figures, at-
tributing widespread effects to these alterations, whereas Brussig only inserts a
seemingly minor moment to the recorded historical events which leaves the chro-
nology of the established events unaffected.
188 Andreas Martin Widmann
VI. Conclusion
it. The basic facts the reader is supposed to be familiar with are usually in-
tegrated into the literary texts that present a counterfactual image related to
these facts. Brussig mentions the peaceful revolution of 1989 and playfully
rejects its impact. It is not the factual process of the German reunification to
which he objects but the particular attitude which glorifies the achievement
and conceals people’s failures that enabled the GDR system to run smoothly
for so long. In Kracht’s novel, the real path of history is presented as a road
not taken. In order to paint a different picture of Switzerland’s political heri-
tage, Kracht pursues that road for a while.
The author’s decision to either approach the links between the events
or the events themselves then requires different strategies of creating the
counterfactual image. Neither Brussig nor Kracht want to falsify history
in the sense that they would want their counterfactual versions to go unrec-
ognized. By asking what each work says about national self-images substan-
tial affinities between Helden wie wir and Ich werde hier sein im Sonnenschein und
im Schatten are revealed. Notably, Brussig and Kracht are both objecting to
interpretations of historical events that are put to use in modern societies.
They create counterfactual versions of history to put the established ver-
sions into perspective. Whereas every powerful official image of history as-
cribes inevitability to a historical development, these counterfactual repre-
sentations draw attention to chance and contingency. In doing so, they
abolish every attempt to make sense of what has happened within a frame-
work of predestination. Once notions of a weltgeist, whatever philosophical
or ideological shape it might take, are done away with, there is room to
negotiate individual concerns by telling counterfactual versions of history.
Thus, after all, and keeping in mind what happens in Kracht’s novel, it is
possible to draw on the initial quotation from David Lodge for a suitable
closing remark about counterfactuals: The sense of history becoming a fic-
tion when someone misses a train and starts a war is a philosophical one.
190 Birte Christ
I. Introduction
In Charlotte Perkins Gilman’s 1914 short story “If I Were a Man”, housewife
and “true woman” Molly Matthewson so longs to be a man that, one morn-
ing, she finds herself leaving the house in the shape of her own husband,
“with only enough subconscious memory of herself remaining to make her
recognize the differences”.1 Not only does Molly feel self-possessed, free,
and powerful for the first time, but through the gender role change she
can also recognize the world as “made, lived in, and seen, by men”.2 In other
words, she can recognize men’s assumptions about gender differences, wit-
ness the rituals of masculinity that reify these differences, and thus analyze
the mechanisms which uphold men’s dominance over women. As Gilman’s
title signals rather overtly, the story is based on a simple “counterfact” in the
manner of “if x, then y”. The “counterfact” in “If I Were a Man” is what may
be called “the counterfact of gender”: If Molly were a man, she would en-
gage in entirely different daily practices, would act in the world outside of
the home, and would experience herself and her position vis-à-vis men and
women differently. By contrasting this sense of self with her sense of self as
woman, she would then gain access to the recognition of the specific social
structures that determine her life as a woman.
The story illustrates, in a nutshell, two central aspects of the use of
counterfactuality in feminist fiction – in fictional texts, that is, which ex-
pose traditional gender roles as arbitrary, man-made, and disadvantageous to
women, which offer alternative visions of gender relations, and which call
the reader, in more or less didactical fashion, to political action. First, the
binary of “fact” versus “counterfact” is cast along the lines of the binary of
“male” versus “female”. Feminist thought recognizes the male gender as the
norm and the female gender as that which diverts from the norm – as the
1 Charlotte Perkins Gilman, “If I Were a Man”, in: Barbara Solomon (ed.), Herland
and Selected Short Stories of Charlotte Perkins Gilman, New York 1992, pp. 302–308,
pp. 302–303.
2 Gilman, “If I Were a Man”, p. 306.
Functions of the Counterfactual in Feminist Fiction 191
change – can go well beyond this analytic function which fiction shares with
the social sciences.4 Because counterfactual feminist fiction’s political aim is
to unsettle and transcend the binary of male versus female and the hierarchy
that comes with it, it also frequently works towards speculatively undoing the
very binary of fact versus counterfact which its narrative structure is predi-
cated upon. Feminist counterfactual fiction, then, strives towards socio-
political analysis in the sense of a denaturalization of the gender hierarchy,
yet, at the same time, strives towards socio-political synthesis: the denaturaliz-
ation of the gender hierarchy, effected through counterfactual devices such
as role reversal and gendered separatism, does not remain its “real-world”
goal. Ultimately, counterfactual feminist fiction aims to synthetize male and
female in order to abandon the binary and recast both male and female as
non-gendered human.
I will illustrate these two central forms of the use of counterfactuality in
feminist fiction – analysis and synthesis – by looking at four texts which are
almost canonical in discussions of speculative fiction by women. Two are
from the early twentieth century: Charlotte Perkins Gilman’s utopian novel
Herland (1915) and her story “If I Were a Man” (1914), which I have already
introduced and from which I have borrowed the title for this paper. Gil-
man’s writing, which includes other utopias such as Moving the Mountain
(1911), must be seen in two interrelated contexts: first, the primary emerg-
ence of feminist engagé fiction on a broad and popular scale – more than
thirty women writers, for example, published utopian novels between 1890
and 1920;5 and second, the growing power of the women’s movement which
won its first significant national battle, women’s suffrage, in 1921. The other
two texts were published about sixty years later in the context of second-
wave feminism, which, like the women’s movement in the beginning of the
century, saw a parallel burgeoning of speculative genres in the hands of
women writers: Sally Miller Gearhart’s The Wanderground (1979) and Joanna
Russ’s The Female Man (1975).6
However, I do not want to make a historical argument here. Rather, I will
use Herland and The Wanderground to illustrate the analytic function of the
counterfactual in feminist fiction, and “If I Were a Man” and The Female Man
to make my case that feminist fiction may go even further and employ the
counterfactual as a tool of synthesis. These two different uses of the counter-
factual, which call forth the reader’s participation in the text and her partici-
pation in political activism in different ways, thus cut across historical and
literary periods. That both uses of the counterfactual in feminist fiction
are relevant strategies in the 1910s as well as in the 1970s, and that both uses
share similar advantages and problems with regard to their feminist argu-
ment, must be due to the fact, I believe, that they are written against struc-
tures of gender inequality that, sixty years apart, differ in degree, but not
in kind. Counterfactuality, in the 1910s and in the 1970s, presumably also
shares in the affective functions of feminist fiction and contributes to mov-
ing readers towards taking political action based on their cognitive insights.7
In the following, however, I will not look at these counterfactual narratives’
possible real-world activation of readers, but will be concerned with what I
consider the two most salient cognitive functions of the counterfactual em-
ployed by them.
8 Charlotte Perkins Gilman, Herland, in: Solomon, Herland and Selected Short Stories,
pp. 1–146, p. 57.
Functions of the Counterfactual in Feminist Fiction 195
“soft”, and rejects rationality and technology, which are coded as “male” or
“hard”. The women of the wanderground, moreover, reject “male” histori-
ography and written cultures of memory; they use books as material to insu-
late their homes, they communicate through forms of telepathy, and create
social bonds through oral storytelling and through communal recollection
in so-called “remember rooms”. The narrative form of The Wanderground de-
parts from Herland’s classical utopian narrative and reflects the non-hier-
archical and anti-scriptural nature of its women’s society: It consists of a
loose series of episodes, told by a hetereodiegetic narrator in associative se-
quences, involving changing protagonists, thereby sketching the commu-
nity’s egalitarian life in the wanderground. Both Herland and The Wander-
ground assume – and problematically so from a non-essentialist perspective –
that if women were left alone for some time to establish their own ways of
organizing their public and private lives, this would result in societies entirely
different from, and, above all, “better” in comparison to the actual U.S. so-
ciety of the 1910s and 1970s. Both texts invert and contradict the male-de-
fined logic that structures actual society down to the smallest detail and can
thus be said to develop entire counterfactual worlds.
The reader’s analysis of the social status quo of the “actual” in these texts
is enabled by two “effects” which are also at work in everyday retrospective
counterfactual thinking. Roese and Morrison have described these effects as
the “contrast effect” and the “causal inference effect”.9 First, I want to focus
on how the contrast effect is actualized in different ways in Herland and The
Wanderground. One aspect that the texts share is that they imagine the worlds
in which gender difference is abolished and women shape society in terms of
a pastoral ideal, and they share this desire for the pastoral with almost all uto-
pian texts published since the age of industrialization. By contrasting a uto-
pian world in which the relationship between humans and nature is sym-
biotic and caring rather than exploitative, they expose and critique real-world
social practices.
Herland makes the contrast between the counterfactual, utopian order and
the contemporary, actual order explicit through its reliance on the classical
utopian traveler figure, the narrator Jennings, who represents actual-world
assumptions in the counterfactual world and serves as the reader’s figure
of identification with whom she discovers the utopian world. Moreover,
the novel uses multiple visitor figures – three American men, who discuss
their observations of Herland with three native cicerones and compare these
observations with their knowledge of the United States in extended dia-
10 Of course, as Johnston has argued, the multiplied visitor figure also allows the text
to represent three male characters which stand for three proto- or stereotypes of
masculinity and to expose their different, but equally problematic attitudes to-
wards women (cf. Georgia Johnston, “Three Men in Herland: Why They Enter the
Text”, in: Lise Leibacher-Ouvrard/Nicholas D. Smith [eds.], Utopian Studies IV,
Lanham 1990, pp. 55–59).
11 Gilman, Herland, pp. 80–81.
12 Gilman, Herland, p. 81; my emphasis.
13 Gilman, Herland, p. 81.
Functions of the Counterfactual in Feminist Fiction 197
own and hence about the actual world’s system also prescribes her emotional
attitudes toward the actual world, which may, as I have suggested above,
function as a trigger towards political action.
The Wanderground, by contrast, is very much a text of its time in that it does
away with the classical utopian traveler-and-cicerone constellation and its co-
lonial and heteronormative implications which, though ironically inversed,
centrally structure Herland. In consequence, the actual world against which
The Wanderground places its upward counterfactual remains outside of the
text. Here, it is the reader who must cognitively substantiate the contrast be-
tween the counterfactual world and her own actual world. She is not invited
to identify with characters who experience the contrast effect themselves.
The following scene, for example, relates how the character Alaka has just
swum through a cave filled with water and shaken the water out of her hair:
“I will warm you,” she heard. Laughing, she turned to the tree. Gently she laid her-
self against the heavy bark, spreading her legs and arms about the big trunk.
“I take when you give,” said the tree.
“I know,” she said. “And I take when you give.” She inhaled slowly, pressing her
viscera against the tree. As she released her breath, the trunk pushed against her.
Slowly, with no visible motion, the two set a rhythm of pneuma exchange. Alaka’s
trousers became dry and warm. So did even her soft shirt, a chamois given her by
Olu, her long-loved antelope when Olu changed form. Her hair swung free now
and dry.14
The scene represents a moment of social interaction in a counterfactual so-
ciety of which plants and animals are an integral part and in which women,
plants, and animals support each other mutually. Alaka’s rhythmic exchange
of “pneuma” with the tree quite obviously re-conceptualizes sexuality as a
life-giving form of interaction between equals, and nurturing as an essentially
physical or embodied capacity is stressed by Alaka’s virtually inhabiting the
inside of her friend’s (antelope) skin. The actual world’s exploitative or, at
best, non-existent relationship with nature and trees in particular exists only
as a foil in the reader’s mind; the way in which Alaka’s relationship with the
tree and her antelope-friend should be evaluated is left uncommented upon,
except for the fact that the reader might identify with the character and ex-
perience her well-being within these relationships vicariously, and hence
as something positive. One might argue that when the reader – over long
stretches of the narrative – is left to experience the contrast between
counterfactual and actual practices herself, without explicit commentary on
the part of the text, the contrast might be more effective didactically. At least
this might be the case in historical contexts in which overt didacticism in lit-
erature is not well-received either by readers or critics. The recognition of
contrast becomes the reader’s own cognitive achievement rather than the
text’s, and, if the text is successful, her evaluation of the hill women’s prac-
tices as cases of upward counterfactuality triggers her independent – rather
than textually prescribed – critical perspective of her own society.
The contrast effect which these feminist counterfactual fictions achieve,
then, primarily contributes to criticizing concrete social practices. Both the
text-internal prescription of the practice of composting as better than actual-
world practices in Herland and the reader’s potential positive, text-external
evaluation of Alaka’s mutually nurturing relationship with nature in The Wan-
derground, I would argue, prepare a desire for the “causal inference effect”: a
desire to understand the causal relationship between the fact that Herland
and the hill women’s society are governed by women and the fact that they
have developed environmentally more sustainable systems of life and more
nurturing relationships than the actual world, governed by men, has.
The “causal inferences” which the counterfactual allows the reader to
make in feminist texts are insights into the naturalized social “logic” that is
the cause and basis of hierarchical gender relations. The following excerpt
from Herland is rather straightforward in its exposure of a supposedly “natu-
ral” hierarchy of the sexes and the social structures that arise from such a
hierarchy. It is a dialogue between the male visitors and their three cicerones
that is sparked by the visitors’ observation that among Herlanders, there is
no economic competition. In spite of the fact that the men experience Her-
land as a society in which every woman is well off, the men’s implied assump-
tion is, nevertheless, that without economic competition there cannot be any
material prosperity. Clinging to their own notions of how the world must
work, they try to defend their perspective on economics by arguing that
without competition, there is no stimulus to industry:
“Stimulus? To Industry? But don’t you like to work?”
“No man would work unless he had to,” Terry declared.
“Oh, no man! You mean that is one of your sex distinctions?”
“No, indeed!” he said hastily. “No one, I mean, man or woman, would work
without incentive. […]”
“[…] Do you mean, for instance, that with you no mother would work for her
children without the stimulus of competition?”
No, he admitted that he did not mean that […] but the world’s work was dif-
ferent – that had to be done by men, and required the competitive element.15
Read alongside the scene in which Alaka engages in a loving and nurturing
version of sexual intercourse with the tree, this passage reinforces, text-
internally, the contrast effect which the reader was left to experience text-
externally in the earlier passage, and hence makes unmistakably clear that ac-
tual-world social practices are a part of men’s overall domination and violent
subjugation of women. However, the passage also allows for causal inference
of why and how the women’s separatist society was established – and serves
as a call to real-world political action at the same time.
Sexual violence against women, which is equated with pornography and
sexual practices between men and women that, from a feminist perspective,
may be seen to underwrite women’s subservience not only in the sexual
act, is represented as the central outrage around which the “Revolt of the
Mother” gained momentum and led to the foundation of the hill women’s
separatist society. Here, the counterfactual scenario of The Wanderground
allows for the reader’s inference of what constitutes the foundations of male
dominance – violence that is represented as part and parcel of heteronor-
mative sexuality – and hence, for an understanding of where feminist resis-
tance must begin. The inference that can be made through the text’s projec-
tion of a counterfactual world and a counterfactual history of this world’s
origins is that programmatic lesbianism is the only measure that can get to
the roots of women’s oppression. The causal inference effect in The Wander-
ground, then, not only contributes to a cognitive denaturalization of actual-
world structures but suggests specific strategies – such as a programmatic
lesbianism – to replace these structures.
As the examples of Herland and The Wanderground have shown, the
“contrast effect” of utopian and real, of counterfactual and actual, is used
in order to critique specific social practices, while the “causal inference
effect” is used to expose mechanisms of naturalization structuring the ac-
tual world, and, as is the case in The Wanderground, can also contribute to pin-
pointing the exact strategies by which these mechanisms may potentially be
undone.
The titles of Gilman’s and Gearhart’s novels, Herland and The Wander-
ground, signal from the start that they set out to develop complete utopian,
or counterfactual, worlds. The titles “If I Were a Man” and The Female
Man, by contrast, signal that they engage in a different kind of counterfac-
tuality, which Hilary Dannenberg calls “characteriological counterfactual-
202 Birte Christ
18 Hilary Dannenberg, Coincidence and Counterfactuality: Plotting Time and Space in Nar-
rative Fiction, Lincoln 2008, pp. 120–122.
19 Gilman, “If I Were a Man”, p. 303.
20 Gilman, “If I Were a Man”, p. 306.
Functions of the Counterfactual in Feminist Fiction 203
thoughts can become central to the narrative either because she is a central
reflector figure, as in “If I Were a Man”, or because she is an autodiegetic
narrator, as in The Female Man.
What is more, “If I Were a Man” synthetizes gendered perspectives in a
way which opens up spaces for change. As the story progresses, Molly’s and
the reader’s newly acquired, counterfactual “male” view of the world be-
come integrated into both the character’s and the reader’s consciousness as
women – but as women who have become aware of their own disadvantaged
position in the world and have begun to understand the mechanisms through
which the dominance of men over women is upheld. The counterfactual
“male” perspective ceases to be counterfactual because it becomes part of
the actual perspective of the emancipated female character and reader. Molly,
and with her the female reader, take on the counterfactual male gender but
retain “enough subconscious memory of [themselves] to […] recognize the
differences”, and thus embody and render conspicuous the female conun-
drum of seeing the world from a female “emancipated” perspective. At the
same time, however, they remain aware of not being able to inhabit the male
one.24 The counterfactual and the actual that become reified simultaneously
within one character can be read as an expression of what feminist critics
have described as “double place of woman”,25 or as women’s “double-
voiced”26 or “palimpsestic”27 subjectivity, drawing on W. E. B. Du Bois’s con-
cept of the “double consciousness” of racially oppressed social groups.28
In feminist and critical race theory, this double or split identity of the op-
pressed is conceived of as being highly problematic for the individual. In Gil-
man’s story, by contrast, the convergence of the actual and the counterfactual
gendered perspective in one character not only models double conscious-
ness as the predicament of the emancipated woman, but at the same time
represents the reader with a counterfactual, productive notion of double
consciousness. Towards the end of the story, Molly’s feminist perspective on
the world – which she has gained by looking at the world through male eyes –
begins to “infect” her husband Gerald’s consciousness and, more import-
antly, his/her actions with her/his new “emancipated” double conscious-
ness. When Molly-as-Gerald looks around the streetcar at his/her fellow
passengers, s/he admires the becoming hats and suits of the gentlemen, and
mentally remarks about women’s hats and fashions: “Never in all her life had
she imagined that this idolized millinery could look, for those who paid for
it, like the decorations of an insane monkey”.29 Obviously, this judgment of
the women’s looks undercuts the dynamics of the desiring gaze of the male,
which one might expect Molly to experience in her husband’s counterfactual
skin: It represents a view on women’s fashion held by female activists for dress
reform. It is what women activists wish men – estranged from their acquired
habit of finding exactly this kind of fashion attractive – would eventually
come to think about women’s fashion and thus support their reform agenda.
Gerald’s male view of women, complemented by Molly’s new feminist
sense of the world, becomes the basis for potential social change within the
story. In a casual conversation with other men on the train, Gerald/Molly ex-
claims, in his/her two-way counterfactual character:
“Women are pretty much people, it seems to me. I know they dress like fools – but
who’s to blame for that? We invent all those idiotic hats of theirs […] and what’s
more, if a woman is courageous enough to wear common-sense clothes – and
shoes – which of us wants to dance with her?”30
Gerald/Molly lets the men on the train participate in his/her “double vi-
sion” of gender roles and thus lays open one of the many structures of
women’s oppression and men’s role in it. The story suggests that if both
sexes mentally occupied and synthetized both gendered perspectives, there
would indeed be potential for moving beyond the gender binary – individ-
ually as well as systemically.31
32 Joanna Russ, The Female Man, Boston 1986, p. 138, emphasis in the original.
33 Russ, Female Man, p. 133.
34 DuPlessis has argued that all four protagonists – Joanna, Janet, Jael, and Jeannine –
might be understood as “female men” and elucidate the contradictions of gender
in different ways (cf. Rachel Blau DuPlessis, “The Feminist Apologues of Lessing,
Piercy, and Russ”, in: Frontiers IV, 1979, pp. 1–8, pp. 6–7).
35 Besides Joanna in the fictional “actual” world of The Female Man, there are Janet in
a future utopian separatist world, Jael in a future dystopian world of a “cold war”
between the sexes, and Jeannine within an alternate history of gender relations,
in which the women’s movement has not taken place. Cf. Cortiel for a reading
of the text as “a postmodern science fiction novel that strategically interlaces four
distinct genres – Utopia, science fiction, alternative history, and ‘main-
stream’ postmodern autobiographical writing” (Jeanne Cortiel, “Joanna Russ: The
Female Man”, in: David Seed [ed.], A Companion to Science Fiction, London 2005,
pp. 500–511, p. 501).
Functions of the Counterfactual in Feminist Fiction 207
While Joanna claims to have turned into a man, and thereby takes pos-
session of her own humanness, the resultant subject position can yet be con-
sidered one that synthetizes “male” and “female”, or fact and counterfact –
as the novel’s title suggests. What is described here as “turning into a man”
must be considered as a radical reversal of the gendered cognitive frames
that govern the protagonist’s recognition of herself and of the world. That
the acceptance or shedding of gender is a dynamic of consciousness, rather
than of embodiment, is stressed in a passage in which Joanna addresses men
and affirms her manhood:
If we are all Mankind, it follows to my interested and righteous and right now very
bright and beady little eyes, that I too am a Man and not at all a Woman. […] I
think I am a Man; I think you had better call me a Man; I think you will write about
me as a Man from now on and speak of me as a Man and employ me as a Man and
recognize child-rearing as Man’s business; you will think of me as a Man and treat
me as a Man until it enters your muddled, terrified, preposterous, nine-tenths-
fake, loveless, papier-mâché-bull-moose head that I am a Man.41
At first sight, this passage may be most striking because of its parody of
men’s patronizing and belittling images of intellectually ambitious women
and its self-confident and aggressive assertion of the narrator’s new male
identity. However, it also insists on the central importance of what men and
women “think” and what is in their “heads” – in other words, on whether
they cognitively structure the world along a gender binary or not. The female
Joanna demands to be re-cognized as and re-cognizes herself as a man.
In The Female Man, “man’s” consciousness becomes embodied in a “fe-
male” one. By thus synthetizing cognitively and physically reified gender,
Joanna’s assumption of the male, counterfactual gender ultimately does away
with “male” and “female”: It is the consequence of recognizing the male as
the unmarked, as the universal – and of claiming the universal, instead of
the particular, for one’s female self. The employment of the counterfactual is
here, in its final consequence, used to deconstruct the border between
the actual and the counterfactual, the border between men and women.
A woman claims that which is counterfactual to her existence: the univer-
sal. In a symbolic feminist appropriation of male power such as in The Female
Man, then,42 the counterfactual does not remain pitted against the actual
world or actual character as an analytic instrument, but it turns into and re-
places the actual completely. Counterfactual fictional gender becomes one
with the actual fictional gender of the protagonist. By transforming the ac-
tual into the counterfactual, the text fulfils the utopia of doing away with the
gender binary and thus models ways for doing the same in our actual, non-
fictional world.
Feminist fiction, then, can be said to employ counterfactuality both as an
analytic and as a synthetic. It exploits basic benefits of counterfactual specu-
lation identified in studies of psychology, namely the contrast and causal
inference effects of counterfactual thinking, in a rather straightforward way,
as my analyses of Herland and The Wanderground have shown. The political
impetus behind the narrative is not the desire to replace the actual with the
counterfactual scenario in the real world, but to critique its structures and
institutions and to move the reader to change them. In characteriological
counterfactual gender narratives, the counterfactual and the actual, as we
have seen, can and do collapse as they denaturalize and eventually do away
with the gender binary.
IV. Afterthought:
Utopian Fiction and the Binary of the Real and Ideal
In its closing passage, The Female Man is very explicit about the effect it wants
to achieve in the actual world. The author-as-character Joanna, in this closing
passage, says “goodbye” to her book:
Go, little book, trot through Texas and Vermont and Alaska and Maryland and
Washington and Florida and Canada and England and France; bob a curtsey at the
shrines of Friedan, Millet, Greer, Firestone, and all the rest; […] do not mutter
angrily to yourself when young persons read you to hrooch and hrch and guffaw,
wondering what the dickens you were all about. Do not get glum, when you are no
longer understood, little book. […]
Rejoice, little book!
For on that day, we will be free.43
The author-as-character projects a future world in which the text itself will
be left without a purpose, as the forms of female oppression which it addresses
no longer exist. One might argue that the feminist necessity of merging the
counterfactual with the actual, of merging male and female in such ways that
they cannot be differentiated any longer, is staged on a metadiscursive level
here. The passage stresses the synthetizing power of the counterfactual and –
metafictionally spoken – of the fictional. The fictional imagination of alter-
natives and ideal states of being, it is implied, is – by virtue of its power in
the real world – capable of changing the real, of transforming it into a better
world. The text here re-casts the gender binary, and more concretely the bi-
nary between being a woman and being a man/human, as a binary between
the real and the fictional. The fictional – the counterfactual fantasy that
women may claim the male/human – is, of course, also the ideal world that
the text is promoting. In other words, the binary between counterfactual and
actual can be conceived of as the binary of the ideal and the real.
The genre that is most overtly based on the opposition between an ideal
and the real world is the literary utopia. While I have focused here on the way
in which the gender binary structures uses of the actual and counterfactual
in feminist fiction, which included feminist appropriations and post-modern
negotiations of the literary utopia, I would like to point out in closing that
the utopian genre should prove similarly fruitful for interrogations into lit-
erary uses of the counterfactual. Surprisingly, however, the classic literary
utopia in the tradition of Thomas More and, more widely framed, specu-
lative fictions with a utopian impetus have so far not been explored from
the perspective of counterfactuality.44 Hilary Dannenberg’s Coincidence and
Counterfactuality, currently the most comprehensive study on counterfactual-
ity in fiction, discusses alternate histories, historiographic metafiction, and
postmodern fiction in general as genres or modes that employ counter-
factuality.45 Utopian fiction, by contrast, is not even mentioned in this study
although it can be categorized as a counterfactual genre par excellence and, in
terms of form and function, is closely linked to the genre of alternate his-
tory.46 I can only speculate that this lack of interest in the literary utopia may
44 Studies on the literary utopia which employ the terminology of the “alternative”
may foreground the utopian opposition of mundus idem and alter mundus and thus
implicitly deal with issues similar to those a perspective on counterfactuality may
be concerned with, but this is neither done explicitly nor with a focus on the struc-
tural problematics of the binary. Cf., for example, Manfred Pfister (ed.), Alternative
Welten, Munich 1982; Derek Littlewood/Peter Stockwell (eds.), Impossibility Fiction:
Alternativity – Extrapolation – Speculation, Rodopi 1996. Philosophical studies on
utopian thinking rather than writing also tend to address structural binaries similar
to that of “fact” and “counterfact” implicitly (cf., for example, Michele Ciliberto’s
recent study on Machiavelli and Bruno: Pensare per contrari: Disincanto e utopia del
Rinascimento, Fiesole 2006).
45 Cf. Dannenberg, Coincidence and Counterfactuality, pp. 116–118.
46 In addition to Dannenberg, Geoffrey Winthrop-Young has recently discussed al-
ternate histories as counterfactual narratives in: “Fallacies and Thresholds: Notes
Functions of the Counterfactual in Feminist Fiction 211
be due to the fact that utopian writing takes the form of the apologue. As
DuPlessis has shown, “character and plot [in the apologue] function
mainly as bearers of philosophical propositions or moral arguments”,47 and
Bammer has called utopian apologues “fictions without a protagonist and
barely a plot”.48 Inquiries into the counterfactual in fiction such as Dannen-
berg’s, however, tend to center their analyses on the category of plot, which
may explain why utopian fiction has so far not fallen into their ken. Here is
an entire literary genre which may be read afresh from the perspective of
studies in counterfactuality; and here is an immense corpus of literary texts
that may elucidate functions of the counterfactual in philosophy, psychology,
political discourse, and a host of other fields.
I. Introduction
ability, and co-tenability of the counterfactual may vary greatly, but, as Ri-
chard Lebow argues, it is not always the realism which determines the use-
fulness of a counterfactual, it can also be the analytical utility.2 Miracle
counterfactuals, for example, may “help us work through moral and
scholarly problems”.3 In general, the usefulness of counterfactuals is not
necessarily dependent on their likelihood if we follow Lebow’s argument
that they can “reveal contradictions in our belief systems”, “highlight double
standards in our moral judgments”, and, most importantly for this essay, that
they “can combat the deeply rooted human propensity to see the future as
being more contingent than the past”.4 In effect, one could here replace the
word “counterfactual” with “time travel narratives”.
The question of “what would have happened if ” that is at the heart of
counterfactuals is also at the heart of time travel narratives in literary worlds.
As noted above, in a very basic sense literary worlds are always counterfac-
tual: they create their worlds through their utterances. They also generally in-
voke the conditional “what if ” by their specific complex referential relation
to the natural world and by their appeal to the reader: not being factual, they
ask of the reader to follow their thought experiment of a possible world.5
In other words, literary worlds are premised on counterfactuals and condi-
tionals both on an extradiegetic (via reference to the natural world) and in-
tradiegetic (via reference within the fictional world) level. If much of the ap-
peal of literature derives from this unique relational complexity, then the
popularity of time travel narratives in literature should not surprise. Time
travel as a theme in literary worlds offers a perfect fictional stage for explor-
ing counterfactuals. It complicates the conditionals and counterfactuals in-
herent in literary worlds by allowing for the malleability of time and history
through the miracle counterfactual of time travel,6 exploring alternative his-
tories and futures, and the attendant paradoxes. Via fictional time travel nar-
ratives, we can travel to far-away places and times, we can change things in
the past, and know more about the future. Time travel toys with contingency
and determinacy, raising the question of the malleability and inescapability of
temporality. And, by implication, time travel fiction is also about causality:
when we go to the past, at least some part of our future life is known to us.
We can then correct decisions we consider wrong in hindsight.
In what follows I will analyze time travel narratives with regard to the
counterfactuals they employ and the attendant conceptions of time, history,
and individual agency. The apparent practical impossibility of time travel
notwithstanding, I will assume that time travel narratives and the paradoxes
they invoke have an “expressive power of [their] own” and shed “light on
[crucial] aspects of human experience” that seemingly more “natural” or
“rational” narratives might not adequately grasp.7
Time travel narratives break with four intuitive and common sense
axioms that, as Marie-Laure Ryan argues, we have about time: (1) time flows
in a fixed direction with a relatively stable speed; (2) you cannot go back in
time against this flow; (3) causes precede their effects; (4) the past is un-
changeable.8 Consequently, narratives that subvert one or more of these
axioms are almost inevitably situated in the realm of the physically and logi-
cally impossible by readers. The latter may then employ a variety of reading
strategies to come to terms with these impossible scenarios, e.g. by reading
them as an indication of genre (science fiction), or as the hallucination of the
narrator. It follows that the specific evaluation of time travel narratives is sig-
nificantly shaped by our knowledge about narrative techniques, generic con-
ventions, and physics, and, closely related, by our assumptions about time.
This in turn shapes our assessment of the counterfactual thought experi-
ments we encounter in these narratives with respect to plausibility and prob-
ability. If our knowledge about the physical properties of time changes to the
degree that we realize our intuitions and experiences about time to be mis-
leading, some previously unlikely time travel scenarios might move closer to
the realm of the possible, altering the degree of departure of the possible
world from the actual one.9 This possibility is not to be underestimated if
one recalls one of Arthur C. Clarke’s famous “three laws” which states that
any technology advanced enough will look like magic to us. Especially re-
garding the four axioms listed above, modern physics and the rather bizarre
consequences of the – experimentally verified – propositions of quantum
mechanics and relativity have already given the lie to many of our intuitive as-
sumptions about time. First of all, it does not flow, nor does it have a speed,
as that would be measured by distance over time; also, the division between
past, present, and future is arbitrary, the future not being any more malleable
than the past.10 Second, under certain conditions, and with the right experi-
mental setup, it is possible to demonstrate that events in the present can de-
termine the past.11
with time. Joe Haldeman’s novel The Forever War is one of the few narratives to
make use of the only kind of real time travel possible already today: time dilation
(cf. The Forever War, New York 2009).
10 One of the consequences of Einstein’s special and general theory of relativity is
that we should actually conceive of all of time as a kind of “bread loaf ”. All of time
is continuously extant: past, present, and future. In fact, the distinction between
past, present, and future is physically untenable and seems to be something that
only exists in our minds. In this conception of time, the past and the future are al-
ways already present and unchanging. What we consider the future is already past
from another perspective. The “bread loaf ” metaphor has become standard in
most popularized discussions of the physics of time, possibly because it serves to
illustrate how one may cut different “slices” of time depending on the angle of the
cut. It is this angle that determines which events belong to the slice then called
“present”. Cf. Brian Greene, The Fabric of the Cosmos, New York 2005, pp. 127–142.
11 This has been shown by John Wheeler in his so-called delayed-choice experiment.
A mind-boggling interpretation of this is Richard Feynman’s sum over histories ap-
proach to quantum mechanics. It claims that the present is an average of all pos-
sible pasts; and thus the past is quite literally contingent on the present. Not less
astounding is the interpretation put forward in 1957 by Hugh Everett, a student of
Wheeler, that has since become known as the many worlds interpretation. For every
possible alternative path of any particle, another universe “splits off ”. This idea
was subsequently used by David Deutsch (cf. The Fabric of Reality, New York 1997;
David Deutsch/Michael Lockwood, “The Quantum Physics of Time Travel”, in:
Scientific American, 3/1994, pp. 50–56) for theorizing time travel as travel in parallel
universes. One will find this in numerous contemporary time travel fictions, a fa-
mous one being Michael Crichton’s Timeline. For a comprehensive introduction to
the physics and technology of time travel, cf. Paul Nahin, Time Machines, New York
1999.
216 Rüdiger Heinze
The idea of traveling through time one way or another of course precedes
modern physics, and time travel narratives have been extant for centuries.12
An early example is the legend of the Monk of Heisterbach, who follows a
bird in a forest and listens to it. When he returns after what seems to him
only hours, many years have passed. However, it is only since the late 19th
century, especially with the publication of H.G. Wells’s The Time Machine in
1895,13 that time travel narratives have become widely popular, predomi-
nantly in the genre of science fiction. In order to facilitate a systematic analy-
sis of time travel literature and its featured counterfactuals, we need to con-
sider three key aspects of all time travel narratives: most importantly (1) the
time and place of travel, because this determines the entire counterfactual,
i.e. the specification of and (intradiegetic) relation between what is present
and what is past, and the (extradiegetic) relation of the fictional to the actual
world; (2) the manner of travel, because it influences the reader’s assessment
of genre and the attending assessment of plausibility, as well as the relevance
of the process of the time travel itself; and (3) the conception of time under-
lying both, because this shapes the specific projection of temporal coherence
and causality and thus issues of agency and determinacy.
The destination of travel has to be examined not only in terms of time and
place but also in terms of logic, as in “counterfactual compared to what”? In
some time travel narratives, the history that is – albeit fictionally – changed
is the history of the actual world, whereas some places and times travelled to
have no referenced factual counterpart in the actual world, and thus are, in
Saint-Gelais’s term, counterfictional, for example mythical pasts, or alter-
native universes which work with differing degrees of departure from the ac-
tual world. This is important for narratives like Mark Twain’s A Connecticut
Yankee at King Arthur’s Court,14 Time Bandits,15 or the Back to the Future series.
So, while it would seem at first glance that one could travel only into the fu-
ture or the past respective to one’s own time, in effect one also has to con-
12 For a relatively comprehensive list and summary of time travel narratives, albeit
with little analytic insight, cf. Ekkehard Böhm, “Der Großvater im Wurmloch”,
in: die horen, 50/2005, pp. 85–98. Andrew Gordon provides an excellent discussion
of a number of time travel films (cf. “Back to the Future: Oedipus as Time
Traveller”, in: Sean Redmond [ed.], Liquid Metal: The Science Fiction Film Reader,
London/New York 2004, pp. 116–125).
13 Cf. Herbert George Wells, The Time Machine, New York 2003.
14 Cf. Mark Twain, A Connecticut Yankee in King Arthur’s Court, New York 1998.
15 Cf. Terry Gilliam (dir.), Time Bandits, HandMade Films 1981.
Time Travel and Counterfactuality in Literature and Film 217
sider exactly what time and kind of world one leaves and exactly what time
and kind of world one travels to.
The manner of travel can be described in terms of the control the traveler
has over it, and by its precise mechanism. It can be technological (e. g. car-like
machines, space ships, telephone booths), esoteric (e. g. by hypnosis or long
sleep), biological (e. g. by a genetic defect), or circumstantial (e. g. by falling
through a hole, or a knock on the head).
Lastly, and most complicated, we have to ask which conception of time is
evoked. With few exceptions, most time travel fictions work with the “bread
loaf ” conception of time in the direction of the past, but are often inconsist-
ent in the direction of the future. They occasionally even change their con-
ception of time during the course of the narrative. Some mix conceptions of
time, some do not seem to care at all or purposefully flout even a pretense
at logic and consistency. One can roughly distinguish four types – fictional
worlds which are:
(1) mostly coherent or homogeneous. They present one intradiegetic
world with a relatively stable degree of departure from the actual world. This
would be true of The Time Machine or The Time Traveler’s Wife.16 In the latter,
the traveler stays roughly in the same region, travels within a limited time in-
terval of about forty years, and the respective past, present, and future are
all located on one timeline with a stable degree of departure from the actual
world.
(2) mostly incoherent or heterogeneous. They intradiegetically mix dif-
ferent worlds with different degrees of departure from the actual world. This
would be true of Bill and Ted’s Excellent Adventure17 or Time Bandits. Here, the
protagonists travel not only to a historical past relatively close to the actual
world, but also to the mythical past of the Minotaur and the past of legends
and giants. Towards the end, they even travel to a place of evil beyond any
specific location and time.
(3) mostly consistent or complete. They follow one conception of tem-
porality. This applies to Twelve Monkeys18 or The Time Traveler’s Wife. In the
former, past, present, and future are located on one timeline with a stable de-
16 Cf. Audrey Niffenegger, The Time Traveler’s Wife, London 2005. And for the film
see: Robert Schwentke (dir.), The Time Traveler’s Wife, New Line Cinema 2009.
17 Cf. Stephen Herek (dir.), Bill and Ted’s Excellent Adventure, De Laurentiis 1989.
18 Cf. Terry Gilliam (dir.), Twelve Monkeys, Universal 1995. The film is based on an-
other interesting time travel scenario in the French film La Jetée by Chris Marker
(La Jetée, Argos Films 1962). Here, time travel is possible via the mind. The physi-
cal body remains in the respective present, and yet the traveler also has a body in
the past, a kind of avatar.
218 Rüdiger Heinze
gree of departure from the actual world, and the conception of time evoked
is consistent (“bread loaf ”) and thus entirely deterministic. The protagonist
temporarily fosters the illusion that he might be able to change the past in
order to prevent the near extinction of humanity; however, in the end, every-
thing happens “according to plan” – time cannot be changed. Even cinema-
tographically, the film goes to great lengths to emphasize this “bread loaf ”
conception of time: repeatedly, images from the past and future overlap.
(4) mostly inconsistent or incomplete. They mix different conceptions of
temporality. This applies to Slaughterhouse-Five (where consistency and logic
in general are hard to come by, to a wonderful satiric effect), or most notice-
ably to the Back to the Future series. The second part introduces the concept
of different timelines to explain why the protagonist cannot travel back from
a trip to the future to a certain past once events have been changed prior to
that past. Most of the action consists of the attempt to set straight this diver-
gence of time lines. However, only the timelines considered to be bad by the
hero are eliminated; the changes effected in the first and third part that are
to the advantage of the hero are never reversed, nor is there any mention of
timelines then. The films seem to mix bread loaf and divergence conceptions
of timelines, depending on which is dramaturgically needed at a given in-
stance.
On the whole, the manner of travel, the time and place of travel, and the
conception of time in time travel narratives have serious consequences for
their ideological investment and the precise nature of their counterfactual
ruminations. To exemplify this, I will now discuss three of the best-known
and most influential time travel narratives in order to present a brief diach-
ronic survey of how differently conceptions of temporality can be negoti-
ated. The texts have also been selected because they employ three very dif-
ferent counterfactual scenarios and different combinations of “what would
have happened if ”, “what would happen if ”, and “what will happen if ”.
19 Please recall that this inelegant double construction is due to the fact that the
grammatical mood of the counterfactual depends on whether the deixis reference
level is intra- or extradiegetic.
220 Rüdiger Heinze
rous knights, and satirizes the then virulent popular belief in the advance-
ment of humanity through technology, in historical progress, and the – often
eugenic – justification of colonialism, and in moral improvement through
free enterprise. All of these are lead ad absurdum and revealed as incongruent
and inconsistent by Hank Morgan when he, as the paragon of these ideas and
values, attempts to implement them in Arthurian England. The failure, how-
ever, is not primarily due to the incorrigibility of Morgan’s 6th-century con-
temporaries but to Morgan himself, as he shows the bloody, immoral, but
unavoidable underside of the values he cherishes. As with most time travel
narratives, this one too is more about the present from which it arises than
about the past it purports to present. The power of the counterfactual in this
case derives from the pervasive incongruity of the narrative, which shows
itself to be ideal for disclosing the incongruity of many of the ideas prevalent
at the time Twain wrote the novel.
due to inactivity and lacking evolutionary fitness, while the working classes
will live in subterranean caves that seem to be very similar to 19th-century
factories and feed off the “cattle”. In some ways, the intradiegetic counter-
factual actually functions predictively as to what will happen if humanity
continues on its path. As improbable and implausible as the counterfac-
tual might seem, the prediction of the narrative is lent credibility through a
number of strategies. First, this is the original instance of a time traveling ma-
chine, which is given plausibility through a careful explanation of its mechan-
ism and a consistent-sounding theory of time. In his careful and detailed ob-
servations, reasoned deductions and hypotheses, the time traveler appears as
a scientist. As Firchow points out, his descriptions scrupulously follow the
rules about “How a Scientist Thinks” set down by T. H. Huxley – a teacher of
Wells – in 1866.20 This style gives the narrative an air of scientism and ob-
jectivity and thereby “appeals to a recently created audience which […] ex-
pected its fictions to be at least as technologically sophisticated as the articles
on technology and science in its newspapers”.21
In addition, because the text employs evolutionary theory as an explana-
tory and predictive framework, the time distance of the future actually in-
creases the plausibility of the counterfactual rather than reducing it, as in
Twain’s Yankee. Thus, the narrative displays a fairly coherent possible world
with a relatively stable degree of departure. The future of The Time Machine
is in fact a blend of the (19th-century) present and the evolutionary future,
which even contains elements of the past: not only does the narrative reflect
an interest in archeology and an apocalyptic mood popular in Wells’s time
that saw humanity on the wane; much of the description and imagery is remi-
niscent of the ancient Greeks and Egyptians and refers to a mythic, Edenic
human past and innocent childhood, but also to humans as formerly primi-
tive killers and cavemen.22 The future of humanity with crabs and arthropods
reads much like the pre-historic past, the ultimate future being the end of life.
Fitting the evolutionary paradigm of the narrative, the conception of time
is of the “bread loaf ” kind, i. e. deterministic, although somewhat inconsist-
ently. For one, the narrative suggests that the time traveler may change the
course of events, or, at least, that he interferes. Even if that individual inter-
ference does not matter on the larger scale of evolution, the narrative clearly
implies that cultural evolution has supplanted biological evolution, and that
20 Cf. Peter Firchow, “H. G. Wells’s Time Machine: In Search of Time Future – and
Time Past”, in: Midwest Quarterly, 45/2004, pp. 123–136, p. 127.
21 Firchow, “In Search”, p. 131.
22 Cf. Firchow, “In Search”, p. 134.
222 Rüdiger Heinze
this cultural evolution may very well be subject to interference in the present –
for what purpose should the warning call the narrative quite loudly sounds
serve, if not to call for a reform of affairs? The counterfactual underlying
the narrative, “this is what might very likely happen if we extrapolate cur-
rent trends into the far future” can be extended to include the appeal “so do
something about it before it is too late”. However, in the long run, the sun
dies and all life ends, so does it then make sense to try to change the present
from a cosmological perspective? Ultimately, I think the narrative remains
undecided about the possibility and sensibility of individual agency. The time
traveler does appeal to a moral sense and responsibility, assuming that one’s
actions do matter; on a larger evolutionary and cosmological scale, how-
ever, all individual human action seems futile. In a way, the narrative well ex-
presses the dissonant mood of the fin de siècle.23
23 For an excellent essay on The Time Machine’s relation to visuality and early cinema,
cf. Jonathan Bignell, “Another Time, Another Space: Modernity, Subjectivity
and The Time Machine”, in: Sean Redmond (ed.), Liquid Metal: The Science Fiction Film
Reader, London/New York 2004, pp. 136–144. As he points out, the narrative is
very “cinematic”, the time traveler watches the time fly by on his cushy chair
through a window, a spectator just like in the movie theaters and the still popular
diorama and panorama (cf. Bignell, “Another Time”, pp. 136–137). He takes “a
tourist trip to alien spaces” (Bignell, “Another Time”, p. 136), so that temporal
movement equals spatial movement.
24 Cf. Robert Zemeckis (dir.), Back to the Future, Universal 1985; Zemeckis, Back to the
Future II , Universal 1989; Zemeckis, Back to the Future III , Universal 1990.
Time Travel and Counterfactuality in Literature and Film 223
into the future of 2015. The temporal details of the series are too numerous
to recount in detail here. Let it suffice to say that (1) not all time travel in the
series is on purpose because external circumstances and coincidences such
as lightning interfere with the technology; (2) and following this, time does
not cooperate: the characters never seem to be in control of their carefully
timed plans, escape is always narrow. Already at the very beginning of the first
part, a room is shown full of clocks. As it turns out, and much to the dismay
of the protagonist, all of them are 20 minutes late. Despite the fancy-looking
high-tech time machine car and the doctor’s scientific-sounding disquisitions
on time and temporality, time is definitely out of joint. (3) In the course of
the series there is a temporal doubling or even tripling of characters caused
by their traveling to the same time more than once. As a result, temporal
paradox, and consequent disaster, is always impending. (4) Themes, images,
and characters recur in all the different times. For example, the 1890s also
have a bully that happens to be the forefather of the bully of the 1950s
and 2015. Family patterns repeat themselves, as do improbable coincidences.
Characters constantly experience temporally paradoxical déjà vus, not to men-
tion the grammatical back bending and word play in order to accommodate
the temporal contradiction that results from the disjunction between story
chronology and temporal chronology.25
Despite the series’ seeming preoccupation with history, the counterfac-
tual is actually not historical but personal. Not only is the main character
always late and has substantial problems “keeping time”. Underlying the
whole series is the assumption that one single incident and decision in one’s
life determines one’s whole future life, and possibly the future of following
generations. Consequently, the series toys with the counterfactual of what
would have happened if Marty’s father, Marty, and his son, had not made that
single crucial mistake and had stood up for themselves. Through time travel,
the films dramatize what would happen if one had the chance to undo the
past and then what happens if one does change the past. As the first part
shows, once Marty’s father has knocked out the bully, his entire life changes.
When Marty returns to his present, his family is better off, sleek and success-
ful, his father has fulfilled his dream of becoming a writer, and the bully com-
plaisantly polishes their car. This pattern is repeated in the next two parts.
The trips to apparently historical stages of the U.S. thus primarily serve as the
stage for this paradoxical mix of determinism and free will. On the one hand,
25 At one point, after arriving in the 1950s, Marty notes about his high school that
“they really cleaned this place up” when of course it is cleaner because it precedes
the state of affairs in 1985.
224 Rüdiger Heinze
except for the very end of the trilogy, neither Marty nor any other male
member of his family seems to be able to change a personality pattern that
keeps getting them into trouble, as if it was genetic and as if history repeated
itself. On the other hand, Doc Brown’s emphasis that the past must not be
changed, and that in general one must not interfere with time, seems to sug-
gest that one can change time. Indeed, the whole second part is about “re-
pairing” unwanted changes to historical time and eliminating an alternative
time line. Moreover, the protagonists again and again change time and events
to suit their purposes; the main counterfactual actually propagates learning
from the past (and the future) and then seizing the opportunity to change
one’s “fate”. Of course, this actually constitutes an alternative time line – al-
though this is never explicitly stated – that should not be changed.
In general, even though it seems to present a straightforward manner of
travel, time traveled to, and conception of time, the series is full of intended
and inadvertent temporal inconsistencies. Although the manner of travel
is technological, it turns out to be quite unreliable – control is illusory. The
time traveled to is at a closer look actually not historical, but rather histori-
ographic, specifically: filmic, and in that regard similar to Twain’s mythic
past. The films show us not the 1890s and 1950s of U.S. history but of U.S.
film history. Fittingly, the travel to the “wild west” in the third part takes place
in an abandoned drive-in film theater; the “Indians” on a poster at one of the
walls then turn into “real” ones upon Marty’s arrival. The temporal concep-
tion is thus mostly coherent, although the mostly stable degree of departure
is not from the actual but from the film world, making the series nostalgic
throughout. In terms of consistency, however, the series is contradictory, re-
sulting from the specific counterfactual discussed above. Only in part two do
the changes Biff the bully makes create an alternative time line that has to
be destroyed; all other changes to the advantage of the heroes do not create
such timelines. The films hence work with an inconsistent combination of
“bread loaf ” and multiverse cosmology. As is the case with most narratives
featuring temporal paradoxes, non-contradiction is at some point surren-
dered for the sake of maintaining narrative interest.26
26 I follow Ryan here, who also points out that few narratives consistently employ all
different kinds of temporal paradoxes and argues that this is because it would vir-
tually obviate any kind of narrative interest (cf. “Temporal”, pp. 147–149).
Time Travel and Counterfactuality in Literature and Film 225
III. Conclusion
It seems that very few time travel narratives are wholly coherent and consist-
ent regarding temporality and logic. At some point, most of them sacrifice
stringency for narrative interest and/or critical investment. Kurt Vonnegut’s
Slaughterhouse-Five is apparently deterministic in that one allegedly cannot
ever change anything, yet one can also never be sure about anything since the
messed-up psyche of the protagonist – not to mention the inconsistent nar-
rative perspective of the narrator – undercuts all reliability of the narrative.27
For a portrayal of atrocious events and a traumatized protagonist, this is
quite fitting. Time Bandits displays only a feeble pretense at temporal logic.
In fact, considering the content (a loot of precious historical artifacts by
dwarves and a boy) and the genre (fantasy), this seems to be the desired ef-
fect. Logic is obviously not the primary purpose; rather, history, legend, and
myth are shown to be close together, and notions of temporal stability and
progression as well as a naturalistic world view are undermined. Even Mi-
chael Crichton’s carefully constructed Timeline is actually illogical.28 The title
is misleading since the narrative works with the alternative timelines of the
multiverse cosmology, but then the alternative lines are always only in the
past, and the present of the book suggests one timeline only. However, since
the narrative is primarily a thriller with the temporal construction as drama-
turgic conceit, this inconsistency does not seriously damage the narrative in-
terest of suspense.
Obviously, focusing solely on inconsistencies and illogicalities in time
travel narratives is beside the point. If we concede once more that the ana-
lytic utility of the counterfactual need not necessarily lie in realism, physical
laws, or rationality, then time travel counterfactuals may have important les-
sons to teach as thought experiments and as imaginative possible worlds that
allow us to vicariously put ourselves “in the skin of the characters whose life
is being invaded by the irrational”.29 Even those time travel narratives that
cloak their contradictions in the thin veneer of apparent consistency and co-
herence at closer scrutiny show us a deeply unsettling, “unreliable” universe,
where time is “out of joint”, and where notions of free will, control, and au-
tonomous agency are highly questionable or at least problematic. One expla-
nation for the recent “explosion of time travel films” and novels, then, might
be a “pervasive uneasiness about our present and uncertainty about our fu-
ture, along with a concurrent nostalgia about our past”.30 Most of the visions
of the future are bleak and apocalyptic and constitute a negative extrapo-
lation of the present. Interestingly, these negative visions of the future are
not necessarily based on a deterministic temporality, but rather base their
extrapolation on a human nature that is shown as morally flawed and imper-
fect. It is humanity’s unique capacity to recognize good and choose evil that
most often leads to disaster and has to be combated by the time traveler. No
wonder, really, that many visions are either explicitly or implicitly of the past,
which is often – though not always! – nostalgically shown as more “authen-
tic” and unspoiled by technology.31
Among the greatest strengths and appeals of fiction is that it can con-
struct, contain and project a virtually endless variation of worlds and scen-
arios, a unique testing ground for thought experiments, with tremendous
aesthetic and experiential “fringe benefits”. Another appeal is that the
readers participate in the construction of these worlds. Fictions with time
travel scenarios offer one way of compounding these pleasures by play-
ing through a variation of conditionals and counterfactuals that are not as
strictly bound by the constraints of laws of rationality, plausibility and prob-
ability as other kinds of narrative. Indeed, it seems that time travel narratives
tend to exploit the more playful potential of fiction and of creating possible
worlds; their potential for counterfictional self-reflexivity offers an addi-
tional vantage point from which to explore the functions of fiction. And yet,
playfulness aside, time travel fictions are also fundamentally about repeti-
tion, contingency, causality, and about the decisions we have to make on a
daily basis. They cater to our understandable occasional dissatisfaction – as
unreasonable as it may seem, but who ever said we have to be reasonable all
the time? – that we cannot change the past and do not have the hindsight of
the future. At least in fiction, we may get to have our cake, and eat it, too.
To speak, as Hilary Dannenberg and other critics now do, of the “counter-
factual” in the context of prose fiction is in some ways counterintuitive.1 To
be sure, the counterfactual is itself a fictitious state of affairs. The subject of
study by social and cognitive psychologists, philosophers, political scien-
tists, legal scholars, and historiographers, among others, a counterfactual is a
statement of something that did not or does not happen, an alternative scen-
ario that contrasts with a real occurrence or a genuine possibility. Counter-
factuals may take the form of “if only …” thinking, as in “If only Jane Austen
had lived into her sixties, we might have fifteen or twenty of her novels
instead of six” or “If only I lived in London, I could go to the theater
every week”. With counterfactuals, one answers questions like, “What would
have happened, if …?” or “What happens if …?” or “What happens if we
don’t …?”. In the possible answers to all such questions, counterfactuals are
phrased subjunctively in sentences having dependent clauses that posit an
antecedent contrary to fact. If Jane Austen had lived longer, if I lived in Lon-
don: these antecedents posit lives that are only imaginary. They suggest al-
ternate realities that would be improvements upon the actual facts; hence,
social and cognitive psychologists would call them “upward counterfac-
tuals”.2 Examples of “downward counterfactuals” would be “If Jane Austen
had died as a teenager, we would not have Pride and Prejudice in its current
form” or “If I lived in rural Idaho, I would have fewer opportunities to at-
tend plays than I do now”. In downward counterfactuals, the result of an
antecedent – the consequent – is worse than the factual state of things.
The assertions following upon the false antecedents are always phrased
subjunctively, but the auxiliary verbs employed carry different degrees of
force. “Might” is less certain than “could”, and “would” (or in British Eng-
1 Cf. Hilary Dannenberg, Coincidence and Counterfactuality: Plotting Time and Space in
Fiction, Lincoln 2008.
2 Cf. Keith D. Markman/Matthew N. McMullen/Donald A. Elizaga/Nobuko Mi-
zoguchi, “Counterfactual Thinking and Regulatory Fit”, in: Judgment and Decision
Making, 2/2006, pp. 98–107; and especially David R. Mandel/Denis J. Hilton/
Patrizia Catellani (eds.), The Psychology of Counterfactual Thinking, New York 2005.
228 Robyn Warhol
lish “should”, as in “I should very much like that”) implies more probability
than either of the other two modalities. “If I lived in London, I might go to
the theater every week” sounds like a lukewarm response to someone else’s
suggestion; “If I lived in London, I would go to the theater every week” car-
ries confidence in my ability, for example, to find the time and to afford the
tickets, unlikely as those eventualities may be. When in Much Ado About No-
thing, Beatrice exclaims in her outrage over Claudio’s behavior, “If I were a
man, I would eat his heart in the marketplace!”, she is recommending to Ben-
edick a more drastic course of action than if she were to say she might or
could eat Claudio’s heart. As she is not a man, her point is moot, as is the
point of all counterfactuals. Contrary to fact as they are, they are not true.
But then, neither is fiction. That is, fiction makes no claims to truth,
though a historical novel, for instance, might overlap in some respects with a
reality external to the storyworld. Realist novels, too, make use of recogniz-
able details – street names, celebrities, routes from one city to the next, de-
scriptions of the weather – as part of the apparatus of what Ian Watt called
“formal realism”, the aggregation of techniques that add up to the realist ef-
fect.3 An entire novel, however, whether a work of historical, realist, or fan-
tasy fiction, amounts to one big counterfactual, pretending to assert, “If all
the conditions set out in this text were true, this plot is what might or could
or would happen”. For this reason, I propose to be more specific than Dan-
nenberg is by calling passages in novels that spell out what might or could or
would have happened if the story were different “counterfictionals”.4
This is another way of thinking about the speech-act theory of literature,
which holds that literary writing is made up not of constative statements that
could be evaluated as true or false, but of performative statements that bring
into being the thing they say. A performative is neither true nor false; it only
happens (in felicitous conditions, as the linguists would have it) or does
not happen.5 Indeed, performative utterances make something happen; they
bring into being a promise, a threat, a christening, a blessing – or, in the case
of literary writing, the creation of a storyworld. As a performative speech act,
literary writing is under no obligation to link the diegesis to any referent in
3 Cf. Ian Watt, The Rise of the Novel, 2nd ed., Berkeley 2001.
4 For the way I am using the concept of counterfictionality I am indebted to con-
versations with Daniel Dohrn and Richard Saint-Gelais at the “Counterfactual
Thinking/Counterfactual Writing” conference at the Freiburg Institute for Ad-
vanced Studies, September 28–30, 2009. See also Elaine Scarry’s use of the term
“counterfactual” to point back to the real in the context of courts of law (The Body
in Pain: The Making and Unmaking of the World, New York/London 1987, p. 299).
5 Cf. John L. Austin, How To Do Things With Words, Oxford 1962.
Dickens's Narrative Refusals 229
the world outside the text. In this respect, the “counterfactual” in literary
writing is distinct from the object of study for historians or philosophers
who can distinguish it from “fact” or “truth”. Thus, the “actual world” of a
fiction is every bit as much the consequence of performative language as
is the “counterfactual world” the narrative might invoke as an alternative.
Although some theorists of the counterfactual in fiction speak of the “actual
fictional world” to distinguish what happens in a novel from that which does
not happen, the “storyworld” or “the diegesis” are better terms in that they
avoid the oxymoron of “actual fiction”.
I begin with an example from Charles Dickens’s Great Expectations, in
which Pip, the narrator, describes a dinner in his childhood home, the forge.
Through an elaborate series of negations quite characteristic of Dickens’s
later style, Pip explains he was uncomfortable
[n]ot because I was squeezed in at an acute angle of the table-cloth, with the table
in my chest, and the Pumblechookian elbow in my eye, nor because I was not
allowed to speak (I didn’t want to speak), nor because I was regaled with the scaly
tips of the drumsticks of the fowls, and with those obscure corners of pork of
which the pig, when living, had had the least reason to be vain.6
Pip brings up the angle, the elbow, and the scaly tips in the course of telling
what was not making him miserable (“not because I was squeezed […] nor
because I was not allowed […] nor because I was regaled” was Pip unhappy).
Embedded in this series is yet another set of negations: for instance “I was not
allowed to speak (I didn’t want to speak)” and “those obscure corners of pork
of which the pig, when living, had had the least reason to be vain”. While it
is hilarious, this passage is a classically Dickensian example of an upward
counterfictional. Behind all these negatives shimmers a potential diegetic
world where children want to speak and are allowed to speak, where pigs
have reason to be vain of the less obscure corners of their bodies, where
being squeezed at the table or forbidden to talk or parsimoniously regaled
with bad food are circumstances unusual enough to be the source of a child’s
discomfort. That is not, of course, the diegetic world Pip inhabits, nor is it a
diegetic world present in any of Dickens’s novels. “Ah, me!”, R. Wilfer says
to himself upon his first appearance in Our Mutual Friend, “what might have
been is not what is!”.7 The rendition of what might have been and yet is
not does form, however, much of the substance of Dickens’s later narrative
prose. Direct narratorial references to some of the specifics of what might have
been and yet is not are what I call “narrative refusals”.
This essay comes out of my new project in narrative theory; a contribu-
tion, I hope, to the descriptive poetics of the nineteenth-century British
novel, which examines specific narrative gestures in the context of the cul-
tural formation of subjectivity. Looking at what I call “narrative refusals”
gives us a glimpse at a previously unrecognized facet of the complexities that
form the distinctive Dickensian prose style, allowing us to see differently
what is “there” in the storyworld by turning our attention to what is marked
as explicitly “not-there”. With specific reference to Great Expectations, Our
Mutual Friend, and Dombey and Son,8 I will begin by laying out two modes of
narrative refusal in Dickens’s middle and later novels, disnarration through
negation and disnarration through the use of subjunctive constructions. I
will then turn to an earlier work, Nicholas Nickleby,9 where narrative refusals
are already incipient, though less common than later in Dickens’s career.
Dickens’s later, “greater” texts reflect a growing awareness of his audiences’
expectation of the cheery, Christmas-story side of “the Dickensian”, a wish
for happier storyworlds than the novelist was willing or able to produce. For
the later Dickens, the details of the happy storyworld are better left unsaid.
To be sure, an unimaginably large quantity of information is literally left
out of any text, considering everything that any novelist might have imagined
or observed, but not included in a particular text’s story or discourse. Nar-
rative elisions, suppressions, repressions, silences, gaps, omissions, or lacu-
nae: all of these invoke that which is “unnarratable” for any given text. Build-
ing on Gerald Prince’s classic definition of the “narratable”, I have elsewhere
identified four types of unnarratability in prose fiction and film: the subnar-
ratable (what need not be told because it is too obvious or boring); the su-
pranarratable (what cannot be told because it is ineffable or inexpressible);
the antinarratable (what should not be told because of trauma or taboo); and
the paranarratable (what would not be told because of literary convention).10
Thresholds of narratability vary from one genre to the next and, within
the same genre, from one period to the next. Even within the genre of the
Victorian realist novel, what is unnarratable for one author may fill up para-
graphs for another. For instance, Trollope does not hesitate to mention the
smallest gestures of body, face, and eye contact in scenes whose substance
would be subnarratable (too boring or obvious) even for novelists as prolix
as Thackeray or Dickens. Always the text is finite, but that which is not nar-
rated is infinite.11 So texts have limits. But there are no limits to the unnar-
ratable.
If the unnarratable usually figures in fiction as a gap, silence, or elision, it
can also motivate explicit narrative refusals of the kind that Pip makes in his
description of the dinner table scene. Pip’s childhood experience of abuse is
antinarratable, too traumatic to recount in the genre of Victorian domestic
fiction. When narrators enact this kind of narrative refusal, they point in the
text to a particular subset of that which is being left out, thus explicitly mark-
ing something that is unnarratable. I have identified two distinct but related
gestures of narrative refusal: unnarration (when the narrator indicates she
can’t or won’t tell what happened), and disnarration (when the narrator tells
something that did not happen, in place of saying what did).
Both of these narratorial modes can come into play when narrators at-
tempt to render material in all four categories of the unnarratable, and any
given novelist is likely to favor one of these gestures over the other. As we
shall see, Dickens’s narrators prefer to disnarrate instances of unnarratabil-
ity. In contrast with that which is simply left out, the unnarrated and disnar-
rated aspects of a text become a vividly present absence, existing at a narrative
level somewhere between the text and everything that is left out of it.12 Un-
narration – the kind of passage where a narrator will say “I won’t go into de-
11 Even an elaborate deconstructive reading intended to open out rather than close
off meanings eventually reaches a boundary beyond which a particular text’s
utterances won’t reach. Whatever else it may be, for example, Little Dorrit is not a
novel about colonizing new planets in outer space.
12 Instances of narrative refusal share features in common with three distinct rhe-
torical figures (as defined by Richard Lanham): occupatio (“emphasizing a point by
pointedly seeming to pass it over”), litotes (“denial of the contrary, understatement
in order to intensify”), and aporia (“true or feigned doubt or deliberation about an
issue”, Analyzing Prose, 2nd ed., London/New York 2003, pp. 240–241). In some
cases unnarration or disnarration will combine effects of two or more of these fig-
ures. They are also related, though not identical to David Herman’s “hypothetical
focalization” (in which a narrator or character produces “hypotheses […] about
what might have been seen or perceived – if only there were someone who could
have adopted the requisite perspective on the situations and events at issue”,
“Hypothetical Focalization”, in: Narrative, 2/1994, pp. 230–253, p. 231) and Brian
Richardson’s “denarration” (“in which a narrator denies significant aspects of her
narrative that had earlier been presented as given”, “De-Narration in Fiction:
Erasing the Story in Beckett and Others”, in: Narrative, 9/2001, pp. 168–175,
p. 168).
232 Robyn Warhol
moves their encounter to the ruined garden at Satis House, where Estella
says they will “remain friends apart”. In this version Pip – having taken Es-
tella’s hand and walked out of the garden with her – says he “saw the shadow
of no parting from her”.15 As if this ending too unambiguously contradicted
the first version by implying that Pip and Estella would become a permanent
couple at the novel’s end, Dickens re-revised it to complicate the disnar-
ration with a further negation: “I saw no shadow of another parting from
her”.16 Commentators may argue endlessly about whether “no shadow of
another parting” means Pip anticipated that he and Estella would stay to-
gether and never part again, or whether it means this is their last parting and
that Pip sees no reason to think they would meet again to part another time.
What makes the ambiguity irresolvable is Pip’s narratorial negation of action:
first he saw “the shadow of no parting”, then “no shadow of another part-
ing”. By rendering the prediction negatively, Pip disnarrates the ending
rather than telling us what did ultimately happen. I read the ending (as I do
the similarly ambiguous end of Charlotte Brontë’s Villette) to say that the nar-
rator-protagonist remains forever single. Such an ending is, in 1860, unnar-
ratable – specifically, it is paranarratable, because it would eventually become
not only acceptable, but conventional for realist novels to end in this way.
For a novelist as profoundly attached to his audience’s pleasure and approval
as Dickens was, the narrative refusal of closure is a startling choice.
The novel following Great Expectations, Our Mutual Friend, sustains from
beginning to end the same way of disnarrating events by negating them
rather than simply leaving them unmentioned. Like those in Great Expec-
tations, disnarrations in Our Mutual Friend work to heighten a sense of ab-
sence. Like the last page of the previous novel, the first page of Our Mutual
Friend leans heavily on negation, this time in the extradiegetic narrator’s first
description of Hexam:
He had no net, hook or line, and he could not be a fisherman; his boat had no
cushion for a sitter, no paint, no inscription, no appliance beyond a rusty boa-
thook and a coil of rope, and he could not be a waterman; his boat was too crazy
and too small to take in cargo for delivery, and he could not be a lighterman or
river-carrier; there was no clue to what he looked for, but he looked for some-
thing, with a most intent and searching gaze.17
This refusal to name Hexam’s object is part of the opening scene’s strat-
egy for building the sense of “dread or horror” the narrator attributes to
is not adequately powerful. In each of these cases but one, the negation
straightforwardly contradicts a positive possibility in order to assert a less
ideal fictional reality. In that one exceptional case, Dickens employs a double
disnarration the more strongly to invoke the course Eugene ought to be tak-
ing with Lizzie, while emphasizing how far Eugene’s actions stray from their
more desirable opposite. The “whole attitude” of Lizzie’s body begs him
not to “force her to disclose her heart”. “Disclose” is a negative locution for
“reveal”; to “force her to disclose” is to do violence to her wish to be silent.
Behind the disnarration of “not to force her to disclose” lies an image of Liz-
zie serenely unmolested, relieved by Eugene’s imagined willingness to simply
let her be quiet. He is, however, “not merciful”, and so he makes her do it.
Every disnarrated action in Our Mutual Friend has its corresponding posi-
tive action. It is as if the narrator is imagining a parallel diegetic world where
the complications driving his novel’s plot become moot, because the de-
sires those complications continually thwart are all already realized. It is a
counterfictional storyworld, an alternate narrative that is not quite not-there.
The disnarrations function here, as in Great Expectations, as a parody of the
naïve view that would imagine this counterfictional storyworld could be a
possibility. If that “shadow world” is born of a naivety the parodic narrator
scorns – for instance, the deluded hope that the world might operate accord-
ing to standard moral norms –, it is also a world that is nonnarratable, where
no conflict or complication would arise to motivate a story. In this respect,
the disnarrations are spaces in which the narrator points to the narrative en-
gines driving the text.
Dombey and Son, like Our Mutual Friend, also carries a shadow-narrative that
flickers behind its main action. Like R. Wilfer, Dombey – finally repenting of
his stony indifference to his daughter – “chiefly thought of what might have
been, and what was not”.19 Of course, Wilfer – that loving father the narrator
likes to call the “Cherub” – is aware of this discrepancy between the real and
the potentially better from his very first appearance in the novel, while Dom-
bey can see nothing beyond his own version of “what is” until he has lost
everything. Whereas the narrator of Our Mutual Friend consistently invokes
that other, more positive possible world through negations, the narrator of
Dombey and Son couches disnarration in the subjunctive mode. Though both
novels continually evoke what might have been and yet is not, Dombey and
Son’s passages of subjunctive disnarration gesture toward a better possible
world only at the novel’s beginning, then shift to reinforce the novel’s pre-
vailing mood of despair. Early in the novel, the narrator raises the subjunc-
tive possibility that Dombey might not have shut out his daughter Florence,
had he been paying sufficient attention to her after her mother had died:
And, perhaps, unlearned as she was, [Polly] could have brought a dawning knowl-
edge home to Mr. Dombey at that early day, which would not then have struck him
in the end like lightning.
Had [Dombey] looked with greater interest and with a father’s eye, he might have
read in [Florence’s] keen glance the impulses and fears that made her waver; the
passionate desire to run clinging to him, crying, as she hid her face in his embrace,
‘Oh father, try to love me! There’s no one else!’; the dread of a repulse; the fear
of being too bold, and of offending him; the pitiable need in which she stood of
some assurance and encouragement; and how her overcharged young heart was
wandering to find some natural resting-place, for its sorrow and affection.20
These subjunctive disnarrations, like the negations in Our Mutual Friend and in
Great Expectations, bring to the surface of the text the specifics of a more posi-
tive version of “what might have been, and what was not”. If he had been able
to learn from Polly’s example, or if he had been capable of reading Florence’s
fear of annoying him, Dombey would have realized the value of his daughter’s
attachment from the beginning, not just at the end. The disnarration invokes
for the reader an alternate story where father and daughter recognize and con-
sole each other for the loss of wife and mother. Still, in this novel the counter-
factual scene – the action that does not happen, but exists behind the action
that does – is not a particularly cheerful alternative. The narrator brings in
through the disnarration the spectacle of the child’s wavering, crying, and
clinging; her dread, her fear, her pitiable need, and her overcharged heart find
their way into the text without Florence’s having actually to enact them. The
shadow-narratives brought forth by this and subsequent passages of disnar-
ration come to rival the Dombey family’s actual story for perversity and misery.
Neither do the hypothetical situations the narrator proposes as “better”
than Dombey’s actual relations with his daughter present anything like posi-
tive alternatives to the diegetic action: “Oh, how much better than this that
[Dombey] had loved her as he had his boy, and lost her as he had his boy, and
laid them in their early grave together!”.21 In the parallel potential story
proposed here, Florence dies young and beloved, like her brother Paul.
The heroine’s early death would be better – the narrator exclaims – than that
Florence should waste her affection on her insensible father. Dombey’s
relations with Edith, his second wife, are identically hopeless, in both their
narrated and disnarrated versions:
[Edith] had better have been dead than laugh as she did, in reply, with her intent
look fixed upon [Mr. Dombey]. He had better have been dead, than sitting there,
in his magnificence, to hear her.22
Better for them both to die than to laugh and be laughed at with such dis-
dain and indifference: in the “better” alternate story behind the disnarration,
both Dombey and Edith succumb to the murderous enmity between them.
For that matter, Florence’s father is so indifferent to her forgiveness and
sympathy that he is not just incapable of responding to them in the diegetic
world as presented. He is equally closed off in the possible world suggested
by the hypothetical passages of disnarration:
If [Dombey] could have heard her voice in an adjoining room, he would not have
gone to her. If he could have seen her in the street, and she had done no more than
look at him as she had been used to look, he would have passed on with his old
cold unforgiving face […].23
This is equally true of Dombey’s relationship with Edith, who from the be-
ginning of their marriage defies his assumed power over her.
[Dombey] might have read in that one glance that nothing that his wealth could
do, though it were increased ten thousand fold, could win him for its own
sake, one look of softened recognition from the defiant woman, linked to
him, but arrayed with her whole soul against him. He might have read in that
one glance that even for its sordid and mercenary influence upon herself, [Edith]
spurned it […].24
Even if he had seen the loving girl on the hypothetical street, or recognized
the disdainful woman actually sitting before him, his situation would be
equally bleak. Up to the moment just before his epiphany, Dombey remains
even hypothetically incapable of connecting with anyone other than his dead
son:
Here, thrown upon the bare boards, in the dead of night, he wept, alone – a proud
man, even then; who, if a kind hand could have been stretched out, or a kind
face could have looked in, would have risen up, and turned away, and gone down
to his cell.25
In the alternate world invoked by the subjunctive disnarration, Dombey’s
pride would keep him as isolated as he is in the presented story. “What might
have been” for Dombey proposes no positive alternative. If the suddenness
of his final reconciliation with Florence seems to come without preparation,
this is at least partly attributable to the hopelessness of the parallel story the
disnarration so consistently tells. The implied reader has little reason to ex-
pect that a blissful father-daughter reunion could be on the horizon of a
storyworld so unremittingly bleak in both its diegetic and counterfictional
versions.
Great Expectations, Our Mutual Friend, and Dombey and Son are all celebrated
not just for the complexity of their plots, but also the complexity of their
psychology. Critics of the Victorian novel take it for granted that the earlier
Nicholas Nickleby is not complex in the same admirable way as the later work.
It would be easy to say that Nicholas Nickleby (either the character or the
novel) has no psychology – but then, no novel has a psychology, strictly
speaking: characters’ psychology is an effect created through the narrative
construction of interiority. Looking for narrative refusals in this earlier novel
shows that Dickens is already working on the technology of structuring a
subjectivity for the text. I will offer only one example from this earlier novel:
in an extremely rare moment of sentimental reflection, Ralph Nickleby falls
into a subjunctive reverie that briefly adds a depth-effect to his characteriz-
ation:
He thought of what his home might be if Kate were there; he placed her in the
empty chair, looked upon her, heard her speak; he felt again upon his arm the
gentle pressure of the trembling hand; he strewed his costly rooms with the
hundred silent tokens of feminine presence and occupation; he came back again
to the cold fireside and the silent dreary splendour; and in that one glimpse of a
better nature, born as it was in selfish thoughts, the rich man felt himself friend-
less, childless, and alone.26
Here the disnarrated images – the sound and sight of the young woman in
the chair, the sensation of her hand on Ralph’s arm, the “female touch” in
the room’s decoration and clutter – represent Ralph’s momentary sense of
what might have been, what the narrator calls “a better nature”. Here Ralph
is more than an early version of the “character type” Dickens will later de-
velop into Dombey. His behavior and attitudes resemble Dombey’s, to be
sure, but more striking is the parallel between the ways the narrators of the
two novels use subjunctive disnarration to structure the two characters’ psy-
chology. As the narrator remarks immediately after Ralph’s musings, “A very
slight circumstance was sufficient to banish such reflections from the mind
of such a man”.27 Such reflections are also evidently banished from this text,
as the narrator of Nicholas Nickleby refrains from disnarrating the potential
Jane Austen’s fans are well aware of the phenomenal number of adaptations,
sequels and parodies of her novels, especially of Pride and Prejudice.2 Some of
these are more surprising than the expected “what happened afterwards” or
“amorous life of a minor character” formulae. Seth Grahame-Smith’s Pride
and Prejudice and Zombies tells anew the story of Elizabeth Bennet and Fitzwil-
liam Darcy,3 inserting into the narrative – otherwise largely identical with
Austen’s – several undead creatures and ninja combat scenes, for no other
reason than the fun of creating a very unlikely cultural hybrid, in which
the two main characters fight both with their inner feelings and with the
“unmentionable beings” that infest Netherfield Park’s environs. Two years
earlier, Emma Campbell Webster’s Lost in Austen, subtitled “Create your own
Jane Austen Adventure”, offered another type of variation – or rather vari-
ations, since this is an interactive novel inviting the reader to identify with
Elizabeth Bennet and, through a sequence of choices, to try to make a profit-
able marriage, with Darcy if possible (but not necessarily).4 This is not an
easy task – most of the paths lead to utter failure – and the text becomes pro-
gressively labyrinthine, as the reader discovers that his or her choices unravel
the original narrative and bring it to unexpected directions, including some
excursions into the plots of other Austen novels.
While it may be premature to speak of a tendency, these two books
(to which we may add a handful of others, based on various classics such as
Flaubert’s Madame Bovary or Stevenson’s Dr. Jekyll and Mr. Hyde) present
a common denominator: Each of them intrudes into a previous fiction, not
in an additive mode (as do sequels), but in a substitutive way. They retell the
1 I would like to thank Dorothee Birke for her useful comments on a previous ver-
sion of this article.
2 A bibliography compiled by Rolf Breuer lists 186 “completions, sequels, adap-
tations, pastiches, and fictionalisations” published between 1850 and 1998, 71 of
which are based on Pride and Prejudice alone. Cf. http://webdoc.sub.gwdg.de/
edoc/ia/eese/breuer/biblio.html (March 28, 2010).
3 Cf. Jane Austen/Seth Grahame-Smith, Pride and Prejudice and Zombies, Philadelphia
2009.
4 Cf. Emma Campbell Webster, Lost in Austen, New York 2007.
From Counterfactuality to Counterfictionality 241
novelistic facts. Lizzie Bennet may fail (or refuse) to marry Mr Darcy, as
happens in several narrative paths of Webster’s Lost in Austen. Emma Bovary
may avoid suicide, as she does in Jacques Cellard’s Emma, oh! Emma!9 Ham-
let may decide to forget about his mother’s second marriage and elope with a
young comedian named Kate, as in Jules Laforgue’s Hamlet.10 I will not pre-
tend that this formula is frequent enough to form a recognizable literary
genre. But the questions it raises justify, I think, considering the formula
more closely, especially since, as we will see, the various ways in which it is
done might offer some insight into the nature and status of fiction.
First, some conceptual indications may be useful. The phenomenon I
want to examine is, in some respects, the intertextual counterpart of the nar-
rative device that Gerald Prince has analysed some years ago under the label
of the “disnarrated”, i. e., “all the events that do not happen but, nonetheless,
are referred to (in a negative or hypothetical mode) by the narrative text”.11
Disnarration happens in virtually every narrative. For instance, in Chandler’s
The Big Sleep, Marlowe, the narrator, says: “I was still staring at the hot black
eyes when a door opened far back under the stairs. It wasn’t the butler coming
back. It was a girl”.12 Recently Prince’s idea has been rediscovered, unknow-
ingly it seems, by Maxime Abolgassemi under the label of “contrefiction”.13
With either disnarration or contrefiction, the narrative erases from itself an
event or circumstance. But the erasure is not complete, as it would be if a
narrator simply omitted a segment of time without mentioning it. On the
contrary, the disnarrated makes the erasure visible. As a consequence, the
disnarrated suggests what it denies, for saying that something did not
happen in a story is a sure way to instil the idea that it could have. Disnar-
rated sentences, thus, create possible worlds in a fictional world. This be-
comes clear when, to quote Prince, they “pertain to a character’s unrealised
imaginings (incorrect beliefs, crushed hopes, false calculations, erroneous
suppositions)”.14
26 One of the very few examples I know of is Charles Renouvier’s Uchronie, first pub-
lished in 1857 (cf. Renouvier, Uchronie, Paris 1988, p. 110: “Avec cette lettre, prob-
ablement apocryphe, nous entrons dans le roman de l’Uchronie, pour ne plus le
quitter. L’auteur appelle à de grandes destinées cet Avidius Cassius, que l’histoire
nous apprend avoir été assassiné dans son armée.”).
27 Cf. Arthur Conan Doyle, “The Empty House”, in: Sherlock Holmes: The Complete
Novels and Stories, Vol. 1, New York 1986, pp. 663–682.
28 Cf. Arthur Conan Doyle, “The Final Problem”, in: Sherlock Holmes: The Complete
Novels and Stories, Vol. 1, New York 1986, pp. 642–659.
246 Richard Saint-Gelais
At this point, we may pause and ask if these alterations of previous nar-
ratives really function as counterfactuals do. An obvious difference is that
counterfictional texts do not follow the “if … then …” form. Novels that
alter previous novels do not begin with phrases like “Let us suppose that …”;
such a modal frame is implicit at best. Emma, oh! Emma!, for instance, works
like the development of a hypothesis contrary to what is established in Mad-
ame Bovary: if Rodolphe had helped Emma out of her financial quagmire, she
would not have committed suicide (and her fate would have been bleak any-
way). But this situation is not described as the result of a hypothesis. There is
no “if … then …” frame; the states of affairs narrated in Cellard’s novel are
given as facts, not suppositions. It is the reader who brackets them, by seeing
them as a conjectural reasoning based on Flaubert’s original novel. The same
may be said of counterfactual novels, which do not use the “if … then …”
frame either: here, too, it is supplied by the reader who understands that, for
instance, Bring the Jubilee posits as its actual narrative world one where Ameri-
can geopolitics, among other things, differ widely from what we know as
real. This suggests that the implicitness of such frames is a general tendency
of fiction, whether counterfictional or counterfactual. The reason for this si-
lence is simple: Fiction is based on make-believe. It does not, after the word
“novel” on the cover of the book, describe itself as fiction, or as the devel-
opment of a speculation. Neither does counterfiction, it seems. Even a meta-
fictional text such as The Eyre Affair pretends that the world of Thursday
Next is reality, as opposed to the avowed fictiveness of Jane Eyre’s world.
Theoretically, nothing prevents an author from playing an overt metafic-
tional game in a counterfiction; in fact, very few of them seem ready to do it,
maybe because they judge that one game at a time is enough.
If we want to find counterfictional utterances that share the explicitness
of the frames used in counterfactuals, we have to turn to the critical dis-
course, to texts that interpret other texts, an operation which naturally leads
to the recognition of the commented text’s existence. Being situated outside
fiction, a critical text treats fictional states of affairs as the fictions they are.
This normally rules out any intervention in the commented text’s diegesis:
Critical discourse is expected to offer interpretations, not rewritings. Some
commentaries, though, explore the virtual paths the plot of a novel could
have taken. A remarkable example is Valincour’s Lettres sur la Princesse de
Clèves, first published in 1678.33 This essay has recently been hailed as an
early example of the formalist study of fiction, as well as of creative criticism.
cepted money from his clients, he would still have been able to pay his part of
the rent for the flat on 221B Baker Street. Things are not so clear when
we consider unframed counterfictions produced in a fictional context. It is
counterintuitive to disqualify as “wrong” or “false” a novel in which Emma
Bovary does not commit suicide. A critic may deem it a bad aesthetical
choice, but this is a completely different matter. Novelistic counterfictions,
after all, are fictions and, as such, supposed to escape truth judgments. It
would be nice if we could stick to this simple axiom. I am afraid, though, that
the problem is a bit more complicated.
There are counterfictions such as Emma, oh! Emma! which read as fictions
upon previous fictions, as playful variations on the fate of the characters.
Other counterfictions stand at the same level as the fictions they revise.
Conan Doyle’s “The Empty House” or François Cérésa’s Cosette are not to be
read as musings about “The Final Problem” or Les Misérables, but as rectifi-
cations of those narratives’ so-called inaccuracies. Does this expose them to
truth judgments? I do not think so. Readers will form opinions about such
operations. But these opinions will probably depend more on the author’s
identity than on the content of the texts. A counterfiction written by the
original author revisiting his or her previous fiction will be received more
favorably than the same operation performed by another author. Conan
Doyle’s resurrection of his hero is seen as a legitimate prerogative of a cre-
ator, whereas François Cérésa’s resurrection of Victor Hugo’s character has
been described by some as a betrayal of the latter’s intentions. In our culture,
the original author is granted a privileged relationship with “his” or “her”
fiction that enables him or her to claim knowledge about parts of the fic-
tional world outside the scope of the narrative.37 Hence the difficulties we
face when we try to evaluate counterfictionals, since this entails institutional
factors (e.g., the status and identity of the author) as much as internal, tex-
tual, ones.
Critical discourse, exemplified by the essays of Valincour and Bayard,
presents yet another set of problems. Critical discourse, by definition, is out-
side of fiction: It makes serious statements about fictions. This is clearly
the case when Valincour suggests possible amendments of Madame de la
Fayette’s novel. But what about Bayard’s new solution to The Murder of Roger
Ackroyd? His tongue-in-cheek tone makes it difficult to decide whether this
is to be read as an effort to solve the mystery, to be taken seriously, or as a
sophisticated literary joke. Maybe this uncertainty is precisely the point of
37 But even this may be contended, as has been shown by the controversy surround-
ing J. K. Rowling’s post facto “revelation” about Dumbledore’s homosexuality.
From Counterfactuality to Counterfictionality 251
the essay: to show the indeterminacy of fiction, but also its pervasiveness, its
propensity to invade unexpected fields, including criticism. Bayard describes
his book as “a detective novel about a detective novel”. His rewriting of The
Murder of Roger Ackroyd is at the same time an essay and a fiction, the result of
a thorough investigation and an exercise in critical ingenuity – an ingenuity
bordering on imagination.
One last word: why? Why alter previous (and often prestigious) fictions?
This clearly looks like an instance of our postmodernist age’s tendency to
build works of art upon works of art, texts upon texts, fictions upon fictions.
But the phenomenon of counterfictionality is anything but new. Valincour
published his essay in 1748. In 1749 Nahum Tate published a version of King
Lear which, among other revisions, provided the characters with a happy
ending: Cordelia does not die but marries Edgar, and Lear regains his
throne.38 Specialists agree, though, that this bold move was complying with
the eighteenth-century readers’ desire to “correct” the original’s bleak dénoue-
ment. Recent counterfictions exhibit a more playful attitude to the narratives
they modify. The alterations they perform are less governed by ideological or
moral imperatives such as the desire to reconcile a narrative with a current
set of values. Rather, they aim to stress the indefinite malleability of fiction,
what we may call its “rewriteability”. The recent additions to the Jane Austen
canon I have mentioned earlier tend to confirm this. Grahame-Smith’s Pride
and Prejudice and Zombies stretches the capacity of a narrative to incorporate
elements alien to its storyworld’s tacit rules; modifying the story, here, entails
a radical transformation of the world – the set of constraints governing what
is possible or not in the fiction – in which events take place.39 Campbell
Webster’s Lost in Austen maintains the original world-rules, but moves to-
wards the exhaustion of the narrative possibilities they open. Her book does
not offer a story but an arborescence of stories, the one told in Pride and
Prejudice being just one among several – one, moreover, that the curious
reader will probably neglect in favor of more disastrous (but much more
amusing) paths.
This suggests two lines of divergence between past and recent counterfic-
tions. The first line divides normative and anti-normative contexts and prac-
tices, but it may also provide a tentative explanation of the uneven distribu-
tion of counterfictionality in history: Paradoxically, both strongly normative
(as the 17th century in France) and strongly anti-normative periods seem fa-
vorable to counterfictional rewriting, although in sharply different forms,
whereas more indifferent contexts seem to be less propitious. The second
distinction concerns the relationship between counterfactuality and the idea
of fictituous worlds. It is probably more blurred, if only because it is much
less conspicuous than normative considerations. But I would submit that
recent counterfictions show a playful attitude not only towards stories, i.e.
characters and their fates, as previous counterfictions already did, but also
towards the world the original author had outlined around them. Worlds, in
this context, are treated less as given than as objects of ludic writing prac-
tices.
But these remain hypotheses, to be verified, revised, or qualified through
a more extensive study. What remains certain, though, is that counterfic-
tionality takes advantage of what we already knew, but forgot in our fasci-
nation with the apparent reality of fictional lives and settings – that fiction,
being made of words, can always be made again and again by other words.
List of Contributors 253
List of Contributors
Michael Butter, Junior Research Fellow at the Freiburg Institute for Ad-
vanced Studies (FRIAS ). Research interests: American literature and culture,
popular culture, conspiracy theories. Recent publications: The Epitome of Evil:
Hitler in American Fiction (2009); Arnold Schwarzenegger: Interdisciplinary Perspec-
tives on Body and Image (2011; ed. with Patrick Keller and Simon Wendt); 9/11:
Kein Tag, der die Welt veränderte (2011; ed. with Birte Christ and Patrick Keller).
Martin Hilpert, Junior Research Fellow at the Freiburg Institute for Ad-
vanced Studies (FRIAS ). Research interests: cognitive linguistics, grammar,
language variation and change, linguistic methodology. Recent publication:
Germanic Future Constructions – A usage-based approach to language change (2008).
Science in Context (2007; ed. with Fernando Vidal); Theophysis. Ernst Haeckels
Philosophie des Naturganzen (2005).