Professional Documents
Culture Documents
Van Fraassen Bas C. (2008) - Scientific Representation Paradoxes of Perspective PDF
Van Fraassen Bas C. (2008) - Scientific Representation Paradoxes of Perspective PDF
Van Fraassen Bas C. (2008) - Scientific Representation Paradoxes of Perspective PDF
ISBN 978–0–19–927822–0
10 9 8 7 6 5 4 3 2 1
for
Janine Blanc Peschard
and the memory of
Dina Landman van Fraassen
This page intentionally left blank
Preface
Preface vii
List of Figures ix
Part I. Representation
APPENDICES
On this view our thoughts stand to things in the same relation as models to the
objects they represent. The essence of the process is the attachment of one concept
having a definite content to each thing, but without implying complete similarity
between thing and thought; for naturally we can know but little of the resemblance
of our thoughts to the things to which we attach them. (Boltzmann 1974, 214.)
I will have no truck with mental representation, in any sense.1 The view
Boltzmann expresses here, a view in philosophy of mind or language,
has nothing to contribute to our understanding of scientific representa-
tion—not to mention that it threw some of the discussion then back into
the Cartesian problem of the external world, to no good purpose.
Scientific representation is—as Boltzmann’s own examples in that article
amply show—by means of artifacts both concrete (graphs, scale models,
computer monitor displays, and the like) and abstract (mathematical models,
needed especially when the infinite on infinitesimal play a role). It is on
these artifacts, their use, and the characteristics that are germane to the
roles they play in this use, that we must focus. The reservations about how
the represented must be like its representation, which Boltzmann expresses
about thought, pertain equally to these artifacts, and are pertinent to any
view of how a science relates to its domain of application.2
Secondly, it is not only to our understanding of theories and their
models that representation is relevant. The achievement of theoretical
representation is mediated by measurement and experimentation, in the
course of which many forms of representation are involved as well. Scientific
representation is not exhausted by a study of the role of theory or theoretical
models. To complete our understanding of scientific representation we must
equally approach measurement, its instrumental character and its role. I
will argue that measuring, just as well as theorizing, is representing. The
representing in question also need not be, and in general is not, a case of
mimesis; rather, measuring locates the target in a theoretically constructed
logical space. In this respect I shall make common cause with views
currently found in philosophy of technology.
Thirdly, the analysis of measurement as well as of the conditions of
use for theoretical models can be completed only through a reflection on
indexicality. Since at least the time of Poincaré, Einstein, and Bohr it is
a commonplace that a measurement outcome does not display what the
measured entity is like, but what it ‘looks like’ in the measurement set-up.
3
That point does not go nearly far enough. It serves, however, to announce
the introduction of relationality, perspective, intensionality, intentionality,
and the essential indexical into the discussion of science, though it stops far
short of presenting their full role.
Debates in philosophy of science take place in the context of much wider
tensions and oppositions in epistemology and metaphysics. When, in Part
Three, I come to the theme of structuralism I will begin with protagonists
in the historical Bildtheorie debate, and show how it was already implicitly
challenged by an apparent paradox at the very heart of its conception
of science. I will then take the message about the role of indexicality in
scientific representation to the paradoxes that beset, bedeviled, or otherwise
preoccupied Bertrand Russell, Rudolf Carnap, Hilary Putnam, and David
Lewis. For my part I will propose an empiricist structuralism, in contrast
to structural realism, as a view of science that can stand up to these
challenges. Then, in the last part, I will address the troubling relations that
the appearances bear to the world’s scientific image.
This, so far, is the general outline of what I am setting out to do. But
it may help to add here something about my own starting point. I try
to be an empiricist, and as I understand that tradition (what it is, and
what it could be in days to come) it involves a common sense realism in
which reference to observable phenomena is unproblematic: rocks, seas,
stars, persons, bicycles . . . . Empiricism also involves certain philosophical
attitudes: to take the empirical sciences as a paradigm of rational inquiry,
and to resist the demands for further explanation that lead to metaphysical
extensions of the sciences. There is within these constraints a good deal of
leeway for different sorts of empiricist positions. For my part, specifically,
I add a certain view of science, that the basic aim—equivalently, the
base-line criterion of success—is empirical adequacy rather than overall
truth, and that acceptance of a scientific theory has a pragmatic dimension
(to guide action and research) but need involve no more belief than that
the theory is empirically adequate.3 While this will undoubtedly shape my
discussion, I have tried to write as much as possible of this book in a
way that does not trade on the differences between this view of science
(‘constructive empiricism’) and its contraries (‘scientific realisms’). What
scientific representation is and how it works is everyone’s concern, and
there we may find a large area where more general philosophical differences
need make no difference.4
This page intentionally left blank
PA RT I
This page intentionally left blank
Representation
Aristotle was trading on a typical ambiguity when he wrote ‘‘Tragedy
is a representation of a serious, complete action which has magnitude,
in embellished speech . . . ; represented by people acting and not by
narration. . . .’’ (Poetics 49b25). We can read his statement as describing
either the poet’s activity or the poet’s product. It is in the activity of
representation that representations are produced. This is not an accidental
equivocation. We lose our topic altogether if we attempt to ask ‘‘what
is a representation?’’ and tacitly take just one or the other aspect into
account; for in fact we cannot understand either in isolation. That applies
to scientific representation as well.
There is a vast and recently rapidly increasing literature on representation
both in general and in philosophy of science. Let me express at once my
accord with the approach advocated by Mauricio Suarez:
I propose that we adopt from the start a deflationary attitude and strategy towards
scientific representation, in analogy to deflationary or minimalist conceptions of
truth, or contextualist analyses of knowledge. [. . .] Representation is not the kind
of notion that requires a theory to elucidate it: there are no necessary and sufficient
conditions for it. We can at best aim to describe its most general features. . . .1
This does require some decisions about what we can and cannot take for
granted, to be used in this description, as already understood. While not
trying to define representation or to reduce it to something else, we will
have to place it in a context where we know our way around.
The first question to broach is this: how, or to what extent, is represent-
ation related to resemblance or likeness-making? This venerable question
occupies the first two chapters. Not all, but certainly many forms of repres-
entation do trade on likeness, likeness in some respects, selective likeness. That
is not what makes them representations; it is part of what defines them as the
sort of representation they are, and may figure in what constitutes success.
But even these tend to trade equally on unlikeness, distortion, addition. A
representation is made with a purpose or goal in mind, governed by criteria
of adequacy pertaining to that goal, which guide its means, medium, and
selectivity. Hence there is even in those cases no general valid inference
8 :
from what the representation is like to what the represented is like overall. Not
surprisingly, empiricist views of science will differ from scientific realist
views on where they locate the selective likeness and unlikeness.
The second question concerns perspective. The third chapter will exa-
mine and elaborate on the enigmatic but now oft-seen contention that
scientific representation is perspectival. In their general use, the words
‘‘perspective’’ and ‘‘perspectival’’ are largely metaphorical. The literal use
appears only when we assert, for example, that artists in the Renaissance
began to draw and paint in perspective.2 As I shall elaborate, the pertinent
point about this technique is Albrecht Dürer’s: drawing in perspective
is a measurement technique.3 The art of perspectival drawing is an art of
measuring. It is a technique for rendering a systematically selective likeness,
yielding information in desired respects, and it provides an initial paradigm
for measurement in general.
We are perennially plagued by the shifting uses and senses of even our
most common terms. In the long history of tension between physics and
astronomy before the modern period, ‘‘saving the phenomena’’ refers to
the appearances of the celestial bodies and their motions to the astronomer,
that is, in the outcomes of the astronomer’s measurements. Those celestial
bodies and their motions are one and all observable, unlike e.g. the
postulated crystalline spheres. But when Kant takes on this terminology of
‘‘appearance’’ and ‘‘phenomenon’’, he entwines their meanings so deeply
in his transcendentalist philosophy that we find ourselves as it were in a
different language. While I have not done so before, I will here make
a terminological distinction between these two words, though neither
will have Kant’s meaning. Phenomena will be observable entities (objects,
events, processes). Thus ‘‘observable phenomenon’’ is redundant in my
usage. Appearances will be the contents of observation or measurement
outcomes. The celestial motions of concern to the ancient or medieval
astronomer were all phenomena, in my sense, but Copernicus insisted that
they had confused what those phenomena are like with how they appear to
the earthly observer. Thus he distinguished, in my terms, the appearances
(in the measurements made by the astronomers) from the phenomena
(which they observed and measured).
Regimenting the terminology in this way, or in any way at all, will chafe
on some common usage. But it will align with other usage no less common.
For example, combustion, St. Elmo’s fire, lightning, and the aurora borealis
: 9
are all commonly named as phenomena; and whereas Giorgione was so called
because of his size, Mars is called the Red Planet not because of its color but
because of its reddish appearance as seen or photographed from the Earth.4
With this regimentation we will have available a distinct terminology to
honor the insight that what measurement shows is not directly what the
measured is like but how it appears in that particular measurement set-up.
There are other techniques besides perspectival drawing, such as those
of the cartographer, that can provide a paradigm for our conception of
measurement and modelling. Perspective comes in there as well, though in
a quite different way, when we examine the conditions for the possibility of use
of these representations, for prediction and manipulation in practice. The
lessons drawn from these seminal examples illuminate not only the painter’s
art but the construction and use of models in science and technology.
Much of Part I will come by way of prolegomenon to further study,
marshalling telling cases to illuminate representation—and what will be
its use? Firstly, to remove the blinders that could focus us naïvely on
the idea that what is represented is simply like what is presented in the
representation. Equally, to save us from the opposite error, of assuming a
total independence of the represented from the content of its representation.
Likeness in contextually selective fashion is important to scientific practice.
The world, the world that our science is of , is the world depicted in science,
and what is depicted there, is the content of its theoretical representations;
but there is less to this equation than meets the eye—and thereby hangs
a tale. . . .
This page intentionally left blank
1
Representation Of,
Representation As
Nelson Goodman was quite right to say so. The budget of examples
and counter-examples that prove it—we will look at some below—have
largely been with us since almost the beginning of philosophy. But there
must be a reason if the idea he disparages, that resemblance is crucial
to representation, is so persistently seductive. That perfect likeness is an
ideal pursued in visual imagery, at least, has much historical support. Pliny
described the painter Zeuxis’ grapes as so lifelike that birds tried to eat
them—while Zeuxis in turn was fooled by Parrhasios who painted a
curtain with such trompe l’oeil perfection that Zeuxis asked him to pull
the curtain aside in order to show his painting.1 We can cite the art of
our time as well: hyperrealism in recent painting, such as Donald Jacot’s or
Jacques Bodin’s, is surely admired in part for excellence of that kind. So
while representation cannot be equated with the presentation of a likeness,
and resemblance to what is represented is not crucial to representation as
such, resemblance does play a role inviting our attention.
12 :
length set on high. For Alcamenes his sweet and diligent strokes being drowned,
and Phidias his disfigured and distorted hardnesse being vanished by the height
of the place, made Alcamenes to be laughed at, and Phidias to be much more
esteemed. (Gombrich 1960: 191)
As Roger Shepard (1990) has studied and richly illustrated in our own time,
this point is general, and does not just apply to sculptors of images to be
seen from below. Even in an ordinary drawing, if you want two differently
oriented parallelograms to look congruent, you have to make one larger
than the other.
It seems then that distortion, infidelity, lack of resemblance in some
respect, may in general be crucial to the success of a representation.3 This
does not rule out that resemblance in some other respect may be required.
Yet even when that is the case—and it may be a special case—the choice
of those respects in which resemblance or a specific kind of distortion is
required, and those for which just anything at all will do, will have to be
seen as crucial as well.
There must be a cautionary tale here for how we are to understand
scientific representation. It may be natural to take a successful representation
to be a likeness of what it represents—but much hinges here on what
the criteria of success were when it was made. One sort of success is
precisely what Copernicus was taken to have as against Ptolemy: that his
theory displayed the real structure of the cosmos.4 In general though, can
we infer from success of a representation, in respects that we can directly
appreciate, to the conclusion that it bears a structural resemblance to what
is represented? The examples of how distortion may be crucial to successful
representation (in view of the purpose of the representing) should certainly
give us pause. But now we are running ahead of the story.
misrepresent her as draconian, but it certainly does represent her, and not
her sister or her pet dragon or whatever else she may have. Yet even if we
take the caricature to represent her because of some carefully introduced
resemblance there, we can declare it a misrepresentation by insisting that
it represents her as something she is not. A caricature may represent a rather
tall man as short (as a well known cartoon depicts Supreme Court Justice
Clarence Thomas as very small compared to the chair he occupies), but it
represents that man, and not someone that it resembles more as to height.
A caricature misrepresents on purpose, to convey a message that is clear
enough in context but is to be gleaned in a quite indirect fashion. My
two examples are both visual representations, but they are not visually
accurate, nor does their visual inaccuracy serve to produce a visually
accurate appearance to a properly placed eye (as in the Sophist’s sculptor’s
case), and yet, neither is their inaccuracy accidental.5
So distortion—departure from resemblance—which may be crucial to
accurate representation in certain cases, is in other cases the vehicle of
effective misrepresentation. Resemblance in some particular respect may
be the vehicle of reference: we recognize the caricature as being of Mrs.
Thatcher because of resemblance in certain respects. It may also be the
means of attribution or misattribution of some characteristic: we take the
caricature to represent her as draconian because of some likeness to a
dragon, which is actually an unlikeness to her. She is represented, and she is
represented as thus or so: the drawing is of her, and depicts her as thus or so.6
But a list of likenesses and unlikenesses does not tell us this much—why,
for example, is this not a caricature of a dragon as Thatcherian?
Let us look at another drawing, say, Spott’s drawing of Bismarck as a
peacock.7 Is this drawing a misrepresentation? There we broach a ques-
tion of truth that certainly is not settled in terms of visual accuracy or
inaccuracy—even if both reference and attribution were effected by select-
ive uses of resemblance and non-resemblance. The judgment whether
this is an illuminating caricature that conveys a truth about him, or
amounts to a falsehood, a lie, a misrepresentation, is not settled by geo-
metrical relations between the line drawing and Bismarck’s appearance to
the eye.
If we just focus on resemblance in some respect as the core notion in
representation then it is at best puzzling that distortion might be needed
for effective representation. But if the resemblances are just a means to an
, 15
end there is no puzzle. The sculptor wants the object he makes to have a
certain use, and he chooses the way in which the proportions of the object
are related to those of the original—the ways in which they are like and
the ways in which they are unlike —so as make that use possible. There have
(of course!) been efforts aimed at naturalizing representation, such as Fred
Dretske’s account of information-bearing as correlation, but these tend to
founder specifically on the issue of misrepresentation.8 Misrepresentation is
a species of representation. If the relationship ‘X represents Y’ were to lie in
a resemblance or correlation or other such structural relation between the
two, what would misrepresentation be?
Suppose I say that the caricature that depicts Mrs. T as draconian
misrepresents her. Then my assertion has as first part that it does represent
Mrs. T as draconian, but as second part not only that it is unlike her with
respect to shape, but that it depicts her as something she is not. To say
that it misrepresents her with respect to shape is to say that rather than
resembling her it depicts her as resembling (being like) something which,
according to me, she is not like. To say that it is a caricature, however, is
to say that it purveys an interpretative attribute, something that the picture
can convey only by drawing on a social context, not just on what is ‘in’
the picture taken by itself.
So what is accuracy? The evaluation of a representation as accurate or
inaccurate is highly context-dependent. A subway map, for example, is
typically not to scale, but only shows topological structure. Relative to its
typical use and our typical need, it is accurate; with a change in use or
need, it would at once have to be classified as inaccurate. Similarly, in one
political context, or relative to a certain kind of evaluation, a caricature may
rightly be judged to be accurate, in another misleading or blatantly false.
picture of by looking at how things are depicted in it! But there are limiting
cases where this can be drawn into doubt.13
Undeniably, though, Goodman has brought into the limelight the strong
analogy: a picture is a picture of something, and depicts that something as
thus or so, and so is in that respect similar to how a verbal description is a
description of something, and describes that something as being thus or so.
We need now to see how we can go beyond this analogy.
One way in which Goodman did bring in resemblance was through
the intricate notion of exemplification. If a hardware store clerk or interior
decorator shows you color swatches or fabric samples, those exemplify the
property of interest in the following sense: they both have and refer to that
property (Goodman 1976: 45–68 ; Elgin 1996: 171–83). Obviously their use
is to represent to you what your wall or floor will look like if you choose
the corresponding paint or carpet. Representation by exemplification
involves likeness, but much more than likeness. In these examples visual
resemblance plays a role: in fact, the color swatch has the same color,
and in that crucial respect resembles, the paint that it represents to the
customer. This only works if the relevant visual resemblance is highlighted
in some way, has a status unlike that of the many other resemblances—that
is the point of taking exemplification of the relevant property, rather than
mere instantiation, as the vehicle by which representation is achieved.14
The relevant visual resemblance must be highlighted in some way in that
context, so as to bestow that status—here we have strayed from semantics
into pragmatics.
But do not equate even this beautifully articulated relationship with
picturing or imaging! As Goodman later pointed out, the word ‘‘word’’
both refers to and has the property of being a word, so it exemplifies that
property, but is not a picture or image. (Goodman 1987–8: 419)
Asymmetry of representation
There is an asymmetry in representation that resemblance does not have.
This is a much repeated point, made to show that resemblance is not the
right criterion for representation. Resemblance could not be the crucial
clue to representation, it is said, for even if representation did require
resemblance to its target, the target would then resemble its representation
but not represent it. While resemblance is indeed not the right criterion,
the argument from asymmetry is not all that strong.
18 :
First of all, we do tend to use terms like ‘‘resemble’’ and ‘‘looks like’’ and
such asymmetrically in certain contexts, because the subject of a statement
tends to select the focus or contrast. Of Rosemary’s baby they said ‘‘He has
his father’s eyes’’. Hard to think of their saying about the father ‘‘He has his
baby’s eyes’’. The retort may be that literally the noticed resemblance goes
both ways; but ‘‘literally’’ may here just mean ‘‘if you ignore the context’’.
Probably we can find a context in which someone may, without oddity,
assert not that a given picture is an exact likeness of me but that I am its
exact likeness, or something to that effect. But that would not show that
resembles is symmetric in general, let alone in every context in which it
comes in.
That literally resemblance must go both ways—that literally speaking it
is both reflexive and symmetric, while representation is neither—is most
likely based on the simple idea that to resemble (in some respect) is to have
a property (of the pertinent kind) in common. But that is too simplistic
a construal anyway. The production of a photo involves a ‘collapsing’
of shades of color and of three-dimensional spatial structure into two
dimensions. If that counts as pertinent resemblance, then this is a relation
of homomorphism rather than isomorphism, yet central to modeling. That A is
a homomorphic image of B certainly does not entail that B is that of A.15
But this contrary point too is weaker than it seems. If A is a homomorphic
image of B then there is a reduction of B, modulo some equivalence
relation, to which A is isomorphic—so we might say that in this respect
B resembles A just as A resembles B. So there is no strong argument,
as far as I can see, based on any clear asymmetry to banish resemblance
from our topic, nor one to make it relevant to representation in general.
What does remain, as needs to be emphasized, is that certain modes or
forms of representation (but not all) do trade on selective (and not arbitrary)
resemblances for their effect, efficacy, and usefulness, and that this typically
goes in one direction only.
I quite agree with you [Cratylus] that words should as far as possible resemble
things, but I fear that this dragging in of resemblance . . . is a shabby thing, which
has to be supplemented by the mechanical aid of convention with a view to
correctness. (Cratylus 435c)
[ . . . ] in pictures you may either give all the appropriate colors and figures, or
you may not give them all—some may be wanting—or there may be too many
or too much—may there not?
[ . . . ] And he who gives all gives a perfect picture or figure, and he who takes
away or adds also gives a picture or figure, but not a good one. (Cratylus 431c)
[ . . . ] I should say rather that the image, if expressing in every point the entire
reality, would no longer be an image.
Let us suppose the existence of two objects. One of them shall be Cratylus, and
the other the image of Cratylus, and we will suppose, further, that some god makes
not only a representation such as a painter would make of your outward form and
color, but also creates an inward organization like yours, having the same warmth
and softness, and into this infuses motion, and soul, and mind, such as you have,
and in a word copies all your qualities, and places them by you in another form.
Would you say this was Cratylus and the image of Cratylus, or that there were
two Cratyluses?
Cratylus I should say there were two Cratyluses.
20 :
Socrates Then you see, my friend, that we must find some other principle of truth
in images. . . . Do you not perceive that images are very far from having qualities
which are the exact counterpart of the realities which they represent? (Cratylus
432a–d)
That Cratylus does not grant that the copy, made by the god to duplicate
Cratylus entirely, is an image of Cratylus shows at the very least that
resemblance is not sufficient to make for representation. But the example
shows much more—we need to explore it in detail.
What’s in a photo?
Just to assert of something that it is a representation, or that it represents, or
that it represents something, is woefully elliptic and invites obscurity and
confusion. Our full locution must in the general case be at least of form ‘‘X
represents Y as F’’, as in ‘‘the caricature represents (depicts) Mrs. Thatcher
as draconian’’.16 Here X is a representation, Y its referent, and F a predicate
that X depicts Y as instantiating. The ‘‘as . . .’’ locution is fascinating, a
difficult topic in the analysis of language, and we will need to carefully
distinguish how its intensionality is connected with the relationality and
intentionality of representation. But this form too is still too brief to allow
all needed distinctions.
Why didn’t Cratylus agree that the god would have made an image of
him? The god would, in Socrates’ example, have made a perfect replica,
more perfect than any statue made by a human sculptor. The replica would
certainly, if properly displayed, have created the appearance of Cratylus’
being there. So as far as the later classification in the Sophist goes, the
god would have acted both as likeness-maker and as appearance-creator.
Yet Cratylus demurs: the god would not have made an image of him
but would have made ‘another Cratylus’. What are we to make of this
intuition?
A more contemporary example will show the inherent ambiguity in play
here about what represents what, with what, and to whom. Imagine: I have
acquired a famous photograph of the Eiffel Tower, Au Pont de l’Alma by
Doisneau. It hangs on my wall, but I scan it and print the scanned image.
This print is an image too—what does it represent? The Eiffel Tower seen
from the Pont de l’Alma, or the famous photograph?
, 21
My point is not just that what represents what is relative to a system of represent-
ation. Rather my point is that you can’t tell for sure whether you are looking at
a representation at all just by looking. . . . One has to determine how the thing
functions. (Ibid.: 512)19
Relativity to systems, ‘languages’, recalls Goodman of course; and this
applies very well to pictures, as is well enough understood when different
styles of representation are studied.20
This notion of system, or of function if understood as a role in a
system, sounds still quite impersonal, but we must understand it in terms of
pragmatics, referring to contexts of use, broadly construed. The context-
sensitivity does not go away when (in different ways) Nelson Goodman
and Ned Block say that you need to know which system of representation the
item belongs to. For since the same item will in different contexts belong to
different such systems, we then need the relevant contextual factors which
determine that.
What is really to be emphasized here is the way in which individuals and
groups, though relying on some pre-existing communally understood form
of representation (a universal qualification of all communal activity), create
new representations and new modes of representation. When Descartes
created his method of coordinates, it is not as if he was just using an
already extant way of representing spatial shapes and motion. But it is
true that in his initiative, to use known numerical equations in this way,
he bestowed a role on already familiar equations that they had not had
before. Unlike a moment’s poetic depiction quickly lost to history, this
act engendered a mode of representation fundamental to all subsequent
science.
That said, I will just write ‘‘use’’ for use, make, or take understood in this
sense.
If that Hauptsatz is understood in this sense, then it places some immediate
limits on the range of representation. Firstly, at least if taken entirely literally,
it has no room for the notion of mental images or mental representations,
whether taken to be brain states or something more ephemeral—for no
such things, if they exist at all, are used or put to use, or taken in one way
or another.23 At least, not in the relevant sense: we can conceive of a
brain surgeon bestowing a representational role on the patients’ brain states,
but not of a person bestowing roles on his or her own brain states—or,
presumably, on whatever could count as mental states.24
Secondly, this conception leaves no room for ‘representation in nature’,
in the sense of ‘naturally produced’ representations that have nothing to
do with conscious or cognitive activity or communication. The Eleatic
Stranger gave a whole series of examples of copies, but it is not clear that
they are all images in the sense of representations:
Things in dreams, and appearances that arise by themselves during the day. They’re
shadows when darkness appears in firelight, and they’re reflections when a thing’s
own light and the light of something else come together around bright, smooth
surfaces and produce an appearance that looks the reverse of the way the thing
looks from straight ahead. [ . . . ] And what about human expertise? We say
housebuilding makes a house itself and drawing makes a different one, like a
human dream made for people who are awake. (Sophist 266c–d)
Most shadows and reflections that occur in nature are not being used or
taken, let alone made, by anyone to do anything. (The Balinese shadow
puppet theater is an exception.) So by our Hauptsatz, they are generally
not representations. Nor is the track left in sand by an ant in some desert
long, long ago in a galaxy far, far away. . . . not even if it has the shape of
our word ‘‘Coca Cola’’.25
A black mark on a rock does not refer, represent, or mean anything
unless it has a role, or has bestowed on it a role, in some practice—no
matter whether it is a simple stroke or a complex pattern. Nor is it sufficient
that it has the sort of shape, coloring, etc. that would place it in a certain
role if encountered or produced in a certain cultural context, by persons
belonging or assimilated there, if in fact it does not bear any relation to such
a context.
, 25
But a natural object can represent, just as it can play other roles, namely
if we bestow such a role on it. Imagine I am using a stone, found on the
ground, to hammer in a tent peg. I am using it as a hammer—it is my
hammer now, I have bestowed the hammer role on it. The hammers we
buy, in contrast, are manufactured precisely to play this role—they are
manufactured artifacts. The stone was not made for that, and it is not an
object that I created, constructed, or assembled. Nevertheless it is now a
physical object with a function—that is to say, an artifact. There is an
analogy here to ‘objets trouvés’, natural objects ‘made’, without physical
modification, into works of art. All of this applies mutatis mutandis when I
use the stone to represent, for example, a certain stateman’s heart: I bestow
a role on the stone for it to play, I give the stone a function for it to serve.
What is in a photo? What is in a picture? This question has the misleading
form of ‘‘What is in a box?’’ We won’t get much further by taking this
form at face value and giving an answer with the correlative form, such as
‘‘What is in the representation is its content’’. That is just a verbal answer,
conveying nothing by itself. To call an object a picture at all is to relate it to use.
As an analogous example we can think of Herbert Mead’s reflections
on the teacup (McCarthy 1984). If there were no people there would
be no teacups, even if there were teacup-shaped objects. For ‘‘there are
teacups’’ implies ‘‘there are things used to drink tea from’’ which in
turn implies ‘‘there are tea-drinkers’’. By ignoring the contextuality of
representation, the fact that we are dealing with about-ness, and that what
the representation is about is a function of its use, we could land ourselves
in useless metaphysical byways. If we were to ask ‘‘What is in a picture?’’
while taking the picture simply to be the physical object and with no
relation to anything that can bestow meaning, the answer would have to
be ‘‘Nothing!’’
The notion of use, the emphasis on the pragmatics rather than syntax
or semantics of representation in general, I will give pride of place in
the understanding of scientific representation. But does that exclude too
much? That a particular person at a specific time uses or takes or presents
something to represent something else is a very local event. Could it really
be a general condition on representation that something so specific has to
happen? In a comment on similar ‘‘intentional’’ views of what constitutes
representation, Mauricio Suarez suggests that it will hamstring the idea that
theories represent:
26 :
But the point is quite general. The spelling out can only go so far, because
the notion of representing has (suffers from?) variable polyadicity: for every
such specification we add there will be another one.
the lowest point is not always the same; when it is, so are the immediately
succeeding locations.
All these features are still in the domain of kinematics. Descartes’s
famous aim for mechanics was that it should be deterministic but only
have ‘quantities of extension’, that is, kinematic parameters. When Leibniz
and Newton each came up with counter-examples, they also introduced
new, non-kinematic (dynamic) parameters to ‘‘fill out the picture’’. The
apparently indeterministic kinematic behavior is embedded in a model
that has additional parameters—such as masses and forces—which is
deterministic after all.
These examples provide the pattern for ‘hidden variable’ interpretations
of apparent indeterminism. Reichenbach argued that such added parameters
might not correspond to anything real, and that physics could forego
satisfying the demand for determinism. But as a good empiricist, he offered
this as methodological advice: neither demand them nor ban them from
modeling. The touch stone would be the usefulness of such models for
empirical prediction.
For us, the point here is simply that, for a given purpose, the best repres-
entation might well be one that embeds its target in a larger structure. So we
can add addition of ‘surplus structure’ to distortion and the trading on selective
unlikenesses, to our catalogue of means for representation achieved by
departures from mimesis. Hence, again, there is no universally valid infer-
ence from what the best representation is like to what the represented is like.
In conclusion
A scientific, technical, or artistic representation is an artifact. As such, it is
both an object or event in nature, that we can regard purely through the
physicist’s or chemist’s or mathematician’s eyes. But it is at the same time
something constituted as a cultural object, through its role or function,
bestowed upon it in practice. Just what the representation is, or what
is represented and how, is not determined entirely—and often enough,
hardly at all—either by what is ‘in’ the natural object or by its physical or
structural relations to other things.
When resemblance is the vehicle of representation, for example, the
representation relation derives from selective resemblances and selective
, 31
Modes of representation
Everything resembles everything else in many ways, so an effective use
of resemblance must always be selective. This can only be effective if
the selection itself is understood or conveyed. That point applies more
generally: even if resemblance is not the vehicle, whatever features of the
representing entity are instrumental to the representing must be somehow
highlighted there. It is not sufficient for the representor just to have them!
Secondly, even if we recognize a role played by a resemblance, this
resemblance need not be with respect to any visually or even perceptually
detectable features. Additionally, resemblance need not consist in sameness
of properties, but can also be at higher levels. This was a theme emphasized
and elaborated especially by Wilfrid Sellars in his discussion of theories
34 :
Mathematical statuary
It will have been obvious that the criteria for distinguishing pictures from
other visual representations do not imply two-dimensionality. The word
‘‘picture’’ no doubt connotes, in common usage, representation on a
plane surface. But a statue, as we saw at once in the Sophist example, is
subject to the sort of distortion practiced in painting to produce a visually
faithful image. There is necessary occlusion and the statue is explicitly
non-committal with respect to certain features that we don’t even think
about in the case of painting. That is easily seen when we compare, say,
the Venus of Milo with the Anatomical Venus in Florence’s Museo della
Specola.
, , 41
This Proposition is called Snell’s (or Snel’s) law after its discoverer in the
seventeenth century.21
But now let us look at a range of phenomena that lies just where
reflection and refraction compete. (Today it is easy to see this illustrated
on the internet by computer simulation.22 ) Light is refracted if it strikes
the surface at a shallow enough angle; but it is reflected, if it arrives at a
sufficiently steep angle. What happens when a light source is moved so as to
change the angle of incidence? Precisely where does the one phenomenon
end, or the other begin?
In the diagram, let QP equal PR, but consider various values for the
angle QPQ . Let Q move downward toward the water surface; the angle of
refraction away from the normal becomes larger, R moves up and to the left
in our diagram. But noticing that the distance Q P is always larger than R P,
what happens when Q and Q coincide? What happens then as the source
moves still further? The only answer within this theory is that the light is at
some point (at the critical angle) no longer refracted but completely reflected,
, , 45
and that when that happens there is suddenly a big jump to a distance
below the surface—a singularity, a discontinuity in nature! Perhaps even a
contradiction in the theory given the traditional principle accepted, at the
time when Willebrord Snel formulated his law, that ‘nature makes no leaps’.
Is there really such an enormous discontinuity in nature at this point? At
the level of observation open to a swimmer or fish, this phenomenon can
be found. But extrapolation to what happens ‘in the small’ is not valid; this
is just the point where the model gives out, where the idealization reaches
its limit of admissibility. The phenomena for which geometric optics works
do include the more easily observable ones studied early in the history of
optics. But there is no infinitely thin precise demarcation between water
and air, and in any case, ignoring the wave character of light will only yield
adequate results even for the observable phenomena in a limited range.
could ask whether all functions are analytic or even continuous, and could
contemplate negative answers thereto, that controversy was long past.
To go even further, after the development of measure theory Birkhoff and
von Neumann pointed out that when classical mechanics solves problems
about systems with given precise configurations, we can construe it as using
conveniently simplified descriptions. More realistic, they suggested, would
be the description that results if we transform the precise descriptions by
identifying regions that differ only by sets of measure zero.27 Their reasons
for thinking of that as more realistic may or may not be cogent, but it
suffices here to note the conceptual possibility. That is, after the time of
Lebesgue we can look back to the older description of nature and we
have the new option for how to conceive mathematically of the shapes of
things.28
You will realize that I am simply giving examples of how, in many
ways, we must in retrospect look upon the scientific image inherited from
the older generation as open, vague, ambiguous in the light of our new
understanding (that is: in the light of alternatives not previously conceived).
What is the shape of this beer glass really? What was it in the Galilean,
Cartesian, Newtonian scientific image? In each case the presupposition that
it was one item in a certain class gives way to (i) the conditional that this
was so if that shape was correctly represented by some item in that class,
and (ii) the realization that there are other candidates.
As it is for shape, so it is for each primary quality, represented by the
mathematics of the continuum. Indeed, we need to cast our net more widely
still, if we want to find all the ways in which we could now understand
the scientific image fashioned in the seventeenth century. There is no such
thing as the classical continuum, if that is meant to be the continuum on
which the classical (= modern, 17th century) scientific image was erected
originally. Cantor, Brouwer, and Weyl had equal right to regard it as
erected on their continua, which are very different. Of course, today we
will use ‘‘the classical continuum’’ to refer to the subject of real number
theory as it now exists in main stream mathematics. That is the politics of
linguistic usage. But even in what we now call classical mathematics, recall
that we have the option of saying that quotient constructions are more
accurate (and the simple use of real numbers merely a convenient artifice),
as Birkhoff and von Neumann suggested. What would you like the shape
of the beer glass to be?29
48 :
So, what is the shape of the beer glass in the scientific image? What is
meant by the assertion that its shape is one of the surfaces in the mathematical
representation of nature?
The openness of scientific description here come to light is irremediable.
Of course, every time we outline a range of alternatives for ourselves,
we can ascend our private throne—are we not all kings and pontiffs in
realms of the mind?—and assert that one of these alternatives is the one
true story of the world. When the range of alternatives is refined by
new conceptual developments—or simply by having our attention drawn
upward by logical reflection—we can choose a new option and make yet
another declaration ex cathedra. Arbitrary perhaps, but as definite as can be,
by choice. What we cannot pretend is to be non-arbitrary, or to close
our text once and for all. Yet the form of understanding is always one of
presumed objectivity and univocity. The scientific image is as replete with
uncashed and ultimately uncashable promissory notes as the manifest image.
Any practical context brings its own standards of appropriate precision, so
it is neither proper nor practical to keep this open-endedness constantly
salient, but to acknowledge that is not to deny it.
that among women, the productivity was less for workers in ill-lit and
ill-ventilated spaces than elsewhere. She also showed that among men, the
productivity was less for workers in ill-lit and ill-ventilated spaces than
elsewhere!
So relevance of working conditions did not show up until there was
a subdivision by this third factor (gender). How is this possible? That
is precisely Simpson’s paradox: correlations can be washed out, or on
the other hand brought to light, by averaging in different ways. Here
is the solution to the puzzle: under all conditions the women were more
productive than men working under the same conditions, but the women
were predominantly working in poor conditions.30
The first statistician abstracted from gender, and by this very means
produced a ‘picture’ which was misleading, and even conveyed a falsehood.
The appearance for him, that is, the outcome in his measurement set-up,
has to be assessed as ‘‘how it looks in that set-up’’ or ‘‘relative to that
set-up’’. His abstracting from gender would have been fine if that factor
had been irrelevant to level of productivity under various conditions. On
the other hand, we say now that it was relevant only because at a lesser level
of abstraction in this respect, the correlation is different. (Mind you, there
is no guarantee that subdividing further, by other features besides gender,
won’t undo the correlation again!) In this particular case we can say that
by abstraction he produced a distorted picture of the reality, a picture in
which lighting and ventilation were irrelevant to productivity. Abstraction
is harmless only under very strict conditions of pertinence.
A presumption that this is always possible, and that the relevance will be
transparently perceivable, has had a strong grip on the physical imagination.
The idea of testing a hypothesis at a different scale tends to be immediately
convincing. Think for example of the experiment proposed by Galileo
concerning buoyancy. The conventional wisdom, which he disputed, was
that a ship will ride higher in the water in the open sea than in port, due to
the amount of water below it. This is difficult to test directly because of the
choppy waves on the high seas. So Galileo proposed to place a small vessel
in a shallow tank and load it with lead pellets until the addition of just
one more pellet would make it sink—and then to repeat this procedure
in a much larger body of (quiet) water.32 Doesn’t this proposal design a
definitive test of the hypothesis?33
To take a more critical attitude, we must recognize that for any concept
there are boundary cases where guidance by our usage so far dwindles and
eventually gives out. Scale modeling displays the characteristics of picturing,
by relying on selective resemblance to achieve its aim, but in a way that is
subject to inevitable occlusion or distortion.
Scaling as picturing
A scale model represents, and yields information about what it is a
model of, by selective resemblance. Are there such necessary limitations in
that case, analogous to occlusion, marginal distortion; is scaling explicitly
non-committal?
Consider a scale model of an airplane. It has the same shape overall, but
with the proportions reduced by a multiplicative factor, say 0.0001. Will it
fly? Not necessarily. If it is to fly, to mention just one factor, it must have
something to propel it; but its size limits necessarily what that can be. For
example, the relation to air resistance will be quite different at this scale:
the air, after all, has not been similarly scaled down in any way! There
are other reasons, as we will see below—‘the same shape’ is a deceptively
simple concept.34
Scale models can be produced for the sheer aesthetic pleasure of it, but
more typically they serve in studies meant to design the very things of
which they are meant to be the scaled down versions. This use and its
subtleties were brought out clearly in the Second Day of Galileo’s Two
New Sciences.35 His calculations involved an error, but his principles were
, , 51
transformation. If we choose the numbers 100, 1000, 1 for this role, we define
the MKS system, with the units meter, kilogram, second—thus producing
another system belonging to the same class of systems of units. The basic
invariance requirement is now that to be significant, an equation must have
the same form regardless of which member of the class of systems of units
is chosen. This requirement was obviously respected well before Fourier,
let alone before the subject of dimensionless analysis matured. Newton’s
famous F ≈ ma does not depend on its validity on a particular choice of
units, and would not be famous if it did.42
A dimensionless number —more accurately speaking, a dimensionless para-
meter —of the class is a quantity that has the same value in every system of
units in the class. That is, it is an invariant of the set of admissible trans-
formations, which are precisely the scale transformations. The Hauptsatz of
dimensionless analysis, prominent in Buckingham’s article, says in part that
it is always possible to shift to a dimensionless representation.43
immersion of the propeller stay the same. Any set of kinds of quantity that
furnish the basic units for this dynamics can be changed in any ratios what-
soever without affecting this. How does this serve to guide a practical study?
The obvious thing to do is to make the smaller propeller geometrically
similar to the original, to immerse it similarly, and to construct the propellers
so that the ‘angle of attack’ of the blades on the water is the same. Is that
enough to ensure that we can get information about the thrust of the
original large propeller from the behavior of its scale model? It suffices only
if one can completely control similarity in the effects of gravity, density,
and viscosity. It is easy to see that for extremely small propellers those
effects will be significant, and for larger but still small ones the difference is
after all only a matter of degree.
So in practice . . . one has to resort to some approximation. And here
special conditions can be experimentally investigated to see what can and
what cannot be ignored at various scales. For example, the pertinent
mechanical behavior in very turbulent motion does not vary much with
the viscosity of the fluid. And similarity in the effects of gravity will be
approached when the ratio between the two speeds of advance approximates
the ratio of the squares of the two diameters. Deep immersion will also
prevent significant effects due to disturbance of the liquid surface. (Notice
though that these are all matters of degree, and the purpose at hand
may require a specific level of accuracy.) With all of this supposed under
sufficient control, the ratio of the thrust of the small propeller to that of the
large one will be (DS/D S )2 , where the primes indicate the diameter and
speed of advance of the large propeller, and not (DS/D S ), that is, not the
ratio by which this product was altered in construction.
The first clue to the answer is of course that the two problems are
essentially the same, related by a simple rotation from horizontal to vertical.
The second clue is also geometrical: in a right triangle, the ratio of height
to base is determined by the ‘‘angle of sight’’ along the hypotenuse. At this
point today’s student reaches for trigonometry, but these practical problems
were solved long before that was available.
By the end of the Roman Empire in the fourth century much of Greek
mathematics was lost to the West, not to return there from Arabic sources
until almost 800 years later. During these centuries practical surveying, archi-
tecture, and the scholarly study of practical geometry did continue, however.
The practical techniques of the Roman surveyors survived, were preserved,
collated, and taught among both artisans and cleric-scholars.3 A representat-
ive text, the practical geometry manual of Hugh of St. Victor, in the century
just before Euclid’s Elements became available again, is divided into three
parts: altimetry, planimetry, and cosmimetry (measurement relating to the
earth, to the sun, and to other aspects of the cosmos). In retrospect we see the
methods there presented as justifiable within geometry and geometric optics,
but what is taught there is simply the practical technology of measurement.
The instruments designed for this use—cross staff, quadrant, and even
the astrolabe introduced into the West about a century before this manual’s
date—consist basically of a ruler with a sighting device (alidade) at the cen-
ter, and part of a circle on which degrees are marked. These had significant
use in navigation, but let us here concentrate on land-measurement.4 The
surveyor measuring the height of a tower, for example, adjusts the alidade
until he can see the top through the two apertures. The angle thus formed
determines the ratio of the height of the tower to the distance from the
tower. Determines how?
Though most of the mathematic theory was lacking, the astrolabe can
be manually calibrated on relatively small similar triangles. This presumes
understanding of geometric similarity; sufficiently much of Euclidean
geometry was retained to understand this.
The distance from the tower may itself not be measurable directly if it
is far away, so Hugh’s manual gives several forms of ‘two station’ methods
to use. Suppose that the astrolabe sighting is done at two points P and Q,
at an unknown distance from the tower. Measure the distance between P
and Q, and a few practical steps, starting with the two alidade readings, will
yield the height of the tower:
62 :
Alberti’s De Pictura
When Alberti wrote his monograph on the technique of perspectival
drawing in 1435 a great deal of Greek mathematics and geometric optics
had been assimilated in the years since Hugh’s manual of practical geometry.
But his way of writing was not so different from Hugh’s, because Alberti
was also a surveyor applying those practical arts as well as thinking about
the theory behind them.6 Since the practical geometry manuals focused on
geometric figures created by physical objects and lines of sight, they were
an obvious source for his study of perspective. His great innovation was to
think of the visual cone or pyramid cut by the picture plane:
[Painters] should understand that, when they draw lines around a surface, and fill
the parts they have drawn with colours, their sole object is the representation
on this one surface of many different surfaces, just as though this surface which
they colour were so transparent and like glass, that the visual pyramid passed right
through it. . . . (Alberti 1991: Book I, 48)
63
Notice how the left-hand diagram is really just the one above of the
‘‘two station’’ altimetry, flipped horizontally, but with the picture plane
and some other sightlines added. The painter’s eye corresponds in the
geometry to the top of the tower of the earlier illustration. So the point of
view is the opposite, as it were, but the geometry involved is the same.
Alberti’s concern was with technology. He began his development of
perspectival drawing techniques by making a box with a small eye-hole
in one side.7 In the box there was a checker board laid horizontally on
the bottom. Let us call what the checkerboard looked like, if viewed
through the peephole, the checkerboard appearance. (That is to say, the
box floor’s appearance in observation or measurement made through the
64 :
For an illustration, consider two cars rolling along a broad one-way road,
one on the left and one on the right. The road consists of equal squares,
and the two cars reach the end of each square simultaneously. We can
imagine a spatial frame of reference that has the left border of the road as Y
axis and the orthogonal border of the first square as X-axis. The two cars
are moving in parallel, with the same velocity, and are thus depicted in a
geometric representation of this situation.
Now a painter sets up his easel at beginning of the road, and he has his eye
precisely on the Y axis. He sees the two cars ready to start, the time is t = 0.
The left and right borders of the road are the lines X = 0 and X = 1. Of
course the painter sees these converging in the distance. To picture this, let
us modify the diagram showing the checkerboard floor drawing. The line
X = 0 is straight, orthogonal to his easel, but the line X = 1 slants in the X-
negative direction, meeting X = 0 on the horizon. The cars begin to move.
He sees them reach each horizontal line simultaneously. But the right-hand
car is moving along the hypotenuse of a perceived triangle, so covers a larger
perceived distance than the left-hand car in the same time interval.
Within the painter’s perspective, the right hand car is moving faster than
the car on the left. If he were making a motion picture, or simply taking
notes of where in his visual field the cars are at t = 1, t = 2, etc. he
would be making a measurement of the velocities, but the content of his
measurement would be what the motions look like and not what they are.
Knowing the geometry of this space and the laws of projection, he can of
course draw on registered relations between the two perceived motions to
69
obtain information about what the motions are really like. These motions
are observable processes, they have determinate speeds and directions which
are in fact the same, although the appearances (which are the contents of
the measurement outcome) are different.12
Examples . . . lead to the conjecture that . . . the same laws of electrodynamics and
optics will be valid for all coordinate systems in which the equations of mechanics
hold. . . . We will raise this conjecture (whose content will hereafter be called
‘‘the principle of relativity’’) to the status of a postulate and shall also introduce
another postulate, which is only seemingly incompatible with it, namely that light
always propagates in empty space with a definite velocity V that is independent
of the state of motion of the emitting body. These two postulates suffice for the
attainment of a simple and consistent electrodynamics of moving bodies based on
Maxwell’s theory for bodies at rest. (Einstein 1905/2005: 124)
This shift from the ‘God’s eye view’ to a frame of reference identified
with an observer equipped with clock and measuring stick continues in the
well-known thought experiment he then presents for the measurement of
a moving rigid rod. We now inquire about the length of the moving rod,
which we imagine to be ascertained by the following two operations:
(a) The observer moves together with the aforementioned measuring
rod and the rigid rod to be measured, and measures the length of the
rod by laying out the measuring rod in the same way as if the rod to
be measured, the observer, and the measuring rod were all at rest.
(b) Using clocks at rest and synchronous in the rest system . . . , the
observer determines at which points of the rest system the beginning
and end of the rod to be measured are located at some given time t.
The distance between these two points, measured with the rod used
71
before—but now at rest—is also a length that we can call the ‘‘length
of the rod.’’ (Ibid., 128)
Not long after Einstein’s creation of this new theory, Minkowski recognized
that Einstein had in effect displayed an elegant new mathematical entity,
distinct from those utilized in classical models. Light paths are to be
represented by curves in this space along which the space-time interval
equals zero . . . motions of bodies by paths on which the points have
time-like separation . . . and so forth. That is to say, according to the new
theory, we are to use this mathematical object in that way to represent the
natural phenomena in this domain.
At this point observers and frames of reference are left behind. Neither
perception nor individual cognition is a salient topic of inquiry in the
context of use of Minkowski space for the representation of rigid motion,
electric current, magnetic field, and transmission of light. Frames of ref-
erence can be thought of as attached to any material body or to none;
certainly talk of relatively moving observers is de trop.
But if we focus on the theoretical models in and by themselves, we
are ignoring the use they have. If someone is to use Einstein’s theory to
predict the behavior of electrically charged bodies in motion, bodies with
which s/he is directly concerned, choice of a coordinate system correlated
to a defined physical frame of reference is required. The user must leave
the God-like reflections on the structure of space-time behind in order to
apply the implications of those reflections to his or her actual situation. The
physical world picture in abstracto is as far removed as possible from this use,
it embodies, in Eddington’s words, ‘‘the view of no one in particular’’. But
to put this picture to use, something must be done by the user, and this is
where choice of reference frame comes in. Hence Weyl’s words are equally
apt, when he refers to coordinate systems as ‘‘the unavoidable residuum of
the ego’s annihilation’’.15
In sum then, the use of ‘‘perspective’’ and ‘‘perspectival’’ in connec-
tion with depictions of events in varying frames of reference cannot be
banished completely. There is a close connection between them and
the acts of observers locating themselves with respect to the theoretical
model—acts of self-location, expressible by the actor in such terms as
72 :
‘‘I am here, and this is how it looks from here’’. I will come back to
this after we have looked at some more general examples of use and
self-location.
distances between adjacent points are equal, so the cross ratio is 4/3—a
special case that is easily visualized and reproduced by the painter.21
Take any point P and draw lines connecting P with the four points in
question. It can be quickly established that the cross ratio is the same for all
projections of those four points onto other straight lines.22 In the diagram,
where point P now plays the role of ‘eye’, CA/CB : DA/DB = CA /Cb
: dA /db, and similarly for other sections of the pencil of projection lines.
This picture we will revert to below to show how taking a photograph can
be used to make a distance measurement.
The invariant content of a perspectival image is the structure ‘common’
to the images produced on any ‘window’ that cuts the pencil of projection
lines. That ‘common’ structure is the structure entirely captured in a
description of the cross ratios between such sequences of collinear points, in
addition to the basic relationships of incidence and order of points and lines.
It is this invariant content that carries the information in the perspectival
image that is independent of the choice of origin and orientation—except
of course that no one ‘window’ can contain an image of more than a finite
part of a selected half of the space ‘on its other side’.
I can express it by pointing to a spot on the map and saying ‘‘I am there’’—a
self-ascription of location on the map.
It is not as if there is an object or event that is indescribable, ineffable,
beyond the reach of objective or impersonal description. This act of self-
description too can be described and the information can be included on a
bigger map (with the label ‘‘location of vF’s map-reading at time t’’). But
then what I need to use this new map is still a self-ascription of location
with respect to it. The problem of practical use has not been altered.
With this new map, I can go on to self-ascribe a location by the different
words ‘‘I am vF and it is now t’’. An attempt to replace or eliminate
these self-ascriptions leads to an infinite regress, using an infinite series of
maps. But even given the accuracy of the whole infinite series of maps, the
regress does not succeed in eliminating the need for self-ascription. For I
will still be lost, unless I can locate myself with respect either to at least
one of these maps or to the series as a whole, and this I can only do by
asserting a self-ascription which is not deducible from the accuracy of those
maps.
Besides self-location in this narrow sense of ‘placing oneself ’, orientation
with respect to direction is equally crucial. An example will immediately
help to continue our discussion of measurement from magnitudes (distances)
to less metric features. Imagine being in New York with a city map. You
go on the subway; at one stop you check the station sign, and it says ‘‘18th
Street’’. This allows you to point to a spot on the map, labeled as 18th
Street station on the line. You exit and find yourself on 18th Street, at
the corner of 7th Avenue; but which way is West? Being unfamiliar with
the buildings, and too far from the Hudson to see it, you walk one block
along the street to see the name of the avenue that crosses it there. Is it 6th
or 8th? Now you are oriented with respect to direction too. This involved
two measurements: checking the station sign, or the initial street signs as you
exited, located you in a spot on the map. Walking one block to take a new
sighting was a second measurement which allowed you to orient yourself
with respect to direction.
Indeed, we can take that as our paradigm situation for how we can
draw on science in action or practice. These measurements had as func-
tion to locate and orient you with respect to a ‘public’ representation of
the object or situation of interest. That means: they were the practical,
instrumental means for arriving at those crucial indexical judgments that
80 :
you needed to make use of the map. Although there are both simpler
and more complex cases of measurement, this case is paradigmatic in
what it reveals about measurement and the use of ‘public’—generally
theoretical—representations.
It was Kant who placed the inevitable indexicality of application center stage
in thinking about how experience, understanding, and reason are related.36
His reflections on this subject came at a crucial stage in his philosophical
development, in his early essay ‘‘Concerning the ultimate ground of the
differentiation of directions in space’’:
No matter how well I may know the order of the compass points, I can only
determine directions by reference to them, if I am aware of whether this order
runs from right to left or from left to right, and the most precise map of the
heavens . . . would not enable me [without this orientation] to infer . . . on which
side of the horizon I ought to look for the sunrise.37
There is here a precise and perfect analogy between theory, model, and map. The
point about the map, made already by Kant, applies to a model of any sort.
The activity of representation is successful only if the recipients are able
to receive that information through their ‘viewing’ of the representation.
But what are the conditions of possibility for this reception? The recipient
must be in some pertinent sense able to relate him or herself, his or her
current situation, to the representation.
There is one objection to Kant’s point which cannot be simply dismissed.
Might each location in the cosmos not have a uniquely defining description in
terms that are not in any way indexical or context-dependent? And if so,
could that description not be everywhere substituted by a person at that
location for the words ‘‘I, here, now’’? Could Kant’s thesis not be evaded
in that way, at least in principle? This objection I will take up later, by
connecting it with the attempts to develop a purely structuralist view of
science by Russell and Carnap, and relate it to arguments such as ‘‘Putnam’s
Paradox’’. For now we will look into the implications of Kant’s thesis at a
less abstract level.
81
GPS, The automated ‘self-locating’ map (?) Does Kant’s analysis still apply,
now that we have Global Positioning Systems (GPS)? GPS satellites transmit
signals, which the GPS receivers receive passively; the receivers do not
transmit data at all—hence no data about their own location. Having such
a receiver we obtain talking, self-locating ‘‘maps’’ that don’t need anything
from us to help them. When installed in an automobile, the system is
designed to show an instinctively recognizable display to drivers allowing
them to see where they are in relation to surrounding topography. In
advanced versions, the interface includes a bird’s-eye view of adjoining
streets and buildings.
So how does this work? The auto receives signals from several GPS
satellites, with data about the satellite’s location and the time of emission.
These signals, sent simultaneously by the satellites, arrive at the receiver at
different times due to the difference in distance. From these differences the
receiver’s location in the coordinate system defined by the satellites can
be determined. The local map coordinates are defined in that coordinate
system, and this map also shows the roads and buildings situated in the
neighborhood of that location.
Well, is anything missing? Yes: the driver has to make the judgment
that the moving point on the map display is his own location. If s/he
does not know how the system works, that is obvious. But even if the
design is understood, a judgment tacitly intervenes; after all, only a slight
malfunction in the computerized calculations could result in a significant
displacement. The case is not so different from a person standing in front
of a map which has on it an arrow with the legend ‘‘You are here’’. That
person has to make the judgment ‘‘I am now at the location indicated by
that arrow’’. This judgment could very easily be false, for example, if the
wrong map had been affixed at that place. Suppose the GPS system makes
the sound ‘‘Right turn, 500 feet’’. I have to supply the judgment that I am
now in the right location with respect to the source of this sound—that it
is not, for example, received as a radio signal from the GPS that is currently
guiding my cousin in his new VW, sent to impress me.
Imagine the automation of ship navigation along similar lines. In the old
days the mate took his astrolabe and reported the elevation and compass
direction for certain stars; then the captain took those data and the data
from the chronometer to the maps on his table. There he plotted the data
and arrived at a location on the map; on the basis of this he issued his
82 :
orders for the course to be steered. All of this except the very last step can
be automated—the GPS can collect the data that will play the role of what
mate and astrolabe provided before. But if the captain is to issue his orders,
when he sees a cross appearing on the map depicted on his computer
monitor, he has to add ‘‘we are now there, and the ship is currently moving
in that direction’’. The situation has not changed in principle, just become
less labor intensive.
It will perhaps be objected that the captain’s role can also be automated,
assigned to an ‘automatic pilot’, integrated with the GPS. But then, who is
the user? The person who programs the automatic pilot with a course to
steer, and then turns it on—in the full confidence that the map point taken
as input is precisely where s/he is at that moment. So again, the situation
has not changed in principle for the user.
What is in a map?
The short answer is ‘‘Nothing!’’. That is, if we take the physical object
by itself, considered entirely without reference to use, to us, then there is
nothing in it to determine its semantic content. But we can expand on this
point. Even relative to the conventions in force in our community or society, for
pictorial representation of terrain, there is some information—taking this
in a broad sense—that cannot be in the map itself.
So what is in a map? The topic of self-ascription belongs to pragmatics
and not to semantics. That is a fancy way to say that what the self-ascription
does cannot be equated with, but adds a crucial step to, the content of a map
or to the bare impersonal belief that the map ‘‘fits’’ the world.
The bearing of this puzzle: we must generalize this to scientific represent-
ation per se. The body of ‘pure’, ‘fundamental’ science, in the sense of the
totality of accepted scientific information, can in principle be written in
coordinate free, context-independent form. That is possible for theoretical
science, even if it includes the history of the universe or the evolution
of biological species on earth. But to draw on this body of science in
technology—whether in practical applications or even to test it or use it to
explain something, or add to it through research—the scientist or scientific
community must supply something extra, which does not come with that
body of science, but serves to locate the user with respect to it.38
83
can easily be such a contrast: suppose an Italian speaker sees the inscription
‘‘burro’’. He may well take it as being in his own language, hence a sign for
butter; but he could take it to be Spanish, to refer to a donkey. In a more
complex case, Sherlock Holmes sees what is ostensibly a simple innocuous
message in English but declares it to be a message in code having a quite
different meaning.
At first blush, such inscriptions are far from maps, and what happens
here is far from the use of a map to locate oneself in a landscape. But the
mere understanding of the inscription as a text requires relating it to one’s
own language—either through taking it to be an expression in one’s own
language or through a translation procedure.39
Language may seem too special an example here: it is the seat of meaning
if anything is. But to call, classify, something as a map or a model is to
locate it in what Wilfrid Sellars called ‘‘the space of reasons’’—at least as
this phrase is now broadly understood. By itself this is not yet self-location.
It is just to classify the item as having semantic content, and as having a role
in reasoned discourse and in practices subject to norms of rationality. We
can to some extent separate our understanding of the item, in the sense of
grasping its semantic content, from the understanding of our own situation
that comes with locating oneself ‘in’ or ‘with respect to’ the item. But the
latter comes in train, so to speak.
This applies tellingly to any observation report made (as it usually is) in
theory-laden language. If I say ‘‘Lo, phlogiston escaping!’’ I am locating my
own situation in the logical space provided by phlogiston theory. If instead
on the same occasion I say ‘‘Rapid oxidation happening!’’ I am locating my
situation in a logical space provided by modern chemistry. If Lavoisier had
heard someone shout the former, it would have been helplessly academic
of him to reflect that he was hearing just another false statement. If you
hear me say either of the above, you are well advised to flee the room, if
you take yourself to be with me, that is, if you are inclined to echo my
self-locating act (even if you will echo it in a different-theory-laden form).
The question will come up a few times below, with either just origin and
orientation or indexicality at issue, whether to spread these terms still further
in application. For the most part I will resist that; the important relations
between representation and perspective will be obscured if we allow much
broader use. This despite the already current broad usage in everyday
language, which can’t always be resisted without irritating the ear.40
The question is least simple when it comes to verbal description. When
is a description perspectival? When it is a description from a certain point
of view—but ‘‘point of view’’ is subject to the same fluctuations in use
as ‘‘perspective’’. A hallmark we can look for in the case of language
is the occurrence of indexical terms or phrases. Among these I include
demonstratives such as ‘‘this’’, ‘‘those yonder’’, as well as the more obvious
‘‘I’’, ‘‘you’’, ‘‘here’’, ‘‘now’’. But the indexicality is sometimes hidden, not
apparent in the surface structure, so we do not have a perfect criterion
here. On the other hand I would resist calling a description perspectival on
the sole ground that some reference to the individual or communal users’
experience is indispensable to understanding its terms. That would take
in even such ostensibly paradigm ‘impersonal’ examples as ‘‘for any body
of gas, pressure will alter with temperature if volume is constant’’. In the
actual construction and use of specific models in practice, we may discern
a specific perspective. But on the other hand, general scientific theories,
in their ‘official’ formulation, are not perspectival descriptions, and their
models—if we consider the entire range of models for a given theory—are
generally not perspectival representations.41
When it comes to terminology, let’s practice tolerance. Whenever it
matters, we should refer to the specific characteristics themselves, not simply
to the perspectivity of which they can serve as hallmarks. Restraint to the
extent indicated here is especially preferable in a technical philosophical
context, while at the same time it is easy enough to understand the wider
usage so commonly found.
The words at a certain stage, perhaps puzzling in their first occurrence, refer
to the end of that long journey from the initial encounter with nature
to the achievement of an even temporarily stable representation. We will
look at this process both from within the journey itself and from above, that
is, from the point of view achieved at its end.
Perspectival drawing provided us with a paradigm example of meas-
urement. The process of drawing produces a representation of the drawn
object, which is selectively like that object; the likeness is at once at a rather
high level of abstraction and yet springing to the eye. While the information
92 : , ,
Strange words simply puzzle us; ordinary words convey what we know already;
it is from metaphor that we can get hold of something new and fresh. (Aristotle,
Rhetoric 1410b10–13)
These are valuable metaphors, each aptly guiding some of our thought
about instruments. They do not exhaust the subject: somewhere in between
the two lies instrumentation conceived of as creating copies, scale models,
likenesses, or ‘significant similarities’ of natural phenomena on the scale of
the directly visible. Peter Dear (1995: 159–60) gives as striking example
William Gilbert’s work on the magnet in 1600. His artifact is a spherical
magnet made on a lathe from a lodestone. He calls it a ‘‘little earth’’, a
terrella, and while we spontaneously read this as a metaphor, Gilbert makes
it clear that he means this very literally: the earth is a spherical magnet, and
what he has created here is the very same thing on a scale where we can
handle it.9 The instrumentation is here conceived of—in terms that echo
Dear’s own distinction—as having a mimetic function, to be contrasted
with ‘‘the semiotic, focusing on signs and representation’’. This is close to
the ‘‘windows’’ metaphor, in that the set-up is claimed to show ‘‘what
things are really like’’, but also close to the ‘‘engines’’ metaphor, because
hinging on the production of novel effects. So here the two metaphors
overlap.
We can speak aptly of representation in all three cases, and the bound-
aries between the three are blurry, in act if not in conception. In the
case of optical instruments we can readily see all three illustrated. Firstly,
the images produced by lenses are copies, literal likenesses, if the lenses
function as windows upon their domain. Secondly, whether or not that
is so, the images produced by lenses are themselves (artificially produced)
phenomena, that have seen their uses in ancient magic and ritual as well
as in modern science. Thirdly, phenomena instrumentally produced may
in some cases be scaled-down or scaled-up versions of naturally occurring
phenomena that are too small or too large to be grasped in percep-
tion, or be asserted to be ‘significantly similar’ to postulated processes in
nature.
But there is one very important disconnect between the second concep-
tion and the other two. Creation of new phenomena is not in general a case
of mimesis. It may indeed provide information about ‘natural’ phenomena,
but need not be doing so by presenting us with a likeness. Instrumentation
in science, both in experimentation and in application, has many roles.
One salient role that I want to emphasize here, that is important as different
theoretical leanings compete, is the production of new phenomena that all
theories in a given domain must account for if they are to compete successfully
98 : , ,
there at all. That role is played by all that is novel in inquiries into nature
by means of instrumentation. Paradoxically, that is precisely the one that
tends to favor the ‘window’ metaphor, although it is most clearly the role
in which the instruments are engines of creation, not channels for passive
observation!
What better way to challenge a rival theory than to produce a new
phenomenon to be accounted for? The new phenomena are not created for
nothing or with no consequences: theory must submit itself to their tribunal.
There are many famous examples of such creation of new phenomena to
play this role. Hertz himself reported on the importance of his experiments
in electromagnetism in this way:
What we here indicate as having been accomplished by the experiments is
accomplished independently of the correctness of particular theories. Nevertheless,
there is an obvious connection between the experiments and the theory in
connection with which they were undertaken. Since the year 1861 science has
been in possession of a theory which Maxwell constructed upon Faraday’s views,
and which we therefore call the Faraday–Maxwell theory. This theory affirms
the possibility of the class of phenomena here discovered just as positively as the
remaining electrical theories are compelled to deny it. (Hertz 1962: 19)
The well-known story of Poisson’s challenge to Augustin Fresnel’s prize
essay furnishes a ready example (even if not generally told with perfect
historical accuracy).10 Fresnel submitted his essay for a scientific prize
competition on the diffraction of light, basing his analysis on the wave
theory. Poisson, a member of the panel which evaluated the essay, pointed
out that Fresnel’s analysis had a strange consequence. It implied that the
center of the shadow of a circular disc would have a bright spot. This
consequence contradicted the Newtonian corpuscular theory of light, and
seemed in any case highly unlikely. But Dominique Arago, the panel’s
chairman, did the experiment and the bright spot appeared! Despite
himself Poisson had designed an experiment that created an observed
phenomenon—new in what these scientists on the panel had seen, though
not new in nature—which only the wave theory was (at that point) able
to explain, and which was entirely upsetting to its great rival.11
The ‘‘window into the invisible world’’ metaphor has dominated modern
philosophical thinking about science as much as the ‘‘mirror of nature’’
metaphor dominated modern epistemology and metaphysics. It will serve
us better to dislodge or at least weaken its grip on our philosophical
⁽ ⁾ 99
on the light affects and alters the state of this ethereal medium. This
altered state is manifest in the illumination and visibility of the objects in
this room.
From a realistic point of view you can describe Faraday’s device as
a detector for the electromagnetic field, as a window allowing us to
look upon those mysterious lines of force in the ether. In theory it is now
certainly of a piece with optical phenomena. At the phenomenological level
it was at the time of a piece with the well-known visible phenomena of
magnetic attraction. The behavior of lodestone and iron had been recreated,
artificially produced—in a like yet different form—in the laboratory. But
look, we have here also a phenomenon that had never occurred before in
history: pressing one object and thereby instantaneously altering a ‘dust’
distribution on another object into such a regular pattern as this. Faraday
had created a new phenomenon. But this event had at once great theoretical
value—for it was a controlled phenomenon so closely related to known
natural phenomena of electricity and magnetism that it was required to be
accounted for in any future theory on offer in this area of physics.
Faraday had constructed an engine to produce a fascinating variety of
new phenomena. The story continues. When representing the distribution
of electric charge as analogous to what happens when iron filings are
scattered in the vicinity of a bar magnet, Faraday and Barlow displayed
the change in the position of the filings as a continuous function of their
position relative to the poles of a magnet. The examples can be multiplied:
brought to light are salient regularities in the new observable phenomena
that theory must account for.
A catalogue of images
Nature creates these public hallucinations. Already in ancient times, concave
mirrors and lenses were used to do the same, namely to create (artificial)
public hallucinations. Even a concave mirror and a lighted candle can
104 : , ,
‘‘COPY’’- NOT
QUALIFIED ‘‘COPY’’-
QUALIFIED
〈microscope
image〉
(Graven Images) On one side are the images which are in fact things, such
as paintings and photos.
(Private Images) On the other extreme are the purely subjective ones
like after-images, dreams, and private hallucinations. These are personal,
not shared, not publicly accessible. Indeed, we are pretty clearly dealing
⁽ ⁾ 105
there with discourse that reifies certain experiences which are ‘‘as if ’’ one
is seeing or hearing.
(Public Hallucinations) In between these two are a whole gallery of images
which are not things, but are also not purely subjective, because they can be
captured on photographs: reflections in the water, mirror images, mirages,
rainbows. For those I will use the term ‘‘public hallucinations’’.
Some of these public hallucinations are actually ‘‘of ’’ real things: e.g. the
reflection of a tree in the water. When you see the reflection of a tree in
water you are not seeing a thing; a reflection is not nothing, it is something,
but it is not a thing, not a material object.19 That is clear enough because of
the way it moves when you move, quite differently from, say, a log floating
in the water. Some public hallucinations are not ‘‘of’’ real things; e.g. the
rainbow. But of those which are not, some—only some—would still lend
themselves to being conceived of or identified as pictures of real things. If
an image would so lend itself I’ll call it ‘‘ ‘copy’-qualified’’ (following the
Sophist’s distinction between making copies and creating appearances). But
of any ‘‘copy’’-qualified image we can still ask: is it really of something real,
or is it not? That is always a question of fact transcending the experience
itself.
explain to ourselves why the equations work. The soft part does change: we no
longer believe in Maxwell’s ether, and we know there is more to nature than
Newton’s particles and forces.
The changes in the soft part of scientific theories also produce changes in our
understanding of the conditions under which the hard part is a good approximation.
But after our theories reach their mature forms, their hard parts represent permanent
accomplishments.(Weinberg 1998: 50)
relations that the theory says they should. So when Perrin showed that
about a dozen experimental ways of ascertaining Avogadro’s number all
gave the same result, his work gave atomic theory a stability that it could
not have attained if the results had been different. Similarly for Thomson’s
multiple ways of measuring the electron’s mass to charge ratio.30 In these
two cases too, the experiments were approached within a developing
theoretical framework that needed consistent empirical grounding for its
theoretical parameters.
For instruments we now have a cross classification to draw on: a
diversity of roles of instrumentation in experiment, and a diversity of
roles of experiment in theorizing. In the former, measurement shows up
as a single role among others, the representative role. In experiments,
however, measurement is crucially involved in all cases. All the examples
of production of new phenomena involved set-ups in which multiple
measurements were being made as well. Although it may seem as if
electron microscopes, spectroscopes, spectrophotometers, x-ray cameras,
refractometers, polarimeters, cloud chambers, scintillators, and the like have
been relegated to a ‘lesser’ status in my account than the scientific realist
would give them, there is no implication at all that the role of measurement
has vanished from the scene. What needs still to be elaborated, however,
is an account of measurement itself, as it appears in the process of joint
experimental-theoretical development and as it appears through the eyes
of the eventually achieved theory.
This page intentionally left blank
5
The Problem of Coordination
and theories were still evolving, our answer had better be coherent with
what we now take that referent to be. So we can only say:
all along they were referring to the magnitude identifiable as measured
by the procedures that would eventually be accepted as measur-
ing it.
So yes, it makes sense for us to say that Galileo’s activity was about
measuring temperature, although we have to add immediately (speaking in
our own voice, using our own words) that his apparatus did not really do
that, that there was a mistake involved in his set-up.
Notice how this answer echoes Reichenbach’s statement that what is
measured, and thus through measurement coordinated with a theoretical
concept, is not defined independently of that coordination, but defined by
the coordination. At first blush this formulation too may sound circular,
even viciously circular, if not debilitatingly anthropocentric or egocentric.
But it is not. What would make it viciously circular is only Reichenbach’s
tacit assumption that what was being done could in principle be done
without any historically prior achievements along such lines. But to
understand this properly we must follow the process of coordinating
both in its general form and in specific examples, to show how the joint
evolution of measurement and theory can happen and can come to a stable
resting point.
What is measured?
Can we identify the parameters that are measured by means of the meas-
uring procedure itself alone? Well, what does ‘‘alone’’ mean here? Does it
mean ‘‘with no theoretical background in which to operate’’? But with no
theoretical background, very few procedures classify as measurement pro-
cedures. For example, as Mach and later Poincaré emphasized, procedures
for measuring mass presuppose something about the items to which they
are applied, e.g. that they are subject to the law that action = reaction.12
This is precisely why Mach, and later Cassirer, Schlick, and Reichenbach,
in their different ways, concerned themselves so much with the problem
of coordination.13 But it is also through measurement results that theories
gain empirical support, so there must be some trade-off, or some subtleties
to how empirical support can be gained, if we are not to find ourselves lost
in a circular argument.
124 : , ,
The idea that the parameters can be identified simply through the
measurement procedures was behind Bridgman’s operationalism, and we
often see its strong appeal still in scientific writings. But the specification
of the measured parameter even in terms of the (final, stable) measuring
procedure can never amount to a complete identification or definition. For
the procedure will classify only when it is actually applied, and even then
only establish order among the items to which it is applied, at the times
when it is applied.14 To try and stick with an ‘operationalist’ view of what
is measured would land us in the absurdity that a process or object has no
character at all while it is not being measured.
In practice a theory eventually emerges which encompasses the meas-
urement procedure itself as well as the items measured, and provides the
coordination. Thus in the case of temperature, the kinetic theory which
was developed after thermometry had been developing for two centuries,
provided the parameter which then was identified as precisely what was
measured by the thermometer. So a definite identification, a complete
definition, of the measured parameter is possible but only through, at the
hands of, and relative to the theory offered and finally accepted to account
for the stability of the measurement procedure.15 We need to see this
development in its historical perspective, and to recognize and distinguish
two facts:
(a) an empirical fact that has been discovered: namely, the very stability
in the procedures found in this historical development, and the
reliability of the predictions concerning these and their correlation
with other measurement procedures derived from the mature theory
in which they are now theoretically embedded.
(b) a historical fact: namely, that choices in the development of these
measuring procedures went hand in hand with the development
of the theory in question, so that we cannot identify an aspect of
nature that is measured if we refer only to those empirical procedures
without using concepts provided by the theory.16 What now counts
as simple passive measurement is a hard-won achievement.17
What were the choices involved? What epistemic status can we assign to
these choices? We will not gain much clarity if we continue simply in the
current abstract vein. We need to look at the actual history to get any grip
on this at all.
125
warmer or colder it gets: the rough and ready comparison by the senses
will do for that. But if the change in height of the water column, correlated
with the volume of enclosed air, can be used as indicating how much hotter
or colder the air became, then we will have a numerical scale induced
by the most basic, most venerable form of already familiar measurement:
length.
Is there an empirical question involved in this choice? What cannot be
ascertained without the use of a good thermometer is whether the change
in volume is quantitatively proportional to the change in temperature. There
will certainly be empirical questions to come, but this is not one. Prior to
the construction of a thermometer, there is no thermometer to settle that question! So
here the scientist confronts a choice, one that may be provisional to start,
with hope of vindication of some sort or other later on: to use changes in
volume as quantitative measure of how much hotter or colder one thing is
than another.
There are a number of drawbacks in this arrangement, if we try to think
about it in current terms as measuring temperature. The water and the glass
are also relevantly affected by temperature changes, even if not to the same
degree as the air. The initial temperature of the apparatus will also play a
role. But most important from that point of view is the effect on the water
level of changes in atmospheric pressure—the instrument is, we might say
now, part thermometer and part barometer. This could not even be realized
or taken into account until the notion of atmospheric pressure was in play.
The history of the barometer began during the same century, and like
that of the thermometer was initially beset by theoretical disputes about
the possibility of a vacuum. Hence the problem of effects of atmospheric
pressure on the air thermometer did not appear till it could appear. The
problem that the air thermometer was actually a mixture of thermometer
and barometer was in fact pointed out by Blaise Pascal in mid-seventeenth
century, after his barometric experiment on Puy-de-Dôme.
longer fills the whole tube—arguably, and at the time contentiously, one
now witnesses the creation of a vacuum. (Questions on that score were
crucial then; I shall ignore them here.) A common form had its length
divided into equal parts. Detailed instructions recorded at the Accademia
del Cimento in Florence, specified that in a good construction snow or ice
will not make the liquid column descend below the 20 degree mark, nor the
hottest day dilate it beyond 80 degrees.19 Since the tube is entirely closed,
this arrangement eliminates the effect of atmospheric pressure, though not
the effects of temperature changes on the liquid or the glass.20
The assumption or choice of definition that differences in temperature
are proportional to resulting differences in volume continues, though now
instantiated to liquids. Two obvious questions: do different liquids agree
with each other in this respect? and do they indeed always expand when, by
other common criteria, they are becoming warmer? These questions did not
receive very satisfactory answers. The early makers of liquid thermometers
used quite a variety of different liquids. Water was not a good candidate,
since its volume does not keep contracting as temperature drops toward
freezing.21 None of the liquids is of any use below its freezing point
anyway. But in the regions well above their freezing point it was at first
thought that they all expanded and contracted in the same proportion when
put in the same environment controlled for effects other than temperature
variation—an appearance which in turn was refuted by Dalton for the case
of high temperatures.22
The results are thus sensitive to the choice of liquid, partly because
their response to temperature changes varies at very low and very high
temperatures in a certain range—and the liquid state does not persist below
or above a certain point. Even in ‘good’ ranges, the expansion in volume
is not proportional for different liquids. Dalton wrote, a good century into
this evolution, that liquids were found to expand unequally, ‘‘but no two
of them alike. Mercury has appeared to have the least variation’’.23 But
even for the otherwise very good choice of mercury, the effects on the
glass tube are not negligible in comparison to the effects on the liquid. The
expansion of mercury is only seven times that of glass.
as the ideal gas law (Boyle’s Law, the Boyle-Mariotte Law, the Law of
Charles and Gay Lussac, Charles’s Law—a multiplicity of names partly
due to nationalist partiality), PV = rT, with r a constant characterizing
the gas. If pressure is kept constant, the volume varies proportionally with
the temperature, for all gas in the same way—conversely, if the volume
is constant, the change in pressure is proportionate to the change in
temperature. Obviously the experimental findings could at best hint at this
relationship before thermometry had been stabilized. But of gases, unlike
of liquids, it could not be said that they expand unequally ‘‘but no two of
them alike’’.
Guillaume Amontons pointed out, in effect, that if a law of this form is
correct, then the temperature scale can become a ratio scale: it will have an
‘absolute zero’, marked by the theoretical condition of zero pressure.24
Is this law correct? To begin its correctness is supported by experimental
findings, made with thermometers of the previous generation, accepted
as a rough guide while recognized as inadequate. Then, as confidence in
the kinetic theory increases, theory cuts the Gordian knot with respect to all
questions about how the scale is to be fixed. The gas law is supported
by its incorporation in the kinetic theory—and refined as well, so that
it also becomes itself qualified there as having been only an empirical
approximation to what really happens. But the scale is now anchored in
prior scales for mechanical parameters. Once T is identified with the mean
of a mechanical parameter, the postulated mechanical behavior—‘‘the type
of motion we call heat’’, in Clausius’s incomparable phrase—is now the
referent of what is measured.
We can also put it the other way around: the parameter, identified by
the eventually stabilized procedures for its measurement, is now classified
by the theory as one aspect of the logical space that the theory provides for
location of items in its domain.
The history of thermometry did not stop here, but this much will suffice
to make the important points concerning coordination.
This is a very natural way to proceed, but if it is thought that the lower
bound is a fiction, one could choose a different principle of coordination,
not bounded below any more than bounded above. Mach points out that
Dalton had in effect chosen such a scale, though of course that is not the
one that caught on.
The choice is settled later by the absorption of the entire subject of
inquiry into physical theory. In the kinetic theory the magnitude with
which temperature is identified has an absolute zero point. But it is well to
note that this was not part of the empirical facts guiding the evolution of
thermometry, though the kinetic theory was developed in a way that was
accountable to that evolution of empirical procedures.
The very simple operations we now perform with thermometers are
an obvious example of ‘passive’ measurement—in our theories we have
the interaction as well as the object and apparatus completely classified,
‘well understood’. This stage is an achievement. The story of thermometry
is a success story, and such success was by no means guaranteed a priori.
The conditions of possibility for such a successful coordination—here we
are at Reichenbach’s question—we have now seen to involve empir-
ical regularities that are contingent and choices subject to conditions of
coherence.
Coherence conditions
But it certainly is the sort of choice that is also subject to strong coherence
constraints, which are not a matter of choice. The class of processes chosen
132 : , ,
as initial standard for equal duration must be such that if two are in spatial
coincidence, they run at the same rate, they run in synchrony. The decision to
choose the pendulum, in such bare form, cannot be sustainable in practice,
precisely because it does not have this coherence. Different pendulums will
disagree due to alteration in temperature, friction, barometric pressure, etc.,
and possibly beside these due to at that time unknown or uncontrollable
factors. Comparing the behavior of different pendulums does not display
stable synchrony. In 1672 Richer took a pendulum clock from Paris,
latitude 49◦ North, to Cayenne in French Guyana, latitude 5◦ North.29
He found that he had to change the length of the pendulum to make
it agree with standard time reckoning, precisely because as Huygens and
later Newton were able to explain, the force of gravity is not the same
everywhere on earth. So the next development, according to Poincaré,
was to correct mechanical clocks such as pendulums by the sidereal clock,
which is based on the passing of a star across the meridian—the unit then
being in effect the duration of one rotation of the earth. This too is a
decision, and this time it was made explicitly.30
Yet this adoption of the sidereal clock was subject to a similar objection,
if in a somewhat more theoretical guise: the movement of the seas will
act like a brake, slowing down the rotation of the earth. Far from being
a merely academic or skeptical doubt, this point was used to explain an
apparent acceleration in the movement of the moon.
Poincaré wishes to reveal by these examples two problems that arise in
developing a measurement procedure for duration. The first is the initial
one, illustrated with the pendulum: we cannot place successive processes
side by side so as to check whether their endpoints coincide in time. So
there is no independent means for checking whether successive stages of a
single process are of equal duration: the question makes sense only after
we have accepted one such process as ‘running evenly’.31 As said above for
temperature,
duration = the magnitude identifiable as measured by the clocks that
would eventually be accepted as measuring it.
This insight requires us to move all our concern to coherence constraints that
put bounds on what we can choose to regard as ‘running evenly’.
In another context, to illustrate that definitions are not totally arbitrary
but in general have factual presuppositions, Poincaré gave the example of
133
here for a cluster of pragmatic desiderata for our theories. But we can add
that the issue became concrete later on due to a proposition advanced in
cosmology by Arthur Milne in the 1930s.34
By that time the corrected astronomical time reckoning that had become
standard had been supplemented at least at the theoretical level with atomic
clocks. It was part of the accepted theory that atoms of each chemical
element and compound absorb and emit electromagnetic radiation at
their own characteristic frequencies. As reckoned by the sidereal standard,
these resonances had been taken to be inherently stable over time and
space. In Milne’s time there was as yet no practical application—the first
atomic clock was not built till 1949. Recall now that there are coherence
constraints: the class of ‘‘standard’’ clocks must be such that whenever two
are in coincidence, they run at the same rate, they run in synchrony. In
principle there could be two such classes, disjoint from each other, and not
combinable in a way that satisfies this constraint. Milne proposed that this is
indeed the case, and that astronomical (sidereal) clocks run differently from
atomic clocks. The disparity is negligible outside cosmology. The relation
between them, he submitted, was an exponential function, so that the two
give us entirely different pictures of cosmic history: for the one, the past is
finite and for the other it is infinite.
Although a lively topic at one stage in the development of cosmology,
the idea did not have any continued impact. What it illustrates and is
important, though, is the point of principle already made by Poincaré. If
the class of measurement interactions (of one sort or another) cannot have
an ‘‘if and only if ’’ definition but only be required to satisfy certain coherence
conditions, then the possibility of such disparities (and hence leeway for
convention or choice) is entailed.
in all the examples, taken length measurement for granted. But suppose we
consider length in the same way as we considered temperature and time
measurement, where the principles of coordination presupposed length
measurement as given. In order to ensure standards of length comparison
that would remain applicable across time, another such history of coordin-
ation unfolded. One telling example occurred early on in a proposal by
Huygens. After his invention of the pendulum clock, and his appreciation
of how its period is related to latitude, he proposed that the unit of length
should be one third of the length of a pendulum with a one second period
at the latitude of Paris—not the English foot, not the pied du Roi, which
was 16/15 of the English foot, but the pes horarius. Here time measurement
is presupposed for the coordination of length.
While this proposal did not prove practical, we see a similar entanglement
of several principles of coordination at the most familiarly known stage:
the construction of the standard meter embodied in the platinum bar to be
kept in the French State Archives. Iron copies were distributed to other
countries, and needed to be sent to France every ten years to be compared
with the primus inter pares copy kept in Sèvres. So we see two decisions
installed as definitive: that such metal bars remain congruent to themselves
under transport, and so provide a standard for congruence with respect
to length, and that a particular such bar defines the unit of length. But
what precisely was the convention? It was decided at the Convention
of the Meter in 1875, and was precisely this: the meter is defined to be
the distance between the midpoints of the ends of the mètre des archives
at the temperature of melting ice. But is the temperature of melting ice
always the same? The convention presupposes temperature measurement,
just as the conventions for temperature measurement presuppose length
measurement. The circle can be broken by entering a choice at one point
or another, but there is no specially privileged point for entering a choice.
Thus Einstein separated Poincaré’s rather simplistic pointing to specific
‘conventions’ or ‘definitions’ from the real insight into the inevitable
holism in the jointly constructed theory and coordination. As Ryckman
observes, Einstein’s admission of Poincaré’s viewpoint ‘‘is not a concession
to conventionalism, that is to the freedom to choose any geometry we
like, but to the inevitable epistemological holism of a theory in principle
capable of explaining its own measurement appliances, and so its ties to
observation’’ (Ryckman 2005: 64). But this holism will isolate the theory
136 : , ,
But their simplifications, even if going too far, honor the insight that the
parameter that is measured is identified in the historical process by the
envisioned eventual stable measuring practice, while it is differently identified
in retrospect by the theory that draws on that history for its credentials.44
Within the vantage point of the accepted theory, once such stability has
been achieved, we can speak meaningfully of the accuracy with which a
given instrument gauges a given observable. At that late point, when the
parameter has found its place in the theory that has emerged now, it is of
course a characteristic which is no longer defined by measurement, but by
its role in the theory. That is right and legitimate also from an empiricist
point of view, for this discourse draws on how objects, phenomena, and
instruments are classified by (or classified relative to) the accepted theory.45
It does not presuppose an impossible god-like view in which nature and
theory and measurement practice are all accessed independently of each
other and compared to see how they are related ‘in reality’. The two ways
of looking at the matter we must combine in a synoptic vision. The first
is ‘from within’ the historical process in which measurement procedures
and theory are stabilized. The second is ‘from above’ with the theoretical
description of the domain including the measurement interactions already
in hand, a stage achieved in that historical process but no longer involving
any explicit reference to its own history.46
We are not done with measurement, even with this realization. There is
still a whole battery of questions about measurement that are not answered
by our escape from the simplistic notion of number-assigning. On the one
hand, a measurement procedure is a physical interaction. On the other
hand, the measurement outcome ‘locates’ the object or process measured
in a certain space of possible states. The physical and intentional aspects of
measurements are to be looked into separately, before we bring them back
into a synoptic view of how theory relates to experiment and observation.
This page intentionally left blank
6
Measurement as Representation:
1. The Physical Correlate
We have looked at measurement ‘from within’, and will now look ‘from
above’: that is, try for a view that we can have after the measurement
procedures, concepts, and theories have stabilized. This resting point in
the conceptual and scientific development is, to be sure, fleeting and
momentary only—yet marking a context in which some things can
be taken for granted pro tem, where it makes sense to ask about the
world-picture of currently accepted physical theory.
For the question What is measured? the most direct answer is Physical
magnitudes that characterize the objects measured. At the point where the
pertinent theory is stably established, what the relevant physical magnitudes
are and how they are measured is itself specified theoretically. We cannot
separate the questions What is measured? and What is a measurement?, and both
of them have, eventually, specific answers from within the pertinent theory,
which will classify certain interactions as measurements and their final stages
as outcomes. The interaction in question I will call the physical correlate
of the measurement. The criterion for what sorts of interactions can be
measurements will be, roughly speaking, that the outcome must represent
the target in a certain fashion—, selectively resembling it at a certain level
of abstraction, according to the theory—it is a representation criterion.
Granting that, let’s ask though how we should classify this example, while
thinking of how Dürer’s Art of Measuring depicts the perspectival painter’s
art. Perhaps the portrait resembled the brother more, because the brother
was in fact the artist’s model while painting the portrait? That would not
have been so remarkable—perhaps the artist had noticed how much the
brothers did resemble each other, and the Duke was not always available
for sitting. Not only not remarkable: in no way threatening the status of
the painting as the Duke’s portrait. To take a more extreme case, Dante
Gabriel Rossetti’s painting The Girlhood of Mary Virgin depicts the Blessed
Virgin in the house of her parents. But the model for Mary was Christina
Rossetti, while Mrs. Gabriele Rossetti was the model for St. Anne. That
does not make the painting a family portrait of the Rossettis. It is without
a single doubt a picture of the Virgin Mary’s childhood, although the
painted figures—equally undoubtedly—resembled Christina and Frances
Mary Rossetti more closely than either the Blessed Virgin or her mother.
What if the artist commissioned by the Duke of Wellington had not only
used the Duke’s brother as model, but had used one of Dürer’s machines
for drawing, so as to produce the painting? In that case we say that the
drawing is a measurement outcome, but the measurement was surely not a
measurement of the Duke! So now the ‘‘of’’ in the two contexts diverge:
we will still grant the result the status of portrait of the Duke, however it
was made, but not that the operation carried out was a measurement of the
Duke.1
While the criteria for what a portrait is a portrait of are various, cultural
context-dependent, and sensitive to social factors, that is not quite so for
measurement, however similar they may be in other respects. In the case of
measurement the physical conditions of the object-instrument interaction
have more weight, and are in fact the subject for a good deal of foundational
research in the sciences themselves.
We can put this partly in terms of language. A claim of the form ‘‘This
is an X-measurement of quantity M pertaining to S’’ makes sense only
in a context where the object measured is already classified as a system
characterized by quantity M. To so describe an object is already to classify
by theory. Therefore the claim is theoretical or at least theory-laden, and
has to be treated as such.3 It is a claim which will change in content and
in truth conditions as our accepted theories change, since our classification
of physical systems changes along with the theories. Here too we see a
difference in the views from within and from above. From the posterior point
of view the new theoretical claims are not displacements but refinements of
the old—precisely because the old claims are now seen as (usually in some
way imperfect) versions of less precise claims formulable in the new theory.4
This context-dependence of the claim may not be visible when we
discuss a measurement procedure that was stabilized so long ago that
we now take the description of the relevant objects and apparatus for
granted. No one today would think of the use of a thermometer as
being a meaningful practice only within certain theory-laden contexts. The
practical use of these instruments has attained such stability that it is for now
at least not at all sensitive to changes in the background theories. But that
context-dependence springs to the eye in cases where the relevant theory
is relatively new and has some claims to descriptive completeness—such
as relativity theory or quantum mechanics.5 In both of these, the behavior
of the measurement apparatus is indeed within the domain of the theory
itself, and the criteria for qualifying as [physical correlate of] measurement
are to be presented in the terms of the theory—with consequent questions
of coherence and consistency inevitably forced on us.6
Philosophical reactions to this point have included a good deal of
extremism. There has been outright denial, as in the operationalist and early
positivist delusion that we could have a hygienic ‘observation language’ in
which the measurement operations and their outcomes could be described
free from all theoretical content. Though we still sometimes see the logical
positivist and logical empiricist circles of Vienna and Berlin identified with
this point of view, it was in fact quickly abandoned there. The critique by
Hanson, Toulmin, and Kuhn in the fifties and sixties removed it from the
philosophical scene altogether—or should have.
The opposite extreme in denial came then, ostensibly in a famous
chapter by Kuhn: scientists in different centuries live in different worlds.
: 145
That extreme is still worth some scrutiny even now. Understood in its
most blatant literal sense, it involves a denial that there is any such thing as
the physical interaction prior to or independent of the theoretical context
in which it is classified as such or such a measurement. Just to say this shows
it up for what it is; but for that very reason, shows that this was not what
was meant. The real point is that occurrence of the physical interaction is
one thing, its playing the role of representation of the value of a certain
parameter is another; and today everyone understands that.7
To escape from both extremes, we need to separate quite clearly the his-
torical process, examined in the preceding chapter, in which measurement
and theory are jointly stabilized, from the retrospective characterization of
measurement in the eventual established physical theory.
whose final value will be the outcome. That process has to be governed
by an equation which has to guarantee something about properties A and
B and that final value—but classical intuitions beware! In addition, the
system denoted as apparatus has to be such that it will play this role quite
independently of what state the object is in—give it any object in any state
and it will play its role.10
What must the governing equation guarantee, in order that this process
really be the measurement of some property pertaining to the object? In
a classical context we might say something like: if the object is thus or so
then the end-state of the apparatus has to indicate precisely that.11 This
assumes that the parameter measured already has a definite value before the
measurement, a value that can be revealed by the measurement outcome.
But this ‘‘revelation’’ assumption is at best controversial and certainly
generally rejected in foundational studies of quantum mechanics.
It is impossible for both to be true! What went wrong? The data listed
were about measurement outcomes, while the extrapolated probabilities
were absolute and unconditional. The proper generalization must give due
attention to the fact that what was found was how things appear in the
measurement set-up:
For X, Y = A, B, C: the probability that X has value 1, conditional on
its being measured equals 0.5, as does the probability that it has value
0. But the probability that X and Y both have value 1, on the condition
that the two are measured together, equals 0 if X = Y
Now there is no contradiction, but rather the consequence that A, B, C
cannot all be measured together. So that is our solution.
How good a solution is it? We want to ask, surely, what value C has
while A and B have values 1 and 0 respectively. But how this question is to
be understood depends on what we take for granted here. Can we take for
granted that when not measured, A, B, C still can only have values 1 or 0?
Or could they have other values when not measured? Or perhaps have no
value at all? And secondly, can we take for granted that if a measurement
of A shows value x, then at that moment, A has value x? Or that it had
value x at the beginning of the measurement?
Could we perhaps reason like this? Suppose A is measured and the
outcome is 1. Now we can predict that a measurement of one of the
others, B or C, will have outcome 0, with certainty. On this basis, can we
assert that B and C already have value 0 at this moment? If we do, we will
have to add that joint measurements of B and C that are actually carried
out are systematically deceptive, for they never show them having the same
value.
Before seeing an example like this, or at least before having any inclination
to take it seriously, various assumptions involved in such reasoning would
likely have been taken for granted when thinking about measurement.
Classical intuitions (if such beings exist) suggest two postulates:
Value Definiteness: Each physical parameter always has some value,
namely one of the values which may be found
by measurement.
Veracity in Measurement: Measurement of a parameter faithfully reveals
the value it really has.13
150 : , ,
These two postulates can be consistently added to our above story, but then
they imply that either measurement is subject to or involves a systematic
alteration, or else some sort of conspiracy in nature constrains when
measurements are made: when A and B both have value 1, we are lucky or
clever enough not to measure the two of them!
Perhaps we should put it this way. The conjunction of these postulates
would be an attempt to say that the world is basically the same, whether
things are being measured or not. But given the above story, the two
postulates are both true only if things are not basically the same whether
things are being measured or not! So the attempt fails: some difference
between measured and unmeasured world will have to be admitted as a
possibility here.
Paul Feyerabend’s (1958) name for the postulate of Value Definiteness
was Classical Principle C. This principle must be rejected, he argued.
From Bohr to Feynman, physicists have expressed similar opinions: an
observable (measurable parameter) might not have a specific value outside
the context of measurement. However, the second postulate—Veracity in
Measurement —has also been much looked upon as a candidate for rejection
or revision. To keep the first postulate and reject the second, by means of
an explanation through uncontrollable disturbance by measurement, would
not be a happy option. It would imply some sort of conspiracy again: if
A and B do sometimes both have value 1, how does the ‘‘uncontrollable’’
disturbance in measurement carefully and systematically hide that fact?
Finally, rejecting Value Definiteness would by itself already imply a
weakening of Veracity in Measurement. For if, at a certain time, parameter
A has no value, and is measured, then this measurement yields a value as
outcome, but clearly does not reveal a value.
When we specify what counts as a measurement of A, we describe a
physical arrangement which must have one of two outcomes (indicator
values), in this case 0 or 1. For this to be a measurement, hence to play a
role in information gathering, it must surely do something that is revealing
about what is measured? But what sort of information it does yield, and
how much, we shall have to consider very carefully.
of the object’s initial state to the apparatus’ final state.14 Otherwise there
would be no way to use this process to gather information about the object
on which the measurement is performed. How this ‘transfer’ requirement
is to be made precise will of course be different in different theories.
For the general form we must allow for the case in which the relation
between physical state and measurement outcome is only characterizable
in terms of probabilities. A deterministic theory can be thought of as the
special case in which all probabilities are zero or one. The measurement
situation modeled as theory prescribes, when the apparatus is itself in the
theory’s domain of application, must include a specification of the following
factors:15
a family M of observables (physical magnitudes) each with a range of
possible values;
a set S of states—physical states of both the system measured and of
the measuring system;
a stochastic response function P s m for each m in M and s in S, which is a
probability measure on the range of m; with P s m to be interpreted as
the probability that a measurement of m will yield a value in E, if performed
when the state is s.
Suppose now that one sort of process represented in the theory is that
of the interaction that qualifies as measurement of an observable. The
situation depicted then involves two systems, the object measured S and
the apparatus R by which it is measured. Together S and R constitute a
larger system, a ‘two-body’ system, S + R. The family of observables must
then include some that pertain just to that object S, some that pertain just
to the apparatus R, and some that pertain to both at once, that is, to system
S + R—and similarly for the states. But there must also be a constraint on
how this situation evolves, as the two objects are coupled and interact.
Classifying R as an apparatus, for measuring observable A for example
(an observable pertaining to the object to be measured) entails that this
interaction will take a certain form, which qualifies the designation as
‘‘apparatus for measuring A’’. In fact, this interaction must be such that
something pertinent about the initial state of the object is reflected in the final state of
the apparatus. So imagine the apparatus as having a dial with a pointer—as
a system it is characterized in part by an observable B, the pointer observable,
152 : , ,
whose possible values are precisely the numbers which the pointer can
indicate on that dial. Now the criterial condition, in its strictest form, must
be this:
Criterion for the Physical Correlate of Measurement: PB fin (E) =
PA init (E)
where fin is the final state of the apparatus, and init is the initial state of
the object on which A was being measured. That is, the final state of the
apparatus must reflect, in its probabilities pertaining to pointer observable
B, the probabilities pertaining to measured observable A in the initial state
of the measured object.16
This includes as a special case that the pointer observable B on the
apparatus would most certainly show e.g. value 17 if inspected at the end,
on the supposition that the measured object was initially in a state in which
observable A ‘most definitely’ had value 17. To this extent then the old
criterion of Veracity (or revelation) is being honored still.
That is, suppose that we had two further apparatus, R1 and R2, the
former one also used to measure A on system S and the latter to measure
B on apparatus R. Then the Criterion’s being satisfied mutatis mutandis for
all measurement interactions will guarantee the coherence of the ‘exterior’
measurements with the ‘interior measurements’.
For a comparison of the probabilities pertaining to R1 and R2 should
show an accord that ‘reveals’ that the accord between S and R demanded
by our Criterion was satisfied. The theory must predict that if a second
independent measurement was made to secure the premise that the object
had been prepared in a certain initial state, and a third measurement were
made of observable B on the apparatus end-state, then the demanded
accord would be found with the requisite probabilities.
This is therefore a coherence constraint on the theory: it must first of all
have this internal harmony in what it predicts, but secondly, comparisons
of results in the two kinds of set-ups must be empirically vindicated. So
here we have a coherence condition that is partly internal consistency and
partly empirical. The theory is to satisfy that coherence constraint—and
that is the most we can ask of it. Put in terms of the sort of description
the theory provides for physical systems in general, this ‘physical theory of
measurement’ is not plagued by any sort of circularity.17 At the same time,
of course, it does not pretend to aim at a definition of measurement in
terms that have nothing to do with measurement!
In practice, the level at which a theory confronts experience is not that
of raw data taken from individual measurement outcomes, but of the ‘data
154 : , ,
models’ constructed on their basis, and the further smoothing of the data
models in which for example sequences or discrete variables are replaced
by continuous functions.18 But the conceptual problems—such as the
‘measurement problem’ of quantum mechanics—refers to the individual
outcomes of measurement interactions, modeled in the above fashion. I will
return to this distinction between data and data models below. The point
is that, however we conceive of this, the above coherence constraint will
have to apply to how the data from measurement are to be accommodated
by the theory’s theoretical models.
Veracity reconsidered
The criterion imposed on the physical correlate of measurement is as
strong as possible, given the general form of a physical theory that was
under consideration. That form, in turn, was kept very permissive, so as to
allow for the form that quantum mechanics took when it was formulated
definitively c. 1925. As a result, the conditions on measurement had to
allow violations of the two ‘classical’ principles that we had noted: Value
Definiteness and Veracity in Measurement.
As to Value Definiteness, nothing in the Criterion for the physical
correlate of measurement precludes the observable A to have no value (or
only an ‘unsharp’ or ‘fuzzy’ value) at the outset. In fact it is not even
implied that the pointer observable B will have a specific one of its possible
values in the final apparatus state. All that is implied is that if measurements
be made, to measure those observables, the possible values will appear as
outcomes with certain probabilities. Value definiteness is not implied in any
sense, way, or form.19 Veracity is implied only in the very much weakened
form of accord among conditional probabilities. But if the theory specifies
nothing beyond those conditional probabilities then no stronger criterion
can even be formulated for the physical correlate of measurement, to the
extent that the theory can cover that.
If we now return to the empirical assertions, we are not bound to stay
within the theoretical description, and we can refer freely to the actual
outcomes of measurements, as typically summarized in data models. These
data models are constructed from the raw data that are actually gathered,
so we are dealing here with actual frequencies, and probabilities that have
a good fit to those frequencies. Suppose now that such a ‘summary’ is
pretty well a picture of a state in a theoretical model, to the extent of
: 155
displaying probabilities derivable from that state. Then the theory may, and
generally does, have implications for how that state evolves in time.20 Thus
predictions can be made about what will be found if new measurements
are made in that situation at a later time.
As an example, we can take the Stern–Gerlach apparatus, named after
Otto Stern and Walther Gerlach’s famous 1920 experiment. There is a
classical picture behind the idea of the experiment: imagine a ‘beam’ of
particles being emitted, with a particle being like a classical dipole with
two halves of charge spinning quickly. In a magnetic field, such a particle
will begin to precess. This way the particle’s position becomes perfectly
correlated with its spin value. That is also the result, mutatis mutandis, with
the situation described in quantum mechanics. When the spin in a given
direction can have only two values, the beam is split into two separate
beams, ‘upper’ and ‘lower’. The early attempts to realize the experiment
encountered many difficulties (cf. Bretislav and Herschbach 2003). In the
first realization a beam of silver atoms (produced in an oven, at temperature
1000 ◦ C) was collimated by two narrow slits (0.03 mm wide) and traversed
a deflecting magnet 3.5 cm long with field strength about 0.1 tesla and
gradient 10 tesla/cm. The splitting achieved was only 0.2 mm, and there
were doubts as to the data obtained. (Of course the set-up is now described
in rather more ideal terms when it is used to illustrate quantum properties.)
The apparatus can be rotated, so as to measure spin in any direction.
Thus data on different spin observables can be collected on some samples
produced in the oven, and on the basis of the frequencies in those samples,
it is possible to infer—via the theoretical description and classification
of this process—just what state is prepared by that source.21 Then the
proportions of the output in the two channels in later measurements can
be predicted on that basis.
It would be illegitimate to conclude that the silver atoms exiting in
the upper channel were prepared in the oven in a state of spin-up.
Rather, the oven prepares a beam of atoms in a state which is such that
the probabilities of a Stern–Gerlach measurement having outcome up or
outcome down are definite, with the result that the relative proportions
in the two channels are definite. But still we can see now that Veracity is
honored at some appropriate level. The outcome does not reveal a prior
state for an individual silver atom, but the frequencies in the outcome do
give information about the prior state in which the source prepares what it
156 : , ,
sends out. If it were not so, the role of measurement would not be played
at all, since the outcomes would not serve to provide information about
what the measurement is performed on. In practical terms it is precisely the
source on which the measurement, taken as a whole, is performed.
What is measurement—number-assigning ?
We do find simplistic answers sometimes even in places where it matters.3
Lord Kelvin, with a well-deserved reputation as scourge of purported
sciences outside the ken of physics, wrote:
I often say that when you can measure what you are speaking about and can
express it in numbers you know something about it; but when you cannot
measure it, when you cannot express it in numbers, your knowledge is of a
meager and unsatisfactory kind [and] you have scarcely . . . advanced to the stage
of science . . . . (Thomson, Lord Kelvin, 1891: 80-1)
Kinds of scales
The guiding idea for the study of measurement scales is that the grading
must be thought of as reflecting characteristics of operations on something
physical—operations that can plausibly be called measurements in some
‘ordinary’ sense. Presumably Stevens was referring to this with ‘‘according
to rules’’, though what sort of rules for operating on real physical things
count as measurement recipes was left aside. The constraints on the possible
outcomes that he describes are designed so as to guarantee that what can be
measured is precisely what can be represented as graded on a real-number
scale.
Mathematically speaking there are many structures other than the real
number continuum. Even in the field called ‘‘foundations of measurement
theory’’, the conception is already so generalized that the standard numerical
version can only be seen as a particular case. The authors in that field
are aware of the limits of numerical representation, and Stevens himself
introduced the now commonly found distinctions:
nominal measurement is the assignment of (numerical) labels without
implying any algebraic structure;
ordinal measurement assigns a rank ordering;
interval measurement is ranking on a scale where only the intervals
between elements are numerically comparable;
ratio measurement is ranking on such a scale where there is also a
minimum, and the ranking can be represented by non-negative
numbers with the ratios between these numbers reflecting a
physical relationship as well.5
160 : , ,
Each of these categories has its examples. The Mohs hardness scale for
minerals is the typical example of an ordinal scale.6 If we ignore abso-
lute temperature then our thermometers, whether Fahrenheit or Celsius,
provide an interval scale—the Kelvin scale for temperature with its absolute
zero is a ratio scale.7
But Stevens’s taxonomy is not exactly a table of categories supported
by a transcendental deduction! It looks nicely hierarchical: we can suppose
that ordinal measurement must be a special case of nominal measurement
with the labels reflecting the ordering. Then each category besides the
nominal presumes an ordering which is linear: two assigned labels x and y
must either be the same or have one greater than the other.
As was much emphasized in nineteenth-century discussions, notably by
Mach and Poincaré even this ordering requires a contingent empirical
regularity for its coherence.8 To be able to order at all, even in its most
minimal logical sense, one needs at least a criterion of equality. Suppose
this be specified, and suppose that by this criterion A and B are both equal
to C. Does it follow that by the same criterion A will be equal to B? No,
it does not, at least not logically, if the criterion refers to a performable
physical test. But the relationship of equality, that is of having the same
position in the ordering, must be transitive or the ordering falls apart.
(Similarly of course for the relationship of greater than or after if applicable).
What criterion is proposed is up to the proposer, and if there is a a variety
of plausible candidates for the criterion then this may be a matter of choice
or even convention. But whether a proposed criterion can be adopted will
depend in part on contingent empirical regularities, on pain of incoherence
in practice.
Returning now to ordinal measurement: the numerical relationships
between the assigned numbers are not all of them significant. If we rank-
order some items by assigning the numbers 1 to 10, merely on the basis
of some ‘greater than’ relationship, there is no significance implied by the
fact that 7 is as much greater than 5 as 5 is than 3, or that 6 is 3 times as
great as 2.
What is significant is what is invariant under ‘re-scaling’. This is not an
explanation; it is a remark that connects various notions connected with
each other. Take the example of a nominal scale: the members of a soccer
team are numbered 1 through 11. It is important only that different players
receive different numbers, the ordering does not matter. That is equivalent
: 161
numerical statements are those that remain invariant under permissible changes in
the representation. That, however, is very similar to saying, in the more complex
case of physical laws, that the representation must be dimensionally invariant. So,
on the face of it, there appears to be a close relationship between meaningful
statements in a single attribute and dimensionally invariant laws stated in terms of
several attributes. (Luce 1978: p. 3)
Approximative measurement
We need not go far afield to find that linear ordering in a scale is too
restrictive, in general. After all, a measurement outcome is not infinitely
precise: the length of the table is registered e.g. as 100 plus or minus 1 cm.
So here the real outcome is not a number but an interval. These intervals are
ordered by inclusion, which is a partial rather than linear ordering. We can
introduce a notion of ‘strictly greater’, e.g. by the definition that one interval
is greater than another if all its elements are greater than all of the other’s
elements. But of course it does not follow then that the ranks are the same
if neither is strictly greater than the other—so the ordering is not linear.
If this practical point is granted, we have already left behind the idea
that measurement outcomes can always be represented as points on a
linear scale. For the class of regions—however delimited—that can be
indicated as found locations in this way is not a linearly ordered class. It
is a class partially ordered by set inclusion, or by set inclusion modulo
differences of measure zero, or some such relation. We should not call this
‘‘locating on an interval scale’’—that term already has an established use,
as we saw above. Rather, in this case, the object is located in the space
of intervals, or in the larger space of ‘Borel sets’ generated by countable
meet and join operations. This is the range most typically encountered
where measurement results are not assumed to be ‘point-like sharp’. There
an elementary form of statement, which can be either that of a theoretical
assertion or of a measurement outcome, relates a physical parameter to a
set of its possible values, which is itself linked to a defined region in a much
larger state space specified by a theory.11
164 : , ,
The HSB color space, with dimensions hue, brightness, and saturation is
a good example of a logical space, but so is the PVT space in elementary
gas theory, phase space in classical mechanics, Hilbert space in quantum
mechanics; space and time themselves also serve as examples.12 I sub-
mit the following generalization as the proper concept of a measurement
operation:
measurement is an operation that locates an item (already classified as
in the domain of a given theory) in a logical space (provided by the
theory to represent a range of possible states or characteristics of such
items).
: 165
As a special case, the logical space can be a scale, which may indicate
the location as being in a certain region of a larger space. Thus a pressure
measurement locates a gas in a region of the larger PVT space, a momentum
measurement locates a body in a region of its phase space. So the locating
is typically not in an exact point, but in a region. We already saw this
in connection with imprecise measurement, which assigns not numbers
but intervals, hence takes its assignments from a partial rather than linear
ordering.13 But this holds more generally, since what is measured is usually
only some aspect of that ‘field of possibilities’.
Thus measurement is an act of locating an item in a logical space. The
converse does not hold: you can locate me in the logical space of astrology
simply by asserting that my Sun sign is Aries. Above I added ‘‘performed in
accordance with certain operational rules’’, but by itself that only points to
another question. The astrologer’s or soothsayers’s or visionary’s operational
rules may not count as yielding genuine measurement. What precisely is
needed? That is precisely the heart of the ‘problem of coordination’, hence
requires looking into the joint evolution of theory and measurement. But
166 : , ,
we can add that once a stable theory has been achieved, the distinction
between what is and is not genuine measurement will be answered relative
to that theory.
Here is a good example: as Henry Margenau and Adolf Grunbaum
discussed, there are certainly procedures that look like simultaneous
position-velocity measurements of particles at any scale. But quantum
theory classifies them as having no such significance—for no operational
outcome can reveal characteristics that, according to the theory, the system
cannot have (cf. Grunbaum 1957: 713–15).
That this is how the results of measurement, and the complexity of their
relation to theory, must be conceived was an early and continuing theme
in Patrick Suppes’ work:
exact analysis of the relation between empirical theories and relevant data calls
for a hierarchy of models of different logical type. Generally speaking, in pure
mathematics the comparison of models involves comparison of two models of
the same logical type, as in the assertion of representation theorems. A radically
different situation often obtains in the comparison of theory and experiment.
Theoretical notions are used in the theory which have no direct observable
analogue in the experimental data. (Suppes 1962: 253)
The term ‘‘data model’’ is often used in the more general sense that does not
distinguish summarized relative frequencies from probability measures, and
168 : , ,
Recall that just now we are looking at measurement ‘from above’, that is,
at this construction guided by accepted theory, rather than ‘from within’.
Since the advent of quantum theory much thought has been given to the
form that any possible surface model must take. Consider an experimental
situation of a quite simple structure, involving several alternative measuring
arrangements, a classification of possible outcomes, and some probabilities
extrapolated from (imagined) observed frequencies. Then a surface model
can be thought of as specified by three factors:
(i) two sets of observable conditions:
(a) a set of realizable measurement choices—call it PRC, and
(b) a set of possible outcomes—call it PRS;
(ii) the surface state P; this is a function which assigns probabilities of
outcomes in PRS, conditional on measurements in PRC.
So P is defined on at least part of PRC × PRS and its values are real
numbers in the interval [0, 1]. This structure is subject to certain minimal
conditions which must guarantee that P is mathematically extendible to a
classical probability function.20 The numbers assigned by the surface state
we can call surface probabilities.
What about theoretical models? We already looked at this briefly above.
These need to be conceived without prejudice in favor of determinism or
causal modeling. The theoretical model could specify, in general,
a family M of observables (physical magnitudes) each with a range
of possible values;
a set S of states;
and a stochastic response function P s m for each m in M and s in S, which
is a probability measure on the range of m.
The number P s m is to be interpreted as the model’s specification of the
probability that a measurement of m will yield a value in E, if performed
when the state is s. From this we can at once see more or less what it shall
mean for such a theoretical model to fit a surface model. But not quite yet:
it only tells us the probabilities of surface phenomena, on the supposition
of a measurement and of a state. The latter is again something theoret-
ical, behind the phenomena. A stringent notion of ‘fitting’ could go as
follows:
A theoretical model MT fits an experimental model ME just in case
MT has some state s such that the function P s m contains the surface
170 : , ,
Lxa: the proposition that L has the xth setting and outcome a (a = 1, 0).
another. The two policemen we say, surely, have before them the contents
of two perspectives within the same space—literally in this case.
At the same time we should recognize that relative to the pertinent theory
there is no real significance to a choice between a family of spaces related
by a group of transformations and a family of coordinatizations of a single
space. Little but bookkeeping ease is involved in the choice between
speaking of different spaces, transformable one into another, and different
coordinate systems imposed on a single space. Yet if we think in the former
terms it seems less apt to speak of differences of perspective. Perhaps the
resistance is the more reasonable, the more we recognize that there is high
theory in play; and the one important thing is to pay attention to what
theories are playing the background role.
For measurement the distinction is essential between the ‘giving’ of an object through
individual exhibition on the one side, in conceptual ways on the other. The latter is only
possible relative to objects that must be immediately exhibited. That is why a theory
of relativity is perforce always involved in measurement.25
all only a representation of the target, and in general does not show what
that is like but only what it ‘looks like’ in that measurement set-up?
Let us honor these two views of what measurement does with the names
Measuring is Locating and Measurement is Perspectival. Are they in tension
with each other at all? One small point may help: what is perspectival
is not the action of measuring but the contents of the measurement
outcome, and locating is an action, not a content. Action and outcome
are two different kinds of things. But this distinction does not go all
that far: the measurement outcome does after all represent the target as
located in a certain logical space. If we understand Measuring is Locating as
meaning just that, we are back with two takes on the same thing, on the
outcome.
If we are to call measurement perspectival, we need to qualify and
elaborate if we are to arrive at an accurately made point. Let’s begin with
a classic passage in which Poincaré insists on the outcome’s relativity to
set-up:
Lorentz could have accounted for the facts by supposing that the velocity of light is
greater in the direction of the earth’s motion than in the perpendicular direction.
He preferred to admit that the velocity is the same in the two directions, but that
bodies are smaller in the former than in the latter. If the surfaces of the waves of
light had undergone the same deformations as material bodies, we should never
have perceived the Lorentz–Fitzgerald deformation. In the one case as in the other,
there can be no question of absolute magnitude, but of the measurement of that
magnitude by means of some instrument. This instrument may be a yard-measure
or the path traversed by light. It is only the relation of the magnitude to the
instrument that we measure, and if this relation is altered, we have no means of
knowing whether it is the magnitude or the instrument that has changed.26
ones definable from the results of local distance and time measurement
outcomes, that have the same value in different frames.27
Note well though that the invariance we are now discussing is not the
invariance that was cited as required for significance above. There we were
concerned with the transformations that connect all the members of an
admissible family of scales. Now we are discussing parameters for which the
value registered in a measurement outcome is the same under admissible
variations in the measurement set-up. As an example imagine again the
speed of a car measured by radar from a moving police car. This speed can
be registered on a scale of miles per hour or kilometers per minute, and so
forth. Leaving aside data about the police car’s own speed relative to the
road surface, what is registered is the speed of the target relative to the radar
source. If the policeman drives at a different speed himself, that relative
speed of the target will be different: it is not an invariant. If on the other
hand the radar is used to measure the target’s acceleration, the result will
be the same for any speed the police car may have. More precisely: where
Newtonian mechanics applies, the acceleration is the same in all inertial
frames, for all inertially moving measurement set-ups, while the speed is not.
There are therefore measurement outcomes that have no relativity left.
Generally, these are after instrumental outputs have been processed with
paper and pencil operations, with final outcome deduced relative to a
theory. This is an important point: a measurement and its outcome can
be complex, and include calculations and input from a model or theory.
Such a procedure still fits the general idea of an operation performed so
as to create a representation of the object; one that locates it in a certain
logical space, with a location that it does not have a priori. To see how the
activity signified by the slogan Measuring as Locating intertwines with the
fact that Measurement is Perspectival, let us take a look at some examples of
how ‘simple’ measurement outcomes are combined in such way to yield
an outcome of a thus created ‘complex’ measurement.
Think once more of celestial navigation in the days of sailing ships. Nav-
igating consists in locating oneself and guiding oneself from one location
to another. The first part is the measurement whose outcome will govern
that self-guidance.
What is one locating oneself in, in this case? In the grid, apparently
first proposed by Hipparcus: basically our system of latitude and longitude
which was also Ptolemy’s. In Ptolemy’s coordinate system as in ours, the
178 : , ,
We can see this as soon as the phrase ‘‘carries the information that . . .’’
is employed. To adapt an old example from Frege: to carry the information
that the Evening Star is within 15 degrees of the sun is not to carry
the information that the Morning Star is thus, although these are the
same object, and although therefore correlation with the position of the
Evening Star is automatically correlation with the position of the Morning
Star.
The intensionality of the concept of a measurement outcome consists in
the fact that it is something that has meaning. In reporting the outcome
one says, for example, that the pressure was [found to be] 17 psi; that
report is a sentence expressing a proposition. Even if, given the background
knowledge or opinion about the whole set-up, the pressure was necessarily
17 psi if and only if 17 = rT/V, the outcome was not that the value of rT/V
was 17—not even in the context defined by that background knowledge
or opinion.30
That the concept is intensional is not to be confused with its being
intentional. Literally, ‘‘intentional’’ refers to intention, but we take it
broadly to include purpose, goal, role, and function. To classify something
as a measurement outcome is to classify it as playing a certain role, namely
as the outcome of a process with a definite function. This is entirely in line
with the reflection that the activity of measurement belongs to technology,
and technological concepts have this dual character, of referring to physical
entities but partly (and essentially so) in terms of function.31
Fourthly, we must insist on the indexicality of the measurement outcome
content in general. That is easy enough to spot when the measurement
outcome is the indexical proposition that the iceberg is located 17 leagues to
the North-East. But it is an especially significant feature when the context
is more, rather than less, theory-laden.
Suppose for a moment that I take a pressure reading on the tire of my
car. The outcome can be reported simply and precisely as attributing a
feature to that tire. Where is the indexicality in that? But think of how
different a role this report plays from the assertion, written in the very same
words, in a historical account of what someone or other did somewhere.
For this outcome to be something useful for me, I must appreciate that
by means of this measurement operation I have also located my own
situation—which involves this tire—as having a place in the theoretical
region of pressure-graded objects.
182 : , ,
The most easily recognized cases of picturing are ones in which the resemb-
lance is not at a very high or abstract level: it is just a sharing of selected
properties. The subway map shares the topological structure of the subway
system, say; it is a picture of that system. But resemblance, we recall, can
be higher order: the spatial structure of a set of letters on a page may be
the same as the temporal structure of a set of events named by those letters.
The use of visual or kinematic imagery to depict things that are not visual
or kinematic is rife, and not excluded by our notion of imagery. So a meas-
urement outcome may well purport to give us information by means of a
selective resemblance to what is measured, although the pointer-observable
may be of a very different character than the measured observable.
Indeed, the Criterion for the Physical Correlate of Measurement entails
that resemblance at such a structural level is required to be implied by the
: 183
that the practice, both experimental and theoretical, stabilizes, and that
‘nature cooperates’ to the extent that, perhaps temporarily, no or less
resistance is experienced in this practice.35 For the time being, at least,
the expectations engendered by empirical predictions are satisfied, the
retrospective evaluation says that thus guided empirical judgment has been
well calibrated.
This page intentionally left blank
PA RT I I I
This page intentionally left blank
Structure and Perspective
In the preceding parts we have uncovered some conditions for the possibility
of scientific representation. Viewed in one particular way, all of these
conditions can be brought under one heading: the crucial role of use
and practice. Although description in language is at best one mode of
representation, this crucial link to use and praxis points us toward the study
of pragmatics: the study of language in which word-thing relations are seen
as abstractions from word-thing-user relationships.1
The asymmetry of representation and the possibility of misrepresentation,
for example, we saw to derive from use rather than from independently
specifiable relations between representor and represented. Nothing is a
representation unless it has a certain kind of role in use and practice. In
addition, besides their status as representation deriving from use, some
representors have a use, which they can have only in a context in which
indexicals and self-reference are available. While I gave maps as paradigmatic
example, this use is central to all the practical sciences, where scientific
representations are drawn on so as to apply scientific knowledge in practice.
This is well illustrated also by the problem of coordination, which was
seen to be unsolvable except in a context where some coordinations are
already achieved and present. Coordination, which assimilates theoretical
terms to the language in use, is not to be understood as a completely
explicit or conscious historical process. We cannot think of theoretical or
other newly introduced terms as made subject to principles of coordination,
except in a context where it is already possible to rely on other terms,
‘old’ terms, as ‘already coordinated’, as meaningful.2 Meaning and use must
indeed be bestowed on newly introduced terms, but this makes sense only
if we think of them as introduced into an already extant language, into our
own language in use.3
The distinction between what is newly introduced theory, and what is
language in which the instruments and measurements are already described,
is historically conditioned. The phrase ‘‘we already knew how to describe’’
signals reliance on our own language at that historical moment. How could
it be otherwise? There is no moment outside history, and at each moment
190 :
in history we not only can but must rely on the language in which we
conduct our business, the language we live in. This is Neurath’s insight
about mariners at sea, but extended to the conditions of possibility of
scientific representation in every respect.
The epistemic situation as here described has seen responses not just in
philosophy of science but in metaphysics, and not just in past centuries
but in our own time. We will follow the resulting problématique as it
developed through the twentieth century and into our own, starting with
the Bildtheorie of Hertz and Boltzmann as precursor to structuralism in the
philosophy of science. Structuralism about science is, roughly speaking, the
contention that scientific representation is of structure only. The obvious
question, what is structure and what is not?, is the first that any advocate of
structuralism must answer. The answers have tended to dissolve into vacuity
or inconsistency when pressed to precision—as we shall see, going through
Russell’s, Carnap’s, and Putnam’s arguments as structuralist conceptions
emerged, faltered, and took hold.
Structuralism unfortunately involved, during most of its problem-beset
history, another attempt to achieve the simultaneous interpretation of all
language and theory without relying on our prior language-in-use. Seen
in this way, reminiscent of Reichenbach’s attempt to conceive of and
solve the problem of coordination in an extreme form, structuralism too
pursued an impossible ideal.4 But just as we can see a real and viable process
of coordination behind Reichenbach’s reach for the ‘unconditioned’, so it
seems to me that we can see a genuine and viable sense in which structuralist
views of science are right, at heart.5
I shall advocate a version, an empiricist version, of structuralism. Once
again, the redeeming clues are to be found in pragmatics. The empiricist
view I propose will, I hope, do justice to the strong structuralist trend
found in philosophy of science without subordinating it to any form of
metaphysical realism, and without giving in to the attendant illusions of
reason.
8
From the Bildtheorie of Science
to Paradox
the old system of physics [sometime before 1900] was not like a single picture, but
more like a whole picture gallery; since every class of natural phenomena had its
own picture. And these different pictures did not all hang together; one could take
any one of them away, without affecting the others. In the future world-picture,
this will not be possible. Each one is an indispensable component of the whole
and, as such, has a specific meaning for observed nature; while, conversely, every
observable physical phenomenon must find its precisely appropriate place in the
picture. (Ibid.: 21–2)
The crucial point follows this immediately: ‘‘In this respect, it differs
essentially from ordinary pictures, which certainly need to correspond to
the original in some particulars, but not in all—a distinction to which, in
my opinion, physicists have not hitherto paid enough attention’’ (ibid.: 22).
194 :
On the contrary, that is precisely what the rivals in this view of science
explicitly contradict, when they speak in terms of ‘‘pictures’’ along the
lines made prevalent by e.g. Hertz and Boltzmann. Indeed, Planck is here
rejecting, in effect, the very core of the Bildtheorie while keeping its picture
terminology. For there is no point in emphasizing that science presents us
with representations of natural phenomena, if not to convey that success
in science will consist in constructing an image of nature that is adequate
in certain respects and trades on resemblance at best in part, as opposed to
constructing a true and accurate copy.
That this passage is not meant simply as a bit of futurology, a vision of
the best conceivable future, but an expression of what Planck takes to be
the defining aim of science, is then made clear:
A constant, unified world-picture is, as I have tried to show, the fixed goal which
true natural science, in all its forms, is perpetually approaching; and in physics we
may justly claim that our present world-picture, although it shimmers with the most
varied colors imparted by the individuality of the researcher, nevertheless contains
certain features which can never be effaced by any revolution, either in nature
or in the human mind. This constant element, independent of every human (and
indeed of every intellectual) individuality, is what we call ‘‘the Real’’. (Ibid.: 25)
The passage I quoted at the outset follows now, and just after that there
comes, at least to our eyes, a curious ending to his polemics. He wishes
simultaneously to withdraw from the ‘representation’ or ‘picture’ view and
to embrace it on a higher (deeper?) level:
Those great men did not speak about their ‘‘world-picture’’; they spoke about
‘‘the world’’ or about ‘‘Nature’’ itself. Now, is there any recognizable difference
between their ‘‘world’’ and our ‘‘world-picture of the future’’? Surely not! For the
fact that no method exists for proving such a difference was made the common
property of all thinkers by Immanuel Kant. (Ibid.)
not in all’’. But it is in Boltzmann’s writings that we see the contrary view
most clearly.6
Boltzmann presents his own point of view as deriving mainly from
Maxwell and Hertz, two of the heroes of the then recent achievements
in electromagnetism.7 Maxwell’s writings are not exactly unambiguous. In
fact he is often taken as postulating the reality of the ether and of the elec-
tromagnetic waves in the ether, while sometimes despairing of any purely
mechanical theory of their character. However, as Boltzmann emphasizes,
Maxwell speaks of the envisaged mechanisms as merely analogies, partial
analogies, that allow us to get an imaginative grasp on the equations. The
equations must on the one hand fit the observed magnetic, electrical, and
optical phenomena, and on the other hand allow of some understanding
of the theory as a description of a physical process. But as far as description
goes, we receive mainly analogies with other forms of material propagation,
diffusion, and interaction—with gases, fluids, and heat. Maxwell himself
cautions us against thinking of this as a true description of reality behind
the phenomena:
By a judicious use of this analogy [between Fourier’s equations of heat conduction
and the equations of the electrostatic field] . . . the progress of physics has been
greatly assisted. In order to avoid the dangers of crude hypotheses we must study
the true nature of analogies of this kind. We must not conclude from the partial
similarity of some of the relations of the phenomena of heat and electricity that
there is any real similarity between the causes of these phenomena. The similarity
is a similarity between relations, not a similarity between things related. (Maxwell
1881: 51–2)
the electric current, etc. But scientific accuracy requires of us that we should in no
wise confuse the simple and homely figure, as it is presented to us by nature, with
the gay garment which we use to clothe it. Of our own free will we can make no
change whatever in the form of the one, but the cut and color of the other we can
choose as we please. (Hertz 1962: 28)
In the first part of this passage we see the relationship pattern characteristic
of how a symmetry requirement is to be satisfied:10
picture 1 picture 2
event 1 event 2
Figure 8.1. Adequacy as Symmetry
evolution. This calculation provides models of the region for the next five
or so days, and Hertz’s constraint requires (naturally!) that those accurately
represent the conditions on those days—with the models’ success gauged
by some measure of accuracy. In this sort of example, there are few if
any hidden parameters characterizing unobservable entities, but the pattern
Hertz displays is general. It can be cited equally well as a constraint on
hortatory astrology (in which natal charts are progressed to forecast the
native’s life history) as on quantum electrodynamics, both of which are
replete with parameters characterizing unseen influences. Not to say, of
course, that this properly imposed constraint is satisfied in all cases.
Hertz’s constraint is a crucial condition for the objectivity of scientific
representation. Far from mere exercises of the imagination, merely adding
levels of fantasy to the known empirical realities, scientific representations
must allow us to go reliably from what we know to what we will or
can encounter further on. This is the empirical constraint on scientific
theorizing, here phrased in a form easily seen to be appropriate to a
structuralist (as opposed to a naïve realist) conception of ‘picturing’ by
means of models.
Alongside of this positive contribution to an understanding of how an
abstract theoretical science can provide representations of the empirical
phenomena, there was a good deal of polemics in the air. Boltzmann,
though always on the side of those advocating the kinetic and atomic
theories nevertheless, lecturing in 1899, expressed the heretics’ philosophical
point of view most trenchantly:
We know how . . . to obtain a useful picture of the world of appearance. What
the real cause for the fact that the world of appearance runs its course in just this
way may be; what may be hidden behind the world of appearance, propelling it,
as it were—such investigations we do not consider to be of the task of natural
science. (Boltzmann 1905a: 252)
So each side depicts the other as having strayed from the true concerns of
natural science into mistaken philosophical conceptions of their common
enterprise.
The ‘‘picture’’ and ‘‘image’’ imagery became pervasive in philosophically
reflective writing on physics by physicists, increasingly so during the
controversies over the interpretation of quantum mechanics in the late
1920s and 1930s. Bohr’s insistence on the use of the complementary
wave and particle pictures—neither of which can be regarded as faithfully
mimetic representations, precisely because they are mutually exclusive—is
too well known to bear repetition. But Erwin Schrödinger, who rejected
wave-particle duality, wrote in the same terms. Heisenberg’s uncertainty
relations, he wrote, ‘‘changed our conception [of . . . and] even what is
to be understood by a physical world image.’’11 This usage continues
throughout his writings:
we do give a complete description, continuous in space and time without leaving
any gaps, conforming to the classical ideal—a description of something. But we
do not claim that this ‘something’ is the observed or observable facts; and still less
do we claim that we thus describe what nature (matter, radiation, etc.) really is.
In fact we use this picture (the so-called wave picture) in full knowledge that it is
neither.’’12
We can certainly see major strands of anti-realism in all these writings, but
notice that it is not simplistic anti-realism—in fact, most of these writers
were actively involved in developing the new physics, including atomic
theory and quantum theory.
The Bildtheorie view of science takes a general form that is compatible, for
example, with what we know later under the name of structural realism. If
what science gives us by way of theories and models is to be conceived of as
pictures, as representations, then the question is opened as to just what the
relevant and appropriate criteria of adequacy are for them. An extreme view
would be that a good representation of that sort is one that corresponds
in every respect to what it is representing. That is not what even a quite
extreme scientific realist would say, since there is always a good deal of
mathematical artifice present. But it is approached, later in the twentieth
BILDTHEORIE 199
This is his reply to the objection that his theories may fit the phenomena
without being true, something he concedes quite readily:
. . . just as the same artisan can make two clocks, which, though they both
equally well indicate the time, and are not different in outward appearance, have
nevertheless nothing resembling in the composition of their wheels; so doubtless
the Supreme Maker of things has an infinity of diverse means at his disposal, by
each of which he could have made all the things of this world to appear as we see
them, without it being possible for the human mind to know which of all these
means he chose to employ. (Ibid.)
We can at the same time see that the entanglement with the question of
the scientific acceptability of atomic theory is only contingently involved
with this, as a currently salient example of scientific representation. Not
only was Hertz’s work crucial to the establishment of that theory, but his
sentiment concerning the theory’s representations is echoed almost literally
half a century or so later by Heisenberg:
The atom of modern physics can be symbolized only through a partial differential
equation in an abstract space of many dimensions . . . . All its qualities are inferen-
tial; no material properties can be directly attributed to it. That is to say, any picture
of the atom that our imagination is able to invent is for that reason defective.
An understanding of the atomic world in that primary sensuous fashion . . . is
impossible.17
What could be the use of theory thus conceived? Success with respect to
simplicity, completeness, and exactitude in a representation would speak for
itself, one might say. But something more is needed to show how we can
conceive of familiar ways of drawing on theory in prediction, application,
and practice.
Duhem responds by elaborating on this conception as allowing us to see
theory as providing a taxonomy in which to locate the regularities, objects,
and processes that are of practical concern. To begin with the regularities:
‘‘Theory is not solely an economical representation of experimental laws;
it is also a classification of these laws’’ (ibid., 23), thus bringing order and
internal connections to what would otherwise be a jumble of empirical
generalizations. Secondly, as illustrated by his discussion of taxonomy
in zoology, the theoretical descriptions of the materials at hand are to
be understood as classification. This is a radio: anyone can see that. If
204 :
Here Poincaré’s view is quite in harmony with Duhem’s, that the aim of
physical theory is to systematize experimental laws, by whatever theoretical
means lend themselves to that, and without any implication of reality for
the theoretical parameters.
is like. Mustn’t the theory also say what that something is? Here is the
dilemma:
• if Maxwell’s equations are statements, what do they say?
• if they are not statements, how can they amount to a theory at all?
If we leave aside the more instrumentalist (non-statement) options, we can
discern here two not very well-distinguished alternatives. The first is that
no, there is no ether, no mechanical medium subject to wave disturbances,
but yes, there is something, it is the electromagnetic field itself, which
is a thing, and it is not the shape or form of something else. Today
that is an often expressed view, perhaps not always clearly distinguished
from simple rejection of the classical ether: Fields in empty space have
physical reality; the medium that supports them does not (Mermin 1998:
753).
On this option, there is no puzzle, just a new ontology, some new and
previously inconceivable furniture for the world. The second alternative
is more agnostic, and so presumably closer to how Poincaré shies away
from claims to knowledge of the unobservable. It could be expressed like
this:24
• The equations only describe a form or structure—if that is the form
or structure of something, then that something is an unknown entity.
• The field is first of all an abstract entity (mathematical: e.g. a function
assigning values to points in space), though we can of course also
give the name ‘‘field’’ to whatever it is—if anything—that bears this
structure.
• That unknown bearer might well have other properties, just as
ordinary things have properties beside their shape. But the theory
does not describe those.
• Science abstracts, it presents us with the structural skeleton of nature
only.
To begin even this sounds rather reactionary, just when we have discarded
the ether and its frustratingly elusive qualities, there is reference again to
something, whatever it is, that ‘‘bears’’ the field after all. But in retrospect
we can understand it as the beginning of sustained attempts to develop a
structuralist view of science, and we shall follow this attempt through several
successive stages.25
206 :
and it is in some ways amazing that what Duhem taught there needed to
be understood all over again a half century later.
We are some distance here from a set of pure mathematical propositions,
and indeed, the sort of taxonomy that a scientific theory provides for
its domain does not seem to be something that could be presented in
the language of pure mathematics. So Duhem attempts an account of
the scientist’s theoretical language. It is distinguished by its special jargon,
foreign to the discourse of every day, and so we may be tempted to
assimilate it to the technical language found in a practical art or craft. ‘‘That
would be a mistake,’’ he writes, and continues:
BILDTHEORIE 207
I am on a sailing ship. I hear the officer on watch shout out the order: ‘‘All hands,
tackle the halyard and bowlines everywhere!’’ A stranger to things of the sea, I do
not understand these words, but I see the men on ship run to posts assigned in
advance, grab hold of specific ropes, and pull on them in regular order. The words
uttered by the officer indicate to them very specific and concrete objects, arousing
in their mind the idea of a known manipulation to be performed. Such, for the
initiated, is the effect of technical language.
Quite different is the language of the physicist. Suppose the following sentence
is pronounced to a physicist: ‘‘If we increase the pressure by so many atmospheres,
we increase the electromotive force of a battery by so many volts.’’ It is indeed
true that the initiated person who knows the theories of physics can translate this
statement into facts and can do the experiment whose result is thus expressed, but
the noteworthy point is that he can do it in an infinity of different ways. ( Duhem
1962: 148–9)
Weyl on isomorphism
So we have seen that while Duhem starts by regarding a physical theory
as a set of mathematical propositions, this quickly turns out to be not a set
of propositions in pure mathematics but in a scientific language involving
208 :
Having all the knowledge about color that theoretical physics and
physiology can provide, will Mary be able to classify the rose as red—and
if so, how?
Weyl would point out that the projective plane has many symmetries,
so that there are in effect many isomorphisms between the color space
and that mathematical structure. Transform an isomorphism appropriately
and you generate another isomorphism. Like Locke’s ‘‘inverted spectrum’’
problem, only worse!
So if what Mary knows is just this structure, that is certainly not enough
to identify which color is which. If her knowledge includes differentiating
connections of the colors with things outside the color space—e.g. of
what is filtered by a ruby—then the same problem reappears in principle.
If all she knows is the structure of the entire domain, colors and rubies
and whatever else may be included, that will not be enough to single out
specific features. Even if they happen e.g. to have a unique location in the
spatial configuration, that may not be one identifiable by Mary in her own
(though only partially accessible) frame of reference.
Can’t Mary do better than a disembodied mind endowed with a complete
mathematical representation of nature? She can indeed. Somewhat later in
the novel we read an imagined sequel, about the day when Mary will be
allowed to come out:
She glanced down at her own hands . . . sheathed in serviceable pigskin which
she was permitted to remove only at night, in total darkness, with the assistance
of the blind maidservant Lucy, thus preventing any inadvertent glimpse of the
pearly pinkness that—so she understood—tinted the translucent plates covering
the dorsal surfaces of her finger-ends.
Well, she would soon be able to see her fingernails along with many other
things . . . . (Ibid.: 154–5)
When the gloves are removed in broad daylight, she will see her fingernails,
and know where they are located in the color space, because she knew
already how to identify her nails and had already the knowledge that they
are pink.30 Suddenly that theoretical object, the color space, will be subject
to coordination (to that extent), she will be able to say ‘‘this is pink’’, and if
she can gather some more such clues, soon know her practical way around
the colored world. The change is that, at the point of success, she will
be able to locate any pink object she sees in the same region of the color
212 :
space—be able to say ‘‘this, in front of me, is there in the color space’’, and
the many symmetries of the projective plane will no longer respect what
she can discriminate.
This sounds like a way to success, but note how crucially it relies on
the indexical: what Mary must be able to say at this point is ‘‘these are my
fingernails; my nails are pink’’. So let us repeat the question that we posed
for the conclusion of Weyl’s reasoning, which was:
Is there something that I could know to be the case, and which is not expressed
by a proposition that could be part of some scientific theory?
There is some sense, undoubtedly, in which Mary can answer this question
affirmatively for herself—but in what sense?31 In the solution here ima-
gined, the crux is the self-attribution of a location in color space. Is that
the only way? This question we will pursue throughout Part III, and we
will return to it explicitly at the end.
9
The Longest Journey: Bertrand
Russell
with the area. But the extent of this defect depends on the curvature of
the surface, which may be very small in which case the differences from
a surface of zero curvature (Euclidean plane) would show up significantly
only in very large areas.
So Lobachevsky looked into astronomical data. He suggested that one
might ‘‘investigate a stellar triangle for an experimental resolution of the
question.’’ The ‘‘stellar triangle’’ he proposed was the star Sirius and two
different positions of the Earth at six-month intervals. But if there was such
a defect in the sum of the interior angles, it was still within the limits of
measurement error.
Indeed, for any defect you can name and any size, there would be
a curvature constant small enough to guarantee that the defect would
still be within those limits. So, if there are also limits on the size of the
regions we can inspect in this way, in the course of human history, the
difference between Euclidean and hyperbolic space may be beyond what
such measurements can reveal.
But there is a deeper problem. The measurement results will also depend
on what we take as measurement standard, and what presuppositions are
in place with that choice. This point can be graphically (if imperfectly)
illustrated by von Helmholtz’s examples of mirror-universes. Helmholtz
imagined that we are making measurements to determine whether our
space is Euclidean. Do the interior angles of a triangle add up to 180
degrees? Suppose they do. At the same time he imagines that we are
reflected in a huge concave mirror.
In that mirror we see little people moving around with rulers and ‘doing
the same thing’ as we do. Of course, they get the same results, and announce
that they live in a Euclidean space. We want to disagree; they are moving
around on a concave surface, and their measuring rods change length as
they work.
Mathematicians’ reaction
One response to the birth of non-Euclidean geometry had thus been that
the question might be subject to empirical test. This had not fared well,
in view of this relativity to the measurement standard, and was by no
means the only or even the most acceptable idea about this new subject
at the time. Another main reaction was that to understand non-Euclidean
geometries they had to be interpreted within Euclidean geometry.1 To
be intelligible they should be readable as strangely worded descriptions of
parts or aspects of Euclidean space. That this is possible was shown initially
for the hyperbolic plane by Beltrami in his ‘‘Saggio di Interpretatione della
Geometria Non-Euclidia’’ (1868).2 Beltrami himself expressed his goal as
maintaining Euclidean geometry as the one true theory of space.
Felix Klein perceived the limits of this effort and first attempted to
interpret both Euclidean and non-Euclidean geometries within projective
geometry. But very soon through Klein’s work, and even more radically
through Riemann’s, there came into being such a cornucopia of geometries
that any true theory of space underlying all of them could be very little
more than pure logic.
The mathematicians’ response was that none were privileged, all geo-
metries were on a par and certain to remain parts of mathematics. But how
do they relate to the physical world which we investigate empirically? With
the focus shifted to interpretation, through such puzzles as Helmholtz’s
mirror worlds, that problem has now taken on a very different shape. It is
no longer a straightforward question that could be settled by measurement.
Klein’s idea that a single projective space could be the domain of either
a Euclidean or non-Euclidean geometry introduced a new way of viewing
the matter. Thus with Klein, and also Lie, we get a different feeling:
they display the Euclidean and non-Euclidean geometries as pertaining
216 :
The view that Russell was resisting may aptly be called an early form
of structuralism, though it is anachronistic to name it thus.6 Russell insisted
to the contrary that there was something more to be known, and in
fact captured in scientific theories representing nature, than mathemat-
ics affords—something more than mathematical structure. (At times he
took this to an almost absurd extreme, asking even ‘‘wherein . . . lies the
plausibility of the notion that all points are alike?’’7 ) Certainly, for the
mathematician, all congruence relations, and all possible denotata of their
primitive terms, have equal status—but for the empirical sciences there
must be more, there must be a fact of the matter that goes beyond what
mathematicians can describe.
the same properties and stand in the same relations, as enter our direct
experience. Beginning with the theory of space, he appears to say precisely
that at first:
If, as science and common sense assume, there is one public all-embracing physical
space in which physical objects are, the relative positions of physical objects in
physical space must more or less correspond to the relative positions of sense-date
in our private spaces. There is no difficulty in supposing this to be the case. [. . .]
thus we may assume that there is a physical space in which physical objects have
spatial relations corresponding to those which the corresponding sense-data have
in our private spaces. It is this physical space which is dealt with in geometry and
assumed in physics and astronomy. (Russell 1912/1997: 30–1)
The reason he has given for this is that the objects are postulated in the first
place as causes to explain our sensations, and causation presupposes spatial
contiguity. But in that motivating discussion he was moving back and
forth between his rather strange ontology and the common sense picture.
Attempting to be more conscientious we see the gloss fading rapidly:
Assuming that there is physical space, and that it does thus correspond to private
spaces, what can we know about it? We can know only what is required in order to
secure the correspondence. That is to say, we can know nothing of what it is like
in itself, but we can know the sort of arrangement of physical objects which results
from their spatial relations. We can know, for example that the earth and moon
and sun are in one straight line during an eclipse, though we cannot know what a
physical straight line is in itself. . . . Thus we come to know much more about the
relations of distances in physical space than about the distances themselves. . . . We
can know the properties of the relations required to preserve the correspondence
with sense-data. . . . (Russell 1912/1997: 31–2)
Note well that now we are asserted to know, not the distances or other
spatial relations between bodies, but only the properties of those relations. That
is, we know what the abstract structure is. If we ‘‘cannot know what a
physical straight line is’’ we certainly cannot know such other relations in
physical space as congruence. All we know about that, presumably, is what
the axioms of a geometry can say about congruence.
Has the ‘‘real property’’ realism adopted in response to von Helmholtz,
Klein, and Poincaré been given up? That is not clear. What has been
given up certainly is any pretence to knowledge of those real properties
themselves.
: 219
Every analogy with familiar things, like waves in water, planets, and
billiard balls was already heavy with disanalogies. So Russell says: we
can only infer the properties of the properties, and the properties of the
relations—the type of structure:
We can know the properties of the relations required to preserve the correspond-
ence with sense data, but we cannot know the nature of the terms between which
the relations hold . . . . [A]lthough the relations of physical objects have all sorts
of knowable properties, . . . the physical objects themselves remain unknown in
their intrinsic nature . . . .’’ (Russell 1912: 32, 34)
The Analysis of Matter (1927) makes this precise: this structure is exactly, no
more and no less, what can be described in terms of mathematical logic.
The logic in question is strong, and today we would see it as higher order
logic or set theory. But still, how little this is! Science is now interpreted
as saying that the entities stand in relations which have such properties as
transitivity, reflexivity, etc. but as giving no further clue as to what those
relations are.
. . . whenever we infer from perceptions it is only structure that we can validly infer;
and structure is what can be expressed by mathematical logic. The only legitimate
attitude about the physical world seems to be one of complete agnosticism as
regards all but its mathematical properties. (Russell 1927: 254, 270)
directly under him two colonels, and directly under each colonel there are
two majors. How can I say that my neighbors instantiate this hierarchical
structure? To justify that I have to define a relation which has the same
properties as the relation directly under has in my model. So I arbitrarily label
my seven neighbors as follows: 1, 10, 11, 100, 101, 110 111. Then I define
the relation as follows: neighbor 1 does not bear to anything; any
neighbor with a label of form Y1 or Y0 bears to the neighbor labeled
Y; and that is all.
Of course there is no sense to the idea that I have discovered a
hierarchical structure in my neighborhood by carrying out this ‘‘pencil
and paper’’ operation. But if I were to say that ‘‘my set of neigh-
bors is hierarchically structured in the fashion of model M’’ means only
that there is a relation on that set which has the same properties
as the directly under relation in M, then the trivializing result follows.
For there is indeed such a relation on that set of neighbors provided
only there are seven of them. In general, equality of size between
two sets means just that there is a one-to-one correspondence between
them, and that correspondence can be used to single out a copy in
the one set of any relational structure there may be displayed in the
other.10
You can easily see that this is von Helmholtz’s move with concave
mirrors, repeated at a more abstract level. The very reasons that drove
Russell originally to ‘‘real property’’ realism, and then later drove him
away from that epistemologically uncomfortable position, have returned to
plague him again.
Russell capitulates
Russell capitulated. In a letter to Newman, he reverted to the ‘‘real
property’’ realism of his early days. The only difference is an update from
real spatial relations to real spatio-temporal relations:
Dear Newman, [. . .] It was quite clear to me, as I read your article, that I had
not really intended to say what in fact I did say, that nothing is known about
the physical world except its structure. I had always assumed spatio-temporal
continuity with the world of percepts, that is to say, I had assumed that there might
be co-punctuality between percepts and non-percepts. . . . And co-punctuality I
regarded as a relation which might exist among percepts and is itself perceptible.
(Russell 1968: 176)
: 221
and unimportant, we have no grip at all on what the assertion says about
the world.11 Newman in fact derided his own suggestion as a counsel of
despair.
One way to take Russell is that he accepted Newman’s suggested way
out but added something so as to regain informativeness. He added that
the ‘important’ relations are exactly those with which we have direct
acquaintance. This ties in with the epistemology in his earlier Problems of
Philosophy: to understand a proposition requires acquaintance with each of
its constituents. Since we do understand some general propositions it follows
from this that we are directly acquainted with certain Universals—all the
ones that appear as constituents in the propositions we understand. But are
we acquainted with all the arbitrary relations Newman recognizes? That
does not follow. For example, we understand the proposition that every
set has a well-ordering. That entails acquaintance with the (higher order)
Universal being a well-ordering, but not acquaintance with e.g. a particular
well-ordering of the real numbers. So within Russell’s epistemology there
was indeed a natural division into important and unimportant relations to
which he could appeal.
To complete the repair then, Russell has to say that a scientific theory
tells us something about the structure of certain relations with which
we are acquainted, instantiated by those postulated entities in nature with
which we are not acquainted, and about whose qualitative properties we have no
idea.12
But now we can also see that the repair is still not finished. There is
the danger that it does not go far enough, and may still leave us with
very uninformative scientific theories, so two tasks remain. Task One:
those ‘‘certain relations with which we are acquainted’’ must be specified
to some extent. Russell clearly saw this, for here we arrive at Russell’s
specific addition to Newman’s way out: ‘‘ spatio-temporal continuity [of
the physical world] with the world of percepts’’. At the very least the
terms ostensibly spatio-temporal in the theory denote the very spatio-
temporal relations with which we are directly acquainted, and they relate
the postulated entities both to each other and to the observed entities of
our direct acquaintance. But this is a postulate: Russell is telling us that
the ostensibly spatio-temporal terms are univocal, that they denote the
same relations when they appear in physical theory as when we use them
ordinarily.
: 223
Task Two: to show that this sort of construal of theories, with that
elaboration added, now has the resources to be sufficiently informative.
Suppose that the space-time geometry of the physical world is indeed
as science says it is, but when it comes to any other relations and prop-
erties attributed to those physical things out there, science can only
specify the formal structure. Then won’t Newman’s argument apply
again?13
The assertion that there are certain relations with a given formal structure
on those things will be automatically true, provided only it is logically com-
patible with the space-time geometry in question, plus some assumptions
about cardinalities. To put it the other way around, any additional physics
beyond the space-time geometry might then give information only about
cardinalities. That is equally disastrous unless the entirety of physics reduces
to providing space-time models. Such was the dream of W. K. Clifton’s
‘‘space theory of matter’’, revived eventually in ‘‘geometrodynamics’’, and
may still be alive in some programmes in the foundations of physics.
But it is certainly not something that we would be ready to accept as a
priori certain, I would think, nor something to which we would want to
indissolubly connect a philosophical position.
Conclusion
So Russell’s extreme structuralism collapsed and his repair, through a
reversion to ‘‘real property’’ realism, does not obviously save him. In
effect, it takes him back to the point where he had hoped that some
admixture of structuralism with realism might make his peace with von
Helmholtz’s lights and mirrors. But there is something striking in his repair,
that alters the view he presents of science quite radically. In The Analysis
of Matter scientific theories are presented as being completely formulable,
without loss, in the language of pure mathematics. Such a formulation
would involve no direct reference to what we encounter in experience,
let alone indexicality, self-reference, or self-location. Reference to actual
individuals would be achieved entirely by objective description. This aspect
of his structuralist view of science is lost in the repair, when our direct
acquaintance with certain entities separates what science is about from what
logical gerrymandering concocts.
224 :
This story has its sequel in writings by Rudolf Carnap, Hilary Putnam,
and David Lewis, where we will recognize the themes from Helmholtz,
Russell, and Newman in a new setting. The problems are transposed
there to a context governed by concerns in philosophy of language and
analytic metaphysics. They become clearer there and so, if anything, more
devastating. But when the problems become clearer, so do the possibilities
of solution. Is there after all a viable form of structuralism about science?
I shall argue for a specifically empiricist structuralism, which escapes
trivialization by recourse to resources that we have been encountering all
along the way: the role of indexical and ostensive reference.
10
Carnap’s Lost World and
Putnam’s Paradox
There is a certain type of relation description which we shall call structure description.
Unlike relation descriptions, these not only leave the properties of the individual
elements of the range unmentioned, they do not even specify the relations
themselves which hold between these elements. In a structure description, only
the structure of the relation is indicated, i.e. the totality of its formal properties.
(section 11, p. 21)
How does Carnap deal with this problem? He admits that it looks as if the
use of definite descriptions will be successful only if eventually it relies
on some ostensive description—that is, some recourse to pointing or other
indexical or demonstrative form of reference. He takes that look to be
deceptive:
However, we shall presently see that, within any object domain, a unique system
of definite descriptions is in principle possible, even without the aid of ostensive
description. (pp. 24–5)
but mathematical models will capture only the structure that is common
to them. Thus to have specific knowledge of one of such a pair, we
must know more than that it is adequately represented by some such
mathematical structure. Secondly, as Klein and Helmholtz had pointed out,
given any significant limitation on what is observable or detectable by us,
there will be many non-isomorphic structures that fit what we do observe
or detect. I said above that these are two sides of one coin. In what sense is
that so?
Russell reacted to the latter problem by insisting that we have know-
ledge that goes beyond what can be mathematically represented: we are
acquainted with the physical congruence relation, whose mathematical rep-
resentation is just one of many congruence relations that can be imposed
on a projective space. One of these relations has a privileged epistemic
status, it is the one that ‘enters’ our experience.
Now we see Carnap opting for a precisely similar solution to the
former problem. He assigns a privileged epistemic status to certain relations,
and connects that status with what can be experienced. We hear an
echo here of the ‘problem of coordination’ which due to Schlick’s and
Reichenbach’s discussions must have been salient in Carnap’s world. The
problem Carnap has encountered pertains to the connection between
any structural representation and what it represents; preoccupied with
coordination, it is seen as relating specifically to the theory-experience
relation. We see the essential problem in much more general form when
Hilary Putnam takes it up some four decades later.
Putnam’s Paradox
At the American Philosophical Association in 1976, Putnam produced his
most famous argument against metaphysical realism. He called it his model-
theoretic argument. David Lewis called it Putnam’s Paradox, because, he
claimed, this argument would show that almost any theory at all is true.
For Lewis, therefore, the argument can be cited in support of a stronger
realism than the realism that Putnam targeted, namely a stronger realism
which can resist the argument. I will begin with what I shall call the core
of the argument.5
230 :
The argument
Suppositions:
1. has infinitely many pieces
2. Theory T is consistent
3. Theory T says that there are infinitely many things
Exactly one meta-mathematical result will be required in the proof; after
that it will proceed precisely like Newman’s argument against Russell. That
result is the Loewenheim–Skolem–Tarski–Vaught theorem, which shows
that if a theory has an infinite model then it has models of every infinite
cardinality.6
1. Since T is consistent and has (only) infinite models, it has a model of
the same cardinality as .
2. Let be a one-to-one correspondence between the elements of that
model and the pieces of the world. (This correspondence exists by
definition, given that the two sets have the same cardinality.)
3. Each term of the theory will have an extension in the model; DO:
assign to each term precisely the image under of its extension in
the model. Take that image to be its extension in .
4. Since all the theorems of T are true in the model (i.e. given the terms’
extensions in the model) all of them will also be true in
when given those images as extensions in .
5. So T is a true theory about .
theorem. Yet you balk . . . You have a good point; you would not call this
sheet a map because in fact, although the function exists, it is not true that
you can regard this sheet as a map of Paris.
To illustrate the same problem and at the same time the ordinary,
acceptable use in which the slippage from ‘‘there exists’’ to ‘‘we have’’
causes no problems, take this simple problem in geometry:
The Euclidean sphere of radius 1 can be coordinatized as the set of
points (x,y,z) in R3 satisfying x2 + y2 + z2 = 1. Determine the distance
along great circles between two arbitrary points on the sphere as a
function of the coordinates.
What is asserted in the opening sentence? That there exists a suitable
mapping of the sphere into the set of triples of real numbers. The ‘‘can’’
in ‘‘can be coordinatized’’ has no literal significance, for how could we do
it? How could we select a point to be assigned (0,0,1) for example? In fact,
the sphere as a mathematical entity has perfect rotational symmetry, so any
description of any point on it applies equally to all points—unless it is a
description that relates some points to things outside the sphere, in which
case we are not considering the sphere by itself.9
Going back now to the context of Putnam’s argument we must similarly
conclude the following. As long as we are not given an independent
description of both the domain and range of an interpretation, we do
not have any such interpretation, nor any way to identify one. Given an
independent description of the interpretation’s domain and range, however,
whether the theory is true under that interpretation depends entirely on
how the mapping is defined, using those descriptions. Remember here
the Water Theory I gave above: if we do not have our own resources to
describe both this theory (its language and axioms) and the world, then we
can’t have an interpretation of it. But if we have those resources, then we
can insist that all admissible interpretations must assign water as referent
to the word ‘‘water’’—and upon that interpretation, the theory’s truth or
falsity is a contingent matter.
On my reading, Putnam spells this out quite clearly. In ‘‘Realism and
Reason’’, where the argument was first introduced, he defuses one point
with the laconic ‘‘because the world is not describable independently of
our description’’ (p. 496). And in ‘‘Two philosophical perspectives’’ he is
very explicit: ‘‘This simply states in mathematical language the intuitive
’ ’ 235
fact that to single out a correspondence between two domains one needs
some independent access to both domains’’ (Putnam 1981: 74).
It is of no earthly relevance to know that there exists some such function
such that it, if only it could be described, would furnish an interpretation
under which the theory is true. The only question of interest for us, if we
do really have such an interpretation, is whether the theory is true under
that interpretation.
What happens if we want to apply Putnam’s argument to our own
theories formulated in our own language? We shall be able to grasp such
a theory if we can grasp an interpretation of the language in which
it is formulated (i.e. our language!). Well, as noted, we can grasp an
interpretation—i.e. function linking words to parts of —only
if we can identify and describe that function. But we cannot do that unless
we can independently describe . So Putnam’s model theoretic
argument, if applied to our own language, meets up with a dilemma:
(A) if we cannot describe the relevant elements of , neither
can we describe/define/identify any function that assigns extensions
to our predicates in ;
(B) if we can describe those elements of then we can also
distinguish between right and wrong assignments of extensions to
our predicates in .
For example, on alternative (B), we would insist that a right assignment of
an extension to ‘‘water’’ is water, and that all other assignments are wrong.
That the theory is true on some other such assignment is simply irrelevant
to whether it is true.
I take it that this is also how Putnam meant the apparent paradox to
be resolved: by noting the dilemma posed by (A) and (B). The response
is Wittgensteinian, in that it focuses on us, on our use of theories and
representations, and brings to light the impasses we reach when we abstract
obliviously from use to use-independent concepts. In terms of our prior
discussion, we must emphasize the crucial role of the indexical here. A
theory says nothing to us unless we can locate ourselves, in our own
language, with respect to its content.
This page intentionally left blank
11
An Empiricist Structuralism
if we are to feel its proper impact, but we can bring it out properly
in successive stages. To do so, I will begin by revisiting Reichenbach’s
problem of coordination in a more abstract form.
(*) the function relates two structured sets, call them S(A) and B:
S(A) = SA1, SA2. where:
SA1 = the set of parts of A,
SA2 = the family of sets which are extensions of relations
on these parts
B is a mathematical object, representable correspondingly in the same
general form Set, Relations.
And just what is this dividing up? It is nothing more nor less than the act
of representing A as having SA1 as its set of parts—i.e. of A as consisting of the
members of SA1.
Would all be well for the realist response (which thinks of the coordin-
ation as a simple two-place relation between the two items) if there
were only a single, unique way of representing A in this manner? The
question is moot. For, as a matter of fact, and as we have seen before,
as long as A can be represented as having a set of parts of the same
size as that of B, some such relationship as (*) above will certainly exist
between A and B! That is precisely the point that (by now, famously)
trivialized Russell’s structuralism and drives Putnam’s model theoretic
argument.
The realist has one final gambit, namely the one that we already saw
in David Lewis’s introduction of natural classes. S/he insists that there is
an essentially unique privileged way of representing: ‘‘carving nature at the
joints’’. There is an objective distinction ‘in nature’ between on the one
hand arbitrary or gerrymandered and on the other hand natural divisions of
A into a set of parts, and similarly between arbitrary and natural relations
between those parts. So A is already a highly structured entity, and the
abstract relational system we need is a precise copy thereof. We could now
ask about this ‘copy’ relation; but there seems no need to press that. For
what precisely is this gambit, this insistent assertion?
It is a postulate (if meaningful at all). What can it possibly mean? The
word ‘‘natural’’, its crucial term, derives its meaning solely from the role
this postulate plays in completing the realist response. Honi soit qui mal
y pense they’ll tell us.12 Unless we subscribe to this sort of metaphysics,
and something like a substantial correspondence theory of truth and
representation, the realist maintains, we cannot make sense of our own
practice.13 But their point will be moot, and the postulate superfluous, if
the problem dissolves upon scrutiny, as I shall now argue.
population density, and the second cannot. So for the first we face the
question of what the adequacy of a model—the conditions under which
it accurately represents its domain—amounts to in a concrete practical
setting. For the second scientist we have to understand how a phenomenon
somewhere and somewhen, which is not encountered in human experience
or targeted in actual measurements or observations, can be said to ‘‘fit’’ a
theoretical model. There are of course many examples of the latter sort,
some much more recondite or abstruse than dinosaurs—think for example
of extra-galactic phenomena, even ones that a theory classifies as beyond
our ‘event-horizon’ so that we are precluded from ever gathering even
very indirect evidence about them.
Metaphysical postulates of the sort we just discussed above, concerning
‘structure in nature’, seem to me to be specially inspired by the latter case,
of the phenomena that are not encountered in our practice. On the one
hand, theories have many models that are never actually constructed by
any one—in the sense that a mathematician can say that a certain equation
has many solutions that are never actually written down by anyone. On
the other hand, those phenomena are never actually represented, in any
concrete way by anyone. With human participation entirely foreign to the
context, what can constitute adequacy or truth except a direct theory-model
to nature relation? How can we assert that those unknown phenomena
fit models of our theories, except in a sense that implies the metaphysical
realist’s postulation of ‘structure in nature’, consisting of universals or the
like? That rhetorical question present the first of the main dangers for the
empiricist that I want to take up here.
The second danger to the empiricist, implicit in discussion of how theor-
ies or theoretical models can represent phenomena when those phenomena
have actually been subject to measurement, is subtler. There the pertinent
‘matching’ seems so obvious and clear: the theoretical model and the data
model are both abstract structures, and it is not difficult to understand how
the latter may be embeddable in the former. But a theoretician at a desk,
matching his theories’ equations to graphs or density functions or the like,
is not someone directly in touch with nature. So doesn’t a reflection that
focuses on the data models for assessing empirical adequacy, lose contact
with reality altogether?
247
model was adequate with respect to its purpose, for example, whether
the representation was accurate with respect to the rate of growth in
volume or in numbers, as measured by this sort of clock or that, . . . . By
hypothesis, no model for this particular phenomenon was ever offered, so
the point is moot. Yet we would like to say that if the equation does have
such a solution—equivalently, if the theory has such a model—then that
(equation, theory) does correctly represent that phenomenon.
It seems to me that the only points to be made in response to this are
verbal. If the theory is offered, that amounts to the offer of a range of
structures—the structures we call models of the theory—as candidates for
the representation of the phenomena in its domain. If this range contains a
candidate that would satisfy the structural constraint—if the phenomenon
is embeddable in it, understood in the innocuous sense explained by the
above examples—then the theory is empirically adequate (and indeed true,
if the domain contains only observable phenomena).
Perhaps it would have been better if the word ‘‘model’’ had not been
adopted by logicians to apply to structures never offered in practice. For
undoubtedly, in many contexts, something is called a model only if it
is a representation, and the sense in which any solution of an equation
is a model of the theory expressed by that equation certainly does not
have that meaning. But it is too late to regiment our language so as to
correct that, and we will just need to be sensitive to usage in different
contexts.
That is, the phenomenon, what it is like taken by itself, does not determine
which structures are data models for it—that depends on our selective
attention to the phenomenon, and our decisions in attending to certain
aspects, to represent them in certain ways and to a certain extent.20,21
Have we now lost or sidestepped Reichenbach’s problem, or have we
rather landed in it ourselves? Have we in effect succumbed to a post modern
il n’y a pas de ‘hors-texte’? No, but the way to see that we have not ‘lost the
world’ will explain why the sea-change I propose is of the sort that tends
to be described as a Wittgensteinian move.
Well, this is Princeton. Still, to begin at least I do not realize that the
questioner is a metaphysician. So to begin I take this as a legitimate
request for more information about how I arrived at the graph. I describe
the measurements that sampled values of various parameters over time,
how we had set up stations for deer observation, the procedures fol-
lowed and the precautions taken against sampling bias. The challenger
responds:
Yes, I understand, I can see that you carried out those procedures
diligently and responsibly, and that the outcomes are summarized
properly in your graph. But the question remains: theory T fits the
summarized outcomes of your measurement procedures, but does T
fit the actual deer population growth in Princeton?
Now I am beginning to see that the question relates to a rather more
foundational worry, and I think that perhaps I can guess what it is. So
I explain that I did take guidance from the theory, that the parameters
measured were precisely those that T treats as relevant to population growth
in this sort of case. I add that although the measurement procedures were
mostly standard in this scientific area of inquiry, it was also in light of
T that I took for granted in some cases that values obtained for some
parameters gave information about the values of certain parameters less
accessible to direct measurement. But I add that none of this biased the
inquiry so as to guarantee that the results would be in accordance with
T. The measurement outcomes could have been quite different, while the
actual outcomes were, as I just showed, in accordance with T.
To the challenger this signals a complete misunderstanding of his concern.
I took his worry to be about a problem in epistemology! But his interest
is in what is really the case, and how we can understand how things really
are, independently of our knowledge and of any procedures followed to
gain knowledge. So he tries to convey that to me:
I understand that in this case your claim to knowledge about the
deer population growth in Princeton is warranted. But there is still
the real deer population growth, which is something in the world,
distinct from anything in your graph, distinct from anything in the
content of your warranted knowledge claim, distinct from the object
of knowledge that you have constituted in your practice (put it how
256 :
use and locate ourselves in to represent the pertinent features of that region.
This point transposes to the use of any model, and pertains specifically to
the assessment of whether a theory succeeds in correctly representing what
it is meant to represent. Take any such assertion as
The exponential function represents in smoothed summary form the growth of
the investigated bacterial colony.
Despite appearances, that is an indexical statement: the phrase ‘‘the invest-
igated bacterial colony’’ receives its reference from the context in which
it is used. The assertion is not made true by anything that can be formally
described within semantics, understood as limited to word-thing relations,
ignoring the role of the user and context of use.23
Nor does the phenomenon, what it is like, taken by itself, determine
which structures are data models for it. That depends on our selective
attention to the phenomenon, and our decisions in attending to certain
aspects, to represent them in certain ways and to a certain extent.24 So once
again we arrive at the conclusion: there is nothing useful to be found in
2-place structure-phenomenon relations alone when we try to understand
representation. Anything we see by way of such relations is something
abstracted from the 3-place relation of use of something by someone to represent
something as thus or so.
The empiricist reply must be, in effect, the step that leaves the entire
game of metaphysics behind, and frees us forever from its illusionary charm
and glamour. But just because it is the step out of that so insidiously
enchanted forest into realistic common sense, it will have to be a very
simple one. On the one hand, we must immediately admit that:
the claim that the theory is adequate to the phenomena is not the same
as the claim that it is adequate to the phenomena as represented by
someone (nor as represented by everyone, or anyone).
After all the phenomena are actual objects, events, and processes, while
the representations that we or the scientific community construct of them
are the products of our independent intellectual activity. Yet on the other
hand it is clear that
if we try to check a claim of adequacy, then we will compare one
representation or description with another —namely, the theoretical model
and the data model.
What are we to make of these two points, taken together?
For us the claims
(A) that the theory is adequate to the phenomena and the claim
(B) that it is adequate to the phenomena as represented, i.e. as
represented by us,
are indeed the same!
That (A) and (B) are the same for us is a pragmatic tautology. That it holds
depends crucially on who the indexical word ‘‘us’’ functions to denote
in an assertion. Appreciating that the equivalence for us is a pragmatic
tautology removes the basis for the Loss of Reality objection.26
A pragmatic tautology is a statement which is logically contingent, but
undeniable nevertheless. Similarly, a pragmatic contradiction is a state-
ment that is logically contingent, but cannot be asserted. That is possible,
because the logical contingency pertains to its content, while deniability
or assertability is a concept pertaining to use. Assertion, denial, calling
into question, and the like are actions by a language user. The semant-
ic status (e.g. expresses something contingent) and the pragmatic status
260 :
(e.g. is deniable) diverge quite obviously when the statement has some
indexical or context-dependent element or feature.
The classic example is Moore’s Paradox, which I already cited above.
But in the cases we are concerned with, there is often no explicitly indexical
word or phrase to be seen. That does not settle the matter. Consider for
example ‘‘There are no statements’’. This is logically contingent since there
are statements in the actual world but there are also logically possible worlds
where there are none. Yet it is not assertable, since its assertion would
be a statement—someone asserting it would stand convicted of falsity by
that very act. In any context of use, the statement is false in the world of
which that context is a part. Yet the statement does not have in it such an
indexical as ‘‘I’’ which plays the crucial role in the usual examples, as in
Moore’s Paradox.27
Similar remarks apply to Tarski’s famous schema ‘‘ The sentence ‘—’
is true if and only if—’’. Suppose we fill it with ‘‘Grows is’’ or with
‘‘Grnunkj’’. Then we do not obtain a truth, in the first case because we
do not have ‘‘Grows is’’ as a sentence of our own language, and in the
second because we do not acknowledge the word ‘‘Grnunkj’’ as a word of
our own language at all. The distinction expressed with the indexical ‘‘our
own’’ is crucial, but is not shown by the words in the sentences under
discussion. We can make the distinction in a general way by asking about
sentences both whether they could be true and whether they can be asserted
by us. If the world had been such that our language had developed so that
the word ‘‘snow’’ had been a word for grass, then ‘‘Snow is white’’ would
not have been a true sentence, though snow would still have been white.
So the Tarskian equivalence is not a necessarily true statement, but its
contraries are not assertable by us for sentences in our own actual language.
How does this apply here? We can sum up the relevant point quite
simply: in a context in which a given model is someone’s representation of
a phenomenon, there is for that person no difference between the question
whether a theory fits that representation and the question whether that theory fits
the phenomenon.28
Think of what would happen to Moore’s Paradox if we treated pragmatic
inconsistencies like logical ones. We would appeal to the logical principle
that if (A & B) is inconsistent then A implies the denial of B and
conversely. So from the incoherence of ‘‘I believe that it is raining in
Peking, but it isn’t’’ we would infer ‘‘If I believe that it is raining in Peking
261
in modern times, still sings its siren song to today’s philosopher; I shall call
it the Appearance from Reality Criterion. But I shall argue that it must follow
the others into the dust, if we are to appreciate the new key in which
the sciences are now composed. Once rejected (as, I shall argue, it has
effectively been rejected already in scientific practice) we have the freedom
to follow the contemporary abstract structural forms now prevalent in the
advanced sciences without the unbearable constraint to satisfy a ‘‘realist’’
imagination.
This page intentionally left blank
12
Appearance vs. Reality
in the Sciences
Galileo’s Assayer
Galileo famously promised that the colors, smells, and sounds in the
experienced world would be fully explained by a physics among whose
272 :
As belonging to the class of things that are clearly apprehended, I recognize the
following, viz, magnitude or extension in length, breadth, and depth; figure, which
results from the termination of extension; situation, which bodies of diverse figures
preserve with reference to each other; and motion or the change of situation; to
which may be added substance, duration, and number. But with regard to light,
colors, sounds, odors, tastes, heat, cold, and the other tactile qualities, they are
thought with so much obscurity and confusion, that I cannot determine even
whether they are true or false; in other words, whether or not the ideas I have of
these qualities are in truth the ideas of real objects. For although I before remarked
that it is only in judgments that formal falsity, or falsity properly so called, can
. 275
be met with, there may nevertheless be found in ideas a certain material falsity,
which arises when they represent what is nothing as if it were something. Thus,
for example, the ideas I have of cold and heat are so far from being clear and
distinct, that I am unable from them to discover whether cold is only the privation
of heat, or heat the privation of cold; or whether they are or are not real qualities:
and since, ideas being as it were images there can be none that does not seem
to us to represent some object, the idea which represents cold as something real
and positive will not improperly be called false, if it be correct to say that cold is
nothing but a privation of heat; and so in other cases.7
choice and evaluation. Such criteria have often been explicitly formu-
lated. Often enough they were not just debated among philosophers, but
loomed centrally in famous episodes in the sciences themselves. Most
striking, though, is how time and again science has refused submission
to ideals imposed from outside, or even from its own past. Scientific
progress may and does involve rejection of previously proclaimed criter-
ia. We shall examine some here, related to necessity, determinism, and
causality.
saw it, the very coherence of experience requires that it takes a form of
experiencing ourselves as living in a spatio-temporally definite causal order.
The context in which physics was changing around 1900 included thus a
strong conviction, inherited from classical physics and modern philosophy,
that all phenomena in nature derive from an underlying deterministic phys-
ics. Determinism had become a criterion of completeness: any apparent
gap in determinism so far is filled by statistical laws, but the statistical
probabilities can only be a measure of ignorance. Poincaré begins one
chapter in his Science and Hypothesis of 1905 with ‘‘No doubt the reader
will be astonished to find reflections on the calculus of probabilities in such
a volume as this. What has that calculus to do with physical science?’’
But the question of the possibility of indeterminism was already salient
and in the air before it was related in any way to the quantum, due to
different views of statistical thermodynamics (cf. Stöltzner 1999).13 Max
Planck discusses that too in his famous 1908 Leiden lecture. There he
praised Boltzmann for ‘‘the emancipation of the concept of entropy from
the human art of experimentation’’ (p. 14), but notes that the price seemed
to be to relegate the second law of thermodynamics to a merely statistical
regularity that admits exceptions.
Boltzmann has drawn therefrom the conclusion that such strange events contra-
dicting the second law of thermodynamics could well occur in nature, and he
accordingly left some room for them in his physical world view. To my mind, this
is, however, a matter in which one does not have to comply with him. For, a nature
in which such events happen . . . would no longer be our nature. . . . (Ibid.: 15)
This is a strong profession of faith in determinism, but the mere fact that
it seemed appropriate to express this shows how the question could not
be ignored. Planck had an explicit antagonist in Franz Serafin Exner, who
defended an empiricism along Mach’s lines, rejecting as meaningless any
speculation as to whether there exist some unobservable deterministic laws
behind the phenomena (cf. Stöltzner 2002). So Exner emphasized that if
some domain appears to be subject to deterministic laws, these could be
the macroscopic limit of indeterministic basic laws governing nature at a
microscopic level. This theme was later taken up by Hans Reichenbach,
who coupled it with his view of how an indeterministic world could still
be ‘lawlike’ through his principle of the common cause.
280 :
The challenge to determinism was the first, foremost, and most vis-
ible philosophical confrontation to arrive for quantum mechanics. The
probabilistic resources of classical statistical mechanics had been newly adap-
ted in such a way that, as it seemed then, no grounding in an underlying
deterministic mechanics was possible. A vocal part of the physics community
was averse to seeking out the logical possibilities here—some rejected expli-
citly the task of finding or displaying such hidden mechanisms.14 Nature
is indeterministic, or at least it can be or may be—and if that is so,
determinism is a mistaken completeness criterion for theory.
Now Reichenbach, who did much to provide a rationale for this
rejection of determinism, introduced an apparently weaker but still sub-
stantive new completeness criterion: the Common Cause Principle.15 This
third principle is satisfied by the causal models of general use in the social
sciences, and for many purposes in the natural sciences as well. They are
models in which all pervasive correlations derive from common causes
(in a technical, probabilistically definable sense). But the demonstration
in the 1960s and later that quantum mechanics violates Bell’s inequalities
shows that the new physics was riding rough-shod even over this third
criterion.16,17
However that may be, I’ll now turn to a fourth completeness criterion
that seems compatible with these rejections, and appears to be quite gen-
erally accepted at least among philosophers and by the general public.
But it too was one clearly, emphatically, and explicitly rejected by the
Copenhagen physicists.18
to many of his contemporaries; they certainly did not return closer to the
familiar world of experientia.
But at the same time, as we saw illustrated above, Galileo accepted a
commensurate criterion of completeness, that physics must explain how those
appearances are produced in reality. This amounts to a stringent demand: that
this noisy, colorful, smelly but tasty world of appearance be fully explained
in terms of the attributes that science explicitly counts among its significant
parameters.
Science is understood to be incomplete until and unless it meets that
demand: I call this the Appearance from Reality Criterion.
Does this Criterion govern and guide the scientific enterprise as a
whole? Examples abound to make us immediately sympathetic to that
idea. We credit science with adequate and satisfactory explanations of how
many familiar phenomena are produced: how ash is produced when we
burn a cigarette or some logs, how methane is naturally produced in a
swamp. In the example of how a flame is turned yellow when a sodi-
um sample is inserted, an aspect of visual appearance is explained. But
we must look carefully into just what is demanded, how far it can be
pushed, and what resistances it may have encountered since those heady
early days.
A theory is not simply an empirical law or generalization to the effect that certain
observable phenomena occur, but an explanation of their occurrence that provides
some mechanism to produce them, or some deeper principles to which their
production is reducible. (Leplin 1997: 15)19
282 :
to Aristotle science must show why the phenomena and their regularities
must be the way they are, deriving from principles that are universal and
necessary. The seventeenth-century conception of laws of nature, only just
emerging from their theological gestation, delineate what is necessity in
nature. The Appearance from Reality Criterion is similar in this respect:
it too is a demand for explanation which is satisfiable only by connections
deeper than brute or factual regularity.
Therefore, when it is claimed that to be complete physical science needs
to derive the appearances from that reality, the term ‘‘derive’’ cannot here
just mean ‘‘deduce’’ or ‘‘predict’’. Required is a connection of the order of
explanation through necessity and/or causal mechanisms to be displayed,
which produce the appearances. When is this demand not met? It is not
met if science should simply issue successful predictions of measurement
outcomes. Prediction does not suffice by itself even if the prediction is by
means of systematic rules of calculation, from the state of nature theoret-
ically described. For calculational and predictive success does not ipso facto
imply an explanation of why and how the appearances are produced.
Both the way in which each of these notions is understood here, and the
way the distinction is marked, have to do with perception and perceptual
illusions or mere impressions. Nothing in my usage of these two terms
refers to these factors, unless very indirectly. How an observable object
or process (phenomenon) appears in the outcomes of the measurements is
itself an objective fact, a public, intersubjectively accessible fact. But there
is a similarity nevertheless to Kant’s insistence on—to coin a phrase—the
greater objectivity of the phenomenon. That is, we do have to insist on
the distinction: the appearance is determined jointly by the measurement
set-up (involving both apparatus and the system to which it is applied), the
experimental practice, and the theoretical conceptual framework in which
the target and measurement procedure are classified, characterized, and
understood.
While usage is diverse, I do not think that my usage is egregious
with respect to the historical scientific literature. For example Hertz,
commenting on his research on the effect of ultra-violet light, says in the
Introduction to his Electrical Waves ‘‘Inasmuch as a certain acquaintance
with the phenomenon is required in investigating the oscillations, I have
reprinted the communication relating to it [here]’’. The reprinted paper
to which he refers begins as follows (and there we can see what the
phenomenon in question is):
space and its consonant kinematics. This technique was mastered in practice
by the time of Copernicus, though formalized only by the end of the
seventeenth century. In contrast, the study and perfection of perspectival
drawing gave rise to the very different subject of projective geometry. That
too saw its first rigorous development in the seventeenth century, but was
then neglected, until coming into its own (with a unified treatment of
Euclidean and non-Euclidean geometries) in the nineteenth century.
This is not an incidental historical point. Both forms of representation
tend to be called ‘‘perspectival’’ and both tend to be thought of as
depicting the appearances. But they accomplish very different tasks. I
honor the difference between them with distinct terminologies. The
geometric representation of e.g. planets and planetary motion depicts the
phenomena. The content of a perspectival drawing of the same events is their
appearance, equivalently, the drawing depicts how they appear from certain
vantage point.29
At the risk of annoying by repetition: the phenomena, in the sense of the
observable parts of the world, whether objects, events, or processes, the
sciences must save (in the ancient phrase). These admit also of objective
and indeed purely theoretical description, which does not link their reality
to contexts of observation or to acts of measurement. What the latter, the
measurements, provide are how those observable parts of the world appear
to us in the corresponding measurement set-ups. That, the content of a
measurement outcome, is an appearance, and the sciences are accountable
to that as well.
In the first book I set forth the entire distribution of the spheres together with
the motions which I attribute to the earth, so that this book contains, as it were,
the general structure of the universe. Then in the remaining books I correlate the
motions of the other planets and of all the spheres with the movement of the earth
so that I may thereby determine to what extent the motions and appearances of
the other planets and spheres can be saved if they are correlated with the earth’s
motions. (Copernicus, De Revolutionibus, Preface.)
The above diagram (from Cohen 1960: 39, figure 10) graphically presents
the Copernican explanation of the apparent retrograde motion of an inferior
planet such as Mercury or Venus, by depicting how its motion would look
from a slower moving Earth also orbiting the sun. Copernicus’s model
represents the observable phenomena, that is, certain processes in space and
time. What the Copernican does in order to credential his representation is,
in effect, to explain by means of geometric optics and projective geometry
how the visual appearances (content of outcomes of measurements made by
astronomers) are produced from reality. Copernicus can demonstrate that
his theory ‘saves’ certain phenomena by showing how the visual appearances
derive (via kinematics and optics) from what he postulates concerning them.
Three-faceted representation
There was another discipline in Copernicus’s time, located between astro-
nomy and physics. It carried the melodious name of Theorica.31 The aim of
. 289
What should be our take on the Appearance from Reality Criterion today?
We need first of all to explore the conceptual possibilities, the ways in
which a science could feasibly come to violate the Criterion, and then ask
how these relate to practice in recent science.
Although my main concern is with the natural sciences, it is instructive
to see how the issue is broached in analytic philosophy of mind and
its take on cognitive science. The question of how the psychological
phenomena relate to the physical processes in brain, body, or body plus
environment—our new version of the mind–body problem—has seen
a series of answers more or less culminating in the supervenience thesis.
This move points us readily to logical and philosophical moves available
when outright reduction seems out of reach. In fact, that thesis echoes a
move familiar from another historical episode: Leibniz’s reconciliation of
contingency with the principle of sufficient reason, and both episodes can
serve us as guide to other such perplexities.
For recent scientific practice I shall argue that the Copenhagen develop-
ment of quantum theory exemplifies a clear rejection of the Criterion. The
famous Measurement Problem in the philosophy of quantum mechanics is
not a problem from an empiricist point of view.1 What it marks, I shall
argue, is the methodological rejection of the Appearance from Reality
Criterion in this new science.2 The rejection may not be unique in the
history of science, but is brought home to us inescapably by the advent of
the new quantum theory.
Even if that theory is superseded (or if fundamental physics develops
in a accordance with a new interpretation under which the Criterion can
be satisfied) our view of science must be forever modified in the light
of this historical episode. The Appearance from Reality Criterion was
292 :
(of the organism, or possibly the organism plus its natural environment, or
possibly of the entire universe).
Leaving aside the more metaphysical notions that are often involved in
the idea of supervenience, we can draw one clear consequence from it
that will reinforce this reading (though in slightly more technical terms).
Suppose that P and M are two languages, one truly physicalist and the
other mentalist—whatever that means, whatever might suffice for the
one to describe everything in terms aptly used in physics and for the other
to describe psychological phenomena (as well) in terms of psychology or
cognitive science. What is implied here by the thesis that the psychological
supervenes on the physical?
(a) To say that we have no possible reduction but do have super-
venience implies that M cannot be translated into P sentence by
sentence, nor paragraph by paragraph, nor even that the definable sets
of sentences of M are translatable into definable sets of sentences of P.
That is the ‘‘no reduction’’ claim.
I will explain the restriction to what is definable below. Taking that
for granted for now, what does the claim to supervenience entail? To
understand that, let’s think of all the possible situations that can be described
(however partially) in both languages. We can depict them this way if we
assume no special relation between the two languages:
Here the vertical bars depict the situations that satisfy a given maximally
complete description P(1), P(2), . . . in the physicalist language P. The
slanted bars depict those that satisfy given maximally complete descriptions
M(1), M(2), . . . available in the language M.
The letters ‘‘X’’ and ‘‘Y’’ name two specific situations: Both X and Y
satisfy P(1), while X satisfies M(1) but Y satisfies M(2). Now you can see
immediately that if there are such situations, then M does not supervene
on P! For suppose we are in situation X. Then if we would say ‘‘if the
mental phenomena were different then something physical would also be
different’’, the possibility of situation Y would contradict our claim.
So if we want to redraw the picture, so as to present the case in which
the mental is supervenient on the physical, then the lines that divide
complete descriptions in M would have to coincide with certain lines that
divide complete descriptions in P. In other words, roughly put: a complete
description in M has to be in effect a disjunction of complete descriptions
in P.
Note well: if we rule out reducibility, then those disjunctions cannot
be finite or recursively specifiable etc. That is why I wrote ‘‘roughly
put’’—I’ve stretched the word ‘‘disjunction’’ beyond its normal use. The
point is clear though: at the level of indefinable sets of possibilities, a
possibility in principle describable in M can be equivalent to a family of
possibilities in principle describable in P. But in principle describable does not, in
that case, refer to what is humanly or mechanically definable or computable
or recursively specifiable, and so does not imply reducibility. We can sum
up this conclusion, taking a little liberty with the notion of disjunction, in
this way:
(b) The syntactically complete descriptions of everything (to the extent
possible in M) correspond disjunctively to sets of sentences in P, as far
as truth conditions are concerned.
Putting the first and second point together shows perspicuously that the
correspondence just claimed must be between sets of sentences that are not
definable.6 Obviously no deduction could mirror this correspondence.
It would be beside the point to stop here to inquire into the plausibility
of this claim. What is important for us, once again, is a meta-issue. On this
view, that the mental or psychological only supervenes on the physical, what
becomes of the science of cognitive psychology? By classifying psychological
295
We are almost home; there is a relevant theorem that derives states for parts
from states of the whole. At this point one can deduce the character of the
Apparatus’s final state fin:
fin is a mixture of {|B, r(l), . . . , |B, r(N)}
This may look very nice! Couldn’t we say that this means that fin really
is one of the pure states {|B, r(1), . . ., |B, r(N)}? Couldn’t we add that
the measurement outcome is that A was measured to have value r(J) just
precisely if fin was |B, r(J)?
The adder in the grass appears precisely here. The answer is NO, we
cannot say that. The mixed state in which the Apparatus ends up is not
identical with any pure state.18 If we knew that an object was really in
some pure state, but did not know which, then it would be fine to attribute
a mixed state to that object, and use that as a basis for prediction. But the
converse is not the case; that a system ends up in a mixed state cannot be
equated in general with its really ending up in some pure state. Mixed
states are sui generis. And what is more, it is precisely states produced by
interactions such as happen in measurement that are demonstrably not
‘‘ignorance cases’’. This point can be richly illustrated with recombination
experiments.19 The idea for such experiments was originally suggested by
Eugene Wigner.20 In such an experiment, the reconstituted beam could be
once more in a coherent state. So a physical measurement interaction need
not, in itself, be such that the coherence is destroyed by the separation
and recombination. Therefore no information of entanglement was lost, as
would happen in a collapse. So now the perplexity can be stated as follows:
Supervenience?
Could the distinction between supervenience and reduction introduce
some leeway here? Suppose that the quantum mechanical representation
of nature, including dynamics, is complete, and that our statement ‘‘A is
measured on X during interval ’’ is true if and only if the pertinent
quantum states and their evolution during belong to some set depending
on A, X and ; call it S(A, X, ). So far that supposition is just a
stipulation, harmless under this supposition. But one could add: we humans
are quite good at recognizing whether or not the real situation is thus, but
S(A, X, ) is not definable within the language of quantum mechanics.
There are in general many such indescribable, undefinable sets for any
theoretical language no matter how rich we make it. Can we tenably add
that at the same time the relevant sets are describable perfectly well in the
ordinary language used also by laboratory assistants fairly ignorant of the
theory? Yes, we can, at least logically speaking, for there is the option that
the discourse to which ‘‘A is measured on X during interval ’’ belongs
may be precisely supervenient on, but irreducible to, that theoretical language.
But in fact, this simple move does not by itself bring us out of the
woods. While the logician may explore this loophole in abstract terms,
the physicist has a serious obstacle in store. For it certainly seems, both in
practice and in theory, that the physical conditions of measurement and
the physical correlate of the measurement outcome (the final state of the
apparatus) are perfectly well formulable in quantum mechanical terms. So
the obvious question is whether that physical correlate could be the same
even if the measurement outcome contents (the appearance) were different.
And that, unfortunately, seems to be obviously so: all the cases in which a
measurement of A has outcome value j, for j = 1, 2, 3, . . ., on a particular
occasion when none of them were certain, are cases in which the final state
of the apparatus is the same mixture of its possible pointer states. So the
appearances do not supervene on the physical state.21
An empiricist view
The view that I will now advocate will not be surprising given the way
I stated the perplexity in the preceding paragraph. But it does require
305
appreciating an attitude toward theory that has been mainly absent from
foundational studies of quantum mechanics.
For unqualified adequacy of the theory, what is required is that the
surface models of phenomena fit properly with or into the theoretical
models. The surface models will provide probability functions for events
that are classified as outcomes in situations classified as measurements
of given observables. Those probability functions need to be parts of
the theoretically specified Born probabilities for the same situation as
theoretically represented in terms of possible states and evolutions.22 The
matching required is between two families of probability functions, not
between the individual events summarized in the surface model and the
states represented in the theoretical model.
But is the identification of actual events in measurement situations—so
classified in practice—and states in the theoretical models for those situ-
ations not presupposed by the question of whether the surface models fit
with the theoretical models?
There is a good question hidden in this challenge, but also a presup-
position of its own. We can view the situation ‘‘from within’’ and point
out that in the laboratory practice, as it has evolved in fact, a given process
is classified as a measurement of a certain sort. Then, viewing the matter
‘‘from above’’, the theory must be able to classify it as a physical correlate of
that sort of measurement. That will restrict the class of pertinent theoretical
models. The observable said to be measured is to be represented by an
operator of a particular kind—Hermitean, or at least bounded, etc.—on a
given state space, and the apparatus as capable of a certain range of states
represented by elements of such a space, and so forth. But none of this entails
that what happens in the actual situation must be displayed as entirely identifiable
in the theoretical model.23 The most stringent demand that can be made here
is that the relative frequencies of certain events in this sort of situation
must have a good fit to probability functions, extrapolated from them in
surface models, which are identifiable as parts of corresponding probability
functions in the theoretical models.
When this demand is met—whether strictly or to some approxima-
tion—the theory is borne out by the experimental results, and can be
used to make predictions. Take for example the Stern–Gerlach experiment
that we briefly looked at above. Suppose that with a given source, the
frequency counts in the upper and lower channel show a proportion 2:1.
306 :
Starting with the assumption that the usual sort of theoretical model fits this
situation, the range of states possibly prepared by the sources is restricted
by the condition
PAs ({‘up’}) : PAs ({‘down’}) = 2:1
Anti-empiricist confusion
There is a charge sometimes made against quantum theory itself, as typically
presented in physics textbooks, but also against any empiricist view of
physics. That is that the theory is made out to be about measurements
rather than about nature, about what people [will] actually find rather than
about what happens. But that was not what physics was meant to be! After
all, it is said, there are events in the stratosphere, where no measuring
instruments are present, and physics applies to those events as well.
Certainly it does! But the objection trades on a stunted impression of
what is meant, either by such a textbook presentation of the theory, or by
an empiricist reading of it. First of all, the observable phenomena do not
include just actual measurements and their outcomes. They include all the
observable entities—objects, events, processes—that there are, have been, or
will be, whether observed or measured or not. Secondly, the theory provides a
representation of the whole of nature, which includes both observable and
unobservable parts. An empiricist will emphasize the term ‘‘representation’’
here, and recall all the ways in which a representation may only partially,
and perhaps distortedly, mirror anything real.
So the empiricist construal does not contradict the assertion that the
theory is about all that can happen, and not just about measurements.
Thirdly, the predictions to whose accuracy the theory is directly accountable
are predictions of what the appearances will be when something appears, i.e.
is measured. But it would be an oxymoron (or anthropocentric metaphor)
to speak of accountability to what cannot be checked in practice, to what
cannot be accounted.
In the case of quantum theory it is sometimes objected that we can check,
by actual measurement as well as with our own eyes, that the pointer on
a given dial coincides with a particular number, whereas any state we can
ascribe to that object will not be localized in a finite region of space for more
than an instant. The conclusion drawn is that observable things are after all
not accommodated in the theory: the correct representation of the pointer
is as an object wholly in a small spatial region for an extended period of
time and this we find nowhere in the quantum theoretical representation.
But this objection confuses the three levels: Theorica, phenomena, and
appearances. What is subjected to a position measurement, may well be
an observable object, yes. But the way it appears when it is photographed
308 :
it must be the sort of thing that can be believed, disbelieved, doubted, and so forth.
Thompson 2007 is the latest comprehensive account of the development of the
semantic approach, with special emphasis on its uses (in Thompson 1989 and Lloyd
1984, 1986, 1987, 1994) to theories in biology, and contrasts various conceptions of
theories and models. Lately Margaret Morrison (2007) has emphasized that despite
the centrality of models for the understanding of science, important roles remain
for theories, though also without offering an ‘‘official’’ codification of just what a
theory is.
Nancy Cartwright, Towfic Shomar, and Mauricio Suarez (1995) developed a
more instrumental view of theories, as providing tools for the construction of
models. At first blush it seems that our conceptions of theory and model must be
quite different, if on their view a model is something constructed and a theory a
tool. But in fact I think that a closer look reveals that our conceptions are entirely
compatible; what we concentrate on signals differences in interest and focus.
The sense in which a theory offers or presents us with a family of models—the
theoretical models—is just the sense in which a set of equations presents us with
the set of its own solutions. In many cases, no solutions to a given equation are
historically found or constructed for a very long time . . . though mathematically
speaking, they exist all along. When the equations formulate a scientific theory,
their solutions are the models of that theory. In this sense, Newtonian mechanics
had in its range of models already solutions to the three-body problem, though the
scientists following Newton, not being logically omniscient, could not see what
it was.3
As Suarez and Cartwright (forthcoming) discuss, the process of actually and
historically arriving at a working model can take several forms. Only rarely is
it a matter of simply instantiating the theory to some specific values of some
parameters. In some cases, emphasized especially in Ronald Giere’s elaboration of
the semantic view, the process is one of ‘‘de-idealization’’. That is, the scientist
wishing to construct a model of an actual situation or phenomenon begins with a
very idealized model, and then adds more factors characterizing the real situation
until a good enough representation of the phenomena is achieved. An example
would be to model a real pendulum starting with a model of one in which friction
is ignored, thus an ideal case, and then add force terms that correspond to the
friction in the actual situation.
But there are also processes of model construction in which we see imaginative
leaps that are not like that—de-idealization is not at all the only tactic to be found
in model construction. For example, the construction of a model of superconduct-
ivity by the London brothers—a much discussed example, and central to those
critical discussions of the semantic view—involved a change from an analogy with
ferromagnets to an analogy with diamagnets.
Nevertheless, what is arrived at is a model of the background theory. If the ideal
pendulum is a Newtonian model, and it is de-idealized, then the result is still a
Newtonian model. For if it is not, then it is a counter-example to Newton’s theory,
311
and hence not a de-idealization. Historically this could happen as well—with the
resulting model not being a de-idealization and not a solution to the problem
as originally defined—and sometimes famously does. For example, there seemed
after all to be no way to arrive at such a de-idealization that matched the actual
advance in the perihelion of Mercury. But the arrival of such an anomaly signals
revolutionary change.
On the other hand, if the imaginative leap succeeds in solving the originally posed
problem, where de-idealization guided by an earlier analogy did not work, then
again the success consists in arriving at a model of the situation that is not a counter-
example to the background theory. Thus, as Suarez and Cartwright also emphasize,
the London brothers’ model of superconductivity is still a model of Maxwell’s
electrodynamics, the background theory within which they were working.
So what accounts for the apparent tension in our two very different discussions
of modeling and representing? We find the answer in the precise complaint that
Suarez and Cartwright voice. To display the differences, they give their own
‘‘structural account’’ of the superconductivity story, which is quite like that in
various versions of the semantic approach and then write:
this framework can account for the piecemeal borrowing characteristic of the practice of
modeling precisely because it leaves out of the description everything that is of interest for
our thesis in ‘‘The Toolbox’’, namely the actual reasons scientists advance for building a
new model. The framework leaves out precisely the Londons’ reasoning in deriving the
London model out of the acceleration equation theory—which is what we have been
claiming is crucial to assess the theory-driven view. Hence we see that an appropriate
structural characterisation of this practice can at best describe correctly the products of the
modeling practice (i.e. the models themselves); while necessarily leaving out the intellectual
processes that lead to those models. (loc. cit.)
Quite right! Structural relations among models form a subject far removed from
the intellectual processes that lead to those models in the actual course of scientific
practice. So what is important depends on what is of one’s interest. Accordingly,
if one’s interest is not in those structural relations but in the intellectual processes
that lead to those models, then the semantic view of theories is nowhere near
enough to pursue one’s interests.
Both interests are important, it seems to me; leaving out either one we would
leave much of science un-understood. But the basic concepts of theory and
model—as opposed to the historical and intellectual dealing with formulations of
theories, or on the contrary, construction of models—do not seem to me very
different in the two approaches to understanding science.
In Part II I take up this divergence in interests with respect to measurement,
which is also studied in two very different—not incompatible but complement-
ary—ways: ‘from above’ and ‘from within’. That is a specific, and in some ways
simpler, instance of the same issue, since measurement too is representation.
APPENDIX TO CHAPTER 6
discrete but continuous, they have no eigenstates. Still the somewhat weaker
requirement:
If E is an interval of possible values of sharp observable A then there is for
each positive number δ < 1 a state s such that the conditional probability
PAs (E) > δ
holds for them as well. Their incompatibility means now of course that this will
not be the case for both in the same states for the same probabilities—intuitively,
δ will have to decrease for one if it increases for the other.
Here we recall the notions of occlusion and ‘‘explicitly non-committal’’ pic-
turing. The extent to which measurement of one observable can be revealing,
however indirectly, will limit the extent to which it can be revealing about certain
other observables. (The analogy is fragile, however: if the unmeasured observable
has no value, then its value is not hidden from sight!6 ) That applies to both discrete
and continuous sharp observables.
But what is then actually, really measured in a measurement of position or
momentum? It is entirely unrealistic to think that the outcome could be a real
number clearly distinct from any nearby numbers, or that in a series of measure-
ments there could be a clear distinction between a frequency of 1 and of δ for any
and all δ < 1. So either the outcomes of real position or momentum measurements
should be reconceived, or we should reconceive just what observables are being
measured. It is the latter option that arrives with the generalized observables
allowing for ‘unsharpness’.
I will explain the basic idea here, but for the simple case of an observ-
able whose possible values form a finite set RA = {v1 , . . . , vN }. The func-
tion PAs assigns a probability to each of these values (the probability that it
would be found if A were measured on an object in state s). But now sup-
pose that these values as not being sharply distinguishable in measurement, so
that if a measurement outcome indicates value v3 , for example, we should
conclude that we have ‘really’ found values v2 , v3 , or v4 , with probabilit-
ies p2 , p3 , or p4 . Or, to be more operational in the gloss we give it: if
the outcome indicates value v3 , for example, we expect that immediately
repeated measurements would find v2 , v3 , or v4 , with probabilities p2 , p3 , or
p4 .
Let me put this a bit more generally. The function p, which is thus associated
with value v3 , has brothers, so to speak, similarly associated with each of the other
values in RA. Let’s therefore write, not simply p, but p[v3 ]. This association is a
confidence mapping. Then we can think of identifying another observable B such
that PBs assigns to v3 the probability PAs (v2 ) + PAs (v3 ) + PAs (v4 ), and similarly for the
other values and associated confidence mappings. Then B is called a fuzzy version
of A.7
For B to be a fuzzy version of A, the indicated relationship has to hold of
course for each of the values in RA and their associated smearings out by a particular
314
confidence mapping; and this has to hold for all states s. This may not make sense,
intuitively. One way to think of it is to regard B as having values that are not
numbers but probability distributions over numbers (possible values of A)—such
values are sometimes called ‘‘fuzzy sets’’.8 The fuzzy versions of sharp observables
are in general not sharp, and in quantum theory are represented by a different sort
of operator.
A theorem on this subject establishes that in any standard measurement of an
observable, in the way that was described above, with the Criterion for a Physical
Correlate of Measurement being satisfied, what is really measured is always a fuzzy
version of an observable.9 To see this, the Criterion for the Physical Correlate
of Measurement is applied in reverse, as it were. Imagine that the specifications
of a measurement set-up, including the pointer observable B and the unitary
operator that governs the measurement interaction, are given. Now ask: what is
the observable A such that the equation
B
Pfin (E) = Pinit
A
(E)
is satisfied, for all initial states init? The found answer is that observable A,
so identified, is then a fuzzy version of some sharp observable, and the two
are the same only if A is discrete. In the case of continuous observables
such as position and momentum, that fuzzy version can not be the same as
the ‘original’, that is, as the observable that is putatively the target of the
measurement.10
APPENDIX TO CHAPTER 7
One way to report the findings in the experiments above is to note that in
no surface model so obtained do Lxa and Rxa both receive the score T when
they can receive an informative score at all (i.e., when the preconditions Lx
and Rx obtain). We then call Lxa and Rxa orthogonal.
If we keep fixed this generalization about all obtainable surface models, then
Rxl must receive score T when Lx0 does (modulo probability zero), again
when the informative scoring conditions obtain; and we call that implication.
The latter is a partial ordering, naturally symbolized as ‘‘≤’’. So what we
found above, the family of propositions that register the possible experimental
outcomes, is a partially ordered set with an orthogonality relation.
Reflection on this form of representation leads to assertions of the form: ‘‘all
data models can take the form . . . ’’ and there are various proposals for what
this mandatory general form must be. It is important to note that structures
taking this form may not in general be ‘classical’. How can we view the measuring
procedure, thus conceived, as locating the system investigated in a logical space?
.
A. R. Marlow (1978, 1980) used the concept of dual poset: a partially ordered
set with zero element and a relation of orthogonality, plus a single operation,
duality, symbolized as *. This operation has the properties that x** = x*,
316
Science aims to give us theories which are empirically adequate; and acceptance of a
theory involves as belief only that it is empirically adequate. This is the statement of
the anti-realist position I advocate; I shall call it constructive empiricism. (1980: 10)
What was meant by ‘‘empirically adequate’’ was explained first of all for what
had been the mainstay of modern physics until, arguably, the twentieth century:
theories in which no probabilities occur. As main example I took Newton’s
mechanics and theory of gravity, as depicted in the semantic approach. I will here
quote the representative passage, but let’s remark at once that I did not at the time
distinguish clearly the observable phenomena from the appearances:
The ‘apparent motions’ form relational structures defined by measuring relative distances,
time intervals, and angles of separation. For brevity, let us call these relational structures
appearances. In the mathematical model provided by Newton’s theory, bodies are located in
Absolute Space, in which they have real or absolute motions. But within these models we
can define structures that are meant to be exact reflections of those appearances, and are, as
Newton says, identifiable as differences between true motions. These structures, defined in
terms of the relevant relations between absolute locations and absolute times, which are the
appropriate parts of Newton’s models, I shall call motions, borrowing Simon’s term. (Later I
shall use the more general term empirical substructures.)
When Newton claims empirical adequacy for his theory, he is claiming that his theory
has some model such that all actual appearances are identifiable with (isomorphic to) motions in
that model. (This refers of course to all actual appearances throughout the history of the
universe, and whether in fact observed or not.) (Ibid.: 45)
318
Indeed, Newton’s ‘apparent motions’ are relative motions, and they are precisely
as they can be determined to be by measuring relative distances, time intervals,
and angles of separation. So although we can draw the distinction between
phenomenon and appearance at the conceptual level, that distinction does not in
this case make a difference in practice.
What then of the case where the theory involves probabilities? Thinking of the
chi-squared tables and the like whereby statisticians assess the ‘fit’ of probabilistic
hypotheses to actual frequencies, I thought that we had to deal there with only
a small and manageable extension of the same relationship. This ‘fit’ extended
‘embed’, I thought. I proceeded to outline a ‘modal frequency interpretation’
of probability which related theoretical models to families of frequencies, so as
to show how and where the ‘fitting’ might be located as a relation between
phenomena and theoretical model. The failure of this approach was brought
home to me by the writings of Joseph Hanna (1983, 1984) and correspondence
with Zeno Swijtink. During the next decade in Princeton I came to abandon
any objective notion of probability, and to move toward something like Richard
Jeffrey’s radical probabilism in epistemology. ‘‘Belief ’’ was no longer the operative
term, but ‘‘opinion’’, the latter conceived of largely (though not entirely) in terms
of subjective probability. But now, how to understand probability in physics?
My understanding of the formal character of physical theory did not change:
a physical theory may have in it irreducible probability, which is truly the
new modality of science. What had to change rather was now to conceive of
the epistemic attitude of acceptance of a theory as empirically adequate. The new
understanding I proposed of how we relate epistemically to the probabilities
offered to us by a physical theory appeared then in Laws and Symmetry.
Belief must be replaced by the nuances of gradated opinion, modeled as personal probability.
The question becomes then: if we accept a theory, how do the probabilities it offers
guide our personal expectation? The answer I shall now begin to elaborate is: in the form of
Miller’s Principle. That is what constitutes acceptance. For someone who totally believes the
theory, that guidance will involve all theoretical probabilities. For the scientist qua scientist
(as described by empiricism) only the theory’s probabilities for observable phenomena will
play this guiding role. ‘Objective chance’ is in either case the honorary epithet we give to
the probabilities in theories we accept. (1989: 194)
To formalize this, I took my cue from Chaim Gaifman’s introduction of the idea
of expertise as guiding clue:
If P is my personal probability function, then q is an expert function for me concerning family F
of propositions exactly if P(A | q(A) = x) = x for all propositions A in family F. (Ibid: 198)
(This relationship is generalized in various ways for less simple cases.) The point is
that an expert is someone or something that guides and constrains one’s opinion
in a certain range. Thus, for example, to say that one accepts contemporary physics
without qualification, can now mean that one takes this physics as expert on
319
which correspond to projection valued measures, while the latter are represented
by a larger class of operators corresponding to a normalized positive operator
measure. For details see e.g. Busch and Lahti 1996; Bush, Lahti, and Mittelstaedt
1966; and more recently Dalla Chiara, Giuntini, and Greechie 2004.
5 We must be careful in how we understand this. Neither operational incom-
insightful [1984], which develops a related way to think about the quantum theory.
Bibliography
Addison, J. W., L. Henkin, and A. Tarski (eds.) [1965] The Theory of Models.
Amsterdam: North-Holland.
Aerts, D. (ed.) [1999] Einstein Meets Magritte: The White Book—An Interdisciplinary
Reflection. Dordrecht: Kluwer Academic Publishers.
Alberti, Leon Battista [1991] On Painting. Tr. C. Grayson, with an Introduction
and Notes by Martin Kemp. London: Penguin Books.
Alpers, Svetlana [1983] The Art of Describing: Dutch Art in the Seventeenth Century.
Chicago: University of Chicago Press.
Anderson, Ronald [1993] ‘‘The Referees’ Assessment of Faraday’s Electromagnetic
Induction Paper of 1831’’. Notes and Records of the Royal Society of London 47:
243–56.
Armstrong, David M. [1993] A Materialist Theory of Mind. London: Routledge &
Kegan Paul, 1968; revised edition with new preface, New York: Routledge.
and Norman Malcolm [1984] Consciousness and Causality. Oxford: Blackwell.
Asquith, Peter D. and Philip Kitcher (eds.) [1984] PSA 1984., vol. i. East Lansing,
MI: Philosophy of Science Association.
Bacon, Francis [1994] Novum Organum, With Other Parts of the Great Instauration.
Tr. and ed. P. Urbach and J. Gibson. Chicago: Open Court, 1994.
Baigrie, B. [1996] Picturing Knowledge: Historical and Philosophical Problems Concerning
the Use of Art in Science. Toronto: University of Toronto Press.
Baird, Davis [2003] ‘‘Thing knowledge: Outline of a materialist theory of know-
ledge’’, pp. 39–67 in Radder 2003.
[2004] Thing Knowledge: A Philosophy of Scientific Instruments. Berkeley:
University of California Press.
Barbour, Julian B. [2001] The Discovery of Dynamics: A Study from a Machian Point of
View of the Discovery and the Structure of Dynamical Theories. New York: Oxford
University Press.
323
Dijksterhuis, Fokko Jan [2004] Lenses and Waves: Christiaan Huygens and the
Mathematical Science of Optics in the Seventeenth Century. Dordrecht: Springer.
Drake, Stillman [1957] Discoveries and Opinions of Galileo. New York: Doubleday
& Co.
Dretske, F. [1981] Knowledge and the Flow of Information. Cambridge, Mass.: MIT
Press.
[1991] Naturalizing the Mind . Cambridge, Mass.: MIT Press.
Ducheyne, Steffen [2005] ‘‘Lessons from Galileo: the Pragmatic Model of Shared
Characteristics of Scientific Representation’’, Philosophia Naturalis 42: 213–34.
Duhem, Pierre [1962] The Aim and Structure of Physical Theory. Tr. Philip P. Wiener
of La Théorie Physique, son Objet, sa Structure. Paris: Marcel Rivière & Cie, 1914.
New York: Atheneum.
Eco, Umberto et al. [1992] Interpretation and Overinterpretation. Cambridge: Cam-
bridge University Press.
Edgerton, Samuel Y. [1975] Renaissance Rediscovery of Linear Perspective. New York:
Basic Books.
[1991] The Heritage of Giotto’s Geometry—Art and Science on the Eve of the
Scientific Revolution. Ithaca: Cornell University Press.
Einstein, Albert [1905/2005] ‘‘On the electrodynamics of moving bodies’’, pp.
123–60 in J. Stachel 2005.
[1988] ‘‘Remarks to the Essays Appearing in this Collective Volume’’, pp.
663–88 in Schilpp 1988.
[2004a] ‘‘Geometry and Experience’’, pp. 12–48 in Einstein 2004.
[2004b] Sidelights on Relativity. Whitefish, Mont.: Kessinger Publishing.
and L. Infeld [1938] The Evolution of Physics. New York: Simon and Schuster.
Elgin, Catherine Z [1996] Considered Judgment. Princeton, N.J.: Princeton Univer-
sity Press.
[2006] ‘‘Exemplification, Idealization, and Understanding’’, ms.
Ellis, Brian [1966] Basic Concepts of Measurement. Cambridge: Cambridge University
Press.
Falmagne, J.-Cl. and L. Narens, [1983] ‘‘Scales and Meaningfulness of Quantitative
Laws’’, Synthese, 54: 287–325.
Feyerabend, Paul [1958] ‘‘Reichenbach’s Interpretation of Quantum Mechanics’’,
Philosophical Studies 9: 49–59; reprinted pp. 109-22 in Hooker 1975.
[2001] Conquest of Abundance: A Tale of Abstraction versus the Richness of Being.
Rep. edn. Chicago: University Of Chicago Press.
Files, Craig [1996] ‘‘Goodman’s rejection of resemblance’’, British Journal of
Aesthetics 36: 398–412.
328
Fine, Arthur [1986] The Shaky Game : Einstein, Realism and the Quantum Theory.
Chicago: University of Chicago Press.
[1998] ‘‘The Viewpoint of No-One in Particular’’, Proceedings of the American
Philosophical Association 72: 9-20.
Fourier, J. B. J. [1952] Analytical Theory of Heat, in Great Books of the Western
World., 45: 161–251, Chicago: Encyclopaedia Britannica.
Freeland, Cynthia [2007] ‘‘Portraits in painting and photography’’, Philosophical
Studies 135: 95–109.
Freeland, G. and A. Corones, (eds.) [1999] 1543 and All That: Image and Word,
Change and Continuity in the Proto-Scientific Revolution. Dordrecht: Springer.
Freistadt, H.[1957] ‘‘The Causal Formulation of Quantum Mechanics of Particles,’’
Nuovo Cimento, Suppl. to vol. 5: 1–70.
French, P. et al. (eds.) [1979] Midwest Studies in Philosophy, vol. iv, Metaphysics,
Minneapolis: University of Minnesota Press.
French, Steven [2003] ‘‘A Model-Theoretic Account of Representation (Or, I
Don’t Know Much About Art . . . But I Know It Involves Isomorphism)’’,
Philosophy of Science 70: 1472–83.
Friedman, Michael [1987] ‘‘Carnap’s Aufbau Reconsidered’’, Noûs 21: 521–45,
repr. as ch. 5 of his 1999.
[2000] ‘‘Transcendental philosophy and a priori knowledge: a neo-Kantian
perspective’’, pp. 367–83 in Boghossian and Peacocke 2000.
[2002] ‘‘Physics, Philosophy, and the Foundations of Geometry’’, Diálogos
79: 121–42.
Frigg, Roman [2006] ‘‘Scientific Representation and the Semantic View of
Theories’’, Theoria 21: 49–65.
Fuchs, Christopher A. [1997] ‘‘Information gain vs. state disturbance in quantum
theory’’, Fortschr. Phys, 46: 535–65. Repr. pp. 229–59 in Braunstein 1998.
[2002] ‘‘Quantum Mechanics as Quantum Information (and only a little
more)’’, arXiv:quant-ph/0205039 v1 8 May.
Galilei, Galileo [1957] The Assayer. Eng. tr. in Drake 1957.
[1964–1966] Le Opere. Firenze: G. Barberi.
[1974] Two New Sciences. Tr. Stillman Drake. Milwaukee: University of
Wisconsin Press.
[1994] Lettera a Fortunio Liceti, gennaio 1641, Opere, XVIII; Engl. tr. in
Crombie 1994, vol. i: 585.
[2005] Discourse on Bodies in Water. Tr. Thomas Salusbury; with an introduc-
tion by Stilman Drake. New York: Dover Publications.
Galison, P. and A. Assmus [1989] ‘‘Artificial clouds, real particles’’, pp. 224–74 in
Gooding, Pinch, and Schaffer 1989.
329
Georgalis, Nicholas [2005] The Primacy of the Subjective: Foundations for a Unified
Theory of Mind and Language. Cambridge, Mass.: MIT Press.
George, A. (ed.) [1953] Louis de Brolie physician et penseur. Paris: Albin Michel.
Ghins, Michel [1992] ‘‘Scientific realism and invariance’’, Philosophical Issues, 2:
249–62.
[1998] ‘‘Van Fraassen’s constructive empiricism, symmetry requirements and
scientific realism’’, Logique et analyse 164: 327–42.
Giere, R. [1985] ‘‘Constructive realism’’, pp. 75–98 in Churchland and Hooker
1985.
[1988] Explaining Science. Chicago: University of Chicago Press.
[1988] Explaining Science. Chicago: University of Chicago Press.
[2006] Scientific Perspectivism. Chicago: University of Chicago Press.
[1996] ‘‘Visual models and scientific judgement’’, pp. 269–302 in Baigrie
1996.
[1999] Science Without Laws. Chicago: University of Chicago Press.
Glymour, Clark [1999] ‘‘A mind is a terrible thing to waste’’, review of Kim 1998.
Philosophy of Science 66: 455–71.
Gombrich, Ernst [1960] Art and illusion; a study in the psychology of pictorial
representation. New York: Pantheon Books.
Good, Irving John, Alan James Mayne, and John Maynard Smith (eds.) [1961] The
Scientist Speculates. London: Heinemann.
Gooding, D., T. Pinch, and S. Schaffer [1989] The Uses of Experiment: Studies in
the Natural Sciences. Cambridge: Cambridge University Press.
Goodman, Nelson [1968] Languages of Art: An Approach to a Theory of Symbols. 2nd
edn. Indianapolis: Hackett Publishing Company, 1976.
[1987–8] ‘‘On what should not be said about representation’’, Journal of
Aesthetics and Art Criticism 46: 419.
and Catherine Z. Elgin [1988] Reconceptions in Philosophy and Other Arts and
Sciences. London: Routledge.
Gopnik, Blake [1995] Pictorial Mimesis in Cinquecento Italy, 1500–1568: Texts, Visual
Rhetorics, and a Roman Test-Case. A doctoral thesis for the University of Oxford,
Wolfson College.
Graßhoff, Gerd, Samuel Portmann, and Adrian Wüthrich [2005] ‘‘Minimal
Assumption Derivation of a Bell-type Inequality’’, British Journal for the Philosophy
of Science 56: 663–80.
Greenstein, George and Arthur G. Zajonc [1997] The Quantum Challenge: Modern
Research on the Foundations of Quantum. Sudbury, Mass.: Jones & Bartlett
Publishers.
330
Gregory, R. L. (ed.) [1987] The Oxford Companion to the Mind. Oxford: Oxford
University Press.
Grünbaum, Adolf [1952] ‘‘Some Highlights of Modern Cosmology and Cos-
mogony’’, The Review of Metaphysics 3: 489–91.
[1954] ‘‘E. A. Milne’s Scales of Time’’, The British Journal for the Philosophy of
Science 4: 329–31.
[1957] ‘‘Complementarity in Quantum Physics and its Philosophical Gener-
alization’’, Journal of Philosophy 54: 713–27.
[1968] Geometry and Chronometry in Philosophical Perspective. Minneapolis:
University of Minnesota Press.
Hacking, Ian [1981] ‘‘Do we see through a microscope?’’, Pacific Philosophical
Quarterly 62, pp. 305–22; repr. pp. 132–52 in Churchland and Hooker 1985.
[1983] Representing and Intervening: Introductory Topics in the Philosophy of
Natural Science. Cambridge: Cambridge University Press.
[2006] Another New World is Being Constructed Right Now: the Ultracold.
Max-Planck-Institut für Wissenschaftsgeschichte, Preprint 316, 2006.
Hagen, Margaret A. [1986] Varieties of Realism: Geometries of Representational Art.
Cambridge: Cambridge University Press.
Hanna, J. F. [1983] ‘‘Empirical adequacy’’, Philosophy of Science 50: 1–34.
Hanna, J. F. [1984]; ‘‘On the empirical adequacy of composite statistical hypo-
theses’’, pp. 73–80 in Asquith and Kitcher 1984.
Harleman, Donald R. F. [1982] ‘‘Hydrothermal Analysis of Lakes and Reservoirs’’,
Journal of the Hydraulics Division 108: 301–25.
Harms, William F. [1998] ‘‘The Use of Information Theory in Epistemology’’,
Philosophy of Science 65: 472–501.
Harré, Rom ‘‘The materiality of instruments in a metaphysics for experiments’’,
pp. 19–38 in Radder 2003.
Heidelberger, Michael [2003] ‘‘Theory-Ladenness and Scientific Instruments in
Experimentation’’, pp. 138–151 in Radder 2003.
Heinonen, Teikko, Pekka Lahti, and Kari Ylinen [2004] ‘‘Covariant fuzzy observ-
ables and coarse-graining’’, Reports on Mathematical Physics 53: 425–41.
Heinonen, Teiko [2005] Imprecise Measurements in Quantum Mechanics. Turkey:
Annales Universitatis Turkuensis.
Heisenberg, Werner [1945] ‘‘Zur Geschichte der physikalischen Naturerklärung’’,
Wandlungen in den Grundlagen der Naturwissenschaft. 6. Aufl. Leibzig: S. Hirzel.
Helmholtz, Hermann von [1956] ‘‘Ueber den Ursprung und die Bedeutung der
geometrischen Axiome’’. Tr. J. R. Newman, ‘‘On the origin and significance
of geometrical axioms’’, pp. 647–68 in Newman, James 1956.
331
Jourdain, Philip [1918] The Philosophy of Mr. B*rtr*nd R*ss*ll. London: George
Allen & Unwin.
Kant, Immanuel [1848] Critique of Pure Reason. Tr. Francis Haywood. London :
William Pickesing.
[1850] Critique of Pure Reason. Tr. J. M. D. Meiklejohn. London: Henry
G. Bohn.
[1983a] ‘‘Von dem ersten Grunde des Unterschiedes der Gegenden im
Raume’’, pp. 991–1000 in Kant 1983b.
[1983b] Werke in Zehn Baenden, (her. W. Weischedel), vol. 2: Vorkritische
Schriften bis 1768, Zweiter Teil. Darmstadt: Wissenschaftliche Buchgesellschaft.
[1992] ‘‘Concerning the ultimate ground of the differentiation of directions
in space’’, tr. of Kant 1983a in Walford 1992.
Kelly, Sean Dorrance [1998] ‘‘What Makes Perceptual Content Non-
Conceptual?’’, Electronic Journal of Analytic Philosophy, Issue 6.
Kemp, Martin [1990] The Science of Art: Optical themes in Western art from Brunelleschi
to Seurat. New Haven: Yale University Press.
Kim, Jaegwon [1998] Mind in a Physical World. Cambridge, Mass.: MIT Press.
Klein, Carsten [2003] ‘‘Coordination And Convention In Hans Reichenbach’s
Philosophy Of Space’’, pp. 109–20 in Stadler 2003.
Körner, S. (ed.) [1962] Observation and Interpretation in the Philosophy of Physics: a
symposium of philosophers and physicists. New York: Academic Press, 1957. Repr.
New York: Dover.
Koslow, A. [1968] ‘‘Mach’s Concept Of Mass: Program And Definition’’, Synthese
18: 216–33.
Koyré, Alexandre [1965a] ‘‘Huygens and Leibniz on Universal Attraction’’, pp.
115–38 in his 1965b.
[1965b] Newtonian Studies. Chicago: University of Chicago Press.
Kroes, Peter [2000] ‘‘Engineering Design and the Empirical Turn in the Philosophy
of Technology’’, pp. 19–43 in Kroes and A. Meijers 2000.
[2003] ‘‘Physics, Experiments, and the Concept of Nature’’, pp. 68–86 in
Radder 2003.
and A. Meijers (ed.) [2000] The Empirical Turn in the Philosophy of Technology.
Amsterdam: Elsevier Science Ltd.
Kubovy, Michael [1986] The Psychology of Perspective and Renaissance Art. Cambridge:
Cambridge University Press.
Kulvicki, John ‘‘Any Way You Slice It: The Viewpoint Independence of Pictorial
Content’’, with comments by Dilworth, on http://www.interdisciplines.org/
artcognition/papers/9/version/original
333
Ladyman, James [1998a] Structural Realism and the Model-Theoretic Approach to Physical
Theories. Diss., University of Leeds.
[1998b] ‘‘Structural realism: epistemology or metaphysics?’’, Studies in the
History and Philosophy of Science, 29A: 409–24.
Lange, Marc [forthcoming] ‘‘Dimensional Explanation’’.
Latour, Bruno [1999a] ‘‘Circulating Reference: Sampling the Soil in the Amazon
Forest’’, pp. 24–79 in his 1999b.
[1999b] Pandora’s Hope: Essays on the Reality of Science Studies. Cambridge,
Mass.: Harvard University Press.
Laudisa, Federico and Carlo Rovelli, ‘‘Relational Quantum Mechanics’’, in
the Stanford Encyclopedia of Philosophy, http://plato.stanford.edu/entries/qm-
relational/.
Leeds, Stephen [1978] ‘‘Theories of Reference and of Truth,’’ Erkenntnis 13:
111–29.
[1984] ‘‘Chance, Realism, Quantum Mechanics’’, Journal of Philosophy 81:
567–78.
[1994] ‘‘Constructive Empiricism,’’ Synthese 101: 187–221.
[1995] ‘‘Truth, Correspondence, and Success,’’ Philosophical Studies 79: 1–36.
[forthcoming] ‘‘Correspondence Truth and Scientific Realism’’ (forthcoming
in Synthese).
and R. Healey [1996] ‘‘A Note on van Fraassen’s Modal Interpretation of
Quantum Mechanics’’, Philosophy of Science 63: 91–104.
Leibniz, G. W. [1989] Philosophical Essays. Tr. Roger Ariew and Daniel Garber.
Indianapolis: Hacket Publishing Co.
Leplin, Jared [1997] A Novel Defense of Scientific Realism. New York: Oxford
University Press.
Leroux, Jean [2001] ‘‘‘Picture theories’ as forerunners of the semantic approach
to scientific theories,’’ International Studies in the Philosophy of Science, 15:
189–97.
Lewis, David K. [1970] ‘‘How to define theoretical terms’’, Journal of Philosophy
67: 427–46; repr. as chapter six of his Philosophical Papers (Oxford, 1986).
[1979] ‘‘Attitudes de dicto and de se’’, Philosophical Review 88: 513–43.
[1984] ‘‘Putnam’s Paradox’’, Australasian Journal of Philosophy 62: 221–36.
[2001] ‘‘Forget about the ‘correspondence theory of truth’’, Analysis 61:
275–80.
Lipton, Peter [2004] Inference to the Best Explanation. 2nd edn. London: Routledge.
Lloyd, Elizabeth [1986] ‘‘Thinking about Models in Evolutionary Theory’’, Philo-
sophica 37:87–100.
[1987] ‘‘Confirmation of Ecological and Evolutionary Models’’, Biology and
Philosophy 2: 277–93.
334
Reichenbach, Hans [1965] The Theory of Relativity and A Priori Knowledge. Tr.
Maria Reichenbach. Berkeley: University of California Press.
[1979] Gesammelte Werke. A. Kamlah and M. Reichenbach (eds.), Braunsch-
weig: Vieweg.
[1991] The Direction of Time. Berkeley: University of California Press, 1956;
new edn. with foreword by Hilary Putnam, 1991.
Rosen, Gideon [1994] ‘‘Objectivity and Modern Idealism: What is the Question?’’,
pp. 277–319 in Michael and O’Leary-Hawthorne 1994.
Rothbart, Daniel [2003] ‘‘Designing instruments and the design of nature’’, pp.
236–54 in Radder 2003.
Rothschild, Daniel ‘‘Structuralism and reference’’, http://www.columbia.edu/
∼dhr2107/structureshort.pdf.
Rovelli, Carlo [1996] ‘‘Relational Quantum Mechanics’’, International Journal of
Theoretical Physics, 35: 1637–78.
Roxburgh, Ian W. [1977] ‘‘The Conventionality of Uniform Time’’, The British
Journal for the Philosophy of Science 28: 172–7.
Russell, Bertrand [1899] Sur les axiomes de la géométrie , Revue de la
métaphysique et de morale, 7: 684–707.
[1901] ‘‘Is position in time and space absolute or relative?’’, Mind 10:
293–317.
[1927]: The Analysis of Matter, London: Allen & Unwin.
[1959]: Problems of Philosophy. Oxford: Oxford University Press.
[1993] Our Knowledge of the External World: As a Field for Scientific Method in
Philosophy. London: Routledge.
[1996] An Essay on the Geometry. Foundations of London: Routledge.
Ryckman, Thomas [2005] The Reign of Relativity. Oxford: Oxford University
Press.
Schilpp, Paul Arthur [1988] Albert Einstein: Philosopher-Scientist. 3rd rev. edn.,
Chicago, IL.: Open Court Publishing Company.
Schrödinger, Erwin [1928] ‘‘Neue Wege in der Physik’’, Elektrische Nachrichtentech-
nik, 5: 485–8.
[1929] ‘‘Neue Wege in der Physik’’, Elektrotechnische Zeitschrift 50: 15–16.
[1951] Science and Humanism: Physics in Our Time. Cambridge: Cambridge
University Press.
[1953] ‘‘The meaning of wave mechanics’’, pp. 16–30 in André George (ed.)
Louis de Brolie physician et penseur. Paris : Albin Michel.
Schwartz, Robert [1980] ‘‘Imagery—There’s More to It than Meets the Eye’’,
PSA: Proceedings of the Biennial Meeting of the Philosophy of Science Association, vol.
2: 285–301.
339
Introduction
1 As I shall characterize representation below, ‘‘mental representation’’ is an
oxymoron.
2 Boltzmann’s way of thinking about this, and especially the various pragmatic
views here, I want to thank the editors of two recent volumes to allow me my say:
Andreas Berg-Hildebrand and Christian Suhm 2006, and Bradley Monton 2007.
Part I: Representation
1 Suarez 2004, 770. This appears to be a change of mind from his earlier ‘‘I
take it that a substantive theory of scientific representation ought to provide us
with necessary and sufficient conditions for a source to represent a target’’ (Suarez
2003: 226). While Suarez 2004 contrasts his view to such approaches as R. I. G.
Hughes’s influential (1997), his own ‘‘inferential conception’’ still came very close
to providing just the sort of theory that he dismisses (in the passage I quoted) as
not to the point. See further Ducheyne (2005) which emphasizes that a model
represents for a person if that person accepts certain things. While not wishing
either to propose rivals to their accounts or to adjudicate between them, I have
learned much from these, as well as from such other recent writings as French
2003.
2 For the etymology, and for what became the stable concept in art history, see
4 See for instance the photo of Mars made through the Hubble tele-
scope, at http://www.jpl.nasa.gov/news/features.cfm?feature=533. Ian Hacking
has recently studied the use of such terms in the history of science, and has emphas-
ized that the term ‘‘phenomenon’’ typically denotes things classed as remarkable,
unusual, or amazing. What amazes are often individual occurrences but also often
processes with many instances. Some of this appears in Hacking 2006 (see p. 32,
section C1 ‘‘The creation of phenomena’’) issued in connection with his Carl
Friedrich von Weizsäcker-Vorlesungen, Hamburg 2007.
sional discourse.
11 Goodman 1976: 26. This phrasing derives from his constant attempt to
later we may have to look seriously at such examples as ‘‘X represents a man
holding a candle/ Santa Claus delivering presents/phlogiston as escaping rapidly’’
and the like.
17 ‘‘Once we take on board the distinction between bare bones and fleshed
such as the purpose for which the representation is made or which it is made
to serve. That is especially relevant for scientific representation; see e.g. Giere
2006: 60.
348 :
19 See further sections 3 and 4 of Schwartz 1980. That what makes something
a representation is the fact that it is pressed into representational service by
representation users is also a theme emphasized in Paul Teller’s 2001a.
20 The standard reference here is Hagen 1986.
21 Compare Georgalis 2005: 128–9: ‘‘For any item r to represent a particular
item t, there must be a conscious agent s to whom r represents t. I call this the
fundamental fact of representation.’’ My statement goes beyond this, since I imply
that there is also a representing by (and not just to) an agent. However, there is a
limiting case, in which one and the same agent plays both roles, for example when
someone spontaneously takes an encountered natural object or event to represent
something.
22 cf. Georgalis 2005: 122: ‘‘Since there is no necessary connection between
such thing as essentially private representation any more than private language,
except in the sense in which private uses can exist as derived from or parasitic on
communal practices.
25 The example is essentially Putnam’s. As to the main point, the Eleatic
Stranger has a line on this: he has prefaced the list by saying that he regards the
natural ones as divine workmanship—so perhaps he thought of shadows as really
made in order to represent the objects casting them. What about things in dreams?
It’s a nice conceit, that paintings are dreams made for people who are awake—a
conceit echoed by the idea of films as dreams that money can buy, and indeed by
the film ‘‘Dreams That Money Can Buy’’. But as an example to show that there
are ‘representations in nature’ it begs the question of what dreams are.
26 This is part of an objection to the semantic approach. Suarez takes that
representations that do not represent real things, hence have nothing to denote.
His gloss on this will do for our purposes.
28
The term ‘‘relation’’ may in fact not be the most apt to explain intentionality.
As Benoist 2006 emphasizes, Brentano who defines the intentionality of mental
activity by the statement that it is characterized as such by ‘‘the relation to
something as object’’ drew a distinction between this case and a genuine relation
(which must be between real things), and says eventually that intentionality is not
a relation but something relation-like: kein Relatives, aber ein Relativliches.
29
In more linguistic terms, the context created by ‘‘represents’’ is ‘referentially
opaque’, for the premise that S and T are the same or apply to the same things
does not license the inference from ‘‘X represents Y as being S’’ to ‘‘X represents
Y as being T’’.
30 I use ‘‘intensional’’ for terms, expressions, forms of discourse, the criteria
being essentially those discussed under this heading by Quine: opacity to reference,
resistance to substitutivity of identicals, and so forth. See for instance The University
of Alberta Dictionary of Cognitive Science, ‘‘ ‘Intentional’ is not to be confused with
‘intensional’ spelled with an ‘s’, the latter of which refers to the meaning of a
term, (along with ‘extensional’)’’ (though that dictionary entry gives too narrow a
meaning to ‘‘intentional’’). For ‘‘intention’’ see Dennett and Haugeland 1987. For
a metaphysical approach, of the sort that I contest here, see Zalta 1988.
31 It is best and clearest to think of this as a point about language, displayed
on the word ‘‘of’’ alone. ‘‘His speech included a description of Mrs. Thatcher as
draconian’’ is not covered by the convention, it is still ambiguous in the usual way,
and the substitution of ‘‘the then Prime Minister’’ might or might not change the
truth-value of this sentence.
33
In aesthetics, discernment of that level of meaning is the subject of icono-
graphy—see Panovsky, 1955: 26–54.
3 For a detailed discussion of the use of visual imagery in the sciences, see
Wimsatt 1990.
4
This does come up in debates in metaphysics: must there be a basis in nature
for objective judgments of similarity, or does the similarity we privilege in our
judgments relate to our concerns, values, and practice? That will be a topic for
another time.
5 There are many paintings to which my description applies, but of course I
mean Manet’s. Thus Zola 1867: ‘‘Les peintres, surtout Edouard Manet, qui est
un peintre analyste, n’ont pas cette préoccupation du sujet qui tourmente la foule
avant tout; le sujet pour eux est un prétexte à peindre tandis que pour la foule le
sujet seul existe. Ainsi, assurément, la femme nue du Déjeuner sur l’herbe n’est là
que pour fournir à l’artiste l’occasion de peindre un peu de chair. Ce qu’il faut voir
dans le tableau, ce n’est pas un déjeuner sur l’herbe, c’est le paysage entier, avec ses
vigueurs et ses finesses, avec ses premiers plans si larges, si solides, et ses fonds d’une
délicatesse si légère; c’est cette chair ferme modelée à grands pans de lumière, ces
étoffes souples et fortes, et surtout cette délicieuse silhouette de femme en chemise
qui fait dans le fond, une adorable tache blanche au milieu des feuilles vertes, c’est
enfin cet ensemble vaste, plein d’air, ce coin de la nature rendu avec une simplicité
si juste, toute cette page admirable dans laquelle un artiste a mis tous les éléments
particuliers et rares qui étaient en lui.’’
6 Recall the earlier reference to Nelson Goodman’s discussion of this, his 1976.
7 Section 6.1 (pp. 112–17) of Lopes 1996.
8
Hyman 2000 and 1992; see further Derksen 2004.
9
Chapter 3 of Giere 2006; see specifically pp. 48–9.
10 Wittgenstein proposed the idea of cluster concepts to highlight classification
would be represented if each lighted square is n squares distant from the first lighted square
(counting by row, as on a calendar). The descriptionalist sees the line representation as a set
of sentences. If n = 7, the set of sentences could be: ‘1 is dark’, ‘2 is light’, ‘3 is dark’, . . . . ‘9
is light’, ‘10 is dark’, and so on . . . .
The pictorialist, on the other hand, sees the representation of a line . . . as like the matrix
itself, not the corresponding set of sentences.
But what does the distinction come to between the set of sentences and the matrix itself?
Aren’t they just different ways of inscribing the same representation with the same semantic
properties?
I say ‘‘No.’’ The key is the way the representations function. Consider how the descrip-
tionalist would rotate his line. A small counterclockwise rotation could be accomplished if
the first lighted square stayed lit, the next number of a lighted square was increased by 1,
the next by 2, the next by 3, and so on. In terms of the example, 2 would stay lit, 10 would
be lit instead of 9, 18 instead of 16, 26 instead of 23, etc. So the descriptionalist’s new set of
sentences would be ‘1 is dark’, ‘2 is light’, . . . ‘10 is light’, . . . ‘18 is light’, . . . ‘26 is light’,
and so on. The important point is that the computer’s ‘‘rotation’’ calculation just involves
the numbers, not the arrangement of the numbered squares. The matrix display is for us
and plays no role in what the computer does. From the point of view of the computer’s
calculations, the squares could as well be arranged in a line or a circle rather than in a matrix.
Real live computer graphics works this way. The machine manipulates numbers. The
programmers think of the numbers as numbers of cells in matrices, and they put in numbers
that correspond to matrices of visual interest. They program the computer to operate
on the numbers in ways that correspond to visually interesting or useful matrix changes.
These correspondences are what makes the computer’s number crunching graphical, but
the correspondences play no role at all in the number crunching itself. (1981: 53)
This is illuminating, though I don’t think it can stand without qualification either.
Note that the functioning in question is not what is meant when we say that a
certain array of numbers or dots functions as a representation. The functioning on
which Block focuses is relevant here, but must be distinguished: it concerns the
transformation of one symbol in a system of symbols into another symbol.
13
Thanks to Isabelle Peschard for this criticism.
14
This insight may be best honored by attention to global differences between
pictorial and other systems of representation. That is, in fact, how Goodman
approaches the matter in his Languages of Art. According to Goodman, as we have
seen, pictures denote and predicate. But what distinguishes pictorial systems from
other denotational systems (such as systems of verbal description) are features such
as denseness that make them more like analog systems, like diagrams and maps. Cf.
Goodman 1976: 194–8; Goodman and Elgin, 1988 ch 7. Pictorial symbol systems
are syntactically and semantically dense. That is, any two pictures, no matter how
similar, could be depicting different things or depicting the same things in different
ways. Cf. Goodman 1976: 226–7. That is not at all true of verbal description,
where small differences typically make no difference at all. An analog measuring
device is similar in that respect: two different pointer positions, no matter how
352 :
close together, indicate different values. In diagrams or maps too a small difference
in lines or positions can be significant. For example in maps made to scale the
merest movement of a border or contour line depicts a different possible situation
in the mapped region. Nevertheless, there is a distinction also between pictures
and diagrams or maps. Goodman takes that difference to be syntactic, that is, it
does not have to do with reference or attribution but solely with the composition.
The difference is only a matter of degree, however: pictorial symbol systems
are more replete. In a painting a much larger set of features—color, thickness,
intensity, contrast, etc.—is relevant to what it denotes and what it predicates than
in diagrams or even the most detailed topographical maps. This is a difference of
degree that I will not build into my terminology here, for it will not be pertinent
to distinctions I will need for different modes of scientific representation. So I will
describe diagrams, maps, etchings, and paintings all as pictures, and their use as
picturing.
15 Adam Elga has suggested an appropriate form of presentation: we can here
are as sharply defined as e.g. classical atoms. What is the shape of this beer glass?
Descartes’s answer (translated into our current terms) would have been ‘‘an analytic
function’’. Why do scientists and philosophers tend to be impatient with the point
that such mathematical modeling is unrealistic? The answer is presumably that
they have long since appreciated it, and prefer to focus on the practical usefulness
of the models—a reasonable attitude, even if we can’t afford it when we study
scientific representation as a subject in its own right. Below I will give an example
from classical optics to show how this sort of idealization can lead to paradox, and
thence to changes in theory.
17 See further my 1999.
18
I take this simple example from the history of geometric optics, on its way
to becoming physical optics in the modern sense. For a fascinating study of this
historical stage see Fokko Jan Dijksterhuis 2004.
19 Hero’s fundamental assumption about nature was not precisely true. It is true
under rather limited conditions; correct in the very simple case of reflection by a
plane mirror set in a homogeneous medium like ordinary room temperature air.
Later it was modified: first by suggesting that light takes the path that takes least
time, and later still further.
20
This simple result has far reaching consequences too. Consider the following
problem set by Aristotle: if a circular light set in the ceiling shines through a square
, , 353
hole in the floor, what is the shape of the light spot on the basement floor? (This
question is, as it were, a precursor to the discussion of the camera obscura constructed
by the Arab mathematician Alhazen in the tenth century.)
21 Willebrord Snel van Royen (Netherlands, early seventeenth century). It is
also called the Law of Sines because, in modern terminology, the ratio of the sines
of the angles of incidence and refraction equals the ratio of velocities in the two
media.
22 For animated demo see http://micro.magnet.fsu.edu/primer/java/refraction/
criticalangle/index.html
23
Teller 2001a; see further his 2008.
24 I am staying here with classical physics, which suffices to make the current
with a ruler and compass; only when such a construction can be achieved with
a continuous movement can we have a clear and distinct idea of the geometrical
solution. The exclusion of the infinite is a constant theme in Descartes; cf. Principles
I, Proposition 26. In attributing a contrary opinion to Pascal we don’t mean of
course that he had the conception developed in point set theory after 1900; though
he went very far in his rejection of the apparent paradoxes of the completed
infinite. Cf. Chevalley 1995.
27
Birkhoff and von Neumann 1936 point out that areas that differ by measure
zero are equivalent as far as anything instrumentally measurable is concerned. A
quotient construction is a reduction modulo some equivalence relation, that is,
replacement of elements by their equivalence classes. In the case of intervals, for
example, (0,1) and [0,1], (0,1], [0,1) all differ just by measure zero, and would be
treated as the same.
28 At that point we can either say or deny Birkhoff and von Neumann’s
contention that the calculations can go on as usual, but the shape is correctly
represented not by one region in geometric 3-space, but by an object in the
quotient construction that identifies regions modulo differences of measure zero. If
we go on to meta-mathematics we can extend this critique much further: anything
beyond finite, combinatorial mathematics is susceptible to multiple ‘‘unintended’’
and ‘‘non-standard’’ interpretations, introduced by pointing to very different sorts
of mathematical structures.
29
We may be reminded here of Solovay’s 1965 ‘‘2ˆaleph0 can be anything
it ought to be’’. The foundations of mathematics galloped away with its own
subject, shanghaied it, abducted it (to hear some mathematicians’ complaints about
354 :
well how easily and naturally he proposes experiments to be done on a small scale
so as to test hypotheses that pertain as well to much larger bodies: ‘‘The third
difficulty in the doctrine of Archimedes was, that he could not render a reason
whence it arose that a piece of Wood, and a Vessel of Wood, which otherwise
floats, goes to the bottom, if filled with Water. Signor Buonamico hath supposed
that a Vessel of Wood, and of Wood that by nature swims, . . . goes to the bottom
if it be filled with water . . . but I . . . dare in defense of Archimedes deny this
experiment, being certain that piece of Wood which by its nature sinks not in
Water, shall not sinke though it be turned and converted into the forme of any
Vessel whatsoever, and then filled with Water: and he that would readily see the
Experiment in some other tractable Matter, and that is easily reduced into several
Figures, may take pure Wax, and making it first into a Ball or other solid Figure,
let him adde to it so much Lead as shall just carry it to the bottome, so that being
a graine less it could not be able to sinke it, and making it afterwards into the
form of a Dish, and filling it with Water, he shall finde that without the said Lead
it shall not sinke, and that with the Lead, it shall descend with much slowness;
& in short he shall satisfie himself, that the Water included makes no alteration.’’
(Galileo 2005: 21)
, , 355
of transformation groups see Barenblatt 2003: 94–6. The seminal paper relating
dimensional invariance to more general notions concerning symmetry, with special
reference to measurement scales, is Luce 1978.
42
The law is that force is proportional to mass times acceleration; the more usual
specific form F=ma does depend on choosing the units such that one unit of force
gives a unit mass unit acceleration.
43 For a precise exposition, see Barenblatt 2003: section 1.2.2. This requirement
the end of the eighteenth century, see the introduction in Homann 1991, which
is a translation, with an introduction and notes, of Hugh of St. Victor’s Practica
Geometrica, dated circa 1120 .
4
The history of astronomical measurement as aid in navigation is part of the
history of Perspectiva. Ptolemy’s planisphere is a geometric construction in which
the eye is located at the South Pole, and instead of an intersecting plane just the
Equator—an intersecting line—is used in the projection. The astrolabe improved
on this by adding lines of latitude and longitude drawn from a second point
of view, for an observer located at a certain latitude. This involved therefore a
projection of the celestial sphere on the Equatorial plane. Hence astrolabes could
be used to show how the sky looks at a specific place at a given time. Typical uses
of the astrolabe include finding the time during the day or night, finding the time
of a celestial event such as sunrise or sunset and as a handy reference of celestial
positions. Cf. Veltman and Keele 1986: 42–4.
5 Since PT = h/A and QT = h/B, it follows that the measured distance PQ,
which is just the difference between them, must be h times the difference between
1/A and 1/B. But both PQ and that difference are known from the measurement
results, so h can be calculated directly from the known numbers. For example if
PQ is 100 ft, A = 1 and B = 2 then h = 200 feet. As a historical note, Thales
was credited with calculating the height of the pyramids from the lengths of their
shadows, in the fifth century . One suggestion, however, is that Thales probably
did not prove any general theorem on similar triangles, but noticed (empirically)
that at a given time of day, for every object, the ratio of height to shadow was
the same. So he could measure a nearby stick and its shadow directly, and also the
length of the pyramid shadow—and then the pyramid height would be the only
unknown factor, so could be calculated.
6 In the case of Alberti too we see at least in practice a close connection
between his studies of measurement and of perspective. His Ludi matematici applies
mathematics to the measurement of distances, dimensions, and weights. The
357
rather than as a mathematician, his monograph begins in Book I with a good bit
of geometry and treats his subject in a mathematically sophisticated way.
8
Reputedly, Alberti went on to considerably more sophisticated examples. As
was reported at the time, ‘‘the pictures, which were contained in a very small box,
were seen through a tiny aperture. There you were able to see very high mountains
and broad landscapes around a wide bay of sea, and, furthermore, regions removed
very daintily from sight, so remote as not to be clearly seen by the viewer. He
called these things ‘demonstrations’ ’’ (from the anonymous Vita in Opere vulgari,
ed. Bounucci, pp. cii-ciii; cited in Alberti, p. 98, Explanatory Note 15).
9
In this passage and below—see also the passage quoted below from Leibniz
for a comparison—I am using the term ‘‘appearance’’ judiciously, in a way that
does not equate it with ‘‘phenomenon’’ but allows a distinction between an
observable phenomenon and its appearance (the way it ‘looks’ in the content of a
measurement outcome). Recall the remarks on this distinction in the Introduction
to Part I.
10
See Panovsky 1956. While these mechanical aides and machines may or may
not have been actually made or used, their design brings out clearly how the
perspectival drawing technique is actually the recording, in a sophisticated visual
display, of the outcome of a complex spatial measurement.
11 From Part I of Thomas 2006.
12 A crucial complication comes in when it is realized by the end of the
1989, cited in Jauernig 2004, chapter 1. The passage begins with ‘‘If bodies are
phenomena and judged in accordance with how they appear to us, they will not
be real since they will appear differently to different people. And so the reality of
bodies, of space, of motion, and of time seems to consist in the fact that they are
phenomena of God, that is, the object of his knowledge by intuition.’’
15 See Ryckman 2005: 128–35 on Weyl (especially the quoted passages on
as long as the other, lie on a line parallel to the painterly window then their
projections on that window will also be in proportion k:. If two line segments
parallel to the window are equal in size, with one edge on the central line of
sight, but the one is at a distance from the eye m times that of the other, then its
projection on the window will be /m as large as that of the other.
19 We focus here on the cross ratio of collinear points. There is also a dual, the
then CA/CB = 2/1 and DA/DB = 3/2, so their ratio is 4/3. If the painter placed
the points meant to correspond to these with e.g. distances AB = 9, BC = 3,
359
CD = 1, the cross ratio on his picture plane would be 12/3 divided by 13/4,
which is pretty close to 4/3, perhaps close enough for the purpose.
22
Because the cross ratio is equal to the same magnitude with the sines of the
corresponding angles replacing the line segments:
CA/CB : DA/DB = sin(CPA)/sin(CPB) : sin(DPA)/sin(DPB)
But those angles, and hence the sines, are the same if we look instead at the
projected points A , b, C, d for example. So that must also equal CA /Cb : dA /db.
The proof is elementary, using the fact that the area of any triangle equals on the
one hand half of the height times the base, and on the other hand the product of
two sides times the sine of the angle between them. All the relevant triangles in
the diagram with vertex P have the same height.
23 A demonstration experiment, to be distinguished from experiments that test
hypotheses on the one hand, and from experimental exploration on the other.
For Brunelleschi’s ‘experiment’ see for example Zajonc 1995. Whatever process
Brunelleschi actually used, it produced a painting that provides a view of the
Baptistery from a certain vantage point, and is thus properly conceived of as a
measurement of the Baptistery. See also Feyerabend 2001: 89–115 in his chapter
‘‘Brunelleschi and the invention of perspective’’, and my review thereof 2000a.
24 In the case of Alberti too we see at least in practice a close connection
between his studies of measurement and of perspective. His Ludi matematici applies
mathematics to the measurement of distances, dimensions, and weights. The
Elementa picturae describes some geometric figures and projections. In Descriptio
urbis Romae he details how he has used a surveying disk similar to an astrolabe; De
Statua deals with proportions in the human body and how to replicate them in
sculpture (see Martin Kemp’s Introduction to Alberti 1991).
25 For this example see Geometry Forum Articles at http://www.geom.uiuc.
30 http://www.hewett.norfolk.sch.uk/CURRIC/soc/theory.htm
31
http://www.indiana.edu/∼intell/map.shtml
32
‘‘Mediology: a metatheory for analyzing media and institutions’’; see
http://www.georgetown.edu/faculty/irvinem/CCTP748/mediology-map.html
33 We can relate this to Lewis 1979, where he construes all assertion as self-
full text is this: ‘‘Wenn ich auch noch so gut die Ordnung der Abteilungen
des Horizonts weiss, so kann ich doch die Gegenden darnach nur bestimmen,
indem ich mir bewusst bin, nach welcher Hand diese Ordnung fortlaufe, und die
allergenaueste Himmelskarte, wenn ausser der Lage der Sterne untereinander nicht
noch durch die Stellung des Abrisses gegen meine Hände die Gegend determiniert
würde, so genau wie ich sie auch in Gedanken hätte, würde mich doch nicht
in den Stand setzen, aus einer bekannten Gegend, z. E. Norden, zu wissen, auf
welcher Seite des Horizonts ich den Sonnenaufgang zu suchen hätte.’’ (Kant 1983:
995/6)
38
I hesitate to use such terms as ‘‘applied science’’ and was hesitant to insert the
word ‘‘pure’’ into this discussion: these points about indexicality pertain equally
well to experimental research in pursuit of theoretical goals.
39 We can think of Nelson Goodman as making this into a general point for
the viewing of art works as well, with his introduction of the concept of languages
of art.
40
To give but a few examples from the Oxford English Dictionary: ‘‘ F.
MYERS Catholic Thoughts IV. xxxv. 359 Clearly no method can be satisfactory
but that which preserves the perspective of history true. H. DRUMMOND
⁽⁾ 361
Lowell Lect. Ascent Man 11 Evolution . . . has thrown the universe into a fresh
perspective. J. GOULD & W. L. KOLB Dict. Social Sci. 262/1 There has
been much discussion from many perspectives as to the origins and ‘‘causes’’ of
fascism. Review No. 53. 21/1 Aiming for a 100 per cent safety record is the
right thing to do, not just from an ethical standpoint but also from a hard-nosed
business perspective. W. JAMES Let. 12 Dec. in R. B. Perry Thought &
Char. W. James (1935) I. 727 Metaphors and epigrams which, witty and striking
and perspective-suggesting as they often are, . . . may be in danger of having the
changes rung on them too long.’’
41
But note my qualifiers ‘‘in general’’ and ‘‘in their official formulation’’—I
am not ruling out examples that bespeak the contrary of the general claim.
42 To give one example from epistemology: as De Finetti emphasized, subjective
that Constantijn Huygens (not to be confused with his more famous son Christiaan
Huygens) conceives of the experience with the microscope as a case of seeing.
3
Heidelberger 2003. For an illuminating discussion that relates Heidelberger’s
conceptions to those of Baird and Harré, see Boon 2004.
4 This is precisely what Pitt 2005 discusses critically, addressing how the results
are presented in nano-technology literature. Contrast his discussion with the view
argued in Hacking 1985 and my reply to that paper in the same volume.
362 :
5 See Boon, op. cit. Nancy Cartwright’s [1999] concept of a nomological engine
was certainly also one of the factors inspiring me to see putative ‘observation by
instruments’ as involving engines of creation, although her concept is at the same
time different from, as well as related to, that of instruments as providing engines
to produce phenomena. Specifically, her notion of a nomological engine cannot
be understood apart from some grasp of her notions of capacity and necessity. But
the examples are of the creation of processes that instantiate certain patterns with a
faithfulness achievable in the laboratory and not found in an uncontrolled natural
setting.
6
See further Anderson 1993.
7 In summary, Mieke Boon offers the following classification that covers
although his illustrations of this idea are grist to my mill, it is clear also that there
are real differences in what we mean by this phrase. As I use the words, ‘‘observable
phenomena’’ is just an emphatic way of saying ‘‘phenomena’’, for by that word I
refer to all and only observable, i.e. perceptible, objects, events, and processes. See
further the discussion of phenomena versus appearances in Part IV.
9 Gilbert is said to have demonstrated to Queen Elizabeth I his theory of the
Earth’s magnetism: placing a small compass at various places around the terrella,
Gilbert showed that it always pointed north–south—offering this as a model to
explain why on Earth a compass points north–south. (Actually, even if the Earth’s
core is iron, it is too hot to be a magnet.)
10 For discussion both of the experiment and complaints about its poor use in
almost a century earlier. Its use to illustrate how new phenomena are created by
experiment does not depend on Poisson being the first one to create it, of course.
Even less does its being new have anything to do with the methodological value or
lack of value of novelty in predictions, if only because being new does not imply
that it should have been surprising.
⁽⁾ 363
World, I would like to thank the University of Oklahoma, and especially Dr. Kerry
V. Magruder, for access to the History of Science Collections to obtain material
from Hooke 1665 and Power 1664.
15
Although the distinctions drawn to classify instruments in science seem
usually to go against this assimilation, I find this suggestion for the microscope
also in Michael Heidelberger’s ‘‘Roentgen’s apparatus was, as we might say,
unconditionally productive, but there are other productive instruments that produce
known phenomena—although in circumstances where they have not appeared
before. I am thinking of instruments, like microscopes or telescopes . . . .’’ (op. cit:
146/147).
16
The rainbow is unlike reflections in the water because it is not the image
of some real arch. That is important to illuminate the point below. But the more
important feature is the status they both share with mirages (and share, I will argue,
with microscope images), which makes them ‘‘public hallucinations’’.
17 The qualities and structure of the rainbow do reveal something about the
light source; yet the rainbow is not a copy or picture of that source.
18
Hence the anecdotal evidence, in ancient sources, of the appearances of
gods and spirits, which could have been produced by projecting images made by
concave mirrors into smoke in a darkened room.
19 In such ‘quantifier’ locutions as ‘‘something’’ or ‘‘there is such a thing as’’
or ‘‘everything’’, the word ‘‘thing’’ does not occur with any substantive meaning,
but is a sort of pronomial device. In elementary logic we paraphrase ‘‘Something
is . . . ’’ as ‘‘There is x such that x . . .’’ and ‘‘Everything . . . is—’’ we render as
‘‘(All x)(if x is . . .then x is —)’’. The word ‘‘thing’’ has disappeared. Two of
the three occurrences of ‘‘x’’ correspond there to the relative pronoun ‘‘it’’. But
the first occurrence of ‘‘x’’, corresponding to the ‘‘thing’’ part of ‘‘Everything’’,
364 :
does not play a different role from the others. (This is clearer in combinatory
logic: a universally quantified sentence says that a certain predicate has universal
application.) In venerable terminology, use of ‘‘thing’’ in ‘‘something’’ is not
categorematic but syncategorematic.
20 Let me explain how I understand this. My experiences are the events that
happen to me of which I am aware. Such events have two sides, so to say: what
really happens to me and the spontaneous judgment I make in response, which
classifies that event in some way. In good cases the two coincide, but often they
do not. For example, I trip over a marmot but take it to be a cat. What happened
to me was that I tripped over a marmot, but I ‘experienced it as’ tripping over a
cat. See further my 2002: 134–6.
21 The most sustained objection along this line is that of Hacking 1981, which
actually also details tellingly the differences between ‘‘seeing through’’ an optical
microscope and a magnifying glass, but still argues for the realist conclusion. See
further my reply ‘‘Ad Ian Hacking’’ in the same volume.
22
Besides the above reply to Hacking 1985, see e.g. my 1982a, and the critique
of Inference to the Best Explanation in my 1989, chapter 6, and in my review of
Lipton 2005.
23 While I cannot go further with this here, I mean to echo at least some aspect
in water, certainly introduces a sort of observation report that I did not have in
mind while writing The Scientific Image. Then I was thinking quite simply in terms
of a classification of objects, events, and processes as observable and unobservable.
25 John Bell 1990 said famously that ‘‘ordinary quantum mechanics is just
fine FAPP’’ (introducing this abbreviation for ‘‘for all practical purposes’’), while
criticizing interpretations of the theory as unsatisfactory if they remain at that level.
26 To continue the preceding footnote about experience: to have the experience
of seeing a paramecium means here to have the spontaneous judgment that one
is seeing a paramecium, in response to what is happening to one, namely to have
one’s eye pressed to the eye-piece of a microscope.
27 His footnote at that point indicates, however, that he wrote this passage
stances, and the lines drawn for any useful distinctions are sensitive to value,
365
purpose, use, and other contextual factors. With respect to science, as to other
such important, pervasive aspects of our culture and civilization, the values an
empiricist pursues require rendering it intelligible without the need for metaphysical
underpinnings.
30 For a contrasting, realist way of appreciating this harmony, see Norton 2000
and 2003.
laws of motion, and his law of universal gravitation (which can only be formulated
in the context where the former are given) is an axiom of connection.
10 It is instructive here to see how, almost just in passing, Reichenbach touches
on ways to back this up: ‘‘Not only the totality of real things is coordinated to
the total system of equations, but individual things are coordinated to individual
equations. The real must always be regarded as given by some perception. By
calling the earth a sphere, we are coordinating the mathematical figure of a sphere
to certain visual and tactile perceptions that we call ‘perceptual images of the earth,’
according to a coordination on a more primitive level. If we speak of Boyle’s gas
law, we coordinate the formula p.V=R.T to certain perceptions, some of which
we call direct perceptions of gases (such as the feeling of air on the skin) and some
of which we call indirect perceptions (such as the position of the pointer of a
manometer). The fact that our sense organs mediate between concepts and reality
is inherent in human nature and cannot be refuted by any metaphysical doctrine.’’
(op. cit. page 37). But under what conditions do we have a right to call those visual
and tactile perceptions ‘‘perceptual images of the earth’’? It seems to me we cannot
classify them as such unless we are already able to describe perceptual images by
themselves on the one hand, and the earth on the other. If that condition is not
met, coordinatization seems to amount to no more than equating one otherwise
meaningless noise to another.
11 There is a good sidelight on the problem in Campbell 1943 where he discusses
‘‘on what experiments the Newtonian theory of dynamics is most suitably based,
and in particular whether quantity of matter, mass, and force can be measured
independently of that theory.’’ That the independent measurement procedures
he discusses involve reliance on previous coordination of some parameters with
measurement procedures is all too clear. Nor do his arguments meet the points
made by Poincaré and Sneed (see note below).
367
12 Poincaré pointed out that to measure the mass of an object, in the sense of
Newtonian physics, we must presuppose the object to be a Newtonian mechanical
system, to which Newton’s laws of motion apply. Joseph Sneed later made this
insight the basis of his theoretical/non-theoretical distinction.
13 Mach provided what he called a definition of mass, for example, but to say
that is to use his own words; by our present understanding of ‘‘definition’’ he did
not do that—we should rather read it as another example of attempts to spell out
a coordination. Patrick Suppes’ comment (1957: 298) is strictly speaking correct,
but for a more balanced exposition of what Mach was up to, see Koslow 1968.
14
What about modality: the object has such and such a feature if a measurement
would have outcome so and so, if the measurement were made? This pushes the
problem back a step: how do such modal assertions receive empirical content?
15 Once that theory is accepted, it begins to infect the language in use; the
which enlarges on and corrects the history that Mach himself provides in Mach
1986. There are now more up-to-date treatments (see for instance Chang 2004)
but the main philosophical points I wish to take up are all in Mach’s treatise. As
I will say also of Poincaré, Mach is presenting us with a ‘‘just so’’ re-creation of
the history, to make the philosophical points, even though as conscientious as he
could be with respect to the historical details.
19 The entire passage from the records of the Accademia is quoted in Middleton,
pp. 33–4.
20 That what is measured is still, in our terms, a combination of mutually isolable
factors, remains true though. Middleton writes about Hooke’s liquid thermometers:
‘‘While it was an excellent attempt, it suffered from several disabilities: spirit of
wine is not a well-defined substance, its properties varying rapidly with the
368 :
amount of water in it; Hooke’s choice of the freezing point of water, rather than
the melting point of ice, was unfortunate; and he did not make any allowance for
the difference in expansion of brass—or silver—and glass. It may be noted that
Réaumur inherited the first two of these sources of error.’’ (op. cit. page 46)
21 Liquid water has its maximum density at 3.98 ◦ C, and expands both if the
in gases near the end of the seventeenth century, and his results led him to
the speculation that a sufficient reduction in temperature would lead to the
disappearance of pressure. As Hasok Chang points out, this is a different notion
of ‘‘absolute temperature’’ from that initially introduced by Kelvin, though later
the two were assimilated. Kelvin was originally concerned to give an independent
standard for equality of temperature intervals, not tied to the choice of one
‘‘standard’’ thermometric substance. For this purpose he drew on Carnot’s work,
using the measure of work equated with the descent from a given higher
temperature to a lower one: a specific amount of work could mark a difference of
one degree, so that ‘‘all degrees have the same value’’. See Chang 2004: 173–86.
25 Poincaré, H. The Value of Science, ch. II. ‘‘The measure of time’’. The page
numbers cited below are for the French edition, Flammarion 1970 (original 1905).
26
This begins section III; I am ignoring the passages in which Poincaré
concentrates on psychology.
27 Cf. my 1970: ch. III-2-c, pages 78–81 concerning Bosanquet, Russell, and
Russell’s debate with Poincaré on this subject. In 1897 Russell still states at least
that ‘‘No day can be brought into temporal coincidence with any other day . . . .;
we are therefore reduced to the arbitrary assumption that some motion or set of
motions, given us in experience, is uniform’’ (Russell 1996: section 151, 155), but
his views expressed in that debate shortly afterward were already quite different.
See further Grünbaum 1968: 44f.
28 This is analogous to the question whether the spatial congruence relation is
keeping with the ancient form of time reckoning that specified units of time
in astronomical terms. Note that the 24-hour period thus defined, in terms of
passage of a fixed star across the meridian is not equal to the older one defined
in terms of the sun’s passage across that meridian, with an average difference of
almost 4 minutes. The latter, however, are not equal to each other if reckoned
in sidereal time; ‘sun days’ are not the same during the different seasons of the
year.
31 In section IV he considers the suggestion that the case of the pendulum
façon que la loi de Newton et celle des forces vives soient vérifiées.’’ (Ibid. 46)
This could perhaps be said more appropriately about Euler; see my 1985, section
III-2-a. 73–4.
33 ‘‘Le temps doit être défini de telle façon que les équations de la mécanique
1961: 46, 247; Roxburgh 1977. See also the discussion in Grünbaum 1952 followed
by Grünbaum 1954 and Whitrow 1954: 151; Grünbaum 1968: 18–19.
35
As Fine 1986 showed clearly, Einstein’s views are not univocal; in this
connection I would cite specifically his 2004a and the debate he imagined between
Reichenbach and Poincaré (Einstein 1988: 676–79).
36 See Ryckman 2005 and my 2007.
37 From the Latin ‘‘scrupulum’’ meaning a small sharp, or pointed, stone: an
here, and I take the response that we are presently exploring to be precisely of the
sort that Putnam called metaphysical realism. What I call Reichenbach’s problem
is certainly not unrelated to the problems raised by Putnam in his celebrated
‘‘model-theoretic argument’’, which we shall have occasion to examine as well.
41
Not to mention the required previous coordination for the classifications of
height and mercury, for example.
370 :
stood if taken out of context. The assertion that temperature, as we now understand
it, is constituted in the historical development starting in the time of Galileo does
not imply, for example, that it is not now correct to say that there were days
before his time when the temperature in someone’s cellar dropped to 4 ◦ Celsius.
Compare the carefully evenhanded treatment in Bitbol [forthcoming a] of Latour’s
discussion of whether Ramses II died of tuberculosis.
44 This is not to deny that the theoretical representation goes beyond what can
in this historical process, I have also simplified the matter. There is typically a broad
range of theories involved; when eventually the community achieves stability in
the measurement of a theoretically identified parameter, that will generally involve
concordance between a variety of different procedures, for which the harmony in
their results is explained theoretically by their counting as measurements of the
same quantity. The main points, about how theory and practice are entangled in
this process, and how that involves elements of choice as well as appreciation of
empirical regularities, remain the same.
6. Measurement as Representation: 1
1 Perhaps we’ll be less happy, however, to call it a painting of the Duke than a
portrait of the Duke. On this distinction see Freeland 2007.
: 371
theory, only that theoretical terms are used in its formulation. For example, ‘‘this
powder is classified in chemistry as sodium nitrate’’ is theory-laden but does not
imply anything about whether the chemical theory is true or false.
4 Cf. Weinberg’s 1998 discussion of mass, and my 2002: 115–16.
5
Even if not equally salient, the situation is not in principle different in
earlier physics. The thermometer, for example, is itself an object in the domain
of thermodynamics, and the theory is required, for its coherence, to admit a
description of the thermometer-object interaction that satisfies the relevant criteria
for measurement.
6 Those questions can appear also even if the theory is well established and
accepted; however, they will play a visible role mainly if the measurements are
being made at the theoretical threshold of accuracy and precision. Examples include
the gravitational disturbance by ‘test particles’ in general relativity, which can be
lessened by reducing size, while in quantum mechanics reducing size increases
uncertainty.
7 Perhaps I am being too charitable. If this extreme reaction ever had any
plausibility, that may have been partly because it is generally only in a new
theoretical context that the relevant interactions first occur. But if the point needs
to be hammered home, take a simple example: you and I inspect a paper with
black marks on it, I recognize it as a Greek word and can read it while you have
no idea what it is. Are we seeing the same thing? Undoubtedly. If there is any
doubt, I will ask you to take a pin and trace the black marks. You may not do it
in the order in which I would, but you certainly trace the same marks—so you
are seeing those very same black marks. If it is objected that here I chose a context
in which you and I do share pertinent concepts and apply them in the same way
to the object before us, I answer that this is inevitable. No matter how far back
we go, with examples of comparative ignorance or lack of similar education, there
is a common background sufficient to ensure communication. Or, if you like, if
there is not, as with perhaps an alien intelligence, then the question of whether
we are seeing the same thing doesn’t arise at all. There is no intelligible Robinson
Crusoe state without prior knowledge or opinion, a tabula rasa waiting to be
inscribed by bare experience. But that we see the same thing does not mean that
you can see it as writing, or as Greek, let alone read it. So for me this object
is classified, is assigned a location in a logical space, which may not be available
to you.
372 :
already tell you that things aren’t so simple. For do we really want something like
this: if the object has the property that the color objectivist account mentions then the apparatus
will end up with the property that the color subjectivist account mentions? One of the
objections to the objectivist account is that there is no physical property, describable
without reference to observers, which equates in this way to the subjectivist’s
observer’s property. There is no physical property that the object has if and only if it
looks blue to the observer. The most troubling phenomena for the color objectivist
are the contextuality and constancy of color. Observers will report different colors
when the object is placed against relevantly different backgrounds; on the other
hand they will continue to report the same color while the object is subjected to
different lighting conditions. There is a further difficulty in the common inability
to identify colors independently of certain objects or kinds of objects that have
them. Seeing a blue steel ball and a blue woolly sweater, we can always continue
to doubt whether they really have the same color (shade of blue). Sean Kelly 1998:
The second kind of dependency—the dependency of a perceived property on the object
it’s perceived to be a property of—is shown by Peacocke’s example of the height of the
window and the height of the arch . . . and also by Merleau-Ponty’s equivalent claim that
‘the blue of the carpet would not be the same blue were it not a woolly blue’. The basic idea
is that when I perceive a property like height or color, what I see is not some independently
determinable property that any other object could share; rather what I see is a dependent
aspect of the object I’m seeing now. (paragraph 26).
12 I first heard this sort of example from Simon Kochen in a seminar in
Princeton.
13 Also referred to in the literature as ‘‘faithful measurement’’.
14
The first sentence of this section was phrased more carefully than the second
(which is a standard formulation), because of differences in interpretation with
respect to the notion of state. (See for example my [forthcoming] on Rovelli’s
Relational Quantum Mechanics.) It is more interpretation-neutral to discuss the
issue in terms of physical quantities (observables) and their values only, than in
terms of states. But since most discussions are in the standard formulation which
: 373
takes the notion of physical state for granted, I will ignore this complication except
when very pertinent. This remark applies also to subsequent chapters.
15
These factors can be identified in many alternative ways, when the math-
ematical foundations of a theory are presented. The state is typically something
quite theoretical. In some presentations, however, the state s is simply identified
by (or with, or through) the set {Psm : m is an observable}. On the other hand,
sometimes the observables are simply identified by (or with, or through) the way
in which states assign those probabilities. Also, a simplification is typically achieved
by thinking of all observables as ‘‘made up’’ of simple ones, whose possible values
are real numbers, so that the variable E in Psm (E) can just range over the Borel sets
of real numbers (that is, the sets formed from intervals by infinitary intersections
and unions). We can proceed on a level that abstracts from these different forms of
representation of basic theory structure.
16 The theory must imply more than that the criterion is satisfied in a particular
case; it must imply that this is always so for such cases. In quantum theory this
means that the Hamiltonian which governs this sort of interaction is such as to
guarantee that relation between initial and final states. This point is crucial to
eliminate limiting cases as counterexamples; see the discussion of an objection by
Jon Dorling in my 1991: 221.
17 What I have here related in very general terms is provided in detail for
quantum mechanics in the references above. The reader may naturally wonder
whether our account of measurement so far helps to solve or dissolve the famous
‘‘measurement problem’’ of quantum mechanics. The answer is Yes and No.
On the one hand, the account which takes for granted that surface models
are produced from realizable experimental conditions, and views the empirical
content of a theory as consisting in claims as to how and to what extent these
surface models can be accommodated by the theoretical models, does not run
into conceptual difficulties. On the other hand, in quantum mechanics there is a
core problem—not touched here at all—that can be formulated for the physical
correlate of measurement taken in and by itself (without regard to the ‘‘information
gathering’’ connotation of the word measurement). This core problem is one to
which we will return under the heading of ‘appearance and reality’ in theoretical
description.
18 I call the replacements ‘‘surface models’’ in contrast to the data models; I will
satisfied at least to the extent that any parameter has a definite value when it is
measured, and that means always, if any occasion at all would in principle allow it
to be measured. But the Criterion for the Physical Correlate of Measurement does
not have any such implications by itself.
374 :
20 Not without premises going beyond the one that the surface model is thus
related to that state—what are needed are also premises about the sorts of systems
involved, and their relations (general conditions formulated in terms of e.g. the
Hamiltonians). On the assumption that the theory applies, however, these too
are extrapolated from, or made with support of, previous measurement results in
experimental situations.
21 By sequencing such Stern–Gerlach apparatus it is easily shown that the
process does not leave the state unaffected; hence data have to be collected on
different samples of the original beam.
7. Measurement as Representation: 2
1 Francis Bacon insisted, in his clarion call to the founders of the new sciences,
that experience must become literate (Bacon 1994: Novum Organum I:101). By this
he emphatically did not mean to include the sort of bias that practically every
form of literacy has riding along with it. Yet bias is inevitable, both for good and
for bad, and it is not incidental. If we can read at all, our responses are shaped
by presuppositions and assumptions, prior opinion, conditioning, learning, not to
mention strong intellectual commitments and norms that govern our selectivity.
We read measurement outcomes through theoretically-schooled eyes; and this is appropriate
as well as inevitable, while also inevitably hostage to the fortunes of later learning
that shows those spectacles’ distortions.
2 For the purpose at hand we can ignore some complexities, but they will
be remembered from the first chapter. How the object is represented by the
measurement outcome depends on the theoretical context in which it is read:
meaning is not independent of reading. Recall the example of ‘‘burro’’, read
as a word in Italian or as a Spanish word—a symbol has its meaning not
absolutely but due to its role in a (contextually) given language, symbol system, or
representational framework, which in scientific contexts is determined by a theory
governing discourse and practice.
3 Both here and elsewhere I am using illustrations from the history of meas-
5 At first blush at least there are numerical scales which do not fit in any of
these four categories. For example, an earthquake ranked 6 in the Richter scale is
10 times stronger than one ranked 5, which in turn is 10 times stronger than one
ranked 4. But this is a matter of presentation: the magnitude on the Richer scale is
the logarithm (to base 10) of a magnitude on an underlying ratio scale, namely the
logarithm of the combined horizontal amplitude of the largest displacement from
zero on a seismometer output. Thanks to Richard Otte for raising this point.
6 The Mohs scale of hardness had its difficulties. Mohs, in developing his scale
for the hardness of minerals, ranked minerals relative to each other by the relation
‘‘scratches’’. He selected ten minerals to represent particular points on the scale
and assigned them numerals from 1 to 10. The operation of scratching does not
give more than an ordinal significance to these numbers. Mohs had assumed that
his relation of ‘‘scratches’’ was transitive and asymmetrical and that the equivalence
indicated by not being able to scratch one another was transitive and symmetric.
This assumption ran into a problem when it was found that some minerals could
not scratch each other, yet differed with respect to scratching a third mineral.
But this does not refute that an ordinal scale could be defined in terms of the
classes of minerals scratchable and minerals not scratchable. That is, minerals could
be classified as equally hard if they can scratch all the same minerals, and can be
scratched by all the same minerals. Yet here too there is an empirical assumption,
if it is asserted that this leaves no ambiguity in the ordering. Later attempts to
arrive at a measurement scale for hardness by other operations such as microscopic
measurement of the depth of a scratch made by a diamond under constant pressure
or the amount of work done in grinding away a certain weight or volume of
material, allowed for more quantitative comparisons. Note well: the real issue here
was to find ways to arrive at a stable ordering induced by outcomes of physical
procedures applied to the minerals.
7 Suppose that x, y, z are three Fahrenheit temperatures, and x , y , z the
systems with different numbers of degrees of freedom, and the relevant phase
spaces are thus not common to all models. Models can be grouped by common
state spaces (see Lloyd 1994: 19–20 and 35). Secondly, the conception of time,
space, or space-time as a logical space for which I argued in my 1970 is of course
controversial, being at odds with substantivalist theories in this area.
13 This becomes especially pertinent when the ascription of an ‘unsharp’ value
in quantum theory does not arise simply because of ignorance of the ‘real’ sharp
value (see Appendix pp. 312–314).
14
A well-known paper by James Bogen and James Woodward (1988)—but in
a way that results in a terminology more confusing than the simplifications they
criticize. It is not grammatical, it seems to me, to say that data are observed—a
datum is the content of the outcome of a measurement. Nor does it seem to me to
properly respect the history of the word ‘‘phenomenon’’ to say that phenomena are
typically not observable. Admittedly, regimenting language always has its leeway;
here I will concentrate on the word ‘‘data’’, and in a later chapter address usage of
‘‘phenomena’’ and ‘‘appearances’’.
15
The analysis may be made by the experimenter, or already automatically by
the instrument which may issue e.g. a graph rather than a set of points. Thanks to
Todd Harris and Paul Teller for pointing this out.
16 I am taking for granted here agreement that a frequency interpretation
up the soil samples to disseminating the smooth graphs and annotated maps, in a
study of the changing boundaries between savannah and forest.
18 The semantic view of theories to which this statement refers is easily
exactness. For detailed considerations of how this can miscarry, and how we are
to relate to the inexact representations we can have in practice, see Teller 2008, as
well as his earlier 2001a.
20
This does not ignore the possibility that some measurement choices in
PRC are mutually incompatible; in fact, that would make it easy for the family
: 377
and 1982b.
23
An algebra is a mathematical structure consisting of a set of elements and a
collection of operators on those elements—though the term is variously defined
in different contexts in mathematics, so as to narrow the meaning (e.g. an algebra
is a vector space with a bilinear multiplication operation).
24 As for representation in general, so for measurement: it would be useless,
perhaps fatuous, to try for a definition. But in this case too, we can come to an
understanding of the subject by eliciting its general features and placing them in
context.
25
Weyl 1953: 8–9; quoted and discussed in Ryckman 2005: 134.
26 From section I of ‘‘The Relativity of Space’’ in Poincaré 1897.
27 Important to keep in mind that ‘‘invariant’’ is not an absolute term, that it is
previous stage of our history, it does not follow that there was or could have been
a moment when meaningful language was created from nothing. With scientific
terms being our concern alone, we do not need to speculate on how language came
into the world; we must only become clear on how new terms can find their use
when explicitly introduced for practical and theoretical reasons. For the way David
Lewis responded quite differently to the basic problems in philosophy of science
(though mainly as posed by Carnap and Putnam) see my detailed examination of
his take on this issue in my 1997b.
4 Much of the problématique encountered here is found, mutatis mutandis, in
the ‘Ramsey sentence’ literature. I will not take that up, but see my 1997b, where
I examine David Lewis’ attempt to define theoretical terms by adapting Ramsey’s
move.
5 My phrase ‘‘reach for the unconditioned’’ is of course meant to echo Kant’s
diagnosis of what he calls the transcendental illusions, the illusions of reason: such
illusions are made inevitable by reason’s tendency to seek the unconditioned, that
is, to carry a series of ideas or questions or arguments to their ‘logical conclusion’
even when their completion would lie clearly beyond the bounds of sense.
of Cassirer 1950. Today the term seems to be used mainly for the study of visual
media, both in aesthetics and in communication theory, but in philosophy also for
Wittgenstein’s ‘‘picture theory’’ of meaning in the Tractatus.
2 Nyhof 1988 argues convincingly against the claim that Boltzmann later
1970.
4 This sort of conflation of the philosophical issue of scientific realism with
11 Schrödinger 1929: 16; cited D’Agostino 2004: 381; see also Schrödinger
1928.
12
Schrödinger 1951: 40; cited Bitbol 1996: 29; and see further Bitbol 1996,
passim, and Schrödinger 1953. The first part of this passage does not attribute
continuous spatio-temporal trajectories for particles; it refers solely to the fact that
the wave function of a system is defined with spatial coordinates (though it is not
a wave in 3-space).
13 The reference is to Aristotle, Meteorology I, 7. In the next section (proposition
205), Descartes submits that under these conditions we can have ‘‘moral certainty’’
(‘‘a certainty sufficient for the conduct of life’’). This could certainly be cited
as presaging (an admirably modest version of) the Rule of Inference to the Best
Explanation.
14 In medieval paintings of the Annunciation we sometimes see an attempt to
assimilate the action of the Holy Spirit to optical or mechanical interaction; see for
example Steinberg and Edgerton 1987 on Filippo Lippi’s Annunciation in London.
15
For textual support see Blackmore 1999. Blackmore, a prominent Boltzmann
scholar, is very unsympathetic to the anti-realist leanings and makes a point of
showing that Boltzmann is careful to hedge his bets on this account.
16 Here I am much indebted to Hyder and Lübbig 2000 and Hyder 2003.
17 Heisenberg 1945, p. 36; cited in translation in Cassirer, op. cit., p. 117.
There is a clear echo here of Bohr’s repeated insistence that the wave function is
no more than a summary of what will be observed in measurement arrangements
which themselves are described in our common language in use before the advent
of atomic physics.
18 Hertz 1962: 20–1.
19 When Poincaré later says that Maxwell had shown that there must exist
‘‘[C]omment nous assurer que les principes que nous imaginerions, sont ceux-
mêmes de la nature? Et sur quel fondement voudrions–nous qu’elle ne sache faire
les choses qu’elle nous cache, que de la manière qu’elle fait celles qu’elle nous
découvre ? Il n’y a point d’analogie qui puisse nous faire deviner ses secrets ; et,
vraisemblablement, si elle nous les révéloit elle-même, nous verrions un monde
tout différent de ce que nous voyons. [. . .] C’est que l’imagination voit tout ce qu’il
lui plaît, et rien de plus’’ (Condillac 1749/1798/1949: 197–8; cited and discussed in
Vuillemin 2005).
23
My translation; ‘‘Nulle théorie ne semblait plus solide que celle de Fres-
nel . . . Cependant, on lui préfère maintenant celle de Maxwell. Cela veut-il dire
que l’œuvre de Fresnel a été vaine? Non, car le but de Fresnel n’était pas de
savoir s’il y a réellement un éther, s’il est ou non formé d’atomes, si ces atomes
se meuvent réellement dans tel ou tel sens ; c’était de prévoir les phénomènes
optiques’’ Poincaré 1968: 173.
24 This view, expressed by John Worrall, was named ‘‘epistemic structuralism’’
by James Ladyman.
25
There is an often mentioned bit of support for such views, already alluded
to by Maxwell’s remarks about fruitful analogies across different areas of physics.
Important equations tend to recur in many places. They tend to identify recurrent
patterns in nature, found not once but many times. Often a new process is first
described in analogy to an old one, with the equations transposed or reinterpreted.
Heat diffusion and gas diffusion are analogous, the harmonic oscillator crops up
everywhere. . . . So the equations omit the distinguishing characteristics. As a
reason for structuralism, this observation does not show much at all. For whenever
we see the same equations describing two scientific subjects, we also see science
describing the differentiating characteristics. If we didn’t, we wouldn’t have an
example to give! The point that such equations describe at once many different
processes needs serious reflection, but it is not much of an argument for any
general view.
26
It is exactly the difference he outlines between the technical language of
a practical profession or craft, and scientific language: only the former can we
interpret by identifying the technical terms’ referents (Duhem 1962: 147–53).
27 As I see it, this applies to all the sciences, not just physics: biology and
the social sciences too involve construction of models that are essentially abstract,
hence mathematical, structures. I realize that this view rests on a view of what
mathematics is, how it encompasses all abstract representation used in science.
28
This insight, and the problem it raised, is found quite explicitly in Carnap’s
Aufbau (some five years before Weyl’s lecture) but with enormous resistance to
its import (as we’ll see below). The famous objection by Newman to Russell’s
382 :
structuralism, at about the same time, trades on a specific instance of what Weyl
states here with complete generality.
29
We will need to re-examine this point carefully below, since assertions of
isomorphism are context-sensitive—a group isomorphism between two mathem-
atical objects certainly does not imply that they are identical, for example, for
although they are both groups, they may have other characteristics as well. Weyl’s
point is telling only when there are no mathematically describable features left out
of account in the isomorphic mapping.
30
In the novel it seems she has simply been told that her fingernails are
pink. She could have also done an experiment to see what a spectrometer would
show about the light reflected from her fingernails, so that she would know the
wavelength of the reflected light. Such variations do not affect the problem.
31 The crucial clue here, as in our discussion of the use of maps and models in
Part I, I see in the word ‘‘this’’ in Mary’s ‘‘this is pink’’. For a similar line, with
respect to the philosophy of mind issues for which the Mary example was devised,
see Ismael 1999.
next note). Russell’s reply (similar to Frege’s point of view, see below), is in effect
that the axioms of geometry are statements capable of being true or false, which
presupposes that their terms designate; what those designata are like determines
the truth value.
5 ‘‘La question de M. Poincaré me place dans la situation désavantageuse d’un
cette lettre dans sa réponse. Si cet écolier était mathématicien, il répondrait tout
bonnement : A est la lettre qui précède B; et si on lui demandait d’épeler B, il
dirait que c’est la lettre qui suit A. Mais s’il sait vraiment ce que c’est qu’épeler, il
renoncera à la tache, de désespoir.’’ (Russell 1899: 701)
6 Frege elaborated his similar response in a controversy with Hilbert during the
years 1899–1906. (See Coffa 1986; Torretti 1984: ch. 3, section 2.10.) The theory
of space must be non-vacuous and true. But the question of truth can’t arise unless
the primitive terms have an independent meaning which fixes their reference.
They certainly cannot get that meaning from the axioms or theorems, as Hilbert
asserted. For those axioms and theorems are incapable of having a truth value
unless their terms have referents. Pressed to explain how we identify those real
spatial relations, Frege also retreated to intuition or direct acquaintance: ‘‘I give the
name of axioms to propositions which are true, but which are not demonstrated,
because their knowledge proceeds from a source which is not logical, which we
may call space intuition’’ (in correspondence with Hilbert; quoted Torretti 1984:
235). This was of course just the sort of thing which Poincaré and Hilbert explicitly
professed to not understand.
7 Russell 1901; see specifically pp. 313–14; this passage was spoofed in Philip
only sense data (and possibly the self ), but that point does not play much of a role
here. The objects postulated by physics are beyond the reach of experience; that is
the only relevant point here.
9
The importance of this article, its devastating import, and its relation to more
recent discussions of realism, was pointed out by Demopoulos and Friedman.
10 We can also easily put the point in terms of models and equations. Suppose
some equations have a model in which there are N distinct entities (where N
may of course be uncountably infinite). Choose a set of the same cardinality in
the world. Because same cardinality implies the existence of a correspondence, we
have an implicit transfer of the relations in the model to that chosen set. Therefore
the world satisfies those equations! Provided only the world’s size is large enough,
experimentation is superfluous. As David Lewis realized (and as we shall take up
below), Newman’s point becomes simply Putnam’s model-theoretic argument,
once we phrase it in terms of models and equations.
11 Or so it seems. In the next chapter we will see how David Lewis denied this
very assertion.
12
Since we are here speaking of a repair being carried out within the context
of Russell’s own epistemology, note that the only things with which we are
acquainted there are our sense data and those Universals (and possibly our selves).
384 :
But we can presumably adapt the same moves to a more liberal view according to
which we are acquainted with all the observed phenomena as usually understood.
13
In the discussion of Putnam’s paradox, we will see this question returning in
a more general way. The answer is yes, the same problem does return even if one
retreats from Russell’s extremism with respect to non-logical terms.
‘‘extraordinary suggestion’’ (1999: 103), and asks ‘‘But what can the ‘experi-
enceable, ‘natural’ relations’ be except precisely those relations somehow available
for ostension?’’ (ibid.).
5
I’ll quote here the argument as it was originally presented to the APA in 1976
(in which that core is not isolated): ‘‘So let T1 be an ideal theory, by our lights.
Lifting restrictions to our actual all-too-finite powers, we can imagine T1 to have
every property except objective truth—which is left open—that we like. E. g. T1 can
be imagined complete, consistent, to predict correctly all observation sentences (as
far as we can tell), to meet whatever ‘operational constraints’ there are . . . to be
‘beautiful’, ‘simple’, ‘plausible’, etc. . . .
I imagine that has (or can be broken into) infinitely many pieces.
I also assume T1 says there are infinitely many things (so in this respect T1 is
‘objectively right’ about ). Now T1 is consistent . . . and has (only)
infinite models. So by the completeness theorem . . ., T1 has a model of every
infinite cardinality. Pick a model M of the same cardinality as THE WORLD.
385
Map the individuals of M one-to-one into the pieces of , and use the
mapping to define relations of M directly in . The result is a satisfaction
relation SAT—a ‘correspondence’ between the terms of [the language] L and
sets of pieces of —such that theory T1 comes out true—true of
—provided we just interpret ‘true’ as TRUE(SAT). So whatever becomes
of the claim that even the ideal theory T1 might really be false?’’ (Putnam 1976:
485; Putnam 1978: 125–6)
6 This theorem was initially debated in the context of philosophy of math-
ematics, in connection with Skolem’s relativism. One spoke there also of the
Loewenheim–Skolem paradox.
7 Thanks to Jenann Ismael for striking examples of this sort.
8 It might be objected that the map, like Paris itself, has much additional
structure, while a model has only the structure it displays. That is not a good
disanalogy, for the model also displays a selection of the structure it has. Suppose
the model is D, F with D its domain and F a family of sets and relations on D.
Then F is of course a selection from the set of sets of n-tuples, for various numbers
n, of members of D. All those other sets of sets there are selectively excluded
from F.
9 This is just the sort of example that must have led Leibniz to agree that
context of the semantic view of theories, shares this vulnerability (cf. Demopoulos
2003). I began to face these difficulties in my 1997a, 1997b, 2001, and mean to
complete their dissolution here.
7 Easy to see, I think, the echo of this passage in Reichenbach’s student
Putnam’s ‘‘This simply states in mathematical language the intuitive fact that to
single out a correspondence between two domains one needs some independent
access to both domains’’ which I quoted above.
8 Mea culpa: in The Scientific Image constructive empiricism was presented in
the framework of the semantic view of theories, but seemingly in the shape
of the above ‘‘offhand’’ realist response. See for instance ch. 3 section 9, p. 64
where I define empirical adequacy using unquestioningly the idea that concrete
observable entities (the appearances or phenomena) can be isomorphic to abstract
ones (substructures of models). Demopoulos 2003 comments on this passage. That
this ‘‘offhand’’ way of talking is most readily interpreted in metaphysical terms is
clear also in comments by such acute and careful readers as Stathis Psillos (2006,
remarking on correspondence and structures in re) and Michel Ghins (Ghins 1998:
328). Rather than try to excuse or explain my obliviousness to these issues at that
time, I tried to do better in my 1997a, 1997b, 2001, and I will try to do better here.
9 Admittedly, this conception is quite a natural one to find in the foundations
of physics, for physical systems in the intended domain of a theory are conceived of
as highly structured, and the entire discussion is targeted on mathematical models.
10 In his preface Whitehead says that he took the term from an earlier use:
‘‘The general name to be given to the subject has caused me much thought: that
finally adopted, Universal Algebra, has been used somewhat in this signification by
Sylvester in a paper, Lectures on the Principles of Universal Algebra, published in the
American Journal of Mathematics, vol. vi., 1884. This paper however, apart from the
suggestiveness of its title, deals explicitly only with matrices.’’ (Whitehead 1898)
11 This point is made and explored by Psillos 2006.
12 The motto of the British Order of the Garter. When Edward III danced
with the Countess of Salisbury her garter fell off. Edward said ‘‘Honi soit qui mal
y pense,’’ (‘‘shame on whoever thinks ill of it’’) and tied the garter around his own
leg.
13 It seems to me, though, that this line of thought is disappearing from the
scene: Stephen Leeds (1978, 1994, 1995, forthcoming ) shows that the scientific
realist does not need to rely on a correspondence theory of truth; lest we think
that metaphysical realists must all need it, there is David Lewis 2001.
14
I won’t do much to argue that here; if a reference is needed beyond the
above discussions of the correspondence theory of truth, my own favourite is
Quine’s ‘‘On what there is’’.
387
15 See e.g. the papers by Leeds and Lewis cited above, or the enormous further
literature on the subject.
16
This is essentially the point raised by Mauricio Suarez which I took up in
Chapter I-1.
17 Giere 1988: chs. 3, 7, 8; Giere 1999: chs. 6, 7; Teller 2001, discussed in my
2001.
18 There is an echo here of the main point of virtue epistemology: for some
information held to count as knowledge, not only its content but the history of
its acquisition is crucial. The point is different but the re-direction of attention, to
how the item is arrived at, is the same.
19 It may be objected that a given model may not have the requisite complexity
(i.e., the phenomenon that we are describing) is first defined by the coordination.
But it could only be ‘something like this’, since this way of putting it seems to
imply that what the models represent is something like the physical objects as described
rather than the physical objects. The ‘‘as’’ in that phrase is the traditional ‘‘qua’’,
and one is meant to both identify and distinguish the referents of ‘‘the X, which
is F’’ and ‘‘the X qua F’’—not a resource the later empiricism can draw on, to
put it mildly. I want to thank Anja Jauernig for help in my effort to understand
Reichenbach’s 1920 attempt to forge a modified neo-Kantian view of the matter.
22 The simplifying assumptions are strong of course: consider just two bodies,
the mass of one of them much less than the other’s, and with the distance between
them remaining limited. Newton’s laws admit other solutions even for the two-
body system and also quickly allowed corrections to Kepler’s laws, taking into
account that the smaller mass is not negligible even for a planet in relation to the
sun.
23 That is not to deny that there is a pertinent 3-place relation that can be
described, assigned a set as its extension, and so forth: namely a relation between
the scientist, the bacteria colony, and the data model this scientist constructed.
The important point is that an indexical sentence is not meaning-equivalent to an
388 :
non-indexical one, except within or relative to a context of use, and it can have
uses which cannot be served by any non-indexical sentence.
24
As Psillos 2006 also emphasized.
25 A response entirely at odds with empiricist scruples would be to escape
‘Snow is white’ true?’’ They are not the same question. After all ‘‘Snow’’ could
have been our word for grass, water, or wine. But given that this sentence is our
sentence, as things actually are, the questions amount to the same thing for us. That
is why for us, who speak this language, ‘‘The sentence ‘Snow is white’ is true if
and only if snow is white’’ is a pragmatic tautology.
27 It may be easy to see the similarity to Descartes’s ‘‘I think’’, for which he
demonstrates not the necessity or truth but the indubitability for the speaker of
the ‘‘I’’, so that the sentence fits well into the category of pragmatic tautologies. It
seems to me that the same holds for Putnam’s ‘‘I am not a brain in a vat’’ in view
of Putnam’s own form of refutation of this sort of scepticism.
28 Now we can see why the offhand realist responds sounds plausible at
first, because it does get something right—namely that in the end there is no
problem, precisely because we can (a) correctly describe relevant parts of nature
and mathematical objects, and (b) say how they are related to each other. But this
plausibility hides the mistake of replacing ‘‘we can’’ with a relation independent of
the user (the ‘‘we’’) and ignores the selectivity exercised by the user for the user’s
specific purpose.
me to this passage.
4 I do not mean to underrate the way in which mathematical modeling too
trades on resemblance. But the extent to which mimesis is involved at all diminishes
continuously as we move our view successively from table-top models to textbook
illustrations, from there to mathematical modeling, and eventually to the geometric
spaces that appear in relativistic cosmology and quantum field theory.
. 389
grand book, the universe, which stands continually open to our gaze. But the book
cannot be understood unless one first learns to comprehend the language and read
the letters in which it is composed. It is written in the language of mathematics,
and its characters are triangles, circles and others geometric figures without which
it is humanly impossible to understand a single word of it; without these, one
wanders about in a dark labyrinth.’’ Il Saggiatore (1623), in Drake 1957: 237–8.
Similarly, ‘‘The book of philosophy is that which stands perpetually open before
our eyes, but because it is written in characters different from those of our alphabet
it cannot be read by every body; and the characters of this book are triangles,
squares, circles, spheres, cones, pyramids and other mathematical figures fittest for
this sort of reading.’’ Lettera a Fortunio Liceti, gennaio 1641 in Crombie 1994: vol.
i, 585.
6 This quaint conviction beset much of early empiricism as well. Locke writes
that the mind ‘‘hath no other immediate object but its own ideas’’ (An Essay
Concerning Human Understanding, Bk IV, i, I); Berkeley that ‘‘the objects of human
knowledge are either ideas actually imprinted on the senses, or else such as are
perceived by attending to the passions and operations of the mind’’ (Principles of
Human Knowledge, Part I, I); Hume that ‘‘[a]ll the perceptions of the human mind
resolve themselves into impressions and ideas’’ (Treatise of Human Nature, I, i, I).
7 Meditation III, section. 19, tr. John Veitch (Descartes 1959).
8 See for example ‘‘Huygens and Leibniz on Universal Attraction’’, pp. 115–38
in Koyré 1965.
9 For the former cf. ch. 2 of Vargish and Mook 1999; for the latter, Cushing
1994.
10
That logically necessary connection may not be finitary, may in fact be
inaccessible to any finite mind—as Leibniz made explicit (therefore not within
even the potential reach of the physical sciences)—but be graspable only by
the divine mind. There are many ambiguities in these developments. Sometimes
Descartes and Leibniz do sound as if there can be only one world, of logical
necessity. The popular version would go like this: from the concept of God it
390 :
follows that he would not create a world at all, if among all the conceivable ones
there was not a best one (Leibniz’s Theodicy) or one uniquely transparent to the
human mind (Descartes’s posthumous The World). But at other points the claim is
that although the regularities derive with logical necessity from the laws of nature,
those laws characterize a selection from the realm of conceivable possible worlds
which has no further rationale, at least within the context of even these ‘‘theories
of everything’’. Notice that we have here, in effect, the first ‘‘supervenience
without reduction’’ claim, for reduction would require finitary reasoning but the
demonstrative link is claimed to be non-finitary. I will return to questions of
supervenience below.
11 ‘‘Some seekers after the theory of Everything would seem to be hoping
tendentious and controversial. I do not want to enter into that debate just now.
The controversies illustrate, in any case, the denouement of the historical pattern
that I discern. The new and revolutionary success in science came with a ostensible
rejection of a prevalent completeness criterion for science, but in the aftermath
. 391
many strove (and strive!) valiantly to reinstate the criterion and to show that its
rejection was not logically required.
18
There is some disparity between what Bohr and his colleagues, students, and
followers originally professed and what almost the entire physics community came
to subscribe to under the name of ‘‘Copenhagen interpretation’’; see Howard 2004.
We could say the same for any revolutionary stance, whether social, religious,
or intellectual. What I will insist on is that with all its ambiguity intact, the
Copenhagen view was intellectually and scientifically revolutionary.
19
See further e.g. p. 80. After maintaining this view strongly throughout most
of the book, Leplin includes a section in which he raises the possibility that recent
and current theories in physics will not be in accord.
20 There is now a quite extensive literature on how this point about wavelength
of reflected light is a far cry from the nuanced and delicate account of human
visual experience. The critique I am in the course of offering here is independent
of that; it targets not the degree of success in such explanations of specific
appearances, but the conception of methodology that involves those putative
criteria of completeness.
21 Attempts to provide such accounts which could perhaps provide such
explanations even today include work by Stephen L. Adler and his colleagues on
Generalized Quantum Dynamics, as well as earlier work by e.g. Nelson 1967.
Such accounts, if successful, can satisfy the Appearance from Reality Criterion no
less than deterministic theories.
22
This is too simple an idea of course, but is not in essence too far from the
work by Adler and Nelson mentioned above. Besides these I am thinking here of
the Ghirardi, Rimini, and Weber (GRW) version of quantum mechanics, which
introduces a ‘Lucretian swerve’ into Schrödinger’s deterministic equation.
23 Wigner’s 1961 appeal to consciousness in quantum mechanics, to ‘explain’
collapse of the wave-packet falls under this heading, as do the idealist and
(quasi-)instrumentalist accounts which Grünbaum depicted as prevalent forms of
easy anti-realism among scientists (Grünbaum 1957: 717–19).
24
I have only slowly come to see the importance of marking such a distinction.
In The Scientific Image I did not make this distinction either carefully or clearly.
The chapter on saving the phenomena introduces ‘‘appearances’’ to denote what
Newton called ‘‘apparent motions’’, identifying them as ‘‘relational structures
defined by measuring relative distances, time intervals, and angles of separation’’(p.
45). I would now refer to those relational structures as data models. Data models
are the summarizing refinement of the contents of a battery of measurements,
typically, so this is not far from my present usage. But in the passages that follow
there, the reference seems from time to time to be just to observable entities, i.e.
392 :
phenomena rather than appearances in my current stricter usage. Thus Paul Teller
rightly writes ‘‘First is the idea of. . .phenomena (which means, the observable
process and structures). . .’’ (SI 3). ‘‘I take van Fraassen to use ‘phenomena’
and ‘appearances’ interchangeably. (SI 45, 64) I will understand phenomena and
appearances as that which we can observe, or that of which we can become
perceptually aware, without the use of instruments’’ (Teller 2001: 125–6).
25 Kant, Critique of Pure Reason, first para. of Transcendental Dialectic, Introduction
section I. The quote is from Meiklejohn’s translation (Kant 1850: 209), which had
many new editions well into The twentieth century. The translation by Francis
Haywood (Kant 1848: 234) is practically the same. The German original is ‘‘Noch
Weniger dürlen Erscheinung and Schein für einerlei gehalten Werden’’.
26 There is lots more to be said about the terms of course. The term
modern science that follows here could be more informatively developed around
e.g. Newton’s system of the world—but I think we would lose the forest for the
trees.
28 Amazingly, this common sense realism is often lacking among the most
see it presaged in earlier ones. Some of what I will argue is certainly along the
line of how Michel Bitbol depicts Schrödinger’s responses to the measurement
problem: ‘‘But what was really needed was a full acceptance of the parallelism
between the time-development of the holistic wave-function (object + apparatus)
and the sequence of macroscopic events, rather than an new blend of the old idea
of a causal interaction which takes place between objects and apparatuses in order
to produce the events.’’ (Bitbol 1996: 123).
3
I emphasize this because arguably, the Appearance from Reality Criterion is
not violated in Bohmian mechanics, where position is the only genuine observable,
evolution of states is deterministic, and all measurements, of any sort, are recon-
structed as in the end just position measurements. Even if such a pattern should
overtake all of physics in the coming century (however unlikely that may be),
the methodological point would stand: the Criterion cannot be said to have been
394 :
philosophy of mind shenanigans; see for instance Glymour’s gleefully critical 1999
review of Kim 1998.
8
As a metaphysical postulate this supervenience claim presumably gives some
emotional comfort to the materialist/physicalist. While science is here admitted to
be incapable of showing this, the world is still conceived as how the physicalist
would like it to be.
9 Quoted by Mates 1986: 108–9. For this part, and for the following references,
and postulates. There are, finally, simple ideas which cannot be defined, and there are
also axioms and postulates, or, in brief, primitive principles, which cannot be proved and
need no proof. And these are identical propositions whose opposites contain an explicit
contradiction. (Loemker 1975: 1050).
the mathematics within which quantum theory is formulated does not allow of a
consistency proof.
13
Evolution of an isolated system obeys Schrödinger’s equation: there is a
group of unitary operators { Ud : d in R } such that the pure states ψ(t) evolve
under the action of these operators: ψ(t+d) = Ud ψ(t).
14 We must distinguish here the actual historical development of the quantum
theory from interpretative additions and extrapolations of recent years. The ‘‘bare’’
theory is empirically empty without the Born Rule, but there have been attempts
to deduce the Born Rule from the basic theory supplemented with assumptions
involved in certain interpretations of the theory, such as the ‘‘many worlds’’
interpretation, GRW, or Bohmian mechanics. These sorts of assumptions were
either entirely absent in the development of quantum theory or roundly rejected
by the main physicists involved, so this does not affect the claim that this historical
episode in physics involved a rejection of the Appearance from Reality Criterion.
15 We may note here that there are certainly set-ups that are not neatly
dissectible into object measured and measuring apparatus. Rom Harré 2003 aptly
coined the terms ‘‘Bohrian artifacts’’ and ‘‘apparatus/world complexes’’ to designate
the peculiarities of such set-ups: ‘‘Let us call the apparatus/world complexes that
scientists, engineers, gardeners, and cooks bring into being Bohrian artifacts.
Properly manipulated they bring into existence phenomena that do not exist as
such in the wild [. . . .] In the famous Bohr–Einstein debate around the EPR
paradox, it is possible to see the outlines of Bohr’s account of experimental physics.
396 :
While Einstein is insisting that for every distinct symbol in a theoretical discourse
there must be a corresponding state in the world . . . Bohr . . . is concerned with
the concrete apparatus and its relation to the world as part of the world. An
apparatus is not something transcendent to the world . . . . The apparatus and the
neighboring part of the world in which it is embedded constitute one thing.’’
(pp. 28, 29.) Despite this indissoluble entanglement, when the set-up is classified
as a measurement, then its outcome is classified as representing something in that
apparatus/world complex.
16
That is (another way to state) the Measurement Problem! This has seen many
offered ‘solutions’ and ‘dissolutions’ and much debate between their advocates, in
its now almost century long history. The literature on this subject is enormous.
For an older detailed treatment see my 1991; for a perspicuous recent discussion
that highlights the points that I will take up here, but in a general probabilistic
setting, see Wilce, forthcoming.
17 See the caution in a previous note about the reliance here on a standard
formulation of the subject matter. I meant to introduce all the basic notions needed
to understand the Measurement Problem, but at this point (and some others) I
included assertions that I am not justifying here. They are not egregious however;
the justification can be found at many places in the philosophical literature on
quantum mechanics, at almost any level of (relative) (non-) technicality.
18 The quick argument sketch is as follows: if the apparatus ends up in one of
the pure states |B,r> then it is not true that the Apparatus + Object ends up in a
superposition involving more than one such eigenstate of B–which contradicts the
conclusion about how the composite system has evolved. See for example Eugene
Wigner’s seminal 1963, specifically pp. 11–12.
19 See for instance Greenstein and Zajonc 1997.
20 Wigner, op. cit.; see specifically pp. 10–12, where Wigner discusses the
interpretations that almost all logically available niches are occupied and almost
every point will fail on some offered interpretation of quantum mechanics. But
there certainly are salient interpretations on which supervenience fails, and that
without locating the measurement outcomes in consciousness or other non-physical
realm. I am thinking here especially of the entire range of modal interpretations
of quantum theory. See my 2005a. What is not satisfactory, it seems to me, is to
appeal to decoherence, for that does not remove the problem in principle.
397
Dalton, John 117, 127, 130 Euclidean geometry 61, 66–7, 213–14,
data models 166, 167–8, 172, 251–2, 215–16, 234, 285–6, 309
391 n. 24 exemplification 17
and phenomena 252–9 Exner, Franz Serafin 279
David, Jacques-Louis 29 experimentation: roles of 111–13
de Beauvoir, Simone, see Beauvoir, explicitly non-committal
Simone de representations 38–9, 40, 50, 313
Dear, Peter 97
deducibility 296–7 Faraday, Michael 95, 101
Dennett, Daniel 38 Feyerabend, Paul 73, 150, 359 n. 23
denotation 16, 348 n. 27 Feynman, Richard 150
depiction 16 De Finetti, Bruno 361 n. 42
Desargues, Gérard 72 Fine, Arthur 369 n. 35
Descartes, René 22, 30, 199–200, 269, Fourier, J. B. J. 53
271, 278, 280 frames of reference 66–70
analytic geometry 41, 46, 66–7 Frege, Gottlob 383 n. 6
frames of reference 66–70 French, Steven 345 n. 1, 349 n. 1, 368 n.
logical necessity 389 n. 10 25, 385 n. 4
primary/secondary qualities 273–5 Fresnel, Augustin 98
determinism 169, 292 Friedman, Michael 365 n. 3, 384 n. 1 and 4
and appearances/reality 278–80 Frigg, Roman 309
in mechanics 29–30 Fuchs, Christopher 378 n. 32
dilation 161–2 function 79, 181
dimensional analysis 53–5, 355 n. 40–3 of experimentation 111
distortion 12–15, 36–7, 38, 40, 183 indexicality and 182
Doisneau, Robert 20–21 instrumentation and 94–100, 157
dreams 24 mimetic 97
Dretske, Fred 15 use and 21–2, 23, 25, 30
Duhem, Pierre 203, 206–7 fuzzy observables 184, 312, 313–14,
Dürer, Albrecht 8, 65–6, 142 321 n. 7
fuzzy values 154, 376 n. 13
Eco, Umberto 116
Eddington, Arthur 71, 136
Edgerton, S. Y. 63, 356 n. 2, 380 n. 14, Galileo Galilei 41, 67, 278, 280, 281
351 n. 14 buoyancy experiment 50–1
Einstein, Albert 118, 134–5, 183, 395 n. 15 primary/secondary qualities 34, 271–3
principle of relativity 69–71 scaling 50–1
Einstein–Podolski–Rosen (EPR) thermometry 117, 123, 125
experiment 170–1, 299 n. 15, 315–16 Galison, P. 94–5
Elga, Adam 352 n. 15 gas law 127–8, 129
Elgin, Catherine Z. 17, 346 n. 9, 347 n. 14 Gassendi, Pierre 278, 280
embedding 29–30, 87, 168–72, 240, 247, geometric optics 42–5
252, 316 geometry
empirical adequacy 3, 136, 199, 246, 249, analytic 41, 46, 66–7, 353 n. 29
258, 317 Euclidean 61, 66–7, 213–14, 215–16,
empiricism 3, 304–6 234, 285–6, 309
empiricist structuralism 237–9 hyperbolic 213–14, 215
epistemology 222 non-Euclidean 213, 215–16
Escher, M. C. 39 projective 66, 72–3, 74–5,
essential indexical 3, 83, 88 215–16, 286
ether theory 100–1 Georgalis, Nicholas 348 n. 21, 348 n. 23
402
Peschard, Isabelle 351 n. 13, 360 n. 36, 367 Putnam’s Paradox (model-theoretic
n. 17, 377 n.21, 377 n.31, 378 n. 35 argument) 229–35
phenomena 8–9 dissolution of 232–5
and abstract structures 245–6, 249–50
and appearances 283–8, 317 quadrants 61
and data models 252–9 quantum theory 144, 155, 164, 183, 197,
outside experience 247–50 198, 199, 277
and theories 250–2, 259–61 and Appearance from Reality
philosophy of mind: and Appearance from Criterion 291, 297–300, 308
Reality Criterion 292–5 and data models 172,
photographs 18, 20–2, 178–9 and determinism 279–80
physical correlates 118–19, 121, 136, 179, and empiricism 306–7, 308
298, 304, 305 and fuzzy observables 184, 313–14
Criterion for Physical Correlate of and measurement 147–8, 154, 166, 184,
Measurement 182–3, 302, 312, 314 291, 300–3, 319
physical conditions for measurement and sharp observables 184, 312–13
141–6 and surface models 169, 170, 315, 316
theory of measurement 147–56 and supervenience 304
picture plane 62–4, 65, 358 n. 21 and theoretical models 252
picture theory of science, see Bildtheorie Quine, W. V. O. 392 n. 28
pictures 35–9
picturing 34, 182, 313
Pino, Paolo 356 n. 1 rainbows 102–3, 110
Place, U. T. 292 Raphael Sanzio 64
Planck, Max 192–5, 196, 279 ratio measurement 159, 160
planetary motion 8, 271, 286–8 ratio scales 128, 160
planisphere 356 n. 4 re-scaling 160–1
Plato 12, 24, 101–2, 231 realism 198–204, 229, 241–4
Pliny the Elder 11, 346 n. 1 real property 220–1
Poincaré, Henri 204, 279, 367 n. 12, scientific 198–9
380 n. 19 structural 198
and coordination 118, 208 reality 270–6
and measurement 125, 130–6, 138, Appearance from Reality
176, 183 Criterion 281–3, 291–300, 308
Poisson, Siméon 98 completeness criteria 276–83
Power, Henry 99 determinism 278–80
pragmatic contradictions 259 necessity and 277–8
pragmatic tautologies 259 primary/secondary qualities 271–6
pragmatics 3, 17, 21–2, 25, 82, 189, 190, rectilinear propagation 42
232–3, 259–60 reducibility 292–5
predication 16 reflection 42, 43–5
prediction 283 in water 101–2, 103, 105
primary/secondary qualities 34, 271–6 refraction 43, 44–5
Principle of Approximation 52–3 Reichenbach, Hans 30, 52–3,
Principle of Similitude 51–2 240–1, 279
probability 305, 317–19 common cause principle 280
projective geometry 215–16, 286 and coordination 118–21, 123, 136–7,
cross ratios 66, 72–3, 74–5 387 n. 21
Psillos, Stathis 253–4 relationality 26, 225–9, 231–2,
Ptolemy 43, 356 n. 4 242–3
Putnam, Hilary 229–35, 386 n. 7 relativity, principle of 69–71
406
Weyl, Hermann 71, 136, 175, 228–9 Worrall, John 239, 362 n. 10, 378 n. 34,
and isomorphism 208–10, 211 381 n. 24, 385 n. 2
Whitehead, A. N. 243
Wigner, Eugene 303 Young, Thomas 251
Williams, Bernard 358 n. 13
Wilson, Catherine 94–5, 99
Wilson, Margaret 389 n. 1 Zajonc, Arthur 359 n. 23, 396 n. 19
Wittgenstein, Ludwig 38, 164 zero-point perspective 69
Woodward, James 376 n. 14 Zeuxis 11
world-picture 141, 192–5, 237, 274, 275 Zola, Émile 350 n. 5