Model-Theory A4

You might also like

Download as pdf or txt
Download as pdf or txt
You are on page 1of 14

pdf version of the entry

Model Theory
http://plato.stanford.edu/archives/fall2013/entries/model-theory/ Model Theory
from the Fall 2013 Edition of the First published Sat Nov 10, 2001; substantive revision Wed Jul 17, 2013

Stanford Encyclopedia Model theory began with the study of formal languages and their
interpretations, and of the kinds of classification that a particular formal
of Philosophy language can make. Mainstream model theory is now a sophisticated
branch of mathematics (see the entry on first-order model theory). But in a
broader sense, model theory is the study of the interpretation of any
language, formal or natural, by means of set-theoretic structures, with
Alfred Tarski's truth definition as a paradigm. In this broader sense, model
theory meets philosophy at several points, for example in the theory of
Edward N. Zalta Uri Nodelman Colin Allen John Perry
logical consequence and in the semantics of natural languages.
Principal Editor Senior Editor Associate Editor Faculty Sponsor
Editorial Board
1. Basic notions of model theory
http://plato.stanford.edu/board.html
2. Model-theoretic definition
Library of Congress Catalog Data 3. Model-theoretic consequence
ISSN: 1095-5054
4. Expressive strength
Notice: This PDF version was distributed by request to mem- 5. Models and modelling
bers of the Friends of the SEP Society and by courtesy to SEP Bibliography
content contributors. It is solely for their fair use. Unauthorized Academic Tools
distribution is prohibited. To learn how to join the Friends of the Other Internet Resources
SEP Society and obtain authorized PDF versions of SEP entries, Related Entries
please visit https://leibniz.stanford.edu/friends/ .

Stanford Encyclopedia of Philosophy


Copyright c 2011 by the publisher
1. Basic notions of model theory
The Metaphysics Research Lab
Center for the Study of Language and Information Sometimes we write or speak a sentence S that expresses nothing either
Stanford University, Stanford, CA 94305 true or false, because some crucial information is missing about what the
Model Theory words mean. If we go on to add this information, so that S comes to
Copyright c 2013 by the author
Wilfrid Hodges
express a true or false statement, we are said to interpret S, and the added
information is called an interpretation of S. If the interpretation I happens
All rights reserved.
Copyright policy: https://leibniz.stanford.edu/friends/info/copyright/

1
Model Theory Wilfrid Hodges

to make S state something true, we say that I is a model of S, or that I the notion of model and write ‘I ⊨ tS’ to mean that I is a model of S at
satisfies S, in symbols ‘I ⊨ S’. Another way of saying that I is a model of time t. The same applies to places, or to anything else that might be picked
S is to say that S is true in I, and so we have the notion of model-theoretic up by other implicit indexical features in the sentence. For example if you
truth, which is truth in a particular interpretation. But one should believe in possible worlds, you can index ⊨ by the possible world where
remember that the statement ‘S is true in I’ is just a paraphrase of ‘S, when the sentence is to be evaluated. Apart from using set theory, model theory
interpreted as in I, is true’; so model-theoretic truth is parasitic on plain is completely agnostic about what kinds of thing exist.
ordinary truth, and we can always paraphrase it away.
Note that the objects and classes in a structure carry labels that steer them
For example I might say to the right expressions in the sentence. These labels are an essential part
of the structure.
He is killing all of them,
If the same class is used to interpret all quantifiers, the class is called the
and offer the interpretation that ‘he’ is Alfonso Arblaster of 35 The domain or universe of the structure. But sometimes there are quantifiers
Crescent, Beetleford, and that ‘them’ are the pigeons in his loft. This ranging over different classes. For example if I say
interpretation explains (a) what objects some expressions refer to, and (b)
what classes some quantifiers range over. (In this example there is one One of those thingummy diseases is killing all the birds.
quantifier: ‘all of them’). Interpretations that consist of items (a) and (b)
appear very often in model theory, and they are known as structures. you will look for an interpretation that assigns a class of diseases to ‘those
Particular kinds of model theory use particular kinds of structure; for thingummy diseases’ and a class of birds to ‘the birds’. Interpretations that
example mathematical model theory tends to use so-called first-order give two or more classes for different quantifiers to range over are said to
structures, model theory of modal logics uses Kripke structures, and so be many-sorted, and the classes are sometimes called the sorts.
on.
The ideas above can still be useful if we start with a sentence S that does
The structure I in the previous paragraph involves one fixed object and say something either true or false without needing further interpretation.
one fixed class. Since we described the structure today, the class is the (Model theorists say that such a sentence is fully interpreted.) For example
class of pigeons in Alfonso's loft today, not those that will come tomorrow we can consider misinterpretations I of a fully interpreted sentence S. A
to replace them. If Alfonso Arblaster kills all the pigeons in his loft today, misinterpretation of S that makes it true is known as a nonstandard or
then I satisfies the quoted sentence today but won't satisfy it tomorrow, unintended model of S. The branch of mathematics called nonstandard
because Alfonso can't kill the same pigeons twice over. Depending on analysis is based on nonstandard models of mathematical statements about
what you want to use model theory for, you may be happy to evaluate the real or complex number systems; see Section 4 below.
sentences today (the default time), or you may want to record how they are
One also talks of model-theoretic semantics of natural languages, which is
satisfied at one time and not at another. In the latter case you can relativise
a way of describing the meanings of natural language sentences, not a way

2 Stanford Encyclopedia of Philosophy Fall 2013 Edition 3


Model Theory Wilfrid Hodges

of giving them meanings. The connection between this semantics and ∀x∀y∀z (x + (y + z) = (x + y) + z).
model theory is a little indirect. It lies in Tarski's truth definition of 1933. ∀x (x + 0 = x).
See the entry on Tarski's truth definitions for more details. ∀x (x + (−x) = 0).
∀x∀y (x + y = y + x).
2. Model-theoretic definition
Here the labels are the addition symbol ‘+’, the minus symbol ‘−’ and the
A sentence S divides all its possible interpretations into two classes, those constant symbol ‘0’. An interpretation also needs to specify a domain for
that are models of it and those that are not. In this way it defines a class, the quantifiers. With one proviso, the models of this set of sentences are
namely the class of all its models, written Mod(S). To take a legal precisely the structures that mathematicians know as abelian groups. The
example, the sentence proviso is that in an abelian group A, the domain should contain the
interpretation of the symbol 0, and it should be closed under the
The first person has transferred the property to the second person, interpretations of the symbols + and −. In mathematical model theory one
who thereby holds the property for the benefit of the third person. builds this condition (or the corresponding conditions for other function
and constant symbols) into the definition of a structure.
defines a class of structures which take the form of labelled 4-tuples, as for
example (writing the label on the left): Each mathematical structure is tied to a particular first-order language. A
structure contains interpretations of certain predicate, function and
the first person = Alfonso Arblaster;
constant symbols; each predicate or function symbol has a fixed arity. The
the property = the derelict land behind Alfonso's house;
collection K of these symbols is called the signature of the structure.
the second person = John Doe;
Symbols in the signature are often called nonlogical constants, and an
the third person = Richard Roe.
older name for them is primitives. The first-order language of signature K
This is a typical model-theoretic definition, defining a class of structures is the first-order language built up using the symbols in K, together with
(in this case, the class known to the lawyers as trusts). the equality sign =, to build up its atomic formulas. (See the entry on
classical logic.) If K is a signature, S is a sentence of the language of
We can extend the idea of model-theoretic definition from a single signature K and A is a structure whose signature is K, then because the
sentence S to a set T of sentences; Mod(T) is the class of all interpretations symbols match up, we know that A makes S either true or false. So one
that are simultaneously models of all the sentences in T. When a set T of defines the class of abelian groups to be the class of all those structures of
sentences is used to define a class in this way, mathematicians say that T signature +, −, 0 which are models of the sentences above. Apart from the
is a theory or a set of axioms, and that T axiomatises the class Mod(T). fact that it uses a formal first-order language, this is exactly the algebraists'
usual definition of the class of abelian groups; model theory formalises a
Take for example the following set of first-order sentences: kind of definition that is extremely common in mathematics.

4 Stanford Encyclopedia of Philosophy Fall 2013 Edition 5


Model Theory Wilfrid Hodges

Now the defining axioms for abelian groups have three kinds of symbol may recognise the relation in the field of real numbers defined by the
(apart from punctuation). First there is the logical symbol = with a fixed formula
meaning. Second there are the nonlogical constants, which get their
interpretation by being applied to a particular structure; one should group v12 + v22 = 1.
the quantifier symbols with them, because the structure also determines
It's the circle of radius 1 around the origin in the real plane. Algebraic
the domain over which the quantifiers range. And third there are the
geometry is full of definitions of this kind.
variables x, y etc. This three-level pattern of symbols allows us to define
classes in a second way. Instead of looking for the interpretations of the During the 1940s it occurred to several people (chiefly Anatolii Mal'tsev
nonlogical constants that will make a sentence true, we fix the in Russia, Alfred Tarski in the USA and Abraham Robinson in Britain)
interpretations of the nonlogical constants by choosing a particular that the metatheorems of classical logic could be used to prove
structure A, and we look for assignments of elements of A to variables mathematical theorems about classes defined in the two ways we have just
which will make a given formula true in A. described. In 1950 both Robinson and Tarski were invited to address the
International Congress of Mathematicians at Cambridge Mass. on this new
For example let Z be the additive group of integers. Its elements are the
discipline (which as yet had no name — Tarski proposed the name ‘theory
integers (positive, negative and 0), and the symbols +, −, 0 have their
of models’ in 1954). The conclusion of Robinson's address to that
usual meanings. Consider the formula
Congress is worth quoting:
v1 + v1 = v2.
[The] concrete examples produced in the present paper will have
If we assign the number −3 to v1 and the number −6 to v2, the formula shown that contemporary symbolic logic can produce useful tools
works out as true in Z. We express this by saying that the pair (−3,−6) — though by no means omnipotent ones — for the development of
satisfies this formula in Z. Likewise (15,30) and (0,0) satisfy it, but (2,−4) actual mathematics, more particularly for the development of
and (3,3) don't. Thus the formula defines a binary relation on the integers, algebra and, it would appear, of algebraic geometry. This is the
namely the set of pairs of integers that satisfy it. A relation defined in this realisation of an ambition which was expressed by Leibniz in a
way in a structure A is called a first-order definable relation in A. A useful letter to Huyghens as long ago as 1679.
generalisation is to allow the defining formula to use added names for
In fact Mal'tsev had already made quite deep applications of model theory
some specific elements of A; these elements are called parameters and the
in group theory several years earlier, but under the political conditions of
relation is then definable with parameters.
the time his work in Russia was not yet known in the West. By the end of
This second type of definition, defining relations inside a structure rather the twentieth century, Robinson's hopes had been amply fulfilled; see the
than classes of structure, also formalises a common mathematical practice. entry on first-order model theory.
But this time the practice belongs to geometry rather than to algebra. You
There are at least two other kinds of definition in model theory besides

6 Stanford Encyclopedia of Philosophy Fall 2013 Edition 7


Model Theory Wilfrid Hodges

these two above. The third is known as interpretation (a special case of the sense to the statement that the theory Tʹ′ is interpreted in the theory T.
interpretations that we began with). Here we start with a structure A, and Philosophers of science have sometimes experimented with this notion of
we build another structure B whose signature need not be related to that of interpretation as a way of making precise what it means for one theory to
A, by defining the domain X of B and all the labelled relations and be reducible to another. But realistic examples of reductions between
functions of B to be the relations definable in A by certain formulas with scientific theories seem generally to be much subtler than this simple-
parameters. A further refinement is to find a definable equivalence relation minded model-theoretic idea will allow. See the entry on intertheory
on X and take the domain of B to be not X itself but the set of equivalence relations in physics.
classes of this relation. The structure B built in this way is said to be
interpreted in the structure A. The fourth kind of definability is a pair of notions, implicit definability
and explicit definability of a particular relation in a theory. See section 3.3
A simple example, again from standard mathematics, is the interpretation of the entry on first-order model theory.
of the group Z of integers in the structure N consisting of the natural
numbers 0, 1, 2 etc. with labels for 0, 1 and +. To construct the domain of Unfortunately there used to be a very confused theory about model-
Z we first take the set X of all ordered pairs of natural numbers (clearly a theoretic axioms, that also went under the name of implicit definition. By
definable relation in N), and on this set X we define the equivalence the end of the nineteenth century, mathematical geometry had generally
relation ∼ by ceased to be a study of space, and it had become the study of classes of
structures which satisfy certain ‘geometric’ axioms. Geometric terms like
(a,b) ∼ (c,d) if and only if a + d = b + c ‘point’, ‘line’ and ‘between’ survived, but only as the primitive symbols in
axioms; they no longer had any meaning associated with them. So the old
(again definable). The domain of Z consists of the equivalence classes of question, whether Euclid's parallel postulate (as a statement about space)
this relation. We define addition on Z by was deducible from Euclid's other assumptions about space, was no longer
interesting to geometers. Instead, geometers showed that if one wrote
(a,b) + (c,d) = (e,f) if and only if a + c + f = b + d + e.
down an up-to-date version of Euclid's other assumptions, in the form of a
The equivalence class of (a,b) becomes the integer a − b. theory T, then it was possible to find models of T which fail to satisfy the
parallel postulate. (See the entry on geometry in the 19th century for the
When a structure B is interpreted in a structure A, every first-order contributions of Lobachevski and Klein to this achievement.) In 1899
statement about B can be translated back into a first-order statement about David Hilbert published a book in which he constructed such models,
A, and in this way we can read off the complete theory of B from that of A. using exactly the method of interpretation that we have just described.
In fact if we carry out this construction not just for a single structure A but
for a family of models of a theory T, always using the same defining Problems arose because of the way that Hilbert and others described what
formulas, then the resulting structures will all be models of a theory Tʹ′ that they were doing. The history is complicated, but roughly the following
can be read off from T and the defining formulas. This gives a precise happened. Around the middle of the nineteenth century people noticed, for

8 Stanford Encyclopedia of Philosophy Fall 2013 Edition 9


Model Theory Wilfrid Hodges

example, that in an abelian group the minus function is definable in terms mathematically adequate description’ of the relations ‘lie’, ‘between’ and
of 0 and + (namely: −a is the element b such that a + b = 0). Since this ‘congruent’. Fortunately Hilbert's mathematics speaks for itself, and one
description of minus is in fact one of the axioms defining abelian groups, can simply bypass these philosophical faux pas. The model-theoretic
we can say (using a term taken from J. D. Gergonne, who should not be account that we now take as a correct description of this line of work
held responsible for the later use made of it) that the axioms for abelian seems to have surfaced first in the group around Giuseppe Peano in the
groups implicitly define minus. In the jargon of the time, one said not that 1890s, and it reached the English-speaking world through Bertrand
the axioms define the function minus, but that they define the concept Russell's Principles of Mathematics in 1903.
minus. Now suppose we switch around and try to define plus in terms of
minus and 0. This way round it can't be done, since one can have two 3. Model-theoretic consequence
abelian groups with the same 0 and minus but different plus functions.
Rather than say this, the nineteenth century mathematicians concluded that Suppose L is a language of signature K, T is a set of sentences of L and φ
the axioms only partially define plus in terms of minus and 0. Having is a sentence of L. Then the relation
swallowed that much, they went on to say that the axioms together form
Mod(T) ⊆ Mod(φ)
an implicit definition of the concepts plus, minus and 0 together, and that
this implicit definition is only partial but it says about these concepts expresses that every structure of signature K which is a model of T is also
precisely as much as we need to know. a model of φ. This is known as the model-theoretic consequence relation,
and it is written for short as
One wonders how it could happen that for fifty years nobody challenged
this nonsense. In fact some people did challenge it, notably the geometer T⊨φ
Moritz Pasch who in section 12 of his Vorlesungen über Neuere
Geometrie (1882) insisted that geometric axioms tell us nothing whatever The double use of ⊨ is a misfortune. But in the particular case where L is
about the meanings of ‘point’, ‘line’ etc. Instead, he said, the axioms give first-order, the completeness theorem (see the entry on classical logic) tells
us relations between the concepts. If one thinks of a structure as a kind of us that ‘T ⊨ φ’ holds if and only if there is a proof of φ from T, a relation
ordered n-tuple of sets etc., then a class Mod(T) becomes an n-ary commonly written
relation, and Pasch's account agrees with ours. But he was unable to spell
out the details, and there is some evidence that his contemporaries (and T⊢φ
some more recent commentators) thought he was saying that the axioms
Since ⊨ and ⊢ express exactly the same relation in this case, model
may not determine the meanings of ‘point’ and ‘line’, but they do
theorists often avoid the double use of ⊨ by using ⊢ for model-theoretic
determine those of relational terms such as ‘between’ and ‘incident with’!
consequence. But since what follows is not confined to first-order
Frege's demolition of the implicit definition doctrine was masterly, but it
languages, safety suggests we stick with ⊨ here.
came too late to save Hilbert from saying, at the beginning of his
Grundlagen der Geometrie, that his axioms give ‘the exact and

10 Stanford Encyclopedia of Philosophy Fall 2013 Edition 11


Model Theory Wilfrid Hodges

Before the middle of the nineteenth century, textbooks of logic commonly English arguments are valid by reducing them to model-theoretic
taught the student how to check the validity of an argument (say in consequences. Since the class of model-theoretic consequences, at least in
English) by showing that it has one of a number of standard forms, or by first-order logic, has none of the vaguenesses of the old argument forms,
paraphrasing it into such a form. The standard forms were syntactic and/or textbooks of logic in this style have long since ceased to have a chapter on
semantic forms of argument in English. The process was hazardous: fallacies.
semantic forms are almost by definition not visible on the surface, and
there is no purely syntactic form that guarantees validity of an argument. But there is one warning that survives from the old textbooks: If you
For this reason most of the old textbooks had a long section on ‘fallacies’ formalise your argument in a way that is not a model-theoretic
— ways in which an invalid argument may seem to be valid. consequence, it doesn't mean the argument is not valid. It may only mean
that you failed to analyse the concepts in the argument deeply enough
In 1847 George Boole changed this arrangement. For example, to validate before you formalised. The old textbooks used to discuss this in a ragbag
the argument section called ‘topics’ (i.e. hints for finding arguments that you might have
missed). Here is an example from Peter of Spain's 13th century Summulae
All monarchs are human beings. No human beings are infallible. Logicales:
Therefore no infallible beings are monarchs.
’There is a father. Therefore there is a child.’ … Where does the
Boole would interpret the symbols P, Q, R as names of classes: validity of this argument come from? From the relation. The
maxim is: When one of a correlated pair is posited, then so is the
P is the class of all monarchs.
other.
Q is the class of all human beings.
R is the class of all infallible beings. Hilbert and Ackermann, possibly the textbook that did most to establish
the modern style, discuss in their section III.3 a very similar example: ‘If
Then he would point out that the original argument paraphrases into a set-
there is a son, then there is a father’. They point out that any attempt to
theoretic consequence:
justify this by using the symbolism
(P ⊆ Q), (Q ∩ R = 0) ⊨ (R ∩ P = 0)
∃xSx → ∃xFx
(This example is from Stanley Jevons, 1869. Boole's own account is
is doomed to failure. “A proof of this statement is possible only if we
idiosyncratic, but I believe Jevons' example represents Boole's intentions
analyze conceptually the meanings of the two predicates which occur”, as
accurately.) Today we would write ∀x(Px → Qx) rather than P ⊆ Q, but
they go on to illustrate. And of course the analysis finds precisely the
this is essentially the standard definition of P ⊆ Q, so the difference
relation that Peter of Spain referred to.
between us and Boole is slight.
On the other hand if your English argument translates into an invalid
Insofar as they follow Boole, modern textbooks of logic establish that

12 Stanford Encyclopedia of Philosophy Fall 2013 Edition 13


Model Theory Wilfrid Hodges

model-theoretic consequence, a counterexample to the consequence may the necessity of eliminating any defined signs which may possibly
well give clues about how you can describe a situation that would make occur in the sentences concerned, i.e. of replacing them by
the premises of your argument true and the conclusion false. But this is not primitive signs.
guaranteed.
This suggests to me that he wants his primitive signs to be by stipulation
One can raise a number of questions about whether the modern textbook unanalysable. But then by stipulation it will be purely accidental if his
procedure does really capture a sensible notion of logical consequence. notion of logical consequence captures everything one would normally
For example in Boole's case the set-theoretic consequences that he relies count as a logical consequence.
on are all easily provable by formal proofs in first-order logic, not even
using any set-theoretic axioms; and by the completeness theorem (see the Historians note a resemblance between Tarski's proposal and one in
entry on classical logic) the same is true for first-order logic. But for some section 147 of Bernard Bolzano's Wissenschaftslehre of 1837. Like Tarski,
other logics it is certainly not true. For instance the model-theoretic Bolzano defines the validity of a proposition in terms of the truth of a
consequence relation for some logics of time presupposes some facts family of related propositions. Unlike Tarski, Bolzano makes his proposal
about the physical structure of time. Also, as Boole himself pointed out, for propositions in the vernacular, not for sentences of a formal language
his translation from an English argument to its set-theoretic form requires with a precisely defined semantics.
us to believe that for every property used in the argument, there is a
On all of this section, see also the entry on logical consequence.
corresponding class of all the things that have the property. This comes
dangerously close to Frege's inconsistent comprehension axiom!
4. Expressive strength
In 1936 Alfred Tarski proposed a definition of logical consequence for
A sentence S defines its class Mod(S) of models. Given two languages L
arguments in a fully interpreted formal language. His proposal was that an
and Lʹ′, we can compare them by asking whether every class Mod(S), with
argument is valid if and only if: under any allowed reinterpretation of its
S a sentence of L, is also a class of the form Mod(Sʹ′) where Sʹ′ is a
nonlogical symbols, if the premises are true then so is the conclusion.
sentence of Lʹ′. If the answer is Yes, we say that L is reducible to Lʹ′, or
Tarski assumed that the class of allowed reinterpretations could be read
that Lʹ′ is at least as expressive as L.
off from the semantics of the language, as set out in his truth definition.
He left it undetermined what symbols count as nonlogical; in fact he For example if L is a first-order language with identity, whose signature
hoped that this freedom would allow one to define different kinds of consists of 1-ary predicate symbols, and Lʹ′ is the language whose
necessity, perhaps separating ‘logical’ from ‘analytic’. One thing that sentences consist of the four syllogistic forms (All A are B, Some A are B,
makes Tarski's proposal difficult to evaluate is that he completely ignores No A are B, Some A are not B) using the same predicate symbols, then Lʹ′
the question we discussed above, of analysing the concepts to reach all the is reducible to L, because the syllogistic forms are expressible in first-
logical connections between them. The only plausible explanation I can order logic. (There are some quarrels about which is the right way to
see for this lies in his parenthetical remark about express them; see the entry on the traditional square of opposition.) But

14 Stanford Encyclopedia of Philosophy Fall 2013 Edition 15


Model Theory Wilfrid Hodges

the first-order language L is certainly not reducible to the language Lʹ′ of These back-and-forth games are immensely flexible. For a start, they make
syllogisms, since in L we can write down a sentence saying that exactly just as much sense on finite structures as they do on infinite; many other
three elements satisfy Px, and there is no way of saying this using just the techniques of classical model theory assume that the structures are infinite.
syllogistic forms. Or moving the other way, if we form a third language Lʺ″ They can also be adapted smoothly to many non-first-order languages.
by adding to L the quantifier Qx with the meaning “There are uncountably
many elements x such that …”, then trivially L is reducible to Lʺ″, but the In 1969 Per Lindström used back-and-forth games to give some abstract
downward Loewenheim-Skolem theorem shows at once that Lʺ″ is not characterisations of first-order logic in terms of its expressive power. One
reducible to L. of his theorems says that if L is a language with a signature K, L is closed
under all the first-order syntactic operations, and L obeys the downward
These notions are useful for analysing the strength of database query Loewenheim-Skolem theorem for single sentences, and the compactness
languages. We can think of the possible states of a database as structures, theorem, then L is reducible to the first-order language of signature K.
and a simple Yes/No query becomes a sentence that elicits the answer Yes These theorems are very attractive; see Chapter XII of Ebbinghaus, Flum
if the database is a model of it and No otherwise. If one database query and Thomas for a good account. But they have never quite lived up to
language is not reducible to another, then the first can express some query their promise. It has been hard to find any similar characterisations of
that can't be expressed in the second. other logics. Even for first-order logic it is a little hard to see exactly what
the characterisations tell us. But very roughly speaking, they tell us that
So we need techniques for comparing the expressive strengths of first-order logic is the unique logic with two properties: (1) we can use it
languages. One of the most powerful techniques available consists of the to express arbitrarily complicated things about finite patterns, and (2) it is
back-and-forth games of Ehrenfeucht and Fraïssé between the two players hopeless for discriminating between one infinite cardinal and another.
Spoiler and Duplicator; see the entry on logic and games for details.
Imagine for example that we play the usual first-order back-and-forth These two properties (1) and (2) are just the properties of first-order logic
game G between two structures A and B. The theory of these games that allowed Abraham Robinson to build his nonstandard analysis. The
establishes that if some first-order sentence φ is true in exactly one of A background is that Leibniz, when he invented differential and integral
and B, then there is a number n, calculable from φ, with the property that calculus, used infinitesimals, i.e. numbers that are greater than 0 and
Spoiler has a strategy for G that will guarantee that he wins in at most n smaller than all of 1/2, 1/3, 1/4 etc. Unfortunately there are no such real
steps. So conversely, to show that first-order logic can't distinguish numbers. During the nineteenth century all definitions and proofs in the
between A and B, it suffices to show that for every finite n, Duplicator has Leibniz style were rewritten to talk of limits instead of infinitesimals. Now
a strategy that will guarantee she doesn't lose G in the first n steps. If we let R be the structure consisting of the field of real numbers together with
succeed in showing this, it follows that any language which does any structural features we care to give names to: certainly plus and times,
distinguish between A and B is not reducible to the first-order language of maybe the ordering, the set of integers, the functions sin and log, etc. Let
the structures A and B. L be the first-order language whose signature is that of R. Because of the
expressive strength of L, we can write down any number of theorems of

16 Stanford Encyclopedia of Philosophy Fall 2013 Edition 17


Model Theory Wilfrid Hodges

calculus as sentences of L. Because of the expressive weakness of L, there the substance. We see this with a cheese mould, and also with the metal
is no way that we can express in L that R has no infinitesimals. In fact letters (called ‘moduli’ in the early 17th century) that carry ink to paper in
Robinson used the compactness theorem to build a structure Rʹ′ that is a printing. So ‘model’ comes to mean an object in hand that expresses the
model of exactly the same sentences of L as R, but which has design of some other objects in the world: the artist's model carries the
infinitesimals. As Robinson showed, we can copy Leibniz's arguments form that the artist depicts, and Christopher Wren's ‘module’ of St Paul's
using the infinitesimals in Rʹ′, and so prove that various theorems of Cathedral serves to guide the builders.
calculus are true in Rʹ′. But these theorems are expressible in L, so they
must also be true in R. Already by the late 17th century the word ‘model’ could mean an object
that shows the form, not of real-world objects, but of mathematical
Since arguments using infinitesimals are usually easier to visualise than constructs. Leibniz boasted that he didn't need models in order to do
arguments using limits, nonstandard analysis is a helpful tool for mathematics. Other mathematicians were happy to use plaster or metal
mathematical analysts. Jacques Fleuriot in his Ph.D. thesis (2001) models of interesting surfaces. The models of model theory first appeared
automated the proof theory of nonstandard analysis and used it to as abstract versions of this kind of model, with theories in place of the
mechanise some of the proofs in Newton's Principia. defining equation of a surface. On the other hand one could stay with real-
world objects but show their form through a theory rather than a physical
5. Models and modelling copy in hand; ‘modelling’ is building such a theory.

To model a phenomenon is to construct a formal theory that describes and We have a confusing halfway situation when a scientist describes a
explains it. In a closely related sense, you model a system or structure that phenomenon in the world by an equation, for example a differential
you plan to build, by writing a description of it. These are very different equation with exponential functions as solutions. Is the model the theory
senses of ‘model’ from that in model theory: the ‘model’ of the consisting of the equation, or are these exponential functions themselves
phenomenon or the system is not a structure but a theory, often in a formal models of the phenomenon? Examples of this kind, where theory and
language. The Universal Modeling Language, UML for short, is a formal structures give essentially the same information, provide some support for
language designed for just this purpose. It's reported that the Australian Patrick Suppes' claim that “the meaning of the concept of model is the
Navy once hired a model theorist for a job ‘modelling hydrodynamic same in mathematics and the empirical sciences” (page 12 of his 1969
phenomena’. (Please don't enlighten them!) book cited below). Several philosophers of science have pursued the idea
of using an informal version of model-theoretic models for scientific
A little history will show how the word ‘model’ came to have these two modelling. Sometimes the models are described as non-linguistic — this
different uses. In late Latin a ‘modellus' was a measuring device, for might be hard to reconcile with our definition of models in section 1
example to measure water or milk. By the vagaries of language, the word above.
generated three different words in English: mould, module, model. Often a
device that measures out a quantity of a substance also imposes a form on Cognitive science is one area where the difference between models and

18 Stanford Encyclopedia of Philosophy Fall 2013 Edition 19


Model Theory Wilfrid Hodges

modelling tends to become blurred. A central question of cognitive Other Internet Resources below),
science is how we represent facts or possibilities in our minds. If one
formalises these mental representations, they become something like any algorithm can be modeled at its natural abstraction level by an
‘models of phenomena’. But it is a serious hypothesis that in fact our appropriate ASM. … ASMs use classical mathematical structures
mental representations have a good deal in common with simple set- to describe states of a computation; structures are well-understood,
theoretic structures, so that they are ‘models’ in the model-theoretic sense precise models.
too. In 1983 two influential works of cognitive science were published,
The book of Börger and Stärk cited below is an authoritative account of
both under the title Mental Models. The first, edited by Dedre Gentner and
ASMs and their uses.
Albert Stevens, was about people's ‘conceptualizations’ of the elementary
facts of physics; it belongs squarely in the world of ‘modelling of Today you can make your name and fortune by finding a good
phenomena’. The second, by Philip Johnson-Laird, is largely about representation system. There is no reason to expect that every such system
reasoning, and makes several appeals to ‘model-theoretic semantics’ in will fit neatly into the syntax/semantics framework of model theory, but it
our sense. Researchers in the Johnson-Laird tradition tend to refer to their will be surprising if model-theoretic ideas don't continue to make a major
approach as ‘model theory’, and to see it as allied in some sense to what contribution in this area.
we have called model theory.
Bibliography
Pictures and diagrams seem at first to hover in the middle ground between
theories and models. In practice model theorists often draw themselves Introductory texts
pictures of structures, and use the pictures to think about the structures. On
the other hand pictures don't generally carry the labelling that is an Doets, K., 1996, Basic Model Theory, Stanford: CSLI Publications.
essential feature of model-theoretic structures. There is a fast growing Hodges, W., 1997, A Shorter Model Theory, Cambridge: Cambridge
body of work on reasoning with diagrams, and the overwhelming University Press.
tendency of this work is to see pictures and diagrams as a form of Manzano, M., 1999, Model Theory, Oxford: Oxford University Press.
language rather than as a form of structure. For example Eric Hammer and Rothmaler, P., 2000, Introduction to Model Theory, Amsterdam: Gordon
Norman Danner (in the book edited by Allwein and Barwise, see the and Breach.
Bibliography) describe a ‘model theory of Venn diagrams’; the Venn
diagrams themselves are the syntax, and the model theory is a set- Model-theoretic definition
theoretical explanation of their meaning.
Frege, G., 1906, “Grundlagen der Geometrie”, Jahresbericht der
The model theorist Yuri Gurevich introduced abstract state machines deutschen Mathematikervereinigung, 15: 293–309, 377–403, 423–
(ASMs) as a way of using model-theoretic ideas for specification in 430.
computer science. According to the Abstract State Machine website (see Gergonne, J., 1818, “Essai sur la théorie de la définition”, Annales de

20 Stanford Encyclopedia of Philosophy Fall 2013 Edition 21


Model Theory Wilfrid Hodges

Mathématiques Pures et Appliquées, 9: 1–35. MA: Harvard University Press.


Hilbert, D., 1899, Grundlagen der Geometrie, Leipzig: Teubner. Frege, G., 1971, On the Foundations of Geometry, and Formal Theories of
Hodges, W., 2008, “Tarski's theory of definition”, in Patterson, D. New Arithmetic, E. Kluge (trans.), New Haven: Yale University Press.
Essays on Tarski and Philosophy, Oxford: Oxford University Press, Gómez-Torrente, M., 1996, “Tarski on logical consequence”, Notre Dame
pp. 94–132. Journal of Formal Logic, 37: 125–151.
Lascar, D., 1998, “Perspective historique sur les rapports entre la théorie Hodges, W. 2004, “The importance and neglect of conceptual analysis:
des modèles et l'algèbre”, Revue d'histoire des mathématiques, 4: Hilbert-Ackermann iii.3”, in V. Hendricks et al. (eds.), First-Order
237–260. Logic Revisited, Berlin: Logos, pp. 129–153.
Mancosu, P., Zach, R. and Badesa, C., 2009, “The development of Kreisel, G., 1969, “Informal rigour and completeness proofs”, in J.
mathematical logic from Russell to Tarski”, in L. Haaparanta (ed.), Hintikka (ed.), The Philosophy of Mathematics, London: Oxford
The Development of Modern Logic, Oxford: Oxford University Press, University Press, pp. 78–94.
pp. 318–470. Tarski, A., 1983, “On the concept of logical consequence”, translated in
Pasch, M., 1882, Vorlesungen über Neuere Geometrie, Berlin: Springer- A. Tarski, Logic, Semantics, Metamathematics, J. Corcoran (ed.),
Verlag. Indianapolis: Hackett, pp. 409–420.
Robinson, A., 1952, “On the application of symbolic logic to algebra”, Van Benthem, J., 1991 [1983], The Logic of Time: A Model-Theoretic
Proceedings of the International Congress of Mathematicians Investigation into the Varieties of Temporal Ontology and Temporal
(Cambridge, MA, 1950, Volume 1), Providence, RI: American Discourse, Dordrecht: Reidel, 1983; second edition, Springer, 1991.
Mathematical Society, pp. 686–694.
Suppes, P., 1957, “Theory of definition” in Introduction to Logic (Chapter Expressive strength
8), Princeton, NJ: Van Nostrand.
Cutland, N., 2009, Nonstandard Analysis and its Applications, Cambridge:
Tarski, A., 1954, “Contributions to the theory of models, I”, Indagationes
Cambridge University Press.
Mathematicae, 16: 572–581.
Ebbinghaus, H.-D., and Flum, J., 1999, Finite Model Theory, Berlin:
Model-theoretic consequence Springer-Verlag.
Ebbinghaus, H.-D., Flum, J. and Thomas, W., 1984, Mathematical Logic,
Blanchette, P., 1996, “Frege and Hilbert on consistency”, The Journal of New York: Springer-Verlag.
Philosophy, 93: 317–336. Fleuriot, J., 2001, A Combination of Geometry Theorem Proving and
Blanchette, P., 2012, Frege's Conception of Logic, New York: Oxford Nonstandard Analysis, with Application to Newton's Principia, New
University Press. York: Springer-Verlag.
Boole, G., 1847, The Mathematical Analysis of Logic, Cambridge: Immerman, N., 1999, Descriptive Complexity, New York: Springer-
Macmillan, Barclay and Macmillan. Verlag.
Etchemendy, J., 1990, The Concept of Logical Consequence, Cambridge, Libkin, L., 2004, Elements of Finite Model Theory, Berlin: Springer-

22 Stanford Encyclopedia of Philosophy Fall 2013 Edition 23


Model Theory Wilfrid Hodges

Verlag. in Logical Aspects of Computational Linguistics (Lecture Notes in


Loeb, P. and Wolff, M. (eds.), 2000, Nonstandard Analysis for the Computer Science: Volume 2099), P. De Groote et al. (eds.), Berlin:
Working Mathematician, Dordrecht: Kluwer. Springer-Verlag, pp. 17-43.
Robinson, A., 1967, “The metaphysics of the calculus”, in Problems in the Stenning, K., 2002, Seeing Reason, Oxford: Oxford University Press.
Philosophy of Mathematics, I. Lakatos (ed.), Amsterdam : North- Suppes, P., 1969, Studies in the Methodology and Foundations of Science,
Holland, pp. 28-40. Dordrecht: Reidel.

Models and modelling Academic Tools


Allwein, G. and Barwise, J. (eds.), 1996, Logical Reasoning with
How to cite this entry.
Diagrams, New York: Oxford University Press.
Preview the PDF version of this entry at the Friends of the SEP
Börger, E. and Stärk, R., 2003, Abstract State Machines: A Method for
Society.
High-Level System Design and Analysis, Berlin: Springer-Verlag.
Fowler, M., 2000, UML Distilled, Boston: Addison-Wesley. Look up this entry topic at the Indiana Philosophy Ontology
Garnham, A., 2001, Mental Models and the Interpretation of Anaphora, Project (InPhO).
Philadelphia: Taylor and Francis. Enhanced bibliography for this entry at PhilPapers, with links
Gentner, D. and Stevens, A. (eds.), 1983, Mental Models, Hillsdale, NJ: to its database.
Lawrence Erlbaum.
Johnson-Laird, P., 1983, Mental Models: Towards a cognitive science of Other Internet Resources
language, inference, and consciousness, Cambridge: Cambridge
University Press. mentalmodelsblog: Mental Models in Human Thinking and
Meijers, A. (ed.), 2009, Philosophy of Technology and Engineering Reasoning, by Ruth Byrne.
Sciences, Amsterdam: Elsevier; see chapters W. Hodges, “Functional Algorithmic Model Theory, by E. Graedel, D. Berwanger and F. Abu
modelling and mathematical models”; R. Müller, “The notion of a Zaid (Mathematische Grundlagen der Informatik, RWTH Aachen)
model, theories of models and history”; and N. Nersessian, “Model Abstract State Machines, by Jim Huggins
based reasoning in interdisciplinary engineering”.
Moktefi, A. and Shin, S.-J. (eds.), 2013, Visual Reasoning with Diagrams, Related Entries
Basel: Birkhäuser.
diagrams | geometry: in the 19th century | logic: classical | logical
Morgan, M. S. and Morrison, M. (eds.), 1999, Models as Mediators,
consequence | logical truth | models in science | physics: intertheory
Cambridge: Cambridge University Press.
relations in | physics: structuralism in | square of opposition | Tarski,
Pullum, G. K. and Scholz, B. C., 2001, “On the distinction between
Alfred | Tarski, Alfred: truth definitions
model-theoretic and generative-enumerative syntactic frameworks”,

24 Stanford Encyclopedia of Philosophy Fall 2013 Edition 25


Model Theory

Copyright © 2013 by the author


Wilfrid Hodges

26 Stanford Encyclopedia of Philosophy

You might also like