Professional Documents
Culture Documents
Markov Partitions and Symbolic Dynamics
Markov Partitions and Symbolic Dynamics
x and y
y, then
x
+ y
N
= /
Z
= s = (s
n
)
nZ
[s
n
/ (2.3.1)
1
Right multiplication on row vectors turns out to be a little more convenient than left multi-
plication on column vectors.
6 ROY L. ADLER
1
0
1
Edge labelled
Vertex labelled
0
Figure 1
of all bi-innite sequences of elements from a set of N symbols. One can think of
an element of this space as a bi-innite walk on the complete directed graph of
N vertices which are distinctly labelled. Sometimes it is more convenient to label
edges, in which case the picture is a single node with N oriented distinctly labelled
loops over which to walk. In Figure 1 we have depicted the graph for
2
by both
methods of labelling.
The shift transformation is dened by shifting each bi-innite sequence one
step to the left. This is expressed by
(s)
n
= s
n+1
.
We dene the distance d(s, t) between two distinct sequences s and t as 1/([n[ +1)
where n is the coordinate of smallest absolute value where they dier. Thus if
d(s, t) < 1/n for n > 0, then s
k
= t
k
for n < k < n. This metric makes the space
N
one of the important compact onesnamely, the Cantor discontinuum and the
shift a homeomorphism. The symbolic system (
N
, ) is called the full N-shift.
Restricting the shift transformation of a full shift
N
to a closed shift-invariant
subspace , we get a very general kind of dynamical system (, ) called a subshift.
Given a symbolic sequence s = (s
n
)
nZ
and integers m < n, we shall use the
notation s
[m,n]
to stand for the mn+1-tuple (s
m
, s
m+1
, . . . , s
n
). Given a symbolic
phase space , we call a k-tuple an allowable k-block if it equals s
[m,m+k1]
for some
s .
Returning to the realm of the more specic from our momentary excursion into
the less knowable, we dene shift of nite type, also called topological Markov shift,
as the subshift of a full shift restricted to the set
G
of bi-innite paths in a nite
directed graph G derived from a complete one by possibly removing some edges.
Usually we denote the space
G
by
A
where A is a matix of non-negative
integers a
ij
denoting the number of edges leading from the i-th node to the j-th.
The term Markov is derived from the resemblance to Markov chains for which
the a
ij
are probabilities instead of integers. One thing to note is that the ij entry
of A
n
is the number of paths of length n beginning at i-th node and ending at
the j-th. Often however A is an N N matrix of zeroes and ones specifying a
directed graph of N nodes (edges) according to the following: the i-th node (edge)
is connected to the j-th, i j, if and only if a
ij
= 1. Whether dealing with nodes
or edges, we call A a transition matrix and restate for zero-one matrices the above
SYMBOLIC DYNAMICS AND MARKOV PARTITIONS 7
0 1
Figure 2. Fibonacci shift
denition by
G
=
A
s = (. . . , s
n
, . . . )[a
sn,sn+1
= 1, s
n
/, n Z. (2.3.2)
Remark. Let (
G
, ) be a topological shift given by a node-labelled directed graph
G. Nodes from which there is no return are called transient, the rest recurrent.
A node is transient if and only if either it has no predecessor nodes or all its
predecessors are transient. This statement is not as circular as it seems: for the set
of predecessors of any set of transient nodes, if non-empty, is a strictly smaller set
of transient nodes. The only symbols which appear in bi-innite sequences of
G
are labels of recurrent nodes.
Figure 2 describes the Fibonacci or golden ratio shift, so-called because the
number of admissible n-blocks (paths of length n) are the Fibonacci numbers
namely, there are two 1-blocks, three 2-blocks, ve 3-blocks, . . . .
Here the space
A
is given by the matrix
A =
_
1 1
1 0
_
.
If we label the rst node by 0 and and the second by 1, then only sequences of
0s and 1s with 1s separated by 0s are admissible. While other shifts of nite
type can be specied by graphs with either nodes or edges labelled, there is no
edge-labelled graph for the Fibonacci shift.
Given a node-labelled graph G, we dene the edge graph G
(2)
by labelling the
edges. For labels we can use the allowable 2-blocks. In general we dene the higher
edge graphs G
(n)
as follows. The alphabet consists of all allowable blocks [a
1
, . . . , a
n
]
gotten from paths of length n on G. The transitions are dened by
[a
1
, . . . , a
n
] [b
1
, . . . , b
n
]
if and only if b
1
= a
2
, . . . , b
n1
= a
n
.
There is a one-side version of the full N-shift: namely,
+
N
= s = (s
0
, s
1
, . . . )[s
n
/, n = 0, 1, 2, . . . . (2.3.3)
On this space the shift transformation is similarly dened: namely, (s)
n
= s
n+1
but only for non-negative n. It acts by shifting sequences one step to the left and
dropping the rst symbol. The metric on this phase space is dened the same as
before but absolute value signs are not needed. We also have one-side versions of
shifts of nite type. In one-side symbolic systems, the shift transformation, like a
multiplication map, is continuous but not invertible.
2.4 Horseshoe map. We present here a brief informal description of Smales
horseshoe map. For more details consult [Sm], [S, page 23], [HK, page 273].
8 ROY L. ADLER
p f
f p
p
o
2
Figure 3. Hyperbolic xed point with homoclinic point
R
R
Figure 4. Horseshoe map
Let f be a dieomorphism of the plane with a hyperbolic xed point at the
origin. This means that there are two invariant curves through the origin such that
iterates under f of points on one and iterates under f
1
of points on the other
converge exponentially to the origin. The rst is called the stable manifold and
the second the unstable manifold. Let p be a transverse homoclinic pointi.e. a
point where the unstable manifold crosses the stable manifold. Then f
n
p will be a
sequence of homoclinic points converging to the origin. See Figure 3. Somewhere
near the origin there will be a rectangular neighborhood R such that R, where
= f
n
for some iterate of f will intersect R as in Figure 4.
In Smales work [Sm] more complicated dieomorphisms than the simple horse-
shoe map are considered. These involve many xed points where a stable manifold
of one might cross unstable manifolds of others.
Exercises.
2.1 Prove that the canonical map : G
(n)
G dened by [s
1
, . . . , s
n
] = s
1
gives topological conjugacy of (
G
(n) , ) and (
G
, ).
3. Symbolic representations
Shifts of nite type contain a great deal of complexity, yet are the best under-
stood dynamical systems. Such symbolic dynamical systems can be used to analyze
general discrete time ones. For example, a good symbolic representation will show
how to identify periodic orbits, almost periodic ones, dense ones, etc.
Representing a general dynamical system by a symbolic one involves a funda-
mental complication. We have two desires: we would like a continuous one-to-one
SYMBOLIC DYNAMICS AND MARKOV PARTITIONS 9
1
2
R R
N
1
x
x
R
x
. . .
2
x
Figure 5. Partitioning a dynamical system
correspondence between orbits
n
x of the rst and orbits
n
s of the second, and
we want the shift system to be one of nite type. Unfortunately, these two desires
are usually in conict: constraints placed by topology must be observed. On one
hand a continuous one-to-one correspondence makes X homeomorphic to a shift
system. On the other hand a shift system is totally disconnected while X is often a
smooth manifold. Thus for the most part we must abandon the quest of nding a
topological conjugacy between a given dynamical system and a shift of nite type.
However, we shall see that by sacricing one-to-one correspondence we can still
salvage a satisfactory symbolization of orbits. We are reminded of arithmetic in
which we represent real numbers symbolically by decimal expansions, unique for
the most part, but must allow two expansions for certain rationals. To do otherwise
would just make the instructions for arithmetical operations unnecessarily compli-
cated. The most natural way to associate a symbolic sequence with a point in a
dynamical system is to track its history as illustrated in Figure 5 through a family
of sets indexed by an alphabet of symbols.
This is easy, but what is more dicult is to get a family for which each history
represents just one point. It is no achievement to specify a family for which each
history might represent more than one point. However, we must live with the
inevitability that each point might have more than one associated history. Having
found a family of sets, the orbits through which determine a unique point, we want
still more: namely, we would like the totality of sequences which arise to comprise
a subshift of nite type. In order to do this, we must nd a family with special
properties. We shall look at some examples for guidance as to what these properties
ought to be, and families of sets possessing them will be called Markov partitions.
The rst example is the trivial case of a dynamical system which is identical
with its symbolic representation: namely, a topological Markov shift.
3.1 Cylinder set partition for symbolic sequences. Let (
A
, ) be a topo-
logical Markov shift, vertex labelled by an alphabet /. We form the partition
( = C
a
: a / of elementary cylinder sets determined by xing the 0-th co-
ordinate: i.e., C
a
= s
A
: s
0
= a. Tracking the history of an orbit of an
element s
A
through this partition means getting a sequence (s
n
)
nZ
such that
n
s C
sn
. But this sequence is s itself. Let us point out the salient features of
this partition.
10 ROY L. ADLER
First
2
,
n=0
n
k
C
sk
= s.
Second, if s C
a
1
C
b
,= , then s C
a
and s C
b
: i.e., s
0
= a, s
1
= b.
In terms of the graph, this means there is an edge from a to b. An absolutely
obvious property of directed graphs is the following. If there is an edge from a to
b, and an edge from b to c, then there is a path from a to c via b. This property
can be reformulated as follows. If C
a
1
C
b
,= and C
b
1
C
c
,= , then
C
a
1
C
b
2
C
c
,= . This property has a length n version for arbitrary n:
namely, n abutting edges form a path of length n+1, and this can be reformulated
to read that n pair-wise non-empty intersections lead to an (n+1)fold non-empty
intersection. We shall call such a countable set of conditions for n = 2, 3, ... the
Markov property: it turns out to be a key requirement in getting the desired
symbolic representation from a partition.
Finally, there is another important feature of the partition ( = C
a
: a / :
namely, the sets of this partition have a product structure respected by the shift
which is described as follows. Let s C
a
in other words, s
0
= a and dene two
sets
v
a
(s)
k
C
sk
which we shall call the vertical through s and
h
a
(s)
0
k
C
sk
which we shall call the horizontal. A sequence s C
a
is the sole member of the
intersection of its vertical and horizontal i.e. s = v
a
(s) h
a
(s). Furthermore,
for s, t C
a
there is a unique sequence in the intersection of the horizontal through
s and the vertical through t : namely, (. . . s
2
, s
1
, s
0
= t
0
, t
1
, t
2
, . . . ) = v
a
(s)
h
a
(t). We dene a map of C
a
C
a
onto C
a
by (s, t) h
a
(s) v
a
(t), or rather the
sole element of this intersection. It is easily veried that this map is continuous and
its restriction to h
a
(s) v
a
(t) for any s, t C
a
is a homeomorphism of h
a
(s) v
a
(t)
onto C
a
. Finally respects this product structure in the sense that if s C
a
1
C
b
, then:
v
a
(s) v
b
(s),
h
a
(s) h
b
(s).
This last property is closely connected with the Markov one.
The next example is based on the binary expansions of real numbers and illus-
trates what one should expect of a good symbolic representation of a dynamical
system.
2
From now on we shall commit a convenient semantic error of confusing a set consisting of a
single point with the point itself and so dispense with the surrounding braces.
SYMBOLIC DYNAMICS AND MARKOV PARTITIONS 11
1
R R
0
Figure 6. Graph of multiplication by 2 (mod 1)
3.2 Symbolic representation for multiplication by two. Let (X, f) be the
multiplication system where and f : x 2x. See Figure 6 for the graph of f.
Recall that the domain
+
2
of the one-sided full 2-shift dynamical system (
+
[2]
,
+
)
is the set of one-sided innite walks on the edge-labelled graph in Figure 1. We can
equate a sequence s = (s
n
)
nZ
with the binary expansion .s
1
s
2
s
3
. . . . Consider the
map from
+
[2]
to X dened by (s
1
, s
2
, . . . ) = s
1
/2 +s
2
/4 +. . . . It is readily
veried that
(i) f =
+
,
(ii) is continuous,
(iii) is onto,
(iv) there is a bound on the number of pre-images (in this case two),
and
(v) there is a unique pre-image of most numbers (here those with binary ex-
pansions not ending in an innite run of all zeros or all ones).
The map is not a homeomorphism, but we do have a satisfactory representation of
the dynamical system by a one-sided 2-shift in the sense that: orbits are preserved,
every point has at least one symbolic representative, there is a nite upper limit to
the number of representatives of any point, and every symbolic sequence represents
some point. This is a example of what is known as a factor map, which we shall
formalize in Section 4.
As we have led the reader to expect, there is an alternate denition of in terms
of a partition. Consider = R
0
= (0, 1/2), R
1
= (1/2, 1). The elements of this
family are disjoint open intervals whose closure covers the unit interval. The map
which associates sequences with points has an alternate expression in terms of
this family: namely,
(s
1
, s
2
, . . . ) =
n=0
R
s1
f
1
(R
s2
) f
n
(R
sn+1
).
Remark. The reader might wonder about dening by the simpler expression
(s
1
, s
2
, . . . ) =
n=0
f
n
(R
sn+1
).
There are cases where this would suce, but a diculty can arise and does here.
In X the point 0 which is identied with 1 is a xed point of f which implies that
12 ROY L. ADLER
+
+
v
v
n=0
R
s1
f
1
(R
s2
) f
n
(R
sn+1
)
n=0
f
n
(R
sn+1
).
However, for the so-called expansive dynamical systems, when the size of partition
elements is uniformly small enough, equality holds, in which case would be well-
dened (see Proposition 5.8).
Next we consider hyperbolic automorphisms of the 2-torus. This was the rst
smooth class of invertible dynamical systems found to have Markov partitions. This
discovery was made by K. Berg [Be] in 1966 in his doctoral research. A short time
later R. Adler and B. Weiss [AW1] constructed some special Markov partitions in
order to prove that two such systems are conjugate in the measure theoretic sense
if they have the same entropy. For these systems topological conjugacy implies
measure conjugacy, but not conversely. We shall give a formal development for the
general two-dimensional case in a later chapter. Before making that plunge, we
shall wet our toes with an informal discussion of one specic illustrative case. A
rigorous proof of what we are about to describe will be achieved by Theorem 7.13.
3.3 Partition for a toral automorphism. Take the matrix
A =
_
1 1
1 0
_
which we have met before in quite a dierent context. Let the phase space X be
the two-torus and be given by A : that is,
(x, y) = (x +y, x).
The matrix A has two eigenvalues: = (1+
5 )/2 and = (1
5 )/2. Observe
that > 1 and 1 < < 0. Associated with these eigenvalues are the eigenvectors
v
-component
by . Note is negative, which causes a direction reversal besides a contraction in
SYMBOLIC DYNAMICS AND MARKOV PARTITIONS 13
+
+
Figure 8. Another fundamental region of 2-torus
1
R
2
R
3 d
d
o
o
b
c
a
a
l
l
R
o
+
+ +
+
+ + +
+
+
a
b
c
Figure 9. Partition of 2-torus
the v
and v
-direction is
called the expanding boundary of the partition and those in the v
-direction, the
contracting boundary. By lattice translations of the various bounding segments, we
can reassemble their union into two intersecting line segments through the origin,
o
b and a
in the v
j
R
Impossible
R R
R
j
i i
Possible
Figure 12. Intersections
points which are (1, 0) and (1, 1) respectively. Thus the image of
d is a, and the
image of a is c. These facts about the expanding and contracting boundaries imply
that renements of the original partition under positive iterates of do not have
any new boundary segments in the v
-direction not
already in o
0
k=n
k
R
sk
,= , then this intersection is a single rectangle stretching all the
way across R
s0
in the expanding direction. Similarly, if
n
k=0
k
R
sk
,= , then
this set is a single rectangle stretching all the way across R
s0
in the contracting
direction.
Combining these two results we have that a non-empty closed set of the form
n
k=n
k
R
sk
is a closed rectangle. The diameter of these sets is uniformly
bounded by constant [[
n
. Thus as n , a sequence of such sets decreases
to a point in X. Consequently, such a point can be represented by a sequence
s = (s
n
)
nZ
. If fact, all points of the torus can be so represented.
If
n
k=n
k
R
sk
,= , then it is clear that R
si
1
R
si+1
,= for n i n1.
The converse which is the Markov property is really the main one we are extracting
from the geometry of this example. As we have seen R
si
1
R
si+1
,= if and only
if edge s
i+1
follows edge s
i
according to the the graph of Figure 11. This means
that the sequences s = (s
n
)
nZ
are elements is a topological Markov shift.
Once again sets R
i
have an obvious product structure. For p R
i
we call
the segment h
i
(p) specied by intersection of R
i
and the line through p in the
expanding direction the horizontal through p. Similarly, we refer to v
i
(p) given by
the intersection of R
i
and the line through p in the contraction direction as the
vertical. Each rectangle is homeomorphic to the Cartesian product of any one of
its horizontals with any one of its verticals. Just as for topological Markov shifts,
the toral automorphism respects this structure: namely, for p R
i
1
R
j
the
16 ROY L. ADLER
1
R
2
R R
1
R
R R
R
R
1
R
R
1
R
R
-1
R
2
R
2
-1
2
Figure 13. Sets containing the invariant set of the horseshoe map
following holds:
v
i
(p) v
j
(p),
h
i
(p) h
j
(p).
We shall incorporate what we have just described in a comprehensive theory.
3.4 Symbolism for the horseshoe map. The map indicated in Figure 4 has
an invariant set X =
n=
n
R.
As shown in Figure 13, let R
1
, R
2
be the disconnected components of R R.
It can be proved that X is homeomorphic to
2
from which follows that (X, )
and (
2
, ) are topologically conjugate.
For versions of horseshoe-like maps arising from cases where there are many ho-
moclinic points linking various xed points, the invariant sets can be specied by
shifts of nite type. For the dynamical systems where the map is restricted to an
invariant subset of its domain we have the exceptional situation where there is ac-
tually a one-to-one correspondence between their orbits and the symbolic sequences
of a shift of nite type. Here the price paid is that not all of the domain of the map
is represented by symbolic sequences but rather only an invariant subset.
4. More on abstract dynamical systems
Denition 4.1. A dynamical system (X, ) is said to be irreducible if for every
pair of open sets U, V there exists n 0 such that
n
U V ,= .
Another concept we need is the following.
Denition 4.2. A point p is said to be bilaterally transitive if the forward orbit
n
p[ n 0 and the backward orbit
n
p[ n < 0 are both dense in X.
Remark. A symbolic sequence in a topological Markov shift is bilaterally transitive
if every admissible block appears in both directions and innitely often.
We use the notation BLT(A) to denote the subset of bilaterally transitive points
in A X.
In an irreducible system the bilaterally transitive points turn out to be every-
where dense. To prove this, we recall the following theorem of point set topology.
The theorem is more general, but can be slightly simplied in the case where the
space X is a compact metric space.
SYMBOLIC DYNAMICS AND MARKOV PARTITIONS 17
Baire Category Theorem 4.3. Let U
n
be a countable collection of open dense
subsets of X. Then
U
n
is non-empty. In fact
U
n
is dense in X. Equivalently,
a compact metric space in not the union of a countable collection of nowhere dense
sets.
Proof. Choose inductively balls B
n
such that B
n
B
n
U
n
, and B
n
B
n1
. The
rst property is easily achieved in a metric space; the second because U
n
is dense,
which implies that B
n1
U
n
is a non-empty open set. The sequence (B
n
)
nN
has the nite intersection property: so by compactness
B
n
is non-empty. But
B
n
B
n
U
n
. Thus the intersection
U
n
is not empty. It is also dense,
which is a consequence of replacing U
n
in the above argument by U
n
B and X by
B where B is any ball.
Proposition 4.4. If (X, ) is irreducible, then the set of bilaterally transitive points
is dense in X.
Proof. Let U
n
be a countable basis for X. Since X is irreducible,
k<0
k
U
n
,
as well as
k0
k
U
n
, is open dense in X for each integer n. The set BLT(X) of
bilaterally transitive points can be expressed as
BLT(X) =
n
(
_
k<0
k
U
n
_
k0
k
U
n
).
We apply the Baire Category Theorem to get BLT(X) ,= . But BLT(X) contains
the whole orbit of any of its points, and the orbit of any of its points is dense in
X.
The three examples we have discussed exhibit a certain property which is easily
veried for them. It concerns the divergence of orbits and is dened as follows.
Denition 4.5. A homeomorphism is said to be expansive if there exists a real
number c > 0 such that if d(
n
p,
n
q) < c for all n Z, then p = q.
In the theory of Markov partitions this property plays a key role. Representation
of dynamical systems by shifts of nite type is possible for certain non-expansive
systems, but really seems natural only for expansive ones.
Next we formalize a property of mappings between dynamical systems previously
alluded to in connection with binary expansions.
Denition 4.6. For two general dynamical systems (X, ) and (Y, ) we call the
second a factor of the rst and the rst an extension of the second if there exists a
map of X into Y , which we call a factor map, such that
(i) = (see Figure 14),
(ii) is continuous,
(iii) is onto.
Furthermore, we say is a nite factor map or that it is bounded-to-one if
(iv) there is a bound on the number of pre-images;
and essentially
3
one-to-one if
(v) every doubly transitive point has a unique pre-image.
3
This term is used because in irreducible systems the non-doubly transitive points are negligible
in both the sense of category and measure.
18 ROY L. ADLER
X X
Y Y
Figure 14. Commutative diagram illustrating a factor map
We remark that a topological conjugacy
4
is a nite factor map where the bound
on the number of pre-images is one in condition 1.2(iv). As we shall see, the
seemingly slight weakening of the chains of topological conjugacy, which is what
the denition of an essentially one-to-one nite factor map is meant to do, allows the
necessary freedom to get symbolic representations for smooth dynamical systems.
Proposition 4.7. Let be a factor map of (X, ) and (Y, ) : i.e. satises
properties (i), (ii), and (iii) of Denition 4.6. If (X, ) is irreducible, then so is
(Y, ); and Y = BLT(Y ).
Proof. Let U, V be non-empty open subsets of Y. By properties (ii) and (iii) of
factor maps,
1
U,
1
V are also non-empty and open. Since (X, ) is irreducible,
there exists an integer n > 0 such that
n
(
1
U)
1
V ,= . By 4.6(i),
,= [
n
(
1
U)
1
V ] = [
1
(
n
U V )] =
n
U V.
Thus (Y, ) is irreducible, and from Proposition 4.4 it follows that Y = BLT(Y ).
Proposition 4.8. Let (X, ) be irreducible and an essentially one-to-one factor
map of (X, ) onto (Y, ): i.e. satises (i),(ii),(iii), and (v) of Denition 4.6.
Then maps BLT(U) homeomorphically onto BLT((U)) for any open subset U
of X.
Proof. From the properties of , if the forward orbit of x hits every non-empty
open subset of X, then the forward orbit of (x) hits every non-empty open subset
of Y. Thus BLT(U) BLT((U)).
We have that is a continuous one-to-one map of BLT(U) into BLT((U)). We
prove next that its inverse is continuous also. The proof is a standard compactness
argument which goes as follows. Suppose y
n
y, where y
n
, y BLT((U)). We
shall prove that
1
y
n
1
y. By compactness the sequence (
1
y
n
)
nN
has limit
points in X. Let x be any one of these limit points. By continuity x = y. But the
pre-image of y is unique: so the sequence (
1
y
n
)
nN
, having only one limit point,
has a limit which is
1
y.
Now let x
1
BLT((U)) U, and let V be any non-empty open subset of X.
Choose v BLT(V ). Then by what we have already shown, (v) BLT((V )).
4
The term derives from the group theory notion of conjugate elements and its usage is standard
in the subject. In the sense we are using it, better terms would have been homomorphism for
factor map and isomorphism for topological conjugacy. These are the terms which denote the
property of preserving structure.
SYMBOLIC DYNAMICS AND MARKOV PARTITIONS 19
Then there exists a sequence of positive integers k
n
such that
kn
(x) (v).
Thus
kn
(x) =
1
kn
(x)
1
(v) = v.
Thus x BLT(U). We have therefore established
1
BLT((U)) BLT(U) : in
other words, BLT((U)) BLT(U).
Proposition 4.9. Under the hypothesis of Proposition 4.8, if U is an open subset
of X, then BLT(U) = BLT[U]
o
and U = [U]
o
.
Proof. Let y BLT(U). Then the unique pre-image of y lies in BLT(U), and y
is not therefore in the closed set (X U). Hence, y Y (X U) [U]
o
.
Therefore, BLT(U) = BLT[U]
o
.
From the continuity properties of , Proposition 4.8, and what was just proven,
we get the following string of equalities: (U) = (BLT(U)) = BLT(U) =
BLT[U]
o
= [U]
o
.
Exercises.
4.1 We call a directed graph G irreducible if given any pair of nodes i, j there
is a directed path from i to j. Show that if G is irreducible as a graph,
then the dynamical system (
G
, ) is irreducible. Conversely, show that if
the dynamical system (
G
, ) is irreducible, then there is an irreducible sub-
graph G
such that
G
=
G
.
4.2 Given any topological Markov shift system (
G
), there exist irreducible sub-
graphs G
1
, . . . , G
M
such that
G
=
G1
GM
(disjoint).
5. Topological partitions
Denition 5.1. We call a nite family of sets = R
0
, R
1
, . . . , R
N1
a topolog-
ical partition for a compact metric space X if:
(1) each R
i
is open
5
;
(2) R
i
R
i
= , i ,= j;
(3) X = R
0
R
1
R
N1
.
Remark. For open sets U, V, U V ,= U V ,= . So for members of a
topological partition we get the following string of implications: R
i
R
j
=
R
i
R
j
= R
i
o
R
j
= R
i
o
R
j
= R
i
o
R
j
o
= . Thus R
i
o
R
j
o
=
for i ,= j.
Denition 5.2. Given two topological partitions = R
0
, R
1
, . . . , R
N1
and
o = S
0
, S
1
, . . . , S
M1
, we dene their common topological renement o as
o = R
i
S
j
: R
i
, S
j
o.
Proposition 5.3. The common topological renement of two topological partitions
is a topological partition.
5
Previous authors have taken these sets to be closed sets with the property that each is the
closure of its interior. The present variation is slightly more general, just enough to make some
notation and certain arguments simpler. In fact, an important example is presented in Section 9
of a partition whose elements are not the interiors of their closures.
20 ROY L. ADLER
Proof. Let and o be the two partitions in question. First of all, it is clear that
the elements of o are disjoint. We show that the closure of elements of o
cover X. Let p X. We have that p R
i
for some i. Thus there exists a sequence
of points p
n
R
i
such that d(p
n
, p) < 1/n. Since o is a topological partition, for
each n there exists S
jn
o such that p
n
S
jn
. Since o is nite, there exists an
index j such that j
n
= j for an innite number of n so that we can assume that
the p
n
were chosen in the rst place such that each j
n
= j. Since p
n
R
i
S
j
we
can choose a sequence of points q
m,n
R
i
S
j
such that d(q
m,n
, p
n
) < 1/m. Thus
d(q
n,n
, p) < 2/n; whence q
n,n
p as n . Therefore p R
i
S
j
.
Proposition 5.4. For dynamical system (X, ) with topological partition of X,
the set
n
dened by
n
=
n
R
1
, . . . ,
n
R
N1
is again a topological partition.
Proof. This is an immediate consequence of the following: (1) the image of a union
is the union of images for any map, (2) a homeomorphism commutes with the
operation of taking closures, (3) the image of an intersection is the intersection of
images for a one-one map.
From Proposition 5.3 and 5.4 we have that for m n,
_
n
m
k
=
m
m1
n
is again a topological partition. We shall use the notation
(n)
n1
k=0
k
.
Thus
(2)
=
1
= R
i
1
R
j
: R
i
, R
j
. Observe that (
(2)
)
(2)
=
(3)
,
or more generally (
(n)
)
(m)
=
(n+m1)
.
The collection
n
: n Z is a collection of open dense sets to which we can
apply the Baire theorem, but due to its special nature we can achieve a slightly
stronger result with the same sort of proof.
Proposition 5.5. Let be a topological partition for dynamical system (X, ).
For every p X there exists a sequence (R
sk
)
kZ
of sets in such that p
n=0
n
n
k
R
sk
.
Proof. Since
_
n
m
k
, m n, is a topological partition, there is a set in it whose
closure contains p, say
n
m
k
R
sk
. We next show that in the renement
_
n+1
m1
n+1
m1
k
R
tk
where t
k
= s
k
for m k n comprise
a subfamily which is a topological partition of
n
m
k
R
sk
. Because
_
n+1
m1
satises 5.1(1) and (2), so does any subfamily. Condition 5.1(3) is a consequence of
_
0tm1N1
0tn+1N1
tk=sk, mkn
n+1
m1
k
R
tk
=
n
k
R
sk
and the fact that the closure of a union is the union of closures. Thus we can choose
by induction the sets R
sk
as follows. Once having specied sets R
sn
, . . . , R
sn
such that p
n
n
k
R
sk
, we can nd sets R
sn1
and R
sn+1
such that p
n+1
n1
k
R
sk
. Hence there exists a sequence (R
sk
)
kZ
of sets in such that
p
n=0
n
n
k
R
sk
.
SYMBOLIC DYNAMICS AND MARKOV PARTITIONS 21
Remark. With a slight modication of this proof somewhat more can be estab-
lished: namely, a nite sequence of sets R
sm
, R
sm+1
, . . . , R
sn
can be extended
to a bi-innite sequence (R
sn
)
nZ
such that if p
n
m
k
R
sk
, m n, then
p
n=0
n
n
k
R
sk
. We can even go further and make the same claim about
extending a one-sided innite sequence R
sm
, R
sm+1
, . . . to a bi-innite one.
Denition 5.6. We dene the diameter d() of a partition by
d() = max
RiR
d(R
i
)
where d(R
i
) sup
x,yRi
d(x, y).
Denition 5.7. We call a topological partition a generator for a dynamical system
(X, ) if lim
n
d
__
n
n
_
= 0.
If is a generator, then clearly lim
n
d
_
n
n
k
R
sk
_
= 0 for any sequence
of symbols (s
i
)
iZ
0, . . . , N1
Z
. The converse is also true (see Exercise 5.1). In
addition d
_
n=0
n
n
k
R
sk
_
= 0. Hence in Proposition 5.5, if is a generator
and p
n=0
n
n
k
R
sk
, then p =
n=0
n
n
k
R
sk
.
The following proposition gives sucient conditions on a topological partition in
terms of its diameter for it to be a generator.
Proposition 5.8. Let (X, ) be expansive and be a topological partition such
that d() < c where c is the expansive constant. Then is a generator.
Proof. The set
k
R
sk
contains at most one point and thus has zero di-
ameter: for if there exists p, q
k
R
sk
, then d(
n
p,
n
q) < c for n Z
implying p = q. Since
n
n
k
R
sk
n
n
k
R
sk
, d
_
lim
n
n
n
k
R
sk
_
=
d
_
k
R
sk
_
= 0. From Exercise 5.1 we get that is a generator.
Remarks. Generally we merely have the inclusion relation
n=0
n
k
R
sk
k
R
sk
(5.9)
but not equality. However, when the sets of the partition are small enough namely,
when the hypothesis of Proposition 5.8 is satised we do have equality: that is, if
n=0
n
n
k
R
sk
,= , then
n=0
n
n
k
R
sk
=
k
R
sk
.
From the inclusion relation (5.9) we see that if (s) = p, then p R
s0
. Thus if
x belongs only to R
i
, then s
0
= i. In particular, by the remark following Denition
5.1, if p R
i
or p R
i
o
, then s
0
= i. In addition, if there exist sequences s, t such
that (s) = (t) = p and s
0
= i ,= j = t
0
, then p R
i
R
j
, i ,= j, and conversely.
In which case p (R
i
R
i
o
) (R
j
R
j
o
) = R
i
R
j
: i.e. p belongs to the
boundary of partition elements.
Let = R
1
, . . . , R
N
be a generator for a dynamical system (X, ). Let be
the subset of the full N-shift dened by
s = (. . . , s
n
, . . . ) :
n=0
n
k
R
sk
,= . (5.10)
22 ROY L. ADLER
X X
Figure 15. Commutative diagram for symbolic representation
Because the topological partition is a generator, the non-empty innite inter-
section
n=0
n
k=n
k
R
sk
consists of a single point. Therefore, we can dene a
map : X by
(s) =
n=0
n
R
sn
n1
R
sn+1
n
R
sn
. (5.11)
Figure 15 helps us to keep in mind what is being mapped to what.
Proposition 5.12. Let the dynamical system (X, ) have a topological partition
which is a generator. Then as dened by (5.10) is a closed shift-invariant subset
of
N
and the map given by (5.11) is a factor map of the dynamical system (, )
onto (X, ) i.e., satises the following items of Denition 4.6:
(i) = ,
(ii) is continuous,
(iii) is onto.
Proof. To prove is closed we must show that if s = (. . . , s
k
, . . . ) , then
n=0
n
k
R
sk
,= , (5.13)
which then implies that s . For each n > 1 there is a sequence t = (. . . , t
k
, . . . )
such that d(t, s) < 1/n. This means that t
k
= s
k
, n k n so that
n
k
R
tk
=
n
k
R
sk
.
Because t , this set is non-empty. Since this is so for arbitrary n and these sets
form a decreasing sequence of non-empty closed sets, applying compactness we get
5.13.
To prove is invariant we must show that if s , then s : in other
words, for n 0, if
n
n
k
R
sk
,= , then
n
n
k
R
sk+1
,= . This follows from
using the distributive property of
1
with respect to intersections and reindexing:
i.e.
1
_
n
k
R
sk+1
_
=
n+1
n+1
k
R
sk
n1
n+1
k
R
sk
.
We now turn our attention to the properties of .
SYMBOLIC DYNAMICS AND MARKOV PARTITIONS 23
(i) satises = . This follows from reindexing after applying the prop-
erty that a homeomorphism commutes with the closure operation and preserves
intersections: to wit,
(s) =
n=0
n
k
R
sk
=
n=0
n
k+1
R
sk
=
n=0
n1
n1
k
R
sk+1
=
n=0
n1
n+1
k
R
sk+1
= (s).
(ii) is continuous. From the generating property of , given > 0 there is
a positive integer n such that d
_
n
n
k
R
sk
_
< . Thus, for s, t
G
, there is a
> 0, namely = 1/(n+1), such that if d(s, t) < , then (s), (t)
n
n
k
R
sk
.
(iii) is onto. This follows immediately from Proposition 5.5.
Exercises.
5.1 Let be a topological partition for a dynamical system (X, ). Prove that
if lim
n
d
_
n
n
k
R
sk
_
= 0 for any sequence of symbols (s
i
)
iZ
0, . . . ,
N 1
Z
, then lim
n
d
__
n
n
_
= 0.
6. Markov partitions and symbolic extensions
Denition 6.1. We say that a topological partition for a dynamical system
(X, ) satises the n-fold intersection property for a positive integer n 3 if
R
sk
1
R
sk+1
,= , 1 k n 1
n
k=1
k
R
sk
,= .
Furthermore, we call a topological partition Markov if it satises the n-fold inter-
section property for all n 3.
Remark. In Section 3.1 and before the term Markov topological generator was
dened, we considered the partition ( = C
a
: a A consisting of the elementary
cylinder sets C
a
= s
G
: s
0
= a for a dynamical system (
G
, ) where
G
is a
shift of nite type base on an alphabet A. As one might have guessed this partition
is the prototype of a topological Markov generator.
Proposition 6.2. If is a Markov partition, then so is
_
n
m
k
for any m n.
Proof. We leave the proof as an exercise.
If a topological partition satises the n-fold intersection property, then it
satises k-fold ones for all smaller k. To increase the order we shall utilize the
following.
Bootstrap Lemma 6.3. If satises the 3-fold and
(2)
satises the n-fold in-
tersection properties, n 3, then satises the (n + 1)-fold intersection property.
24 ROY L. ADLER
Proof. Suppose R
ik
1
R
ik+1
,= , 1 k n. Because satises the 3-fold
intersection property, we have
R
ik
1
R
ik+1
2
R
ik+2
,= , 1 k n 1.
In other words,
(R
ik
1
R
ik+1
)
1
(R
ik+1
1
R
ik+2
) ,= , 1 k n 1.
Because
(2)
satises the n-fold intersection property, we obtain
n+1
k
R
ik
=
n
k
(R
ik
1
R
ik+1
) ,= .
Suppose a dynamical system (X, ) has a Markov generator =R
0
, . . . , R
N1
.
We dene an associated topological Markov shift given by the directed graph G
whose vertices are labelled by / = 0, 1, . . . , N 1 and in which the i-th vertex
is connected to the j-th, i j, i R
i
1
R
j
,= . So by denition of the Markov
shift associated with a transition matrix of a directed graph,
G
= s = (s
n
)
nZ
: R
sn1
1
R
sn
,= , s
n
/, n Z. (6.4)
This set coincides with the subsystem dened by 5.10, which is easily seen as follows.
On one hand, for s
G
, each of the closed sets
k=n
k
R
sk
[ n = 1, 2 . . .
for any n 0 is non-empty since the nite intersection under the closure sign is
non-empty due to the Markov property. For increasing n these closed intersec-
tions form a decreasing sequence of non-empty sets, and therefore by compactness
n=0
n
k=n
k
R
sk
,= . On the other hand, if
n=0
n
k=n
k
R
sk
,= , then
each nite intersection under the closure sign is non-empty, which in turn implies
that each pair of intersections R
sk
1
R
sk
,= , for arbitrary k Z.
Main Theorem.
Theorem 6.5. Suppose the dynamical system (X, ) is expansive and has a Markov
generator = R
0
, . . . , R
N1
. Then the map , as dened by (5.11), is an es-
sentially one-to-one nite factor map of the shift of nite type
G
, as dened by
(6.1), onto X. Furthermore, if (X, ) is irreducible, then so is (
G
, ).
Proof. We must establish (i) - (v) in Denition 4.6. That is a factor mapnamely,
it satises items (i), (ii), and (iii)is the content of Theorem 5.12.
In order to establish (iv)namely, a bound on the number of pre-images under we
introduce the following concept.
Denition 6.6. A map from
G
to X is said to have a diamond if there are
two sequences s, t
G
for which (s) = (t) and for which there exist indices
k < l < m such that s
k
= t
k
, s
l
,= t
l
, s
m
= t
m
. See Figure 16.
Lemma 6.7. If the number of pre-images of a point is more than N
2
, then has
a diamond.
SYMBOLIC DYNAMICS AND MARKOV PARTITIONS 25
. . .
l
t
,
s
l ,
s
m-1 ,
t
m-1
s
k m
= t
,
s
t
k+1
k+1
,
. . .
,
. . .
. . .
t =
k m
s
Figure 16. A diamond
Proof. We apply the familiar pigeon hole argument. Let s
(1)
, . . . s
(N
2
+1)
be N
2
+1
dierent sequences which map to the same point. Since the sequences are distinct,
there are a pair of indices k, m such that the allowable blocks s
(1)
[k,m]
, . . . s
(N
2
+1)
[k,m]
are distinct. There are N
2
distinct choices of pairs of symbols (s
(i)
k
, s
(i)
m
) : so by
the pigeon hole principle there must be two allowable blocks s
(i)
[k,m]
, s
(j)
[k,m]
, such
that (s
(i)
k
, s
(i)
m
) = (s
(j)
k
, s
(j)
m
). But, since the blocks are dierent, there is an index
l such that s
(i)
l
,= s
(j)
l
. Thus the two sequences s
(i)
, s
(j)
map to the same point,
agree at indices k, m, but dier at l which is between k, m, which means there is a
diamond.
Lemma 6.8. If there exists a bilaterally transitive point with two pre-images, then
has a diamond.
Proof. Let a BLT point p have two pre-images. As we have indicated in the remark
following 5.8, there are two sets R
a
, R
b
, a ,= b, such that p R
a
R
b
. For each
n > 0, the family of sets
n
R
sn
R
s0
. . .
n
R
sn
: s
G
where s
0
= a
covers R
a
, and the family
n
R
tn
R
t0
. . .
n
R
tn
: t
G
where t
0
= b
covers R
b
. Thus, by compactness, there exists s, t
G
with s
0
= a, t
0
= b such
that
p
n=0
n
k
R
sk
and
p
n=0
n
k
R
tk
.
Since p is bilaterally transitive and R
0
is open,
n
p R
0
for some positive n and
m
p R
0
for some negative m. Thus by the remark following (5.8), s
m
= t
m
=
0, s
0
= a ,= b = t
0
, s
n
= t
n
= 0, which is a diamond for .
Lemma 6.9. If d() < c/2, then has no diamonds.
Proof. Since = , we can assume without loss of generality that k = 1 in the
denition of a diamond. Assume that p = (s) = (t) where
s = (. . . , s
2
, a, b
0
, b
1
, . . . , b
m1
, d, s
m+1
, . . . ),
26 ROY L. ADLER
t = (. . . , t
2
, a, c
0
, c
1
, . . . , c
m1
, d, t
m+1
, . . . ).
We must show that b
l
= c
l
for 0 l m1. Because [a, b
0
, b
1
, . . . , b
m1
, d] is an
allowable block in
G
,
R
a
R
b0
1
R
b1
m+1
R
m1
m
R
d
,= .
Choose a point q in this open set. Because is onto, there is a sequence
u = (. . . , u
2
, a, b
0
, b
1
, . . . , b
m1
, d, u
m+1
, . . . )
G
such that (u) = q. Also since [a, c
0
, c
1
, . . . , c
m1
, d] is an allowable block and
G
is a shift of nite type, there is a sequence v
G
such that
v = (. . . , u
2
, a, c
0
, c
1
, . . . , c
m1
, d, u
m+1
, . . . ).
Thus
r (v) R
a
R
c0
1
R
c1
m+1
R
m1
m
R
d
.
From d(R
i
) < c/2 and
l
(x) R
bl
R
cl
for 0 l m 1, we conclude by the
triangle inequality that d(
l
q,
l
r) < c. Furthermore, d(
n
q,
n
r) < c/2 for n < 0
and n > m1. The expansive property then implies that q = r. Thus R
bl
R
cl
,=
which implies that R
bl
R
cl
,= . However, elements of are pairwise disjoint: so
b
l
= c
l
.
(iv) There is a bound on the number of pre-images of .
(v) A BLT point has a unique pre-image.
Because is a generator, n can be chosen so that d
__
n
n
_
< c/2. By
Proposition 6.2,
_
n
n
k
is again a topological Markov partition. For this partition
the associated shift of nite type of (6.4) is given by the higher edge graph G
(2n+1)
.
Let
(2n+1)
be the map of
G
(2n+1) onto X according to (5.11). It has no diamonds:
so by Lemma 6.7 a point has at most N
2(2n+1)
pre-images, and by Lemma 6.8 a
BLT point has only one. The original satises =
(2n+1)
n
where is a
conjugacy of
G
onto
G
(2n+1) . Thus we have that under a point has at most
N
2(2n+1)
pre-images, and a BLT point has a unique pre-image.
We defer the proof of irreducibility to Exercise 6.2.
Converse to the Main Theorem. Recall that we introduced in 3.1 the partition
( = C
i
: i = 0, . . . , N 1 consisting of the elementary cylinder sets C
i
= s
G
: s
0
= i for a dynamical system (
G
, ) where
G
is a shift of nite type
based on an alphabet A = 0, 1, . . . , N 1. This partition is a topological Markov
generator.
Theorem 6.10. Let (X, ) be a dynamical system, (
G
, ) an irreducible shift of
nite type based on N symbols, and suppose there exists an essentially one-to-one
factor map from
G
to X. Then the partition dened by = R
i
= (C
i
)
o
:
i = 0, . . . , N 1 is a topological Markov generator.
Remark. Note we assume is a factor map which has a unique inverse for each
bilaterally transitive point, but no bound is assumed on the number of pre-images
of arbitrary points: i.e., satises (i), (ii), (iii), and (v) of Denition 4.6 but not
(iv). However, in Corollary 6.12 we shall show that (iv) follows from the others
under the hypothesis of expansivity. However, as Exercise 6.3 shows property (v)
is essential: we cannot obtain it from expansivity and (i) through (iv).
SYMBOLIC DYNAMICS AND MARKOV PARTITIONS 27
Proof. We must prove the following items:
1. Elements of are disjoint.
2. The closure of elements of cover X.
3. is a generator.
4. satises the Markov property.
(1) Elements of are disjoint: i.e., R
i
R
j
= , i ,= j.
The idea of the proof is to use bilaterally transitive points to overcome a diculty:
namely, maps in general do not enjoy the property that the image of an intersection
is equal to the intersection of images, but one-to-one maps do. Suppose R
i
R
j
,=
for i ,= j. Then, by Proposition 4.7 BLT(R
i
R
j
) ,= . By Propositions 4.8
and 4.9,
1
maps BLT(R
i
) and BLT(R
j
) homeomorphically onto BLT(C
i
) and
BLT(C
j
) respectively. Therefore
1
maps BLT(R
i
R
j
) = BLT(R
i
) BLT(R
j
)
homeomorphically onto BLT(C
i
C
j
) = BLT(C
i
) BLT(C
j
), which implies that
,= BLT(C
i
C
j
) C
i
C
j
, a contradiction.
(2) X =
N1
i=0
R
i
.
X = (
G
) =
N1
i=0
C
i
=
N1
i=0
(C
i
) =
N1
i=0
R
i
, the last equality following
from Proposition 4.9.
For the next two items we need a lemma.
Lemma 6.11. Under the hypothesis of 6.10, (
n
m
k
C
sk
) =
n
m
k
R
sk
for
m < n.
Proof. Once again we use the bilaterally transitive points to deal with images of
intersections. We have the following string of equalities.
(
n
k
C
sk
) =
_
BLT(
n
k
C
sk
)
_
= (
n
m
BLT(
k
C
sk
))
which by injectivity of and shift-invariance of bilateral transitive points
=
n
m
(BLT(
k
C
sk
)) =
n
k
BLT((C
sk
))
which by commutativity of and Proposition 4.8
=
n
k
(BLT(C
sk
)) =
n
k
(BLT(C
sk
))
which by Proposition 4.9
=
n
k
(BLT(R
sk
)) = BLT(
n
k
(R
sk
))
=
n
k
(R
sk
).
(3) is a generator.
28 ROY L. ADLER
Because ( is a generator, d
_
n
n
k
C
sk
_
0. By Lemma 6.11 , (
n
n
k
C
sk
)
=
n
n
k
R
sk
. So, by continuity of , we get
d
_
n
k
R
sk
_
= d
_
n
k
R
sk
_
0.
(4) satises the Markov property.
Suppose R
si
1
R
si+1
,= , 1 k n 1. By Lemma 6.11 we have [C
si
1
C
si+1
] = R
si
1
R
si+1
,= , 1 k n 1. Thus C
si
1
C
si+1
,= , 1
k n1. Since ( satises the Markov property,
n
k=1
k
C
sk
,= for all n > 1. So
(
n
k=1
k
C
sk
) =
n
k=1
k
R
sk
,= for all n > 1. Therefore,
n
k=1
k
R
sk
,=
for all n > 1.
Corollary 6.12. If in addition to the hypotheses of Theorem 4.18 the dynamical
system (X, ) is expansive, then is nite.
Proof. We derive 4.6(iv) from the assumption that the domain of is irreducible,
satises 4.6(i), (ii), (iii), and (v), and is expansive. This is an immediate
consequence of Theorems 6.10 and 6.5.
Remark. We remark that a dynamical system (X, ) which is a factor of a subshift
of nite type via an essentially one-to-one nite factor map, i.e. one that satises
properties (i)-(v) of Denition 4.6, need not be expansive (see Exercise 8.4). On
the other hand D. Fried has proven [F].
Theorem 6.13. An expansive dynamical system (X, ) is a factor of a subshift of
nite type (
A
, ) if and only if has a Markov partition.
Here the factor map need not be nite. To square this with what we have
proved, we note that in the only if part of the theorem the subshift and factor
map associated with the Markov partition will not be the ones of the assumption.
We further remark that here expansivity has replaced the one-to-one condition of
Theorem 6.10. The proof of Frieds theorem relies on a technique of Bowen which
will be mentioned in the epilogue.
Exercises.
6.1 A topological partition for a dynamical system (X, ) is Markov if and only
if
n
k=0
k
R
sk
,= ,
0
k=n
k
R
sk
,=
n
k=n
k
R
sk
,= ,
for n 0.
6.2 Under the hypothesis of Theorem 6.5 show that if (X, ) is irreducible, then
so is (
G
, ).
6.3 Show that in Theorem 6.10, condition (v) cannot be replaced by condition
(iv). Hint: Consider the following example. Let
A =
_
_
1 1 0
1 0 1
1 1 0
_
_
,
SYMBOLIC DYNAMICS AND MARKOV PARTITIONS 29
1
1
2
3
0
0
0
1
1
Figure 17. indicated by edge-labels
and : X
A
X
2
be dened, as depicted in Figure 17, by
[1, 1] = [2, 3] = [3, 2] = 0
[1, 2] = [2, 1] = [3, 1] = 1.
Show disjointness is violated in the image partition of the elementary cylin-
der sets.
6.4 Let be a factor map from a topological Markov shift (
A
, ) to an expansive
dynamical system (X, ) with expansive constant c, and let the partition
dened by = R
i
= (C
i
)
o
: i = 0, . . . , N 1 satisfy d() < c/2. Show
that if has a diamond, then there exists a point p X with a continuum
of pre-images.
7. Product structure
The Markov property for a topological partition is an innite set of conditions.
It is the crucial one for obtaining a topological Markov shift representation of a
dynamical system, but it could be dicult to verify. However, there is another
more useful criterion for getting it to which we now turn our attention. It involves
exchanging one innite set of conditions for another of a dierent sort that is more
readily checkable. Once more we look to our concrete systems as a guide. The sets
of partitions in Examples 3.1 and 3.3 have a product structure whose behavior with
respect to the action of a mapping is intimately tied up with the Markov property.
A general notion of partition without regard to any other consideration is the
following.
Denition 7.1. A partition of a set R is dened to be a family H = h(p) : p R
of subsets of R such that for p, q R
1. p h(p),
2. h(p) h(q) ,= h(p) = h(q).
Denition 7.2. We call two partitions
H = h(p) : p R,
1 = v(p) : p R
of R transverse if h(p) v(q) ,= for every p, q R.
30 ROY L. ADLER
R
p
i
R
p
i
j
R
Figure 18. Property M
A set R with two transverse partitions H, 1 can be viewed as having a product
structure something like that of a rectangle, which suggests the following graphic
terminology: we shall refer to the elements H as horizontals and those of 1 as
verticals. When we are dealing with elements R
i
of a topological partition =
R
i
: i = 1, . . . , N, each having a pair of transverse partitions H
i
, 1
i
, we refer to
h
i
(p) as the horizontal through p in R
i
and to v
i
(p) as the vertical .
Next we introduce notions concerned with the behavior of horizontals and ver-
ticals under the map associated with a dynamical system. We shall stick to the
convention that under the action of a map verticals seem to contract and hori-
zontals seem to expand. While we do not insist that the diameters of the images
of these sets actually increase or decrease, this will generally be the case. In fact
there usually is uniform geometric expansion and contraction. In the literature one
encounters the term stable set for what we call a vertical and unstable set for a
horizontal.
Denition 7.3. Suppose a dynamical system (X, ) has a topological partition
= R
i
, each member of which has a pair of transverse partitions. We say alignment
of verticals and horizontals are respectively maintained by and
1
if for all i, j
1. p R
i
1
R
j
R
j
v
i
(p) v
j
(p),
2. p R
i
R
j
R
j
1
h
i
(p) h
j
(
1
p).
We actually require something stronger.
Denition 7.4. In a dynamical system (X, ) we say a topological partition
= R
i
has property M if each set R
i
has a pair of transverse partitions such
that alignments of horizontals and verticals are maintained by and its inverse
respectively in such a manner that the image of any vertical and the pre-image of
any horizontal are contained in a unique element of . In other words, 7.3 (1) and
(2) are replaced by:
1. p R
i
1
R
j
v
i
(p) v
j
(p),
2. p R
i
R
j
1
h
i
(p) h
j
(
1
p).
See Figure 18. Also see Figure 19 for violations of alignment and property M.
We remark that with respect to horizontals 7.6 (2) can be expressed alternatively
as follows:
p R
i
1
R
j
h
i
(p) h
j
(p).
Proposition 7.5. If has property M, then so does
(2)
. (See Figure 20.)
SYMBOLIC DYNAMICS AND MARKOV PARTITIONS 31
j
p R
i
R
J
R
R
i
property M
p
alignment
Figure 19. Violations
R
j
p
R
i
-1
q
Figure 20. Property M on
(2)
Proof. Let R
i
1
R
j
,= be a member of
(2)
. Since a partition of a set induces
a partition of a subset, the horizontals and verticals of R
i
induce corresponding
partitions of R
i
1
R
j
: namely,
1. H(R
i
1
R
j
) = h
ij
(p) = h
i
(p)
1
R
j
: p R
i
1
R
j
,
2. 1(R
i
1
R
j
) = v
ij
(p) = v
i
(p) : p R
i
1
R
j
.
First, to verify that this pair of partitions is transverse, we observe that if p, q
R
i
1
R
j
, then by 7.4 (1)
v
i
(q)
1
v
j
(q)
1
R
j
.
From denition (1) we have
h
ij
(p) v
ij
(q) = h
i
(p)
1
R
j
v
i
(q) = h
i
(p) v
i
(q) ,= .
Second, we show that and its inverse map verticals and horizontals so as to
satisfy property M. Let p R
i
1
R
j
1
(R
j
1
R
k
). On one hand, it is
immediate from the denition that
v
ij
(p) = v
i
(p) v
j
(p) = v
ij
(p).
32 ROY L. ADLER
i
R
j
R
R
q
-1
k
p
Figure 21. 3-fold intersection property
On the other,
h
ij
(q) = (h
i
(q)
1
R
j
) h
j
(q) h
j
(q)
1
R
k
= h
jk
(q).
Corollary 7.6. If has property M, then so does
(n)
for n = 1, 2, . . . .
Proof. Repeated use of Proposition 7.5 using the identity (
(n)
)
(2)
=
(n+1)
.
Proposition 7.7. For a dynamical system (X, ) if a topological partition has
property M, then satises the 3-fold intersection property. (See Figure 21.)
Proof. Let p R
i
R
j
,= and q R
j
1
R
k
,= . Then by transversality
v
j
(q) h
j
(p) ,= . Futhermore, v
j
(q) h
j
(p) R
j
and v
j
(q) h
j
(p)
1
v
k
(q)
h
i
(
1
p)
1
R
k
R
i
: so R
i
R
j
1
R
k
,= . Thus we have
R
i
1
R
j
,= , R
j
1
R
k
,= R
i
1
R
j
2
R
k
.
Corollary 7.8. Given a dynamical system (X, ), if a topological partition has
property M, then
(n)
satises the 3-fold intersection property for n = 1, 2, . . . .
Proof. Follows from Corollary 7.6 and Proposition 7.7.
Theorem 7.9. Given a dynamical system (X, ), if a topological partition has
property M, then is Markov.
Proof. Follows from Corollaries 7.6, 7.8, and the Bootstrap Lemma 6.3. For in-
stance,
(n1)
and
(n)
satisfy the 3-fold intersection property. So
(n1)
satises
the 4-fold one. Working our way back, we get
(n2)
satises the 5-fold one, etc.
Finally, we get that satises the (n + 2)fold intersection property, but this is
true for any n.
SYMBOLIC DYNAMICS AND MARKOV PARTITIONS 33
V
H
(q)
(p)
p
1
p
2
p
q
i
R
i
. Suppose the boundary R
i
of each
element of is the union of two subsets: one,
V
R
i
, which we shall call the vertical
boundary of R
i
, the other,
H
R
i
, the horizontal boundary of R
i
. We denote the
union of all vertical boundaries of elements of by
V
V
R
i
, and the union
of all horizontal ones by
H
H
R
i
.
Denition 7.10. We say that a topological partition has boundaries satisfying
property M if the following hold for each i :
1. R
i
=
V
R
i
H
R
i
,
2. v
i
(p) R
i
H
R
i
,
3. h
i
(p) R
i
V
R
i
,
4.
V
,
5.
1
H
.
We introduce the additional topological structure needed for the next theorem.
Denition 7.11. We call a metric space R an abstract rectangle if it is homeo-
morphic to the Cartesian product of two metric spaces i.e. there exist two metric
spaces H, V and a homeomorphism of the Cartesian product HV onto R. (See
Figure 22.)
Sets with a pair of transverse partitions usually arise in this way. Let (p
1
, p
2
) =
p, where p R and (p
1
, p
2
) H V. Dene the following horizontal and vertical
sets of R :
h(p) (x, p
2
) : x H,
34 ROY L. ADLER
(r)
R
i
R
j
R
k
(q)
-1
R
(p)
i
q
p
r
Figure 23. Impossible boundary picture
v(p) (p
1
, y) : y V .
Naturally the two partitions
H = h(p) : p R,
1 = v(p) : p R
of R are transverse since
v(p) h(q) = (p
1
, q
2
) ,= .
In addition, for each pair of points p, q R the map (p, q) v(p) h(q) is
continuous, onto, and maps h(p) v(q) homeomorphically onto R
i
. Thus we could
have assumed that H, V were subsets of R in the rst place. We designate these
subsets with letters meant to suggest horizontal and vertical lines.
Theorem 7.12. In a dynamical system (X, ), if each element of a topological par-
tition is a connected abstract rectangle, the alignments of which are maintained
by and its inverse respectively, and if has boundaries with property M, then
itself has property M i.e. is a Markov partition.
Proof. We give the proof only for verticals, which consists of proving
p R
i
1
R
j
v
i
(p) R
j
.
See Figure 23. Our proof involves one proof by contradiction established by means
of a second. The main one is a contradiction to the assumption that v
i
(p) R
j
.
The other one contradicts the connectivity of v
i
(p), which is a consequence of the
following.
Since R
i
is homeomorphic to h
i
(p) v
i
(p), the vertical v
i
(p) is connected: for
otherwise R
i
would not be. Therefore, the homeomorphic image v
i
(p) is connected
as well. Thus, if v
i
(p) R
j
, then we would have that v
i
(p) R
j
,= . Hence
there would exist a point q v
i
(p) R
j
which is also in v
i
(p) R
j
: for if not,
then there would be an open set U v
i
(p) R
j
such that U v
i
(p) R
j
= ,
and the open sets R
j
and U (X R
j
) would disconnect v
i
(p).
SYMBOLIC DYNAMICS AND MARKOV PARTITIONS 35
By preservation of alignments, we have v
i
(p) R
j
v
j
(p) so that q v
j
(p).
Thus q
H
, from which follows by property M for boundaries that
1
(q) .
However,
1
(q) v
i
(p) R
i
, which contradicts R
i
R
i
= .
We now turn our attention to 2-dimensional toral automorphisms in generality.
While there exist non-measurable automorphisms, for us toral automorphisms will
mean continuous ones.
8. Markov partitions for automorphisms of the 2-torus
Let X = R
n
/Z
n
be the n-dimensional torus and A a n n matrix with integer
entries and determinant 1. Such a matrix denes an automorphism of the n-torus
in the manner described in Section 2.3. The set of such matrices forms a group
called the general linear group GL(n, Z). Both a matrix A and the automorphism
it denes are called hyperbolic if A has no eigenvalue of modulus one.
We shall devote the rest of this section to the two dimensional case; i.e. n = 2.
Let
A =
_
a b
c d
_
GL(2, Z).
Eigenvalues of A are the solutions of the quadratic equation
x
2
(traceA)x + detA = 0.
Here hyperbolicity means that A has two distinct eigenvalues, say and , which
are irrational numbers. Since = detA = 1, we can assume that [[ > 1 and
[[ < 1. An easy calculation shows that the row vectors
v
= (c, a)
v
= (c, a)
are eigenvectors associated with and with respectively. The action of A on a
vector v is to contract its v
-component by .
Directions may or may not be reversed depending on the signs of the eigenvalues.
We refer to the direction of v
as the
contracting one. Finally, let
the one in the contracting direction. See Figure 24. We call these
lines, which are invariant under the action of A on the plane, the expanding and
contracting eigen-line respectively. The slopes of these lines are m
= ( a)/c
and m
= ( a)/c. From these formulae one sees that these lines pass through
no lattice points other than the origin: for if they did then the slopes m
and m
+
+
Figure 24. A fundamental region and eigen-directions of A
with the appropriate boundary identications. In this region the metric on the
torus coincides with the Euclidean one: namely, d(p, q) = [[pq[[ where [[(x, y)[[ =
_
x
2
+y
2
. Let r = p q ,= (0, 0). Let r
and r
be the v
[ [
n
r
[ [[
n
r[[ [
n
r
[ +[
n
r
[.
One of the components r
, r
,= 0 : otherwise
replace by
1
in the argument. We can also assume that [[ <
1
2
: for, if not,
replace by
k
for large enough k. If [[r[[ >
||
4
, then [[
n
r[[ > c for n = 0. If
[[r[[
||
4
, choose n 1 such that
[[
n+1
4
[r
[
[[
n
4
.
Then the following inequalities show that
n
r is in the fundamental region and that
[[
n
r[[ > c.
[
n
r
[ +[
n
r
[
1
4
+
[[
n+1
4
<
1
2
,
[[
8
[[
4
[[
n+1
4
[
n
r
[ [
n
r
[.
Remark. We now have all the ingredients for a formal proof that the toral auto-
morphism of Example 3.3 enjoys the conclusions of Theorem 6.5 about representing
dynamical systems by topological Markov shifts. In this case the automorphisms
can be given by the matrix
A =
_
1 1
1 0
_
and the topological Markov shift (
G
, ) by the edge-graph G of Figure 2. Theorem
8.1 shows that is expansive, a necessary item in the hypothesis of the main the-
orem, Theorem 6.5. Theorem 7.12 can be applied to show that the partition in
Example 3.3 is Markov, another necessary item. Finally in Section 3.3 we have
already shown that the partition in question is a generator, the remaining require-
ment of the main theorem.
SYMBOLIC DYNAMICS AND MARKOV PARTITIONS 37
Returning our attention to the general case, we shall exploit one of the properties
of a hyperbolic automorphism: namely, a matrix in the group GL(2, Z) specifying
a hyperbolic automorphism is conjugate to another one, all of whose entries bear
the same sign. More specically we have the following theorem of Williams [W],
proved by entirely elementary methods of plane geometry.
Theorem 8.2. Let A GL(2, Z) be hyperbolic. Then there exists C, P GL(2, Z)
such that CAC
1
= P where = 1, the choice of sign being the same as that of
, and the entries of the matrix
P =
_
p q
r s
_
are non-negative.
Proof. Choose a pair of lattice points (, ), (, ) such that
(i) the angle (, )(0, 0)(, ) is acute and
does not;
(ii) the closed parallelogram with vertices at the origin, the two lattice points, and
the lattice point ( + , + ) contains no other lattice point. (See Figure
25.)
This can always be done. One way is to use continued fractions to approximate
slopes of lines, a discussion of which shall be deferred to a remark. An even more
elementary way is the following.
First, choose initial lattice points (, ) and (, ) so close to
that (i) is
satised and no other lattice points lie between them and the origin on a direct
line. Then if the parallelogram in (ii) contains another lattice point in its interior,
connect it to the origin with a line segment. Form a new pair of lattice points by
taking the closest lattice point to the origin on this segment and selecting the one
(+,+ )
l
+
+
+
+
+
+
+
(,)
(,)
(0,0)
+ +
Figure 25. Parallelogram and expanding direction
38 ROY L. ADLER
from the previous pair for which (i) holds. Continue this process until condition
(ii) is satised.
Condition (ii) is equivalent to the area of the parallelogram equalling 1.
Consider the linear map given by the matrix
C =
_
_
.
Since
(0, 1)C = (, ),
(1, 0)C = (, ),
C maps the principal fundamental regionnamely, the closed unit squareonto the
parallelogram. We shall show that C provides the sought-after conjugating trans-
formation.
The two lines
the second.
To prove that P is non-negative, or equivalently, that P maps the rst quadrant
into itself, we must just show that (0, 1)P and (1, 0)P lie in the rst quadrant. We
shall give the proof only for (0, 1)P : the arguments which follow work equally well
for the other lattice point (1, 0)P.
There are two cases depending on whether the contracting eigenvalue of P is
positive or negative. In the second case ( < 0), the linear map P reects the rst
quadrant about the eigen-line
and b
its
image under P. See Figure 26. The point b
on the line
.
We deal rst with the case without reection. Let
respectively. The notation [pq[ stands for the length of the line
segment with end-points p, q. On one hand, since [a
[ = [[
1
[ab[ < [ab[, the
point a
and the
vertical through a
[ > 1. However,
because triangle c
is similar to cab, [a
< 1,
SYMBOLIC DYNAMICS AND MARKOV PARTITIONS 39
l
+
a
b
a
b
Figure 26. Geometrical gure for proof without reection
l
a
b
c
l
c
a
+
+
b
Figure 27. Geometrical gure for proof with reection
2. 0 < m
< 1,
where m
and m
for P.
The rst inequality indicates that the contracting eigen-line for P passes through
the second quadrant between the lattice points (0, 1) and (1, 1), and the second
that the expanding one passes through the rst quadrant under the lattice point
(1, 1). For Theorem 8.4, the main one of this section, one does not need more than
what is provided by Theorem 8.2. These extra properties make life a little less
dicult. The rst one makes a key gure, Figure 29 on page 42, easier to draw.
The second one obviates repeating proofs covering slightly dierent geometrical
gures. Not taking advantage of it multiplies the number of cases in the proof, and
we shall have enough of them as it is. Since we shall not be using the full strength
of this remark, one can skip the remainder of it and proceed directly to Theorem
8.4. We shall be using the second property, which is very easy to achieve by itself.
40 ROY L. ADLER
From the theory of continued fractions, we know that every irrational number
can be written uniquely as an innite continued fraction [a
0
, a
1
, . . . ] = a
0
+1/(a
1
+
1/ . . . ) where a
n
Z for all n and a
n
> 0 for n > 0. In addition, the continued
fraction of a quadratic surd has a periodic tail: namely, the tail can be written as
[b
1
, . . . , b
m
] > 1, where the overbar means innite repetition of b
1
through b
m
. In
[ATW] the following was proved.
Theorem 8.3. Let A GL(2, Z) be hyperbolic. The slope m
, being a quadratic
surd, can be written m
= [a
0
, a
1
, . . . , a
n
, b
1
, . . . , b
m
], where m is as small as possi-
ble. If
C =
_
0 1
1 a
0
__
0 1
1 a
1
_
. . .
_
0 1
1 a
n
_
,
then CAC
1
= P where
P =
__
0 1
1 b
1
_
. . .
_
0 1
1 b
m
__
N
,
for some positive integer N and is the same as in Theorem 8.2. Furthermore,
the slopes of the eigen-lines of P satisfy m
= [b
1
, . . . , b
m
] > 1 and 1/m
=
[b
m
, . . . , b
1
] > 1.
The matrix
_
0 1
1 0
_
P
_
0 1
1 0
_
achieves the above two conditions in the remark.
Theorem 8.4. Let be a toral automorphism whose dening matrix is either P
or P where
P =
_
p q
r s
_
is a hyperbolic matrix in GL(2, Z) with non-negative entries. Then there exists a
Markov generator
, v
the corresponding eigen-lines through the origin. We denote lines parallel to these
through a point p by assigning p as a superscript. For example,
(0,1)
denotes the
line through (0, 1) parallel to
, etc.
SYMBOLIC DYNAMICS AND MARKOV PARTITIONS 41
. . .
p paths
. . .
. . .
. . .
q paths
s paths
r paths
I
II
Figure 28. Edge graph dened by P
We dene the following points as depicted in Figure 29:
o (0, 0)
o
(1, 0)
o
(1, 1)
o
(0, 1)
a
(1,0)
(1,0)
(1,1)
(1,1)
(0,1)
(0,1)
(1,0)
(1,0)
= b + (1, 0)
b
(1,1)
(0,1)
= b + (1, 1)
c
(0,1)
(1,0)
(1,1)
c
(0,1)
(1,1)
(1,0)
(0,1)
= d + (1, 0)
d
(0, 1).
(8.5)
We have drawn Figure 29 as if the rst statement in the remark following The-
orem 8.2 holds. This places the point c
and R
II
the interior of parallelo-
gram c
.
42 ROY L. ADLER
R
II
R
d
c
b
c
a o =
l
(0,0)
l
(0,0)
b
a o =
l
(1,1)
l
(1,1)
a o =
l
(0,-1)
l
I
-
(0,1)
l
(-1,0)
l
(-1,1)
l
(1,0)
d*
+
b
c
= a o
d
+
+
Figure 29. A remarkable fundamental region
The closed set R
I
R
II
, as we shall show, is a fundamental region, which we
shall call the principal Markov one. As drawn in Figure 29, this set is equivalent
modulo Z
2
to the unit square by sliding aa
one unit down. But in general we need a slightly more elaborate proof.
First, no two points in the interior of R
I
R
II
are equivalent because R
I
R
II
is
disjoint from its four neighboring unit translates which totally bound it. Second, the
set of all Z
2
translates of R
I
R
II
covers the plane because all integral horizontal
translates of the union of this set with its unit downward vertical translate covers the
innite strip between the lines y = 0 and y = 1 and all integral vertical translates
of the strip cover the plane.
Thus we can view the torus to be the set X = R
I
R
II
with points on the
boundary identied by lattice translations.
The proof of the theorem involves four cases: two subcases arise for each matrix
P depending on whether is positive or negative. These amount to the four
possible combinations,
(, ) = ([[, [[),
of signs for the eigenvalues of the matrix representing . The simplest case is when
both and are positive. Things are more dicult when either is negative, es-
pecially . So that the proof appears less tedious, we divide each case into ve
steps.
Case I: > 0, > 0.
Step 1. The family = R
I
, R
II
is a Markov partition.
Our aim is to show that satises the hypothesis of Theorem 7.12.
By virtue of their construction as parallelograms the members R
I
, R
II
of are
connected. These open parallelograms are obviously abstract rectangles, each being
homeomorphic to the Cartesian product of two open intervals. The two in question
SYMBOLIC DYNAMICS AND MARKOV PARTITIONS 43
are a pair of intersecting sides of a parallelogram minus endpoints. Horizontals and
verticals of R
i
, i = I, II, are given by
v
i
(p)
p
R
i
,
h
i
(p)
p
R
i
.
Since P denes a linear transformation of the plane, the image of a line parallel
to an eigen-line is another line parallel to the same eigen-line: so alignment of
verticals and horizontals is maintained by and
1
: namely,
p R
i
1
R
j
R
j
v
i
(p) v
j
(p), (1)
p R
i
R
j
R
j
1
h
i
(p) h
j
(
1
p) (2)
for i, j = I, II.
Next we verify that the boundaries of these members satisfy property M of
Denition 7.10 which entails ve items. The rst of these concerns dividing R
I
and R
II
into vertical and horizontal pieces. As shown in Figure 29 we have
R
I
=
V
R
I
H
R
I
,
R
II
=
V
R
II
H
R
II
,
where
V
R
I
ac b
H
R
I
ab
cd
V
R
II
c
H
R
II
c
,
each being the union of two line segments. Utilizing the boundary identications,
we have that consists of two transverse line segments intersecting at the origin:
namely,
H
= cb
V
= ad
.
(8.6)
It is clear from Figure 29 that (2) and (3) of 7.10 are satised: namely,
v
i
(p) R
i
H
R
i
,
h
i
(p) R
i
V
R
i
.
From (8.6) and the property that the restrictions of P and P
1
respectively to the
lines
and
(cb
)P
(ad)P ad.
(8.7)
We can restate (8.7) as items (4) and (5) of 7.10: namely,
V
V
,
H
H
.
Thus we have established that satises the hypothesis of Theorem 7.12: so it
is a Markov partition.
44 ROY L. ADLER
Remark. At this point it may be instructive to remark on our denition of topolog-
ical partition. This example illustrates the advantage of using open sets over their
closures as members of such a partition. For one thing, R
I
,= R
I
o
and R
II
,= R
II
o
.
For another, while R
I
and R
II
are abstract rectangles, their closures are not. For
instance, R
I
is homeomorphic to a Cartesian product: namely, the product of the
line segment ab
H
are connected line segments which get mapped into themselves under and
1
respectively. This would not be the case if = R
I
, R
II
. If ones denition
of topological partition involves closures of open sets as members rather than open
sets, then one is forced into somewhat greater contortions in order to achieve the
same results.
The Markov partition is not necessarily a generator. The trouble is that, while
the members of are connected, those of
(2)
=
1
may not be, in which
case non-empty sets of the form
n=0
n
n
k
R
sk
may consist of more than one
point. To overcome this let us examine the sets R
i
1
R
j
1
. As we
shall see each consists of a union of disjoint open parallelograms, the number of
which is given by the matrix P. A remedy is immediately suggested.
Step 2. The family
= R
k
: 1 k N
-
direction. This parallelogram passes through various Markov fundamental regions,
and in each one when it intersects a parallelogram equivalent to R
j
, the intersection
is a parallelogram. No two of these intersections share an equivalent point: for
otherwise a violation of 2.1.1(3) with respect to the fundamental region (R
I
R
II
)P
would be committed. While the set h
i
(p) R
j
may consist of several horizontals,
the set h
i
(p) R
k
is either empty or a single one. Furthermore, if h
i
(p) R
k
,=
for one horizontal h
i
(p) of R
i
, then h
i
(p
) R
k
,= for any other horizontal h
i
(p
)
in R
i
. Back on the torus the various non-equivalent parallelograms represent disjoint
connected sets which we have labelled R
k
, 1 k N
is maintained by
and
1
for the same reason it is for members of .
Regarding boundaries we have that
(2)
=
H
= (cb
)P,
(2)
=
V
= ad.
(8.8)
Thus, just as for , we have
= (ad)P ad =
= cb
(cb
)P =
H
.
The partition
k
)P passes through various
SYMBOLIC DYNAMICS AND MARKOV PARTITIONS 45
Figure 30. A non-empty parallelogram of the form
n
n
k
R
sk
fundamental regions; and in each one when it intersects a parallelogram equivalent
to R
l
, the intersection is a parallelogram, and no two of these intersections share an
equivalent point. Hence, a non-empty set of the form R
i
R
j
is a single connected
parallelogram.
Step 3. If R
k
is one of the parallelograms in R
i
R
j
, then the length of its v
-
dimension is the same as that of R
j
, while the length of its v
-dimension is [[ times
that of R
i
. Similarly, a non-empty set of the form R
i
R
j
is a single connected
open parallelogram, the length of its v
j
and
the length of its v
i
.
Likewise, a non-empty set of the form
n
n
k
R
sk
is a single connected open
parallelogram (as depicted by the shaded area of Figure 30), the length of its v
-
dimension being [[
n
times that of R
sn
and the length of its v
-dimension [[
n
times that of R
sn
. From this we have
d
_
n
_
= d()/[[
n+1
0, as n .
In other words,
is a generator.
Step 4. Let #(R
i
R
j
) denote the number of disjoint parallelograms in the in-
tersection R
i
R
j
.
#(R
i
R
I
) number of lines x = 0, x = 1, . . . , traversed by (R
i
)P;
#(R
i
R
II
) number of lines y = 1, y = 2, . . . , traversed by (R
i
)P,
(8.9)
which leads to
#(R
I
R
I
) = x-coordinate of (0, 1)P = p;
#(R
II
R
I
) = x-coordinate of (1, 1)P (1, 0)P = q;
#(R
I
R
II
) = y-coordinate of (0, 1)P = r;
#(R
II
R
II
) = y-coordinate of (1, 1)P (1, 0)P = s.
(8.10)
To establish Step 4 it is important to understand how (R
I
)P and (R
II
)P in-
tersect various Markov fundamental regions, particularly how they begin and end.
We must show that Figure 31 truly represents the situation: namely, the segment
(b
b. First, if a parallelogram (R
i
)P intersects a lattice
translate of R
j
, then it stripes all the way across. As it does do, it passes strictly
46 ROY L. ADLER
b
b
a
c
a
c
d
I
R
~
c
b
~
a
(p,q)
I
R + (p,q)
a
~
b
~
~
~
~
a
(p+r,q+s)
=
I
R +(p+r-1,q+s-1)
I
R P
b
Figure 31. How (R
I
)P and (R
II
)P intersect various fundamental regions
through the lattice translate without ever straddling any part of a horizontal bound-
ary: for otherwise there would be a violation of the property that (R
I
R
II
)P is a
fundamental region because
H
(P)
H
.
Since P is a contraction on the line
)P and
b (p, q) + b. The same
is true for (d
)P : for otherwise (R
I
)P would straddle part of its own boundary on
the line
(p1,q)
. Thus
(ac)P (ad)P ab,
and
(b
)P a
b.
So the parallelogram (R
I
)P begins (as shown in the leftmost gure of Figure
31) with its left vertical boundary contained in the segment ab on the line
and
ends (as shown in the middle gure of Figure 31) with its right vertical boundary
contained in the segment a
b on the line
(p,q)
)P
(r,s)
. The parallelogram (R
I
)P ends by striping through R
II
+ (p 1, q 1).
The other parallelogram (R
II
)P begins where (R
I
)P leaves o. Its left vertical
boundary is contained in a
b where
a (p + r, q + s) and
b b + (p, q), as shown in the middle gure of Figure 31. The rst set that
(R
II
)P stripes through is R
I
+ (p, q) and the last R
II
+ (p + r 1, q + s 1). It
stripes through R
I
+(p, q) on top of R
I
(R
I
)P+(p, q) and ends at the bottom of
R
II
+ (p +r 1, q +s 1), as shown in Figure 31.
As drawn in Figure 29, the line segment aa
lies totally in R
I
connecting its
bottom horizontal boundary with its top: so (R
i
)P has the property that it stripes
across R
I
+ (m, n) if and only if it passes through the interior of the line segment
aa
+ (m, n). But one must bear in mind that, unless we have arranged for c
to
SYMBOLIC DYNAMICS AND MARKOV PARTITIONS 47
be located in the principal unit square, the line segment aa
equals the
number of dierent lines x = integer crossed by (R
i
)P, which equals p for (R
I
)P
and r for (R
II
)P.
Similarly, a parallelogram (R
i
)P intersects lattice translate of R
II
if and only
if (R
i
)P crosses the same lattice translate of a
= p +q +r +s parallelograms R
i
in
accordingly:
R
1
R
p
= R
I
R
I
,
R
p+1
R
p+q
= R
II
R
I
,
R
p+q+1
R
p+q+r
= R
I
R
II
,
R
p+q+r+1
R
p+q+r+s
= R
II
R
II
.
As we have said before: if h
i
(p) R
k
,= for one horizontal h
i
(p) of R
i
, then
h
i
(q) R
k
,= for any other horizontal h
i
(q) of R
i
. Therefore, the image of each
parallelogram R
k
contained in an R
i
intersects the same elements of
. From this
we get that R
k
R
l
,= , equivalently R
k
1
R
l
,= , whenever either
k (1) (3) and l (1) (2),
or
k (2) (4) and l (3) (4).
Case II: > 0, < 0.
48 ROY L. ADLER
R
d
c
c
b
l
(1,1)
a
b
II
R
l
(0,1) v +
a
a
l
(1,0)
o
o
l
(0,-1)
l
(0,0)
v
+ (-1,1)
l
v
I
d*
(-1.0) +
l
(0,0) v +
l
+ (1.0) v
b
c
d
-
a
+
o
o
Figure 32(A). New principal Markov fundamental region, a
translation of the previous one with o between a and b
Except for the lattice points of the unit square labelled by os we redene the
other ones in Figure 29 by translating them by v
d)P = a.
For example, a
new
= a
old
+ v
. The dierence
in the two gures is the location of o with respect to b. In a moment we shall argue
that Figure 32(B) cannot occur.
Step 1. The new partition = R
I
, R
II
is Markov.
The proof of this is the same as in Step 1 of Case I except the verication of
V
V
,
H
H
is more involved and is as follows.
For this new partition, (8.6) remains valid: namely,
H
= cb
V
= ad
.
Before there was a common xed point on
H
and
V
: namely, origin =
H
V
, but now the intersection
H
V
is no longer a xed point. The
line segment ad still contains the origin as a xed point. The line segment cb also
SYMBOLIC DYNAMICS AND MARKOV PARTITIONS 49
R
c
c
l
(1,1)
a
b
II
R
l
(0,1) v +
a
a
l
(1,0)
l
+ (1.0) v
d
o
o
o
b
l
(0,0) v +
I
l
o
(0,-1)
l
(0,0)
v
+ (-1,1)
l
(-1.0) +
b
c
d
-
a
+
Figure 32(B). Hypothetical new principal Markov fundamental
region, with o above b
contains a xed point, but this is not so obvious. This will be a consequence of the
following version of (8.7):
(cb
)P lattice translate of cb
(ad)P ad.
(8.11)
Verication of the rst relation is more dicult than before. That the second
relation holds is immediate because the translation was chosen to make it so. The
rst relation rests on the property of the partition that a, b, c, d are the only points
on the contracting boundary which have equivalent ones on the expanding. There
are no others!
Recall d
= d
)P is a lattice
translate of a and the line
a
, and d
)P contains c
)P lies
on the line
a
)P lies inside,
the segment (cb
)P overlaps c
b.
Now we are in a position to prove that Figure 32(B) cannot occur. Let
a =
(p, q) + v
. Since (
P. Let
l
l
(0,0)
^
c
a
^
^
d
b
^ a
^
( )P d* =
b
v +
Figure 33. Image of expanding boundary of new principal
Markov fundamental region
to b
on the line
(p,q)
. The segment
c
b.
Since d
P = a lies below
b, the parallelogram R
I
P would straddle part of its own
horizontal boundary: namely, the segment
c
= R
k
: 1 k N
is a generator.
Proof, same as Case I, Step 3.
Step 4. Let #(R
i
R
j
) denote the number of disjoint parallelograms in the in-
tersection R
i
R
j
.
#(R
i
R
I
) number of lines x = 0, x = 1, . . . , traversed by (R
i
)P;
#(R
i
R
II
) number of lines y = 1, y = 2, . . . , traversed by (R
i
)P,
which leads to
#(R
I
R
I
) = x-coordinate of (0, 1)P = p;
#(R
II
R
I
) = x-coordinate of (1, 1)P (1, 0)P = q;
#(R
I
R
II
) = y-coordinate of (0, 1)P = r;
#(R
II
R
II
) = y-coordinate of (1, 1)P (1, 0)P = s.
Argument similar to Case I, Step 4, but based on Figure 34. There is a slight
dierence here. The vertical line segment aa
= (, ), then
#(R
i
R
I
) number of lines x = 0, x = 1, . . . , traversed by (R
i
)P;
#(R
i
R
II
) number of lines y = 1, y = 2, . . . , traversed by (R
i
)P.
To get the desired result we observe that the parallelogram (R
i
)P traverses the line
x = m if and only if it traverses the line x = m. Similarly the parallelogram
traverses y = m if and only if it traverses y = m.
SYMBOLIC DYNAMICS AND MARKOV PARTITIONS 51
b
~
~
c
v
+
~
~
a
b
~
~
v
+
(p+r,q+s)
I
R P
b
b
a
a
c
d
~
a
(p,q)
=
v
++
=
=
c
a
b
(0,0)
Figure 34. How the new (R
I
)P and (R
II
)P intersect various fun-
damental regions
Step 5. The transition matrix associated with mapping of the Markov generator
(cb
)P
(ad)P ad.
The argument for the rst relation is similar to that of Case II, Step 1. The
argument for the second is the same as (8.7) of Case I.
Step 2. The family
= R
k
: 1 k N
is a generator.
Proof, same as Case I, Step 3.
Step 4. Let #(R
i
R
j
) denote the number of disjoint parallelograms in the in-
tersection R
i
R
j
.
#(R
i
R
I
) number of lines x = 0, x = 1, . . . , traversed by (R
i
)P;
#(R
i
R
II
) number of lines y = 1, y = 2, . . . , traversed by (R
i
)P,
which leads to
#(R
I
R
I
) = x-coordinate of (0, 1)P = p;
#(R
II
R
I
) = x-coordinate of (1, 1)P (1, 0)P = q;
#(R
I
R
II
) = y-coordinate of (0, 1)P = r;
#(R
II
R
II
) = y-coordinate of (1, 1)P (1, 0)P = s.
52 ROY L. ADLER
R P
b
~
~
~
~
b
~
~
a
~
c
I
R
P
b
=
II
(0,0)
(-p,-q)
a
a = (-p-r,-q-s)
=
Figure 35. How (R
I
)P and (R
II
)P intersect various fundamental
regions for Case III
Argument the same as Case I, Step 4, but based on Figure 35.
Step 5. The transition matrix associated with mapping of the Markov generator
= R
k
: 1 k N
is a generator.
Proof same as Case I, Step 3.
Step 4. Let #(R
i
R
j
) denote the number of disjoint parallelograms in the in-
tersection R
i
R
j
.
#(R
i
R
I
) number of lines x = 0, x = 1, . . . , traversed by (R
i
)P;
#(R
i
R
II
) number of lines y = 1, y = 2, . . . , traversed by (R
i
)P,
which leads to
#(R
I
R
I
) = x-coordinate of (0, 1)P = p;
#(R
II
R
I
) = x-coordinate of (1, 1)P (1, 0)P = q;
#(R
I
R
II
) = y-coordinate of (0, 1)P = r;
#(R
II
R
II
) = y-coordinate of (1, 1)P (1, 0)P = s.
Argument the same as Case II, Step 4, but based on Figure 36.
Step 5. The transition matrix associated with mapping of the Markov generator