Professional Documents
Culture Documents
Codes On Algebraic Curves 9781461371670 9781461547853 1461371678 - Compress
Codes On Algebraic Curves 9781461371670 9781461547853 1461371678 - Compress
Serguei A. Stepanov
Bilkent University
Ankara, Turkey
and Steklov Mathematical Institute
Moscow, Russia
ISBN 978-1-4613-7167-0
© 1999 Springer Science+Business Media New York
Originally published by Kluwer Academic I Plenum Publishers in 1999
Softcover reprint of the hardcover 1si edilion 1999
1098765432 1
A C.I.P. record for this book is available from the Library ofCongress.
AII rights reserved
No part of this book may be reproduced, stored in a retrieval system, or transmitted in any form
or by any means, electronic, mechanical, photocopying, microfilming, recording, or otherwise,
without written permission from the Publisher
Preface
v
vi Preface
I. Error-Correcting Codes
Chapter 1
Codes and Their Parameters 3
1.1. Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3
1.2. Finite Fields . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5
1.3. Linear Codes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 13
1.4. Spectrum and Duality . . . . . . . . . . . . . . . . . . . . . . . . . 15
Exercises 20
Chapter 2
Bounds on Codes 25
2.1. Upper Bounds . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 26
2.2. The Linear Programming Bound . . . . . . . . . . . . . . . . . . 32
2.3. Lower Bounds . . . . . . . . . . . . . . . . . . . . . . . . ' . . . . . 35
Exercises 37
Chapter 3
Examples and Constructions 41
3.1. Codes of Genus Zero . . . . . . . . . . . . . . . . . . . . . . . . . 41
3.2. Some Families of Codes . . . . . . . . . . . . . . . . . . . . . . . 46
3.3. Constructing Codes from other Codes . . . . . . . . . . . . . . . 60
Exercises . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 64
ix
x Contents
Chapter 4
Algebraic Curves 71
4.1. Algebraic Varieties . . . . . . . . . . . . . . . . . . . . . . . . . . 71
4.2. Non-Singular Curves . . . . . . . . . . . . . . . . . . . . . . . . . 77
4.3. Divisors on Algebraic Curves . . . . . . . . . . . . . . . . . . . . 80
4.4. The Riemann-Roch Theorem . . . . . . . . . . . . . . . . . . . . 85
4.5. Hurwitz and Plucker Genus Formulas . . . . . . . . . . . . . . .. 93
4.6. Special Divisors . . . . . . . . . . . . . . . . . . . . . . . . . . .. 96
Exercises . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 98
Chapter 5
Curves over a Finite Field 103
5.1. Rational Points and Divisors . . . . . . . . . . . . . . . . . . . .. 105
5.2. The Zeta-Function of a Curve . . . . . . . . . . . . . . . . . . . . 111
5.3. L-Functions of Artin . . . . . . . . . . . . . . . . . . . . . . . . . 120
5.4. Algebraic Function Fields . . . . . . . . . . . . . . . . . . . . . . 130
Exercises . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 139
Chapter 6
Counting Points on Curves over Finite Fields 143
6.1. The Number of Rational Points on a Curve . . . . . . . . . . . .. 143
6.2. Character Sums . . . . . . . . . . . . . . . . . . . . . . . . . . . . 147
6.3. Asymptotics .. . . . . . . . . . . . . . . . . . . . . . . . . . . .. 157
Exercises . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 170
Chapter 7
Elliptic Curves 175
7.1. The Group Law . . . . . . . . . . . . . . . . . . . . . . . . . . . . 175
7.2. Thej-Invariant . . . . . . . . . . . . . . . . . . . . . . . . . . . . 178
7.3. Isogenies . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .. 180
7.4. Elliptic Curves over Finite Fields . . . . . . . . . . . . . . . . . . 184
7.5. Elliptic Functions ... . . . . . . . . . . . . . . . . . . . . . . .. 186
Exercises . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 190
Chapter 8
Classical Modular Curves 193
8.1. Congruence Subgroups . . . . . . . . . . . . . . . . . . . . . . .. 193
8.2. The Curves X(N), Xo(N), and Xl (N) . . . . . . . . . . . . . . .. 195
Contents xi
Chapter 9
Reductions of Modular Curves 219
9.1. Reductions and Moduli Spaces . . . . . . . . . 219
9.2. The Igusa Theorem . . . . . . . . . . . . . . . . 224
9.3. The Eichler-Shimura Congruence Relation . . . . . . 231
9.4. The Eichler-Selberg Trace Formula ...... . 236
Exercises . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 239
Chapter 10
Constructions and Properties 243
10.1. L -Construction 243
10.2. O-Construction ... 245
10.3. Parameters . . . . . . . . 248
10.4. Duality and Spectra 251
Exercises 254
Chapter 11
Examples 257
11.1. Codes of Small Genera 257
11.2. Elliptic and Hermitian Codes 261
11.3. Codes on Fiber Products . . . . . . . 267
11.4. Codes on Classical Modular Curves 274
11.5. Codes on Artin-Schreier Coverings . 276
11.6. Codes on Trace-Norm Curves 284
Exercises 287
Chapter 12
Decoding Geometric Goppa Codes 289
12.1. The Decoding Problem . . . . . . . . . . . . . . . . . . . 289
12.2. The Basic and Modified Algorithms . . . . . . . . . . . 292
12.3. An Improvement of the Modified Algorithm . 301
12.4. Majority Voting for Unknown Syndromes 306
12.5. Faster Decoding . . . . . . . . . . . . 309
Exercises . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 310
xii Contents
Chapter 13
Bounds 315
13.1. Asymptotic Bounds . . . . . . . . . . . . . . . . . . . . . . . . .. 315
13.2. Constructive Bounds . . . . . . . . . . . . . . . . . . . . . . . . . 316
13.3. Other Bounds . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 319
Exercises . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 321
Bibliography 323
Index 343
Codes on Algebraic Curves
Part I
Error-Correcting Codes
In this chapter the basic notions of the theory of error-correcting codes are in-
troduced: the Hamming distance, parameters of codes, linear codes, encoding
and decoding procedures, spectrum and duality, the Mac Williams identity and
Krawtchouk polynomials.
1.1. INTRODUCTION
Each non-empty subset C <;;:; F n is called a q-ary block code of length n. The
non-negative integer M = Iq is called the cardinality ofthe code C and the non-
negative real number k = logq IC I is called the log-cardinality of C. The minimum
distance of the code C is defined as
3
4 Chapter 1
Define the information rate R and relative minimum distance {) of an tn, k, d)q-
code C as R = k / nand {) = d / n. It is clear that 0 :::; R :::; 1, 0 :::; {) :::; 1.
Let us briefly explain why the codes are called error-correcting. The following
question is essentially one of the central problems of information theory. We con-
sider information presented as a very long sequence of symbols from an alphabet
F. In the sequence each symbol occurs with equal probability. This information
is sent to a receiver over a so-called noisy channel. In the model that we consider
there is a small fixed probability p that a symbol, which is sent over the channel, is
changed to one of the other symbols. Such an event is called a symbol-error and p
is the symbol-error probability. As a result a fractionp of the transmitted symbols
arrives incorrectly at the receiver at the end of the communication channel. The
aim of the coding theory is to lower the probability of error (considerably) at the
expense of spending some of the transmission time or energy on redundant sym-
bols. The basic idea of the theory can be explained in the sentence that follows.
When we read printed text we recognize a printing error in a word because in
our vocabulary there is only one word that resembles (is sufficiently close to) the
printed word.
In block coding the message is split into parts of k symbols. The encoding
is an injective map from Fk to F n (where n ?: k). In other words we take some
[n,k,d)q-code C with integer k and fix an embedding
y : Fk ~ C c;;, Fn.
The transmission is now R = k / n time slower, which justifies the term "information
rate" for R. Instead of the part Z E Fk of the message, we transmit the corresponding
word x = y(z) E C of the length n. On the end of the channel we obtain a distorted
wordx' E F n and we transform it into the nearest word x" E C (i.e., we decode the
message on the maximum likelihood basis). This transformation can be defined
by some decoding map C: F n -+ C. If the number of distorted symbols is at
most l d 21J, then x" = x, i.e., the decoding is correct. The maximum likelihood
decoding is an ideal that is almost unattainable. Usually we just give a map
C: U -+ C, where U is the union of all balls
Bt(x) = {y E F n Id(x,y) :::; t}
X=XIWI +, .. +xvwv
Proposition 1.1. Let F be a finite field. Then F consists ofpV elements, where p
is the characteristic ofF.
6 Chapter 1
Theorem 1.3. For any positive integer v there exists at least one monic irreducible
polynomialf E Fp[u] ofdegree v.
Codes and Their Parameters 7
~(s) = TI(1-N(J)-s)-I,
f
where s = a + it is a complex variable with a = Res> 1 and the product is taken
over all monic irreducible polynomials f E Fp [u]. In view of the uniqueness of
factorization into irreducibles, we have
TI(1 + L
00
where the sum on the right-hand side is over all monic polynomials g in Fp [u] of
positive degree, and then
'"
~(s) = 1 + L ( L N(g)-S).
n=1 degg=n
Since there are exactly pn monic polynomials g E Fp[u] of degree n, the last
relation gives
~(s)
'"
= 1 + LP(1-s)n = (1- pl-S)-I.
n=1
Let I ( v) denote the number of monic irreducible polynomials f E Fp [u] of degree
v. From the definition of the function ~(s) we have
~(s)=
'"
TI(I_p-VS)-I(v)
v=1
and hence
'"
TI(1_p-vS)-I(v) = (l_pl-S)-I.
v=1
Taking logarithms, we obtain the equality
'"
L I(v) log(l - p-VS) = log(l _ pl-S),
v=1
Comparing the coefficients ofp-ns on both sides of this equality, we find that
LvI(v) =pn,
vln
whence, from the Mobius inversion formula, we obtain
I(v) = 2. LJL(m)pv/m,
v mlv
I ifm = 1
,,(m) = { (-It if m = PI ... Pr is the product of distinct primes
o if m is divisible by the square of a prime
The sum
LJL(m)pv/m
mlv
is positive for any p and v ::::: 1, hence I ( v) ::::: 1 for any positive integer v. •
Corollary 1.4. For any prime p and any positive integer v there exists a finite field
Fq with q = pV elements.
Algebraic Structure
Now we study the algebraic structure of finite fields.
Theorem 1.5. If the finite field Fq has q = pV elements then every x E Fq satisfies
the equation x q - x = o.
Corollary 1.7. Any two finite fields having the same number of elements are
isomorphic.
The order of a non-zero element x E Fq is the least positive integer n such that
xn = 1.We note that if x is an element of order n, then the equality xl = xm is
equivalent to I == m mod (n ). In particular, we see that the order of every non-zero
element of the field Fq is a divisor of q - 1. Let us show that every finite field
Fq contains at least one element TJ of order q - 1. Such an element is called a
primitive element of Fq , and the existence of a primitive element implies that the
multiplicative group F; = Fq \ {O} ofthe field Fq is a cyclic group of order q - 1.
Theorem 1.8 (the Gauss theorem). The finite field Fq with q = pV elements
contains cp( q - 1) primitive elements, where cp is the Euler phi-function.
Proof: Let n be a divisor of q -1, and tfJ(n) the number of elements inFq of order
n. Let us assume that tfJ(n) > 0, i.e., that there exists at least one element x E Fq
of order n. The powers 1,x, ... ,xn- 1 of the element x are different and satisfy
the polynomial equation un - 1 = O. Since the number of roots of a non-zero
polynomial does not exceed its degree, these powers exhaust all the roots of the
polynomial un - 1. Hence, every element of order n has the form xm for some
m=O,l, ... ,n-1.
If (m, n) = d > 1, then the element xm is of the order n / d, which is strictly
smaller than n. Now if (m,n) = 1 and if xlm = 1 for a certain positive integer
1< n, then we have 1m == 0 mod (n), which is impossible. Thus, the elementx m
has order n if and only if (m,n) = 1, and therefore tfJ(n) = cp(n).
Now we use the obvious equality
L tfJ(n) = q - 1
nlq-l
L cp(n)=q-1
nlq-l
L (cp(n)-tfJ(n)) =0
nlq-l
Automorphisms
Letf be a monic irreducible polynomial of degree v ~ 1 in Fp[u]. Consider the
finite field Fq = Fp[u]/(f(u)). Denote by 0 the residue class in Fp[u] containing
the polynomial u, and observe thatf( 0) = O. Raising both sides of this equality to
the power p, and using Proposition 1.2 and Theorem 1.5, we find thatf( oP) = O.
Repeating this process several times, we see that the elements oP, ... , oP v - 1 are also
the roots of the polynomialf(u). Let us show that 0,oP, ... ,oP V - 1 are distinct.
Suppose that 1} is a primitive element of the field Fq and 1} = Xo + XI 0 + ... +
Xv_Io v- 1 with~i E~p"'. Ifw~ ass~e.that.oPm ~ oP~ forO::; m < n::; v-I, then
we get the equahty ~ = ~ ,WhICh lmphes ~ -p = 1. We have I ::; pn _pm <
q - 1, and arrive at a contradiction with the definition of the element 1}. Thus, the
irreducible polynomial f E Fp [u] has in the field Fq, with q = P v , the factorization
v-I
f(u) = II (u - oP\
i=1
by the rule
a(x) =xo+xllJP+···+Xv_IO(v-l)p =xP
and leaving the field Fp fixed. The automorphism a is known as the Frobenius
automorphism of the field Fq.
Theorem 1.9. The Galois groupofafinitefieldFq with q = pV elements is a cyclic
group oforder v.
ai(x) =x/.
Since a i ( 0) =f: a k ( 0) for 0 ::; i < k ::; v-I, these automorphisms are distinct and
exhaust all possible automorphisms of Fq (which cannot be greater in number than
v). ..
Codes and Their Parameters 11
Let Gp, be the Galois group of the finite field FpJJ. and Gil be the Galois group
of the field Fpv. The field FpJJ. is a subfield of Fpv if and only if Gp, is a subgroup
of Gil' Taking into account the cyclicity of the groups Gp, and Gil we get the
following result:
Corollary 1.10. The field FpJJ. is a subfield ofthe field Fpv if and only if IL divides
v.
If u is the Frobenius automorphism of the field Fq with q = pll elements, we
define the norm of an element x E Fq as
11-1 11-1
normll(x) = Il ui(x) = Il Xpi.
i=O i=O
In a similar way we define the trace of x as
11-1 11-1
trll(X) = Lui(x)= Lx/.
i=O i=O
The norm and trace are homomorphisms of multiplicative and additive groups of
the field Fq to multiplicative and additive groups of the field Fp, respectively.
The following result is a special case of the Hilbert theorem 90:
Theorem 1.11. Let Fq be afinitefieldwith q = pll elements. Then
(i) the norm ofx E Fq is equal to 1 if and only if there exists a non-zero element
y E Fq such that x = y /yP;
(ii) the trace ofx E Fq is equal to 0 if and only if there exists an element z E Fq
such that x = z -zp.
Proof:
(i) Let e be a generator of Fq over Fp, so that Fq = Fp( e). For every i =
0, I, ... , v-I and every non-zero x E Fq consider the Lagrange-Hilbert
resolvent
R(x, e i ) = e i +xe ip + ... +xl+p+·+pV - 2 eipV - l •
is not zero, at least one of the elements R(x, ei ), 0 ::; i ::; v-I, differs from
zero. Suppose it is the element
(ii) Since
det ( eipk ) i= 0,
at least one of elements tr v(1 ), tr v( e), ... ,tr v( ev- I) differs from zero. Let
trv(f3) i= 0 for f3 = (Ji, and set
(ii) Let g(u) be a monic polynomial in Fqn[u], and assume that g(f3q) = Ofor
every root f3 ofg(u). Then g(u) E Fq[u].
Proof:
(i) Iff(u) = L.aiui, it follows from Proposition 1.2 and Theorem 1.5 that
Algebraic Closure
Let us consider the sequence of finite fields
and set
co
Fp= UFpn!.
n=!
The set Fp is a field. Indeed, for any x,y E Fp there exists an integer n such that
x ,y E Fpn! , hence we can determine the sum x +y and the product xy of elements
x and y. Next, every polynomial g E Fp [u 1has coefficients in some field Fpm, and
iff is its irreducible factor in Fprn [u l, say of degree v, then all roots off lie in the
field Fpmv which is a subfield of Fpn! for a sufficiently large n. Therefore, the roots
of the polynomialf lie in Fp, and hence Fp is an algebraically closed field. The
field F p is called an algebraic closure of the prime finite field Fp.
Now we sum up our discussion of finite fields as follows:
Theorem 1.13. For each prime p and each integer v ~ 1 there exists a finite field
Fq with q = pV elements, uniquely determined as a subfield ofan algebraic closure
Fp. Thefield Fq is the splittingfield of the polynomial
and its elements are the roots of uq - u. Every finite field is isomorphic to exactly
onefield F q. The group of autom orph isms ofthefield Fq with q = pV elements is
cyclic of order v, and the multiplicative group F; of the field Fq is a cyclic group
of order q - 1.
Both for the construction of good codes and for the design of algorithms realizing
coding and decoding procedures, the notion of a code over an arbitrary alphabet
is very poor in algebraic and arithmetical structure. It is possible to enrich this
structure by introducing the notion of a linear code.
Now let F = Fq be a finite field of characteristic p with q = pV elements. In
this case F; forms a linear metric space over Fq called a Hamming space. A q-ary
linear code C of length n is a linear subspace of the linear metric space F;. The
integer
Ilxll = #{ill :::; i:::; n,xi7i O}
14 Chapter 1
is called the weight of the element x = (Xl, ... ,Xn ) E F;. Forlinear [n,k,d]q-code
C we have k = dimC and
y: F; '-+F;.
The matrix G of this map is called a generator matrix of the code C. The map )'
is included into a short exact sequence
(i.e., )' is an embedding, T/ is a sUIjection, and KerT/ = 1m )'). The matrix G has
as its rows k basis vectors of C, hence
acts on the set of such bases, and two matrices G and G' define the same code C
if and only ifG' =A· G for some A E GLk(Fq).
At last we mention a decoding method that is sometimes used in practice. For
high rate codes it is fairly effective. The method is known as syndrome decoding.
Codes and Their Parameters 15
For any x E F; the syndrome is defined as H . x'T. For a code-word the syndrome
is O. A received vector x' with errors in it can be written as x' = x + e, where x
is the transmitted word and e is known as the error-vector. If we pick a certain
error-vector e and add it to all the code-words, the result is a coset of C in F;
and all words in this coset have the same syndrome, namely H . e'T. This means
that any vector in a coset is a candidate for the error-vector of a word in the same
coset. By maximum likelihood decoding we should choo~ this vector so that it
has minimum weight. Decoding now goes as follows. For each coset of C we
pick a member of minimal weight (often this member is unique). This is called the
coset leader. We make a list ofthese coset leaders and their syndromes. When x' is
received, the syndrome is calculated, the leader is found by the table examination
and x' is decoded by subtracting the leader from x'.
From now on we shall focus primarily on linear codes.
'L un-lIxllvllxll.
n
Wc(u: v) = 'LAiUn-ivi =
i=O xEC
i=O
Since in many cases we do not know the precise value of d but have only some
lower bound for it, the following form is rather convenient. Let s be some integer
16 Chapter 1
and that the group of additive characters of Fq is isomorphic to Fq. The character
t/Io (x) == 1 is called trivial one.
For x,y E F; and for a non-trivial additive character 1/1 of Fq define an additive
character
j(x) = L I/Ix(y)f(y).
YEF~
Codes and Their Parameters 17
I ~
L iCY) = -lei Li(x)
yEC.l xEC
Proof: We have
The left hand side of the identity of Lemma 1.14 equals WC.l (u : v). Let us
calculate j(x). If x = (XI, ... ,xn), Y = CYI, ... ,Yn) then taking into account the
equality y;-I = I for Yi i= 0 and relation (1.1) we get
= fI
;=\
(u - vyi- 1 (U + (q - 1)v) \-xi- 1 = (u + (q - 1)v)n-11x l (u - v)lIxll.
It is clear that in this case the dual code C.L is also a code of genus at most g.
It is not difficult to prove (see Exercise 1.1) that the parameters of each linear
[n, k, d] q -code satisfy the relation
k+d~n+1
in the form
s
Wc(u) = un + LBi(U - I);,
;=0
where
Ai
;=n-l
±
= . .(-It+i+j ( n ~ I.)Bj .
The following theorem gives us some information on the value of the coefficients
Bi [90]:
Codes and Their Parameters 19
=q-k ((U+(q-l)V)n+itBi(qV)i(U-vt-i).
we obtain
whence
for 0 ~ i ~ s - 2g + 1,
for s - 2g + 2 ~ i ~ s
20 Chapter 1
Theorem 1.18. For every formally self-dual code C there exists a homogeneous
polynomial P( u : v) such that
Theorem 1.20. Suppose that all the weights of a formally self-dual q-ary code C
are divisible by an integer t > l. Then either C is a trivial [n, n/2,2]q-code and
or (q,t) = (2,2),(2,4),(3,3),(4,2).
EXERCISES
1.1. Use induction on n and v to prove that in a field of prime characteristic p one holds:
(
n
La;
)P = La;".
V
n
i=1 i=1
Codes and Their Parameters 21
l.2. Let Fq be a finite field with q elements andf E Fq[u] an irreducible polynomial of
degreem 2': l. Provethatf(u) divides
uq" - u = TITIfm(u),
minIm
where the inner product is taken over all irreducible monic polynomials of degree m.
If I(m) is the number of irreducible monic polynomials in Fq[u] of degree m 2': I,
deduce that
I(m) = ~ L p,(d)qm/d.
mdlm
104. Let p be a prime number andf E Fp[u] an irreducible polynomial of degree m 2': l.
In the ring Fq[u], where q = pV, prove thatf(u) splits into d = (m, v) irreducible
factors each of which has degree mid.
l.5. LetH be a parity-check matrix ofa linear [n,k,d]q-code C. Show that any (d -I)
columns of H are linearly independent (as vectors in F;-k) and there exists d
linearly dependent columns. Deduce from this fact the validity of the inequality
d:Sn-k+l.
l.6. Let V, W be linear spaces over a finite field Fq. An [n,k,d]q-system is an or-
dered finite family P = {PI, ... ,Pn } of points Pi E V such that P does not lie in a
hyperplane. The parameters of the system P are defined as
(c) If C is a linear [n, k, 2: d]q-code and the minimum distance of the dual code
C-L is at least d-L, then
n-d
Wc(u)=u n + LBi(u-l/,
i=O
Then let
Bi = L (q/(1i. l -I).
1i.c1',I1i.I=i
and make use of this interpretation.)
1. 7. Let IP' = IP'( V) be a projective space (i.e., the space oflines in a linear space V) over
Fq. Aprojective [n,k,d]q-system is a finite unordered family P of points ofIP' which
does not lie in a projective hyperplane. The parameters n, k, d are defined as
Just as for [n,k,d]q-systems, P <;;; IP' and pI <;;; JPf are called equivalent if and only
if there is a projective isomorphism IP' ':::' JPf mapping P onto P'. A linear code
F;
C <;;; is called degenerate if and only if C <;;; C F;-I F;,
where F;-I
is the
subspace of vectors having 0 in some fixed coordinate. Prove that for k > 1, d 2: 1
there is a one-to-one correspondence between the set of equivalence classes of non-
degenerate linear [n,k,d]q-codes and the set of equivalence classes of projective
[n, k, d] q -systems.
1.8. Show that if parameters of an [n,k,d]q-code C satisfy the inequality k+d 2: n+ I
then k+d = n + I and (n - k) +d-L = n + 1, i.e., Cis MDS-code (or the code of
genus 0).
1.9. Let C be a binary linear [n,k,d]-code with dual [n,n - k,d-L]-code C-L and let
. z
Pi(z)=;;;'o(-IY.
i C) ( ) n-z .
i-j ,/=0,1,2, ... ,
Codes and Their Parameters 23
±Afun-ivi = ;k ±Ai(U+V)n-i(u-v)i
i=O i=O
show that
(a) Af = fr r.j=OAjPi(j), 0 ~ i ~ n;
Bounds on Codes
We have already explained that a good code should have large din and kin in the
unit interval [0, IJ for a given n. From Shannon's theorem we know also that we
should study long codes. However, if the channel has symbol-error probability
p, then we should expect an average of pn errors per received word of length n.
To correct these we need to have a minimum distance more than 2pn. So, if we
increase n, then d should increase proportionally.
For the set of all [n,k,dJq-codes over the field Fq we define the set Vq by
Vq =
I
{( 8,R) E [0, If there exists an [n,k,d]q-code with din = 8 and kin = R}
and denote by Uq the set oflimit points of Vq. It is clear that (8,R) E Uq if and
only ifthere exists an infinite sequence of distinct [ni, k i , d;J-codes Ci with different
8i = ddni andR i = kdn; such that
° °
If 8 > and R > such a sequence of codes Ci is called asymptotically good (or
simply good).
A description of the set Uq is provided by the following theorem of Manin [119]
(see also [1] and [208, p. 68] and Exercises 2.8-2.10): there exists a continuous
function CXq ( 8) such that
25
26 Chapter 2
and tells us something about the information rate of long [n,k,d]q-codes with
din = 8.
Ifwe restrict ourselves to consideration of only linear [n,k,d]q-codes then we
can define in the same way the sets VJin and u!in (taking into account only points
(8,R) associated to linear q-ary codes). In this case there exists a continuous
function cy~n(lj) such that
The study of functions CYq ( 8) and cy~n (8) is one of the central problems of
the coding theory. At the present time we know only a few of the simplest facts
concerning the structure of these functions. So, we are unable to solve even the
following problems:
Problem l Are the functions CYq(8) and cy~n (8) differentiable in the interval
(0, (q -l)/q)?
Problem Il Are these functions convex?
Problem III Is it true that cy~n (8) = aq (8), or not?
Therefore we are constrained to search some upper and lower bounds for the
functions CYq ( 8) and cy~n(8) as close to each other as possible.
For the set of all [n,k,d]q-codes overFq we define the functionAq(n,d) by
k:::;n-d+l.
Bounds on Codes 27
a q (8) ::; 1- 8.
n>2: r--:-d 1
k-J
- ;=0 ql
and hence d' ~ I~l Iterating this operation k times we get an [n(k) ,O,d(k)]q-
system with
n(k) = n -d -d' - ... ::; n - 2: rid 1.
k-J
i=O q
The condition n(k) ~ 0 proves the theorem.
•
Corollary 2.4 (the asymptotic Griesmer bound).
Set mij = I{x = (XI,'" ,xn ) E ClXi =j}1 and note that
L mij =M
jEFq
for every i = 1,2, ... ,no Let SjI be the Kronecker symbol. Then we find (using
Cauchy--Schwartz inequality)
n
M(M -1)d ~ L d(x,y) = L L (1- SxiyJ
XJ'EC i=lxJ'EC
=n q - 1M2.
q
This completes the proof.
•
Corollary 2.6 (the asymptotic Plotkin bound). We have
and
q-l
CXq(S)~Rp(8)=1-~18 for 0<8<--.
- q
q-
k d
M=q <
- d -n'L2.
I
q
This estimate does not look very useful because we do not expect d to be so large.
However we already have a result for cxq ( 8) from this inequality, namely
q-l
--<8<1.
q - -
Bounds on Codes 29
To make use of the inequality for smaller values of 8 we define the length n' :;=
l (dq~'lq Jand note that n' < n. We consider the last n - n' symbols of all the code-
words. There is a subset of M' code-words ending in the same n - n' symbols,
where
M' ;::: qn I -n M = qn I -n+ k .
For this subset the inequality derived above also holds, i.e.,
d
< M' < < d.
I
q n -n M
- - d-n,0-
q
Hq(O) =0,
q-l
Hq(z) =zlogq(q -I) -zlogqz - (l-z) logil-z), 0<z:::;--.(2.1)
q
The following lemma can easily be proved using Stirling's formula (see [115,
§5.1)):
( d) qn Aq(n,d, w)
Aq n, :::; ( ) .
n (q -1)w
w
I ifxECandd(x,y)=w
X(x,y) = { 0 otherwise (2.2)
L X(x,y):::; LAq(n,d,w)=qnAq(n,d,w)
x,yEF3 YEFQ
l
Lemma 2.12 (S. M. Johnson). We have
Aq(n,d,w)::::: d
d-2w+ (q-I)n
~j.
Proof: Let C be an [n, k, dq l-constant-weight code of cardinality M = qk which
attains the bound Aq(n,d, w); thus M = Aq(n,d, w). The minimum distance d
cannot exceed the average pairwise distance between the elements of C and hence
M(M -l)d::::: L d(x,y).
x,yEC
Set again mij = I{x = (XI,··· ,xn ) E C IXi = j}1 and note that
n n
LmiO = (n -w)M, L L mij=wM.
i=1 i=IJEF~
l ~ Aq(n,d) ~ (n)
(q - I)wr
w
and hence
a (8)~RBd8)=I-H q-I-q-IF1J;8)
(- - 1-- .
q q q q q-I
Proof: For din ---+ 8, 10gqAq(n,d)ln = kin ---+ R and win ---+ w. Theorem 2.13
yields
2 q-l q-l
w -2--w+--8>e>O.
q q-
Tending e ---+ 0 and choosing the largest w ~ 1 with this property, i.e.,
Many ofthe best upper bounds for Aq (n, d) known at present are based on a method
which was developed by Delsarte in 1973. The idea is to derive inequalities
that have close connections to the MacWilliams identity and then to use linear
programming techniques to analyze these inequalities.
Bounds on Codes 33
Pi (u) = ~) -1 Y(q -
j=O
l)i-J (~)
}
(~I =}~)
(see Exercise 1.9). Note that
(1 + (q - l)zY-U(I-zY = LPi(U)zi.
i=O
Since Af are the coefficients of Wc~ (u : v) they are non-negative integers, i.e., for
anyj= 1,2, ... ,n
n
LAiPj(i) ~ o.
i=O
We want to give an upper bound for
n
l= 1+ LA i ,
i=d
Solving the dual problem and using simplest properties of the polynomials P; (x)
(see Exercise 2.11) we obtain the following statement (see [115, §5.3) and [118,
Ch. 17, §4]):
Theorem 2.15 (the linear programming bound). For a given set ofnon-negative
real numbers al,··· ,an such that
n
1+ L a;P;(j) ~ 0, d ~j ~ n,
;=1
In other words, if
n
f(x) = 1 + L a;P;(x)
;=1
f(j) ~ 0,
then
Aq(n,d) ~f(O).
The advantage of Theorem 2.15 is that any polynomial f(x) satisfying the
conditions of the theorem yields a bound for A(n,d) whereas in the above men-
tioned inequality one has to find the optimal solution ofthe corresponding system.
Note that this result can be sharpened if we apply the linear programming method
to the constant-weight (spherical) codes and then use Lemma 2.11. The linear
programming bound can also be used to get asymptotic upper bounds, but one has
to apply a rather subtle technique which does not fit into the frames of this book.
We restrict ourselves to the formulation of corresponding results (see [118, Ch. 17,
§7) and also [112, 1l3]):
(q-l)-8(q-2)-2 J (q-l)8(1-8))
a q(8) ~ R/p(8) = Hq ( .
q
where
Here we stop to discuss upper (i.e., possibility) bounds and pass to existence
bounds.
Suppose C is a code of length .1'/ over Fq with minimum distance d and suppose
that it is not possible to find a vector not in C that has distance at least d to all
code-vectors in C. Then clearly
Suppose now that we consider only linear codes in F;. We claim that we find
a result as good as Theorem 2.18:
qn-k+l> L
d-l ( ~ ) (q _1)i,
i=O 1
Proof: For k = 0 the assertion is trivial. Suppose the inequality holds for k - 1
and that we have a linear [n, k - 1, d]q-code C. By the proof of Theorem 2.18 there
36 Chapter 2
R
/ ~ - - - - -R p
R
A~
/ - - - - - -RBE
I
I
R- -
GV
q-l
2q
Figure 2.1.
is a word x' E F; that has a distance at least d to all the words of C. If x E C and
a E F;,then
(i) The parameters of almost all linear codes lie on the curve Rov( 8).
EXERCISES
lim .!.lolL
n-+oo n ""I
(f (~)(q_1)i)
i=O I
=Hq(c5).
L iXi == m
n-I
mod(n)
}
.
i=1
Show that
nw - 1
A2(n,4,w)~-,- as n-+ oo .
w.
2.3. Show that
(~)A2(n,21) ~ 2nA2(n,21, w).
2.4. Check the following facts:
(a) On the segment [0, (q -l)/q] the curveRGv(c5) is differentiable (of class COO)
and convex;
(b) RGv(O) = 1, RGv«q - l)/q) = 0 and for c5 -+ 0 there is the asymptotical
equality
RGv(c5) = 1 +c51o~ c5+o(c51ogq c5).
In particular, the tangent at c5 = 0 is vertical;
(c) For c5 -+ (q - 1)/q there is the asymptotical equality
q
RGV ( -q--W
-1) = 2(q-1) logq W +o(w).
q2 2 2
2.8. Suppose that there exists a linear [n,k,d]q-code C. Prove that it is possible to
construct a linear code with parameters [n + I, k, d]q, and if k ~ 1, n > d ~ 2 then
also linear codes with parameters [n -I,k-l,d]q, [n -I,k,d -1]q, [n,k-l,d]q
and [n,k,d -I]q.
2.9. State and prove the similar result for non-linear codes. (Hint: In this case (k - 1) is
changed by logq(lqk-I J).)
2.10. Prove that the curve R = aq(a) is continuous on the segment [0,1]. Show that it
satisfies the conditions Clq(O) = l,aq(a) = 0 for (q -1)jq ~ a ~ I and decreases
on the segment [0, (q -1)jq]. (Hint: Use Exercises 2.8,2.9 and Corollary 2.6.)
2.11. Prove that Krawtchouk polynomials
(a) Pi(U)=L]=O(-qy(q-l)i-J(~=j)0);
(b) Pi(U) =L]=o(-I Yi/-J (n-;+j) (~=;);
(c) Pi(U) is polynomial of degree i in u, with leading coefficient (-q)iji! and
constant tenn (~) (q - l)i;
Bounds on Codes 39
then
n
ai = q-n If(j)Pj(i).
j=O
Chapter 3
Recall that a code ofgenus zero (or anMDS-code) is an [n,k,d]q-code C such that
k+d=n+l.
Trivial Codes
For every n there are three simplest q-ary codes of genus zero which are naturally
called trivial. These are:
(ii) [n, n - 1, 2]q-code CI = { (XI, ..• ,xn ) E F~ I 2.7= I Xi = O}, called the parity-
check code;
(iii) tn, l,n]q-code C2 = {x = (XI, .•. ,xI) E F~}, called the repetition code.
41
42 Chapter 3
Reed-Solomon Codes
Let :P = {al, ... , an} ~ Fq be a subset of cardinality n. Consider a linear space
L (m) of all polynomials in one variable of degree at most m with coefficients in
Fq; its dimension over Fq is dimL(m) = m + l. For n > m a non-zero polynomial
f(u) E L(m) cannot vanish at all points of:P. Moreover, it has at least (n - m)
non-zero values at points of the set:P. Hence ifn > m, the evaluation map
and denote by fl( m) the linear vector space spanned by the rational functions
g,(u), 0 ::; I ::; n - m - 2. To construct e..L it is convenient to use the so-called
residue map from fl( m) to F q .
Consider a rational function F(u) = f(u)go(u ),J(u) being a polynomial, and
recall the definition of the residue of F at ai:
Res",J (u) = f (ai) TI (ai - aj )-I .
fl.i
Proposition 3.1 (the residue formula). If degf ::; n - 2 then
n
I Res",J(u) = o.
i=1
n n
= (_1)n TI (Uk-U/)Lf(Ui)TI(Ui-Uj)-1 (3.1)
19<k~n i=l j=l
I#
and then
n n
L!(Ui)TI(Ui-Uj)-1 =0,
i=l j=l
Hi
as required.
•
Proposition 3.2. The dual code Cl. for the Reed-Solomon code C ofdegree m is
the image offi(m) under the residue map
Resa;g(u) =Yih(Ui),
where
n
Yi = TICUi - Uj)-l,
j=l
Hi
i.e., Cl. can be obtained from a Reed--Solomon code C' = Ev(L(ml.)), ml. =
n - m - 2, by multiplying the ith coordinate of all its vectors by Yi E F; . We
write Cl. = Y X C', Y = (YI, ... ,Yn) E (17;)n, and call such a code generalized
Reed-Solomon code. It is equivalent to a Reed--Solomon code in the sense of the
definition in Section 1.2. •
44 Chapter 3
Ai= (~X~(-IYG)(qi-d-j+I-I)
= (~)(q-I)~~(-IYCj l)qi-d-j .
Since for any x = (XI, ... ,xn ) E C the corresponding sum is equal to zero (the
matrix (~) is a parity-check matrix of the code C) we have
Sj = L.ei~'
iEI
which is called the error-locator polynomial. We explain that the coefficients are
uniquely determined as a solution of the following system of linear equations
t
LZISj+1 = 0,
/=0
Examples and Constructions 45
with respect to indeterminates ZI. Indeed, since 0'( ai) = 0 for any i E 1, and
we see that (zo, ... ,Zt) is a solution of the above system. Now, if (zo,'" ,z~) is
some other solution and
t
O"(u) = L.Z[u l ,
1=0
then, setting
t-l
OJ(u) = I1(U-ai) = L.ZkjUk
iEI k=O
i#
for any j E 1, we obtain
ejOj(aj)O"(aj) = L.eiOj(ai)O"(ai)
iEI
t-l t-l t
= L. L. eiZkjarO"(ai) = L. L.zkAL.eiaf+ 1
iEI k=O k=O/=O iEI
t-l ( t )
= L. Zkj L.Z[Sk+1 = O.
k=O 1=0
Thus O"(aj) = 0 for eachj E 1, i.e., (zo, ... ,Zt) is the only solution (up to a
multiplicative constant).
Decomposing O'(u) into linear factors we find now the set of error locators
1. Finally, to find error-vector e = (el, .. . , en) itself it is sufficient to solve the
following system of linear equations
0::;j::;t-1,
with respect to ei. We note again that the elements ei we are looking for are
uniquely determined by this system. Indeed, if {en is another solution, then
L.(ei -eDa{ = 0,
iEI
and hence e - e' E C. But the weight of the vector e - e' is at most 2t ::; d - 1.
The contradiction we obtain shows us that e is determined uniquely.
46 Chapter 3
°
° °
A non-zero fonnj E L~ has at most qm zeros in F:;+l (recall that 1$ :P and if
j(Yi) = for Yi E P then alsoj(aYi) = for all a i= 0, 1). Therefore the number
of zeros of j in :P is at most (qm - 1) j (q - 1). The maximum cardinality of P
is (qm+l - l)j(q - 1) (take for example all non-zero elements of F:;+l such that
their first non-zero coordinate is 1). We obtain an [n,m + 1,n - q;_ll]q-code for
n S; (qm + 1 - 1) j (q - 1), in particular a code C with parameters
qm+l -1 ]
[ q _I,m + l,qm q
This is a very good code which lies on the Plotkin bound. It is not difficult to
check (see Exercise 3.2) that the weight enumerator of the code C has the fonn
Examples and Constructions 47
Hamming Codes
The Hamming single-error-correcting codes form an important family of linear
codes which are easy to encode and decode. The Hamming code CH can be
defined as dual to the Ree~Muller code C: CH = Cl.. Theorem 1.17 makes it
possible to find out the spectrum of CH (see Exercise 3.2). The spectrum shows
that d 2: 3. This can be seen also without knowing the weight enumerator. In fact,
the parity-check matrix of CH has no proportional columns (if for Yl,y2 E F:;+l
all linear forms are proportional andf(yJ) = Otf(Y2), thenYI = OtY2). Hence any
two columns are linearly independent and d 2: 3 (see Exercise 1.5).
So we have constructed a family of codes with parameters
[n,n-m-I,2: 3], n::; (qm+l_l)/(q_l),
and for n = (qm+ I - I) / (q - I) we know spectra of such codes. These codes are
good enough if we are interested in codes with d = 3. For n = (qm+l - I) / (q - I)
they lie on the Hamming bound.
Ev: L~(r) -+ F;
is an embedding and the code C = Ev(L~ (r)) has parameters n = qm,
k = dimL~(r) = L~J () ( m -
Lr L(-l)i"! ='-fJj
I· .) ,
j=Oj=O ] m I
and
d = (q - T)qm-u-l.
Cyclic Codes
A linear code C ~ F n is called cyclic if it is invariant with respect to the cyclic shift
of coordinates, i.e., (Xl, ... ,Xn) E C yields (X2,'" ,Xn ,Xl) E C. Note that cyclicity
is not an invariant of the equivalence class of codes.
From now on we make the convention (n,q) = 1. To describe cyclic codes
algebraically we observe that F; as vector space is isomorphic to the ring Rn =
Fq[u]/(u n -1), if only we ignore the multiplication in this ring. We now identify
the vector (Xl, ... ,Xn) E F; with the corresponding polynomial Xl + X2U + ... +
xnu n- l . Observe that multiplication by u now is nothing but a cyclic shift of the
vector. Since a cyclic code is linear by definition, we have:
Proposition 3.6. ifF is a field then the residue class ring F[u]/(u n - 1) is a
principal ideal ring and every ideal is generated by a divisor of un - 1.
un -1 =fi(u) "'Is(u)
is a generator matrix for the code C with generator polynomial g( u ) and one easily
checks that
o hk ... hI hO)
...... ho 0
ho o 0 0
is a parity-check matrix for C. We call h (u ) the parity-check polynomial. Observe
that the code with h (u) as generator polynomial is equivalent to C~ (obtained by
reversing the order of the n symbols). So C~ has generator polynomial ukh(u- I ).
Let C be a cyclic code with generator g( u) =fi (u) .. ·ft (u) and let ai be a root
of.li (u), 1 ::; i ::; t, in algebraic closure Fq of the field F q . Denote by Fqm the
splitting field of the polynomial un - 1, i.e., the smallest extension of F q , where
un - 1 decomposes into a product of linear factors (see Exercise 3.4).
If degfi (u) = mi then ai E Fqm; C Fqm and
Besides, each ai can be interpreted as column-vector (ail ... , aim r of its coordi-
nates in a basis of Fqm over F q .
Now we consider the t x n matrix over Fqm:
a 2I al
H~(i
al ,-1 )
a2 a 22 a 2n-I
a t2 ... n-I
at at
BCH-Codes
We now come to a generalization of Hamming codes, the so called BCH-codes
(discovered by Bose, Ray-Chaudhuri, and Hocquenghem). Let a be a primitive
nth root of I in an extension of F q . Let g( u) be the least common multiple of the
minimal polynomials of ai, a l + I, ... , a l +t - 2. The cyclic code of length n over Fq
with generator g( u) is called a BCH-code with designed distance t.
From now on we restrict ourselves to the case I = 1 (narrow-sense BCH-codes).
If n = qm - 1, i.e., a is a primitive element of Fqm, the code is called a primitive
BCH-code.
Theorem 3.7. The minimum distance d ofa BCH-code C with designed distance
t is at least t (this is called the BCH-bound).
Proof: As we saw earlier, a vector x = (XI, ... ,xn ) is the code-vector if and only
if
a2
~
a
a2 a4
Hx' ( i
at-I a 2(t-l) a(n-I)(t-I)
The point of the algorithm is that the first 2t coefficients on the right-hand side
are known, because e(a i ) = y(a i ) for 1 :S;j :s; 2t by definition of the code. So, if
we write
21
S(t) = Ly(ci)zi- 1 ,
j=l
we now have to find the unknown polynomials a(z) and w(z) about which we
know that
w(z) == a(z)S(z) mod (z21).
We now perform Euclid's algorithm to calculate the g.c.d. of S(z) andz 2t . The
algorithm starts with
S(z) = 0·z2t + l·S(z)
and produces a sequence of equations
where the degree of rn(z) decreases until the g.c.d. is reached. Clearly the pair
rn(z), vn(z) satisfies the congruence
When for the first time rn (z) has degree less than t we have found the required
pair up to a constant factor (which is determined by the fact that a(O) = I).
52 Chapter 3
Quadratic-Residue Codes
The quadratic-residue codes (QR-codes) CR, C~, CN, C~ are cyclic codes of prime
length lover a field Fp , where p is an another prime which is a quadratic residue
modi. The codes CR, CN are equivalent ones with parameters [I, I~I,~ 11/2]p,
while C~ and C~ are equivalent codes with parameters [I, /;1, ~ 11/2]p. These
codes have an information rate close to 1/2, have large automorphism groups and
tend to have high minimum distance (at least ifp is not too large).
Let R denote the set of quadratic residues mod I, and N the set of non-residues.
Since pER, the sets Rand N are closed under multiplication by p. Hence, if a is
a primitive Ith root of I in some extension of Fp , the polynomials
R(u) = TI (u - a r ) andN(u) = TI (u - an)
rER nEN
have coefficients in Fp , and also
ul -I = (u -1)R(u)N(u).
The quadratic-residue codes CR, C~, CN, C~ are cyclic codes (or ideals of the
ring Fp [u] / (u l - 1» with generator polynomials
R(u),(u -1)R(u),N(u), (u-I)N(u),
respectively (see also Exercises 3.6-3.9). The permutation of coordinates in
Fp[u]/(u l -, 1) induced by u H un for a fixed non-residue n interchanges CR
and CN, and also q and C~, so that these codes are equivalent. Clearly CR :J C~
and CN :J C~. Besides, we have
cf = C~, ck = C~ if I = 4k - 1,
cf=c~, ck=c~ if 1=4k+1.
Theorem 3.8. Ifd is the minimum distance olCR or CN, then d 2 ~ I. Furthermore,
ifl=4m+3, thend 2 -d+1 ~/.
Proof: Let x (u) be a code-vector of minimum non-zero weight d in CR. If n
is non-residue, x'(u) =x(u n) is a code-vector of minimum weight in CN. Then
x(u)x'(u) must be in CR n CN, i.e., must be a non-zero multiple of
I-I I-I
TI(u _a r ) TI (u - an) = TI(u-a i ) = Lui,
rER nEN i=1 i=O
and hence x(u)x'(u) = y(I.::J for some y E F;. Thus x(u)x'(u) has weight
ui )
I. Since x (u) has weight d, the maximum number of non-zero coefficients in
x(u)x'(u) is d 2, so that d 2 ~ I. If 1= 4m + 3, we can take n = -1. Now in
the product x(u)x(u- I ) there are at most d 2 - d + 1 non-zero coefficients, so that
I ::; d 2 - d + 1. •
Examples and Constructions 53
Alternant Codes
Altemant codes are a large family of codes obtained by a small modification of
BCH-codes (see [118, Ch. 12]). They are also closely related to the generalized
Reed-Solomon codes. Let C be a generalized Reed-Solomon [n,k,d]qm- code
over Fqm. The code C' = Cl. n F; is called an alternant code. The parameters
[n,k',d'] of the code C' satisfy
k'~n-m(d-l), d' ~ d.
Altemant codes form a very large class of codes, and the extra freedom in the
definition is enough to ensure that some long altemant codes meet the Gilbert-
Varshamov bound, in contrast to the situation for BCH-codes.
n-l n-l
= Lzi LXi+l(ai+l)i =zd-lf(z)
j=O i=O
i.e.,
~ ~ = zd-lf(z) (3.2)
i=O z - a-I zn - 1
for some polynomialf(z) and vice versa, i.e.,x = (Xl, ... ,Xn) is in the code ifand
only if the left-hand side of(3.1) written as a rational function a(z)/b(z) has a
numerator divisible by zd-l. We now generalize this as follows.
Letg(z) be a monic polynomial of degree t overFqm andletP = {al, ... ,an} C
Fqm. We require that g( ai) =1= 0, 1 ~ i ~ n. The rational Goppa code C with
Goppa polynomial g(z) is defined as
(3.3)
where the right-hand side is the unique polynomial r(z)modg(z) such that (z-
a)r(z) == I mod (g(z)). From (3.1) we see that if we take g(z) = zd-I and
!P = {a- i I0 ~ i ~ n - I}, where a is a primitive nth root of unity, then the
rational Goppa code C is the narrow-sense BCH-code of designed distance d. We
remark that not all BCH-codes are also rational Goppa codes. .
We can also interpret (3.2) as follows. Consider the vector space of rational
functionsJ(z) with the following properties:
(i) J(z) has zeros in all the points where g(z) has zeros, with at least the same
multiplicity;
(ii) J(z) has no poles, except possibly in the points al, ... ,all and then of order
1.
Consider the code over Fqm consisting of all the vectors (ResaIJ, ... ,ResaJ).
The rational Goppa code C is the "subfield subcode" consisting of all vectors in
the code with coordinates in F q .
We shall find now a parity-check matrix for C. Let
t
g(z) = LgkZk.
k=O
Then
g::::..(,--,z)'-----==g:..:. (u-,-) =
z-u
so we have an easy expression for the polynomials on the right-hand side of (3.3).
By (3.2) we must have, with hi = g-I (ai), 1 ~ i ~ n,
II
LXihi L gk+j+l(ai)izk = 0,
i=1 OSk+j~t-1
i.e., the coefficient of zk is zero for 0 ~ k ~ t - 1. We see that x = (XI, ... ,XII)
must have zero inner product with the rows of the matrix
hlgt h2gt
hI (gt-I + gtaJ) h2(gt-1 +gta2) ... hllgt )
( hn(gt-I:+gtan) .
h2 a 2t-I
Examples and Constructions 55
±~
i=1 Z-Cii
±~
z Ci
i=l - ;
~±(n)(q_1Y
t s=o s
~
t
± (n)(q_1 Y
s=o s
is less than the total number of monic irreducible polynomials of degree t over
Fqm, which is known to be
1
_qmt(1 +0(1)).
t
So we find a sufficient condition (after taking d = l8n J and n -+ 00)
mt
Hq(8) < - +0(1), m -+ 00.
n
From Theorem 3.9 we know that the codes we are considering have information
rate R 2: 1 - mt In. So we have proved the following result:
Theorem 3.11. There exists a sequence ofrational Goppa codes over Fq that have
information rate tending to 1 - Hq (8) (i.e.• the rate tends to the Gilbert-Varshamov
bound).
We note that the decoding method that we discussed for BCH codes can
be generalized also to Goppa codes. Consider the received word y = x + e,
x E C,e = (e\, ... ,en), and define
S(z) = f~
i=\ Z - Clti
(using the convention (3.3)). By (3.2) we can calculate S(z) fromy = (y\, ... ,Yn).
Now we again define error-locator and error-evaluator polynomials by
Justesen Codes
Consider a qm_ ary Reed-Solomon [n,k,n + 1- kJqm-code C, and let
2 ... ,a n-I) ,
y= ( 1,a,a,
i.e.,
and consider C' as aq-ary code. One can prove (see Exercise 3.5) that the Justesen
code C' has parameters
[2mn,mk, ±(2~)
i=1
i
1
(q - I/]q,
LI (2m)
. (q-I)i::;n-k+l.
i=1 I
The Justesen codes form a class of asymptotically good linear codes (see van Lint
[115, §9.2]).
Golay Codes
The Golay codes, of all codes, provide probably the most interesting examples for
both practical and theoretical reasons. These codes can be described in several
different ways.
Consider the field F 2 11. We have 211 - 1 = 23 . 89 and hence the roots of unity
of degree 23 lie in this field. Let a E F211 be such a primitive root and let
U23 +1= (u + 1)
x(u ll +ulO + u6 +u 5 +u4 + u2 + I)(u ll + u9 +u 7 +u 6 +u 5 +u + 1),
58 Chapter 3
(it depends on the choice of a). The minimum distance of C23 is much larger than
might be expected.
The full automorphism group of the Golay code C23 is the 4-fold transitive
Mathieu group M23 of the order 23 ·22·21 ·20 . 48 = 10200960. The spectrum of
C23 has the form
A similar situation takes place over F3. Consider the field F35 and observe that
35 - 1 = 11 . 22. Let a E F35 be a primitive root of degree 11 and let
We note also that the extended Golay codes C24 and Cl2 are self-dual.
Examples and Constructions 59
Perfect Codes
A code C is called perfect if d = 2t + 1 and F; is the union of balls of radius
t centered at code-vectors. It is clear that this property depends only on the
parameters [n,k,djq. The set of all perfect codes is characterized by the following
theorem (see [14, p. 252]):
Group Codes
The notion of a cyclic code can be extended to the case of an arbitrary finite group
G.
Recall that a group algebra of G is an algebra of functions
(ft!2)(g) = L./i(h)h(h-Ig).
hEG
The group G operates on Fq[Gj from the right: (fg)(h) = f(hg). For a subgroup
H <;;; G, the invariant space is defined as
Every G-space Fq[G/H] has a natural basis./i, ...j/, where I = [G: Hj, and the
functions fi have the property
ifi =j
if i 1= j ,
Many new codes can be obtained by combining and modifying previously con-
structed codes. In this section we shall describe several methods, starting with
those that do not change q.
Direct Sum
Let CI ~ F;I and C2 ~ F;2 be [nl,kl,d!]q and [n2,k2,d2]q-linear codes, respec-
tively. Their direct sum
C = C I EB C2 ~ F;I +n2
is the set of vectors x = (Xt,X2), where XI E CI andx2 E C2. Clearly C is a linear
[nl +n2,kl +k2,d]q-code with d = min(dl,d2). We can also consider direct sums
of any finite number of codes. If all these codes are equal, we obtain the power c m
of the original code C. IfC has parameters [n,k,d]q we get an [mn,mk,d]q-code.
Tensor Product
The tensor (or Kronecker) product
oflinear [nl ,kl ,dt]q and [n2,k2,d2]q-codes CI ~ F;I and C2 ~ F;2 respectively is
defined to be the [nln2,klk2,dld2]q-code whose code-words consist of all nl x n2
matrices in which the rows belong to C 1 and columns belong to C2 (the set of such
matrices can be identified with a linear subspace of F;ln2).
Examples and Constructions 61
Unfortunately tensor product codes usually have poor minimum distance (but
are easy to decode).
Pasting
Let CI and C2 be [nl,k,ddq and [n2,k,d2]q-linear codes, respectively, given by
encoding maps
4>1 : F; -+ F;l and
It is natural to consider the diagonal map
(x,x +y)-Construction
Let linear [n,kl,dd q and [n,k2,d2]q-codes Cl ~ F; and C2 ~ F; lie in the same
vector space F;. Define
C={(X,X+y)IXECl, yECz}.
Overall Parity-Check
Let C ~ Ff be a linear [n,k,dlz-code with odd d. We form a new linear code
- z+ as
C' CF n 1
Since every code-vector now has an even weight, we see that C' is an [n + I, k, d +
l]q-code.
Now we pass to the other type of constructions, where we change q. We start
with the case oflinear codes.
Examples and Constructions 63
Subfield Restriction
Let Fq, be a subfield of Fq,q = q'm, and C be a linear [n,k,d]q-code. Set
C'=CnFq'n CF
-
n
q'
and note that C' ~ F;, is a linear Fq,-subspace. Hence C' is a linear [n,k',d']q,-code
over F q, and clearly
k' ?:. n - m (n - k), d' ?:. d.
This construction makes sense when applied to linear codes with an information
rate R = k / n close to 1.
Concatenation
Let Co ~ F;P be a linear [no, m, do ]q,-code given by an encoding map ~ : F:;: -+ F;P
and C ~ F; , q = q'm, a linear [n, k, d]q -code given by an encoding map cP : F; -+ F;
(we call Co the inner code, and C the outer code). Define a new code C' ~ Fn,on
with the help of the composition of maps q
mk ~ Fk cf> Fn ~ (Fm)n
Fq'-+qY (</>0, ... ,</>0) Fnon
q -+ q' Y q"
and call it the concatenation ofC and Co. One can show (see Exercise 3.11) that
C' has parameters [non,mk,dod]q"
When Co is an [m,m, 1]q,-code, this construction is called the field descent,
and the parameters are [mn,mk,d]q"
Now we are going to present two essentially non-linear constructions. Let C
be a [n,k,d]q-code, with k E IR, q E Z, q ?:. 2.
Alphabet Extension
Let q ::; q', and C be an [n,k,d]q-code. Let us embed an alphabetF of cardinality
q into an alphabet F' of cardinality q'. Now if we consider an embedding C Y
F n YF'n, we get an [n,klogq,q,d]q,-code.
Alphabet Restriction
Let, vice versa, q' ::; q. We embed F' into F and make F an abelian group (setting,
for example, F = Z/ qZ). Consider all qn shifts Cy of the code C by vectors
y E Fn, Cy = {y + x Ix E C}. In the totality of sets Cy each vector z E F n appears
exactly M = Ie! = qk times. Consider all intersections F'n n Cy . There are qn of
them and their total cardinality is M q'n. Hence there exists Cy such that
I
IF'n n Cy ?:. M ( ~) n
64 Chapter 3
Since the shift does not change the minimum distance, we have obtained an
[n,2: n - (n - k) logql q, 2: d]ql-code.
Decoding
In the process of combining or modifying previously constructed codes one must
also provide an easy decoding procedure for the resulting code, supposing we know
decoding algorithms for the codes we start with. In general the constructions
presented above pose no problems in finding a fast decoding algorithm. For
example, if there is a fast decoding algorithm of a q-ary code C ~ F;, the same
algorithm decodes the field restriction C ' = C n F;, F;,. ~
A more serious difficulty arises in the case of concatenation. To clarify the
situation we need the following notion: an erasure is an error whose position we
know. Suppose we have transmitted x E C and received y E F; , Y = x + e + e',
1= {i 11 :<:; i :<:; n,e; =j:. O}, I' = {i 11 :<:; i :<:; n,e; =j:. O}, I being Unknown to us (the
set of error-locators) and I' being known (the set of erasure-locators). The vector
e is called the error-vector, and e' the erasure-vector. If there is an algorithm
which finds out the nearest code vector x for every e and e' such that III :<:; t and
II'I :<:; t', we say that the code corrects t errors and t ' erasures.
Now it is not difficult to prove (see Exercise 3.12) the following result:
EXERCISES
q-l ,m + l,qm}q-code C
3.2. Show that all non-zero code-vectors ofa Reed-Muller [~
of order I are of the same weight qm , i.e.,
Calculate the spectrum of the Hamming code CH = CJ. dual to the code C. (Hint:
Use Theorem 1.17.)
Examples and Constructions 65
3.3. Let n ==qm, r == a(q-I)+T 5 m(q-I), 1ST 5 q-I andL~(r) be the linear
space over Fq spanned by monomials urI .. ·u;:,m ,05 ai 5 q - I,Lai == r. Prove
that if P = F;then the Reed-Muller code C = Ev(L~ (r)) defined by
d = (q - T)qm-lT-l.
(Hint: To compute k one can calculate the number of ways to place m objects in i cells
such that no cell contains more thanj objects and then apply an exclusion-inclusion
argument. To compute done can just use induction on m.)
3.4. Let Fqm be the splitting field of the polynomial un - 1. The group F;m is cyclic and
its subgroup Un of nth roots of 1 is also cyclic. Hence
n-I
un - I = n(u-a i ),
i=O
g(u) = n(u-a i )
iEi
for a subset I ~ {O, I, ... , n - I}. Check that m is the smallest positive integer such
that n I (qm - I) and the coefficients of g(u) belong to Fq if and only if qI == I
mod (q).
3.5. Prove that Justesen code C constructed from a Reed-Solomon [n, k, n - k + I]qm -code
C is an
i=1
. (q-I/5 n -
L (2m)
I
1
k +1.
(Hint: The hyperplane (XI ,XI ;X2, (XX2; ... ,xn , an-IXn) contains at least n - k + 1
different q-ary vectors (Xi, ai-I Xi) of length 2m. Estimate the total weight.)
=
3.6. Let Rn Fq[u]/(u n - 1) be the ring of all polynomials of degree at most n -1 with
coefficients in Fq and let a E Fqm be a primitive nth root of unity (we suppose that
(n,q) = I). Apolynomial1)(u) ERn is called idempotent if
1)(u) = 1)2(u).
66 Chapter 3
A minimal ideal in Rn is one which does not contain any smaller non-zero ideal.
The corresponding cyclic code C is called a minimal or irreducible code, and the
idempotent of this ideal is called a primitive idempotent.
The cyclotomic coset modn over Fq which contains an integer s is
=
where ms is the smallest positive integer such that sqm, s mod (n). The non-zero
elements of a minimal ideal must be {ci liE Cs } for some cyclotomic coset Cs .
We denote this minimal ideal by Ms, and the corresponding primitive idempotent by
Bs(u), so thatMs = (Bs(u)). Prove the valj4ity of the following statements (q = 2):
(a) A cyclic [n,k,d]q-code or ideal C = (g(u)) inRn contains a unique idempotent
Tj(u) such that C = (Tj(u)). Moreover Tj(u) = a(u)g(u) for some a(u) ERn;
Tj(a i ) = 0 ifand only ifg(ai ) = 0 andx(u) E C ifand onlyifx(u)Tj(u) =x(u);
(b) Tj(u) is an idempotent ifandonly ifTj(ai ) = 0 or Tj( a i ) = I for i = 0, I, ... ,n-
l. (Hint: Use the following inversion formula: If
n-l
Tj(u) = L Tji Ui ,
i=O
then
1 n-l . .. 1 " ..
Tji = - L Tj(al)a-I) = - L £." a-I),
n j=o n s jEC,
where s runs through a subset of the cyclotomic cosets);
(c) Bs (aI)
. = {I0 E
if} Cs,
otherwise;
a ( ) - ~n-la(s) i
(d) Us h
u - L...i=O Ui U, were a(s) -- 11I ~
Ui L...jEC, a -ij , 0<
_ I. <
_ n-
l',
(e) The primitive idempotents Os (u) have the following properties:
(i) L Bs(u) = 1,
(ii) Bs(u)O/(u) = 0 if s # t,
(iii) The ring Rn is the direct sum of the minimal ideals generated by the
Os(u ),
(iv) The minimal ideal Ms = (Os(u)) of dimension ms =n- degBs(u) is
isomorphic to the field Fqm"
(v) Any idempotent Tj (u) can be written in the form
Tj(u) = LasOs(u),
s
3.7. Show that for p = 2, I = 4k + 3 the primitive Ith root of I can be chosen so that the
idempotents of quadratic-residue codes CR, Ck, CN, C~ are
Examples and Constructions 67
1'J~(u)=I+Lun,
nEN
1'JN(U) = L un,
nEN
1'JN(u) = 1 + L ur
rER
and for p = 2, I = 4k + 1 the idempotents of CR, Ch, CN, CN may be taken to be
1 + L ur , L un, 1 + L un, L ur ,
rER nEN nEN rER
respectively.
3.8. For a prime I > 2 define the Gaussian sum 7" as follows
I-I (.)
7"=L 7 (i,
1=0
where (f) is the Legendre symbol and ex E Fpm is a primitive lth root of unity. Since
7"P = 7" we have 7" E Fp. Prove that
I if / =4k+ 1
-/ if I =4k+3
3.9. Check that for p > 2 and I = 4k ± 1 the idempotents of quadratic residue codes C R,
Ch, CN, Cfy are
1'JR(U)=-1(1+-1)I +-1(1---
2 I
1) rERLU +-1(1-+-
2 I
7"
1) nEN
LU' r
2 7"
n
1'JN(u) = ~2 (I-~)
I
- ~2 (~-~)
I 7"
L ur - ~ (~+~) L un,
rER 2 I 7" nEN
respectively. (Hint: Use Exercises 3.6 and 3.8.)
= =
3.10. Let A (aij) be an m x m matrix and B (bij) be an n x n matrix over any field.
The Kronecker product A 0 B of A and B is the mn x mn matrix ob~ained from A
by replacing every entry aij byaijE. Now let GI and G2 be generatdr matrices for
[nl ,kl ,dil2 and [n2,k2,d2b-codes CI and C2, respectively. Prove thatthe Kronecker
product GI 0 G2 is a generator matrix for C = CI 0 C2.
3.11. Prove that the minimum distance of the concatenation C' of a linear [no, m, do ]ql-code
Co and a linear [n,k,d]q-code C is at least dod.
3.12. Prove Proposition 3.14. (Hint: Decode the given vector first with the decoding
algorithm for Co, then decode the obtained vector and all vectors obtained from it
by erasing one symbol with the decoding algorithm for C.)
Part II
This part introduces the basic notions of algebraic geometry and provides a number
of fundamental facts we shall apply later to the theory of error-correcting codes.
Almost all of the constructions coming from algebraic geometry and being used
in the coding theory are based on consideration of algebraic curves. That is the
reason why we concentrate mostly on the theory of curves. Multi-dimensional
algebraic geometry appears here only as an instrument for the study of algebraic
curves. For a more extensive exposition of concepts and methods of algebraic
geometry we refer the reader to Fulton [47], Griffiths and Harris [67], Hartshorne
[73], and Shafarevich [172].
69
Chapter 4
Algebraic Curves
This chapter contains the basic definitions and results of the theory of algebraic
curves: valuations, divisors, the genus of a curve, finite morphisms, linear sys-
tems, Jacobians, differential forms and their residues, the Riemann-Roch theorem,
Hurwitz and Plucker genus formulas, special divisors and Weierstrass points. We
do not consider here the arithmetical properties of curves and for that reason the
ground field k is assumed to be algebraically closed.
Affine Varieties
Let k be an algebraically closed field and n ;::: 1 be an integer. Define n-dimensional
affine space over k, denoted A'k (or simply An), to be the set of n-tuples (Xl, ... ,xn)
with components in k. An element X E An will be called a point, and if X =
(Xl, ... ,xn), Xi E k, then the Xi will be called the coordinates ofx.
Let k[T] = k[Tl' ... ' Tn] be the ring of polynomials in n variables over k. We
will interpret the elements of k[T] as functions from the affine n-space An to k, by
defining F(x) =F(Xl, ... ,xn), where FE k[T] and X E An. A zero of FE k[T] is
a point X = (Xl, ... ,Xn) E An such that F(x) = O. If S is any subset of k[T], we
define the zero set V (S) of S to be the set of common zeros of all the polynomials
FES:
V(S) = {x E An IF(x) = 0 for all F E S}.
A subset X of An is an algebraic set if there exists a subset S ~ k[T] such that
X = V(S). If a is the ideal of k[T] generated by S then X can be considered
71
72 Chapter 4
Morphisms
wherel E k[X]. Then U(j) is again an affine variety (it can be given in A n by the +'
equations of X ~ An and the equation Tn+if(T" ... , Tn) = I) and the coordinate
ring of such an open set is isomorphic to k [X] [1/1].
Non-Singular Varieties
If X is a topological space, we define the dimension of X (denoted dimX) to
be the supremum of all non-negative integers n such that there exists a chain
Xo C XI C ... C Xn = X of distinct irreducible closed subsets of X. We define the
dimension of an affine, projective or quasi-projective variety to be its dimension
as a topological space (with the topology of Zariski).
Theorem 4.5. The dimension ofan affine variety X is equal to the transcendence
degree ofthe field k(X) over k.
LetX ~ AP be an affine variety and let FI, ... ,Fr E k[TI, ... , Tn] be generators
for the ideal a(X) . The variety X is non-singular at a point x E X, if the rank of
the matrix
( aFj (X))
alj
is n - d, where d is the dimension of X. The variety X is non-singular (or
smooth) if it is non-singular at every point. Note that this definition depends on
the embedding of X in an affine space. To extend the concept of non-singularity
to the case of quasi-projective varieties it is convenient to describe this concept in
terms of local rings.
The height of a prime ideal P in an arbitrary commutative ring R is the supre-
mum of all non-negative integers n such that there exists a chain Po C PI C ... C
Pn = P of distinct prime ideals. We define the dimension of R to be the supremum
of the heights of all prime ideals.
(i) every proper non-zero ideal in C9x is of the form a = m~ for some integer
n 2: 1;
Valuations
Let L :::> k be an extension of the ground field k. A map v : L * --+ Z of the
multiplicative group L * of L onto Z such that
(i) v(k*) = 0,
(ii) v(xy) = vex) +v(y), and
(iii) v(x+y) 2:min(v(x),v(y)),
is called a discrete valuation of the field L. It is convenient to extend the map 'Ii
to the whole L by setting v(O) = 00. Such a discrete valuation defines a discrete
valuation ring C9 v = {x E L * Iv(x) 2: O}.
The quotient field of C9 v is L. An integral domain R is called a discrete
valuation ring if there exists a discrete valuation v on its quotient field K such
that R = {x E K* Iv(x) 2: O}. Every discrete valuation ring R is a principal ideal
domain and every non-trivial ideal a c R is of the form a = mn for some integer
n 2: 1, where
m = {x E R Iv(x) > O}
is the unique maximal ideal in R.
Theorem 4.9. Let X be an algebraic curve and x a point of X. Then x is non-
singular if and only if the local ring C9 x is a discrete valuation ring.
Algebraic Curves 79
Proof: Let x E X and let us cp be represented locally around x by y I-t (fo (y) :
Ji (y) : ... :.fn(y)).
Multiply allJi, 0::::: i ::::: n by the same power tV of t such that
v(tvJi) ~ 0 and
min(v(tVfo) , ... ,v(tVfn)) = 1.
Then cp can be represented as y I-t W.fo (y) : tVJi (y) : . .. : t".fn (y)) which shows
that cp is well-defined on X. •
Corollary 4.11. Non-singular projective curves are classified by their function
fields.
Non-Singular Points
The set of singular points of a projective curve is closed in the Zariski topology.
This fact is local in nature, so that we can restrict ourselves to consideration of
affine curves, which provide an open covering of the considered curve. In turn,
to check whether a point on an affine curve is non-singular, we can use the notion
of tangent space. For simplicity, we are restricted to consideration of the case of
plane affine curves (in the general case, the situation is entirely similar).
Let a plane affine curve X ~ A,2 be defined by a polynomial F (Tl , T2)' The
linear form
dxF = aF(x) (Tl -xJ) + aF(x) (T2 -X2)
aXI aX2
dxF = °
is called the differential of the polynomialF atthe point x = (Xl ,X2). The equation
defines a linear subspace ex ~ A,2 which is called the tangent space
of X at the point X E X. Then there is an isomorphism of the vector spaces
e; ~ mx/m; with mx the maximal ideal of c:Jx and e; the dual vector space of
ex. Indeed, by associating to G E k[Tl, T2J its differential dxG, one gets a map
dx : k[XJ -+ Homk(ex,k). We extend this by the formula
where the sum is over all points of X, and the coefficients are integers and are
zeros for all but a finite number of x. The set {x E X Iax i- o} is called the support
Algebraic Curves 81
The degree map deg: Div(X) -+ Z, D r-+ degD, is surjective; its kernel is denoted
by Divo(X). Ifa x ~ 0 for every x E X then we call D = Lax·x an effective divisor
and write D ~ O. If, moreover, D¥-O we call it positive and write D > O. This
definition induces a partial order on Div(X) : D ~ D' if and only if D - D' ~ o.
Note that each divisor is a difference of two effective divisors.
Divisors
and
{D,D'} = Lmax(ax,a~).x
we call respectively the greatest common divisor and the least common multiple
of divisors D = Lax· x and D' = La~ . x.
Let Vx be the discrete valuation of L = k(X) associated to x and let f be a
non-zero function in L. Iff E mx (resp.j-I E mx) then x is called a zero (resp. a
pole) off and vx(f) (resp. vx(f-I» is called the order of zero (resp. the order of
pole) off atx. Set
and call (f) the divisor off. Note that (f) is indeed a divisor since eachf E L*
has only a finite number of zeros and poles and thus Vx if) ¥- 0 only for a finite
number of points x EX. Note also that (f) = (f)o - (f)"" where
(f)o = L vxif)·x
vx(f»o
and
if)", = L vx(f-I)·x
vx(f)<o
are effective divisors. The divisor (f)o is called divisor ofzeros, and (f)", is called
the divisor ofpoles off.
Divisors of the form (f) are called principal or linearly equivalent to zero.
Principal divisors form a subgroup P(X) in Div(X). There is an intrinsic iso-
morphism P(X) ~ L* jk*. If D -D' E P(X), then divisors D and D' are called
linearly equivalent (or simply equivalent). In this case we write D '" D', and call
{D' E Div(X) ID' '" D} the divisor class of D.
82 Chapter 4
Finite Morphisms
Let cp : X -7 Y be a non-constant morphism of curves X and Y. It is called the
finite morphism and X is called the covering of Y. In this connection we get an
embedding cpo : k(Y) Y k(X). The degree of k(X) over k(Y) is called the degree
of cp and is denoted by deg cpo If k(X) is a separable extension of k( Y) then the
morphism cp : X -7 Y and the covering X is also called separable.
Let Vx be the discrete valuation associated to x E X and let t be a local parameter
aty = cp(x) E Y. The morphism cp induces also a homomorphism cpo : C9y -7 C9 x of
valuation rings C9y and C9 x, and the number ex = Vx (cp* (t» is called the ramification
index of the finite morphism cp at x. If ex > 1 we say cp is ramified at x, and that
y = cp(x) is a branch point and x is a ramification point of cpo If ex = 1, we say cp
is un ramified at x (see Fig. 4.1).
A finite morphism cp : X -7 Y induces a homomorphism cpo : Div( Y) -7 Div(X)
which is defined as follows. Set
cp*(y) = L ex·x
xEX
cp(x)=y
and observe that the sum is finite. Then we extend the definition of cpo (y) by
linearity to all divisors of Y. The image cp*(D) ofa divisor DE Div(Y) is called
the pullback of D. It is clear that cpo preserves the linear equivalence and hence
induces a homomorphism Pic( Y) -7 Pic (X) .
We can now deduce the following fact:
Theorem 4.15. Let cp : X -7 Y be a finite morphism of curves X and Y. Then for
any point y E Y
deg cpo (y) = deg cp,
and for any D E Div( Y) we have
degcp*(D) = (degD)· (degcp).
Algebraic Curves 83
y ==<p(x)
x
y
Figure 4.1.
Proof: Note thatf defines a morphism cp : X ---+ ]pl. We have cpo (0) = (f)o and
cpo (00) = (f)oo, hence
Proof: First we observe that if D,....., D' then L(D) ~ L(D') by f 1-+ fg if (g) =
D - D'. If degD < 0 then deg( if) + D) = degD < 0 for any function f E L *,
so L(D) = {O}. If degD ::::: 0 then either L(D) = {O} or there exists a divisor
D' ::::: 0 such that D' ,. . ., D (namely D' = D + if) for a functionf -I- 0 in L (D); see
Theorem 4.12). Since L(D) ~ L(D') we can replace D by D'. Therefore we may
assume that D = Ir= I mi • Xi with mi ::::: O. An element of L (D) determines for each
i = 1,2, ... ,r· an element of t;;-mi(lx)(lxi where tXi is a local parameter at Xi. This
gives a linear map
A :L(D) -+ EBr=1 t;;-mi (lX)(lxi.
The kernel Ker A of this map is k. Indeed, if Aif) = 0 then f E (lxi for all
i = 1,2 ... ,r and f is regular everywhere. But then f is constant, so
r r
I(D)::; 1 + Ldimkt;;-mi(lx)(lxi ::; 1 + Lmi = 1 +degD.
~l ~l
we get a bijection
Jacobian of a Curve
Let G be a quasi-projective variety which at the same time is a group. Then G is
called an algebraic group if and only if the maps
and
.p:G-+G,
Algebraic Curves 85
are morphisms. It is easy to see that every algebraic group is a smooth algebraic
variety.
If G is an algebraic group and a projective variety then G is called an abelian
variety. An abelian variety is a commutative group. IfX and Yare abelian varieties
which are isomorphic as varieties then they are also isomorphic as algebraic groups.
When X is a plane cubic curve, the factor group Pico(X) = Divo(X)jP(X) is
a one-dimensional abelian variety. The group law on X can be defined starting out
from the study ofPico(X) (see Section 4.4 below and Section 7.1). This example
is typical of a much more general situation. Starting from an arbitrary smooth
projective curve X, we can construct an abelian variety whose group of points is
isomorphic to Pico(X).
Theorem 4.18. For any smooth projective curve X there exists a unique abelian
variety J x such that
(iii) for any morphism 'P : X --+ Y from X to an abelian variety Y such that 'P(xo)
is the neutral element of Y there exists a morphism of abelian varieties
'" : Jx --+ Y, with 'P =" '" )0'
The abelian variety J x is called the Jacobian of X. The dimension of the
Jacobian is called the genus of X and is denoted g(X). One can show that this
definition coincides with the definition of genus in terms of differential forms
given in the next section. For k = C it also coincides with topological definition
of genus.
It should be noted that any curve which is not isomorphic to JlDI can be embedded
into its Jacobian.
Theorem 4.19. Ijjo: X --+ 1x is not bijective then X c:::' JlDI •
Proof: Ifjo is not injective then there exist x,y E X, x i- y, such that x = y + if)
for somef E k(X),j if- k. Considerf as a morphismf : X --+ JlDI • Since if) = x - y
we have if)o = x and by Theorem 4.15 degf = 1. Hence f is an isomorphism. •
determination of the dimension I(D) of the vector space L(D) is known as the
Riemann-Roch problem. The famous Riemann-Roch theorem gives a partial
answer to this problem.
Differential Forms
Let X be an affine variety over k. We define a k [X] -module n[X] as follows: it is
generated by elements df,f E k[X] satisfying the relations: dif + g) = df + dg,
°
difg) = df ·g+f ·dg andda = for all a E k. The elements ofn[X] are called
regular differential forms.
Now let X be a quasi-projective variety. Consider a family {Ui, Wi} of pairs
( Uj , Wi)' where Ui are affine open sets ensuring an open covering of X and Wi E
n[ Ui] are regular differential forms such that Wi = Wj on the intersections Ui n ~.
Define an equivalence relation on the set of all {Ui , Wi} by
Proof: Note at first that for every F E k [TI , ... , Tn] and any fi , ... ,In E k [U] one
has
n aF
d(F(ft, ... ,In)) = L
-(ft, ... ,/n)dji.
i=1 aTi
Now let V C An be an open neighborhood of x which is an affine curve and let
FI, ... ,Fm E k[TI, ... , Tn] be a basis of the ideal a(V). Since Fi vanishes on V for
each i = 1,2, ... ,m we have
n aFi
L-·dfj=O, 1 5, i 5, m,
j=1 a1j
where fj == 1j mod (a( V)) and fj E k[V]. Whenever x is a non-singular point, the
rank ofthe matrix
( aFi (x))
a1j
Algebraic Curves 87
equals n - 1. Without loss ofgenerality we can assume that t = tl. One can express
d t.i ' 2 ~j ~ n, using the above system, d t.i =lid t ,Ii being rational functions regular
atx. Let Ube an open subset in V such thatli E k[U] for allj = 1,2, ... ,m. Since
one can express each wE O[UJ in dtI, ... ,dtn and hence in dt, we find that
n[U] =kU·dt. •
dimL O(X) = 1.
Canonical Class
Let w be a rational differential form on a smooth curve X. We can write near a
point x E X the form w as w = fdt withf =Ix a rational function and t = tx a
local parameter. We can now define the divisor of the differential form w by
Residues
Let w E O(X), t be a local parameter at x E X, and w = f· dt forf E L = k(X).
Expandingf into Laurent power series in t we obtain
00
f= I a/.
i=-N
We call the coefficient a_I the residue of w at x and denote it by Resx ( w ). The
basic properties of residues can be summed up as follows:
(i) Resx (w) does not depend on the choice of local parameter t = tx ;
(ii) Resx(w) is k-linear functional on O(X);
Proof: Properties (ii), (iii) and (iv) are clear. To prove (v) we writef = tng with
vx(g) = O. Then df If = ndtlt +dglg, where dglg is regular at x, and hence
Resx(df If) = n.
We shall prove the property (i) only in the case when the characteristic of k is
zero (the case of a positive characteristic requires some additional considerations).
Let us write
w= Iavdtltv+w',
v 2': I
Res~(w) =I avRes~(dtltV).
v 2': I
Algebraic Curves 89
Proof: Note that the last formula makes sense since Resx ( w) = 0 for all but a
finite number of points x E X. The idea of the proof is to check the above formula
on pI and then represent a curve as a branched covering of pI and to check what
happens under a morphism (for details see Serre [165, Ch. 2], and also Lang [109,
Ch. 1], and Stepanov [187, p. 202]). •
i(D) = dim,.:l(D).
For every DE Div(X) we define the space n(D) as
From this definition it is clear that n(D) ~ L(K +D). In particular, n(D) is
finite-dimensional for each D E Div(X).
Now we define the pairing (w,r) as follows:
Proof: For D = 0 the statement follows from the previous theorem. Now it
is sufficient to prove that the theorem is valid for D if and only if it is valid for
D' = D + x, x being an arbitrary point of X. Indeed, every divisor can be obtained
from the trivial one by addition and subtraction of points. We have
degD' - g+ 1 = (degD - g+ I) + 1.
Therefore one has to show that
taking into account that L(D) = R(D) nL for every DE Div(X) we get the exact
sequences
Since the functions degD and I(D) are functions on divisor class group Pic (X)
we can give also the following version of this theorem:
(i) degK = 2g - 2;
Let Xo be a point of an elliptic curve X. Then the map x f-+ cx, Cx being the
divisor class of degree zero containing x - xo, gives a one-to-one correspondence
between the set of points of X and the elements of the group Pico(X). Thus we
get a group structure (with Xo as identity) on the set of points of X.
To see this it will be enough to show that if D is any divisor of degree 0, then
there exists a unique point x E X such that D '" x - Xo. We apply Riemann-Roch
to D + Xo and obtain
with the.fi a basis of L(K). In fact, by the Riemann-Roch theorem we can check
that always one of.fi(x) is non-zero: I(K -x) -I(x) = g - 2 and I(x) = 1 as was
noted above. So I (K - x) = g - 1. We find a well-defined morphism. Of course, it
depends upon the choice of a basis. Iff/ is another basis and if cp' is the associated
morphism then cp' = cp. a with a an automorphism ofJPlg-I. Despite this element
of choice, the morphism cp is called the canonical map. Suppose that cp identifies
two different points, say x' and x". Then I(K - x') = I(K - x' - x") = g - 1. By
the Riemann-Roch theorem
fact, by also applying the argument above for x' = x" we find that if X is not
hyperelliptic then cp is an embedding.
As an example, take a curve of genus 3 which is not hyperelliptic. Then its
image under the canonical map is a non-singular curve of degree 4 in ]p2 .
This construction can be extended as follows: consider a morphism
I(D-x-y)=I(D)-2.
Proof: For x =I y the above equality implies cp(x) =I cp(y), i.e., cp is injective.
For x = y it implies that the induced homomorphism CPo : (lx! -+ (lx defines an
isomorphism ofmx!lm;! onto mxlm~, where x' = cp(x). Thus cp is an injection
which induces isomorphism of all tangent spaces. Every such injection is an
embedding. Indeed, the injectivity of cp implies that deg cp = 1. Hence there exists
an inverse rational map cp-I : Y = cp(X) -+ X. Since the map
where cp' is purely inseparable and cp" is separable. Let us showthatg(X) = g(X).
We need the following definition. Let X be a smooth projective curve over a field
k of positive characteristic p > O. Denote by L the algebraic closure of L and by
Lp = L lip the field
Lp = {f E L IfFE L} .
There exists a unique smooth projective curveXp withLp = k(Xp) and the inclusion
L C Lp defines a morphism cpp : xp -+ X which is called the Frobenius morphism.
Since the transcendence degree of L equals 1, then degcpp = p.
Proposition 4.32. Let cp : X -+ Y be a non-constant purely inseparable morphism.
Then g(X) = g(Y).
Proof: Let deg cp = pm. By induction one can assume that m = 1. In this case
k(X) ~ cp*(k(y)llp) and since [k(X) : cp*(k(Y))] = p, the morphism cp coincides
with cpp and X = Yp, Let us show that the dimension of O[Y] over k is equal to the
dimension of O[Yp]. Indeed, let w =f dg E O[Y] and consider w' = f' dg', where
f' and g' are obtained fromf and g by raising all their coefficients to the pth power.
Then w' =f'dg' E O[Yp]. Conversely, for w' =f'dg' E O[Yp], the form w =fdg
lies in O[Y] (here f and g are obtained from f' and g' by extracting the pth power
root from all their coefficients). •
By this proposition, computation of g(X) is reduced to the case when cp is
separable. Let cp : X -+ Y be a finite separable morphism of smooth projective
curves, let x EX, cp(x) = y, and let tx and ty be local parameters at x and y,
respectively. Then cp*(dty) =gdtx for someg E C>X. Letvx(g) = ax and let
The divisor is called the ramification divisor of cp (note that ax =1= 0 only for a finite
number of ramification points x of cp).
Theorem 4.33. Let cp : X -+ Y be a separable finite morphism ofdegree n. Then
2g(X) -2 = n(2g(Y) -2) +degD",.
provided cp is tamely ramified. Indeed, if cpo (ty) = gt;X with Vx (g) = 0 then
Example 4.2. Let k = F2 be an algebraic closure of F2, Y = !PI and X the non-
singular projective curve defined by the function field k( u, z) with z2 + z = u5 + 1.
We have an involution z I--t z + I on k(X) with fixed field k(u) = k(IPI). The
field extension k(u,z)/k(u) corresponds to a morphism cp: X -+ Y of degree 2.
Restricting X to the affine part A I with u as coordinate we can give X by the
affine equation z2 +z = u5 + 1 and cp by (u,z) I--t U. No point of Al is a branch
point. The inverse image ofyeo = (1 : 0) E!PI is the point Xeo which corresponds to
the discrete valuation vx"" with vx",,(u) = -2 and vx",,(z) = -5. A local parameter
t at yeo is t = l/u; at Xeo it is s = u2/z. We have d(z2 +z) = d(u 5 + 1), which
gives dz = u4du. Since vxoo (u 2/z) = 1, vx",,(d(u 2/z)) = 0 we find vx",,(dz) = -6,
vx",,(du) = 2. Therefore
g(X) = 2( -2) +6 = 2.
A basis for the two-dimensional space of regular differential forms is du, udu.
96 Chapter 4
w =Hdu/G~ = -Hdv/G~,
(since dimk .o.[X] = g(X) = g). The integer g is a birational invariant, hence
the Plucker genus fonnula shows us that non-singular plane projective curves of
different degrees m,m' ~ 3 are not birational to each other.
We call a divisor DE Div{X) special if and only if I{K - D) > O. It is easy to see
that D is special if and only if D - K '" D' for an effective D'. Moreover, every
divisor of degree at least 2g - I is non-special and every divisor of degree at most
g - 2 is special. One can show that for any divisor D and D'
dimlDI ~ ~degD.
Algebraic Curves 97
Let Gx = {al, ... ,ag }, 1 :::; al < ... < ag :::; 2g-l, be agap-sequenceatx EX.
The non-negative integer
g
w(x) = L(ai-i)
i=1
98 Chapter 4
is called the Weierstrass weight of x. Note that w(x) > 0 if and only if x is a
Weierstrass point. One can easily show that
L (l(a ·x) -
00
We have
L w(x) = g(i - 1) + a(X),
xEX
where a(X) ~ O. In addition, a(X) = 0 ifp = chark = 0, and for p > 0 the value
a(X) vanishes for all but a finite number of "exceptional" curves. Therefore the
number of Weierstrass points on X is finite and does not exceed g(g2 - 1) + a (X) .
One can show that w(x) ~ g(g - 1) /2 and w(x) = g(g - 1) /2 if and only if 2 is a
non-gap at x.
A point x on X with w(x) = g(g - 1)/2 is called a hyperelliptic point. For
p = char k 'I 2 any hyperelliptic curve has 2(g + 1) Weierstrass points, all of them
being hyperelliptic. In general, a curve X has hyperelliptic points if and only if X
is a hyperelliptic curve.
To construct Weierstrass points one can use the following fact:
EXERCISES
4.1. If a, b are ideals of a commutative ring R with identity element I, their sum a + b is
the set of all x + Y with x E a and Y E b. It is the smallest ideal containing a and b.
The product of two ideals a, b of R is the ideal ab consisting of all finite sums IXiYi
withxi E a andYi E b. Check the following properties of the radical r(a) of an ideal
a inR:
(a) a ~ r(a);
Algebraic Curves 99
4.2. LetR be a commutative ring with identity element 1 and let X be the set of all prime
ideals of R. For each subset E of R, let V(E) denotes the set of all prime ideals of R
which contain E. Prove that:
(a) x is a closed point (x = x) inX = SpecR if and only ifthe ideal Px is maximal;
(b) x=V(Px);
(c) y Ex if and only ifpx ~ Py.
4.5. A topological space X is called Noetherian if it satisfies the descending chain con-
dition for closed subsets: for any sequence Y\ ~ Y2 ~ ... of closed subsets, there is
an integer m 2: 1 such that Ym = Ym +\ = .... Show that:
(a) An and JP>" are Noetherian topological spaces (in the Zariski topology);
(b) in a Noetherian topological space X, every non-empty closed subset Y can be
uniquely expressed as a finite union Y = Y\ U ... U Ys of irreducible subsets Yj
such that Yj g; lj for i =I- j (they are called the irreducible components of Y);
(c) a Noetherian topological space is quasi-compact (i.e., every open cover has a
finite subcover);
(d) a Noetherian space which is also Hausdorff must be a finite set with discrete
topology.
4.6. Prove the homogeneous version of Hilbert Nullstellensatz: if a ~ k[To, T\, ... , Tn]
is a homogeneous ideal, and ifF E k[To, T\, ... , Tn] is a homogeneous polynomial
with degF 2: 1, such that F(x) = ofor all x E V(a) in JP>", then F m E afor some
integer m 2: 1.
4.7. An open subset of an affine variety is called quasi-affine variety. Prove that:
100 Chapter 4
4.11. Let X = pi and D = n ·Xoo. Show that L(D) is the space of polynomials F(T) of
degree at most n, and hence /(D) = n + 1.
4.12. Let X be a smooth projective curve defined over an algebraically closed field k.
Prove that the sequence
is exact.
4.l3. Show that a curve X is birational to pi if and only if g(X) = O. (Hint: Use the
Riemann--Roch theorem.)
4.14. Show that a projective curve in jp>2 given over a field of characteristic p #- 2, 3 by the
equation
zf
Z~Zo = +azlz5 +bzS, 4a 3 +27b 2 #- 0,
is a curve of genus 1. Prove that any curve X of genus lover a field of characteristic
p #- 2,3 is birationally equivalent to this curve. (Hint: Use the Riemann--Roch
theorem.)
4.15. Let X be a curve and let XI, ... ,Xs E X be some points of X. Show that there exists
a rational function! E k(X) having poles at points Xi and being regular elsewhere.
4.16. Let K be a subfield of a pure transcendental extension L = k(t) of k, strictly con-
taining k. Prove that K is also pure transcendental. (Hint: Use the Hurwitz genus
formula and Exercise 4.l3.)
Chapter 5
Closed Points
If the field k is no longer algebraically closed, the Hilbert Nullstellensatz no
longer holds in general since the corresponding points are lacking. For example,
the equation x 2 +y2 = -1 has no solution over k = lR and defines at the same time
the imaginary circumference over k' = C. Therefore, if we consider the case of
an affine variety, it seems better to start with the coordinate ring instead of with a
set of zeros. So let k' be an algebraically closed field containing k as a subfield
and let p be a prime ideal of k[T] = k[T" ... , Tn] which generates a prime ideal
p' in k'[T]. Then p' defines an affine variety X defined over k; to emphasize this
fact we say that X is absolutely irreducible (or absolute). A morphism of affine
varieties over k is a morphism of the associated varieties over k' which is given
by a homomorphism of k-algebras. Similarly a projective variety defined over k
is given by a homogeneous prime ideal of k[To, T" ... ,Tn] which remains prime
being extended to k'[To, T" ... , Tn]. To this ideal we can associate a function
field k(X) by restricting our earlier definition to those pairs (U,J) where f can be
defined by polynomials with coefficients in k. For a curve X the field k(X) can
be described as follows: it contains k as a subfield and possesses an element x
103
104 ChapterS
which is transcendental over k and such that k(X) is a finite algebraic extension of
k(x). A field with these properties is called an algebraic function field over k. By
extending the constants to k' we obtain the function field k'(X).
We calIX smooth (or non-singular) if, after extension of k to an algebraically
closed field k', the curve is a smooth curve.
We can view a curve X over k as a curve over k' of which we can see only
a fraction of all points. Over the field k', we had a one-to-one correspondence
between the points ofX and the discrete valuation rings of the function field. Since
we cannot see all points, this no longer holds, but nevertheless we can look at all
discrete valuation rings contained in k(X) such that the discrete valuation is trivial
on k. If v is a discrete valuation of k(X) and C>V is its valuation ring with maximal
ideal m, then lev = C>v/mv is called the residue field. This is a finite extension of
k. We call the pair (C>v,mv) a closed point of X and v = [lev : k] the degree of the
point. If k is an algebraically closed field then of course v = 1 for every point.
To a closed point of degree v over k we can associate a set of v points of degree
one overk' which are conjugates of each other under Gal(k' /k). This extends the
discrete valuation v of k(X) to a discrete valuation v' of k' (X) (which corresponds
to a point x EX). These v points are all distinct if k' / k is separable.
Let, for example, X = plover k = F q • Every irreducible polynomial F E k[T]
in one variable T of degree v 2: 1 defines a closed point of degree v on pl. After
extending Fq to Fqv we can see the v points of degree one over Fqv which form
this point of degree v over k.
Over F2 the curve X has three points of degree one: (0: 0: 1), (0: 1 : 0) and
(1 : 0: 0). Its points of degree 2 become visible over F4 = F2(a), where a is a
root of the polynomial F(z) = z2 + z + 1. If we consider our curve as a curve
over F4, then the curve has two more points of degree one: (1: a : 1 + a) and
its conjugate (1 : 1 + a : a). This pair of points defines a closed point of degree
2 on X over F2. Over Fs we find many more points. To describe these we first
introduce an automorphism group of X over Fs = F2(J~), where f3 is a root of the
polynomial G(z) = z3 + z + 1. An automorphism of order 7 over Fs is given by
(T : (u : v : w) 1-+ (u : f3v : f35 w ). Besides this we have an automorphism of order
three: (u: v : w) 1-+ (w : u : v). In fact, one can show that the curve X admits GI6S,
the simple group with 168 elements, as an automorphism group. Suppose that
(u : v: w) is a point of X over Fs. If u :f:. 0 and the point is not (1 : 0 : 0) then by
applying (T we can assume that both u and v are equal to 1. The condition is then:
Curves over a Finite Field 105
, ifv:jEOmod(3)
if v == 0 mod(3) ,
Now we present the situation described above in a slightly different form using
the notion of k-rational points on a curve. Let k be a subfield of an algebraically
closed field k'. We say that a point X = (XI, ... ,xn) E A'ic, is k-rational if Xi E k for
all i = 0,1, ... , n. A point Xi E IPkI is called k-rational if Xi "# 0 implies Xj (Xi E k
for all j = 0, 1, ... , n. A variety X ~ A'ic, is called defined over k if its prime ideal
p = p(X) has a basis {FI, ... ,Fr } consisting of polynomials with coefficients in
k. The subset of k-rational points of X is denoted by X(k).
Let p(X) = k[T]FI + ... +k[T]Fr be a prime ideal ink[T] which generates the
prime ideal p'(X) = k'[T]FI + ... +k'[T]Fr C k'[T] of the variety X ~ A'ic, defined
over k. Then we have
p(X) = p'(X) nk[T].
Hence ifG = Gal(k' (k) is the Galois group of k' over k and if X EX then u(x) EX
for every u E G.
Example 5.1. Let k = Fq be a finite field with q elements and k' = Fq be its alge-
braic closure. If X ~ A'ic, is an affine variety defined over k, then the automorphism
u : u I--t uq of the field k defines the Frobenius automorphism
to k"-rational points X,<T(x), ... ,<Tv-1(x), and <Tv leaves fixed all k"-rational
points ofX.
Similarly, we can consider the Frobenius automorphisms <Ti of a projective
variety X ~ 1Pk' defined over k = F q •
with CYi E k' and!; E k(X). Thus k' (X) is the tensor product
Proposition 5.1. Let x E X and k(x) = Fqv. Then the equivalence class 01x is
Proof: At first we observe that u i (x) "" ui (x) for all i,j = 0, 1, ... , v-I. Let
°: ;
us show that all points u i (x),
generality that x = (1 : Xl : ... : xn ). If u i (x) = ui (x) for°: ;
i ::; v-I, are distinct. Assume without loss of
i <j ::; v-I, then
ui - i (xs ) = Xs for all s = 1,2, ... ,n, and hence Xs E Flj-i. This implies Fqv ~ Flj-i
which is impossible.
Now we show that all points y E X equivalent to x become exhausted by the
points u i (x), 0::; i::; v -1. Indeed, lety beequivalenttox. Theny = T(X) for some
T E Gal(k'lk). Consider a restriction ofT to the field k(x) =Fqv. The restricted
T gives an automorphism of Fqv and hence T = u i for some i = 0, 1, ... , v-I . •
where v = [k(x) : k] is the degree of k(x) over k. The points u i - l (x), 1 ::; i ::; v,
are called components of P. Note that P is a k-rational divisor of degree v and
thatPx = Py if and only if x "" y.
It is clear that a closed point of degree v over k corresponds uniquely to an
equivalence class { u i (x) 10 ::;
i ::; v-I} of x E X and hence to a prime k-rational
divisor P on X of degree v. This gives us the following result:
Now we are are able to give a criterion for a divisor D onX to be a k-rational
one.
Proposition 5.3. A divisor DEDiv' (X) is k-rational if and only if it can written
as
D=Lap.P, ap Ell,
where ap = °for all but a finite number ofP.
Proof: If D = Lax· x is a k-rational divisor, then for some v 2: 1 such that
U V (x) = x for all x which occur in D with non-zero coefficients ax we have
It follows from this that all points from the same equivalence class occur in D with
the same coefficients. Hence D = Lap ·P. The inverse is obvious. •
Let x be a component of a prime k-rational divisor P. The point x defines a
discrete valuation v~ of the field k'(X). Let CJx and CJy be local rings in k(X) at
points x and y, respectively. It is clear that CJx coincides with CJy if and only if
x "" y. Hence any two equivalent points of the curve X define the same valuation
108 Chapter 5
of k(X) and the map P f-t Vp = Vx gives a bijection with the set of prime k-rational
divisors on X and the set of discrete valuations of k(X). Note that if r:Jp is a
valuation ring of vp and mp is its maximal ideal, then
r:Jpjmp ~ k(x).
Proposition 5.4. A principal divisor if) is k-rational if and only iff E k(X).
Proof: Let if) = LVxif)·x be a k-rational divisor and letf = F jG, G (j. p'(X),
in an open neighborhood U of a point x E X. Since if) is k-rational it follows
that along with x each conjugate (Ii (x) is a zero of the polynomial F (resp. G)
and hence F, G E k[T). Thereforef E k(X). The inverse easily follows from the
previous proposition. •
Let Pic' (X) be the divisor class group on the curve X over k'. A divisor class
C E Pic' (X) is called k-rational if it contains at least one k-rational divisor. The set
of all k-rational divisor classes form a subgroup of Pic' (X). Denote this subgroup
by Pic(X). The embedding
induces an epimorphism
Div(X) ----t Pic(X).
The kernel of this epimorphism is the group P(X) of principal k-rational divisors
and hence
Pic (X) ~ Div(X)jP(X).
Let D be a k-rational divisor on X. Set L' = k'(X), L = k(X) and consider
k' -vector space
and hence
dimL .o(X) = dimL'.o' (X) = 1.
Therefore the canonical divisor class W E Pic' (X) contains at least one k-rational
divisor K and we deduce the following result:
Now we are able to give the following version of the Riemann-Roch theorem
over an arbitrary perfect field k (see also Chevalley [20, Ch. 2], Deuring [24,
Ch. 2], Stepanov [187, Ch. 4] and Stichtenoth [197, 1.5]).
Let Divo(X) C Div(X) be the group of k-rational divisors of degree zero and
let
Pico(X) ~ Divo(X)jP(X)
be the class group of k-rational divisors of degree zero.
Theorem 5.7. Let X be a smooth projective curve defined over a finite field k = F q .
Then the group Pico(X) isfinite.
Proof: First of all we show that for a given non-negative integer v there is
only a finite number of effective k-rational divisors of degree v and we begin by
proving this assertion for prime k-rational divisors. Let x be a component of a
prime k-rational divisor P of degree v and let k(x) = Fqv. To the divisor P we can
associate a set of v such points. Hence it is sufficient to show that X contains only
a finite number of Fqv-rational points. One can assume without loss of generality
that x has the form x = (1 : Xl : •.. : xn ) with Xi E Fqv. The number of such points
is at most qvn. Hence the total number of Fqv-rational points X E X is at most
(n + I )qvn. Thus the number of the prime k-rational divisors P of degree v is
finite.
Now let D = Lax·x be an arbitrary effective k-rational divisor of degree v.
We have
degD = Lap. degP =v
and hence the number of k-rational divisors of degree v does not exceed the number
of the solutions in non-negative integers ap, degP of the equation
Lap ·degP = v.
110 Chapter 5
Let {PI, ... ,Ps } be the set of all prime k-rational divisors of degree at most v and
set degPi = Vi, aPj = ai. Then it is sufficient to estimate the number of solutions
in non-negative integers ai of the equation
s
Laivi = v.
i=1
This number does not exceed (v + I Yand hence the number of effective k-rational
divisors of degree v is finite.
Finally, let us establish the finiteness of the group Pico(X). Consider a non-
constant rational function f E k(X) and set g = F, where r is a sufficiently
large positive integer. Since f is not a constant then degif)o > 0 and hence
deg(g)o = rdeg(f)o = v 2: 2g. Let us fix g and v and prove that any k-rational
divisor of degree zero is linearly equivalent to a difference of two effective k-
rational divisors of the same degree v. Because there is only a finite number of
such divisors we can see that the number of k-rational divisors of degree zero is
also finite.
Let DE Divo(X). By the Riemann-Roch theorem we have
and hence there exists a non-zero rational function hE k(X) such that (h) + (g)o-
D 2: O. SetD' = (g)o andD" = (h) + (g)o -D. ThedivisorsD' andD" are effective
k-rational divisors of the same degree v, and therelationD =D' - D" + (h) implies
D rv D' -D". •
Let h = h(X) denote the cardinality of the group Pico(X).
Proposition 5.9. Let CI, ... , Ch be all the k-rational divisor classes ofdegree zero
and let Co be a fixed k-rational divisor class of degree e. Then any k-rational
divisor class C of degree v . e can be written uniquely in the form C = vCo + Ci
for some i = 1,2 ... , h. In particular, for any v 2: 0, there are exactly h 2: 1 of the
k-rational divisor classes of degree v . e.
Curves over a Finite Field 111
Proof: Since the degree of any k-rational divisor is divisible by e then the degree
of every k-rational divisor class is of the form v . e for some integer v. Let C be
a k-rational divisor class of degree v . e. The class vCo is also of degree v· e and
hence the difference C - vCo is a k-rational divisor class of degree zero. In that
case C = vCo + Ci for some i = 1,2, ... ,h. •
Theorem 5.10. Let k = Fq be a finite field with q elements. The number n (C) of
distinct effective k-rational divisors contained in a k-rational divisor class C is
expressed by the formula
ql(C) _ 1
n(C) = .
q-l
Proof: Let Do E C be a k-rational divisor. Consider the space L(Do) and to each
non-zero rational function f E L(Do) associate the k-rational effective divisor
D = Do + (f) in C. Conversely, let DEC be an effective k-rational divisor. Then
there exists a non-zero rational functionf E L(Do) such that D = Do + (f).
Thus the associationf t-+ D = Do + (f) defines a map of the set of all non-zero
rational functions of L (Do) onto the set of all effective k-rational divisors of C. If
D = Do + (f) = Do + (g) then if) = (g) and hence f = exg for some ex E k*. So
if IL(Do)1 is the cardinality of L(Do) then
(C) = IL(Do)l- 1
n q- 1 '
ql(C) _ 1
n(C) = q-l
.
where D runs over all the effective k-rational divisors of X and N(D) = qdegD
denotes the norm of D.
112 Chapter 5
~(X,s) = _1_ L
ql(C)-sdegC - h -es.
q-1degQO (q-1)(1-q)
~(X,S) = _1_ i:
q-l v=OdegC=v.e
L qdegC+I-sdegC _
(q-l)(I-q es)
1_
__ q_ ~ e(l-s)v _ 1
- q- 1 q :=0 (q - 1) (1 - qes)
q 1
(q - 1)(1- qe(l-s)) (q - 1)(1- q-es)"
Now, let g ~ 1. Then
where the product is over all prime k-rational divisors P on X. The product is
absolutely convergent, and hence does not depend on the order of the factors.
IT
N(P)~N
(1- (N(P))-s)-l = IT
N(P)~N
(fn=O
(N(p))-ns)
and since there are only a finite number of factors in the product, each factor being
an absolutely convergent series, we may multiply these to obtain
where the first summation is over all effective k-rational divisors with N(D) ~ N,
and the second is over all effective k-rational divisors D which contain no prime
divisor with N(P) > N and satisfy the inequality N(D) > N. Hence,
L ' (N(D))-Res
N(D»N
being remainder term of the convergent series for ,(X,s), tends to zero as N --7 00.
Proof: Let e be the smallest value of the degrees of all positive k-rational divisors
on X. It is sufficient to prove that e 1.=
Let us consider an extension k" = Fqe of degree e of the field k = Fq. If Pis
a prime k-rational divisor of degree v then it has the form
)J
P = Lui-I (x),
1=1
where x is a Fqv-rational point of X. Since e divides v the equivalence class over
k
{Ui-1(X) I1 ~i~ v}
splits into e = (e, v) equivalence classes over k"
{ue(i-l)+j-I(x) 11 ~ i ~ vie}, 1 ~j ~ e.
Hence the prime k-rational divisor P splits into e corresponding prime k"-rational
divisors P~' , ... ,P~ .
Let
pI!
be the zeta-function of X defined over k". We have
C"(X,s) = TI(1_q-sedegpl!)-1 =
pI!
Ii: (TI(1_q:....sede
i=1 Pf'
gp f')-I) ,
~(X,s) = _1_ L
ql(C)-sdegC + hql-s I-s h
q-ldegC=O (q-l)(l-q ) (q-l)(l-q-S)
h -1 q hql-s h
= q-l + q-l + (q-l)(l-ql-s) (q-l)(l-q-S)
1 + (h -q _l)q-S +q oq-2s
= -~(l:-_--'q'--;-I--s7-)(7-:-1-_-q--=-s-:-=)-
Finally if g ~ 2, then
= q_ 1 O!jq JS + q_ 1
;=1
hqgq-(2g-I)s h
+(q-l)(l-ql-S) (q-l)(l-q-S)
_ 1 +O"Iq-s +ooo+<T2g_lq-(2g-I)S +qgq-2gs
- (l-q-s)(1-ql-S)
Finally, letg ~ 2. To prove the theorem in this case we write down ~(X,s) as
+__
h ( 1 +q-(g-I)(2s-1) + qg q -(2g-1)s _ __ .
q-l l_ q l-s l-q-S
We have
~2(X, l-s) = 1+q(g-I)(2s-1)
_h_ ( (g-I)(2s-1) qgq(2g-1)(s-l) _ 1 )
+ q- 1 1+ q + 1 - sq - 1qs-l
= q(g-I)(2s-1) ~2(X,S).
Now we observe that if C runs through the set of divisor classes with the condition
and hence
y
~l
(X , 1 -s ) = -11 "qP(C)-(1/2-s)de g C
£...
q- l:<;;degC<2g-2
= q(g-I)(2s-1) (_1_ L
q - 1 l:<;;degC<2g-2
qP(C)-(S-I/2)(2g -2-degC))
= q(g-I)(2s-1)~1 (X,s)
and hence
tmdegP
= - ~)og(l-tdegp) = L L --
00
logZ(X,t)
P P m=l m
= L 00 (
L -1 ) tV =L 00 (
L degP ) tV
-.
v=l mdegP=v m v=l degPlv v
Setting
N: = L degP
degPlv
we obtain
Z(X,t) = exp ( v~
00
-;-t
N* V
)
,
N: = L degP =Nqv.
degPlv
Let us consider the setX(Fqv) of all Fqv-rational points of the curve X defined
over k = Fq. The set X(Fqv) is divided into equivalence classes which fonn the
k-rational divisors P. Thus
Nqv = L'degP.
P
where the sum is taken over all the mentioned equivalence classes of the setX(Fqv ).
Now we show that a prime k-rational divisor P corresponds to one of the
considered equivalence classes if and only if degP Iv. Indeed, let P correspond to
such an equivalence class and letx be a component ofP. We have degP = [k(x) : k]
and since x is an Fqv -rational point then k(x) ~ Fqv. In that case
k ~ k(x) ~ Fqv
and hence degP I v. Conversely, if degP I v and if x is a component of P, then
the condition degP I v means that [k(x) : k] divides [Fqv : k]. Therefore we have
k ~ k(x) ~Fqv
and hence the point x is Fqv-rational. Thus the divisor P corresponds exactly
to one of the equivalence classes of the set X(Fqv). It follows from the above
arguments that N; = Nqv. •
According to Theorem 5.14 the zeta-function Z(X,t) has the fonn
P(t)
Z(X,t) = (l-t)(I-qt)'
120 Chapter 5
where
2g-1
P(t) = 1 + L, 0'/ +qg t 2 , O'i E Q.
i=1
Let
2g
P(t) = TI(1- wit)
i=1
be a decomposition of P(t) into linear factors in some finite extension of the field
of rational numbers Q.
Theorem 5.17. Let N qV be the number of Fqv -rational points of a smooth projec-
tive curve X of genus g defined over Fq. Then
2g
Nqv=qv+I-L,wj.
i=1
Proof: We have
()
Z X,t =exp
~Nqv v)
(,tt--;-t =
re!I(I- wit )
(l-t)(I-qt)
and hence
co N 2g
L, 1 t v = L,log(l- wi t )-log(l-t)-log(l-qt)
v=1 v i=1
= L,
co -
1 ~)
( qV + 1- L, w[ tV.
v=1 v i=1
Comparing the coefficients under the same powers of t we obtain the required
result. •
Proposition 5.1S. Let G be the cyclic group of order n, and let 'TJ be a generator
of G. Then every character X on G has the form
G=GI x···xGr
of cyclic groups GI, ... , Gr of orders nl = p~l , ... ,nr = rrr, respectively. Let
'TJI, ... ,'TJr be' generators of the groups GI, ... , Gr and X be a character on G. We
have X ( 1!i) = exp( 21Ti ~) for some €Xj = 0, 1, ... ,nj - 1, and since every element
1
x EGis written uniquely in the form
then
Proposition 5.20. Let G be afinite abelian group of the order n = p~l ... p:r and
G = Gl X .•• x Gr be its decomposition into a direct product of the cyclic groups
G 1 , ... ,Gr of orders nl = p~l , ... ,nr = p~r, respectively. Then every character X
on G has the form
~X(x)={ ~
xEG
ifx = xo,
ifx =1= XO
and
ifx = 1,
~x(x)={~ ifx =1= 1.
XEG
Proof: For X = XO the assertion is obvious. Let X =1= xo. In this case there exists
an element Xo E G such that X(xo) =1= 1. We have
and hence S = o.
The second statement follows because the group G is isomorphic to a group
which is dual to G. •
non-zero sth powers. Conversely, if XCY) = I for every y E (F;Y, then XS = xo.
Therefore, if X is a character of exponent s, then X (x) depends only on the coset of
x modulo the subgroup (F; y. Thus, a character of exponent s may be interpreted
as a character of the group F; / (F;
y. There are exactly s characters of exponent
s.
It will be convenient for us to extend the definition of characters X on F; by
putting
X(O) = { ~ifX=xo,
if X i= xo·
Such a character X we call the multiplicative character of the field F q.
ifx E (F;Y,
ifx rf. (F;Y and x i= 0,
ifx = 0..
To prove the second part of the statement we note that X is a non-trivial character
of F; /
(F;)S and {I, 1), ... , 1)s-l} is the set of all representatives of the residue
classes modulo (F; y. The statement follows again from Proposition 5.22. •
We now tum to additive characters of Fq. An additive character of Fq is simply
a character of the additive group of Fq . If q = pm, where p is the characteristic of
F q , then the additive group is the direct sum of m copies of the additive group of
= x + x p + ... + x P
m-l
Fp. Denote by tr(x) the trace of the element x E Fq over
Fp.
Proposition 5.24. Every additive character", ofFq is of the form
Proof: We have
0/(3 (x +y) = 0/(3 (x) 0/(3 (y).
Thus 0/(3 (x) is Iln additive character of F q . All these characters are distinct and
their number equals q. Hence 0/(3 exhausts all additive characters of Fq . •
of Fqv onto Fq is called the relative trace of x E Fqv, and the map
and show that they depend in a regular way on the integral parameter v. To find
the dependence Tv of v let us introduce for consideration the L-function ofArtin
L (z) of complex variable z which we define by the series
(5.3)
where the sum is over all non-negative integers il, ... , iv with the condition il +
2h + ... + viv = v, and note that all f3v lie in the field Q( e 271"i/p ).
Proposition 5.25. Let s be a positive divisor ofq - 1 and deg(ji .. ·fr) = m. Sup-
pose that at least one of the following conditions holds:
(i) X is a non-trivial multiplicative character of Fq of exponent sand
(s,S], ... ,sr) = 1;
(ii) tfJ is a non-trivial additive character ofFq and bo i= 0, (n, q) = 1.
Then f3v = °
for all v ~ m + n - 1.
Proof: Let i], ... , iv be non-negative integers satisfYing the condition il + 2i2 +
... + viv = v. If for each 'T = 1,2, ... , v the polynomial
a(x) =xv +U]X v- 1 +···+u v EFq[x]
has exactly iT irreducible divisors of degree 'T in Fq [x] then the v-tuple (i], ... , iv)
is called the decomposition type of a(x). Let
V iT T-]
a(x) = IlIlIl(x+(T~(xt))),
T=lj=lk=O
where
(I) (I) • . (v) (v)
xl , ... ,Xi1 EFq'···'XI , ... ,Xiv EFqv.
It is easy to see that if the non-negative integers il, ... ,iv run through all solutions of
the equationil +2i2 + ... + viv = v andxi T ), ••• ,xt) independently run through all
elements of the fields FqT, 1 ::; i ::; v, then the elementary symmetric polynomials
UI, ... ,U v of these elements independently run through all elements of the field
F q . In addition, the polynomials UI, ... ,U v are invariant under all pennutations
of elements xiT), ... ,x,t) E FqT, 1 ::; 'T ::; v, and also under all replacements of
these elements by their conjugates over Fq . Thus, if we set (according to the
fundamental theorem on symmetric polynomials)
126 Chapter 5
be a decomposition ofJ;(x) into linear factors in the ring FqVi [xl. Since
we have
r
IT IT (lXi/Lv + UI lXi/Lv-I + ... + Uv )Si .
Vi
f * (UI, ... , Uv ) --
i=I/L=1
We set
(5.5)
v v-I J;
lXrl +UIlXrl +···+Uv =y.
If UI , ... , Uv - m are fixed and Ui-m+ I, ... , Uv independently run through all ele-
ments of F q , then gl, ... , gr independently run through all elements of the fields
FqVI , ... , Fqvr , respectively. In addition,
r r
f*(UI, ••. ,U v ) = IT (normVi (gi))Si = IT~:i (5.6)
i=1 i=1
where
Curves over a Finite Field 127
and hence
n
g*(Ul"" ,Uv) = L bn-tJ- L afl, ... ,AvU~1 ... u~v + vbn
tJ-=l Al +2A2+··+vAv=tJ-
f3v =N
Ul "",U v- m EFq
where
r
N = I1(qVi - 1)/(q - 1).
i=l
i:1,···,i:rEFq
q-l
= L X(1]k l s l +"+kr Sr ),
kl,···,kr=l
where 1]ki = {j fori = 1,2, ... ,r. We have (S,Sl,'" ,sr) = 1 and, hence, ifkl, ... ,kr
independently run through all elements of the set {I, 2, ... ,q - I} then klSl + ... +
krsr runs, with the same multiplicity, through all the elements 0, 1, ... ,S - 1 of the
complete residue system modulo s. In that case the product runs several ,:1 ., .,;r
times through all elements of the factor group F; / (F;)s. Since X is a non-trivial
character of this factor group then according to Proposition 5.22 we have
°
Thus, if X =1= Xo, then f3v = for all v > m + n - 1.
Now let I/J be a nontrivial additive character of F q • It follows from (5.5) that
Uv-m+l,··· ,Uv are uniquely determined by gl,"" gr, Ul,'" ,Uv- m, and then
We have
Ul,···,uv-mEFq
Ul,···,uv-mEFq
and since bo =I- 0, (n, q) = 1, it follows that u = ( _1)n-1 bonu n, together with Un,
runs through all elements of F q • In that case, by Proposition 5.22,
and denote by K the minimal extension of the field Q(e 21Ti / P ) in which
m+n-\
P(z) = I1 (1- Wi Z ). (5.8)
i=\
Theorem 5.26. Letf(x) = !iSl (x)·· f;r(x), g(x) = bOX n +b\xn-\ + ... + b n be
non-constant polynomials in Fq[x] of degrees I and n, respectively, let s be a
positive divisor of q - 1, and let deg(fi ···fr) = m. If at least one of the following
two conditions holds:
L(z) =P(z).
Furthermore, if the algebraic numbers WI, ... ,Wm+n-\ are defined by (5.8) then
the sum (5.1) can be written in the form
m+n-\
Tv = - L wj, v = 1,2 ....
j=\
Curves over a Finite Field 129
L(z) = n
m+n-I
j=1
(1- w;z).
Next,
T m+n-I (m+n-I ) v
L: ~zv = L: L: L: wJ :.-
00 00
log(1 - Wjz) =-
v=1 v j=1 v=1 j=1 V
and hence
m+n-I
Tv = - L: wJ
j=1
Proposition 5.28. Let WI , ... ,Wr be complex numbers and c, R be positive num-
bers. If
IwI+ ... +w:1 ~cRV (5.9)
for all v = 1,2, ... , then IWjl ~ Rfor j = 1,2, ... ,r.
log(l- wt) =-
v=1
i: Wv zV,
V
and then
n(I - wjZ) = -
r v
L: (WI + ... + wn :.-.
00
log
j=1 v=1 v
130 Chapter 5
In view of(5.9) the series on the right converges for Izl < R- i , and hence the
function
r
10gTI(1- Wjz)
j=i
is regular in the disk Izl < R- i . In that case 1 - WjZ =1= 0 for Izl < R- i ; hence,
IWjl :::; R for allj = 1,2, ... ,r. •
To provide an easy way to construct smooth projective curves over a finite field
Fq with a lot Fq-rational points, we shall need some facts from the ramification
theory of Artin-Schreier extensions (for details, see Artin [4], Deuring [24], Serre
[167] and Stichtenoth [197]).
Field Extensions
At first we recall some well-known facts about finite field extensions. LetL = k(X)
be the function field on a smooth projective curve X over a field k. Let P be a
prime k-rational divisor (a closed point) onX and let vp be the normalized discrete
valuation of L associated with P (vp(L *) = Z). The field L can be regarded as a
finite field extension of the rational function field k(x).
Let L I k denote an algebraic function field of one variable with the full constant
field k. The field k is assumed to be perfect (i.e., all its algebraic extensions k' I k
are separable). For convenience, we fix some algebraically closed field L 2 Land
consider only extensions L' of L which lie in L. An algebraic function field L' I k'
is called an algebraic extension of Llk if L' 2 L is an algebraic extension of L
and k' 2 k. The algebraic extension L'lk' is called a constantfield extension if
L' = Lk', the composite field of Land k'. The algebraic extension L' I k' of L I k is
called afinite extension if degree [L' : L] of L' over L is finite. It is easy to check
thatL'lk' is a finite extension of Llk if and only if[k': k] < 00.
Let Llk = k(X) be the function field on a smooth projective curve X over k
and letP be a prime k-rational divisor on X. Let fJp be the local ring ofthe divisor
P and mp the unique maximal ideal offJp. There is a one-to-one correspondence
between the prime k-rational divisors P E Div(X) and the maximal ideals mp.
This correspondence allows us to identifY P with the corresponding equivalence
class of valuations of Llk, which is called a prime divisor of the field Llk. The
genus g = g(X) of X is also called the genus of Llk, and Div(X), Pic(X) are
denoted by Div(L), Pic(L).
Let L' I k' be a finite extension of L I k. Every prime divisor P' of L' I k' induces
the prime divisor P = P' nL E Div(L) of the field Llk. In this case we say that
Curves over a Finite Field 131
P' lies over P and write p' I P. If Vp is the nonnalized valuation of L I k associated
withP and Vpl is the valuation of L' Ik' associated withP' over P, the value group
vpl(L'*) = fpl ofvpl is a subgroup of the value group vp(L*) = Z ofvp, and the
index e(P' I P) = (Z : f pI) is called a ramification index of p' over P (in other
words, Vpl if) = e(P' I P)vpif) for any f E L *). We say that p' I P is ramified if
e(P' I P) > 1, and p' I P is unramified if e(P' I P) = 1. Let (9 p, mp and (9' Pi, m' pi
be local rings and maximal ideals of valuations Vp and Vpl, respectively. The
fields Lp = (9plmp and L' = (9 'PI Im'pl are called residue fields of P andP'. The
field Lp is canonically embedded into L~" andf(P'IP) = [L~, : Lp] is called the
relative degree of p' over P. In particular, degP = [Lp : k] and degP' = [L~, : k']
are degrees of prime divisors P and P'. It is easy to see that
For every prime divisor P of Llk, there is at least one prime divisor p' of L' Ik'
lying over P, and the number of such prime divisors p' is finite. Moreover, we
have
I
e(P' IP)f(P' IP) = [L': L].
P'IP
Let L' be a finite extension of L, and L; be the subfield of L' consisting of
all separable elements of L' over L (whose minimal polynomials over L have no
multiple roots). The degree [L; : L] is known as a separable degree of L' over
L, and is denoted by [L' : L ]s. The field L' is a purely inseparable extension of
L;, and the degree [L' : L;] is called an inseparable degree of L' over L (notation:
[L' : L]i). Suppose now that L'lk' is a finite extension of an algebraic function
field Llk, and that P' E Div(L') is a prime divisor lying over a prime divisor
P E Div(L). Then!s(P'IP) = [L~, : Lp]s is called a separable degree of p' over
P, andji(P'IP) = [L~, : Lpl]i is called an inseparable degree of p' over P. The
prime divisor p' is separable, inseparable, or purely inseparable according to the
cases whenji(P' IP) = l,ji(P'IP) > 1, orji(P' IP) = f(P' IP).
Suppose that L' I k' and L" I kIf are two finite extensions of an algebraic function
field L I k, and u : L' -+ Lff is an isomorphism of fields L' and L" which maps k'
onto kIf and leaves L fixed. For every prime divisor p' of L' I k' we define the prime
divisor uP' of Lff I kIf by setting
for allf E L" Ik" (we assume Vpl(O) = 00). The mapP' -+ uP' gives a one-to-one
correspondence between the prime divisors p' of L' I k' and the prime divisors p"
of L" Ik". If Llk,L'lk' andL" Ikff fonn a tower
Galois Extensions
Let L' I L be a nonnal extension of an algebraic function field L I k with the Galois
group G = Gal(L'IL). Let P be a prime divisor of Llk and let p' be a prime
divisor of L' I k' lying over P. Then every prime divisor of L' I k', lying over P,
has the fonn aP' for some u E G. The subgroup D(P'IP) of G, consisting of
elements u E G such that uP' = P', is called the decomposition group of p' over
P. The field L~, is a nonnal extension of Lp, and every element (j of the Galois
group of this extension is induced by some element u E D(P'IP). The subgroup
I(P'IP) of elements u E D(P' I P), such that (j E G(P' I P) is trivial on L~" is
called the inertia group of p' over P. Let r denote the number of prime divisors
p' of L' Ik' lying over P. Thus the following relations holds:
D= L ap·P
PEDiv(L)
be a divisor of the field Llk, letP' be a prime divisor of L' Ik' lying over P and let
e(P' I P) be the ramification index of P' over P. The divisor
of the field L' I k' is called a conorm of the divisor D. The map
is an embedding of the group Div(L) into the group Div(L'), which induces a
homomorphism con: Pic(L) -+ Pic(L'). For any D E Div(L) we have
[L" L]
deg(conLI/L(D)) = [k'; k] degD.
Suppose that L" Ik" is the smallest normal extension of Llk containing L', and k"
is the algebraic closure of k' in L". Then L" I k" is an algebraic function field with
the full constant field k". Let G = Gal(L" I L) and H be the subgroup of G which
leaves L' fixed. Let G I H be the set ofleft cosets of G with respect to H. If
D'= ~
£... 'P'
apl'
PIEDiv(L')
of the field Llk is called a norm of D'. The map D' 1--7 normL'/L(D') defines a
homomorphism
norm: Div(L') -+ Div(L).
If P' is a prime divisor of L' I k' lying over P E Div(L), then
degL' D = degD;
134 Chapter 5
(vii) for any prime divisor P' E Div(L') lying over P = P' nL,
normL'lL (J) = n
i=1
n
O'i(J).
field Lp((t)) offonnal power series in t over Lp. Every element of this field has
the fonn 00
Ol = L ai ti ,
i=m
where m is an integer, ai E Lp and am ::f. O.
Suppose now that L' I k' is a finite extension of L I k. Let P be a prime divisor
ofLlk andP' a prime divisorofL' Ik'lying over P. Thenlp can be embedded in
Dpl. Since the composite field L'lp is complete, contains L' and is contained in
DpI, we conclude that L'lp = Dpl. Denote the valuation rings of DpI and lp by
(9' pI and Op, and let
(9'pl)* = rd{PI/P)(9'PI ,
Diff(L'IL)= L Ld(P'IP),P'
PEDiv{L) pi / p
(i) A prime divisor p' E Div(L') lying over P is said to be tamely (resp. wildly)
ramified if e(P' I P) > 1 and p = chark does not divide e(P" I P) (resp. p
does divide e(P' I P)).
(ii) We say that P E Div(L) is ramified (resp. unramified) in L' I L if there is at
least one p' E Div(L') over P such that p' is ramified (resp. if all p' lying
over P are unramified). The prime divisor P is tamely ramified in L' I L if
it is ramified in L'IL and no prime divisor p' over P is wildly ramified. If
there is at least one wildly ramified prime divisor p' over P we say that P is
wildly ramified in L' I L.
(iii) P is totally ramified in L' I L ifthere is only one P' over P, and the ramification
index is e(P'1P) = [L' : L].
(iv) L'IL is said to be ramified (resp. unramified) if at least one prime divisor
P E Div(L) is ramified in L'IL (resp. all Pare unramified in L'IL).
136 Chapter 5
There is a close connection between e(P'1P) and d(P" I P) (see, for example,
Serre [167, Ch. III] and Stichtenoth [197, III.5]):
Proposition 5.30. Let L' I k' be a finite separable extension ofan algebraic func-
tion field L I k. Then we have:
(i) if P is a prime divisor ofL I k and p' is a prime divisor ofL' I k' lying over
P, thenP' is ramified ifand only ifP' E Supp(Diff(L'IL)). IfP'isramified,
then
(iii) (the Hurwitz genus formula) ifk' = k and g (resp. g') denotes the genus of
L I k (resp. L' I k), then
-()p= n'()p"
P'/P
and there exists a basis {Ii, ... ,In} of L' I L such that
n
fjp = L ()p 'j;, n=[L':Lj.
i=l
Any such basis {Ii , ... ,fn} is called an integral basis of fjP over () p.
Proposition 5.31. Suppose L' = L (z) is a finite separable extension ofa function
field Llk of degree [L' : Lj = n. Let P E Div(L) be a prime divisor such that the
minimal polynomial F(T) ofz over Llk has coefficients in ()p (i.e., z is integral
over ()p), and let P;, ... ,P; E Div(L') be all prime divisors lying over P. Let fjp
be the integral closure of()p in L'. Then the following holds:
Curves over a Finite Field 137
(iv) if at least one prime divisor P E Div(L) satisfies mp > 0, then k is alge-
braically closed in L', and
f = g+E(h) at P,
(ii) The prime divisor P is totally ramified in L', and the ramification index ofP'
over Pis e(P' I P) = q. Moreover, degP' = degP, and the different exponent
of p' I P in the extension L' I L is given by
(iv) Suppose that the prime divisor Q E Div(L) is a zero off - '}' with'}' E Fq.
The equation a q + a = '}' has q distinct roots a E k, and for any such a
there exists a unique prime divisor Q~ E Div(L') such that Q~ lies over Q,
and Q~ is a zero of z - a; in particular, the divisor Q splits completely in
L' (i.e., e(QaIQ) = f(QaIQ) = I for all Qa lying over Q).
Curves over a Finite Field 139
EXERCISES
5.1. Let Fq be a finite field of characteristic p #- 2,f(x) a polynomial in Fq [xl and (!!)
the generalized Legendre symbol (the multiplicative character of order 2) defined by
if a = 0,
if a#-O and a is a square in Fq ,
if a#-O and a is a non-square in Fq.
Show that:
be a quadratic form over Fq with non-zero discriminant d = al ... an. Show that:
(a) ifn = 2m then the number Nq of solutions ofthe equationJ(xl, ... ,xn ) = e in
elements Xl, ..• ,xn E Fq is expressed by
ife = 0
if e #- O.
(Hint: Use induction on m.)
140 Chapter 5
5.3. LetFq be a finite field and X (resp. t/J) be a multiplicative (resp. an additive) character
of Fq . Prove that for the Gaussian sum
T(X,t/J) = L X(x)t/J(x)
XEFq
the following relations hold:
(a) T(Xo, t/Io) = q;
(b) if X # XO, then T(X, t/Io) = 0;
(c) ift/J# t/Io,then T(xo,t/J) =0;
(d) if X # xo, t/J # t/Io, then IT(x, t/J)I = ql/2.
5.4. Let t/J be a non-trivial additive character of a finite field F q , let s be a positive divisor
of q - 1 and let a be a non-zero element of F q . In the notations of the previous
exercise prove that:
L I/J(f(X))!=ql/2.
!XEFq
5.5. Letf(xl, ... ,xn) be a polynomial in Fq[XI, ... ,xnJ and I/J a non-trivial additive char-
acter of F q • Prove that the number N q of solutions ofthe equationf(xl,'" ,xn) = 0
in elements Xl, . .. ,Xn E Fq is expressed by
5.6. Let Sl, .. . ,Sn be positive integers, So be their least common multiple and di =
(Si,q -I), 0::; i::; n. Show that:
(a) if Nq is the number of solutions in elements Xl, .. . ,Xn E Fq of the equation
then
Curves over a Finite Field 141
then
5.7. Let X (resp. l{!) be a non-trivial multiplicative (resp. a non-trivial additive) character
of Fq and let
Tv = Tv(X,l{!) = L Xv (x) l{!v (x).
XEFqv
Tv = (-l)v+ITr.
5.8. Let l{! be anon-trivial additive characterofFq , andletg(x) = boxn +bIXn- 1 + .. ·+bo
be a polynomial in Fq [xl of degree n with (n, q) = 1. Then let
Tv = Tv(g) = L l{!v(g(x))
xEFqv
and
L(g,z) = exp (i ~
v=1
zv) .
Show that:
(a) the L-function of ArtinL(g,z) has the form
0-1
(c) lf3n-ti = qT.
5.9. Let a, b be non-zero elements of a finite field Fq and let l{! be a non-trivial additive
character of F q . Then let
(c) 1f:l21 = q.
5.10. Letf(x) = liS! (x) .. f;' (x) be the decomposition of the polynomialf E Fq[x] into
irreducible factors in Fq [x], let s be a positive divisor of q - 1 and deg(fi .. fr) =
m. Then let X be a non-trivial multiplicative character of Fq of exponent s, let
(S,SI,'" ,Sr) = 1 and let
Sv = Sv(f) = L Xv(f(x)).
XEFqv
Show that:
(a) the L-function of Artin
L(f,z) = exp (f S:
v~l
zv)
m-!
(c) If:lm-Ii=q"''2.
Chapter 6
In this chapter we apply the technique we have worked out earlier to prove the
Riemann hypothesis for the zeta-function C(X,s) of a curve X defined over a finite
field F q . This result was proved for the first time by Hasse (in the case of elliptic
curves) and Weil (in the general case) using the correspondence theory on X. Here
we give an elementary proof based essentially on using only the Riemann-Roch
theorem (see Stepanov [184,185,187], Bombieri [17], Schmidt [159] and Stohr
and Voloch [200]).
(6.1)
(6.2)
with some positive constant c = c (X), and then we deduce (6.1) using the properties
of the zeta-function Z(X,s), established earlier in Section 5.2.
143
144 Chapter 6
Preliminary Bound
Let k' = Fq be an algebraic closure of the field k = F q . The method which we
shall use consists in the construction of a non-zero rational function f E k' (X) that
has zeroes of a sufficiently high order at every Fqv-rational point x E X (with the
possible exception of only one such point) and that does not have too many poles.
The inequality (6.2) is obtained now if we compare the number of zeros (taken
with their multiplicities) with the number of poles of the function f. To detennine
the functionf we use the Riemann-Roch theorem.
Whenever every extension Fqv of the field k = Fq is again the definition field
of X, we can assume without loss of generality that q II = p2r, where p is the
characteristic of F q .
Proof: We may assume that X has a point y with coordinates in Fqv, since
otherwise Nqv = O. Let Rm denote the k' -linear space of functions f E k' (X),
which are regular outside y and which have aty a pole of the order at most m, that
is vi!) ~ -m. The following facts can be easily proved:
(iv) there is a basis {ji, ... ,/s} of Rm such that vy(fi) < vy(fi+d for i =
1,2, ... ,s-1.
Indeed, we have
so that
Rm = tIJf=oRdRi- I .
By property (i), we have dimRd Ri-I :::; 1, and the result follows, since a basis can
be obtained by picking up for each i, when possible, one element of Ri not in Ri- I.
Let n, T be non-negative integers and UI, ••• , Us be elements of Rn. Consider
the auxiliary function
We have:
Counting Points on Curves over Finite Fields 145
(v) /fnpT < qV, thenf(x) is identically zero in k'(X) if and only if all the Ui(X)
are identically zero.
In fact, suppose thatf(x) is identically zero and that Uj(x) is the first Ui (x) which
is not identically zero. Taking the order aty of both sides of the identity
Therefore
pTvy(Uj) ~ _npT +qV(vy(!f+l) -vy(!f)) ~ _npT +qV > O.
This means that Uj(x) vanishes at y, and thus is a function with no poles and at
least one zero; hence Uj (x) is identically zero, contradicting our initial assumption.
(vi) /fm,n > 2g-2 and if(n -g+ l)(m -g+ 1) > npT +m -g+ 1 then we
can choose the Ui (x) not all identically zero, such that
WechoosepT = qv/2, n = qv/2 -1, m = qv/2 +2g. The conditions are satisfied
if qV > (g+ 1)4, which we have supposed, and we obtain at once the conclusion
ofthe lemma. •
Now we tum to the proof of the inequality (6.2).
146 Chapter 6
Proof: The function field k' (X) contains a purely transcendental subfield k' (u)
such that k' (X) is a separable extension of k' (u). Hence there is a nonnal extension
of k' (u) which is also nonnal over k' (X); geometrically, we have a situation
X' -t X -t pI ,
where X' -t pI and X' -t X are Galois coverings, with Galois groups G and
H respectively, H being a subgroup of G. Although this situation need not be
realized over the field Fqv, it will always be realized over a finite extension of it
and therefore for our purpose we may as well assume it is in fact realized over Fqv.
Ifx is a point of pI over Fqv and unramified inX -t pI, and ifx' is a point of
X'lying over x, we have
O"(x') = x,qV
for some 0" E G, called the Frobenius substitution of G at the point x'. Let
Nqv (X', 0") be the number of such points of X' with Frobenius substitution 0". The
same argument used in the proof of Lemma 6.2 gives
where g' is genus of X'; alternatively, one may note that Nqv (X', 0") = Nqv (X~),
where X~ is a curve over Fqv isomorphic to X' over an extension Fqmv, where m
is the order of element 0" E G (X~ is thus a certain twisting of X' by means of 0"),
and the lemma can be applied directly. We have
(the O( 1) takes care of the branch points of the covering) and, since Nqv (pI) =
qV + 1, the upper bound for Nqv(X',O") implies
We have also
L Nqv (X', 0") = IHI·Nqv(X)+O(l),
CIEH
whence
Nqv = Nqv (X) = qV + 0(qv/2),
and this completes the proof of the theorem.
•
Counting Points on Curves over Finite Fields 147
then
2g
Nqv = q" + 1 - L wj. (6.3)
i=1
Now we show that all zeros wi- I of the function Z (X, t) lie on the circle
It I = q-I/2 in the complex plane Co This is equivalent to the condition that all
zeros of ,(X,s) = Z(X,t) lie on the line Res = 1/2. Indeed, it follows from
Theorem 6.3 that the series
converges absolutely in the disk It I < q-I/2. Hence the function Z(X,t) has no
zero for It I < q-I/2. Moreover, by virtue of functional equation it also has no zero
for It I > q-I/2. In that case all the zeros ofZ(X,t) lie on the circle It I = q-I/2, so
that Iwd = ql/2 for i = 1,2, ... ,2g.
From (6.3) we obtain the following inequality
2g
INqv -q" -11 ~ L Iwd" = 2gq,,/2,
i=1
The question concerning upper bounds for absolute values of character sums
with multiplicative and additive characters X and l/J of the field Fq is closely
connected with the question of the number of Fqv -rational points of superelliptic
and Artin-Schreier curves defined over Fq by equations
respectively.
Superelliptic Curves
Let s' = (s,q -1) and s = s'r. Since (r,q -1) = 1 then z = yr runs through all
elements of Fq as y does. Hence the number of solutions x,y E Fq
(6.4)
is the same as the number of solutions x,y E Fq of the equation yS' = f(x).
Therefore, we can assume without loss of generality that s is a divisor of q - 1.
Proposition 6.4. The number Nqv ofsolutions ofequation (6.4) in elements x,y E
Fqv is given by
where the external sum is over all multiplicative characters X of the field Fq of
exponent s.
Proposition 6.5. Let f(x) be a polynomialin Fq [x]. The following conditions are
equivalent:
(i) yS - f(x) is absolutely irreducible;
(ii) iff = !is) .. -//r is the decomposition off into distinct irreducible factors in
Fq[x] then (s,S), ... ,sr) = 1.
Theorem 6.7. Letf(x) be a polynomial in F[x). letf = /{l .. -fir be its decom-
position into distinct irreducible factors /; E Fq [x). let m = deg(ti .. ·fr) and let
X be a non-trivial multiplicative character of Fq of exponent s. Suppose that
(S,Si, ... ,Sr) = 1. Then
L Xv(f(x)) ~ (m _1)qv/2
XEFqv
for all v ~ 1.
and hence
and therefore
m-i
L L wj'(X) ~ cqv/2
ind.¥=sj=i
X;eXo
for all v ~ 1. Applying Proposition 5.28 we obtain the required result. •
Corollary 6.S. Let Nqv be the number of Fqv-rational points on absolute curve
X defined over Fq by equation (6.4). and m the number of distinct roots of the
polynomialf E Fq[x) in algebraic closure Fq ofthefield Fq. Then
(6.5)
Proposition 6.9. Let Fq be a finite field and v > 1 an integer. Then the number
Nqv ofsolutions x,y E Fqv of equation (6.5) is given by
where the external sum is over all additive characters 1/1 of the field Fq.
v"'if) degfj
=max--
199 i
and suppose that 8if) = lis with (l,s) = 1. Thenf(x,y) is absolutely irreducible.
Proof: The proof of the theorem is completely similar to the proof of Theorem
6.7. •
Corollary 6.13. Let g(x) be a polynomial in Fq [x] of degree n ~ 1 and Nqv the
number ofFq v-rational points ofthe affine curve X defined over Fq by the equation
(6.5). If(n,q) = 1 and v> 1 then
INqv-qVI:::; (n_1)(q_1)qv/2.
Counting Points on Curves over Finite Fields 151
Lower Bounds
The results of Theorems 6.7 and 6.12 can not be improved in general (Stepanov
[186, 189]):
if v = 2n
if v = 2n+ I
such that
( v/2 I) v/2 if v = 2n
L Xv(f(x)) = { ~ _ 1- q
if v=2n+1
XEFqV q
Proof: Let v > I be an even number. Since xqV = x for any x E Fqv, we have
v(
- IT v
norm vf( x ) - x+x qv/2 )qi-I -_ IT (qi-I
x +x qV/2+i-l)
i=1 i=1
v/2. . v/2 . .
= IT(X ql - 1+xqV/2+I-I)IT(xqV/2+J-1 +xql-I)
i-I j=1
- IT(
-
v/2.
ql-I
.
+xqV/2+'-1)2 ,
X
i=1
and therefore
-
norm v f( x ) -
IT
(v-I)/2 .
(ql-I
x +x
q(V+I)/2+,-1)2
.
IT (.
(v+I)/2
Xql-I +x
.
q(V-I)/2+ -I)2
J
i=1 j=1
152 Chapter 6
and therefore
L x,,(f(x)) = qV - I,
XEFqv
as required. •
Since a E Fqv is a square if and only if norm" (a) is a square in Fq we obtain
the following result.
Corollary 6.15. Let Nqv be the number of solutions in x,y E Fqv of the equation
y2 = f(x), where f(x) is the polynomial from Theorem 6.14 with
qv/2 if v = 2n
{
degf= q(,,-I)/2(q+l) if v=2n+1
Then
_ { q" + (q,,/2 _ l)qv/2 if v = 2n
Nqv- 2qV-I if v=2n+1
It is possible to extend the result of Theorem 6.14 to the case of arbitrary
non-trivial multiplicative character X of exponent s ~ 2, where s is a divisor of
q - 1, and construct the corresponding affine curve
X :yS = f(x)
with a lot of Fqv-rational points (see Gluhov [62]).
U
2 +2~(v-2)/2 ql+1
L.,[=I U
+ Uqv/2+1 if v = 2n
g(u) = { 2 +2~(v-I)/2 q'+1
u L.,[=I u if v=2n+l
such that
L I/Iv(g(u)) = qv-I/2.
UEFqv
Proof: Let {WI, ... , wv } be a basis of the field Fqv over Fq. Every element
x E Fqv can be uniquely written as a linear combination
X =XIWI +···+xvwv
of the basis elements WI, ... , Wv with coefficients Xj E Fq. Next, if u(x) = xq is
the Frobenius automorphism of Fqv, we have
Let us consider the above linear combinations as a system oflinear equations with
respect to unknowns Xj. The determinant
of the system differs from zero and hence (by Cramer's rule)
where
A .• _ (
alJ - -
l)i+j dt(
e u I-I( Wk )) I <O,1,k<O,v,l-cli,k-h .
Clearly
u(~) = (_1)v-I~,
U(~i-IJ) = (_1)V-2~ij
and therefore
L uqi-Iul-I
19<k<O,v
in the form
v-I v-i
L uqi-l Uqk - I = L L (u ql +1)qi-l
19<k<O,v i=I/=1
Counting Points on Curves over Finite Fields 155
(1'-1)/2(1'-1)/2 .
= L L (U q1 +1 )ql-l
i=1 1=1
v-I v-i .
+ L L(Uq1+I)ql-l
i=(v+I)/2/=1
(v-I) v-i .
+ L L (U q1 +1)ql-l .
i=1 1=(v+1)/2
and therefore
. (1'-1)/2(1'-1)/2 .
L Uq1 - 1Uqk - 1 = L L (u q1 +1 )ql-l
I::;i<k::;v i=1 1=1
v-I v-i .
+ L L(uq1+I)ql-l
i=(v+I)/21=1
v (1'-1)/2 .
+ L L (u q1 +1 )ql-l
i=( v+3)/21=v-i+ I
Now define
1=1
For every u E Fqv we have
xj = trv(g(u))
156 Chapter 6
and hence
we obtain
L I/I,,(g(u)) = q,,-1/2,
UEFqv
and setting
we obtain again
L I/I,,(g(u)) = q,,-1/2.
UEFqv
6.3. ASYMPTOTICS
There are two important cases when we are able to determine the number of Fq-
rational points on a smooth projective curve X of genus g(X) defined over Fq.
In the first case, the genus g(X) is fixed and q -+ 00. This situation is typical for
various number-theoretic problems. In the second case, which is closely related
to coding theory, the field Fq is fixed and g(X) -+ 00. At first we consider the
number-theoretic aspect of the problem.
INq(g) - q - 11 :S L2gyqJ.
Theorem 6.19 (the Serre bound). Let Nq = Nq(X) be the number of Fq-rational
points on a smooth projective curve X over Fq. Then
Z(X t) _ rri!1
(1- wit)
, - (l-t)(l-qt)
be the zeta-function of X. The complex numbers WI , ... ,W2g are algebraic integers
with IWil = ql/2. They can be ordered in such way that WiWg+i = q, hence
Set
and note that (Xi, (3i are positive real algebraic integers.
Galois group Ga1(K/Q) of the fieldK = Q(WI, ... , W2g) overQ permutes WI,
... , W2g. Moreover, if U E Gal(K /Q) is such that U( Wi) = Wj then
are invariant under action of the group Gal(K IQ). This implies that a,13 E Q
(see [107, p. 301]), and since a,13 are algebraic integers then a,13 E Z (see [107,
p. 359]). The numbers a, 13 are positive, so we have
g g
TIa; ~ I and TII3; ~ l.
;=1 ;=1
Now the well-known inequality between the arithmetic and geometric mean yields
-1 La;
g ( g
~ TIa;
) l/g
,
g ;=1 ;=1
and hence
and hence
N q <5": q + 1 + gl2q1/2 J.
In the same way, the inequality
implies that
N q ~ q+ l-gl2q 1/2J.
This proves the theorem. •
By a variety of methods N q (g) can be determined for low genera and for
various q (see Serre [168]):
Counting Points on Curves over Finite Fields 159
Nq(2) = q + 1 +2r,
Nq(2)=q+2r or Nq=q+2r-l,
depending upon whether {2vq - r} > (vq - 1) /2 or not, where {lX} de-
notes the fractional part of lX.
A table for small q (see also [59, 60, 61]) is given below:
q 2 3 4 5 7 8 9 11 13 16 17 19 23 25 27
Nq(l) 5 7 9 10 13 14 16 18 21 25 26 28 33 36 38
Nq(2) 6 8 10 12 16 18 20 24 26 33 32 36 42 46 48
Nq(3) 7 10 14 16 20 24 28 28 32 38 40 44 ? 56 ?
N'l(4) 8 12 15 18 ? ? ? ? ? ? ? ? ? 66 ?
Of course one can also keep q fixed and vary g. For q = 2 a table of results is:
g 0 2 3 4 5 6 7 8 9 15 19 21 39 50
N2(g) 3 5 6 7 8 9 10 10 11 12 17 20 21 33 40
We see in particular that the Klein quartic (see Chapter 5) over Fs has the
maximum number of rational points (= 24).
160 Chapter 6
Maximal Curves
It is clear that a curve X over Fq of genus g = g(X) is maximal only when q is a
square, say q = q,2. Next, if
P(t)
Z(X,t) = (I-t)(l-qt)
Proof: Let WI, ... , W2g be the reciprocals of the roots of P(t). Since
2g
Nq =q+ 1- LW; and
;=1
Counting Points on Curves over Finite Fields 161
ro,-
o _ _ ql/2 , 1~ i ~ 2go
Now we consider the number Nq2 of Fq2-rational points on X. We have Nq2 2: Nq
and
2g
Nq 2 = q2 + 1 - L rol = q2 + 1 - 2gq.
j=1
Thus,
and hence
q _ql/2
g~ 2
This completes the proof.
There are further restrictions on the genus of maximal curves. For example,
•
any smooth proj ective curve X over F q (q = q'2) of genus g, where g satisfies
Asymptotic Bounds
One strategy for producing good codes with the help of algebraic curves is by
taking a curve of genus g over a fixed finite field Fq with a lot of Fq-rational
points, say XI, ... ,Xn . This allows us to construct codes (see Chapter 10) with
k2:n-d-g+l,
i.e., with
R>I_8_ g - 1 .
- n
Let Nq = Nq(X) be the number of Fq-rational points of a curve X of genus
g = g(X) defined over Fq. In order to maximize the last inequality asymptotically,
one looks for a family of curves X with Nq (X) as large as possible. We define
. Nq(X)
A(q) = hm sup (X)'
g(X)--t oo g
where X runs over all smooth projective curves over Fq (up to isomorphism over
Fq). From the Serre bound we deduce immediately
162 Chapter 6
In the case when g(X) is large with respect to q, Ihara [85] improved this bound
as follows:
Theorem 6.22 (the Ihara theorem). We have
() y'8q+ 1-1
A q ~ 2 .
The idea of the proof of this theorem is very simple. If the Wi (in the notation
of Theorem 5.17) have arguments near 7T' then Nq is big; but then the squares of
Wi have arguments near 27T' and hence N q2 is small. However, we have Nq ~ N q2.
This bound was improved by Vladut and Drinfeld [215] (refining Ihara's proof):
Theorem 6.23 (the Drinfeld-Vladut theorem). One has
A(q)~vq-l.
rr;!1
(1- wit)
Z(X,t) = (l-t)(I-qt)
hence
2g
L a[ ~ qv/2 +q-v/2 -Nq . q-v/2. (6.6)
i=1
On the other hand for every ai E C and for any positive integer n we have
o~lia[12=
v=1
i
V,T=O
a[-T
n
= n+ 1 + L (n+ 1- v)(a[ +aiV). (6.7)
v=1
Hence
Nq ~ n + 1 - v -v/2 < 1
L... +1 q - +
!"
L...
n + 1- v (v/2
1 q +q
-v/2).
g v=1 n g n+
If g and n tend to infinity in such a way that n /logq g -+ 0 then it follows that for
anye > 0
Nq 1
-·--<1+13.
g y'q-1-
This proves the theorem.
Later we shall demonstrate in several ways that the above upper bound is exact
•
forq =p2v.
For an arbitrary q, it was proved by Serre that A (q) > O. In fact, he proved that
there exists a constant c > 0 such that
L = L1 ~ L2 ~ L3 ~ ....
Since all the above extensions are unramified, the genus g(Li) can be calculated
by the Hurwitz genus fonnula:
A(q)? ~.
g-I
By an optimal choice of Land S we obtain the Serre lower bound (6.8). This
gives also a possibility to find lower bounds for A (q) in many concrete cases, for
example,A(2) ? 2/9 (see [I63]),A(3) ? 1/3, andA(5) ? 1/2 (see [233]).
The Serre lower bound can be essentially improved for many q of a special
form (see Perret [144], Schoof [163], Xing [233], Zink [237]). For example,
using deep results from the theory of Shimura varieties, Zink [237] obtained the
following lower bound
Tame Extensions
Let {Xi} be a sequence of smooth projective curves over F q , and {Li} the cor-
responding sequence of algebraic function fields. Denote by g(Li) the genus of
Li and by Nq (Li) the number of prime divisors of Li of degree 1. A tower of
function fields over Fq is a sequence £.- = {Li} of function fields Li / Fq satisfYing
the following conditions:
(ii) for every i ? 1, the extension LH IiLi is separable of degree [LH I : Ld > 1;
Since 0::; A(q ::; A(q), any tower of function fields over Fq provides a lower
bound for A(q). We call £.- asymptotically good (resp. asymptotically optimal) if
A(q > 0 (resp. A(q = A(q)). The notion of asymptotically good sequences of
function fields is closely related to the notion of asymptotically good sequences
of codes. For example, let {Li} be a tower of function fields, and {Cd the
corresponding sequence of geometric Goppa codes, coming from the curves X;
(see Chapter 10). If
· Nq(X;)
1I 1
m-->
Hoo g(X;) ,
Counting Points on Curves over Finite Fields 165
the sequence {Cj} of linear codes Cj is asymptotically good. This reduces the
problem of constructing asymptotically good codes to the problem of constructing
asymptotically good sequences of function fields over a given finite field F q .
Now we consider several examples of asymptotically good towers of function
fields over Fq for an arbitrary q, proposed recently by Garcia and Stichtenoth
[55]. Let P(L) be the set of all prime divisors of a function field L / Fq . Given a
finite extension L' / L and a prime divisor P E P(L), there are finitely many prime
divisors P' E P(L') lying over P. We recall that the extension L' / L is tame if the
ramification index e(P' / P) is relatively prime to the characteristic of Fq , for all
P E P(L) andallP'/P.
Theorem 6.24. Let f., = {Lj} be a tower offunction fields over Fq satisfying the
follOWing conditions:
(i) all extensions Li+I/Lj are tame;
(ii) the set
isfinite;
(iii) the set
is non-empty.
Then the tower f., is asymptotically good, and
21TI
A(J:.,) ~ 2g(LI)-2+s'
where
s = L degP.
PES
Proof: Since L;JL I is tame, the degree of the different Diff(L;JL d is given by
degDiff(L;JLd= L L(e(P'/P)-l).degP'.
PESP'/P
we obtain
degDiff(L;/LI) :::; [L; : Ld L degP = [L; : Ld ·s.
PES
Now the Hurwitz genus fonnula implies
Observe that this inequality implies that 2g(L;) - 2 + s > 0, since g(L;) -+ 00 as
i -+ 00.
On the other hand, we have Nq(L;) 2: ITI· [L; : Ld by condition (iii), and
therefore
Nq(Li) > 21TI
g(L;) - 2g(L;) -2+s +2/[L; : Ld
for all i 2: 2. This shows that
Proposition 6.25. Let m > I be an integer with q == I mod (m), and let So ~ Fq
be a subset of Fq with 0 E So. Suppose thatf(u) E Fq[uJ is a polynomial whose
leading coefficient is an mth power in Fq satisfying the following conditions:
(c) for each a E So, all roots of the equationf(u) = am lie in So.
Proof: First we consider the extension L2/ LI, where L2 = LI (X2) and
Let PIE P (L I) be the zero of XI in L I and let P2 be a prime divisorlying over PI.
Ifv2 denotes the corresponding discrete valuation of L2, we have from (6.13)
= xl .Ji (Xi).
X~I
Dividing by x't and setting y = XH 1/Xi, we obtain
(6.14)
where (3 is the leading coefficient of f( u) and the function z has a zero at the
prime divisor Q. The reduction of the equation (6.14) modulo Q gives ym == (3
mod(Q), and since the equation um = {3 has m distinct roots onFq , it follows from
the well-known Kummer theorem (see Stichtenoth [197, 111.3.7]) that the prime
divisor Q splits completely in Li+I/Li. As a consequence, we have Nq(Li) 2: m i- I
and therefore g(Li) --+ 00 as i --+ 00.
We have proved that L = {Li} is a tower of function fields over Fq with the
properties (i) and (iii). Now we prove the property (ii). Suppose that P E P(LI) is
ramified inLdLI. Choose Q E P(L i ) with e(Q/P) > 1 and letPj = QnLj be the
restriction of Q to Lj . Since Q/ P is ramified, then Pj+1/Pj is ramified for some
j 2: 1. From the equation
(6.15)
and from the ramification theory of Kummer extensions (see [197, III.7.3]), it
follows that PHI is a zero of xHI. Denoting by x(Q) the residue class of an
element X ELi modulo Q, we obtain from (6.15) that
The condition (c) of the theorem implies thatxj(Q) E So. Repeating this process,
we find that Xj_1 (Q), ... ,X2(Q),XI (Q) E So. Hence the property (ii) holds.
168 Chapter 6
A(,c) ~ ~2.
q-
Proof: Let So = Fq andf(u) = 1- (u + l)m. Conditions (a) and (b) of Proposition
6.25 hold obviously. In order to verifY the condition (c), let a E Fq and
I - ( 1 + 1)m = am.
If am = 1 then 1 = -1 E F q . If am i- 1 then 1 - am E F; (observe that a H am
is the norm map from Fq onto Fp). Hence
A(L)~~2·
q-
Counting Points on Curves over Finite Fields 169
In particular,
(ql/2 _I)2g::::; hq(X)::::; (qi/2+ I)2g.
One can improve upon this fact in the following situation. Let X be a curve from
a family of curves of growing genus g such that
. . logq hq (X) q
hm lOf (X) ~ I+clogq(--I)·
g(X)-t oo g q-
EXERCISES
6.1. Let Fqv be a finite extension of Fq. Given positive integer s I (q - 1), s > 1 and
z E Fqv show that the number of elements y E Fqv with y' = z is equal to
then
IT,,(j,g) I ::; (m +n _1)q"j2.
6.6. Let Fq be a finite field of characteristic p > 2, and f(x) a non-zero polynomial in
Fq[xJ. Prove that all solutions x E Fq of the equation
!L.!
I ±f 2 (x) = 0
are at least double roots of the polynomial
then compare the number of roots ofthe polynomial R(x) with its degree.)
6.7. Let Fq be a finite field of characteristic p > 2, andf(x) = ax 2 + bx + c E Fq [xl a
polynomial of degree 2 with the non-zero discriminant D(J) = b2 - 4ac. Prove that
the number Nq of Fq-rational points on the curve y2 = f(x) is
N q =q- (~).
6.8. Let X be the Klein quartic defined over F2 by
x 3y+y3 z + z 3x = O.
1 +5t+8t6
Z(X,t) = (l-t)(1-2t)
and then deduce that the number N2v of Fq-rational points of X is given by
ifv~O mod(3)
ifv:=O mod(3) ,
and
h2v=(h2v/3)3 ifv:=O mod(3),
where S3v are integers defined by the recurrence relation:
i+y=x 5 + 1.
Show that:
(a) X has genus 2;
(b) the zeta-function Z(X,t) of X has the fonn
Z(X )= (1+2t-2t2)(1+2t+2t2).
,t (l-t)(1-2t) '
if v #4n
ifv=4n
6.12. Let p > 2 be a prime and I t= 0 mod (P) be an integer. For p = 4k + 3, prove that the
Jakobsthal sum
S(/)=
x=!
f:(x3+ 1X )
p
is zero. For p = 4k + 1, prove that:
(a) S(l) is an even number;
The aim of this part is to give an introduction to the theory of modular curves
insofar as they apply to the construction of geometric Goppa codes on modular
curves. For reasons of space our treatment will be rather brief. In fact we shall
try to emphasize those aspects of the theory which are of a classical nature and
are easy to comprehend with a minimum knowledge of algebraic geometry. As
our program is to make available a formula for counting the number of rational
points on a modular curve over a finite field, we shall first develop some notions
leading up to Igusa's description of a model of modular curve Xo(N) defined over
the integers Z with a good reduction modulo every prime p which does not divide
the level N. Then we recall the basic results of Eichler and Shimura relating the
trace ofthe Hecke operators to the trace ofFrobenius acting on the I-primary part
of the torsion points of the Jacobian variety of Xo(N). Finally we will give the
formula of Eichler and Selberg for the trace of the Hecke operators.
173
Chapter 7
Elliptic Curves
The theory of elliptic curves (curves of genus I having a specified basepoint xo)
is varied and rich, and provides a good example of the profound connections
between abstract algebraic geometry, complex analysis, and number theory. The
most important property is that any elliptic curve is an abelian variety.
A moduli space is, roughly speaking, a variety whose points classify the
isomorphism classes of some kind of object, e.g., algebraic curves of a certain
type. The modular curves we shall be concerned with are moduli spaces of elliptic
curves.
In Section 4.4 we have stated that there is a bijection between an elliptic curve E
over an algebraically closed field k and Pico(E) given by x f-7 (x -xo) for some
Xo E E. Since Pico(E) is a group, E is also a group whose zero element is Xo (see
also Proposition 9.1 below).
Let us describe the group law by geometric means. To do this we consider the
map <I> : E f-7 jp>2 which is defined by the complete linear system 13 ,xol (note that
1(3 . xo) = 3 by the Riemann-Roch theorem). Then we obtain the following result:
175
176 Chapter 7
Then the functions I, u, v, u2, uv, v 2 and u3 lie in L (6 . xo). It follows from the
Riemann-Roch theorem that l(n ·xo) = n for any n 2: I, hence these functions are
linearly dependent over k, so
Since only v 2 and u 3 have a pole of order 6 at xo, their coefficients in this linear
relation do not vanish. Multiplying u and v by appropriate non-zero elements of
k we can assume that the relation has the form
(7.1)
Figure 7.1.
for coordinates of the point X3. If Xl = (UI : vI : 1) and X2 = (U2 : V2 : 1), then
X3 = XI + X2 can be obtained as follows: let x~ = (u~ : v~ : 1) be the third point of
intersection of the line through XI and X2 with E, then X3 = Xl +X2 is the reflection
of x~ (see Fig. 7.1). It is easy to see that
(7.3)
Theorem 7.2. The group E(k) is divisible, i.e.Jor any positive integer N and any
XE E(k) there exists Xl E E(k) such that N 'XI = X in the group E(k).
and v'. We obtain two equations FN{U', v') = 0 and GN{U',V') = 0, whereFN and
GN are polynomials whose coefficients depend on u, v and A. One can show that
since k is algebraically closed this system has a solution. •
Automorphisms
An isomorphism cp : X -+ X of a curve X onto itself is called an automorphism.
The group of automorphisms of X is denoted Aut{X) or Autk (X). If X = pi then
Aut{X) = PGL2{k) = GL2{k)/1*, 1* being the center of GL2{k) consisting of
matrices of the form
(~ ~), a E k*.
Since E is an abelian variety we have:
Theorem 7.3. For each fixed x' E E the map x I-t x + x' is an automorphism ofE
(as of an algebraic variety).
Corollary 7.4. The group Aut{E) operates on E transitively.
We see that Aut{E) contains E as subgroup. This subgroup is normal in
Aut{E) and for p = chark =1= 2,3 the factor group G = Aut{E)/E is a finite group
of order 2,4 or 6. For p = 3 the order of G is a divisor of 12 and for p = 2 it is a
divisor of 24. Therefore for g{X) = 1, the group ofautomorphisms Aut{X) ofa
curve X defined over an algebraically closed field k is infinite. On the other hand
for g = g{X) ~ 2 we have:
Theorem 7.5. If g ~ 2 then Aut{X) is finite. Moreover, if chark = 0 then
JAut{X)J ::; 84{g-I).
Our first topic is to define thej-invariant of an elliptic curve, and to show that it
classifies curves up to isomorphism. Since j can be any element of the ground field
k, this will show that the affine line Al is a variety of moduli for elliptic curves
over k.
Let us assume that chark =1= 2. Note that the value Afrom (7.2) can be different
for isomorphic elliptic curves. In particular, the equations of the form (7.2) with
A and A' = 1/A define isomorphic curves. Let us set
.. 8{.\2-'\+1)3
)=)(E)=2 .\2(,\-1)2 (7.4)
Elliptic Curves 179
This value is called thej-invariant (or the absolute invariant) of E. Note that the
coefficient 28 is introduced to make sure that thej-invariant has integer coefficients
being expanded into a power series in some natural variable t (see Section 8.2
below). Our main result then is the following:
(i) the valuej =j(A) =j(E) depends only on the isomorphism class ofE;
(ii) two elliptic curves E and E' are isomorphic if and only ifj(E) = j(E');
(iii) every element ofk occurs as thej-invariant of some elliptic curve E over k.
Proof:
(i) Note that j(A) = j(A') for every A' E A = {A, l/A, 1 - A, 1 - 1/(1-
A),A/(A-l),(A-l)/A}, which can be checked directly. If we write
an equation of E in the form (7.2) then the projectionf(u, v) = u defines a
morphismf : E ---+ pI of degree 2 with four ramification points 0, 1, A and
00. Let
(ii) Let E, E' be elliptic curves, and A, A' their Legendre moduli. Letj(A) =
j(A'). Considering A' as a variable and A as a parameter, we obtain an
equation of degree 6 in A', vanishing on A. Therefore it has no other roots
and hence E and E' are isomorphic.
For chark i= 2,3 another form of the equation of an elliptic curve E is quite
useful. Making the substitution U H U - (A + 1)/3 in (7.2) we get an equation of
the form
(7.5)
Usually one makes the substitution v H 4v, U H 4u in (7.5) and writes the equation
in the form
v 2 =4u 3 -g2 u -g3, (7.6)
which is called Weierstrass normal form of E. It is easy to check that
3
j(E) = 1728 3 g2 ~.
g2 -27 3
Theorem 7.6 also remains valid for chark = 2. We give no proof in this case
and only define the absolute invariantj(E). Making the substitution U H U + a in
(7.1) we get
V 2 +CIUV +C3V = u 3 +C4U +C6. (7.7)
Thenj(E) = cF /a, where
7.3. ISOGENIES
In the case of complex elliptic curves this result will be proved in Section 7.5
(for the general case, see Silverman [177, III, §6]).
Now we describe Epv for p = char k and v 2: 1. To begin with, we assume that
v = 1. Let E(P) be the elliptic curve obtained from E by raising the coefficients
to the pth power, let f = /P : E -t E(P) be the Frobenius morphism given by
(u,v) t-+ (uP,vP), andf* : E(P) -t E be its dual morphism. Then by Proposition
7.7 we havef f* = f* f = PE and sincef is a purely inseparable morphism,PE
is not separable. Iff* is a separable morphism then E is ordinary, and ifj* is not
separable then E is a supersingular elliptic curve. Note that for supersingular E
we havej* = f, sincef is the only purely inseparable morphism of degree p.
Proof: From the above argument it follows that the proposition holds for v = 1.
For any v > 1 it can be easily deduced by induction on v. •
Theorem 7.10. Let p = chark > 2 and let E be the curve defined by v 2 = u(u-
1) (u - ,.\). Then E is supersingular if and only if
±(~)2,.\;
;=\ I
=0, (7.8)
182 Chapter 7
where s = (p - 1)/2. Infact, there are exactly lP/12 J+ l>p supersingular elliptic
curves E (up to isomorphism) over k, where l>3 = 1, and for p :::=: 5,
Proof: See Hartshorne [73, p. 333], Husemoller [81, Ch. 13, §4], or Silverman
[177, V, §4] •
Therefore all supersingular values of the modulus A and ofthej-invariant lie
in a finite field. Moreover, one has the following fact:
Proposition 7.11. Let p = chark > 2, and letj = j(E) be a supersingular value
ofj-invariants. Thenj E Fp 2.
Proof: We see from Theorem 7.10 thatj E Fp , hence it is sufficient to show that
jP2 = j. For supersingular curve E we have f =f* and since PE : E 4 E (P) C E
is purely inseparable isogeny of degree p2, we conclude that PE = fi,
where fi :
E -+ E(p2) is the Frobenius morphism. Hence E ~ E(P2), andj(E) = jp2 (E). •
Note that for p = 2 there exits only one supersingular curve E (withj(E) = 0)
which can be given by v 2 + V = u3 .
Homomorphisms
Let E and E' be elliptic curves. The set of algebraic group morphisms f : E -+ E'
(i.e., ofmorphisms which are group homomorphisms) is denoted by Hom(E,E').
If E = E' it is denoted by End(E). Note that Hom(E,E') is an abelian group since
we can add its elements: if + g)(x) =f(x) + g(x). Moreover, End(E) is a ring:
multiplication is the composition of morphisms. It is clear that Hom(E, E') has
no torsion since the condition N f = 0 implies thatf(E) is contained in the finite
set EN and hence is trivial.
Studying the behavior of morphism at torsion points of E one can prove the
following proposition:
Theorem 7.13. These are the following possibilities for the division algebra
Endo(E):
(i) Endo(E) = Q;
Proof: See Lang [108, Ch. 13, §I and §2], or Husemoller [81, Ch. 12, §4] and
[81, Ch. 13, §6]. •
Therefore End(E) is a free Z-module generating Endo(E) over Q. In other
words End(E) is an order in the division algebra Endo(E).
Theorem 7.14. These are the following possibilities for the order End(E):
(i) End(E) = Z;
(ii) End(E) = Z+m(9k, where mE Z, m:j. 0 mod(p), (9k being the maximal
order in the imaginary quadratic field k = Endo (E) (in this case m is called
the conductor ofEnd(E));
Proof: See Lang [108, Ch. 13, §I and §2], or Silverman [177, III, §9]. •
Automorphisms
Theorem 7.14 makes it possible to determine the group Auto (E) of automorphisms
of an elliptic curve E as an algebraic group (i.e., of those preserving the initial
point xo) which is isomorphic to the group End' (E) of units ofEnd(E).
Proof: See Hartshorne [73, p. 321], Lang [l08, Appendix 1] or Exercise 7.1. •
Note that Theorem 7.15 also gives a description of Aut(E) since the group
Aut(E) of automorphisms of the curve E is a semi-direct product of E(k) by
Auto (E). One has the following "mass-formula" of Eichler and Deuring (see
Husemoller [81, Ch. 13, §4)):
-I p-l
LIAut(E)1 = 24'
where the sum is taken over the set of isomorphism classes of supersingular curves
in characteristic p > o.
The theory of elliptic curves outlined above concerns the case of algebraically
closed ground field, while we are mainly interested in elliptic curves over a finite
field k = Fq with q = pI! elements. To study this case one should make some
changes in the theory. The definition of an elliptic curve over a finite field is the
same as in the case of an algebraically closed field, except that we need to check
that an elliptic curve has at least one Fq-rational point. This follows from Theorem
6.1, since
IE(Fq)l2: q + 1-2y'q = (y'q _1)2> 0
and hence IE(Fq)l2: 1. Letxo E E(Fq). Ifwe consider Xo as the zero element, we
obtain a group structure on the finite set E(Fq). Moreover, using the Riemann-
Roch theorem which is valid over an arbitrary ground field, we can write down an
equation of E in the form
We have seen above that for an algebraically closed ground field, the j-invariant
classifies isomorphism classes of elliptic curves. This is not the case over a finite
field (nor for the most part over non-closed fields). From Theorem 7.15 one can
deduce:
Proposition 7.16. Letj(E) = j(E'). Then E and E' are isomorphic over afinite
extension K o/the groundfield k such that [K : k] divides 24. More preCisely, we
have:
(i) Ifp = char k i= 2,3 then [K : k] divides 4 or 6.
(ii) Ifj(E) i= 0 or 1728 then [K : k] = 1 or 2.
Note also that there exist elliptic curves E and E' over Fq such thatj(E) =
j(E') E Fq and HomFq (E ,E') = 0, i.e., E and E' are not isogenous over Fq.
Elliptic Curves 185
Theorem 7.17. Let E and E' be elliptic curves over a finite field F q. Then E is
isogenous to E' iJand only iJIE(Fq) I = IE'(Fq)l.
Endomorphisms
Elliptic curves over finite fields have an abundant set of endomorphisms. To
be more precise, let EndFq (E) be the subring of End(E) which is formed by
morphisms defined over F q .
Proof: We prove this proposition for ordinary curves and supersingular curves
E withj(E) E Fp. Indeed, let q = pV and letfV : E --+ Eq be the v-power of
the Frobenius morphismf = /po Since E is defined over F q , Eq = E and hence
JV E EndFq (E). If E is an ordinary curve thenJV .;. Z, since no NEZ is purely
inseparable. If E is defined over Fp then v = 1 andf .;. Z since its degree equals p
(the degree of NEZ equals N 2 ). For supersingular curves E withj(E) .;. Fp one
needs a slightly more elaborate argument. •
Therefore EndFq (E) contains an order in an imaginary quadratic field.
(i) (q,u) = 1;
(ii) q is a square and u = ±2.;q;
(iii) q is a square, p ct 1 mod (3), and u = ±.;q;
(iv) q is not a square, p = 2 or 3, and u = ±Vfiij;
(v) q is not a square and u = 0;
186 Chapter 7
(i) (q, u) = 1, lui '5:. 2,jq and GN:::::' 'lLII'lL x 'lLlm'lL, where mil andm I (u - 2);
(ii) q is a square, u = ±2,jq, and GN:::::' 'lLII'lL x 'lLII'lL, where 1= ,jq ± 1;
Z
I
p(z) = 2" + L
wEA\{O}
(I
(z-w )2 - I )
2
W
.
One shows that this series converges at all Z (j. A, thus giving a meromorphic
function having a double pole at the points of A, and which is elliptic. Its derivative
p/(Z) = -2 L (z-1)3
wEA W
is another elliptic function. If one adds, subtracts, multiplies, or divides two elliptic
functions with period lattice A, one gets another such function. Hence the elliptic
functions for a given A form a field.
Theorem 7.21. Thefield ofelliptic functions for given lattice A is generated over
C by the Weierstrass p-function and its derivative p'. They satisfy the algebraic
relation
where
Proof: See Lang [108, pp. 8-11], Husemoller [81, Ch. 9, §4], or Exercise 7.11..
Thus if we define a map cp : C -t JPl2 (q by sending Z H (p (z) , p' (z )) in affine
coordinates, we obtain a holomorphic map whose image lies inside the curve E
with the equation
v 2 = 4u 3 - g2 U - g3·
In fact, cp induces a bijection between Cj A and E, and E is non-singular, and hence
is an elliptic curve. Under this map the field of elliptic functions is identified with
the function field on the curve E. Thus for any elliptic function, we can speak of
its divisor L aj . Zj with Zj E Cj A.
Theorem 7.22. Given distinct points Z\, ... ,Zm E Cj A, and given integers
a\ , ... ,am, a necessary and sufficient condition that there exists an elliptic function
f with divisor (f) = L aj . Zj is that L aj = 0 and L aj . Zj = 0 in the group Cj A
and give E the group structure with origin xo, this says that rp(Z\) + rp(Z2) = rp(Z3)
in the group structure on E. In other words, rp gives a group isomorphism between
C/ A under addition, and E with the above-mentioned group law.
Theorem 7.24. Let T, T' be two complex numbers. Then J( T) = J( T') if and only
if there are integers a,b,c,d E Z with ad - bc = ±1 and
, aT+b
T=--.
cT+d
Furthermore, for any given T', there is a unique T with J ( T) = J (T') such that T
lies in the region F (fundamental domain) defined by
if ReT :::; 0
if ReT > 0
Proof: Since it is separable and a group homomorphism, its degree is the order
of the kernel, which is N2. •
Now we investigate the ring of endomorphisms End{E) of the elliptic curve E
determined by the elliptic functions with periods 1 and T.
Proposition 7.27. There is a one-to-one correspondence between endomorphisms
f E End(E) and complex numbers a E C such that aA ~ A This correspondence
gives an injective ring homomorphism ofEnd{E) to C
Proof: For given T we can determine End(E) as the set of all a E C such that
aA ~ A. A necessary and sufficient condition for aA ~ A is that there exist
integers a,b,l,m such that
If a E lR, then a E Z, and we see that End(E) nlR = Z. On the other hand, if E has
complex multiplication, then there is an a f/. Z, and in this case, b i- O. Eliminating
a from these equations, we find that
bT 2 +(a-m)T-I=0,
which shows that T lies in a quadratic extension of Q. Since T f/. lR,. it must
be an imaginary extension, so T E Q( vi -d) for some square-free d E Z, d 2: 1.
Eliminating T from the same equations, we find that
Corollary 7.29. There are only countably many values ofj E C for which the
corresponding elliptic curve E has complex multiplication.
Proof: Indeed, there are only countably many elements of all quadratic extensions
~Q •
For a more detailed treatment of the deep theory of elliptic curves we refer the
reader to Silverman and Tate [178], Husem611er [81], Koblitz [96], Lang [108] and
Silverman [177].
EXERCISES
7.l. Let the elliptic curve E be embedded in nn2 so as to have the equation v 2 = u(u-
1) (u - A). Show that any automorphism of E leaving Xo = (0, 1, 0) fixed is induced
by an automorphism ofjpZ coming from the automorphism of the affine (u, v)-plane
given by
ul =au+b, VI =CV.
Describe these automorphisms of jpZ explicitly and prove Theorem 7.15 for p =
chark =I- 2.
Elliptic Curves 191
Show that the j-invariant is a rational function of the ai with coefficients in IQ. In
particular, if the ai are all in some field ko C k, thenj E ko also. Furthermore, for
every Dl E ko there exists an elliptic curve defined over ko, with thej-invariant equal
to Dl.
7.3. Letf: E -+ E' be an isogeny of elliptic curves E and E' defined over an algebraically
closed field k. Show that:
7.4. LetE be an elliptic curve over a field k ofcharacteristicp > 0 andletp YN. Show that
EN ~ 'l./N'l. x 'l./N'l.. (Hint: Study the case of a prime N and then use induction
on the number of divisors of N.)
7.5. Letf be an isogeny of elliptic curves E and E' of degree N = N'N", where N' and
Nil are coprime. Show that there exist isogenies f' and f" such that f =f' .f" ,
degf' = N' and degf" = Nil.
7.6. Let E : v 2 + v = u3 + u and E' : v 2 + v = u3 be elliptic curves over F2. Show that:
(a) j(E) = j(E') = 0;
(b) E and E' are not isomorphic over F2 and F22;
(c) E and E' are not isomorphic over F24, but they are isomorphic over F28.
p(z)=2"+
z
1wEA\{O}
L (1(z-w)
3-2"
W
1)
converges absolutely and uniformly on any compact C such that en A = 0.
192 Chapter 7
7.10. Show that the Weierstrass p-function is an elliptic function with the period lattice
A which has a double pole at any w E A and no other poles. Show also that its
derivative
, ,,1
p (z) = -2 £.., ( )2
wEA z-w
is an odd elliptic function with the period lattice A which has a pole of order 3 at any
w E A and no other poles.
7.11. Prove Theorem 7.21. (Hint: Consider expansions p(z) and p'(z) into Laurent series
at the origin.)
7.12. LetE be an elliptic curve over 1(:, defined by the elliptic functions with periods 1 and
T. Let End(E) be the ring of endomorphisms of E. Show that:
_az+b
() -
yz --d.
cz+
193
194 Chapter 8
The group f(N) is called the principal congruence subgroup of level N. Clearly
feN) C f,(N) C fo(N) C f(I) for N > 1.
f(I)/f(N)":; SL2(Z/NZ)
and
fo(N)/f(N)":; { (~ a~') E f(I)}.
In particular, f(N) is normal in f(I) and f1 (N) is a normal subgroup offo(N).
Riemann Surface
Suppose that f is equipped with a discrete topology while H has the usual complex
topology. The space f\H is canonically equipped with a (non-compact) Riemann
Classical Modular Curves 195
Compactification
An essential deficiency of the Riemann surface r\H is that it is not compact.
A canonical compactification r\H* of the surface r\H is provided as follows.
Consider the set WI (Q) = Qu {oo} consisting of rational numbers and the symbol
00 (or i . 00 in other notations). Each element r E WI (Q) can be written in the form
r = lin, with l,n E Z, and for n = 0 we put r = 00.
The group f(l) acts naturally on WI (Q): ih=( ~ ~ ).then')l(r)=~;!~,
where ')1(00) = alc and ')I(r) = 00 if cr +d = O.
Let r\H* = (r\H) u (r\WI (Q)). From Proposition 8.1 it follows that the
factor set r\WI (Q) is finite. This set is called the set of cusps of the Riemann
surface r\H* (or of the group f).
Let us define a complex structure on r\H* such that its restriction to the open
subset f\H coincides with the one defined above, complex-analytic neighborhoods
of r E Q being open discs tangent to the line Irnz = 0 at r and neighborhoods of
00 being open half-planes of the form Imz > M.
Theorem 8.3. The set r\H* with the above complex structure is a connected
compact Riemann surface.
r
If is an arbitrary congruence subgroup of f( 1), then there exists a unique (up to
isomorphism) smooth projective curve Xr over C (see Springer [183]) such that
Xr considered as a Riemann surface is isomorphic to r\H*, and the Riemann
surface r\H is naturally isomorphic to the smooth affine curve Yr over Co We
call Xr and Yr modular curves. If r is a congruence subgroup of level N, then the
curvesXr and Yr are called modular curves oflevel N. We are mostly interested in
curves corresponding to the groups r = f(N), r = ro(N) and r = r l (N), which
are denoted by X(N), Y(N), Xo(N), Yo(N) and XI (N), YI (N), respectively.
196 Chapter 8
az+b )
'Y : z -+ ( cz + d .
Tm
1 (0 1)
= y'm -m 0 E SL2(lR.)
fo(N)\H* -+ f(I)\H*
g(N) = 1+ (N - 6)N2
24
II (1 _~) ;
piN P
if41N
otherwise.
Classical Modular Curves 197
if91N
otherwise
(here (-;1) = (-;3) = 0 and (~) for p 2: 3 denotes the quadratic residue
symbol);
Automorphic Functions
The elements of the function field onXr, or, in other terms, meromorphic functions
on the Riemann surface r\H*, can be considered as functions on H invariant
198 Chapter 8
under r such that their only singularities are poles. These functions are called
automorphic under r.
If r = r(1), so that r\H* ~ pI (C), then the field of automorphic functions
coincides with C(j), wherej = j(z) is thej-invariant of an elliptic curve Ez(C) =
C/ Az associated with the lattice Az = Z'z+Z (see Lang [108, p. 63]). Every such
function has a canonical Laurent series expansion in the neighborhood of infinity
which is called a t-expansion. Since T = (~ ~) E r( 1), for any J E C(j) and
z E HwehaveJ(T(z)) =J(z+ 1) =J(z). Ifnowt(z) = e 27riz , thent(T(z)) = t(z),
and we can take t = t(z) as a local parameter in a neighborhood of infinity.
A local expansion with respect to t of the function J on r\H* is called the
t-expansion ofJ:
n=-m
The theory of elliptic function yields the following result (see Lang [108, p. 45]):
Proposition 8.8. For the t-expansion oJj = j(z) we have
j(z) = t- I (I + n~ c(n)tn+I) ,
TN = (~ ~) E r(N),
so that for any automorphic function J with respect to the subgroup r we have
J(T N (z)) = J(z+N) = J(z). Therefore each suc4 function in a neighborhood of
infinity has at-expansion of the form
J(z) = I a(n)tn/N.
n=-m
Note that T E ro(N), and therefore the functions automorphic under r = ro(N)
have expansions in integral powers of t. The field oj automorphic Junctions under
To(N) has the following explicit description:
Theorem 8.9. Thefield oJJunctions automorphic under ro(N) coincides with the
field C(j(z),j(Nz)).
In particular the functionjN(z) = j(Nz) is invariant under ro(N). There exists
the canonical involution of the field C(j,jN) which corresponds to to the element
Tm E SL2(JR). This involutionpermutesj andjN.
Classical Modular Curves 199
Then
(iii) the polynomial <PN (j, u) is absolutely irreducible and <PN (j,jN) = 0;
Thus, the modular equation <PN (j, u) = 0 provides an affine model for the curve
Xo(N). Moreover, since <PN has integer coefficients, the curve Xo(N) is defined
over Q. Unfortunately, this model is highly singular.
Let'Y= (~ ~) Ef(I),letJ(z)beafunctiononH*=HUQUoowithvalues
in CU {oo}, and let k be an integer. We define the operator ['YJk by
Modular Functions
Letf(z) be a meromorphic function on H and let f c f(l) be a congruence
subgroup of level N, i.e., f:> f(N). We callf(z) a modular function of weight k
for f if
fo[ylk=f
for all y E f and if, for any a E f(l), the functionf(z) 0 [alk has a Fourier series
expansion at infinity of the fOnD
00
Proposition 8.11.
(i) Let f be a congruence subgroup of [(1), let a E GL2(Q), deta > 0, and
f' = (a-' fa) n f. Then f' is a congruence subgroup of [( 1), and the
map f t-+ f 0 [a]k = f(a(z))(cz + d)-k(deta)k/2 takes Mk(f) to Mk(f'),
and takes Sk(f) to Sk(f'). Inparticular, iff E Mk(f(l)) and g(z) = f(Nz),
then g E Mk(fo(N)) and one has g(oo) = f(oo), g(O) = N-kf(O).
(iv) for f = f(N): a pair (A, {T', T2}), where T" 1"2 E Cj A have the property
that every T E kAj A is of the form T = mT, +nT2, i.e., T" T2 form a basis
for the points of order N.
Given a lattice A, in general there will be several modular points of the form
(A, T), (A, G) or (A, {T, , 1"2} ). However, when N = 1, there is only one modular
point corresponding to each A, and we identify it with the modular point A for
f(1).
In each case (i) to (iv), we consider complex-valued functions F on the set of
modular points which are of weight k in the following sense. Ifwe scale a modular
202 ChapterS
(ii) F(AA, AT) = A-kF(A, T) for all modular points (A, T);
(iv) F(AA, {AT!, AT2}) = A-k F(A, {T!, Tz}) for all modular points (A, {T!, T2}).
For'YE (~ ~) Ef(l)wedefinetheactionof'YonfunctionsJ(z)bythe
Proof: We shall treat case (ii) and leave the other cases as exercises. Suppose
'Y E f! (N) and F is a weight k function on modular points (A, T). Let w = ( ~ )
az+b )
and 'YW = ( cz+d . We have
CZ+d) ,
J('Y(z))=F(Ay(z),I/N) = (cz+d) k F ( Ayw,---y;r-
because F has weight k. But A,\w = A z, (cz+d)/N =- I/N (mod Az) and hence
J('Y(z)) = (cz+dlF(Az, I/N) = (cz+d)kJ(z).
Classical Modular Curves 203
Heeke Operators
We now discuss the Hecke operators acting on modular forms of weight k for
fl(N). We could define them directly onf(z) in Mk(fl(N)). However, the
definition appears more natural when given in terms of the corresponding functions
F on modular points.
Let L (Q) denote the Q-vector space of formal finite linear combinations of
modular points, i.e., L(Q) = EBQLA, .. is the direct sum of infinitely many one-
dimensional spaces, one for each pair (A, T), where A is any lattice in C and
T E C/ A is any point of exact order N. A linear map T: L(Q) -+ L(Q) can be
given by describing the image
(8.2)
where the summation is over all lattices A' containing the A with index n such
that (A' , T) is a modular point (here for T E C/ A we still use the letter T to denote
the image of T modulo the larger lattice A'). In other words, A' / A c C/ A is a
subgroup of order n, and T must have exact order N modulo the larger lattice A'
as well as modulo A. The latter condition means that the only mUltiples ZT which
are in A' / A are the multiples m T which are in A. In the case N = 1 this condition
disappears, and we sum over all lattices A' with [A' : A] = n. The condition on T
204 Chapter 8
is also empty if (n,N) = 1. To see this, suppose that N'7 E N. Then the order
of N' 7 in N I A divides N (because N'N 7 E A) and divides n (because n is the
cardinality of N I A), and so divides (n, N) = 1. Thus N' 7 E A. Note that the sum
in (8.2) is finite and T\ is the identity map.
Next, for any positive integer n prime to N we define another linear map
Tn,n : L(Q) -+ L(Q) by
(8.3)
Since (n,N) = 1 then 7 has exact order N modulo ~A. Again we are using the
same letter 7 to denote an element in CI A and the corresponding element in CI ~ A.
It is easy to check the commutativity of the operators:
(8.4)
Proposition 8.14.
(i) If(m,n) = 1, then Tmn = TmTn; inparticuiar TmTn = TnTm.
(8.5)
Proof:
(i) In the sum (8.2) for Tn, the N correspond to certain subgroups G' of order mn
in ~n AI A, namely, those which have trivial intersection with the subgroup
1:7 c CIA. Since (m, n) = 1, it follows that any such G' has a unique
subgroup Gil of order n; if A" ::> A is the lattice corresponding to Gil, then
G'I Gil gives a subgroup of order m in ~ A" IA". Both Gil and G' IGil have
non-trivial intersection with 1:7. Conversely, given Gil = A" IA c ~ AI A
of order n and a subgroup G' = NIA" c ~ AI A of order m, where both
subgroups have trivial intersection with 1:7, we have a unique subgroup
NIA c ~n AI A of order mn with non-trivial intersection with 1:7. This
shows that the modular points that occur in
and in
(ii) By induction, it suffices to show that Tpa-l Tp = T; for a ~ 2. Let -r' = ~-r.
Then
Tpa-l (LA,"/") = p-a'L.LA',"J")
where the summation is over all A! :J A such that A! I A c p-a AI A has
order pa and does not contain -r'. Notice that A! I A must be cyclic, since
otherwise it would contain a (P,p)-subgroup ofp-aAI A. There is only one
such (P,p)-subgroup, namely j;A/A, and T' E j;A/A, sincepT' =NT E A.
Once we know that A! I A must be cyclic, we can use the same argument as
in part (i). Namely, for each A! that occurs in the sum for Tpa (LA,"/") there
is a unique cyclic subgroup of order p in A! I A; the corresponding lattice
A" occurs in the sum for Tp (LA,"/") and A' is one of the lattices that occur in
Tpa-l (L A","/"). This shows the equality in part (ii).
(iii) Since (P,N) = 1, the condition about the order of T in CIA is always
fulfilled. We have
where the first summation is over all lattices A" such that Gil = A" I A has
order p, and the second summation is over all A! such that G' = A! I A" has
order pa-l. On the other hand,
Tpa(LA,"/") = p-a'L.LA',"J")
A'
where the summation is over all A! such that A! I A has order pa. Clearly,
every A' in the inner sum for Tpa-l Tp is an A! of the form in the sum for
Tpa, and every A! in the latter sum is an A! of the form in the former sum.
But we must count how many different pairs A", A' in the double sum lead
to the same A!. First, if A! I A is cyclic, then there is only one possible A'.
But if A' I A is not cyclic, i.e., if A! I A:J j;AI A, then A" can be an arbitrary
lattice such that A" I A has order p. Since there are p + I such lattices, it
follows that there are p extra times that LA',"/" occurs in the double sum for
Tpa-l Tp. Thus,
A'-:J(I/p)A
[A':(I/p)A]=pa-2
But
Parts (ii) to (iii) of the proposition show that each Tpa is a polynomial in Tp and
Tp,p. From this and (8.4) we see that all of the Tn's commute with each other.
Thus, the operators Tm,m (m prime to N) and Tn generate a commutative algebra
A of linear maps from L ('01) to L ('01); actually, A is generated by the Tp,p (p )' N a
prime) and Tp (p any prime).
There is an elegant way to summarize the relations in Proposition 8.14 as formal
power series identities, where the coefficients of the power series are elements in
A. First, for piN, we have from (ii)
piN. (8.6)
p)'N. (8.7)
then we have a corresponding linear map (which we also denote T) on the vector
space of complex-valued functions on modular points: ToF(A, T) = La{n)F{xn).
For example:
where the last summation is over all modular points (A', T) such that [A' : A] = n,
as in (8.2).
This correspondence provides the following properties of operators [d], Tm,m,
and Tn (see Koblitz [96, p. 159]):
(denoted [d)f, Tm,mf, and T,J) in Mk{f l (N)). Iff is a cusp-form, then so are
[d)f, Tm,mf and T,J. Thus [d], Tm,m and Tn may be regarded as linear maps
on Mk (f I (N)) or on Sk (f I (N)). In this situation, let X be a Dirichlet character
modulo N. Thenf E Mk{N,X) ifand only if[d]F = X{d)F, i.e., ifand only if
We saw before (Proposition 8.12) that a functionf E Mk{f l (N)) can be written
as a sum of functions in Mk{N,X) for different Dirichlet character x. Thus, using
the one-to-one correspondence in Proposition 8.13 we can write a modular form
F(A, T) as a direct sum of F's which satisfy (8.9) for various X.
Proposition 8.16. The operators Tm,m and Tn commute with [d], and preserve the
space ofF{A,T) of weight k which satisfies (8.9). IfF{A,T) has weight k and
satisfies (8.9) then Tm,mF = mk- 2x(m)F.
Proof: That the operators commute follows directly from the definitions. Next,
if [d]F = X{d)F, it follows that [d]TnF = Tn [d]F = X(d)TnF and [d]Tm,mF =
Tm,m[d]F = X(d)Tm,mF. It is a fact of linear algebra that the eigenspace for
an operator [d] with a given eigenvalue is preserved under any operator which
commutes with [d]. Finally, if F{A,T) satisfies (8.9), then
Corollary 8.18. Tm,m and Tn preserve Mk(fo(N)), and also Sk(fo(N)). For
J E Mk (fo (N)) the action oJTm,m is given by Tm,,J = mk- 2f.
Now we examine the effect of the Hecke operators Tm on the t-expansion at
00of a modular fonnJ(z) E Mk(N,X), that is, if we writeJ(z) = I:a(n)tn and
T,J(z) = I:b(n)t n, t = e27riz , we want to express b(n) in tenns of the a(n).
IfJ E q[t]],J = I:a(n)tn, we define
Proof: We have
Tp.[(z) = ~ t;F (AI, ~),
where F is the function on modular points which corresponds to J and the sum is
over all lattices A' containing Az with index p such that 1/N has order N modulo
A'. Such A' are contained in the lattice ~ Az generated by ~ and ~, and the lattices
of index p are in one-to-one correspondence with the projective line plover the
finite field Fp = Z/pZ withp elements. Namely, the point in pI with homogeneous
coordinates (a, b) corresponds to the lattice generated by Az and (az + b) / p. Thus,
there are p + 1 possible A' corresponding to (1 J) forj = 0, 1, ... ,p - 1 and (0, 1).
Ifp 'IN, then allp+ 1 of these lattices A' are included; ifp IN, then the last lattice
must be omitted, since -k has order ~ in that case. Note that the lattice generated
by Az and (z+j)/p is A(z+j)/p. Thus, ifp I N we have
TP/(z) = -I ~
-
P I
F ( A(z+j)fp, N = -I1) p
-
~J
I
(z+j)
- = Up.[(z).
p j=o P j=O P
Classical Modular Curves 209
Ifp YN, then we have the same sum plus one additional tenn corresponding to the
lattice generated by Az and ~; this lattice is ~ Apz. Thus, in that case
(the proof is exactly like the proof of the Euler product fonnula for the Riemann
zeta-function). From Proposition 8.17 we see that
(8.10)
n=1 p
therefore the operators Tn on Mk(N,X) satisfY the following fonnal power series
identity
n=l p
210 Chapter 8
or, equivalently
Tn = LX(d)dk- 1 Vd 0 Un/d. (8.11)
din
T"J(z) = L b(n)tn,
n=O
then
ben) = L x(d)dk- 1a(mn/d 2 ). (8.12)
dl(m,n)
If we set r = d 2 n/m, the inner sum becomes Ia(rm/d 2 )tr with the sum taken
over all r divisible by d. Replacing r by n and gathering together coefficients of
tn, we obtain the expression (8.12) for nth coefficient. •
Most of the important examples of modular forms turn out to be eigenforms for
the action of all of the Tm on the given space of modular forms. Iff E Mk(N, X) is
such an eigenform. then we can conclude a lot of information about its t -expansion
coefficients.
Theorem 8.22. Letf(z) E Mk(N,X). Assume thatf is a non-zero eigenformfor
all of the operators Tm, m = 1,2 .... Let
T"J = A"J,
and 00
fez) = L a(m)tm.
m=O
Then
(i) a(m) = Ama(l);
(ii) ifk i- 0 andf is not a constantfunction, then a(l) i- 0;
(iii) if a(O) i- 0, then
Am = LX(d)dk- l •
dim
Classical Modular Curves 211
Proof: Using (8.12) with n = 1, we find that the coefficient of the first power. of
t in T"J' is a (m ). If T"J' = A"J', then this coefficient is also equal to Am a( 1). This
proves the first assertion. Ifwehad a(l) = 0, then it would followthatalla(m) = 0
andJ would be a constant. Finally, suppose that a(O) =I- O. If we compare the
constant terms in T"J' = A"J' and use (8.12) with n = 0 we obtain
Ama(O) = b(O) =L X(d)dk-1a(O).
diM
Differential Forms
Let X be a compact Riemann surface. We let O[X] be the complex space of regular
differential forms on X. It has dimension g, where g is the genus of X. Suppose
that X = Xr for some congruence subgroup f ~ f( I), and let w E O[X]. Under
the map
Tr:H-tf\HCX
we can take the pullback w* = Tr* (w). Then w* is a holomorphic differential form
on H, which can be written in the form
w* =J(z)dz
with some holomorphic function J on H. For'Y E f we have (by abuse of notation,
we write w instead of w*)
w 0 'Y = J( 'Y(z))(cz+d)-2dz,
and hence
JO['Y12=J·
Furthermore, le~ TN = (~ ~) where N is the ramification index of f at
infinity (see Exercise 8.7). Since w is invariant under TN we can write
J(z) = La(n)tn/N = La (n)e21Tinz/N.
212 Chapter 8
N d(t I/N )
f(z)dz = f(z) 2'Tf'i tl/ N
Let r be a cusp for f, and let a E f(l) be such that a(r) = 00. Then
woa-I(z) =g(z)dz
for some holomorphic function g(z) on H. The same analysis as above shows that
co
g(z) = L b(n)tn/N.
n=1
Therefore we have:
Theorem 8.23. The map to which each regular differential form w = f(z)dz on
Xr associates the function f, is an isomorphism between n[xrJ and the space
S2 (f) ofcusp-forms of weight 2 with respect to f.
Letf(z) and g(z) be two functions in Mk(f) and let GLi(Q) be the subgroup of
GL2 (Q) consisting of matrices with positive determinants. If f is a congruence
r r
subgroup of f( 1) we denote by its projectivization, i.e., = f· {± 1} / {± 1}.
We consider the functionf(z)g(z)yk, where the bar denotes complex conju-
gation andy = Imz. If we replace the variable z by a(z) for a E GLi(Q), we
obtain
It follows immediately from this definition that /f,g) is linear inf and anti-
linear in g (i.e., /f,cg) = c/f,g)), it is antisymmetric (i.e., (g,f) = /f,g)), and
/f,f) > 0 forf =I O.
Ifz = x + iy, we can write
dxl\dy i -2 _
y2 = "2(lmz) dz I\dz,
and then
- kdxdy - k 1 i
f(z)g(z)y - 2 = f(z)g(z) (Imz) - ( )2 -dz I\dz.
y Imz 2
In particular, for k = 2, when we can identify a cusp-fonn of weight 2 and a
regular differential on Xr, if we put
w = f(z )dz and TJ = g(z )dz,
then the inner product can be written in the fonn
1 (i
/f,g) = (w, TJ) = [[(1) : fl lXr "2w 1\ Tj.
Of course, we must verify that the Peters son inner product makes sense. In
other words, we shall verify that
(i) the integral is independent of the choice of fundamental domain;
(ii) it converges absolutely;
(iii) it is independent of the choice off.
Proposition 8.24. The integral in (8.13) is absolutely convergent, and does not
depend on the choice of F. If f' is another congruence subgroup such that
f, g E Mk(f'), then the definition of /f,g) is independent of whether f, g are
considered in Mk(f) or in Mk(f').
Proof: See Koblitz [96, p. 170] or Lang [110, p. 38]. •
Now we note that if a E GL{ (Ql) then a can be multiplied by a positive scalar
without affecting [ak So without loss of generality we shall assume in what
follows that a = (~ ~) has integer entries. Let D = ad - bc = det a, and set
Proposition 8.25. Letj, g E Mk(f) withj orga cusp-form, and leta E GLi(Q).
Then
(iii) the inner product if 0 [a]k,g) depends only on the double coset faf oj a
modulo f.
ifo[a]k,go[alk) = 1 I"
[[(1) : (a- I a)]
1a-IF'
j(z) 0 [a]kg(z) 0 [a]kY kdxdy
-2
y
1 r - kdxdy
= [[(1) : (a-II" a)] JF'j(z)g(z)y y'
The first part of the proposition now follows from the fact that [f( 1) : (a- I I" a)] =
[f(I) : r].
The second part follows from the first, and the fact that go [a']k = go [a-Ilk-
The third part is obvious from (ii), since
for'Y E f. -
A Basis of Eigenforms
From the Riemann-Roch theorem it follows that for any congruence subgroup f
and any integer k, the <C-vector space Mk(f) is finite dimensional (see Shimura
[176, §2.6]). The Petersson inner product gives a Hermitian inner product on
the finite dimensional <C-vector space Sk(f). Thus if,g) is linear in j (i.e.,
(cj,g) = cif,g)) and anti-linear in g (i.e., if, cg) = cif,g)). It is anti-symmetric
(i.e., (g,j) = if,g)), and if,j) > 0 forj =1= o.
Proof:
(cnT,/,g) =cn(T'/,g)
= cnX(n)Ij,Tng)
= CnC~ if, Tng)
=cnif,Tng)
= lj,cnTng),
as claimed. •
We saw before that the eigenfonns for the Tn have nice properties: the coeffi-
cients can be expressed in tenns ofthe eigenvalues for the Tn, and the corresponding
Dirichlet series have Euler products. Because of Proposition 8.27, it is possible to
find a basis of such fonns.
Theorem S.2S. There exists a basis ofe-vector space Sk(N,X) whose elements
are eigenformsfor all of the Tn for which (n,N) = 1.
Proof: For any fixed Tn with (n,N) = I and any subspace S c Sk(N,X) which is
preserved by Tn, there exists a basis of S consisting of eigenfonns of Tn. To see
this we consider the Hennitian operator CnTn and apply to it the following general
fact: given any Hennitian operator T on a finite dimensional vector space, there
exists a basis of the space consisting of eigenvectors for T. We further note that
any eigenspace for Tn is preserved by all Tn" which follows from the fact that Tn
and Tn' commute: if Tn = A,/, then Tn (Tn,f) = Tn' (T,/) = An Tnt/· Thus, to prove
the theorem we list the Tn for n prime to N. We write Sk(N,X) as a direct sum of
eigenspaces S for the first Tn in the list. Then we write each S as a direct sum of
eigenspaces for the next Tn; then we write each one of those spaces as a sum of
eigenspaces for the third Tn; and so on. Because Sk(N,X) is finite dimensional,
after finitely many steps this process must stop giving us any new smaller spaces.
At that point Sk (N, X) is expressed as a direct sum of subspaces on each of which
the Tn for n prime to N acts as a scalar. Any basis consisting of fonns in these
subspaces will satisfY the requirements of the theorem. •
EXERCISES
8.1. Prove that for any integer N 2:: 1 the map SL2 (Z) -+ SL2 (Zj NZ) obtained by reduc-
ing the matrix entries modulo N is a surjective group homomorphism.
8.2. Let N = pf' ... pf; be the prime factorization of the positive integer N. Show that
the reductions modulo P( , 1 ~ i ~ s, give isomorphisms
GL2(ZjNZ) ~ IT GL2(Zjpf'Z)
i
216 Chapter 8
and
SLz(Z/NZ) ~ nSLz(Z/pfiZ).
i
8.3. Find the indices WI (N) : r(N)], [ro(N) : r l (N)] and [ro(N) : r(N)].
8.4. Let r(1) = u7=1 air be a disjoint union of n cosets air, where r is a subgroup
of index n in r(1), and let F(1) be a fundamental domain for r(1). Show that
F = U7=1 a i- I F(I) is ajundamental domain for r.
8.5. Let {ai} be a complete set of coset representatives for r in r( 1), where r is a
subgroup of finite index in r(1). Show that the cusps of r are among the set
{aj- I (oo)}, but that at I (00) and aj - I (00) are r -equivalent if and only if there exists
n E Z such that
aj -IT naj E -r.
8.6. Letp be a prime number. Prove that ro(p) has two cusps 00 and 0; and that ro(pZ)
hasp + 1 cusps: 00,0, and -I/mp for m = 1,2, .. . ,p - 1.
8.7. Let r be a congruence subgroup of r(1) of level N, and denote rs =
{y E fI y(s) = s} for s E QW {oo}. Let s = a-I (00), a E r(I). Prove that:
(a) arsa- I = (ara- I )00;
(b) there exists a unique positive integer I (called the ramification index of rat s)
such that
(i) in the case -1 E r
rs = ±a-I{T1m}mEza,
(ii) in the case -1 rf r either
rs=a-I{T1m}mEZa, or rs=a-I{(-TI)m}mEZa;
8.12. Let f C f(l) be a congruence subgroup andF be a fundamental domain for f. Show
that
(a) the integral
lL(f) = r dx~y
iF y
converges and is independent of the choice of F;
(b) [1'(1): f] = lL(f)/IL(f(I));
(c) if a E GLi(Q) and (a-Ifa) c f, then [f(I): f] = [f(l): (a-Ira)].
Chapter 9
The existence of good codes coming from classical modular curves is substantiated
by the following three phenomena:
(i) the existence of modular curves, i.e., curves whose points have an interpre-
tation as modular points;
(iii) the Eichler-Selberg trace formula, which computes the trace of a Hecke
operator on the space of modular forms.
We give a sketch of these ideas but must refer to the literature for more
detailed treatment of deep arithmetical theory of modular curves (see Gunning
[69], Koblitz [96], Lang [108, 110], Miyake [127], and especially Eichler [38] and
Shimura [176]).
The most important property of the curve Xo (N) is the fact, first proved by Igusa,
that Xo(N) has a non-singular projective model which is defined by an equation
over Q whose reductions modulo primes p, (P,N) = 1, are also non-singular.
219
220 Chapter 9
Reduction of a Curve
Let An (k) be n-dimensional affine space over a field k, and C) a valuation ring in
k with the unique maximal ideal m. Let us assume that the quotient field of C)
coincides with k. Consider the residue field k = C) 1m and denote by x the image
of x E C) under the canonical map C) -+ k. Now we define the reduction map
C)n -+ An (k) by (X\, ... ,xn) H (X\, ... ,xn), and extend it to the whole space An (k)
as follows:
Let X C An be a smooth affine curve over a field k. Denote by a(X) the ideal
of X in the ring k[T) = k[T\, ... , Tn) and assume that F\, ... ,Fr form a basis of
a(X) in k[T), so that a(X) = (F\, ... ,Fr). Since the quotient field of C) is k, we
can assume that all coefficients of the polynomials Fi lie in C) and each Fi has at
least one coefficient which does not lie in m. Thus for every i = 1,2 ... , r, the
polynomial Fi E k[T) obtained fromFi by reduction modulo m of its coefficients,
does not vanish in the ring k[T\, ... , Tn). Let us consider a closed subset X in
An (k) which is defined by the system of equations F\ (T) = ... = Fr(T) = O.
One can show that there exists a basis F\, ... ,Fr of a(X) such that dimX = l.
The set X is called a reduction of X modulo m. It should be pointed out that X
depends on the choice of basis in a(X); in general, another choice of basis can give
another reduction X', which is not isomorphic to X; moreover X' can be reducible
while X is irreducible. If X is an irreducible smooth curve, we say that X has a
good reduction modulo m. An affine curve X may possess non-isomorphic good
reductions X and X'.
Now if X is a smooth projective curve we can define its reduction choosing
a basis (F\, ... ,Fs) in the ideal a (X) of homogeneous polynomials vanishing on
X. For a smooth projective curve one can prove that its good reduction is unique
(up to an isomorphism). Thus good reduction of a smooth projective curve is
well-defined.
Let X and Y be smooth projective curves having good reductions modulo m,
and let cp : X -+ Y be a morphism. We can define its reduction ip : X -+ Y as
follows. Let r be the graph of cp, i.e., the set
{FI, ... ,Fr } in a(X) ensuring the existence of good reduction. Next, if X has no
good reduction it is not clear how to define "correct" reduction modulo m. Note
that the above definition of reduction of a morphism is also not satisfactory for its
use in practice.
The theory of schemes gives an adequate technique to study the above ques-
tions, but this assumes a fairly thorough acquaintance with algebraic varieties over
arbitrary commutative rings. We describe some aspects of the reduction process
in connection with the special case of modular curves without any reference to the
theory of schemes. The fundamental fact which will be used in the future is that
modular curves possess an interpretation as moduli varieties of elliptic curves with
additional structures.
Moduli Spaces
To get more moduli spaces of elliptic curves over a field k which is not algebraically
closed, we consider pairs (E, GN), where E is an elliptic curve and GN a cyclic
subgroup of order N in E.
Under the action
the orbits of the group f( 1) can be identified with the points in the fundamental
domain F = f(I)\H (see Theorem 7.24). This domain can be made into a
compact Riemann surface F* of genus g = 0 by adding the point 00 and prescribing
that any subset FM of F of the form FM = {z E F I Imz > M} shall be an open
neighborhood of 00. This space is an algebraic curve over C whose field of
meromorphic functions is isomorphic to that of the projective line pi (q. This
correspondence gives the equality
and also permits us to think of pi (q as the parameter space for the family of
isomorphism classes of elliptic curves over C, with the understanding that the
point 00 corresponds to the singular cubic curve v 2 = 4u 3 . Let us try to clarify
the situation. For our purpose it is convenient to think of an elliptic curve over
C as a one-dimensional torus, i.e., quotient space of C by a lattice Az = 'k + Z:
Ez = C/ Az , where z E H. The Weierstrass function p(Z') provides an analytic
isomorphism C/ Az .:t Ez between Ez and the Weierstrass model
Ez.. V2 = 4 u2 -g2u-g3.
is the fact that two elliptic curves E and E' withj(E) = j(E') are isomorphic. The
isomorphism need not be defined over the field which contains the coefficients
g2,g3. Each point z E F then defines an elliptic curve Ez with finitej-invariant
j(z). To make the correspondence complete we observe that if a value ofj = j(z) is
given it is possible to write an equation for a representative curve in the isomorphic
class of elliptic curves with the givenj-invariant, namely:
As the pointz E H approaches the point 00, the value ofj(z) becomes unbounded,
thus reflecting the fact that Ez degenerates into a plane cubic curve with singularity
for which a model can be chosen to be v 2 = 4u 3 .
To recapitulate what we have just said, we observe that two points z and z' in
H related by a transformation z' = y(z), Y E f( 1), give rise to two elliptic curves
Ez and EZI which are birationally isomorphic, i.e., they have isomorphic function
fields. On the other hand, the functionj(z) on H is invariant under f(I) and its
value characterizes the isomorphism class of elliptic curve Ez . The projective line
]p'1 (C) with the field of rational functions <C(j) as its function field, can now be
considered as a parameter space for the isomorphism class of elliptic curves over
<C; in this sense we shall refer to ]p'1 (C) = Xo( 1) simply as thej-line.
Let us now describe the moduli spaces corresponding to the congruence sub-
groups of f(l); this will correspond to a finite algebraic extension of the function
field <C(j) on the j-line which possesses many pleasant properties. Let N be a pos-
itive integer and E = <C/ Az be an elliptic curve with period lattice Az = Zz + Z,
z E H. Let GN be a cyclic subgroup of E of order N, which without loss of
generality we may identify with integral multiples of 1/N modulo Az . Under
the action of y = (: ~) E fo(N) the lattice Az is mapped to the lattice
Az, = (az + b )Z+ (cz + d)Z; the generator of GN is mapped to
A useful set of representatives for the orbits f 0(N) \IP'I (Q) is described as
follows. Consider the set of all pairs {c, d} of positive integers satisfying
(c, d) = 1, diN, 0<c < N I d.
If fs is the stabilizer of a cusp s in fo(N)\1P'1 (Q), then the cardinality of
fo(N)\Jll>1 (Q) is the same as the cardinality of the double cosets fs \f(I)/fo(N).
Ifwe take for s the cusp 0, this cardinality is simply the number of the pairs {c,d}
modulotheequivalence: {c,d}rv{c',d'}whentherearematricesy= ( : ~)
and y' = (:; ~:) in f( 1) satisfying y' = y (~ ~) for some m E Z. Thus
for a fixed d the number of inequivalent cusps is cp((d,N Id)), and hence the num-
ber of cusps for f 0 (N) is
Voo = L cp((d,N Id)).
diN
Now the point x is left fixed by y if and only if c == 0 mod (N) and d == 1 mod (N).
This is precisely the condition that y be an element of the congruence subgroup
f I (N). The orbits of points in H under f I (N) correspond to isomorphism classes
of pairs (E ,x) consisting of an elliptic curveE together with a point x E E of order
N. The quotient YI (N) = fl (N)\H can also be compactified to obtain a compact
Riemann surface
As we have seen, the curve Yo(N) is in a natural bijection with the set of is omor-
phism classes of pairs (E, GN) consisting of a complex elliptic curve E = CI Az
together with a fixed subgroup GN of E of order N (clearly GN ~ EN, where
EN = -it Az I Az ~ 'lL1 N'lL x 'lL1 N'lL is the group of points of order N on E).
To state a similar result for Y(N) ~ f(N) \H we need some technical construc-
tions with the group EN, one of which is the Weil pairing eN(x,y). Let UN be the
cyclic group of complex Nth roots of 1, and x ,y be points of EN, corresponding
to pairs (s,t), (u, v) E 'lLIN'lL x 'lLIN'lL. The skew-symmetric Wei! pairing
is defined by
eN(X,y) = exp(2'7Ti(sv - tu)IN).
A level N -structure on E is simply an isomorphism
with det aN = 1. If such a level N -structure aN is given, let '(aN) denote the
primitive Nth root of 1:
The curve Y (N) is in a natural bijection with isomorphism classes of elliptic curves
over C endowed with a level N -structure aN satisfying '( aN) = e2wi / N. In other
words, the modular curve Y(N) can be interpreted as a moduli variety over C.
The Igusa theorem (Igusa [82]) provides a sort of universal model for the
parameter space of the families of elliptic curves with a level N -structure. As
a preliminary to Igusa's main result on the modular curves Xo(N) and X(N) we
need a thorough study of the Galois theory of the corresponding fields of elliptic
functions over a field of characteristic p 2:: 0 (see Lang [108] for more detailed
treatment). This requires some extension of the Weil pairing eN(x,y) considered
above.
Weil Pairing
Let E be an elliptic curve defined over a field of characteristic p 2:: O. Let Xo be the
origin for the group law on E and N be a positive integer relatively prime to char k.
To construct the Weil pairing in this general situation one needs the following
version of Theorem 7.22.
Proposition 9.1. Let Xl, ..• ,Xs be points ofE and ai, ... ,as be integers satisfying
Reductions of Modular Curves 225
(i) L7=1 ai = 0;
(ii) L7=1 aiXi = xo, with addition in group law on E.
Then
s
D = Lai· (Xi)
i=1
is a principal divisor on E.
Proof: Since the assertion involves only a finite number of points we may assume
without loss of generality that the ground field k is algebraically closed. We have
seen, as a consequence of the Riemann-Roch theorem that the map X I-t L (x - xo)
induces a bijection between the set of points of E and the group Pico(E). Thus the
set of points on E forms a group, with Xo as the zero element. Moreover, x +y = z
if and only if the corresponding divisors, on E are equivalent: (x + y) rv (z + xo).
Since D = Lax· (Xi) is the divisor of degree zero, the proposition follows now
from this group structure on E. •
Let EN be the group of points of order N on E and UN be the cyclic group
of roots of 1 in the algebraic closure of k. If we use the above proposition, the
skew-symmetric Wei! pairing eN (x ,y):
(x,y) -+ eN(x,y),
is constructed as follows. Fix a point y E EN. Since the divisor N . (y) - N . (xo)
of degree zero satisfies the hypothesis of the proposition, it is principal, say
Clearly we have
(?)= LN.(x+z)- LN.(x).
XEEN xEEN
Since x +z is a zero ofj(Nt) and x is a pole ofj(Nt) we see that
L N·(x+z)- L N·(x)=(f(Nt))
xEE(N) XEE(N)
and thus, up to a constant factor, we have ~ (t) = j(Nt) for any tEE. Since
x E EN then ~ (x + t) = ~ (t) and hence
g(x+t) = eN (x,y)g(t)
226 Chapter 9
These functions are defined over k and have the property that if a : E -+ E' is an
isomorphism then h~ = h~ 0 a for i = 1,2,3. In particular two points x and y on
E satisfy
hE(X) = hE(y)
ifand only ifx = a(y) for some automorphism a E Aut(E) (see Lang [lOS, p. 20]).
In describing the Galois theory of elliptic function fields, we consider a ground
field k and variable e1ementj over k. Representative elements of the isomorphism
classes of elliptic curves defined over k(j) withj as their j-invariant are given as
follows: in chark =I- 2,3 we take the projective model
E: v 2w = 4u 3 -27j(j -172S)-I(u+w)w2,
These curves are non-singular and the group law can be defined with reference
to the point Xo = (0 : 1 : 0) as origin. Ifj is allowed to take special values then
Reductions of Modular Curves 227
Gal(k(j,EN)/kN(j)) ~ SL2(Z/NZ),
Gal(k(j,EN)/k(j)) ~ GL2(Z/NZ)
and
j-invariant of the elliptic curve E / (y), where (y) is the cyclic group generated by
y. The numberjN is algebraic over Q(j) and the field
L = Q(j,jN ,h(x))
Q(j,h(EN ))
Q(j)
(Jp = {f(j) I
E Q(j) vp (f(j)) ~ O} .
Let j be transcendental over the finite field Fp. Then (Jp is the localization of
the ring Z[;] with respect to the prime ideal which defines the reduction map
Z[;] --+ Fp[;].
The following two propositions, which we state without proof, are essential
ingredients in the Igusa construction of the smooth model for the field L. First
recall that the integral closure of a ring A contained in a field k is the ring of all
elements in k which are roots of monic polynomials inA[t].
Proposition 9.3. Ifp does not divide the level N, the integral closure (J oJ(Jp in
L is an unramified valuation ring.
To state the second result we need some notation. Let R be the integral closure
ofZ[;] inL. By picking a basis WI,.'" Ws of Lover Q(j), with discriminant d and
observing that as Z[;]-module R is contained in Z[;][WI /d, ... ,ws/dj, we obtain
from the Noetherian property of Z[;] that R is finitely generated over Z[;]. Also
Reductions of Modular Curves 229
let R* be the integral closure ofZ[l/i]; for reasons similar to those above, R* is
a finitely generated module overZ[l/i]. As Z~] and Z[l/j] are contained in the
valuation ring <9p, both Rand R* are contained in the integral closure <9 of <9p in
L. If P f N, Proposition 9.3 guarantees that <9 is a local ring with maximal ideal
m = p<9. In fact, it is easily verified using a theorem of Krull that m nR = pRo
Similarly we get m n R* = pR*; thus pR and pR* are prime ideals respectively
of Rand R*. Let k be the residue class field of <9, i.e., k = <9/m. Let Rand
R* be the images of Rand R* under the residue homomorphism <9 -+ k = <9/m,
i.e., R = RIpR and R* = R* IpR*. The residue class field k is exactly the field of
quotients of R and R*; the integral domain Rand R* are integrally closed in k.
Proposition 9.4. The residue class field k = <9/m is a regular extension of Fp,
that is Fp is algebraically closed in k and k is a separable extension. Furthermore
the compositum of k and the algebraic closure Fp of Fp is the field of modular
functions of level N in characteristic p.
The so-called Kronecker model of the field of modular functions of level N is
the union V of all discrete valuation rings of Rand R*. The union U of all discrete
valuation rings ofZ~] and Z[I/}] is the universal projective line and V is in some
sense "the derived normal model" of U in L. If the discrete valuation rings of
V are classified according to the characteristic of the corresponding residue class
fields, the model V is the disjoint union of local models, each parameterized by
primes ofZ:
V= Vo+ }2Vp,
p
The ring theoretic properties of Rand R* can now be translated into the
geometric properties of X. First of all X is non-singular. In fact the projection
of X on the hyperplane at infinity in F' corresponding to the first coordinate is
an affine curve with coordinate ring R ®z Q which is the integral closure of Q~]
in L. This affine curve is normal over Q and hence non-singular. Similarly the
projection ofX on the hyperplane in infinity inF' corresponding to the (m + 1)-st
coordinate is an affine curve with coordinate ring R* ®z Q which is the integral
closure of Q[ I /j] in L. The resulting affine curve is also normal over Q and hence
non-singular. These two affine curves, which provide a complete open cover for
X, are fixed in the discussion which follows. The crucial property ofX is the claim
that for a prime p which does not divide N, the reduction Xp of X with respect to
p remains non-singular. As indicated in Proposition 9.4, pR is a prime ideal of R
and the field of quotients k of R = R / pR is regular over Fp. This implies that the
reduction of one open set of X with respect to p is an irreducible affine curve over
Fp. Now, since Fp[Rm] = Rand R is integrally closed in k, this curve is normal
over Fp and hence non-singular. Similarly the reduction of the other open set of
X with respect to p is non-singular. These two affine curves determine a complete
open cover of Xp, and hence the curve Xp is non-singular. The following is the
fundamental theorem in the theory of elliptic modular functions.
Theorem 9.5 (the Igusa theorem). There exists a non-singular projective model
X of the field Lover Q such that the local model Vp is the union of the discrete
valuation rings corresponding to the closed points ofits reduction Xp with respect
to p. If p f N then Xp is a non-singular projective model of the field k = 1:)/pI:)
over Fp.
Using some simple properties of the binomial coefficients, one can prove the
following property of polynomial <PN (;, n) defined earlier in Section 8.2.
Reductions of Modular Curves 231
supersingular point
,-
P(l;: )
,-
P(~)
Figure 9.1.
i.e., the union of two projective lines (see Fig. 9.1). The singularities are ordinary
double points, all defined over Fp 2. A more refined analysis using the moduli
interpretation of Xo (P) reveals that the singularities actually occur at the points
corresponding to supersingular elliptic curves.
On the curve Xo(N) over Q the space !lo(N) of regular differential forms is
isomorphic to the C-vector space S2(ro(N)) of holomorphic cusp-forms on H
of weight 2 for the group ro(N). Hence S2(ro(N)) is g-dimensional space over
C, where g = go(N) is genus of Xo(N). Such forms are defined by functions
f : H -t C which are holomorphic, vanish about all cusps, and under the action
'Y E (~ ~) E ro(N) satisfy
aZ+b) 2
f ( cz+d = (cz+d) fez).
The isomorphism is given byf(z) t--+ w =fez )dz. Lett = e2'friz be a local parameter
about the cusp at infinity and let
00
fez) = L a(n)tn
n=!
232 Chapter 9
The operator Tp leaves invariant the space S2(fo(N)) and hence Oo(N).
The 19usa theorem for Xo(N) shows that for a prime p,p %N, the space of
regular differential fonns on the reduction Xo(N) of Xo(N) modulo p is obtained
by reducing modulo p the corresponding space for Xo(N). This theorem in fact
provides the existence of a basis for Oo(N) whose t-expansions have rational
coefficients in Z[I/N]; the reduction of these fonns provide a basis for the regular
differential fonns on Xo(N). Using this procedure it makes sense to reduce the
Hecke operator Tp modulo p. The resulting operator Tp turns out to be the
Frobenius operator on Xo(N) acting on the space of regular differential fonns.
This is basically the content of the congruence relation; a proof of this result,
which is originally due to Eichler, can be based on the congruence relation
where tr Tp is the trace ofthe Hecke operator Tp acting on the space ofcusp-forms
of weight 2for fo(N).
We would like to add a few words about the proof of this important theorem.
Our brief discussion is intended to suggest only how the main line of the argument
runs and we invite the interested reader to consult the Eichler paper [37] and
Shimura's book [176, Ch. 7].
To begin with we start with a prime p which does not divide the level N. As
implied by Igusa theorem, the physical appearance of the curve Xo (N) is the same
over the complex numbers as over the algebraic closure Fp of Fp.
We can interpret the Hecke operator Tp in the following way. Consider the
product Xo(N) x Xo(N). Its points correspond to pairs (E, GN), where E is an
elliptic curve and GN is a cyclic subgroup of order N. We consider in this product
the correspondence (algebraic curve) ~p consisting of all pairs ((E, GN), (E', Gfv ))
with E' = E / G, Gfv being the image of GN in E / G, and G being a subgroup of E
Reductions of Modular Curves 233
The Frobenius correspondence J takes x into the point y associated to the elliptic
curve E(P) obtained from E by raising the coefficients to the pth power; that is to
say, E(P) is an elliptic curve in the isomorphism class detennined by the j -invariant
jP; thus
J(x) = y +-+ E(P) +-+ jP.
We also needJ', which is the dual correspondence. This is defined as
Thus we have
x ++ E(z) = C/(Zz +Z) ++ j =j(z).
Since c(n) are integers, a simple extension of the Fennat little theorem to the ring
of formal power series in t over Z gives the congruence
with the invariant j (pz) associated to the isomorphism class of the elliptic curve
E(pz) = C/(Zpz +Z). This then suggests the congruence
where the bar means that the equation for E(pz) has been reduced modulo p.
This is not always possible and requires thatj-invariant of E does not lie in Fp2
(in the case of supersingular values j E Fp2 the reducing is carried out modulo a
prime ideal dividingp in the unique quadratic subfield KN ofQ(e21Ti / N ».Again
a straightforward application of the Fennat little theorem gives the congruence
The map
quite plausible. To obtain the actual equality claimed in the theorem it is necessary
first to represent correspondencesf,f* and Tp on the vector space of characteristic
zero and then to take traces, using the fact that the fixed points of the Frobenius
morphism acting on Xo(N)(Fp) correspond to those which have coordinates in
Fp. IfJi , ... ,/g is a basis of common normalized eigenforms in S2 (f0 (N)) for all
Hecke operators Tn with (n,N) = 1, and if
00
we have Tp./i = aj (P)Ji, and a representation of the Frobenius and Hecke corre-
spondence on the same vector space over a field of characteristic zero leads, via the
congruence relation, to the following identity for the zeta-function Z (Xo (N), u) of
the curveXo(N) defined over Fp
where
Therefore, if
then
Setting now
TI =2id,
T;v = Tpv - pTpv-2,
236 Chapter 9
we obtain
Corollary 9.9. With notation as above, and p a prime which does not divide N
we have
Now we give an explicit formula for tr T;v. This will be based on the following
special case of the Hijikata version [76] (see also Miyake [127, §6.8]) of the
Eichler-Selberg trace formula for the Hecke operators acting on S2 (f0 (N) ). Let
o (a)
{ (b)
=
t2
d-
- t 2m < Oandm 1 mod(4) (c)
4t2m < Oandm =2,3 mod(4). (c')
For each ofthese we have
Reductions of Modular Curves 237
(a)
(b)
(c,c')
(a)
f = { !ll positive divisors oft (b,c,c')
and let
(a)
b(s,J) ={ !c,o(t If) (b)
h(d)/w(d), (c, c')
where cp is the Euler phi-function, h (d) denotes the class number of primitive
ideals of the order Od ofQ( Jd) with discriminant d, and wed) is the order of the
group of units OJ. For fixed s andf and a prime II N, let AI(S,J) (resp. BI(S,!))
be a complete system of representatives modulo lv/(N)+v/(J) for the set ofintegers
x E Z satisfYing
x 2 - sx + n == 0 mod (IV,(N)+2v/(J)) ,
2x == s mod (F,(J))
x 2 - sx + n == 0 mod (I V/(N)+2V/(J)+I)).
Put
CI(S,!) ={ IAI(S,!) I
if St
4n 1=- 0 mod (I)
IAI(S,!) I+ IBI(S,J)I ifs p4n == 0 model)
and let
c(s,!) = I1 CI(S,!).
liN
where the limsup is taken over all absolutely irreducible non-singular projective
curves X defined over F q .
A fairly elementary proof of this result based on construction of an explicit
tower of Artin--Schreier coverings was recently proposed by Garcia and Stichtenoth
[53, 54, 56] (see Sections 11.5 and 11.6).
Reductions of Modular Curves 239
EXERCISES
9.1. Let p be a prime number, and X ~ JPl2 the conic defined by uw - p3 v 2 = O. Show
that X is non-singular over Q, and its reduction X modulo p has a singular point at
(uo : vo : wo) = (0 : 1 : 0).
9.2. Let E be the elliptic curve defined over Q by v 2 = j(u), wherej E l2:[u] is a cubic
polynomial. Prove that E has bad reduction modulo p = 2 and modulo every prime
p which divides the discriminant Dif) ofthe polynomialj(u).
9.3. Prove that the elliptic curve E : v 2 + v = u3 - u2 has bad reduction both modulo
p = 2 and modulo p = 11.
9.4. Show that the curve E : v 2 = u3 - 2u 2 - u has good reduction both modulo p = 3
and modulo p = 5. Find all torsion points on E. (Hint: Use the following fact:
if (u, v) E Q2 is a torsion point of an elliptic curve defined over Q by v2 = j (u ),
where j E l2:[u], then (u, v) E 12:2 and v divides the discriminant Dif) of the cubic
polynomialj(u), or v = 0.)
9.5. Show that in characteristic p = 2 there exists only one supersingular curve E, which
can be given by v 2 + v = u 3 withj(E) = O.
9.6. Prove that for every prime p 2: 3, the number of Fp-rational points of the elliptic
curve E : v 2 = u 3 + u satisfies Np == 0 mod(4).
9.7. Prove that for non-zero a E Q the equation v 2 = u3 -a determines an elliptic curve
E over any field k whose characteristic p does not divide 6 or the numerator or
denominator of a. Show that:
(a) for q == 2 mod (3) the number N q of Fq-rational points of E is Nq = q + 1;
(b) if q == 2 mod (3), then E has exactly three Fq-rational points of order 3;
(c) if q == 2 mod(3) and 6 XN, then there are at most N Fq-rational points of
order NonE.
9.8. LetE be an elliptic curve defined over F q , and let! be a prime not equal to p = char F q .
Suppose that Fq contains alllth roots of 1, i.e., q = plL == 1 mod(l), and that there
are z2 Fq-rational points of order! on E. Show that:
(a) the splitting field ofu l - a, where a E Fq, has degree either lor lover Fq;
(b) ify E E(Fq) and if Fqv is the extension of Fq generated by the coordinates of
a solution x to the equation Ix = y (i.e., Fqv is the smallest extension of Fq
containing such an x), then there are 12 Fqv-rational points Xi such that IXi = y;
(c) given a point x E E(Fqv) such that Ix = y, the map u -+ u(x) - x gives an
embedding of Gal (Fqv /Fq) into the group of points of order Ion E;
(d) v = lorl.
9.10. Let! E Mk(fo(P» (p a prime) and let tr(f) = r!!=o!o ['Y;], where 'Yo = 1 and
(a) 'Yo and 'Y;, 0 < i :<:;p, are right coset representatives for f(l) modulo fo(P);
(b) tr(f) EMk(f(I»;
Geometric Goppa codes were introduced by V. D. Goppa [64] in 1981. This class
of codes is a very natural generalization ofReed-Solomon codes, which have been
well-known in coding theory for a long time, and rational Goppa codes, described
in Chapter 3. At present there is a variety of examples of algebraic curves over
a finite field Fq (with a lot of Fq-rational points), which can be used for explicit
constructions of good geometric Goppa codes (see Chapter 11). Furthermore,
there are several ways to construct asymptotically good geometric Goppa codes
on algebraic curves defined over a finite field. One is based on the use of classical
modular curves and makes it possible to produce a family of linear [n,k,dlq-codes
over a finite field F q , q 2: 49 being a square of a prime p, for which R + [) comes
above the Gi1bert-Varshamov bound. A second (and much easier) is based on
construction of a special sequence of Artin-Schreier coverings of the projective
line pI (Fq) and provides the existence of a similar family of [n,k,dlq-codes in
more general situations, when q 2: 49 is an even power of a prime number p. This
result can also be obtained through the use of Drinfeld modular curves. However,
the third approach allows us to construct only extremely long codes, which are
not suitable for practical applications. On the other hand, the first and second
ways are completely constructive and allow one to rather easily produce linear
[n,k,dlq-codes of acceptable length n. For this reason we restrict ourselves to
consider only asymptotically good geometric Goppa codes arising from classical
modular curves and Artin-Schreier coverings. The reader can find an extensive
presentation of the third approach in the book by Tsfasman and Vladut [208].
One can note also that the use of some Shimura modular curves (which generalize
classical modular curves) gives one more way to construct asymptotically good
[n,k,d]q-codes for any q = p2v.
241
Chapter 10
10.1. L-CONSTRUCTION
Let X be a smooth projective curve defined over a finite field F q . Goppa introduced
the remarkable idea of associating to a set of distinct Fq-rational points Xl , •.. ,Xn E
X a linear code by evaluating a set of rational functions on X at the points Xl, •.. ,Xn •
To be precise, let Xl, •.. , Xn be Fq-rational points of X and
Let D be a Fq-rational divisor on the curve X with the condition degD ;::: O. We
assume that D has support disjoint from the divisor Do, i.e., SuppDo n SuppD = 0.
The linear q-ary code C = C(Do, D) of length n associated to the pair (Do,D)
is the image of the Fq-linear evaluation map
Note that this definition makes sense: for f E L(D), we have VXj if) ;::: 0, 1 :::; i :::; n,
because SuppDo n SuppD = 0. Such a code C is called a geometric Goppa code.
The analogy with the definition of Reed-Solomon codes is obvious. In fact,
243
244 Chapter 10
and
d?n-degD.
and hence
C(Do,D) ~L(D)/L(D-Do).
Therefore
T-I
is an effective divisor. By taking degrees we find
degD -n +d? O.
Ev: L(D) -+ F;
is an embedding, and we have:
(i) C(Do,D) is an [n,k,dlq-code with
(iii) if {Ii , ... Jk} is a basis 0/L (D), then the matrix
fi (~I) fi (~2)
(
!k(XI) !k(X2)
is a generator matrix/or C(Do,D).
10.2. a-CONSTRUCTION
There is another geometric Goppa code that we can associate to the pair (Do,D),
where degD ::; n + 2g - 2. Consider the linear space of rational differential forms
Now, if (Res xI if ~), ... ,Resxn if ~)) has Hamming weight d*, i.e.,f ~ is
regular at n - d* points ofSuppDo, say Xii' ... ,Xin _ d*, thenf E L(K +Do -XiI -
... - Xin _ d * - D). By taking degrees we arrive at the inequality 2g - 2 + n - (n -
dO) - degD ~ 0, which completes the proof. •
Corollary 10.4. If degD > 2g - 2 then the map Res is an embedding, and we
have
k* ~ n - degD + g - 1.
Moreover, if2g - 2 < degD < n then
k* = n - degD + g - 1.
The following result gives an exact connection between codes C*(Do,D) and
C(Do,K +Do -D) (which shows that the O-construction leads us to the same
class of codes as the L-construction; in particular, all results that we derive for the
codes C*(Do,D) can be carried over to the codes C(Do,D)).
Proposition 10.5. There exists a rational differential form ~ on X having simple
poles with residue 1 at all points Xi E SuppDo such that C*(Do,D) = C(Do,K +
Do - D) with K the divisor of ~.
O(Do-D) ~ L(K+Do-D)
Res..l- ..l-Reso
F; F;
Constructions and Properties 247
where ex. is an isomorphism sending w to w / wo, and from the fact that Resx; (f wo) =
f(Xi) for 1 ~ i ~ n. •
The parameters kc* = n - degD + g - 1 and dc* = degD - 2g + 2 are called
designed parameters of the code C* = C*(Do,D). The designed parameters of
both L- and O-constructions satisfy
kc + dc = n - g + 1.
Remark
If degD < n, the lower bound k ~ kc is non-trivial only for degD > g - 1,
and if degD > 2g - 2 then the lower bound k* ~ kc* is non-trivial only for
degD ~ n + g - 1. In addition, k ~ n - g for degD ~ n, since /(D - Do) ~
deg(D-Do) + 1 = degD-n+ 1, andk* ~ n -g for degD ~ 2g-2, since /(K-
D) ~ deg(K - D) + 1 = 2g - 1 - degD. Similarly, the lower bound d ~ dc is
non-trivial only for degD < n, and the lower bound d* ~ dc* is non-trivial only
for degD > 2g - 2.
P-Construction
Recall (see Exercise 1.7) that the equivalence class of a non-degenerate linear
[n,k,d]q-code C is uniquely determined by a projective [n,k,d]q-system:P.
248 Chapter 10
LetX c F be a smooth projective curve over Fq andP = {Xl, ... ,Xn } S;;;X(Fq)
be a subset such that IPI > m and P does not lie in a hyperplane. Any choice of
P yields a projective [n,k,d]q -system with n = IPI, k = m + I and d = n-
maxH{IHnpl}, the maximum being taken over all Fq-hyperplanes He F.
For a curve Xc F over Fq its degree degX is defined as the number of points
X E X over Fq (counted with proper multiplicities) in its intersection with a general
10.3. PARAMETERS
Weierstrass point on a curve X and {a I , ... , ag_l} be the sequence of its non-
gaps, 2 ~ al < ... < ag_1 ~ 2g-I, i.e., /(ai ·x) > /((ai -1) ·x) for each i =
I,2, ... ,g - 1. Then /(ai ·x) = i + 1 and choosing D = aj·x we get a code
C = C(Do,D) with k = /(D) = i + 1, de = n - ai. The obtained value of k is
better than ke = degD - g + 1 for every i satisfying i + 1 > ai - g + 1. Since x is
a Weierstrass point this inequality holds at least for one i.
Therefore we have proved the following result:
Proposition 10.7. Let X be a smooth projective curve over Fq of genus g ~ 2
and XI, ... ,Xn be Fq-rational points of X. If n > 2g - 1 then there exists an
[n,k,d]q-code C = C(Do,D) with k > ke and d = de.
Weierstrass points and some more subtle techniques using Cartier operators
(see Lang [108, p. 311] and Tsfasman and Vladut [208, p. 166]) sometimes lead
also to better lower bounds for the parameters of geometric Goppa codes obtained
by the O-construction. Let C* = C*(Do,D) and D = 'Lf=1 mi ·pi , where Pi is a
prime Fq-rational divisor. Let mi = ni . q + ri, 0 ::; rj ~ q - I, and ail < ... < aig
be the Weierstrass gap-sequence at the closed point Pi. Let
and
s
d ~ degD+ LeljdegPi -2g+2.
i=1
Theorem 10.9. If D = qD' and D' = Iai . Yi, O:S ai :S q - 1, then the field
restriction gives a code c· = Co
n F; with parameters
[n,2 n - vdegD(q -l)/q -1,2 degD -2g+2]q.
In view ofthe condition ai :S q - 1, the left-hand side has a pole ofthe order at most
q - I. The order of any pole of the right-hand side is divisible by q. Hence if we
assume that the function I f3J( has at least one pole, we arrive at a contradiction. If
it has no pole then I f3J( = f30 and hence (I yJi)q = Y6 with y? = f3i. Therefore
I yJi = Yo, contradicting the linear independence of 1,./i , ... ,1m-g.
Extend the set F = {1,./i, ... '!m-g,j(, ... ,Pm-g} to a basis of the space L(D).
Since C' = ct F;,
n F;
an element v = (VI, .. . , vn ) E lies in C' if and only if
n
v-/= Lv/(xi) =0
i=1
for any f in this basis. Iff #- 1, in general we have v -/ E Fqv, and hence each
equation v -/ = 0 generates v linear equations over Fq (iff = 1 we obtain just
one linear equation). Observe also that the equation v .f; = 0 is equivalent to the
equation v·j( = O. Thus we see that the number of the linear equations over Fq
is at least by
v(m -g) + v-I
less than the a priori bound vl(D) = v(degD - g+ 1). This yields
In the case m :S g we set F = {I} and proceed as before. The a priori bound
v (degD - g + I) is sharpened by (v - 1), and we arrive at the inequality
dimC· 2 n - v(degD - g) -1,
Corollary 10.10. Let gq :S degD < n. If D = qD' and D' = Iai . Yi,
•
0 :S ai :S
q -1, then C' = C'(Do,D} has the parameters
[n,2 n -degD(q-1}/q -1,2 degD-2g+2]q.
Constructions and Properties 251
Automorphlsms
Let G ~ AutFq (X) be a subgroup of the group of all Fq-automorphisms of X.
Suppose that SuppDo and the divisor D are G-invariant. Then G operates on C =
C(Do,D). Indeed, in this case for any g E G andanyf EL(D) the function g* (j),
whereg*(j) (x) =f(g(x)), lies inL(D). Therefore a code-vector (j (x I) , ... ,j(xn))
is mapped to (g* (j)(xI) , ... ,g*(j)(xn)) = (j(g(XI)), ... ,j(g(Xn))), which is also
a code-vector. Thus G is naturally mapped to Aut(C) nSn and, according to the
properties of group codes, we have
where Hi is the stabilizer of Yi E SuppDo, {YI, ... ,Ym} being the set of orbit
representatives ofthe action ofG on SuppDo.
Proof: Consider first the case 2g - 2 < degD < n when both maps Ev and Res
are embeddings. Forf E L(D) and wE O(Do -D) the residue formula yields
n
(Ev(j),Res(w)) = LResxj(jw) = L Resx(jw) = O.
i=1 xEX
A code C is quasi-self-dual if there exists such a and is self-dual if a = (1, ... , 1).
Theorem 10.13. Letn > 2g-2 be even, anddegD = nI2+g-1. IfK +Do rv 2D
then an [n,nI2, 2: nl2 - g+ Ijq-code C = C(Do,D) is quasi-self-dual. Moreover,
there exists a unique (up to a multiplicative constant) differentialform WO E O(Do -
2D) such that the code C is quasi-self-dual with respect to a = (al' ... ' an), where
ai = ResXi (wo) =J O. In particular, ifResx1 (WO) = ... = Resxn (wo) then C is self-
dual.
an extension of the ground field). By the supposition of the theorem there exists a
:s :s
divisor iJ = a ·y+iJ', 0 iJ D such that deg(D - iJ) = 2g - 1 and degiJ ~ 2g.
The Riemann--Roch theorem yields L (iJ) =I L (iJ - y ). Letf E L (iJ) \ L (iJ - y ). It
follows from self-duality that there exists a differential form wE O(Do - D) with
Resx;(w) =f(Xi). We havefwo E O(Do - (D -iJ)) and Resxjifw) = Resx;(w),
i.e., w - fwo E O(iJ -D). On the other hand deg(D -iJ) = 2g - 1, and since
a non-zero regular differential form can not have (2g - 1) zeros then w = f WOo
Hence the order of zero of WO at Y satisfies
This is valid for eachy E SuppD; therefore (wo) ~ 2D, that is wo E O(Do - 2D) .
• Self-dual geometric Goppa codes have been studied by Driencourt and
Stichtenoth [29], Scharlau [158] and Stichtenoth [193].
Spectra
Determination of the weight distribution of a geometric Goppa code leads to very
subtle questions on the geometry of corresponding projective curve X. We are
restricted to description of some general facts.
"x·
d*
D-K", £..J 1T -D" ,
T=l
where D" is an effective divisor on X with the support SuppD" disjoint from {XiI'
... , Xid *}·
Proof: Let f E L (K + Do - D) and suppose that there are exactly s points, say
,xs , wheref wo has a pole. Then we have
Xl, .•.
s
if wo) = - L Xj + D + D"
j=l
EXERCISES
10.1. Let X = IP'I (Fq), and Xl , •.• ,Xn be distinct Fq-rational points onX. Let
n
Do= LXi
i=l
and D be a Fq-rational divisor onX with the condition SuppDo n SuppD = 0. Prove
that the geometric Goppa [n, k, d]q-code C = C(Do, D) associated to the pair (Do, D)
has the following properties:
(a) n :::;q+l;
(b) k = n if and only if degD > n - 2;
(c) for 0:::; degD :::; n - 2,
k = I + degD and d = n - degD;
°
10.5. The Fermat elliptic curve u3 + v 3 + w3 = is birationally isomorphic to the curve
X defined over F2 by the equation v 2w + vw 2 = u3 + w 3. Verify that the rational
points of X over F4 = {O, l,a,a} are the point at infinity xoo = (0,1,0) and eight
points
xl=(O,a,I), x2=(1,I,I), x3=(a,I,I), x4=(a,I,I),
xs=(O,a,I), x6=(1,0,1), x7=(a,0,1), xs=(a,O,I),
where a, a are roots ofthepolynomiaIF(t) = t 2 +t+ I. Let Do =Xl + .. ·+xs +Xoo
and letD = x + o-(x) + 0- 2(X) be a primeF2-rational divisor, where X is a Fs-rational
point of X such that x f/. SuppDo. Using the map
Res: O(Do-D) ~F!
construct an [9,6,2': 3]4-code C*(Do,D).
10.6. Let X be the non-singular projective curve of genus 1 given over F4 = {O, 1, a, a}
by the equation
u 2 v + av 2 w + aw2 u = 0.
This curve has nine F 4-rational points:
xl=(I,O,O}, x2=(0,1,0), x3=(0,0,1), x4=(I,a,a),
xs=(I,a,a), x6=(I,I,I), x7=(a,I,I), xg=(1,a,I), x9=(I,I,a).
Let Do =Xl + .. ,+x6 andD = 2x7 +xg. Using the map
Ev:L(D)~F2
construct an [6, 3,4]4-code C(Do,D).
10.7. Let C ~ (Fqv)n be a linear code over Fqv and
trv :Fqv ~Fq
is Fq-linear;
(b) (C')-L = trv(C-L);
(c) if C, C' and c" = trv(C) have parameters [n,k,d]qv, [n',k',d']q and
[n",k",d"]q, respectively, then
ks,k"s,v·k
and
k- (v-l)(n-k) S, k' S, k.
Chapter 11
Examples
First we consider the simplest case, that of linear codes coming from curves of
genus zero.
or
l-g
R+8>
- 1+--.
n
257
258 Chapter 11
In particular, for the codes coming from curves of genus zero, one has
k+d=n+l,
and we have
k+d2::n-1.
Codes on curves of genus g = 3 satisfy
k+d2::n-2
4 ~ n ~ Nq (3) and 1~ k ~ n - 3.
The information concerning all possible values for Nq (2) is given by Theorem
6.22 (see also the corresponding table of values of Nq (3) for small q placed in the
same section).
Examples 259
n = 24, k* = 26 - 2m, 2m - 4 :S d* :S 2m - 1.
Now we apply arguments presented in the previous chapter. If we take m = 3,
then we obtain d* ~ 2. We have d* = 2 if and only if L(D' - P) i= {O} with
o:S D' :S Do a divisor of degree 2 (note that K rv 2 . P). Suppose L (D' - P) i= {O}.
Then the two points over Fg are flex points, so the tangent there is not a bitangent
(aflex point is a point where the tangent has at least a 3-fold intersection withX).
So this is not possible and d* ~ 3. Again, d* = 3 if and only if there exists a
divisor 0 :S D' :S Do of degree 3 with L(D' - P) i= {O}, i.e., there exists a point
x E X of degree one over Fg such that x + P rv D'. But then they belong to a linear
system JP'(L(D")), with I(D") = 2, degD" = 3, and we can find a pointy over Fg
such that
x+P+y rvD' +y rv K.
In fact, by the Riemann-Roch theorem one has I(D') -1(K -D') = I, and we know
I(D') ~ 2, therefore I(K - D') ~ 1. This means that x + y rv P, i.e., 2·x + 2· y rv
2· P rv K, i.e., x ,yare two intersection points of a bitangent. But we do not have
bitangents which are tangent in Fg-rational points. Therefore d* ~ 4. By the
Hamming bound d* can not be 5. We have proved the following result:
The fact that d* i= 5 implies that there exists a divisor 0 :S D' :S Do of degree
4 such that L(D' - P) i= {O}, i.e., there exists an effective divisor P' of degree 2
over Fs such that D' rv P + P'. One can take P' = P.
We can try to get good codes from this curve with D = m . P and m ~ 5.
However, here one can not improve above the minimum value for d*: d* ~ 2m - 4.
In fact, let us show this for m = 5 and for m = 11. If m = 5 then 6 :S d* :S 9. We have
d* = 6 ifandonly if there exists a divisor 0 :S D' :S Do withL(K +D' - 5 ·P) i= {O},
i.e., D' '" 3 . P. This happens if and only if D' + P '" 2K, i.e., there exists a conic
which passes through P and the six points of D'. For suitable D' such a conic
260 Chapter 11
Hyperelliptic Codes
Let X be the hyperelliptic curve of genus g = 2 defined over F2 by the equation
v 2 +v = u5 + 1.
It has a 2-fold covering of pI ramified over the point at infinity and the genus of
X can be computed by the Hurwitz formula (see Section 4.5). We denote by r the
hyperelliptic involution. Let Xoo be the point of X lying over the point at infinity.
The number N2v of F2v-rational points of X is given by
if v =1= 41
if v = 41
So over F24 we find 33 points: Xoo plus two over each of the 16 points of the affine
line AI, namely points x and r(x). Now let Do be the sum of all 32 F 24-rational
points differentfromxoo andD = m ·Xoo with m 2: 3. For the code C' = C*(Do,D)
we find n = 32, k = 33 - m and m - 2:'S d* :'S m.
Suppose d* = m - 2. In that case there exists a divisor 0 :'S D' :'S Do of degree
m - 2 such thatD' ,...., (m - 2) ·Xoo. This is possible for m even by taking (m - 2)/2
pairs of conjugate points x and rex). We now assume that m is odd. Then
D' ,...., (m - 2) . Xoo is impossible if m = 3, since X is not rational. If m = 5, there
exists a divisor 0 :'S D' :'S Do with D' ,...., 3 . Xoo. Then 2· D' ,...., 2· K. The hyperelliptic
involution r acts as the identity on 12 . K 1since L (2 . K) is generated by products
of elements of L(K), hence 2· D' is a 2-canonical divisor invariant under the
involution. But then also D' is invariant. This contradicts the fact that degD' = 3.
Next, let m 2: 7. Then we can find a divisor D' with D' ,...., (m - 2) . Xoo. Indeed, the
points lying over the 5th roots of 1 on the affine line Al with v = 0 form a divisor
linearly equivalent with 5 . Xoo. By adding suitable pairs of conjugate points x and
r(x) one gets a D' of required form. We have proved the following result:
Proposition 11.3. Let X be the hyperelliptic curve defined over F2 by v 2 + V =
u 5 + 1 and let D = m . xoo with 3 :'S m :'S 31. Let Do be the sum of all F24 -rational
Examples 261
points ofX minus Xoo. Then the code C* = C*(Do,D) is a linear [32,33 -m,d*b-
code with d* = m - 1for m = 3,5 and d* = m - 2 otherwise.
if v = 21 + 1
if v = 21
The nine points over F4 are flex points and they give the points of order 3 in the
group law. Choose D as a prime F2-rational divisor which corresponds to a close
point of degree 3 over F2 (consisting of 3 new points Xi, 1 :5 i :5 3, of degree
one over F8), and choose Do as the sum of nine F4-rational points. The space
L(K +Do -D) has dimension 6. We get a linear [9,6, 2: 3kcode C* = C*(Do,D).
The curve E is an elliptic curve with origin (1 : 1 : 0). We have d* = 3 if and only
if Xl + X2 + X3 is a point of order 3 in the group. This is the case because this sum is
an F2-rational point and all points over F2 are points of order 3. The configuration
of the nine points of order 3 has a large automorphism group which is the group
of affine transformations of p} over F3.
v 2 +v =g(u),
where
/'(a) = TI (a + a').
a'EA
a'i-a
With n = 2a and q = 2", we define the residue map
where
w ~ Res(w) = (ResX1 (w), ... ,Resxa (w ),ResT(xl)( w), ... ,Res,,(xa)(w )).
Proposition 11.4. If q + 1 < n < N q (I), the elliptiC [n,k,d]q-codes are optimal.
q(q - 1)
g= .:;...c..::-2--'-·
The curve X has (q + 1) Fq2 -rational points at infinity (ofthe fonnxco = (1, ~,o),
where ~ is a (q + 1)th root of -1). Next, there are (q + 1) F q2 -rational points
of the fonn x = (u, v, 1), where u satisfies 1 - uq+l = 0, and (q2 - q - 1)(q + 1)
Fq2 -rational points of the fonn x = (u, v, 1) for each u such that 1 + uq+I :I: 0 and
for each v satisfYing v q + I = -1 - uq+ 1 • Thus the curve X has exactly
N q 2 = 2( q + 1) + (q + 1)(q2 - q - 1) = q3 + 1
F q 2 -rational points. Because N q2 = q2 + 1 + 2gq, the Hennitian curve X is maxi-
mal.
Choosing a, /3 E Fq such that a q + a = /3q+ I = -1, and setting
/3_
y __ /3(I+a)u-av
z=uy-a=
- v- /3u' v-/3u
we can transfonn
to the equation
264 Chapter 11
zq+z=yq+l.
The curve X given by the above equation has one F q2 -rational point Xao = (0,1,0)
at infinity and q3 F q 2-rational points of the form X = (y,z), where zq +z = yq+l.
Indeed, the non-zero elements u E Fq2 with u q+l E Fq form a subgroup of F;2
of the order (q - I)(q + 1) = q2 - 1, and the equation zq + z = t has exactly q
solutions in z E F q2 for each t E F q .
For n = q3 let Xl, ... ,Xn be all F q 2-rational points of X different from Xao. Set
and
d?:.n-m.
Note that if m > n + 2g - 2, the Riemamr-Roch theorem yields
k=m-g+ 1- (m-n -g+ 1) =n.
if 0:::;m<q3
if q3:::; m :::; q3 + 2g - 2
if 2g - 2 < m < q3
and the minimum distance d satisfies
d? q3 -m;
(ii) if 0 < m < q3 and both numbers m and q3 - m are non-gaps at xoo then
d=q3- m.
Proof:
(i) For 0:::; m < q3 we have k = l{m . x",,}. For q3 :::; m:::; q3 +2g-2 and
p, = q3 + 2g - 2 - m we find 0 :::; p, :::; q3, and then
has iq distinct zeros x = (a, 13) E SuppDo. Next, we choose j distinct elements
131, ... ,J3j E F q2 with J3~ + J3T = 'Y and set
j
h= TI(Z-J3T).
T=I
The rational function h hasj(q + I) zeros x = (a,J3) E SuppDo, and all of them
are distinct from the zeros of/because J3~ + J3T = 'Y i= aZ+ 1 for I ::; p ::; i and
I ::; T ::;j. Hence the function
(the existence of the automorphism a for above mentioned parameters A, '1/ and
e follows from the fact that a(y) and a(z) satisfy the equation a(z)q + a(z) =
Examples 267
Now we consider a class of rather long linear [n,k,d]q-codes over a finite field Fq
coming from fiber products of hyperelliptic curves with a lot of Fq-rational points
(Stepanov [188, 189]). Our approach allows us to determine explicitly a basis of
the space O(Do - D), and this provides an easy way to write out the generator
matrices for codes in the class and to find a fast decoding algorithm.
Let p be a prime number, v ~ 1 an integer and Fq a finite field with q = pV
elements. The field Fq is a Galois extension of Fp degree v with a cyclic Galois
group of order v. The action of a generator e of this group on an element v E Fq
is given by the rule e( v) = vI', and the map
1 v-I
normv(v) = V· e(v)··· e v- (v) = v·vP···vP
of Fq onto Fp is known as the norm of v. Let X be a non-trivial multiplicative
character of Fq and van element of Fq. Set
Z2 = f(u),
and observe that the genus of Y is g = l m;-l J and the number N q of Fq-rational
points of Y is
Nq = l:(1 + Xv(f(u))).
UEFq
and therefore
Nq = 2q _ql/2.
Thus the curve Y given by the equation
z2 = U+Uql / 2
is maximal over the field F q •
Denote by X a smooth projective model of Y. Let I, n ::; Nq be positive integers
and Xl , ••• ,Xn be Fq -rational points of X different from the point xoo at infinity. Set
DO=XI+···+Xn , D=I·xoo
and consider the geometric Goppa [n,k,dlq-code C = C(Do,D) associated to the
pair (Do,D).
Theorem 11.8. Let v > 1 be even and Fq a finite field of characteristic p > 2
with q = pV elements. For any integer I, t.
q 1/2 < 1 < n, there exists a geometric
Goppa [n,k,dlq-code C = C(Do,D) with
n ::;2q_ q l/2,
k > I - ! .ql/2
- 2 '
d?:.n-I.
Corollary 11.9. The relative parameters R = k / nand 8 = d / n of the code C =
C(Do,D) satisfy
ql/2
R?:.I-8- 2n .
In particular, for n = 2q - ql/2 we have
Now we extend the above approach to the case of the curve Y defined over
Fq as a fiber product in the corresponding affine space. Letfi, ... ,Is be pairwise
coprime square-free polynomials in Fq [ul of the same degree m?:. 1. Consider the
fiber product given over Fq by equations
Then
(i) the set A forms a subgroup of the additive group F: of the field Fq;
(ii) if {AI = A,A2, ... ,Ar} is the set of all cosets in F: jA and {aI, a2,.··, a r }
are distinct representatives of the cosets, then the polynomials
and therefore
This yields
vl2
ai -aj+(ai -aj)p = 0,
and we find that ai - aj is a root off(u), hence ai - aj EA. But ai - aj (j. A
according to the choice of aI, ... ,ar , and we arrive at a contradiction.
Finally, since IAI = pv/2 we find that
zf=fi(x), l:Si:Ss,
with polynomials
v/2
fi(x)=(x+ai)+(x+a;)P,I:Si:Ss,
Proof: We have
Nq = L (I+Xv(fi(u)))···(I+Xv(fs(u)))
UEFq
and hence
s
N q =pv+ L
It follows from Proposition 11.10 and Theorem 6.14 that
Xv(f;(u)) = {
o if u E Ai
1 if u E Fq \Ai '
and since any two distinct sets Ai and Aj have no common elements we obtain
Wil,···,i u z···
1I
·z·lu '
where il,"" ia are integers satisfying the condition 1 :S i I < ... < ia :S sand
are polynomials in k'[u]. Indeed, the differential form
Pi] , ... ,io-
, du
Wij, ... ,icr = z· .. ·z·
It lu
is regular at any point Uo E Al with the condition Zi(UO) =f. 0 for i E {il,"" ia}.
Now if Zi(UO) = 0 for a unique i E {il,"" ia }, then Zi is a local parameter at x? =
(UO,±ZI, ... ,±Zi-\,O,±Xi+I , ... ,±zs), so that Vxll(Zi) = 1 and vxn(u - uo) = 2.
Therefore, vxi,(du) = 1 and again wI j , ••• ,;" is re~lar at Uo. The f~rm Wo = du is
272 Chapter 11
also regular at any point Uo E AI. Thus, the differential forms w~ = du and w;I, ... ,iu
form a basis of the k'[u]-module n[Y].
It remains to clarify which of the forms wo and Wil, ... ,iu are regular at points
xii), ... ,xt). Let x", be one of these points. If I is a local parameter at x"" then
u = 1-2 U' , Zi = rmz;, where u' and z; are units in the local ring Ox",. Therefore
Wil, ... ,iu = I
I mu-3 T/il, ... ,i d·th
u I, WI
. . In hence (wiJ,
T/iI, ... ,iu a unIt m vx""
' ... ,i ) -- (mu-
u
3) ·X",. Thus, the differential form
. (u)du
p.'1,···,10'
Wil,···,iu = z···
II
·z·Is
is regular at x", if and only if
=~
2 u=1
±u(s) _ ± (;) U u=1
u=O mod(2)
= ~ (ms2 S- 1- 2s - 2s - 1 + 2)
2
Examples 273
and hence
g = g(X) = diIDk' fi[XJ = (ms - 3)28 - 2 + 1.
This completes the proof.
Now we use the curve Y ~ A8+ I defined over Fq by Equations (1Ll), where
•
Ji, ... ,Is E Fq [u] are pairwise coprime square-free polynomials of the same degree
ql/2 given by (11.2), to construct rather good linear codes over Fq. Let X bea
smooth projective model of Y over k' = Fq and letMq be the number ofFq-rational
points of X. From Proposition ILl 1 and Proposition lLl2 we have
and
g = g(X) = (sql/2 - 3)28 - 2 + 1.
Let 1 and n ::::; Nq be positive integers, XI, ••• ,Xn be Fq-rational points of X
different from the point Xoo at infinity, and
Do = XI + ... + X n , D=[·xoo.
Applying to X the L-construction for (sql/2 - 3)28 - 2 < [ < n we obtain the fol-
lowing result:
k ~ 1- (sql/2 - 3)28 - 2,
d ~ n -I.
Corollary 11.14. The relative parameters R = kin and S = din of the code
C = C(Do,D) satisfy
(sql/2 _ 3)2S - 2
R>l-S- .
- n
sql/2_3
R>l-S- .
- 2(2q l/2 -s)ql/2
274 Chapter 11
By a suitable concatenation one gets reasonably good codes over Fp. Indeed,
letko> I be an even number. Applying a linear [no,ko,dojp-code Co to an [n,k, djq-
code C = C(Do,D) over Fq, where q = pko, we obtain an [n',k',d'jp-code C' with
parameters
n' = non, k' = kok, d' = dod.
Let us denote by Ro = ko I no and 80 = do I no the relative parameters of the code
Co.
Corollary 11.15. For any positive integers no > 1, s ~ ql/2 and I, (sql/2_
3)2 S - 2 < I < n, there exists a linear [n',k',d'jp-code C' with
Relative parameters R' = k'ln' and S' = d'lnl of the code C' satisfY
zi = fi(u), I ~ i ~ s,
Proof: The proposition follows immediately from Corollary 8.2, Proposition 8.5,
Corollary 9.9 and Theorem 9.10. •
Note that the assertion of Theorem 9.10 remains correct for all positive integers
N relatively prime to p (see Tsfasman and Vladut [208, p. 426]), and we arrive at
the following result:
where
n'~ N(P-1)n(1+I-I)
12 liN
n = rN (P-1)
12
n(1 +r )1·
liN
l
(in this case the effect obtained by using Weierstrass points is maximal). Then we
have
where
ifm =2v
ifm = 2v+ 1
where n' ~ 2m - 3 (p -1) and k(l,m) is defined asfollows: k(l,m) = r ifand only
if
Let Fq be a finite field with q = pI) elements and Fq2 a quadratic extension of Fq.
Let Nq2 (X) be the number of Fq 2 -rational points of a smooth projective curve X
of genus g(X) defined over Fq 2 and
2. Nq2(X)
A(q ) = lIm sup -(X) ,
g(X)-t oo g
the supremum being taken over all smooth projective curves X of genus g = g(X)
(up to isomorphism over Fq2). As was shown before, there exists a family of
modular curves Xi such that
A( 2) _ r N 2 (Xi)
q - _1
q - j!! g(Xi) -q . (11.3)
much easier proof of (11.3), proposed recently by Garcia and Stichtenoth [52] and
based on construction of a sequence of (modified) Artin-Schreier coverings
···~X2~XI~XO
such that the ratio N q 2 (Xi) / g(Xi) tends to the Drinfeld-Vladut bound q - 1 as
g(Xi) ~ 00.
Let k = Fq2 and Lo = k(x) be the rational function field over k. We define the
sequence of smooth projective curves Xi over k recursively by
(11.4)
(11.5)
and
v
A;+l' {Z~+l
I
+ZHI =j(+l
I (11.6)
I' Ji = zdJi-1
for i 2: 1. Consider the corresponding tower of function fields
L j = Li-l (Zi).
Our purpose is to calculate the genus of the curve Xi for each i 2: 0 and determine
the number of k-rational points of Xi. To do this we use the ramification theory of
Artin-Schreier extensions described in Section 5.4.
The Genus of L;
From now on, we consider the tower (11.7) of algebraic function fields Li = k(Xi),
where the smooth projective curvesXo,XI ,X2, ... are defined by Equations (11.4)
to (11.6).
Lemma 11.19. Suppose that a prime divisor P E Div(Li) is a simple pole of
Ji = zdJi-1 E L j • Then the extension LHI/L j has degree [LHI : L;J = q, and Pis
totally ramified in LHI/Li. The prime divisor pI E Div(LHJ) lying over P is a
simple pole ofJi+I.
i.e., S(l) contains all prime divisors of LI which are either a pole of x or a
common zero of x and ZI - a, for some a E k* satisfying a q + a = O.
p(h) = h q +h.
To proceed to the critical step of the calculation of the genus gi = g(Li) we need
two lemmas.
Lemma 11.22. Let 1 :::; J.L :::; i, and let P E Div(Li) be a prime divisor ofLi lying
over Q/L (i.e., P is a common zero ofx ,ZI, ... ,z/L). Then (in the notation ofSection
5.4), we have at P:
and
yields
Asf/L = z/Llf/L-I, this gives the first assertion .. Now we setg = f:~: and obtain
hence
Proof: (By induction on T.) Suppose first that T = O. A prime divisor P E S6i )
is the common zero OfX,ZI, ... ,Zi-l andzi - a, with an element a E k* satisfying
a q + a = o. We have (Zi - a)q + (Zi - a) = J;~+;l, hence
(-(q+l)
Ji-2r
= .~Vi-2r-l
Mtr-(q+I)) +E(I)
'
so that
Zi
r(q+l) +E(I) .
= 'YJi-2r-1 (11.9)
Since.li =Zd.li-I' then
B -1 -- 'Y I + EU·q+l)
Vi-2T -
_
'Y
-I
+ EfI·q+1 )
Vi-2T-I· (11.11)
Using (11.9) and Lemma 11.22 with J.L =i- 27, we find
A =ZJi-2T
= ('YJ;:::~~~?+E(1))J;~2T_I (1+EW~2~~I))
= 'YJ;=~T-I +E(f;~2T-I)·
It follows that
Proof: We consider a prime divisor P E S~i) , where 0 :::; 7 :::; l i-:/ J. From Lemma
11.23, we have at P
,rl]+1 = ",~-(q+l) +E(l)
Ji lli-2T-I '
with 0 # 'Y E Fq and i - 27 -1 ~ 1. By Lemma 11.22,
C(q+l)
Ji-2T-I
= PVi-2T-2
(C(q+I)) +E(l)
'
hence
,rl]+1
Ji
= ,vVi-2T-2
Vl(~-(q+I)) +E(l). (11.13)
Since Li+1 = Li(ZHJ) andzi+1 +Zi+1 = J;q+l, it follows from Proposition 5.33(iii)
that the divisor P is unramified in the field L i + I. •
Proof:
(i) The assertion follows from Lemma 11.23, since VP(j;-2T-d = qi-2T-l (see
also Lemma 11.21) and Proposition 11.24.
(ii) (By induction on i.) The assertion is obvious for i ::; 1. Suppose now that
i ~ 2. If i == 0 mod(2), then pnLi-l E S(i-l) and, from the induction
assumption, the prime divisor P nLi-l is a simple pole of.Ii-I. By Lemma
11.19, the divisor P is then a simple pole of.Ii.
(i-l)
Po = ( pnLi-l ) E S j-3 •
""2
From (i) we know that vpo (j;-l) = -q. Since zi + Zi = I!-~! and P / Po is unrami-
fied, by Proposition 11.24, we conclude that
Lemma 11.26.
lYJ .
(j;) (i) = qi Qi - L qi-2T-l D~') - D(i) ,
T=O
where
D¥)= L P and D(i) = L P.
PES~i) PES(i)
The degree ofD~i) (resp. D(i)) is degD~i) = qi+l (q - 1) (resp. degD(i) = ql ¥ J).
Examples 283
Proof: This is straightforward induction based on the use of Lemma 11.25 and
the equations zf+1 +ZHI =1(+1 and.fi+1 = zHdk •
By Lemma 11.26, Proposition 5.33 and Proposition 11.24, the prime divisors
of Li ramified in L i+II Li are exactly the divisors P E S(i), and they are totally
ramified. The different exponent of a prime divisor pi E Div(LHd lying over
P E S(i) is d(PI/P) = (q - 1)(q + 2) (see Proposition 5.33(ii)), and the degree
of the divisor n(i) is equal to ql!¥ J. Hence, the Hurwitz genus formula (see
Proposition 5.30) gives the following recursion for the genus gi = g(Li ):
(i) Let P E Div(Lo) be the zero of x - a with a E k*. Then, the prime divisor
p splits completely in Li / Lo, i.e., there are exactly qi prime divisors pi over
Pin Div(Li ), all of them having degree one (this follows from Proposition
5.33(iv)).
(ii) The prime divisors P E S(I) have degree one, and they are totally ramified
in Li / L I. Hence, over each of these prime divisors there is a unique prime
divisor pi of L i , and this divisor has degree one.
There are (q2 - 1)qi prime divisors of type (i), q prime divisors of type (ii),
and q divisors of type (iii). So, we have:
Since N q 2(Li) = N q2(X;) andgi =g(Li) =g(X;), from Theorems 11.27 and
11.28 we deduce the following result.
Theorem 11.29. We have
. Nq2(X;)
~lm
1--)00 g
(X)
i
= q - 1,
hence
and
A*=A\{O}={aEkl aq-I=-l}.
Examples 285
(i) [L:k(y)]=q;
(ii) the functiony has a unique pole Poo in L; and the prime divisor Poo is totally
ramified inL/k(y);
(iii) for any a E A, the function y - a has a unique zero POI in L, and the prime
divisor POI is totally ramified in L / k (y);
(iv) for any l' E A, there is a unique common zero Qy ofy andz -1' in L;
(v) the principal divisor in L of the functions y - a and z - l' are as follows:
(y) = L Qy - qPoo,
yEA
(y - a) = qPOl
qPoo, - fora E A*,
(z-1')=qQy-Poo- L POI' for l' E A;
OlEcalA
(vi) the prime divisors of L that are ramified over k(y) are exactly the prime
divisors P00 and POI' with a E A *; their different exponents with respect to
the extensionL/k(y) are
(vii) the prime divisors of L that are ramified over k(z) are exactly the prime
divisors Qy, with l' EA.
(iii) if P E ':P(Li) isa prime divisor which is neither the pole of XI nor a zero of
XI - a, for all a E A, then P is unramified in Li / LI.
Our aim is to calculate the degree of the different Diff(Ld Li-d, for all i 2: 2.
By the previous properties, it remains to. investigate the behavior of the zeros Q of
XI in Li / Li- I . From the properties of the function field L = k(X), where
one has the following possibilities for such prime divisors Q E ':P:
(a) the prime divisor Q is a common zero of the functions XI ,X2, ... ,Xi;
In case (a), the prime divisors below Q are unramified in k(xs,xs+I )/k(xs ), for
s = I, 2, ... ,i - 1. This implies that Q is unramified in LdLi -I .
In case (b), the ramification indices of the prime divisors Q are determined as
follows:
(ii) for 2r + 1 < i, the prime divisor Q is totally ramified in Li / L2r+ I, and for
2r ~ s ~ i, the restriction of Q to Ls is unramified in Ls / k(xs);
(iii) if 2r + 1 < i, the different exponent d(Q) of Q in L;/L;_I is given by
d(Q)=2(q-l).
Then we have
and
degDiff(Li/L;-d = 2(q - l)qln/2J.
Examples 287
Moreover, any prime divisor Pa E !J'(LJ), which is the zero of XI - a, for a fj. A,
splits completely in all extensions Li / LI.
Summing up the above properties of the tower J:., = {Li} and applying the
Hurwitz genus formula we find that
Since
Nq2(Li) ~ (q _l)qi+l
we obtain the following result:
Theorem 11.30. The tower J:., = {Li} attains the Drinfeld-Vladut bound over
k= Fq2, so that
EXERCISES
11.1. Let X be a smooth projective curve defined over Fq . Write out the relation between
n = IX(Fq)l, degD and /(D) corresponding to upper bounds for codes given in
Chapter 2 and applied to geometric Goppa codes on X.
11.2. Let n = 2m. Show that a geometric Goppa [n, n /2, n /2 + l]q-code C of genus zero is
always quasi-self-dual, and that for an even q there exist self-dual geometric Goppa
codes with these parameters. (Hint: If g = 0 then any divisor of even degree is
divisible by 2 and all the divisors of a given degree are equivalent.)
11.3. Check that any elliptic code with n = 2m is formally self-dual.
11.4. Let N = Nq be the number of Fq-rational points of an elliptic curve E. Prove that:
(a) if N is odd, there exists a quasi-self-dual [N -1, (N -1)/2, (N -1)/2]q code
onE;
(b) if N is even, there exists either a quasi-self-dual [N,N /2,N /2]q-code or a
quasi-self-dual [N - 2, (N - 2)/2, (N - 2)j2]q-code;
(c) if q is even, there exist com;:sponding self-dual codes.
11.5. Let E be the plane curve over F2 given by the affine equation v 2 + v = u3 + u + 1.
Show that:
(a) the curve E c Jp>2 is absolutely irreducible and non-singular, i.e., E is an elliptic
curve;
(b) IE(F2 )1 = 1, IE (F4) I = 5, and IE(F16) I = 25;
(c) E is maximal curve over F24;
288 Chapter 11
(d) the spaceL(m ·xoo), wherexoo = (0: 1 : 0), has a basis {u i viI2i+3j:::; m}.
Write out the generator matrices of codes C(Do,m .xoo) and C(Do,m ·xoo) over F4
andF16, respectively, where SuppDo = E(F4) \ {xoo}, and SuppDo = E(F16) \ {xoo}
for m = 2, S, 12, 16. Calculate their spectra.
11.6. Let E be the curve over F52 given by v 2 + v = u 3 . Show that E c]p>2 is a maximal
over F52 elliptic curve. Write out generator matrices of codes C(Do,m ·xoo) for
m = 2,S, 16, where SuppDo = E(F52) \ {xoo}.
11.7. Let E : v 2 + v = u 3 + bu + c be an elliptic curve defined over F 2. Prove that
for v 2 + v = u3 + u + 1
for v 2 + v = u3 or u3 + 1
for v 2 + v = u3 + u.
1 +2t+2t 2
Z(E,t) = (l-t)(1-2t);
2v + I ifv==2,6 mod(S)
2 v + 1 +2 v/ 2+1 if v == 4 mod(S)
{
N2"= 2v+I_2v/2+1 if v == 0 mod(S)
2 v + 1 +2(v+l)/2 if v == 1,7 mod(S)
2v + 1 _ 2(v+l)/2 if v == 3,5 mod(S).
Chapter 12
This chapter concerns the decoding problem for geometric Goppa codes. We
consider various aspects of the problem beginning with results on the existence
of decoding algorithms and ending with ones on the construction of efficient
algorithms which can easily be used in practice. For a detailed treatment of the
complexity of algorithms we refer the reader to Aho, Hopcroft and Ulman [2].
~:
such that ~(u) = u, for all U E C, is called a decoder or a decoding map for the
code C. We allow the decoder to give as outcome "?" when it fails to find a
code-vector.
We recall that a minimum distance decoding for a code C is a decoder ~ such
that ~(v) E C' is a nearest code-vector to v, for all v E F;. A decoding error of a
decoder occurs when the decoded vector is different from the transmitted vector.
A maximum likelihood decoding minimizes the probability of a decoding error.
Minimum distance decoding is equivalent to maximum likelihood decoding for
a q-ary symmetric channel (in which the probability that a symbol is changed to
another one is the same for all symbols in the alphabet and does not depend on the
position in the transmitting vector).
289
290 Chapter 12
C = {u E F; IH . U T = o} .
For a received vector v = (VI"", v n ) E F; and the parity-check matrix H for C,
the syndromes are defined as
n
Si(V) = ~>ijVj, I ~ i ~ n.
j=1
We can extend the matrix H to an n x n matrix H' such that the rows
I ~ i ~ n,
of H' form a basis of the space F; and the first n - k rows are from H. Let
e = v - {(v) = (el, ... , en) E F; be the error-vector. The n syndromes
n
si(e) = L aijej, I ~ i ~ n,
j=1
determine the error-vector uniquely, but only the first n - k syndromes are known,
since s i ( e) = S i ( V ), for i = I, 2, ... , n - k. The remaining syndromes are called
unknown syndromes. Later we will show that the unknown syndromes can be
obtained recursively from known syndromes s i ( e) = S i ( V ), I ~ i ~ n - k, by a
majority vote.
The set of all vectors with the same syndrome as v = (VI, . .. , v n ) is the coset
v + C. If v'is a coset leader of v + C (an element of v + C of minimal weight),
a simple minimum distance decoding consists of an exhaustive search for a coset
leader. Alternatively, we can produce a list of all coset leaders. It is clear that
both these decoding procedures have exponential complexity as a function of n,
since either one has to search among qRn elements of the coset v + C to find one
of minimal weight, or one has to store q(I-R)n coset leaders.
Now we briefly describe the decoding problem for linear codes. Let A be an
algorithm which has as input a pair (C,v), where C is a linear [n,k,d]q-code and
v is a vector of the same length n. Then Ac is the restriction of the algorithm A
to C, if Ac has as input a vector v = (v I , ... , v n ) and as output A ( C , v) computed
by A. Consider the following problem: Find an algorithm A which has as input
(C,v), where C is a linear [n,k,djq-code and v = (VI, ... ,vn ) a received vector,
and as output a vector A( C, v) in C ' such that Ac is a minimum distance decoder
Decoding Geometric Goppa Codes 291
for C. This problem is NP-hard, and it can be divided into two parts. First, for
an appropriate code C the preprocessing part provides a decoder Ac. Second,
the algorithm Ac should work very fast. Thus, the decoding problem can be
formulated as the problem of minimum distance decoding with preprocessing.
All the known decoding algorithms which have polynomial complexity decode
only up to some bound depending on the code (for example, up to halfthe designed
minimum distance). We say that a decoder
°
Proof: Clearly, the error-vector is a solution. Now, if Z = e' is another solution,
then H . (e' - e) = and hence e' - e E C. On the other hand, we have II e' - e II ~
d - 1. This implies e' - e = 0, so e' = e. •
Thus we have shown that we can reduce error-correcting to the problem of
finding the error-positions. To decode all received vectors with t errors, we have to
consider ( ~) possible t -sets for error positions. This number grows exponentially
with n when tin tends to a positive real number. Proposition 12.1 shows us that
292 Chapter 12
For a code to have practical use, it is essential that it possess an effective de-
coding algorithm. We present a generalization of the decoding algorithm for
Reed--Solomon codes in the case of an arbitrary geometric Goppa code (see Juste-
sen, Larsen, Jensen, Havemose, Hoholdt [87], Skorobogatov, Vladut [180] and
Tsfasman, Vladut [208, 209]).
the parity-check matrix of the code C* is given by (!J. (Xi)). Define elements s/LP
as
s/LP = s/Lp(v) = s(v,g/Lhp), 1 ~ J.L ~ m, 1 ~ p ~ r,
and note that the following system of linear equations plays the crucial role for
decoding of the code C*:
1 ~ p ~ r. (12.1)
Proposition 12.2. IfI(D') > t then the system (12.1) has a non-trivial solution in
elements Zi E Fq . Moreover, if
then for any solution Z = (ZI, ... ,Zm) E F:;' o/the system, the/unction
On the other hand, since u E C* and gh~ E L(D) then s(u,g'hp ) = 0, and since
ei =0 for i f/. I and g' (Xi) = 0 for i E I (because g' E L (D' - LiEI Xi) then
Now we take an arbitrary solutionz = (z" ... ,zm) of the system (12.1) and set
m
gz = L z/Lgw
/L=l
and hence
m r
s(v,gzh) = L z/L LYp(v,g/Lhp)
/L=l p=l
r m
= LYp L s/Lpz/L = 0,
p=l /L=l
Proposition 12.3. If
degD > degD' + 2g - 2
then the system (12.3) has at most one solution.
Proof: Suppose that wand w' are two different solutions of (12.3). Then w - w'
is a solution of
L f>..(Xi)Ui = 0, I "'5: A "'5:1,
iEI(gz)
i.e., the vector U = (UI,'" ,un) with Ui = Wi - wI for i E I(gz) and Ui = 0 for
i (j. I(gz) is a non-zero code-vector. Since gz E L(D') the weight of U can be
estimated as follows:
But the weight of a non-zero code-vector cannot be less than the minimum distance
d* , and we arrive at a contradiction. •
(I) Find a basis {Ii, ... ,it} of L(D), a basis {gl,'" ,gm} of L(D') and a basis
{hi, ... ,hr} of L(D -D').
I(D') > t,
degD > degD' +2g-2+t,
then
Oi) one can choose the divisor D' in such a way that the algorithm corrects all
errors of weight
Ilell::; (dc* -g-I)/2.
Proof: The assertion (i) is obvious from Proposition 12.2 and Proposition 12.3.
To prove (ii) we assume that t ::; (dc* - g - 1) /2, and choose a divisor D' such
that degD' = g + t and SuppDo n SuppD' = 0. By the Riemann-Roch theorem,
and hence
degD> degD' +2g-2+t.
This completes the proof.
To use the basic algorithm we must know D' explicitly. The degree of D' has
•
to satisfy the following two inequalities:
which contradict each other when t is large enough. The largest possible value for
t which satisfies both the inequalities is
Lett = L(dc* - g-I)/2 J, and take a divisor D' ofdegreeg+t with support disjoint
from Do. One can easily show that such a divisor always exists. If we insert the
construction of such a divisor D' and bases for L(D), L(D') and L (D - D') in the
preprocessing part, the complexity of the basic algorithm is at most O(n 3 ).
Decoding Geometric Goppa Codes 297
and let L(D' - E) be the space of error-locator functions in L(D'), that is, the
space of rational functions g' E L(D') which vanish at all points Xi E SuppE.
Since L( v,D') contains all the error-locator functions of L(D'), we have
The linear space L (V, D') can be determined as soon as we know the received vector
v. Moreover, if L(D' - E) = L( v,D'), we can find a non-zero rational function
g' which vanishes at all error-positions. In that case, Proposition 12.1 allows one
now to find the corresponding error-vector e. In this way one can easily remove
the condition SuppDo n SuppD' = 0, and we obtain the following version of the
basic algorithm (see Duursma [31, 33] and Ehrhard [35]):
(iO If fl(D' - °
D + E) = 0, L(D' - E) i= and g' is a non-zero element of
L(v,D') with the set of zero -positions J = {j 19'(Xj) = OJ, then the system
of equations
One deficiency of the basic algorithm is that it corrects only errors of weight
S (dc' - g - I) /2, but not all errors of weight S (dc' - 1)/2. The following
heuristic argument shows that the basic algorithm corrects L(dc' - 1) /2J errors
most ofthe time. Indeed, let E be the divisor of error-positions, and let L(D' - E)
be the space of error-locator functions. If t = L(dc' - 1) /2 J, and degD' ? g + t,
°
then L(D' - E) i= for all divisors of t error-positions. The set of divisors E
298 Chapter 12
Then we have the following result (see Skorobogatov, Vladut [180] and Tsfasman,
Vladut [208, 209]).
lDe;-l_ S (A)J
It follows from the Clifford theorem that O"(A) ~ 0. Suppose that the designed
minimum distance of C* = C*(Do,D) is odd. IfQ( = {Ao,AI, ... ,Ag-I} is a set
of divisors such that degAi = 2g - 2 - 2i, then define O"o(Q() to be the maximum
over all 0" (Ai ), for i = 0, 1, ... ,g - 1. Let Db,D~, ... ,D~ be a sequence of divisors,
with supports disjoint from the support of Do, defined recursively by letting Db
be a divisor of degree (g - 1) /2 and D; be a divisor which is equivalent to D -
D;_I -Ai-I. Let
de* -1
t = - 2 - - O"o(Q().
Then O(Db - D +E) = 0, L (D~ - E) =1= 0, and for any divisor E of terror-positions
dc* -1
-2--O"i(~)
errors, where i = 0 when the designed minimum distance is odd and i = 1, other-
wise. The complexity of the algorithm is at most O(n 3 ).
The Clifford defect is computed for several curves (see [31]) and is aboutg/4
for plane curves.
It is possible to give an upper bound for the number s of divisors D;, ... ,D~
which are needed in order that for each received vector v :;:: (VI, ... , v n ), with at
most l(dc* -1)/2J errors, at least one of the basic algorithms A(DD will correct
v (see Pellikaan [139] and Vladut [214]).
Again we consider the decoding of codes C* = C*(Do,D). Assume that
degD > 2g - 2. Let t = l (Dc* - 1) /2 J, and suppose for simplicity that dc* > 1 is
an odd number. Let D; , ... ,D~ be a collection of effective divisors of degree g + t.
Then I(DD > t, so L(D; - E) is not zero for all i = 1,2, ... ,s and all divisors E
of t error-positions. Let v be a received vector with divisor of error-positions E.
If there exists an index i such that fl(D; - D + E) = 0, then A(DD will decode v
by Proposition 12.5. So suppose on the contrary that fl(D; - D + E) =I=- 0 for all
i = 1,2, ... , s. Then there exist a differential form Wi such that ( Wi) ;:::: D - D: - E;
that is, there exists an effective divisor Ai such that
(Wi) = D-D; -E +A i .
Taking degrees we obtain degAi = g - 1 for all i. Denote the class of a divisor
B E Div(X) by [B]. We have [Ai - D:] = W - [D - E], hence [Ai - Dn
does not
dependoni. In that case [Ai -Aj ] = [D: -Dj] for all i,j = 1,2, ... ,s, SOAi -Aj and
D; - Dj represent the same divisor class in Pico (X) for all i,j. Let ~ m denote the
set of all effective divisors in Div(X) of degree m, so Ai E ~g_1 and D: E ~g+l.
Now we consider the map
defined by
I/I:"(BI, ... ,Bs ) = ([BI- B2], ... ,[Bs -I-Bs ]).
It is an obvious consequence of the Riemann-Roch theorem that the map 1/1:,. is
sUljective for all m ;:::: g and s. Suppose now that 1/1:-1 is not surjective. Then
we could have started with an s-tuple (D;, ... ,D~) such that 1/1:+1 (D; , ... ,D~) rf.
Im( 1/1:-1). In this way we would arrive at a contradiction with the assumption that
fl(D: - D + E) =I=- 0 for all i. Thus there exists an i such that fl(D: - D + E) = 0,
and therefore the basic algorithm A(D;) corrects the errors of the received vector
v. We have sketched the proof of the following result (see [139]):
Decoding Geometric Goppa Codes 301
Proposition 12.9. Let (D~, ... ,D~) be an s-tuple of effective divisors in Div(X)
of degree g+ t such that I/I:+t(D~, ... ,D~) is not in the image of 1/1;_1· Thenfor
every received vector v with at most L(de' - 1) /2 J errors there is at least one
i = 1,2, ... , s such that the basic algorithm A(DD corrects v.
Now let
P(u)
Z(X,u) = (l-u)(l-qu)
is a decomposition of P(u) into linear factors in some finite extension ofQ, then
lail = ql/2, for all i = 1,2, ... ,2g. The number h = hq(X) of elements ofPico(X)
is equal to P( 1), so
This allows one to prove that the map I/I~~ I is not smjective for all maximal curves
(see [139]), for all curves when q ?: 37, and for all curves ofgenusg?: go(q) when
q ?: 16 (see [214]). As a result we obtain the following theorem:
Theorem 12.10. There exist s divisors D~, ... ,D~ such that for every received
vector v = (VI' ... ' v n ) with at most L(de. - 1) /2 J errors, at least one of the
basic algorithms A(DD, ... ,A(D~) corrects the errors with respect to the code
C* = C*(Do,D) over Fq,forall q ?: 37 andalldivisorsDsuchthatdegD > 2g-2.
Furthermore, s = O( n) and the complexity of the algorithm is O( n 4 ) for n -+ 00.
Unfortunately, this result is not not effective, since it does not provide any
construction of the divisors D~, ... ,D~. On the other hand, if the curve X has
gonality m and at least two Fq-rational points, then the map tfJ:::- I is not smjective
and there exist 2m divisors D~, ... ,D~m' which can be constructed explicitly, such
that the corresponding basic algorithms, run in parallel, correct L(de' - g - 1 +
m)/2J errors.
One can regard the ring of polynomials in one variable as the ring of rational
functions on the projective line pi with poles only at the point xoo at infinity.
The ring of polynomials in one variable is replaced by the ring Roo (x) of rational
functions on the curve X with poles only at a fixed Fq-rational point x E X, where
x differs from the points XI, ... ,Xn used to construct the geometric Goppa code
C* = C*(Do,D). The weight ofa rational functionf E Roo(x) is defined as the
order of the pole off at x and is denoted by w(f). The ring Roo (x) with the weight
function w(f) is not an Euclidean domain unless the genus of X is zero, but it still
has very similar properties. For allf, h E Roo (x ) we have:
(iii) if w(f) = w(h), then there exists an element ,.\ E F; such that w(f -"\h) <
w(f).
If the number of rational points on the curve X is greater than n, the divisor D
in the definition of the code C* = C*(Do,D) can be taken in the form J.L·X -A,
where A is an effective divisor and J.L is a positive integer. Next, we can always
find n linearly independent differential forms WI , ..• , Wn E n( Do + J.L . x) such that
ifi =j
otherwise
n
w(v) = L ViWi,
i=1
then (Resxj (w( v)), ... , Resxn (w( v))) = v. Therefore, the map
w: F; -tn(Do-A+J.L·x)
is the right inverse of the map
if and only if
v E C*(Do, f.L·x - A).
Suppose that A is the divisor of a function h E Roo (x) which does not vanish
at all points XI, ... ,Xn • We want to define the syndrome s( v) of a received vector
v. In order to present the syndrome as a rational function, one first proves the
existence of a particular differential form w'. The syndrome s (v) is now defined
as follows:
, n h(xi)-h
s(v)w = LVi h( .) Wi·
i=1 X,
Let us assume for simplicity that w' is a differential form such that (w') =
(2g - 2)·x (this assumption is satisfied, for example, for Hermitian curves). Now
one searches for solutions of the key equation (12.4), that is for pairs if, cp) with
f, cp E Roo (x) such that there exists an r E Roo (x) with the property
fs(v) = cp+rh.
A valid solution if, cp) is called minimal if w if) is minimal among all the weights
off', such that if', cp') is a valid solution. In this way we get the following result
(see Ehrhard [34, 35] and Porter, Shen, Pellikaan [147]):
(i) There exists a valid solution if, cp) afthe key equation (12.4) such that
cpw'lf E n(Do + f.L' x) and (ResX1 (cpw' If),··· ,Resxn (cpw' If)) = e.
(ii) Let t = (de -1 )/2 - (j, where (j is the Clifford defectofx. Suppose Ilell ::; t.
If if, cp) is a minimal solution of the key equation (12.4), then
cpw'lf E n(Do + f.L' x) and (ResX1 (cpw' If), .. ·, Resxn (cpw' If)) = e.
304 Chapter 12
The explicit computation of the differential forms WI, ... ,Wn and finding for-
mulas for the syndromes is, in general, quite elaborate, but we consider this as
part of the preprocessing of the algorithm. The above algorithm is worked out in
details for Hermitian codes (see Porter, Shen, Pellikaan [147]).
Now we compare Porter's and the modified algorithm, and describe Ehrhard's
algorithm [36] which gives an effective solution of the decoding of geometric
Goppa codes.
The valuef(x) ofa rational functionf at a point x is defined only iff does not
have a pole at x. As we have seen, one can take the space L (V, D') in such a way
that SuppD' n SuppDo =1= 0. The code C* = C*(Do,D) is the image of the residue
map
Res: O(Do -D) -+ F;
and this map is injective if we assume that degD > 2g - 2. There exists a divisor
D" :::; D such that the map
Res: O(Do - D") -+ F;
is sUljective. Moreover, there exists a linear map
One can show that L'( v,D') = L(v,D') when D' has support disjoint from the
support of Do and
degD' > max (degD",deg(D -Do)).
The following result is similar to Theorem 12.11 (ii) (see Ehrhard [34, 35]):
Proposition 12.12. Suppose L (D' - E) ::f. 0 and O(D' - D + E) = O. Iff is a
non-zero element ofL(v,D'), then
is the error-vector of v.
Decoding Geometric Goppa Codes 305
We can easily compare the modified algorithm and Porter's algorithm in the
special case when D = m . x and there exists a differential form w' with divisor
(w') = (2g - 2)x. Iff is a non-zero element of L(v,i ·x) for the smallest i E
{I, 2, ... , n} such that L (v, i . x) =f. 0, then there exists an cp E Reo (x) such that
if, cp) is a valid solution ofthe key equation (12.4). Conversely, if if, cp) is a valid
solution of(12.4) and i = wif), thenf is anon-zero element ofL(v,i ·x), and i is
the smallest integer such that L (v, i . x) =f. 0.
Now we describe Ehrhard s algorithm which produces a sequence of divisors
{Di ,... ,D; }. It depends on the received vector v and has the property that the
basic algorithm A(D;) decodes v when there are at most t = L(dc' - I) /2 J errors.
In this way, the elaborate problem of constructing the sequence of divisors is
circumvented, although this algorithm still has the complexity of solving a system
of linear equations.
(3) Look for an index j E {1,2, ... ,n} such that dimL'(v,D; -Xj) :S
dimL'(v,DD - 2. If there is such aj, then: setD;+1 = D; -Xj, incrementi
and continue at step (3), else
(4) If dimL (v, DD = 0, then continue at step (5), else continue at step (6).
(5) Output?
(6) Compute e = (ResX1 (7Tifw,;)/f), ... , Resxn (7Tifw,;)/f)) for some non-zero
f EL'(Di ).
(7) Output v-e.
If we apply both algorithms 'B(D') and 'B(D - D') for a divisor D' such that
degD' = g + t, then it is enough to assume that de* 2: 4g - 2m, where m is the
gonality of the curve X (see [33]). Moreover, it is shown in an example that this
cannot be improved.
Now we restrict our attention to one-point codes, that is geometric Goppa codes
of the form C = C(Do,m ·x) or C* = C*(Do,m ·x), where m is an integer and
x E X is an Fq -rational point which is distinct from the points Xl, . .. , Xn. We
shall show how for one-point codes one can extend the parity-check matrix H
with rows ai = (ail, ... ,ain), 1:::; i:::; n -k, to an n x n matrix H' with rows
ai = (ail, ... , ain), I :::; i :::; n. This will be done in such a way that the unknown
syndromes Si (e) = ai . eT , i > n - k can be obtained recursively from known
syndromes Si = SiC V), 1 :::; i :::; n - k, by a majority vote (see Feng and Rao [39],
Duursma [32, 33], Kirfel and Pellikaan [95] and Pellikaan [141]).
Let Nx = {O = ml < m2 < m3 < ... } be the non-gap sequence of x. The
non-gaps form a semi-group in the set of non-negative integers which is generated
by m2,m3, ... ,mg +2. Let gi be a rational function on X which has a pole of
order mi at X and no other poles. Then {gl, ... , gr } is a basis for L (m r . x). Let
ai = (gi(XJ), ... ,gi(Xn)), and Hr be the r x n matrix with ai, 1 :::; i :::; r, as rows.
Then Hr is a parity-check matrix of the code C' = C*(Do,m r ·x). We note that
the rows of Hr need not be linearly independent. Define a matrix of syndromes
Let nr = INr I, and define the Fen~Rao minimum distance dFR (r) of the code
C* = C*(Do,mr ·x) by
Note that the definition of dFR (r) depends only on the semi-group of non-gaps
of x. One can check that dFR (r) 2: de' and equality holds if r > 3g - 2. In many
examples dFR(r) is greater than de' strictly for small r (see Kirfel and Pellikaan
[95]).
The entries ofthe matrix of syndromes with (i ,j) E Nr are the first unknown
syndromes we encounter with respect to C* (Do, mr . x). As soon as we know one
sij(e) with (i,j) E N r , we know all the others Sen with ((I", T) E N r , since each one
of the functions gig}, gUgT' or gr+1 is a generator of the one-dimensional vector
space L(mr+1 ·x) modulo L(mr ·x). In other words, there exist aij,aijl E Fq such
°
that aij =I- and
gig} = aijgr+ 1 + L aijlgl
ISr
and this relation is the same for all error-vectors. Consider the matrix
If mi + m} = mr+l, then all entries of this matrix, except sij(e), are known. Next,
if mi + m} = mr , then S (i ,j) is a matrix of the linear map from L (m) . x) to
L(mi ·x) which is used to compute the space L(v,m} ·x) in the basic algorithm
A(m} ·x) for the code C* = C*(Do,mr ·x). The rectangular sub-matrices S(i,j)
with mi + m} = mr , is the collection of matrices which one encounters in the
modified algorithm for C*(Do,mr ·x). If g' E L(m} ·x) is a non-zero error-locator
function and
}
g' =L bTgT)
T=I
then the columns of the matrix S(i,j) are linearly dependent:
}
L bTsU,T(e) = 0, for all 1::; (I" ::; i.
T=I
If (i,j) E N r and the three matrices S(i -l,j - 1), S(i -l,j), and S(i,j -1)
have equal rank, then (i,j) is called a candidate with respect to C*(Do,mr ·x). If
(i,j) is a candidate, then there is a unique value sij(e) to assign to the unknown
entry S ij (e) such that the matrices S (i ,j) and S (i - I ,j - I) have equal rank. The
element sij (e) is called the candidate value of the unknown syndrome sij (e). A
candidate is called correct when sij = sij and incorrect otherwise. Denote the
number of correct candidates by M and the number of incorrect candidates by N.
308 Chapter 12
An entry (i,j) is called discrepancy if the three matrices S(iJ), S(i -I,j), and
S(i,j - I) have equal rank and the matrices S(iJ) and S(i -I,j -I) do not have
equal rank. The total number P of discrepancies is equal to the rank of the matrix
of syndromes, soP:::; Ileli.
Let v be a received vector with error-vector e which has at most (nr - I) /2
errors with respect to C*(Do,m r ·x). Then all syndromes sij (e) such thatmi +mj :::;
mr are known, and the remaining syndromes are unknown. Denote the number of
known discrepancies by Q. A candidate is correct if and only if it is a discrepancy,
so
N + Q:::; P :::; lIell.
If (i ,j) is a known discrepancy, then all entries (i, T) in the ith row with T > i, and
all entries ((T ,j) in the jth column with (T > i are not candidates. If (i ,j) E 'Nr
is not a candidate, then there is at least one known discrepancy in the same row
or column. Thus the number of pairs (i,j) E 'Nr which are not candidates is at
most 2Q. The number of pairs (i,j) E 'Nr which are candidates is equal to M + N.
Therefore,
nr :::;M+N+2Q.
Furthermore, we assume that
N:::;M-l.
Problem II. What is the relation between Ehrhard's decoding algorithm and ma-
jority voting?
Problem III. Does majority voting correct more than l (dFR - 1) /2 J errors?
The basic and the modified algorithms as well as the majority scheme have the
complexity of solving systems of linear equations, for finding both the error-
locations and the error-values. If one uses the special structure of the syndrome
matrix, the complexity of the majority-voting scheme can be reduced from O(n 3 )
to O(n 7 / 3 ). This is done by Feng, Wei, Rao and Tzeng, using the block-Hankel
structure of codes on plane curves. The Berlekamp-Massey-Sakata algorithm,
which is a generalization of well-known Berlekamp-Massey algorithm on linear
recurring relations in one variable to the case of several variables, allows one to
get fast implementations of the modified algorithm, of Porter's algorithm and of
the majority-voting scheme (see [87, 156, 157]). In this section we show how the
Berlekamp-Massey--Sakata algorithm is used for decoding one-point codes up to
half of the Feng-Rao distance.
Consider codes of the fonn
For the fixed rational function g" we associate with each vector v =
(VI, ... , vn ) E F; a syndrome s" (v) by
n
s,,(v) = L Vig" (Xi)'
i=1
Then we find that U = (UI, ... , un) E C* if and only if s,,(u) = 0 for all a with
w(a) ::; m. In the decoding situation, v = U+e is received and s,,(u + e) = s,,(e)
310 Chapter 12
ifw(a) ::::; m. These can be easily calculated when all syndromes are known. The
following version of the discrete Fourier transformation method gives an explicit
formula.
Proposition 12.16. Assume that all coordinates of the points Xi are non-zero. If
all syndromes s,,(e), 0::::; aj ::::; q - 2, 1 ::::;j::::; s, are known, then
q-2
ei=(-I)i L s"g-"(Xi)'
"1,···,"5=1
Theorem 12.17. The algorithm e(m) corrects t::::; l(dFR - 1)/2J errors. The
complexity of the algorithm is O(n 7/3).
The general problem of solving linear equations can be done faster than Gaus-
sian elimination. Its complexity can be reduced from O(n 3 ) to O(n 2 .38), where n
is the number of variables.
Problem IV. Is there a decoding algorithm which decodes all geometric Goppa
codes up to half the designed minimum distance with complexity O( n 2 ) for n -+ oo?
EXERCISES
uq+ l +v q+ l +wq+ l = o.
The curve X is isomorphic to the curve Y with affine equation
which has exactly one point xoo at infinity and n = q3 points Xl, ... ,Xn in the affine
plane. Show that:
(a) the semi-group of non-gaps at xoo is generated by q and q + 1;
(b) the ring Roo(xoo) of rational functions on the curve Y with only poles at xoo is
generated over Fq2 by rational functions gl, g2 such that VXoo (gl) = -q and
V xoo (g2) = -(q + 1), that is
u 3v+v 3 +u = O.
It has genus g = 3 and three rational points Zl = (1 : 0 : 0), Z2 = (0 : 1 : 0), and
Z3 = (0: 0: 1) over F 2, and 21 points Xl, ... ,X21 which are rational over Fs, but
not over F2. Let us consider the code C· = C*(Do,D), coming from X, where
Do =Xl + .. ,+x21 +Zl +z3 andD =m ·Z2. It has parameters [23,25-m, ~m-4ls,
for 4 < m < 23. The homogeneous equation of the Klein quartic is
and from this we readily see that the intersection divisor of the curve with the line
u = 0 is 3Z3 +z2, with the line v = 0 is 3Z1 +z3, and with the line w = 0 is 2Z2 +zl.
Letf = u/w and h = v /w. Prove that:
and
(d)
where
a = (uw+v2,u 4 +v+vw,u 3v+w+w3 );
(e) the effective divisors 3z 1 +z3, 3Z2 +ZI and 3Z3 +Z2 are canonical divisors on
X;
(f) (df) = 2z3 +4Z2 - 2z1 and therefore
and
h = t3 +alt2 +aotl +al + ....
Next, show that t = h 2(1 + ht 3 ) and deduce from this that
df = (t 4 + higher order terms )dt,
so (dx) = 2Z3 + 4Z2 - 2z1 + B with an effective divisor B. Finally, using the
equality deg(df) = 2g - 2 = 4, find that B = 0.);
(g) when the basic algorithm A(3 ·Z2) is applied to the code C*(Do, II ·Z2) it
corrects single errors (as well as three errors when the error-positions lie on
the line v + cw = 0). (Hint: Take D' = 4· Z2, so D - D' = 7· Z2, and show
that rational functions I, h form a basis for L (4 . Z2) = L (3 . Z2) and the rational
functions l,h,fh,h 2,f2h form a basis for L(7 ·Z2).)
(h) the number of decoding failures of the basic algorithm A(5· Z2) to decode
two errors is equal to 7 out of Ci) . 72, the number of all possible error-
vectors with two errors. (Hint: Let (YI ,Y2) be a couple of distinct points of
the points XI, ... ,X2I ,ZI,Z3 and Y = YI + Y2. Then L(5 ·Z2 - y) oF 0, and if
n(y - 6 .Z2) oF 0, then 6 ·Z2 '" 3 ·Z2 +zl and hence Y +ZI '" 3· Z2. Thus there
exists anon-zero rational functiong' E L(3 ·Z2) which is zero atzi. So g' = ch,
c E F g, and furthermore YI = Z3 and Y2 = ZI. Now the code
C(y,5 .Z2) = C(y,6· Z2) = C* (y, 6 .Z2)
is generated by (I, I), and therefore
L(v,5 ·Z2) = L(5 ·Z2 - y)
for a received vector v with Y as error-positions and error-vector e if and only
if
ex C(y,5 .Z2) n C*(y,6 ·Z2) = o.
By Proposition 12.6 this is equivalent to el oF e2.)
12.3. The code C*(Do,m ·Z2) coming from the Klein quartic X over Fs has designed
distance dc' = m - 4, and is therefore t = l (m - 5) /2 J-error-correcting, but since
(de- -g-I)/2 = (m -8)/2, the basic algorithmA(D') corrects 1-2 errors when
m == 1 mod(2) and t - 1 errors when m == 0 mod (2). The modified algorithm
corrects t - 1 errors by Theorem 12.7, since S(Z2) = 1. Show that the extended
modified algorithm corrects 1 - I errors when m == I mod(2), and t errors when
m == 0 mod(2). (Hint: Ifm == 1 mod (2), takeAo = 4 ·Z2, Al = 2 ·Z2, A2 = 0, and
let Ql = {Ao,AI,A3}, so <To(Ql) = 1. The correspondingdivisorsD: are Db = t·Z2,
D; = (t + 1) ·Z2, D~ = (I +2) ·Z2, and D~ = (/+3) ·Z2. If m == 0 mod(2), take
Al = z2,A2 = 3 ·Z2, and Ql = {AI ,A2}, so <TI = 1/2. The corresponding divisorsD:
are D; = (t + 1) ·Z2, D~ = (t + 2) . Z2, and D~ = (t + 3) ·Z2. Then use the result of
Theorem 12.8.)
Decoding Geometric Goppa Codes 313
12.4. Consider the code C*(Do,23 'Z2) coming from the Klein quartic X over F8. It has
dimension k* = 2 and designed minimum distance d* = 19, and therefore is 9-error-
correcting. Show that it is possible to choose an error-pattern of weight 9 where the
modified algorithm fails. (Hint: LetYJ,Y2, andY3 be the affine points on the quartic
X and the line u + 1 = 0, that iSYI = (1, nY2 = (1, ,2), andY3 = (1,,4), where ns
a primitive element ofFq such that ,4+,+ 1=0. LetY4 = (,6,,3),Y5 = (,6,,4),
Y6 = (,4, ,3), Y7 = (" ,6), Y8 = (,3, ,3), and Y9 = (,2, I). The latter six points
together with the points ZI and Z3 lie on the intersection of X and the quadric
u2 + ,5 uv + ,3vw + ,yv = O.
Put
9
y=LYi.
i=1
The rational function
Now there are nine known discrepancies and they are located at the entries (1, 11),
(2,8), (3,6), (4,5), (5,4), (6,3), (7,7), (8,2), and (11, 1). Thus there is exactly one
candidate at the entry (10,10), which gives as outcome that SI = 0.)
12.6. The Suzuki curve X is defined over Fs by the equation
v s +v = u2 (u S +u).
It has 64 rational pointsxl, ... ,X64 in the affine plane, one rational point x'" at infinity,
and genus g = 14. The homogeneous equation of the curve X is
Bounds
We have seen in Chapter 10 that the Goppa construction gives the following
inequality for the parameters of an [n, k, d]q -code C:
k'2n-d-(g-I).
Ifwe now apply this construction to modular curve Xo (N) over Fp2, from Theorem
9.11 we obtain:
Theorem 13.1 (the Tsfasman-Vladut-Zink theorem). There exists afamily of
geometric Gappa codes over Fq , q = p2, such that
It follows from Theorem 6.23 that the above result can not be improved in this
way.
Corollary 13.2. If q = p2 then
0'~n(8) '2 RAa(8) = 1- 8 - (y'q _1)-1.
Theorems 9.12,11.29, and 11.30 show that this result is also true for any q
that is an even power of a prime p.
315
316 Chapter 13
Theorem 13.3. The AG-bound RAG lies completely below the Gilbert-Varshamov
bound for q = p2v < 49. For q = p2v 2: 49 these bounds intersect, and RAG lies
above RGV on the interval (8 1 , 8z), where 81 and 82 are zeros of the equation
If it is above the AG-bound then the bounds do not intersect; otherwise they do
intersect, and in this case first coordinates of the intersection points are defined as
zeros of the equation
Hq (8)-8= ("fij_1)-I.
The tangent line is given by
and the equation Hq - 8 = (yiq - I) -1 has two zeros if and only if ( yiq - 1) -I <
logq (2q - I) - 1. It is easy to see that for q = p2v the last inequality holds if and
only if q 2: 49. •
Recall that the AG-bound does not depend on the construction of the geometric
Goppa codes we use. Using the L-constriction (or the O-construction) we can put
D = m ·Xo, where Xo E X(Fq), and construct codes of length n = IX(Fq)l- I
evaluating at the other Fq-rational points of X.
Note also (see Tsfasman and Vladut [208, Ch. 3.4] that the inequality
associated with a code (or a class of codes): construction, encoding and decoding.
Therefore we arrive at three corresponding questions concerning the complexity of
construction, encoding, and decoding of some classes of codes (see Aho, Hopcroft
and Ulman [2] for a more detailed treatment of the complexity theory). Let us
note at once that for linear codes their encoding procedure is trivial.
Let {C;} be a family oflinear [n;,k;,d;Jq-codes over a finite field Fq of steadily
growing length ni, and let G; be a generator matrix of C;. The family {C;} is
called polynomial (or having a polynomial structure complexity), if and only if
there exists an algorithm to construct matrices G; whose complexity is bounded by
a polynomial in n;. In general, a family { C;} of [n;, k;, d;]q-codes (not necessarily
linear) is said to be polynomial if and only if both the construction and encoding
algorithms for each C; are polynomial in n;.
We define families of [n;,k;,d;]q-codes having a polynomial decoding proce-
dure in a similar way. Let {Ci } be a family of codes equipped with algorithms
{Ai} of decoding up to l (d; - I) /2 J. Then we say {C;, A;} is a family having a
polynomial decoding complexity if and only if there exists a universal algorithm A
generating all the A;, where Ai is polynomial in ni, and the number of operations
needed to apply each Ai to a received vector is also polynomial as a function of ni.
Define U¥ol,lin as the set of those limit points (8,R) E v~in for which there
exists a polynomial in ni families of linear [ni, k i , d;]q -codes with d;/ ni --+ 8 and
k;/ni --+ R.
Theorem 13.4. There exists a continuous function ago1,lin (8) on the interval [0, I]
such that
U¥OI,lin( 8) = {( 8,R) 10 ::;
R ::; ago1,lin( 8)} .
It is fairly obvious that ago1,lin (8) ::; a~n (8) ::; a q (8). Unfortunately, we do
not know a specific polynomial upper bound. All the known upper bounds are
those for a q ( 8). Codes on the Gilbert-Varshamov bound are constructed by an
essentially non-polynomial method. On the other hand, concatenation gives the
following result.
Theorem 13.5 (the Zyablov bound).
ago1,lin(8) ~ Rz(8) = max {(I- 8/8')(I-Hq(8'))}.
5~JY$(q-I)/q
If now we use concatenation with outer codes over a fixed finite extension Fqk
we get (see [216]):
In particular,
a P01 ,lin(8) > max {~ . apo1,lin (!.!.. .8)}
q - C n qk d '
where the maximum is taken over all linear [n, k, dl q-codes C.
Geometric Goppa codes make it possible to sharpen the above lower bound
for ago1,lin(8) (see Manin and Vladut [120]).
Theorem 13.8 (the Vladut theorem). If q is an even power ofa prime p then
the maximum being taken overall linear [n,k,dlq-codes C such thatqk is a square.
Then
ago1,lin (8) :::: R8 n (8).
R8
Unfortunately, we do not know the precise value of n ( 8). The reason is, of
course, that we do not know the parameters of linear q-ary codes. However, each
code from the set described in the theorem gives a lower bound for ago1,lin (8) (see
[216]).
Bounds 319
Rlin(8)=max{l-v(q-I).8- 2v(q-I)},
v q q(qv/2 -I)
the maximum being taken over all integers v ~ I such that qV is a square. Then
Now we consider the question concerning asymptotic bounds for non-linear codes
having a polynomial decoding complexity. For the set of all polynomial families
of [n, k, d]q-codes (linear, or non-linear), let us introduce into consideration the
function a Pol (8) (its definition is quite similar to the definition of the function
agol,lin( 8)).
Ifnowwe apply alphabet extension to the boundRo( 8), we obtain the following
result for ago l ( 8) :
where the maximum is taken over all [n,k,d]p2v-codes C, and where the prime P
and an integer v 2: 1 are such that M = qk 2:p2v. Then
The family Cj corresponds to the point (2T,R) in the unit square [0, IF of the
(8,R)-plane. Then we can define
agol.dec,linu» 2: 1 - 8 - 2( vq _1)-1.
Moreover, we have the following results (see Skorobogatov and Vladut [180]):
the maximum being taken over all linear [n,k,d]q-codes C and over all (not only
linear) codes C', respectively, such that l is a square. Then
agol. dec ,lin(8) ;::: Rolin(8)
and
the maximum being taken over all integers v ;::: 1 such that q v is a square. Then
agol. dec ,lin(8);::: Rjlin(8).
EXERCISES
Show that:
(a) I-RBz(8)~~L8.lo~8 for 8~00;
R ~ I + 28 logq 8;
if q =p2v
if q =p2v+!
322 Chapter 13
13.7. Prove Theorem 13.13. (Hint: Show that agol(8) ;::: maxq':<:;q( agO\
8) logq q') under
the alphabet extension, and then apply the obtained result to the bound Ro( 8).)
13.8. Prove that:
(a)
agoJ.dec,lio(8);::: Rz(8) = max {(1- 8j8')(1-Hq (8'))};
1l:<:;Il':<:;(q- 1)/q
(b)
poJ.dec,Jio(8) > R (8) =R (8) _ 8 rRGv(IJ) dR
Cl'.q - HZ GV io 8(R) .
13.9. Prove Theorems 13.14 to 13.16.
Bibliography
[I] Aaltonen M. J., Notes on the asymptotic behavior of the infonnation rate of block codes, IEEE
Trans. Info. Theory, 1984, IT-30, p. 84-85.
[2] Aho A., Hopcroft J., Ulman 1., The Design and Analysis o/Computing Algorithms, Addison-
Wesley, Reading, MA, 1974.
[3] Artin E., Quadratische Korper im Gebiete der hohern Kongruenzen I, 2, Math. Zeitschr., 1924,
19, p. 153-246.
[4] Artin E., Algebraic Numbers and Algebraic Functions, Gordon and Breach, New York, 1967.
[6] Atkin A. O. L., Weierstrass points of cusps offo(N), Ann. Math., 1967,85, no. 1, p. 42-45.
[7] Aubry Y., Perret M., A Weil theorem for singular curves, Proc. 0/ Arithmetic, Geometry and
Coding Theory IV, De Gmyter, 1995.
[8] Aubry Y., Perret M., Coverings of singular curves over finite fields, Manuscripta Math., 1995,
88, p. 467-478.
[9] Barg A. M., Exponential sums and constrained error-correcting codes, Lect. Notes in Compo
Science, 573, Springer-Verlag, Berlin, 1991, p. 16-22.
[10] Barg A. M., Some new NP-complete coding problems, Probl. In/o. Trans., 1994,30, no. 2, p.
23-28.
[II] Barg A. M., Katsman G. L., Tsfasman M. A., Algebraic-geometric codes on curves of small
genus, Probl.lnfo. Trans., 1987,23, p. 34-38.
[12] Bassalygo L. A., Zinoviev V. A., Litsyn S. N., A lower estimate of complete trigonometrical
sums in tenns of multiple sums, Soviet Math. Doki., 1988,37, p. 756-759.
323
324 Bibliography
[13] Berlekamp E. R., Algebraic Coding Theory, McGraw-Hill, New York, 1968.
[14] Berlekamp E. R., (Editor) Key Papers in the Development ofCoding Theory, IEEE Press, New
York, 1974.
[15] Beth T., Some aspects of coding theory between probability, algebra, combinatorics and
complexity theory, Combinatorial Theory, Lecture Notes in Math., 969, Springer-Verlag, New
York, 1982, p.12-29.
[16] Blahut R. E., Theory and Practice of Error Control Codes, Addison-Wesley, Reading, MA,
1983.
[17] Bombieri E., Counting points on curves over finite fields [d'apres S. A. Stepanov], 8em.
Bourbaki, 25 erne annee, 1972/73,430, p. 1-8; Lecture Notes in Math., 383, Springer-Verlag,
New York, 1974, p. 234-241.
[18] Carlitz L., Uchiyama S., Bounds for exponential sums, Duke Math. J., 1957,24, p. 179-193.
[19] Chebotarev N. G., The Theory ofAlgebraic Functions, M.-L., 1948 (in Russian).
[20] Chevalley C., Introduction to the Theory ofAlgebraic Functions ofOne Variable, Math. Surv.,
6, AMS, New York, 1951.
[21] Conway 1. H., Sloane N. J. A., Sphere Packings, Lattices and Groups, Springer-Verlag, New
York,1988.
[22] Deligne P., Cohomologie Etale, Lecture Notes in Math., 569, Springer-Verlag, New York,
1977.
[23] Deligne P., Husemoller D., Drinfeld modular curves, Contemp. Math., 1987,67, p. 25-91.
[24] Deuring M., Lectures on the Theory ofAlgebraic Functions ofOne Variable, Lecture Notes in
Math., 314, Springer-Verlag, New York, 1973.
[25] Driencourt Y., Some Properties ofElliptic Codes over a Field of Characteristic 2, Lec!. Notes
in Compo Science, 229, 1985.
[26] Driencourt Y., Michon J. E, Remarques sur les codes geomeetriques, c. R. Acad. Sci. Paris,
Ser. 1,1985,301, p. 15-17.
[27] Driencourt Y., Michon J. E, Rapport sur les codes geometriques, Preprint, 1986.
[28] Driencourt Y., Michon 1. E, Elliptic curves over field of characteristic 2, 3, Pure Appl. Algebra,
1987,45, p. 15-39.
[29] Driencourt Y., Stichtenoth H., A criterion for self-duality of codes, Commun. Algebra, 1989,
17, no. 4, p. 885-898.
[30] Drinfeld V. G., Elliptic modules 1,2, Math. USSR Sbornik, 1974, 23, no. 4, p. 561-592; 1977,
31,no.2, p. 159-170.
[31] Duursma I. M., Algebraic decoding using special divisors, IEEE Trans. Info. Theory. 1993,
1T-39, p. 694-698.
[32] Duursma I. M., Majority coset decoding, IEEE Trans. Info. Theory, 1993, 1T-39, p. 1067-1070.
Bibliography 325
[33] Duursma I. M., Decoding codes from curves and cyclic codes, Ph. D. Dissertation, Eindhoven
Univ. Techn., The Netherlands, 1993.
[34] Ehrhard D., Uber das Dekodieren algebraisch-geometrischer codes, Ph. D. Dissertation, Uni-
versit at Dusseldorf, Germany, 1991.
[35] Ehrhard D., Decoding algebraic-geometric codes by solving a key equation, Proc. AGCT-3,
Luminy 1991, Lect. Notes in Math., 1518, Springer-Verlag, New York, 1992, p. 18-25.
[36] Ehrhard D., Achieving the designed error capacity in decoding algebraic-geometric codes,
IEEE Trans. Info. Theory, 1993, 1T-39. p. 743-751.
[37] Eichler M., Quaternare quadratische Formen und die Riemannsche Vermutung fUr die Kon-
gruenzzetafunktion, Arch. Math., 1954,5, p. 355-366.
[38] Eichler M., Introduction to the Theory ofAlgebraic Numbers and Functions, Academic Press,
New York, 1966.
[39] Feng G. 1., Rao T. R. N., Decoding algebraic-geometric codes up to the designed minimum
distance, IEEE Trans. Info. Theory, 1993, IT-39, p. 37-45.
[40] Feng G. 1., Rao T. R. N., A simple approach for construction of algebraic-geometric codes
from affine plane curves, IEEE Trans. Info. Theory, 1994, IT-40, p. 1003-1012.
[41] Feng G. 1., Rao T. R. N., Improved geometric Goppa code-Part I: Basic Theory, IEEE Trans.
Info. Theory, 1995,41,no.6,p. 1678-1693.
[42] Frey G., Perret M., Stichtenoth H., On the different of abelian extensions of global fields,
Coding Theory and Algebraic Geometry, Lect. Notes in Math., 1518, Springer-Verlag, Berlin,
1992, p. 2~32.
[43] Fried M. D., Jarden M., Field Arithmetic, Springer-Verlag, New York, 1986.
[44] Forney G. D. Jr., Concatenated Codes, MIT Press, Cambridge, MA, 1966.
[45] Fuhrmann R., Torres F., The genus of curves over finite fields with many rational points,
Manuscripta Math., 1996,89, p. 103-106.
[46] Fuhrmann R., Torres F., On curves over finite fields with many rational points, Preprint, 1996.
[47] Fulton w., Plane Algebraic Curves, W. A. Benjamin, New York, 1969.
[48] Garcia A., The curveyn =f(x) over finite fields, Arch. Math., 1990,54, p. 3~.
[49] Garcia A., On Goppa codes and Artin--Schreier extensions, Preprint, 1991.
[50] Garcia A., Kim S. J., Lax R. F., Consecutive Weierstrass gaps and minimum distance ofGoppa
codes, J. Pure and Appl. Algebra, to appear.
[51] Garcia A., Lax R. F., Goppa codes and Weierstrass points, Coding Theory and Algebraic
Geometry, Lect. Notes in Math., 1518, Springer-Verlag, Berlin, 1992, p. 33-42.
[52] Garcia A., Stichtenoth H., Elementary abelian p-extensions of algebraic function fields,
Manuscripta Math., 1991,72, p. 67-79.
[53] Garcia A., Stichtenoth H., A tower of Artin--Schreier extensions of function fields attained the
Drinfe1d-Vladut bound, Invent. Math., 1995, 121, p. 211-222.
326 Bibliography
[54] Garcia A., Stichtenoth H., Algebraic function fields over finite fields with many rational places,
IEEE Trans. Info. Theory, 1995, IT-41, no. 6, p. 1548-1562.
[55] Garcia A., Stichtenoth H., On towers and composita of towers of function fields over finite
fields, Finite Fields and their Appl., to appear.
[56] Garcia A., Stichtenoth H., On the asymptotic behavior of some towers of function fields over
finite fields, J. Number Theory, to appear.
[57] Garcia A., Voloch J. E, Fermat curves over finite fields, J. Number Theory, 1988, 30, p.
345-356.
[58] van der Geer G., van der Vlugt M., Artin-Schreier curves and codes, J. Algebra, 1991,139, p.
256-272.
[59] van der Geer G., van der Vlugt M., Curves over finite fields of characteristic 2 with many
rational points, C. R. Acad. Sci. Paris, Ser. I, 1993,317, p. 593-597.
[60] van der Geer G., van der Vlugt M., Fibre products of Artin-Schreier curves and generalized
Hamming weights of codes, J. Comb. Theory A, 1995, 70, no. 2, p. 337-348.
[61] van der Geer G., van der Vlugt M., How to construct curves over finite fields with many points,
Preprint, 1996.
[62] Gluhov M. M., On lower bounds for character sums over finite fields, Diskret. Mat., 1994,6,
no. 3, p. 136-142.
[63] Gluhov M. M., Ozbudak E, Codes on superelJiptic curves, Turkish Journ. ofMath., to appear.
[64] Goppa V. G., Codes on algebraic curves, Soviet Math. Dokl., 1981,24, p. 170-172.
[65] Goppa V. G., Codes and information, Russ. Math. Surveys, 1984,39, no. 1, p. 87-141.
[66] Goppa V. G., Geometry and Codes, Kluwer Acad. Pub!., Dordrecht, 1988.
[67] Griffiths P. A., Harris J., Principles ofAlgebraic Geometry, Wiley, New York, 1978.
[68] Grothendieck A. (with Dieudonne J.), Elements de Geometrie Algebrique, Pub!. Math. IHES,
4(1960),8,11(1961), 17(1963),20(1964),24(1965),28(1966),32(1967).
[69] Gunning R. C., Lectures on Modular Forms, Ann. Math. Studies, 48, Princeton Univ. Press,
Princeton, NJ, 1962.
[70] Hansen J. P., Codes on the Klein quartic, ideals and decoding, IEEE Trans. lrifo. Theory, 1987,
IT-33, p. 923--925.
[71] Hansen J. P., Group Codes on Algebraic Curves, Mathematica Gottingensis, Heft 9, 1987.
[72] Hansen J. P., Delign~Lusztig varieties and group codes, Coding Theory and Algebraic Ge-
ometry. Lect. Notes in Math., 1518, Springer-Verlag, Berlin, 1992, p. 63-81.
[74] Hasse H., Zur Theorie der abstracten elJiptischen Funktionenkorper I - -3, J. reine angew.
Math. 1936,177, p. 55-62; 69--88; 193-208.
Bibliography 327
[75] Helleseth T., On the covering radius of cyclic linear codes and arithmetic codes, Discr. Appl.
Math., 1985,11, p. 157-173.
[76] Hijikata H., Explicit formula for the traces of the Hecke operators for fo(N), J. Math. Soc.
Japan, 1974,26,p.56-80.
[77] Hirschfeld J. W. P., Projective Geometries over Finite Fields, Oxford Univ. Press, Oxford, U.
K.,1979.
[78] Hirschfeld J. W. P., Linear codes and algebraic curves, Geometric Combinatorics, Pitman,
Boston, 1984, p. 35-53.
[79] Hirschfeld J. W. P., Tsfasman M. A., Vladut S. G., The weight hierarchy of higher-dimensional
Hermitian codes, IEEE Trans. Info. Theory, 1994, 1T-40, p. 275-278.
[80] Hoholdt T., and Pellikaan R., On the Decoding of Algebraic-Geometric Codes, IEEE Trans.
Info. Theory, 1995,41, no. 6, p. 1589--1614.
[82] 19usa J., Kroneckerian model of fields of elliptic modular functions, Amer. J. Math., 1959,81,
p.561-577.
[84] Ihara Y., Hecke polynomials as congruence' functions in elliptic modular case, Ann. Math.,
1967,85,p.267-295.
[85] Ihara Y., Some remarks on the number of rational points of algebraic curves over finite fields,
J. Fac. Sci. Tokyo, Ser. 1 A, 1981,28, p. 721-724.
[86] Janwa H., Some optimal codes from algebraic geometry and their covering radii, Europe. J.
Combinatorics, 1990, 11, p. 249--266.
[87] Justesen J., Larsen K. J., Jensen H. E., Havemose A., Hoholdt T., Construction and decoding
ofa class of algebraic geometry codes, IEEE Trans. Info. Theory, 1989, IT-35, p. 811-821.
[88] Kabatyanski G. A., Levenshtein V. I., Bounds for packing on a sphere and in a space, Probl.
Info. Trans., 1978, 14, no. 1, p. 1-17.
[89] Kamiya N., Miura S., On a fast decoding algorithm for geometric Goppa codes defined on
certain algebraic curves with at most one higher cusp, IEEE Intern. Symp. on Info. Theory,
San Antonio, TX, 1993.
[90] Katsman G. L., Tsfasman M. A., Spectra of algebraic-geometric codes, Probl. Info. Trans.,
1987,23,p.262-275.
[91] Katsman G. L., Tsfasman M. A., A remark on algebraic-geometric codes, Contemp. Math.,
1989,93, p. 197-199.
[92] Katz N., Sommes Exponentielles, Asterisque 79, Soc. Math. de France, Paris, 1980.
[93] Katz N., Gauss Sums, Kloosterman Sums, and Monodromy Groups, Princeton Univ. Press, NJ,
1988.
[94] Katz N., Mazur B., Arithmetic moduli of elliptic curves, Ann. Math. Studies, Princeton, 1985.
328 Bibliography
[95] Kirfel C., Pellikaan R.; The minimum distance of codes in array coming from telescopic semi-
groups, The fourth Workshop on Arithmetic Geometry and Coding Theory, Luminy, France,
1993; IEEE Trans. Info. Theory, 1995,41, no. 6, p. 1720-1732.
[96] Koblitz N., Introduction to Elliptic Curves and Modular Forms, Springer-Verlag, New York,
1984.
[97] Kodama T., Washio T., A family of hyperelliptic function fields with Hasse-Witt invariant
zero, J. Number Theory, 1990, 36, no. 2, p. 187-200.
[98] Kumar P. v., Yang K., On the true minimum distance of Hermitian codes, Coding Theory and
Algebraic Geometry, Lect. Notes in Math., 1518, Springer-Verlag, Berlin, 1992, p. 99-107.
[99] Lachaud G., Les codes geometriques de Goppa, Sem. Bourbaki, 1985, no. 641, p. 1-19;
Asterisque, 1986, 133-134, p. I 89-207.
[100] Lachaud G., Sommes d'Eisenstein et nombre de points de certaines courbes algebriques sur
les corps fini, C. R. Acad. Sci. Paris, Ser. I, 1987,305, p. 729-732.
[101] Lachaud G., Exponential sums and the Carlitz-Uchiyama bound, Lecture Notes in Camp.
Science, 1989,338, p. 63-75.
[102] Lachaud G., The parameters of projective Reed-Muller codes, Discr. Math., 1990, 81, p.
217-220.
[103] Lachaud G., Artin-Schreier curves, exponential sums, and the Carlitz-Uchiyama bound for
geometric codes, J. Number Theory, 1991, 39, p. 18-40.
[104] Lachaud G., Number of points of plane sections and linear codes defined on algebraic varieties,
Preprint, 1994.
[105] Lachaud G., Wolfmann J., Sommes de Kloosterman, courbes elliptiques et codes cycliques en
caracteristique 2, C. R. Acad. Sci. Paris, Ser. I, 1987,305, p. 881--883.
[106] Lachaud G., Wolfmann J., The weights of the orthogonals of the extended quadratic binary
Goppa codes, IEEE Trans. Info. Theory, 1990, IT-36, no. 3, p. 686-692.
[107] Lang S., Introduction to Modular Forms, Springer-Verlag, New York, 1976.
[108] Lang S., Introduction to Algebraic and Abelian Functions, Graduate Text in Math., 89,
Springer-Verlag, New York, 1982.
[110] Lang S., Algebra, 3-rd edition, Addison-Wesley, Reading, MA, 1993.
[III] Lang S., Weil A., Number of points of varieties in finite fields, Amer. J. Math., 1954, 4, p.
819-827.
[112] Levenshtein V. r., Bounds for packing in metric spaces and certain applications, Probl. Kiber-
netiki, 1983,40, p. 40-1 10.
[113] Levenshtein V. 'r., Krawtchouk polynomials and universal bounds for codes and designs in
Hanuning spaces, IEEE Trans. Info. Theory, 1995,41, no. 5, p. 1303-1321.
[114] Lidl R., Niederreiter H., Finite Fields, Addison-Wesley, Reading, MA, 1993.
Bibliography 329
[115] van Lint 1. H., Introduction to Coding Theory, Grad. Text in Math., 86, Springer-Verlag, New
York,1982.
[116] van Lint J. H., van der Geer G., Introduction to Coding Theory and Algebraic Geometry,
Birkhiiuser, Basel, 1988.
[117] van Lint 1. H., Springer T. A., Generalized Reed-Solomon codes from algebraic geometry,
IEEE Trans. Info. Theory, 1987, IT-33, p. 30>-309.
[118] MacWilliams F. J., Sloane N. J. A., The Theory of Error-Correcting Codes, North-Holland,
Amsterdam, 1977.
[119] Manin Yu. I., What is the maximum number of points on a curve over F2?, J. Fac. Sci. Tokyo,
Ser. lA, 1982,28, no. 3, p. 71>-720.
[120] Manin Y. I., Vladut S. G., Linear codes and modular curves, J. Soviet. Math., 1985, 30, p.
2611-2643.
[122] McElice R. J., The Theory of Information and Coding, Encyclopedia of Math. and its Appl.,
v. 3, Addison-Wesley, Reading, MA, 1977.
[123] McElice R. J., Finite Fieldsfor Computer Scientists and Engineers, Kluwer, Boston, 1987.
[124] Michon 1. F., Codes de Goppa, Sem. Theorie Nombres, Bordeaux, 1983/84,7, p.l-17.
[125] Michon 1. F., Les Codes BCH comme codes geometriques, Preprint, 1985.
[126] Michon 1. F., Amelioration des parameters des codes de Goppa, Preprint, 1986.
[128] Moreno C. J., Goppa codes and modular curves, Preprint, 1985.
[129] Moreno C. J., Algebraic Curves over Finite Field, Cambridge Univ. Press, 1991
[130] Moreno C. J, Moreno 0., Exponential sums and Goppa codes 1,2,3,4, Preprints, 1988-89.
[131] Moreno C. J, Moreno 0., Exponential sums and Goppacodes I, Proc. Amer. Math. Soc., 1991,
111, p. 523-531; 2, IEEE Trans. Info. Theory, 1992, IT-38, p. 1222-1229.
[132] Moreno C. J, Moreno 0., An improved Bombieri-Weil bound in characteristic two and appli-
cations to coding theory, J. Number Theory, 1992, 42, p. 32-46.
[133] Moreno 0., Counting traces of powers over GF(2m), Congr. Numer., 1980,29, p. 673--680.
[134] Moreno 0., Kumar P. V., Minimum distance bounds for cyclic codes and Deligne's theorem,
IEEE Trans. Info. Theory, to appear.
[135] Ogg A., Hyperelliptic modular curves, Bull. Soc. Math. France, 1974, 102, p. 449-462.
[136] Ozbudak F., On lower bounds for incomplete character sums over finite fields, Finite Fields
and their Appl., 1996,2, p. 173-191.
[137] Ozbudak F., Codes on fibre products of some Kummer coverings, Preprint, 1996.
330 Bibliography
[139] Pellikaan R, On a decoding algorithm for codes on maximal curves, IEEE Trans. Info. Theory,
1989,IT-35,p.1228-1232.
[140] Pellikaan R, On the gonality of curves, abundant codes and decoding, Coding Theory and
Algebraic Geometry, Lect. Notes in Math., 1518, Springer-Verlag, Berlin, 1992, p. 132-144.
[141] Pellikaan R., On the efficient decoding of algebraic-geometric codes, Proc. Eurocode 92,
CISM Courses and Lectures, 339, Springer-Verlag, New York, 1993, p. 231-253.
[142] Perret M., Sur Ie nombre de points d'une courbe sur un corps fini; application aux codes
correcteurd'erreurs, C. R. Acad. Sci. Paris, Ser. 1,1989,309, p. 177-182.
II43] Perret M., Multiplicative character sums and nonlinear geometric codes, Lect. Notes in Camp.
Science, 514, Springer-Verlag, Berlin, 1991, p. 158-165.
[144] Perret M., Tours ramifiees de corps de classes, J. Number Theory, 1991,38, p. 300-322.
[145] Peterson W. w., Weldon E. J., Error-Correcting Codes, MIT Press, Cambridge, MA, 1972.
[146] Porter S. C., Ph. D. Dissertation, Yale University, New Haven, CT, 1988.
[147] Porter S. C., Shen B.-Z., Pellikaan R., Decoding geometric Goppa codes using extra place,
IEEE Trans. Info. Theory, 1992, IT-38, p. 1663--1676.
[148] Quebbemann H. G., Cyclotomic Goppa codes, IEEE Trans. Info. Theory, 1988, IT-34, no. 5,
p. 1317-1320.
[150] Rodier E, Minoration de certaines sommes exponentialles binaries, Lect. Notes in Math.,
1518, Springer-Verlag, Berlin, 1992, p. 199--210.
[152] Roland R, On hypersurfaces over a finite field and the parameters of the projective Reed-
Muller codes, Preprint, 1990.
[153] Roquette P., Abschiitzung der Automorphismezahl von Funktionenkorpem bei primzahl Char-
acteristic, Math. Zeitschr., 1970, 117, p. 157-163.
[154] Riick H. G., On Goppa codes defined by Kummer and Artin--Schreier extensions, J. Pure and
Appl. Algebra, 1990,64, p. 163--169.
[155] Riick H. G., Stichtenoth H., A characterization of Hermitian function fields over finite fields,
J. reine angew. Math., to appear.
[156] Sakata S., Jensen H. E., Hoholdt T., Generalized Berlekamp-Massey decoding of algebraic-
geometric codes up to half the Feng-Rao bound, Trans. Info. Theory, 1995, 41, no. 6, p.
1762-1768.
[157] Sakata S., Justesen J., Madelung Y., Jensen H. E., and Hoholdt T., A fast decoding method of
AG codes from Miura-Kamiya curves Cab Up to half the Feng-Rao bound, Finite Fields and
Their Appl., 1995,1, p. 83--101.
Bibliography 331
[158] Scharlau W., Selbstduale Goppa Codes, Math. Nachr., 1989,143, p. 119-122.
[159] Schmidt W. M., Equations over Finite Fields, Lecture Notes in Math., 536, Springer-Verlag,
New York, 1976.
[160] Schoeneberg B., Uber die Weierstrasspunkte in der Korpern der elliptischen Modulfunktionen,
Abh. Math. Sem. Univ. Hamburg, 1951,17, p. 104-111.
[161] SchoofR, Nonsingular curves over finite fields, J. Combin. Theory, Ser. A, 1987,46, no. 2, p.
183-211.
[162] SchoofR, Algebraic curves and coding theory, UTM, 336, Univ. ofTrento, 1990.
[163] SchoofR, Algebraic curves over F2 with many rational points, J. Number Theory, 1992,41,
p.6-14.
[164] SchoofR, van der Vlugt M., Hecke operators and the weight distribution of certain codes, J.
Combin. Theory, Ser. A, 1991,57, p. 163-186.
[165] Serre J. P., Groups Algebriques et Corps de Classes, Hermann, Paris, 1959.
[166] Serre J. P., Majoration de sommes exponentielles, Astmsque 41-42,Soc. Math. France, Paris,
1977.
[168] Serre J. P., Nombre de points de courbes algebriques sur Fq , 8em. Theorie Nombres Bordeux,
1982-83,22,p.I-8.
[169] Serre J. P., Sur Ie nombre des points rationnels d'une courbe algebrique sur un corps fini, C.
R. Acad. Sci. Paris, Ser. I, 1983, 296, p. 397-402.
[\70] SerreJ. P., Resume des cours de 1983-1984, Annuaire du College de France, 1984, p. 79-83.
[171] Serre J. P., Rational points on curves over finite fields, Lectures given at Harvard University,
Sept.-Dec. 1985.
[173] Shen B.-Z., Tzeng K. K., Decoding geometric Goppa codes up to designed minimum distance
by solving a key equation in a ring,lEEE Trans. Info. Theory, 1995,41, no. 6, p. 1709-1719.
[174] Shen B.-Z., Tzeng K. K., Generation of matrices for determining minimum distance and
decoding of algebraic-geometric codes, IEEE Trans. Info. Theory, 1995,41, no. 6, p. 1703-
1708.
[175] Shimura G., Correspondances modulaires et les fonctions , de courbes algebriques, J. Math.
Soc. Japan, 1958, 10, p. 1-28.
[176] Shimura G., Introduction to the Arithmetic Theory of Automorphic Functions, Pub\. Math.
Soc. Japan, Princeton Univ. Press, 1971.
[177] Silverman J. H., The Arithmetic of Elliptic Curves, Graduate Texts in Math., 106, Springer-
Verlag, 1986.
[178] Silverman J. H., Tate J., Rational Points on Elliptic Curves, Springer-Verlag, New York, 1992.
332 Bibliography
[179] Skorobogatov A. N., The parameters of subfield subcodes of algebraic-geometric codes, Discr.
Appl. Math., 1991,33, p. 205-214.
[180] Skorobogatov A. N., Vladut S. G., On the decoding of algebraic-geometric codes, IEEE Trans.
Info. Theory, 1990, IT-36, no. 5, p. 1051-1060.
[l81] Sloane N. J. A., Sphere packing constructed from BCH and Justesen codes, Mathematika,
1972,19, p. 183-190.
[182] Sorensen A. B., Projective Ree~Muller codes, IEEE Trans. Info. Theory, 1991, IT-37, p.
1567-1576.
[183] Springer G., Introduction to the Theory ofRiemann Surfaces, Addison-Wesley, Reading, MA,
1957.
[184] Stepanov S. A., The number of points ofa hyperelliptic curve over a finite prime field, Math.
USSR Izv., 1969,3, p. 1103-1119.
[185] Stepanov S. A., Congruences in two unknowns, Math. USSR Izv., 1972,6, no. 1, p. 677-709.
[186] Stepanov S. A., On lower bounds of character sums over finite fields, Discr. Math., 1991,3,
no. 2,p. 77---S6 (in Russian); Discr. Math. Appl., 1992,2, no. 5, p. 523-532.
[187] Stepanov S. A., Arithmetic of Algebraic Curves, Plenum, New York, 1994.
[188] Stepanov S. A., Character sums and coding theory, Finite Fields and Applications. London
Math. Soc. Lect. Note Series, 233, Cambro Vniv. Press, Cambridge, 1996, p. 355-376.
[189] Stepanov S. A., Codes on fibre products of hyperelliptic curves, Diskret. Mat., 1997,9, no. 1,
p. 83-94; Discr. Math. Appl., 1997,7, no. 1, p. 77---S8.
[190] Stepanov S. A., Character sums, algebraic curves and Goppa codes, Algebraic Geometry, Lect.
Notes in Pure and Appl. Math., Ser. 193, Marcel Dekker, New York, 1997, p. 313- 345.
[191] Stepanov S. A., Ozbudak F., Fibre products of hyperelliptic curves and geometric Goppa codes,
Discr. Math., 1997,7, no. 3, p. 223-229.
[192] Stepanov S. A., Ozbudak F., Fibre products of superelliptic curves and codes therefrom, Proc.
1997 IEEE Intern. Symp. on Info. Theory, Vim, Germany, 1997, p. 413.
[193] Stichtenoth H., Self-dual Goppa codes, J. Pure Appl. Algebra, 1988,55, p. 199-211.
[194] Stichtenoth H., A note on Hermitian codes over GF(q2), IEEE Trans. Info. Theory, 1988,
IT-34, no. 5, p. 1345-1348.
[195] Stichtenoth H., On automorphism of geometric Goppa codes, J. Algebra, 1990,130, no. 1, p.
113-l2l.
[196] Stichtenoth H., Algebraic-geometric codes associated to Artin- Schreier extensions of Fq[z],
Proc. of the second Intern. Workshop on Algebraic Geometry and Combinatorial Coding
Theory, Leningrad, 1990, p. 203-206.
[197] Stichtenoth H., Algebraic Function Fields and Codes, Springer-Verlag, Berlin, 1993.
[198] Stichtenoth H., Algebraic geometric codes, Proceedings ofSymposia in Applied Mathematics,
50, AMS, New York, 1995, p.139-152.
Bibliography 333
[199] Stichtenoth H., Xing C. P., On the structure of the divisor class group of a class of curves over
finite fields, Arch. Math., 1995, 65, p. 141-150.
[200] Stohr K. 0., Voloch J. F., Weierstrass points and curves over finite fields, Proc. London Math.
Soc. (3), 1986, 52, p. 1-19.
[201] Tate J., The arithmetic of elliptic curves, Invent. Math., 1974,23, p. 179--206.
[202] Tate J., Endomorphisms of abelian varieties over finite fields, Invent. Math., 1996, 2, p.
134-144.
[203] Tiersma H. J., Remarks on codes from Hermitian curves, IEEE Trans. Info. Theory, 1987,
IT-33,p.605-609.
[204] Tsfasman M. A., On Goppa codes which are better than the Varshamov-Gi.!bert bound, Probl.
Info. Trans., 1982, 18, p. 163--166.
[205] Tsfasman M. A., Group of points of an elliptic curve over a finite field, Preprint, 1985.
[206] Tsfasman M. A., Algebraic-geometric codes and asymptotic problems, Discr. Appl. Math.,
1991,33, p. 241-256.
[207] Tsfasman M. A., Global fields, codes and sphere packings, Asterisques, 1991, 198-200, p.
373--396.
[208] Tsfasman M. A., Vladut S. G., Algebraic-Geometric Codes, Kluwer Acad. Pub!., Dordrecht,
1991.
[209] Tsfasman M. A., Vladut S. G., Geometric approach to higher weights, IEEE Trans. Info.
Theory, 1995,4I,no.6,p. 1565-1588.
[210] Tsfasman M. A., Vladut S. G., Zink T., Modular curves, Shimura curves and Goppa codes,
better than the Varshamov-Gilbert bound, Math. Nachr., 1982, 109, p. 21-28.
[211] Vladut S. G., On the polynomiality of codes on classical modular curves, Preprint, 1983.
[212] Vladut S. G., An exhaustion bound for algebraic-geometric "modular" curves, Probl. Info.
Trans., 1987,23, p. 23--43.
[213] Vladut S. G., Algebraic-geometric "modular" codes as group codes, Preprint, 1989.
[214] Vladut S. G., On the decoding of algebraic-geometric codes over Fq for q ~ 16, 1990, IEEE
Trans. Info. Theory, IT-36, no. 6, p. 1461-1463.
[215] Vladut S. G., Drinfeld V. G., Number of points of algebraic curves, Func. Anal., 1983,17, no.
1, p. 68-69.
[216] Vladut S. G., Katsman G. 1., Tsfasman M. A., Modular curves and codes with polynomial
construction complexity, Probl. Info. Trans., 1984,20, p. 35-42.
[218] Voloch J. F., A note on elliptic curves over finite fields, Bull. Soc. Math. France, 1988, 116, p.
455-458.
[219] Voss C., On the weights of trace codes, Coding Theory and Algebraic Geometry, Lect. Notes
in Math., 1518, Springer-Verlag, Berlin, 1992, p. 193--198.
334 Bibliography
[220] Voss C., Hoholdt T., A family ofKummer extensions ofthe Hennitian function fields, Commun.
Algebru, 1995,23,no.4,p. 1551-1566.
[221] Voss C., Stichtenoth H., Asymptotically good families of subfield subcodes of geometric
Goppa codes, Geometriae Dedicata, 1990,33, p. 111-116.
[222] Waterhouse W. C., Abelian varieties over finite fields, Ann. Sci. E. N. S. (4), 1969, 2, p.
521-560.
[223] Weil A., On some exponential sums, Proc. Nat. Acad. Sci. USA, 1948,34, p. 204-207.
[224] Weil A., Sur les Courbes Algebriques et les Varietes qui s •en Dtiduisent, Hennann, Paris, 1948.
[225] Weil A., Varietes Abeliennes et Courbes Aigebriques, Hermann, Paris, 1948.
[226] Weil A., Number of solutions of equations in finite fields, Bull. Amer. Math. Soc., 1949,55, p.
497-508.
[227] Wirtz M., On the parameters of Goppa codes, IEEE Trans. Info. Theory, 1988, IT-34, no. 5, p.
1341-1343.
[228] Wolfinann J., Nombre de points rationnels de courbes algebriques sur des corps finis associees
Ii des codes cycliques, C. R. Acad. Sci. Paris, Ser. 1,1987,305, p. 345-348.
[229] Wolfinann J., The weights of the dual code to the Melas code over GF(3), Discr. Math., 1989,
74, p. 327-329.
[230] Wolfinann J., New bounds on cyclic codes from algebraic curves, Lecture Notes in Compo
Science, 388, Springer-Verlag, Berlin, 1989, p. 47-62.
[231] Wolfinann J., The number of points of certain algebraic curves over finite fields, Commun.
Algebra, 1989,17, p. 2055-2066.
[232] Wolfinann J., The number of solutions of certain diagonal equations over finite fields, J.
Number Theory, 1992,42, p. 247-257.
[233] Xing C. P., Multiple Kummer extensions and the number of prime divisors of degree one in
function fields, J. Pure and Appl. Algebra, 1993, 84, p. 85-93.
[234] Xing C. P., On automorphisms groups of the Hermitian codes, IEEE Trans. Info. Theory, 1995,
IT-41,no.6,p.1629-1635.
[235] Xing C. P., Stichtenoth H., The genus of maximal function fields over finite fields, Manuscripta
Math., 1995, 86, p. 217-224.
[236] Yang K., Kumar P. V., Stichtenoth H., On the weight hierarchy of geometric Goppa codes,
IEEE Trans. Info. Theory, 1994, IT-40, p. 913--920.
[237] Zink T., Degeneration of ShimuTa surfaces and a problem in coding theory, Lecture Notes in
Compo Science, 199, Springer-Verlag, Berlin, 1996, p. 503-511
[238] Zinoviev V. A., Ericson T., On concatenated constant weight codes ameliorating the
Varshamov--Gilbert bound, Prahl. Info. Truns, 1987,23, no. I, p. 110-111.
[239] Zinoviev V. A., Litsyn S. N., Codes that exceed the Gilbert bound, Prabl. Info. Trans., 1985,
21,no. l,p. 105-108.
List of Notations
General Notations
AcB proper subset {A =1= B}
AyB injective map
o empty set
IMI cardinality of a set
f·g composition of maps
Imcp image of a map
Kercp kernel of a map
Z ring of integers
N set of positive integers
Q field of rational numbers
~ field of real numbers
c field of complex numbers
min m divides n in the ring Z
min m does not divide n in the ring Z
Fq finite field with q elements
Fq[u] ring of polynomials in U over Fq
I(v) number of monic irreducible polynomials in
Fq[u] of degree v
{(s) zeta-function of Fq[u]
J.L(n) Mobius function
cp(n) Euler phi-function
R(x,e i ) Lagrange-Hilbert resolvent
ZjnZ residue ring modulo n
335
336 List of Notations
Coding Theory
C code
M=ICI cardinality of a code
k = logq ICI log-cardinality of a code
[n,k,d)q parameters of a code
d(x,y) Hamming distance
d minimum distance of a code
d.l minimum distance of a dual code
R=k/n information rate
8=d/n relative minimum distance
Ilxll Hamming weight of a vector
Bt(x) ball of radius t centered at x
G generator matrix
H parity-check matrix
C.l dual code
Wc(u: v) weight enumerator
Wc(u), Wc(v) non-homogeneous weight enumerators
j(z) Hadamard transform
Pi(U) Krawtchouk polynomial
MDS maximum distance separable code
Aq(n,d) = max {qk Ithere exists an [n, k, d) q-code over
Fq}
A~n(n,d) = max{qk Ithere exists a linear [n,k,d)q-code
over Fq}
Hq(8) q-ary entropy function
RH(8) asymptotic Hamming bound
Rp(8) asymptotic Plotkin bound
RBE(8) asymptotic Bassalygo-Elias bound
R/p(8) asymptotic linear programming bound
RG(8) asymptotic Gilbert bound
RGv(8) asymptotic Gilbert-Varshamov bound
RAG(8) asymptotic algebraic-geometric bound
RBZ(8) asymptotic Blokh-Zyablov bound
Rz(8) asymptotic Zyablov bound
RS Reed-Solomon code
CH Hamming code
CR,CN quadratic-residue codes
q,c~ extended quadratic-residue codes
BCH Bose--Chaudhuri-Hocquenghem code
Cll,C23 Golay codes
338 List of Notations
Algebraic Geometry
An affine n-dimensional space
lP" projective n-dimensional space
YeS) zero set
a(X) = {F E k[TlIF(x) = 0 for all x EX}
m maximal ideal
p prime ideal
rea) mdical of an ideal
X algebmic variety, smooth projective curve
List of Notations 339
343
344 Index