Professional Documents
Culture Documents
Analytic Number Theory 1
Analytic Number Theory 1
Valentin Blomer
22. November 2022
1
Inhaltsverzeichnis
0 Introduction 3
2
0 Introduction
Recommended literature:
• J. Brüdern, Einführung in die analytische Zahlentheorie
• How many primes are there in short interval (x, x + y] with y much less
than x?1
• in how many ways can one write a number as a sum of 2 (3, 4, . . .) primes?
• the same questions for squarefree numbers, sums of two squares, etc.
q (a prime, say), if they are q-adically close, and they are in a short interval, if they are
archimedically close
3
Example 1
We use the notation e(x) = e2πix . For n ∈ Z we have
Z 1
e(αn)dα = δn=0 .
0
Hence the number of integral solutions to n = x21 + x22 + x23 + x24 (sums of four
squares) is
X Z 1 Z 1 X 4
e(α(x21 + . . . + x24 − n))dα = e(−αn) e(αx2 ) dα.
0 0 √
x1 ,...,x4 ∈Z |x|≤ n
Example 2
For natural numbers n we have
X
µ(d) = δn=1
d|n
4
√
One can show that this is asymptotic to e−γ x/ log x, where γ = 0.577 . . .
is the Euler constant. Hence our heuristic argument gives the correct order of
magnitude, but the wrong constant. In particular, it is not allowed to ignore error
terms. Nevertheless, this method can be refined to prove interesting results (not
for counting primes, though), and leads to sieve theory.
Example 3
For c ∈ R let (c) be the vertical path c + it, −∞ < t < ∞. Then for c > 0 one
has Z (
1 −s ds 1, 0 < y < 1,
y =
2πi (c) s 0, y > 1.
(This is a non-trivial formula that we will prove later in (4.6)). The complex
contour integral is only conditionally convergent, but we ignore this for the
moment. Hence if an is any sequence of complex numbers, we have (ignoring
convergence)
Z −s Z X
X X X 1 n ds 1 an s ds
an = an δn/x≤1 = an = s
x .
n n
2πi (c) x s 2πi (c) n n s
n≤x
There are serious convergence issues, but they can be overcome in many appli-
cations. We will use this idea, for instance, to count primes up to x and prove
the prime number theorem.
5
1 Arithmetic functions and Dirichlet series
(1.1) Definition
An arithmetic function is a map f : N → C. An arithmetic function f 6= 0 is
multiplicative resp. completely multiplicative if f (nm) = f (n)f (m) for all
(m, n) ∈ N2 with (m, n) = 1 resp. for all (m, n) ∈ N2 . The set A of arithmetic
functions is a ring (an integral domain) with respect to pointwise addition and
Dirichlet convolution
X
(f ∗ g)(n) := f (d)g(n/d).
d|n
The identity element with respect to multiplication is the function η with η(1) =
1, η(n) = 0 for n > 1. (To show that there are no zero divisors let r, s minimal
with f (r) 6= 0 6= g(s) for 0 6= f, g ∈ A. Then (f ∗ g)(rs) = f (r)g(s) 6= 0.)
(1.2) Remarks
a) Since f (n) = f (1 · n) = f (1)f (n) for a multiplicative function f , we have
automatically f (1) = 1.
b) Multiplicative function are determined by their values on prime powers, com-
pletely multiplicative functions by their values on primes.
c) The units of A are precisely the function f with f (1) 6= 0: if f ∗ g = η, then
1 = η(1) = f (1)g(1). Conversely, if f (1) 6= 0, define its inverse g recursively by
g(1) = 1/f (1) and
1 X
g(n) = − f (d)g(n/d).
f (1)
d|n
d>1
(1.3) Lemma
The set of mutliplicative functions forms a subgroup of A∗ .
Proof. For (n, m) = 1 we have a bijection {d | nm} ↔ {d1 | n} × {d2 | m}. Let
f, g be multiplicative (hence invertible). Then
nm X X
X nm
(f ∗ g)(nm) = f (d)g = f (d1 d2 )g
d d1 d2
d|nm d1 |n d2 |m
X
X n n
= f (d1 )g f (d2 )g = (f ∗ g)(n) · (f ∗ g)(m)
d1 d2
d1 |n d2 |m
6
Then h is multiplicative and h = f −1 on prime powers, so f ∗ h = η on prime
powers, and hence by multiplicativity everywhere. We conclude that f −1 =
f −1 ∗ η = f −1 ∗ f ∗ h = h is multiplicative.
(1.4) Examples
• the function 1(n) = 1 for all n ∈ N is completely multiplicative;
P
• the divisor function τ (n) = d|n 1 = (1 ∗ 1)(n) is not completely mul-
tiplicative. We have τ (pe ) = e + 1.
P
• the iterated divisor function τk (n) = (1 ∗ . . . ∗ 1)(n) = a1 ···ak =n 1 with
τk (pe ) = k+e−1
e ;
• Euler φ-function φ(n). We have (φ ∗ 1)(n) = n (check on prime powers);
P
• the functions ω(n) = #{p | n : p prim} and Ω(n) = pk kn k are not
multiplicative;
(
log p, n = pk
• the function Λ(n) = (von Mangoldt function) sa-
0, otherwise
Q ep
tisfies (Λ ∗ 1)(n) = log n: for n = p we have
X X X
Λ(d) = ep log p = log pep = log n;
d|n p p
Proof. We have
X X
A(x)g(x) − an g(n) = an (g(x) − g(n))
y≤n≤x y≤n≤x
X Z x Z x X
= an g 0 (ξ)dξ = g 0 (ξ) an dξ.
y≤n≤x n y y≤n≤ξ
7
(1.6) Examples
a) The limit
X 1
γ := lim − log M = 0.577 . . .
M →∞ m
m≤M
b) We have X
τ (n) = x log x + (2γ − 1)x + O(x1/2 ),
n≤x
Proof: We have
X X X X X X √
τ (n) = 1= 1+ 1+ 1=2 1 + [ x]2 .
n≤x ab≤x ab≤x
√ ab≤x
√ ab≤x
√ ab≤x
√
a> x b> x a,b≤ x a> x
√ √
We have [ x]2 = x + O( x) and
X X X X x √
1= 1= − x + O(1)
√ √ √ b
ab≤x
√ b≤ x x<a≤x/b b≤ x
a> x
√ √ √ √ √
1 1
= x log x + γ + O √ − x[ x] + O( x) = x log x + γ − 1 + O( x).
x 2
(1.7) Lemma
For k ∈ N, ε > 0, there exists a constant C(k, ε) > 0 such that τk (n) ≤ C(k, ε)nε .
If p > C0 (k, ε) for some sufficiently large C0 (k, ε), then pαε > (α + k)k . We can
drop these factors and obtain
k+α−1
k+α−1
!C0 (k,ε)
τk (n) Y
α α
≤ ≤ max =: C(k, ε).
nε α
pαε α≥1 2αε
p kn
p≤C0 (k,ε)
8
(1.8) Definition and Lemma
Let f be an arithmetic
P∞ function. The Dirichlet series attached to f is the
−s
formal series n=1 f (n)n . If f, g are two functions whose Dirichlet series
converge absolutely for some s ∈ C, then
a)
∞ ∞ ∞
X (f ∗ g)(n) X f (n) X g(n)
= ;
n=1
ns n=1
ns n=1 ns
b)
∞ ∞
X f (n) Y X f (pk )
=
n=1
ns p
pks
k=0
Proof. a) We have
∞ ∞ ∞ ∞
X (f ∗ g)(n) X 1 X X f (a) X g(b)
= f (a)g(b) = .
n=1
ns n=1
ns a=1
as bs
ab=n b=1
b) Multiply the right hand side and use uniqueness of prime factorization.
(1.9) Examples
a) For <s > 1 we have
∞ Y −1
X 1 1
ζ(s) := = 1− s 6= 0.
n=1
ns p
p
The Riemann zeta function has meromorphic continuation to <s > 0 with only
a simple pole at s = 1: for N ∈ N we have by partial summation we have
X Z N
n−s = [N ]N −s + s [x]x−s−1 dx.
n≤N 1
The right hand side exists for <s > 0 (except at s = 1) and yields the desired
continuation.
b) By (1.4) we have
∞ ∞
X τ (n) X φ(n) ζ(s − 1)
s
= ζ(s)2 , <s > 1, s
= , <s > 2,
n=1
n n=1
n ζ(s)
∞ ∞
ζ 0 (s) X Λ(n) X µ(n) 1
− = , <s > 1, = , <s > 1.
ζ(s) n=1
ns n=1
n s ζ(s)
9
(1.10) Theorem
P∞ −s
If F (s) = n=1 an n is convergent for some s0 ∈ C, then it converges for
all s ∈ C with <s > <s0 , uniformly on compacta, and defines a holomorphic
function in this region.
For any ε > 0 there is M = M (ε) such that | M ≤n≤X an n−s0 | ≤ ε for all
P
X > M . Since <s ≥ <s0 we have
N Z N !
X a
n <(s0 −s)−1 |s − s0 | 1
≤ ε 1 + |s − s0 | t dt ≤ ε 1 + ≤ε 1+ .
ns <(s − s0 ) sin η
n=M M
(1.11) Theorem
There is a number σ0 ∈ R∪{±∞} (abscissa of convergence) such that a Dirichlet
P for all s with <s > σ0 and diverges for all s with <s < σ0 . If
series converges
σ0 > 0, i.e. if n an diverges, it is given by
PN ( N
)
log | n=1 an | X
α
σ0 = lim sup = inf α | an = O(N ) .
N →∞ log N n=1
Proof. The existence of σ0 follows from (1.10). The second equality is obvious.
P call−σthe right hand side γ. It remains to show γ = σ0 . Let σ > σ0 . Then
We
an n is convergent, hence by partial summation
N N Z NX
X X an σ an σ−1
an = σ
N − σ σ
t dt = O(N σ ),
n=1 n=1
n 1 n
n≤t
10
(1.12) Corollary
of convergence of n an n−s and σ1 the abscissa of con-
P
Let σ0 be the
Pabscissa
vergence of n |an |n−s . Then σ0 ≤ σ1 ≤ σ0 + 1.
Proof. Exercise
Proof. If not, then let m be the smallest index with am 6= bm . For σ > c we
have
∞ ∞ ∞ ∞
!
X an X bn X m σ X m σ
σ
0=m − = am − bm + a n − bn
n=1
nσ n=1 nσ n=m+1
n n=m+1
n
11
(1.15) Example
What is the “probability” that two numbers are coprime? By (1.4) we have
1 X 1 X X 1 X X X
1= µ(d) = µ(d) 1
x2 x2 x 2
n,m≤x n,m≤x d|(n,m) d≤x n≤x m≤x
(n,m)=1 d|n d|m
1 X x 2 X µ(d) 1 X1 1 X
= 2 µ(d) + O(1) = +O + 1
x d d2 x d x2
d≤x d≤x d≤x d≤x
∞
!
X µ(d) X 1 log x 1 1 log x 6
= 2
+O 2
+ + = +O → 2.
d d x x ζ(2) x π
d=1 d>x
Appendix: characters
(1.17) Definition
Let q ∈ N. A homomorphism χ : (Z/qZ)∗ → S 1 is called a Dirichlet character.
We extend χ to a completely multiplicative q-periodic function Z → S 1 ∪ {0}
by putting χ(n) = 0 for (n, q) 6= 1 and call this again a Dirichlet character. The
function
X∞ Y
L(s, χ) := χ(n)n−s = (1 − χ(p)p−s )−1
n=1 p
2 this is easy to see for cyclic groups and then follows in general
12
To show for instance the second formula for χ 6= χ0 , pick m with (m, q) = 1 and
χ(m) 6= 1, then
X X X
χ(n) = χ(nm) = χ(m) χ(n).
n (mod q) n (mod q) n (mod q)
(1.19) Definition
a) Let χ modulo q be a Dirichlet character. We say that χ has quasiperiod d
if χ(n) = χ(m) whenever n ≡ m (mod d) and (nm, q) = 1. The smallest quasi-
period is called conductor of χ.
b) If q ∗ | q, then χ mod q is induced by χ∗ mod q ∗ if χ(n) = χ∗ (n) for all
(n, q) = 1. A character χ mod q is called primitive if it is not induced by a
character χ∗ mod q ∗ with q ∗ < q.
(1.20) Lemma
a) The conductor of a character χ mod q is a divisor of q.
b) Every character χ mod q is induced by a unique character χ∗ mod q ∗ where
q ∗ is the conductor of χ. This character χ∗ is primitive.
13
any k ∈ Z such that (n + kq ∗ , q) = 1, and χ∗ (n) = 0 for (n, q ∗ ) > 1. Such a k
exists, for instance Y
k= p
p|q
p-q ∗ n
(1.21) Lemma
If a, b ∈ Z and χ is a primitive character modulo q, then
1 X
χ(ac + b) = χ(b)δq|a .
q
c (mod q)
Proof. First we observe that we can replace a by (a, q) without changing the
sum: indeed, if we write d = (a, q), a0 = a/d, q 0 = q/d with (a0 , q 0 ) = 1, then
X X X
χ(ac + b) = χ(a0 dc + b) = d χ(d(a0 c) + b)
c (mod q) c (mod q) c (mod q 0 )
X X
=d χ(dc + b) = χ(dc + b).
c (mod q 0 ) c (mod q)
Assume that χ(1 + ax) = 1 for all x ∈ Z with (1 + ax, q) = 1. Let (n, q) = 1 and
fix an integer n̄ such that n̄n ≡ 1 (mod q). Then χ(n + ay) = χ(n + nn̄ay) =
χ(n)χ(1 + an̄y) = χ(n) for all y ∈ Z with (1 + an̄y, q) = (n + ay, q) = 1, so that
by definition χ has quasiperiod a < q, contradiction. Hence there exists x with
χ(1 + ax) 6= 1, and so S = 0.
3 To check this, show that every p | q divides exactly one of n and kq ∗ , therefore it does
not divide n + kq ∗ .
14
(1.22) Definition
For a character χ modulo q we define the Gauß sum
X
τ (χ) = χ(h)e(h/q)
h (mod q)
(1.23) Theorem
Let χ be a character modulo q, a ∈ Z.
a) If (a, q) = 1, then4
X
χ(a)τ (χ̄) = χ̄(h)e(ha/q).
h (mod q)
Note in the first step that τ (χ) = χ(−1)τ (χ̄), so that |τ (χ)| = |τ (χ̄)|.
(1.24) Definition
A Dirichlet character is called even if χ(−1) = 1 and odd if χ(−1) = −1.
15
2 Fourier analysis and Poisson summation
The main result of this section is the Poisson summation formula. We give two
arithmetic applications. More will follow later.
(2.1) Definition
A Schwartz class function on R is a C ∞ -function f : R → C such that
f (n) (x) n,A (1 + |x|)−A for all n ∈ N0 and all A > 0. We denote the vector
space of such functions by S(R).
(2.2) Definition
For f ∈ L1 (R) we define the Fourier transform fb ∈ L∞ (R) by
Z
F(f )(y) = f (y) :=
b f (x)e(−xy)dx.
R
(2.3) Lemma
Let f, g ∈ L1 (R), c ∈ R.
a) The Fourier transform is a linear operator.
b) If h(x) = f (x − c), then bh(y) = e(−cy)fb(y).
h(y) = |c|−1 fb(y/c).
c) If h(x) = f (cx) with c 6= 0, then b
d) If h(x) = (f ∗ g)(x) = R f (t)g(x − t)dt, then h ∈ L1 (R) and b
R
h(y) = fb(y)b
g (y).
R R
e) We have R f (x)b g (x)dx = R fb(x)g(x)dx.
2
f) The function f (x) = e−πx is self-dual with respect to the Fourier transform.
(2.4) Theorem
a) If f ∈ S(R), then fb ∈ S(R). More precisely, if f (r) (x) (1 + |x|)−n−1−δ for
all r ≤ k and some δ > 0, then fb(n) (y) (1 + |y|)−k .
b) If f ∈ C 1 (R) ∩ L1 (R) and fb ∈ L1 (R), in particular if f, f 0 , f 00 (1 + |x|)−2
(by part a) with n = 0, k = 2, δ = 1), the Fourier inversion formula holds:
Z
f (x) = fb(y)e(xy)dy = fb(−x).
b
R
16
c) If f ∈ S(R), Parseval’s identity holds:
Z Z
2
|f (x)| dx = |fb(y)|2 dy.
R R
Proof. a) differentiation under the integral sign and integration by parts:
Z
1 1
fb(n) (y) r |(xn f (x))(r) |dx r
|y| R |y|
for all r ≤ k.
b) By Lebesgue’s dominated convergence theorem we have
Z Z Z
2 2
fb(y)e(−xy)dy = lim e−πε y fb(y)e(−xy)dy = lim fb(y)gε (y)dy
R ε→0 R ε→0 R
−πε2 y 2
where gε (y) = gε (y; x) = e e(−xy). By (2.3b, c, f) we compute
1 − π(x−t)2
gbε (t) = e ε2
ε
so that by (2.3e)
Z Z Z
1 π(x−y)2 f ∈C 1 (R) 1 π(x−y)2
fb(y)e(−xy)dy = lim f (y) e− ε2 dy = lim (f (x) + O(|y − x|)) e− ε2 dy
R ε→0 R ε ε→0 R ε
= lim (f (x) + O(ε)) = f (x).
ε→0
(2.5) Remarks
a) Theorem 2.4b, c holds with less restrictive assumptions on f . A standard
condition in b) is f ∈ L1 (R) ∩ C(R) and fb ∈ L1 (R), and f ∈ L1 (R) ∩ L2 (R) in
c).
b) The results in (2.3), (2.4) generalize in an obvious way to functions on Rn .
c) The Fourier transform translates smoothness in decay conditions (and vice
versa), and it translates small support into large support. In particular, there is
an uncertainty principle: the support of f and fb cannot be small simultaneously.
P
Proof. Let F (x) = P n∈Z f (x + n) and expand this 1-periodic function into a
Fourier series: F (x) = m∈Z a(m)e(mx) where
Z 1 Z 1X Z ∞
a(m) = F (x)e(−mx)dx = f (x+n)e(−mx)dx = f (x)e(−mx)dx = fb(m).
0 0 n∈Z −∞
Now put x = 0.
17
(2.7) Corollary
Let f ∈ S(R), a, q ∈ N, χ a primitive Dirichlet character modulo q.
a) We have
X 1 X b m am
f (m) = f e .
q q q
m∈Z m∈Z
m≡a (mod q)
b) We have
X τ (χ) X b m
f (m)χ(m) = f χ̄(m).
q q
m∈Z m∈Z
Proof. Exercise
(2.8) Corollary
2
e−πn x
P
√ Theta function Θ(x) =
The n∈Z satisfies the functional equation Θ(1/x) =
xΘ(x).
(2.9) Lemma
Let X and 0 < Z < Y be three real numbers. There exists a smooth, non-
negative function f satisfying the following properties:
a) the support of f is contained in [X − Z, X + Y + Z]
b) f = 1 on [X, X + Y ];
c) kf (j) k1 Z 1−j for all j ∈ N.
Proof. Exercise.
Proof. Let us first assume that χ is primitive (and q ≥ 2). We would like to
apply Poisson summation. To this end we need to smooth out the characteristic
function on the interval (M, M + N ]. Let w be a smooth bounded function with
support on [M −1, M +N +1] such that w = 1 on (M, M +N ] and kw(j) k1 j 1
for all j ∈ N. Then we have by (2.7b) that
Z
X X τ (χ) X nx
χ(n) = χ(n)w(n)+O(1) = χ̄(n) w(x)e − dx+O(1).
q R q
M <n≤M +N n∈Z n∈Z
18
for all j ∈ N. Hence
X 1 X q X q2
χ(n) 1 + √ + q 1/2 log q.
q n n>q n2
M <n≤M +N n≤q
Application: let q be a prime. Then any interval of length ≥ cq 1/2 log q with c
sufficiently large contains a quadratic residue and a quadratic non-residue.
(2.11) Lemma
For k ∈ Z, x ∈ R define the Bessel function
Z π
1
Jk (x) = e−ikφ+ix sin φ dφ.
2π −π
for all j ∈ N0 .
e) We have Jk (x) k min(1, |x|−1/2 ).
19
Proof. a) Follows directly from the theory of Fourier series.
b) Differentiate the generating series with respect to x:
and θ:
eiθ + e−iθ ix sin θ X
x e = kJk (x)eikθ
2
k∈Z
for any 0 < α < 1/10 such that |x|α > 100k. We can p estimate the other regions
of the integral in the same way. Choosing α 1/ |x| and assuming wlog that
x is sufficiently large (i.e. x k 2 ), we obtain the desired bound.
Remark: This improves what one gets from the elementary Lipschitz principle:
πR + O(R1/2 ).
20
Proof. Let 1 ≤ T ≤ R1/2 be another parameter and let w be a smooth bounded
function with support on [0, R + T ] such that w = 1 on [0, R] and kw(j) k1 j
T 1−j for all j ∈ N. Let r(n) be as in (1.18d). We have
X
r(n) = 4 χ−4 (d) τ (n) nε .
d|n
Hence
X X X
r(n) = r(n)w(n) + O(T Rε ) = w(x2 + y 2 ) + O(T Rε )
n≤R n≤R x,y∈Z
X Z
= w(x2 + y 2 )e(−nx − my)dx dy + O(T Rε ).
n,m∈Z R2
X Z ∞ √
=π r(`) w(r)J0 (2π r`)dr.
`≥1 0
1/3
Choosing T = R gives the result.
with ∞ √
Z
w̌(n) = π w(x)J0 (2π xn)dx.
0
21
3 The functional equation
In this section we use Poisson summation to derive an integral representation
of the Riemann zeta-function that (a) continues the zeta-function to the entire
plane and (b) proves the functional equation. We introduce a general class of L-
functions sharing similar properties with the Riemann zeta-functions. For such
L-functions we show the “approximate functional equation” as a tool for un-
derstanding L-functions in the critical strip. Applications include the convexity
bound and bounds for moments.
(3.1) Definition
For <z > 0 we define the Gamma function by
Z ∞
dt
Γ(z) = e−t tz .
0 t
(3.2) Theorem
The Gamma function can be continued meromorphically to C with simple poles
at −n, n ∈ N0 , with residue (−1)n /n!. It satisfies zΓ(z) = Γ(z + 1) and in
particular Γ(n) = (n − 1)! for n ∈ N. It is rapidly decaying on vertical lines, i.e.
Γ(x + iy) x,A (1 + |y|)−A away from poles, i.e. if minn∈N0 (x + iy + n) ≥ 1/100.
Proof. The formula zΓ(z) = Γ(z + 1) follows from partial integration in <z >
0. This gives inductively meromorphic continuation (and hence the recurrence
relation everywhere) and the location of poles. The decay property follows from
repeated integration by parts.
(3.3) Theorem
The Gamma function satisfies the following properties:
a) Γ is zero-free.
√
b) Γ(1/2) = π.
c) Γ(z)Γ(1 − z) = π/√ sin(πz).
d) Γ(z)Γ(z + 1/2) = π21−2z Γ(2z).
e) Γ(z) = Γ(z̄).
f) Stirling’s formula:
1 1
log Γ(z) = z − log z − z + log(2π) + Oδ (|z|−1 ),
2 2
or 1/2
2π z z 1
Γ(z) = 1 + Oδ
z e |z|
or
Γ0
(z) = log z + Oδ (|z|−1 )
Γ
22
whenever | arg(z)| < π − δ for some δ > 0.
Proof. Known from complex analysis. Part e) is obvious, part b) follows from
c).
(3.4) Corollary
a) For x, y ∈ R we have
π 1
|Γ(x + iy)| x,δ e− 2 |y| (1 + |y|)x− 2 , |x + iy + n| ≥ δ for all n ∈ N0 .
b) Suppose that <z ≥ −A, <(z + w) ≥ δ, |<w| ≤ A for some A, δ > 0. Then
there exists a constant c > 0 (depending on A and δ) such that
Γ(z + w)
A,δ (1 + |z|)<(w) exp(c|w|).
Γ(z)
Proof. a) It suffices to assume that |y| > 1. Then by Stirling’s formula we have
1 p
|Γ(x + iy)| x 1/2
exp < (x + iy)(log x2 + y 2 + i arg(x + iy))
|y|
1 2 π
x 1/2
ex(log |y|+Ox (1/y ))−|y|( 2 +Ox (1/|y|)) .
|y|
b) Let us first assume that <z ≥ 1/2. Then we can apply Stirling’s formula to
Γ(z + w) and Γ(z). We have
Z z+w 1
1 1
| log(z + w) − log z| = dt ≤ |w| + .
z t <(z + w) <z
Therefore
Γ(z + w)
= exp{<(log Γ(z + w) − log Γ(z))}
Γ(z)
1 1
δ exp < (z + w − 1/2) log(z) + O |w| + − w − (z − 1/2) log z
<(z + w) <z
exp{<(w log z) + OA,δ (|w|)} = exp{<(w) log |z| + OA,δ (|w|)} = |z|<w exp{OA,δ (|w|)}.
If <z > −A, |z| ≥ δ, we conclude from what we have already shown that
Γ(z + w) z(z + 1) · · · (z + [A] + 1)Γ(z + w)
= δ,A |z|[A]+2 ·|z|<w−[A]−2 exp(OA,δ (|w|)).
Γ(z) Γ(z + [A] + 2)
Finally if |z| ≤ δ, there is nothing to show.
(3.5) Definition
An entire function f : C → C is called of order β if f (z) exp(|z|β+ε ) for
all ε > 0. By slight abuse of notation we call aQmeromorphic function f with
finitely many poles at z = zj of order β if f (z) j (z − zj ) is of order β.
23
(3.6) Main Theorem (Riemann)
Let Z(s) := Γ(s/2)π −s/2 ζ(s) be the completed zeta function. It can be con-
tinued to a meromorphic function of order 1 with simple poles at s = 0 and
s = 1 and functional equation Z(s) = Z(1 − s). Its zeros are contained in the
region 0 ≤ <s ≤ 1.
The zeta function ζ(s) can be continued meromorphically to all of C; it is holo-
morphic except for a simple pole at s = 1 with residue 1. In <s < 0 it has zeros
precisely at −2, −4, −6, . . . which are all simple.
Remarks. The factor Γ(s/2)π −s/2 is sometimes called the local Euler factor at
∞. In <s > 1 we understand the zeta function from its series/product repre-
sentation. This is translated into the region <s < 0 by the functional equation.
The hard part will be to understand the zeta function in the “critical strip”
0 < <s < 1. The Riemann hypothesis states that all non-trivial zeros of the
Riemann zeta function satisfy <ρ = 1/2.
Proof. We have
∞ Z ∞
X 1 2 s
Z(s) = Γ(s/2)π −s/2 ζ(s) = π −s/2 s
e−πn y (πn2 y) 2 −1 πn2 dy
n=1
n 0
We obtain
Z ∞ Z ∞ Z ∞
dx s s dx 1 s dx
Z(s) = ω(x)x = 2 ω(x)x 2 + ω x− 2
0 x x x x
Z ∞1 1
1 1 s 1−s dx
=− + + ω(x)(x 2 + x 2 ) .
s s−1 1 x
24
The statement about the zeros of ζ is <s < 0 follows from
ζ(1 − s)Γ((1 − s)/2)π −1/2+s
ζ(s) =
Γ(s/2)
and the fact that Γ and ζ have no zeros in <s > 1.
(3.7) Theorem
Let χ be a primitive character modulo q > 1. Let κ = 0 if χ is even and κ = 1
if χ is odd. Then the completed L-function
q s/2 s + κ
Λ(s, χ) = Γ L(s, χ)
π 2
is entire of order 1 and satisfies the functional equation Λ(s, χ) = (χ)Λ(1−s, χ̄)
where (χ) = i−κ τ (χ)q −1/2 . In particular, in <s < 0 we have L(s, χ) = 0 if and
only if s ∈ {−2, −4, . . .} (χ even) or s ∈ {−1, −3, . . .} (χ odd).
Proof. We copy the proof of (3.6) with some modifications. In <s > 1 we have
q s/2 s + κ
Λ(s, χ) = Γ L(s, χ)
π 2
∞
χ(n) ∞ −πn2 y/q
q s/2 X Z
s+κ
= s
e (πn2 y/q) 2 −1 πn2 /q dy
π n=1
n 0
κ/2 Z ∞ X ∞
π s+κ 2
= χ(n)nκ y 2 −1 e−πn y/q dy.
q 0 n=1
We define X 2
Θ(x, χ) := χ(n)nκ e−πxn /q
.
n∈Z
2
If f (t) = tκ e−πt x/q
, then one checks that
κ
t −κ t q 1/2 −πt2 /xq
fb =i e
q x x
for κ ∈ {0, 1}. Hence by (2.7b) we have the functional equation
i−κ τ (χ)
Θ(x, χ) = √ Θ(1/x, χ̄).
xκ+1/2 q
Since χ(n)nκ is even and χ(0) = 0, we obtain
κ/2 Z ∞
π s+κ dy
2Λ(s, χ) = Θ(y, χ)y 2
q 0 y
κ/2 Z ∞ Z ∞
π s+κ dy
− s+κ dy
= Θ(y, χ)y 2 + Θ(1/y, χ)y 2
q 1 y 1 y
κ/2 Z ∞
i−κ τ (χ) ∞
Z
π s+κ dy 1−s+κ dy
= Θ(y, χ)y 2 + √ Θ(y, χ̄)y 2 .
q 1 y q 1 y
25
The integrals on the right hand side define entire functions. Replacing s by 1 − s
and χ by χ̄ shows the functional equation. The statements about the order of
Λ(s, χ) and the zeros of L(s, χ) follow as in (3.6).
(3.8) Definition
P L-function
An of degree d ∈ N in the class S is a Dirichlet series L(s) =
an n−s with the following properties:
a) it is absolutely convergent in <s > 1 and can be expanded in an Euler
product
d −1
YY αj (p)
L(s) = 1−
p j=1
ps
We write
d
Y s + κj
L∞ (s) = π −s/2 Γ
j=1
2
and
d X
Y
−s/2 s + κj
Λ̄(s) := Λ(s̄) = N s/2
π Γ ān n−s .
j=1
2 n
We write
d
Y d
Y
C0 (s) := (|s + κj | + 2), C(s) := N (|s + κj | + 2),
j=1 j=1
and call C(s) the analytic conductor of the L-function. It is a measure of its
complexity at the point s.
26
analytic conductor is q(|s| + 1).
By condition a) the coefficients an are multiplicative. One can show that the
sequence apk satisfies a d-term recurrence (exercise).
Since the L-function is absolutely convergent in <s > 1, we have by (1.11) that
1+ε
P
n≤x |an | x .
The product of two L-functions of degrees d1 and d2 is an L-function of d1 + d2 .
An L-function is called primitive if it doesn’t factor into L-functions of lower
degree. Currently we only know primitive L-functions of degree 1. (One can
show that Dirichlet L-functions and the Riemann zeta-function are the only
L-functions of degree 1). We will get to know primitive L-functions of degree 2
much later.
All our subsequent estimates will be uniform in N , αj (p) and κj .
Notation: We write s = σ + it.
(3.9) Lemma
We have
L∞ (1 − s)
N 1/2−s A,δ C(s)1/2−σ , |σ| ≤ A, <(1 − s + κj ) ≥ δ
L∞ (s)
and
L∞ (s + u) c
A,δ e 2 d|u| C0 (s)<u/2 , <s ≥ −A, |<u| ≤ A, <(s + u + κj ) ≥ δ
L∞ (s)
Proof. To prove the first statement, apply (3.3e) and (3.4b) with
1 − s + κj s + κj 1
z+w = , z= , w= − <s
2 2 2
and recall that the κj are either real or come in complex conjugate pairs. The
second part follows similarly.
We would like to investigate L-functions in the “critical strip” 0 < <s < 1. We
cannot use the series representation, but a certain smoothly truncated version
does the job.
27
where5
Z
1 L∞ (s + u) du
Vs (y) = y −u G(u) ,
2πi (3) L∞ (s) u
L∞ (1 − s)
(s) = ηN 1/2−s ,
L∞ (s)
Λ(s + u) G(u) u
R= res + res X .
u=1−s u=−s N s/2 L∞ (s) u
Proof. Let Z
1 du
I(s) = X u Λ(s + u)G(u) .
2πi (3) u
The integral is absolutely convergent by the rapid decay of L∞ . We move the
line of integration to <u = −3, picking up possible poles of Λ at u ∈ {1 − s, −s}
and a pole at u = 0. This gives
Z
1 du
I(s) = X u Λ(s + u)G(u) + Λ(s) + RN s/2 L∞ (s).
2πi (−3) u
In the first integral we apply the functional equation and change variables u 7→
−u getting
Z
η du
I(s) = − X −u Λ̄(1 − s + u)G(u) + Λ(s) + RN s/2 L∞ (s).
2πi (3) u
and
Z
η −u du (1−s)/2
X ān nX
X Λ̄(1 − s + u)G(u) = ηN L∞ (1 − s) V1−s √
2πi (3) u n
n1−s N
X ān
nX
= (s)N s/2 L∞ (s) V
1−s 1−s
√ ,
n
n N
28
(3.11) Lemma
Let 0 ≤ σ ≤ 1 and suppose that σ + <κj > 0 for all j. There is a choice of G
such that −A
y
y a Vs(a) (y) a,A 1 +
C0 (s)1/2
for all a ∈ N0 , A ≥ 1.
Remark: In other words, Vs is bounded, and all its derivatives are rapidly
decaying once y is bigger than C0 (s)1/2 . Hence the “effective length” of the ap-
proximate functional equation is C(s)1/2 .
2
Proof. We choose G(u) = eu Then G is even, holomorphic and satisfies
2
G(u) <u e−|u| . Now
Z
1 L∞ (s + u) du
a (a)
y Vs (y) = y −u G(u)(−u) · · · (−u − a + 1) .
2πi (3) L∞ (s) u
Then
1−`(σ)
|f (σ + it)| ≤ Ma`(σ) Mb (1 + |t|)α`(σ)+β(1−`(σ))
σ−b
where `(σ) = a−b is the linear function with `(a) = 1, `(b) = 0.
29
It follows from the assumption that L(s) 1 in <s ≥ 1 + ε, and from the
functional equation along with (3.9a) we obtain L(s) C(s)1/2−σ+ε in <s ≤ −ε.
The Phragmén-Lindelöf principle now gives the claim. In the case of the Rie-
mann zeta-function we consider (s − 1)ζ(s) to have a holomorphic function.
Note that C(s) C(1 − s) for bounded <s P since the κj are either real or co-
me in pairs of complex conjugates. Since n≤x |an | x1+ε , we see by partial
summation after splitting the sum at n = C(s)1/2 that the first two sums are
C(s)(1−σ)/2+ε , as desired. In the case of the Riemann zeta-function we esti-
mate R 1 and the claim follows.
Some details: First we explain why C(s) C(1 − s). Since |x + iy| |x| + |y|, we have for
s = σ + it that
d
Y d
Y
C(s) = N (|s + κj | + 2) N (1 + |σ + <κj | + |t + =κj |)
j=1 j=1
d
Y
N (1 + |<κj | + |t + =κj |)
j=1
and the two products coincide, because either =κj = 0, or there exists a pair of κ’s with
opposite signs.
P Summation1+ε by parts: For notational simplicity let us write C := C(s)1/2 and A(t) :=
|a
n≤t n | t . First we estimate
X |an | X |an | Z C
n −10 A(C) A(t)
σ
1+ ≤ σ
= σ
+σ σ+1
dt
n≤C
n C n≤C
n C 1 t
Z C
C 1+ε−σ + t−σ+ε dt C 1+ε−σ .
1
30
Next we estimate in the same way
X |an | X |an | C 10 Z ∞
n −10 10 A(t)
σ
1 + ≤ σ n10
≤ C (σ + 10) σ+10+1
dt C 1+ε−σ .
n≥C
n C n≥C
n C t
(3.13) Remark
The Generalized Lindelöf Hypothesis states that L(s) C(s)max(1/2−σ,0)+ε eve-
rywhere (away from poles), but this is currently not known. For L-functions of
degree 1 the best known bound is
L(s) C(s)1/6+ε , <s = 1/2,
whereas (3.12) gives the exponent 1/4. This uses deep results from algebraic
geometry and the theory of automorphic forms.
(3.14) Corollary
We have X
|L(1/2, χ)|2 q 1+ε .
χ (mod q)
χ primitive
Proof. If one sums the approximate functional equation over a family, one has
to be very careful which parameters depend on the family. In our case, the root
number (s) depends on χ and the weight function V depends on the parity of
χ. Therefore we sum over even and odd characters separately and eliminate the
root number by a baby form of Cauchy-Schwarz (|a + b|2 ≤ 2(|a|2 + |b|2 )):
X χ(n) n 2
X X
|L(1/2, χ)|2 ≤ 4 V √
n1/2 q
χ (mod q) χ (mod q) n
χ primitive, even χ primitive, even
31
By symmetry we can assume m ≥ n, so that we are left with bounding
X 1 n −1
X 1 m
q V √ √ 1 + √
n
n1/2 q m
m≥n
q
m≡n (mod q)
X 1 n 1 X q 1/2 X 1 n −1 1 1
q V √ √ + q 1 + √ √ + .
n
n1/2 q n
k≥1
(kq)3/2 n
n1/2 q n q
(exercise). Here one can drop all but one term and recover the convexity bound
L(1/2, χ) q 1/4+ε .
(3.15) Convention
From now on, the symbol ε means a sufficiently small number, but not necessa-
rily the same at each occurrence. For instance, we may write xε log x xε
etc.
(3.16) Theorem
For T > 1, 1/2 ≤ σ < 1 one has
Z T
|ζ(σ + it)|2 dt T 1+ε .
0
We have −1
|G(1 − σ − it)| |G(−σ − it)| Γ σ + it e−|t| ,
R +
|1 − σ − it| | − σ − it| 2
32
so I3 1.
Next we investigate I1 . We always assume 0 ≤ t ≤ T and 1/2 ≤ σ < 1. By (3.11) we have
−A
X 1 X 1 n A 1
Vσ+it (n) T 2 −(σ+A)( 2 +ε) T −εA/2 .
1/2+ε
nσ+it 1/2+ε
nσ T 1/2
n>T n>T
33
Remark: As in (3.14), the proof shows really a fourth moment estimate
Z T
|ζ(σ + it)|4 dt T 1+ε .
0
34
4 The Mellin transform
In this section we get to know a new kind of integral transform that is particu-
larly useful for counting purposes.
(4.1) Definition
Let f : (0, ∞) → C be a piecewise continuous function satisfying f (x) x−a
for x → 0 and f (x) x−b for x → ∞ for some real numbers a < b (possibly
±∞). The Mellin transform of f is the function
Z ∞
dx
M(f )(s) = f (s) :=
b f (x)xs .
0 x
(4.2) Remarks
We have F(f )(y) = M(f ◦ log)(−2πiy) and M(f )(s) = F(f ◦ exp)(−s/(2πi)).
Hence the Mellin transform is, up to scaling by −2πi, the image of the Fourier
transform under the homomorphism exp : (R, +) 7→ (R> , ·). The maps x 7→ xs
are quasi-characters of the multiplicative group.
We have M(e−x )(s) = Γ(s) with a = 0, b = ∞.
(4.3) Lemma
Let f : (0, ∞) → C be a piecewise continuous function satisfying f (x) x−a
for x → 0 and f (x) x−b for x → ∞ for some real numbers a < b. Assume
that for x → 0 we have
∞ NX
X (m)
f (x) ∼ cnm (log x)n xrm
m=0 n=0
N (m)
X X
f (x) = cnm (log x)n xrm + O(xM ) (x → 0)
<(rm )<M n=0
for all M ∈ R. Then the Mellin transform fb(s) can be continued meromorphi-
cally to <s < b with poles at s = −rm of order N (m) + 1.
Remark: This is consistent with what we know about the Gamma function
(3.2). A similar result holds if f has an asymptotic expansion at ∞.
35
Proof. For M ∈ R let f = gM + hM with
0, x ≥ 1,
gM (x) = X NX (m)
cnm (log x)n xrm , x<1
<(rm )≤M n=0
and hM = f − gM . Then
Z ∞ Z 1
dx dx
hM (s) =
b f (x)xs + (f − gM )(x)xs
1 x 0 x
is holomorphic in −M < <s < b. On the other hand, for <s sufficiently large
one has
X NX (m)
cnm (−1)n n!
gbM (s) =
n=0
(s + rm )n+1
<(rm )≤M
which can be continued meromorphically to the entire plane. Since M was ar-
bitrary, the claim follows.
(4.4) Lemma
Let a0 ≤ a < b ≤ b0 . Let f, h : (0, ∞) → C be piecewise continuous functions
satisfying f (x), h(x) x−a for x → 0 and f (x), h(x) x−b for x → ∞, and
assume that fb can be continued to a0 < <s < b0 .
a) For u ∈ R, v > 0, the Mellin transform of g(x) = xu f (xv ) can be continued
to −u + va0 < <s < −u + vb0 and satisfies gb(s) = v −1 fb((s + u)/v).
b) If f is continuously differentiable, then the Mellin transform of g = f 0 can
be continued to a0 + 1 < <s < b0 + 1 and satisfies gb(s) = (1 − s)fb(s − 1).
c) If f ∈ CcN , then fb(s) = O<s,N ((1 + |=s|)−N ) for s ∈ C.
R∞
d) If (f ∗ h)(x) := 0 f (t)h(x/t)dt/t, then f[ ∗ h = fbbh in a < <s < b.
Proof. Exercise
36
Examples and remarks: We have e−x = 2πi 1
Γ(s)x−s ds.
R
(1)
In (3.10) the weight function is given by an inverse Mellin transform. One ty-
pically estimates Mellin integrals by partial integration and inverse Mellin inte-
grals by shifting the contour.
Proof. Replacing f (x) by xN f (x), we can assume that a < 0, so that we can
choose c = 0. Now the statement follows simply from the corresponding inversion
formula for the Fourier transform (2.4):
Z Z
1 −s 1 s
M(f )(s)x ds = F(f ◦ exp) − x−s ds
2πi (0) 2πi (0) 2πi
Z ∞
= F(f ◦ exp)(s) exp(2πis log x)ds = f (exp(log x)) = f (x).
−∞
1 x−s
R
Then χ
b(s) = 1/s and χ(x) = 2πi (c) s
ds for c > 0. More precisely:
Z c+iT −s (
1 x O(cT −1 ), x = 1,
ds = χ(x) + −c −1
2πi c−iT s O(x min(1, (T | log x|) )), x 6= 1.
37
√
Alternatively, let C be the circle about 0 with radius R = c2 + T 2 . Let C0 be
the portion to the right of the segment [c − iT, c + iT ]. Then
1 Z c+iT x−s 1 Z x−s x−c
ds = ds ≤ R = x−c .
2πi c−iT s 2πi C0 s R
(4.7) Theorem
Let x ≥ 2. Let 0 < c 1, T ≥ 2 + c, possibly depending on x. Let an
−s
P
be a sequence of complex numbers and assume that n an n is absolutely
convergent at s = c. Then
c+iT !
xc X |an |
Z
X 1 X an
s ds x log x
an = x +O + max 5 |an | 1 + .
2πi c−iT n
ns s T n nc 3
4 x≤n≤ 4 x
T
n≤x
where
X |an |
E Ax + x c min 1, (T | log(n/x)|)−1
n
nc
xc X |an | 1 X
Ax + c
+ Ax | log(x/n)|−1
T n n T
2≤|n−x|≤ 14 x
c −1
x X |an | Ax X x
Ax + + − 1
T n nc T n
2≤|n−x|≤ 14 x
xc X |an | Ax X x
Ax + +
T n nc T |x − n|
2≤|n−x|≤ 14 x
Remark: The truncation is not necessary if one allows a smooth weight func-
tion, e.g. Z X
X 1 an
an e−n/x = s
Γ(s)xs ds.
n
2πi (c) n
n
38
(4.8) Example
1
We have n≤x µ2 (n) = + O(x1/2+ε ).
P
ζ(2) x
µ(n)n−2s is
P
We shift the contour to <s = 1/2 + ε. In this region 1/ζ(2s) =
holomorphic and bounded (by absolute convergence). Hence
Z 1/2+ε+iT Z c±iT !
X
2 x x log x
µ (n) = + (. . .) + O (. . .) + O .
ζ(2) 1/2+ε−iT 1/2+ε±iT T
n≤x
Remark: We see that the error term is intimately linked to the location of zeros
of the Riemann zeta function, because the factor 1/ζ(2s) limits a contour shift
further to the left. Any improvement in the error term (beyond removing the ε
which is not hard) is equivalent to a quasi-Riemann hypothesis.
6c = 1 + ε would do the job equally well.
39
(4.9) Example
Let k ∈ N, k ≥ 2. There is a polynomial Pk of degree k − 1 and a constant
δ = δk > 0, such that
X
dk (n) = xPk (log x) + O(x1−δ ).
n≤x
and
c±iT c
x1+ε
Z Z
1
(. . .) xσ T k/2(1−σ)+ε dσ .
1/2±iT T 1/2 T
We choose −1
1 k
T = x 2 ( 4 +1) = x2/(k+1) ,
so that
X ζ k (s)xs
dk (n) = res + O(x1−δ+ε )
s=1 s
n≤x
Remark: Assuming the Lindelöf hypothesis (cf. (3.13)), we could get the error
term O(x1/2+ε ) for all k.
40
5 Zeros of L-functions and prime number theo-
rems
In this section we study the location and density of zeros of L-functions and pro-
ve the prime number theorem (5.6) and the prime number theorem in arithmetic
progressions (5.14).
where the sum is over all zeros ρ of Λ (with multiplicity), and b is some complex
number (depending on the L-function).
Proof. Since (s(1 − s))r Λ(s) is an entire function of order 1, this follows from
(5.1b) with g(z) = a + bz after taking the logarithmic derivative and noting that
d
ds (s(1− s))r Λ(s) L0 L0∞ 1 r r
r
= (s) + (s) + log N + +
(s(1 − s)) Λ(s) L L∞ 2 s s−1
and
d
ds (1 − s/ρ)es/ρ 1 1
s/ρ
= + .
(1 − s/ρ)e s−ρ ρ
41
(5.3) Important technical lemma
Keep the above notation, in particular let L be a general L-function, let ρ 6= 0, 1
be a zero of Λ,
Pand let b be as in (5.2). Then
a) <(b) = − ρ <ρ−1 . (Note that this sum is not absolutely convergent, but
limx→∞ |ρ|≤x <ρ−1 exists.)
P
Note that both sums on the right hand side are absolutely convergent since
1 1 1 − 2<ρ 1
+ = s
s − ρ 1 − s − ρ̄ (s − ρ)(1 − s − ρ̄) |ρ|2
and 1/ρ + 1/ρ̄ 1/|ρ|2 . Since both ρ and 1 − ρ̄ are zeros, the first sum vanishes.
b) Let s = 3 + iT. We have the crude bound
d
L0 X X αj (p) log p X p
(s) = − d d log C(iT ).
L p j=1
ps p
p3
By Stirling’s formula,
1 L0
log N + ∞ (s) log C(iT ).
2 L∞
Finally, r/s + r/(1 − s) r 1. Taking the real part in (5.2) and using part
a), we see
X X 1 1
−1
<ρ − < + log C(iT )
ρ ρ
s−ρ ρ
for s = 3 + iT . Write ρ = β + iγ. Since
1 1 1
< =< ,
s−ρ (3 − β) + i(T − γ) 1 + (T − γ)2
42
we can rearrange the absolutely convergent sum to get
X 1
log C(iT )
ρ
1 + (T − γ)2
L0
and by Stirling’s formula we have L∞
∞
(s) log C(it) away from poles, in other
words
L0∞ X 1
(s) = + O(log C(it)).
L∞ s + κj
|s+κj |<1
where ρ runs over all nontrivial zeros (with multiplicity) of the Riemann zeta
function and w
b is the Mellin transform of w.
Remark: This shows a close connection between primes and zeros of the Rie-
mann zeta function.
43
Sketch of Proof. By Mellin inversion we have
ζ0
Z
X 1
Λ(n)w(n) = − (s)w(s)ds.
b
n
2πi (2) ζ
We shift the contour to the far left. We pick up contributions atR the non-trivial
∞
zeros of the Riemann zeta function, at s = 1 we obtain w(s)
b = 0 w(x)dx, and
the trivial zeros contribute
XZ ∞ Z ∞
−2n dx 1 dx
X
− w(−2n) =− w(x)x =− w(x) 2 .
x −1 x
b
0 x 0
n≥1 n≥1
Exercise: make the contour shift rigorous using the rapid decay of w
b on vertical
lines and (5.3).
44
Now fix a zero ρ with t = =ρ ≥ C0 /2. Then
ζ0 ζ0 ζ0
0 ≤ < −3 (σ) − 4 (σ + it) − (σ + 2it)
ζ ζ ζ
3 4
≤ + O(1) − < + O(log(2 + t))
σ−1 σ + it − <ρ − it
3 4
≤ − + C1 log(2 + t).
σ − 1 σ − <ρ
for some constant C1 > 0. Now we choose σ = 1 + δ/ log(2 + t) for some δ > 0.
This gives
δ 4δ 1
<ρ < 1 + − =1−
log(2 + t) (3 + C1 δ) log(t + 2) 15C1 log(2 + t)
Proof. We apply Perron’s formula (4.7) with c = 1 + 1/ log x and some 10 <
T ≤ x1/2 to be chosen later. This gives
Z c+iT 0 !
X 1 ζ xs x X log n x log x
Λ(n) = − (s) ds + O + log x .
2πi c−iT ζ s T n n1+1/ log x T
n≤x
The error term is O(x(log x)2 /T ). We integrate over the rectangle with vertices
c ± iT , c0 ± iT where
C
c0 = 1 −
2 log(2 + T )
with C as in (5.5). Then (5.3) implies that
ζ0
(s) (log T )2
ζ
45
on the edges of the rectangle. Hence the integral over the three segments [c −
iT, c0 − iT ], [c0 − iT, c0 + iT ], [c0 + iT, c + iT ] is
x 0
(log T )2 + (log T )xc ,
T
and we obtain
x(log x)2
X 0
Λ(n) = x + O + (log T )3 x1−C / log T .
T
n≤x
√
The proof is completed by choosing T = exp( log x).
(5.7) Theorem
The Riemann hypothesis is true if and only if ψ(x) = x+O(x1/2+ε ) for all ε > 0.
The integral on the right hand side is holomorphic in <s > 1/2, hence ζ is
zero-free in this region.
(5.8) Corollary
We have
X log p p X1 p
= log x+c1 +O(exp(−c0 log x)), = log log x+c2 +O(exp(−c0 log x))
p p
p≤x p≤x
Remark: With weaker error terms O(1/ log x), this can be proved without the
prime number theorem in an elementary way.
46
(5.9) Corollary
a) We have
Y 1
C p
1− = 1 + O(exp(−c0 log x))
p log x
p≤x
Proof. We have
Y 1
X
1 X 1 XX 1
1− = exp log 1 − = exp − −
p p p kpk
p≤x p≤x p≤x p≤x k≥2
!
p XX 1 X X 1
= exp − log log x − c1 + O(exp(−c0 log x)) − + O
p
kpk p>x
pk
k≥2 k≥2
| {z }
=(p(p−1))−1
C p
= exp O(exp(−c0 log x)) ,
log x
and the result follows.
b) Exercise.
c) We have
XX X x
1= + O(1) = x log log x + c2 x + O(x/ log x).
p
n≤x p|n p≤x
47