Professional Documents
Culture Documents
Mathmatical Thought From Ancient to Modern Times
Mathmatical Thought From Ancient to Modern Times
MORRIS KLINE
Professor of Mathematics
Courant Institute of Mathematical Sciences
New York University
/
New York OXFORD UNIVERSITY PRESS rg72
Copfight @ 1972 by Morris Klinc
Library of Congress Catalogue Card Numbr 77-170263
Printed in thc Unitcd Statca of Amcri.a
To my wife, Helen Mann Kline
Preface
repldce or prove the Euclidean parallel axiom date from Euclid's time
onward. Of course, many topics recur at various periods.
To keep the material within bounds I have ignored several civilizations
such as the Chinese,l Japanese, and Mayan because their work had no
material impact on the main line of mathematical thought. Also some
developments in mathematics, such as the theory of probability and the
calculus of finite differences, which are important today, did not play major
roles during the period covered and have accordingly received very little
attention. The vast expansion of the last few decades has obliged me to
include only those creations of the twentieth century that became significant
in that period. To continue into the twentieth century the extensions ofsuch
subjects as ordinary differential equations or the calculus ofvariations would
call for highly specialized material of interest only to research men in those
fields and would have added inordinately to the size ofthe work. Beyond these
considerations, the importance of many of the more recent developments
cannot be evaluated objectively at this time. The history of mathematics
teaches us that many subjects which aroused tremendous enthusiasm and
engaged the attention of the best mathematicians ultimately faded into
oblivion. One has but to recall Cayley's dictum that projective geometry is
all geometry, and Sylvester's assertion that the theory of algebraic invariants
summed up all that is valuable in mathematics. Indeed one of the interesting
questions that the history answers is what survivcs in mathematics. History
makes its own and sounder evaluations.
Readers of even a basic account of the dozens of major developments
cannot be expected to know the substance of all these developments. Hence
except for some very elementary areas the contents of the subjects whose his-
tory is being treated are also described, thus fusing exposition with history.
These explanations of the various creations may not clarify them completely
but should give some idea of their nature. Consequently this book may
serve to some extent as a historical introduction to mathematics, This
approach is certainly one of the best ways to acquire understanding and
appreciation.
I hope that this work will be helpful to professional and prospective
mathematicians. The professional man is obliged today to devote so much of
his time and energ'y to his specialty that he has little opportunity to familiar-
ize himself with the history of his subject. Yet this background is important.
The roots ofthe present lie deep in the past and almost nothing in that past is
irrelevant to the man who seeks to understand how the present came to be
what it is. Moreover, mathematics, despite the proliferation into hundreds of
branches, is a unity and has its major problems and goals. Unless the various
specialties contribute to the heart of mathematics they are likely to be
L A fine account ofthe history of Chinese mathematics is available in Joseph Needham's
Science and Ciailization in China, Cambridge University Press, 1959, Vol. 3, pp. l-168.
PREFACE lX
sterile. Perhaps the surest way to combat the dangers which beset our
fragmented subject is to acquire some knowledge of the past achievements,
traditions, and objectives of mathcmatics so that one can direct his research
into fruitful channels. As Hilbert put it, " Mathematics is an organism for
whose vital strength the indissoluble union of the parts is a necessary
condition."
For students of mathematics this work may have other values. The
usual courses Present segments of mathematics that seem to have little re-
lationship to each other. The history may give perspective on the. entire
subject and relate the subject matter ofthe courses not only to each other but
also to the main body of mathematical thought'
The usual courses in mathematics are also deceptive in a basic resPect'
They give an organized logical presentation which leaves the impression that
mathematicians go from theorem to theorem almost naturally, that matle-
maticians can master any difficulty, and that the subjects are completely
thrashed out and settled. The succession of theorems overwhelms thc student,
especially if he isjust learning the subject.
The history, by contrast, teaches us that the development of a subject
is made bit by bit with results coming from various directions. We learn, too,
that often decades and even hundreds ofyears of effort were required bcfore
significant steps could be made. In place of the irnpression that the subjects
are completely thrashed out one finds that what is attained is often but a
start, that many gaps have to be filled, or that the really imPortant extensions
remain to be created.
'ihe polished presentations in the courses fail to show the struggles
ofthe creative process, the frustrations, and the long arduous road mathema-
ticians must travel to attain a sizable structure. Once aware of this, the
student will not only gain insight but derive courage to Pursue tenaciously
his own problems and not be dismayed by the incompleteness or deficiencics
in his own work. fndeed the account of how mathematicians stumblcd,
groped their way through obscurities, and arrived piecemeal at their rcsults
should give heart to any tlro in research.
To cover the large area which this work comprises I have tried to
select the most reliable sources. In the pre-calculus period these sources,
such as T. L. lJeattr's A Historg of Gruk Mathcmatbs, are admittcdly secondary,
though I have not relied on just one such source. For the subsequent dc-
velopment it has usually been possible to go directly to the original papers,
which fortunately can be found in thejournals or in the collected works of the
prominent mathematicians. I have also been aided by numerous accounts and
i.-,.'rr"y. of research, some in fact to be found in the collected works' I have
tried to give references for all of the major results; but to do so for all asser-
tions would have meant a mass of references and the consumption of space
that is better devoted to the account itself.
PREFACE
New York M. K.
May 1972
Contents
1. Mathematics in Mesopotamia, 3
2, Eg'yptian Mathematics, 15
l. Background, 15 2. The Arithmetic, 16 3. Algebra and Geomery, l8 4' Egyptian
Uses of Mathematics, 21 5. Summary, 22
l. Introduction, 1052 2. The Beginning ofa Gcneral Theory, 1056 3. The Work of
Hilbert, 1060 4. The Immediate Successors of Hilbert, 1070 5. Extensions of the
Theory, 1073
l. The Nine teenth-Century Background, ll36 2' Abstrrict Group Theory, ll37 3' The
AbstractTheoryofFields,l1464.Rings,ll505.Non-AssociativeAlgebras,ll53
6. The Range of Abstract Algebra, I 156
3
MATIIEMATICS IN MESOPOTAMIA
n
2
Rf
34
Y-W w
5
ffi
6
v-m
w w <Y
ll
<It
20 40
4
.10
I l0 t2 30
Y K Y< YY YY<
60 70 80 120 r30
The striking features of the Babylonian number system are the base 60 and
positional notation.
At first the Babylonians had no symbol to indicate the absence of a
number in any one position, and consequently their numbers were am-
biguous. Thus (could mean B0 or 3620, depending upon whether the first
symbol meant 60 or 3600. Spacing was often used to indicate no quantity
in a given position, but of course this might be misinterpreted' In the
6 plTHEMAflcs rN MEsoporAMrA
Now suppose there is another system with a different unit but the same
ratios, and political or social forces compel the fusing of the two systems. (We
have meters and yards, for example.) I[ the larger unit were 60 times the
smaller, then 112, 113, and 213 of the larger unit would be integral multiples
of the smaller one. Thus the larger unit might have been adopted because it
was so convenient.
As to the origin ofpositional notation, there are two likely explanations.
In an older scheme of writing numbers I multiplied by 60 was written with a
larger I than the same symbol for l. When the writing was simplified the
larger I was reduced in size but kept in the usual place for 60. Hence position
ARITHMETIC OPERATION8 7
4. Arithmetic OPerations
In the Babylonian system the symbols for I and l0 were basic. Numbers from
I to 59 were formed by combining fewer or more of these symbols' Hence the
processes of addition and subtraction were merely a matter of adding or
iaking away symbols. To indicate addition the Babylonians joined numbers
i" (fff, which indicates 16. Subtraction was often indicated by
together u,
the symbol
|> . fn"t { Ifr
ti
f - ,. In astronomical texts of a later
signi$ing addition.
period the word tab appears,
Multiplication of integers was also performed. To multiply by 37, say,
would mein multiplying by 30, then by 7, and adding the results' The
symbol for multiplication *ut I p[, pronounced a-rd' lt r'r.rearrt " to go'"
TheBabyloniansdividedonewholenumberbyanother.Sincetodivide
by an integer a is to multiply by the reciprocal I la, to this extent fractions
*".. irr'rroli.d. The Babylonians converted the reciprocals to sexagesimal
" decimals " and, except for the few mentioned above, did not use special
symbols for fractions. They had tables showing how numbers of the form l/a,
where a : 2s385t, could be written as terminating sexagesimal numbers'
Some tables gave approximate values for 1/7, l/l l, l/13, etc', because these
fractions led to infinite repeating sexagesimals. where fractions involving
denominators other than 2, 3, or 5 occurred in the older problems, the same
troublesome factors occurred in the numerator and were cancelled'
The Babylonians relied entirely uPon the tables of reciprocals' Their
tables show, for example,
obviously meaning ll2 :30160, ll3 :20160, etc. The precise meanhgs of
igi and, gil-bi are not known. Sexagesimal fractions, that is, numbers less than
1 in inverse powers of 60, 602, etc., but with denominators merely
"*p.ot"d continued to be used by the Greeks Hipparchus and Ptolemy
understood,
and in Renaissance Europe up to the sixteenth century, when they were
replaced by decimals in base 10.
I MATHEMATTcS rN MEsoporAMrA
The Babylonians also had tables expressing squares, square roots, cubes,
and cube roots. When the root was a whole number it was given exactly. For
other roots, the corresponding sexagesimal numbers were only approximate.
Of course, irrationals cannot be expressed in a finite number of decimals or
sexagesimals. flowever, there is no evidence that the Babylonians were
aware of this. They might well have believed that the irrationals could be
converted exactly to sexagesimal numbers if more places were used. An
excellent Babylonian approximation to {2 gives {1, : 1.41+213 . . .
instead of 1.414214 . . ..
Roots occur in their calculations of the diagonal d of a rectangle of
height i and width zo. One problem asks for the diagonal of a rectangular
gate with given height and width. The answer is given without explanation
and amounts to using the approximate formula for the diagonal d namely,
dxh J-'2h
ro2
-.
The formula is a good approximation to d for i > zz. Thus for lz > rl,,, as is
the case in one problem, we can see that the answer is reasonable by noting
that
d : \/F * -a :
^l
,;# : n(r + {)'''
Ifwe now expand the binomial and retain only the first two terms we get the
formula. Other approximate answers to square-root problems were given,
obtained presumably by using numbers in Babylonian tables.
5. Babylonian Algebra
Distinct from the table texts, which yield much information on the Baby-
lonian number system and operations with numbers, are texts that deal with
algebraic and geometric problems. A fundamental problem of the older
Babylonian algebra asks for a number which, added to its reciprocal, yields
a given number. In modern notation, the Babylonians sought r and i such
that
xi : l, t * E : b.
These two equations equation in x, namely, x2 btt +
-
I: 0. They formed
JT='and then
b .b
and.
2+ ,-
which yield the answers. In effect, the Babylonians had the quadratic for-
mula. Other problems, such as finding two numbers whose sum and product
BABYLOMAN ALGEBRA 9
are given, were reduced to the above problem. Since the Babylonians had no
,r.giai,r. numbers, negative roots ol quadratic equations were neglected'
Though only concrete examples were given, many were intended to illustrate
a ge.r"ral pro"edrrre for quadratics. More complicated algebraic problems
were reduced by transformations to simpler ones'
The Babylonians were able to solve special problems involving five
unknowns in hve equations. One problem, which arose in connection with
the adjustment of astronomical observations, involved ten equations and ten
,rkro-*rs, mostly linear. The solution used a special method of combining
the equations until.the unknowns were evaluated'
ihe algebraic problems were stated and solved verbally' The words as
(length), sag (breadth), and aia (area) were often used for the unknowns' not
because the unknowns necessarily represented these geometric quantities'
but probably because many algebraic problems came from geometric
situations and the geometric terminology became standard' An illustration
of the way in which these terms were employed for the unknowns and of the
*ay ir, *hi"h problems were stated may be gathered from the following:
..I have multipiied Length and Breadth and the Area is 10. I have multiplied
the Length by itself and have obtained an Area. The excess- of Length over
Breadth I have multiplied by itself and this result by 9. And this Arca is Area
obtained by multiplying the Length by itself. What are the Length and
Breadth?" It is obvious here that the words Length, Breadth, arrd Atea ate1
merely convenient terms for two unknowns and their product, respectively'
Today we would write this problem as
xY:10
9(x-y)2:f.
The solution, incidentally, leads to a fourth-degree equation in *, witfr th;
x arrd x3 terms missing so that it can be and was solved as a quadratic in x3.
Problemsleadingtoacuberootalsooccurred.Themodernformulation
of one such problem would be
l2x : z, !: x, )c! z : V,
where Zis some known volume. To find x here we must extract a cube root'
The Babylonians calculated this root from the cube-root tables we mentioned
the
earlier. They also did compound-interest Problems that called for finding
value of an unknown exPonent.
The Babylonians sometimes employed symbols for unknowns, but the
6. Babylonian Geornetry
The role of geometry in Babylonia was insignificant. Geometry was not a
separate mathematical discipline. Problems involving division of a field or
the sizes of bricks needed for some construction were readily converted into
algebraic problems. Some computations of area and volume were given in
accordance with rules or formulas. However, the figures illustrating geo-
metric problems were roughly drawn and the formulas may have been in-
correct, In the Babylonians' calculations of areas, for example, we cannot
tell whether their triangles were right triangles or whethgr their quadri-
laterals were squares and, hence, whether their formulas were the right ones
for the figures concerned. However, the Pythagorean relationship, the simi-
larity of triangles, and the proportionality of corresponding sides in similar
triangles were known. The area of a circle was obtained, apparently by
tousing3forz.However,anotheroftheirresultsgivingtherelationbetween
the cirJumference of a regular hexagon and its circumscribed circle implies
a value ol 3 l/B for zr. A few volumes were comPuted, some correctly and
some incorrectly, in the course of solving particular physical
problems'
Apart from a few special facts, such as the calculation of the radius of a
a inown isosceles triangle, the substance of Babylonian
circle circumscribing
geometry was a colLction of rules for the areas of simple plane figures' in-
It,rdirrg ."g,rlu. polygons, and the volumes of simple solids' The geometry
*u, ,rJ, ,ai ai"d-i.r'ut d ficr itself but always in connection with practical
problems.
kept the lunar calendar in time with the solar year. Thus 235 lunar months
were equal to 19 solar years. The summer solstice was systematically com-
puted, and the winter solstice and the equinoxes were placed at equal
intervals. This calendar was used by theJews and Greeks and by the Romans
up to 45 B.c., the year the Julian calendar was adopted.
The division of the circle into 360 units originated in the Babylonian
astronomy ofthe last centuries before the Christian era. It had nothing to do
with the earlier use of base 60; however, base 60 was used to divide the
degree and the minute into 60 parts. The astronomer Ptolemy (2nd cent.
a.o.) followed the Babylonians in this practice.
Closely connected with astronomy was astrology' In Babylonia, as in
many ancient civilizations, the heavenly bodies were thought to be gods and
so were presumed to have influence and even control over the affairs of man.
When one takes into account the imPortance of the sun for light, heat, and
the growth of plants, the dread inspired by its eclipses, and such seasonal
phenomena as the mating of animals, one can well understand the belief that
the heavenly bodies do affect even the daily events in man's life.
Pseudoscientific schemes of prediction in ancient civilizations did not
always involve astronomy. Numbers themselves had mystic properties and
could be used to make predictions. One finds some Babylonian usages in the
Book of Daniel and in the writings of the Old and New Testament prophets.
The Hebrew "science" of gematria (a form of cabbalistic mysticism) was
based on the fact that each letter ofthe alphabet had a number value because
the Hebrews used letters to represent numbers. If the sum of the numerical
values of the letters in two words was the same' an imPortant connection
between the two ideas or people or events rePresented by the words was
inferred. In the prophecy of Isaiah (21:B), the Iion proclaims the fall of
Babylon because the letters in the Hebrew word for lion and those in the
word for Babylon add up to the same sum.
Bibliography
Bell, E. T.: TIU Dculopmcnt of Matlumatics,2nd ed., McGraw-Hill, Chaps. l-2.
Boyer, Carl B.z A History oJ Matlumatics, John Wiley and Sons, 1968, Chap. 3.
Cantor, Moritz: Vorlcs'ungcn ilber Gcschichtc der Matlumatik, 2nd ed., B. G. Teubner,
1894, Vol. I, Chap. l.
Chiera, E.: Tluy Wok on Clay, Chicago University Press, 1938.
Childe, V. Gordon: Man Makes Himscfi, New American Library, 1951, Chaps. 6-8.
Dantzig, Tobiasz Numbcr: Thc Languagc oJ Sciercc, 4th ed., Macmillan, 1954,
Chaps. l-2.
Karpinski, Louis C.: Tlu Histoty of Arithmctic, Rand McNally, 1925.
Menninger, K,t Numbcr Words and Numbcr Sgmbols: A Cultural History of Numbcrs,
Massachusetts Institute of Technology Press, 1969.
Neugebauer, Ottol Tlu Etact Scitnces in Antiquitg, Princeton University Press, 1952,
Chaps. l-3 and 5.
Vorgri.echischc Matlumatik, Julius Springer, 1934, Chaps. l-3 and 5.
Sarton, George: A History of Scicnce, Harvard University Press, 1952, Vol. l, Chap,3.
The Studg of tlu History of Matlumatirs and tlu History of Scictuc, Dover
(reprint), 1954.
Smith, David Eugene: History oJ Matlumatics, Dover (repriat), 1958, Vol. l,
Chap. l.
Struik, Dirk J.: A Concise Historg of Mathzmatics, 3rd ed., Dover, 1967, Chaps. l-2.
van der Waerden, B. L.: Sciztuc Awakcning, P. Noordhoff, 1954, Chaps, 2-3.
2
Egyptian Mathematics
l. Background
While Mesopotamia experienced many changes in its ruling peoples, with
resultin! new cultural influences, the Egyptian civilization developed un-
affected-5y foreign influences. The origins of the civilization are unknown but
it surely existed even before 4000 r.c. Egypt, as the Greek historian Hero-
dotus says, is a gift of the Nile. Once a year this river, drawing its water from
the soutir, floods the territory aII along its banks and leaves behind rich soil.
Most of the people did and still do make their living by tilling this soil. The
rest of the country is desert.
There were two kiagdomq, one in the north and one in t]re south of what
is present-day Egypt. Some time between 3500 and 3000,p'c', -the..- ryler,
Mirra, or Menq, unifiod'trpper and lower Egygt' From this time on the
major 'ireriods oi Egyptian history are referred to in terms of the nrling
dyiasties, Menes having been the founder of the first dyt'ttty' The height of
Egyptian culture occurred during the third dynasty (about 2500 r'c'),
arlring which period the rulers built the pyramids. The civilization went its
o*r, iuy until Alexander the Great conquered it in 332 r'c' Thereafter,
until about e.o. 600, its history and mathematics belong to the Greek
civilization. Thus, apart from one minor invasion by the Hyksoc (1700-
1600 r.c.) and slight contact with the Babylonian civilization (inferred from
the discovery in the Nile valley of the cuneiform Tell al-Amarna tablets of
about 1500 a.c.), Egyptian civilization was the product of its native people'
The ancient egyPtians developed their own systems of writing' One
system, hieroglyphics, was pictorial, that is, each symbol was a picture of
some olject. Hi.roglyphicr were used on monuments until about the time of
christ. From about 2500 B.c. the Egyptians used for daily purposes what is
called hieratic writing. This system employed conventional symbols, which
at fi.rst were merely simplifications of the hieroglyphics. Hieratic writing is
I5
\
I6 ,t\
EGYPTIAN MATHEMATICS
<alnd
syllabic; each syllable is represented by an ideogram and an entire word is
a collection ofideograms. The meaning of the word is not tied to the separate
ideograms.
The writing was done with ink on papyrus, sheets made by pressing the
pith of a plant and then slicing it. Since papyrus dries up and crtLmbles,
very few documents of ancient Egypt have syrvived, apart from the hiero-
glyphic inscriptions on stone.
' ."The main surviving mathematical documents are two sizable papyri:
the Moscow papyrus, which is in Moscow, and the Rhind papyrus, dis-
covered in I858 by a Britisher, Henry Rhind, and now in the British Museum.
The Rhind papyrus is also known as the Ahmes papyrus after its author, who
opens with the words " Directions for Obtaining the Knowledge olAll Dark
Things." Both papyri date from about 1700 e.c. There are also fragments of
other papyri written at this time and later. The mathematical papyri were
written by scribes who were workers in the Egyptian state and church
administrations.
The papyri contain problems and their solutions-85 in the Rhind
papyrus and 25 in the Moscow papyrus. Presumably such problems occurred
in the work ofthe scribes and they were expected to know how to solve them.
It is most likely that the problems in the two major papyri were intended
as examples of typical problems and solutions. Though these papyri date
from about l-700 r.q., the mathematics in them wai k.,o*n to the Egyptians
as far back as 3500 n.c., and little was added from that time to the Greek
conquest.
2. The Arithmetic
The hieroglyphic number symbols used by the Egyptians were l for .l;
f \AJ
fl for 10, Q and 9 for 100, I for 1000, dl fo. to,OoO,andother
symbols for larger units. These symbols were combined to form intermediate
numbers. The direction of the writing was from right to left, so that I I I lnn
represented 24. This system of writing numbers uses the base l0 but is not
positional.
Egyptian hieratic whole numbers are illustrated by the following
_"l
symbols:
The arithmetic was essentially additive. For ordinary additions and suS-
traclions they could simply combine or take away symbols to reach the proper
C r7
THE AR-ITHMETI
for 100.
Division of one whole number by another as carried out by the Egyptians
is equally interesting. For example, 19 was divided by B as follows:
I
2. ,, 16'
ttz: 4
Lr4. - 2
1/8 I
the
Therefore, the answer was 2 * ll4 + ll8' The idea was simply to take
number of eights and parts of eight that totaled 19'
The den"otation oi fru,.tiorrrln the Egyptian number system was much
more complicated than our own. The symbol .-' pronounced ro' which
originally indicated ll32O of a bushel, came to denote a fraction' In hieratic
-numberby
wri"ting ih" o't al was replaced a dot' The 3 or dot was generally
plu""i ubou. the whole to indicate the fraction' Thus' in hiero-
glyphic writing,
ol o- l. .-lll l.
lll : E'
ilJ n:m' n ll --15
A few fractions were denoted by special s1'rnbols' Thus the hieroglyph '-
denoted l12; 7,213; and' x,ll4'
Aside from a few special ones, all fractions were decomposed
into what
.are called unit fractions. Thus Ahmes writes 2/5 as l/3 + 1/15. The plus
a
rig" aia not appear but was understood' The Rhind paPyrus contains from
i"ii" f". "*pr"rrirrg fractions with numerator 2 and odd denominators
5 to l0l ur r',r-, oifractions with numerator l'
By means of this table a frac-
integer
tio, ,,r"h as otr 7l29,which to Ahmes is the integer 7 divided by the :
Inasmuch as7 2I
29, could also be expressed as a sum of unit fractions'
to a sum of fractions with
2 + 2 + l, he proceeds by converting each 2129
,rr.*.r.to.'1. By combining these results and by further conversion he ends
I8 EGYPTIAN MATHEMATICS
up with a sum of unit fractions, each with a diflerent denominator. The final
expression for 7 129 is
tll -r
tl
-_r--J--
6' 24' 58 - 'a7'232
-,1-
-.
It so happens that 7129 can also be expressed as l/5 + l/29 + l/145, but
because Ahmes' 2ln table leads to the former expression, this is the one used.
The expression of ow alb as a sum of unit fractions was done systematically
according to age-old procedures. Using unit fractions, the Egyptians could
carry out the four arithmetic operations with fractions. The extensive and
complicated computations with fractions were one reason the Egyptians
never developed arithmetic or algebra to an advanced state.
The nature of irrational numbers was not recognized in Egyptian
arithmetic any more than it was in the Babylonian. The simple square roots
that occurred in algebraic problems could be and were expressed in terms of
whole numbers and fractions,
ference, to be 5 t/2 times the smallest number' He then picks I as the smallest
.rrd g"i, the progression : 1,6 ll2, 12, 17 112,23' But these numbers add up
to 6dwhereas th-ey should add up to 100. He then multiplies each term by
513.
Only the simplest types of quadratic equations, such as atf
: bt arc
considered. Even when two unknowns occur, the type is
The Egyptians also had rules for the volume of a cube, box, cylinder,
and other figures. Some ol the rules were correct and others only approxi-
mations. The papyri give as the volume of a truncated conical clepsydra
(water clock), in our notation,
,: #G@ + d)) \2
n:LuA, + ab + br),
where I is the height and a and D are sides of top and bottom. The formula is
surprising because it is correct and because it is symmetrically expressed (but
of course not in our notation). It is given only for concrete numbers. How-
ever, we do not know whether the pyramid was square-based or not because
the figure in the papyrus is not carefully drawn.
Neither do we know whether the Egyptians recognized the Pythagorean
theorem. We know there were rope-stretchers, that is, suryeyors, but the
story that they used a rope knotted at points to divide the total length into
parts ofratios 3 to 4 to 5, which could then be used to form a right triangle,
is not confirmed in any document.
The rules were not expressed in symbols. The Egyptians stated the
problems verbally; and their procedure in solving them was essentially what
we do .when we calculate according to a formula. Thus an almost literal
translation of the geometrical problem of finding the volume of the frustum
of a pyramid reads: "If you are told: a truncated pyramid of 6 for the verti-
cal height by 4 on the base, by 2 on the top. You are to square this 4, result
16, You are to double, result B. You are to square 2, result 4. You are to add
the 16, and 8, and the 4, result 28. You are to take one-third of 6, result 2.
You are to take 28 twice, result 56. See, it is 56. You will find it right."
Did the Egyptians know proofs or justifications of their procedures and
formulas ? One belief is that the Ahmes papyrus was written in the style of a
textbook for students of that day and hence, even though no general rules or
principles for solving types of equations were formulated by Ahmes, it is
very Iikely that he knew them but wanted the student to formulate them
himself or have a teacher formulate them for him. Under this view the
Ahmes papyrus is a rather advanced arithmetic text. Others say it is the
notebook of a pupil. In either case, the papyri almost surely recorded
the types of problems that had to be solved by business and administrative
clerks, and the methods of solution were just practical rules known by ex-
perience to work. No one believes that the Egyptians had a deductive
(+$\\\.,
EGYPTIAN USES OF MATHEMATICS 2l
knew ofthe Sothic cycle is oPen to question. Their calendar was adopted by
Julius Caesar in 45 r.c., but changed to a 365 I l4-d'ay year on the advice of
the Alexandrian Greek Sosigenes. Though the Egyptian determination ofthe
year and the calendar were valuable contributions, they did not result from
a well-developed astronomy, which in fact was crude and far inferior to
Babylonian astronomy.
The Egyptians combined their knowledge of astronomy and geometry
to construct their temples in such a manner that on certain days of the year
the sun would strike them in a particular way' Thus some were built so that
on the longest day of the year the sun would shine directly into the templeand
illuminate the god at the altar. This orientation of temples is also found to
some extent among the Babylonians and Greeks. The pyramids too were
oriented to special directions of the heavens, and the Sphinx faces east.
While the details of the construction of these works are unimportant for us,
it is worth noting that the pyramids rePresent another application of Egyp-
tian geometry. They are the tombs of kings; and because the Egyptians
believed in immortality they believed that the proper construction of a tomb
was material licr the dead person's afterlife. In fact, an entire aPartment for
the future residence of king and queen was installed in each pyramid. They
took great care to make the bases of the pyramids of the correct shape; the
relative dimensions of base and height were also highly significant. However,
one should not overemphasize the comple>rity or depth ofthe ideas involved.
Egyptian mathematics was simple and crude and no deep principles were
ivolved, contrary to what is often asserted.
5, Summary
Let us review the status of mathematics before the Greeks enter the picture.
We find in the Babylonian and Egyptian civilizations an arithmetic of
integers and fractions, including positional notation, the beginnings of
algebra, and some empirical fiormulas in geometry. There was almost no
symbolism, hardly any conscious thought about abstractions, no formulation
of general methodology, and no concept of proof or even of plausible argu-
ments that might convince one of the correctness of a procedure or formula.
There was, in fact, no conception of any kind of theoretical science.
Apart from a few incidental results in Babylonia, mathematics in the
two civilizations was not a distinct discipline, nor was it pursued for its own
sake. It was a tool in the form of disconnected, simple rules which answered
questions arising in the daily life of the peoPle. Certainly nothing was done
in mathematics that altered or affected the way of life. Although Babylonian
mathematics was more advanced than the Egyptian, about the best one can
say for both is that they showed some vigor, if not rigor, and more per-
severance than brilliance'
BIBLIOGRAPITY 23
Bibliography
Boyer, Carl B.: A History of Matlumatits, John Wiley and Sons, 1968, Chap' 2'
Cantor, Moritz: Voilcsungen ltber Gcschichtc dcr Matlumatik, 2nd ed', B' G' Teubner'
1894, Vol' l, ChaP. 3.
-i,,
Chace, A. ,r ol., eds.t Ttu Mind Mathcmatical Papras,2 vols', Mathematical
Association of America, 1927-29 '
Childe, V. Gordon: Man Malees Himself, New American Library, l95l'
Karpinski, Louis C': The History of Arithmatb, Rand McNally, 1925'
Neu-gebauer, O,z Tlu Exact Sci.cntcs in Antiquity, Princeton Univenity Press, 1952'
ChaP. 4.
Vorgicchischc Matlumatik, Julius Springer, 1934'
Sarton, George : A History of Seitncc, Harvard University Prcss, 1952, VoI' l'
Chap. 2.
Smith, David Eugene: History of Matlumatbs, Dover (reprint), 1958, Vol' l'
Chap. 2; Vol. 2, ChaPs. 2 all'd' 4,
van der Waerden, B. L.: Scbuc Awakning, P' Noordhoff, 1954, Chap'
l'
3
The Creation of Classical
Greek Mathematics
l. Background
In the history of civilization the Greeks are preeminerrt, and in the history
of mathematics the Greeks are the supreme event. Though they did borrow
from the surrounding civilizations, the Greeks built a civilization and culture
of their own which is the most impressive of all civilizations, the most in-
fluential in the development of modern Western culture, and decisive in
founding mathematics as we understand the subject today. One of the great
problems ofthe history ofcivilization is how to account for the brilliance and
creativity of the ancient Greeks.
Though our knowledge oftheir early history is subject to correction and
amplification as more archeological research is carried on, we now have
reason to believe, on the basis of the lliad and the Odgssey of Homer, the
decipherment of ancient languages and scripts, and archeological investiga-
tions, that the Greek civilization dates back to 2800 a.c. The Greeks settled
in Asia Minor, which may have been their original home, on the mainland
ofEurope in the area of modern Greece, and in southern Italy, Sicily, Crete,
Rhodes, Delos, and North Africa. About 775 B.c. the Greeks replaced various
hieroglyphic systems of writing with the Phoenician alphabet (which was
also used by the Hebrews). With the adoption of an alphabet the Greeks
became more literate, more capable of recording their history and ideas.
As the Greeks became established they visited and traded with the
Egyptians and Babylonians. There are many references in classical Greek
writings to the knowledge of the Egyptians, whom some Greeks erroneously
considered the founders of science, particulariy surveying, astronomy, and
24
TIIE GENERAL SOURCES 25
arithmetic. Many Greeks went to Egypt to travel and study. Others visited
Babylonia and learned mathematics and science there.
The influence of the Egyptians and Babylonians was almost surely felt
in Miletus, a city of Ionia in Asia Minor and the birthplace of Greek phil-
osophy, mathematics, and science. Miletus was a great and wealthy trading
city on the Mediterranean. Ships from the Greek mainland, Phoenicia, and
Egypt came to its harbors; Babylonia was connected by caravan routes
leading eastward. Ionia fell to Persia about 540 B.c', though Miletus was
allowed some independence. After an Ionian revolt against Persia in 494 s'c'
was crushed, Ionia declined in importance. It became Greek again in 479 r'c'
when Greece defeated Persia, but by then cultural activity had shifted to the
mainland of Greece with Athens as its center.
Though the ancient Greek civilization lasted until about a'o' 600, from
the standpoint of the history of mathematics it is desirable to distinguish two
periods, the classical, which lasted from 600 to 300 r.c., and the Alexandrian
or Hellenistic, from 300 B.c. to A.D. 600. The adoption of the alphabet,
akeady mentioned, and the fact that PaPyrus became available in Greece
during the seventh century B.c. may account for the blossoming of cultural
activity about 600 B.c. The availability of this writing paper undoubtedly
helped the spread of ideas.
The Greek and Arabic versions we have may come from such versions of the
originals. Ffowever, in one or another of these forms we do have the works
of Euclid, Apollonius, Archimedes, Ptolemy, Diophantus, and other Greek
authors. Many Greek texts written during the classical and Alexandrian
periods did not come down to us because even in Greek times they were
superseded by the writings of these men.
The Greeks wrote some histories of mathematics and science. Eudemus
(4th cent. r.c.), a member of Aristotle's school, wrote a history of arithmetic,
a history of geometry, and a history of astronomy. Except for fragments
quoted by later writers, these histories are lost. The history ofgeometry dealt
with the period preceding Euclid's and would be invaluable were it available.
Theophrastus (c. 372-+. 287 r.c.), another disciple of Aristotle, wrote a
history of physics, and this, too, except for a few fragments, is lost.
In addition to the above, we have two important commentaries.
Pappus (end of 3rd cent. a.o.) wrote the Sgnagoge or Matlumatical Collecti.on;
almost the whole of it is extant in a twelfth-century copy. This is an account
of much of the work of the classical and Alexandrian Greeks from Euclid to
Ptolemy, supplemented by a number of lemmas and theorems that Pappus
added as an aid to understanding. Pappus had also written the Treasury oif
Analysis, a collection of the Greek works themselves. This book is lost, but in
Book VII of his Matlumatical Collecti.on he tells us what his Treasurg contained.
The second important commentator is Proclus (e.o. 410-485), a prolific
writer. Proclus drew material from the texts of the Greek mathematicians
and from prior commentaries, Of his surviving works, the Commentarg, which
treats Book I of Euclid's Elements, is the most valuable. Proclus apparently
intended to discuss more of the Elements, but there is no evidence that he ever
did so. The Commcntarg contains one of the three quotations traditionally
credited to Eudemus'history of geometry (see sec. l0) but probably taken
from a later modification. This particular extract, the longest of the three, is
referred to as the Eudemian summary. Proclus also tells us something about
Pappus' work. Thus, besides the later editions and versions of some of the
Greek classics themselves, Pappus' Matlumatical Collection and Proclus' Coz-
fluntarg are the two main sources of the history of Greek mathematics.
Of original wordings (though not the manuscripts) we have only a
fragment concerning the lunes of Hippocrates, quoted by Simplicius (first
half of 6th cent. e.o.) and taken from Eudemus' lost Histnry of Geometrg, arld,
a fragment of Archytas on the duplication of the cube. And of original manu-
scripts we have some papyri written in Alexandrian Greek times. Related
sources on Greek mathematics are also immensely valuable. For example,
the Greek philosophers, especially Plato and Aristotle, had much to say about
mathematics and their writings have survived somewhat in the same way as
have the mathematical works.
The reconstruction ofthe history ofGreek mathematics, based on sources
TUE MAJOR SCHOOT,S OF Tr{E CLASSTCAL PERTOD 27
discuss the work ofa man in connection with a particular school even though
his association with it was not close.
5. The Pythagoreans
The torch was picked up by Pythagoras who, supposedly having learned
from Thales, founded his own school in Croton, a Greek settlement in
southern Italy. There are no written works by the Pythagoreans I we know
about them through the writings of others, including Plato and Herodotus.
In particular we ate hazy about the personal life of Pythagoras and his
followers; nor can we be sure of what is to be credited to him personally or
to his followers. Hence when one speaks ofthe work of Pythagoras one really
refers to the work done by the group between 585 r.c., the rePuted date of
his birth, and roughly 400 g.c. Philolaus (5th cent. s.c.) and Archytas
(428-347 a.c.) were prominent members of this school.
Pythagoras was born on the island of Samos, just off the coast of Asia
Minor. After spending some time with Thales in Miletus, he traveled to other
places, including Egypt and Babylon, where he may have picked up some
mathematics and mystical doctrines. He then settled in Croton. There he
29
THE PYTHAGOREANS
founded a religious, scientific, and philosophical brotherhood' It was a
formal school, in that membership was limited and members learned from
leaders. The ieachings ofthe group were kept secret by the members, though
the secrecy as to mathematics and physics is denied by some historians. The
Pythagoreans were supposed to have mixed in politics ; they allied themselves
w'ith tte aristocratic faction and were driven out by the popular or democratic
party. Pythagoras fled to nearby Metapontum and was murdered there
his
aborrt +Si r.". Hi, followers spread to other Greek centers and continued
teachings.
On1 of the great Greek contributions to the very concePt of mathe-
matics was the conscious recognition and emphasis of the fact that mathe-
matical entities, numbers, and geometrical figures are abstractions' ideas
entertained by the mind and sharply distinguished from physical objects
or
pictures, It is true that even some primitive civilizations and certainly the
'Egyptians
and Babylonians had Iearned to think about numbers as divorced
frlm physical objects. Yet there is some question as to how much they were
geo-
corscLrrsly aware of the abstract nature of such thinking' Moreover'
metricalthinkinginallpre-Greekcivilizationswasdefinitelytiedtomatter.
To the Egyptiu*, fo. e*ample, a line was no more than either a stretched
rope or tfr'" .ag" of a field and a rectangle was the boundary of a field'
The recotnition that mathematics deals with abstractions may with
some confidenie be attributed to the Pythagoreans. However,
this may not
the Pythag-
have been true at the outset of their work. Aristotle declared that
oreans regarded numbers as the ultimate components of real'
material
objects.l lir-b.r, did not have a detached existence apart from objects
of
seise. When the early Pythagoreans said that all objects were composed
of
(whole) numbers or that ,rr-b".t were the essence of the universe' they
meant it literally, because numbers to them were Iike atoms are
to us' It is
also believed that the sixth- and fifth-century Pythagoreans
did not really
then' a number
distinguish numbers from geometrical dots' Geometrically'
Eudemus'
*", u]r, extended poirrt oi a very small sphere' However' (than had as
Figure 3,I
Figurc 3.2
each side, 4 being another favorite number. They realized that the sums
l, | + 2, I + 2 + 3, and so forth gave the triangular numbers and that
I + 2 + ... I n : (nl2)(n + t).
The numbers l, 4, 9, 16,.. . were called square numbers because as
dots they could be arranged as squares (Fig. 3.2). Composite (nonprime)
numbers which were not perfect squares were called oblong.
From the geometrical arrangements certain properties of the whole
numbers became evident. Introducing the slash, as in the third illustration
of Figure 3.2, shows that the sum of two consecutive triangular numbers is a
square number. This is true generally, for as we can see, in modern notation,
i@ + r) *+ @ + 2): (z + 1)'.
Figure 3.3
THE PYIIIAGOREANS 3r
add the gnomon 3 and then the gnomon 5, and so forth, what we have in our
symbolism is
l+3+5+"'+(2n-l):z''
As to the word "gnomon," originally in Babylonia it probably meant
an upright stick whose shadow was used to tell time. In Pythagoras' time it
is the shape of the above grromon' It
-.uot " carpenter's square, and thisa square when a smaller square was cut
also meant what was left over from
out of one corner. Later, with Euclid, it meant what was left from a paral-
lelogram when a smaller one was cut out of one corner provided that the
p".ull"log.urn in the lower right-hand corner was similar to the one cut out
(Fig.
' 3.a).
-The with polygonal numbers such as pen-
Pythagoreans also worked
tagonal, hexagonal, and higher ones. As we can see from Figure 3'5, where
.ulh dot .ap."r"rrtt a unit, the first pentagonal number is l, the second,
whose dots form the vertices ofa pentagon, is 5; the third is I * 4 * 7, or
12, and so forth. The zth pentagonal number, in our notation,is (3nz - n)12'
Liiewise the hexagonal numbers (Fig. 3.6) are l, 6, 15, 28, ' ' ' and generally
2n2 - n,
A number that equaled the sum of its divisors including I but not the
number itself was callet perfect; for example,6,28, and 496' Those e:rceed-
ing the sum of the divisors were called excessive and those which were less
wJre called defective. Two numbers were called amicable if each was the
sum of the divisors of the other, for example, 284 arad 22O'
The Pythagoreans devised a rule for finding triples of integers which
could be the sides of a right triangle. This rule implies knowledge of tlre Py-
thagorean theorem, about which we shall say more later' They found that
whJn n is odd, thert m, (m2 - l) 12, and, (mz + l) 12 arc such a triple' However,
this rule gives only some sets of such triples. Any set of three integers which
can be the sides ofa right riangle is now called a Pythagorean triple.
The Pythagoreans studied prime numbers, progressions, and those
ratios and proportions they regarded as beautiful. Thus ifp and q are two
numbers, the arithmetic rnean Ais (p + q)12, the geometric mean Gis \/-pq,
and the harmonic mean 11, which is the reciprocal of the arithmetic mean of
llp and, llq, is 2pql(p * q). Now G is seen to be the geometric mean of
A and, H. The proportion AIG : GIH was called the perfect proportion
and the proportion pt(p + dl2 :z?qlQ I q):q was called the musical
ProPortion.
Numbers to the Pythagoreans meant whole numbers only. A ratio of
two whole numbers was not a fraction and therefore another kind of number,
as it is in modern times. Actual fractions, expressing parts of a monetary unit
or a measure, were employed in commerce, but such commercial uses of
arithmetic were outside the pale of Greek mathematics proper. Hence the
Pythagoreans were startled and disturbed by the discovery that some ratios-
for example, the ratio of the hypotenuse of an isosceles right triangle to an
arm or the ratio of a diagonal to a side of a square-cannot be expressed by
whole numbers. Since the Pythagoreans had concerned themselves with
whole-number triples that could be the sides of a right triangle, it is most
likely that they discovered these new ratios in this work. They called ratios
expressed by whole numbers commensurable ratios, which means that the
two quantities are measured by a common unit, and they called ratios not so
expressible, incommensurable ratios. Thus what we express u" l/i.12 is an
incommensurable ratio. The ratio of incommensurable magnitudes was
called cloyos (alogos, inexpressible). The terr:l, d,ppytos (arratos, not having
a ratio) was also used. The discovery of iricommensurable ratios is attributed
to Hippasus of Metapontum (5th cent. a.c.). The Pythagoreans were suP-
posed to have been at sea at the time and to have thrown Hippasus over-
board for having produced an element in the universe which denied the
Pythagorean doctrine that all phenomena in the universe can be reduced to
whole numbers or their ratios.
Tr{E PYTHACOREANS 33
The proof that y'Z it i.rco-rrr.nsurable with I was given by the Pythag-
orearrr. According to Aristotle, their method was a reductio ad absutdum-
that is, the indirect method' The proof showed that if the hypotenuse were
commensurable with an arm then the same number would be both odd and
even. It runs as follows: Let the ratio of hypotenuse to arm ofan isosceles
right triangle be o:p and let this ratio be expressed in the smallest numbers.
Then o' : Zp' Ay the Pythagorean theorem. Since a2 is even, d must be
even, for the square of any odd number is odd.2 Now the ratio a:p is in its
lowest terms. Hence B must be odd. Since cr is even, let a:2y' Then
o2 : 4y2 : 2F2. lts'ence F' : 2y'and so p2 is even' Then B is even' But is
p
also odd and so there is a contradiction.
This proot which is of course the same as the modern one that /t is
irrational, was included in older editions of Euclid's Elements as Proposition
I l7 of Book X. Elowever, it was most likely not in Euclid's original text and
so is omitted in modern editions'
Incommensurable ratios are expressed in modern mathematics by
irrational numbers. But the Pythagoreans would not accePt such numbers.
The Babylonians did work with such numbers by approximating them,
though tirey probably did not know that their sexagesimal fractional
.pp"o*i-ution. could never be made exact. Nor did the Egyptians recognize
the distinctive nature of irrationals. The Pythagoreans did at least recognize
that incommensurable ratios are entirely different in character from com-
mensurable ones.
This discovery posed a problem that was central in Greek mathematics'
The Pythagoreans had, uP to this point, identified number with geometry'
But ttre existence of incommensurable ratios shattered this identification.
They did not cease to consider all kinds of lengths, areas, and ratios in
geometry, but they restricted the consideration of numerical ratios to com-
t.rrr.utl" ones. The theory of proportions for incommensurable ratios and
all kinds of magnitudes was provided by Eudoxus, whose work we shall
consider shortly.
Some geometrical results are also credited to the Pythagoreans' The
most famous is the Pythagorean theorem itselt a key theorem of Euclidean
geometry. The Pythagoreans are also supposed to have discovered what we
Lu.n theorems about triangles, parallel lines, polygons, circles, spheres,
". regular polyhedra. They knew in particular that the sum of the
and the
angles of alriangle is 180". A limited theory of similar figures and the fact
that a plane can te filled out with equilateral triangles, squares, and regular
hexagons are included among their results'
The Pythagoreans started work on a class of problems known as
2. Any odd whole number can be expressed as 2n * | for some z' Then (2n * l)t :
4n2 + 4n * l, and this is necessarily odd.
34 THE CREATTON OF CLASSICAL GREEK MATTTEMATICS
llll
Figure 3,7 A D C B
.A
B. .B
Figure 3.8 c. .c
Aristotle: "It says that the slowest moving object cannot be overtaken by
the fastest since the pursuer must first arrive at the point from which the
pursued started so that necessarily the slower one is always ahead. The argu-
ment is similar to that of the Dichotomy, but the diflerence is that we are not
dividing in halves the distances which have to be passed over." Aristotle
then says that if the slowly moving object covers a finite distance, it can be
overtaken for the same reason he gives in answering the first paradox.
The next two paradoxes are directed against " cinematographic "
motion. The third paradox, called the Arrow, is given by Aristotle as fol-
lows: "The third paradox he [Zeno] spoke about, is that a moving arrow is
at a standstill. This he concludes from the assumption that time is made up
of instants. If it would not be for this supposition, there would be no such
conclusion." According to Aristotle, Zeno means that at any instant during
its motion the arrow occupies a definite position and so is at rest. Hence it
cannot be in motion. Aristotle says that this paradox fails if we do not grant
indivisible units of time.
The fourth paradox, called the Stadium or the Moving Rows, is put by
Aristotle in these words: "The fourth is the argument about a set of bodies
moving on a race-course and passing another set of bodies equal in number
and moving in the oppoSite direction, the one starting from the end, the other
from the middle and both moving at equal speed; he [Zeno] concluded that
it follows that half the time is equal to double the time. The mistake is to
assume that two bodies moving at equal speeds take equal times in passing,
the one a body which is in motion, and the other a body of equal size which
is at rest, an assumption which is false."
The probable point of Zeno's fourth paradox can be stated as follows:
Suppose that there are three rows ofsoldiers, A, B, ar,.d, C (Fig. 3.8), and that
in the smallest unit of time -B moves one position to the left, while in that
time C moves one position to the right. Then relative to B, C has moved two
positions. Hence there must have been a smaller unit of time in which C was
one position to the right of .B or else half the unit of time equals the unit of
time.
It is possible that Zeno merely intended to point out that speed is
relative. C's speed relative to.B is not C's speed relative to l. Or he may have
meant there is no absolute space to which to refer speeds. Aristotle says that
Zeno's lallacy consists in supposing that things that move with the same
speed past a moving object and past a fixed object take the same time.
Neither Zeno's argument nor Aristotle's answer is clear. But if we think of
TIIE SOPHIST SCIIOOL 3t
Figure 3,9
Figure 3.10
6 E',H
;ir: fr'
If we denote E'H by y and. BA by a, then
6 !.
(r) 4
"12
3. The point G cannot be obtained dircctly from ttrc dcfinition of thc curvc bccausc '{8
rcaches lD at the samc instant as 8C reaches ,{D and so thcre is no point of intcEcction
of the rotating line and the horizontal line. G can bc obtaincd only as the limit ofprcccding
points of thc quadratrix. By using the calculus wc can show that ,{G : 2olt whcre a = AB'
40 THE CREATION OT CI,ASSICAL GREEK MATIIEMATICS
or
.2
:4'q';'
c
B:ut tf AH : tr, ther
6:
,x arc tan!.
Hence
u:
2au
atc tarL--
u,ttu -
or
y: xtanfr.
,,f,1
The curve, if constructible, could be used to trisect any acute angle. Let
{ be such an angle. Then trisect y so that E'H' : 2H'H. Dtaw B"C'
through fI' and let it cut the quadratrix in L.Draw AL. Then iLAD : $13,
for, by the argument which led to (l),
4LAD H'H
-E-
*LAD _s-l!.
ol2 o
But by (l)
6 v
o12 a
Hence
4LAD:t
Another famous discovery that resulted from the work on the construc-
tion problems was made by Hippocrates of Chios (5th cent. B.c.), the most
famous mathematician of his century, who is to be distinguished from his
contemporary Hippocrates of Cos, the father of Greek medicine. The mathe-
matician Hippocrates flourished in Athens during the second half of the
century; he was not a Sophist, but most likely a Pythagorean. He is credited
with the idea of arranging theorems so that later ones can be proven on the
THE SOPHIST SCHOOL 4t
Figure 3.1 I
basis of earlier ones, in the manner familiar to us from the study of Euclid.
He is also credited with introducing the indirect method of proof into mathe-.
matics. His text on geometry, called the Elements, is lost.
Hippocrates did not, of course, solve the problem ofsquaring the circle,
but he did solve a relatbd one. Let ABC be an isosceles right triangle (Fig.
3.1l) and let it be inscribed in the semicircle with center at O. Let AEB be
the semicircle with AB as diamter. Then
2:1
xg
c
2a
Then
xz : a! and, y2 - 2ax.
42 TrrE CREATTON OF CLASSTCAL CREEK MATHEMATICS
aides. During the classical period the study of mathematics and philosophy
was favored there. Though the main center for mathematics shifted to
Alexandria about 300 r.c., the Academy remained preeminent in philosophy
throughout the Alexandrian period. It lasted nine hundred years until it
was ctsed by the Christian emperor Justinian in l.o. 529 because it taught
" pagan and perverse learning."
Plato, one of the most informed men of his day, was not a mathema-
tician; but his enthusiasm for the subject and his belief in its importance for
philosophy and for the understanding of the universe encouraged mathe-
maticians to pursue it. It is noteworthy that almost all of the imPortant
mathematical work of the fourth century was done by the friends and pupils
of Plato. Plato himself seems to have been more concerned to improve and
perfect what was known.
Though we may not be sure to what extent the concepts of mathematics
were treated as abstractions prior to Plato's time, there is no question that
Plato and his successors did so regard them. Plato says that numbers and
geometrical concepts have nothing material in them and are distinct from
physical things. The concepts of mathematics are independent of experience
and have a reality of their own. They are discovered, not invented or
fashioned. This distinction between abstractions and material objects may
have come from Socrates.
A quotation from Plato'sRepublic may serve to illustrate the contem-
porary view of the mathematical concePts. Socrates addresses Glaucon:
4. Book VII, sec. 525; in B. Jowctt, Tlu Dialogucs of Pla!0, Clarcndon Prcss, 1953, Vol' 2'
44 THE CREATION OF CLASSTCAL GnEEK MATHEMATTCS
the necessity for the subsequent synthesis. The indirect method, as already
noted, is also attributed to Hippocrates.
The status of deductive structure with Plato is best indicated by a
passage in The RePublic.6 He saYs
You are aware that students of geometry, arithmetic and thc kindred
sciencesassumetheoddandtheevenandthefiguresandthreekindsof
anglesandthelikeintheirseveralbranchesofsciencelthesearetheir
hypotheses, which they and everybody are supposed to know, and there-
fore they do not deign to give any account of them either to themselves
or others; but they begin with them, and go on until they arrive at last'
and in a consistent manner, at their conclusion'
If this passage is indeed descriptive of the mathematics of the time, then
proof, *.r. c-ertainly made, but the axiomatic basis was implicit or may have
varied somewhat from one mathematician to another'
Platodidaffirmthedesirabilityofadeductiveorganizationofknowl-
edge.Thetaskofsciencewastodiscoverthestructureof(ideal)natureand
to lire it an articulation in a deductive system' Plato was the first to systema-
tizf the rules of rigorous demonstration, and his followers are supposed to
have arranged thirems in logical order' Also, we know that in Plato's
Academy tlie question arose whether a given problem can be solved at all on
the basis of the known truths and the hypotheses given in the problem'
Whether or not mathematics was actually deductively organized on the
basis of explicit axioms by the Platonists, there is no question that deductive
proof fro; some accepted principles was required from at least Plato's time
or*.rd. By insisting on this form of proof the Greeks were discarding all
,ules, procedrr"r, urrd fu"t, that had been accepted in the body of mathe-
matics for thousands of years preceding the Greek period'
Why did the Greeks insist on deductive proof? Since induction' obser-
vation, and experimentation were and still are vital sources of knowledge
heavily and advantageously employed by the sciences, why did the Greeks
prefer in mathematics deductive reasoning to the exclusion of all other
methods ? We know that the Greeks, the philosophical geometers
as they
were called, liked reasoning and speculation, as evidenced by their great
contributions to philosophy, logic, and theoretical science' Moreover'
philosophers are interested in obtaining truths' Whereas induction' experi-
rn"r,u,iorr, and generalizations based on experience yield only probable
knowledge, deduction gives absolutely certain results if the premises are
correct. ivlathematics in the classical Greek world was Part of the body
of
truths philosophers sought and accordingly had to be deductive'
Still another reason for the Greek preference for deduction may be
foundinthecontemPtshownbytheeducatedclassoftheclassicalGreek
6. Book VI, sec. 510.
4S rr{E CREATToN oF cLAssrcAL GREEK MATHEMATTCS
Eudoxus and Archytas had been the first originators of this far-
famed and highly-prized art of mechanics, which they employed as an
elegant illustration of geometrical truths, and as means of sustaining
experimentally, to the satisfaction of the senses, conclusions too intricate
for proof by words and diagrams. As, for example, to solve the problem,
so often required in constructing geometrical figures, given the two ex-
trcmcs, to find the two mean lines of a proportion, both of these mathe-
maticians had recourse to the aid of instrumene, adapting to their purpose
certain curves and sections oflines. But what \ /ith Plato's indignation at it,
and his invectives against it as the mere corruption and annihilation ofthe
one good of geometry, which was thus shamefully turning its back upon
thc unembodied objects of pure intelligence to recur to sensation, and to
ask help (not to be obtained without base supervisions and deprivation)
from matter, so it was that mechanics came to be separated from geom-
etry, and, repudiated and neglected by philosophers, took its place as a
military art.
This accounts for the poor development of experimental science and the
science of mechanics in the classical Greek period.
Whether or not historical research has isolated the relevant factors to
explain the Greek preference for deductive reasoning, we do know that they
were the first to insist on deductive reasoning as the sole method of proof in
mathematics. This requirement has been characteristic of mathematics ever
since and has distinguished mathematics from all other fields of knowledge
or investigation. However, we have yet to see to what extent later mathema-
ticians remained true to this principle.
TIIE PLATOMC SCHOOL
Figure 3,12
Hence we can see through coordinate geometry that x and y are the coordi-
nates of the point of intersection of two parabolas or a parabola and a
hyperbola. I\ier"e"h-rrs worked on the problem and saw both wap of
,oi.,oi"g it through pure geometry. According to the matlematical historian
Otto freugebaue. if eSS- ), the conic sections might have originated in work
on the construction of sundials.
Menaechmus introduced the conic sections by using three types of
cones (Fig. 3.I2), right-angled, acute-angled, and obtuse-angled, and cutting
48 TTTE CREATION OF CI.AS{iICAL GREEK MATTIEMATICS
geometry without concerning himself in the Ieast with the actual dimensions
of even one rectangle. Mathematical thought was thus separated from prac-
tical needs, and there was no compulsion for the mathematicians to improve
arithmetical and algebraic techniques. When the barrier between the cul-
tured and slave classes was breached in the Alexandrian period (300 a.c. to
about A.D. 600) and educated men interested themselves in practical affairs,
the emphasis shifted to quantitative knowledge and the development of
arithmetic and algebra.
To return to the contributions of Eudoxus, the powerful Greek method
of establishing the areas and volumes of curved figures, now called the
method of exhaustion, is also due to him. We shall examine the method and
its use, as given by Euclid, later. It is really the first step in the calculus but
does not use an explicit theory of limits. With it Eudoxus proved, for example,
that the areas of two circles are to each other as the squares of their radii, the
volumes of two spheres are to each other as the cubes of their radii, the
volume of a pyramid is one-third the volume of a prism of the same base and
altitude, and the volume ofa cone is one-third the volume ofthe correspond-
ing cylinder.
Some authority can be found to credit every school from Thales'
onward with having introduced the deductive organization of mathematics.
There is no question, however, that the work of Eudoxus established the
deductive organization ot tlu basis of cxplicit axioms. The necessity for under-
standing and operating with incommensurable ratios is undoubtedly the
reason for this step. Since Eudoxus undertook to provide the precise logical
basis for these ratios, he most likely saw the need to formulate aioms and
deduce consequences one by one so that no mistakes would be made with
these unfamiliar and troublesome magnitudes. This need to work with in-
commensurable ratios also undoubtedly reinforced the earlier decision to
rely only on deductive reasoning for proof.
Because the Greeks sought truths and had decided on deductive proot
they had to obtain axioms that were themselves truths. They did find itate-
ments whose ruth was self-evident to them, though the justifications given
for accepting the axioms as indisputable truths varied. Almost all Gieeks
believed that the mind was capable of recognizing truths. plato applied his
theory of anamnesis, that we have had direct experience of truths in a
period of existence as souls in another world before coming to earth, and we
have but to recall this experience to know that these truths included the
axioms of geometry. No experience on earth is necessary. Some historians
read into statements by Plato and proclus the idea that there can be some
arbitrariness in the axioms, provided only tlat they are clear and true in the
mind of the individual. The important thing is to reason deductively on the
basis of the ones chosen. Aristotle had a good deal to say about axioms and
we shall note his views in a moment.
ARIII{)TLE AND HIS SCEOOL 5I
defined things has to be proved except in the case of a few primary things
such as point and line, whose existence is assumed along with the first
principles or axioms. Thus one can define a square, but such a figure may
not exist; that is, the properties demanded in the definition may be incom-
patible. Leibniz gave the example ofa regular polyhedron with ten faces;
one can define such a figure but it does not exist. If one did not realize that
this figure did not exist, and proceeded to prove theorems about it, his results
would be nonsensical. The method of proving existence that Aristotle and
Euclid adopted was construction. The first three axioms in Euclid's Elements
grant the construction of straight Iines and circles; all other mathematical
concepts must be constructed to establish their existence. Thus angle tri-
sectors, though definable, are not constructible with straight lines and circles
and so could not be considered in Greek geometry.
Aristotle also treats the basic principles of mathematics. He distin-
guishes between the axioms or common notions, which are truths common to
all sciences, and the postulates, which are acceptable first principles for any
one science. Among axioms he includes logical principles, such as the law of
contradiction, the law ofexcluded middle, the axiom that if equals are added
or subtracted from equals the results are equal, and other such principles.
The postulates need not be self-evident but their truth must then be attested
to by the consequences derived from them. The collection of axioms and
postulates should be of the fewest possible, provided they enable all the
results to be proved. Though, as we shall see, Euclid uses Aristotle's distinc-
tion between common notions and postulates, all mathematicians up to the
late nineteenth century overlooked this distinction and treated axioms and
postulates as equally self-evident. According to Aristotle, the axioms are
obtained from the observation of physical objects. They are immediately
apprehended generalizations. He and his followers gave many definitions
and axioms or improved earlier ones. Some of the Aristotelian versions were
taLen up by Euclid.
Aristotle discusses the fundamental problem of how points and lines can
be related. A point, he says, is indivisible and has position. But then no
accumulation of points, however far it may be carried, can give us anything
divisible, whereas of course a line is a divisible magnitude. Hence points
cannot make up anything continuous like a line, for point cannot be con-
tinuous with point. A point, he says, is like the now in timel now is indi-
visible and not a part of time. A point may be an extremity, beginning, or
divider of a line but is not a part of it or of magnitude. It is only by rzo tion that
a point can generate a line and thus be the origin of magnitude. He also
argues that a point has no length and so ifa line were composed ofpoints, it
would have no length. Similarly, if time were composed of instants there
would be no interval of time. His definition of continuity, which a line pos-
sesses, is : A thing is continuous when the limits at which any two successive
ARISTOTLE AND HIS SCHOOL 53
parts touch are one and the same and are, as the word continuous imPlies'
ireld together. Actually Aristotle makes many statements about continuous
mug.ritide, which are not in agreement with each other' The substance of
hisioctrine, nevertheless, is that points and numbers are discrete quantities
and must be distinguished from the continuous magnitudes of geometry'
There is no continuum in arithmetic. As to the relation of the two
fields, he
is, the theory of numbers-more accurate' be'
considers arithmetic-that
cause numbers lend themselves to abstraction more readily than
the geo-
metric concepts. He also considers arithmetic to be prior to geometry because
the number three is needed to consider a triangle'
In discussing infinity he makes a distinction, which is imPortant today'
age of the
between the pof,ntially infinite and the actually infinite' The
infinite but is never at any
earth, if it hai a sudden beginning, is potentially
time actually infinite. According to Aristotle, only the potentially infinite
exists. The positive integers, he grants, are potentially
infinite in that we can
at*ay, aaa't to any nulber ani get a new one,- but the infinite set as such
does not exist. Further, most magnitudes cannot be even potentially infinite'
because if they were continually added to they could exceed the bounds of
the universe. Sp."e, ho*el,er, is potentially infinite, in that it can be re-
peatedly subdivided, and time is potentially infinite in both ways'
' A major achievement of Aristotle was the founding of the science of
had
logic. In producing correct laws of mathematical reasoning the Greeks
l"id th. giorrnd*ork for logic, but it took Aristotle-to codify and systematize
these laws into a separate Jiscipline' Aristotle's writings
make it abundantly
from mathematics' His basic principles of logic-
clear that he derived logic
be both true
the law of contradiction, which asserts that a proposition cannot
arrd false, and the law of excluded middle, which maintains that a proposi-
tionmustbeeithertrueorfalse-aretheheartoftheindirectmethodof
mathematical proof. Further, Aristotle used mathematical examples taken
from contemporary texts to illustrate his principles of reasoning'
Aristotelian
logic remainid unchallenged until the nineteenth century'
Though the science of logic was derived from mathematics' logic even-
tually came to be considered independent of and prior to mathematics and
logic
applicable to all reasoning. Even Aristotle, as already noted' regarded
a, p.elimirrary to sciencJand philosophy' In mathematics he emphasized
d.iuctir. proof as the sole basis for establishing facts' For Plato' who be-
lieved thai mathematical truths preexist or exist in a world independent
of
of theorems; the
man, reasoning was not the guarantee of the correctness
iog*r po*".. ilayed only a s""ot'daty role' They made explicit' so to speak'
what was alreadY known to be true'
One member of Aristotle's school especially worthy of note is Eudemus
g'c' and was the
of Rhodes, who lived in the last Part of the fourth century
by Proclus and Simplicius' As we
author of the Eudemian summary quoted
54 THE CREATTON OF CLASSICAL GREEK MATTiEMATTCS
Bibliograplry
Apostle, H, G.: Aristoth's Phihsoplry of Matlumatics, University of Chicago press,
1952.
BaII, W. W. Rouse: A Short Account of thc History of Matlumatits, Dover (reprint),
1960, Chaps. 2-3.
Boyer, Carl B.: A Histary of Mathcmatics, John Wiley and Sons, 1968, Chaps. 4-6.
Brumbaugh, Robert S.: Plato's Matlumatbal Imagination, Indiana University press,
1954.
Gomperz, Theodor: Greck Thin*as, 4 vols., John Murray, 1920.
Guthrie, W. K. C. : A History oJ Gruk Philosoplry, Cambridge University press, 1962
and 1965, Vols. I and 2.
Hamilton, Edith : Thc Greck Way to Westcrn Ciailizatiez, New American Library, 194g.
Heath, Thomas L.: A History of Grcek Mathemalics, Oxford University press, 1921,
Vol. 1
Chap. 3.
van der Waerden, B. L.: Scilncc Au&ning, P' Noordhoff, 1954, Chaps' 44'
Wedberg, Anders: Ptato's Phihsoptry of Matlumaties, Almqvist and Wiksell'
1955'
+
Euclid and Apollonius
l. Introdtution
The cream of the mathematical work created by the men of the classical
period has fortunately come down to us in the writings of two men, Euclid
and Apollonius. Chronologically, both belong to the second great period of
Greek history, the Hellenistic or Alexandrian. It is quite certain that Euclid
lived in Alexandria about 300 s.c. and trained students there, though his
own education was probably acquired in Plato's Academy. This informa-
tion, incidentally, is about all we have on Euclid's personal life and even this
'
iomes from a one-paragraph passage in Proclus' Commentarg. Apollonius died
in 190 r.c., so his life too falls within the Alexandrian period. It is customary,
however, to identify Euclid's work with the classical period, because his
bools are accounts of what was developed in that age. Euclid's work is
actually an organization of the separate discoveries of the classical Greeks;
this is clear from a comparison of its contents with what is known of the
earlier work. The Elements in particular is as much a mathematical history
ofthe agejust brought to a close as it is the logical development ofa subject.
Apollonius'work is generally classed with that of the Alexandrian period but
the content and spirit of his major work, Conic Sections, is of the classical
period. In fact Apollonius acknowledged that the first four of the work's
eight books were a revision of Euclid's lost work on the same subject. Pappus
mentions that Apollonius spent a long time with the pupils of Euclid at
Alexandria, which readily explains the kinship with Euclid. The justification
for identifying Apollonius with the work of the classical period will be more
apparent when we have learned the characteristics of the work of the
Alexandrian period.
56
ttELEMENrs" 5t
TIIE BAcKcRouND oF EUoLID'S
text for siudents. Proclus gives some weight to the latter belief'
In view of the length and incomparable historical importance of this
58
work, we shall devote several sections of this chapter to a review of and
comment on the contents. Since we still learn Euclidean geometry, we may
be somewhat surprised by the contents of the Elemmts. The high school
versions most widely used during our century are patterned on Legendre's
modification of Euclid's work. Some algebra used by Legendre is not in the
Elcmmts, though, as we shall see, the equivalent geometrical material is.
l. T. L. Heath: Tlu Thirteen Books of Erclid's E)ements, Dover (reprint), 1956,3 vols.
THE DEFINITIONS AND AXIOMS OF THE..ELEMBNIS,, 59
23. Parallel straight lines are straight lines which, being in the same plane
and being p.odrrced indefinitely in both directions, do not meet one another
in either direction.
Theopeningdefinitionsareframedintermsofconceptsthatarenot
defined, urrd h.rr.. serve no logical purpose. Euclid might not have realized
that the initial concepts must be undefined and was naively explaining their
meaning in terms ofphysical concepts. Some commentators say he appre-
ciated that the definitions were not logically helpful but wanted to explain
what his terms rePresented intuitively so that his readers would be convinced
that the axioms and Postulates were applicable to these concePts'
Euclid next lays down five postulates and five common notions (Proclus
uses the term "axiom" for " common notion")' He adopts
the distinction
already made by Aristotle, namely, that the common notions are truths
applicable to all sciences whereas the Postulates apply only to geometry' As
*" h.r" noted, Aristotle said that the postulates need not be known to be
true but that their truth would be tested by whether the results deduced
from them agreed with reality. Proclus even speaks of all of mathematics
as hypothetiJal ; that is, it merely deduces what must follow from the
urr,r-p,iorrr, whether or not the latier are true' Presumably Euclid accepted
Aristotle's views concerning the truth of the postulates' However' in the
subsequent history of mathematics, both the Postulates and the cotnmon
notions were accepted as unquestionable truths, at least until the advent
of
non-Euclidean geometry.
Euclid postulates the following:
POSTULATES
l. [It is possible] to draw a straight line from any point to any point'
COMMON NOTIONS
l. Things which are equal to the same thing are also equal to one another'
2. If equals be added to equals, the wholes are equal'
3. If equals be subtracted from equals, the remainders are equal'
60
4. Things which coincide with one another are equal to one another.
5. The whole is greater than the part.
Euclid does not naively assume that the defined concepts exist or are
consistentI as Aristotle had pointed out, one might define something that
had incompatible properties. The first three postulates, since they declare
the possibility of constructing lines and circles, are existence assertions for
these two entities. In the development ofBook I, Euclid proves the existence
of the other entities by constructing them. An exception is the plane.
Euclid presupposes that the line in Postulate I is unique; this assump-
tion is implicit in Book I, Proposition 4. It would have been better, however,
to make it explicit. Likewise, in Postulate 2 Euclid assumes the extension is
unique. fle uses the uniqueness explicitly in Book XI, Proposition 1, but has
actually already used it unconsciously at the very beginning of Book I.
Postulate 5 is Euclid's own; it is a mark of his genius that he recognized
its necessity, Many Greeks objected to this postulate because it was not
clearly self-evident and hence lacked the appeal of the others. The attempts
to prove it from the other axioms and postulates-which, according to
Proclus, commenced even in Euclid's own time-all failed. The full
history of these efforts will be related in the discussion of non-Euclidean
geometry.
As to the common notions, there are differences of opinion over which
ones were in Euclid's original version. Common Notion 4, which is the basis
for proof by superposition, is geometrical in character and should be a
postulate. Euclid uses superposition in Book I, Propositions 4 and B, though
apparently he was unhappy about the method; he could have used it to
prove Proposition 26 (a.s.a. : a.s.a. and. s.a.a. : s.a.a.), but instead uses a
longer proof. He probably found the method in works of older geometers and
did not know how to avoid it. More axioms were added to Euclid's by
Pappus and others who found Euclid's set inadequate.
Figure 4.1
The proof (Fig. a.3) requires an infinitely extensible straight line, be-
cause in it AE 'ts extended its own length to fl and it must be possible to do
this.
Proposition 20. The sum of any two sides of a triangle is greater than the
third side.
This theorem is as close as one comes in Euclidean geometry to the fact
that the straight line is the shortest distance between two points.
Proposition 27 . If a straight line falling on two straight lines makes the
alternate interior angles equal to one another, the straight lines will be
parallel to one angther.
The proofis made by contradiction using the proposition on the exterior
angle of a triangle. The theorem establishes the existence of at least one
parallel line to a given line and through a given point.
Proposition 29. A straight line falling on parallel straight lines makes the
alternate interior angles equal to one another, the exterior angle equal to
the interior and opposite angle [corresponding angles are equal], and the
interior angles on the same side equal to two right angles.
The proof (Fig. a.a) supposes 4l + 42. If {2 is greater, add {4 to
each. Then 42 + 44 > +l + {4. This implies that {l * {4 is less
than two right angles. By the parallel postulate, which is used for the first
time, the two given lines z48 and CD would have to meet whereas they are
parallel by hypothesis.
Proposition 44. To a given straight line to apply, in a given rectilinear
angle, a parallelogram equal to a given triangle.
This proposition (Fig. a.5) says that we are given a triangle C, an angle
D, and a line segment lB. We are to construct on AB as one side a parallelo-
gram equal in area to C and containing angle D as one angle. We shall not
give Euclid's proof, which depends upon preceding propositions. The major
point ofnote is that this is the first ofthe problems included under the theory
ofapplication ofareas, the theory ascribed to the Pythagoreans by Eudemus
(as reported by Proclus). In this case we apply (exactly) an atea to AB.
Booxs r ro rv oF ttG "Er,EIlEttts" 63
Figure 4.5
Figure 4.6
64
Figure 4.7
The theorem also shows how to obtain a given square whose area is a
sum of two given squares, that is, how to find r such that x2 : a2 * b2.
Hence it is another example of geometrical algebra.
AB2+AC2:BC2
and we have from the right triangle ADC that
ADz+ACz:DCg.
Since,4B : AD then BCz : DCz and so DC : .BC. Hence the two triangles
are congruent by.r.s.s., so that angle CAB mtxt be a right angle.
b ab b2
l, a2 ab
a b
Figure 4.9
area to the given product. The other side of the rectangle is, of course, the
quotient. The construction uses the theory of application of areas already
touched upon in Proposition 44 of Book I. The addition and subtraction of
products are the addition and subtraction of rectangles. The sum or differ-
ence is transformed into a single rectangle by means of the method of appli-
cation ofareas. The extraction ofa square root is, in this geometrical algebra,
the finding ofa square equal in area to a rectangle whose area is the given
quantity; this is done in Proposition 14 (see below) '
.ThefirsttenpropositionsofBooklldealgeometricallywiththefollow.
ing equivalent algebraic propositions. stated in our notation some ofthese are:
l. + c + d + "') : ab + ac +
a(b ad + "' ;
2. (a + b)a + (a + b)b : (a + b)2;
3. (a + b)a: ab * a2;
4.(a+b)2:a2+2ab+b2;
'\2
5. ab *{i. . b) - b\ : {Lr'* b)l ;
Figure 4.1 I
Proposition ll. To cut a given straight line so that the rectangle contained
by the whole and one of the segments is equal to the square on the remaining
segment.
This requires that we divide lB (Fig. a.10) at some point I/ so that
AB.BH : AH.AH. Euclid's construction is as follows: Let AB be the given
line, Construct the square ABDC. Let E be the midpoint of AC. Draw BE.
Let F on C,tl produced be such that EF : EB. Construct the square lFGIl.
Then rlI is the desired point on lB, that is,
AB.BH: AH.AH.
The proof is made by means of areas, using preceding theorems, including
the Pythagorean theorem-the crucial theorem being Proposition 6.
The importance of the theorem is that r4.B of length a is divided into
two s€gments of lengths x and a - r so that
(a-r)a:x2
or
x2 + ax: a2,
Figure 4.1
2. By the usual current definition of an angle between two curves, the horn angle is of
zeto size.
BooK v: TIIE THEoRY oF PRoPoRTIoN 69
Definition 4. Magnitudes are said to have a ratio to one another which are
capable, when multiplied, of exceeding one another.
The meaning of this definition is that the magnitudes a an.d b have a
ratio if some integral multiple z (including I ) of a exceeds D and some integral
multiple (including 1) of D exceeds a. The definition excludes the concept
that appeared later, namely, the infinitely small quantity which is not 0,
called the infinitesimal. Euclid's definition does not allow a ratio between
two magnitudes if one is so small that some finite multiple of it does not
exceed the other. The definition also excludes infinitely large magnitudes
because then no finite multiple of the smaller one will exceed the larger. The
next definition is the key one.
Definition 5. Magnitudes are said to be in the same ratio, the first to the
second and the third to the fourth, when, if any equimultiples whatever be
taken of the first and third, and any equimultiples whatever ofthe second and
fourth, the former equimultiples alike exceed, are alike equal to, or alike fall
short of the latter equimultiples respectively taken in corresponding order'
The definition says that
ac
bd
if when we multiply a and cby ang whole number m, say, and b and dby ang
whole number z, then for all such choices of rn and z,
Just to see the meaning of this definition, let us use modern numbers'
To test whether
\/, ^/6
I \/3
7o EUCLTD AND APOLLOMUS
rwiew of Books I to IV. However, this use of geometry did not take care of
ratios and proportions of incommensurable magnitudes of all sorts, and this
lack was filled by Book V, which started anew with a general theory of mag-
nitude. It thereby placed all of Greek geometry that dealt with magnitudes
on a sound basis. The critical question, however, has been whether the theory
of magnitudes provided a logical basis for a theory ofreal numbers, including,
of course, the irrational numbers.
There is no question about how succeeding generations of mathema-
ticians interpreted Euclid's theory of magnitudes. They regarded it as
applicable only to geometry and therefore took the attitude that only
geometry was rigorous. Hence, when in the Renaissance and in the following
centuries irrational numbers were reintroduced and used, many mathema-
ticians objected because these numbers had no logical foundation.
A critical examination of Book V seems to establish that they were
right. It is true that the definitions and proofs as presented by Euclid in
Book V make no use of geometry. As already noted his use of line segments
in his presentation of the propositions and proofs is pedagogical only. How-
ever, if Euclid had really oflered a theory of irrationals in his theory of mag-
nitudes, it would have had to come from either one of two possible interpreta-
tions. The first is that magnitudes themselves could be taken to be the
irrational numbers, and the second is that the ratios of two magnitudes
could be the irrational numbers.
Let us suppose the magnitudes themselves could be the irrational
numbers. Then, even if we leave aside any criticisms of Euclid's rigor when
judged by modern standards, the following difficulties enter. Euclid never
defines what he means by a magnitude, or the equality or equivalence of
magnitudes. Moreover, Euclid works not with the magnitudes themselves
but with proportions. A product of two magnitudes a and 6 occurs only when
a and. b are lengths, thereby enabling Euclid to consider the product as an
area. The product aD could not then be a number because the product has
no general meaning in Euclid. Further, Euclid proves in Book V a number of
theorems on proportion which in themselves can readily be restated, as in
fact we did above, as algebraic theorems. Flowever, to prove Proposition 18
of Book V he needs to establish the fourth proportional to three given magni
tudes, which he is able to do only for magnitudes that are line segments
(Book VI, Proposition l2). Hence, not only is his theory of general magni
tudes incomplete (even for proofs he himself makes in Book XII), but what
he does establish for lengths is dependent on geometry. Moreover, Euclid
insists in Definition 3 that a ratio can be formed only of magnitudes of the
same kind. Clearly if magnitudes were numbers this limitation would be
meaningless. His concept of magnitude as used later adheres to the definition
and so is tied to geometry. Another difficulty is that there is no system of
rational numbers to which a theory of irrationals could be added. Ratios of
BOOK \Ir: STMTLAR FTGURES 73
Definition 3. A straight line is cut in extreme and mean ratio when the whole
line is to the greater segment as the greater to the less'
Definition 4. The height of any figure is the perpendicular drawn from the
vertex to the base.
This ilefinition is certainly vague but Euclid does employ it.
In the proofs of the theorems in this book, Euclid, using his theory of
proportion, does not have to treat separately the commensurable and incom-
mensurable cases, a separation introduced by Legendre, who used an alge-
braic definition of proportion limited to commensurable quantities and so
had to treat the incommensurable cases by another argument such as a
reduetio ad absurdum.
We shall note only some of the thirty-three theorems. Again we shall
find some basic results of modern algebra treated geometrically.
Proposition l. Triangles and parallelograms [i.e. the areas] which are under
the same height [have the same altitudes] are to one another as their bases'
74 EUCLID AND APOLLONTUS
Figure 4.I3
Here Euclid uses a proportion among four magnitudes two of which are
areat.
Proposition 4. In equiangular triangles the sides about the equal angles are
proportional, and those are the corresponding sides which subtend the equal
angles.
Figure 4.1,1
- azc
r<zr'
On the other hand the condition that equation (l) regarded as a quadratic
in * have a real root is that its discriminant be Sreater than or equd to 0'
That is,
a,-q!s>oc
a2c
,s<
46'
D.-------r------- E
-'l
I
I
I
I
F
Figure 4.15
EUCLID AND APOLLOMUS
76
s cD
b
s l, D'
Figurc 4.16 A
Thus the proposition tells us not only what the largest possible value of S is
but that for all possible values there is an r which satisfies (I) and which
geometrically furnishes one side, Kd of the tectangle AF. This result will be
used in the next proposition.
Before considering it let us note an interesting special case ofProposition
27. Suppose the given parallelogram AD (Fig.4.l5) is a square. Then of all
rectangles on lB deficient by a square similar to AD, the square on lC is the
greatest. But the rectangle lF on (part of) AB has arca AK'KF and since
KF : KB, the perimeter of this rectangle is the same as that of square DB or
square lD. Bwt AD has Iarger area than lF. Thus of all rectangles with the
satne perimeter the square has the greatest area.
Proposition 28. To a given straight line to apply [on part of the line as side]
a parallelogram equal to a given rectilinear figure [S] and deficient firom the
parallelogram on the entire line] by a parallelogramic figure similar to a
given one [D]. Thus [by Proposition 27] the given rectilinear figure [S] must
not be greater than the parallelogram described on half of the sraight line
and similar to the defect.
This theorem is the geometrical equivalent of the solution of the quad-
ratic equation ax - (blc)x2 : S, where S is the area of the given rectilinear
figure and is subject to the condition for a real solution, namely, S is not
greater than alcl4b.To see this, suPpose (for convenience) that the parallelo-
grams are rectangles (fig. a.16), ,S is the given rectilinear figure, D is the
other given rectangular figure with sides c and b, ais AB, and r is one side of
the desired rectangle. What Euclid constructs is the rectangle./KFG of area S
and such that the defect D' is similar to D. B:ut AKFG : ABHG - D'. Since
D' is similar to D its area is bxz lc. H.ence
(2) S=ax-!xz.
c
Thus to construct IKFG is to trnd 4( and x such that .r satisfies equation (2).
IX: THE THEORY OF NUMBERS 77
BOOKS \1I, V[I, AND
a parallelogram equal to a
Proposition 29. To a given straight line to apply
;;;;;;;tlt;.." fig,r.J [S] and eiceeding bv a parallelogramic figure similar
to a given one [D].
In afebraic terms this theorem solves
* 1!*': S
Definition 17. And when three numbers having multiplied one another
make some number, the number so produced is solid, and its sides are the
numbers which have multiplied one another.
Definition 20. Numbers are proportional when the first is the same multiple,
or the same part, or the same parts of the second that the third is of the fourth.
Definition 22. A perfect number is that which is equal to [the sum of] its
own Parts.
Propositions I and 2 give the process of finding the greatest common
measure (divisor) of two numbers. Euclid describes the process by saying
BOOXS vII, \IIIr, AND D<: TIIE TIrEORY OF NUMBERS 79
thzr Aand B are the numbers and B < l, then subtract B fron A enough
if
times until a number C less than -B is ldt. Then subtract C from B enough
times until a number less than c is left. And so on. If A and, B are relatively
prime we arrive at I as the last remainder. Then I is the greatest common
diriror. If I and I are not relatively prime we arrive at some stage where the
last number measures the one before it. This last number is the greatest
common divisor of z4 and 8. This process is referred to today as the Euclidean
algorithm.
Simple theorems about numbers follow. For example, if a: bln and
c : dln,ih", o t c : (b t /)/n. Some are just the theorems on ProPortion
previously proved for magnitudes and now proved all over again for num-
Lers' Thuslf alb: cld, then (a - c)l@ - d) : alb' Also' in Definition 15
a.6 is defined as D added to itself 4 times. Hence Euclid proves that ab
: ba'
Proposition 30. If
two numbers by multiplying one another make some
,,r.ib"r, and any prime number measures the product, it will also measure
one of the original numbers.
This result is fundamental in the modern theory of numbers' We say
that if a prime p divides a product of two whole numbers it must divide at
least one of the liactors'
Book VIII continues with the theory of numberr; no new definitions are
needed. In essence the book treats geometrical progressions, A geometrical
progression is to Euclid a set of numbers in continued proportion, that is'
i1A": 4t : cld : d.le : '". This continued proportion satisfies our defini-
tion of geometric progression, for if a, b, c, d, c, ' ' ' are in geometrical Prc'
gression the ratio of any term to the next one is a constant'
Book IX concludes the work on the theory of numbcrs' There are
theorems on square and cube numbers, plane and solid numbers, and more
theorems on continued ProPortions. Of note are the following:
At the conclusion of the prool Euclid says the theorem can be proven if
the parts subtracted be halves. One steP in the proof utilizes an axiom' not
recognized as such by Euclid, to the effect that of two unequal
"orrr"iorrly
magnitudes the smaller can be added to itself a finite number of times, so
,. f h.rr" the sum exceed the larger. Euclid bases the questionable step on
the definition of a ratio between two magnitudes (Definition 4 of Book V)'
But this definition does notjustily the step, It says that two magnitudes
have
a ratio when either can be added to itself enough times to have the sum
exceed the other; hence Euclid should prove that this can be done for
the
magnitudes he deals with. Instead he assumes that his magnitudes have a
.uti-o.rd uses the fact that the smaller can be added to itself enough times to
exceed the greater. According to Archimedes' the axiom in question
(strictly
an equivalJrt statement) was used by Eudoxus, who had established it as a
l"--.. Archimedes uses this lemma without proof and so, in effect, he too
uses it as an axiom. It is called today the axiom of Archimedes-Eudoxus'
There are I 15 propositions in Book X, though Propositions 116 and ll7
are found in some iditio.rt of Euclid. The latter gives the proof already de-
scribed in Chapter 3 of the irrationality of Vf '
9. Books XI, Xil, and XIII: Solid Geometry and the Method of
Exhaustion
Book begins the treatment of solid geometry, though some important
XI
theorems on plurr" geometry are yet to come' The book opens with defini-
tions.
Figurc 4.17
regular icosahedron, and other figures. The sphere is defined as the figure
comprehended by a semicircle rotated around a diameter, The cone is
defined as the figure comprehended by rotating a right-angled triangle about
one of the arms, Then if the fixed arm or axis is less, equal to, or greater than
the other arm, the cone is obtuse-angled, right-angled, or acute-angled,
respectively. The cylinder is the figure comprehended by rotating a rectangle
around one side. The significance of these last three definitions is that solid
figures, except for the regular polyhedra, arise from rotating plane figures
about an axis.
The definitions are loose, unclear, and often assume theorems. For
erample, in Definition 6 it is assumed that the acute angle is the same no
matter where it is formed on the common section of the planes. Also Euclid
intends to consider convex solids only, but does not specify this in his defini-
tions of the regular polyhedra.
The book considers only figures formed by plane elements. The first 19
of the 39 theorems of this book treat properties of lines and planes, for
example, theorems on lines perpendicular and parallel to planes. The proofs
of the early theorems in this book are not adequate, and many general
theorems about polyhedra are proved only for special cases.
Proposition 2O. lfa solid angle be contained by three plane angles, any two,
taken together in any manner, are greater than the remaining one.
That is, of the three plane angles (Fig. a.l7) CAB, CAD, and BAD, the
sum ofany two is greater than the third one.
Proposition 21. Any solid angle is contained by plane angles [the sum of
which is] less than four right angles.
Proposition 31. Parallelepipedal solids which are on equal bases and ofthe
same height are equal [equivalent] to one another.
Proposition 32. Parallelepipedal solids which are of the same height are to
one another as their bases.
solrD GEouETRY AND THE METHoD oF ExHAUsrroN 83
Figure 4.
TIIE METIIOD OF EXHAUSTION B5
SOLID GEOMETRY AND
S' until we arrive at P' say, which is such that its area differs from S' by
one,
less than S' - S'. This polygon can be constructed because we proved
above that the difference between the circle s' and inscribed regular poly-
gons can be made less than any given magnitude and so less than S' - S"'
Then
P;$: P':S'.
However,sinceP<Sthen
P', < s'.
But by (5) this is a contradiction'
Similarly, one can show that S' cannot be greater than S" Hence
:
S" S', and in view of (4), the Proportion (3) is established.
This method is used to prove such critical and difficult theorems as:
Proposition 5. Pyramids which are of the same height and have triangular
bases are to one another as the bases.
Proposition 10. Any cone is a third part of the cylinder which has the same
base with it and equal height.
Proposition I 1. Cones and cylinders which are of the same height are to one
another as their bases.
proposition 12, Similar cones and cylinders are to one another in the tripli-
cate ratio [ratio of the cubes] of the diameters in their bases'
proposition lB. spheres are to one other in the triplicate ratio of their
respective diameters.
The proof that no more than five regular solids can exist depends upon
an earlier theorem, Book XI, Proposition 21, that the faces of a solid angle
must contain less than 360'. Hence if we put together equilateral triangles
we can have three meet at each vertex of the regular solid to form a tetra-
hedron; we can use four at a time to form an octahedronl and we can use
five to form the icosahedron. Six equilateral triangles at one vertex would
add up to 360'and so cannot be used. We can use three squares at any one
vertex and thus form the cube, Then we can use three regular pentagons
at each vertex to form the dodecahedron. No other regular polygons can be
used, for even three coming together at one point will form.an angle of 360"
or more. Note that Euclid assumes convex regular solids. There are other,
nonconvex regular solids.
The thirteen books of the Elemcnts contain 467 propositions. In some of
the old editions there are two more books, both of which contain more
results on regular solids, though Book XV is unclear and inaccurate. IIow-
ever, both postdate Euclid. Book XIV is due to Hypsicles (c. 150 r.c.), and
parts of Book XV were probably written as late as the sixth century a.o.
edge and compass constructions. It is because Euclid could not establish the
existence ofangle trisectors that he proved no theorems involving them.
Despite some omissions and errors of proof that we shall point out
shortly, Euclid's choice of axioms is remarkable. From a small set he was
able to prove hundreds of theorems, many ofthem deep ones. Moreover, his
choice was sophisticated. His handling of the parallel axiom is especially
clever. Euclid undoubtedly knew that any such axiom states exPlicitly or
implicitly what must happen in the infinite reaches of space and that any
pronouncement on what must be true of infinite space is physically dubious
because man's experiences are limited. Nevertheless, he also realized that
some such axiom is indispensable. He therefore chose a version that states
conditions under which two lines will meet at a finitely distant point' More-
over, he proved all the theorems he could before calling upon this axiom.
Though Euclid used superposition of figures to establish congruence' a
method which rests on Common Notion 4, he was evidently concerned about
the soundness of the method, There are two objections to it: First, the con-
cept of motion is utilized and there is no logical basis for this concept; and
second, the method of superposition assumes that a figure retains all its
properties when moved from one position to another. The displaced figure
may indeed be proved congruent to a second one but the first figure in its
original position may not be congruent to the second one. To assume that
moving a figure does not change its properties is to make a strong assumption
about physical space. Indeed the very PurPose of Euclidean geometry is to
compare figures in different positions. The evidence for Euclid's concern
about the method's soundness is that he did not use it for proofs that he could
make by other means, even though superposition would have permitted a
simpler proof.
Though mathematicians generally did regard Euclid's work as a model
of rigor until well into the nineteenth century, there are serious defects that
a few mathematicians recognized and struggled with. The first is the use of
superposition. The second is the vagueness of some of his definitions and the
pointlessness of others. The initial definitions of point, line, and surlirce have
no precise mathematical meanings and, as we now recognize, could not havc
been given any because any independent mathematical development must
have undefined terms (see sec. 3). As to the vagueness of many definitions,
we have but to refer back to those in Book V as an example. An additional
objection to the definitions is that several, such as Definition 17 of Book I,
presuppose an axiom.
A critical study of Euclid, with, of course, the advantage of present
insights, shows that he uses dozens of assumptions that he never states amd
undoubtedly did not recognize. A few have been mentioned in our survey.
What Euclid and hundreds of the best mathematicians of later generations
did was to use facts either evident from the figures or intuitively so evident
88
that they did not realize they were using them. In a few instances the un-
conscious assumptions could be obviated by proofs based on the explicit
assumptions, but this is not true generally.
Among the assumptions made unconsciously are those concerning the
continuity of lines and circles. The proof of Proposition I of Book I assumes
that the two circles have a point in common. Each circle is a collection of
points, and it could be that though the circles cross each other there is no
common point on the two circles at the supposed point or points of inter-
section. The same criticism applies to the straight line. Two lines may cross
each other and yet not have a common point as far as the logical basis in the
Ehmmk is concerned.
There are also defects in the proofs actually given. Some of these are
mistakes made by Euclid that can be remedied, though new proofs would be
needed in a few instances. Another kind of defect that runs throughout the
Elemmk is the statement of a general theorem that is proved only for special
cases or for special positions of the given data.
Though we have praised Euclid for the overall organization of the
contents of the Elements, the thirteen books are not a unity, but are to an
extent compilations of previous works. For example, we have already noted
that Bools VII, VIII, and IX repeat for whole numbers many results given
for magnitudes. The first part of Book XIII repeats results of Books II and
IV. Books X and XIII probably were a unit before Euclid and were due to
Theaetetus.
Despite these defects, many of which were pointed out by later com-
mentators (Chap. 42, sec. l) and very likely also by immediate successors of
Euclid, the Elements was so successful that it displaced all previous texts on
geometry. In the third century n.c., when others were still extant, even
Apollonius and Archimedes referred to the Elements for prior results.
of this lost work of four books became substantially the first three books of
Apollonius' Conic Secticns. Euclid treated the conics as sections olthe three
different types of cones (right-angled, acute-angled, and obtuse-angled)'
The ellipse was also obtai.rid as a section of any cone and of a circular
cylinder. As we shall see, Apollonius changed the approach to
the conic
sections.
The Pseudaria of Euclid contained correct and false geometric proofs
and was intended for the training of students' The work is lost'
On Diaisions [of figures], mentioned by Proclus, treats the subdivision
of
a given figure into othe, figr.e., as a triangle into smaller triangles or a
t.iirgl" inio triangles and quadrilaterals' A Latin translation' probably due
to Gfrard of Cremona (l li4-87), of an incorrect and incomplete Arabic
versionexists.Inl85lFranzWoepckefoundandtranslatedanotherArabic
version that seems to be correct. There is an English translation by
R' C'
Archibald.
AnotherlostworkisthePorisms.Thecontentsandeventhenatureofthe
work are largely unknown. Pappus it his Mathematieal Collection says that the
prresrzs consited of three books. It is believed from the remarks of Pappus
and Proclus that the Porisms d,ealt essentially with constructions of geometric
objects whose existence was already assured' Thus these problems
were inter-
pure theorems and constructions establishing existence- To
-Jdi.," between typical of
find the center of a circle under some given conditions would be
the problems in the Porisms.
The work Surface-I'oci, composed of two books, is mentioned by Pappus
in his collection This work, which is not extant, probably dealt with loci that
are surfaces.
Euclid's Phaenomena, though a text on astronomy, contains lB proposi-
tions on spherical Seometry ard othe"t on uniformly rotating spheres' The
earth is treated as a sphere. Some versions are extant'
Figure 4.20
the axis of the cone lies; IBC is called an axial triangle. Let this triangle cut
the conic in PP' . (PP' need not be an axis of the conic section') PP' M is tbe
Iine determined iy the intersection of the cutting plane and the aliil
triangle.s In the conic section let Q'Q be any chord which is parallel to D.E
ff."i 8Q'need not be perpendicular to PP'. Apollonius then proves that
Q'Q is bisected by PP' so that VQ is half of Q'Q.
Now draw AF parallel to PM to rieet BM in tr' say' Next draw Pl
perpendicular to PM arrd in the plane of the section' For the ellipse and
hyperbola .L is chosen to satisfr the condition
PL BF.FC
fl -vlz-'
PL BCA
T.e:
-n,s,Ac'
3. Apollonius points out that if the cone is scalene then P.tlf ir not nccecsarily pcrpcndicular
is
to O'8. f..p.rrdi"ularity holds only for right circular concs or whcn thc planc of zlBC
lrcrpendicular to thc basc of a scalene conc.
92
In tlre cases of ellipse and hyperbola we now draw P'L. From V draw VR
parallel to PL to rr,cet P'L in R. (In the case of the hyperbola the location
ofP'is on the other branch and P'Lhas to be extended to locate rt.)
After some subordinate constructions which we do not give Apollonius
proves that for the ellipse and hyperbola
(6) QVz : PV.VR.
Now Apollonius refers to QV as an ordinate and the result (6) says that the
square of the ordinate equals a rectangle applied to PZ, namely PV.VR.
Moreover, he proves that in the case of the ellipse this rectangle falls short
of the entire rectangle PV.PL by the rectangle IR which is similar to the
entire rectangle formed by PL and PP'. Hence the term "ellipse" (sec. 6).
In the case of the hyperbola, (6) still holds but the construction would
show that Zrt is longer than PL so that the rectangle PV.VR exceeds the
rectangle appli ed to PL, that is, PL.PV, by the rectangle ZR which is similar
to the rectangle formed by PL and PP'. Hence the term hyperbola. In the
case of the parabola Apollonius shows that in place of (6),
Y2 = 2!x.
For the ellipse, we note that we first get from the defining equation (6) that
YQ : PV'VR.
TIIE MATHEMATICAL WORK OF APOLLOMUS
Bwt PV.VR :
x(2p - ZS). AIso, because the rectangle Z-R is similar to the
rectangle determined by PL and PP' ,
ISx
PLd
Hence 1-S : 2pxld. Then
Y2 :2Px *ry'
In the Apollonian construction, / is infinite for the parabola and so we see
how the parabola appears as a limiting case of ellipse or hyperbola.
To pursue further Apollonius' treatment of the conics we need some
definitions of concepts that are still important in modern geometry. Consider
a set ofparallel chords in an ellipse, say the set parallel to PQ in Figure 4.21'
Apollonius proves that the centers of these chords lie on one line lB, which
is called a diameter of the conic. (The line PP' in the basic Figure 4.20 is a
diameter.) He then proves that if through C, the midpoint of AB, a line DE
be drawn parallel to the original family of chords, this line will bisect all the
chords parallel to AB. The Iine DE is called the conjugate diameter to lB'
In the case of the hyperbola (Fig' 4.22), the chords may be inside the
branches, e.C. PQ, and the length of the diameter is that cut off (if it is cut
off) between the two branches, AB in the figure. The chords parallel to r{8,
for example R.I{, then lie between the branches. The conjugate diameter to
AB, namely D.E, which is defined to be the mean proportional between lB
and the latus rectum ofthe hyperbola, does not cut the curve' In the parabola
any diameter, that, is a line passing through the midpoints of a family of
parallel chords, is always parallel to the axis of symmetry, but there is no
diameter conjugate to a given diameter because each chord of the liamily of
chords parallel to the given diameter is infinite in length. The axes of an
94
Figure ,L23
ellipse or a hyperbola are two diameters that are perpendicular to each other.
For the parabola (Fig. 4.23), the axis is the diameter whose corresponding
chords are perpendicular to the diameter.
After introducing the basic properties of the conic sections, Apollonius
provcs simple Iiacts about conjugate diameters. Book I also treats tangents
to conics. Apollonius conceives ofa tangent as a line which hasjust one point
in common with a conic section but which everywhere else lies outside of it.
He then shows that a straight line drawn through an extremity of a diameter
(point P in the basic Figure 4.20) and parallel to the corresponding chords
of that diameter (parallel to QQ' in that figure) will fall outside the conic
and no other straight line can fall between the said straight line and the
conic (see.Elazeants, Book III, Proposition 16). Therefore the said straight Iine
touches the conic, that is, is the tangent at P.
Another theorem on tangents asserts the following: Suppose PP'
Fig. a.2q is a diameter of a parabola and QZ is one of the chords corre-
sponding to that diameter. Then if a point I be taken on the diameter but
outside the curve and such that TP : PV,where Zis the foot of the ordinate
(chord) from Q to the diameter PP', then the line 7Q will touch the parabola
at Q. There are analogous theorems for the ellipse and hyperbola.
Apollonius proves next that if any diameter of the conic other than PP'
in the basic figure (4.20) be taken, the definitive property of the conic,
equations (6) and (7), remains the same; of course QV then refers to the
Figure 4.24
THE MATHEMATICAL WORK OF APOLLOMUS 95
Figure
Figure 4.28
That is, 7 divides ,lLS externally in the same r;rtio as .I divides it internally.
The line PQ is called the polar of the point T, and T, R, I, and S are said to
form a harmonic set of points. Also if any line through Z (Fig. 4.28), the
midpoint of PQ, meets the conic in R and S and meets the parallel to PQ
throu{,h Z in O, then
OR VR
AT:B'
The line through 7is the polar of V and O, R, V, and S are a harmonic set
of poins.
The book continues with the subject of the focal properties of central
conics; the focus ofa parabola is not mentioned here. The foci (the word is
not used by Apollonius) are defined for the ellipse and the hyperbola (Fig.
4.29) as the points F and F'on the (major) axis AA' such that AF.FA' :
AF'.F'A' : 2p.AA'14. Apollonius proves for the ellipse and the hyperbola
that the lines PF and PF' from a point P on the conic make equal angles with
the tangent at P and that the sum (for the ellipse) of the focal distances PF
znd, PF' equals AA' and the diflerence of the focal distances (for the hy-
perbola) eqrals AA'.
No concept of directrix appears in this work, but the fact that a conic
is a locus ofpoints the ratio ofwhose distances from a fixed point (focus) and
a fixed line (directrix) is constant was known to Euclid and is stated and
proved by Pappus (Chap. 5, sec. 7).
THE MATHEMATICAL WORK OF APOLLONIUS 97
Figure 4.32
Bibliography
Ball, W. W. Rouse: A Short Account oJ ttu HisW of Matlumatics, Dovcr (reprint)'
1960, Chaps. 2-3.
Boyer, Carl B.: A History of Mathenalrrs, John Wiley and Sorx, 1968, Chaps. 7 and 9.
Coolidge, Julian L.: A History of Geomctrical Mcthods, Dover (reprint), 1963' Book l,
Chaps. 2-3.
Heath, Thomas L.: A Manual of Greck Malhcmalrcs, Dover (reprint), 1963, Chaps.
3-9 and 12.
A of Grcck Mathcmatits, Odord University Press, 1921, Vol. l,
History
Chaps.3-ll; Vol' 2, ChaP. 14.
T'lu Thirtccn Books of Euclid's Elements, 3 vols., Dover (reprint), 1956.
--i
IOO EUCLID AND APOLLONIUS
IOI
IO2 ALEXANDRIAN CEOMETRY AND TRIGONOMETRY
while still engaged in conquests. After his death, his generals fought each
other for power. Following several decades ofpolitical instability, the empire
was split into three independent parts. The European portion became the
Antigonid empire (from the Greek general Antigonus) ; the Asian part, the
Seleucid empire (after the Greek general Seleucus); and Egypt, ruled by
the Greek Ptolemy dynasty, became the third empire. Antigonid Greece and
Macedonia gradually fell under Roman domination and became unimpor-
tant as liar as the development of mathematics is concerned. The mathematics
generated in the Seleucid empire was largely a continuation of Babylonian
mathematics, though influenced by the developments we are about to
consider. The major creations following the classical Greek period were made
in the Ptolemaic empire, primarily in Alexandria.
That the Ptolemaic empire became the mathematical heir of classical
Greece was not accidental. The kings of the empire were wise Greeks and
pursued Alexander's plan to build a cultural center at Alexandria. Ptolemy
Soter, who ruled from 323 to 285 a.c., his immediate successors, Ptolemy II,
called Philadelphus, who ruled from 285 to 247 o.c., and Ptolemy Euergetes,
who reigned from 247 to 222 r.c., were well aware of the cultural importanci
of the great Greek schools such as those of Pythagoras, Plato, and Aristotle.
These rulers therefore brought to Alexandria scholars from all the existing
centers of civilization and supported them with state funds. About 290 s.c
Ptolemy Soter built a center in which the scholars could study and teach.
This building, dedicated to the muses, became known as the Museum, and it
housed poets, philosophers, philologists, astronomers, geographers, physi-
cians, historians, artists, and most of the famous mathematicians of the
Alexandrian Greek civilization.
Adjacent to the Museum, Ptolemy built a library, not only for the
prcservation of important documents but for the use ol the general public.
This famous library was said at one time to contain 750,000 volumes, in-
cluding the personal library of Aristotle and his successor Theophrastus.
Books, incidentally, were more readily available in Alexandria than in
classical Greece because Egyptian papyrus was at hand. In fact, Alexandria
became the center of the book-copying trade of the ancient world.
The Ptolemies also pursued Alexander's plan of encouraging a mixture of
peoples, so that Greeks, Persians, Jews, Ethiopians, Arabs, Romans, Indians,
and Negroes came unhindered to Alexandria and mingled freely in the city.
Aristocrat, citizen, and slave jostled each other and, in fact, the class dis-
tinctions of the older Greek civilization broke down. The civilization in
Egypt was influenced further by the knowledge brought in by traders and by
the special expeditions organized by the scholars to learn more about other
parts of the world. Consequently, intellectual horizons were broadened. The
long sea voyages of the Alexandrians called for far better knowledge of
geography, methods of telling time, and navigational techniques, while
THE CHARACTER OF ALEXANDRIAN GREEK MATHEMATICS IO3
already noted, Euclid organized the work of the classical period, and
Apollonius is exceptional in that he too organized and extended classical
Greek mathematics-though in his astronomy and his work on irrational
numbers (both of which will be presented in later chaPters), he was some-
what affected by the Alexandrian culture. To be sure, the other great
Ale:<andrian mathematicians, Archimedes, Eratosthenes, Hipparchus, Nico-
medes, IIeron, Menelaus, Ptolemy, Diophantus, and Pappus, continued to
display the Greek genius for theoretical and abstract mathematics, but with
striking differences. Alexandrian geometry was devoted in the main to
results useful in the calculation of length, area, and volume' It is true that
some such theorems are also in Euclid's Elements. For example, Proposition l0
of Book XII asserts that any cone is a third part of the cylinder which has
the same base and height. Hence if one knows the volume of a cylinder he
can compute the volume of a cone. However, such theorems are relatively
scarce in Euclid, whereas they were the major concern ofthe Alexandrian
geometers. Thus, while Euclid was content to prove that the areas of two
circles are to each other as the squares on their diameters-which leaves us
with the knowledge that the area A : kd2 b:ut without a value for /r-
Archimedes obtained a close approximation to zl so that circular areas
could be computed.
Further, the classical Greeks, because they would not entertain irra-
tionals as numbers, had produced a purely qualitative geometry. The
Alexandrians, following the practice of the Babylonians, did not hesitate
to use irrationals and in fact applied numbers freely to lengths, areas, and
volumes. The climax of this work was the development of trigonometry.
Even more significant is the fact that the Alexandrians revived and
extended arithmetic and algebra, which became subjects in their own right.
This development of the science of number was, of course, necessary if
quantitative knowledge was to be obtained either from geometrical results
or from the direct use of algebra.
The Alexandrian mathematicians took an active hand in the work on
mechanics. They calculated centers of gravity of bodies of various shapes;
they dealt with forces, inclined planes, pulleys, and gears; and they were
often inventors. They were also the chief contributors of their time to the
work on light, mathematical geography, and astronomy.
In the classical period mathematics had embraced arithmetic (of the
whole numbers only), geometry, music, and astronomy. The scope of mathe-
matics was broadened immeasurably in the Alexandrian period' Proclus,
who drew material from Geminus of Rhodes (lst cent. B.c.), cites the latter
on the divisions of mathematics (presumably in Geminus' time) : arithmetic
(our theory of numbers), geometry, mechanics, astronomy, optics, geodesy,
canonic (science of musical harmony), and logistic (applied arithmetic).
According to Proclus, Geminus says : " The entire mathematics was separated
AREAS AND VOLUMES rN TlrE WORK OF ARCHTMEDES rO5
into two main divisions with the following distinction: one part concerned
itself with the intellectual concepts and the other with material concepts."
Arithmetic and geometry were intellectual. The other division was material.
However, this distinction was gradually lost sight of, if it was still significant
as late as the first century s.c. One can say, as a broad generalization, that
the mathematicians of the Alexandrian period severed their relation with
philosophy and allied themselves with engineering.
We shall treat first the Alexandrian work in geometry and trigonometry.
In the next chapter we shall discuss the arithmetic and algebra.
Syracuse had ordered the crown. When it was delivered, he suspected that it
was filled with baser metals and sent it to Archimedes to devise some method
of testing the contents without, of course, destroying the workmanship.
Archimedes pondered thc problem; one day while bathing he observed that
his body was partly buoyed up by the water and suddenly grasped the
principle that enabled him to solve the problem. He was so excited by this
discovery that he ran out into the street naked shouting " Eureka ! " (" I have
found it!") He had discovered that a body immersed in water is buoyed up
by a force equal to the weight of the water displaced, and by means of this
principle was able to determine the contents of the crown (see Chap' 7,
sec. 6).
Though Archimedes was a remarkably ingenious and successful in-
ventor, Plutarch says that these inventions were merely "the diversions of
geometry at play." According to Plutarch, Archimedes " possessed so high a
spirit, so profound a soul, and such treasures of scientific knowledge that,
though these inventions had obtained for him the renown of more than
human sagacity, he yet would not deign to leave behind him any written
work on such subjects, but, regarding as ignoble and sordid the business of
mcchanics and every sort ofart which is directed to use and profit, he placed
his whole ambition in those speculations in whose beauty and subtlety there
is no admixture of the common needs for life." However Plutarch's status as
a storyteller is far higher than his status as a historian. Archimedes did write
books on mechanics and we have one entitled On Floating Bodies and another,
On tlu Equilibium of Plancs; two others, On l*aers and On Centers of Graaity,
are lost. He also wrote a work on optics that is lost, and did deign to write
about his inventions I though the work has vanished, we know definitely that
he wrote On Splure-making, which describes an invention displaying the
motions of the sun, the moon, and the five planets about the (fixed) earth.
The death of Archimedes portended what was to happen to the entire
Greek world. In 216 s.c. Syracuse allied itself with Carthage in the second
Punic war between that city and Rome. The Romans attacked Syracuse in
2L2 s.c. While drawing mathematical figures in the sand, Archimedes was
challenged by one of the Roman soldiers who had just taken the city' Story
has it that Archimedes was so lost in thought that he did not hear the
challenge of the Roman soldier. The soldier thereupon killed him, despite
the order of the Roman commander, Marcellus, that Archimedes be un-
harmed. Archimedes was then seventy-five and still in full possession of his
powen. By way of " compensation," the Romans built an elaborate tomb
upon which they inscribed a famous Archimedean theorem.
Archimedes' writings took the form of small tracts rather than large
books. Our knowledge of these works comes from extant Greek manuscripts
and from Latin manuscripts translated from the Greek from the thirteenth
century onward. Some of the Latin versions were made from Greek
AREAS AND VOLUMES IN TIIE WORK OF ARCIIIMEDES r07
Figure 5.1
Proposition 33. The surface of any sphere is four times the [area of the]
greatest circle on it.
Corollary to Prop. 34. Every cylinder whose base is the greatest circle in a
sphere and whose height is equal to the diameter of the sphere is 3/2 of [the
volume of] the sphere, and its surface together with its bases is 3/2 of the
surface of the sphere.
That is, he compares the surface area and volume of a sphere and a
cylinder circumscribed about the sphere. This is the famous theorem which
in accordance with Archimedes' wishes, was inscribed on his tombstone.
He then proves in Propositions 42 arl.d,43 that the surface of the segment
ALMNP of a sphere is the area of a circle whose radius is lZ (Fig. 5.2).
The segment can be less or more than a hemisphere.
IOB ALEXANDRIAN GEOME'TRY AND TRIGONOMETRY
Figure 5,2
The theorems on surlbce area and volume are proved by the method of
exhaustion. Archimedes uses inscribed and circumscribed rectilinear figures
to " exhaust" the area or volume, and then, like Euclid, uses the indirect
method of proof to complete the argument.
Some theorems in the second book of Oz thc Sphere and Cylinder, which is
concerned largely with segments of a sphere, are significant because they
contain new geometrical algebra. For example, he gives:
Proposition 4. To cut a given sphere by a plane so that the volumes of the
segments are to one another in a given ratio.
This problem amounts algebraically to the solution of the cubic
equation
(a-x):c:b2zx2
aud Archimedes solves it geometrically by finding the intersection of a
parabola and a rectangular hyperbola.
The work on conoids and spheroids treats properties of figures ofrevolution
generated by conics. Archimedes' right-angled conoid is a paraboloid ol
ievolution. (In Archimedes' time the parabola was still regarded as a section
ofa right-angled cone.) The obtuse-angled conoid is one branch ofa hyper-
boloid of revolution. Archimedes' spheroids are what we call oblate and
prolate spheroids, which are figures of revolution generated by ellipses' The
main object olthe work is the determination of volumes of segments cut off
from the three solids by planes. The book also contains some of Archimedes'
work on the conic sections already alluded to in the discussion olApollonius.
As in other works, he presupposes theorems that he deems easily proved or
that can be proved by methods he has previously used' Many of the proofs
use the method of exhaustion. some examples of the contents are furnished
by the following ProPositions:
Proposition 5. lf AA' and BB' be the major and minor axes of an ellipse and
ifd be the diameter olany circle, then the area ofthe ellipse is to the area of
the circle as AA' 'BB' is to de.
AREAS AND VOLUMES IN THE WORK OF ARCHIMEDES r09
I
I
Figure 5.3
The theorem says that if 2a is the major axis and 2D is the minor axis
and s and s' are the areas of the ellipse and circle, then sls' : 4abld2' Sirrce
s':(il4)d2,s:tab.
Proposition 7. Given an ellipse with center C and a line CO PerPendicular
to tire plane of the ellipse, it is possible to find a circular cone with vertex O
and such that the ellipse is a section of it.
Clearly Archimedes realized that some, at least, of the several conic
sections can be obtained from the same cone, a fact that Apollonius utilized.
-\ I
I I
I I
I lJv I
,l I
I
Figurc 5.4
(r) EB : BD,
though Euclid's proof of this fact is not known. Now draw AF parallel to ED
and let CB cut r4.F in I(. Then, by (l) and the use of similar triangles, one
proves that FK : KA. Produce CK to H so that Cr( : .I(1L Further, let
MNPO be any diameter of the parabola. Then MN : -MO, because of (l)
and the use of similar triangles.
Now Archimedes compares the area of the segment and the area of
triangle CFA. He regards the first area as the sum of line segments such as
PO and the area of the triangle as the sum of line segments such as MO.
He then proves that
HK.OP: KN.MO.
AREAS AND VOLI'MES IN THE ll,ORK OF ARCHIMEDES
Physically this means that if we regard .r(II and .t(.tY as arms of a lever with a
fulcrum at r, then oP regarded as a weight placed at Ir would balance the
weight of MO placed at lY. Consequently the sum of all the line segments
suci as pO placed at I/ will balance the sum of all the line segments such as
MO each concentrated at its midpoint, which is the center of gravity of a
line segment. But the collection ofsegments MO each placed at its center of
g..riti is .. equivalent" to the triangle CAF placed at its center of gravity.
in his book On thc Equilibrium of Plarus, Archimedes shows that tJris center
is X on CI( where X* : 1t1l1CX. By the law of the lever, r(X'the area of
triangle CFA : HK'area of parabolic segment or
ACFA HK
_-------:- 3
(2)
segrnentcBA KX I
(a)
Figure 5,6, a and b
This is readily proven in many ways; we can do it by our formula for -the
sum of z terms of a geometrical progression' In the application of (4)' ,t4t is
triangle PQq.
ifren e."ni-edes shows that the atea A of the parabolic segment cannot
be greater or less than (4/3)24r. His proof is simply that if th e area
A exceeded
tten he could g"i 1n"lt"; set of, triangles whose sum S would
91614 " given magnitude' and
aii#, f.orn the area of the segment by less than any
hence the sum S would exceed (4/3)/t' That is,
A>s>(413)Ar
But bY (4) ir's contains'j"*;r';.nl
*,u,r,
or
s< (+13)41.
(5) (413)AL-A>A^
II4 ALEXANDRIAN GEOMETRY AND TRIGONOMETRY
where A^ is the zth term of the sequence and geometrically represents the
sum of2'-1 triangles. But since by (4)
Proposition 24. The area bounded by the first turn of the spiral and the
initial line [the shaded area in the figure] is equal to one third of the first
circle.
The first circle is the circle with radius Ol, which equals 2zra, and so
the shaded area is t(2ra)213.
The proof is by the method of exhaustion. In preceding theorems, which
prepare the ground, the area of a region bounded by an arc of a spiral, the
arc BPQRC in Figure 5.8, and by two radii vectors OB and, OC, is enclosed
between two sets of circular sectors. Thus B!',Pq',Qr'r... arc arcs of
circles with center at O and likewise Pb, Qp, Rq,. . . are arcs of circles. The
circular sectors of the inscribed set are OBp', OPq', OQr',. . . and the circular
sectors of the circumscribed set are OPb, OQp, ORq,. . . . Thus circular
AREAS AND VOLUMBS IN THE WORI( OF ARCHIMEDES
I15
Figure 5.7
Figure 5.8
116 ALEXANDRIAN GEoMETRY AND TRIGoNoMETRY
wherein a, b, and c are the sides and s is half the perimeter, illustrates the
and the formula
last-mentioned idea. This formula appears inhis Geoduy,
and a proof are in both the Dioptra and the Mctrica' ln the Dioptra he shows
how to dig a straight tunnel under a mountain by working simultaneously
from both ends.
Though many of the formulas are proven, Heron gives many without
proof and-also gives many approximate ones' -Thus he gives an inexact
i'o.-rlu for the area of a triangle along with the above correct one' One
reason that Heron gave many Egyptian formulas may be that the exact
ones involved square roots or cube roots and the surveyors could
not execute
these operations. In flact, there was a distinction between Pure geometry
.rd g.od".y or metrics. The calculation of areas and volumes belonged to
geode.y urril *u, not part of a liberal education' It was taught to surveyors'
irurorrr, carpenters, and other technicians' There is no doubt that Heron
continued and enriched the Egyptian science of field measurements; his
writings on geodesy were used for hundreds of years'
Hlron applied many of his theorems and rules to the design of theaters'
banquet halls, and baths. His applied works include Mechanics, Tlu Con'
struction of Catapulx, Measurements, The Design of Guns, Pneumatica
(the theory
and use Lf ,i. p...rr..), and On the Art of Construction of Automala' He gives
designs for water clocks, measuring instruments, automatic machines'
weight-lifting machines, and war engines.
,'
Figurc 5.9
ofP we get three other curves according as a > b, a : b, and a < D. Hence
there are four types of conchoids, all due to Nicomedes. The modern polar
equation is r : a * b *c 0. Nicomedes used the curve to trisect an angle
and double the cube.l
Nicomedes is supposed to have invented a mechanism to construct the
conchoids. The nature of the mechanism is of far less interest than the fact
that mathematicians of the period were interested in devising them. The
conchoids of Nicomedes, together with the line and circle, are the oldest
mechanically constructible curves about which we possess satisfactory
information.
Diocles (end of2nd cent. a.c.), in his book On Burning-glasses, solved the
problem of doubling the cube by introducing the curve called the cissoid.
The curve is defined as follows: AB and CD are perpendicular diameters of a
circle (Fig. 5.10) and EB and, BZ are eqwaL arcs. Draw Zlf perpendicular to
CD and then draw ED. The intersection of ZH and, ED gives a point P on
the cissoid. For Diocles the cissoid is the locus of all points P determined by
all positions of E on arc BC and, Z on arc BD wfih arc BE : arc BZ. Ore
\,
H\
o
,Z
F\- /
B
I
Figure 5.10 I
ThusHZandHDaretwomeanProportionalsbetweenCHand'IIP.This
solves the Delian problem. The equation of the cissoid in rectangular co-
ordinates is y'(a i x) : (a - x)3, where O is the origin, a the radius of
the
This equation includes the
circle, and bb OA the coordinate axes.
""a'
brokenJine portions of the curve shown in the figure, which were not
considered by Diocles.
Figure 5.1
Christ, and a diameter is divided into 120 parts. Each part of the circum-
ference and diameter is further divided into 60 parts and each of these into
60 more, and so on according to the Babylonian system of sexagesimal
fractions. Then for a given arc l-B of some number of degrees, Hipparchus-
in a book, now lost, on chords in a circle-gives the number of units in the
corresponding chord AB. Just how he calculated these will be described in
the discussion of Ptolemy's work, which presents their combined thoughts
and results.
The number of units in the chord corresponding to an arc of a given
number of degrees is equivalent to the modern sine function. If 2cr is the
central angle of arc AB (Fig. 5.1l), then for us sin cr : ACIOA, whereas,
instead of sin a, Hipparchus gives the number of units in 2.AC when the
radius Oz4 contains 60 units. For example, if the chord of2a is 40 units, then
for us sin a : 20160, or, more generally,
Greek trigonometry reached a high point with Menelaus (r. e.o. 9B).
I{is work, but apparently he also wrote Chords in a Circle
Sphazrica is his chief
in six books, and a treatise on the setting (or rising) of arcs of the zodiac.
The Arabs attribute additional works to him.
The Sphacrica, extant in an Arab version, is in three books. In the first
book, on spherical geometry, we find the concept of a spherical triangle,
that is, the figure formed by three arcs ofgreat circles on a sphere, each arc
being less than a semicircle. The object of the book is to prove theorems for
spherical triangles analogous to what Euclid proved for plane triangles.
Thus the sum of two sides ofa spherical triangle is greater than the third side
and the sum of the angles of a triangle is greater than two right angles.
Equal sides subtend equal angles. Then Menelaus proves the theorem that
has no analogue in plane triangles, namely, that ifthe angles ofone spherical
triangle equal respectively the angles ofanother, the triangles are congruent.
He also has other congruence theorems and theorems about isosceles triangles.
THE CREATION OF TRIGONOMETRY
The second book of Menel aus' Sphaerica is chiefly about astronomy and
only indirectly concerned with spherical geometry' The third book contains
some sphericai trigonometry and bases the development on the first
theorem
of the iook, which supposes that we have a spherical triangle ABC (Fig' 5 '12)
and any great circle cutting the sides of the triangle (produced where
,"""rr.ry;l To state the theorem we shall use our modern sine notion, but
fo. Merr.iars the sine of an arc such as lB (or the sine of the corresponding
central angle at the center of the sphere) is replaced by the chord ofdouble
the arc AB,In terms of our sines, then, Menelaus' theorem says
The proof of this theorem rests upon the corresponding theorem for plane
trianlles, which we still call Menelaus' theorem' For plane triangles the
theorem states (Fig. 5.13) :
PLA' PzB' PBC : PtC' PzA' PsB'
Menelaus does not prove the plane theorem' One may conclude that it was
already known or perhaps proved by Menelaus in an earlier writing'
The second theorem oiBook III, in the notation that arc a lies opposite
angleAintriangleABC,statesthatifABCandA'B'C'aretwospherical
and, if i:
triingles A' andC: C', or C is suPPlementary to C', then
sin r : sin c'
ii"-, ii-"a'
Theorem 5 of Book III uses a Property of arcs that was presumably
known by Menelaus' time, namely (Fig' 5'14), if four great circular atcs
emanatefromapointOand'ABCDandA'B'C'D'aregreatcirclescutting
the four, then
sir, AD sir. BC sirt A'D' sin B'C'
,LB: og''':j-1"l'Y'
iJ[-J-lr'
-sin.DC'-sin
ALEXANDRIAN CEOMETRY AND TRIGONOMETRY
Figurc 5.14
decagon arrd BF, a side of the regular inscribed Pentagon' But 'ED contains-
SO u-nits and 8D,60 units. Since EBz : EDz + BDz, EBz: 4500 and
BP :67 4'55" (which means 67 + 4/60 + 55/603 units)' 11o" f,f : EB
and so he knows .OF. Then FD : EF - DE : 67 4'55' - 30 : 37 4'55''
Since FD equals the side ofthe decagon, it is the chord ofthe 36' arc' Hence
he knows the chord of this arc. By using FD and the right triangle trDB'
he can calculate BF. Itis 7032'3'. But BFis the side of the pentagon' Hence
he has the chord of the 72o arc.
Of course for the side of a regular hexagon, since it equals the radius'
he has at once that the chord of length 60 belongs to the arc of length 60'
Also, since the side of the inscribed square is immediately calculable in tcrsrs
of the radius, he has the chord of 90'' It is 84 51'10'' Further, since the
side of the inscribed equilateral triangle is also immediately calculable
in
terms of the radius, he gets that the chord of 120' is 103 55'23''
By using the right triangle ABC (Fig.5.16) on the diameter 'tlC one can
im-eji.t.ly-get the chord of the s..ppletentary arc AB rf orre knows the
chord of thi irc BC. Thus, since Ptolemy knows the chord of 36o he can find
tlre chord of 144', which turns out to be ll4 7'37''
The relationship established here is equivalent to sin3 A + corz A
: I
where z4 is ary acute angle. This can be seen as follows' Ptolemy has proved
that ifS is any arc less than 180" then
(120)2 sin2
f * ltzo;',i"' (199/)
: rzo'
t24 ALEXANDRIAN GEOMETRY AND TRIGONOMETRY
ligurc 5,
sin'f +
"o.'f: t.
Now Ptolemy proves what he calls a lemma but which is known today
as Ptolemy's theorem. Given any quadrilateral inscribed in a circle (Fig.
5.17), he proves that AC.BD: AB.DC + AD.BC. The proof is straight-
forward. He then takes the special quadrilateral ABCD in which AD is a
diameter (Fig.5.lB). Suppose we know lB and AC. Ptolemy now shows how
to find BC. BD is the chord of the supplement of arc AB and CD is the chord
of tlre supplement of arc AC.lf one now applies the lemma, he sees that five
of the six lengths involved in it are known and so the sixth length, which in
this case is BC, can be calculated. But arc BC : arc AC - arc AB. Hence we
can calculate the chord ofthe difference of two arcs if their chords are known.
In modern terms this means that if we know sin z4 and sin B we can calculate
sin (l - B). Ptolemy points out that, since he knows the chords of 72" and
60o, he can calculate the chord of l2'.
Ife shows next how, given any chord in a circle, he can find the chord of
half the arc of the given chord. In modern terms this means finding sin l/2
from sin 24. This result is powerful, as Ptolemy points out, because we can
start with an arc whose chord is known and find the chords of successive
halves ofthis arc. He also shows that if one knows the chords ofarcs AB and
BC, then one can find the chord of arc lC. In modern terms, this result is the
formula for sin (A + B) , As a special case, he points out that we can calculate,
in modern terms, sin 2.,4 from sin ,4.
Since Ptolemy can get the chord of3/4" from the chord of 12' by halving,
he can add this arc of3/4" or subtract it from any arc whose chord is known;
r25
and by using the above theorems, he can calculate the chord of the sum or
difference oithe two arcs. Thus he is in a position to calculate the chords of
all arcs in steps of 314'. However, he wants to obtain the chords of arcs in
steps of I /2.. frere he is stuck and resorts to clever reasoning with
inequalities.
The approximate result is that the chord of ll2" : 0 31'25"
ti" ir rro* in a position to make up a table of the chords ofarcs for arcs
diflering by 112", from 0'to 180'. This is the first trigonometric table'
Tt"err Ptole-y proceeds (Chapter XI of Book I) to solve astronomical
problems that call for finding arcs ofgreat circles on asphere' These arcs are
,id", of spherical triangles, some of whose Parts are known through either
observation or prior cJculation. To determine the unknown arcs Ptolemy
proves relationships that are theorems of spherical trigonometry some of
*hi"h fr"d already been proved in Book III of Menelaus' Sphaerica' Ptolemy's
basic method is to use Menelaus' theorem for spherical triangles. Thus he
proves, in our notation, that in the spherical triangle with right angle at C
lfig. S.tS; and with arc a denoting the side opposite angle A,
sina: sin c sin ..4
tar, d -- sin D tan z4
,::'; H::::x
=
of course for Ptolemy the various trigonometric functions are chords of arcs.
To treat oblique spherical uiangles, he breaks them uP into right-angled
spherical triat gles. There is no systematic Presentation of spherical trigo-
,ro-"t.y; h. f.o',res just those theorems that he needs to solve specific
astronomical Problems.
The Atmigest put trigonometry into the definitive form it retained for
over a thousanl y.u.r. Wi generally speak of this trigonomery as spherical'
butthedistinctionbetweenplaneandsphericaltrigonometryreallyhaslittle
point in assessing Ptolemy's material. Ptolemy certainly works with spherical
iriangles, but inlaving calculated the chords ofarcs, he has really given the
theorletical basis for plane trigonometry. For by knowing sin 24, and, in effect'
cos r4 for arry A ftom 0" to 90o, one can solve plane triangles'
We should note that trigonometry was created for use in astronomy;
and, because spherical trigonometry was for this purpose the more useful
126 ALEXANDRIAN GEOMETRY AND TRIGONOMETRY
3. The circle has greater area than a regular polygon of the same perimeter.
4. Of all solids with the same surface the sphere has the greatest volume.
The subject of the theorems, which today we would describe as maxima
and minima problems, was novel in Greek mathematics.
Figure 5.20
Figure 5.21
r28 ALEXANDRIAN GEOMETRY AND TRIGONOME TRY
Bibliography
Aaboe, Asger : Episodcs Jrom thc Early History of Matlumatics , Random Houre,
1964, Chaps. 3-4.
Bdl, W. W. R.: ..{ Shoil Accowrt ol tlu Histary of Matlumati.cs, Dover (reprint),
1960, Chaps.,s.{.
Cajori, Florian: A H;sW of Mathematics, Macmillan, 1919, pp. 29-52.
Dijlsterhuis, E. J.: Archbncdes (English trans.), Ejnar Munksgaard, 1956,
Heath, Thomas L.: A Hisbry of Grcck Mathtmatics, Oxlord University Press, 1921,
VoI.2, Chaps. 13, 15, 17-19,21.
Tlu Worles of Archimcdcs, Dover (reprint), 1953.
Pappns d'Alexatdie: La Collcction mathimati.que , ed. Paul Ver Eecke, 2 vols., Albert
Blanchard, 1933.
Parsons, Edward Alexander: Thc Ahxandrian Library, The Elsevier Press, 1952.
Sarton, George z A Histuy of Scitncc, Harvard University Press, 1959, Vol. 2,
Chaps. l-3, 5, 18.
Scott, J. F,z A History of Matlumatics, Taylor and Francis, 1958, Chaps. 3-4.
Smith, David Eugene: History of Mathematbs, Dover (reprint), 1958, Vol, l,
Chap.4; Vol.2, Chaps. S and 10.
vau der Waerden, B. L.: Scbnce Awakcning, P. Noordhoff, 1954, Chaps. 7-8.
6
The Alexandrian Period:
The Reemergence of
Arithmetic and Algebra
Wherever there is number, there is beauty.
PR()cLUS
I3I
I32 ALEXANDRIAN ARITHMETIC AND ALGEBRA
meaning of " abacus " was " sand," which suggests that before the intro-
duction of the abacus, and probably afterward, they drew numbers as dots
in sand. In the three hundred years from Thales to Euclid the mathe-
maticians paid no attention to computation, and this art made no progress.
It is significant that the books do not tell us about the practice of arithmetic.
For some unknown reason the classical Greeks changed their way of
writing numbers to the Ionic or Alexandrian system, which uses letters of
the alphabet. This alphabetic system was the most common one in Alex-
andrian Greek mathematics and is found, in particular, in Ptolemy's
Almagest. It was used also in ancient Syria and Israel.
The details of the Greek system are as follows :
ap y6€r( q0
t2 34567 8.9
LKI pvg ona
l0 20 30 40 50 60 70 B0 90
porttdx ,ltoT
100 200 300 400 500 600 700 800 900
Various Greek writers used minor variations of the above scheme and of
schemes given below.
Greek papyri of the first part of the Alexandrial p4"d (first three
centuries B.c.) contain symbols for zero such as 0-10, 0,- 0-, and
-0-. .Iih.
zero ofthe Greek Alexandrian period was used, as was the zero ofthe Seleucid
Babylonian period, to indicate missing numbers. According to Byzantine
manuscripts, which are all we have of Ptolemyts work, he used 0 for zero
both in the middle and at the end of a number.
Archimedes' Sand-Reckoner presented a scheme for writing very large
numbers. He sought to show that he could write a number as large as the
THE SYMBOLS AND OPERATIONS OF CREEK ARITHMETIC I33
number of grains of sand in the universe. He takes the largest number then
expressed in Greek numerals, that is 108, a myriad myriads, and uses it as
the starting point of a new series of numbers that goes up to 108 x 108 or
1016. Then he uses 1016 as a new starting point for a series of numbers that
goes from 1016 to 1024, and so forth. He next estimates the number of grains
of sand in the universe and shows it is less than the largest number he can
express. What is important in this work of Archimedes is not a practical
scheme for actually writing any large number, but the thought that one can
construct indefinitely large numbers' Apollonius had a similar scheme'
The arithmetic operations, with the whole numbers written as described
above, were like ours. Thus in adding, the Greeks wrote the numbers one
below the other to form a unit column, tens column, and so forth, added the
numbers in each column, and carried from one column to the other' These
methods are a great step forward, compared with the Egyptian ones' The
latter, however, were also taught by the Alexandrian Greeks.
As for fractions, there was the special symbol , L' for l/2. Thus (some-
times with one accent), aL' ara vL' :3|' S-til
: ll, Ur' :Z),
fractions were denoted by writing the numerator marked with an accent,
then the denominator written once or twice, each time with two accents'
Thus ry'xT"xT' : 13129. Diophantus often wrote the denominator above
the numerator.
The Egyptian scheme of writing fractions whose numerators are
larger than I as a sum of unit fractions is also found. Thus H.,ot' *ito ffi
l35l 265
76>v5>69'
but does not explain how he got this result. Among the many conjectures in
the historical literature concerning its derivation the following is very
plausible. Given a number l, if one writes it as a2 1 D where a2 is the rational
square nearest to l, larger or smaller, and D is the remainder, then
b_h
ot;>{a2+b>otuL+1.
Several applications of this procedure do produce Archimedes' result. To
obtain thb approximation to zr, Archimedes first proves, in Proposition l,
that the area of a circle is equal to the area ofa right triangle whose base is
as long as the circumference of the circle and whose altitude equals the
radius. He must now find the circumference. This he approximates more
and more closely by using inscribed and circumscribed regular polygons
and calculates the perimeters of these polygons. His result for zr is
sfr.. . s|.
ARITTIMETIC AND ALGEBRA AS AN INDEPENDBNT DEVELOPMENT I35
\/V:\/7-rb-at!,
-2a
where a and D have the meaning explained above. He gets this approximation
/-.\ 55 /r\ 23
lo3
60\,i602\'i604
for {5, which amounts to 1.7320509 and is correct to six decimal places.
not solely because we said that it existed before all the others in the mind
ofthe creating God like some universal and exemplary plan, relying upon
which as a design and archetypal example the creator ofthe universe sets
in order his material creations and makes them attain to their proPer ends;
but also because it is naturally prior in birth. . ..
++n,:;(32-r).
Also the zth triangular number, the zth square number, the nth Pentagonal
number, and so on form an arithmetic progression with the (z - I)st
triangular number as the common difference.
He discovered the following proposition: If one writes down the odd
numbers
1,3,5,7,9, 11, 13, 15, 17,...
then the first is the cube of 1, the sum of the next two is the cube of 2, the
sum ofthe next three is the cube of3, and so on. There are other propositions
on progressions.
Nichomachus gives four perfect numbers, 6, 28, 496, and Bl2B, and
repeats Euclid,s formula for perfect numbers. He classifies all sorts of ratios,
including m * l:m,2m + nim I n, and mn * l:n, and gives them names'
These were important for music'
He also studies proportion, which, he says, is very necessary for " natural
science, music, spherical trigonometry and planimetry and particularly for
I38 ALEXANDRIAN ARTTHMETIC AND ALGEBRA
a+b
ot--T- 2ab
o+0
The Introductio also gives the sieve of Eratosthenes (about whom we
shall say more in Chapter 7); it is a method of obtaining prime numbers
quickly. One writes down all the odd numbers from 3 on as far as one
wishes, then crosses out all multiples of 3, that is to say every third number
after 3. Next one crosses out all multiples of 5, or every fifth number after 5,
counting any that may have been crossed out before. Then every seventh
number alter 7, and so forth. No crossed-out number can be the starting
point of a new crossing-out process. The number 2 must now be included
with those that are not crossed out. These numbers are the primes.
Nichomachus always uses specific numbers to discuss various categories
and proportions. The examples illustrate and explain what he asserts but
there is no support for any general assertions beyond the examples. He does
not prove deductively.
The Introductio had value because it was a systematic, orderly, clear, and
comprehensive presentation ofthe arithmetic ofintegers and ratios ofintegers
freed of geometry, It was not original as far as ideas were concerned, but was
a very useful compilation. It also incorporated speculative, aesthetic, mystical,
and moral properties of numbers, but no practical applications . The Intro-
duttio wx the standard text in arithmetic for a thousand years. At Alexandria,
from the time of Nichomachus, arithmetic rather than geometry became the
favorite study.
At this time, too, algebra came to the fore. Books of problems appeared
that were solved by algebraic techniques. Some of these problems were
exactly those that appeared in Babylonian texts of2000 r.c. or in the Rhind
papyrus. This Greek algebraic work was written in verbal form; no symbolism
was used. Also, no proofs of the procedures were given. From the time of
Nichomachus onward, problems leading to equations were a common form
of puzzle. Between fifty and sixty of these are preserved in the Palatine
Codex of Greek Epigrams (l0th cent. .1,.o.). At least thirty of them are
attributed to Metrodorus (c. e..o. 500), but are surely older. One is the
Archimedes cattle problem, which calls for finding the number of bulls and
cows of different colors subject to given information. Another is due to
Euclid and involves a mule and a donkey carrying corn. Another calls for
the time required for pipes to fill a cistern. There were also age problems
such as appear in our algebra texts.
The highest point of Alexandrian Greek algebra is reached with Dio-
phantus. We know almost nothing about his origins or life; he was probably
ARITIIMETIC AND ALCEBRA AS AN INDEPENDENT DEVELOPMENI I39
Greek. One of the algebraic problems found in a Greek collection gives the
following facts about his life: His boyhood lasted 1/6 of his life; his beard
grew after l/12 more; he married after ll7 more I and his son was born 5
years later. The son lived to half of his father's age and the father died 4
years after the son. The problem is to find how long Diophantus lived' The
answer is easily found to be 84. His work towers above that of his con-
temporaries; unfortunately, it came too late to be highly influential in his
time because a destructive tide was already engulfing the civilization.
Diophantus wrote several bools that are lost in their entirety. Part ofa
tract On Polygonal Numbers is known, in which he states and proves theorems
in the deductive manner of Books VII, VIII, and IX of t};.e Elemmts; how-
ever, the theorems are not striking. His great work is the Atithmetica which,
Diophantus says, comprises thirteen books. We have six, which come from
a thirteenth-century manuscript that is a Greek copy of an older one and
from later versions.
The Arithmetica, Iike the Rhind papyrus, is a collection of separate
problems. The dedication says it was written as a series of exercises to help
one of his students learn the subject. One of Diophantus' major steps is the
introduction of symbolism in algebra. Since we do not have the manuscript
written by him but only much later ones, we do not know the precise
symbols. It is believed that the symbol he used for the unknown was s,
which served as our .r. This s may have been the same letter as the Greek o
when used at the end of a word, as in dtpfilis (arithmos), and may have
been chosen because it was not a number in the Greek system of using
letters for numbers. Diophantus called the unknown "the number of &e
problem." For our x2 Diophantus used AY, the A being the first letter of
itivay.rs (dghamrs, "power"). xs is KY; the K is from niBos (c:abos). z' is
AYA; *5 is AKY; .rB is KYK. In this system KY is not clearly the cube of s as
our *3 is of x. For Diophantus sr : llx. He also uses namel for these various
powers, e.g. number for r, square for *2, cube for .r3, square-square
(dynamodynamis) for *a, square-cube for *5, and cubocube for x6.'
The appearance of such symbolism is of course remarkable but the use
of powers higher than three is even more extraordinary. The classical Greels
could not and would not consider a product of more than three factors
because such a product had no geometrical significance. On a purely
arithmetic basis, however, such products do have a meaning; and this is
precisely the basis Diophantus adopts.
Addition is indicated in Diophantus by putting terms alongside one
another. Thus
The Ift is a symbol for unity and indicates that a pure number not involving
the unknown follows. Again
(*)'-(,-l1',:oo
to expressions such as x * 2 for p and r * 3 for q' That is, he makes steps
that use the identities but the identities themselves do not appear.
The first book of the Arithmetie a consists mainly of problems that lead to
determinate equations of the first degree in one or more unknowns. The
remaining five books treat mainly indeterminate equations ofsecond degree.
But this segregation is not sharply adhered to. In the case ofthe determinate
equations (i.e. equations leading to unique solutions) with more than one
unknown involved, he uses given information to eliminate all but one
unknown and, at worst, ends with quadratics of the form ax2 : D. Thus
Problem 27 of Book I states: Find two numbers such that their sum is 20
and product is 96. Diophantus proceeds thus: Given sum 20, given product
96, 2r the difference of the required numbers. Therefore the numbers are
l0 + *, l0 - x. Hence 100 - x2 :96. Then r : 2 and the required
numbers are 12, B.
The most striking feature of Diophantus' algebra is his solution of
indeterminate equations. Such equations had been considered before, as for
example in the Pythagorean work on solutions of x2 + g2 : 22, in the
Archimedean cattle problem, which leads to seven equations in eight
unknowns (plus two supplementary conditions), and in other odd writings.
Diophantus, however, pursues indeterminate equations extensively and is
ARITHMETIC AND ALGEBRA AS AN INDEPENDENT DEVELOPMENT I4I
(1) !2:A*z*BxiC.
Diophantus does not wtite 92 but says that the quadratic expression must
. square number (square ofa rational number). He considers (l) for
"qril
spicial values of A, B, and C and treats these types in separate cases' For
.*u-pI", when C is absent he lets y : mxln, where m and n are specffic
whole numbers, obtains
t--2 -tt D- fr'
rilL -,
')* --7^,
and then cancels r and solves. When ..4 and C do not vanish but A : az,he
assumes y : ax - m. lf C: c3 he assumes y : (m" - c). In all cases n isa
specific number.
He also treats the case of simultaneous quadratics, namely,
(2) lz:Ax2+BxlC
(3) z2:Dxz*ExlF-
Here, too, he undertakes only particular cases, that is, where .,{, B, ' . ', F are
special numbers or satisfy special conditions, and his method is to assume
expressions lor y and z in terms ofx and then solve for x.
In effect he is solving determinate equations in one unknown' He
realizes, however, that in choosing expressions fory and zin (2) and' (3) and
fory in (l), he is giving merely typical solutions and that the values assigned
to y and z are somewhat arbitrarY'
He also has problems in which cubic and higher degree expressions in *
must equal a square number, e.g.
Axs + Bxz *Cx I d2:!2'
Here he lets y :mx * d and fixes m so that the coefficient of r vanishes'
Since the d2 terms cancel and he can then divide through by x2, he obtains
I42 ALEXANDRIAN ARITHMETIC AND ALGEBRA
numbers, as, for example, coemcients in equations. Aristotle did use letters of
the Greek alphabet to indicate an arbitrary time or an arbitrary distance
and in discussions of motion employed such phrases as " the half of 8."
Euclid, too, used letters for classes of numbers in Books VII to IX of the
Eletnents, a practice that Pappus followed. However, there was no recognition
of the enormous contribution that letters could make in increasing the
eflectiveness and generality of algebraic methodology.
Another feature of Alexandrian algebra is the absence of any explicit
deductive structure. The various types of numbers-whole numbers, frac-
tions, and irrationals-were certainly not defined. Nor was there any axio-
matic basis on which a deductive structure could be erected. The work of
Heron, Nichomachus, and Diophantus, and of Archimedes as far as his
arithmetic is concerned, reads like the procedural texts of the Egyptians and
Babylonians, which tell us how to do things. The deductive, orderly proofof
Euclid and Apollonius, and of Archimedes' geometry is gone. The problems
are inductive in spirit, in that they show methods for concrete problems that
presumably apply to general classes whose extent is not specified. In view of
the fact that as a consequence of the work of the classical Greeks mathe-
matical results were supposed to be derived deductively from an explicit
axiomatic basis, the emergence of an independent arithmetic and algebra
with no logical structure of its own raised what became one of the great
problems of the history of mathematics. This approach to arithmetic and
algebra is the clearest indication of the Egyptian and Babylonian influences
in the Alexandrian world. Though the Alexandrian Greek algebraists did
not seem to be concerned about this deficiency, we shall find that it did
trouble deeply the European mathematicians.
Bibliography
Ball, W. W. R,: ,t{ Short Account oJ thc History of Mathematics, Dover (reprint), 1960,
Chaps. 5 and 7.
Cajori, Florian: A History of Mathemalics, Macmillan, 1919, pp. 52-62.
Heath, Thomas L.: Diaphanlus of Alexandria, Dover (reprint), 1964,
Tlu Works of Archinudes, Dover (reprint), 1953, Chaps. 4 and 6 of the
fnroduction, pp. 9l-98, 319-326.
A History of Greek Mathematits, Oxford University Press, 1921, Vol. l,
Chaps. 1-3; Vol. 2, Chap.20.
A Manual of Grcek Mathematics, Dover (reprint), 1963, Chaps. 2-3, and 17.
D'Ooge, Martin Luther: Nichomachus of Gaasa, University of Michigan Press, 1938.
van der Waerden, B. L.: Sciencc Awakening, P. Noordhofl 1954, pp. 278-86.
7
The Greek Rationalization
of Nature
Mathematics is the gate and key of the sciences.
ROGER BACON
detail how men such as Euler and Gauss worked, we may be fairly certain
that the investigations in astronomy, optics, and music must have suggested
mathematical problems, and it is most likely that the motivation for the
mathematics was its application to these other areas. It is also relevant that
the geometry of the sphere, known in Greek times as "sphaeric," was studied
just as soon as astronomy became mathematical, which happened even
before Eudoxus' time. The word " sphaeric " meant " astronomy " to the
Pythagoreans.
Fortunately the inferences we may draw from the works of the mathe-
maticians, though reasonable enough, are established beyond doubt by the
overwhelming evidence in the writings of the Greek philosophers, many of
whom were also prominent mathematicians, and of the Greek scientists. The
bounds of mathematics were not mathematics proper. In the classical period
mathematics comprised arithmetic, ggometry, astronomy, and music; and
in the Alexandrian period, as we have already noted in Chapter 5, the
divisions of the mathematical sciences were arithmetic (theory of numbers),
geometry, mechanics, astronomy, optics, geodesy, canonic (musical har-
mony), and logistics (applied arithmetic).
shall not describe the qualitative theories of Thales, Anaxagoras, and their
colleagues, each of whom fixed on a single substance persisting through aII
uppu..rrt change. The underlying identity of this prime substance is con-
served but all forms of matter can be explained in terms of it, This natural
philosophy of the lonians was a series of bold speculations, shrewd guesses'
and brilliant intuitions rather than the outcome of extensive and careful
scientific investigations. They were perhaps a little too ea8er to see the whole
picture and so naively jumped to broad conclusions. But they did substitute
material and objective explanations ofthe structure and design ofthe universe
for the older mythical stories. They offered a reasoned approach in place of
the fanciful and uncritical accounts of the poets and they defended their
contentions by reason. At least these men dared to tackle the universe with
their minds and refused to rely upon gods, spirits, ghosts, devils, angels, and
other mythical agents.
of demons and gods but in all the acts and the thoughts of men, in all
handicrafts and music."
The reduction of music, for example, to simple relationships among
numbers became possible for the Pythagoreans when they discovered two
facts: first, that the sound caused by a plucked string depends upon the
Iength of the string; and second, that harmonious sounds are given off by
equally taut strings whose lengths are to each other as the ratios of whole
numbers. For example, a harmonious sound is produced by plucking two
equally taut strings, one twice as long as the other. In our language, the
interval between the two notes is an octave. Another harmonious combina-
tion is formed by two strings whose lengths are in the ratio 3 to 2; in this case
the shorter one gives forth a note called the fifth above that given off by the
first string. In fact, the relative lengths in every harmonious combination of
plucked strings can be expressed as ratios ofwhole numbers. The Pythagoreans
also developed a famous Greek musical scale, Though we shall not devote
space to the music of the Greek period, we note that many Greek mathe-
maticians, including Euclid and Ptolemy, wrote on the subject, especially on
harmonious combinations of sounds and the construction of scales.
The Pythagoreans reduced the motions of the planets to number
.relations. They
believed that bodies moving in space produce sounds; per-
haps this was suggested by the swishing ofan object whirled on the end ofa
string. They believed, further, that a rapidly moving body gives forth a
higher note than one that moves slowly. Now according to their astronomy
the greater the distance of a planet from the earth the more rapidly it
moved. Hence the sounds produced by the planets, which we do not hear
because we are accustomed to them from birth, varied with their distances
from the earth and all harmonized. But since this "music of the spheres,,,
like all harmony, reduced to no more than number relationships, so did the
motions of the planets.
The Pythagoreans and probably Pythagoras himself wanted notjust to
observe and describe the heavenly motions but to find regularity in them.
The idea of uniform circular motion, seemingly obvious in the case of the
moon and sun, suggested that all the planetary motions were explainable in
terms of uniform circular motions. The later Pythagoreans made a more
striking break with tradition; they were the first to believe that the earth was
spherical. Moreover, because l0 was their ideal number, they decided that
the moving bodies in'the heavens must be l0 in number. First, there was a
central fire around which the heavenly bodies, including the earth, moved.
They knew five planets in addition to the earth. These six bodies, the sun,
the moon, and the sphere to which the stars were attached made only 9
moving bodies. Hence they asserted the existence ofa tenth one, called the
counter-earth, which also revolved around the central fire. We cannot see
this tenth one because it moves at exactly the same speed as the earth on the
THE BELIEF IN MATHEMATICAL DESIGN 149
opposite side of the central fire and also because the inhabited part of earth
faces away from the central fire. Here we have the first theory to put the earth
in motion. However, the Pythagoreans did not assert the rotation of the
earth; rather, the sphere of fixed stars revolves about the center ofthe
universe.
The belief that the celestial bodies are eternal, divine, perfect, and
unchangeable and that the sublunar bodies, that is the earth and (according
to the Greeks) the comets, are subject to change, decomposition, decay, and
death may also have come from the Pythagoreans. The docrine of uniform
circular motion and the distinction between celestial and sublunar bodies
became embedded in Greek thought'
Other features of nature also "reduced" to number. The numbers l, 2,
3, and 4, the tetractys, were especially valued because they added up to 10'
In fact the Pythagorean oath is reported to have been: " I swear in the name
ofthe Tetractys which has been bestowed on our soul. The source and roots
of the everflowing nature are contained in it." The Pythagoreans asserted
that nature was composed of fournesses; for example, point, line, surface,
and solid, and the four elements, earth, air, fire, and water' The four
elements were also central in Plato's natural philosophy' Because 10 was
ideal, l0 represented the universe. The ideality of l0 required that the
whole universe be describable in terms of 10 categories ofopposites: odd and
even, bounded and unbounded, good and evil, right and left, one and many,
male and female, straight and curved, square and oblong, Iight and darkness,
and rest and motion.
Clearly, Pythagorean philosophy mingled serious thoughts with what
we would consider fanciful, useless, and unscientific doctrines' Their obses-
sion with the importance of numbers resulted in a natural philosophy that
certainly had little correspondence with nature. But they did stress the under-
standing of nature, not, like the Ionians, through a single substance, but
through the formal structure of number relationships. Moreover they and the
Ionians both saw that underlying mere sense data there must be a harmoni-
ous account of nature.
We can now see why the discovery of incommensurable lengths was so
disastrous to Pythagorean philosophy: a ratio of incommensurable lengths
could not be expressed as a ratio ol whole numbers. In addition, they had
believed that a line is made up of a finite number of points (which they
identified with physical particles) ; but this could not be the case for a length
such as l/2. Their philosophy, based on the primariness of the whole
numbers, would have been shattered if they had accepted irrationals as
numbers.
Because the Pythagoreans "reduced" astronomy and music to number,
these subjects came to be linked to arithmetic and geometry; these four were
regarded as the mathematical subjects. They became and remained part ol
I5O THE GREEK RATIONALIZATION OF NATURE
the school curriculum even into medieval times, when they were called,
collectively, "the quadrivium." As we have noted, the Pythagorean interest
in arithmetic (i.e. the theory of numbers) was due not to the purely aesthetic
value ofthat subject but to a search for the meaning ofnatural phenomena in
numerical terms ; and this value caused the emphasis on special proportions
and on triangular, square, pentagonal, and higher forms into which numbers
could be arranged. Further, it was the Pythagorean natural philosophy
centering about number that gave the subject importance with such men as
Nichomachus. In fact, modern science adheres to the Pythagorean emphasis
on number-though, as we shall see, in a much more sophisticated form-
while the purely aesthetic modern theory of numbers derives from Pythag-
orean arithmetic per se.
The philosophers who came chronologically between the Pythagoreans
and Plato were equally concerned with the nature of reality but did not
involve mathematics directly. The arguments and views of men such as
Parmenides (5th cent. a.c.), Zeno (5th cent. a.c.), Empedocles (c.484-c.424
a.c.), Leucippus (c. 44O r.c.), and Democritus (c. 460-c.370 r.c.) were, like
tlose of their Ionian predecessors, qualitative. They made broad assertions
about reality that were, at best, barely suggested by observation. Neverthe-
less, each affirmed that nature is intelligible and that reality can be grasped
by thought. Each was a link in the chain that led to the mathematical
investigation of nature. Leucippus and Democritus are notable because they
were the most explicit in affirming the doctrine of atomism. Their common
philosophy was that the world is composed of an infinite number of simple,
eternal atoms. These differ in shape, size, order, and position, but every
object is some combination of these atoms. Though geometrical magnitudes
are infinitely divisible, the atoms are ultimate indivisible particles. (The
word atom in Greek means indivisible.) Hardness, shape, and size are physi-
cally real properties of the atoms. All other properties, such as taste, heat, and
color are not in the atoms but in the perceiver I thus sensuous knowledge is
unreliable because it varies with the perceiver. Like the Pythagoreans, the
atomists asserted that the reality underlying the constantly changing diversity
of the physical world was expressible in terms of mathematics and, moreover,
that the happenings in this world were shictly determined by mathematical
laws.
Plato, the foremost Pythagorean next to Pythagoras, was the most
influential propagator of the doctrine that the reality and intelligibility of
the physical world can be comprehended only through mathematics. For
him there was no question that the world was mathematically designed, for
" God eternally geometrizes." The world perceived by the senses is confused
and deceptive and in any case imperfect and impermanent. Physical knowl-
edge is unimportant, because material objects change and decay; thus the
direct study of nature and purely physical investigations are worthless. The
THE BELIEF IN MATHEMATICAL DESIGN I5I
physical world is but an imperfect copy of the ideal world, the one that
Lathematicians and philosophers should study. Mathematical laws, eternal
and unchanging, are the essence of reality.
Plato went further than the Pythagoreans in wishing not merely to
understand nature through mathematics but to substitute mathematics for
nature itself. He believed that a few penetrating glances at the physical world
would supply some basic truths with which reason could then carry on
unaided. From that point there would be no nature, just mathematics, which
would substitute for physical investigations as it does in geometry'
Plato's attitude toward astronomy illustrates his position on the knowl-
edge to be sought. This science is not concerned with the movements of the
visible heavenly bodies. The arrangement ofthe stars in the heavens and their
apparent movements are indeed wonderful and beautiful to behold, but mere
observation and explanation of the motions fall far short of true astronomy'
Before we can attain to the latter we " must leave the heavens alone, " for
true astronomy deals with the laws of motion of true stars in a mathematical
heaven of which the visible heaven is but an imperfect expression' Plato
encourages devotion to a theoretical astronomy, whose problems please the
mind, not the eye, and whose objects are apprehended by the mind, not
visually. The varied figures that the sky presents to the eye are to be us€d
only as diagrams to assist the search for the higher truths. The uscs of
astionomy in navigation, calendar-reckoning, and the measurement of time
were alien to Plato.
Plato's views on the role of mathematics in astronomy are an intcgral
part ofhis philosophy, which held that there is an objective, universally valid
ieality consisting of forms or ideas. These realities were independent ofhuman
beings and were immutable, eternal, and timeless. We become aware of
these ideas through recollection or anamnesis I although they are pr€sent in
the soul, it must be stimulated to recall them or fetch them up from its
depths. These ideas are the only reality. Included among them but occuPytng
Lr."" rank are mathematical ideas, which are regarded as interrnediate
"between the sensible world and such higher ideas as goodness, truth, justice,
and beauty. In this comprehensive philosophy, mathematical ideas played a
double role; not only were they part of reality themselves but, as we have
already pointed out in Chapter 3, they helped train the mind to view eternal
ideas. As Plato put it in Book YII of Tlu Rtpublir, the study of geomery
made easier the vision of the idea of goodness: " Geometry will draw the
soul toward truth, and create the spirit of philosoPhy. . . ."
Aristotle, while deriving many ideas from his teacher Plato, had a quite
diflerent concept of the study of the real world and ofthe relation of mathe-
matics to reality. He criticized Plato's otherworldliness and his reduction of
science to mathematics. Aristotle was a physicist; he believed in material
things as the primary substance and source of reality. Physics and science
I52 TIIE GREEK RATIONALIZATION OF NATURE
generally must study the physical world to obtain truths; genuine knowledge
is obtained from sense experience by intuition and abstraction. Then reason
can be applied to the knowledge so obtained.
Matter alone is not significant. As such it is indeterminate, simply the
potentiality of form; matter becomes significant when it is organized into
various forms. Form and the changes in matter that give rise to new forms
are the interesting features of reality and the real concern of science.
According to Aristotle matter is not, as some earlier Greeks believed,
composed of one primitive substance. The matter r,ve see and touch is com-
posed of four basic elements: earth, \ /ater, fire, and air. Also, each element
has its own characteristic qualities. Earth is cold and dry; water is cold and
moist; air is hot and moist; and fire is hot and dry. Hence the qualities of
any given object depend upon the proportions ofthe elements that enter into
it; and thereby solidity, hardness, coarseness, and other qualities are
determined,
The four elements have other qualities. Earth and water have gravity;
air and fire have levity. Gravity causes an element to seek to be at rest at the
center ofthe earth; levity causes it to seek the heavens. Thus by knowing the
proportions of the elements that enter into a given object, one can also
determine its motion,
Aristotle regarded solids, fluids, and gases as three different types of
matter, distinguished by the possession ofdifferent substantial qualities. The
transition from solid to fluid, for example, meant the loss of one quality and
the substitution ol another. Thus changing mercury into rigid gold involved
taking from mercury the substance that possessed fluidity and substituting
some other substance.
Science also had to consider the causes of change. For Aristotle there
were four types of causes. The first was the material or immanent cause; for a
statue made of bronze, bronze is the immanent cause. The second was the
formal cause; for the statue it is the design or shape. The formal cause of
harmony is the pattern of 2 to I in the octave. The third cause was the
eflective cause, the agent or doer; the artist and his chisel are the effective
cause of the statue. The fourth was the final cause, or the purpose that the
phenomenon served; the statue serves to please people, to offer beauty.
Final cause was the most important of the four because it gave the ultimate
reasons for events or phenomena. Everything had a final cause.
Where was mathematics in this scheme of things ? The physical sciences
were fundamental to the study of nature, and mathematics helped by
describing formal properties such as shape and quantity. It also provided
explanations of facts observed in material phenomena. Thus geometry
provided the reasons for facts provided by optics and astronomy, and arith-
metical proportions could give the reasons for harmony. But mathematics
was definitely an abstraction from the real world, since mathematical objects
TIIE BELIEF IN MA.I'I{EMATICAL DESIGN I53
insofar as they helped to understand the observable world, said that we must
start from principles that are known and manifest to the mind and then
proceed to analyze things found in nature. We proceed, he said, from
universals to particulars, from man to men, iust as children call all men
father and then learn to distinguish. Thus even the abstractions made from
concrete objects presuppose some general principles emanating from the
mind. This doctrine, the power of the mind to yield first prin-ciples, was
overthrown in the seventeenth century,
moved along what can be called the equator of that sphere; that is' the
axis
of rotation ias perpendicular to the circular Path of the body' However'
while rotating on itt u*it, this sphere was being carried along by the rotation
of the next of th" sphe.es by the following device' Imagine that
"orr"e.rtric the
the axis of rotation of the firsi sphere is extended at each end to reach
second sphere and that its endpoints are fixed on that second sphere' If'
now, the second sphere rotates about an axis of its own, then this sphere
willcarrytheaxisofthefirstspherearoundwhilethelatterrotatesaboutthat
axis. The axis of the second sphere is in turn carried around by the
rotation
of a third sphere on its own axis. Eudoxus found that, for the sun and moon'
a combina'tion of three spheres sufficed to reproduce the actual motions as
planet a fourth sphere was required' to
viewed from the earth. For each
which the third was related in the manner just described' The outermost
sphereofeachcombinationrotatedonanaxisthroughthecelestialpoles
once in.ach 24 hours. In all, Eudoxus used 27 spheres' Their axes
ofrotation'
speeds of rotation, and radii were chosen to make the theory fit as well as
Figure 7. I
that of the moon. He was far from the truth here; the correct number is
2,296,000. He also found the ratio of the diameter of the sun to the diameter
of the earth to be between l9/3 and 43/6; but the correct ratio is about
107.
Aristarchus is equally famous for having been the first to propose the
heliocentric hypothesis-that the earth and the planets all revolve in circles
around the fixed sun. The stars too are fixed and their apparent motion is
due to the rotation of the earth on its axis. The moon revolves about the
earth. Though, as we know today, Aristarchus had the right idea, it was not
accepted for many reasons. For one thing, Greek mechanics (see below),
already well developed by Aristotle, could not account for objects staying
on a moving earth. According to Aristotle, heavy objects seek the center of
the universe. This principle accounted for the fall of objects to earth as long
as it was the center of the universe; but if it moved, the objects would stay
behind. That argument was used by Ptolemy against Aristarchus and, in
fact, was used later against Copernicus because the prevailing system of
mechanics was still Aristotle's. Ptolemy also said that the clouds would lag
behind a moving earth. Further, Aristotle's mechanics required a force to
keep earthly objects in motion and there was no apParent force, We do not
know how Aristarchus answered these arguments.
Another argument advanced against Aristarchus was that if the earth
were in motion its distance from the fixed stars would vary, but it apparently
does not. To this Aristarchus gave the correct rebuttal; he said that the
radius ofthe sphere of fixed stars is so large that the earth's orbit is too small
to matter. Aristarchus' heliocentric idea was rejected by many because it
was impious to identify the corruptible matter of the earth with the incor-
ruptible matter of the heavenly bodies. The hypothesis that the planets move
about the sun in circles is, of course, unsatisfactory, because the motion is
actually more complicated. But the heliocentric idea could have been refined,
as Copernicus did later. It was, however, too radical for Greek thought.
The founder of quantitative mathematical astronomy is Apollonius.
He was called Epsilon because the symbol € was used to denote the moon,
and much of his astronomy was devoted to the motion of that body. Before
considering his work and that of Hipparchus and Ptolemy, to which it is
closely related, we shall examine the basic scheme that had entered Greek
astronomy between the times of Eudoxus and Apollonius, the scheme of
epicycle and deferent. In this schemc a planet P moves at a constant speed
on a circle (Fig. 7.2) with center S, while S itself moves with constant speed
on a circle with center at the earth E. The circle on which S moves is called
the deferent; the circle on which P moves is called the epicycle. The point S
for some planets is the sun but in other cases it is just a mathematical Point.
The direction of the motion of P may agree with or be opposite to the direc-
tion of motion of S. The latter is the case for the sun and moon.
r58 THE GREEK RATIONALIZATION OF NATURE
Figure 7.2
Figure 7.3
6 l/2 minutes too long), and a catalogue ofabout a thousand stars. He found
that the ratio of the distance to the moon to the radius of the earth was67.74;
the modern figure is 60.4. He calculated the moon's radius to be 1/3 the
earth's radius; the present figure is 27/100.
Ptolemy extended the work of Hipparchus by further improving the
mathematical descriptions of the motions of all the heavenly bodies. The
extended theory, presented in the Almagest, offers a complete exposition of
the geocentric theory of deferent and epicycle, which has come to be known
as the Ptolemaic theory.
To make the geometrical account fit observations, Ptolemy also intro-
duced a variation on epicyclic motion known as uniform equant motion. In
this scheme (fig. 7.3) the planet moves on an epicycle with center Q while
Q moves on a circle with center C which is not the earth but somewhat offset.
He fixed the speed ofQ by introducing the point rR such that EC : CR ar:d
required that i.QRT increase uniformly. Thus Q moves with uniform
angular velocity, but not with uniform linear velocity.
The approach and understanding that the Greek astronomers attained
are thoroughly modern. The Alexandrian Greek astronomers' notably
Hipparchus and Ptolemy, made observations of their ownl in fact, Hip-
parchus did not trust many of the older Egyptian and Chaldean observations
and repeated them. The classical and Alexandrian astronomers not only
constructed theories but fully realized that these theories were not the true
design but just descriptions that fit the observations. Ptolemy says in tlre
AlmagestL that in astronomy one ought to seek as simple as possible a mathe-
matical model. These men, unlike other Greeks, did not look for some physical
explanation of the motions. On this point Ptolemy says,2 "After all, generally
speaking, the cause ofthe first principles is either nothing or hard to interPret
in its nature." But his own mathematical model was later taken as the
literal truth by the Christian world.
5. Geography
Another science that received its foundation in Greek times is geography.
Though a few classical Greeks such as Anaximander and Hecataeus of
Miletus (died a. 475 a.c.) made maps of the earth as it was known at that
time, it was the Alexandrians who made the great strides in geography,
They measured or calculated distances along the earth, the heights of
mountains, the depths ofvalleys, and the extent ofthe seas. The Alexandrians
were especially stimulated to study geography because the Greek world had
widened.
The first great Alexandrian geographer was Eratosthenes of Cyrene
library at Alexandria, a mathematician,
(c. 284-c. 192 n.c.), director of the
poet, philosopher, historian, philologist, chronologist, and by reputation one
of the most learned men of antiquity. He studied in Plato's school in Athens
and was invited to Alexandria by Ptolemy Euergetes. Eratosthenes worked
at Alexandria until blindness overtook him in his old age; because of this
afliction he starved himself to death.
Eratosthenes collected the available geographical knowledge and made
numerous calculations of distances on the earth between significant places
(such as cities). His most famous calculation is the length of the circumference
of the earth. At noon on the summer solstice, the sun was observed to be
practically overhead at Syene, the city that today is called Aswan (Fig. 7.4).
(This was confirmed by observing that the sun shone directly down a well
there.) At the same time, in Alexandria, which is (within t') on the same
meridian as Syene but north of it, the angle between the overhead direction
for that location (O.B in the figure) and the direction of the sun (lD in the
figure) was observed to be 1/50 of 360'. The sun is so far from the earth that
SE and AD may be considered parallel. Hence {SOl is l/50 x 360'. This
means that arc Sl is 1/50 of the circumference of the earth. Eratosthenes
estimated the distance from Alexandria to Syene by using the fact that
GEOGRAPHY I6I
camel trains, which usually traveled 100 stadia a day, took 50 days to reach
Syene. llence this distance is 5000 stadia and the circumference of the
earth is 250,000 stadia. It is believed that a stadium was 157 meters so that
Eratosthenes' result is 24,662 miles. This result was far more accurate than
all previous estimates.
Eratosthenes wrote the Geography, in which he incorporated the methods
and results of his measurements and calculations. It includes explanations of
the nature and causes of the changes that had taken place on the earth's
surface. He also made a map of the world.
Scientific map-making became part of the work of geography. Hip-
parchus is generally credited with introducing latitude and longitude, though
the scheme was known earlier. The use of latitude and longitude, ofcourse,
permitted accurate description of locations on the earth. Hipparchus did
invent orthographic projection, in which "light rays" from infinity project
the earth on a plane (Fig. 7.5). Our view of the moon, for example, is
practically orthographic. This method enabled him to map a portion of the
earth onto a flat surface.
Ptolemy in his Planisphaerium, and probably Hipparchus before him,
used the method of stereographic projection' A line from O (Fig' 7'6)
through P on the earth's surface is continued until it hits the equatorial
plane or a tanBent plane at the opposite pole' Hipparchus supposedy used
Figure 7.6
r62 THE GREEK RAflONALIZATION OF NATURE
,..-._--.\
Figure
the latter and Ptolemy the equatorial plane. Thus points on the sphere are
transferred to a plane. In this scheme all points on the map have true
direction from the center of the map. Also angleS are preserved locally
(conformal mapping), though Ptolemy does not mention this. The meridians
and the parallels of latitude are therefore at right angles. Circles on the
sphere become circles on the plane, but area is not preserved. Ptolemy
himself invented conical projection, that is, the projection ofa region on the
earth from the center of the sphere onto a tangent cone (Fig. 7.7).
In his Geographia, a work in eight books, Ptolemy teaches methods of
map-making. Chapter 24 of Book I is the oldest work extant that is devoted
by title and contents to mapping a sphere onto a plane. The entire Geographia
is the first atlas and gazeteer. It gives the latitude and longitude of 8000
places on the earth and was the standard reference for hundreds of years.
6. Mechan'ics
The Greeks initiated the science of mechanics. ln his Physirs, Aristotle put
together a theory of motion that is the high point of Greek mechanics. Like
all of his physics, his mechanics is based on rational, seemingly self-evident
principles only slightly checked by or drawn from observation and experiment.
There are, according to Aristotle, two kinds of motion, natural and
violent or man-made. The heavenly spheres possess only natural motion,
which is circular. As for earthly objects, he taught that a natural motion (as
opposed to violent motions caused by throwing or pulling a body from one
place to another) arises from the fact that each body has a natural place in
the universe where it is in equilibrium or at rest. Heavy bodies have their
natural place at the center of the universe, which is the center of the earth.
Light bodies, such as gases, have their natural place in the sky. Natural
motion results when a body seeks its natural place. In natural motions earthly
objects move up or down in straight lines. Ifan earthly object were not in its
natural place it would seek that place as soon as possible. Violent motions,
that is, man-made ones, are composed of circular and rectilinear parts.
Thus a stone thrown out and up follows a straight line path up and a straight
Iine path down.
MEcHANrcs 163
5. In figures which are unequal but similar the centers of gravity will be
similarly situated.. . .
7. In any figure whose perimeter is concave in the same direction the center
of gravity must be within the figure.
Proposition 4. If two equal weights have not the same center of gravity, the
center of gravity of both taken together is at the middle point of the lines
joining their centers of gravity.
Propositions 6 and 7. Two magnitudes whether commensurable or incom-
mensurable balance at distances reciprocally proportional to the magnitudes.
#" *#"'
Let F be the actual weight of the water displaced by the crown. Then
#" *#": o
or
',to2:lz-
F-F,
F.
Thus Archimedes was able to determine the ratio of the gold to the silver in
the crown without destroying the crown. Vitruvius' account of this story is
that Archimede used. volumes of displaced water instead of weights. In this
case, the fl Fr, and F, above are, respectively, the volumes of water dis-
placed by the crown, a weight llz of pure gold, and a weight I,/ of pure silver.
The same algebra follows but Proposition 7 is not involved. Archimedes did
find that the gold had been debased with silver.
To gain some appreciation of the mathematical and physical complexity
of the problems treated by Archimedes in this work, we shall cite one of the
simple propositions of Book II.
Proposition 2. Ifa right segment ofa paraboloid of revolution whose axis is
not greater tha,l 3pl4 [p is the principal parameter or latus rectum of the
generating parabola], and whose specific gravity is less than that ofa fluid,
be placed in the fluid with its axis inclined to the vertical at any anBle, but
so that the base of the segment does not touch the surface of the fluid (Fig,
7.9), the segment of the paraboloid will not remain in that position but will
return to the position in which its axis is vertical.
The subject Archimedes is treating is the stability of bodies placed in
water. IIe shows under what conditions a body will, when placed in water,
either turn to or remain in a position of equilibrium. The problems are
clearly idealizations of how ships would behave when obliged to assume
different tilts in water.
7. Optics
Nort to astronomy, optics has been the most constantly pursued and most
successful of the mathematical sciences. It was founded by the Greeks.
Almost all of the Greek philosophers, beginning with the Pythagoreans,
speculated on the nature oflight, vision, and color. Our concern, however, is
with tfie mathematical accomplishments. The first of these is the assertion on
a priori grounds by Empedocles of Agrigentum in Sicily (c. 490 n.c.) that
light travels with finite velocity.
The first systematic treatments that we have are Euclid's Optits and.
Cataptrica.The Opticsis concerned with the problem ofvision and with the use
of vision to determine sizes of objects. Euclid begins with definitions (which
are really postulates), the first of which states (as had Plato) that vision is
possible because rays of light emitted by the eye travel along straight lines
and impinge on the object seen, Definition 2 states that the figure formed by
t67
Figure 7.10
the visual rays is a cone that has its vertex in the eye and its base at the
extremities of the object seen. Definition 4 says that of two objects the one
that determines a greater vertex angle of the cone appears greater. Then in
Proposition B Euclid proves that the apparent sizes of two equal and parallel
objects (,4.8 and, CD in Fig. 7.10) are not proportional to their distances from
the eye. Propositions 23 to 27 prove that the eye looking at a sphere really
sees less than half of it, and that the contour of what is seen is a circle.
Propositions 32 to 37 point out that the eye looking at a circle will see a circle
only if the eye is located on the perpendicular to the plane of the circle and
the perpendicular strikes the center. Euclid also shows how to calculate the
sizes of objects that are seen as images in a plane mirror. There are 58
propositions in the book.
The Catoptri.ca (theory of mirrors) describes the behavior of light rays
reflected from plane, concave, and convex mirrors and the effect of this
behavior on what we see. Like th,e Optics, it starts with definitions that are
really postulates. Theorem l, the law of reflection, is now fundamental in
what is called geometrical optics. It says that the angle A, which the incident
ray makes with the mirror (Fig. 7. t 1), equals the angle B, which the reflected
ray makes with the mirror. It is more customary today to say that {C : {D
and to speak of {Cas the angle of incidence and {D as the angle of reflec-
tion. Euclid also proves the law for a ray striking a convex or a concave
mirror by substituting a tangent for the mirror at the point where the ray
strikes.
Figure 7,1 I
168 TIIE GREEK RATIoNALIZATIoN oF NATURE
8. Astrology
Though astrology is not accepted as a science today, in earlier civilizations
it did have that standing. The astrology developed by the Alexandrian Greeks
ofabout the second century B.c. was different from the Babylonian astrology
of the Assyrian period. The latter merely drew from observations of the
BrBLrocRAPr{Y 169
positions of the planets some conclusions about the king and affairs of state.
There were no computations, and the appearance ol the sky at the moment
of birth played no role. However, Hellenistic or Alexandrian astrology was
personal; it predicted the future and fate of specific individuals on the basis
ofthe computed positions ofthe sun, moon, and the five planets in the zodiac
at the moment of birth. To evaluate these data an enormous body of doctrines
was built up.
Certainly the science was taken seriously by the Alexandrian Greels.
Ptolemy wrote a well-known work on the subject, the Quadrilartite ot Tetra-
biblos, or Four Books Concerning the hfruence o1f tlu Stars, in which he gave rules
for astrological predictions that \,vere used for a thousand years.
The importance of astrology in the history of science lies in the motiva-
tion it supplied for the study of astronomy, not only in Greece but in India,
Arabia, and medieval Europe. Astrology nourished astronomy much as
alchemy nourished chemistry. Curiously, errors in astrological predictions
were ascribed to errors in astronomy, not to the unreliability ofthe astrological
doctrines.
Alexandrian Greece witnessed the beginnings of the application of
mathematics to medicine, peculiarly enough through the medium of
astrology. Doctors, called iatromathematicians, employed astrological signs
to decide courses of treatment. Galen, the great physician of Greek times, was
a firm believer in astrology, perhaps excusably so since Ptolemy, the most
renowned astronomer, was also. This connection between mathematics and
medicine became stronger in the Middle Ages.
Our account of Greek science, in view of our concern with mathematics,
has dealt with the mathematical sciences. The Greeks carried on other
'investigations, in areas where mathematics, at least at that time, played no
role. Moreover, they performed experiments and made observations, the
latter particularly in astronomy. Nevertheless their vital achievement is that
they established the value of mathematics in the scientific enterprise. The
Platonic dialogue Philebus first gave expression to the thought that each
science is a science only so far as it contains mathematics; this doctrine gained
immense support from the Greek accomplishments. Moreover, the Greeks
produced ample evidence that nature is mathematically designed. It was
their vision of nature and their initiation of the mathematical investigation
of nature that inspired the creation of mathematics, in Greek times and in all
succeeding centuries.
Bibliography
Apostle, H. G.: Aristotle's Philosoplry of Matlumatics, University of Chicago Press,
1952.
Berry, Arthur: A Short Hi.story of Asbonong, Dover (reprint), 1961, Chaps. l-2.
r7o TrrE GREEK RATTONALTZATTON OF NATI,RE
Tlu Workr of Archinudcs, Dover (reprint), 1953, pp. 189-220 and 253-300.
Jaeger, Werner: Paidcia,3 vols., Oxford University Press, 1939-44.
Pannekoek,
-: A.: A History of Astrononry, John Wiley and Sons, 1961.
Sambunky, S.'. Tlu Plrysital World of tlu Gracfu, Routledge and Kegan Paul, 1956.
Santillana, G. del Tlu Origins of Scicntfic Thought from Anatimandcr to Proclus,
600 B.C. to 300 A.D., University of Chicago Press, 1961.
Sarton, George: A Hirtory of Scimce, Harvard University Press, 1952 and 1959,
Vols. I and 2.
Vcr Eecle, Pariz Etulide, L'Optiqua et la catofitriquc, Albert Blanchard, 1959,
Wcdberg, Anders: Plalo's Phihsoplry oJ Matlumatirs, Almqvist and Wiksell, 1955.
B
The Demise
of the Greek World
He who understands Archimedes and Apollonius will admire
less the achievements of the foremost men of later times'
G. W. LEIBNIZ
r7r
172 THE DEMTSE OF THE GREEK WORLD
secure truths they had to start from truths, and be sure not to assume any
unwarranted facts. Hence they stated all their axioms explicitly and in
addition adopted the practice ofstating them at the very outset oftheir work
so that they could be examined critically at once.
Beyond conceiving this highly remarkable plan to establish secure
knowledge, the Greeks showed a sophistication one could hardly expect
from innovators. Their recognition that the concepts must be free of con-
tradiction and that one cannot build a consistent structure by working with
nonexistent figures (such as a ten-faced regular polyhedron) shows an almost
superhuman and certainly unprecedented keenness of thought. As we now
know, their method of establishing the existence of concepts, so they could
deal with them, was to demonstrate their constructibility with straight-
edge and compass.
The power of the Greeks to divine theorems and proofs is attested to by
the fact that Euclid's E/ements contains 467 propositions and Apollonius' Canic
Sections contains 487, all derived from the I0 axioms in the Elements. No doubt
secondary in importance, and perhaps secondary in intent, is the coherence
that the deductive structures provided. Were the same results obtainable
from numerous different-though equally reliable-sets of axioms, they
might have presented far less manageable and assimilable knowledge.
The Greek contribution to the content of mathematics-plane and solid
geomery, plane and spherical trigonometry, the beginnings of the theory of
numbers, the extension of Babylonian and Egyptian arithmetic and algebra
enormous, especially in view of the small number of people involved and
-is
the few centuries of extensive activity. To these contributions we must add
the geomerical algebra, which awaited only the recognition of irrational
numbers and translation into symbolic language to become the basis of
considerable elementary algebra. The treatment ofcurvilinear figures by the
method of exhaustion, though part of their geometry, also warrants special
mention because it is the beginning of the calculus.
An equally vital contribution and inspiration to later generations was
the Greek conception of nature. The Greeks identified mathematics with the
reality of the physical world and saw in mathematics the ultimate truth about
the structure and design ofthe universe. They founded the alliance between
mathematics and the disinterested study of nature , which has since become
the very basis of modern science. Moreover, they went far enough in rational-
izing nature to establish firmly the conviction that the universe is indeed
mathematically designed, that it is controlled, lawful, and intelligible to man.
The aesthetic appeal of mathematics was by no means overlooked. In
Greek times the subject was also valued as an art; beauty, harmony, simplic-
ity, clarity, and order were recognized in it. Arithmetic, geometry, and
astronomy were considered the art of the mind and music for the soul. Plato
delighted in geometry; Aristotle would not divorce mathematics from
TTIE LIMITATIONS OF GREEK MATHEMATICS I73
that we have analytic geometry and the calculus. When one considers how
hard Archimedes worked to find the area ofa parabolic segment or the area
under an arc of his spiral and compares this with the modern calculus
treatment, one appreciates the effectiveness of the calculus.
Not only did the Greels restrict mathematics largely to geometry; they
even Iimited that subject to figures that could be obtained from the line and
circle. Accordingly, the only surfaces admitted were those obtained by
revolving lines and circles about an axis, for example the cylinder, cone, and
sphere, formed by the revolution of a rectangle, triangle, and circle, respec-
tively, about a line. A few exceptions were made: the plane, which is the
analogue of a line; the prism, which is a special cylinder; and the pyramid,
which results from decomposition of a prism. The conic sections were intro-
duced by cutting cones by a plane. Curves such as the quadratrix of Hippias,
the conchoid of Nicomedes, and the cissoid of Diocles were kept on the fringe
of geometry; they were called mechanical, rather than geometrical.
Pappus' classification of curves shows an attempt to keep within fixed
bounds. The Greeks, according to Pappus, distinguished curves as follows :
Plane loci or plane curves \ivere those constructed from straight lines and
circles; the conics were called solid loci because they originated from the
cone; the linear curves, such as quadratrices, conchoids, cissoids, and spirals
constituted the third class. Similarly, they distinguished among plane, solid,
and linear problems. Plane problems were solved by means of straight lines
and circles; solid problems were solved by one or more of the conic sections.
Problems that could not be solved by means of straight lines, circles, or conics
were called linear, because they used lines (curves) having a more compli-
cated or less natural origin than the others, Pappus stressed the importance
of solving problems by means of plane or solid loci because then the criterion
for the possibility ofa real solution can be given.
Why did the Greels limit their geometry to line and circle and figures
readily derived from them? One reason was that this solved the problem of
establishing the existence of geometrical figures. As we saw, Aristotle, in
particular, pointed out that we must be sure that the concepts introduced are
not self-contradictory; that is, they must be shown to exist. To settle this
point, the Greeks, in principle at least, admitted only those concepts that
were constructible. Line and circle were accepted as constructible in the
postulates but all other figures had to be constructed with line and circle.
However, the use of constructions to establish existence was not carried
over to figures in three dimensions, Here the Greeks apparently accepted
what was intuitively clear, for example the existence of figures of revolution
such as sphere, cylinder, and cone. Sections of these figures by planes
produced curves such as the conic sectionsl thus even figures in the plane
whose existence was not established were accepted-but reluctantly.
Descartes notes this point near the beginning of Book lI of La Gtomttrie:
THE LIUTTATIONS OF GREEK MATIIEMAIICS I75
.,It is true that the conic sections were never freely received into ancient
geometry...."
- Another reason for the limitation to line, circle, and figures derived
from them stems from Plato, according to whom ideas had to be clear to be
acceptable. While the whole number seemed to be acceptable as a clear idea
in itselfi even though never explicitly defined by the Greeks, the geometric
figures irad to be made precise. Lines and circles and figures derived from
thlm were clear, whereas curves introduced by mechanical instruments
(other than straightedge and compass) were not, and- so were not admissible'
ihe restriction 1o figures produced a simple, well-arranged, har-
"liu"
monious, and beautiful geometry.
By insisting on a unity, a comPleteness, and a simplicity for their
g"o-"try, and by separating speculative thought from utility, cla$ical Gre€k
!"o*"try-becuml . ii-ited accomplishment. It narrowed people's
vision and^
closed their minds to new thoughts and methods. It carried ldithin itself
the seeds ofits own death. The narrowness of its field of action, the exclusive-
ness ofits point of view, and the aesthetic demands on it might have arrcsted
its development, had not the influences of the Alexandrian civilizztion
broadened the outlook of Greek mathematicians.
The philosophical doctrines of the Greeks limited mathematics in
another way. Thioughout the classical period they believed that man does
not create tjre mathematical {iacts: they preexist. He is limited to ascef,taining
and recording them. Inthe Tluactctus Plato compares the search for knowlcdge
to a bird-hunl pursued in an aviary. The birds, already captive, need only to
be seized. This belief about the nature of mathematics did not prevail'
The Greeks failed to comprehend the infinitely large, the infinitdy
small, and infinite processes. They " shrank before the silence of infnite
,pu""r." The Pythagoreans associated good and evil with the limited and
unlimited, .espi"tit Aristotle says the infinite is imperfect, unfinished,
"ly.
and therefore unthinkable; it is formless and codused' Only as objects are
delimited and distinct do they have a nature.
To avoid any assertion about the infinitude of the straight line, Euclid
says a line segment (he uses the word "line" in this sense) can be extended
as far as t iJnwillingness to involve the infinitely large is seen also
in Euclid's"""irr.ry.
statement of the parallel axiom. Instead of considering two lines
that extend to infinity and giving a direct condition or assumption under
which parallel lines might exist, his parallel axiom gives a condition under
which two lines will meet at some finite point.
The concept of the infinitely small is involved in the relation of points
to a line or the relation of the discrete to the continuous, and Zeno's pata-
doxes may have caused the Greeks to shy away from this subject. The relation-
ship of point to line bothered the Greeks and led Aristotle to separate the two.
fh""gh he admits points are on lines, he says that a line is not madc up of
,76 .r'HD DEMISE oF THE GREEK woRLD
points and that the continuous cannot be made up ofthe discrete (Chap. 3,
sec. I0). This distinction contributed also to the presumed need for separating
number from geometry, since to the Greeks numbers were discrete and
geometry dealt with continuous magnitudes.
Because they feared infinite processes they missed the limit process. In
approximating a circle by a polygon they were content to make the difference
smaller than any given quantity, but something positive was always left over.
Thus the process remained clear to the intuition; the limit process, on the
other hand, would have involved the infinitely small.
contributions in the new era were those of Ptolemy and Diophantus. The
gr,eat commentators Pappus and Proclus warrant attention, but they merely
close the record. The decline of this civilization, which for five or six cen-
turies made contributions far surpassing in er(tent and brilliance those of any
other, calls for orplanation.
Unfortunately, mathematicians are as subject to the forces of history as
the lowliest peasant. One has only to familiarize himself with the most super-
ficial facts about the political history of the Alexandrian Greeks of the
Christian era to see that not only mathematics but every cultrrral activity
was detined to suffer. As long as the Alo<andrian Greek civilization was
ruled by thc dynasty of the Ptolemies, it flourished. The first disaster was the
advent of the Romans, whose entire role in the history of mathematics was
that of an agent of destruction.
Before discussing their impact on the Alo<andrian Greek civilization,
let us note a few facts about Roman mathematics and the nature of the
Roman civilization. Roman mathematics hardly warrants mention. The
period during which the Romans figured in history extends from about
750 s.c. to l.o. 476, roughly the same period during which the Greek
civilization flourished. Moreover, as we shall see, from at least 200 s.c.
onward, tlre Romans were in close contact with the Greels. Yet in all of the
eleven hundred years there was not one Roman mathematician; apart from
a few details this fact in itself tells virtually the whole story of Roman
mathematics.
The Romars did have a crude arithmetic and some approximate
geometric formulas t"lr.at were later supplemented by borrowings from the
Alorandrian Greeks. Their symbols for the whole numbers are familiar to
us. To calculate with whole numbers, they used various forms of the abacus.
Calculations were also done with the fingers and with the aid of specially
prepared tables.
Roman fractions were in base 12. Special symbols and words were used
fot lll2,2ll2,.. .,llll2, 1124, 1136, ll48,l/96,. . .. The origin of the base
12 may be tJre relation of the lunar month to the year. The unit of weight,
incidcntalln was the a,r; one twelfth of this was the uncia, from which we get
our ounce and inch.
The principal use of arithmetic and geometry by the Romans was in
surveying, to fix city boundaries and plots of land for homes and temples.
Surveyors could calculate most of the quantities they sought using only
simple instruments and congruent triangles.
We do owe to the Romans an improvement in the calender. Up to the
time ofJulius Caesar ( 100-44 B.c.), the basic Roman year had 12 months,
totalling 355 days. An intercalary month of 22 or 23 days was used every
other year so that tlre average year consisted of 366 l/4 days. To improve
this calendar, Caesar called in Sosigenes, an Ale:<andrian, who advised a
THE DEMISE OF TIIE GREEK CTVILIZATION I79
year of 365 days and a leap year every 4 years. The Julian calendar was
adopted in 45 s.c.
From about 50 r.c. on, the Romans wrote their own technical books;
all of the basic material, however, was taken from Greek sources' The most
famous of these technical works isvitruvius' ten books on architecture, which
date from 14 a.c. Here, too, the material is Greek. Peculiarly, Vitruvius says
that the three greatest mathematical discoveries are the 3, 4, 5 right triangle,
the irrationaliiy of the diagonal of the unit square, and the Archimedean
solution of the crown problem. He does give other liacts involving mathe-
matics, such as the proportions of the parts of the ideal human body, some
harmonious arithmetical relationships, and arithmetic facts about the capac-
ities of catapults.
Amo"j the Romans the term "mathematics" came into disrepute
because thJastrologers were c alled matlumaticii, and astrology was condemned
by the Roman emperors. The emperor Diocletian ('l"o' 24$-316), distin-
guished between geometry and mathematics' The former was to be learned
Ind applied in the public service I but the "art of mathematics "-that is,
ast.oloja-was damnable and forbidden in its entirety. The " code of mathe-
matics-and evil deeds," the Roman law forbidding astrology, was also applied
in Europe during the Middle Ages. The Roman and christian emPerors
nonetheiess employed astrologers at their courts on the chance that thcre
might be some truth in their prophecies. The distinction between the terms
..geometer" lasted until well past the Renaissance.
..
rn'athematician " and
Even in the seventeenth and eighteenth centuries, "geometer" meant what
we mean by " mathematician."
The Romans were Practical People and they boasted of their practicdity'
They undertook and completed vast engineering Projects-viaducts, magni-
ficent roads that survive even today, bridges, public buildings, and land
surveys-but they refused to consider any ideas beyond the particular
applications they were making at the moment' The Roman attitude
"o.r"r"t"
toward maihematics is stated by Cicero: "The Greeks held the geometer in
the highest honor; accordingly, nothing made more brilliant Progres among
them ihan mathematics. But we have established as the limit of this art its
usefulness in measuring and counting."
The Roman emPerors did not suPPort mathematics as did the Ptolemys
of Egypt. Nor did the Romans understand pure science' Their failure to
a.""t"i mathematics is striking, because they ruled a worldwidc empire and
b"".r,,r" they did seek to solve Practical problems' The lesson one can learn
from the hisiory of the Romans is that people who scorn the highly theoretical
work of mathematicians and scientists and decry its usefulness are ignorant
of the manner in which important practical developments have arisen'
Let us turn to the role the Romans played in the political and military
history of Greece. After having secured control of central and northern Italy,
r80 TIIE DEMISE OF TIIE GREEK WORLD
they conquered the Greek cities in southern Italy and Sicily. (Archimedes,
we recall, contributed to the defense of Syracuse when the Romans attacked
the city and was killed by a Roman soldier.) The Romans conquered Greece
proper in 146 r.c., and Mesopotamia in 64 B.c. By intervening in the internal
strife in Egypt between Cleopatra, the last of the Ptolemy dynasty, and her .
BibliograplV
Cajori, Florian: A Histoty oJ Matlanati*,IVIacmillan, 1919, pp.63-68.
Gibbon, Edward: Tlu Dulitu and Fall of tlu Rnmaa Emlhc (maay editiors), Chaps.
20"21,29,29,92,9+.
Parsons, Edward Alexandcr: Ihr Ahxat drias Librury, Thrc Elsevici Pfess, 1952.
9
The Mathematics
of the Hindus and Arabs
As the sun eclipses the stars by its brilliancy, so the man of
knowledge will eclipae the fame of others in assemblies of the
people if he propotes algebraic problems, and still more if he
SOlveS tJrem, BRAHIIrIGUPTA
r83
r84 THE MATTIEMATICS OF THE HINDUS AND ARABS
We should note the phrase " reckoned like integers " in the above quotation.
Irrationals were treated as though they possessed the same properties as
integers. Thus if we had ihtegers c and d we would certainly write
(2) o*d:{GT7-Y:\/7-+-A4EA.
Now ilc : {a and d : {6, then (2) is exactly (l).
BhisLara also gives the following rule for the sum of two irrationals :
"The root of the quotient of the greater irrational divided by the lesser one
being increased by one; the sum being squared and multiplied by t}le
smallcr irational quantity is the sum of the two surd roots." This means,
for o<ample.
{3+,/r2:lU++r)'.s,
which yields 3fr. tte also gives rules for the multiplication, division, and
square rq)t of irrational expressions.
The Hindus were less sophisticated than the Greels in that they failed
to see the logical difficulties involved in the concept of irrational numbers.
Their interest in calculation caused them to overlook philosophic distinctions,
or distinctions based on principles that in Greek thought were fundamental.
But in blithely applying to irrationals procedures like those used for rationals,
they hclped mathematics progress. Moreover, their entire arithmetic was
completely independent of geometry.
The Hindus also made some progress in algebra. They used abbrevia-
tions of words and a few symbols to describe operations. As in Diophantus,
there was no symbol for addition; a dot over the subtrahend indicated sub-
traction; other operations were called for by key words or abbreviations;
thus rta from the word kdrana called for the square root of what followed.
For the unknowns, when more than one was involved, they had words that
denoted colors. The first one was called the unknown and the remaining
ones black, blue, yellow, and so forth. The initial letter of each word was
also used as a symbol. This symbolism, though not extensive, was enough to
classifr Hindu algebra as almost symbolic and certainly more so than
Diophantusl syncopated algebra. Problems and solutions were written in this
quasi-symbolic style. Only the steps were given; no reasons or proofs accom-
panied them.
The Hindus recognized that quadratic equations have two roots and
included negative roots as well as irrational roots. The three types of quad-
ratirs axz * bx : c, ax2 : bx + c, axz ! c : bx, a, D, c positive, separately
treated by Diophantus, were treated as one case pr2 + q, + r : 0, because
the Hindus allowed some coefficients to be negative. They used the method
of completing the square, which of course was not new with them. Since they
did not recognize square roots for negative numbers, they could not solve all
ARITHMETIC AND ALOEBRA, A.D. 2OO-I2OO I87
(4) !o: o, *
-++ Vii
az
Let us consider the positive value. We can now return to our original indeter-
minate equation, and since aq - bp : I we can write
ax+bg:c(aq-b?)
and by rearranging terms obtain
cq-x:y+cp.
ba
If we let , represent each of these fractions we have
(s) tc:cq-bt and, g:at-cp.
We may now assign integral values to l, and since all the other quantities are
integers, we thereby obtain integral values of x and y. The minor modifica-
tions to take care of the cases where aq - bp: -1, or when the original
equation is ax - b! : c, a:re readily devised. Brahmagupta gave solution (5),
though not of course in terms ofgeneral letters a, b, p, and q.
The Hindus also worked with indeterminate quadratic equations. They
solved the type
perimeter arrd a, b, c, and d are the sides' a formula which is correct only
for quadrilaterals inscribed in circles. They offered no geometric proofs l on
the whole they cared little for geometry.
In trigonometry the Hindus made a few minor advances. Ptolemy had
used the chords of arcs, calculated on the basis of 120 units in the diameter
of a circle. Varihamihira used 120 units for the radius. Hence Ptolemy's
table olchords became for him a table ofhalfchords, but still associated with
the full arc. Aryabhata then made two changes. First he associated the half
chord with half of the arc of the full chord ; this Hindu notion of sine was
used by all later Hindu mathematicians. Secondly, he introduced a radius of
3438 units. This number comes from assigning 360'60 units (the number of
minutes) to the circumference of a circle and using C : 2rr, with z approxi-
mated by 3.14. Thus in Aryabhata's scheme the sine of an arc of 30', that is,
the length of the half chord corresponding to an arc of 30o, was 1719. While
the Hindus used the equivalent of our cosine, they more often used the sine
of the complementary arc. They also used the notion of versed sine, or
I- cosine.
Since the radius of a circle now contained 3438 units, Ptolemy's values
for the chords were no longer suitable, and the Hindus recomputed a table of
half chords, starting from the fact that the half chord corresponding to an
arc of90" is 3438 and the hall chord corresponding to an arc of30'is 1719.
Then, by using rigonometric identities such as Ptolemy had established,
they were able to calculate the half chords of arcs at intervals of 3"45'. This
angle resulted from dividing each quadrant of 90' into 24 patts. It is note-
worthy that they used the identities in algebraic form, unlike the geometrical
arguments of Ptolemy, and made arithmetic calculations on the basis of the
algebraic relations. Their practice was, in principle, like ours.
The motivation for the trigonometry was astronomy, of which prac-
tically all of Hindu trigonometry was a by-product. Standard astronomical
works included the Sfirga Siddhlnta (System of the Sun, 4th cent') and the
Argabhatiya of Aryabhata (6th cent.). The major work was the Siddhantu
Siromani (Diadem of an Astronomical System) written by BhEskara in 1150.
Two chapters of this work are titled Lilaodtt (The Beautiful) and Vija'ganita
(Root Extraction) ; these are devoted to arithmetic and algebra.
Though astronomy was a primary interest in the period after e.o. 200,
the Hindus made no significant progress. They took over a minor Hellenistic
activity in arithmetical astronomy (of Babylonian origin), which predicts
planetary and lunar positions by extrapolation from observational data.
Even the Hindu words for center, minute, and other terms were just the
Greek words transliterated. The Hindus were only slightly concerned with
the geometrical theory of deferent and epicycle, though they did teach the
sphericity of the earth.
About the year 1200 scientific activity in India declined and progress
I9O TIIE MATHEMATICS OF THE HINDUS AND ARABS
4. The Arabs
Thus far the Arab role in the history of mathematics has been to deliver the
final blow to the Alexandrian civilization. Before starting their conquests,
they had been nomads occupying the region of modern Arabia. They were
stirred to activity and unity by Mohammed, and less than a century after his
death in 632, had conquered lands from India to Spain, including North
Africa and southern Italy. In 755 the Arab empire split into two independent
kingdoms, the eastern part having its capital at Bagdad and the western one
at Cordova in Spain.
Their conquests completed, the former nomads settled down to build a
civilization and a culture. Rather quickly the Arabs became interested in the
arts and sciences. Both centers attracted scientists and supported tleir work,
though Bagdad proved to be the greater; an academy, a library, and an
astronomical observatory were established there.
The cultural resources available to the Arabs were considerable. They
invited Hindu scientists to settle in Bagdad. When Justinian closed plato,s
Academy in e.o. 529, many of its Greek scholars went to penia, and the
Greek learning that flourished there became, a century later, part of
ARABIC ARTTHUETIC AND ALGEBRA I9I
the Arab world. The Arabs also established contacts with the Greels of
the independent Byzantine Empire; in fact the Arab caliphs bought Greek
manuscripts from the Byzantines. Egypt, the center of Greek learning in the
Alexandrian period, had been conquered by the Arabs, so the learning that
survived there contributed to the activity in the Arab empire. The Syrian
schools of Antioch, Emesa, and Damascus and the school of Nestorian
Christians at Edessa, which had become the chief repositories in the Near
East of Greek works after the destruction of Alexandria in 640, and christian
monasteries in the Near East, which also possessed these works, all came under
Arab rule. Thus the Arabs had control over, or access to, the men and culture
of the Byzantine Empire, Egypt, Syria, Persia, and the lands liarther east,
including India.
One speaks of Arabic mathematics, but it was Arabic in language
primarily. Most of the scholars were Greets, Christians, Persians, and Jews'
Ho*.r"., it is to the credit of the Arabs that after the period of conquest,
which was marked by religious fanaticism, they were liberal to other peoples
and sects and the infidels were able to function freely'
Fundamentally, what the Arabs possessed was Greek knowledge ob-
tained directly from Greek manuscripts or from Syrian and Hebrew versions'
All the major works became accessible to them. They obtained a copy of
Fiuclid,s Ehmcnlr from the Byzantines about 800 and translated it into Arabic.
Ptolemy's Matlumati.cal Syntaxis was translated into Arabic in 827 and to the
Arabs became a preeminent, almost divine book; it became known as the
Almagest, meaning the greatest work. They also translated Ptolemy's Tcfia-
bibtos and,this work on astrology was popular among them. In the course of
time the works of Aristotle, Apollonius, Archimedes, Heron, Diophantus, and
of the Hindus became accessible in Arabic. The Arabs then improved the
translations and made commentaries. It is these translations, some still
extant, that became available to Europe later, the Greek originals having
been lost. Until 1300 the Arab civilization was dynamic and its learning
became widespread.
HAD
II I
IV III
Figure 9.I
figurc 9.2
(8)
* PS
PS c:x
From (7) and (8) we have
b PS
(e)
x c-x
But from (7)
PS:;.-2
If we substitute this value of PS in (9) we see that x satisfies the equation
xa + bzx : bzc.
Khayyam also solved the type xs + a*2 = 6s, whose roots are deter-
ARABTC OEOMETRY AND TRTOONOMETRY I95
tangents for every 10 minutes of angles. Al-Birtrni gave the law of sines
for plane triangles and a proof.
The systematization of plane and spherical trigonometry in a work
independent of astronomy was achieved by Nasir-Eddin in his Treatise on tlu
Quadrilabral. This work contains six fundamental formulas for the solution
ofright spherical triangles and shows how to solve more general triangles by
what we now call the polar triangle. Unfortunately the Europeans did not
know Nasir's work until about 1450; until then trigonometry remained, as it
had been from its inception, an appendage of astronomy, in the texts as well
as in application.
The Arab scientific effort, though not original, was extensive; however,
we cannot do more here than note the continuation of lines inaugurated by
the Greels. Unlike the Hindus, the Arabs did take over Ptolemy's astronomy.
Astronomy was emphasized so that the hours for prayers could be accurately
known and so that Arabs throughout the vast empire could face Mecca
during prayers. Istronomical tables were enlarged i instruments were im-
proved; and observatories were built and used. As in India, practically all
mathematicians were primarily astronomers. Astrology also played a big role
in stimulating work in astronomy and, therefore, mathematics,
Another science pursued by the Arabs was optics. Alhazen, who was a
physicist as well as a mathematician, wrote the great treatise Kitab al-manazer
or Treailrg of Optics, which exercised an enormous influence. In it he stated
the full law ofreflection, including the fact that the incident ray, the reflected
ray, and the normal to the reflecting surface all lie in one plane. But, like
Ptolemy, he did not succeed in finding the law for the angle of refraction,
despite much effort and experimentation. He discussed spherical and
parabolic mirrors, lenses, the camera obscura, and vision. Optics was a
favorite subject with the Arabs because it lent itself to occult and mystic
thoughts. However, they made no important original contributions.
The Arab uses of mathematics were of the sort we have encountered
before. Astronomy, astrology, optics, and medicine (through astrology)
required it, though some of the algebra, as one of the Arab mathematicians
put it, was "most needed in problems of distribution, inheritance, partner-
ship, land measurement. . . ." Mathematics was studied by the Arabs mainly
to further the few sciences they pursued, not for its own sake. Nor did they
study science for its own sake. They were not interested in the Greek goal of
understanding the mathematical design of nature or in comprehending
God's ways, as were the medieval Europeans. The Arab objective, new in the
history of science, was power over nature. They thought they could achieve
that power through alchemy, magic, and astrology, which were an integral
part of their scientific eflort. This objective was taken up later by more
critical minds, who could distinguish science from pseudoscience and were
more profound in their approach.
r 3oo r97
arithmetic and algebra but had the audacity, reinforced by practical needs,
to develop these branches. Though they undoubtedly did not appreciate
what they were accomplishing, they took the only course mathematical
innovation can purrue. New ideas can come about only by the free and bold
pursuit ofheuristic and intuitive insights. The logical justification and correc-
tilr" -.ur.r..r, should the latter be needed, can be brought into play only
when there is something to logicize. Hindu and Arab venturesomeness
brought arithmetic and algebra to the fore once again and placed it almost
on a par with geometry.
Two independent traditions or concePts of mathematics had now be-
come established: on the one hand, the logical deductive body of knowledge
that the Greeks established, which served the larger purpose of under-
standing nature; and on the other, the empirically grounded, practically
oriented mathematics founded by the Egyptians and Babylonians, resus-
citated by some of the Alexandrian Greeks, and extended by the Hindus and
Arabs. The one favored geometry and the other arithmetic and algebra' Both
traditions and both goals were to continue to oPerate.
Bibliographg
Ball, W. W. R, : .d S/rlrt Account of ttu Hbtory of Matlumaties, Dover (reprint)r 1960,
Chap. 9.
'Berry, Arthur: A Short Histnry of Astrorumtr, Dover (reprint), 1961, pp' 7683'
Boyer, Carl B.: A History of Mathcmatits, Jobn Wiley and Sons, 1968, Chapo' l2-I3'
Cajori, Florian: A History of Matlumatics, Macmillan, l9l9' pp. 83-l 12'
Cantor, Moritz: Matlurnati*,2nd ed., B. G' Tcubner,
Vorlesungen ilbet Gcschichk der
1894, Johnson Reprint Corp', 1965, Vol' l, Chaps. 28-30, 32-37.
Coolidge, Julian L.: Thc Matlumatics of Grcat Amakws, Dover (reprint), 1963,
Chap. 2.
Datta, 8., and A. N. Sirrgh: History of Hindu Matlumatics,2 vols., Asia Publishing
House (reprint), 1962.
Dreyer, J. L.E.:AHistoryof Aslrorcng from Thalcs to Kcphr, flover (reprint), 1953,
Chap. I l.
Karpinski, L, C.: Robat of Ctustcr's Latin Translatian of tlu Algebra of al-Kluwoizmi,
Macmillan, l9l5' English version also.
Kasir, D. S.: "The Algebra of Omar Khayyam," Columbia University Teachers
College thesis, 1931.
O'Leary, De Lacy: How Grcck Scicuc Passcd to thc Arabs, Roudedge and Kcgan
Paul, 1949.
Pannekoel, A.: A History of Astronong, John Wiley and Sons, 1961, Chap. 15'
Scott,J, F.: A Hktory of Matlumatics, Taylor and Francis, 1958, Chap. 5.
Smith, David Eugene: History of Matlumatits, Dover (reprint)' 1958, Vol' 1,
pp. 13&-47, 152-92, 283-90'
Struik, D. J.: " Omar Khayyam, Mathematician," Tlu Matlumatits Teuha, 51,
1958,28H5.
IO
The Medieval Period in Europe
200
THE MATERIALS AVAILABLE FOR LEARNING 2OI
In the latter half of the eighth century some secular rulers founded
additionhl schools. In Charlemagne's empire schools were organized by
Alcuin of York (730-804), an Englishman who came to Europe at the
invitation of Charlemagne himself. These schools too were attached to
cathedrals or monasteries and emphasized Christian theology and music.
Ultimately the universities of Europe grew out of the church schools, with
teachers supplied by church orders such as the Franciscans and Dominicans.
Bologna, the first university, was founded in l0BB. The universities of Paris,
Salerno, Oxford, and Cambridge were established about the year 1200. Of
course at the outset these were hardly universities in the modern sense. Also,
though formally independent, they were essentially devoted to the interests
of the Church.
there any serious attempts to build mathematics. The reasons are of interest
to those who seek to understand under what circumstances mathematics can
flourish.
The primary reason for the low level of mathematics was lack of interest
in the physical world. Christianity, which dominated Europe, prescribed its
own goals, values, and way of life. The important concerns were spiritual,
so much so that inquiries into nature stimulated by curiosity or practical
ends were regarded as frivolous or unworthy. Christianity and even the later
Greek philosophers, the Stoics, Epicureans, and neo-Platonists, emphasized
lifting mind above flesh and matter and preparing the soul for an after-
Iife in heaven. The ultimate reality was the everlasting life of the soul; and
the soul's health was strengthened by learning moral and spiritual truths.
The doctrines of sin, fear of hell, salvation, and aspiration to heaven were
dominant. Since the study of nature did not help achieve such goals or
prepare for the afterlife, it was opposed as worthless and even heretical.
Where, then, did the Europeans secure any knowledge about the nature
and design of the universe and of man? The answer is that all knowledge was
derived from the study of the Scriptures. The creeds and dogmas of the
Church Fathers, which were amplifications and interpretations of the
Scriptures, were taken as the supreme authority. St. Augustine (35+430), a
very learned man and most influential in spreading neo-Platonism, said,
" Whatever knowledge man has acquired outside of Holy Writ, if it be
harmful it is there condemned; if it be wholesome it is there contained."
This quotation, though not representative of Augustine, is representative of
the early medieval attitude toward nature.
This brief sketch of the early medieval civilization, rather one-sided as
it is, because we have been concerned largely with its relation to mathe-
matics, may nevertheless give some idea of what was indigenous to Europe
and what Europe, building on a meager legacy from Rome, produced under
the leadership ofthe Church. Until 1100 the medieval period did not produce
any great culture in intellectual spheres. The characteristics ofits intellectual
state were an indistinctness of ideas, dogmatism, mysticism, and a reliance
upon authorities, who were constantly consulted, analyzed, and com-
mented on. The mystical leanings caused people to elevate vague ideas into
realities and even accept them as religious truths. What little theoretical
science existed was static. Theology embraced all knowledge and the Fathers
of the Church authored systems of universal knowledge. But they did not
conceive or search for principles other than those contained in Christian
doctrines.
The Roman civilization was unproductive in mathematics because it
was too much concerned with practical and immediately applicable results.
The civilization of medieval Europe was unproductive in mathematics for
exactly the opposite reason. It was not at all concerned with the physical
WORKS 2O5
THE FIRST REVIVAL OF TI{E GREEK
in North Africa. The republics of northern Italy and the papacy sent missions
and ambassadors to the Byzantine Empire and to Sicily, which was the
original home of famous Greek centers and which up to B7B had been under
Byzantine rule. In l0B5 Toledo was captured by the Christians and thus a
major center for the Arabic works was opened up to European scholars.
Sicily was taken from the Arabs by the Christians in l09I and the works
there became freely accessible. A search in Rome, which possessed Greek
works from the days of the Empire, unearthed more manuscripts.
As they secured these works the Europeans undertook more and more
to translate them into Latin. The twelfth-century translations from Greek
were, on the whole, not good because Greek was not well known. They were
dt aerbo ad aerbuml but they were better than the translations of Greek
works that had passed through Arabic, a language quite dissimilar to Greek.
Hence until well into the seventeenth century there was a steady output
of new, improved translations.
Thus Europe got to know the works of Euclid and Ptolemy, al-Khow-
6rizmi's Aithmeth and. Algebra, the Sphauica of Theodosius, many works of
Aristotle and Heron, and a couple of Archimedes' works, particularly his
Mcasarernent o;f aCircle. (The rest of his work was translated into Latin in 1544
by Hervagius of Basle.) Neither Apollonius nor Diophantus was translated
during the twelfth and thirteenth centuries. Works on philosophy, medicine,
science, theology, and astrology were also translated. Since the Arabs did
have almost all the Greek works, the Europeans acquired a tremendous
Iiterature. They admired these works so much and were so fascinated by the
novel ideas that they became disciples of Greek thought. They valued these
works far more than their own creations.
even began to pit their own reason against the authority of the Church'
Thus AJelard oi B.th said he would not listen to those who are "led in a
halter;.. . Wherefiore if you want to hear anything from me, give and take
reason."
Peculiarly enough, the introduction of some ofthe Greek works retarded
the awakening ofEurope for a couple ofcenturies' By 1200 or so the extensive
writings of Aistotle beca-e reasonably well known' The European intellec-
tuals iere pleased and impressed by his vast store of facts, his acute distinc-
tions, his cogent arguments, and his logical arrangement of knowledge'
The
was that he accepted those that appealed to the
defect in Aristotle's doctrines
almost without regard for their correspondence with experience' He
mind
offered concepts, theoriJs, and explanations, such as the doctrine of
basic
substances, the distinction between earthly and heavenly bodies (Chap' 7'
sec.3), and the emphasis on final cause, which had little basis in reality' or
new
were not fruitful. Since all of these doctrines were accePted uncritically,
ideas were either not entertained or failed to gain a hearing, and progress
a minor
was delayed. It was perhaps also a hindrance that Aristotle assigned
role to mathematics, certainly a role subordinate to physical explanation'
which, for Aristotle, was qualitative.
Thescientificworkoftheperiodfromaboutlt00to1450wasdoneby
the Scholastics, who espoused doctrines based on the authority of the
Christian Fathers and of Aristotle; the work suffered accordingly' Some of
the Scholastics revolted against the prevailing dogmatism and against the
insistence on the absolute correctness of Aristotle's science. one who felt
the
need for obtaining general principles from experimentation and for deduc-
tions in which mathematics would play a part and which could then be
testedagainstfactswasthenaturalphilosopherRobertGrosseteste(c.l168-
1253), Bishop of Lincoln.
th. -ort "loqrent Protester against authority, and one who had genuine
ideas to offer, was Roge. Bu"ot (1214-94), the Doctor Mirabilis'
He declared'
"If I had the power over the works of Aristotle, I would have them all
burned; for it is only a loss of time to study in them and a cause of error' and
a multiplicatio., of igrro.arrce beyond expression'" Bacon's enormous knowl-
idge covered the sciences of his time and many languages, including Arabic'
Lo"ng before they became widely known he was informed on the
latest
inveitions and scientific advances: gunpowder, the action oflenses' mechan-
ical clocks, the construction of the calendar, and the formation of the
rainbow.Heevendiscussedideasonsubmarines,airplanes,andautomobiles'
His writings on mathematics, mechanics, oPtics, vision, astronomy' geog-
raphy, chrJnology, chemistry, perspective, music, medicine, grammar' logic'
metaphysics, ethics, and theology were sound'
Wttut especially striking about Bacon is that he understood how
it
reliable knowled-ge is obtained. He inquired into the causes that produce
or
2oB THE MEDIEVAL PERIoD IN EURoPE
prevent the advance of science and speculated on the reform of the methods
of inquiry. Though he did recommend study of the Scriptures, he emphasized
mathematics and experimentation and foresaw great prospects that could be
realized through science.
Mathematical ideas, he affirms, are innate in us and identical with things
as they are in nature, for nature is written in the language ofgeometry. Hence
mathematics offers truth. It is prior to the other sciences because it takes
cognizance of quantity, which is apprehended by the intuition. He " proves "
in one chapter of his Oprs Majus that all science requires mathematics, and
his arguments show a just appreciation of the office of mathematics in
science. Though he stresses mathematics, he also has a full appreciation of the
role and importance of experimentation as a means of discovery and as a
test of results obtained theoretically or in any other way. "Argument con-
cludes a question; but it does not make us feel certain, or acquiesce in the
contemplation oftruth, except the truth also be found to be so by experience."
Bacon's OPus Majus has much on the usefulness of mathematics for
geography, chronology, music, the explanation of the rainbow, calen-
dar-reckoning, and the certification of faith. He also treats the role of
mathematics in state administration, meteorology, hydrography, astrology,
perspective, optics, and vision.
However, even Bacon was a product of his times. He believed in magic
and astrology and maintained that theology is the goal of all learning. He
was also a victim of his times : he ended up in prison, as did many other
intellectual leaders who had begun to assert the priority and independence
of human reason and the importance of observation and experimentation.
His influence on his age was not great.
William of Ockham (r. 1300-49) continued the weighty attacks on
Aristotle, criticizing Aristotle's views on causation. Final cause, he said, is
pure metaphor. All causes are immediate, and the total cause is the aggregate
of all the antecedents that suffice to bring about an event. This knowledge of
connections has a universal validity because of the uniformity ofnature. The
primary function of science is to establish sequences of observations. As for
substance, Ockham said, we know only properties, not a fundamental
substantial form.
He also attacked contemporary physics and metaphysics, saying that
knowledge gained from experience is real, whereas rational constructs are
not; they are merely invented to explain the observed facts. His famous
principle is " Ockham's razor " (already stated by Grosseteste and Duns
Scotus [266-1308]) : It is futile to work with more entities when fewer
suffice. He divorced theology from natural philosophy (science) on the
ground that theology derives knowledge from revelations whereas natural
philosophy should derive it from experience.
These dissenters did not suggest new scientific ideas. But they did press
PROPER 2O9
PROORESS IN MATHEMATICS
for freedom of speculation, thought, and inquiry and urged experience as the
source of scientific knowledge.
Figure 10.1
to that \ /hich the object already had. In rising bodies-for example, pro-
jectiles-the impetus given to the object was gradually decreased by the air
resistance and natural gravity. Impetus was what God gave to the celestial
spheres and these needed no heavenly agents to keep them running. Buridan
defined impetus as the quantity of matter multiplied by the velocityl hence,
in modern terms, it is momentum.
This new theory was remarkable for several reasons. By applying it to
heavenly and earthly motions, Buridan linked the two in one theory.
Furthermore, the theory implied that, contrary to Aristotle's law, a force
would alter motion and not just sustain it. Third, the concept of impetus
itself was a great advance; it transferred the motive power from the medium
to the moving object and also made possible the consideration of a void.
Buridan is one of the founders of modern dynamics. His theory gained wide
acceptance in his own century and for two centuries thereafter.
Perhaps projectile motion received this attention because by the thir-
teenth century improved weaponry, such as catapults, crossbows, and long-
bows, could throw projectiles over long and highly arched paths; a century
later cannonballs were in use. Aristotle had said that a body can move only
under one type of force at a time ; if two were acting, one would destroy the
other. Elence a body thrown up and out would move along a straight line
until the "violent" motion was spent and then the body would drop straight
to earth under a natural motion. Of the various revisions of this theory, the
idea ofJordanus Nemorarius proved most helpful; he showed that the force
under which a body thrown straight out moved at any instant could be
resolved into two components, natural gravity acting downward and a
"violent" horizontal force ofprojection. This idea was taken up by da Vinci,
Stevin, Galileo, and Descartes.
The Parisian school of Buridan and Oresme went on to consider not
only uniform but difform and uniformly difform motion and proved to its
own satisfaction that the effective velocity in uniformly difform motion was
the average of the initial and final velocities. Perhaps most significant about
the efforts in thirteenth- and fourteenth-century mechanics was the attempt
to introduce quantitative considerations and to replace qualitative by
quantitative arguments.
The major interest of the medieval scientists lay in the field of optics.
One reason for this is that the Greeks had laid a firmer foundation in what
we now call geometrical optics than in other physical fields; and by the late
medieval period their numerous works in optics were known in Europe. In
addition, the Arabs had made some advances over the Greeks. By 1200 some
of the basic laws of light were well known, including the straight-line motion
of light in a tiniform medium; the law of reflection; and Ptolemy's incorrect
law of refraction (he believed the angle of refraction was proportional to the
angle of incidence). Also the knowledge of spherical and parabolic mirrors,
STMMARY 2I3
spherical aberration, the pinhole camera, the uses of lenses, the functioning
of th" atmospheric refraction, and the possibility of magnification had
"y.,
passed on to Europe from the Greeks and the Arabs'
The scientists Grosseteste, Roger Bacon, Vitello (l3th cent'), John
Peckham (d. 1292), and Theodoric of Freiberg (d. c. l3l l) made advances
in light. Having learned of the relraction of light by lenses, they determined
the focal lengths of some lenses, studied combinations of lenses, suggested
magnification by combinations of lenses, and made improvements in the
theory of the rainbow. Glass mirrors were perfected during the thirteenth
century; spectacles date from 1299. Vitello observed the dispersion of light
under iefraction; that is, he produced colors from white light passed through
a hexagonal crystal. He also directed light through a bowl of water to study
the rainbow, since he had previously noted that when sunlight passes
through a bowl of water the colors of the rainbow appear in the emerging
light. bptics continued to be a major sciencel we shall find Kepler, Galileo,
Descartes, Fermat, Huygens, and Newton active in it.
9. SummarE
In science, as in other fields, the Middle Ages devoted itself to time-tested
and authoritative works. The schools produced diligent excerpting from older
manuscripts, summarizing, and commentaries' The spirit of the times forced
minds to iollow trusted, prescribed, and rigid ways. The search for a universal
philosophy covering all phenomena ofman, nature, and God is characteristic
of th" lut. medieval period. But the contributions suffered from an indistinct-
nessof ideas, mysticism, dogmatism, and a commentatorial spirit directed
toward the analysis of authorities.
Nevertheless, as the world gradually changed, a$'areness of discrepancy
and conflict between beliefs and overt facts grew stronger, and the need for a
revision of learning and beliefs grew clearer, Before Galileo demonstrated
the value of experiince, before Descartes taught people to look within them-
selves, and before Pascal formulated the idea of progress, it was the uncon-
ventional thinkers, largely the dissident Scholastics, who attemPted to
advance along new lines, to challenge established outlools, and to place
stronger reliance upon observation of nature than had the Greeks'
Experimentation, motivated in part by the search for magical Poweni,
and the use ofinduction to obtain general principles or scientific laws began
to be important sources of knowledge, despite the fact that the major
scientific method of the late Middle Ages was rational explanation, presented
in a formal or geometrical demonstration based on a priori principles'
The value of mathematics in the investigation of nature also received
some recognition. Though on the whole the medieval scientists followed
2t4 TIIE MEDIEVAL PERIOD IN EUROPE
Bibliograplry
Short Account of the History of Matlumatits, Dover (reprint), 1960,
Ball, W. W. R. : ,ll
Chaps. 8, 10, I l.
Boyer, Carl B.: A History of Matlumatics, lohn Wiley and Sorx, 1968, Chap. 14.
Cajori, Florian: A History of Mathematics, Macmillan, 1919, pp. I 13-29.
Clagett, Marshall : Tlu Scizme of Mechani.cs in the Middle lgcs, University of Wis-
consin Press, 1959.
Nirole Oresme and the Geomctry of Qualities and Motions, University of Wis-
consin Press, 1968.
-'t
Crombie, A. C.: Augustine to Galileo, Falcon Press, 1952, Chaps. l-5.
Robert Grosseleste and ilu Oigitu of Expuimcntal Scizrce, Oxford University
Press, 1953.
-i
BIBLIOGRAPITT 2r5
EastoD, stewart z Rogcr Buon awl His searchfor a uni:uersal science, columbia univer-
sity Press, 1952'
Hofmann, J. D.t Ttu History oJ Mattunatics, Philosophical Library, 1957' Chaps'
s4.
Smith, David Eugene: Hi;tory oJ Matlumalits, Dover (reprint), 1958, Vol' I'
pp. 177-265.
II
The Renaissance
z16
REVOLUTTONARY INFLUENCES IN EUROPE 217
repeated; they were but the reciters and trumpeters of other people's
learning. He also criticized the concepts, methods, and goals of the bookish
scholars because they did not deal with the real world. He almost boasted
that he was not a man of letters and could do bigger and better things by
learning from experience. And indeed he learned for himself many facts of
mathematics, some principles of mechanics, and the laws of equilibrium of
the lever. He made remarkable observations on the flight of birds, the flow
of water, the structure of rocks, and the structure of the human body. He
studied light, color, plants, and animals. Famous are his words, " If you do
not rest on the good foundations of nature you will labor with little honor
and less profit." Experience, he says, is never deceptive though ourjudgment
may be. " In the study of the sciences which depend upon mathematics,
those who do not consult nature but authors, are not the children of nature
but only her grandchildren. "
Leonardo did believe in the combination of theory and practice. He
says, " He who loves practice without theory is like the sailor who boards
ship without a rudder and compass and never knows where he may be cast."
On the other hand, he said, theory without practice cannot survive and dies
as quickly as it lives. " Theory is the general; experiments are the soldiers,"
He wished to use theory to direct experiments.
Nevertheless, Leonardo did not fully grasp the true method of science.
In fact, he had no methodology, nor any underlying philosophy. His work
was that of a practical investigator of nature, motivated by aesthetic drives
but otherwise undirected. He was interested in and sought quantitative
relationships and in this respect was a forerunner ofmodern science. Ilowever,
he was not as consciously quantitative as Galileo. While his writings on
mechanics and science were used by such sixteenth-century men as Cardan,
Baldi, Tartaglia, and Benedetti, they were not the stimulus for Galileo,
Descartes, Stevin, and Roberval.
Leonardo's views on mathematics and his working knowledge and use
of it were peculiar to his time and illustrate its spirit and approach. Reading
Leonardo one finds many statements suggesting that he was a learned mathe-
matician and profound philosopher who worked on the level of the profes-
sional mathematician. He says, for example, "The man who discredits the
supreme certainty of mathematics is feeding on confusion, and can never
silence the contradictions of sophistical sciences, which lead to eternal
quackery . . . for no human inquiry can be called science unless it pursues its
path through mathematical exposition and demonstration." To pass beyond
observation and experience there was for him only one trustworthy road
through deceptions and mirages-mathematics. Only by holding fast to
mathematics could the mind salely penetrate thc labyrinth of intangible and
insubstantial thought. Nature works through mathematical laws and
nature's forces and operations must be studied through quantitative investi-
THE CLAMOR FOR THE REFORM OF SCIENCE 225
themselves, while men on their side must force themselves for a while to lay
their notions by and begin to familiarize themselves with the facts."
Bacon did not realize that science must measure so as to obtain quanti-
tative laws. He did not see, in other words, what kinds of gradual inquiries
were needed and the order in which they must be taken. Nor did he appreci-
ate the inventive genius that all discovery requires. In fact, he says that
" there is not much left to acuteness and strength of genius, but all degrees
of genius and intellect are brought to the same level."
Though he did not create it, Bacon issued the manifesto for the experi-
mental method. He attacked preconceived philosophical systems, brain
creations, and idle displays of learning. Scientific work, he said, should not
be entangled in a search for final causes, which belongs to philosophy. Logic
and rhetoric are useful only in organizing what we already know. Let us
close in on nature and come to grips with her. Let us not have desultory,
haphazard experimentation; Iet it be systematic, thorough, and directed.
Mathematics is to be a handmaiden to physics. In aII, Bacon offered a
fascinating program for future generations.
Another doctrine and program is associated with Francis Bacon,
though it antedates him. The Greeks had, on the whole, been content to
derive from their mathematics and science an understanding of nature's
ways. The few early medieval scientists and the Scholastics studied nature
largely to determine the final cause or purpose served by phenomena.
However, the Arabs, a more practical people, studied nature to acquire
power over it. Their astrologers, seers, and alchemists sought the elixir of life,
the philosopher's stone, methods of converting less useful to more useful
metals, and magic properties of plants and animals, in order to prolong man's
life, heal his sicknesses, and enrich him materially. While these pseudo-
sciences continued to flourish in medieval times, some of the more rational
Scholastics-for example, Robert Grosseteste and Roger Bacon-began to
envision the same goal, but by more proper scientific investigations. Thanks
to Francis Bacon's exhortations, the mastery of nature became a positive
doctrine and a pervasive motivation.
Bacon wished to put knowledge to use. He wanted to command nature
for the service and welfare of man, not to please and delight scholars. As he
put it, science was to ascend to axioms and then descend to works. In Tiz
Neu Atlantis Bacon describes a society of scholars provided with space and
equipment for the acquisition of useful knowledge. He foresaw that science
could provide man with "infinite commodities," " endow human life with
inventions and riches, and minister to the conveniences and comforts of
man." These, he says, are the true and lawful goals of science.
Descartes, in his Discourse on Method, echoed this thought:
It is possible to attain knowledge which is very useful in life, and irxtead
of that Speculative Philosophy which is taught in the Schools, we may
THE RIIIE OF EMPIRICISM 227
find a practical philosophy by means of which, knowing the force and the
action of fire, water, air, the stars' the heavens, and all other bodies that
environ us, as distinctly as we know the different crafts ofour artisans, we
can in the same way employ them in all those uses to which they are
adapted, and thus render ourselves the masters and possessors of nature'
The chemist Robert Boyle said, "The good of mankind may be much
increased by the naturalist's insight into the trades."
The challenge thrown out by Bacon and Descartes was quickly accepted,
and scientists plunged optimistically into the task of mastering as well as
understanding nature. These two motivations are still the major driving
forces, and indeed the interconnections between science and engineering
grew rapidly from the seventeenth century onward.
- This program was taken uP most seriously even by governments' The
French Academy of Sciences, founded by Colbert in 1666, and the Royal
Society of London, founded in 1662, were dedicated to the cultivation of
"such knowledge as has a tendency to use" and to making science " useful
as well as attractive,"
\ilorked rather than speculated, dealt with particulars rather than generalities,
and added to science instead of defining it or proposing how to obtain it. In
physics, the plastic arts, and technical fields generally, experience rather
than theory and speculation became the new source of knowledge.
Coupled with the pure empiricism of the artisans and, indeed, in part
suggested by the problems they presented, was the gradual rise of systematic
observation and experimentation, carried out largely by a more learned
group. Greeks such as Aristotle and Galen had observed a great deal and had
discussed the inductions that might be made on the basis of observations; but
one cannot say that the Greels ever possessed an experimental science. The
Renaissance activity, very modest in extent, marks the beginning of the now
vast scientific enterprise. The most significant groups of Renaissance experi-
mentalists were the physiologists led by Andreas Vesalius (1514-64), the
zoologists led by Ulysses Aldrovandi (1522-1605), and the botanists led by
Andrea Cesalpino (1519-1603).
fn the area of tJre physical sciences, the experimental work of William
Gilbert (1540-1603) on magnetism was by far the most outstanding. In his
famous De Magnetc ( 1600), he stated explicitly that we must start from
e:rperiments. Though he respected the ancients because a stream of wisdom
came from them, he scorned those who quoted others as authorities and did
not experiment or verifr what they were told. His series of cardully con-
ducted, detailed, and simple experiments is a classic in the experimental
method. Ife notes, incidentally, that Cardan, in De Rerum Varietatz, had,
described a perpetual-motion machine and comments, " May the gods
damn all such sham, pilfered, distorted worls, which do but muddle the
minds of tlre students."
We have been pointing out the variegated practical interests that led
to a vast expansion of the study of nature and the consequent impulse to
systematic experimentation. Side by side with this practical work, largely
independent but not oblivious of it, some men pursued the larger goal of
science-the understanding of nature. The work of the later Scholastics on
Iialling bodies, described in the preceding chapter, was continued in the
sixteenth century. Their predominant goal was to secure the basic laws of
motion. The work on projectile motion, often described as a response to
practical needs, was motivated far more by broad scientific interests in '
mechanics. The work of Copernicus and Kepler in astronomy (Chap. 12,
sec. 5) was certainly motivated by the desire to improve astronomical theory.
Even t}re artists of the Renaissance sought to penetrate to the essence of
reality.
Fortunately technicians and scientists began to recognize common
interests and to appreciate the assistance each could derive from the other.
The technicians of the fifteenth and sixteenth centuries, the early engineers
who had relied on manual dexterity, mechanical ingenuitn and sheer
THE RISE OE EMPIRICISM 229
inventiveness and cared little for principles, became aware of the aid to
practice they could secure from theory. on their side scientists became aware
thrt the artisans were obtaining facts of nature that correct theory had to
comprehend, and that they could secure pregnant suggestions for investiga-
tion from the work.of the artisans. In the opening paragraph of his Dialogrus
conceming Two New scfunces (the sciences are strength of materials and the
theory oi motion), Galileo acknowledges this inspiration for his investiga-
tions. "The constant activity which you Venetians display in your famous
arsenal suggests to the studious mind a large field for investigation, especially
that part oi the work which involves mechanics; for in this department all
types of instruments and machines are constantly being constructed by many
#irurrr, among whom there must be some who, partly by inherited experi-
ences and partiy by their own observations, have become highly expert and
clever in explanation."
The practical and purely scientific interests were fused in the swen-
teenth century. When the larger principles and problems had emerged from
the empirical needs, and the mathematical knowledge of the Greeks had
b."o-. frUy available to the scientists, the latter were able to proceed more
effectively with pure science. Without losing sight of the goal of under-
standing ihe design ofthe universe, they also willingly sought to aid practice.
The outcome was an expansion in scientific activity on an unprecedented
scale, plus far-reaching and weighty technical improvements that culrninated
in the Industrial Revolution'
The great importance of the beginnings of modern science for us is, of
course, th;t it paved the way for the major developments in mathematics'
Its immediate *ur involvement with concrete problems. since the
"ffa"t
Renaissance mathematicians worked for the rePublics and Princes and
collaborated with architects and handworkers-Maurolycus was an engineer
for the city of Messina, Baldi was a mathematician for the Duke of Urbino,
Benedetti was the chiefengineer for the Duke of savoy, and Galileo was court
mathematician for the Grand Duke of Tuscany-they took up the observa-
tions and experiences of the practical people. Up to the time of Galileo, the
impact of the technicians and architects can be seen largely in the work of
Nicold Tartaglia (1506-57), a genius who was self-taught in the science
of his time. T.rtaglia made the transition from the practical to the learned
mathematician, singting out with discernment useful problems and obser-
vations from empirical knowledge. His uniqueness lies in this achievement
and in his complete independence from the magical influences that char-
acterizethe work of his rival Cardan. Tartaglia's position is midway between
Leonardo and Galileo-not merely chronologically, but because his work on
the mathematics of dynamical problems raised that subject to a new science
and influenced the forerunners of Galileo.
The long-range effect was that modern mathematics, guided by the
230
Platonic doctrine that it is the essence of reality, grew almost entirely out of
the problems of science. Under the new directive to study nature and to
obtain laws embracing observations and experimental results, mathematics
broke away from philosophy and became tied to physical science. The
consequence for mathematics was a burst of activity and original creation
that was the most prolific in its history.
Bibliograplry
Ball, W. W. R.: I Short Account of the History of Matlumati.cs, Dover (reprint), 1960,
Chaps. 12-13.
Burtt, E. A.z The Mclaphgsiral Foundations of Modtrn Phgsical Scicnce, Routledge and
Kegan Paul, 1932.
Butterfeld, Herbert: Tlu Origins oJ Modem Scierce, Macmillan, 1951, pp. l-47,
Cajori, Florian: A Histnry of Matlumatics,2nd ed., Macmillan, 1919, pp. 128-45.
Cardano, Gerolamo: Opera Omnia, Johnson Reprint Corp., 1964.
Tlu Book of My L;,fe, Dover (reprint), 1962.
Tlu Book on Gamcs of Chanre, Holt, Rinehart and Winston, 1961.
Clagett, Marshall, ed.: Critiral Problens in thc History of Sci.erce , University of Wis-
-:
-: consin Press, 1959, pp. 3-196.
Crombie, A. C.t Augustiru ta Galileo, Falcon Press, 1952, Chaps. 5-6.
Robeil Grosseteste and tlu Origins of Expai.mcntal Scbntc, Oxford University
Press, 1953, Chap. I l.
Dampier-Whetham,
-: W. C. D.: A History oJ Scicnre, Cambridge University Press,
1929, Chap. 3.
Farrington, B.: Francis Bacon, }ler:ry Schuman, 1949.
Mason, S. F.z A Histary of tlu Scieues, Routledge and Kegan Paul, 1953, Chaps.
13, 16, 19, and 20.
Ore, O.: Cardano : Tlu Ganbling Scholar, Princeton University Press, 1953.
Randall, John H., Jr.: Tlu Making of ilu Moden Mind, Houghton Miffiin, 1940,
Chaps. G-9.
Russell, Bertrand: A Histnry of Wcstern Philosophy, Simon and Schuster, 1945,
pp. 491-557.
Smith, David Eugene: History of Mathcmatics, Dover (reprint), 1958, Vol. l,
pp.24245, and Chap. 8.
Smith, Preserved: A History oJ Moden Culture, Holt, Rinehart and Winston, 1940,
Vol. l, Chaps. 5-6.
Strong, Edward '\N.: Procedurcs and Metaplrysics, University of California Press,
1936; reprinted by Georg Olms, 1966, pp. l-134.
Taton, Ren6, ed,z The Beginnings of Modern Science, Basic Books, l9&1, pp. 3-51,
pp. 82-177.
Vallentin, Antonina: Itonardo da Vinci, Viking Press, 1938.
White, Andrew D . z A History of tlu Warfare of Science with Theology , George Braziller
(reprint), 1955.
t2
Mathematical Contributions
in the Renaissance
The chiefaim ofall investigations of the external world should
be to discover the rational order and harmony which has
been imposed on it by God and which He revealed to us in
the language of mathematics. JoHANNES xEPLER
l. Paspectiae
Though the Renaissance men grasped only dimly the outlooks, values, and
goals of the Greek works, they did take some original stePs in mathematics;
and they made advances in other fields that paved the way for the tremen-
dous seventeenth-century uPsurge in our subject.
The artists were the first to manifest the renewal of interest in nature
and to apply seriously the Greek doctrine that mathematics is the essence
of nature's reality. The artists were selfl-taught and learned through practice.
Fragments of Greek knowledge filtered down to them, but on the whole
they sensed rather than grasped the Greek ideas and intellectual outlook'
To an extent this was an advantage because, lacking formal schooling, they
were free ofindoctrination. Also, they enjoyed freedom of expression because
their work was deemed " harmless."
The Renaissance artists were by profession universal men-that is,
they were hired by princes to perform all sorts of tasks from the creation of
great paintings to the design of fortifications, canals, bridges, war machines,
palaces, public buildings, and churches. Hence they were obliged to learn
mathematics, physics, architecture, engineering, stonecutting, metalworking,
anatomy, woodworking, optics, statics, and hydraulics. They performed
manual work and yet tackled the most abstract problems. In the fifteenth
century, at least, they were the best mathematical physicists'
To appreciate their contribution to geometry we must note their new
goals in painting. In the medieval period the glorification of God and the
illustration of biblical themes were the purposes of painting. Gilt back-
grounds suggested that the people and objects portrayed existed in some
23r
232 MATHEMATICAL CONTRIBUTIONS IN THE RENAISSANCE
heavenly region. Also the figures were intended to be symbolic rather than
realistic. The painters produced forms that were flat and unnatural and did
not deviate from the pattern. In the Renaissance the depiction of the real
world became the goal. Hence the artists undertook to study nature in order
to reproduce it faithfully on their canvases and were confronted with the
mathematical problem of representing the three-dimensional real world on
a two-dimensional canvas,
Filippo Brunelleschi (1377-1446) was the first artist to study and em-
ploy mathematics intensively. Giorgio Vasari ( l5I l-74), the Italian artist
and biographer, says that Brunelleschi's interest in mathematics led him to
study perspective and that he undertook painting just to apply geometry.
He read Euclid, Hipparchus, and Vitello on mathematics and optics and
learned mathematics from the Florentine mathematician Paolo del Pozzo
Toscanelli (1397-1482). The painters Paolo Uccello (1397-1475) and
Masaccio ( l40l-28) also sought mathematical principles for a system of
realistic perspective.
The theoretical genius in mathematical perspective was Leone Battista
Alberti (1404-72), who presented his ideas in Della pittura (1435), printed
in l5l l. This book, thoroughly mathematical in character, also includes
some work on optics. His other important mathematical work is Ludi
matlumatici (1450), which contains applications to mechanics, surveying,
time-reckoning, and artillery fire. Alberti conceived the principle that
became the basis for the mathematical system of perspective adopted and
perfected by his artist successors. He proposed to paint what one eye sees,
though he was well aware that in normal vision both eyes see the same
scene from slightly different positions and that only through the brain's re-
conciliation of the two images is depth perceived. His plan was to further
the illusion of depth by such devices as light and shade and the diminution
of color with distance. His basic principle can be explained in the follow-
ing terms. Between the eye and the scene he interposed a glass screen
standing upright. He then imagined lines of light running from the eye or
station point to each point in the scene itself. These lines he called a pyramid
of rays or a projection. Where these rays pierced the glass screen (the picture
plane), he imagined points marked out; the collection of points he called a
section. The significant fact about it is that it creates the same impression
on the eye as the scene itselt because the same lines of light come from the
section as from the original scene. Hence the problem ofpainting realistically
is to get a true section onto the glass screen or, in practice, on a canvas. Of
course the section depends upon the position of the eye and the position of
the screen. This means merely that different paintings of the same scene
can be made.
Since the painter does not look through his canvas to determine the
section, he must have rules based on mathematical theorems, which tell
233
him how to draw it. Alberti furnished some correct rulesl in his Della
littura, b:ut did not give all the details. He intended his book as a summary
to be supplemented by discussions with his fellow painters and, in fact,
apologizes for his brevity. He tried to make his material concrete, rather
than formal and rigorous, and so.gave theorems and constructions without
proof.
Beyond introducing the concepts of projection and section, Alberti
raised a very significant question. If two glass screens are interposed between
the eye and the scene itself, the sections on them will be different. Further,
if the eye looks at the same scene from two different positions and in each
case a glass screen is interposed between the eye and the scene, again the
sections will be different. Yet all these sections convey the original figure.
Hence they must have some properties in common. The question is, What
is the mathematical relation between any two of these sections, or what
mathematical properties do they have in common ? This question became
the starting point of the development of projective geometry.
Though a number of artists wrote books on mathematical perspective
and shared Alberti's philosophy of art, we can mention here only one or
two leaders. Leonardo believed that painting must be an exact reproduction
of reality and that mathematical perspective would permit this. It was "the
rudder and guide rope of painting" and amounted to applied optics and
geometry. Painting for him was a science because it reveals the reality in
nature; for this reason it is superior to Poetry, music, and architecture'
Leonardo's writings on perspective are contained in his Ttattato della pittura
(1651), compiled by some unknown author who used the most valuable of
Leonardo's notes on the subject.
The painter who set forth the mathematical principles of perspective
in fairly complete form is Piero della Francesca (c. 1410-92)' He, too, re-
garded perspective as the science of painting and sought to correct and
extend empirical knowledge through mathematics. His main work, De
prospettila pingendi (1482-87), made advances on Alberti's idea of projection
and section. In general, he gives procedures useful to artists and his direc-
tions employ strips of paper, wood, and the like. To help the artist he, like
Alberti, gives intuitively understandable definitions' He then offers theorems
which he " demonstrates " by constructions or by an arithmetical calcula-
tion of ratios. FIe was the painter-mathematician and the scientific artist
par excellence, and his contemporaries so regarded him. He was also the
best geometer of his time.
However, of all the Renaissance artists, the best mathematician was the
German Albrecht Diirer (1471-1528). His [Jnderweysang der Mesrung mid d'em
l. For some of these rules, and paintings constructed in accordance with them, see the
awthor's Mathematits in Western Culturz, Oxford University Press, 1953.
234 MATTTEMATTCAL CONTRTBUTIONS rN THE RENAIIIIiANCE
2. Geometry Proper
The developments in geometry aPart from perspective during the fifteenth
and sixteenth centuries were not impressive. One of the geometric toPics
discussed by Diirer, Leonardo, and Luca Pacioli (c. 1445-c. l5l4), an
Italian monk who was a pupil of Piero della Francesca and a friend and
teacher of Leonardo, was the inscription of regular polygons in circles'
These men attempted such constructions with a straightedge and a compass
of fixed opening, a limitation already considered by the Arab Abir'l-WefA,
but they gave only approximate methods.
The construction of the regular Pentagon was a problem of great in-
terest because it arose in the design of fortifications. In the Elerncnts, Book
IV, Proposition I l, Euclid had given a construction not limited by a com-
pass of fixed opening. The problem of giving an exact construction with this
limitation was tackled by Tartaglia, Ferrari, Cardan, del Monte, Bene-
detti, and many other sixteenth-century mathematicians. Benedetti then
broadened the problem and sought to solve all Euclidean constructions with
a straightedge and a compass of fixed opening. The general problem was
solved by the Dane George Mohr (1640-97) in his Compendium Erclidis
Curiosi (1673).
Mohr also showed, in his Euclides Danicus (1672), that the constructions
that can be performed with straightedge and compass can be performed
with only a compass. Of course without a straightedge, one cannot draw
the straight line joining two points; but given the two points, one can con-
struct the points ofintersection ofthe line and the circle, and given two pairs
of points, one can construct the point of intersection ol the two lines deter-
mined by the two pairs. The fact that a comPass alone suffices to perform
GEOMETR.Y PROPER 235
For this true ratio to hold on Mercator's straightJine map, where longitude
lines are equally spaced and each minute of change equals 6087 feet, he
increases the spaces between latitude Iines by the factor l/cos Z as latitude
Z increases. At 20' latitude on his rnap a l' change in latitude equals a
distance of 6087 (l/cos 20'), or 6450 feet. Thus at latitude 20'
l' change in longitude 6087
-T-EandE-Etitra; - 6m'
and this ratio equals the true rutio of 572216087.
The Mercator map has several advantages. Only on this projection are
two points on the map at correct compass course from one another. Then a
course of constant compass bearing on the sphere, that is, a curve called a
loxodrome or rhumb line, which cuts all meridians at the same angle,
becomes a straight line on the map. Distance and area are not preserved;
in fact, the map distorts badly around the poles. However, since direction
is preserved, so is the angle between two directions at a point, and the map
is said to be conformal.
Though no large new mathematical ideas emerged from the sixteenth-
century work on map-making, the problem was taken over later by mathe-
maticians and led to work in differential geometry.
3.Algebra
Up to the appearance of Cardan's Ars Magna (1545)' which we shall treat
in the next chapter, there were no Renaissance developments of any conse-
quence in algebra. However, the work of Pacioli is worth noting. Like most
others of his century he believed that mathematics is the broadest systematic
learning and that it applies to the practical and spiritual life of all people.
He also realized the advantages of theoretical knowledge for practical work.
Theory must be master and guide, he tells the mathematicians and tech-
nicians. Like Cardan he belonged to the humanist circle. Pacioli's major
publication is the Summa de Arithmetica, Geometria, Proportione et Prolortionalita
(1a94). The Summa was a compendium of the available knowledge and was
representative of the times because it linked mathematics with a great
variety of practical applications.
The contents covered the Hindu-Arabic number symbols, which were
already in use in Europe, business arithmetic, including bookkeeping, the
algebra thus far created, a Poor summary of Euclid's Elements' and some
TR.IGONOMETRY
237
4. Trigonometry
Until 1450, trigonometry was largely spherical trigonometry; surveying
continued to use the geometric methods of the Romans' About that date
plane
-pisa
trigonometry became important in surveying, though Leonardo of
in hii practica Geometriae (1220) had already initiated the method.
New work in trigonometry was done by Germans of the late fifteenth
and early sixteenth centuries, who usually studied in Italy and then returned-
to their native cities. At the time Germany had become ProsPerous, some of
the wealth having been acquired by the Hanseatic League of North Ger-
many, which conlrolled much trade; hence merchant Patrons were able
to srrpport the work of many of the men we shall mention' The trigonometric
work- was motivated by navigation, calendar-reckoning, and astronomy'
238 MATTTEMATTCAL coNTRrBUTroNs rN THE R!,NArssANcE
I
I
l/
Figure l2.l C
The De Triangulis was not published until 1533; in the meantime' Johann
Werner(1468-1528)improvedandpublishedRegiomontanus'sideasin
De Tiangulis SPhaeticis (1514).
Fo, L.rry y"urs .fie. the work of Regiomontanus spherical
trigonom-
etry continuei io be troubled by the need for multitude of formulas'
-a Copernicus a
partly becarrse Regiomontanus in his De Triangulis, and even
^"..rrr.y later, useJ only the sine and cosine functions' Also the negative
lrul.r", of the cosine and tangent functions for obtuse angles were not recog-
nized as numbers.
Rhaeticus, who was a pupil of Copernicus, changed the meaning of
sine. Instead oi rp"ukirrg of rif lfig. 12. 1) as the sine of lD,
he spoke of'tlB
expressed in
as the sine of arril. AO-8. How.*ei, the length of AB was still
a number of unil dependent on the number of units chosen for the length
of the radius. As a consequence of Rhaeticus's change, the triangle- OIB
becamethebasicstructure.andthecirclewithradiusoz{incidental.Rhaeti-
cus used all six functions.
Plane and spherical trigonometry were further systematized and- ex-
tended slightly by Frarrqois Vieta, a lawyer by profession but recognized
far more as the foremost mathematician of the sixteenth century. His
Canon Mathematicus (1579) was the first of his many works
on trigonometry'
Here he gathered together the formulas for the solution of right and oblique
plane triingles, including his own contribution, the law of tangents:
a-b -"(+1
;T i,
-"(H
to
For spherical right triangles he gave the complete set of formulas needed
.rry oi. part in termslf two other known parts, and the rule for
"alc.r'Iut"
240 MATHEMATICAL CONTRIBUTIONS IN THE RENAISSANCE
sinl - sin B : z
B
"o"Lj! "inA,
and identities for sin zd and cos z0 in terms of sin d and cos d. The latter
identities are contained in his Sectiones Angulares, published posthumously in
1615.2 Vieta expressed and worked with the identities in algebraic form,
though the notation was by no means modern.
He used the formula for sin z0 to solve the problem posed by the
Belgian mathematician Adrianus Romanus (1561-1615) in his book ldeae
Mathematicae (1593) as a challenge to all Frenchmen. The problem was to
solve an equation of the forty-fifth degree in .r. Henry IV of France sent
for Vieta, who recognized that the problem amounted to this : Given the
chord of an arc,.to find the chord of the forty-fifth part of that arc. This is
equivalent to expressing sin 451 in terms of sin I and finding sin l. If
# : sin I then the algebraic equation is of the forty-fifth degree in .t. Vieta
knew that problem could be solved by breaking this equation up into a
fifth-degree equation and two third-degree equations; these he solved
quickly. IIe gave the 23 positive roots but ignored the negative ones. fn
his Rzsponsum (1595)3 he explained his method of solution.
In the sixteenth century, trigonometry began to break away from
astronomy and acquire status as a branch of mathematics. The application
to astronomy continued to be extensive, but other applications-as, for
example, surveying-warranted the study of the subject from a more de-
tached viewpoint.
though there were some exceptions. The rise of algebra was, at least at the
outsJ, a continuation of Arab lines of activity; and some new work in
geometry was suggested by problems raised by artists'
By iut tt ,"*t significant Renaissance development in motivating the
"
mathematics of the next two centuries was the revolution in astronomy
led
the Greek works became available' after
by Copernicus and Kepler. When
ubort i200, both Aristotle's astronomical theory (a modification of Eudoxus')
and Ptolemy's theory became widely known and were pitted against each
other. Stric;ly speaking, various additions to both schemes had been intro-
duced by the Arabs and the Iate medieval astronomers to improve the
accuracyofbothsystemsortoaccommodateAristotle'sschemetoChristian
theology. Ptolemy;s scheme, reasonably accurate for the time, was
purely
mathematical and hence regarded only as a hypothesis, not a description
of real structures. Aristotle's iheory was the one most men accepted,
though
Ptolemy's was more useful for astronomical predictions, navigation' and
calendar-reckoning.
A few Arab, late medieval, and Renaissance figures, including-al-
Birtrni (973-1048), Oresme, and Cardinal Nicholas of Cusa (1401-64)'
perhaps responding to Greek ideas, seriously considered that the earth
mighi be .oiatirg and that it might be equally possible to build an astro-
,roili"ul theory on the basis of the earth moving around the sun' but none
worked out a new theory.
Among astronomers, Nicholas Copernicus suddenly appeared as a
colossus. Bo.n i., Thorn, Poland , in 1473' Copernicus studied mathematics
and science at the University of Cracow. At the age of twenty-three he went
to Bologna to further his studies and there became familiar with Pythagorean
and oti'er Greek doctrines, including astronomical theory' He also studied
medicine and church law. In t5l2 he returned to Poland to become canon
(an administrator) at the Cathedral of Frauenberg where he remained
untilhisdeathin1543.Whileperforminghisdutieshedevotedhimselfto
intensive studies and observations that culminated in a revolutionary astro-
nomical theory. This achievement in the domain of thought outclasses in
significance, boldness, and grandeur the conquest of the seas'
It is difficult to determine what caused Copernicus to overthrow the
fourteen-hundred-year-o1d Ptolemaic theory' The indications
in the preface
ofhisclassicwork,DaReuolutionibusorbiumCoelcstiulz(ontheRevolutions
of the Heavenly Spheres, 1543) are incomplete and somewhat enigmatic'
Copernicus states tiat he was aroused by divergent views on the
accuracy
that the Ptolemaic theory was just a
of ihe Ptole*aic system, by the view
conflict between adherents of the Aristo-
convenient hypothesis, and by the
telian and Ptolemaic theories'
Copernicus retained some principles of Ptolemaic astronomy' He
used
the circie as the basic curve on which his explanation of the motions of the
242 MATIIEMATICAL CONTRTBUTTONS rN THE RENATSSANCE
largest radius was that of the orbit of Saturn. In a sphere of this radius he
supposed a cube to be inscribed. In this cube a sphere was inscribed whose
radius should be that of the orbit ofJupiter. In this sphere he supposed a
tetrahedron to be inscribed and in this, in turn, another sphere, whose
radius was to be that ofthe orbit of Mars, and so on through the five regular
solids. This allowed for six spheres, just enough for the number of planets
then known. Flowever, the deductions from this hypothesis were not in
accord with observations and he abandoned the idea, but not before he
had made extraordinary efforts to apply it in modified form.
Although the attempt to use the five regular solids to ferret out nature's
secrets did not succeed, Kepler was eminently successful in later efforts to
find harmonious mathematical relations. His most famous and important
results are known today as Kepler's three laws of planetary motion. The
first two were published in a book of 1609 bearing a long title, which is
sometimes shortened to Astronomia Nooa and sometimes to Commentaries on the
Motions oJ Mars.
The first law states that the path of each planet is not the resultant of
a combination of moving circles but is an ellipse with the sun at one focus
(Fig. 12.2). Kepler's second law is best understood in terms of a diagram
(Fig. 12.3). The Greeks, we saw, believed that the motion of a planet must
be explained in terms of constant linear speeds. Kepler, like Copernicus, at
first held firmly to the doctrine of constant speeds. But his observations
compelled him to abandon this cherished belief too. His joy was great when
he was able to replace it by something equally attractive, for his conviction
that nature follows mathematical laws was reaffirmed . If MM' and imf'
are distances traversed by a planet in equal intervals of time, then, according
to the principle of constant speed,, MM'and .lmf'would have to be equal
distances. However, according to Kepler's second law, MM' and NN' are
generally not equal but the areas SMM'and Sr'[/['are equal. Thus Kepler
replaced equal distances by equal areas. To wrest such a secret from the
planets was indeed a triumph, for the relationship described is by no means
as easily discernible as it may appear to be on paper.
Kepler made even more extraordinary eflorts to obtain the third law of
motion. This law says the square of the period of revolution of any planet is
equal to the cube of its average distance from the sun, provided the period
of the earth's revolution and its distance from the sun are the units of time
MAJOR SCTENTTFTC PROGRESS rN THE RENATSSANCE 245
and distance.4 Kepler published this result in The Hannony of the world
(161e).
' ("pl"r's work is far more revolutionary than Copernicus's; equally
daring in adopting heliocentrism, Kepler broke radically from authority
urrd by utltiring the ellipse (as opposed to a composition ofcircular
tlditio,
motions) and nonuniform velocities. He hewed firmly to the position that
scientific investigations are independent of all philosophical and theo-
logical doctrines, that mathematical considerations alone should determine
th"e wisdom of any hypothesis, and that the hypotheses and deductions
from them must stand the test of empirical confirmation'
The work of Copernicus and Kepler is remarkable on many accounts'
but we must confine ourselves to its relevance to the history of mathematics'
In view of the many serious counter-arguments against a heliocentric theory,
their work de-orrtrates how strongly the Greek view that the truths of
nature lie in mathematical laws had already taken hold in Europe'
There were weighty scientific objections, many of which had already
been advanced by Ptolemy against Aristarchus' suggestion' How could
such
a heavy body as ihe be set into and kept in motion ? The other planets
"r.th
were in motion, even according to Ptolemaic theory, but the Greeks and
medieval thinkers had maintained that these were composed of some special
light substance. There were other objections' Why, if the earth rotates from
w"est to east, does an object thrown up into the air not fall back
to the west
of its original position i Why doesn't the earth fly apart in its rotation_?
Coperrici,s,s very weak answer to the latter question was that the sphere is
a iatural shape and moves naturally, and hence the earth would not
destroy itself. iurther, it was asked, why do objects on the earth and the air
itself stay with an earth rotating at about 3/10 of a mile per second and
revolving around the sun at the rate of about l8 miles Per second ? If, as
Ptotem/and Copernicus believed, the speed of a body in natural motion-
is proportional to its weight' the earth should leave behind it objects of
lesre. w"ight. copernicus replied that the air possessed " earthiness " and
would therefore stay with the earth.
Therewereadditionalscientificobjectionsfromastronomers.Ifthe
earth moved, why did the direction of the " fixed " stars not change ? An
angle of parall ax'of 2, required that the distance of the stars be at least
four
*Jlio' ti*"s the radius of the earth ; such a distance was inconceivable at
that time. Not detecting any parallax of the stars (which implied that they
hadtobeevenfartheraway),Copernicusdeclaredthat..theheavensare
immense by comparison with the Earth and aPPear to be of infinite size' ' ' '
The bounds of thi universe are unknown and unknowable." Then, realizing
the inadequacy of this answer, he assigned the problem to the philosophers
4. Though Kepler stated it this way, the correct statement calls for replacing average
distance by the semimajor axis.
246 MATHEMATTCAL coNTRrBUTroNs rN THE RENATSSANCE
and thereby evaded it. Not until IBSB did the mathematician Bessel measure
the parallax of one of the nearest stars and find it to be 0.31'.
If Copernicus and Kepler had been "sensible" men, they would never
have defied their senses. We do not feel either the rotation or the revolution
of the earth despite the high speeds involved. On the other hand, we do see
the motion of the sun-
Copernicus and Kepler were highly religious; yet both denied one of
the central doctrines of Christianity, that man, the chief concern of God,
was at the center of the universe, and everything in the universe revolved
around him, In contrast, the heliocentric theory, by putting the sun at the
center of the universe, undermined this comforting dogma of the Church,
because it made man appear to be just one of a possible host of wanderers
drifting through a cold sky. It seemed less likely that he was born to live
gloriously and to attain paradise upon his death. Less likely, too, was it that
he was the object of God's ministrations. Thus, by displacing the earth,
Copernicus and Kepler removed a cornerstone of Catholic theology and
imperiled its structure. Copernicus pointed out that the universe is so im-
mense, compared to the earth, that to speak of a center is meaningless,
However, this argument put him all the more into opposition with religion.
Against all the objections Copernicus and Kepler had only one retort,
but a weighty one. Each had achieved mathematical simplification and,
indeed, overwhelmingly harmonious and aesthetically superior theory. If
mathematical relationships were to be the goal of scientific work, and if a
better mathematical account could be given, then this fact, reinforced by
the belief that God had designed the world and would clearly have used
the superior theory, was sufrcient to outweigh all objections. Each felt, and
clearly stated, that his work revealed the harmony, symmetry, and design
of the divine workshop and overpowering evidence of God's presence.
Copernicus could not restrain his gratification: "We find, therefore, under
this orderly arrangement, a wonderful symmetry in the universe, and a
definite relation of harmony in the motion and magnitude of the orbs, of a
kind that it is not possible to obtain in any other way." The very title of
Kepler's work of 1619, Thc Hannony of the World, and endless paeans to God,
expressing satisfaction with the grandeur of God's mathematical design,
attest to his beliefs.
It is not surprising that at first only mathematicians supported the
heliocentric theory. Only a mathematician, and one convinced that the
universe was mathematically designed, would have had the mental fortitude
to disregard the prevailing philosophical, religious, and physical beliefs.
Not until Galileo focused his telescope on the heavens did astronomical
evidence favor the mathematical argument. Galileo's observations, made in
the early 1600s, revealed four moons circling around Jupiter, showing that
planets could have moons. It followed that the earth, too, need not be more
2+7
BIBLIOGRAPITY
than a planet just because it had a moon' Galileo also observed that the
*oon h.d a rough surface and mountains and valleys like the earth' Hence
the earth was also likely to be just one more heavenly body and not neces-
sarily the center of the universe.
th. heliocentric theory finally won acceptance because it was simpler for
calculations, because of its superior mathematics, and because observations
supported ii. fni, meant thaa the science of motion had to be recast in the
tight ofa rotating and revolving earth. In brief, a new science of mechanics
was needed,
Investigations into light and optics continued in an unbroken line from
what we haie already noted in the medieval period. In the sixteenth century
the astronomers became more interested in these subjects because the re-
fractive effect of air on light changes the direction of light rays as they come
from the planets and stars and so gives misleading information as to the direc-
tions of these bodies. Toward the end of the sixteenth century, the tele-
scopeandmicroscopewereinvented.Thescientificusesoftheseinstruments
ex-
are obvious; they opened uP new worlds, and interest in optics' already
tensive, surged stili higher. Almost all of the seventeenth-century mathe-
maticians worked on light and lenses.
Bibliograplty
Armitage, Angts Copcmicus, W. W' Norton, 1938'
John KePter, Faber and Faber, 1966'
Sun, Stind Then Stilt, Henry Schuman, 1947; in paperback as The World
oJ Copernicus, New American Library, l95l'
248 MATHEMATICAL CoNTRTBUTTONS rN THE RENATSSANCE
Koyr6, Alexandre: From thc Ctoscd Woild to th, Irfinite [Jnfuerse, Jobr:s Hopkins Press,
1957.
La Rioolution astronomique, Hermann, 1961.
Kuhn, Thomas S.: Tlu Copuni.can Rcaolution, Harvard University Press, 1957'
MacCurdy, Edward: The Noteboolcs of Lconartlo da virui, Georye Braziller (reprint),
1954.
Pannekoek, A.: A Histary oJ Astronomy, iohn Wiley and Sons, 1961, Chaps' 16-25'
Panofsky, F.;rwin: " Dilrer as a Matlumatician," in James R' Newman, Tlu World of
Mathematics, Simon and Schuster, 1956, pp' 603-21'
Santillana, G. d'e: Thc Crime of Galileo, University of Chicago Press, 1955'
sarton, George: Ttu Appreciation of Anci.ent and Medieoal scitnce During the Rznaissarce,
University of Pennsylvania Press, 1955.
Six Wings: Men oJ Scierce in the Rznaissance, Indiana University Press,
1957.
Smith, David Eugene: History of Mattumatics, Dover (reprint), 1958, Vol' l, Chap'
-t
8; Vol. 2, Chap. 8.
Smith, Preserve d: A History of Modern Culture, Holt, Rinehart and Winston, 1930,
Vol. l, Chaps. 2-3.
Taylor, Henry Osborn: Thought antl Exprcssion it the Sixteenth Ccntury, Ctowell-
Collier (reprint), 1962, Part V.
Taylor, R. Emmet: No Royal Road: Lua Pacioli and His Tdmcs, University of North
Carolina Press, 1942.
Tropfke, Johannes: Geschichtc tler Elcmentarmatlumatik, 7 vols', 2nd' ed', W' De
Gruyter, l92l-24.
vasari, Giorgio : Liaes of the Most Eminent Painters, sculptors, and Architects (marry
editions).
Wolf, Abraham: A History of in tlu Sixtccnth and
Sciznce, Tcchnology and Phihsophy
Allen and IJnwin, 1950, Chaps' l-6'
Seaenteedh Centuries, George
Zeller, Sister Mary Claudia: "The Development of Trigonometry from Regio-
montanus to Pitiscus ", Ph.D. dissertation, University of Michigan, 19'14;
Edwards Brothers, 1946.
r3
Arithmetic and Algebra in the
Sixteenth and Seventeenth Centuries
l. Introduction
The first major new European mathematical developments took place in
arithmetic and algebra. The Hindu and Arabic work had put practical
arithmetical calculations in the forefront of mathematics and had placed
algebra on an arithmetic instead of a geometric basis. This work also
attracted attention to the problem of solving equations.
In the first half of the sixteenth century there was hardly any change
from the Arab attitude or spirit, but merely an increase in the kind of
activity the Europeans had learned about from the Arab worlc. By the middle
of the century the practical and scientific needs of European civilization
were prompting further steps in arithmetic and algebra. Technological
applications of scientific work and practical needs require, as we have
pointed out, quantitative results. For example, the far-ranging geographical
explorations required more accurate astronomical knowledge. At the same
time the interest in correlating the new astronomical theory with increas-
ingly accurate observations demanded better astronomical tables, which in
turn meant more accurate trigonometric tables. In fact, a good deal of the
sixteenth-century interest in algebra was motivated by the need to solve
equations and work with identities in making trigonometric tables. Develop-
ing banking and commercial activities called for an improved arithmetic.
The response to these interests is evident in the writings of Pacioli, Tartaglia,
and Stevin, among oihers. Pacioli's Summa and Tartaglia,s General trittaro
de' numeri c misura (1556) contain an immense number of problems in mercan-
tile arithmetic. Finally, the technical work of the artisans, especially in
architecture, cannon-making and projectile motion, called for quantitative
thinking. Beyond these applications, a totally new use for algebra-the
250
STATUS OF THE NUMBER SYSTEM AND ARTTHMETTC 25r
He then argues that real numbers are either whole numbers or frac-
tions; obviously-irrationals are neither, and so are not real numbers' A
l. The symbol zr was first used by William Jones ( 1706)
'
252 ARTTHMETTC AND ALGEBRA r5OO-r7oO
century later, Pascal and Barrow said that a number such as \/5 can be
understood only as a geometric magnitude; irrational numbers are mere
symbols that have no existence independent of continuous geometrical mag-
nitude, and the logic of operations with irrationals must be justified by the
Eudoxian theory of magnitudes. This was also the view of Newton in his
Arithmetiea Unfuersalis (published in 1707 but based on lectures of thirty
years earlier).
Others made positive assertions that the irrational numbers were inde-
pendent entities. Stevin recognized irrationals as numbers and approxi-
mated them more and more closely by rationals; John Wallis in Algebra
(1685), also accepted irrationals as numbers in the full sense. He regarded
the frfth book of Euclid's Elements as essentially arithmetical in nature. Des-
cartes, too, in Rules for the Direction of the Mind (c. 1628), admitted irrationals
as abstract numbers that can represent continuous magnitudes.
As for negative numbers, though they had become known in Europe
through the Arab texts, most mathematicians of the sixteenth and seven-
teenth centuries did not accept them as numbers, or if they did, would not
accept them as roots of equations. In the fifteenth century Nicolas Chuquet
(1445?-1500?) and, in the sixteenth, Stifel (1553) both spoke of negative
numbers as absurd numbers. Cardan gave negative numbers as roots of
equations but considered them impossible solutions, mere symbols; he
called them fictitious, whereas he called positive roots real. Vieta discarded
negative numbers entirely. Descartes accepted them, in part. He called
negative roots of equations false, on the ground that they claim to represent
numbers less than nothing. However, he had shown (see sec. 5) that, given
an equation, one can obtain another whose roots are larger than the original
one by any given quantity. Thus an equation with negative roots could be
transformed into one with positive roots. Since we can turn false roots into
real roots, Descartes was willing to accept negative numbers. Pascal regarded
the subtraction of 4 from 0 as utter nonsense.
An interesting argument against negative numbers was given by
Antoine Arnauld (1612-94), a theologian and mathematician who was a
close friend of Pascal. Arnauld questioned that - l: I : l:- I because, he
said, - I is less than * 1; hence, How could a smaller be to a greater as a
greater is to a smaller ? The problem was discussed by many men. In 1712
Leibniz agreed 2 that there was a valid objection but argued that one can
calculate with such proportions because their form is correct, just as one
calculates with imaginary quantities.
One of the first algebraists to accept negative numbers was Thomas
Harriot (1560-1621), who occasionally placed a negative number by itself
on one side of an equation. But he did not accept negative roots. Raphael
Bombelli ( l6th cent.) gave clear definitions for negative numbers' Stevin
used positive and negative coefficients in equations and also accepted nega-
tive roots. ln his L'Inoention nowelle en l'alg]bre (1629), Albert Girard (1595-
1632) placed negative numbers on a Par with positive numbers and gave
both roots of a quadratic equation, even when both were negative' Both
Girard and Harriot used the minus sign for the operation of subtraction and
for negative numbers.
On the whole not many sixteenth- and seventeenth-century mathema-
ticians felt at ease with or accepted negative numbers as such, let alone
recognizing them as true roots of equations. There were some curious beliefs
about them. Though Wallis was advanced for his times and accepted nega'
tive numbers, he thought they were larger than infinity but not less than
zero. Irr his Arithmetica bfninrum (1655), he argued that since the ntio al0,
when a is positive, is infinite, then, when the denominator is changed to a
negative nu-ber, as h alb with D negative, the ratio must be greater than
infinity.
Without having fully overcome their difficulties with irrational and
negative numbers the Europeans added to their problems by blundering
into what we now call complex numbers. They obtained these new numbers
by extending the arithmetic operation of square root to whatever numbers
appeared in solving quadratic equations by the usual method of completing
the square. Thus, Cardan, in Chapter 37 of Ars Magna (1545), sets up and
solves the problem of dividing 10 into two parts whose Product is 40'{he
equationisx(I0 - r) : 40. He obtains the roots 5 + y'-15-a"d5 - Vll5-
urrd th.r, says, " Putting aside the mental tortures involved," multiply
5 + t/=tS and 5 - {=lS; the product is 25 - (-15) or 40. He then
states, "So progresses arithmetic subtlety the end of which, as is said, is as
refined as it ls useless." As we shall soon see, Cardan became further involved
with complex numbers in his solution of cubic equations (sec' 4)' Bombelli
too considered complex numbers in the solution of cubic equations and
formulated in practically modern form the four operations with complex
numbers; but he still regarded them as useless and " sophistic'" Albert
Girard did recognize complex numbers as at least formal solutions of
equations. ln L'Inoention nouaelle en I'algibre he says, "One could say: Of what
,rr" u." these impossible solutions [complex roots] ? I answer: For three
things-for the certitude of the general rules, for their utility, and because
there are no other solutions." However, Girard's advanced views were not
influential.
Descartes also rejected complex roots and coined the term "imaginary'"
in La ,.Neither the true nor the false [negative] roots are
He says Giomltrie,
always real; sometimes they are imaginary." He argued that' whereas
,r.guiirr. roots can at least be made "real" by transforming the equation in
which they occur into another equation whose roots are positive, this cannot
254 ARTTHMETTC AND ALCEBRA r5OC-r7OO
be done for complex roots. These, therefore, are not real but imaginary;
they are not numbers. Descartes did make a clearer distinction than his
predecessors between real and imaginary roots of equations,
Even Newton did not regard complex roots as significant, most likely
because in his day they lacked physical meaning. In fact he says, in Uninrsal
Arithmetic,s " But it is just that the Roots of Equations should be often
impossible [complex], lest they should exhibit the cases of Problems that are
impossible as if they were possible." That is, problems that do not have a
physically or geometrically real solution should have complex roots.
The lack of clarity about complex numbers is illustrated by the oft-
quoted statement by Leibniz, "The Divine Spirit found a sublime outlet in
that wonder of analysis, that portent of the ideal world, that amphibian
between being and not-being, which we call the imaginary root of negative
unity." I Though Leibniz worked formally with complex numbers, he had
no understanding of their nature.
During the sixteenth and seventeenth centuries, the operational pro-
cedures with real numbers were improved and extended. In Belgium (then
part of the Netherlands), we find Stevin advocating in La Disme (Decimal
Arithmetic, l5B5) the use of decimals, as opposed to the sexagesimal system,
for writing and operating with fractions. Others-ChristoffRudolff (c. 1500-
c. I 545) , Vieta, and the Arab al-Kashi (d . c. I 436) used them previously.
Stevin advocated a decimal system of weights and -hadmeasures he was con-
I
cerned to save the time and labor of bookkeepers (he himself had started his
career as a clerk). He writes 5.912 as 5 @ 9 O I @ 2 @, or as 5, 9' l'2'.
Vieta improved and extended the methods of performing square and cube
roots.
The use of continued fractions in arithmetic is another development of
the period. We may recall that the Hindus-Aryabhata in particular-had
used continued fractions to solve linear indeterminate equations. Bombelli,
in his Algcbra (1572), was the first to use them in approximating square roots.
To approximate y'2, he writes
(r) l2: r
t
+-.
s
From this one finds
(2) y:I+\/2.
By adding I to both sides of (l) and using (2), it follows that
(3) u:2+!.
-!
3. Second ed., 1728, p. 193.
4. Aeta 0rud., 1702 : Math. Schifun, 5,350-$1.
STATUS OF THE NUMBER SYSTEM AND ARITHMEfiC 255
t/2:t+ l,'
2+!v
And' since Y is given rv al;:
l + --f---.
Z+-\
z+!v
By repeated substitution of the value ofjt, Bombelli obtained
|
^/o
va:-,-r------------l-
- q -r- -------:-
qt I
I
2 +;+ ....
The right-hand side is also written as
{2: t.*++
This continued fraction is simple because the numerators are all I ; it is
periodic because the denominators rePeat. Bombelli gave other examples of
how to obtain continued fractions' He did not, however, consider the ques-
tion of whether the expansion converged to the number it was supposed to
represent.
The English mathematician John Wallis, in his Aithrwtita hfinitarum
(1655), represented 1as the infinite product In this book
i+X*++:.
he also stated that Lord William Brouncker (1620-84), the first prcsident of
the Royal Society, had transformed this product into the continued fraction
!,:r*+*##
Brouncker made no further use of this form' Wallis, however, took up the
work. In his Opera Mathematica, I (1695), wherein he introduced the term
" continued fraction," he gave the general rule for calculating the convergents
of a continued fraction. That is, if p,lqo is the zth convergent of the con-
tinued fraction
No definitive result on the convergence of pnlqo to the number that the con-
tinued fraction represents was obtained at this time.
The biggest improvement in arithmetic during the sixteenth and seven-
teenth centuries was the invention of logarithms. The basic idea was noted
by Stifel. ln Arithmetica Integra he observed that the terms of the geometric
progression
lrrrr2rr3r...
correspond to the terms in the arithmetic progression formed by the
exponents
o, 1r2,3,....
I\{ultiplication of two terms in the geometric progression yields a term whose
exponent is the sum ofthe corresponding terms in the arithmetic progression.
Division of two terms in the geometric progression yields a term whose
exponent is the diflerence of the corresponding terms in the arithmetic
progression. This observation had also been made by Chuquet in Le Triparty
(1484). Stifel extended this connection between the two
cn la sciente dts nombres
progressions to negative and fractional exponents. Thus the division of 12
by r3 yields r-1, which corresponds to the term - I in the extended arith-
metic progression. Stifel, however, did not make use of this connection
between the two progressions to introduce logarithms.
John Napier (1550-1617), the Scotsman who did develop logarithms
about 1594, was guided by this correspondence between the terms of a
geometric progression and those ofthe corresponding arithmetic progression.
Napier was interested in facilitating calculations in spherical trigonometry
that were being made on behalf of astronomical problems. In fact, he sent
his preliminary results to Tycho Brahe for approval.
Napier explained his ideas in Mirfui Logarithmorum Canonis Descriptio
(1614) and in the posthumously published Mirifui Logarithmarum Canonis
Constrwtio (1619). Since he was concerned with spherical trigonometry, he
dealt with the logarithms of sines. Following Regiomontanus, who used
half chords and a circle whose radius contained l0? units, Napier started
with 10? as the largest number to be considered. He let the sine values from
107 to 0 be represented on the line AZ (Fig. l3.l) and supposed that ..4
moves toward Z with a velocity proportional to its distance from Z.
Strictly speaking, the velocity of the moving point varies continually
with the distance from A and the proper expression of it calls for the
STATUS OF THE NUMBER SYSTEM AND ARITHMETIC 257
calculus. However, if we consider any small interval of time t and let the
lengths lB, BC, CD,. . . be traversed in this interval, and if we assume that
the velocity during the interval I is constant and the one Possessed by the
moving point at the beginning of the interval, then the lengths AZ, BZ,CZ, . . .
are in geometric progression. For, consider DZ, and let ,t be the propor-
tionality constant relating the velocity and the distance of the moving point
from Z. Now
DZ:CZ-CD.
Moreover, the velocity of the moving point at C is k(CZ). Then the distance
CD : k(CZ)t. Hence
DZ : CZ - k(Cz)t : Cz(t - kt).
Thus any length in the sequence AZ, BZ, CZ, .. . is I - ftl times the pre-
ceding length.
Next Napier supposed that another point starting at the same time as ''4
moves at a constant velocity on the line A'L (Fig. l3.l), which extends in-
definitely to the right, so that this point reaches B', C', D',. . . as the first
point reaches 8,C,D,..., respectively. The distanbes A'B', A'C', A'D',..'
are obviously in arithmetic progression. These distances A'B', A'C', A'D', . ' '
were taken by Napier to be the logarithms of BZ,CZ, DZ,. . ., respectively'
The logarithm of AZ or 107 was taken to be 0. Thus the logarithms increase
in arithmetic progression while the numbers (sine values) decrease in geo-
metric progression. The quantity here denoted by I - fr is f - in
S
Napier's original work. Ilowever, he changed this ratio as he proceeded in
his calculations of logarithms.
Note that the smaller we take t the less the decrease in the sine values
frorn AZ to BZ to CZ, etc. Hence the numbers of the logarithm table are
closer together.
Napier took the distances,4' B', A' C',. . . to be l, 2, 3, .. ., though there
was no need to do that. They could have been ll2, L, I 112,2, . .. and the
scheme would have worked just as well. Moreover, the original numbers
were meaningful just as quantities, independent of the fact that they were
sine values I so Napier's scheme really gave the logarithms of numbers'
Napier himself applied logarithms to computations of spherical trigonometry'
The word "logarithm," coined by Napier, means "number of the
ratio," "Ratio" refers to the common ratio of the sequence of numbers
AZ, BZ, CZ, . . . .He also referred to logarithms as " artificial numbers'"
Henry Briggs (1561-1631), a prolessor of mathematics and astronomy,
suggested to Napier in 1615 that l0 be used as a base and that the logarithm
ofa number be the exponent in the power of 10 equal to that number' Here,
unlike Napier's scheme, a base is chosen first. Briggs calculated his logarithms
258 ARrrHMErrc AND ALGEBRA r5oo-r 7oo
by taking successive square roots of 10, that i., y'T6, l-rt-tO,. . ., until he
reached, after 54 such root extractions, a number slightly greater than l.
That is, he obtained a number A : l}totD64l. Then he took logro z4 to be
(l/2)5a. By using the fact that the logarithm ofa product of two numbers is
the sum of their logarithms, he built up a table of logarithms for numbers
that were closely spaced. The tables of common logarithms in current use
are derived from those of Briggs.
Joost Brlrgi (1552-1632), a Swiss watch and instrument maker and an
assistant to Kepler in Prague, was also interested in facilitating astronomical
calculations; he invented logarithms independently of Napier about 1600
but did not publish his work, Progress Tabuhn, until 1620. Brirgi too was
stimulated by Stifel's remarks that multiplication and division of terms in a
geometric progression can be performed by adding and subtracting the
exponents. His arithmetical work was similar to Napier's.
Variations of Napier's idea were gradually introduced' Also, many dif-
ferent tables of logarithms were calculated by algebraic means. The calcu-
lation of logarithms by the use of infinite series was made later by James
Gregory, Lord Brouncker, Nicholas Mercator (born Kaufman, 1620-87),
Wallis, and Edmond Halley (see Chap. 20, sec. 2).
Though the definition of logarithms as exponents of the powers that
represent the numbers in a fixed base, as in Briggs's scheme' became the
common approach, they were not defined as cxPonents in the early seven-
teenth century because fractional and irrational exponents were not in use.
By the end of the century a number of men recognized that logarithms could
be so defined, but the first systematic exposition of this approach was not
made until 1742, wlr..en William Jones (1675-17 49) gave such a Presentation
in the introduction to William Gardiner's Tablc of lagaritirzs. Euler had
already defined logarithms as exponents and in 1728, in an unpublished
manuscript (Opera Posthuma, II, 800-804), introduced e for the base of the
natural logarithms.
The nexi development in arithmetic (whose further realization has
proved momentous in recent times) was the invention of mechanical devices
and machines to speed up the execution of arithmetic Processes. The slide
rule comes from the work of Edmund Gunter (158l-1626), who utilized
Napier's logaritlms. William Oughtred (1574-1660) introduced circular
slide rules.
In 1642 Pascal invented a computing machine that handled addition by
carrying from units to tens, tens to hundreds, etc., automatically. Leibniz
saw it in Paris and then invented a machine which could perform multipli
cation. He showed the Royal Society of London his idea in 1677; a descrip-
tion was published by the Berlin Academy in 1710. In the late seventeenth
century Samuel Morland (1625-95), independently invented one machine
for addition and subtraction and another for multiplication.
SYMBOLISM 259
3. Symbolism
The advance in algebra that proved far more significant for its development
and for analysis than the technical progress of the sixteenth century was the
introduction of better symbolism. Indeed, this step made possible a science
ofalgebra. Prior to the sixteenth century, the only man who had consciously
introduced symbolism to make algebraic thinking and writing more comPact
and more effective was Diophantus. All other changes in notation were
essentially abbreviations of normal words, rather casually introduced. In the
Renaissance the common style was still rhetorical, that is, the use of special
words, abbreviations, and of course number symbols'
The pressure in the sixteenth century to introduce symbolism un-
doubtedly came from the rapidly expanding scientific demands on mathe-
maticians, just as the improvements in methods of calculation were a response
to the increasing uses of that art. However, improvement was intermittent.
Many changes were made by accident; and it is clear that the men of the
sixteenth century certainly did not have an appreciation ofwhat symbolism
could do for algebra. Even after a decided advance in symbolism was made,
it was not taken up at once by mathematicians.
Perhaps the first abbreviations, used from the flfteenth century on, were
p for plus and rn for minus. However, in the Renaissance and especially
during the sixteenth and seventeenth centuries, special symbols were intro-
duced. The symbols * and - were introduced by Germans of the fifteenth
century to denote excess and defective weights of chests and were taken over
by mathematicians; they appear in manuscripts after 1481. The symbol x
for "times" is due to Oughtred; but Leibniz rightly objected to it because
it can be confused with the letter x.
The sign : was introduced in 1557 by Robert Recorde (1510-58) of
Cambridge, who wrote the first English treatise on algebra, Tlu Wlutstoru
oJ Wilte (1557). He said he knew no two things more nearly alike than
260 ARrIHMETTC AND ALGEBRA r5oo-I7oo
parallel lines and so two such lines should denote equality' Vieta, who at
irrst wrote out " aequalis," later used - for equality' Descartes used oc' The
symbols > and < are due to Thomas Harriot. Parentheses appear in 1544'
square brackets and braceq introduced by vieta, date from about 1593.
Th" ,qru." root symbol, {- , *u" used by Descartes; but for cube root he
wtote {71-.
As examples of some of the writing, we might note the following' Using
B for square root, p for plus, and m for minus, Cardan wrote
(5 + y'=15).(5 - /=15) : 25 - (-15) : 40
,ul
5p: &m:15
5m: &m:15
25m:m:15 qd. est 40.
Hc also wrote
\/i +Tr4 as &.v. 7.p:&r4.
The V indicated that all that followed was under the radical sign'
The use of symbols for the unknown and powers of the unknown had a
rather surprisingiy slow rise, in view of the simplicity and yet extraordinary
value of the practice. (Of course Diophantus had used such symbols') Early
sixteenth-ceniury authors, like Pacioli, referred to the unknown as radix
(Latin for "roor") or res (Latin for " thing"), cosa (" thing" in Italian), and
rass ("thing" in German), for which reason algebra became known as the
" cossic " ait. In his Ars Magna Cardan referred to the unknown as rem
ignotam. He wrote x2 : 4x * 32 as qtlratu aeqtur 4 rebus p:32'5 The constant
tlerm, 32, was called the numero. The terms and notation varied a great deal;
many symbols were derived from abbreviations' For example, one symbol
for tire unknown was the letter R, an abbreviation of res' The second power'
represented by Z (ftom zensus), was called the quadratwn ot censo ' C, taken
from cubus, denoted .r3.
Gradually exponents were introduced to denote the powers ofr' Expo-
nents attached to numbers, we may recall, were used by Oresme in the
fourteenth century. h 1484 Chuquet in Triparty wrote 123, 105, and 1208
for 12x3, 1015, ani 120*8. He also wrote I2o for l2ro and 71'for 7x-1' Thus
8", 7,^ equals 562 stood for 8x3'7x-L :5612.
Irnhii AtgebraBombelli used the word tanto, instead of cora' For *, 12, and
*t,he wrote! J" and$. Thus 1 + 3x + 6tc2 + x3 is 1 p:*P'&p' lc' l\
l5B5 Stevin r.irol for this expression l@ + 30 + 62 +@. Stevin also used
fractional exponents,(
/'r\ - /T\"
square root,( )for cube root, and so on'
; )for i
5. Rcm and' rebus ate forms in the declension of res'
SYMBOLISM 26I
Occasionally he and others used x2 also. For higher powers he used ra,
.r5,. . . but not ,tr. Newton used positive, negative, integral' and fractional
exponents, as in *5/3 and x-3. When, in l80l, Gauss adopted *2 for .r*, the
former became standard.
The most significant change in the character of algebra was introduced
in connection with symbolism by Frangois Vieta. Trained as a lawyer, he
served in that capacity for the parliament in Brittany; he was later privy
councillor to Henry of Navarre. When he was out of office from l5B4 to 1589,
as a result of political opposition, he devoted himself entirely to mathe-
matics. Generally, he pursued mathematics as a hobby and printed and
circulated his work at his own expense-a guarantee, as one writer put it,
of oblivion.
Vieta was a humanist in spirit and intention; he wished to be the pre-
server, rediscoverer, and continuator of ancient mathematics. Innovation
was for him renovation. He describes his In A*m Analytitam Isagogco as the
"work of the restored mathematical analysis." For this book he drew on
the seventh book of Pappus' Matlumatical Collection and on Diophantus'
Arithmetica. He believed that the ancients had used a general algebraic type
of calculation, which he reintroduced in his algebra, thus merely reactivating
an art known and approved of in antiquity.
During the hiatus in his political career Vieta studied the works of
Cardan, Tartaglia, Bombelli, Stevin, and Diophantus. From them and
particularly from Diophantus he got the idea of using letters. Though a
number of men, including Euclid and Aristotle, had used letters in place of
specific numbers, these uses were infrequent, sporadic, and incidental. Vieta
was the first to use letters purPosefully and systematically, not just to repre-
sent an unknown or Powers of the unknown but as general coefficients'
Usually he used consonants for the known quantities and vowels for the
unknown quantities. He called his symbolic algebra logistica speciosa, as
opposed to logistica numerosa. Vieta was fully aware that when he studied the
general quadratic equation ax2 + bx + , : 0 (in our notation), he was
.trdying an entire class of expressions. In making the distinction between
logistica numerosa and, logistiea speciosa in his Isagoge, Vieta drew the line be-
tween arithmetic and algebra. Algebra, the logistiea speciosa, he said, was a
method of operating on species or forms of things. Arithmetic, the numnosa,
6. Introduction to the Analytic Art, l59l : Opera, l-12.
262 ARTTHMETTo AND ALGEBRA r5oo-r7oo
dealt with numbers. Thus in this one step, algebra became a study of general
types of forms and equations, since what is done for the general case covers
an infinity of special cases. Vieta used literal coefficients for positive numbers
only.
Vieta sought to establish the algebraic identities concealed in geometrical
form in the old Greek works, but to his mind clearly recognizable in Dio-
phantus. Indeed, as we noted in Chapter 6, the latter had made many
transformations of algebraic expressions by using identities that he did not
cite explicitly. Tn his Zetetborum Libri QuinquzT Vieta sought to recapture these
identities. He completed the square of a general quadratic expression and
expressed general identities such as
though he wrote
manner by men who did not aPpreciate the importance of this device became
standard.In noting this, the historian Florian Cajori was impelled to say
that " our symbols today are a mosaic of individual signs of rejected
systems."
assistance, I sought out its demonstration in [various] forms. This was very
difficult. My version of it follows."
Tartaglia protested the breach ol promise, and' in Quesiti ed inoenzioni
diterse (1546) presented his own case. However' neither in this book nor in his
General trattato di numeri e misure (1556), which is a good presentation of the
arithmetical and geometrical knowledge of the times, did he give more on the
cubic equation itself. The dispute as to who first solved the cubic led to an
open conflict between Tartaglia and Ferrari that exhausted itself in wild
quarreling in which Cardan took no part. Tartaglia himself was not above
reproach; he published a translation of some of Archimedes' work which he
actually took over from William of Moerbecke (d. c. 12Bl), and he claimed
to have discovered the law of motion of an object on an inclined plane-
which really came from Jordanus Nemorarius.
The method Cardan published he first illustrates with the equation
*3 + 6* : 20. However, to see the generality of the method we shall
consider
(4) xa+mx:n
with z and z positive. Cardan introduces two quantities I and u and lets
(s) t-il:n
and
(B) t: +2' u:
Here we have taken the positive radical as Cardan does. Having obtained ,
and z Cardan takes the positive cube root of each and by (7) obtains one
value of r. Presumably this is the same root that Tartaglia got.
The above is Cardan's method. However, he had to prove that (7) gives
a correct value for x. His proof is geometrical; for Cardan, I and z were
volumes of cubes whose sides *.r" {t and {i, and the product Xi ' {i *ut
a rectangle formed by the two sides whose area was z/3. Also, where we
said t - u : nt Cardan says the differenqe in the volumes is a. Then he says
that the solution x is the difference in the edges of the two cubes, i.e. x :
lft f/i. To prove that this value of * is correct, he states and Proves a
-
soLUTroN oF THrRD AND FOURTH DEGREE EqUATIONS 265
Figure 13.2
d-$ la
geometrical lemma to the effect that, if from a line segment AC (Fig. 13.2) a
segment BC is cut off, then the cube on lB will equal the cube on ,4C minus
the cube on BC minus three times the right parallelepiped whose edges are
AC, AB, and, BC. This geometrical lemma is of course no more than that
(e) ({t- t4":t-u-3({t- {-,4{tt".
Granted this lemma (which, by using the binomial theorem, we see must be
correct, but which Cardan establishes by citing theorems of Euclid), Cardan
has buttoobservethatifheletsr : 1/1 - {-u,t - u: n,and{r{n:i,
then the lemma says that xs : n - mtc. Hence if he chooses I and z to satisfr
the conditions (5) and (6), the value of x given by (7) in terms of , and z will
satisfy the cubic. He then gives ? purety verbal arithmetical rule for the
method, which tells us to form \ft - Yi where I arrd, u are given by (B) in
terms of z and a.
Cardan also solves (as did Tartaglia) particular equations of the types
tc':m* +n, xa+mxln:0, x3 an:mt.
He has to treat each of these cases separately and all three separately from
equation (4) because, first, up to this time the Europeans wrote equations so
that only terms with positive numbers appeared in them, and second,
because he had to give a separate geometrical justification for the rule in
each case.
Cardan also shows how to solve equations such as *3 * 6x2 : 100' He
knew how to eliminate the x2 term; that is, since the coefficient was 6, he
replaced x by g - 2 and obtained !3 : l2y * 84. He also recognized that
one can treat an equation such as *a 4 6xa: 100 as a cubic by letting
yz : y. Throttghout the book he gives positive and negative roots, desPite
the fact that he calls negative numbers fictitious. However he ignored com'
plex roots. In fact, in Chapter 37 he calls problems leading to neither true
nor lalse (positive or negative) roots false problems. The book is detailed-
even boring, to a modern reader-because Cardan treats separately the
many cases not only ofthe cubic equation but also ofthe auxiliary quadratic
equations he must solve to find t and z. In each case, he writes the equation
so that the coefficients of the terms are positive.
There is a difficulty with Cardan's solution of the cubic, which he
observed but did not resolve. When the roots of the cubic are all real and
distinct, it can be shown that tr and u will be complex because the radicand
in (B) is negative; yet we need {t and, t7z- to obtain x. This means that real
266 ARITHMETIC AND ALGEBRA I5OO-I7OO
(13)
""+42+1:o.
n' n"
(14) n: {=qE.
With this value of z chosen, we select a value ..{ so that
(15)
f;: -f,.osze
or so that
this fixes 31. But for any given z{ that satisfies (16), so do A * 120" and
A + 240'. Since z : cos l, there are, then, three values satis$ing (13):
x2*2bx:c
he lets
x * b:!.
Then
!2:12+2bx* b2.
Then
x:!-b:\/c+F-b.
In the case of the third degree equation
xs+bxz*cx*d:0,
Vieta starts by letting r : g- bl\. This substitution gives the reduced cubic
(23) s3+Pg+q:0.
Next he introduces a further transformation, indeed the one we learn today.
He lets
(24) y:"-*"
and obtains
""-rL7""+q:0.
He then solves the quadratic in z3 and obtains
,a:-!+2 = V.R
- where /,'\8 lc\',-.
_ (t) * (i,
":
Here, as in Cardan's method, there are two values for 20. Though Victa used
only the positive cube root of 23, one can use all six (complex) roots. The use
of (24) would show that only three distinct values ofy result from tlre six
values of z.
To solve the general fourth degree equation
*tlbxs+cxz+dx*e:O,
Vieta lets x : ! - bl4 and. reduces the equation to
x4+Pxz+qt +r:0,
He then transposes the last three terms and adds 2x2g2 + ya to both sides.
This makes the left side a perfect square and, as in Ferrari's method, by
properly choosing y he makes the right side a perfect square of the form
(Ax + B)2. To choose y properly he applies the discriminant condition for
a quadratic equation and is led to a sixth degree equation in y, which for-
tunately is a cubic in y2. This step and the rest of the work is precisely the
same as in Ferrari's method.
Another general method explored by Vieta was to factor the poly-
nomial into flrst degree liactors just as we might factor x2 * 5x * 6 into
(x + 2) (x + 3). He was unsuccessful partly because he rejected all but
positive roots and partly because he did not have enough theory, such as the
270 ARITHMETIC AND ALGEBRA r soe-r 7OO
ln his Arithmctica [Jnbersalis Newton described but did not prove another
method for determining the maximum number of positive and negative real
roots and hence the least possible number of complex roots. This method is
more complicated to apply but gives better results than Descartes's rule of
signs. It was finally proved as a special case of a more general theorem by
Sylvester.lo Somewhat earlier Gauss showed that if the number of positive
roots falls short of the number of variations of sign, it falls short by an even
number.
Another class ofresults concerns the relationships between the roots and
coefficients ofan equation. Cardan discovered that the sum ofthe roots is the
negative of the coefficient of xo-I, that the sum of the products two at a time
is the coefficient of r'- 2, and so forth. Both Cardan and Vieta (in Da
Aequationum Rccognitione et Emendatione) used the first relationship between
roots and coefficients of low degree equations to eliminate the.ro-1 term in
polynomial equations in manners we have described earlier. Newton stated
ihe relationship between roots and coefficients in hlis Arithmctica Unioasalis;
so did James Gregory in a letter to John Collins (1625-83), the secretary of
the Royal Society. However, no proofs were given by any of these men'
Vieta and Descartes constructed equations whose roots were more or
less than the roots ofa given equation. The process is merely to replace x by
g * m. Both men also used the transformation ! : mx to obtain an equation
whose roots are z times the roots of a given equation. For Descartes the
former propess had the significance we mentioned earlier, namely, that false
(negative) roots can be made true (positive) roots, and conversely'
Descartes also proved that if a cubic equation with rational coefficients
has a rationai root, then the polynomial can be expressed as the product of
factors with rational coefficients.
Another major result is now known as the factor theorem. In the third
book of La GComCtrie Descartes asserted that /(x) is divisible by x - a, a
positive, if and only if a is a root off(x) :0, and byr * a ifa is a lblse root'
With this fact and others he had asserted, Descartes established the modern
method of finding the rational roots of a polynomial equation. After making
the highest coefficient l, he made all the coefrcients integral by multiply-
ing the roots of the given equation by the necessary factor. This is done by
using the rule he had given ofreplacing * byy/zz in the equation. The rational
roots of the original equation must now be integral liactors of the coostant
term in the new equation. If by trial one finds that a, say, is a root, then by
the factor theorem, y - a is a factor of the new polynomial in y. Descartes
points out that by eliminating this factor one reduces the degree of the equa-
tion and can then work with the reduced equation.
Newton in Arithmctica [Jniaersalis and others earlier gave t]reorems on the
upper bound for the roots of equations. One of these theorems involves
10. Proc. Lorulon Mart. Soc,, l, 1865, l-16 = Math. Papcrs,2' 498-513'
272 ARTTHMETIC AND ALGEBRA I5OO-r 7OO
the calculus and will be stated in Chapter t7 (sec. 7). Newton discovered
the relation between the roots and discriminant of an equation, namely that,
for example, ax2 + bx + , : 0 has equal, real, or nonreal roots according
as b2 - 4ac equals 0, is greater than 0 or is less than 0.
In La Giomitrie Descartes introduced the principle of undetermined co-
efficients. The principle can be illustrated thus: To factor *2
- I into two
linear factors, one supposes that
xz_t:(x+b)(x+d).
By multiplying out the right-hand side and equating coefficients of like
powers of *, one finds that
b+d:0
bd : -1.
One can now solve for D and /. Descartes stressed the usefulness of this
method.
One other method, the method of mathematical induction, entered
algebra explicitly in the late sixteenth century. Of course the method is
implicit even in Euclid's proof of the infinitude of the number of primes.
As he proves the theorem, he shows that if there are a primes, there must
be z + I primes; and since there is a first prime, the number of primes
must be infinite. The method was recognized explicitly by Maurolycus in
his Arithmetita of 1575 and was used by him to prove, for example, that
1+3 +5 +..'+ (2n - l): (na l)2. Pascal in one of his letters ac-
knowledged Maurolycus's introduction of the method and used it himsellin
bts Traiti du triangle arithmitique (1665), wherein he presents what we now call
the Pascal triangle (sec. 6).
in which each number is the sum of the two immediately above it, already
TIIE BINOMIAL THEOREM AND ALLIED TOPICS .tJ
known to Tartaglia, Stifel, and Stevin, was used by Pascal (1654) to obtain
the coefficients of the binomial expansion. Thus the numbers in the fourth
row are the coefficients in the expansion of (a * 6)3. Despite the flact that
this arrangement was known to many predecessors, it has been called
Pascal's triangle.
Newton in 1665 showed that we may comPute (l + 4)'directly without
reference to (1 + a)n-'. Then he became convinced that the expansion held
for fractional and negative n (it is an infinite series in this case) and so stated,
but never proved, this generalization. He did verify that the series of ( I '1 a|rz
times itself gave I + x, but neither he nor James Gregory (who arrived at
the theorem independently) thought a proof necessary. In two letters, ol
secretary of
June 6 and October 4, 1676, to Henry Oldenburg (c. 1615-77),
the Royal Society, Newton stated the more general result, which he knew
before 1669, namely, the expansion of (P + PQ)^t". H. thought of it as a
useful method of extracting roots, because if Q is less than I (the P being
factored out), the successive terms, being powers of Q, are smaller and
smaller in value.
Independently of the work on the binomial theorem, the formulas for
the number of Permutations and the number of combinations of z things
taken r at a time appeared in the works of a number of mathematicians, for
example, Bhdskara and the Frenchman Levi ben Gerson (1321)' Pascal
observed that the formula for combinations, often denoted O, "", "t (l),
also gives the binomial coefficients. That is, for z fixed and r running from
0 to z, the formula yields the successive coefficients. James Bernoulli, in
his Ars Conjectandi (1713), extended the theory of combinations and then
proved the binomial theorem for the case of z positive integral by using the
formula for combinations.
The work on Permutations and combinations is connected with another
development, the theory of probability, which was to assume major im-
portance in the late nineteenth century but which barely warrants mention
in the sixteenth and seventeenth centuries. The problem of the probability
of throwing a particular number on a throw of two dice had been raised
even in medieval times. Another problem, how to divide the stake between
two players when the stake is to go to the player who first wins z points, but
the ptay is interrupted after the first player has madep points and the second
4 poi.r,., appears in Pacioli's Summa
and in books by Cardan, Tartaglia, and
otir"... 1'hit ptoblem acquired some imPortance when, after it was proposed
to Pascal by Antoine Gombaud, Chevalier de Mdr6 (1610-85), Pascal and
Fermat corresponded about it. The problem and their solutions are unim-
portant, but iheir work on it does mark the beginning of the theory of
probability. Both applied the theory of combinations.
The 6rst significant book on probability was Bernoulli's Ars Conjectandi.
The most important new result, still called Bernoulli's theorem, states that
274 ARTTHMETTC AND ALGEBRA r 5OO-r 7OO
ifp is the probability ofa single event happening and 4 the probability ofits
failing to happen, then the probability of the event happening at least z
times in z trials is the sum of the terms in the expansion of (p + q)" ftom p"
to the term involving pa4n-n.
case did he give a proof, and this was a sketch. The best mathematicians of
the eighteenth century worked hard to prove his results, (Chap' 25, sec' 4)'
These all turned out to be correct except for one error (which we shall note
later) and one still-unproved famous " theorem," for which the indications
are all favorable. There is no doubt that he had great intuition, but it is
unlikely that he had proofs for all of his affirmations'
A document discovered in lB79 among the manuscripts of Huygens
gives a famous metltod, called the method of infinite descent, which was
introduced and used by Fermat. To understand the method let us consider
the theorem asserted by Fermat in a letter to Marin Mersenne (1588-Ie+8)
of December 25, 1640, which states that a prime of the form 4a + I can
be expressed in one and only one way as the sum of two squares' Thus
17 :-16 * I and 29:25 + 4. The method proceeds by showing that
if there is one prime of the form 4n + | that does zol possess the required
property,. then ihere will be a smaller prime of the form 4n + I not possessing
It. ih.rr, since z is arbitrary, there must be a still smaller one' By descending
through the positive integral values of z one must rcac}l, n : I and thus the
prime"4.t i t or 5. Then 5 cannot possess the required property' But,
since 5 is expressible as a sum of two squares and in only one way' so is every
prime of the form 4n * l. This sketch of his method Fermat sent to his
iriend pierre de Carcavi (d. 1684) in 1659. Fermat said that he used the
method to prove the theorem just described, but his proof was nwer found'
He also said that he proved other theorems by this method'
The method of infinite descent differs from mathematical induction'
First of all, the method does not require that one exhibit even one case in
which the proposed theorem is satisfied because one can conclude the argu-
ment by the fact that the case n : I merely leads to a contradiction of some
other known fact. Moreover, after the appropriate hypothesis is made for
one value of n, the method shows that there is a smaller, but not necessarily
the next, value of a for which the hypothesis is true. Finally the method dis-
proves certain assertions and is in fact more useful for this purpose'
Fermat also stated that no prime of the form 4z + 3 can be expressed
as a sum of two squares' In a note in his copy of Diophantus and in the letter
to Mersenne, Fermat generalized on the well-known 3, 4, 5 right-triangle
relationship by asserting the following theorems : A prime of the form
4z * I is ihe hypotenuse of one and only one right triangle with integral
arms. The tqt u"i of (4n + l) is the hypotenuse of two and only two such
right fiiangles; its cube, ofthree; its biquadrate, oi4; and so on, ad infinitum'
.du, e*u-pl", consider the case ofz : l. Then4n + I : 5 and 3, 4, 5 are
the sides ofih. one and only right triangle with 5 as hyPotenuse' Ilowever,
52 is the hypotenuse of the two, and only two, right triangles 15, 20, 25 and
7,24,25. Also 53 is the hypotenuse of the three, and only three, right
triangles 75, lO0, 125;35, 120, 125;and,44, 117, I25'
276 ARTTHMETTC AND ALGEBRA r sOG-r 7OO
In the letter to Mersenne, Fermat declared that the same prime number
4n * I and, its square are each the sum of two squares in one way only; its
cube and biquadrate, each in two ways; its fifth and sixth powers, each in
three ways, and so on ad infinitum. Thus for n : 1,5 : 4 + | and 52 :
9 + 16; 53 : 4 + l2l :25 * 100; and so forth. The letter continues: If
a prime number that is a sum of two squares be multiplied into another
prime that is also the sum of two squares, the product will be the sum of two
squares in two ways. If the first prime be multiplied into the square of the
second one, the product will be the sum of two squares in three ways; if
multiplied into the cube of the second one, then the product will be the sum
of two squares in four ways I and so on ad infinitum.
Fermat stated many theorems on the representation of Prime numbers
in the form x2 + 2y2, x2 + 3y', x' + 5y', *' - 2y', and other such forms,
which are extensions of the representation as a sum of squares. Thus every
prime ofthe form 6z * I can be represented as x2 + 3y2; every prime ofthe
form Bz + I and Bn + 3 can be represented as x2 + 2y2. An odd prime
number (every prime but 2) can be expressed as the diflerence of two
squares in one and only one way,
Two theorems asserted by Fermat have since been referred to as the
minor and major theorems, the latter also known as the last theorem. The
minor one, communicated by Fermat in a letter of October 18, 1640, to
his friend Bernard Fr6nicle de Bessy (1605-75), states that ifp is a prime
and if a is prime to ,, then ap - a is divisible by p.
The major Fermat " theorem," which he believed he had proved,
states that for n > 2 no integral solutions of x" ! y" : zo are possible. This
theorem was stated by Fermat in a marginal note in his copy of Diophantus
alongside Diophantus' problem: To divide a given square number into (a
sum of) two squares. Fermat added, "On the other hand it is impossible to
separate a cube into two cubes, or a biquadrate into two biquadrates, or
generally any power except a square into two powers with the same expo-
nent. I have discovered a truly marvelous proof of this, which however the
margin is not large enough to contain." IJnfortunately, Fermat's proof, il
he had one, was never found and hundreds of the best mathematicians
have not been able to prove it. Fermat stated in a letter to Carcavi that
he had used the method of infinite descent to Prove the case of n : 4 but
did not give full details. Fr6nicle, using Fermat's few indications, did give
a proof for that case in 1676, in his posthumously published Traiti des
triangles rectangles en nombres (Treatise on Numerical Properties of Right
Triangles).11
If we may anticipate, Euler proved the theorem for n:3 (Chap. 25,
sec, 4). Since the theorem is true for n : 3, it is true for any multiple o[ 3;
for if it were not true for n : 6, say, then there would be integers x,y, and z
such that
x6+ya:26.
But then
(r')'+(y')":(z')s,
and the theorem would be false for z : 3. Hence we know that Fermat's
theorem is true for an infinite number ofvalues ofn, but we still do not know
that it is true for all values ofn. It is actually necessary to Prove the theorem
only for n : 4 and. for n an odd prime. For suppose first that z is not divisible
by an odd prime; it must then be a power of 2, and since it is larger than 2
it must be 4 or divisibte by 4. Let n : 4tn. Then the equation + ga : z'L
'cn
becomes
(*^)n+(g^)n:(.^)n.
If the theorem were not true for z, it would therefore not be true for n : 4'
Hence if it is true for n : 4,it is true for all z not divisible by an odd prime'
lf n : ?n where y' is an odd prime, then if the theorem were not true for z
it would not be tiue for the exponent p. Hence if true for n : P it is true for
any z divisible by an odd Prime'
Fermat did make some mistakes. He believed that he had found a
solution to the long-standing problem of producing a formula that would
yield primes for values of the variable z. Now it is not hard to show that
i' + I "u.rt ot be a prime unless rz is a power of 2' In many letters dating
from 1640 on 12 Fermat asserted the converse-namely, that (2)3" + I
represents a series of primes-though he admitted that he could not Prove
thls assertion. Later he doubted its correctness' Thus far only the five
primes 3, 5, 17,257, and 65,537 yielded by the formula are known' (See
Chap. 25, sec. 4.)
Fermatstatedandsketchedlstheproofbyinfinitedescentofthetheo-
rem: The area of a right-angled triangle the sides of which are rational
numbers cannot be a square number. This sketch is the only detailed one
ever given by him, and it follows as a corollary that the solution of *' +
la : za in integers is imPossible.
On polygonal ,rr-L".. Fermat stated in his copy of Diophantus the
importani theorem that every positive integer is itself triangular or the sum
of i or 3 triangular numbers I every positive integer is itself square or a sum
of 2, 3, or 4 squares; every positive integer is either pentagonal or a sum of
2, 3, 4, or 5 pintagonal numbers; and so on for higher polygonal numbers-
Mr"h *o.t *as required to Prove these results, which are correct only if 0
12. (Euares' 2' 206.
13. (Euarcs, l, 340i 3,271.
278 ARrrrrMETrc AND ALGEBRA r5oo-r7oo
x:-----Z-
-bq1/6--z-4*
and one can construct x by performin g on a, b, and c the geometrical con-
structions called for by the algebraic expression on the right-
Algebra for Vieta meant a special procedure ficr discovery; it was
analysis in the sense of Plato, who opposed it to synthesis. Theon of Ale:<-
andria, who introduced the term " analysis," defined it as the process that
begins with the assumption of what is sought and by deduction arrives at a
known truth. This is why Vieta called his algebra the analytic art. It Per-
formed the process of analysis, particularly for geometric problems' In fact
this was the starting point of Descartes's thinking on coordinate geometry
28o ARTTIIMETTC AND ALGEBRA r 5OO-r 7OO
and his work on the theory of equations was motivated by the desire to
further their use in solving geometric constructions.
The interdependence of algebra and geometry can be seen also in the
work of Marino Ghetaldi (1566-1627), a pupil of Vieta. He made a system-
atic study of the algebraic solution oldeterminate geometric problems in one
book of his Apollonius Rediaiaus (Apollonius Modernized, 1607). Conversely,
he gave geometric proofs ofalgebraic rules. He also constructed geometrically
the roots of algebraic equations. A full work on this subject is his De resolu-
tione et compositione mathematica (1630), published posthumously.
We also find in the sixteenth and seventeenth centuries the recognition
that algebra had to be developed to replace the geometrical methods intro-
duced by the Greeks. Vieta saw the possibility of using algebra to deal with
equality and proportion of magnitudes, no matter whether these magnitudes
arose in geometrical, physical, or commercial problems, Hence he did not
hesitate to consider higher-degree equations and algebraic methodology; he
envisioned a deductive science of magnitudes employing symbolism. While
algebra was to Vieta largely a royal road to geometry, his vision was great
enough to see that algebra had a life and meaning of its own. Bombelli gave
algebraic proofs acceptable for his time, without the use of geometry.
Stevin asserted that what could be done in geometry could be done in
arithmetic and algebra. Harriot's book Artis Analyticae Praxrs (1631) ex-
tended, systematized, and brought out some of the implications of Vieta,s
work. The book is much like a modern text on algebra; it is more analytical
than any algebra preceding it and presents a great advance in symbolism.
It was widely used.
Descartes too began to see great potentialities in algebra. He says he
begins where Vieta left off. He does not regard algebra as a science in the
sense of giving knowledge of the physical world. Indeed such knowledge, he
says, consists of geometry and mechanics; he sees in algebra a powerful
method wherewith to carry on reasoning, particularly about abstract and
unknown quantities. In his view algebra mechanizes mathematics so that
thinking and processes become simple and do not require a great effort of
the mind. Mathematical creation might become almost automatic.
Algebra, for Descartes, precedes the other branches of mathematics. It
is an extension of logic useful for handling quantity, and in this sense more
fundamental even than geometry; that is, it is logically prior to geometry.
He therefore sought an independent and systematic algebra instead of an
unplanned and unfounded collection of symbols and procedures tied to
geometry. There is a sketch of a treatise on algebra, known as Lc Calcul
(1638), written either by Descartes himself or under his direction, that treats
algebra as a distinct science. His algebra is devoid of meaning. It is a tech-
nique of calculation, or a method, and is part of his general search for
method.
THE RELATIONSHIP OF ALGEBRA TO CEOMETRY 28I
Bibliographg
Ball, W. W. R.: I Short Acoount oJ tlu Histoty of Matlumatics, Dover (reprint), 1960,
Chap. 12.
Boyer, CarI B.: History of Analgtie Gcotrutry, Scripta Mathematica, 1956, Chap. 4.
283
BTBIJOGRAPITY
A History of Mattumatits,
-Oightnd, John Wiley and Sons, 1968' Chaps' 15-16'
Cajori, Florian: A Grcat Seaentcenth Ccntury Teaehcr of Mathcmatbs' Open
-2 Court, 1916.
A HisW of Mattumatics,2nd ed', Macmillan, 1919' pp' 130-59'
A History if Mathcnatital Notati'ons, Open Court, 1928' Vol' l'
Cantor, Moritz: Vorhsungen iibet Gesohichtc dcr Mathcmatik, 2nd
ed" B' G' Teubner'
ig00,;ottton Reprint Corp., 1965, Vol' 2, pp' 369-806'
Cardan, C.rbprro Onnia, lO vols', 1663, Johnson Reprint Corp''
1964'
Art' trans' T' R' Witmer' Massachlrsetts Institute
Cura.rro, Giroiamo: Tlu Great
of TechnologY Press, 1968.
Coolidge, Julian i.: Thc Matlumatics oJ Grcat Amatcurs' Dover (reprint)' 1963'
Chaps. G-7.
'N.;
David, F. Go^"r, Gods awl Gambling: TIu Origins and History of Probabilitg'
Hafner, 1962.
Descartes, Ren6: I[a Gcomctry, Dover (reprint), 1954, Book 3'
Dickson, Leonard E.: History of ttu Ttuory oJ Numbus, Carnegie
Institution' l9l9-
23, Chelsea (reprint), 1951, Vol' 2, Chap' 26'
189l-1912'
Fermat, Pierre de: @u*cs,4 vols. and Supplement, Gauthier-Villars'
1922.
Heath, Sir Thomas Ditphantus of Alexandria,2nd ed', Cambridge Univenity
L.:
Press, 1910; Dover (reprint), 1964, Supplement, Secs' l-5'
Hobson, B. W.': John Napbr awl tlu Inaention of Logaithms' Cambridge
Univenity
Pres, 1914.
Klein,- Jacob: Greek Mathematical Thought and ilu Ortgin of Algcbra' Massachusctts
irrrtit..,t. ofT.chnology Press, 1968' Contains a translation of Vieta's Ingoge'
Knott, C. G.: Napier Tcrccntenary Mcmorial Volume, Lotgmans Green' l9I5'
Montucla, 7. f.: ru*u;re des mathimatiqus, Albert Blanchard (reprint)'
196O'
1921.
Morley, Henry t Lifc oJCanlan,2 vols', Chapman and Hall, 1854'
Ne"utorr, Sir i.oact Matfumaticat Works, YoL 2, ed' D' T' Whiteside' Johnrcn
Reprint Corp., lg6T. This volume contains a translation of Aith"uticd
Univcrsalis .
285
286 THE BEorNMNcs oF pRoJEcrrvE GEoMETRv
ABCD. T]ne lines from o to the points on the foursides of this rectangle
constitute a projection of which OA, OB, OC, arLd OD are typical linee' If a
plane is ,ror"- inierpos"d between the eye and the rectangle, the lines of thc
irojectio, will cui through the plane and mark out on it ttr9 quadratgle
A'B'C'O'. Since.the sectio t, narrrlely A'B'C'D', ctates tble same imprereiau on
the eye as does the original rectangle, it is reasonable to ask, as Alb€rti did,
what geometrical propirties do the section and the original rectmgle have in
common ? It is intuitively appafent that the original figure and ttre section
will be neither congruent nor similar; nor will they contain the same area.
In fact the section need not be a rectangle.
There is an e:rtension of this problem: Suppose two different sections of
this same projection are made by two diflerent planes tlat cut the projection
at any angle. What properties would the two sections have in common?
fhe froble- *ty b" further entended. Suppose a rectangle ABCD is
viewed frim two different locations O' and Oo (Fig' la'2)' Then tlere are
two projections, one determined by O' and the rectangle an{ the -seclnd
deterrnined by o, and the rectangle. If a section is made of each projeixlion,
then, in view of the fact that each section should have some
properties in common with the rectangle, the two sections should have some
lorn*on geometrical proPerties.
some of the seventeenth-century geometers undertook to a.oswer these
questions. They viewed the methods and results they obtained T PT "l
liuclidean geometry. Ilowever, these methods and results, while indeed
contributing much to that subject, proved to be the beginaing of a new
288 TrrE BEGTNNTNGS OF PROJECfi\/E GEOMETRY
Glass screen
Figure 14.2
l, (Euures, 1, 103-230.
THE'woRK oF DESARGUEs 289
Figure 14.3
Having introduced his points and lines at inflnity, Desargues was able
to state a basic theorem, still called Desargues's theorem, Consider ttre point
O (Fig. ru.q and triargle ABC. The lines from O to the various points oa the
sides of the triangle constitute, as we know, a projection. A section of this
projection will then contain a triangle A'B'C', where r4' corresponds to A, B'
to B, and C' to C. The two triangles ABC and A'B'C' arc said to be per-
spective from the point O. Desargues's theorem then states: The pairs of
corresponding sides l.B and, A'B', BC and B'C', arr.d AC and A'C' (or their
prolongations) of two triangles perspective from a point meet in three points
which lie on one straight line. Conversely, if the three pairs of corresponding
sides of the two triangles meet in three points which lie on one straight line,
then the lines joining corresponding vertices meet in one point. With specific
reference to the figure, the theorem proPer says that because AA', BB', arLd
CC' meet in a point O, the sides AC and, A'C' meet in a point P; AB andA'B'
meet in a point Q; and BC and B'C'rrrcet in a point R;andP, Q, and Elie
on a straight line.
Though the tieorem is true whether the triangles ABC arrd A'B'C'te
in the same or different planes, its proof is simple only in the latter case.
Desargues proved the theorem and its converse for both the two- and three-
dimensional cases,
In the appendix to Bosse's 1648 work, there appears another funda-
mental result due to Desargues, the invariance ofcross ratio under projection.
The cross ratio of the line segments formed by four points A, BrC, and D on
one line (Fig. 1a.5) is by definition
y*lA* Pappus had already introduced
this ratio (Chap. 5, sec. 7) and proved that it is the same on the two lines lD
and. A' D' . Menelaus also had a similar theorem about arcs of great circles on
THE EEGINMNGS OF PROJECTIVE GEOMETRY
Figure 14.5
a sphere (Chap. 5, sec. 6). But neither of these men thought in terms of
projection and section. Desargues did, and proved that the cross ratio is the
same for every section of the projection,
In his major work (1639), he treated the concept of involution, which
Pappus had also introduced but which Desargues named. Two pairs of
points l, B and A',8'are said to be an involution if there is a special point
O, called the center of the involution, on the line containing the four such
that OA.OB : OA'.OB' (Fig. la.6). Likewise, three pairs of points, l, B,
A', B', and A', B" are said to be an involution if OA.OB : OA'.OB' :
OA'.OB'. The points A and B, A' and. B', and. A' and Bo are said to be
conjugate points. If there is a point .E such that OA.OB :OE', then E is
called a double point. In this case there is a second double point d and O is
the midpoint of EF. The conjugate of O is the point at infinity. Desargues
used Menelaus' theorem on a line cutting the sides of a triangle to prove that
if the four points l, 8,, A', and B' are an involution (Fig. 1a.7), and if they
are projected from P onto the points AL, BL, A1, and Bi ofanother line, then
the second set of four points is also an involution.
In the subject of involutions, Desargues proved a major theorem. To
get at it, let us first consider the concept ofa complete quadrilateral, a notion
already partly treated by Pappus. Let B, C, D, and E be any four points in a
plane (Fig. l4.B), no three collinear. Then they determine six lines that are
the sides of the complete quadrilateral. Opposite sides are two sides that do
not have one of the four points in common. Thus .BC and DE are opposite,
Figure I4.7
293
Figure 14.8
as are CD and BE and BD ar'id' CE. The intersections O, fl and ''4 of the
three pairs of opposite sides are the diagonal points of the quadrilateral. Now
take t-h" forr, ,reiti.es B, C, D, -E on a circle. Suppose a line PM meets the
pairs ofsides inP, Q;d 1(; and G,H;and the circle in the pair L, M'Thae
four pairs of points are four pairs of an involution.
iurther, ,rppor. the entire figure is projected from some point outside
the plane of the figure and a section is made of this projection' The circle
williive rise to a conic in the section, and each line in the original figure will
girre ri.e to some line in the section. In particular, the quadrilateral in the
circle will give rise to a quadrilateral in the conic. Since an involution
projects into an involution, there follows the important and general result:
if a-quadrilateral is inscribed in a conic, any straight line not passing through
,r".t.* intersects the conic and the pairs of opposite sides of the complete
"quadrilateral in four pairs of points of an involution.
Desargues next introduces the notion of a harmonic set of points' The
points 24, B, E, arld F fotm a harmonic set if ,4 and B ate a pair of conjugate
points with respect to the double points E and F' of an involution' (The
definition that the cross ratio of a harmonic set is - I is a later
"r.r.rrt
approach.)z Since an involution projects into an involution, so does a
har-onic set project into a harmonic set. Then Desargues shows that if one
member of a harmonic set of points is the point at infinity (on the line of the
four), the other point of that pair bisects the line segment joining the other
two foints of the set. Further if A, B, A' , and' B' ate aharmonic set (Fig' 1a'9),
a.rd lf th" projection from O is formed, then if Ol' is perpendiculat to OB' ,
Ou4' bisecs arrgle AOB, and OB' bisects the suPplementary angle'
With the notion of harmonic set available, Desargues proceeds to pole
2. It is due to Miibius: Bdrgcent*clu Calcul (1827), p. 269.
294 THE BEGINNINGS OF PROJECTIVE GEOMETRY
Figure I4.9
Figurc 14.10
Figure l4.l I
monic conjugate with resPect to A' and B' of the Point at infinity on A'B' is
the midpoint B of the chord r4'B'. Likewise 81, the harmonic conjugate of the
point at infinity with respect to .r4i and B'r, is the midpoint of the chord
liBi. These midpoints of a family of parallel chords lie on one straight line,
which is a diameter in Apollonius' definition also. Desargues then proves a
number of f,acts about diameters, conjugate diameters, and asymPtotes to
hyperbolas.
As we can now see, Desargues not only introduced new concePts,
notably the elements at infinity, and many new theorems, but above all he
introduced projection and section as a new method of proof and unified the
approach to the several types of conics through projection and section,
whereas Apollonius had treated each type ofconic separately. Desargues was
one of the most original mathematicians in a century rich in genius.
vacuums, the fact that the weight ofthe air decreases as the altitude increases,
and a clarification ofthe concept ofpressure in liquids. The originality ofthe
work in physics has been questioned ; in fact, some historians of science have
described it as popularization, or plagiarism, depending upon whether or not
they wished to be charitable.
Pascal was great in many other fields. He became a master of French
prose; his PensCes and Lettres proainciales are literary classics. He also became
famous as a polemicist in theology. From childhood on, he attempted to
reconcile religious faith with the rationalism of mathematics and science, and
throughout his life the two interests competed for his energy and time.
Pascal, like Descartes, believed that truths of science must either appeal
clearly and distinctly to the senses or the reason or be logical consequences
of such truths. He saw no room for mystery-mongering in matters of science
and mathematics. "Nothing that has to do with faith can be the concern of
reason." In matters of science, in which only our natural thinking is in-
volved, authority is useless; reason alone has grounds for such knowledge.
However, the mysteries of faith are hidden from sense and reason and must
be accepted on the authority of the Bible. He condemns those who use
authority in science or reason in theology. However, the level of faith was
above the level of reason.
Religion dominated his thoughts after the age of twenty-four though
he continued to do mathematical and scientific work. He believed that the
pursuit of science for mere enjoyment was wrong. To make enjoyment the
chief end of research was to corrupt the research, for then one acquired " a
greed or lust for learning, a profligate appetite for knowledge.. . . Such a
study of science sprang from a prior concern for self as the center of things
rather than a concern for seeking out, amid all surrounding natural
phenomena, the presence of God and His glory."
In mathematical work he was largely intuitive; he anticipated great
results, made superb guesses, and saw shortcuts. Later in life, he favored
intuition as a source of all truths. Several of his declarations bearing on this
have become famous, "The heart has its own reasons, which reason does not
know." " Reason is the slow and tortuous method by which those who do not
know the truth discover it." " Humble thyself, impotent reason."
If one may judge from a letter Pascal wrote to Fermat on August 10,
1660, toward the end of his life, Pascal seems to have turned somewhat
against mathematics. He wrote: "To speak freely of mathematics, I find it
the highest exercise of the spirit; but at the same time I know that it is so
useless that I make little distinction between a man who is only a mathe-
matician and a common artisan. Also, I call it the most beautiful occupation
in the world; but it is only an occupation; and I have often said that it is
good to make the attempt [to study mathematics], but not to use our forces:
so that I would not take two steps for mathematics, and I am confident that
THE WORK OF PASCAL AND I.A HIRE 297
Pq
Figure 14.12 B
you are strongly of my opinion." Pascal was a man of manifold but con-
tradictory qualities.
Desargues urged Pascal to work on the method ofprojection and section
and suggested in particular the goal of reducing the many properties of the
conic sictions to a small number of basic propositions. Pascal took up these
recommendations. In 1639, at the age of sixteen, he wrote a work on conics
that used projective methods, that is projection and section. This work is now
lost, but Leibniz did see it in Paris in 1676 and described it to Pascal's
nephew. An eight-page Essag on Conics (1640), known to a few of Pascal's
contemporaries, was also lost until 1779, when it was recovered'8 Descartes,
who saw the 1640 essay, regarded it as so brilliant that he could not believe
it was written by so young a man.
Pascal's most famous result in projective geometry, which appeared in
both of the works just mentioned, is a theorem now named after him' The
theorem in modern language asserts the following: Ifa hexagon is inscribed
in a conic, the three points of intersection of the pairs of oPPosite sides lie
on one line. Thus (Fig. 14.12) P, Q, and ft lie on one straight line' If the
opposite sides of the hexagon are parallel, P, Q, and',R will lie on the line at
infinity.
We have only indications of how Pascal proved this theorem' He says
that since it is true of the circle, it must by projection and section be true of
all conics. And it is clear that if one forms a projection of the above figure
from a point outside the plane and then a section of this projection' the
section will contain a conic and a hexagon inscribed in it. Moreover, the
opposite sides of this hexagon will meet in three points of a straight line,
the points and straight line that correspond to P, Q, -R, and the line PQit of
the triginal figure. Incidentally Pappus' theorem (Chap. 5, sec' 7), which
refers to three points on each of two lines, is a special case of the above
Figurc 14.13
theorem. When the conic degenerates into two straight lines, as when a
hyperbola degenerates into its asymptotes, then the situation described by
Pappus results.
The converse of Pascal's theorem, namely, if a hexagon is such that the
points of intersection of its three pairs of opposite sides lie on one straight
linc, thcn the vertices of t}re hexagon lie on a conic, is also correct but was not
considered by Pascal. There are other results in Pascal's 1640 work but they
do not warrant attention here.
The method of projection and section was taken up by Philippe de La
Hirc (l&0-1718), who was a painter in his youth and then turned to
matherbatics and astronomy. Like Pascal, La Hire was influenced by
Desargues and did a considerable amount of work on the conic sections.
Some of it, in publications of 1673 and 1679, employed the synthetic manner
of the Greeks but with new approaches, such as the focus-distance definition
of thc ellipse and hyperbola, and some of it used the analytic geometry of
Descartes and Fermat. His greatest work, however, is the Sectioncs Conicaz
(1685) and this is devoted to projective geometry.
Lite Desargues and Pascal, La Hire first proved properties ofthe circle,
chiefly involving harmonic sets, and then carried these properties over to the
other conic sections by projection and section. Thus he could carry the
properties of the circle over to any type of conic section in one method of
proof. Though there were a few omissions, such as Desargues's involution
theorem and Pascal's theorem, in this 1685 work of La Hire we find practi-
cally all the now familiar properties of conic sections synthetically proved
and systematically established. In fact, La Hire proves almost all of Apol-
lonius' 364 theorems on the conics. FIe also has the harmonic properties of
quadrilaterals. In all, La Hire proved about 300 theorems. He tried to show
that projective methods were superior to those of Apollonius and to the new
analytic methods of Descartes and Fermat (Chap. 15) which had already
bcen crcated.
On the whole, La Hire's results do not go beyond Desargues's and
Pascal's. However, in pole and polar theory he has one major new result.
TIIE EMERGENCE OF NEW PRINCIPLES 299
He proves that if a Point traces a straight line, then the polar of the point
will rotate around the pole of that straight line. Thus if Q (Fig. 14'13) moves
along the line p, then the polar of I rotates around the pole P of the line p'
BibliographE
Chasles, Michcl: Apergu hbtoiquc des nithodts n giomitric,3rd ed., Gauthier-Villars
et Fils, 1889, pp. 68-95, l18-37' (Same as first edition of 1837.)
Cootidge, Julian L.: A History of Geonutriral Methods, Dover (reprint)' 1963, pp.
88-92.
A History of the Conir Sectians and Quadric Sufaccs, Dover (reprint), 1968'
Chap. 3.
"The Rise and Fall of Projective Geometry," Atnet Math. Monthlg,41,
t93+,217-28.
Desargues, Girardt (Ewres,2 vols., Leiber, 1864'
-:
Ivins, W. M., Jr.: "A Note on Girard Desargues," Scripta Math., 9, l9+3, 33--48.
"A Note on Desargues's Theorem," S*ipta Math., 13, 1947,205-10.
Mortimer, Erncst: Blaisc Pascal: Tlu Life antl Work of a Realist, Harper and Bros.,
1959.
Pascal, Blaise: (Euores, Hachette, l9l'$-21.
Smith, David Eugene: A Sourec Book in Mathcmaiics, Dover (reprint), 1959' Vol. 2,
pp. 307-14, 326-30.
Struik, D. J.: A Source Book in Matlumatics, 1200-1800, Ilarvard lJniversity Press,
1969, pp. 157-68.
Taton, Ren6: L' CEuure mathimati.qw de G. Dcsargues, Presses Universitaires de
France, 1951.
r5
Coordinate Geometry
l. Fcrmat usca these terms in the serxe orplained by Pappx. See Chap. 8, sec.2.
2. Guorus, 1, 9l-L03,
302
303
Figure l5.l
the first degree in A and. E has a straight-line Iocus and all second degree
equations in I and .E have conics as their loci. In his Methodus ad Disquirendam
Maximam et Minimam (Method of Finding Maxima and Minima, 1637),3 he
introduced the curves ofy : va and y : x-",
3. Reni Descartes
Descartes was the first great modern philosopher, a founder of modern
biology, a first-rate physicist, and only incidentally a mathematician.
However, when a man of his power of intellect devotes even part of his time
to a subject, his work cannot but be significant.
He was born in La Haye in Touraine on March 31, 1596. His father,
a moderately wealthy lawyer, sent him at the age of eight to the Jesuit
school of La Fldche in Anjou. Because he was of delicate health, he was
allowed to spend the mornings in bed, during which time he worked. He
followed this custom throughout his life. At sixteen he left La Fldche and at
twenty he was graduated from the University of Poitiers as a lawyer and went
to Paris. There he met Mydorge and Father Marin Mersenne and spent a
year with them in the study of mathematics. However, Descartes became
restless and entered the army of Prince Maurice of Orange in 1617' During
the next nine years he alternated between service in several armies and
carousing in Paris, but throughout this period continued to study mathe-
matics. His ability to solve a problem that had been posted on a billboard in
Breda in the Netherlands as a challenge convinced him that he had
mathematical ability and he began to think seriously in this subject. He
returned to Paris and, having become excited by the power of the telescope,
secluded himself to study the theory and construction ofoptical instruments.
In t62B he moved to Holland to secure a quieter and freer intellectual
atmosphere. There he lived for twenty years and wrote his famous works.
In 1649 he was invited to instruct Queen Christina of Sweden. Tempted by
the honor and the glamor of royalty, he accepted. He died there of
pneumonia in 1650.
His first work, Regulae ad Directionem Ingenii (Rules for the Direction of
the Mind),a was written in 1628 but published posthumously. His next
major work was Le Mondz (System of the World, 1634), which contains a
cosmological theory ofvortices to explain how the planets are kept in motion
and in their paths around the sun. However, he did not publish it for fear of
persecution by the Church. In 1637 he published his Discours d.c la mithodc
lour bien conduirc sa raison, et chercher la ahitC dans les scienrcs.s This book, a
.,. All the sciences which have for their end investigations concerning
order and measure are related to mathematics, it being of small im-
portance whether this measure be sought in numbers, forms, stars,
soun&, or any other object; that accordingly, there ought to exist a
gcneral sciencc which should explain all that can be Lnown about order
and measure, considered independently of any application to a Particular
subject, and that, indeed, this science has its own ProPer name, con-
secrated by long raage, to wit, mathematics. And a proof that it far
surpa$es in facility and importance the sciences which depend upon it is
that it embraces at once all the objects to which these are devoted and a
great maDy others beides. . , .
And so he concluded that "The long chains of simple and easy reasonings
by means ofwhich geometers are accustomed to reach the conclusions oftheir
most difficult demonstrations had led me to imagine that all things to the
knowledge of which man is competent are mutually connected in the same
way."
. From his study of mathematical method he isolated in his Rulcs Jot tlu
Direction of tlu Mind the following principles for securing exact knowledge in
any field. He would accePt nothing as true that was not so clear and distinct
in his own mind as to exclude all doubt; he would divide difficulties into
smaller ones; he would proceed from the simple to the complex; and,
lastly, he would enumerate and review the steps of his reasoning so
thoroughly that nothing could be omitted.
With these essentials of method, which he distilled from the practice of
mathematicians, Descartes hoped to solve problems in philosophy, physics,
anatomy, astronomy, mathematics, and other fields. Although he did not
succeed in this ambitious Program, he did make remarkable contributions
to philosophy, science, and mathematics' The mind's immediate appre'
hension of basic, clear, and distinct truths, this intuitive power, and the
RENE DESCARTES 3O7
mathematics for its own sake are idle searchers given to a vain play of the
spirit.
Figure 15.2
(Descartes ignored the second root, which is negative.) Descartes now gives a
for x. He constructs the right triangle NLM (Fig' l5'2) with
"orrrt.r"tio.,
LM : b and NL : al2, and prolongs MN to O so that 'l[O : NL : al2'
Then the solution x is the lengtb oM. The proof that oM is the correct
length is not given by Descartes but it is immediately apparent for
OM: ON + Uw:i+
3IO cooRDTNATE GEOMETRY
Thus the orpression (I) for *, which was obtained by solving an algebraic
equation, indicates the proper construction for *.
In the first half of Book I, Descartes solves only classical geometric
construction problems with the aid of algebra. This is an application of
algebra to geometry, but not analytic geometry in our present sense. The
problems thus far are what one might call determinate construction prob-
lems because they lead to a unique length. He considers next indeterminate
construction problems, that is, problems in which there are many possible
lengths that serye zui zulswers. The endpoints of the many lengths fill out a
curve I and here Descartes says, "It is also required to discover and trace the
curve containing all such points." This curve is described by the final
indeterminate equation expressing the unknown lengths y in terms of the
arbitrary lengths *. Moreover, Descartes stresses that for each r, y satisfies a
determinate equation and so can be constructed. If the equation is of the
first or second degree, y can be constructed by the methods of Book I, using
only lines and circles. For higher-degree equations, he says he will show in
Book III how y can be constructed.
Descartes uses the problem of Pappus (Chap. 5, sec. 7) to illustrate what
happens when a problem leads to one equation in two unknowns. This
problem, which had not been solved in full generality, is as follows: Given
the poeition of three lines in a plane, find the position ofall points (the locus)
from which we can construct lines, one to each ofthe given lines and making
a known angle with each of these given lines (the angle may be different
from line to line), such that the rectangle contained by two ofthe constructed
lines has a given ratio to the square on the third constructed line; ifthere are
four given lines, then the constructed lines, making given angles with the
given lines, must be such that the rectangle contained by two must have
a given ratio to the rectangle contained by the other two; if there are
five given lines, then the five constructed lines, each making a given angle
with one of the given lines, must be such that the product of three of them
has a given ratio to the product of the remaining two. The condition on the
locus when there are more than five given lines is an obvious extension ofthe
above.
Pappus had declared that when three or four lines are given, the locus
is a conic section. In Book II Descartes treats the Pappus problem for the
case of four lines. The given lines (Fig. 15.3) are AG, GH, EF, and AD.
Consider a point C and the four lines from C to each of the four given lines
and making a specified angle with each of the four given lines. The angle can
be different from one line to another. Let us denote the four lines by CP,CQ,
CR, and CS. It is required to find the locus of C satisSing the condition
CP.CR = CS.CQ.
Descartes denotes AP by x and PC by y. By simple geometric con-
siderations, he obtains the values of CR, CQ, and C.S in terms of known
DESCARTES,S vvORK IN COORDINATE GEOMETRY 3tr
Figure 15.3
construct them. But, Descartes says, even the straight line and circle require
some instrument. Nor can the accuracy of the mechanical construction
matter, because in mathematics only the reasoning counts. Possibly, he
continues, the ancients objected to linear curves because they were in-
securely defined. On these grounds, Descartes rejects the idea that only the
curves constructible with straightedge and compass6 are legitimate and even
proposes some new curves generated by mechanical constructions. He con-
cludes with the highly significant statement that geometric curves ere those
that can be expressed by a unique algebraic equation (of finite degree) in *
and y. Thus Descartes accepts the conchoid and cissoid. All other curves,
such as the spiral and the quadratrix, he calls mechanical.
Descartes's insistence that an acceptable curve is one that has an
algebraic equation is the beginning of the elimination of constructibility as a
criterion of existence. Leibniz went farther than Descartes. Using the words
" algebraic " and " transcendental " for Descartes's terms " geometrical " and
" mechanical," he protested the requirement that a curve must have an
algebraic equation.? Actually Descartes and his contemporaries ignored the
requirement and worked just as enthusiastically with the cycloid, the
logarithmic curve, the logarithmic spiral (1og p : a0) , and other non-
algebraic curves.
In broadening the concept of admissible curves, Descartes made a
major step. He not only admitted curve$ formerly rejected but opened up
the whole field of curves, because, given any algebraic equation in x arrd. g,
one can find its curve and so obtain totally new curves. In Arithmetica
Uniocrsalis Newton says ( I 707), " But the Moderns advancing yet much
further [than the plane, solid and linear loci ofthe Greeks] have received into
Geometry all Lines that can be expressed by Equations."
Descartes next considers the classes of geometric curves. Curves of the
first and second degree in * andy are in the first and simplest class. Descartes
says, in this connection, that the equations of the conic sections are of the
second degree, but does not prove this. Curves whose equations are of the
third and fourth degree constitute the second class. Curves whose equations
are ofthe fifth and sixth degree are ofthe third class and so on. His reason for
grouping third and fourth, as well as fifth and sixth degree curves, is that he
believed the higher one in each class could be reduced to the lower, as the
solution of quartic equations could be effected by the solution of cubics.
This belief was of course incorrect.
The third book of La Giomitrie returns to the theme of Book I. Its
objective is the solution of geometric construction problems, which, when
formulated algebraically, lead to determinate equations of third and higher
degree and which, in accordance with the algebra, call for the conic sections
6. Compa.re the discussion in Chap.8, sec. 2.
Acta Erud., 1684, pp. 470, 587 ; 1686, p. 292 : Math. Schriftcn, 5, 127 , 223,226.
DESCARTES'S I^/ORK IN COORDINATE GEOMETRY 3r3
a z zzla
2: -zzla:
ZlA'
Then, if we take zsf a2 to be g, we have the equation z must satisfy. Hence,
given q and 4, we must find z such that
(3) z3 : azq,
or, we must solve a cubic equation. Descartes now shows that such quantities
z and z2la can be obtained by a geometrical construction that utilizes a
parabola and a circle.
As the construction is described by Descartes, seemingly no coordinate
geometry is involved. However, the parabola is not constructible with
itraightedge and compass, excePt point by point, and so one must use the
equation to plot the curve accurately.
Descartes does zol obtain z by writing the equations in * andy of circle
and parabola and finding the coordinates of the point of intersection by
solving equations simultaneously. In other words, he is not solving equations
graphically in our sense. Rather he uses purely geometric constructions
(except for supposing that a parabola can be drawn), the knowledge of the
f;act that z satisfies an equation, and the geometric proPerties of the circle
and parabola (which can be more readily seen through their equations) '
Descartes does herejust what he did in Book I, except that he is now solving
geometric construction problems in which the unknown length satisfies a
third or higher-degree equation instead of a first or second degree equation'
His solution ofthe purely algebraic aspect ofthe problem and the subsequent
construction is practically the same one the Arabs gave, excePt that he was
able to use the equations of the conic sections to deduce facts about the
curves and to draw them.
Descartes not only wished to show how some solid problems could be
solved with the aid of algebra and the conic sections but was interested in
classifying problems so that one would know what they involved and how to
go aboul solving them. His classification is based on the degree of the
algebraic equation to which one is led when the construction problem is
formulated algebraically. If that degree is one or two, then the construction
can be performed with straight line and circle. If the degree is three or four,
COOR,DINATE GEOMETRY
the conic sections must be employed. He does affirm, incidentally, that all
cubic problerns can be reduced to trisecting the angle and doubling the cube
and that no cubic problems can be solved without the use of a curve more
complor than the circle. If the degree of the equation is higher than four,
curvcs more complicated than the conic sections may be required to perform
the corutruction.
Descartes also emphasized the degree of the equation of a curve as the
mcasure of its simplicity. One should use the simplest curve, t}lat is, the
lowcst degree possible, to solve a construction problem. The emphasis on
ttre degree of a curve became so strong that a complicated curve such as
thc folium of Descartes (Fig. 15.4), whose equation is x3 a yo - Saxg : Q,
was considered simpler thar g : st,
What is far more significant than Descartes's insight into construction
problems and their classification is the importance he assigned to algebra.
This key makes it possible to recognize the typical problems of geometry and
to bring together problems that in geometrical form would not appear to be
related at all. Algebra brings to geometry the most natural principles of
classification and the most natural hierarchy of method. Not only can ques-
tions of sotvability and geometrical consructibility be decided elegantly,
quickly, and fully from the parallel algebra, but without it they cannot be
decidcd at all. Thus, system and structure were transferred from geometry
to algebra.
Part of Book lI of La Giometric as well as La Diaptriqw Dex,czrte devoted
to optics, using coordinate geometry as an aid. He was very much concerned
with the design of lenses for the telescope, microscope, and other optical
instruments because he appreciated the importance of tlese instruments for
astronomy and biology. }Jis Dioptrique takes up the phenomenon of re-
fraction. Kepler and Alhazen before him had noted that the belief that the
angle of refraction is proportional to the angle of incidence, the propor-
tionality constant being dependent on the medium doing the refracting, was
DESCARTTS'S WORX, IN COORDINATE GEOMETRY 3r5
Figure 15.5
incorrect for large angles, but they did not discover the true law. Before 1626
willebrord snell discovered but did not publish the correct relationship,
sin i u1
il"r:['
where a, is the velocity of light in the first medium (Fig. l5'5) and u, the
velocity in the medium into which the light passes. Descartes gave this same
law in 1637 in the Dioptique' There is some question as to whether he dis-
covered it independently. His argument was wrong, and Fermat im-
mediately attacked both the law and the proof. A controversy arose between
them which lasted ten years. Fermat was not satisfied that the law was
correct, until he derived it from his Principle ofLeast Time (Chap' 24, sec' 3)'
In La Dioptriqtu, aftet describing the operation of the eye, Descartes
considers the problem of designing properly focusing lenses for telcscopes,
microscopes, and spectacles. It was well known even in antiquity that a
sphericailens will not cause parallel rays or rays diverging from a source, 'S
to focus on one point. Hence the question was oPen as to what shapc would
so focus the incoming rays' Kepler had suggested that some conic section
would serve. Descartes sought to design a lens that would focus the rays
perfectly.
'
He proceeded to solve the general problem of what surface should
separate iwo media such that light rays starting from one point in the first
into the second medium, and there
-.dir- would strike the surface, refract
converge to one point. He discovered that the curve generating the desired
surface of revolution is an oval, now known as the oval of Descartes; This
curve and its refracting properties are discussed h La Dinptriqw, and the
discussion is supplemented in Book ll of La Glomltrie'
The modein definition is that the curve is the locus of points M satisfring
the condition
FM ! nF'M :2a
where F and F' are fixed points, 2a is any real number larger than FF', and
z is any real number. lf n : I the curve becomes an ellipse' In the general
case, the equation ofthe oval is ofthe fourth degree it x andy, and the curve
316
Figure 15.6
consists of two closed, distinct portions (Fig. 15.6) without common point and
one inside the other. The inner curve is convex like an ellipse and the outer
one can be convex or may have points ofinflection, as in the figure.
As we can now see, Descartes's approach to coordinate geometry differs
profoundly from Fermat's. Descartes criticized and proposed to break with
the Greek tradition, whereas Fermat believed in continuity with Greek
thought and regarded his work in coordinate geometry only as a reformula-
tion of the work of Apollonius. The real discovery-the power of algebraic
methods-is Descartes's; and he realized he was supplanting the ancient
methods. Though the idea of equations for curves is clearer with Fermat than
with Descartes, Fermat's work is primarily a technical achievement that
completes the work of Apollonius and uses Vieta's idea of letters to represent
classes of numbers. Descartes's methodology is universally applicable and
potentially applies to the transcendental curves, too.
Despite these significant differences in approach to coordinate geometry
and in goals, Descartes and Fermat became embroiled in controversy as to
priority of discovery. Fermat's work was not published until 1679; however,
his discovery of the basic ideas of coordinate geometry in 1629 predates
Descartes's publication of La GiomCtrie in 1637. Descartes was by this time
fully aware of many of Fermat's discoveries, but he denied having learned
his ideas from Fermat, Descartes's ideas on coordinate geometry, according
to the Dutch mathematician Isaac Beeckman (1588-1637), went back to
1619; and furthermore, there is no question about the originality of many of
his basic ideas in coordinate geometry.
When Ia Giomitrit was published, Fermat criticized it because it omitted
ideas such as maxima and minima, tangents to curves, and the construction
of solid Ioci, which, he had decided, merited the attention of all geometers.
Descartes in turn said Fermat had done little, in fact no more than could be
easily arrived at without industry or previous knowledge, whereas he himself
had used a full knowledge of the nature of equations, which he had ex-
pounded in the third book of Za Giomitrie. Descartes referred sarcastically to
Fermat as aostre Conseiller De Maximis et Minimis and said Fermat was indebted
EXTENSIONS OF COORDINATE GEOMETRY 3I7
to him. Roberval, Pascal, and others sided with Fermat, and Mydorge and
Desargues sided with Descartes. Fermat's friends wrote bitter letters against
Descaites. Later the attitudes of the two men toward each other softened,
and in a work of 1660, Fermat, while calling attention to an error in Za
Giomitrie, declared that he admired that genius so much that even when he
made mistakes Descartes,s work was worth more than that of others who did
correct things. Descartes had not been so generous.
The emphasis placed by posterity on La Giomitrie was not what Descartes
had intended. while the salient idea for the future of mathematics was the
association of equation and curve, for Descartes this idea was just a means to
an end-the solution of geometric construction problems' Fermat's em-
phasis on the equations of loci is, from the modern standpoint, more to the
point. The geometric construction problems that Descartes stressed in Books
i and III have dwindled in importance, largely because construction is no
longer used, as it was by the Greeks, to establish existence'
One portion of Book III has also found a permanent place in mathe-
matics. Since Descartes solved geometric construction problems by first
formulating them algebraically, solving the algebraic equations' and then
constructing what the solutions called for, he gathered together work of his
own and of others on the theory of equations that might exPedite their
solution. Because algebraic equations continued to arise in hundreds of
diflerent contexts having nothing to do with geometrical construction
problems, this theory of equations has become a basic part of elementary
algebra.
the attention given to the construction problems. Another reason for the
slow spread of analytic geometry was Descartes's insistence on making his
presentation difficult to follow.
In addition many mathematicians objected to confounding algebra and
geometry, or arithmetic and geometry. This objection had been voiced even
in the sixteenth century, when algebra was on the rise. For example, Tar-
taglia insisted on the distinction between the Greek operations with geo-
metrical objects and operations with numbers. He reproached a translator
of Euclid for using interchangeably multiplhare and durere. The first belongs to
numbers, he says, and the second to magnitude. Vieta, too, considered the
sciences of number and of geometric magnitudes as parallel but distinct.
Even Newton, in his Arithmttica Uniaersalis, objected to confounding algebra
and geometry, though he contributed to coordinate geometry and used it in
the calculus. He says,8
Equations are expressions of arithmetical computation and properly have
no place in geometry except insofar as truly geometrical quantities (that
is, lines, surfaces, solids and proportions) are thereby shown equal, some
to others. Multiplications, divisions and computations of that kind have
been recently introduced into geometry, unadvisedly and against the first
principles of this science. . . . Therefore these two sciences ought not to be
confounded, and recent generations by confounding them have lost that
simplicity in which all geometrical elegance consists.
A reasonable interpretation of Newton's position is that he wanted to keep
algebra out of elementary geometry but did find it useful to treat the conics
and higher-degree curves.
Still another reason for the slowness with which coordinate geometry was
accepted was the objection to the lack of rigor in algebra. We have already
mentioned Barrow's unwillingness to accept irrational numbers as more than
symbols for continuous geometrical magnitudes (Chap. 13, sec. 2). Arithmetic
and algebra found their Iogical justification in geometry; hence algebra
could not replace geometry or exist as its equal. The philosopher Thomas
Hobbes (1588-1679), though only a minor figure in mathematics, neverthe-
less spoke for many mathematicians when he objected to the "whole herd of
them who apply their algebra to geometry." Hobbes said that these alge-
braists mistook the symbols for geometry and characterized John Wallis's
book on the conics as scurvy and as a "scab of symbols."
Despite the hindrances to appreciation of what Descartes and Fermat
had contributed, a number of men gradually took uP and expanded co-
ordinate geometry. The first task was to explain Descartes's idea. A Latin
translation of La Giomitrie by Frans van Schooten (1615-60), first published
in 1649 and republished several times, not only made the book available in
8. Aithmctica Uniaersalis, 1707, p, 282.
EXTENSIONS OF COORDINATE GEOMETR'Y 3I9
the language all scholars could read but contained a commentary which
e*panded iescartes's compact presentation. In the edition of 1659-61, van
Sclhooten actually grrr" ih" algebraic form of a transformation of co-
ordinates from one base line (r-axis) to another' He was so impressed with
the power of Descartes's method that he claimed the Greek geometers had
,r.J it to derive their results. Having the algebraic work, the Greeks,
according to van Schooten, saw how to obtain the results synthetically-he
showed how this could be done-and then published their synthetic methods,
which are less perspicuous than the algebraic, to an:;laze the world' Van
Schooten may harr" t""r, misled by the word " analysis," which to the Greeks
,n urrt urrulyring a problem, and the term "analytic geometry," which
specifically described Descartes's use of algebra as a method'
derived the equations
John Wallis, in De Scctionibus Conicis (1655), first
of th*e conics by translating Apollonius' geometric conditions into algebraic
form (much as we did in Ctrap. 4, sec. 12) in order to elucidate Apollonius'
results. He then defined the conics as curves corresponding to second degree
equations in * and y and proved that these curves- were indeed the conic
.e"tiors as known giomeEically. He was probably the first to use equations
to prove properties of the conics. His book helped immensely to spread the
idea of coofdinate geometry and to popularize treatment of the conics as
curves in the plane instead of as sections of a cone, though the latter aP-
proach persistJd. Moreover, Wallis emphasized the validiry of the algebraic
..urorri.rg whereas Descartes, at least in lis GtomCtric, really rested on the
g"o-"t.!, regarding algebra as just a tool' Wallis was also the first to con-
Iciourly irrtroirr". ,r.gulir" abscis.as and ordinates' Newton, who did this
later, may have gotten the idea from Wallis' We can contrast van Schooten's
remark on method with one by Wallis, who said that Archimedes and
nearly all the ancients so hid from posterity their method of discovery and
urruly.i, that the moderns found it easier to invent a new analysis than to scek
out the old.
Newton,sTheMcthodofFluxiansandltfinitc.Sarias,writtenabout16Tl
butfirstpublishedinanEnglishtranslationbyJohnColson(d.1760)under
the above title in I 736, coniains many uses of coordinate geometry, such as
sketching curves from equations. One of the original ideas it offers is the use
ofnew cJordinate systerns. The seventeenth- and even many ofthe eighteenth-
century men generally used one axis, with the y-values drawn at an oblique
or right anglJto that axis. Among the new coordinate systems introduced-by
N"*Io' is ihe location ofpoints by reference to a fixed point and a fixed line
through that point. The scheme is essentially our polar coordinate system'
The book many variations on the polar coordinate idea' Newton
"ort.irr
alsointroducedbipolarcoordinates.Inthisschemeapointislocatedby-its
distance from two fixed points (Fig. 15.7)' Because this work of Newton
did
not become known untii 1736, credit for the discovery of polar coordinates
320
Figure 15.7
PG, t, z)
o
Figure 15.9 Figure 15.10
intersecting at right angles (Fig. 15.9). The ends of these perpendiculars will
each describe a curve in the respective plane. These plane curves can then
be treated by the method already given' Earlier in Book II Descartes
observes that one equation in three unknowns for the determination of the
typical point C of a locus represents a plane, a sphere, or a more complex
surface. Clearly he appreciated thas his method could be extended to curves
and surfaces in three-dimensional space, but he did not himself go further
with the extension.
Fermat, in a letter of 1643, gave a brief sketch of his ideas on analytic
geometry of three dimensions' He speaks of cyclindrical surfaces, elliptic
paraboloids, hyperboloids of two sheets, and ellipsoids. He then says that, to
crown the introduction ofplane curves, one should study curves on surfaces'
"This theory is susceptible of being treated by a general method which if I
have leisure I will explain." In a work of half a page, Nouus Secundarum,tr he
says that an equation in three unknowns gives a surface.
La Hire, in his Nouueaux ilimens des sections coniques (1679), was a little
more specifi.c about three-dimensional coordinate geometry. To represent a
surface, he first represented a point P in space by the three coordinates
indicated in Figure 15.10 and actually wrote the equation of a surface'
However, the development of three-dimensional coordinate geometry is the
work of the eighteenth century and will be discussed later.
and even the design of lenses, which Descartes himself undertook, call for
quantitative knowledge. coordinate geometry made possible the expression
of shrpo and paths in allebraic form, from which quantitative knowledge
could be derived.
Thus algebra, which Descartes had thought wasjust a tool, an octension
of logic rather than part of mathematics proPer, became more vital than
g.o-1t y. In fact, coordinate geometry paved the way for a complete
ieversal of the roles of algebra and geometry. Whereas from Greek times until
about 1600 geometry dominated mathematics and algebra was subordinate,
after 1600 algeb." became the basic mathematical subject; in this trans-
position of roles the calculus was to be the decisive factor. The ascendancy of
algeb.a aggravated the difficulty to which we have already called attention,
,ra=m"ly, ihut thet" was no logical foundation for arithmetic and algebra;
but nothing was done about it until the late nineteenth century'
The fact that algebra was built up on an empirical basis has led to
confusion in mathemalical terminology. The subject created by Fermat and
Descartes is usually referred to as analytic geometry' The word " analytic "
is inappropriate; coordinate geometry or algebraic geometry (which now has
u.rothet rneaning) would be more suitable. The word "analysis" had been
used since Plato's time to mean the process of analyzing by working backward
from what is to be proved until one arrives at something known. In this sense
it was opposed to " synthesis," which describes the deductive presentation'
abo,.rt 1-SgO Vieta rejected the word " algebra" as having no meaning in the
European language and proposed the term "analysis" (Chap' 13, sec' 8); the
suggestio., *u. ,rot adopted. However, for him and for Descartes, the word
" a'r'alysis " was still somewhat appropriate to describe the application of
algebra to geometry because the algebra served to analyze the geometric
construction problem. One assumed the desired geometric length was
known, founJ an equation that this length satisfied, manipulated the
equation,' and then saw how to construct the required length' Thus Jacques
O'run^ (1640-1717) said in his Dictionary (1690) that moderns did
their analysis by algebra. In the famous eighteenth-centwy huyclopldit,
d'Alemberi used " algebra " and "analysis" as synonyms' Gradually,
"analysis" came to mean the algebraic method, though the new coordinate
geometry, up to about the end of the eighteenth century, was most often
iormally described as the application of algebra to geometry' By the end of
the ceniury the term " analytic geometry" became standard and was fre-
quently used in titles of books.
-
However, as algebra became the dominant subject, mathematicians
came to regard it as having a much greater function than the analysis of a
problem in the Greek sense. In the eighteenth century the view that algebra
as applied to geometry was more than a tool-that algebra itself was a basic
meti;d of introducing and studying curves and surlirces (the supposed view
324 COORDINATE GEOMETRY
Bi.bliography
Boyer, Carl B,: History of Analytic Geometry, Scripta lVlathematica, 1956.
Cantor, Moritz: Vorlesungen ilber Geschichte der Mathematik,2nd ed., B. G. Teubner,
1900, Johnson Reprint Corp., 1965, Vol. 2, pp. 806-76.
Chasles, ll;[.ichel: Apngu historique sur I'origine et le diteloppement des mithodes cn gilomitrie,
3rd ed., Gauthier-Villars et Fils, 1889, Chaps. 2-3 and relevant notes.
Coolidge, Julian L. : A History of Geometrical Methods, Dover (reprint), 1963, pp.
I l7-31.
Descartes, Ren6: Za G6omitrie (French and English), Dover (reprint), 1954.
(Eulres, 12 vols., Cerll I897-1913.
Fermat, Pierre de:. (Eutres, 4 vols. and Supplement, Gauthier-Villars, 189l-1912;
Supplement, 1922.
Montucla, J. F.: Histoire des mathimatiques, Albert Blanchard (reprint), 1960, Vol.
2, pp. 102-77.
Scott, J. F.z The Scientific Work of Reni Descartes, Taylor and Francis, 1952.
Smith, David E,: A Source Book in Mathemallcs, Dover (reprint), 1959, Vol. 2, pp.
389-402.
Struik, D. J.: A Source Book in Mathematics, 1200-1800, Harvard IJniversity Press,
1969, pp. 87-93, 143-57.
Wallis, John: Opera,3 vols., (1693-99), Georg Olms (reprint), 1968.
Vrooman, Jack R,: Reni Descailes: A Biography, G. P. Putnam's Sons, 1970.
Vuillemin, Jtles: Mathimatiques et mitaphysique chez Descartes, Presses Universitaires
de France, 1960.
I6
The Mathem atization of Science
So that we may say the door is now opened, for the first
time, to a new method fraught with numerous and wonderful
results which in future years will command the attention of
other minds. GALTLEo GALTLET
l. Introduction
By 1600 the European scientists were unquestionably impressed with the
importance of malhematics for the study of nature' The strongest evidence
of ihis conviction was the willingness of Copernicus and Kepler to overturn
the accepted laws of astronomy and mechanics and religious doctrines for
the sake of a theory which in their time had only mathematical advantages.
However, the astonishing successes of modern science and the cnormous
impetus io creative work ihat mathematics derived from that source probably
*ould not have come about if science had continued in the footsteps of the
past. But in the seventeenth century two men, Descartes and Galileo'
ievolutionized the very nature of scientific activity' They selected the con-
cepts science should ernploy, redefined the goals of scientific activity, and
aliered the methodology of science. Their reformulation not only imparted
unexpected arrd urrprecldented Power to science but bound it indissolubly
to mathematics. In fact, their plan practically reduced theoretical science to
mathematics. To understand the spirit that animated mathematics from
the seventeenth through the nineteenth centuries, we must first examine
the ideas of Descartes and Galileo.
325
326 THE MATTTEMATTZATToN or soIENcE
write up his years of thought and work on the phenomena ofmotion and on
the strength of materials. The manuscript, entitled Discorsi e dimostrazioni
matemaliche intorno d due nuoae scienze (Discortrses and Mathematical Demon-
strations Concerning Two New Sciences, also referred to as Dialogues
Concerning Two New Sciences), was secretly transported to Holland and
published there in 1638. This is the classic in which Galileo presented his new
scientific method. He defended his actions with the words that he had never
"declined in piety and reverence for the Church and my own conscience.,,
Galileo was an extraordinary man in many fields. He was a keen
astronomical observer. He is often called the father of modern invention;
though he did not invent the telescope or " perplexive glasses,,, as Ben
Jonson called them, he was immediately able to construct one when he
heard of the idea. He was an independent inventor of the microscope, and
he designed the first pendulum clock. He also designed and made a compass
with scales that automatically yielded the results of numerical computations
so the user could read the scales and avoid having to do the calculations.
This device was so much in demand that he produced many for sale.
Galileo was the first important modern student of sound. He suggested
a wave theory of sound and began work on pitch, harmonics, and the
vibrations of strings. This work was continued by Mersenne and Newton
and became a major inspiration for mathematical work in the eighteenth
century.
Galileo's major writings, though concerned with scientific subjects, are
still regarded as literary masterpieces. His Sidereus .ly'zzcizs (Sidereal Messen-
ger) of 1610, in which he announced his astronomical observations and
declared himself in support of Copernican theory, was an immediate success,
and he was elected to the prestigious Academy of the LynxJike in Rome.
His two greatest classics, the Dialogue on the Gredt World Sgstems and Dialoguzs
Corceming Two New Sciences, are clear, direct, witty, yet profound. In both,
Galileo has one character present the current views, against which another
argues cleverly and tenaciously to show the fallacies and weaknesses of these
views and the strengths of the new ones.
In his philosophy of science Galileo broke sharply from the speculative
and mystical in favor of a mechanical and mathematical view of nature. He
also believed that scientific problems should not become enmeshed in and
beclouded by theological arguments. Indeed, one of his achievements in
science, though somewhat apart from the method we are about to examine,
is that he recognized clearly the domain of science and severed it sharply
from religious doctrines.,
Galileo, like Descartes, was certain that nature is mathematically de-
signed. His statement of 1610 is famous:
Philosophy [nature] is written in that great book which ever lies befdrc
our eyes-I mean the irniverse-but we cannot understand it if we do
GALILEO'S APPROACH I'O SCIENCE 329
not first learn the language and grasp the symbols in which it is written'
The book is written in the mathematical language, and the symbols are
triangles, circles and other geometrical figures, without whose help it is
impossible to comprehend a single word of it; without which one wanders
in vain through a dark labYrinth.l
Nature is simple and orderly and its behavior is regular and necessary' It
acts in accordance with perfect and immutable mathematical laws. Divine
reason is the source of the rational in naturel God put into the world that
rigorous mathematical necessity that men reach only laboriously' Mathe-
matical knowledge is therefore not only absolute truth, but as sacrosanct as
any line of Scripture. In fact it is superior, for there is much disagreement
about the Scripiures, but there can be none about mathematical truths'
Another Joctrine, the atomism of the Greek Democritus, is clearer in
Galileo than in Descartes. Atomism presupposed empty space (which Des-
cartes did not accept) and individual, indestructible atoms' Change con-
sisted in the combination and separation of atoms. All qualitative varieties
in bodies were due to quantitative variety in number, size, shape, and
spatial arrangement of the atoms. The atom's chief properties were impene-
trability and indestructibility; these properties served to explain chemical
and physical phenomena. Galileo's espousal of atomism placed it in the
forefront of scientific doctrines.
Atomism led Galileo to the doctrine of primary and secondary qualities'
He says, "If ears, tongues, and noses were removed, I am of the opinion
that shape, quantity [size] and motion would remain, but there would be
an end of smells, tastes, and sounds, which abstracted from the living
creature, I take to be mere words." Thus in one swoop Galileo, like Des-
cartes, stripped away a thousand phenomena and qualities to concentrate
on matter and motion, properties that are mathematically describable' It is
perhaps not too surprising that in the century in which problems of motion
,r.." th" most prominent and serious, scientists should find motion to be a
fundamental physical phenomenon.
The concentration on matter and motion was only the first step in
Galileo's new approach to nature. His next thought, also voiced by Des-
cartes, was that any branch of science should be patterned on the model of
mathematics. This implies two essential steps. Mathematics starts udth
axioms-clear, self-evident truths-and from these proceeds by deductive
reasoning to establish new truths. Any branch of science, then, should start
with axioms or principles and then proceed deductively' Moreover, one
should extract from the axioms as many consequences as possible' This
thought, of course, goes back to Aristotle, who also aimed at deductive
structure in science with the mathematical model in mind'
However, Galileo departed radically from the Greeks, the medieval
l. OPcre,4, l7l.
33O TrrE MATHEMATIZATION OF SCIENCE
scientists, and even Descartes in his method of obtaining first principles. The
pre-Galileans and Descartes had believed that the mind supplied the basic
principles; it had but to think about any class of phenomena and it would
immediately recognize fundamental truths. This power of the mind was
clearly evidenced in mathematics. Axioms such as "equals added to equals
give equals " and " two points determine a line" suggested themselves im-
mediately in thinking about number or geometrical figures, and were in-
dubitable truths. So too had the Greeks found some physical principles
equally appealing. That all objects in the universe should have a natural
place was no more than fitting. The state of rest seemed clearly more natural
than the state of motion. It seemed indubitable, too, that force must be
applied to put and keep bodies in motion. To believe that the mind supplies
fundamental principles did not deny that observations might play a role in
obtaining these principles. But the observations merely evoked the correct
principlcs, just as the sight of a familiar face might call to mind facts about
that person.
The Greek and medieval scientists were so convinced that there were
a priori fundamental principles that when occasional observations did not
fit they invented special explanations to preserve the principles but still
account for the anomalies. These men, as Galileo put it, first decided how
the world should function and then fitted what they saw into their precon-
ceived principles.
Galileo decided that in physics, as opposed to mathematics, first
principlcs must come from experience and experimentation. The way to
obtain correct and basic principles is to pay attention to what nature says
rathcr than what the mind prefers. Nature, he argued, did not first make
men's brains and then arrange the world to be acceptable to human in-
tellects. To the mediwal thinkers who kept repeating Aristotle and debating
what he meant, Galileo addressed the criticism that knowledge comes from
observation and not from books, and that it was useless to debate about
Aristotle. He says, "When we have the decrees of nature, authority goes for
nothing.. . ." Of course some Renaissance thinkers and Galileo's contem-
porary Francis Bacon had also arrived at the conclusion that experimentation
wrx necessary; in this particular aspect of his new method, Galileo was not
ahead of all others. Yet the modernist Descartes did not grant the wisdom
of Galileo's reliance upon experimentation. The facts of the senses, Des-
cartes said, can only lead to delusion, but reason penetrates such delusions.
From the innate general principles supplied by the mind, we can deduce
particular phenomena of nature and understand them. Galileo did appre-
ciate that one may glean an incorrect principle from experimentation and
that as a consequence the deductions from it could be incorrect. Hence he
proposed the use of experiments to check the conclusions of his reasonings
as well as to acquire basic principles.
GALII.EO,S APPROACH TO SoIENCE 33I
Galileowasactuallyatransitionalfigureasfarasexperimentationis
concerned. He, and Isaac Newton fifty years later, believed that a
few key
or critical experiments would yield correct fundamental principles' More-
over, many of Gulil.o" so-called experiments were really thought-e:<peri-
ments; that is, he relied upon common experience to imagine what would
h.pp"r, if an experiment were performed' He then drew a conclusion as
as if he had actually Performed the ocperiment' When in the
"orriid"rrtly
i;rtogu, on the Great World systrms he describes the motion of a ball dropped
frorn ih" mast of a moving ship, he is asked by Simplicio, one of the charac-
ters, whether he had male an e:rperiment' Galileo replies, "No, and
I do
not need it, as without any experience I can confirm that it is so, because it
cannot be otherwise." He says in fact that he experimented rarely, and then
primarily to refute tlose who did not follow the mathematics' Though
'p"rformed
i\"*tor, some famous and ingenioqs experiments, he too says
that he used experiments to make his rcs,,lts physically intelligible and to
convince the common PeoPle.
The truth of the matter is that Galileo had some preconceptions about
nature, which made him confident that a few experiments would suffice'
He believed, for e>rample, that nature was simple' Hence when he considered
freely falling bodies, which fall with increasing velocity, he supposed that
the increase" in velocity is the same for each second of fall' This was the
simplest "truth." He believed also that nature is mathematicaUy designed'
h.rr". any mathematical law that seemed to fit even on the basis of
".ri
rather limited experimentation appeared to him to be correct'
For Galileo, as well as for Huygens and Newton, the deductive' mathe-
matical part of ihe scientific enterprise played a greater Part than the ex-
perimenial. Galileo was no less proud of the abundance of theorems that
ilo* f.o- a single principle than of the discovery of the principle itself. The
men who fashiot .a *od"t' science-Descartes, Galileo, Huygens' and
Newton (we can also include Copernicus and Kepler)-approached the study
of nature as mathematicians, in their general method and in their concrete
to
investigations. They were primarily speculative thinkers who expected
appreh"end broad, deep (but also simple), clear, and immutable mathe-
*uti"ut principles eithir ihrough intuition or through crucial observations
urrd and then to deduce new laws from these fundamental
"*p..i*.ots,
truths, entirely in the manner in which mathematics proper had constructed
its g"o-.try. The bulk of the activity was to be the deductive portion;
whole systems of thought were to be so derived'
What the great ihinkers of the seventeenth century envisaged as the
prop.. p.o""drrie for science did indeed prove to be the profitable course'
th" .utiorr.l search for laws of nature produced, by Newton's time' ex-
tremely valuable results on the basis of the slimmest observational and
ex-
perimental knowledge' The great scientific advances of the sixteenth and
332 TrrE MATHEMATTZATTON OF SCTENCE
but in a special way. Unlike the Aristotelians and the late medieval scientists,
who had fastened upon qualities they regarded as fundamental and studied
the acquisition and loss of qualities or debated the meaning of the qualities,
Galileo proposed to seek quantitatiae axioms. This change is most important ;
we shall see the full significance of it later, but an elementary example may
be useful now. The Aristotelians said that a ball falls because it has weight,
and that it falls to the earth because every object seeks its natural place and
the natural place of heavy bodies is the center ofthe earth. These principles
are qualitative. Even Kepler's first law of motion, that the path of each
planet is an ellipse, is a qualitative statement. By contrast, let us consider
the statement that the speed (in feet per second) with which a ball falls is
32 times the number of seconds it has been falling, or in symbols, tt : 32t.
This is a quantitative statement about how a ball falls. Galileo intended to
seek such quantitative statements as his axioms, and he expected to deduce
new ones by mathematical means. These deductions would also give quanti-
tative knowledge. Moreover, as we have seen, mathematics was to be his
essential medium,
The decision to seek quantitative knowledge expressed in formulas
carried with it another radical decision, though first contact with it hardly
reveals its full significance. The Aristotelians believed that one of the tasks
ofscience was to explain why things happened; explanation meant unearth-
ing the causes of a phenomenon. The statement that a body falls because it
has weight gives the effective cause of the fall and the statement that it
seeks its natural place gives the final cause. But the quantitative statement
a : 32t, for whatever it may be worth, gives no explanation of why a ball
falls; it tells only how the speed changes with the time. In other words,
formulas do not explain ; they describe. The knowledge of nature Galileo
sought was descriptive. He says in Tuo Neu Sciences, "The cause of the ac-
celeration of the motion of falling bodies is not a necessary part of the in-
vestigation." More generally, he points out that he will investigate and
demonstrate some of the properties of motion without regard to what the
causes might be. Positive scientific inquiries were to be separated from
questions of ultimate causation, and speculation as to physical causes was to
be abandoned.
First reactions to this principle of Galileo are likely to be negative'
Description of phenomena in terms of formulas hardly seems to be more
than a first step. It would seem that the true function of science had really
been grasped by the Aristotelians, namely, to explain why phenomena
happened. Even Descartes protested Galileo's decision to seek descriptive
formulas. He said, " Everything that Galileo says about bodies falling in
empty space is built without foundation: he ought first to have determined
the nature of weight." Further, said Descartes, Galileo should reflect on
334 THE MATHEMATTZATTON OF SCTENCE
ultimate reasons. But we shall see clearly after a few chapters that Galileo's
decision to aim for description was the deepest and most fruitful idea that
anyone has had about scientific methodology.
Whereas the Aristotelians had talked in terms of qualities such as
fluidity, rigidity, essences, natural places, natural and violent motion, and
potentiality, Galileo chose an entirely new set of concepts, which, moreover,
were measurable, so that their measures could be related by formulas.
Some of them are : distance, time, speed, acceleration, force, mass, and
weight. These concepts are too familiar to surprise us. But in Galileo's time
they were radical choices, at least as fundamental conceptsl and these are
the ones that proved most instrumental in the task of understanding and
mastering nature.
We have described the essential features of Galileo's program. Some of
the ideas in it had been espoused by others; some were entirely original with
him. But what establishes Galileo's greatness is that he saw so clearly what
was u,rong or deficient in the current scientific eflorts, shed completely the
older ways, and formulated the new procedures so clearly. Moreover, in
applying them to problems of motion he not only exemplified the method
but succeeded in obtaining brilliant results-in other words, he showed that
it worked. The unity of his work, the clarity of his thoughts and expressions,
and the force ofhis argumentation influenced almost all ofhis contemporaries
and successors. More than any other man, Galileo is the founder of the
methodology of modern science, He was fully conscious of what he had
accomplished (see the chapter legend); so were others. The philosopher
Hobbes said of Galileo, " He has been tlte first to open to us the door to the
whole realm of physics."
We cannot pursue the history of the metJrodology of science. However,
since mathematics became so important in this methodology and profited
so much from its adoption, we should note how completely Galileo's pro-
gram was accepted by giants such as Newton. He asserts that experiments
are needed to furnish basic laws. Newton is also clear that the function of
science, after having obtained some basic principles, is to deduce new facts
from these principles. In the preface to his Pfiuipia, he says:
Since the ancients (as we are told by Pappus) esteemed the science of
mechanics of greatest importance in the investigation of natural things,
and the moderns, rejecting substantial forms and occult qualities, have
endeavored to subject the phenomena of nature to the laws of mathe-
matics, I have in this treatise cultivated mathematics as far as it relates
to philosophy [science] . . . and therefore I offer this work as the mathe-
matical principles of philosophy, for the whole burden in philosophy
sccrns to consist in this-frorn the phcnornena of motions to investigate
the forces of nature, and then from these forces to demonstrate the other
phenomena.. , .
THE FUNCTION CONCEPT 335
on, Newton used the term "fluent" to represent any relationship between
variables. In a manuscript of 1673 Leibniz used the word "function" to
mean any quantity varying from point to point of a curve-for example,
the length of the tangent, the normal, the subtangent, and the ordinate.
The curve itself was said to be given by an equation. Leibniz also introduced
the words "constant," "variable," and "parameter," the latter used in
connection with a family of curves.s In working with functionsJohn Bernoulli
spoke from 1697 on of a quantity formed, in any manner whatever, of
variables and of constants;a by " any manner" he meant to cover algebraic
and transcendental expressions. He adopted Leibniz's phrase "function of
x" for this quantity in 1698. In his Historia (1714), Leibniz used the word
"function" to mean quantities that depend on a variable.
As to notation, John Bernoulli wrote X or f for a general function of
x, though in 17lB he changed to /r. Leibniz approved of this, but proposed
also xr and x2 for functions of r, the superscript to be used when several
functions were involved. The notation f(x) was introduced by Euler in
L734.6 The function concept immediately became central in the work on
the calculus. We shall see later how the concept was extended.
Bi.bliograplry
Bell, A. E.: Christian Huygens and the Deuelopmznt of Science in the Seuenteenth Century,
Edward Arnold, 1947.
Burtt, A. E.: Thc Metaphysical Foundations of Modern Phgsital Science,2nd ed., Rout-
ledge and Kegan Paul, 1932, Chaps. l-7.
Butterfield, Herbert: The Origins of Modtrn Sci.ence, Macmillan, 1951, Chaps. 4-7.
Cohen, I. Bernard: The Birth of a New Prtysics, Doubleday, 1960.
Coolidge, J. L.: The Matlumatics of Great Amateurs, Dover (reprint), 1963, pp.
tt9-27.
Crombie, A. C.: Augusti.ne to Galileo, Falcon Press, 1952, Chap. 6'
Dampier-Whetham, W. C. D.: A History of Science and lts Relations with PhilosoQhy
and Religi.on, Cambridge University Press, 1929, Chap. 3.
Dijksterhuis, E, J.: The Mechani.zation of the World Pi.cture, Oxford University Press,
1961.
Drabkin, I. E., and Stillman Drake: Gali.leo Galilei: On Motion and, Meehanics,
University of Wisconsin Press, 1960.
Drake, Stillman: Discoaeries and Opinions of Gali.leo, Doubleday, 1957.
Galilei, Galileo: Opere,20 vols., 1890-1909, reprinted by G. Barbera, 1964-66.
Dialogues Concerning Two New Sciences, Dovet (reprint), 1952.
Hall, A. R.: The Scientfic Reaoluti.on, Longmans Green, 1954, Chaps. l-8.
From Galileo to Newton, Collins, 1963, Chaps. 1-5.
3.
-: Math. Schriften, 5,266-69.
4. Mim de l'Acad des Sri., Paris, 1718, 100 ff. = Olera,2,235-65, p. 241 in particular.
5. Comm. Acad. Sci. Petro!.,7, 173+135, 184-200, pub. 1740 = Ofuera, (l),22,57-75.
34t
Huygens, C.: (Euares comphtes,22 vols., M. Nyhofl 1888-1950.
Newton, I.: Mathematical hinciphs of Natural Philosoplry, University of California
Press, 1946.
Randall, John H., jr,z Making of tlu Modnn Mind, rev. ed., Houghton Miffiin,
1940, Chap. 10.
Scott, J. E.: The Scientific Work of Reni Descartes' Taylor and Francis, 1952, Chaps.
l0-12.
Smith, Preservedz A Hi.story of Modzrn Culture, Henry Holt, 1930, Vol. I' Chaps. 3,
6, and 7.
Strong, Edward Yl.t Procedures and Metaphysi.cs, University of California Press, 1936,
Chaps. 5-8.
Whitehead, Alfred North: Science and. the Modern World, Cambridge University
Press, 1926, Chap. 3.
Wolt Abraham: A History of Science, Technology and Philosophg in tlu 16th and 17th
Ccnturies,2nd ed., George Allen and lJnwin, 1950, Chap' 3.
r7
The Creation of the Calculus
Who, by a vigor of mind almost divine, the motions and
figures of the planets, the paths of comets, and the tides of
thc seas first demonstrated. xrwrox's EPrrAPr{
342
TT{E MOTTVATION TOR TIIE CALCULUS 343
Figure 17,I
must knou/ the angle at which the ray strikes the lens in order to apply the
law ofrefraction. The significant angle is that between the ray and the normal
to the curve (Fig. 17.1), the normal being the perpendicular to the tangent.
Hence the problem was to find either the normal or the tangent. Another
scientific problem involving the tangent to a curve arose in the study of
motion. The direction of motion of a moving body at any point of its path is
the direction of the tangent to the path.
Actually, even the very meaning of " tangent " was open. For the conic
sections the definition of a tangent as a line touching a curve at only one
point and lying on one side of the curve sufficed; this definition was used
by the Greeks. But it was inadequate for the more complicated curves already
in use in the seventeenth century.
The third problem was that of finding the maximum or minimum value
of a function. When a cannonball is shot from a cannon, the distance it will
travel horizontally-the range-depends on the angle at which the cannon
is inclined to the ground. One " practical " problem was to find the angle
that would maximize the range. Early in the seventeenth century, Galileo
determined that (in a vacuum) the maximum range is obtained for an angle
offire of45'; he also obtained the maximum heights reached by projectiles
fired at various angles to the ground. The study of the motion of the planets
also involved maxima and minima problems, such as finding the greatest
and least distances of a planet from the sun,
The fourth problem was finding the lengths of curves, for example, the
distance covered by a planet in a given period of time; the areas bounded by
curves; volumes bounded by surfaces; centers of gravity of bodies ; and the
gravitational attraction that an extendtd body, a planet for example, exerts
on another body. The Greeks had used the method ofexhaustion to find some
areas and volumes. Despite the fact that they used it for relatively simple
areas and volumes, they had to apply much ingenuity, because the method
lacked generality. Nor did they often come up with numerical answers'
Interest in finding lengths, areas, volumes, and centers ofgravity was revived
when the work of Archimedes became known in Europe. The method of
exhaustion was first modified gradually, and then radically by the invention
of the calculus.
344 THE CREATION OF THE CALCULUS
called the subtangent. Fermat's plan is to find the length of 7Q, from which
one knows the position of Tand can then draw TP.
Figure 17.2
tu: fi-aE,J@)
n1 _f@)'
For the;t(x) Fermat treated, it was immediately possible to divide numerator
and denominator of the above fraction by E. He then set E : 0 (he says,
remove the E term) and so obtained ZQ.
Fermat applied his method of tangents to many difficult problems'
The method has ttre Jorm of the now-standard method of the differential
calculus, though it begs entirely the difficult theory of limits.
To Descartes the problem of finding a tangent to a curve was important
because it enables one to obtain properties of curves-for example, the angle
of intersection of two curves. FIe says, "This is the most useful, and the most
general problem, not only that I know, but even that I have any desire to
know in geometry." He gave his method in the second book of Zo Giomitrie'
It was purely algebraic and did not involve any concePt of limit, whereas
Fermat's did, if rigorously formulated. However, Descartes's method was
useful only for curves whose equations were of the form y : f(x), where/('r)
was a simple polynomial. Though Fermat's method was general, Descartes
346 THE CREATION OF THE CALCULUS
thought his own method was better; he criticized Fermat's, which admittedly
was not clear as presented then, and tried to interpret it in terms of his own
ideas. Fermat in turn claimed his method was superior and saw advantages
in his use of the little increments E.
Isaac Barrow (1630-77) also gave a method of finding tangents to
curves. Barrow was a professor of mathematics at Cambridge University.
Well versed in Greek and Arabic, he was able to translate some of Euclid's
works and to improve a number of other translations of the writings of Euclid,
Apollonius, Archimedes, and Theodosius. His chief work, the Lectiones
Geometritac (1669), is one of the great contributions to the calculus. In it he
used geometrical methods, " freed," as he put it, " from the loathsome burdens
of calculation." In 1669 Barrow resigned his professorship in favor of Newton
and turned to theological studies.
Barrow's geometrical method is quite involved and makes use of
auxiliary curves. However, one feature is worth noting because it illustrates
the thinking of the time; it is the use of what is called the differential, or
characteristic, triangle. He starts with the triangle P,RQ (Fig. 17.4), which
results from the increment Pft, and uses the fact that this triangle is similar
to triangle PMN to assert that the slope QRIPR of the tangent is equal to
PMIMN. However, Barrow says, when the arc PP' is sufficiently small we
may safely identi$ it with the segment PQ of the tangent at P. The triangle
PRP' (Fig. 17.5), in which PP'is regarded both as an arc of the curve and as
part of the tangent, is the characteristic triangle. It had been used much
earlier by Pascal, in connection with finding areas, and by others before him.
In Lecture I0 of the Lectiones, Barrow does resort to calculation to find
the tangent to a curve. Here the method is essentially the same as Fermat's.
He uses the equation of the curve, say yz : px, arrd replaces x by x ! e
and,ybyy*a.Then
92+2ag1a2:px*pe.
EARLY SEVENTEENTH-CENTURY WORK ON THE CALCULUS 347
ffiI :p.
PM
2u
By subtracting common terms from the two sides and dividing by E, he gets
B:24+E.
He then sets E : 0 (he says, discard the E term) and gets B : 2A' Thus
the rectangle is a square.
The method, Fermat says, is quite general; he describes it thus: If ,4
is the independent variable, and, rf A is increased to A + E, then when E
becomes indefinitely small and when the function is passing through a
maximum or minimum, the two values of the function will be equal. These
348 THE CREATTON OF THE CALCULUS
two values are equated; the equation is divided by.E; and .O is now made to
vanish, so that from the resulting equation the value of A that makes the
function a maximum or minimum can be determined, The method is
essentially the one he used to find the tangent to a curve. However, the basic
fact there is a similarity of two uiangles; here it is the equality of two
function values. Fermat did not see the need tojustify introducing a non-zero
E and then, after dividing by .8, setting E : 0.2
The seventeenth-century work on finding areas, volumes, centers of
gravity, and lengths of curves begins with Kepler, who is said to have been
attracted to the volume problem because he noted the inaccuracy of methods
used by wine dealers to find the volumes of kegs. This work (in Stereometria
Doliorum) is crude by modern standards. For example, the area of a circle is
to him the area of an infinite number of triangles, each with a vertex at the
center and a base on the circumference. Then from the formula for the area
of a regular inscribed polygon, 1/2 the perimeter times the apothem, he
obtained the area of the circle. In an analogous manner he regarded the
volume of a sphere as the sum of the volumes of small cones with vertices at
the center of the sphere and bases on its surface. He then proceeded to show
that the volume of the sphere is l/3 the radius times the surface. The cone
he regarded as a sum of very thin circular discs and was able thereby to
compute its volume. Stimulated by Archimedes' Spheroids and Conoids, he
generated new figures by rotation of areas and calculated the volumes. Thus
he rotated the segment of a circle cut out by a chord around the chord and
found the volume.
The identification of curvilinear areas and volumes with the sum of an
infinite number of infinitesimal elements of the same dimension is the essence
of Kepler's method. That the circle could be regarded as the sum of an
infinite number of triangles was in his mind justified by the principle of
continuity (Chap. 14, sec. 5). He saw no difference in kind between the two
figures. For the same reason a line and an infinitesimal area were really the
samel and he did, in some problems, regard an area as a sum of lines.
ln Tu:o Neu.t Sciences Galileo conceives of areas in a manner similar to
Kepler's; in treating the problem of uniformly accelerated motion, he gives
an argument to show that the area under the time-velocity curve is the
distance. Suppose an object moves with varying velocity a : 32t, represented
by the straight line in Figure 17.6; then the distance covered in time Ol is
the area OAB. Galileo arrived at this conclusion by regarding A'B' , say, as
a typical velocity at some instant and also as the infinitesimal distance
covered (as it would be if multiplied by a very small element of time), then
arguing that the area OAB, which is made up of lines l'8', must therefore be
2. For the equations that precede his setting E : 0, Fermat used the term adacqualitas,
which Carl B. Boyer in Thc Concepts of the Calculus, p. 156, has aPtly translated as " pseudo-
equality."
EARLY SEVENTEENTH.CENTURY WORK ON THE CALCULUS 349
Figure 17.7
the total distance. Since lB is 32I and OAis t, the area of OAB is 16'2' The
reasoning is of course unclear. It was supported in Galileo's mind by
philosoptical considerations that amount to regarding the area OAB as rnade
r-,p of un infinite number of indivisible units such as A'B'' He spent
much
tiie on the problem of the structure of continuous magnitudes such as line
segments and areas but did not resolve it.
Bonaventura Cavalieri (1598-1647), a pupil of Galileo and professor in
a lyceum in Bologna, was influenced by Kepler and Galileo and urged by
the latter to look into problems of the calculus' Cavalieri developed the
thoughts of Galileo and others on indivisibles into a geometrical method and
p.,blistred a work on the subject, Geometria Indiuisibitibus Continuorum Notta
quadam Ratione Promota (Geometty Advanced by a thus far Unknown
Method'
indivisibles of Continua, 1635). He regards an area as made up of an in-
definite number of equidistant parallel line segments and a volume as com-
posed of an indefinite-number oi parallel plane areas; these elements he calls
ih" i.rdi.riribt.s of area and volume, respectively' Cavalieri recognizes that
the number of indivisibles making up an atea or volume must be indefinitely
large but do., ,roi try to elaborate on this' Roughly speaking' the indivisi-
biltists held, as Cavalieri put it in his Exercitationes Geometricae Sex (1647),
that a line is made up of points as a string is of beads; a plane is made up of
lines as a cloth is of threads; and a solid is made up of plane areas as a book
is made up of pages. However, they allowed for an infinite number of the
constituent elements.
Cavalieri's method or principle is illustrated by the following proposition'
which ofcourse can be proved in other ways' To show that the parallelogram
ABCD (Fig. 17.7) has twice the area of either triangle ABD or B9?'h:
..gr"d ihit when GD : BE, then GH : FE' Hence triangles ABD ar:.d
BdD are made up of an equal number of equal lines, such as GH and EF'
and therefore must have equal areas.
The same principle is incorporated in the proposition now taught in
solid geometry books ard known as Cavalieri's Theorem' The
principle says
that iitwo solids have equal altitudes and if sections made by planes parallel
350
Figurc 17.8
to the bases and at equal distances from them always have a given ratio, the
volumes of the two solids have this given ratio to each other. Using essentially
this principle, Cavalieri proved that the volume ofa cone is l/3 that ofthe
circumscribed cylinder. Likewise he treated the area under two curves, say
y : l@) and y : g(.r) in our notation, and over the same range of .r-values;
considering the areas as the sums ofordinates, if the ordinates ofone are in a
constant ratio to those of the other, then, says Cavalieri, the areas are in the
same ratio. He showed by his methods in Centuria di aarii problemi (1639) that,
in our notation,
fd -n+ 1,
Jo*"tu:;71
for positive integral values of z up to 9. However, his method was entirely
geometrical. He was successful in obtaining correct results because he
applied his principle to calculate ratios of areas and volumes where the
ratio of the indivisibles making up the respective areas and volumes was
constant.
Cavalieri's indivisibles were criticized by contemporaries, and Cavalieri
attempted to answer them; but he had no rigorous justification. At times he
claimed his method was just a pragmatic device to avoid the method of
exhaustion. Despite criticism of the method, it was intensively employed by
many mathematicians. Others, such as Fermat, Pascal, and Roberval, used
the method and even the language, sum of ordinates, but thought of area
as a sum of infinitely small rectangles rather than as a sum of lines.
In 1634 Roberval, who says he studied the " divine Archimedes," used
essentially the method of indivisibles to find the area under one arch of the
cycloid, a problem Mersenne called to his attention in 1629. Roberval is
sometimes credited with independent discovery of the method ofindivisibles,
but actually he believed in the infinite divisibility of lines, surfaces, and
volumes, so that there are no ultimate parts. He called his method the
" method of infinities," though he used as the title of his work Traiti du
indiaisibles .
EARLY SEVEI{TEENTH-CENTURY WORK ON TIIE CALCULUS 35r
(rd)'
o
Figure 17.9
or
dg(ll-2z+3e+...*n).
Now the sum of the mth powers of the first n natural numbers had been
obtained by Pascal and Fermat for use in just such problems; so the mathe-
maticians could readily replace the last expression by
(3) on"Q +
[ .#)
Now if one argues, as these men did, that the last two terms can be neglected
when z is infinite, the correct result is obtained. The limit process had not
yet been introduced-or was only crudely perceived-and so the neglect of
terms such as the last two was not justified.
We see that the method calls for approximating the curvilinear figure by
rectilinear ones, as in the method of exhaustion. However, there is a vital
shift in the final step: in place ofthe indirect proofused in the older method,
here the number of rectangles becomes infinite and one takes the limit of (3j
as z becomes infinite-though the thinking in terms of limit was at this stage
by no means explicit. This new approach, used as early as t5B6 by Ste;
in his Statics, was pursued by many men, including Fermat.a
If the curve involved was not the parabola, then one had to replace the
characteristic property of the parabola by that of the curve in question and
so obtain some other series in place of (l) above. Summing the analogue of
(l) to obtain the analogue of (2) did call for ingenuity. Hence the results on
areas, volumes, and centers of gravity were limited. Of course the powerful
method of evaluating the limit of such sums by reversing differentiation
was not yet envisaged.
Using essentially the kind of summation technique we have just illus_
trated, Fermat knew before 1636 that (in our notation)
f"*"d*:!:
Jo n* I
for all rational n except I .a rhis result was also obtained independently by
-
Roberval, Torricelli, and Cavalieri, though in some cases only in geometrical
form and for more limited z.
Among those who used summation in geometrical form was pascal.
In 1658 he took up problems of the cycloid.6 He calculated the area of any
Figure 17.10
segment of the curve cut off by a line parallel to the base, the centroid
of
th"e segment, and the volumes of solids generated by such segments when
revolv;d around their bases (YZ in Fig. 17'10) or a vertical line (the axis of
symmetry). In this work, as well as in earlier work on areas under the curves
of ttr. fu-ily y : tctu, he summed small rectangles in the manner described
in connectioriwith (l) above, though his work and results were stated geo-
metrically. Under the pseudonym of Dettonville, he proposed the problems
he had solved as a challenge io other mathematicians, then published his
own superior solutions (Letttes de Dettonaille, 1659) '
Beiore Newton and Leibniz, the man who did most to introduce
analytical methods in the calculus was John Wallis (1616-1703)' Though
h" iid ,rot begin to learn mathematics until he was about twenty-his
university eduJtion at Cambridge was devoted to theology-he became
professor'ofgeometryatOxfordandtheablestBritishmathematicianofthe
,"it to Newton. ln his Arithmetica l4fnitorum ( 1655)' he aPpli{
""rr,,r.y, and the method of indivisibles to effect many quadratures and
^rutyri,
obtain broad and useful results.
the
One of Wallis's notable results, obtained in his efforts to calculate
the
area of the circle analytically, was a new expression for z' He calculated
area bounded by the axes, tire ordinate at a\d the curve for
the functions
'c,
y : (l - x')o,! : (l - *\',U : (1 - *)',! : (l -
")""'
and obtained the areas
l. , | 3^ 3 I
,,* -;*',* -ix" + f,x',r -'s*'* i*u - i*""'
respectively. When * : l, these areas are
(4)
-2 B
I _-
48
" 3' --15' 105-
Now the circle is given bY Y : (l - tr2)1t2. Using induction
and interpola-
tion, Wallis calculated its area, and by further complicated reasoning
arrived at
t 2'2'4'4'6'6'B'8"'
,: T.T:r:i:i:1 .1:g='
354 THE CREATION OF THE CALCULUS
o
)o b
)t
)2
)t )---.-
Figure I7.l I Jal X3
Gregory of St. Vincent in his Opas Geometricum (1647), gave the basis
for the important connection between the rectangular hyperbola and the
logarithm function. He showed, using the method of exhaustion, that if for
the curve ofy : llx (Fig. l7.ll) the x, are chosen so that the areas a,b,c,
d, . . . are equal, then the y, are in geometric progression. This means that
the sum of the areas from xs to x,, which sums form an arithmetical pro-
gression, is proportional to the logarithm ofthey, values or, in our notation,
Ii"*
: r'*,'
This agrees with our familiar calculus result, because g : llx. The observa-
tion that the areas can be interpreted as logarithms is actually due to Gregory,s
pupil, the Belgian Jesuit Alfons A. de Sarasa ( 16l 8-67), in his So/zreo
Problematis a Mersenno Propositi ( 1649)
. About 1665 Newton also noted the
connection between the area under the hyperbola and logarithms and in-
cluded this relation in his Method of Fluxions. He expanded l/(I + r) by the
binomial theorem and integrated term by term to obtain
.x2xa
roge(r+4:x-r+A
Nicholas Mercator, using Gregory's results, gave the same series independ-
ently (though he did not state it explicitly) in his Logarithmotee hm a of 1668.
Other men soon found series which, as we would put it, converged more
rapidly. The work on the quadrature ofthe hyperbola and its relation to the
logarithm function was done by many men, and much of it was com-
municated in letters, so that it is hard to trace the order of discovery and to
assign credit.
Up to about 1650 no one believed that the length ofa curve could equal
exactly the length of a line. In fact, in the second book of La Glomitie,
Descartes says the relation between curved lines and straight lines is not nor
ever can be known. But Roberval found the length ofan arch ofthe cycloid.
The architect Christopher Wren (1632-1723) rectified the cycloid byshowing
355
Figure 17.12
(Fig.l7.l2)thatarcPA:2PT.EWilliamNeile(1637-70)alsoobtained
of Wallis, rectified the
iroiO; ,t" iength of an arch and, using a suggestion
,"rni"..bi"ut pirabola (!" : ax') '' Fermat' too' calculated some lengths of
curves. These *"r, t rrrully used an inscribed polygon to approximate
the
curve, found. the sum of the segments, then let the number of segments
professor
become infinite as each got smaller. James Gregory (1638-75), a
at St. Andrews and Edinburgh (whose work was known slightly to his
contemporaries but not known generally until a memorial volume, edited by
H. W. burnbull, appeared in 1939), gave in his Geometriae Pars Uniacrsalis
(Universal Part of Geometry, 1668) a method of rectifying curves'
Further results on rectification were obtained by Christian Huygens
(1629-95). In particular, he gave the length ofarc of the cissoid' He also
contributed to ihe work o., u..u, and volumes and was the first to give results
ontheareasofsurfacesbeyondthatofthesphere.Thusheobtainedthe
areas of portions of the surfaces ofthe paraboloid and hyperboloid'
Huygens
obtained all these results by purely geometric methods, though he did use
arithmetic, as Archimedes did occasionally, to obtain quantitative answers'
Therectificationoftheellipsedefiedthemathematicians.Infact,
James Gregory asserted that the rectification ofthe ellipse and the hyperbola
-could -be
not achieved in terms of known functions. For a while mathe-
maticians were discouraged from further work on this problem and no new
results were obtained until the next century'
We have been discussing the chief contributions of the predecessors of
on
Newton and Leibniz to the four major problems that motivated the work
the calculus. The four problems were regarded as distinct; yet relationships
among them had been- noted and even utilized' For example' Fermat
had
,rr.d tlh" very same method for finding tangents as for finding the maximum
value ofa function. Also, the problem ofthe rate ofchange ofa function
with
respect to the independent variable and the tangent problem were readily
,"", ,o be the same. In flact, Fermat's and Barrow's method of finding
tangents is merely the geometrical counterpart of finding the rate of
change'
B,rt" th. -ajo.featr.e of the calculus, next to the very concepts of the
6. The method was published by wallis it Tractatus Duo (1659 : Olna, 1' 550-69)'
Wren gave onlY the result.
7. Neil-e's work was published by Wallis in the reference in footnote 6'
356 THE oREATToN oF THE cALcuLUs
derivative and of the integral as a limit of a sum, is the fact that the integral
can be found by reversing the differentiation process or, as we say, by finding
the antiderivative. Much evidence ofthis relationship had been encountered,
but its significance was not appreciated. Torricelli saw in special cases that
the rate problem was essentially the inverse ofthe area problem. It was, in
fact, involved in Galileo's use of the fact that the area under a velocity-time
graph gives distance. Since the rate of change of distance must be velocity,
the rate of change of area, regarded as a "sum," must be the derivative of
the area function. But Torricelli did not see the general point. Fermat, too,
knew the relationship between area and derivative in special cases but did
not appreciate its generality or importance. James Gregory , in his Geometriae
of 1668, proved that the tangent and area problems are inverse problems
but his book went unnoticed.In Geometrical Lectures, Barrow had the relation-
ship between finding the tangent to a curve and the area problem, but it
was in geometrical form, and he himself did not recognize its significance.
Actually an immense amount of knowledge of the calculus had accumu-
lated before Newton and Leibniz made their impact. A survey of even the
one book by Barrow shows a method of finding tangents, theorems on the
differentiation of the product and quotient of two functions, the differentia-
tion ofpowers ofx, the rectification ofcurves, change ofvariable in a definite
integral, and even the differentiation of implicit functions. Though in
Barrow's case the geometric formulation made the discernment of the general
ideas difficult, in Wallis's Arithmetica Infinitorum comparable results were in
algebraic form.
One wonders then what remained to be achieved in the way of major
new results. The answer is greater generality of method and the recognition
ofthe generality ofwhat had already been established in particular problems.
The work on the calculus during the first two thirds of the century lost itself
in details. Also, in their efforts to attain rigor through geometry, many men
failed to utilize or explore the implications ofthe new algebra and coordinate
geometry, and exhausted themselves in abortive subtle reasonings. What
ultimately fostered the necessary insight and the attainment of generality was
the arithmetical work of Fermat, Gregory of St. Vincent, and Wallis, the
men whom Hobbes criticized for substituting symbols for geometry. James
Gregory stated in the preface to Geometriae that the true division of mathe-
matics was not into geometry and arithmetic but into the universal and the
particular. The universal was supplied by the two all-embracing minds,
Newton and Leibniz.
tually one man sharp enough to distinguish the valuable ideas of his pre-
d.c.rro"s from the welter of suggestions and pronouncements, imaginative
enough to fit the bits into a new account, and audacious enough to build a
master plan takes the culminating and definitive step' In the case of the
calculus, this was Isaac Newton.
Newton (1642-1727) was born in the hamlet of Woolsthorpe, England,
where his mother managed the farm left by her husband, who died two
months before Isaac was born. He was educated at local schools of low
educational standards and as a youth showed no special flair, except for an
interest in mechanical devices. Having passed entrance examinations with a
deficiency in Euclidean geometry, he entered Trinity college of cambridge
Universiiy in 1661 and studied quietly and unobstrusively' At one time he
almost changed his course from natural philosophy (science) to law' Ap-
parently .."Jirrirg very little stimulation from his teachers, except possibly
ilu..o*, he experimented by himself and studied Descartes's Giomitrie, as
well as the works of Copernicus, Kepler, Galileo, Wallis, and Barrow'
the university was
Just after Newton finished his undergraduate work
.lorJ do*r, because the plague was widespread in the London area' He left
Cambridgeandspenttheyears1665and1666inthequietofthefamily
home at woolsthorpe. There he initiated his great work in mechanics,
mathematics, and optics. At this time he realized that the inverse square law
of gravitation, a concePt advanced by others, including Kepler, as far back
lOt2, was the key to an embracing science of mechanics; he obtained a
",
general method for treating the problems of the calculus; and through ex-
that white light, such as
ieriments with light he made the epochal discovery
sunlight, is really composed of all colors from violet to red' "All this,"
N.*io, said latei in life, "was in the two plague years of 1665 and 1666, for
in those days I was in the prime of my age for invention, and minded mathe-
matics and philosophy [science] more than at any other time since"'
Ne*torrsaidnothingaboutthesediscoveries'HereturnedtoCambridge
in 1667 to secure a master's degree and was elected a fellow of Trinity college.
In 1669 Isaac Barrow resigned his professorship and Newton was appointed
in Barrow's place as Lucasian professor of mathematics' Apparently he was
not a successful teacher, for few students attended his lectures; nor was the
originality of the material he presented noticed by his colleagues' Only
Bri".o* and, somewhat later, the astronomer Edmond Halley (1656-1742)
recognized his greatness and encouraged him'
AtfirstNewtondidnotpublishhisdiscoveries.Heissaidtohavehad
an abnormal fear of criticism ; De Morgan says that " a morbid fear of
opposition from others ruled his whole life." when in 1672 he did publish
his *o.k on light, accompanied by his philosophy ofscience, he was severely
-most
criticized by of his contemporaries, including Robert Hooke and
Huygens, who had different ideas on the nature of light' Newton was so
958 THE oREATIoN oF THE cALcuLUs
taken aback that he decided not to publish in the future. However, in 1675
he did publish another paper on light, which contained his idea that light
was a stream of particles-the corpuscular theory of light. Again he was met
by a storm of criticism and even claims by others that they had already
discovered these ideas. This time Newton resolved that his results would be
published after his death. Nonetheless he did publish subsequent papers and
several famous books, the Pincipia, the Opticks (English edition 1704, Latin
edition 1706), and the Aithmctica Unbersalis (17O7).
From 1665 on he applied the law of gravitation to planetary motion;
in this area the works of Hooke and Huygens influenced him considerably.
In 1684 his friend Halley urged him to publish his results, but aside from his
reluctance to publish Newton lacked a proof that the gravitational attraction
exerted by a solid sphere acts as though the sphere's mass were concentrated
at the center. He says, in a letter to Halley of June 20, 1686, that until
1685 he suspected that it was false. In that year he showed that a sphere
whose density varies only with distance to the center does in fact attract an
external particle as though the sphere's mass were concentrated at its
center, and agreed to write up his work.
Halley then assisted Newton editorially and paid for the publication'
In 1687 the first editionof the Philosophiae Naturalis Principia Mathematica
(Tlu Mathematical Principlcs of Natural Philwophy) appeared. There were two
subsequent editions, in 1713 and 1726, the second edition containing im-
provements. Though the book brought Newton great fame, it was very
difficult to understand. He told a friend that he had purposely made it
difficult "to avoid being bated by little smatterers in mathematics." He no
doubt hoped in this way to avoid the criticism that his earlier papers on
light had received.
Newton was also a major chemist. Though there are no great discoveries
associated with his work in this area, one must take into account that
chemistry was then in its infancy. He had the correct idea of trying to explain
chemical phenomena in terms of ultimate particles, and he had a profound
knowledge of experimental chemistry. In this subject he wrote one major
paper, "De natura acidorum " (written in 1692 and published in l7l0). In
the Philosophical Transactions ofthe Royal Society of 1701, he published a
paper on heat that contains his famous law on cooling. Though he read the
works of alchemists, he did not accept their cloudy and mystical views. The
chemical and physical properties of bodies could, he believed, be accounted
for in terms of the size, shape, and motion of the ultimate particles; he
rejected the alchemists' occult forces, such as sympathy, antipathy, congruity,
and attraction.
In addition to his work on celestial mcchanics, light, and chemistry,
Newton worked in hydrostatics and hydrodynamics. Beyond his superb
experimental work on light, he experimented on the damping of pendulum
THE WORK OF NEWTON 359
motion by various media, the fall of spheres in air and water, and the flow of
water from jets. Like most men of the time Newton constructed his own
equipment. fue built two reflecting telescopes, even making the alloy for the
frames, molding the frames, making the mountings, and polishing the
lenses'
Aite. ,.rviig u, a professor for thirty-five years Newton became depressed
and suffered a nervous breakdown. He decided to give up research and in
16g5acceptedanappointmentaswardenoftheBritishMintinLondon'
During his twenty-seven years at the mint, except for work on an occasional
probleim, he did no research' He became president of the Royal Society
in
i703, unoffice he held until his death; he was knighted in 1705'
it is evident that Newton was far more bngrossed in science than in
mathematics and was an active participant in the problems of his time.
He
considered the chief value of his scientific work to be its support of
revealed
religion and was, in fact, a learned theologian, though he never took orders'
He"thought scientific research hard and dreary but stuck to it because it
g.rre .rrii.r"e of God's handiwork. Like his predecessor Barrow, Newton
t"r.rr.d to religious studies later in life. In The Chronology oJ Arcbnt Kingdoms
Amended,he tied to date accurately events described in the Bible and
other
religious documents by relating them to astronomical events' His major
relilious work was the obsentations tfion thz proplucies of Danful and tlu
Apialypse of st. John. Biblical exegesis was a phase of the rational approach
to
."figi# thit was popular in the Age of Reason; Leibniz, too, took a hand in it'
So far as the caiculus is concerned, Newton generalized the ideas
already
advanced by many men, established full-fledged methods, and showed the
interrelationships of several of the major problems described above' Though
he learned -u& ., a student of Barrow, in algebra and the calculus he
was
more influenced by the works of Wallis' He said that he was led to his
discoveries in analysis by the Arithmctica hfinitarum; certainly in his own work
on the calculus he made progress by thinking analytically' However' even
Newton thought the geometry was necessary fior a rigorous proof'
In 1669 Newton circulated among his friends a monograPh entitled
De Analysi per Aequationes Numero Terminorum ltfrnitas (On Analysis by Means
of oquJtions *iti, Infinite Number of Terms) ; it was not published until
",
l7l I . He suPposes that he has a curve and that the area z (Fig' 17' 13) under
this curve is given bY
(5) z: axn,
! : max^-t'
Thus, in our language, the rate of change of area at any x is the y-value of
the curve at that value ofx. Conversely, if the curve is g : max^-1, the
area under it is z : axn.
In this process Newton not only gave a general method for finding the
instantaneous rate of change ofone variable with respect to another (z with
respect to x in the above example), but showed that area can be obtained by
reversing the process of finding a rate of change. Since areas had also been
expressed and obtained by the summation of infinitesimal areas, Newton
also showed that such sums can be obtained by reversing the process offinding
a rate of change. This fact, that summations (more properly, limits of sums)
can be obtained by reversing differentiation, is what we now call the funda-
mental theorem of the calculus. Though it was known in special cases and
dimly foreseen by Newton's predecessors, he saw it as general. He applied
the method to obtain the area under many curves and to solve other problems
that can be formulated as summations.
After showing that the derivative ofthe area is they-value and asserting
that the converse is true, Newton gave the rule that, if the y-value be a sum
of terms, then the area is the sum of the areas that result from each of the
terms. In modern terms, the indefinite integral of a sum of functions is the
sum of the integrals of the separate functions.
His next contribution in the monograph carried further his use of infinite
series. To integrate y : azl(b + r), he divided a2 by b * x and obtained
a2
,!:---J-----L..-
a2x a2x2 a2x3
" b b2' b3 b4
He says of this infinite series that a few of the initial terms are exact enough
for any use, provided that D be equal to .r repeated some few times'
Likewise, to integrate y : ll0 * x2) he uses the binomial expansion
to write
y:l-x2+x4-xo+xa-...
and integrates term by term' He notes that if, instead, y is taken to be
ll(x2 + 1), then by binomial expansion one would obtain
! : x-2 - x-a +*-6 _ *-8 +...
and now one can integrate term by term. He then remarks that when 'r is
small enough the first expansion is to be used; but when x is large, the
second is to be used. Thus he was somewhat aware that what we call con-
vergence is important, but had no precise notion about it'
Newton iealized that he had extended term by term integration to
infinite series but says in the De Analysi.z
And whatever the common Analysis performs by Means of Equations of
a finite Number of Terms (provided that can be done) this can always
perform the same by Means of infinite Equations so that I have not made
any question of giving this the name of Analysis likewise' For the reason-
ings in this are no less certain than in the other; nor the equations less
exact; albeit we Mortals whose reasoning Powers are confined within
narrow limits, can neither exPress, nor so conceive all the Terms of these
Equations, as to know exactly from thence the quantities we lvant'
Thus far in his approach to the calculus Newton used what may be
described as the method of infinitesimals. Moments are infinitely small
quantities, indivisibles or infinitesimals. of course the logic of what Newton
did i. ro, clear. He says in this work that his method is "shortly explained
rather than accurately demonstrated."
Newton gave a second, more extensive, and more definitive exposition
of his ideas in the book Methodus Fluxionum et scrierum hfinitarum, written in
1671 but not published until 1736. In this work he says he regards his
variables as generated by the continuous motion ofpoints, lines, and planes,
rather than as static aggregates of infinitesimal elements, as in the earlier
paper. A variable quantity he now called a fluent and its rate of change, the
hr*io.r. His notation is * and/ for fluxions of the fluents t andy' The fluxion
of .i is ,f , etc. The fluent of which x is the fluxion is l, and the fluent of the
latter is *.
In this second work Newton states somewhat more clearly the funda-
mental problem of the calculus: Given a relation between two fluents, find
the relaiion between their fluxions, and conversely. The two variables whose
relation is given can represent any quantities. However, Newton thinks of
them as chlnging with iime because it is a useful way of thinking, though'
262 THE CREATToN oF THE cALcuLUs
!*!io:(x**o)",
and then proceeds as in the earlier paper. He expands the right side by
using the binomial theorem, subtracts y : x', divides through by o, neglects
all terms still containing o, and obtains
g : ssn- t1.
In modern notation this result can be written
Idt: n*"-'4,dt
and since dyldx : (dyldt) l@xldt), Newton, in finding the ratio of dyldt to
dxldt or ! to i, has found dgldx.
The method of fluxions is not essentially different from the one used in
the De Analgsi, nor is the rigor any better; Newton drops terms such as i*a
and **o*o (he writes *3oo) on the ground that they are infinitely small com-
pared to the one retained. However, his point of view in the Method of
Fluxions is somewhat different. The moments io and go change with time o,
whereas in the fust paper the moments are ultimate fixed bits of x and z.
This newer view follows the more dynamic thinking of Galileo; the older
used the static indivisible of Cavalieri. The change served, as Newton put it,
only to remove the harshness from the doctrine of indivisibles; however, the
moments *o and /o are still some sort of infinitely small quantities. Moreover,
i and !, which are the fluxions or derivatives with respect to time of * and
y, are never really defined; this central problem is evaded.
Given a relation between * andy, finding the relation between x and, y
is more difficult than merely integrating a function of x. Newton treats
several types: (l) when *, 11, and x or ! ate present; (2) when i, 1!, x, and, y
are present; (3) when *, y, 2, and. the fluents are present. The first type is
the easiest and, in modern notation, calls for solving dgldx : /(r). Of the
second type, Newton treats yl* : I - 3* i g I x2 | xg andsolves it by a
successive approximation process. He starts with 1il*: I
- 3r * x2 as a
first approximation, obtains y as a function ofr, introduces this value ofy on
the right side of the original equation, and continues the process. Newton
describes what he does but does not justify it. Of the third type, he treats
2i - a +1ix:0. He assumes a relation between x and y, say x:y2, so
that i : 21jy. Then the equation becomes 4!C - Z - !i!' : 0, from which
he gets 2y2 + (y" 13) : z. Thus, if the third type is regarded as a partial
differential equation, Newton obtains only a particular integral.
THE WORK OF NEWTON 363
Newton'snewconcePt,themethodofprimeandultimateratio,amounts
to this. He considers the function ! : x". To find the fluxion of y or 'rn, let
r "by flowing" become r + o. Then .r' becomes
o
(x + o)*: xn * noxn-L *o': o'*o-' *""
Figure 17.14
,_ (l + i2\3t2
!
where * is taken as l. He also gave this same quantity in polar coordinates.
Finally, he included a brief table of integrals.
Newton did not publish his basic papers in the calculus until long after
he had written them. The earliest printed account of his theory of fluxions
appeared in Wallis's Algebra (2nd ed. in Latin, 1693), of which Newton
wrote pages 390 to 396. Had he published at once he might have avoided the
controversy with Leibniz on the priority of discovery.
Newton's first publication involving his calculus is the great Mathe-
matical Principles of Natural Philosophy.s So far as the basic notion of the
8. The third edition was translated into English by Andrew Motte in 1729. This edition,
revised and edited by Florian Cajori, was published by the University of California Press
in 1946.
THE woRK oF NEwroN 365
can be diminished without end. In the first and third editions of the Principia
Newton says, ,,IJltimate ratios in which quantities vanish are not, strictly
speaking, ratios of ultimate quantities, but limits to which the ratios of these
quantities, decreasing without limit, approach, and which, though they can
*^" ,.u"". than any given difference whatever, they can neither pass over
e
nor attain before the quantities have diminished indefinitely." This is the
clearest statement he ever gave as to the meaning of his ultimate ratio.
Apropos ofthe preceding quotation, he also says, "By the ultimate velocity
is meant that with which the body is moved, neither before it arrives at its
last place, when the motion ceases, nor after I but at the very instaut when it
a..i',res. . . . And, in like manner, by the ultimate ratio of evanescent quan-
tities is to be understood the ratio of quantities, not before they vanish, nor
after, but that with which they vanish."
ln the Principia Newton used geometrical methods of proof' However, in
what are called the Portsmouth Papers, containing unpublished work, he
used analytical methods to find some of the theorems' These papers show
that he also obtained analytically results beyond those he was able to trans-
late into geometry. One reason he resorted to Seometry is believed to be that
the proofi would be more understandable to his contemporaries. Another is
that he admired Huygens's geometrical work immensely and hoped to equal
it. In these geometrical proofs Newton uses the basic limit processes of the
calculus. Thus the area under a curve is considered essentially as the limit
of the sum of the approximating rectangles, just as in the calculus today'
llowever, instead of calculating srrch areas, he uses this concept to compare
areas under diflerent curves.
He proves that, when AR and BR (Fig' 17'15) are the perpendiculars
to the tingent s at A arrd .B of the arc ACB, the ultimate ratio, when 'B
approaches- and coincides with z{, of any two of the quantities chord lB'
,eCA, arrd AD, is l. Ilence he says in Corollary 3 to Lemma 2 of Book I'
"ii
"And therefore in all our reasoning about ultimate ratios, we may freely use
any one ofthese lines for any other." He then proves that when B approaches
ani coincides with A, the ratio of any two triangles (ateas) RAB, RACB'
andRADwillbel...Andhenceinallreasoningsaboutultimateratios,we
may use any one of these triangles for any other'" Also, (Fig' 17'16) let BD
uni CEb" perpendicular to lE (which is not necessarily tangent to arc ABC
at,4). When B and C apptoach and coincide with l, the ultimate ratio of the
areas ACE and ABD will equal the ultimate ratio of AE2 to AD2'
The Principi.a contains a wealth of results, some of which we shall note'
Law II. The change [in the quantity] of motion is proportional to the motive
power impressed; and is made in the direction of the right line in which that
force is impressed.
the law was first fully recognized by Euler. This law incorporates the key
change from the mechanics of Aristotle, which affirmed that force causes
velocity. Aristotle had also affirmed that a force is needed to maintain
velocity. Law I denies this'
Law III. To every action there is always opposed an equal reaction' ' ' '
We shall not digress into the history of mechanics excePt to note that
the first two laws are more explicit and somewhat generalized statements of
the principles of motion previously discovered and advanced by Galileo and
Descartes. The distinction between mass, that is, the resistance a body offers
to a change in its motion, and weight, the force gravity exerts on the mass of
any object, is also due to these men; and the vector character of force
generalires Galileo,s principle that the vertical and horizontal motions of a
projectile, for example, can be treated independently.
Book I of the Principia begins with some theorems of the calculus,
including the ones involving ultimate ratios cited above. It then discusses
motion under central forces, that is, forces that always attract the moving
object to one (fixed) point (the sun in practice), and proves in Proposition I
that equal areas are swept out in equal time (which encompasses Kepler's
law of areas). Newton considers next the motion ofa body along a conic
section and proves (Props. I l, 12, and 13) that the force must vary with the
inverse square of the distance from some fixed point' He also proves the
converse, which contains Kepler's first law. After some treatment of cen-
tripetal force, he deduces Kepler's third law (Prop. l5)' There follow two
,""1ior* devoted to properties of the conic sections. The principal problem is
the construction of conics that satisfy five given conditions; in practice these
are usually observational data. Then, given the time an object has been in
motion along a conic section, he determines its velocity and position' He
takes up the motion of the apse lines, that is, the lines joining the center of
attraction (at one focus) to the maximum or minimum distance of a body
moving along a conic that is itself rotating at some rate about the focus'
section 10 is devoted to the motion of bodies along surfaces with special
reference to pendulum motion. Here Newton gives due acknowledgment to
Huygens. In connection with the accelerating effect ofgravity on motions, he
invertigates geometrical proPerties of cycloids, epicycloids, and hypocycloids
and gives the length of the epicycloid (Prop. 49).
in Section I I Newton deduces from the laws of motion and the law of
gravitation the motion of two bodies, each attracting the other in accordance
*ith th. gravitational force. Their motion is reduced to the motion of one
around the fixed second body. The moving body traverses an ellipse'
He then considers the attraction exerted by spheres and spheroids of
uniform and varying density on a particle. He gives (Sec' 12, Prop' 70) a
geometrical proof that a thin homogeneous spherical shell exerts no force on
368 THE CREATToN oF THE cALcuLUs
a particle in its interior. Since this result holds for a thin shell, it holds for a
sum of such shells, that is, for a shell of finite thickness. (He proves later
[Prop. 91, Cor. 3] that the same result holds for a homogeneous ellipsoidal
shell, that is, a shell contained between two similar ellipsoidal surfaces,
similarly placed.) Proposition 71 shows that the attraction of a thin homo-
geneous spherical shell on an external particle is equivalent to the attraction
that would be exerted if the mass ofthe shell were concentrated at the center,
so that the shell attracts the external particle toward the center and with a
force varying inversely as the square ofthe distance from the center. Proposi-
tion 73 shows that a solid homogeneous sphere attracts a particle inside with
a force proportional to the particle's distance from the center. As for the
attraction that a solid homogeneous sphere exerts on an external point,
Proposition 74 shows that it is the same as if the mass of the sphere were
concentrated at its center. Then if two spheres attract each other, the first
attracts every particle of the second as if the mass of the first were concen-
trated at its center. Thus the first sphere becomes a particle attracted by the
distributed mass of the secondl hence the second sphere can also be treated
as a particle with its mass concentrated at its center. Thus both spheres can
be treated as particles with their masses concentrated at their respective
centers. All these results, original with Newton, are extended to spheres
whose densities are spherically symmetric and to other laws of attraction in
addition to the inverse square law.
Newton next takes up the motion of three bodies, each attracting the
other two, and obtains some approximate results. The problem of the motion
of three bodies has been a major one since Newton's time and has not been
solved as yet,
The second book of the Principia is devoted to the motion of bodies in
resisting media such as air and liquids. It is the beginning of the subject of
hydrodynamics. Newton assumes in some problems that the resistance of the
medium is proportional to the velocity and in others to the square of the
velocity of the moving body. He considers what shape a body must have to
encounter least resistance (see Chap. 24, sec. I ) . He also considers the motion
of pendulums and projectiles in air and in fluids. A section is devoted to the
theory of waves in air (e.g., sound waves) and he obtains a formula for the
velocity of sound in air. He also treats the motion of waves in water. Newton
continues with a description of experiments he made to determine the
resistance fluids offer to bodies moving in them. One major conclusion is
that the planets move in a vacuum. In this book Newton broke entirely new
ground; however, the definitive work on fluid motion was yet to be done
Book III, entitled On the System oJ thz World, contains the application of
the general theory developed in Book I to the solar system. It shows how
the sun's mass can be calculated in terms of the earth's mass, and that the
mass of any planet having a satellite can be found in the same way. He
THE woRK oF NEwroN 369
calculates the average density ofthe earth and finds it to be between 5 and 6
times that of water (today's figure is about 5'5)'
Heshowsthattheearthisnotatruespherebutanoblatespheroidand
spheroid
calculates the flattening; his result is that the etlipticity ofthe oblate
is l/230 (the figure today is l/297). From the observed oblateness of any
planet, the lengih ofits day is then calculated. Using the amount offlattening
and the notioi of centripetal force, Newton comPutes the variation of the
earth's gravitational attraction over the surface and thus the variation
in the
*eight if an object. He proves that the attractive force of a spheroid is not
the same as if the spheroid's mass were concentrated at its center'
He then u""olr.rt, for the precession of the equinoxes' The explanation is
basedonthefactthattheearthisnotsphericalbutbulgesoutalongthe
equator. Consequently the gravitational attraction of the moon on the earth
does ,ot effectively act on the center ofthe earth but forces a
periodic change
in the direction ofthe earth's axis of rotation' The period of this change was
calculated by Newton and found to be 26,000 years, the value obtained by
Hipparchus ty inference from observations available to him'
Newton explained the main features of the tides (Book I' Prop' 66'
Book III, Props. 36, 37). The moon is the main cause; the sun, the second'
Using the sun-'s mass he calculated the height of the solar tides' From the
obseied heights of the spring and neap tides (sun and moon in full con-
junction
-estimate or
lull opposition) he determined the lunar tide and made an
of the mass of the moon. Newton also managed to give some
approximate treatment of the effect of the sun on the moon's motion around
the.a.th. He determined the motion of the moon in latitude and longitude;
the motion of the apse line (the line from the center of the earth to the
maximum distance of the moon) ; the motion of the nodes (the points in
which the moon's path cuts the plane of the earth's orbit; these points
regress, that is, moie slowly in a direction opposite to the motion
of the
mlon iiself) I the evection (a periodic change in the eccentricity ofthe moon's
orbit) ; the annual equation (the effect on the moon's motion of the daily
change in distance beiween the earth and the sun); and the periodic
change
in thI inclination of the plane of the moon's orbit to the plane of the earth's
orbit. There were seven known irregularities in the motion of the moon and
Newton discovered two more, the inequalities of the apogee (apse line) and
of the nodes. His approximation gave only half of the motion of the apse line'
Clairaut in 1752 improved the calculation and obtained the full 3' of
rotation of the apse line; however, much later John Couch Adams found the
correct calculation in Newton's papers. Finally Newton showed that
the
comets must be moving under the gravitational attraction of the sun because
their paths, determinid on the basis of observations' are conic sections'
N.*ton devoted a great deal of time to the problem of the moon's motion
was needed to
because, u, *" ,rot"l in the preceding chapter, the knowledge
370 THE CREATION OF THE CALCULUS
Figure 17.17
I, )
B
He also observed, of course, that if the original sequence starts from 0, the
sum of the first differences is the last term of the sequence.
To relate these facts to the calculus he had to think of the sequence of
numbers as the y-values of a function and the difference of any two as the
difference of two nearby y-values. Initially he thought of r as representing
the order of the term in the sequence and, g as representing the value ofthat
term.
The quantity dr, which he often writes as a, is then I because it is the
difference ofthe orders of two successive terms, and dy is the actual difference
in the values of two successive terms. Then using omn. as an abbreviation
for the Latin omnia, to mean sum, and using I for dy, Leibniz concludes that
omn. / : y, because omn. I is the sum of the first differences of a sequence
whose terms begin with 0 and so gives the last term. However, ornn. yl
presents a new problem. Leibniz obtains the result that olrrn. yl is y2l2 by
thinking in terms of the function y : ,,. Thus, as Figure 17.17 shows, the
area of triangle ABC is the sum of the yl (for "small" /) and it is also y212.
Leibniz says, " Straight lines which increase from nothing each multiplied
by its corresponding element of increase form a triangle." These few facts
already appear, among more complicated ones, in papers of 1673.
In the next stage he struggled with several difficulties. He had to make
the transition from a discrete series of values to the case where dy arrd, dx are
increments ofan arbitrary functiony ofr. Since he was still tied to sequences,
wherein x is the order of the term, his a or /x was I ; so he inserted and omitted
a freely. When he made the transition to the dy and dx of any function, this
a was no longer 1. Iilowever, while still struggling with the notion of summa-
tion he ignored this fact.
Thus in a manuscript of October 29, 1675, Leibniz starts with
_t
(7) omn. z/ : omn.omn. l1r
'a
TrrE WORK OF LErBNrz 373
(B) ornn.4:$'
Hence from (7) and (B)
q: !{t,+*: t,H
Leibniz says that this result is admirable.
Another theorem of the same kind, which Leibniz derived from a
geometrical argument, is
(10) omn. xl : , omn. / - omn'omn. /,
[.on:w-lua..
Now Leibniz lets I itself in (10) be r, and obtains
omn. *2 : ,r omn. .r - omn.Omn. .r.
But omn. x, he says, is x2l2 (he has shown that omn. yl is y2/2). Hence
Figure 17.18
numerous that the difference between this sum and the true area under the
curve could be neglected, and at other times as a sum of the ordinates or
y-values. This latter concePt of area was common' especially among the
indivisibilists, who thought that the ultimate unit of area and the y-value
were the same.
With respect to differentiation, even after recognizing that dg and dx
can be arbitrarily small quantities, Leibniz had yet to overcome the funda-
mental difficulty that the rutio dgldx is not quite the derivative in our sense.
He based his argument on the characteristic triangle, which Pascal and
Barrow had also used. This triangle (Fig. 17.18) consists of dy, dt, and the
chord PQ, which Leibniz also thought of as tlu curae bctween P and Qand part
ol the tangent al T. Though he speaks of this triangle as indefinitely small, he
maintains nevertheless that it is similar to a definite triangle' namely, the
triangle SIU formed by the subtangent SU, the ordinate at f, and the
length of tangent S7- Hence dg and dx are ultimate elements, and their ratio
has a definite meaning. In fact, he uses the argument that, from the similar
triangles PftQ and SUT, dgldx : TUISU.
In the manuscript of November 11, 1675, Leibniz shows how he can
solve a definite problem. He seeks the curve whose subnormal is inversely
proportional to the ordinate. In Figure l7.l8, the normal is TV and the
subnormal p is uV. From the similarity of triangles PRQ and TUV, he has
dvP
dxs
pd*:ydy.
But the curve has the given property
,c
b: -,b
376 TI{E CREATION OF THE CALCULUS
* :# 0,.
Then
I*:lt*
ug
x:TE'
Leibniz also solved other inverse tangent problems.
In a paper ofJune 26, 1676, he realizes that the best method of finding
tangents is to find dgldx, wherc dy and. dx are differences and. dgldx is the
quotient. IIe ignores dt.dx and higher powers of /.r.
By November of 1676, he is able to give the general rules dx" :
nx"-r dx fot integral and fractional n and, J x" : x"+r ln * l, and says, "The
reasoning is general, and it does not depend upon what the progressions
of the x's may be." Here x still means the order of the terms of a sequence.
In this manuscript he also says that to differe ntiate \/iJn7jE, let
a + bz + czL : x, differentiate l-tt, and. multiply by dxldz. This is the
chain rule.
ByJuly ll, 1677 , Leibniz could give the correct rules for the differential
ofsum, difference, product, and quotient of two functions and for powers and
roots, but no proofs. In the manuscript of November Ll, 1675, he had
struggled with d (ua) and d(ula), and thought that d (uo) : du da.
In 1680, dr has become the difference of abscissas and, dg the differences
in the ordinates. IIe says, ". . . now these d,r and, dy are taken to be infinitely
small, or the two points on the curve are understood to be a distance apart
that is less than any given length. . . . " He calls dy the ('momentaneous
increment" in y as the ordinate moves along the r-axis. But PQ in Figure
l7.lB is still considered part ofa straight line. It is "an element ofthe curve
or a side of the infinite-angled polygon that stands for the curve. . . ." He
continues to use the usual differential form. Thus, if y : d2lx, therr
- a2-
: _7dx.
dg
He also says that differences are the opposite to sums. Then, to get the area
under a curve (Fig. 17.19), he takes the sum of the rectangles and says one
can neglect the remaining " triangles, since they are infinitely small com-
THE WORK OF LEIBNIZ 3?7
Figure 17.19
dg:dx : g: subtangent.
This definition ofdy presumes some expression for the subtangent; hence the
definition is not complete. Moreover, Leibniz's definition of tangent as a line
joining
- two infinitely near points is not satisfactory.
H" also gives in this paper the ruies he had obtained in 1677 for the
differential of the sum, product, and quotient of two functions and the rule
for finding d (x") . ln this last case he sketches the proof for positive integral a
but says th" .ri" is true for all n; fot the other rules he gives no proofs' He-
makes applications to finding tangents, maxima and minima, and points of
inflection. This paper, six pages long, is so unclear that the Bernoulli
brothers called it "an enigma rather than an explication'"l3
Y: \E-7 * Jl2x-xz
l-=4-^
as the equation of the cycloid. His point here is to show that by his methods
and notation some curves can be expressed as equations not obtainable in
other ways. He reaffirms this in his Historia where he says that his dx,, ddx
(second difference), and the sums that are the inverses of these differences
can be applied to all functions of x, not excepting the mechanical curves of
Vieta and Descartes, which Descartes had said have no equations. Leibniz
also says that he can include curves that Newton could not handle even with
his method of series.
In the 1686 paper as well as in subsequent papers,ls Leibniz gave the
differentials of the logarithmic and exponential functions and recognized
exponential functions as a class. He also treated curvature, the osculating
circle, and the theory of envelopes (see Chap. 23). In a letter to John
Bernoulli of 1697, he differentiated under the integral sign with respect to a
parameter. He also had the idea that many indefinite integrals could be
evaluated by reducing them to known forms and speaks of preparing tables
for such reductions-in other words, a table of integrals. He tried to define
the higher-order differentials swch as ddy (d2g) and dddy (dsy), but the defini-
tions were not satisfactory. Though he did not succeed, he also tried to find
a meaning for d"y where c is any real number.
With respect to notation, Leibniz worked painstakingly to achieve the
best. His dx, dy, and dyldx are, of course, still standard. He introduced the
notation log x, dn for the zth differential, and even d-L and d-" for t and,
the zth iteration of summation, respectively.
In general Leibniz's work, though richly suggestive and profound, was
so incomplete and fragmentary that it was barely intelligible. Fortunately,
the Bernoulli brothers, James and John, who were immensely impressed and
stirred by Leibniz's ideas, elaborated his sketchy papers and contributed an
immense number of new developments we shall discuss later. Leibniz agreed
that the calculus was as much theirs as his.
Both also arithmetized the calculus; that is, they built on algebraic
concepts. The algebraic notation and techniques used by Newton and Leib-
niz not only gave them a more effective tool than geometry, but also per-
mitted many diff.."t t geometric and physical problems to be treated by the
same technique. A major change from the beginning to the end of the
seventeenth century was the algebraicization of the calculus' This is com-
parable to what vieta had done in the theory ofequations and Descartes and
Fermat in geometry.
The third vital contribution that Newton and Leibniz share is the
reduction to antidifierentiation of area, volume, and other problems that
were previously treated as summations. Thus the four main problems-
rates, tangents, maxima and minima, and summation-were all reduced to
differentiation and antidifferentiation.
The chid distinction between the work of the two men is that Newton
used the infinitely small increments in .r and y as a means of determining
the fluxion or derivative. It was essentially the limit of the ratio of the in-
crements as they became smaller and smaller. On the other hand, Leibniz
dealt directly with the infinitely small increments in r and y, that is, with
differentials, and determined the relationship between them. This difference
reflects Newton's physical orientation, in which a concePt such as velocity is
central, and Leibniz's philosophical concern with ultimate particles of
matter, which he called monads. As a consequence, Newton solved area and
volume problems by thinking entirely in terms of rate of change' For him
differentiation was basic; this process and its inverse solved all calculus
problems,
-center
and in fact the use of summation to obtain an area, volume, or
of gravity rarely apPears in his work. Leibniz, on the other hand'
thought first in terms of summation, though of course these sums were
evaluated by antidifferentiation.
A third distinction between the work of the two men lies in Newton's
free use of series to represent functions; Leibniz preferred the closed form.
In a letter to Leibniz of 1676, Newton stressed the use ol series even to solve
simple differential equations. Though Leibniz did use infinite series, he
repliied that the real goal should be to obtain results in finite terms, using the
trigonometric and logarithmic functions where algebraic functions would
,rot ...r". He recalled to Newton James Gregory's assertion that the rectifi-
cation ol the ellipse and hyperbola could not be reduced to the circular and
logarithmic functions and challenged Newton to determine by the use of
series whether Gregory was correct. Newton replied that by the use of series
he could decide whether some integrations could be achieved in finite terms,
but gave no criteria. Again, in a letter of l7l2 to John Bernoulli, Leibniz
objected to the expansion offunctions into series and stated that the calculus
should be concerned with reducing its results to quadratures (integrations)
and, where necessary, quadratures involving transcendental functions'
38o rrrE 0REATToN oF TrrE cALcuLUs
z : dx tb:ddg : dg ds:ddx
THE SOUNDNESS OF TI{E CALCULUS 383
Figure 17.20
z
dxlds dslds
: 7\i&:l-xtds,'
which are more llamiliar forms. James also gave the result in polar coordi
nates, I
John produced a now-famous theorem for obtaining the limit approached
by a fraction whose numerator and denominator approach 0. This theorem
was incorporated by Guillaume F. A. I'Hospital (1661-1704), a pupil of
pctits
John, in an influential book on the calculus, the Analyse cbs irfinhrunt
(1696), and is now known as L'Hospital's rule.
As to the ultimate meani ngs of dg, dx and dg ldx, Leibniz remained vague'
He spoke ofldr as the difference in x values between two infinitely near points
u.rd tf th. tangent as the line joining such points. He dropped differentials
of higher ordei with no justification, though he did distinguish among the
vario-us orders. The infinitely small d* and dg were sometimes described as
vanishing or incipient quantities, as opposed to quantities already formed'
These indefinitety small quantities were not zero, but were smaller than any
finite quantity. Alternatively he appealed to Beometry to say that a higher
differential is to a lower one as a point is to a linel? ot that dx is to * as a
point to the earth or as the radius of the earth to that of the heavens. The
iatio of two infinitesimals he thought of as a quotient of inassignables or of
indefinitely small quantities, but one which could nevertheless be expressed
in terms of definite quantities such as the ratio of ordinate to subtangent.
A flurry of atta&s and rebuttals was initiated in books of 1694 and 1695
by the Dutch physician and geometer Bernard Nieuwentijdt (1654-17t B) '
Aithough he uimitted that in general the new methods led to correct results,
he criticired the obscurity and pointed out that sometimes the methods led
to absurdities. He complained that he could not understand how the infinitely
small quantities differed from zero and asked how a sum of infinitesimals
could be finite. He also challenged the meaning and existence of differentials
of higher order and the rejection of infinitely small quantities in portions of
the arguments.
L.ib.rir, in a draft of a reply to Nieuwentijdt, probably written in 1695,
and in an article in the Acta Eruditorum of 1695,18 gives various answers' He
speaks of overprecise,' critics and says that excessive scrupulousness should
,.
,lot us to ieject the fruits of invention. He then says his method differs
"ur..
from Archimedes; only in the expressions used, but that his own are better
adapted to the art of discovery. The words "infinite" and "infinitesimal"
signify merely quantities that one can take as large or as small as one wishes
in order to show that the error incurred is less than any number that can be
assigned-in other words, that there is no error, one can use these ultimate
thiJgs-that is, infinite and infinitely small quantities-as a tool, much as
algebraists used imaginary roots with great profit'
Leibniz's argument thus far was that his calculus used only ordinary
mathematical concepts. But since he could not satisfy his critics, he enun-
ciated a philosophical principle known as the law of continuity, which was
practicaliy the same as one already stated by Kepler' In 1687, in a letter to
iierre Bayle,le Leibniz expressed this principle as follows: " In any supposed
transition, ending in any terminus, it is permissible to institute a general
reasoning, in which the final terminus may also be included'" To support
17. Math. Schriften, 5, 322 ff.
18, Acta bud., 1695, 310-16 : Math. Schriftetr, 5, 320-28'
19. Math. SchriJten, 5' 385.
386 TIIE CREATION OF THB CALCULUS
Figurc 17.21
On these suppositions, all the rules ofour algorithm, as set out in the
Acta Eruditorum for October 1684, can be proved without much trouble'
Leibniz then goes over these rules. He introduces the quantities (d)y and
(d)x and carries out the usual processes of differentiation with them. These
he calls assignable or definite nonvanishing quantities. After obtaining the
final result, he says, we can replace (d)y and (d)x by the evanescent or un-
assignable quantities dy and, dx, by making "the supposition that the ratio of
the evanescent quantities dy and dx is equal to the ratio of (d)g and (d)x'
because this supposition can always be reduced to an undoubtable truth"'
Leibniz's principle of continuity is certainly not a mathematical axiom
today, but he emphasized it and it became important later. He gave many
arguments that are in accordance with this principle. For example, in a
letter to Wallis,2o Leibniz defended his use of the characteristic triangle as a
form without magnitude, the form remaining after the magnitudes had been
reduced to zeto, and challengingly asked, "Who does not admit a form
without magnitude ? " Likewise, in a letter to Guido Grandi,zl he said the
infinitely small is not a simple and absolute zero but a relative zero, that is,
an evanescent quantity which yet retains the character of that which is dis-
appearing. However, Leibniz also said, at other times, that he did not
believe in magnitudes truly infinite or truly infinitesimal.
Leibniz, less concerned with the ultimate justification of his procedures
than Newton, felt that it lay in their effectiveness. He stressed the procedural
or algorithmic value of what he had created. Somehow he had confidence
that if he formulated clearly the rules of operation and these were proPerly
applied, reasonable and correct results would be obtained, however doubtful
might be the meanings of the symbols involved.
It is apparent that neither Newton nor Leibniz succeeded in making
clear, let alone precise, the basic concepts ofthe calculus: the derivative and
the integral. Not being able to grasp these properly, they relied upon the
coherence of the results and the fecundity of the methods to push ahead
without rigor.
Several examples may illustrate the lack of clarity even among the great
immediate successors of Newton and Leibniz. John Bernoulli wrote the first
text on the calculus in 169l and 1692. The portion on the integral calculus
was published in 1742;22 the Part on the differential calculus, Die Dffirential'
rechiung, was not published until 1924. However, the Marquis de l'Hospital
did publish a slightly altered French version (already referred to) under his
own name in 1696. Bernoulli begins the Diferentialrechnung with three Postu-
lates. The first reads: "A quantity which is diminished or increased by an
this limit process yields irrationals not obtainable as roots of rationals. But
these insights of Wallis and Gregory were ignored in their century.
The foundations of the calculus remained unclear. Adding to the con-
fusion was the fact that the proPonents of Newton's work continued to speak
of prime and ultimate ratios, while the followers of Leibniz used the in-
finitely small non-zero quantities. Many of the English mathematicians,
perhaps because they were in the main still tied to the rigor of Greek geom-
etry, distrusted all the work on the calculus. Thus the century ended with
the calculus in a muddled state.
Bibliography
Armitage, A.: Edmond Halley, Thonr'as Nelson and Sons, 1966'
Auger, L.: {Jn Saaant miconnu: Gilles Personz de Roberaal (1602-1675), A' Blanchard,
1962.
Ball, W. W. R. : I Short Account of ttu History of Matlumatits, Dover (reprint), 1960,
pp. 309-70.
Baron, Margaret E.z The Origins of the Infinitesimal Calculnr, Pergamon Press, 1969'
Bell, Arthur E.: Neutoni.an Scizrce, Ddward Arnold, 1961.
Boyer, Carl B.: The Coneepts of the Caleulus, Dover (reprint), 1949'
A History oJ Mattumatics, John Wiley and Sons, 1968, Chaps' 18-19'
Brewster, David; Memoirs of the Life, lrritings and Discoaetits of Sir Isau Newtan,
2 vols., 1855, Johnson Reprint Corp., 1965.
Cajori, Florian: A History of the Coneeptions of Lini* and Fluxions in Great Bitain
.1from Newlon to Woodhouse, Open Court, 1919.
A History oJ Mathematics, Macrnillan, 1919, 2nd ed., pp' l8l-220'
Cantor, Moritz: Vorlesungen iiber Geschichte der Mathematik,2nd ed., B' G' Teubner,
1900 and 1898, Vol. 2, pp.82l-922; Vol. 3, pp. 150-316'
Child, J. M.: The Gnmetri.cal Lcctures of Isaac Banow, Open Court, 1916'
The Early Mathematical Manuscipts of Leibniz, Open Court, 1920'
Cohen, I. B.: Isaac Newlon's Palers and l*lters on Natural Phihsophy, Harvard Uni-
-: versity Press, 1958.
Coolidge, Julian L. : The Mathematics of Great Amatcurs, Dover (reprint), 1963,
Chaps. 7, ll, and 12.
De Morgan, Augustus: Essays on the Life and Work of Neuton, Open Court, l9l4'
Fermat, Pierre d,e: (Euarcs, Gauthier-Villars, 189l-1912, Vol. I' pp' 133-79,
Vol. 3, pp. 12l-56.
Gibson, G. A. : "James Gregory's Mathematical Work," Proc. Edinbwgh Math'
Soc., 41, 1922123, 2-25.
Huygens, C.: CEuores comphtes,22 vols., Soci6t6 Hollandaise des Sciences' Nyhoff
1888-1950.
Leibniz, G. W.t (Euurcs, Firmin-Didot, 1859-75.
Malhematische Schriften, ed. C. I. Gerhardt, 7 vols., Ascher-Schmidt,
1849-63. Reprinted by Georg Olms, 1962.
More, Louis T.: Isaac Newtan, Dovet (reprint), 1962.
3go THE CREATION OF THE CALCULUS
Montucla, J. F,: Histoire fus mathimatiques, Albert Blanchard (reprint), 1960, Vol. 2,
pp. 102-77,34A4O3; Vol.3, pp. 102-38.
Newton, Sir Isaac: Tlu Mathematical Works, ed. D. T. Whiteside, 2 vols., Johnson
Reprint Corp., 196,1-67, Vol. I contains translations of the three basic
PaPenr on the calculus.
Matlumatical Papcrs, ed. D. T. Whiteside, 4 vols., Cambridge University
Press, 1967-71.
Mathematical Principles of Natural Phi.lcsophg, ed. Florian Cajori, 3rd ed.,
University of California Press, 1946.
-: Opticks, Dover (reprint), 1952,
Pascal, B.: (Eut H^chette, 1914-21.
es,
Scott,
-: Joseph F.: Tlu Matlumatical Work oJ John Wallis, Oxford University Press,
1938.
A Histary of Mathcmalics, Taylor and Francis, 1958, Chaps, 10-l l.
Smith, D. E.: A Source Book in Mathematics, Dover (reprint), 1959, pp. 605-26,
Struik, D. J.: A Source Book in Matlumatics, 1200-1800, Ilarvard IJniversity Press,
1969, pp. 188-316, 32,1-28.
Thayer, H, S.: Newton's Philosophy of Nature, Hafner, 1953.
Turnbull, H. W.: Thc Mathematical Discoueries of Newtan, Blackie and Son, 1945.
.............._: Jarnes Gregory Terccntenary Mcmorial Volume, Royal Society of Edinburgh,
1939.
Turnbull, H. W. andJ. F, Scott: The Coneslondence of Isaac Newtnn,4 vols., Cam-
bridge University Press, 1959-1967.
Walker, Evelyn: I Study of the Traift des indivisibles of Gilles Personz de Robertal,
Columbia University Press, 1932.
Wallis, John: Opra Mathematiea, 3 vols., 1693-99, Georg Olms (reprint), 1968.
Whiteside, Derek T. : " Patterns of Mathematical Thought in the Seventeenth
Century," Archiae for History of Exact Sciences, l, 1961, pp. 179-388.
Wolf, Abraham: A History oJ Science, Technology and Philosophg in the 16th and ITth
Cenluics,2nd ed., George Allen and Unwin, 1950, Chaps. 7-14.
IB
Mathematics as of r1oo
39r
392 MATHEMATTCS AS OF rTOO
became not only an effective methodology for its own ends but also the
superior approach to the solution of geometric problems. The greater effec-
tiveness of analytical methods in the calculus decided the competition, and
algebra became the dominant substance of mathematics.
It was Wallis and Newton who saw clearly that algebra provided the
superior methodology. Unlike Descartes, who regarded algebra asjust tech-
nique, Wallis and Newton realized that it was vital subject matter. The work
of, Desargues, Pascal, and La Hire was depreciated and forgotten, and the
geometric methods of Cavalieri, Gregory of Saint Vincent, Huygens, and
Barrow were superseded. Pure geometry was eclipsed for about a hundred
years, becoming at best an interpretation ofalgebra and a guide to algebraic
thinking through coordinate geometry. It is true that excessive reverence for
Newton's geometrical work in the Principia, reinforced by the enmity against
the Continental mathematicians engendered by the dispute between Newton
and Leibniz, caused the English mathematicians to persist in the geometrical
development of the calculus. But their contributions were trivial compared
to what the Continentals were able to achieve using the analytical approach.
What was evident by 1700 was explicitly stated by no less an authority than
Euler, who, in his Introductio in Analgsin Infnitorurn (1748), praises algebra as
far superior to the synthetic methods of the Greeks.
It was with great reluctance that mathematicians abandoned the geo-
metric approach. According to Henry Pemberton (169+-1771), who edited
the third edition of Newton's Principia, Newton not only constantly expressed
great admiration for the geometers of Greece but censured himself for not
following them more closely than he did. In a letter to David Gregory
(1661-1708), a nephew ofJames Gregory, Newton remarked that " algebra
is the analysis of the bunglers in mathematics." But his own Arithmetica
Unilersalis of 1707 did as much as any single work to establish the supremacy
of algebra. Here he set up arithmetic and algebra as the basic science,
allowing geometry only where it made demonstrations easier. Leibniz, too,
noted the growing dominance of algebra and felt obliged to say, in an un-
published essay,l " Often the Beometers could demonstrate in a few words
what is very Iengthy in the calculus . . . the view of algebra is assured, but it
is not better."
Another, more subtle, change in the nature of mathematics had been
unconsciously accepted by the masters. Up to 1550 the concepts of mathe-
matics were immediate idealizations of or abstractions from experience. By
that time negative and irrational numbers had made their appearance and
were gradually gaining acceptance. When, in addition, complex numbers,
an extensive algebra employing literal coefficients, and the notions of deriva-
tive and integral entered mathematics, the subject became dominated by
L Couturat, L.: Opuscules et Jfragments inidits de Leibniz, 1903, reprinted by Georg Olms,
1961, p. l8l.
THE TRANSFORMATION OF MATIIEMATICS 393
character, and so failed to realize that a basis for the axiomatic development
other tJran self-evident truths was needed. It is true that the new concepts
were far more subtle than the old ones; and the proper axiomatic basis, as
we now know, could not have been readily erected.
How could critical mathematicians, well-versed in Greek mathematics,
have been content to operate on a heuristic basis ? They were concerned
with major and in some cases pressing problems of science, and the mathe-
matics they employed handled these problems. Rather than seeking full
comprehension of the new creations or trying to erect the requisite deductive
structure, they salved their consciences with their successes. An occasional
recourse to philosophical or mystical doctrines succeeded in cloaking some
difficulties so that they were no longer visible.
One new goal in particular characterizes the mathematics of the seven-
teenth and succeeding centuries-generality of methods and results. We have
already noted the value placed on generality of method by Vieta, in his
introduction of literal coefficients; by the projective geometers; by Fermat
and Descartes in the exploration of curvesl and by Newton and Leibniz in
the treatment of functions. As flar as generality of results is concerned, the
accomplishments were limited. Many were just affirmations, such as that
every polynomial equation of the nth degree has n roots, or that every equa-
tion of the second degree in ,r and y is a conic. Mathematical methods and
notation were still too limited to permit the establishment of general results,
but this became a goal of the mathematical efforts.
when we enter upon physics, we comPare those proportions with the phe-
nomena of Nature. . . . " Here physics relers to experimentation and obser-
vation. Newton's mathematics would be regarded as mathematical physics
today.
Charles II in 1662 and adopted the name ofthe Royal Society ofLondon for
the Promotion of Natural Knowledge. This society was concerned with
putting mathematics and science to use and regarded dyeing, coinage,
gunnery, the refinement of metals, and population statistics as subjects of
interest. Finally, the Berlin Academy of Sciences, which Liebniz had advo-
cated for some years, was opened in 1700 with Leibniz as its first President.
In Russia, Peter the Great founded the Academy ofSciences ofSt. Petersburg
in 1724.
The academies were important, not only in making possible direct
contact and exchange of ideas, but because they also supported journals.
The first of the scientific journals, though not sponsored by an academy,
was the Journal dr Sgauans or Journal des Saoants, which began publication in
1665. This journal and the Philosophical Transadions of the Royal Sociely, which
began publication in the same year, were the first journals to include
mathematical and scientific articles. The French Acad6mie des Sciences ini-
tiated the publication Histoire de l'Acadimie Royale des Sciences attec les Mhnoircs
de Mathimatique et de Phgsique. It also published the Mhnoires dt Mathhnatique
et de Phgsique Prisentis d l'Acadhnie Rogale des Scierues par Diaers Sgattans et Lus
dans ses Assemblies, also known as tt,e Mirnoires des Sauants Etrangcrs. Another
of the early scientific journals, the Acta Eruditorum, was begun in 1682 and,
because it was published in Latin, soon acquired international readership.
The Berlin Academy of Sciences sponsored the Histoire de l'Acadhnit Royalc das
Seiences et Belles-lettres (whose title for some years was the Miscellawa
Berolinensia).
The academies and their journals opened new outlets for scientific
communication; these and later journals became the accepted medium for
publication ofnew research. The academies furthered research in that most
of them supported scientists. For example, Euler was supported by the
Berlin Academy from l74l to 1766 and Lagrange from 1766 to 1787. The
St. Petersburg Academy supported Daniel and Nicholas Bernoulli at various
times, and Euler from 1727 to l74l and again from 1766 to his death in
1783. The founding of academies by the European governments marls also
the official entry of governments into the area of science and the support of
science. The usefulness of science had received recognition.
The institutions that a modern person would expect to play the major
role in the creation and dissemination of knowledge-the universities-were
ineffective. They were conservative and dogmatic, controlled by the official
religions of the respective countries and very slow to incorporate new
knowledge. On the whole they taught just a modicum of arithmetic, algebra,
and geometry. Though there were some mathematicians at Cambridge
University in the sixteenth century, from 1600 to 1630 there were none. In
fact, in England during the early seventeenth century' mathematics did not
form part of the curriculum. ft was regarded as devilry. Wallis, who was
MATHEMATTCS AS OF rTOO
398
born in 1616, says of the education common during his childhood, " Mathe-
matics at that time with us was scarce looked on as academical but rather
mechanical-as the business of tradesmen." He did attend Cambridge
University and study mathematics there, but learned far more from inde-
pendent study. Though prepared to be a professor of mathematics, he left
Cambridge " because that study had died out there and no career was open
to a teacher of that subject."
Professorships in mathematics were founded first at Oxford in 1619 and
later at Cambridge. Prior to that, there had been only lecturers oflow status.
The Lucasian professorship at Cambridge, which Barrow was the first to
hold, was founded in 1663. Wallis himself became a professor at Oxford in
1649 and held the chair until 1702. One obstacle to the enlistment of able
professors was that they had to take holy orders, though exceptions were
made, as in the case of Newton. The British universities generally (including
also London, Glasgow, and Edinburgh) had roughly the same history: from
about 1650 to 1750, they were somewhat active and then declined in activity
until about 1825.
The French universities ofthe seventeenth and eighteenth centuries were
inactive in mathematics. Not until the end of the eighteenth century, when
Napoleon founded first-class technical schools, did they make any contri-
bution. At German universities too, the mathematical activity of those two
centuries was at a low level. Leibniz was isolated and, as we noted earlier,
he railed against the teachings of the universities. The University of G<ittin-
gen wzur founded in 1731, but rose only slowly to any position of importance
until Gauss became a professor there. The university centers of Geneva and
Basel in Switzerland were exceptions in the period we are surveying; they
could boast of the Bernoullis, Hermann, and others. The Italian universities
were of some importance in the seventeenth century but lost ground in the
eighteenth. When one notes that Pascal, Fermat, Descartes, Huygens, and
Leibniz never taught at any university and that though Kepler and Galileo
did teach for a while, they were court mathematicians for most of their lives,
one sees how relatively unimportant the universities were.
Bibli.ographE
Hahn, Roger: Thc Anatanry of a Scitntifc Instituti.on: The Pais Acadcny oJ Scimus'
1666-1803, University of California Press, 1971.
Hall, A. Rupert: The Scientific Reaolution, 1500-1800, Longmans, Green, 1954,
Chap. 7.
Hall, A. Rupert, and Marie Boas: The Concspondtnce of Henry Oldcnbwg, 4 vols.,
University of Wisconsin Press, 1968.
Hartley, Sir Harold: The Rogal Society: Its Origins awl Foundas, The Royal Society,
1960.
Ornstein, M.: The Rolc of Scientific Societies in tlu Sctcntunth Ccz!2ry, University of
Chicago Press, 1938.
Purver, Margery: Ttu Rogal Sociely, Conccpt and Crcati.on, Massachusetts Institute of
Technology Press, 1967'
Wolt Abraham: A History of Scince, Technology aal Philosophy in ,hc l6th and lTth
Centuri.es,2nd ed., George Allen and Unwin, 1950, Chap.4.
r9
Calculus in the Eighteenth Century
l. Introduction
The greatest achievement ofthe seventeenth century was the calculus. From
this source there stemmed major new branches of mathematics: differential
equations, infinite series, differential geometry, the calculus of variations,
functions of complex variables, and many others. Indeed, the beginnings of
some of these subjects were already present in the works of Newton and
Leibniz. The eighteenth century was devoted largely to the development of
some ofthese branches ofanalysis. But before this could be accomplished, the
calculus itself had to be extended. Newton and Leibniz had created basic
methods, but much remained to be done. Many new functions of one
variable and functions of two or more variables had either to be recognized
explicitly or created; the techniques of differentiation and integration had to
be extended to some of the existing functions and to others yet to be intro-
duced; and the logical foundation of the calculus was still missing. The first
goal was to expand the subject matter of the calculus and this is the subject
of the present chapter and the next one,
The eighteenth-century men did extend the calculus and founded new
branches of analysis, though encountering in the process all the pangs,
errors, incompleteness, and confusion of the creative process. The mathe-
maticians produced a purely formal treatment of calculus and the ensuing
branches of analysis. Their technical skill was unsurpassed I it was guided,
however, not by sharp mathematical thinking but by intuitive and physical
insights. These formal efforts withstood the test of subsequent critical
examination and produced great lines of thought. The conquest of new
domains of mathematics proceeds somewhat as do military conquests. Bold
dashes into enemy territory caPture stronghoids. These incursions must then
be followed up and supported by broader, more thorough and more cautious
operations to secure what has been only tentatively and insecurely grasped.
4o0
TNTRODUCTTON 4Ot
geometry of curves and surfaces, the theory of numbers, series, and the
calculus of variations, This mathematics he applied to the entire domain of
physics. He created analytical mechanics (as opposed to the older geometrical
mechanics) and the subject of rigid body mechanics. He calculated the
perturbative effect of celestial bodies on the orbit of a planet and the paths
of projectiles in resisting media. His theory of the tides and work on the
design and sailing of ships aided navigation. In this area his Scientia Naaalis
(1749) and Thiorie complitt de la constradion et dz la man@uore dts aaisseaux
(1773) arc outstanding. He investigated the bending of beams and calculated
the safety load ofa column. In acoustics he studied the propagation ofsound
and musical consonance and dissonance. His three volumes on oPtical
instruments contributed to the design of telescopes and microscopes. He was
the fust to treat the vibrations of light analytically and to deduce the
equation of motion taking into account the dependence on the elasticity and
density of the ether, and he obtained many results on the refraction and
dispersion of light. In the subject of light he was the only physicist of the
eighteenth century who favored the wave as opposed to the Particle theory.
The fundamental diflerential equations for the motion of an ideal fluid are
his; and he applied them to the flow of blood in the human body. In the
theory of heat, he (and Daniel Bernoulli) regarded heat as an oscillation of
molecules, and his Essag on Fire (1738) won a prize. Chemistry, geography,
and cartography also interested him, and he made a map of Russia. The
applications were said to be an excuse for his mathematical investigations;
but there can be no doubt that he liked both.
Euler wrote texts on mechanics, algebra, mathematical analysis,
analytic and differential geometry, and the calculus of variations that were
standard works for a hundred years and more afterward. The ones that will
concern us in this chapter are the two-vol urne Intro&uti,o in Analgsin l4finitorum
(1748), the first connected Presentation of the calculus and elementary
analysis; the more comprehensive Institutiones Calculi Diferentialis ( I 755) ; and
the three-volum e Institutiones Calculi Intcgralis (1768-70); all are landmarks.
All of Euler's books contained some highly original features' IIis mechanics,
as noted, was based on analytical rather than geometrical methods. He gave
the first significant treatment of the calculus of variations. Beyond texts he
published original research papers of high quality at the rate ofabout eight
hundred pages a year during most ofthe years ofhis life. The quality ofthese
papers may be judged from the fact that he won so many prizes for them that
these awards became an almost regular addition to his income. Some of the
books and four hundred of his research papers were written after he became
totally blind. A current edition of his collected works, when comPleted, will
contain seventy-four volumes.
Unlike Descartes or Newton before him or Cauchy after him, Euler did
not open up new branches of mathematics' But no one wrls so prolific or
THE FUNCTTON CONCEPT 4O3
could so cleverly handle mathematicsl no one could muster and utilize the
resources of algebra, geometry, and analysis to produce so many admirable
results. Euler was superbly inventive in methodology and a skilled technician.
One finds his name in all branches of mathematics : there are formulas of
Euler, polynomials of Euler, Euler constants, Eulerian integrals, and Euler
lines,
One might suspect that such a volume of activity could be carried on
only at the expense of all other interests, But Euler married and fathered
thirteen children. Always attentive to his family and its welfare he instructed
his children and his grandchildren, constructed scientific games for them,
and spent evenings reading the Bible to them. He also loved to express
himself on matters of philosophy; but here he exhibited his only weakness,
for which he was often chided by Voltaire. One day he was forced to confess
that he had never studied any philosophy and regretted having believed that
one could understand that subject without studying it. But Euler's spirit for
philosophic disputes remained undampened and he continued to engage in
them. He even enjoyed the sharp criticism he provoked from Voltaire.
Surrounded by universal respect-well merited by the nobility of his
character-he could at the end of his life consider as his pupils all the
mathematicians of Europe. On September 7, 1783, after having discussed
the topics of the day, the Montgolfiers,l and the discovery of lJranus,
according to the oft-cited words of J. A. N. C, de Condorcet, "He ceased to
calculate and to live,"
l. Thc Montgolfiers were two brothers who in 1783 first successfully madc an ascent in
a balloon filled with heated air.
4O4 CALCULUS IN THE ETGHTEENTH CENTURY
I o'd*
)a2-72
James Bernoullis had used the change of variable
b'-t2
*: a_b\ f;
this converts the integral to the form
tdt
Jmr
which is readily integrable as a logarithm function. John Bernoulli noticed
in 1702 and published in the Mhnoires of the Academy of Sciences of that
year e the observation that
o'--!l-L I \.
- a_xl,
so that the integration can be performed at once' Thus the method of partial
-4_xz-Z\o+x-
fractions was introduced. This method was also noted indePendently by
Leibniz in the Acta Eruditorum of 1702.10
In correspondence between John Bernoulli and Leibniz, the method
was applied to
rtu
J ;al-t- +,'
However, since the linear factors of ax2 + ,, + c could be complor, the
method of partial fractions led to integrals of the form
ld*
J-*+ a
in which d at least was a complex number. Both Leibuiz and John Bernoulli
nevertheless integrated by using the logarithm rule and so involved the
Iogarithms of complex numbers. Despite the confusion about complex
numbers, neither hesitated to integrate in this manner. Leibniz said the
presence of complex numbers did no harm.
John Bernoulli employed them repeatedly. In a paper published in
170211 he pointed out that, just as adzl(b2 - z') goes over by means of the
-dt
(r) 41 :4,
-* lC
then log ( -r) : log r; and since log I : 0, so is log ( - l). Leibniz countered
that d(log x) : dxlx holds only for positive x. A second round of corre-
spondence and disagreement took place between Euler and John Bernoulli
during the years 1727-31. Bernoulli maintained his position, while Euler
disagreed withit, though, at the time, he had no consistent position of his
own.
The final clarification of what the logarithm of a complex number is
became possible by virtue of related developments that are themselves
significant and that led to the relationship between the exponential and the
trigonometric functions. In l7l4 Roger Cotes (1682-1716) published 13 a
theorem on complex numbers, which, in modern notation, states that
Euler also answers. For example, Bernoulli had argued that since (-a)':a',
then log (-o)' : log a3 and so 2log (-") :2loga or log (-a) : lo9 a.
Euler counters that, since (dt/=11t : aa, then log a : Log (a\/ a7 :
log a * log \/= and so in this case, presumably log l-1 would have to
be 0. But, Euler says, Bernoulli himself has proved in another connection
that tog \/= : Gi nl2.
Leibniz had argued that since
from which one sees at least that log (-l) is not 0 (in fact, Leibniz had
said that log ( - l) is nonexistent). Euler's answer to this argument is that
from
I
i-i':I-**x2-xi*xa"''
for x : -3 one obtains
I
-2:t+3+9+27+...
and for *: I one obtains
I
i:t-l+l-l+...,
so that by adding the left and right sides
0:2 +2 + 10 + 26 +....
Hence, says Euler, the argument from series proves nothing.
After rduting Leibniz and Bernoulli, Euler gives what is, by present
standards, an incorrect argument, He writes
!6:ev:(r*1)',
wherein i is an infinitely Iarge number.l8 Then
l:n: I +!,
18. In his carlier work Euler used i (the first lettet of bStitus) for an infinitcly large
quantity. I.Jfter l7?7 he used i for y'li'
ELLIPTIC INTIGR.A'IJ 4rt
and so
g:i(xLtt-l).
Since xllr, "the root with infinitely large exponent i," has infinitely many
complex values, y has such values, and since y : log.r, then so does log x'
In fact Euler now writes le
x: a * b\/=: c(cos{ + y'JsinC)'
Letting c be ec he has
4. Elliptir Integrak
John Bernoulli, having succeeded in integrating some rational functions by
ihe method of partial fractions, asserted in the Acta Eruditorvm of 1702 that
the integral of any rational function need not involve any other tran-
scendental functions than trigonometric or logarithmic functions. Since the
denominator ofa rational function can be an zth degree polynomial in *, the
correctness of the assertion depended on whether any polynomial with real
coefficients can be expressed as a product of first and second degree factors
with real coefficients. Leibniz in his paper in the Acta of 1702 thought this
was not possible and gave the example of xa a aa- He pointed out that
{-- I and, - \/= as ordinary complex numbers he would have seen his
error. Nicholas Bernoulli (1687-1759), a nephew ofJames andJohn, pointed
out in the Acta Eruditorum of l7l9 that xa + oo : (a' + x')' - 2a2x2 :
(az + x2 + ax\/-2) (a2 + x2 - axl/2) ; thus the function ll(x4 + aa) could
be integrated in terms of trigonometric and logarithmic functions.
The integration of irrational functions was also considered. James
Bernoulli and Leibniz corresponded on this subject, because such integrands
were being encountered frequently. James in 169420 was concerned with the
elastica, the shape assumed by a thin rod when forces are applied to it-as,
for example, at its ends. For one set of end-conditions he found that the
equation of the curve is given by
. (x2 + ab\ dx
-
' \f-at_ (rc\ ab)z'
he could not integrate it in terms of the elementary functions. In connection
with this work he introduced the lemniscate, whose rectangular coordinate
equation is (r2 * y')' : a'(x' - 92) and. whose polar coordinate equation
is 12 : a2 cos 2d. James tried to find the arc length, which from the vertex
to an arbitrary point on the curve is given by
, : Jof' ---!-_
at
a,.
1/ - ,n
and surmised that this integral, too, could not be integrated in terms of the
elementary functions. Seventeenth-century attempts to rectify the ellipse,
whose arc length is important for astronomy, led to the problem of evaluating
':of'-W
Jo {(t - t2)(t - k2t2)
when the equation of the ellipse is taken as
*' .- lj ,-
a2 t bz--
and in the integrand k : (a' - b')lo'and tr : xla.The problem of finding
the period of a simple pendulum led to the integral
r:4 -{gJo
F- l"t2 d6
1/-1- 1rz;"24
Such irrational integrands also occurred in finding the length of arc of a
hyperbola, the trigonometric functions, and other curves. These integrals
were already known by 1700; and others involving such integrands kept
occurring throughout the eighteenth century. Thus Euler, in a definitive
treatment of the elastica in the appendix to his I 744 book on the calculus of
variations, obtained
,.,- @+Fx+Yxz)dx
u-t/-on-1oap*1-r*ry
where the constants are for us immaterial. Like his predecessors, he resorted
to series in order to obtain physical results'
The class of integrals comprised by the above examples is known as
elliptic, the name coming from the problem of finding the length of arc of
an ellipse. The eighteenth-century men did not know it, but these integrals
cannot be evaluated in terms of the algebraic, circular, logarithmic, or
exponential functions.2l
The first investigations of elliptic integrals were directed not so much
toward attempts to evaluate the integrals as toward the reduction of the
more complicated ones to those arising in the rectification of eltpse and
hyperbola. The reason for this approach is that from the geometrical Point
of view, which dominated at that time, the integrals for the elliptic and
hyperbolic arcs seemed to be the simplest ones. A new point of view was
opened up by the observation that the difierential equation
(7) J@) dx : !f@) ds,
21. This was proved by Liouville (Jour, de I'Ecole Polg., 14, 1833, 12'[-93)'
22. Acta Erud., Oct. 1698,462tr'= Opua, 1,249'53.
23. Gionalc dzi Iztteruti d'Italia, Vols. 19 fr'
414 CALCULUS IN THE EIGHTtsENTH CENTT'RY
Figurc 19.l
where #o and ,1 are the abscissas of rR and Rr, and P.R and Pr8, are the
tangents at P and P1 respectively. Likewise
i- rn |14---------::-:arcQQt
dz
- (QrS, -
m + 2J.o \n +I;ifi 8S).
then the sum of the two definite integrals would be 0, and we would have
(e) tc QQr - arc PPr: (QrS, - QS) - (&.B1 - PR).
A solution of (8) for m : 4 is
(lo) I.a.: r.
Then for m:4, ol the curve g : xgl3az, the difference of two arcs whose
end-values * and z are related by (10) is expressible as a straight line segment.
Fagnano also obtained integrals of (B) for m : 6 and, m : 3.
Fagnano showed further that on the ellipse, as well as on the hyperbola,
one can find infinitely many arcs whose difference can be expressed alge-
braicalln even though the individual arcs cannot be rectified. Thus in 1716
he showed that the difference of any two elliptic arcs is algebraic. Analytically
he had
(r 1)
\/ffi1
""":d)t . ffi or:o
t/Jzz + e
415
Figure 19.2
(14) arcJD+arcDG:fi+c.
(To identify this with the integrals, let I be the parameter [atus rectum]
of the ellipse. let b-2a:i, l:2a8, f: -2a' g:2as' Then z is
o7-ro"- 2*; l!r;--2a\hxz.) whenr : o, arc JD vanishes and the algebraic
i"r* ir, (14) vanishes. ny (1i), z : a, arrd so arc DG becomes arc Dl' This
is the value of C. Then one can say
arcJD*arcGD:+r*arcDA
arc JD - ur.GA:42 r.
One result from this work,'{ still called Fagnano's theorem and obtained
in 1716, states the following: Let
# *orJ,: '
24. OPcre, 2, 287-92.
416 CALCULUS IN THE EIGHTEENTH CENTURY
Figure 19.3
From l7l4 on Fagnano also concerned himself with the rectification of the
lemniscate by means of elliptic and hyperbolic arcs.
In l7l7 and, 1720 Fagnano integrated other combinations of differen-
':als. Thus he showed
that
(18) _L:_L
ll-xa {l-ra'
has the integral
(te) -: -^17t-v
*i
or
(20) x2 *!2 I x2g2:1.
One way of stating the result is: Between two integrals that express arcs ofa
lemniscate (for which a : l), an algebraic relation exists, even though each
integral separately is a transcendental function ofa new kind.
Fagnano then proceeded to establish a number of similar relations in
order to obtain special results about the lemniscate.25 For example, he
25. Gionale dci l*uetati d,Itatia. 30, l7l8, 87 ff. : O?eru,2, gO4_tZ.
ELLIPTIC INTEGRALS 417
Figure 19.4
showed that if
d* 2du
(21)
t/l - xn {l-yn
then
(22)
tt-t-_:^ : {-, - *,
y{2 {l+x
or by solving for *,
From a variety ofsuch results Fagnano showed how to find the points on the
lemniscate (thevaluesofrinr2 : a2 cos 2d) that divide the quadrant, that is,
the arc CQA in Figure 19.4, into z equal parts for certain values of n' He
also showed how, given an arc CS, one can find the point f on this arc that
divides C,5 into two equal parts' Further, he found the points on arc CQA
which, when joined to C, divide the area under CQA and the horizontal axis
into two, three, and five parts; and given the chords that divide this area
into n equal parts, he found the chords bisecting each of these parts'
Thus Fignano had done more than answer Bernoulli's question; he
had shown that the same remarkable algebraic property that characterized
the integrals representing logarithmic and inverse trigonometric functions
held for at least certain classes of elliptic integrals.
About 1750 Euler noted Fagnano's work on the ellipse, hyperbola, and
lemniscate and began a series of investigations of his own. fn a paper'
" Observationes de Comparatione Arcuum Curvarum lrrectificabilium," '8
Euler, after repeating some of Fagnano's work, showed how to divide the
area of a quadrant of the lemniscate into z + I parts if it has already been
divided inio z parts. He then points out that his and Fagnano's work has
furnished some useful results on integration, for the equation (18) has,
besides the'obvious integral 7 : y, the additional particular integral
26. Novi Comm. Acad. Sci. Petrop.,6, 175617,5&{4, pub. 176l = Olerut (l), 20, 80-107'
4I8 ciALcULUs IN TIIE EIGIITE'ENTH oE,NTURY
(26) .:----|-;FF-
u\/T-=V + ct/|1 u+
07\ f* d* : f ! dx tc drc
J o 1/1i J o m - Jo
where a is a constant, is also a complete integral of (lB). Hence *, g, and, c
must be related by (25). Thus the addition theorem says that if (27) holds -'=-,ci
for the elliptic integrals therein, then the upper limit x is an algebraic
symmetric function, namely (26), of the arbitrarily chosen upper limits y
and c of the other two integrals. The addition theorem applies to more
general integrals, as we shall see.
27. Nooi Conm. Acad. Sci. Parop.,6, t75617,37-57, pub. 176l : Olcra, (l),20,58-79.
ELLIPTIC INTEORAIS 4I9
lv d,
|
fx dtc
(28) :-nl
)n <rf1-xe Jo1/l-xa
theny is an algebraic function of ,. This result is called Euler's multiplication
theorem for tf,e eiliptic integral li a4t/C7. From this result the com-
--
where x and Y are polynomials of the fourth degree of which four coefficients
(the same in Xand I) can be expressed in terms of the five in (29) with the
help of an arbitrary constant. Then (29) is the complete integral of (30),
and when (30) is speciatized to (IB), then (29) becomes (25)' Euler remarks
that it is wonderful that, even though an integral of dxl{T cannot be
obtained in terms of circular or logarithmic functions, the equation (30) is
satisfied by an algebraic relation. He then generalizes the results to
mtk
---: ndu
mlnfanOnal,
(31)
\/x \/Y
--,
where X and I are fourth degree polynomials with the same coefficients'
The result is also in r,rtler,s Institutiancs calculi Integtalis,zs where he shows
what the results mean geometrically in terms of ellipse, hyperbola, and
lemniscate.
From these results Euler was able to proceed with what is now called
the addition theorem for elliptic integrals of the first kind. consider the
elliptic integral
fdx
t-
(32)
J \/@
28. Noai Comm. Acad. Sci. Pcttop.,7, 175819,3-48, pub. 176l : Olaa, (1), 20' 153-200'
29. Vol. 1, Sec. 2, Chap. 6 = Olna, (l), ll, 391-423'
420 CALCULUS IN THE EIGHTEENTH CENTURY
(34)
I dtc
t-
J \/86
is equated to a third integral of the same form, and if further the lower limit
of integration and the coefficients under the radical are the same for all
three integrals, then the upper limit of integration ofthe third integral is an
algebraic function of the two other upper limits, the common lower limit,
and the corresponding values of {E@ at the two upper limits and the lower
limit.
Euler went further. Just as Fagnano's treatment of the difference of
two lemniscate arcs led him to the general elliptic integral of the first kind,
so Fagnano's treatment of the difference of two elliptic arcs (see [l l]) led
Euler to an addition theorem for integrals ofa second kind.so He expressed
regret that his methods were not extensible to higher roots than the square
root or to radicands ofhigher than fourth degree. He also saw a great defect
in his work in that he had not obtained his complete algebraic integrals by a
general method of analysis; consequently his results were not naturally
related to other parts of the calculus.
The definitive work on elliptic integrals was done by Adrien-Marie
Legendre (1752-1833), a professor at the Ecole Militaire who served on
several governmental committees; he later became an examiner of students
at the Ecole Polytechnique. Up to his death in lB33 he never ceased to work
with passion and regularity. His name lives in a great number of theorems,
very varied, because he tackled most diverse questions. But he had neither
the originality nor the profundity of Lagrange, Laplace, and Monge.
Legendre's work gave birth to very important theories, but only after it was
taken over by more powerful minds. He ranks just after the three con-
temporaries just mentioned.
30, Noai Comm, Acad. Sci. Pettop., 7, 175819, 3-48, pub. 176l : Opera, (l), 20, 153-200
:
and Inst. Cal. Integ., l, fl645 Opera, (l), I l, '1T645.
ELLIPTIC INTEGRAI,S 421
Euler's addition theorems were the main results of the theory of elliptic
integrals when Legendre took up the subject in 1786. For four decades he
*ur1h" only man who added investigations concerning these integrals to the
literature. i{e devoted two basic papers to the subject,3l then wrote the
Exerciccs de calcul intlgrat (3 vols., l8ll, l8l7, 1826), the Traiti des fonctions
elliptiquessz (2 vols., 1825-26), and three supPlements giving accounts of the
*oit of eU.t and Jacobi in 1829 and 1832. Euler's results, like Fagnano's,
were tied to geometrical considerations I Legendre concentrated on the
analysis proper.
i"g.rrai"" chief result, which is in lrris Trai.ti, was to show that the
general elliptic integral
P(*) *,
IJ t/R(x\
(3s)
where P(r) is any rational function of x and,R(.r) is the usual general fourth
degree polynomial, can be reduced to three types:
(36) ldx
J1/-t- xr1/l - ti?
I
I
xz dx
(37)
J 'n-:Arn-=eP'
dx
(38)
Legendre designated these three types as elliPtic inte$als of the fust, second,
and third kinds, resPectivelY.
He also showed that by further
-. transformations these three integrals can
be reduced to the three forms
(3e) F(k,6) : J: d6
0<r<l
\/l - k2 sinz 6
(40) E(k,$):
I: \/T-=-FR6a6, 0</c<l
where z is any constant. In these forms one sees that the values of the integrals
from{:0 io{: tl2 ate repeated but in reverse order from $: '.l2to
31. Hist. de l'Acad.des Sci., Paris, 1786, 616-43 and 6'1'['{3'
32. The use ofthe word,,function" in this text is misleading. Hc studied elliPtic integrals
and at times those with variable upper limis. These are, of course, functions of the upper
limits. But the term,.elliptic functions " refers today to the functions introduced later by
Abel and Jacobi.
422 _
CALCULUS rN THE ETGTTTEENTH CENTURY
(42)
(43)
The quantity,t is called the modulus of each elliptic integral. If the limits of
integration are $ : zrl2 or x: l, then the integrals are called complete;
otherwise, incomplete.
There was much merit in Legendre's work on elliptic integrals. He
drew many inferences, previously unstated, from the work ofhis predecessors
and otganized the mathematical subject; but he did not add any basic
ideas, nor did he attain the new insight of Abel and Jacobi (Chap. 27,
sec. 6) who inverted these integrals and so conceived of the elliptic
functions.
Legendre did get to know the work ofAbel andJacobi and praised them, with
much humility and, probably, some bitterness. In devoting the supplements
to his 1825 work to tleir new ideas, he understood very well that this material
threw into the shade all he had done on the subject. He had overlooked one
of the great discoveries of his epoch.
(46)
;:(#\#)(#)(H)
In the notation l(z * l) : z!, introduced later by Legendre' Euler also
showed that l(z + t) : zl(z) and so obtained I(3/2L I(5/2)' and so forth'
Euler couid have used (45) as his generalization ofthe factorial concePt.
In f,rct, it is often introduced today in the equivalent form, which Euler also
gave, namely,
.. ml (n
* 1)"
(47)
;'ji 6ry0, + 2)..'(n + n)
But the connection with wallis's result led Euler to take uf an integral
already considered by Wallis, namely,
wherein for Euler e and n are now arbitrary' Euler evaluated this integral by
expanding (l - *)" by the binomial theorem and obtained
(4e)
tt x"(1 - x)" dx
: # _
w+a . #6h _,,i,;,ru)c!r.nl, *
For z : O, 1,2, 3,. . . the sums on the right side are respectively
(5oi ;
I 1 1.2
+-t'GT rjGTT)'G +TGTECTT)'
l-2.3
424 CALCULUS IN THE EIGHTEENTII CENTURY
(51)
(52)
",
: I' (-rog x)* dx.
This integral has meaning for almost all arbitrary z. It is called the second
Eulerian integral, or, as Legendre later called it, the gamma function, and
is denoted by I(n + l). [Gauss let r(n) : l(z + l)]. Later, in lTBl (pub.
1794), Euler gave the modern form, which is obtained from (52) by letting
,: -log *,
(s3) I(z + l) : *"r-" d,*.
[*
The integral (48), which Legendre called the fust Eulerian integral,
became standardized as the beta function,
' B(*,, :
I:in-L(t -
(54) x),,-t 4r.
B(m,n):W
In his Exercices de calcul intigral, Legendre made a profound study of the
Eulerian integrals and arrived at the duplication formula
36. Nooi Comrn. Acad. Sci. Petrop., 16, 1771,91-139, pub. 1772 : Operu, (l), 17, 316-57.
37. Comm. Soc. Gott.,II, l8l3 - Werhe,3, 123-162, p. 149 in part.
TrrE oALCULUS OF FUNCTIONS OF SEVERAL VARTABLES 425
38. Mlm. de l'Acatl. des Sci., Pa*, 1739, 425-36, and 1740, 293-323'
39. Comm. Acatl. Sci. Petro!.,7, 173415, 174'93, pub. l7'tO = Olera, (l), 22, 3G56'
426 cALcuLUs rN THE ErcrrrEENTH cENTrrRy
^6, lf cdtdu
)) 6 a-p _1 g--==n
taken over the ellipse (xala') 4 (y'lb') : 1. This was evaluated by Euler in
1738 by repeated integration.ao He integrated with respect to y and used
infnite series to expand the new integrand as a function ofx.
By 1770 Euler did have a clear conception of the definite double in-
tegral over a bounded domain enclosed by arcs, and he gave the procedure
for evaluating such integrals by repeated integration.al Lagrange, in his
work on the attraction by ellipsoids of revolution,as expressed the attraction
as a triple integral. Finding it difficult to evaluate in rectangular coordinates,
he trarxformed to spherical coordinates. He introduced
x:o+rsin{cos0
y:b+rsin/sind
z: c * rcosit
where a, b, arl.d c are the coordinates of the new origin, 0 is the longitude
angle, $ the colatitude, and 0 < 6 < n, O < 0 < 2zr. The essence of the
transformation of the integral is to replace dx dy dz by r2 sin 0 d0 d$ dr. Tht*
Lagrange began the subject of the transformation of multiple integrals. In
fact, he gave the general method, though not very clearly. Laplace also gave
the spherical coordinate transformation almost simultaneously.ao
were treated either as infinitesimals, that is, quantities not zero but not of
any finite size, or sometimes, as zero.
Brook Taylor (16B5-173I), who was secretary of the Royal Society
lrom I 714 to I 7 18, in his Methodus hurementorum Directa et Inoersa (1715) ,
sought to clarify the ideas of the calculus but limited himself to algebraic
fu.rctiorrs and algebraic differential equations. He thought he could always
deal with finite increments but was vague on their transition to fluxions.
Taylor's exposition, based on what we would call finite differences, failed
to obtain many backers because it was arithmetical in nature when the
British were trying to tie the calculus to geometry or to the physical notion
of velocity.
Some idea of the obscurity of the eighteenth-century efiorts and their
Iack of success may also be gained from Thomas Simpson's (17l04l) A New
Treatise on Fluxions (1737). After some preliminary definitions, he defines a
fluxion thus: "The magnitude by which any flowing quantity would be
uniformly increased in a given portion of time with the generating celerity at
anyproposedpositionorinstant(was[were]itfromt}rencetocontinue
irrvariable) is the fluxion of the said quantity at that Position or instant." In
our language Simpson is defining the derivative by saying that itis (dy ldt) Lt'
Some artho.s gavi up. The French mathematician Michel Rolle at one point
taught that the calculus was a collection of ingenious fallacies'
The eighteenth century also witnessed new attacks on the calculus' The
strongest w;s made by Bishop George Berkeley (1685-1753), who feared the
gro*irrg threat to religion posed by mechanism and determinism' In 1734 he
puulistred The Analyst, or A Discourse Addrxscd to an Itfidel Mathematician,
Wurein It is examined uhether tlu Object, Prhuiples, and Infnaucs of tlu morhm
Analysis afe ,n\re distbutly corceiacd, or more caidently dchued, than Rcligious
Mysteries and Points of Faith. " First cast out the beam out of thitu oum Eyc; and thm
shilt thou see clearly to cast out the ,notc out oJ thg brotlur's Eye'" (T;oe "infidel"
was Edmond Halley.)aa
Berkeley rightly pointed out that the mathematicians were proceeding
inductively rather than deductively. Nor did they give the logic or reatons
for their siep.. H. criticized many of Newton's arguments; for example, in
the latter,s be euadraUra, where he said he avoided the infinitely small, he
gave .f the increment denoted by a, expanded (x + o)", subtracted the *n,
Ji*rid"d by a to find the ratio of the increments ofx" and r, and then dropped
terms involving o, thus obtaining the fluxion of *". Berkeley said that
Newton first did give * an increment but then let it be zero' This, he said, is
a defiance of the law of contradiction and the fluxion obtained was really
0/0. Berkeley attacked also the method of differentials, as presented by
I'i{ospital and others on the Continent. The ratio of the differentials, he
44. George BerLel ey: Ttu Wotk, G. Bell and Sons, 1898, Vol' 3, l-51'
428 cALcuLUs rN THE ETcHTEET{TH cENTURy
said, should determine the secant and not the tangent; one undoes this
error by neglecting higher differentials. Thus, ..by virtue of a twofold
mistnte you arrive, though not at a science, yet at the truth,,, because errors
were compensating for each other. He also picked on the second differential
d (dr) because it is the difference of a quantity dx that is itself rhe least dis-
cernible quantity. He says, "In every other science men prove their con-
clusions by their principles, and not their principles by the conclusions."
As for the derivative regarded as the ratio of the evanescent increments
iny and x or dy and dr, these were "neither finite quantities, nor quantities
infinitely small, nor yet nothing." These rates of change were but,.the ghosts
of departed quantities. Certainly . . . he who can digest a second or third
fluxion . . . need not, methinks, be squeamish about any point in Divinity.,,
He concluded that the principles of fluxions were no clearer than those of
Christianity and denied that the object, principles, and inferences of the
modern analysis were more distinctly conceived or more soundly deduced
than religious mysteries and points of faith.
A reply to the Aralgst was made by James Jurin (1684-1750). In 1734
he published Gcometry, No Friend to hfidtlitg, in which he maintained that
fluxions are clear to those versed in geometry. He then tried ineffectually to
explain Newton's moments and fluxions. For example, Jurin defined the
limit of a variable quantity as "some determinate quantity, to which the
variable quantity is supposed continually to approach,' and to come nearer
than any given difference. However, then he added, .,but never to go
beyond it." This definition he applied to a variable ratio (the difference
quotient). Berkeley's crushing answer, entitled A Defensc of Fruthinking in
Matlunatics (1735),45 indicated that Jurin was trying to defend what he did
not understand. Jurin replied, but did not clarify matters.
Benjamin Robins (1707-51) then entered the fray with articles and a
book, ..4 Discourse Concerning the Nature and Certaintg of Sir Isaat Newton, s
Mcthod of Fluxions and of Prime and Ultimate Ratios (1735). Robins neglected
the moments of Newton's first paper but emphasized fluxions and prime and
ultimate ratios. He defined a limit thus: "We define an ultimate magnitude
to be a limit, to which a varying magnitude can approach within any degree
of nearness whatever, though it can never be made absolutely equal to it.,,
Fluxions he considered to be the right idea, and prime and ultimate ratios
as only an explanation. He added that the method of fluxions is established
without recourse to limits despite the fact that he gave explanations in terms
of a variable approaching a limit. Infinitesimals he disavowed.
To answer Berkeley, Colin Maclaurin (1698-1746), in his Treatise of
to establish the rigor of the calculus. It was a
Fluxions (1742), attempted
commendable effort but not correct. Like Newton, Maclaurin loved geom-
45. George Berkeley: The Works, G. Bell and Sons, 1898, Vol. 3,53J9.
THE ATTEMPIS rO SUPPLY RrCOR rN THE CALCULUS 429
etry, and therefore tried to found the doctrine of fluxions on the geometry
of the Greeks and the method of exhaustion, particularly as used by Archi-
medes. He hoped thereby to avoid the limit concept. His accomplishment
was to use geometry so skillfully that he persuaded others to use it and
neglect analysis.
The Continental mathematicians relied more upon the formal manipu-
lation of algebraic expressions than on geometry. The most important
representative of this approach is Euler, who rejected geometry as a basis for
the calculus and tried to work purely formally with functions, that is, to
argue from their algebraic (analytic) representation.
He denied the concept of an infinitesimal, a quantity less than any
assignable magnitude and yet not 0. In tris Institutiones of 1755 he argued,a6
z2 zB z4
(57) log" (l + z) -234 J--- + ...
Replacing z by dxlx give
- dx dtz
dv:;-*+# drca
Since all the terms beyond the first one are evanescent, we have
d(tog O :4.
We should keep in mind that Euler's texts were the standard of his day.
What Euler did contribute in his formalistic approach was to free the calculus
from geometry and base it on arithmetic and algebra. This step at least
prepared the way for the ultimate justification of the calculus on the basis
of the real number system.
Lagrange, in a paper of I 772 a7 and in his ThCorfu dzsforutions analytiquzsao
made the most ambitious attempt to rebuild the foundations of the calculus.
The subtitle ofhis book reveals his folly. It reads: " Containing the principal
theorems of the differential calculus without the use of the infinitely small,
or vanishing quantities, or limits and fluxions, and reduced to the art of
algebraic analysis of finite quantities."
Lagrange criticizes Newton's approach by pointing out that, regarding
the limiting ratio of arc to chord, Newton considers chord and arc equal not
before or after vanishing, but when they vanish. As Lagrange correctly points
out, "That method has the great inconvenience of considering quantities in
the state in which they cease, so to speak, to be quantities; for though we can
always properly conceive the ratios of two quantities as long as they remain
finite, that ratio offers to the mind no clear and precise idea, as soon as its
terms both become nothing at the same time." Maclauin's Treatise of
Flu*ions, he says, shows how difficult it is to demonstrate the method
of fluxions. He is equally dissatisfied with the little zeros (infinitesimals) of
Leibniz and Bernoulli and with the absolute zeros of Euler, all of which,
" although correct in reality are not sufficiently clear to serve as foundation
ofa science whose certitude should rest on its own evidencc."
47. Nouo. Mlm, de l'Acad. de Bcrlin, 1772, pub. 1774 : (Euwcs, 3, 441-76.
48. 1797;2nd ed., l8l3 : (Euares,9.
THE ATTEMPIS TO SUPPLY RICOR IN TIIE CALCULUS 43t
Lagrange wished to give the calculus all the rigor of the demonstrations
of the ancients and proposed to do this by reducing the calculus to algebra,
which, as we noted above, included infinite series as extensions of poly-
nomials. In fact, for Lagrange, function theory is the part of algebra con-
cerned with the derivatives of functions. Specifically Lagrange proposed to
use power series. with becoming modesty he remarks that it is strange that
this method did not occur to Newton.
He now wants to use the fact that any function ;f(*) can be expressed
thus:
(58) f(z + h) :f(r) + ?h i qh2 a rho * rla * "'
wherein the coefficients, ?,4,r,'.. involve , but are independent of fr'
However, he wishes to be sure before proceeding that such a power series
expansion is always possible. Of course, he says, this is known through any
,rumbe. of familiar examples, but he does agree that there are exceptional
cases. As exceptional cases Lagrange has in mind those in which some deriva-
tive of;f(x) bicomes infinite and those in which the function and its deriva-
tives become infinite. These exceptions happen only at isolated Points;
hence, with Lagrange they do not count. In a similar cavalier fashion he
deals with a secorrd difficulty. Lagrange and Euler accepted without question
that a series expansion containing integral and fractional powers of I was
surely possible, but Lagrange wished to eliminate the need for fractional
porr".t. Fractional Powers, he believed, could arise only if/(r) contained
radicals. But these, ioo, he dismisses as exceptional cases' Hence he is ready
to proceed with (58).
By a somewhat involved but purely formal argument Lagrange con-
cludes that we get 2q from y' in the same way that we get P fromrf(r), and a,
similar conclusi,on holds ficr the other coefficients r, J, " ' in (58)' Hence if
we let, be denoted by .f '(*) and designate by -f'(*) a function derived from
f '(x) as f '(*) is derived from;f(x), then
p : "f'(r), t: L-r'@), ,: {!r"ti,-..,
and so (58) gives
Lagrange has yet to show how one derives p or f' (x) from;f(r). Here he
uses (58) and neglects all terms after the second. Then,f(x + h) -f(x) : ph.
He divides by and concludes that p : f' (rc) .
i
Actually Lagrange's assumption that a function can be expanded in
a power series is one weak point in the scheme. The criteria now known
for such an expansion involve the existence of derivatives, and this is what
Lagrange sought to avoid. His arguments to justify the power series only
added to the confusion about which functions can be so expanded. Even if
such an expansion is possible, Lagrange shows how to calculate the coeffi-
cients only ifwe can get the first one, that is,rf'(r); and here he does the same
crude thing his predecessors did. Finally, the question of the convergence of
the series (58) is really not discussed. He does show that for i small enough
the last term kept is greater than what is neglected. He also gives in this
book the Lagrange form of the remainder in a Taylor expansion (Chap. 20,
sec. 7), but it plays no role in the above development. Despite all these weak-
nesses, Lagrange's approach to the calculus met great favor for quite some
time. Later it was abandoned.
Lagrange believed he had dispensed with the limit concept. He did
agreeae that the calculus could be founded on a theory of limits but said the
kind of metaphysics one must employ is foreign to the spirit of analysis.
Despite the inadequacies of his approach, he did contribute, as did Euler,
to divorcing the foundations of analysis from geometry and mechanics; in
this his influence was decisive. While this separation is not pedagogically
desirable, since it bars intuitive understanding, it did make clear that
logically analysis must stand on its own feet.
Toward the end of the century the mathematician, soldier, and ad-
ministrator Lazare N. M. Carnot (1753-1823) wrote a popular, widely sold
book, Rrflexions sur la mitaphysique du calcul hfinitisimal (1797), in which he
sought to make the calculus precise. He tried to show that the logic rested on
the method of exhaustion and that all the ways of treating the calculus were
but simplifications or shortcuts whose logic could be supplied by founding
them on that method. After much thought he, like Berkeley, ended up
concluding that errors in the usual arguments of the calculus were com-
pensating for each other.
Among the multitude of efforts to rigorize the calculus, a few were on the
right track. The most notable of these were d'Alembert's and, earlier, Wallis's.
D'Alembert believed that Newton had the right idea and that he himself
was merely explaining Newton's meaning. In his article " Diff€rentiel "
in the famous Encydopidie ou Dictionnaire Raisonni des Scimces, des Arts, et des
Mitins (1751-80), he says, " Newton has never regarded the differential
calculus as a calculus of infinitesimals, but as a method of prime and ulti-
mate ratios, that is to say, a method of finding the limit of these ratios." But
d'Alembert defined a differential as "an infinitely small quantity or at least
smaller than any assignable magnitude." He did believe the calculus of
Leibniz could be built up on three rules ofdifferentials; however, he favored
the derivative as a limit. In his pursuit of the use of limits he, like Euler,
argued that 0/0 may be equal to any quantity one wishes.
In another article, " Limite, " he says : " The theory of limits is the true
metaphysics of the calculus. . . . It is never a question of infinitesimal quan-
tities in the differential calculus: it is uniquely a question of limits of finite
quantities. Thus the metaphysics of the infinite and infinitely small quan-
tities, larger or smaller than one another, is totally useless to the differential
calculus." Infinitesimals were merely a manner of speaking that avoided the
lengthier description in terms of limits. In fact, d'Alembert gave a good
approximation to the correct definition of limit in terms of a variable quan-
tity approaching a fixed quantity more closely than any given quantity,
though here too he talks about the variable never reaching the limit. How-
ever, he did not give a formal exposition of the calculus that incorporated
and utilized his basically correct views.
He, too, was vague on a number of Points; for example, he defined the
tangent to a curve as the limit of the secant when the two Points of inter-
section become one. This vagueness, especially in his statement of the notion
of limit, caused many to debate the question of whether a variable can reach
its limit. Since there was no explicit, correct Presentation, d'Alembert
advised students of the calculus, " Persist and faith will come to you."
Sylvestre-Frangois Lacroix (1765-1843), in the second edition (lBI0-
l8lg) of his Traiti du calcul dffirentiel et du calcul intCgtal had the idea more
explicitly that the ratio of two quantities, each of which approaches 0, can
approach a definite number as a limit. He gives the ratio axl(ax + r') and
notes that this ratio is the same as al(a * *), and the latter approaches I as
* approaches 0. Moreover, he points out that I is the limit even when x
approaches 0 through negative values, Ilowever, he also speaks of the ratio
of the limits when these are 0 and even uses the symbol 0/0' He does intro-
duce the differential dy ofa function y : -f(r) in terms ofthe derivative; that
is, dg : f'(x) dx. Hence if g : axs, dy : 3*'d*. He first used the term
" differential coefficient" for the derivative; thus 3a*2 is the difierential
coefficient.
Almost every mathematician of the eighteenth century made some
effort or at least a pronouncement on the logic of the calculus, and though
one or two were on the right track, all the eflorts were abortive. The dis-
tinction between a very large number and an infinite " number" was hardly
made. It seemed clear that a theorem that held for any z must hold for z
infinite. Likewise a difference quotient was replaced by a derivative, and a
sum of a finite number of terms and an integral were hardly distinguished'
434 CALCULUS rN THE ETGHTEENTH CENTURY
Mathematicians passed from one to the other freely. In 1755, in his Inslitu-
tioncs, Euler distinguished between the increment in a function and the
differential of that function and between a summation and the integral, but
these distinctions were not immediately taken up. All the efforts could be
summed up in Voltaire's description of the calculus as "the art of numbering
and measuring o<actly a Thing whose Existence cannot, be conceived."
In view of the almost complete absence of any foundations, how could
the mathematicians proceed with the manipulation of the variety of func-
tions ? In addition to their great reliance upon physical and intuitive mean-
ings, they did have a model in mind-the simpler algebraic functions, such
as polynomials and rational functions. They carried over to all functions the
prop€rties they found in these explicit, concrete functions: continuity, the
existence of isolated infinities and discontinuities, expansion in power series,
and the existence of derivatives and integrals. But when they were obliged,
largely through the work on the vibrating string, to broaden the concept of
function, as Euler put it, to any freely drawn curves (Euler's mixed or
irregular or discontinuous functions), they could no longer use the simpler
functions as a guide. And when the logarithmic function had to be extended
to negative and complex numbers, they really proceeded without any
reliable basis at all; this iS why arguments on such matters were common.
The rigorization of the calculus was not achieved until the nineteenth
century.
Bibliography
Bernoulli, James l Operu, 2 vols., 1744, reprinted by Birkhaiiser, 1968.
3stn6r'lli, John: Opera Omnia, 4 vols., 1742, reprinted by Georg Olms, 1968.
Boyer, Carl B,z Tlu Conccpts oJ tlu Calculus, Dover (reprint), 1949, Chap. 4.
Brill, A. and M. Ndther: "Die Entwicklung der Theorie der algebraischen Funk-
tionen in dlterer and neuercr Zeit," Jahres. dar Deut. Math.-Vercin.,3, 189213,
107-566.
Cajori, Florian: " History of tle Exponential and Logarithmic Concep*," Amar.
Math. Monthly, 20, 1913, 5-14, 3547, 75-44, 107-17, 148-51, 173-42,
205-10.
A History oJ tlu Corceptions of Limits and Fluxions in Great Bitain rtom Newton
to Woodhousc, Open Court, 1919,
"The History of Notations of the Calculus," Annals of Math., (2),25, 1923,
t-46.
-:
Cantor, Moritz : Voilesungm ilbcr Gcschirhte dn Mathematik, B. G. Teubner, 1898 and
1924, Vols. 3 and, 4, relevant sections,
Davis, PhilipJ.: " Leonhard Euler's Integral: A Historical Profile of the Gamma
Function," Aner, Math. Monthly,66, 1959, 849-69.
Euler, Leonhard l Opera Omnia, B. G. Teubner and Orell Fiissli, l9l l-; see refer-
ences to specific volumes in the chapter.
43s
l. Introduti.on
Infinite series were in the eighteenth century and are still today considered
an essential part of the calculus. Indeed, Newton considered series insepa-
rable from his method of fluxions because the only way he could handle even
slightly complicated algebraic functions and the transcendental functions
wan to expand them into in$nite series and differentiate or integrate term by
term. Leibniz in his first published papers of 1684 and 1686 also emphasized
" general or indefinite equations." The Bernoullis, Euler, and their contem-
poraries relied heavily on the use ofseries. Only gradually, as we pointed out
in the preceding chapter, did the mathematicians learn to work with the
elementary functions in closed form, that is, as simple analytical expressions.
Nwertheless, series were still the only representation for some functions and
the most effective means of calculating the elementary transcendental
functions.
The successes obtained by using infinite series became more numerous
as the mathematicians gradually extended their discipline' The difficulties
in the new concept were not recognized, at least for a while. Series were just
infinite polynomials and appeared to be treatable as such. Moreover, it
seemed clear, as Euler and Lagrange believed, that every function could be
expressed as a series.
$6
INITIAL W,ORK ON INFINTTE SERIES 437
the distance traveled by moving bodies when the velocity changes from one
period of time to another. Oresme, who had considered a few such series,
even proved in a tract, Quastiones Super Geometriam Euclidis (c' 1360), that the
harmonic series
llll
t+r+1+4+5+'..
is divergent by the method used today, namely, to replace the series by the
series of lesser terms
i*i*(}.i) *(i.*****) .
and to note that the latter series diverges because we can obtain as many
groups of terms each of magnitude 1/2 as we please. However, one must not
conclude that Oresme or mathematicians in general began to distinguish
convergent and divergent series.
lr his Varia R.esponsa (1593, Opera,347435) Vieta gave the formula for
the sum of an i4finite geometric progression. He took from Euclid's Elemnts
that the sum ofz terms ofal I az * " ' * oo is given by
sn- dt _dt.
Sr.
- At llg
Then if a1la, > l, a, approaches 0 as z becomes infinite, so that
a?
s-:f,-r_ar'
In the middle of the seventeenth century Gregory of Saint Vincent, in
his Opus Geotnetricum ( I 647), showed that the Achilles and the Tortoise
paradox could be resolved by summing an infinite geometric series. The
finiteness of the sum showed that Achilles would overtake the tortoise at a
definite time and place. Gregory gave the first e:<plicit statement that an
infinite series represents a magnitude, namely, the sum of the series, which
he called the limit of the series. He says the " terminus of a progression is the
end of the series to which the progression does not attain, even if continued
to infinity, but io which it can approach more closely than by any given
interval." He made many other statements that are less accurate and less
clear, but he did contribute to the subject and influenced many pupils'
Mercator and Newton (Chap. 17, sec. 2) found the series
arc cos .r
Figure 20.1
difficulty but could not explain it. Newton oltained many other series for
algebraic and transcendental functions. Thus, to obtain the series for arc
sin x, in 1666, he used the fact (Fig. 20.1) that the area OBC: (l 12)
arc sin r, so arc sin rc : I; \/T=7 dx - x{T17/2. He got the result by
expanding the right side into series, integrating term by term, and combining
the two series. He also obtained the series for arc tan *. In his De Analysi of
1669 he gave the series for sin *, cos r, arc sin x, and, ex. Some of these he got
from others by inverting a series, that is, solvingit for the independent variable
in terms of the dependent variable. His method of doing this is crude and
inductive. Nevertheless, Newton was immensely pleased with his derivation
of so many series.
Collins received Newton's De Analysi in 1669 and communicated the
results on series to James Gregory on December 24, 1670. Gregory answered
(Turnbull, Conespondence, l, 52-58 and 61-64) on February 15, 167l that he
had obtained other series, among them
James Bernoulli in 17022 derived the series for sin x and cos * by using
expressions he had derived for sin nc in terms of sin a and then letting c ap-
proach 0 vrhile z becomes infinite, so that zcr approaches x while z sin cr, which
equals zd sin c/c, also approaches .r. Wallis had mentioned in the Latin
edition of his Algebra (1693) that Newton had given these series again in
1676; Bernoulli noted this remark but flailed to acknowledge Newton's
priority. Moreover, de Moivre gave a proof of Newton's results in the
Philosophical Transactions of 1698;8 though Bernoulli used and referred to this
journal in other work, he gave no indication that he was aware through this
source of Newton's work.
One ofthe major uses ofseries beyond their service in differentiation and
integration is to calculate special quantities, such as 7r and a, and the loga-
rithmic and trigonometric functions. Newton, Leibniz, James Gregory,
Cotes, Euler, and many others were interested in series for this purPose'
However, some series converge so slowly that they are almost useless for
calculation. Thus Leibniz in 1674a obtained the famous result
7t-llI
4:t-5+5-1+...
However, it would require about 100,000 terms to comPute z, even to the
accuracy obtained by Archimedes. Likewise, the series for log (l { r) con-
verges very slowly, so that many terms have to be taken into account to
achieve an accuracy of a few decimal places. This series was transformed in
various ways to produce more rapidly converging series. Thus James
Gregory (Exercitationes Geometrhae, 1668) obtained
;ffi+i-}:"+1""+l"u+ '
which proved to be more useful for the calculation of logarithms. The
problem of transforming a series into another that converges more rapidly
was pursued by many men throughout the eighteenth century. One such
transformation, due to Euler, is given in Section 4.
Still another use of series was initiated by Newton. Given the implicit
functionrf(r, y) : 0, to work with y as a function of * one wants the explicit
function. There may be several such explicit functions, as is evident in the
trivial case of x2 a gz - I : 0, which ias two solutions y : t \n-=V,
both emanating from the point (1,0). In this simple case the two solutions
can be expressed in terms of closed analytic expressions. But generally each
expression for y must be expressed as an infinite series in r. However, these
series are not necessarily power series, Particularly if the points at which the
2. Opera, 2, 921-29.
3. Vol. 20, 190-93.
4. Math. Schri,ftn, 5,88-92; also Acta Etud., 1682 : Math. Schtifttn,5, ll8-22'
440
Eruditomm of 1694; and though Taylor knew this result he did not refer to it.
His own " proof" was different. What he did amounts to letting c be A* in
the Gregory-Newton formula. Then, for o<ample, the third term on the
right side of (l) becomes
(3)
1,2
Taylor concluded that when Ar : 0, this term becomes hlf'(a) l2l, and, so
the entire Gregory-Newton formula becomes
Of course Taylor's method was not rigorous, nor did he consider the question
of convergence.
Taylor's theorem for a : 0 is now called Maclaurin's theorem. Colin
Maclaurin, who succeeded James Gregory as professor at Edinburgh, gave
this special case in his Treatisc of Fluxions (1742) and. stated that it was but a
special case of Taylor's result. However, historically it has been credited to
Maclaurin as a separate theorem. Incidentally, Stirling gave this special case
for algebraic functions in l7l7 and for general functions in his Methodus
Difncntialis of 1730.
Maclaurin's proof of his result is by the method of undetermined
coefficients. He proceeds as follows. Let
(5) .f("):A+Bz *Czz l Dzo + "'.
Then
: B + 2Cz * 3Dz2 I ...
"f'(") :2c
l'k) + 6Dz + .. .
from which
(e)
ll +"'+ I
n*l'n12 F
-J--
and says this sum is larger ttrar. (nz - n)'(llnz) because there ate nz - n
terms and each is at least as large as the last. But
(n,-n)(*J:,-;
Ilence if we add 1/z to (9)
l-r I - I +"'+-)I'I
n' n+l'n*2
Thus, he says, we can go from one group of terms to another, each group
having a sum greater than l. Hence we can obtain a finite number of terms
whose sum is as large as we please; and therefore the sum of the whole series
must be infinite. Consequently, he also points out, the sum of an infinite
series whose "last" term vanishes can be hfinite; this is contrary to his earlier
belief and the belief of many eighteenth-century mathematicians, including
Lagrange.
5. Opta, l, 375-402.
6. Otera, l, 392.
4+* rNFrNrrE sERrEs
m*n
: !*(, . o*)-' : !_ _ # *,#" _...,
7. Opcra, l, 517-42.
8, Opera,2,745-64.
THE MANIPULATION OF SERIES 445
(ll)
tltl
2mrnmm--+--...,
which he describes as a not inelegant paradox.
In the second paper on series he replaced the general term by a sum or
difference of two other terms, and then performed other operations that lead
to specific results. This replacement is comect for absolutely convergent series
but not for conditionally convergent ones. Hence he got wrong results
which he also described as paradoxes.
One ofJames's very interesting results deals with the series of reciprocals
of the zth powers of the natural numbers, that is, with I + ll2 + 1/3' +
I 14 + .. . . James proved that the sum of the odd-numbered terms is to the
sum of the even-numbered terms as 2o - I is to I' This is correct fot n Z 2.
However, James did not hesitate to apply it for the case n : 1 ur.6 n : ll2.
This last result he found paradoxical.
Another result on series due to James says that the sum of the series
I + ll\/2 + lt\/S f.' . is infinite because each term is greater than the
corresponding term of the harmonic series. Ilere the comparison test was
used effectively.
The series that provoked th€ greatest discussion and controversy, which
occurs when I and' m in (l l) are 1, is
(12) I - I + I - 1+...
It seemed clear that by writing the series as
(r3) (r - l) + (t - l) + (l - l) +...
the sum should be 0. It also seemed clear that by writing the series as
I - (l - 1) - (1 - l) - (1 - l) -. . .
the sum should be l. However, if one denotes the sum of (12) by S, then
S: 1 - S, so that S: ll2; and this is in fact Bernoulli's result in (11)'
Guido Grandi (167I-1742), a professor of mathematics at the University of
Pisa, in his little book Quadratura Circuti et Hgperbolae (The Quadrature of
Circles and Hyperbolas, 1703), obtained the third result by another method'
He set r : I in the expansion
I :l-**x2-xs+...
(14)
+r
and obtained -l
|:r-r+l-r+'...
446 rNnNrrE sERrEs
Grandi therefore maintained that I /2 was the sum of the series (12). He
also argued that since the sum of (12) in the form (13) was 0, he had proved
that the world could be created out of nothing.
In a letter to Christian Wolf (1678-1754), published in the Acta,s
Leibniz also treated the series (12). He agreed with Grandi's result but
thought it should be possible to obtain it \,vithout resorting to his argument.
Instead, Leibniz argued tiat if one takes the first term, the sum of the first
two, the sum of the first three, and so forth, one obtains l, 0, 1, 0, I, . . ..
Thus I and 0 are equally probable; one should therefiore take the arithmetic
mean, which is also the most probable value, as the sum. This solution was
accepted by James and John Bernoulli, Daniel Bernoulli, and, as we shall
see, Lagrange. Leibniz conceded that his argument was more metaphysical
than mathematical but went on to say that there was more metaphysical
truth in mathematics than was generally recognized. However, he was
probably much more influenced by Grandi's argument than he himself
realized. For when, in later correspondence, Wolf wished to conclude that
l-2+4- B+tU...:+
l-3+g- 27+tr...:i
by using an extension of Leibniz's own probability argument, Leibniz
objected. He pointed out that series that have sums have decreasing terms,
and (12) is at least a limit of series with decreasing terrns, as is evident from
(ta) by letting .r approach I from below.
Really extensive work on series began about 1730 with Euler, who
aroused tremendous interest in the subject, But there was much confusion
in his thinking. To obtain the sum of
1- I + I - 1+ I -'..
Euler argued that since
(15)
*:l+r+x2+*s,
tlrenwhent:-1,
I
(16)
2
1- I + I - I +...
so that the sum is I/2.
Also, when x : -2, (15) shows that
I
(17) ::l-2+22-2s+...;
5
hence the sum of the right-hand series is 1/3. As a third example, since
(lB)
lr: r -2+3-4+...
Again, since
l-x : : |- * +"',
ffi (I - tc)(l + tc)-z 3x 5rz - 7xs
(20) co:l*2+3+4+5+..'.
This Euler accepted. Moreover one sees from (15), for .r : 2, that
(21) -l:1+2+4+8+....
Since the right-hand side of (21) should exceed the right-hand side of (20),
the sum I + 2 + 4 + B +... should exceed o. According to (21), it yields
be a sort of limit between the positive
- I . Euler concluded that co mustrespect
and negative numbers and in this resembles 0.
Apropos of (19), Nicholas Bernoulli (1687-1759) said, in a letter to
Eulerofl743, that the sum ofthis series t - 3 + 5 - 7 + "'is -oo(-I)-'
Euler,s result of 0 he called an unsolvable contradiction. Bernoulli also
noted that from (15) one gets' for x : 2,
-l:l+2+4+B+...;
and from
I
(22)
| -x-x' : I + x + 2rc2 + 1xs +...,
forx:lonegets
-l:l+l+2+3+....
The fact that two diflerent series give - I is also an unsolvable contradiction,
for otherwise one could equate the two series.
In one paper Euler did point out that series can be used only for values
448 rYFrNrrE sERrEs
*:x+x2+r3+...
and
xl .ll1
l+-+--*--*"'.
x-l I --
t x tc'
x
But the two left sides add up to 0, while the two right sides add up to the
original series,
fn an earlier paper,11 Euler started with the series
. fi3 tt5
(24) /:srnx:*-31 +51 -...
or
.xxsxi
(2s) *
' - , rt, - st, * "': o'
lll..,o
-t.+F+*+ :m
10. Comm. Acad. Sci. Petrop., ll, 1739, 116-27, pub. 1750 : Olns, (l), 14, 95M3.
ll. Comm. Acad. Sci. Pebop.,7, 1734135, 123-34, pub. 1740 : Opera, (l), 14, 23-86.
12. He used the symbol p for z until 1739; r had been introduced by William Jones in
1706.
THE MANIPULAnON OF SERIES 449
llll ,6
n2
12,22',32',42
--r-r--L-
llll
_-J--r--!-
t4
14,24',34'44 ' 90'
>i":(-t1'-' ffiu'^'
Y=1
where the Bro are the Bernoulli numbers (see below). The connection with
the Bernoulli numbers was actually established by Euler a little later in his
Institutiones of 1755.16 He also gave in the l74O paper the sum )f=, (l/vn)
for the first few odd values of z but got no general expression for all odd z'
Euler also worked on harmonic series, that is, series such that the
reciprocals of the terms are in arithmetic progression. In particular he
showed 17 how one can sum a finite number of terms of the ordinary harmonic
series by using the logarithm function. He starts with
l: .*(#) . L* - L*. +^
- -
l4ttlt
: IocE "'
B - * - fr - m - s.248'r
I n*l I I I I
;: l%--n- -Tnl- 3o:s
a ,rr-t - -srru
-....
By adding and noting that each log term is a difference of two logarithms,
one gets
18. Novi Comm. Acad. Sci. Pebo!., 5, 175415,20t37, pub. 1760 : Opna, (l), 14,585-617.
TIIE MANIPULATION OF SERIES 45r
(31) xzg'*y:x.
But this diflerential equation has the integrating factor x2e-7t', so that
f x.-rlt
(32) Y :,'l' Jo? dt
is a solution that can be shown by L'Hospital's rule to vanish with x' Euler
considered the series (30) to be the series expansion of the function in (32)
and (32) as the sum of the series (30). In fact he lets * : I and obtains
The remarkable fact about the series (30) is that it can be used to obtain
good numerical values for the function (32) because, given a value of .r, if
we neglect all terms beyond a certain one, the absolute value ofthe remainder
can be shown to be smaller than the absolute value ofthe first ofthe neglected
terms. Hence the series can be used to obtain good numerical approximations
to the integral. Euler was using divergent series to advantage. The full
significance ofwhat these divergent series accomplished was not aPPreciated
for another 150 years. (See Chap. 47.)
Another famous result of Euler's in the area of series should be noted.
ln Ars Conjcctandi Jarnes Bernoulli, treating the subject of probability,
introduced the now widely used Bernoulli numbers. He sought a formula
for the sums of the positive integral powers of the integers and gave the
following formula without demonstration :
B;
(3s) Zru : ["rut a, - lval --r(o)] + u'@) -.r'(o)l
BiLr@)
+ - -f'(o)l+' '' + pr1fuzr1,lf"*-"1o) -Jr(ek-1)(0)l + ftk
where
Here z and * are positive integers. Pr**r(*) is the (2ft + l)th Bernoulli
polynomial (which also appears in Bernoulli's Ars Conjectandi), which is given
by
(37) Pa@) : Exk + B, *k-r B- xk-Z B"
iG=lIi * 2.6= 4. + "' + Ii'
wherein B, : -ll2 and, Brn*r: 0 for k : 1,2,.... The series
- t>
(n) y<zr - o(o)l
(38) 17<zrc -
Ze
is divergent for almostalll(r) that occur in applications. Nevertheless, the
remainder B* is less than the first term neglected and so the series in (35)
gives a useful approximation to
)lt.
The Bernoulli numbers B, are often defined today by a relation given
later by Euler,so namely,
used. Given a series )f;=o 6o, he wrote it as If,=o (- l)"o,' Then by a number
of formal algebraic steps he showed that
(40)
wherein the Ao denotes the nth finite difference (sec. 3). The advantage of
this transformation, in modern terms, is to convert a convergent series into
a more rapidly converging one. However, for Euler, who did not usually
distinguish convergent and divergent series, the transformation could also
t urrrfir- divergent series into convergent ones' If one applies ('[0) to
(41) I - I + I - I +...,
then the right side of ('10) yields l/2. Likewise for the series
(42) l-2+22-2s+24...
(40) gives
:+
(43) i,-,,"r,:It,l *|t-,1 *|t,l-1*r-,1
tn"r"."rrn. are, of course, the same as those Euler got above (see [16] and
tl7]) by taking the sum of the series to be the value of the function from
which the series is derived.
The spirit of Euler's methods should be clear' He is the great manipu-
lator and iointed the way to thousands ofresults later established rigorously.
one other famous series must be mentioned. In his Methofus Di,fermtialifa
James Stirling gave the series we now write
as
5. Tigorurutrir Series
The eighteenth-century mathematicians also worked extensively with
trigonometric series, especially in their astronomical theory. The usefulness
of such series in astronomy is evident from the fact that they are periodic
functions and astronomical phenomena are largely periodic. This work
was the beginning ofa vast subject whose full significance was not appreci-
ated .in the eighteenth century. The problem that launched the use of
trigonometric series was interpolation, particularly to determine the positions
of the planets between those obtained by observation. The same series were
introduced in the early work on partial differential equations (see Chap. 22)
but curiously t-he two lines of thought were kept separate even though the
same men worked on both problems.
By a trigonometric series is meant any series of the form
(46)
i" - 2 (ancos nx a 0,, sin z*)
(50) z+f,zz+|,'+"':o.
This equation, in view of the series for a, is
e"-l: O.
Next he determines the roots of this last equation. fle starts with the
equation
(r +f)": r'
+( + i) "i"'l +'j-
The roots of (50) are not affected if we divide each flactor by 4 sins bln (fot
the respective ft), and so the quadratic factors are
zzZ
l+-+
4rr,#n
For z : @, the term zlnls 0. The quantity sin furln is replaced by furln, and
so the factors become
-2
1 -L -:-.
' ' 4tP#
26. Vol. I, Chap. 14.
456 INFINITE SERIES
To such a factor in the auxiliary equation (50) there correspond the roots
z : t i2kr, and hence the integral
a* sin 2kztx a A* cos 2krx
of (49). The linear factor z mentioned above gives rise to a constant integral.
Since;f(O) : I is an initial condition, Euler finally obtains
-f(*):f(x - l) +x(x)
rx -
+ 2 i sin2z",* *G) sin2nzt{ d('
?' J"
Here we have a function expressed as a trigonometric series in the year
1750-51. Euler maintained that his was the most general solution of the
interpolation problem. If so, it surely included the representation of poly-
nomials by trigonometric series. But, as we shall see in Chapter 22, Euler
denied this in the arguments on the vibrating string and related problems.
ln 1754 d'Alembert 27 considered the problem of the expansion of the
reciprocal of the distance between two planets in a series of cosines of the
multiples of the angle between the rays from the origin to the planets, and
here too the definite integral expressions for the coefficients in Fourier
series can be found.
In another work Euler obtains trigonometric series representations of
functions in a totally difierent fashion.2s lfe starts with the geometric series
g
),a"(cos* + isinx)n, i: \/=
r!=0
27 , Rtchzrches sur difiCrens points importans du systCme du monde, 1754, Vol. II, p. 66.
28. Noai Comm. Acad. Sci. Petrop.,5, 175415, 164-204, pub. 1760 : Opera, (l), 14,542-a4;
see also Opera, (l), 15, 435-97, for another method.
TR.IGONOMETRIC SERIES 457
asinx € - .
Z,"$nn,
T=i,cos;TA: n=''
So f;ar his results are not surprising. He now lets a : tI and obtains, for
example,
I
(51) t* cos 'r * cos 2* + cos 3r + cos 4* + ""
i:
(Actually the series are divergent.) He then integrates and obtains
and oth6r such equatiors. Daniel Bernoulli, who had also given expansions
such as (52), (53), and (54), recognized that the series represent the functions
only for certain ranges ofr values.
ln 1757, while studying perturbations caused by the sun, Clairaut 2e
took a far bolder step. He says he will represent azy function in the form
(s5) f@ : Ao 12),1'^cosnx.
,r=1
Hc rcgards the problem as one of interpolation and so uses the function
values at thc *-values
2zt 4n 6n
T' T' T'"'
and after some manipulations obtains
n":iz4H
e":tr)r(w).o"w.
By lctting & become infinite, Clairaut arrives at
29. Hisr, dt l'Acad. ths Sci., Patis, 1754,545 ff., pub. 1759.
30. Mitc. Taw., I, 1759 = (Euorcs, l, 1lO.
31. Iagraryc, CEwn*, 13, 116.
CONTINUED FRACTIONS 459
whereas the right side would be an even one. The problem was not resolved
in the eighteenth century.
ln Euler, working on a problem in astronomy' actually obtained
1777,32
the coefficients of a trigonometric series by using the orthogonality of the
trigonometric functions, the method we use today. That is, from
(57) r@ :? + )o*","ff
he deduced that
6. Conti,nued Fradions
We have already noted (Chap. 13, sec. 2) the use of continued fractions to
obtain approximations to irrational numbers. Euler took up this subject'
In his first paper on it,33 entitled "De Fractionibus Continuis," he derived
a number of interesting results, such as that every rational number can be
expressed as a finite continued fraction. He then gave the expansions
llllllll
e - | : t * i* fo fu IT 4+frTT6T...,
32. Nora Aata Acad. Sci. Pctrop., ll, f793, ll4-32, pub. 1798 = Olctq (l), 16, Part l,
333-55.
33. Comm. Acad. Sci. P.fio!.,9, 1737,9U137, pub' 1744 = OIna, (l), 14, 187-215'
46o TNrlr.trTE sERrEs
the relation between logx and the area under y : l/1, demonstrated the
convergence of the series for log 2 and log 5/4 by comparison with a geo-
metric series. Newton andJames Gregory, who made much use of numerical
values of series to calculate logarithmic and other function tables and to
evaluate integrals, were aware that the sums of series can be finite or infinite.
The terms " convergent " and " divergent " were actually used by James
Gregory in 1668, but he did not develop the ideas. Newton recognized the
,r""J to consider convergence but did no more than affirm that power series
converge for small values of the variable at least as well as the geometric
series. He also remarked that some series can be infinite for some values of
x and so be useless, as, for example, the series fot g : \/-4x- *' at tc : a'
Leibniz, too, felt some concern about convergence and noted in a
letter of Octob er 25, 1713, to John Bernoulli what is now a theorem, that a
series whose terms alternate in sign and decrease in absolute value mono-
tonically to zero converges.ss
Maclaurin, irt his Treatise of Fluxiotts ( I 742), used series as a regular
method for integration. He says, " When a fluent cannot be represented
accurately in algebraic terms, it is then to be expressed by a converging
series." That the terms of a convergent series must continually decrease
and become less than any quantity howsoever small that can be assigned he
also recognized.
..In that case a few terms at the beginning of the series will
be nearly equal to the value of the whole." In the Treatisc Maclaurin gave
the integral test (independently discovered by cauchy) for the convergence
of an ininite seriert ), {(n) converges ifand only if J.- {(x) is finite, provided
that {(r) is finite and of the same sign for a < x < o. Maclaurin gave it in
geometrical form.
Some ideas about convergence were also expressed by Nicholas
Bernoulli (1687-1759) in letters to Leibniz of l7l2 and 1713. In a letter of
April 7, l7l3,8s Bernoulli says the series
(l + })" : t + n* **+ x2 +"'
has no sum when r is negative and numerically Sreater than I ifz is frac-
tional and has an even denominator, That is, the (arithmetical) divergence of
a series is not the only reason for a series not to have a sum' Thus for * > I
both series
(l - x)-rra: t *+. + # *'+#*3 +"'
(t - x;-rrz : , *L- +#4*' +!#4.6.rs + "'
38, Math, Schriften, 3, 922-23. Leibrriz also gave an incorrect proof in a letter to John
of
January 10, l7l4: Math. Schiftcn,3,926.
39. Leibniz: Math. Schrifien' 3, 98G-84.
462 rNFrNrrE sERrEs
are divergent, but the fust series has a possible value and the second an
imaginary value. One cannot distinguish the two by examining the series
because the rcmaindzrs are missing. Ilowever, Nicholas did not set up a clear
concept of convergence. In a reply ofJune 28, I 713, Leibniz ao uses the term
" advergent " for series that converge (roughly in our sense) and agrees that
non-advergent series may be impossible or infinitely large.
There is no doubt that Euler saw some of the difficulties with divergent
series, and in particular the difficulty in using therri for computations, but
he certainly was unclear about the concepts of convergence and divergence.
He did recognize that the terms must become infinitely small for con-
vergence. The letters described below tell us, indirectly, something of his
views.
Nicholas Bernoulli (1687-1759), in correspondence with Euler during
174243, had challenged some of Euler's ideas and work. He pointed out
tlrat Euler's use in his paper of 1734135 (see sec. 4) of
.J3s5
srnr:r_fi+Sl + :(,-#)(,-#x,-#)
does yield
tt2 .lll
t +p+
6 U+ *--z*...,
but a proof of the convergence of the basic series in s is missing. In a letter
of April 6, 1743,4t he says he cannot imagine that Euler can believe a
divergent series gives the exact value of some quantity or function. He points
out that the remainder is lacking. Thus l/(1 - x) cannot equal I 1x 1
i + . . . because the remainder, namely, .16 +1/(l - x), is missing.
In another letter of I 743, Bernoulli says Euler must distinguish between
a 6nite sum and a sum of an infinite number of terms. There is no last term
in the latter case. Hence one cannot use for infinite polynomials (as Euler
did) the relation between the roots and coefficients of a polynomial of finite
degree. For polynomials with an infinite number of terms one cannot speak
of the sum of the roots.
Euler's answers to these letters of Bernoulli are not known. In writing
to Goldbach on August 7,1745,42 Euler refers to Bernoulli's argument that
divergent series such as
+l - 2 + 6 - 24 + 120 - 72O -...
have no sum but says that these series have a definite aaluz. He notes that
we should not use the term "sum" because this refers to actual addition.
He then states the general principle which explains what he means by a
40. Math, Schriftzn, 3,986.
41. Fuss: Cotcslondance, 2, 701 tr.
42, Fuss: Corrcspondarcc, l, 324,
THE IRoBLEM oF coNvERcENcE AND DTvERGENcE +65
definite value. He points out that the divergent series come from finite
algebraic exp.essions and then says that the value of the series is the wluc oJ
ttu algebrait expressi.on ftom uhich tlv series comes. ln the paper of 1754155
(sec. +;, he adds, " Whenever an infinite series is obtained as the development
of sorne closed expression, it may be used in mathematical operations as the
equivalent of thai expression, even for values of the variable lor which the
series diverges." He repeats the first principle in his Institutiones of !755:
Let us say, therefore, that the sum of any infinite series is the finite
expression, by the expansion of which the scries is generated' In this sense
- *
thi sum of the infinite series I x xz -*3 +"' will be l/(l
+ x)'
because the series arises from the expansion ofthe fraction, whatever number
is put in place of x. If this is agreed, the new definition of the word sum
coLcides with the ordinary meaning when a series converges; and since
divergent series have no sum in the Proper sense of the word, no incon'
veniencecanarisefromthisterminology.Finally,bymeansofthisdefinition,
weczrnpreservetheutilityofdivergentseriesanddefendtheirusefromall
objections.ao
It is fairly certain that Euler meant to limit the doctrine to power series.
In writing to Nicholas Bernoulli in 1743, Euler did say that he had had
grave doubts as to the use of divergent series but that he had never been led
into error by using his definition of sum.{4 To this Bernoulli replied that the
same series might arise from the expansion of two different functions and, if
(in- the
so, the sum would not be unique.a6 Euler then wrote to Gotdbach.
,,
leiter of Augus t I , l74b) : Bernoulli gives no examples and I do not believe
it possible that the same series could come from two truly different algebraic
e*|ressions. Hence it follows unquestionably that any series, divergent or
convergent, has a definite sum or value."
There is an intereting sequel to this argument' Euler rested on his
contention that the sum of series such as
(58) I - 1+ 1- I + 1...
could be the value of the function from which the series comes. Thus the
above series comes from l/(l + r) when * : 1, and so has the value l/2'
However, Jean-Charles (Frangois) Callet ( 17't't-99), in an unpublished
me*o""rrdrr- submitted to Lagrange (Lagrange approved it for publication
in the Mhnoires of the Academy of Sciences of Paris but it was never pub-
Iished), pointed out some forty years Iater that
l+r+...+*'r l-{
\JJ/ T;.VT::.+-"-t:T-*"
: I - { * xo - *o+a * tc?o -,.,.
+3. Paragraphs 108-l l.
44. Opra Posthuma, l, 536.
45. April 6, 1743; Fnssz Cortcslondaruc, 2,701 tr.
464 rNFrNrrE sERrEs
Hence for 3 : I * < ,), since the left side is mln, the sum of the right
(and
side must also be mln, where z and n are at our disposal.
Lagrange ao considered Callet's objection and argued that it was
incorrect. He -used Leibniz's probability argument thus: Suppose rrr : 3
and z : 5. Then thefll series on the right side of (59) is
I +0+0-xs +0+*6 +0+0-ro +0 + rro 10-....
Now if one takes for x : l, the sum of the first term, the first two, the first
three,. , ., then in each five of these partial sums three are equal to I and
two are equal to 0. Hence the most probable value (mean value) is 3/5;
and this is the value of the series in (59) for m : 3 and z : 5. Incidentally,
Poisson, without mentioning Lagrange, repeats Lagrange's argument.a?
Euler did say that great care should be exercised in the summation of
divergent series. He also made a distinction between divergent series and
semiconvergent series, such as (58), that oscillate in value as more and more
terms are added but do not become infinite. Certainly he recognized the
distinction between convergent series and divergent ones. fn one case
(1747), where he used infinite series to calculate the attraction that the
earth, as an oblate spheroid, exerts on a particle at the pole, he says the
series " converges vehemently."
Lagrange, too, showed some awareness of the distinction between con-
vergence and divergence. In his earlier writings he was indeed lax on this
matter. one paperas he says that a series will represent a number if it
fn
converges to its extremity, that is, ifits zth term approaches 0. Later, toward
the end of the eighteenth century, when he worked with Taylor's series, he
gave what we call Taylor's tJreorem,{s namely,
Ro:f<n+D(x + 0h)
#.
and 0 is between 0 and I in value. This expression for rRo is still known as
Lagrange's form of the remainder. Lagrange said that the Taylor (infinite)
4$. Mlm. de l'Acad. des Sci., Inst, Frarce,3, 1796,l-ll,pub. 1799; this article does not
appear in the (Euotes.
47. Jour. de PEcole Poly., 12, 1A23, 404-509. If one insists on using the full lorzer series,
then Lagrange's argument makes more sense, It can be rigorized by applying Erobenius's
definition of summability (Chap. 47, sec.4).
48. Hisl. de l'Acad, de Berlin,24, l77O = (Euares,3,5-73, p.6l in particular.
49. Thlorb des fonctions,2nd ed., 1813, Chap. 6 : CEuares, 9, 69-a5. The mean value
theorem of the differential calculus, .f(b) -f(r) = f'(6)(b - c), is due to Lagrange
(1797). Later it was used to derive Taylor's theorem as in modern books,
THE PROBLBM OF CONVERGENCE AND DTVERGENCE 465
t+f+l*l*...
tztut3n'4n'
converges when z > I and diverges when z < l' He also gave (1776) the
well-known test for convergence and divergence, now known as the ratio
test and attributed to Cauchy. The ratio of the (z + l)st to the zth term is
formed, and if the limit as z -> o is less than l, the series converges; if
greater than l, the series diverges. No conclusion may be drawn when the
limit is l.
ThoughLacroixsaidseveralnonsensicalthingsaboutseriesin-the
1797 edition of his influenti al TraitC du calcul difdrentiel et du calcul intigral,
he was more cautious in his second edition. Speaking of
x*2xB
I+-+--*-*"',
aa'd-
he says tlrat one should speak of the series as a deaelopment of tle function
because tJre series does not always have the aafue of the function to which it
belongs.62 The series, he says, gives the value of the function only for
lrl < lrl. He continues with a thought already expressed by Euler, that
the infinite series is nevertheless tied in with the function for all *. In any
analytical work involving the series we would be right to conclude that we
are dealing with the function. Thus if we discover some property of the series,
we may be sure this property holds for the function. To perceive the truth
of this assertion, it is sufficient to observe that the series verifies the equation
that characterizes the function. For example, for y : 4@ - *) we have
a-(a-x)y:O.
But if one substitutes the series for y in this last equation, he will see that the
series also satisfies it. One knows, Lacroix continues, that it would be the
same for any other example; and he points to the great number presented
in the te:<t.
It is fair to say that in the eighteenth-century work on infinite series the
formal view dominated. On the whole, the mathematicians even resented
any limitations, such as the need to think about convergence. Their work
produced useful results, and they were satisfied with this pragmatic sanction.
They did exceed the bounds of what they could justi$r, but they were at
least prudent in their use of divergent series, As we shall see, the insistence on
restricting the use of series to convergent ones won out during most of the
nineteenth century. But the eighteenth-century men were ultimately
vindicated; two vital ideas that they glimpsed in infinite series were later to
gain acceptance. The first was that divergent series can be useful for
numerical approximations of functions I the second, that a series may
represent a function in analytical operations, even though the series is
divergent.
Bibliographg
Bernoulli, James t Ars Conjcctandi, 1713, reprinted by Culture et Civilisation, 1968.
Opera, 2 vob., 1744, reprinted by Birkhaiiser, 1968.
Bernoulli, John: Opera Omnia, 4 vols., 1742, reprinted by Georg Olms, 1968.
Burkhardt,
-: H.: " Trigonometrische Reihen und Integrale bis etwa 1850," Ercgk.
fur math. Wiss., B. G. Teubner, 19l.l-15, 2, Part l, pp. 825-1354.
" Entwicklungen nach oscillirenden Functionen," Jahrcs. der Deut. Math.-
Vercin., Yol. 10, 1908, pp. l-1804.
"Uber den Gebrauch divergenter Reihen in der Zeit 175O-1860," Math.
Ann., 70, 191I, 189-206.
Cantor, Moritz : Vorhsungcn ilber Geschichtz br Matlumalik, B' G' Teubner, 1898,
Vol. 3, Chaps. 85,86, 97, 109, 110.
Dehn, M., and E. D. Hellinger: " Certain Mathematical Achievements of James
Gregory," Amcr. Math. Monthlg,50, 1943' 149-63'
Euler, Leonhard: Op*a Omnia, (l), Vols. 10, 14, and 16 (2 parts), B' G' Teubner
and Orell Fiissli, 1913, 1924, 1933, and 1935'
Fuss, Paul Heinrich von'z Cotespondarce matWmatiqw dz quclquzs cillbres
't Plrysiqut
Reprint Corp', 1967'
gionltrcs du XVIIDru silclc,2 vols., 1843, Johnson
Hofmain, Joseph E' : "Uber Jakob Bernoullis Beitregg zur Infinitesimal-
mathematik," L'Eueigrcmcnt Mathimatiqu, (2), 2, 1956, 6l-17l; also
published separately by Irstitut de Mathdmatiques, Gen0ve, 1957'
Montucla, l. F.: Histoirc dcs mathimatiques, A. Blanchard (reprint), 1960, Vol' 3,
pp.20H3.
neiffn.A.:Geschichtederuuallitlunfuilun,H.LauppscheBuchhandlung,lSS9;
Martin Siindig (rePrint), 1969.
Schneider, Ivo: "Der Mathematiker Abraham de Moivre (1667-1754)," Archiac
-2
2t
Ordinary Differential Equations
in the Eighteenth Century
A traveler who refuses to pass over a bridge until he has
personally tested the soundness of every part of it is not likely
to go far; something must be risked, even in mathematics.
HORACE LAIrlB
l. Motiaati.ons
The mathematicians sought to use the calculus to solve more and more
physical problems and soon found themselves obliged to handle a new class
of problems. They wrought more than they had consciously sought. The
simpler problems led to quadratures that could be evaluated in terms of the
elementary functions. Somewhat more difficult ones led to quadratures
which could not be so expressed, as was the case for elliptic integrals (Chap.
19, sec. 4). Both of these types fall within the purview of the calculus.
However, solution of the still more complicated problems demanded
specialized techniques; thus the subject of differential equations arose.
Several classes of physical problems motivated the investigations in
differential equations. Problems in the area now generally known as the
theory of elasticity were one class. A body is elastic if it deforms under the
action of a force and recovers its original shape when the fiorce is removed.
The most practical problems are concerned with the shapes assumed by
beams, vertical and horizontal, when loads are applied. These problems,
treated empirically by the builders of the great medieval cathedrals, were
approached mathematically in the seventeenth century by men such as
Galileo, Edme Mariotte (1620?-84), Robert Hooke (1635-1703), and
Wren. The behavior of beams is one of the two sciences Galileo treats in
the Dialogucs Concerning Tuo New Scienccs. Hooke's investigation of springs
led to his discovery of the law that states that the restoring force of a spring
that is stretched or contracted is proportional to the stretch or contraction.
The men of the eighteenth century, armed with more mathematics, began
their work in elasticity by tackling such problems as the shape assumed by
an inelastic but flexible rope suspended from two fixed points, the shape of
468
MOTrvATroNs 469
an inelastic but flexible cord or chain suspended from one fixed point and
set into vibration, the shape assumed by an elastic vibrating sEing held
fixed at its ends, the shape of a rod when fixed at its ends and subject to a
Ioad, and the shape when the rod is set into vibration.
The pendulum continued to interest the mathematicians. The exact
differential equation for the circular pendulum, d20ldt2 + ng sin d : 0,
defied treatment, and even the apProximate one obtained by replacing
sin 0 by d had yet to be treated analytically. Moreover, the period of a
circular pendulum is not strictly independent of the amplitude of the motion,
and the search was undertaken for a curve along which the bob ofa pendulum
must swing for the period to be strictly independent of the amplitude.
Huygens had solved this geometrically by introducing the cycloid; but the
analytical solution was yet to be fashioned.
The pendulum was closely connected with two other major investiga-
tions of the eighteenth century, the shape of the earth and the verification
of the inverse square law of gravitational attraction. The approximate
period of a pendulum, T - 2rlTii, was used to measure the force of
gravity at various points on the surface of the earth because the period
depends on the acceleration g determined by that force. By measuring
successive lengths along a meridian, each length corresponding to a change
of one degree in latitude, one can, with the aid of some theory and the
values ofg, determine the shape ofthe earth, In fact by using the observed
variation in the period at various places on the earth's surface, Newton
deduced that the earth bulges at the equator.
After Newton had, by his theoretical argument, concluded rhat the
equatorial radius was l/230 longer than the polar radius (this value is 30
percent too Iarge), the European scientists were eager to confirm it. One
method would be to measure the length of a degree of latitude near the
equator and near a pole. If the earth were flattened, one degree of latitude
would be slightly longer at the poles than at the equator.
Jacques Cassini (1677-1756) and members of his family made such
measurements and in I 720 gave an opposite result. They found that the
pole-to-pole diameter was I /95 longer than the equatorial diameter. To
settle the question the French Academy of Sciences sent out two expeditions
in the 1730s, one to Lapland under the mathematician Pierre L. M. de
Maupertuis and the other to Peru. Maupertuis's party included a fellow
mathematician Alexis-Claude Clairaut. Their measurements confirmed that
the earth was flattened at the poles; Voltaire hailed Maupertuis as the
" flattener of the poles and the Cassinis." Actually Maupertuis's value was
I / t 78, which was less accurate than Newton's. The question of the shape
of the earth remained a major subject and for a long time it was oPen as
to whether the shape was an oblate spheroid, a prolate spheroid, a general
ellipsoid, or some other figure of revolution.
The related problem, veri$ing the law of gravitation, could be handled
47O oRDINARY DTFFERENTTAL EqUATTONS rTOO-rBOO
if the shape of the earth were known. Given the shape, one could determine
the centripetal force needed to keep an object on or near the surface of the
rotating earth. Then, knowing t}le acceleration g due to the force ofgravity
at the surface, one could test whether the full force of gravity, which supplies
the centripetal acceleration and g, is indeed the inverse square law. Clairaut,
one of the men who questioned the law, believed at one time that it should
be of the form F : Alrz + Blra. The two problems of the law of attraction
and the shape of the earth are further intertwined, because when the earth
is treated as a rotating fluid in equilibrium, the conditions for equilibrium
involve the attraction of the particles of the fluid on each other.
The physical field of interest that dominated the century was astronomy.
Newton had solved what is called the tv/o-body problem, that is, the motion
of a single planet under the gravitational attraction of the sun, wherein
each body is idealized to have a point mass. He had also made some steps
toward treating the major three-body problem, the behavior of the moon
under the attraction of the earth and sun (Chap. 17, sec. 3). However, this
was just the beginning of the efforts to study the motions of the planets and
their satellites under the gravitational attraction of the sun and the mutual
attraction of all the other bodies. Even Newton's work in the Priluipia,
though constituting in effect the solution of differential equations, had to
be translated into analytical form, which was done gradually during the
eighteenth century. It was begun, incidentally, by Pierre Varignon, a fine
French mathematician and physicist, who sought to free dynamics from the
encumbrance of geometry. Newton did solve some differential equations
in analytical form, for example in his Method of Fluxions of 167l (Chap. 17,
sec. 3); and in his Tradatus of 1676, he observed that the solution of
drgld* : f(*) is arbitrary to the extent of an (n - I)st degree polynomial
in r. In the third edition of the Prircipia, Proposition 34, Scholium, he
confines himself to a statement of which shapes of sudaces of revolution
offer least resistance to motion in a fluid; but in a letter to David Gregory
of 1694 he explains how he got his results and in the explanation uses
differential equations.
Among problems of astronomy, the motion of the moon received the
greatest attention, because the common method of determining longitude
of ships at sea (Chap. 16, sec. 4), as well as other methods recommended in
the seventeenth century, depended on knowing at all times the direction
of the moon from a standard position (which from late in the century was
Greenwich, England). It was necessary to know this direction of the moon
to within 15 seconds of angle to determine the time at Greenwich to within
I minute; even such an error could lead to an error of 30 kilometers in the
determination of a ship's position. But with the tables of the moon's position
available in Newton's time such accuracy was far from attainable. Another
reason for the interest in the theory of the motion of the moon is that it
FIRST ORDER ORDINARY DIFFERENTIAL EqUATIONS 471
could be used to Predict eclipses, which in turn were a check on the entire
astronomical theory.
The subject of ordinary differential equations arose in the problems
just sketched. As mathematics developed, the subject of partial differential
-equations
led to further work in ordinary differential equations' So did the
branches now known as differential geometry and the calculus of variations.
In this chapter we shall consider the problems leading directly to the basic
early work in ordinary differential equations, that is, equations involving
derivatives with respect to only one independent variable'
ds{W- : dr\G-
l. (Euvres, l0' 512-14.
2. Marh. Schnfun, 5,306.
3. Acta Erud.,1690, 217-19 = Olcra, 1,421-24'
472 oRDINARY DIFFERENTIAL EqUATTONS r 7OO-rBOO
Figure 2l.l
Bernoulli concluded from the equality of the differentials that the integrals
(the word is used for the first time) must be equal and gave
Yt*3! \/Fr=a : *G
as the solution. The curve is, of course, the cycloid.
fn the same paper of 1690 James Bernoulli posed the problem of
finding the curve assumed by a flexible inextensible cord hung freely from
two fixed points, the curve Leibniz called the catenary. The problem had
been considered as far back as the fifteenth century by Leonardo da Vinci.
Galileo thought the curve was a parabola, Iluygens affirmed that this was
not correct and showed, largely by physical reasoning, that if the total
load of cord and any weights suspended from it is uniform per horizontal
foot, the curve is a parabola. For the true catenary the weight per foot alcng
tlu cable is uniform.
ln the Acta for June of 1691, Leibniz, Huygens, and John Bernoulli
published independent solutions. Huygens's was geometrical and unclear.
John Bernoulli a gave a solution by the method of the calculus. The full
explanation is in his calculus text of 1691. It is the one now given in calculus
and mechanics texts and is based on the equation
dy
(r) :s_
dtc c
where s is arc length from B to some arbitrary point I (Fig. 2l.l) and c
depends upon the weight per unit length of the cord. This differential
equation Ieads to what we now write asy : c cosh (x/r). Leibniz too obtained
this result by calculus methods.
John Bernoulli was immensely proud that he had been able to solve
the catenary problem and that his brother James, who had proposed it,
(3) )n'*:-i{@'-Y')',
5. Johann Berno ulli, Der Briefwechscl oon Johann Bcmozl/i, Birkhiiuser vcrlag, 1955, 97-98.
474 oRDTNARY DTFFERENTTAL BqUATTONS r 7OO-rBOO
Figure 2I.2
which he left as characteristic integrals for the curve. (Equation [2] can
be integrated to yield * + @=7 : alog[a + \f=T)til.)
Leibniz hit upon the technique of separating variables in ordinary
diflerential equations and communicated it in a letter to Huygens of 1691.
Thus he solved an equation of the form y(drcldil :-f(x)e@), by writing
d*lf@ : S@) dSlC, and then was able to integrate both sides, He did not
formuLate tlle general method. He also reduced (1691) the homogeneous
diflerential equations of first order, S' :"f@lx), to quadratures. He let
! : ox arl.d substituted in the equation. The equation is t}len separable.
Both these ideas, separation of variables and solution of homogeneous
equations, were explained more fully byJohn Bernoulli in the Acta Eruditorum
of 1694. Then Leibniz, in 1694, showed how to reduce the linear first order
ordinary diflerential equation !' + P(x)g: Q(*) to quadratures. His
method ut:lizsd a change in the dependent variable. In general, Leibniz
solved only first order ordinary diflerential equations.
James Bernoulli then proposed in the Acta of 16956 the problem of
solving what is now called Bernoulli's equation:
(4)
*:'or, + Q@)v".
8. Opera, l, 266.
9. Phil. Trans., 29, l7 16, 399-.100.
lO. Acta Erad., 1717,349 ff. Also in John Bernoulli, Opera,2,275-79.
476 oRDTNARY DTFFERENTTAL EeuATroNs r Too-r8oo
3. Singular Solutions
Singular solutions are not obtainable from the general solution by giving a
definite value to the constant of integration; that is, they are not particular
solutions. This was observed by Brook Taylor in his Methodus Incrementorumr2
while solving a particular first order second degree equation. Leibniz in
1694 had already noted that an envelope of a family of solutions is also a
solution. Singular solutions were more fully explored by Clairaut and Euler.
Clairaut's work of 173413 dealt with the equation that now bears his
narne,
(7) !: xy' +"f@,).
Let y' be denoted by y'. Then
(B) s: 'c!
+lQ).
ll. Comm. Acad. Sci. Pebo!'7, 1734135, 174-93, pub. 17,10 : Otera, (l), 22, 36-56.
t2. 1715, p.26.
13. Hist, de l'Acad. des Sci., Paris, 1734, 196-215.
SINGULAR SOLTITIONS 477
?:? + {x + f'(P)}&*.
Then
(e)
fr= o and x +f'(p) :0.
The equation dpldx : 0 leads to g' : c,and then from the original equation
we have
(10) y:cx+lk).
This is the general solution and is a family of straight lines. The second
factot, x + .f'(P) :0, may be used together with the original equation to
eliminate 1; this yields a new solution, which is the singular solution. To see
that it is the envelope of the general solution we take (10) and diflerentiate
with respect to c. Then
(l l) x*-f'(c):o.
The envelope is the curve that results from eliminating c between (10) and
(t l). But these two are exactly the same as the two equations that yield the
singular solution. The fact that the singular solution is an envelope was not
yet appreciated, but Clairaut was explicit that the singular solution was not
included in the general solution.
Clairaut and Euler had given a method of finding the singular solution
by working from the differential equation itseli that is, by eliminating y'
fri:om J@,y,g'): 0 and aflay' :0' This fact and the {irct that singular
solutions are not contained in the general solution puzzled Euler' In his
Institutiones of 17681a he gave a criterion for distinguishing the singular
solution from a particular integral, which could be used when the general
solution was not known. D'Alembert 15 sharpened this criterion. Then
Laplace ls extended the notion of singular solutions (he called them partic-
ular integrals) to equations of higher order and to differential equations in
three variables.
Lagrange 17 made a systematic study of singular solutions and their
connection with the general solution' He gave the general method of obtain-
ing the singular solution from the general solution by elimination of the con-
stant in a clear and elegant way that surpasses Laplace's contribution' Given
the general solution V(x,y, a) : 0, Lagrange's method was to filed dglda,
14. Vol. l, pp. 393 tr
15. Hitt, de l'Acad. des Sci., Paris, 1769,85 ff., pub. 1772.
16. Hisl'. dc I'Acad. ths Sei., Paris, 1772, Pafi l, 344 tr., pub. 1775 = CEtNr.s 8, 32ffi'
17. Nouo. Mlm. tte I'Acad. de Bulin, 1774, pttb. 1776 = (Ewres' 4, 5-108.
478 oRDINARY DTEFERENTIAL EQUATIONS r TOO-r BOO
set equal to 0, and eliminate cr from this equation and, V : 0. The same
it
Procedure can be used with dzlda: 0. He also gave further information
on Clairaut's and Euler's method of obtaining the singular solution from
the differential equation. Finally, Lagrange gave the geometrical interpreta-
tion of the singular solution as the envelope of the family of integral curves.
There are a number of special difficulties in the theory of singular solutions
which he did not recognize. For example, he did not realize that other
singular curves, which may appear in the equation obtained by eliminating
y' from f(x,g,y') : 0 and afl as' :0 are not singular solutions, or that a
singular solution may contain a branch that is a particular solution. The
full theory of singular solutions was developed in the nineteenth century
and given its present form by Cayley and Darboux in 1872.
where 7 is the tension in the string, o : mlg, m is the mass Per unit length,
and g is the acceleration of gravity.
In his effort to treat the vibrating string, John Bernoulli, in a letter of
1727 to his son Daniel and in a paper,le considered the weightless elastic
string loaded with z equal and equally spaced masses' He derived the
fundamental frequency of the system when there ate 1,2,. '.,6 masses.
(There are other frequencies of oscillation of the system of masses') John
recognized that the force on each mass is -r( times its displacement, and
solved dzxldtz : - Kx, thus integrating the equation of simple harmonic
motion by analytic methods. He then passed to the continuous string which,
like Taylor, he proved must have the shape of a sine curve (at any instant)
and calculated the fundamental frequency. Here he soLved, dzyldxz : -ky'
Neither Taylor nor John Bernoulli treated the higher modes of elastic
vibrating bodies.
ln I72B Euler began to consider second order equations. His interest
in these was aroused partly by his work in mechanics. He had worked, for
example, on pendulum motion in resisting media, which leads to a second
order differential equation. For the king ofPrussia he worked on the effect of
the resistance of air on projectiles. Here he took over the work ofthe English-
man Benjamin Robins, improved it, and wrote a German vemion (1745).
This was translated into French and English and used by the artillery.
He also considered 20 a class of second order equations that he reduced
to first order by a change of variables. For example, he considered the
equation
(12) af dxc : y" dgo-'d'y
or in derivative form
ldg\o-z 72, a*ttl
(13)
\e) -d*: s"'
Euler introduced the new variables I and z by means of the equations
(14) y : e"t(a), *:edo
wherein a is a constant to be determined. The equations (I4) may be
regarded as parametric equations for x and y in terms of a, so that one can
now calculate ilgldx and dzyldxz and by substitution in (13) obtain a second
order equation in I as a function a. Euler then fixes s so as to eliminate the
exponential factor, and a no longer apPears explicitly. A further transforma-
tion, namely, z : daldt, reduces the second order equation to first order'
The details of this method are not worth pursuing because they apply
to just one class ofsecond order equations, but historically this piece of work
19. Comm. Acad. Sci. PertoP.,3, 1728, 13-28, pub. 1732 = Olera,3' 198-210'
20. Comm. Acad. Sci. Parop.,3, 1727,124-37, pub' 1732 = Oleta, (l),22, l-14'
480 oRDTNARY DIFFERENTTAL EqUATTONS r 7OO-rBOO
Figure 21.3
(ls)
"!*e*) +y:0,
and the solution is an infinite series, which (in modern notation) can be
expressed as
(16) y : Aro\liii),
where.Io is the zero-th order Bessel function (of the first kind).za Moreover,
c is such that
(t7) Joe\/-qd) : o,
where / is the length of the chain. He asserts that (17) has infinitely many
roots, which become smaller and smaller and approach 0, and he gives the
largest value for c. For each c there is a mode of oscillation and a charac-
teristic frequency.
IIe now says, " Nor would it be difficult to derive from this theory a
theory of musical strings agreeing with those given by Taylor and by my
father . . . Experiment shows that in musical strings there are intersections
[nodes] similar to those for vibrating chains." Actually, Bernoulli here
goes beyond Taylor and his father in recognizing the higher modes or
harmonics of a vibrating string.
His paper on the hanging chain also treats the oscillating chain of
non-uniform thickness; here he introduces the differential equation
(lB)
"*(^r!) *,ff : o,
where g(.r) is the distribution of the weight along the chain. For g(x) : x2ll2
he gives a series solution that can be expressed in modern notation as
with
Jt?\/-2qd) : o.
y: +rl)*.
Aq-*trln(2{-q), o: -("
The z here is general so that Euler is introducing Bessel functions of arbitrary
real index. He also gives the integral solution
Io'(t - t2)<2't-r)t2."rn(zrJ@) a,
!:A
This is perhaps the earliest case of a solution of a second order differential
equation expressed as an integral.
In a paper of 173927 Euler took up the difierential equations of the
harmonic oscillator * * kx : 0 and the forced oscillation of the harmonic
oecillator
(20) Mii + Kx : Fsin arct.
(22)
H: ,"r*l * a1@)v * a2(x)vz
dr dtc*t,
-^4:* l1
du ,t2 : nAfi+n-t'
(23)
dx'xn
-r-
He then showed how to solve (23) for special values of z by the method of
separation of variables for ordinary differential equations. Later, several
of the Bernoullis determined other values of z for which solution of (23) by
separation of variables was possible.
Riccati's work is significant not only because he treated second order
diflerential equations but because he had the idea of reducing second order
equations to first order. This idea of reducing the order of an ordinary
and showed that ifone knows a particular integral z, then the transformation
z:a+u-r
produces a linear equation. Moreover, if one knows two particular integrals,
one can reduce the problem of solving the original differential equation to
quadratures.
D'Alembertsl was the first to consider the general form (22) of the
" for this form. He
Riccati equation and to use the term " Riccati equation
started with
(25)
dzs: - AzxzrzS
7F -ZaLc
and let
30. Novi Comm. Acad, Sci. Petrop.,8, 1760/61, 3-63, pub. 1763 : OPta, (l),22, ?34-94,
and 9, 1762163, 15,[-69, pub. 1764 = O?era, (l'),22, 403-20.
31. Hist. de l'Acarl. de Berlin, 19, 1763, 242 tr., pub. 1770.
HrGrrER ORDER EqUATTONS 485
(2s) o : Ay + a4* +
"#,
* ,# *...* , *,ry,,
where the coefficients are constants. The equation is called homogeneous
because the term independent of y and its derivatives is 0. He points out
that the general solution must contain n arbitrary constants and that the
solution will be a sum of z particular solutions, each multiplied by an arbi-
trary constant. Then he makes the substitution
y:",.pU,4,
r constant, and obtains the equation in r,
A+Br+Crz+"'!Lr":0,
which is called the characteristic or indicial or auxiliary equation' When 4
is a simple real root of this equation, then
,.*olloo4
is a solution of the original differential equation. When the characteristic
equation has a multiple root q, Euler lets y : eq"u(x) and substitutes in the
differential equation. He finds that
(30) ! : ea, (a * px + yxz +...* r*k-r;
32. Misc. Beroli.n., 7, 1743, 19T242 : O|cra, (l),22' 10849.
486 oRDTNARv DTrFT,RENTTAL EeuATroNs r Too-r8oo
(31) ,&**afl+U:x(x),
he multiplies through by eux dx and obtains
*.(''' *
''#')
By differentiating this quantity and comparing with the original equation
he finds that
(33) A-Ba*Ca2:O.
Thus a, A', and B'are found and the original equation is reduced to
33. Nooi Comm. Acad. Sci. Petrop.,3, 1750151,3-35, pub. 1753 = Olaa, (l),22, l8l-213.
34. Misc. Taur., 3, 1762165, 179-86 = Gwreq l, 471-78.
HTGHER ORDER EqUATTONS 487
This led, as we shall see, to the concePt of the adjoint equation. Lagrange
starts with
(36) 0:2+3x-2y+x2+x29,
Newton assumes
(37) !: Ao * A$ * Aaxz +....
Then
(38) !: At + 2A2x * 3A"x2 1" '
Substitution of (37) and (38) in (36) and equating coefficients of like powers
of x yields
Ar:2 - 2Ao, 2A2:3 - 2Ay 3A": 1 * Ao - 2Az + "'.
Thus we determine the 11 except for le. The fact that
is undetermined lo
and that therefore there is an infinite number of solutions was noted, but the
significance of an arbitrary constant was not fully appreciated until about
1750. Leibniz solved some elementary differential equations by the use of
infinite seriess6 and also used the above method of undetermined coemcients.
Euler put the method of series in the fore from about 1750 on, to solve
differential equations that could not be integrated in closed form. Though
he worked with specific differential equations, and the details of what he
did are often complicated, his method is what we use today. He assumes a
solution of the form
s:i(A+Bx+Cxz+"'),
substitutes for y and, its derivatives in the differential equation, and deter-
mines tr and the coefficients A, B, C,.. . from the condition that each power
of x in the resulting series must have a zero coefficient. Thus the ordinary
differential equation which arose from his work on the oscillating mem-
branesT (see Chap. 22, sec. 3), namely,
now called the Bessel equation, Euler solved by an infinite series. He gives
the solution
rs +...
The above form of (39) and of the solution (40) he gave again in his main
paper on this subject, written in 1778.3e He had written other papers on
what he called the hypergeometric series, but there. the term referred to
another series originally introduced by Watlis. The term " hypergeometric,"
to describe the diflerential equation (39) and the series (40), is due to
Johbnn Friedrich Pfatr (1765-1S25), Gauss's friend and teacher. The series
(40) for y is now denoted by F(a, b, c; z). In this notation Euler gave the
famous relations
(41)
40. Hitt. dc I'Acod. de Barlin, 6, 1750, 185-217, pub. 1752 = Opcra, (2), 5, 8l-108.
49t
(r,1, <)
Figure 21.4
In this case ofone body moving under the attracting force of another (fixed)
body, the two diflerential equations can be combined into one involving
x and g or r and 0 because, for example, the second polar equation can be
integrated to yield r2 dlldt : C and the value of d0lilt cat be substituted
in the first equation. It turns out t}rat the moving body describes a conic
section with the position of the first body as a focus.
If the two bodies move, each subject to the attraction of the other,
then the differential equations are slightly different. Let ra, and z2 be the
masses of two spherical bodies with spherically symmetric mass and with
m1 * m2 : M. Choose a fixed coordinate system (usually the center is takcn
at the center of mass ofthe two bodies) and let (xr,yr, zr) be the coordinates
of one body and (x2, y2, zs) the coordinates of the other; let r be the distance
. Then the system of equations
that describes their motion is
Priruipia (Book I, Section I l). However, the analytical work was not taken
up for sonie time. In mechanics the French followed Descartes's system
until Voltaire, after visiting London in 1727 , returned to champion New-
ton's. Even Cambridge, Newton's-own university, continued to teach natural
philosophy from the text of Jacques Rohault (1620-75), a Cartesian. In
addition, the most eminent mathematicians of the late seventeenth century
Leibniz, and John Bernoulli-were opposed to the concept of
-Huygens,
gravitation and hence to its application. Analytical methods of treating
planetary motion were undertaken by Daniel Bernoulli, who received a prize
from the French Academy of Sciences for a paper of 1734 on the two-body
problem. It was handled completely by Euler in his book Theoria Motuum
P lawlamm ct Comctantm,at
If we have z bodies, each spherical and with spherically symmetric
mass distribution (density a function of radius), they will attract each other
as tlrough their masses were at their centers. Let mr, f,zs . . . t fln represent
the masses and (xe ys zr) the (variable) coordinates of the ith mass with
respect to a fixed system of axes; let 4, be the distance from m1 to mr. Then
the x-components of the forces acting orr lz1 ar€
k,.rl
- Ls
rmtm2(x, - *r), -i"mrms(x, - ,s),. . ., -* mtmo(xt - xn),
with similar expressions for the y- and z-components of the force. Each body
has such components of force acting on it.
The differential equations of the motion of the l'th body are then
directions. The first is a search for whatever general theorems one can
deduce, which may at least shed some light on the motions' The second is
a search for approximate solutions that may be useful for a period of time
subsequent to-roln. instant at which data may be available; this is known
as the method of Perturbations.
The first type of investigation produced some theorems on the motion
of the center of mass of z bodies, which were given by Newton in his Principia.
For example, the center of mass of the z bodies moves with uniform speed
in a straight line. The ten integrals mentioned above, which are consequences
of what are called conservation laws of motion, also constitute theorems
of the first type. These integrals were known to Euler. There are also some
exact resrlts for special cases of the problem of three bodies, and these are
due to one of the masters of celestial mechanics, Joseph-Louis Lagrange'
Lagrange (1736-l8t3) was of French and Italian extraction' As a
boy he was inimpressed with mathematics, but while still at school he read
an essay by Halley on the merits of Newton's calculus and became excited
abor,t the subject.' At the age of nineteen he became a professor of mathe-
matics at the Royal Artillery School of Turin, the city of his birth. He soon
contributed so much to mathematics that even at an early age he was
recognized as one of the period's greatest mathematicians' Though Lagrange
*o.i'"d in many branches of mathematics-the theory of numbers, the
theory of algebraic equations, the calculus, differential equations and the
of variations-and in many branches of physics, his chief interest
"alcrirrs
was the application of the law of gravitation to planetary motion' He said'
in 1775,..The arithmetical researches are those which have cost me most
trouble and are perhaps the least valuable'" Archimedes was Lagrange's
idol.
Lagrange's most famous work, his Mhanique analytiqut (l7BB; second
edition, lBIl-15; posthumous edition, 1853), extended, formalized, and
crowned Newton's work on mechanics, Lagrange had once complained that
Newton was a most fortunate man, since there was but one universe and
Newton had already discovered its mathematical laws' However, Lagrange
had the honor ofmaking apparent to the world the perfection ofthe Newton-
ian theory. Though the MCeanique is a classic of science and is significant
also for the theory and use of ordinary differential equations, Lagrange
had trouble finding a Publisher'
The particular exact solutions obtained in the three'body problem
*e.. giren by Lagrange in a prize paper of 1772, Essai sur le problimc des
trois cirps.az One oi these solutions states that it is possible to set these bodies
i, motio., so that their orbits are similar ellipses all described in the same
time and with the center of mass of the three bodies as a common focus'
Another solution assumes the three bodies are started from the three summits
of an equilateral triangle. They will then move as though attached to the
triangle which itself rotates about the center of mass of the bodies. The third
solution assumes the three bodies are projected into motion from positions
on a straight line. For appropriate initial conditions they will continue to be
fixed on that line while the line rotates in a plane about the center of mass
of the bodies. These three cases had no physical reality for Lagrange, but
the equilateral triangle case was found in 1906 to apply to the sun, Jupiter,
and an asteroid named Achilles.
The second type of problem involving z bodies deals, as already noted,
with approximate solutions or the theory of perturbations. Two spherical
bodies acted upon by their mutual gravitational attraction move along
conic sections. This motion is said to be unperturbed. Any departure from
such motions, whether in position or velocity, however caused, is perturbed
motion. If there are two spheres but there is resistance from the medium
in which they move, or if the two bodies are no longer spherical but, say,
oblate spheroids, or if more than two bodies are involved, then the orbits
of the bodies are no longer conic sections. Before the use of telescopes the
perturbations were not striking. In the eighteenth century the calculation
of perturbations became a major mathematical problem, and Clairaut,
d'Alembert, Euler, Lagrange, and Laplace all made contributions. Laplace's
work in this area was the most outstanding.
Pierre-Simon de T aplace (1749-1827) was born to reasonably well-to-do
parents in the town of Beaumont, Normandy. It seemed likely that he would
become a priest but at the University of Caen, which he entered at the age
of sixteen, he took to mathematics. He spent five years at Caen and while
there wrote a paper on the calculus of finite differences. After finishing his
studies, Laplace went to Paris with letters of recommendation to d,Alembert,
who ignored him. Then Laplace wrote d,Alembert a letter containing an
exposition of the general principles of mechanics; this time d,Alembert
took notice, sent for Laplace, and got him the position of professor of
mathematics at the Ecole Militaire in Paris.
Even as a youth Laplace published prolifically. A statement made in
the Paris Academy of Sciences shortly after his election in l77B pointed
out that no one so young had presented so many papers on such diverse and
difficult subjects. In l7B3 he replaced Bezout as an examiner in artillery
and examined Napoleon. During the Revolution he was made a member
of the Commission on Weights and Measures but was later expelled, along
with Lavoisier and others, for not being a good republican. Laplace retired
to Melun, a small city near Paris, where he worked on his celebrated and
popralar Exposition du systime du monde (l7gi). After the Revolution he became
a professor at the Ecole Normale, where Lagrange by this time was also
teaching, and served on a number of government committees. Then he
sYsrEMs oF DIFFERENTTAL EqUATIONS 49s
predicted in a paper read to the Paris Academy on November 14, 1758, that
ihe perihelion would occur on April 13, 1759' He remarked that the exact
time was uncertain to the extent of a month because the masses of Jupiter
and Saturn were not known precisely and because there were slight per-
turbations caused by other planets. The comet reached its perihelion on
March 13.
Tocomputeperturbationsthemethodcalledvariationoftheelements
or variation ol parameters--or variation of the constants of integration-
was created and is the most effective one' We are obliged to confine ourselves
to its mathematical principles; we shall therefore examine it without taking
into account the full physical background'
The mathematical method of variation of parameters for the problem
of three bodies goes back to Newton's Principia' After treating the motion
of the moon about the earth and obtaining the elliptical orbit, Newton
took account of the effects of the sun on the moon's orbit by considering
variations in the latter. The method was used in isolated instances to solve
nonhomogeneousequationsbyJohnBernoulliintheActaEruditorumof
I697 a3 an-cl by Eulerln I 739, in treating the second order equation y' * kzg
:
X (x). It was first used to treat perturbations of planetary motions by Euler
pup". of l74},aa which treated the mutual perturbations of Jupiter
ir'hi,
and Satuin and won a ptize from the French Academy' Laplace wrote
many papers on the method.as It was fully developed by Lagrange in two
papers.aB
Themethodofvariationofparametersforasingleordinarydifierential
equation was applied by Lagrange to the zth-order equation
PY + Q!' + Rg" + "'* Vg<"\ - X,
9,3s7-80.
46, Mhn. de I'Acad.
Nout,. de Berlin, 5, 1774,201ff', and 6, 1775' 190 ff' = (Euorcs' 4'
We have given, in the first part of this work, the general principles ofthe
equilibrium and motion of bodies. The application of these principles
47. Mlrn. de l'Acad. d,es Sci., Inst. Frante, 1808,267 tr, : (Euares, 6, 713-68.
SITMMARY 499
Lap|aceconcludedthatnatureorderedthecelestialmachine..foran
eternal duration, upon the same principles which prevail so admirably-
upon the earth, for ihe preservation of individuals and for the perpetuity of
the species."
is the mathematical methods for solving differential equations were
improved and as new physical facts about the Planets were acquired' efforts
we^re made throughout the nineteenth and twentieth centuries to obtain
better results on th. ,r".ious subjects Laplace mentions, in particular on
the z-body problem and the stability of the solar system'
B. Summary
As we have seen, the attemPt to solve physical problems, which at first
involved no more than quadratures, Ied gradually to the realization that a
new branch of mathematics was being created, namely, ordinary differential
equations.Bythemiddleoftheeighteenthcenturythesubjectofdifferential
became an independent discipline and the solution of such equa-
"{r.tiorrs
tions an end in itself,
The nature ol what was regarded and sought as a solution gradually
changed. At first, mathematicians looked for solutions in terms of elementary
frnctiorrs ; soon they were content to exPress an answer as a quadrature that
might noi be effected. When the major attempts to find solutions in terms of
eleirentary functions and quadratures failed, mathematicians became
content to seek solutio.ns in infinite series'
The problem of sllution in closed form was not forgotten' but instead
of attempiing to solve in that manner the particular differential equations
Bibli.ograplry
Bernoulli, farnes: Opera,2 vols., 1744, reprinted by Birkhaiiser, 1968'
Bernoulli, John: Opera Omnia,4 vols., 1742, reprinted by Georg Olms, 1968'
Berry, Arthur: A Short History of Astronomg, Dover (reprint), 1961, Chaps' 9-11'
Cantor, Moritz : Vorlesungen iiber Geschichte der Mathematik, B. G. Teubner, 1898
atd 1924, Vol.3, Chaps. 100 and ll8, Vol. 4,Sec.27.
Delambre, J. B. J.: Histoire de l'astronomie moderne, 2 vols., 1821, Johnson Reprint
Corp., 1966.
Euler, Leonhard: Opera Omnia, Orell Fiissli, Series l, Vols. 22 and 23, 1936 and
1938; Series 2, Vols. 10 and ll, Part l, 1947 afi' 1957 '
Hofmann, J. E. : " Uber Jakob Bernoullis Beirage zur Infinitesimal-mathematik,"
L'Enseignement Mathimatique, (2),2,61-171. Published separately by Institut
de Mathdmatiques, Geneva, 1957.
Lagrange, Joseph-Louis : (Emres de Lagrange, Gauthier-Villars, I 868-l 873, relevant
papers in Vols' 2, 3, 4, and 6.
Micanique analglique, 1788; 4th ed., Gauthier-Villars, 1889' The fourth
edition is an unchanged reproduction of the third edition of 1853'
Lalande, J. de: Traiti d'aslronomie,3 vols., 1792, Johnson Reprint Corp'' 1964'
Laplace, Pierre-Simon: (Euates complites, Gauthier-Villars, 189l-1904, relevant
papers in Vols' 8, 11 and 13.
(Euorcs comPllles,
Traiti de micanique cileste, 5 vols., 1799-1825' Also in
Vols. l-5, Gauthier-Villars, 1878-82. English trans' of Vols. l-4 by
Nathaniel Bowditch, 1829-39, Chelsea (reprint), 1966'
Exposition du systime du monde, lst ed., 1796, 6th ed' in CEwres comPl2tcs ,
Gauthier-Villars, 1884, Vol. 6.
Montucla,J.F.:Hisloiredesmathimaliquts,lS02,AlbertBlanchard(reprint),
1960, Vol.3, 163-200; Vol.4, l-125.
Todhunter, of the Mathematical Theories oJ Attraction and the Figarc of
l.: A Hi.story
the Earth, 1873, Dover (reprint)' 1962.
Truesdell, Clifford E.: Introtluction to Leonhardi Euleri Opera Omnia, Vol' X ct XI
Seriei Secundae, in Euler, Opera Omnia, (2), ll, Part 2, Orell Fiissli, 1960'
22
Partial Differential Equations
in the Eighteenth Century
Mathematical Analysis is as extensive as nature herself.
JOSEPH FOURIER
l. Introdwtion
As in the case of ordinary differential equations, the mathematicians did
not consciously create the subject of partial differential equations' They
continued to explore the same physical problems that had led to the former
subject; and as they secured a better grasp of the physical principles under-
lying the phenomena, they formulated mathematical statements that are
now comprised in partial differential equations. Thus, whereas the displace-
ment of a vibrating string had been studied separately as a function of time
and as a function of the distance of a point on the string from one end, the
study of the displacement as a function of both variables and the attempt to
comprehend all the possible motions led to a partial differential equation.
The natural continuation of this study, namely, the investigation of the
sounds created by the string as they propagate in air, introduced additional
partial differential equations. After studying these sounds the mathematicians
took up the sounds given offby horns ofall shapes, organ pipes, bells, drums,
and other instruments.
Air is one type offluid, as the term is used in physics, and happens to be
compressible. Liquids are (virtually) incompressible fluids. The laws of
motion of such fluids and, in particular, the waves that can propagate in
both became a broad field ofinvestigation that now constitutes the subject of
hydrodynamics. This field, too, gave rise to partial differential equations.
Throughout the eighteenth century, mathematicians continued to work
on the problem of the gravitational attraction exerted by bodies of various
shapes, notably the ellipsoid. While basically this is a problem of triple
integration, it was converted by Laplace into d problem of partial difieren-
tial equations in a manner we shall examine shortly.
502
TIIE WAVE EQUATION 5O3
By analyzing the force on the *th mass, Bernoulli had shown that ifyn is the
displacement of the ,tth mass, then
whire a2 : tTlM, Tis the tension in the string (which is taken to be constant
as the string vibrates), and M the total mass. D'Alembert replaced y" by
g(t, x) and, llnby L.x. Then
029(t, x) _ 02y(t, x)
(r) --z;-- - u^r -dli-,
where a2 is now flo, o being the mass per unit length. Thus what is now
called the wave equation in one spatial dimension appears for the first time.
Since the string is fixed at the endpoints * : 0 and x : l, the solution
must satisfy the boundary conditions
(2) y(r, 0) : 0, g(t, l) : 0.
ag\' x)l
(3) g(0, x) - rt/"\
\*) t --7i- lr=o -
o
Since for arry ,c, ax ! t: at' for some value of tr', we may say that for any.r
and t
(6) g(x + at) : -g(x + at).
The condition
(B) x\l
-i-1,-,:
ail(L
u
3. Nooa Acta Erud., 1749, 512'27 : Opera, (2), 10, 50-62; also in French by Euler'
Hist. de I'Acad. de Bcrlin,4, 1748,69-85 : Opera, (2), l0' 63-77 '
506 PARTIAL DTFFERENTTAL EqUATTONS r 7OO-r8OO
Figure 22.1
4. Hiit. di l'Acad. de Bcrlin, 9, 1753, 196-222, pub. 1755 : Ofera, (2), 10, 232-54.
5. ll,
Novi Cothm. Acad. Sci. Petro!,, 1765, 67-102, pub. 1767 :
Olua, (l),23,74-91,
TrrE wAvE EqUATTON 5O7
Then, if this curve [y : f(*)] is used to represent the initial function, after
the time I the ordinate that will answer to the abscissa x of the string in
vibration will be (cf. [3] and [12])
\- hrt
. ---r-
(le) (*) : 2 4n Stn
"f LtL
because there are enough constants a, to make the series fit any curve. Hence,
he asserts, a/l subsequent motions would be
: s- nfix nfict
(20) g(t, x)
Zo"srn-7-cos-7-.
7 . Comm. Acad. Sci. Pctro!., 12, l?40, 97-108, pub. 1750.
8. Comm. Acad. Sci. Peho!., 13, 1741143, 167-96, pub. 1751.
9. Hist. de I'Acad. de Berlin, 9, 1753, 147-72 and I 73-95, pub' 1755.
THE ll/A\rE EQUATTON 5O9
Thus every motion corresponding to any initial curve is no more than a sum
of sinusoidal periodic modes, and the combination has the frequency of the
fundamental. However, he gives no mathematical arguments to back up his
contentions; he relies on the physics. In this paper of 1753 Bernoulli states:
(24)
.L -"' EI Y(x\
i .i. i ,in '4I vb\
-'"'-'-ilx and L " a*
C=l q=l
other for the most part, and those which remain are so disfigured and
altered as to become absolutely unrecognizable. It is truly annoying
that so ingenious a theory ... is shown false in the principal case, to
which all the small reciprocal motions occurring in nature may be
related.
15. Min. de l'Acad, dcs Sci., Paris, l?79,207-309, pub' 1782 : CEuurcs, 10, l-89'
514 PARTTAL DIEFERENTTAL EQUATTONS r 7OO-rBoo
must be periodic in .r. Ilowever, he failed to realize that, given any arbitrary
function in, say, 0 I x 1 l, this function could be repeated in every interval
[nl, (n + l){ for integral z and so be periodic. Of course, such a periodic
function might not be representable by one (closed) formula. Euler and
Lagrange were, at least in their time,;justified in believing that not all
" discontinuous " functions could be represented by Fourier series, yet equally
right in believing (though they did not have proof) that the initial curve can
be very general. It need not be analytic, nor need it be periodic. Bernoulli
did adopt the correct position on physical grounds but could not back it
up with the mathematics.
One of the very curious features of the debate on the trigonometric
series representation of functions is that all the men involved knew that non-
periodic functions can be represented (in an interval) by trigonometric
series. Reference to Chapter 20 (sec. 5) will show that Clairaut, Euler, Daniel
Bernoulli, and others had actually produced such representations; many of
their papers also had the formulas for the coefficients of the trigonometric
series. Practically all of this work was in print by 1759, the year in which
Lagrange presented his basic paper on the vibrating string. He could then
have inferred that any function has a trigonometric expansion and could
have read off the formulas for the coefficients, but failed to do so. Only in
1773, when the heat of the controversy was past, did Daniel Bernoulli
notice that the sum of a trigonometric series may represent different alge-
braic expressions in different intervals. Why did all these results have no
influence on the controversy concerning the vibrating string? It may be
explained in several ways. Many ofthe results on the representation of quite
general functions by trigonometric series were in papers on astronomy, and
Daniel Bernoulli may not have read these and so could not point to them in
ddense of his position. Euler and d'Alembert, who must have known
Clairaut's work of 1757 (Chap. 20, sec. 5), were probably not inclined to
study it, since it refuted their own arguments. Also, this astronomical work by
Clairaut was soon superseded and forgotten. On the other hand, whereas
Euler used trigonometric series, as in his work on interpolation theory, to
represent polynomial expressions, he did not accept the general fact that
quite arbitrary functions could be so represented; the existence of such a
series representation, where he used it, was assured by other means.
Another issue, how a partial diflerential equation with analytic coeffi-
cients (e.g. constants) could have a non-analytic solution, was not really
clarffied. In the case of ordinary differential equations, if the coefficients are
analytical, the solutions must be. However, this is not true for partial
differential equations. Though Euler was correct in saying that solutions
with corners are admissible (and he did insist on it), determination of the
singularities that are admissible in the solution of partial diflerential equa-
tions was still far in the future.
EXTENSTONS OF THE WAVE EQUATION 5I5
:
'r : *,(' . *) {T-1oo, k 1,2,3, "'
t7. Noai Comm. Acatl. Sci. Petrop.,9,1762163,246-304, pub. 1764 : Olaa, (2),10,293-3+3'
516 PARTTAL DTFFERENTTAL EQUATIONS r 7OO-rBOO
Thus the ratio of two successive frequencies is the same as for a string of
uniform thickness, but the fundamental frequency is no longer inversely
proportional to the length.
/ In this paper of 1762163 Euler also considered the vibrations of a string
composed of two lengths , a and. D, of different thicknesses m and n. He
derived the equation for the frequencies ar of the modes. These turn out to
be solutions of
uta nta\-ofi :0,
(2e) tntaln +
-mt1
and he solves for <., in special cases. The solutions of (29) are called the
characteristic values or eigenvalues of the problem. These values are, as we
shall see, of prime importance in the theory of partial differential equations.
It is almost evident from (29) that the characteristic frequencies are not
integral multiples of the fundamental one.
However, Euler took up this question again in another paper on the
vibrating string of variable thickness,ls and starting with (28) he shows that
there are functions c(r) for which the frequencies of the higher modes are
not integral multiples of the fundamental.
D'Alembert, too, took up the string of variable thickness.lo Here he
used a significant method of solution that he had introduced earlier for the
string of constant density. In this earlier attempt at the vibrating-sring
problem d'Alembert had introduced the idea ofseparation ofvariables, which
is now a basic method of solution in partial differential equations|To
solve
x) _ -2 azlU,
02y(t,
--ZtT--'--aF- tc)
d'Alembert sets
y : h(t) s(x),
substitutes this in the differential equation, and obtains
L h"(t) _ e'@)
(30)
a2 h(t) e@)
He then argues, as we do now, that since g'/g does not vary when tr does, it
must be a constant, and by the like argument applied to h" lh, this expression
too must be a constant. The two constants are equal and are denoted by l.
Thus he gets the two separate ordinary differential equations
h'(t) - a2Ah(t) :0
(31)
e'@)-Ag(x):s.
18. Mise. Taur., 3, 1762165,25-59, pub. 1766 : Olera, (2), 10, 397-425.
19. Hist. de l'Acad. de Berlin, 19, 1763' 242 ff., pub. 1770.
20. H;st. de I'Acad. de Berlin,6, 1750, 335-60, pub. 1752.
EXTENSTONS OF TrrE WAVE EQUATION 517
Since a and ,4 are constants, each of these equations is readily solvable, and
d'Alembert gets
y(t,x) : h(t) g(") : fMsorT't + Ne-o{a4ffPeti'x 1 gr-r7'1.
The end-conditions, y(1, 0) : 0 and y(t, l) : 0, led d'Alembert to assert
that g(x) must be of the form ,t sin .Rx and that i(t)
must be of the same form
because y(r, *)
must be periodic in l. He left the matter there. Daniel Ber-
noulli had used the idea of separation of variables in 1732 in his treatment
of the vibrations of a chain suspended from one end, but d'Alembert was
more explicit, despite the fact that he did not comPlete the solution.
In his 1763 paper d'Alembert wrote the wave equation as
*": *al*
UI, U*
Thus the results are the same as for the " weightless " string (where the
gravitational force is ignored), except that the oscillation takes place about
the parabolic figure of equilibrium
(tl2)gx(x - l)
u: -
"c'
21. Acta Acart. Sci --;-
Petrop.,l, 1781, 178-90, pub. 1784 = Olera, (2),11,324-54, but dating
from 1774.
5r8 PARTTAL DTPFERENTTAL BquATroNs rToo-r8oo
/ e, *" shall see in a moment, Euler had introduced all the Bessel
functions of the first kind in a paper on the vibrating drum (see also Chap.
21, secs. 4 and 6) and in this lTBl paper he remarks that it is possible to
express any motion by a series of Bessel functions (despite the fact that he
had argued against Daniel Bernoulli's claim, in the vibrating-string problem,
that any function can be represented as a series of trigonometric functions).
Papers on the vibrating string and the hanging chain, of which the
above are just samples, were published by many other men up to the
end of the century. The authors continued to disagree, correct each other,
and make all sorts of errors in doing so, including contradicting what they
themselves had previously said and even proven. They made assertions,
contentions, and rebuttals on the basis of loose arguments and often just
personal predilections and convictions. Their references to paPers to Prove
their contentions did not prove what they claimed. They also resorted to
sarcasm, irony, invective, and self-praise. Mingled with these attacks were
seeming ag?eements expressed in order to curry favor, particularly with
d'Alembert, who had considerable influence with Frederick II of Prussia
and as director of the Berlin Academy of Sciences.
The second order partial differential equation problems described thus
far involved only one space variable and time. The eighteenth century did
not go much beyond this. In a paper of 175922 Euler took up the vibration
of a rectangular drum, thus considering a two-dimensional body. He ob-
tained for the vertical displacement z of the surface of the drum 1l
l02z 022
-L
022
(33)
c2 atz -axz -.0!''
wherein x and, g represent the coordinates of any point on the drum and a is
deterrnined by the mass and tension. Euler tried
z: a(x,y) sin (<,rI * o)
- :'-a2u* ;-=
02a 02a
U
ox' * -'
o!'
This equation has sinusoidal solutions of the form
where
a2 F',.y',
7:A*F'
Noai Comm. Acad. Sci. Petrop., lO, 176+,243-60, pub. 1766: Opela, (2),10'344-59.
EXTENSTONS OF THE r^/AVE EqUATTON 5I9
The dimensions of the drum are a and b, so that 0 < x < aand0 <y < b.
When the initial velocity is 0, B and. C may be taken to be 0. If the boundaries
are fixed, then p : mr and, y : ntr where m and n are integers. Then,
since ar : 2rru where r is the frequency per second, he obtains readily that
the frequencies are
t lT7
v:2cnl7+P'
He then considers a circular drum and transforms (33) to polar coor-
dinates (a highly original step), obtaining
(34)
I 022 022 l0z I 022
c20t2-0r2,r0r'12062
--L---!---
He now tries solutions of the form
(35) z : u(r) sin (art * l) sin (B{ + B)
so that z(r) satisfies
:
"(:,) u{' - mh ff;)' .,?m,imr?) (;;)'. )'
which wb would write now as
"(1')
: (i)"'"u *'t" (i')'
Since the edge r: a must remain 6xed,
(37) ,,(:.) : o.
It also follows from (35), since z must be of period 2tr in $, that p is an in-
teger. Euler asserts that for a fixed B there are infinitely many roots a, so
that infinitely many simple sounds result. However, he did not calculate
these roots. He did attempt to find a second solution of (36) but liniled to do
so. The theory of the vibrating membrane was derived independently by
Poisson2s and is often credited solely to him.
Euler, Lagrange, and others worked on the propagation ofsound in air.
Euler wrote on the subject of sound frequently from the time he was twenty
where a is again the divergence of the displacement (x, y, z). Euler gives
plane wave and spherical wave solutions using the kind of substitution just
indicated for cylindrical waves. The basic equation for spherical waves is
4. Pokntial Tluory
The dcvelopment of the subject ofpartial differential equations was furthered
by another class of physical investigations. One of the major problems of the
eighteenth century was the determination of the amount of gravitational
attraction one mass exerts on another, the prime cases being the attraction
of the sun on a planet, ofthe earth on a particle exterior or interior to it, and
of the earth on another extended mass. When the two masses are very far
apart compared to their sizes, it is possible to treat them as point masses;
but in other cases, notably the earth attracting a particle, the extent of the
earth must be taken into account. Clearly the shape of the earth must be
known if one is to calculate the gravitational attraction its distributed mass
exerts on a particle or another distributed mass. Although the precise shape
remained a subject for investigation (Chap. 21, sec. l), it was already clear
by 1700 that it must be some form of ellipsoid, perhaps an oblate spheroid
(an ellipsoid generated by revolving an ellipse around the minor axis). For
the solid oblate spheroid the force of attraction both on an external and on
an internal particle cannot be calculated as though the mass were concen-
trated at the center.
In a prize paper of l74O on the tides, and in his Treatise of Fluxions
(1742), Maclaurin proved that, for a fluid of uniform density under constant
angular rotation, the oblate spheroid is an equilibrium shape. Then Mac-
laurin proved synthetically that, given two confocal homogeneous ellipsoids
27. Nooi Comm. Acad. Sci. Petrop., 16,1771,281425, pub. 1772 : Ofera, (2), 13,262-369.
POTENTIAL TI{EORY 523
of revolution, the attractions of the two bodies on the same particle external
to both, provided the particle be on the prolongation ofthe axis of revolution
or in the plane of the equator' will be proportional to the volumes. Some
other limited results were also established geometrically in the nineteenth
century by James Ivory (1765-1842) and Michel Chasles-
The geometrical approach to the problem of gravitational attraction
used by Newton, Maclaurin, and others is good only for special bodies and
special locations of the attracted masses. This approach soon Save way to
analytical methods, which one finds first in papers by Clairaut before 1743
and especially in his famous book Thioic dc la fgurc de la tcrrc ( I 743) , in
which he considers both the shape of the earth and gravitational atEaction.
Let us first note some facts about the analytical formulation. The
gravitational force exerted by an extended body on a unit mass P regarded
as a particle is the sum of the forces exerted by all the small masses that make
up the body. If d{ dl d( is a small volume of the body (Fig. 22.2), so small
that it may be regarded as a particle centered at the Point (f, l, (), and if
P has the coordinates (*,y, r), the attraction exerted by the small mass of
density p on the unit particle is a vector directed from P to the small mass
and, in view of the Newtonian law of gravitation' the components of this
vector are (Chap. 21, sec. 7)
--c z-f
-rr* i* d(dqd{, -kpc+d(&1d(, -rpTd{fidt'
where ,t is the constant in the Newtonian law and
"r-:-olll ,+ddndL
(42) on+dtd,tdt
^:-rlll
f":-rlll ,+d€d,td(,
wherein the integral is extended over the entire attracting body. These
integrals are finite and correct also when P is inside the attracting body'
Instead of treating each component of the force separately, it is possible
to introduce one function V(x, y, z) whose partial derivatives with respect to
x, y, and z respectively are the three components of the force' This function
is
By differentiating under the integral sign with resPect to x,y, and z which
are involved in r, one obtains
av l- 0v l- av 1.
fi:7J*, @:1t", E:EJ-
and these equations also hold when P is inside the attracting body' The
function Z is called a potential function. When problems involving the three
componentslf , f, and f" can be reduced to the problem of working with tr/,
there is the advantage of working with one function instead of three.
If one knows the distribution of mass inside the body, which means
knowing p as a function of (, 1, and (, and if one knows the precise shape of
the body, one can sometimes calculate Zby actually evaluating the integral'
However, for most shapes of bodies this triple integral is not integrable in
terms of simple functions. Moreover, we do not know the true distribution ol
mass inside the earth and other bodies. Hence Z must be determined in
other ways. The principal lact about Z is that for points (*, g, ") outside the
attracting body, it satisfies the partial differential equation
(44)
a2v* a2v* ;-=
-:-=
a2V
: U'
otc' =-=
ag' o z'
in which we note that p does not appear. This differential equation is known
as the potential equation and as Laplace's equation.
The idea that a force can be derived from a potential function, and
even the term "potential function," were used by Daniel Bernoulli in
POTENTIAL THEORY 525
Hgdrodynamica (1738). The Potential equation itself appears for the first
time in one of Euler's major papers composed in 1752, " Principles of the
Motion ol Fluids." 28 In dealing with the components u, a, and p of the ve-
locity of any point in a fluid, Euler had shown that u dx + o dy + w dz must
be an exact differential. He introduces the function S such that d.S :
udx * odg + u dz. Then
as
U:.;-,
a,s W:;-'as
A: -;-,
ox oy oz
But the motion of incompressible fluids is subject to what is called the law
of continuity, namely,
0u 0o 0w
(45) :-*;-*-:U'
ox o.u oz
which expresses mathematically the fact that no matter is destroyed or
created during the motion. Then it follows that
a2S a2S A2S
--L--!-:tt oy'
ox' oz'
How to solve this equation generally, Euler says, is not known I so he con-
siders just special cases where ,S is a polynomial in x, y, and z. The function
,S was later (1868) called by Helmholtz the velocity Potential. In a paper
published in 17622s Lagrange reproduced all of these quantities, which he
took over from Euler without acknowledgment, though he did improve the
order of the ideas and the expressions'
Before we can investigate the work done to solve the potential equation
in behalf of gravitational attraction, we must review some eflorts to evaluate
this attraction directly by means of the integrals (42) or the equivalents in
other coordinate systems.
In a paper written in l7B2 but published in 1785, entitled " Recherches
sur I'attraction des sph€roides," 30 Legendre, interested in the attraction
exerted by solids of revolution, proved the theorem: If the attraction of a
solid of revolution is known for every external point on the prolongation of
its axis, then it is known for every external point' He first expressed the
component ofthe force of attraction in the direction ofthe radius vector r by
means of
28. Noai Comn. Acad. Sci. Petro!., 6, 1756157, 271-311, pub. 1761 = Opcra, (2), 12,
133-68.
29. Misc. Taur., 221760/61, 196-298, pub. 1762 : (Euures, 1, 365-'168.
30. MCn. des sao. itrangers, 10, 1785, 41 l-34.
526 PARTIAL DTFFERENTTAL EqUATTONS r 7OO-r 8OO
Figurc 22.3 t
where (Fig. 22.3) r is the radius vector to the attracted point, r'is the radius
vector to any point of the attracting body, and y is the angle formed at the
center ofthe body by the two radii vectors. The { coordinate of the external
point can be taken to be 0 because the solid is a figure of revolution around
the z-axis. Then he expanded the integrand in powers of r'/r. This is done
by writing the denominator as
u
[, - (, ': .", ,- t)]'"
The quantity in the brackets can be put into the numerator and then ex-
panded by the binomial theorem with the quantity in parentheses as the
second term of the binomial. Legendre obtained for the integrand, apart
from the volume element, the series
The coefficients Pz, P4, . . . are rational integral functions of cos 7. These
functions are what we now call the Legendre polynomials or Laplace coeffi-
cients or zonal harmonics. Legendre gave the form of the functions so that
the general Po, namely,
P(r,0,0) :4 t
L ffiP,,(.,o' e1\'
where
*
o": TlM R2n+a pr,(cos 0,) sin 0,d0,.
f"
The value of this integral depends uPon the shape of the meridiran curves
R:-f(0').
From the above result, and on the basis of a communication from
Laplace, Legendre then obtained the expression for the potential function
for this problem, and from the potential derived the component of the force
of attraction perpendicular to the radius vector.
In a second paper written in 1784,32 Legendre derived some propertics
of the functions Pro. Thus
(48)
li.f{*')r,-ti
*:o
for each rational integral function of x2 whose dcgree in .C is less than z. If
z is any positive integer,
(4e)
2).'.("-2n+2)
Il*"r,^a-: + (z
OLl"tn 1) + 3). ..(z * 2m + l)
If m and z are positive integers,
rt (0 fotmln,
(50) l- P2"@)P2-@) :
dx \ r_ fo,^ :
ro [+.+r
n.
31. This cxpression is derived as follows : In vicw of the equations of trancformation from
spherical to r€ctangular coordinates x = 7 sin 0 cos $,y: r sin 0 sin $, z: r coc 0, the
rectangular coordinales of P arc (t sin d cos {, r sin d sin {, z cos d) and the rcctangular
coordinates of Q are (r'sin 0' cos {', r' sin 0' sin $',t' cos 0'). Then using tbe distancc formula
we can express PQ. But by the law of cosines PQ : r' I r'2 ! 2't.'cos 7. Equating ttrc
tv/o expressions for PQ gives the above cxpression for cos 7.
32. Mln. dc t'Acart. dcs Sci., Park, 1784,370-{.9, pub. 1787.
528 PARTTAL DTFTERENTIAL EqUATTONS r 7OO-r8OO
He also proved that the zeros ofeach of the Pro are real, different from each
other, symmetric with respect to 0, and in absolute value less than 1. AIso
for 0 < r < l, P2"(.r) < l.
Then, with the help of the orthogonality condition (50), he proves (by
integration of the series term by term) that a given function of x2 can be
expressed in only one way in a series of functions Prn(x).
Finally, using these and other properties of his polynomials, Legendre
returns to the main problem of gravitational attraction and using the
expression (43) for the potential and the condition for equilibrium of a
rotating fluid mass, he obtains the equation for the meridian curve of such a
mass in the form of a series of his polynomials. He believed that this equation
included all possible equilibrium figures for a spheroid of revolution.
Now Laplace enters the picture. He had written several papers on the
force of attraction exerted by volumes of revolution (1772, pub. 1776;
1773, pub. 1776; and. I775, pub. 1778), in which he worked with the com-
ponents of the force but not the potential function. The article by Legendre
of 1782, published in 1785, inspired a famous and remarkable fourth paper
by Laplace, " Thdorie des attractions des sph6roides et de la figure des
planetes." 33 Without mentioning Legendre, Laplace took up the problem
of the attraction exerted by an arbitrary spheroid as opposed to Legendre's
figures of revolution. By a spheroid Laplace meant any surface given by one
equatiorr in r, 0, and $.
He starts with the theorem that the potential Zof the force an arbitrary
body exerts on an external point, expressed in spherical coordinates r, d, {
with p : cos d, satisfies the potential equation
I AzV
----j- t 02(rV\ : n
.:-!-:--Z
*(u - -,ry*)
(51) -r- -----j-
'l-p'A6' 0r2
Laplace does not say here how he obtained the equation. In a later paperoa
he gives the rectangular coordinate form (44). One may be fairly sure that
he possessed the rectangular form first and derived the spherical coordinate
form from it. In fact, both forms had already been given by Euler and
Lagrange, but Laplace does not mention them. He may not have known
their work, though this is doubtful.
In the l7B2 paper Laplace sets
33. Min. de I'Acad. des Sci., Pais, 1782, 113-96, pub. 1785 : CEuaret 10, 339-419.
34. Min. de l'Acad. des Sci., Paris, 1787, 249-67, pub. 1789 = CEuarest ll, 275-92.
POTENTTAL THEORY 529
where Ur : tl^(0, and' substitutes this in (51)' Then the individual U"
$),
satisfy36
. iT?* * + r)u:
*"lu - Au*f
(s3) o'
^"
With the help of Legendre's Prn he is able to show that
Now Laplace uses this result and (52) to caiculate the potential of a
spheroid ditrering little from a sphere. lle writes the equation of the surface
of the spheroid as
where c is small and y on the spheroid is a function of 0' and {'' Laplace
assumes that y(0, {) can be expanded in a series of functions
(56) U:Yo+Yl+Y2*.",
where the Io are functions of d and { and satisfy the differential equation
(53). The result he obtains here is first that
(57) u _2n*117
+L "i'
'r.- 4iult
These Y,, then, may be used in (52). Also the expansion (56) may be recast
as
(58) v(p, 4) : +,) {2" * r) il,i;" Y,(p', i)P*(p, 6, p', 6') dp' ttq',
where p' : cos 0'. With the value of y he now has an expression for r in
(55)
and with this and the U" in (54) he obtains I'/in (52)'
Laplace does not consider here the general problem of the development
ofazy function of 0 and 4 into aseries ofthe Yo' In what he does do here and
in laier papers he presumes that such an expression is possible and is unique'
equation
35. If we ignore the middle term ({ is absent) the resulting ordinary differential
is what we now call Legendre's differcntial equation,
The P"(x) satisfy this equation. On the other hand the Uo (and the
I" in [57]) rcgardcd as
functions'of the two variables p : cos 0 and { satisfy (53)' The Uo zrd Yo are called by
the Germans spherical functions and by Lord Kelvin sphcrical harmonics or spherical
surface harmonics.
53O PARTTAL DTFFERENTTAL EqUATTONS r TOO-rBOO
(5e)
i:,J;' u^(p,6)(t^o'd) : o, n * n'
rr (0form / n,
(60)
rI-r ' ' :1 2
r^1x1r"61
"" "' ax form: n.
LET,
Then he too introduces the spherical functions. That is, he lets f, be the
cocfficient ofz"in the expansion of (l - 2zt + z2) -u2 where tr : cos dcos d'
* sin 0 sin d' cos ({ - {'). Then, letting p : cos d, p' : cos 0' and g :
O - i' , he shows that
r,(,) : P,Q,)P*Q,) .
#\l!#rysin dsin 0'cos,lt
+
6=Tp,# 6-TT)W4#9sin2 gsin2 0'cos2,lt *"',
the higher terms containing higher derivatives of P,. This equation is equiv-
alent to
P.(cos 0 cos 0' + sin 0 sin 0' cos($ - d)) :
Pf;(cos d)Pf;(cos 0') cosn($ - g'),
1t - *'1^''ff'
The Pf;(*) thus introduced io* called the associated Legendre polyno-
"..
mials. Then Legendre Proves that
(62)
?a frn
tlo(p',,i)PoQ", i, t ',6) dP' d$' : ffi
U,6, f,
J_, J"
and that
(63)
Jl,J.'t"'tulr2
dp'ds = #
The fact that the Po(x) satisfy Legendre's differential equation is used in this
PaPer.
Many other special results involving the Legendre polynomials and
the spherical harmonics were obtained by Legendre, Laplace, and others'
A basic result is the formula of Olinde Rodrigues (1794-185l), given in
t816,37
| dt(xz l)n
(64) n\al - 2nnt d* -
,Dt_\_
Laplace,s work on the solution of the potential equation for the attract-
ing force of spheroids was the beginning of a vast amount of work on this
sJject. Eq"alty important was his and Legendre's work on the Legcndre
pol!r,o-i.L P,(*), the associated Legendre polynomials Pf(*), and the
spherical (surface) harmonics Y^(p, $), because rather arbitrary functions
c-an be expressed in terms of infinite series of the Po, Pf; and' the f"' These
series of functions are analogous to the trigonometric functions, which Daniel
Bernoulli claimed could also be used to represent arbitrary functions. The
choice of class offunctions depends on the differential equation being solved
and on the initial and boundary conditions. Of course far more had to be
done and was to be done with these functions to render them more usdul
in the solution of partial differential equations.
(67)
aP aQ aP: -:-,
: 0x aR aQ aR
dl =-,
=- =-
0z 0x 0z 0y
Clairaut showed how to solve (65) by a method still used in modern texts.
The interest in equation (65) stemmed from the fact that if P, Q, R are com-
ponents of velocity in fluid motion, then (65) has to be an exact diflerential.
If (65) is not an exact differential, then Clairaut also showed that it
may be possible to find an integrating factor, that is, a function p.(x,g, z) such
that when multiplied into (65), it makes the new left side an exact differential.
Clairautse and later d'Alembert (Trail,i de l',iquilibre et du moatement des
fluillcs, 1744) gave a necessary condition that it be integrable (with the aid of
an integrating liactor). This condition (which is also sufficient) is
of a'
40. For arbitrary { it is not generally possible to actually carry out the elimination
The general integral is a concept and amounts to a collection of particular solutions'
4l- Noua. Mdn, de l'Acatt. d.e Berlin, 1772 : CEuuros' 3, 549-75'
534 PARTTAL DTFFERENTTAL EqUATTONS r Too-r8OO
If one puts into the last of these three equations the values of 0Ml0x ar..d,
dMlAg from the 6rst t!r,o, one obtains
(73) ?-?-o?+o?:0.
oc ox ' oz 'oz
This is tlre condition (68) for the integrability of (72), a condition known
before, as Lagrange remarls. In (73) q can be taken as the given function Q
of *, g, z, and p so that explicitly the equation becomes
(77) p!+o!*a9:0.
ox -og oz
What Lagrange shows readily is that if z(r, !, z) : c is a solution of (76),
then.,ir : u(x, g, z) is a solution of (77) and, conversely. Hence the problem
of solving (76) is equivalent to that of solving (77). The equation (77) in
turn is related to the system of ordinary differential equations
dxdudzduOdzR
: :
(78)
V O ? ot -=a*: =P and dx: P'
In fact, iflf: y1*,n, z) and.f: a(*,A, z) are two independent solutions of
(77), then u : h and o : cz are a solution of (78) and conversely, Hence,
if we can find the solutions u : ct and t) : cz of (78), -f : u and,J: z will
be solutions of (77) and, u : c and o : c will be solutions of (76). Moreover,
(7e)
ff:r, fl:r,, ff:rr,+t-r,,
!,: -'f' -l"t' !,:-t"-t'o'
The solution is effected by finding any one integral of (79), say u(x,g, z,!,
d : A. One solves this and;t: 0 simultaneously forp and 4 and substitutes
ior p and q in dz : pdx + qdy (see [72]). Then one integrates by the method
used for (65).
Lagrange's method is often called Cauchy's method of characteristics
becrrrse the g.neralization to z variables of the method of arriving at (79)
used by Lagrange and charpit for a differential equation in two independent
variabies p..r"ti. difficulties which were surmounted by Cauchy in IBl9''r
45. Hitt. de l'Acad. des Sci., Paris, 1784,85-lt7, t18-92, pub. 1787.
MONGE AND THE TITEORY OF CHARACTERISTICS 537
Figure 22.4
sent a curve called a characteristic curve. This curve is also the curve of
intersection of two " consecutive" members of the subfamily. The set of
characteristic curves fills out the envelope; that is, the envelope touches
each member of the subfamily along a characteristic. The general integral
is the aggregate of surfaces (envelopes of one-parameter families), each
generated by a set of characteristic curves. The envelope of all the solutions
of (80), that is, the singular solution, obtained by eliminating a and D from
(80) and the partial derivatives of (80) with respect to a and b respectively,
isz: tX.
The characteristic curve appears in another way. Consider two sub-
families of spheres whose envelopes are tangential along any one sphere. We
might call such envelopes consecutive envelopes. The curve of intersection
of these two consecutive envelopes is the same characteristic curve on the
sphere as the one obtained by considering consecutive members of either
subfamily. Any one sphere may belong to an infinity of different subfamilies
whose envelopes are all different, and so there will be different characteristic
curves on the same sphere. All are great circles in vertical planes.
Monge gave the analytical form of the differential equations of the
characteristic curves, which amounts to the fact that equations (79) dcter-
mine the characteristic curves of (69). (Monge used total differential equa-
tions to express the equations of the characteristic curves.)
Monge also introduced (1784) the notion of a characteristic cone. At
any point (x,y, z) of space (Fig. 22.a) one may consider a plane whose normal
has the direction numbers 1D, g, - l. For a fixed (r, y, z), the set ofy' and 4
which satisfy
(84) F(x,9, z, p, q) : 0
nu# * u
ffi * rffo, * o* * r# * Fz * G: o,
whcrc l, B, .. .
, G are functions of x and y. This equation is commonly
written as
(85) Ar * Bs +Ct + DP + Eq *Fz* G:0,
where the letters r, s, t, p, znd q have the obvious meanings. Laplace showed
in1773a1 that equation (85) can, by a change of variables, be reduced to
the form
(86) s+a|+bq+cz*g:0,
where a, b, c,and g are functions ofxand,y only, provided that 82 - 4AC *
0, He then solved the equation in terms of an infinite series.
ln his Failhs d'arulgn, Monge considered the nonlinear equation
(87) rtr *.Ss + Tt: V
,16. This is the tidc of thc third cdition of his Fatrilhs d'analya.
47. Hitt. dt f Acad, das Sci., Paris,1773, pub. 1777 (hprcs,9'
= 5-{Il.
MONGE AND NONLINEAR SECOND ORDER EQUATIONS 539
in which rt, S, f, and Zare functions of *,y, z, p, and 4, so that the equation
is linear oniy in the second derivatives r, J, and l' This type of
equation arose
in Lagrangels work on minimal surfaces, that is, surflaces ofleast area bounded
by giien rlpu"" wherein the specific differential equation is (l + q'3)r
".r*"r, Qhap' 24, sec' 4)' Though Monge had
-ipqs +'(l + p2)t:0. (See also (87), in the present work (1795) he
uf..dy dorr" ,o-" work on equation
** ubl" to solve it elegantly by the method we shall sketch'
By using the immediate flacts
(88) 62:pdxqqdg
(Be) dP: rdx + sdY
(e0) dq: sdx + tdy
equation
and eliminating r and I from (87), (89), and (90) he obtained the
(91) s(Rdgz - Sdxdy + Tdxz) - (Rdvdp + Tdxdq - Vdxdi : A'
and then
(e5) ut : 6(uz),
where / is arbitrary, is a first order partial differential equation' The
equation (95) is called an intermediate integral' Its general solution is the
so'lution oi 1421. If,n" other equation in (94) can be used together with
(88)
and (93), we get another function
not take time to cover. Monge's integration of the equation for minimal
surfaces was one of his claims to glory.
For the equation (87) also Monge introduced the theory of characteris-
tics. The total differential equation of the characteristics is (92), that is,
Rdg'-Sdxdg+Tdxz:0.
This equation, which appears in his work as early as l784,a8 defines at each
point of an integral surface two directions that are the characteristic direc-
tions at that point. Through each point on an integral surface there pass two
characteristic curves, along each of which two consecutive integral surf;aces
touch each other.
48. Hisl. dc I'Acad. dcs Sci., Paril, 1784' llA-92, pub. 1787.
49. Noui Comm. Acad. Sci Petrop.,6,l756157,27l-311, pub. l76l : Olera, (2), 12, 133-68.
50. Hkr. de I'Acad. dc Berlin, ll, 1755, 274-315, pub. 1757 = O\cra, (2)' 12' 54-91.
FrRST ORDER SYSTEMS 5+r
In time dt, the particle at (x,y, z) will travel a distance z dI in the r-direction,
a dt in the y-direction, and, w dt in the z-direction. Then the actual changes
dx, dy, and dz in the expression for du are given by these quantities, so that
a* : *.u
dx dt + *.a
oy dt + **
02
dt + *aJ
dt
or
du 0u Au ux- 0u 0u
(eB)
a : u -;-
oxI0--o! * oz * at'
and there are the corresponding expressions for duldt and dwldt. These
quantities give what is called now the convective rate ofchange of the velocity
at (rc, g, z) or the convective acceleration. By calculating the forces acting
on the particle at (x, g, z) and applying Newton's second law, Euler obtains
the system of differential equations
lAp du
'^ pAx At
(ee) , -iff:#
- IAP dw
^ - iaz:7i'
Euler also generalized d'Alembert's differential equation of continuity
(45) and obtained for compressible flow the equation
(100)
ap . a$u) , aGu) ,a(pu) _n
-:-f.....:--l-_:-.r--;--v.
ot ox oll oz
There are four equations and five unknowns, but the pressuref as a function
of the density, the equation of state, must be specified.
In the 1755 paper Euler says, "And if it is not permitted to us to Pene-
trate to a complete knowledge concerning the motion of fluids, it is not to
mechanics, or to the insufficiency of the known principles of motion, that
we must attribute the cause. It is analysis itself which abandons us here,
since all the theory of the motion of fluids has just been reduced to the solu-
tion of analytic formulas." Unfortunately analysis was still too weak to do
much with these equations. He then proceeds to discuss some special solu-
tions. He also wrote bther papers on the subject and dealt with the resistance
encountered by ships and ship propulsion. Euler's equations are not the
final ones for hydrodynamics. He neglected viscosity, which was introduced
seventy years later by Navier and Stokes (Chap. 28, sec. 7).
Lagrange, too, worked on fluid motion. In the first edition of his
Mhanique analgtique, which contains some of the work, he gave Euler's
542 PARTTAL DTFFERENTTAL EqUATTONS rTOO-rBOO
basic equations and generalized them. Here he gave credit to d'Alembert but
none to Euler. He too says the equations of fluid motion are too difficult to
be handled by analysis. Only the cases of infinitely small movements are
susceptible of rigorous calculation'
In the area of systems of partial differential equations, the equations of
hydrodynamics were, in the eighteenth century, the main inspiration for
mathematical research on this subject. Actually the eighteenth century
accomplished little in the solution of systems.
BibliographE
Burkhardt, H.: "Entwicklungen nach oscillirenden Funktionen und Integration
der Differentialgleichungen der mathematischen Physik," Jahres. dcr Dcat.
Math.-Verein,, 10, 1908, l-1804.
Burkhardt, H., and W. Franz Meyer: " Potentialtheorie," Emy*. der Malh. Wiss.,
B. G. Teubner, 1899-1916, 2, 1'7b,464-503.
Cantor, Moritz z Vorhsungcn frbu Gcschichle tler Mathcmatik, B. G. Teubner, 1898
and 1924; Johruon Reprint Corp., 1965, Vol' 3, 858-78, Yol.4'87T1047.
Euler, Leonard: Opta Omnia, Orell Fiissli, (l), Vols. 13 (1914) and 23 (1938); (2),
Vols. I0, ll, 12, and 13 (1947-55).
Lagrange, Joseph-Louis: (Euorcs, Gauthier'Villars, 1868-70, relevant papers in
Vols. 1, 3, 4, 5.
Laplace, Pierre-Simon : CEuans compl)tcs, Gauthier-Villars, 1893-94, rclevant
papers in Vols' 9 and 10.
Montucla, J. F.: Histoirc des mathimatiqucs (1802), Albert Blanchard (reprint),
1960, Vol. 3, pp. 342-52.
Langer, Rudolph E.: "Fourier Series: The Genesis aad Evolution of a Theory,"
Amcr. Marh. Monthlg,54, No' 7, Part 2, 1947.
Taton, Rend: L'(Euare scbntifqu de Monge, Presses Universitaires de France, 1951.
Todhunter, lsaac: A History of ttu Mathzmatiral Theorbs of Attraction and tlu Figttc of
tlu Earth (1873), Dover (reprint), 1962.
Truesdell, Clifford E.: Introduction to Izortudi Eulai Opcra Omnia Vol. X ct XI
Serfui Suundae, in Euler, Opaa Omnia, (2), I l, Part 2, Orell Fiissli' 1960.
Edito/ s Intro&utraz in Euler, Opcra Omnia, (2), Vol. 12, Orell Fiissli, 1954'
Editor's Introduetioz in Euler, Opcra Omnia, (2), Vol. 13, Orcll Fiilsli, 1956'
23
Analytic and Differential Geometry
in the Eighteenth Century
Geometry may sometimes appear to take the lead over
analysis but in fact Precedes it only as a servant goes before
the master to clear the path and light him on his rvay.
JAMES JOSEPH SYLVESTER
l. Introduction
The exploration of physical problems led inevitably to the search for greater
knowledge of curves and surfaces, because the paths of moving objects are
curves and the objects themselves are three-dimensional bodies bounded
by surfaces. The mathematicians, already enthusiastic about the method of
coordinate geometry and the power of the calculus, approached geometrical
problems with these two major tools. The impressive results of the century
were obtained in the already established area of coordinate geometry and
the new field created by applying the calculus to geometrical problems,
namely, differential geometry.
544
BASIC ANALYTIC GEOMETRY 545
Line of nodes
gcomctry. Hc presents much of what had already been done and then
studies thc general s€cond degree equation in three variables
(l) af + by2 + cz2 + dxg + cxz+fyz+ Cx + lry * kz: l.
IIe now seels to use change of axes to reduce this equation to the forms that
result from having the principal axes of the quadric surfaces represented by
( I ) as the coordinate axes. He introduces the transformation from the
Euler uses this transformation to reduce (l) to canonical forms and obtains
six distinct cases: cone, cylinder, ellipsoid, hyperboloid of one and two
shees, hyperbolic paraboloid (which he discovered), and parabolic cylinder.
Like Descartes, Euler maintained that classification by the degree of the
equation was the correct principle; Euler's reason was that the degree is
invariant under linear transformation.
Aftcr continuing work on this problem of change of axes, he wrote
another paper,s in which he considers the transformation that will carry
3. Novi Comm. Acad. Sci. Petro!., 15, 1770,75-106' pub. l77l : Ofuila, (l)' 6, 287-315.
HIGHER PLANE CURVES 547
,c2+ gz * zs into x,2 * y'2 + z'2. Here he-and Lagrange a little later,
in a paper on the attraction of spheroids a--gave the symmetric form for
t}te rotation of axes, the homogeneous linear orthogonal transformation
x: \x, * Py, + vz,
!:i'x'*1t'g'+v'z'
z: \, x, + lt'y, + v"z',
where
|: + .tr,r : I
.u ),e lp + )r,p, * l, t", : O
p2 +p,2*p,,: l lyatrr,*)\,y":0
uzay,2 !yD2:l pu * 1t'v' t 1t'v' :0.
The authors show that every plane section of a second degree surface is a
second degree curve, and that parallel planes cut out similar and similarly
placed curves. These results parallel Archimedes' geometric theorems. The
authors also show that the hyperboloid of one sheet and the hyperbolic
paraboloid are ruled surfaces, that is, each can be generated in two different
ways by the motion of a line or each surfiace is formed by two systems of
lines. The result on the one-sheeted hyperboloid was known by 1669 to
Christopher Wren, who said that this figure could be Senerated by revolving
a line about another not in the same plane. With the work of Euler, Lagrange,
and Monge, analytic geometry became an independent and full-fledged
branch of mathematics.
t \t
\/
O) t a\ ).(
Figure 23.2
The first extensive study of higher plane curves was made by Newton.
Impressed by Descartes's plan to classify curves according to the degree of
their equations and then to study systematically each degree by methods
suited to that degree, Newton undertook to study third degree curves. This
work appeared in his Enumcratio Linearum Tertii Ordinis, which was published
in 1704 as an appendix to the English edition of his Opticks but had been
composed by 1676. Though the use of negative *- and y-values appears in
works of La Hire and Wallis, Newton not only uses two axes and negative
x- and y-values but plots in all four quadrants.
Newton showed how all curves comprised by the general third degree
equation
(3) axs + bxzy a cxyz t dgs + ex2 + fxg + gg2 + htc +jy + k:0
can, by a change of axes, be reduced to one of the following four forms:
(a) rgz * e! : a# + bx2 + cx + d
(b) x!:ax,+bx2+cx+d
(c) !2:ax'+bx2+cx*d
(d) !:ax'+bxz+cx+d.
The third class, which Newton called diverging parabolas, contains five
species of curves whose types are shown in Figure 23.2, The species are
distinguished by the nature of the roots of the cubic right-hand member,
as follows: all real and distinct; two roots complex; all real but two equal
and the double root greater or less than the simple root; and all three equal.
Newton affirmed that every cubic curve can be obtained by projection of
one of these five types from a point and then by a section of the projection.
Newton gave no proofs of many of the assertions in his Enumeratio. ln
his Lineae, James Stirling proved or reproved in other ways most of the
assertions but not the projection theorem, which Clairaut6 and Frangois
Nicole ( 1683-1 758) ? proved. Also, whereas Newton recognized seventy-two
6. Mim, de l'Acad. des Sci., Paris, 1731, +90-93, pub. 1733.
7. Min. de I'Acad. des Sci., Paris, 1731, 49.t-510, pub. 1733.
HIGHER PLANE CURVES 549
species' of third degree curves, Stirling added four more and abb6 Jean-Paul
de Gua de Malves, in a little book of 1740 entitled Usage d.e l'analgse de
Descartts pour dicouarir sans Ie secours tlu calcul difircntial . . . , added two more'
Newton's work on third degree curves stimulated much other work on
higher plane curves. The topic of classifying third and fourth degree curves
in accordance with one or another principle continued to interest mathema-
ticians of the eighteenth and nineteenth centuries. The number of classes
found varied with the methods of classification.
As is evident from the figures of Newton's fi.ve species of cubic curves'
the curves of higher-degree equations exhibit many peculiarities not found
in first and second degree curves. The elementary peculiarities, called
singular points, are inflection points and multiple points. Before proceeding,
let us see what some of them look like.
Inflection points are familiar from the calculus. A point at which there
are two or more tangents which may coincide, is called a multiple point'
At such a point two or more branches ofthe curve intersect. If two branches
intersect at the multiple point, it is called a double point. If three branches
intersect then the point is called a triple point, and so on.
If we take the equation of an algebraic curve
"f(*,v)
: o,
;f being a polynomial in x andy, we can by a translation always remove the
constant term. If this is done and if there are first degree terms in;f, say
ap ! b1y, then arx 'l bry :0 gives the equation of the tangent to the
curve at the origin. The origin is not in this case a multiple point. If there
are no first degree terms, arld if a2x2 a b2xy I cry2 ate the second degree
terms, then several cases arise. The equation arxz + b2xg * czyz :0 lllay
represent two distinct lines. These lines are tangents at the origin (this can
be proven), and since there are two distinct tangents the origin is a double
point; it is called a node. Thus the equation of the lemniscate (Fig. 23.3) is
(4) o'(!' - x') + (!" f x2)e : S
and the second degree terms yield U2 - xz : 0. Then g : x and ! : -x
are the equations of the tangents. Likewise, the folium of Descartes (Fig'
23.4) has the equation
(5) f +ys:3a1,,
and the tangents at the origin, which is a node, are given by r : 0 and
v:0.
When the two tangent lines are coincident, the single line is considered
as a double tangent and the two branches of the curve touch each other
at the point of tangency, which is called a cusp. (Sometimes cusps are
550
Lemnircatc
Figurc 23.3, Lemnircatc Figure 23.4. Folium of Descartes
included zrmong the double points.) Thus the semicubical parabola (Fig.
23.5)
(6) ry,z:f
has a cuspat the origin, and thc equation of the two coincident tangents
foyr : 0. On the curve (y - i)' : f (Fig. 23.6) the origin is a cusp.
Here both branches lie on the same side of the double tangent, which is
I : 0. De Gua in his Usage had, tried to show that this type of cusp could
not occur, but Euler 8 gave many examples, A cusp is also called a stationary
point or point of retrogression because a point moving along the curve
must come to rest before continuing its motion at a cusP.
When the two tangent lines are imaginary, the double point is called a
conjugate point. Thc coordinates of the point satisfy the equation of the
curve but the point is isolated from the rest of the curve. Thus the curve
(Fig. 23.7) of gz : *(2x - l) has a conjugate Point at the origin. The
equation of the double tangent there is !2 : -x2 and the tangents are
imagrnary.
The curve of ays - 3ax2g : *a (Fig. 23.8) has a triple point at thc
origin. The equation of the three tangents is
aYs-3a*2Y:O
ory:0 andy: +xnft,.
The curve of aga - ax'y' : xE (Fig. 23.9) has a quadruple Point at
the origin. The origin is a combination of a node and a cusp. The tangents
arey :0,9 : O,g : +x.
Curves of the third degree (order) may have a double point (which
may be a cusp) but no other multiple point. There are of coursc cubics with
no double point.
To return to the history proper, many of these special or singular
points on curves were studied by Leibniz and his sucoesson. The analytical
conditions for such points, such as that f : 0 at an inflection point and
that 3i is indeterminate at a dotrble point, werc known wen to the foundcrs
of the calculus.
Clairaut in the l73l book referred to above assumed that a third dcgrec
curve cannot have more than three real inflection points aud must have at
least one. De Gua in Usagc proved that if a third degree curve has three
real inflection points, a line through two of thern passes through the third
Figure 23.9
552 ANALYTTC AND DIFFERENTIAL GEOMETRY rTOO-r8OO
The total neighborhood of a point (*0, ys) of an algebraic plane curve can
be expressed by a finite number of developments
(7) U -lo: a1(x - xo)qttoo I a2(x - rco)cztqo +"'.
These developments convergein some interval about ro and all the 4, have
no common factors. The points given by each development are called a
branch of the algebraic curve.
The intersections of a curve and line and of two curves is another topic
that received a great deal of attention. Stirling, in lrris Lineae of 1717, showed
that an algebraic curve of the zth degree (in x and, g) is determined by
n(n + 3) 12 of its points because it has that number of essential coefficients'
He also asserted that any two parallel lines cut a given curve in the same
number of points, real or imaginary, and he showed that the number of
branches of a curve that extend to infinity is even. Maclaurin's work, Geometria
Organica, founded the theory of intersections of higher plane curves' He
generalized on results for special cases and on this basis concluded that an
equation of the mth degree and one of the zth degree intersect in mz points'
In 1748 Euler and Cramer sought to prove this result, but neither gave
a correct proof. Eulerlo relied upon an argument by analogy; realizing
that his argument was not complete, he said one should apply the method
to particular examples. Cramer's " proof" in his book of 1750 relied entirely
on examples and was certainly not acceptable. Both men took into account
points of intersection with imaginary coordinates and infinitely distant
points and noted that the number zzz will be attained only if both
"o*-o,
types of points are included and if any factor, such as at * Dy, common
to both curves is excluded. Ilowever, both faited to assign the proper multi-
plicity to several types of intersections. ln 1764 Etienne Bezout (173G43)
gave a better proof of the theorem, but this was also incomplete in the count
of the multiplicity assigned to points at infinity and multiple Points' The
proper count of the multiplicity was settled by Georgcs-Henri Halphen
(1844-89) in 1873.11
In his book of 1750 Cramer took up a paradox noted by Maclaurin in
his Geometria concerning the number of points common to two curves' A
curve ofdegree z is determined by n(n + 3)/2 points. Two rth degree curves
meet in z2 points. Now if z is 3, the curve should, by the first statement, be
determined by 9 points. But since two third degree curves meet in 9 points,
these 9 points do not determine a unique third degree curve. A similar
paradox arises when z :4. Cramer's explanation of the paradox, now
iefe.red to as his, was that the z2 equations that determine the z3 points
of intersection are not independent. All cubics that Pass through 8 fixed
lO. Mdn. tlc I'Acad. tle Berlin,4, l7+8,234-48 = Oierd, (l), 26' 46-59'
ll. Bull. Soc. Math. de Frarce, l, 1873, 13G48;2, 1873,34-52;3, 1875,76-92: (Euarcs,
points on a given cubic must pass through the same ninth fixed point. That
is, the ninth point is dependent on the first B. Euler gave the same explanation
in 1748.13
In 1756 Matthieu B. Goudin (1734-1917) and Achille-Pierre Dionis
du Sdjour (1734-94) wrote the Traitl dts courbcs algCbriquer. Its new features
are that a curve oforder (degree) z cannot have more than n(n - l) tangents
with a given direction, nor more than z asymptotes. As had Maclaurin,
they pointed out that an asymptote cannot cut the curve in more than
a - 2 points.
The two best eighteenth-century compendia of results on higher plane
curyes are the second volume of Euler's Infioductia (1748) and Cramer's
Lignes curbcs algibrigras. The latter book has a unity ofviewpoint, is excellently
set forth, and contains good examples. The work was often cited, even to
the poiirt of crediting Cramer with some results that were not original with
him.
12. Mhn. dc I'Acad. dc Bnlin,4, l7+8,219-33, pub. 1750 : OPcra, (l), 26' 33-45.
PLANE CURVES 555
Figure 23.10
5. Plane Cumes
The first applications of the calculus to curves dealt with plane curves.
Some of the concepts subsequently treated by the calculus were introduced
by Christian Huygens, who used purely geometrical methods. His work
in this direction was motivated by his interest in light and in the design
of pendulum clocks. In 1673, in the third chapter of }:ris Horohgium Oscilla-
torium, he introduced the involute ofa plane curve C. Imagine a cord wraPped
around C from P, to the right (Fig. 23. l0). The end at P1 on C is held fixed
and the other unwound while the cord is kept taut. The locus C' of the free
end is an involute of C. Huygens proved that at the free end the cord is
perpendicular to the locus C'. F,ach point of the cord also describes an
involute; thus C" is also an involute, and Huygens proved that the involutes
cannot touch one another. Since the cord is tangent to C at the Point where
it just leaves C, it follows that every orthogonal trajectory of the family of
tangents to a curve is an involute of the curve'
Huygens then treated the evolute of a plane curve. Given a fixed
normal at a point P on a curve, as an adjacent normal moves toward it
the point of intersection of the two normals attains a limiting position on
the fixed normal, which is called the center of curvature of the curve at P-
The distance from the point on the curve along the fixed normal to the
limiting position was shown by Huygens to be (in modern notation)
556 ANALYTIC AND DIFFERENTIAL GEOMETRY ITOO_IBOO
Figure 23.1 I
This length is the radius of the curvature of the curve at P. The locus of
the centers of curvature, one on each normal, is called the evolute of the
original curve. Thus the curve C above is the evolute of any one of its
involutes. In this work Huygens proved that the evolute of a cycloid is a
cycloid or, more precisely, the evolute of the left half of the lower cycloid
in Figure 23.11 is the right half of the upper cycloid. This theorem was
proved analytically by Euler in l764.to The significance of the cycloid lor
Huygens's work on pendulum clocls is that a pendulum bob swinging
along a cycloidal arc takes exactly the same time to complete swings of
large and of small amplitude. For this reason the cycloid is called the
tautochrone.
Newton, too, in his Gcomctria Analytica (published in I 736 though most
of it was written by 1671) introduces the center of curvature as the limiting
point of intersection of a normal at P with an adjacent normal. He then
states that the circle with center at the center of curvature and radius equal
to the radius of curvature is the circle of closest contact with the curve at P;
that is, no other circle tangent to the curve at P can come between the
curve and the circle of closest contact. This circle of closest contact is called
the osculating circle, the term " osculating " having been used by Leibniz
in a paper of l686.1a The curvature of this circle is the reciprocal of its
radius and is the curvature of the curve at P. Newton also gave the formula
for the curvature and calculated the curvature of several curves, including
the cycloid. He noted that at a point ofinflection a curve has zero curvature.
These results duplicate those of Huygens, but probably Newton wished to
show that he could use analytical methods to establish them.
In 169l John Bernoulli took up the subject ofplane curves and produced
some new results on envelopes. The caustic of a family of light rays, that is,
the envelope of the family, had been introduced by Tschirnhausen in I682.
In the Acta Eruditorum of 1692 Bernoulli obtained the equations of some
caustics, for example, the caustic of rays reflected from a spherical mirror
when a beam of parallel rays strikes it.ls Then he tackled the problem posed
13. Noui Comm. Acad. Sci. Petrop., 10, 176+, l?9-98, pub. 1766 = Opera, (l),27, 38+-+00.
14. Acta Erud., 1686, 289-92 : Math, Schrrfkn, T , 326-29.
15. OPera, l, 52-59.
SPACE CURVES 55/
to him by Fatio de Duillier, to find the envelope of the family of parabolas
that are the paths of cannon balls fired from a cannon with the same initial
velocity but at various angles ofelevation. Bernoulli showed that the envelope
is a parabola with focus at the gun. This result had already been established
geometrically by Torricelli. ln the Acta Eruditorum of 1692 and 169416
Leibniz gave the general method of finding the envelope of a family of
curves. If the family is given by (in our notation) f (x, y, ,) : 0, where c is
the parameter of the family, the method calls for eliminating a between
-f : 0 and, AflAa : 0. L'Hospital's text, L'Analyse d'es itfiniment pelils (1696),
helped to perfect and spread the theory of plane curves.
6. Space Curues
Clairaut launched the theory of space curves, the first major development
in three-dimensional differential geometry. Alexis-Claude Clairaut ( I 713-65)
was precocious. At the age of twelve he had already written a good work on
curves. In I 731 he published Recherche sur les courbes d doublc courbure, which
was written in 1729 when he was but sixteen. In this book he treated the
analytics of surfaces and space curves (sec. 2). Another paper by Clairaut
led to his election to the Paris Academy of Sciences at the unprecedented
age of seventeen. In 1743 he produced his classic work on the shape of the
earth. Here he treated in more complete form than Newton or Maclaurin
the shape a rotating body such as the earth assumes under the mutual
gravitational attraction of its parts. He also worked on the problem of
three bodies, primarily to study the moon's motion (Chap. 21, sec. 7), and
wrote several papers on it, one of which won a prize from the St. Petersburg
Academy in 1750. In 1763 he published his Thiorie de la lune. Clairaut had
great personal charm and was a figure in Paris society.
In his I 731 work he treated analytically fundamental problems of
curves in space. He called space curves " curves of double curvature "
because, following Descartes, he considered their projections on two PerPen-
dicular planes. The space curve then partakes of the curvatures of the two
curves on the planes. Geometrically he thought of a space curve ,rs the
intersection ol two surfaces; analytically the equation of each surface was
expressed as an equation in three variables (sec. 2). Clairaut then studied
tangents to curves of double curvature. fle saw that a sPace curve can have
an infinity of normals located in a plane perpendicular to the tangent. The
expressions for the arc length ofa space curve and the quadrature of certain
areas on surfaces are also due to him.
Though Clairaut had taken a few steps in the theory of space curves,
very little had been done in this subject or in the theory ol surfaces by 1750.
This is reflected in Euler's fttro d.uctio of 1748, where he presented the differen-
tial geometry of planar and spatial figures. The first was rather complete,
but thc second was scanty.
The next major step in the differential geometry of space curyes was
taken by Euler. A great deal of his work in differential geometry was moti-
vated by his use of curves and surfaces in mechanics. flis Mechanica ( I 736),17
written when he was twenty-nine, is a major contribution to the analytical
foundation of mechanics. He gave another treatment of the subject in his
Tluoria Motus Cor\orun Solidorum seu Rigidorum (1765).18 In this book he
derived the currently used polar coordinate formulas for the radial and
normal components of acceleration of a particle moving along a plane
cuwe, namely,
dzr td^2
q:4iz d20 zVV'
4e: r77 _rLd|.
-'\a) ' +
Figure 23.12
the two neighboring tangents of the two points that are ds apart along the
curve. Euler's definition of the radius of curvature of the curve is
ds'
a'
He then derives an analytical expression for the radius of curvature:
ds2 r ]--l
16r P:lffil:v6p1 -
The plane through ds,and the center o is Euler's definition of the osculating
plane at (*,!, Bernoulli, who introduced the term, regarded the
"1. John
pl.r" u, determined by three " coincident " points. Its equation, as given
by Euler, is
where I is determined by the point (x,y, z) on the curve through which the
osculating plane passes. This equation is equivalent to the ooe we write
today in vector notation as
(R - r).r'; r' : 0,
where r(s) is the position vector \dth resPect to some point in space of the
point on ih" at which the osculating plane is determined, and R is
"r*i
the position vector of any point in the osculating plane' In vector form r is
given by
r(s)i +y(s)j + z(s)k
and R has the form Xi + yi I Zk, whete (X, Y, Z) are the coordinates
of R.
Clairaut had introduced the idea that a space curve has two curvatures'
One of these was standardized by Euler in the manner just described' The
other, now called '" torsion " and representing geometrically the rate at
which a curve departs from a plane at a point (*, y,
'),
was formulated
explicitly and analytically by Michel-Ange Lancret (177'1-1807), an
and mathematician who was a student of Monge and worked in
"rriir"".
his spirit. He singled out21 three principal directions at any point of a
21. Mdn. diocrs Savans, l, 1806, 416-54.
560 ANALYTIo AND DTFFERENTTAL GEoMETRY r.7oo-r8oo
curve. The first is that of the tangent. " Successive" tangents lie ih a plane,
the osculating plane. The normal to the curve that lies in the osculating
plane is the principal normal, and the perpendicular to the osculating plane,
the binormal, is the third principal direction. Torsion is the rate of change
of the direction of the binormal with respect to arc length; Lancret used
the terminology, flexion of successive osculating planes or successive bi-
normals.
Lancret represented a curve by
* : 6Q),{(z) v :
and called /p the angle between successive normal planes and dv that
between successive osculating ones. Then, in modern notation,
dr':!. 4!:!.
dspdsr
where p is the radius of curvature and r is the radius ol torsion.
Cauchy improved the lormulation of the concepts and clarified much
of the theory of space curves in his famous Legons sur les applications du calcul
itfrnitesimal d la giomitrie (1826).22 He discarded constant infinitesimals, the
ds's, and straightened out the confusion between increments and differen-
tials. He pointed out that when one writes
ds2:dx2+dy2+dzz
one should mean
(f)':H'*(*)'.(*)'
Cauchy preferred to write a surface as w(x,y, z) : 0 instead ofthe unsym-
metric form z : "f(x,y), and he wrote the equation ofa straight line through
the point ((, l, () as
k:! : P
,{@, +-ATT,\-'.
c(, cos p,
Then he proves that, if the direction cosines of the tangent are cos
and cos y,
/(cos a) tr
*..:__E_: , " :---k B): P,fl'
cos d(cos cos
P,
(e) ,, d (cos y) cos v
-dsP
where p is the radius ol curvature already introduced and cos '\, cos p, and
cos y are the direction cosines ofa normal, which he takes to be the
principal
one. He shcws next that
lda
i:a
where is the angle between adjacent tangents'
ro
He introduces the osculating plane as the plane of the tangent and
principal normal. The normal to this plane is the binormal, and its direction
cosirr.t L, cos M, and cos I[ are given by the formulas
"ot
cos I cos M cos N
6F;=-A;AT: V;T; --Et du: 7;F;671'
He can then Prove that
L ,\ d cos M
:-;,
cos 14 d cos 'ly' cos /
(r0) a-:
d cos cos
---A, ---E-
",
where is the torsion, and that the torsion equals do//s, where O is the
l/r
angle between osculating Planes.
Formulas (9) and ( 10) are two of the three famous Serret-Fr6net
formulas, the third being
where I /r is the torsion arrd I lp is the curvature. These formulas (9), ( t0),
and (l l), which give the derivatives of the direction cosines of the tangent,
binormal, and normal respectively, were published by Joseph Alfred Serret
(l8l9+5) in 185138 and Fr6deric-Jean Frdnet (1816-1900) in l852.2a The
significance of curvature and torsion is that they are the two essential
properties of space curves. Given the curvature and torsion as functions
of arc length along the curve, the curve is completely determined except
for position in space. This theorem is readily proven on the basis of the
Serret-Fr6net formulas.
representation x(t,u), y(t, a), and z(l,z) such that the partial derivatives
satisfy the conditions (14).
Euler then investigated the relationship between space curves and
developable surfaces and showed that the family of tangents to any space
curve fills out or constitutes a developable surface. He tried unsuccessfully
to show that every developable surface is a ruled surf;ace, that is, one gener-
ated by a moving straight line, and conversely. The converse is, in fact,
not true.
The subject of developable surfaces was taken up independently by
Gaspard Monge. With Monge geometry and analysis supported each other'
He embraced simultaneously both aspects of the same problem and showed
the usefulness of thinking both geometrically and analytically' Because
analysis had dominated the eighteenth century, despite some analytic
geometry and differential geometry by Euler and Clairaut, the effect of
Mo.rg." double view was to put geometry on at Ieast an equal basis with
analysis and then to inspire the revival of pure geometry. He is the first real
innovator in synthetic geometry after Desargues.
His extensive work in descriptive geometry (which primarily serves
architecture), analytic geometry, differential geometry, and ordinary and
partial differential equations won the admiration and envy of Lagrange'
Th. Irtt"., after listening to a lecture by Monge, said to him, "You have
just presented, my dear colleague, many elegant things. I wish I could have
doni them." Monge contributed much to physics, chemistry, metallurgy
(problems of forges), and machinery. In chemistry he worked with Lavoisier'
lidorg" sr* the need for science in the development ofindustry and advocated
industrialization as a path to the betterment of life. He was inspired by an
active social concern, perhaps because he knew the hardships of humble
origins. For this reason, too, he supPorted the French Revolution and served
u. Mirrirt". of the Navy and as a member of the Committee on Public Health
in the succeeding governments. He designed armaments and instructed
governmental staff in technical matters' His admiration for Bonaparte
seduced him into following the latter in his counter-revolutionary measures.
Monge helped to organize the Ecole Polytechnique and as a professor
there lounded a school of geometers. IIe was a great teacher and a force in
inspiring nineteenth-century mathematical activity. His vigorous and fertile
lectures communicated enthusiasm to his students, among whom were at
least a dozen of the most famous men of the early nineteenth century'
Monge created results in three-dimensional differential geometry which
go far beyond Euler's. His paper of 1771, Mhnoire sur les diaeloppies, les rayons
d, ,ourburr, et les difirents genres d'infexions fus courbes d double courbure, published
much later,3a was followed by his Feuilles d'analgse appliquie d la giomitrie
(1795, second ed. l80l). The Feuilles was as much differential geometry as
34. Mim. diaers Saaans, 10, 1785, 5ll-50'
566 ANALyrrc AND DIFFERENTTAL oEoMETRY IToo-IBoa
z*xZyy - z?n : 0.
two points distant from the surface by tJre amount of the two principal
curvatures. Each set ofpoints determined by one of the principal curyatures,
one on each of the normals to a line of curvature, lies on the edge of regres-
sion of the developable surface formed by that set ol normals, so that the
normals are taDgert to tlat edge. The edges of regression of one family of
developable surfaces all constitute a surface called a center surface. There
are then two such center surfaces (Fig. 23.13). The envelope of each family
of developable surfaces is called a focal surface.
4O. Noai Comm. Acad. Soi. Parol., l+, 1769, t0't-28, pub. l77O: Olcra, (l)' 28' 99-119.
41. Acta Acad. Sci. Petrop.,l, 1777, lO7'32 and 133-42, pub' 1778 : Olera, (l),28,
248-75 atd 276-87.
42. Nouu, Mtim. de I'Acad, dc Berlin, 1779, 16l-210' pub. l78l : CEuttrest 4,637-92.
BTBLTOGRAPTTY 57|
the earth's surface onto a plane area that transform latitude and longitude
circles into circular arcs.
Further progress in the mapping problem and in conformal mapping
in particular awaited the extension of differential geometry and complex
function theory.
Bibliogmphy
Ball, W. W. Rouse: "On Newton's Classification of Cubic Curves," Pnccedhgs
hndon Mathematical Societg, 22, 1890, 10,H3.
oJ thc
Bernoulli, John: Opera Omnia, 4 vols., 1742, reprint by Georg OIms, 1968.
Berzolari, Luigi: "Allgemeine Theorie der hiiheren ebenen algcbraischen Kurven,"
bugk, der Math. WLss.,B. G. Teubner, 1903-15, III C4,313-455.
Boyer, Carl B.: History of Analytic Gconutry, Scripta Mathematica, 1956, Chapo.
6-8.
A History of Mathematics, John Wilcy and Sons, 1968, Chaps. 2O and 21.
Brill, A., and Max Noether: " Die Entwicklung der Thcoric der algebraischen
Funktionen in ilterer und neuerer Zeit," Jahres. der Dcut. Math.-Vach., 3,
r8s2/3, 107-56.
Cantor, Moritz t Vorlcsungm iber Geschichtc dcr Matlumatik, B. G. Teubner, 1898
ard 1924; Johnson Reprint Corp., 1965, Vol.3, 18-35,748-829; Vol.4,
37s-88.
Chasles, M.t Ap*gu historiquc sur l'origitu cl lc deuloppnunt dcs mithodcs cn gionitrit
(1837), 3rd ed., Gauthier-Villars, 1889, pp. 142-252-
Coolidge,Julian L. : " The Beginnings of Analytic Geometry in Three Dimensions,"
Anur. Math. Monthly, 55, 1948, 76-86.
A History of Geometrical Methods, Dover (reprint), 1963, pp. 13+-40,
318-46.
Tfu Mathcmatics d Great Amakurs. Dover (reprint), 1963, Chap. 12.
A H;stnry of Conic Sections and Quadric Surfaccs, Dover (reprint), 1968.
Euler, Leonhardt Opera Omnia, Orcll Fiissli, Series l, Vol. 6 (1921), Vols. 26-29
(1953-56); Series 2, Vols. 3 and 4 (1948-50), Vol. 9 (1968).
Huygens, Christian: Horclogium Oscillatorium (1673), reprint by Dawsons, 1966;
also in Huygens, (Euwcs Compl2us, 18,27438.
Hofmann, Jos. E. : "UberJakob Bernoutlis Beiriige zur Infinitcsimalmathematik,"
L' Enscigument MatMnatiqu, (2), 2, 61-17 1, 956 ; published separately by
1
For since the fabric of the universe is most Perfect and the
work of a most wise Creator, nothing at all takes place in the
universe in which some rule of maximum or minimum does
not appear. LEONHARD EULER
r:1."#ffi*
Jli
s74 THE CALCULUS Or VARTATTONS r TOO-r8OO
by choosing the proper function y(*) for the shape of the curve that is to
be rotated around the x-axis (Fig. 24. t). The peculiar feature ofthis problem
(and of calculus of variations problems generally) is that it poses an integral
whose value depends upon an unknown function y(r) which appears in
the integrand and which is to be determined so as to make the integral a
minimum or a maximum.
Newton's solution, though it did use the idea of introducing a change
in the shape of a part of the meridian arc y(x), which is almost what the
essential method of the calculus of variations involves, is not typical of the
technique ofthe subject and so we shall not look into it. It may be ofinterest
that the parametric equations of the proper y(r) are
where y' is the parameter. Of this work Newton says, "This proposition I
conceive may be of use in the building of ships." Problems of this nature
have become important in the design not only of ships but submarines and
airplanes.
In the Acta Eruditorum of June 16961 John Bernoulli proposed as a
challenge to other mathematicians the now llamous brachistochrone problem'
The problem is to determine the path down which a particle will slide from
one given point to another not directly below in the shortest time. The initial
velocity o1 at P1 Gig. 2a.4 is given; friction and air resistance are to be
neglected. In modern form, this problem is to minimize the integral "I which
represents the time of descent where
and the answer is of course a line segment' In the eighteenth century the
geodesic problem of most interest concerned the shortest Paths on the surface
If ,h" whose precise shape was not known, though the mathematicians
"r.itt,
believed it was some form of ellipsoid and most likely a figure of revolution.
The early work on geodesics already noted (Chap' 23, sec' 7) did not use
themethodofthecalculusofvariationsbutitwasclearthatspecialdevices
would not be powerful enough to treat the general geodesic problem'
2. Acta Erud., 1697, 206-t I = Opcta, I' 187-93.
3. Acta Erutl., 1697, 2ll-17 = Opotd' 2, 768-78'
4. open, l; 187-93.
576 THE cALcuLUs oF vARrATroNs r Too-r8oo
J : j*'ftr,v,u) d*
and call for finding the y(*) that extends from (rr,yr) to (xr, yr) and that
minimizes or maximizes J. Another class of problems, called isoperimetrical
problems, also entered the history of the calculus of variations at the end
of the seventeenth century. The progenitor of this cldss of problems, of all
closed plane curves with a given perimeter to find the one that bounds
maximum area, may date back to pre-Greek times. There is a story that
Princess Dido of the ancient Phoenician city of Tyre ran away from her
home to settle on the Mediterranean coast of North Africa. There she
bargained for some land and agreed to pay a fixed sum for as much land
as could be encompassed by a bull's hide. The shrewd Dido cut the hide
into very thin strips, tied the strips end to end and proceeded to enclose
an area having the total length of these strips as its perimeter. Moreover,
she chose land along the sea so that no hide would be needed along the
shore. According to the legend Dido decided thar the length of hide should
form a semicircle-the correct shape to enclose maximum area.
Apart from the work ofZenodorus (Chap. 5, sec. 7), there was practically
no work on isoperimetrical problems until the end ofthe seventeenth century.
In a move to challenge and embarrass his brother, James Bernoulli posed
a rather complicated isoperimetrical problem involving several cases in the
Acta Eruditorum of May l697.sJames even offered John a prize of fifty ducats
for a satisfactory solution. John gave several solutions, one of which was
obtained in 1701,6 but all were incorrect. James gave a correct solution.?
The brothers quarreled about the correctness of each other's solutions.
Actually James's method, as in the case of the brachistochrone problem,
was a major step toward the general technique soon to be fashioned. In
l7l8 John 8 considerably improved his brother's solution.
Analytically the basic isoperimetric problem is formulated thus. The
possible curves are represented parametrically by
x:x(t), s:yU), \<t<t2
and because they are closed curves, x(t) : tc(t2) and.y(tr) : y(tr). Moreover
no curve must intersect itself. The problem then calls for determining the
x(t) and, y(t) such that the length
t: f" t/@f 1 1yf at
JI,
5. Page 214.
6. Min, de I'Acad. des. Sci., Paris, 1706,235: Olera, 1,424.
7. Acta Erud., 1701,213 ff, : Oicra, 2,897-920.
8. M,in. de I'Aeqd. du Sci., Paris, 1718, 100 ff, : Opera,2,23549.
THE EARLY WORK OF EULER s77
(2) r, - !bu) : o.
This notation must be understood in the following sense. The integrand
"f(x,A,y') is to be regarded as a function of the independent variables x,
!, arrdy'insofar as rf, and fu, are concerned. However dJr,ldx rnust be taken
to be the derivative of Ju, wherein fo, depends on * through x, y, and g'.
That is, Euler's differential equation is equivalent to
(3) f, - fr'* - fr'uY' -.fr'u'!' : o.
ll. Comm. Aead. Sci. Petrop.,8, 1736, 159-90, pub. l74l : O?erq (l), 25, 54-80.
THE PRINCIPLE OF LEAST ACTION 579
The differential equation (3) is not the proper one when the integrands
of the integrals to be minimized or maximi zed, are more complicated than
-the
in (l). In years from 1736 to 1744 E;uler improved his methods and
obtained the differential equations analogous to (3) for a large number of
problems. These results he published in 1744 in a book, Methodus Inaeniendi
Lioro, Curro, Maximi Minimiae Proprictat'e Gaudentes (The Art of Finding
Curved Lines Which Enjoy Some Maximum or Minimum Property)'l2
Euler's work in his Method,us was cumbersome because he used geometric
considerations, successive differences, and series and he changed derivatives
to difference quotients and integrals to finite sums' He failed, in other
words, to makJ most eflective use of the calculus' But he ended with simple
and elegant formulas applicable to a large variety of problems; and he
treated i large number of e*"mpl.s to show the convenience and generality
of his methoJ. One example deals with minimal surfaces of revolution. Here
the problem is to determine the plane curve y : J@) lying between ('ro, yo)
and (xr,y1) such that when revolved around the *-axis it generates the
surface of least area. The integral to be minimized is
(4) n:l.'z,y{T1-sad*.
Euler proved that the function;t(x) must be an arc ofa catenary; the surface
so gerirrted is called a catenoid. In an appendix to his 1744 book Euler
.lro' g.\re a definitive solution of the elastic-rod problem referred to above.
He ,r-ot orrly deduced that the shape of the rod took the form of an elliptic
integral, but also gave solutions for different kinds of end-conditions. This
boof brought hirrr immediate fame and recognition as the greatest living
mathematician.
With this work the calculus of variltions came into existence as a new
branch of mathematics. However, geometrical arguments were used exten-
sively, and the combined analytical and geometrical arguments were not
only complicated but hardly provided a systematic general method' Euler
was fully aware of these limitations'
Figure 24.4
takes, is shorter than any other path, such as PR'Q, which it could conceiv-
ably take. Since the light takes the shortest path, if the medium on the
upper side of the line.R.R'is homogeneous, then the light travels with constant
velocity and so takes the path requiring least time. Heron applied this
principle of shortest path and least time to problems of reflection from
concave and convex spherical mirrors.
Basing their case on this phenomenon of reflection and on philosophic,
theological, and aesthetic principles, philosophers and scientists after Greek
times propounded the doctrine that nature acts in the shortest possible way
or, as Olympiodorus (6th cent. A.b.) said in his Catoptrica, " Nature does
nothing superfluous or any unnecessary work." Leonardo da Vinci said
nature is economical and her economy is quantitative, and Robert Grosseteste
believed that nature always acts in the mathematically shortest and best
possible way. In medieval times it was commonly accepted that nature
behaved in this manner,
The seventeenth-century scientists were at least receptive to this idea
but, as scientists, tried to tie it to phenomena that supported it. Fermat
knew that under reflection light takes the path requiring least time and,
convinced that nature does indeed act simply and economically, affirmed
in letters of 1657 and 166213 his Principle of Least Time, which states that
light always takes the path requiring least time. He had doubted the correct-
ness of the Iaw of refraction of light (Chap. 15, sec. 4) but when he found
in 16611{ that he could deduce it from his Principle, he not only resolved
his doubts about the law but felt all the more certain that his Principle was
correct.
Fermat's Principle is stated mathematically in several equivalent forms.
According to the law of refraction
srn, a7
srn,r t2
where a, is the velocity of light in the first medium and ,2 in the second. The
ratio of rl to z, is denoted by z and is called the index of refraction of the
13. (Eut res, 2, 354-59, 457-63.
14. (Euures, 2, 457-63,
THE PRTNcTPLE oF LEAsr AcrroN 58,
second medium relative to the first, or, if the first is a vacuum, a is called
the absolute index of refraction of the nonvacuous medium. If, denotes
the velocity of light in a vacuum, then the absolute index z : c/z where a is
the velocity of light in the medium. If the medium is variable in character
lrom point to point, then z and a are functions of x, y, and z. Hence the time
required for Iight to travel from a point P1 to a point P, along a curve *(o),
y(o), z(o) is given by
where o, is the value of o at Pl and o2 the value at P2. Thus the Principle
states that the path light actually takes in traveling from P, to P, is given
by the curve which makes .f a minimum.ls
By the early eighteenth century the mathematicians had several im-
pressive examples of the llact that nature does attempt to maximize or
minimize some important quantities. Huygens, who had at first objected
to Fermat's Principle, showed that it does hold for the propagation of light
in media with variable indices of refraction. Even Newton's fust law of
motion, which states that the straight line or shortest distance is the natural
motion of a body, showed nature's desire to economize, These examples
suggested that there might be some more general principle. The search for
such a principle was undertaken by Maupertuis.
Pierre-Louis Moreau de Maupertuis (1698-1759), while working with
the theory of light in 1744, propounded his famous Principle of Least Action
in a paper entitled "Accord des diffErentes lois de la nature qui avaient
jusqu'ici paru incompatibles." 18 He started from Fermat's Principle, but in
view of disagreements at that time as to whether the velocity of light was
proportional to the index of refraction as Descartes and Newton believed,
or inversely proportional as Fermat believed, Maupertuis abandoned least
time. In fact he did not believe that it was always correct.
Action, Maupertuis said, is the integral of the product of mass, velocity,
and distance traversed, and any changes in nature are such as to make the
action least. Maupertuis was somewhat vague because he failed to specify
the time interval over which the product of m, a, an,d J was to be taken and
because he assigned a different meaning to action in each ofthe applications
he made to optics and some problems of mechanics.
Though he had some physical examples to support his Principle,
Maupertuis advocated it also for theological reasons. The laws of behavior
of matter had to possess the perfection worthy of God's creation; and the
15. There are instances, as for example in the reflcction of light from a concave mirror,
where light takes the path requiring maximum time. This fact was known to Fermat and
was explicitly stated by William R. Hamilton.
16. M,in. de l'Acad. des Sci.., Paris, 1744,
582 THE cALcuLUs oF vARrATroNs rToG-IBoo
least action principle seemed to satisry this criterion because it showed that
nature was economical. Maupertuis proclaimed his principle to be a universal
law of nature and the fust scientific proof of the existence of God. Euler,
who had corresponded with Maupertuis on this subject between 1740 and
1744, agreed with Maupertuis that God must have constructed the universe
in accordance with some such basic principle and that the existence ofsuch
a principle evidenced the hand of God.
In the second appendix to his 1744 book Euler formulated the Principle
of Least Action as an exact dynamical theorem' He limited himself to the
motion of a single particle moving along plane curves. Moreover, he supposed
that the speed is dependent upon position or, in modern terms, that the
force is derivable from a potential. Whereas Maupertuis wrote
mtts : min.,
Euler wrote
alods:0,
by which he meant that the rate of change of the integral for a change in
the path must be zero. He also wrote that, since ds : tt dt,
oltPdt:o.
J
Just what Euler meant by the rate of change of the integral was still vague
here even though he applied the principle correctly in specific problems by
using his technique of the calculus of variations. At least he showed that
Maupertuis's action was least for motions along plane curves.
Euler went further than Maupertuis in believing that all natural
phenomena behave so as to maximize or minimize some function, so that
the basic physical principles should be expressed to the effect that some
function is maximized or minimized' In particular this should be true in
dynamics, which studies the motions of bodies propelled by forces. Euler
was not too Iiar from the truth.
Figure 24.5
(7) Ar:6r+|av+fiav+"'
where 5J indicates the integral of first degree terms in Ey and 6y', ELI
indicates the integral of the second degree terms, and so forth' Thus
v : Jxr
l*' U, sg + f,,. sg,) dx
6'J : I-"
{f,,(6v)' + 2f"",(sil':,/) + 1"."'(s!')'\ d*.
18. " Elementa Calculi Variationum," .Voui Comm. Acad, Sci' Pettop', l0' 1764, 5l-93,
pub. 1766 : O?cra, (l),25, 14l-76.
584 THE CALCULUS OF VARTATIONS I TOO-I8OO
that is, the order of the operations d and 6 can be interchanged. This is
correct, though the reason was not clear to Lagrange's contemporaries
and Euler clarified it later. [It is easily seen to be correct ilwe write y + 3y
asg t n(x), where n(.r) is the variation ofy(r). Then 69 : y + n(x) - g :
z(x) and 6y' : C' + n'(x) - S' : n'(x). But z'(x) : dn(x)ldx : d(iy)ldx.l
Using (B) Lagrange writes the first variation as
(10) r"-!h(-f,,):o.
Thus Lagrange arrived at the same ordinary differential equation for
y(*) that Euler had obtained. Lagrange's method of deriving (10) (except
for his use of differentials) and even his notation are used today. Of course,
(10) is a necessary condition on y(x) but not sumcient.
In this paper of 1760/61 Lagrange also deduced for the first time
end-conditions that must be satisfied by a minimizing curve for problems
with variable endpoints. He found the transversality conditions that must
hold at the intersections of the minimizing curve with the fixed curves or
19. The fact that the coefficient ofDy must be 0 was intuitively accepted or incorrectly
proven by every writer on the subject for one hundred years after Lagrange's work.
Even Cauchy's proof was i4adequate. The first correct proof was given by Pierre FridCric
Sarrus (1798-1861) (M,in. ditcrs Sauans, (2), 10, 1848, l-128). The result is now known
as the fundamental lemma of the calculus ofvariations.
TI{E METIIODOLOGY OF LAGRANGE
t:fQ)
(1 1) t : !1ru,u, z, p, q) dx dy
wherein z is a function of x and y, p : 0 z I 0x, and q : 0 z | 0y. The integration
is over some area in the *y-plane' The problem, then, is to find the function
z(x, y) that maximizes or minimizes the value of.L One of the most important
problems that comes under this class of double integrals is to find the surliace
of least area among all surfaces whose boundary is fixed in some way. Thus
one might be given two closed non-intersecting curves in space and seek
the surface of minimum area bounded by these two curves' As a special
case of the minimal surface problem, the two curves can be circles parallel
to the y z-plane (Fig.2a.7) and with centers on the x-axis. Then the possible
minimal surfaces are ngcessarily surfaces of revolution bounded by the two
curves and the problem is to find the surface of revolution of minimum area.
This last problem, as we noted above, had already been solved by Euler in
1744. However, this special case ofa surface of revolution can be treated
by the theory applicable to the integral (11).
By a method similar to the one he had used for the simpler integral (6),
Lagrange obtained the differential equation that the function z(*,y)
minimizing (ll) must satisfy. If we use the common notation
0z 0z 022 022 022
s, @:
T*: P, q: l, A*,:',
-0x0y: "
20. Misc. Taur., 4, 1766169 : CEuures,2, 3743.
586 THE cALcuLUs oF vARrATroNs r Too-r8oo
This equation was given by Lagrange in his 1760/61 paper (though not
quite in this form) and is a major analytical result in the theory of minimal
surfaces. Geometrically, as Meusnier pointed out in a paper of 1785,21 this
partial differential equation expresses the fact that at any point on the
minimizing surface the principal radii of curvature are equal and opposite
or that the mean curvature, that is, the average of the principal curvatures,
is zero.
fn a hter paper (177O)22 r.agrange also considered single and multiple
integrals in which higher derivatives than the first ones appear in the inte-
grand. This topic has been well developed since Lagrange's time and is
now standard material in the calculus of variations, However, since the
principles are not basically different from the cases already considered, we
shall not go into this extension of the subject. The content of Lagrange's
papen on the calculus ofvariations is incorporated in his Micaniquz annlgtique.
The calculus ofvariations was not well understood by the contemporaries
of Lagrange and Euler. Euler explained Lagrange's method in numerous
writings and used it to re-prove a number of old results. Though he realized
that the calculus of variations was a new branch or technique, which he
says is symbolized by the new operational symbol E, he, like Lagrange,
tried to base the logic of the calculus of variations on the ordinary calculus.
Euler's idea28 was to introduce a parameter I such that the curves of the
family considered in a variations problem would vary with l, that is, for
each , of some range there would be a curvey,(r). Then, says Euler, whereas
dy : (dgldx) dx, 6y : (dyldt) dt. Hence the variation Ey is expressed by a
partial differentiation with respect to t. He then formulated the technique
21. Mim. dio*s Saaans, 10, 1785,477-45.
22. Nouo, Min. de l'Acad, de Berlin, l77O : (Euarc\ 3, 157-46.
23. Noti Comm. Acad. Sci. Parop., 16, 1771,35-70, pub. 1772 = Olcra, (l), 25, 208-35.
LAGRANGE AND LEAST ACTION 587
(ls) r:f,n@'+!i2+22).
Lagrange also supposed that the forces acting to cause the motion are
derivable from a potential function V, which depends otr x, y, and z, An
additional condition, then, is that 7 + Z: const.r that is, the total energy
is constant. Lagrange's action is
(16)
r;,"
and his Principle of Least Action states that this action must be a minimum
or a maximum, that is
(17) t
f' rat: o.
24, Mln, de I'Acad. des Sci. de St. Pctus., +, l8ll, 1842, pub. l8l3 : Olcra, (l),25'
293-313.
25. Mhn. de l'Acad. dx Sci. de St. Pcters., 8, l8l7/18, 17-45, pub. 1822 : Olna, (l)' 25,
314-42.
588 THE cALcuLUs oF vARTATIoNs I Too-IBoo
(lB)
*,(#)*{*:o
and the two corresponding equations with y and z. These equations are the
equivalent of Newton's second law of motion.
Lagrange made the further step of introducing what are now called
generalized coordinates. That is, in place of rectangular coordinates one
may use polar coordinates or, in fact, any set of coordinates qL, q2, qs,
which are needed to fix the positioh of the particle (or extended mass).
Then
: x(qb q2, qs)
tc
! : !(4b 82, Qs)
z : z(Qb 42, Q;,
where the 4r are now functions of t. In terms of the new coordinates, 7
becomes a function of the 4, and {, while Z becomes a function of the q,.
Then the equations (lB) become
(le)
*,(#\ - T,* Yq,: o, i : 1,2,3'
This is a set of 3 simultaneous second order ordinary differential equations
in the 91. They are the Euler (characteristic) equations for the action integral.
Il z coordinates are needed to fix the position of the moving object, for
example, 2 particles require 6 coordinates, then equations (19) are replaced
by z equations.26
These generalized coordinates need not have either geometrical or
physical significance. One speaksofthem today as coordinates in a configura-
tion space and then the qr(t) are the equations of a path in configuration
space. Thus Lagrange recognized that the variational principle, namely,
that the action must be a minimum or maximum, can be used with any set
26. Lagrange is explicit that the number ofvariables in Iandin Zare the number required
to determine the position of the mechanical system. Thus if there are ly' independent
particles and each requires three coordinates (*r, !r, zr) to describe its path in sPace, then
3N coordinates are required. In this case there will be 3iy' coordinates 41, 3-l[ equations
relating the Cartesian coordinates to the 4, and 3iy' equations (19). The number of
independent coordinates or the number ofdegrees offreedom, as physicists put it, depends
on the system being treated and on the constraints in the motion.
THE SECOND VARTATTON 589
27. Hist. de I'Acad. des Sci., Paris, 1786' 7-37, pub. 1788.
5go THE CALCULUS OF VARTATTONS r TOO-rBOO
in the ordinary calculus the sign ofJ'@) at a value ofx for which;['(r) : 0
determines whether/(*) has a maximum or a minimum, Legendre considered
the second variation 62J, recast its form, and concluded that J is a maximum
for the curve y(x) which satisfies Euler's equation and passes through
(*0, yo) and (xr,yr) provided. that fo,r, < 0 at each x along y(x). Likewise,
J is a minimum for a y(r) satisfying the first two conditions provided that
Ju,r, > 0 at each x along y(x). Legendre then extended this result to more
general integrals than (6). However, Legendre realized in 1787 that the
condition on;f,,r, was just a necessary condition on y(r) in order that it
be a maximizing or minimizing curve. The problem of finding sufficient
conditions that a curve y(*) maximize or minimize an integral such as (6)
was not solved in the eighteenth century.
Bi.bliograplry
Bernoulli, James: Opcra,2 vob., 1744, reprint by Birkhaiiser, 1968.
Bernoulli, John: OPcra Omnia,,4 vols., 1742, Georg Olms (reprint), 1968.
Bliss, Gilbert A.z Thc Cahulus of Vaiations, Open Court, 1925.
"The Evolution of Problems in the Calculus of Variationsr" Amer. Math.
Monthls, 43, 1936, 59M09.
Cantor, Moritz: Voilesungcn ilber Geschichtc der Mathematik, B. G. Teubner, 1898
zrd 1924, Vol. 3, Chap. ll7, znd Vol. 4, 1066-74.
Caratheodory, C. : Introduction to Series ( I ), Vol. 24 of Euler's Opera Omnia,
viii-lxii, Orell Fiissli, 1952. AIso in C. Caratheodory: Gesammeile mathematische
ScbiJtea, C. H. Beck, 1957, Vol. 5, pp. 107-74.
Darboux, Gaston: LcAons sw la tMoric ginbale des surJaces,2nd ed., Gauthier-Villars,
1914, Vol. l, Book III, Chaps. l-2.
Euler, Leonhard z Opcra Omnia, (l), Vols. 24-25, Orcll Fiissli, 1952.
Hofmann, Joseph E. : "Uber Jakob Bernoullis Beitriige zur Infinitesimalmathe-
lmatik," L'Enscignemat Mathimatiqw, (2), 2, 1956, 6l-17l; published
separately by Institut de mathdmatiques, Geneva, 1957.
Huke, Aline: An Histoical and Citical Study of tlu Fundatuntal Itmma in the Calculus of
Vaiations, University of Chicago Contributions to the Calculus of Variations,
University of Chicago Press, Vol, l, 1930, pp. 45-160.
Lagrange, Joseph-Louis: CEuares de Lagrangc, Gauthier-Villars, 1867-69, relevant
papers in Vols. l-3.
Micanique analgti.que, 2 vols., 4th ed., Gauthier-Villars, 1889.
Lecat, Maurice: Bibliographb du calcul des tariatians depuis hs oiginos.iusqu'i 1850,
Gand, 1916.
Montucla, f. F.:, Histoire des mathimatiquzs, 1802, Albert Blanchard (reprint),
1960, Vol. 3, 643-58.
Porter, Thomas Isaac: "A History of the Classical Isoperimetric Problem,"
Uniocrsitg of Chicago Conlribulians lo lhe Calculus of Varialioru, University of
Chicago Press, 1933, Vol.2, pp.475-517.
Smith, David E.: A Source Book in Mathcmatdcs, Dover (reprint), 1959, pp. 6,14-55.
BTBLTOCRAPITY 59r
Struih D.!.: A Souru Book i.n Mathcnatics, 1200-1800, Harvard University Press,
1969, pp. 391-413.
Todhunter, Isaac: A History of tlw Cahulas of Variations during tlu Ninetcenth Century,
1861, Chelsea (reprint), 1962.
Woodhouse, Robert: A History qf the Calculus of Vaiations i.n the Eighteenth Ccntury,
1810, Chelsea (repdnt), 1964.
25
Algebra in the Eighteenth Century
592
STATUS OF TIIE NU}iiBER SYSTEM s93
algebra or were again discarded from it: for if this were done, there is
good reason to imagine, the objections which many learned and in-
genious men now make to algebraic computations, as being obscure
and perplexed with almost unintelligible notions, would be thereby
removed; it being certain that Algebra, or universal arithmetic, is, in
its own nature, a science no less simple, clear, and capable of demon-
stration, than geometry.
Certainly negative numbers were not really well understood until
modern times. Euler, in the latter half of the eighteenth century, still believed
that negative numbers were greater than m. He also argued that ( - I ) .
(-l) : *l because the product must be +l or -l and since l.(-l):
-1, then (-l).(-l) : +1. Carnot, the noted French geometer, thought
the use of negative numbers led to erroneous conclusions. As late as 1831
Augustus De Morgan (1806-71), professor of mathematics at University
College, London, and a famous mathematical logician and contributor to
algebra, in his On the Study and Difuulties oJ Mathetnatias, said,"The imaginary
expression | - a and, the negative expression - 6 have this resemblance, that
either of them occurring as the solution of a problem indicates some in-
consistency or absurdity. As far as real meaning is concerned, both are
equally imaginary, since 0 - a is as inconceivable as \/=."
De Morgan illustrated this by means of a problem. A father is 56; his
son is 29. When will the father be twice as old as the son ? He solves 56 * x :
2(29 + x) and obtains x : -2. Thus the result, he says, is absurd. But, he
continues, if we change tc to -x and solve 56 - x : 2(29 - x), we get x : 2.
He concludes that we phrased the original problem wrongly and thus were
led to the unacceptable negative answer. De Morgan insisted that it was
absurd to consider numbers less than zero.
Though nothing was done in the eighteenth century to clariry the
concept of irrational numbers, some progress was made in this subject. In
1737 Euler showed, substantially, that e and a2 are irrational and Lambert
showed that z is irrational (Chap. 20, sec. 6). The work on the irrationality
ofzlwas motivated largely by the desire to solve the problem ofsquaring the
circle. Legendre's conjecture that r rnay not be a root of an algebraic
equation with rational coefficients led to a distinction between types of
irrationals. Any root, real or complex, of any algebraic (polynomial) equation
with rational coefficients is called an algebraic number. Thus the roots of
aoxn a arxn-L +... + aa-tx I ao : 0,
where the a, are rational numbers, are called algebraic numbers, Conse-
quently, every rational number and some irrationals are algebraic numbers,
for any rational number c is a root of x - c: 0, and l/i i" u root of
x2 - 2 :0. Those numbers that are not algebraic are called transcendental
because, as Euler put it, "they transcend the power of algebraic methods."
594 ALGEBRA IN THE EIGHTEENTH CENTURY
Because all concgivable numbers are either greater than zero or less
than 0 or equal to 0, then it is clear that the square roots ofnegative
numbers cannot be included among the possible numbers [real num-
bers]. Corsequently we must say that these are impossible numbers.
ADd this circumstance leads us to the concept of such numbers, which
by their nature are impossible, and ordinarily are called imaginary or
fancied numbers, because they exist only in the imagination.
By the time De Morgan wrote the above lines, the concePts of complex
numbers and complex functions were well on the way to clarification. But
the diffusion of the new knowledge was slow. Certainly throughout the
eighteenth and the first half of the nineteenth centuries the meaning of
complex numbers was debated hotly. All the arguments of John Bernoulli,
d'Alembert, and Euler were continually rehashed. Even twentieth-century
textbooks on trigonometry supplemented presentations that employed
complex numbers by proofs not involving V=I.
We should note here another point whose significance is almost in
inverse proportion to the conciseness with which it can be stated: In the
THE THEORY OF EqUATTONS 597
eighteenth century no one worried about the logic of the real or complex
number systems. What Euclid had done in Book V of the E/aments to establish
the properties of incommensurable magnitudes was disregarded. That this
exposition had been tied to geometry, whereas by now arithmetic and alge-
bra were independent of geometry, accounts in part for this disregard.
Moreover this logical development, even if suitably modified to free it ol
geometry, could not establish the logical foundations of negative and complex
numbers; and this fact too may have caused mathematicians to desist from
any attempt to found the number system rigorously. Finally, the century
was concerned primarily to use mathematics in science, and since the rules
of operation were intuitively secure at least for real numbers no one really
worried about the foundations. Typical is the statement of d'Alembert in his
article on negative numbers in t}re Eruyclopidie. The article is not at all clear
and d'Alembert concludes that "the algebraic rules of operation with nega-
tive numbers are generally admitted by everyone and acknowledged as
exact, whatever idea we may have about these quantities." The various
types of numbers, never properly introduced to the world, nevertheless
gained a firmer place in the eighteenth-century mathematical community.
out that complex roots occur in conjugate pairs, so that the product
of *- (a+bt/-t1 and x- (a-bt/--t!, wherein a+bl-l and
a - bli are a conjugate pair, gives a quadratic expression with real
coefficients. Euler then showed that this was true for Bernoulli's example' But
Goldbach, too, rejected the idea that every polynomial with real coefficients
can be factored into real factors and gave the example xa + 72x - 2O.
Euler then showed Goldbach that the latter had made a mistake and that
he (Euler) had proved his theorem for polynomials up to the sixth degree.
However, Goldbach was not convinced, because Euler did not succeed in
giving a general proof of his assertion.
The kernel ofthe problem of factoring a real polynomial into linear and
quadratic factors with real coefficients was to prove that every such poly-
nomiral had at least one real or complex root. The proof of this fact, called
the fundamental theorem of algebra, became a major goal.
Proofs offered by d'Alembert and Euler were incomplete. ln 17721
Lagrange, in a long and detailed argument, " completed " Euler's proof.
But Lagrange, Iike Euler and his contemporaries, applied freely the ordinary
properties of numbers to what were supposedly the roots without establishing
that the roots must at worst be complex numbers. Since the nature of the
roots was unknown, the proof was actually incomplete.
The first substantial proof of the fundamental theorem, though not
rigorous by modern standards, was given by Gauss in his doctoral thesis
of 1799 at Helmstddt.2 He criticized the work of d'Alembert, Euler' and
Lagrange and then gave his own proof. Gauss's method was not to calculate
a root but to demonstrate its existence. He points out that the complex roots
a * ib of P (x + ig) : 0 correspond to points (a, b) of the plane, and if
P(x + iy) : u(r, y) + ia(x, y) then (a, D) must be an intersection ol the
cnrves z : 0 and 7 : 0. By a qualitative study ofthe curves he shows that a
continuous arc of one joins the points of two distinct regions separated by
the other. Then the curvc t,l : 0 must cut the curve u : 0. The argument
was highly original. However, he depended on the graphs of these curves,
which were somewhat complicated, to show that they must cross' In this
same paper Gauss proved that the zth degree polynomial can be expressed as
a product of linear and quadratic factors with real coefficients.
Gauss gave three more proofs of the theorem. In the second proofg
he dispensed with geometrical arguments. In this proof he also showed that
the product ofthe differences ofeach two roots (which we, following Sylvester,
call the discriminant) can be expressed as a linear combination of the poly-
nomial and its derivative, so that a necessary and sufficient condition that
the polynomial and its derivative have a common root is that the discriminant
l. Noua. Mim. de I'Acad. de Balin, 1772' 222 ff. : CEuares, S, +79-516,
2. Werke,3, l-30; also reproduced in Euler, Opeta, (l)' 6, 15l-69.
3. Comm. Soc. Gott.,3, l8l4/15' lO7-42 : Werke,3,33-56,
THE THEORY OF EqUATIONS 599
vanish. However, this second proof assumed that a polynomial cannot change
signs at tu,o different values ofx without vanishing in between. The proof of
this fact lay beyond the rigor of the time'
The third proofa used in effect what we now call the Cauchy integral
theorem (Chap. 27, sec. 4).5 The fourth proofo is a variation of the first as
far as method is concerned. However, in this proof Gauss uses complex
numbers more freely because, he says, they are now common knowledge.
It is worth noting that the theorem was, in many proofs, not proved in all
generality. Gauss's first three proofs and later proofs by Cauchy, Jacobi,
and Abel assumed that the (literal) coefficients represented real numbers,
whereas the full theorem includes the case of complex coefficients. Gauss's
fourth proof did allow the coefficients of the polynomial to be complex
numbers.
Gauss's approach to the fundamental theorem of algebra inaugurated
a new approach to the entire question of mathematical existence. The Greeks
had wisely recognized that the existence of mathematical entities must be
established before theorems about them can be entertained. Their criterion
of existence was constructibility. In the more explicit formal work of the
succeeding centuries, existence was established by actually obtaining or
exhibiting the quantity in question. For example, the existence of the
solutions of a quadratic equation is established by exhibiting quantities that
satisfy the equation. But in the case of equations ofdegree higher than four,
this method is not available. Of course a proof of existence such as Gauss's
may be of no help at all in computing the object whose existence is being
established.
While the work that ultimately showed that every polynomial equation
with real coemcients has at least one root was under way, the mathematicians
also pushed hard to solve equations of degree higher than four by algebraic
processes. Leibniz and his friend Tschirnhausen were the first to make
serious efforts. LeibnizT reconsidered the irreducible case of the third degree
equation and convinced himself that one could not avoid the use of complex
numbers to solve this type. He then tackled the solution of the fifth degree
equation but without success. Tschirnhausens thought he had solved this
problem by transforming the given equation into a new one by means of a
transform ! : P(*), where P(x) is a suitable fourth degree polynomial.
4. Comm. Soc. Gott., 3, 1816 : Werkc, 3, 5944.
5. For a discussion of Gauss's third proof sce M. Bocher, " Gauss's Third Proof of the
Fundamental Theorem of Algcbra," Amr. Math. Soc., Bull., l, 1895' 205-9. A translation
of the third proof can be found in H. MeschkowsLi, Ways oJ Thoughl oJ Grcat Maiundiciotrs,
Holden-Day, 1964.
6, Abhand. dtr Ges. der Wist. zu Gott., 4, 1848150,3-34 : Werkc,3,73-102.
7. Der Brbfuechsel oon Gottfried Willulm Leibniz mit Matlumatikcm, Georg Olms (reprint),
1961, Vol. l, 547--64.
8. Acta Erud.,2, 1683,204-07.
6oo ALGEBRA IN THE EIGIITEENTH CENTURY
This transformation eliminated all but the .r5 and constant terms of the
equation. But Leibniz showed that to determine the coefficients of P(r) one
had to solve equations of degree higher than five, and so the method was
useless.
For a while the problem of solving the zth degree equation centered on
the special case ta - I :0, called the binomial equation. Cotes and De
Moivre showed, through the use of complex numbers, that the solution ofthis
problem amounts to the division of the circumference of a circle into z equal
parts. To obtain the roots by radicals (trigonometric solutions are not
necessarily algebraic) it is sufficient to consider the case ofr an odd prime,
because if n : ?m, wherep is a prime, then one can consider (r'il)e - 1 : 0.
Ifthis equation can be solved for x', then x' - A can be considered where I
is any of the roots of the preceding equation. Alexandre-Th€ophile Vander-
monde (1735-96) affirmed in a paper of l77ls that every equation ol the
form xo - I : O, where z is a prime, is solvable by radicals. However,
Vandermonde merely verified that this is so for values of n up to I l' The
decisive work on binomial equations was done by Gauss (Chap. 31, sec' 2)'
The major effort toward solving equations of degree higher than four
concentrated on the general equation, and toward this end some subsidiary
work on symmetric functions proved important. The expression r1x2 *
x2ro + xsrr is a symmetric function of xy x2, and xs because replacement
throughout of any x, by an *, and *1 by .r, leaves the entire expression un-
altered. The interest in symmetric functions arose when the seventeenth-
century algebraists noted and Newton proved that the various sums of the
products of the roots of a polynomial equation can be expressed in terms
of the coefficients. For example, for z : 3, the sum of the products taken
two at a time,
ataz+0zas+0sal
is an elementary symmetric function; and if the equation is written as
the sum equals rr. The progress made by Vandermonde in the l77l paper
was to show that any symmetric function of the roots can be expressed in
terms of the coefficients ol the equation.
The outstanding work of the eighteenth century on the problem of the
solution of equations by radicals, after efforts by many men including Euler,lo
was by Vandermonde in the l77l paper and Lagrange, in his massive paper
11 Vandermonde's
" Rdflexions sur la r€solution alg6brique des 6quations."
9. Hist. de l'Acad, dcs Sci., Pari"s, 1771,365-416, pub. 1774.
lO. Comm, Acad. Sci. Petrop.,6, 113213,216-31, pub' 1738 : Opeta, (l)' 6' l-19 and Noai
Comn. Acad. Sci. Pebop.,9' 1762163' 70-98, pub. 1764 : Olerq (l)' 6, 170-96'
ll. Noua. Mim. de l'Acatl. de Beilin, 1770, 134-215, pub' 1772 ar'd 1771, 138-254, pub'
1773 = CEuures, 3,205-421,
THE THEORY OF EqUATTONS 6or
ideas are similar but not so extensive nor so clear. We shall therefore Present
Lagrange's version. Lagrange set himself the task of analyzing the methods
ofsolving third and fourth degree equations, to see why these worked and to
see what clue these methods might furnish for solving higher-degree equations.
For the third degree equation
(r) *3+nx+P:O
Lagrange noted that if one introduces the transformation (Chap. 13, sec. 4)
(2) Y:s - (nl3s),
when x,, .rr, and rs are taken in particular orders. Examination of this
expression enables us to perceive two properties of the reduced equation in
y. First, the roots xr, xr, and x, in the expression for g are not one fixed choice
for x1, x2,, and x., and the expression is, so to speak, ambiguous. Thus any of
the three x-values can be .r1, any of the other two xr, etc. But there are 3 !
permutations of the .r's. Hence there are six values for g, and y should satisfy
a sixth degree equation. Thus the degree of the reduced equation is deter-
mined by the number of permutations of the roots in the proposed equation.
In the second place, the relation (5) also shows why one can reduce the
sixth degree reduced equation to the second degree equation. For among the
six permutations, three (including the identity) come from interchanging
all the *1 and three from interchanging just two and keeping one fixed.
But then in view of the value of <.r, the six values ofy that result are related by
(6) gr : (,)2Y2 : @Ysi Y4:o2Y5:u,YB
and by cubing,
ci:cZ:Y8, yl:yE:s3.
Another way of putting this result is that the function
(xL+@xz+etzxs)a
can take on only two values under all six permutations of x1, .rr, and *., and
this is why the equation that y satisfies proves to be a quadratic in y3. More-
over the coefficients of the sixth degree equatiot that y satisfies are rational
functions of the coefficients of the original cubic.
In the case of the general fourth degree equation in x, Lagrange con-
siders
y:xfiz+xz,t4,
This function of the four roots takes on only three distinct values under all
24 possible permutations of the four roots. Hence there should be a third
degree equation that g satisfies and the coefficients of this equation should
be rational functions of those of the original equation. These statements do
apply to the fourth degree equation.
Lagrange then takes up the general ath degree equation
(7) xn * at*n-L +. . . + an-r* * a,,: O'
*':-b+(:'-x')'2
Lagrange's proof of this proposition also shows how to express { as a rational
function of f.
Lagrange's second proposition states: Ifa function 6(tr, ttz,.,., rn)
of the roots of the general equation does not admit all the permutations
admitted by a function {(xr, xr, . . . , x,) but takes on under the Permutations
that f admits r different values, then { is a root ofan equation of degree r
whose coefficients are rational functions of I and ofthe given general equa-
tion of degree z. This equation ofdegree r can be constructed. Thus *, - x,
does not admit all the permutations .rr + x, admits but takes on the two
values ,rr - x2 and xz - xt. Then *, - x2 is a root of a second degree
equation whose coefficients are rational functions of x, * x2 and of 6 and c.
As a matter of fact, x, - x2 is a root of
t2_(b2_4c):g
because 62 - 4ac : (*, - *r)'. With the value of this root, namely \/F4,
we can find x1 by means ofthe preceding equation for xr.
Likewise for the equation xs + pt + 4 : 0 the expression (the /)
(xL+.Dx2+a2xs)3,
where <,r : ( - I + i\/3) 12, takes on two values under the six possible
permutations of the roots, whereas xr + x2 * x3 (the ry') admits all six
permutations. I[ the two values are denoted by ,4 and B, one can show
that I and B are roots of a second degree equation whose coefficients are
6o4 ALOEBRA IN THE EIGHTEENTII CENTURY
rational inp and g (since trt + tb * r. : 91. If we solve the second degree
equation and the roots are A and B, we can then find xr, x2, and .x" from
xL * x. :6
+ x2
x1 4 ax2 + t't2t": ilV
x1 * a2*2 * <ox": f/-3.
For the fourth degree equation, Lagrange started with the function
(8) xfi2 * nsx4,
12. Lagrange used the word "resolvent" for the special forms ofthe {1 functions and not
for the equations which the {1 satisfy. Thus for the cubic equation, xr + .Dxz * <rr2xs is
one form of Lagrange's resolvent.
THE THEORY OF EqUATIONS 6o5
Lagrange's method worked for the general second, third, and fourth
degree equations. He tried to solve the fifth degree equation in this way but
found the work so difficult that he abandoned it. Whereas for the cubic he
had but to solve an equation of the second degree, for the quintic he had to
solve a sixth degree equation. Lagrange sought in vain to find a resolving
function (in his sense of the term) that would satisfy an equation of degree
less than five. Ilowever his work did not give any criterion for picking {,'s
that would satisfy algebraically solvable equations. Also his method applied
only to the general equation because his two basic propositions assume that
the roots are independent.
Lagrange was drawn to the conclusion that the solution of the general
higher-degree equation (for z > 4) by algebraic operations was likely to be
impossible. (For special equations of higher degree he offered little.) He
decided that either the problem was beyond human capacities or the nature
of the expressions for the roots must be different from all those thus far
known. Gauss, too, in his Disquisitiones of 1801, declared that the problem
could not be solved.
Lagrange's method, despite its lack of success, does give insight into the
reason for the successes when z < 4 and failures when z > 4; this insight
was capitalized on by Abel and Galois (Chap. 3l). Moreover, Lagrange's
idea that one must consider the number of values that a rational function
takes on when its variables are permuted led to the theory of permutation
or substitution groups. In fact, he had in effect the theorem that the order o[
a subgroup (the number of elements) must be a divisor of the order of the
group. In Lagrange's work, which preceded any work on grouP theory, this
result takes the form that the number r ofvalues that /, takes on is a divisor
ofn!.
Influenced by Lagrange, Paolo Ruffini (1765-1822), a mathematician,
doctor, politician, and an ardent disciple of Lagrange, made several attempts
during the years 1799 to l8l3 to prove that the general equation of degree
higher than the fourth could not be solved algebraically. ln his Teoria
generale delle equazioni,Lo Ruffini succeeded in proving by Lagrange's own
method that no resolving function (in Lagrange's sense) existed that would
satisly an equation of degree less than five. In fact he demonstrated that no
rational function of n elements existed that took on 3 or 4 values under
permutations of the z elements when z > 4. Then he boldly undertook to
prove, in his Riflessioni intorno alla soluzione dtlle equazioni algebraiclu gmeruli,la
that the algebraic solution of the general equation of degree z > 4 was
impossible. This effort was not conclusive, though at first Ruffini believed it
was correct. Ruffini used but did not prove the auxiliary theorem, now
known as Abel's theorem, that if an equation is solvable by radicals, the
!3. 1799 : Olere Mat., 1, l-324.
14. l8l3 : Olere Mat.,2, 155-268.
606 ALGEBRA IN THE BIGIITEENTH cENTURY
expressions for the roots can be given such a form that the radicals in them
are rational functions with rational coefficients of the roots of the given
equation and the roots of unity.
is divisible by z.
22. Jour. fiir Math., 15, 1836, l0l-24 : Gesam. We*e,3,297-320.
23. Jour. fiir Math.,22, 1841, 178-83.
24. Comm. Acad. Sci. Petrop.,8, 1736, 14l-46, pub. l74l : Olera, (l),2,33-37.
25. Noai Comm. Acad. Sci. Petrop.,8, 176011,74-104, pub. 1763 : O?erq (l), 3,531-55.
THE THEoRY oF NUMBERs 6o9
34. Comm. Acad. Sci. Petrop,, 10, 1738, 125-+6, pub. 1747 : Opera, (l), 2, 38-59; also in
Algcbra (1770), Part II, Ch. 13, arts, 202-8 :
O?erq (l), l, +3H3.
35. NoaiComm. Acad. Sci. Petrop.,8, 1760/1, 105-28, pub. 1763 :Opera, (l),2,556-75.
36. "De numeris amicabilibus," Opuscula uarii argumenli,2, 1750,23-107 : O?era, (l),2,
86- 162.
37. "De numeris amicabilibus," Corznr. Atith.,2, 1849,627-36: Olera losluma, l, 1862,
85-100 = Opera, (t),5, 353-65.
38. Noxt. Mim. de I'Acad. de Berlin,2, 1771,125 ff., pub. 1773 :
(Eures, 3, 425-38.
39. Noui Comm. Acad. Sci. Petrop., 11,1765, 28-66, pub. 1767 :Ofiera, (l), 3, 73-l I l.
THE THEoRY oF NUMBERS 6ll
where the coefficients are integers, was also tackled. Euler gave incomplete
classes ofsolutions; then Lagrangea2 gave the complete solution. In the next
volume ol the Mimoiresa3 he gave a simpler proof.
The most original and perhaps the most consequential discovery of the
eighteenth century in the theory of numbers is the law of quadratic recipro-
city. It uses the notion of quadratic remainders or residues. In the language
introduced by Euler in the 1754/55 paper and adopted by Gauss, if there
exists an r such that x2 - p is divisible by q, then p is said to be a quadratic
residue of 4; if there is no such .r, p is said to be a quadratic nonresidue ofg.
Legendre (lB0B) invented a symbol that is now used to represent either
state of affairs. The symbol is (p/q) and means the following: For any number
p and any prime q,
Qldklil : (- l)(e-lxc-1'i{'
This means that if the exponent ol ( - I ) is even, p is a quadratic residue of 4
and g is a quadratic residue ofp or neither is a quadratic residue ofthe other.
When the exponent is odd, which occurs whenp and g are of the form 4k + 3,
one prime will be a quadratic residue of the other but not the second of the
first.
The history of this law is detailed. Euler in a paPer of l7B3a{ gave four'
theorems and a fifth summarizing theorem that states the law of quadratic
reciprocity very clearly. However, he did not prove these theorems. The
work on this paper dates from 1772 and incorporates even earlier work.
Kronecker observed in 1875 as that the statement of the law is actually
contained in a much earlier paper by Euler.a6 However, Euler's " proof"
was based on calculations. In l7B5 Legendre announced the law indepen-
dently, though he cites another paper by Euler, in the same volume of the
Opuscula, in his own paper on the subject. His proofa? was incomplete. In his
Thnrie dcs nombresEa he again stated the law and gave another prooL How-
ever, this one, too, was incomplete, because he assumed that there are an
infinite number of primes in certain arithmetic progressions. The desire to
find what is behind the law and to derive its many implications has been a
key theme in investigations since 1800 and has led to important discoveries,
some of which we shall consider in a later chapter.
The eighteenth-century work on the theory of numbers closes with
Legendre's classic Thiorie of 1798. Though it contains a number of interesting
results, in this domain as in others (such as the subject of elliptic integrals),
Legendre made no great innovations. One could reproach him for presenting
a collection ofpropositions from which general conceptions could have been
extracted but were not. This was done by his successors.
Bibliography
Cajori, Florian: " Historical note on the Graphical Representation of Imaginaries
Before the Time of Wessel," Amer, Math, Monthly, 19, 1912, 167-71.
Cantor, Moritz: Vorhsungen ilbcr Geschichtz der Malhematik, B. G. Teubner, 1898 and
1924;Johnson Reprint Corp., 1965, Vol. 3, Chap. 107; Vol.4, pp. 153-98.
Dickson, Leonard E.: History of tht Theory of Numbers, 3 vols., Chelsea (reprint),
I95 t.
" Fermat's Last Theorem," Annals of Math., 18, 1917, l6l-87.
Euler, Leonhard : Opera Omnia, ( I ), Vols. l-5, Orell Fiissli, l9l l-44.
Volktdndigc Anhitung zur Algebrit (1770) : Opera Ornnia, (l), l.
Fuss, Paul H. von, ed.: Corrcspondance
nathimalique et physique de quelques cilibres
giomitres du XVIIIime siicle,2 vols, (1843), Johnson Reprint Corp., 1967.
Gauss, Carl Friedrich: Werke, Kilnigliche Gesellschaft der Wissenschaften zu
Gdttingen, 1876, Vol. 3, pp.3-121.
Gerhardt, C. l.: Dcr Briefwechsel aon Gottfried Wilhelm Leibniz mit Mathematikern,
Mayer und Miiller, 1899; Georg Olms (reprint), 1962.
Heath, Thomas L.: Diophantus of Alexandria, 1910, Dover (reprint), 1964, pp. 267
380.
Jones, P, S. : " Complex Numbers: An Example of Recurring Themes in the
45. Werke,2,3-10.
46. Comtn. Acad. Sci. Petrop., 14, 174+146, l5l-81, pub. l75l = Olera, (l),2, 191-222.
+7, Hist. de I'Acad. des Sci., Paris, 1785,465-559, pub. 1788.
48. 1798,214-26;2nd ed., 1808, 198-207.
BTBLTocRAPHY 613
6r4
THE RrsE oF ANALYSn 615
took his work on excavation and fiIl and the design of windmill vanes as
seriously as any problem of differential geometry or differential equations.
J. F. (Jean-Etienne) Montucla (1725-99), in his Histoire des mathimatiques
(2nd ed., 1799-1S02) divided mathematics into two parts, the one "com-
prising those things that are pure and abstract, the other those that one
calls compound, or more ordinarily physico-mathematics." His second part
comprised fields that can be approached and treated mathematically, that
is, mechanics, optics, astronomy, military and civil architecture, insurance,
acoustics, and music. Under optics he included dioptrics and even optometry'
catoptrics, and perspective. Mechanics included dynamics and statics,
hydrodynamics and hydrostatics. Astronomy covered geography, theoretical
astronomy, spherical astronomy, gnomonics (e.g. sundials), chronology, and
navigation. Montucla also included astrology, the constructon of obser-
vatories, and the design of ships.
the distinction between values of* for which the series converged and values
for which it diverged did not seem to demand attention. And even though
they recognized that some series, such as I + 2 + 3 +. . ., had an infinite
sum, they preferred to try to give meaning to the sum rather than question
the summation.
Likewise the somewhat free use of complex numbers rested on the
confidence in symbols, Since second degree expressions, ar2 * bx * c,
could be expressed as a product of linear factors when the zeros were real,
it was equally clear that there should be linear factors when the zeros were
complex. The formal operations of the calculus, differentiation and anti-
diflerentiation, were extended to new functions despite the fact that the
mathematicians were conscious of their own lack of clarity about the ideas.
This reliance upon formalism blinded them somewhat. Thus the difficulty
they had in broadening their notion of function was caused by their adher-
ence to the conviction that functions must be expressible by formulas.
The eighteenth-century men were fully aware of the mathematical
requirement of proof. We have seen that Euler did try to justify his use of
divergent series and Lagrange, among others, did offer a foundation for the
calculus. But the few efforts to achieve rigor, significant because they show
that standards of rigor vary with the times, did not logicize the work of the
century; and the men almost willingly took the position that what cannot
be cured must be endured. They were so intoxicated with their physical
successes that most often they were indifferent to the missing rigor. Very
striking is the extreme confidence in the conclusions while the theory was so
ill-assured. Because the eighteenth-century mathematicians were willing to
plunge ahead so boldly without logical support, this period has been called
the heroic age in mathematics.
Perhaps because the few efforts to rigorize the calculus were unsuccessful
and additional questions o[ rigor raised by subsequent analytical work were
hopclessly beyond resolution, some mathematicians abandoned the effort
to secure it and, Iike the fox with the grapes, consciously derided the rigor
of the Greeks. Sylvestre-Frangois Lacroix (1765-1843), in the second edition
(1810-19) of his three-volume Traiti du calcul difuentful et du calcul intigral,
says, in the preface to Volume I (p. I I ), " Such subtleties as the Greeks
worried about we no longer need." The typical attitude of the century was:
Why go to the trouble of proving by abstruse reasoning things which one
never doubts in the first place, or of demonstrating what is more evident by
means of what is less evident ?
Even Euclidean geometry was criticized, on the ground that it offered
proofs where none were deemed to be needed. Clairaut said in his Eliments
de giomittic (1741),
that the sum of the sides of a triangle which is enclosed within another
is smaller than the sum of the sides of the enclosing triangle. This
geometer had to convince obstinate sophists who glory in rejecting the
most evident truths; so that geometry must, like logic, rely on formal
reasoning in order to rebut the quibblers. . . . But the tables have turned.
All reasoning concerned with what common sense knows in advance,
serves only to conceal the truth and to weary the reader and is today
disregarded.
6. A Glance Ahead
As we know, by the end of the eighteenth century the mathematicians had
created a number of new branches of mathematics. But the problems of
these branches had become extremely complicated and, with few exceptions,
no general methods had been devised to treat them. The mathematicians
began to feel blocked. Lagrange wrote to d'Alembert on September 21,
I 78 l, " It appears to me also that the mine [of mathematics] is already very
deep and that unless one discovers new veins it will be necessary sooner or
later to abandon it. Physics and chemistry now offer the most brilliant
riches and easier exploitation; also our century's taste aPPears to be entirely
in this direction and it is not impossible that the chairs of geometry in the
Academy will one day become what the chairs of Arabic presently are in
the universities." 2 Euler and d'Alembert agreed with Lagrange that mathe-
matics had almost exhausted its ideas, and they saw no new great minds on
the horizon. This fear was expressed even as early as 1754 by Diderot in
Thoughts on the Interpretation of Nature : " I dare say that in less than a century
we shall not have three great geometers [mathematicians] Ieft in Europe.
This science will very soon come to a standstill where the Bernoullis, Mauper-
tuis, Clairauts, Fontaines, d'Alemberts and Lagranges will have left it. . . .
We shall not go beyond this point."
Jean-Baptiste Delambre (1749-1822), who was permanent secretary of
the mathematics and physics section of the Institut de France, in a report
(Rapport historique sur le progris des sciences mathimatiqucs depuis 1789 ct sur leur
itat actuel, Paris, said, " It would be difficult and rash to analyze the
l8l0)
chances which the future offers to the advancement of mathematics; in
almost all its branches one is blocked by insurmountable difficulties;
perfection of detail seems to be the only thing which remains to be done. All
these difficulties appear to announce that the power of our analysis is
practically exhausted.. . . "
The wiser prediction was made in lTBl by Condorcet, who was im-
pressed by Monge's work.
...in spite of so many works often crowned by success, we are far
from having exhausted all the applications of analysis to geometry,
and instead of believing that we have approached the end where
these sciences must stop because they have reached the limit of the
forces ofthe human spirit, we ought to avow rather we are only at the
first steps of an immense career. These new [practical] applications,
independently of the utility which they may have in themselves, are
necessary to the progress of analysis in generall they give birth to
questions which one would not think to propose; they demand that
one create new methods. Technical processes are the children ofneed;
one can say the same for the methods of the most abstract sciences.
But we owe the latter to needs of a more noble kind, the need to dis-
cover the new truths or to know better the laws of nature,
Thus one sees in the sciences many brilliant theories which have
remained unapplied for a long time suddenly becoming the foundation
of most important applications, and likewise applications very simple
in appearance giving birth to ideas of the most abstract theories, for
which no one would have felt the need, and directing the work of
geometers [mathematicians] to these theories. . ..
Bibliography
Boutroux, Piete: L'Ideal scientifque d,es mathimaticiens, Libraire Felix Alcan, 1920'
Brunschvicg, L6on: Les Etales dr la philosophie mathimatiqw, Presses Universitaires de
France, 1947, Chaps. l0-12'
Hankins, Thomas L,: Jean d' Alembert : Science and Enlighlenment, Oxford University
Press, 1970.
Hille , Einar: " Mathematics and Mathematicians from Abel to Zermelo," Matlu'
matics Magazine, 26, 1953, 127 -46.
Montucla, J. F.: Histoire ths mathimatiques' 2nd ed., 4 vols., 1799-1802, Albert
Blanchard (reprint), 1960.
27
Functions of a Complex Variable
The shortest path between two truths in the real domain passes
through the complex domain. JACqUES HADAMARD
l. Introdtction
From the technical standpoint, the most original creation of the nineteenth
century was the theory of functions of a complex variable. The subject is
often referred to as the theory offunctions, though the abbreviated descrip-
tion implies more than is intended. This new branch of mathematics domi-
nated the nineteenth century almost as much as the direct extensions of the
calculus dominated the eighteenth century. The theory of functions, a most
fertile branch of mathematics, has been called the mathematical joy of the
century. It has also been acclaimed as one of the most harmonious theories
in the abstract sciences.
626
THE BEGTNNTNGS orcoMpLEx ruNcrloN THEoRY 627
These equations are now called the Cauchy-Riemann equations. The equa-
tions (2) say (Chap. 19, sec. 6) that q dx * p dg and, ! dx - q dy are exact
differentials of certain functions. Then the expressions (we shall use i for
y'=I, though this was done only occasionally by Euler and made common
practice by Gauss)
(5) | zP'1a,: v
where zisreal. He sets z : x* iy so that Zbecomes P + iQ' Then
(9) AM AN AN AM
-oy:
- ax' ay:-il'
Thus by substituting z: xi iy in Z (z), " one obtains two functions M and
lY which possess the remarkable property that 0Ml0y: -aNlAx and,
0Ml0x : ANIAy; P and Q have similar properties." Ilere Euler stresses
that M and .ly', the real and imaginary parts of a complex function, satisly
the Cauchy-Riemann equations. Howwer, his main point is to use the
integrals (B) to calculate (5), for P equals the original Z. To reduce the
integrals in (8) to integrals of functions of one variable, Euler replaces
z:x* jy in (5) by z: r(cos d + isin d) and keeps d constant. This
amounts to integrating along a ray through the origin ofthe complex plane.
He then uses his method to evaluate some integrals.
Laplace too used complex functions to evaluate integrals. In a series
of papers that start in l7B2 and end with his famous Thiorie analgtique des
probabilitCs (l8l2), he passes from real to complex integrals, much as Euler
did, to evaluate the real integrals. Laplace claimed priority because
Euler's papers were published later than his own. However, even the
papers of 1793 and 1797 mentioned above were read to the St. Petersburg
Academy in March of 1777 . Incidentally, in this work Laplace introduced
what we now call the Laplace transform method of solving differential
equations.
The work of Euler, d'Alembert, and Laplace constituted significant
progress in the theory of functions. However, there is an essential limitation
in their work. They depended upon separating the real and imaginary
parts of;f(* * ig) to carry out their analytical work. The complex function
was not really the basic entity. It is clear that these men were still very
uneasy about the use of complex functions. Laplace, in his IBl2 book,
remarks, " This transition from real to imaginary can be regarded as a
heuristic method, which is like the method of induction long used by
mathematicians. However if one uses the method with great care and
restraint, one will always be able to prove the results obtained." He does
emphasize that the results must be verified.
J)
Figure 27.1
complex numbers and of the algebraic operations with these numbers. That
many men-Cotes, De Moivre, Euler, and Vandermonde-really thought
of complex numhers as points in the plane follows from the fact that all, in
attempting to solve x" - | : 0, thought of the solutions
fln
"o"2kn
* i
"in2ko
as the vertices of a regular polygon. Euler, for example, replaced x and y,
which geometrically he visualized as a point in the coordinate plane, by
x * ig and then represented the latter by r(cos 0 + i sin 0), which in turn
he plots as polar coordinates r and 0. Hence one could say that the plotting
of complex numbers as coordinates of points in the plane was known by
1800. However, decisive identification of the two was not made, nor did the
algebraic operations with complex numbers have as yet any geometrical
meaning. Also missing was the idea of plotting the values of a complex
function u * io of x f zy as points in another plane.
ln 1797 the self-taught Norwegian-born surveyor Caspar Wessel
( 1745-l B I S) wrote a paper entitled " On the Analytic Representation of
a
-l I
Figure 27.2
such that Ort is to OQ as OP is to the real unit, and the angle made by OR
and the r-axis is the sum of the angles made by OP and OQ. Clearly Wessel
thought in terms of vectors rather than associating complex numbers with
points in the plane. He applies his geometric representation of vectors to
problems of geometry and trigonometry. Despite its great merit, Wessel's
paper went unnoticed until 1897, when it was republished in a French
translation.
A somewhat different geometric interPretation of complex numbers
was given by a Swiss, Jean-Robert Argand (1768-1822). Argand, who was
also self-taught and a bookkeeper, published a small book, Essai sur une
manihc de rcpriscntcr hs quantitis imaginaires dans lcs constructions giomitriques
(1806).3 He observes that negative numbers are an extension of positive
numbers that results from combining direction with magnitude. He then
asks, Can we extend the real number system by adding some new concePt ?
Let us consider the sequence 1,.r, - 1. Can we find an operation that turns
I into *, and then repeated on r turns x into - I ? If we rotate OP (Fig.27.2)
counterclockwise about O through 90'and then repeat this rotation, we do
go from P to Q by repeating an operation twice. But, Argand notes, this is
precisely what happens if we multiply I by t/ -t and then multiply this
product by {=1 ; that is, we obtain - I . Hence we can think of VJ as a
rotation through 90o counterclockwise, say, and - l1l as a clockwise
rotation through 90o.
To utilize this operational meaning of complex numbers, Argand
decided that a typical line segment OB (Fig.27.3) emanating from the
origin, which he calls a directed line, should be represented as r(cos c f I
sin c), where r is the length. He also regarded the complex number a * Di
as symbolizing the geometrical combination OB of a and Di. Argand, like
Wessel, showed how complex numbers can be added and multiplied geo-
merically and applied these geometrical ideas to Prove theorems of trigo-
2. A number of papers on Argand's idea and ideas of other writers on the geometrical
representation of complex numbers can be found in Volume 4 (1813-14) and Volume 5
(1814-15) of Gcrgonne's Annales dcs Malhimaliqus.
cEoMETRToAL RErR.EsENTATToN oF coMpLEx NUMBERs 63t
3. Werlcc,8, 90-92.
4. Comm. Soc. Gott., 3, 1832
'Chapter
: Werke,2,9Fl48; the main content of thb PaPcr will bc
discussed in 34, sec. 2.
5. Werkt,2, 169-78.
6, Werke,2, 174 tr,
631 FUNcrIoNs oF A coMPLEx VARTABLE
into tlu domain oJ arithmetic [italics added]." He also says that if the units 1
- l, and t/41 hua not been given the names positive, negative, and imag-
inary units but were called direct, inverse, and lateral, people would not
have gotten the impression that there was some dark mystery in these
numbers. The geometrical representation, he says, puts the true meta-
physics of imaginary numbers in a new light' He introduced the term
" complex numbers " as opposed to imaginary numbers,T and used i for
\/=.
8. Weilu,8, 90-92.
TuE FouNDATToN oF coMpLEx FUNcrroN TrrEoRtr 6g9
(10)
I',_,+-
Here he sets , : eto, where d runs from (2n + l)r to 0, and obtains, by
treating the integral as a limit of a sum, the value - (2n + l)il.
Then he notes that the value ofan integral need not be the same when
taken over an imaginary path as when over a real path. He gives the
example
f@ cos ar
(11) d*,
J-* *,
: t * ik, where ,t is constant
where a and b are positive constants. He lets ,t -t\
and positive, and obtains the values r(e-"o - e"b) l2r for k > b and
re-ob for k < b. The second value is also the correct one fot k : 0. Thus
for two different values of i, which means two different paths, one gets two
different results. Poisson was the first man to carry out integrations along a
path in the complex plane.
Though these observations of Gauss and Poisson are indeed significant,
neither published a major paper on complex function theory. This theory
was founded by Augustin-Louis Cauchy. Born in Paris in 1789, in 1805 he
entered the Ecole Polytechnique, where he studied engineering. Because he
was in poor health he was advised by Lagrange and Laplace to devote him-
self to mathematics. He held professorships at the Ecole Polytechnique, the
Sorbonne, and the Colldge de France. Politics had unexpected effects on his
career. He was an ardent Royalist and supporter of the Bourbons. When in
1830 a distant branch ofthe Bourbons took control in France, he refused to
swear allegiance to the new monarchy and resigned his professorship at the
Ecole Polytechnique. He exiled himself to Turin and taught Latin and
Italian for some years. In lB3B he returned to Paris, where he served as
professor in several religious institutions, up to the time when the govern-
ment that took over aftcr the revolution of 1848 did away with oaths of
allegiance. In lB4B Cauchy took the chair of mathematical astronomy in
the Facult€ des Sciences of the Sorbonne. Though Napoleon III restored
the oath in 1852, he allowed Cauchy to forgo it. To the condescending
gesture of the Emperor he responded by donating his salary to the poor of
Sceaux, where he lived. Cauchy, an admirable professor and one of the
greatest mathematicians, died in 1857.
Cauchy- had universal interests. He knew the poetry of his time and
was the author of a work on Hebrew prosody. In mathematics he wrote
Figure 27.4
over seven hundred papers, second only to Euler in number. His works in
a modern edition fill twenty-six volumes and embrace all branches of
mathematics. In mechanics he wrote important works on the equilibrium of
rods and of elastic membranes and on waves in elastic media. In the theory
of light he occupied himself with the theory of waves that Fresnel had
started and with the dispersion and polarization of light. He advanced the
theory of determinants immensely and contributed basic theorems in
ordinary and partial differential equations.
Cauchy's first significant paper in the direction of complex function
theory is his " Mdmoire sur la thdorie des int€grales d6finies." This paper
was read to the Paris Academy in lBl4. However, it was not submitted for
publication until lB25 and was published in l827.1o In the publication
Cauchy added two notes that pretty surely reflect developments between
1814 and 1825 and the possible influence of Gauss's work during that period.
Let us consider the paper proper for the present. Cauchy says in the prefiace
that he was led to this work by trying to rigorize the passage from the real
to the imaginary in processes used by Euler since 1759 and by Laplace since
1782 to evaluate definite integrals, and in fact Cauchy cites Laplace, who
observed that the method needed rigorization. But the paper proper does
not treat this problem. It treats the question of the interchange of the order
of integration in double integrals that arose in hydrodynamical investiga-
tions, Euler had said in I77011 that this interchange was permissible when
the limits for each of the variables under the integral sign were independent
of each other, and Laplace apparently agreed because he used this fact
repeatedly.
Specifically, Cauchy treats the relation
(12) dv dx : !""[."ta,a * o,
J*.['.lt*,r)
wherein xo, go, X, and y are constants (Fig.
27 .a). This interchange of the
order of integration holds whetJ@, y) is continuous in and on the boundary
lO. Min. des sao. dtrangers, (2), l,lS27,599-799 : CEwres, (l), l, 319-506.
ll. Noui Comm. Aead. Sci. Petrop., 14, 1769, 72-103, pub. 1770 : Olera, (l), 17, 289-315'
THE FOUNDATION OF COMPLEX FUNCTION THEORY 6ss
of the region. Then he introduces two functions V(x,y) and S(x,y) such
that
(t 3)
av as _avas
and =- : -;-.
0g Ax ox og
Euler had already shown in 1777 how to obtain such functions (see [5],
[B], and [9]). Now Cauchy considers anf(x,g) which is given by AVIAy :
0Sl0x. ln (12) he replacesrfon the left side by aVlay, and;fon the right side
by 3S/0x, so that
(14)
E"!1,# * o* : I',ff"q** *,
and by using the second equation in (13) he obtains
(ts)
!l"ll"T, * o* : - J',|.j.{u. *
"
These equalities can be used to evaluate double integrals in either order of
integration; however they do not involve complex functions. When Cauchy
says in his Introduction 12 that he will " establish rigorously and directly the
passage from the real to the imaginary (complex)," it is equations (13) that
he has in mind. Cauchy statesls that these two equations contain the whole
theory of the passage lrom the real to the imaginary.
All of the above is in the body of the l8l4 paper, and really gives no
explicit indication of how complex function theory is involved. Moreover,
though Cauchy used complex functions to evaluate definite real integrals in
the same manner as had Euler and Laplace, this use did not involve com-
plex functions as the basic entity. As late as l82l, in his Cours d'analgse,ta
he says
cos + G1 sin a
d
cosD * VJsinD
cos (a 1 b) + t/-t sin (a + D)
v(x,y6)fdx : - s(x6,flf
[..vt*,Y) - J" tsrx,vl
(16) dy
and
He now had the idea that he could combine these two equations and thus
make a statement aboutF(z) : F(x + iy) : S * lZ' Thus, by multiplying
(16) through by i and adding the two equations, he obtains
(rB) Il.'r-, +
iv)i dY *
Il"'a + iY) dx
:
+ ivo) dx + + iv)idv'
ff"ro !,'"n{x
This last result is the Cauchy integral theorem for the simple case of complex
integration around the boundary of a rectangle (Fig. 27.4). One can
express the result thus:
,.^
(le) IADCpk) a, : I
J J ABC
F(z) dz.
lx+lY
(20) JQ) dz,
J*o *,ro
where z : x* ig, and defines the integral carefully as the limit of the sum
where *s, xr, . . . , X, and y6, yy . . . , Y are the points of subdivision along a
path from (xo,y) to (X, Y). Here x + iyis definitely a point of the complex
plane and the integral is over a complex path. He also shows that if we set
x: iU),y: {(t), where I is real, then the result is independent of the
choice of $ and, t!, that is, independent of the path, provided that no dis-
continuity of/(z) lies between two distinct paths. This result generalizes the
result for rectangles.
Cauchy formulates his theorem thus: If/(r + ey) is finite and continu-
ous for ro < t < X andys < y < Y, then the value of the integral (20) is
independent of the form of the function. * : C(r) and y : 'y'(t). His proof
of this theorem uses the method of the calculus of variations. He considers
as an alternative path $(t) + eu(t),,t'(t) + ea(t), and shows that the first
variation of the integral with respect to e vanishes. This proof is not satis-
factory. In it Cauchy not only uses the existence of the derivative of f (z)
but also the continuity ol the derivative, though he does not assume either
fact in the statement of the theorem. The explanation is that Cauchy
believed a continuous function is always differentiable and that the deriva-
tive can be discontinuous only where the function itself is discontinuous.
Cauchy's belief was reasonable in that in the earlier years of his work a
function meant for him, as for others of the eighteenth and early nineteenth
centuries, an analytic expression, and the derivative is given at once by the
usual formal rules of differentiation.
In the lB25 paper Cauchy is clearer about a major idea he had already
touched on in the Paper proper of lB14 and in a footnote to that paper. He
considers what happens whenf (z) is discontinuous inside or on the boundary
16. Bull. des Sci. Math., 7, 187+,265-30+, and 8, 1875, 43-55, t48-59; this paper is not in
Cauchy's Gzare.r.
698 FUNcrroNs oF A coMpLEx VARTABLE
of the rectangl e (Fig. 27 .4). Then the value of the integral along two differ-
ent paths may be different. It at z, : a + ib,f(z) is infinite, but the limit
r:l11,e_2,)f(z)
exists, that is, iflhas a simple pole at zr, then the difference in the integrals
is +2tt/ - lF. Thus for the function ;[ (z) : t l(t * z2), which is infinite
at 2 : 1/-l,so thata : 0 and D : 1,
(2r) F : lim
x+(y-\\/= : -\/=
--- 2--'
;:g t, + fu + t'1t/--t11x + (y - t)t/ -t1
The quantity F itself is what Cauchy called the risidu intigral (integral
residue) in his Exercices fu mathtimatique.lT Also, when a function has several
poles in the region bounded by the two paths of integration, Cauchy points
out that one must take the sum of the residues to get the difference in the
integrals over the two paths. In this particular section on residues his two
paths still form a rectangle, but he takes a very large one and lets the sides
become infinite in length to include all the residues.
In the Ezcrcires 18 Cauchy points out that the residue ofrf(z) at z, is also
the coefficient of the term (. - -1 in the development of;f(z) in powers
of z - zr. Much later, in a paper "r)of l84l,re Cauchy gives a new expression
for the residue at a pole, namely,
where Z is the first value for which;t(z) is discontinuous and ;f(z) is the
largest value of lf(z)l fot all z whose absolute value equals lZl. Thus
Cauchy gives a powerful and easily applied criterion for the expansibility
of a function in a Maclaurin series which uses a comparison series now called
a majorant series.
In the proof of the theorem he first shows that
:
I (X - ?\ o. dv I " da + ! " a*
(22) I
* * ds : [ -"d* + |,, du,
I I (fr *)
20. Conp. Rcwl., 4., 1837, 216-218 = (Euurcs, O) ' a, 38-a2; see also Exerciccs d'analysc ct de
phgsique nathimatique, Yol, 2, 1841, 48-l l2 : CEurtres, (2), 12, +8-112,
21. Conp. ReruL.,23, 1846, 251-55 : CEutrest (l)' 10, 70-74.
64o FUNcrIoNs oF A coMpLEx VARTABLE
where the double integrals are over the area and the single integrals over
the bounding curve. Now, in view of the Cauchy-Riemann equations
(cf. [3]), the left sides are 0 and the two right sides are the integrals that
appear in
(23) az : 2,iE[_f(z)],
.[rct
where E[1[Q)l is his notation for the sum of the residues.
He also took up the subject of the integrals of multiple-valued func-
tions.23 In the first part of the paper, where he treats integrals of single-
valued functions, he does not state much more than what Gauss had pointed
out in his letter to Bessel apropos of I axlx o, I ar1\ * x2). The integrals
are indeed multiple-valued and their values depend on the path of
integration.
But Cauchy goes further to consider multiple-valued functions under
the integral sign. Here he says that il the integrand is an expression lor the
roots of an algebraic or transcendental equation, for example, J u3 dz where
tos : z, and if one integrates over a closed path and returns to the starting
point, then the integrand now represents another root. In these cases the
value of the integral over the closed path is not independent of the starting
22. " Sur les intdgrales dans lesquelles la fonction sous le signe J change brusquement de
valeur," Comp. Rend.,23, 1846,537 and 557-69 : (Euures, (l), 10, 133-34 and 135-43.
23. " Considdrations nouvelles sur les int6grales difinies qui s'itendent e tous les points
d'une courbe fermde," Comp. Rend.,23, 18+6,689-702 : @wres, (l), 10, 153-68.
THE FOUNDATION OF COMPLEX FUNCTION THEORY 64,
point; and continuation around the path produces different values of the
integral. But if one goes around the path enough times so that z, returns to
its original value, then the values ofthe integral will repeat and the integral
is a periodic function of z. The periodicity modules (indiees de piriodicittl) ol
the integral are not any longer, as in the case of single-valued functions,
representable by residues. Cauchy's ideas on the integrals of multiple-
valued functions were still vague.
For about twenty-five years, from 1821 on, Cauchy singlehandedly
developed complex function theory. In lB43 fellow countrymen began to
take up threads of his work. Pierre-Alphonse Laurent (1813-54), who
worked alone, published a major result obtained in I 843. He showed
ea
24. Conp. Rend., 17 , 1843,348-49; the full paper was published in tt,e Jour' dc I'E'cob Poly',
23, 1863,75-20+.
25. Werlw, l, 5l-66.
26. Jour. de Math., 15, 1850, 365-480.
642 FUNcrroNs oF A coMpLEx VARTABLE
27. Con!. Rend.,32, 1851,68-75and 162-64 = (Euares, (l), 11,292-300 and 30,$-5.
28. @uares, (l), ll.
\^/EIERSTRASS'S APPROACH TO FUNCTION THEORY 6+s
6. Elliptic Functions
Paralleling the development ol the basic theorems of complex function
theory during the first halfofthe century was a special development dealing
with elliptic and later Abelian functions. There is no doubt that Gauss
obtained a number ofkey results in the theory of elliptic functions, because
many of these were found after his death in papers he had never published.
However, the acknowledged founders of the theory of elliptic functions are
Abel and Jacobi.
Niels Henrik Abel (1802-29) was the son of a poor pastor. As a student
in Christiania (Oslo), Norway, he had the luck to have Berndt Michael
Holmbtie (1 795-1850) as a teacher. The latter recognized Abel's genius
and predicted when Abel was seventeen that he would become the greatest
mathematician in the world. Alter studying at Christiania and at Copen-
hagen, Abel received a scholarship that permitted him to travel. In Paris
he was presented to Legendre, Laplace, Cauchy, and Lacroix, but they
ignored him. Having exhausted his funds, he departed for Berlin and spent
the years 1825-1827 with Crelle. He returned to Christiania so exhausted
that he found it necessary, he wrote, to hold on to the gate ofa church. To
earn money he gave lessons to young students. He began to receive attention
ELLTPTTc FUNcrIoNs 64s
through his published works, and crelle thought he might be able to secure
him a professorship at the University of Berlin. But Abel became ill with
tuberculosis and died in I829.
Abel knew the work of Euler, Lagtange, and Legendre on elliptic
integrals and may have gotten suggestions for the work he undertook from
,.-i.k. made by Gauss, especially in his Disquisitiones Arithmeticac ' He
himself started to write papers in 1825. He presented his major Paper on
integrals to the Academy ol Sciences in Paris on October 30, 1826, for
pubiication in its journal. This paper, " M6moire sur une propri€t6 g6ndrale
d'u.,e clas.. tres-dtendue de fonctions transcendantes," contained Abel's
great theorem (sec. 7). Fourier, the secretary of the Academy at the time,
iead the introduction to the paper and then referred the paper to Legendre
and Cauchy for evaluation, the latter being chiefly responsible' The paper
was long and difficult, only because it contained many new ideas' Cauchy
laid it aside to favor his own work. Legendre forgot about it. After Abel's
death, when his fame was established, the Academy searched for the paper,
found it, and published it in lB4l.2e Abel published other papers in crclle's
Journal and, Gergonne,s Annales on the theory of equations and elliptic
functions. These appeared from lB27 on. Because Abel's main paper of 1826
was not published until I B4l, other authors, learning the more limited
theorems published in between these dates, obtained independently many
of Abel's 1826 results.
The other discoverer of elliptic functions was Carl Gustav Jacob
Jacobi(1804-51).UnlikeAbel,helivedaquietlife'BorninPotsdamtoaa
jewish iamily, he studied at the I;niversity of Berlin and in lB27 became
"p.of.r.o, at kOnigsberg. In lB42 he had to give up his post because of ill
irealth. He was given a pension by the Prussian government and retired to
Berlin, where ne aiea in 1851. His fame was great even in his own lifetime,
and his students spread his ideas to many centers'
for many years' His
Jacobi taught the subject of elliptic functions
.pp.:ou"h to it became the model according to which the theory of functions
itr.r*u.developed.Healsoworkedinfunctionaldeterminants(Jacobians),
ordinary and partial differential equations, dynamics, celestial mechanics,
fluiddynamics,andhyperellipticintegralsandfunctions'Jacobihasoften
been labeled a pure mathematician, but, like almost all mathematicians
of
his own and preceding centuries, he took most seriously the investigation of
nature.
While Abel was working on elliptic functions, Jacobi, who had also
read Legendre,s work on elliptic integrals, started work in 1827 on the corre-
spondirr! functions. He submitted a paper to the Astronomische Nachrichtenso
*itho,t"p.oof,. Almost simultaneously, Abel published independently his
29. Min. des sao. itrangers, 7 , l8+1, 176-264 : CEuores, 145-21 '
l
" Recherches sur les fonctions elliptiques." 31 Both had arived at the key
idea of working with inverse functions of the elliptic integrals, an idea
Abel had had since 1823. Jacobi next gave proofs of the results he had
published in lB27 in several atticles of Crelle's Journal for the years 1B2B to
1830. Thereafter, both published on the subject of elliptic functions; but
whereas Abel died in 1829, Jacobi lived until l85l and was able to publish
much more. In particular, Jacobi's Fundamenta Noua Theoriac Furutionum
Elliptitarum of 182932 became a key work on elliptic functions.
Through letters from Jacobi, Legendre became familiar with Jacobi's
and Abel's work. He wrote toJacobi on February 9, 1828, " It is a great satis-
faction to me to see two young mathematicians so successfully cultivate a
branch of analysis which has long been my favorite field, but not at all
received as it deserves in my own country." Legendre then published three
supplements (1829 and 1832) to his Traiti d.cs fonctions elliptiques (2 vols.,
1825-26), in which he gave accounts ofJacobi's and Abel's work.
The general elliptic integral concerns
f_
(2s) u: )R(x,{P1x1'1,
where P(.r) is a third or fourth degree polynomial with distinct roots and
R(x, y) is a rational function of x and y. The efforts to deduce some general
facts about the function z of r had to fail because the very meaning of the
integral was limited for Euler and Legendre. The coefficients of P(x) were
real, and the range of * was real and moreover did not contain a root of
P(*) : 0. With more knowledge of the theory of complex functions some
advance might have been made in learning something about u as a function
of *, but this knowledge was not available. As it turned out, Abel and
Jacobi had a better idea.
To be specific, Legendre had introduced (Chap. 19, sec. 4) the elliptic
integrals F(k,i), E(k,il, and r(n,k,il. lt was Abel who, about 1826,
observed that if, to consider F(k, $) for example, one studied
(26)
":l: \rT=-ew=wa dx
_t'Jo d6
1-- 7z
1/ 6'
"inz
where x : sin d, then one encountered the same difficulties as when one
studied
u:f' dx : arcsinr.
Jo 1/1 - *z
The nicer relationships come from studying r as a function of rz. Hence Abel
proposed to study * as a function of z in the case of the elliptic integrals.
Since x : sin d, one can also study / as a function ofz.
31. Jour. fir Math.,2, 1827, l0l-€1, and 3, 1828, l6G-90 : (Euares,263-388.
32. Werk, 1,49-239.
ELLIPTIC FUNCTIONS 6+l
33 the notation
Jacobi introduced
4: o*u
for the function $ of u defined by (26). He also introduced
cos{ : cosamu and A6 : Lo*u : \/T---iEl;, E.
This notation was abbreviated by Gudermann to
,: sind: sin arnu: snu, cos/: cos arnu : cnu,
L,$ : L,amu: dnu.
sind:itand, .orO:;!,
cos I
^(s,r)
:*#,
33. Fundamenta Noaa, 1829.
6+8 FUNCTIONS OF A COMPLEX VARIABLE
where d : am i u, so that
t_ lx dx
u: E(x) - J, \/R(4
where y2 : .lt(r) is a polynomial of degree four, Euler had obtained the
addition theorem (Chap. 19, sec. 4)
E(x)+E(x,):E(xs)
where is a known rational function of xr, tr, gy and y2, and y : \/F6.
*.
Abel thought that for the inverse lunction * : 6(u) there might be a simple
addition theorem, and this proved to be the case. This result too appeared
in his lB27 paper. Thus for real u and tt
: snucnadna + snoenudnu
(28) sn(u + u)
| - k2snz u sn2 u
with analogous formulas lor cn(u a o) and dn(u 1 a). These are the addition
theorems for elliptic functions and the analogues of the addition theorems
for elliptic integrals.
Having defined the elliptic functions for real and imaginary values of
the arguments, Abel was able with the addition theorems to extend the
definitions to complcx values. For, if z : u ! ir,sn z : sn(u a iz) nowhas
a meaning in view of the addition theorem, in terms of the sn, cn, and dn
of u and of la separately.
6+g
z+4K+2iK',
Figure 27.5
(30) 0( z\ : !
.L ,-nzt+rntz.
65o FUNcrroNs oF A coMpLEx vARTABLE
where z and , are complex and, Re(t) > 0. The series converge absolutely
and uniformly in any bounded region of the z-plane. Jacobi introduced
four theta functions and then expressed sn u, cn u, and dn u in terms of these
functions. The theta functions are the simplest elements out of which the
elliptic functions can be constructed. He also obtained various expressions
for the theta functions in the form of infinite series and infinite products.
Further study of ideas in Abel's work led Jacobi to relations between the
theta functions and the theory of numbers. This connection was taken up
Iater by Hermite, Kronecker, and others. The pursuit of relations among
many different forms of theta functions was a major activity for mathema-
ticians of the nineteenth century. It was one of the many fads that sweep
through mathematics regularly.
In an important paper of 1835sa Jacobi showed that a single-valued
function of a single variable which for every finite value of the argument
has the character of a rational function (that is, is a meromorphic function)
cannot have more than two periods, and the ratio ol the periods is neces-
sarily a nonreal number. This discovery opened up a new direction ofwork,
namely, the problem of finding all doubly periodic functions. As early as
184435 Liouville, in a communication to the French Academy of Sciences,
showed how to develop a complete theory of doubly periodic elliptic func-
tions starting from Jacobi's theorem. This theory was a major contribution
to elliptic functions. In the double periodicity Liouville had discovered an
essential property of the elliptic functions and a unifying point ol view lor
their theory, though the doubly periodic functions are a more general class
than those designated by Jacobi as elliptic. However, all the fundamental
properties of elliptic functions do hold for the doubly periodic funoions.
Weierstrass took up the subject of elliptic funcrions about 1860. He
learned Jacobi's work from Gudermann, and Abel's work through his
papers. These impressed him so much that he constantly urged his pupils
to read Abel. For his teacher's certificate he took up a problem assigned to
him by Gudermann, namely, to represent the elliptic functions as quotients
of power series. This he did. In his lectures as a professor he constantly
reworked his theory of elliptic functions.
Legendre had reduced the elliptic integrals to three standard forms
involving the qquare root of a fourth degree polynomial. Weierstrass
arrived at three different forms involving the square root of a third degree
polynomial,36 namely
d*
I .\/4;;=*r .f xdx .f d*
J - t" J \/4F - t,-- r"'J e - a)\/4F=V,*- t"'
34. Jour. far Math., 13, 1835, 55-78 : Werke,2,23-50.
35. Con!. Rend., 19, 1844, l26l-63, and 32, 1851, 450-52.
36. Sitzungsber. Akad. Wiss. zu Berlin,l882, 443-51 = Werke,2,245-55; see also Werkc, 5.
HYPERELLIPTIc INTEGRALS AND ABEL'S TIIEORBM 65,
which he obtained by solving the differential equation for /x/dz given by the
above integral, and then used the addition theorem for p(u), in a manner
similar to Abel's, to obtain the full function. Weierstrass's work completed,
remodeled, and filled with elegance the theory of elliptic functions.
Though we shall not enter into specific details, we cannot leave the
subject of elliptic functions without mentioning the work of Charles Hermite
(1822-1901), who was a professor at the Sorbonne and at the Ecole Poly-
technique. From his student days on, Hermite constantly occupied himself
with the subject of elliptic functions. In lB92 he wrote, " I cannot leave the
elliptic domain. Where the goat is attached she must graze." He produced
basic results in the theory proper and studied the connection with the theory
of numbers. He applied elliptic functions to the solution o[ the fifth degree
polynomial equation and reated problems of mechanics involving the
functions. He is known also for his proof of the transcendence of a and his
introduction of the Hermite polynomials.
(31 )
In6'v;a''
where rR(x, y) is a rational function of x and y, y2 : P(x), and the degree of
P(*) is at least five. When P(r) is of degree five or six, the integrals were
called ultraelliptic in the mid-nineteenth century. To emphasize complex
values it is common to write
a';
(32)
InP' 4
and P(z) is often written as
(33) u2 : P(z) : A(z - er)... (z - e).
Of course z is a multiple-valued function of z.
Among the integrals of the form (32) there are some that are every-
where finite. These basic ones are
fdz f zdz I z!-rdz
(34) u,:
);'"r: ) u'"')ttp:J
"'
where a is given by (33) and p : (n - 2)12 or (n - l)/2 according as z is
even or odd. For n : 6 (and so p : 2), there are two such integrals. The
general integrals (32) have at most poles and logarithmically singular
points, that is, singular points that behave like log z at z : 0. Those in-
tegrals of the first class, that is, those which are everywhere finite and so
have no singular points, can always be expressed in terms of the I integrals
(34), which are linearly independent.
For the case n : 6 (and p: 2) integrals of the second class are
exemplified by
f z2 dz I z3 dz
(3s)
)@,)f,ppi
where P(z) is a polynomial of the sixth degree. The integrals of the first and
second kind for n : 6 have four periods each.
The hyperelliptic integrals are functions of the upper limit z if the
lovVer limit is fixed. Suppose we denote one such function by ra. Then, as in
the case of the elliptic integrals, one can raise the question as to what is the
inverse function z of w. This problem was tackled by Abel, but he did not
solve it. Jacobi then tackled it.3? Let us consider with Jacobi the particular
hyperelliptic integrals
z
(36) ttt : tz dz
I '-----:, and u :t-,f z dz
J o t/ P(z) J" \/pe)
37. Jour. fir Math., 9, 1832, 394-403 : Wulee, 2, 7-16, and Jour, fiir l[ath., 13, 1835,
55-78 : Werke,2,25-50 and 516-21.
HYPERELLIPTIC INTEGRALS AND ABEL,S THEOREM 6sg
Irzt dz fzz dz
(37)
J" '\/VA ' Jo !@Gi -agt
f zr --l
z dz zz- zdz
t- u2.
(38)
l" t/F@ t t/ P(z) -
Jacobi succeeded in showing that the symmetric functions zL + 22 and
z1Z2 zte each single-valued functions of zo1 and ar, with a system ofllozr
periods. The lunctions z, and z, of the two variables w1 and' w2 can then
te obtained. He also gave an addition theorem for these functions' Jacobi
left many points incomplete. "For Gaussian rigor," he said, "we have no
time."
The study of a generalization of the elliptic and hyperelliptic integrals
was begun by Galois, but the more significant initial steps were taken by
Abel in his 1826 paper. He considered (32), that is,
a''
(3e)
I nP' z1
(40) f(u, z) : g.
Equations (39) and (40) define what is called an Abelian integral, which then
includes as special cases the elliptic and hyperelliptic integrals'
Though Abel did not carry the study of Abelian integrals very far, he
proved a key theorem in the subject. Abel's basic theorem is a very broad
generalization of the addition theorem for elliptic integrals (Chap' 19,
sec. 4). The theorem and proof is in his Paris paper of 1826 and the state-
ment of it is in Crelle's Journal for 1829.38 Let us consider the integral
(41)
I n6'4 ax'
(43)
1""o"
*(*,r) dx * ... + [<'^*^t R@,g) dx
with fixed (but arbitrary) lower limits is expressible by rational functions of
*1'lt. ,.,.r, and y, and logarithms of such rational functions, with the
addition of a sum of a certain number 1 of integrals
...
(45)
!i'*,.,,, dx, + * J" R(x,y) dx
: Ii *r-,r, * * I: R@,s) dx * Rr(xt,!t,...t xmtuat A,y(A), B,y(B))
+) const. log Rr(x r, g r, . . . t x ms mt A, y (A), B, y (B)),
!
Figure 27.6
The branch-cut need not be the positive *-axis but it must, in the present
case, join 0 and oo. The points 0 and o are called branch-points because
the branches of w2 : z are interchanged when z describes a closed path
about each.
The function u2 : z d\dt consequently, its associated Riemann surface,
are especially simple. Let us consider the function w2 : z3 - z. This
function also has two branches that become equal at z : O, z : l, z : -1,
and z : oo. Moreover, though we shall not present the full argument, all
four of these points are branch-points, because if z makes a circuit around
any one of them the value of zo changes frorn that of one branch to that of
another. The branch-cuts can be taken to be the line segments from 0 to l,
0 to - l, I to co, and - I to o. As z crosses any one of these cuts, the value
of zz changes from those it takes on one branch to those of the second.
For more complicated multiple-valued functions the Riemann surface
is more complicated, An z-valued function requires an z-sheeted Riemann
surface. There may be many branch-points, and one must introduce
branch-cuts joining each two. Moreover the sheets that come together at
one branch-point need not be the same as those that come together at
another. If& sheets coincide at a branch-point, the order ofthe point is said
to be ,t - l. However, two sheets of a Riemann surface may touch at a
point but the branches of the function may not change as z goes completely
around the point. Then the point is not a branch'point.
It is not possible to represent Riemann surfaces accurately in three-
dimensional space. For example, the two sheets for w2 : z must intersect
along the positive r-axis if represented in three dimensions, so that one must
be cut along the positive r-axis, whereas the mathematics requires smooth
passage from the first sheet into the second and then, after a circuit around
z : 0, back to the first sheet again.
Riemann surllaces are not merely a way of portraying multiple-valued
functions but, in effect, make such functions single-valued on the surface as
6S8 FUNcrroNs oF A coMpLEx VARTABLE
(48) ,:[?H*.4*0,)
Thus a is also determined, and so is u. It is important to note that for
Riemann the domain of z was any Part of a Riemann surface, including
possibly the whole surface. In his doctoral thesis he considered surfaces
with boundaries and only later used closed surfaces, that is, surfaces
without boundaries, such as a torus.
To determine u, Riemann's essential tool was what he called the
Dirichlet principle, because he learned it from Dirichlet; but he extended it
to domains on Riemann surfaces and, moreover, prescribed singularities for
a in the domain and prescribed jumps (conditions 2 and 3 above). The
Dirichlet principle says that the function u that minimizes the Dirichlet
integral
+ (it)')a.av
II{e)',
satisfies the potential equation. The latter isin fact the necessary condition
that the first variation of the Dirichlet integral vanish (see also Chap' 28,
66o FUNCTIoNS oF A coMpLEx VARTABLE
sec. 4). Since the integrand in the Dirichlet integral is positive and so has a
lower bound which is greater than or at worst zero, Riemann concluded
that there must be a function a that minimizes the integral and so satisfies
the potential equation. Thus the existence ol the function rz and therefore
by (aB) of;f(z) which belongs to the Riemann surface and may even have
prescribed singularities and complex jumps (periodicity modules) was
assured as far as Riemann was concerned.
Once the existence of functions on a given Riemann surface as their
domain has been established, one can show that there is a fundamental
equation that can be associated with the given surface; that is, there is an
:
"f(u, r) 0 that has the given surface as its surfiace. Just how the surface
corresponds to the relationship between u and z Riemann does not state.
Actually this ;f(zo, z) : 0 is not unique. As a matter of fact, from every
rational function u, of w and z on the surface one can obtain through
-f (w, ,) : 0 another equation , (rr,,l , z) : O which, if irreducible, has the
same Riemann surface. This is a feature of Riemann's method.
To investigate further the kinds of functions that can exist on a Rie-
mann surface, it is necessary to become acquainted with Riemann,s notion
of the connectivity of a Riemann surface. A Riemann surface may have
boundary curves or be closed like a sphere or a torus. If it is the Riemann
surface ofan algebraic function, that is, if;f(a;, z) : 0 defines zr., as a func-
tion of z and;f is a polynomial in zo and z, then the surface is closed. If /is
irreducible, that is, cannot be expressed as a product of such polynomials,
then the surface consists ofone piece or is said to be connected.
A plane or a sphere is a surface such that any closed curve divides it
into two parts so that it is not possible to pass continuously from a point in
one part to a point in the second without crossing the closed curve. Such a
surface is said to be simply connected. If, however, it is possible to draw
some closed curve on a surface and the curve does not disconnect the
surface, then the surface is not simply connected. For example one may
draw two different closed curves on the torus (Fig.27.7), and even the
presence of both does not disconnect the surface.
Riemann wished to assign a number that indicated the connectivity
of his surface. He regarded the poles and branch-points as part ofthe surface
and because he had algebraic functions in mind, his surfaces were closed.
By removing a small portion of one sheet the surface had a boundary curve
C. He then thought of the surface as cut by a non-self-intersecting curve
which runs from the boundary C to another point of the boundary C. Such
a curve is called a cross-cut (Querschnitt). This cross-cut and C are regarded
as a new boundary and a second cross-cut can be introduced that starts at
one point ofthe (new) boundary and ends at another and does not cross the
(new) boundary.
A sufficient number of these cross-cuts is introduced so as to cut up a
RIEMANN AND MULTIPLE-VALUED FUNCTIONS 66r
Figure 27.
Riemann surface that may be multiply connected into one simply connected
surface. Thus, ifa surface is simply connected, no cross-cut is necessary and
the surf;ace has connectivity (Grundzahl) l. A surface is called doubly con-
nected if by one appropriate cross-cut it is changed into a single simply
connected surface. Then the connectivity is 2. A plane ring and a spherical
surface with two holes are examples. A surface is called triply connected
when by two appropriate cross-cuts it is converted into a single simply
connected surface. Then the connectivity is 3. An example is the (surface of
a) torus with a hole in it. In general a surface will be said to be .tr[-ply
connected or has connectivity i[ if by il - I appropriate cross-cuts it can
be changed into a single surface that is simply connected. A spherical surf,ace
with z holes in it has connectivity N.
It is now possible to relate the connectivity of a Riemann surliace
(with one boundary) and the number of branch-points. Each branch-point,
r,, say, is to be counted according to the multiplicity of the number of
branches of the function which interchange at that point. If the number
is wr, i : 1,2, . . ., r, then the multiplicity of r, is wt - l. Suppose the surface
has 4 sheets. Then the connectivity N is given by
N:>u,-2q*3.
I
One can show that the connectivity .l[ of a closed surf;ace with a single
boundary is2p + l. Hence
2e :>w, - 2q * 2.
The integer p is called the genus of the Riemann surface and of the associ'
ated equationlf(w, z) : 0. This relation was established by Riemann'
A special case of considerable importance is the surface of
w' : (. - a)(z - az)... (. - o),
662 FUNcrroNs oF A coMpLEx VARTABLE
!: [+-
ln t
when z is even
when z is odd.
[2
Given a Riemann surface determined by;f(za, z) : 0, we know that zo
is a single-valued function of the points on the surface. Then every rational
function ofar and z is also a single-valued function of position on the surface
(because we can replace ro in the rational function by its value in terms of
z). AIso the branch-points of this rational function, though not its poles, are
the same as those of.7f, Conversely, one can prove that every single-valued
function of position on the surface having poles of finite order is a rational
function of u and z.
Even in the case of simple single-valued functions defined on the
ordinary plane, the integrals ofsuch functions can be multiple-valued. Thus
f2 dz
u+
J"w: n1r
where ul is the value of the integral along, say, a straight line path from.
0 to z, and n depends upon how the path from 0 to z circles + i. Likewise,
when considering functions single-valued on a Riemann surface, say a
rational function of ar and z on the surface, the integral of such a function
may be multiple-valued. This does indeed occur. If one then introduces
cross-cuts to make the surface simply connected, and if the integral takes a
path from zlto 22, each time the path crosses a cross-cut a constant value I
is added to the basic value Uof the integral for a path on a simply connected
portion of the surface. If the path should cross the cross-cut m times in the
same direction, then the value rnl is added to U. The constant I is called a
periodicity modulus. Each cross-cut introduces its own periodicity modulus,
and if the connectivity of the surface is .try' 1 l, there are .tr[ linearly inde-
pendent periodicity moduli. Let these be f r, Ir,. . . , .Io. Then the value of
the integral of the original single-valued function taken over its original
path is
where m1, fltb.. .t zo are integers. The d are generally complex numbers,
ABELIAN INTEORALS AND FUNCTIONS 669
w+ )^,.,,
r=1
where the are integers and the <,r, are periodicity moduli for this integral.
zn,
The integrals of the second kind have algebraic but not logarithmic
infinities. An elementary integral of the second kind has an infinity of first
order at one point on the Riemann surface. If E(z) is a value of the integral
at one point on the surface (the upper limit of the integral), then all the
values of the. integral are included in
2p
E(z) +2"*,,
where the z, are integers and the e, are the periodicity moduli for this
integral. Two elementary integrals with an infinity at a common point on
the Riemann surface differ by an integral of the first kind. From this we can
infer that there are y' 1 I linearly independent elementary integrals of the
second kind with an infinity at the same point on the Riemann surface.
40. Vol.54, 1857, l15-55 : Wcrhe,88-144.
66+ FUNCTIONS OF A COMPLEX VARIABLE
They did so because they had reason to distrust the Dirichlet principle.
Weierstrass, in a paper read in l870,al pointed out that the existence ofa
function that minimizes the Dirichlet integral was not established. Riemann
himself was of another mind. He recognized the problem of establishing the
existence of a minimizing function for the Dirichlet integral belore Weier-
strass made his statement, but declared that the Dirichlet principle was just
a convenient tool that happened to be available; the existence ofthe function
z, he said, was nevertheless correct. Helmholtz's remark on this point is also
interesting: ". . . for us physicists the [use ofthe] Dirichlet principle remains
a proof." a2
Another of the new investigations in complex function theory that
Riemann launched is the inversion of Abelian integrals, that is, to deter-
mine the function z of u when
u : R(2, u) dz
Jo
)t)
Figure 27.8
where c ar,d c' are determinable from the position ofthe polygon and where
a, b, c, .. ., correspond to A, B, C, . . '. The mapping has proved to be very
useful in solving the potential (Laplace) equation.
where
Bibliograplry
Abel, N. H.: vols., 1881, Johnson Reprint Corp., 1964.
(Euares complites,2
Mimorial publii d l'occasion du centinaire dt sa naissance, Jacob Dybwad,
Kristiania, 1902.
Brill, A., and M. Noether: " Die Entwicklung der Theorie der algebraischen
Functionen in ?ilterer und neuerer Zeit," Jahtes. der Deut. Math.-Verein., 3,
1892i3, 109-556, 155 86 in particular.
Brun, Viggo: "Niels Henrik Abel. Neue biographische Funde," Jour. fiit Math.,193,
1954,239-49.
Cauchy, A. L,; (Ewres complites,26 vols., Gauthier-Villars, 1882-1938, relevant
papers.
Crowe, Michael l.: A History of Vector Analgsis, University of Notre Dame Press,
1967, Chap. 1.
Enneper, A.: Elliptische Funkliontn: Theorie und Geschichte,2nd ed., L' Nebert, 1890.
Hadamard, Jacques: Nolice sur les lraaaux scienlifques de M- Jacques Hadamatd,
Gauthier-Villars, 1901.
Jacobi, C. G. J.: Gesammelte Werke, T vols' and Supplement, G. Reimer, 1881-91;
Chelsea reprint, 1968.
Jorrrdair.r, Philip E. B. : " The Theory of Functions with Cauchy and Gauss,"
Bibliotecha Mathematica, (3), 6, 1905, 190-207 .
Klein, Felix: Vorlesungen tiber die Entwicklung der Mathematik im 19. Jahrhundert,
2 vols., Chelsea (reprint), 1950.
L€vy, Paul, et al. : " La Vie e t l'ceuvre de J. Hadamard," L'Enseignement Mathi-
matique, (2), 13, 1967, l-72.
Markuschewitsch, A. I.: Skizzen zur Geschichte der analgtischen Funktionen, Y EB
Deutscher Verlag der Wissenschaften, 1955.
Mittag-Leffier, G. : " An Introduction to the Theory of Elliptic Functions,"
Annak of Math., 24, 192213' 271-351.
67o ruNcrroNs oF A coMpLEx VARIABLE
................._:"Die ersten .10 Jahre des Lebens von Weierstrass," Acta Math.,39, 1923,
l-57.
Ore, O. : Nicls Hcnrik Abcl, Matlumatieian Exlratrdinary, University of Minnesota
Press, 1957.
Osgood, W. F. : " Allgemeine Theorie der analytischen Funktionen," Encgk. dtr
Math. Wiss., B. G. Teubner, 1901-21, II Bl, l-114.
Reichardt, flarx, ed.: Gauss: Lcben und Werk, Haude und Spenersche Verlags-
buchhandlung, 1960; B. G. Teubner, 1957, l5l-82.
Riemann, Bernhard: Gcsammclte mathematisclu l,l/erke, 2nd, ed., Dover (reprint),
1953.
Schlesinger, L.: "Uber
Gauss' Arbeiten zu Funktionenlehre," Nathrichten Kdnig.
Gdtt., 1912, Beiheft, l-143; also in Gauss's Wcrke, 102,77 tr.
Ges. dcr Wiss. zu
Smith, David Eugene : A Sourcc Book in Matlumatics,2 vols., Dover (reprint), 1959,
pp. 55-66, ,10,1-10.
Staectel, PauI: " Integration durch imaginiires Gebiet," Bibliotccha Matlumalita,
(3), l, 190'0, 109-28.
Valson, C. A.: La Vic et hs traaauc du baron Cauchg,2 vols., Gauthier-Villars, 1868,
Weierstrass, Karl: Matlumatische Walur T vols., Mayer und Mi.iLller, 1895-1924.
28
Partial Differential Equations
in the Nineteenth Century
The profound study of nature is the most Grtile source of
mathematical discoveries. JosEPrr ForrEER
l. Introduction
The subject ofpartial differential equations, which had its beginnings in the
eighteenth century, burgeoned in the nineteenth. As physical science ex-
panded, in both the variety and depth of the phenomena investigated, the
number of new types of differential equations increased I and even the tyPes
already known, the wave equation and the potential equation, were
applied to new areas of physics. Partial differential equations became and
remain the heart of mathematics. Their importance for physical science is
only one of the reasons for assigning them this central place. From the
standpoint of mathematics itself, the solution of partial differential equations
created the need for mathematical developments in the theory of functions,
the calculus ofvariations, series expansions, ordinary differential equations,
algebra, and differential geometry. The subject has become so extensive
that in this chapter we can give only a few of the major results.
We are accustomed today to classifying partial differential equations
according to tyPes. At the beginning of the nineteenth century, so little
was known about the subject that the idea of distinguishing the various
types could not have occurred. The physical problems dictated which
equations were to be pursued and the mathematicians passed freely from
one type to another without recognizing some differences among them that
we now consider fundamental. The physical world was and still is indifferent
to the mathematicians' classification.
67,
672 PARTIAL EquATroNs rBoo-r9oo
(2) H:.T
subject to the boundary conditions
(3) 7(0,,) : 0, T(l,t) :0, for t > 0,
and the initial condition
(4) T(x,O):f(x) for0<r</.
To solve this problem Fourier used the method ofseparation ofvariables.
He let
(s) T(x, t) : 4@){(t).
On substitution in the differential equation, he obtained
!'\n
k'6@): {'a).tu)
He then argued (cf. [30] of Chap. 22) that each of these ratios must be a
constant, -) say, so that
(6) 6"@) + Ak2$(x) : o
and
(7) *',(t) + rl(,) : 0.
6@):bsin(y'-l*r+c)'
The condition that {(0) :0 implies that c:0' The condition /(l) :0
imposes a limitation on ,\, namely that 17 must be an integral multiple of
rlkl. Hence there are an infinite number of admissible values ,\, of tr or
(ll) f@):>b,sin'ff.
Fourier then faced the question, Can f (z) be represented as a trigonometric
In particular, can the D, be determined ?
'series ?
Fourier proceeded to answer these questions. Though by this time he
was somewhat conscious of the problem of rigor, he proceeded formally in
the eighteenth-century spirit. To follow Fourier's work we shall, for
simplicity, let I : r. Thus we consider
(12)
"t@:>b"sinvx, for0 < x < z.
v=1
Fourier takes each sine function and expands it by Maclaurin's theorem into
a polr'er series; that is, he uses
(13)
By equating coefficients of like powers of* in (14) and (15), Fourier finds
thatjf(")(0) : 0 for even,t and beyond that
L
) vzn-Lbn: (_ l )z- y(2" - 1)(0), n:1,2,3,....
Now the derivatives of f (x) are known, because;f(r) is a given initial con-
dition. Hence the 6, are an infinite set of unknowns in an infinite system of
linear algebraic equations.
In a previous problem, wherein he faced this same kind of system,
Fourier took the first & terms and the right-hand constant of the first ft
equations, solved these, and by obtaining a general expression {ot b",r,
which denotes the approximate value of D, obtained from the first it equa-
tions, he boldly concluded that 6, : Iimr- - i,,s. However, this time he
had much difficulty in determining the 0,. He took several different;f(r)'s
and showed how to determine the D, by very complicated procedures that
involved divergent expressions. Using these special cases as a guide he
obtained an expression for D, involving infinite products and infinite sums.
Fourier realized that this expression was rather useless, and by further bold
and ingenious, though again often questionable, steps finally arrived at the
formula
(16) u" : ?,
!" *l sin vs ds.
\i
(le) J@) -ao -,
-
(, l
,/.
ay cos vt(.
!=1
(20)
Figure 28. I
Fourier ne"er gave any complete proof that an " arbitrary" function
can be represented by a series such as (20). In the book he gives some loose
arguments, and in his final discussion of this point (paragraphs 415, 416,
and,423) he gives a sketch ofa proof. But even there Fourier does not state
the conditions that a function must satisfy to be expansible in a trigono-
metric series. Nevertheless Fourier's conviction that this was possible is
expressed throughout the book. He also saysa that his series are convergent
no matter what /(*) may be, whether' or not one can assign an analytic
expression to f (x) and, whether or not the function follows any regular law.
Fourier's conviction that any function can be expanded in a Fourier series
rested on the geometrical evidence described above. About this he says in
his book (p. 206), " Nothing has appeared to us more suitable than geo-
metrical constructions to demonstrate the truth of the new results and to
render intelligible the forms which analysis employs for their expressions."
Fourier's work incorporated several major advances. Beyond furthering
the theory of partial differential equations, he forced a revision in the
notion of function. Suppose the function ! : x is represented by a Fourier
series (20) in the interval (-n, n). The series repeats its behavior in each
interval of length 22. F{ence the function given by the series looks as shown
in Figure 28.1. Such functions cannot be represented by a single (finite)
analytic expression, whereas Fourier's predecessors had insisted that a
function must be representable by a single expression. Since the entire
function ! : x for all x is not represented by the series, they could not see
how an arbitrary function, which is not periodic, could be represented by
the series, though Euler and Lagrange had actually done so for particular
nonperiodic functions. Fourier is explicit that his series can represent func-
tions that also have different analytical expressions in different parts of the
interval (0, z) or (- r, n), whether or not the expressions join one another
continuously. He points out, finally, that his work settles the arguments on
solutions of the vibrating-string problem in favor of Daniel Bernoulli.
F-ourier's work marked the break lrom analytic lunctions or functions de-
velopable in Taylor's series. It is also significant that a Fourier series
(2r)
where the qo are determined by the boundary conditions and the anare de-
termined by the initial conditions, Fourier now regards the q, as abscissas of
a curve and the an as the ordinates ofthat curve. Then a, : Q(il where Q
is some function of q. He then replaces (21 ) by
and seeks to determine Q. He goes back to the formula for the coefficients
where {(r) would usually be the initial functron. By a " limiting process "
that replaces a,, by Q and n by 4, he obtains
where F(*), an even function, is the given initial temperature in the infinite
domain. Then by using (23) in (22) and by an interchange of the order of
integration, which Fourier does not bother to question, he has
U:-
2f-| F(") *[:, -kq2t cos qx cos qa dq.
rJo
Fourier then does the analogous thing for an odd F(.r), and so finally
obtains
(24)
"::l:" F(") d' I"* ' -kc2' cos q(x
- a) dq.
This form shows that F (x) can be resolved into an infinite number ol
harmonic components with continuously varying frequency ql2zr and with
amplitude (l t2n) J': - F (a)e - too da, whereas the ordinary Fouricr serics
resolves a given function into an infinite but discrete set ol harmonic
components.
Cauchy's derivation of the Fourier integral is somewhat similar. The
paper in which it appeared, " Th6orie de la propagation des ondcs,"
receivcd the prizc of thc Paris Academy for lBl6.5 This paper is the first
large investigation of waves on the surface o[ a fluid, a subject initiated by
5. Mim. diaers satans, l, 1827,3-312: (Euues, (l), l, 5-318; sec also Cauchy, Noriu.
Bull. de la Soc. : (Euures, (2), 2,223-27.
Phil., 1817, l2l-2+
THE POTENTIAL EQUATION AND GREEN'S THEOREM 68r
F(*):
(27)
I; cos mx J(m) dm.
(28) -f (*)
::I: cos mu F (u) dtt.
where a is the surface normal ofthe body directed inward and do is a surface
element. Theorem (32), incidentally, was also proved by Michel Ostro-
gradsky ( 180l-61), a Russian mathematician, who presented it to the St.
Petersburg Academy of Sciences in lB2B.8
Green then showed that the requirement that Z and each of its first
derivatives be continuous in the interior ofthe body can be imposed instead
ofa boundary condition on the derivatives of Z. In light of this fact, Green
represented V in the interior of the body in terms of its value Z on the
boundary (which function would be given) and in terms ofanother function
U which has the properties: (a) U must be 0 on the surface; (b) at a fixed
but undetermined point P in the interior, U becomes infinite as I /r where r
is the distance of any other point from P; (c) U must satisfy the potential
equation (30) in the interior. If U is known, and it might be found more
readily because it satisfies simpler conditions than V, then V can be
represented at every interior point by
8. Jour. ftu Math.,39, 1850, 73-89; ++, rc52,356-74; and 47,1854, 16l-221 = Grccn's
Matfumaticol Palers, 187 l, 3-l 15.
9. Mim. Acatl. Sci. St. Petas., (6), l, 1831, 39-53.
68+ eARTTAL DTFTERENTTAL EeuATroNs r8oo-rgoo
where the integral extends over the surface, and 0Ul7n is the derivative of
U in the direction perpendicular to the surface and into the body. It is
understood that the coordinates of P are contained in 0Ul0n and are the
arguments at P. This function U, introduced by Green, which Riemann
later called the Green's function, became a fundamental concept of partial
differential equations. Green himsellused the term " potential function " for
this special function U as well as for V. His method of obtaining solutions of
the potential equation, as opposed to the method of using series of special
functions, is called the method of singularities. There is unfortunately no
general expression for the function [,/, nor is there a general method for
finding it. Green was content in this matter to give the physical meaning of
U for the case of the potential created by electric charges.
Green applied his theorem and concepts to electrical and magnetic
problems. He also took up in 183310 the problem of the gravitational
potential o[ ellipsoids of variable densities. In this work Green showed that
when Z is given on the boundary of a body, there is just one function that
satisfies LV :0 throughout the body, has no singularities, and has the
given boundary values. To make his proof, Green assumed the existence of
a function that minimizes
(33)
ilI l(X)'.(e#)'.(Y.)'l*
This is the first use of the Dirichlet principle (cf. Chap. 27, sec. 8).
In this lB35 paper Green did much of the work in z dimensions instead
of three and also gave important results on what are now called ultra-
spherical functions, which are a generalization to n variables of Laplace's
spherical surface harmonics. Because Green's work did not become well
known for some time, other men did some of this work independently.
Green is the first great English mathematician to take up the threads of
the work done on the Continent after the introduction of analysis to
England. His work inspired the great Cambridge school of mathematical
physicists which included Sir William Thomson, Sir Gabriel Stokes, Lord
Rayleigh, and Clerk Maxwell.
Green's achievements were lollowed by Gauss's masterful work of
1839,11 "Allgemeine Lehrsiitze in Beziehung auf die im verkehrten Ver-
hiiltnisse des Quadrats der Entfernung wirkenden Anziehungs-und Ab-
stossungs-krafte " (General Theorems on Attractive and Repulsive Forces
Which Act According to the Inverse Square of the Distance). Gauss proved
rigorously Poisson's result, namely, that LV - -4np at a point inside the
acting mass, under the condition that p is continuous at that point and in a
10. Trans. Canb. Phil. Soc., 53, 1835, 395-430 : Mathematiral Papeu, 187-222.
ll. Resultate aus det Beobachtungen des magwlischen Vereins, Yol. 4, 1840 : Werke, 5, 197-242.
THE POTENTIAL EQUATION AND GREEN'S THEOREM 6Bs
small domain around it. This condition is not fulfilled on the surf;ace of the
acting mass. On the surface the quantities A2Vl0x2, A2Vl0g2, and 02Vl0z2
have jumps.
The work thus far on the potential equation and on Poisson's equation
assumed the existence of a solution. Green's prool of the existence of a
Green's function rested entirely on a physical argument. From the existence
standpoint the fundamental problem of potential theory was to show the
existence of a potential function Z, which William Thomson about lB50
called a harmonic function, whose values are given on the boundary of a
region and which satisfies LV : O in the region. One might establish this
directly, or establish the existence of a Green's function U and from that
obtain tr/. The problem of establishing the existence of the Green's function
or ol Zitsellis known as the Dirichlet problem or the first boundary-value
problem ol potential theory, the most basic and oldest existence problem
of the subject. The problem of finding a tr/ satisfying LV : 0 in a region
when the normal derivative of Z is specified on the boundary is called the
Neumann problem, alter Carl G. Neumann (1832-1925), a professor at
Leipzig. This problem is called the second fundamental problem ofpotential
theory.
One approach to the problem of establishing the existence of a solution
of L,V:0, which Green had already used (see [33]), was brought into
prominence by William Thomson. ln 1847 12 Thomson announced the
theorem or principle which in England is named after him and on the
Continent is called the Dirichlet principle because Riemann so named it.
Though Thomson stated it in a somewhat more general form, the essence of
the principle may be put thus: Consider the class ofall functions U that have
continuous derivatives of the second order in the interior and exterior
domains T and T'respectively separated by a surface S. The U's are to be
continuous everywhere and assume on S the values ofa continuous function
f The lunction V that minimizes the Dirichlet integral
(3+)
IilKXr.(H)',.(#)1^
is the one that satisfies L,V : 0 and takes on the value;fon the boundary,S.
The connection between (34) and Atrl is that the first variation of lin the
sense of the calculus of variations is AZ, and this must be 0 for a minimizing
Z. Since for real U,l cannot be negative, it seemed clear that a minimizing
function Z must exist, and it is then not diflicult to prove it is unique. The
Dirichlet principle is then oza approach to the Dirichlet problem ofpotential
theory.
12. Jour. de Math.,12, 1847, 493-96 : Cambridge and Dublin Moth. Jour.,3, 1848, 8,f-87 :
Math, and Plqsical Papers, 1,93-96.
686 nARTTAL DTFFERENTTAL EquATroNs rSoG-Igoo
t : -f (x,y), 1 : eQc's)
where
{:t+i,1
produces another analytic function G(() : C(E * i1), and its real and
imaginary parts also satisfy AZ(f, ?) : 0. Now if the original Potential
13. Chap.27,zec.9.
cuRvrLrNEAR cooRDrNATEs 687
5. Curoilinear Coordinates
Green introduced a number of major ideas whose significance extended far
beyond the potential equation. Gabriel LamC (1795-1870), a mathema-
tician and engineer concerned primarily with the heat equation, introduced
another major technique, the use of curvilinear coordinate systems, which
could also be used for many types of equations. Lamd pointed out in lB331a
that the heat equation had been solved only for conducting bodies whose
surfaces are normal to the coordinate planes r: const., y: const., and
z : const. Lam€'s idea was to introduce new systems of coordinates and the
corresponding coordinate surflaces. To a very limited extent this had been
done by Euler and Laplace, both of whom used spherical coordinates p, 0,
and /, in which case the coordinate surfaces p : const,, 0 : const., and
d : co.rst. are spheres, planes, and cones respectively. Knowing the equa-
tions that transform from rectangular to spherical coordinates, one can, as
Euler and Laplace did, transform the potential equation from rectangular
to spherical coordinates.
The value of the new coordinate systems and surfaces is twofold. First,
a partial differential equation in rectangular coordinates might not be
separable into ordinary differential equations in this system but might be
separable in some other system. Secondly, the physical problem might call
for a boundary condition on, say, an ellipsoid. Such a boundary is repre-
sented simply in a coordinate system wherein one family of surfaces consists
of ellipsoids, whereas in the rectangular system a relatively complicated
equation must be used. Moreover, after separation of variables in the proper
coordinate system is employed, this boundary condition becomes applicable
to just one of the resulting ordinary differential equations.
14. Jour. dr l:Ecoh Poly., 14, 1833, 19,*-251,
688 rARTTAL DTFFERENTTAL EquATroNs r8oo-I9oo
Lam6 introduced several new coordinate systems for the express pur-
pose of solving the heat equation in these systems.rs His chief system was the
three f,amilies of surfaces given by the equations
x2 u2 22
p+f,_6r+Xr=T-l:o
*1
* =!',=+ =t' .-l:o
*1
* + ^"
^Y',= ' ,2 _c2^
y2'v2_b2 - l:0.
where tr2 > e' > p' > b2 > v2. These three families are ellipsoids, hy-
perboloids ofone sheet, and hyperboloids of two sheets, all of which possess
the same foci. Any surface of one family cuts all the surfaces of the other two
orthogonally, and in fact cuts them in lines of curvature (Chap. 23, sec. 7).
Any point in space accordingly has coordinates (tr, g., r), namely the tr, p,
and v of the surfaces, one from each family, which go through that point.
This new coordinate system is called ellipsoidal, though Lamd called it
elliptical, a term now used for another system.
Lam6 transformed the heat equation for the steady-state case (tem-
perature independent of time), that is, the potential equation, to these
coordinates, and showed that he could use separation of variables to reduce
the partial differential equation to three ordinary differential equations. Of
course these equations must be solved subject to appropriate boundary
conditions. In a paper of 183916 Lam6 studied further the steady-state
temperature distribution in a three-axis ellipsoid and gave a complete solu-
tion of the problem treated in his lB33 paper. In this lB39 paper he also
introduced another curvilinear coordinate system, nor r' called the sphero-
conal system, wherein the coordinate surfaces are a family of spheres and
two families of cones. This system too Lam6 used to solve heat conduction
problems. Lamd wrote many other papers on heat conduction using ellip-
soidal coordinates, including a second one ofl lB39 in the same volume ofthe
Journal de Mathimatiques, in which he treats special cases of the ellipsoid.lT
The subject of mutually orthogonal families of surfaces had such
obvious importance in the solution of partial differential equations that it
became a subject of investigation in and for itself. In a paper of 183418
Lam6 considered the general properties of any three families ol mutually
orthogonal surfaces and gave a procedure for expressing a partial differen-
(38)
02u 02u 02u I 02u
axz ' au2 ' azz 42 aP
-t-l-:--
wherein 4 is a constant. The solution z is given by
(39) u(x,y, z, t)
: *1" / cos 0,s t
[i" r,(x I / sin d,
atsin arsin
wherein 0 and. $ are the usual spherical coordinates. The domain ofintegra-
tion is the surface of a sphere So, with radius aI about the point P with
coordinates x, y, and, z,
23. Mhn. de l'Acad. dcs Sci., Paris, (2), 3, 1818, l2l-76.
THE llIA\rE AND REDUCED WAVE EqUATTONS 69r
Figure 28.2
Figure 28.3
* :;(,fi - ",],) *
,*
,:l("*-.*) * u,,
Then Green's theorem states
of air in a tube (organ pipe) with an oPen end, gave the first general
investigation of its solutions.2? He was concerned with the acoustical prob-
lem in which ro is the velocity potential of a harmonically moving air mass,
rt is a constant determined by the elasticity of the air and the oscillation
frequency, and l, which equals 2n lk, is the wavelength' By applying Green's
theorem he showed that any solution of Ara * h2w : 0 that is continuous in
a given domain can be represented as the effect of single and double layers
of excitation points on the surface of the domain. Using e-rk'l4nr as one of
the functions in Green's theorem, he obtained
Pu: "t'
wherein we have let u : wetot so that ,t: olc. Then (46) may be written as
Darboux in his Thiorit genCrale des surJaces fo both cite Riemann's paper of
1858/59. If the given equation is
L(u\ : A*
ox' * rr 3*
ox oy
* c*oy'+ o*ox+ E*
oy
+ Fu : o,
wherein the coefficients are functions of x all.d y, one integrates the product
ll(u) over an arbitrary domain R of the xy-plane under the assumption that
u, a, and their first and second partial derivatives are continuous. Then
integration by parts yields the generalized Green's theorem, which states
that
L(u) : 64
ox* b!.
* o*- og
+,u : o,
then
circle. Then the generalized Green's theorem gives, when the circle is
contracted to (f, a) :
where z is positive if directed to the outside of the domain; the single integral
is taken counterclockwise over the boundary. Since z satisfi.es L(u) : 0, if
we know u and if u and 0ul0n are given (both are not arbitrary) on the
boundary, then we have u expressed as a single integral. The function a is
called a Green's function, though often the condition that a vanish on the
boundary of .R is added in the definition of the Green's function. Various
specializations and generalizations of this use of Green's theorem have been
developed.
Dw-AP ta0 *
P Dt:Pz,-Tz + i17 lt\w
-0u0u0u
0:'-*;-*;-,
ox o! oz
31. M,im de l' Acad. des Sci., Paris, (2) 1827 , 37 5-94.
'
SYSTEMS OF PARTIAL DIFFERENTIAL EQUATIONS 6g7
where A has the usual meaning; p is the density ofthe fluid;y', the pressure;
u, o, and u are the components of velocity of the fluid at any ,(, y, z, and
t; X, Y, and Z are the components of an external force; the constant p,
which depends on the nature of the fluid, is called the coefficient of vis-
cosity; and the derivative DlDt has the meaning explained in Chapter 22,
section 8. For an incompressible fluid, 0 : 0.
The equations were also obtained in lB29 by Poisson.s2 They were
then rederived in lB45 on the basis of the mechanics of continua by George
Gabriel Stokes ( I B 19-1903), professor ol mathematics at Cambridge Uni-
versity, in his essay "On the Theories of the Internal Friction of Fluids in
Motion."33 Stokes endeavored to account for the frictional action in all
known liquids, which causes the motion to subside by converting kinetic
energy into heat. Fluids, by virtue of their viscosity, stick to the surfaces of
solids and thus exert tangential forces on them.
The subject of elasticity was founded by Galileo, Hooke, and Mariotte
and was cultivated by the Bernoullis and Euler. But these men dealt with
specific problems. To solve them they concocted ad hoc hypotheses on how
beams, rods, and plates behaved under stresses, pressures, or loads. The
theory proper is the creation of the nineteenth century. From the beginning
of the nineteenth century on a number of great men worked persistently to
obtain the equations that govern the behavior of elastic media, which
includes the air. These men were primarily engineers and physicists. Cauchy
and Poisson are the great mathematicians among them, though Cauchy was
an engineer by training.
The problems of elasticity include the behavior of bodies under stress
wherein one considers what equilibrium position they will assume, the
vibrations of bodies when set in motion by an initial disturbance or by a
continuously applied force, and, in the case of air and solid bodies, the
propagation of waves through them. The interest in elasticity in the nine-
teenth century was heightened by the appearance about 1820 of a wave
theory of light, initiated by the physician Thomas Young (1773-1829) and
by Augustin-Jean Fresnel (l7BB-1827), an engineer. Light was regarded as
a wave motion in ether, and ether was believed to be an elastic medium.
Hence the propagation of light through ether became a basic problem.
Another stimulus to a strong interest in elasticity in the early nineteenth
century was Ernst F. F. Chladni's (1756-1827) experiments (1787) on the
vibrations of glass and metal, in which he showed the nodal lines. These
should be related to the sounds given off by, for example, a vibrating
drumhead.
The work to obtain basic equations of elasticity was long and full of
()+,,)ff*nou:,?u
(51)
. .A0 02o
(i+p)5-n+rA.tt:PAp
The first two equations are the primary ones and amount to six scalar
(non-vectorial) partial differential equations. The displacement current is
the term AQE)lAt
By working with just these equations, Maxwell predicted that electro-
magnetic waves travel through space and at the speed of light. On the basis
of the identity of the two speeds, he dared to assert that light is an electro-
magnetic phenomenon, a prediction that has been amply confirmed since
his time.
No general methods for solving any of the above systems of equations
are known. However, the nineteenth-century men gradually realized that
in the case of partial differential equations, whether single equations or
systems, general solutions are not nearly so useful as the solutions for
specific problems where the initial and boundary conditions are given, and
where experimental work might also aid one in making useful simplifying
assumptions. The writings of Fourier, Cauchy, and Riemann furthered this
realization. The work on the solution of the many initial- and boundary-
value problems to which specializations of these systems gave rise is enor-
mous, and almost all of the mathematicians of the century undertook such
problems.
B. Existence Theorems
As the eighteenth- and nineteenth-century mathematicians created a vast
number of types of differential equations, they found that methods of
solving many of these equations were not available. Somewhat as in the
case of polynomial equations, where efforts to solve equations of degree
higher than four failed and Gauss turned to the proof of existence of a root
(Chap. 25, sec. 2), so in the work on differential equations the failure to find
explicit solutions, which of course ipso;facro demonstrate existence, caused
the mathematicians to turn to proof of the existence of solutions. Such
proofs, even though they do not exhibit a solution or exhibit it in a useful
form, serve several purposes. The differential equations were in nearly all
cases the mathematical formulation of physical problems. No guarantees
were available that the mathematical equations could be solved; hence a
38. For the meaning of curl and div see Chap. 32, scc. 5.
700 PARTIAL EquATroNs rBoo-r9oo
proof of the existence of a solution would at least insure that a search lor a
solution would not be attempting the impossible. The proof of existence
would also answer the question: What must we know about a given physical
situation, that is, what initial and boundary conditions insure a solution and
preferably a unique one? Other objectives, perhaps not envisaged at the
beginning of the work on existence theorems, were soon recognized. Does
the solution change continuously with the initial conditions, or does some
totally new phenomenon enter when the initial or boundary conditions are
varied slightly ? Thus a parabolic orbit that obtains for one value of the
initial velocity ofa planet may change to an elliptic orbit as a consequence
ofa slight change in the initial velocity. Such a difference in orbit is physi-
cally most significant, Further, some of the methodologies ol solution, such
as the use of the Dirichlet principle or Green's theorem, presupposed the
existence of a particular solution. The existence of these pariicular solutions
had not been established.
Before we give some brief indications of the work on existence theorems,
it may be helpful to note a classification ofpartial differential equations that
was actually made rather late in the century. Though some eflorts toward
classification by reducing thcse equations to normal or standard forms had
been made by Laplace and Poisson, the classification introduced by Du
Bois-Reymond has now become standard. In 1BB93e he classified the most
general homogeneous linear equation of second order
Tdxz-Sdxdg-tRdY2:9.
The characteristics are imaginary, real and distinct, or real and coincident
according as
€:i@,y), n:*@,!),
39. Jour. fnr Math., 104, 1889, 241-301.
EXISTENCE THEOREMS 7OI
the above equation can always be transformed into one of the three types of
normal lorms
k\ R'*+P'Y+A'?*zu:o
oE' ol o7l
respectively. The two families 6@,5) : const. and {(*, y) : const' are the
equations of two families of characteristics.
The supplementary conditions that can be imposed differ for the three
types ofequations. In the elliptic case (a) one considers a bounded domain of
thi ry-plane and specifies the value of a on the boundary (or an equivalent
condition) and asks for the value of u in the domain. For the initial-value
problem of the hyperbolic differential equation (b) one must specify a and
Auf An on some initial curve. There may also be boundary conditions' The
proper initial conditions for the parabolic case (c) were not sPecified at this
time, though it is now known that one initial condition and boundary
conditions can be imposed. This classification ofpartial differential equations
was extended to equations in more independent variables, higher-order
equations and to systems. Though the classification and the supplementary
conditions were not known early in the century, the mathematicians
gradually became aware ofthe distinctions and these figured in the existence
theorems they were able to Prove.
The work on existence theorems became a major activity with Cauchy,
who emphasized that existence can often be established where an explicit
solution is not available. In a series of papersao Cauchy noted that any
partial differential equation of order greater than one can be reduced to a
system of partial differential equations, and he treated the existence of a
solution for the system. He called his method the calcul des limites but it is
known today as the method of majorant functions. The essence of the method
is to show that a power series in the independent variables with a definite
domain of convergence does satisfy the system of equations. We shall illus-
trate the method in connection with Cauchy's work on ordinary differen-
tial equations (Chap. 29, sec. 4). His theorem covers only the case of
analytic coefficients in the equations and analytic initial conditions'
To obtain some concrete idea about Cauchy's work, we shall consider
what it implies for the second order equation in two independent variables
(55) r : f(2, x,Y, P, q, s, t)
40. Comp. Rend., 14, 1842, lO2C_i25 : (Euores, (l), 6, 461-67, and Conp' Rcnd'' 15, 1842'
4,1-59, 85-101, l3l-38 : (Euures, (l),7, 17-33' 33-49, 52-58.
7O2 PARTTAL DIFFEREMnAL EqUATIONS IEOO-r9OO
it is necessary to solve for r before it can be put in the form (55). To consider
a simple but vital case, if the equation is
"
: n# + 28
ffi * rf:', * rX * rYE + Fz : o,
(s7)
\-1
Z
t,k
At*-a*,
a*u
02u
*1u'#,* ":'
where the coefficients are functions of the independent variables *11 r21 :rrd
x., then the exceptional case occurs when the initial surface ,S satisfies the
first order partial differential equation
o'
(58)
2o" *9,#,-
41. Jour. fw Math., 80, 1875, l-32.
42. Bull. Soc. Math, th Framc,26, 1098, 129-34.
703
Along such surfaces two solutions of (57) can be tangent and even have
higher-order contact. This property is the same as for the characteristic
curves of the first order equationlf(x, !, u, !, 8) : 0 (Chap. 22, sec. 5), and
so these surfaces S are also called characteristics. Physically the surfaces S
are wave fronts.
This theory of characteristics for the case of two independent variables
was known to Monge and to Andrd-Marie Amptre (1775-1836). fts exten-
sion to the case of second order equations in more than two independent
variables was first made by Albert Victor Bicklund (1845-1922),43 but was
not widely known until it was redone by Jules Beudon.aa
ln his Legons sur la Propagation des ondes (1903), Jacques Hadamard
(1865-1963), the leading French mathematician of this century, generalized
the theory of characteristics to partial differential equations of any order.
As an example, let us consider a system ofthree partial differential equations
of the second order in the dependent variables (, 1, and ( and the inde-
pendent variables xr,x2,...,*o. The Cauchy problem for this system is:
Given the values of {, 1, l, and A{lfu,7ql{x", and 0fl0x* on a "surface"
Mn-, of n - I dimensions, to find the functions (, 1, and' (. The values of
the second and higher derivatives of t, n, and ( may then be comPuted
unless Mr-, satisfies a first order partial differential equation of the sixth
degree, say H :0. All " surfaces " satisfying H :0 are characteristic
" surflaces." According to the theory of first order partial differential equa-
tions, the differential equation 11 : 0 has characteristic lines (curves)
defined by
d*, : dr, dxn -,
aHlah- aHlaPz- aHlaP*_r'
where Pr, Pr, . . . , Pn -, are the partial derivatives of x,, with respect to
r15 !(27 . taken along the " surface " Mn-r. These lines are called the
.., *o-1
bicharacteristics of the original second order system. In the theory of light
they are the rays.
The characteristics now play a vital role in the theory of partial dif-
ferential equations. For example, Darbouxas has given a powerful method
ofintegrating second order partial differential equations in two independent
variables that rests on the theory of characteristics. It converts the problem
to the integration of one or more ordinary differential equations and
embraces the methods of Monge, Laplace, and others.
Another class of existence theorems dealt with the Dirichlet problem,
that is, establishing the existence ofa solution of L'V : 0 either directly or
by means of the Dirichlet principle. The first existence proof of the Dirichlet
46. Schwarz's method is sketched in Felix Klein, Vorlesurgen iiber die Entwicklung d.er
Mathenatik im 19. Jahrhundert, Chelsea (reprint), 1950, l, p. 265, and given fully in A. R.
Forsyth, Theory of Functions, Dover (reprint), 1965, 2, Chap. 17. Many references are given
in the latter source.
47. Monalsber. Beiliner Akad., 1870,767-95: Gu. Math. Abh.,2, 144-71.
48. Kdn;glich Sdchsischen Ges. der Wiss. zu Leipzig, 1870, 49-56, 264-321,
49. The method is described in O. D. Kellogg, Foundations of Potential Theory, .lwlius
Springer, 1929,281 tr
50. Second cd., 1884, 238 tr
51. Amer. Jour. of Math., 12, 1890, 2ll-94 : (Euares, 9, 28-113.
52. Jahres. der Deut. Mqth.-Verein., 8, 1900, l8,l-88 : Gu. Abh.,3, l0-14,
53. Math. Ann., 59, 1904, 16l-86 = Gcs. Abh., 3, 15-37.
705
54. Comp. Rend., 107,1888, 939-41, Jour. de Math., (4), 6' 1890, l4F2l0' Jotr. dc Math.,
(5), 2, 1896, 29F304.
55. Jour dc I'Enh Poly.,60, 1890,89-105.
56. Jour. fnr Math., 104, 1889, 241-30t.
76 PARTIAL DTFFER ENTIAL EqUATTONS rBOO-rgOO
For the continuous initial conditions z and dal0n (and, z in the second case),
the solutions will be regular or satisfy the differential equation everywhere
in the rectangles described above. Discontinuities in a and 1ulAn will
propagate along the characteristics in the rectangle.
A great deal of work was done in the second halfofthe century on the
existence of eigenvalues for Au + k2u : 0 considered in a domain D. The
main result is that for a given domdin and under any one of the three
boundary conditions u :0, 0ul0n :0, 0ul0n * hu :0 (/z > 0 when the
positive normal is directed outside the domain), there are always an infinite
number of discrete values of *2, for each of which there is a solution. In two
dimensions the vibrations of a membrane fixed along its boundary illus-
trate this theorem. The values of * are the frequencies of the infinitely many
purely harmonic vibrations. The corresponding solutions give the deforma-
tion of the membrane in carrying out its characteristic oscillations.
The first major step was the proof by Schwarz 67 of the existence of the
6rst eigenfunction of
au + {f(x,g)u: o,
that is, the existence of a U, such that
LU,+k?f(x,y)u,:o
and U, : 0 on the boundary of the domain considered. His method gave
a procedure for finding the solution and permitted the calculation of &f.
Picard 68 then established the existence of the second eigenvalue *!.
Schwarz also showed in the l8B5 paper that when the domain varies
continuously, the value of *f, the first characteristic number, also varies
continuously; and as the domain becomes smaller, ,tf increases unboundedly.
Thus a smaller membrane gives off a higher first harmonic.
In lB94 Poincardse demonstrated the existence and the essential
properties of all the eigenvalues of
(60) Lu+lu:-f,
,\ complex, in a bounded, three-dimensional domain, with z : 0 on the
boundary. The existence of a was demonstrated by a generalization of
Schwarz's method. He proved next that u(,\) is a meromorphic function of
the complex variable ,\ and that the poles are real; these are just the
eigenvalues lo. Then he obtained the characteristic solutions U,, that is,
57, Acta Soc. Fennicae, 15, 1885, 315-62: G*. Math. Abh., l, 223-69.
58. Conp. Rend., 117,1893, 502-7.
59. Rcndiconti del Circoh Mahmatito di Palermo, S, 1894, 57-155 : (Euarcst 9, 123-96.
BIBLIOCRAPI{Y 707
The /rf are the characteristic numbers (eigenvalues) and determine the
frequencies of the respective characteristic solutions.
Physically Poincar€'s result has the following significance' The func-
tionrflin (60) can be thought of as an applied force. The free oscillations of
a mechanical system are those at which the forced oscillations degenerate
and become infinite. In fact, (60) is the equation ol an oscillating system
excited by a periodically varying force ol amplitudel and the character-
istic solutions are the free oscillations of the system, which, once excited,
continue indefinitely. The lrequencies of the free oscillations, which are
proportional to the *,, are calculated by Poincard's method as the values of
y'l for which the forced oscillation u becomes infinite'
At the end of the century the systematic theory of boundary- and
initial-value problems for partial differential equations, which dates from
Schwarz's fundamental lBB5 paper, was still young' The work in this area
expanded rapidly in the twentieth century.
Bibliograplry
Bacharach, Max: Abriss der Geschichtc der Potentiahtuuie, Vandenhoeck and
Ruprecht, 1883.
Burkhardt,H.:..EntwicklungennachoscillirendenFunctionenundlntegration
der Differentialgleichungen der mathematischen Physik," Jahres' dcr Deut'
Math.-Verei.n., Vol. 10, 1908, l-1804.
Burkhardt, H. and W. Franz Meyer: " Potentialtheorie ," Ercyk' dcr Math' Wiss''
B. G. Teubner, 1899-1916, II A7b,464-503'
Cauchy, Augustin-Louis: (Euares complites, Gauthier-Villars, 1890, (2), I'
Fourier, Joslph: ?/ra Analytital Theory of Heat (1822), Dover reprint of English
translation, 1955.
(Euores, 2 vols., Gauthier-Villars, 1888-90; Georg olms (reprint), 1970'
Green, George: Essay on the Applicalion of Mathematical Analysis to the Thtorics of
-i Etectrixity and Magnctism, 1828, reprinted by Weziita-Melins Aktiebolag'
1958; also in Ostwald's Klassiker 16l (in German), Wilhelm Engelmann'
1895.
Mathematieal Papers, Macrnillan, l87l; Chelsea (reprint), 1970'
Heine,Eduard:HandbuchderKugelfunktionen,2vols',1878-gl'PhysicaVerlag
-: (reprint), 1961.
Helmholtz, Hermann von : " Theorie der Luftschwingungen in Riihren mit offenen
Enden," Oslwald's .Klassitter der exakl,en Wissenschqften, Wilhelm Engelmann'
1896.
Klein, Felix: Vorlesungcn nber die Entwicklung der Matlumatik im 19' Jahthunderl' 2
l. Introduction
Ordinary differential equations arose in the eighteenth century in direct
response to physical problems. By tackling more complicated physical
phenomena, notably in the work on the vibrating string, the mathematicians
arrived at partial differential equations. In the nineteenth century the roles
of these two subjects were somewhat reversed. The efforts to solve partial
differential equations by the method of separation of variables led to the
problem of solving ordinary differential equations. Moreover, because the
partial differential equations were expressed in various coordinate systems
the ordinary differential equations that resulted were strange ones and not
solvable in closed form. The mathematicians resorted to solutions in infinite
series which are now known as special functions or higher transcendental
functions as opposed to the elementary transcendental functions such as
sin .r, e*, and log *.
After much work on the extended variety ol ordinary differential
equations, some deep theoretical studies were devoted to tyPes of such
equations. These theoretical investigations also differentiate the nineteenth-
century work from that'of the eighteenth century. The contributions of the
new century were so vast that, as in the case ofpartial differential equations,
we cannot hope to survey all of the major developments. Our topics are a
sample of what was created during the century.
7o9
7to oRDINARY DTFFERENTTAL EQUATIONS rBOO-I9OO
(he wrote.I[ and his,t is our x). Bessel also obtained the series
(2)
and many other relations involving the Bessel function of the first kind.
The generalization of the Bessel function Jr(r) to complex z and * was made
by several men2 with (2) remaining as the correct form.
Since there should be two independent solutions for a second order
equation, many mathematicians sought it. When z is not an integer this
second solution is J-"(x). For integral z the second solution was given by
l, Abh. Konig. Akad. der W*s. Berlin, 1824, 1-52, pub, 1826 Werke, l, 8+109.
=
2. Chiefly by Eugen C. J. Lommel (1837-99) in his Srudirr iber die Bcssel'sclun Furutiorcn
(1868).
SERIES SOLUTTONS AND SPECTAL FUNCTTONS 7rt
Carl G. Neumann.o However, the form adopted most commonly today was
given by Hermann Hankel (1B39-73),{ namely,
Y.(z) :ZcoffP#{"- G1
+ 2v - >L- Zil
$ 11r72111 -"+2t(n - r - I) !
z-,rT
t=O
6 also gave the expansion of an analytic
where 7 is Euler's constant. Neumann
function;[(z), namely,
but he did in unpublished material.lo Of course the equation and the series
solution
F(a'B,y;1): r(y)l(y-a-P)
t(y-a)t(y-p)
He also established the convergence of the series (cL Chap. 40, sec. 5). The
notation F(", F, y; x) is due to Gauss.
Another class of special functions was introduced by Lam6.11 In
Chapter 28, sec. 5, we pointed out that Lam6, working on a steady-state
temperature distribution in an ellipsoid, separated Laplace's equation in
ellipsoidal coordinates p, p., and, v. This process gives the same ordinary
differential equations in each of the three variables, namely,
)2 E( o\ dE(p)
(4) (p'- h')(p'- 721 :-:-Y1 + p(2p, - h2 - k2)'
dp
8.See, for cxample, E. W. Hobson, The Theory oJ Spheri.cal and Ellipsoidal Harmonics,
1931, Chelsea (reprint), 1955.
9.Comm. Soc. Sci. Gott.,2, l8l3 : Wuke,3, 123-62,
10. Werke, 3, 207-30.
ll. Jour. dc Math.,2, 1837, 147-831'4, 1839, 126-63.
SERTES SOLUTIONS AND SPECIAL FUNCTIONS 713
The solutions E(p) zre called Lamd lunctions or ellipsoidal harmonics. For
integral z these functions fall into four classes of the form
E'"(P) :* alPn-2 *"'
aoP"
Fi$) : dp
(2n + I)E\G)
T: W
and such functions are called Lamd functions ofthe second kind. These were
12 and Heine.13
introduced by Liouville
The differential equations
# *, - 2k2 cos2,t)i -- o
ft-A-2k2cosh2t){:o
arose in Mathieu's work on the vibrations of an elliptical membrane la and
they also arise in problems on the potential of elliptical cylinders when
separation ol variables is applied to Az * kzu : 0 expressed in elliptical
cylindrical coordinates in two dimensions. These elliptic coordinates,
incidentally, are related to rectangular coordinates by the equations
r: icosh f cos ?, y:isinhfsin?,
where x : + h, ! : 0 are the foci of the conlocal ellipses and hyperbolas of
the lamily of ellipses and the family of hyperbolas in the elliptic coordinate
system. The variety olforms in which Mathieu's equation is written and the
many notations for the solutions by different authors Present a confusing
picture. The functions defined by either differential equation were called by
Heine functions of the elliptic cylinder and are now called Mathieu functions.
Mathieu and Heine first got series expressions for the solutions. Then they
sought to fix the parameter a so that one class of solutions is periodic and of
period 22. The problem of finding periodic solutions, which are the most
important ones for physical applications, was pursued throughout the
century. In 188315 Gaston Floquet (1847-1920) published a complete dis-
cussion of the existence and properties of the periodic solutions of a linear
differential equation of the zth order having periodic coefficients which have
the same period <u, The general properties of the solutions having been
12. Jour. d,e Math., 10, 1845,222-28.
13. Jour. far Math., 29, 1845, 18F208.
14. Jour. de Math., (2),13, 1868, 137-201.
15. Ann. de l'Ecole Norm. Su!., (2), 12, 1883, 47-48.
714 oRDINARY DIFFERENTTAL EQUATTONS I SOO-r gOO
3. Sturm-Liouaille Theory
The problems involving partial differential equations of mathematical physics
usually contain boundary conditions such as the condition that the vibrating
string must be fixed at the endpoints. When the method of separation of
variables is applied to a partial differential equation this equation is resolved
into two or more ordinary differential equations and the boundary conditions
on the desired solution become boundary conditions on one ordinary
differential equation. The ordinary equation generally contains a Parameter,
which in fact results from the separation ofvariables procedure, and solutions
can usually be obtained for particular values ofthe parameter. These values
are called eigenvalues or characteristic values and the solution for any one
eigenvalue is called an eigenfunction. Moreover, to meet the initial condition
or conditions of the original problem it is necessary to express a given
functionrf(x) in terms of the eigenfunctions (see, for example, I l] of Chap.
28).
These problems of determining the eigenvalues and eigenfunctions ofan
ordinary differential equation with boundary conditions and of expanding a
given function in terms ofan infinite series ofthe eigenfunctions, which date
from about I 750, became more prominent as new coordinate systems r,vere
introduced and new classes of functions such as Bessel functions, Legendre
polynomials, Lam6 functions, and Mathieu functions arose as the eigen-
functions of ordinary differential equations. Two men, Charles Sturm
(1803-55), professor of mechanics at the Sorbonne, and Joseph Liouville
(1809-82), a friend of Sturm and professor of mathematics at the Colldge de
France, decided to tackle the general problem for any second order ordinary
differential equation. Sturm had been working since lB33 on problems of
partial diflerential equations, primarily on the flow of heat in a bar of
variable density, and hence was fully aware of the eigenvalue and eigen-
function problem.
The mathematical ideas he applied to this problem le are closely
related to his investigations of the reality and distribution of the roots of
algebraic equations. His ideas on differential equations, he says, came from
the study of diflerence equations and a passage to the limit.
19. Jour. dc Math., 1,1836, 106-86 and 373-444.
716 oRDINARY DTFFERENTTAL EQUATTONS rEOO-r gOO
(a) The problem has a non-zero solution only when tr takes on any one
ofa sequence of values ln of positive numbers which increase to co.
(b) For each ln the solutions are multiples of one function u,, which
one can normalize by the conditio cr'" dx :
" l! 1 .
f(x) : \c*t*(x)
where
J" cf'a*
: Z'l
obtains. This Iast equality, called the ,;;, equality, had already been
demonstrated purely formally by Marc-Antoine Parseval ( ?-1836) in I 799 21
20. Jour. de Math., l, 1836, 253-65; 2, 1837, 16-35 and 418-36.
21. Min. des saa. ihangers, (2), l, 1805, 639-48.
EXISTENCE THEOREMS 717
for the set of trigonometric functions. From it there follows the inequality
demonstrated by Bessel in 1B2822 also for trigonometric series, namely,
o-.
Z,z = J)vatr
n=1
Actually the Sturm-Liouville results were not satisfactorily established
in all respects. The proofthat;t(.r) can be represented as an infinite sum oflthe
eigenfunctions was inadequate. One difficulty was the matter of the com-
pleteness ofthe set of eigenfunctions, which for a continuous../(x) on (4,6) is
the condition (e) above and which loosely means that the set ofeigenfunctions
is large enough to represent " any" f (x). Also the question of the sense in
which the series ! cnzn(r) converges to;f(x), whether pointwise, uniformly,
or in some more general sense, was not covered though Liouville did give
convergence proofs in some cases, using theory developed by Cauchy and
Dirichlet.
4. Existence Theorems
We have already noted under this same topic in the chapter on nineteenth-
century partial differential equations that as mathematicians found the
problem of obtaining solutions for specific differential equations more and
more difficult they turned to the question, Given a difierential equation does
it have a solution for given initial conditions and boundary conditions ? The
same movement, of course to be expected, occurred in ordinary differential
equations. That the question of existence was neglected so long is partly due
to the fact that differential equations arose in physical and geometrical
problems, and it was intuitively clear that these equations had solutions.
Cauchy was the first to consider the question ofthe existence ofsolutions
of differential equations and succeeded in giving two methods. The first,
applicable to
(6) !' : f(x,!),
was created sometime between 1820 and 1830 and summarized in his
E xercic e s d' analg sc.23
This method, the essence of which may be lound in Euler,za utilizes the
same idea as is involved in the integral as a limit of a sum. Cauchy wished
to show that there is one and only oney :;f(.r) that satisfies (6) and which
meets the given initial condition that yn : f@). He divided (xo, x) into z
parts Axo, A*rr..., A,tn-, and formed
!t+t : !t * f(xsY) Mu
22. Astronom. Nach,, 6, 1828, 33348.
23. Vol. l, 1840,327 ff. : CEuures, (2), 11,399-465.
24. Inst. Cal. Int., 1,1768,493.
7t8 oRDTNARv DTFFERENTTAL EeuATIoNs I8oa-Igoo
t,t:lo+)f@,,v,)Lxu
l-O
Now Cauchy shows that as z becomes infinite yo converges to a unique
function
! -- lo * li"ta,a *
and that this function satisfies (6) and the initial conditions.
Cauchy assumed thatf (x, g) and fu are continuous for all real values of
x and, g in the rectangle determined by the intervals (xo, x) and (ys,y). ln
1876 Rudolph Lipschitz (1832-1903) weakened the hypotheses of the
theorem.sE His essential condition was that for all (r, yr) and (x, 92) in
the rectangle l* - *ol < o, lS - lol 3 b, that is, for any two points with
the same abscissa, there is a constant I( such that
F(*,s):2#**n,
which is the expansion of
(9) r'
F(x,u\ : -,---4=-.
(t - xla)(l - slb)
One shows next that the series solution of
dY
(10)
a: F(x, y)
namely,
-2 -3
(l l) Y:Y'sx+Ylh.+rifr+...,
which is derived from (10) in the same way that (B) is derived from (7),
dominates term for term the series (8). Hence if ( I I ) converges then (B) does.
To show that (ll) converges one solves (10) explicitly using the value ofF
in (9) and shows that the series expansion of the solution, which must be
(l l), converges.
The method does not in itselfdetermine the precise radius ofconvergence
of the series for y. Numerous efforts were therelore devoted to showing that
the radius can be extended. However, the papers do not give the full domain
of convergence and are of little practical importance.
A third method of establishing the existence of solutions of ordinary
differential equations, probably known to Cauchy, was first published by
Liouville 28 for a second order equation. This is the method of successive
28. Jour. de Math., (l), 3, 1838, 561-6t4.
72O ORDTNARY DTFFERENTTAL EqUATTONS rBOO-r9OO
!' : -f(x,!),
wherein;f(x, y) is analytic in * and y and whose solution g : -f(r) is to pass
through (*o,yi, the method is to introduce the sequence of functions
yflx) : uo + at
["of{t,to)
ca@) : vo + dt
!*"lU,t,UD
These power series are unique. They give a regular solution of the system
and take on the initial values. In this generality the result can be found in
Cauchy's " M6moire sur I'emploi du nouveau calcul, appeld calcul des
31 Thus
limites, dans I'int6gration d'un systdme d'dquations diff6rentielles."
the idea was to content oneself with establishing the existence ofand obtaining
the solution in the neighborhood of one point in the complex plane.
Weierstrass obtained the same result in the same year (1842) but did not
publish it until his Werke came out in 1894.32
(14) v@:>)to'(o)(.-o)'
To Cauchy's result Fuchs added that the series is absolutely and uniformly
convergent within any circle having d as a center and in which the pr(z) are
analytic. It follows that the solutions can possess singularities only where the
coefficients are singular.
The study of solutions in the neighborhood of singular points was
initiated by Briot and Bouquet.3a Since their results for first order linear
equations were soon generalized we shall consider the more general treat-
ments-
To get at the behavior of solutions in the neighborhood of singular
points Riemann proposed an unusual approach. Though the pr(z) in (l 3) are
assumed to be single-valued functions analytic except at isolated singular
points, the solutions y,(z), analytic except possibly at the singular points, are
not in general single-valued over the entire domain of z-values. Let us
suppose that we have a fundamental set of solutions, ylz), i:1,2,...,n,
that is, z independent solutions of the kind specified in the above theorem.
Then the general solution is
!'o:c*At*"'*cmln'
That is, the yy .. ., yn undergo a certain linear transformation when each is
carried around a closed path enclosing a singular point. Such a transforma-
tion arises for any closed path around each of the singular points or com-
bination of singular points, The set of transformations forms a group,3s
which is called the monodromy group of the differential equation, a term
introduced by Hermite.ss
Riemann's approach to obtaining the character of solutions in the
neighborhood ofsingular points appeared in his paper of lB57 " Beitrdge zur
Theorie der durch die Gauss'sche Reihe F(c, p,7, r) darstellbaren Func-
tionen." 3? The hypergeometric differential equation, as Gauss knew, has
three singular points, 0, l, and co. Now Riemann showed that for complex.r,
to obtain conclusions about the behavior of the particular solutions around
singular points of the second order equation, one does not need to know the
differential equation itself but rather how two independent solutions behave
as the independent variable traces closed paths around the three singular
points. That is, we must know the transformations
35. By Riemann's time the algebraic notion ofa group wris known. It will be introduced
in this book in Chap. 31. However, all that one needs to know here is that the application
of two successive transformations is a transformation ofthe set and that the inverse ofeach
transformation belongs to the set.
36. Conp. Rend.,32, 1851, 458-61 = CEuares, l, 27G-80.
37. Werke, 67-83.
38. Werke, 379-90.
724 ORDTNARY DIFFERENTTAL EQUATTONS ISOO-I9OO
where s is some number (which can be further specified) and the {, are
single-valued lunctions in the neighborhood ofz : a which may have poles
39. Jour, far Math.,66, 1866, l2l-60; 68, 1868, 35il-85.
THE THEORY OF SINGULARITIES 725
of finite order. His answer was that a necessary and sufficient condition is
that p,(z) : (z - a)-'P(z), where P(z) is analytic at and in the neighbor-
hood of z : a. Thus Pr(z) has a pole of order one and so on. Such a point
a is called a regular singular point (Fuchs called it a point of determinateness).
Fuchs also studied a more specialized class ofequations of the form (13).
A homogeneous linear equation of this type is said to be of Fuchsian tgpe
when it has at worst regular singular points in the extended complex plane
(including the point at o). In this case the pr(z) must be rational functions
of z. For example, the hypergeometric equation has regular singular points
at z : 0, l, and co.
But the study of integrals of differential equations in the neighborhood
ol given point does not necessarily furnish the integrals themselves. The
a
study was taken as the point of departure lor the investigation of the full
integrals. Since the great researches olFuchs, mathematicians have succeeded
in extending the variety oflinear ordinary differential equations that can be
integrated explicitly. Previously only the nth-order linear equations with
constant coefficients and Legendre's equation
:
"* *A + zn-rPlz)# . "'+ zPn-,(z)4* + P,(z)w o,
in which tl,e Pt(z) are analytic at and around z : O. ln this case one can
obtain the z fundamental s6lutions in the form of series about z : 0 and
show that the series converge for some range of z-values. The series are of
the lorm
- : ] ,,.0 *u
a'
and the p and the c, are determinable for each solution. The result is due to
Georg Frobenius (l849-191 7).ao
40. Jour.far Math.,76, 187+,214-35 = Ges. Abh., l, 8'[-105.
726 oRDTNARY DTFFERENTTAL EeuATroNs rEoo-rgoo
The Riemann problem was also taken up during the latter part of the
nineteenth century, but unsuccessfully until Hilbert in I90541 and Oliver D.
Kellogg (1878-1932),42 with the help of the theory ol integral equations,
which was developed in the meantime, gave the first complete solution. They
showed that the generating transformation of the monodromy group can be
prescribed arbitrarily.
6. Automorphic Functions
The theory of solutions of linear differential equations was tackled next by
Poincar6 and Felix Klein. The subject they introduced is called automorphic
functions, which, though important for various other applications, play a
major role in the theory of differential equations.
Henri Poincard (1854-1912) was a professor at the Sorbonne. His
publications, almost as numerous as Euler's and Cauchy's, cover a wide
range of mathematics and mathematical physics. His physical researches,
which we shall not have occasion to discuss, included capillary attraction,
elasticity, potential theory, hydrodynamics, the propagation of heat,
electriciry, optics, electromagnetic theory, relativity, and above all, celestial
mechanics. Poincard had penetrating insight, and in every problem he
studied he brought out its essential character. He focused sharply on a
problem and examined it minutely. He also believed in a qualitative study
of all aspects of a problem.
Automorphic functions are generalizations of the circular, hyperbolic,
elliptic, and other functions of elementary analysis. The function sin z is
unchanged in value if z is replaced by z + 2m, where m is any integer. One
can also say that the function is unaltered in value if z is subjected to any
transformation of the group z' : z * 2mtr. The hyperbolic lunction sinh z
is unchanged in value if z be subjected to any transformation of the group
z' : z * 2tmi. An elliptic function remains invariant in value under
transformations of the group z' : z + m@ + m'a' where <rr and <a' are the
periods of the function. All ofthese groups are discontinuous (a term intro-
duced by Poincar6); that is, all the transforms of any point under the
transformations of the group are finite in number in any closed bounded
domain.
The term automorphic function is now used to cover functions that are
invariant under the group of transformations
(16)
, az+b
cz+d
41, Proc. Third Internat. Math. Cong., 1905, 233-40; and Nachri.chhn Kdnig. Gu. der Wiss. zu
GAU., 1905,307-88. Atso in D. Hilbert, Grundzilge einer allgemeinen Theorie der liwaren
Integralglci.chungen, t 9 I 2, Chelsea (reprint), I 953, 8 l-l 08.
42. Math. Ann.,60, 1905, 42.+-33.
AtrroMoRPHrc FUNCTIONS 727
(17)
#, * 0,#, * o,r :0,
where p, and p, were at first rational functions of z. A special case is the
hypergeometric equation
d2q.-y-(a*F*l)drl oF _:n
(18)
dz2' z(l-z) -
dz' z(z-l)''
with the three singular points 0, l, and oo.
Riemann, in his lectures of l85B-59 on the hypergeometric series and
in a posthumous paper of 1867 on minimal surfaces, and Schwarz'8
independently established the following. Let 1, and 12 be any two particular
solutions ofthe equation (17). All solutions can then be expressed as
q: mt11 * nt1s.
When z traverses a closed path around a singular point, r11 and r2, go over
into
f1 : aq * b12, \Z: ab * dns
and by letting z traverse closed paths around all the singular points one
obtains a whole group of such linear transformations, which is the monodromy
group of the differential equation.
Now let ((z) : nrllr. The quotient ( as z traverses a closed path is
transformed to
al+b
(le) ,- _ rt+d
From (17) we find that ( satisfies the differential equation
(20)
tr -;ff) :2Pz-|n-n.
43. Jour. fir Math., 75, 1873, 292-335 = Ces. Abh.,2, 2ll-59.
728 oRDTNARY DTFFERENTIAL EQUATIONS IBOO-I9OO
Figure 29. I
If we take for the p1 and pzir, ( 17) the particular lunctions in (18) we obtain
44. Acta Math' l, 1882' l-62 and 191-294 : CEw)res,2, 108-68, 169-257.
AUTOMORPHIC FUNCTIONS 729
az + b
(22) cz a'
where a, b, c, and, d are rea'! and ad - 6c : 1. These transformations, which
leave the circle and its interior invariant, form a group called the Fuchsian
group. Schwarz's function {(() constitutes the simplest example of a Fuchsian
iunction. Thus Poincard demonstrated the existence ol a class of auto-
morphic functions more general than the elliptic modular functions'as
Poincar6's construction of automorphic functions (in the second paper
of 1BB2) was based on his theta series. Let the transformations ofthe group
(22) be
, arz+ b1
(23) - crztdi
Let 27, 22,. . . be the transforms of z under the various transformations of
the group. Let H(z) be a rational function (aside from other minor con-
ditions). Then Poincar6's theta series is the function
One can show that O(rt) : Qp + d,)2^0(z). Now let 0r(z) and 0r(z) be
two theta series with the same m. These series are not only uniform functions
but entire. Then
(25) F(z) :
is an automorphic function ofthe group (23). Poincard called the series (24)
a theta-fuchsian series or a theta-kleinian series according as the grouP to
which it belongs is Fuchsian or Kleinian (the latter will be described in a
moment).
45. In this work on Fuchsian groups Poincar6 used non-Euclidean Seometry (Chap' 36)
and showed that the study of Fuchsian groups reduces to that of the translation group of
Lobatchevskian geometry.
73O ORDTNARY DTFFERENTTAL EqUATTONS rBOO-rgOO
Fuchsian functions are of two kinds, one existing in the entire plane, the
other existing only in the interior of the fundamental circle. The inverse
function ofa Fuchsian function is, as we saw above, the ratio of two integrals
of a second order linear differential equation with algebraic coefficients.
Such an equation, which Poincar6 called a Fuchsian equation, can be
integrated by means of Fuchsian functions.
Then Poincar6a6 extended the group of transformations (22) to com-
plex coefficients and considered several types of such groups, which he
named Kleinian. We must be content here to note that a group is Kleinian
if, essentially, it is not finite and not Fuchsian but, of course, is of the form
(22) and, discontinuous in any part of the complex plane. For these Kleinian
groups Poincard obtained new automorphic functions, that is, functions
invariant under the Kleinian groups, which he called Kleinian functions.
These functions have properties analogous to the Fuchsian ones; however,
the fundamental region for the new functions is more complicated than a
circle. Incidentally, Klein had considered Fuchsian functions while Lazarus
Fuchs had not. Klein therefore protested to Poincar6. Poincar€ responded by
naming the next class of automorphic functions that he discovered Kleinian
because, as someone wryly observed, they had never been considered by
Klein.
Then Poincar6 showed how to express the integrals of zth-order linear
equations with algebraic coefficients having only regular singular points with
the aid of the Kleinian functions. Thus this entire class of linear differential
equations is solved by the use of these new transcendental functions of
Poincar6.
three-body problem were found until George William Hill (lB3B-1914)' the
first great American mathematician, did his work on lunar theory. In 1877
Hill published privately a remarkably original paper on the motion of the
moon's perigee.a? He also published a very important paper on the motion
of the moon in tine American Journal oJ Matfumatics.a8 His work founded the
mathematical theory ol homogeneous linear differential equations with
periodic coemcients.
Hill's first fundamental idea (in his lB77 paper) was to determine a
periodic solution of the differential equations for the motion of the moon
that approximated the actual observed motion' He then wrote equations for
variations from this periodic solution that led him to a fourth order system
ol linear differential equations with periodic coefficients. Knowing some
integrals, he was able to reduce his fourth order system to a single linear
differential equation of the second order
(26)
ff * ,p1*: o,
with d(l) periodic of period z'and even. The form of Hill's equation can be
put, by expanding d(t) in a Fourier series, as
(27)
# * rro" * 2q1 cos2t + 2qzcos 4, + " ') : 0.
(28)
ff * -)uE" :0.
He then let
,= -,,
where ;r, and D; were to be determined. By substituting this value of x in
(28) and setting the coefficients of each power of ( equal to 0 he obtained
the doubly infinite system of linear equations
47. This was reprinted in Acta Malh.,8, 1886, l-36 : Coll. Malh' Wo*s, 1,243-70'
48. Amer. Jour. i1f Math.,l, 1878, 5-26, 129-47,245-60 : Coll' Math' Work, l,2U-335'
732 oRDTNARY DIFFERENTTAL EqUATIONS rBOO-I9OO
where
Ljl:0,+2j),-qo.
Hill set the determinant of the coefficients of the unknown D, equal to 0.
He first determined the properties of the infinitely many solutions for pr and
gave explicit formulas for determining the /r. With these values olg. he then
solved the system of an infinite number of linear homogeneous equations in
the infinite number of b, for the ratio of the D, to 6o. Hill did show that the
second order differential equation has a periodic solution and that the
motion of the moon's perigee is periodic.
Hill's work was ridiculed until Poincar6ae proved the convergence of
the procedure and thereby put the theory of infinite determinants and
infinite systems of linear equations on its feet. Poincar6's attention to and
completion of Hill's efforts gave prominence to Hill and the subjects involved.
:
49. Bult. Soc. Math. de France, 13, 1885, l9-2?; 14, 1886, 77-90 CEuutes,5, 85-94,
95-t 07.
50. Jour. de Math., (3), 7, l8Sl, 375422;8, 1882, 251-96; (4), l, 1885, 167-2*4;2, 1886,
l5l-217 : CEuares, 1,3-84, 90-161, 167-221.
NONLINEAR: THE QUALITATTVE THEORY 733
29.2
Figure 29.2 Figure 29.3
(2e) *:#3,
where P and Q are analytic in x andy. This form was chosen partly because
some problems of planetary motion led to it and partly because it was the
simplest mathematical type with which to commence the kind of investiga-
tion Poincar6 had in mind. The solution of (29) is of the form;f(r, !) : O,
and this equation is said to define a system of trajectories. In place of
.f @,il : 0 one can consider the parametric form *
: x(t), y : y(t).
In the analysis of the kinds ofsolutions equation (29) can have, Poincar6
found that the singular points of the differential equation (the Points at
which P and Q both vanish) play a key role. These singular points are
undetermined or irregular in Fuchs's sense. Here Poincar6 took up earlier
work by Briot and Bouquet (sec. 5) but limited himself to real values and to
studying the behavior of the entire solution rather than just in the neigh-
borhood ofthe singular points. He distinguished four types ofsingular points
and described the behavior of solutions around such points.
The first type of singular point is the focus (foyer), the origin in Figure
29.2, and the solution spirals around and approaches the origin as runs
from -o to o. This type of solution is considered stable. The second kind
'
of singular point is the saddle point (col). It is the origin of Figure 29'3 and
the trajectories approach this point and then depart from it. The lines ll'
and BB' are the asymPtotes of the trajectories. The motion is unstable'
The third type of singular point, called a node (naud), is a point where an
infinity ofsolutions cross, and the fourth, called a center, is one around which
closed trajectories exist, one enclosing the other and all enclosing the center.
734 oRDTNARY DIFFERENTTAL EQUATTONS rBOO-r9OO
Among many results, Poincar€ found that there can be closed curves
which do not towh any of the curves satisfying the differential equation. He
called these closed curves cycles without contact. A curve satisfying the
diflerential equation cannot meet such a cycle in more than one point, and
so ifit crosses the cycle it cannot re-cross it. Such a curve, if it is an orbit ofa
plianet, represents unstable motion.
Beyond cycles without contact there are closed curves that Poincar6
cdled limit cycles. These are closed curves which satisfy the differential
equation and which other solutions approach asymptotically, that is,
without ever reaching the limit cycle. The approach can be from without
or within the limit cycle C (Fig. 29.4). For some differential equations of the
type (29) he determined the limit cycles and the regions in which they exist.
In the case of limit cycles the trajectories approach a periodic curve, and the
motion is again stable. If, however, the direction of the motions were away
from the limit cycle the motion outside would be unstable and the motion
inside would be a contracting spiral.
In the third of his papers on this subject, Poincar6 studied first order
equations of higher degree and of the form F(r,y,S') : 0, where F is a
polynomial in x, g, and, y'. To study these equations Poincar6 regarded x, y,
and g' as three Cartesian coordinates and considered the surf,ace defined by
the diferential equation. If this surface has genus 0 (form ofa sphere), then
the integral curves have the same properties as in the case of first degree
differential equations. For other genuses the results on integral curves can be
quite different. Thus for a torus many new circumstances arise. Poincard did
not complete this study. In the fourth paper (1886) he studied second order
equations and obtained some results analogous to those for first order
equations.
While continuing his work on the types of solutions of the differential
equation (29), Poincard considered a more general theory directed to the
NONLTNEAR: THE qUALITATM THEORY 735
(30)
*: *'r*""''xn'P)' i:1,2,...,n.
He developed the X, in powers of the small parameter p and, supposing that
the system had for p : 0 a known periodic solution
4: 6lt), i:1,2,...,n
of period 7, he proposed to find the periodic solution of the system vihich
for p : 0 reduces to the {,(l). The existence of periodic solutions for the
three-body problem had been discovered by Hill, and Poincar€ made use of
this fact.
The details of Poincard's work are too specialized to consider here' He
first generalized earlier work of Cauchy on solutions of systems of ordinary
differential equations wherein the latter had used his calcul dts limitts.
Poincard then demonstrated the existence of the periodic solutions he sought
and applied what he learned to the study of periodic solutions ofthe three-
body problem for the case where the masses of two of the bodies (but not
that of the sun) are small. Thus one obtains such solutions by supposing
that the two small masses move in concentric circles about the sun and are
in the same plane. One can obtain others by supposing that for /, : 0 the
orbits are ellipses and that their periods are commensurable. With these
solutions, and by using the theory he had developed for the system, he
obtained other periodic solutions. In sum he showed that there is an infinity
of initial positions and initial velocities such that the mutual distaaces of the
three bodies are periodic functions ofthe time. (Such solutions are also called
periodic.)
Poincard in this paper of 1890 drew many other conclusions about
periodic and almost periodic solutions of the system (30). Among them is the
very notable discovery for such a system of a new class ofsolutions previously
unknown. These he called asymptotic solutions. There are two kinds. In the
first the solution approaches the periodic solution asymptotically as I
approaches - @ or as I approaches + o. The second kind consists ofdoubly
asymptotic solutions, that is, solutions that approach a periodic solution as I
approaches -o and *o. There is an infinity of such doubly asymptotic
solutions. All of the results in this paper of 1890 and many others can be
found also in Poincari's Les Mithodes nouuelles de la muanique cileste.62
Poincard's work on the problem of stability ofthe solar system was only
partially successful. The stability is still an open question. As a matter of
51. Acta Math., 13, 1890, l-270 : (Euares, 7,262479.
52. Three volumes, 1892-99.
n6 oRDTNARY DIFFERENTTAL EQUATIONS r BOO-I9OO
Figure 29.5 1: +l
fact so is the question of whether the orbit of the moon is stable; most
scientists today believe it is not.
The stability of the solutions of (29) can be analyzed by means of what
is called the characteristic equation, namely,
(31)
- A P*(xo,!i I :
lQ*@o,ao) o.
I Q,@o,sd Pu@o,vs) - tl
where (xo, yo) is a singular point of (29). The stability in the neighborhood
of (*o,yo), according to a theorem of the distinguished Russian mathe-
matician Alexander Liapounofl (1857-1918) depends Lrpon the roots of this
characteristic equation.se The analysis of possible cases is detailed and
includes many more types than those given in the discussion above of
Poincari's work. The basic result, according to Liapounoff, whose work on
stability problems continued into the early years of this century, is that the
solutions are stable in the neighborhood of a singular point when and only
when the roots of the equation (31) in f have negative real parts.
The qualitative study ofnonlinear equations was advanced by Poincar6's
introduction of topological arguments (in the first of the four papers in the
Journal de Mathimatiqrcs). To describe the nature of a singular point he
introduced the notion of an index. Consider a singular point Po and a simple
closed curve C surrounding it. At each intersection ofC with the solutions of
(32)
dg:
dx
53. Aan. Fac. Sci. de Toulouse, (2), 9, 1907, 203-474; originally published in Russian in
1892.
NONLINEAR: THE QUALITATIVE THEORY ?37
returned to its original value). The quantity 1is the index of the curve. It
can be proved that the index ofa closed curve that contains several singular-
ities is the algebraic sum of their indices. The index of a closed trajectory is
* I and conversely.
The nature of the trajectories can be determined by the characteristic
equation, and so the index .I of a curve should be determinable by knowing
just the differential equation. One can prove that
BibliograplA
Acta Matlumatica, Vol. 38, 1921. This entire volume is devoted to articles on
Poincard's work by various leading mathematicians.
Bochcr, M. : " Randwertaufgaben bei gewiihnlichen Differentialgleichungen,"
Encgk. dd Math. Wiss., B. G. Teubner, 1899-1916, 1I, A7a' 43743.
.............._: *norrnd,ry Problems in One Dimension," Intemat. Cong. of Math., Proc.,
Cambridge, 1912, l, 163-95.
BurLhardt, H.: " Entwicklungen nach oscillirenden Functionen und Integration
der Differentialgleichungen der mathematischen Physik," Jahres, d.er Deut.
Math.-Vcrcin., 10, 1908, l-1804.
Cauchn A.L.; (Euarcs compbtzs, (l), Vols.4,7, and 10, Gauthier-Villars, 1884' 1892,
and 1897.
Craig,T.: "Some of the Developments in the Theory of Ordinary Differential
Equations Between 1878 and 1893," lY. f. Math. Soc. Bull-,2, 1893, ll9-34.
Fuchs, Lazarus : Gesammelte matlumali.sclu Werk, 3 vols., 190't-09, Georg Olms
(reprint), 1970.
Heine, Eduard : Handbtuh dcr Kugelfunktinrun, 2 vols., I878-B l, Physica Verlag
(reprint), 1961.
Hilb, E. : " Lineare Differentialgleichungen im komplexen Gebiet," Emyk, der
Math. Wiss., B. G. Teubner, 1899-1916, II' 85.
" Nichtlineare Differentialgleichungen," Enryk' dtr Math. Wi.ss., B. G.
Teubner, 1899-1916, II, 86.
Hill, George rN.: Colhcted Mathcmatical Works,4 vols., 1905, Johnson Reprint Corp.,
1965.
Klein, Felix: Vmlcsungen nbcr db Entwicklung dtr Mathematik im 19. Jahrhundert,
Vol. I, Chelsea (reprint), 1950.
Gcsammeltc matlumatisclu Abhandlungen, Jtlius Springer, 1923, Vol. 3.
Painlev6, P.: " Le Probldme moderne de l'intdgration des 6quations diff6rentielles,"
-z Thhd Intcrnat. Math. Cong. in Heidelberg, 1904, 86-99, B. G. Teubner, 1905.
" Gew6hnliche Differentialgleichungen, Existenz der Ltisungen," Ency,t.
det Math. Wiss., B. G. Teubner, 1899-1916, ll, A4a.
Poincard, Henri: (Euares, l,
-i 2, and 5, Gauthier-Villars, 1928, 1916, and 1960.
Riemann, Bernhard: Gesammelte mathematische Werlce,2nd ed., 1892, Dover (reprint),
1953.
Schlesinger, L. : " Bericht iiber die Entwickelung der Theorie der linearen Differen-
tialgleichungen seit 1865," Jahres. der Deut, Math.-Verein., 18, 1909, 133-266.
Wangerin, A.: " Theorie der Kugelfunktionen und der verwandten Funktionen,"
Encgk. der Malh. Wiss., B. G. Teubner, 1899 1916, II, Al0.
Wirtinger, W.: " Riemanns Vorlesungen [ber die hypergeometrische Reihe und
ihre Bedeutung ,- Thi,rd Internat. Math. Cong. in Hei.delberg, 1904, B. G.
Teubner, 1905, 12l-39.
3o
The Calculus of Variations in the
Nineteenth Century
l. Introduction
As we have seen, the calculus of variations was founded in the eighteenth
century chiefly by Euler and Lagrange. Beyond mathematical and physical
problems of various sorts there was one leading motivation for the study,
namely, the Principle of Least Action, which in the hands of Maupertuis,
Euler, and Lagrange became a leading principle of mathematical physics.
The nineteenth-century men contlnued the work on least action and the
greatest stimulus to the calculus of variations in the first half of the century
came from this direction. Physically, the interest was in the science of
mechanics and particularly in problems of astronomy.
739
740 THE cALcuLUs oF vARrATroNs r8oo-rgoo
keep the subject active, but no major new mathematical ideas of the calculus
of variations are to be noted in this area or in Gauss's famous contribution
to mechanics, The Principle of Least Constraint.2
The first new point worth noting is due to Poisson. Using Lagrange's
generalized coordinates, he followed up immediately two papers by
Lagrange and starts with3 Lagrange's equations (Chap. 24, sec. 5)
(2) '*,:#;
so that the equations of motion read
_d t-
(3)
il
taL
\a41 -ff,:o' i:t'2'"''n'
He also introduces
(4)
AL AT
: dA,:
?, uq,'
and so from (3) he has
universe, are now generally rejected. And the rejection appears just, for this,
among other reasons, that the quantity Pretended to be economized is in fact
often lavishly expended." Because in some phenomena of nature, even simPle
ones, the action is maximized, Hamilton preferred to speak of a principle of
stationary action.
In a series of papers of the period lB24 to lB32 Hamilton developed his
mathematical theory ofoptics and then carried over ideas he had introduced
there to mechanics. He wrote two basic papers.s It is the second of these which
is more pertinent. Here he introduces the action integral, namely, the time
integral of the difference between kinetic and potential energies
(6) t:
!::,',"'
(r v) dt. -
The quantity T - V is called the Lagrangian function though it was
introduced by Poisson; Pl standsfor q!, ql, . . ., qLn, and P 2for 91", 1LD, . . ., 4?'.
Now Hamilton generalizes the principle of Euler and Lagrange by allowing
comparison paths that are not restricted except that the motion along them
must start at PL at time 11 and end at P2 at time tr. Also, the law of con-
servation of energy need not hold whereas in the Euler-Lagrange principle
conservation of energy is presupposed, and as a consequence the time
required by an object to traverse any one of the comparison paths differs
from the time taken to traverse the actual path.
The Hamiltonian principle of least action asserts that the actual motion
is the one that makes the action stationary. For conservative systems' that is,
where the components of force are derivable from a potential that is a
function of position only, 7 + tr: const. Hence 7 - V :27 - const.
and so Hamilton's principle reduces to Lagrange's, but, as noted, Hamilton's
principle also holds for nonconservative systems. Also the potential e\ergy V
can be a function of time and even of the velocities; that is, in generalized
coordinates V : V(qt, . . ., Qo, 4t. . ., 8", t).
If, setting T - V equial to L,we write the action integral (6) as
(7) , :
L(q,,- . ., 8o,8t, - . ., En, t) itt,
/,:'
with the condition that all comparison 4,(t) must have the same given
values at t, and tr, then the problem is to determine the 41 as functions of,
from the condition that the true 4i make the integral stationary. The Euler
equations, which express the condition that the first variation of .S is 0,
become a system of simultaneous second order ordinary differential
equations, namely,
(B)
aL d tal
_l- :0, k:1,2,...,n,
-0q* il\aEk
-
5. Phil. Trans.,1834, Part II,247-308; 1835, Part 1,95-l+4 : Math. Palcrs,2' 103-2ll'
742 TrrE oALCULUS OF VARTATTONS rSOO-I9OO
and the equations are to be solved in tt < t < fr. These equations are still
called the Lagrangian equations of motion even though I is now a different
function. The choice of coordinate system is arbitrary and usually utilizes
generalized coordinates, This is an essential advantage of variational
principles.
Now introduce (see (a))
?, : ui,'
AL
(s) aL i,:?,
: aq,, i:1,2,...,n.
?, aqt
(l l) .aH
: -dl,' .aH
bt: --;-, 1:1,2,...rn,
9' o4t
equations. They are the variational equations (Euler equations) for the
integral
'
:[:; e v) * : 1," : [{2m, - H(tuo,.o}a,.
Qt : Q,Q" c" t)
(12)
Pt : P/?u q" t)
such that
t
f' ()w, - H(p, t,, o) at : o
goes over by the transformation (12) into
rf E P,e, - K(n,0,,o)a,: o,
P,: - N4;
(13) a,: ffi,
where.I((P1, Qr, t) is the new Hamiltonian. This path leads to
0r:0, P, :0,
or Q1 and Pi are constants. Moreover,
(14) a+ff:0,
and it can be shown that
aa
?,: Ai,'
Hence by (14), in view of the variables in fI,
o--oaoo,
't-
and that h : Ft, a constant' because P, : 0' Hence one solves the algebraic
equations
'aa:
AQ
-P" i: t'2'"''n
for the 4,. These solutions
Figure 30.1 A
emanating from I with constant velocity tl but with varying angles of fire are
extremals of the problem of minimizing or maximizing the action integral
nfB
i Jno
*'
The problem of minimizing the action between two points ,4 and B does in
general have two solutions, the parabola AA'B and the parabola ABA'.It
is also the case that the family of parabolas through A has an envelope that
touches the two parabolas at A' and, A'.The conjugate Point on AA'B is A"
and the one on ABA' is ,4'. According to Jacobi's condition, the extremal
AA'B could not furnish a maximum or a minimum but the extremal ABA'
could.
Jacobi reconsidered the second variation 6'./ (Chap. 24, sec. 4). If we
writc y * el (*) in place of Lagrange's y * 6y and ifa and D are the abscissas
of I and .B then
(18)
{t"-*r"Y-fiu",,,"):o
and where the partial derivatives are evaluated along an extremal joining the
two endpoints A and B. Now z(r) is required to pass through l. Then all
other points on the extremal y(r) through A and B at which rz(x) vanishes
are the conjugate points of z4 on that extremal. lf u : F#r I B2u, is the
general solution of the accessory equation (lB), then one can show that
u(x) u(a)
;M:;M'
MATIIEMATICAL EXTENSIONS 747
where d is the abscissa of the point u4, is the equation for the abscissas of
all points conjugate to l.
Jacobi also showed that one need not solve the accessory equation.
Since one must solve the Euler equation in any case, lety : y(x, c1, ,2) be the
general solution of that equation, that is, the flamily of extremals. Then zr can
be taken tobe 1ylAcy and z2 to be 0gl0c2.
From his work on conjugate points Jacobi drew two conclusions' The
first was that ilalong the extremal from I to B a conjugate point to 14 occurs
then a maximum or a minimum is impossible. In this conclusion Jacobi was
essentially correct.
On the basis of his considerations of conjugate points Jacobi also con-
cluded that an extremal (a solution of Euler's equation) taken between I and
.B for which;f,,r, > 0 along the curve and for which no conjugate point
exists between A and. B (or at B) furnishes a minimum for the original
integral. The corresponding statement with fr'r, < 0, he asserted' holds for
a maximum. Actually, these sufficient conditions were not cotrect, as we
shall see in a few moments. In this lB37 paper Jacobi stated results and gave
brief indications of proofs. The full proofs of the correct statements were
supplied by later workers.
Aside from the specific value ofJacobi's results for the existence of a
maximizing or minimizing function, his work made clear that Progress in the
calculus of variations could not be guided by the theory of maxima and
minima of the ordinary calculus.
For thirty-five years both ofJacobi's conclusions were accepted as cor-
rect. During this period the paPers on the subject were imprecise in statement
and dubious in proof; problems were not sharply ficrmulated and all sors of
errors were made. Then Weierstrass undertook work on the calculus of
variations. He presented his material in his lectures at Berlin in 1872 but did
not publish it himself. His ideas aroused a new interest, stimulated further
activiry in the subject, and sharpened the thinking, as did WeiersEass's
work in other domains.
Weierstrass's first point was that the criteria for a minimum or a
maximum hitherto established-Euler's, Legendre's, and Jacobi's-were
limited because the supposed minimizing or maximizing curve y(x) was
compared with other curves y(r) t et(x), wherein it was actually supposed
that al(r) and et'(z), or what Lagrange called 8y and 6y', were both small
along the ,-range from ,4 to -8. That is, y(*) was being compared with a
limited class ofother curves, and by satisfying the three criteria it did better
than any other one of these comparison curyes. Such variations et(x) were
called by Adolf Kneser (1862-1930) weak variations. However, to find the
curve that really maximizes or minimizes the integral J one must compare it
widn all other curves joining ,4 and B, including those whose derivatives may
not approach the derivatives of the maximizing (or minimizing) curve as
748 THE CALCULUS OF VARTATIONS rBOO-I9OO
and his result was: The fourth necessary condition that y(x) furnish a
minimum is that E (x, y, y', i) > O along the extremal y (x) for every finite
value off. For a maximum.E < 0.
weierstrass then (1879) turned his attention to sufficient conditions for
a maximum (or a minimum) when strong variations are permitted' To
formulate his sufficient conditions it is necessary to introduce Weierstrass's
concept of a field. Consider any one-parameter flamily (Fig' 30'3) of ex-
tremais y : @(x, y) in which the particular extremal joining ,4 and B is
includeJ, say for 7:7o. Aside from some details on the continuity and
differentiability of O(r, y), the essential fact about this family of extremals
is that in u ..gion about the extremal through .'4 and B there Passes through
any point 1*,g7 of the region one and only one extremal of the family' A
famiiy of extremals satislying this essential condition is called a field'
Given a field surrounding the extremal C6 joining A and' B (Fig' 30'a)
then if at every point (x,y) lying between x : a arrd x : b and in the region
covered by the field, E(x,y,p(x,y),fi) > 0, where p(r,y) denotes the slope
at (x, g) oithe extremal passing through (x, y) and rt is any finite value, then
C6 minimizes the integral J with respect to any other C lying within the field
and joining A ard B. (For a maximum, .E < 0.)
RELATED PROBLEMS 749
Figure 30.4 A
t : !*'lt*,u,a) d*.
Some mention has been made of other problems, the isoperimetric problems,
the problems of several functions ofone variable such as arise in the Principle
of Least Action, and the case of multiple integrals, which Lagrange first
treated and which occurs in the minimal surface problem (Chap. 24,
sec. 4). There are numerous types ofrelated problems, such as those in which
the minimizing or maximizing curve is treated in a parameteric representa-
tion * : x(i) and y : y(r)-this problem v/as thoroughly discussed by
Weierstrass-and problems largely in dynamics in which the variables that
appear in the integrand are restricted by auxiliary or subsidiary equations'
Bibliographg
Dresden, Arnold: "Some Recent Work in the Calculus of Variations," Amet Math.
Soc. Bull., 32, 1926,475-521.
Duren, W. L., Jr.: "The Development of Sufficient Conditions in the Calculus of
Variations," Unioctsity oJ Chicago Conlibutions to tlu Calculus of Varialions, l,
1930,245-349, University of Chicago Press, 1931.
Hamilton, W. R.: Tire Matlwmatical Paperc,3 vols., Cambridge University Press,
l93l, 19,10, and 1967.
Jacobi, C. G. J.: G. Reimer, 1886 and 1891, Chelsea (reprint),
Gesammclte We*.c,
1968, Vols. 4 and 7.
Vorlesurgen ilber Dynamik (1866), Chelsea (reprint), 1968. AIso in Vol. B of
Jacobi's Gesamrueltc Werke,
McShane, E.J.: " Recent Developments in the Calculus of Variatioru,," Amer. Math.
Soc, Semicenlennial Publirations, II, 1938, 69-97.
BIBLIOGRAPHY 75I
Porter, Thomas Isaac: "A History of the Classical Isoperimetric Problem,"
tlni.aersily oJ Chicago Contributions to tlu Calculus of Variatinns, ll, 475-517,
University of Chicago Press, 1933.
Prange, Georg: "Die allgemeinen Integrationsmethoden der analytischen
Mechanik," Encgk. derMath. Wiss.,B. G. Teubner, 1904-35, IV, 2, 509J04'
Todhunter, lsaac: A History of thz Calculus oJ Variations in the Ni'ruteenth Ccntury,
Chelsea (reprint), 1962.
Weierstrass, Karl: Werlce, Akademische Verlagsgesellsch aft, 1927, Yot' 7'
3r
Galois Theory
l. Introduction
The basic problem of algebra, the solution of polynomial equations, con-
tinued to occupy the center ofthe stage in the algebra ofthe early nineteenth
century. In this period, the broad question of which equations are solvable
by algebraic operations was definitely and comprehensively answered by
Galois. Moreover, he not only created the first significant coherent body of
algebraic theory but he introduced new notions which were to be developed
into still other broadly applicable theories of algebra. In particular the
concepts of a group and a field emerged from his work and Abel's.
2. Binornial Equations
We have already discussed (Chap. 25, sec. 2) the fruitless efforts of Euler,
Vandermonde, Lagrange, and Ruffini to solve algebraically equations of
degree greater than 4 and the binomial equation x" - I :0. A major
success w,rs achieved by Gauss. In the last section of his Disquisitiones
Aithmeticatr Gauss considered the equation
(l) xe - |: o,
where p is a prime.2 This equation is often called the cyclotomic equation or
the equation for the division of a circle. The latter term refers to the fact
that the roots of this equation are, by de Moivre's theorem,
(2) #r:cosY*i"ink2!0,
'pp k:1,2,...,!,
l. 1801, We*e, l.
2. The case takes care of *" - I : 0 for if z : pq, let ! : xq. But/, - I : 0
ofp prime
f,q : const. can be solved ifq is a prime and ifnot g can be decomposed
is solvable. Hence
in the same manner that z is.
752
BINOMIAL EQUATIONS 753
and the complex numbers ,, when plotted geometrically are the vertices of
a p-sided regular polygon that lie on the unit circle.
Gauss showed that the roots of this equation may be rationally exPressed
in terms of the roots of a sequence of equations
(3) Zr:0,22:o,...,
whose coefficients are rational in the roots of the preceding equations of the
sequence. The degrees ol the equations (3) are precisely the prime factors
p - t. There is a Zrlor each factor even if repeated. Moreover, each of the
"t
Z, 0 canbe solved by radicals and so equation (1) can also be so solved'
--
This result is of course of great significance for the problem of solving
the general nth-degree equation algebraically. It shows that some equations
ofhigh degree can be solved by radicals, for example a fifth degree equation
if5 is a factor ofp - | or a seventh degree equation if 7 is a factor'
The result is of major importance also fior the geometric problem of
constructing regular polygons ofp sides. lf p - | contains no factors other
than 2, then the polygon is constructible with straightedge and compass
because the degrees of the equations (3) are each 2 and each of its roots is
constructible in terms of its coefficients. Thus we are able to construct all
polygons of a prime number of sides I if p - | is a power of 2. Such primes
are 3, 5, 17, 257,65537, . . .. Alternatively, a regular polygon can be con-
structed if p is a prime ol the form 22' + I .3 Gauss remarks (Art' 365) that
although the geometric construction of regular polygons of 3, 5, and 15
sides and those immediately derivable from them-for example, 2, 2'3,
2n.5,2.15, wherein z is a positive integer-was known in Euclid's time, in
an interval of 2000 years no new constructible polygons had been discovered
and geometers had been unanimous in declaring that no others could be
constructed.
Gauss thought that his result might lead to all sorts of attempts to find
new constructibL polygons of a prime number of sides. He warns then: "As
often as p - I contains other prime {lactors besides 2, we arrive at higher
equations, namely, to one or more cubic equations if 3 enters once or oftener
as a factor ofp - l, to equations of the filth degree ifp - I is divisible by 5,
etc. And *. pro,r. with all rigor that these higher equations cannot be
"u,
avoided or madelo depend upon equations of lower degree; and although
the limits of this work do not permit us to give the demonstration here, we
still thought it necessary to note this fact in order that one should not seek to
construciother polygons [of prime number of sides] than those given by our
theory, as for example, polygons of 7 , ll, 13, l9 sides, and so employ his time
in vain."
Gauss then considers (Aat. 366) polygons of any number of sides z and
asserts that a regular polygon of z sides is constructible if and only if
3. Inaprimeof the form 2r * l, p is necessarily ofthe form 2"rbwt22b + l is not nec€s-
sarily prime.
754 GALOTS THEORY
n : 2br'?e ' ' 'po, where h,92, . . ., pa zre distinct primes of the form
22^ + | and where I is any positive integer or 0. The sufficiency of this
condition does follow readily from Gauss's work on polygons of a prime
number of sides but the necessity is not at all obvious and was not proven by
Gauss.a
The construction of regular polygons had interested Gauss since 1796
when he conceived the first proof that the l7-sided polygon is constructible.
There is a story abut this discovery which may bear repeating. This con-
struction problem was already famous. One day Gauss approached his
professor A. G. Kiistner at the IJniversity of Goitingen with the proof that
this polygon is constructible. Kastner was incredulous and sought to dismiss
Gauss, much as university teachers today dismiss angle-trisectors. Rather
than take the time to examine Gauss's proof and find the supposed error in it,
Kiistner told Gauss the construction was unimportant because practical
constructions were known. Of course Kiistner knew that the existence of
practical or approximate constructions was irrelevant for the theoretical
problem. To interest Kiistner in his proof Gauss pointed out that he had solved
a seventeenth degree algebraic equation. K2istner replied that the solution
was impossible. But Gauss rejoined that he had reduced the problem to
solving an equation of lower degree. "Oh well," scoffed Kiistner, "I have
already done this." Later Gauss repaid Kiistner, who also prided himself on
his poetry, by lauding K?istner as the best poet among mathematicians and
the best mathematician among poets.
The other subjects he neglected. Galois sought to enter the Ecole Poly-
technique but possibly because he failed to explain in sufficient detail the
questions he had to answer orally at the entrance examination or because the
examining professors did not understand him he was rejected in two tries.
He therefore entered the Ecole Priparatoire (the name then for the Ecole
Normale and a much inferior school at that time.) During the lB30 Revolu-
tion, which drove Charles X from the throne and installed Louis Philippe,
Galois publicly criticized the director of his school for failing to support
the Revolution and was expelled. He was twice arrested for political offenses,
spent most of the last year and a half of his life in prison, and was killed in a
duel on May 31, 1832.
During his first year at the Ecole Galois published four papers. In 1829
he submitted two papers on the solution of equations to the Academy of
Sciences. These were entrusted to Cauchy, who lost them. In January of
lB30 he presented to the Academy of Sciences another carefully written
paper on his research. This was sent to Fourier, who died soon after, and
this paper too was lost. At the suggestion of Poisson Galois wrote (l83l) a
new paper on his research. This article, "Sur les conditions de r6solubilit6
des 6quations par radicaux,"s the only finished article on his theory of
the solution of equations, was returned by Poisson as unintelligible, with the
recommendation that a fuller exposition be written. The night before his
death Galois drew up a hastily written account of his researches which he
entrusted to his friend, August Chevalier. This account has been preserved.
In 1846 Liouville edited and published in the Journal dt Mathimatiquess
part of Galois's papers including a revision olthe I83l paper. Then Serret's
Cours d'alglbrc superiture (3rd ed.) of l866 gave an exposition of Galois's ideas.
The first full and clear presentation of Galois theory was given in 1870 by
Camille Jordan in his book Traiti des substitutions et des iquations algibriques.
Galois approached the problem of characterizing equations solvable by
radicals by improving on Lagrange's ideas, though he also derived some
suggestions from Legendre's, Gauss's, and Abel's work. He proposed to
consider the general equation, which is, of course,
(4) xo * arxo-r +...+ ao-p * an: O
(:" :^ :: :)
To perform the first substitution and then the second one is equivalent to
performing the third substitution
f: :::, :)
because, for example, by the first substitution *, is replaced by xs; by the
second substitution *, is replaced by ra; and by the third substitution *,
goes directly into ra. One says that the product of the first two substitutions
taken in the orderjust indicated is the third substitution. There are in all 4!
possible substitutions, The set ofsubstitutions is said to form a group because
the product of any two substitutions is a member of the set. This notion,
which is not of course a formal definition of an abstract group, is due to
Galois.
To secure some grasp of Galois's ideas let us consider the equationlo
x4+Pxz*q:o,
where p and q are independent. Let R be the field formed by rational
expressions inp and 4 and with coefficients in the field of rational numbers, a
typical expression being (3p' - 4q)lk'- 7p). One says with Galois that iR
is the field obtained by adjoining the letters or indeterminates p and ? to
the rational numbers. This field rR is the field or domain of rationaliry of the
coefficients of the given equation eind the equation is said to belong to the
field rt. Like Abel, Galois did not use the terms field or domain of rationality
but he did use the concept.
10. Since Galois's own presentation of his ideas was not clear and he introduced so many
new notions, we shall utilize an example due to Verriest (see the bibliography at thc end
of the chapter) to help make Galois's theory clear,
758 GALors THEoRY
We happen to know that the roots of the fourth degree equation are
*t: ^2- -
l-P-{P'-4q
,s:{-------T-,
Then it is true that the two relatiors with coefficients in .R
*z Es tCt *s
E: E_
(;
f:tz xs :) xt xB :,),
xz ts *z *g
Ec: (;: Eg: (;,
tc rt ;) x1 *4 ;)
*z *g *z *g
Et: Ea:
[: It )Ct :), [: xs *t :),
*z *s Xz lts Xl\
Ea: Ez:
f: *t *z ;) f: *a nz -J
leave the two relations true in X.lr One could show that these eight are the
only substitutions of the twenty-four which leave invariant a/l relations in R
among the roots. These eight are the group of the equation in .R. They are a
subgroup ofthe full group. That is, the group ofan equation with respect to
a field .R is the group or subgroup of substitutions on the roots which leave
invariant a/J the relations with coefficients in -R among the roots of the given
equation (whether general or particular). One can say that the number of
substitutions that leave all relations in B invariant is a measure of our
ll. For the general equation of degree z, that is, with z independent quantities as co-
efficicnts, a function of the roots is inuaiant under or unaltered by a substitution on the
roots if and only ifit remains identical with the original function. If the coefficients are all
numcrical then a function is unaltered if it remains numerically the same. Thus for
ro +.r2 + x * I :0 the roots are ,r1 : -1,x2: r, and x3: -i. Consider x!. The
substitution xo for x2 givcs x!. This has the same numerical value as x!. Then r! is not
altered by the substitution, If the coefficients contain somc numerical values and some
independent quantities then a function of the roots remains unaltered by a substitution on
the roots if the function remains numerically the same for all values of the independent
quantities (in thc domain to which these may be limited) and the numcrical values which
the roots may zulsume.
GALOIS'S THEORY OF SOLVABILITY 7s9
ignorance of the roots because we can not distinguish them under these
eight substitutions.
Now consider x? - xZ which equals l-p- 4q. We adjoin this radical
to .R, and form the field rR', that is, we form the smallest field containing .R
and,lf--il.16r.,
(6) x? -xZ: '{F=Zo
is a relation in.R', Since xr + x2 : 0 and xs * xa : 0 we also have
x? : xZ and' xl : fi.
Then in view ofthese last two facts we can say that the first four ofthe above
eight substitutions leave the relation (6) in n' true, but the last four do not.
Then the four substitutions, if they leave eaery true relation in R' among the
roots invariant, are the group ofthe equation in rR'. The four are a subgroup
ofthe eight,
Now suppose we adjoin to R' the quantity 1/(-j@ w5s1s
D : \/FA and form the field.R". Then
Xg-X4:l
is a relation inft'. This relation remains invariant only under the first two
substitutions E and. El but not under the rest of the eight. Then the group of
the equation in ft'consists ofthese two substitutions, provided every relation
in rt" among the roots remains invariant under these two substitutions. The
two are a subgroup of the four substitutions.
If we now adjoin to.R" the quantity lT-iTDll we get it'. In.tB'we
have
X1 -fi2:l -P+t)
2
Now the only substitution leaving all relations in l?'true is just.E and this is
the group of the equation in R-.
We may see from the above discussion that the group of an equation is a
key to its solvability because the group expresses the degree of indistinguish-
ability of the roots. It tells us what we do not know about the roots.
There were many groups, or strictly a group ofsubstitutions and succes-
sive subgroups, involved above. Now the order of a group (or subgroup) is the
number of elements in it. Thus we had groups of the orders 24, B, 4,2, and l.
The order ofa subgroup always divides the order ofthe group (sec. 6). The
indzx of a subgroup is the order of the group in which it lies divided by the
order ofthe subgroup. Thus the index of the subgroup of order B is 3.
760 cAlors THEoRY
The above sketch merely shows the ideas Galois dealt with. His work
proceeded as follows: Given an equation general or particular, he showed
first how one could find the group G ol this equation in the field of the co-
efficients, that is, the group of substitutions on the roots that leaves in-
variant every relation among the roots with coefficients in that field. Of
course one must find the group of the equation without knowing the roots.
In our example above, the group of the quartic equation was of order B and
the field of the coefficients was R. Having found the group G of the equation,
one seeks next the largest subgroup H in G. In our example this was the
subgroup of order 4. If there should be two or more largest subgroups we
take any one. The determination of H is a matter ol pure group theory and
can be done. Having found //, one can find by a set ofprocedures involving
only rational operations a function { ofthe roots whose coefficients belong
to rR and which does not change value under the substitutions in 11but does
change under all other substitutions in G. In our example above the function
was *f - rfi. Actually an infinity of such functions can be obtained. One must
of course find such a function without knowing the roots. There is a method
of constructing an equation in ft one of whose roots is this function {. The
degree ofthis equation is the index of Ilin G. This equation is called a partial
resolvent.l2 In our example the equation is 12 - (p'- 4q):0 and its
degree is Bl4 or 2.
One must now be able to solve the partial resolvent to find the root {.
In our exampl e $is t/ p2 --fi. One adjoins d to R and obtains a new field
.R'. Then the group of the original equation with respect to the field R' can
be shown to be H.
We now repeat the procedure. We have the group 11, of order 4 in our
example, and the field R'. We seek next the largest subgroup in 1L In our
example, it was the subgroup of order 2. Let us call this subgroup r(. One
can now obtain a function of the roots of the original equation whose co-
efficients belong to lt'and whose value is unchanged by every substitution
in l( but is changed by other substitutions in 1L In our example above this
was rs - xo. To find this function, /, say, without knowing the roots, one
constructs an equation in R' having the function /, as a root. In our example
this equation is t2 - 2(-p - \/F -4) : 0. The degree of this equation
is the index of ( with respect to H, that is, 4f2 or 2. This equation is the
second partial resolvent.
Now one must be able to solve this resolvent equation and get a root,
the function {r, and adjoin this value to .R' thereby forming the field ,R".
With respect to .R" the group of the equation is I(.
We again repeat the process. We find the largest subgroup Z in 1(. In
our example this was just the identity substitution E. We seek a function ol
12. This use of the word "resolvent" is diflerent from Lagrange's use.
cAr.ors's THEoRY oF soLvABrLrrY 76t
the roots (with coefficients in R") which retains its value under E but not
under the other substitutions of r(. In our example such a function was
fit - xz, To obtain this {, without knowing the roots we must construct
an equation in -R" having the function $2 as a root. In our example the
equation is t2 - 2(-p + lt - +q) : 0. The degree ol this equation is
the index of I in K. In our example this index was 2/l or 2. This equation is
the third partial resolvent. We must solve this equation to find the value {r.
By adjoining this root to R" one gets a field .l?'. Let us suppose that we
have reached the final stage wherein the group ofthe original equation in rR"
is the identity substitution, .8.
Now Galois showed that when the group of an equation with respect to
a given field is just d then the roots of the equation are members of that
field. Hence the roots lie in the field R", and we know the field in which the
roots lie because .R' was obtained from the known field ,R by successive
adjunction of known quantities, There is next a straightforward process for
finding the roots by rational operations in R'.
Galois gave a method of finding the group of a given equation, the
successive resolvents, and the groups of the equation with respect to the
successively enlarged fields of coefficients that result from adjoining the roots
of these successive resolvents to the original coefficient field, that is, the
successive subgroups of the original group. These processes involve consider-
able theory but, as Galois pointed out, his work was not intended as a
practical method of solving equations.
And now Galois applied the above theory to the problem of solving
polynomial equations by rational operations and radicals. Here he introduced
another notion of group theory, Suppose IIis a subgroup of G. If one multi-
plies the substitutions of H by any element g of G then one obtains a new
collection ol substitutions which is denoted by gI1, the notation indicating
that the substitution g is performed first and then any element of /Jis applied.
If gH : Hg for every g in G then 11 is called a normal (self-conjugate or
invariant) subgroup in G.
We recall that Galois's method of solving an equation called lor finding
and solving the successive resolvents. Galois showed that when the resolvent
that serves to reduce the group ofan equation, say from G to Il, is a binomial
equation *e : ,4 of prime degreep, then.F/is a normal subgroup in G (and of
indexl), and conversely if flis a normal subgroup in G and of prime indexp
then the corresponding resolvent is a binomial equation of degree p or can
be reduced to such. Ifall ofthe successive resolvents are binomial equations,
then, in view olGauss's iesult on binomial equations, we can solve the original
equation by iadicals. For, we know that we can pass from the initial field to
the final field in which the roots lie by successive adjunctions of radicals.
Conversely if an equation is solvable by radicals then the set of resolvent
equations must exist, and these are binomial equations.
762 oAlors THEoRY
property that each of its roots is a rational function of two ol them with
coefficients in .R. Such an equation is now called Galoisian. The simplest
example of a Galoisian equation is xp - A : 0. The notion is an extension
of Abelian equations.
As a postscript Hermite 13 and Kronecker in a letter to Hermite 14 and
in a subsequent paperls solved the general quintic equation by means of
elliptic modular functions. This is analogous to the use of trigonometric
functions to solve the irreducible case of the cubic equation.
17. Memorie ddla Societd ltaliana delle Scienze, 10, 1803, 385-409.
18. Jour. de l'Ecole Poly., 10, 1815, :
l-28 (Euures, (2), l, 64-90.
TerG GALoIs THEoRY
correspondence between the elements of the two groups such that il a'b : c
in the first one, then for the corresponding elements in the second, a' ,b' : c' .
He also introduced the notions of simple and composite groups. A group
having no invariant subgroup is simple; otherwise it is composite. Apropos
of these notions, Galois expressed the conjecture 1e that the smallest simple
group whose order is a composite number is the group ol order 60.
Unfortunately Galois's work did not become known until Liouville
published parts ofit in 1846, and even that material was not readily under-
standable. On the other hand, Lagrange's and Ruffini's work on substitution
groups, couched in the Ianguage of the values which a function of z letters
can take on, became well known. Hence the subject of the solution of equa-
tions receded into the background, and when Cauchy returned to the theory
of equations he concentrated on substitution groups. During the years 1844
to 18,16 he wrote a host of papers. In the major paper2o he systematized
many earlier results and proved a number of special theorems on transitive
primitive and imprimitive groups and on intransitive groups. In particular
he proved the assertion by Galois that every finite (substitution) group whose
order is divisible by a prime p contains at least one subgroup of order p.
The major paper was followed by a great number of others published in the
Comptes Rndus of the Paris Academy for the years 184446.2t Most of this
work was concerned with the formal (that is non-numerical) values that
functions of z letters can take on under interchange of the letters and with
finding functions that take on a given number of values.
After Liouville published some of Galois's work, Serret lectured on it at
the Sorbonne and in the third edition of his Cours gave a better textual
exposition of Galois theory. The work of clarifying Galois's ideas on the
solvability of equations and the development of the theory of substitution
groups therealter proceeded hand in hand. In his text Serret gave an im-
proved form of Cauchy's lBl5 result. Ifa function ofz letters has less than p
values where 1 is the largest prime smaller than n, then the function cannot
have more than two values.
One ofthe problems Serret stressed in the 1866 text asks for all the groups
that one can form with z letters. This problem had already attracted the
attention of Ruffini, and he, Cauchy, and Serret himselfin a paPer of 185022
gave a number of partial results, as did Thomas Penyngton Kirkman
(1806-95). Despite many efforts and hundreds ofllimited results the problem
is still unsolved.
After Galois, Camille Jordan (1838-1922) was the first to add signifi-
19. (Euores, 1897 ed., 26.
20. Exercices d'analyse et de lhysigue math;malique, 3, 1844, l5l-252 : (Ewres, (2), 13'
t7t-282.
21. (Ewres, (l), Vols. 9 and 10.
22. Jota. de Math., (l), 15, 1850,45-70'
THE THEORY OF SUBSTITUTION GROUPS 762
But the more useful representation ofall types of groups was introduced by
Jordan. He sought to represent substitutions by linear transformations of the
form
IL
Since the substitution groups are finite some restriction had to be placed on
the transformations so that the group of translormations would be finite.
Galois had considered such transformations 28 and had limited them so that
the coefficients and the variables take values in a finite field ofprime order.
Yl:"$t' i:1'2'"''n
where the e1's arepth roots of unity. The theorem was proved by a number of
men.3o This was the beginning of a vast amount of research on the deter-
mination of the possible linear substitution groups of a given order and of
binary and ternary form (two and three variables). Also the determination
of the subgroups of given linear substitution groups and the algebraic
expressions left invariant by all the members ola group or subgroup stirred
up much research.
Directly after noting Bravais's paper Jordan undertook the first major
investigation of infnite groups. In his paper " M6moire sur les groupes de
mouvements," sl Jordan points out that the determination of all the groups
of movements (he considered only translations and rotations) is equivalent
to the determination ol all the possible systems of molecules such that each
movement of any one group transforms the corresponding system of molecules
into itself. He therefore studied the various types ofgroups and classified them.
The results are not as significant as the f,act that his paper initiated the study
of geometric transformations under the rubric of groups and the geometers
were quick to pick up this line of thought (Chap. 38, sec. 5).
One other development ol the middle nineteenth century is both note-
worthy and instructive. Arthur Cayley, very much influenced by Cauchy's
work, recognized that the notion of substitution group could be generalized.
In three paperss2 Cayley introduced the notion of an abstract group- He used
a general operator symbol 0 applied to a system of elements *,y,2,... and
spoke of g so applied producing a function *',!', Z',. ' . of x,y, 2,.... He
pointed out that in particular d may be a substitution' The abstract group
contains many operatorc 0, $, . . .. 0+ is a compound (product) of two
operations and the compound is associative but not necessarily commutative.
His general definition of a group calls for a set of operators l, c, B, . . . , all of
them different and such that the product ofany two of them in either order
or the product of any one and itself belongs to the set.33 He mentions
matrices under multiplication and quaternions (under addition) as con-
stituting groups. Unfortunately, Cayley's introduction of the abstract group
29. Jour. far. Math., 84, 1878, 89-215, p. I 12 in particular: CEa)rcs,2, 13-139, p' 36 in
particular.
30. See E. H. Moore, Math. Ann.,50, 1898, 215.
31. Annali di Mat., (2),2, 1868/69, 167-215 and 322-45 = CEuures, 4,231-302'
32. Phil. Mag., (3), 34, 1849, 527-29 : Coll. Math. Papers,l,42?-24 and (4)'7,1854,4047
and 408-9 : Palerst 2' 123-30 and l3l-32.
33. Pa!*s,2, 124. )
77o GALOIS THEORY
Bibliograplry
Ab€l, N. H.z (Euwcs compDks (l8Sl), 2 vols., Johnson Reprint Corp., 1964.
Bachmann, P. : " Uber Gauss' zahlentheoretische Arbeiten," Nachrichlen Kiinig, Ges.
dcr Wiss. zu Gott., l9l l, 455-518. AIso in Gauss's Werke, lO, Part 2, l-69.
BurLhardt, H. : " Endliche discrete Gruppen, Encgk. der Math. Wiss,, B. G. Teubner,
1903-15, I, Part l, 208-26.
................._:
"Die Anliinge der Gruppentheorie und Paolo Ruffini," Abhandlungcn zur
Matlumatik, Heft 6, 1892, I 19-59.
Gcschichte dcr
Burns, Josephine E.: "The Foundation Period in the History of Group Theory,"
Anur. Math. Monthl.y,20, 1913, l4l-48.
Dupuy, P.: "La Vie d'Evariste Galois," Ann. & l'Ecole Norm. Sup., (2), 13, 1896,
197-266.
Galois, Evariste: " (Euvres," Jour. dc Math., I l, 18'16, 3814+4.
(Eutrcs matMnatignes, Gauthier-Villars, 1897.
Eaits ct mimoircs matMmatiques (ed. by R. Bourgne and J.-P' Azra),
Gauthier-Villars, I 962.
Gauss, C. F.: Disquisitiorus Arithneticae (1801), Werhc, Vol. l, Kiinig. Ges. der Wiss.,
zu Giittingen, 1870, English translation by Arthur A. Clarke, S.J., Yale
University Press, 1966.
Hobson, E. W.: Squaring tlu Circle and Othcr Monograplu, Chelsea (reprint), 1953.
H6lder, Otto: " Galois'sche Theorie mit Anwendungen," Eruyk. der Math, Wiss.,
B. G. Teubner, 1898-1904, I, Part 1,480-520.
Infeld, Leopold: Whom the Gods laac: Tlu Story of Etariste Galoi,r, McGraw-Hill,
1948.
Jordan, Camille: (Euorcs, 4 vols., Gauthier-Villan, 196l-64.
Trait| dcs substitutiotu ct des iquati.ons alg€briques (1870), Gauthier-Villars
(reprint), 1957.
Kiernan, B. M.: "The Development of Galois Theory from Lagrange to Artin,"
Archioc for Hi.story oJ Exaa Scierces, 8, 1971, +0-154.
Lebesgue, lJenfi:. Notite sw la aie et les traoaux de Canille Joilan, Gauthier-Villars,
1923. Also in Lebesgue's Notices d'histoire des mathimatiques, pp. '$'N{5,
Institut de Math6matiques, Gendve, 1958.
Miller, G. A.: " History of the Theory of Groups to 1900," Collectcd Works, Yol. 1,
427-67, University of Illinois Press, 1935.
Pierpont, James: " Lagrange's Place in the Theory of Substitutions ," Atruer. Math.
Soc. Bull., l,
1894/95, 196-204.
" Early History of Galois's Theory of Equations," Amer. Math, Soc. Bull., 4,
1898, 332-40.
77r
Smith, David Eugene: A Sourcc Book in Mathemalics, Dover (reprint), 1959, Vol. l,
232-52, 2s3-60, 261-66, 278-45.
Verriest, G.: (Euares matMmatiques d'Eoaristc Galois (1897 ed'),2nd ed., Gauthier-
Villan, l95l.
Wiman, A.: "Endliche Gruppen linearer Substitutionen," Enryk. der Math. Wiss.,
B. G. Teubner, 1898-1904, l, Parr 1,522-54.
Wussing, H.L.: Die Gcusis des abslrakten Grappenbcgrifes, VEB Deutscher Verlag der
Wiss., 1969.
32
Quaternions, Vectors, and Linear
Associative Algebras
literal expressions possessed a logic of its own, which accounted for its
effectiveness and correctness. Flence in the I830s the mathematicians
tackled the problem of justifying the operations with literal or symbolic
expressions.
This problem was first considered by George Peacock (1791-1858),
professor of mathematics at Cambridge University. To justify the operations
with literal expressions that could stand for negative, irrational, and complex
numbers he made the distinction between arithmetical algebra and sym-
bolical algebra. The former dealt with symbols representing the positive
integers and so was on solid ground. Here only operations leading to positive
integers were permissible. Symbolical algebra adopts the rules of arithmetical
algebra but removes the restrictions to positive integers. All the results
deduced in arithmetical algebra, whose expressions are general in form but
particular in value, are results likewise in symbolical algebra where they are
general in value as well as in form, Thus a'an : an+n holds in arithmetical
algebra when rn and a are positive integers and it therefore holds in sym-
bolical algebra for all m and z. Likewise the series for (a * D)n when z is a
positive integer, il it be exhibited in a general form without reference to a
final term, holds for all z. Peacock's argument is known as the principle of
the permanence of form.
The explicit formulation of this principle was given in Peacock's
" Report on the Recent Progress and Present State of Certain Branches of
Analysis," l in which he does not merely report but dogmatically affirms. In
symbolical algebra, he says:
l. The symbols are unlimited both in value and in representation.
2. The operations on them, whatever they may be, are poasible in
all cases.
3. The laws of combination of the symbols are of such a kind as to
coincide universally with those in arithmetical algebra when the symbols
are arithmetical quantities, and when the operations to which they are
subject are called by the same names as in arithmetical algebra.
In this paper Gregory emphasized the commutative and distributive laws' the
t".*, lurlt g b."In irrt.oirr""d by Frangois-Joseph Servois (1767-1847) '4
The thJory of algebra as the science of symbols and the laws of their
combinations was carried further by Augustus De Morgan, who wrote several
papers on the structure of algebra.E Ic;is Tigonometry and Double Alge,bra
double algebra meant the algebra
if6+S1 ufro contains his views. The words
of numbers, whereas single algebra meant negative numbers' Prior
"o*pl.*
2. 184245; lst ed., 1830.
3. Transactions of tlu Royat Society of Edinburgh, 14, 1840, 208-16'
4. Ann. dc Math.,5, l814/15' 93-140.
5. Trans, Canb. Phil. Soc., 1841, 1842, 1844' and 1847'
PERMANENCE OF FORM I l5
to single algebra is universal arithmetic, which covers the algebra of the
positive real numbers. Algebra, De Morgan maintained, is a collection of
meaningless symbols and operations with symbols. The symbols are 0, 1, *,
-, X , *, ( )( \ and letters. The laws ol algebra are the laws which such
symbols obey, for example, the commutative law, the distributive law, the
laws of exponents, a negative times a positive is negative, a - a : 0,
a + a : 1, and derived laws. The basic laws are arbitrarily chosen.
By the middle ofthe nineteenth century the axioms ofalgebra generally
accepted were :
Figure 32. I
complex number a * bi is not a genuine sum in the sense that 2 + 3 is' The
*" of th" plus sign is a historical accident and Di cannot be added to a' The
complex number a * bi is no more than an ordered couple (a, D) of real
,rrrrrib"... The peculiarity which i oi y'J introduces in the operations with
complex numbers is incorporated by Hamilton in the definitions of the
operations with ordered couples' Thus, if a * bi and c * di arc two complex
numbers then
(a,b) t (c,d): (a + c,b + d)
(a, b). (c,d) : (* - bd, ad + bc)
5 + 6i, represents the sum ofthe vectors added by means ofthe parallelogram
law. What the complex numbers do for vectors in a plane is to supply an
algebra to represent the vectors and operations with vectors. One need not
carry out the operations geometrically but can work with them algebraically
much as the equation ol a curve can be used to represent and work with
curves.
This use of complex numbers to represent vectors in a plane became
well known by 1830. However, the utility of complex numbers is limited.
Ifseveral forces act on a body these forces need not lie in one plane. To treat
these forces algebraically a three-dimensional analogue of complex numbers
is needed. One could use the ordinary Cartesian coordinates (x,y, z) of a
point to represent the vector from the origin to the point but there were no
operations with the triples of numbers to represent the operations with
vectors. These operations, as in the case of complex numbers, would seemingly
have to include addition, subtraction, multiplication, and division and
moreover obey the usual associative, commutative, and distributive laws so
that algebraic operations could be applied freely and effectively. The
mathematicians began a search for what was called a three-dimensional
complex number and its algebra.
Wessel, Gauss, Servois, M<ibius, and others worked on this problem.
Gauss ? wrote a short unpublished note dated l8l9 on mutations of space.
He thought of the complex numbers as displacements ; 4 + Di was a dis-
placement of a units along one fixed direction followed by a displacement of
D units in a perpendicular direction. Hence he tried to build an algebra ofa
three-component number in which the third component would represent a
displacement in a direction perpendicular to the plane of a * bi. He arrived
at a non-commutative algebra but it was not the effective algebra required
by the physicists, Moreover, because he did not publish it, this work had
little influence.
The creation of a useful spatial analogue of complex numbers is due to
William R. Hamilton (1805-65). Next to Newton, Hamilton is the greatest of
the English mathematicians and like Newton he was even greater as a
physicist than as a mathematician. At the age ol five Hamilton could read
Latin, Greek, and Hebrew. At eight he added Italian and French; at ten he
could read Arabic and Sanskrit and at fourteen, Persian. A contact with a
lightning calculator inspired him to study mathematics. He entered Trinity
College in Dublin in lB23 where he was a brilliant student. In 1822, at the
age of seventeen he prepared a paper on caustics which was read before the
Royal Irish Academy in lB24 but not published. Hamilton was advised to
rework and expand it. In lB27 he submitted to the Academy a revision
entitled "A Theory olSystems of Rays," which made a science of geometrical
7. Wrke, 8, 35742.
quATERNroNs, vEcroRs' AND LTNEAR ALGEBRAS
7?8
optics. Here he introduced what are called the characteristic functions of
optics. The paper was published in 1828 in the Transattions oJ the Rogal Irish
Academg.s
In t827, while still an undergraduate he was appointed Professor of
Astronomy at Trinity college, an appointment that carried with it the title
of Royal Astronomer of Ireland. His duties as professor were to lecture on
science and to manage the astronomical observatory. He did not do much
with the latter but he was a fine teacher.
In 1830 and lB32 he published three supplements to "A Theory of
Systerns of Rays." In the third papere he predicted that a ray of light
propagating in special directions in a biaxial crystal would give rise to a cone
r"fra"t"d rays. This phenomenon was confirmed experimentally by
"f
Humphrey Lloyd, a friend and colleague. Hamilton then carried over to
dynamics his ideas on optics and in the field of dynamics wrote two very
famous papers (chap. 30) in which he used the characteristic function con-
cept which he had developed for optics. He also gave a system of complete
and rigorous integrals for the difierential equations of motion of a system of
foaies. ffi" major mathematical work was the subject of quaternions, which
we shall discuss shortly. His final form of this work he presented in his
Izcturcs on Quahrnions (1853) and in a two-volume posthumously published
Eltrurns of Quateniorc (1866)
Hamilton could use analogy skillfully to reason from the known to the
unknown. Though he had a fine intuition, he did not have great flashes of
ideas but worked long and hard on special problems to see what could be
general. IIe was Patient and systematic in working out many specific
ixamples and was willing to undertake tremendous calculations to check or
p.orr" point. However, in his publications one finds only the polished
" general results.
compressed
-H"
** deeply religious and this interest was most imPortant to him'
Next in order of importance were metaphysics, mathematics, poetry,
physics, and general literature. He also wrote Poetry' He thought that the
and imagi-
ieometrical ideas created in his time, the use of infinite elements
iary elements in the work of Poncelet and Chasles (Chap. 35), were akin to
poeiry. Though he was a modest man, he admitted and even emphasized that
love of fame moves and cheers great mathematicians.
Hamilton'sclarificationofthenotionofcomplexnumbersenabledhim
to think more clearly about the problem ol introducing a three-dimensional
analogue to rePresent vectors in space' But the immediate effect w-as to
frustrite his efforts. All of the numbers known to mathematicians at this time
possessed the commutative Property of multiplication and it was natural for
equal and that the coemcients of their i, j, and k units shall be respectively
equal. Two quaternions are added by adding their scalar parts and adding
the coefficients of each of the i, j, and k units to form new coefficients for
those units. The sum of two quaternions is therefore itself a quaternion.
All the familiar algebraic rules of multiplication are supposed valid in
operating with quaternions except that in lorming products of the units
i, j, and k the following rules, which abandon the commutative law, hold:
jk:i, kj:-i, ki:j, ik: _j, ij:k, ji:-k
(3) i2:i2:k2:_1.
Thus if
p:3 +2i+6j + 7k and 9:4+6i +Bj +9k
then
pq : (3 + 2i + 6j + 7k)(4 + 6i + Bi + 9k) : - I I I + 24i + 72: + 35k
whereas
(4) v: j9 +:ft+u{.
Atc
-Vu: 0u.
(5) --^l
ox
+ 4*i + !u.
This vector, which varies lrom point to point of space, is now called the
gradient ofz. It represents in magnitude and direction the greatest space rate
of increase of z.
Also letting v : zri + o2i + z.k denote a continuous vector Point
function, wherein a1, a2, and u" are functions of x, g, and z, Hamilton intro-
duced
which departed even more from the usual properties of real and complex
numbers. This realization lvas necessary before vector algebra and analysis
could be developed because vectors violate more ordinary laws of algebra
than do quaternions (sec. 5). More generally, Hamilton's work led to the
theory of linear associative algebras (sec. 6). Hamilton himself began work on
hypernumbers which contain n components or z-tuples,l2 but it was his work
on quaterniotrs which stimulated the new work on linear algebras.
where the al and B, are real numbers and where cr, er, arld a, are primary or
qualitative units represented geometrically by direct line segments ol unit
length drawn from a common origin so as to determine a right-handed
orthogonal system of axes. The apl are multiples of the primary units and are
represented geometrically by lengths d, along the respective axes, while c is
represented by a directed line segment in space whose projections on the
axes are the lengths cr1. The same is true for the p, and p. Grassmann called
the directed line segments or line-vectors Slreafte.
12. Trans. Royal lrish Academy,2l, 1848, 199-296 : Mdth. Palers,III, 159-226.
GRASSMANN'S CALCULUS OF EXTENSTON 783
The addition and subtraction of these hypernumbers are defined by
These brackets are called units of the second order and are not reduced by
Grassmann (whereas Hamilton does) to units of the first order, that is, to the
r,, but are dealt with as though they were equivalent to first order units with
lrrrr] : e", and so forth.
From these definitions it follows that the inner product crlp of cr and B
is given by
"lF
: "rP,
* u2P2 * ,,3P, and
"lF
: Fl".
The numerical value or magnitude a of a hypernumber cc is defined as
l& : '/A + q', +-AB. Thus the magnitude of a is numerically equal to
the length of the line-vector which represents it geometrically. If d denotes
the angle between the line-vectors a and p, then
ow:- *"\ab
*tr , ab,+ 5&\
abp,q, +",8, :
ab)-"".abcoso.
With the aid of the outer product rule (9) the outer product P of the
hypernumbers a and p can be expressed as follows:
Hence the magnitude lP| ol the outer product [cp] is represented geo-
metrically by the area of the parallelogram constructed upon line-vectors
which are the geometrical representations ofa and B. This area together with
a unit line-vector normal to it, whose direction is chosen so that if cr is
rotated into B about the normal, then the normal will point in the direction
of a right-handed screw rotating from a to p, is now called a vectorial area.
Grassmann's term was Plangriisse,
Grassmann's inner product of two primary hypernumbers for three
dimensions is equivalent to the negative of the scalar part of Hamilton's
quaternion product of two vectors; and again in the three-dimensional case,
Grassmann's outer product, if we replace le2es] by e, and so forth, is precisely
Hamilton's quaternion product of two vectors. However, in the theory of
quaternions the vector is a subsidiary part of the quaternion whereas in
Grassman's algebra the vector appears as the basic quantity.
Another product was formed by Grassmann by taking the scalar (inner)
product of a hypernumber y with the vector (outer) product [cB] of two
hypernumbers a and B. This product Q for the three-dimensional case is
: ["F]y : ("z!s -
"s{r)yt * "t|)yz * ("rF, - az!)yz.
Q ("rF, -
In determinant fiorm
(12) a:
l:'; xi)
Consequently Q can be interpreted geometrically as the volume ofa paral-
lelepiped constructed on the line-vectors that represent a, B, and, y. ^the
volume may be positive or negative.
Grassmann considered (for z-component hypernumbers) not only the
two types ofproducts described above but also products ofhigher order. In a
paper of 1855,13 he gave sixteen different types of products for hyper-
numbers. He also gave the geometrical significance of the products and made
applications to mechanics, magnetism, and crystallography.
It might seem as though Grassmann's treatment of hypernumbers of z
parts was needlessly general since thus far at least the useful instances of
hypernumbers contained at most four parts. Yet Grassmann's thinking
helped to lead mathematicians into the theory ol tensors (Chap. 4B), for
tensors, as we shall see, are hypernumbers. Other geometrical ideas and the
idea of invariance had yet to make themselves known to mathematicians
before the day oftensors was to come. Though thinking about hypernumbers
13. Jour. far Math., 49, 1855, 10-20 and 123-41 : Gu. Math. und Phys. Werkc, 2, Part l,
t99-2t7.
FROM QUATERNIONS TO \IECTORS 785
14. Proc. London Math. Soc.,3, 1871,224-32 : The Scientilc Palers' Yol, 2,257-66.
786 qUATERNIONS, VECTORS, AND LINEAR ALGEBRAS
written by E. B. Wilson and based on Gibbs's lectures. The book, Gibbs and
l3a)
i.i:j.j:k.k:1
i.j: j.i: i.k: k.i: j.k: k.j:0.
(
Figure 32.2
be a scalar and not equal to the vector v, On the other hand if g were a
scalar then, though qv' is defined as qa'i + qb'i + q/k, it would be rare
that qa' : a, qb' : 6 and qc' : c where a, b, and c are the coefficients in v.
Despite the absence of a quotient the scalar product is useful.
The physical significance of the scalar product is readily shown to be
the following: If v' is a force (Fig. 32.2) whose direction and magnitude are
represented by the line segment from O to P' then the effectiveness of this
force in pushing an object at O, say, in the direction of OP, where OP
represents v, is the projection ol OP' on OP or OP'cos { where { is the angle
between OP' and OP. If OPis of unit length, the projection of OP' is precisely
the value of the product v'v'.
The second kind of product of vectors, called the vector product, is
defined as follows: We again multiply v and v' as polynomials but this time
let
ixi:ixi:kxk:0
(l3b) ixj:k, jxi:-k, jxt:i, kxj:-i,
kXi:j, iXk:-j.
Thus the product, which is indicated by v t v', is
v Xv' : (bc' - b'c)i * (ca' - ac')j + (ab' - b'a)k.
The vector product of two vectors is a vector. Its direction can readily
be shown to be perpendicular to that ofv and v' and pointed in the direction
that a right-hand screw moves when it is turned from v to v' through the
smaller angle. The vector product ol two parallel vectors is zero, though
neither factor is. Moreover, this product, like the quaternion product, is not
commutative. Further, it is not even associative. For example, i X j X i
mean (ixj) xj : k xj : -iori x (jxi) : i x0 : 0.
"u.t
There is no inverse to vector multiplication. For the quotient of v by
v' to be a vector q we would have to have
v:v'Xq
FROM qUATERNIONS TO VECTORS 789
and this would require, whatever q be, that v' be perpendicular to v which
may not be the case to start with. If 4 were a scalar it would be accidental
that qa'i + qb'i + qc'k would equal v.
The vector product, like the scalar product, is suggested by physical
situations. Let OP and PP' in Figure 32.3 be the lengths and directions ofv
and v'. Ilv'is a force whose magnitude and direction are those of PP', the
measure of the moment of lorce exerted by v' around O is the length of the
vector v ; v'and is usually taken to have the direction ofv X v'.
The algebra of vectors is extended to variable vectors and a calculus of
vectors. For example, the variable vector v(I) : a(t)i + D(t)j + e(l)k'
wherea(l), b(t), andc(t) are functions of l, is a vector function. Ifthe various
vectors one gets for various values of, are drawn from O as origin (Fig. 32.4)
the endpoints of these vectors will trace out a curve. Ilence the vector
lunction ofa scalar variable I plays a role analogous to the ordinary function
g : x2 + 7,
say, and a calculus of vectors has been developed for these vector functions
just as there is one for ordinary functions.
The concepts of gradient z,
(14) o":*i+alasi*?*o,
where a is a scalar function of x, y, and' z, the divergence ofa vector function
Azt--)- ----:
(15) V.n:%+
ax
---- Aa.
ov ' oz'
where a1, ur, and a, are the components ofv, and the curl ofv,
4) * : I I"(pcos,\ * Q cos p *
I I [,(q* * E * "
o, .R cos z) /s.
Here P, Q, and R are functions of x, y, and z and. are components ofa vector,
and ,\, p, andy are the direction cosines of the normal to the surface S which
bounds the volume Zover which the left-hand integral is taken. This theorem,
known as the divergence theorem (also Gauss's theorem and Ostrogradsky,s
theorem) can be expressed in vector form thus: If F is the vector whose
components are P, Q, and.R and n is the normal to,S then
(t7)
i/i".*:llF.nds.
Likewise Stokes's theorem, which was first stated by him as a question in an
examination for the Smith prize at Cambridge in 1854,18 states in scalar
form that
(18)
JJ
*" F.n ds : Ir.4*
*,
where r(s) is the vector whose components are x(s), y(s), and z(s).
When Maxwell wrote expressions and equations of electromagnetism,
especially the equations which now bear his name, he usually wrote out the
components of the vectors involved in grad z, div v, and curl v, However,
Heaviside wrote Maxwell's equations in vector form (Chap. 28, (52)).
It is true that calculations with vectors and vector functions are olten
made by resort to Cartesian components but it is highly important to think
also in terms of the single entity the vector and in terms of gradient, diver-
gence, and curl. These have a direct physical significance, to say nothing ol
the fact that complicated technical steps can be perlormed directly with the
and the fourteen equations obtained from these last seven by permuting each
set of three subscripts cyclically; e.g. e2%: et; eset: e2.
A general number (octonion) r is defined by
x: tco * *f1 *...* xzez,
wherein the *, are real numbers. The norm of x, N(x), is by definition
N(r) : xZ + x? +. . .+ xl.
The norm ofa product equals the product ofthe norms. The associative law
of multiplication fails in general (as does the commutative law of multiplica-
tion). Right- and left-hand division, except by zero, are always possible and
unique, a fact overlooked by Cayley and proved by Leonard Eugene
Dickson.2o In later papers Cayley gave other algebras of hypernumbers
differing somewhat from the one above.
Hamilton in his Lectures on Quaternions2l also introduced biquaternions,
that is, quaternions with complex coefficients. He noted that the product law
does not hold for these biquaternions; that is, two non-zero biquaternions
may have a product of zero.
William Kingdon Clifford ( 1845-79), professor of mathematics and
mechanics at University College, London, created another type of hyper-
number,22 which he also called biquaternions. If q and Q are rcal quaternions
and if crr is such that a2 : l, and ar commutes with every real quaternion,
then 4 + <,rQ is a biquaternion. Clifford's biquaternions satisfy the product
law of multiplication but the multiplication is not associative. In the latter
work Clifford introduced the algebras which bear his name. There are
Clifford algebras with units, 1,0t,...,an_1 such that the square of each
er : - | and. eie, : -e j4 for i I j. Each product of two or more units is a
new unit and so there are 2" different units. All products are associative. A
form is a scalar multiplied by a unit and an algebra is generated by the
sum and product of forms.
The flood of new systems of hypernumbers continued to rise and the
variety became enormous. In a paper " Linear Associative Algebra" read in
19. Phil. Mag., (3), 26, 1845, 210-13 and 30, 1847,257-58 : Coll. Math. Palers, t, t27
and 301.
20. Arner. Math. Soc. Trans., 13, 1912,59-73.
21. 1853, p. 650.
22. Proc. Lond. Math. Soc.,4, 1873,381-95 : Coll. Math. Papcrs, l8l-200 and Amer. Jour.
of Math., l, 1878, 350-58 : Coll. Math. Papers,266-76.
LINEAR ASSOCIATIVE ALGEBRAS 793
extensions of the complex number system ilwe wish to preserve at least some
of its algebraic properties. Had Hamilton known these theorems he would
have saved years oflabor in his search for a three-dimensional vector algebra.
The study of linear algebras with a finite and even infinite number of
generating (primary) units and with or without division continued to be an
active subject well into the twentieth century. Such men as Leonard Eugene
Diclson and J. H. M. Wedderburn contributed much to the subject.
Bi.bliograplry
Clifford, W. K.: Collccted Mailumatical Papcrs (882) Chelsea (reprint), 1968.
'
Collins, Joseph V.: "An Elementary Exposition of Grassmann's Ausdehnungslehre,"
Anur. Math. Monthl'y,6, 1899' several partsl and 7, 1900, several parts.
Coolidge, Julian L.: A History of Georutrical Methods, Dover (reprint), 1963, pp.
25244.
Crowe, IVlichael J,z A History of Vector Analysis, University of Notre Dame Press,
1967.
Dickson, Leonard E.:. Liruar Algebros, Cambridge University Press, 1914.
Gibba, Josiah W., and E. B. Wilson: Vector Analysi.s (1901), Dover (reprint), 1960.
Grassmann, H. G.: Dit liwah Ausdehnungslehre (1844), Chelsea (reprint), 1969.
......-.'...-...._: Gcsammelte matlumalisclu und plrysi.kalischc Werke, 3 vols., B. G. Teubner,
l. Introduction -
Though determinants and matrices received a great deal of attention in the
nineteenth century and thousands of papers were written on these subjects
they do not constitute great innovations in mathematics. The concept of a
vector, which from the mathematical standpoint is no more than a collection
ofordered triples, nevertheless has a direct physical significance as a force or
velocity, and with it one can write down at once mathematically what the
physics states. The same applies with all the more cogency to grailient,
divergence, and curl. Likewise though mathematically tlgltk is no more than
a symbol for a lengthy expression involving the limit of Ag/Ar, the derivative
is in itselfa powerful concept that enables us to think directly and creatively
about physical happenings. Thus though mathematics superficially regarded
is no more than a language or a shorthand, its most vital concepts are those
that supply keys to new realms of thought. By contrast, determinants and
matrices are solely innovations in language. They are shorthand expressions
for ideas which already exist in more expanded form. In themselves they say
nothing directly that is not already said by equations or transformations
albeit in lengthier fashion. Neither determinants nor matrices have influenced
deeply the course of mathematics despite their utility as comPact expressions
and despite the suggestiveness of matrices as concrete groups for the discern-
ment of general theorems ofgroup theory. Nevertheless, both concePts have
proved to be highly useful tools and are now part of the apparatus of
mathematics.
795
756
above or below the main diagonal are zero) is the product ofthe elements on
the main diagonal.
One of the consistent workers in determinant theory over a period ol
more than filty years was James Joseph Sylvester (1814-97)' After winning
the second wranglership in the mathematical tripos he was nevertheless
barred from teaching at Cambridge University because he was Jewish' From
I B4l to 1845 he was a professor at the University of Virginia' He then
returned to London and served from lB45 to lB55 as an actuary and a
lawyer. The relatively lowly position of professor in a military academy in
Woolwich, England, was offered to him and he served there until lBTl' A
few years of miscellaneous activity were followed by a professorship at Johns
Hopkins University where, starting in 1876, he lectured on invariant theory'
Helnitiated research in pure mathematics in the United States and founded
the American Journal of Mathematics.ln lBB4 he returned to England, and at
the age of seventy became a professor at Oxford UniverSity, a post he held
until his death.
Sylvester was a lively, sensitive, stimulating, passionate, and even
excitable person. His speeches were brilliant and witty and he presented his
ideas enthusiastically and with fire. In his papers he used glowing language'
He introduced much new terminology and jokingly likened himself to
Adam who gave names to the beasts and the flowers' Though he related
many diverse fields, such as mechanics and the theory of invariants, he was
not given to systematic and thoroughly worked-out theories' In fact he
frequently published guesses, and though many ofthese were brilliant others
*.r" i.r"o.r."t. He ruelully acknowledged that his friends on the Continent
" complimented his powers of divination at the expense of his judgment'"
His chief contributions were to combinatorial ideas and to abstractions from
more concrete develoPments.
One of Sylvester's major accomplishments was an improved method of
eliminating x from an zth degree and an mth degree polynomial' He called
it the dialytic method.a Thus to eliminate x from the equations
asxs * afi2 * arx I Ag :0
b oxz + brx + b2 :0
he formed the fifth order determinant
Ag A1 A2 as0
Oaodl a2 ag
(3) bo br b2 00
0DobL b2o
00bo br bz
4. Phil. Mag.,l6, 1840, 132-35 and 21,1842,534-39: Coll. Math' Papet, 1,5't-57 and
86-90.
798 DETERIIITNANTS AND MATRTCES
#:Zn,*r,,
t,t
where the prime denotes differentiation with respect to ,.
Determinants were employed in another connection, namely, in the
change of variables in a multiple integral. Special results were found first by
Jacobi (1832 and lB33). Then Eugtne Charles Catalan (l8l4-94) in 1839?
gave the result familiar today to students of the calculus. Thus the double
integral,
(4)
[[r@,y1a.as
under the change of variables given by
(5) x:f(u,o), y: g(u,a)
becomes
where G(2, o) : F(x(u, a), y(u, u)). The determinant in (6) is called the
Jacobian of x and, y with respect to u and a or the functional determinant.
Jacobi devoted a major papers to functional determinants. In this paper
Jacobi considers z functions each of which is a function of
ur,...,un
xts xz> from such z functions, the r,
. . . , xn and. raises the question of when,
can be eliminated so that the z,'s are connected by one equation. If this is
not possible the functions z, are said to be independent. The answer is that
if the Jacobian of the z, with respect to the x, vanishes the functions are not
independent and conversely. He also gives the product theorem forJacobians.
That is, if the u, are lunctions of yrand,theyrarc functions of the *,, then the
Jacobian of the z, with respect to the ,i is the product of the Jacobian of the
z, with respect to the g, and the Jacobian of the y, with respect to the x,.
*,:2b,,u,, i:1,2,...,n,
with non-vanishing determinant. Sylvester's law states that the number s of
positive terms and r - s negative ones is always the same no matter what real
transformation is used. Regarding the law as self-evident, Sylvester gave no
proof.
The law was rediscovered and proved byJacobi.lz I[a form is positive
or zero for all real values of the variables, it is called positive definite. Then
all the signs in (B) are positive, and r : z. It is called semidefinite if it can
take on positive. and negative values (in which case r < z), and negative
definite when the form is always negative or zero (and r : /,). These terms
were introduced by Gauss in his Disquisitiones Arithmeticae (sec.271).
The further study of the reduction of quadratic forms involves the
notion ofthe characteristic equation ofa quadratic form or ofa determinant.
A quadratic form in three variables was usually written in the eighteenth
century and first half of the nineteenth century as
In the latter notation the form has associated with it the determinant
lat-, ttrz a,,l
(10) lo* dzz or"l, ar!: dn.
lr", osz o".l
The characteristic equation or latent equation ofthe form or the determinant
is
larr - tr atz ars I
A: S a,*,x.- B : ) b"*'*'
occur in the greatest common divisor D,(I) of the ith order minors for each i
are the elementary diaisors ol lA + ,\Bl or of any general determinant. The
quotients of D,(,\) by D,-r()) for each i are called the invariant factors ol
lA + fBl. Sylvester did not prove that the invariant factors constitute a
complete set of invariants for the two quadratic forms.
Weierstrass 25 completed the theory of quadratic forms and extended the
theory to bilinear forms, a bilinear forrn being
atzclyr + apxl!2*,.,+ annlnyn.
lfi ll:i
A series of results due to many men including Kronecker and Cayley led to
the general results now usually stated in terms of matrices but which in the
middle of the nineteenth century were stated in terms of augmented and
unaugmented determinants. The general results on m equations in z un-
knowns with zn greater than, equal to, or less than z-the equations can be
homogeneous (the constant terms are zero) or nonhomogeneous-are stated,
for example, in Charles L.
Dodgson's (Lewis Carroll, lB32-lB9B) ln
Elementarg Theorg of Delerninants (1867).In later texts one finds the present
condition: In order that a set of zn nonhomogeneous linear equations in z
unknowns may be consistent it is necessary and sufficient that the highest
order non-vanishing minor be of the same order in the unaugmented and
25. Mottatsber. Berliner Akad., 1868, 310-38 : Werke,2, l9-+4.
26. Phil. Trans., l5l, 186l, 293-326 : Coll. Math. Papers, 1,367-409.
Bo4
augmented arrays, or in terms of matrix language that the ranks of the two
matrices be the same.
New results on determinants were obtained throughout the hineteenth
century. As an illustration there is the theorem proved by Hadamard in
1893,27 though known and proved by many others before and after this date.
Ifthe elements of the determinant D : la,rl satisfy the condition loul . A
then
lDl < Aa ' n"rz.
The above theorems on determinants are but a small sample of
the multitude that have been established. Beyond a great variety of
other theorems on general determinants there are hundreds of others on
determinants of special form such as symmetric determinants (au: ai,
skew-symmetric determinants (au : - ar,), orthogonants (determinants of
orthogonal coordinate transformations), bordered determinants (determi-
nants extended by the addition of rows and columns), compound deter-
minants (the elements are themselves determinants), and many other special
types.
4. Matri.cu
One could say that the subject of matrices was well developed before it was
created. Determinants, as we know, were studied from the middle of the
eighteenth century onward. A determinant contains an array of numbers
and usually one is concerned with the value of that array, given by the
definition of the determinant. However, it was apparent from the immense
amount of work on determinants that the array itself could be studied and
manipulated for many purposes whether or not the value of the determinant
came into question. It remained then to recognize that the array as such
should be given an identity independent of the determinant. Thelarray itself
is called a matrix. The word matrix was first used by Sylvester2s when in
fact he wished to refer to a rectangular array of numbers and could not use
the word determinant, though he was at that time concerned only with the
determinants that could be formed from the elements of the rectangular
array. Later, as we have already noted in the preceding section, augmented
arrays were used freely without any mention of matrices. The basic properties
of matrices, as we shall see, were also established in the development of
determinants.
It is true, as Arthur Cayley insisted (in the 1855 paper referred to
below), that logically the idea of a matrix precedes that of a determinant but
historically the order was the reverse and this is why the basic properties of
27. Bull. du Sci. Math., (2), 17, 1893, 240-46 : @uures, 1,239-45.
28. Phil. Mag., (3), 37, 1850, 363-70 : Coll. Math. Papers, l, 145-51.
MATRT.E. Bo5
matrices were already clear by the time that matrices were introduced. Thus
the general impression among mathematicians that matrices were a highly
original and independent creation invented by pure mathematicians when
they divined the potential usefulness of the idea is erroneous. Because the
uses of matrices were well established it occurred to Cayley to introduce
them as distinct entities. He says, "I certainly did not get the notion of a
matrix in any way through quaternions; it was either directly from that of a
determinant or as a convenient way of expression ofthe equations:
x':ax*b!
Y':cx+dY'"
And so he introduced the matrix
(, ')
which represents the essential information about the transformation. Because
Cayley was the first to single out the matrix itself and the first to publish a
series ofarticles on the subject he is generally credited with being the creator
of the theory of matrices,
Cayley, born in lB21 of an old and talented English family, showed
mathematical ability in school. His teachers persuaded his flather to send him
to Cambridge instead of putting him in the family business. At Cambridge
he was senior wrangler in the mathematical tripos and won the Smith
prize. He was elected a fellow of Trinity College in Cambridge and assistant
tutor, but left after three years because he would have had to take holy
orders. He turned to law and spent the next fllteen years in that Profession.
During this period he managed to devote considerable time to mathematics
and published close to 200 papers. It was during this period, too, that he
began his long friendship and collaboration with Sylvester.
In 1863, he was appointed to the newly created Sadlerian professorship
of mathematics at Cambridge. Except for the year lBB2, spent at Johns
Hopkins University at the invitation of Sylvester, he remained at Cambridge
until his death in 1895. He was a prolific writer and creator in various
subjects, notably the analytic geometry of z dimensions, determinant theory,
linear transformations, skew surfaces, and matrix theory. Together with
Sylvester, he was the founder ofthe theory ofinvariants. For these numerous
contributions he received many honors.
Unlike Sylvester, Cayley was a man of even temper, sober judgment, and
serenity. He was generous in help and encouragement to others. In addition
to his fine work in law and prodigious accomplishments in mathematics,
he found time to develop interests in literature, travel, painting, and
architecture.
806 DETERMINANTS AND MATRICES
(:::)and(:::)
The sum of two matrices is defined as the matrix whose elements are the
sums of the corresponding elements of the two summands. He notes that this
definition applies to any two z by z matrices and that addition is associative
and convertible (commutative). If m is a scalar and A a matrix then rzl is
defined as the matrix whose elements are each zz times the corresponding
element of l.
The definition of multiplication of two matrices Cayley took directly
from the representation of the effect of two successive transformations. Thus
if the transformation
x':atx*atz!
y':azLtc+az\!
is followed by the transformation
x':bnx'*bn!'
U':bzfi'*bzz!'
then tlre relation between *n , g' an.d .r, y is given by
lM-xll:a,
where lM - r1l is the determinant of the matrix M - xI and .I is the unit
matrix. Thus if
,:(:'),
the characteristic equation (Cayley does not use the term though it was
introduced for deterrrtinants by Cauchy [sec. 3]) is
(13) x'-(a+d)x+(ad-bc):O.
The roots of this equation are the characteristic roots (eigenvalues) of the
matrix.
In the lB5B paper Cayley announced what is now called the Cayley-
Hamilton theorem for square matrices of any order. The theorem states that
if M is substituted for x in ( 13) the resulting matrix is the zero matrix.
Bo8 DETERMINANTS AND MATRICES
Cayley states that he has verified the theorem for the 3 by 3 case and that
further proof is not necessary. Hamilton's association with this theorem rests
on the fact that in introducing in hii kclures on Quatemionssr the notion of a
linear vector function r' of another vector r, a linear transformation from
x, y, and, z to x', y', and, z' is involved. He proved that the matrix of this
transformation satisfied the characteristic equation ofthat matrix, though he
did not think formally in terms of matrices.
Other mathematicians found special properties of the characteristic
roots of classes of matrices. Hermites2 showed that if the matrix M : M*,
where ll1* is the transpose of the matrix formed by replacing each element
of M by its complex conjugate (such M's are now called Hermitian), then
the characteristic roots are real. In 186l Clebsch 33 deduced from Hermite's
theorem that the non-zero characteristic roots of a real skew-symmetric
matrix are pure imaginaries. Then Arthur Buchheim3a (1859-88) demon-
strated that if M is symmetric and the elements are real, the characteristic
roots are real, though this result was already established for determinants by
Cauchy.ss Henry Taber (1860-?) in another paperso asserted as evident
that if
xn - mr*'t-L * fllz*n-2 -...+ nn:O
M, then the determinant
is the characteristic equation of any square rnatrix
of M
is mn and if by a principal minor of a matrix we understand the deter-
minant of a minor whose diagonal is part of the main diagonal of M, then mt
is the sum of the principal i-rowed minors. In particular, then, m1, which
is also the sum of the characteristic roots, is the sum of the elements along
the main diagonal. This sum is called the trace of the matrix. The proofs of
Taber's assertions were given by William Henry Metzler (1863-?).3?
Frobenius3s raised a question related to the characteristic equation. He
sought the minimal polynomial-the polynomial of lowest degree-which
the matrix satisfies. He stated that it is formed lrom the factors of the
characteristic polynomial and is unique. It was not until 19043e that Kurt
Hensel (186l-1941) proved Frobenius's statement of uniqueness. In the
same article, Hensel also proved that ifrf(x) is the minimal polynomial of a
matrix rM and g(x) is any other polynomial satisfied by M, then f (x) divides
s@).
3t. 18s3, p. 566.
32. Comp. Rend.,4l, 1855, l8l-83 : CEuues, l, 479-gl.
33. Jow. far Math.,62, 1863,232-45.
34. Musonger of Math., (2), 14, 1885, 14344.
35. (Euorer, (2), 9, 174-91.
36. Amer. Jour. of Malh., 12, 1890, 337-96.
37 . Amer. Jour. o-f Math., 14, l89l192, 326_77 .
38. Jour. fiir Math., 84, 1878, l-63 : Gu. Ahb., l, 343-405.
39. Jour. far Math., 127,1904, l16-66.
uATRrcEs Bog
100
,(:
&r0
0 0 0 ... ,\,
denote an /,th order matrix. Jordan showed that "I can be transformed to a
similar matrix having the form
00
Jz 0 "'
(:
00 )
This is the Jordan canonical, or normal, form of a matrix.
The similarity transformation from I to B was also treated by Frobenius
under the name of contragredient translormation in his IBTB paper. In the
0 ...
There are many basic theorems on congruent matrices. For example, if S is
symmetric and ^S1 is congruent to S, then 51 is symmeEic and if S is skew
then S, is skew.
In his lB92 paper in the Arnerican Joumal of Matlwmatics Metzler intro-
duced transcendental functions of a matrix, writing each as a power series
in a matrix, He established series for 0M, a-M,log M, sin M, and sin-L M.
Thus
eM: ) Malnl.
4
The ramifications of the theory of matrices are numerous. Matrices have
been used to represent quadratic and bilinear forms. The reduction of such
forms to simple canonical forms is the core of the work on the invariants of
matrices. They are intimately connected with hypernumbers and Cayley in
his lB5B paper developed the idea of treating hypernumbers as matrices.
Both determinants and matrices have been extended to infinite order.
Infinite determinants were involved in Fourier's work of determining the
coefficients ofa Fourier series expansion ofa function (Chap, 28, sec. 2) and
in Hill's work on the solution of ordinary differential equations (Chap. 29,
sec. 7), Isolated papers on infinite determinants were written between these
two outstanding nineteenth-century researches but the major activity
postdates Hill's.
Infinite matrices were implicitly and explicitly involved in the work of
Fourier, Hill, and Poincard, who completed Hill's work. However, the great
impetus to the study of infinite matrices came from the theory of integral
equations (Chap. 45). We cannot devote space to the theory of determinants
and matrices of infinite order.a6
46. See the reference to Bernkopf in the bibliography at end of chapter.
8rz
In the elementary work on matrices the elements are ordinary real
numbers though a great deal of what was done on behalf of the theory
of numbers was limited to integral elements. However, they can be com-
plex numbers and indeed most any other quantities. Naturally, the properties
the matrices themselves possess depend upon the properties of the elements.
Much late nineteenth- and early twentieth-century research has been devoted
to the properties of matrices whose elements are members of an abstract field,
The importance of matrix theory in the mathematical machinery of modern
physics cannot be treated here, but in this connection a prophetic statement
made by Tait is of interest. " Cayley is forging the weapons for future
generations of physicisu."
Bibli.ographg
Bernkopt Michael: "A History of Infinite Matrices," Archioe for History of Exact
Sciences, 4, 1968, 308-58.
Caylen Arthur: The Collccted Matlumatical Papers, 13 vols., Cambridge University
Press (1889-97), Johnson Reprint Corp., 1963.
Feldman, Richard W., Jr. : (Six articles on matrices with various titles), The
Matlumatics Teaehcr, 55, 1962, 482-A4,589-90, 657-59; 56, 1963, 37-38,
I0l-2, 163-64.
Frobenius, F. G.: Gesammehc Abhandlungen,3 vols., Springer-Verlag, 1968.
Jacobi, C. G. l.: Cesammelte We*.c, Georg Reimer, 1884, Vol. 3.
MacDuffee, C. C.: Tlu Theory of Matices, Chelsea, 1946.
Muir, Thomas: Tlu Tluory of Dckrminants in the Historical Order of Deaelopmznt
(1906-23), 4 vols., Dover (reprint), 1960.
List of writings on the theory of matrices, Anur. Jour. of Math.,20, 1898,
225-28.
Sylvester, James Joseph: The Collected Mathematical Papers, 4 vols., Cambridge
University Press, l90t[-12.
Weierstrass, Karlz Mathematische Werke, Mayer und Miiller, 1895, Vol. 2.
3+
The Theory of Numbers in the
Nineteenth Century
It is true that Fourier had the opinion that the principal
object of mathematics was public use and the explanation of
natural phenomena; but a philosopher like him ought to
know that the sole object of the science is the honor of the
human spirit and that under this view a problem of [the
theory of] numbers is worth as much as a problem on the
system of the world. . c. c. J. JAcoBI
l. Introduction
Up to the nineteenth century the theory of numbers was a series of isolated
though often brilliant results. A new era began with Gauss's Disquisitiones
Arithmetimet which he composed at the age of twenty. This great work had
been sent to the French Academy in 1800 and was rejected but Gauss
published it on his own. In this book he standardized the notation; he
systematized the existing theory and extended it; and he classified the
problems to be studied and the known methods olattack and introduced new
methods. In Gauss's work on the theory of numbers there are three main
ideas : the theory of congruences, the introduction of algebraic numbers, and
the theory of forms as the leading idea in Diophantine analysis. This work not
only began the modern theory of numbers but determined the directions ol
work in the subject up to the present time. The Disquisitioner is difficult to
read but Dirichlet expounded it.
Another major nineteenth-century development is analytic number
theory, which uses analysis in addition to algebra to treat problems involving
the integers. The leaders in this innovation were Dirichlet and Riemann.
8r3
Br4 TuE THEoRY oF NUMBERs ISoo-r9oo
This equation has no solutions because 2.r is even and 2x - 25 is odd. Hence
2x - 25 cannot be a multiple of 12. The basic theorem on polynomial
congruences, which Gauss re-proves in the second section, had already been
established by Lagrange.s A congruence of the ath degree
a*n I Bx"-L +... + Mx * N : 0modulol
whose modulus is a prime number p which does not divide I cannot have
more than z noncongruent roots.
In the third section Gauss takes up residues of powers. Here he gives a
proof in terms of congruences of Fermat's minor theorem, which, stated in
terms of congruences, reads: If p is a prime and a is not a multiple ofp then
aP-1 = I modulo p.
where a and z are relatively prime. This subject was continued by many men
after Gauss.
The fourth section of Disquisitiones treats quadratic residues. lf p is a
prime and a is not a multiple of p and if there exists an x such that x2 = a
modp, then a is a quadratic residue of p; otherwise a is a quadratic non-
residue ofp, After proving some subordinate theorems on quadratic residues
Gauss gave the first rigorous proof of the law of quadratic reciprocity (Chap.
2. Hisr. de I'Acad. de Berlin,24, 1768, 192 ff., pub. 1770 : CEuures,2, 655-726.
TI{E TTIEORY OF CONGRUENCES 8r5
25, sec. 4). Euler had given a complete statement much like Gauss's in one
paper of his Opuscula Analytica of l7B3 (Chap. 25, sec. 4). Nevertheless in
article l5l of his Disquisitiones Ganss says that no one had presented the
theorem in as simple a form as he had. He refers to other work of Euler
including another paper in
the Opuscula and to Legendre's work of 1785.
Of these papers Gauss rightly that the proofs were incomplete.
says
Gauss is supposed to have discovered a proofofthe law in 1796 when he
was nineteen. He gave another proof in the Disquisitiones and later published
four others. Among his unpublished papers two more were found. Gauss
says that he sought many proofs because he wished to find one that could be
used to establish the biquadratic reciprocity theorem (see below). The law
of quadratic reciprocity, which Gauss called the gem of arithmetic, is a basic
result on congruences. After Gauss gave his proofs, more than fifty others
were given by later mathematicians.
Gauss also treated congruences of polynbmials, If .,{ and B are two
polynomials in r with, say, real coefficients then one knows that one can
find unique polynomials Q and .R such that
A: B.Q + R,
where the degree ofR is less than the degree of B. One can then say that two
polynomials A, and A, are congruent modulo a third polynomial P if they
have the same remainder rR on division by P.
Cauchy used this idea 3 to define complex numbers by polynomial
congruences. Thus if;t(x) is a polynomial with real coemcients then under
division by x2 +|
l@): o + &xmodx2 + I
because the remainder is of lower degree than the divisor. Here a and D are
necessarily real by virtue of the division process. If g(x) is another such
polynomial then
cQc):c*dxrr;.odx2+l'
Cauchy now points out that if At, Az, and .B are any polynomials and if
AL: BQt + R1 and Az: BQz * Rz,
then
Al + Az = R1 * rR2 mod -8, and ArA2 = RlRs mod 8.
We can now see readily that
"f(i):ooaa,i+a2i2*"'
that
f(i) : oo - az * a4 -..- * (o, - as * as -''')i modulo i2 + l.
Hence any expression involving complex numbers behaves as one of the
form c * di and one has all the apparatus needed to work with complex
expressions. For Cauchy, then, the polynomials in i, with his understanding
about i, take the place of complex numbers and one can put into one class
all those polynomials having the same residue modulo i2 + l. These classes
are the complex numbers.
It is interesting that in 1847 Cauchy still had misgivings about y'J.
He says, " In the theory of algebraic equivalences substituted for the theory
olimaginary numbers the letter i cetses to represent the symbolic sign r/J,
which we repudiate completely and which we can abandon without regret
since one does not know what this supposed sign signifies nor what sense to
attribute to it. On the contrary we rePresent by the letter i a real quantity
but indeterminate and in substituting the sign = lor : we translorm what
has been called an imaginary equation into an algebraic equivalence relative
to the variable i and to the divisor e2 + l. Since this divisor remains the
same in all the formulas one.can dispense with writing it."
In the second decade of the century Gauss proceeded to search for
reciprocity laws applicable to congruences ofhigher degree. These laws again
involve residues of congruences. Thus for the congruence
*+: qnodp
one can define 4 as a biquadratic residue ofp ifthere is an integral value ofr
satisfying the equation. He did arrive at a law of biquadratic reciprocity
(see below) and a law of cubic reciprocity. Much of this work appeared in
papers from l80B to l8l7 and the theorem Proper on biquadratic residues
was given in papers of 1828 and 1832.4
4. Comm, Soc. Gott.,6, 1828, and 7,1832 : Werkt,2,6F92 and 93-148; also pp' 165-78'
THE THEORY OF CONGRUENCES Br7
(ff). :, - 1)(1/4x'{-1)(1/4)(8-1)(E)4
8rB THE THEoRY oF NUMBERS IBoo-I9oo
The symbol (c/B)1 has the following meaning: If p is any complex prime
and ,t is any biquadratic residue not divisible by p, then (klp), is the power
i" of i which satisfies the congruence
llNp'rrt4 : I modulo y'
3. Algebraic Numbers
The theory of complex integers is a step in the direction ofa vast subject, the
theory of algebraic numbers. Neither Euler nor Lagrange envisioned the
rich possibilities which their work on complex integers opened up' Neither
did Gauss.
The theory grew out of the attempts to prove Fermat's assertion about
xn + g" : 2", The cases z : 3,4, ar.d 5 have already been discussed (Chap'
25, sec, 4). Gauss tried to prove the assertion fot n :7 but failed' Perhaps
because
-he was disgusted with his llailure, he said in a letter of I 816 to
Heinrich W. M. Olbers (1758-1840), " I confess indeed that Fermat's
theorem as an isolated proposition has little interest for me, since a multitude
of such propositions, which one can neither prove nor refute, can easily be
and all four flactors can readily be shor,vn to be prime integers. Then unique
decomposition does not hold. Let us introduce, for this domain, the ideal
numbers o:12, pr:0 + \/=)l\/-2, Fr:0 - t/-sltl1. we see
that 6 : c2p.p2. Thus 6 is now uniquely expressed as the product ol four
factors, all ideal numbers as far as the domain a + bl4 is concerned.la
In terms of these ideals and other primes factorization in the domain is
unique (apart from factors consisting of units). With ideal numbers one can
prove some of the results of ordinary number theory in all domains that
previously lacked unique factorization.
Kummer's ideal numbers, though ordinary numbers, do not belong to
the class of algebraic numbers he had introduced. Moreover, the ideal
numbers were not defined in any general way. As far as Fermat's theorem
is concerned, with his ideal numbers Kummer did succeed in showing that
it was correct for a number of prime numbers. In the first hundred integers
only 37, 59, and 67 were not covered by Kummer's demonstration. Then
Kummer in a paper of 1857 15 extended his results to these exceptional
primes. These results were further extended by Dimitry Mirimanoff (1861-
1945), a professor at the University of Geneva, by perfecting Kummer's
method.l8 Mirimanoff proved that Fermat's theorem is correct for each z
up to 256 if x, y, and z are prime to that exPonent rr.
Whereas Kummer worked with algebraic numbers formed out of the
roots of unity, Richard Dedekind (lB3l-1916), a pupil of Gauss, who spent
fifty years of his life as a teacher at a technical high school in Germany,
approached the problem ofunique factorization in an entirely new and fresh
manner. Dedekind published his results in supplement l0 to the second
edition of Dirichlet's Zahlentheorie ( l87l ) which Dedekind edited. He
extended these results in the supplements to the third and fourth editions of
the same book.l7 Therein he created the modern theory ofalgebraic numbers.
Dedekind's theory of algebraic numbers is a generalization of Gauss's
complex integers and Kummer's algebraic numbers but the generalization is
somewhat at variance with Gauss's complex integers. A number r that is a
root of
(2) asxl * alxn-r +'''+ ar-it * ao: 0,
whe.e\he a,'s are ordinary integers (positive or negative), and that is not
a root ofsuch an equation ofdegree less than z is called an algebraic number
of degree z. Il the coefficient of the highest power of .r in (2) is l, the solutions
are called algebraic integers oldegree z. The sum, difference, and product of
14. With the introduction of the ideal numbers, 2 and 3 are no longer indecomposable,
for 2 = o' and 3: FrFz.
15, Abh. Kani.g. Akad. der Wiss. Berlin, 1858, +l-7+,
16. Jour. fiir Math., l2B, 1905, 4F68.
17. 4th ed., 1894 : wcrke,3,2-222.
Bzr
where the 11 are any integers of the field I(. This ideal is denoted by
(or, or, .. .The zero ideal consists ofthe number 0 alone and accordingly
, crn) .
is denoted by (0). The unit ideal is that generated by the number l, that is,
(1). An ideal I is called principal if it is generated by the single integer c,
so that (a) consists of all the algebraic integers divisible by c. In the ring of
the ordinary integers every ideal is a principal ideal.
An example of an ideal in the algebraic number field a * b1/=,
where a and D are ordinary rational numbers, is the ideal generated by the
integers 2 and I + \/=. This ideal consists of all integem of the form
2p + (l + \/=)r, where p and, v are arbitrary integers of the field' The
ideal also happens to be a principal ideal because it is generated by the
number 2 alone in view of the fact that (l + l/-- S)Z must also belong to
the ideal generated by 2.
Two ideals (or, or, .. . , cr) and (Fr, Fr, .. . , po) are equal if every
member of the former ideal is a member ol the latter and conversely. To
tackle the problem of factorization we must first consider the product of two
ideals. The product of the ideal A : (ar,' . ., cr") and the ideal B : (Br, . . ., Br)
of l( is defined to be the ideal
AB : (a$y dr\z, uz|y. . .,
""F ).
dt|r. . .,
It is almost evident that this product is commutative and associative. With'
this definition we may say that .,{ divides .B if there exists an ideal C such that
824 THE THEoRy oF NUMBERS r8oo-rgoo
whcrein a, b, and a are integral, is a binary form because two variables are
involved, and it is a quadratic form because it is of the second degree' A
number M is said to be represented by the form if for specific integral values
of a, b, c, x, and, y thc above expression equals M. One problem is to find the
set of numbers M that are rePresentable by a given form or class of forms.
The converse problem, glven M and given a,'b, and, c or some class of a, 6,
and a, to find the values of * and y that represent ,t1, is equally imPortant.
The latter problem belongs to Diophantine analysis and the former may
equally well be considered part of the same subject.
Euler had obtained some special results on these problems. However,
Lagrange made the key discovery that if a number is representable by one
form it is also representable by many other forms, which he called equivalent.
The latter could be obtained from the original form by a change ofvariables
(4) x:a*'l9y', y:yx'1-69'
wherein d, P, y, and 6 are integral and cr6 - Fy : 1.26 In particular,
Lagrange showed that for a given discriminant (Gauss used the word
determinant) b2 - u there is a finite number of forms such that each form
vrith that discriminant is equivalent to one of this finite number' Thus all
forms with a given discriminant can be segregated into classes, each class
consisting of forms equivalent to one member of that class. This result and
some inductively established results by Legendre attracted Gauss's attention.
In a bold step Gauss extracted from Lagrange's work the notion of equiva-
lence of forms and concentrated on that. The fifth section of his Disquisitiones,
by far the largest section, is devoted to this subject.
25. Noua Mim. dc I'Acad. dc Bcrlin, 1773,263-312; and 1775,323 ff. -- (Euttres,3' 693-795 '
THE THEORY OF FORMS Bz7
F:axz*2bxg+cgz
be transformed by means of (4) into the form
If now (cD - Fi' : l, the discriminants of the two forms are equal. Then
the inverse of the transformation (4) will also contain integral coefficients
(by Cramer's rule) and will transform F' into F. F and F' are said to be
equivalent. If cr6 - Fy : l, F and F' are said to be properly equivalent,
and if cE - Fy : - 1, then F and F' are said to be improperly equivalent.
Gauss proved a number of theorems on the equivalence of forms. Thus
if F is equivalent to ,F'' and .F.' to F' then F is equivalent to F'. If F is equiva-
lent to F', any number M representable by F is representable by F' and in
as many ways by one as by the other. He then shows, if Fand F' are equiva-
lent, how to find all the transformations from F into F'. He also finds all the
representations ofa given number M by a form F, provided the values ofr
and y are relatively prime.
By definition two equivalent forms have the same value for their
discriminant D : b2 - ac. However, two forms with the same discriminant
are not necessarily equivalent. Gauss shows that all forms with a given D
can be segregated into classes I the members of any one class are properly
equivalent to each other. Though the number of forms with a given D is
infinite, the number of classes for a given D is finite. In each class one fofm
can be taken as representative and Gauss gives criteria for the choice of a
simplest representative. The simplest form of all those with determinant D
has a : l, b : 0, c : -D. This he calls the principal liorm and the class to
which it belongs is called the principal class.
Gauss then takes up the composition (product) of forms. If the form
by the substitution
where the coemcients are integers, and undertakes a study very much like
what he has just done for binary forms. The goal as in the case of binary
forms is the representation of integers. The theory of ternary forms was not
carried far by Gauss.
The objective of the entire work on the theory of forms was, as already
noted, to produce theorems in the theory of numbers. In the course of his
treatment of forms Gauss shows how the theory can be used to prove any
number of theorems about the integers including many that were previously
proved by such men as Euler and Lagrange. Thus Gauss proves that any
prime number of the form 4n * | can be represented as a surn of squares in
one and only one way. Any prime number of the form Bz * I or Bz + 3
can be represented in the form x' + 2!2 (for positive integral ,r and y) in
one and only one way. He shows how to find all the representations of a
given number M by the given form ax2 + 2bxg + cy2 provided the dis-
criminant D is a positive non-square number. Further ifFis a primitive form
(the values of a, b, and r are relatively prime) with discriminant D and' if p
is a prime number dividing D, then the numbers not divisible by p which
can be represented by F agree in that they are all either quadratic residues
ofy' or non-residues ofP.
Among the results Gauss drew from his work on ternary quadratic
forms is the first proof of the theorem that every number can be represented
as the sum of three triangular numbers. These, we recall, are the numbers
l815 which established the general result first asserted by Fermat that every
integer is the sum of ft or fewer fr-gonal numbers'26 (The general *-gonal
number is n * (n2 - n)(k - 2)12.)
The algebraic theory ofbinary and ternary quadratic forms as presented
by Gauss has an interesting geometrical analogue which Gauss himself
initiated. In a review which appeared in the Gduingische Gelehrte Anzeigen of
183027 of a book on ternary quadratic forms by Ludwig August Seeber
(1793-1855) Gauss sketched the geometrical representation of his forms and
classes of forms.28 This work is the beginning of a development called the
geometrical theory of numbers which first gained prominence when Hermann
Minkowski (1864-1909), who served as professor of mathematics at several
universities, published his Geometrie der Zahlen (1896).
The subject of forms became a major one in the theory of numbers ol
the nineteenth century. Further work was done by a host of men on binary
and ternary quadratic forms and on forms with more variables and of higher
degree.2s
t..' T@)
xltog x
.A..
where 0.922 < AL < I and I < A2 < 1.105, but did not prove that the
function tends to a limit. This inequality was improved by many mathe-
32. OPuscula Analytica,2, 1783.
33. Min. de I'Acad. des Sci., Paris, 1785, 465-559, pub. 1788.
3*. Thiorie dcs rcmbres,2nd ed., p. 404'
35. Abh. Kinig. Akad. der Wiss., Berlin, 1837, 45-81 and 108-10 : Werke, 1,30742.
36. Abh. Kdnig. Akad. der Wiss., Berlin, 1841, l4l-Ol : Werke,2' 509-32.
37. llerke, 2, 4++-+7.
38. Min. Acad. Sci. St. Peters.,7, 1854, l5-33; also Jour. de Math., (l), 17, 1852.366-90:
(Ewres, l, 5l-70.
ANALYTIC NUMBER THEORY 83r
maticians including Sylvester, who among others doubted in lBBl that the
function had a limit. In his work Tchebycheff used what we now call the
Riemann zeta function,
{(") _
-.Ln'
s-l
though he used it for real values ofz. (This series is a special case ofDirichlet's
series.) Incidentally he also proved in the same Paper that for n > 3 there is
always at Ieast one prime between n and 2n - 2.
The zeta function for real z apPeafi in a work of Eulerss where he
introduced
I=J(,
Here the pn's are prime numbers. Euler used the function to Prove that the
sum of the reciprocals of the prime numbers diverges. For s an even positive
integer Euler knew the value of ((s) (Chap. 20, sec. 4). Then in a paper read
in 1749a0 Euler asserted that for real s
39. Comm. Acad. Sci. Pebop',9,1737, 160-88, pub. 1744 : O\era, (l), 14,2lM4'
40. Hist. de l'Acad. Berlin, 17,1761,83-106, pub' 1768
de :
Oleru, (l), 15' 70-90'
41. Monatsbcr. Berliner Akd.' 1859, 671-80 : Werke' 145-55'
42. In l9l4 Godfrey H. Hardy proved (Conp. Reul., 158, 1914, l0l2-14 = Coll' Popas'Z,
6-9) that an infinity of zeros of {(z) lie on the line r = }.
43. Bull. Soc. Math. dc France, 14, 1896, 199-220 : (Euovcs, 1,189-210'
83e THE TrrEoRy oF NUMBERs r8oo-r9oo
and proved the prime number theorem at the same time.aa This theorem is a
central one in analytic number theory.
Bibliographg
Bachmann, P.: " Uber Gauss' zahlentheoretische Arbeiten," Nathrichten Kdnig.
Ges. dcr Wiss. zu Gdtt., l9l l, 455-508; also in Gauss: Werle, lO2, l-69.
Bell, Eric T.: Tlu Deoclopment of Mathematics.,2nd ed., McGraw-Hill, 1945, Chaps.
9-10.
Carmichael, Robert D. : " Some Recent Researches in the Theory of Numbers,"
Amn. Math. Monthlg,39, 1932, 139-60.
Dedekind, Richard: Uber dil Thzorie dcr ganzzn algebaischen Zahlcn (repint of tJne
eleventh supplement to Dirichlet's Zahlentheorie), F, Vieweg und Sohn, 1964.
Gcsammelte matlumatischc l[e*e, 3 vols., F. Vieweg und Sohn, 1930-32,
Chelsea (reprint), 1968.
"Sur la thdorie des nombres entiers algdbriques," Bull. des Sci. Math,, (l),
I l, 1876, 278-48; (2), t, 1877, 1741,69-92, 14444,20748 : Ges. math.
We*c, 3, 263-96.
Dictson, Leonard E.: History oJ the Theory of Numbers, 3 vols., Chelsea (reprint),
1951.
Studbs in the Tluory of Numbers (1930), Chelsea (reprint), 1962.
" Fermat's Last Theorem and the Origin and Nature of the Theory of
Algebraic Numbers," Annals. of Math., (2), 18, 1917, l6l-87.
..................._ et al.: Algebraic Numbns, Repoil of Committee ott Algebraic Numbers,
-: National
Research Council, 1923 and 1928; Chelsea (reprint), 1967.
Dirichlet, P. G. L.: Werke (1889-97); Chelsea (reprint), 1969, 2 vols.
Dirichlet, P. G. L., and R. Dedekind,: Vorlesungen iiber Zahlentheorie, 4th ed., 1894
(contains Dedekind's Supplement); Chelsea (reprint), 1968.
Gauss, C. F.: Disquisitiones Arilhmeticae, trans. A. A. Clarke, Yale University Press,
1965.
Hasse, H.: "Bericht i-iber neuere Untersuchungen und Probleme aus der Theorie
der algebraischen Zahlkiirper," Jahres. der Deut. Math.-Vnein., 35, 1926,
l-55 and 36, 1927, 233-311.
Hilbert, David: "Die Theorie der algebraischen Zahlk6rper," Jahres. der Deut.
Math.-Verein., 4, 1897,175-546 : Gesammelte Abhandlungen, l, 63-363.
Klein, Felix: Vorlesungen iiber die Enlwicklung der Mathematik im 19. Jahrhundtrt,
Chelsea (reprint), 1950, Vol. 1.
Kronecker, Leopold: Werke, 5 vols. (1895-1931), Chelsea (reprint), 1968. See
especially, Vol.2, pp. l-10 on the law of quadratic reciprocity.
Grundzilge einer arithmetischen Theorie der algebraischen Gri)ssen, G. Reimer,
1BB2 : Jour. filr Math.,92, lBBll82, l-122 : lilerke, 2, 237-388.
Landau, Edmund : Handbuch der lxhre aon der Verteilung der Primzahlen, B. G. Teubner,
I909, Vol. l, pp. l- 55.
Mordell, L. J. : "An Introductory Account of the Arithmetical Theory of Algebraic
44. Ann. Soc. Sci. Bruxelles, (l), 20 Part II, 1896, 183-256, 281-397.
Bge
Numbers and its Recent Development," Arrur. Math. Soc. Bull., 29, 1923,
44s-63.
Reichardt, Ilans, ed. : C. F. Gauss, I*ben und Werk, }{aude und Spenersche Verlags-
buchhandlung, 1960, pp. 38-91; also B. G. Teubner, 1957.
Scott, J. F.; A History of Mathematics, Taylor and Francis, 1958, Chap. 15.
Smith, David E,: A Source Book in Mailumatics, Dover (reprint), 1959, Vol. I'
to7-48.
Smith, H. J. S.: Collected Mathematical Papers, 2 vols. (1890-94), Chelsea
(reprint), 1965. Vol. I contains Smith's rteparl on the Theory of Numbers,
which is also published separately by Chelsea, 1965.
Vandiver, H. S. : " Fermat's Last Theorem," Amer, Math, Monthlg,53, 1946, 555-78.
3s
The Revival of Projective
Geometry
The doctrines of pure geometry often, and in many questions'
give a simple and natural way to penetrate to the origin of
truths, to lay bare the mysterious chain which unites them,
and to make them known individually, luminously and
completely. MICIIEL CIrASLES
8z+
THE RENEWAL oF TNTEREsT rN cEoMETRy 8gs
l. Now. Mim. de I'Acad. da Beilin, 1773, 12148, pub. 1775 = CDw)rcs,3, 617-58'
8g6 THE REvrvAL oF pRoJECTTvE GEoMETRv
processes neglects all the small steps which geometry continually makes. The
quick and perhaps penetrating steps of analysis do not reveal the sense of
what is accomplished. The connection between the starting point and the
final result is not clear. Chasles asks, "Is it then sufficient in a philosophic
and basic study of a science to know that something is true if one does not
know why it is so and what place it should take in the series oftruths to which
it belongs ? " The geometric method, on the other hand, permits simple and
intuitively evident proofs and conclusions.
There was another argument which, first voiced by Descartes, still
appealed in the nineteenth century. Geometry was regarded as the truth
about space and the real world. Algebra and analysis were not in themselves
significant truths even about numbers and functions. They were merely
methods of arriving at truths, and artificial at that. This view of algebra and
analysis was gradually disappearing. Nevertheless the criticism was still
vigorous in the early nineteenth century because the methods of analysis
were incomplete and even Iogically unsound. The geometers rightly ques-
tioned the validity of the analytic proofs and credited them with merely
suggesting results. The analysts could retort only that the geometric proofs
were clumsy and inelegant.
The upshot ofthe controversy is that the pure geometers reasserted their
role in mathematics. As if to revenge themselves on Descartes because his
creation ofanalytic geometry had caused the abandonment ofpure geometry,
the early nineteenth-century geometers made it their objective to beat
Descartes at the game of geometry. The rivalry between analysts and
geomdiers grew so bitter that Steiner, who was a pure geometer, threatened
to quit writing fpr Crelle's Journal fiir Mathematik if Crelle continued to pub-
lish the analytical papers of Pliicker.
The stimulus to revive synthetic geometry came primarily from one man,
Gaspard Monge. We have already discussed his valuable contributions to
analytic and differential geometry and his inspiring Iectures at the Ecole
Polytechnique during the years 1795 to 1809. Monge himself did not intend
to do more than bring geometry back into the fold of mathematics as a
suggestive approach to and an interpretation of analytic results. He sought
only to stress both modes of thought. However, his own work in geometry
and his enthusiasm for it inspired in his pupils, Charles Dupin, Frangois-
Joseph Servois, Charles-Julien Brianchon, Jean-Baptiste Biot (1774-1862),
Lazare-Nicholas-Marguerite Carnot, and Jean-Victor Poncelet, the urge to
revitalize pure geometry. \
Monge's contribution to pure geometry was his I:il$i de giomCtrie
This subject shows how to project orthogonally a three-
descriptioe (1799).
dimensional object on two planes (a horizontal and a vertical one) so that
from this representation one can deduce mathematical properties of the
object. The scheme is useful in architecture, the design of fortifications, per-
SYNTHETIC EUCLIDEAN GEOMETRY 827
spective, carpentry, and stonecutting and was the first to treat the projection
ol a three-dimensional figure into two two-dimensional ones. The ideas
and method of descriptive geometry did not prove to be the avenue to subse-
quent developments in geometry or, for that matter, to any other part of
mathematics.
Figure 35. I
Figurc 35.2
spheres with the plane are the foci of the conic section, and the intersections
of the planc with the planes of the circles along which the spheres touch the
cone are the directrices of the conic.
Another interesting theme pursued in the nineteenth century was the
solution of maximum and minimum problems by purely geometric methods,
that is, without relying upon the calculus of variations. Of the several
theorems Jacob Steiner proved by using synthetic methods, the most
famous result is the isoperimetric theorem: Of all plane figures with a given
perimeter the circle bounds the greatest area, Steiner gave various proofs.a
Unfortunately, Steiner assumed that there exists a curve that does have
maximum area. Dirichlet tried several times to persuade him that his
proofs were incomplete on that account but Steiner insisted that this was
self-evident. Once, however, he did write (in the first of the 1842 papers):6
"and the proof is readily made if one assumes that there is a largest figure."
The proof of the existence of a maximizing curve bamed mathe-
maticians for a number of years until Weierstrass in his lectures of the 1870s
at Berlin resorted to the calculus ofvariations.o Later Constantin Caratheo-
dory (1873-1950) and StudyT in a joint paper rigorized Steiner's proofs
without employing that calculus. Their proofs (there were two) were direct
rather than indirect as in Steiner's method. Hermann Amandus Schwarz,
who did great work in partial differential equations and analysis and who
4. Jour. filr Math., 18, 1838, 281-96; and 24, 1842,83-162, 189-250; the 1842 papers are
in his Gcs, We*2, 2, 177-308.
5. Ges. Wcrkc, 2, 197.
6. Werke, 7, 25?-64, 301-2.
7. Math. Ann., 68, 1909, 133-40 : Caratheodory, Ges. math. Schtiften, 2, 3-ll.
SYNT}IETIC EUCLIDEAN GEOMETRY Bgg
Figure 35.3
8. K6ilg. G.s. der Wiss. zu Gdtt., 1884, I-13 -- Ges. math. Abh.'
Nachtichten 2' 32740-
9. Monatsben Berliner Akad-, 1837, l+4 : Ges. Werke,2, 93 and 729-31.
10. Paper unpublished, Ges. Math. Abh.' 2, 344-45.
ll. Schwarz's proof can be found in Richard Courant and Herbert Robbins, lUhat Is
Mathemalics?, Oxford University Press, 1941, pp. 346-49. A proof using the calculus was
given by J. F. dc' Toschi di Fagnano (l7lF97) in the Acla Eruditotum, 1775, p.297.
There were less elegant geometrical proofs before Schwarz's.
12. For a proof and references to published proofs see H. S. M. Coxeteq lilrodwtion lo
Geomctry, Joho Wiley and Sons, 1961, pp.23-25.
84o THE RE\/ML OF PROJECTM GEOMETRY
Figure 35.4
Figure 35.5
it the Traitc des propriltis projectiaes des fgures (1822). This work was his
as
chief contribution to projective geometry and to the erection of a new
discipline. In his later life he was obliged to devote a great deal of time to
government service, though he did hold professorships for limited periods.
Poncelet became the most ardent champion of synthetic geometry and
even attacked the analysts. He had been lriendly with the analyst Joseph-
Diez Gergonne (1771-1859) and had published papers in Gergonne's
Annahs dc Mathimatiqucs, but his attacks were soon directed to Gergonne also.
Poncelet was convinced of the autonomy and importance of pure geometry.
Though he admitted the power of analysis he believed that one could give
the same power to synthetic geometry. In a paper of l8l8, published in
Gergonne's Annales,rs he said that the power of analytic methods lay not in
the use of algebra but in its generality and this advantage resulted from the
fact that the metric properties discovered ficr a typical figure remain ap-
plicable, other than fior a possible change of sign, to all related figures which
spring from the typical or basic one. This generality could be secured in
synthetic geometry by the principle of continuity (which we shall examine
shortly).
Poncelet was the first mathematician to appreciate fully that projective
geometry was a new branch of mathematics with methods and goals of its
own. Whereas the seventeenth-century projective geometers had dealt with
specific problems, Poncelet entertained the general problem o[ seeking all
properties of geometrical figures that were common to all sections of any
projection of a figure, that is, remain unaltered by projection and section'
This is the theme that he and his successors took up. Because distances and
angles are altered by projection and section Poncelet selected and developed
the theory ofinvolution and of harmonic sets ofpoints but not the concept of
cross ratio. Monge had used parallel projection in his work; like Desargues,
Pascal, Newton, and Lambert, Poncelet used central projection, that is,
projection from a point. This concept Poncelet elevated into a method
of approach to geometric Problems' Poncelet also considered projective
transformation from one space hgure to another, ofcourse in purely geometric
form. Here he seemed to have lost interest in projective ProPerties and was
more concerned with the use of the method in bas-relief and stage design.
His work centers about three ideas. The first is that of homologous
figures. Two figures are homologous if one can be derived from the other by
one projection and a section, which is called a perspectivity, or by a sequence
of projections and sections, that is, a projectivity. In working with homol-
ogous figures his plan was to find for a given figure a simpler homologous
figure and by studying it find properties which are invariant under projection
15. Ann. tlc Math.,8, l8l7/18, l4l-55. This paper is reprinted in Poncelet's Applications
d'arulgsc ct de giomitrie (1862-64)' 2,464-76.
THE RErv'ML OF SYNTHETIC PROJECTM CEOMETRY B4z
and section, and so obtain properties of the more comPlicated figure. The
essence of this method was used by Desargues and Pascal, and Poncelet in
his Traiti praised Desargues's originality in this and other respects.
Poncelet's second leading theme is the principle of continuity. In his
Traiti he phrases it thus: "Ifone figure is derived from another by a con-
tinuous change and the latter is as general as the former, then any property of
the first figure can be asserted at once for the second figure." The determina-
tion of when both figures are general is not explained. Poncelet's principle
also asserts that if a figure should degenerate, as a hexagon does into a
pentagon when one side is made to approach zero, any property of the
original figure will carry over into an appropriately worded statement for the
degenerate figure.
The principle was really not new with Poncelet. In a broad philosophical
sense it goes back to Leibniz, who stated in 1687 that when the differences
between two cases can be made smaller than any datum in the given, the
differences can be made smaller than any given quantity in the result. Since
Leibniz's time the principle was recognized and used constantly. Monge
began the use of the principle of continuity to establish theorems. He wanted
to prove a general theorem but used a special position of the figure to Prove
it and then maintained that the theorem was true generally, even when some
elements in the figures became imaginary. Thus to prove a theorem about a
line and a surface he would prove it when the line cuts the surface and then
maintain that the result holds even when the line no longer cuts the surf;ace
and the points of intersection are imaginary. Neither Monge nor Carnot,
who also used the principle, gave any justification ofit.
Poncelet, who coined the term " principle of continuity," advanced the
principle as an absolute truth and applied it boldly in his Traiti. To "demon-
strate" its soundness he takes the theorem on the equality of the products of
the segments of intersecting chords ofa circle and notes that when the point
ofintersection moves outside the circle one obtains the equality ofthe Products
of the secants and their external segments. Further, when one secant becomes
a tangent, the tangent and its external segment become equal and their
product continues to equal the product of the other secant and its external
segment. All ofthis was reasonable enough, but Poncelet applied the principle
to prove many theorems and, like Monge, extended the principle to make
assertions about imaginary figures. (We shall note some examples later.)
The other members of the Paris Academy of Sciences criticized the
principle of continuity and regarded it as having only heuristic value.
Cauchy, in particular, criticized the principle but unfortunately his criticism
was directed at applications made by Poncelet wherein the principle did
work, The critics also charged that the confidence which Poncelet and others
had in the principle really came from the fact that it could be justified on an
algebraic basis. As a matter of fact, the notes Poncelet made in prison show
B++ THE REvrvAL oF pRoJEcrrvE GEorlrETRy
that he did use analysis to test the soundness ol the principle. These notes,
incidentally, were written up by Poncelet and published by him in two
volumes entitled Applications d'analgse et de giomltrie ( 1862-64) which is
really a revision ol his Traiti of lB22, and in the later work he does use
analytic methods. Poncelet admitted that a proof could be based on algebra
but he insisted that the principle did not depend on such a proof. However, it
is quite certain that Poncelet relied on the algebraic method to see what
should be the case and then afErmed the geometric results using the principle
as a justification.
Chasles in his Apergu defended Poncelet. Chasles's position was that the
algebra is an a posteriori proof of the principle. However, he hedged by
pointing out that one must be careful not to carry over from one figure to
another any property which depends essentially on the elements being real or
imaginary. Thus one section of a cone may be a hyperbola, and this has
asymptotes. When the section is an ellipse the asymptotes become imaginary.
Ilence one should not prove a result about the asymptotes alone because
these depend upon the particular nature of the section. Also one should not
carry the results for a parabola over to the hyperbola because the cutting
plane does not have a general position in the case of the parabola. Then he
discusses the case of two intersecting circles which have a common chord.
When the circles no longer intersect, the common chord is imaginary. The
fact that the real common chord passes through two real points is, he says,
an incidental or a contingent property. One must define the chord in some
way which does not depend on the fact that it passes through real points
when the circles intersect but so that it is a permanent property of the two
circles in any position. Thus one can define it as the (real) radical axis, which
means that it is a line such that from any point on it the tangents to the two
circles are equal, or one can define it by means ofthe property that any circle
drawn about any point ofthe line as a center cuts the two circles orthogonally.
Chasles, too, insisted that the principle of continuity is suited to treat
imaginary elements in geometry. He first explains what one means by the
imaginary in geometry. The im4ginary elements pertain to a condition or
state ofa figure in which certain parts are nonexistent, provided these parts
are real in another state ofthe figure. For, he adds, one cannot have any idea
of imaginary quantities except by thinking of the related states in which the
quantities are real. These latter states are the ones which he called " acci-
dental" and which furnish the key to the imaginary in geometry. To prove
results about imaginary elements one has only to take the general condition
of the figure in which the elements are real, and then, according to the
principle of accidental relations or the principle of continuity, one may
conclude that the result holds when the elements are imaginary- " So one sees
that the use and the consideration of the imaginary are completely justified."
The principle of continuity was accepted during the nineteenth century as
THE REvrvAL oF syNTHETrc pRoJEcrrvE cEoMETRy B+S
intuitively clear and therefore having the status of an axiom. The geometers
used it freely and never deemed that it required proof.
Though Poncelet used the principle of continuity to assert results about
imaginary points and lines he never gave any general definition of such
elements. To introduce some imaginary points he gave a geometrical
definition which is complicated and hardly perspicuous. We shall under-
stand these imaginary elements more readily when we discuss them from an
algebraic point ofview. Despite the lack of clarity in Poncelet's approach he
must be credited with introducing the notion of the circular points at
infinity, that is, two imaginary points situated on the line at infinity and
common to any two circles.lB He also introduced the imaginary spherical
circle which any two spheres have in common. He then proved that two
real conics which do not intersect have two imaginary common chords, and
two conics intersect in four points, real or imaginary'
The third leading idea in Poncelet's work is the notion ofpole and polar
with respect to a conic. The concept goes back to Apollonius and was used
by Desargues (Chap. 14, sec. 3) and others in the seventeenth-century work
on projective geometry. AIso Euler, Legendre, Monge, Servois, and Brian-
chon had already used it. But Poncelet gave a general formulation of the
transformation from pole to polar and conversely and used it in his TraitC of
lB22 and in his " M€moire sur la th6orie g6n€rale des polaires rdciproques "
presented to the Paris Academy in 182411 as a method of establishing many
theorems.
One of Poncelet's objectives in studying polar reciprocation with
respect to a conic was to establish the principle of duality. The workers in
projective geometry had observed that theorems dealing with figures lying
in one plane when rephrased by replacing the word "point" by "line" and
"line" by "point" not only made sense but proved to be true. The reason
for the validity of theorems resulting from such a rephrasing was not clear,
and in fact Brianchon questioned the principle. Poncelet thought that the
pole and polar relationship was the reason.
llowever, this relation required the mediation of a conic. Gergonne 18
insisted that the principle was a general one and applied to all statements and
theorems except those involving metric properties. Pole and polar were not
needed as an intermediary supporting device. He introduced the term
"duality" to denote the relationship between the original and new theorem.
He also observed that in three-dimensional situations point and plane are
dual elements and the line is dual to itself.
To illustrate Gergonne's understanding of the principle of duality let us
examine his dualization of Desargues's triangle theorem. We should note
16. Traitd, l, 48.
17. Jour. far Math.,4, 1829, l-71.
lB. Ann. de Math., 16, 1825-26, 209-31.
8+6 THE REvrvAL oF PRoJEcrrvE GEoMETRY
first what the dual ofa triangle is. A triangle consists ofthree points not all on
the same line, and the lines joining them. The dual figure consists of three
Iines not all on the same Point,,and the points joining them (the points of
interscction). The dual figure is again a triangle and so the triangle is called
self-dual. Gergonne invented the scheme of writing dual theorems in double
columns with the dual alongside the original proposition.
Now let us consider Desargues's theorem, where this time the two
triangles and the point O lie in one plane , and let us see what results when we
interchange point and line. Gergonne in the 1825-26 paper already referred
to wrotc this theorem and its dual as follows:
Figure 35.6
Figure 35.7
conic. Moreover, the conic passes through P1 and, P2 (Fig. 35.7)' In this
manner Steiner built up the conics or second degree curves by means of the
simpler forms, pencils of lines. Flowever, he did not identify his conics with
sections of a cone.
He also built up the ruled quadrics, the hyperboloid ofone sheet and the
hyperbolic paraboloid, in'a similar manner, making projective correspond-
ence the basis of his definitions. Actually his method is not sufficiently
general for all of projective geometry.
In his proofs he used cross ratio as a fundamental tool. However, he
ignored imaginary elements and referred to them as "the ghosts" or the
'ishadows of geometry." He also did not use signed quantities though
M<ibius, whose work we shall soon examine, had already introduced them'
Steiner used the principle ol duality from the very outset of his work'
Thus he dualized the definition of a conic to obtain a new structure called
a line curve. If one starts with two projectively (but not perspectively)
related pencils of points then the family of lines (Fig. 35.8) joining the
corresponding points of the two pencils is called a line conic. To distinguish
such families of lines, which also describe a curve' the usual curve as a locus
of points is called a point curve. The tangent lines of a point curve are a line
curve and in the case of a conic do constitute the dual curve. Conversely
every line conic envelopes a point conic or is the collection of tangents of a
point conic.
With Steiner's notion of a dual to a point conic, one can dualize many
theorems. Let us take Pascal's theorem and form the dual statement. We
shall write the theorem on the left and the new statement on the right.
Pascal's Thcorem Dual of Pascal's Theorem
ar.d' If we take six lines a, b, c, d, e, and'
If we take six points, A, B,C, D, E,
.F on the point conic, then the lines ;fon the line conic, then the points
which join Aall,dB and D and E which join a and b and dande are
meet in a point P; the lines joined by the line 1; the points
which join B and C and E andF which join D and e and e andf are
meet in a point Q; joined bY the line q;
the lines which join C ar,d D and F the points which join c atd d and f
and ,4 meet in a point .R. and a are joined by the line r'
The three points P, Q, and R lie on The three lines p, q, and r are on
one line l. one Point I.
Figure 35.8
Figurc 35.9
added the condition that the transformation must preserve cross ratio but
this latter fact can be proved. The transformation that carries points into
lincs and lines into points he called a correlation.
Though he defended pure geometry Chasles thought analytically but
presented his proofs and results geometrically. This approach is called the
" mixed method " and was used later by others.
By 1850 the general concepts and goals of projective geometry as
distinguished from Euclidean geometry were clear; nevertheless the logical
relationship of the two geometries was not clarified. The concept of length
was used in projective geometry from Desargues to Chasles. In fact the
concept of cross ratio was defined in terms of length. Yet length is not a
projective concept because it is not invariant under projective transformation.
Karl Georg Christian von Staudt (1798-1867), a professor at Erlangen who
was interested in foundations, decided to free projective geometry from
dependence on length and congruence. The essence of his plan, presented in
his Gcomctrir dn Lage (Geometry of Position, 1847), was to introduce an
analogue of length on a projective basis. His scheme is called "the algebra
of throws." One chooses three arbitrary points on a line and assigns to them
the symbols 0, l, o. Then by means of a geometric construction (which in
itself comes from M<ibius)-a " throw "-a symbol is attached to any
arbitrary point P.
To see what the construction amounts to in Euclidean geometry
suppose we start with the points labelled 0 and I on a line (Fig. 35.9).
Through the point M on a parallel line draw 0M and then draw I iy' parallel
to 0M. Now draw lM and draw .l[2 parallel to lM. lt is immediate that
0l : l2 because opposite sides ofa parallelogram are equal. Thus one carries
over the length 0l to 12 by a geometric construction.
Now to treat the projective case we start with three points 0, l, oo (Fig.
35.10). The point oo lies on 1.", the line at infinity, but this isjust an ordinary
line in projective geometry. Now pick a point M and draw a " parallel " to
THE REVTAL oF syNTrrETrc pRoJEcrrvE GEoMETRv 85,
the line 0I through ,r11. This means that the line through .&/ must meet the
line 0l at co and so we draw Mo. Now draw 0M and prolong it to /-. Next
we draw through I a " parallel " to 0M. This means the " parallel " through
I must meet 0M on I*. We thereby get the line lP and this determines .tr[.
Now draw lM and prolong it until it meets /- at Q. The line through iI
parallel to lM is Q.l[ and where it meets 0l we get a point which is labeled 2.
By this type of construction we can attach " rational coordinates " to
points on the line 0lo. To assign irrational numbers to points on the line one
must introduce an axiom of continuity (Chap. 4l). This notion was not well
understood at that time and as a consequence von Staudt's work lacked
rigor.
Von Staudt's assignment of coordinates to the points of a line did not
involve length. His coordinates, though they were the usual number symbols,
served as systematic identification symbols for the points. To add or subtract
such " numbers " von Staudt could not then use the laws of arithmetic.
Instead he gave geometric constructions that defined the operations with
these symbols so that, for example, the sum of the numbers 2 and 3 is the
number 5. These operations obeyed all the usual laws of numbers. Thus his
symbols or coordinates could be treated as ordinary numbers wen though
they were built up geometrically.
With these labels attached to his points von Staudt could define the
cross ratio offour points. If the coordinates of these points are x1, r21 .rsl zrrd
xn, then the cross ratio is defined to be
*t-*slxr-x,
xr - xal x, - x4'
Thus von Staudt had the fundamental tools to build up projective geometry
without depending on the notions of length and congruence.
A harmonic set of four points is one for which t}re cross ratio is - l. On
the basis of harmonic sets von Staudt gave the fundamental definition that
two pencils of points are projectively related when under a one-to-one
correspondence of their members a harmonic set corresponds to a harmonic
set. Four concurrent lines form a harmonic set if the points in which they meet
an arbitrary transversal constitute a harmonic set of points. Thus the pro-
jectivity of two pencils of lines can also be defined. With these notions von
Staudt defined a collineation of the plane into itself as a one-to-one trans-
formation of point to point and line to line and showed that it carries a set
of harmonic elements into a set of harmonic'elements.
The principal contribution of von Staudt in his Geomctric dzr Lagc was to
show that projective geometry is indeed more fundamental than Euclidean.
Its concepts are logically antecedent. This book and his Bcitriigc zur Gcorrurtc
der Lagc (Conributions to the Geometry of Position, 1856, 1857, 1860)
revealed projective geometry as a subject independent of distance. However,
852 THE RE\/ML OF PROJECTM GEOMETRY
he did use the parallel axiom ol Euclidean geometry, which from a logical
standpoint was a blemish because parallelism is not a projective invariant.
This blemish was removed by Felix Klein.2l
21. Math. Ann.,6, 1873, 11245: Ges. Abh.,l,3ll-43. See also Chap' 38, sec' 3'
22. Published 1827 : lilerke, l, l-388.
Bss
Figure 35.1 I
M6bius introduced signed elements in geometry not only for the line
segments but for areas and volumes. Consequently he was able to give a
complete treatment of the notion of the signed cross ratio of four points on a
Iine. He also showed that the cross ratio of four Iines of a pencil can be
expressed in terms of the sines of the angles at the vertex P (Fig' 35' I 1) by
:0
#.r,*f{,-;*ff".;
can be interpreted, when 17, lz, ls are the running coordinates, as the
equation of thc tangent at the point (xr, xr,4) or, when xy *27 *e are the
running coordinates, as the equation of the polar line of the arbitrary point
(4, ,lz, li with respect to the conic.
Using homogeneous coordinates Pltcker gave the algebraic formulation
of the infinitely distant line, the circular Points at infinity, and other notions.
In tlre homogeneous coordinate system (xb xr, xs) the equation of the
infinitely distant line is r. : g. This line is not excePtional in projective
gcomctry, but in our visualization of the geometric elements each normal
point of tlre Euclidean plane is at a finite position given by x : xrlxs and
! : xzlxs and so we are obliged to think of points on rs : 0 as infinitely
distant.
Tbe equation of a circle
(r-o)'+(y-b)2:r'
becomes on the introduction of homogeneous Cartesian coordinates 11, *2,
and 4 through x : : Yr126"
xrlxa andY
(*, - ax")' *(*r- bxs)z:72fi'
Sincc the equation of the infinitely distant line is r, = g, the intersection of
this linc with the circle is given by
and now require that the line pass through the points (xu ly zt) and (1, i, 0)
then the resulting nonhomogeneous equation of the line is
x-x6*i(Y-Ci:O
wherein ro : xJzyand,yo : gr/zr. Likewise the equation of the line through
(xtgt zt) and (1, -i,0) is
)t-xo-i(Y-S;:0'
Each of these lines is perpendicular to itself because the slope equals its
negative reciprocal. Sophus Lie called them crazy lines; they are now called
isotropic lines.
Phicker's efforts to treat duality algebraically led him to a beautiful
idea, line coordinates.2* Ifthe equation ofa line in homogeneous coordinates
is
uxttU+wz:0,
and if x, y, and z are fixed quantities, u, a, and w or any three numbers
proportional to them are the coordinates ofa line in the plane. Then just as
an equation;f(* b $s) : 0 represents a collection ofpoints, so./(z, a, w) : g
x2,
represents a collection of lines or a line curve.
With this notion of line coordinates Plircker was able to give an algebraic
formulation and proof of the principle of duality. Given any equation
f(r, s, t) : 0 if we interpret r, s, and , as the homogeneous coordinates
xr, x2, and. x" ofa point then we have the equation ofa point curve, whereas
if we interpret them as u, a, and' zo we have the dual line curve' Any property
proved by algebraic Processes for the point curve will, because the algebra
is the same under either interpretation ofthe variables, give rise to the dual
property of the line curve.
In this second lB30 paper and in Volume 2 ofhis Entwickclungcz Plficker
also pointed out that the very same curve regarded as a collection of poins
be regarded as the collection oflines tangent to the curve because the
".rr.1ro
tangents determine the shape as much as the Points do. The family of tan-
geris is a line curve and has an equation in line coordinates' The degree of
this equation is called the class of the curve, whereas the degree of the
equation in point coordinates is called the order.
wherelisaparameter.
Using this scheme Phicker gave a clear explanation of Cramer's
paradox (Chap. 23, sec. 3). A general curve C,, is determined by n(n + 3)12
points bdcause this is the number of essential coefficients in its equation. On
the other hand, since two Co's intersect in z2 points, through the n(n + 3)12
points in which two Co's intersect an infinite number of other Co's pass. The
apparent contradiction was explained by Pliicker.2s Any two ,rth degree
curves do indeed meet in z2 points. However, only (nl2) (n + 3) - I points
are independent. That is, if we take two zth degree curves through the
(nl2) (n + 3) - I points any other nth degree curve through these points
will pass through the remaining (n - l)(n - 2)12 of the z2 points. Thus
when z :4, 13 points are independent. Any two curves through the 13
points determine 16 points but any other curve through the 13 will pass
through the remaining three.
Phicker then took up28 the theory of intersections of an rath degree
curve and an nth degree curve. He regarded the latter as fixed and the curves
intersecting it as variable. Using the abridged notation C" for the expression
for the zth degree curve and similar notation for the others he wrote
C^:Ci"+A^-nCn:0
for thecase where rn > n so that A^-n is a polynomial of degree m - z.
From this equation Plfrcker got the correct method of determining the inter-
25. Annales de Math., 19, 1828, 97-106 = l[iss, Abh,, l, 7U82,
26. Jour. fiir Math., 16, 1837, +7-54.
HIGHER PLANE CURVES AND SURFACES 8sz
sections ofa C, with all curves ofdegree z. Since according to this equation
m - n + I linearly independent curves (the number of coefficients it A^-")
pass through the intersections of Ci, and Cn, Pliicker's conclusion was,
given mn - (n - 1) (n - 2)12 arbitrary Points on Cn; the remaining
(n - l)(n - 2) 12 of the zn intersection points with a C- are determined.
The same result was obtained at about the same time by Jacobi.27
ln his Sgstetn of lB34 and more explicitly it his Theorie der algebraisclun
Curucn (1839) Pliicker gave what are now called the Phicker formulas which
relate the order n and the class,t ofa curve and the simple singularities' Let
d be the number of double points (singular points at which the two tangents
are distinct) and r the number of cusps. To the double points there corre-
spond in the line curve double tangents (a double tangent is actually tangent
at two distinct points) the number of which is, say, t. To the cusps there corre-
spond osculating tangents (tangents which cross the curve at inflection points)
the number of which is, say, zc'. Then Pliicker was able to establish the follow-
ing dual formulas:
k:n(n - l) - 2d-3r n:k(k - l) - 2t-3w
w:3n(n-2) -Gd-Br r:3k(k-2) -Gt -Bw'
The number of any one element includes real and imaginary cases'
In the case when z : 3, d :0, and r : 0, then ro, the number of in-
flection points, is 9. Up to Phicker's time, De Gua and Maclaurin had proved
that a line through two inflection points of the general curve of third order
goes through a third one, and the fact that a general C" had three real in-
flection points had been assumed lrom Clairaut's time on. In his Syskrz of
1834 Phicker proved that a C, has either one or three real inflection points,
and in the latter case they lie on one line' He also arrived at the more
general result which takes complex elements into account. A general Cs has
nine inflection points of which six are imaginary. To derive this result he
used his principle of counting constants to show that
C,:fgh-13,
where rf, g,h, and I are linear forms, and derived ttre result of De Gua and
Maclaurin. He then showed (with incomplete arguments) that the nine
inflection points ofC3 lie three on a line so that there are twelve such lines'
Ludwig Otto Hesse (l8ll-74), who served as a proGssor at several univer-
sities, completed Pliickerts proof2s and showed that the twelve lines can be
grouped into four triangles.
As an additional example of the discovery of general properties of
curves we shall consider the problem ofthe inflection points ofan zth degree
curve;f(r, y) :0. Pliicker had expressed the usual calculus condition for
an inflection point when y : -f (x), namely, dzyldxz :0, in the form
aPProPriate to f(x,y): 0 and obtained an equation of degree 3n - 4.
Since the original curve and the new curve must have r(3n - 4) inter-
sections, it appeared that the original curve has n(3n - 4) points of in-
flection. Because this number was too large Plircker suggested that the curve
of the equation of degree 3z - 4 has a tangential contact with each of the n
infinite branches of the original curve;f - 0, so that 2z ofthe common points
are not inflection points, and so obtained the correct number, 3n(n - 2)-
Hesse clarified this point2e by using homogeneous coordinates; that is, he
replaced xby x1lxs, andy by x2lxs, and by using Euler's theorem on homo-
geneous functions he showed that Plficker's equation lor the inflection Points
can be written as
x:rz+P, !:sz+o
in which the four parameters t, p, s, and, o fix the line. Now lines can be used
to build up all of space, since, for example, planes are no more than col-
lections of lines, and points are intersections of lines. Phicker then said that
if lines are regarded as the fundamental element of space, space is four-
dimensional because four parameters are needed to cover all of space with
lines. The notion of a four-dimensional space of points he rejected as too
Bibliograplry
Berzolari, Luigi: "Allgemeine Theorie der h6heren ebenen algebraischen Kurven,"
Encyk. der Math. Wiss., B. G' Teubner, 1903-15, III C4,313-455'
32. Cambritlge and Dublin Math. Jour.,4, 1849, l18-32 : Mdth. Palqs' l,'*45-56'
33. Math. Ann., 4, l87l , 28+-345.
3+. Morahber. Baliner Akad., 1864, 246-60, 49F99.
86o THE REVIvAL oF PRoJEcTIvE GEoMETRY
Boyer, Carl B.: History of Analglic Geonutry, Scripta Mathematica, 1956, Chaps. 8-
9.
Brill, A., and M. Noether: " Die Entwicklung der Theorie der algebraischen
Functionen in ilterer und neuerer Zeit," Jahres. d.er Deut. Math.'Verein,,3,
1892/3, 109-566, 287-312 in particular.
Cajori, Florian: A History of Mathematits,2nd ed., Macmillan, 1919, pp. 286-302,
309-14.
Coolidge, Julian L.: A Treatisc on the Circle and the Sphere , Oxford University Press,
1916.
A Histoty oJ Geomehiral Methods, Dover (reprint), 1963, Book I, Chap.5 and
Book II, Chap. 2.
A Histary of thc Conb Scctions and Quadric Surfaccs, Dover (reprint), 1968.
" The Rise and Fall of Projective Geometry," American Malhematical
-: Monthly, 41, 1934, 217-28.
Fano, G.: " Gegensatz von synthetischer und analytischer Geometrie in seiner
historischen Entwicklung im XIX. Jahrhundert," Encyk. der Math. Wiss,,
B. G. Teubner 1907-10, III AB4a, 221-88.
Klein, Felix: Elementary Mathzmatics fron an A&tanced Stand|oint, Macmillan, 1939;
Dover (reprint), 1945, Geometry, Part 2.
Kotter, Ernst: " Die Entwickelung der synthetischen Geometrie von Monge bis auf
- Staudt," 1847, Jahrcs. der Deut. Math.-Vercin., Vol. 5, Part II, 1896 (pub.
l90l), l-486.
M6bius, August F.: Der barycentrische Calcul (1827), Georg Olms (reprint), 1968.
Also in Vol. I of Gesammelte Weilez, pp, l-388.
Gesammaltc Wnkc, 4 vols., S. Hirzel, 1885-47; Springer-Verlag (reprint),
1967.
Pliiclrer, Jtriius Gesammelte wisnnschdtliche Abhand.lungen, 2 vols., B. G. Teubner,
l89s-96.
: " Projektive Geometrie," Encyk. der Math. Wiss., B, G. Teubner,
Schoenflies, A.
1907-10, III AB5, 389-480.
Smith, David Eugene: A Source Book in Mathemalics, Dover (reprint), 1959' Vol. 2,
pp. 3l$-23, 33145, 670-76.
Steiner, Jacob: Gcometrical Constructbns Wirh a Ruler (a translation ofhis 1833 book),
Scripta Mathematica, 1950.
Gesanmelte Werla,2 vols., G. Reimer, l88l-82; Chelsea (reprint), 1971.
Zacharias, M, : " Elementargeometrie and elementare nicht-euklidische Geometrie
in synthetischer Behandlung," Encyk. der Math. Wiss., B. G. Teubner,
1907-10, III, AB9, 859-l 172.
36
Non-Euclidean Geometry
l. Introduction
Amidst all the complex technical creations of the nineteenth century the
most profound one, non-Euclidean geometry, was technically the simplest'
This ireation gave rise to important new branches of mathematics but its
most significait implication is that it obliged mathematicians to revise
radicalf their understanding of the nature of mathematics and its relrtion
to the physical world. It also gave rise to problems in the foundations of
is still struggling. As we shall
-athe-aiic. with which the twintieth century of a long series of efforts in
see, non-Euclidean geometry was the culmination
the area of Euclidean geometry. The fruition of this work came in the early
nineteenth century during the same decades in which projective geometry
was being revived and extended. flowever, the two domains were
not related
to each other at this time.
86I
86r
2. See, for example, the reference to Bonola in the bibliography at the end ofthc chapter.
NON-EUCLIDEAN GEOMETRY
theorem." Proclus Points out that while it is necessary to believe that two
lines will tend toward each other on the side of the transversal where the
sum of the interior angles is lgss than two right angles, it is not so clear that
these two lines will actually intersect at a finite point. This conclusion is only
probable. For, he continues, there are certain curves that approach each other
more and more but do not actually meet. Thus, a hyperbola approaches but
does not meet its asymPtote. Hence might not this be true of the two lines in
Euclid's postulate ? He then says that up to a certain sum of the interior
angles on one side of the transversal the two lines might indeed meet; how-
ever, for a value slightly greater but still less than two right angles, the lines
might be asymptotic.
Proclus based his own proof of the parallel postulate on an axiom which
Aristotle used to prove the universe is finite. The axiom says, " Il from one
point two straight lines forming an angle be produced indefinitely' the
successive distances between the said straight lines [perpendiculars from one
onto the other] will ultimately exceed any finite magnitude." Proclus' proof
was essentially correct but he substituted one questionable axiom for another.
Nasir-Eddin (1201-74), the Persian editor of Euclid, likewise gave a
" proof" of Euclid's parallel postulate by assuming that two non-parallel lines
approach each other in one direction and diverge in the other. Specifically,
if AB and CD (Fig. 36.2) are two lines cut by GH, JK, LM, . . ., if these latter
are perpendicular to AB, and if the angles 1,3,5,... are obtuse while
2,+,6,... are acute thenGH > JK > LM '... This fact, Nasir-Eddin says,
is clearly seen.
Wallis did some work on the parallel axiom in 1663 which he published
in 1693.8 First he reproduced Nasir-Eddin's work on the parallel axiom which
he had translated for him by a professor of Arabic at Oxford. Incidentally,
this was how Nasir-Eddin's work on the parallel axiom became known to
Europe. Wallis then criticized Nasir-Eddin's proof and offered his own proof
of Euclid's assertion. His proof rests on the exPlicit assumption that to each
triangle there is a similar one whose sides have any given ratio to the sides
of the original one. Wallis believed that this axiom was more evident than
3. Opera,2, 669-78.
THE RESEARcH ON THE PARALLEL AXIOM 86S
every triangle. Also if the sum is less than two right angles in one triangle
it is so in every triangle. Then he gives the proofthat if the sum ofthe angles
of any triangle is two right angles, Euclid's parallel postulate holds. This
work on the sum ofthe angles ofa triangle was also fruitless because Legendre
failed to show (without the aid ofthe parallel axiom or an equivalent axiom)
that the sum of the angles ofa triangle cannot be less than two right angles.
The efforts described thus far were mainly attempts to find a more self-
evident substitute axiom for Euclid's parallel axiom and many ofthe proposed
axioms did seem intuitively more self-evident. Consequently their creators
were satisfied that they had accomplished their objective. However, closer
examination showed that these substitute axioms were not really more
satisfactory. Some of them made assertions about what happens indefinitely
Iiar out in space, Thus, to require that there be a circle through any three
points not in a straight line requires larger and larger circles as the three
points approach collinearity. On the other hand the substitute axioms that
did not involve "infinity" directly, for example, the axiom that there exist
two similar but unequal triangles, were seen to be rather complex assump-
tions and by no means preferable to Euclid's own parallel axiom.
The second group of efforts to solve the problem of the parallel axiom
sought to deduce Euclid's assertion from the other nine axioms. The deduc-
tion could be direct or indirect. Ptolemy had attempted a direct proof. The
indirect method assumes some contradictory assertion in place of Euclid's
statemcnt and attempts to deduce a contradiction within this new body of
ensuing theorems. For example, since Euclid's parallel axiom is equivalent
to the axiom that through a point P not on a line / there is one and only one
parallel to /, there are lwo alternatives to this axiom. One is that there are no
parallels to / through P and the other is that there is more than one parallel
to I through P. If by adopting each of these in place of the "one parallel "
axiom one could show that the new set led to a contradiction, then these
alternatives would have to be rejected and the "one parallel " assertion
would be proaed.
The most significant effort of this sort was made by Gerolamo Saccheri
(1667-1733), a Jesuit priest and professor at the University of Pavia. He
studied the work of Nasir-Eddin and Wallis carefully and then adopted his
own approach. Saccheri started with the quadrilateral (Fig. 36.3) ABCD in
which z4 and B are right angles and AC : BD.lt is then easy to prove that
+C : 4D. Now Euclid's parallel axiom is equivalent to the assertion that
angles C and D are right angles. Hence Saccheri considered the two possible
alternatives :
Saccheri proved that angles C and D must be right angles. Thus with this
hypothesis he did deduce a contradiction.
Saccheri next considered the second hypothesis and proved many
interesting theorems. He continued until he reached the following theorcm:
Given any point .,4 and a line D (Fig. 36.a), on the hypothesis of the acute
angle there exist in the pencil (family) of lines through I two lines I and
4 which divide the pencil into two Parts. The first of these consists of the
lines which intersect D, and the second consists of those lines (lying in angle
cr) which have a common perpendicular with 6 somewhere along D. The
lines p and 4 themsclves are asymptotic to 6. From this result and a lengthy
subsequent argument Saccheri deduced that, and D would have a com,mon
perpendicular at their common point, which is at infinity. Even though he
had not obtained any contradiction, Saccheri found this conclusion and
others so repugnant that he decided the hypothesis ofthe acute angle must be
false.
There remained only the hypothesis that angles Cand D of Figure 36.3
are right angles. Saccheri had previously proved that when C and D arc
right angles, the sum of the angles of any triangle equals l80o and that'this
fact implies Euclid's parallel postulate. He therefore felt justified in con-
cluding that Euclid was upheld and so published his Eucli.dts ab Onni Nmo
Vintlicatus (Euclid Vindicated from All Faults, 1733). Howevet, since
Saccheri did not obtain a contradiction on the basis of the acutc angle
hypothesis the problem of the parallel axiom was still open.
The efforts to find an acceptable substitute for the Euclidean axiom on
parallels or to prove that the Euclidean assertion must be a theorem were so
numerous and so futile that in 1759 d'Alembert called the problem of the
parallel axiom "the scandal of the elements of geometry'"
introduced for the first time the thought that experience rather than
self-evidence substantiated the axioms. Kltigel expressed doubt that the
Euclidean assertion could be proved. He realized that Saccheri had'not
arrived at a contradiction but merely at results that seemed at variance with
experience.
Klngel's paper suggested work on the parallel axiom to Lambert. In his
book, Tlvoie der Parallellinien written in 1766 and published in I 786,6
Lambert, somewhat like Saccheri, considered a quadrilateral with three
.
right angles and considered the possibilities of the fourth angle being right,
obtuse, and acute. Lambert did discard the obtuse angle hypothesis because
it Ied to a contradiction. However, unlike Saccheri, Lambert did not conclude
that he had obtained a contradiction from the acute angle hypothesis.
The consequences Lambert deduced from both the obtuse and acute
angle hypothesis, respectively, even though the former did lead to a contra-
diction, are noteworthy. His most remarkable result is that under either
hypothesis the area of a polygon of n sides is proportional to the difference
between the sum of the angles and,2n - 4 right angles. (Saccheri had this
result for a triangle.) He also noted that the obtuse angle hypothesis gave
rise to theorems just like those which hold for figures on the surface of a
sphere. And he conjectured that the theorems that followed from the acute
angle hypothesis would apply to figures on a sphere of imaginary radius.
This led him to write a paperT on the trigonometric functions of imaginary
angles, that is, dl, where I is a real angle and i : \/=, which in effect
introduced the hyperbolic functions (Chap. 19, sec. 2). We shall see a little
more clearly later just what Lambert's observations mean.
Lambert's views on geometry were quite advanced. He realized that
any body ofhypotheses which did not lead to contradictions offered a possible
geometry. Such a geometry would be a valid logical structure even though it
might have little to do with real 6gures. The latter might be suggestive of a
particular geometry but do not control the variety of logically developable
geometries. Lambert did not reach the more radical conclusion that was
introduced somewhat later by Gauss.
Still another forward step was made by Ferdinand Karl Schweikart
(1780-1859), a professor ofjurisprudence, who devoted spare time to mathe-
matics. He worked on non-Euclidean geometry during the period in which
Gauss devoted some thought to the subject but Schweikart arrived at his
conclusions independently. He was, however, influenced by Saccheri's and
Lambert's work. In a memorandum ol lBl6 which he sent to Gauss in lBlB
for approval, Schweikart actually distinguished two geometries. There is the
geometry of Euclid and a geometry based on the assumption that the sum
of the angles of a triangle is not two right angles. This latter geometry he
6. Magazin.ftu reine und angewandte Mathematik, 1786, 137-64, 325-58.
7 . H*t. de I'Acad. de Berlin, 24, 1768, 327
-54, pub. 1770 : O?era Mathematica, 2, 245-69.
THE cREATToN oF NoN-EUcLTDEAN GEoMETRY 86g
called astral geometry, because it might hold in the space of the stars, and its
theorems were those which Saccheri and Lambert had established on the
basis of the acute angle hypothesis.
Franz Adolf Taurinus (1794-lS7 4), a nephew of Schweikart, took up
his uncle's suggestion to study astral geometry. Though he established in his
Geometriae Prima Elemmta ( 1826) some new results, notably some analytic
geometry, he concluded that only Euclid's geometry could be true of
physical space but that the astral geometry was hgically consistetut. Taurinus
also showed that the formulas which would hold on a sphere of imaginary
radius are precisely those that hold in his astral geometry.
The work of Lambert, Schweikart, and Taurinus constitutes advances
that warrant recapitulation. All three and other men such as Kliigel and
Abraham G. Kiistner (1719-1800), a professor at Gitttingen' were convinced
that Euclid's parallel axiom could not be proven, that is, it is independent of
Euclid's other axioms. Further, Lambert, Schweikart, and Taurinus were
convinced that it is possible to adopt an alternative axiom contradicting
Euclid's and build a logically consistent geometry. Lambert made no
assertions about the applicability ofsuch a geometry; Taurinus thought it is
not applicable to physical space; but Schweikart believed it might aPPly to
the region of the stars. These three men also noted that the geometry on a
real sphere has the properties of the geometry based on the obtuse angle
hypothesis (if one leaves aside the contradiction which results from the latter)
and the geometry on a sphere of imaginary radius has the properties of the
geometry based on the acute angle hypothesis. Thus all three recognized the
existence of a non-Euclidean geometry but they missed one fundamental
point, namely, that Euclidean geometry is not the only geometry that
describes the properties of physical sPace to within the accuracy for which
experience can vouch.
does. The latter is not the necessary geometry of physical (pace; its physical
trutfi cannot be guaranteed on any a priori grounds. This realization, which
did not call for any technical mathematical development because this had
already been done, was first achieved by Gauss.
Carl Friedrich Gauss (1777-1855) was the son of a mason in the German
city of Brunswick and seemed destined for manual work. But the director of
the school at which he rcceived his elementary education was struck.by
Gauss's intelligence and called him to the attention of Duke KarI Wilhelm.
The Duke sent Gauss to a higher school and then in I 795 to the University
of G6ttingen. Gauss now began to work hard on his ideas. At eighteen he
invented the method of least squ,rres and at nineteen he showed that the
l7-sidcd regular polygon is constructible. These successes convinced him that
he should turn from philology to mathematics. In l79B he transferred to the
University of Helmstiidt and there he was noticed by Johann Friedrich
Pfaff who became his teacher and friend. After finishing his doctor's degree
Gauss returned to Brunswick where he wrote some of his most famous papers.
This wort earned for him in 1807 the appointment as professor of astronomy
and director of the observatory at Gtittingen. Except for one visit to Berlin
to attcnd a scientific meeting he remained at G<ittingen for the rest of his life.
He did not like to teach and said so. However, he did enjoy social life, was
married twice, and raised a liamily.
Gauss's fust major work was his doctoral thesis in which he proved the
fundamental theorem of algebra. In l80l he published the classic Dis-
Arithmeticac. His mathematical work in differential geometry, the
" Disq u:sitiones Generales circa Superficies Curvas" (General Investigations
of Curved Surfaces, lB27), which, incidentally, was the outcome of his
interest in surveying, geodesy, and map-making, is a mathematical landmark
(Chap. 37, sec. 2). He made many other contributions to algebra, complex
functions, and potential theory. In unpublished papers he recorded his
innovative work in two major fields: the elliptic functions and non-Euclidean
g€ometry.
His interests in physics were equally broad and he devoted most of his
energy to them. When Giuseppe Piazzi (1746-1826) discovered the planet
Ccres in l80l Gauss undertook to determine its path. This was the beginning
of his work in astronomy, the activity that absorbed him most and to which
he devoted about twenty years. One ofhis great publications in this area is his
Tluoia Motus Corporam Coelcstiun (Theory of Motion of the Heavenly Bodies,
lB09). Gauss also earned great distinction in his physical research on
theoretical and experimental magnetism. Maxwell says in his Electricitg and
Magnetism that Gauss's studies of magnetism reconstructed the whole science,
the instruments used, the methods of observation, and the calculation of
results. Gauss's papers on terrestrial magnetism are models of physical
research and supplied the best method of measuring the earth's magnetic
THE cREATToN oF NoN-EUcLTDEAN GEoMETRY 87t
field. His work on astronomy and magnetism opened up a new and brilliant
period of alliance between mathematics and physics.
Though Gauss and Wilhelm Weber (1804-91) did not invent the idea
of telegraphy, in 1833 they improved on earlier techniques with a practical
device which made a needle rotate right and left depending upon the direction
of current sent over a wire. Gauss also worked in optics, which had been
neglected since Euler's days, and his investigations of 1838-41 gave a totally
new basis for the handling of optical problems.
The universality of Gauss's activities is all the more remarkable because
his contemporaries had begun to confine themselves to specialized investiga-
tions. Despite the fact that he is acknowledged to be the greatest mathe-
matician at least since Newton, Gauss was not so much an innovator as a
transitional figure from the eighteenth to the nineteenth century. Although
he achieved some new views which did engage other mathematicians he was
oriented more to the past than to the future. Felix Klein describes Gauss's
position in these words : We could have a tableau of the development of
mathematics if we would imagine a chain of high mountains representing
the men of the eighteenth century terminated by an imposing summit-
Gauss-then a large and rich region filled with new elements of life. Gauss's
contemporaries aPPreciated his genius and by the time of his death in 1855
he was widely venerated and called the " prince of mathematicians."
Gauss published relatively little of his work because he polished what-
ever he did partly to achieve elegance and partly to achieve for his demon-
strations the maximum of conciseness without sacrificing rigor, at least the
rigor of his time. In the case of non-Euclidean geometry he published no
definitive work. He said in a letter to Bessel ofJanuary 27,1829, that he
probably would never publish his findings in this subject because he feared
ridicule, or, as he put it, he feared the clamor of the Boeotians, a figurative
reference to a dull-witted Greek tribe. Gauss may have been overly cautious,
but one must remember that though some mathematicians had been
gradually reaching ihe climax of the work in non-Euclidean geometry the
intellectual world at large was still dominated by Kant's teachings. What we
do know about Gauss's work in non-Euclidean geometry is gleaned from his
letters to friends, two short reviews in the Gdttingische Gchhrte Anzeigan of l816
and lB22 and some notes of t83l found among his papers after his death'8
Gauss was fully aware of the vain efforts to establish Euclid's Parallel
postulate because this was common knowledge in Gtittingen and the whole
history of these efforts was thoroughly familiar to Gauss's teacher Kistner'
Gauss told his friend Schumacher that as far back as 1792 (Gauss was then
fifteen) he had already grasped the idea that there could be a logical geometry
in which Euclid's parallel postulate did not hold. By 1794 Gauss had found
8. Werke,I, 157-268, contains all of the above and the l€tter discussed bclow'
8zz
that in his concept ofnon-Euclidean geometry the area ofa quadrangle must
be proportional to the difference between 360' and the sum of the angles.
However, at this later date and even up to I 799 Gauss was still trying to
deduce Euclid's parallel postulate from other more plausible assumptions and
he still believed Euclidean geometry to be the geometry of physical space
even though he could conceive of other logical non-Euclidean geometries.
However, on December 17, 1799, Gauss wrote to his friend the Hungarian
mathematician Wolfgang Farkas Bolyai ( I 775-1 856),
As for me I have already made some progress in my work. However, the
path I have chosen does not lead at all to the goal which we seek [deduction
of the parallel axiom], and which you assure me you have reached. It
seems rather to compel me to doubt the truth of geometry itself. It is true
that I have come upon much which by most people would be held to
constitute a proofl but in my eyes it proves as good as nothing. For
example, if we could show that a rectilinear triangle whose area would be
greater than any given area is possible, then I would be ready to prove
the whole of [Euclidean] geometry absolutely rigorously.
Most people would certainly let this stand as an axioml but I, no!
It would, indeed, be possible that the area might always remain below a
certain limit, however far apart the three angular points of the triangle
were taken.
This passage shows that by 1799 Gauss was rather convinced that the parallel
axiom cannot be deduced from the remaining Euclidean axioms and began
to take more seriously the development of a new and possibly applicable
geometry.
From about l8l3 on Gauss developed his new geometry which he first
called anti-Euclidean geometry, then astral geometry, and finally non-
Euclidean geometry. He became convinced that it was Iogically consistent
and rather sure that it might be applicable. In reviews of lBl6 and lB22 and
in his letter to Bessel of lB29 Gauss reaffirmed that the parallel postulate
could not be proved on the basis of the other axioms in Euclid. His letter to
Olbers written in 1817e is a landmark. In it Gauss says, "I am becoming
more and more convinced that the [physical] necessity of our [Euclidean]
geometry cannot be proved, at least not by human reason nor for human
reason. Perhaps in another life we will be able to obtain insight into the
nature of space, which is now unattainable. Until then we must place
geometry not in the same class with arithmetic, rvhich is purely a priori,
but with mechanics."
To test the applicability of Euclidean geometry and his non-Euclidean
geometry Gauss actually measured the sum of the angles of the triangle
formed by three mountain peaks, Brocken, Hohehagen, and Inselsberg. The
9. Werke, 8, 177.
THE CREATION OF NON-EUCLIDEAN GEOMETRY 879
sidesof this triangle were 69, 85, and 197 km. He found ro that the sum
exceeded lB0" by 14185. The experiment proved nothing because the
experimental error was much larger than the excess and so the correct sum
could have been l80o or even less. As Gauss realized, the triangle was a
small one and since in the non-Euclidean geometry the defect is proportional
to the area only a large triangle could possibly reveal any significant departure
from an angle sum of 180'.
We shall not discuss the specific non-Euclidean theorems that are due
to Gauss. He did not write up a full deductive presentation and the theorems
he did prove are much like those we shall encounter in the work of
Lobatchevsky and Bolyai. These two men are generally credited with the
creation of non-Euclidean geometry. Just what is to their credit will be
discussed later but they did publish organized presentations of a non-
Euclidean geometry on a deductive synthetic basis with the full understanding
that this new geometry was logically as legitimate as Euclid's.
Nikolai Ivanovich Lobatchevsky (l 793-1856), a Russian, studied at the
University of Kazan and from lB27 to 1846 was professor and rector at that
university. He presented his views on the loundations of geometry in a paper
before the department of mathematics and physics of the University in 1826.
However, the paper was never printed and was lost. He gave his approach
to non-Euclidean geometry in a series of papers, the first two of which were
published in Kazan journals and the third in the Journal fiir Matlumatik.rt
The first was entitled " On the Foundations of Geometry" and appeared in
1829-30. The second, entitled " New Foundations of Geometry with a
Complete Theory ol Parallels " (1835-37), is a better presentation of
Lobatchevsky's ideas. He called his new geometry imaginary geometry for
reasons which are perhaps already apparent and will be clearer later. In
lB40 he published a book in German, Geometrische Untersuchungen zur Tluoric
12). In
der Parallelliniea (Geometrical Researches on the Theory of Parallels
this book he laments the slight interest shown in his writings. Though he
became blind he dictated a completely new exposition of his geometry and
published it in 1855 under the title PangiomCtrie.
John (J6nos) Bolyai ( I 802-60), son of Wolfgang Bolyai, was a Hungarian
army officer. On non-Euclidean geometry, which he called absolute geom-
etry, he wrote a twenty-six-page paPer "The Science of Absolute Spac6."ra
This was published as an appendix to his father's book Tentamen Juacntubm
Studiosam in Elementa Mathepos (Essay on the Elements of Mathematics for
Figure 36.5
angle z(a) with lB are the parallels and z(a) is called the angle ofparallelism.
Lines through C other than the parallels and which do not meet AB ate
called non-intersecting lines, though in Euclid's sense they are parallel to
AB and. so in this sense Lobatchevsky's geometry contains an infinite number
of parallels through C.
If zr(a) : ztl2 then the Euclidean axiom results. If not, then it follows
that r(a) increases and approaches rf2 as c decreases to zero' and zr(c)
decreases and approaches zero as a becomes infinite. The sum of the angles
of a triangle is always less than z, decreases as the area of the triangle
increases, and approaches zr as the area approaches zero. If two triangles arc
similar then they are congruent.
Now Lobatchevsky turns to the trigonometric part of his gcometry.
The first step is the determination of r(a). The result, if a complete central
angle is 2t, isLT
(r)
from which it follows that zr(0) : nl2 and r'( +o) = 0. The relation (l) is
significant in that with each length * it associates a deinite angle z(*).
When .r : I, tan ln(l)l2J: a-l so that z(l) : 4O'24'. Thus the unit of
Iength is that length whose angle of parallelism is 40"24' . This unit of length
does not have direct physical significance. It can be physically one inch or
one mile. One would choose the physical interPretation which would make
the geometry physically applicable.ls
Then Lobatchevsky deduces formulas connecting sides and angles of
the plane triangles of his geometry. In a paper of lB34 he had defined cos x
17. This is a special formulation. In his 18'10 work Lobatchevsky gives what amounB to
the form usually given in modern t€xts and which Gauss also has, namcly,
Figure 36.6
and sin x for real x as the real and imaginary parts of ar". Lobatchevsky's
point was to give a purely analytical foundation for trigonometry and so
make it independent of Euclidean geometry. The main trigonometric
formulas of his geometry are (Fig. 36.6)
Also whereas in the usual spherical geometry the area of a triangle with
angles l, B, and C is 12 (A + B + C - zr), in the non-Euclidean geometry
o,:l@\m, \
for the element of arc on a curve g -- f (x) at the point (.t, y). Then the entire
circumference of a circle of radius r can be calculated' It is
C:n(e'-e-').
The expression for the area of a circle proves to be
A: n(erl2 - e- tl2)2
CLAIMS OF LOBATCHEVSKY AIYD BOLYAI TO PRIORITY 8ll
fss
Figure 36.7
He also gives theorems on the area ofplane and curved regions and volumes
o(solids.
The formulas of Euclidean geometry result from the non-Euclidean
formulas when the magnitudes are small. Thus if we use the fact that
e,:t*r*fi*...
and neglect for small r all but the first two terms then, for example,
C : t(e, -'-,) - fll + r - (1 - r)) : 2nr.
In the first paper (1829-30) Lobatchevsky also considered the applica-
bility of his geometry to physical space. The essence of his argument rests on
the parallax of stars. Suppose E, and. E, (fig. 36.7) are the positions of the
earth six months apart and S is a star. The parallaxp of S is the difference in
the directions of ErS and ErS measured, say, from the perpendicular E1S'.
If Er.R is the Euclidean parallel to .E S, then since E,.SE2 is an isosceles
triangle, n 12 - j.SEfi2 is half of the change in direction of the star, that is,
pl2. This angle is l'.24 for the star Sirius (Lobatchevsky's value). As long as
such an angle is not zero, the line from .8, to the star cannot be the parallel
to LS because the line cuts 7S. If, however, there were a lower bound to the
various parallaxes of all the stars then any line from E1 making a smaller
angle with Er,S' than this lower bound could be taken to be a parallel to 7S
through E, and this geometry would be equally useful so far as stellar
measurements are concerned. But then Lobatchevsky showed that the unit
of length in his geometry would have to be, physically, more than a half a
million times the radius of the earth's path. In other words, Lobatchevsky's
geometry could be applicable only in an enormously large triangle.
parallel axiom could not be deduced from the other nine axioms both
Lobatchevsky and Bolyai may have decided to try their hand at the problem.
Having failed, they could appreciate all the more readily the wisdom of
Gauss's views on the subject.
As for the technical content contributed by Lobatchevsky and John
Bolyai, though they may have created this independently oftheir predecessors
and of each other, Saccheri's and Lambert's work, to say nothing of
Schweikart's and Taurinus's, was widely known in Gottingen and was
certainly known to Barqels and Wolfgang Bolyai. And when Lobatchevsky
refers in his lB35-37 paper to the futility of the efforts over two thousand
years to settle the question of the parallel axiom, by inference he admits to
the knowledge of the earlier work.
Bibliogra!lry
Bonola, Roberto: Non-Eutlidean Geometry, Dover (reprint), 1955.
Dunnington, G. W.: Carl Friedrich Gauss, Stechert-Hafner, 1960.
Engel, F., and P. Staeckel: Die Theoriz der Parallellinien wn Euklid bis auf Gauss,
2 vols., B. G. Teubner, 1895. )
I|rkunden zur Geschichtc fur nichteuklidisclun Geometric, B. G. Teubner,
1899-1913, 2 vols. The 6rst volume contains the translation from Russian
into German of Lobatchevsky's 1829-30 and 1835-37 papers. The second
is on the work of the two Bolyais.
Enriques, F. : " Prinzipien der Geometrie," Encyk. dzr Math. Wiss., B. G. Teubner,
1907-10, III ABr, l-129.
Gauss, Carl F.: Werlu, B. G. Teubner, 1900 and 1903, Vol. 8, 157-268; Vol. 9,
297458.
Heath, Thomas L.: Euclid's Elements, Dover (reprint), 1956, Vol. I' pp.202-20.
Kagan, V. : Lobatchzuskg and. his Contri.buti.on to Scitrce, Foreign Language Pub. House,
Moscow, 1957.
Lambert, J. H.: Opera Mathemalica,..2 vols., Orell Fussli, l9'1G-'18.
Pasch, Moritz, and Max Dehn: Vi\sungcn i)ber neture Geomttric, 2nd ed., Julius
Springer, 1926, pp. 185-238.
Saccheri, Gerolamo: Ettclides ab Omni Nacoo Vindicatus, English trans. by G. B,
Halsted in Amer. Math. Monthly, Vots. l-5, 189'S-98; also Opcn Court Pub. Co.,
1920, and Chelsea (reprint), 1970.
Schmidt, Franz, and Paul Staeckel: Brizfucchsel zwischen Carl Fricdli.h Gauss uzd
Wolfgang Bolyai.B. G. Teubner, 1899; Georg Olms (reprint)' 1970.
Smith, David E.: A Source Book in Mathematirs, Dover (reprint), 1959, Vol.2,
pp.35l-88.
Sommerville, D. M. Y. : The Elenunts of Non-Euclidean Gcomctry, Dover (reprint),
1958.
Staeckel, P. :
" Gauss als Geometer," Naehrichtcn Kdnig. Gcs, dcr Wiss. zu Gdtt.,
1917, Beiheft, pp.25-142. Also in Gauss: Wcrke , X2.
von Walterhausen, W. Sartorius: Carl Fricdrich Galss, S. Hirzel, 1856; Springer-
Verlag (reprint), 1965.
Zact,arias, M. : " Elementargeometrie und elementare nicht-euLlidische Geometrie
in synthetischer Behandlung," Encyk. dcr Math. Wiss.' B. G. Teubner,
l9l,s-31, III AB9, 859-1172.
37
The Differential Geometry of
Gauss and Riemann
l. Introduction
We shall now pick up the threads of the development of differential geometry,
particularly the theory of surfaces as founded by Euler and extended by
Monge. The no<t great steP in this subject was made by Gauss.
Gauss had devoted an immense amount of work to geodesy and map-
making starting in 1816. His participation in actual physical surveys, on
which he published many PaPers, stimulated his interest in differential
geomctry and led to his definitive 1827 paper " Disquisitiones Generales
circa Superficies Curvas " (General (Investigations of Curved Surfaces) '1
However, bcyond contributing this dbfinitive treatment of the differential
geometry of surfaces lying in three-dimensional space Gauss advanced the
totally new concept that a surface is a space in itself. It was this concept
that Riemann generalized, thereby opening up new vistas in non-Euclidean
geometry.
Gauss's point of departure was to use this parametric representation for the
systematic study of surfaces. From these parametric equations we have
(2) dt:.atlu a a'da, dg: bdu * b'ila, dz:cduic'da
l. Comm. Soc, Go,t.,6, f 828, 99-l'16 : Wcrla,4,217-58.
BBe
OAUSS,S DIFFERENTIAL OEOMETRY 88s
wherein d : xt!, o,' : *u, and so forth. For convenience Gauss introduces
the determinants
where
E:a2+b2*c2, F:aa'+bb'+cc', G:a'2 +b'' +c'2.
The angle between two curves on a surface is another fundamental
quantity. A curve on the surface is determined by a relation between z and
a, for then x, g, and. z become functions of one parameter, u or u, and the
equations (l) become the parametric representation of a curve. One says in
the language ofdifferentials that at a point (u, u) a curve or the direction ofa
curve emanating from the point is given by the ratio du:da. If then we have
two curves or two directions emanating from (2, u), one given by tlu:fu anl.d
the other by du'tda', and if 0 is the angle between these directions, Gauss
shows that
points. Gauss evaluates this ratio by noting first that the tangent plane at
(X, Y, Z) on the sphere is parallel to the one at (x,y, z) on the surface. Hence
the ratio of the two areas is the ratio of their projections on the respective
tangent planes. To find this latter ratio Gauss performs an amazing number
of differentiations and obtains a result which is still basic, namely, that the
(total) curvature r( of the surface is
K
LN_M2
(6) "- -- EG-F2
wherein
lx"" lut
"',:ri." lxuu !u,
:l*, lu ",1,
zuul zuul
i;"
:l
u
l-, !t, ",1 ': h ',i ',".1
Then Gauss shows that his i( is the product of the two principal curvatures
at (x,g, z), which had been introduced by Euler. The notion of mean
curvature, the average of the two principal curvatures, was introduced by
Sophie Germain in 1831.3
Now Gauss makes an extremely imPortant observation. When the
surface is given by the parametric equations (l), the properties of the
surface seem to depend on the functions rc, !, z. By fixing u, say u : 3, and
letting u vary, one obtains a curve on the surface. For the various possible
fixed values of z one obtains a family of curves' Likewise by fixing u one
obtains a family of curves. These two families are the parametric curves on
the surf;ace so that each point is given by a pair of numbers (e,d), say'
where z : c and a : d are the parametric curves which pass through the
point. These coordinates do not necessarily denote distances any more than
latitude and longitude do. Let us think ofa surface on which the parametric
curves have been determined in some way. Then, Gauss affirms, the geo-
metrical properties of the surface are determined solely by the E, F, and G
in the expression (4) for /s2. These functions of z and tt ate all that matter'
It is certainly the case, as is evident from (4) and (5), that distances and
angles on the surface are determined by the E, F, and G. But Gauss's funda-
mental expression lor curvature, (6) above, depends upon the additional
quantities L, M, and N. Gauss now Proves that
alad L, M, and .tr[ are related to E, F, ar:d G through (7). This surfirce is
uniquely determined except for position in space. (For real surfaces with real
coordinates (u, a) we must have EG - F2 > 0, E > 0, and G > 0). Bonnet's
theorem is the analogue of the corresponding theorem on curves (Chap. 23'
sec. 6).
The fact that the properties ofa surface depend only on the E, F, and G
has many implications some of which were drawn by Gauss in his 1827
paper. For example, if a surface is bent without stretching or contracting'
the coordinate lines z : const. and , : const. will remain the same and so
ds will remain the same. Hence all properties of the surface, the curvature in
particular, will remain the same. Moreover, if two surfaces can be put into
one-to-one correspondence with each other, that is, if z' : $(u,a)' tt' :
,lt(u, u), where u' and a' are the coordinates of points on the second surliace'
and if the element of distance at corresponding points is the same on the
two surflaces, that is, if
E duz + 2F du da * G dtt2 : E' du'2 * 2F' du' du' + G' dv'z
dza
(e)
E:
I
"\7")
&t\3
+ (2n - re)' + (t'2p.)!,- t,
Figure 37.1
ll
JJt
xae:cr* ..z+do-ni
that is, the integral of the curvature over a geodesic triangle is equal to the
excess of the sum of the angles over l80o or, where the angle sum is less
than z, to the defect from I80". This theorem, Gauss says, ought to be
counted as a most elegant theorem. The result generalizes the theorem of
Lambert (Chap. 36, sec. 4), which states that the area ofa spherical triangle
equals the product of its spherical excess and the square of the radius, for
in a spherical triangle 1( is constant and equals l/R2.
One more important piece of work in Gauss's differential geometry
must be noted. Lagrange (Chap. 23, sec. B) had treated the conformal
mapping of a surface of revolution into the plane' In 1822 Gauss won a
prize offered by the Danish Royal Society of Sciences for a paper on the
problem of finding the analytic condition for transforming any surface
conformally onto any other surface.s His condition, which holds in the
neighborhood of corresponding points on the two surfaces, amounts to the
fact that a function P + iQ, which we shall not specify further, of the pa-
rameters T and u by means of which one surface is represented is a
functionrf of iD * iq which is the corresponding function of the parameters I
and z by which the other surface is represented and P - iQ isf'(p - ;q)'
where.rf is;f or obtained from;f by replacing i by - 2. The functionlf depends
on the correspondence between the two surfaces, the correspondence being
specified by T : T(t, u) and' U(1, z). Gauss did not answer the question of
whether and in what way a finite portion of the surface can be mapped
conformally onto the other surface' This problem was taken up by Riemann
in his work on complex functions (Chap. 27, sec. I0).
Gauss's work in differential geometry is a landmark in itself' But its
implications were far deeper than he himself appreciated. Until this work,
'lilerke,4, 189-216.
8.
B8B THE DIFEERENTIAL CEOMETRY OF GAUSS AND RIEMANN
9. Abh. dcr :
Ges. der l4/iss. zu Giitt., 13, 1868, l-20 Wcrke, 2nd ed., 272-87. An English
translation can be found in W. K. Clifford's Collected Mathematical Paprs. Also in Noturc,
8, 1873, l,$-36, and in D. E. Smith, A Source Book in Mothunatics, 4ll-25.
10. Werke,2nd ed., 1892, 391-404'
89o THE DIFTERENTIAL GEOMETRY OF GAUSS AND RIEMANN
(10) dsz :
))sud*,a*,,
wherein the 91, are functions of the coordinates x1, !t2, . , .,ltn, Et! : gn, a\d
the right side of (10) is always positive for all possible values of the /r1. This
expression for dsz is a generalization of the Euclidean distance formula
tk2 : dx? + dxA +...+ dxf;.
Though Riemann in his paper of 1854 did not set forth explicitly the
following definitions he undoubtedly had them in mind because they parallel
what Gauss did for surfaces. A curve on a Riemannian manifold is given by
the set of z functions
(lt) \ : x(t), tz : xz?),. .', x,r : xn(t).
Then the length ofa curve between I: a and I : p is defined by
ll. For the meaning of the brace symbol see (19) bclow. Riemann did not give these
equations explicitly.
B9Z THE DIFFERENTIAL GEOMETRY OF GAUSS AND RIEMANN
2 d,?
(14) *"
).2
--(l
l=l
+(al+))x!)2
-
Riemann thought that the curvature o must be positive or zero, so that when
a > 0 we get a spherical space and when a : 0 we get a Euclidean space
and conversely. He also believed that if a space is infinitely extended the
RTEMANN's AppRoActr ro cEoMETRY BgS
curvaturs mtrt be zqo. He did, however, suggest that there might be a real
surflace of constant negative curvature'12
To elaborate on Riemann, for cr : a2 > 0 and' n : 3 we get a three-
dimensional spherical geometry though we cannot visualize it. The space is
finite in extent but boundless' All geodesics in it are of constant length,
namely, 2rla, and return upon themselves. The volume of the space is
2n2la3, For a2 > 0 and n:2 we get the space of the ordinary spherical
surface. The geodesics are ofcourse the great circles and are finite. Moreover,
any two intersect in two points. Actually it is not clear whether Riemann
regarded the geodesics of a surface of constant positive curvature as cutting
in one or two points. He probably intended the latter. Felix Klein pointed
out later (see the next chapter) that there were two distinct geometries
involved.
Riemann also points out a distinction, of which more was made later,
between boundlessness [as is the case for the surface ofa sphere] and infinite-
ness of space. Unboundedness, he says, has a greater empirical credibility
than any other empirically derived fact such as infinite extent.
Toward the end of his paper Riemann notes that since physical space is
a special kind of manifold the geometry of that sPace cannot be derived only
from general notions about manifolds. The properties that distinguish
physical space from other triPly extended manifolds are to be obtained only
from experience. He adds, "It remains to resolve the question of knowing in
what measure and up to what point these hypotheses about manifolds are
confirmed by experience." In particular the axioms of Euclidean geometry
may be only approximately true of physical space. Like Lobatchevsky,
Riemann believed that astronomy will decide which geometry fits space'
He ends his paper with the prophetic remark: " Either therefore the reality
which underlies space must form a discrete manifold or we must seek the
ground of its metric relations outside it, in the binding forces which act on
it. . . . This leads us into the domain of another science, that of physics, into
which the object ofour work does not allow us to go today."
This point was developed by William Kingdon Clifford'ra
I hold in fact: (l) That small portions ofspace are ofa nature analogous
to little hills on a surface which is on the average flat' (2) That this
property of being curved or distorted is continually passed on from one
portion of space to another after the manner of a wave' (3) That this
variation of the curvature of space is really what haPpens in that
phenomenon which we call the motion of matter whether ponderable or
ethereal. (4) That in this physical world nothing else takes place but this
variation, subject, possibly, to the law of continuity'
12. Such surfaces were already known to Ferdinand Minding (Jour' ftr Math', 19' 1839,
370-87, pp.378-80 in particular), including the very one later callcd thc pscudosphere
(see Chap. 38, sec. 2). See also Gauss, We*e,8,265.
13. Proc. Camb. Phit. Soc., 2, 1870, 157-58 : Math. Pa?ers,2V22'
894 THE DIFFERENTIAL GEOMETRY OF GAUSS AND RIEMANN
The ordinary laws of Euclidean geometry are not valid for a space whose
curvature changes not only from place to place but because of the motion of
matter, from time to time. He added that a more exact investigation of
physical laws would not be able to ignore these " hills " in space. Thus
Riemann and Clifford, unlike most other geometers, felt the need to associate
matter rvith space in order to determine what is true of physical space. This
line of thought leads, of course, to the theory of relativity.
ln lis Pariserurbail ( I 86l ) Riemann returned to the question of when a
given Riemannian space whose metric is
: t,P,
(lB) rar,r: tifl
^r:i(w.* -*)
(ls) rla : {if} : t"p, :)s'^lf)
^}
where g'^ is the cofactor divided by g ofg,1 in the determinant of g. Riemann
also introduced what is now known as the Riemann four index symbol
(22) o : sD
) s,r d*, tu, - z aaf su dxt dxt + dd)su dx, dxt.
Here one understands that the d and 6 oPerate formally on the expressions
following them (and / and E commute) so that
Now let
(25) (: -[E^,
8g6 THE DIFFERENTIAL GEOMETRY OF GAUSS AND RIEMANN
The overall conclusion is that the necessary and sufficient condition that a
given ds2 can be brought to the form (for n : 3)
(26) ds'z : 4 dx! + c, dx! + cs dx!,
where the c1's are constants, is that all the symbols ("F, y6) be zero. In case
the e,'s are all positive the d,r'can be reduced to dg! + dyl + dg!, that is, the
space is Euclidean. As we can see from the value of [O], when 1( is zero, the
space is essentially Euclidean.
It is worth noting that Riemann's curvature for an z-dimensional
manifiold reduces to Gauss's total curvature ofa surface. In fact when
ds2 : gr dx? + 2kz dx, dxs * 92 dxZ,
of the 16 symbols ("F, yA), 12 are zero and for the remaining four we have
(12, 12) : -(12,21) : -(21,12) : (21,21). Then Riemann's r( reduces
to
k _ (t2, 12) .
c
By using (20) this expression can be shown to be equal to Gauss's expression
for the total curvature of a surface.
F: > g,,dxidx1
1,,
and
F' : AdX2 +28dXdY +CdY2
and suppose that x and g may be expressed as functions of Xand Iso that F
becomes F' under the transformation. Of course dx : QIIAX) dX +
@xll\ dY. Now when x, y, dx, and dg are replaced in F by their values in X
and I and when one equates coemcients in this new form of Fwith those of
F' one obtains
He lets S: lg-1. Then if A,, is the cofactor of g," in the determinant let
gn : Lrolg. He, lile Riemann, introduces independently the four index
rymbol (without the comma)
f s,,*%nfi: r'",.
Thcse equations are the necessary and sufficient conditions that a trans-
formation exist for which F : F'.
Partly to treat the integrability of this set of equations and partly
because Christoffel wishes to consider forms of degree higher than two in the
dr,, he performs a number of differentiations and algebraic steps which show
that
(27) (,sly)',:
,z..rtrrr#'il*%,#,,
where c, B, y, and 6 take all values from I to n. There are n2(nz - l)ll2
equations of this form. These equations are the necessary and sufficient
conditions for the equivalence of two differential forms of fourth order.
Indeed let d(7,x, d<ztx, d$tx, dtr>x be four sets ofdifferentials ofx and likewise
for the y's. Then if we have the quadrilinear form
Gr : (gkhi)d<rrxotl@)x*d,@)xod,@)x,
>
g,k,h,t
the relations (27) are necessary and sufficient that G, : Gi, where Gl is the
analogue of G. in the y variables.
This theory can be generalized to p-ply differential forms. In lact
Christoffel introduces
It;"'lt
where the term in parentheses is defined in terms of the 9,, much as the four
index symbol is and the symbol ? is used to distinguish the. differentials of the
set of*, from the set obtained by applying ?. He then shows that
- ) t{;;,, rx&...,r)
These (p * l)-index symbols are the coefficients of the Go*, form. The
procedure Christoffel uses here is what Ricci and Levi-Civita later called
covariant differentiation (Chap. a8).
Whereas Christoffel wrote only one key paper on Riemannian geometry,
Rudolph Lipschitz, professor of mathematics at Bonn University, wrote a
great number appearing in the Journal fiir Mathtmatik from 1869 on. Though
there are some generalizations of the work of Beltrami and Christoffel, the
essential subject matter and results are the same as those of the latter two
men. He did produce some new results on subspaces of Riemannian and
Euclidean n-dimensional spaces.
The ideas projected by Riemann and developed by his three immediiate
successors suggested hosts of new problems in both Euclidean and Rieman-
nian differential geometry. In particular the results already obtained in the
Euclidean case for three dimensions were generalized to curves, surfaces, and
higher-dimensional forms in z dimensions. Of many results we shall citc just
one.
In 1886 Friedrich Schur (1856-1932) proved the theorem named after
him.16 In accordance with Riemann's approach to the notion of curvature,
Schur speaks ofthe curvature ofan orientation ofspace. Such an orientation
is determined by a pencil of geodesics pa * AB where cr and p are the
directions of two geodesics issuing from a point. This pencil forms a surface
and has a Gauss curvature which Schur calls the Riemannian curvature of
that orientation. His theorem then states that if at each point the Riemannian
curvature of a space is independent of the orientation then the Riemannian
curvature is constant throughout the space. The manifold is then a space of
constant curvature.
to another such expression with preservation ofthe value ofdsa that different
coordinate representations can be obtained for the very same manifold.
However, the geometrical properties of the manifold must be independent of
the particular coordinate system used to represent and study it. Analytically
these geometrical properties would be represented by invariants, that is,
expressions which retain their form under the change of coordinates and
which will consequently have the same value at a given point. The invariants
of interest in Riemannian geometry involve not only the fundamental
quadratic form, which contains the differentials dx1 and dxp but may also
contain derivatives of the coefficients and ofother functions. They are there-
fore called differential invariants.
To use the two-dimensional case as an example, if
(32) dsz : E dus * 2F duda + G da2
^'4:#tt^U*#,).*U*#)
.*"u*#))
Incidentally in this same work Lam6 gave conditions on when the ds' given
by (36) determines a curvilinear coordinate system in Euclidean space and,
if it does, how to change to rectangular coordinates.
The investigation of invariants for the theory of surfaces was first made
by Beltrami.l8 }fe gave the two differential invariants
: E*-Bu{r(^4)' - zra},ff +
^,6 "(#)')
and
| (a lG6"-Fd,\ a l-F6"+Ed,\l
u,e:-^/Ec-=-Fatu\;7rt-nc_eil-d,\Gffi )l'
These have geometrical meaning. For example in the case of ArC, if AIC : 1,
the curves $(u,o) : const. are the orthogonal trajectories ofa family of
geodesics on the surflace.
The search for differential invariants was carried over to quadratic
diflerential forms in z variables. The reason again was that these invariants
are independent of particular choices of coordinates; they rePresent intrinsic
properties of the manifold itself. Thus the Riemann curvature is a scalar
invariant.
Beltrami, using a method given byJacobi,lo succeeded in carrying over
to n-dimensional Riemannian spaces the Lam6 invariants.2o Let g as usual
be the determinant of the g,1 and let g" be the cofactor divided by g of gu
in g. Then Beltrami showed that Lam6's first invariant becomes
a,(d) :S,ua4a{.
f;- 0x,0x1
18. Gi.or, tli Mat.,2, 1864,267-82, and, succeeding papers in Vols. 2 and 3 = Olcru Mat.,
l, 107-98.
19. Jour.ffrr Math.,36, 1848, I13-34: We*e,2, 193'216.
20. Memorie dcll' Accademia dclle Scienze dell' Istituto di Bologna, (2), 8, 1868' 551-90 :
Opeft Mat., 2, 7+-118.
go2 THE DTFTERENTIAL GEOUETRY OF GAUSS AND RIEMANN
This is the general form for the square of the gradient of{. For the second of
Lam€'s invariants Beltrami obtained
A,(C) :t"1-",@1-#)
Hc also introduced the mixed differential invariant
A,@{)
aia{.
:S ,u dxr4xt
fi"
This is the general form of the scalar product of the gradiens of { and ry'.
Of course the form ds2 is itselfan invariant under a change ofcoordinates.
From this, as we found in the previous siction, Christoffel derived higher
order differential forms, his G. and G, which are also invariants. Moreover,
he showed how Iiom G, one can derive Gu*r, which is also an invariant. The
construction of such invariants was also pursued by Lipschitz. The number
and variety are extensive. As we shall see this theory of differential invariants
was the inspiration for tensor analysis.
Bibliograplry
Bcltrami, Eugenio: Ofotc matcnaticlu, 4 vols., Ulrico Hoepli, 1902-20.
CXifford, William K.: Matlumatiral Palas, Macmillan, 1882; Chelsea (reprint),
1968.
Coolidgc, Julian L.: A HirW of Gcomctital Methods, Dover (reprint), 1963,
pp. 35${7.
buykhpilb da Mailumalisclun Wissmsehaftcn, III, Teil 3, various articles, B. G.
Tcubncr, 1902-7.
Gaurs, Carl F.: Weilu, 4, 192-216, 217-58, Kiinigliche Gesellschaft der Wissen-
schaften zu Gottingen, 1880. A translation, " General Investigations of
Curved Surfaces, " has been reprinted by Raven Pres, 1965.
Helmholtz, Hermann von: "Uber die taeiichlichen Grundlagen der Geometrie,"
WisscnschaJtlithc Abhandhmgen, 2, 6 I 0-l 7.
"Uber die Tatsachen, die der Geometrie zum Grunde lieger," Nachrichtcn
Kdnig. Gcs. tb Wiss. zu Gdtt., 15, 1868, 193-221; Wiss. Abh., 2, 618-39.
" Uber den lJrsprung Sinn und Bedeutung der geometrischen Siitze";
English translation, " On the Origin and Significance of Geometrical
Axioms," in Helmholtz: Popular Scicntfu lzctures, Dover (reprint), 1962'
22&49. Also in James R. Newman: The ltorld of Matlumatits, Simon and
Schuster, 1956, Vol. 1, 647-68.
Jammer, Max: Conceqts of Spa*, Harvard University Press, 1954.
Killing, W. : Diz nicht-euklidisclun Raumformen in analgtischu Behandlung, B. G.
Teubner, 1885,
Klein, F.: Vorksungcn nbcr dir Entwicklung der Mathcmatik im 19. Jahrhundcrt, Chelsea
(reprint), 1950, Vol. 1,6-62; YoL 2, 147-2O6.
BIBLIOGRAPHY 9O3
Pierpont, James: " Some Modern Views of Space," Amer. Math. Soe . Bull., 32,
1926, 225-s8.
Riemann, Bernhard: Gesammeltc mathematische Wnke, 2rrd ed., Dover (reprint),
1953, pp. 272'87 and 391-404.
Russell, Bertrand: An Essay on the Foundations of Geomztry (1897), Dover (reprint),
1956.
Smith, David E.: ASource Book in Mathematics, Dover (reprint), 1959' Vol. 2, 41 l-25,
463-75. This contains translations of Riemann's 1854 paper and Gauss's
1822 paper.
: " Gauss als Geometer," Nachrichten Kdnig. Gu. d* lUiss. zu Giit'',
Staeckel, P.
1917, Beiheft, 25-140; also in Werlce, l0z.
Weatherburn, C. E. : " The Development of Multidimensional Differential
Geometry," Australian and Ncw Zealand Ass'n for the Adttanccment of Scbue,
21, 193s, 12-28.
3B
Projective and Metric Geometry
But it should always be required that a mathematical subject
not be considered exhausted until it has become intuitively
evident . . . FELI)( KLEIN
l. Introduction
Prior to and during the work on non-Euclidean geometry, the study of
projective properties was the major geometric activity. Moreover, it was
evident from the work of von Staudt (Chap. 35, sec. 3) that projective
geometry is logically prior to Euclidean geometry because it deals with
qualitative and descriptive properties that enter into the very formation of
geometrical figures and does not use the measures of line segments and
angles. This fact suggested that Euclidean geometry might be some special-
ization of projective geometry. With the non-Euclidean geometries now at
hand the possibility arose that these, too, at least the ones dealing with spaces
of constant curyature, might be specializations of projective geometry. Hence
the relationship of projective geometry to the non-Euclidean geometries,
which are metric geometries because distance is employed as a fundamental
concept, became a subject of research. The clarification of the relationship
of projective geometry to Euclidean and the non-Euclidean geometries is the
great achievement of the work we are about to examine. Equally vital was the
establishment of the consistency of the basic non-Euclidean geometries.
90.4
SURFACES AS MODELS OF NON-EUCLIDEAN GEOMETRY 905
Figure 38.1
\/F-=7
z: klog
k+
- \/r=7
and the equation of the surface is
z: ktos{*=7
yrc'+!' - t/-p- az - Yz'
The curvature of the surface is -llkz. Thus the pseudosphere is a model for
a limited portion of the plane of Gauss, Lobatchevsky, and Bolyai' On the
pseudosphere a figure may be shifted about and just bending will make it
confo"- to the surface, as a plane figure by bending can be fitted to the
surface of a circular cylinder.
l. Jour. filr Math., 19, 1839, 370-87.
2. Annali di Mat., 7, 1866, f85-204 : O?ere Mat., l, 262-40'
3. Gior. di Mat., 6, 1868,248-312 : O?$e Mst., l, 374.405'
906 PRoJEcrrvE AND METRTC GEoMETRY
4, Amer. Math, Soc., Ttars.,2, 1901, 86-99 = Ges. Abh.,2, +3748. The proof and further
historical details can be found in Appendix V of David Hilbert's Gtundlagen du Gcomebie,
7th cd., B. G. Tcubner, 1930. The theorem presupposes that thc lines of hyperbolic
geometry would be the geodesics of thc surface and thc lengths and angles would be the
Euclidean lengths and angles on the surface.
5. Nacbichacn Kdnig, G*. dcr Wiss. zt Giitt., 1899' 't*-55; Math' Ann.,53, 1900, 8l-l 12; and
s4, r901,505-17.
PROJBCTIVE AND METRIC GEOMETRY 907
(r) 6-o'-e:|rcs(uu',ww'),
where (uu', z'a') is the cross ratio of the four lines.T What is significant about
the expression (l) is that it may be taken as the definition of the size of an
angle in terms ofthe projective concept ofcross ratio. The logarithm function
is, of course, purely quantitative and may be introduced in any geometry.
Independently of Laguerre, Cayley made the next step. He approached
geometry from the standpoint of algebra, and in fact was interested in the
geometric interpretation of quantics (homogeneous polynomial forms), a
subject we shall consider in Chapter 39. Seeking to show that metrical
notions can be formulated in projective terms he concentrated on the relation
of Euclidean to projective geometry. The work we are about to describe is in
his " Sixth Memoir upon Quantics."s
Cayley's work proved to be a generalization of Laguerre's idea. The
latter had used the circular points at infinity to define angle in thc plane.
The circular points are really a degenerate conic. In two dimensions Cayley
introduced any conic in place of the circular points and in three dimensions
he introduced any quadric surface. These figures he called the absolute.
Cayley asserted that all metric properties of figures are none other than
projective properties augmented by the absolute or in relation to the
absolute. He then showed how this principle led to a new o<pression for
angle and an expression for distance between two points.
He starts with the fact that the points of a plane are represented by
homogeneous coordinates. These coordinates are not to be regarded as
distances or ratios of distances but as an assumed fundamental notion not
F(*,y) : Z ou*r,.
I'l =l
G(u,u) : *,,,,,: o,
)
t,t =r
whete Alt is the cofactor of arl in the determinant lal of the coefficients of F.
Cayley now defines the distance 6 between two points r and y, where
x : (xr, xr, xs) and g : (!b!2,!s), by the formula
F(x.u\
(2) E : arc cos
lF\jpffifi-uz'
The angle { between two lines whose line coordinates are u : (ur, u2, ur) ar,d
a : (ot ur, a.) is defined by
These general formulas become simple if we take for the absolute the
particular conic xf + xZ+ x3:0. Then if (av a2, a.) and (b1,b2,br) are
the homogeneous coordinates of two points, the distance between them is
given by
14t
\.,/ --urr.r-"@
--- tffia2,
a az, I o\{b? + bZ + bZ
and the angle { between two lines whose homogeneous line coordinates are
(ur, u2, ur) and (u1, a2, us) is given bY
(5) cos{:ffi
With respect to the exPression for distance if we use the shorthand that
tr! : xrlt * xzlz * x3y" then, if a : (at a2, as), band a are three points on
a line,
ab + arc
-----:---
bc
---------= : arc COS
ac
---F---F'
ZrC COS COS
t/ aa{ bb ! bb! cc ! aaY cc
PROJECTTVE AND METRIC CEOMETRY 9O9
That is, the distances add as they should. By taking the absolute conic to be
the circular points at infinity, (1, i,0) and (1, -r,0), Cayley showed that
his formulas for distance and angle reduce to the usual Euclidean formulas.
It will be noted that the expressions for length and angle involve the
algebraic expression for the absolute. Generally the analytic expression ofany
Euclidean metrical property involves the relation of that property to the
absolute. Metrical properties are not ProPerties ofthe figure per se but ofthe
figure in relation to the absolute. This is Cayley's idea of the general pro-
jective determination of metrics. The place of the metric concept in Pro-
jective geometry and the greater generality of the latter were described by
Cayley as, " Metrical geometry is part of projective geometry."
Cayley's idea was taken over by Felix Klein ( 1849-1925) and generalized
include the non-Euclidean geometries. Klein, a professor at Gtittingen,
so as to
was one of the leading mathematicians in Germany during the last part of the
nineteenth and first part of the twentieth century. During the years 1869-70
he learned the work of Lobatchevsky, Bolyai, von Staudt, and Cayley;
however, even in 1B7l he did not know Laguerre's result. It seemed to him
to be possible to subsume the non-Euclidean geometries, hyperbolic and
double elliptic geometry, under projective geometry by exploiting Cayley's
idea. He gave a sketch of his thoughts in a paper of lBTl e and then developed
them in two papers.lo Klein was the first to recognize that we do not need
surfaces to obtain models of non-Euclidean geometries.
To start with, Klein noted that Cayley did not make clear just what he
had in mind for the meaning of his coordinates. They were either simply
variables with no geometrical interPretation or they were Euclidean
distances. But to derive the metric geometries from projective Seometry it $'as
necessary to build up the coordinates on a projective basis. Von Staudt had
shown (Chap. 35, sec. 3) that it was Possible to assign numbers to points by
his algebra of throws. But he used the Euclidean parallel axiom. It seemed
clear to Klein that this axiom could be dispensed with and in the 1873 paper
he shows that this can be done. Ilence coordinates and cross ratio of four
points, four lines, or four planes can be defined on a purely projective basis.
Klein's major idea was that by specializing the nature of Cayley's
absolute quadric surface (if one considers three-dimensional geometry) one
could show that the metric, which according to Cayley depended on the
nature of the absolute, would yield hyperbolic and double elliptic geometry.
When the second degree surface is a real ellipsoid, real elliptic paraboloid, or
real hyperboloid of two sheets one gets Lobatchevsky's metric geometry, and
when the second degree surface is imaginary one gets Riemann's non-
Euclidean geometry (of constant positive curvature). If the absolute is
9. Nachichten Kdnig. Gx, der W*s. zu Gatt., 1871,419-33 = Ges- Math. Abh., l' 244-53.
10. Math. Ann., 4, 1871, 573425; and 6, 1873, 11245 : Gx. Math' Abh., l' 254-305'
31t43.
9to PROJECTTVE AND METRTC GEOIUETRY
Figurc a8.2
where the quantity in parentheses denotes the cross ratio of the four points
and a is a constant. This cross ratio can be expressed in terms of the co-
PROJECTTVE AND METRIC CEOMETRY 9II
ordinates of the points. Moreover, if there are three points Pr, P2, Pr on the
line then it can readily be shown that
(P,Pr, QrQr)'(P,P"' QrQ) : (Pis, QtQz)
so that PtP, * PzPs :
PtPs.
Likewise if z and are two lines (Fig. 38.3) one considers the tangents,
u
and ro from their point of intersection to the absolute (the tangents may be
imaginary lines) ; then the angle between u and' u is defined to be
6 : c'log (ua, tu),
where again c' is a constant and the quantity in parentheses dcnotcs the
cross ratio of the four lines.
To express the values of d and { analytically and to show their depend-
ence upon the choice of the absolute, let the equation of the absolute be
given byFand G above. By definition
: Z
g
F*o otr*ryt.
t,! -r
One can now show that if .r : (rcr, xz, xs) and ! : (!y !lz, ys) are thc
coordinates of P1 and P, then
4: r1orF*o
+ {FFF-4,'.'
" F*, - \/Fly - F,;Fy,
Likewise, if (u1, u2, z3) and (oba2,ts) are the coordinates of the two lines,
then one can show, using G, that
The constant c' is generally taken to be i/2 so as to make C real and a complctc
central angle 2zr.
Klein used the logarithmic expressions above for angle and distancc and
showed how the metric geometries can be derived from projcctive Seomctry'
9r2 PROJECTIVE AND METRIC GEOMETRY
Thus ifone starts with projective geometry then by his choice ofthe absolute
and by using the above expressions for distance and angle one can get the
Euclidean, hyperbolic, and elliptic geometries as special cases' The nature of
the metric geometry is fixed by the choice of the absolute. Incidentally,
Klein's expressions for distance and angle can be shown to be equal to
Cayley's.
If one makes a projective (i.e. linear) transformation of the projective
plane into itself which transforms the absolute into itself (though points on
the absolute go into other points) then because cross ratio is unaltered by a
linear transformation, distance and angle will be unaltered. These particular
Iinear transformations which leave the absolute fixed are the rigid motions or
congruence transformations of the particular metric geometry determined by
the absolute. A general projective transformation will not leave the absolute
invariant. Thus projective geometry proper is more general in the trans-
formations it allows.
Another contribution of Klein to non-Euclidean geometry was the
observation, which he says he first made in l87l11 but published in l\74,t2
that there are two kinds of elliptic geometry. In the double elliptic geometry
two points do not always determine a unique straight line. This is evident
from the spherical model when the two points are diametrically opposite. In
the second elliptic geometry, called single elliptic, two points always deter-
mine a unique straight line. When looked at from the standpoint of differ-
ential geometry, the differential form ds2 of a surface of constant positive
curvature is (in homogeneous coordinates)
: dx?+dxZ+dx?
ds2
1r+@214)(x?+xl+xl))2
In both cases a2 > 0. However, in the first type the geodesics are curves
of finite length 2trf a or if .R is the radius, 2rR, and are closed (return upon
themselves). In the second the geodesics are of length tla or tiR and are still
closed.
A model of a surface which has the properties of single elliptic geometry
and which is due to Klein,l3 is provided by a hemisphere including the
boundary. However, one must identily any two points on the boundary
which are diametrically opposite. The great circular arcs on the hemisphere
are the " straight lines" or geodesics ofthis geometry and the ordinary angles
on the surface are the angles of the geometry. Single elliptic geometry
(sometimes called elliptic in which case double elliptic geometry is called
spherical) is then also realized on a space of constant positive curvature.
ll. Math. Ann.,4, 1871,604. See also, Math. Ann.,6, 1873, 125; and, Math. Ant.,37,1890'
554-57 .
12. Math. Am., 7, 1874, 549-57 ; 9, 1876, 476-82 : Ges. mqth. Abh., 2' 63-77 .
the geometry of the surface of the sphere provided that line in the double
elliptic geometry is interpreted as great circle on the surface of the sphere. If
there should be contradictory theorems in this double elliptic geometry then
there would be contradictory theorems about the geometry of the surface
of the sphere. Now the sphere is part of Euclidean geometry. Hence if
Euclidean geometry is consistent, then double elliptic geometry must also
be so. To the mathematicians of the 1870s the consistency of Euclidean
geometry was hardly oPen to question because, apart from the views of a
few men such as Gauss, Bolyai, Lobatchevsky, and Riemann, Euclidean
geometry was still the necessary geometry of the physical world and it was
inconccivablc that there could be contradictory properties in the geometry
of the physical world. However, it is imPortant, especially in the Iight of later
developments, to realize that this proof of the consistency of double elliptic
geometry depends upon the consistency of Euclidean geometry.
The metlrod of proving the consistency of double elliptic geometry
could not be used for single elliptic geometry or ficr hyperbolic geometry.
The hemispherical model of single elliptic geometry cannot be realized in
three-dimensional Euclidean geometry though it can be in four-dimensional
Euclidean geometry. If one were willing to believe in the consistency of the
latter, then one might accept the consistency of single elliptic geometry.
Howwer, though z-dimensional geometry had already been considered by
Grassmann, Riemann, and others, it is doubtful that any mathematician of
thc l870s would have been willing to affirm the consistency of four-
dimensional Euclidean geometry.
The case for the consistency of hyperbolic geometry could not be made
on any such grounds. Beltrami had given the pseudospherical interpretation,
which is a surface in Euclidean space, but this serves as a model for only a
linited region of hyperbolic geometry and so could not be used to establish
the consistency of the entire geometry. Lobatchevsky and Bolyai had con-
sidered this problem (Chap. 36, sec. B) but had not been able to settle it.
As a matter of fact, though Bolyai proudly published his non-Euclidean
geometry, there is evidence that he doubted its consistency because in papers
found after his death he continued to try to prove the Euclidean parallel
axiom.
The consistency of the hyperbolic and single elliptic geometries was
established by new models. The model for hyperbolic geometry is due to
Beltrami.l6 However, the distance function used in this model is due to Klein
and the model is often ascribed to him' Let us consider the two-dimensional
cr$e.
Within the Euclidean plane (which is part of the projective plane) one
selects a real conic which one may as well take to be a circle (Fig. 3B'4)'
15. Annali di MaL, 7, 1866, 185-204 : Ol*c Mat., l, 262-80; Gior. di Mat', 6' 1868,
284-312 = Op.re Mdt., l' 37+-q5.
MODEf,Ii AND THE CONSISTENCY PROBLEM 9r5
Figure 38.
4(a, b) : lrog
(ab, nn),
where zl and n are the conjugate imaginary tangents from the vertex ofthe
angle to the circle and (ab, nn ) is the cross ratio of the four lines a, b, m, and n.
The constant I /2i ensures that a right angle has the measure tl2. The
definition of distance between two Points is given by formula (8), that is,
d : c log (PP' , XY) with e usually taken to be t/2. According to this formula
as P or P' approaches X or Y, the distance PP' becomes infinite. Hence in
terms of this distance, a chord is an infinite line of hyperbolic geometry'
Thus with the projective definitions of distance and angle size, the
points, chords, angles, and other figures interior to the circle satisfly the
axioms of hyperbolic geometry. Then the theorems of hyperbolic geometry
also apply to these figures inside the circle. In this model the axioms and
theorems of hyperbolic geometry are really assertions about special figures
and concepts (e.g. distance defined in the manner of hyperbolic geometry) of
Euclidean geometry. Since the axioms and theorems in question apply to these
figures and concepts, regarded as belonging to Euclidean geometry, all of
the assertions of hyperbolic geometry are theorems of Euclidean 8eometry.
III then, there were a contradiction in hyperbolic Seometry, this contra-
diction would be a contradiction within Euclidean geometry. But if Euclidean
geometrg is consistent, then hyperbolic geometry must also be. Thus the
9r6 PROJECTIVE AND METRIC GEOMETRY
Figurc 38.5
and again the determinant of the a,, must not be zero. The invariants under
the projective group are, for example, linearity, collinearity, cross ratio,
harmonic sets, and the property of being a conic section.
One subgroup of the projective grouP is the collection of affine trans-
formations.le This subgroup is defined as follows: Let any line /- in the
projective plane be fixed. The points of l- are called ideal points or points
at infinity and /- is called the line at infinity. Other points and lines of the
projective plane are called ordinary points and these are the usual points of
the Euclidean plane. The affine group ofcollineations is that subgroup ofthe
projective group which leaves l. invariant (though not necessarily pointwise)
and affine geometry is the set of properties and relations invariant under the
affine group. Algebraically, in two dimensions and in homogeneous co-
ordinates, the'affine transformations are represented by equations (9) above
but in which lst : asc: 0 and with the same determinant condition' In
nonhomogeneous coordinates affine transformations are given by
x':,rrx+a,ny+4n o,,l
lo'
lrn
* o.
!':azfi*azz!*azs azzl
geomctry
Euclidean to-.*
PROJECTTVE AND METRTC GEOMETRY
92O
Klein also considered more general geometries than projective' At this
time ( 1872) algebraic geometry was gradually being distinguished as a
separate discipline and he characterized this geomery by introducing the
transformations which in three dimensions and in nonhomogeneous
coordinates read
20. Klein does speak, in the case of differential geometry, of the group of transformations
which leave the expression for dsz invariant. This leads to diflerential invariants (Ges. math.
Abh., t,487).
TIIE REALITY OF NON-EUCLIDEAN CEOMETRY 927
less, Cayley affirmed in his lB59 paper (sec. 3) that, " Projective geometry
is all geometry and reciprocally." 2s Bertrand Russell in his Essay on the
Foundations oJ Geometrg (1897) also believed that projective Seometry was
necessarily the a priori form of any geometry of physical space. Hermann
Hankel, despite the attention he gave to history,2s did not hesitate to say in
1869 that projective geometry is the royal road to all mathematics. Our
examination of the developments already recorded shows clearly that
mathematicians can readily be carried away by their enthusiasms.
Bibliographg
Beltrami, Eugenio: Opue matcmatiche, Ulrico Hoepli, 1902, Vol. l.
Bonola, Roberto: Non-Euclidean Geometry, Dover (reprint), 1955, pp. 129-264.
Coolidge, Julian L. : A History oJ Geomttrical Methods, Dover (reprint), 1963' pp.
68-87.
Klein, Felix: Gesammelte mathcmati.sche Abhandlungcn, lrt lius Springer, l92l-23' Vols.
I and 2.
Pasch, Moritz, and Max Dehn: Vorlcsungen iiber newte Geomelrie, 2nd ed.' Julius
Springer, 1926, pp. 185-239.
Pierpont, James: " Non-Euclidean Geometry. A Retrospect," Arncr. Iv[ath. Soc.
Bull., 36, 1930, 66-76.
Russell, Bertrand: An Essay on thc Foundalions of Geotrulry (1897), Dover (reprint),
1956.
25. Cayley used the term " descriptive geomctry" for projcctive geometry.
26. Die Entuicklung det Mathcmalik in den lctzten Jaluhurulnlcn (The Developmcnt of Mathc-
matics in the Last Few Centuries), 1869; 2nd ed', 1884.
39
Algebraic Geometry
In these days the angel of topology and the devil of abstract
algebra fight for the soul of each individual mathematical
domain. HERMANN WEYL
l. Background
While non-Euclidean and Riemannian geometry were being created, the
projective geometers were pursuing their theme. As we have seen, the two
areas were linked by the work of Cayley and Klein. After the algebraic
method became widely used in projective geometry the problem of recog-
nizing what properties of geometrical figures are independent of the co-
ordinate representation commanded attention and this prompted the study
of algebraic invariants.
The projective properties of geometrical figures are those that are
invariant under linear transformations of the figures. While working on these
properties the mathematicians occasionally allowed themselves to consider
higher-degree transformations and to seek those properties of curves and
surfaces that are invariant under these latter transformations. The class of
transformations, which soon superseded linear translormations as the
favorite interest, is called birational because these are expressed algebraically
as rational functions of the coordinates and the inverse transformations are
also rational functions of their coordinates. The concentration on birational
transformations undoubtedly resulted from the fact that Riemann had used
them in his work on Abelian integrals and functions, and in fact, as we shall
see, the first big steps in the study of the birational transformation of curves
were guided by what Riemann had done. These two subjects formed the
content ofalgebraic geometry in the latter part ofthe nineteenth century.
The term algebraic geometry is an unfortunate one because originally
it referred to all the work from the time of Fermat and Descartes in which
algebra had been applied to geometry; in the latter part of thc nineteenth
century it was applied to the study of algebraic invariants and birational
transformations. In the twentieth century it relbrs to the last-mentioned
field.
924
TIIE THEORY OF 925
In three variables the forms are called ternary; in four variables, quaternary;
etc. The definitions below apply to forms in z variables.
Suppose we subject the binary form to a transformation ?" of the form
(2). Under 7 the form;[(xr, xr) is transformed into the form
F(Xb X; -- AoX! + AtX!-rX2 +.. .+ A"Xg.
926
The coefficients of F will differ from those oflf and the roots of F : 0 will
differ from the roots of;f : 0. Any function .I of the coefficients of;f which
satisfies the relationship
whose vanishing expresses the fact that J, and f, represent harmonic pairs of
points.
Beyond invariants ofa form or system of forms there are covariants. Any
function C of the coefficients and variables of;fwhich is an invariant under
I except for a power olthe modulus (determinant) of Tis called a covariant
ofl Thus, for binary forms, a covariant satisfies the relation
C (Ao, Ab .. ., Ao, Xr, Xr) -- r'C (ao, ay "'1an5 *17 x2).
lx #l
is a simultaneous covariant of weight I of the two forms. Geometrically, the
Jacobian set equal to zero represents a pair of points which is harmonic to
each ofthe original pairs represented byrfand / and the harmonic ProPerty
is projective.
The Hessian of a binary form introduced by Hesse,l
arf a'f I
dfrl
is a covariant of weight 2. Its geometric meaning is too involved to warrant
space here (Cf. Chap. 35, sec. 5). The concept ol the Hessian and its
covariance applies to any form in n variables.
The work on algebraic invariants was started in l84l by George Boole
(1815-64) whose results 2 were limited. What is more relevant is that Cayley
was attracted to the subject by Boole's work and he interested Sylvester in
the subject. They were joined by George Salmon (1819-1904), who was a
professor of mathematics at Trinity College in Dublin from l8'10 to 1866 and
the., bec"-e a professor of divinity at that institution. These three men did
so much work on invariants that in one of his letters Hermite dubbed them
the invariant trinity.
In l84l Cayley began to publish mathematical articles on the algebraic
side ofprojective geometry. The lB4l paper ofBoole suggested to Cayley the
computation of invariants of nth degree homogeneous functions. He called
the invariants derivatives and then hyperdeterminants; the term invariant is
due to Sylvester.s Cayley, employing ideas of Hesse and Eisenstein on deter-
minants, developed a technique for generating his "derivatives'" Then he
published ten papers on quantics in the Philosophieal Transactions from 1854
and
8s:
l;,:, I
are invariants. To these results Sylvester and Salmon added many more.
Another contributor, Ferdinand Eisenstein, who was more concerned
with the theory of numbers, had already found for the binary cubic form 5
l: *? * Sbxlx, + 3cxrxl + dxl
that the simplest covariant of the second degree is its Hessian 11 and the
simplest invariant is
3b2c2 + 6abcd - 4bsd - 4ac3 - a'd',
which is the determinant of the quadratic Hessian as well as the discriminant
ofl Also the Jacobian of f and FI is another covariant of order three. Then
Siegfried Heinrich Aronhold (l8l9-84), who began work on invariants in
1849, contributed invariants for ternary cubic forms.6
The first major problem that confronted the founders ofinvariant theory
was the discovery of particular invariants. This was the direction of the
work from about 1840 to 1870. As we can see, many such lunctions can be
constructed because some invariants such as the Jacobian and the Hessian
are themselves lorms that have invariants and because some invariants
taken together with the original form give a new system of forms that have
simultaneous invariants. Dozens of major mathematicians including the few
we have already mentioned computed particular invariants.
The continued calculation of invariants led to the major problem of
invariant theory, which was raised alter many special or particular in-
variants were found; this was to find a complete system of invariants' What
4. Coll. Math. Papers, 2, 4, 6, 7, 10.
5. Jour..fiir Math.,27, 1844, 89-106, 319-21.
6. Jour. ftir Math., 55, 1858, 97-l9l ; and 62, 1863, 281-345.
THE THEORY OF ALOEBRATC INVARTANTS 929
this means is to find for a form ofa given number ofvariables and degree the
smallest possible number of rational integral invariants and covariants such
that any other rational integral invariant or covariant could be expressed as a
rational integral function with numerical coefficients of this complete set.
Cayley showed that the invariants and covariants found by Eisenstein for the
binary cubic form and the ones he obtained for the binary quartic form are a
complete system for the respective cases.T This left open the question of a
complete system for other forms.
The existence of a finite complete system or basis for binary fiorms of
any given degree was first established by Paul Gordan (1837-1912), who
devoted most of his life to the subject. His results is that to each binary form
-f(rr, rr) there belongs a finite complete system olrational integral invariants
and covariants. Gordan had the aid of theorems due to Clebsch and the result
is known as the Clebsch-Gordan theorem. The proof is long and difficult.
Gordan also proved s that any finite system of binary forms has a finite com-
plete system of invariants and covariants. Gordan's proofs showed how to
compute the complete systems.
Various limited extensions of Gordan's results were obtained during the
next twenty years. Gordan himself gave the complete system for the ternary
quadratic form,lo for the ternary cubic form,ll and for a system of two and
three ternary quadratics. 12 For the special ternary quartic.ri x, + xfu" + xlx,
Gordan gave a complete system of 54 ground forms.13
In l886 Franz Mertens (1840-1927)14 re-proved Gordan's theorem for
binary systems by an inductive method. He assumed the theorem to be true
for any given set of binary forms and then proved it must still be true when
the degree of one of the forms is increased by one. He did not exhibit ex-
plicitly the finite set ofindependent invariants and covariants but he proved
that it existed. The simplest case, a linear form, was the starting point of the
induction and such a form has only powers ol itself as covariants.
Hilbert, after writing a doctoral thesis in lBB5 on invariants,ls in lBBB16
also re-proved Gordan's theorem that any given system ofbinary forms has a
finite complete system of invariants and covariants. His proof was a modifica-
tion of Mertens's. Both proofs were far simpler than Gordan's. But Hilbert's
proof also did not present a process for finding the complete system.
7. Phil. Trans., 146, 1856, l0l-26: Coll. Math, Papers, 2, 250-75.
8. Jour. far Math.,69, 1868, 323-54.
9. Math. Ann.,2, 187O, 227-BO.
10. R. Clebsch and F. Lindemann, Vorlesungen iiber Geometriz, I' 1876' p. 291.
ll. Math. Aru., I,1869,56-89, 90-128.
12. Clebsch-Lindemann, P. 2BB.
13. Math. Ann., 17,1880, 217-33.
14. Jour. Jih Math., 100, 1887, 223-30.
15. Math. Ann.,30, 1887, l5-29 : Ges. Abh.,2, 102-16.
16. Math. Ann.,33, l8B9'223-26: Ges. Abh.,2' 162-64.
ALGEBRAIC GEOMETRY
93O
In 1888 Hilbert astonished the mathematical community by announcing
a totally new approach to the problem of showing that any form of given
degree and given number ofvariables, and any given system of forms in any
given number of variables, have a finite complete system of independent
rational integral invariants and covariants.l? The basic idea of this new
approach was to forget about invariants for the moment and consider the
question: If an infinite system of rational integral expressions in a finite
numbcr ofvariables be given, under what conditions does a finite number ol
these expressions, a basis, exist in terms of which all the others are expressible
as linear combinations with rational integral functions of the same variables
as coefficients ? The answer is, Always. More specifically, Hilbert's basis
theorem, which precedes the result on invariants, goes as follows: By an
algebraic form we understand a rational integral homogeneous function in n
variablcs with coefficients in some definite domain of rationality (field).
Given a collection of infnitely many forms of any degrees in the z variables,
then there is a finite number (a basis) Fr,Fr,...,F such thatanyformFof
the collection can be written as
F : Aft * Afz +...+ A^F^
where r4r, Ar,. . ., in the z variables (not necessarily in
z{,. are suitable forms
thc infinite system) with coefficients in the same domain as the coefficients of
the infinite system.
In the application of this theorem to invariants and covariants, Hilbert's
result states that for any form or system of forms there is a finite number of
rational integral invariants and covariants by means of which every other
rational integral invariant or covariant can be expressed as a linear com-
bination of the ones in the finite set. This finite collection of invariants and
covariants is the complete invariant system.
Hilbert's existence proof was so much simpler than Gordan's laborious
calculation of a basis that Gordan could not help exclaiming, " This is not
mathematics; it is theology." However, he reconsidered the matter and said
later, "I have convinced myself that theology also has its advantages'" In
fact hc himself simplified Hilbert's existence proof.l8
In the 1880s and '90s the theory of invariants was seen to have unified
many areas of mathematics. This theory was the " modern algebra" of the
period. Sylvester said in 1864:10 "As all roads lead to Rome so I find in my
orn r, at least that all algebraic inquiries, sooner or later, end at the
Capitol""""of modern algebra over whose shining portal is inscribed the Theory
of Invariants." Soon the theory became an end in itself, independent of its
17 . Math. Aru.,36,1890, 47!534 : Gcs. Abh.,2, 199-257 and in succceding papers until
'
1893.
18. Nuhrichha Konig. Gcs. det tilks. z/ Gafi., 1899,24M2'
19. Phil. Truns., 154, 1864, 579-666 :
Coll. Math. Polrs, 2, 37G-*79, p' 380'
THE THEORY OT ALGEBRAIC INVARIANTS 93r
She also gave a complete system ofcovariant forms for a ternary quartic, 331
in all. In 1910 she extended Gordan's result to n variables.2l
The subsequent history ofalgebraic invariant theory belongs to modern
abstract algebra. The methodology of Hilbert brought to the fore the ab-
stract theory of modules, rings, hnd fields. In this language Hilbert proved
that every modular system (an ideal in the class of polynomials in n variables)
has a basis consisting of a finite number of polynomials, or every ideal in a
polynomial domain of z variables possesses a finite basis provided that in the
domain of the coefficients of the polynomials every ideal has a finite basis.
20. Jour, fir Math., 134, 1908, 23-90.
21. Jour.fnr Math' 139, l9l I, I 18-54.
g32 ALGEBRAIC GEOMETRY
From l9l I to l9l9 Emmy Noether produced many papers on finite bases
for various cases using Hilbert's technique and her own. In the subsequent
twentieth-century development the abstract algebraic.viewpoint dominated'
As Eduard Study complained in his text on invariant theory' there was lack
of concern for specific problems and only abstract methods were pursued'
, r2x ,: r2y
(7)
x" +y' '
q
x'+g'
-;----:------6.
whete M is (r,y) and M'is (x',y'). Under this transformation circles trans-
form into circles or straight lines and conversely. Inversion is a transforma-
tion that carries the entire plane into itselfand such birational transformations
THE CONCEPT OF BIRATIONAL TRANSFORMATIONS 933
Figure 39.
22. Theorie der Kreisoerwdndschqft (Theory of Inversion), Abh. Knnig. Suh, Ges. dcr Wiss.,2
1855, 529-65 :Werke,2, 243-3+5.
23. Jour. de Math., 10, 1845, 36,1-67.
24. Jour. de Math., 12, 1847, 26190.
25. Gior. di Mat., l,
1863, 305-ll :
Obue, 1,5,t-61; and 3, 1865, 269-40, 363-76 =
Opere, 2, 193-218.
26. Math. Ann., 3, 1871, 165-227 , p. 167 in particular.
g34 ALGEBRATC GEOMETRY
and ra the equationf (w, z) 7 0 represents a curve in the real Cartesian plane.
z and w may still have complex values satisfyin9 -f @, z) : 0 but these are
not plotted. This view of real curves with complex points was already
familiar from the work in projective geometry. To the theory of birational
transformations of the surflace corresponds a theory of birational transforma-
tions of the plane curve. Under the reinterpretation just described the branch
points of the Riemann surface correspond to those points of the curve where
a Iine x : const. meets the curve in two or more consecutive points, that is,
is either tangent to the curve or passes through a cusp. A double point ofthe
curve corresponds to a point on the surface where two sheets just touch
without any further connection. Higher multiple points on curves also
correspond to other peculiarities of Riemann surfaces.
In the subsequent account we shall utilize the following definitions
(Cf. Chap. 23, sec. 3): A multiple point (singular point) P of order t > I of
an zth degree plane curve is a point such that a generic line through P cuts
the curve in n - k points. The multiple point is ordinary if the i tangents at
P are distinct. fn counting intersections ofan nth degree and an zth degree
curve one must take into account the multiplicity of the multiple points on
each curve. If it is i on the curve C" and k on C' and if the tangents at P of
C" are distinct from those of C', then the point of intersection has multi-
plicity it,t. A curve C' is said to be adjoint to a curve C when the multiple
points of C are ordinary or cusps and if C' has a point of multiplicity of
order t - I at every multiple point ofC of order t.
Clebschso first restated Abel's theorem (Chap. 27, sec. 7) on integrals
of the first kind in terms of curves. Abel considered a fixed rational function
.R(x, y) where x and y are related by any algebraic curve;f(r, y) : 0, so that
y is a function of x. Suppose.(Fig. 39.2);t: 0 is cut by another algebraic
curve
4(*,9, or, a2, - . ., a*) : O,
IW**
where / is a polynomial (an adjoint) of precisely the degree a - 3 which
vanishes at the double points and cusps of;[ : Q.
Clebsch's next contributionsl was to introduce the notion of genus as a
concept for classi$ing curves. If the curve has d double points then the genus
p: (l l2)(n - l)(, - 2) - d. Previously there was the notion of the
deficiency of a curve (Chap. 23, sec. 3), that is, the maximum possible
number of double points a curve of flegree n could possess, namely,
(n - l) (n - 2)/2, minus the number it actually does possess. Clebsch
shov"'ed 32 that for curves with only ordinary multiple points (the tangents
are all distinct) the genus is the same as the deficiency and the genus is an
invariant under birational transformation of the entire plane into itself.33
31. Jour. f'nr Math., 64, 1865, 43-65.
32. Jour. fiir Math,, 64, 1865, 98-100.
33. If the multiple (singular) points are of order I then the genus p of a curve C is
(z - l)(z - 2)12 - (l 12) ) rr(r, - l), where the summation extends over all multiple
points. The genus is a more refined concept.
THE UNIFORMIZATION PROBLEM 937
proof. Both Klein and Poincar6 continued to work hard to prove this
theorem but no decisive result was obtained for twenty-five years. In 1907
Poincar€a2 and Paul Koebe (1882-1945)a3 independently gave a proof of
this uniformization theorem. Koebe then extended the result in many
directions. With the theorem on uniformization now rigorously established
an improved treatment of algebraic functions and their integrals has become
possible.
P4:Sllb!s,!e), i:1,2,3,
42. Acta Math.,3l, 1908, 143 : @uwes,4, 70-139.
43. Math. Ann.,67, 1909, 145-224.
ALGEBRAIC GEOMETRY
940
transforms an integral ofthe first kind into an integral ofthe first kind so that
y' is invariant. They also gave new proofs of Abel's theorem (by using
function-theoretic ideas and methods) .
Their work was not rigorous' In particular they, too' in the Pliicker
tradition counted arbitrary constants to determine the number of inter-
section points of a C^ with a C". Special kinds of double points were not
investigited. The significance of the Clebsch-Gordan work for the theory of
algebraic functions was to express clearly in algebraic form such results as
Abel's theorem and to use it in the study of Abelian integrals' They put the
algebraic part of the theory of Abelianintegrals and functions more into the
foreground and in particular establisbLd the theory of transformations on its
own foundations.
Clebsch and Gordan had raised many problems and left many gaps' The
problems lay in the direction of new algebraic investigations for a purely
algebraic theory of algebraic functions. The work on the algebraic approach
was continued by Alexander von Brill and Max Noether from lB71 on; their
key paper was published in lB74.aa Brill and Noether based their theory on a
celebrated residual theorem (Rcstsatz) which in their hands took the place
of Abel's theorem. They also gave an algebraic proof of the Riemann-Roch
theorem on the number of constants which appear in algebraic functions
F(w, z) which become infinite nowhere except in rll prescribed points of
a Co. According to this theorem the most general algebraic function which
fulElls this condition has the form
F : CrFr + CzFs +... * CuFu * Cu*,
where
P:n-?*r,
t is the number of linearly independent functions { (of degree z - 3) which
vanish in the z prescribed points, andp is the genus of the C,. Thus if the C"
is a Cl without double points, thenp : 3 and the {'s are straight lines' For
this case when
m:l,thent :2 and,p.: I -3 +2 :0;
m:2,thent: l, andp:2 -3 + l:0;
m:3,then z: I or0andP: I or 0.
When p : 0 there is no algebraic function which becomes infinite in the
given points. When rn : 3, there is one and only one such function provided
the three given points be on a straight line. If the three points do lie on a
line z : 0, this line cuts the Ca in a fourth point. We choose a line z : 0
through this point and then Ft : ula.
,H. ,,Uber die algebraischen Funktionen und ihre Anwendung in der Geometrie," Math.
Am., 7, 187*,269-310'
THE ALCEBRAIC-GEOMETRIC APPROACH 94r
so that the equation ofany other structure containing the original one can be
represented by
M.FL + ...+ MpFt, : o,
where the M's are arbitrary homogeneous integral forms whose degree must
be so chosen that the left side of the equation is itself homogeneous'
Hilbert following Dedekind called the collection of the MrF, a module
(the term is now ideal and module now is something more general). One can
state Hilbert's result thus: Every algebraic structure of .Rn determines the
vanishing of a finite module.
53. "Uber die Theorie der algebraischen Formen," Mart. Aan.,36, 1890, 47t-53+ =
Gcs. Abh.,2, 199-257 .
54. ComP. Rtnl., 67, 1868, t238-39.
944
double integrals which play the role of Abelian integrals in the theory of
curves. Clebsch noted that flor an algebraic surface;[(.t, y , z) : 0 of degree
zr with isolated multiple points and ordinary multiple lines, certain surfaces
of degree z - 4 ought to play the role which the adjoint curves of degree
m - 3 play with respect to a curve of degree zz. Given a rational function
R(x,g, z) where r,y, and. z are related by;[(r,y, z) :0, if one seeks the
double integrals
I I *o''' z) dx dv'
which always remain finite when the integrals extend over a two-dimensional
region of the four-dimensional surface, one finds that they are of the form
IIwfz**
where Q is a polynomial of degree n - 4.Q : 0 is an adjoint surface which
passes through the multiple lines of;f : 0 and has a multiple line of order
k - I at least in every multiple line ofrf of order,t and has a multiple point
of order q - 2 at least in every isolated multiple point of;fof order 4. Such
an integral is called a double integral ofthe first kind. The number oflinearly
independent integrals of this class, which is the number of essential con-
stants in Q@,y, ,), is called the geometrical genus 1, of f : g. If the surface
has no multiple lines of points
,o:@-r)(*:2)(*-s).
Max Noetherss and Hieronymus G. Zeuthen (1839-1920) 58 proved that p,
is an invariant under birational transformations of the surface (not of the
whole space).
Up to this point the analogy with the theory of curves is good. The
double integrals ofthe first kind are analogous to the Abelian integrals ofthe
first kind. But now a first diflerence becomes manifest. It is necessary to
calculate the number of essential constants in the polynomials Q of degree
z - 4 which behave at multiple points of the surface in such a manner that
the integral remains finite. But one can find by a precise formula the number
of conditions thus involved only for a polynomial of sufficiently large degree
.ly'. If one puts into this formula N : m - 4, one might find a number
diflerent from po. CayleysT called this new number the numerical (arith-
metic) genusl, of the surface. The most general case is where pn : pr. When
55. Math. Ann., 2, 1870,293-316.
56. Moth. Ann., 4, l87l,2l-49.
57. Phil. Tratts., 159, 1869, 201-29 : Coll. Math. Papers, 6, 329-58; and Math. Ann', 3,
l87l, 526-29 : Coll. Math. Paper, 8, 394-97.
THE ALGEBRAIC GEOMETRY OF SURFACES 945
the equality does not hold one hasln < p, and the surface is called irregular;
otherwise it is called regular. Then Zeuthenss and Noetherss established the
invariance of the number 1" when it is not equal to pr.
Picardso developed a theory of double integrals of the second kind.
These are the integrals which become infinite in the manner of
(l 3)
IIW*#)*r
where U and V are rational functions ol x,g, and z andf(x,9, z) : 0. The
number of different integrals of the second kind, different in the sense that
no linear combination of these integrals reduces to the form (13), is finite;
this is a birational invariant ofthe surfacelf: 0. But it is not true here, as in
the case ofcurves, that the number ofdistinct Abelian integrals olthe second
kind is 2p. This new invariant of algebraic surfaces does not appear to be tied
to the numerical or the geometrical genus.
Far less has been accomplished for the theory of surfaces than for curves.
One reason is that the possible singularities of surfaces are much more com-
plicated. There is the theorem ofPicard and Georges Simart proven by Beppo
Levi (1875-1928) s1 that any (real) algebraic surface can be birationally
transformed into a surface free of singularities which must, however, be in a
space of five dimensions. But this theorem does not Prove to be too helpful.
In the case ofcurves the single invariant number, the genusy', is capable
of definition in terms of the characteristics of the curve or the connectivity
of the Riemann surface. In the case ofl(x, y, z) : 0 the number of character-
izing arithmetical birational invariants is unknown.s2 We shall not attemPt
to describe further the few limited results for the algebraic geometry of
surfaces.
The subject of algebraic geometry now embraces the study of higher-
dimensional figures (manifolds or varieties) defined by one or more algebraic
equations. Beyond generalization in this direction, another type, namely, the
use of more general coefficients in the defining equations, has also been
undertaken. These coefficients can be members of an abstract ring or field
and the methods of abstract algebra are applied. The several methods of
pursuing algebraic geometry as well as the abstract algebraic formulation
introduced in the twentieth century have led to sharp differences in language
and methods ofapproach so that one class ofworkers finds it very difficult to
understand another. The emphasis in this century has been on the abstract
algebraic approach. It does seem to offer sharp formulations of theorems and
Bibliograplry
Bater, H. F.: "On Some Recent Advances in the Theory of Algebraic Surfaces,"
Proc. Ian. Math. Soc., (2), 12, l9l2-13, l-40.
Berzolari, L.: "Allgemeine Theorie der hiiheren ebenen algebraischen Kurven,"
hqk. der Math. Wiss., B. G. Teubner, 1903-15, III C4, 313-455.
"Algebraische Transformationen und Korrespondenzen," Ercyk. der
Math. Wiss., B. G. Teubner, 1903-15, III, 2, 2nd hzlf B, 1781-2218. Useful
for results on higher-dimensional figures.
Bliss, G. A. : " The Reduction of Singularities of Plane Curves by Birational
Transformations," Amcr. Math. Soc. Bull.,29, 1923, 16l-83.
Brill, 4., and M. Noether: " Die Entwicklung der Theorie der algebraischen
Funltionen," Jalves. dcr Dcut, Math.-Vcrcin, 3, 1892-93, 107-565.
Castclnuovo, G., and F. Enriques: " Die algebraischen Fliichen vom Gesichtspunkte
der birationalcn Transformationen," Encgk. dcr Marh. Wiss., B. G. Teubner,
l9o3-15, III C6b, 674-76a.
"Sur quelques rdcents rdsultats dans la th6orie des surfaces alg6briques,"
Math. Am.,'18, 1897' 241-916.
Caylcy, A.:
-: Colbctcd Matlpnatical Papcrs, lohnson Reprint Corp., 1963, Vols. 2, 4,
6,7, 10, 189l-96.
Cf,cbsch, R. F. A. : " Versuch einer Darlegung und Wi-irdigung seiner Wissen-
schaftlichen Leistungen," Math. Ann.,7, 1874, l-55. An article by friends
of Clebsch.
Cootidge, Julian L.: A History of Gconutrical Methods, Dover (reprint), 1963, pp.
t95-230,278-92.
Cremona, Lt:j,gi: Ofuc mallumaticlu,3 vols., Ulrico Hoepli, l9l,t-17.
Henscl, Kurt, and Georg Landsberg: Thcorie dcr algebraischen Futktionen cirur
Variabch (1902), Chelsea (reprint), 1965, pp. 694-702 in particular.
Hilbcrt, David,:. Gesamnullc Abhandhmgcn, Julius Springer, 1933, Vol. 2.
Klein, Felix: Vmhsungcn iiber dir Entwickhmg dcr Matlumatik i.m 19 Jahrhundtrt, l,
155-66, 295-3 l9 ; 2, 2-26, Chelsea (reprint), 1950.
Meyer, Franz W. : " Bericht iiber den gegenwiirtigen Stand der Invariantentheorie,"
Jahrcs. dcr Deut. Malh.-Vcrcin., l, 1890-91, 79-292.
National Research Council: Scleckd Topics in Algcbraic Geomctry, Chelsea (reprint),
1970.
Noether, Emmy: "Die arithmetische Theorie der algebraischen Funktionen einer
Veriinderlichen in ihrer Beziehung zu den iibrigen Theorien and zu der
Zahlentheorie," Jahres. da Deul. Math.-Vcreiz., 28, 1919, 182-203.
+o
The Instillation of Rigor in Analysis
But it would be a serious error to think that one can find
certainty only in geometrical demonstrations or in the
testimony of the senses. A. L. cAUcItY
l. Introduction
By about lB00 the mathematicians began to be concerned about the looseness
in the concepts and proofs ofthe vast branches of analysis. The very concePt
ol a function was not clear; the use of series without regard to convergence
and divergence had produced paradoxes and disagreements; the controversy
about the representations offunctions by trigonometric series had introduced
further confusion I and, of course, the fundamental notions of derivative and
integral had never been properly defined. All these difficulties finally brought
on dissatisflaction with the logical status of analysis.
Abel, in a letter of 1826 to Professor Christoffer Hansteen,l complained
about " the remendous obscurity which one unquestionably finds in analysis'
It lacks so completely all plan and system that it is peculiar that so many men
could have studied it. The worst ofit is, it has never been treated stringently.
There are very few theorems in advanced analysis which have been demon-
strated in a logically tenable manner. Everywhere one finds this miserable
way ofconcluding from the special to the general and it is extremely peculiar
that such a procedure has led to so few of the so-called paradoxes'"
Several mathematicians resolved to bring order out ofchaos' The leaders
of what is often called the critical movement decided to rebuild analysis
solely on the basis of arithmetical concepts. The beginnings of the movement
coincide with the creation of non-Euclidean geometry. An entirely difierent
group, except for Gauss, was involved in the latter activity and it is therefore
diffi"rtt to t.o"" any direct connection between it and the decision to found
analysis on arithmetic. Perhaps the decision was reached because the hope of
l. CEuarcs,2, 263-65.
947
g4B rHE TNSTTLLATTON OF RrGOR rN ANALYSTS
concepts for the establishment of the calculus (except for a theory of real
numbers), but his work went unnoticed for half a century. He denied the
existence of infinitely small numbers (infinitesimals) and infinitely large
numbers, both of which had been used by the eighteenth-century writers. In
a book of I8l7 whose long title starts with Rein analytischer Beweis (see the
bibliography) Bolzano gave the proper definition of continuity, namely,
,f(x) is continuous in an interval if at any ,r in the interval the difference
f(x + a) - f(x) can be made as small as one wishes by taking ro sufficiently
small, He proves that polynomials are continuous.
Cauchy, too, tackled the notions of limit and continuity. As with
Bolzano the limit concept was based on purely arithmetical considerations.
In the Cours ( I 82l ) Cauchy says, " When the successive values attributed to a
variable approach indefinitely a fixed value so as to end by differing from it
by as little as one wishes, this last is called the limit of all the others. Thus,
for example, an irrational number is the limit of diverse fractions which
furnish closer and closer approximate values of it." This example was a bit
unfortunate because many took it to be a definition of irrational numbers in
terms of limit whereas the limit could have no meaning if irrationals were
not already present. Cauchy omitted it in his lB23 and lB29 works.
In the preface to his l82l work Cauchy says that to speak of the con-
tinuity of functions he must make known the principal ProPerties of in-
finitely small quantities. " One says fCours, p. 5] that a variable quantity
becomes infinitely small when its numerical value decreases indefinitely in
such a way as to converge to the limit 0." Such variables he calls infinites-
imals. Thus Cauchy clarifies Leibniz's notion of infinitesimal and frees it of
metaphysical ties. Cauchy continues, " One says that a variable quantity
becomes infinitely large when its numerical value increases indefinitely in
such a manner as to converge to the limit @." However, o means not a fixed
quantity but something indefinitely large.
Cauchy is now prepared to define continuity ofa function. In the Cours
(pp. 34-35) he says. "Letf(x) be a function of the variable r, and suppooe
that, for each value of .r intermediate between two given limits [bounds]' this
function constantly assumes a finite and unique value. If, beginning with a
value of x contained between these limits, one assigns to the variable * an
infinitely small increment c, the function itself will take on as an increment
the difference;f(: + c) -;t(x) which will depend at the same time on the
new variable cl and on the value of*. This granted, the function;f(x) will be'
between the two limits assigned to the variable x, a continuous function of the
variable if,, for each value of , intermediate bettveen these two limits, the
numerical value of the differencerf('r + u) - f (x) decreases indefinitely with
that of cr. In other words, tlu fwution f(x) will tcmain continuotr with rcspcct tt x
between tlu giaen limits, if,, betuen tluse limi*, an ilfiniuly snull hwcnvnt of tlu
aarinble alwags prodrues an irfrnitely small incremcnt of tlu fwctioil itsclf.
952 THE INSTTLLATTON OF RrCOR IN ANALYSIS
ll. Considery: sin(l/x) forxf O and y : Q for r : 0. This function goes through all
values from one assumed at a negative value of x to one assumed at a positive value of x,
However, it is not continuous at r: 0.
FUNCTIONS AND THEIR PROPERTIES 953
and introduced the theorem that if for z large enough we can make the
difference Fn*, - F.n less than any given quantity, no matter how large r is,
then there exists a fixed magnitude X such that the sequence comes closer
and closer to X, and indeed as close as one wishes. His determination of the
quantity X was somewhat obscure because he did not have a clear theory of
the real number system and of irrational numbers in particular on which to
build. However, he had the idea of what we now call the Cauchy condition
for the convergence of a sequence (see below).
In the course of the proof Bolzano established the existence of a least
upper bound for a bounded set of real numbers. His precise statement is: If
a property M does not apply to all values ofa variable quantity x, but to all
those that are smaller than a certain z, there is always a quantity U which is
the largest of those of which it can be asserted that all smalleru Possess the
property M.The essence of Bolzano's proof of this lemma was to divide the
bounded interval into two parts and select a Particular part containing an
infinite number of members of the set. He then repeats the Process until he
closes down on the number which is the least upper bound of the given set
of real numbers. This method was used by Weierstrass in the l860s, with
due credit to Bolzano, to prove what is now called the Weierstrass-Bolzano
theorem. It establishes for any bounded infinite set ofpoints the existence ofa
point such that in every neighborhood of it there are points of the set.
Cauchy had used without proof (in one of his proofs of the existence of
roots of a polynomial) the existence of a minimum of a continuous function
defined over a closed interval. Weierstrass in his Berlin lectures proved for
any continuous function of one or more variables defined over a closed
bounded domain the existence of a minimum value and a maximum value
of the function.
In work inspired by the ideas of Georg Cantor and Weierstrass, Heine
12 and
defined uniform continuity for functions of one or several variables
then proved 13 that a function which is continuous on a closed bounded
interval ol the real numbers is uniformly continuous. Heine's method
introduced and used the following theorem: Let a closed interval la, bl arrd
a countably infinite set A of intervals, all in [a, D], be given such that every
point x of a < x < b is an interior point of at least one of the intervals of A.
(The endpoints a and D are regarded as interior points when a is the left-
hand end ofan interval and D the right-hand end ofanother interval.) Then a
set consisting of a fnite number of the intervals of A has the same ProPerty'
namely, every point ofthe closed interval [a, D] is an interior Point ofat least
one of this finite set of intervals (a and, b can be endpoints).
Emile Borel (l87l-1956), one of the leading French mathematicians of
this century, recognized the importance of being able to select a finite
number of covering intervals and first stated it as an independent theorem
12. Jour. far Math.,7l, 1870, 353-65.
13. Jour..fir Morh.,74, 1872, 172-88.
954 Tr{E TNSTTLLATTON OF RrGOR rN ANALYSTS
for the case when the original set of intervals A is countable.la Though many
German and French mathematicians refer to this theorem as Borel's, since
Heine used the property in his proof of uniform continuity the theorem is also
known as the Heine-Borel theorem. The merit of the theorem, as Lebesgue
pointed out, is not in the proof of it, which is not difficult, but in perceiving
its importancc and enunciating it as a distinct theorem. The theorem applies
to closcd sets in any number of dimensions and is now basic in set theory.
The extension of the Heine-Borel theorem to the case where a finite set
of covering intervals can be selected from an uncountably infinite set is
usually credited to Lebesgue who claimed to have known the theorem in
1898 and published it in his hgons *r l'intlgration (l90t). However, it was
first published by Pierre Cousin (1867-1933) in 1895.15
3. Tlu Deriaatiae
D'Alembert was the first to see that Newton had essentially the correct
notion of tJre derivative. D'Alembert says explicitly in the Ercgclopidic that
the dcrivative must be based on the limit of the ratio of the differenccs of
dependent and independent variables. This version is a reformulation of
Newton's prime and ultimate ratio. D'Alembert did not go further because
his thoughts were still tied to geometric intuition. His successors of the next
fifty years still failed to give a clear definition ofthe derivative. Even Poisson
believcd that there are positive numbers that are not zero, but which are
snallcr than any given number however small.
Bolzano was the 6rst (l8l7) to define the derivative of;t(x) as the
quantity /'(*) which the ratio Nf@ + Lx) --f(*)ll&* approaches in-
definitely closely as A* approaches 0 through positive and negative values.
Bolzano emphasized that J'@) was not a quotient of zeros or a ratio of
evanGcent quantities but a number which the ratio above approached,
llnhis RlsmC dts hgonsLo Cauchy defined the derivative in the same man-
ner as Bolzano. He then unified this notion and the Leibnizian differentials
by dcfining dx to be any finite quantity and. dg to be f'(x) d*.1? In other
words, one introduces two quantities dx and. dy whose ratio, by definition, is
;f'(x). Difierentials have meaning in terms of the derivative and are merely
an auxiliary notion that could be dispensed with logically but are convenient
as a way ofthinking or writing. Cauchy also pointed out what the differential
cxpressions used throughout the eighteenth century meant in terms of
derivatives.
He then clarified the relation between Ay/A* andf'(x) through the mean
value theorem, that is, Ay : f'(x + 0A*) A.r,whereO < 0 < l.Thetheorem
itself was known to Lagrange (Chap. 20, sec. 7). Cauchy's proofofthe mean
value theorem used the continuity of/'(*) in the interval A*.
Though Bolzano and Cauchy had rigorized (somewhat) the notions of
continuity and the derivative, Cauchy and nearly all mathematicians of
his era believed and many texts "proved" for the next fifty years that a
continuous function must be differentiable (except of course at isolated
points such as x : 0 for g : Llx). Bolzano did understand the distinction
between continuity and differentiability. In his Funktionmlehra, which he
wrote in lB34 but did not complete and publish,ls he gave an example of a
continuous function which has no finite derivative at any Point. Bolzano's
example, like his other works, was not noticed.le Even if it had been published
in 1834 it probably would have made no impression because the curve did
not have an analytic representation, and for mathematicians of that period
functions were still entities given by analytical expressions.
The example that ultimately drove home the distinction between con-
tinuity and differentiability was given by Riemann in the Habilitations-
schrift, the paper of 1854 he wrote to qualify as a Priaatdozent at Giittingen,
" LJber die Darstellbarkeit einer Function durch eine trigonometrische
Reihe," (On the Representability of a Function by a Trigonometric Series).2o
(The paper on the foundations of geometry (Chap. 37, sec. 3) was given as a
qualifying lccture.) F':iernann defined the following function. Let (x) denote
the difference between .r and the nearest integer and let (x) : 0 if it is
halfway between two integers. Then - ll2 < (x) < l/2. Now;t(r) is defined
as
r@):+ *? *$ *
This series converges for all values of *. However, for x : pl2n where p is an
integer prime to 2n, f (x) is discontinuous and has a jump whose value is
# l\nz. At all other values of *, ;[(r) is continuous. Moreover, 1f(.r) is dis-
continuous an infinite number of times in every arbitrarily small interval.
Nevertheless, ;t(x) is integrable (sec. 4). Moreover, f (r) : J,f{") A, ir
"""-
tinuous for all x but fails to have a derivative where;[(x) is discontinuous.
This pathological function did not attract much attention until it was
published in
1868.
An even more striking distinction between continuity and differentia-
bility was demonstrated by the Swiss mathematician Charles Cell6rier
18. Schiftcx, l, Prague, 1930. It was edited and publishcd by K. Rychlik, Prague, 1930.
19. In 1922 Rychlik proved that the function was nowhere differentiablc. Sce Gcrhard
Kowalcwski, "Uber Bolzanos nichtdiffercnzierbare stetige Funktion," Acta Math., 44,
1923, 315-19. This article contains a description of Bolzano's function.
20. Abh. der Gcs. dcr Wkl zu Gdtt., 13, 1868, 87-132 : Wcrkc,227-$4.
956 THE rNsrrLLATIoN oF RIcoR rN ANALYSIS
f@:;a-ositranx
in which a is a large positive integer. This was not published, however, until
1890.'r The example that attracted the most attention is due to Weierstrass.
As far back as 186l he had affirmed in his lectures that any attemPt to
prove that differentiability follows from continuity must fail. He then gave
the classic example of a continuous nowhere differentiable function in a
lecture to the Berlin Academy on July 18, 1872.22 Weierstrass communicated
his example in a letter of lB74 to Du Bois-Reymond and the example was
first published by the latter.23 Weierstrass's function is
4. The Intcgral
Newton's work showed how areas could be found by reversing diflerentiation.
This is of course still the essential method. Leibniz's idea of area and volume
as a "sum" of elements such as rectangles or cylinders [the definite integral]
was neglected. When the latter concept was employed at all in the eighteenth
century it was loosely used.
Cauchy stressed defining the integral as the limit ofa sum instead ofthe
inverse of differentiation. There was at least one major reason for the change.
Fourier, as we know, dealt with discontinuous functions, and the formula for
the coefficients of a Fourier series, namely,
,li->/(f,)kt-tct-r),
where (, is any value of x in[*,-r, *,]. The definition presupposes that/(r) is
continuous over [xo, Xf and that the length of the largest subinterval
approaches zero. The definition is arithmetical. Cauchy shows the integral
exists no matter how the .r, and f1 are chosen. However, his proof was not
rigorous because he did not have the notion ofuniform continuity. He denotes
the limit by the notation proposed by Fourier IX,f@ a, in place of
[rut arff::i1
often employed by Euler for antidifferentiation.
Cauchy then defines
F(,) : li"tat or
and shows that F(x) is continuous in [xo, X]. By forming
F(x + etti----F//'\ :;).
t rz+h
-f(*) d*
and using the mean value theorem for integrals Cauchy proves that
F'(x) : f(x).
25. Riaurui,8l-a4 - @uaru, (2), 4, 122-27.
958 THE rNsrrLLATroN oF RrGoR rN ANALYS$
[rc]*:!-*ldx+c.
He points out that
!'t'at:-f(o) --f(o)
presupposes ;f'(*) continuous. Cauchy then treats the singular (improper)
integrals where;f(r) becomes infinite at some value of * in the interval of
integration or where the interval of integration extends to co. For the case
whererf(r) has a discontinuity at , : c at which value;f(r) may be bounded
or not Cauchy defines
llrat *: lHJ.-"'r@) dx +
I\l".""tat
o.
when these limits exist. When er : s2 we get what Cauchy called the principal
value.
The notions of area bounded by a curve, length of a curve, volume
bounded by surfaces and areas of surfaces had been accepted as intuitively
understood, and it had been considered one of the great achievements of the
calculus that these quantities could be calculated by means ofintegrals. But
Cauchy, in keeping with his goal of arithmetizing analysis, dtfined these
geometric quantities by means of the integrals which had been formulated
to calculate them. Cauchy unwittingly imposed a limitation on the concePts
he defined because the calculus formulas impose restrictions on the quantities
involved. Thus the formula for the length of arc of a curve given by S : "f(*)
is
,:l'47aaY6
and this formula presupposes the differentiability of;f(r). The question of
what are the most general definitions ofareas, lengths ofcurves, and volumes
was to be raised later (Chap. 42, sec. 5).
Cauchy had proved the existence of an integral for any continuous
integrand. He had also defined the integral when the integrand has jump
discontinuities and infinities. But with the growth of analysis the need to
consider integrals of more irregularly behaving functions became manifest.
The subject ofintegrability was taken up by Riemann in his paper of 1854 on
trigonometric series. fle says that it is important at least for mathematics,
though not for physical applications, to consider the broader conditions
under which the integral formula for the Fourier coefrcients holds.
TTIE INTEGRAL 959
s: j/{,,)a*,,
l=1
where *, is any value of r in Ax,, approach a unique limit (that the integral
exists) as the maximum Ar, approaches zero is that the sum of the intervals
A*, in which the oscillation of/(r) is greater than any given number I must
approach zero with the size of the intervals.
Riemann then points out that this condition on the oscillations allows
him to replace continuous functions by functions with isolated discontinuities
and also by functions having an everywhere dense set of points of dis-
continuity. In fact the example he gave of an integrable function with an
infinite number of discontinuities in every arbitrarily small interval (sec. 3)
was offered to illustrate the generality of his integral concept. Thus Riemann
dispensed with continuity and piecewise continuity in the definition of the
integral.
In his 1854 paper Riemann with no further remarks gives another
necessary and sufficient condition that a bounded function;f(x) be integrable
on la, b). It amounts to first setting up what are now called the upper and
lower sums
S : MrAxl *. .'* MnMo
s : mrLtt 1"'* mnLxn
where z, ar,d, M1 are the least and greatest values of ;f(.r) in A*,. Then
letting D, : Mr - n1, Riemann states that the integral of J@) over la, bf
exists if and only if
for all choices of A.r, filling out the interval [a, D]. Darboux completed this
formulation and proved that the condition is necessary and sufficient.2?
There are many values of S each corresponding to a partition of [a, D] into
Ax,. Likewise there are many values of .r. Each S is called an uPper sum and
each s a lower sum. Let the greatest lower bound of the S be J and the least
upper bound of the s be .L It follows that / < J. Darboux's theorem then
states that the sums S and s tend respectively to "I and I when the number of
26. For brevity we use Ar1 for the subintervals and their length.s.
27. Ann. dcI'kole Norm. Sup., (2), 4, 1875, 57-112.
960 THE INSTTLLATIoN oF RrGoR rN ANALYSIS
dx:f(h) - f(o)
['ral
when .,7f is merely integrable in the Riemann-Darboux sense. Darboux's
argrrment was that
5. Infinite Series
The eighteenth-century mathematicians used series indiscriminately. By the
end of the century some doubtful or plainly absurd results from work with
infinite series stimulated inquiries into the validity of operations with tlem.
Around l8l0 Fourier, Gauss, and Bolzano began the exact handling of
infinite series. Bolzano stressed that one must consider convergence and
criticized in particular the loose proof of the binomial theorem. Abel was the
most outspoken critic of the older uses of series.
In his lSll paper and his Analytical Theory of l/aat Fourier gave a
satislactory definition of convergence of an infinite series, though in general
he worked freely with divergent series. In the book (p. 196 of the English
edition) he describes convergence to mean that as z increases the sum ofz
terms approaches a fixed value more and more closely and should differ from
it only by a quantity which becomes less than any given magnitude. More-
over, he recognized that convergence ofa series offunctions may obtain only
Bolzano in his lBlT publication had the correct notion of the condition
for the convergence of a sequence, the condition now ascribed to Cauchy.
Bolzano also had clear and correct notions about the convergence of series.
But, as we have already noted, his work did not become widely known.
Cauchy's work on the convergence of series is the first extensive sig-
nificant treatment of the subject. ln his Cours d'analyse Cauchy says, " Let
Sa: Uo + ur + uz *... * ut_t
be the sum of the first z terms [of the infinite series which one considers], z
designating a natural number. If, for constantly increasing values of z, the
sum rn approaches indefinitely a certain limit .r, the series will be called
conoergent, and the limit in question will be called the surn of the series.ss On
the contrary, if while a increases indefinitely, the sum s,, does not approach
a fixed limit, the series will be called dioergent and will have no sum."
After defining convergence and divergence Cauchy states (Cozrs, p. 125)
the Cauchy convergence criterion, namely, a sequence {So} converges to a
limit S if and only if S,,*, - So can be made less in absolute value than any
assignable quantity for all r and sufficiently large a. Cauchy proves this
condition is necessary but merely remarks that if the condition is fulfilled, the
convergence ofthe sequence is assured. He lacked the knowledge ofproperties
of real numbers to make the proof.
Cauchy then states and proves specific tests for the convergence of
series with positive terms. He points out that 4, must approach zero. Another
test (Cours, 132-35) requires that one find the limit or limits toward which the
expression (u,,)l/n tends as z becomes infinite, and designate the greatest of
these limits by,t. Then the series will be convergent if /c < I and divergent
if tr > l. He also gives the ratio test which uses lil:ro-- uoarlun If this limit
is less than I the series converges and if greater than l, the series diverges.
Special tests are given if the ratio is l. There follow comparison tests and a
logarithmic test. He proves that the sum un + t,n of two convergent series
converges to the sum of the separate sums and the analogous result for prod-
uct. Series with some negative terms, Cauchy shows, converge when the series
of absolute values of the terms converge, and he then deduces Leibniz's test
for alternating series.
Cauchy also considered the sum of a series
Lagrange was the first to state Taylor's theorem with a remainder but
Cauchy in his 1823 and lB29 texts made the important point that the infinite
Taylor series converges to the function from which it is derived if the re-
mainder approaches zero. He gives the example s- x' 1 s- 1tx2 of a function
whose Taylor series does not converge to the function. In his lB23 text he
gives the example e-rtxz of a function which has all derivatives at * : 0 but
has no Taylor expansion around * : 0. Here he contradicts by an example
Lagrange's assertion in lis Thiorie dzs fonetions (Ch. V., Art. 30) that if
/(*) has at *6 all derivatives then it can be expressed as a Taylor series which
converges to;f(*) for x rl.eal. xo. Cauchy also gaveao an alternative form for
the remainder in Taylor's formula.
Cauchy here made some additional missteps with respect to rigor. In his
Cours d'analyse (pp. t 3l-32) he states that F(.r) is continuous if when
.F(*) : Ifl z"(*) the series is convergent and the un(x) arc continuous. In his
Rlstml dts hgonsaL he says that if the u,(x) are continuous and the series
converges then one may integrate the series term by term; that is,
fb
I Fdz: ZI""r-.
t" 1
He overlooked the need for uniform convergence. He also asserts for con-
tinuous functions a2 that
d,c:
*tto,u) tg^*
Cauchy's work inspired Abel. Writing to his former teacher Holmboe
from Paris in 1826 Abel said aa Cauchy "is at present the one who knows how
mathematics should be treated." In that yearaa Abel investigated the domain
of convergence of the binomial series
(2)
. sin 2x sin 3x
srn.r-l-+--3-...
which is discontinuous when x : (2n + l)z and z is integral, though the
individual terms are continuous.a6 Then by using the idea of rniform
convergence he gave a correct proof that the sum ofa uniformly convergent
series of continuous functions is continuous in the interior of the interval of
convergence. Abel did not isolate the property of uniform convergence ofa
series.
The notion of uniform convergence of a series 2f u"(x) requires that
given any e, there exists an try' such that for all z > lf, lS(x)
- \i u^(x)l < e
for all x in some interval. S (.r) is of course the sum of the series. This notion
was recognized in and for itself by Stokes, a leading mathematical physicist,az
and independently by Philipp L. Seidel (1821-96).48 Neither man gave the
precise formulation. Rather both showed that if a sum of a series of con-
tinuous functions is discontinuous at xo then there are values ofx near ro for
which the series converges arbitrarily slowly. Also neither related the need
for uniform convergence to the justification of integrating a series term by
term. In fact, Stokes acceptedae Cauchy's use of term-by-term integration.
Cauchy ultimately recognized the need for uniform convergenceso in order
to assert the continuity ofthe sum ofa series ofcontinuous functions but even
he at that time did not see the error in his use of term-by-term integration of
series.
Actually Weierstrass 51 had the notion of uniform convergence as early
as 1842. fn a theorem that duplicates unknowingly Cauchy's theorem on the
existence of power series solutions of a system of first order ordinary dif-
ferential equations, he affirms that the series converge uniformly and so
constitute analytic functions of the complex variable. At about the same time
45. CEuores, l, 224.
46. The series (2) is the Fourier expansion ofr/2 in the interval -zr < x < a. Hcncc the
series represents the periodic function which is r/2 in each 2z intcrval. Thcn thc scrics
converges to rl2 wheIf, r approaches (2n * l)r from the left and thc serics convcrgcl to
-zl2 whera r approaches (2n * l)z from the right.
^i0c.,85, 18.18,53H3: Math. and Phys. Papcrs, 1,2rc4l3.
47. Trars. Camb. Phil.
48. Abh. der Bager. Akad. der Wiss., 1847149,379-94.
49. Papers, 1,2+2,255,268, and 283.
50. Comp. Rerd.,36, 1853, 45,+-59: (Ew,rcs, (l), 12, 30-36.
51. Werke, l, 67-45.
966 THE rNs'flLLATroN oF RrcoR rN ANALYSTS
Weierstrass used the notion of uniform convergence to give conditions for the
integration ofa series term by term and conditions for differentiation under
the integral sign.
Through Weierstrass's circle of students the importance of uniform
convergence was made known. Heine emphasized the notion in a paper on
trigonometric series.63 Heine may have learned of the idea through Georg
Cantor who had studied at Berlin and then came to Halle in 1867 where
Heine was a professor of mathematics.
'Weierstrass
During his years as a high-school teacher also discovered
that any continuous function over a closed interval of the real axis can be
exprcsscd in that interval as an absolutely and uniformly convergent series
of polynomials. Weierstrass included also functions of several variables.
This rcsult 53 aroused considerable interest and many extensions of this
result to the representation of complex functions by a series of polynomials or
a series of rational functions were established in the last quarter ol the
ninctcenth ccntury.
It had been assumed that the terms ofa series can be rearranged at will.
In a papcr of 18375{ Dirichlet proved that in an absolutely convergent series
one may group or rearange terms and not change the sum. He also gave
exarnples to show that the terms ofany conditionally convergent series can be
rearrauged so that the sum is altered. Riemann in a paper written in lB54
(see below) proved that by suitable rearrangement ofthe terms the sum could
be any given number. Many more criteria for the convergence of infinite
scries were developed by leading mathematicians from the lB30s on through-
out the rest of the century,
6. Fouricr Snizs
As we know Fourier's work showed that a wide class of functions can be
rcprcsentd by trigonometric series. The problem of finding precise con-
ditions on the functions which would possess a convergent Fourier series
rcmained open. Efforts by Cauchy and Poisson were fruitless.
Dirichlet took an interest in Fourier series after meeting Fourier in
Paris during the years 1822-25. In a basic paper "Sur la convergence des
sdries trigonomdtriques" 55 Dirichlet gave the first set of sufuimt conditions
that the Fourier series representing a given;f(x) converge and converge to
;t(r). The proof given by Dirichlet is a refinement ofthat sketched by Fourier
in the concluding sections of his Analytical Tluory oJ Heat. Consider;[(*) either
52. Jour.ftu Math.,7l' 1870, 353-65.
53- Sitzungsbcr. Akatl. Wbs. zu Boilin, 1885, 633-39, 789-905 :
Weilse' 3' l-??.
54. Abh, Kdnig. Akad. dq Wiss., Berlin, 1837, 45-gl = Wetkc, l' 313-342 : Jour. dc Math,,
4, 1839,393422.
55. Jotr. ftu Math., 4, 1829, 157-69 : Werke, l, ll7-32.
FOURIER SERIES s67
given periodic with period 2zr or given in the interval f-n, n) and defined to
be periodic in each interval of length 2z to the left and right of l-n, r].
Dirichlet's conditions are :
(") -f(*) is single-valued and bounded.
(b) is piecewise continuous; that is, it has only a finite number of
"f(r)
discontinuities in the (closed) period.
@) J@) is piecewise monotone; that is, it has only a finite number of
maxima and minima in one period.
approach zero as n tends to infinity. The theorem showed too that for
bounded and integrable;t(x) the convergence olits Fourier series at a point
in [-zr, zr] depends only on the behavior of/(*) in the neighborhood ofthat
point. However, the problem offinding necessary azd sufficient conditions on
/(r) so that its Fourier series converges to .it(r) was not and has not been
solved.
Riemann opened up another line of investigation. He considered
higonometric series but did not require that the coefficients be determined by
the formula (3) for the Fourier coefficients. He starts with the series
.o.3
(4)
)c"sin * *? + f bncosnx
t1
and defines
P1 : )e^6'
a=0
Of course ;1"(r) has a value only for those values of r for which the series
converges. Let us refer to the series itself by O. Now the terms of
() may
apploach zero for all r or for some f,. These two cases are treated separately
byRiemann.
Ifao and 6,n approach zero, the terms of f) approach zero for all x. Let
F(x) be the function
r(,): c+c'x*n"+- A,-+- -#
which is obtained by two successive integrations of O. Riemann shows that
J0(.r) converges for all r and is continuous in *. Then F(*) can itself be
integrated. Riemann now proves a number of theorems about F(r) which
lead in turn to necessary and sufficient conditions for a series ofthe form (4)
to converge to a given function;f(x) of period 2zr. He then gives a necessary
and sufficient condition that the trigonometric series (4) converge at a
particular value of*, with 4,, and D,, still approaching 0 as n approaches co'
Next he considers the alternate case where limn- - An depends on the
value of .r and gives conditions which hold when the series cl is convergent
for particular values of; and a criterion for convergence at particular values
of ,.
He also shows that a given /(.r) may be integrable and yet not have a
Fourier series representation. Further there are nonintegrable functions to
FOURTER SERTES 969
which the series Q converges for an infinite number of values of # taken
between arbitrarily close limits. Finally a trigonometric series can converge
for an infinite number of values of r in an arbitrarily small interval even
though an and Dn do not approach zero for all r.
The nature of the convergence of Fourier series received further atten-
tion after the introduction of the concept of uniform convergence by Stokes
and Seidel. It had been known since Dirichlet's time that the series were, in
general, only conditionally convergent, if at all, and that their convergence
depended upon the presence ofpositive and negative terms. Heine noted in a
paper of 18705? that the usual proof that a bounded;f(x) is uniquely
represented between -z and zr by a Fourier series is incomplete because the
series may not be uniformly convergent and so cannot be integrated term by
term. This suggested that there may nevertheless exist nonuniformly con-
verging trigonometric series which do represent a function. Moreover, a
continuous function might be representable by a Fourier series and yet the
series might not be uniformly convergent. These problems gave rise to a new
series of investigations seeking to establish the uniqueness ofthe representa-
tion of a function by a trigonometric series and whether the coefficients are
necessarily the Fourier coemcients. Heine in the above-mentioned paper
proved that a Fourier series which represents a bounded function satisrying
the Dirichlet conditions is uniformly convergent in the portions of the interval
l- o, n) which remain when arbitrarily small neighborhoods of the points of
discontinuity of the function are removed from the interval. In these neigh-
borhoods the convergence is necessarily nonuniform. Heine then proved that
if the uniform convergence just specified holds for a trigonometric series
which represents a function, then the series is unique.
The second result, on uniqueness, is equivalent to the statement that if
a trigonometric series of the form
do-.S
(s) (ao cos nx * bnsin nx)
is uniformly convergent and represents zero where convergent, that is, except
on a finite set P ofpoints, then the coemcients are all zero and ofcourse then
the series represents zero throughout l-zr, n).
The problems associated with the uniqueness of trigonometric and
Fourier series attracted Georg Cantor, who studied Heine's work. Cantor
began his investigations by seeking uniqueness criteria for trigonometric
series representations of functions. He proved sB that when rf(*) is repre-
sented by a trigonometric series convergent for all x, there does not exist a
different series of the same form which likewise converges for every * and
57. Jour. fir Math.,7l,l870, 353-65.
58. Jour. filr Math., 72, 1870, 139-42 : Ges. Abh., 8V83.
97O THE TNSTTLLATTON OF RIGOR rN ANALYS$
repnBents the same rf(*). Another paper6e gave a better proof for this last
result.
The uniqueness theorem he proved can be restated thus: If, for all *,
there is a convergent rePresentation of zero by a trigonometric series, then
the coefficients an and bo are zero. Then Cantor demonstrates in the lBTl
paper that the conclusion still holds even if the convergence is renounced for a
finite number of* values. This paper was the first ofa sequence of papers in
60 the
which Cantor treats the sets of exceptional values of *. He extended
uniqueness result to the case where an infinite set of exceptional values is
permitted. To describe this set he fust defined a pointp to be a limit of a set
of p"it tt S if every interval containing p contains infinitely many points of
.S. Thcn he introduced the notion of the derived set of a set of points. This
derived set consists of the limit points of the original set. There is, then, a
second derived set, that is, the derived set of the derived set, and so forth' If
the zth derived set of a given set is a finite set of points then the given set is
said to be of the zth kind or zth order (or ofthe first species). Cantor's final
answer to the question of whether a function can have two different trigono-
metric series representations in the interval l-o, nf or whether zeto canr
have a non-zero Fourier representation is that if in the interval a trigono-
metric series adds up to zero for all x except those of a point set of the zth
kind (at which one knows nothing more about the series) then all the
cocfficients of the series must be zero. In this 1872 paper Cantor laid the
foundation of the theory of point sets which we shall consider in a later
chapt€r. The problem of uniqueness was pursued by many other men in the
last part of the nineteenth century and the early part of the twentieth.sl
For about fifry years after Dirichlet's work it was believed that the
Fourier series of any function continuous in [-2, z] converged to the
function, But Du Bois-Reymond 8e gave an example of a function continuous
in (-2, zr) whose Fourier series did not converge at a Particular point' He
also constructed another continuous function whose Fourier series fails to
convcrge at the Points of an everywhere dense set. Then in 187563 he proved
that if a trigonometric series of the form
converged to/(r) in l-r, o) and ifl(*) is integrable (in a sense even more
general than Riemann's in that;t(*) can be unbounded on a set of the first
59. Jour. f* Math., ?3, 1871,2944: Ges. Abh.,84-a6.
60. Math. Aa*, 5, 1872, 12!-32: Gcs. Abh.,92-1O2.
61. Dctails can befoundin E. W. Hobson, Thc Tluory of Frttuliots oJ a Rcal Variable,Yol' 2,
656-98.
62. Nactvichtctt K6nie. Gu. thr Wiss. zu Gdtl., 1873, 57 LA2.
63. Abh. dcr Bq*. Akad. der Wiss., 12, 1876' I 17-66.
FOTTRTER SERTES g7r
species) then the series must be the Fourier series for;f(*). He also showed 0a
that any Fourier series of a function that is Riemann integrable can be
integrated term by term even though the series is not uniformly convergent.
Many men then took up the problem already answered in one way by
Dirichlet, namely, to give sufficient conditions that a function f (x) have a
Fourier series which converges to f(x). Several results are classical. Jordan
gave a sufficient condition in terms of the concept of a function of bounded
variation, which he introduced.66 Let f(x) be bounded in fa, bf and let
d : to, !(11 . . .5 2to-11 to : b be a mode of division (partition) of this
interval. Letgoryr,. . .tlr-tltt be the values of;t(*) at these points. Then,
for every partition
lY,*, - Y,l.
To every mode of subdividing [a, D] there is a I. When corresponding to all
possible modes of division of [a, D], the sums t have a least upper bound then
;[is defined to be of bounded variation in fa, bf.
Jordan's sufficient condition states that the Fourier series for the integ-
rable function;f(*) converges to
*l-,vav dx :2aZ +f 1
t"t + b1),
and ifl(r) and g(x) and their squares are Riemann integrable then
whete a, bn an, and p, are the Fourier coefficients ofl(r) and g(x) respectively.
Logic sometimes makes monsters. For half a century we have seen a mass
of bizarre functions which appear to be forced to resemble as little as
possible honest functions which serve some purpose. More of continuity,
or less of continuity, more derivatives, and so forth. Indeed from the
point ofview oflogic, these strange functions are the most general; on the
other hand those which one meets without searching for them, and which
follow simple laws appear as a particular case which does not amount to
more than a small corner.
In former times when one invented a new function it was for a
practical purpose; today one invents them purposely to show up defects
in the reasoning of our fathers and one will deduce from them only that.
Charles Hermite said in a letter to Stieltjes, "I turn away with fright and
horror from this lamentable evil offunctions which do not have derivatives."
Another kind of objection was voiced by Du Bois-Reymond.o8 His
concern was that the arithmetization of analysis separated analysis from
geometry and consequently from intuition and physical thinking. It reduced
analysis " to a simple game of symbols where the written signs take on the
arbitrary significance ofthe pieces in a chess or card game."
The issue that provoked the most controversy was the banishment of
divergent series notably by Abel and Cauchy. In a letter to Holmbo6, written
in 1826, Abel says,6e
The divergent series are the invention of the devil, and it is a shame to
base on them any demonstration whatsoever. By using them, one may
draw any conclusion he pleases and that is why these series have produced
so many fallacies and so many paradoxes....I have become prodi-
giously attentive to all this, for with the exception of the geometrical
series, there does not exist in all of mathematics a single infinite series the
sum of which has been determined rigorously. In other words, the things
which are most important in mathematics are also those which have the
lcast foundation.
However, Abel showed some concern about whether a good idea had been
overlooked because he continues his letter thus: "That most of these things
a^re oorrect in spite of that is extraordinarily surprising. I am trying to find
a rea$)n for this; it is an exceedingly interesting question." Abel died young
and so nwcr did pursue the matter.
Cauchy, too, had some qualms about ostracizing divergent series. He
says in the introduction of his Cozrs (1821), "I have been forced to admit
diverse propositions which appear somewhat deplorable, for example, that a
divcrgcnt scries cannot be summed." Despite this conclusion Cauchy
continued to use divergent series as in notes appended to the publication in
1827 7o of a prize paper written in l8l5 on water waves. He decided to look
into the question ofwhy divergent series proved so useful and as a matter of
fact he ultimately did come close to recognizing the reason (Chap. a7).
The Frcnch mathematicians accepted Cauchy's banishment of divergent
scries. But thc English and the Germans did not. In England the Cambridge
school defended the use of divergent serics by appeating to the principle of
pcrmanence of form (Chap. 32, sec. l). In connection with divergent series
the principle was first used by Robert Woodhouse (1773-1827). ln Thz
Priwiphs of Analytir Cahtlation (lBO3, p. 3) he points out that in the equation
(6)
*:l+'+12+...
the equality sign has "a more extendd signification " than just numerical
cquality. flence the equation holds whether the series diverges or not.
Peacock, too, applied the principle ofpermanence of forms to operations
with divergent series.?l On page 267 he says, " Thus since for r < l, (6)
above holds, then forr: I wedogetco : I + I + I +....Forr > l we
get a negative number on the left and, because the terms on the right con-
tinually incrcase, a quantity more than co on the right." This Peacock
accepts. The point he tries to make is that the series can represent ll(l - r)
for all r. He says,
If the operations of algebra be considered as general, and the symbols
which are subject to them as unlimited in value, it will be impossible to
avoid the formation of divergent as well as convergent seriesl and if such
series be considered as the results of operations which are definable,
apart from the series themselves, then it will not be very important to
70. MAn. drs saa. Itrangcrs, l, W27,3-?12; see (Euarus,(l), l, 238, 277,2A6.
7l. Rrprlon lfu Racnt Progrcss and Prcsent Stak oJ Certain Branclus of Aaalytk, Brit. Assu. for
Adv. of Science, 3, 1833, 185-352.
THE STATUS OF ANALYS$ 97s
enter into such an examination of the relation of the arithmetical values
of the successive terms as may be necessary to ascertain their convergence
or divergence I for under such circumstances, they must be considered as
equivalent forms representing their generating function, and as possessing
for the purposes of such operations, equivalent proPerties. . . . The
attempt to exclude the use of divergent series in symbolical operations
would necessarily impose a limit upon the universality of algebraic
formulas and operations which is altogether contrary to the spirit of
science. . . . It would necessarily lead to a great and embarrassing
multiplication of cases: it would deprive almost all algebraical oPerations
of much of their certainty and simplicity.
I
| -r
72. Trans, Camb. Philo. Soc.,8, Part II, 1844, 182'203' pub. 1849.
73. London, 1842, p. 566.
976 THE TNSTTLLATToN oF RrcoR rN ANALysrs
" algebraic " or " syntactical " properties of the function and these properties
are carried beyond the domain of convergence of the element. The process
of analytic continuation uses this fact. Actually there was sound mathe-
matical substance in the concept of divergent series which accounted for
their usefulness. But the recognition ofthis substance and the final acceptance
of divergent series had to await a new theory of infinite series (Chap. 47).
Bibliograplry
Abel, N. H.z (Euores complites,2 vols., 1881, Johnson Reprint Corp., 1964.
Minoial publii d I'occasion du centenaire de sa naissance, Jacob Dybwad, 1902.
Letters to and from Abel.
Bolzano, B.: Rein analytischzr Beueis des Lehrsatzes, dass zwischen je zwei. Wcrtlun, db cin
entgegongesetztes Resultat geudhren, uenigstens eine reele Wurzel der Gleichung
Gottheb Hass, Prague, l8l7 : Abh. Kdnigl. Bdhm. Gcs. dn Wiss., (3),5,
liege,
l8l4-17, pub. l8l8 : Ostwald's Klassikn der exakten Wisscnschafttn #153,
1905, 3-43. Not contained in Bolzano's Schriften.
Paradoxes of the l4fnite, Routledge and Kegan Paul, 1950. Contains a
historical survey of Bolzano's work.
Schriften,5 vols., Kdniglichen Bdhmischen Gesellschaft der Wissenschaften,
r930-48.
Boyer, Carl B.:.The Concepts of the Calculus, Dover (reprint), 1949, Chap. 7.
Burkhardt, H.: "Uber den Gebrauch divergenter Reihen in der Zeit von 1750"-
186O," Math. Ann.,70, l9l l, 169-206.
" Trigonometrische Reihe und Integrale," Etugk. dn Math. Wiss.,ll Al2,
819-1354, B. G. Teubner, 190.t-16.
Cantor, Georg: Gesammclte Abhandlungen (1932), Georg Olms (reprint), 1962.
Cauchy, A. L.: CEuares, (2), Gauthier-Villars, 1897-99, Vols. 3 and 4.
Dauben, J. W.: "The Trigonometric Background to Georg Cantor's Theory of
Sets," Archite for History of Exad Sciences, T, 1971, l8l-216.
Dirichlet, P. G. L. : Wnke, 2 vols. Georg Reimer, 1889-97, Chelsea (reprint),
1969.
Du Bois-Reymond, Paul : Zu;ei Abhandlungen iiber unzndliche und tigononutrisclu
Reihen (1871 and 1874), Ostwald's Klassiker ll85; Wilhelm Engelmann,
1913.
Freudenthal, H.: " Did Cauchy Plagiarize Bolzano?," Archiae for History of Exact
7, 197 l, 37 5-92.
Sciznres,
Gibson, G. A.: " On the History of Fourier Series," Pror. Edinburgh Math. Soc., ll,
1892/93, 137-66.
Grattan-Guinness, f.: "Bolzano, Cauchy and the 'New Analysis' ofthe Nineteenth
Century," for Histnry of Exact Scienees, 6, 1970,372-+00,
Archiae
Tlu Deuelopment of the Foundations of Mathematical Analgsis Jrom Eabr a
Riemann, Massachusetts Institute of Technology Press, 1970.
Hawkins, Thomas \N.,Jr.: Lebesgue's Theory of Integration: Its Origins aad Dcoclopmcnt,
University of Wisconsin Press, 1970, Chaps. l-3.
978 THE rNsrrLLATroN oF RrGoR rN ANALYSIS
Manhcim, Jerome H.: fhc Gctusis of Point Set Topology, Macmillao, 1964, Chaps.
14.
Pcsin, Ivan N,z Classical and Modaa Inkgration Tluorits, Academic Press, 1970,
Chap. 1.
Pria$hcim, A. : " Irrationalzahlen und Konvergenz unendlichen Prozxe," Eruyk.
da Math. Wiss.,lL3, 47-l+7, B. G. Teubner, 1898-1904.
Reiff, R. : Gcschirhrc d.a urundlirhzn Rzihcrr, H. Lauppsche Buchhandlung, I 889 ;
Martin Siindig (reprint), 1969.
Ricmann, Berahard: Gesammcltt ndlumatisclu Wcrlu,2nd'. ed. (1902), Dover
(reprint), 1953.
Schl.<inger, L.: "Uber Gauss'Arbeiten zur Funktionenlehre," Nachrichtzn Kdnig.
Gas. dcr Wiss. zu Gott.,l9l2, Beiheft, l-43. Also in Gauss's Wcrke, l0z, 77 tr.
Schocofica, Arthur M. : ." Die Entwicklung der Lehre von den Punktmannig-
faltigLciten," Jahrcs. dcr Dcut. Math.-Vacin, 82, 1899, l-250.
Singh, A. N.: "The Thcory and Construction of Non-Differentiable Functions,"
in E. W. Hobson : Squcing thc Circh and Otlu Mowgraplu, Chelsea (reprint),
1953.
Smith, David E.z A Souu Book in Matlumatics, Dover (reprint), 1959, Vol. l,
286-91, Vol. 2,635-37.
Stolz, O.: "8. Bolzanoe Bcdeutung in der Geschichte der Infinitesimalrechnung,"
Math. Arm., t8, 1881, 255-79.
Wcicrrtrass, Kartlt Mztfunatinlu Werkc, 7 vols., Mayer und MiiLller, 1894-1927.
Young, Grace C. : " On Infinite Derivativa," Quart. Jour. of Math., 47 , 1916,
t27-7s.
4r
The Foundations of the Real and
Transfinite Numbers
l. Introduction
One of the most surprising facts in the history of mathematics is that the
logical foundation of the real number system was not erected until the late
nineteenth century. Up to that time not even the simplest properties of
positive and negative rational numbers and irrational numbers were logcally
established, nor were these numbers dehned. Even the logical foundation
of complex numbers had not been long in existence (Chap. 32, sec. l), and
that foundation presupposed the real number system. In view of the exten-
sive development of algebra and analysis, all of which utilized the real
numbers, the failure to consider the precise structure and properties of the
real numbers shows how illogically mathematics progresses. The intuitive
understanding of these numbers seemed adequate and mathematicians iryere
content to operate on this basis.
The rigorization of analysis forced the realization that the lack of
clarity in the number system itself had to be remedied. For example, Bol-
zano's proof (Chap. 40, sec. 2) that a continuous function that is negative
for x : a and positive for * : b is zero for some value of .r between a and b
floundered at a critical point because he lacked an adequate understanding
of the structure of the real number system. The closer study of limits also
showed the need to understand the real number system, for rational numberr
can have an irrational limit and conversely. Cauchy's inability to prove the
sufficiency of his criterion for the convergence of a sequence likewise resulted
from his lack of understanding of the structure of the number system. The
study of discontinuities of functions representable by Fourier series revealed
the same deficiency. It was Weierstrass who first pointed out that to establish
carefully the properties of continuous functions he needed the theory of the
arithmetic continuum.
979
9BO FOUNDATIONS OF REAL AND TRANSFINITE NUMBERS
t. Werke,8, 177.
2. Werke , 8, 201.
ALGEBRAIC AND TRANSCENDENTAL NUMBERS 98I
Up to 1844, the question of whether there were any transcendental
irrationals was open. In that year Liouvilles showed that any number of
the form
a, d6 a-
--:- -L ----L r ----:- r...
lo , lo2! , lo3! ,
l.-21
lclq'
.y
has no solutions : z and hence for y, <n. Then x
in integers p and q for p
is transcendentalif for a fixed M and for every positive integer p the inequality
has a solution ?lq.By showing that his irrationals satisfy this last criterion,
Liouville proved they were transcendental.
The next big step in the recognition ofspecific transcendental numbers
was Hermite's proof in l$73a that e is transcendental. After obtaining this
result, Hermite wrote to Carl Wilhelm Borchardt (1817-80), ..I do not
dare to attempt to show the transcendence of z. If others undertake it, no
one will be happier than I about their success, but believe me, my dear
friend, this cannot fail to cost them some effort."
That zr is transcendental had already been suspected by Legendre
(Chap. 25, sec. l). Ferdinand Lindemann (1852-1939) proved this in
3. Conp. Rend., 18, l8+4,910-ll, and Jour. de Math., (l), 16, 185t, 133-42.
4. Comp. Rend.,77, 1873, 18-24, ?4-79, 226-33, 285-93 = (Ewres,2, 150-81.
982 FoUNDATIoNS oF REAL AND TRANSFINITE NUMBERS
18825 by a method that does not differ essentially from Hermite's. Linde-
mann established that lf xy x2,. , ., x,t are distinct algebraic numbers, real
or complex, and pr, pr,. . . , p,L are algebraic numbers and not all zero, then
the sum
lrfr+pzfa+...+pnf"
cannot be 0. If we take z : 2, ?, : l, and.r, : 0, we see that r*1 cannot
be algebraic for an *, that is algebraic and nonzero. Since x1 can be I, , is
transccndental. Now it was known that rt" + I : 0; hence the number izr
cannot be algebraic. Then zr is not, because i is, and the product of two
algebraic numbers is algebraic. The proof that zr is transcendental disposed
of the last point in the famous construction problems of geometry, flor all
constructible numbers are algebraic.
One mystery about a fundamental constant remains- Euler's constant
(Chap. 20, sec. 4)
":,1,I(r
*|* +;-bc,)'
which is approximately 0.577216 and which plays an important role in
analysis, notably in the study of the gamma and zeta functions, is not known
to be rational or irrational.
That is, the formation of fundamental sequences of real numbers does not
create the need for still newer types of numbers that can serve as limits of
these fundamental sequences, because the already existing real numbers
suffice to provide the limits. In other words, from the standpoint of providing
limits lor fundamental sequences (or what amounts to the same thing,
sequences which satisfy Cauchy's criterion ofconvergence), the real numbers
are a complete system.
Dedekind's theory of irrational numbers, presented in his book ol lB72
mentioned above, stems from ideas he had in lB5B. At that time he had to
give lectures on the calculus and realized that the real number system had
no logical foundation. To prove that a monotonically increasing quantity
that is bounded approaches a limit he, like the other authors, had to resort to
geometrical evidence. (He says that this is still the way to do it in the first
treatment of the calculus, especially if one does not wish to lose much time.)
Moreover, many basic arithmetic theorems were not proven. He gives as
an example the fact that {2.\/3 : \/6 had not yet been strictly demon-
strated.
He then states that he presupposes the development of the rational
numbers, which he discusses briefly. To approach the irrational numbers
he asks first what is meant by geometrical continuity. Contemporary and
earlier thinkers-for example, Bolzano-believed that continuity meant the
existence ol at least one other number between any two, the property now
known as denseness. But the rational numbers in themselves form a dense
set. Hence denseness is not continuity.
Dedekind obtained the suggestion for the definition of an irrational
number by noting that in every division of the line into two classes of points
such that every point in one class is to the left of each point in the second,
there is one and onlg one point that produces the division. This lact makes the
line continuous. For the line it is an axiom. He carried this idea over to
the number system. Let us consider, Dedekind says, any division of the
rational numbers into two classes such that any number in the first class is
less than any number in the second. Such a division ofthe rational numbers
he calls a cut. Ifthe classes are denoted by l, and Ar, then the cut is denoted
by (Ar, Ar). From some cuts, namely, those determined by a rational
number, there is either a largest number in.4, or a smallest number in lr.
Conversely every cut in the rationals in which there is a largest number in
the first class or a smallest in the second is determined by a rational number.
But there are cuts that are not determined by rational numbers. If we
put into the first class all negative rational numbers and all positive ones
whose squares are less than 2, and put into the second class all the other
rationals, then this cut is not determined by a rational number. To each
such cut "we create a new irrational member a which is fully defined by
this cut; we will say that the number a corresPonds to this cut or that it
986 FOUNDATIONS OF REAL AND TRANSFINITE NUMBERS
brings about this cut." Hence there corresponds to each cut one and only
one either rational or irrational number.
Dedekind's language in introducing irrational numbers leaves a little
to be desired. He introduces the irrational a as corresponding to the cut and
defined by the cut. But he is not too clear about where a comes from. He
should say that the irrational number cr is no more than the cut. In fact
Heinrich Weber told Dedekind this, and in a letter of IBBB Dedekind replied
that the irrational number a is not the cut itself but is something distinct,
which corresponds to the cut and which brings about the cut. Likewise,
while the rational numbers generate cuts, they are not the same as the cuts.
He says we have the mental power to create such concePts'
He then defines when one cut (Ar, Ar) is less than or greater than
another cut (Br, Br). After having defined inequality, he points out that the
real numbers possess three provable properties: (l) If c > p and p > y,
then a > y. (2) lf d and y are two different real numbers, then there is an
inflnite number of different numbers which lie between a and y. (3) If cr is
any real number, then the real numbers are divided into two classes l, and
Ar, each of which contains an infinite number of members, and each member
of ,4, is less than a and each member of l, is greater than c. The number a
itself can be assigned to either class. The class of real numbers now Possesses
conlirutily, which he expresses thus: If the set of all real numbers is divided
into two classes l, and A2 such that each member of l, is less than all
members of 12, then there exists one and only one number o which brings
about this division.
He defines next the operations with real numbers. Addition ol the
cuts (,41, Ar) and, (Br, Br) is defined thus: If c is any rational number, then
we put it in the class Cr if there is a number arin A, and a number b, in B,
such that ar + bL > c. All other rational numbers are put in the class C'r.
This pair of classes C, and C, forms a cut (Cr, C2) because every member ol
C, is less than every member of Cr. The cut (Cr, C'r) is the sum of (,4r, Ar) and
(Br, Br). The other operations, he says, are defined analogously. He can
now establish properties such as the associative and commutative properties
of addition and multiplication. Though Dedekind's theory of irrational
numbers, with minor modifications such as the one indicated above, is
logically satisfactory, Cantor criticized it because cuts do not appear
naturally in analysis.
There are other approaches to the theory of irrational numbers beyond
those already mentioned or described. For example, Wallis in 1696 had
identified rational numbers and periodic decimal numbers. Otto Stolz
(1842-1905), in his Vorlesungen ilber allgemeine Arithmetik,ta showed that every
irrational number can be represented as a nonperiodic decimal and tl.ris
fact can be used as a defining Property.
14. 1886, l, 109-19.
TIIE THEORY OF RATIONAL NUMBERII 987
integers, a few believed that the whole numbers were so fundamental that
no logical analysis of them could be made. This position was taken by
Kronecker, who was motivated by philosophical considerations into which
we shall enter more deeply later. Kronecker too wished to arithmetize
analysis, that is, to found analysis on the integers; but he thought one could
not go beyond recognition of knowledge of the integers. Ol these man has a
fundamental intuition. " God made the integers," he said, " all else is the
work of man."
Dedekind gave a theory of the integers in his Was sind und was sollen die
Zahlen.L6 Though published in IBBB, the work dates from 1872 to 1878. He
used set-theoretic ideas, which by this time Cantor had already advanced,
and which were to assume great importance. Nevertheless, Dedekind's
approach was so complicated that it was not accorded much attention.
The approach to the integers that best suited the axiomatizing proclivi-
ties of the late nineteenth century was to introduce them entirely by a set
of axioms. Using results obtained by Dedekind in the above-mentioned
work, Giuseppe Peano (lB5B-1932) first accomplished this in his Arithmetices
Principia Nooa Methodo Exposita (lBBg).1? Since Peano's approach is very
widely used we shall review it.
Peano used a great deal of symbolism because he wished to sharpen the
reasoning. Thus e means to belong tol = mst" implies; tr[6 means the class
ofnatural numbers; and a* denotes the next natural number after a. Peano
used this symbolism in his presentatron of all of mathematics, notably in his
Formulario mathematieo (5 vols., 1895-1908). He used it also in his lectures,
and the students rebelled. He tried to satisfy them by passing all of them,
but that did not work, and he was obliged to resign his professorship at the
University of Turin.
Though Peano's work influenced the further development ol symbolic
logic and the later movement of Frege and Russell to build mathematics
on logic, his work must be distinguished from that ol Frege and Russell.
Peano did zal wish to build mathematics on logic. To him, logic was the
servant of mathematics.
Peano started with the undefined concepts (cf. Chap.42, sec. 2) of
"set," "natural numbersr" "successorr" and " belong to." His five axioms
for the natural numbers are:
(1) I is a natural number.
(2) I is not the successor ofany other natural number.
(3) Each natural number a has a successor.
(4) Ilthe successors ofa and D are equal then so are a and. b.
16. The Nature and Meaning of Numbers : Werkc, 3, 335-91'
17. Opere scelte, 2, 20-55, and Riaista di Matematica, l, 1891, 87-102, 256-57 : Oiere
scelte, 3, 8V109.
THE THEORY OF RATIONAL NUMBERS g8g
* | : a*
a
a+(b+):(a+b)+.
Likewise, multiplication was defined by the statement that to each pair of
natural numbers a and, b there is a unique product such that
a'l : a
a.b*:(a.b)+a.
He then established all the familiar properties of natural numbers.
From the natural numbers and their properties it is straightforward to
define and establish the properties of the negative whole numbers and the
rational numbers. One can first define the positive and negative integers
as a new class of numbers, each an ordered pair of natural numbers. Thus
(a, 6) where a and b are natural numbers is an integer. The intuitive meaning
of (a, b) is a - b. Hence when a > b, the couple represents the usual positive
integer, and when a < b, the couple represents the usual negative integer.
Suitable definitions of the operations of addition and multiplication lead
to the usual properties of the positive and negative integers.
Given the integers, one introduces the rational numbers as ordered
couples of integers. Thus ifl and B are integers, the ordered couple (1, B)
is a rational number. Intuitively (A, B) is AlB. Again suitable definitions of
the operations of addition and multiplication of the couples lead to the
usual properties of the rational numbers.
Thus, once the logical approach to the natural numbers was attained,
the problem of building up the foundations of the real number system was
completed. As we have already noted, generally the men who worked on
the theory ofirrationals assumed that the rational numbers were so thoroughly
understood that they could be taken for granted, or made only a minor
gesture toward clarifying them. After Hamilton had grounded the complex
numbers on the real numbers, and alter the irrationals were defined in
terms of the rational numbers, the logic of this last class was finally created.
The historical order was essentially the reverse of the logical order required
to build up the complex number system.
99O FOUNDATTONS OF REAL AND TRANSFINITE NUMBERS
I. Axiotns of Conncction
Ir. From the number a and the number 6 there arises through addition a
definite number c; in symbols
a*b:c or c:a*b.
Ir. Ifa and D are given numbers, then there exists one and only one number
* and also one and only one number y so that
a+x:b and yta:b.
Is. There is a definite number, denoted by 0, so that for each a
a*0:a and 01-a:a.
I.. From the number a and the number 6 there arises by another method,
by multiplication, a definite number e; in symbols
ab:c or c:ab.
I". If a and D are arbitrary given numbers and 4 not 0, then there exists
one and only one number ,, and also one and only one number y such
that
att: b and ga:b.
16. There exists a definite number, denoted by l, such that for each a we
have
a.l : a and l.a: a.
18. Jahres. der Deut. Math.-Verein-,8,1899, 180-84; this article is notin Hilbert's Gesammelte
Abhandlungen. It is in his Grundlagen der Geometriz, Tth ed., Appendix 6.
OTHER APPROACIIES TO THE REAL NUMBER SYSTEM 99r
lI. Axioms of Calculation
llr.at(b+c):(a+b)+c.
llr.atb:b+a.
Il". a(bc) : (ab)c.
lln. a(b * c) : ab + ac.
II.. (a * b)c : ac + bc.
Il5. ab : ba.
lll. Axiom.c of Order
IIIr. Il a and D are any two different numbers, then one of these is always
greater than the other; the latter is said to be smaller; in symbols
a>b and b<a.
IlI2. If a > D and b > c then a > c.
IIIs. Ifa > 6 then it is always true that
a+c>b+c and c*a>c*b.
llln. lf a> b and c > 0 then ac > bcandca > cb.
lY . Axioms of Continuity
IVr. (Axiom of Archimedes) lf a > 0 and D > 0 are two arbitrary numbers
then it is always possible to add a to itself a sufficient number of times
so that
a+a+...*a>b.
IVr. (Axiom of Completeness) It is not possible to adjoin to the system of
numbers any collection of things so that in the combined collection the
preceding axioms are satisfied; that is, briefly put, the numbers form a
system of objects which cannot be enlarged with the preceding axioms
continuing to hold.
Hilbert points out that these axioms are not independent; some can
be deduced from the others. He then affirms that the objections against the
existence olinfinite sets (sec. 6) are not valid for the above conception ofthe
real numbers. For, he says, we do not have to think about the collection of
all possible laws in accordance with which the elements of a fundamental
sequence (Cantor's sequences of rational numbers) can be formed. We have
but to consider a closed system of axioms and conclusions that can be
deduced from them by a finite number of logical steps. He does point out
that it is necessary to prove the consistency ol this set of axioms, but when
this is done the objects defined by it, the real numbers, exist in the mathema-
tical sense. Hilbert was not aware at this time of the difficulty of proving
the consistency of axioms for real numbers.
992 FOUNDATTONS OF REAL AND TRANSFTNTTE NUMBERS
Despite attacks such as this, the completion of the work on the real
numbers seemed to mathematicians to resolve all the logical problems the
subject had faced. Arithmetic, algebra, and analysis were by far the most
extensive part of mathematics and this part was now securely grounded.
Figure 4l.l
Cantor proved that the set of rational numbers is enumerable. He gave one
proof in 1874.23 However, his second proof24 is the one now most widely
used and we shall describe it.
The rational numbers are arranged thus :
1234
I I lT
t2 3
,2 ,
\/
t2
55
I
(4
It will be noted that all those in any one diagonal have the same sum of
numerator and denominator. Now one starts with l/l and follows the arrows
assigning the number I to l/1, 2 to 211,3 to ll2, 4 to 113, and so on. Every
rational number will be reached at some stage and to each one a finite
integer will be assigned. Hence the above set of rational numbers (in which
some appear many times) is in one-to-one correspondence with the integers.
Then if duplicates are eliminated the set of rational numbers will still be
infinite and necessarily enumerable since this is the smallest infinite set.
Still more surprising is Cantor's proof in the lB74 paper just referred
to that the set of all algebraic numbers, that is, the set ol all numbers that
are solutions of all algebraic equations
asxn I alxn-L + "'+ an : 0,
height 2 are labeled from z1 + 1 to n2; and so forth. Because each algebraic
number will be reached and be assigned to one and only one integer, the set
of algebraic numbers is enumerable.
In his correspondence with Dedekind in 1873, Cantor posed the
question of whether the set of real numbers can be put into one-to-one
correspondence with the integers, and some weeks later he answered in the
negative. He gave two proofs. The first (in the lB74 article just referred to)
is more complicated than the second,2s which is the one most often used
today. It also has the advantage, as Cantor pointed out, ofbeing independent
of technical considerations about irrational numbers.
Cantor's second proof that the real numbers are uncountable (non-
enumerable) begins by assuming that the real numbers between 0 and I are
countable (enumerable) . Let us write each as a decimal and let us agree
that a number such as l/2 will be written as .4999. . .. If these real numbers
are countable, then we can assign each one to an integer z, thus:
I +-+ 0. a, a.s,z atz .. .
Now let us define a real number between 0 and I thus: Let b : 0.btb2h. . .,
where 6" : 9 if atr: I and b* : I il axx * l. This real number differs
from any of those in the above correspondence. However, this was supposed
to contain all the real numbers between 0 and l. Hence there is a contra-
diction.
Since the real numbers are uncountable and the algebraic numbers
are countable, there must be transcendental irrationals. This is Cantor's
nonconstructive existence proof, which should be compared with Liouville's
actual construction of transcendental irrationals (sec. 2).
In 1874 Cantor occupied himself with the equivalence of the points of
a line and the points of -Rn (z-dimensional space) and sought to prove that
a one-to-one correspondence between these two sets was impossible. Three
years later he proved that there is such a correspondence. ffe wrote to
Dedekind,26 " I see it but I do not believe it."
The idea2? used to set up this one-to-one correspondence can be
exhibited readily if we set up such a correspondence between the points
of the unit square and the points of the segment (0, I ) . Let (.r, y) be a point
of the unit square and z a point of the unit interval. Let r and y be repre-
sented by infinite decimals so that in a finite decimal terminating in zero,
we replace the 0 by an infinite sequence of 9's. We now break up ,r and g
into groups of decimals, each group ending with the first nonzero digit
encountered. Thus
x:.300203046"'
9: '01 6 07 8 09 ".'
Form
z:.30l 00260307MB 609 "'
by choosing as the grouPs in z the first grouP from x, then the first from y,
and so forth. If two r's or y's differ in some digit then the corresponding z's
will differ. Hence to each (r, y) there is a unique z. Given a z, one breaks
up its decimal representation into the groups just described and forms the
x and y by reversing the above Process. Again two different z's will yield
two diflerent pairs (x, y) so that to each z there is a unique (x, y). The
one-to-one correspondence just described is not continuous; roughly stated,
this means that neighboring z-points do not necessarily go into neighboring
(.r, y)-points, nor conversely.
Du Bois-Reymond objected to this proof.28 "It appears repugnant to
common sense. The fact is that this is simply the conclusion of a type of
reasoning which allows the intervention of idealistic fictions, wherein one
lets them play the role of genuine quantities even though they are not even
limis of representations of quantities. This is where the paradox resides."
28. Page 167, French ed. (1887) of his Dic allgemeine Funktionentheorie (1882).
29. Math. Ann., 46, 1895, 481-512, and 49, 1897, 207-46 : Ges. Abh., 282-351; an
English translation of these two papers may be found in Georg Cantor, Coilibutions lo lhe
Founding ofa Theory of Transjnile Numbers, Dover (rcprint), no date.
30. 1883, Ges. Abh., 165.
TRANSFTNTTE CARDTNALS AND ORDTNALS 999
that would be a contradiction of the very definition of4. On the other hand
if a did not contain z, it should because z1 was by definition the set ol all
m's which were not contained in the corresponding subsets of N. Hence the
assumption that there is a one-to-one correspondence between the elements
m of M and of N, which consists of all the subsets of M, leads to a contra-
diction. Then the cardinal number of the set consisting of all the subsets of
a given set is larger than that of the given set.
Cantor defined the sum of two cardinal numbers as the cardinal number
of the set that is the union ofthe (disjoined) sets represented by the summands.
Cantor also defined the product of any two cardinal numbers. Given two
cardinal numbers a and B, one takes a set M represented by c and a set -lf
represented by B and forms the pairs of elements (m, n) where m is any
element of M andn of iy'. Then the cardinal number of the set of all possible
pairs is the product ofc and P.
Powers of cardinal numbers are also defined. If we have a set M of m
objects and a set .l[ of z objects, Cantor defines the set m', and this amounts
to the set of permutations o[ zn objects n at a tirrre with repetitions of the m
objecs permitted. Thus if m : 3 and ra : 2 and we have mv m2, ms, the
permutations are .
fltflt frzflz filsflt
the cardinal number of the set of numbers in the set. The ordinal number
of the set of positive integers in their natural order is denoted by ar. On the
other hand, the set of positive integers in decreasing order, that is
...r4r3r2, I
is denoted by *<..r. The set of positive and negative integers and zero in the
usual order has the ordinal number *a * a.
Then Cantor defines the addition and multiplication of ordinal numbers.
The sum of two ordinal numbers is the ordinal number of the first ordered
set plus the ordinal number of the second ordered set taken in that specific
order. Thus the set ofpositive integers followed by the first five integers, that
is,
1,2,3,.'-,1,2,3,4,5,
has the ordinal number ar * 5. AIso the equality and inequality of ordinal
numbers is defined in a rather obvious way.
And now he introduces the full set of transfinite ordinals, partly for
their own value and partly to define precisely higher transfinite cardinal
numbers. To introduce these new ordinals he restricts the simply ordered
sets to well-ordered sets.3z A set is well-ordered if it has a first element in
the ordering and if every subset has a first element. There is a hierarchy of
ordinal numbers and cardinal numbers. In the first class, denoted by 21,
are the finite ordinals
1,2,3,..,.
In the second class, denoted by Zr, are the ordinals
Each of these ordinals is the ordinal of a set whose cardinal number is No.
The set of ordinals in Z, has a cardinal number. The set is not enumer-
able and so Cantor introduces a new cardinal number Nr as the cardinal
number of the set Zr. N, is then shown to be the next cardinal after No.
The ordinals of the third class, denoted by Zs, arc
Now Cantor had shown that given any set' it is always possible to create
a new set, the set ofsubsets ofthe given set, whose cardinal
number is larger
than that of the given set. If No is the given set, then the cardinal number
of the set of subsits is 2xo. Cantor had proved that 2xo
: c, where c is the
cardinal number ol the continuum. On the other hand he introduced N'
through the ordinal numbers and proved that N, is the next cardinal after
No. Ii"rrc" Nr < c, but the question as to whether Nr: c' known as the
hypothesis, Cantor, despite arduous efforts, could not answer'
"Jrr,it,rrr-
Inalistofo,rtst"rrdirrgproblemsPresentedatthelnternationalCongressof
Mathematicians in 1900, Hilbert included this (Chap' 43, sec' 5; see
also
Chap. 51, sec.8).
'Forgerr.ralsets.7l1and,lfitispossiblet}.atMcannotbeputintoone-to-
orre co.rJpondence with any subset of 'trf and -1y' cannot be put into one-to-one
with a subset of M' In this case' though M and N have
"o*oporrd.t "e :
cardinal numbers c and B, say, it is not Possible to say that F o' " < F'
numbers are not comparable' For well-
ot a > p.That is, the two cardinal
ordered sets cantor was able to prove that this situation cannot arise. It
seemed paradoxical that there should be non-well-ordered sets whose
could
cardinal numbers cannot be compared' But this problem, too' Cantor
not solve.
Ernst Zermelo (lB7l-1953) took up the problem ofwhat to do about
the
comparison of the cardinal numbers of sets that are not well-ordered, In
and in l9083a gave a second proof, that every set can be
190+g" he proved,
well-orderid (in some rearrangement) ' To make the proof he had to use
what is now known as the axiom of choice (Zermelo's axiom)' which states
that given any collection of nonempty, disjoined sets, it is possible to choose
3rrstJt . from each set and so make up a new set' The axiom of
-.-b..
""hoi"",
the well-ordering theorem, and the fact that any two sets may be
p' either
compared as to size (that is, il their cardinal numbers are c and
a :- F, @ < B, or a > p) are equivalent principles'
Bibliography
Becker, Oskar: Grundlagen dcr Malhemalik in geschichtlicher Entuicklung, Verlag Karl
Alber, 1954, pp. 217-316.
Boyer, Carl B.: A History oJ Mathematics, John Wiley and Sons, 1968, Chap. 25.
Cantor, Georg: Gesammeltc Abhandlungcn, 1932, Georg Olms (reprint), 1962.
......'........._ : Contributions lo tlu Founding of the Theory of Transfnite Numbers
, Dover
(reprint), no date. This contains an English translation of Cantor's two
key papers of 1895 and 1897 and a very helpful introduction by P. E. B.
Jourdain.
Cavailles, Jean: Philosophie malhimalique, Hermann, 1962. Also contains the
Cantor-Dedekind correspondence translated into French.
Dedckind, R.: Essays on thc Theory of Numbers, Dover (reprint), 1963. Contains an
English translation of Dedekind's " Stetigkeit und irrationale Zahlen" and.
" Was sind und was sollen die Zahlen." Both essays are in Dedekind's
We*e, 3, 31't-34 and 335-91.
Fraenlel, Abraham A.: " Georg Cantor," Jahres. der Deut. Math.-Verern., 39, 1930,
189-266. A historical account of Cantor's work.
Helmholtz, Hermann von:' Counting and, Measuring, D. Van Nostrand, 1930. English
translation of Helmholtz's Ztihlen und ll[essen, Wissenschaftliche Abhandlungen,
3,356-91.
Ivlanheim, Jerome H. : The Genesis of Point Set Topologg, Macmillan, 1964, pp.
76-l10.
Meschkowski, Herbert: Wags of Thought of Great Mathematicians, Hold'en-Day,
1964, pp. 9l-104.
Eaolution of Mathematical Thoughl, Holden-Day, 1965, Chaps. '1-5.
Problcmc des Unendliclun: lil'erk und Leben Georg Cantors, F. Vieweg und
.....................-:
Sohn, 1967.
Noether, E. and J. Cavaillts: Briefuechsel Canlor-Dedekind, Hermann, 1937.
Peano G. : Opere sccltc, 3 vols., Edizioni Cremonese, 1957-59.
Schoenflies, Arthur M. : Die Entwickelung der Mengenlehre und ihrc Anwendungen, two
pars, B. G. Teubner, 1908, 1913.
Smith, David Eugene: A Source Book in Mathematics, Dover (reprint), 1959, Vol. I'
pP. 35-45,99-106.
Stammler, Gerhard: Der Zahlbegrif seit Gauss, Georg Olms, 1965.
42
The Foundations of Geometry
Geometry is nothing if it be not rigorous . . . . The methods
of Euclid are, by almost universal consent, unexceptional in
point of rigor. n. l. s. surrH (1873)
roo5
r006 TITE FOUNDATIONS OF GEOMETRY
Figurc 42.1 C
Figure 42.2 o
Then the triangles marked I are congruent, and OF : OE. The triangles
marked III are also congruent, and OB: OC. Consequently the triangles
marked II are congruent, and FB : EC. From the triangles marked I we
have AF : lE. Then AB : AC and the triangle is isosceles.
One might question the position of the point O, and indeed one can
show that it must lie outside the triangle on the circumscribed circle. If,
however, one draws Figure 42.2, it can again be " proved " that triangle
IBC is isosceles. The flaw is that of the two points E and F one must lie
inside and the other outside of the respective sides of the triangle. But this
means that we must be able to determine the correct position of F with
respect to A and B and .E'with respect to A and C before starting the proof.
Of course one should not rely upon drawing a correct figure to determine
the locations of E and 4 but this was precisely what Euclid and the mathe-
maticians up to 1800 did. Euclidean geometry was supposed to have oflered
accurate proofs of theorems suggested intuitively by figures, but actually it
offered intuitive proofs of accurately drawn figures.
Though criticisms of the logical structure of Euclid's Elcmmts were
launched almost from the time it was written, they were not widely known
or the defects were regarded as minor. The Elements was generally taken to
be the model of rigor. However, the work on non-Euclidean geometry made
mathematicians aware of the full extent of the deficiencies in Euclid's
structure, for in carrying out proofs they had to be especially critical of
what they were accepting. The recognition of the many deficiencies finally
obliged the mathematicians to undertake the reconstruction of the founda-
tions of Euclidean geometry and of other geometries that contained the same
weaknesses. This activity became extensive in the last third ofthe nineteenth
century,
precisely when this becomes necessary that a gap occurs in the deduction
and (when it is not possible to supply the deficiency by modifying the
reasoning) we are forced to admit the inadequacy of the propositions
invoked as the means of Proof.
Pasch did believe that the concepts and axioms should bear on experience,
but logically this was irrelevant.
In his Vorlesungen Pasch gave axioms for projective geometry, but
many of the axioms or their analogues were equally important for the
axiomatization of Euclidean and the non-Euclidean geometries when built
up as independent subjects. Thus he was first to give a set of axioms for the
order of points on a line (or the concept of betweenness). Such axioms must
also be incorporated in a complete set for any one ol the metric geometries.
We shhll see below what the order axioms amount to.
His method of building projective geometry was to add the point, line,
and plane at infinity to the proper points, Iines, and planes. Then he intro-
duced coordinates (on a geometric basis), using the throw construction of
von Staudt and Klein (Chap. 35, sec. 3), and finally the algebraic rePresen-
tation of projective transformations. The non-Euclidean and Euclidean
geometries were introduced as special cases on a geometric basis by distin-
guishing the proper and improper lines and Points A la Felix Klein.
A more satislactory approach to projective geomery was given by
Peano.s This was followed by the work of Mario Pieri (186G-1904)' "I
Principii della geometria di posizione";6 Federigo Enriques (l87l-1946,
Lezioni di geometria proiettiaa, l89B) ; Eliakim Hastings Moore (1862-1932);7
Friedrich H. Schirr (1856-1932) ;8 Alfred North Whitehead (1861-1947,
The Axioms of Projedioe Geometry); e and Oswald Veblen ( 1880-1960) and
John W. Young (1879-1932).10 The last two men gave a completely inde-
pendent set. The classic text, Veblen and Young's Projcctiae Geometry (2 vols.,
l9l0 and l91B), carries out Klein's organization of geometry by starting
with projective geometry on a strict axiomatic basis and then specializing
this geometry by choosing different absolute quadrics (Chap. 38, sec. 3)
to obtain Euclidean and the several non-Euclidean geometries. Their
axioms are general enough to include geometries with a finite number of
points, geometries with only rational points, and geometries with complex
points.
One more point about many ol the axiomatic systems for projective
ll. Memorie della Reqle Accademia delle Scienze di Torino, (2), 49, 1899, 173-222.
FOUNDATIONS OF EUCLIDEAN GEOMETRY IOII
l. Axioms of Connection
Ir. To each two points A and B there is one line a which lies on z4 and B.
Ir. To each two points A and B there is not more than one line which lies
on A and B.
I.. On each line there are at least two Points. There exist at least three
points which do not lie on one line.
Io. To any three points A, B, arrd C which do not lie on one line there is a
plane c which lies on [contains] these three points. On each plane there
is [at least] one point.
Iu. To any three points A, B, and C not on one line there is not more than
one plane containing these three Points.
I.. If two points of a line lie on a plane a then every point of the line lies
on c.
Ir. If two planes c and p have a point ,4 in common, then they have at
least one more point .B in common.
I.. There are at least four points not lying on the same plane.
The second group o[ axioms supplies the most serious omission in
Euclid's set, namely, axioms about the relative order of points and lines:
IIn. (Pasch's Axiom) Let A, B, and C be the three points not on one line
and let a be any line in the plane of A, B, and C but which does not
go through (lie on) l, B, or C. If a goes through a point of the segment
IOI2 THE FOUNDATIONS OF GEOMETRY
Let a be a line and ,4 a point not on a. Then in the plane of a and, A there
exists at most one line through I which does not meet d.
The existence of at least one line through,4 which does not meet d can
be proved and hence is not needed in this axiom.
Y. Axioms of Continuity
Vr. (Axiom of Archimedes) lf AB andCD areany two segments, then there
exists on the line ,4.B a number of points Ar, Ar, .. . , ln such that the
FOUNDATTONS OF EUCLIDEAN GEOMETRY ror3
This axiom amounts to requiring enough points on the line so that the
points can be put into one-to-one correspondence with the real numbers.
Though this fact had been used consciously and unconsciously since the
days of coordinate geometry, the logical basis for it had not previously
been stated.
With these axioms Hilbert proved some of the basic theorems of Euclid-
ean geometry. Others completed the task of showing that all of Euclidean
geometry does follow from the axioms.
The arbitrary character of the axioms of Euclidean geometry, that is,
their independence of physical reality, brought to the fore another problem,
the consistency of this geometry. As long as Euclidean geometry was regarded
as the truth about physical space, any doubt about its consistency seemed
pointless. But the new understanding of the undefined concePts and axioms
required that the consistency be established. The problem was all the more
vital because the consistency of the non-Euclidean geometries had been
reduced to that of Euclidean geometry (Chap. 38, sec.4). Poincard brought
this matter up in 189812 and said that we could believe in the consistency of
an axiomatically grounded structure if we could give it an arithmetic
interpretation. Hilbert proceeded to show that Euclidean geometry is
consistent by supplying such an interpretation.
He identifies (in the case of plane geometry) point with the ordered
pair of real numbers 13 (a, b) and line with the ratio (u:a:u) in which z and
a are r,ot both 0. A point lies on a line if
ua+ob+&--O.
Congruence is interpreted algebraically by means of the expressions for
translation and rotation ol analytic geometry; that is, two figures are
congruent if one can be transformed into the other by translation, reflection
in the r-axis, and rotation.
After every concept has been interpreted arithmetically and it is clear
that the axioms are satisfied by the interpretation, Hilbert's argument is
that the theorems must also apply to the interpretation because they are
Iogical consequences ofthe axioms. Ifthere were a contradiction in Euclidean
geometry, then the same would hold of the algebraic formulation ofgeometry,
gave a set of axioms based on the undefined concepts of point and order'
Hc showed that each of his axioms is independent of the others, and he
also established another property, namely, categoricalness. This notion was
first clearly stated and used by Edward V. Huntington (1874-1952) in a
paper devoted to the real number system.15 (He called the notion sufficiency')
A set of axioms Pr, Pr,. . ., Pn connecting a set of undefined symbols
Sr, Sr,. . ,, S, is said to be categorical if between the elements of any two
assemblages, each of which contains undefined symbols and satisfies thc
14, Amcr. Moth. Soc. Trans.,5, 1904, 343-84.
15. Amcr, Malh. Soc. Trans.,3, 1902,26+-79.
SOME RELATED FOUNDATIONAL WORK ror5
axioms, it is possible to set up a one-to-one correspondence ofthe undefined
concepts which is preserved by the relationships asserted by the axioms;
that is, the two systems are isomorphic. In effect categoricalness means that
all interpretations ofthe axiom system differ only in language. This property
would not hold il, for example, the parallel axiom were omitted, because
then Euclidean and hyperbolic non-Euclidean geometry would be noniso-
morphic interpretations of the reduced set of axioms.
Categoricalness implies another property which Veblen called disjunc-
tive and which is now called completeness. A set of axioms is called complete
il it is impossible to add another axiom that is independent of the given
set and consistent with the given set (without introducing new primitive
concepts). Categoricalness implies completeness, for if a set I of axioms
were categorical but not complete it would be possible to introduce an
axiom S such that S and not-S are consistent with the set l. Then, since
the original set I is categorical, there would be isomorphic interpretations
of I together with S and I together with not-S. But this would be impossible
because the corr.esponding propositions in the two interpretations must
hold, whereas S would apply to one interpretation and not-.S to the other.
Figure 42.3 o
P, P.
t I, t 2 r 3 t 4 t
Pr Pa P! P. P5
P1 P,
Figure 42.4
Travel through each subsquare so that the path shown corresPonds to the
unit segment. Now divide the unit square into 16 subsquares numbered as
shown in Figure 42.5 and join the centers of the l6 subsquares as shown.
We continue the process of dividing each subsquare into fiour parts,
numbering them so that we can traverse the entire set by a continuous Path.
The desired curve is the limit of the successive polygonal curves formed at
each stage. Since the subsquares and the parts of the unit segment both
contract to a point as the subdivision continues, we can see intuitively that
each point on the unit segment maps into one point on the square' In fact,
if we fix on one point in the unit segment, saf t :2/3, then the image of
this point is the limit of the successive images of t -- 213 which appear in the
successive polygons.
These examples show that the definition of a curve Jordan suggested
is not satisfactory because a curve, according to this definition, can fill out
a square, The question of what is meant by a curve remained oPen. Felix
Klein remarked in lB9B30 that nothing was more obscure than the notion
ofa curve. This question was taken up by the topologists (Chap. 50, sec. 2).
l6 l3 t2 _lll
I f--
L_ __lt4
l5
9 _l t0
I-_
2 _-l3 L_ _l7
I
I-_
L_ __l
t + .5 6
Figure 42.5
L: lo fi + u'2\rt2 dt('
J"'
where y :;f(*), calls, at the very
least, for the existence of the derivative.
Hence, the concept no longer applies to the non-differentiable functions.
Various efforts to generalize the concept of length of curve were made by
Du Bois-Reymond, Peano, Ludwig Scheeffer (1859-85), and Jordan, using
either generalized integral definitions or geometric concepts' The most
general definition was formulated in terms of the notion ol measure, which
we shall examine in ChaPter 44.
A similar difficulty was noted for the concept of area of a surface. The
concept favored in the texts of the nineteenth century was to inscribe in
the surface a polyhedron with triangular faces. The limit of the sum ol the
areas of these triangles when the sides approach 0 was taken to be the area
of the surface. Analytically, if the surface is representedby
y : g(u, a), y: r!(u,a), 2 : y(u, u)
third an equilateral triangle of side s/3 and delete the base of each triangle.
There will be twelve such triangles. Then on the outside segments of the
resulting figure, construct a new equilateral triangle of side s/9. There will
be 48 of these triangles. The perimeters of the successive figures are 9s, l2s,
16.r, . . . and these perimeters become infinite. However, the area of the
limiting figure is finite. For, by the well-known formula for the area of an
equilateral triangle in terms of its side, namely, if the side isJ, the area is
y'5. The area of
1i'1+1 t4, then the area of the original triangle is t(3s)'z/al
the first three triangles added is 3' (s'z14)\/3. Since the side of the next
triangles added is ,/3 ..rd there are 12, these areas are l2(slq2\nft :
(s'z 13)\/3. The sum of the areas is
: tfr a 1_ffi
.( 9r,
o_z_v"r :E
(1tgs,:F _5",r,l5.
The Peano and Hilbert curves also raised the question of what we mean
by dimension. The square is two-dimensional in itself, but, as the continuous
image of a curve, should be one-dimensional' Moreover, Cantor had shown
that the points ola line segment can be put into one-to-one correspondence
with the points ofa square (Chap. 41, sec. 7). Though this correspondence
is not continuous from the line segment to the square or the other way, it
did show that dimension is not a matter of multiplicity of points. Nor is it a
matter of the number of coordinates needed to fix the position of a point, as
Riemann and Helmholtz had thought, because the Peano curve assigns a
unique (r, y) to each value of ,.
to22 THE FOUNDA'I]ONS OF GEOMETRY
Bibliograplry
Beckcr, Oskar: Grundlagm der Mattumatik in gcschiehtlichet Entwicklung, Karl Alber,
1954, 199_212.
Enriqucs, Federigo: " Prinzipicn der Geometric," Ercyk. der Math' Wiss', B' G'
Teubner, 1907-10, III ABl, l-129.
Hilbert, Davidz Gruwllagcn der Geomttrb, 7th ed., B. G. Teubner, 1930'
Pasch, M. and NI. Dehn: Vorhsungen iibcr aewrc Gcomctric,2nd ed', Julius Springer,
1926, l8!27t.
Pcano, Giuseppc t Opac scelte,3 vols., Edizioni Cremonese, 1957-59'
Reichardt, Hans: C. F. Gauss, Ltbcn und ll'crke, Haude und Spenersche, 1960,
I I l-50.
Schmidt, Arnold : " Zu Hilberts Grundlegung der Geometrie ," in Hilbert's
Gcsammclk Abhandlungen, 2' 404-14.
Singh, A. N. : " The Theory and Construction of Non-Differentiable Functions,"
in E. W. Hobson: Squaring lhe Citclc and Othn Monographs, Chelsea (reprint),
1953.
+3
Mathematics as of Igoo
t023
t024 MATHEMATICS AS OF I9OO
Despite the fact that geometry too had been rigorized, one consequence
of the rigorization movement was that number and analysis took precedence
o.r., g.o-.try. The recognition by mathematicians, during and after the
creation ofnon-Euclidean geometry, that they had unconsciously relied upon
intuition in accepting the prools of Euclidean geometry made them learful
that they would continue to do so in all geometric reasoning' Hence they
preferred a mathematics built upon number. Many favored going further
l. Comp. Rendu du Deuxiime Congris Internat. des Math., 1900, pub' 1902' pp' l2l-22'
2. Amir. Math. Soc. Bull.,ll, 1904/05,417; see also Chap. 40, sec' 7, and Chap' 41, secs' 5
and 9-
r026 MATHEMATICS AS OF I9OO
4. Abh. Malh. Seminar der Hamburgct (Jaia., 1,1922, 157-77 : Ccs, Abh.,3, 157-77.
5. Math. Ann.,78, 1918, 405-15 : Gcs, Abh.,3, 145-56'
r028 MATHEMATICS AS OF I9OO
geometry were sound as long as geometry was identified with the study of
physical space.
But gradually and unwittingly mathematicians began to introduce
concepts that had little or no direct physical meaning. Of these, negative
and complex numbers were the most troublesome. It was because these two
types of numbers had no "reality" in nature that they were still suspect at
the beginning olthe nineteenth century, even though freely utilized by then'
The geometrical representation of negative numbers as distances in one
direction on a line and of complex numbers as points or vectors in the com-
plex plane, which, as Gauss remarked of the latter, gave them intuitive
meaning and so madc them admissible, may have delayed the realization
that mathematics deals with man-made concepts. But then the introduction
of quaternions, non-Euclidean geometry, complex elements in geometry,
n-dimensional geometry, bizarre functions, and transfinite numbers forced
the recognition of the artificiality of mathematics.
In this connection the impact of non-Euclidean geometry has already
been noted (Chap. 36, sec. B) and the impact ol z-dimensional geometry
must now be observed. The concept aPpears innocuously in the analytical
work of d'Alembert, Euler, and Lagrange. D'Alembert suggested thinking
of time as a fourth dimension in his article " Dimension " in the Encyclopidie.
Lagrange, in studying the reduction of quadratic forms to standard forms,
casually introduces lorms in z variables. He, too, used time as a fourth
dimension in his fuIie anique analgtique ( I 7BB) and in his Thioric fus Jbnttions
analytiques (1797). He says in the latter work, "Thus we may regard mech-
anics as a geometry of four dimensions and analytic mechanics as an extension
ofanalytic geometry." Lagrange's work put the three sPatial coordinates and
the fourth one representing time on the same footing' Further, George Green
in his paper of l82B on potential theory did not hesitate to consider potel-
tial problems in n dimensions ; he says of the theory, " It is no longer confined,
as it was, to the three dimensions of space."
These early involvements in z dimensions were not intended as a study
of geometry proper. They were natural generalizations of analytical work
that was no longer tied to geometry' In part, this introduction of z-dimen-
sional language was intended only as a convenience and aid to analytical
thinking. It was helpful to think of (xr,xr,..., xn) as a point and of an
equation in z variables as a hypersurface in z-dimensional space, for by
thinking in terms of what these mean in three-dimensional geometry one
might secure some insight into the analytical work. In fact Cauchyz actually
emphasized that the concept of r-dimensional space is useful in many
analytical investigations, especially those of number theory.
However, the serious study of n-dimensional geometry, though not
However after about I 850, the view that mathematics can introduce
and deal with rather arbitrary concepts and theories that do not have im-
mediate physical interpretation but may nevetheless be useful, as in the case
olquaternions, or satisly a desire for generality, as in the case of a-dimensional
geometry, gained acceptance. Hankel in ltis Theorie der complexen Zahlen-
systeme (1867 , p. l0) defended mathematics as " purely intellectual, a pure
theory of forms, which has for its object, not the combination of quantities
or ol their images, the numbers, but things of thought to which there could
correspond effective objects or relations even though such a correspondence
is not necessary."
In delense of his creation of transfinite numbers as existing, real definite
quantities, Cantor claimed that mathematics is distinguished from other
fields by its freedom to create its own concepts without regard to transient
reality. He saids in 1883, " Mathematics is entirely free in its development
and its concepts are restricted only by the necessity ofbeing noncontradictory
and coordinated to concepts previously introduced by precise definitions. . . .
The essence of mathematics lies in its freedom." He preferred the term " free
mathematics" over the usual form, "pure mathematics."
The new view of mathematics extended to the older, physically grounded
branches. ln his [Jnioersal Algebra ( I B9B), Alfred North Whitehead says (p.
l1),
. . . Algebra does not depend on Arithmetic for the validity of its laws of
transformation. If there were such a dependence it is obvious that as soon
as algebraic expressions are arithmetically unintelligible all laws respecting
them must lose their validity. But the laws of Algebra, though suggested
by Arithmetic, do not depend on it. They depend entirely on the con-
ventions by which it is stated that certain modes ofgrouping the symbols
are to be considered as identical. This assigns certain properties to the
marks which form the symbols of Algebra.
4. The Loss d
Truth
The introduction and gradual acceptance ofconcepts that have no immediate
counterparts in the real world certainly forced the recognition that mathe-
matics is a human, somewhat arbitrary creation, rather than an idealization
o[the realities in nature, derived solely from nature. But accompanying this
recognition and indeed propelling its acceptance was a more profound
discovery-mathematics is not a body of truths about nature' The develop-
ment that raised the issue of truth was non-Euclidean geometry, though its
impact was delayed by the characteristic conservatism and closed-mindedness
of all but a few mathematicians. The philosopher David Hume (171l-76)
had already pointed out that nature did not conform to fixed patterns and
necessary laws; but the dominant view, expressed by Kant, was that the
properties of physical space were Euclidean. Even Legendre in his Eliments
de giomitrie of 179* still believed that the axioms of Euclid were sellevident
truths.
With respect to geometry, at least, the view that seems correct today was
first expressed by Gauss. Early in the nineteenth century he was convinced
that geometry was an empirical science and must be ranked with mechanics,
whereas arithmetic and analysis were a priori truths. Gauss wrote to Bessel
in 1830,10
However Gauss seems to have had conflicting views, because he also ex-
pressed the opinion that all o[ mathematics is man-made. In a letter to
Bessel of November 21, l8l l, in which he speaks of functions of a complex
variable he says,ll "One should never lorget that the functions, like all
mathematical constructions, are only our own creations, and that when the
definition with which one begins ceases to make sense, one should not ask:
What is it, but what is it convenient to assume in order that it remain
significant ? "
Despite Gauss's views on geometry, most mathematicians thought there
were basic truths in it. Bolyai thought that the absolute truths in geometry
were those axioms and theorems common to Euclidean and hyperbolic
geometry. He did not know elliptic geometr-y, and so in his time still did not
appreciate that many of these common axioms were not common to all
geometries.
In his lB54 paper "On the Hypotheses Which Underlie Geometry,"
Riemann still believed that there were some propositions about space that
were a priori, though these did not include the assertion that physical space
is truly Euclidean. It was, however, locally Euclidean.
Cayley and Klein remained attached to the reality of Euclidean
geometry (see also Chap. 38, sec. 6). In his presidential address to the
British Association for the Advancement of Science,l2 Cayley said, ". . . not
that the propositions of geometry are only approximately true, but that they
remain absolutely nue in regard to that Euclidean sPace which has so long
been regarded as being the physical space ofour experience." Though they
themselves had worked in non-Euclidean geometries, they regarded the
latter as novelties that result when new distance functions are introduced in
Euclidean geometry. They failed to see that non-Euclidean is as basic and
as applicable as Euclidean geometry.
In the 1890s Bertrand Russell took up the question of what properties
of space are necessary to and are presupposed by experience. That is, ex-
perience would be meaningless ifany ofthese a priori properties were denied.
In his .Esray on the Foundations of Geometry (1897), he agrees that Euclidean
geometry is not a priori knowledge. He concludes, rather, that projective
geometry is a priori for all geometry, an understandable conclusion in view
of the importance of that subject around 1900. He then adds to projective
geometry as a priori the axioms common to Euclidean and all the non-
Euclidean geometries. The latter facts, the homogeneity of space, finite
dimensionality, and a concept of distance make measurement Possible. The
facts that space is three-dimensional and that our actual space is Euclidean
he considers to be empirical.
That the metrical geometries can be derived from projective geometry
by the introduction of a metric Russell regards as a technical achievement
having no philosophical significance. Metrical geometry is a logically sub-
sequent and separate branch of mathematics and is not a priori. With respect
to Euclidean and the several basic non-Euclidean geometries, he departs from
Cayley and Klein and regards all these geometries as being on an equal
footing. Since the only metric spaces that possess the above properties are
the Euclidean, hyperbolic, and single and double elliptic, Russell concludes
that these are the only possible metrical geometries, and of course Euclidean
is the only physically applicable one. The others are of philosophical
import'ance in showing that there can be other geometries. With hindsight it
is now possible to see that Russell had replaced the Euclidean bias by a pro-
jective bias.
12. Report of the Bit. lssn. /or thc Adtt. ol Sci., 1883, 3-37 : Coll, Math. Palers' ll, 429-59.
ro34 MATHEMATICS AS OF I9OO
intermediate one and yet the ear might distinguish the original two. Here
things equal to the same thing are not equal to each other. One cannot add
the values of electrical resistance connected in parallel to obtain the total
resistance, nor can one combine in any way the indices ol refraction o[
different media.
By the end of the nineteenth century, the view that all the axioms of
mathematics are arbitrary prevailed. Axioms were merely to be the basis for
the deduction ofconsequences. Since the axioms were no longer truths about
the concepts involved in them, the physical meaning of these concePts no
longer mattered. This meaning could, at best, be a heuristic guide when the
axioms bore some relation to reality. Thus even the concepts were severed
from the physical world. By 1900 mathematics had broken away from reality;
it had clearly and irretrievably lost its claim to the truth about nature, and
had become the pursuit of necessary consequences of arbitrary axioms about
meaningless things.
The loss of truth and the seeming arbitrariness, the subjective nature of
mathematical ideas and results, deeply disturbed many men who considered
this a denigration of mathematics. Some therefore adopted a mystical view
that sought to grant some reality and objectivity to mathematics. These
mathematicians subscribed to the idea that mathematics is a reality in
itsell, an independent body oftruths, and that the objects of mathematics are
given to us as are the objects of the real world. Mathematicians merely
di."o,r.. the concepts and their ProPerties. Hermite, in a letter to Stieltjes,r6
said, "I believe that the numbers and functions of analysis are not the
arbitrary product ofour minds; I believe that they exist outside ofus with the
same character of necessity as the objects of objective reality; and we find or
discover them and study them as do the physicists, chemists and zoologists"'
Hilbert said at the International Congress in Bologna in l928,ro "How
would it be above all with the truth ofour knowledge and with the existence
and progress of science if there were no truth in mathematics ? Indeed in
professional writings and public lectures there often aPpears today a skepti-
cism and despondency about knowledge; this is a certain kind of occultism
which I regard as damaging."
Godfrey H. Hardy (1577-1947), an outstanding analyst of the twentieth
ccntury, said in 1928,1? " Mathematical theorems are true or false; their
truth or lalsity is absolutely independent of our knowledge of them. ln some
sense, mathematical truth is part of objective reality." He expressed the same
view in his book A fuIathematician' s Apology (1967 ed., p. 123): "I believe that
mathematical reality lies outside us, that our function is to discover or observe
coveries. This study offers not only the advantage ofa well-determined goal
but the advantage of excluding vague questions and useless calculations. It
is a means of building analysis itself and of discovering the ideas which matter
most and which science must always preserve. The fundamental ideas are
those which represent the natural happenings." He also stressed the applica-
tion ol mathematics to socially useful problems.
Though Jacobi had done first-class work in mechanics and astronomy,
he took issue with Fourier. l{e wrote to Legendre onJuly 2, 1830,t4 "1,;.
true that Fourier is of the opinion that the principal object of mathematics is
the public utility and the explanation ol natural phenomena; but a scientist
like him ought to know that the unique object ol science is the honor ofthe
human spirit and on this basis a question of [the theory of] numbers is worth
as much as a question about the planetary system. . .."
Throughout the century, as more men became disturbed by the drift
to pure mathematics, voices were raised in protest. Kronecker wrote to
Helmholtz, "The wealth of your practical experience with sane and interest-
ing problems will give to mathematics a new direction and a new impetus. , . .
One-sided and introspective mathematical speculation leads into sterile
fields."
Felix Klein, in his Mathematical Theorg oJ the Top, (1897, pp. 1-2) stated,
"It the great need of the present in mathematical science that the pure
is
science and those departments of physical science in which it finds its most
important applications should again be brought into the intimate association
which proved so fruitful in the works of Lagrange and Gauss." And Emile
Picard, speaking in the early part of this century (La Science modtme ct son
itat actuel, 1908), warned against the tendency to abstractions and pointless
problems.
Somewhat later, Felix Klein spoke out again.re Fearing that the freedom
to create arbitrary structures was being abused, he emphasized that arbitrary
structures are " the death of all science. The axioms of geometry are . . . not
arbitrary but sensible statements which are, in general, induced by space
perception and are determined as to their precise content by expediency."
To justify the non-Euclidean axioms Klein pointed out that visuali zation c-an
verify the Euclidean parallel axiom only within certain limits. On another
occasion he pointed out that " whoever has the privilege of freedom should
also bear responsibility." By " responsibility " Klein meant service in the
investigation of nature.
Despite the warnings, the trend to abstractions, to generalization of
existing results for the sake of generalization, and the pursuit of arbitrarily
chosen problems co'niinued. The reasonable need to study an entire class of
18. Ges. Werke, l, +5+-55.
19. Elementary Mathematics from an Aduanccd Standpoint, Macmillan, 1939; Dover (reprint),
1945, Vol. 2, p. 187.
IO38 MATTTEMATICS AS OF I9OO
7. A Glance Ahead
The pace of mathematical creation has expanded steadily since 1600, and
this is certainly true of the twentieth century. Most of the fields pursued in
the nineteenth century were further developed in the twentieth. However,
the details of the newer work in these fields would be of interest only to
specialists. We shall therefore limit our account of twentieth-century work
to those fields that first became prominent in this period. Moreover, we shall
consider only the beginnings ofthose fields. Developments ofthe second and
third quarters of this century are too recent to be properly evaluated. We
have noted many areas pursued vigorously and enthusiastically in the past,
which were taken by their advocates to be the essence of mathematics, but
which proved to be passing fancies or to have little consequential impact on
the course of mathematics. However confident mathematicians of the last
half-century may be that their work is of the utmost importance, the place
of their contributions in the history of mathematics cannot be decided at the
present time.
Biblio.qraftut
Fang, J. : Hilberl, Paideia Press, 1970. Sketches of Hilbert's mathematical work.
Hardy, G. H.: A Mathematician's Apology, Cambridge Univenity Press, 1940 and
1967.
Helmholtz, H. von: Counting and Measuring, D. Van Nostrand, 1930. Translation
of Zahlen und Messen : Wisscnschqftlichc Abhandlungcn,3, 356-91.
" Uber den Ursprung Sinn und Bedeutung der geometrischen Siitze";
English translation : " On the Origin and Significance of Geometrical
--: Axioms," in Helmholtz: Popalar Seicnlifu Lccturcs, Dovet (reprint), 1962, pp.
223-49. Also in James R. Newman: The World oJ Matlumalics, Simon and
Schuster, 1956, Vol. l, pp. 647-68. See also Helmholtz's Wisnnschdtlichc
A b handlungcn, 2, 640-60.
Hilbert, David: "Sur les problimes futurs des math6matiques," Conptcs Rendus du
Deuxilmc Congrls Intcrnational dcs Malhimaticicns, Gauthier-Villars, 1902,
58-l14. Also in German, in Nachrichtzn Kdnig. Gcs. dcr Wiss. zu Gdu., 19O0,
253-97, and, in Hilbert's Gesammcltc Abhandlungcn, 3, 290-329. English
translation in Amer. l[ath. Soc. Bull., 8, l90l12, 437-79.
Kle in, Felix : " Uber Arithmetisirung der l\{athematik," Gcs. Math. Abh.,2,23240.
English translation in Amcr. It[ath. Soc. Bull., 2, 189516, 241-49.
Pierpont, James: "On the Arithmetization of Mathematics," Ancr. Math. Soc. Bull.,
s, 1898/9,394-406.
Poincard, Henri: The Foundatio.ns of ,9cience , Science Press, 1913. See especially pp.
43-91.
Reid, Constance: Hilbert, Springer-Verlag, 1970. A biography.
4+
The Theory of Functions of
Real Variables
L. Tlu Oigins
The theory offunctions ofone or more real variables grew out ofthe attemPt
to understand and clarify a number ofstrange discoveries that had been made
in the nineteenth century. continuous but non-differentiable functions,
series of continuous functions whose sum is discontinuous, continuous
functions that are not piecewise monotonic, functions possessing bounded
derivatives that are not Riemann integrable, curves that are rectifiable but
not according to the calculus definition of prc length, and nonintegratle
functions thai are limits of sequences of integrable functions-all seemed to
contradict the expected behavior ol functions, derivatives, and integrals'
Another motivation for the further study of the behavior of functions came
from the research on Fourier series. This theory, as built up by Dirichlet,
Riemann, Cantor, Ulisse Dini (1S45-1918), Jordan, and other mathe-
maticians of the nineteenth century, was a quite satisfiactory instrument lor
applied mathcmatics. But the properties of the series, as thus far developed'
f.iied to give a theory that could satisfy the pure mathematicians' Unity,
.y--.try, and completeness of relation between function and series were
still wanting.
The research in the theory of functions emphasized the theory of the
integral because it seemed that most ol the incongruities could be resolved
by f,roadening that notion. Hence to a large extent this work may be
regarded as a direct continuation of the work of Riemann, Darboux, Du
Bois-Reymond, Cantor, and others (Chap. 40, sec. 4)'
ro4o
EARLY WORK ON CONTENT AND MEASURE ro41
+
)fG)(S@'*,) - d(",))
wherein the xo, *r, . . ., tr,tL ate a partition of fa, b] and f1 is within [*1, x,* t].
He then showed that when;[is continuous in fa, bf and the maximum sub-
interval of the partitions approaches 0, the sums approach a limit which he
denoted AV IJf*l @(*). Though he used this integral in his own work,
Stieltjes did not push further the integral notion itself, except that for the
interval (0, o) he defined
His integral concept was not taken up by mathematicians until much later,
when it did find many applications (see Chap. 47, sec.4).
l. Ann. Fac. Sci. de Toulouse, 8, 1894, J.l'122' and 9, 1895, A.l-47 = (Euwcs comllltcs,2,
402-559.
ro42 THE THEORY OF FUNCTIONS OF REAL VARIABLES
measure $/ere introduced to extend the notion oflength to sets of points that
are not full intervals of the usual straight line.
The notion of content is based on the following idea: Consider a set E
of points distributed in some manner over an interval [a, D]. To be loose for
the moment, suppose that it is possible to enclose or cover these points by
small subintervals of [c, D] so that the points of E are either interior to one
of the subintervals or at worst an endpoint. We reduce the lengths of these
subintervals more and more and add others if necessary to continue to
enclose the points of d while reducing the sum of the lengths of the sub-
intervals. The greatest lower bound of the sum of those subintervals that
cover points of E is called the (outer) content of E. This loose formulation
is not the definitive notion that was finally adopted, but it may serve to
indicate what the men were trying to do.
A notion of (outer) content was given by Du Bois-Reymond in his
Dh allgancinc Funktionentluorie (lBB2), Axel Harnack (lB5l-BB) in his Dre
Elancntt da Di.ffcrcntial- und Integralrechnang (lBBl), Otto Stolz,2 and Cantor.s
Stolz and Cantor also extended the notion of content to two and higher-
dimensional sets using rectangles, cubes, and so forth in place of intervals.
The use of this notion of content, which was unfortunately not satis-
factory in all respects, nevertheless revealed that there were nowhere dense
sets (that is, the set lies in an interval but is not dense in any subinterval of
that interval) of positive content and that functions with discontinuities on
such Eets were not integrable in Riemann's sense. Also there were functions
with bounded nonintegrable derivatives. But mathematicians of this time,
the 1880s, thought that Riemann's notion of the integral could not be
extended.
To overcome limitations in the above theory of content and to rigorize
the notion of area of a region, Peano (Applicazioni geomelriche del calcolo
itfinittsimalc, 1887) introduced a fuller and much improved notion of content.
He introduced an inner and outer content for regions. Let us consider two
dimensions. The inner content is the least upper bound of all polygonal
regions contained within the region.R and the outer content is the greatest
lower bound of all polygonal regions containing the region rt. If the inner
and outer content are equal, this common value is the area. For a one-
dimensional set the idea is similar but uses intervals instead of polygons.
Peano pointed out that ifl(x) is non-negative in [a, D] then
rb : li :
dx Ct@) u"d C"(R),
),f J"
where the first integral is the least upper bound of the lower Riemann sums
ol fon la, D] and the second is the greatest lower bound ofthe upper Riemann
sums and C1(ft) and C are the inner and outer content of the region ,R
"(R)
bounded above by the graph ofl Thus,lis integrable if and only ifR has
content in the sense that Cr(R) : C"(R).
Jordan made the most advanced step in the nineteenth-century theory
of content (itcndue). He too introduced an inner and outer content,a but
formulated the concePt somewhat more effectively. His definition for a set
ofpoints E contained in la, bf starts with the outer content. One covers E by
a finite set of subintervals of fa, bf such that each point of .E is interior to or an
endpoint of one of these subintervals. The greatest lower bound of the sum
of all such sets ofsubintervals that contain at least one point ofE is the outer
content of .8. The inner content olE is defined to be the least upper bound ol
the sum of the subintervals that enclose only points of E in [a, D]. If the inner
and outer content of E are equal, then E has content. The same notion was
applied by Jordan to sets in z-dimensional space excePt that rectangles and
the higher-dimensional analogues replace the subintervals. Jordan could now
prove what is called the additivity proPerty: The content of the sum of a
fnite number of disjoined sets with content is the sum of the contents of the
separate sets. This was not true for the earlier theories of content, except
Peano's.
Jordan's interest in content derived from the attemPt to clarify the
theory of double integrals taken over some plane region E. The definition
usually adopted was to divide the plane into squares rR,, by lines parallel to
the coordinate axes. This partition of the plane induces a partition of E
into E11's. Then, by definition,
I f@,il at : Lx'Lt;'o
lim ) f@,,y,)a(R,,),
JE
-'
where a(R,r) denotes the area of R11 and the sum is over all rR,, interior to E
and all l?,, that contain any points of E but may also contain points outside
ofE. For the integral to exist it is necessary to show that the rRt, that are not
entirely interior to E can be neglected or that the sum of the areas of the
R,, that contain boundary points ol E approaches 0 with the dimensions of
the R,r. It had been generally assumed that this was the case and Jordan
himself did so in the first edition of his Cours d'analyse (Yol.2, 1883). However
the discovery ofsuch peculiar curves as Peano's square-filling curve made the
mathematicians more cautious. If E has two-dimensional Jordan content,
then one can neglect the.Ri, that contain the boundary points ofE Jordan
was also able to obtain results on the evaluation of double integrals by
repeated integration.
The second edition of Jord an's Cours d'analyse (Vol. I, 1893) contains
Jordan's treatment of content and its application to integration. Though
4. Jour. de Math., (4),8, 1892, 69-99 = CEuatcst 4, +27-57,
roM TIIE THEORY OF FUNCTIONS OF REAL VARIABLES
s:f 1n1,,1,
The sums S and s have a greatest lower bound J and a least upper bound d
respectively. Lebesgue showed- that for bounded measurable functions
I : J, and this common value is the Lebesgue integral ofrt(r) on E. The
notation is
I: dx.
)"f(x)
If E consists of the entire interval a < x < 6, then we use the notation
il -f @) a*, but the integral is understood in the Lebesgue sense. If/(x) is
Lebesgue integrable and the value of the integral is finite, then..f(.r) is said
to be summable, a term introduced by Lebesgue. An f(x) that is Riemann
r046 THE THEORY OF FUNCTIONS OF REAL VARIABLES
r,11
//(,){X"l*d*
: o,
6. Atti l,
della Accad. dei Linrci, Rendiconti, (+), 1885, 321-26, 532-37, 566-69,
7. 8.g., Ann. de l'Ecole Norm, Sup., (3), 20, 1903, 453-85.
THE LEBESGUE INTEGRAL to47
I r, :
)1" ,ruls<a
d,x 2aoao + ) @^"^ + b,P,),
where a,, and 6n, and cn and Bn are the Fourlier coefficients tor/('t) and g(r)
whose squares are Lebesgue integrable in [-2, zr]. Despite these advances
in the theory of Fourier series, there is no known ProPel'ty of an f(x)
Lebesgue integrable in [-zr, rr] that is necessary and sufficient for the con-
vergence of its Fourier series.
Lebesgue devoted most of his efforts to the connection between the
notions of integral and of primitive functions (indefinite integrals). When
Riemann introduced his generalization ofthe integral, the question u'as posed
whether the correspondence between definite integral and primitive function,
valid for continuous functions, held in the more general case. But it is
possible to give examples of functions;f integrable in Riemann's sense and
such that IIttD a, does not have a derivative (not even a right or leflt
derivative)'it certain points' Conversely Volterra showed in l88l 10 that a
function .F'(x) can have a bounded derivative in an interval l that is not
integrable in Riemann's sense over the interval. By a subtle analysis Lebesgue
shoied that if/is integrable in his sense in fa, bl, then F (x) : I: "f (t) dt has a
derivative equal to;f(.r) almost everywhere, that is, except on a set of
measure zero (Legons sur l'intlgration). Conversely, ifa function g is differenti-
able in la, bf and if its derivative g' : f is bounded, then;f is Lebesgue
integrabL urd th" formula s@) - s@): Jitttl /t holds. Ilowever, as
Lebesgue established, the situation is much more complex ifg' is not bounded'
In this case g' is not necessarily integrable, and the first problem is to
characterize the functions g for which g' exists almost everywhere and is
11
integrable. Limiting himself to the case where one of the derived numbers
ofg is finite everywhere, Lebesgue showed that g is necessarily a function of
bounded variation (Chap. 40, sec. 6). Finally Lebesgue established (in the
1904 book) the reciprocal result. A function g ofbounded variation admits a
derivative almost everywhere and g' is integrable. But one does not neces-
sarily have
(1) s@) - s@) : [- s'Q) dt;
(") .f (r, y) as a function of x and as a function ofy is summable for almost all
g and, x, respectively;
lO. Gior. di Mat,, 19, 1881, 333-72 : OPete Mat., l, 16-48'
ll. The two right derived numbers ofg are the two lirnis,
I*.*t'!l#@',I'*.i:l'!l#'
The two left derived numbers are defined similarly.
12. Arti della Acead. dei Lincei, Rendiconti, (5), 16, 1907, 608-14'
ri
(b) the set of points (ro,yo) for which either;f(x, g) orJ@o,y) is not sum-
mable has measure 0;
(1) It is completely additive; that is, F()f E") : >f F(E ) where the E,,
are pairwise disjoined measurable sets.
(2) It is absolutely continuous in the sense that F(E) tends to 0 with the
measure of E,
The essential part of this paper of Lebesgue was to show the converse of this
proposition, that is, to define a derivative ofF(E) at
point P of z-dimensional
a
space. Lebesgue arrived at the following theorem: Il F(E) is absolutely
continuous and additive, then it possesses a finite derivative almost every-
where, and F is the indefinite integral of that summable function which is the
derivative ofF where it exists and is finite but is otherwise arbitrary at the
remaining points.
The principal tool in the proof is a covering theorem due to Vitali,l4
which remains fundamental in this area of integration' But Lebesgue did not
stop there. He indicated the possibility ofgeneralizing the notion offunctions
of bounded variation by considering functions F(E) where .E is a measurable
set, the functions being completely additive and such that )" lF(E ) |
5. Gmtralizntions
We have already indicated the advantages oflebesgue integration in general-
izing older results and in formulating neat theorems on series. In subsequent
chapters we shall meet additional applications of Lebesgue's ideas. The
immediate developments in the theory of functions were many extensions of
the notion of integral. Of these we shall merely mention one by Johann
Radon (1887-1956), which embraces both Stieltjes's and Lebesgue's integral
and is in fact known as the Lebesgue-Stieldes integral'16 The generalizations
cover not only broader or different notions of integrals on point sets of n-
dimensional Euclidean space but on domains of more general spaces such
as spaces of functions. The applications of these more general concepts are
now found in the theory of probability, spectral theory, ergodic theory, and
harmonic analysis (generalized Fourier analysis).
Bibliographg
Borel, Emile: Notitc sur bs traaaux scitnlifques de M. Emilc Borel,Znd ed., Gauthier-
Villars, 1921.
Bourbaki, Nicolas: Elhrunts d'histoirc de malMmatiques, Hermann, 1960, pp. 246-59.
Collingwood, E. F.: " Emile Borel," Jour. Lon' Math.Soc.,34, 1959, 488-512.
FrCchet, M.: "La Vie et l'cuvre d'Emile Borel," L'Enseignemcnt Matlunatiqu, (2),
ll, 1965, l-94.
Hawkins, T. W., Jr. : Lebesguc's Tluory oJ Integration: Ik Origins and Deaehpnenl,
University of Wisconsin Press, 1970, Chaps. 't-6.
Hildebrandt, T. H. : " On Integrals Related to and Extensions of the Lebesgue
Integral," Amer. Math. Soc. Bull.,24, 1918, ll3-77.
Jordan, Camille: (Euvres, 4 vols., Gauthier-Villars, 196l-64.
Lebesgue, Henri: Measwc awl the Inlegral, Holden-Day, 1966, pp. 176-94. Trans-
lation of the French Ia Mesurc des grandeurs,
Noticc sur les traoaux scicntifqucs dc M. Henri l*besguc, Edouard Privat, 1922.
Itgons sur I'inligration et la rcchtrclu des Jonctions friml'ldza.r, Gauthier-Villars,
1904,2nd ed., 1928.
McShane, E. J.: " Integrals Devised for Special Purposes," Aner. Math. Soc. Bull.,
69, 1963,597-627.
Pesin, Ivan M.: Classical and Modern Integration Thcorizs, Academic Press, 1970.
Plancherel, Michel: " Le Ddveloppement de la thdorie des sdries trigonomdtriques
dans le dernier quart de sihcle," L'Enscignzmcnl Mathimatiquc, 24, 1924125,
t9-58.
Riesz, F.: " L'Evolution de la notion d'intCgrale depuis Lebesgue," Annahs dc
I' Instilut Fowin, l, 1949,2942.
+5
Integral Equations
l. Introduction
An integral equation is an equation in which an unknown function appears
under an integral sign and the problem of solving the equation is to deter-
mine that function. As we shall soon see, some problems of mathematical
physics lead directly to integral equations, and other problems, which lead
fust to ordinary or Partial differential equations, can be handled more
expeditiously by converting them to integral equations. At first, solving
integral equations was described as inverting integrals. The term integral
equations is due to Du Bois-Reymond.l
As in other branches of mathematics, isolated problems involving
integral equations occurred long before the subject acquired a distinct
status and methodology. Thus Laplace in 17822 considered the integral
equation for g(t) given by
for large enough a. Another of the noteworthy results that really belong to
the history ol integral equations stems from Fourier's famous I B I I paPer on
the theory ol heat (Chap. 28, sec. 3). Flere one finds
r o52
INTRODUCTION ro53
Figure 45. I
u(t) _21*
-;J" cos (xt)f(x) dx.
-l I la ds
"
,
-- \/4J, {-*- g
-
Now s can be expressed in terms of f. Suppose s is a((). Then the whole time
-.
of descent T frorr P to O is given by
Jk):l.$aE.
Jo{x-E -
Abel obtained the solution
Thus ifwe can solve this integral equation we shall have obtained that
solution ofthe differential equation (3) that satisfies the initial conditions (4).
Liouville obtained the solution by a method of successive substitutions
attributed to Carl G. Neumann, whose work Unterruthungen iiber das logarith'
misclu und Newton'sche Potential (1877) came thirty years later' We shall not
describe Liouville's method because it is practically identical with the one
given by Volterra, which is to be described shortly.
6. Jour. de l'Ecole Polg., 13, 1832, l-69.
7. Jout. de Math.,2, 1837, 16-35,
INTRODUCTION r055
The integral equations treated by Abel and Liouville are of basic types.
Abel's is of the form
where 0l0n denotes the normal derivative to the boundary, then {(s) must
satisfy the integral equation
(10)
"r@
: root + Luf.olt fi("gfr:i).,
an integral equation of the second kind. These equations were solved by
Neumann for convex areas in lrris (Jntersttchunge, and later publications.
Another problem of partial differential equations was tackled through
integral equations. The equation
(ll) Az * lu:"f(*,y)
in the study of wave motion when the time dependence of the corre-
arises
sponding hyperbolic equation
I
Lu-iuu:f@,y),
usually taken to be e-''t, is eliminated. It was known (Chap. 28, sec. 8) that
the homogeneous case of (11) subject to boundary conditions has nontrivial
solutions only for a discrete set of l-values, called eigenvalues or character-
istic values. Poincar6 in 18948 considered the inhomogeneous case (11) with
complex I. He was able to produce a function, meromorphic in tr, which
represented a unique solution of (l l) for any tr which is not an eigenvalue,
and whose residues produce eigenfunctions for the homogeneous case, that
is, when.,/ : Q.
On the basis of these results, Poincard in 1896e considered the equation
integral equations. He wrote papers on the subject from 1884 on and prin-
cipal ones in 1896 and l897.1o Volterra contributed a method of solving
integral equations of the second kind,
where the kernel r( (later called the solving kernel or resolvent by Hilbert) is
rt(s, t) : lb
-r(s, t) * J. ((r, r)K(r, t) dr
-l'nl Ls
tt, .... rrL
where rr, rz,,..,rn run independently over all the values from I to z. By
expanding the determinant of the coefficients in ( I 8), and then letting n
become infinite, Fredholm obtained the determinant
+ I:J:l*,r,'u, K((,,
(,) x(E,' il)l,o,dtr-"'.
lK(t,,y) K(t,, €,) K(t,, il)l
Fredholm called D(r, y, l) the first minor of the kernel K because it plays
the role analogous to first minors in the case of z linear equations in z un-
knowns. He also called the zeros of the integral analytic function D(f) the
roots of r((.t,y). By applying Cramer's rule to the system of linear equations
(18) and by letting z become infinite, Fredholm inGrred the form of the
solution of ( 16). He then proved that it was correct by direct substitution and
could assert the following results: If l is not one of the roots of r(, that is, if
D(A) + 0, (16) has one and,only one (continuous) solution, namely,
and the inhomogeneous equation (16). It is almost evident from (21) that
when I is not a root ofr( the only continuous solution of (22) is u = 0. Hence
he considered the case when tr is a root of r(. Let tr : trr be such a root. Then
(22) has the infinite number of solutions
where the rr's are arbitrary constants; the ur,u2,. . .,un, called principal
solutions, are linearly independent; and n depends upon lr. The quantity z
is called the index of ,\1 [which is not the multiplicity of trt as a zero of D(])].
Fredholm was able to determine the index of any root ,\, and to show that the
index can never exceed the multiplicity (which is always finite). The roots
of D(f) : 0 are called the characteristic values of K(x, y) and the set ofroots
is called the spectrum. The solutions of (22) corresponding to the character-
istic values are called eigenfunctions or characteristic functions.
And now Fredholm was able to establish what has since been called the
Fredholm alternative theorem. In the case where I is a characteristic value
offf not only does the integral equation (22) have z independent solutions
but the associated or adjoint equation which has the transposed kernel,
namely,
u(x) : d{,
^t*r*,x)u({)
also has z solutions {r(r), . . ., rlt,(*) for the same characteristic value and
then the nonhomogeneous equation (16) is solvable if and only if
(23)
[' rul*,o
dx : o, i : 1,2,' ' ', n.
These last few results parallel very closely the theory of a system of linear
algebraic equations, homogeneous and nonhomogeneous'
s(^):l*it-l)odn^n,
n=l
wherein x(r) and y(r) are arbitrary continuous functions of r on [0, l], and
wherein now .r(r) : r((s, r) and y(z) : K(r, t). He then proves that if k is
defned by
i(-(s. r) : A*(ljr,(l)
fr
t)
-
for values of ,\ for which D(f) I 0, then
ft
: ,((s, t) - AJo K(s, t)K(r, t) dr'
Finally if { is taken to be
then { is a solution ot(24). The proofs ofvarious stePs in this theory involve a
number of limit considerations on expressions which occur in Hilbert's
treatment of the finite system of linear equations.
Thus far Hilbert showed that for any continuous (not necessarily
symmetric) kernel r((s, I) and for any value of tr such that 5()) I 0, there
exists the solving function (resolvent) r((r, t), which has the property that
(25) solves equation (24).
Now Hilbert assumes.l((s, t) to be symmeric, which enables him to use
facts about symmetric matrices in the finite case, and shows that the zeros
of E(I), that is, the eigenvalues of the symmetric kernel, are real. Then the
zeros of 6(l) are ordered according to increasing absolute values (for equal
THE WORK OF I{ILBERT r063
absolute values the positive zero is taken first and any multiplicities are to be
counted). The eigenfunctions of (24) are now defined by
d*(,) : (*#h,D),,,
r*1,r*;,,,*;,
where sr is chosen so that A*(,\*;s*,s*) * 0 and tr* is any eigenvalue of
K(s, t).
The eigenfunctions associated with the separate eigenvalues can be
chosen to be an orthonormal (orthogonal and normalized 13) set and for each
eigenvalue ,\* and for each eigenfunction belonging to tr*
(26)
22r"**,
be an z-dimensional quadratic form in the a variables *L, x2,..., xn, This
can be written as (.Kx, r) where r( is the matrix of the *ro, I stands for the
vector (rr, xz, . . ., xn) and (Kx, x) is the inner product (scalar product) ofthe
two vectors Kx and x. Suppose 1( has the z distinct eigenvalues lt, ,\2, . . ., trn.
Then for any fixed )" the equations
6- : @y.,i8,"',6b,
which is a unique solution up to a constant multiple. It is then possible, as
Hilbert showed, to write
(28) K(x,x):z;tr#
wherein the parentheses again denote an inner product of vectors.
Hilbert's generalized principal axis theorem reads as follows: Let
1((s,I) be a continuous symmetric function ofs and t. Let /e(s) be the normal-
ized eigenfunction belongin! to the eigenvalue l, of the integral equation
(24). Then for arbitrary continuous *(s) and y(s), the following relation
holds:
(2e)
i" f ^U,
t),c(s)y(t) ** :2f (J'o,t,r,r,, ,r([ +,r)vo) d,),
(30) ,f(,) :
r K(s, t) s(t) dt
then
K\x, y)
.\-:
Z koox,lq,
p,c=7
THE WORK OF IIILBERT r065
and by passing to the limit of results for bilinear and quadratic forms in 2z
and z variables respectively, obtains basic results. The details ofthe work are
considerable and we shall only note some ol the results. Hilbert first obtains
an expression for a resolvent form 1((); *, r), which has the peculiar feature
that it is the sum of expressions, one for each ofa discrete set ofvalues ofl,
and of an integral over a set of ,\ belonging to a continuous range' The
discrete set of ) values belongs to the point spectrum ofI(, and the continuous
set to the continuous or band sPectrum. This is the first significant use of
continuous spectra, which had been observed for partial differential equations
by Wilhelm Wirtinger (b. 1865) in 1896.14
To get at the key result for quadratic forms, Hilbert introduces the notion
of a bounded form. The notation (.r, x) denotes the inner (scalar) product of
the vector (rr, rr, . . . , *,, . .. ) with itself and (x,y) has the analogous
meaning. Then the form K(x, g) is said to be bounded lf lK(x, g)l < M fot
all r and y such that (*, *) < I and (y,y) < L Boundedness implies con-
tinuity, which Hilbert defines for a function of infinitely many variables.
Hilbert's key result is the generalization to quadratic forms in infinitely
many variables of the more familiar principal axis theorem of analytic
geometry. He proves that there exists an orthogonal transformation 7 such
that in the new variables xi, where x' : Tx,.t( can be reduced to a "sum of
squares." That is, every bounded quadratic form K(x, r) : 23,o=t kr{rxo
can be transformed by a unique orthogonal transformation into the form
(33) K(x.t\:5.*,r?
2 "
I d"(p't),
* JG, tL
I=1
where the k, are the reciprocal eigenvalues of K. The integral, which we shall
not describe further, is over a continuous range ofeigenvalues or a continuous
spectrum.
To eliminate the continuous spectrum, Hilbert introduces the concept
of complete continuity. A function F(*t, xz, . . .) of infinitely many variables
is said to be completely continuous at a : (ar, as,. . .) if
lim F(o, + ey a2 I
6',O.
s2, ' ' ') : F (ar, a2,' ' ')
ci-o'
whenever €lt E2t ." ,' , are allowed to run through any value system ef), ef), . . .
having the limit
lim ei]) : 0, lim ef) : 0, "'.
hi co lt+ o
(34) K(x,x):2r,*,
I
where the /r, are reciprocal eigenvalues and the (xr,*r,,..) satisly the
condition that )f *f is finite.
Now Hilbert applies his theory of quadratic forms in infinitely many
variables to integral equations. The results in many instances are not new
but are obtained by clearer and simpler methods. Hilbert starts this new
work on integral equations by defining the important concept ofa complete
orthogonal system of functions {{r(s)}. This is a sequence of functions all
defined and continuous on the interval [a, D] with the following properties:
(a) orthogonality:
ui : J"g,(s)u(s)
ds
is called the Fourier coefficient of z("r) with respect to the system {de}.
Hilbert shows that a complete orthonormal system can be defined for
any finite interval fa, bl, tor example, by the use of polynomials. Then a
generalized Bessel's inequality is proved, and finally the condition
lo
u'(s) d' : S ,I,
is shown to be equivalent to completeness.
Hilbert now turns to the integral equation
",,
: I: lo 16,,16,1,16og) ds dt.
It follows that
2
p,tl= 7
"',,. ff Kzg,t) dsdt.
Also, if
lb
o, : l,$,(s)f(s)
ds'
that is, if the a, are the " Fourier" coefficients of;f(s), then 2?=, a? < a.
Hilbert next converts the above integral equation into a system of in-
finitely many linear equations in infinitely many unknowns. The idea is to
look at solving the integral equation for {(s) as a problem of finding the
" Fourier" coefficients of {(s). Denoting the as yet unknown coefficients by
*s )(2, . . . , he gets the following linear equations:
sr
(36) *o+ Zapdq:ap, P:1,2,"'.
He proves that if this system has a unique solution, then the integral equation
has a unique continuous solution, and when the linear homogeneous system
associated with (36) has z linearly independent solutions, then the homo-
geneous integral equation associated with (35),
has a unique solution for all;f or the associated homogeneous equation has
n linearly independent solutions. In the latter case (39) has a solution if and
only if the orthogonality conditions (38) are satisfied'
Hilbert turns next to the eigenvalue problem
K(x', x') _s
-L
,2
llpqp ,
where the pe are tt,Le reciprocal eigenvalues of the quadratic form K(x' x)'
The eigenfunctions {{r(s)} for the kernel r((s, I) are now defined by
: :
Z,(l((s)) 2,*ll .l((s, t)Oo(l) dt y.,$r(s)
where the Oo(t) are a given complete orthonormal set. The {r(s) [as distinct
from the Oq(r)] are shown to form an orthonormal set and to satisfy
:
[' x$, t1g1t1 dt f(s),
thenlf is representable as a series in the eigenfunctions of .I( which is uniformly
and absolutely convergent (cf. t3l]). Hilbert uses this result to show that the
homogeneous equation associated with (40) has no nontrivial solutions
."""pi ut the eigenvalues l' Then the Fredholm alternative theorem takes
the iorm: For I I I, equation (40) has a unique solution' For I : trr,
equation (40) has a solution if and only if the n, conditions
J d,*,k).f(') : o, j : 1,2,. . ., no
TrrE woRK oF HTLBERT 1069
are satisfied where the are the z, eigenfunctions associated with trr.
$r*/s)
Finally, he proves anew the extension of the principal axis theorem :
(41)
fr(oatu*) * oav * '\z: o
where G(r, f) is the Green's function for (41), that is, a particular solution of
*,(t4-) + qQ)u: o
ithad required great mathematical efforts (Chap. 28, sec. B) to prove the
existence of the lowest oscillating frequency for a membrane. With integral
equations, constructive proofofthe existence ofthe whole series offrequencies
and of the actual eigenfunctions was obtained under very general conditions
on the oscillating medium. These results were first derived, using Fredholm's
theory, by Emile Picard.ls Another noteworthy result due to Hilbert is that
the development ofa function in the eigenlunctions belonging to an integral
equation of the second kind depends on the solvability of the corresponding
integral equation of the first kind. In particular, Hilbert discovered that the
success of Fredholm's method rested on the notion of complete continuity,
which he carried over to bilinear forms and studied intensively, Here he
inaugurated the spectral theory of bilinear symmetric forms.
After Hilbert showed how to convert problems of differential equations
to integral equations, this approach was used with increasing frequency to
solve physical problems. Here the use of a Green's function to convert has
been a major tool. Also, Hilbert himself showed,16 in problems of gas dy-
namics, that one can go directly to integral equations. This direct recourse
to integral equations is possible because the concept of summation proves as
fundamental in some physical problems as the concept of rate of change
which leads to differential equations is in other problems. Hilbert also
emphasized that not ordinary or partial differential equations but integral
equations are the necessary and natural starting point for the theory of
expansion of functions in series, and that the expansions obtained through
differential equations are just special cases of the general theorem in the
theory of integral equations.
dx:
l" ruto,al
ap
for each $, and ar. The function;f Proves to be Lebesgue square integrable'
This theorem establishes a one-to-one correspondence between the set of
Lebesgue square integrable functions and the set of square summable
,.q.r.i"., through the mediation of any orthonormal sequence of Lebesgue
square integrable functions.
With ihe introduction of Lebesgue integrable functions, Riesz was
also able to show that the integral equation of the second kind
ff*@) - -f^(*))2 dx
: o'
,l11-[
and {J} is said to converge in the mean to;f if
:y:['"c-n126:o'
The integrals are Lebesgue integrals. The function;f is uniquely determined
to within a function defined over a set of measure 0, that-.is, a function
g(x) * O,called a null function, which satisfies the condition .fi s'@) d* : O'
19. Com!. Rmd., 144, 1907,1022-24.
t072 INTEGRAL EQUATIONS
which had arisen in Riesz's 1907 worls, is called the moment problem
(Lebesgue integration is understood). In 191021 Riesz sought to generalize
this problem. Because in this new work Riesz used the Hdlder inequalities
\l",t,l = (Z r,,r)"'(i
',,,1"'
and
- ru)) dt : o'
Y!)rnat
strong convergence implies weak convergence. (The modern definition of
weak iorvergence, if {fr} belongs to Le and if;[ belongs to I? and if
dx :
;,a J. rrr,l - f,(x))g(x)
o
J@ : ^lzf'
,Y 7JO cos (x$u($ dt,
which can be regarded as an integral equation of the first kind and has as
its solution the inverse transform
has just two eigenvalues * I and each has an infinite number of eigenfunc-
tions, These cases are now studied under the heading of singular integral
equations. Such equations cannot be solved by the methods applicable to
the Volterra and Fredholm equations. Moreover, they exhibit a curious
property, namely, there are continuous intervals of l-values or band spectra
for which there are solutions. The first significant paper on this subject is
due to Hermann Weyl (1885-1955).22
The subject of existence theorems for integral equations has also been
given a great deal of attention. This work has been devoted to linear and
nonlinear integral equations. For example, existence theorems for
Bibliograplry
Bernkopf, IVt.:
,.The Development of Function Spaces with Particular Reference
to their Origins in Integral Equation Theory," Archioe fot Hislory of Exact
Sciences, 3, 1966, l-96.
Bliss, G. A. : " The Scientific Work of E. H. Moore," Amer' Math' Soc' Bull', 40,
1934,501-14.
Bocher, M.: An Introduction lo the Slud.g of Integtal Equatians, 2nd ed', Cambridge
University Press, 1913.
Bourbaki, N.z Eliments d'hisloire des mathimati,ques, Hermann, 1960, pp' 230-45'
Davis, Harold T.: The Present Stale oJ Integral Equations, Indiana university Press,
1926.
Hahn, H.: " Bericht iiber die Theorie der linearen Integralgleichungen," Jahres'
der Deut. Math.-Verein., 20, l9l l, 69-l l7'
Hellinger, E.; Hilberts Arbeiten i)ber Integralgleichungen und unendliche Ghichungs'
systeme, in Hilbert's Gcsam. Abh.,3, 94-145, Julius Springer, 1935'
" Begriindung der Theorie quadritischer Formen von unendlichvielen
Veriinderlichen," Jour. fiit Math., 1361 1909, 210-71'
Hellinger, E., and O. Toeplitz: " Integralgleichungen und Gleichungen mit
unendlichvielen lJnbekannten," Encgk. der Malh' Wiss', B' G' Teubner,
1923-27, Vol.2, Part 3,2nd halt 1335-1597'
Hilbert, D.: Grundziige iiner allgemeiwn Theoric dcr liruarcn Integralglcichungcn, 1912,
Chelsea (rePrint), 1953.
Reid, Constance : Hilbert, Springer-Verlag, 1970'
Volterra, Yito: Apere matematithe,5 vols., Accademia Nazionale dei Lincei, 195'1-62'
Weyl, Hermann : Gesammelte Abhandlungen,4 vols, Springer-Verlag, 1968'
+6
Functional Analysis
J: dx,
)"F(x,s,y')
the integral can be regarded as operating on a class of functions y (x) ot
which that one is sought which happens to maximize or minimize the integral.
The area of differential equations offers another class of operators. Thus the
differential operator
L:!;+p@)!*+ilx),
acting on a class y(,r) of functions, converts these to other functions, Of
course to solve the differential equation, one seeks a particular y(r) such
that Z acting on that y(r) yields 0 and perhaps satisfies initial or boundary
conditions. As a final example ol operators we have integral equations. The
right-hand side of
"f@)
: I' K@,Y)u(x) dx
to76
THE TIIEORY OF FUNCTIONALS r077
l. Atri della Reale Accadenia dei Lincei, (4), 3, 1887, 97 105, 14l-46' 153-58 : O?ue
matematiche, l, 294-314, and others of the same and later years.
2. Bult. Soc. Math. de France,30, 1902, 40-43 : (Euores, 1,401-4.
3, We*e, p. 30.
4. Memorie della Reale Accademia dei Lincei, (3), 18, 1883, 521-86.
5. Atti della Accad. dei Lincei, Rendiconti, (4), 5, 1889, 342 48.
6- Verhandlungen des erslen inlernationalen Mathematiker-Koagtcsses, Teubner, 1898, 201-2'
roTB FUNCTIONAL ANALYSIS
7. Verhandlungcn, 204-5.
8. Comp. Rend., 136, 1903, 351-54 : (Euures, 1,405-8'
9, Rendiconti del Circolo Matematieo di Palermo,22, 1906, l-74'
THE THEORY OF FUNCTIONALS t079
decreasing sequence, that is, E,*, is contained in.E',, of closed subsets ofan
extremal set, then the intersection of all the .En is not empty.
Frichet now considers functionals (he calls them functional operations)'
These are real-valued functions defined on a set E. He defines the continuity
ofa functional thus: A functional Uis said to be continuous at an element
A of E il lirr- U (A*) : U(A) for all sequences {1,} contained in E and
converging to L He also introduced semicontinuity olfunctionals, a notion
introduced for ordinary functions by Ren6 Baire (1874-1932) in 1899.10 t/
is upper semicontinuous in .E il U (A) > Iim sup U (A^) for the {l'} just
described. It is lower semicontinuous if U (,4) < lim inf U (A^)."
With these definitions FrCchet was able to prove a number of theorems
on functionals. Thus every functional continubus on an extremal set -E is
bounded and attains its maximum and minimum on E. Every upper semi-
continuous functional defined on an extremal set -E is bounded above and
attains its maximum on E.
Frdchet introduced next concepts for sequences and sets of functionals
such as uniform convergence, quasi-uniform convergence' compactness, and
equicontinuity. For example, the sequence of functionals {U,,} converges
uniformly to U if given any positive quantity e, lU"(A) - U(A)I < e for z
sufficiently large but independent of A in E. He was then able to Prove
generalizations to functionals of theorems previously obtained for real
functions.
Having treated the class of spaces Z, Fr€chet defines more specialized
spaces such as neighborhood spaces, redefines the concePts introduced for
spaces with limit points, and proves theorems analogous to those already
described but often with better results because the spaces have more
properties.
Finally he introduces metric spaces. In such a sPace a function playing
the role of distance (icart) and' denoted by (A, B) is defined for each two
points ,4 and -B of the space and this function satisfies the conditions:
(a) (A, B) : (8, A) > 0;
(b) (1, B) : 0 if and onty if A : B;
(c) (A,B) + (B,C) > (A,C).
Condition C is called the triangle inequality' Such a space, he says, is of
class d. For such spaces, too, Frdchet is able to prove a number of theorems
about the spaces and the functionals defined on them much like those
previously
- proven for the more general spaces.
Frdchet gave some examples of function spaces' Thus the set of all real
functions of a real variable, all continuous on the same interval 1 with the
icart of any two functionslf and g defined to be max l"f @- S(") l, is a sPace
of class d. Today this hart is called the maximum norm.
Another example offered by Fr6chet is the set of all sequences of real
numbers. lf x: (xy.r2,...) and ! : (!y!2,...) are any two sequences,
the icart of x and y is defined to be
(*,y):zi#;5
This, as Fr6chet remarks, is a space of countably infinite dimensions'
Using his spaces d, Fr6chet 12 succeeded in giving a definition of
continuity, diflerential, and differentiability of a functional. Though these
were not entirely adequate for the calculus of variations, his definition of
differential is worth noting because it is the core of what did prove to be
satisfactory. IIe assumes that there exists a linear functional Z(a(.r)) such
that
F[s(x) +,r(,)] : F(y) + LQ) + eM(fi,
where 4(x) is a variation on y(.r), M(fi is the maximum absolute value
of 1(x) on fa, bf, and. e approaches 0 with M. Then Z(a) is the differential of
F(y). He also presupposes the continuity of F(y), which is more than can be
satisfied in problems of the calculus of variations.
Charles Albert Fischer (lSB4-1922) 1a then improved Volterra's defini-
tion of the derivative of a functional so that it did cover the functionals
used in the calculus of variations; the differential ofa functional could then
be defined in terms of the derivative.
So far as the basic definitions of properties of functionals needed for the
calculus of variations are concerned, the final formulations were given by
Elizabeth Le Stourgeon (lBBl-1971).14 The key concept, the differential
of a functional, is a modification of Fr6chet's. The lunctional F(y) is said
to have a differential atyo(*) if there exists a linear functional Z(r7) such that
for all arcs y6 * q in the neighborhood ofyo the relation
F(yo + il : F(Yo) + L(fi + M(d"Qt)
holds where M (q) is the maximum absolute value of r7 and 4' on the interval
[a,6] e(7) is a quantity vanishing with -,M(1). She also defined second
1nd
differehtials.
Both Le Stourgeon and Fischer deduced from their definitions of
differentials necessary conditions for the existence of a minimum ol a
functional that are of a type applicable to the problems ol the calculus of
variations. For example, a neccssary condition that a flunctional F(y) have
of functionals needed for the latter field are rather special and do not hold
for functionals in general. In addition, the nonlinearity of these functionals
created difficulties that were irrelevant to the functionals and operators that
embraced integral equations. While concrete extensions of the theory of
solution of integral equations were being made by Schmidt, Fischer, and
Riesz, these men and others also began work on the corresPonding abstract
theory.
The first attempt at an abstract theory of linear functionals and oper-
ators was made by the American mathematician E' H. Moore starting in
1906.16 Moore realized that there were features common to the theory of
linear equations in a finite number of unknowns, the theory of infinitely
many equations in an infinite number of unknowns, and the theory of linear
integral equations. He therefore undertook to build an abstract theory,
called General Analysis, which would include the above more concrete
theories as special cases. His approach was axiomatic. We shall not present
the details because Moore's influence was not extensive, nor did he achieve
effective methodology. Moreover, his symbolic language was strange and
difficult for others to follow.
The first influential step toward an abstract theory oflinear functionals
and opcrators was taken in 1907 by Erhard Schmidt 18 and Fr6chet.17
Hilbert, in his work on integral equations, had regarded a function as given
by its Fourier coefficients in an expansion with respect to an orthonormal
sequence of functions. These coefficients and the values he attached to the
rj in his theory of quadratic forms in infinitely many variables are sequences
{xn} such that )fl xl is finite. However, Hilbert did not regard these sequences
as coordinates ofa point in space, nor did he use geometrical language. This
step was taken by Schmidt and Fr6chet. By regarding each sequence {rn} as
a point, functions were represented as points of an infinite-dimensional
space. Schmidt also introduced complex as well as real numbers in the
sequences {x,}. Such a space has since been called a Hilbert space. Our
account follows Schmidt's work.
The elements of Schmidt's function spaces are infinite sequences of
complex numbers, z : {zr\, with
15. Sec, for example, Atti del IV Congresso Internazionale dei Matematici (l9OA),2' Rcale
Accadcmia dei Lincei,l909, 98-l14, and Amer. Math. Soc. Bull., 18, l9lll12,334-62.
16, Rcndiconti del Circolo Malcmatico di Palermo,25, 1908, 53-77 .
17. Nouvcllcs Annales de Mathlmatiques, S, t908, 97-116, 289-317.
LINEAR FUNCTIONAL ANALYSIS roB3
lf,=, zow* so that llzll : \4r2. (The modern practice is to define (2, rr)
as ) zSitr.) Two elements z and w of the space are called orthogonal if and
only if (2, a) : O.Schmidt then proves a generalized Pythagorean theorem,
namely, if 21, 22,. . . , z,t are n mutually orthogonal elements of the space,
then
IL: s.
L'p
implies
t
L l(.,
2o)l' < ll,ll'
Also Schwarz's inequality and the triangle inequality are proved for the
norm.
A sequence of elements {2"} is said to converge strongly to z if llz" - zll
approaches 0 and every strong Cauchy sequence, i.e. every sequence for
which ll z, - zo ll approaches 0 as p and' q approach o, is shown to converge
to an eiement z so that the space of sequences is complete. This is a vital
property.
Schmidt next introduces the notion of a (strongly) closed subspace. A
subset I of his space I/ is said to be a closed subspace if it is a closed subset
in the sense of the convergence just defined and if it is algebraically closed;
that is, it w1 and u2 are elements of l, then a1a1 * a2u2, where a1 and a2
are any complex numbers, is also an element of l. Such closed subspaces are
shown to exist. One merely takes any sequence {z'} of linearly independent
elements and takes all finite linear combinations of elements of {2,}' The
closure of this collection of elements is an algebraically closed subspace'
Now let A be any fixed closed subspace. Schmidt first proves that if z
is any element of the space, there exist unique elements a;1 and zo, such that
z : wr * rl, where w, is in A and a]2 is orthogonal to l, which means that
zr..r, is orthogonal to every element in ,4. (This result is today called the
projection ih.o..-; zr, is the projection of z in ,4.) Further, llro2ll :
min lly - zll where y is any element in ,4 and this minimum is assumed
only for y -- ur. llwrll is called the distance between z znd A'
ro84
In 1907 both Schmidt and Fr6chet remarked that the space of square
summable (Lebesgue integrable) functions has a geometry completely
analogous to the Hilbert space of sequences. This analogy was elucidated
when, some months later, Riesz, making use of the Riesz-Fischer theorem
(Chap. 45, sec. 4), which establishes a one-to-one correspondence between
Lebesgue measurable square integrable functions and square summable
sequences ofreal numbers, pointed out that in the set Z2 ofsquare summable
functions a distance can be defined and one can use this to build a geometry
of this space of functions. This notion of distance between any two square
summable functions of the space 12, all defined on an interval la, bl, was
in fact also defined by Frichet rs as
where the integral is understood in the Lebesgue sense; and it is also under-
stood that two functions that differ only on a set ofmeasure 0 are considered
equal. The square of the distance is also called the mean square deviation
of the functions. The scalar product o[;f and g is defined to be (/ g) :
[l-f@e@ /x. Functions for which ("f, d
: 0 are called orthogonal. The
Schwarz inequality,
t JG)sG) dx <
and other properties that hold in the space of square summable sequences'
apply to the function space ; in particular this class of square summable
functions forms a complete space. Thus the space of square summable
functions and the space of square summable sequences that are the Fourier
coemcients of these functions with respect to a fixed complete orthonormal
system of functions can be identified.
So far as abstract function spaces are concerned, we should recall
(Chap. 45, sec. 4) the introduction of Zp spaces, I < P < o, by Riev.
ih.r" ,p"".. are also complete in the metric d("fr,"f) : (ll tt, - -frlo d*)'t' .
Though we shall soon look into additional creations in the area of
abstract spaces, the next developments concern functionals and operators.
In the 1907 paper just referred to, in which he introduced the metric or
icart for functions in Z2 space, and in another paper of that year,ls Frdchet
proved that lor every continuous linear functional U(f) defined on 12 there
is a unique u(x) in L2 such that fot every J in L2
for given ft, assigns;f to g and satisfies some additional conditions. In the
notation ,4 for the abstract operator and the notation C: Af' linearity
means
#nerc (Afr, f2) denotes the inner or scalar product of two functions of the
,p."., thi., ,4 is said to be self-adjoint. In the case of integral equations, if
dY'
I:k@'Y)r(Y)
then
'':
(Ar,,r,) : t:t:k@'Y)r1@)J'(x) ds dx
(.f,, AJ,) : I: I: k@, Y)f'(Y)f'(x) dY dx'
xg, t14py dt
[b
as a transformation acting on the functiond(t). He called it a functional
transformation and denoted it by f(d(r)). Moreover, since the d(') with
which Riesz was concerned were in Lt space, the transformation takes
functions into the same or another space. In particular a transformation or
operator that takes functions of Lp into functions ofzp is called linear in zp
iiit satisfies (3) and if 2-is bounded; that is, there is a constant M such that
for all;tin I" that satisfy
r lf(x)1'?dx < L
then
(4)
t 7'(f(x))g(x) dx
(5)
[' r1y@y1g1*) * : I: f@){,(x) dx.
The adjoint or transpose of 7, denoted by 7t, is now defined to be that
operator in Zq, depending for fixed 7 only on g, which assigns to g the ry' of
equation (5); that is, T*(S) : f. (In modern notation 7* satisfies (A g) :
(f, f*d.) 7"* is a linear transformation in Zq and ilf*il : ilf fl.
Riesz now considers the solution ol
(6) r($(x)) : f(x),
where 7is a linear transformation in Le,Jis known, and { is unknown. He
proves that (6) has a solution if and only if
was given during the years 1920 to 1922 by Stef,an Banach (1892-1945),
Hans Hahn (1879-1934), Eduard Helly (1884-1943), and Norbert Wiener
(1894-1964). Though there is much overlap in the work of these men and
the question of priority is difficult to settle, it is Banach's work that had the
greatest influence. His motivation was the generalization ofintegral equations.
The essential feature of all this work, Banach's in particular,2s was to
set up a space with a norm but one which is no longer defined in terms ofan
inner product. Whereas in L2, llfll : ("f,f)''' it is not possible to define the
norm of a Banach space in this way because an inner product is no longer
available.
Banach starts v/ith a space .E of elements denoted by x, g, 2,.. ., whereas
a, b, c,. ..
denote real numbers. The axioms for his space are divided into
three groups. The first group contains thirteen axioms which specify that
E is a commutative group under addition , that E is closed under multipli-
cation by a real scalar, and that the familiar associative and distributive
relations are to hold among various operations on the real numbers and
the elements.
The second group of axioms characterizes a norm on the elements
(vectors) of.E. The norm is a real-valued function defined on .E and denoted
by ||"il. For any real number a and, any.r in E the norm has the following
proPerties :
familiar facts for his spaces. One of the key theorems states: Let {r,} be a
set of elements of E such that
\ llr,ll < -.
Then )f,=r.f,p converges in the norm to an element.z of E.
After proving a number of theorems, Banach considers operators defined
on the space but whose range is in another space Er, which is also a Banach
space. An operator .F is said to be continuous at .ro relative to a set I if
.F(.t) is defined for all x of A; xo belongs to I and to the derived set of ,4;
and whenever {*,} is a sequence of I with limit xo, then F(x,) approaches
F(xo). He also defines unilorm continuity of F relative to a set I and then
turns his attention to sequences ol operators. The sequence {F,} of operators
is said to converge in the norm toFon a set ,4 if for every x in A, lim F"(x) :
F (*).
An important class of operators introduced by Banach is the set of
continuous additive ones. An operator tr' is additive if for all x and y,
F(x + y): F(x) + F(y). An additive continuous operator proves to have
the property that F (ax) : aF (x) for any real number a. If F is additive and
continuous at one element (point) of E, it follows that it is continuous
everywhere and it is bounded ; that is, there is a constant M, depending
onlyonF,sothat llF(r) l] < Mllxll for every.t of -8. Another theorem asserts
that if {f',} is a sequence of additive continuous operators and if F is an
additive operator such that for each x, lil;: Fn@) : F(*), then F is contin-
uous and there exists an rVI such that for all n, llF"(x) il < Ull*ll.
In this paper Banach proves theorems on the solution of the abstract
formulation of integral equations. If F is a continuous operator with domain
and range in the space E and if there exists a number M, O < M < 1, so
that for all x' and x' in E, llr(r') - r(r') ll < M ll*' -.r"ll, then there
exists a unique element x in E that satisfies F(t) : v. More important is
the theorem: Consider the equation
(8) +hF(x):s
where y is a known function in .8, F, is an additive continuous operator with
domain and range in E, and ft is a real number. Let M be the least upper
bound of all those numbers M' satisfying llf (") ll < M'llxll tor all r. Then
for every y and every value of i satisfying lhMl < 1 there exists a lunction r
satisfying (B) and
where F(")(y) : ,io(F('-1)(y)). This result is one form of the spectral radius
theorem, and it is a generalization of Volterra's method of solving integral
equations.
Banach 19292{ introduced another important notion in the subject
in
of functional analysis, the notion of the dual or adjoint space of a Banach
space. The idea was also introduced independently by Hahn,2s but Banach's
work was more thorough. This dual space is the space of all continuous
bounded linear functionals on the given space. The norm for the space of
functionals is taken to be the bounds of the functionals, and the space proves
to be a complete normed linea1 space, that is, a Banach space' Actually
Banach's work here generalizes Riesz's work on LD arrd Zq spaces, where
C : pt(p - I), because the I{ space is equivalent
to the dual in Banach's
sense of the L space. The connection with Banach's work is made evident
by the Riesz representation theorem (2). In other words, Banach's dual
space has the same relationship to a given Banach space E that Lq has to Lp'
Banach stars with the definition of a continuous linear functional, that
is, a continuous real-valued function whose domain is the space E, and
proves that each functional is bounded. A key theorem, which generalizes
onc in Hahn's work, is known today as the Hahn-Banach theorem' Letp be
a real-valued functional defined on a complete normed linear space R and
Iet p satis$ for all x and Y in fi
Jlultsr'<,t
a, : o, i:1'2,"',n
be satisfied, where gjl),. . ., grn) is a set of linearly independent solutions of
fL
0 : s(r) - K(t,s)g(t) dt.
^oJo
(o,b) :20,u-
p=l
then, iflf corresponds to a and g to b, (f, g) : (o, b).
26. Math. Ann., 102, 1929/30, 49-131, and 370-427 : Coll. Works,2,3-143.
THE A)iIOMATIZATION OF I{ILBERT SPACE ro93
(A) 11 is a linear vector space. That is, there is an addition and scalar
multiplication defined on .FI so that if f, and 1f, are elements of I/ and a1
and. a, are any complex numbers, then arf, I a2f2 is also an element of 1/.
Bibliography
Bernkopll M.: "The Development of Function Spaces with Particular Reference
to their Origins in Integral Equation Theory," Archiae.for Histary of Exact
Sciences, 3, 1966, l-96.
"A History of Infinite Matrices," Arehiae for Hislory of Exact Scitrces, 4,
1968, 308-58.
Bourbaki, N.: Eliments d'histoire dcs mathimatiqucs, Hermann, 1960, 230-45.
Dresden, Arnold: "Some Recent Work in the Calculus of Variations," ,{zrr,
Math. Soc. Bull., 32, 1926,475-521.
FrCchet, M.': Notice sur les traaaux scientifques dt M. Maurice Frichat, Hermann, 1933.
Hellinger, E. and O. Toeplitz: " Integralgleichungen und Gleichungen mit
unendlichvielen Unbekannten," Euyk. dcr Math. Wiss., B. G. Teubner,
1923-27, Vol.2, Part III,2nd halt 1335-1597.
Hildebrandt, T. H. : " Linear Functional Transformations in General Spaces,"
Amer. Math. Soc. Bull.,37, 1931, lA5-212.
Ldvy, Paul: "Jacques Hadamard, sa vie et son @uvre," L'Enscigtutunt MarMma-
tique, (2), 13, 1967, l-24.
McShane, E. J. : " Recent Developmens in the Calculus of Variations," lnrar.
M ath. Soc. Semicentcnnial Publications, 2, I 938, 69-97.
Neumann, John von: Collected Works, Pergamon Press, 1961, Vol. 2.
Sanger, Ralph G.: " Functions of Lines and the Calculus of Variations," Uniursiq
qf Chhago Contributions to the Calculus of Variations for 1931-32, University of
Chicago Press, 1933, 193-293.
Tonelli, L.: "The Calculus of Variations," Anur. Math. Soc. Bull.,3l, 1925, 16:l-72.
Volterra, Yito: Opere matematil:\u,5 vols., Accademia Nazionale dei Lincei, 195M2.
47
Divergent Series
It is indeed a strange vicissitude of our science that those
series which early in the century were supposed to be banished
once and for all from rigorous mathematics should, at its
close, be knocking at the door for readmission.
JAMES PIERPONT
l. Introduction
The serious consideration from the late nineteenth century onward of a
subject such asdivergent series indicates how radically mathematicians have
revised their own conception of the nature of mathematics. whereas in the
first part of the nineteenth century they accepted the ban on divergent
series on the ground that mathematics was restricted by some inner require-
ment or the dictates ofnature to a fixed class of correct concepts, by the end
of the century they recognized their freedom to entertain any ideas that
seemed to offer any utility.
We may recall that divergent series were used throughout the eighteenth
century, with more or less conscious recognition of their divergence, because
they did give useful approximations to lunctions when only a lew terms were
,rt"d. Afti. the dawn of rigorous mathematics with Cauchy, most mathema-
ticians followed his dictates and rejected divergent series as unsound.
However, a few mathematicians (Chap. 40, sec. 7) continued to defend
divergent series because they were impressed by their usefulness, either for
the computation of functions or as analytical equivalents of the functions
from which they were derived. Still others defended them because they
l
were a method of discovery. Thus De Morgan says, "We must admit that
many series are such as we cannot at Present safely use, except as a means of
discovery, the results of which are to be subsequently verified and the most
l. Trans. Camb. Phil. Soc., 8, Part II, 1844, 182-203, pub' 1849'
ro96
togT
determined rejector of all divergent series doubtless makes this use of them
in his closet. . . ."
Astronomers continued to use divergent series even after they were
banished because the exigencies of their science required them for purposes
of computation. Since thc first few terms ofsuch series offered useful numerical
approximations, the astronomers ignored the fact that the series as a whole
were divergent, whereas the mathematicians, concerned with the behavior
not ofthe first ten or twenty terms but with the character ofthe entire series,
could not base a case for such series on the sole ground of utility.
However, as we have already noted (Chap. 40, sec. 7), both Abel and
Cauchy were not without concern that in banishing divergent series they
were discarding something useful. Cauchy not only continued to use them
(see below) but wrote a paper with the title "Sur l'emploi l6gitime des
s6ries divergentes," 2 in which, speaking of the Stirling series lor log I(r)
or logrn! (Chap. 20, sec. 4), Cauchy points out that the series, though
divergent for all values of x, can be used in computing log f (x) when x
is large and positive. In fact he showed that having fixed the number n of
terms taken, the absolute error committed by stopping the summation at
the nth term is less than the absolute value of the next succeeding term,
and the error becomes smaller as x increases. Cauchy tried to understand
why the approximation furnished by the series was so good, but failed.
The usefulness of divergent series ultimately convinced mathematicians
that there must be some feature which, if culled out, would reveal why they
furnished good approximations. As Oliver Heaviside put it in the second
volume of Electromagnetic Theory ( 1899), " I must say a few words on the
subject of generalized differentiation and divergent series. . . . It is not easy
to get up any enthusiasm after it has been artificially cooled by the wet
blanket of the rigorists. . .. There will have to be a theory of divergent
series, or say a larger theory offunctions than the pres:nt, including conver-
gent and divergent series in one harmonious whole." When Heaviside made
this remark, he was unaware that some steps had already been taken.
The willingness of mathematicians to move on the subject of divergent
series was undoubtedly strengthened by another influence that had gradually
penetrated the mathematical atmosphere: non-Euclidean geometry and the
new algebras. The mathematicians slowly began to appreciate that mathe-
matics is man-made and that Cauchy's definition of convergence could no
longer be regarded as a higher necessity imposed by some superhuman
power. In the last part ofthe nineteenth century they succeeded in isolating
the essential property of those divergent series that furnished useful approxi-
mations to functions. These series were called asymptotic by Poincar6, though
during the century they were called semiconvergent, a term introduced by
Legendre in his Essai de la thCorie dcs nombres (179S, p' 13) and used also for
oscillating series.
The-theory of divergent series has two major themes' The first is the
one already briefly described, namely, that some of these series may,
for a
fixed number of terms, approximate a function better and better as the
variable increases. In fact, Legendre, in his Traiti des fonctions elliptiques
(1825-28), had already characterized such series by the property that the
er.o, by stopping at any one term is ofthe order ofthe first term
"o--itt.d
omitted. The second theme in the theory of divergent series is the concept of
summability. It is possible to define the sum of a series in entirely new ways
that give finite sums to series that are divergent in Cauchy's sense'
I 6{r){"l,a\'r*
when s is large depends on the values of u(x) near its stationary points'
that
:
is, the valuei of * for which u'(r) 0' LaPlace used this observation to
Prove that
s! -ss+ltrc-s\/z;(, -*o*#,- ),
a result that can be obtained as well from Stirling's approximation to
log s !
where g may be complex, h and t are real, and * is large and positive; he
remarked that the main contribution to the integral comes from the im-
mediate neighborhood of those points in the range of integration in which
i(l) attains its absolute maximum. The contribution ol which Laplace
spoke is the first term of what we would call today an asymptotic series
evaluation of the integral .Il h(t) has just one maximum at r:4, then
Laplace's result is
f x) -
(
s ( a) e
* h
"' J:,*' r,,
as r approaches o.
If in place of (l) the integral to be evaluated is
(2) "f(4
: s(t)et,h(,> dt
['
when I and x are real and .r is large, then lrtrtrrtrl is a constant and Laplace's
method does not apply. In this case a method adumbrated by Cauchy in
his major paper on the propagation of waves,6 now called the principle of
stationary phase, is applicable. The principle states that the most relevant
contribution to the integral comes from the immediate neighborhoods of
the stationary points of h(t), that is, the points at which n'(t) : g. 1'5i.
principle is intuitively reasonable because the integrand can be thought of
as an oscillating current or wave with amplitude lg(r)1. If , is time, then the
velocity of the wave is proportional to xh'(t), and if n'(l) 10, the velocity
ofthe oscillations increases indefinitely as r becomes infinite. The oscillations
are then so rapid that during a full period g(t) is approximately constant
and xh(t) is approximately linear, so that the integral over a full period
vanishes. This reasoning fails at a value of I where h'(t) :0. Thus the
stationary points of h(t) are likely to furnish the main contribution to the
asymptotic value of/(x).If z is a value of t at which h'(t) : g^ 6h'(r) > O,
then
"f{4 - JZ-1'1,1
g(r)ctxhG)+t"t4
as * becomes infinite.
This principle was used by Stokes in evaluating Airy's integral (see
:
i. ,t (; .'\ a" f, - u ""'im' -
Nsir!2m',
where
I 1.3 1.3.5.7 r 1.3.5
M: -i---; * --------=-
mlf - m'n" m'n- -"'' "^/-----!...
- m3o2 m1t4
(4) t: I:.(se)',',',
d., . : (qo?)'tny
and obtained
d2w
__r
(5) )\2u : rut
dtz '
where
Liouville now argued that for sufficiently large values of I the first approxi-
mation to the solutions of (5) should be
(6) to N c7 cos l, + c, sin trr,
Ifthe values of zz and I given by (4) are now used to obtain the approximate
solution ol (3), we have from (6) that
for large lzl. This integral represents the strength of diffracted light near a
caustic. Airy had given a series for I'l in powers of m which, though conver-
gent for all rn, was not useful for calculation when lrzl is large. Stokes's
method consisted in forming a differential equation of which the integral
is a particular solution and solving the differential equation in terms of
divergent series that might be useful for calculation (he called such series
semiconvergent).
After showing that tJ : (tl2)\ta W satisfies the Airy differential equation
(l l) t!+iu:0,
dn' J
n: @'''
*,
Stokes proved that for positive z
where
(r3) R: l
I.5.7.II .l I.5.7.II.13.I7 19.23....
1.2.162.3n3 | .2'3 '4 ' l6a '32no
^ t .s 1 .5.7. 11 ' 13' 17
(14) - +" '
": 1.2.3:mretx3)3i'z
The quantities A and"r-u1rr,1*
B that make U yield the integral were determined
by a special argument (wherein Stokes uses the principle of stationary phase)'
He also gave an analogous result for z negative.
The series (12) and the one for negative z behave like convergent series
for some number of terms but are actually divergent. Stokes observed that'
they can be used for calculation. Given a value ofz, one uses the terms from
the first up to the one that becomes smallest for the value of n in question'
He gave a qualitative argument to show why the series are useful for
numerical work.
Stokes encountered a special difficulty with the solutions of (lt) for
positive and negative z. He was not able to Pass from the series for which z
is positive to the series for which n is negative by letting rl vary through 0
because the series have no meaning lot n :0. He therefore tried to pass
from positive to negative a through complex values ol z, but this did not
yield the correct series and constant multipliers.
What Stokes did discover, after some struggle,2o was that if for a certain
range of the amplitude of n a general solution was rePresented by a certain
iinear combination of two asymptotic series each of which is a solution, then
in a neighboring range of the amplitude of z it was by no ineans necessary
for the same linear combination of the two fundamental asymptotic expan-
sions to represent the same general solution. He found that the constants
of the linear combination changed abruptly as certain lines given by amp'
z : const. were crossed. These are now called Stokes lines.
Though Stokes had been primarily concerned with the evaluation of
integrals, it was clear to him that divergent series could be used generally
to solve differential equations. Whereas Euler, Poisson, and others had
solved individual equations in such terms, their results appeared to be
tricks that produced answers for specific physical problems. Stokes actually
gave several examples in the 1856 and 1857 papers.
The above work on the evaluation of integrals and the solution of
differential equations by divergent series is a sample of what was done by
many mathematicians and physicists.
.atdz,
.f (*) -*
-An+-+-=-l-"'
Iim
,+0 ,l [ro, - T''']
Though in the case of some asymptotic series one knows what error is
co..r-itteJ by stopping at a definite term, no such information about the
numerical ...o. i. k.ro*r, for general asymptotic series' However, asymptotic
series can be used to give rather accurate numerical results for large
x by
employing only those terms for which the magnitude of the terms decreases
as one'tak1s more and more terms. The order of the magnitude of
the error
at any stage is equal to the magnitude of the first term omitted'
.f(r)-oo+f,+ff+',
then
1o,
r, f(z) dz - C * aox + allogx - -r?-"' -!2
c"*x"(Ao.** )
and the series satisfy the differential equation formally' He also showed
that
to each such series there corresponds an exact solution in the form an of
integral to which the series is asymptotic'
-Poincar6's
results are included in the following theorem due to Jakob
Horn (1867-19't6).23 He treats the equation
(18) ,<"t 1 ar(x)!@ - 1) +.. .* a^(x) g" : o,
y, - strtrtso,>+, r : 1,2,.' ., n,
Birkhoff in this paper did not consider equation (lB) but the more general
system
IL
la11 - 6srl.l :0
has distinct roots. F{e gave asymptotic series solutions for the y, which hold
in various sectors ofthe complex plane with vertices at x : 0'
Whereas Poincar€'s and the other expansions above are in powers of
the independent variable, further work on asymPtotic series solutions of
differentll equations turned to the problem first considered by Liouville
(sec. 2) wherein a parameter is involved. A general result on this problem
was given by Birkhoff.26 He considered
for large lpl and for r in the interval [a, ]]' The functions a1(x, p) are supposed
analytic in the complex parameter p at- p : oo and have derivatives of all
ordeis in the real variable 'r. The assumptions on a1(x, p) imply that
ar(x, P) :2ourrro-'
l=o
and that the roots of wr(x), wz(x), ..., unlc) of the characteristic equation
u" + an-t,o(x)un-r + "'* as.(x) : o
are distinct lor each x. He proves that there are z independent solutions
zr(x, P),' ' ', zo@, P)
where
where r ranges over [a, D]. The WKBJ approximation for large ), in view
of (7), gives two solutions for x > 0 and two fior x < 0' It breaks down at
the value ofx where 4 : 0. Such a point is called a transition point, turning
point, or Stokes point. The exact solutions ol (23) are, however, finite at
such a point. The problem is to relate the WKBJ solutions on each side ol
the transition point so that they represent the same exact solution over the
interval la, bl in which the differential equation is being solved' To put
the problem more specifically, consider the above equation in which q(r)
is real for real x and such that q(*) : 0, q'(x) * 0 for r : 0' suppose also
that q(x) is negative for x positive (or the reverse). Given a linear combination
of the two WKBJ solutions for * < 0 and another for .r > 0, the question
arises as to which ofthe solutions holding for x > 0 should be joined to that
for .r < 0. Connection formulas provide the answer.
27
The scheme for crossing a zero of 4('r) was first given by Lord Rayleigh
and extended by Richard Gans (lBB0-1954),28 who was familiar with
Rayleigh's work. Both ol these men were working on the propagation of
light in a varying medium.
27. Proc. Roy. Soc., A 86, 1912, 207-26 : Sci. Papcrs,6, 7l-90'
28. Annalen der Phys., (4), 47, 1915, 709-36.
SUMMABILITY r r09
(24)
ffi + \'x(x)s :0,
where ris real, I is large and real, and X(x) has a simple zero a\ say,
r : 0. He derived formulas connecting the asymptotic series solutions of
(24) for ,r > 0 and .t < 0 by using an approximating equation fot (24),
namely,
which replaces X(x) in Q\ by a linear function in r' The solutions of (25) are
(26) Yr(x) : xrt2Jrtrc(t)
where ( : (213)\xsr2. The asymptotic expansions for large r of these solu-
tions can be used to join the asymptotic solutions of (24) on each side of
l : 0. There are many details on the ranges of x and that must be con-
sidered in the joining process but we shall not enter into^ them. Extension of
the work on connection formulas for the cases where X(x) in (24) has
multiple zeros or several distinct zeros and for more complicated second
order and higher-order equations and for complex x and ,\ has appeared in
numerous paPers.
The theory of asymPtotic series, whether used for the evaluation of
integrals or the approximate solution of differential equations, has been
extended vastly in recent years. What is especially worth noting is that the
mathematical development shows that the eighteenth- and nineteenth-
century men, notably Euler, who perceived the great utility of divergent
series and maintained that these series could be used as analytical equivalents
of the functions they represented, that is, that operations on the series
corresponded to operations on the functions, were on the right track. Even
though these men failed to isolate the essential rigorous notion, they saw
intuitively and on the basis of results that divergent series were intimately
related to the functions they represented.
4. Summability
The work on divergent series described thus far has dealt with finding
asymptotic series to represent functions either known explicitly or existing
implicitly as solutions of ordinary differential equations. Another problem
that mathematicians tackled from about lBB0 on is essentially the converse
ol finding asymptotic series. Given a series divergent in Cauchy's sense,
29. Proc. Lordon Math. Soc., (2),23, 1922-24,428-36.
DT\IERGENT SERIES
IIIO
can a .,sum" be assigned to the series ? If the series consists of variable
terms this.,sum,, would be a function for which the divergent series might
or might not be an asymptotic expansion. Nevertheless the function might
be tafen to be the " sum " of the series, and this " sum " might serve some
useful purposes' even though the series will certainly not converge to
it
o.rrrryrrotbeusabletocalculateapproximatevaluesofthefunction'
To some extent the problem of summing divergent series was actually
undertakenbeforeCauchyintroducedhisdefinitionsofconvergenceand
divergence.Themathematiciansencountereddivergentseriesandsought
*ro,rt, them much as they did fior convergent series, because the distinction
between the two types was not sharply drawn and the only question was'
What is the appropriate sum ? Thus Euler's principle (Chap' 20, sec' 7) that
a power se.ies e*parrion ofa function has as its sum the value ofthe
function
from which the series is derived gave a sum to the series even for values of
.r for which the series diverges in Cauchy's sense' Likewise, in his transfor-
mation ofseries (Chap. 20, 4), he converted divergent series to convergent
sec.
ones without doubting that there should be a sum for practically
all series'
However,afterCauchydidmakethedistinctionbetweenconvergenceand
divergence, the problem of summing divergent series was broached on a
differint level. The relatively naive assignments in the eighteenth century of
sums to all series were no longer acceptable. The new definitions prescribe
what is now called sammabititg, to distinguish the notion from convergence
in Cauchy's sense.
Wittr hindsight one can see that the notion of summability was really
what the eighte;th- and early nineteenth-century men were advancing'
This is whaiEuler's methods of summing just described amount to' In fact
in his letter to Goldbach of August 7, 1745, in which he asserted that the
sum of a power series is the value of the function from which the series
is
that every series must have a sum, but since the
derived, Euler also asserted
word sum implies the usual process of adding, and this process does not
lead to the sum in the case ofa divergent series such as I - 1! + 2! - 3!
*..., we should use the word value for the "sum" ofa divergent series'
Poisson too introduced what is now recognized to be a summability
notion. Implied in Euler's definition of sum as the value of the function from
which the series comes is the idea that
(27)
fo- : lim S
x-1_ .L
,r=o
a-r".
"
(28)
?-2 (ancos no * bo sin zd),
(2s)
?. > (an cos n0 + bnsin nl)r"
and define the sum of (28) to be the limit of the series (29) as r approaches
I from below. Of course Poisson did not appreciate that he was suggesting a
definition of a sum of a divergent series because, as already noted, the
distinction between convergence and divergence was in his time not a
critical one.
The definition used by Poisson is now called Abel summability because
it was also suggested by a theorem due to Abel,31 which states that if the
power series
f(*) : S a-.rn
"
" =t,
has a radius of convergence r and converges for *: r, then
)o-:,l1is":,1*u*#
if this limit inus for the series I - I + I - I + "', the S, have the
"**o
values l, 112,213,214,315, ll2, 417, l/2,. '. so that limn-- S" : l/2' If
) ao converges, then Frobenius's "sum" gives the usual sum. This idea of
averaging the partial sums of a series can be found in the older literature'
It was used for special types of series by Daniel Bernoulli3s and Joseph L'
Raabe (t B0l -59).34
Shortly after Frobenius published his paper, H<ilder3s produced a
generalization. Given the series ) 4,,, let
slo) : s,
,l', : # (s[o> a 5!or +...+ r[o))
I
"rrl
:
, *1 (sf;-t) + rf -') +...
i/(9-1) +.r(r-1) * "' +* ra
J(t-l))-
)'
32. Jour. far Math., 89, 1880, 26244 : Ges. Abh., 2,8-10.
33. Comm. Acad. Sci. Petrop., 16, 1771,7l-90.
34. Jour. fiir Math., 15, 1836, 355-64.
35. Math. Ann.,20, 1882, 535-49.
SUMMABILITY I I 13
(32) , : t,j:,f'
if this limit exists for some r. H<ilder's definition is now known as summability
(H,r)-
Hcilder gave an example. Consider the series
I
-O .T: -l + 2x - 3x2 + 4x3 -""
This series diverges when r : l. However fot x : l,
Jo: -lrsr: l,s:: -2, ss :2,s+: -3'" "
Then
, : ,l'* DF,
'C(t)
(33) rintegral and > 0,
where
l)
r(r + r(r + l)...(r+n-l)
SY):sn + ffn-r + - O sn-z*" '* -
ro
and ----;i"--
Dr,: (r + l)(r + 2)"'(r + z)
nl
r'fl+v:,,
for which he obtained the integral solution
f- xe-t
(36) s: Jo T1;tdt.
Then by using rules he derived for converting convergent series into con-
tinued fractions Euler translormed (35) into
(37)
t( x x 2x 2x 3x 3x
ITTT TTITiTiT I+
This work contains two features. On the one hand, Euler obtained an
integral that can be taken to be the "sum" of the divergent scries (35) ;
the latter is in fact asymptotic to the integral. On the other hand he showed
how to convert divergent series into continued fractions. In fact he used the
continued fraction for x : I to calculate a value lor the series (34).
There was incidental work of this nature during the latter part ol the
eighteenth century and a good deal of the nineteenth, of which the most
noteworthy is due to Laguerre.al He proved, first of all, that the integral
38. Noui Comm. Acad. Sci. Petrop.,5, 175415,205-37, pub. 1760 :Opera, (l), 14, 585 617,
and Nooa Acta Acad. Sci. Petrop.,2, 1784,36-45, pub. 1788 : Opera, (l), 16, 34-43.
39. Nooi Comtn. Acad. Sci. Petrop.,5, 175415,205 37, pub. 1760 = Opera, (l), 14, 585 617.
40. Euler's Opera Poslhuma, l, 545 49.
4l- Bull. Soc. Math. de France, T, l8?9, 72-Bl : CEuures, l,428-37.
rr15
(36) could be expanded into the continued fraction (37). He also treated
the divergent series
(38) I + .r + 2l xz + 3!x3 +...
Since
Jo
f- ,-.,
f* ,-. d" + * Jo dz * xz f- ,-"", dz * . . ..
Jo
or
(40)
lllll I
alz* a2* a|z+ aa+ asz+ a2n* a2nq lzj
where the ao are positive real numbers and z is complex. He then shows
that when the series Zf,=, ao diverges, the continued fraction (40) converges
42, " Recherches sur quelques sdries semi-convergentes," Ann. de l'Ecolc Norn. Sup.' l3),
3, 1886, 201-58 : CEwres complites,2,2 58.
43. Ann. Fac. Sci. de Toulouse, S, 1894, J. l-122, and 9, 1895, A. l-47 : CEuures amPlltes,
2,402-5s9.
IIT6 DfVERGENT SERIES
to a function F(z) which is analytic in the complex Plane except along the
negative real axis and at the origin, and
(41)
When ) a,n converges, the even and odd partial sums of (40) converge to
distinct limits tr'r(z) and F2(z) where
Fr(z):l:ffi, Fr(z):l:*
Now it was known that the continued fraction (.10) could be formally
developed into a series
(42)
CO C',C' C3,
zz2'z?24'
with positive C,. The correspondence (with some restrictions) is also recip-
rocal. To every series (42) there corresponds a continued fraction (40)
with positive an. Stieltjes showed how to determine the C, from the an and
in the case where ) an is divergent he showed that the ratio Cn/Cn-1 increases.
If it has a finite limit ,\, the series converges for lzl > l, but if the ratio
increases without limit then the series diverges for all z,
The relation between the series (42) and the continued fraction (40)
is more detailed. Although the continued fraction converges if the series
does, the converse is not true. When the series (42) diverges one must
distinguish two cases, according as ) an is divergent or convergent. In the
former case, as we have noted, the continued fraction gives one and only
one functional equivalent, which can be taken to be the sum ofthe divergent
series (42). When ) a,, is convergent two different functions are obtained
from the continued fraction, one from the even convergents and the other
from the odd convergents. But to the series (42) (now divergent) there
corresponds an infinite number of functions each of which has the series as
its asymptotic development.
Stieltjes's results have also this significance: they indicate a division
of divergent series into at least two classes, those series for which there is
properly a single functional equivalent whose expansion is the series and
those for which there are at least two functional equivalents whose expansions
are the series. The continued fraction is only the intermediary between
the series and the integral; that is, given the series one obtains the integral
through the continued fraction. Thus, a divergent series belongs to one or
more functions, which functions can be taken to be the ium of the series
in a new sense of sum.
Stieldes also posed and solved an inverse problem. To simplify the
SUMMABILITY ttrT
statement a bit, let us suppose that $(u) is differentiable so that the integral
(41) can be written as
Hence knowing the C, one must determine .rf(z) satisfying the infinite set
of equations (43). This is what Stieldes called the " problem of moments."
It does not admit a unique solution, lor Stieltjes himself gave a function
f (u) : e- {n sin {i
which makes C, : 0 for all n. lf the supplementary condition is imposed
that f(u) shall be positive between the limits of integration, then only a
singlel'(a) is possible.
The systematic development of the theory of summable series begins
with the work ofBorel from lB95 on. He first gave definitions that generalize
Cesiro's. Then, taking off from Stieltjes's work, he gave an integral defini-
tion.aa If the process used by Laguerre is applied to any series of the form
(44) ao + ag * arx2 1-. -
having a finite radius ofconvergence (including 0), we are led to the integral
(4s)
where
r, - "F (zx) dz
e-'F(zx) dz
)o
is absolutely convergent and the successive integrals
J"' -l a2 a", ^ :
'lu^F,(?| 1, 2, "'
|
have a sense. Borel then shows that a divergent series, ifabsolutely summable,
can be manipulated precisely as a convergent series' The series, in other
words, represents a function and can be manipulated in place ofthe function.
Thus the sum, difference, and product of two absolutely summable series
is absolutely summable and represents the sum, difference, and product
respectively of the two functions represented by the separate series. The
analogous llact holds for the derivative of an absolutely summable series.
Moreover the sum in the above sense agrees with the usual sum in the case
of a convergent series, and subtraction of the first,t terms reduces the "sum"
ol the entire series by the sum of these ,t terms' Borel emphasized that any
satisfactory definition of summability must Possess these properties, though
not all do. He did not require that any two definitions necessarily yield the
same sum.
These properties made possible the immediate application of Borel's
theory to diflerential equations. If, in fact,
P(*,Y'Y'," ',gott) : o
is a diflerential equation which is holomorphic in x at the origin and is
algebraic in y and its derivatives, any absolutely summable series
ds*d.1ltIarx2+--.
which satisfies the differential equation formally defines an analytic function
that is a solution of the equation. For example, the Laguerre series
I + , + 2l x2 + 3!xs + .. .
rrrg
satisfies formally
,,*^+(x-l)u:-1.
and so the function (cf. (39))
J@) : f- e-z dz
)o 7-=-u
must be a solution of the equation.
Once the notion of summability gained some acceptance, dozens of
mathematicians introduced a variety of new definitions that met some or
all of the requirements imposed by Borel and others. Many of the definitions
of summability have been extended to double series. Also, a variety of
problems has been formulated and many solved that involve the notion of
summability. For example, suppose a series is summable by some method'
What additional conditions can be imposed on the series so that, granted
its summability, it will also be convergent in Cauchy's sense ? Such theorems
are called Tauberian after Alfred Tauber (b. 1866). Thus Tauber proveda6
that if ) a" is Abelian summable to s and nan approaches 0 as z becomes
infinite, then ) a,, converges to J,
The concept of summability does then allow us to give a value or sum
to a great variety of divergent series. The question of what is accomplished
thereby necessarily arises. If a given infinite series were to arise directly
in a physical situation, the appropriateness of any definition of sum would
depend entirely on whether the sum is physically significant, just as the
physical utility of any geometry depends on whether the geometry describes
physical space. Cauchy's definition ofsum is the one that usually fits because
it says basically that the sum is what one gets by continually adding more
and more terms in the ordinary sense. But there is no Iogical reason to
prefer this concept ofsum to the others that have been introduced. Indeed
the representation of functions by series is greatly extended by employing
the newer concepts. Thus Leopold Fej€r (1880-1959), a student of H. A'
Schwarz, showed the value of summability in the theory of Fourier series.
In I904a6 Fej6r proved that if in the interval [-4 r), f(x) is bounded and
(Riemann) integrable, or if unbounded the integral I!,-f@ dr is absolutely
convergent, then at every point in the interval at which /(r + 0) and
"f(* - O) exist, the Frobenius sum of the Fourier series
4o-.S (ao cos nx + 6n sin z*)
Bibliograplry
Borel, Emile: Nolice sur les lraoaux scientifques de M. Emile Borel,2nd ed., Gauthier-
Villars,
1921.
Legons sur les siries diuergentes, Gauthier-Villars, 1901.
--2
Burkhardt, H.: " Trigonome trische Reihe und Integrale," Encyk. der Math. Wiss,,
B. G. Teubner, 190.t-16, II Al2,819 1354.
"Uber den Gebrauch divergenter Reihen in der Zeit von 1750-1860,"
Math. Ann.,70, 19l l, 169-206.
Carmichael, Robert D.: " General Aspects of the Theory of Summable Series,"
Amer. Math. Soc. Bull.,25, l9l8/19, 97-131.
Collingwood E. F.: "Emile Borel," Jour. Lon. l[ath. Soc,,34, 1959, 488 512.
Ford, W. B.: "A Conspectus of the Modern Theories of Divergent Series," Amer.
Math. Soc. Bull.,25, l9l8/19, l-15.
BIBLIOGRAPI{Y II2I
Hardy, G. H.: Diuergent Srrias, Oxford University Press, 1949. See the historical
notes at the ends of the chapters.
Hurwitz, W. A.: "A Report on Topics in the Theory of Divergent Series," Amer.
Math. Soc. Bull., 28, 1922, 17 36.
Knopp, K.: " Neuere lJntersuchungen in der Theorie der divergenten Reihen,"
Jahres. der Deut. Math.-Verein., 32, 1923, 43-67 .
Langer, Rudolf E. : " The Asymptotic Solution of Ordinary Linear Differential
Equations of the Second Order," Amer. Math. Soc. Bull., 40, 193+, 545-82.
McHugh, J. A. M.: "An Historical Survey of Ordinary Linear Differential
Equations with a Large Parameter and Turning Points," Archiae for History
oJ Exact Sciences, T, 1971,277-324.
Moore, C. N.: "Applications of the Theory of Summability to Developments in
Orthogonal Functions," Amer. Math. Soc. Bull.,25, l9l8/19,258-76.
Plancherel, N{ichel: "Le DCveloppement de la thdorie des sCries trigonomCtriques
dans le dernier quart de sitcle," L'Enseignement Mathdmatiqut, 24, 1924125,
l9-58.
Pringsheim, A.: " Irrationalzahlen und Konvergenz unendlicher Prozesse," Encyk.
der Math. Wiss., B. G. Teubner, 1898-1904, IA3, 47-146.
Reiff, R.: Geschichte der unendlhhen Reihen, H, Lauppsche Buchhandlung, 1889,
Martin Sandig (reprint), 1969.
Smail, L. L.: History and Sgnopsis of the Theory of Summable Inf,nite Processes, University
of Oregon Press, 1925.
Van Vleck, E. B.: " Selected Topics in the Theory of Divergent Series and Con-
tinued Fractions," The Boston Colloquium of the Amer. Math. Soc., 1903,
Macmillan, 1905, 75-187.
48
Tensor Analysis and Differential
Geometry
l. Bull. des Sci. Math., (2), 16, 1892, 167-89 : Open, 1,288-310.
tr22
THE NOTION OF A TENSOR r r23
Nine years later Ricci and his famous pupil Tullio Levi-Civita (1873-
l94l ) collaborated on a comprehensive paper, " Methods of the Absolute
Differential Calculus and Their Applications." 2 The work of Ricci and
Levi-Civita gave a more definitive formulation of this calculus. The subject
became known as tensor analysis after Einstein gave it this name in 1916.
In view of the many changes in notation made by Ricci and later by Levi-
Civita and Ricci, we shall use the notation which has by now become rather
standard.
(r) f e,a*'
is a diflerential invariant under transformations ofl the form
(2) x' : -f,(Yr,Y',...'!").
It is assumed that the functions.fl possess all necessary derivatives and that
the transformation is reversible, so that
(3) yt : gi(x', x2, . . ., x").
Under the transfiormation (2) the expression (l) becomes
(+) 5
L
A,(r',y',...,!")dvi.
However, ,4-, does not equal lr. Rather
.. : aA ?xt ox2+...+ AnW
o a*n
(5) o,
A!,- a,W * ^rW
wherein it is understood that the riin the l, are replaced by their values in
terms of the yi. Thus the A, can be related to the /4i by the specific law of
translormation (5) wherein the first derivatives of the translormation are
involved.
Ricci's idea was that instead of concentrating on the invariant differen-
tial form ( I ), it would be sufficient and more expeditious to treat the set
Ab A2r' ' ', An
and to call this set of components a tensor provided that, under a change of
coordinates, the new set of components
Ab 42,' ' ', An
are related to the original set by the law of transformation (5). It is this
explicit emphasis on the set or system of functions and the law of trans-
formation that marks Ricci's approach to the subject of differential invari-
ants. The set of 17, which happen to be the components of the gradient of
the scalar function l, is an example ofa covariant tensor of rank 1. The
notion of a set or system of functions characterizing an invariant quantity
was in itself not new because vectors were already well known in Ricci's
time; these are represented by their comPonents in a coordinate system and
are also subject to a law of transformation if the vector is to remain, as it
should be, invariant under a change of coordinates. However, the new
systems that Ricci introduced were far more general and the emphasis on
the law of transformation was also new.
As another example of the point of view introduced by Ricci, let us
consider the expression for the element of distance. This is given by
Under change of coordinates the value of the distance /s must remain the
same on geometrical grounds. However, if we perform the transformation
(2) and write the new expression in the form
-a
: Z
(7) d" G,,dYt ds',
then . . . , x") will not equal G,,(g', . .. , y') (when the values
gi(*r, of the yt
represent the same point as the.rtdo). What does hold is that
(8)
"': ,2,*# *u"
when the xr in the g,, are replaced by their values in terms of the yr. To see
that (8) holds we have but to replace dxr in (6) by
a*,:* 4a,*-
?roY-
and. N in (6) by
a*, :* 4a,,
?-,0s"
and extract the coefficient of dyh dgt. Thus, though Gn, is not at all g*r, we
do know how to obtain Gu, from the gr,. The set of z2 coefficients g,, of the
THE NOTION OF A TENSOR r r25
then
Ifwe now regard the tlxk as a set of quantities constituting a tensor, then we
see that, of course, dyi A dx!, but we can obtain the dy' from the d,t' by the
law of transformation illustrated in (10). The set of elements dxk is called a
contravariant tensor of rank l, the term contravariant pointing to the
presence of the 1yt lLxk in the transformation, as opposed to the derivatives
AxtlOgr which appear in (8) and (5)' Thus the very differentials of the
transformation variables form a contravariant tensor of rank l.
Correspondingly, we can have a tensor ofrank 2 that transforms contra-
variantly in both indices. If the setof functions Akt(xr, x2,..., x"), k, l:
1,2,. . .,n, transform under (9) so that
* og'alt *,
(l l) "zrn-
- .L oxk oxt" '
^,
: i s,"^"
ar:)er:,
wherein on the right side we add the components. We could show that the
set of quantities.Bf is a tensor ofrank 2, covariant in the index i and contra-
variant in the index i.
To sum up, a tensor is a set of functions (components), fixed relative
to one frame of reference or coordinate system, that transform under change
of coordinates in accordance with certain laws. Each component in one
coordinate system is a linear homogeneous function of the components in
another coordinate system. If the components of one tensor equal those of
another when both are expressed in one coordinate system, they will be equal
in all coordinate systems. In particular, if the components vanish in one
system they vanish in all. Equality oftensors is then an invariant with respect
to change of reference system. The physical, geometrical, or even purely
mathematical significance which a tensor possesses in one coordinate system
is preserved by the transformation so that it obtains again in the second
coordinate system. This property is vital in the theory of relativity, wherein
each observer has his own coordinate system. Since the true physical
laws are those that hold for all observers, to reflect this independence of
coordinate system these laws are expressed as tensors.
With the tensor concept in hand, one can re-express many of the
concepts ol Riemannian geometry in tensor form. Perhaps the most im-
portant is the curvature of the space. Riemann's concept of curvature
(Chap. 37, sec. 3) can be formulated as a tensor in many ways. The modern
COVARIANT DIFFERENTIATION tt27
s,t;,:) s,'tt.
l=r
In this notation the curvature tensor is (cf. [20] of Chap. 37)
wherein the brackets denote Christoffel symbols of the first kind and the
braces, symbols of the second kind. Either form is now called the Riemann-
Christoffel curvature tensor. By reason of certain relationships (which we
shall not describe) among the components, the number of distinct com-
ponents of this tensor is n2 (n2 - l)/12. For n : 4, which is the case in the
general theory of relativity, the number of distinct components is 20. In a
two-dimensional Riemannian space there is just one distinct component'
which can be taken to be rRlrrr. In this case Gauss's total curvature ( proves
to be
x : !,,*,
e
where g is the determinant of the gi1 or €rgzz - g!2. If all comPonents
vanish the space is Euclidean.
From the Riemann-Christoffel tensor Ricci obtained by contraction
what is now called the Ricci tensor or the Einstein tensor. The comPonents
R,, are )!=1Rhk. This tensor for n : 4 was used by Einsteins to express the
curvature of his space-time Riemannian geometry.
3. Coaariant Dffirentiation
Ricci also introduced into tensor analysisa an operation that he and Levi-
Civita later called covariant diflerentiation. This operation had already
appeared in Christoffel's and Lipschitz's work.s Christoffel had given a
method (Chap. 37, sec. 4) whereby from differential invariants involving
3. Zeit. ffir Math. und Phys.,62, l9l4' 225-61.
4. Atti. della Accad. dei Lincei, Rendiconti, (4), 3, 1887, 15-18 : O\ere, l, 199-203.
5. Jour. filr Math.,70, 1869, 46-70 and 24145, and 7l-102.
I I28 TENSOR ANALYSIS AND DIFFERENTIAL GEOMETRY
A:,:#+)e,st,;1,
t=1
and this is a mixed tensor of the second order. For the mixed tensor with
components A! the avariant derivative is
Il one takes the derivatives with respect to the coordinates, r and 0 say, of
these components, then the rate of change expressed by these derivatives
reflects the change in the components caused by the coordinate system and
not by any change in the vectors themselves. The coordinate systems used in
Riemannian geometry are curvilinear. The effect ol the curvilinearity of the
coordinates is given by the Christoffel symbol of the second kind (here
denoted by braces). The full covariant derivative ofa tensor gives the actual
rate ofchange ofthe underlying physical or geometrical quantity represented
by the original tensor as well as the change due to the underlying coordinate
system.
In Euclidean sPaces, where the ds2 can always be reduced to a sum of
squares with constant coefficients, the covariant derivative reduces to the
o.dirr".y derivative because the christoffel symbols are 0. Also the covariant
derivative ol each g,, in a Riemannian metric is 0. This last fact was proved
by Ricci 6 and is called Ricci's lemma.
The concept of covariant differentiation enables us to exPress readily for
tensors generalizations of notions already known in vector analysis but now
treatabl-e in Riemannian geometry. Thus il the Ar(xr, tc2 , ' ' ' ', x") are the
components of an z-dimensional vector ,4, then
fl
(14) o : ) e"Att,
where gtl has been defined above, is a differential invariant' When the
fundamlntal metric is a rectangular Cartesian coordinate system (in
Euclidean space), the constants grl : 0 except when i : /' and in this case
thecovariantandordinaryderivativesareidentical.Then(14)becomes
0:>#,
and this is the z-dimensional analogue in Euclidean space of what is called
the divergence in three dimensions. Hence (14) is also called the divergence
of the ten;or with components 1,. One can also show, by using (14), that if
A is a scalar point function then the divergence of the gradient of u4 is
given by
n r- I S 2
oxl di
( 15) "2,, - ,2 z< A\,
where
* az
AI: ,L 6,,, ?xt S ,,,2.
- L" "'',
6. Atti delta Accad. dei Linczi, Rendiconti' (4),5' 1889, I 12-18 : OPerc, 1,268_75-
r r3() TENSOR ANALYSIS AND DIFFERENTIAL GEOMETRY
This expression for Arl is also Beltrami's expression for L2A in a Riemannian
geometry (Chap. 37, sec.5).
Though Ricci and Levi-Civita devoted much of their l90l paper to
developing the technique of tensor analysis, they were primarily concerned
to find differential invariants. They posed the following general problem:
Given a positive differential quadratic form / and an arbitrary number of
associated functions S, to determine all the absolute differential invariants
that one can form from the coefficients of{, the functions S, and the deriva-
tives of the coefficients and functions up to a definite order zz. They gave a
complete solution. It is sufficient to find the algebraic invariants of the system
consisting of the fundamental differential quadratic form {, the covariant
derivatives of any associated functions S up to order m, and, for m > l, a
certain quadrilinear form Gn whose coemcients are the Riemann expressions
(ih,jk) and its covariant derivatives up to order m - 2.
They conclude their paper by showing how some partial differential
equations and physical laws can be expressed in tensor lorm so as to render
them independent of the coordinate system. This was Ricci's avowed goal.
Thus tensor analysis was used to express the mathematical invariance of
physical laws many years before Einstein used it for this purpose.
4. Parallel Displanment
From l90l to 1915, research on tensor analysis was limited to a very small
group of mathematicians. However, Einstein's work changed the picture.
Albert Einstein (1879-1955), while engaged as an engineer in the Swiss
patent office, greatly stirred the scientific world by the announcement of his
restricted or special theory of relativity.T In l9l4 Einstein accepted a call to
the Prussian Academy of Science in Berlin, as successor to the celebrated
physical chemist Jacobus Van't Hoff (1852-l9l l). Two years later he
announced his general theory of relativity.s
Einstein's revolutionary views on the relativity of physical phenomena
aroused intense interest among physicists, philosophers, and mathematicians
throughout the world. Mathematicians especially were excited by the nature
of the geometry Einstein found it expedient to use in the creation of his
theories.
The exposition of the restricted theory, involving the properties offour-
dimensional pseudo-Euclidean manifolds (space-time), is best made with the
aid ofvectors and tensors, but the exposition ofthe general theory, involving
properties of four-dimensional Riemannian manifolds (space-time), demands
the use of the special calculus of tensors associated with such manifolds.
7. Annalen der Phys., 17, 1905, 891-921; an English translation can be found in the Dover
edition ofA. Einstein, The Priwiple of Relatiaily (1951).
8, Atnahn der Plgsik,49, 1916, 769-822.
PARALLEL DISPLACEMENT r r3r
Fortunately, the calculus had already been developed but had not as yet
attracted particular notice from physicists'
Actually Einstein's work on the restricted theory did not use Riemannian
geometry or tensor analysis's But the restricted theory did not involve the
action of gravitation. Einstein then began to work on the problem of dis-
pensing with the force ofgravitation and accounting for its effect by imposing
a strrrciu.e on his space-time geometry so that objects would automatically
move along the same paths as those derived by assuming the action of the
gravitational force. In l9l I he made public a theory that accounted in this
for a gravitational lorce that had a constant direction throughout
^u.r.r..
space, knowing of course that this theory was unrealistic. Up to this time
Einstein had used only the simplest mathematical tools and had even been
suspicious of the need for " higher mathematics," which he thought was
often introduced to dumbfound the reader. However, to make Progress on
his problem he discussed it in Prague with a colleague, the mathematician
Georg Pick, who called his attention to the mathematical theory of Ricci and
Levi-Civita. In Zurich Einstein found a friend, Marcel Grossmann ( l87B-
1936), who helped him learn the theory ; and with this as a basis, he succeeded
in formulating the general theory of relativity.
To represent his four-dimensional world of three space coordinates and
a fourth coordinate denoting time, Einstein used the Riemannian metric
wherein .r4 represents time. The gi, were chosen so as to reflect the presence
of matter in various regions. Moreover, since the theory is concerned with
the determination of lengths, time, mass, and other physical quantities by
different observers who are moving in arbitrary fashion with respect to each
other, the "points" of space-time are represented by different coordinate
systems, one attached to each observer. The relation of one coordinate
system to another is given by a transformation
9. The metric is ds2: dxz + dy2 + dzz - c2 tlt2. T}l.is is a space of constant curvaturc'
Any section by a ptane , : const. is Euclidean.
t t32 TENSOR ANALYSIS AND DIFFERENTIAL GEOMETRY
10. Rendiconti del Circolo Matematico di. Palermo, 42, l9l7, 173-205.
ll. Math. Ann.,78, 1918, 187-217.
CENERALIZATIONS OF RIEMANNIAN GEOMETRY I I33
P tangent to the surface; but if{ is the coJatitude of P, it will make an angle
2r - with the original vector.
2tr cos {
Ifone uses the general definition of parallel displacement along a curve
of a Riemannian space, he obtains an analytic condition. The differential
equation satisfied by the components X@ of a contravariant vector under
parallel transler along a curve is (summation understood)
(17)
# * rUr,4xaff:0, u:1,2,...,n,
wherein it is presupposed that the ut(t), i : 1,2,..., z, define a curve. For
a covariant vector Xo the condition is
( l8)
ff - ra,n*,ff :0, a : 1,2,. . ., n.
These equations can be used to define parallel transfer along any curve C.
The solution determined uniquely by the values of the components at a
definite P is a vector having values at each point of C and by definition
parallel to the initial vector at P. Equation (lB) states that the covariant
derivative ofX, is 0.
Once the notion ofparallel displacement is introduced, one can describe
the curvature of a space in terms of it, specifically in terms of the change in
an infinitesimal vector upon parallel displacement by infinitesimal steps.
Parallelism is at the basis ofthe notion of curvature even in Euclidean space,
because the curvature of an infinitesimal arc depends on the change in
direction of the tangent vector over the arc.
(20)
d'r' - S ,, dx^ dxu : n i: 1,2,. . ., n,
ds2 ' L-n'ds ds
12. Mathematische kitschrift,2, 1918, 38't-41 I : Ges. Abh., 2' l-28:
13, Proccedings of thz National Academy qf Scierucs, 8, 1922, 19-23.
BIBLIOGRAPHY I I35
with li,, : fl,,r, a family of curves called paths. These are the
defines
geodesics of the geometry. (In Riemannian geometry equations (20) are
precisely those for geodesics.) Given the geodesics in the sense just described,
one can build up the geometry of paths in a manner quite analogous to that
pursued in Riemannian geometry.
A different generalization of Riemannian geometry is due to Paul
Finsler (l89,t-) in his (unpublished) thesis of l9l8 at Gdttingen. The
Riemannian d.r2 is replaced by a more general function F (x, dx) of the
coordinates and differentials. There are restrictions on F to insure the
possibility of minimizing the integral J F @, (dxldt)) dt and obtaining thereby
the geodesics.
The attempts to generalize the concept of Riemannian geometry so as to
incorporate electromagnetic as well as gravitational phenomena have thus
far failed. However, mathematicians have continued to work on the abstract
geometries.
Bibliography
Cartan, E.: "Les rdcentes gdndralisations de la notion d'espace," Bull. des Sci.
M ath., 48, 1924, 294-320.
Pierpont, James: " Some Modern Views of Space," Amer. Math, Soc. Bull., 32, 1926,
225-58.
Ricci-Curbastro, G.: Opere,2 vols., Edizioni Cremonese, 1956-57.
Ricci-Curbastro, G., and T. Levi-Civita: " M6thodes de calcul diff6rentiel absolu
et leurs applicatiots," Math. Ann.,54, 1901, 125-201.
Thomas, T. Y. : " Recent Trends in Geometry," Amer. Math. Soc. ScmicenEnnial
Publications II, 1938, 98 135.
Weatherburn, C. E.: The Deoelo|ment of Multidimcnsional Difcrcntial Geomctry,
Australian and New Zealand Association for the Advancement of Science,
2t, t933, t2-28.
Weitzenbock, R,: " Neuere Arbeiten der algebraischen Invariantentheorie.
Differentialinvarianten," Eruyk. der Math. Wiss., B. G. Teubner, l9O2-27,
III, Part III, El, l-71.
Weyl, H.: Mathematische Analgse des Raumproblems (1923), Chelsea (reprint), 1964.
+9
The Emergence of Abstract Algebra
Perhaps I may without immodesty lay claim to the appella-
tion of Mathematical Adam, as I believe that f have given
more names (passed into general circulation) of the creatures
of the mathematical reason than all the other mathematicians
of the age combined. J. J. SYLVESTER
r r36
ABSTR,ACT GROUP TIIEORY 1137
properties, the foremost of .these being that the operation applied to two
elements of the set produces a third element of the set' The same advantages
could be achieved for the various collections that formed rings and fields.
Though the idea olworking with abstract collections preceded the axiomatics
of Pasch, Peano, and Hilbert, the latter development undoubtedly acceler-
ated the acceptance of the abstract approach to algebras.
Thus arose abstract algebra as the conscious study of entire classes of
algebras, which individually were not only concrete but which served
purposes in specifi.c areas as substitution groups did in the theory ofequations.
The advantage of obtaining results that might be useful in many specific
areas by considering abstract versions was soon lost sight of, and the study of
abstract structures and the derivation of their properties became an end in
itself.
Abstract algebra has been one of the favored fields of the twentieth
century and is now a vast area. We shall present only the beginnings of the
subject and indicate the almost unlimited opportunities for continued
research. The great difficulty in discussing what has been done in this field is
terminology. Aside from the usual difficulties, that different authors use
different terms and that terms change meaning from one period to another,
abstract algebra is marked and marred by the introduction of hundreds o[
new terms. Every minor variation in concept is distinguished by a new and
often imposing sounding term. A complete dictionary of the terms used
would fill a large book.
element 4'such that aa' : a'a: e. When the operation is commutative the
group is called commutative or Abelian and the operation is called addition
and denoted by *. The element a is then denoted by 0 and called the zero
element. If the operation is not commutative it is called multiplication and
the element e is denoted by I and called the identity.
The abstract group concePt and the properties that should be attached
to it were slow in coming to light. We may recall (Chap. 31, sec. 6) that
Cayley had proposed the abstract group in 1849, but the merit ofthe notion
was not recognized at the time. In lB58 Dedekind,l far in advance of his
time, gave an abstract definition of finite groups which he derived from
permutation groups. Again in 1877 2 he observed that his modules ofalgebraic
numbers, which call for cr 1 B and cr - p belonging when cr and B do, can be
generalized so that the elements are no longer algebraic numbers and the
operation can be general but must have an inverse and be commutative.
Thus he suggested an abstract finite commutative group. Dedekind's under-
standing of the value of abstraction is noteworthy. He saw clearly in his
work on algebraic number theory the value of structures such as ideals and
fields. He is the effective founder of abstract algebra.
Kronecker,s taking off from work on Kummer's ideal numbers, also
gave what amounts to the abstract definition of a finite Abelian group
similar to Cayley's 1849 concept. He specifies abstract elements, an abstract
operation, the closure proPerty, the associative and commutative properties,
and the existence ofa unique inverse element ofeach element. He then proves
some theorems. Among the various powers of any element 0, there is one
which equals the unit element, l. If z is the smallest exponent for which 0'
equals the unit element, then to each divisor pr of u there is an element {
for which 6u : l.lf 0p and {' both equal I and p and o are the smallest
numbers for which this holds and are relatively prime, then (06)" : l.
Kronecker also gave the first proof of what is now called a basis theorem.
There exists a fundamental finite system of elements 0L, 02, %,. . ' such that
the products
0!,0120\". . ., hi: 1,2,3,"',nt,
represent all of the group just once. The lowest possible values
elements
fl1. n21 ns,... that correspond to 01,02,0.,'.. (that is, for which
qit:1)
are such that each is divisible by the following one and the product ntn2na' ' '
equals the number z of elements in the group. Moreover all the prime factors
of n ane in nr.
In 1878 Cayley wrote four more Papers on finite abstract groups.a In
l. Werke, 3, 439-46.
2. Bull. des Sci. Math., (2), l, 1877, 17-41, p.4l in particular : Werke,3,262-96.
3. Monatsber. Berliner Akad., 1870, 881-89 : lilerke, 1,271-82.
4. Math. Ann., 13, 1878, 561-65; Proc, London Math, Soc.,9, 1878, 126-33; Amer. Jour. of
Math., I,1878, 50-52 and 174-76; all in Vol. l0 of his Collected Math. PaPe$.
ABSTRACT GROT'P THEORY I I39
these, as in his 1849 and 1854 papers, he stressed that a group can be con-
sidered as a general concept and need not be limited to substitution groups'
though, he points out, every (finite) group can be represented as a substitu-
tion group. These papers of Cayley had more influence than his earlier ones
because the time was ripe for an abstraction that embraced more than
substitution groups.
In a joint paper, Frobenius and Ludwig Stickelberger (1850-1936) s
made the advance of reeognizing that the abstract grouP concept embraces
congruences and Gauss's composition of forms as well as the substitution
groups of Galois. They mention groups of infinite order.
Though Eugen Netto, in his book Substitutinnentfuorfu und ihre Anwendung
aqf die Atgebra (lBB2), confined his treatment to substitution groups, his
wording of his concepts and theorems recognized the abstractness of the
concepts. Beyond putting together results established by his predecessors,
Netto treated the concepts of isomorphism and homornorphism. The former
means a one-to-one correspondence between two groups such that if a'D : a,
where a, b, and c are elements of the first group, then a''b' : c', where
a', b', and, c' are the corresponding elements of the second group. A homo-
morphism is a many-to-one correspondence in which again a'b: c imPlies
a'.b' : c'.
By lBB0 new ideas on groups came into the picture. Klein, influenced
by Jordan's work on permutation groups, had shown in his Erlanger Pro-
gramm (Chap. 38, sec. 5) that infinite transformation groups, that is, groups
with infinitely many elements, could be used to classify geometries. These
groups, moreover, are continuous in the sense that arbitrarily small trans-
formations are included in any group or, alternatively stated' the parameters
in the transformations can take on all real values. Thus in the transformations
that express rotation of axes the angle 0 can take on all real values. Klein and
Poincard in their work on automorphic functions had utilized another kind
of infinite group, the discrete or noncontinuous group (Chap. 29, sec. 6).
Sophus Lie, who had worked with Klein around 1870, took up the
notion of continuous transformation groups, but for other Purposes than the
classification of geometries. He had observed that most of the ordinary
differential equations that had been integrated by older methods were
invariant under classes ofcontinuous transformation groups, and he thought
he could throw light on the solution of diflerential equations and classify
them.
ln
lB74 Lie introduced his general theory of transformation groups.s
Such a group is represented by
(l) xi : f,(xr, x2,. . .
5. Jour. filr. Math., 86, l8?9, 21742 = Flobenius, Ces. Abh., l, 545-90.
6. Nachri.chten Kinig. Ges. der Wiss. zu Gdtt., 1874,52942 : Gcs. Abh.,5, l-8,
r r40 THE EMERGENCE OF ABSTRACT ALGEBRA
where the..7f, are analytic in the.r1 and ar. The a, are parameters as opposed to
the 4, which are variables, and (xr, xz, . . ., x*) stands for a point in z-dimen-
sional space. Both the parameters and the variables may take on real or
complex values. Thus in one dimension the class of transformations
, atc+ b
*:A_+a'
where a, b, c, and d take on all real values, is a continuous group. The groups
represented by (l) are called finite, the word finite referring to the number
of parameters. The number of transformations is, of course, infinite' The
oni-dimensional case is a three-parameter group because only the ratios of
a, b, and c to d matter, In the general case the product of two transformations
(r ;)
is another representation, which is called the sum of the separate rePresenta-
tions. Likewise if
,, : (; ';r)
is another representation, when Bo and D, are non-singular matrices oforders
m and. n respectively, the .Bo and D, are also representations, and of lower
order than E, Eo is called a graduated representation ; it and any representa-
tion equivalent to it (F;rEsFs is an equivalent representation ifF is non-
singular and of the order of E ) is called reducible. A rePresentation not
equivalent to a graduated one is called irreducible' The basic idea of an
irreducible representation consisting of a set ol linear transformations in z
variables is that it is a homomorphic or isomorphic representation in which
it is impossible to choose m < n linear functions of the variables which are
transformed among themselves by every operation of the group they repre-
sent. A representation that is equivalent to the sum of irreducible represen-
tations is called completely reducible.
Every finite group has a particular representation called regular. Sup-
pose the elements are labeled gu gz, . . . , g*. Let a be any one of the g's. We
consider an z by n matrix. Suppose a& : €t. Then we place a I in the (i, j)
place of the matrix. This is done for all g, and the fixed a. All the other
elements of the matrix are taken to be 0. The matrix so obtained corresponds
to a. There is such a matrix for each g of the group and this set of matrices is
a left regular representation. Likewise by forming the Products gia we get a
right regular representation. By reordering the g's of the group we can get
other regular representations. The notion of a regular representation was
introduced by Charles S. Peirce in 1879.28
The representation of substitution groups by linear transformations of
the form
*i:)o,1*,, i: 1,2,...,n,
the first part of the twentieth. Frobenius 2s introduced for finite grouPs the
notions of reducible and completely reducible representations and showed
that a regular representation contains all the irreducible representations. In
other papers published from lB97 to 1910, some in conjunction with Schur,
he proved many other results, including the fact that there are only a few
irreducible representations, out of which all the others are composed.
Burnsideso gave another major result, a necessary and sufficient condi-
tion on the coefficients of a group of linear transformations in z variables in
order that the group be reducible. The fact that any finite group of linear
transformations is completely reducible was first proved by Heinrich Maschke
(1853-1908).31 Representation theory for finite groups has led to important
theorems for abstract groups. In the second quarter of this century' rePresen-
tation theory was extended to continuous grouPs, but this development will
not be pursued here.
Aiding in the study of group representations is the notion of group
character. This notion, which can be traced back to the work of Gauss,
Dirichlet, and Heinrich Weber (see note 35), was formulated abstractly by
Dedekind for Abelian groups in the third edition of Dirichlet's Vorlesungen
ilber Zahlentheorie (1879). A character of a group is a function x(s) defined
on all the elements s such that it is not zero for any s and *(ss') : r(s)r(s').
Two characters are distinct if x(s) * r'(s) for at least one s of the group.
This notion was generalized to all finite groups by Frobenius. After
having formulated a rather complex definition,S2 he gave a simpler defini-
tion,33 which is now standard. The character function is the trace (sum ofthe
main diagonal elements) of the matrices of an irreducible representation of
the group. The same concept was applied later by Frobenius and others to
infinite groups.
Group characters furnish in particular a determination of the minimum
number ofvariables in terms ofwhich a given finite group can be represented
as a linear transformation group. For commutative grouPs they permit a
determination of all subgroups,
Displaying the usual exuberance for the current vogue, many mathe-
maticians of the late nineteenth and early twentieth centuries thought that
all mathematics worth remembering would ultimately be comprised in the
theory of groups. Klein in particular, though he did not like the formalism of
abstract group theory, favored the group concept because he thought it would
unify mathematics. Poincard was equally enthusiastic. He said,3a ". . . the
29. Sitzungsber. Akad. Wi,ss- zu Beilin, 1897, 99'1-1015 : Ges. Abh.,3, 82-103.
30. Proc, Loulon Math. Soe,, (2),3, 1905, 430-34.
31. Math. Am.,52, 1899, 363-68.
32. Si.tzungsber. Akad. Wi.ss. zu Berlin, 1896,985-1021 : Gu. Abh,'3, l-37.
33. Sitzungsber. Akad. Wiss. zu Beilin, 1897, 99'l-1015 : Ges. Abh.,3, 82-103.
34. Acta Math.,38, 1921, 145.
r r46 THE EMERGENCE OF ABSTRACT ALGEBRA
theory of groups is, so to say, the whole of mathematics divested of its matter
and reduced to pure form."
, : iP",
a
35. Mart. Ann.,43,1893, 521-49. For grouPs see Math. Ana.,20, 1882, 301-29'
36. Ann. Math. Soc. Trans,,4, 1903' 13-20, and 6, 1905, 198-204'
!7. Amer. Math. Soc. Trans.,4, 1903, 3l-37, and 6, 1905, l8l-97.
THE ABSTRACT THEORY OF FIELDS t r47
(2) g
Z,,P',
wherep is a prime and the coefficients, the c,'s, are ordinary rational numbers
reduced to their lowest form whose denominator is not divisible by 1. Such
expressions need not in general have values as ordinary numbers. However,
by definition they are mathematical entities.
Hensel defined the four basic operations with these numbers and showed
that they are a field. A subset of the 1-adic numbers can be put into one-to-
one correspondence with the ordinary rational numbers, and in fact this
subset is isomorphic to the rational numbers in the full sense of an isomor-
phism between two fields. In the field of p-adic numbers, Hensel defined
units, integral 1-adic numbers, and other notions analogous to those of the
ordinary rational numbers.
By introducing polynomials whose coefficients are p-adic numbers,
Flensel was able to speak of p-adic roots of polynomial equations and extend
to these roots all of the concepts of algebraic number fields. Thus there are
p-adic integral algebraic numbers and more general p-adic algebraic
numbers, and one can form fields of p-adic algebraic numbers that are
extensions ofthe " rational" p-adic numbers defined by (2). In fact, all ofthe
ordinary theory of algebraic numbers is carried over to p-adic numbers.
Surprisingly perhaps, the theory of p-adic algebraic numbers leads to results
on ordinary algebraic numbers. It has also been useful in treating quadratic
forms and has led to the notion of valuation fields.
The growing variety of fields motivated Ernst Steinitz (187l-1928),
who was very much influenced by Weber's work, to undertake a compre-
hensive study of abstract fields; this he did in his fundamental paper, Alge-
braischen Theorie der Kiirper.3B All fields, according to Steinitz, can be divided
into two principal types. Let r( be a field and consider all subfields of .r( (for
example, the rational numbers are a subfield of the real numbers). The
elements common to all the subfields are also a subfield, called the prime 6eld
P of r(. Two types of prime fields are possible. The unit element e is contained
in P and, therefore, so are
er2er, , ., ner. , ..
These elements are either all different or there exists an ordinary integer I
such that pe : 0. In the first case P must contain all fractions nelme, and
since these elements form a field, P must be isomorphic to the field of rational
numbers and l( is said to have characteristic 0.
If, on the other hand, pe : 0, it is readily shown that the smallest such
1 must be a prime and the field must be isomorphic to the field of
integral
residues modulo p, that is, 0, 1,. ..,y' - 1. Then ,I( is said to be a field of
characteristic p. Any subfield of r( has the same characteristic' Then pa :
Pca : O; that is, all expressions in r( can be reduced modulo p'
From the prime field P in either of the types just described, the original
field r( can be obtained by the process of adjunction. The method is to take
an element a in -r( but not in P and to form all rational functions R(a) of a
with coefficients in P and then, if necessary, to take 6 not in rR(a) and do the
same with 6, and to continue the process as long as necessary'
If one starts with an arbitrary field r( one can make various types of
adjunctions. A simple adjunction is obtained by adjoining a single element r'
The enlarged field must contain all expressions of the form
(3) ao + ay* *"'* an!(o,
where the ai are elements of ff' If these expressions
are all different, then the
extended field is the field 1( (x) of all rational functions of x with coefficients
in r(. Such an adjunction is called a transcendental adjunction and r( (x) is
called a transcendental extension. If some of the expressions (3) are equal,
one can show that there must exist a relation (using c for r)
Multiplyingtheset(5)byywillproducemoresuchquantities,untilwe
p' - I or ae"-r : I and so
obtain all o}the form (4). Hence rz must divide
ac':a.Thep'valuesoftheform(4)constituteafinitefield'Galoishad
shown in this concrete situation that the number of elements in a Galois
field of characteristic p is a power of1.
E. fI. Moore{o showed that any finite abstract field is isomorphic to a
p
Galois field of order p", p a prime. There is such a field for every prime and
positive integer z. The characteristic of each field isp' Joseph H' M' Wedder-
Lurn (IASZ-1S+8), a professorPrinceton lJniversity,{l and Dickson
at
provcd simultaneously that any finite field is necessarily commutative (in the
multiplication oPeration). A great deal of work has been done to determine
the s;cture of ihe additive groups contained in Galois fields and the struc-
ture of the fields themselves.
4. Rings
Though the structures rings and ideals were well known and utilized in
Dedef,nd,s and Kronecker's work on algebraic numbers, the abstract thegry
isentire\aproductofthetwentiethcentury.Thewordidealhadalready
been adopted (Chap. 34, sec. 4). Kronecker used the word "order" for
ring;
the latter term was introduced by Hilbert'
Beforediscussingthehistory,itmaybewelltobeclearaboutthemodern
meanings of the concepts. An abstract ring is a collection of elements that
form a Jo-mrrtative group with respect to one operation and are subject to a
second operation applicable to any two elements' The second operation
is
then
where r belongs to R and z is any whole number' If R has a unit element'
ta + na : ra-+ nea : (r + ne)a: r'a, where r' is now any element ol 'R'
The ideal generated by one element is called a principal ideal' Any ideal
other thani and .R is a proper ideal' Similarly if ar, ar, ' ' ', a^ ate n
given
elements of a ring .R with unit element the set of all sums /1d1 * ' ' ' * rna^
o)n',:Z*t,
and the product of two elements of the algebra is defined by
E *,,)
E u,,,\ :) *,u,,,,,,
i,J
5. Non-associatiae Algebras
Modern ring theory, or, more properly, an extension of ring theory, also
includes non-associative algebras. The product operation is non-associative
and non-commutative. The other properties oflinear associative algebras are
applicable. There are today several important non-associative algebras.
Historically the most important is the type called a Lie algebra' It is custom-
ary in such algebras to denote the product of two elements a and D by la, bl.
In place ofthe associative law the product operation satisfies two conditions,
la,b): -lb,al and fa, fb,cll + lb,lc,al) + fc,la,bll:0.
47. Math. Ann., 60, 1905, 20-l 16.
48. Math. Ann., 83, 1921,2+-66.
II54 THE EMERGENCE OF ABSTRACT ALGEBRA
The second proPerty is called the Jacobi identity. Incidentally, the vector
product of two vectors satisfies the two conditions.
An ideal in a Lie algebra I is a sub-algebra Zr such that the product of
any element of tr and any element of I, is in Ir. A simple Lie algebra is one
that has no nontrivial ideals. It is semi-simple if it has no Abelian ideals'
Lie algebras arose out of Lie's efforts to study the structure of his
continuous transformation groups. To do this Lie introduced the notion of
infinitesimal transformations.ro Roughly speaking, an infinitesimal trans-
formation is one that moves points an infinitesimal distance. Symbolically it
is represcnted by Lie as
(7) xi: q * 6rXr(xr, *a,''', xn),
Let ao be the value of the parameter for which $ and' $ are the identity
transformation so that
x-$(x,g,ao), g -$(x,g,as).
Ifao is changed to do + 64, then by Taylor's theorem
\:4(x,t,o") +HEa*...
Ut : *(x,!, "r) + HDa * .' .
and
(9) $$ : {(x,g) 6a, $s : q@,Y) 6a.
If Ea is 6t we get the form (7) or (B). The equations (9) represent an infini-
tesimal transformation of the group.
49. Archiv for Mathematik Naturuidcnskab, l, 1876, 152-93 = Ges. Abh',5,42-75'
NON-ASSOCIATIVE ALGEBRAS I I55
5O. Math. Ana.,3l, 1888, 252-90, and in Vols. 33, 34, and 36.
51. 1894; 2nd ed., Vuibert, Paris, 1933 : CEwres, Part I, Vol. l, 137-286.
52. Ann. de I'Ecolc Norm. Sgp., 31, 1914, 263-355 = CEwres, Part I, Vol. l, 399-491.
53. Bull. Soc. Math. de France,4l, 1913, 53-96 = (Euorcs, Part I, Vol. l, 355-98.
r 156 TIIE EMERGENCE OF ABSTRACT ALGEBRA
54. Mathematische Zeitschrift, 23, 1925, 27l-309, aod 24, 1926, 328-95 : Ges. Abh.' 2'
543-6+7 .
tt57
abstract algebra. This is the case with large parts of number theory, including
algebraic numbers.
However, abstract algebra has subverted its own role in mathematics.
Its concepts were formulated to unify various seemingly diverse and dis-
similar mathematical domains as, for example, group theory did. Having
formulated the abstract theories, mathematicians turned away from the
original concrete fields and concentrated on the abstract structures. Through
the introduction of hundreds of subordinate concepts, the subject has mush-
roomed into a welter of smaller developments that have little relation to each
other or to the original concrete fields. Unification has been succeeded by
diversification and specialization. Indeed, most workers in the domain of
abstract algebra are no longer aware ofthe origins ofthe abstract structures,
nor are they concerned with the application of their results to the concrete
fields.
Bibliography
Artin, Emil: "The Influence of J. H. M. Wedderburn on the Development of
Modern Algebra," Amer. Math. Soc. Bull., 56, 1950, 65-72.
Bell, Eric T.: " Fifty Years of Algebra in America, 1888-1938," Amcr. Math. Soc.
Semicentennial Publications, II, 1938, l-34.
Bourbaki, N.: Eliments d'histoire des mathimatiques, Hermann, 1960' pp. I 10-28.
Cartan, Elie : " Notice sur les travaux scientifiques," (Euares complihs, Gauthier-
Villars, 1952-55, Part I, Vol. I, pp. l-98.
Dicke, Auguste: Emmy Noether, 1882-1935, Birkhiiuser Verlag, 1970.
Dickson, L. E.: "An Elementary Exposition of Frobenius's Theory of Group-
Characters and Group-Determinants," Annals of Math., 4, 1902,2549.
Liwar Algebras, Cambridge University Press, 1914.
Algebras and Their Arithmetics (1923), G. E. Stechert (reprint), 1938.
Frobenius, F. G.: Cesammelte Abhandlungen,3 vols., Springer-Verlag, 1968.
Hawkins, Thomas : " The Origins of the Theory of Group Characters," Archiac Jor
-:
History oJ Exact Sciences, T, 1971, 142-70.
Maclane, Saunders: "Some Recent Advances in Algebra," Amcr. Math. MonNy,
46, 1939, 3-19. Also in Albert, A. A., ed.: Studizs in Modcrn Algcbra, The
Math. Assn. of Amer., 1963, pp, 9-34.
" Some Additional Advances in Algebra," in Albert, A. A., ed.: Studics ;n
Modern Algebra, The Math. Assn. of Amer., 1963, pp.35-58.
Ore, Oystein: "Some Recent Developments in Abstract Algebra," Aner. Math.
Soc. Bull., 37, 1931, 53748.
"Abstract Ideal Theory," Amn. Math. Soc. Bull., 39, 1933, 72845.
Steinitz, Ernst: Algebraische Theorie d.er Kiirper, W. de Gruyter, l9l0; 2nd rev. ed-,
1930; Chelsea (reprint), 1950. The first edition is the same as the articlc in
Jour. fir Math., 137,1910, 167-309.
Wiman, A.: " Endliche Gruppen linearer Substitutionen," Eneyk. dcr Math. Wiss.,
B. G. Teubner, 1898-1904, l, Part l, 522-54.
Wussing, H. L,: Die Cenzsis dts abstrakten Grupfienbegrifies, VEB Deutscher Verlag
der Wisseruchaften, 1969.
5o
The Beginnings of Topology
I believe that we lack another analysis properly geometric or
linear which expresses location directly as algebra expresses
magnitude. G. }lI. LEIBNIZ
r r58
I I59
that to each point of one space there is associated a unique point of the
second or image space and that given any neighborhood of an image point
there is a neighborhood of the original point (or each original point if there
are many) whose points map into the neighborhood in the image space.
This concept is no more than a generalization of the e - E definition of a
continuous function, the e specifying the neighborhood of a point in the
image space, and the E a neighborhood of the original point. A homeo-
morphism between two spaces S and ?"is a one-to-one correspondence that
is continuous both ways; that is, the transformations from S to ?1 and from
Z to .S are continuous. The basic task of point set topology is to discover
properties that are invariant under continuous transformations and homeo-
morphisms. All of the properties mentioned above are topological invariants.
Hausdorff added many results to the theory of metric spaces. In
particular he added to the notion of completeness, which Fr€chet had
introduced in his 1906 thesis. A space is complete if every sequence {4,,} that
satisfies the condition that given e, there exists an .trI, such that la,, - a^l <
"
lor all m and z greater than 1[, has a limit point. Hausdorff proved that
every metric space can be extended to a complete metric space in one and
only one way.
The introduction of abstract sPaces raised several questions that
prompted much research. For example, if a space is defined by neighbor-
hoods, is it necessarily metrizable; that is, is it possible to introduce a metric
that preserves the structure of the space so that limit Points remain limit
points ? This question was raised by Fr6chet. One result, due to Paul S.
Urysohn (1898-1924), states that every normal topological space can be
metrized.2 A normal space is one in which two disjoined closed sets can
each be enclosed in an open set and the two open sets are disjoined. A
related result of some importance is also due to Urysohn. He proved 3 that
every separable metric sPace, that is, every metric space in which a countable
subset is dense in the space, is homeomorphic to a subset of the Hilbert
cube; the cube consists of the space of all infinite sequences {.r,} such that
0 < r, ( l/i and in which distance is defined by d: t/flt@t= gF.
The question of dimension, as already noted, was raised by Cantor's
demonstration of a one-to-one correspondence of line and plane (Chap. 41,
sec. 7) and by Peano's curve, which fills out a square (Chap. 42, sec. 5).
Frdchet (already working with abstract spaces) and Poincar6 saw the need
for a definition of dimension that would apPly to abstract spaces and yet
grant to line and plane the dimensions usually assumed for them. The def-
inition that had been tacitly accepted was the number ofcoordinates needed
to fix the points of a space. This definition was not applicable to general
spaces.
2. Math. Ann., 94, 1925,262-95.
3. Math. Ann.,94, 1925, 309-15.
I I62 TIIE BECINNINGS OF TOPOLOGY
fields, mathematicians have not hesitated to plunge freely and broadly into
point set topology.
12. Leibniz, Math. Schriften, I Abt., Vol. 2, 1850, 19-20: Gcrhardt, Dcr Brbfucclucl
oon lzibniz m;t Malhematikern,l, 1899, 568 : Chr. Huygens, (Euu. Comp,,8' No. 2192.
13. Noui Comm. Acad. Sci. Petrop.,4, 1752-53, 10940, pub. 1758 : Olna, (l)' 26' 7l-93.
14. Noai Comm. Acad. Sci. Petrop,, 4, 1752-53, 140-60, pub. 1758: Olnd, (l),26,9'+-108.
15. Jour. de I'Ecol. Poly.,9, l8l?,6&{6 and 87-98 = @uurcs, (2), 1,7-38.
r r64 THE BEGINNINGS OF TOPOLOGY
Figure 50.3
Figure 50.4
On the Mdbius band the perpendicular will return to the Point on the
" opposite side" with reverse direction.
Still another problem that was later seen to be topological in nature is
called the map problem. The problem is to show that four colors suffice to
color all maps so that countries with at least an arc as a common boundary
are differently colored. The conjecture that four colors will always suffice
was made in lB52 by Francis Guthrie (d. lB99), a little-known professor of
mathematics, at which time his brother Frederick communicated it to De
Morgan. The first article devoted to it was Cayley's;22 in it he says that he
could not obtain a proof. The Proof was attemPted by a number of mathe-
maticians, and though some published proofs were accepted for a time,
they have been shown to be {iallacious and the problem is still open'
The greatest impetus to topological investigations came from Riemann's
work in complex function theory. In his thesis of l85l on complex functions
and in his study of Abelian functions,2s he stressed that to work with func-
tions some theorems of analysis situs were indispensable. In these inves-
tigations he found it necessary to introduce the connectivity of Riemann
..rrf""o. Riemann defined connectivity in the following manner: " If upon
the surface F [with boundaries] there can be drawn z closed curves
dr, dz,. .. , 4,, which neither individually nor in combination completely
bound a part of this surface F, but with whose aid every other closed curve
forms the complete boundary of a part of fl the surface is said to be
(z + l)-fold connected." To reduce the connectivity of a surface (with
boundaries) Riemann states,
By means ofa crosscut lQttaschnittl, that is, a line lying in the interior of
the surface and going from a boundary point to a boundary point, an
(z + I )-fold connected surface can be changed into an z-fold connected
one, F', The parts of the boundary arising from the cutting play the role
of boundary even during the further cutting so that a crosscut can pass
through no Point more than once but can end in one of its earlier points'
. . . To apply these considerations to a surface without boundary, a
closedsurface,wemustchangeitintoaboundedonebythespecialization
of an arbitrary point, so that the first division is made by means of this
point and a crosscut beginning and ending in it, hence by a closed curve'
Riemann gives the example of an anchor ring or torus (fig' 50'5) which is
three-fold connected [genus I or one-dimensional Betti number 2] and can
be changed into a simply connected surliace by means of a closed curve alc
and a crosscut aD'c',
Riemann had thus classified surfaces according to their connectivity
and, as he himself realized, had introduced a topological property' In
22. Proceedings of the Royal Geogrdphical Societg, l, 1879, 25+61 : Coll' Msth' Papers' ll,
74.
23. Jour. fir Malh.,54, 1857, 105-10 : Werke,9l-96; see also ll/crlu,479-82'
THE BEGINNINGS OF COMBINATORIAL TOPOLOGY t 167
50.5
Figure 50.5 Figure 50.6. Sphere with p holes
terms of genus, the term used by the algebraic geometers of the latter part
of the nineteenth century, Riemann had classified closed surfaces by means
of their genus 1, 2p being the number of closed curves floop cuts or .Riic*-
kehrschnitte) needed to make the surface simply connected and, 2p + I to
cut the surface into two distinct pieces. He regarded as intuitively evident
that if two closed (orientable) Riemann surfaces are topologically equivalent
they have the same genus. He also observed that all closed (algebraic)
surfaces of genus zero, that is, simply connected, are topologically (and con-
formally and birationally) equivalent. Each can be mapped topologically
on a sphere.
Since the structure of Riemann surfaces is complicated and a topo-
logically equivalent figure has the same genus, some mathematicians sought
simpler structures. William K. Clifford showed 2a that the Riemann surliace
of an z-valued function with a branch-points can be transformed into a
sphere with p holes where 1D : (w 12) - n * | (Fig. 50.6). There is the
likelihood that Riemann knew and used this model. Klein suggested another
topological model, a sphere with p handles (Fig. 50.7).,6
The study of the topological equivalence of closed surfaces was made
by many men. To state the chief result it is necessary to note the concept of
orientable surfaces. An orientable surface is one that can be triangulated and
each (curvilinear) triangle can be oriented so that any side common to two
triangles has opposite orientations induced on it, Thus the sphere is orientable
but the projective plane (see below) is non-orientable. This fact was discovered
by Klein.26 The chief result, clarified by Klein in this paper, is that two
orientable closed surfaces are homeomorphic if and only if they have the
same genus. For orientable surfaces with boundaries, as Klein also pointed
out, the equality of the number of boundary curves must be added to the
above condition. The theorem had been proved by Jordan.27
The complexity of even two-dimensional closed figures was emphasized
by I(lein's introduction in l8B2 (see sec' 23 of the reference in note 25)
of the surface now called the Klein bottle (Fig. 50.8). The neck enters the
bottle without intersecting it and ends smoothly joined to the base along C.
Along D the surface is uninterrupted and yet the tube enters the surface. It
has no edge, no inside, and no outside; it is one-sided and has a one-
dimensional connectivity number of 3 or a genus ol 1. It cannot be con-
structed in three dimensions.
The projective plane is another example of a rather complex closed
surface. Topologically the plane can be represented by a circle with diametri-
cally opposite points identified (Fig. 50.9). The infinitely distant line is
represented by the semicircumference CAD. The surface is closed and its
connectivity number is I or its genus is 0. It can also be formed by pasting
the edge of a circle along the edge of a M<ibius band (which has just a single
edge), though again the figure cannot be constructed in three dimensions
without having points coincide that should be distinct.
Still another impetus to topological research came from algebraic
geometry. We have already related (Chap. 39, sec. B) that the geometers
Figure 50.8
27. Jour. de Math-, (2), I l, 1866, 105-9 : (Euores,4. 85-49.
r r69
Figure 50.9
had turned to the study of the four-dimensional " surfaces " that represent
the domain of algebraic functions of two complex variables and had intro-
duced integrais on these surflaces in the manner analogous to the theory of
algebraic functions and integrals on the two-dimensional Riemann surfaces.
To study these four-dimensional figures, their connectivity was investigated
and the fact learned that such figures cannot be characterized by a single
number as the genus characterizes Riemann surfaces. Some investigations
by Emile Picard around 1890 revealed that at least a one-dimensional and
a two-dimensional connectivity number would be needed to characterize
such surfaces.
The need to study the connectivity of higher-dimensional figures was
appreciated by Enrico Betti (1823-92), a professor of mathematics at the
University of Pisa. He decided that it was just as useful to take the step
to z dimensions. He had met Riemann in Italy, where the latter had gone
for several winters to improve his health, and from him Betti learned about
Riemann's own work and the work of Clebsch. Betti 28 introduced connec-
tivity numbers for each dimension from 1 to z - 1. The one-dimensional
connectivity number is the number of closed curves that can be drawn in
the geometrical structure that do not divide the surface into disjoined
regions. (Riemann's connectivity number was I greater.) The two-dimen-
sional connectivity number is the number of closed sufaces in the figure
that collectively do not bound any three-dimensional region of the figure.
The higher-dimensional connectivity numbers are similarly defined. The
closed curves, surfaces, and higher-dimensional figures involved in these
definitions are called cycles. (If a surface has edges, the curves must be
crosscuts; that is, a curve that goes from a point on one edge to a point on
another edge. Thus the one-dimensional connectivity number of a finite
hollow tube (without ends) is l, because a crosscut from one edge to the
other can be drawn without disconnecting the surface.) For the four-dimen-
sional structures used to represent complex algebraic functions;fl( x, y, z) : Q,
Betti showed that the one-dimensional connectivity number equals the
three-dimensional connectivity number.
28. Annali di Mat., (2),4, 1870-71, 1,10-58.
tlTo THE BEGTNNTNGS OF TOPOLOGY
O9
Figure 50.10
2,,rr
bounds and not all the c1 vre zera.
THE COMBINATORIAL WORK OF POINCAR6 t r73
Figure 50.I I
lq: dc - Yq,t - lq * |
n(,r') :i ,-,,-"-.
k=O
N(rj : )r-r-o-.
k=0
Figure 50.
homotopy group. The idea arises from considering the distinction between
simply and multiply connected plane regions' In the interior of a circle all
closed curves can be shrunk to a point. However, in a circular ring some
closed curves, those that surround the inner circular boundary, cannot be
shrunk to a point, whereas those closed curves that do not surround the inner
boundary can be shrunk to a point.
The more precise notion is best approached by considering the closed
curves that start and end at a point yo of the complex. Then those curves
that can be deformed by continuous motion in the space of the complex
onto each other are homotopic to each other and are regarded as one class'
Thus the closed curves starting and ending atyo (Fig. 50'12) in the circular
ring and not enclosing the inner boundary are one class. Those which,
starting and ending at go, do enclose the inner boundary are another class'
Those which, starting and ending at yo enclose the inner boundary z times,
are another class.
It is now possible to define an operation ofone class on another, which
geometrically amounts to starting at yo and' traversing any curve of one
class, then traversing any curve of the second class. The order in which
the curves are chosen and the direction in which a curve is traversed are
distinguished. The classes then form a group' called the fundamental
group of the complex with respect to the base pointyo. This non-commutative
group is now denoted by r{K,yo) where r( is the complex. In reasonable
complexes the group does not actually depend uPon the point. That is, the
groups at yo and yr, say, are isomorphic. For the circular ring the funda-
mental group is infinite cyclic. The simply connected domain usually
referred to in analysis, e.g. the circle and its interior, has only the identity
element for its fundamental group. Just as the circle and the circular ring
difler in their homotopy groups, so it is possible to describe higher-dimen-
sional complexes that differ markedly in this respect.
Poincar6 bequeathed some major conjectures. In his second supplement
he asserted that any two closed manifolds that have the same Betti numbers
t t76 THE BEGINNINGS OT TOPOLOGY
5. Combinatorial Inoariants
The problem of establishing the invariance of combinatorial properties is
to show that any complex homeomorphic to a given complex it the point
set sense has the same combinatorial properties as the given complex' The
proof that the Betti numbers and torsion coefficients are combinatorial
invariants was first given by Alexander.3? His result is that if K and 'I(1 are
any simplicial subdivisions (not necessarily rectilinear) of any two homeo-
morphic (as point sets) polyhedra P and' Pr, then the Betti numbers and
33. Readiconti del Circolo Matematico di Paletmo, 18, 1904, 45-l f0 : CEuores, 6, 435-98'
34. Amer. Math. Soc. Trans.,2O, 1919, 339-42'
35. The generalized conjecture has been proved for z > 5 by Stephen Smale (zlzcr'
Math. Soc. Bull.,66, 1960, 373-75), John R. Stallings (i6r,/.' 485-88) and E' C' Zeeman
(ibid., 67, 1961, 270).
i6. The Hauptaermulung has been shown to be true for finite simplicial complexes (which
are more general than manifolds) of dimension less than three and to be false for such
complexes of dimension greater than five. It is correct for manifolds of dimension less
than or equal to three and an open problem for manifolds of dimension four or greater'
See John i:l.lilnor, Annals of Math., (2), 74, 1961, 575-90.
37, Amer, Math. Soc. Trans., 16, 1915, 148-54.
FIXED POINT THEOREMS t t77
torsion coefficients of P and P1 are equal. The converse is not true, so that
the equality of the Betti numbers and torsion coefficients of two complexes
does not guarantee the homeomorphism of the two complexes.
Another invariant of importance was contributed by L. E. J. Brouwer.
Brouwer became interested in topology through problems in function theory.
He sought to prove that there are 39 - 3 classes of conformally equivalent
Riemann surfaces of genus g > I and was led to take up the related topo-
logical problems. He proved 38 the invariance of the dimension of a complex
in the following sense: If 1( is an z-dimensional, simplicial subdivision of a
polyhedron P, then every simplicial subdivision ofP and every such division
of any polyhedron homeomorphic to P is also an z-dimensional complex.
The proofs of this theorem, as well as Alexander's theorem, are made by
using a method due to Brouwer,ss namely, simplicial approximations of
continuous transformations' Simplicial transformations (simplexes into
simplexes) are themselves no more than the higher-dimensional analogues
of continuous transformations, while the simplicial approximation of
continuous transformations is analogous to linear approximation applied to
continuous functions. If the domain in which the approximation is made is
small, then the approximation serves to represent the continuous transforma-
tion lor the purposes of the invariance proofs.
in the interval 0 < r < I and the initial condition !:0 at *:0. The
solution {(.r) satisfies the equation
c(") : Jo
F(x,f(x)) dx
metric spaces as opposed to figures that are complexes to start with. The
basic schemes were introduced by Paul S. Alexandroff (l8g6-),6s Leopold
Vietoris (l89l-),5? and Eduard dech (1893-1960).ss The details will not be
given, since they involve a totally new approach to homology theory.
However, we should note that this work marks a step toward the fusion of
point set and combinatorial topology.
Bibliography
Bouligand, Georges: Lcs D6fuitions
motlernzs th la dimension, Hermann, 1935.
Dehn, M., and P, Heegard: "Analysis Situs ," Ercgk. der Math. I4ziss., B, G. Teubner,
1907-10, III AB3, 1s3-220.
Franklin, Philip: "The Four Color Problem,', Scripta Mothemarr'ea,6, 193g, 149-56,
t97-2t0_
Hadamard, J. : " L'(Euvre mathdmatique de Poincar6, Acta Math., 38, lg2l,
20347.
Manheim, J. H.: Tlu Gcneis of Point Sct Topologg, Pergamon press, 1964.
Osgood, William F. : " Topics in the Theory of Functions of Several Complex
Variables," M adison Collo quium, Amertcan Mathematical Society, l g l d
pp. lll-230.
PoincarC, Henri: (Eatres, Gauthier-Villars, 1916-56, Vol. 6.
Smith, David E.: A Source Book in Mathematis, Dover (reprint), 1959, Vol. 2,
404-10.
Tietze, H,, and L, Vietoris: " Beziehungen zwischen den verschiedenen Zweigen
der Topologie," Entgk. der Math. Wiss., B. G. Teubner, lgl,+-31, III ABl3,
t4t-237.
Zoretti, L,, and A. Rosenthal: "Die Punktmengen,,' Encyk. da Math. Wiss.,B. G.
Teubner, 1923-27, II CgA, 855-1030.
l. Intro&ution
By far the most profound activity of twentieth-century mathematics has been
the research on the foundations. The problems thrust upon the mathema-
ticians, and others that they voluntarily assumed, concern not only the nature
of mathematics but the validity of deductive mathematics.
Several activities converged to bring foundational problems to a head
in the first part of the century. The first was the discovery of contradictions,
euphemistically called paradoxes, notably in set theory. One such contra-
diction, the Burali-Forti paradox, has already been noted (Chap' 41, sec' 9)'
A number of others were discovered in the first few years of this century'
CIearIy the discovery of contradictions disturbed the mathematicians
deeply. Another problem that had gradually been recognized and that
emerged into the open early in this century was the consistency of mathe-
matics (Chap. 43, sec. 6). In view of the paradoxes of set theory consistency
had to be established especially in this area.
During the latter part of the nineteenth century a number of men had
begun to reconsider the foundations of mathematics and, in particular, the
relationship of mathematics to logic. Research in this area (about which
we shall say more later) suggested to some mathematicians that mathematics
could be founded on logic. Others questioned the universal application o[
logical principles, the meaningfulness of some existence proofs, and even the
,"liu.r"" upon logical proof as the substantiation of mathematical results.
r r8z
THE PARADOXES OF SET THEORY r r83
Controversies that had been smoldering before 1900 broke out into open
fire when the paradoxes and the consistency problem added fuel. Thereupon
the question of the proper foundation for all mathematics became vital and
of widespread concern.
the complement of .r, that is, all non-r, in some large universe of objects
might not be safe.
Zermelo's development of set theory was improved by Abraham A.
Fraenkel (lBgl-1965).? Additional changes were made by von Neumann'8
The hope of avoiding the paradoxes rests in the case ofthe Zermelo-Fraenkel
system on restricting the types of sets that are admitted while admitting
enough to serve the foundations of analysis. Von Neumann's idea was a
Iittle more daring. He makes the distinction between classes and sets. Classes
are sets so large that they are not contained in other sets or classes, whereas
sets are more restricted classes and may be members of a class' Thus sets
are the safe classes. As von Neumann pointed out, it was not the admission
of the classes that led to contradictions but their being treated as members
of other classes.
Zermelo's formal set theory, as modified by Fraenkel, von Neumann,
and others, is adequate for developing the set theory required for practically
all of classical analysis and avoids the paradoxes to the extent that, as yet,
no one h:rs discovered any within the theory. However, the consistency of
the axiomatized set theory has not been demonstrated. Apropos ofthe open
question of consistency Poincar€ remarked, "We have put a fence around
the herd to protect it from the wolves but we do not know whether some
wolves were not already within the fence."
Bcyond the problem of consistency, the axiomatization of set theory
used the axiom of choice, which is needed to establish parts of standard
analysis, topology, and abstract algebra. This axiom was considered objec-
tiorrable by a number of mathematicians, among them Hadamard, Lebesgue'
Borel, and Baire, and in l9(X, when Zermelo used it to prove the well-
ordering theorem (Chap. 41, sec. B), a host ofobjections flooded thejournals.s
The questions of whether this axiom was essential and whether it was
independent of the others were raised and remained unanswered for some
time (see sec. B).
The axiomatization ofset theory, despite the fact that it left open such
questions as consistency and the role of the axiom of choice, might have put
mathematicians at ease with respect to the paradoxes and have led to a
decline in the interest in foundations. But by this time several schools of
thought on the foundations of mathematics, no doubt stirred into life by the
paradoxes and the problem of consistency, had become active and conten-
tious. To the proponents of these philosophies the axiomatic method as
practiced by Zermelo and others was not satisfactory. To some it was objec-
tionable because it presupposed the logic it used, whereas by this time logic
itself and its relation to mathematics was under investigation. Others, more
radical, objected to the reliance upon any kind of logic, particularly as it
was applied to infinite sets. To understand the arguments that various
schools of thought propounded, we must go back in time somewhat.
10. Pub. 1690: G. W. Leibniz: Die lhilosophischen Schiften, ed. by C' I' Gerhardt,
1875-90, Yol. 4, 27-102.
THE RISE OF MATHEMATICAL LOGIC r rBg
of the M's, and d of the hI's are in I and D of the M's are in .8, then there
are at least (a + b - m) A's tb,at are.B's. The point olDe Morgan's observa-
tion is that the terms may be quantified. He was able as a consequence to
introduce many more valid forms of the syllogism. Quantification also
eliminated a defect in Aristotelian logic. The conclusion " Some l's are
B's" which in Aristotelian logic can be drawn from "All l's are.B's" implies
the existence of l's but they need not exist.
De Morgan also initiated the study of the logic of relations. Aristotelian
logic is devoted primarily to the relationship "to be" and either asserts
or denies this relationship. As De Morgan pointed out, this logic could not
prove that if a horse is an animal, then a horse's tail is an animal's tail. It
certainly could not handle a relation such as ,r lovesy. De Morgan introduced
symbolism to handle relations but did not carry this subject very far.
In the area of symbolic logic De Morgan is widely known for what are
now called De Morgan's laws. As he stated them,11 the contrary of an
aggregate is the compound ofthe contraries of the aggregates; the contrary
of a compound is the aggregate of the contraries of the components. In
logical notation these laws read
I- (.r +e) : (I - '()(t - !).
t-xs:(l-") +(r-y).
The contribution of symbolism to an algebra of logic was the major
step made by George Boole (l 815-1864), who was largely self-taught and
became professor ol mathematics at queens College in Cork. Boole was
convinced that the symbolization of language would rigorize logic. His
Mathematie al Analysis of Logic, which appeared on the same day as De
Morgan's Formal Logic, and his An Inaestigation of the Laws of Thought (1854)
contain his major ideas.
Boole's approach was to emphasize extensional logic, that is, a logic
of classes, wherein iets or classes were denoted by x,y, 2,.., whereas the
symbols X, Y, 2,.. . represented individual rnembers. The universal class was
denoted by I and the empty or null class by 0. He used ry to denote the
intersection of two sets (he called the operation election), that is, the set of
elements common to both x and g, and * + y to denote the set consisting
of all the elements in ,r and in y. (Strictly for Boole addition or union
applied only to disjoined sets; W. S. Jevons [ 835-82] generalized the
concept.) The complement of r is denoted by I - *. More generally x - y
is the class of x's that are not y's. The relation of inclusion, that is, x is
contained in y, he wrote as x! : x. The equal sign denoted the identity of
the two classes.
Boole believed that the mind grants to us at once certain elementary
processes of reasoning that are the axioms of logic. For example, the law
ll. Trans. Camb. Phil. Soc., !0, 1858, 173-230.
r r90 THE FOUNDATIONS OF MATHEMATICS
The theory of types seeks to avoid the paradoxes, which arise because a
collection of objects contains a member that itself can be defined only in
terms of the collection. The resolution by Russell and Whitehead of this
difficulty was to require that " whatever involves all members of a collection
must not itself be a member ol the collection'" To carry out this restriction
in the Principia, they specify that a (logical) function cannot have as one of
its arguments anything defined in terms of the function itself. They then
discuss the paradoxes and show that the theory ol types avoids them.
However, the theory of types leads to classes of statements that must
be carelully distinguished by type. If one attempts to build mathematics in
accordance with the theory of types, the development becomes exceedingly
complex. For example, in the Principia two objects a and b are equal if for
every property P(x), P(a) and P(b) are equivalent propositions (each implies
the other). According to the theory of types, P may be of diflerent types
because it may contain variables of various orders as well as the individual
objects a or D, and so the definition of equality must aPPly for all types of P;
in other words, there is an infinity of relations of equality, one for each type
of property. Likewise, an irrational number defined by the Dedekind cut
proves to be ofhigher type than a rational number, which in turn is ofhigher
type than a natural number, and so the continuum consists of numbers of
different types. To escape this complexity, Russell and Whitehead introduced
the axiom of reducibility, which affirms the existence, for each propositional
function of whatever type, ol an equivalent propositional function of type
zeto.
Having treated propositional functions, the authors take up the theory
of classes. A class, loosely stated, is the set of objects satisflying some proposi-
tional function. Relations are then expressed as classes of couples satisfying
propositional functions of two variables. Thus "xjudgesy" expresses a rela-
tion. On this basis the authors are prepared to introduce the notion of
cardinal number.
The definition of a cardinal number is of considerable interest. It
depends upon the previously introduced relation of one-to-one correspond-
ence between classes. If two classes are in one-to-one correspondence, they
are called similar. The relationship of similarity is proven to be reflexive,
symmetric, and transitive. All similar classes possess a common property,
and this is their number. However, similar classes may have more than one
common property. Russell and Whitehead get around this, as had Frege,
by defining the number of a class as the class of all classes that are similar
to the given class. Thus the number 3 is the class of all three-membered
classes and the denotation of all three-membered classes is lx, y, z| with
x + y + z. Since the definition of number Presupposes the concept of one-
to-one correspondence, it would seem as though the definition is circular.
The authors point out, however, that a relation is one-to-one if, when *
r r96 THE FOUNDATIONS OF MATHEMATICS
and r'have the relation toy, then x and, x'are identical, and when x has the
relation to y and y' , then y and y' are identical. Hence the concept of one-to-
one correspondence does not involve the number l.
Given the cardinal or natural numbers, it is possible to build up the
real and complex number systems, functions, and in fact all of analysis.
Geometry can be introduced through numbers. Though the details in the
Prircipia differ somewhat, our own examination of the foundations of the
number system and of geometry (Chaps. 4l and 42) shows that such con-
structions are logically possible without additional axioms.
This, then, is the grand program of the logistic school. What it does
with logic itself is quite a story, which we are skimming over here briefly.
What it does for mathematics, and this we must emphasize, is to found
mathematics on logic. No axioms of mathematics are neededl mathematics
becomes no more than a natural extension of the laws and subject matter
of logic. But the postulates of logic and all their consequences are arbitrary
and, moreover, formal. That is, they have no content; they have merely
form. As a consequence, mathematics too has no.content, but merely form.
The physical meanings we attach to numbers or to geometric concePts are
not part of mathematics. It was with this in mind that Russell said that
mathematics is the subject in which we never know what we are talking
about nor whether what we are saying is true. Actually, when Russell
started this program in the early part ofthe century, he (and Frege) thought
the axioms of logic were truths. But he abandoned this view in the 1937
edition of the Principles of Mathematics.
The logistic approach has received much criticism. The axiom of
reducibility aroused opposition, for it is quite arbitrary. It has been called
a happy accident and not a logical necessity; it has been said that the axiom
has no place in mathematics, and that what cannot be proved without it
cannot be regarded as proved at all. Others called the axiom a sacrifice of
the intellect. Moreover, the system of Russell and Whitehead was gever
completed and is obscure in numerous details. Many efforts were made
later to simplify and clarify it.
Another serious philosophical criticism of the entire logistic position
is that if the logistic view is correct, then all of mathematics is a purely
formal, logico-deductive science whose theorems follow from the laws of
thought. Just how such a deductive elaboration of the laws of thought can
represent wide varieties of natural phenomena such as acoustics, electro-
magnetics, and mechanics seems unexplained. Further, in the creation of
mathematics perceptual or imaginative intuition must supply new concepts,
whether or not derived from experience. Otherwise, how could new knowledge
arise? But in the Principia all concepts reduce to logical ones.
The formalization of the logistic program apparently does not represent
mathematics in any real sense. It presents us with the husk, not the kernel.
THE INTIIITIONIST SCIIOOL I I97
non-o<istent " ?
which results from the passage o[ time, is abstracted from all special occur-
rences. The remaining empty form [the relation of n to n * l] of the common
content of all these twonesses becomes the original intuition of mathematics
and repeated unlimitedly creates new mathematical subjects." Thus by
unlimited repetition the mind lorms the concept ol the successive natural
numbers. This idea that the whole numbers derive from the intuition of time
had been maintained by Kant, William R. Hamilton in his "Algebra as a
Science of Time," and the philosopher Arthur Schopenhauer.
Brouwer conceives of mathematical thinking as a process of construction
that builds its own universe, independent of the universe of our experience
and somewhat as a free design, restricted only in so far as it is based upon the
fundamental mathematical intuition. This fundamental intuitive concept
must not be thought ofas an undefined idea, such as occurs in postulational
theories, but rather as something in terms oJ which all undefined ideas that
occur in the various mathematical systems are to be intuitively conceived, if
they are indeed to serve in mathematical thinking'
Brouwer holds that "in this constructive process' bound by the obliga-
tion to notice with care which theses are acceptable to the intuition and
which are not, lies the only possible loundation for mathematics." Mathe-
matical ideas are imbedded in the human mind' prior to language, logic, and
czpaience, The intuition, not experience or logic, determines the soundness
and acceptability of ideas. It must of course be remembered that these
statements concerning the role of experience are to be taken in the philo-
sophical sense, not the historical sense.
The mathematical objects are for Brouwer acquired by intellectual
construction, wherein the basic numbers l, 2, 3, . . . furnish the Prototype
of such a construction. The possibility of the unlimited repetition of the
empty form, the steP from z to z * I , leads to infinite sets. However,
Brouwer's infinite is the potential infinity of Aristotle, whereas modern
mathematics, as founded for example by Cantor, makes extensive use of
actually infinite sets whose elements are all prdsent "at once."
In connection with the intuitionist notion of infinite sets, Weyl, who
belonged to the intuitionist school, says that
in 1904. FIis motives at that time were to provide a basis for the number
system without using the theory of sets and to establish the consistency ol
arithmetic. Since his own proofolthe consistency of geometry reduced to the
consistency of arithmetic, the consistency of the latter was a vital open
question. He also sought to combat Kronecker's contention that the irra-
tionals must be thrown out. Hilbert accepted the actual infinite and praised
Cantor's work (Chap. 41, sec. 9). He wished to keep the infinite, the pure
existence proofs, and concepts such as the least upper bound whose definition
appeared to be circular.
Hilbert presented one paper on his views at the International Congress
of 1904.1? He did no more on this subject for fifteen years; then, moved by
the desire to answer the intuitionists' criticisms of classical analysis, he took
up problems of the foundations and continued to work on them for the rest
of his scientific career. He published several key papers during the nineteen-
twenties. Gradually a number of men took up his views.
Their mature philosophy contains many doctrines. In keeping with the
new trend that any foundation lor mathematics must take cognizance olthe
role of logic, the formalists maintain that logic must be treated simultane-
ously with mathematics. Mathematics consists of several branches and each
branch is to have its own axiomatic foundation. This must consist ol logical
and mathematical concepts and principles. Logic is a sign language that puts
mathematical statements into formulas and expresses reasoning by formal
processes. The axioms merely express the rules by which formulas follow
from one another. All signs and symbols of operation are lreed from their
significance with respect to content. Thus all meaning is eliminated from the
mathematical symbols. In his 1926 paper 18 Hilbert says the objects of
mathematical thought are the symbols themselves. The symbols are the
essence; they no longer stand for idealized physical objects. The formulas
may imply intuitively meaningful statements, but these implications are not
part of mathematics.
Hilbert retained the law of excluded middle because analysis depends
upon it. He said,le " Forbidding a mathematician to make use of the prin-
ciple of excluded middle is like forbidding an astronomer his telescope or a
boxer the use of his fists." Because mathematics deals only with symbolic
expressions, all the rules of Aristotelian logic can be applied to these formal
expressions. In this new sense the mathematics of infinite sets is possible.
Also, by avoiding the explicit use of the word "all," Hilbert hoped to avoid
the paradoxes.
To formulate the logical axioms Hilbert introduced symbolism for
17. Proc. Third Inlcrnat. Congress oJ Math., Heidelberg, 1904, l7'[-85 : Grundlagen der Geom.,
7th ed.,247-61; English trans. in Monist, 15, 1905, 338-52.
18. Math. Ann.,95, 1926, l6l -90 : Grundlagen der Geometrie,7th ed.,262-88. See note 20.
19. Weyt, Amer. Math. Soc. Bull.,50, 1944,637 : Ces. Abh., +, 157.
I2o5
concepts and relations such as "and," "or," " negation," "there exists,"
and the like. Luckily the logical calculus (symbolic logic) had already been
developed (for other purposes) and so, Hilbert says, he has at hand what he
needs. All the above symbols are the building blocks lor the ideal expressions
formulas.
-theTo handle the infinite, Hilbert uses, aside from ordinary noncontrover-
sial axioms, the transfinite axiom
The major result (G6del's incompleteness theorem) states that if any formal
theory 7 adequate to embrace number theory is consistent and if the axioms
of the formal system of arithmetic are axioms or theorems of 7, then 7" is
incomplete. That is, there is a statement S olnumber theory such that neither
S nor not-S is a theorem ofthe theory. Now either S or not-S is true; there is,
then, a true statement of number theory which is not provable. This result
applies to the Russell-Whitehead system, the Zermelo-Fraenkel system, and
Hilbert's axiomatization of number theory. It is somewhat ironic that Hilbert
in his address at the International Congress in Bologna ol 1928 (see note 22)
had criticized the older proofs ofcompleteness through categoricalness (Chap'
42, sec. 3) but was very confident that his own system was complete. Actually
the older proofs involving systems containing the natural numbers were
accepted as valid only because set theory had not been axiomatized but was
used on a naive basis.
Incompleteness is a blemish in that the formal system is not adequate
to prove all the assertions frameable in the system. To add insult to injury,
there are assertions that are undecidable but are intuitively true in the
system. Incompleteness cannot be remedied by adjoining S or - S as an
axiom, for Godel proved that any system embracing number theory must
contain an undecidable proposition. Thus while Brouwer made clear that
what is intuitively certain falls short ol what is mathematically proved,
G6del showed that the intuitively certain goes beyond mathematical proof'
One of the implications of Gcidel's theorem is that no system of axioms
is adequate to encompass, not only all of mathematics, but even any one
significant branch of mathematics, because any such axiom system is
incomplete. There exist statements whose concepts belong to the system'
which cannot be proved within the system but can nevertheless be shown to
be true by nonformal arguments, in fact by the logic of metamathematics'
This implication, that there are limitations on what can be achieved .by
axiomatization, contrasts sharply with the late nineteenth-century view that
mathematics is coextensive with the collection of axiomatized branches'
Godel's result dealt a death blow to comprehensive axiomatization. This
inadequacy of the axiomatic method is not in itsell a contradiction, but it
was surprising, because mathematicians had expected that any true state-
ment could certainly be established within the framework of some axiomatic
system. Of course the above arguments do not exclude the possibility of new
methods of proof that would go beyond what Hilbert's metamathematics
permit.
Hilbert was not convinced that these blows destroyed his program' He
argued that even though one might have to use concepts outside a formal
system, they might still be finite and intuitively concrete and so acceptable'
Hilbert was an optimist. He had unbounded confidence in the power of
man's reasoning and understanding. At the talk he gave at the l92B
r 2()8 THE FOUNDATIONS Otr' MATHEMATICS
22. Atti Del Congresso lnternazionale Dei Matematici,l, l35-41 : Grundlagen der Geometri.e, Tth
ed.,313-23.
23. Jour. fiir Math., 154, 1925, l.
24. Abh. Math- Seminar der Hamburger Llnitt., |, 1922. 157-77 : Ges. Abh., 3, 157-77 .
r 2og
Bourbaki, offer this encouragement: 27 " There are now twenty-five centuries
during which the mathematicians have had the practice of correcting their
errors and thereby seeing their science enriched, not impoverished; this gives
them the right to view the future with serenity."
Whether or not the optimism is warranted, the presegrt state of mathe-
matics has been aptly described by Weyl : 28 " The question of the ultimate
foundations and the ultimate meaning of mathematics remains open; we do
not know in what direction it will find its final solution or even whether a
final objective answer can be expected at all. 'Mathematizing' may well be
a creative activity of man, like language or music, of primary originality,
whoe€ historical decisions defy complete objective rationalization. "
Bibliography
Bcclrer, Oskar: Cnmdlagen dcr Matlumati* in geschirhtlirlur Entwicklung, Verlag Karl
Alber, 1956, 317--401.
B€th, E. W.z Mailunatical Thought: An Introduction to thc Philosophy of Mathematics,
Gordon and Breach, 1965.
Bochcnsli, I. M.: A History oJ Formal loglc, University of Notre Dame Press, 1962;
Chelsea (reprint), 1970.
Boole, George: An Intcstigation of thc Laus of Thought (1854), Dover (reprint), 1951.
Tlu Matlumatital Analysis of Ingic (1847), Basil Blackwell (reprint), 1948.
Colhctcd Iagical Wo*s, Operr Court, 1952.
BourbaLi, N.: Elimcnts d'histoire des mathimatiqws, 2nd ed., Hermann, 1969, I l-64,
Brouwer, L. E. J. : " Intuitionism and Formalism," Amer. Math. Soe , Bull,, 20,
l9l3ll4, 8l-96. An English translation of Brouwer's inaugural address as
profcseor of mathematics at Amsterdam.
Church, Alonzo: "The Richard Paradox," Amer. Math. Monthly,4l, 1934,356-61.
Cohen, Paul J., and Reuben Hersh : " Non-Cantorian Set Theory," Scienlifu
Anwban, Dec. 1967, 10'[-16.
Couturat, L.: La Logiqu dc lzi.bniz d'apris des documents inidits, Alcan, 1901.
De Morgan, Augustus: On lhe Syllogism and Other Logical Witings, Yale University
Press, 1966. A collection of his papers edited by Peter Heath.
Dresden, Arnold: "Brouwer's Contribution to the Foundations of Mathematics,"
Ama. Math. Soc. Bull.,30, 1924,31-40.
Enriques, Federigo: The Historit Deoelopmmt of Logir, Henry Holt, 1929.
Fraenkel, A. A.: "The Recent Controversies About the Foundations of Mathe-
matics," Scripta Mathematira, 13, 1947, 17-36.
Fraenkel, A. A., and Y. Bar-Hillel: Foundatitns oJ Set Tluory, North-Holland, 1958.
Frege, Gottlob: Thc Foundations of Arithmetic, Blackwell, 1953, English and German;
also English translation only, Harper and Bros., 1960.
The Basi.c Laws of Arithmctir, University of California Press, 1965.
Gerhardt, C. I., ed.: Die phitosophischen Schiften uon G. W.bibniz, 1875-80, Vol. 7.
Giidel, Kurt: On Formallg Undeeidablc Propositions qf Principia Mathematica czl
Related Systems, Basic Books, 1965.
" What Is Cantor's Continuum Problem ?," Amer. Math. Monthly, 54, 19+7,
sl s-25.
The Consistency of the Axiom of Choice and of tht Genetaliz2d Contiiluum Hyqothetis
with the Axioms o-f Set Theory, Princeton University Press, 1940; rev. ed., 1951'
Kneale, William and Martha: The Deuelopment of Logic, Oxford University Press,
r962.
I(neebone, G. T.: Mathematical Logic and the Foundations of Matlumatirs, D' Van
Nosrand, 1963. See the Appendix especially on developments since 1939'
Leibniz, G. W.: Iagbat Papcrs, edited and translated by G. A. R. Parkinson,
Oxford University Press, 1966.
Lewis, C. l.: A Suroey of Synbolic Logic, Dover (reprint), 1960' pp. l-117.
Meschkowski, Herbert: Probleme des tlnzndlichen, Werk und Itben Georg Cantors,
F. Vieweg und Sohn, 1967.
Mostowski, Andrzej: Thirty Ycars of Foundational Studdas, Barnes and Noble, 1966.
Nagel, E., and.f. R. Newman: Gddel's Proof, New York University Press, 1958.
Poincar6, Henri: The Foundations of Scicnce, Science Press, 1946, 44H5. This is a
reprint in one volume of Sciente and Hypothesis, The Valut oJ Scierce, and Scieme
and Melhod.
Rosser, J. Barkley: "An Informal Exposition of Proofs of Gtidel's Theorems and
Church's Theorem," Journal of Symbolic Logic,4, 1939, 53-60.
Russell, Bertrand: Tht Principles of Malhemalics, George Allen and Unwin, 1903;
2nd ed., 1937.
Scholz, Heinrich: Concise History of Logic, Philosophical Library, 1961.
Styazhkin, N. l.: History of Mathematical Logic from lzibniz to Paazo, Massachusetts
Institute of Technology Press, 1969.
Van Heijenoort, Jean: From Frege to Giidel, Harvard University Press, 1967. Trans-
lations of key papers on Iogic and the foundations of mathematics.
Weyl, Hermann: " Mathematics and Logic," Amu. Math. Monthlg, 53, 1946,
2-13 : Gu. Abh., 4,26U79.
Phihsopfu of Mathemalics and Natural Schrce, Princeton University Press,
1949.
Wilder, R. L.: "The Role of the Axiomatic Method," Amet. Math. Monthlg,74,
1967, rtF27 .
Abbreviations
Journals whose titles have been written out in full in the text'are not listed here.
Abh. der Bayer. Akad. der l4zrss. Abhandlungen der Kdniglich Bayerischen Akademie
der Wissenschaften (Miinchen)
Abh. der Ges. der Wiss, zu Gdtt. Abhandlungen der Kiiniglichen Gesellschaft der
Wissenschalten zu Gottingen
Abh. Kdnig.tAkad. dtr Wiss,, Berlin Abhandhngen der Kdniglich Preussischen
Akademie der Wissenschaften zu Berlin
Abh. Ki;niglich Bdhm. Ges. der Wiss. Abhandlungen der Kdniglichen Btihmischen
Gesellschaft der Wissenschaften
Abh. Math. Seminar der Hamburger Uzda. Abhandlungen aus dem Mathematischen
Seminar Hamburgischen Universitiit
Acta Acad. Sci, Petrop, Acta Academiae Scientiarum Petropolitanae
Acta Erud. Acta Eruditorum
Acta Math. Acta Mathematica
Acta Soc. Fennicae Acta Societatis Scientiarum Fennicae
Amer. Jour. of Malh, American Journal of Mathematics
Amer. Math. Monthlg American Mathematical Monthly
Amcr. Math- Soc. Bull. American Mathematical Society, Bulletin
Amer, Math. ,9oc. Trans. American Mathematical Society, Transactions
Ann. de l'Ecole Norm. ,Sup. Annales Scientifiques de l'Ecole Normale Sup6rieure
Ann. de Math. Annales de Math6matiques Pures et Appliqudes
Ann. Fac. Sci. de Toulouse Annales de la Facult6 des Sciences de Toulouse
Ann. Soc. Sci. Bruxelles Annales de la Soci6t6 Scientifique de Bruxelles
Annali di Mat. Annali di Matematica Pura ed Applicata
Annals of Math. Annals of Mathematics
Astronom. iy'acrl. Astronomische Nachrichten
Aui Accad. Torino Atti della Reale Accademia delle Scienze di Torino
Aui della Accad. dei Lincei, Rendiconli Atti della Reale Accademia dei Lincei, Rendi-
conti
Brit. Assn. for Ada. o.f Sci. British Association for the Advancement of Science
Bull. des Sci. Math. Bulletin des Sciences Math6matiques
Bull. Soc, Math. de Ftance Brlletin de la Soci6t6 Mathdmatique de France
Cambridge and Dublin Math. Jour. Cambridge and Dublin Mathematical Journal
Comm, Acad. Sci, Pelrop. Commentarii Academiae Scientiarum Petropolitanae
,213
r2t4 ABBREVTATTONS
l2l9
r220 NAME INDEX
Bernoulli, Daniel, 4or, 446, 479'8r, 492' Brill, Alexander von, 938, 94o
bo7-ro, br2,52r, b24, b78,677, rrt2 Brillouin, Ldon, r toz
B€rnoulli, James: calculus, 380-83, 4o7, Briot, Charles A. 4.,642, 7r8,722
4rz; calculus of variations, 51877; co' Brouncker, Lord William, 255, e78, 468'69
ordinate Seom€try, 32o; differential ge- Brouwer, Luitzen E. 1., rt3z, 116z, rr7r,
ometry, 56r; infinite series, 4Eg, 442-46, r r77-79, r r99-rro3, rzoS
Mere, Chevalier de. See Gombaud, An- function concept, 339-4o; infinite series,
toine 437-42, 46ti compared with Leibniz,
Mersenne, Marin, z7g-76,278, zg5, 3o4, 478 378-8o; methodology of science, 33r, 334;
Mertens, Franz, 929 philosophy of science, 2rg, 616, 620
Metrodorus, r38 Nicholas of Cusa, Cardinal, z4r
Metzler, William Henry,8o8,8rr Nichomachus, 135-38, eor
Meusnier,Jean-Baptiste-Marie-Charles, Nicole,Frangois,548
563-64,586 Nicomedes, r r7-r 8, e86
Mezir-iac, Claude Bachet de, z6r,274 Nieuwentijdt, Bernard, 385
Mill, John Stuart, 86r Niibeling, A. Georg, 1162
Miller, George A., r r42 Noether, Emmy, 93r-32, r t53, r t8o
Milnor, John, rr5r, r176 Noether, Max, 933-34, g4o-4r,944
Minding, Ferdinand, 6o8,886,893,9o5 Novikov, P. S., rr43
Minkorvski, Hennann, 8e9, 99r
Mirimanoff, Dimiry, 8eo Ohm, Martin, 976, 982
Mittag-Lemer, Gtista, 668 Olbers, Heinrich W. M., 8r8, 872, 98o
Mitbius, Augustus Ferdinand, j77, 846, Oldenburg, Henry, 2?9,37o
852-53,9r8, ro28, 1164-66 Olympiodorus, 58o
Mohr, George, :34, 84o Oresme, Nicole, 2rc'rt,24r, 437
Moivre, Abraham de, 4og, 453-94, 6oo Ostrogradsky, Michel, 683, 789-9o
Motien, Theodor, r r44 Oughtred, William, 258
Monge, Gaspard, 46 596-4o, b+7, b6b-69, Ozanam, Jacques, 3e3, rozS
6r4, 6r6-r7, 8or, 896, 84o
Monte, Guidobaldo del, r:3, 234, 286 Pacioli, Luca, 224-37, zbo-bt, 260
Montmort, Pierre R6mond de,473 Painlevi, Paul,737
Montucla, J. F. (Jean-Etienne), 6r? Pappus, 26, 98, 57, t2Z-29, 168, t74, 223,
Moore, Eliakim H,, r5, roo9, ror8, ro8z, roo5
-42, rrio
r r4r Parent, Antoine,545
Morland, Samuel, 258 Parmenides, 17, r5o
Morlcy, Frank,839-4o Parseval, Marc-Antoine, 716-17
Motre, Andrew,364 Pascal, Blaise, 2rg, 252, 288, 272-75, 395,
Mourraille, J. Raymond, g8r roz6, rosS; biography, 195-97; calculus,
Mtiller, Iohannes (Regiomontanus) 298- g5o, gb2-53,383-84; projective geometry,
39 zg7-3or,84o
Muller, J. H. T., 984 Pasch, Moritz, roo8'9, t t37
Murphy, Robert, Trr Peacock, George, 6zz, 773-15, 974-75, 98o
Mydorge, Claude, 295,8o4 Peano, GiusePPe, 988-89, roog-ro, ror4,
ror8, ro2o, ro38, ro4:-49, rrg7, rr6e
Nabu-rimanni, 4 Peckham, John, er3
Napier, John, 256-58 Peirce, Benjamin, 793, ro23
Nasir-Eddin, rgr, 196,864, 866 Peirce, Charles S., 793, rr44, rrgr
Nave, Annibale della, e63 Peletier, Jacques, roo5
Nalier, Clau<le L. M. H., 696-98 Pemberton, Henry, 392
Neile, William, 98, 355 Pericles, 3?
Nelson, Leonard, r 183-84 Peurbach, George, 238
Netto, Eugen E., ror8, l l3g Peyrard, Fran$ois, 57
Neugebauer, otto, 47 Pfaff, Johann Friedrich, 489, 87o
Neumann, Carl Gottfried, 666, 685, 7o4, Philolaus, 28, t47'48
Zrr, ro54 PhiloPonus,:rr
Neumann, John von, rogz-95, r186, rzo5 Piazzi, Giuseppe,8To
Newton, Sir Isaac: biography, 465-69; alge- Picard, (Charles) Emile, 668, 705'6, 72o,
bra, 2S2, 2b4, 27r-72,28r, 600, 60?; as- 945, to25, to37, to4o, ro7o, 1169
tronomy, gig-6g, 47o; calculus, 354, 359- Pick, Georg, rrSr
65, 384, 387, 556; calculus of variations, Pieri, Mario, roog'ro
Ei3-24; coordinate geometry, 548-49; dif- Piero della Francesca, 233, 235
ferential equatiors, 47o, 47b,4gr -93, 497; PierPont, James, ro96
1226 NAME INDEX
1229
r 23o SUBJECT INDEX
metry, 186, 3oo, 3o2, 3o8, 322; of sci- Optics, 88, 166-68, r96, rre-t3, e85-86, 3o7,
ence,223,395-35 3r4-r1, ib1, g7g-8r , 74o
Methodus Fluxionum et Serierum Infini- Ordinal number, rooo-r
tarum, g0r-62,364, 38r, 439, 47o Ordinary difierential equation, 468-Eoo,
Meusnier's theorem, 864 578, 7o9-A8; adjoint, 482; Bernoulli's
Miletus,2g,22 474; Bessel's, 488-89, St9; Clairaut's 4?6-
Minimal surface, 539, 579, 585-86, 75o 7T exact,476; existence theorems, 7r7-
Miibius band, r r6i-66 2r, tr?9-jg; first order, 4gr, 47r-78;
Modular system, 828 Fuchsian, 7zr-zz. 724-26; higher order,
Module,93r,94g 484-87; hypergeometric, 489, 7rz,' 7t3:
Moment problem , rc72, ttt7 Lami's, 7zr-zzi Legendre's, 529, 7rr;
Morley's theorem, 839-4o linear, 485-87, 73o-32; Mathieu's, 7r3-
Multiple integrals, 4sS-26, 798, ro48-49 r4; method of series, 488-89, 7og-rs;
Multiple points of curves. See Singular nonlinear, 483-84, 732-38; periodic solu-
points of curves tions, ir5-t4, ?go-92; Riccati, 488-84;
Multiple-r'alued complex functions, 64r- second order, 4?8-84; singular solutions
42,655-62 ot, 476-78; systems of, 4go-92, 735, 742t
Museum of Alexandria, rol variation of parameters, 497-gg; Weber's,
Music, r48, 478-79, 48r, 5r5, 5zr-rr, 693' 7t4; see also Asymptotic series, Auto-
g4i see also Vibrating-string problem morphic functions; Qualitative theory
of ordinary difierential equations;
Sturm-Liouville theory; Summability
Napier's rule, z4o
Orthogonal system of functions, 716, to66
Navier-Stokes equations, 696-97
Orthogonal trajectories, 474-75
Navigation, rr9, 25o, 286, gg6-97, 47o'7t
Osculating circle, 556
N-dimensional geometry, 782, 8go, r028-31
Osculating plane, 599, 56t
N€ative number, r43, r85, rga, r5e-53,
Ostrogradsky's theorem. See Divergence
592-93 theorem
Neumann problem, 685
Oval of Descartes, 3r5-ro
Newton's laws of motion, 866-62, 4go
Newton's parallelogram, 439-40, 852
P-adic fields, r r46-47
Newton-Raphson method, 38r
Pappus's theorem, rz8, 297-98
Nine-point circle, 897
Pappus-Guldin tlreorem, r 29
Non-Archimedian geometry, ror6
Pappi, 16, 20, 2b, rgg
Non-Euclidean geometry, 1eg, 86t-8r, 947,
Parabolic cylinder functions, 7 r4
ror4, ro25; applicability, 8j2-72, 817, Paradoxes of set theory, r r82-85
92l-22; axioms for, ror5; consistency, Parallel axiom, &), 117, 852,863-67, 9t6,
88o, 9r3-r7; hyperbolic, 9o5-6; implica-
ror2, !or4
tions, 879-80; models, 888, 9o5-6, 9ta-
Parallel displacement, r r30-33
r7; priority of creation, 877-79; single Parseval inequality, rogg
and double elliptic, 9o4, 906, gre-r3;
Riemannian geometry
see also
Parseval s theorem, 7r6-t7, 97r, to47
Partial derivative, 425
Non-Riemannian geometries, r t 33-35
Partial differential equation, 962, 502-43,
Normal, S60-6r
Number, 29-30; amicable, gr, e?8, 610; 56?-68, 6?r-io?; classification, 7oo-7or;
existence theorems, 685, 699-707, rr?8-
hexagonal, 3r-32: pentagonal, 3r; perfect,
79; first order, 53r-35; Hamilton-Jacobi,
3r, ?8, r37, 2?8, 6ro; polygonal, t37, j44t lJ,eal equation, 672-75, 679, 687-89;
277-78, Azg; prime, see Prime number;
Helmholtz, 693-96, ro56; norrlinear, 536-
square, Bo; triangular, zg-3o, 8:8; see
. also Complex number, Irrational num- 40; Poisson's, 682, 684-85; Potential, 524-
mer, Negative number; Theory of num- 529, 659, 68r-87, ?o3-5; reduced rvave
bers.
cquation, 693-94; scParation of vari'
ables, 5r6-r7, 673-74; systems of., 54o'42,
696-99; total, 532; wave equation, 5os-
Operator, rc76-77, ro9z, ro85-89, rog4; Her- 2c,6qo-94
mitian, roge-93 Pascal triangle, 27e-?3
Optichs,958 Pascal's theorem, e9?-98, 848
r 236 SUBJECT INDEX
Pasch's axiom, ror r-rz r98-99, z8z, A83-89, 393-94, 426-94, 617-
Peano curve, ror8, r 16r rg, ro24-26i indirect method of, 33,
Peano's axioms, 988-89 44-45
Pell's equation, e78, 6to-r r Proportion, 22, r37-98, s37; Eudoxian
Pendulum motion, g3?, 469, 47r-72, 479, theory of, 68
556 Pseudosphere, 893, 9o5
Periodicity modules, 64r, 662 Ptolemy dynasty, rcie-g
Permanence of form, 773-75 Puiseux's theorem, 5S2-58
Permutation. se" Substitution Pure and applied mathematics, r036-38
Permu tations and combinations, :73 Pythagorean number philosophy, z r g
Persia, 4, ro Pythagorean theorem, \o, 20, gg, 6Z-64,
Perslrcctive, egr-94, 286-87 r84
Pi, to-rr, r9, rg4-35, 25t, 255, 353, 439, Pythagorean triples, ro, 3r-32, g4
448, 893, 98o-8c Pythagoreans, 27-14, 49, t47-bo
Picard's theorems, 668
Platonic school, 4r-48 Quadratic equation, 8-9, rg, 186-87, r92-
Pliicker formulas, 852 93; solved geometrically, 76-77
Poincard conjecture, r r75-76 Quadratic form,799-8o; reduction to stan-
Poincare-Bendixson theorem, ?37 dard form, 299, 80r-2; infinite, r069-66;
Poinere's last theorem, r r78 see also Law of inertia
Point at in6nity, 290 Quadratic reciprocity, 6rr-r2, 8rg-r5, 8r7
Polar coordinates, gtg Quadratrix, g9-40, 48
Pole and polar, 96-97, 2914, a98-99, 845 Quadrature, 4s
Polyhedra, regular, 47, 85-86 Quadric surface, ro8-ro, r68, 545-46, 848
Positional nohtion, 5-7, r85 Quadrivium, r46, r49-50, 20r-P
Potential tieory, 522-rg, 659, 68r-87, ro55- Qualitative theory of ordinary differential
56; equation, 524-29, 689, 682-87, ?o3-5; equations, 732-38, rrlo
function, 524, 68e-86 Quantics,928
Power series,64g-44; see also Taylor's Quantitative yersus qualitative knowl-
theorem edge, 933-94
Precession of the equinoxes, r58, 369 quartic equations, 267-?o
Primary and secondary qualities, 326, 329 Quaternion, 779-Bz, 7gr, tozg
Prime number, j8, 277, 609, 8go-32; see Quintic equation, 763
also Theory of numbers; Prime num-
ber theorem Rate of change, instantaneous, 344,360
Prime number theorem, 830-32 Reduction of singularities, 94r-4:
Principia Mathernatica, t rgz Reformation, g r8
Principlb of continuity, 385-87, 84r, 843- Relativity, 894, r r3o-3r
45 Religious motivation, rrg-2o, BBg
Principle of duality, 845-46, 848-49, 855 Residue,638,64o
Principle of Irast Action, 58t-8e, 587-89, Resolvent equation, 604, 760
6zo, 739'45 Resultant, 6o6-8, 797-98
Principle of Least Time, 3r5, 58o-8t Revival of Greek works, 2ob-j,2t6-r7
Principle of Stationary Phase, rogg Ricci tensor, r rz7
Printing, srT Ricci's lemma, r rz9
Probability,273 Riemann four index symbol,8g4, rrz5
Projectile motion, 286, 476, 479 Riemann hypothesis, 83r
Projection, z3z, 287 Riemann mapping lheorem, 666
Projective geometry, 283, 285"3or, 834- Riemann problem, 724, 726, 1o6,9
59, rm7-ro; algebraic, 852-59; and met- Riemann surface, 656-62, gl4-3b, gl7
ric geometry, go4-:3; relation to Eu' Riemann zeta function, 89 r
clidean Beometry, 850-52, 9og, ro33; rela- Riemannian geometry, 889-99, r r26-2?,
tion to non-Euclidean geometry, gog-r2, r r3r-39; applicability, 893
rogg; see also Algebraic invariants Riemann-Irbesguc lemma, l 046-47
Projective plane, rgo, r r68 Riemann-Roch theorem, 66b, 94o
Proof, r4, 20-22, 34, 44-46, bo, t44, 17r, Riesz representation theorem, ro85
SUBJECT INDEX r237
Ricsz-Fischer theorem, 1072, ro84 Subgroup, Tir8; inyariant, see normal; nor-
Rigor, 947-?7, ro25, r2o9; see also Proof mal, 76r; self-conjugate, see Normal
Ring, 8er-zz, 93r, I r50-53 Substitution, 602- Z, j 4j -t 8
Romans, ro6, r78'8o Sumcriarrs,4
Rule of false position, r8-rg Summability, 464, ro98, rrog-zo; Abcl,
Ruled surfaccs, 567, 859 rrrr; Borel, rrrT-rg; Cesiro, rrr3; Fro-
benius, rrr2, rrrg; Hiilder, rrr:-r3; Sti-
Scholastics,2oT eltjes, rrrg-r6
Schwarz-Christoffel mapping, 666-67, 687 Summation convention, r r2?
Schwarz's inequality, ro83, ro93 Superposition, 87
Section, 232, e87 Sylow's theorcm, ?68, rr4r
Seleucid period,4, toz Symbolic logic. See Mathematical logic
Semicubical parabola, 98, 55o Symbolism, ro, r39-40, r43-44, r86, r9z,
Serret-FreDet formulas, 56r -62 259-63,94o,3?8
Set, 9?o, gg4-roo3, rrSg; closed, 995-to78, Symmetric functions, 600, 602
rr6o; derived, g7o, lo78; enumerable,
995-97; first species, 97o; inRnite, 992- Tangcrlt, g42-46
roog; limit point of, 97o; oPen, 995, Tauberian theorem, rrrg
1160; perf€ct, gg5; Power of, 995; well- Tautochrone. See Isochrone
ordered, roor; sea also Space, abstract Taylor's theorem , 442, 464-65, 964
Shape of the earth, 469'7o,522 Tensor analysis, 784, rt22-22
Sieve of Eratosthenes, r38 Thiorie analytique de la chaleur, 672,
Simplex, r t7r q6r, 966
Singular poirrts of curves, 549'52,935,94r' Thiorie d.es lorctions analytiques, 4o6,
42; conjugate point, 55o; cusP, 549-50; 43o-gz, 948,964, ro:9
double point, S49; multiPle point,549; Theory of algebmic equations, z7o-j2, l8r,
node,54g 597-600
Singularities of differential equations, 72r' Theory of forms, 826-19
26, 733, 13? Theory of numbers, rc,29-ga, b3, 77-80,
Slide rule, :98 tZ6-49, 274-78,608-12, 8r3-22, 928; ana-
Sophists,27,3?-42 lytic,829-32; J"e arso Biquadratic reci-
Sound. 482-83, gto-:2, 69r procity; Cubic reciprocity; Pell's equa-
Space: abstract, ro78-8o, rt59-63; com- tion; Prime number; Prime number
pact, ro78, rr6(); comPlete, rr6r; con- theorem; Quadratic reciprocity; Theory
nected, r160; extremal, to?8; function, of forms
ro78-8o, ro8*-84, rr78; interior Point of, Theory of perturbations, 494-98
ro78; limit point of, r160; metric, ro79, Theory of surfaces, 56:-69, 859; cubic, 859;
r 160-6r; metrizable, r16r: neighbor- difierential geometric, 882-88; isomet-
trood, ro7g, rr6o-6r; normal, to87-89; ric, 886; quartic, 859; see also Alge-
perfect, ro78; separable, rr6oi see also braic geometry; Kummer surface; Quad-
Banach space; Hilbert sPace; Set ric surface
Space curves, 2Bb, bb7-62, b66,941 Theory of types, r r9S
Special functions, 4zz'26, 7o9, 7 15 Theta functions, 64q'io
Specialization, ro24 Three-body problem, 169, 492-gZ, 496-97,
Spherical circle, 845, 854
Spherical functions, b29-3o, 7rr-rP Topology, 9:o, r r58-8r; combinatorial,
Spherical harmonics. ser SPherical func- r r59, r169-8r; point bet, rrsg-68
tions Torsion: of a complex, rrTZ-74, rt76-7i,
Stability of the solar system, 73o, 735-36 r r8o; oI a curve, 559-62
Stereometry,23S Tractatus d,e Quad,ratura Curuarum,, 36t-
Sieltjes integral, ro4t, to85 6z
Stirling series, 4bg, to97 Tractrix, 982, 471-74, }ob
Stokes line, r ro3, r ro8 Transcendental number, 593-94, 98o-8e
Stokes' thcorcm, 7go Transfinite number, 992- loo3
Straight line, infinitencss of, 863 Transformation of coordinates, 4e6, b46-42
Sturm-Liouville theory, 7r5-r7 Translation of books, eo6, 2eo
r238 SUBJECT rNDEX