Professional Documents
Culture Documents
Dmth403 Abstract Algebra
Dmth403 Abstract Algebra
'
0%
Edited by:
Richa Nandra
ABSTRACT ALGEBRA
Edited By
Richa Nandra
Printed b y
EXCEL BOOKS PRIVATE LIMITED
A-45, Naraina, Phase-I,
New Delhi-110028
for
Lovely Professional University
Phagwara
SYLLABUS
Abstract Algebra
Objectives:
To learn about the structure as group, ring and field.
To gain knowledge about homomorphisms, isomorphisms, cosets, quotient groups, and the isomorphism theorems,
rings, ideals, ring homeomorphisms, isomorphisms and its theorems.
To learn about fields, quotient fields and field extensions Galois Theory also.
Unit 2: Groups 20
Richa Nandra, Lovely Professional University
Unit 3: Subgroups 37
Richa Nandra, Lovely Professional University
CONTENTS
Objectives
Introduction
1.1 Sets
1.2 Cartesian Product
1.3 Relations
1.4 Functions
1.5.2 Divisibility in Z
1.6 Summary
1.7 Keywords
Objectives
Introduction
In this unit, we will discuss some basic ideas concerning sets and functions. These concepts are
elementary to the study of any branch of mathematics, in particular of algebra. In the unit, we
discuss some basic number theory. The primary aim of this section is to assemble a few facts that
will be required in the rest of the course. We also hope to give you a glimpse of the elegance of
number theory. It is this sophistication that led the mathematician Gauss to call number theory
the "queen of mathematics". Let us start explaining these concepts one-by-one.
You must have used the word 'set' off and on in your conversations to describe any collection.
In mathematics, the term set is used to describe any well defined collection of objects, that is,
every set should be so described that given any object it should be clear whether the given object
belongs to the set or not.
For instance, the collection N of all natural numbers is well defined, and hence is a set. But the
collection of all rich people is not a set, because there is no way of deciding whether a human
being is rich or not.
If S is a set, an object a in the collection S is called an element of S. This fact is expressed in
symbols as a S (read as "a is in S" or "a belongs to S"). If a is not in S, we write a S. For example,
3 R the set of real numbers. But 1 R .
A set with no element in it is called the empty set, and is denoted by the Greek letter (phi). For
example, the set of all natural numbers less than 1 is .
Roster Method: In this method, we list all the elements of the set within braces. For instance, the
collection of all positive divisors of 48 contains 1, 2, 3, 4, 6, 8, 12, 16, 24 and 48 as its elements. So
this set may be written as (1, 2, 3,4, 6, 8, 12, 16, 24, 48).
Convention 1: The order in which the elements of the set are listed is not important.
Convention 2: No element is written more than once, that is, every element must be written
exactly once.
1 1
For example, consider the set S of all integers between 1 and 4 . Obviously, these integers
2 4
are 2, 3 and 4. So we may write S = (2, 3, 4).
We may also write S = (3, 2, 4), but we must not write S = (2, 3, 2, 4). Why? Isn’t this what
Convention 2 says?
The roster method is sometimes used to list the elements of a large set also. In this case we may
not want to list all the elements of the set. We list a few, enough to give an indication of the rest
of the elements. For example, the set of integers lying between 0 and 100 is (0, 1, 2 ,........., 100),
and the set of all integers is Z = (0, +1, !2, ........ }.
Another method that we can use for describing a set is the Set Builder Method.
Set Builder Method: In this method we first try to find a property, which characterises the
elements of the set, that is, a property P which all the elements of the set possess, and which no
other objects possess. Then we describe the set as
Z = {x | x is an integer}.
Some other sets that you may be familiar with are Notes
a
Q, the set of rational numbers = a, b Z, b 0
b
Subsets: Consider the sets A = (1, 3, 4) and B = (1, 4). Here every element of B is also an element
of A. In such a case, that is, when every element of a set B is an element of a set A, we say that B
is a subset of A, and we write this as B A.
It is obvious that if A is any set, then every element of A is certainly an element of A. So, for
every set A, A A.
Now consider the set S = (1, 3, 5, 15) and T = (2, 3, 5, 7}. Is S T? No, because not every element
of S is in T; for example, 1 S but 1 T. In this case we say that S is not a subset of T, and denote
it by S T.
Note that if B is not a subset of A, there must be an element of B which is not an element of A. In
mathematical notation this can be written as ‘ x B such that x’ A’.
We can now say that two sets A and B are equal (i.e., have precisely the same elements) if and
only if A B and B A.
Let us now look at some operations on sets. We will briefly discuss the operations of union,
intersection and complementation on sets.
Union: If A and B are subsets of a set S, we can collect the elements of both to get a new set. This
set is called their union. Formally, we define the union of A and B to be the set of all those
elements of S which are in A or in B. We denote the union of A and B by A B. Thus,
A B=(X S | X A or x B).
Again, if A = {l, 2, 3, 4) and B = (2, 4, 6, 8), then A B = {l, 2, 3, 4, 6, 8). Observe that 2 and 4 are in
both A and B, but when we write A B, we write these elements only once, in accordance with
Convention 2 given earlier.
Now we will extend the definition of union to define the union of more than two sets.
If A1, A2, A3, ..........,Ak are k subsets of a set S, then their union A1 A2 . ...... Ak is the set of
elements which belong to at least one of these sets. That is,
k
The expression A1 A2 ........ Ak is often abbreviated to A . i
i 1
Notes If is a collection of subsets of a set S, then we can define the union of all members of by
Now let us look at another way of obtaining a new set from two or more given sets.
Intersection: If A and B are two subsets of a set S, we can collect the elements that are common to
both A and B. We call this set the intersection of A, and B (denoted by A B. So,
A B = { x S | X " A and x B } .
Now suppose A = {1, 2) and B = (4, 6, 7). Then what is A B? We observe that, in this case, A and
B have no common elements, and so A B = , the empty set.
When the intersection of two sets is , we say that the two sets are disjoint (or mutually disjoint).
For example, the sets (1, 4) and (0, 5, 7, 14) are disjoint.
The definition of intersection can be extended to any number of sets. Thus, the intersection of k
subsets A1, A2 ,....., Ak of a set S is
In general, if is a collection of subsets of a set S, then we can define the intersection of all the
members of by
Apart from the operations of unions and intersections, there is another operation on sets, namely,
the operation of taking differences.
Differences: Consider the sets A = { 1, 2, 3] and B = [2, 3, 4]. Now the set of all elements of A that
are not in B is {1}. We call this set the difference A \ B. Similarly, the difference B \A is the set of
elements of B that are not in A, that is, {4}.
Thus, for any two subsets A and B of a set S,
A\B = { x S | x A and x B }
When we are working with elements and subsets of a single set X, we say that the set X is the
universal set. Suppose X is the universal set and A X. Then the set of all elements of X which are
not in A is called the complement of A and is denoted by A’, AC or X \ A.
Thus,
Ac = {x X | x A }.
For example, if X = [a, b, p, q , r) and A = {a, p, q], then Ac = (b, r).
An interesting set that can be formed from two given sets is their Cartesian product, named after
the French philosopher and mathematician Rene Descartes (1596 - 1650). He also invented the
Cartesian co-ordinate system.
Let A and B be two sets. Consider the pair (a, b), in which the first element is from A and the Notes
second from B. Then (a, b) is called an ordered pair. In an ordered pair the order in which the two
elements are written is important. Thus, (a, b) and (b,a) are different ordered pairs. Two ordered
pairs (a, b) and (c, d) are called equal, or the same, if
a = c and b = d.
Definition: The Cartesian product A x B, of the sets A and B, is the set of all possible ordered pairs
(a, b), where a A, b B.
For example, if A = {l , 2 , 3} and B = (4, 6), then
A × B = { (1, 4), (1, 6), (2, 4), (2, 6), (3, 4), (3, 6) }.
Also note that
B × A = { (4, 1), (4, 2), (4, 3), (6, 1), (6, 2), (6, 3) } and A x B B x A.
(ii) If A has m elements and B has n elements, then A x B has mn elements. B x A also has mn
elements. But the elements of B x A need not be the same as the elements of A x B, as you have just
seen.
We can also define the Cartesian product of more than two sets in a similar way. Thus, if A1, A2,
A3, .......... An, are n sets, we can define their Cartesian product as
R x R x R = { (al, a2, a3) | ai R for i = 1, 2, 3 ), and so on. It is customary to write R2 for R x R and
Rn for R x .......... x R (n times).
Now, you know that every point in a plane has two coordinates, x and y. Also, every ordered
pair (x, y) of real numbers defines the coordinates of a point in the plane. So, we can say that R2
represents a plane. In fact, R2 is the Cartesian product of the x-axis and the y-axis. In the same way
R3 represents three-dimensional space, and Rn represents n-dimensional space, for any n 1.
Note that R represents a line.
1.3 Relations
You are already familiar with the concept of a relationship between people. For example, a
parent-child relationship exists between A and B if and only if A is a parent of B or B is a parent
of A.
Again, if Q is the set of all rational numbers and R is the relation ‘is greater than’, then 3 R 2
(because 3 > 2).
Example: Consider the relation R on Z given by ‘aRb if and only if a > b’. Determine
whether R is reflexive, symmetric and transitive.
Solution: Since a > a is not true, aRa is not true. Hence, R is not reflexive.
If a > b, then certainly b > a is not true. That is, aRb does not imply bRa. Hence, R is not
symmetric.
Since a > b and b > c implies a > c, we find that aRb, bRc implies aRc. Thus, R is transitive.
Example: Let S be a non-empty set. Let (S) denote the set of all subsets of S, i.e.,
(S) = (A | A S}. We call p (S) the power set of S.
Define the relation R on (S) by
If A B, B need not be contained in A. (In fact, A B and B A A = B.) Thus, R is not symmetric.
A very important property of an equivalence relation on a set S is that it divides S into a number
of mutually disjoint subsets, that is, it partitions S. Let us see how this happens.
Let R be an equivalence relation on the set S. Let a S. Then the set { b S | aRb } is called the
equivalence class of a in S. It is just the set of elements in S which are related to a. We denote it
by [a].
This is
[1] = { n | 1 R n ; n N }
Similarly,
(i) [1] and [6] are not disjoint. In fact, [1] = [6]. Similarly, [2] = [7], and so on.
(ii) N = [I] U [2] U [3] U [4] U [5], and the sets on the right hand side arc mutually disjoint.
Theorem 1: Let R be an equivalence relation on a set S. For a S, let [a] denote the equivalence
class of a. Then
(a) a [a],
(c) S= [a]
a S
aRa V a S. a [a].
(b) Firstly, assume that b " [a]. We will show that [a] [b] and [b] [a]. For this, let x [a].
Then xRa.
We also know that aRb. Thus, by transitivity of R, we have xRb, i.e., x [b]. [a] [b]. We can
similarly show that [b] [a].
[a] = [b].
Conversely, let x S. Then x [x] by (a) above. [x] is one of the sets in the collection whose union
is [a] .
a S
Thus, S [a]
a S
and [a] S , proving (c).
a S
[a] = [b].
Note that, in Theorem 1, distinct sets on the right hand side of (c) are mutually disjoint because
of (d). Therefore, (c) expresses S as a union of mutually disjoint subsets of S; that is, we have a
partition of S into equivalence classes.
Let us look at some more examples of partitioning a set into equivalence classes.
Notes
Example: Let S be the set of straight lines in R × R. Consider the relation on S given by
‘L1 R L2 iff L1 = L2 or L1 is parallel to L2. Show that R is an equivalence relation, What are the
equivalence classes in S?
Solution: R is reflexive, symmetric and transitive. Thus, R is an equivalence relation. Now, take
any line L1 (see Figure 1.1).
Let L be the line through (0, 0) and parallel to L1. Then L [L1]. Thus, [L] = [L,]. In this way the
distinct lines through (0, 0) give distinct equivalence classes into which S is partitioned. Each
equivalence class [L] consists of all the lines in the plane that are parallel to L.
In the next section we will briefly discuss a concept that you may be familiar with, namely,
functions.
1.4 Functions
Recall that a function f from a non-empty set A to a non-empty set B is a rule which associates
with every element of A exactly one element of B. This is written as f : A B. If f associates with
a A, the element b of B, we write f(a) = b. A is called the domain of f, and the set f(A) = { f(a) |
a A] is called the range of f. The range of f is a subset of B.
Note that
If we define g : A B by g(1) = 1, g(2) = 1, g(3) = 4, then g is also a function. The domain of g Notes
remains the same, namely, A. But the range of g is {1, 4}.
Remark: We can also consider a function f : A B to be the subset { (a, f(a)) | a A } of A × B.
Now let us look at functions with special properties.
Definition: A function f : A B is called One-one (or injective) if f associates different elements
of A with different elements of B, i.e., if a1, a2 A and al a,, then f(al) f(a2). In other words, f is
1-1 if f(a1) = f(a2) a1 = a2.
In the examples given above, the function f is one-one. The function g is not one-one because 1
and 2 are distinct elements of A, but g(1) = g(2).
Now consider another example of sets and functions.
Let A = (1, 2, 3), B = { p, q, r }. Let f : A B be defined by f(1) = q, f(2) = r, f(3) = p. Then f is a function.
Here the range of f = B = codomain of f. This is an example of an onto function, as you shall see.
Definition: A function f : A B is called onto (or surjective) if the range of f is B, i.e., if, for each
b B, there is an a A such that f(a) = b. In other words, f is onto if f(A) = B.
For another important example of a surjective function, consider two non-empty sets A and B.
We define the function 1 : A × B A : 1((a, b)) = a. l is called the projection of A × B onto A. You
can see that the range of 1 is the whole of A. Therefore, 1 is onto. Similarly, : A × B B : 2 ((a,
b)) = b, the projection of A × B onto B, is a surjective function.
Consider the following example that you will use again and again.
Example: Let A be any set. The function IA : A A : IA(a) = a is called the identity function
on A. Show that I, is bijective.
Solution: For any a A, IA (a) = a. Thus, the range of IA is the whole of A. That is, IA is onto.
IA is also 1-1 because if a1, a2 A such that a1 a2, then IA (a1) IA (a2).
Thus, IA is bijective.
If f : A B is a bijection, then we also say that the sets A and B are equivalent. Any set which is
equivalent to the set { 1, 2, 3 ,............, n}, for some n N, is called a finite set. A set that is not finite
is called an infinite set.
Definition: Let A and B be two sets and f : A B be a function, Then, for any subset S of B, the
inverse image of S under f is the set
f–1(S) = { a A | f(a) S }.
f–1({ 1, 2, 3 }) = { n N | f(n) { 1 , 2 , 3 } }
= { n N | n + 5 { 1,2,3 }}
= , the empty set.
We now give some nice theorems involving the inverse image of a function.
If f : A B and g : C D are functions and if the range of f is a subset of C, there is a natural way
of combining g and f to yield a new function h : A # D. Let us see how.
Example: Let A = {1, 2, 3}, B = {p, q, r} and C = {x, y}. Let f : A B be defined by f(1) = p,
f(2) = p, f(3) = r. Let g : B C be defined by g(p) = x, g(q) = y, g(r) = y. Determine if f o g and g o
f can be defined.
Solution: For f o g to be defined, it is necessary that the range of g should be a subset of the
domain of f. In this case the range of g is C and the domain of f is A. As C is not a subset of A,
f o g cannot be defined.
Since the range of f, which is (p, r), is a subset of B, the domain of g, we see that g o f is defined.
Also g o f : A C is such that
g o f(l) = g(f(1)) = g(p) = x,
We now come to a theorem which shows us that the identity function behaves like the number
1 R does for multiplication. That is, if we take the composition of any function f with a suitable
identity function, we get the same function f.
Proof: Since both f and IA are defined from A to A, both the compositions f o IA and IA o f are
defined. Moreover, V x A,
In the case of real numbers, you know that given any real number x + 0, $ y % 0 such that xy = 1.
y is called the inverse of x. Similarly, we can define an inverse function for a given function.
Definition: Let f : A B be a given function. If there exists a function g : B A such that f o g =
IB and g o f = IA, then we say that g is the inverse of f, and we write g = f –1.
Note that in this example f adds 3 to x and g does the opposite – it subtracts 3 from x. Thus, the
key to finding the inverse of a given function is : try to retrieve x from f(x).
For example, let f : R R be defined by f(x) = 3x + 5. How can we retrieve x from 3x + 5? The
x5
answer is “first subtract 5 and then divide by 3”. So, we try g(x) = (x) . And we find
3
f(x) 5 (3x 5) 5
g o f(x) = g(f(x)) = x.
3 3
(x 5)
Also, f o g(x) = 3(g(x)) + 5 = 3 5 x V x R.
3
Let’s see if you’ve understood the process of extracting the inverse of a function,
Do all functions have an inverse? No, as the following example shows.
Example: Let f : R R be the constant function given by f(x) = 1 V x " R. What is the
inverse of f ?
In view of this example, we naturally ask for necessary and sufficient conditions for f to have an
inverse. The answer is given by the following theorem.
Theorem 4: A function f ; A B has an inverse if and only if f is bijective.
Proof: Firstly, suppose f is bijective. We shall define a function g : B A and prove that g = f–1.
Let b B. Since f is onto, there is some a " A such that f(a) = b. Since f is one-one, there is only one
such a A. We take this unique element a of A as g(b). That is, given b B, we define g(b) = a,
where f(a) = b.
Note that, since f is onto, B = { f(a) | a A). Then, we are simply defining g : B A by g(f(a)) =
a. This automatically ensures that g o f = IA.
Now, let b B and g(b) = a. Then f(a) = b, by definition of g. Therefore, f o g(b) = f(g(b)) = f(a) =
b. Hence, f o g = IB.
Conversely, suppose f has an inverse and that g = f –1. We must prove that f is one-one and onto.
Note g o f is 1 – 1 f is 1 – 1
In this section we will spell out certain factorisation properties of integers that we will use
throughout the course. For this we first need to present the principle of finite induction.
We will first state an axiom of the integers that we will often use implicitly, namely, the
well-ordering principle. We start with a definition.
Definition: Let S be a non-empty subset of Z. An element a S is called a least element (or a
minimum element) of S if a b V b S. For example, N has a least element, namely, 1. But Z
has no least element. In fact, many subsets of Z, like 2Z, (-1, -2, -3, ...... ), etc., don’t have least
elements.
The following axiom tells us of some sets that have a least element.
You may be surprised to know that this principle is actually equivalent to the principle of finite
induction, which we now state.
(i) 1 S, and
Then S = N.
This theorem is further equivalent to:
(i) 1 S, and
Then S = N
We will not prove the equivalence of the well-ordering principle and Theorems 5 and 6 in this
course, since the proof is slightly technical.
Let us rewrite Theorems 5 and 6 in the forms that we will normally use.
Theorem 5: Let P(n) be a statement about a positive integer n such that
(i) P(1) is true, and
(ii) if P(m) is true for all positive integers m < k, then P(k) is true.
n 2 (n 1)2
Example: Prove that 13 + 23 + ............... + n3 = for every n N.
4
n 2 (n 1)2
Solution: Let Sn = 13 + ............ + n3, and let P(n) be the statement that S n .
4
12 2 2
Since S 1 , P(I) is true.
4
(n 1)2 n 2
Now, suppose P(n – 1) is true, i.e., S n 1
4
= Sn–1 + n3.
(n 1)2 n 2
= n 3 , since P(n – 1) is true.
4
n 2 (n 1)2 4n
=
4
n 2 (n 1)2
=
4
Thus, P(n) is true.
Now, use the principle of induction to prove the following property of numbers that you must
have used time and again.
1.5.2 Divisibility in Z
Proof: We will first prove that q and r exist. Then we will show that they are unique. To prove
their existence, we will consider three different situations : a = 0, a > 0, a < 0.
This shows that P(n) is true. Hence, by Theorem 5, P(n) is true, for any n N. That is, for a > 0, a
= qb + r, q, r Z, 0 r < b.
Case 3 (a < 0): Here (-a) > 0. Therefore, by Case 2, we can write
( q)b, if r 0
i.e., a
(–q – 1)b + (b - r), if 0 < r < b
This proves the existence of the integers q, r with the required properties.
Now let q’, r’ be in Z such that a = qb + r and a = q’b + r’, where 0 5 r, r’ < b. Then r – r’ = b(q’ – q).
Thus, b | (r – r’). But | r – r’ | < b. Hence, r – r’ = 0, i.e., r = r’ and q = q’. So we have proved the
uniqueness of q and r.
In the expression, a = qb + r, 0 & r < b, r is called the remainder obtained when a is divided by b.
Definition: Let a, b Z. c Z is called a common divisor of a and b if c | a and c | b. For example, Notes
2 is a common divisor of 2 and 4. You know that 1 and –1 are common divisors of a and b, for any
a, b Z. Thus, a pair of integers do have more than one common divisor. This fact leads us to the
following definition.
Definition: An integer d is said to be a greatest common divisor (g.c.d. in short) of two non-zero
integers a and b if
But then d = ma + nb = malc + nb1c = (ma1 + nb1)c. Thus, c | d, So we have shown that d is a g.c.d.
In fact, it is the unique positive g.c.d. (a,b).
For example, the g.c.d. of 2 and 10 is 2 = 1.2 + 0.10, and the g.c.d, of 2 and 3 is 1 = (–1)2 + l(3).
Definition: If (a,b) = 1, then the two integers a and b are said to be relatively prime (or coprime)
to each other.
Using Theorem 8, we can say that a and b are coprime to each other iff there, exist m, n Z such
that 1 = ma + nb.
Definition: A natural number p ( 1) is called a prime if its only divisors are 1 and p. If a natural
number n (% 1) is not a prime, then it is called a composite number.
Notes For example, 2 and 3 are prime numbers, while 4 is a composite number.
Now consider the number 50. We can write 50 = 2 × 5 × 5 as a ,product of primes. In fact we can
always express any natural number as a product of primes. This is what the unique prime
factorisation theorem says.
Theorem 10 (Unique Prime Factorisation Theorem): Every integer n > 1 can be written as
n = p1.p2 ........ pn, where pl, ........ pn are prime numbers. This representation is unique, except for
the order in which the prime factors occur.
Proof: We will first prove the existence of such a factorisation. Let P(n) be the statement that
n + l is a product of primes. P(1) is true, because 2 is a prime number itself.
Now let us assume that P(m) is true for all positive integers m < k. We want to show that P(k) is
true. If (k+l) is a prime, P(k) is true. If k+l is not a prime, hen we can write k + l = m, m2, where
1 < m, < k + l and 1 < m2 < k + l. But then P(ml – 1) and P(m2 – 1) are both true. Thus, ml = p1p2. ....
pr, m2 = q1q2 .........qs, where pl,p2, .......Pr, ql, q2, .......qs are primes. Thus,
k + 1 = pl p2 ... pr q1 q2 .... qs, i .e., P(k) is true. Hence, by Theorem 6, P(n) is true for every n N.
pl, p2, ......p1. ql, q2, ......, qs are primes. We will use induction on t.
If t = 1, then p1 = q1, q2 ....... qs. But p1 is a prime. Thus, its only factors are 1 and itself.
Now suppose t > 1 and the uniqueness holds for a product of t–1 primes. Now p1 | q1q2 ......qs and
hence, p1 | qi for some i, By re-ordering q1 ....... q, we can assume that p1 | q1. But both
p1 and q1 are primes. Therefore, p1 = ql. But then p2 ..... p, = q2 ....... q,. So, by induction, t–1 = s–1
and p2 ,........ pt are the same as q2. ......q,, in some order.
The primes that occur in the factorisation of a number may be repeated, just as 5 is repeated in
the factorisation 50 = 2 × 5 × 5. By collecting the same primes together we can give the following
corollary to Theorem 10.
Corollary: Any natural number n can be uniquely written as n = plml p2m2 .....Prmr where for i = 1,
2, ....... r, each mi N and each pi is a prime with 1 < p1 < p2 < .... < p,.
As an application of Theorem 10, we give the following important theorem, due to the ancient
Greek mathematician Euclid.
Therefore, no pi divides n. But since n > 1, Theorem 10 says that n must have a prime factor. We
reach a contradiction. Therefore, the set of primes must be infinite.
(b) {x | x < 5}
(c) {1, 2, 3,...}
(a) A = {x | 6x = 24 and 3x = 1}
(b) B = {x | x + 10 = 10}
4. Let D E. Suppose a D and b E. Which of the following statements must be true?
(a) cD
(b) bD
(c) aE
(d) aD
5. Let A = {x | x is even}, B = {1, 2, 3... 99, 100}, C = {3, 5, 7, 9}, D = {101, 102} and E = {101, 103,
105}. Which of these sets can equal S if S A and S and B are disjoint?
(d) D (e) E
(c) AB
(d) AB
7. Which set S does the power set 2S = {, {1}, {2}, {3}, {1, 2}, {1, 3}, {2, 3}, {1, 2, 3}} come from?
(a) {{1},{2},{3}}
(b) {1, 2, 3}
The definition of a function, a 1-1 function, an unto function and a bijective function.
The composition of functions.
The well-ordering principle, which states that every subset of N has a least element.
The principle of finite induction, which states that : If P(n) is a statement about some n N
such that
Properties of divisibility in Z, like the division algorithm and unique prime factorisation.
1.7 Keywords
Empty Set: A set with no element in it is called the empty set, and is denoted by the Greek letter
(phi). For example, the set of all natural numbers less than 1 is .
Roster Method: It is sometimes used to list the elements of a large set also. In this case we may
not want to list all the elements of the set.
Union: If A and B are subsets of a set S, we can collect the elements of both to get a new set. This
set is called their union.
1. Let C = {1, 2, 3, 4} and D = {1, 3, 5, 7, 9}. How many elements does the set C D contain?
How many elements does the set CD contain?
2. Let U = {1, 2, 3... 8, 9}, B = {1, 3, 5, and 7} and C = {2, 3, 4, 5, 6}. How many elements does the
set (B C)’ contain? How many elements does the set (C – B)’ contain?
3. Let S = {a, b}. How many elements does the power set 2S contain?
3. Set B is not an empty set because it contains one element. The only element of the set B is
zero. B = {0}
4. a E is true, because a D and D E means that every element from D is contained in E.
5. The correct answer is E, because E consists of even numbers as elements and the intersection
of sets S and B is a null set.
www.maths.tcd.ie/
archives.math.utk.edu
CONTENTS
Objectives
Introduction
2.2 Group
2.6 Keywords
Objectives
Describe the cancellation laws and laws of indices for various groups
Introduction
The theory of groups is one of the oldest branches of abstract algebra. It has many applications
in mathematics and in the other sciences. Group theory has helped in developing physics,
chemistry and computer science. Its own roots go back to the work of the eighteenth century
mathematicians Lagrange, Ruffini and Galois.
In this unit, we will study about the group theory in detail. We surge fine groups and give some
examples. After that we understand details of some properties of groups that the elements of a
group satisfy. Let us discuss all these one by one.
As you all know common operations of addition and multiplication in R, Q and C. All these
operations are examples of binary operations. It can be defined as:
Definition: Let S be a non-empty set. Any function * : S × S S is called a binary operation
on S.
So, a binary operation associates a unique element of S to every ordered pair of elements of S.
For a binary operation * on S and (a, b) S × S, we denote *(a, b) by a*b.
We will use symbols like +, –, ×, , o, * and A to denote binary operations.
Let us look at some examples.
(i) + and x are binary operations on Z. In fact, we have + (a, b)= a + b and × (a, b) = a × b a,
b Z. We will normally denote a × b by ab.
(ii) Let (S) be the set of all subsets of S. Then the operations and are binary operations
on (S), since A B and A B are in (S) for all subsets A and B of S.
(iii) Let X be a non-empty set and F(X) be the family of all functions f : X X. Then the
composition of functions is a binary operation on F(X), since fog F(X) f, g F(X).
Note A binary operation on S is always closed on S, but may not be closed on a subset
of S.
Task For the following binary operations defined on R, determine whether they are
commutative or associative. Are they closed on N?
(a) x y = x + y – 5
(b) x * y = 2(x + y)
xy
(c) x y =
2
for all x, y R.
Notes As you are familiar with the equation such as a(b + c) = ab + ac and (b + c)a = bc +ba a, b, c R.
As this equation explains that multiplication distributes over addition in R. In general we can
define this as.
Definition: If o and * are two binary operations on a set S, then we say that * is distributive over
o if a, b, c S, we have a * (b o c) = (a * b) o (a * c) and (b o c) * a = (b * a) o (c * a).
ab b c ab ac
For example, let a * b = a, b R. Then a(b a c) = a = ab * ac, and (b * c)a
2 2 2
bc ba ca
= a ba * ca a,b,c R.
2 2
Let us now look deeper at some binary operations. You know that, for any a " R, a + 0 =a, 0 + a
= a and a + (–a) = (–a) + a = 0. We say that 0 is the identity element for addition and (–a) is the
negative or additive inverse of a.
Definition: Let *.be a binary operation on a set S. If there is an element e S such that a S,
a * e = a and e * a = a, then e is called an identity element for *.
(b) Suppose there exist a, b S such that s * a = e = a * s and s * b = e = b * s, e being the identity
element for *, Then
a=a*e=a*(s*b)
= (a * s) * b, since * is associative.
=e*b=b.
That is, a = b.
This uniqueness theorem allows us to say the identity element and the inverse, henceforth.
A binary operation may or may not have an identity element. For example, the operation of
addition on N has no identity element.
Similarly, an element may not have an inverse with respect to a binary operation. For example, Notes
2 " Z has no inverse with respect to multiplication on Z, does it?
Now let us consider the following examples.
Example: Let S be a non-empty set. Consider (S), the set of all subsets of S. Are and
commutative or associative operations on (S)? Do identity elements and inverses of elements
of (S) exist with respect to these operations?
When the set S under consideration is small, we can represent the way a binary operation on S
acts by a table.
Let S be a finite set and * be a binary operation on S. We can represent the binary operation by
a square table, called an operation table or a Cayley table. The Cayley table is named after the
famous mathematician Arthur Cayley (1821-1895).
To write this table, we first list the elements of S vertically as well as horizontally, in the same
order. Then we write a * b in the table at the intersection of the row headed by a and the column
headed by b.
For example, if S = (–1, 0, 1) and the binary operation is multiplication, denoted by., then it can
be represented by the following table.
. –1 0 1
–1 (–1) . (–1) (–1) . 0 (–1) . 1
=1 =0 = –1
0 0 . (–1) 0.0 0.1
=0 =0 = –1
1 1 . (–1) 1.0 1.1
= –1 =0 =1
Notes Conversely, if we are given a table, we can define a binary operation on S. For example, we can
define the operation * on S = {1, 2, 3} by the following table.
* 1 2 3
1 1 2 3
2 3 1 2
3 2 3 1
Again, (2 * 1) * 3 = 3 * 3 = 1 and 2 * (1 * 3) = 2
(2 * 1) * 3 2 * (1 * 3). , * is not associative.
Definition: Let * be a binary operation on a non-empty set S and let a,, . . . . . .,ak+1, S. We define
the product a, * . ... .. * ak+1, as follows:
If k = 1, a, * a2 is a well defined element in S.
Theorem 2: Let a,, ...... ,am+n be elements in a set S with an associative binary operation *. Then
(a, * ..,...* a,) * (a,,, * ...... * am+n) = a1 * ...... * am+n.
Proof: We use induction on n. That is, we will show that the statement is true for n = 1. Then,
assuming that it is true for n – 1, we will prove it for n.
If n = 1, our definition above gives us
Then
= a, * ....... a, by definition.
We will use Theorem 2 quite often in this course, without explicitly referring to it.
Now that we have discussed binary operations let us talk about groups.
After understanding the concept of binary operations. Let us start defining group.
Definition: Let G be a non-empty set and * be a binary operation on G. We say that the pair
(G, * ) is a group if
G 1) * is associative:’
G 2) G contains an identity element e for * , and
G 3) every element in G has an inverse in G with respect to *.
We will now give some examples of groups.
Now, multiplication in Z is associative and 1 Z is the multiplicative identity. But does every
element in Z have a multiplicative inverse? No. For instance, 0 and 2 have no inverses with
respect to ‘.’ Therefore, (Z,.) is not a group.
Note that (Z,.) is a semigroup since it satisfies GI. So, there exist semigroups that aren’t groups!
Actually, to show that (G, *) is a group it is sufficient to show that * satisfies the following
axioms.
G 1’) * is associative.
What we are saying is that the two sets of axioms are equivalent. The difference between them
is the following:
In the first set we need to prove that e is a two-sided identity and that the inverse b of any a G
satisfies a * b = e and b * a = e. In the second set we only need to prove that e is a one-sided identity
and that the inverse b of any a G only satisfies a * b = e.
Clearly, if * satisfies GI, G2 and G3, then it also satisfies Gl’, G2' and G3'. The following theorem
tells us that if * satisfies the second set of axioms, then it satisfies the first set too.
Theorem 3: Let (G, * ) satisfy Gl’, G2’ and G3’. Then e * a = a a G. Also, given a G, if bG
such that a * b = e, then b * a = e. Thus, (G, *) satisfies G1, G2 and G3.
To prove this theorem, we need the following result.
Lemma 1: Let (G, * ) satisfy Gl’, G2' and G3'. If a G such that a *a = a, then a = e.
Notes Now (a * a) * b = a * b = e.
b * a = e. Now,
(b * a) * (b * a) = (b * (a * b)) * a = (b * e) * a = b * a .
Therefore, by Lemma 1, b * a = e. Therefore, G3 is true.
Now we will show that G2 holds. Let a G. Then by G2', for a G, a * e = a. Since G3 holds,
b G such that a * b = b * a = e. Then
e * a = (a * b) * a = a * (b * a) = a * e = a .
That is, G2 also holds.
Example: Let G = { ±1, i }, i = 1. Let the binary operation be multiplication. Show
that (G) is a group.
1 –1 i –i
1 1 –1 i –i
–1 –1 1 –i i
i i –i –1 1
–i –i i 1 –1
This table shows us that a.l = a a G. Therefore, 1 is the identity element. It also shows us that
(G) satisfies G3. Therefore, (G) is a group.
Definition: If (G, *) is a group, where G is a finite set consisting of n elements, then we say that
(G, *) is a Finite group of order n. If G is an infinite set, then we say that (G,*) is an infinite group.
Now let us discuss an example of a non-commutative (or non-abelian) group. Before doing this
example recall that an m x n matrix over a Set S is a rectangular arrangement of elements of S in
m rows and n columns.
a b
If A = then ad-bc is called the determinant of A and is written as det
Note c d
A or |A|
Notes
Example: Let G be the set of all 2 x 2 matrices with non-zero determinant. That is,
a b
G a, b, c, d r, ad – bc 0
c d
a b p q ap br aq bs
A and p r s in G, A.P cp dr cq ds
c d
1 0
We also know that matrix multiplication is associative and is the multiplicative identity.
0 1
a b
Now, for A = in G. the mamx
c d
d b
ad bc ad bc * 1 0
B is such that det B 0 and AB .
c a ad bc 0 1
ad bc ad bc
Thus, B = A–1. (Note that we have used the axiom G3' here, and not G3.) This shows that the set of
all 2 × 2 matrices over R with non-zero determinant forms a group under multiplication. Since
1 2 0 1 2 1
3 and
4 1 0 4 3
0 1 1 2 3 4
1
0 3 4 1 2
Note that each element fa,b in T is represented by a point (a, b) in R2. Show that (T, o) is a group,
where o denotes the composition of functions.
Note that fa,b o fc,d = fc,d o fa,b fa,b, fc,d T. Therefore, (T, o) is abelian.
Before understanding the properties of group lets first give notational conventions.
Convention: Let us, we will denote a group (G, *) by G, if there is no danger of confusion. We
will also denote a * b by ab, for a, b G, and say that we are multiplying a and b. The letter e will
continue to denote the group identity.
(b) For a, b G, ab G. Therefore, (ab)-1 G and is the unique element satisfying (ab) (ab)-1
= (ab)-l (ab) = e.
= (a e) a-1
= aa-1
=e
-1 -1
Similarly, (b a ) (ab) = e.
You know that whenever ba = ca or ab = ac for a, b, c in R*, we can conclude that b = C. That is,
we can cancel a. This fact is true for any group.
Proof: We will first show that these linear equations do have solulions in G, and then we will
show that the solutions are unique.
For a, b G, consider a-1 b G. We find that a(a-1 b) = (aa-1) b = eb = b. Thus, a-1 b satisfies the
equation ax = b, i.e., ax = b has a solution in G.
But is this the only solution? Suppose x1, x2 are two solutions of ax = b in G. Then ax, = b = ax2. By
the left cancellation law, we get xl = x2. Thus, a-1 b is the unique solution in G.
Similarly, using the right cancellation law, we can show that ba-1 is the unique solution of
ya = b in G.
2 3 1 5
Example: Consider A ,B 0 4 in GL, (R)
1 2
2 3
A–1 =
1 2
2 2
A–1B = X.
1 3
Example: Let S be a non-empty set. Consider (S) with the binary operation of
symmetric difference A, given by
Show that ((S), A) is an abelian group. What is the unique solution for the equation Y A=B?
Solution: A is an associative binary operation. This can be seen by using the facts that
A\B=A BC, (A B)C = AC BC, (A B)C = AC BC
Notes and that and are commutative and associative. A is also commutative since A A B
= B A A, B (S).
For A, B in ((S), A) we want to solve Y A A = B. But we know that A is its own inverse. So, by
Theorem 6, Y = B A A-1 = B A A is the unique solution. What we have also proved is that (B A A)
A A = B for any A, B in (S).
Definition: Let G be a group. For a G, we define
(i) a0 = e.
Notes When the notation used for the binary operation is addition, an becomes
na. For example, f a any a Z,
na = 0 if a = 0,
na = a + a+ ... +a (n times) if n > 0,
Proof: We prove (a) and (b), and leave the proof of (c) to you.
(an)-1 = (a-1)n.
Also, (a-1)n = a-n, by definition.
(an)-1 = [a-(n)]-1
= ad
Also, (a-1)n = (a-l)-(-n)
= [(a-1)-1]-n, by the case n > 0
= a*.
So, in this case too,
(an)-1 = a-n = (a-1)n.
(b) If m = 0 or n = 0, then am+n = am . an. Suppose m 0 and n 0.
Case 1 (m > 0 and n > 0): We prove the proposition by induction on n.
Then, am . an = am(an-1 . a) = (am . an-1) a = am+n-1 . a = am+n . Thus, by the principle of induction, (a) holds
for all m > 0 and n > 0.
Case 2 (m < 0 and n < 0): Then (-m) 0 and (-n) > 0. Thus, by Case 1, a-n . a-m = a-(n+m) = adwn). Taking
inverses of both the sides and using (a), we get,
The cases when m < 0 and n > 0 are similar to Cases 3 and 4. Hence, awn = am . an for all a G and
m, n Z.
Consider the set of integers, Z, and n N. Let us define the relation of congruence on Z by : a is
congruent to b modulo n if n divides a-b. We write this as a b (mod n). For example, 4 1
(mod 3), since 3 | (4-1).
Similarly, (-5) 2(mod 7) and 30 0 (mod 6).
is an equivalence relation, and hence partitions Z into disjoint equivalence classes called
congruence classes modulo n. We denote the class containing r by r.
Thus, r ={ m Z | m r (modn) }.
So an integer m belongs to r for some r, 0 r < n, iff n | (r-m), i.e., iff r–m = kn, for some k Z.
r = { r + kn | k
Now, if m n, then the division algorithm says that m = nq + r for some q, r Z, 0 r < n. That
is, m r (mod n), for some r = 0, .,..., n-1. Therefore, all the congruence classes modulo n are
0, 1, ....., n 1. Let Z n {0,1, 2, ....., n 1}. We define the operation + on Zn by a b a b.
Notes
Is this operation well defined? To check this, we have to see that if a b c d in Zn, then
a b c d.
Now, a b (mod n) and c d (mod n). Hence, there exist integers kl and k2 such that a – b = k1n
and c – d = k2n. But then (a + c) – (b + d) = (a – b) + (c – d) = (k1 + k2)n.
ac bd.
addition is commutative in Z
(ii)
a b c b c a (b c)
a b c (a b) c (a b) c a, b, c Z n ,
But (Z,, .) is not a group. This is because every element of Zn, for example Q, does not have a
multiplicative inverse.
But, suppose we consider the non-zero elements of Zn, that is, Z *n ,. . Is this a group? For example,
Z *4 1, 2, 3
is not a group because * is not even a binary operation on Z *4 , since
We will now discuss the symmetric group briefly. In Next Unit we will discuss this group in
more detail.
Let X be a non-empty set. We have seen that the composition of functions defines a binary Notes
operation on the set F(X) of all functions from X to X. This binary operation is associative.
IX, the identity map, is the identity in F(X).
Now consider the subset S(X) of F(X) given by
S(X) = {f F(X) | f is bijective }.
So f S(X) iff f-l : X X exists. Remember that f o f-1 = f-1 . o f = IX. This also shows that f-1 S(X).
Now, for all f, g in S(X),
(g o f) o (f-1 o g-1) = IX = (f-1 o g-1) o (g o f), i.e., g o f S(X).
Thus, o is a binary operation on S(X).
Let us check that (S(X), o) is a group.
Suppose we want to construct an element f in Sn. We can start by choosing f(1). Now, f(1) can
be any one of the n symbols 1,2, ..... n. Having chosen f(l), we can choose f(2) from the set
{ l,2 ........ n } \ { f(l) }, i.e., in (n – 1) ways. This is because f is 1 – 1. Inductively, after choosing f(i),
we can choose f(i + l) in (n – i) ways. Thus, f can be chosen in (1 × 2 × .... × n) = n ! ways, i.e.,
Sn contains n ! elements.
1 2 .......... n
.......... f(n)
f(1) f(2)
1 2 3 4
For example, represents the function f : (1, 2. 3. 4) {1. 2, 3. 4) : f (1) = 2, f(2) = 4,
2 4 3 1
f (3) = 3, f (4) = 1. The elements in the top row can be placed in any order as long as the order of
the elements in the bottom row is changed accordingly.
2 1 3 4
Thus, also represents the same function f.
4 2 3 1
Definition: We say that f Sn is a cycle of length r if there are x,, ...., x, in X = { 1, 2, ....., n) such that
f(xi) = xi+1 for 1 i r – 1, f(xr) = x1 and f(t) = t for t # x, ..., x,. In this case f is written as (x1 .... x,).
For example, by f = (2 4 5 10) S10, we mean f (2) = 4, f (4) = 5, f (5) = 10, f (10) = 2 and f (j) = j for
j # 2, 4, 5, 10.
1 2 3 4 5 6 7 8 9 10
i.e., f
1 4 3 5 10 6 7 8 9 2
Notes
Note In the notation of a cycle, we don’t mention the elements that are left fixed
2 5
by the permutation. Similarly, the permutation is the cycle (1 2 5 3 4 ) in S5.
5 3
Now let us see how we calculate the composition of two permutations. Consider the following
example in S5.
1 2 3 4 5 1 2 3 4 5
° =
2 5 4 3 1 5 3 4 1 2
1 2 3 4 5
=
(1) (2) (3) (4) (5)
1 2 3 4 5
=
(5) (3) (4) (1) (2)
1 2 3 4 5
= 1 4 3 2 5 (2 4).
In this sub-section we will show that the set of complex numbers forms a group with respect to
addition. Some of you may not be acquainted with some basic properties or complex numbers.
Consider the set C of all ordered pairs (x, y) of real numbers. i.e.. we take C = R × R. Define
addition (+) and multiplication (.) in C as follows:
(x1, y1) + (x2, y2) = (x1 + x2, y1 + y2) and
This gives us an algebraic system (C, +, .) called the system of complex numbers. We must
remember that two complex numbers (x1, y1) and (x2, y2) are equal iff x1 = x2 and y1 = y2.
You can verify that + and . are commutative and associative.
Moreover,
x y
(x, y). 2 2
, 2
x y x y2
x ( y) y x
= x . 2 2
y. 2 2
,x. 2 2
y 2
x y x y x y x y2
= (1, 0)
x y
Thus, 2 2
, is the multiplicative inverse of (x, y) in C.
x y x y2
2
Thus, (C, +) is a group and (C*,.) is a group. (As usual, C* denotes the set of non-zero complex
numbers.)
Now let us see what we have covered in this unit.
Self Assessment
2.5 Summary
We proved and used the cancellation laws and laws of indices for group elements.
In this unit we discussed the group of integers modulo n, the symmetric group and the
group of complex numbers.
Binary Operation: A binary operation on S is always closed on S, but may not be closed on a subset
of S.
Abelian Group: If (G, *) is a group, where G is a finite set consisting of n elements, then we say
that (G, *) is a Finite group of order n. If G is an infinite set, then we say that (G,*) is an infinite
group.
1 2 3 4
1. Obtain the identity element, if it exists, for the operations .
2 4 3 1
1 2 3 4
2. For x R, obtain x-1 (if it exists) for each of the operations .
2 4 3 1
(b) (1 3 2)
Show that (1 2) ° (1 3 2)–1 (1 2)-1 ° (1 3 2)-1. (This shows that in Theorem 4(b) we can’t write
(ab)-1 = a-1b-1.)
www.maths.tcd.ie/
archives.math.utk.edu
CONTENTS
Objectives
Introduction
3.1 Subgroups
3.2 Properties of Subgroups
3.4 Summary
3.5 Keywords
Objectives
Define subgroups
Explain the intersection, union and product of two subgroups
Introduction
In the last unit, you have studied about the algebraic structures of integers, rational numbers,
real numbers and complex numbers. You have got an idea that, not only is Z Q R C, but
the operations of addition and multiplication coincide in these sets. In the present unit, you will
go through more examples of subsets of groups which are groups in their own right. Such
structures are rightfully named subgroups. We will discuss some of their properties also. We
will see some cases in which we obtain a group from a few elements of the group. In particular,
we will study cases of groups that can be built up by a single element of the group.
3.1 Subgroups
In the previous unit, you have already read the concept of group. You also noted that group (Z+),
(Q+) and (R+) are the member of a bigger group (C+) complex number. These all groups that
contained in bigger group are not just subsets but groups.
Now, if (H,*) is a subgroup of (G,*), can the identity element in (H,*) be different from the
identify element in (G,*)? Let us see. If h is the identity of (H,*), then, for any a H, h * a = a * h
= a. However, a H G. Thus, a * e = e * a = a, where e is the identity in G. Therefore, h * a =
e * a.
By right cancellation in (G,*)w,e get h = e.
Thus, whenever (H, *) is a subgroup of (G,*), e H.
Remark 1: (H,*) is a subgroup of (G, *) if and only if
(i) e H,
(ii) a, b H a * b H,
(iii) a H a-1 H.
We would also like to make an important remark about notation here.
Remark 2: If (H,*) is a subgroup of (G,*), we shall just say that H is a subgroup of G, provided that
there is no confusion about the binary operations. We will also denote this fact by H G.
Now let us first discuss an important necessary and sufficient condition for a subset to be a
subgroup.
1
|z1 z2–1| = |z1| |z2–1| = |z1| 1.
|z 2 |
Notes
Example: Consider G = M2×3 (C), the set of all 2 × 3 matrices over C. Check that (G,+) is an
abelian group. Show that
0 a b
S a, b, c C is a subgroup of G.
0 0 c
a b c p q r a p b q c r
d e f s t u d s e t f u
0 0 0 a c
You can see that + is a binary operation on G. 0 = is the additive identity and d f
0 0 0
a b c
is the inverse of G.
d e f
0 a b 0 d e
0 0 c , 0 0 f S , we see that
0 a b 0 d e 0 a d b e
0 0 c , 0 0 f 0 0 c f
S.
= S I G.
Example: Consider the set of all invertible 3 × 3 matrices over R, GL 3 (R). That is,
A GL3(R) iff det (A) 0. Show that SL3 (R) = (A GL3(R) | det(A) = I ) is a subgroup of (GL3(R),.).
1
det (AB-1) = det (A) det(B-1) = 1, since det (A) = 1 and det (B) = I.
det(B)
AB-1 SL3(R)
SL3(R) I GL3(R).
Example: Any non-trivial subgroup of (Z, +) is of the form mZ, where m N and
mZ = { mt | t Z) = { 0, m, ± 2m, ± 3m,....... ).
Solution: We will first show that mZ is a subgroup of Z. Then we will show that if H is a
subgroup of Z, H # {0}, then H = mZ, for some m N.
Notes Now, 0 mZ. Therefore, mZ . Also, for mr, ms mZ, mr-ms = m(r-s) mZ.
Therefore, mZ is a subgroup of Z.
Note that m is the least positive integer in mZ.
Now, let H (0) be a subgroup of Z and S = { i | i > 0, i H).
Since H # {0), there is a non-zero integer k in H. If k > 0, then k S. If k < 0, then (-k) S, since
(–k) H and (–k) > 0.
Hence, S .
Clearly, S N. Thus, by the well-ordering principle S has a least element, say s. That is, s is the
least positive integer that belongs to H.
Now sZ H. Why? Well, consider any element st sZ.
If t = 0, then st = 0 H.
If t > 0, then st = s + s + ..... + s (t times) H.
Now, let m H. By the division algorithm m = ns + r for some n, r Z, 0 r < s. Thus, r = m – ns.
But H is a subgroup of Z and m, ns H. Thus, r H. By minimality of s in S, we must have r = 0,
i.e., m = ns. Thus, H sZ.
Equating the modulus of both the sides of (1). we get rn = 1, i.e., r =l.
On comparing the arguments of both sides of (1), we see that 0 + 2nk (k Z) and n are
arguments of the same complex number. Thus, n can take any one of the values 2k, k Z. Does
2nk
this mean that as k ranges over Z and ranges over we get distinct nth roots of 1? Let us
n
2nk 2zk 2 m 2 m 2nk 2nm
find out. Now, cos + i sin = cos i sin if and only if 2nt for
n n n n n n
some t Z. This will happen iff k = m + nt, i.e., k m (mod n). Thus, corresponding to every r
2 r 2 r
in Z, we get an nth root of unity, z = cos i sin , 0 r n; and these are all the nth roots
n n
of unity.
For example, If n = 6, we get the 6th roots of 1 as z 0 , z l , z 2 , z 3 , z 4 and z 5 , where zj =
2 j 2 j
cos i sin , j 0,1, 2, 3, 4, 5. In Figure 3.1 you can see that all these lie on the unit circle (i.e.,
6 6
the circle of radius one with centre (0, 0)). They form the vertices of a regular hexagon.
2 2
Now, let = cos i sin . Then all the nth roots of 1 are 1, , 2 , ......., n-1 , since
n n
2 j 2 j
j cos i sin for 0 j n 1 (using De Moivre’s theorem).
n n
Finally, if i U,, then 0 n – i n – 1 and i . n–i = n = 1; i.e., n-i is the inverse of oi for all
1 i < n. Hence, Un is a subgroup of C*.
Note that Un, is a finite group of order n and is a subgroup of an infinite group, C*. So, for every
natural number n we have a finite subgroup of order n of C*.
Before ending this we will introduce you to a subgroup that you will use off and on.
z(G) = {G G | xg = gx x G}.
Thus, Z(G) is the set of those elements of G that commute with every element of G.
Also, for any a, b Z(G) and for any x G, (ab)x = a(bx) = a(xb) = (ax)b = (xa)b = x (ab).
ab Z(G).
After discussing the term subgroup let us start understanding the important properties of
subgroup.
Example: In earlier example we have seen that my subgroup of Z is of the form mZ for
some m N. Let mZ and kZ be two subgroups of Z. Show that rnZ is a subgroup of kZ iff k | m.
Conversely, suppose k | m.
Then, m = kr for some r Z. Now consider any n mZ, and let t Z such that n = mt.
Hence, mZ kZ.
Thus, mZ kZ iff k | m.
Thus, H K .
Hence, H K is a subgroup of G.
The whole argument of Theorem 4 remains valid if we take a family of subgroups instead of just Notes
two subgroups. Hence, we have the following result.
of G.
Now question arises that does the union of two or more subgroup is again a subgroup. Lets see
its true or not. Consider the, two subgroups 2Z and 3Z of Z. Let S = 2Z 32. Now, 3 32 S,
2 22 S, but 1 = 3 – 2 is neither in 2Z nor in 3Z. Hence, S is not a subgroup of (Z, +). Thus, if A
and B are subgroups of G, A B need not be a subgroup of G. But, if A B, then A B = B is a
subgroup of G. The next exercise says that this is the only situation in which A B is a subgroup
of G.
Let us now see what we mean by the product of two subsets of a group G.
= 62.
In this example we find that the product of two subgroups is a subgroup. But is that always so?
Consider the group
S3 = {I, (1 2), (1 3), (2 3), (1 2 3), (1 3 2)}, and its subgroups H = { I, (1 2) } and K = { I, (1 3)).
1 2 3 1 2 3
(Remember, (1 2) is the permutation and (1 2 3) is the permutation 2 3 1 .
2 1 3
= { I, (1 3), (1 2), (1 3 2) }
HK is not a subgroup of G, since it is not even closed under composition. (Note that (1 3) (1 2)
= (1 2 3) HK.)
So, when will the product of two subgroups be a subgroup? The following result answers this
question.
Proof: Firstly, assume that HK G. We will show that HK = KH; Let hk HK. Then
(hk)-1 = k-1 h-1 HK, since HK G.
Therefore, k-1 h-1 = h1 kl for some hi H, k1 K. But then hk = (k-1 h-1)-1 = k1-1 h1-1 KH. Thus,
HK KH.
Now, we will show that KH HK. Let kh KH. Then (kh)-1 = h-1 k-1 HK. But HK G. Therefore,
((kh)-1)-1 HK, that is, kh HK. Thus, KH HK.
Hence, we have shown that HK = KH.
Now, (kk1-1) h1-1 KH = HK, Therefore, 3 h2k2 HK such that (kk1-1)h1-1 = h2k2.
Then, ab-1 = h(h2k2) = (hh2)k2 HK.
Thus, by Theorem 1, HK G.
The following result is a nice corollary to Theorem 5.
Corollary: If H and K are subgroups of an abelian group G, then HK is a subgroup of G.
Let G be any group and S a subset of G. Consider the family F of all subgroups of G that contain
S, that is,
F = { H | H G and S H } .
Note that
(i) S H.
HF
(ii) H
H F
is the smallest subgroup of G containing S. (Because if K is a subgroup of G
Definition: If S is a subset of a group G, then the smallest subgroup of G containing S is called the
subgroup generated by the set S, and is written as <S>.
Thus, <S> = { H | H G, S H }.
If <S> = G, then we say that G is generated by the set S, and that S is a set of generators of G.
If the set S is finite, we say that G is finitely generated.
We will give an alternative way of describing <S>. This definition is much easier to work with
than the previous one.
n n n
<S> = a1 1 a2 2 ..... ak k |ai S for 1 i k, n 1 , ..., n k Z .
n n n
Proof: Let A = a1 1 a2 2 ..... ak k |ai S for 1 i k, n 1 , ..., n k Z .
Now, let us see why <S> A. We will show that A is a subgroup containing S. Then, by the Notes
definition of <S>, it will follow that <S> A.
Since any a S can be written as a = a1, S A.
Since S , A .
y = bm m2 mr
1 b 2 .... b r
1
aj , bj S for 1 i k, 1 j r.
1
Then xy-1 = a n1 a n2 ..... a kn
1 2 k
b m1
1 b m2 2 .... bmr r
Note that, if (G, +) is a group generated by S, then any element of G is of the form n1 al + n2 a2 +
..... + nr ar, where a1, a2,....., a, S and n1, n2, ....., nr, Z.
For example, Z is generated by the set of odd integers S = { ± 1, f3, ± 5, ......). Let us see why. Let
m Z. Then m = 2rs where r 0 and s S. Thus, m <S>. And hence, <S> = Z.
Definition: A group G is called a cyclic group if G = < {a) > for some a E G. We usually write
< {a) > as < a >.
Then < (1 2) > = { I, (1 2)), since (1 2)2 = I, (1 2)3 = (1 2), and so on.
Proof: Let G = < a > = { an | n Z). Then, for any x, y in G, there exist m, n Z such that x = am, y
= an. But, then, xy = am. an am+n = an+m = an. am = yx. Thus, xy = yx for all x, y in G.
That is G is abelian.
Note Theorem 7 says that every cyclic group is abelian. But this does not mean that
every abelian group is cyclic.
Notes
Example: Consider the set K4 = {e, a, b, ab] and the binary operation on K4 given by the
table.
× e a b ab
e e a b ab
a a e ab b
b b ab e a
ab ab b a e
This group is called the Klein 4-group, after the pioneering German group theorist Felix Klein.
Therefore, < a > = { e, a }. Similarly, < b > = { e, b } and < ab > = { e, ab).
Now, Theorem 8 says that every subgroup of a cyclic group is cyclic. But the converse is not true.
That is, we can have groups whose proper subgroups are all cyclic, without the group being
cyclic.
Consider the group S3, of all permutations on 3 symbols. Its proper subgroups are
A = <I>
B = <(1 2)>
C = <(1 3)>
D = <(2 3)>
E = <(1 2 3)>.
As you can see, all these are cyclic. But, you know that S3 itself is not cyclic.
Now we state a corollary to Theorem 8, in which we write down the important point made in the Notes
proof of Theorem 8.
Corollary: Let H {e} be a subgroup of < a >. Then H = < an >, where n is the least positive integer
such that an H.
Self Assessment
(a) (b)
(c) (d)
4. Let G be a group, H be subgroup of G and be subgroup of H then k is a .................. of G.
3.4 Summary
Any subgroup of a cyclic group is cyclic, but the converse need not be true.
3.5 Keywords
Notes Cyclic Groups: Let G be any group and S a subset of G. Consider the family F of all subgroups of
G that contain S, that is, F = { H | H G and S H }.
2 2 2 2
= i and = i.
2 2 2 2
a b
K =
c d d a, c 2b,ad bc 0
1 0
10. Compute the centralizer in GL2 (R) of the matrix .
1 0
m b
11. Let G be the subgroup of GL2 (R) defined by G = 0 1 m 0 .
1 1 1 0
12. Let A = and B = 0 1 . Find the centralizers C(A) and C(B), and show that C(A)
0 1
C(B) = Z(G), where Z(G) is the center of G.
www.maths.tcd.ie/
archives.math.utk.edu
CONTENTS
Objectives
Introduction
4.1 Cosets
4.2 Lagrange's Theorem
4.3 Summary
4.4 Keywords
Objectives
Introduction
In the last unit, you have studied about the subgroup and different properties of subgroups. In
this unit, you will learn the concept of cosets and also see how a subgroup can partition a group
into equivalence classes. You can use cosets to prove a very useful result about the number of
elements in a subgroup. In the present era, this elementary theorem is known as Lagrange's
theorem, though Lagrange proved it for subgroups of S only. Let us understand these concepts
with the help of examples and theorem.
4.1 Cosets
First of all we will discuss cosets. Cosets means the product of two subset of a particular group.
In a case when one of the subsets consists of single element only, we will go through a situation
i.e.,
H(x) = {hx | h H}.
Hx = {hx | h H}
a right coset of H in G. The element x is a representative of Hx.
xH={xh | h H} . Notes
Note that, if the group operation is +, then the right and left cosets of H in (G,+) represented by
x G are
H + x = { h + x | h H} and x + H = { x + h | h H }, respectively.
In general, the distinct right cosets of H (= nZ) in Z are H, H + 1, ....., H + (n – 1). Similarly, the
distinct left cosets of H (= nZ) in Z are H, 1 + H, 2 + H, ....., (n – 1) + H.
After understanding the concept of cosets. Let us discuss some basic and important properties of
cosets.
Then
(a) X HX
(b) Hx = H x H.
(c) Hx = H xy–1 H.
Conversely, let us assume that x H. We will show that Hx H and H Hx. Now any element
of Hx is of the form hx, where h H. This is in H, since h H and x H. Thus, Hx H. Again, let
h H. Then h = (hx-l) x Hx, since hx-1 H.
H HX.
Notes H = Hx.
Note Along the lines of the proof of Theorem 1, we can prove that if H is a subgroup of
G and x, y G, then
(a) x xH.
(b) xH = H x H.
(c) x H = yH x-1y H.
Example: Let G = Sg = {I, (1 2), (1 3), (2 3), (1 2 3), (1 3 2)} and H be the cyclic subgroup of G
generated by (1 2 3). Obtain the left cosets of H in G.
Solution: Two cosets are
For the other cosets you can apply Theorem 1 to see that
(1 2)H = (2 3)H = (1 3)H and
(1 2 3)H = H = (1 3 2)H.
1 0 0 1 0 i i 0
I , A 1 0 , B i 0 , C 0 i and i 1.
0 1
You can check that the following relations hold between the elements of Q8:
I2 = I , A2 = B2 = C2 = –I,
Show that the subgroup H = < A > has only two distinct right cosets in Q8.
That is , ~ is transitive.
Thus, ~ is an equivalence relation.
[ x l = { y G | y ~ x } = { y G | xy-1 H}.
Now, we will show that [x] = Hx. So, let y [X}. Then Hy = Hx, by Theorem 1. And since y Hy,
y Hx.
Therefore, [x] Hx.
Therefore, hx ~ x. That is, hx [x]. This is true for any hx Hx. Therefore, Hx [x].
Note that what Theorem 2 and the remark above say is that any subgroup H of a group G
partitions G into disjoint right cosets.
To understand this theorem first we have to define the order of a finite group, after that we will
show that the order of any subgroup divides the order of the group.
Definition: The order of a finite group G is the number of elements in G. It is denoted by o(G).
For example, o(S3) = 6 and o(A3) = 3. Remember, A3 = {I, (1 2 3), (1 3 2)}!
You can also see that o(Zn) = n. And, you know that o(Sn) = n!.
Now, let G be a finite group and H be a subgroup of G. We define a function f between the set of
right cosets of H in G and the set of left cosets of H in G by
f : { Hx | x G } { yH | y G } : f(Hx) = x–1H.
Definition: Let H be a subgroup of a finite group G. We call the number of distinct cosets of H in
G the index of H in G, and denote it by | G : H |.
Thus, we see that | S3 : A3 | = 2.
Note that, if we take H = {e}, then | G : {e} | = o(G), since {e}g = {g) g G and {e)g {e}g’
if g g’.
Now let us look at the order of subgroups. In last unit you saw that the orders of the subgroups
of S3 are 1, 2, 3 and 6. All these divide o(S3) = 6. This fact is part of a fundamental theorem about
finite groups. Its beginnings appeared in a paper in 1770, written by the famous French
mathematician Lagrange. He proved the result for permutation groups only. The general result
was probably proved by the famous mathematician Evariste Galois in 1830.
Theorem 3 (Lagrange): Let H be a subgroup of a finite group G. Then
and | G : H | = r.
We know that | Hx1 | = | Hx2 | = ... = | Hxr | = o(H).
Thus, the total number of elements in the union on the right hand side of ( I ) is
You will see the power of Lagrange’s theorem when we get down to obtaining all the subgroups
of a finite group.
For example, suppose we are asked to find all the subgroups of a group G of order 35. Then the
only possible subgroups are those of order 1, 5, 7 and 35. So, for example, we don’t need to waste
time looking for subgroups of order 2 or 4.
In fact, we can prove quite a few nice results by using Lagrange’s theorem. Let us prove some
results about the order of an element.
Definition: Let G be a group and g G. Then the order of g is the order of the cyclic subgroup
< g >, if < g > is finite. We denote this finite number by o(g). If < g > is an infinite subgroup of G,
we say that g is of infinite order.
Now, let g G have finite order. Then the set {e, g, g2, ...} is finite, since G is finite. Therefore, all Notes
the powers of g can’t be distinct. Therefore, gr = gs for some r > s. Then gr-s = e and r-s N. Thus,
the set { t N | gt = e } is non-empty. So, by the well ordering principle it has a least element. Let
n be the least positive integer such that gn= e.
Then
< g > = {e, g, g2, ...., gn-1}.
Therefore, o(g) = o(< g >) = n.
That is, o(g) is the least positive integer n such that gn = e.
Note If g (G, + ), then o(g) is the least positive integer n such that ng = e.
Now suppose g G is of infinite order. Then, for m 11. gm gn. (Because, if gm = gn, then
g m-n = e, which shows that < g > is a finite group.) We will use this fact while proving
Theorem 5.
Theorem 4: Let G be a group and g G be of order n. Then gm = e for some m N iff n | m.
Proof: We will first show that gm = e ! n (m.F or this consider the set S = { r Z | gr = e }.
Now, n S. Also, if a, b S, then ga = e = gb. Hence, ga-b = ga (gb)-1 = e. Therefore, a-b S. Thus,
S Z.
Note So, from last unit, we see that S = nZ. Remember, n is the least positive integer
in S!
We will now use Theorem 4 to prove a result about the orders of elements in a cyclic group.
n
o(gm) = m 1, ..., n 1. ((n,m) is the g.c.d. of n and m.)
n, m
Proof: (a) An element is of infinite order iff all its powers are distinct. We know that all the
powers of g are distinct. We have to show that all the powers of gm are distinct. If possible, let
(gm)t = (gm)w, Then gmt = gmw, But then mt = mw, and hence, t = w. This shows that the powers of gm
are all distinct, and hence gm is of infinite order.
(b) Since o(g) = n, G = {e, g, ........ gn-1 ) . < gm >, being a subgroup of G, must be of finite order. Thus,
n
gm is of finite order. Let o(gm) = t. We will show that t = .
m
n,
Let d = (n, m). We can then write n = n1d, m = m1d, where (m1, n1,) = 1.
n n
Then n 1 .
d n, m
n
t n1 .
n, m
n
i.e., o(g m ) .
(n, m)
12
Using this result we know that o (4) in Z 12 is 3.
12, 4
Theorem 6: Every group of prime order is cyclic.
Proof: Let G be a group of prime order p. Since p 1, a G such that a e. Theorem 4, o(a) | p.
Therefore, o(a) = 1 or o(a) = p. Since a e, o(a) 2.
Thus, o(a) = p, i.e., o(< a >) = p. So, < a > G such that o(< a >) = b(G). Therefore, < a > = G, that is,
G is cyclic.
Using Theorems 3 and 6, we can immediately say that all the proper subgroups of a group of
order 35 are cyclic.
Now let us look at groups of composite order.
Theorem 7: If G is a finite group such that o(G) is neither 1 nor a prime, then G has nontrivial
proper subgroups.
Proof: If G is not cyclic, then any a G, a e, generates a proper non-trivial subgroup < a >.
Now, suppose G is cyclic, say C = < x >, where o(x) = mn (m, n 1).
Then, (xm)n = xmn = e. Thus, by Theorem 4, o(xm) n < o(G).
We first define the Euler phi-function, named after the Swiss mathematician Leonard Euler
(1707-1783).
For example, (2) = I and (6) = 2 (since the only positive integers < 6 and relatively prime to 6 are Notes
1 and 5).
We will now prove a lemma, which will be needed to prove the theorem that follows it. This
lemma also gives us examples of subgroups of Z,, for every n 2 2.
n | ar – 1
ar 1 (mod n),
a r 1.
1
ar
Further, a G, because if a and n have a common factor other than 1; then this factor will ‘divide
ar + bn = 1. But that is not possible.
Since G consists of all those r Z, such that r < n and (r, n) = I, o(G) = (n).
Lemma and Lagrange’s theorem immediately give us the following result due to the
mathematicians Euler and Pierre Fermat.
Proof:
1. Leonhard Euler published a proof in 1789. Using modern terminology, one may prove the
theorem as follows: the numbers a which are relatively prime to n form a group under
multiplication mod n, the group G of (multiplicative) units of the ring Z/nZ. This group
has (n) elements. The element a : = a (mod n) is a member of the group G, and the order
o(a) of a (the least k > 0 such that ak = 1) must have a multiple equal to the size of G.
(The order of a is the size of the subgroup of G generated by a, and Lagrange’s theorem
states that the size of any subgroup of G divides the size of G.)
Thus for some integer M > 0, M· o(a) = (n). Therefore, a(n) = ao(a)·M = (ao(a))M = 1M = 1. This
means that a(n) = 1 (mod n).
Notes 2. Another direct proof: If a is coprime to n, then multiplication by a permutes the residue
classes mod n that are co prime to n; in other words, (writing R for the set consisting of the
(n) different such classes) the sets { x : x in R } and { ax : x in R } are equal; therefore, the two
products over all of the elements in each set are equal. Hence, P a(n)P (mod n) where P is
the product over all of the elements in the first set. Since P is coprime to n, it follows that
a(n) 1 (mod n).
Self Assessment
(a) A1 (b) A3
(c) A4 (d) A5
4.3 Summary
Any subgroup partitions a group into disjoint left (or right) cosets of the subgroup.
The definition of the order of a group and the order of an element of a group.
The proof of Lagrange’s theorem, which slates that if H is a subgroup of a finite group G,
then o(G) = o(H) | G : H |. But, if m | o(G), then G need not have a subgroup of order.
The following consequences of Lagrange’s theorem:
Coset: Let H be a subgroup of a group G, and let x G. We call the set Hx = {hx | h H} a right
coset of H in G.
Lagrange: Let H be a subgroup of a finite group G. Then o(G) = o(H) | G : H |. Thus, o(H) divides
o(G) and | G : H | divides o(G).
1. Obtain the left and right cosets of H = < (1 2) > in S3. Show that Hx xH for some x S3.
2. Show that K = {I, –I} is a subgroup of Q8. Obtain all its right cosets in Q8.
0 1
(c) 1 0 Q8, (d) 3 Z4
(e) 1 R?
www.maths.tcd.ie/
archives.math.utk.edu
CONTENTS
Objectives
Introduction
5.3 Summary
5.4 Keywords
Objectives
Introduction
In earlier units, you have studied about the term subgroups and cosets. In this unit, we will
discuss a special class of subgroups known as normal subgroups. You will also come to know
about that the cosets of such a subgroup form a group with respect to a suitably defined operation.
These groups are called quotient groups. After discussing these concepts, we will also discuss
some examples related to this concept.
In the last unit, you have studied about coset of a subgroup also introduced with a fact that left
coset aH, not be same as the right coset Ha.
But this fact is true for certain subgroup for which Ha and aH represented by the same element
coincide.
In group theory, these types of subgroup are very important and this type of a subgroup has a
special name. This subgroup is referred to normal subgroup.
For example, any group G has two normal subgroups, namely, {e} and G itself. Can you see
why? Well, {e}x = {x} = x{e}, for any x G, and Gx = G = xG, for any x G.
H Z.
Above example is a special case of the fact that every subgroup of a commutative group is a
normal subgroup.
Let us now prove a result that gives equivalent conditions for a subgroup to be normal.
Theorem 1: Let H be a subgroup of a group G. The following statements are equivalent.
(a) H is normal in G.
(b) g -l Hg H g G.
(c) g -l Hg H g G.
Proof: We will show that (a) (b) (c) (a). This will show that the three statements are
equivalent.
(a) (b) : Since (a) is true, Hg = gH g E G. We want to prove (b). For this, consider
g-1hg = g-1gh1 = h1 H
(b) holds.
(b) (c) : Now, we know that (b) holds, i.e., for g G, g-1Hg H. We want to show
= g-1 (ghg-1) g
H g-1Hg.
g-1Hg = H g G.
Remark: Theorem 1 says that H G g 1Hg H g e G. This does not mean that
g hg = h h H and g G .
g-1Hg H. Thus, H G.
Theorem 2 says that if G is abelian, then all its subgroups are normal. Unfortunately, the converse
of this is not true. That is, there are non-commutative groups whose subgroups are all normal.
We will give you an example after doing Theorem 3. Let us first look at another example of a
normal subgroup.
Example: Consider the Klein 4-group, K4, given in table below. Show that both its
subgroups < a > and < b > are normal.
× e a b ab
e e a b ab
a a e ab b
b b ab e a
ab ab b a e
Solution: Consider the table of the operation given in table. Note that a and b are of order 2.
Therefore, a = a-1 and b = b-1. Also note that ba = ab.
Now, let H = < a > = {e, a}. We will check that H K4, that is, g-1hg H g K4 and h E H.
H K4.
In above Example, both < a > and < b > are of index 2 in K4. We have the following result about Notes
such subgroups.
Theorem 3: Every subgroup of a group G of index 2 is normal in G.
Proof: Let N G such that | G : N | = 2. Let the two right cosets of N be N and Nx, and the two
left cosets be N and yN.
Thus, Nx xN.
Nx = xN, and N G.
We will use this theorem to show that, for any n 2, the alternating group A, is a normal
subgroup of Sn.
In fact, if you go back b, you can see that A4 S4, since Lagrange’s theorem implies that
o(S 4 ) 4!
|S 4 : A 4 | 2.
o(A 4 ) 12
Consider the quaternion group Q8, which we discussed earlier. It has the following 6 subgroups:
Ho = (I}, HI = {I, – I}, H2 = (I, – I, A, – A), H3 = {I, – I, B, – B},
g 1H 1g H1 g Q 8 . H1 Q8 .
So far we have given examples of normal subgroups. Let us look at an example of a subgroup
that isn’t normal.
= (3 2 1) (2 3) = (1 3) < (1 2)>
In earlier unit we proved that if H I G and K H, then K I G. That is, ‘’ is a transitive relation. But
‘ ’ is not a transitive relation. That is, if H N and N G, it is not necessary that H G.
Proof: From Theorem 4 of Unit 3, you know that H K G. We have to show that
g-1xg H K x H K and g G.
Thus, H K G.
LetH=<x>andK=<y>.
Solution: Note that the elements of G are, of the form xi $, where i = 0, 1 and j = 0, 1, 2, 3
| G : K | = 2. Thus, by Theorem 3, K G.
Note that we can’t apply Theorem 2, since G is non-abelian (as xy = y-1x and y y-l).
y4 = x
e = x, a contradiction.
Take y to be a rotation of the Euclidean plane about the origin through , and x the reflection
2
about the vertical axis.
Here we will use a property of normal subgroups to create a new group. This group is analogous
to the concept of quotient spaces given in the Linear Algebra course.
Let H be a normal subgroup of a group G. Then gH = Hg for every g G. Consider the collection
of all cosets of H in G. (Note that since H G, we need not write ‘left coset’ or ‘right coset; simply
‘coset’ is enough.) We denote this set by G/H. Now, for x, y H, we have
As this definition seems to depend on the way in which we represent a coset. Let us discuss this
in detail. Suppose C1 and C2 are two cosets, say C1 = Hx and C2 = Hy. Then C1C2 = Hxy. But C1 and
C2 can be written in the form Hx and Hy in several ways. So, you may ask : Does C1C2 depend on
the particular way of writing C1 and C2?
In other words, if C1 = Hx = Hx1 and C2 = Hy = Hy1, then is C1C2 = Hxy or is C1C2 = Hx1y1?
Actually, we will show you that Hxy = Hx1y1, that is, the product of cosets is well-defined.
Hxy = Hx1y1.
So, we have shown you that multiplication is a well-defined binary operation on G/H.
Notes Proof: We have already observed that the product of two cosets is a coset.
o(G)
o(G /H)
o(H)
Also note that if (G, +) is an abelian group and H G, then H G. Further, the operation on
G/H is defined by (H + x) + (H + y) = H + (x + y).
Example: Obtain the group G/H, where G = S3 and H = A3 = {I, (1 2 3), (1 3 2)}.
You know that G/H is a group of order 2 whose elements are H and (1 2) H.
o(Z/nZ) = n.
Note that addition in Z/nZ is given a + b = a + b
Definition: Let G be a group and x, y G. Then x-1y-1 xy is called the commutator of x and y. It is
denoted by [x, y].
The subgroup of G generated by the set of all commutators is called the commutator subgroup
of G. It is denoted by [G, G].
[G, G] G.
For the rest of the proof let us denote [G, G] by H, for convenience.
Now, for x, y G,
Thus, since xy x-1 y-1 H x, y G, HxHy = HyHx x, y G. That is, G/H is abelian.
Note We have defined the quotient group G/H only if H G. But if H G we can
still define G/H to be the set of all left (or right) cosets of H in G. But, in this case G/H will
not be a group.
H G.
Self Assessment
(a) (b)
(c) = (d)
5.3 Summary
We discussed here:
The definition and examples of a normal subgroup.
Every subgroup of an abelian group is normal.
Every subgroup of index 2 is normal.
If G is abelian, then every quotient group of G is abelian. The converse is not true.
The quotient group corresponding to the commutator subgroup is commutative.
5.4 Keywords
Dihedral Group, D8: It is the group of symmetries of a square, that is, its elements represent the
different ways in which two copies of a square can be placed so that one covers the other.
Quotient Group: If C1 = Hx = Hx1 and C2 = Hy = Hy1, then is C1C2 = Hxy or is C1C2 = Hx1y1?
Actually, we will show you that Hxy = Hx1y1, that is, the product of cosets is well-defined.
2. Consider the subgroup SL2(R) = {A E GL2(R) | det(A) = 1} of GL2(R). Using the facts that det
1
(AB) = det (A) det (B) and det (A–1) = , prove that SL2(R) GL2(R).
det(A)
3. Consider the group of all 2 × 2 diagonal matrices over R*, with respect to multiplication.
How many of its subgroups are normal.
archives.math.utk.edu
CONTENTS
Objectives
Introduction
6.1 Homomorphisms
6.2 Isomorphisms
6.4 Summary
6.5 Keywords
Objectives
Introduction
In the last unit, you have studied about the normal groups and the concept of quotient group.
In this unit, we will discuss various properties of those functions between groups which preserve
the algebraic structure of their domain groups. These functions are called group
Homomorphisms. This term was introduced by the mathematician Klein in 1893. This concept
is analogous to the concept of a vector space homomorphism, as you studied in the earlier unit.
In this unit, you will also get an idea about a very important mathematical idea—isomorphism.
6.1 Homomorphisms
Let us start our study of functions from one group to another with an example.
1, if n is even
f : Z {1, –1} by f(n) =
1, if n is odd,
then you can see that f(a + b) = f(a).f(b) a, b Z. What we have just seen is an example of a
homomorphism, a function that preserves the algebraic structure of its domain.
Definition: Let (G1, *1) and (G2,*2) be two groups. A mapping f : G1 — G2 is said to be a group
homomorphism (or just a homomorphism), if
Note that a homomorphism f from G1 to G2 carries the product x *1 y in G1 to the product Notes
f(x) *2 f(y) in G2.
Note The word ‘homomorphism’ is derived from two Greek words ‘homos’,
meaning ‘link’, and ‘morphe’, meaning ‘form’.
Im f = {f(x) 1 x G1}.
(ii) the kernel of f to be the set
Example: Consider the two groups (R, +) and (R*,.). Show that the map exp : (R, +) (R*,.)
: exp(r) = er is a group homomorphism. Also find Im exp and Ker exp.
Note that examples takes the identity 0 of R to the identity 1 of R*. example also carries the
additive inverse – r of r. to the multiplicative inverse of exp (r).
Example: Consider the groups (R, +) and (C, +) and define f : (C, +) (R, +) by f(x + iy) =
x, the real part of x + iy. Show that f is a homomorphism. What are Im f and Ker f?
f((a + ib) + (c + id)) = f((a + c) + i(b + d)) = a + c = f(a + ib) + f(c + id)
Ker f = { x + iy C | f ( x + iy ) = 0 } = { x + iy C | x = 0 }
Note that f carries the additive identity of C to the additive identity of R and ( – z) to – f(z), for any
z C.
Notes In Examples 1 and 2 we observed that the homomorphisms carried the identity to the identity
and the inverse to the inverse. In fact, these observations can be proved for any group
homomorphism.
Then
(a) f(el) = e2, where e1 is the identity of G1 and e2 is the identity of G2.
(b) Now, for any x G1, f(x) *2 f(x-1) = f(x *1x-1) = f(e1) = e2.
Note that the converse of Theorem 1 is false. That is, if f : G1 G2 is a function such that f(e1) = e2
and [f(x)]-1 = f(x-l) = f(x-1) x G1, then f need not be a homomorphism. For example, consider
f : Z Z : f(0) = 0 and ,
n 1 n 0
f(n)
n 1 n 0
Since f(l + 1) f(1) + f(l), f is not a homomorphism. But f(e1) = e2 and f(n) = - f(- n) n Z.
Let us look at a few more examples of homomorphisms now. We can get one important class of
homomorphisms from quotient groups.
Example: Let H G. Consider the map p : G G/H : p(x) = Hx. Show that p is a
homomorphism. Also show that p is onto. What is Ker p?
= (xG|Hx=H}
= H.
In this example you can see that Ker p G. You can also check that Theorem 1 is true here.
Let us briefly look at the inclusion map in the context of symmetric groups. Consider two
natural numbers m and n, where m I n.
Then, we can consider Sm Sn, where any Sm, written as
1 2 .... m
, is considered to be the same as
(1) (2) .... (m)
1 2 .... m m 1 .... n
Sn, i.e., (k) = k for m + 1 k n.
(1) (2) .... (m) m 1 .... n
1 2 3 4
For example, under i : S3 S4, (1 2) goes to .
2 1 4 4
We will now prove some results about homomorphisms. Henceforth, for convenience, we shall
drop the notation for the binary operation, and write a * b as ab.
Now let us look at the composition of two homomorphisms. Is it a homomorphism? Let us see.’
Thus g, f is a homomorphism.
Theorem 3: Let f : G1 G2 be a group homomorphism. Then
xy-1 Ker f.
Therefore, by Theorem 1 of Unit 3, Ker f G1. Now, for any y G1 and x E Ker f,
f(y-1xy) = f(y-1) f(x)f(y)
= e2 .
Ker f G1.
Now, let x2, y2 Im f. Then x1, y1 G1 such that f(x1) = x2 and f(y1) = y2.
x2y2-1 = f(x1) f(y1-1) = f(x1y1-1) Imf.
Im f G2.
Using this result, we can immediately see that the set of purely imaginary numbers is a normal
subgroup of C.
Consider : (R, +) (C*,.) (x) = cos x + i sin x. We have seen that(x + y) = (x)(y), that is, is a
group homomorphism. Now (x) = 1 iff x = 2n for some n Z . Thus, by Theorem 3, Ker =
(2 n | n Z) is a normal subgroup of (R. +). Note that this is cyclic, and 2n is a generator.
Similarly, Im is a subgroup of C*. This consists of all the complex numbers with absolute value
1, i.e., the complex numbers on the circle with radius 1 unit and centre (0, 0).
You may have noticed that sometimes the kernel of a homomorphism is {e} and sometimes it is
a large subgroup. Does the size of the kernel indicate anything? We will prove that a
homomorphism is 1 – 1 iff its kernel is {e}.
Theorem 4: Let f : G1 G2 be a group homomorphism. Then f is injective iff Ker f = {e1}, where
e1 is the identity element of the group G1.
Proof: Firstly, assume that f is injective. Let x Ker f. Then f(x) = e2, i.e., f(x) = f(e1). But f is 1 – 1.
x = e1 .
Thus, Kerf = {e1}.
So, by using Theorem 4, we can immediately say that any inclusion i : B G is 1-1, since
Ker i = {e}.
Example: Consider the group T of translations of R2. We define a map : (R2 + ) (T, o) by
4 (a, b) = fa, b. Show that is an onto homomorphism, which is also 1-1.
Now, any element of T is -f(a, b). Therefore, is surjective. We now show that is also injective.
is 1-1.
So we have proved that ! is a homomorphism, which is bijective.
And now let us look at a very useful property of a homomorphism that is surjective.
Theorem 5: Iff : G1 G2 is an onto group homomorphism and S is a subset that generates G1,
then f(S) generates G2.
Proof: We know that
Let x G2, Since f is surjective, there exists y G1 such that f(y) = x. Since y G1, y = x1 r1 ...x m rm ,
for some m N, where xi S and ri Z, 1 i m.
So far you have seen examples of various kinds of homomorphisms-injective, surjective and
bijective. Let us now look at bijective homomorphism in particular.
6.2 Isomorphisms
Note The word ‘isomorphisms’ is derived from the Greek word ‘ISOS’ meaning
‘equal’.
a b
Example: Consider the set G =
b a a, b R .
a b
Show that f : G C : f = a + ib is an isomorphism.
b a
Notes Solution: Let us first verify that f is a homomorphism. Now, for any two elements
a b c d
b a and d c in G,
a b c d a c b d
r f (a c) i(b d)
b a d c (b d) a c
= (a + ib) + (c + id)
a b c d
= f f
b a d c
Therefore, f is a homomorphism.
a b a b 0 0
Now, Ker f = f a ib 0 b a a 0,b 0 0 0
b a
The following result is one of the consequences of isomorphic groups being algebraically alike
Now if x has finite order, then o(x) = o(< x >) = o(< f(x) >) = o(f(x)), proving (i)
To prove (ii) assume hat x is of infinite order. Then < x > is an infinite group.
Therefore, < f(x) > is an infinite group, and hence, f(x) is of infinite order. So, we have proved (ii).
However, the order of any real number different from ±1 is infinite: and o(1) = 1, o(–1) = 2. Notes
So we reach a contradiction. Therefore, our supposition must be wrong. That is, R* and C* are
not isomorphic.
You must have noticed that the definition of an isomorphism just says that the map is bijective,
i.e., the inverse map exists. It does not tell us any properties of the inverse. The next result does
so.
Theorem 7: If f : G1 G2 is an isomorphism of groups, then f-1 : G2 G is also an isomorphism.
Proof: You know that f-1 is bijective. So, we only need to show that f-1 is a homomorphism. Let a’,
b’ G2 and a = f-1 (a’), b = f-1 (b’). Then f(a)= a’ and f(b)= b’.
Therefore, f(ab) = f(a) f(b) = a’b’. On applying f-1, we get
f-1 (a’b’) = ab = f-1 (a’) f-l (b’), Thus,
Theorem 7 says that if GI G2, then G2 G1. We will be using this result quite often.
In abstract algebra, a group isomorphism is a function between two groups that sets up a one-to-
one correspondence between the elements of the groups in a way that respects the given group
operations. If there exists an isomorphism between two groups, then the groups are called
isomorphic. From the standpoint of group theory, isomorphic groups have the same properties
and need not be distinguished.
Given two groups (G, *) and (H, ), a group isomorphism from (G, *) to (H, ) is a bijective
group homomorphism from G to H. Spelled out, this means that a group isomorphism is a
bijective function f : G H such that for all u and v in G it holds that
f(u * v) = f(u) f(v).
The two groups (G, *) and (H, ) are isomorphic if an isomorphism exists. This is written:
(G, *) (H, )
Often shorter and more simple notations can be used. Often there is no ambiguity about the
group operation, and it can be omitted:
GH
Sometimes one can even simply write G = H. Whether such a notation is possible without
confusion or ambiguity depends on context. For example, the equals sign is not very suitable
when the groups are both subgroups of the same group.
Conversely, given a group (G, *), a set H, and a bijection f : G H, we can make H a group
(H, ) by defining
f(u) f(v) = f(u * v).
Notes Intuitively, group theorists view two isomorphic groups as follows: For every element g of a
group G, there exists an element h of H such that h ‘behaves in the same way’ as g (operates with
other elements of the group in the same way as g). For instance, if g generates G, then so does h.
This implies in particular that G and H are in bijective correspondence. So the definition of an
isomorphism is quite natural.
An isomorphism of groups may equivalently be defined as an invertible morphism in the
category of groups, where invertible here means has a two-sided inverse.
Examples:
1. The group of all real numbers with addition, (, +), is isomorphic to the group of all
positive real numbers with multiplication (+, ×):
(, +) (+, ×)
/ S1
An isomorphism is given by
f(x + ) = e2x1
for every x in .
3. The Klein four-group is isomorphic to the direct product of two copies of 2 = /2
(see modular arithmetic), and can therefore be written 2 × 2. Another notation is Dih2,
because it is a dihedral group.
4. Generalizing this, for all odd n, Dih2n is isomorphic with the direct product of Dihn and Z2.
5. If (G, *) is an infinite cyclic group, then (G, *) is isomorphic to the integers (with the
addition operation). From an algebraic point of view, this means that the set of all integers
(with the addition operation) is the ‘only’ infinite cyclic group.
Some groups can be proven to be isomorphic, relying on the axiom of choice, but the proof does
not indicate how to construct a concrete isomorphism.
1. The group (, +) is isomorphic to the group (, +) of all complex numbers with addition.
Properties
The kernel of an isomorphism from (G, *) to (H, ), is always {eG} where eG is the identity
of the group (G, *)
If (G, *) is a locally finite group that is isomorphic to (H, ), then (H, ) is also locally Notes
finite.
Cyclic Groups
Consequences
From the definition, it follows that any isomorphism f : G H will map the identity element of
G to the identity element of H,
f(eG) = eH
that it will map inverses to inverses,
f(u–1) = [f(u)]–1
and more generally, nth powers to nth powers,
f(un) = [f(u)]n
for all u in G, and that the inverse map f–1 : H G is also a group isomorphism.
The relation “being isomorphic” satisfies all the axioms of an equivalence relation. If f is an
isomorphism between two groups G and H, then everything that is true about G that is only
related to the group structure can be translated via f into a true ditto statement about H, and vice
versa.
Self Assessment
6.5 Keywords
Homomorphism: Homomorphism is derived from two Greek words ‘homos’, meaning ‘link’,
and ‘morphe’, meaning ‘form’.
Inclusion Map: Let H be a subgroup of a group G. Show that the map i : H G, i(h) = h is a
homomorphism. This function is called the inclusion map.
1. Show that f : (R*,.) (R, 4) : f(x) = inx, the natural logarithm of x, is a group homomorphism.
Find Ker f and Im f also.
2. Is f : (GL3(R)3,) (w*,.) : f(A) = det(A) a homomorphism? If so, obtain Ker f and Im f.
Define f : (R, +) (S,.) L f(x) = eInx, where n is a fixed positive integer. Is f a homomorphism?
If so find Ker f.
Notes
CONTENTS
Objectives
Introduction
7.3 Summary
7.4 Keywords
Objectives
Introduction
After understanding the concept of isomorphisms. Let us prove some result about the relationship
between homomorphisms and quotient groups. The first result is the Fundamental Theorem of
Homomorphism for groups. It is called ‘fundamental’ because a lot of group theory depends
upon this result. This result is also called the first isomorphism theorem.
G1/Ker f Im f.
In particular, if f is onto, then G1/Ker f G2.
Proof: Let Ker f = H. Note that H G1. Let us define the function
At first glance it seems that the definition of depends on the coset representative. But we
will show that if x, y G1 such that Hx = Hy, then (Hx) = (Hy). This will prove that is a
well-defined function.
(Hx) = (HY).
f(x) = f(y)
f(x) [f(y)l-1 = e2
f(xy1) = e2
xy-1 Ker f = H.
Hx = Hy
Thus, , is 1-1.
Also, any element of Im f is f(x) = (Hx), where x G1.
Im = Im f.
So, we have proved that is bijective, and hence, an isomorphism. Thus, G1/Ker f = Im f.
The diagram says that if you first apply p, and then , to the elements of G1, it is the same as
applying f to them. That is,
p = f.
Also, note that Theorem 1 says that two elements of G1 have the same image under f iff they
belong to the same coset of Ker f.
One of the simplest situations we can consider is IG : G G. On applying Theorem 1 here, we see
that G/{e} G. We will be using this identification of G/{e) and G quite often.
1, if n is even
Example: Consider f : Z ({1, - 1),.) : f(n)
1, if n is odd.
At the beginning, you saw that f is a homomorphism. Obtain Ker f and Im f. What does Theorem
1 say in this case?
Solution: Let Ze and Zo denote the set of even and odd integers, respectively. Then
Ker f = {n Z | f(n) = 1 } = Z,
Im f = {f(n) | n Z ) = { l , – 1}
{ Ze, Zo } { 1, -1 }.
Example: Show that GL2(R)/SL2(R) R*, where SL2(R) = {A GL2(R) | det (A) = 1 }.,
Solution: We know that the function
f : GL2(R) R* : f(A) = der(A) is a homomorphism. Now, Ker f = SL2(R).
1 0
Also, Im f = R*, since any r R* can be written as det .
0 1
Now-we will use the Fundamental Theorem of Homomorphism to prove a very important
result which classifies all cyclic groups.
f is a homomorphism because
Now, we have two possibilities for Ker I Ker f = {0) or Ker f {0}.
Case 1 (Ker f = {0)): In this case f is 1-1. Therefore, f is an isomorphism. Therefore, by Theorem 7
of unit 6, f-1 is an isomorphism. That is, G (Z, +).
Case 2 (Ker f # {0)): Since Ker f Z, we know that Ker f = nZ, for some n N. Therefore, by the
Fundamental Theorem of Homomorphism, Z/nZ G.
Proof: We must first verify that the quotient groups H/(H K) and (HK)/K are well defined. Notes
You know that H K H. You know that HK G. Again, you know that K HK. Thus, the
given quotient groups are meaningful.
Now, for x, y H,
f(xy) = xyK = (xK) (yK) = f(x) f(y).
Therefore, f is a homomorphism.
We will show that Im f = (HK)/K. Now, take any element hK Im f. Since h H, h HK
Im f = (HK)/K.
Finally, Ker f = { h H | f(h) = K } = { h H hK = K }
={hH|h K}
=HK.
(H + K) / K H/H K.
Theorem 4: Let H and K be normal subgroups of a group G such that K H. Then (G/K)/(H/K)
G/H.
Proof: We will define a homomorphism from G/K onto G/H, whose kernel will turn out to be
H/K.
Consider f : G/K G/H : f(Kx) = Hx. f is well-defined because Kx = Ky tor x, y G
7.2 Automorphisms
Aut G = { f : G G | f is an isomorphism }.
You have already seen that the identity map IG Aut G. You know that Aut G is closed under the
binary operation of composition. Iff E Aut G, then f-1 Aut G. We summarise this discussion in
the following theorem.
An isomorphism from a group (G,*) to itself is called an Automorphisms of this group. Thus it
is a bijection f : G G such that
f(u) * f(v) = f(u * v).
Notes An automorphism always maps the identity to itself. The image under an automorphism of a
conjugacy class is always a conjugacy class (the same or another). The image of an element has
the same order as that element.
The composition of two automorphisms is again an automorphism, and with this operation the
set of all automorphisms of a group G, denoted by Aut(G), forms itself a group, the automorphism
group of G.
For all Abelian groups there is at least the automorphism that replaces the group elements by
their inverses. However, in groups where all elements are equal to their inverse this is the
trivial automorphism, e.g. in the Klein four-group. For that group all permutations of the three
non-identity elements are automorphisms, so the automorphism group is isomorphic to S3 and
Dih3.
In Zp for a prime number p, one non-identity element can be replaced by any other, with
corresponding changes in the other elements. The Automorphisms group is isomorphic to
Zp – 1. For example, for n = 7, multiplying all elements of Z7 by 3, modulo 7, is an automorphism
of order 6 in the automorphism group, because 36 = 1 (modulo 7), while lower powers do not
give 1. Thus this automorphism generates Z6. There is one more automorphism with this property:
multiplying all elements of Z7 by 5, modulo 7. Therefore, these two correspond to the elements
1 and 5 of Z6, in that order or conversely.
The automorphism group of Z6 is isomorphic to Z2, because only each of the two elements 1 and
5 generate Z6, so apart from the identity we can only interchange these.
The automorphism group of Z2 × Z2 × Z2 = Dih2 × Z2 has order 168, as can be found as follows.
All 7 non-identity elements play the same role, so we can choose which plays the role of (1,0,0).
Any of the remaining 6 can be chosen to play the role of (0, 1, 0). This determines which
corresponds to (1, 1, 0). For (0, 0, 1) we can choose from 4, which determines the rest. Thus we
have 7 × 6 × 4 = 168 automorphisms. They correspond to those of the Fano plane, of which the 7
points correspond to the 7 non-identity elements. The lines connecting three points correspond
to the group operation: a, b, and c on one line means a + b = c, a + c = b, and b + c = a. See also
general linear group over finite fields.
For Abelian groups all automorphisms except the trivial one are called outer automorphisms.
Non-Abelian groups have a non-trivial inner automorphism group, and possibly also outer
Automorphisms.
n = 1 or n = –1.
fg : G G : fg(x) = gxg-1.
Y = (gg-1)y(gg-1)
= (g-1yg)g-1
= fg(g-1yg) lm fg.
Thus, f, is an automorphism of G.
Definition: fg is called an inner automorphism of G induced by the element g in G. The subset of
Aut G consisting of all inner automorphism of G is denoted by Inn G.
For example, Let us compute fg(1). fg(l 3) and fg(1 2 3), where g = (1 2). Note that g-1 = (1 2) = g.
fg(l 3) = (1 2) (1 3) (1 2) = (2 3).
fg(l 2 3) = (1 2)(1 2 3)(1 2) = (1 3 2).
= g(hxh-1) g-1
= (gh)x (gh)-1
= fgh(x)
Thus, fgh = fg o fh, i.e., Inn G is closed under composition. Also fe = IG belongs to Inn G.
Thus, fg-1 = (fg)-1. That is every element of Inn G has an inverse in Inn G.
This proves that Inn G is a subgroup of Aut G.
Now, to prove that Inn Aut G, let Aut G and fg Inn G. Then, for any x G
Inn G Aut G.
Now we will prove an interesting result which relates the cosets of the centre of a group G to
lnn G. Recall that the centre of G, Z(G) = { x G | xg = gx g G }.
Theorem 7: Let G be a group. Then G/Z(G) Inn G.
Proof: As usual, we will use the powerful Fundamental Theorem of Homomorphism to prove
this result.
We define f : G Aut G : f(g) = fg.
= [(g) o f(h).
Next, Im F = ( fg, 1 g G ) = Inn G.
Finally, Ker f = ( g G | f, = IG }
= { g G [ fg(x) = x x G }
= { g G | gx = xg x G }
= Z(G).
Self Assessment
2. The word isomorphisms is derived from Greek word ISOS meaning .................
7.3 Summary
Any infinite cyclic group is isomorphic to (Z, +). Any finite cyclic group of order n is Notes
isomorphic to ( Z , +).
The set of automorphism of a group G, Aut G, is a group with respect to the composition
of functions.
7.4 Keywords
Group Homomorphism: Iff : G1 G2 and g : G2 G3 are two group homomorphisms, then the
composite map g . f : G1 G3 is also a group homomorphism.
1. Let G be a group and H G. Show that there exists a group G1 and a homomorphism
f : G G1 such that Ker f = H.
2. Show that the homomorphic image of a cyclic group is cyclic i.e., if G is a cyclic group and
f : G G’ is a homomorphism, then f(G) is cyclic.
3. Show that Z = nZ, for a fixed integer n,
(Hint: Consider f : (Z, +) (nZ, +) : f(k) = nk)
4. Is f : Z Z : f(x) = 0 a homomorphism? An isomorphism?
1. (b) 2. (a)
archives.math.utk.edu
CONTENTS
Objectives
Introduction
8.5 Summary
8.6 Keywords
8.7 Review Questions
Objectives
Introduction
In earlier classes, you have studied about the symmetric group. As you have often seen in
previous units, the symmetric groups S, as well as its subgroups, have provided us a lot of
examples. The symmetric groups and their subgroups are called permutation groups. It was the
study of permutation groups and groups of transformations that gave the foundation to group
theory. In this unit, we will prove a result by the mathematician Cayley, which says that every
group is isomorphic to permutations group. This result is what makes permutation groups so
important.
In earlier units, you have studied that a permutation on n non-empty set X is a bijective function
from X onto X. We denote the set of all permutations on X by S(X).
Suppose X is a finite set having n elements. For simplicity, we take these elements to be
1, 2, . . . , n. Then, we denote the set of all permutations on these n symbols by Sn.
1 2 .... n
f .
f(1) f(2) .... f(n)
Now, there are n possibilities for f(l), namely, 1, 2, . . . , n. Once f(1) has been specified, there are
(n – 1) possibilities for f(2), namely, {1, 2, . . . , n} \ {f(1)}. This is because f is 1-1. Thus, there are
n(n – 1) choices for f(1) and f(2). Continuing in this manner, we see that there are n! different
ways in which f can be defined. Therefore, S, has n! element.
Now, let us discuss at the algebraic structure of S(X), for any set X. The composition of
permutations is a binary operation on S(X). To help you regain practice in computing the
composition of permutations, consider an example.
1 2 3 4 1 2 3 4
Let f = and g be in S 4 .
2 4 1 3 4 1 3 2
1 2 3 4
fog=
3 2 1 4
Figure 8.1: (1 2 3 4) o (1 4 2) in S4
Theorem 1: Let X be a non-empty set. Then the system (S(X), 0 ) forms a group, called the
symmetric group of X.
Thus, Sn is a group of order n!. We call Sn, the symmetric group of degree n. Note that if f Sn, then
Notes Remark: From now we will refer to the composition of permutations as multiplication of
permutations. We will also drop the composition sign. Thus, we will write f o g as fg.
The two-line notation that we have used for a permutation is rather cumbersome. In the next
section we will see how to use a shorter notation.
2 4
Consider the permutation f = . Choose any one of the symbols say 1.
1 2
Figure 8.2: (1 3 4 2)
1 3
2 4
Such a permutation is called a 4-cycle, or a cycle of length 4. Figure 8.2 can give you an indication
as to why we give this name.
Definition: A permutation f Sn, is called an r-cycle (or cycle of length r) if there are r distinct
integers i1, i2,, i3, . . . , ir lying between 1 and n such that
In particular, 2-cycles are called transpositions. For example, the permutation f = (2 3) S3 is a Notes
transposition. Here f(1) = 1, f(2) = 3 and f(3) = 2.
Later you will see that transpositions play a very important role in the theory of permutations.
1 2 .... n
Now consider any 1-cycle (i) in S,. It is simply the identity permutation I , since
1 2 .... n
it maps i to i and the other (n - 1) symbols to themselves.
Let us see some examples of cycles in S3 (1 2 3) is the 3-cycle that takes 1 to 2, 2 to 3 and 3 to 1. There
are also 3 transpositions in S3, namely, (1 2), (1 3) and (2 3).
Now, can we express any permutation as a cycle? No. Consider the following example from S5.
Let g be the permutation defined by
1 2 3 4 5
g= .
3 5 4 1 2
If we start with the symbol 1 and apply the procedure for obtaining a cycle to g, we obtain
(1 3 4) after three steps, Because, g maps 4 to 1, we close the brackets, even though we have not
yet written down all the symbols. Now we simply choose another symbol that has not appeared
so far, say 2, and start the procedure of writing a cycle again. Thus, we obtain another cycle (2 5).
Now, all the symbols are exhausted.
g = (1 3 4) (2 5).
We call this expression for g a product of a 3-cycle and a transposition. In Figure 8.3 we represent
g by a diagram which shows the 3-cycle and the 2-cycle clearly.
Figure 8.3: (1 3 4) (2 5)
Because of the arbitrary choice of symbol at the beginning of each cycle, there are many ways of
expressing g. For example,
g = (4 1 3) (2 5) = (2 5) (1 3 4) = (5 2) (3 4 1).
That is, we can write the product of the separate cycles in any order, and the choice of the starting
element within each cycle is arbitrary.
So, you see that g can’t be written as a cycle; it is a product of disjoint cycles.
Definition: We call two cycle disjoint if they have no symbol in common. Thus, disjoint cycles
move disjoint sets of elements, (Note that f ! S,, moves a symbol i if f(i) i. We say that f fixes
i if f(i) = i.)
So, for example, the cycles (1 2) and (3 4) in S4 are disjoint. But (1 2) and (1 4) are not disjoint, since
they both move 1.
Notes Note that if f and g are disjoint, then fg=-gf, since f and g move disjoint sets of symbols.
Now let us examine one more example. Let h be the permutation in S5 defined by
1 2 3 4 5
h= .
4 2 3 5 1
because each of the symbols 2 and 3 is left unchanged by h. By convention, we don’t include the
1-cycles (2) and (3) in the expression for h unless we wish to emphasize them, since they just
represent the identity permutation. Thus, we simply write h = (1 4 5).
The same process that we have just used is true for any cycle. That is, any r-cycle (i1 i2 . . . . . ir) can
be written as (i1 ir) (i1 i1) . . . . . (i1 i2), a product of transpositions.
Now we will use Theorem 2 to state a result which shows why transpositions are so important
in the theory of permutations.
Theorem 2: Every permutation in Sn (n 2) can be written as a product of transpositions.
Proof: The proof is really very simple. By Theorem 1 every permutation, apart from I, is a
product of disjoint cycles. Also, you have just seen that every cycle is a product of transpositions.
Hence, every permutation, apart from I, is a product of transpositions.
Also, I = (1 2) (1 2). Thus, I is also a product of transpositions. So, the theorem is proved.
Let us see how Theorem 3 works in practice. This is the same as (1 4) (1 2) (1 3) (1 5).
1 2 3 4 5 6
Similarly, the permutation
3 6 4 1 2 5
= (1 3 4) (2 6 5) = (1 4) (1 3) (2 5) (2 6).
The decomposition given in Theorem 3 leads us to a subgroup of Sn that we will now discuss.
You have seen that a permutation in Sn can be written as a product of transpositions. But all such
representations have one thing in common – if a permutation in Sn is the product of an odd
number of transpositions in one such representation, then it will be a product of an odd number
of transpositions in any such representation. Similarly, if f Sn is a product of an even number
of transpositions in one representation, then f is a product of an even number of transpositions
in any such representation. To see this fact we need the concept of the signature or sign function.
f(i) – f(i)
sign f
i ,j 1 j–i
Notes
3 – 2 1 2 1 3
= 1.
1 2 1
1 2 3 – 2 3 1
= 1.
1 2 1
Proof: By definition,
n
sign fog f(g(j)) – f(g(i))
i , j 1 j–i
i j
Now, as i and j take all possible pairs of distinct values from 1 to n, so do g(i) and g(j), since g is
a bijection.
t (q) – t p – 1
= 1.
q–p q-p
t(i) t(j) i j
1, if i, j p, q.
ij ij
The remaining factors contain either p or q, but not both. These can be paired together to form
one of the following products.
Notes
t(i) – t(p) t(q) – t(i) i – q p – i
= . l, if q > i > p,
i–p q–i i–p q–i
Taking the values of all the factors of sign t, we see that sign t = –1.
(b) Let f S,. By Theorem 3 we know that f = t1,t2 .... t, for some transpositions t1, ..... tr in Sn.
sign f = sign (t1 t2 . . . . t,)
= (sign t1) (sign t2) . . . . . sign (tr), by Theorem 3.
{1, – 1} Im {sign}
be two factorisations of f into a product of transpositions. Then either both r and s are even
integers, or both are odd integers.
Proof: We apply the function sign: Sn {1, –1} to f = t1t2 . . . . tr.
sign (t1’ t2’ . . . ts’) = (–1) substituting t1’ t2’. . . ts’ for f.
So, we have shown that for f S, the number of factors occurring in any factorisation of f into
transposition is always even or always odd. Therefore, the following definition is meaningful.
Definition: A permutation f Sn, is called even if it can be written as a product of an even sign
number of transposition. f is called odd if it can be represented as a product of an odd number of
transpositions.
For example, (1 2) S3 is an odd permutation. In fact, any transposition is an odd permutation.
On the other hand, any 3-cycle is an even permutation, since
(i j k) = (i k) (i j)
A, = (f Sn, | f is even).
n!
We’ll show that A,, Sn, and that o(An) = , for n 2.
2
n!
Notes
Theorem 6: The set A,, of even permutations in S,, forms a normal subgroup of Sn, of order .
2
= (f Sn | f is even)
= A.
A Sn.
o(S n )
o(Sn/An) = 2, that is, 2.
o(A n )
o(S n ) n !
o(An) = .
2 2
Note that this theorem says that the number of even permutations in S, equals the number of
odd permutations in S,.
Theorem 6 leads us to the following definition.
Definition: A,, the group of even permutations in Sn, is called the alternating group of
degree n.
Let us look at an example that you have already seen in previous units, A3. Now, Theorem 6 says
3!
that o(A3) = 3. Since (1 2 3) = (1 3) (1 2), (1 2 3) A3. Similarly,
2
A fact that we have used in the example above is that an r-cycle is odd if r is even, and even if r
is odd. This is because (i1i2 .... ir,) = (i1 i,) (i1 ir-1) . . . . . . (i1 i2), a product of (r – 1) transpositions.
Now, for a moment, let us go back to Unit 4 and Lagrange’s theorem. This theorem says that the
order of the subgroup of a finite group divides the order of the group. We also said that if
n |o(G), then G need not have a subgroup of order n. Now that you know what A4 looks like, we
are in a position to illustrate this statement.
We will show that A4 has no subgroup of order 6, even though 6 | o (A4). Suppose such a
subgroup H exists. Then o(H) = 6, o (A4) = 12. (A4 : H | = 2. H A4 (see Theorem 3, Unit 5).
Now, A4/H is a group of order 2.
Notes Similarly, (1 3 2)2 = (1 2 3) H. By the same reasoning (1 4 2), (1 2 4), (1 4 3), (1 3 4), (2 3 4), (2 4 3)
are also distinct element of H. Of course, I H.
Thus, H contains at least 9 elements.
o(H) 9. This contradicts our assumption that o(H) = 6.
Therefore, A4 has no subgroup of order 6.
We use A4 to provide another example too. (See how useful A4 is!) In earlier unit we’d said that
if H N and N G, then H need not be normal in 6. Well, here’s the example.’
(1 2 3)-1 (1 2) (3 4) (1 2 3) = (1 3) (2 4) H.
And now let us see why permutation groups are so important in group theory.
Most finite groups that first appeared in mathematics were groups of permutations. It was the
English mathematician Clayley who first realised that every group has the algebraic structure
of a subgroup of S(X), for some set X. In this section we will discuss Cayley’s result and some of
its applications.
Theorem 7 (Cayley): Any group G is isomorphic to a subgroup of the symmetric group S(G).
fa is 1-1, since
fa(x) = fa(y) ax = ay x = y x, y E G.
fa is onto, since any x E G is f, (a – ’x).
:. fa S(G) a G.
Now, Ker f = (a G | fa = IG )
= ( a G | fa(x)=x x G }
= ( a G |a x = x x G }
= {e}.
G/Ker f Im f S(G),
that is, G is isomorphic to a subgroup of S(G).
As an example of Cayley’s theorem, we will show you that the Klein 4-group K4 is isomorphic
to the subgroup V4 of S4. The multiplication table for K4 is
. e a b c
e e a b c
a a e c b
b b c e a
c c b a e
Self Assessment
1. If .................. is a group of order n!. Then we call S, the symmetric group of define n.
(a) Sn (b) Sn
1
(c) S n (d) Sn-1
2. Every permutation is Sn (n ..................) can be written as produce of transposition
(a) –1 (b) 1
(c) 0 (d) 2
8.5 Summary
The symmetric group S(X), for any set X, and the group S,, in particular.
Symmetric Group: Let X be a non-empty set. Then the system (S(Xj, 0) forms a group, called the
symmetric group of X.
Permutation: A permutation f S, is called an r-cycle (or cycle of length r) if there are r distinct
integers i1, i2, i3, . . . , ir lying between 1 and n.
4. (a) 5. (d)
www.maths.tcd.ie/
archives.math.utk.edu
CONTENTS
Objectives
Introduction
9.4 Summary
9.5 Keywords
Objectives
Introduction
In the last unit, we have studied about permutation group. This unit will provide you the
information related to 15 finite groups and direct products. Let us understand all these one by
one.
In this section, we will discuss a very important method of constructing new groups. We will
first see how two groups can be combined to form a third group. Then we will see how two
subgroups of a group can be combined to form another subgroup.
In this sub-section we will construct a new group from two or more groups that we already
have.
Let (G1, *1) and ((G2, *2) be two groups. Consider their Cartesian product G = G1 × G2 = {(x, y) |
x G1, y G2}.
Notes Can we define a binary operation on G by using the operations on G1 and G2? Let us try the
method, namely, component-wise multiplication. That is, we define the operation * on G by
(a,b) * (c, d)=(a*1 c, b*2d) a, c G1, b, d G2.
So, you have proved that G = G1 × G2 is a group with respect to *. We call G the external direct
product of (G1, *1) and (G2, *2).
We can similarly prove that G1 × {e2} G1 × G2 and (G1 × G2)/(G1 × {e2}) G2.
So, far we have seen the construction of G1 × G2 from two groups G1 and G2. Now we will see
under what conditions we can express a group as a direct product of its subgroups.
Let us begin by recalling from Unit 5 that if H and K are normal subgroups of a group G, then HK
is a normal subgroup of G. We are interested in the case when HK is the whole of G. We have the
following definition.
Definition: Let H and K be normal subgroups of a group G. We call G the internal direct product Notes
of H and K if
G = HK and H K = {e}.
We write this fact as G = H × K.
For example, let us consider the familiar Klein 4-group
For another example, consider Z10. It is the internal direct product of its subgroups H = {0, 5}
(i) Z10= H + K, since any element of Z10 is the sum of an element of H and an element of K, and
(ii) H K= {0} .
Now, can an external direct product also be an internal direct product? What does it say? It says
that the external product of G1 × G2 is the internal product (G1 × {e2}) × ({e1} × G2).
For example, look at the group G generated by {a, b, c}, where a2 = e = b2 = c2 and ab = ba, ac = ca,
bc = cb. This is the internal direct product of < a >, < b > and < c >. That is G Z2 × Z2 × Z2.
Now, can every group be written as an internal direct product of two or more of its proper
normal subgroups? Consider Z. Suppose Z = H × K, where H, K are subgroups of Z.
You know that H = < m > and K = < n > for some m, n Z. Then mn H K. But if H × K is a direct
product, H K = {0}. So, we reach a contradiction. Therefore, Z can’t be written as an internal
direct product of two subgroups.
By the same reasoning we can say that Z can’t be expressed as H1 × H2 × ..... × Hn, where Hi Z
i = 1, 2, .... , n.
When a group is an internal direct product of its subgroups, it satisfies the following theorem.
Theorem 1: Let a group G be the internal direct product of its subgroups H and K. Then
Notes Proof: (a) We know that G = HK. Therefore, if x G, then x = hk, for some h H, k K. Now
suppose x = h1k1 also, where h1 H and k1 K. Then hk = h1k1.
(b) The best way to show that two elements x and y commute is to show that their commutator
x-1y-1 xy is identity. So, let h H and k K and consider h-’k-’hk. Since K G, h-1k-1 h K.
h-1k-1hk K.
Now let us look at the relationship between internal direct products and quotient groups.
Theorem 2: Let H and K be normal subgroups of a group G such that G = H × K. Then G/H K
and G/K H.
Theorem 3: Let G be a finite group and H and K be its subgroups such that G = H X K.
In Unit 4 we proved Lagrange’s theorem, which says that the order of a subgroup of a finite
group divides the order of the group. We also said that if G is a finite cyclic group and m | o(G),
then G has a subgroup of order. But if G is not cyclic, this statement need not be true, as you have
seen in the previous unit. In this context, in 1845 the mathematician Cauchy proved the following
useful result.
Theorem 4: If a prime p divides the order of a finite group G, then G contains an element of
order p.
The proof of this result involves a knowledge of group theory that is beyond the scope of this
course. Therefore, we omit it.
Theorem 5: If a prime p divides the order of a finite group G, then G contains a subgroup of
order p.
Proof: Just take the cyclic subgroup generated by an element of order p. This element exists
because of Theorem 4.
So, by Theorem 5 we know that any group of order 30 will have a subgroup of order 2, a
subgroup of order 3 and a subgroup of order 5. In 1872 Ludwig Sylow, a Norwegian
mathematician, proved a remarkable extension of Cauchy’s result. This result, called the first Notes
Sylow theorem, has turned out to be the basis of finite group theory. Using this result we can
say, for example, that any group of order 100 has subgroups of order 2, 4, 5 and 25.
Theorem 6 (First Sylow Theorem): Let G be a finite group such that o (G) = pnm, where p is a
prime, n 1 and (p, m) = 1. Then G contains a subgroup of order pk k = 1, . . . , n.
We shall not prove this result or the next two Sylow theorems either. But, after stating all these
results we shall show how useful they are.
The next theorem involves the concepts of conjugacy and Sylow p-subgroups which we now
define.
Definition: Two subgroups H and K of a group G are conjugate in G if g G such that
K = g-1Hg and then K is called a conjugate of H in G.
And now let us see how many Sylow p-subgroups a group can have.
Theorem 8 (Third Sylow Theorem): Let G be a group of order pnm, where (p, m) = 1 and p is a
prime. Then np, the number of distinct Sylow p-subgroups of G, is given by np = 1 + kp for some
k 0. And further, np | o(G).
We would like to make a remark about the actual use of Theorem 8.
Remark: Theorem 8 says that np, 1 (mod p). (np, pn) = 1. Also, since np | o(G), using Theorem 9
of Unit 1 we find that np | m. This fact helps us to cut down the possibilities for n,, as you will see
in the following examples.
Similarly, we know that G has a subgroup of order 5. The total number of such subgroups is 1,6
or 11 and must divide 3. Thus, the only possibility is 1. So G has a unique subgroup of order 5, say
K. Then K G and K is cyclic.
G = HK.
So, G = H × K Z3 X Z5 = Z15,
Notes
Example: Show that a group G of order 30 either has a normal subgroup of order 5 or a
normal subgroup of order 3, i.e. G is not simple. A group G is called simple if its only normal
subgroups.
Solution: Since 30 = 2 × 3 × 5, G has a Sylow 2-subgroup, a Sylow 3-subgroup and a Sylow
5-subgroup. The number of Sylow 5-subgroups is of the form 1 + 5k and divides 6. Therefore, it
can be 1 or 6. If it is 1, then the Sylow 5-subgroup is normal in G.
On the other hand, suppose the number of Sylow 5-subgroups is 6. Each of these subgroups are
distinct cyclic groups of order 5, the only common element being e. Thus, together they contain
24 + 1 = 25 elements of the group. So, we are left with 5 elements of the group which are of order
2 or 3. Now, the number of Sylow 3-subgroups can be 1 or 10. We can’t have 10 Sylow
3-subgroups, because we only have at most 5 elements of the group which are of order 3. So, if
the group has 6 Sylow 5-groups then it has only 1 Sylow 3-subgroup.
Now let us use the powerful Sylow theorems to classify groups of order 1 to 10. In the process we
will show you the algebraic structure of several types of finite groups.
Here, we will apply the results of the above discussion to study some finite groups. In particular,
we will list all the groups of order 1 to 10, up to isomorphism.
We start with proving a very useful result.
Theorem 9: Let G be a group such that o(G) = pq, where p, q are primes such that p > q and
q | p – 1. Then G is cyclic.
Proof: Let P be a Sylow p-subgroup and Q be a Sylow q-subgroup of G. Then o(P) = p and
o(Q) = q. Now, any group of prime order is cyclic, so P = < x > and Q = < y > for some x, y G.
By the third Sylow theorem, the number np of subgroups of order p can be 1 , 1+ p, 1 + 2p, . . . , and
it must divide q. But p > q. Therefore, the only possibility for np is 1. Thus, there exists only one
Sylow p-subgroup, i.e., P. Further, by Sylow’s second theorem P G.
Again, the number of distinct Sylow q-subgroups of G is nq, = 1 + kq for some k, and n, | p. Since
p is a prime, its only factors are 1 and p. n, = 1 or nq = p. Now if 1 + kq = p, then q | p – 1. But
we started by assuming that 9 | p - 1. So we reach a contradiction. Thus, nq = 1 is the only
possibility. Thus, the Sylow q-subgroup Q is normal in G.
Now we want to show that G = P × Q. For this, let us consider P Q. The order of any element of
P Q must divide p as well as q, and hence it must divide (p, q) = 1.
Now if q | p – 1, then does o(G) = pq imply that G is cyclic? Well, consider S3. You know that o(S3)
= 6 = 2.3, but S3 is not cyclic. In fact, we have the following result.
Theorem 10: Let G be a group such that o(G) = 2p, where p is an odd prime, Then either G is cyclic
or G is isomorphic to the dihedral group D2p of order 2p.
Proof: As in the proof of Theorem 9, there exists a subgroup P = < x > of order p with
xy yx. y-1xy x.
Now, since P = < x > G, y-1xy P. y-1xy = x’, for some r = 2 ,... . , p - 1.
2
r
Therefore, y-2xy2 = y-1(y-1xy) = y–1xry = (y-1xy)r = (xr)r = x
2
x = x r , since o(y) = 2.
21
xr 6.
So, G = PQ = < {x, y | xp = e, y2 = e, y-1xy = x-1 > , which is exactly the same algebraic structure as
that of D2p.
G = D2p = {e, x, x2, ... , xp-1, y, xy, x2 y, . . . . , xp-1y]
Now, from Theorem 6 of Unit 4, we know that if o(G) is a prime, then G is cyclic. Thus, groups
of orders 2, 3, 5 and 7 are cyclic. This fact allows us to classify all groups whose orders are 1, 2, 3,
5, 6, 7 or 10. What about the structure of groups of order 4 = 22 and 9 = 32? Such groups are covered
by the following result.
Theorem 12: Let G be a group such that o(G) = p2, where p is a prime. Then either G is cyclic or
G = Zp × Zp, a direct product of two cyclic groups of order p.
o(H) = p.
Therefore, y G such that y H. Then, by the same reasoning, K = < y > is of order p. Both H
and K are normal in G, since G is abelian.
We want to show that G = H × K. For this, consider H K. Now H K H.
G = H × K Zp × Zp .
So far we have shown the algebraic structure of all groups of order 1 to 10, except groups of order
8. Now we will list the classification of groups of order 8.
If G is an abelian group of order 8, then
(ii) G Z4 × Z2, or
(iii) G Z2 × Z2 × Z2 .
Self Assessment
2. Let H and k be normal subgroups of a group G such that G = H × k. Then G/H ...................
and G/k H
(a) k (b) H
3. Let G ................... be and H and k be its subgroup such that G = H × k. Thus O(G) = O(H) o(k).
4. If a prime p divides the order of a finite group G, then G contains an element of ................... Notes
(a) P (b) G
(c) Q (d) R
5. If a prime P divides the order of a finite group G, then G contains a ................... of order P.
(a) subgroup (b) normal
(c) cycle (d) permutation
9.4 Summary
The statement and application of Sylow’s theorems, which state that: Let G be a finite
group of order pnm, where p is a prime and p | m. Then
G is abelian.
G is cyclic or G Zp × Zp.
The classification of groups of order 1 to 10, which we give in the following table.
External Direct Product: Let (G1, *1), (G2, *2), . . . . . , (Gn, *n) be n groups. Their external direct
product is the group (G, *), where
G = G1 × G2 ..... × Gn and
Sylow p-subgroup: Let G be a finite group and p be a prime such that pn | o(G) but pn+1 o(G), for
some n 1. Then a subgroup of G of order pn is called a Sylow p-subgroup of G.
1. Show that the binary operation * on G is associative. Find its identity element and the
inverse of any element (x, y) in G.
2. Show that G1 × G2 = G2 × G1, for any two groups G1 and G2.
3. Show that G1 × G2 is the product of its normal subgroup H = G1 × {e2} and K = {e1} × G2. Also
show that (G1 × {e2}) ({e1} × G2) = {(e1, e2)}.
4. Prove that P(G1 × G3) = Z(G1) × Z(G2), where Z(G3) denotes the centre of G (see Theorem 2
of unit 3).
5. Let A and B be cyclic groups of order m and n, respectively, where (m, n) = 1. Prove that
A × B is cyclic of order mn.
(Hint: Define f : Z Zm × Zn : f(r) = (r + mZ, r + nZ). Then apply the Fundamental theorem
of Homomorphism to show that Zm × Zn Zmn.
6. Let H and K be normal subgroups of G which satisfy (a) of Theorem 1. Then show that
G = H × K.
7. Use Theorem 2 to prove Theorem 3.
Notes
CONTENTS
Objectives
Introduction
10.1 Definition
10.2 Properties
10.3 Notation
10.4 Summary
10.5 Keywords
Objectives
Introduction
A group for which the elements commute (i.e., AB = BA for all elements A and B) is called a finite
abelian group. All cyclic groups are finite abelian, but a finite abelian group is not necessarily
cyclic. All subgroups of a finite abelian group are normal. In a finite abelian group, each element
is in a conjugacy class by itself, and the character table involves powers of a single element
known as a group generator. In Mathematica, the function finite abelian group
[{n1, n2 ...}] represents the direct product of the cyclic groups of degrees n1 n2 ...
10.1 Definition
A finite abelian group is a set, A, together with an operation “•” that combines any two elements
a and b to form another element denoted a • b. The symbol “•” is a general placeholder for a
concretely given operation. To qualify as a finite abelian group, the set and operation,
(A, •), must satisfy five requirements known as the finite abelian group axioms:
Closure
Associatively Notes
Identity Element
There exists an element e in A, such that for all elements a in A, the equation e • a = a • e = a holds.
Inverse Element
For each a in A, there exists an element b in A such that a • b = b • a = e, where e is the identity
element.
Commutatively
For all a, b in A, a • b = b • a.
More compactly, a finite abelian group is a commutative group. A group in which the group
operation is not commutative is called a “non-finite abelian group” or “non-commutative group”.
You should notice that any field is a finite abelian group under addition. Furthermore, under
multiplication, the set of non-zero elements of any field must also form a finite abelian group.
Of course, in this case the two operations are not independent–they are connected by the
distributive laws.
The definition of a finite abelian group is also useful in discussing vector spaces and modules.
In fact, we can define a vector space to be a finite abelian group together with a scalar multiplication
satisfying the relevant axioms. Using this definition of a vector space as a model, we can state the
definition of a module in the following way.
10.2 Properties
Let us assume that, If n is a natural number and x is an element of a finite abelian group G written
additively, then nx can be defined as x + x + ... + x (n summands) and (–n)x = –(nx). In this way, G
becomes a module over the ring Z of integers. In fact, the modules over Z can be identified with
the finite abelian groups.
Theorems about finite abelian groups can often be generalized to theorems about modules over
an arbitrary principal ideal domain. A typical example is the classification of finitely generated
finite abelian groups which is a specialization of the structure theorem for finitely generated
modules over a principal ideal domain. In the case of finitely generated finite abelian groups,
this theorem guarantees that a finite abelian group splits as a direct sum of a torsion group and
a free finite abelian group. The former may be written as a direct sum of finitely many groups
of the form Z/pkZ for p prime, and the latter is a direct sum of finitely many copies of Z.
If f, g : G H are two group homomorphisms between finite abelian groups, then their sum
f + g, defined by (f + g)(x) = f(x) + g(x), is again a homomorphism. (This is not true if H is a non-
finite abelian group.) The set Hom (G, H) of all group homomorphisms from G to H thus turns
into a finite abelian group in its own right.
Somewhat kind to the dimension of vector spaces, every finite abelian group has a rank. It is
defined as the cardinality of the largest set of linearly independent elements of the group. The
integers and the rational numbers have rank one, as well as every subgroup of the rationals.
There are two main notational conventions for finite abelian groups: ‘+’ additive and ‘.’
multiplicative.
Addition x+y 0 nx –x
Multiplication x * y or xy e or 1 xn x–1
Generally, the multiplicative notation is the usual notation for groups, while the additive notation
is the usual notation for modules. The additive notation may also be used to emphasize that a
particular group is abelian, whenever both abelian and non-finite abelian groups are considered.
Multiplication Table
To verify that a finite group is abelian, a table (matrix) - known as a Cayley table - can be
constructed in a similar fashion to a multiplication table. If the group is G = {g1 = e, g2, ..., gn} under
the operation “, the (i, j)’th entry of this table contains the product gi . gj. The group is abelian if
and only if this table is symmetric about the main diagonal.
This is true since if the group is abelian, then gi . gj = gj . gi. This implies that the (i, j)’th entry of the
table equals the (j, i)’th entry, thus the table is symmetric about the main diagonal.
Examples:
1. For the integers and the operation addition “+”, denoted (Z,+), the operation + combines
any two integers to form a third integer, addition is associative, zero is the additive
identity, every integer n has an additive inverse, –n, and the addition operation is
commutative since m + n = n + m for any two integers m and n.
3. Every ring is a finite abelian group with respect to its addition operation. In a commutative
ring the invertible elements, or units, form an abelian multiplicative group. In particular,
the real numbers are a finite abelian group under addition, and the non-zero real numbers
are a finite abelian group under multiplication.
4. Every subgroup of a finite abelian group is normal, so each subgroup gives rise to a quotient
group. Subgroups, quotients, and direct sums of finite abelian groups are again abelian.
In general, matrices, even invertible matrices, do not form a finite abelian group under
multiplication because matrix multiplication is generally not commutative. However, some
groups of matrices are finite abelian groups under matrix multiplication - one example is the
group of 2 x 2 rotation matrices.
Example: Find all finite abelian groups of order 108 (up to isomorphism).
Solution: The prime factorization is 108 = 22 · 33. There are two possible groups of order 4: Z4 and Notes
Z2 × Z2 . There are three possible groups of order 27: Z27 , Z9 × Z3 , and Z3 × Z3 × Z3 . This gives us
the following possible groups:
Z4 × Z27
Z2 × Z2 × Z27
Z4 × Z9 × Z3
Z2 × Z2 × Z9 × Z3
Z4 × Z3 × Z3 × Z3
Z2 × Z2 × Z3 × Z3 × Z3 .
Example: Let G and H be finite abelian groups, and assume that G × G is isomorphic to
H × H. Prove that G is isomorphic to H.
Solution: Let p be a prime divisor of |G|, and let q = pm be the order of a cyclic component of G.
If G has k such components, then G × G has 2k components of order q. An isomorphism between
G × G and H × H must preserve these components, so it follows that H also has k cyclic components
of order q. Since this is true for every such q, it follows that G H
Example: Let G be a finite abelian group which has 8 elements of order 3, 18 elements of
order 9, and no other elements besides the identity. Find (with proof) the decomposition of G as
a direct product of cyclic groups.
Solution: We have |G| = 27. First, G is not cyclic since there is no element of order 27. Since there
are elements of order 9, G must have Z9 as a factor. To give a total of 27 elements, the only
possibility is G Z9 × Z3.
Check: The elements 3 and 6 have order 3 in Z9, while 1 and 2 have order 3 in Z3. Thus, the
following 8 elements have order 3 in the direct product: (3, 0), (6, 0), (3, 1), (6, 1), (3, 2), (6, 2),
(0, 1), and (0, 2).
Example: Let G be a finite abelian group such that |G| = 216. If | 6 G | = 6, determine G up
to isomorphism.
Solution: We have 216 = 23 · 33, and 6G Z2 × Z3 since it has order 6. Let H be the Sylow
2-subgroup of G, which must have 8 elements. Then multiplication by 3 defines an automorphism
of H, so we only need to consider 2H. Since 2H Z2, we know that there are elements not of order
2, and that H is not cyclic, since 2 Z8 Z4. We conclude that H Z4 × Z2.
A similar argument shows that the Sylow 3-subgroup K of G, which has 27 elements, must be
isomorphic to Z9 × Z3.
G Z4 × Z2 × Z9 × Z3 .
(If you prefer the form of the decomposition, you can also give the answer in the form G Z36 × Z6.)
Example: Apply both structure theorems to give the two decompositions of the finite
abelian group Z 216
Notes Since 27 is a power of an odd prime, it follows that Z 27 is cyclic. This can also be shown directly
by guessing that 2 is a generator.
Since Z 27 has order 33 - 32 = 18, an element can only have order 1, 2, 3, 6, 9 or 18. We have
22 = 4,
23 = 8,
26 82 10, and
29 23 · 26 8 · 10 -1,
so it follows that 2 must be a generator.
To give the first decomposition, states that any finite abelian group is isomorphic to a direct
product of cyclic groups of prime power order. In this decomposition we need to split Z18 up into
cyclic subgrops of prime power order, so we finally get the decomposition
Z 216 Z2 × Z2 × Z2 × Z9.
On the other hand, the second decomposition, where any finite finite abelian group is written as
a direct product of cyclic groups in which the orders any component is a divisor of the previous
one. To do this we need to group together the largest prime powers that we can. In the first
decomposition, we can combine Z2 and Z9 to get Z18 as the first component. We end up with
Z 216 Z18 × Z2 × Z2
as the second way of breaking Z 216 up into a direct product of cyclic subgroups.
Example: Let G and H be finite abelian groups, and assume that they have the following
property. For each positive integer m, G and H have the same number of elements of order m.
Prove that G and H are isomorphic.
Solution: We give a proof by induction on the order of |G|. The statement is clearly true for
groups of order 2 and 3, so suppose that G and H are given, and the statement holds for all
groups of lower order. Let p be a prime divisor of |G|, and let Gp and Hp be the Sylow
p-subgroups of G and H, respectively. Since the Sylow subgroups contain all elements of order
a power of p, the induction hypothesis applies to Gp and Hp. If we can show that Gp Hp for all p,
then it will follow that G H, since G and H are direct products of their Sylow subgroups.
Let x be an element of Gp with maximal order q = pm. Then < x > is a direct factor of Gp, so there
is a subgroup G’ with Gp = < x > × G’. By the same argument we can write Hp = < y > × H’, where
y has the same order as x.
Now consider < xp > × G’ and < yp > × H’. To construct each of these subgroups we have removed
elements of the form (xk, g’), where xk has order q and g’ is any element of G’. Because x has
maximal order in a p-group, in each case the order of g’ is a divisor of q, and so (xk, g’) has order
q since the order of an element in a direct product is the least common multiple of the orders of
the components. Thus to construct each of these subgroups we have removed (pm – pm-1) · |G’|
elements, each having order q. It follows from the hypothesis that we are left with the same
number of elements of each order, and so the induction hypothesis implies that < xp > × G’ and
< yp > × H’ are isomorphic. But then G’ H’, and so Gp Hp, completing the proof.
Proposition: Every finite abelian group has a natural structure as a module over the ring Z. Notes
As with vector spaces, one goal is to be able to express a finite abelian group in terms of simpler
building blocks. For vector spaces we can use one-dimensional spaces as the building blocks; for
finite abelian groups, it seems natural to use the simple finite abelian groups.
Recall that in an arbitrary group G, a subgroup N G is called a normal subgroup if gxg–1 N,
for all x N and all g G. Then G is said to be a simple group if its only normal subgroups are
{1} and G. If the group A is abelian, then all subgroups are normal, and so A is simple iff its only
subgroups are the trivial subgroup (0) and the improper subgroup A. The same definition is
given for modules: a nonzero module M is a simple module if its only submodules are (0) and
M. When you view a finite abelian group as a Z-module, then, of course, the two definitions
coincide.
Note Any cyclic finite abelian group is isomorphic to Z or Zn, for some n.
Outline of the Proof: Let A be a cyclic finite abelian group that is generated by the single
element a. Define the group homomorphism f : Z A by setting f(n) = na, for all n Z. Note that
f maps Z onto A since f(Z) = Za = A. If f is one-to-one, then A is isomorphic to Z. If f is not
one-to-one, we need to use the fundamental homomorphism theorem and the fact that every
subgroup of Z is cyclic to show that A is isomorphic to Zn, where n is the smallest positive
integer such that na = 0.
Proposition: A finite abelian group is simple iff it is isomorphic to Zp, for some prime number p.
Proof: First, let A be a finite abelian group isomorphic to Zp, where p is a prime number. The
isomorphism preserves the subgroup structure, so we only need to know that Zp has no proper
nontrivial subgroups. This follows from the general correspondence between subgroups of Zn
and divisors of n, since p is prime precisely when its only divisors are ±1 and ±p, which correspond
to the subgroups Zp and (0), respectively.
Conversely, suppose that A is a simple finite abelian group. Since A is nonzero, pick any
nonzero element a A. Then the set Za = {na | n Z} is a nonzero subgroup of A, so by
assumption it must be equal to A. This shows that A is a cyclic group. Furthermore, A can’t be
infinite, since then it would be isomorphic to Z and would have infinitely many subgroups. We
conclude that A is finite, and hence isomorphic to Zn, for some n. Once again, the correspondence
between subgroups of Zn and divisors of n shows that if Zn is simple, then n must be a prime
number.
Example: The group Z4 is not a semisimple Z-module. First, Z4 is not a simple group.
Secondly, it cannot be written non-trivially as a direct sum of any subgroups, since its subgroups
lie in a chain Z4 2Z4 (0), and no two proper nonzero subgroups intersect in (0).
Notes The function defined in the example is a special case of a more general result that is usually
referred to as the Chinese remainder theorem (this result is given more generally for rings. The
proof of the next proposition makes use of the same function.
Proposition: If k = mn, where m and n are relatively prime integers, then Zk is isomorphic to
Zm Zn.
Outline of the Proof: Define f : Zk Zm Zn by f([x]k) = ([x]m, [x]n), for all x Z. Here I have been
a bit more careful, by using [x]k to denote the congruence class of x, modulo k. It is not hard to
show that f preserves addition. The sets Zk and Zm Zn are finite and have the same number of
elements, so f is one-to-one iff it is onto, and therefore proving one of these conditions will give
the other. (Actually, it isn’t hard to see how to prove both conditions.) Showing that f is one-to-
one depends on the fact that if x is an integer having both m and n as factors, then it must have
mn as a factor since m and n are relatively prime. On the other hand, the usual statement of the
Chinese remainder theorem is precisely the condition that f is an onto function.
Corollary: Any finite cyclic group is isomorphic to a direct sum of cyclic groups of prime power
order.
The corollary depends on an important result in Z: every positive integer can be factored into a
product of prime numbers. Grouping the primes together, the proof of the corollary uses induction
on the number of distinct primes in the factorization.
This basic result has implications for all finite groups. The cyclic group Zn also has a ring
structure, and the isomorphism that proves the corollary is actually an isomorphism of rings,
not just of finite abelian groups. To use this observation, suppose that A is a finite finite abelian
group. Let n be the smallest positive integer such that na = 0 for all a A. (This number might be
familiar to you in reference to a multiplicative group G, where it is called the exponent of the
group, and is the smallest positive integer n such that gn = 1 for all g G.)
You can check that because na = 0 for all a A, we can actually give A the structure of a
Zn-module.
Next we can apply a general result that if a ring R can be written as a direct sum R = I1 . . .In
of two-sided ideals, then each Ij is a ring in its own right, and every left R-module M splits up
into a direct sum M1 . . . Mn, where Mj is a module over Ij . Applying this to Zn, we can write
Zn as a direct sum of rings of the form Zpk , where p is a prime, and then the group A breaks up
into A1 . . . An, where each Aj is a p-group, for some prime p. (Recall that a group G is a
p-group if every element of G has order p.) This argument proves the next lemma. (You can also
prove it using Sylow subgroups, if you know about them.)
Theorem 1: Any finite abelian group is isomorphic to a direct sum of cyclic groups of prime
power order.
A proof of the fundamental structure theorem, let us first discuss some of the directions it
suggests for module theory. First of all, the hope was to construct finite abelian groups out of
ones of prime order, not prime power order. The only way to do this is to stack them on top of
each other, instead of having a direct sum in which the simple groups are lined up one beside the
other. To see what I mean by “stacking” the groups, think of Z4 and its subgroups Z4 2Z4 (0).
It might be better to picture them vertically.
Z4 Notes
|
2Z4
|
(0)
The subgroup 2Z4 = {0, 2} Z2 is simple, and so is the factor module Z4/2Z4 Z2. This having Z2
stacked on top of Z2, and the group is structured so tightly that you can’t even find an isomorphism
to rearrange the factors.
A module M is said to have a composition series of length n if there is a chain of submodules M
= M0 M1 . . . Mn = (0) for which each factor module Mi–1/Mi is a simple module. Thus, we
would say that Z4 has a composition series of length 2. This gives a measurement that equals the
dimension, in the case of a vector space. It is also true that the length of a cyclic group of order
pn is precisely n. It can be shown that if M has a composition series of length n, then every other
composition series also has length n, so this is an invariant of the module. Furthermore, the
same simple modules show up in both series, with the same multiplicity.
The idea of a composition series is related to two other conditions on modules. A module is said
to satisfy the ascending chain condition, or ACC, if it has no infinite chain of ascending submodules;
it is said to satisfy the descending chain condition, or DCC, if it has no infinite chain of descending
submodules. Modules satisfying these conditions are called Noetherian or Artinian, respectively.
A module has finite length iff it satisfies both the ACC and DCC. As an example to keep in mind,
let’s look at the ring of integers, which has ACC but not DCC. Since mZ nZ iff n | m,
generators get smaller as you go up in Z, and larger as you go down. Any set of positive integers
has a smallest element, so we can’t have any infinite ascending chains, but, for example, we can
construct the infinite descending chain 2Z 4Z 8Z ... .
The cyclic groups of prime power order play a crucial role in the structure of finite abelian
groups precisely because they cannot be split up any further. A module M can be expressed as a
direct sum of two submodules M1 and M2 iff M1 M2 = (0) and M1 + M2 = M. In the case of a cyclic
group of prime power order, the subgroups form a descending chain, and so any two nonzero
subgroups have a nonzero intersection. A module is called indecomposable if it cannot be
written as a direct sum of two nonzero submodules. With this terminology, the cyclic groups of
prime power order are precisely the indecomposable finite abelian groups. The major results in
this direction are (the Krull-Schmidt theorem), which show that any module with finite length
can be written as a direct sum of indecomposable submodules, and this decomposition is unique
up to isomorphism and the order of the summands.
After this rather lengthy preview, or review, as the case may be, it is time to move on to study
general rings and modules. The next results present a proof of the structure theorem for finite
abelian groups, but you should feel free to skip them.
(a) Let a A be an element of maximal order, and let b + Za be any coset of A/Za. Then there
exists d A such that d + Za = b + Za and Zd Za = (0).
(b) Let a A be an element of maximal order. Then there exists a subgroup B with A Za B.
Proof: (a) The outline of part (a) is to let s be the smallest positive integer such that sb Za. Then
we solve the equation sb = sx for elements x Za and let d = b – x.
Notes Using o(x) for the order of an element x, let s be the order of b + Za in the factor group G/Za.
Then sb Za, and we can write sb = (qt)a for some exponent qt such that t = p for some and
p | q. Then qa is a generator for Za, since q is relatively prime to o(a). Since s is a divisor of the
order of b, we have o(b)/s = o(sb) = o((qt)a) = o(a)/t, or simply, o(b) . t = o(a) . s. All of these are
powers of p, and so o(b) o(a) implies that s|t, say t = ms. Then x = (qm)a is a solution of the
equation sb = sx. If d = b – x, then d + Za = b + Za and so sd = sb – sx = sb – sb = 0. Therefore,
Zd Za = (0), since nd Za implies n(b – x) = nb – nx Za. Thus, nb Za implies n(b + Za) = Za
in G/Za, so s|n and nd = 0.
(b) The outline of this part is to factor out Za and use induction to decompose A/Za into a direct
sum of cyclic groups. Then part (a) can be used to choose the right preimages of the generators
of A/Za to generate the complement B of Za.
We use induction on the order of A. If |A| is prime, then A is cyclic and there is nothing to prove.
Consequently, we may assume that the statement of the lemma holds for all groups of order less
than |A| = p. If A is cyclic, then we are done. If not, let Za be a maximal cyclic subgroup, and use
the induction hypothesis repeatedly to write A/Za as a direct sum B1 B2 . . . Bn of cyclic
subgroups.
We next use part (a) to choose, for each i, a coset ai + Za that corresponds to a generator of Ai such
that Zai Za = (0). We claim that A Za B for the smallest subgroup B = Za1 + Za2 + · · · + Zan
that contains a1, a2, . . ., an.
First, if x Za (Za1 +· · · + Zan), then x = m1a1 +· · · + mnan Za for some coefficients m1, . . . ,mn.
Thus x + Za = (m1a1 + · · · + mnan) + Za = Za, and since A/Za is a direct sum, this implies that
miai + Za = Za for each i. But then miai Za, and so miai = 0 since Zai Za = (0). Thus x = 0.
Next, given x A, express the coset x + Za as (m1a1 +· · · + mnan) + Za for coefficients m1, . . ., mn.
Then x xZa, and so x = ma + m1a1 + · · · + mnan for some m.
Uniqueness is shown by induction on |A|. It is enough to prove the uniqueness for a given
p-group. Suppose that
Zp1 Zp2 · · · Zpn = Zp1 Zp2 · · · Zpm
where 1 2 . . . n and 1 2 . . . m. Consider the subgroups in which each element has
been multiplied by p. By induction, 1 – 1 = 1 – 1, . . ., which gives 1 = 1, . . ., with the possible
exception of the i’s and j’s that equal 1. But the groups have the same order, and this determines
that each has the same number of factors isomorphic to Zp. This completes the proof.
Self Assessment
1. A .................. is a set, A together with an operations “.”. That combines any two elements a
and b to form another element denoted a.b.
2. In a finite abelian group, each element is in a conjugacy class by itself and the character Notes
table involve powers of a single element known as a ..................
(a) group generator (b) group connector
(c) group and subgroup (d) normal group element
3. In mathematica, the function finite abelian group {n1, n2, .... } represents .................. product
of the cyclic group of degree n1n2 ..................
(a) direct (b) indirect
(c) single (d) external
4. In commutative ring .................. the elements, or unit, from an abelian multiplication
groups.
5. Every subgroup of a finite abelian group is normal, so each subgroup gives rest to a
.................. group.
(a) cyclic (b) permutation
10.4 Summary
A finite abelian group is a set, A, together with an operation “•” that combines any two
elements a and b to form another element denoted a • b. The symbol “•” is a general
placeholder for a concretely given operation. To qualify as a finite abelian group, the set
and operation, (A, •), must satisfy five requirements known as the finite Abelian group
axioms.
Generally, the multiplicative notation is the usual notation for groups, while the additive
notation is the usual notation for modules. The additive notation may also be used to
emphasize that a particular group is abelian, whenever both abelian and non-finite abelian
groups are considered.
For the integers and the operation addition “+”, denoted (Z,+), the operation + combines
any two integers to form a third integer, addition is associative, zero is the additive
identity, every integer n has an additive inverse, “n, and the addition operation is
commutative since m + n = n + m for any two integers m and n.
10.5 Keywords
Finite Abelian Group: A finite abelian group is a set, A, together with an operation “•” that
combines any two elements a and b to form another element denoted a • b.
Multiplication: The multiplicative notation is the usual notation for groups, while the additive
notation is the usual notation for modules.
Cyclic Group: Every cyclic group G is abelian, because if x, y are in G, then xy = aman = am + n =
an + m = anam = yx.
1. Compute all possible finite abelian groups of order n. What is the largest n for which it
will work?
2. Find all finite abelian group of order less than or equal to 40 up to isomorphism.
archives.math.utk.edu
CONTENTS
Objectives
Introduction
11.3 Keywords
Objectives
Introduction
In the last unit, you have studied about finite abelian group. If G is a group and X is an arbitrary
set, a group action of an element g G and x X is a product, gx giving in x many problem in
algebra may best be attached in group actions. In this unit, you will get the information related
to conjugate elements.
Proposition 1:
(b) Conjugacy of subgroups defines an equivalence relation on the set of all subgroups of G.
Definition: Let G be a group. For any element x in G, the set
{ a in G | axa-1 = x }
{ a in G | aHa-1 = H }
Notes Proposition 2: Let G be a group and let x be an element of G. Then C(x) is a subgroup of G.
Proposition 3: Let x be an element of the group G. Then the elements of the conjugacy class of x
are in one-to-one correspondence with the left cosets of the centralizer C(x) of x in G.
Example: Two permutations are conjugate in Sn if and only if they have the same shape
(i.e., the same number of disjoint cycles, of the same lengths). Thus, in particular, cycles of the
same length are always conjugate.
Theorem 1: [Conjugacy class Equation] Let G be a finite group. Then
| G | = | Z(G) | + [ g : C(x) ]
where the sum ranges over one element x from each nontrivial conjugacy class.
Definition: A group of order pn, with p a prime number and n 1, is called a p-group.
Theorem 2: [Burnside] Let p be a prime number. The center of any p-group is nontrivial.
Corollary 1: Any group of order p2 (where p is prime) is abelian.
Theorem 3: [Cauchy] If G is a finite group and p is a prime divisor of the order of G, then G
contains an element of order p.
Example: Prove that if the center of the group G has index n, then every conjugacy class of
G has at most n elements.
Solution: The conjugacy class of an element a in G has [G : C(a)] elements. Since the center Z(G)
is contained in C(a), we have [G : C(a)] [G : Z(G)] = n. (In fact, [G : C(a)] must be a divisor of n.)
Example: Find all finite groups that have exactly two conjugacy classes.
Solution: Suppose that |G| = n. The identity element forms one conjugacy class, so the second
conjugacy class must have n-1 elements. But the number of elements in any conjugacy class is a
divisor of |G|, so the only way that n-1 is a divisor of n is if n = 2.
Example: Let G = D12, given by generators a, b with |a|=6, |b|=2, and ba=a-1b. Let H =
{ 1, a , b, a3b }. Find the normalizer of H in G and find the subgroups of G that are conjugate to H.
3
Solution: The normalizer of H is a subgroup containing H, so since H has index 3, either NG (H)
= H or NG (H) = G. Choose any element not in H to do the first conjugation.
This computation shows that a is not in the normalizer, so NG (H) = H. Conjugating by any
element in the same left coset aH = { a, a4, ab, a4b } will give the same subgroup. Therefore, it
makes sense to choose a2 to do the next computation.
a2Ha-2 = { 1, a3, a2ba4, a2(a3b)a4 } = { 1, a3, a4b, ab }
Comment: It is interesting to note that an earlier problem shows that b, a2b, and a4b form one
conjugacy class, while ab, a3b, and a5 b form a second conjugacy class. In the above computations,
notice how the orbits of individual elements combine to give the orbit of a subgroup.
Example: Write out the class equation for the dihedral group Dn. Note that you will need
two cases: when n is even, and when n is odd.
Solution: When n is odd the center is trivial and elements of the form ai b are all conjugate. Notes
Elements of the form ai are conjugate in pairs; am a-m since a2m 1. We can write the class
equation in the following form:
|G| = 1 + ((n-1)/2) · 2 + n
When n is even, the center has two elements. (The element an/2 is conjugate to itself since it is
equal to a-n/2. This shows that Z(G) = { 1, an/2 }.) Therefore, elements of the form ai b split into two
conjugacy classes. In this case the class equation has the following form:
|G| = 2 + ((n-2)/2) · 2 + 2 · (n/2)
Example: Show that for all n 4, the centralizer of the element (1,2)(3,4) in Sn has order
8· (n-4)!. Determine the elements in the centralizer of ((1,2)(3,4)).
Solution: The conjugates of a = (1,2)(3,4) in Sn are the permutations of the form (a,b) (c,d). The
number of ways to construct such a permutation is
n(n-1)/2 · (n-2)(n-3)/2 · 1/2 ,
and dividing this into n! gives the order 8 · (n-4)! of the centralizer.
We first compute the centralizer of a in S4. The elements (1, 2) and (3, 4) clearly commute with
(1, 2) (3, 4). Note that a is the square of b = (1, 3, 2, 4); it follows that the centralizer contains
< b >, so b3 = (1, 4, 2, 3) also belongs. Computing products of these elements shows that we must
include (1, 3)(2, 4) and (1, 4)(2, 3), and this gives the required total of 8 elements.
To find the centralizer of a in Sn, any of the elements listed above can be multiplied by any
permutation disjoint from (1, 2)(3, 4). This produces the required total |C(a)| = 8 · (n-4)!.
Self Assessment
Let G be a group, and let x,y be elements of G. Then y is said to be a conjugate of x if there
exists an element a in G such that y = axa-1.
If H and K are subgroups of G, then K is said to be a conjugate subgroup of H if there exists
an element a in G such that K = aHa-1.
Conjugacy of elements defines an equivalence relation on any group G.
Conjugacy of subgroups defines an equivalence relation on the set of all subgroups of G.
Let G be a group. For any element x in G, the set
{ a in G | axa-1 = x }
{ a in G | aHa-1 = H }
is called the normalizer of H in G, denoted by N(H).
Let x be an element of the group G. Then the elements of the conjugacy class of x are in
one-to-one correspondence with the left cosets of the centralizer C(x) of x in G.
11.3 Keywords
{ a in G | axa-1 = x }
is called the centralizer of x in G, denoted by C(x).
1. Compute the G-equivalence classes of X for each of the G-sets X = {1, –2, 24, 5, 6} and
G = {(1), (1, 2) (3, 4, 5) ; (1 2) (3 4 5), (1 2) (3 8 4)} for each x X verify |G| = |Ox| |Gx|.
5. Let |G| = pn and suppose that |Z(G)| = pn-1 for p prime. Prove that G is abelian.
6. Let G be a group with order pn, where p is prime and X a finite G-set. If XG = {x X : gx = x
for all g G} is the set of elements in X fixed by the group actions, then prove that
|X| = |XG| (modp).
www.maths.tcd.ie/
archives.math.utk.edu
CONTENTS
Objectives
Introduction
12.2 Summary
12.3 Keywords
Objectives
Introduction
We already know that the converse of Lagrange’s Theorem is false. If G is a group of order m and
n divides m, then G does not necessarily possess a subgroup of order n. For example, A4 has
order 12 but does not possess a subgroup of order 6. However, the Sylow Theorems do provide
a partial converse for Lagrange’s Theorem: in certain cases they guarantee us subgroups of
specific orders. These theorems yield a powerful set of tools for the classification of all finite
non-abelian groups.
We will use the idea of group actions to prove the Sylow Theorems. Recall for a moment what
it means for G to act on itself by conjugation and how conjugacy classes are distributed in the
group according to the class equation. A group G acts on itself by conjugation via the map (g, x)
gxg-1. Let x1,...,xk be representatives from each of the distinct conjugacy classes of G that consist
of more than one element. Then the class equation can be written as
|G| = |Z(G)| + [G : C(x1)] + ... + [G : C(xk)],
where Z(G) = {g G : gx = xg for all x G} is the center of G and C(xi) = {g G : gxi = xig} is the
centralizer subgroup of xi.
We now begin our investigation of the Sylow Theorems by examining subgroups of order p,
where p is prime. A group G is a p-group if every element in G has as its order a power of p,
where p is a prime number. A subgroup of a group G is a p-subgroup if it is a p-group.
In this handout, we give proofs of the three Sylow theorems which are slightly different from
the ones in the book. Recall the following lemma:
Lemma: Let p be a prime number, and let G be a p-group (a finite group of order pk for some
k 1) acting on a finite set S. Then the number of fixed points of the action is congruent to |S|
modulo p.
We make the following definition: if G has order pkm with p | m, a Sylow p-subgroup of G is
a subgroup of order pk.
Theorem (Sylow’s First Theorem): If G is a finite group of order n = pkm with p prime and
p | m, then G has a subgroup of order pk. In other words, if Sylp(G) denotes the set of Sylow
Proof. The proof is by induction on |G|, the base case |G| = 1 being trivial. If there exists a
proper subgroup H of G such that p | [G : H], then a Sylow p-subgroup of H is also a a Sylow
p-subgroup of G and we’re finished by induction. So without loss of generality, we may assume
that p | [G : H] whenever H < G. From the class equation, it follows that p | |ZG|. By Cauchy’s
theorem, there exists a subgroup N ZG of order p, which is necessarily normal in G. Let G =
G/N, so | G | = pk–1m. By induction, G has a subgroup P of order pk–1. Let P be the subgroup
of G containing N which corresponds to P by the first isomorphism theorem. Then
|P| = |P/N| . |N| = pk–1 . p = pk,
Proof: We show more precisely that if H is any subgroup of G of p-power order and P is any
Sylow p-subgroup of G, then there exists x G such that H xPx–1. (This implies the theorem,
since if H Sylp(G) then |H| = |P| = |xPx–1|, which implies that H = xPx–1, so that H is conjugate
to P.) Note that H acts on G/P (the set of left cosets of P in G) by left multiplication. Let Fix denote
the elements of G/P fixed by this action. Then |Fix| |G/P| (mod p) by the Lemma. Since
p | m = |G/P|, |Fix| 0, and thus Fix 0 ;. Let xP be a left coset fixed by the action. Then
hxP = xP h H x–1 Hx P,
Theorem (Sylow’s Third Theorem): If G is a finite group and p is a prime number, let
np = |Sylp(G)|. Then np | |G| and np 1 (mod p).
Proof: We consider the action of G on Sylp(G) by conjugation. By the second Sylow theorem, this
action is transitive, so there is just one orbit. Hence np, which is the size of this orbit, divides |G|.
To prove the congruence np 1 (mod p), we fix a Sylow p-subgroup P Sylp(G) and consider the
action of P on Sylp(G) by conjugation. Let Fix denote the set of fixed points of this action. Note
that Q Fix P NG(Q), and in particular P Fix. If Q Fix, then P, Q NG(Q) are both Sylow
p-subgroups of NG(Q), so they are conjugate in NG(Q) (again by the second Sylow theorem). But
Q is a normal subgroup of NG(Q), so P = Q. Thus Fix = {P}, and in particular |Fix| = 1. By the
Lemma, np 1 (mod p) as desired.
Notes The more precise fact established in our proof of Sylow’s Second Theorem yields the following
useful result:
Corollary: If G is a finite group and p is a prime number, then any subgroup of G of p-power
order is contained in some Sylow p-subgroup.
Since G acts transitively by conjugation on Sylp(G), and the stabilizer of P Sylp(G) is NG(P), we
deduce that np = [G : NG(P)] for any P Sylp(G).
Therefore:
Corollary: If G is a finite group and p is a prime number, let n p be the number of Sylow
p-subgroups of G. Then the following are equivalent:
1. np = 1.
2. Every Sylow p-subgroup of G is normal.
Example: By direct computation, find the number of Sylow 3-subgroups and the number
of Sylow 5-subgroups of the symmetric group S5. Check that your calculations are consistent
with the Sylow theorems.
Solution: In S5 there are ( 5 · 4 · 3 ) / 3 = 20 three cycles. These will split up into 10 subgroups of
order 3. This number is congruent to 1 mod 3, and is a divisor of 5 · 4 · 2.
There are ( 5! ) / 5 = 24 five cycles. These will split up into 6 subgroups of order 5. This number
is congruent to 1 mod 5, and is a divisor of 4 · 3 · 2.
Example: How many elements of order 7 are there in a simple group of order 168?
Solution: First, 168 = 23 . 3 . 7. The number of Sylow 7-subgroups must be congruent to 1 mod 7
and must be a divisor of 24. The only possibilities are 1 and 8. If there is no proper normal
subgroup, then the number must be 8. The subgroups all have the identity in common, leaving
8 · 6 = 48 elements of order 7.
Example: Prove that a group of order 48 must have a normal subgroup of order 8 or 16.
Solution: The number of Sylow 2-subgroups is 1 or 3. In the first case there is a normal subgroup
of order 16. In the second case, let G act by conjugation on the Sylow 2-subgroups. This produces
a homomorphism from G into S3. Because of the action, the image cannot consist of just 2
elements. On the other hand, since no Sylow 2-subgroup is normal, the kernel cannot have 16
elements. The only possibility is that the homomorphism maps G onto S3, and so the kernel is a
normal subgroup of order 48 / 6 = 8.
Example: Let G be a group of order 340. Prove that G has a normal cyclic subgroup of
order 85 and an abelian subgroup of order 4.
Solution: First, 340 = 22 . 5 . 17. There exists a Sylow 2-subgroup of order 4, and it must be abelian.
No divisor of 68 = 22 . 17 is congruent to 1 mod 5, so the Sylow 5-subgroup is normal. Similarly,
then Sylow 17-subgroup is normal. These subgroups have trivial intersection, so their product
is a direct product, and hence must be cyclic of order 85 = 5 . 17. The product of two normal
subgroups is again normal, so this produces the required normal subgroup of order 85.
Notes
Example: Show that there is no simple group of order 200.
Solution: Since 200 = 23 . 52, the number of Sylow 5-subgroups is congruent to 1 mod 5 and a
divisor of 8. Thus there is only one Sylow 5-subgroup, and it is a proper nontrivial normal
subgroup.
Example: Show that a group of order 108 has a normal subgroup of order 9 or 27.
Solution: Let S be a Sylow 3-subgroup of G. Then [G:S] = 4, since |G| = 22 33, so we can let G act
by multiplication on the cosets of S. This defines a homomorphism µ : G -> S4, so it follows that
| µ(G) | is a divisor of 12, since it must be a common divisor of 108 and 24. Thus | ker(µ) | 9,
and it follows that ker(µ) S, so | ker(µ) | must be a divisor of 27. It follows that | ker(µ) | = 9
or | ker(µ) | = 27.
Example: If p is a prime number, find all Sylow p-subgroups of the symmetric group Sp.
Solution: Since |Sp| = p!, and p is a prime number, the highest power of p that divides |Sp| is p.
Therefore, the Sylow p-subgroups are precisely the cyclic subgroups of order p, each generated
by a p-cycle. There are (p-1)! = p! / p ways to construct a p-cycle (a1, . . . , ap). The subgroup
generated by a given p-cycle will contain the identity and the p-1 powers of the cycle. Two
different such subgroups intersect in the identity, since they are of prime order, so the total
number of subgroups of order p in Sp is (p-2)! = (p-1)! / (p-1).
Example: Prove that if G is a group of order 56, then G has a normal Sylow 2-subgroup
or a normal Sylow 7-subgroup.
Solution: The number of Sylow 7-subgroups is either 1 or 8. Eight Sylow 7-subgroups would
yield 48 elements of order 7, and so the remaining 8 elements would constitute the (unique)
Sylow 2-subgroup.
Solution: Suppose that N contains the Sylow p-subgroup P. Then since N is normal it also
contains all of the conjugates of P. But this means that N contains all of the Sylow p-subgroups
of G, since they are all conjugate. We conclude that N and G have the same number of Sylow
p-subgroups.
Example: Prove that if G is a group of order 105, then G has a normal Sylow 5-subgroup
and a normal Sylow 7-subgroup.
Solution: The notation np(G) will be used for the number of Sylow p-subgroups of G. Since 105
= 3 · 5 · 7, we have n3(G) = 1 or 7, n5(G) = 1 or 21, and n7(G) = 1 or 15 for the numbers of Sylow
subgroups. Let P be a Sylow 5-subgroup and let Q be a Sylow 7-subgroup. At least one of these
subgroups must be normal, since otherwise we would have 21 · 4 elements of order 5 and 15 · 6
elements of order 7. Therefore, PQ is a subgroup, and it must be normal since its index is the
smallest prime divisor of |G|, so we can apply the result in the previous problem. Since PQ is
normal and contains a Sylow 5-subgroup, we can reduce to the number 35 when considering the
number of Sylow 5-subgroups, and thus n5(G) = n5(PQ) = 1. Similarly, since PQ is normal and
contains a Sylow 7-subgroup, we have n7(G) = n7(PQ) = 1.
(a) 7 (b) 8
(c) 9 (d) 48
12.2 Summary
Let G be a finite group and p a prime such that p divides the order of G. Then G contains a
subgroup of order p.
(First Sylow Theorem) Let G be a finite group and p a prime such that pr divides |G|.
Then G contains a subgroup of order pr.
Let P be a Sylow p-subgroup of a finite group G and let x have as its order a power of p.
If x-1Px = P. Then x P.
Let H and K be subgroups of G. The number of distinct H-conjugates of K is
[H : N(K) H].
(Second Sylow Theorem) Let G be a finite group and p a prime dividing |G|. Then all
Sylow p-subgroups of G are conjugate. That is, if P1 and P2 are two Sylow p-subgroups,
there exists a g G such that gP1g-1 = P2.
12.3 Keywords
Cauchy: Let G be a finite group and p a prime such that p divides the order of G. Then G contains
a subgroup of order p.
First Sylow Theorem: Let G be a finite group and p a prime such that pr divides |G|. Then G
contains a subgroup of order pr.
1. What are the order of all Sylow p-subgroups where G has order 18, 24, 54 and 80?
2. Find all the Sylow 3-subgroups of S4 and show that they are all conjugate.
3. Show that every group of order 45 has a normal subgroup of order 9.
4. Let H be a Sylow p-subgroup of G. Prove that H ps the only Sylow p-subgroup of G
contained in N(H).
5. Prove that no group of order 96 is simple.
6. If H is normal subgroup of a finite group G and |H| = pk for some prime p, show that H is
a contained in every Sylow p-subgroup of G.
www.maths.tcd.ie/
archives.math.utk.edu
CONTENTS
Objectives
Introduction
13.3 Keywords
Objectives
Introduction
In the earlier unit, you have studied about the conjugate elements and Sylow’s Theorem. This
unit will equip you with more information related to solvable group.
Definition: The group G is said to be solvable if there exists a finite chain of subgroups
G = N0 N1 ··· Nn such that
(i) Ni is a normal subgroup in Ni-1 for i = 1, 2, ... ,n,
(iii) Nn = {e}.
Proposition: A finite group G is solvable if and only if there exists a finite chain of subgroups
G = N0 N1 ... Nn such that
(iii) Nn = {e}.
for elements a, b in G.
The smallest subgroup that contains all commutators of G is called the commutator subgroup or Notes
derived subgroup of G, and is denoted by G’.
Proposition: Let G be a group with commutator subgroup G’.
(a) The subgroup G’ is normal in G, and the factor group G/G’ is abelian.
(b) If N is any normal subgroup of G, then the factor group G/N is abelian if and only if
G’ N.
Definition: Let G be a group. The subgroup (G’ )’ is called the second derived subgroup of G. We
define G(k) inductively as (G(k-1))’, and call it the k th derived subgroup.
Theorem 2: A group G is solvable if and only if G(n) = {e} for some positive integer n.
Corollary: Let G be a group.
Definition: Let G be a group. A chain of subgroups G = N0 N1 ... Nn such that
(iii) Nn = {e}
is called a composition series for G.
The factor groups Ni-1 / Ni are called the composition factors determined by the series.
Theorem 3: [Jordan-Hölder] Any two composition series for a finite group have the same
length. Furthermore, there exists a one-to-one correspondence between composition factors of
the two composition series under which corresponding composition factors are isomorphic.
Example: Let p be a prime and let G be a non-abelian group of order p3. Show that the
center Z(G) of G equals the commutator subgroup G’ of G.
Solution: Since G is non-abelian, we have |Z(G)| = p. (The center is nontrivial, and if |Z(G)| =
p2, then G/Z(G) is cyclic, the text implies that G is abelian.) On the other hand, any group of
order p2 is abelian, so G/Z(G) is abelian, which implies that G’ Z(G). Since G is nonabelian, G’ {e},
and therefore G’ = Z(G).
Case 2: If x = ai and y = ajb, then xyx-1y-1 = aiajba-iajb = aiajaibajb = aiajaia-jb2 = a2i, and thus each even
power of a is a commutator.
Case 3: If x = ajb and y = ai, we get the inverse of the element in Case 2.
Example: Prove that any group of order 588 is solvable, given that any group of order 12
is solvable.
We have 588 = 22 · 3 · 72. Let S be the Sylow 7-subgroup. It must be normal, since 1 is the only
divisor of 12 that is 1 mod 7. By assumption, G / S is solvable since | G / S | = 12. Furthermore,
S is solvable since it is a p-group. Since both S and G / S are solvable, it follows from Corollary
that G is solvable.
Example: Let G be a group of order 780=22 · 3 · 5 · 13. Assume that G is not solvable. What
are the composition factors of G? (Assume that the only nonabelian simple group of order 60 is
A5.)
The Sylow 13-subgroup N is normal, since 1 is the only divisor of 60 that is 1 mod 13. Using the
fact that the smallest simple nonabelian group has order 60, we see that the factor G/N must be
simple, since otherwise each composition factor would be abelian and G would be solvable.
Thus the composition factors are Z13 and A5.
Theorem-[Jordan-Hölder] Any two composition series for a finite group have the same length.
Furthermore, there exists a one-to-one correspondence between composition factors of the two
composition series under which corresponding composition factors are isomorphic.
Let |G| = N. We first prove existence, using induction on N. If N = 1 (or, more generally, if G is
simple) the result is clear. Now suppose G is not simple. Choose a maximal proper normal
subgroup G1 of G. Then G1 has a Jordan-Hölder decomposition by induction, which produces a
Jordan-Hölder decomposition for G.
To prove uniqueness, we use induction on the length n of the decomposition series. If n=1 then
G is simple and we are done. For n > 1, suppose that
G G1 G2 Gn = {1}
and
G G1 G2 Gm=1
are two decompositions of G . If G1 = G1 then we’re done (apply the induction hypothesis to G1),
so assume G1/G1 . Set H : = G1 G1 and choose a decomposition series H H1 Hk = {1} for H.
By the second isomorphism theorem, G1/H=G1G1/G1=G/G1 (the last equality is because G1G1
is a normal subgroup of G properly containing G1). In particular, H is a normal subgroup of G1
with simple quotient. But then
G1 G2 ... Gn
and
G1 H ... Hk
are two decomposition series for G1, and hence have the same simple quotients by the induction
hypothesis; likewise for the G1 series. Therefore, n=m. Moreover, since G/G1=G1/H and
G/G1=G1/H (by the second isomorphism theorem), we have now accounted for all of the
simple quotients, and shown that they are the same.
13.2 Summary
The group G is said to be solvable if there exists a finite chain of subgroups G = N0 N1
··· Nn such that
(i) Ni is a normal subgroup in Ni-1 for i = 1, 2, ..., n,
(ii) Ni-1 / Ni is abelian for i = 1, 2, ..., n, and
(iii) Nn = {e}.
A finite group G is solvable if and only if there exists a finite chain of subgroups G = N0
N1 ... Nn such that
(i) Ni is a normal subgroup in Ni-1 for i = 1, 2, . . ., n,
(ii) Ni-1 / Ni is cyclic of prime order for i = 1, 2, . . ., n, and
(iii) Nn = {e}.
Let p be a prime number. Any finite p-group is solvable.
Let G be a group. An element g in G is called a commutator if
g = aba-1b-1
for elements a,b in G.
The smallest subgroup that contains all commutators of G is called the commutator
subgroup or derived subgroup of G, and is denoted by G’.
Let G be a group. A chain of subgroups G = N0 N1 ... Nn such that
(i) Ni is a normal subgroup in Ni-1 for i = 1, 2, . . ., n,
(ii) Ni-1 / Ni is simple for i = 1, 2, . . ., n, and
(iii) Nn = {e}
is called a composition series for G.
The factor groups Ni-1 / Ni are called the composition factors determined by the series.
Commutator Subgroup: The smallest subgroup that contains all commutators of G is called the
commutator subgroup or derived subgroup of G, and is denoted by G’.
Let G be a group. A chain of subgroups G = N0 N1 ... Nn such that
Z60 { 3 } { 15 } { 0 }
Z60 { 4 } { 20 } { 0 }
3. If G has a composition series and if N is a proper normal subgroup of G, Show the n exists
a composition series containing N.
4. Let N be a normal subgroup of G. If N and G/N have composition series, then G must also
have a composition series.
5. Let N be a normal subgroup of G if N and G/N are solvable groups. Show that G is also
solvable group.
6. Prove that G is a solvable group if and only if G has a series of subgroups G = Pn Pn-1
... P1 P0 = { e }
where pi is normal in pi+1 and the order pi+1/pi is prime.
Notes
CONTENTS
Objectives
Introduction
14.4 Summary
14.5 Keywords
Objectives
Define and give examples of commutative rings, rings with identity and commutative
rings with identity
Introduction
With this unit, we start the study of algebraic system with two binary operations satisfying
certain properties. Z, Q and R are examples of such a system, which we shall call a ring.
Now, you know that both addition and multiplication are binary operations on Z. Further, Z is
an abelian group under addition. Though it is not a group under multiplication, multiplication
is associative. Also, addition and multiplication are related by the distributive laws
for all integers a, b and c. We generalise these very properties of the binary operations to define
a ring in general. This definition is given by the famous algebraist Emmy Noether.
After defining rings we will provide several examples of rings. You will also learn about
some propertics of rings that follow from the definition itself. Finally, we shall discuss certain
types of rings that are obtained when we impose more restrictions on the “multiplication” in
the ring.
As the contents suggest, this unit lays the foundation for the rest of this course. So make sure that
you have attained the following objectives before going to the next unit.
You are familiar with Z, the set of integers. You also know that it is a group with respect to
addition. Is it a group with respect to multiplication too? No. But multiplication is associative
and distributes over addition. These properties of addition and multiplication of integers allow
us to say that the system (Z, +, .) is a ring. But, what do we mean by a ring?
Definition: A non-empty set R together with two binary operations, we mean usually called
addition (denoted by f) and multiplication (denoted by .), is called a ring if the following axioms
are satisfied:
R 1) a + b = b + a for all a, b in R, i.e., addition is commutative.
R 2) (a + b) + c = a + (b + c) for all a, b, c in R, is., addition is associative.
R 4) For each a in R, there exists x in R such that a + x =: 0 = x + a, i.e., every elements of R has
an additive inverse.
R 5) (a . b).c = a.(b . c) for all a, b, c in R, i.e., multiplication is associative.
R 6) a.(b + c) = a . b + a . c, and
(atb).i=a.d+b.c
for all a, b, c in R,
i.e., multiplication distributes over addition from the left as well as the right.
The axioms RI-R4 say that (R, +) is an abelian group. The axiom R5 says that multiplication is
associative. Hence, we can say that the system (R, +, .) is a ring if
(i) (R, +) is an abelian group,
Let us look at some examples of rings now. You have already seen that Z is a ring. What about
the sets Q and R? Do (Q, +, .) and (R, +, .) satisfy the axioms R1 – R6? They do.
So far the examples that we have considered have been infinite rings, that is, their underlying
sets have been infinite sets. Now let us look at a finite ring, that is, a ring (R, +. .) where R is a
Notes finite set. Our example is the set Z,,. Let us briefly recall the construction of Z,, the set of residue
classes modulo n.
If a and b are integers, we say that a is congruent to b modulo n if a – b is divisible by n; in
symbols, a b (mod n) if n I (a – b). The relation ‘congruence modulo n’ is an equivalence
relation in Z. The equivalence class containing the integer a is
a = { b Z ( a - b is divisible by n }
= { a + m p | m Z }.
It is called the congruence class of a modulo n or the residue class of a modulo n. The set of all
equivalence classes is denoted by Z,,. So
Z , , = {0,1,2,..., n 1}.
a b a b and
a . b ab a, b Zn.
To help you regain some practice in adding and multiplying in Z,,, consider the following
Cayley tables for Zn.
For any a, b c Z n ,
We define ‘+’ and ‘.’ in Z + iZ to be the usual addition and multiplication of complex numbers. Notes
Thus, foram + in and s + it in Z + iZ,
(m + in) + (s + it) = (m + s) + i(n + t), and
(m + in) . (s + it) = (p – nt) + i(mt + ns).
Verify that Z + iZ is a ring under this addition and multiplication. (This ring is called the ring of
Gaussian integers, after the mathematician Carl Friedrich Gauss.)
Solution: Check that (Z + iZ, +) is a subgroup of (C, I–). Thus, the axioms RI-R4 are satisfied. You
can also check that
((a + ib) . (c + id)) . (m + in) = (a + ib) . ((c + id) . (m + in))
Example: Let X be a non-empty set, (XI ) be the collection of all subsets of X and A
denote the symmetric difference operation. Show that ((X), A, n) is a ring.
Solution: For any two subsets A and B of X,
A B = (A\B) (B\A)
It is clear that ( (X), A) is an abelian group. You also know that is associative. Now let us see
if distributes over A.
Let A, B, C E (X). Then
A (B C) = A [(B\C) (C\B)]
= [A (B\C)][A (C\ B)], since n distributes over U.
= [(A B)\(A C)][(A C)\(A B)], since distributes over complementation.
= (A B) A (A C).
So, the left distributive law holds.
Also, (B C) A = A (B C), since is commutative.
= (A B) P (A C)
= (B A) A ( C A).
Therefore, the right distributive law holds also.
Therefore, ( (X), A, ) is a ring.
So’ far you have seen examples of rings in which both the operations defined on the ring have
been commutative. This is not so in the next example.
Notes Show that M2(R) is a ring with respect to addition and multiplication of matrices.
Solution: You can check that (M2(R), +) is an abelian group. You can also verify the associative
property for multiplication. We now show that A . (B + C) = A . B A- A . C for A, B, C in M2(R).
a11 b11 a 12 c 21 a 11 c11 a 12 c 21 a11b12 a12 b22 a11 c12 a12 c22
=
a 21 b 11 a 22 c 21 a 21 c 11 a 22 c 21 a21b12 a22 b22 a 21 c12 a 22 c22
= A.B + A.C
In the same way we can obtain the other distributive law, i.e., (A + B) . C = A. C + B . C A, B,
C M2(R).
Thus, M2(R) is a ring under matrix addition and multiplication.
Note Multiplication over M2(R) is not commutative. So, we can’t say that the left
distributive law implies the right distributive law in this case.
Example: Consider the class of all continuous real valued functions defined on the closed
interval [0, 1]. We denote this by C [0, 1]. If f and g are two continuous functions on [0, 1], we
define f + g and fg as
for every x [0, 1]. From the Calculus course you know that the function f + g and fg are defined
and continuous on [0, 1], i.e., if f and g C[0, 1], then both f + g and f .g are in C [0, 1]. Show that
C [0, 1] is a ring with respect to + and
Solution: Since addition in R is associative and commutative, so is addition in C [0, 1]. The
additive identity of C [0, 1] is the zero function. The additive inverse off C [0, 1] is (–f), where
(–f)(x) = – f(A) x [0, 1]. See figure 14.1 for a visual interpretation of (- f). Thus, (C [0, 1], +) is
an abelian group. Again, since multiplication in R is associative, so is multiplication in C [0, 1].
= (f.g)(x) + (f.h)(x)
Since multiplication is commutative in C [0, 1], the other distributive law also holds. Thus, R6 is
true for C [0, 1]. Therefore, (C [0, 1], +, .) is a ring.
This ring is called the ring of continuous functions on [0, 1].
Show that (End A, +,.) is a ring. (This ring is called the endomorphism ring of A.)
Solution: Let us first check that + and . defined by (1) are binary operations on End A.
For all a, b A,
(f + g) (a + b) = f(a + b) f g(a + b)
= f(g(a) + g(b))
= f(g(a)) + f(g(b))
= (f . g)(a) + (f . g) (b)
Thus, f + g and f . g End A.
Now let us see if (End A, +, .) satisfies Rl-R6.
Since + in the abelian group A is associative and commutative, so is + in End A. The zero
homomorphism on A is the zero element in End A. (– f) is the additive inverse of f E End A. Thus,
(End A, +) is an abelian group.
You also know that the composition of functions is an associative operation in End A.
= f(g(a) + h(a))
= f(g(a)) + f(h(a))
= (f . g) (a) + (f . h) (a)
= (f . g + f . h) (a)
f.(g + h) = f . g + f . h.
Note It is not commutative since fog need not be equal to gof for f, g End A.
Example: Let (A, +,.) and (B, + , ) be two rings. Show that their Cartesian product
A X B is a ring with respect to and * defined by
Solution: We have defined the addition and multiplication in A X B componentwise. The zero
element of A X B is (0, 0). The additive inverse of (a, b) is (–a, – b), where – b denotes the inverse
of b with respect to – .
Since the multiplications in A and B are associative, * is associative in A × B. Again, using the fact
that R6 holds for A and B, we can show that R6 holds for A × B. Thus, (A × B, 0, *) is a ring.
In this section we will prove some simple but important properties of rings which are immediate
consequences of the definition of a ring. As we go along you must not forget that for any ring R,
(R, +) is an abelian group. Hence, the results obtained for groups in the earlier units are applicable
to the abelian group (R, +). In particular,
(i) the zero element, 0, and the additive inverse of any element is unique.
(i) Now, 0 + 0 = 0
a(0 + 0) = a0
a0 + a0 = a0, applying the distributive law.
Now, ab + [– (ab)] = 0 and ab + a(– b) = 0. But you know that the additive inverse of an
element is unique.
In the same manner, using the fact that a + (–a) = 0, we get – (ab) = (– a)b.
Thus, a(– b) = (– a)b = – (ab) for all a, b R.
(iii) For a, b R,
The definition of a ring guarantees that the binary operation multiplication is associative and,
along with +, satisfies the distributive laws. Nothing more is said about the properties of
multiplication. If we place restrictions on this operation we get several types of rings. Let us
introduce you to two of them now.
Definition: We say that a ring (R, +, .) is commutative if . is commutative, i.e., if ab = ba for all a,
b R.
ae = ea = a for all a R.
Can you think of such a ring? Aren’t Z, Q and R examples of a ring with identity? Notes
Definition: We say that a ring (R, +, .) is a commutative ring with unity, if it is a commutative
ring and has the multiplicative identity element 1.
Thus, the rings Z, Q, Rand C are all commutative rings with unity. The integer 1 is the
multiplicative identity in all these rings.
We can also find commutative rings which are not rings with identity. For example, 2Z, the ring
of all even integers is commutative. But it has no multiplicative identity.
Similarly, we can find rings with identity which are not commutative. For example, M2(R) has
1 0
the unit element .
0 1
1 0 0 1
if A = and B , then
2 0 0 2
1 0 0 1 0 1
AB = and
2 0 0 2 0 2
0 1 1 0 2 0
BA = and
0 2 2 0 4 0
Thus, AB BA.
Now, can the trivial ring be a ring with identity? Since 0 . 0 = 0, 0 is also the multiplicative
identity for this ring. So (( 0 ), +, .) is a ring with identity in which the additive and identities
coincide. But, if R is not the trivial ring we have the following result.
Theorem 2: Let R be a ring with identity 1. If R { 0 } then the elements 0 and 1 are distinct.
Now let us go back when will A × B be commutative? A × B is commutative if and only if both
the rings A and B are commutative. Let us see why. For convenience we will denote the operations
in all three rings A, B and A × B by + and . . Let (a, h) and (a’, b’) A × B.
Then (a, b) . (a’, b’) = (a’, b’) . (a, b)
We can similarly show that A × B is with unity iff A and B are with unity. If A and B have
identities e1 and e2 respectively, then the identity of A × B is (e1, e2).
Now we will give an important example of a non-commutative ring with identity. This is the
ring of real quaternions. It was first described by the Irish mathematician William Rowan
Hamilton (1805-1865). It plays an important role in geometry, number theory and the study of
mechanics.
Notes
Example: Let H = ( a + bi + cj + dk | a, b, c, d R ), where i, j, k are symbols that satisfy
i2 = – 1 = j2 = k2, ij = k = – ji, jk = i = – kj ki = j = – ik.
We define addition and multiplication in H by
(a + bi + cj + dk) + (ai + bli + cij + d1k )
= (a + a1) (b + b1)i + (c – c1)j t (d + d1)k, and
(a + bi + cj + dk) (a1 + bli + cj+ d1k) = (aal – bb1 – cc1 – dd1) + (ab1+ ha1 + cd1 – dc1)i + (ac1 – bdl + ca1
+ db1)j – (ad1 + bc1 – cb1 + da1)k
(This multiplication may seem complicated. But it is not so. It is simply performed as for
polynomials, keeping the relationships between i, j and k in mind.)
Show that H is a ring.
I = 1 + 0i + 0j + 0k is the unity in H.
Do you agree that H is not a commutative ring? You will if You remember that ij ji, for
example.
So far, in this unit we have discussed various types of rings. We have seen examples of commutative
and non-commutative rings. Though non-commutative rings are very important for the sake of
simplicity we shall only deal with commutative rings henceforth. Thus, from now on, for us a
ring will always mean a commutative ring. We would like you to remember that both + and .
are commutative in a commutative ring.
Self Assessment
1. For each a in R. There exists X in R such that a + X = :0 = ................ i.e. every elements of R
has an additive inverse.
4. The ................ for addition and multiplication and the generalised distributive law.
(a) law of indices (b) Ring
(a) na + nb (b) an + bn
(c) nab + nba (d) an + bn-1
14.4 Summary
a(– b) = – (ab) = (– a) b,
(– a) (– b) = ab,
a(b – c) = ab – ac,
(b – c)a = ba – ca
a, b, c in a ring R.
The laws of indices for addition and multiplication, and the generalised distributive law.
Commutative rings, rings with unity and commutative rings with unity.
Henceforth, we will always assume that a ring means a commutative ring, unless otherwise
mentioned.
14.5 Keywords
Ring: A non-empty set R together with two binary operations, usually called addition (denoted
by f) and multiplication (denoted by .), is called a ring if the following axioms are satisfied.
1. Write out the Cayley tables for addition and multiplication in Z6* , the set of non-zero
elements of Z6. Is (Z 6* , ,'.) a ring? Why?
2. Show that the set Q 2Q {p 2q|p,q Q} is a ring with respect to addition and
multiplication of real numbers.
a 0
3. Let R = a,b are real numbers . Show that R is a ring under matrix addition and
0 b
multiplication.
a 0
4. Let R = a,b are real numbers . Prove that R is a ring under matrix addition and
b 0
multiplication.
7. Prove that the only ring R in which the two operations are equal (i.e., a + b = ab a,
b R) is the trivial ring.
x x
8. Show that the set of matrices
x x x R is a commutative ring with unity.
9. Let R be a Boolean ring (i.e., a2 = a a R). Show that a = –a a R. Hence show that R
must be commutative.
archives.math.utk.edu
CONTENTS
Objectives
Introduction
15.1 Subrings
15.2 Summary
15.3 Keyword
Objectives
Introduction
In this unit, we will study various concepts in ring theory corresponding to some of those that
we have discussed in group theory. We will start with the notion of a subring, which corresponds
to that of a subgroup, as you may have guessed already.
Then we will take a close look at a special kind of subring, called an ideal. You will see that the
ideals in a ring play the role of normal subgroups in a group. That is, they help us to define a
notion in ring theory corresponding to that of a quotient group, namely, a quotient ring.
After defining quotient rings, we will look at several examples of such rings. But you will only
be able to realise the importance of quotient rings in the future units.
We hope that you will be able to meet the following objectives of this unit, because only then
you will be comfortable in the future units of this course.
15.1 Subrings
In last unit we introduced you to the concept of subgroups of a group. In this unit we will
introduce you to an analogous notion for rings. Remember that for us a ring means a
commutative ring.
In the previous unit you saw that, not only is Z Q, but Z and Q are rings with respect to the
same operations. This shows that Z is n subring of Q, as you will now realise.
Notes Definition: Let (R, +, .) be a ring and S be a subset of R. We say that S is a subring of R, if
(S, +, .) is itself a ring, i.e., S is a ring with respect to the operations on R.
For example, we can say that 2Z, the set of even integers, is a subring of Z.
Before giving more examples, let us analyse the definition of a subring. The definition says that
a subring of a ring R is a ring with respect to the operations on R. Now, the distributive,
commutative and associative laws hold good in R. Therefore, they hold good in any subset of
R also. So, to prove that a subset S of R is a ring we don’t need to check all the 6 axioms R1-R6
for S. It is enough to check that
(i) S is closed under both + and . ,
(ii) 0 S, and
(iii) for each a S, – a S.
(a) x – y S x, y S; and
(b) xy S x, y S.
Proof: We need to show that S is a subring of R according to our definition iff S satisfies (a) and
(b). Now, S is a subring of R iff (S, f ) (R, f ) and S is closed under multiplication, i.e., iff (a)
and (b) hold.
So, we have proved the theorem.
We have already noted that Z is a subring of Q. In fact, you can use Theorem 1 to check that Z is
subring of R, C and Z + iZ too. You can also verify that Q is a subring of R, C and
Q 2Q {a 2| , Q}.
Example: Consider Z6, the ring of integers modulo 6. Show, that 3Z6 = (3.0, 3.1, ....., 3.5) is
a subring of Z6.
Solution: Firstly, do you agree that 3Z6 = (0, 3)? Remember that 6 = 0, 9 = 3, and so on. Also,
0 – 5 = – 3 = 5. Thus, x - y 3Z6 x, y 3Z6. You can also verify that xy 3Z6 x, y 3Z6. Thus,
by Theorem 1, 3Z6 is a subring of Z6.
Notes
Example: Consider the ring (X). Show that S = { , X ) is a subring of (X).
For each proper subset of X we get a subring of (X). Thus, a ring can have, several subrings. Let
us consider two subrings of the ring Z2.
Solution: You can recall the ring structure of Z2. Both S and D are non-empty. Both of them
satisfy (a) and (b) of Theorem 1. Thus, S and D are both subrings of Z2.
We would like to make a remark here which is based on the examples of subrings that you have
seen so far.
Remark: (i) If R is a ring with identity, a subring of R may or may not be with identity. For
example, the ring Z has identity 1, but its subring nZ (n 2) is without identity.
(ii) The identity of a subring, if it exists, may not coincide with the identity of the ring. For
example, the identity of the ring Z × Z is (1, 1). But the identity of its subring Z × {0} is (1, 0).
Using Example we can immediately say that mZ n is a subring of Zn m E Z. This also shows
us a fact that we have already seen : nZ is a subring of Z n Z.
Now let us look at some properties of subrings. From Unit 3 you know that the intersection of
two or more subgroups is a subgroup. The following result says that the same is true for
subrings.
Now, let x, y S1 S2. Then x, y E S1 and x, y S2. Thus, by Theorem 1, x – y and xy are in S1 as
well as in S2, i.e,, they lie in S1 S2.
On the same lines as the proof above we can prove that the intersection of any family of subrings
of a ring R is a subring of R.
Now let us look at the Cartesian product of subrings.
Notes Theorem 3: Let S1 and S2 be subrings of the rings R1 and R2, respectively. Then S1 × S2 is a subring
of R1 × R2.
Proof: Since S1 and S2 are subrings of R1 and R2, S1 and S2 . S1 × S2 .
Now, let (a, b) and (a’, b’) S1 × S2. Then a, a’ E S1 and b, b’ E S2. As S1 and S2 are subrings, a – a’,
a. a’ S1 and b – b’, b b’ S2.
(We are using + and . for both R1 and R2 here, for convenience.) Hence,
(a, b) – (a’, b’) = (a – a’, b – b’) S1 × Sz, and
(a, b) . (a’, b’) = (aa’, bb’) S1 × S2.
Thus, by Theorem 1, S1 × S2 is a subring of R1 × R2.
Self Assessment
15.2 Summary
Let (R, +, .) be a ring and S be a subset of R. We say that S is a subring of R, if (S, +, .) is itself
a ring, i.e., S is a ring with respect to the operations on R.
For example, we can say that 2Z, the set of even integers, is a subring of Z.
Before giving more examples, let us analyse the definition of a subring. The definition
says that a subring of a ring R is a ring with respect to the operations on R. Now, the
distributive, commutative and associative laws hold good in R. Therefore, they hold good
in any subset of R also. So, to prove that a subset S of R is a ring we don’t need to check all
the 6 axioms R1-R6 for S. It is enough to check that
15.3 Keyword
Subring: Let (R, +, .) be a ring and S be a subset of R. We say that S is a subring of R, if (S, +, .) is
itself a ring, i.e., S is a ring with respect to the operations on R.
a 0 a 0
1. Show that S = a, b Z . is a subring of R = a, b R . Does S have a unit
0
b 0
b
element?
If yes, then is the unit element the same as that of R?
2. For any ring R, show that {0} and R are its subrings.
5.
Show that a, 3 and 0, 2, 4 are proper ideal of Z6.
archives.math.utk.edu
CONTENTS
Objectives
Introduction
16.3 Keywords
Objectives
Introduction
In earlier unit, you have studied normal subgroups and the role that they play in group theory.
You saw that the most important reason for the existence of normal subgroups is that they allow
us to define quotient groups. In ring theory, we would like to define a similar concept, a
quotient ring. In this unit, we will discuss a class of subrings. These subrings are called ideals.
While exploring algebraic number theory, the 19th century mathematicians Dedekind, Kronecker
and others developed this concept. Let us see how we can use it to define a quotient ring.
Consider a ring (R, + , .) and a subring I of R. As (R, +) is an abelian group, the subgroup, I is
normal in (R, +), and hence the set R/I = ( a + 1 | a R }, of all cosets of I in R, is group under the
binary operation + given by
(a + I) + (b + I) = (a + b) + I ..... (1)
for all a + I, b + I R/I. We wish to define. on R/I so as to make R/I a ring. You may think that
the most natural way to do so is to define
(a + I) . (b + I) = a b + I a + 1, b + I R ..... (2)
But, is this well defined? Not always. For instance, consider the subring Z of R and the set of
cosets of Z in R. Now, since 1 = 1 – 0 Z , 1 + Z = 0 + Z.
But this is a contradiction. Thus, our definition of multiplication is not valid for the set R/Z.
But, it is valid for R/I if we add some conditions on I: What should these conditions be? To Notes
answer this, assume that the multiplication in (2) is well defined.
Then, (r + I). (0 + I) = r . 0 + I = 0 + I = I for all r R.
Now, you know that if x I, then x + I = 0 + I = I.
As we have assumed that is well defined, we get
(r + I) (x + I) = (r + I) . (0 + I) = 0 + I whenever r R, x I.
i.e., rx + I = I whenever r R, x I
Thus, rx I, whenever r R, x I.
So, if ‘ . ’ is well defined we see that the subring I must satisfy the additional condition that
rx I whenever r R and x I.
We will prove that this extra condition on I is enough to make the operation a well defined one
and (R/I, +, .) a ring. In this unit we will consider the subrings I of R on which we impose the
condition rx I whenever r R and x I.
Over here we would like to remark that we are always assuming that our rings are commutative.
In the case of non-commutative rings the definition of an ideal is partially modified as follows.
(i) a – b I a, b I, and
(ii) ra I and ar I a I, r R.
Now let us go back to commutative rings. From the definition we see that a subring I of a ring
R is an ideal of R iff ra I r R a and a I.
Let us consider some examples. You saw that for any ring R, the set (0) is a subring. In fact, it is
an ideal of R called the trivial ideal of R. Other ideals, if they exist, are known as non-trivial
ideals of R.
You can also verify that every ring is an ideal of itself. If an ideal I of a ring R is such that I R,
then I is called a proper ideal of R.
For example, if n 0,1, then the subring nZ = { nm | m Z ) is a proper non-trivial ideal of Z. This
is because for any z Z and nm nZ, z(nm) = n(zm) nZ.
Example: Let X be an infinite set. Consider I, the class of all finite subsets of X. Show that
I is an ideal of (X).
Solution: I = { A | A is a finite subset of X }. Note that
(i) I, i.e., the zero element of (X) is in I,
(ii) A – B = A + (–B) = A + B, as B = –B in (X) = A B.
Thus, if A, B I, then A - B is again a finite subset of X, and hence A – B I.
(iii) AB = A B. Now, whenever A is a finite subset of X and B is any element of (X), AB
is a finite subset of X. Thus, A I and B P (X) AB I .
Hence, I is an ideal of (X).
Notes
Example: Let X be a set and Y be a non-empty subset of X. Show that
I = { A (x) | AY = } is an ideal of (X).
In particular, if we take Y = {x0}; where X0 is a fixed element of X, then
Next, iff M and g C [0, 1] then (fg) (1/2) = f(1/2) g (1/2) = 0 g(1/2) = 0, so fg M.
Example: For any ring R and al, a2 R, show that Ra1 + Ra2 = { x1a1 + x2a2 | x1, x2 R )
is an ideal of R.
Solution: Firstly, 0 = 0a1 t 0a2. 0 Ra1 + Ra2.
This method of obtaining ideals can be extended to give ideals of the form { x1a1 + x2a2 + ... + xnan
| xi R } for fixed elements a1..,... , a, of R. Such ideals crop up again and again in ring theory. We
give them a special name.
Definition: Let a1, ....., a, be given elements of a ring R. Then the ideal generated by a1, ....., a,, is
Ra1 + Ra2 + ... + Ran = (x1al + x2a2 + ... + xnan | x, E R ). a1, ....., a,, are called the generators of this ideal.
When n = 1, the ideal we get is called a principal ideal. Thus, if a R, then Ra = < a > is a principal Notes
ideal of R. In the next unit you will be using principal ideals quite a lot.
Definition: An element a of a ring R is called nilpotent if there exists a positive integer n such
that a” = 0.
2 2
For example, 3 and 6 are nilpotent elements of Z9, since 3 9 0 and 6 36 0. Also, in any
ring R, 0 is a nilpotent element.
Example: Let R be a ring. Show that the set of nilpotent elements of R is an ideal of R.
This ideal is called the nil radical of R.
m n
Now, (a b)m n mn
C r a r ( b)m n r
r0
So, N is an ideal of R.
Let us see what the nil radicals of some familiar rings are. For the rings Z, Q, R or C, N = {0}, since
the power of any non-zero element of these rings is non-zero.
Proof: We know that I R. We want to prove that R I. Let r E R. Since 1 E I and I is an ideal of
R, r = r . l I. So, R I. Hence I = R.
Using this result we can immediately say that Z is not an ideal of Q. Does this also tell us whether
Q is an ideal of R or not’? Certainly Since 1 Q and Q R, Q can’t be an ideal of R.
Now let us shift our attention to the algebra of ideals. In the previous section we proved that the
intersection of subrings is a subring. We will now show that the intersection of ideals is an ideal.
We will also show that the sum of ideals is an ideal and a suitably defined product of ideals is an
ideal.
(a) IJ,
(b) I + J = { a + b | a I and b J }, and
(c) IJ = { x R | x is a finite sum a1b1 + ... + ambm, where ai 1 and bi J } are ideals of R.
Proof: (a) From Theorem 2 you know that I J is a subring of R. Now, if a l J, then a I and
a J. Therefore, ax I and a J for all x in R. So ax I J for all a I J and x R. Thus, I J
is an ideal of R.
(b) Firstly, 0 = 0 + 0 l + J I + J = f.
Thus, I + J is an ideal of R.
y = a’1b’1 + ... + a’nb’n for some al, ..., a, a’1,.. ., a’, I and bi,.., bm, b’1,...., , b’n J.
So, x – y IJ.
Finally, let x IJ say x = a1b1 + ... + anbn with a, 1 and b, E J. Then, for any r E R
Thus, IJ is an ideal of R.
Over here, we would like to remark that if we define IJ = { ab | a E I, b J }, then IJ need not even
be a subring, leave alone being an ideal. This is because if x, y E IJ, then with this definition of IJ
it is not necessary that x – y IJ.
Let us now look at the relationship between the ideals obtained. Let us first look at the following
particular situation:
In fact, these inclusions are true for any I and J. We show the relationship in figure 16.1.
You have studied quotient groups. You know that given a normal subgroup N of a group G, the
set of all cosets of N is a group and is called the quotient group associated with the normal
subgroup N. Using ideals, we will now define a similar concept for rings. At the beginning we
said that if (R, +, .) is a ring and I is a subring of R such that
(R/I, +, .) is a ring, where + and . are defined by
(X + I) + (y + I) = (x + y) + I and
(x+I).(y+I)=xy+I x+I,y+IR/I,
then the subring I should satisfy the extra condition that rx I whenever r R and x I, i.e.,
I should be an ideal. We now show that if I satisfies this extra condition then the operations that
we have defined on R/I are well defined.
From group theory we know that (R/I, +) is an abelian group. So we only need to check that is
well defined, i.e., if
a + I = a’ + I, b + I = b’ + I, then ab + I = a’b’ + I.
Now, since a + I = a’ + I, a – a’ I.
( x + I ) . ( y + I ) = x y + I x,y R.
Proof: As we have noted earlier, (R/I, +) is an abelian group. So, to prove that R/I is a ring we
only need to check that . is commutative, associative and distributive over +.
Now,
(i) . is commutative : (a + I). (b + I) = ab + I = ba + I = (b + I), (a 4- I) for all a + I,b + I R/I.
(ii) . is associative : ‘ ’ a, b, c R
((a + I). (b + I)). (c + I) = (ab + I). (C + I)
= (ab)c + I
= a(bc) + I
= (a + I) . ((b + I) . (c + I))
= a(b + c) + I
= (ab + ac) + I
= (a + I). (b + I) + (a + I).(c -1 I)
Thus, R/I is a ring.
Let us look at some examples. We start with the example that ‘gave rise to the terminology
‘R mod I’.
Example: Let R = Z8. Show that I = {0, 4} is an ideal of R. Construct the Cayley tables for
+ and, in R/I.
Notes
Solution: I = 4 R, and hence is an ideal of R. From group theory you know that the number 8 of
o(R) 8
elements in R/I = o(R/I) = 4.
o(I) 2
Self Assessment
(a) (b)
(c) (d)
(a) (b)
(c) (d)
5. A normal subgroup N of a group G, the set of all cosets of N is a group and is called
................. associated with the normal subgroup N.
16.2 Summary
Notes Over here we would like to remark that we are always assuming that our rings are
commutative. In the case of non-commutative rings the definition of an ideal is partially
modified as follows.
(i) a – b I a, b I, and
(ii) ra I and ar I a I, r R.
Now let us go back to commutative rings. From the definition we see that a subring I of a
ring R is an ideal of R iff ra I r R a and a I.
You can also verify that every ring is an ideal of itself. If an ideal I of a ring R is such that
I R, then I is called a proper ideal of R.
For example, if n 0,1, then the subring nZ = { nm | m Z ) is a proper non-trivial ideal
of Z. This is because for any z Z and nrn nZ, z(nm) = n(zm) nZ.
An element a of a ring R is called nilpotent if there exists a positive integer n such that
a” = 0.
2 2
For example, 3 and 6 are nilpotent elements of Z9, since 3 9 0 and 6 36 0. Also,
in any ring R, 0 is a nilpotent element.
16.3 Keywords
Quotient Group: A normal subgroup N of a group G, the set of all cosets of N is a group and is
called the quotient group associated with the normal subgroup N.
1. Let S be a subring of a ring R. Can we always define a ring homomorphism whose domain
is R and kernel is S? Why?
2. Prove Theorem 8.
(a) if g o f is 1 – 1, then so is f.
www.maths.tcd.ie/
archives.math.utk.edu
CONTENTS
Objectives
Introduction
17.1 Homomorphisms
17.2 Properties of Homomorphisms
17.4 Summary
17.5 Keyword
Objectives
Introduction
You have studied about the functions between groups that preserve the binary operation. You
also saw how useful they were for studying the structure of a group. In this unit, we will discuss
functions between rings which preserve the two binary operations. Such functions are called
ring homomorphisms. You will see how homomorphisms allow us to investigate the algebraic
nature of a ring.
Finally, we will show you the interrelationship between ring homomorphism, ideals and
quotient rings.
17.1 Homomorphisms
Definition: Let (R1, +, . ) and (R1,+ . . ) be two rings and f : R1 – R2 be a map. We say that f is a ring Notes
homomorphisms if
f(a + b) = f(a) 4 – f(b), and
f(a . b) = f(a) . f(b) for all a, b in R1.
Note that the + and . occurring on the left hand sides of the equations in the definition above are
defined on R1, while the + and . occurring on the right hand sides are defined on R2.
So, we can say that f : R1 – R2 is a homomorphism if
(i) the image of a sum is the sum of the images, and
(ii) the image of a product is the product of the images.
Thus, the ring homomorphism f is also a group homomorphisms from (R1,+ ) into (R2, +).
Just as we did in Unit 6, before giving some examples of homomorphisms let us define the
kernel and image of a homomorphism. As is to be expected, these definitions are analogous to
the corresponding ones in Unit 6.
Definition: Let R1 and R2 be two rings and f : R1 – R2 be a ring homomorphism. Then we define
(i) the image of f to be the set lm f = {f(x) | x R1},
Example: Let R be a ring. Show that the identity map IK is a ring homomorphism. What are
Ker IR and Im IR?
Ker IR = { x R | IR(x) = 0 }
=.{x R | x = 0)
= {0}
Im IR = {(IR(x) [ x R ]}
={x | x R ]
= R.
Example: Let s N. Show that the map f : Z – Z, given by f(m) = m for all m Z is a
homomorphism. Obtain Ker f and Im f also.
= {m Z | m= 01
= (m Z | m 0 (mod s))
= sz.
Im f = (f(m) | m Z)
= ( m | m Z)
= Zs ,
showing that f is an epimorphism.
Example: Consider the map f : Z6 – Z3 : f(n (mod 6)) = n(mod 3). Show that f is a ring
homomorphism. What is Ker f?
= n (mod 3) + m(mod 3)
Before discussing any more examples, we would like to make a remark about terminology. In
future we will use the term ‘homomorphism’ for ‘ring homomorphism’. You may remember
that we also did this in the case of group homomorphisms.
Example: Consider the ring C[0, 1] of all real valued continuous functions defined on the
closed interval [0, 1].
1 1
f(fg) = (fg) (1/2) = f g (f)(g).
2 2
Thus, is a homomorphism.
a 0
Example: Consider the ring R = a, b R under matrix addition and multiplication.
0 b
n 0
Show that the map I : Z 13 : f(n) = is a homomorphism.
0 n
Solution: Note that f(n) = nI, where I is the identity matrix of order 2. Now you can check that f(n
+ m) = f(n) + f(m) and f(nm) = f(n) f(m) n, m Z. Thus, f is a homomorphism.
= ((A\B) (B\AY))
= ((A\B)Y) ((B\A)Y)
= ((AY)\(BY))((BY)\(AY))
= (f(A)\ f(B)(f(B) \f(A))
f(AB) = (AB)Y
= (AB)(YY)
I = f(A)f(B).
So, f is a ring homomorphism from (X) into (Y).
Therefore, Im f = (Y).
Notes
Tasks 1. Let A and B be two rings. Show that the projection map P : A × B A : p(x, y)
= x is a homomorphism. What are Ker p and Im p?
2. Is f : Z + 2Z Z + 2Z : f(a + 2b ) = a – 2b a homomorphism?
Having discussed many examples, let us obtain some basic results about ring homomorphisms.
Let us start by listing some properties that show how a homomorphism preserves the structure
of its domain. The following result is only a restatement of Theorem 1 of Unit 6.
Proof: Since f is a group homomorphism from (R1, + ) to (R2, + ), we can apply Theorem 1 of
Unit 6 to get the result.
Firstly, since T , f-1 (T) . Next, let a, b f -1(T). Then f(a), f(b) T
f(a) – f(b) T and f(a) f(b) T
f-1(T) is a subring.
Now, it is natural to expect an analogue of Theorem 2 for ideals. But consider the inclusion i : Z
– R : i(x) = x. You know that 22 is an ideal of Z. But is i(2Z) (i.e., 22) an ideal of R? No. For example,
1 1 1
2 22, R, but 2. 2Z. Thus, the homomorphic image of an ideal need not be an ideal.
4 4 2
But, all is not lost. We have the following result.
(a) Iff is surjective and I is an ideal of R1, then f (I) is an ideal of R,.
Secondly, take any f(x) f(1) and r R2. Since f is surjective, s R1 such that f(s) = r. Notes
Then
rf(x) = f(s) f(x) = f(sx) f(I), since sx I.
Thus, f(1) is an ideal of R2.
Now, consider an epimorphism f : R S and an ideal I in R. By Theorem 3 you know that f(1) is
an ideal of S and f-1(f(I)) is an ideal of R. How are I and f-1(f(I)) related? Clearly, I f-1(f(1)).
Can f-1(f(T)) contain elements of R\I? Remember that Ker f f-1(f(1)) also. Thus,
I + Ker f f-1(f(1)). In fact, I + Ker f = f-1(f(1)). Let us see why.
Let x f-1(f(l)). Then f(x) f(1). Therefore, f(x) = f(y) for sdme y I. Then
f(x – y) = 0.
(b) the mapping 1 – f(I) defines a one-to-one correspondence between the set of ideals of R
containing Ker f and the set of ideals of S.
Proof: We have proved (a) in the discussion above. Let us prove (b) now.
Let A be the set of ideals of R containing Ker f, and B be the set of ideals of S.
Define : A B : 4(I) = f(I).
f-1(f(I1)) = f-1(f(I2))
I1 = I2, by (a).
Thus, is bijective.
And now let us look closely at the sets Ker f and Im f, where f is a ring homomorphism. In Unit
6 we proved that iff : G1 – G2 is a group homomorphism then Ker f is a normal subgroup of G1
and Im f is a subgroup of G2. We have an analogous result for ring homomorphisms, which you
may have already realised from the examples you have studied so far.
Notes Proof: (a) Since (0) is an ideal of R2, by Theorem 3(b) we know that f-1({o}) is an ideal of R1. But
f-1({o}) = Ker f.
(b) Since R1 is a subring of R1, f(R1) is a subring of R2, by Theorem 2(a). Thus, Im f is a subring of
R2 .
This result is very useful for showing that certain sets are ideals. For example, from Theorem 5
you can immediately say that {0,3} is an ideal of Z6. As we go along you will see more examples
of this use of Theorem 5.
Let us look a little more closely at the kernel of a homomorphism. In fact, let us prove a result
analogous to Theorem 4 of Unit 6.
Proof: f is injective iff f is an injective group homomorphism from (R1, +) into (R2, + ). This is true
iff Ker f = {0}, by Theorem 4 of Unit 6. So, our result is proved.
So far we have seen that given a ring homomorphism f : R — S, we can obtain an ideal of R,
namely, Ker f. Now, given an ideal I of a ring R can we define a homomorphism f so that
Ker f = I?
The following theorem answers this question. Before going to the theorem recall the definition
of quotient rings.
Theorem 7: If I is an ideal of a ring R, then there exists a ring homomorphism f : R R/I whose
kernel is I.
Proof: Let us define f : R R/I by f(a) = a + I for all a R. Let us see iff is a homomorphism. For
this take any a, b R. Then
Thus, f is a homomorphism.
= {a R | a I} = I .
We call the homomorphism defined in the proof above the canonical (or natural) homomorphism
from R onto R/I.
Now let us look at the behaviour of the composition of homomorphisms. We are sure you find
the following result quite unsurprising.
Theorem 8: Let R1, R2 and R3 be rings and f : R1 — R2, and g : R2 R3 be ring homomorphisms.
Then their composition gof : R1 R3 given by (gof (x) = g(f(x)) for all x R1 is a ring
homomorphism.
The proof of this result is on the same lines as the proof of the corresponding result in
Unit 6.
We discussed group isomorphisms and various results involving them. In this section we will
do the same thing for rings. So, let us start by defining a ring isomorphism.
Definition: Let R1 and R2 be two rings. A function f : R1 R2 is called a ring isomorphism (or
simply an isomorphism) if
(i) f is a ring homomorphism,
(ii) f is 1 – 1, and
(iii) f is onto.
Thus, a homomorphism that is bijective is an isomorphism.
Remark: Two rings are isomorphic if and only if they are algebraically identical. That is,
isomorphic rings must have exactly the same algebraic properties. Thus, if R1 is a ring with
identity then it cannot be isomorphic to a ring without identity. Similarly, if the only ideals of
R1 are {0} and itself, then any ring isomorphic to R1 must have this property too.
And now, let us go back to Unit 6 for a moment. Over there we proved the Fundamental Theorem
of Homomorphism for groups, according to which the homomorphic image of a group G is
isomorphic to a quotient group of G, Now we will prove a similar result for rings, namely, the first
isomorphism theorem or the Fundamental Theorem of Homomorphism for rings.
Theorem 9 (The Fundamental Theorem of Homomorphism): Let f : R S be a ring
homomorphism. Then R/Ker f Im f. In particular, iff is surjective, then R/Ker f S.
Proof: Firstly, note that K/Ker f is a well defined quotient ring since Ker f is an ideal of R. For
convenience, let us put Ker f = I. Let us define
= (x + 1)(y + 1)
Thus, is a ring homomorphism.
Therefore, Z/mZ Z,
(Note that we have often used the fact that Z/mZ and Zm are the same.)
Let us now apply Theorem 9 to prove that any ring homomorphism from a ring R onto Z is
uniquely determined by its kernel. That is, we can’t have two different ring homomorphisms
from R onto Z with the same kernel. (Note that this is not true for group homomorphisms. In
fact, you know that Iz and – Iz are distinct homomorphisms from Z onto itself with the same
kernel, {0}. To prove this statement we need the following result.
Theorem 10: The only non-trivial ring homomorphism from Z into itself is Iz.
Corollary: Let R be a ring isomorphic. to Z. If f and g are two isomorphisms from R onto Z, then
f = g.
Proof: The composition f.g-’ is an isomorphism from Z. onto itself. Therefore, by Theorem 10,
fog-1 = Iz, i.e., f = g.
Since Ker f = Ker g, r and g are isomorphisms of the same ring onto Z. Thus, by the corollary
above, r = g.
Also, the canonical maps r : R R/Ker f and g : R R/Ker g are the same since Ker f = Ker g.
f = r o f = g o g = g.
Let us halt our discussion of homomorphisms here and briefly recall what we have done in this
unit. Of course, we have not finished with these functions. We will be going back to them again
and again in the future units.
Self Assessment
1. If R1 + R2 be two rings and f : R1 R2 be a ring ................. then we define imf = {f(x) | x R1}.
Notes 5. The only ................. ring homomorphism from Z into itself is Z2.
(a) trivial (b) non-trivial
(c) direct (d) indirect
17.4 Summary
1. The definition of a ring homomorphism, its kernel and its image, along with several
examples.
2. The direct or inverse image of a subring under a homomorphism is a subring.
3. Iff : R - S is a ring homomorphism, then
(i) Im f is a subring of S,
(ii) Ker f is an ideal of R,
(iii) f-1(1) is an ideal of R for every ideal I of S.
(iv) iff is surjective, then f(I) is an ideal of S.
4. A homomorphism is injective iff its kernel is {0}.
5. The composition of homomorphisms is a homomorphism.
6. The definition and examples of a ring isomorphism.
7. The proof and applications of the Fundamental Theorem of Homomorphism which says
that iff : R S is a ring homomorphism, then R/Ker f Im f.
17.5 Keyword
Notes
CONTENTS
Objectives
Introduction
18.3 Summary
18.4 Keywords
Objectives
Introduction
In the earlier units, we have introduced you to rings, and then to special rings whose speciality
lay in the properties of their multiplication. In this unit, we will introduce you to yet another
type of ring, namely, an integral domain. You will see that an integral domain is a ring with
identity in which the product of two non-zero elements is again a non-zero element. We will
discuss various properties of such rings.
Next, we will look at rings like Q, R, C, and Z,, (where p is a prime number). In these rings, the
non-zero elements form an abelian group under multiplication. Such rings are called fields.
These structures are very useful, one reason being that we can “divide” in them.
Related to integral domains and fields are certain special ideals called prime ideals and maximal
ideals. In this unit, we will also discuss them and their corresponding quotient rings.
As you will soon realise, this shows that 2 (and 3) is a zero divisor, i.e., 0 is divisible by 2
(and 3 ).
So, let us see what a zero divisor is.
Definition: A non-zero element a in a ring R is called a zero divisor in R if there exists: a Notes
non-zero element b in R such that ab = 0.
Now do you agree that 2 is a zero divisor in Z,? What about 3 in Z4? Since 3 x 0 for every
Now let us look at an example of a zero divisor in C[0, l]. Consider the function
f C[0, 1] given by
1
x 2 , 0 x 1/2
f(x)
0,1/2 x 1
0, 0 x 1/2
g(x)
x 1/2,1/2 x 1
Then g C[0, 1], g 0 and (fg) (x) = 0 x [0,1]. Thus, fg is the zero function. Hence, f is a zero
divisor in C[0, 1].
For another example, consider the Cartesian product of two non-trivial rings A and B. For every
a 0 in A, (a, 0) is a zero divisor in A × B. This is because, for any b 0 in B. (a . 0) (0.b) = (0.0).
Now let us look at the ring (X), where X is a set with at least two elements, Each non-empty
proper subset A of X is a zero divisor because A.XC = AAC = , the zero element of (X).
Thus, an integral domain is a non-zero ring wilh identity in which the product of two non-zero
elements is a non-zero element.
This kind of ring gets its name from the set of integers, one of its best known examples. Other
examples of domains that immediately come to mind are Q, R and C. What about C[0,1]? You
have already seen that it has zero divisors. Thus C[0,l] is not a domain.
Note Several authors often shorten the term ‘integral domain’ to ‘domain’. We will
do so too.
Notes
What we have shown is that if a 0 and b 6, then ab 6. Thus, Zp is without zero divisors, and
hence, is domain.
Conversely, we will show that if p is not a prime, then Zp is not a domain, So, suppose p is not
a prime. If p = 1, then Z , is the trivial ring, which is not a domain.
If p is composite number and m | p, you know that m Zp is a zero divisor. Thus, Zp has zero
divisors. Hence, it is not a domain.
Now consider a ring R. We know that the cancellation law for addition holds in R, i.e., whenever
acb = acc in R, then b = c. But, does ab = ac imply b = c? It need not. For example, 0.1 = 0.2 in Z but
1 # 2. So, if a = 0, ab = ac need not imply b = c. But, if a # 0 and ab = ac, is it true that b = c’? We will
prove that this is true for integral domains.
Theorem 2: A ring R has no zero divisors if and only if the cancellation law for multiplication
holds in R (i.e., if a, b, c R such that a 0 and ab = ac, then b = c.)
Proof: Let us first assume that R contains no zero divisors. Assume that a, b, c R such that
a 0 and ab = ac. Then a(b – c) = ab – ac = 0. As a 0, and R has no zero divisors, we get b – c = 0,
i.e., b = c.
Now let us introduce a number associated with an integral domain in fact, with any ring.
For this let us look at Z4 first. We know that 4x 0 x Z 4 . In fact, 8x = 0 and 12 x = 0 also for
any x Z4.
But 4 is the least element of the set { n N | nx = 0 x Z4 ). This shows that 4 is the
characteristic of Z4, as you will see now.
Definition: Let R be a ring. The least positive integer n such that nx = 0 x R is called the
characteristic of R. If there is no positive integer n such that nx = 0 x R, then we say that the
characteristic of R is zero.
Now let us look at a nice result for integral domains. It helps in considerably reducing our
labour when we want to obtain the characteristic of a domain.
Theorem 3: Let m be a positive integer and R be an integral domain. Then the following Notes
conditions are equivalent.
(a) m1 = 0.
(b) ma = 0 for all a R.
(c) ma = 0 for some a 0 in R.
(c) (a) : Let ma = 0 for some a 0 in R. Then 0 = ma = m (la) = (ml) a. As a 0 and R is without
zero divisors, we get m1 = 0.
What Theorem 3 tells us is that to find the characteristic of a domain we only need to look at the
set in {n.1 | n N}.
Let us look at some examples.
(iii) You have already seen that chat Z, = n. Thus, for any positive integer n, there exists a ring
with characteristic n.
Now let us look at a peculiarity of the characteristic of a domain.
We will now see what algebraic structure we get after we impose certain restrictions on the
multiplication of a domain.
18.2 Field
Let (R, +, .) be a ring. We know that (R, +) is an abelian group. We also know that the operation
is commutative and associative. But (R,.) is not an abelian group. Actually, even if R has identity,
(R,.) will never be a group since there is no element a R such that a.0 = 1. But can (R\{0}) be a
group? It can, in some cases. For example, from Unit 2 you know that Q* and R* are groups with
respect to multiplication. This allows us to say that Q and R are fields, a term we will now define.
Notes Thus, for a, system (R,+,.) to be a field it must satisfy the ring axioms R1 to R6 as well as the
following axioms.
(i) is commutative,
(ii) R has identity (which we denote by 1) and 1 0, and
(iii) every non-zero element x in R has a multiplicative inverse, which we denote by x–1.
Just as a matter of information we would like to tell you that a ring that satisfies only (ii) and (iii)
above, is called a division ring or a skew field or a non-commutative field. Such rings are very
important in the study of algebra, but we will not be discussing them in this course.
Let us go back to fields now. The notion of a field evolved daring the 19th century through the
research of the German mathematicians Richard Dedekind and Leopold Kronecker in algebraic
number theory. Dedekind used the German word Körper, which means field, for this concept.
This is why you will often find that a field is denoted by K.
As you may have realised, two of the best known examples of fields are R and C. These were the
fields that Dedekind considered. Yet another example of a field is the following ring.
Solution: From Unit 14 you know that F Q 2Q is a commutative ring with identity 1 + 2.
1 1 1 2b a fib
a 2b =
a 2b (a 2b)(a 2b) a 2 2b 2
1 ( b)
= 2 2
2 2 F
a 2b a 2b 2
Can you think of an example of a ring that is not a field? Does every non-zero integer have a
multiplicative inverse in Z? No. Thus, Z is not a field.
By now you have seen several examples of fields. Have you observed that all of them happen to
be integral domains also? This is not a coincidence. In fact, we have the following result.
Proof: Let F be a field. Then F {0} and 1 B. We need to see if F has zero divisors. So let a and
b be elements of F such that ab = 0 and a 0. As a 0 and P is a field, a-1 exists.
Hence, b = I . b = (a – la) b = ad-1 (ab) = a-1 0. Hence, if a 0 and ab = 0, we get b = 0, i.e., F has no
zero divisors. Thus, F is a domain.
Theorem 5 may immediately prompt you to ask if every domain is a field. You have already
seen that Z is a domain but not a field. But if we restrict ourselves to finite domains, we find that
they are fields.
Proof: Let R = {a, = 0, a1 = 1, a2,....., a,] be a finite domain. Then R is commutative also. To show
that R is a field we must show that every non-zero element of R has a multiplicative inverse.
So, let a = aj be a non-zero element of R (i.e., i 0). Consider the elements aa1, ..., aan. For every
j 0, aj 0; and since a 0, we get aaj 0.
Hence, the set { aa1, ..., aan } G (a,, ..., a,}.
Also, aa, , aa ,..., aa, are all distinct elements of the set {a,, ...., a,}, since aaj = nak aj = aj, using the
cancellation law for multiplication.
Thus, {aa1, ...., aan} = [a; ,...., an}.
In particular, a, = aaj, i.e., 1 = aaj for some j. Thus, a is invertible in R. Hence every non-zero
element of R has a multiplicative inverse. Thus, M is a field.
Using this result we can now prove a theorem which generates several examples of finite fields.
Theorem 7: Zn is a field if and only if n is a prime number.
Proof: From theorem 1 you know that Zn is a domain if and only if n is a prime number. You also
know that Zn has only n elements. Now we can apply Theorem 6 to obtain the result.
Theorem 7 unleashes a load of examples of fields : Z2, Z3, Z5, Z7,, and so on. Looking at these
examples, and other examples of fields, can you say anything about the characteristic of a field?
In fact, using Theorems 4 and 5 we can say that.
So far the examples of finite fields that you have seen have consisted of p elements, for some
prime p. In the following exercise we give you an example of a finite field for which this is not
so.
Theorem 9: Let R be a ring with identity. Then R is a field if and only if R and {0} are the only
ideals of R.
Proof: Let us first assume that R is a field. Let I be an ideal of R. If I {0), there exists, a non-zero
element x I. As x 0 and R is a field, xy = 1 for some y R. Since x I and I is an ideal, xy I,
i.e., 1 I.
Conversely, assume that R and { 0 } are the only ideals of R. Now, let a 0 be an element of R.
Then you know that the set Ra = [ra | r R] is a non-zero ideal of R. Therefore, Ra = R.
Now, 1 R = Ra. Therefore, 1 = ba for some b R, i.e., a-1 exists. Thus, every non-zero element
of R has a multiplicative inverse. Therefore, R is a field.
Using Theorem 9, we can obtain some interesting facts about field homomorphisms (i.e., ring
homomorphisms from one field to another). We give them to you in the form of an exercise.
Now that we have discussed domains and fields, let us look at certain ideals of a ring, with
respect to which the quotient rings are domains or fields.
Self Assessment
(a) 1 (b) 2
(c) 0 (d) 3
4. A ring (R, +,.) is called a .................... if (R | { 0 }.) is an abelian group.
(a) field (b) domain
(c) range (d) ideal
18.3 Summary
18.4 Keywords
Zero Divisor: A non-zero element a in a ring R is called a zero divisor in R if there exists: a non-
zero element b in R such that ab = 0.
1. Let n N and m | n | < m < n. Then show that m is a zero divisor in Zn.
4. Let R be a ring and a R be a zero divisor. Then show that every element of the principal
ideal Ra is a zero divisor.
5. In a domain, show that the only solutions of the equation x2 = x are x = 0 and x = 1.
archives.math.utk.edu
CONTENTS
Objectives
Introduction
19.3 Summary
19.4 Keywords
Objectives
Introduction
Finally, we shall see how to construct the smallest field that contains a given integral domain.
This is essentially the way that Q is constructed from Z. We call such a field the field of quotients
of the corresponding integral domain.
In this unit, we have tried to introduce you to a lot of new concepts. You may need some time to
grasp them. Take as much time as you need. But by the time you finish it, make sure that you
have attained the knowledge of following topics.
In ‘Z’ we know that if p is a prime number and p divides the product of two integers a and b, then
either p divides a or p divides b. In other words, if ab pZ, then either a pZ or b pZ. Because
of this property we say that pZ is a prime ideal, a term we will define now.
You can see that {0} is a prime ideal of Z because ab {0} a {0} or b {0}, where a,b Z.
Notes
Conversely, assume that Rp is a prime ideal. Then Rp R, Thus, 1 Rp, and hence, p does not
have a multiplicative inverse. Now suppose p divides ab, where a, b R. Then ab = rp far some
r R, i.e., ab Rp.
As Rp is a prime ideal, either a Rp or b Rp. Hence, either p | a or p | b. Thus, p is a prime
element in R.
Theorem 2 is very useful for checking whether an element is a prime element or not, or for
finding out when a principal ideal is a prime ideal.
Prime ideals have several useful properties.
Now consider the ideal 22 in Z. Suppose the ideal nZ in Z is such that 2Z nZ Z. Then n | 2.
n= 1or n = 2. nZ = Z or nZ = 2Z.
This shows that no ideal can lie between 2Z and Z. That is, 22 is maximal among the proper
ideals of Z that contain it. So we say that it is a “maximal ideal”. Let us define this expression.
Definition: A proper ideal M of a ring R is called a maximal ideal if whenever I is an ideal of R
such that M I R, then either I = M or I = R.
Thus, a proper ideal M is a maximal ideal if there is no proper ideal of R which contains it. An
example that comes to mind immediately is the zero ideal in any field F. This is maximal
because you know that the only other ideal of F is F itself.
To generate more examples of maximal ideals, we can use the following characterisation of such
ideals.
Theorem 3: Let R be a ring with identity. An ideal M in R is maximal if and only if R/M is a field.
Proof: Let us first assume that M is a maximal ideal of R. We want to prove that R/M is a field.
For this, it is enough to prove that R/M has no non-zero proper ideals. So, let I be an ideal of
R/M. Consider the canonical homomorphism : R R/M : (r) = r + M. Then, you know that
-1 (I) is an ideal of R containing M, the kernel of . Since M is a maximal ideal of R. 1(I) = M or
-1(I) = R. Therefore, I = (-1 (I)) is either (M) or (R). That is, I = {0} or I = R/M, where; = O +M
= M. Thus, R/M is a field.
Conversely, let M be an ideal of R such that R/M is a field. Then the only ideals of R/M are
{0} and R/M. Let I be an ideal of R containing M. Then, as above (1) = {0} or, (I) = R/M.
Notes
Example: Show that 2Z 12 is a maximal ideal of Z12, whereas (0, 4,8) is not.
Solution: You know that Z12 = Z/12Z and 2Z 12 = 2Z/12Z. We see that Z12/ 2Z 12 = (Z/12Z)/
(2Z/12Z) = Z2, which is a field. Therefore, 2Z 12 {0, 2, 4,6, 8, 10} is maximal in Z12.
Now {0, 4, 8} = 4Z 12 2Z 12 Z 12 .
We first introduced you to a special ideal of a ring, called a prime ideal. Its speciality lies in the
fact that the quotient ring corresponding to it is an integral domain.
Then we discussed a special kind of prime ideal, i.e., a maximal ideal.
a
Consider Z and Q. You know that every element of Q is of the form , where a Z and b Z*.
b
a
Actually, we can also denote by the ordered pair (a, b) Z × Z*. Now, in Q we know that
b
a c
= - iff ad = bc. Let us put a similar relation on the elements of Z × Z*
b d
a c ad bc a c a c a c
and . , Q.
b d bd b d bd b d
Keeping these in mind we can define operations on Z × Z*. Then we can suitably define an
equivalence relation on Z × Z* to get a field isomorphic to Q.
We can generalise this procedure to obtain a field from any integral domain. So, take an integral
domain R. Let K be the following set of ordered pairs:
K= {(a,b) ) a , b R and b 0)
We define a relation ~ in K by
(a, b) ~ (c, d) if ad = bc.
(ii) Let (a, b), (c, d) K such that (a, b) ~ (c, d). Then ad = bc, i.e., cb = da. Therefore, (c, d) ~
(a, b). Thus, ~ is symmetric.
(iii) Finally, let (a,b), (c,d), ( u, v) K such that (a,b) – (c,d) and (c,d) ~ (u,v ). Then ad = bc and
cv = du. Therefore, (ad) v = (bc)v = bdu, i.e., avd =bud. Thus, by the cancellation law for
multiplication (which is valid for a domain), we get av = bu, i.e., (a,b) – (u,v). Thus, – is
transitive.
Notes Let us denote the equivalence class that contains (a,b) by [a,b]. Thus,
So, let [a,b] = [a’,b’] and [c,d] = [c’,d’]. We have to show that [a,b] + [c,d] = [a’,b’] + [c’,d’],
i.e., [ad+bc,bd] = [a’d’+b’c’,b’d’].
Now, (ad+bc) b’d’ – (a’d’ + b’c’) bd
= (0) dd’ + (0)bb’, since (a,b) - (a’, b’) and (c,d) ~ (c’,d’).
=0
Hence, [ad+bc,bd] = [a’ d’ + b’c’,brd’], i.e., + is well defined.
= ab’cd’ – ba’dc’ = ba’cd’ – ba’ cd’, since ab’ = bar and cd’ = dc’
=0
= [a,b] + [cv+ud,dv]
Self Assessment
Notes 2. An ideal P of a ring R with identity is a prime ideal of R. If and only if the .................. R/P
is an integral domain.
(a) polynomial ring (b) subring
(c) quotient ring (d) ideal ring
3. If x R, it has multiplicative inverse iff RX = ..................
(a) R (b) RX-1
(c) XR (d) X
4. A proper ideal m of a ring R is called maximal ideal of whenever I is an ideal of R such that
m .................. I .................. R then either I = m or I = R.
(a) , (b) ,
(c) , (d) ,
5. If R be a ring with identity. An ideal M in R is maximal if and only if .................. is a field.
(a) R.M (b) R/M
(c) M/R (d) R+M
19.3 Summary
19.4 Keywords
Prime Ideal: A ideal P of a ring R is called a prime ideal of R if whenever ab P for a, b R, then
either a P or b P.
Proper Ideal: A proper ideal M of a ring R is called a maximal ideal if whenever I is an ideal of
R such that M I R, then either I = M or I = R.
Maximal Ideal: Every maximal ideal of a ring with identity is a prime ideal.
1. Let F be a field. Show that F, with the Euclidean valuation d defined by d(a) = 1 a
F/{0}, is a Euclidean domain.
2. Let F be a field. Define the function
d : F(x)\{0} N {0} : d(f(x)) = deg f(x).
Show that d is a Euclidean valuation on F[x], and hence, F[x] is a Euclidean domain.
archives.math.utk.edu
CONTENTS
Objectives
Introduction
20.3 Summary
20.4 Keywords
Objectives
Introduction
In the last unit, you have studied about field and integer domain. In this unit, you will study
about Principal Ideal Domains.
In earlier classes you have seen that Z and F[x] satisfy a division algorithm. There are many
other domains that have this property. Here we will introduce you to them and discuss some of
their properties. Let us start with a definition.
Definition: Let R be an integral domain. We say that a function d : R \ (0) NU (0) is a Euclidean
valuation on R if the following conditions are satisfied:
!
Caution The difference between a unit in R and the unity in R. The unity is the identity
with respect to multiplication and is certainly a unit. But a ring can have other units ton, as
you have just seen in the case of Z.
Now, can we obtain all the units in a domain? You know that every non-zero element in a field
F is invertible. Thus, the set of units of F’ is F \ {0}. Let us look at some examples.
Since deg f(x) and deg g(x) are non-negative integers, this equation can hold only if deg f(x) = 0
= deg g(x). Thus, f(x) must be a non-zero constant, i.e., an element of F\ {0}. Thus, the units of F[x]
are the non-zero elements of F. That is, the units of F and F[x] coincide.
Example: Find all the units in R = a b 5|a, b Z .
Solution: Let a b 5 be a unit in R. Then there exists
c d 5 R such that
a b
5 c d 5 = 1
In the previous section you have proved that every ideal of F[x] is principal, where F is a field.
There are several other integral domains, apart from Euclidean domains, which have this property.
We give such rings a very appropriate name.
Definition: We call an integral domain R a principal ideal domain (PID, in short) if every ideal
in R is a principal ideal.
Thus, Z is a PID. Can you think of another example of a PID? What about Q and Q[x]? In fact, by
Theorem 2 all Euclidean domains are PIDs. But, the converse is not true. That is, every principal
ideal domain is not a Euclidean domain.
b
For example, the ring of all complex numbers of the form a
2
1 i 19 , where a, b Z, is a
On the contrary, suppose that 3 f(x) Z[x] such that < 2, x > = < f(x) >. Clearly, f(x) 0.
Also, 3 g(x), h(x) Z[x] such that
Then (2) shows that deg h(x) = 1. Let h(x) = ax+b with a,b Z.
Then x =f(x) h(x) = n(ax+b).
Comparing the coefficients on either side of this equation, we see that na = 1 and nb = 0. Thus, n
is a unit in Z, that is, n = If I.
Therefore, 1 < f(x) > = < x,2 >. Thus, we can write
I = x (a0 +a1x+ ...+arxr ) + 2(b0+b1x+ .... +bsxs), where ai,bj Z i = 0, l,.. ...., r and j = 0, 1,...,s.
Now, on comparing the constant term on either side we see that 1 = 2b0. This can’t be true, since
2 is not invertible in Z. So we reach a contradiction.
Thus, < x,2 > is not a principal ideal.
Notes We will now discuss some properties of divisibility in PIDs. If R is a ring and a,b R, with a 0,
then a divides b if there exists c R such that b = ac.
Definition: Given two elements a and b in a ring. R, we say that c R is a common divisor of a
and b if c | a and c | b.
An element d R is a greatest common divisor (g.c.d, in short) of a, b R if
(i) d | a and d | b, and
(ii) for any common divisor c of a and b, c | d.
For example, in Z a g.c.d of 5 and 15 is 5 , and a g.c.d of 5 and 7 is 1.
We will show you that if the g.c.d of two elements exists, it is unique up to units, i.e., if d and d
are two g.c.ds of a and b, then d=ud’ , for some unit u.
Proof: So, let d and d’ be two g.c.ds of a and b. Since d is a common divisor and d’ is a g.c.d, we
get d | d’ . Similarly, we get d’|d. Thus, we see that d and d’ are associates in R. Thus, the g.c.d of
a and b is unique up to units.
Theorem 3 allows us to say the g.c.d instead of a g.c.d. We denote the g.c.d of a and b by (a,b).
(This notation is also used for elements of R × R. But there should be no cause for confusion. The
context will clarify what we are using the notation for.
How do we obtain the g.c.d of two elements in practice? How did we do it in Z? We looked at the
common factors of the two elements and their product turned out to be the required g.c.d.
We will use the same method in the following example.
Solution: By the quadratic formula, we know that the roots of p(x) are 2 and –5, and the roots of
q(x) are 2 and –1/3.
The g.c.d of p(x) and q(x) is the product of the common factors of p(x) and q(x), which is (x – 2).
Theorem 4: Let R be a PID and a, b R. Then (a, b) exists and is of the form ax + by for some x,y
R.
Proof: Consider the ideal <a, b>. Since R is a PID, this ideal must be principal also. Let d R such
that <a, b> = <d>. We will show that the g.c,d of a and b is d.
Thus, we have shown that d = (a,b), and d= ax+by for some x.y R. Notes
The fact that F[x] is a PID gives-us the following corollary to Theorem 4.
Corollary: Let F be a field. Then any two polynomials f(x) and g(x) in F[x] have a g.c.d which is
of the form a(x)f(x)+b(x)g(x) for some a(x), b(x) F[x].
1 3 ( x) 2
For example, (c), (x–1) = (x – 2x2 + 6x – 5) + (x – 2x + 1).
5 5
Now you can use Theorem 4 to prove the following exercise about relatively prime elements in
a PID, i.e., pairs of elements whose g.c.d is 1.
Let us now discuss a concept related to that of a prime element of a domain.
Definition: Let R be an integral domain. We say that an element x R is irreducible if
Thus, an element is irreducible if it cannot be factored in a non-trivial way, i.e., its only factors
are its associates and the units in the ring.
So, for example, the irreducible elements of Z are the prime numbers and their associates. This
means that an element in Z is prime iff it is irreducible.
Another domain in which we can find several examples is F[x], where F is a field. Let us look at
the irreducible elements in R[x] and C[x], i.e., the irreducible polynomials over R and C. Consider
the following important theorem about polynomials in C[x]. You have already come across this
in the Linear Algebra course.
We will not prove these results here but we will use them often when discussing polynomials
over R or C. You can use them to solve the following exercise.
Let us now discuss the relationship between prime and irreducible elements in a PID.
Then (x,a) = 1, since the only factor of x is itself, up to units. Thus, xb, Thus, x is prime.
Notes Now, why do you think we have said that Theorem, 7 is true for a PID only? You can see that one
way is true for any domain. Is the other way true for any domain? That is, is every irreducible
element of a domain prime? You will get an answer to this question.
a–b I a, b I.
Thus, I is an ideal of R. Since R is a PID, I = <a> for some a R. Since a I, a I, for some m N.
Then I I,. But I, I. So we. see that I = Im.
Theorem 9: Let R be a PID. An ideal < a > is a maximal ideal of R iff a is a prime element of R.
Proof: If < a > is a maximal ideal of R, then it is a prime ideal of R. Therefore, a is a prime element
of R.
Conversely, let a be prime and let I be an ideal of R such that < a > I. Since R is a PID, I = < b
> for some b R. We will show that b is a unit in R.
Now, < a > < b > a = bc for some c R. Since a is irreducible, either b is an associate of a or
b is a unit in R. But if b is an associate of a, then <b> = <a>, a contradiction. Therefore, b is a unit
in R. Therefore, I = R.
What Theorem 9 says is that the prime ideals and maximal Ideals coincide in a PID.
Now, take any integer n. Then we can have n = 0, or n = ± 1, or n has a prime factor. This property
of integers is true for the elements of any PID, as you will see now.
Theorem 10: Let R be a PID and a be a non-zero non-invertible element of R. Then there is some
prime element p in R such that a.
Proof: If a is prime, take p = a. Otherwise, we can write a =albl, where neither a, nor b1 is an
associate of a. Then < a > < a1 >. If a1 is prime, take p = a1. Otherwise, we can write a1 = a2b2,
where neither a2 nor b2 is an associate of a,. Then <a1> < a2 >. Continuing in this way we get
an increasing chain
By Theorem 8, this chain stops with some < a, >. Then a, will be prime, since it doesn’t have any
non-trivial factors. Take p = a,, and the theorem is proved.
And now we are in a position to prove that any non-zero non-invertible element of a PID can be
uniquely written as a finite product of prime elements (i.e., irreducible elements).
Theorem 11: Let Rt be a PID. Let a R such that a 0 and a is not a unit. Then a = p1, p2....pr, where
p1,p2.... pr, are prime elements of R.
Proof: If a is a prime element, there is nothing to prove. If not, then P1 | a, for some prime p1 in
R, by Theorem 10. Let a = plal. If p1a1. If a1 is a prime, we are through. Otherwise P2 | a, for some
prime p2 in R. Let a1, = p2a2. Then a = p1p2a2. If a2 is a prime, we are through. Otherwise we
continue the process. Note that since al is a non-trivial factor of a, <a> <a1>. Similarly, <a1>
< a2 >. So, as the process continues we get an increasing chain of ideals,
in the PID R. Just as in the proof of Theorem 10, this chain ends at < a, > for some m N, and a,
is irreducible.
Hence, the process stops after m steps, i.e., we can write a = p1p2 ... pm, where pi is a prime element
of R i = 1, .... m.
Thus, any non-zero non-invertible element in a PID can be factorised into a product of primes.
What is interesting about this factorisation is the following result that you have already proved
for Z in Unit 1.
Theorem 12: Let R be a PID and a 0 be non-invertible in R. Let a = p1p2....pn = q1q2....qm, where
pi and qj are prime elements of R. Then n = m and each pi is an associate of some qj for 1 i | n,
1 j | m.
Before going into the proof of this result, we ask you to prove a property of prime elements that
you will need in the proof.
Notes Proof: Since p1p2, ...p, = q1q2 ... ,.qm, p1 | q1q2. ... qm,.
Thus, p1 | qj for some j = 1. .... ..,m. By changing the order of the qi, if necessary, we can assume
that j = 1, i.e., p1 | q. Let q1 = plul. Since q1 is irreducible, u1 must be a unit in R. So p1 and q1 are
associates. Now we have
p1p2 = Pn (p1u1)q2....qm.
Cancelling p1 from both sides, we get
p2p3...pn = u1q2...,qm.
Now, if m > n, we can apply the same process to p2, p3, and so on.
Then we will get
1 = u1u2 .... un qn+1 .... qm.
This shows that qn+1 is a unit. But this contradicts the fact that qn+1 is irreducible.
Thus, m n.
Interchanging the roles of the ps and qs and by using a similar argument, we get n m.
Thus, n = m.
During the proof we have also shown that each pi is an associate of some qj, and vice versa.
What Theorem 12 says is that any two prime factorisations of an element in a PID are identical,
apart from the order in which the factors appear and apart from replacement of the factors by
their associates.
Thus, Theorems 11 and 12 say that every non-zero element in a PID R, which is not a unit, can be
expressed uniquely (up to associates) as a product of a finite number of prime elements.
For example, x2 – 1 R[x] can be written as (x-1)(x+1) or (x-1) (x-1) or [2(x-tl)] [2(x-1)] in R[x].
The property that we have shown for a PID in Theorems 11 and 12 is true for several other
domains also. Let us discuss such rings now.
Self Assessment
(a) n (b) 2
(c) 4 (d) 5
20.3 Summary
The g.c.d. of any two elements a and b in a PID R exists and is of the form ax + by for some
x, y R.
The Fundamental Theorem of Algebra: Any non-constant polynomial over C has all its
roots in C.
20.4 Keywords
Euclidean Domain: An integral domain D is called a Euclidean domain if for each non-zero
element x in D there is assigned a non-negative integer (x) such that
(i) (ab) (b) for all non-zero a,b in D, and
(ii) for any non-zero elements a,b in D there exist q,r in D such that a = bq + r, where either
r = 0 or (r) < (b).
UID: Let R be a commutative ring with identity. A non-zero element p of R is said to be
irreducible if
2. Will any quotient ring of a PID be a PID? Why? Remember that a PID must be an integral
domain.
www.maths.tcd.ie/
archives.math.utk.edu
CONTENTS
Objectives
Introduction
21.3 Keyword
Objectives
Introduction
In this unit, we shall look at special kinds of integral domains. These domains were mainly
studied with a view to develop number theory. Let us say a few introductory sentences about
them.
You saw that the division algorithm holds for F[x], where F is a field. You saw that it holds
for Z. Such integral domains are called Euclidean domains.
We shall look at some domains which are algebraically very similar to Z. These are the principal
ideal domains, so called because every ideal in them is principal.
Finally, we shall discuss domains in which every non-zero non-invertible element can be uniquely
factorised in a particular way. Such domains are very appropriately called unique factorisation
domains. While discussing them, we shall introduce you to irreducible elements of a domain.
While going through the unit, you will also see the relationship between Euclidean domains,
principal ideal domains and unique factorisation domains.
Here we shall look at some details of a class of domains that include PDs.
Definition: We call an integral domain R a unique factorisation domain (UFD, in short) if every
non-zero element of R which is not a unit in R can be uniquely expressed as a product of a finite
number of irreducible elements of R.
Also, since any Euclidean domain is a PID, it is also a UFD. You directly proved that Z is a UFD.
Why don’t you go through that proof and then try and solve the exercises.
Now we give you an example of a domain which is not a UFD (and hence, neither a PID nor a
Euclidean domain).
Consider the ideals (pi) . (qi). Relabel so that p1 generates a minimal ideal amongst these
(in other words, (p1) does not strictly contain another one of the ideals). Now we show (p1) = (qi)
for some i. Suppose not. Then (p1) does not contain any qi, thus qi is nonzero modulo (p1) for
all i, which is a contradiction because the left-hand side of the above equation is zero modulo
(p1).
Relabel so that (p1) = (q1). Then p1 = uq1 for some unit u. Cancelling gives up2 ... pr = q2 ... qz. The
element up2 is also irreducible, so by induction we have that factorization is unique.
The converse of the above theorem is not always true. Consider the ring [x]. The ideal (2, x) is
not principal: suppose (2, x) = (a) for some a. Since this ideal contains the even integers, a must
be some integer (multiplication never reduces the degree of an element), and in fact it must be
(an associate of) 2. But (2) does not contain polynomials with odd coefficients, so (2, x) = (2).
You can check that this function has the property that
9 = 3.3 = ( 2 + –5 ) ( 2 – –5 ).
You have already shown that the only units of Z [ 5 ] are 1 and –1. Thus, no two of 3, 2+ 5
and 2 – 5 are associates of each other.
Also, each of them is irreducible. For suppose any one of them, say 2 + 5 , is reducible. Then
f ( 2 + –5 ) = f() f(),
Since f(), f() N and a, are not units, the only possibilities are f() = 3 = f().
Similarly, we can show that 3 and 2– 5 are irreducible. Thus, the factorisation of 9 as a product
of irreducible elements is not unique. Therefore, Z[ 5 ] is not a UFD.
From this example you can also see that an irreducible element need not be a prime element.
Now let us discuss some properties of a UFD. The first property says that any two elements of a
UFD have a g.c.d. and their g.c.d. is the product of all their common factors. Here we will use the
fact that any element a in a UFD R can be written as
a p1 r1 p 2 r2 ...p n rn
where the pis are distinct irreducible elements of R. For example, in Z[x] we have
x3 – x2 – x + 1 = (x – 1) (x + l) (x – 1 ) = (x – 1)2 (x + 1).
So, let us prove the following result.
Notes where p1, p2, ..., pn are distinct irreducible elements of R and ri and si are non-negative integers
i = 12, ..., n.
(If some pi does not occur in the factorisation of a, then the corresponding ri = 0. Similarly, if
some pi is not a factor of b, then the corresponding si = 0. For example, take 20 and 15 in Z. Then
20 = 22 × 30 × 5 and 15 = 20 × 31 × 51.)
Now, let c | a and c | b. Then every irreducible factor of c must be an irreducible factor of a and
of b, because of the unique factorisation property.
Therefore, c | d.
Proof: We know that every prime in R is irreducible. So let us prove the converse.
If (a,b) = a, a| b.
and a is irreducible, a must be one of the pis or one of the qjs. Since a | b, a pi for any i.
Therefore, a = qj for some j. That is, a | c.
Hence, a is prime.
We will not prove this result here, even though it is very useful to mathematicians. But let us
apply it. You can use it to solve the following exercises.
Lemma: Let D be a unique factorization domain, and let p be an irreducible element of D. If a,b
are in D and p|ab, then p|a or p|b.
Lemma: Let Q be the quotient field of D, and let f(x) be a polynomial in Q[x]. Then f(x) can be
written in the form f(x) = (a/b)f*(x), where f*(x) is a primitive element of D[x], a,b are in D, and
a and b have no common irreducible divisors. This expression is unique, up to units of D.
Lemma: Let D be a unique factorization domain, let Q be the quotient field of D, and let f(x) be
a primitive polynomial in D[x]. Then f(x) is irreducible in D[x] if and only if f(x) is irreducible in
Q[x].
Theorem 6: If D is a unique factorization domain, then so is the ring D[x] of polynomials with
coefficients in D.
Corollary: For any field F, the ring of polynomials
F[x1 , x2 , ... , xn]
Self Assessment
1. If R is a UFD and a R, with a 0 and being a .................., then a can be written as a product
of finite number of irreducible elements.
(a) invertible (b) non-invertible
21.2 Summary
Every PID is a UFD, but the converse is not true. Thus, Z, F’ and F[x] are UFDs, for any
field F.
21.3 Keyword
www.maths.tcd.ie/
archives.math.utk.edu
CONTENTS
Objectives
Introduction
22.3 Summary
22.4 Keywords
Objectives
Introduction
In the earlier units, you must have come across expressions of the form x+1, x2+2x+1, and so on.
These are examples of polynomial. You have also dealt with polynomials in the course of Linear
Algebra. In this unit, we will discuss sets whose elements are polynomials of the type a0 + a, x +
... + anxn, where a0, a1,......, a,, are elements of a ring R. You will see that this set, denoted by R [x],
is a ring also.
You may wonder why we are talking of polynomial rings in a block on domains and fields. The
reason for this is that we want to focus on a particular case, namely, R [x], where R is a domain.
This will turn out to be a domain also, with a lot of useful properties. In particular, the ring of
polynomials over a field satisfies a division algorithm, which is similar to the one satisfied by
Z. We will prove this property and use it to show how many roots any polynomial over a field
can have.
As we have said above, you may already be familiar with expressions of the type 1 + x, 2 + 3x +
4x2, x5-1, and so on. These are examples of polynomials over the ring Z. Do these examples
suggest to you what a polynomial over any ring R is ? Let’s hope that your definition agrees
with the following one.
Definition: Let a,, + a, x + . . . + a,, xn be a polynomial over a ring R, where a, 0. Then we call the
integer n the degree of this polynomial, and we write
n
deg a i x i n, of a, , 0.
i 0
Notation: We will denote the set of all polynomials over a ring R by R[x]. (Please note the use of
the square brackets [ ]. Do not use any other kind of brackets because R [x] and R (x) denote
different sets.)
n i
Thus, R[x] = a 1 x a i R i 0, 1,...n, where n 0, n Z .
i 0
Notes
Thus, an example of an element from Z4 [x] is f(x) = 2 x2 + 3 x + i.
Now, for ring R, we would like to see if you can define operations on the set R [x] so that it
becomes a ring. For this purpose we define the operations of addition and multiplication of
polynomials.
Definition: Let f(x) = a,, + a1x + .. + a, xn and g (x) = b0 + b, x + .. + bmxm be two polynomials in
R[X]. Let us assume that m 2 n. Then their sum f(x) + g(x) is given by
f(x) + g(x) = (a,, + b0) + (a, + b1)x + .. + (an+ b,) xn + bn+1 xn+l .. + bmxm.
For example, consider the two polynomials p(x), q(x).in Z[x] given by
p(x) = 1 + 2x + 3x2, q(x) = 4 + 5x + 7x3
Then
p(x) + q(x) = (1+4) + (2+5)x + (3+0) x2 + 7x3 = 5 + 7x + 3x2 + 7x3.
Definition: If f(x) = a,, + a1x + .. + a, xn and g(x) = b0 + b, x + .. + bmxm are two polynomials in R [x],
we define their product f(x). g(x) by
Here a, = 1, a, = –1, a2 = 0, a3 = 2, b0 = 2, b, = 5, b2 = 7.
5
i
Thus, p(x) q(x) = c x , where
i=0
i
c0 = a0b0 = 2,
c1 = a1b0 + a0b1 = 3,
c2 = a2b0 + al b1 + a0b2 = 2,
Here, deg (p(x). q(x)) = 2 < deg p (x) + deg q (x) (since deg p (x) = 1, deg q (x) = 2).
By now you must have got used to addition and multiplication of polynomials. We would like
to prove that for any ring R, R[x] is a ring with respect to these operations. For this we must note
that by definition, + and . are binary operations over R [x].
Now let us prove the following theorem. It is true for any ring, commutative or not,
Theorem 1: If R is a ring, then so is R[x], where x is an indeterminate.
Proof: We need to establish the axioms R1 – R6 of Unit 14 for (R[x], + , .).
(i) Addition is Commutative: We need to show that
p(x) + q(x) = q(x) + p(x) for any p(x) , q(x) R [x].
Let p (x) = a0 + a1x + ... + a,xn, and
q(x) = b0 + b1x + ... + bmxm be in R[x].
Then, p (x) + q(x) = c0 + c1x + ... + c1xt,
where ci = ai + bi and t = max (m,n).
Similarly,
q(x) + p(x) = d0 + d1x + ... + dsxs,
where di = bi + ai, s = max (n, m) = t.
So we have
p(x) + q(x) = q(x) + p(x).
(ii) Addition is Associative: Again, by using the associativity of addition in R, we can show
that if p(x), q(x), s(x) R[x], then
{p(x) + q (x)} + s(x) = p(x) + {q(x) + s(x)}.
(iii) Additive Identity: The zero polynomial is the additive identity in R [x]. This is because,
for any p(x) = a0 + a, x+ ... + anxn R[x],
0 + p(x) = (0 + a,) + (0 +a1)x + ... +(0 + an)xn
= a0 + al x + ... +anxn
= p(x)
(iv) Additive Inverse: For p (x) = a, + a1x +... + anxn R[x], consider the polynomial –p(x) = Notes
–a, –a1x – ... –anxn, – ai being the additive inverse of ai in R. Then
p(x) + (–p(x)) = (a,, –a,,) + (a1 – a1) x + ... + (a1 – an)xn
= 0 + 0.x + 0.x2+ ... + 0.xn
= 0.
Therefore, – p(x) is the additive inverse of p(x).
(v) Multiplication is Associative:
Let p(x) =a, +a1x + ... +anxn,
q(x) = b0 + b1x+ ... +bmxm,
and t (x) = d0 +d1x + ... + drxr, be in R [x]
Then
p(x) . q(x) = c0 + c1 x + .. . + csxs, where s = m + n and
Therefore,
{p(x) . q(x)} t (x) = e0 + e1 x + ... +e1xt,
where t = s + r = m+n+r and
ek = ckd0 + ck-1d1 + ... + c0dk
= (akb0 + ... + a0bk)d0 + (ak-1,b0 + ... + a0bk-1) d1 + ... + a0b0dk.
Similarly, we can show that the coefficient of xk (for any k 0) in p(x) (q (x) t(x))
is akb0d0 + ak-1, (b1d0 + b0d1) + ... + a0(bkd0 + bk-1, d1 + ... + b0dk)
= ek, by using the properties of + and . in R.
Hence, {p(x).q(x)} . t(x) = p(x) . {q (x). t (x)}
(vi) Multiplication Distributes over Addition:
Let p(x) = a0 +a1x + ... + anxn,
q(x) = b0 + blx + ...+ bmxm
and t(x) = d0 + d, x + . . . + dr xr be in R[x],
The coefficient of xk in p (x). (q(x) + t (x)) is
ck = ak (b0 + d0) + a(b1 + d1) + (b1 + d1) + ... + a, (bk + dk).
And the coefficient of xk in p (x) q (x) + p (x) t(x) is
(akb0 + ak-1b1 + ... + a0bk) + (akd0 + ak-1d1 + ... +a0dk),
= ak(b0 + d0) +ak-1 (b + d1) + ... +a0 (bk + dk) = ck
This is true k 0.
Notes Note that the definitions and theorem in this section are true for any ring. We have not restricted
ourselves to commutative rings. But, the case that we are really interested in is when R is a
domain. In the next section we will progress towards this case.
In the previous section you must have realised the intimate relationship between the operations
on a ring R and the operations on R [x]. The next theorem reinforces this fact.
Theorem 2: Let R be a ring.
(a) If R is commutative, so is R [x].
(b) If R has identity, so does R [x].
= bka0 + bk-1a14 ... + b1ak-1 + b0ak, since both addition and multiplication are commutative in R.
In the following exercise we ask you to check if the converse of Theorem 2 is true.
Now let us explicitly state a result which will help in showing us that R is a domain iff R [x} is a
domain, This result follows just from the definition of multiplication of polynomials.
Theorem 3: Let R be a ring and f (x) and g (x) be two non-zero elements of R [x]. Then deg (f(x) g
(x)) deg f(x) + deg g (x), with equality if R is an integral domain.
Since an+1, an+2, ... and bm+1, bm+2. .. . are all zero, Notes
cm+n = anbm
Now, if R is without zero divisors, then anbm 0, since a, 0
and b 0. Thus, in this case,
deg (f(x) g (x)) = deg f(x) + deg g (x).
On the other hand, if R has zero divisors, it can happen that a,b, = 0. In this case,
deg (f (x) g (x)) < m+n = deg f(x) + deg g(x).
Thus, our theorem is proved.
The following result follows immediately from Theorem 3.
Conversely, let us assume that R [x] is without zero divisors. Let a and b be non-zero elements
of R. Then they are non-zero elements of R [x] also. Therefore, ab 0. Thus, R is without zero
divisors. So, we have proved the theorem.
Now, you have seen that many properties of the ring R carry over to R’[x]. Thus, if F is a field, we
should expect F[x] to be a field also, But this is not so. F[x] can never be a field.
This is because any polynomial of positive degree in F|x| does not have a multiplicative
inverse. Let us see why.
Let f (x) F [x] and deg f (x) = n > 0. Suppose g (x) F [x] such that f (x) g (x) = 1. Then
0 = deg 1 = deg (f(x) g (x)) = deg f(x) + deg g (x), since F [x] is a domain.
We reach a contradiction.
But there are several very interesting properties of F [x], which are similar to those of Z, the set
of integers. In the next section we shall discuss the properties of division in F [x].
Self Assessment
(a) a0x0 + a1x1 + a2x2 + ...... anxn (b) a0x1 + a2x2 + a3x3 + ...... anxn
(c) a-1x + a-1x2 + a-1x3 ...... a-1xn (d) a0x-1 + a1x-1 + a2x-3 ...... anx-n
Notes 2. The degree of the zero polynomial to be ................. thus degree 0 = .................
(a) – , – (b) ,
(c) , –1 (d) –1,
3. 3x2 + 4x + 5 is a polynomial of degree ................., whose coefficients belong to the ring of
integers Z its leading coefficients is .................
(a) 4, 5 (b) 2, 3
(c) 2, 4 (d) 2, 5
4. x2 + 2x4 + 6x + 8 is a polynomial of degree ................. with coefficient 2.
(a) 4 (b) 5
(c) 6 (d) 8
(a) = (b)
(c) (d)
22.3 Summary
22.4 Keywords
1. Identify the polynomials from the following expressions. Which of these are elements of
Z[x]?
2 1
(a) x6 + x5 + x4 + x2 + x + 1 (b) x x2
x2 x
1 1 1
(c) 3x 2 2x 5 (d) 1 x x2 x 3
2 3 4
2. Calculate Notes
www.maths.tcd.ie/
archives.math.utk.edu
CONTENTS
Objectives
Introduction
23.3 Keywords
Objectives
Introduction
In the last unit, you have studied about polynomials rings. In this unit, we will discuss the
division of algorithm.
Theorem 1 (Division Algorithm): Let F be a field. Let f(x) and g(x) be two polynomials in F[x],
with g(x) 0. Then
(a) there exist two polynomials q(x) and r (x) in F [X] such that
f (x) = q (x) g (x) + r (x), where deg r(x) < deg g (x).
Then f(x) = 0. g(x) + f (x), where deg f(x) < deg g (x).
Therefore, the algorithm is true for f(x), and hence for all polynomials in F[x].
(b) Now let us show that q(x) and r(x) are uniquely determined.
If possible, let
f(x) = q1(x) g(x) + r1(x), where deg r1(x) < deg g(x).
and
Then
q1(x) g(x)+r1(x) = q2(x) g(x)+r2(x), so that
deg r2(x) < deg g(x) and deg r1(x) < deg g(x).
But this contradicts Equation (1). Hence. Equation (1) will remain valid only if
Notes Thus, we have proved the uniqueness of q(x) and r(x) in the expression f(x) = q(x) g(x)+r(x). Here
q(x) is called the quotient and r(x) is called the remainder obtained on dividing f(x) by g(x).
Now, what happens if we take g(x) of Theorem 1 to be a linear polynomial? We get the remainder
theorem. Before proving it let us set up some notation.
Notation: Let R be a ring and f(x) R[x]. Let
f(x) = a0 + alx + ... +anxn.
Then, for any r R, we define
that is, f(r) is the value of f(x) obtained by substituting r for x.
Thus, if f(x) = 1 + x + x2 Z[x], then
f(2) = 1 + 2 + 4 = 7 and f(0) = 1 + 0 + 0 = 1.
Let us now prove the remainder theorem, which is a corollary to the division algorithm.
Theorem 2 (Remainder Theorem): Let F be a field. If f(x) P[x] and b F, then there exists a
unique polynomial q(x) F[x] such that f(x) = (i-b) q(x)+f(b).
Proof: Let g(x) = x-b. Then, applying the division algorithm to f(x) and g(x), we can find unique
q(x) and r(x) in F[x], such that
f(b) = (b – b) q(b) + a
= 0.q(b) + a= a
Thus, a = f(b).
Therefore, f(x) = (x-b) q(x)+f(b).
x 2 + x + 1) x 4 + x3 + 5x 2 x
x 4 x3 x2
4x2 x
4x2 4x 4
5x 4
Now, since the degree of the remainder -5x- 4 is less than deg .(x2+x+1), we stop the process. We Notes
get
x4 + x3 + 5x2 – x = (x2 + x + 1) (x2 + 4) – (5x + 4)
Here the quotient is x2 + 4 and the remainder is – (5x+4).
Now, let us see what happens when the remainder in the expression f = qg + r is zero.
Self Assessment
1. Let F be a field. Let f(x) and g(x) be two polynomials is f[x], with g(x) 0, then the
polynomial q(x) and r(x) an ...................
2. If deg f(x) < deg g(x) we can chosen q(x) = 0. Then f(x) = 0.g(x) + f(x) where degf(x) ...................
deg g(x).
(a) < (b) >
(c) (d)
4 3 2
3. x + x + 5x – x is equal to ...................
4. ................... theorem said that let F be a field, if F[x] P[x] and b F, then there exists a
unique polynomial q(x) F[x] such that f(x) = (i - b) q(x) + F(b)
(a) remainder theorem (b) division algorithm
23.2 Summary
The division algorithm in F[x], where F is a field, which states that if f(x), g(x) F(x),
g(x) 0, then there exist unique q(x), r(x) F[x] with f(x) = q(x) g(x)+r(x) and deg r(x)
< deg g(x).
A non-zero polynomial of degree n over a field F can have at the most n roots.
23.3 Keywords
Division Algorithm: Let F be a field. Let f(x) and g(x) be two polynomials in F[x], with g(x) 0.
Remainder Theorem: Let F be a field. If f(x) P[x] and b F, then there exists a unique polynomial
q(x) F[x] such that f(x) = (i-b) q(x)+f(b).
(b) f = x 3 + 2x 2 – x + 1, g = x + 1 in Z 3 [x].
(c) f = x3 – 1, g = x – 1 in R[x].
p
2. You know that if p, q Z, q 0, then can be written as the sum of an integer and a
q
fraction * with | m | < | q |. What is the analogous property for elements of F[x]?
archives.math.utk.edu
CONTENTS
Objectives
Introduction
24.3 Summary
24.4 Keywords
24.5 Review Questions
Objectives
Introduction
We have discussed various kinds of integral domains, including unique factorisation domains.
Over there you saw that Z[x] and Q[x] are UFDs. Thus, the prime and irreducible elements
coincide in these rings. In this unit, we will give you a method for obtaining the prime
(or irreducible) elements of Z[x] and Q[x]. This is the Eisenstein criterion, which can also be used
for obtaining the irreducible elements of any polynomial ring over a UFD.
After this, we will introduce you to the field extensions and subfields. We will use irreducible
polynomials for obtaining field extensions of a field F from F[x]. We will also show you that
every field is a field extension of Q or Z, for some prime p. Because of this, we call Q and the Zps
prime fields. We will discuss these fields briefly.
Finally, we will look at finite fields. These fields were introduced by the young French
mathematician Evariste Galois while he was exploring number theory. We will discuss some
properties of finite fields which will show us how to classify them.
Before reading this unit ,we suggest that you go through the definitions of irreducibility.
We introduced you to irreducible polynomials in F[x], where F is a field. We also stated the
Fundamental Theorem of Algebra, which said that a polynomial over C is irreducible iff it is
linear. You also learnt that if a polynomial over R is irreducible, it must have degree 1 or
degree 2. Thus, any polynomial over R of degree more than 2 is reducible. And, using the
quadratic formula, we know which quadratic polynomials over R are irreducible.
Now let us look at polynomials over Q. Again, as for any field F, a linear polynomial over Q is
irreducible. Also, by using the quadratic formula we can explicitly obtain the roots of any
quadratic polynomial over Q, and hence figure out whether it is irreducible or not. But, can you
tell whether 2x7 + 3x5 – 6x4 + 3x3 + 12 is irreducible over Q. This criterion was discovered by the
nineteenth century mathematician Ferdinand Eisenstein. In this section we will build up the
theory for proving this useful criterion.
Let us start with a definition.
Definition: Let f(x) = a, + a1x + . .. + anxn Z[x]. We define the content of f[x] to be the g.c.d. of the
integers a0, a1,,..., a,.
We say that f(x) is primitive if the content of f(x) is 1.
For example, the content of 3x2 + 6x + 12 is the g.c.d. of 3, 6 and 12, i.e., 3. Thus, this polynomial
is not primitive. But x5 + 3x2 + 4x – 5 is primitive, since the g.c.d of 1, 0, 0, 3, 4, –5 is 1.
We will now prove that the product of primitive polynomials is a primitive polynomial. This
result is well known as Gauss’ lemma.
Theorem 1: Let f(x) and g(x) be primitive polynomials. Then so is f(x) g(x).
p | bs.
Now consider
= arbs + (a0br+s + aIbr+s-1 + ... + ar-1 bs+1 + ar+1bs-1 + ... + ar+s b0)
By our choice of r and s, p | a0, p | a1, ..., p | ar-1, and p | b0, p | b1, ..., p | bs-1. Also p | cr+s,
Therefore, p | cr+s – (a0br+s +... + ar-1 bs+1 + ar+1 bs-1 + ... + ar+s b0)
i.e., p | ar b,.
Notes
But p | ar and p | bs. So we reach a contradiction. Therefore, our supposition is false. That is, our
theorem is true.
3 3 1 2 1
Consider any polynomial over Q, say f(x) = x + x + 3x + . If we take the l.c.m of the
2 5 3
denominators, is., of 2, 5, 1 and 3, i.e., 30 and multiply f(x) by it, what do we get? We get
30f(x) = 45x3 + 6x2 + 90x + 10 Z[x]
Using the same process, we can multiply any f(x) Q[x] by a suitable integer d so that df(x),
Z[X]. We will use this fact while relating irreducibility in Q[x] with irreducibility in Z[x].
Theorem 2: If f(x) Z[x] is irreducible in Z[x], then it is irreducible in Q[x].
Proof: Let us suppose that f(x) is not irreducible over Q[x]. Then we should reach a contradiction.
So let f(x) = g(x) h(x) in Q[x], where neither g(x) nor h(x) is unit, i.e., deg g(x) > 0, deg h(x) > 0. Since
g(x) Q[x]. m Z such that mg(x) Z[x]. Similarly, n Z such that nh(x) Z[x]. Then,
Therefore, f(x) = rf1(x) = (rg1(x)) h1(x) in Z[x], where neither rp1(x) nor h1(x) is a unit. This
contradicts the fact that f(x) is irreducible in Z[x].
What this result says is that to check irreducibility of ii polynomial in Q[x], it is enough to check
it in Z[x]. And, for checking it in Z[x] we have the terrific Eisenstein’s criterion, that we mentioned
at the beginning.
Theorem 3 (Eisenstein’s Criterion): Let f(x) = a0 + alx + ... + a,,xn Z[x]. Suppose that for some
prime number p;
(i) P | an,
(iii) p2 | a0.
Now a0 = b0c0. We know that p | a0. Thus, p | b0c0, p | b0 or p | c0. Since p2 | a0, p cannot divide
both b0 and c0. Let us suppose that p | b0 and p k CJ
Now let us look at a,, = b, c,. Since p | a, we see that p | bm and p | cr. Thus, we see that for some
i, p | bi. Let k be the least integer such that p | bk. Note that 0 < k m < n.
Therefore, p|ak.
Since p|ak and p|b0, p | b1, ..., p | bk–1, we see that p(ak – (b0ck + .... + bk–1c1), i. e.,
You also know that if R is a ring, and M is a maximal ideal of R, then R/M is a field.
Thus, Q[x] /<x3 – p> is a field.
Solution: To start with, we would like you to note that f(x) = g(x) h(x) in Z[x] iff f(x + 1) = g(x + 1)
h(x + 1) in Z[x]. Thus, f(x) is irreducible in Z[x] iff f(x + l) is irreducible in Z[x].
xp 1
Now, f(x) =
x1
x 1 p 1
f(x + 1) =
x
1 p p
= (x + C1 xp-1 + ... + pCp-1 x + 1 – 1), (by the binomial theorem)
x
Now apply Eisenstein’s criterion taking p as the prime. We find that f(x+l) is irreducible. Notes
But, we can generalise the fact that irreducibility in Z[x] implies irreducibility in Q[x]. This is not
only true for Z and Q; it is true for any UFD R and its field of quotients F. Let us state this
relationship explicitly.
Theorem 4: Let R be a UFD with field of quotients F.
(i) If f(x) R[x] is an irreducible primitive polynomial, then it is also irreducible in F[x].
(ii) (Eisenstein’s Criterion) Let f(x) = a0 + a1x + ... + a, xn R[x] and p R be a prime element
such that p | a,, p2 | a0 and p | ai for 0 i < n. Then f(x) is irreducible in F[x].
The proof of this result is on the same lines as that of Theorems 2 and 3. We will not be doing it
here. But if you are interested, you should try and prove the result yourself.
Now, we have already pointed out that if F is a field and f(x) is irreducible over F, then F[x]/
<f(x)> is field. How is this field related to F? That is part of what we will discuss in the next
section.
We shall discuss subfields and field extensions. To start with let us define these terms. By now
the definition may be quite obvious to you.
(ii) the set of all non-zero elements of S forms a subgroup of the group of non-zero elements
of F under multiplication.
Theorem 5: A non-empty subset S of a field F is a subfield of F if and only if
(i) a S, b S a – b S, and
(ii) a S , b S , b 0 ab-1 S.
Now, let us look at a particular field extension of a field F. Since F[x] is an integral domain, we
can obtain its field of quotients. We denote this field by F(x). Then F is a subfield of F(x). Thus,
F(x) is a field extension of F. Its elements are expressions of the form f,( x) where f(x), g(x) F[x]
and g(x) # 0.
g(x)
There is another way of obtaining a field extension of a field F from F[x]. We can look at quotient
rings of F[x] by its maximal ideals. You know that an ideal is maximal in F[x] iff it is generated
by an irreducible polynomial over F. So, F[x]/<f(x)> is a field iff f(x) is irreducible over F.
Notes Now, given any f(x) F[x], such that deg f(x) > 0, we will show that there is a field monomorphism
from F into F[x]/d(x)>. This will show that F[x)/<f(x)> contains an isomorphic copy of F; and
hence, we can say, that it contains F. So, let us define 0 : F F[x]/d(x)>: (a) = a + <f(x)>.
Well, we have looked at field extensions of any field F. Now let us look at certain fields, one of
which F will be an extension of.
Let us consider any field F. Can we say anything about what its subfields look like? Yes, we can
say something about one of its subfields. Let us prove this very startling and useful fact.
Theorem 6: Every field contains a subfield isomorphic to Q or to Zp, for some prime number p.
You know that char F = 0 or char F = p, a prime. So let us look at these two cases separately.
Case 1 (char F = 0): In this case f is one-one. Z = f(Z). Thus, f(Z) is an integral domain contained
in the field F. Since F is a field, it will also contain the field of quotients of f(Z). This will be
isomorphic’ to the field of quotients of Z, i.e., Q. Thus, F has a subfield which is isomorphic to Q.
Let F be a field.
(i) If char F = 0, then F has a subfield isomorphic to Q.
Because of this property of Q arid Zp (where p is a prime number) we call these fields prime
fields.
Thus, the prime fields are Q, Z2, Z3, Z5, etc. Notes
We call the subfield isomorphic to a prime field (obtained in Theorem 6), the prime subfield of
the given field.
Let us again reword Theorem 6 in terms of field extensions. What it says is that every field is a
Weld extension of a prime field.
Now, suppose a field F is an extension of a field K. Are the prime subfields of K and F isomorphic
or not? To answer this let us look at char K and char F. We want to know if char K = char F or not.
Since F is a field extension of K, the unity of F and K is the same, namely, 1. Therefore, the least
positive integer n such that n.1 = 0 is the same for F as well as K. Thus, char K = char F. Therefore,
the prime subfields of K and F are isomorphic.
A very important fact that a field is a prime field iff it has no proper subfields.
You have dealt a lot with the finite fields Zp. Now we will look at field extensions of these fields.
You know that any finite field F has characteristic p, for some prime p. And then F is an extension
of Z. Suppose P contains q elements. Then q must be a power of p. That is what we will prove
now.
Theorem 7: Let F be a finite field having q elements and characteristic p. Then q = pn, some
positive integer n.
The proof of this result uses the concepts of a vector space and its basis.
Proof: Since char F = p, F has a prime subfield which is isomorphic to Zp. We lose nothing if we
assume that the prime subfield is Zp. We first show that F is a vector space over Zp with finite
dimension.
(ab). v = a. (b.v)
1.v = v.
Now, we know that (F, +) is an abelian group. We also know that the multiplication in F will
satisfy all the conditions that the scalar multiplication should satisfy. Thus, F is a vector space
over 2,. Since F is a finite field, it has a finite dimension over Zp. Let dim Zp F = n. Then we can
find a,. .., an, a F such that
Notes Similarly, each of b2, b3, ...., bn has p choices. And, corresponding to each of these choices we get
a distinct element of F. Thus, the number of elements in F is p × p × ... × p (n times) = pn.
The utility of this result is something similar to that of Lagrange’s theorem. Using this result we
know that, for instance, no field of order 26 exists. But does a field of order 25 exist? Does
Theorem 7 answer this question? It only says that a field of order 25 can exist. But it does not say
that it does exist. The following exciting result, the proof of which is beyond the scope of this
course, gives us the required answer. This result was obtained by the American mathematician
E.H. Moore in 1893.
Theorem 8: For any prime number p and n N, there exists a field with pn elements. Moreover,
any two finite fields having the same number of elements are isomorphic.
Self Assessment
4. Let F be a finite having of elements and characteristics P, then q = .............., some positive
integer n.
(a) p-1 (b) pn
24.3 Summary
Eisenstein’s irreducibility criterion for polynomials over Z and Q. This states that if Notes
f(x) = a0 + a, x + . . . + anxn Z[x] and there is a prime p Z such that
p | ai i = 0 , 1 . ..., n – 1.
p | a, and
p | a0,
The number of elements in a finite field F is pn’, where char F = p and dim zpF = n.
Given a prime number p’and n N, there exists a field containing pn elements. Any two
finite fields with the same number of elements are isomorphic.
n
p
If F is a finite field with pn elements, then x x is a product of pn linear polynomials
over F.
24.4 Keywords
Eisenstein’s Criterion: Let f(x) = a0 + alx + ... + a,,xn Z[x]. Suppose that for some prime number
p; (i) P | an, (ii) p | a0, p | a1,...p|an–1
2. Prove that any polynomial f(x) Z[x] can be written as dg(x), where d is the content of f(x)
and g(x) is a primitive polynomial.
3. For any n N and prime number p, show that xn – p is irreducible over Q[x]. Note that this
shows us that we can obtain irreducible polynomials of any degree over Q[x].
4. If a0 + a1x + ... + a, xn Z[x] is irreducible in Q[x], can you always find a prime p that satisfied
the conditions (i), (ii) and (iii) of Theorem 3?
6. Let p be a prime: integer. Let a be a non-zero non-unit square-free integer, i.e., b 2 | a for
any b Z. Show that Z[x]/<xp + a> is an integral domain.
archives.math.utk.edu
CONTENTS
Objectives
Introduction
25.3 Keywords
Objectives
Introduction
You have seen when we can say that an element in a ring divides another element. Let us recall
the definition in the context of F[x], where F is a field.
Definition: Let f(x) and g(x) be in F[x], where F is a field and g(x) 0. We say that g(x) divides
f(x)(or g(x) is a factor of f(x), or f(x) is divisible by gi(x)) if there-exists q(x) F[x] such that
We write g(x) | f(x) for ‘g(x) divides f(x)’, and g(x) | f(x) for ‘g(x) does not divide f(x)’.
Now, if f(x) F[x] and g(x) F[x], where g(x) 0, when g(x) | f(x)? We find that g(x) | f(x) if
r(x) = 0.
Definition: Let F be a field and f(x) F[x]. We say that an element a F is a root (or zero) of f(x)
if f(n) = 0.
3 2 1 1
Similarly, –1 is a root of f(x) = x x x Q[x], since
2 2
1 1
f(–1) = –1 +1 0.
2 2
Notes Let F be a field and f (x) F[x]. Then a P is a root of f(x) if and only if (x–a) | f(x)).
For example, 3 is a root of multiplicity 2 of the polynomial (x–3)2 (x+2) Q[x]; and (–2) is a root
of multiplicity 1 of this polynomial.
Now, is it easy to obtain all the roots of a given polynomial? Any linear polynomial ax+b F[x]
will have only one root, namely, -a-1b. This is because ax+b = 0 iff x = -a-1b.
In the case of a quadratic polynomial ax2 + bx + c F[x], you know that its two roots are obtained
by applying the quadratic formula
b b 2 4ac
2a
For polynomials of higher degree we may be able to obtain some roots by trial and error. For
example, consider f(x) = x5 – 2x + 1 R[x]. Then, we try out x = 1 and find f(1) = 0. So, we find that
1 is a zero of f(x). But this method doesn’t give us all the roots of f(x).
As we have just seen, it is not easy to find all the roots of a given polynomial. But, we can give
a definite result about the number of roots of a polynomial.
Theorem 1: Let f(x) be a non-zero polynomial of degree n over a field F:Then f(x) has at most n
roots in F.
then
f(x) = a0 + a1x, where a0, a, F and a, 0.
Now assume that the theorem is true for all polynomials in F[x] of degree n. We will show that
the number of roots of f(x) n.
If f(x) has no root in F, then the number of roots of f(x) in F is 0 S n. So, suppose f(x) has a root
a F.
Hence, by the induction hypothesis g(x) has at most n–1 roots in F, say a1,....,an-1. Now,
Using this result we know that, for example, x3–1 Q[x] can’t have more than 3 roots in Q.
In Theorem 1 we have not spoken about the roots being distinct. But an obvious corollary of
Theorem 1 is that
if f(x) F[x] is of degree n, then f(x) has st most n distinct roots in F.
Example: Prove that x 3 5x Z 6 [x] has more roots than its degree. (Note that Z6 is not
a field.)
Solution: Since the ring is finite, it is easy for us to run through all its elements and check which
of them, are roots of
f(x) x 3 5x.
In fact, every element of Z6 is a zero of f(x). Thus, f(x) has 6 zeros, while deg f(x) = 3.
So far, we have been saying that a polynomial of degree n over F has at most n roots in Fa. It can
happen that the polynomial has no root in F. For example, consider the polynomial x2 + 1 R[x].
You know that it can have 2 roots in R, at the most. But as you know, this has no roots in R (it has
two roots, i and –i, in C).
We can find many other examples of such polynomials in R[x]. We call such polynomials
irreducible over R. We shall discuss them in detail in the next units.
Now let us end this unit by seeing what we have covered in it.
Definition: Let F be a set on which two binary operations are defined, called addition and
multiplication, and denoted by + and · respectively. Then F is called a field with respect to these
operations if the following properties hold:
(i) Closure: For all a,b in F the sum a + b and the product a . b are uniquely defined and belong
to F.
(ii) Associative Laws: For all a,b,c in F,
a + b = b + a and a · b = b· a.
Notes (v) Identity Elements: The set F contains an additive identity element, denoted by 0, such that
for all a in F,
a + 0 = a and 0 + a = a.
The set F also contains a multiplicative identity element, denoted by 1 (and assumed to be
different from 0) such that for all a in F,
a· 1 = a and 1· a = a.
(vi) Inverse Elements: For each a in F, the equations
a + x = 0 and x + a = 0
have a solution x in F, called an additive inverse of a, and denoted by -a. For each nonzero
element a in F, the equations
a· x = 1 and x· a = 1
have a solution x in F, called a multiplicative inverse of a, and denoted by a-1.
Definition: Let F be a field. For am, am-1 , . . . , a1, a0 in F, an expression of the form
has degree n, written deg(f(x)) = n, and an is called the leading coefficient of f(x). If the leading
coefficient is 1, then f(x) is said to be monic.
Two polynomials are equal by definition if they have the same degree and all corresponding
coefficients are equal. It is important to distinguish between the polynomial f(x) as an element
of F[x] and the corresponding polynomial function from F into F defined by substituting elements
of F in place of x. If f(x) = amxm + · · · + a0 and c is an element of F, then f(c) = amcm + · · · + a0. In fact,
if F is a finite field, it is possible to have two different polynomials that define the same polynomial
function. For example, let F be the field Z5 and consider the polynomials x5 - 2x + 1 and 4x + 1. For
any c in Z5, by Fermat’s theorem we have c5 c (mod 5), and so
c5 - 2c + 1 - c + 1 4c + 1 (mod 5),
and
the sum of f(x) and g(x) is defined by just adding corresponding coefficients. The product f(x)g(x)
is defined to be
k
ck = i 0a i
bk i .
This definition of the product is consistent with what we would expect to obtain using a naive Notes
approach: Expand the product using the distributive law repeatedly (this amounts to multiplying
each term be every other) and then collect similar terms.
Proposition: If f(x) and g(x) are non-zero polynomials in F[x], then f(x)g(x) is non-zero and
deg(f(x)g(x)) = deg(f(x)) + deg(g(x)).
Corollary: If f(x),g(x),h(x) are polynomials in F[x], and f(x) is not the zero polynomial, then
f(x)g(x) = f(x)h(x) implies g(x) = h(x).
Definition: Let f(x),g(x) be polynomials in F[x]. If f(x) = q(x)g(x) for some q(x) in F[x], then we say
that g(x) is a factor or divisor of f(x), and we write g(x) | f(x). The set of all polynomials divisible
by g(x) will be denoted by < g(x) >.
Lemma: For any element c in F, and any positive integer k,
(x - c) | (xk - ck).
Theorem 3: Let f(x) be a non-zero polynomial in F[x], and let c be an element of F. Then there
exists a polynomial q(x) in F[x] such that
Definition: Let f(x) = amxm + · · + a0 belong to F[x]. An element c in F is called a root of the
polynomial f(x) if f(c) = 0, that is, if c is a solution of the polynomial equation f(x) = 0 .
Corollary: Let f(x) be a non-zero polynomial in F[x], and let c be an element of F. Then c is a root
of f(x) if and only if x-c is a factor of f(x). That is,
f(c) = 0 if and only if (x-c) | f(x).
Corollary: A polynomial of degree n with coefficients in the field F has at most n distinct roots
in F.
Self Assessment
1. Let F be a field and f(x) F[x] then we say that an element a F is a root of f(x) of
f(n) = ...............
(a) 1 (b) 2
(c) 0 (d) –1
1 1
3. ............... is a root of f(x) = x3 + x2 + x Q[x]
2 2
(a) 1 (b) 2
(c) –1 (d) –2
(a) 2 (b) 3
(c) 1 (d) more
25.2 Summary
If f(x) and g(x) are non-zero polynomials in F[x], then f(x)g(x) is non-zero and deg(f(x)g(x))
= deg(f(x)) + deg(g(x)).
If f(x),g(x),h(x) are polynomials in F[x], and f(x) is not the zero polynomial, then f(x)g(x) =
f(x)h(x) implies g(x) = h(x).
Let f(x),g(x) be polynomials in F[x]. If f(x) = q(x)g(x) for some q(x) in F[x], then we say that
g(x) is a factor or divisor of f(x), and we write g(x) | f(x). The set of all polynomials
divisible by g(x) will be denoted by < g(x) >.
(x - c) | (xk - ck).
Let f(x) be a non-zero polynomial in F[x], and let c be an element of F. Then there exists a
polynomial q(x) in F[x] such that
f(x) = q(x)(x - c) + f(c).
Moreover, if f(x) = q1(x)(x - c) + k, where q1(x) is in F[x] and k is in F, then q1(x) = q(x) and
k = f(c).
Let f(x) = amxm + · · + a0 belong to F[x]. An element c in F is called a root of the polynomial
f(x) if f(c) = 0, that is, if c is a solution of the polynomial equation f(x) = 0 .
Let f(x) be a non-zero polynomial in F[x], and let c be an element of F. Then c is a root of f(x)
if and only if x-c is a factor of f(x). That is,
25.3 Keywords
Field: Let F be a set on which two binary operations are defined, called addition and multiplication,
and denoted by + and · respectively. Then F is called a field with respect to these operations.
Identity Elements: The set F contains an additive identity element, denoted by 0, such that for all
a in F,
a + 0 = a and 0 + a = a.
have a solution x in F, called an additive inverse of a, and denoted by -a. For each non-zero
element a in F, the equations
a· x = 1 and x· a = 1
1. Let F be a field and f(x) F[x] with deg f(x) 1. Let a F. Show that f(x) is divisible by
x – a iff f(a) = 0.
2. Find the roots of the following polynomials, along with their multiplicity.
1 2 5
(a) f(x) x x 3 Q[x] (b) f(x) x 2 x 1 Z 3 [x]
2 2
(b) f (b) = b b F.
4. Let p be a prime number. Consider xp 1 1 Zp [x]. Use the fact that Zp is a group of order
p to show that every non-zero element of Zp is a root of xp-1 – 1. Thus, show that xp-1 – 1 =
(x 1)(x 2)...(x p 1).
5. The polynomial x4 + 4 can be factored into linear factors in Z5[x]. Find this factorisation.
www.maths.tcd.ie/
archives.math.utk.edu
CONTENTS
Objectives
Introduction
26.3 Keywords
Objectives
Introduction
Beginning with a field K, and a polynomial f(x) K, we need to construct the smallest possible
extension field F of K that contains all of the roots of f(x). This will be called a splitting field for
f(x) over K. The word “the” is justified by proving that any two splitting fields are isomorphic.
Let F be an extension field of K and let u F. If there exists a non-zero polynomial f(x) K[x]
such that f(u) = 0, then u is said to be algebraic over K. If not, then u is said to be transcendental
over K.
If F is an extension field of K, and u F is algebraic over K, then there exists a unique monic
irreducible polynomial p(x) K[x] such that p(u) = 0. It is the monic polynomial of minimal
degree that has u as a root, and if f(x) is any polynomial in K[x] with f(u) = 0, then
p(x) | f(x).
Alternate proof: The proof in the text uses some elementary ring theory. Then decided to include
a proof that depends only on basic facts about polynomials.
Assume that u F is algebraic over K, and let I be the set of all polynomials f(x) K[x] such that
f(u) = 0. The division algorithm for polynomials can be used to show that if p(x) is a non-zero
monic polynomial in I of minimal degree, then p(x) is a generator for I, and thus p(x) | f(x)
whenever f(u) = 0.
Furthermore, p(x) must be an irreducible polynomial, since if p(x) = g(x)h(x) for g(x); h(x) K[x],
then g(u)h(u) = p(u) = 0, and so either g(u) = 0 or h(u) = 0 since F is a field. From the choice of p(x)
as a polynomial of minimal degree that has u as a root, we see that either g(x) or h(x) has the
same degree as p(x), and so p(x) must be irreducible.
In the above proof, the monic polynomial p(x) of minimal degree in K[x] such that p(u) = 0 is Notes
called the minimal polynomial of u over K, and its degree is called the degree of u over K.
Let F be an extension field of K, and let u1, u2, ..., un F. The smallest subfield of F that contains
K and u1, u2,..., un will be denoted by K(u1, u2,..., un). It is called the extension field of K generated
by u1, u2,...., un. If F = K(u) for a single element u F, then F is said to be a simple extension of K.
Let F be an extension field of K, and let u F. Since K(u) is a field, it must contain all elements of
the form
a 0 + a 1 u + a 2 u 2 + ... + a m u m
,
b 0 + b1 u + b 2 u 2 + ... + b n u n
where ai, bj K for i = 1,..., m and j = 1,... n. In fact, this set describes K(u), and if u is transcendental
over K, this description cannot be simplified. On the other hand, if u is algebraic over K, then the
denominator in the above expression is unnecessary, and the degree of the numerator can be
assumed to be less than the degree of the minimal polynomial of u over K.
If F is an extension field of K, then the multiplication of F defines a scalar multiplication,
considering the elements of K as scalars and the elements of F as vectors. This gives F the
structure of a vector space over K, and allows us to make use of the concept of the dimension of
a vector space. The next result describes the structure of the extension field obtained by adjoining
an algebraic element.
(b) If the minimal polynomial of u over K has degree n, then K(u) is an n-dimensional vector
space over K.
Alternate proof: The standard proof uses the ring homomorphism : K[x] F defined by
evaluation at u. Then the image of is K(u), and the kernel is the ideal of K[x] generated by the
minimal polynomial p(x) of u over K. Since p(x) is irreducible, ker() is a prime ideal, and so
K[x] = ker() is a field because every nonzero prime ideal of a principal ideal domain is maximal.
Thus K(u) is a field since K(u) 245= K[x]= ker().
The usual proof involves some ring theory, but the actual ideas of the proof are much simpler.
To give an elementary proof, define : K[x]= {p(x)} K(u) by ([f(x)]) = f(u), for all congruence
classes [f(x)] of polynomials (modulo p(x)). This mapping makes sense because K(u) contains u,
together with all of the elements of K, and so it must contain any expression of the form a0 +a1u+
... + amum, where ai K, for each subscript i. The function is well-defined, since it is also
independent of the choice of a representative of [f(x)]. In fact, if g(x) K[x] and f(x) is equivalent
to g(x), then f(x) – g(x) = q(x)p(x) for some q(x) K[x], and so f(u) – g(u) = q(u)p(u) = 0, showing
that ([f(x)]) = ([g(x)]).
Since the function simply substitutes u into the polynomial f(x), and it is not difficult to show
that it preserves addition and multiplication. It follows from the definition of p(x) that
is one-to-one. Suppose that f(x) represents a nonzero congruence class in K[x]= {p(x)}. Then
p(x) | f(x), and so f(x) is relatively prime to p(x) since it is irreducible. Therefore, there exist
polynomials a(x) and b(x) in K[x] such that a(x)f(x) + b(x)p(x) = 1. It follows that [a(x)][f(x)] = [1]
for the corresponding equivalence classes, and this shows that K[x] /{p(x)} is a field. Thus the
image E of in F must be subfield of F. On the one hand, E contains u and K, and on the other
hand, we have already shown that E must contain any expression of the form a0 + a1u + ... + amum,
where ai K. It follows that E = K(u), and we have the desired isomorphism.
(b) It follows from the description of K(u) in part (a) that if p(x) has degree n, then the set
B = {1, u, u2,..., un-1} is a basis for K(u) over K.
Notes Let F be an extension field of K. The dimension of F as a vector space over K is called the degree
of F over K, denoted by [F : K]. If the dimension of F over K is finite, then F is said to be a finite
extension of K. Let F be an extension field of K and let u F. The following conditions are
equivalent: (1) u is algebraic over K; (2) K(u) is a finite extension of K; (3) u belongs to a finite
extension of K.
Never underestimate the power of counting: the next result is crucial. If we have a tower of
extensions K E F, where E is finite over K and F is finite over E, then F is finite over K, and
[F : K] = [F : E][E : K]. This has a useful corollary, which states that the degree of any element of
F is a divisor of [F : K].
Let K be a field and let f(x) = a0 + a1x + ... + anxn be a polynomial in K[x] of degree n > 0. An
extension field F of K is called a splitting field for f(x) over K if there exist elements r1, r2,..., rn
F such that
In this situation we usually say that f(x) splits over the field F. The elements r1, r2,..., rn are roots
of f(x), and so F is obtained by adjoining to K a complete set of roots of f(x). An induction
argument (on the degree of f(x)) can be given to show that splitting fields always exist. Theorem
states that if f(x) K[x] is a polynomial of degree n > 0, then there exists a splitting field F for f(x)
over K, with [F : K] n!.
The uniqueness of splitting fields follows from two lemmas. Let : K L be an isomorphism of
fields. Let F be an extension field of K such that F = K(u) for an algebraic element u F. Let p(x)
be the minimal polynomial of u over K. If v is any root of the image q(x) of p(x) under , and
E = L(v), then there is a unique way to extend to an isomorphism : F E such that (u) = v and
(a) = (a) for all a K. The required isomorphism : K(u) L(v) must have the form
Self Assessment
2. The monic polynomial P(x) of minimal degree in K[x] such that P(u) = 0 is called is
............... of r over K and its degree is called the degree of u over K.
(a) maximal polynomial (b) minimal polynomial
4. The splitting field over the field K of a polynomial f(x) K[x] is unique up to ............... Notes
26.2 Summary
If F is an extension field of K, and u F is algebraic over K, then there exists a unique monic
irreducible polynomial p(x) K[x] such that p(u) = 0. It is the monic polynomial of minimal
degree that has u as a root, and if f(x) is any polynomial in K[x] with f(u) = 0, then
p(x) | f(x).
Alternate Proof: The proof in the text uses some elementary ring theory. I’ve decided to include
a proof that depends only on basic facts about polynomials.
Let F be an extension field of K, and let u1, u2, ..., un F. The smallest subfield of F that contains
K and u1, u2,..., un will be denoted by K(u1, u2,..., un). It is called the extension field of K generated
by u1, u2,...., un. If F = K(u) for a single element u F, then F is said to be a simple extension of K.
Let F be an extension field of K, and let u F. Since K(u) is a field, it must contain all elements of
the form
a 0 + a 1 u + a 2 u 2 + ... + a m u m
,
b 0 + b1 u + b 2 u 2 + ... + b n u n
where ai, bj K for i = 1,..., m and j = 1,... n. In fact, this set describes K(u), and if u is transcendental
over K, this description cannot be simplified. On the other hand, if u is algebraic over K, then the
denominator in the above expression is unnecessary, and the degree of the numerator can be
assumed to be less than the degree of the minimal polynomial of u over K.
The uniqueness of splitting fields follows from two lemmas. Let : K L be an isomorphism of
fields. Let F be an extension field of K such that F = K(u) for an algebraic element u F. Let p(x)
be the minimal polynomial of u over K. If v is any root of the image q(x) of p(x) under , and
E = L(v), then there is a unique way to extend to an isomorphism : F E such that (u) = v and
(a) = (a) for all a K. The required isomorphism : K(u) L(v) must have the form
The second lemma is stated as follows. Let F be a splitting field for the polynomial f(x) K[x].
If : K L is a field isomorphism that maps f(x) to g(x) L[x] and E is a splitting field for g(x)
over L, then there exists an isomorphism : F E such that (a) = (a) for all a K. The proof uses
induction on the degree of f(x), together with the previous lemma.
The splitting field over the field K of a polynomial f(x) K[x] is unique up to isomorphism.
26.3 Keywords
Splitting Field: Beginning with a field K, and a polynomial f(x) K, we need to construct the
smallest possible extension field F of K that contains all of the roots of f(x). This will be called a
splitting field for f(x) over K.
Notes Extension Field: Let F be an extension field of K and let u F. If there exists a nonzero polynomial
f(x) K[x] such that f(u) = 0, then u is said to be algebraic over K. If not, then u is said to be
transcendental over K.
In the above proof, the monic polynomial p(x) of minimal degree in K[x] such that p(u) = 0 is
called the minimal polynomial of u over K, and its degree is called the degree of u over K.
4. Find the degree of the splitting field over Z2 for the polynomial (x3 + x + 1)(x2 + x + 1).
5. Let F be an extension field of K. Show that the set of all elements of F that are algebraic
over K is a subfield of F.
6. Let F be a field generated over the field K by u and v of relatively prime degrees m and n,
respectively, over K. Prove that [F : K] = mn.
7. Let F E K be extension fields. Show that if F is algebraic over E and E is algebraic over
K, then F is algebraic over K.
8. Let F K be an extension field, with u F. Show that if [K(u) : K] is an odd number, then
K(u2) = K(u).
9. Find the degree [F : Q], where F is the splitting field of the polynomial x3 – 11 over the field
Q of rational numbers.
archives.math.utk.edu
CONTENTS
Objectives
Introduction
27.1 Separability
27.2 Summary
27.3 Keywords
Objectives
Introduction
In the last unit, you have studied about the splitting field and extension field. This unit will
provide you information related to separable extension.
27.1 Separability
Separability of a finite field extension L/K can be described in several different ways. The
original definition is that every element of L is separable over K (that is, has a separable
minimal polynomial in K[X]). We will give here three descriptions of separability for a finite
extension and use each of them to prove two theorems about separable extensions.
Theorem 1: Let L/K be a finite extension. Then L/K is separable if and only if the trace function
TrL/K : L K is not identically 0.
Theorem 2: Let L/K be a finite extension. Then L/K is separable if and only if the ring K K L
has no non-zero nilpotent elements. When L/K is separable, the ring K K L is isomorphic to
[ L:K ]
K .
Example: Consider the extension Q( 2 )=Q. Since Q( 2 ) Q[X]/(X2 – 2), tensoring with
2
Q gives Q Q Q 2 ; Q[X]/(X 2) Q[X]/((X 2 )(X 2 ) Q Q,
which is a product of 2 copies of Q (associated to the 2 roots of X2 2) and has no nilpotent
elements besides 0.
Notes
Example: Consider the extension F2( u )/F2(u). Since F2( u ) F2[X]/(X2 – u),
Theorem 3: Let L/K be a finite extension. Then L is separable over K if and only if any derivation
of K has a unique extension to a derivation of L.
For above two proofs, the reader should be comfortable with the fact that injectivity and
surjectivity of a linear map of vector spaces can be detected after a base extension: a linear
map is injective or surjective if and only if its base extension to a larger field is injective or
surjective.
Each of the three theorems above will be proved and then lead in its own way to proofs of the
following two theorems.
Theorem 4: If L = K(a1,....., ar) and each ai is separable over K then every element of L is separable
over K (so L/K is separable).
Theorem 5: Let L/K be a finite extension and F be an intermediate field. If L/F and F/K are
separable then L/K is separable.
We will use our new viewpoints to define separability for arbitrary (possibly non-algebraic)
field extensions.
We want to show L/K is separable if and only if TrL/K : L K is not identically 0. The trace map
is either identically 0 or it is onto, since it is K-linear with target K, so another way of putting
Theorem 1 is that we want to show L/K is separable if and only if the trace from L to K is onto.
Proof: We might as well take K to have positive characteristic p, since in characteristic 0 all
finite field extensions are separable and the trace is not identically 0 : TrL/K(1) = [L : K] 0 in
characteristic 0.
If L/K is separable, by the primitive element theorem we can write L = K() where is separable
over K. To show the trace is surjective for finite separable extensions, it suffices to prove surjectivity
of the trace map on K()/K when K is any base field and is separable over K.
If L/K is inseparable, then there must be some a L which is inseparable over K. Since
TrL/K = TrK()/K o TrL/K(), it success to prove the trace map on K()=K vanishes when is inseparable
over K.
For both cases of the field extension K()/K ( separable or inseparable over K), let have
minimal polynomial (X) in K[X]. Write (X) = (Xpm) where m is as large as possible, so (X)
is separable. Thus (X) is separable if and only if m = 0.
Notes
Consider now the extension of scalars up to K of the trace map TrK(a)/K : K(a) K:
Since tensoring with a field extension preserves injectivity and surjectivity of a linear map,
Since K() K[X] /((X)) as K-algebras, K( ) K[X]/( (X)) as K -algebras, and thus is isomorphic
m
to the direct product of the rings K [X]/ (X p – i ). The trace is the sum of the traces to K on each
m m
K[X]/(X p – i ). Let’s look at the trace from K[X]/(X p – i ). to K .
m m m m
In K [X], X p – i = (X – i )p . Then K[X]/(X p – i ) = K[Y]/(Y p ), where Y = X i. If m = 0, then
m m
K [Y ] / (Yp ) = K , so the trace to K is the identity. If m > 0, any element of K [Y ]/ (Yp ) is the
sum of a constant plus a multiple of Y , which is a constant plus a nilpotent element (since Y
r m
mod Yp is nilpotent). Any constant in K [Y ]/ (Yp ) has trace 0 since pm = 0 in K (because
m
m > 0). A nilpotent element has trace 0. Thus the trace to K of any element of K [Y ] / (Yp ) is 0.
To summarize, when is separable over K (i.e., m = 0), the trace map from K() to K is onto since
it is onto after extending scalars to K . When a is inseparable over K (i.e., m > 0), the trace map
is identically 0 since it vanishes after extending scalars.
Since i is separable over K and the minimal polynomial of i over Li-1 divides its minimal
polynomial over K, i is separable over Li-1. Therefore TrLi-1(i)/Li-1 : Li Li-1 is onto from the
proof of Theorem 1, so the composite map TrL/K : L K is onto. Therefore L/K is separable by
Theorem 1.
Corollary: Theorem 1 implies Theorem 5.
Proof: By Theorem 1 and the hypothesis of Theorem 5, both TrL/F and TrF/K are onto. Therefore,
their composite TrL/K is onto, so L/K is separable by Theorem 1.
Proof: We will begin with the case of a simple extension L = K(). Let (X) be the minimal
polynomial of over K, so L K[X]/((X)) as K-algebras and
K K L K[X]/ (X)
as K -algebras. This ring was considered in the proof of Theorem 1, where we saw its structure
is different when (X) is separable or inseparable. If (X) is separable in K[X], then K[X]/( K (X))
Notes
is a product of copies of the field K , so it has no non-zero nilpotent elements. If (X) is inseparable,
m
then K [X]/((X)) is a product of copies of rings K [Y ]/( Yp ) with m > 0, which all have
nonzero nilpotents.
linear factors in K .
If L/K is inseparable, then some a L is inseparable over K. Tensoring the inclusion map
K() L up to K , we have an inclusion
K K K() K K L.
The ring K K K() has a non-zero nilpotent element by the first paragraph of the proof, so
K K L does as well.
Corollary: The proof of Theorem 2 implies Theorem 4.
K K L (K K L1 ) L1 L
as K -algebras. Therefore
[L 1 :K ]
K K L K L 1 L (K L1 L)[L1 :K ]
Since L = L1(2,... r) with each i separable over L1, we can run through the same computation
for K L2 L as we did for K K L, and we get K L1 L (K L2 L)[L2 :L1 ] , so
K KL ( K KF)F L
( K F L)[F:K]
[L:F][F:K] Notes
K since L=F is separable
K [L:K]
Thus L/K is separable by Theorem 1.2.
Theorem 6: Let L/K be an extension of fields, and L be algebraic over K. Then is separable
over K if and only if any derivation on K has a unique extension to a derivation on K().
Proof: When L is separable over K, Corollary B.10 shows any derivation on K extends
uniquely to a derivation on K().
Now suppose L is inseparable over K. Then ’(X) = 0, where (X) is the minimal polynomial
of over K. In particular ’() = 0. We are going to use this vanishing of ’() to construct a
nonzero derivation on K() which extends the zero derivation on K.
Then the zero derivation on K has two lifts to K(): the zero derivation on K() and this other
derivation we will construct.
Define Z : K() K() by Z(f()) = f’(), where f(X) K[X]. Is this well-defined?
It is left to the reader to check Z is a derivation on K(). This derivation kills K, but Z() = 1, so
Z extends the zero derivation on K while not being the zero derivation itself.
The reader can check more generally that when is inseparable over K and K() is arbitrary
the map f() f’() is a derivation on K() that extends the zero derivation on K and sends
to . So there are many extensions of the zero derivation on K to K(): one for each element of
K().
We need a lemma to put inseparable extensions into a convenient form for our derivation
constructions later.
Lemma: Let L/K be a finite inseparable field extension. Then there is an L and intermediate
field F such that L = F() and is inseparable over F.
Proof: Inseparable field extensions only occur in positive characteristic. Let p be the characteristic
of K. Necessarily [L : K] > 1. Since L/K is inseparable, there is some L that is inseparable over
K.
Write L = K(1,.... r). We will show by contradiction that some i has to be inseparable over K.
Assume every i is separable over K. Then we can treat L/K as a succession of simple field
extensions as in (2.2), where Li = Li-1(i) with i separable over Li-1. By Theorem, any derivation
on Li-1 extends to a derivation on Li, so any derivation on K extends to a derivation on L.
Moreover, this extended derivation on L is unique. To show that, consider two derivations D and
D’ on L that are equal on K. Since L1 = K(1) and 1 is separable over K, the proof of Corollary
B.10 tells us that D and D’ both send L1 to L1 and are equal on L1. Now using L1 in place of K, D and
D’ being equal on L1 implies they are equal on L2 since L2 = L1(2) and 2 is separable over L1. We
can keep going like this until we get D = D’ on Lr = L. As a special case of this uniqueness, the only
derivation on L which vanishes on K is the zero derivation on L.
Notes Now replace K as base field with K(), over which the i’s are of course still separable. Then any
derivation on K() extends uniquely to a derivation on L. But in the proof of Theorem we saw
there is a non-zero derivation Z on K() that vanishes on K, and an extension of that to a
derivation on L is non-zero on L and is zero on K. We have a contradiction of the uniqueness of
extensions, so in any set of field generators {1,...., r}, some i must be inseparable in K.
Choose a generating set {1,....,r} with as few inseparable elements as possible. At least one i
is inseparable over K and we may assume that r is one of them. Set = r and F = K(1,....r-1)
(so F = K if r = 1). Then L = F(). We will show by contradiction that must be inseparable
over F, which is the point of the lemma.
Suppose is separable over F. Then is separable over the larger field F(p) since its minimal
polynomial over F(p) divides its minimal polynomial over F. Since is a root of Xp – p
F(p)[X], its (separable) minimal polynomial in F(p)[X] is a factor of this, so that polynomial
k k+1
must be X – . Therefore, F(p). Taking pk-th powers for any k 0, ap F(ap ), so
k k+1
F(ap ) F(ap ).
k k+1
p p
The reverse inclusion is obvious, so F(a ) = F(a ) for all k 0. Therefore,
k
L = F() = F( p ) = K( 1 ,..., r-1 , rpk )
k
p
for any k 0. We can pick k so that is separable over K (why?). Then the generating set
k
{ 1 ,..., r 1 ,apr } has with one less inseparable element among the field generators. This
contradicts the choice of generators to have as few members in the list as possible that are
inseparable over K, so has to be inseparable over F.
Proof: Assume L/K is separable, so by the primitive element theorem L = K() where is
separable over K. Any derivation on K can be extended (using Theorem) uniquely to a derivation
on L.
If L/K is inseparable, then Lemma lets us write L = F() with inseparable over F, and
F K. The, by a construction used in the proof of Theorem, f() f’() with f(X) F[X] is a
nonzero derivation on L which is zero on F, and thus also zero on the smaller field K. This shows
the zero derivation on K has a non-zero extension (and thus two extensions) to a derivation on L.
Lemma: Let L/K be a finite extension and F be an intermediate extension such that F/K is
separable. Then any derivation F L which sends K to K has values in F.
Proof: Pick F, so is separable over K. Now use Corollary B.10 to see the derivation F L
sends to an element of K() F.
Proof: To prove L/K is separable, we want to show any derivation on K has a unique extension
to a derivation on L. Since F/K is separable, a derivation on K extends to a derivation on F. Since
L/F is separable, a derivation on F extends to a derivation on L.
For uniqueness, let D1 and D2 be derivations on L which extend the same derivation on K. Since Notes
D1(K) K and D2(K) K, we have D1(F) F and D2(F) F by Lemma. Then D1 = D2 on F since
F/K is separable, and D1 = D2 on L since L/F is separable.
When L/K is an algebraic extension of possibly infinite degree, here is the way separability is
defined.
Definition: An algebraic extension L/K is called separable if every finite subextension of L=K is
separable. Equivalently, L=K is separable when every element of L is separable over K.
This definition makes no sense if L/K is not an algebraic extension since a non-algebraic extension
is not the union of its finite subextensions.
Theorem 1 has a problem in the infinite-degree case: there is no natural trace map. However, the
conditions in Theorems 2 and 3 both make sense for a general L/K. (In the case of Theorem 2,
we have to drop the specification of K K L as a product of copies of K , and just leave the
statement about the tensor product having no non-zero nilpotent elements.) It is left to the
reader to check for an infinite algebraic extension L/K that the conditions of Theorems 2 and 3
match Definition.
The conditions in Theorems 2 and 3 both make sense if L/K is not algebraic, so they could each
potentially be used to define separability of a completely arbitrary field extension. But there is
a problem: for transcendental (that is, non-algebraic) extensions the conditions in Theorems 2
and 3 are no longer equivalent. Indeed, take L = K(u), with u transcendental over K. Then
K K L = K(u) is a field, so the condition in Theorem 2 is satisfied. However, the zero derivation
on K has more than one extension to K(u): the zero derivation on K(u) and differentiation with
respect to u on K(u).
Definition: A commutative ring with no nonzero nilpotent elements is called reduced.
A domain is reduced, but a more worthwhile example is a product of domains, like F3 × Q[X],
which is not a domain but is reduced.
Definition: An arbitrary field extension L/K is called separable when the ring K K L is reduced.
Using this definition, in characteristic 0 all field extensions are separable. In characteristic p, any
purely transcendental extension is separable. The condition in Theorem 3, that derivations on
the base field admit unique extensions to a larger field, characterizes not separable field extensions
in general, but separable algebraic field extensions.
A condition equivalent to that in Definition is that F K L is reduced as F runs over the finite
extensions of K.
The condition that K K L is reduced makes sense not just for field extensions L/K, but for any
commutative K-algebra. Define an arbitrary commutative K-algebra A to be separable when
the ring K K A is reduced. This condition is equivalent to A F being reduced for every finite
extension field F/K.
Example: Let A = K[X]/(f(X)) for any non-constant f(X) K[X]. The polynomial f(X) need
not be irreducible, so A might not be a field. It is a separable K-algebra precisely when f(X) is a
separable polynomial in K[X].
Notes Traces
Theorem 7: In the above notation, TrA/K = TrL/K TrA/L. In particular, if a L, then TrA/K(a) =
[A : L]TrL/K(a).
Proof: Let (e1; : : : ; em) be an ordered L-basis of A and (f1; : : : ; fn) be an ordered K-basis
For a A, let
m n
ae j c ije i , cij fs bijrs fr ,
i 1 r 1
Thus
= Tr L /K (c ii )
i
= b iirr
i r
= TrA/K(a).
Theorem 8: Let A and B be finite-dimensional K-algebras. For (a; b) in the product ring A × B,
Tr(A×B)/K(a, b) = TrA/K(a) + TrB/K(b).
[m a ] 0
matrix , whose trace is TrA/K(a) + TrB/K(b).
0 [m b ]
Theorem 9: Let A be a finite-dimensional K-algebra, L/K be a field extension, and B = LKA be Notes
the base extension of A to an L-algebra. For a A, TrB/L(1a) = TrA/K(a).
n
Proof: Let e1,....., en be a K-basis of A. Write aej = i1
c ije i , so the matrix for ma in this basis is (cij).
n
(1 a) (1 ej) = 1 aej = c (1 e ),
i 1
ij i
so the matrix for m1a on B is the same as the matrix for ma on A. Thus TrA/K(a) = TrB/L(1 a).
Remark: Because m1a and ma have the same matrix representation, not only are their traces the
same but their characteristic polynomials are the same.
Theorem 10: Let A be a finite-dimensional K-algebra. For any field extension L/K, the base
extension by K of the trace map A K is the trace map L K A L. That is, the function
idTrA/K : LK A L which sends an elementary tensor x a to xTrA/K(a) is the trace map
Tr(LKA)/L.
Proof: We want to show Tr(LKA)/L(t) = (idTrA/K)(t) for all t LKA. The elementary tensors
additively span LK A so it succes to check equality when t = x a for x K and a A. This means
we need to check Tr(LKA)/K(x a) = xTrA/K(a).
n
Pick a K-basis e1,..., en for A and write aej = i1
c ije i with cij K. The elementary tensors
1 e1,..., 1 en are an L-basis of LK A and
n n
(x a)(1 e j ) x ae j c ij (x e i ) c ij x(1 e i )
i 1 i 1
by the definition of the L-vector space structure on LKA. So the matrix for multiplication by
x a in the basis {1 ei} is (cijx), which implies
n n
Tr( L KA ) /L (x a) cii x x cii xTrA /K (a).
i1 i 1
Derivations
A derivation is an abstraction of differentiation on polynomials. We want to work with
derivations on fields, but polynomial rings will intervene, so we need to understand derivations
on rings before we focus on fields.
Let R be a commutative ring and M be an R-module (e.g., M = R). A derivation on R with values
in M is a map D : R M such that D(a + b) = D(a) + D(b) and D(ab) = aD(b) + bD(a). Easily, by
induction D(an) = nan-1D(a) for any n 1. When M = R, we will speak of a derivation on R.
Example: For any commutative ring A, differentiation with respect to X on A[X] is a derivation
on A[X] (R = M = A[X]).
Example: Let R = A[X] and M = A as an R-module by f(X)a := f(0)a. Then D: R M by D(f)
= f’(0) is a derivation.
This is the application of D coefficentwise to f(X). The operation f fD is a derivation on R[X] (to
check the product rule, it suffices to look at monomials).
Notes If R = F2[u] and D is the usual u-derivative on F2[u], then the polynomial f(X) = (u3 + u)X4 + uX3
+ u2X + 1 in R[X] has fD(X) = (u2 + 1)X4 + X3.
Any element of R satisfying D(a) = 0 is called a D-constant, or just a constant if the derivation is
understood. The constants for a derivation form a subring. For instance, from the product rule,
taking a = b = 1, we obtain D(1) = 0.
Example: The set of all constants for X-differentiation on K[X] is K when K has characteristic 0 and
K[Xp] when K has characteristic p.
Example: If D: R R is a derivation and f fD is the corresponding derivation on R[X], its ring of
constants is C[X], where C is the constants for D.
We will generally focus on derivations from R to R, although it will be convenient to allow
R-modules as the target space for derivations in Corollary, which is used in the main text in the
proofs of Theorem 3 and Lemma.
Example: Let’s check that any derivation on K[X] which has the elements of K among its constants
has the form D(f) = hf’ for some h K[X]. (When h = 1, this is the usual X-derivative.)
When K is among the constants of D, D is K-linear: D(cf) = cD(f) + fD(c) = cD(f). Therefore, D is
determined by what it does to a K-basis of K[X], such as the power functions Xn. By induction,
D(Xn) = nXn-1D(X) for all n 1. Therefore, by linearity, D(f) = f’(X)D(X) for every f K[X]. Set
h = D(X).
Theorem 11: Let R be a domain with fraction field K. Any derivation D: R K uniquely extends
to D (a/b) = (bD(a) – aD(b))/b2.
: K K, given by the quotient rule: D
Therefore in K,
To see, conversely, that this formula does give a derivation D on K, first we check it is well-
defined: if a/b = c/d (with b and d nonzero), then ad = bc, so
= 0 since ad = bc.
That D
satisfies the sum and product rules is left to the reader to check.
Theorem 12: Let L/K be a finite extension of fields, and D: K K be a derivation. Suppose Notes
a L is separable over K, with minimal polynomial (X) K[X]. That is, (X) is irreducible in
K[X], () = 0, and ’() 0. Then D has a unique extension from K to a derivation on the field
K(), and it is given by the rule
D D(a)
D(f()) = f ( ) – f'( )
'(a)
r
i
D() = D(f()) = (D(c )
i + ci (i i-1D( ))) = f D ( ) + f'( )D( ).
i=0
Taking f(X) = (X) to be the minimal polynomial of over K, f() = 0, so if D has an extension to
K() then (B.2) becomes
0 = D() + ’()D(),
which proves (since ’() 0) that D() must be given by the formula –D()/’(). Plugging this
formula for D(), shows D() must be given by the formula. Since was a general element of
K(), this proves D has at most one extension to a derivation on K().
Now, to show the formula works, we start over and define
D D ( )
D(f()) : = f (a) – f'( ) .
'( )
Suppose f1() = f2() for f1; f2 K[X]. Then f1(X) f2(X) mod (X), say
f1(X) = f2(X) + (X)k(X)
for some k(X) K[X]. Differentiating both sides with respect to X in the usual way,
D (a) D ( )
–f'1 ( ) f '2 (a) D ( )k( ).
' a '( )
We want to add f1D ( ) to both sides. First, apply D to the coefficients in (B.3), which is a derivation
on K[X], to get
Therefore,
D (a) D D (a)
f1D ( ) f2' ( ) f2 ( ) D ( )k( ) f2' ( ) D ( )k( )
'(a) '(a)
D ( )
f2D ( ) f2' (a) .
'( )
This proves the formula for a derivation on K() is well-defined. It is left to the reader to check
this really is a derivation.
Example: In contrast with Theorem 12, consider K = Fp(u) and L = K() where is a root
of Xp – u K[X]. This is an inseparable irreducible polynomial over K. The u-derivative on K
does not have any extension to a derivation on L. Indeed, suppose the u-derivative on K has an
extension to L, and call it D. Applying D to the equation ap = u gives
pp-1D() = D(u).
The left side is 0 since we’re in characteristic p. The right side is 1 since D is the u-derivative on
Fp(u). This is a contradiction, so D does not exist.
Corollary: Let L/K be a finite extension of fields. For any derivation D: K L and L
which is separable over K, D has a unique extension to a derivation K() L. If D(K) K then
D(K()) K().
Proof: Follow the argument in the proof of Theorem 12, allowing derivations to have values in
L rather than in K(). The formula for D(f()) still turns out to be the same as in (B.1). In
particular, if D(K) K then the extension of D to a derivation on K() actually takes values in
K().
Self Assessment
2. An arbitrary field extension ................... is called separable when the ring K u L is reduced.
5. If D : R R is a derivatives and f F0 is the corresponding derivation on R[x] from its ring Notes
of constants in C[x], where ................... is the constant for ..................., f(x) =
1
a x 1 in R[x] and fd(x) D(ai )X i
u
(a) D, C (b) C, D
(c) X, C (d) C, D
27.2 Summary
Let L/K be a finite extension. Then L is separable over K if and only if any derivation of K
has a unique extension to a derivation of L.
If L = K(a1,....., ar) and each ai is separable over K then every element of L is separable over
K (so L/K is separable).
Let L/K be a finite extension and F be an intermediate field. If L/F and F/K are separable
then L/K is separable.
An arbitrary field extension L/K is called separable when the ring K K L is reduced.
27.3 Keywords
Separability: Separability of a finite field extension L/K can be described in several different
ways.
Commutative Ring: A commutative ring with no nonzero nilpotent elements is called reduced.
Domain: A domain is reduced, but a more worthwhile example is a product of domains, like
F3 × Q[X], which is not a domain but is reduced.
CONTENTS
Objectives
Introduction
28.4 Summary
28.5 Keywords
Objectives
Introduction
In the last unit, you have studied about extension field. This unit will provide information
related to Galois theory.
This gives the definition of the Galois group and some results that follow immediately from the
definition. We can give the full story for Galois groups of finite fields.
We use the notation Aut(F) for the group of all automorphisms of F, that is, all one-to-one
functions from F onto F that preserve addition and multiplication. The smallest subfield containing
the identity element 1 is called the prime subfield of F. If F has characteristic zero, then its prime
subfield is isomorphic to Q, and if F has characteristic p, for some prime number p, then its
prime subfield is isomorphic to Zp. In either case, for any automorphisms of F we must have
(x) = x for all elements in the prime subfield of F.
To study solvability by radicals of a polynomial equation f(x) = 0, we let K be the field generated
by the coefficients of f(x), and let F be a splitting field for f(x) over K. Galois considered
permutations of the roots that leave the coefficient field fixed. The modern approach is to
consider the automorphism determined by these permutations. The first result is that if F is an
extension field of K, then the set of all automorphism : F F such that (a) = a for all a K is
a group under composition of functions. This justifies the following definitions.
This result can be used to compute the Galois group of any finite extension of any finite field, but
first we need to review the structure of finite fields. If F is a finite field of characteristic p, then it
is a vector space over its prime subfield Zp, and so it has pn elements, where [F : Zp] = n. The
structure of F is determined by the following theorem.
Theorem 2: If F is a finite field with pn elements, then F is the splitting field of the polynomial
n
xp – x over the prime subfield of F.
n
The description of the splitting field of xp – x over Zp shows that for each prime p and each
positive integer n, there exists a field with pn elements. The uniqueness of splitting fields shows
that two finite fields are isomorphic iff they have the same number of elements. The field with
pn elements is called the Galois field of order pn, denoted by GF(pn). Every finite field is a simple
extension of its prime subfield, since the multiplicative group of nonzero elements is cyclic, and
this implies that for each positive integer n there exists an irreducible polynomial of degree n in
Zp[x].
n
p
If F is a field of characteristic p, and n Z+, then {a F | a = a} is a subfield of F, and this
observation actually produces all subfields. In fact, Proposition 6.5.5 has the following statement:
Let F be a field with pn elements. Each subfield of F has pm elements for some divisor m of n.
Conversely, for each positive divisor m of n there exists a unique subfield of F with pm elements.
If F is a field of characteristic p, consider the function : F F defined by (x) = xp. Since F has
characteristic p, we have (a + b) = (a + b)p = ap + bp = (a) + (b), because in the binomial expansion
of (a + b)p each coefficient except those of ap and bp is zero. (The coefficient (p!)/(k!(p – k)!)
contains p in the numerator but not the denominator since p is prime, and so it must be equal to
zero in a field of characteristic p.) It is clear that preserves products, and so is a ring
homomorphism. Furthermore, since it is not the zero mapping, it must be one-to-one. If F is
finite, then must also be onto, and so in this case is called the Frobenius automorphism of F.
n n-1 n
p p p
Note that n(x) = x (Inductively, n(x) = (n-1(x))p = (x ) p = x .) Using an appropriate power
of the Frobenius automorphism, we can prove that the Galois group of any finite field must be
cyclic.
Theorem 3: Let K be a finite field and let F be an extension of K with [F : K] = m. Then
Gal(F/K) is a cyclic group of order m.
Outline of the proof: We start with the observation that F has pn elements, for some positive
n
p
integer n. Then K has pr elements, for r = n/m, and F is the splitting field of x x over its
prime subfield, and hence over K. Since f(x) has no repeated roots, to conclude that |Gal(F/K)|
= m. Now define : F F to be the rth power of the Frobenius automorphism. That is, define
r Notes
(x) = x p . To compute the order of in Gal(F/K), first note that m is the identity since m(x) =
rm n
xp = xp x for all x F. But cannot have lower degree, since this would give a polynomial
with too many roots. It follows that is a generator for Gal(F/K).
In computing the Galois group of a polynomial, it is important to know whether or not it has
repeated roots. A field F is called perfect if no irreducible polynomial over F has repeated roots.
This section includes the results that any field of characteristic zero is perfect, and that any finite
field is perfect.
In the previous section, we showed that the order of the Galois group of a polynomial with no
repeated roots is equal to the degree of its splitting field over the base field. The first thing in this
section is to develop methods to determine whether or not a polynomial has repeated roots.
Let f(x) be a polynomial in K[x], and let F be a splitting field for f(x) over K. If f(x) has the
factorization f(x) = (x – r1 )m 1 ... (x – rt )m t over F, then we say that the root ri has multiplicity mi.
If mi = 1, then ri is called a simple root.
t
Let f(x) K[x], with f(x) = k=0
a k x k . The formal derivative f’(x) of f(x) is defined by the formula
t
f'(x) = k=0
ka k x k-1 , where kak denotes the sum of ak added to itself k times. It is not difficult to
show from this definition that the standard differentiation formulas hold. Proposition shows
that the polynomial f(x) K[x] has no multiple roots iff it is relatively prime to its formal
derivative f’(x). Proposition shows that f(x) has no multiple roots unless char(K) = p 0 and f(x)
has the form f(x) = a0 + a1xp + a2x2p + ... + anxnp.
A polynomial f(x) over the field K is called separable if its irreducible factors have only simple
roots. An algebraic extension field F of K is called separable over K if the minimal polynomial
of each element of F is separable. The field F is called perfect if every polynomial over F is
separable.
Theorem states that any field of characteristic zero is perfect, and a field of characteristic
p > 0 is perfect if and only if each of its elements has a pth root in the field. It follows immediately
from the theorem that any finite field is perfect.
To give an example of a field that is not perfect, let p be a prime number, and let K = Zp. Then in
the field K(x) of rational functions over K, the element x has no pth root. Therefore, this rational
function field is not perfect.
The extension field F of K is called a simple extension if there exists an element u F such that
F = K(u). In this case, u is called a primitive element. Note that if F is a finite field, then Theorem
shows that the multiplicative group Fx is cyclic. If the generator of this group is a, then it is easy
to see that F = K(a) for any subfield K. Theorem shows that any finite separable extension is a
simple extension.
Here we study the connection between subgroups of Gal(F/K) and fields between K and F. This
is a critical step in proving that a polynomial is solvable by radicals if and only if its Galois
group is solvable.
Self Assessment
1. If F has characteristics zero, then its prime subfield is isomorphic to Q and if F has
characteristics P, for some prime number P, then its prime subfield is ................ to Zp.
2. Let F be extension field of K. The set { Q Aut(F) | Q(a) = a for all a K } is Galois group Notes
is denoted by ................
(a) Gal(F/K) (b) Gal(u/F)
(c) Gal-1(K/F) (d) Gal(k × F)
3. Let K be a finite field and let F be an extension of K with [F : k] = m. Then Gal(F/k) is a
................ group of order m.
(a) cyclic (b) polynomial
(c) permutation (d) finite
4. A polynomial f(x) over the field k is called ................ if its irreducible factors have only
simple roots.
5. The ................ F of K is called simple extensions. If then exist an element u F. Such that
F = K(u).
(a) finite field (b) extension field
28.4 Summary
Let K be a field, let f(x) K[x], and let F be a splitting field for f(x) over K. Then Gal(F/K)
is called the Galois group of f(x) over K, or the Galois group of the equation f(x) = 0 over
K.
It states that if F is an extension field of K, and f(x) K[x], then any element of Gal(F/K)
defines a permutation of the roots of f(x) that lie in F. The next theorem is extremely
important.
Let K be a field, let f(x) K[x] have positive degree, and let F be a splitting field for f(x)
over K. If no irreducible factor of f(x) has repeated roots, then j Gal(F=K)j = [F : K].
This result can be used to compute the Galois group of any finite extension of any finite
field, but first we need to review the structure of finite fields. If F is a finite field of
characteristic p, then it is a vector space over its prime subfield Zp, and so it has pn elements,
where [F : Zp] = n. The structure of F is determined by the following theorem.
n
If F is a finite field with pn elements, then F is the splitting field of the polynomial xp – x
over the prime subfield of F.
Let K be a finite field and let F be an extension of K with [F : K] = m. Then Gal(F/K) is a
cyclic group of order m.
(The fundamental theorem of Galois theory) Let F be the splitting field of a separable
polynomial over the field K, and let G = Gal(F/K).
Prime Subfield: If F is a finite field with pn elements, then F is the splitting field of the polynomial
n
xp – x over the prime subfield of F.
The Fundamental Theorem of Galois Theory: Let F be the splitting field of a separable polynomial
over the field K, and let G = Gal(F/K).
(b) Find automorphisms of F that have fixed fields Q( 2 ), Q(i), and Q( 2 i),
respectively.
12. Let F be a finite, normal extension of Q for which |Gal(F=Q)| = 8 and each element of
Gal(F/Q) has order 2. Find the number of subfields of F that have degree 4 over Q.
13. Let F be a finite, normal, separable extension of the field K. Suppose that the Galois group
Gal(F/K) is isomorphic to D7. Find the number of distinct subfields between F and K. How
many of these are normal extensions of K?
14. Show that F = Q(i, 2 ) is normal over Q; find its Galois group over Q, and find all
intermediate fields between Q and F.
3
15. Let F = Q( 2 , 2 ). Find [F : Q] and prove that F is not normal over Q.
www.maths.tcd.ie/
archives.math.utk.edu
CONTENTS
Objectives
Introduction
29.3 Summary
29.4 Keywords
Objectives
Introduction
In the last unit, you have studied about Galois theory. In this unit, you will get information
related to computing the Galois groups.
Definition: Let G be a group acting on a set S. We say that G acts transitively on S if for each pair
of elements x,y in S there exist an element g in G such that y = gx.
If G is a subgroup of the symmetric group Sn, then G is called a transitive group if it acts
transitively on the set { 1, 2, ... , n }.
Proposition: Let f(x) be a separable polynomial over the field K, with roots r1 , ... , rn in its
splitting field F. Then f(x) is irreducible over K if and only if Gal(F/K) acts transitively on the
roots of f(x).
Lemma: Let p be a prime number, and let G be a transitive subgroup of Sp. Then any nontrivial
normal subgroup of G is also transitive.
Lemma: Let p be a prime number, and let G be a solvable, transitive subgroup of Sp. Then G
contains a cycle of length p.
Proposition: Let p be a prime number, and let G be a solvable, transitive subgroup of Sp. Then G
is a subgroup of the normalizer in Sp of a cyclic subgroup of order p.
Let f(x) be a polynomial of degree n over the field K, and assume that f(x) has roots r1, r2, ... , rn in Notes
its splitting field F. The element of F defined by
= (ri - rj)2,
where the product is taken over all i, j with 1 i < j n, is called the discriminant of f(x).
It can be shown that the discriminant of any polynomial f(x) can be expressed as a polynomial in
the coefficients of f(x), with integer coefficients. This requires use of elementary symmetric
functions, and lies beyond the scope of what we have chosen to cover in the book.
We have the following properties of the discriminant:
(i) 0 if and only if f(x) has distinct roots;
(ii) belongs to K;
(iii) If 0, then a permutation in Sn is even if and only if it leaves unchanged the sign of
1 i < j n(ri - rj )
Proposition: Let f(x) be a separable polynomial over the field K, with discriminant , and let F be
its splitting field over K. Then every permutation in Gal(F/K) is even if and only if is the square
of some element in K.
We now restrict our attention to polynomials with rational coefficients. The next lemma shows
that in computing Galois groups it is enough to consider polynomials with integer coefficients.
Then a powerful technique is to reduce the integer coefficients modulo a prime and consider the
Galois group of the reduced equation over the field GF(p).
Lemma: Let f(x) = xn + an-1 xn-1 + · · · + a1 x + a0 be a polynomial in Q[x], and assume that
ai = bi / d for d, b0, b1, ... , bn-1 in Z.
Then dn f(x/d) is monic with integer coefficients, and has the same splitting field over Q as f(x).
If p is a prime number, we have the natural mapping : Z[x] > Zp[x] which reduces each coefficient
modulo p. We will use the notation p(f(x)) = fp(x).
Theorem [Dedekind]: Let f(x) be a monic polynomial of degree n, with integer coefficients and
Galois group G over Q, and let p be a prime such that fp(x) has distinct roots. If fp(x) factors in Zp[x]
as a product of irreducible factors of degrees n1, n2, ... , nk, then G contains a permutation with the
cycle decomposition
(1,2, ... ,n1) (n1+1, n1+2, ... , n1+n2) · · · (n-nk+1, ... ,n),
Self Assessment
Notes 3. Let P be a prime number and G be a solvable, transitive subgroup of Sp. Then G is a
subgroup of the normalizer in Sp of a cyclic subgroup of order ..................
(a) P (b) G
(c) Sp (d) S
4. If f(x) be a polynomial of degree n over the field k and assume that f(x) has roots r1, r2,...rn
in its splitting field F. Then element of F defined by
(a) = (r1 – r3)2 (b) = 2(r1 – rj)2
(c) = (ri – rj)-2 (d) = 3(r1 – rj)3
29.3 Summary
Let f(x) be a separable polynomial over the field K, with roots r1 , ... , rn in its splitting field
F. Then f(x) is irreducible over K if and only if Gal(F/K) acts transitively on the roots of
f(x).
Let p be a prime number, and let G be a transitive subgroup of Sp. Then any non-trivial
normal subgroup of G is also transitive.
Let p be a prime number, and let G be a solvable, transitive subgroup of Sp. Then G
contains a cycle of length p.
Let p be a prime number, and let G be a solvable, transitive subgroup of Sp. Then G is a
subgroup of the normalizer in Sp of a cyclic subgroup of order p.
Let f(x) = x n + a n-1 x n-1 + · · · + a 1 x + a 0 be a polynomial in Q[x], and assume that
ai = bi / d for d, b0, b1, ... , bn-1 in Z.
Then dn f(x/d) is monic with integer coefficients, and has the same splitting field over Q as
f(x).
If p is a prime number, we have the natural mapping : Z[x] > Zp[x] which reduces each
coefficient modulo p. We will use the notation p(f(x)) = fp(x).
Let f(x) be a monic polynomial of degree n, with integer coefficients and Galois group G
over Q, and let p be a prime such that fp(x) has distinct roots. If fp(x) factors in Zp[x] as a
product of irreducible factors of degrees n1, n2, ... , nk, then G contains a permutation with
the cycle decomposition
(1,2, ... ,n1) (n1+1, n1+2, ... , n1+n2) · · · (n-nk+1, ... ,n),
29.4 Keywords
Transitive Group: If G is a subgroup of the symmetric group Sn, then G is called a transitive
group if it acts transitively on the set { 1, 2, ... , n }.
Separable Polynomial: Let f(x) be a separable polynomial over the field K, with roots r1 , ... , rn
in its splitting field F. Then f(x) is irreducible over K if and only if Gal(F/K) acts transitively on
the roots of f(x).
1. Give the order and describe a generator of the Galois group of GF (729) over GF(9).
2. Determine the Galois group of each of the following polynomials in Q[x]; hence, determine
the solvability of each of the polynomials
3. Find a primitive element in the splitting field of each of the following polynomials in
Q[x].
4. Prove that the Galois group of an irreducible quadratic polynomial is isomorphic to Z2.
5. Prove that the Galois group of an irreducible cubic polynomial is isomorphic to S3 or Z3.
www.maths.tcd.ie/
archives.math.utk.edu
CONTENTS
Objectives
Introduction
30.3 Keywords
Objectives
Introduction
In the last unit, you have studied about computing Galois theory and groups. In this unit, you
will get information related to fundamental theorem.
is a subfield of F.
Definition: Let F be a field, and let G be a subgroup of Aut (F). Then
is called the G-fixed subfield of F, or the G-invariant subfield of F, and is denoted by FG.
Proposition: If F is the splitting field over K of a separable polynomial and G = Gal(F/K), then
FG = K.
Lemma [Artin]: Let G be a finite group of automorphisms of the field F, and let K = FG. Then
[F : K] | G |.
Let F be an algebraic extension of the field K. Then F is said to be a normal extension of K if every
irreducible polynomial in K[x] that contains a root in F is a product of linear factors in F[x].
Example: The Galois group of GF(pn) over GF(p) is cyclic of order n, generated by the
automorphism defined by (x) = xp, for all x in GF(pn). This automorphism is usually known as
the Frobenius automorphism of GF(pn).
Let F be the splitting field of a separable polynomial over the field K, and let G = Gal(F/K).
H = Gal(F/FH).
(ii) If E is a subfield of F that contains K, then the corresponding subgroup of G is
H = Gal(F/E), and
E = F H.
In the statement of the fundamental theorem we could have simply said that normal subgroups
correspond to normal extensions. In the proof we noted that if E is a normal extension of K, then
(E) E for all in Gal(F/K). In the context of the fundamental theorem, we say that two
intermediate subfields E1 and E2 are conjugate if there exists in Gal(F/K) such that (E1) = E2.
The next result shows that the subfields conjugate to an intermediate subfield E correspond to
the subgroups conjugate to Gal(F/E). Thus E is a normal extension if and only if it is conjugate
only to itself.
Let F be the splitting field of a separable polynomial over the field K, and let E be a subfield such
that K E F, with H = Gal(F/E). If is in Gal(F/K), then
Gal(F/(E)) = H -1.
[Fundamental Theorem of Algebra]: Any polynomial in C[x] has a root in C.
Solution: The given condition is equivalent to the condition that F is the splitting field over K of
a separable polynomial. Since we must have G = Gal (F/K), the fundamental theorem of Galois
theory implies that the subfields between F and K are in one-to-one correspondence with the
subgroups of F. Because G is a finite group, it has only finitely many subgroups.
Notes
Example: Let F be the splitting field over K of a separable polynomial. Prove that if
Gal (F/K) is cyclic, then for each divisor d of [F:K] there is exactly one field E with K E F and
[E:K] = d.
Solution: By assumption we are in the situation of the fundamental theorem of Galois theory, so
that there is a one-to-one order-reversing correspondence between subfields of F that contain K
and subgroups of G = Gal (F/K). Because G is cyclic of order [F:K], there is a one-to-one
correspondence between subgroups of G and divisors of [F:K]. Thus for each divisor d of [F:K]
there is a unique subgroup H of index d. By the fundamental theorem, [FH: K] = [G:H], and so E
= F^H is the unique subfield with [E:K] = d.
Comment: Pay careful attention to the fact that the correspondence between subfields and
subgroups reverses the order
Example: Let F be a finite, normal extension of Q for which | Gal (F/Q) | = 8 and each
element of Gal (F/Q) has order 2. Find the number of subfields of F that have degree 4 over Q.
Solution: Since F has characteristic zero, the extension is automatically separable, and so the
fundamental theorem of Galois theory can be applied. Any subfield E of F must contain Q, its
prime subfield, and then [E:Q] = 4 iff [F:E] = 2, since [F:Q] = 8. Thus the subfields of F that have
degree 4 over Q correspond to the subgroups of Gal (F/Q) that have order 2. Because each
nontrivial element has order 2 there are precisely 7 such subgroups.
Example: Let F be a finite, normal, separable extension of the field K. Suppose that the
Galois group Gal (F/K) is isomorphic to D7. Find the number of distinct subfields between F and
K. How many of these are normal extensions of K?
Solution: The fundamental theorem of Galois theory converts this question into the question of
enumerating the subgroups of D7, and determining which are normal. If we use the usual
description of D7 via generators a of order 7 and b of order 2, with ba = a -1 b, then a generates a
subgroup of order 7, while each element of the form ai b generates a subgroup of order 2, for
0 i < 7. Thus there are 8 proper nontrivial subgroups of D7, and the only one that is normal is
< a >, since it has |D7| / 2 elements. As you should recall from the description of the conjugacy
classes of D7 conjugating one of the 2-element subgroups by a produces a different subgroup,
showing that none of them are normal.
Example: Show that F = Q ( 2 ,i) is normal over Q; find its Galois group over Q, and find
all intermediate fields between Q and F.
Solution: It is clear that F is the splitting field over Q of the polynomial (x2 + 1)(x2 – 2), and this
polynomial is certainly separable. Thus, F is a normal extension of Q.
It follows that the Galois group is isomorphic to Z2 × Z2. Since the Galois group has 3 proper
nontrivial subgroups, there will be 3 intermediate subfields E with Q E F.
The existence of 3 nontrivial elements begins with the splitting field of x4+1 over Q.
Self Assessment
30.2 Summary
H = Gal(F/FH).
(ii) If E is a subfield of F that contains K, then the corresponding subgroup of G is
H = Gal(F/E), and
E = F H.
Notes Let F be the splitting field of a separable polynomial over the field K, and let E be a
subfield such that K E F, with H = Gal(F/E). If is in Gal(F/K), then
Gal(F/(E)) = H -1.
[Fundamental Theorem of Algebra] Any polynomial in C[x] has a root in C.
30.3 Keywords
Normal Extension: Let F be an algebraic extension of the field K. Then F is said to be a normal
extension of K if every irreducible polynomial in K[x] that contains a root in F is a product of
linear factors in F[x].
Frobenius Automorphism: The Galois group of GF(pn) over GF(p) is cyclic of order n, generated
by the automorphism defined by (x) = xp, for all x in GF(pn). This automorphism is usually
known as the Frobenius automorphism of GF(pn).
1. Compute each of the following Galois groups. Which of these field extensions are normal
field extensions? If the extension is not normal, find a normal extension of Q in which the
extension field is contained.
(a) G(Q( 30 )/Q) (b) G(Q( 4 5 )/Q)
2. Let F K E be field. If E is a normal extension of F, show that E must also be a normal
extension of K.
3. Let G be the Galois group of a polynomial of degree n. Prove that |G| divides n!.
4. Let F E. If f(x) is solvable over F, show that f(x) is also solvable over E.
www.maths.tcd.ie/
archives.math.utk.edu
CONTENTS
Objectives
Introduction
31.6 Summary
31.7 Keywords
Objectives
Introduction
To study solvability by radicals of a polynomial equation f(x) = 0, we let K be the field generated
by the coefficients of f(x), and let F be a splitting field for f(x) over K. Galois considered
permutations of the roots that leave the coefficient field fixed. The modern approach is to
consider the automorphisms determined by these permutations. We note that any automorphism
of a field F must leave its prime subfield fixed.
Proposition: Let F be an extension field of K. The set of all automorphisms : F > F such that (a)
= a for all a in K is a group under composition of functions.
Definition: Let K be a field, let f(x) be a polynomial in K[x], and let F be a splitting field for f(x)
over K. Then Gal(F/K) is called the Galois group of f(x) over K, or the Galois group of the
equation f(x) = 0 over K.
Notes Proposition: Let F be an extension field of K, and let f(x) be a polynomial in K[x]. Then any
element of Gal(F/K) defines a permutation of the roots of f(x) that lie in F.
Let f(x) be a polynomial in K[x] with no repeated roots and let F be a splitting field for f(x) over
K. If : K > L is a field isomorphism that maps f(x) to g(x) in L[x] and E is a splitting field for g(x)
over L, then there exist exactly [F:K] isomorphisms : F -> E such that (a) = (a) for all a in K.
Theorem: Let K be a field, let f(x) be a polynomial in K[x], and let F be a splitting field for f(x)
over K. If f(x) has no repeated roots, then |Gal(F/K)| = [F:K].
Corollary: Let K be a finite field and let F be an extension of K with [F:K] = m. Then Gal
(F/K) is a cyclic group of order m.
If we take K = Zp, where p is a prime number, and F is an extension of degree m, then the
generator of the cyclic group Gal(F/K) is the automorphism : F -> F defined by (x) = xp, for all
x in F. This automorphism is called the Frobenius automorphism of F.
A symmetric function on n variables x1,..., xn is a function that is unchanged by any permutation
of its variables. In most contexts, the term “symmetric function” refers to a polynomial on n
variables with this feature (more properly called a “symmetric polynomial”). Another type of
symmetric functions is symmetric rational functions, which are the rational functions that are
unchanged by permutation of variables.
The symmetric polynomials (respectively, symmetric rational functions) can be expressed as
polynomials (respectively, rational functions) in the elementary symmetric polynomials. This
is called the fundamental theorem of symmetric functions.
A function f(x) is sometimes said to be symmetric about the y-axis if f(–x) = f(x). Examples of such
functions include |x| (the absolute value) and x2 (the parabola).
and 1, ..., n are elementary symmetric functions, and t = X e11 , ..., X enn are special terms.
Furthermore, any special term t has a total degree n(n – 1)/2, and a maximal variable degree
n – 1.
For fixed n, the set of all symmetric polynomials in n variables forms an algebra of Notes
dimension n. The coefficients of a univariate polynomial f(x) of degree n are algebraically
independent symmetric polynomials in the roots of f, and thus form a basis for the set of all such
symmetric polynomials.
There are four common homogeneous bases for the symmetric polynomials, each of which is
indexed by a partition (Dumitriu et al., 2004). Letting l be the length of , the elementary
functions e, complete homogeneous functions h, and power-sum functions p are defined for
l = 1 by
e = x j 1 ...x j ...(2)
1 1
j1 j2 ... j 1
n
mj
h =
1
m 1 ... mn l1 j 1
x ...(3)
n
p =
1
x
j1
. ...(4)
l
s = s i ...(5)
i1
1
m = x ( 1) x s 2 ( 2 ) ... x m (m ) , ...(6)
S
where S is the set of permutations giving distinct terms in the sum and is considered to be
infinite.
As several different abbreviations and conventions are in common use, care must be taken when
determining which symmetric polynomial is in use.
The elementary symmetric polynomials k (x1, ..., xn) (sometimes denoted k or e) on n variables
{x1, ..., xn} are defined by
Notes Alternatively, j(x1,..., xn) can be defined as the coefficient of xn-j in the generating function
(x x ).
1 i n
i ...(13)
For example, on four variables x1, ..., x4, the elementary symmetric polynomials are
1(x1, x2, x3, x4) = x1 + x2 + x3 + x4 ...(14)
n
p
Sp(x1, ..., xn) = x .
k 1
k ...(18)
The relationship between * and 1,...,p is given by the so-called Newton-Girard formulas. The
related function sp(1, ..., n) with arguments given by the elementary symmetric polynomials
(not xn) is defined by
sp(1,...,n) = (–1)p–1 Sp (x1,...,xn) ...(19)
n
= ( 1)p 1 x kp . ...(20)
k 1
It turns out that sp (1, ...,n) is given by the coefficients of the generating function
sk k
ln (1 + 1t + 2 t2 + 3 t3 + ...) = kt ...(21)
k 1
1 2 2 1 3 3
= 1 t ( 1 2 2 )t ( 1 3 1 2 3 3 )t ...
2 3
s2 = 21 2 2 ...(23)
s3 = 13 – 3 1 2 3 3 ...(24)
s4 = 14 4 12 2 2 22 4 1 3 4 4 . ...(25)
1 1 0 0 0
2 2 1 1 0 0
p1 3 3 2 1 1 0
sp = ( 1) ...(26)
4 4 3 2 1 0
1
p p p1 p2 p3 1
n
S1(x1,..., xn) = x
k 1
k 1 ...(27)
Some regular polygons are easy to construct with compass and straightedge; others are not. This
led to the question being posed: is it possible to construct all regular n-gons with compass and
straightedge? If not, which n-gons are constructible and which are not?
Carl Friedrich Gauss proved the constructability of the regular 17-gon in 1796. Five years later,
he developed the theory of Gaussian periods in his Disquisitiones Arithmeticae. This theory
allowed him to formulate a sufficient condition for the constructability of regular polygons.
A regular n-gon can be constructed with compass and straight edge if n is the product of a power
of 2 and any number of distinct Fermat primes.
Gauss stated without proof that this condition was also necessary, but never published his proof.
A full proof of necessity was given by Pierre Wantzel in 1837. The result is known as the Gauss–
Wantzel theorem.
Figure 3.1
There are 31 known numbers that are multiples of distinct Fermat primes, which correspond to
the 31 odd-sided regular polygons that are known to be constructible. These are 3, 5, 15, 17, 51,
85, 255, 257, …, 4294967295. As John Conway commented in The Book of Numbers, these numbers,
when written in binary, are equal to the first 32 rows of the modulo-2 Pascal’s triangle, minus
the top row. This pattern breaks down after there, as the 6th Fermat number is composite, so the
following rows do not correspond to constructible polygons. It is unknown whether any more
Fermat primes exist, and is therefore unknown how many odd-sided constructible polygons
exist. In general, if there are x Fermat primes, then there are 2x”1 odd-sided constructible polygons.
General Theory
In the light of later work on Galois Theory, the principles of these proofs have been clarified.
It is straightforward to show from analytic geometry that constructible lengths must come from
base lengths by the solution of some sequence of quadratic equations. In terms of field theory,
such lengths must be contained in a field extension generated by a tower of quadratic extensions.
It follows that a field generated by constructions will always have degree over the base field that
is a power of two.
In the specific case of a regular n-gon, the question reduces to the question of constructing a
length
cos(2/n).
This number lies in the n-th cyclotomic field — and in fact in its real subfield, which is a totally
real field and a rational vector space of dimension
½(n),
where (n) is Euler’s quotient function. Wantzel’s result comes down to a calculation showing
that (n) is a power of 2 precisely in the cases specified.
As for the construction of Gauss, when the Galois group is 2-group it follows that it has a
sequence of subgroups of orders
1, 2, 4, 8, ...
that are nested, each in the next something simple to prove by induction in this case of an abelian
group. Therefore, there are subfields nested inside the cyclotomic field, each of degree 2 over the
one before. Generators for each such field can be written down by Gaussian period theory.
For example for n = 17 there is a period that is a sum of eight roots of unity, one that is a sum of Notes
four roots of unity, and one that is the sum of two, which is cos(2/17).
Each of those is a root of a quadratic equation in terms of the one before. Moreover, these
equations have real rather than imaginary roots, so in principle can be solved by geometric
construction: this because the work all goes on inside a totally real field.
In this way the result of Gauss can be understood in current terms; for actual calculation of the
equations to be solved, the periods can be squared and compared with the ‘lower’ periods, in a
quite feasible algorithm.
Compass and straightedge constructions are known for all constructible polygons. If n = p· q
with p = 2 or p and q co-prime, an n-gon can be constructed from a p-gon and a q-gon.
If p = 2, draw a q-gon and bisect one of its central angles. From this, a 2q-gon can be
constructed.
If p > 2, inscribe a p-gon and a q-gon in the same circle in such a way that they share a
vertex. Because p and q are relatively prime, there exists integers a,b such that ap + bq = 1.
Then 2a/q + 2b/p = 2/pq. From this, a p·q-gon can be constructed.
Thus one only has to find a compass and straightedge construction for n-gons where n is a
Fermat prime.
The construction for an equilateral triangle is simple and has been known since Antiquity.
Constructions for the regular pentagon were described both by Euclid and by Ptolemy.
Although Gauss proved that the regular 17-gon is constructible, he didn’t actually show
how to do it. The first construction is due to Erchinger, a few years after Gauss’ work.
The first explicit construction of a regular 257-gon was given by Friedrich Julius Richelot
(1832).
A construction for a regular 65537-gon was first given by Johann Gustav Hermes (1894).
The construction is very complex; Hermes spent 10 years completing the 200-page
manuscript. (Conway has cast doubt on the validity of Hermes’ construction, however.
Figure 31.2
Other Constructions
It should be stressed that the concept of constructible as discussed in this article applies specifically
to compass and straightedge construction. More constructions become possible if other tools are
allowed. The so-called neusis constructions, for example, make use of a marked rulers.The
constructions are a mathematical idealization and are assumed to be done exactly.
As you know this field with 4 elements can be constructed as F = Z2[x] / < x2+x+1 >. Letting a be
the coset of x, we have F = {0, 1, a, 1+a}. Any automorphism of F must leave 0 and 1 fixed, so the
only possibility for an automorphism other than the identity is to interchange a and 1+a. Is this
an automorphism? Since x2+x+1 0, we have x2 -x-1 x+1, so a2 = 1+a and (1+a)2 = 1+2a+a2 = a. Thus
the function that fixes 0 and 1 while interchanging a and 1+a is in fact the Frobenius automorphism
of F.
It is clear that the splitting field can also be obtained by adjoining first 2 and then i, so it can
also be expressed as Q( 2 , i).
Notes
(ii) Find automorphisms of F that have fixed fields Q( 2 ), Q(i), and Q( 2 i), respectively.
Solution: These subfields of Q( 2 , i) are the splitting fields of x2-2, x2+1, and x2+2, respectively.
Any automorphism must take roots to roots, so if is an automorphism of Q( 2 , i), we must
have ( 2 ) = ± 2 , and (i) = ± i. These possibilities must in fact define 4 automorphisms of the
splitting field.
Example: Find the Galois groups of x3 – 2 over the fields Z5 and Z11.
Solution: The polynomial is not irreducible over Z5, since it factors as x3-2 = (x+2)(x2-2x-1). The
quadratic factor will have a splitting field of degree 2 over Z5, so the Galois group is cyclic of
order 2.
A search in Z11 for roots of x3-2 yields one and only one: x = 7. Then x3-2 can be factored as x3-2 =
(x-7)(x2+7x+5), and the second factor must be irreducible. The splitting field has degree 2 over
Z11, and can be described as Z11[x] / < x2+7x+5 >. Thus the Galois group is cyclic of order 2.
Example: Find the Galois group of x4-1 over the field Z7.
Solution: We first need to find the splitting field of x4-1 over Z7. We have x4-1 = (x-1)(x+1)(x2+1).
A quick check of ±2 and ±3 shows that they are not roots of x2+1 over Z7, so x2+1 is irreducible
over Z7. To obtain the splitting field we must adjoin a root of x2+1, so we get a splitting field
Z7[x] / < x2+1 > of degree 2 over Z7.
Example: Find the Galois group of x3-2 over the field Z7.
Solution: In this case, x3-2 has no roots in Z7, so it is irreducible. We first adjoin a root a of x3-2 to
Z7. The resulting extension Z7(a) has degree 3 over Z7, so it has 73 = 343 elements, and each
element is a root of the polynomial x343-x. Let b> be a generator of the multiplicative group of
the extension. Then (b114)3 = b342 = 1, showing that Z7(a) contains a non-trivial cube root of 1. It
follows that x3-2 has three distinct roots in Z7(a): a, ab114, and ab228, so therefore Z7(a) is a splitting
field for x3-2 over Z7. Since the splitting field has degree 3 over Z7, it follows the Galois group of
the polynomial is cyclic of order 3.
Self Assessment
1. Galois considered ................... of the roots that leave the coefficient field fixed.
(a) polynomial (b) permutation
Notes 3. Any automorphism of a field f must leave its prime ................... fixed.
(b) Q 1 2 ,Q 2 i, Q 3 2i 0
(c) Q = Q2 = Q3
(d) Q1 = Q2–1 = Q3–1
31.6 Summary
Let F be an extension field of K. The set of all automorphisms : F > F such that (a) = a for
all a in K is a group under composition of functions.
Let F be an extension field of K. The set
Let K be a field, let f(x) be a polynomial in K[x], and let F be a splitting field for f(x) over
K. Then Gal(F/K) is called the Galois group of f(x) over K, or the Galois group of the
equation f(x) = 0 over K.
Let F be an extension field of K, and let f(x) be a polynomial in K[x]. Then any element of
Gal(F/K) defines a permutation of the roots of f(x) that lie in F.
Let f(x) be a polynomial in K[x] with no repeated roots and let F be a splitting field for f(x)
over K. If : K > L is a field isomorphism that maps f(x) to g(x) in L[x] and E is a splitting
field for g(x) over L, then there exist exactly [F:K] isomorphisms : F -> E such that (a) = (a)
for all a in K.
Let K be a field, let f(x) be a polynomial in K[x], and let F be a splitting field for f(x) over
K. If f(x) has no repeated roots, then |Gal(F/K)| = [F:K].
Let K be a finite field and let F be an extension of K with [F:K] = m. Then Gal(F/K) is a
cyclic group of order m.
If we take K = Zp, where p is a prime number, and F is an extension of degree m, then the
generator of the cyclic group Gal(F/K) is the automorphism : F -> F defined by
(x) = xp, for all x in F. This automorphism is called the Frobenius automorphism of F.
31.7 Keywords
Galois Group of the Equation: Let K be a field, let f(x) be a polynomial in K[x], and let F be a Notes
splitting field for f(x) over K. Then Gal(F/K) is called the Galois group of f(x) over K, or the
Galois group of the equation f(x) = 0 over K.
1. Let p be prime. Prove that there exists a polynomial f(x) Q[x] of degree p with Galois
group isomorphic to Sp. Conclude that for each prime p with p 5 there exists a polynomial
of degree p that is not solvable by radicals.
2. Let p be a prime and Zp(t) be the field of rational functions over Zp. Prove that f(x) = xp – t
is an irreducible polynomial in Zp(t)[x]. Show that f(x) is not separable.
3. Let E be an extension field of F. Suppose that K and L are two intermediate fields. If there
exists an element G(E/F) such that (K) = L, then K and L are said to be conjugate fields.
Prove that K and L are conjugate if and only if G(E/K) and G(E/L) are conjugate subgroups
of G(E/F).
6. Let F be a field such that char F 2. Prove that the splitting field of f(x) = ax2 + bx + c is
F( ), where a = b2 – 4ac.
7. Prove or disprove: Two different subgroups of a Galois group will have different fixed
fields.
8. Let K be the splitting field of a polynomial over F. If E is a field extension of F contained
in K and [E : F] = 2, then E is the splitting field of some polynomial in F[x].
www.maths.tcd.ie/
archives.math.utk.edu
CONTENTS
Objectives
Introduction
32.3 Summary
32.4 Keywords
Objectives
Introduction
In most results, in this section we will assume that the fields have characteristic zero, in order to
guarantee that no irreducible polynomial has multiple roots. When we say that a polynomial
equation is solvable by radicals, we mean that the solutions can be obtained from the coefficients
in a finite sequence of steps, each of which may involve addition, subtraction, multiplication,
division, or taking nth roots. Only the extraction of an nth root leads to a larger field, and so our
formal definition is phrased in terms of subfields and adjunction of roots of xn-a for suitable
elements a.
For a polynomial f(x) in K[x], the polynomial equation f(x) = 0 is said to be solvable by radicals
if there exists a radical extension F of K that contains all roots of f(x).
Proposition: Let F be the splitting field of xn - 1 over a field K of characteristic zero. Then
Gal(F/K) is an abelian group.
Theorem 1: Let K be a field of characteristic zero that contains all nth roots of unity, let a be an Notes
element of K, and let F be the splitting field of xn-a over K. Then Gal(F/K) is a cyclic group whose
order is a divisor of n.
Theorem 2: Let p be a prime number, let K be a field that contains all pth roots of unity, and let
F be an extension of K. If [F:K] = |Gal(F/K)| = p, then F = K(u) for some u in F such that up is in
K.
Lemma: Let K be a field of characteristic zero, and let E be a radical extension of K. Then there
exists an extension F of E that is a normal radical extension of K.
Theorem 3: Let f(x) be a polynomial over a field K of characteristic zero. The equation
f(x) = 0 is solvable by radicals if and only if the Galois group of f(x) over K is solvable.
Sn is not solvable for n 5, and so to give an example of a polynomial equation of degree n that
is not solvable by radicals, we only need to find a polynomial of degree n whose Galois group
over Q is Sn.
Lemma: Any subgroup of S5 that contains both a transposition and a cycle of length 5 must be
equal to S5 itself.
Theorem 4: There exists a polynomial of degree 5 with rational coefficients that is not solvable
by radicals
Example: Let f(x) be irreducible over Q, and let F be its splitting field over Q. Show that if
Gal (F/Q) is abelian, then F = Q(u) for all roots u of f(x).
Solution: Since F has characteristic zero, we are in the situation of the fundamental theorem of
Galois theory. Because Gal (F/Q) is abelian, every intermediate extension between Q and F must
be normal. Therefore, if we adjoin any root u of f(x), the extension Q(u) must contain all other
roots of f(x), since it is irreducible over Q. Thus Q(u) is a splitting field for f(x), so Q(u) = F.
x9-1 = (x3-1)(x6+x3+1)
= (x-1)(x2+x+1)(x6+x3+1).
This polynomial satisfies Eisenstein’s criterion for the prime 3, which implies that the factor
x6+x3+1 is irreducible over Q. The roots of this factor are the primitive 9th roots of unity, so it
follows that [F:Q] = 6. Gal (F/Q) is isomorphic to a subgroup of Z9× Since Z9× is abelian of order
6, it is isomorphic to Z6. It follows that Gal (F/Q) Z6.
Comment: The Galois group of xn-1 over Q is isomorphic to Zn× and so the Galois group is cyclic
of order (n) iff n = 2, 4, pk, or 2pk, for an odd prime p.
Example: Show that x4-x3+x2-x+1 is irreducible over Q, and use it to find the Galois group
10
of x -1 over Q.
Notes Solution: We can construct the splitting field F of x10-1 over Q by adjoining a primitive 10th root
of unity to Q. We have the factorization
x10-1 = (x5-1)(x5+1)
= (x-1)(x4+x3+x2+x+1) (x+1)(x4-x3+x2-x+1).
Substituting x-1 in the last factor yields
(x-1)4-(x-1)3+(x-1)2-(x-1)+1
= (x4-4x3+6x2-4x+1) - (x3-3x2+3x-1) + (x2-2x+1) - (x-1) + 1
= x4-5x3+10x2-10x+5.
This polynomial satisfies Eisenstein’s criterion for the prime 5, which implies that the factor
x4-x3+x2-x+1 is irreducible over Q.
The roots of this factor are the primitive 10th roots of unity, so it follows that [F:Q] = 4. The proof
of Theorem 1 shows that Gal (F/Q) Z10× and so the Galois group is cyclic of order 4.
Example: Show that p(x) = x5-4x+2 is irreducible over Q, and find the number of real
roots. Find the Galois group of p(x) over Q, and explain why the group is not solvable.
Solution: The polynomial p(x) is irreducible over Q since it satisfies Eisenstein’s criterion for
p = 2. Since p(-2) = -22, p(-1) = 5, p(0) = 2, p(1) = –1, and p(2) = 26, we see that p(x) has a real root
between -2 and -1, another between 0 and 1, and a third between 1 and 2. The derivative
p’(x) = 5x4-4 has two real roots, so p(x) has one relative maximum and one relative minimum,
and thus it must have exactly three real roots. It follows as in the proof of Theorem 2 that the
Galois group of p(x) over Q is S5, and so it is not solvable.
Self Assessment
(c) –1 (d) –2
2. Let K be a field of characteristic zero and let be a .................. of K. Thus there exists an
extension of F of that is normal radical extension.
3. There exists a polynomial of degree .................. with rational co-efficients that is not solvable
by radical.
(a) 4 (b) 5
(c) 6 (d) 7
4. Any subgroup of S5 that contains both a transposition and cycle of length .................. must
be equal to S5 itself.
(a) 4 (b) 5
(c) 3 (d) 6
Let p be a prime number, let K be a field that contains all pth roots of unity, and let F be an
extension of K. If [F:K] = |Gal(F/K)| = p, then F = K(u) for some u in F such that up is in K.
Let K be a field of characteristic zero, and let E be a radical extension of K. Then there exists
an extension F of E that is a normal radical extension of K.
Let f(x) be a polynomial over a field K of characteristic zero. The equation
f(x) = 0 is solvable by radicals if and only if the Galois group of f(x) over K is solvable.
Any subgroup of S5 that contains both a transposition and a cycle of length 5 must be equal
to S5 itself.
There exists a polynomial of degree 5 with rational coefficients that is not solvable by
radicals
32.4 Keywords
Solvable by Radicals: For a polynomial f(x) in K[x], the polynomial equation f(x) = 0 is said to be
solvable by radicals if there exists a radical extension F of K that contains all roots of f(x).
xp 1
p (x) xp 1 x p 2 ... x 1
x1
is irreducible over for every prime p. Let w be a zero p(x), and consider the field ().
(a) Show that , 2,...,p-1 are distinct zeros of p(x), and conclude that they are all the
zeros of p(x).
(e) If G(E/F) is an even permutation of the roots of f(x), show that () = .
archives.math.utk.edu