Professional Documents
Culture Documents
Block 01 Groups
Block 01 Groups
BOOK
When we give you exercises to do, whether from the textbook or any other ones, we
place the icon EXERCISES in the margin next to them. When we give a note or
clarification of comment, we place NOTE in the wide margin next to it. At the
completion of the note we have placed the symbol to indicate the completion of the
note. Finally, the end of a proof of a theorem is shown by placing a at the end of the
last line, and the end of an example is shown by placing *** after its solution.
You may find the Preface and Note for the Teachers on Pages xiii-xvii of the textbook
useful as a guide to read the book. 3
Study Guide-I We hope you will enjoy reading this course. If you have any queries, corrections or
comments do not hesistate to contact us through e-mail at the addresses
svenkat@ignou.ac.in and pksinclair@ignou.ac.in
You can also write to us at the following address:
The Course Coordinator
MMT-003, Algebra
School of Sciences
IGNOU
Maidan Garhi
New Delhi-110068
All the best!
4
BLOCK INTRODUCTION
In this block, you will continue the study of Algebra from where you stopped in your
undergraduate course in group theory (See Block 1 and 2, MTE-06 of the course
material.). This study would be done largely from Chapters and of ‘Algebra’, by
Michael Artin, supported by the material given in this block.
You would first study group actions, which is very important not only in abstract areas
of mathematics like the study of Riemann Surfaces, but also in many topics in Discrete
Mathematics which have applications. In the first unit of this block, we have illustrated
this through examples. We use this concept to study the conjugacy classes of a group.
We also use the technique built around group actions in proving Sylow Theorems.
The next concept we discuss is the ’symmetric group’. Apart from proving the well
known result that the alternating group An is simple, we also discuss how to count the
number of conjugacy classes in a symmetric group and how to find them.
In the third unit, we discuss what are called classical groups in mathematics literature.
You already know that invertible matrices over any field form a group with respect to
matrix multiplication. This group is known in mathematics literature as The General
Linear Group. From your study of Linear Algebra in undergraduation and in
MMT-002, you are also familiar with unitary and orthogonal matrices. The set of
unitary matrices form a subgroup of the general linear group over the field of complex
numbers. In unit 3 we focus on this group and its subgroup, the group of special unitary
matrices, which are unitary matrices with determinant one. Apart from these groups we
discuss the special linear group, the group of invertible matrices with determinant one.
The aim of the study is to illustrate some of the concepts like orbits, stabilisers etc. that
we have studied earlier.
The fourth unit has two main topics for discussions, namely, free groups and free
abelian groups. In this unit, we define a free group and prove some elementary results
about free groups. In the sections on free abelian groups we begin by discussing some
basic facts about finitely generated free abelian groups and end the unit with the
structure theorem on finitely generated free abelian groups.
Unit 5 focusses on semigroups, a concept that is usually marginalised in the
undergraduate studies. However, as you will see in the unit, semigroups have a variety
of important applications in the study of automata, linguistics and various social
sciences. This is why we have offered you this opportunity to study them. This is the
first unit which is not a ’wrap’ around some material in the textbook.
The last unit in this block is on congruences. This a part of number theory and we use
this topic to illustrate some of the results that we have studied earlier. Apart from some
of the standard results in congruences like the Chinese Remainder Theorem, we also
prove the quadratic reciprocity law. In the recent times some nice applications of
congruences have been found. We discuss some of them in this unit.
5
Study Guide-I
NOTATION & SYMBOLS
See page 601 of Artin for Notations and Symbols not in this list.
6
UNIT 1 GROUP OPERATIONS
Structure Page No.
1.1 Introduction 7
Objectives
1.2 Group Actions 7
1.3 Conjugacy Classes 13
1.4 Summary 13
1.5 Solutions/Answers 14
Sections in Artin’s book related to this unit:
Chapter Section
5 5, 6 and 7
6 1 and 3
1.1 INTRODUCTION
Permutation groups and matrix groups are among the most basic classes of groups.
These groups are equipped with a natural “action” on a set. For example, a group of
permutations of {1, . . . , n}, i.e., a subgroup G of the symmetric group Sn , acts naturally
on the set {1, . . . , n} in the sense that given any σ ∈ G and i ∈ {1, . . . , n}, we can
associate another element of {1, . . . , n}, namely σ (i). Likewise a subgroup G of the
multiplicative group GLn (F) of n × n nonsingular matrices with entries in a field F acts
naturally on the set Fn of n-tuples of elements of F simply by matrix multiplication.
The notion of group action (or as Artin calls them, group operation) is an abstraction of
the above basic examples. In general, a group G acts (or operates) on a set S if there is a
map
G×S → S
satisfying certain natural properties [See axioms (5.1) on page 176 of Artin.]. It turns
out that a group can have several different actions on different sets and understanding
these can give a better insight into the structure of the group
In this unit we will discuss, in Section 1.2 below, several examples of group actions and
some basic concepts and properties of group actions, in general. Next, in Section 1.3,
we will discuss and illustrate two important tools in studying finite groups, namely, the
so called Counting Formula and the class equation. Further applications of group
actions will be discussed in the next unit.
Objectives
After studying this unit along with Sections 5 and 6 of Chapter 5 and Sections 1 and 3
of Chapter 6 from the Artin’s book, you will be able to
• define group action (or group operation, as per the terminology used by Artin);
• check whether a given map G × S −→ S, where G is a group and S is a set, is a group
action;
• to find the stabilisers and orbits for given actions;
• understand and apply class equation for a finite group.
So, this gives another example of action of Z, this time on the set of all points on the
plane with integer co-ordinates.
∗∗∗
Example 6: Consider the collection of all partitions of {1, 2, 3, 4, 5} into two subsets of
cardinality 3 and 2. The number of such partitions is simply the number of ways of
choosing the 3 numbers for the first set,
and so it is given by the binomial coefficient
C(5, 3) = 15. A typical partition, say, {3, 2, 5}, {4, 1} , may be represented in a
diagram of two rows of boxes filled with numbers: (in each row we arrange the
numbers in increasing order).
2 3 5
1 4
Fig. 1: A partition of {1, 2, 3, 4, 5} into two subsets of size 3 and 2.
The group S5 acts on this collection of 15 such diagrams by acting on each box of the
diagram. For example, since 1 and 5 are interchanged under the transposition (1 5), the
positions of 1 and 5 in Fig. 1 are changed to the ones in Fig. 2.
2 3 1
5 4
Fig. 2: Changed partition under the action of (1 5).
{{i11 , i12 , · · · , i1n1 } , {i21 , i22 · · · , i2n2 } , · · · , {ik1 , ik2 , · · · , iknk }} (3)
be a partition of {1, 2, . . . , n} into k subsets, the i-th subset having ni elements and
n1 + n2 + · · · + nk = n. Then, we have an action of Sn on the set of all partitions of this
type. It is easy to check that there are
C (n, n1 ) C (n − n1 , n2 ) C (n − n1 − n2 , n3 ) · · · C (n − n1 − n2 − · · · − nk−1 , nk )
n!
= (4)
n1 !n2 ! · · · nk ! 9
Study Guide-I partitions of the type in Eqn. (3). Note that the order in which we choose the subsets
doesn’t matter, i.e. it doesn’t matter whether we select the subset with n1 elements first
or the subset with n2 elements first etc., because the final formula in the RHS of
Eqn. (4) is independent of the order in which we choose the subsets.
∗∗∗
Try the following exercises to check your understanding of group actions.
|x − y| = |m(x) − m(y)| ∀ x, y ∈ R2 .
In other words, m preserves distances. Note that a rigid motion is necessarily a 1-1
map.(Why?)
A characterisation of group of rigid motions given in Chapter 5, Section 2, pages
157–162. You may browse through it if you are interested.
NOTE Please correct equation (5.3) on page 177.
The equation should be
Os = s0 ∈ S s0 = gs for some g ∈ G
NOTE Please refer to page 177, lines 9—11.
Note that, since a rigid motion preserves length, it preserves the lengths of the sides of a
triangle. So, it maps a triangle to a congruent triangle.
The concepts of orbit, stabiliser and transitive action are discussed in page 177 of
Artin’s book. Here are some examples to help you understand these concepts.
Example 7: Consider the action of G = GLn (F) on Fn given by left multiplication.
The element 0 is left n
fixed by all the elements. Let v ∈ F be any t
non-zero vector.
For a row vector v, vt Suppose A = aij ∈ GLn (F) is in Ge1 , where e1 = (1, 0, . . . , 0) . Then, the first column
denotes its transpose. of A is the vector e1 , and so A is of the form
n−1 entries
1
z }| {
∗ ∗ ···∗
0
B
0 (5)
..
.
0
1 2 3 4 5
row: ↓ ↓ ↓ ↓ ↓.
3 2 5 4 1
The stabiliser subgroup of any such partition above is also easy to describe: it consists
of those permutations that do not mix the contents of one row with another. They are all
isomorphic to S3 × S2 .
In general, the stabiliser of a partition of type given in Eqn. (3) is isomorphic to
Sn1 × Sn2 × · · · × Snk where n1 , n2 , . . . , nk is a partition of n.
∗∗∗
Here are some exercises to test your understanding of the concepts of transitivity and
stabiliser.
E2) Find thestabilisers for the following elements under the action given: EXERCISES
a) Stabiliser of the vector e1 = (1, 1, . . . , 1)t ∈ Fn under left multiplication by
elements in GLn (F).
1 1
b) Stabiliser of the element ∈ M2 (R) under left multiplication by
0 0
elements in GL2 (R).
E3) Which of the actions amongst b) and c) in Example 1 are transitive? Give reasons
for your answer. Describe the orbits under the action given in b), Example 1.
After finishing Sec. 5, try exercises 2, 4, 5 and 9 in page 192 of Artin’s book under
the heading ‘5. Abstract Symmetry: Group Operations.’ EXERCISES
Read Section 6 of chapter 6. BOOK
Please refer to the example that starts from 7th line from the bottom in page 178
and end at 6th line from the top in page 179. NOTE
Cayley’s theorem tells us that we can embed D3 in S6 . However, from the discussion
mentioned above, we see that we can do better; we can actually embed D3 in S3 , a
smaller permutation group. We did this by considering the action of D3 on the cosets of
one of its subgroups. So, the question arises: When can we find such an embedding?
Let G be a finite group and let H be its subgroup. As we saw, there is an action of G on
the (left) cosets of H in G.
g · xH = gxH (6)
τ : G −→ S(G/H) (7)
If G is a group and H is a Coming back to our original question, we see that if G has a subgroup H which is not
subgroup of G, [G : H] normal and doesn’t contain any non-trivial subgroup of G, then we can embed G in Sn
denotes the index of H in G. with n = [G : H]. In particular, if H is a subgroup of G which is simple and non-normal,
we can embed G in Sn with n = [G : H].
For notational convenience, let us write [G : H] for the index of H in G. If the map τ
defined in Eqn. (7) is injective, then |G| divides [G : H]!. Put differently, if |G| doesn’t
divide [G : H]!, then τ is not injective and so its kernel is non-trivial. Since the kernel of
a group homomorphism is a normal subgroup, G contains a non-trivial normal
subgroup which is contained in H. Let us summarise this as a proposition.
Proposition 2: let G be a finite group and H be its subgroup such that |G| doesn’t
divide [G : H]!. Then H contains a non-trivial normal subgroup of G.
Try the next exercise which is an application of Proposition 2.
EXERCISES E4) If H is a (proper) subgroup of G of index p where p is the smallest prime dividing
|G|, then H is a normal subgroup of G.
(order of the group) = (order of the stabiliser) × (order of the orbit) (8)
There are qn − 1 vectors in Fn \ {0} and the orbit of e1 contains all the qn − 1 elements.
If we know the order of the stabiliser of e1 we can find the order of the group
G = GLn (F). Note that, there are qn−1 ways of choosing the last n − 1 entries in the
first row of the matrix in Eqn. (5) whereas B can be any element of GLn−1 (F). So, the
order of the stabiliser of e1 is qn−1 |GLn−1 (F)|. Now, using Eqn. (8), we get
You can find the order of GLn (F) by successively using Eqn. (9). We leave the details
as an exercise to you.
∗∗∗
Here are some exercises for you.
Read Section 1 of Chapter 6, starting from line 3 of page 198 and section 6. BOOK
Definition 2: A group G is simple if it has no subgroups other than {1} and G itself.
Lemma 1:
a) Let G be a group and N a normal subgroup of G. If N contains an element x ∈ G,
then it contains the conjugacy class Cx of x in G. In other words, a normal
subgroup is a union of conjugacy classes in G.
b) The order of N is the sum of the orders of the distinct conjugacy classes of G that
are contained in it.
Do exercises 1, 5, 9, 13 under ‘3. Operations on Subsets in pages 230 and 231. EXERCISES
We conclude this unit here. We will summarise the contents of the unit in the next
section.
1.4 SUMMARY
1.5 SOLUTIONS/ANSWERS
E1) You can check easily that the identity matrix acts as the identity map. To check
the associativity, we compute:
0 0 0
a b a b aa + bc0 ab0 + bd0
z= z
c d c0 d0 ca0 + dc0 cb0 + dd0
(aa0 + bc0 )z + ab0 + bd0
=
(ca0 + dc0 )z + cb0 + dd0
a0 z + b0
0 0
a b
0 0 z= 0
c d c z + d0
0 0
a ac0 z+b
z+d0 + b a(a0 z + b0 ) + b(c0 z + d0 )
0 0
a b a b
∴ z = =
c d c0 d0 c ca0 z+d
0 z+b0
0 +d c(a0 z + b0 ) + d(c0 z + d0 )
Regrouping the terms, RHS in the equation above is
(aa0 + bc0 )z + ab0 + bd0
=
(ca0 + dc0 )z + cb0 + dd0
0
aa + bc0 ab0 + bd0
= z
ca0 + dc0 cb0 + dd0
E2) a) Writing G = GLn (F), the stabiliser of e1 is
( )
n
Ge1 = aij ∈ GLn (F) ∑ aij = 1, for i = 1, 2, · · · , n.
j=1
a b 1 1
b) If is in the stabiliser of , then check that a = 1, c = 0, and
c d 0 0
b, d are arbitrary.
E3) Neither action is transitive because a singular matrix can never be carried to a
non-singular matrix under either action. Regarding the action in b), since any
invertible matrix can be reduced to the identity matrix through row reduction, the
invertible elements are in a single orbit that contains the identity matrix. If a
matrix is not invertible, its orbit contains a unique matrix in the reduced row
echelon form. (Refer to Section 2, Chapter 2 of Artin’s book, particularly
Proposition (2.18) in page 16.)
Regarding the action in c), the orbits are simply the similarity classes of matrices.
We can describe the orbits under the conjugacy action using the rational canonical
form if the field F is not algebraically closed. If F is algebraically closed, we can
describe the orbits using the Jordan form also.
E4) Let K be the kernel of the homomorphism in Eqn. (7). We will show that H = K.
The subgroup K is non-trivial because |G| - p!. To see this note that (|G|, p!) = p
and |G| 6= p. So, there are prime factors of |G| that do not divide p!. So, |G| - p!.
Next, we need to show that [H : K] = 1. We have [G : K] | [G : H]! = p!. But,
[G : K] = [G : H][H : K] = p [H : K]. Thus [H : K] | (p − 1)! and therefore all its
prime divisors are < p. Since p is the smallest prime divisor of |G|, it follows that
14 [H : K] = 1 or H = K.
E5) a) When n = 1, |GL1 (F)| = |F∗ | = q − 1.Therefore, the result is true for n = 1. Group Operation
Suppose the result is true for n:
n(n−1)
|GLn (F)| = q (qn − 1) qn−1 − 1 · · · (q − 1) (10)
2
Then,
n(n+1)
=q qn+1 − 1 qn − 1 qn−1 − 1 · · · q − 1
2
Q9) a) Using row echelon form, the orbits are of the form GL2
(C)x where x is
1 α
either the identity matrix or a matrix of the form .
0 0
b) Using Jordan canonical form, the orbits are of the form
α 1 α 0 α 0
, ,
0 α 0 α 0 β
B1 B2
−1
PAQ = P[A1 | A2 ]
B3 B4
B1 B2
= [PA1 | PA2 ]
B3 B4
= [PA1 B1 + PA2 B3 | PA1 B2 + PA2 B4 ] (11)
Q1) It is those elements g ∈ G such that g.aH = aH in G/H. This happens if and only
if a−1 ga ∈ H, that is g ∈ aHa−1 . So the stabilizer is the subgroup aHa−1 of G.
Q2) We are given that left multiplication by x fixes every coset of the cyclic
subgroup H generated by x in G. That is, xgH = gH for all g ∈ G. So,
x2 gH = x(xgH) = xgh. Repeatedly applying this we see that xn gH = gH for all
g ∈ G, n ∈ Z. That is, g−1 xn g ∈ H for all g ∈ G. As H is the group generated by
x, this is precisely the statement of normality of H.
∑ 1 + ∑ Cg
=
g∈Z g6∈Z
= |Z| + ∑ |C| 17
Study Guide-I where the second sum is over the conjugacy classes with more than one element.
Q4) Suppose |G| = pn , n ≥ 1. For each s ∈ S, |Os | = [G : Gs ]. Since [G : Gs ] | |G|,
[G : Gs ] = pk , 0 ≤ k ≤ n. We have
s fixed by G ⇔ G = Gs
⇔ [G : Gs ] = 1
So, we have to show that [G : Gs ] = 1 for at least one s. If this is not true, each
orbit will have pk elements for some 0 < k ≤ n, i.e. order of each orbit will be
divisible by p. So, the number of elements in S, which is the sum of the order of
the orbits, will also be divisible by p, which is a contradiction to the assumption
p - |S|.
Q6) 1 + 1 + 1 + 2 + 5 is not possible because |Z| = 3 and |Z| must divide 10.
1 + 2 + 2 + 5 is possible. Check that this is the class equation of D5 .
1 + 2 + 3 + 4 is not possible because there is a conjugacy class of order 3 and 3
does not divide 10, the order of the group.
1 + 1 + 2 + 2 + 2 + 2 is not possible. Here |Z| = 2. So, 2 | |Z(g)| for any g ∈ G,
|G|
therefore |Cg | = |Z(g)| | 5. So, there cannot be any conjugacy class with 2
elements.
1 0 a b
Q7) Let A = and X = be any matrix in GL2 (F5 ). Then
0 2 c d
0 −b
AX − XA = . So, AX = XA iff b = 0, c = 0. Since ad − bc 6= 0, this
c 0
implies that
a 0
∗
Z(A) = a, d ∈ F5
0 d
Q1) Let
S = { {a, b}| a, b ∈ D3 }
There are C(6, 2) = 15 subsets of D3 with two elements, i.e. |S| = 15.
What are the possible sizes of orbits? We know that the size of an orbit of an
element is the index of its stabiliser in the group by counting formula. Since D3
has order 6, the possible sizes for orbits are 1, 2, 3 and 6.
The elements of S which have orbit of length 1 are the subsets of order 2 of D3
which are stabilised by conjugation. But, according to Proposition 3.2, any
subset of D3 stabilised under conjugation should be a union of conjugacy classes.
Using the notation in Proposition 3.6 in page 165, we know that the conjugacy
classes in D3 are {1}, {x, x2 }, {y, xy, x2 y}.(See page 198.) So, using Proposition
3.2 in page 203, the only set of two elements fixed by D3 is the conjugacy class
{x, x2 }.(Why?)
Consider sets of the form {1, g} where g ∈ D3 . There are 5 such sets. They are
divided into 2 orbits:
{{1, x}, {1, x2 }} and {{1, y}, {1, xy}, {1, x2 y}}
Now, consider the subsets that do not contain identity and contain x or x2 , but not
18 both. There are six such subsets, three containing x and three containing x2 .
1) Show that the stabiliser of {x, y} in D3 is Z(x) ∩ Z(y). Group Operation
2) Show that Z(x) = {1, x, x2 } and Z(y) = {1, y} and deduce that the
stabiliser of {x, y} is trivial.
There are three more sets, {y, yx}, {y, yx2 } and {yx, yx2 } and they form a single
orbit because there are no orbits of length one other than {x, x2 }.
Q5) No. Take G = D3 , and U = x, x2 , y, yx, yx2 . Then |G| and |U| are coprime, but
Q13) a) Let H = {1, g}. Then, H is the union of conjugacy classes of G. The
element 1 will be in a conjugacy class with just one element. So, the other
element g is also in a conjugacy class with just one element. Thus, g is in
the centre of H.
b) The argument is similar. Again the identity element will be in a conjugacy
class with one element. The order of the every conjugacy class of G that is
contained in H is at most p − 1 and hence strictly less than p. Since p is the
smallest prime divisor of the order of G, the order of the conjugacy class
has to be 1.
19
Study Guide-I
20
UNIT 2 SYLOW THEOREMS AND
THE SYMMETRIC GROUP
Structure Page No.
2.1 Introduction 21
Objectives
2.2 Sylows Theorems 21
Applications of Sylow Theorems
2.3 Symmetric Group 23
Conjugacy Classes in Sn
An is Simple
2.4 Summary 28
2.5 Solutions/Answers 28
Sections in Artin’s book related to this unit:
Chapter Section
6 4,5 and 6
2.1 INTRODUCTION
In your degree course in algebra, you have learnt Lagrange’s theorem which says that
the order of any subgroup of a finite group divides the order of the group. It is natural to
ask if the converse is true, i.e., if G is a finite group and d a positive integer such that d
divides the order of G, then does G have a subgroup of order d.
It turns out that the converse is not true, in general. For example, the alternating group
A4 of order 12 has no subgroup of order 6.(See, for example, page 38 of Unit 2 of
MTE-06.) In Sec. 2.3 of this unit, we will discuss a theorem due to the Norwegian
mathematician Sylow which states that for any prime p, if pn is the highest power of p
dividing the order of G, then G has a subgroup of order pn .
The study of the symmetric group is important because every finite group is a subgroup
of a symmetric group. Symmetric groups are also a rich source of examples and
counter examples. In your degree classes, you would have already learned some P. L. M. Sylow
elementary properties of symmetric groups. In Sec. 2.3 of this unit, we will discuss (1832–1918)
some more results about symmetric groups. In particular, we will see how to find the
order of conjugacy classes in the symmetric group.
Objectives
After reading this unit together with the prescribed material from the Artin’s book, you
will be able to
• apply Sylow theorems to prove the existence of groups of specified order;
• explain the relationship between conjugacy classes in Sn and partitions of n and use
the relationship to describe the conjugacy classes in Sn ;
• explain when does a conjugacy class in Sn of an element in An splits into two
conjugacy classes in An ;
• calculate conjugacy classes and class equation of Sn and An ;
ψ : G −→ S6
given by
and hence GKi = N(Ki ) has cardinality 3. But, since |Ki | = 3 and Ki ⊂ N (Ki ), we
obtain N(Ki ) = Ki .
22 NOTE Lines 17 and 18 from the top in page 210 of Artin’s book. ‘Since the only element
common to the subgroups Ki is the identity element, only the identity element stabilises Sylow Theorems and The
Symmetric Group
all of these subgroups.’
Explanation: If g ∈ G stabilises all the Ki ,
4 4
g∈ N (Ki ) = Ki = {1},
\ \
i=1 i=1
where the first equality follows since N (Ki ) = Ki for 1 ≤ i ≤ 4. So, the identity element
is the only element that stabilises all the Ki .
Lines 20 and 21 on page 210 of Artin’s book.‘Since G has four subgroups of order 3, NOTE
it contains eight elements of order 3 and they certainly generate the group.’
Explanation: The group generated by elements of order 3 has order at least 8 and its
order divides 12, the order of the group, by Lagrange’s Theorem. So, the subgroup
generated by elements of order 8 in G has order 12.
Attempt all the exercises under ‘5. The Groups of Order 12.’ in page 232 of
Artin’s book. EXERCISES
In the next section, we will discuss the symmetric group.
You would have already studied about symmetric groups in your degree course. It is an
important class of groups. In this section, we will discuss the conjugacy classes of
symmetric groups. In particular, we will determine the conjugacy classes in A4 , S4 and
A5 . We will also show that An is simple for n ≥ 5.
But, before we do that, we ask you to read the material on symmetric groups in Artin’s
book. You may be already familiar with many of the results from your degree course.
However, reading the material will help you to quickly recapitulate what you have
studied earlier and also help you in familiarising yourself with the notation used in the
Artin’s book.
The definition of sign of a permutation in Artin’s book is different from the one given
in MTE-06 and in many other books. So, browse through pages 24, 25 and 26 of
Artin’s book to familiarise yourself with this definition. NOTE
Attempt exercises 2, 3, 4, 5, 6, 7, 8, 10, 15 and 16 for this section in pages 232
and 233. EXERCISES
Important: Note that, as in Artin’s book, we read the permutations left to right. In
other words, the product of two permutations σ τ means that we apply σ first and τ
next.
In view of the fact that every permutation in Sn can be uniquely written as a product of
disjoint cycles[Prop. (6.6) on page 213 of Artin], we make the following definition.
p = σ1 σ2 · · · σh
e1 + e2 + · · · + eh = n of n with e1 ≥ e2 ≥ · · · ≥ eh ≥ 1 (1)
Then, we can always find a cycle p ∈ Sn with cycle-type (e1 , e2 , . . . , eh ). (We will
always write the parts of a partition in non-increasing order.) In this terminology,
Prop. (6.10) on page 214 of Artin can be summarised as follows.
Proposition 1: Two permutations in Sn are conjugate to each other if and only if they
have the same cycle type.
We have the following corollary:
The number p(n) grows
rapidly with n. Finding an Corollary 1: The number of distinct conjugacy classes in Sn = p(n) where p(n)
exact formula for p(n) is not
easy. A remarkable formula
denotes the number of partitions of n.
has been obtained with Having found the number of conjugacy classes in Sn , let us now find the number of
essential contributions by elements in a conjugacy class. As you would expect, this also depends on the cycle
the Indian mathematician structure of the partition corresponding to the conjugacy class. Let us denote the
S. Ramanujan; this is known conjugacy class of an element σ ∈ Sn by [σ ]. To determine the number of elements in a
as the Hardy-Ramanujan- conjugacy class in Sn , we need to know the number of permutations that correspond to
Rademachar formula, which a given partition. Again, we will make use of group actions for counting.
can be found, for example,
in the book ‘Theory of Suppose the cycle-type of a permutation σ is (e1 , e2 , . . . , eh ). For i = 1, 2, . . . n, let αi be
Partitions’ by G. Andrews. the number of times i occurs in (e1 , e2 , . . . , eh ). Note that, some of the αi s could be zero.
We can then write Eqn. (1) as
and we may write the partition (e1 , e2 , . . . , eh ) as 1α1 2α2 . . . nαn . We will drop the i’s for
which αi = 0 from the product. For example, we can write the partition
(4, 3, 2, 2, 1, 1, 1) of n = 14 as 13 · 22 · 33 · 4.
We know that |[σ ]| = [Sn : N(σ )]. We also know that |Sn | = n!, so we have to calculate
|N(σ )|, the order of the normaliser of σ , i.e.
|{τ ∈ Sn |τσ = σ τ }|
So, by the orbit-stabiliser relation, the number of elements in the conjugacy classes Cσ
corresponding to the partition
|k + k +
{z· · · + k} + (k
|
− 1) + (k − 1) + · · · + (k − 1) + · · · + 2| + 2 +
{z } {z· · · + 2} + 1| + 1 +
{z· · · + 1}
αk times αk−1 α2 times α1 times
(3)
is
n!
|Cσ | = k k
(4)
24 ∏ i ∏ αi !
αi
i=1 i=1
Let us now find the conjugacy classes in the group S4 . The partitions of 4 and the Sylow Theorems and The
Symmetric Group
number of elements in the conjugacy classes are given below:
So, the number of conjugacy class in S4 is five and the class equation is
24 = 1 + 3 + 6 + 6 + 8
[(1)], [(1 2)], [(1 2 3)], [(1 2)(3 4)], [(1 2 3 4)]
Let us now write down the class equation for A4 . Since A4 is a normal subgroup it is
the union of S4 -conjugacy classes of elements that contain an even permutation. Note
that a cycle of length k can be written as a product of k − 1 transpositions for k ≥ 3:
You can use this to verify that the S4 -conjugacy classes in A4 that contain an even
permutation are
But, in general, the elements that are conjugates in Sn need not be conjugates in An . So,
some of the Sn -conjugacy classes may split into smaller conjugacy classes in An . We
leave it to you to check that the Sn -conjugacy class of an element σ ∈ An will split into
at most two An -conjugacy classes and this happens precisely when stabiliser of σ in Sn
contains only even permutations.(See Exercise 1.) So, the question arises: When will
the stabiliser of σ ∈ An in Sn will contain only even permutations?
Proposition 2: Let σ ∈ An . Then, Z(σ ) has only even permutations if and only if the
parts in the partition of n corresponding to σ are odd and distinct.
Proof: Necessity: Suppose the partition corresponding to σ has an even part. This
means that the decomposition of σ into disjoint cycles contains a cycle of even length,
say
σ = (1 2 . . . 2k)( )( ) · · · (6)
σ = (1 2 . . . k)(k + 1 . . . 2k)( )( ) · · ·
Let us now use Proposition 2 to identify the conjugacy classes in the list in Eqn. (6)
which will split into two when we consider the An -conjugacy classes. As you can see,
[(1 2 3)], which corresponds to the partition 4 = 3 + 1 will split into two conjugacy
classes in A4 . Since this Sn -conjugacy class has eight elements, it will split into two
conjugacy classes [(1 2 3)] and [(1 2 4)] of four elements each. So, the class equation of
A4 is
1+3+4+4
We can find the conjugacy classes of S5 by using the method we used for S4 .
120 = 1 + 10 + 15 + 20 + 20 + 30 + 24
[(1)], [(1 2)], [(1 2)(3 4)], [(1 2 3)], [(1 2 3)(4 5)], [(1 2 3 4)], [(1 2 3 4 5)]
We leave it to you to find the class equation and the conjugacy classes of A5 .
Here are some exercises for you to check your understanding of our discussion on
conjugacy classes of Sn and An .
EXERCISES E1) Let g ∈ An be an even permutation. Let C1 and C2 , respectively, denote the
conjugacy classes of g in Sn and An and let H1 denote the centraliser of g in Sn .
Using the orbit-stabiliser relationship show that |C1 | = |C2 | if H1 contains an odd
permutation and |C1 | = 2 |C2 | if H1 doesn’t contain an odd permutation.
E2) Find the elements in the two conjugacy classes of size four each in A4 .
E3) Write down the class equation and the conjugacy classes of A5 .
2.3.2 An is Simple
In this subsection, we will prove that An is simple. We first prove the following result.
The last term in the above equation is (i j r)(r j s). This completes the proof.
Theorem 1: The alternating group An is simple for all n ≥ 2, except in the case n = 4.
Proof: Recall that a group with no proper subgroups is called a simple group.
We know that A2 is the trivial group and A3 has order 3. So, both the groups are simple
by Lagrange’s theorem.
We have already seen that A4 has a normal subgroup of order four.
Now let us consider An for n ≥ 5.
Step I: If N is a normal subgroup of An containing a 3-cycle, then N = An .
Proof of Step I: Let (a1 a2 a3 ) be any three cycle in N. In view of Proposition 3, it is
enough to show that N contains every 3-cycle. Suppose (b1 b2 b3 ) is any 3-cycle.
Choose a4 , a5 (resp. b4 , b5 ) in {1, 2, 3, . . . n} distinct from a1 , a2 and a3 (resp. b1 , b2 , b3 )
and let g be a permutation of the form
a1 a2 a3 a4 a5 · · ·
b1 b2 b3 b4 b5 · · ·
σ = (a1 , a2 , a3 , . . .) · · · (7)
We conclude this unit here. In the next section, we will summarise what we have learnt.
2.4 SUMMARY
2.5 SOLUTIONS/ANSWERS
E1) Let H1 and H2 , respectively, denote the stabiliser of g under the conjugation action
of Sn and An , respectively. Then H2 = H1 ∩ An by definition. We have
|Sn | |An |
|C1 | = and |C2 | =
|H1 | |An ∩ H1 |
This can be re-written as
n!
|C1 | |H1 | = n! and |C2 | |H2 | = (9)
2
If |C1 | = |C2 |, then |H1 | = 2 |H2 |. So, H1 * An , i.e., H1 contains an odd
permutation.
Conversely, suppose H1 contains an odd permutation. In this case H1 An = Sn
(Why?). From the isomorphism
H1 H1 An
'
H1 ∩ An An
it follows that |H1 | = 2 |H2 | (How ?). Hence, from Eqn. (9), it follows that
|C1 | = |C2 | (How?).
E2) The elements in the conjugacy class [(1 2 3)] are (2 4, 3), (1 2 3), (1 3 4), (1 4 2).
The elements in the conjugacy class [(1 2 4)] are (2 3 4), (1 2 4), (1 3 2) and
28 (1 4 3).
E3) The conjugacy classes in S5 that contain only the even permutations are Sylow Theorems and The
Symmetric Group
[(1)], [(1 2)(3 4)], [(1 2 3)], [(1 2 3 4 5)]
The only conjugacy class that will split into two is [(1 2 3 4 5)] and it will split
into two conjugacy classes of 12 elements each. The class equation is
60 = 1 + 15 + 20 + 12 + 12
Solutions to selected exercises in ‘4. The Sylow Theorems.’ in page 231 and
232 of Artin’s book.
Q1) The number of subgroups of order 5 is of the form 1 + 5k and it divides 4. So, it
has to be 1. Therefore, there is a unique 5-Sylow subgroup. Since any element of
order 5 generates a group of order 5, all the elements of order 5 must be in this
unique subgroup of order 5. Therefore, the group has 4 elements of order 5.(Why?)
Q2) Let G be a group of order pq. Suppose that p < q without loss of generality. The
number of q- Sylow subgroups is of the form 1 + kq and divides p. Since q > p,
k = 0 and so the Sylow group of order q is normal.
Q3) Let G be a group of order p2 q. If q < p, the argument used in the previous problem
can be used to prove that the group order p2 is normal. So, let us assume that q > p.
Let nq be the number of q-Sylow subgroups of the G. Then nq is of the form
1 + kq, k ≥ 0 and nq | p2 . So, nq is 1, p or p2 . If nq = 1, we are done. But, nq
cannot be p because q > p, so nq = 1 + kq > p. So, nq = 1 + kq = p2 . So,
q | p1 − 1 = (p − 1)(p + 1). Since q - p − 1(Why?), q | p + 1. Then, p + 1 = tq > tp
or (t − 1)p < 1. So, t = 1 and q = p + 1. This is not possible if p > 2 and we are
done in this case. So, let us assume that p = 2 so that |G| = 12. The number of
2-Sylow subgroups of G is of the form 1 + 2k and divides 3. So, the only
possibilities are 1 and 3. If it is 3, G acts on the 2-Sylow subgroups by
conjugation. So, there is a homomorphism G −→ S3 . Since 12 - 6, the map is not
injective and so it has a non-trivial kernel which is a normal subgroup of G.
Q4) Check that the order of the group is 21 by considering the number of possibilities
for a and c and applying the multiplication principle from elementary
combinatorics.(See the first
unit in Block 2 of MTE-13.) Then, check that
1, 1 1 0
x= ,y= satisfy the conditions in (4.9)b, where we use i to denote
0, 1 0 4
the residue class of i (mod 7).
Q5) Let D10 = xi yj x2 = y10 = 1, xyx = y−1 and H = hxi × hy5 i where the product
is the internal direct product. (Why is hxi × hy5 i a group?) Then H is a 2-Sylow
subgroup of D10 . We leave it to you to find the conjugates of H.
1 a
Q6) Check that a ∈ Fp is a p-Sylow subgroup.
0 1
Q9) It is enough to prove that any group G of order pe has a subgroup of order pr for all
r ≤ e.(Why?) It is easy to prove this when G is cyclic. We leave it to you to fill in
the details. Let us therefore assume that G is not a cyclic group and apply induction
on e. If e = 1, the result is trivially true. Suppose it is true for all groups of order
pn with n < e. We have to prove that the result is true for groups of order pe .
Let us first show that G has a proper normal subgroup of order, say pk , for some
k < e. If G is abelian, then choose any element g 6= 1 in G and consider the group
N = hgi. Then, |N| = pk for k < e since we have assumed that G is not cyclic. If G
is not abelian, then Z(G) 6= G and Z(G) 6= {1} since G is a group of prime power
order. We have that Z(G) is a normal subgroup of G and it has order pk for some k.
So, in any case, G has normal subgroup N of order pk with k < e. By our induction
hypothesis, for any r ≤ k, N has a subgroup of order pr and therefore G also has a
subgroup of that order. Let e > r > k, and let r = k + v, say, where v > 1. Now,
G
consider the group N which is a group of order pm with m = e − k < e and so it has 29
Study Guide-I a subgroup of order pv . The inverse image of this subgroup under the canonical
G
map G → N will give a subgroup of order pk+v in G. The proof is now complete
by induction.
Solutions to selected exercises in ‘5. The Groups of Order 12.’ in page 232
of Artin’s book.
Q1) For all the abelian groups, the class equation is |1 + 1 + 1{z+ · · · + 1}. This leaves
12 times
us with three non abelian groups given in Theorem (5.1) in page 209.
Let K1 , K2 , K3 and K4 have the same meaning they have in the discussion of
case 2 in page 210 of Artin’s
4 book.
If g 6= 1 is any element not in H, then it is in
one
3 of the K
i s because ∪i=1 i \ {1} = 8 and the intersection of H with
K
∪i=1 Ki \ {1} is the empty set because H ∩ Ki = {1}. Let x ∈ K1 , x 6= 1. If g,
g1 in H are such that gxg−1 = g1 xg−1 −1
1 , since K1 = hxi, gg1 ∈ N(K1 ) = K1 , But,
H ∩ K1 = {1}, so g = g1 . So, the remaining 8 elements are in two conjugacy
classes of 4 elements each. Thus, the class equation of A4 is 1 + 3 + 4 + 4.
Let us now consider the dihedral group D6 of order 12. This is exercise 8 a) in
page 229. Here are some more details that may enable to solve the exercise in
case you haven’t solved it earlier. Let
D6 = xi yj x2 = y6 = 1, xyx = y−1
According to the discussion in last paragraph in page 210 of Artin’s book, this
corresponds to the case where H is the Klein four group. In this case the
subgroup of order 6, hyi, is normal and it is the union of conjugacy classes.
Further, since hyi ⊂ Z(g) for each g ∈ hyi, each conjugacy class of G contained
in hyi will have either one element or two elements. Since
xyx = xyx −1 = y−1 = y5 and xy4 x−1 = y−4 = y2 , there are two conjugacy
You can check that the remaining six elements are in two conjugacy classes of
order three. Determine also the conjugacy classes.
Q2) First dispose off the case p = 2. Then, modify the proof of Corollary(4.4) in
page 205 of Artin’s book to prove the result.
Q4) a) The 11-Sylow subgroup is normal. Let us write H for the 11-Sylow
subgroup. Let K be a 5-Sylow subgroup of G. Let H = hxi and K = hyi.
30 Since H is normal, yxy−1 ∈ H. So, yxy−1 = xr , 1 ≤ r ≤ 10.
b) Let φy (x) = yxy−1 . Then φy is an automorphism of H and o φy = o(r)
Sylow Theorems and The
Symmetric Group
where r denotes the residue class of r (mod 11). Since o φy = o(y) = 5,
o(r) = 5. Since Z∗11 is a cyclic group of order 10, the set of elements in Z∗11
that satisfy g5 = 1 are precisely the squares in Z∗11 . So, the allowed values of
r are the set of squares in Z∗11 which is {1, 3, 4, 5, 9}.
1 1
c) Let H be the subgroup of GL2 (F11 ) generated by where 1 is the
0 1
residue class of 1 (mod 5). Let
1 0
K= c ∈ {1, 3, 4, 5, 9}
0 c
Check that
−1 c
1 0 1 1 1 0 1 c 1 1
= =
0 c−1 0 1 0 c−1 0 1 0 1
Q2) Suppose conjugating (1 2 3)(4 5) by σ will yield (2 4 1)(3 5). Then, comparing
the two permutations, 1σ = 2, 2σ = 4, 3σ = 1, 4σ = 3, 5σ = 5. So, σ is the
permutation
1 2 3 4 5
i.e., σ = (1 2 4 3).
2 4 1 3 5
Q4) a) The five cycle (1 2 3 4 5) has order five in S7 . The element (1 2 3 4 5)(6 7)
has order ten because (1 2 3 4 5) has order five and (6 7) has order two and
they commute. Suppose there is an element α which has order 15. Consider
the action of hαi on the set {1, 2, 3, 4, 5, 6, 7}. Every element in the set must
have stabiliser different from {1} because the set has only seven elements
while hαi has order 15. So, the stabiliser of any element has order 3, 5 or
15. Accordingly, the possible orders for the orbits are 1, 3 and 5. Since the
sum of the elements in the orbit add up to seven, the only possibilities are
1 + 3 + 3 and 1 + 1 + 5. In the first case, the stabilisers have order 15, 5 and
5. So, α 3 will belong to the stabilisers of all the elements and so it acts
trivially on the set {1, 2, 3, 4, 5, 6, 7}. So, α 3 = 1. This means that α has
order 5, a contradiction. We leave the other case to you.
b) Let σ be any element of Sn and let σ = c1 c2 . . . ck be its decomposition into
disjoint cycles of length greater than 1. If ai is length of the cycle ci , then
a1 + a2 + · · · + ak < n since each ci is of the form (i1 , i2 , i3 , . . . , ik ) with each
ij ∈ {1, 2, 3, . . . , n} and the cycles are disjoint. Since any two cycles ci and cj
commute, (c1 c2 . . . ck )m = cm m m
1 c2 . . . ck for any integer m. It follows that, if
m
m = lcm (a1 , a2 , · · · , ak ), (c1 c2 . . . ck ) = 1. So, the order of c1 c2 . . . ck
divides lcm (a1 , a2 , · · · , ak ).
Therefore, the maximum possible order of an element in Sn is
To prove the result for n = 2, (a1 , a2 ) = q−1 (1, 2)q for some permutation q. So,
0 1
1 0 .
In−2
where In−2 is the identity matrix of size n − 2. This matrix has determinant −1.
To complete the inductive proof use the relation
where the cycle ci has length ni and r = (n1 − 1) + (n2 − 1) + · · · + (nk − 1).
Q6) We have already proved in problem 4 that, if ai is the order of the cycle ci and
the the cycles are disjoint, the order of the permutation σ = c1 c2 · · · ck divides
lcm (a1 , a2 , . . . , ak ). It remains to show that lcm (a1 , a2 , . . . , ak ) divides the order of
the permutation σ . For this, it is enought to show that a1 , a2 , . . ., ak divide the
order of σ . Let us show that a1 , say, divides the order of σ . Suppose
c1 = (i1 , i2 , . . . , ia1 ). If we show that cm 1 acts as the identity permutation i1 , i2 , . . .,
ia1 , it will follow that a1 divides m. To show that cm 1 fixes i1 , i2 , ia1 , note that
σ m = cm 1 when restricted to the set {i ,
1 2 i , . . . , i a1 } because all the other cycles c2 ,
c3 ,. . .,ck act as identity permutation on the set {i1 , i2 , . . . , ia1 } since the cycles are
pairwise disjoint. Since σ m is identity, it follows that cm 1 is identiy on the set
{i1 , i2 , . . . , ia1 }. Also, c1 leaves fixed every j ∈ {1, 2, 3, . . . , n} \ {i1 , i2 , . . . , ia1 }
since the cycles are disjoint. Thus, cm 1 is the identity permutation and we are
done.
Q7) No, consider the 5-cycle (2 1 3 4 5). This 5-cycle is a conjugate of the 5-cycle
(1 2 3 4 5), but it is not a power of the 5-cycle (1 2 3 4 5).(Check this! Compare
how the powers of (1 2 3 4 5) and (2 1 3 4 5) act on 1 and 2.)
Q8) This is a standard result in combinatorics. A permutation that does not fix any
index fixed is called a derangement. One way of finding the number of
derangements is using the principle of inclusion and exclusion. See page 45 of
Block 2 of the IGNOU course MTE-13, Discrete Mathematics, for the
derivation. It is available online at http://www.egyankosh.ac.in/
32 bitstream/123456789/11595/1/Unit-6.pdf.
Q10) a) We first show that a cycle of length k can be written as a product of k − 1 Sylow Theorems and The
Symmetric Group
transpositions for k ≥ 3:
33
Study Guide-I
34
UNIT 3 SPECIAL GROUPS
Structure Page No.
3.1 Introduction 35
Objectives
3.2 Definitions 35
3.3 The Special Unitary Group 36
3.4 The Special Linear Group 36
3.5 Summary 37
3.6 Solutions/Answers 37
Sections in Artin’s book related to this unit:
Chapter Section
8 1,2 and 4
3.1 INTRODUCTION
In this unit, we will discuss an important class of groups called classical groups. You
are already familiar with three of the classical groups, GLn (R) the general group of
n × n invertible matrices, the orthogonal group and the unitary groups. Recall that the
real(resp. complex) orthogonal group O(n)(R)(resp. O(C)) is the group of n × n
real(resp. complex) matrices P such that Pt P = I and the Unitary group Un is the group
of complex matrices P such that P∗ P = 1 where P∗ is the conjugate transpose of P. In
Sec. 3.2, we will discuss the definitions of various groups that we will study in this unit.
In Sec. 3.3, we will discuss the special unitary group, the subgroup formed by the
unitary matrices of determinant one. In Sec. 3.4, we discuss the special linear group,
the group of real matrices with determinant one.
Objectives
After studying this unit, you should be able to
• define the Symplectic, Orthogonal and Unitary groups;
• explain how to identify SU2 with the sphere S3 ;
• describe the conjugacy classes in SU2 in terms of latitudes of SU2 ;
• explain how the longitudes of SU2 are subgroups which are conjugate to each other;
• explain how to identify SL2 (R) with S1 × R.
3.2 DEFINITIONS
In this section, you will begin the study of the so called classical groups. You will learn
the definitions of the various groups, like the Orthogonal group, the Special Orthogonal
group and the Special Unitary group, that you will study in the later sections of the unit.
E1) Show that, if P ∈ SP2n (R) or P ∈ Op,q (R) where p + q = n, det(P) = ±1. Deduce EXERCISES
that SP2n (R) is a subgroup of GL2n (R) and Op,q is a subgroup of GLn (R).
We conclude this section here. In the next section, we will discuss the special unitary
35
group.
Study Guide-I
3.3 THE SPECIAL UNITARY GROUP
Read section 2 of Chapter 8 in Artin’s book. Attempt problems 1, 2, and 3 under ‘2.
BOOK
The Special Unitary Group’ in page 301 of Artin’s book.
The group of n × n real matrices with determinant one is called the Special Linear
Group. In this section, we will study the group of 2 × 2 real matrices with determinant
1. Note that, unlike the Special Unitary Group which is defined over the field of
complex numbers, the definition of the special linear group, matrices of determinant is
1, makes sense over any field. In particular, it is defined over the finite fields also.
However, we restrict ourselves to the real and complex fields in our course.
Read section 4 of chapter 8 in Artin’s book. Do problems 1 and 3, except for 3(c), in
BOOK
‘4.The Special Linear Group SL2 (R)’ on page 302.
a r a
h.e1 = and hence =
c 0 c
a b a 0 1 0
t
hh = =
0 a−1 b a−1 0 1
that is,
2
a + b2 ba−1 1 0
=
a−1 b a−2 0 1
√ a
!
1 a 1
Qe1 = a2 +c2 =√ =√ Pe1 .
√ c a2 + c2 c a2 + c2
a2 +c2
Further,
√ √
√ a √ c a2 + c2 √ab+cd a2 + c2 √ab+cd
! ! !
a b
Q−1 P = a2 +c2 a2 +c2 = a2 +c2 = a2 +c2
− √a2c+c2 √ a
a2 +c2
c d 0 √ad−bc 0 √ 1
a2 +c2 a2 +c2
So, Q−1 P ∈ H.
Note that a2 + c2 6= 0 since
ad − bc = 1. So, P Q and P Q−1 P are both continuous.
Thus, f−1 (P) = Q, Q−1 P is continuous.
3.5 SUMMARY
3.6 SOLUTIONS/ANSWERS
0 0 0 1
Q11) Let
A B
P=
C D
be any 2n × 2n matrix where A, B, C and D are n × n matrices. We have
t
A Ct
t
P = .
Bt Dt
0 −I
Let P be the matrix where I is the n × n identity matrix and 0 is the
I 0
0 I
n × n zero matrix. Then P = t . So,
−I 0
0 I 0 I
t 0 −I
P JP =
−I 0 −I 0 I 0
0 I
−I 0 0 −I
= =
0 −I I 0 −I 0
At A
! !
0 0
Let P = . Then, Pt = t . Therefore,
0 A−1 0 A−1
A
! t
0 0 I A
t 0
P JP = t
0 A−1 −I 0 0 A−1
A
!
0 At 0 I
0
= t =
− A−1 0 0 A−1 −I 0
t t
since − A−1 At = − AA−1 = −I.
Let P be a 2n × 2n matrix of the form
I B
P=
0 I
I 0
where B is a n × n matrix with B = Bt . Then Pt = . We have
B I
I B 0 I I 0
t
P JP =
0 I −I 0 B I
38
Special Groups
−B I I 0 −B + Bt I
= = =J
−I 0 Bt I −I 0
since −B + Bt = 0.
Solutions to selected exercises in ‘2. The Special Unitary Group SU2 ’, in page 301
of Artin’s book.
a b c d
Q1) Let P = and Q = . Write
−b a −d c
We have
ac − bd ad + bc e f
PQ = = (say)
−bc − ad −bd + ac −f e
We have
a b
Q2) We have to find an invertible matrix A = ∈ SU2 . Note that, since
−b a
a −b
2 2 −1
A ∈ SU2 , a, b has to satisfy |a| + |b| = 1. Also, A = .
b a
a b cos θ − sin θ a −b 0
λ1 (θ )
= ∀ θ ∈ R (2)
−b a sin θ cos θ b a 0 λ2 (θ )
Solutions to selected exercises in ‘4. The Special Linear Group SL2 (R)’, in page
302 of Artin’s book.
a b
Q1) Let us first determine the stabiliser H. Let h = ∈ SL2 (C) be in H. By
c d
definition, we have
a b 1 r
= where r > 0
c d 0 0
But,
a b 1 a
=
c d 0 c
So, a = r > 0 and c = 0. Since det(h) = 1, we obtain
r b
h=
0 1r
a b
since det(h) = 1. Now, if P = ∈ SL2 (C), we set
c d
√ 2a 2 √ −c
|a| +|c| |a|2 +|c|2
Q= . Check that Q ∈ SU2 (C) and Q−1 P ∈ H. Check
√ 2 2 √ 2a 2
c
|a| +|c| |a| +|c|
that f : SL2 (C) −→ SU2 (C) × H given by P Q, Q−1 P and f−1 , given by
Chapter Section
6 7 and 8
12 4
4.1 INTRODUCTION
We have seen that the dihedral group Dn , the group of symmetries of a regular polygon
with n sides, isomorphic to the group generated by two elements x and y which satisfy
the relations yxy = x−1 , xn = 1 and y2 = 1. In this Unit, we will discuss the formal
background behind such representations. In Sec. 4.2, we introduce the concept of free
groups. In the next section, Sec. 4.3, we will see that any group can be realised as the
quotient of a free group. In Sec. 4.4, we will prove the structure theorem for finitely
generated abelian groups which says that we can, in a unique way (upto isomorphism),
write a finitely generated abelian group as a direct sum of cyclic groups.
Objectives
After studying this unit, you should be able to
• define a free group;
• explain the description of groups by generators and relations;
• explain the mapping property of the free groups;
• state and apply the structure theorem for finitely generated abelian groups.
Attempt Exercises 1 and 3 on Page 233 under ‘7. The Free Group.’ EXERCISES
Attempt Exercises 1, 7, 8 and 9 on page 234 under ‘8. Generators and Relations’. EXERCISES
41
Study Guide-I
4.4 STRUCTURE THEOREM FOR FINITELY GENERATED
ABELIAN GROUPS
You are already familiar with the concept of basis of a vector space. If V is a
n-dimensional vector space with a basis {e1 , e2 , . . . , en }, then we can write any element
x of V in the form x = ∑ni=1 xi ei for some scalars x1 , x2 , . . ., xn . In this section, we are
going to consider this notion in the setting of an abelian group.
Since we will be discussing only abelian groups in rest of this unit, we will use + for
the group operation.
Superficially, it would seem that the analogue of finite dimensionality for a vector space
is the finite generation of abelian groups. We know that every finite dimensional vector
space over any field has a basis consisting of finitely many elements. But, as we will
see later, not every finitely generated abelian group has a basis. Let us now formally
define the notion of a basis for an abelian group. It is along the expected lines.
ei = (0, 0, . . . , 1, 0, . . . , 0)
where the ith coordinate is 1, then Zn is a free abelian group with basis e1 , e2 , . . ., en .
∗∗∗
Pn = a0 + a1 X + a2 X2 + · · · + an Xn ∈ Z[X] a0 , a1 , . . . , an ∈ Z
E2) Let G be a finitely generated, free abelian group. Hom(G, Z) be the set of all
group homomorphisms from G to Z. Then, we can give the structure of an abelian
group on Hom(G, Z) by defining (φ + ψ)(g) = φ (g) + ψ(g) for g ∈ G. Show that
Hom(G, Z) is a free abelian group of rank n under this operation.
Once we fix a basis S = {e1 , e2 , . . . , en } in a free abelian group G, we can talk of the
coordinate vectors of an element x with respect to the basis S. If x = ∑ni=1 xi ei , then we
call (x1 , x2 , . . . , xn ) the coordinate vector of G with respect to S. Also, the map
x (x1 , x2 , . . . , xn )
If x = ∑ni=1 xi ei , then
! !
n n n m n m
φ (x) = ∑ xi φ (ei ) = ∑ xi ∑ aij fj =∑ ∑ aij xi fj = ∑ yj fj ,
i=1 i=1 j=1 j=1 i=1 j=1
where
y1 a11 a12 a1n x1
...
y2 a21 a22 ... a2n x2
.. = ..
.. .. ..
. . .
... . .
ym am1 am2 . . . amn xn
Proposition 1: Let G be a finitely generated free abelian group. Then, any two bases
have the same number of elements.
But,
n
ei = 0.e1 + 0.e2 + · · · + 0.ei−1 + 1.ei + 0.ei+1 + · · · + 0.en = ∑ δik ek ,
k=1
We can now define the rank of a finitely generated free abelian group, which is the
analogue of dimension of a vector space.
Definition 6: Let G be a finitely generated free abelian group. Then, the rank of G is
the number of elements in a basis of G. By convention, we will say that the trivial
group has rank zero.
We will prove the structure theorem for finitely generated abelian groups. This says that
any finitely generated abelian group can be written as a direct product of cyclic groups.
The precise statement of the theorem is as follows:
The plan of the proof is as follows: We first prove that any finitely generated abelian
group is isomorphic to a quotient FF21 , where F1 and F2 are free abelian groups. We then
show that we can choose a basis {w1 , w2 , . . . , wm } for F1 and a basis {u1 , u2 , . . . , un } for
F2 with n ≤ m such that ui = di wi for 1 ≤ i ≤ n. From this, we deduce Eqn. (1). We
then prove the uniqueness of representation.
Before we can discuss the proof of the result, we have to prove some elementary results
44 related to free abelian groups.
Proposition 2: Let G be a free abelian group of finite rank n, and H be a subgroup of G. Free Groups
Then, H is a free abelian group of rank at most n.
Proof: If n = 0, both G and H are trivial, and so they have rank 0. Therefore, the result
is true in this case.
If the rank of G is 1, G is an infinite cyclic group. In fact, if G = hvi is any cyclic group
and H is a subgroup, H = hdvi for some d ∈ N. We leave this to you as an exercise.
Since H = hdvi, H is also a free abelian group of rank at most 1. It has rank 0 if d is
zero and rank 1 if d 6= 0. This proves the result for n = 1.
Now, assume that the result is true for all groups of rank ≤ n − 1. Suppose, n, the rank
of G, is greater than one and let H be a subgroup of G. Let {e1 , e2 , . . . , en } be a basis of
G over Z. Let K be the free abelian subgroup of G generated by {e1 , e2 , . . . , en−1 }. Let
us denote the image of x ∈ G under the natural map
G
φ : G −→ ,x x+H
H
by x. We claim that K has rank n − 1 and G K is a free abelian group of rank 1 generated
by en = en + K. Let us see why this is so.
Let x ∈ K G
, where x ∈ G. Then, x = ∑ni=1 ai ei and so x = an en + K since e1 , e2 ,
. . . , en−1 ∈ K. If an en = 0, an en ∈ K and we can write an en as a linear combination of e1 ,
e2 , . . . en−1 , i.e. ∑ni=1 bi ei = 0 with bn = an . Since we can represent zero uniquely as
∑ni=1 0ei , this implies that all the bi s are zero. In particular, an = 0.
Let H be the image of H under the natural homomorphism φ : G −→ G H . If H = {0},
then H ⊂ K and by induction, H is a free abelian group of rank at most n − 1.
If H 6= {0} then H is a free abelian group of rank 1, generated by den for some d ∈ N.
By induction hypothesis, H ∩ K is a free group of rank at most n − 1. Let {f1 , f2 , . . . , fm }
be a basis for H ∩ K over Z, where m ≤ n − 1. Then, we claim that {f1 , f2 , . . . , fm , fm+1 }
is a basis for H over Z, where we write fm+1 = den . Let us see why this is true.
Let us consider an element x ∈ H and let x be its image in G K . Since fm+1 generates H,
x = afm+1 , a ∈ Z. Since x − afm+1 ∈ K and both x and fm+1 are in H,
x − afm+1 ∈ H ∩ K. Since {f1 , f2 , . . . , fm } is a basis for H ∩ K, x − afm+1 = ∑m i=1 ai fi or
x = ∑mi=1 a i fi + afm+1 . This proves that the set {f 1 , f 2 , . . . , f m , f m+1 } generates H over Z.
To prove that the set {f1 , f2 , . . . , fm , fm+1 } forms a basis, we have to show that, if
∑m+1 m+1
i=1 ai fi = 0, ai ∈ Z, then ai = 0, i = 1, 2, . . . , m + 1. If ∑i=1 ai fi = 0, we have
m
am+1 fm+1 = − ∑i=1 ai fi ∈ K since fi ∈ K. So, am+1 fm+1 = 0. Since fm+1 is a basis for H,
am+1 = 0. So, ∑m i=1 ai fi = 0. Since {f1 , f2 , . . . , fm } is a basis for H ∩ K over Z, it follows
that ai = 0 for 1 ≤ i ≤ m. Hence H is a free abelian group with basis {f1 , f2 , . . . , fm+1 }
where m + 1 ≤ n − 1 + 1 = n since m ≤ n − 1. This completes the proof.
Remark 3: The result in Proposition 2 is not true for non-abelian groups. A free
non-abelian group of rank n can have a subgroup of rank greater than n. For example, if
G is the free group on two generators, the subgroup H generated by u = x2 , v = y2 ,
w = xy is isomorphic to the free group on three generators.
Proof:
a) Since G is finitely generated, there are elements x1 , x2 , . . ., xn which generate G.
Let F1 be a free group of rank n and let {e1 , e2 , . . . , en } be a basis of F1 . Let
φ : F1 −→ G be the group homomorphism given by ei xi . This map is onto since
xi generate G.(Why?) Let us call the kernel of this map F2 . Then F2 is a free
abelian group of rank ≤ n by Proposition 2 and FF12 ' G 45
Study Guide-I H
b) There is a subgroup H of F1 such that F2 ⊆ H ⊆ F1 and with ' H under the
F2
natural isomorphism φ : FF12 −→ G induced by φ . Since F1 is a free abelian group
of rank n, H is a free abelian group of rank at most n. The quotient of a finitely
generated abelian group is finitely generated.(See exercise 3 a).) So, FH2 is finitely
generated and therefore H is also finitely generated.
BOOK Read section 4 of chapter 12.
In the second paragraph of page 462 of the book, Artin mentions a final more serious
point in the proof of Theorem 4.11, namely, ensuring that S has a finite set of
generators. Proposition 2 on page 45 takes care of this since S is the subgroup of a
finitely generated abelian group W. So, no gap remains in the proof of Theorem 4.11.
Suppose G is a finitely generated abelian group. Then, as in Proposition 2 on the
previous page, we can get free abelian groups F1 and F2 such that F2 ⊂ F1 , a map
φ : F1 −→ G with kernel F2 . Let us now apply Theorem 4.11 with W = F1 and S = F2 .
We get a basis {w1 , w2 , . . . , wm } of F1 and a basis {u1 , u2 , . . . , un } of F2 such that
ui = di wi , di ≥ 1 for 1 ≤ i ≤ n and di | di+1 for 1 ≤ i ≤ n − 1.
Let us now prove that G is of the form given in Eqn. (1) on page 44. Let r = m − n. Let
d1 , d2 , . . .,dn be as in the statement of Theorem 4.11. We define a map
ψ : Zd1 × Zd2 × · · · × Zdn × Zr −→ FF12 ' G by
n r
(a1 , . . . , an , b1 , b2 , . . . , br ) ∑ ai wi + ∑ bi wn+i (3)
i=1 i=1
So,
n r
ψ a01 , . . . , a0n , b01 , b02 , . . . , b0r = ∑ a0i wi + ∑ b0i wn+i
i=1 i=1
n r
= ∑ a0i wi + ∑ bi wn+i (∵ bi = b0i )
i=1 i=1
n n r
= ∑ ai wi + ∑ yi di wi + ∑ bi wn+i
i=1 i=1 i=1
n r
= ∑ ai wi + ∑ bi wn+i (∵ di wi = 0).
i=1 i=1
Suppose G ' G0 and G ' G00 too. Then, we have to show that r = r0 and di = d0i for
1 ≤ i ≤ n. To prove this, we need the notion of torsion group of an abelian group.
Definition 7: Let G be an abelian group. Then, the torsion group of G, written as Gtor
is the subgroup
{ g ∈ G| dg = 0 for some d ∈ N}
Our strategy for the proof of uniqueness is as follows: Since G0 ' G00 , we have
G0tors ' G00tors . (Check this.) We then prove that G0tors = Zd1 × Zd2 × · · · × Zdn . Similarly,
G00tors = Zd01 × Zd02 × · · · × Zd0n . From this, we deduce that d0i = di .
For any abelian group G and d ∈ N, let us write dG = {dg | g ∈ G}. We leave it to you
to check that, if f : G1 → G2 is a group isomorphism, f(dG1 ) = df(G1 ) = dG2 . We
prove that dn G0 is a free abelian group of rank r and dn G00 is a free abelian group of rank
r0 . Since dn G0 ' dn G ' dn G00 , r = r0 and we are done. Let us now carry out our strategy.
Let g ∈ G0 and suppose that g = (a1 , a2 , . . . , an , b1 , . . . , br ) where ai ∈ Zdi for 1 ≤ i ≤ n
and bi ∈ Z for 1 ≤ i ≤ r. If g ∈ G0tors , there is d ∈ N such that
Lemma 2: Suppose
where G ' H, di , d0i ≥ 1 for 1 ≤ i ≤ n. If di | di+1 and d0i | d0i+1 for 1 ≤ i ≤ n. Then
di = d0i for 1 ≤ i ≤ n.
Proof: We have | G |= ∏ni=1 di =| H |= ∏ni=1 d0i . We will prove the result by induction
on the order of G. If | G |= 1, all the di s, d0i s are 1 and we are done. Suppose the order
of G is greater than 1. It follows that there is a prime p such that p divides |G|. Then,
there is a t such that p divides dt+1 , . . . , dn but p does not divide d1 , d2 , . . . , dt and p
divides d0s+1 , . . . , d0n but p does not divide d01 , d02 , . . . , d0s . Here t ≤ n and s ≤ n. We have
pG ' pH. Also, check that
(
Zd if p - d
pZd =
Zd/p if p | d
So,
pG ' pZd1 × pZd2 × · · · × pZdn ' Zd1 × Zd2 × Zdt × Zdt+1 /p × · · · × Zdn /p . 47
Study Guide-I Similarly,
To complete the proof, we have to show that dn G0 is a free abelian group of rank r. For
this, consider any element g = (a1 , a2 , . . . , an , b1 , . . . , br ) where ai ∈ Zdi for 1 ≤ i ≤ n and
bi ∈ Z for 1 ≤ i ≤ r. Then,
dn g = (dn a1 , dn a2 , . . . , dn an , dn b1 , . . . , dn br ) = (0, 0, . . . , 0, dn b1 , . . . , dn br )
This means that dG0 is the free abelian group generated by f1 , f2 , . . .,fr where fi is the
n + r tuple having dn as the (n + i)th component and all the other components are 0.
There is another way of decomposing a finitely generated abelian group into a product
of cyclic groups. We now state the result.
where the p1 , p2 , . . .,ps are primes, not necessarily distinct. The value of r in Eqn. (4) is
unique and the powers pn11 , pn22 , . . .,pns s are uniquely determined up to the order of the
factors.
Lemma 3: Let C be a finite cyclic group. Then, we can write C as a product of cyclic
groups of prime power order.
Proof: We will prove the result by induction on the order of C. Suppose |C| = n. If
n = 1, there is nothing to prove. Let n > 1. If n is already a power there is nothing to
prove. Suppose n is not a prime power. Suppose n = mpk where (m, p) = 1. Then,
C ' Cpk × Cm where Cpk is a cyclic group of order pk and Cm is a cyclic group of order
m. We leave the proof of this fact as an exercise to you.
Since m < n, by induction hypothesis, we can write Cm as a product of cyclic groups of
prime power order. The result now follows.
Proof of Theorem 3: We can easily prove Theorem 3 now. We apply Lemma 3 to each
of the groups Zd1 , Zd2 , . . .,Zdk in Eqn. (4) to complete the proof of Theorem 3. Since
we can write each of these groups as a product of cyclic groups of prime power order,
we can write the product of these groups as a product of groups of prime power order.
Let us prove that decomposition in Eqn. (4) is unique. Suppose G ' G0 and G ' G00
where
0
G0 ' Zpn1 × Zpn2 × Zpns s × Zr and G00 ' G ' Zpm1 × Zpm2 × Zpms s × Zr . (5)
1 2 1 2
In the equation, Eqn. (5), p1 , p2 , . . . , ps are primes, not necessarily distinct. By allowing
48 some of the mi s and ni s to be zero, we can assume that the number of cyclic groups of
prime power order are the same and the same set of primes occur on both sides because Free Groups
we can add as many factors with ni = 0 or mi = 0 as necessary.
The proof that r = r0 is similar to the proof we gave for Theorem 2. We take
d = ∏si=1 p`i i where `i = max{ni , mi }. Then, as before, dG0 ' dG ' dG00 and we can
show that dG0 and dG00 are free abelian groups of rank r and r0 , respectively. The result
now follows as before.
Let us now prove the uniqueness of the orders of the cyclic groups of prime power
order that occur in Eqn. (4).
Let
pe111 pe112 ... pe11n
pe221 pe222 ... pe22n
.. .. .. .. (6)
. . . .
pekk1 pekk2 . . . pekkn
be the prime powers that occur in the decomposition of G0 . We are abusing the notation
a little here and using the notation p1 , p2 , . . .,pk for primes that are distinct. Also, we
assume that the powers of the primes along the rows are in ascending order, i.e.
ei1 ≤ ei2 ≤ · · · ≤ ei n−1 ≤ ein for i = 1, 2, . . . , k. We take some of the eij s to be zero, if
necessary, so that all the rows have the same number of elements.
For example, if G0 ' Z2 × Z4 × Z3 × Z5 , we would arrange the prime powers as follows:
2 22
30 3
50 5
Similarly, let
pf111 pf112 ... pf11n
pf221 pf222 ... pf22n
.. .. .. .. (7)
. . . .
pfkk1 pfkk2 . . . pfkkn
be the prime powers that occur in the decomposition of G00 where
fi1 ≤ fi2 ≤ · · · ≤ fi n−1 ≤ fin for i = 1, 2, . . . , k. Some of the fij s could be zero.
e f
Now, define di = ∏kj=1 pj ji , d0i = ∏kj=1 pj ji . Note that di is the product of the prime
powers that occur in the ith column on Eqn. (6) and d0i is the product of prime powers
that occur in the ith column of Eqn. (7). Then, since ei1 ≤ ei2 ≤ · · · ≤ ei n−1 ≤ ein and
fi1 ≤ fi2 ≤ · · · ≤ fi n−1 ≤ fin , it follows that di | di+1 and d0i | d0i+1 . Also, regrouping the
terms, we have
G0 ' Zpe11 × Zpe21 × · · · × Zpek1 × Zpe12 × Zpe22 × · · · × Zpek2 × · · ·
1 2 k 1 2 k
Proof: We will prove Eqn. (10). The proof of Eqn. (11) is similar. Since every finite
abelian group is also finitely generated, we can apply Theorem 2 on page 44 to G. But,
Zr is an infinite group for r > 0. So, in Eqn. (1) on page 44, r = 0 if G is a finite abelian
group. The result now follows Theorem 2.
The powers pn11 , . . .,pnkk in Theorem 3 are called the elementary divisors of the group
G. Let us now look at an example to understand Corollary 3.
Example 2: Consider the group
EXERCISES E4) Let Crs be a finite cyclic group of order rs where r and s are integers such that
(r, s) = 1. Then, Crs ' Cr × Cs .
E5) Find the elementary divisors and invariant factors of the group
4.5 SUMMARY
4. The structure theorem for finitely generated abelian groups and its application in
classifying abelian groups of a given order.
4.6 SOLUTIONS/ANSWERS
E2) Suppose that the G has rank n and let e1 , e2 , . . . en be a basis for G.
Define φ1 , φ2 , . . ., φn by
(
0 if i 6= j
φi (ej ) =
1 if i = j
Verify that φ1 , φ2 , . . ., φn forms a basis for Hom(G, Z). Note that Hom(G, Z) is
the analogue of the dual space of a vector space in the case of finitely generated
abelian groups.
G
E3) a) The group H is finitely generated. In fact, if e1 , e2 , . . .,en generate G, then
G
e1 , e2 , . . .,en generate H over Z.
b) No, the group G H need not be free. For example, Z is a free group over Z
generated by 1. But, the quotient Zn = Z/nZ is not a free group. If it is a free
abelian group, it will have a finite basis e1 , e2 , . . ., en .(This is because, if G is
a finitely generated group and S is another generating set, not necessarily
finite, then there is a finite subset of S that generates G. Here 1 generates
Z/nZ over Z.) But, in this basis there will be two representations for zero,
namely 0e1 + 0e2 + · · · + 0en and ne1 + ne2 + · · · + nen .
of groups. The proof is similar to the proof that C6 ' C2 × C3 , given in page 62 of
Artin’s book.
E5) The elementary divisors are 2, 2, 3, 3, 5, 5, 7, 7 and the invariant factors are 210
and 210.
E6) There will be four non-isomorphic abelian groups of this order.
Solutions to exercises 1 and 3 on Page 233 under ‘7. The Free Group.’
Q1) No. The product of two cyclic groups of infinite order is abelian. The free group
on two generators is not abelian. 51
Study Guide-I Q3) Let us write a ∼ b if a and b are conjugates. Given a conjugacy class C there is a
reduced word of smallest length, say k, in the conjugacy class. Suppose
x = x1 x2 · · · xk . Then, x1 6= x−1
k since x ∼ x2 x3 · · · xk−2 and C will have a word of
smaller length, contradicting our choice of x. So, the closed loop formed from x
will be reduced also.
Suppose x0 is another reduced word in C of length k. Then, we claim that x0 is a
cyclic permutation of x so that x and x0 will give the same reduced, closed, loop.
Let us prove in general that if x and x0 are two reduced words of the same length
and x0 = uxu−1 for some reduced word u, then x0 is a cyclic permutation of x.
The proof is by induction on the length of u. Suppose u is a reduced word of
length one. Let us write x = x1 x2 · · · xk and x0 = x01 x02 · · · x0k . Since x and x0 have
the same length, either u and x1 cancel each other or u−1 and xk cancel out each
other. So, u = x1 −1 or u−1 = xk −1 . Suppose, u = x1 −1 . Then, u−1 = x1 and
uxu−1 = x2 x3 · · · xk x1 = x0 . So, x2 = x01 , x3 = x02 , . . .,xk = x0k−1 and x1 = x0k .
Thus, x0 is a cyclic permutation of x. If u−1 = x−1 k , check that
0
x = xk x1 x2 · · · xk−1 .
Suppose that, the result is true whenever the length of u is less than n and we
have uxu−1 = x0 where u is a reduced word of length n, n > 1. Since the length
of uxu−1 is the same as x, there has to be some cancellations and this can happen
only if u = vx−1
1 or u
−1 = x−1 v−1 for some word v of length n − 1. In the first
k
case, we have
uxu−1 = vyv−1 = x0
we can ‘cut it open’ at any point read off clockwise to get a reduced word. Two
reduced words, obtained by cutting open the reduced word at two different
positions will be conjugates of each other. For example, if we cut the above word
before x1 and after x1 , we will get x1 x2 · · · xk and x2 x3 · · · xk x1 and these words
are conjugates of each other. Therefore, if C1 and C2 are two different conjugacy
classes and the reduced words of smallest length x ∈ C1 and x0 ∈ C2 yield the
same reduced, closed, loop, then x and x0 are conjugates of each other, so
C1 = C2 .
Q1) Let G be the group generated by a and b and let H be the subgroup generated by
bab3 and bab2 . It is enough to show that a and b are in H. We have
−1
bab2 bab3 = b−2 a−1 b−1 bab3 = b is in H. Since b and bab−2 in H, a is also in
H.
Q8) All the subgroups of the quarternion group are normal. Only the centre is
characteristic. None of the subgroups of order four are characteristic. For
example, the subgroup {±i, ±1} is mapped to {±j, ±1} by the automorphism φ
defined by φ (±i) = ±j, φ (±j) = ±i and φ (±k) = ∓k, φ (±1) = ±1. (Check that
φ is an automorphism.)
53
UNIT 5 APPLICATIONS OF SEMIGROUPS
Structure
5.1 Introduction
Objectives
5.2 Some Basic Concepts
Semigroups
Free Semigroups
5.3 Connections with (Semi)automata
5.4 Application to Formal Languages
5.5 Summary
5.6 Solutions/Answers To Exercises
5.1 INTRODUCTION
So far you have studied some aspects of groups from the textbook and the material in
this block. Now we look at simpler algebraic structures, i.e., semigroups and monoids.
In your undergraduate studies, you would have come across these algebraic objects in
passing. To start with, we recall their definition and some of their properties here.
Then, in the next two sections, we focus on the applications of semigroups in two
areas, namely, automata and formal languages, though there many other areas in
which the theory of semigroups is applied.
This unit is independent of the textbook by Artin. If you are interested in studying
more about the matter discussed here, you can refer to
Objectives
prove that different bases of a free semigroup/monoid must have the same
cardinality;
Commonly found examples of semigroups are (N, ), (N, ) (Z, ), (Z, ) and (R , ) .
Example 1: Let S . Show that the set of all mappings from S to S , Map (S, S) ,
is a semigroup w.r.t. the composition of mappings.
Example 2: Show that every non-empty set can be turned into a semigroup.
E2) Let S be a non-empty set, and Re l (S) the set of all relations on S , i.e., subsets
of S S . Define : Re l (S) Re l (S) Re l (S) by
‘x (R1 R 2 ) y iff z S s.t. x R 1 z and z R 2 y’ ,
i.e., ‘ ( x , y) R 1 R 2 iff z S s.t. ( x , z) R 1 and (z, y) R 2 ’.
Show that (Re l (S), ) is a semigroup.
[This is called the relation semigroup.]
E3) If (X, ) is a semigroup, then define (X) to be the set of non-empty subsets of
X. Show that ( (X), ) is a semigroup, where
A B a b,a A, b B A, B (X).
[This is called the power semigroup of X.]
You also know that a semigroup (S, ) is a group if (S, ) is a monoid and every
element in S is invertible w.r.t. .
Now, within a monoid, we have a very natural group. Can you guess what it is?
You can check that (G S , ) is a group, and hence the name ‘unit group’ is
appropriate. Why do you think it is called ‘group kernel’ also?
Now, in the case of groups you studied subgroups and group homomorphisms. We
can define analogous objects for semigroups too.
E5) Find the group kernels of (N {0}, ), (Z, ), { (S), } and Map(S, S) , where
S.
E7) Prove that ( ({1, 2, 3}), ) and ( ({a, b, c}), ) are isomorphic semigroups.
3
Can you think of some examples of generating sets? For instance, given any
semigroup (S, ) , is S S ? In fact, it is. So, every semigroup has a generating
set, but the fewer the generators, the easier it is for us to ‘see’ the elements of the
semigroup. For instance, (N, ) N . But (N, ) 1 also, since any element
of N is a finite sum 1 1 1 . In fact, (N, ) is an example of a finitely
generated semigroup, as you will just see.
E8) Prove Theorem 1. (Note that an analogous statement is true for monoids.)
The union of
subsemigroups need not E9) Give an example of a semigroup S and two subsemigroups S1 , S 2 of S such
be a subsemigroup.
that S1 S 2 is not a subsemigroup of S .
Let us now look at a particular kind of semigroup which has extensive applications in
computer science.
4
Example 4: If B {b} , then show that the free semigroup on B is Applications of
Semigroups
FB {b, b b b 2 , b 3 , } , which is isomorphic to (N, ) .
E12) Are (N, ), (N {0}, ) and (R , ) free semigroups? Give reasons for your
answers.
E13) Define a submonoid and a free monoid on a set B , along the same lines as the
definitions related to semigroups.
You may well ask if given B , there always exists FB . The following theorem
tells us about this.
Theorem 2 (Existence): For any set B , there exists a semigroup F which is free
on B .
Proof: Let F {b1b 2 b n | bi B, n N} , the set of all formal products (or strings)
of elements of B .
So, by definition, b1b 2 b n c1c 2 c m iff n m and bi ci i 1 , , n.
Let x , y F . So, x b1 b 2 b r and y c1c 2 c s , for some b i and c j in B .
Define x y b1 b 2 b r c1c 2 c s , i.e., the binary operation on F is just the If x F, then
concatenation (or juxtaposition). x b1b 2 b n for some
Then (F, ) is a semigroup containing B .
b i B. The length of
Now, let f : B G be any map, where G is a semigroup.
x, denoted by l (x), is n.
Define h : F G : h (b1 b r ) f (b1 ) f (b 2 ) f (b r ) .
Then h is a homomorphism, and h extends f .
Suppose h is any other homomorphism from F to G that extends f . Then, for any
c1 c 2 c s F ,
h (c1c 2 c s ) h (c1 ) h (c 2 ) h (c s )
f (c1 ) f (c 2 ) f (c s )
h (c1 c 2 c s ) .
Therefore, h h .
This shows that h is unique, and F is free on B .
Theorem 1 tells us that given a set we can always find a free semigroup on it. The
next question that arises is: can a free semigroup have more than one basis? If so, are
these bases related?
5
In the case of groups you have seen that a free group can have any number of distinct
bases. But, the cardinality of all these bases must be the same. This is also true for
semigroups, as we now prove.
Proof of lemma: Let us prove this diagrammatically, using Fig. 1. The situation here
is as in Fig. 2, where is an isomorphism from F to F .
F ~ F
|
 ̄  ̄
B
B B
~ B
Fig. 2: | : B
B
h
F F
i
2
i f
1 U U
B B
Fig. 3: h extends i 2 f : B F
The reason for discussing semigroups in this course is that they have several
applications. For instance, in biology they are being used for classifying organisms
vis-à-vis the hereditary laws. They are also useful for studying the DNA protein-
coding problem. Semigroups are also being used in some of the social sciences to
study various aspects of social and financial networks.
However, in this unit we shall consider the close relationship that semigroups and
monoids have with applications pertaining to automata and to formal languages.
So, as you can see, a semiautomaton doesn’t have an output function. For this, we
extend this object to an automaton.
Now, an automaton is called finite (or Mealy) if all the sets involved are finite. In
practical examples, there are usually a collection of only two switching states– on and
off. In such a situation, S will be Z 2 Z 2 Z 2 . And A1 and A 2 will look
similar too.
7
Solution: Let us take
s1 : Wasima is angry a 1 : Hari argues calmly
s 2 : Wasima is bored a 2 : Hari shouts
s 3 : Wasima is pleased a 3 : Hari cooks their favourite dishes
b1 : Wasima shouts
b 2 : Wasima is quiet.
Then take S {s1 , s 2 , s3 } , A1 {a1 , a 2 , a 3 } and A 2 {b1 , b 2 } . Define the functions
and by the following tables.
a1 a2 a3 a1 a2 a3
s1 s1 s1 s3 s1 b2 b1 b2
s2 s2 s1 s3 s2 b2 b2 b2
s3 s3 s1 s3 s3 b2 b2 b2
0 1 0 1
s0 s0 s1 s0 0 1
s1 s1 s0 s1 0 1
Give a situation that could be described by (S, A1 , A 2 , , ) .
What E18 tells us is that any semigroup gives rise to a semiautomaton. In fact, the
converse is also true, but we will not be proving it over here. These facts may give
you some sense of how the study of semigroups can help us understand
(semi)automata.
8
Applications of
5.4 APPLICATION TO FORMAL LANGUAGES Semigroups
You may be familiar with two or three languages that are spoken around you, like
English, Hindi, etc. These are examples of ‘natural’ languages. Apart from such
languages, mathematicians and computer scientists have defined ‘formal’ languages.
These languages also have alphabets and words, but the words may not mean anything
to a listener or reader. Such languages focus on syntax, and are useful for studying
linguistic patterns as well as the syntax of programming languages.
So, as you can see from the above, a formal language is about the form (i.e., the
syntax) and not about meaning. Further, the mathematical theory of formal languages
doesn’t study individual languages, but the classes of language, and the mechanisms
that describe these classes. Noam Chomsky, American linguist and philosopher, has
presented an hierarchy of these classes, viz., L1 L 2 L 3 L 4 , where
L1 : Regular languages, characterised by finite state automata; Fig. 4: Noam Chomsky
(born: 1928)
L 2 : Context-free languages, characterised by pushdown automata;
L 3 : Context-sensitive languages, characterised by linear bounded automata;
L 4 : Computable languages, characterised by Turing machines.
Let us see what the semigroups/monoids are that characterise these languages. For
this, let us consider Chomsky’s approach to formal languages. This is based on the
use of grammar, which we now define.
The relation is the heart of the grammar. It tells us how one string transforms into
another. For x FV , y V* , we will write (x, y) also as x y. For instance, if
w1 uxv and w 2 uyv , then x y applied to w1 gives us w 2 . This is also
9
indicated
⇒w by saying that w1 derives w 2 , or that w 2 is derived from w1 , and denoted
by w1 2.
E21) Find a grammar with the alphabet set A that generates A* , i.e., the universal
language.
10
Applications of
Semigroups
Now, let us look at what kind of grammars make up the classes L1 to L 4 in
Chomsky’s hierarchy, mentioned above.
Example 10: Show that L xy n n 0 is a regular language.
Remark: You can check that these languages are also context-free.
Now, there is a lot of work going on in the area of characterising the various languages
in terms of semigroups. We will state, without proof, a result here to help us get an
example of a language which is not regular.
L a n P , where P is a periodic subset of N 0 .
n
periodic if k, n 0 N
such that p n k p n is
Using result, this we can immediately say that a n n N 0 is not regular since
2 constant n n o .
n 2
n N 0 is not a periodic set.
11
E22) Show that L (G1 ) L (G2 ), where
Different grammars can G1 a, b , g 0 , g 0 g 0 g 0 , g 0 aa , g 0 and
G2 a, b , g 0 , g 0 ag 0 a, g 0 aa , g 0 .
generate the same
language.
E23) Check whether a n n 3(mod 4) is regular.
You have seen that a language is a subset of a free monoid. There are some operations
that can be applied on the set of languages over an alphabet A . These are
With this we come to the end of our discussion on semigroups, monoids and their
applications.
5.5 SUMMARY
In this unit we have discussed the following points.
2) What the group kernel (or the unit group) of a monoid is.
5.6 SOLUTIONS/ANSWERS
E1) All three sets are non-empty, and closed w.r.t. the operations given. However,
the first two are semigroups, and (Z, ) is not, since ‘–’ is not an associative
operation.
E7) Define : ({1, 2, 3}) ({a, b, c}) : () , (1) a, (2) b, (3) c , and
extend elementwise.
The subsets of {1, 2, 3} are , {1}, {2}, {3}, {1, 2}, {1, 3}, {2, 3}, {1, 2, 3} .
So, under , the images of these are
, {a},{b}, {c}, {a, b}, {a, c}, {b, c}, {a, b, c} .
13
You can check that (S1 S2 ) (S1 ) (S2 ) subsets S1 , S 2 of {1, 2, 3} .
Also, is clearly a monomorphism and an epimorphism. Hence, is an
isomorphism.
E12) (N, ) is not free. [Had it been free, it would be a free commutative semigroup.
So it would be isomorphic to (N, ) . But (N, ) has no identity element,
while (N, ) does. So, we reach a contradiction.]
On the same lines you can show that (N {0}, ) and (R, ) are not free
semigroups.
14
Applications of
Semigroups
E15) Let {a 1 , a 2 , , a r } generate ( Z n , ) , where a i N . Take F to be the free
semigroup on {a 1 , a 2 , , a r } . Define
: F Z n : (x1x 2 x k ) (x1 x 2 x k ) mod n , where x 1 x 2 x k is a
string in F .
Then is surjective, and ( xy) ( x ) ( y) x , y F .
a1 a2 a3
s0 s0 s0 s0
s1 s1 s0 s1
s2 s2 s1 s2
s10 s10 s9 s10
a1 a2 a3
s0 b1 b3 b3
s1 b1 b2 b3
s2 b1 b2 b3
s10 b1 b2 b3
E18) i) Firstly, S .
Next, a b S a , b S .
Thirdly, (a b) c ((a , b), c) and a (b c) (a, (b, c) for
a , b, c S .
Therefore, need not be associative.
So, (S, ) need not be a semigroup.
E19) i) Starting with g 0 , any derivation will lead to a or b and to no other string.
So, L(G ) a, b.
15
ii) Here, any string can be , a, aba.
So, L(G ) , a, aba .
E21) G = A, g 0 , g 0 , g 0 g 0 a a A , g 0 , where g 0 A.
Then L (G ) A* .
E22) In both cases L aa n N .
n
E23) Since n n 3(mod 4) is a periodic set, a n n 3(mod 4) is regular.
16
UNIT 6 CONGRUENCES AND APPLICATIONS
Structure Page No.
6.1 Introduction 71
Objectives
6.2 Basic Results on Congruences 71
6.3 The Chinese Remainder Theorem 75
6.4 The Quadratic Reciprocity Law 78
6.5 Applications of Congruences 87
Primality Testing
Error Checking Using Congruences
6.6 Summary 91
6.7 Solutions/Answers 92
6.1 INTRODUCTION
In this unit, we will discuss congruences and their applications. Gauss, in his book
Disquisitiones Arithmeticae formulated the notion of congruences and introduced the
notation that we use for congruences at present. With the help of the notion of
congruences he revolutionised number theory and changed it from a collection of
isolated results, due to other mathematicians like Euler, Fermat, Lagrange and
Legendre, into a coherent subject. He not only reformulated many results known earlier
in terms of congruences, he also proved many new results. In the recent times, P. Fermat
congruences have led to many interesting applications in computing. In Sec. 9.2, we (1601–1665)
prove basic results regarding congruences using basic concepts from algebra that you
have studied in your degree course. In Sec. 9.3, we will prove the Chinese remainder
theorem, which has many applications, and derive some of its consequences. One of the
results in the study of congruences, which is important from both theoretical and
applications point of view, is the quadratic reciprocity law. In Sec. 9.4, we will prove
quadratic reciprocity which was proved rigorously by Gauss although the result was
known earlier to Euler and Legendre. In Sec. 9.5, we will discuss some applications of
congruences to primality testing and in checking whether an ISBN number is valid or
not.
Objectives C. F. Gauss
(1777–1855)
After studying this unit, you should be able to
• solve linear congruences ( mod n);
• use the Chinese Remainder Theorem to solve simultaneous linear congruences;
• calculate the legendre symbol;
• solve the equation x2 − a = 0 (mod p), when p is a prime and a and p are odd
numbers coprime to each other, using quadratic reciprocity;
• explain how congruences are used for checking primality and for checking whether
an ISBN number is valid number or not.
Proposition 2: a ∈ Z
(n) is a unit if and only if (a, n) = 1.
In the proof of Proposition 2, we showed that, if (a, n) = 1 and u and v are such that
ua + vn = 1, then u is the inverse of a. Translated in terms of congruences, this means
that u is a solution to the equation ax ≡ 1 (mod n). So, to find a−1 , we have to find u
and v such that au + vn = 1.
Recall that, in general, if (p, q) = d, then we can find d using Euclidean algorithm. The
same algorithm can be modified to find x and y such that xp + yq = d. The general
procedure is as follows: There is nothing to do if p = q because d = p = q in this case.
So, suppose p < q. Then, we can find q1 and d1 , 0 ≤ d1 < p such that q = p.q1 + d1 . We
stop if d1 = 0 because p | q and so (p, q) = p. Again, we can find q2 and d2 , 0 ≤ d2 < d1
such that p = q2 d1 + d2 . If d2 = 0, we stop. The gcd is d1 . Otherwise, we find q3 and d3
such that d1 = q3 d2 + d3 and so on. Since d1 > d2 > d3 > · · · , and di are non-negative,
for some n, we must have dn = 0(Why?). Then, dn−1 is the gcd of p and q.
We can modify the same algorithm to find x and y such that xp + yq = d. All we need is
to do some additional ‘book keeping’. We have q = pq1 + d1 , so,
d1 = −q1 p + q = x1 p + y1 q, say. Again, we have p = d1 q2 + d2 , so
d2 = p − d1 q2 = p − q2 (−q1 p + q) = (1 + q1 q2 ) p − q2 q = x2 p + y2 q
But,
Using Eqn. (4) and Eqn. (5), we can easily calculate xi , yi recursively for all i ≥ 3.
Example 1: We have (93, 141) = 3. Find u and v such that 93u + 141v = 3 using
Euclidean algorithm.
Solution: The steps of the Euclidean algorithm are as follows: We have
141 = 1 · 93 + 48 , so, q1 = 1, d1 = 48. Therefore, from Eqn. (4),
x1 = −1, y1 = 1
x2 = 2, y2 = −1
48 = 1 · 45 + 3, So, q3 = 1, d3 = 3, x3 = x1 − q3 x2 = −1 − 2 = −3,
y3 = y1 − q3 y2 = 1 − 1 · 1 = 2. We note that d3 | d2 and so d4 = 0. So, we can stop here.
We have x3 = −3 and y3 = 2, so 3 = −3 · 93 + 2 · 143, i.e. u = −3, v = 2.
∗∗∗ 73
Study Guide-I Let us now look at an example to see how to solve congruences of the type in Eqn. (2).
Example 2: Find a solution to the equation 3x ≡ 5 (mod 7).
Solution: Here (3, 7) = 1. So, we can find u, v ∈ Z such that 3u + 7v = 1. We have
−1
7 = 2 · 3 + 1. So, 1 = 7 − 2 · 3. Therefore, we can take u = −2, v = 1. Hence, 3 = −2.
We have,
−1
x≡3 · 5 ≡ −2 · 5 ≡ −10 ≡ 4 (mod 7)
Thus, x = 4 is a solution to the congruence 3x ≡ 5 (mod 7).
∗∗∗
We next prove a result regarding cancellation of a constant occurring in both the sides
of a congruence.
What can we say about the solution to Eqn. (2) in general? Here is the result.
Proof: Let d = (a, n). If x ∈ Z is a solution to Eqn. (2), then n | (ax − b). Since d | n,
d | (ax − b). Since d | a, d also divides b.
Conversely, if d | b, n | (ax − b) if and only if dn da x − db . So, Eqn. (2) has a solution if
Note that the units in Zn form a group, usually denoted by Z∗n . From Proposition 2, it
follows that
Z∗n = a ∈ Zn \ {0} (a, n) = 1
Proof: For any finite group G and any a ∈ G, we have a|G| = 1. In the case of Z∗n , we
have aφ (n) = 1 ∀a ∈ Z∗n . If a ∈ Z and (a, n) = 1, then a ∈ Z∗n and aφ (n) = 1. Translating
this in the language of congruences, aφ (n) ≡ 1 (mod n).
As it stands, Eqn. (8) doesn’t tell us much regarding the computation of φ (n). Later, we
will see an expression for φ (n) in Eqn. (17). However, when p is a prime, we get the
following interesting result immediately.
We close this section here. In the next section, we will see how to solve simultaneous
congruences, for example, pairs of congruences of the type x ≡ 3 (mod 11), x ≡ 2
(mod 7).
x ≡ 1 (mod 3)
x ≡ 3 (mod 5)
x ≡ 6 (mod 7)
We will see how to solve this using the Chinese Remainder Theorem in Example 4.
∗∗∗ 75
Study Guide-I Theorem 4: If n1 , n2 , . . ., nk are pairwise relative prime integers (i.e. (ni , nj ) = 1 if
i 6= j) and a1 , a2 , . . .,ak are any integers, there is a solution x0 to the following
simultaneous congruences:
x ≡ a1 (mod n1 )
x ≡ a2 (mod n2 )
.. (10)
.
x ≡ an (mod nk )
If x0 and x00 are two solutions, then x0 ≡ x00 (mod N), where N = n1 n2 · · · nk .
Proof: Let us first solve a special case of Eqn. (10). Let us fix an i and suppose that
ai = 1 and aj = 0, for j 6= i. Let
Ni = ∏ nj
j6=i
Then, (Ni , ni ) = 1 and we can find integers a and b such that aNi + bni = 1. This gives
the congruences
For each i, 1 ≤ i ≤ k we find an xi satisfying Eqn. (13) and Eqn. (14). We can use the
xi s to get an x satisfying Eqn. (10) by taking x = a1 x1 + a2 x2 + · · · + ak xk . Then,
x ≡ ai xi ≡ ai (mod ni ) for 1 ≤ i ≤ k since aj xj ≡ 0 (mod ni ) if j 6= i.
If x0 , x00 two solutions to the simultaneous congruences in Eqn. (10), x0 ≡ ai (mod ni )
and x00 ≡ ai (mod ni ), so x0 ≡ x00 (mod ni ) or ni |(x0 − x00 ) for each i. Since ni are
pairwise coprime, N = ∏ ni also divides x0 − x00 , i.e. x0 ≡ x00 (mod N).
Remark 4: In the proof of Theorem 4, we saw that we have to construct xi such that
xi ≡ 0 (mod nj ) for j 6= i and xi ≡ 1 (mod ni ). We constructed such an xi by taking the
solution a to the congruences in Eqn. (11) and Eqn. (12) and multiplying it by Ni . The
−1
congruence in Eqn. (11) implies that a ≡ Ni in Zni . So, if we choose N0i such that
−1
N0i = Ni in Zni , the congruence in Eqn. (13) is satisfied for a = N0i . For all j 6= i, since
Ni ≡ 0 (mod nj ), N0i Ni ≡ 0 (mod nj ). So, we choose xi such that xi = Ni N0i in ZNi ,
multiply the xi by ai and sum them up to get a solution to the congruence in Eqn. (10).
So, if Eqn. (10) is solvable, x = ∑ki=1 ai Ni N0i is a solution to it, where
−1
N = ∏ nj N i = ∏ nj N0i = Ni in Zni
j j6=i
To find the smallest non-negative solution, we take the smallest non-negative residue of
x (mod N).
x ≡ 1 (mod 3)
x ≡ 3 (mod 5)
76 x ≡ 6 (mod 7)
Solution: Let us take n1 = 3, n2 = 5 and n3 = 7. Then N = 105. Also, Congruences and
Applications
−1
N1 = 35 ≡ 2 (mod 3) 2 = 2 in Z3 N01 = 2
−1
N2 = 21 ≡ 1 (mod 5) 1 = 1 in Z5 N02 = 1
−1
N3 = 15 ≡ 1 (mod 7) 1 = 1 in Z7 N03 = 1
So,
So, the minimum number of students in the class is the smallest non-negative residue of
223 (mod 105) which is 13.
∗∗∗
Let us now use Theorem 4 to find more about the structure of Zn . Let n be a natural
number n = pα1 1 pα2 2 · · · pαk k . Then, since (n) ⊂ (pαi ) for 1 ≤ i ≤ k, we have ring
homomorphisms φi : Zn −→ Zpαi . Putting together the φi s, we have a ring
i
homomorphism
g(m) = (a1 , a2 , . . . , ak )
m ≡ a1 (mod pα1 1 )
m ≡ a2 (mod pα2 2 )
..
.
m ≡ ak (mod pαk k )
map is also injective because the Chinese RemainderTheorem also says that if m, m0
are two solutions to the congruences x ≡ ai mod pαi i , then m ≡ m0 (mod n).
Corollary 6: Let n be a natural number n = pα1 1 pα2 2 · · · pαk k . Then, the map g in
Proposition 6 induces an isomorphism
Further,
k
1
φ (n) = n ∏ 1 − (17)
i=1 pi
Also,
Note that the LHS and RHS of Eqn. (20) are trivial rearrangements of each other.
Let us now check Eqn. (19). Now,
{a | 0 ≤ a ≤ pα − 1, p | a} = kp | 0 ≤ k < pα−1
and
kp | 0 ≤ k < pα−1 = pα−1
x≡2 (mod 5)
x≡4 (mod 7)
x≡3 (mod 11)
We close this section here. In the next section, we will discuss solution of quadratic
congruences, i.e. congruences of the type x2 ≡ a (mod n).
In this section, we will prove the quadratic reciprocity law which was proved by Gauss
in his path breaking work Disquisitiones Arithmeticae. When he did this work, he was
not even 18 years old. The result was known to Euler, Legendre and other
mathematicians, but none of them were able to prove it. Gauss called the result
L. Euler ‘Theorem Aureum’ meaning ‘Golden theorem’. He gave several proofs of the theorem.
(1707–1783) Many proofs were given by others also. The proof we will give is due to Eisenstein, one
of the gifted students of Gauss.
Let us consider the congruence x2 ≡ m (mod n) where m and n are odd. Suppose
x2 ≡ m (mod pα1 1 )
x2 ≡ m (mod pα2 2 )
..
.
x2 ≡ m (mod pαk k )
If a ≡ 0 (mod p), this is easy to prove. So, let us assume that a 6≡ 0 (mod p). We can
prove Proposition 7 by starting with a root of x2 ≡ a (mod p) and repeatedly applying
the following lemma.
If α 0 = α + vpk then Eqn. (24) is satisfied. So, if we can find a v such that α 0 = α + vpk
satisfies Eqn. (23), we are done. Let us put α 0 = α + vpk in Eqn. (23) and see if we can
solve for v. Note that p2k | (α 0 − α)2 , so pk+1 | (α 0 − α)2 . So, Eqn. (23) reduces to
where u is defined as in Eqn. (21). From the congruence in Eqn. (25) it follows that
u + 2αv ≡ 0 (mod p)
or
2αv ≡ −u (mod p) (26)
We can solve the last equation for v since (2α, p) = 1. This is because, if p | α, from
the congruence α 2 ≡ a (mod pk ), it will follow that p | a, a contradiction to our choice
of a.
(
a 1 if a is a quadratic residue
= (27)
p −1 if a is not a quadratic residue
a a
Remark 5: Note that pis 1 or −1 according as a is a square in Z∗p or not. So, p is
0
determined by the residue class of a modulo p. Therefore, pa = ap if a ≡ a0
(mod p).
a 1 2 3 4 5 6
a2 1 4 2 2 4 1
a
(p−1)
a 2 ≡ (mod p) (28)
p
a ∗
In particular, a p induces a group homomorphism Zp −→ {1, −1}.
Lemma 5: Let G be a cyclic group of order n and suppose d | n. Then, G has a unique
subgroup of order d given by
x ∈ G xd = 1 (29)
Further,
n n o
x ∈ G xd = 1 = x d x ∈ G (30)
n p−1 o
x ∈ Z∗p x 2 = 1 = x2 x ∈ Z∗p
Let us now look at an example that explains how to use Eqn. (28) for finding the
legendre symbol.
6: Find the
Example
3 19
following legendre
6
symbols:
a) b) c)
7 41 11
Solution:
7−1 3
a) We have 3 2 ≡ 33 ≡ 6 ≡ −1 (mod 7). So,
7 = −1.
41−1
b) We have to find 19 2 ≡ 1920 (mod 41). We have
e ∑ ai xi = ∑ ψ (ai ) xi
ψ
p q
p−1 q−1
= (−1) 2 2 (31)
q p
Remark 6: The quadratic reciprocity is stated often in the following form also.
p q
p−1 q−1
= (−1) 2 2 (32)
q p
This follows form Eqn. (31) because pq = ±1.
Proposition 8: We have
(
2 p2 −1 1 if p ≡ ±1 (mod 8)
= (−1) 8 = (33)
p −1 if p ≡ ±3 (mod 8)
Proposition 9: We have
(
−1 1 if p ≡ 1 (mod 4)
= (34)
p −1 if p ≡ 3 (mod 4)
7 p p
p−1 3
p
p−1 p−1
= (−1)3( 2 ) = (−1) 2 = (−1) 2 (35)
p 7 7 7
p−1
We want to know the primes for which the RHS of Eqn. (35) is 1. It is 1 if both (−1) 2
p
and 7 are −1 or both are 1.
p−1
Let us first consider the case where (−1) 2 = 1 and p7 = 1. From Proposition 9 we
must have p ≡ 1 (mod 4). Also, from the table of squares in Example 5, we have
a ≡ 1, 2 or 4 (mod 7). So, p should satisfy one of the following set of congruences:
Remark 7: Note that, if we don’t have equation Eqn. (35), to check whether
x2 − 7
splits into linear factors modulo a prime p, we will be forced to find p7 for each p.
However, with the help of Eqn. (35), we are able to reduce this to checking whether p is
in one of the finitely many residue classes modulo p, which is much easier to do!
For example, if we want to check whether x2 − 7 splits in 263081503 or not, we need
263081503−1
not compute 7 2 = 7131540751 (mod 263081503). We find that 263081503 ≡ 27
(mod 28) and 27 figures in the list of residue classes we have obtained in Example 7.
So, x2 − 7 splits into linear factors modulo 263081503!
Let us now prove quadratic reciprocity. The proof is along the lines of the proof in the
book Course in Arithmetic by J. P. S ERRE, pages 9—10. For proving quadratic
reciprocity, we need some preliminary results.
Let p be a prime. Let S be any set such that Z∗p is the disjoint union of S and −S where
−S = {−s|s ∈ S}. The set {1, 2, . . . , p−1
2 } has this property. So, we will choose
S = {1, 2, . . . , p−1 ∗
2 }. For s ∈ S and a ∈ Zp , either sa or −sa is in S. So, we can write
sa = es (a)sa where es (a) = ±1 and sa ∈ S. Note that es (a) = 1 if as ∈ S and es (a) = −1,
if as ∈ −S. For example, let us take p = 7, S = {1, 2, 3}. If a = 6, s = 3,
sa = 18 ≡ 4 ≡ −3 ≡ (−1)3 (mod 7). So, e3 (6) = −1 and 63 = 3 in this case. 83
Study Guide-I Proposition 10(Gauss Lemma): For any prime p and a ∈ Z, p - a
a
= ∏ es (a) (36)
p s∈S
Proof: If s and s0 are two distinct elements of S, then sa 6= sa0 . If sa = sa0 , then
es (a)as = es0 (a)as0 or es (a)s = es0 (a)s0 . Therefore, s = ±s0 , which contradicts the choice
of S. So, s sa is a bijection of S to itself. Multiplying the equalities as = es (a)sa , we
get
! !
p−1
a 2
∏ s = ∏ es (a) ∏ sa = ∏ es (a) ∏ s
s∈S s∈S s∈S s∈S s∈S
Hence
!
p−1
a 2 = ∏ es (a)
s∈S
where ai,n ∈ Z.
The proof is not difficult. First, verify it for n = 1, 2, 3. You will be able to prove the
lemma with the insight gained from this. We leave it to you as an exercise.
We also need the following trigonometric lemma.
Lemma 7: We have
sin(2` + 1)x 2 2πj
2
= (−4) ∏ sin x − sin
`
(38)
sin x 1≤j≤` 2` + 1
Let us substitute eix for x in Eqn. (39). Then, LHS of Eqn. (39) becomes
1
x2`+1 − x2`+1 e(2`+1)ix − e−(2`+1)ix sin(2` + 1)x
= = (40)
x − 1x eix − e−ix sin x
Let us write
`−1
P(T) = (−4)` T` + ∑ (−4)j aj,` Tj (41)
84 j=1
Then, Congruences and
Applications
sin(2` + 1)x
= P(sin2 x) (42)
sin x
So, we have
2πj
2 2πj
sin(2` + 1) 2`+1
P sin = 2πj
= 0 for 1 ≤ j ≤ ` (43)
2` + 1 sin 2`+1
In other words,
2πj
sin2 , 1≤j≤`
2` + 1
are the roots of the polynomial P(T). So,
2 2πj
`
P(T) = (−4) ∏ T − sin
`
j=1 2` + 1
Proof of Quadratic reciprocity: Let p and q be distinct, odd primes. As before, let
p−1
S = {1, 2, . . . , }
2
From Proposition 10, Gauss lemma, we get
q
= ∏ es (q)
p s∈S
2π 2π
sin qs = es (q) sin sq
p p
0
(Note that, if a ≡ a0 (mod p), then sin 2πa
p = sin
2πa
. This is because we can write
p
2πa0 0
a = a0 + pr for some r ∈ Z and so sin 2πa
p = sin 2rπ + p = sin 2πa
p . So, it makes
sense to write sin 2πs ∗
p for s ∈ Zp .)
Multiplying these equations and taking into account that s sq is a bijection, we get
q sin 2πqs
p
= ∏ es (q) = ∏ 2πs
p s∈S s∈S sin p
q 2 2πs 2 2πt
q−1
= ∏(−4) ∏ sin2 − sin
p s∈S t∈T p q
2 2πs 2 2πt
(q−1)(p−1)
= (−1) 4
∏ sin p − sin q
s∈S,t∈T
p 2πt 2πs
(q−1)(p−1)
2 2
q
= (−1) 4 ∏ sin q − sin p
s∈S,t∈T
The factors giving qp and qp are identical up to sign. Since there are (p−1)(q−1)
4 of
these, we have
q p
(p−1)(q−1)
= (−1) 4
p q
Proof of Proposition 8: We use Gauss lemma, Proposition 10, to prove this. Let us
p−1 p−1
take a = 2 and S = {1, 2, . . . , 2 }. We have es (2) = 1 if 2s ≤ 2 and es (2) = −1
2
otherwise. From this, we get p = (−1)n(p) where n(p) is the number of integers s
p−1 p−1
such that 4 <s≤ 2 .
Therefore
(
2 1 if p ≡ 1 (mod 8)
=
p −1 if p ≡ 5 (mod 8)
We close this section here. In the next section we will discuss some applications of
86 congruences to primality testing and in checking bar codes.
Congruences and
Applications
6.5 APPLICATIONS OF CONGRUENCES
In this section, we will discuss some applications of congruences. First, we will discuss
some applications of congruences to primality testing. Then, we will discuss the use of
check digits to check errors in International Standard Book Number(ISBN), which is a
unique number assigned to books.
The next result gives a necessary and sufficient condition for a number to be a prime.
Proposition 11: An integer p > 1 is a prime if and only if (p − 1)! ≡ −1 (mod p).
Proof: Suppose p is a prime. Then, Zp is a finite field and so Z∗p is cyclic. If p = 2, the
result is trivially true. So, let us assume that p > 2 is an odd prime. Note that, if
a = a−1 , a2 = 1 and so a = −1 or 1 since these are the only elements in Z∗p that satisfy
x2 = 1. This is because Z∗p is a cyclic group, there is a unique cyclic subgroup of order
2 and all the elements in Z∗p that satisfy x2 = 1 must lie in this subgroup. Since the
cyclic subgroup generated by −1 has order 2, this must be the unique subgroup of order
2. So, for all the other elements of Z∗p , a 6= a−1 . Therefore the terms in the product
∏ a can be grouped into pairs of the form a a−1 . So, this product is 1. Therefore,
a6=1,−1
∏ a = 1.−1. ∏ a = −1 (44)
a∈Z∗p a6=1,−1
From Eqn. (44) and Eqn. (45), we get (p − 1)! = −1 in Z∗p . Translating this in the
language of congruences, we get the result.
Conversely, suppose that (p − 1)! ≡ −1 (mod p), i.e. p | (p − 1)! + 1. If p is not a
prime, it has a prime factor q, 1 < q < p. Since q occurs in the product (p − 1)!, it
doesn’t divide (p − 1)! + 1. But, since q | p and p | (p − 1)! + 1, q has to divide
(p − 1)! + 1, a contradiction.
Proposition 12: If (a, n) = 1 and an−1 6≡ 1 (mod n) for some a, then n is not a prime.
Proposition 13: Let p be an odd prime and let (a, p) = 1. Suppose p − 1 = r2t with r
odd. Then, a satisfies at least one of the following conditions:
i) ar ≡ 1 (mod p)
i
ii) a2 r ≡ −1 (mod p) for some i, 0 ≤ i < t.
Proof: If ar ≡ 1 (mod p), we are done. Suppose ar 6≡ 1 (mod p). Consider the set
m
n o
S = m | a2 r ≡ 1 (mod p)
t
We have a2 r = ap−1 ≡ 1 (mod p). So, S 6= 0/ because t ∈ S. Also, 0 6∈ S because we
have assumed ar 6≡ 1 (mod p). Let t0 = min S. Then, t0 ≥ 1. Also, by choice of t0 ,
t −1
t −1 2 t t −1
ar2 0 6≡ 1 (mod p). However, ar2 0 = ar2 0 ≡ 1 (mod p), so ar2 0 ≡ −1
(mod p).
We will prove the implication only in one direction in this course. We will only show
that Mp is a prime if Mp divides sp−1 .
√ √
Let ω = 2 + 3 and ω = 2 − 3. You can check that ωω = 1.
Lemma 8:
m−1 m−1
Sm = ω 2 + ω2
You can easily prove this by induction. We leave this to you as an exercise.
For an odd prime q, let X denotes the set {(a, b) | a, b ∈ Zq }. We can define binary
operations, addition and multiplication, as follows:
(a, b) + (c, d) = (a + c, b + d)
(a, b)(c, d) = (ac + 3bd, ad + bc)
Then, X is a commutative ring with an identity element and it has q2 elements. So, X∗ ,
88 the group of units in X, has at most q2 − 1 elements.
Proposition 14: Mp is a prime if Mp divides Sp−1 . Congruences and
Applications
p−2 p−2
Proof: If Mp | Sp−1 , from Lemma 8, it follows ω 2 + ω 2 ≡ 0 (mod Mp ), so
p−2 p−2 p−2
ω 2 + ω 2 = NMp for some integer N. Multiplying this by ω 2 we find that
p−1 p−2
ω2 = NMp ω 2 − 1. (46)
Squaring
p
p−2
2
ω 2 = NMp ω 2 − 1 (47)
Suppose Mp is composite. 2
√ Then, there is a prime divisor q of√Mp , q odd, with q ≤ Mp .
Consider
√ the ring {a + b 3 | a, b ∈ Z}. Then, the map f : Z[ 3] −→ X,
a+ b 3 (a, b) gives aring homomorphism. Consider f(ω). Since q | Mp ,
p−1
= 0. So, from Eqn. (46) and Eqn. (47), f (ω)2
p−1 p−1
f NMp ω 2 = NMp f ω 2 = −1
2p
and f (ω) = 1. (Here we need that fact that q is an odd prime.) So, f(ω) ∈ X∗ has
order 2p . The order of f(ω) divides |X∗ | so 2p ≤ q2 − 1. But, q2 − 1 ≤ Mp − 1 = 2p − 2
and we have a contradiction.
Here are some exercises that asks you to fill in the details in the proof of Proposition 14.
You know that data is transmitted as strings of 0s and 1s. Suppose we transmit a string
x1 x2 . . . xn where each xi is 0 or 1. We add one more digit xn+1 so that
x1 + x2 + · · · + xn ≡ xn+1 (mod 2)
so that
x1 + x2 + · · · + xn + xn+1 ≡ 0 (mod 2)
Suppose we transmit the string x1 x2 . . . xn+1 and the recipient receives the string
y1 y2 . . . yn+1 . The recipient checks if y1 + y2 + · · · + yn+1 ≡ 0 (mod 2). If it is not, at
least one of the bits xi has been changed and the recipient can ask us to transmit the
string again.
Of course, this method can detect only one error, i.e. if one of the zeros has been
changed to 1 or a 1 has been changed to 0. If two bits are changed during transmission,
the test can’t detect it.
Every book published recently has a unique number associated to it called the ISBN
number. This is a sequence of 9 digits x1 , x2 , . . . , x9 together with a check digit
x10 ∈ {0, 1, 2, . . . , 9, X}. We use the single digit X to represent 10. We can find the
check digit of the number from the first 9 digits and by comparing it with the check
digit, we can check whether the ISBN number is correct or not.
Suppose we order a book over the telephone and give the dealer the ISBN number of
the book. How can the dealer make sure that he has correctly noted down the ISBN
number? Let us see how the dealer can check whether the ISBN number is correct
using the arithmetic modulo 11. 89
Study Guide-I Let us now look at an example to know how to use the check digit.
Example 8: The ISBN number of the prescribed text book for the course, Artin’s
Algebra book is 81-203-0871-9. The last digit is the check digit. Check whether the
number is correct using the check digit.
Solution: Let us first find the check digit from the first 9 digits and see if it matches
with the check digit for the book which is the last digit, namely 9. We have
8 1 2 0 3 0 8 7 1
l l l l l l l l l
x1 x2 x3 x4 x5 x6 x7 x8 x9
EXERCISES E9) Check whether the ISBN number 0-387-97329-X is a valid ISBN number.
Another way to to look at computation is as follows: We can look at the ISBN Number
as an element of Z911 by putting it in the form (8, 1, 2, 0, 3, 0, 8, 7, 1). If (a1 , a2 , . . . , a9 )
and (b1 , b2 , . . . , b9 ) are in Z911 , consider ‘dot product’:
(a1 , a2 , . . . , a9 ) · (b1 , b2 , . . . , b9 ) = a1 b1 + a2 b2 + · · · + a9 b9
Let us write α = 1, 2, · · · , 9 . Then, the check digit is the number i, 0 ≤ i ≤ 10, such
that i ≡ α · x (mod 11) where x is the 9 digit ISBN number regarded as an element of
Z911 .
From 2007, all the new books will have a 13 digit ISBN number instead of 10 digits.
The check digit is calculated as follows:
Suppose x1 x2 . . . x12 are the first 12 digits of the ISBN number:
i) Starting from the left, multiply the first digit by 1, the second digit by 3, the third
digit by 1 again, fourth digit by 3 again, etc. and add them up. In other words, we
multiply the odd digits by 1 and even digits by 3 and add them up.
ii) Reduce the number modulo 10 and subtract it from 10, i.e. take the additive
inverse modulo 10 of the number. This will give the check digit.
Let us look at an example now.
Example 9: Calculate the check digit of the ISBN number 978-0-11-000222. We have
(9 + 8 + 1 + 0 + 0 + 2) + 3(7 + 0 + 1 + 0 + 2 + 2) = 20 + 36 = 56 ≡ 6 (mod 10) and
−6 ≡ 4 (mod 10). So, the check digit is 4.
90 ∗∗∗
We regard the 12 digit ISBN number as an element of Z12 9
10 instead of Z11 . We let
Congruences and
12
β = (1, 3, 1, 3, 1, 3, 1, 3, 1, 3, 1, 3). If x ∈ Z10 is any ISBN number, the check digit is the Applications
Then, a single position error ai → a0i is undetectable if and only if αi (ai − a0i ) ≡ 0
(mod `) and a transposition error that
interchanges ith position and jth position is
undetectable if and only if ai − aj αi − αj ≡ 0 (mod `)
Proof: Suppose there is a single error in the ith position, ai has been replaced by a0i . Let
a be the correct number and let b be the number with ith digit changed. Then the
difference α · (a − b) is (ai − a0i )αi . We will not be able to detect this error if and only if
(ai − a0i )αi ≡ 0 (mod `).
Suppose there is a transposition error where . . . ai ai+1 . . . aj aj+1 . . . is replaced by
. . . aj ai+1 . . . ai aj+1 . . .. Once again, let a be the correct number and
b be the wrong
number. The difference α(a − b) = αi ai + αj aj − αj ai + αi aj = αi − αj ai − aj .
6.6 SUMMARY
In this Unit, we have discussed the following:
So,
52 = 25 ≡ 3 (mod 11)
4 2
5 ≡ 3 ≡ 9 (mod 11)
55 ≡ 9 × 5 = 45 ≡ 1 (mod 11)
5
92 So, 11 = 1. We leave part b) to you.
1
2n+1
E6) Proof:Note that, the lemma says that x2n+1 − x2n+1 is the sum of x − x1 with Congruences and
Applications
a polynomial in
1 1 3 1 2n−1
x− , x− ,..., x−
x x x
with integer coefficients.
We apply induction on n. For n = 1, we have
1 1 3 1
3
x − = x− +3 x−
x x x
So, the result is true for n = 1.
Suppose for all k ≤ n − 1, we have
where ai,k ∈ Z.
1 2n+1 n
1
x− = x2n+1 + ∑ (−1)i C(2n + 1, i)x2n+1−i −i
x i=1 x
2n
1 1
+ ∑ (−1)i C(2n + 1, i)x2n+1−i − 2n+1
i=n+1 x −i x
1 1 2n+1 n
∴ x2n+1 − = x− − ∑ (−1)i C(2n + 1, i)x2n+1−2i
x2n+1 x i=1
2n
− ∑ (−1)i C(2n + 1, i)x2n+1−2i (50)
i=n+1
is a polynomial in
1 1 3 1 2n−1
x− , x− ,..., x−
x x x
with integer coefficients.
We now group the term in the first sum corresponding to i = 1, which is
−C(2n + 1, 1)x2n−1 , with the term corresponding to i = 2n in the second sum
which is
1
(−1)2n−m+1 C(2n + 1, 2n − m + 1)x2n+1−(2n−m+1)
x2n−m+1
= −(−1)m C(2n + 1, 2n − m + 1)x2n+1−(2n−m+1)−(2n−m+1)
= −(−1)m C(2n + 1, m)x−(2(n−m)+1) (53)
Grouping the terms in Eqn. (52) and Eqn. (53) together, we get the term
(−1)m C(2n + 1, m) x2(n−m)+1 − x−(2(n−m)+1)
x2(n−i)+1 − x−(2(n−i)+1)
is a polynomial in
1 1 3 1 2(n−i)+1
x− , x− ,..., x−
x x x
E7) Since ω + ω = 4, it is true for m = 1. Apply induction on m using the fact that
m−1 2
m−1 m m
ω2 + ω2 − 2 = ω2 + ω2
since ωω = 1.
E8) We have a, b c, d = ac + 3bd, ad + bc = ca + 3db, da + cb = c, d a, b
94
Errata Errata
Page Replace By
i i
88 a2 r ≡ −1 (mod p) for some i, 0 ≤ i ≤ a2 r ≡ −1 (mod p) for some i, 0 ≤ i <
t. t.
95