Groups, Rings, and Modules Overview
Groups, Rings, and Modules Overview
Lent 2005
A
L TEXed by Sebastian Pancratz
ii
These notes are based on a course of lectures given by Prof. C.J.B. Brookes in Part IB of
the Mathematical Tripos at the University of Cambridge in the academic year 20042005.
These notes have not been checked by Prof. C.J.B. Brookes and should not be regarded
as ocial notes for the course. In particular, the responsibility for any errors is mine
please email Sebastian Pancratz (sfp25) with any comments or corrections.
Contents
Introduction i
1 Groups 1
1.1 Basic concepts . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1
2 Rings 17
2.1 Denitions and examples . . . . . . . . . . . . . . . . . . . . . . . . . . . 17
2.3 Integral domains, elds of fractions, maximal ideals and prime ideals . . 23
3 Modules 41
3.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 41
3.4 Modules over F[X] for a eld F normal forms for matrices . . . . . . 51
Introduction
Groups
This part of the course builds on the Part IA course Algebra & Geometry. The rst
punchline we aim for will be Sylow's theorems. We'll especially think p-groups, that is,
groups of order pa .
Rings
A ring is a set R together with two operations + and ×, where multiplication is distrib-
utive over addition. This includes elds, but also the integers Z and polynomial rings
C[X]. We will concentrate on commutative rings.
Number theory
The Gaussian integers are Z[i] ⊂ Q[i] ⊂ C. This is an example of adjoining roots of
integral polynomials to Q, or Z. These rings analogous to integers do not necessarily
have unique factorization. In fact, we do get unique factorization for Gaussian integers
since Euclid's algorithm works.
Algebraic geometry
A family J of polynomials in C[x1 , . . . , xn ] in n variables induces a subset of Cn which
consists of(λ1 , . . . , λn ) giving you zero for all polynomials in J , that is, a set of common
roots. Hilbert's basis theorem implies that all but nitely many of the polynomials in J
are redundant. The set of common zeros of J is equal to the set of common zeros of a
nite subset of J .
Modules
Modules are a generalisation of vector spaces, using scalars from a ring rather than a
eld. Our aim is a structure theorem for rings in which Euclid's algorithm works, for
example Z. We also consider the structure of algebraic groups, which is important in
algebraic topology, as well as C[X] and the Jordan normal form.
Lecture 1
ii Introduction
E-mail
The lecturer can be contacted at brookes@[Link].
Books
• There is a long list of recommended books in the schedules.
Lecture 1
Chapter 1
Groups
1.1 Basic concepts
Denition (Group) . A set G is a group if there is a binary operation G × G → G,
satisfying
(iii) There exists e∈G such that, for all g ∈ G, eg = ge = g (identity element).
Example. (i) Additive groups, e.g. (Z, +), (R, +), (C, +).
(ii) Matrix groups, such as the general linear group GL(Rn ), i.e. the group of invertible
n×n n
matrices, or the special linear group SL(R ), i.e. matrices with determinant 1.
(iv) Cyclic groups of order n, i.e. {e, g, . . . , g n−1 }, and dihedral groups D2n of order
2n, i.e. the symmetries of the regular n-gon.
g1 ∼ g2 ⇐⇒ g1−1 g2 ∈ H.
gH = {gh : h ∈ H}.
All these cosets are of the same size |H|. Counting the elements of G we obtain
Lecture 1
2 Groups
|G| = |H||G : H|
Note 1. You can also do the same with the equivalence relation g1 ∼ g2 ⇐⇒ g2 g1−1 ∈
H, obtaining right cosets as equivalence classes. The number of right cosets is equal to
the number of left cosets.
Denition (Order). The order o(g) of g ∈ G is the least n ∈ N such that g n = e. Note
this implies that if g
m =e then o(g) | m.
Proof. The set {e, g, . . . , g o(g)−1 } is a subgroup of G. Now apply Theorem 1.1.
Example. In abelian groups all subgroups are normal. In general, {e} and G are always
normal. In the dihedral group D2n , the subgroup of rotations is normal but {e, τ }, for
a reection τ, is not normal if n ≥ 3.
Proposition 1.3. K / G.
Let Then the set of cosets of K in G form a group under the
operation g1 K.g2 K = g1 g2 K .
Proof. We need to check that the denition is well-dened, that is, if g1 K = g10 K and
g2 K = g20 K then g1 g2 K = g10 g20 K . (See Algebra & Geometry.)
The identity element is K, the inverse of gK is g −1 K . Associativity is inherited from
G.
Lecture 2
3
Proof. (i) Let K = {g ∈ G : θ(g) = e}. Then the left coset gK is the set {g1 ∈ G :
θ(g1 ) = θ(g)} since
θ(g1 ) = θ(g)
⇐⇒ θ(g −1 g1 ) = e
⇐⇒ g −1 g1 ∈ K
⇐⇒ g1 ∈ gK.
Similarly, the right coset Kg is the set {g1 ∈ G : θ(g1 ) = θ(g)}. Thus Kg = gK .
(ii) We need to check that θ(g1 )θ(g2 )−1 ∈ Im θ for all g1 , g 2 . But this is θ(g1 g2−1 ).
Observe that `isomorphic to' denes an equivalence relation on groups. The notation
used is ∼
=.
g1 K = g2 K =⇒ g2−1 g1 ∈ K
=⇒ θ(g2−1 g1 ) = e
=⇒ θ(g2 )−1 θ(g1 ) = e
=⇒ θ(g1 ) = θ(g2 ).
Φ is a homomorphism,
This is a group homomorphism, θ(r1 +r2 ) = θ(r1 )θ(r2 ). Its kernel is (Z, +). Theorem1.5
implies R/Z ∼
= Im θ = {e2πir : r ∈ R}. The map Φ as above is Φ(Z + r) = e2πir .
Lecture 2
4 Groups
θ(h1 h2 ) = h1 h2 K
= h1 Kh2 K
= θ(h1 )θ(h2 ).
So Theorem 1.5 implies that H/ ker θ ∼
= Im θ. Now
ker θ = {h ∈ H : hK = K} = H ∩ K
Im θ = {hK : h ∈ H} = (HK)/K,
cosets of K in HK . Thus H/(H ∩ K) ∼
= (HK)/K , as required.
In fact, there is a bijection between subgroups of G/K and the subgroups of G containing
K. This is given by the maps
Take a normal subgroup K/G and form the quotient group G/K , with elements the
cosets gK and multiplication dened by g1 Kg2 K = g1 g2 K . The correspondence
restricts to
that is,
L/K / G/K ⇐⇒ L / G.
Lecture 3
5
g1 K = g2 K
=⇒ g1−1 g2 ∈ K ≤ L
=⇒ g1 L = g2 L.
θ is a homomorphism,
Clearly, Im θ = G/L. Also ker θ = {gK : gL = L} = L/K and so Theorem 1.5 implies
(G/K)/(L/K) ∼
= G/L.
Denition (Simplicity) . G is simple if the only normal subgroups in it are {e} and G
itself.
Proof. `Normal' for abelian groups is easy to understand as all subgroups are normal
because of commutativity. Take a non-trivial element g 6= e in a simple abelian group
G.
Then either g is of innite order and G contains the subgroup {. . . , g −1 , e, g, g 2 , . . . }.
Simplicity implies that G = {. . . , g −1 , e, g, g 2 , . . . }. But then there is also the subgroup
{. . . , g −2 , e, g 2 , g 4 , . . . }, a contradiction.
Or g is of nite order and G has the subgroup {e, g, g 2 , . . . , g o(g)−1 }. Simplicity implies
that G = {e, g, g 2 , . . . , g o(g)−1 }. But if o(g) = rs, say, then we also have the subgroup
{e, g r , g 2r , . . . , g r(s−1) }. Now simplicity implies that there are no non-trivial factorisa-
tions of o(g) so G = {e, g, . . . , g
p−1 } for some prime p.
We shall see shortly that the alternating group A5 on {1, . . . , 5} is simple. In fact, An
is simple for n ≥ 5, but we don't need to know the proof. Note that |A3 | = 3, so
A3 is isomorphic to C3 and simple. |A4 | = 12 and A4 contains the normal subgroup
V = {e, (12)(34), (13)(24), (14)(23)} / A4 , so A4 is not simple.
There are nitely many families of nite simple groups, and 26 groups that do not t
into those general families, called the sporadic ones.
Note|A5 | = 60 and |A6 | = 360. There is another nite simple group of order 168, GL3 (Z
mod 2), the group of invertible 3 × 3 matrices entries {0, 1} modulo 2. (This is left as
an exercise.)
Lecture 3
6 Groups
Theorem 1.8. Let G be a nite group. Then there are subgroups H1 , . . . , H s such that
G = H1 ≥ H2 ≥ · · · ≥ Hs = {e}
Remark. The simple groups Hi /Hi+1 are essentially unique, although there may be
lots of ways of picking the subgroups Hi , called composition factors.
Remark. The terminology comes from Galois theory, see the relevant Part II course.
You all know a formula for solving quadratic equations. You can do something similar
involving roots for cubics and quartics, the process is called `solution by radicals'. But
you cannot necessarily do it for a quintic, e.g. t5 − 6t + 3 = 0. This can be shown by
dening a Galois group associated with the polynomial. If you can solve the polynomial
by radicals then the Galois group is soluble.
|Sn | = n!,
n!
|An | = .
2
We can quickly see that the alternating group An / Sn is a normal subgroup. There are
two left cosets of An , Sn \ An = gAn for all permutations g and two right cosets of An ,
S n \ An = An g . Thus the left cosets are equal to the right cosets.
More generally, for any set Ω you can dene the symmetric group Sym G of all bijective
maps Ω → Ω.
Lecture 4
7
Example. Sn , An , D2n ≤ Sym Ω where Ω is the set of vertices of the regular n-gon.
Denition. An action of a group G on a set Ω is a map G × Ω → Ω, (g, α) 7→ g ∗ α,
often also written as g(α), such that
(ii) g1 ∗ (g2 ∗ α) = g1 g2 ∗ α,
(iii) e ∗ α = α.
Lemma 1.9. Let G act on Ω. Then for g∈G the map Φg : α 7→ g ∗ α is a permutation
of Ω and the map
Φ : G → Sym Ω, g 7→ Φg
is a homomorphism, a permutation representation of G.
The image Φ(G) Ω
is a subgroup of Sym Ω, denoted G . The kernel of Φ, ker Φ = {g ∈
G : g ∗ α = α for all α ∈ Ω}, also denoted G(Ω) , is a normal subgroup and G/GΩ ∼ = GΩ .
Example. (i) Let G be the symmetry group of the cube, |G| = 48, and Ω the set of
diagonals, |Ω| = 4. G acts on Ω,
G Ω = S4 ,
G(Ω) = {e, symmetry sending each vertex to the opposite one}.
g ∗ g1 = gg1 .
(iii) For a subgroup H of G, let Ω be the set of left cosets of H. Consider the action
of G on Ω given by
g ∗ g1 H = gg1 H.
T −1
The kernel is g1 ∈G g1 Hg1 since, for all g1 ∈ G,
gg1 H = g1 H
⇐⇒ g1−1 gg1 H = H
⇐⇒ g1−1 gg1 ∈ H
⇐⇒ g ∈ g1 Hg1−1 .
Lecture 4
8 Groups
Proof. Let K be the kernel of the action of G on Ω, where Ω is the set of left cosets
of H. Lemma 1.9 implies that G/K ∼
= GΩ , a subgroup of Sn . Lagrange's theorem 1.1
implies subgroups of Sn have order dividing n!. Since K ≤ H and H is of index n in G
we have that |G/K| ≥ n.
Now assume G is non-abelian simple. Then K = {e}, and so G∼
= GΩ subgroup of Sn .
But An / Sn and so GΩ ∩ An / GΩ . Simplicity implies that either
GΩ ∩ An = {e} or G Ω ∩ An = G Ω .
GΩ /(GΩ ∩ An ) ∼
= (GΩ An )/An ≤ Sn /An
and hence |GΩ | ≤ 2 since GΩ ∩ An = {e}, a contradiction. Finally, we have n>4 since
A4 has no non-abelian simple subgroups.
Let us recall some denitions from the Part IA course Algebra & Geometry. Let G act
on Ω.
Denition. The orbit on Ω containing α is G(α) = {g ∗ α : g ∈ G}. The stabiliser of
α is Gα = {g ∈ G : g ∗ α = α}.
Theorem 1.12 (Orbit-stabiliser theorem). Let G act on Ω. Then |G : Gα | = |G(α)|.
g ∗ x = gxg −1 .
G(Ω) = {g ∈ G : ∀x ∈ Ω gxg −1 = x}
Lecture 5
9
= {g ∈ G : ∀x ∈ Ω gx = xg}
\
= CG (x)
x∈G
Lemma 1.13.
P
Let G be a nite group. Then 1= 1/|CG (x)|, summing over distinct
conjugacy classes.
Proof.
P
We count the elements of G. |G| = |cclG (x)|, so
X |G|
|G| =
|CG (x)|
since |cclG (x)| = |G|/|CG (x)|. Now divide by |G|.
Example (Conjugacy classes of Sn and An ). Recall from Algebra & Geometry that two
permutations are conjugate in Sn precisely when they have the same cycle type when
expressed in disjoint cycle form, e.g. 22 1 denotes the cycle type of double transpositions
in S5 , e.g. (1 2)(3 4)(5).
In S5 , we have the following cycle types:
cycle type 15 22 1 3 12 5 2 13 32 41
# elements 1 15 20 24 10 20 30
even permutations odd permutations
If all permutations that commute with x are even, i.e. CSn (x) = CAn (x),
|Sn | 2|An |
|cclSn (x)| = = = 2|cclAn (x)|.
|CSn (x)| CAn (x)
If there is an odd permutation commuting with x, i.e. |CSn (x) : CAn (x)| = 2,
|Sn | |An |
|cclSn (x)| = = = |cclAn (x)|.
|CSn (x)| CAn (x)
Example. Consider A5 , take x ∈ A5 . (1 2)(3 4)(5) commutes with (1 2),
S5 and
(1 2 3)(4)(5) commutes with (4 5). But conjugacy classes of 5 cycles do not split into 2
conjugacy classes of size 12. Consider (1 2 3 4 5) and its conjugates.
Lecture 5
10 Groups
But the size of conjugacy classes divides |G| = pn . So |cclG (x)| = pa for some 0 ≤ a ≤ n.
But p divides |G|. So p divides the number of conjugacy classes of size 1. But {x} is a
conjugacy class if and only if
∀g ∈ G gxg −1 = x
⇐⇒ x ∈ Z(G)
Proof. Suppose gZ generates G/Z(G) and thus each coset in Z is of the form (gZ)r =
gr Z for some r. Thus any
r
element x ∈ G is of the form g z for some z ∈ Z, r ∈ N.
x1 x2 = (g r1 z1 )(g r2 z2 )
= g r 1 g r 2 z 1 z2
= (g r2 z2 )(g r1 z1 )
= x2 x1
Lecture 5
11
Proof. By induction on a. The result is clearly true for a = 1. Suppose a > 1 and
pick an element x ∈ Z(G), x 6= e, noting that by Theorem 1.15 the centre is non-
trivial. By taking a suitable power of it, we may assume that x is of order p. Hence
K = {e, x, . . . , xp−1 } is a subgroup of
G, and it is normal in G since x is central.
Consider the quotient G/K .pa−1 and we can apply the inductive hypothesis
Its order is
to get L/K ≤ G/K of order p
b−1 . The correspondence between subgroups of G/K and
G∼
= Cd1 × Cd2 × · · · × Cdk
with di+1 | di and d1 · · · dk = |G|.
Remark (Remark on proof ). One can be less sophisticated. As the rst step, consider
an abelian p-group G hxi generated by x ∈ G of maximal order. If
and the subgroup
hxi = G we are done. Otherwise, if hxi 6= G, the claim is that there exists a subgroup
H ≤ G with hxiH = G, hxi ∩ H = {e}, i.e. G ∼ = hxi × H .
Example. The abelian groups of order 8 are C8 , C4 × C2 and C2 × C2 × C2 . The abelian
groups of order 16 are C16 , C8 × C2 , C4 × C4 , C4 × C2 × C2 and C2 × C2 × C2 × C2 .
Lemma 1.20. If m and n are coprime then Cm × Cn ∼
= Cmn .
Example. C2 × C3 ∼
= C6 . Abelian groups of order 24 = 3 × 8 are C24 , C12 × C2 , and
C6 × C2 × C2 .
Lecture 6
12 Groups
(iii) The number np of Sylow p-subgroups satises np ≡ 1 (mod p) and divides |G|,
and hence np | m.
Remark. G acts on the set Ω of Sylow p-subgroups via conjugation, g ∗ P = gP g −1 . By
Sylow's second theorem, there is exactly one orbit. The stabiliser, called the normaliser
of P, is NG (P ) = {g ∈ G : gP g −1 = P }. By the orbit-stabiliser theorem, the size of the
orbit is np = |G : NG (P )|. Thus np | |G|.
Lemma 1.22. If np = 1 then the unique Sylow p-subgroup is normal in G.
Proof. Take the unique Sylow p-subgroup P . Then gP g −1 is also a Sylow p-subgroup.
Thus gP g
−1 =P for all g ∈ G.
Corollary 1.23. Let G be a non-abelian simple group. Then |G| divides np !/2, and
np ≥ 5 once we have shown that all non-abelian subgroups of S4 are not simple.
Proof. G is acting on the set Ω of Sylow p-subgroups where |Ω| = np . IfG is non-abelian
simple then by Theorem 1.1 G is isomorphic to a subgroup of Anp , and np ≥ 5 subject
to proviso.
Proof. n11 ≡ 1 (mod 11) and n11 | 12. Assume G is simple and so n11 6= 1. Then n11 =
12. n3 ≡ 1 (mod 3) and n3 | 44. By simplicity, n3 6= 1 and n3 6= 4 by Corollary 1.23, so
n3 = 22.
But since n11 = 12 and n3 = 22 there are 12 · (11 − 1) elements of order 11 and 22 · (3 − 1)
elements of order 3. But this gives too many elements, a contradiction!
Lecture 7
13
This is a typical argument. We use Sylow's theorem to estimate the number of subgroups
and then count elements.
Then also
2
x = g 2 xg −2 = gxr g −1 = (gxg −1 )r = xr .
So r2 ≡ 1 (mod p). This gives two choices. Suppose r ≡ 1 (mod p), then g and x
commute so gx is of order 2p and hence G ∼
= C2p . Otherwise suppose r ≡ −1 (mod p),
then gxg
−1 = x−1 = xp−1 and G ∼ D . In the latter case, x is a rotation and g is a
= 2p
reection.
There is a problem on the examples sheet for the case |G| = 15.
Proof. Consider the set Ω of subsets of G of size pa and let {g1 , . . . , gpa } ∈ Ω. G acts on
Ω via left multiplication,
|G| pa m
|Σ| ≥ = = m.
pa pa
So we have two types of orbits Σ, (a) |Σ| = m, (b) |Σ| > m. By the orbit-stabiliser
theorem, |Σ| divides |G| = pa m. So for orbits Σ of type (b), where |Σ| > m, we deduce
that p | |Σ|.
The next step is to show that |Ω| is not divisible by p,
a
p m pa m pa m − 1 pa m − pa + 1
|Ω| = = · · · .
pa p a pa − 1 1
Lecture 7
14 Groups
pa m − k pa m
= = m.
pa − k pa
pa m − k pa m − pb q
=
pa − k pa − pb q
pa−b m − q
= a−b .
p −q
Note that b < a, so p does not divide the numerator after this cancellation process,
observing that b < a =⇒ k < pa . So after the cancellation for the product, p does not
divide the numerator. Thus p - |Ω|, as required.
But p - |Ω| and p divides the size of orbits of type (b), thus we do have orbits of type
(a).
|G| pa m
m = |Σ| = =
|P | |P |
where P is the stabiliser of an element chosen in Σ. So |P | = pa and thus there is a
a
subgroup of order p .
In fact, we will prove the following. If P is a Sylow p-subgroup and Q ≤ G with |Q| = pb
for 1 ≤ b ≤ a, then there exists g1 ∈ G such that Q ≤ g1 P g1−1 . In particular, if Q is of
order pa then Q = g1 P g1−1 .
Proof. This time consider the action of Q on the set of left cosets of P via left multipli-
cation,
g ∗ g1 P = gg1 P
for g ∈ Q, g1 ∈ G. We consider the orbits. By the orbit-stabiliser theorem, the size of
an orbit divides |Q|, thus is either 1 or a power of p.
The number of cosets is equal to the index of P in G,
|G| pa m
= a = m.
|P | p
In particular, m is equal to the number of orbits of size 1 plus pk times the number of
orbits of size k.
Counting cosets, we see that we must have an orbit of size 1. Suppose {g1 P } is such an
orbit. Thus gg1 P = g1 P for all g ∈ Q and hence g1−1 gg1 P = P . So g1−1 gg1 ∈ P and
g ∈ g1 P g1−1 . −1
Thus Q ≤ g1 P g1 , as required.
Lecture 7
15
Proof. Consider the action of P on the set of Sylow p-subgroups of G via conjugation.
By the orbit-stabiliser theorem, orbits are of size 1 or of size divisible by p. We need to
show that there is exactly one orbit of size 1.
Visibly {P } is an orbit of size 1. Are there any others?
Suppose {Q} is an orbit of size 1. Thus gQg −1 = Q for all g∈P and so P ≤ NG (Q).
But we observed last time that the normaliser of a Sylow subgroup has a unique Sylow
subgroup. (This is a consequence of Sylow's second theorem applied to the normaliser.)
So P = Q, since both are Sylow subgroups of NG (Q).
Thus {P } is the only orbit of size 1.
Counting Sylow p-subgroups in G,
But p divides the size of every other orbit. Therefore, np ≡ 1 (mod p).
Recall the denition of a soluble group, a group for which the composition factors can
be chosen to be abelian simple.
Theorem (1904, Burnside). If |G| = pa q b , where p and q are primes, then G is soluble.
Theorem (1937, Hall) . G is soluble if and only if whenever |G| factorises as |G| = mn
with m and n coprime there is a subgroup of order m.
Theorem (1963, Feit, Thompson, Odd Order Theorem). If |G| is odd then G is soluble.
Lecture 8
Lecture 8
Chapter 2
Rings
2.1 Denitions and examples
Denition (Ring). A set R with two operations, + and ·, forms a ring if
r1 (r2 + r3 ) = r1 r2 + r1 r3 ,
(r1 + r2 )r3 = r1 r3 + r2 r3 .
In this course all rings are assumed to be commutative, i.e. multiplication is commutative.
Lecture 8
18 Rings
f (X) = an X n + · · · + a1 X + a0
X
(f + g)(X) = (ai + bi )X i ,
i
i
X X
(f g)(X) = aj bi−j X i
i j=0
ai X i , g(X) = bi X i .
P P
where f (X) =
ai X i
P
R is a subring of R[X] by identifying R with the constant polynomials f (X) =
with ai = 0 for all i ≥ 1.
f (X) = a0 + a1 X + a2 X 2 + · · ·
with the same addition and multiplication as for R[X] but note that there is no restriction
demanding that all but nitely many coecients be zero.
Laurent polynomials,
X
f (X) = ai X i ,
i∈Z
X X
(f g)(X) = aj bi−j X i
i j
with the condition that all but nitely many ai are zero, e.g. X −1 + X .
Laurent series, X
ai X i
i∈Z
with the condition that all but nitely many of the ai for i≤0 are zero. Addition and
multiplication are dened in the same way as for Laurent polynomials.
For example, the set of continuous functions f : R → R forms a subring of the ring of
all functions R → R. R → R contains the subring
The set of continuous functions of
polynomial functions, f : R → R, a 7→ f (a), where f (X) ∈ R[X].
We can similarly consider Laurent polynomial functions, power series functions C→C
and Laurent series functions C → C.
Lecture 9
19
(iii) θ(1R ) = 1S .
Warning. An ideal in general is not a subring. If 1∈I then r = 1r ∈ I for all r∈R
and so I=R if 1 ∈ I. So the only ideal which is a subring of R is R itself.
Example. (i) In a eld F, the only ideals are {0} and F. (Multiply any non-trivial
element in the ideal by its inverse to see that 1 is in the ideal.)
Lecture 9
20 Rings
Note that this is the smallest ideal of R containing a. Such an ideal is called a principal
ideal.
More generally, the ideal generated by a1 , . . . , ak is
(a1 , . . . , ak ) = a1 R + . . . ak R
X k
= ai ri : ri ∈ R .
i=1
Examples of principal ideals include nZ in Z and (X) in C[X], where (X) is the ideal
of polynomials with zero constant term.
Proposition 2.3. Let I / R be an ideal. Then the quotient ring R/I has elements
consisting of the cosets r + I with the operations
Proof. From Proposition 1.3 we have already checked that R/I forms a group under
addition as dened. So we need to check that multiplication is well-dened. If
r1 + I = r10 + I, r2 + I = r20 + I
then
r10 = r1 + a1 , r20 = r2 + a2
for some a1 , a2 ∈ I , and therefore
where the second term is contained in I by the strong multiplicative closure property.
Thus r10 r20 + I = r1 r2 + I .
The multiplicative identity is 1+I since (1 + I)(r + I) = r + I .
Associativity, closure under multiplication and distributivity follow from the respective
properties in the ring R. (Check.)
Lecture 9
21
(a + I) + (b + I) = (a + b) + I,
(a + I)(b + I) = ab + I.
Therefore, C[X]/I ∼
= C.
Proposition 2.4 (Euclid's algorithm for polynomials) . Let F be a eld and consider
polynomials f (X), g(X) ∈ F[X]. Then we can write
Proof. Write
n
X m
X
i
f (X) = ai X , g(X) = bi X i
i=0 i=0
with deg f (X) = n and deg g(X) = m.
If n < m then set q(X) = 0 and r(X) = f (X). Now assume n ≥ m, and argue by
induction on n. Set f1 (X) = f (X) − an b−1
m X
n−m g(X) so deg f (X) < deg f (X).
1 If
m=n then deg f1 (X) < m and we can apply our rst case to f1 (X),
f (X) = an b−1
m g(X) + f1 (X).
R[X]/(X 2 + 1) ∼
= C,
a + bX + I 7→ a + bi.
Lecture 10
22 Rings
Proof. Let
θ : R → S/J, r 7→ r + J.
This is a ring homomorphism. (Check.) We have that
ker θ = {r ∈ R : r + J = J} = R ∩ J / R,
Im θ = {r + J : r ∈ R} = (R + J)/J ≤ S/J.
additive subgroups of R additive subgroups
←→
containing ideal I of R/I
subrings of R subrings
←→ ,
containing I of R/I
Lecture 10
23
ideals ofR ideals
←→ .
containing I of R/I
θ : R/I → R/J, r + I 7→ r + J.
This is well-dened as in Theorem 1.7. One can check it is a ring homomorphism and
θ(1 + I) = 1 + J .
The rst isomorphism theorem implies that R/ ker θ ∼
= Im θ ≤ R/J . Noting that
φ: Z → R
1 7→ 1R
m 7→ 1| + ·{z
· · + 1}
m times
The rst isomorphism theorem implies Z/ ker φ ∼
= Im φ ≤ R. Im φ is the prime subring
of R. ker φ / Z is an ideal and so is of the form nZ for some n.
Example. Z is an integral domain. All elds are integral domains. Subrings of integral
domains are integral domains, e.g. Z[i] ≤ C.
Denition (Principal ideal domain). A principal ideal domain is an integral domain in
which all ideals are principal ideals, i.e. of the form (a) = aR for some a ∈ R.
Example. Z is a principal ideal domain. We will see that any integral domain where
Euclid's algorithm applies is a principal ideal domain.
Lecture 11
24 Rings
Proof. If
X X
f (X) = ai X i , g(X) = bi X i
with deg f = m and deg g = n then f (X)g(X) has degree m+n since am bn 6= 0 as R is
an integral domain. Therefore,
(ii) F is a eld,
(iii) every element of F has the form ab−1 where a∈R and b−1 is the multiplicative
inverse in F of b ∈ R.
Example. With the notation as above, R = Z, F = Q.
a c ad + bc
+ =
b d bd
and multiplication by
a c ac
· = .
b d bd
One can easily check that these operations are well-dened.
Lecture 11
25
R may be identied with the subring of elements of the form r/1. The multiplicative
identity is 1/1. a/b has multiplicative inverse b/a if a 6= 0,
ab ab 1
= = .
ba ab 1
Lemma 2.11. A non-zero ring R is a eld if and only if its only ideals are {0} and R.
Proof. Suppose that R is a eld and take an ideal I / R. If I 6= {0} then pick a 6= 0 ∈ I .
Then 1 = aa−1 ∈ I . Hence I = R.
Conversely, assume that {0} and R are the only ideals. Take a ∈ R with a 6= 0. Then
(a) 6= {0} and so (a) = R. Hence 1 ∈ (a) and there exists r such that ar = 1.
Lemma 2.12. For an ideal I / R, the quotient ring R/I is a eld if and only if I is
maximal in R.
Proof. R/I is a eld if and only if I/I and R/I are the only ideals in R/I . By the usual
correspondence, this is equivalent to that the only ideals in R containing I are I and R,
which is the statement that I is maximal in R.
Lemma 2.13. For an ideal I / R, the quotient ring R/I is an integral domain if and
only if I is a prime ideal in R.
Example. In Z, the maximal ideals are pZ where p is a prime number. Prime ideals
are pZ for p prime and {0}.
Proof. Recall that we have a unique ring homomorphism φ : Z → R. The prime subring
Im φ ∼
= Z/nZ, where n is the characteristic of R. If n properly factorises as n = st then
working modulo n we have st = 0 (mod n) and so we have zero divisors.
Lecture 12
26 Rings
Remark. These denitions do depend on the ring R, e.g. 2 is prime and irreducible in
Z, but not in Q. 2X is irreducible in Q[X] but not in Z[X], as 2X = 2 · X and 2, X are
not units in Z[X].
Example.
√ To show that the converse of Lemma 2.16 does not hold in general, consider
R = Z[ −5] ≤ C. R is an integral domain since it is a subring of a eld. Dene a norm,
√
N (a + b −5) = a2 + 5b2 ∈ Z≥0 .
Lecture 12
27
(ii) F[X] with F a eld is a Euclidean domain with φ(f (X)) = deg f . (Section 2.4
gives Euclid's algorithm here.)
√
(iii) R = Z[i] = {a + bi : a, b ∈ Z} ≤ C is an integral domain. As with Z[ −5] we can
2 2
dene a norm via N (z) = z z̄ for z ∈ R. Thus N (a + bi) = a + b . In this case
the norm is a Euclidean function.
Remark. Similar arguments apply for other subrings of C and one can sometimes show
that the norm is a Euclidean function but one needs that any point in the complex plane
√
is distance less than 1 from a lattice point of R. Note for Z[ −5] this is not true.
Example. Z, F[X] for a eld F and Z[i] are all PIDs. But
Z[X] is not a PID.
Xi
P
To see this, consider the ideal I = (2, X) = ai : a0 is even / Z[X]. Suppose that
I = (f (X)) for some f (X) ∈ Z[X], in particular 2 = f (X)g(X) for some g(X) ∈ Z[X].
We deduce that f (X) has to be a constant polynomial, that is, of degree 0, and must
be ±1 or ±2.
Also X = f (X)h(X) for some h(X) ∈ Z[X]. So f (X) cannot be ±2. So f (X) = ±1.
Hence (f (X)) = Z[X]. This is a contradiction, since I 6= Z[X], e.g. 1 6∈ I .
Example (Minimal polynomials of matrices with entries in a eld F). Let ∆ ∈ Mn×n (F).
Consider I = {f (X) ∈ F[X] : f (∆) = 0}.
We can check that I / F[X] is an ideal. But F[X] is a PID, so I = (m(X)), an by
multiplyingm(X) by a unit in F we may assume m(X) is monic. This is the minimal
polynomial of ∆. (See the course Linear Algebra where Euclid's algorithm is explicitly
used.)
Lecture 13
28 Rings
(i) every non-zero element that is not a unit may be expressed as a product of nitely
many irreducibles,
Proof. Let p be an irreducible and suppose p | ab. Suppose p - a. The ideal (p, a) must
be principle in a PID, (p, a) = (d), say.
1 = rp + sa
and so
b = rpb + sab.
Therefore, p | b.
Remark. In fact, d is a highest common factor of p and a. Highest common factors are
unique up to units, see Example Sheet 3.
Lemma 2.21 (Ascending chain condition, ACC) . Let R be a PID and Ij / R with
I1 ⊂ I2 ⊂ I3 ⊂ · · · . Then, for some n ∈ N, In = In+i for all i ≥ 0.
Proof.
S
I = j Ij / R is an ideal, check. (For example, if a ∈ Ij and b ∈ Ik ,
The union
j ≤ k then a ∈ Ik , B ∈ Ik , so a + b ∈ Ik ⊂ I .) R is a PID, so I = (a) for some a ∈ I .
We must have a ∈ In for some n ∈ N. For all i ≥ 0,
Lecture 13
29
Proof of (2.18). (i) Let a ∈ R, non-zero and not a unit. Assume it cannot be fac-
torised as a product of nitely many irreducibles. So in particular a itself is not
irreducible, hence a = a1 b1 with a1 , b1 non-zero and not units. We may assume
that a1 cannot be factorised as a product of nitely many irreducibles, so write
a1 = a2 b2 and continue. (If both a1 , b1 can be factorised into nitely many irre-
ducibles, then so can a.)
We obtain (a1 ) ( (a2 ) ( (a3 ) ( · · · with inequality at each stage, for if (ai ) =
(ai+1 ) then ai and ai+1 would be associates and bi+1 a unit.
p1 · · · pm = up1 q2 · · · qn .
Cancellation in R gives
p2 · · · pm = (uq2 )q3 · · · qn .
Note that (uq2 ) is irreducible. Continuing in this way gives the result.
Remark. The proof of (ii) depends just on Lemma 2.20, that irreducibles are prime,
and (i) depends on the ACC.
There are several properties which follow quickly from the denition of a UFD.
We have already shown the `only if ' direction for integral domains.
a = p1 · · · p m , b = q1 · · · qn
Lecture 14
30 Rings
n
Y
ui pj ij
j
where u is a unit. But the tj can be at most mini {nij } if d0 | ai for each i. Thus d0 | d.
Lemma 2.22 (Gauss' lemma) . Let R be a UFD with F its eld of fractions. Suppose
f (X) ∈ R[X] is primitive. Then f (X) is irreducible in R[X] if and only if f (X) is
irreducible in F[X].
g(X) = b0 + b1 X,
h(X) = c0 + c1 X + c2 X 2 ,
b0 c0 = 1,
b1 c2 = 1.
So b0 = ±1, b1 = ±1. This is a contradiction since ±1 is not a root of X 3 + X + 1.
Lecture 14
31
Lemma 2.23. If f (X) and g(X) are primitive in R[X] then so is f (X)g(X).
Proof. Let
f (X) = a0 + a1 X + · · · + am X m ,
g(X) = b0 + b1 X + · · · + bn X n
be primitive. Suppose the product f (X)g(X) is not primitive. So there is a prime p∈R
dividing c(f (X)g(X)) but p - c(f (X)) and p - c(f (X)).
Let k and l be such that
p | a0 , p | a1 , . . . , p | ak−1 , p - ak ,
p | b0 , p | b1 , . . . , p | bl−1 , p - bl .
p divides all of these terms apart from ak bl which it does not divide. So p - ck+l , contra-
dicting that p | c(f (X)g(X)). Thus f (X)g(X) is primitive.
Consider the polynomial ring R[X] and let F be the eld of fractions of R.
Corollary 2.24. For f (X), g(X) ∈ R[X], the content c(f (X)g(X)) is an associate
of c(f (X))c(g(X)). (Recall that highest common factors and hence contents are only
dened up to associates.)
where f1 (X)g1 (X) is primitive by Lemma 2.23. So a highest common factor of the
coecients of f (X)g(X) is c(f (X))c(g(X)).
Proof of Lemma 2.22, Gauss' lemma. Take f (X) primitive in R[X]. Suppose it fac-
torises in R[X] as a product of non-units in R[X]. Since f (X) is primitive, neither of
these non-units is in R, the factors have degrees greater than 0. So f (X) factorises as a
product of non-units in F[X].
Lecture 15
32 Rings
Conversely, suppose that f (X) = g(X)h(X) with g(X), h(X) ∈ F[X] non-units and
hence not constant. Multiply by a ∈ R and b ∈ R repectively such that ag(X), bh(X) ∈
R[X]. Thus
Write
where g1 (X), h1 (X) are primitive in R[X]. Note that g1 (X), h1 (X) are not constant and
hence not units. Now Lemma 2.24 implies that ab is an associate of c(ag(X))c(bh(X))
by considering contents. So
Remark. A similar argument shows that if f (X) ∈ R[X], not necessarily primitive, and
f (X) = g1 (X)h(X) with g1 (X) primitive in R[X], h(X) ∈ F[X] then we can deduce
that in fact
f (X) = g1 (X)h0 (X)
with h0 (X) ∈ R[X].
We can see this as follows. There exists b ∈ R with bh(X) ∈ R[X]. Consider bf (X) =
g1 (X)(bh(X)). Write bh(X) = c(bh(X))h1 (X) with h1 (X) primitive. Consider contents,
b | c(g1 (X)bh(X)) and so b | c(bh(X)) since g1 (X) is primitive. Cancellation as in the
previous result gives f (X) = g1 (X)h0 (X) for some h0 (X) ∈ R[X].
Thus if
I = g1 (X)F[X] / F[X]
and
J = g1 (X)R[X] / R[X]
Proof. Let f (X) ∈ R[X] be non-zero and a non-unit. Write f (X) = c(f (X))f1 (X) with
f1 (X) primitive. (The rst step is to show that we need only look at primitives.)
Lecture 15
33
irreducibles is primitive by Lemma 2.23 and so the content of f (X) is an associate of the
product of the irreducibles in R. The essentially unique factorisation of c(f (X)) means
that this product is essentially the same as the previous one.
Cancellation implies that the product of the primitives is an associate of f1 (X). Thus
we may assume that f (X) is primitive.
ai pi (X) = ci qi (X)
with ci = c(ai pi (X)), qi (X) primitive in R[X] (and irreducible in R[X] by Gauss'
lemma). So
Considering contents and using the assumption that f (X) is primitive, we have
a1 · · · ak = uc1 · · · ck
for some unit u ∈ R since qi (X) is primitive by Lemma 2.23, i.e. q1 (X) · · · qk (X) is
primitive.
Cancellation gives
a product of irreducibles in R[X]. This shows the existence part. For the uniqueness of
the factorisation assume that
with qi (X), ri (X) irreducible in R[X]. We are assuming that f (X) is primitive and so
each ri (X) must be primitive.
By Gauss' lemma, ri (X) is irreducible in F[X]. Uniqueness of factorisation in F[X]
implies that k = l, and after reordering we have qi (X) = ui ri (X) with ui a unit in F[X],
hence in F.
We can write ui = ai /bi for some ai , bi ∈ R with bi 6= 0. Then bi qi (X) = ai ri (X). But
qi (X) and ri (X) are primitive and hence bi and ai must be associates being the content
of bi qi (X) = ai ri (X). Cancelling gives that qi (X) and ri (X) are associates in R[X].
Thus factorisation is essentially unique in R[X].
Lecture 16
34 Rings
g(X) = rk X k + · · · + r0 ,
h(X) = sl X l + · · · + s0
aj = r0 sj + · · · + rj s0 .
By Eisenstein's criterion, f (X) is irreducible in Z[X] and hence in Q[X]. So f (X) does
not have any roots in Q, as a root would yield a linear factor of f (X) in Q[X]. Thus p
has no nth roots in Q.
where p is prime. The claim is that this is irreducible in Z[X], and hence in Q[X].
Note that (X − 1)f (X) = X p − 1. We make the substitution X =1+Y and get
Y f (1 + Y ) = (1 + Y )p − 1,
p−1 p p−2 p
f (1 + Y ) = Y + Y + ··· +
1 p−1
p p
where
i are binomial coecients. Eisenstein's criterion does now apply, we have p| i ,
p
p2 - p−1 = p. We deduce that f (1 + Y ) is irreducible and hence f (X) is irreducible in
Z[X], and hence in Q[X].
Z[i] = {a + ib : a, b ∈ Z}.
Lecture 16
35
The norm
The units have to be of norm 1, hence the only units are ±1, ±i.
Recall from an earlier example that Z[i] is a Euclidean domain and hence a PID and a
UFD. Thus the irreducibles and primes are the same, by Lemma 2.16 and Lemma 2.20.
What are they?
• 2 = (1 + i)(1 − i).
• 5 = (1 + 2i)(1 − 2i).
Lemma 2.28. A prime number p in Z is irreducible (and prime) in Z[i] if and only if
p is not of the form x2 + y 2 with x, y ∈ Z − {0}.
Proof. Let us rst show that these are actually irreducibles (and primes).
(i) If p ≡ 3 (mod 4) then it is not of the form x2 + y 2 since squares modulo 4 are 0
2 2
or 1 and so x + y ≡ 0, 1 or 2 (mod 4). By Lemma 2.28, p is irreducible (and
prime) in Z[i].
(ii) For example, p = 2 = (1 + i)(1 − i) is of the correct form, 1±i are irreducibles of
norm 2.
Consider the nite eld of p elements {0, 1, . . . , p − 1} = Fp ∼
= Z/pZ. Its multi-
plicative group Fp \ {0} is cyclic. If not then the structure theorem for abelian
groups implies we can nd a subgroup of form Cm × Cm for some m > 1.
Thus there are at least m2 elements satisfying am = 1. These are all roots of
Xm −1 in Fp [X] which is a UFD. X
m − 1 has a unique expression up to ordering
and associates as a product of at most m irreducibles in Fp [X].
Lecture 16
36 Rings
z = α1 · · · αs ,
say. But we know what the irreducibles are from Proposition 2.29. Either N (αj ) = p2j
with
Q pj ≡ 3 (mod 4) or N (αj ) = pj with pj = 2 or pj ≡ 1 (mod 4). Thus n = N (z) =
N (αj ) is of the required form.
Conversely, if n = pn1 1 · · · pnk k with nj even for pj ≡ 3 (mod 4) then we can replace any
pj = 2 or pj ≡ 1 (mod 4) by pj = αj ᾱj and any other primes pj = αj = ᾱj and so
p2j = αj ᾱj for some irreducible αj . Thus n is of the form z z̄ for some z = x + iy and so
n = x2 + y 2 .
or
Lecture 17
37
For example,
Proposition 2.31. For an algebraic integer α this kernel is a principal ideal generated
by a monic irreducible polynomial in Z[X].
Proof. By denition, f (α) = 0 for some monic f (X) ∈ Z[X]. So we can pick fα (X) ∈ I
of minimal degree ≥ 0, and we may assume fα (X) is primitive in Z[X]. The claim is
that this is the required polynomial.
with r(X) = 0 or deg r(X) < deg fα (X). Clearing denominators, there exists a ∈ Z\{0}
such that
ah(X) = aq(X)fα (X) + ar(X)
with aq(X) ∈ Z[X] and ar(X) ∈ Z[X]. But ar(α) = 0 so ar(X) ∈ I . Minimality
of degree of fα (X) implies that ar(X) = 0 so r(X) = 0. So ah(X) = aq(X)fα (X).
Consider the contents of both sides,
a | c(ah(X)),
c(aq(X)fα (X)) = c(aq(X))
since fα (X) is primitive. So a divides all the coecients of aq(X). So q(X) ∈ Z[X].
Thus h(X) ∈ (fα (X)) / Z[X].
We claim that fα (X) is prime (and irreducible).
If fα (X) = f1 (X)f2 (X) then 0 = f1 (α)f2 (α) and so fi (X) ∈ I for some i. fα (X) | fi (X)
for some i.
Finally, we show that fα (X) is monic.
Lecture 18
38 Rings
Proof. fα (X) is irreducible in Z[X] and monic. By Gauss' lemma, Lemma 2.22, fα (X)
is irreducible in Q[X]. α is a root. So fα (X) = X − α. Thus α ∈ Z.
Fp [X]/(f¯α (X)) ∼
= Z[X]/(p, fα (X)) ∼
= Z[α]/(p)
where f¯α (X) is the polynomial in Fp [X] obtained from fα (X) by taking coecients
modulo p.
Fp /(X 2 + 1) ∼
= Z[i]/(p).
Lemma 2.33. A ring R satises the ACC if and only if all ideals in R are nitely
generated.
Proof. Suppose that all ideals are nitely generated and consider a chain
I1 ⊂ I2 ⊂ · · ·
S S
with Ij / R. Then the union j Ij /R is an ideal too. So by supposition j Ij is nitely
generated. There exists N such that all these nitely many generators lie in IN .
S
If m≥N then Im ⊂ j Ij ⊂ IN . Also IN ⊂ Im and so Im = IN .
Conversely, assume the ACC and let J / R. Take a1 ∈ J with a1 6= 0. If J 6= (a1 ) pick
a2 ∈ J \ (a1 ). If J 6= (a1 , a2 ) pick a3 ∈ J \ (a1 , a2 ) etc. We are producing a chain
Lecture 18
39
Consider
j
X
i
Ij = aj ∈ R : ∃f (X) ∈ J f (X) = ai X ∈ J ∪ {0},
i=0
degree less than m. Repeating this process yields q1 (X), . . . , qk (X) ∈ R[X] such that
f (X) − (q1 (X)f1 (X) + · · · + qk (X)fk (X)) ∈ J of degree less than N .
Now consider polynomials in J of degree less than N . For j < N there is a nite
S set Sj
of polynomials in J of degree j whose leading coecients generate Ij . Let S = j<N Sj ,
a nite set.
Lecture 18
Lecture 18
Chapter 3
Modules
3.1 Introduction
Denition. Let R be a commutative ring. A set M is an R-module with operation +,
with (M, +) forming an abelian group, and also has a map
R × M → M, (r, m) 7→ m
satisfying
(r1 + r2 )m = r1 m + r2 m
r(m1 + m2 ) = rm1 + rm2
r1 (r2 m) = (r1 r2 )m
1m = m
| + ·{z
na = a · · + a} for n≥1
n times
0a = 0
(−n)a = −(an) for n≥1
f (X).v = f (α)(v)
Lecture 19
42 Modules
rs = (rs)
(vii) Let R be the prime subring of a nite eld F. Here R ∼= Fp for some prime p.
(Question 8 on Example Sheet 3 shows that F has cardinality pn for some n.) We
can view F as an Fp -vector space.
Remark. There exists exactly one eld of cardinality pn for every prime p and n ≥ 1,
up to isomorphism. For example,
Cardinality Field
4 F2 [X]/(X 2 + X + 1),
8 F2 [X]/(X 3 + X + 1)
Denition. If N ≤M then the quotient module M/N has elements m+N with the
operation + dened as for quotients of the additive groups,
r(m + N ) = rm + N
Proof. Left to the reader. The isomorphism theorem for additive subgroups shows that
M/ ker θ ∼
= Im θ for the additive groups. So we only need to check the multiplicative
properties.
Lecture 19
43
submodules of submodules of M
←→ .
M/N containing N
M/L ∼
= (M/N )/(L/N ).
Example. In the special case of R = F a eld and vector spaces, we have quotient
spaces V /W where W ≤ V and there are isomorphism theorems for linear maps (F-
module homomorphisms).
Ann(m) = {r ∈ R : rm = 0}.
The annihilator of M is
Ann(M ) = {r ∈ R : ∀m ∈ M rm = 0}
\
= Ann(m).
m∈M
r1 m = 0, r2 m = 0 =⇒ 0 = r1 m + r2 m = (r1 + r2 )m,
r1 m = 0 =⇒ (rr1 )m = 0.
Rm ∼
= R/ Ann(m)
Lecture 20
44 Modules
m + N = r1 m1 + · · · + rk mk + N
= r1 (m1 + N ) + · · · + rk (mk + N ).
for some r1 , . . . , r k ∈ R .
If R = C[X1 , X2 , . . . ] then the submodules are the ideals, and we have shown that there
is a submodule which is not nitely generated inside the cyclic module R.
r1 m1 + · · · + rk mk = 0 =⇒ r1 = · · · = rk = 0.
(i) S generates M,
Lecture 20
45
Proof. [(i) =⇒ (ii)] Suppose that S generates M freely. We need to show independence.
Let N= Rk and dene
ψ : S → N, mj 7→ (0, . . . , 1, . . . ),
θ(r1 m1 + · · · + rk mk ) = (r1 , . . . , rk ).
So if r1 m1 + · · · + rk mk = 0 then r1 = · · · = rk = 0.
The two implications (ii) =⇒ (iii) and (iii) =⇒ (i) are left as an exercise.
Example. For R = Z, M = Z, the set {2, 3} is a generating set, but contains no basis
for M. For example, {2} is independent, but is not a generating set.
θ : Rk → M, (r1 , . . . , rk ) 7→ r1 m1 + · · · + rk mk .
Proof. One can dene determinants for square matrices with coecients just as in Linear
Algebra by
X
det A = sgn(π)A1,π(1) · · · An,π(n) ,
π∈Sn
det AB = det A det B,
A adj(A) = (det A)I.
Thus A is invertible if and only if det A is a unit in R. Assume that n≥m with unique
expressions
X X
vj = Aij ui , uk = Bjk vj .
i j
So
XX
uk = Bjk Aij ui
j i
XX
= Aij Bjk ui
i j
Lecture 20
46 Modules
X
= (AB)ik ui .
i
(ER1) Add c times ith row to j th row. (Achieved by multiplying A on the left by I + C
where Ckl = 0 except for Cij = c.)
(ER2) Interchange rows i and j , i 6= j . (Multiplying by I+C where Ckl = 0, Cii =
Cjj = −1, Cij = Cji = 1.)
(ER3) Multiplying row i by a unit c. (Multiplying by C diagonal, all diagonal entries 1
apart from Cii = c.)
All of these are achieved by multiplication on the left by suitable matrices. These
operations are reversible as the corresponding matrices are invertible.
We could similarly consider elementary column operations (EC1), (EC2), (EC3), which
are achieved by multiplying on the right by suitable invertible matrices.
Denition. Two m×n matrices are equivalent if one can get from one to the other via
a sequence of elementary operations.
IfA, B are equivalent then there exists an invertible m × m matrix Q and an invertible
n × n matrix P with B = QAP −1 . We can see this by doing the bookkeeping on the
accumulation of row and column operations.
Theorem 3.7 (Smith normal form). Let A be an m×n matrix over a Euclidean domain
R. Then by a sequence of elementary operations, we can put it into the form
d1 0
..
.
dr
0
..
.
0 0
Lecture 21
47
Proof. Consider the 1×1 minors, i.e. the entries of A. Under an elementary operation,
Aij either stays the same or is replaced by Aij + cAik or Aij + cAkj or multiplied by a
unit or replaced by some other entry Akl , and so we deduce that the ideal generated by
the entries of the new matrix is contained in the ideal generated by the entries of the old
matrix. Now the other direction follows from the fact that the operations are revertible.
We thus obtain equality.
Keep using these two cases until we cannot do so any longer, i.e. when Ak divides all
entries in rst row and column. Then subtract suitable multiples of the rst column
from the others and substract suitable multiples of the rst row from the others to get
matrix of the form
d 0
0 C
with d 6= 0 for some (m − 1) × (n − 1) matrix C.
Case 3. If we have a matrix of the form
d 0
0 C
but d does not divide some entry of C then d - Aij , say. Use Euclid's algorithm so that
Aij = qd + r with r 6= 0 and φ(r) < φ(d). Add column 1 to column j . Subtract q times
row 1 from row i so that we have replaced Aij by r . Switch columns j and 1, rows i and
1, so r appears in top left hand corner.
We are now in a position to apply Case 1 and Case 2 again to reduce the φ-value of
the top left hand corner. Repeat the whole process (Cases 1, 2 and 3) until we can't
anymore, i.e. when the matrix is of the form
d 0
0 C
Lecture 21
48 Modules
2 −1 1 −1 1 0 1 0
A= .
1 2 3 2 3 5 0 5
1 0 .)
(i) Add column 2 to column 1. (Multiply on the right by 11
1 1 .)
(ii) Add column 1 to column 2. (Multiply on the right by 01
1 0
(iii) Subtract 3× row 1 from row 2. (Multiply on the left by −3 1 .)
Thus
1 0 1 0 1 1
= A .
0 5 −3 1 1 2
Lemma 3.9. Let M be a free R-module of rank m, i.e. there is a basis of cardinality
m, and R be an Euclidean domain and suppose that N ≤M is a submodule. Then N
is nitely generated.
I = {r ∈ R : (r1 , r2 , . . . , rm ) ∈ N } / R.
Lecture 22
49
Proof. Use Lemma 3.9 to see that N is nitely generated by x1 , . . . , xn , say. So write
these as columns of a matrix A, an m × n matrix.
By Theorem 3.7 on Smith normal forms, we know that we can put A into the form
d1 0
..
.
dr
, di 6= 0, d1 | d 2 | · · · | d r
0
..
.
0 0
Proof. The set {d1 v1 , . . . , dr vr } freely generates N since {v1 , . . . , vm } are free generators
of Rm .
R R R
M∼
= ⊕ ⊕ ··· ⊕ ⊕ R··· ⊕ R
(d1 ) (d2 ) (dr )
Remark. (i) We may assume that all the dk are non-units, for if dk is a unit then
R/(dk ) ∼
= {0} which may be thrown away.
Lecture 22
50 Modules
θ : Rm → M, (r1 , . . . , rm ) 7→ r1 m1 + · · · + rm mm
and Im θ = M {m1 , . . . , mm }
since generates M and N = ker θ ≤ Rm . By the isomor-
phism theorem, M∼
= Rm /N .
There exists a basis {v1 , . . . , vm } of Rm such that {d1 v1 , . . . , dr vr } is a generating set
for N. So
Rm /N ∼
= R/(d1 ) ⊕ · · · ⊕ R/(dr ) ⊕ R ⊕ · · · R,
as required.
Example. Take R = Z. The theorem tells us about nitely generated Z-modules, that
is, nitely generated abelian groups. Consider an abelian group A, written additively,
generated by a, b, c subject to relations
2a + 3b + c = 0,
a + 4b = 0,
5a + 6b + 7c = 0.
1 0 0
0 1 0 .
0 0 21
(Check.) So A∼
= Z3 /N ∼
= Z/(1Z) ⊕ Z/(1Z) ⊕ Z/(21Z) ∼
= Z/(21Z), a cyclic abelian group
of order 21.
Theorem 3.13 (Structure theorem for nitely generated abelian groups) . A nitely
generated abelian group is isomorphic to
Proof. Set R=Z in Theorem 3.12 and write the group operation multiplicatively.
Remark. For a nite group there are no copies of C∞ , see Theorem 1.19.
R/(d) ∼
= R/(pn1 1 ) ⊕ · · · ⊕ R/(pns s ),
Lecture 23
51
n
Proof (cf. (Lemma 1.20).
Split o one summand R/ pj j at a time using the following
lemma.
We have Ann(r1 m) = (r2 ), Ann(r2 m) = (r1 ) using that we have good factorisation in
R. Set
M1 ∼
= R/(r2 ), M2 ∼
= R/(r1 ).
M1 ⊕ M2 → M, (m1 , m2 ) 7→ m1 + m2 .
M∼
= N1 ⊕ N2 ⊕ · · · ⊕ Nt
n
where each Nj ∼
= R/(pj j ) and pj is prime (and irreducible) and nj ≥ 1, or Nj ∼
= R.
Proof. Use primary decomposition from Lemma 3.14 to split the components in Propo-
sition 3.12 as direct sums of modules of this form.
n
Note that the pj are not necessarily distinct. The pj j are the elementary divisors and
they are unique up to ordering. (Proof omitted.)
g(X).v = g(α)(v)
Example. Cyclic modules over F[X], e.g. V = M ∼ = F[X]/(f (X)). Here f (X) is a
polynomial of least degree such that f (α) = 0. We may assume that f (X) is monic, it
is the minimal polynomial for α.
Lecture 23
52 Modules
(ii) f (X) = (X − λ)r . Then (α − λ)r = 0. Consider β = α − λ.ι then the minimal
r
polynomial of β is X . So there exists a vector space basis of M = V such that β
is represented by
0 0 0
1 0
..
0 1 .
..
. 0
0 1 0
and so α is represented by
λ 0 0
1 λ
..
0 1 . .
..
. λ
0 1 λ
(iii) For a general f (X), for a generator m with Ann(m) = (f (X)) as in (i), we can
pick a vector space basis m, Xm, . . . , X r−1 m where
and
Lecture 23
53
C(f1 ) 0
C(f2 )
.
..
.
0 C(fs )
Proof. Theorem 3.12 splits M as a direct sum of cyclic modules of the right form, and
since M is nite dimensional as a F-vector space there are no components isomorphic
to F[X]. Now use Example (iii) to represent α.
Remark. (i) The invariant factors fi (X) are unique up to associates. (This is not
quite proved here).
Now we can use Proposition 3.14 on primary decomposition to split the Mi as direct
sums of cyclic modules with annihilators generated by powers of irreducibles.
M∼
= M 1 ⊕ · · · ⊕ Ms
where Mj ∼
= C[X]/((X − λj )aj ) for some λj ∈ C. Here, λ1 , . . . , λ s are not necessarily
distinct.
Lecture 24
54 Modules
the powers of X −λ increase as we pass from f1 (X) to fs (X). Thus the size of
λ-blocks increases as we pass from M1 to Ms , so the largest λ-block arises from
Ms .
Recall that the minimal polynomial of α is
Y
fs (X) = (X − λ)aλ .
λ distinct
Then aλ is the size of the largest λ-block.
Y
f1 (X) · · · fs (X) = (X − λ)bλ
λ distinct
where bλ is the sum of the sizes of λ-blocks. Observe that the λ are the eigenvalues
of α.
Lecture 24
55
(z1 , z2 , . . . ) 7→ (z2 , z3 , . . . ).
Lecture 24