Introduction to Algebraic Structures
Introduction to Algebraic Structures
ALGEBRAIC
STRUCTURES
Tom Denton
Introduction to Algebraic Structures
Tom Denton
This open text is disseminated via the Open Education Resource (OER) LibreTexts Project ([Link] and like the
hundreds of other open texts available within this powerful platform, it is licensed to be freely used, adapted, and distributed. This
book is open licensed which allows you to make changes, save, and print this book as long as the use is non-commercial. The
applicable license is indicated at the bottom of each page.
LibreTexts Reduces COST for Students and Provides CONTROL for Faculty
Instructors can adopt existing LibreTexts texts or Remix them to quickly build course-specific resources to meet the needs of their
students. Unlike traditional textbooks LibreTexts’ web based origins allow powerful integration of advanced features and new
technologies to support learning. Instructors can select the best fit for their classes. No additional IT technology is needed to use
LibreTexts, the system is available 24/7 and we are working on a hand-held, solar powered LibreTexts-in-a-Box system using an
inexpensive Raspberry pi which will bring the content to places with limited connectivity.
The LibreTexts mission is to unite students, faculty and scholars in a cooperative effort to develop an easy-to-use online platform for
the construction, customization, and dissemination of OER content to reduce the burdens of unreasonable textbook costs to our
students and society. The LibreTexts project is a multi-institutional collaborative venture to develop the next generation of open-access
texts to improve postsecondary education at all levels of higher learning by developing an Open Access Resource environment. The
project currently consists of 13 independently operating and interconnected libraries that are constantly being optimized by students,
faculty, and outside experts to supplant conventional paper-based books. These free textbook alternatives are organized within a central
environment that is both vertically (from advance to basic level) and horizontally (across different fields) integrated.
The LibreTexts libraries are Powered by MindTouch® and are supported by the Department of Education Open Textbook Pilot Project,
the UC Davis Office of the Provost, the UC Davis Library, the California State University Affordable Learning Solutions Program, and
Merlot. This material is based upon work supported by the National Science Foundation under Grant No. 1246120, 1525057, and
1413739. Unless otherwise noted, LibreTexts content is licensed by CC BY-NC-SA 3.0.
Any opinions, findings, and conclusions or recommendations expressed in this material are those of the author(s) and do not
necessarily reflect the views of the National Science Foundation nor the US Department of Education
Have questions or comments? For information about adoptions or adaptions contact info@[Link]. More information on our
activities can be found via our Facebook ([Link] Twitter ([Link] or Blog
([Link]
Compiled on 11/13/2019
TABLE OF CONTENTS
An algebraic structure is a set (called carrier set or underlying set) with one or more finitary operations defined on it that satisfies a list
of axioms. Examples of algebraic structures include groups, rings, fields, and lattices.
0: INTRODUCTION
1: SYMMETRY
Groups arise in nature whenever we can find symmetry. For example, the human body has a lateral symmetry: if you imagine reversing
left and right, most people would look more-or-less the same.
1.0: Symmetry
1.1: Counting Symmetries
1.2: Symmetric Polynomials
1.3: Abstraction. What is a group?
2: GROUPS I
2.0: Symmetry and Functions
2.1: Definition of a Group
2.2: Integers Modulo n
2.3: Permutations
3: GROUPS II
3.0: Generating Sets
3.1: Visualizing Groups: Cayley Graphs
3.2: Subgroups
3.3: Cosets and Lagrage's Theorem
4: GROUPS III
4.0: Homomorphisms
4.1: Product Groups
4.2: Image and Kernel
5: GROUPS IV
5.0: Quotient Groups
5.1: Examples of Quotient Groups
5.2: Isomorphism Theorem
5.3: Classifying Finite Groups
6: GROUP ACTIONS
In this chapter, we examine group actions and some fun applications of group theory.
7: RINGS I
7.0: Juggling With Two Operations
7.1: Ring Homomorphisms
8: RINGS II
TOC.1 11/13/2019
8.0: The Problem of Division
8.1: Field of Fractions
8.2: Euclidean Domains
9: VECTOR SPACES
9.0: A Return to Linear Algebra
9.1: Linear Independence
BACK MATTER
FRONT MATTER
TitlePage
InfoPage
Table of Contents
TOC.2 11/13/2019
CHAPTER OVERVIEW
TOC.1 11/13/2019
0: INTRODUCTION
This is a set of notes I developed for an e-learning course in Algebraic Structures offered by Maseno, University in Western Kenya. The
idea is to introduce the key concepts of algebraic structures without assuming much background in higher mathematics. Math education
in Kenya is heavy on calculation (it's relatively easier to teach and evaluate), but often falls short when it comes to teaching students to
think creatively about mathematics, and really understand the subject as it relates to the world beyond the test. On the bright side, these
same students are usually very ready to take a more creative approach to mathematics: good skills in calculation provides at least a good
intuition for working with numbers, and gives a good foundation from which to build. Kenyan students are also generally very
enthusiastic when presented with interesting mathematics.
Give students a first encounter with algebraic structures: Groups, rings, fields, and vector spaces,
Create an intuition for how these objects appear 'in the world,' meaning both in the real world and in the broader scope of
mathematics,
Encourage students to engage with the material in a creative way, and
Teach/Reinforce important points from the foundations of mathematics, such as induction.
It's a lot to ask for a single ten-week term. Let's see where we get.
The notes themselves are divided into eleven 'chapters,' one for each week of Maseno's term, plus this introductory chapter. Taking a
cue from computer science, all numbering of chapters and sections starts at 0. As the course becomes fully developed, I will be inserting
videos for each section, giving an alternate presentation of the ideas. But the text is primary!
Here are some underlying principles that I believe strongly in, which also guide the formation of these notes.
We live in the future. Computers are somewhere between a million and a billion times faster at computation than humans are.
Therefore, we should focus our teaching on what humans do better than computers: Understanding, problem solving, and
placing things in context. It is often essential to understand how to compute things (indeed, otherwise we would not be able to
tell the computer how to do computations for us!), but computation should not be the aim of a course.
We live in the future. We can communicate at almost zero-cost at slightly less than the speed of light. Information is governed
by post-scarcity economics, and we need to treat it as such. This means we cannot treat information like a scarce resource to be
hoarded: we must share our infinite wealth freely. Thus, these notes will remain free, and will be distributed under the Gnu
Public License.
DESIGN PRINCIPLES
This book is also a programming project! As of this writing, I'm learning some modern web-programming tools; this book runs on
Django, HTML5, Javascript, JQuery, MathJAX, the Sage Cell Server, and probably more by the time I'm done. HTML5 support is
becoming more common in browsers, and should be an available standard for a long time to come.
Here is a list of design principles that I hope to adhere to for the final product:
An important principle of the book is to support multiple learning modes: there should be a combination of video and text for
every section.
Videos, for their part, should be no longer than five or ten minutes long. Likewise, sections of the text should be somewhere
south of 1000 words.
Each section should have at least one exercise, and these exercises should encourage both basic mechanical understanding and
encourage creative approaches to the material.
The finished work should be free and freely available.
The finished work should meet the standards of a Maseno University e-learning course; in particular, have at least ten 'topics' to
be digested at a rate of one-per-week, with clearly marked exercises to act as assignments.
Wherever reasonable, interactive elements (probably using Sage) should be included.
CONTRIBUTORS
Tom Denton (Fields Institute/York University in Toronto)
TOC.2 11/13/2019
CHAPTER OVERVIEW
1: SYMMETRY
Groups arise in nature whenever we can find symmetry. For example, the human body has a lateral symmetry: if you imagine reversing
left and right, most people would look more-or-less the same.
1.0: SYMMETRY
1.1: COUNTING SYMMETRIES
1.2: SYMMETRIC POLYNOMIALS
f is a symmetric polynomial if every way of switching around (i.e., permuting) the variables leaves f the same.
TOC.1 11/13/2019
1.0: SYMMETRY
Groups arise in nature whenever we can find symmetry. For example, the human body has a lateral symmetry: if you imagine reversing
left and right, most people would look more-or-less the same. (In fact, we see an example of this every time we look in a mirror.)
Figure 1: Da Vinci's famous sketch demonstrates lateral symmetry in the human body. (Source)
Another example is in the formation of crystals. In a crystal, the atomic structure arranges itself into a very symmetrical pattern, which
you can see even with the unaided eye. The symmetry of the atomic structure means the atoms are packed very regularly, which leads
to the nice shapes we see. In the late 1800's, mathematicians used group theory to classify all of the shapes of crystals that could ever
exist in the world.
Example 1.0.1:
Find some examples symmetry other than those that we talked about above.
So how can we use mathematics to study symmetry? Well, the first thing we learn about in mathematics is counting, so perhaps we
should try to count symmetries!
If we think of a perfectly symmetrical face, there are two symmetries: one from flipping left-to-right, and another from leaving the face
alone. Some might argue that the face only has one symmetry, and leaving it alone doesn't count. In fact, we could argue all day about
this point and make no progress until we became very precise about what is meant by 'a symmetry.'
In fact, doing nothing to an object is a way of moving it back onto itself. Thus, we will say that a symmetrical face has two symmetries.
Let's consider some more mathematical objects. A line segment always has two symmetries, just like a face. An equilateral triangle,
though, has six symmetries: three rotations (including the rotation by 0 ), and three rotations when flipped over. You can keep track of
∘
the various symmetries by labelling the corners of the triangle, and seeing where they end up after applying one of the symmetries.
(See the illustration.)
Exercise 1.1.1:
How many symmetries does a square have? How about an n-sided regular polygon?
We can also imagine an object which is symmetrical under some number of rotations, but which can't be flipped over. You can make
such an object in many ways; one way is to take a square (or any other regular polygon) and then add an identical 'bump' just to one
side of each corner. This object has rotational symmetry, but cannot be flipped.
In three dimensions, we have regular polyhedra. These are three dimensional objects with many symmetries! A tetrahedron has 24
symmetries, for example: twelve of these are rotations, and another 12 can be obtained by reflecting and then rotating.
Exercise 1.1.2:
There are five regular polyhedra: the tetrahedron, the cube, the octahedron, the dodecahedron, and the icosahedron. How many
symmetries does each one have? Try working it out directly for the smaller cases, then see if you can arrive at a formula for the
polyhedra with more sides.
Of course, some objects have an infinite number of symmetries. A circle is a good example of this: every rotation is a symmetry, and
there are infinitely many angles by which the circle may be rotated, all of which preserves its shape.
Thinking back to our regular tiling patterns, these also have infinitely many symmetries. All of them have translational symmetry,
since you can translate the whole picture back onto itself. And you can translate in one direction as many times as you like, so there's at
least one symmetry for every integer.
CONTRIBUTORS
Tom Denton (Fields Institute/York University in Toronto)
So far, we've considered geometric objects. Let's also have an example of something that isn't geometric. Let f be a polynomial in
some number of variables. For now, we'll stick with 3 variables, x, y, and z. We say that f is a symmetric polynomial if every way of
switching around (ie, permuting) the variables leaves f the same.
For example, the polynomial f(x, y, z) = x + y + z is symmetric: switching the x and the z, for example, gives z + y + x , which
is the same as f . As a more complicated example, you can check that g(x, y, z) = x y + x z + y x + y z + z x + z y
2 2 2 2 2
is also
2
symmetric.
On the other hand, h(x, y, z) = x + y + z is not symmetric, since switching x and z produces z + y + x , which is not equal
3 3 3 3
to h. This polynomial does have some symmetry, since switching x and y leaves h the same, but we save the name 'symmetric
polynomial' for the fully symmetric polynomials.
Exercise 1.2.0:
Let f be a symmetric polynomial with n variables. how many symmetries does f have?
If you haven't tried a problem like this before - working in n variables - it is extremely important to get some practice. Try writing
down some different symmetric polynomials with small numbers of variables. Is there a formula that describes the the number of
symmetries in terms of the number of variables?
Symmetric polynomials are really interesting things, and we'll see them again when we talk about rings and vector spaces!
CONTRIBUTORS
Tom Denton (Fields Institute/York University in Toronto)
The six rotational symmetries of the bumpy hexagon. The blue arrow represents the rotation r; r 6
= id .
CONTRIBUTORS
Tom Denton (Fields Institute/York University in Toronto)
Topic hierarchy
TOC.1 11/13/2019
2.0: SYMMETRY AND FUNCTIONS
The red arrows describe the flip over the vertical axis. Do it twice, and you end up where you started: F ∘ F = id .
Now remember our 'bumpy' hexagon, which only had rotational symmetries. Call the identity e and let r be the clockwise rotation by
60 . All of the other rotations we can think of as r composed with itself some number of times; we'll just write this as r . So all of the
∘ k
symmetries of the bumpy hexagon are {e, r, r , r , r , r } . We notice that r ∘ r = r = e . This is quite interesting! In fact, we can
2 3 4 5 5 6
make a 'composition table' to keep track of what happens when we compose any two of the symmetries.
Exercise 2.0.0:
Write down all of the symmetries of an equilateral triangle. Make a 'composition table' of the symmetries, showing what happens
when any two of them are composed. Then make a list of each symmetry and its inverse. What do you observe?
CONTRIBUTORS
Consider an object X with some symmetries S . We've seen that we can compose any of the symmetries in S and obtain another
symmetry of X. We've also seen that these symmetries obey certain rules. We can now, at last, define a group.
An essential notion in mathematics is abstraction. Note that our definition certainly applies to any collection S of symmetries of an
object, but in fact there are other contexts where the definitions apply as well! The operation can be any way of combining two things
in S and getting another back; S doesn't need to be a collection of functions, and the operation doesn't need to be composition. A
group is defined purely by the rules that it follows! This is our first example of an algebraic structure; all the others that we meet will
follow a similar template: A set with some operation(s) that follow some particular rules.
For example, consider the integers Z with the operation of addition. To check that the integers form a group, we need to check four
things:
1. Addition takes two integers and gives another integer back. (Here we're checking the requirement that the operation is one from
S × S → S . Notice that the the output of the operation is always in S ! This is called closure of the operation.)
Exercise 2.1.1
An important note about inverses: An inverse means, roughly, that we can go back to where we started after applying an operation.
Algebraically, this means we can cancel elements. When we have something like gh = gk , we can multiply both sides on the left
by g to get h = k . We have to be careful to multiply on the same side on both sides, since groups aren't always commutative! If
−1
Exercise 2.1.2
Suppose that S is the collection of all symmetries of a geometric object X . Show that S is a group! (Then S is called, perhaps
unsurprisingly, the group of symmetries of X.)
Exercise 2.1.3
CONTRIBUTORS
Tom Denton (Fields Institute/York University in Toronto)
Recall the 'bumpy' hexagon, which had rotational symmetry but no reflection symmetry. The group of symmetries of the bumpy
hexagon is called Z . In this section, we'll consider the general case, Z , which we can initially think of as the group of symmetries of
6 n
the following rule: For any a, b ∈ Z , a + b = (a + b)%n . For example, Z is the set {0, 1, 2, 3, 4}. Here, for example,
n n 5
Usually, we don't write + for the addition. From now on, whenever you see an expression like 4 + 3 , you will have to be mindful
n
of the context! If we consider 4 and 3 as plain old integers, the answer is 7. If they are integers mod 5, then the answer is 2!
3. The next definition is really just an easy way to think of the second definition. Imagine a distant planet where the clock has n hours
on it instead of 12 (or 24). Then, just as our hours 'wrap around' the circle beyond 12 o'clock, the hours wrap around at n. Now if
we imagine the clock is numbered 0 through n − 1 instead of 1 to n, we have exactly the situation of Z . n
4. Our last definition will identify Z with the n-th roots of unity, which are complex numbers. Recall that any complex number may
n
be written as re , where r is a positive real number and θ is any angle. Now let n and k be some positive integers, and consider
iθ
k k
xk xj = e n
2iπ
e n
2iπ
Because angles only matter up to adding/subtraction 2π, we see that the multiplication here
= e n
2iπ
.
'wraps around' just like the addition in Z . In fact, we can see this directly by drawing the n-th roots of unity in the complex plane!
n
They are just n points evenly spaced around the unit circle, and their multiplication exactly matches the addition in Z or on the n
extra-terrestrial clock.
All of these are somehow the same; but there's a question of how to formally show that two groups are the same. What do we mean by
the same? This is an important question to consider, which we will come back to later. For now, an exercise.
Exercise 2.2.0
Write out tables for n = 5 and n = 6 for:
1. composition of the rotations of the 'bumpy' n-gon,
2. addition in Z , n
CONTRIBUTORS
Tom Denton (Fields Institute/York University in Toronto)
of view, the objects we use don't actually matter; all we care about is the order they are arranged in. So usually we'll just talk about
permutations of the numbers 1 through n. You can think of each number as just counting the objects involved: first object, second
object, nth object.
Permutations arise in the world in a many, many ways. For example, suppose you are asked to list your preferences amongst a bunch of
presidential candidates. The list you make up, from favorite to least favorite, is a permutation of the candidates. In fact, you can use the
mathematics of permutations to learn interesting things about different kinds of voting systems.
Figure 2.3: Instant run-off voting uses a full permutation of the candidates to find a winner. (Source)
Another example is a deck of playing cards. In a standard deck, each card appears exactly once. When you shuffle the deck, you are
just creating a random permutation of the cards. One can use mathematics related to permutations to answer interesting questions about
cards. Like: 'How many times do I need to shuffle the deck before it is truly randomized?' The answer, by the way, seems to be 7 for a
standard riffle shuffle. But proving that is well beyond the scope of these notes!
Because permutations are so common, problems involving permutations tend to be very applicable! For example, suppose you have
two hundred students in a class and they all hand in an exam. The stack of exams they give you is a permutation of the students; most
likely, the list of student scores you keep is alphabetical. This suggests a problem: What is the fastest way to sort the exams? (In fact,
sorting is a fundamental problem in computer science.)
How many permutations are there of a set of n objects? Suppose we try to build a permutation by successively choosing objects. Then
there are n choices for the first object, n − 1 choices for the second, and so on, until there is only one choice for the last object. Then
to get the total number of possible permutations, we multiply these numbers together, and get n(n − 1)(n − 2) ⋯ 1 . This number, if
you haven't seen it before, is called n-factorial, written n!.
Exercise 2.3.0
Write out all of the permutations of the set {1, 2, 3, 4}. How many are there in all? Find a sensible way to organize your list!
Suppose we have some initial ordering of our objects. The letters {a, b, c}, for example, can be organized alphabetically. Then every
permutation we can think of as a mixing-up of this initial order. In this sense, the permutation is a special kind of function from the set
of objects back to itself. (By special, I mean it's a bijection, which is to say a one-to-one and onto function.) (TODO: Wikipedia link) A
permutation of these objects is then the list [σ(a), σ(b), σ(c)]; this list is called the one-line notation for σ.
These permutations-as-functions can be composed: if you think of two permutations σ and τ as different ways to mix up the set, you
can mix them up according to σ and then according to τ . Then the composition is specified by the list [τ (σ(a)), τ (σ(b)), τ (σ(c))] .
For example, if σ = [2, 3, 1, 4] and τ = [3, 4, 1, 2] , then τ ∘ σ = [4, 3, 1, 2] . (In particular, σ(1) = 2 , and τ (2) = 4 , so the first
entry of τ ∘ σ is 4. The other three entries are computed similarly.) On the other hand, σ ∘ τ = [3, 1, 4, 2] . This is different from
Composition of two permutations is again a permutation. Since each permutation contains every element of X exactly once, the
composition τ ∘ σ must also contain each element of X exactly once. Identity: The permutation [1, 2, … , n] acts as the identity.
Inverses: Roughly speaking, if you can mix things up, you can just as easily sort them back out. The 'sorting permutation' of σ is
exactly σ . Associativity: Suppose we compose three permutations, σ, τ , and ρ. Int he braid notation, this just means placing the
−1
three braids on top of each other top-to-bottom, and then 'forgetting' the two sets of intermediate dots. (TODO: a picture!) Associativity
is tantamount to forgetting the two sets of dots in two different orders; the resulting picture is the same either way, so composition of
permutations is associative!
Exercise 2.3.1
Carefully work through the above and check for yourself that permutations satisfy the definition of a group. For example, where it
is stated that the identity permutation has one-line notation [1, 2, … , n], you should check that this is actually the identity.
Likewise, how can you explicitly compute the inverse of a permutation explicitly?
CONTRIBUTORS
Tom Denton (Fields Institute/York University in Toronto)
Topic hierarchy
TOC.1 11/13/2019
3.0: GENERATING SETS
We have now seen a few different kinds of groups: groups of symmetries of a geometric object, integers under addition, integers
modulo n, and permutations. We can easily visualize the objects related to the group - like the geometric object, numbers, or the braid
notation for permutations - but how can we visualize the group itself?
An excellent way to go about this is to identify a set of generators for the group. In a group we can always combine some elements
using the group operation to get another group element. Generators are some special elements that we pick out which can be used to
get to any other element in the group.
As an example, remember the dihedral group, the symmetries of an n-sided polygon. There are 2n symmetries in all, but we can build
up any of the symmetries using just a small rotation and a flip. For the symmetries of the equilateral triangle, we let ρ denote the
rotation by 120 degrees, and let f be the flip over one of the axes of the triangle. Then the six elements of the dihedral group are given
by: id, ρ, ρ , f, fρ, fρ . Thus, {f, ρ} is a set of generators for the dihedral group.
2 2
Definition 3.0.0:
Let G be a group, and S a subset of G . We say that S generates G (and that S is a set of generators for G ) if every element of G
can be expressed as a product of elements of S and their inverses.
We include the inverses of the generators in the definition because we know that every element has an inverse. If we think of the
integers under addition, we can write every positive number as a many-times sum of the number 1: for example, 5 is just
1 + 1 + 1 + 1 + 1 . If we allow inverses as well, we can then get every element of the group from a single generator: the inverse of 1
is −1 , so we can write (for example) −4 = (−1) + (−1) + (−1) + (−1) . (Including the inverses also means we don't need to
include the identity, since for any g, gg = e .)
−1
On the other hand, for any group G , we can certainly take G itself as a generating set! Then every element is considered a 'generator,'
so every element can be written as a (trivial) product of generators. This tells us that for any group we can find a generating set.
Usually, we try to find a generating set as small as possible. Sometimes, though, a larger generating set might be interesting if it helps
us to better understand the group in question.
Once we have a generating set for a graph G , we can produce a very nice visualization of the group called the Cayley graph. By graph,
we mean a number of points (called vertices) connected by some arrows (called edges). Graphs are good for keeping track of
relationships between things, and appear in many, many places in mathematics and in applications.
The Cayley graph of a group has one vertex for each element x in the group. Each vertex has one arrow coming out of it for each
generator g, pointing to the element gx. (This creates the left Cayley graph. The right Cayley graph has arrows pointing from x to xg.)
Usually we make the arrows different colors to correspond to the different generators; this is very useful for being able to visualize the
structure of the group!
For the dihedral group, we found a set of generators with two elements: the rotation and the flip over one of the axes. In fact, the
dihedral group has many different sets of generators of size two! We could have chosen the clockwise rotation instead of the counter-
clockwise rotation, for example. Or we could have chosen any of the other flips. But the resulting Cayley graph would have been
more-or-less the same.
Figure. 3.2: The Cayley graph for the dihedral group with generators given by two different flips.
Suppose we have a generator where g 2
= 1 . It's tedious to draw arrows in both directions from every element, so we sometimes omit
the arrow heads in this case.
Exercise 3.0.1
Identify generators for the permutation group S . Make a Cayley graph.
3
CONTRIBUTORS
Tom Denton (Fields Institute/York University in Toronto)
some others.
INTEGERS MODULO n
The integers modulo n only require a single generator to obtain the entire group. 1 is a nice choice of generator, as we know that every
number k in {0, 1, 2, … n} can be written as k ⋅ 1 , which is to say, the sum of 1 with itself k times. The Cayley graph for this
situation is simple: it's just n vertices, arranged in a loop with an arrow pointing from each number to the next. This creates a cycle!
When n = 8 , the cycle is this:
0 → 1 → 2 → 3 → 4 → 5 → 6 → 7 → 0 (3.1.1)
. Any group which is generated by a single element (including the usual integers!) is called a cyclic group. (This is yet another
interpretation of Z !)
n
We can choose other numbers than 1 as the generator, though! Take n = 8 , and consider the number 3. We can make our Cayley
graph by drawing a vertex for each number in {0, 1, 2, … , 8} and an arrow from each x to x + 3 . Then the cycle draws out as:
[0\rightarrow 3\rightarrow 6\rightarrow 1\rightarrow 4\rightarrow 7\rightarrow 2\rightarrow 5\rightarrow 0].
Here's a Cayley graph for Z shown with three generators. Any one of the three generators would work just fine. The red vertex is the
7
identity, 0. The green arrows are for the generator 1, blue for the generator 2, and green for the generator 3. What would happen if we
included the generators 4, 5, or 6?
Figure 3.1: Cayley graph for Z with three different generators. The identity is marked as the red dot.
7
Not every number is a generator of Z . For example, in Z , if we choose 4, the cycle is just:
n 8 0 → 4 → 0 . Since the cycle doesn't
contain every element of the group, we see that 4 doesn't generate the group on its own.
Exercise 3.1.0
Suppose k ∈ Z . Show that k generates Z if and only if k is relatively prime to n. (ie, the only common divisor of k and n is 1.)
n n
Exercise 3.1.1
Suppose k ∈ Z and k is not relatively prime to n. Is it possible to find another umber m not relatively prime to n such that k and
n
PERMUTATION GROUPS
The permutation group S has a number of interesting generating sets. We'll show a few of these generating sets for n = 3 and n = 4
n
consider an arbitrary permutation σ, written in list notation. If there are two adjacent entries that are out of order (big to the left of the
small), we can apply rotations until the two things sit in the first two entries (suppose we use k rotations to do this). Then we apply the
flip. And then we 'unrotate' k times to put the now-sorted numbers back. Then we find two more adjacent numbers and repeat. Once
there are no adjacent numbers out of order, then we must be at the identity! Then the reverse of the sequence of moves we just made
builds the permutation we wanted. Since the permutation was arbitrary, our two moves must generate the group.
Figure 3.2: A Cayley graph for S , generated by the rotation [4, 1, 2, 3] (in red) and reflection \([2,1,3,4]) (cyan).
4
A second set of generators is given by the set of all transpositions. These are all of the permutations that have two things switched and
everything else in order. For example, [4, 2, 3, 1] and [1, 2, 6, 4, 5, 3] are transpositions. Modifying the above argument, you can see
that the set of all transpositions are a generating set. There are more than 2 transpositions, so this isn't a minimal generating set. But it
is an interesting set of generators when studying the permutation group more closely.
Exercise 3.1.2
Yet a third set of generators is given by the simple transpositions. This is the set of transpositions { s } that just exchange i and i + 1
i
while leaving everything else alone. There are n − 1 simple transpositions. This is a very important set of generators in the further
study of permutations! But it shows up in one simple context, as well.
Exercise 3.1.3
What permutation of n things takes the longest to be sorted by Bubble Sort? How many simple transpositions are necessary to sort
that permutation?
Exercise 3.1.4
For the same 'long' permutation from the last exercise, sort the permutation using the first set of generators for S , the rotation and
n
the flip. How many steps are needed to sort the permutation this way?
We see that there's a trade-off between having a smaller set of generators and being able to write different group elements as products
of fewer generators. (Indeed, if we took the whole group as the generating set, every element could be written as a product of just one
generator! But this usually isn't so helpful for understanding the group...)
CONTRIBUTORS
Tom Denton (Fields Institute/York University in Toronto)
Ostensibly, to check that a subset H is a subgroup, we would need to check all four properties of the group. That is, closure (ie, the
operation gives a map H × H → H ; products of things in H are always in H ), identity, the existence of inverses, and associativity.
In fact, since H has the same operation as G , we know that the operation in H is associative (since G is a group). Furthermore, if the
operation is closed and inverses exist, then we know that for any h ∈ H , h h = e must be in H . So really we only need to check
−1
two things:
1. Closure: gh ∈ H for all g, h ∈ H , and
2. Inverses: h ∈ H for all h ∈ H .
−1
generated by h.
Exercise 3.2.1
Let X be a geometric object. Show that the rotations of X back onto itself forms a subgroup of the group of symmetries of X. (Try
this in particular on a regular polygon and a regular polyhedron. What happens with a 'bumpy' polygon?)
Let G be a group, and g ∈ G . Consider a function f : G → G given by f (h) = g ⋅ h . (This is the 'left multiplication by g'
g g
function.) What happens if, for some h, k ∈ G , f (h) = f (k) ? Then gh = gk , so g gh = g gk , and h = k . This tells us that
g g
−1 −1
f is a one-to-one, or injective, function. If G has a finite number of elements, then f is also an onto function, and is thus a bijection
g g
If we consider G as a set, we can think of any left multiplication as a permutation of G . But the set of all left multiplications is itself a
group. This gives us what is known as Cayley's Theorem!
Exercise 3.2.3
Label the six symmetries of the equilateral triangle. Demonstrate that the symmetries of the triangle are a subgroup of S6 , the
permutations of 6 objects.
It is worth noticing that for any g in a group G , the powers of g generate a subgroup of G . The set { g i
∣ i ∈ Z} is closed under the
group operation, and includes the identity and inverses. This is called the cyclic subgroup generated by g.
Exercise 3,2,4
Find all of the subgroups of the permutation group S for three objects. Which subgroups are subgroups of other subgroups? Name
3
CONTRIBUTORS
Tom Denton (Fields Institute/York University in Toronto)
two different 'copies' of Z sitting inside of the dihedral group. Likewise, the light arrow loops look like five copies of Z . Both Z
5 2 5
and Z are subgroups of D , generated by the rotation and flip respectively. These 'copies' of the subgroups that we see in the Cayley
2 5
Dihedral group Cayley graph, generated by a flip and rotation. The Cayley graph for the dihedral group with generators given by a flip
and a rotation.
These are precisely the 'copies' of the subgroups that we saw in D . The elements of D can all be written as f r with i ∈ {0, 1}
5 5
i j
and j ∈ {0, 1, … , 4} . The rotation subgroup consists of the element R = {e, r, r , r , r } . Then R has two distinct cosets, R and
2 3 4
fR = {f, fr, f r , f r , f r } . For any g we choose, gR is equal to one of these two cosets! Likewise, if we consider the subgroup
2 3 4
F = {e, f} , there are five distinct cosets given by r F , where i ∈ {0, 1, 2, 3, 4}. Notice that the cosets evenly divide up the group;
i
Suppose that H is a subgroup of G . Let x, y ∈ G . Then either xH = yH or xH and yH have no elements in common.
Proof 3.3.2
any elements, then they are equal, and if they share no elements, they are tautologically disjoint!
So the order of R ⊂ D is |R| = 5 , and the order of the flip subgroup |F | = 2 . We can now prove Lagrange's Theorem!
5
Notice that every element of the group G shows up in some coset of H : since e ∈ H , we have g ∈ gH for every g. Therefore, every element of
the group shows up in exactly one coset of H . Also notice that every coset of H has the same number of elements as H . (If the size of gH were
less than |H | , there would be have to be two different elements h , h ∈ H with g h = g h . But cancelling the g's gives h = h , a
1 2 1 2 1 2
contradiction.)
Then the cosets of H break up G evenly into subsets of size |H | . Thus, |H | divides |G| , as desired.
Exercise 3.3.6
Find all of the subgroups of the permutation group S and the dihedral group D .
3 5
Exercise 3.3.7
Find a subgroup of the permutation group S with twelve elements. What are it's cosets?
4
CONTRIBUTORS
Tom Denton (Fields Institute/York University in Toronto)
4.0: HOMOMORPHISMS
4.1: PRODUCT GROUPS
4.2: IMAGE AND KERNEL
TOC.1 11/13/2019
4.0: HOMOMORPHISMS
When we think of the integers, it is useful not just to think of individual numbers, but relations between numbers; people do this
automatically, comparing two numbers to see which is bigger. And if neither number is bigger, we see that the two numbers are equal.
In fact, this is a very important way to approach mathematics: We consider not just objects but also how objects are related to one
another. So far, we have thought extensively about group as objects. But are there interesting relationships between groups? If so, can
we come up with a notion of when two groups are the same?
We can relate groups using special functions between different groups called homomorphisms. There's a way to think of relationships
between numbers as functions, too: if you have k cows and p chickens, we can try to make a one-to-one function from cows to
chickens. If it's impossible, we know the number of cows is larger. If there are chickens left over (ie, the function is not onto) then the
number of chickens is larger. But if the function is one-to-one and onto, then the two numbers p and k are equal.
We should look at lots of examples to build up some intuition! Take the integers with the operation of addition. Define phi:
Exercise 4.0.1
Find a function on the interval [0, 1] that changes the endpoints but is not a symmetry. In other words, find some
f : [0, 1] → [0, 1] such that f(0) = 1 , f(1) = 0 , but f is not length-preserving.
So symmetries are a special kind of function which preserve distances. When we try to relate groups to one another, we use special
kinds of functions between the groups.
HOMOMORPHISMS
A group is a set with an operation which obeys certain rules. So we'll consider functions that preserve the operation. That is, functions
for which it doesn't matter whether we perform our group operation before or after applying the function. More precisely:
We should look at lots of examples to build up some intuition! Take the integers with the operation of addition. Define ϕ : Z → Z by
ϕ(n) = 2n . Note that the definition of homomorphism works regardless of the symbol we're using for the group operation, and for Z
we use addition. Then to show that ϕ is a homomorphism, we need to check that ϕ(n + m) = ϕ(n) + ϕ(m) ; the operation before
applying ϕ is the same as the operation after applying ϕ. So we check! ϕ(n + m) = 2(n + m) , while ϕ(n) + ϕ(m) = 2m + 2n .
Since 2(n + m) = 2n + 2m , we see that ϕ is a homomorphism.
We can also have homomorphisms between groups where the operations are written differently! For example, there is a
homomorphism between the integers modulo n (mathbbZ ) and the nth roots of unity. Remember that mathbbZ is written with an
n n
ik
addition operation, while the nth roots of unity are written with multiplication. We define ρ by ρ(k) = e . Then we check that ρ is a
2π
homomorphism! Since the operations are written differently (addition and multiplication), we need to check whether
i( k+j) i( k) i( j) i( k+j)
ρ(k + j) = ρ(k)ρ(j) . This isn't so bad: ρ(k + j) = e . On the other hand, ρ(k)ρ(j) = e e
2π 2π
= e 2π 2π
. So this is a
homomorphism; in fact, it is an isomorphism, since the n-th roots of unity and Z have the same number of elements.
n
Isomorphisms are very special homomorphisms. If two groups are isomorphic, it is impossible to tell them apart using just the tools of
group theory. True, the two groups may look very different, but they are structurally identical. When we saw the integers modulo n,
we saw four different realizations of 'the same' group; they were all isomorphic.
Exercise 4.0.4
Let H be a subgroup of G . Define the inclusion ι : H → G by ι(x) = x for each x ∈ H . Show that ι is a homomorphism.
There are many things we can say about homomorphisms with just a little work. We'll prove two basic statements right away.
Proposition 4.0.5
Proof 4.0.6
1. Choose any element x ∈ G . Then rho(x) = ρ(1x) = ρ(1)ρ(x) . So rho(x) = ρ(1)ρ(x). Cancelling the rho(x) on both side leaves us
with 1 = ρ(1) .
2. We have ϕ(1) = 1 , so 1 = ϕ(xx ) = ϕ(x)ϕ( x ), giving us 1 = ϕ(x)ϕ( x ) . Then we can multiply both sides on the left by ϕ(x)
−1 −1 −1 −1
Exercise 4.0.7
1. Show that for any homomorphism ϕ, we have ϕ(x ) = ϕ(x ) .
n n
2. Show that if two finite cyclic groups have the same order, then they are isomorphic.
This tells us that group homomorphisms, in addition to preserving the group operation, also preserve inverses and exponents. Thus,
group homorphisms also preserve inverses and exponents!
CONTRIBUTORS
Tom Denton (Fields Institute/York University in Toronto)
look at products of groups and find a way to make new groups from the groups we already know.
A very famous group - though not a very complicated one - is the Klein Four-Group. This is the symmetry group of a rectangle. It has
a pair of generators, given by the flips over the horizontal and vertical axes.
{(0, 0), (1, 0), (0, 1), (1, 1)} , and operation given by just adding the elements coordinate-wise as elements of Z . (So that 2
(1, 0) + (1, 1) = (0, 1) . Then H is a group (check!), and is in fact isomorphic to the Klein Four-Group. It's an example of a product
group!
Let's be more precise and set a definition of a product group.
G and H .
Here's an example. Take G = Z and H = Z , and consider the product G × H . The product group has 18 elements: there are three
3 6
choices for the first coordinate and 6 choices for the second coordinate. Since we use addition as the operation in both of the coordinate
groups, we'll use addition as the operation in the product. So consider elements (2, 4) and (1, 3). Then (2, 4) + (1, 3) = (0, 1) ;
addition in the first coordinate is according to Z , and addition in the second coordinate is according to Z .
3 6
We should check that the product of any pair of groups G and H is actually a group.
1. The product group has an identity (1, 1): (1, 1) ⋅ (g, h) = (1g, 1h) = (g, h) .
2. Associativity follows from associativity of G and H .
3. Closure also follows from closure in G and H .
4. The inverse of (g, h) is (g , h ) .
−1 −1
So G × H really is a group.
We saw in the example that Z 3 × Z6 has 18 elements. This isn't a coincidence! For any finite groups G and H , the product group has
|G||H | elements.
An interesting question at this point is suggested by Lagrange's Theorem, which told us that the cardinality of any subgroup divides the
cardinality of the original group. We've seen that we can form product group sto 'multiply' groups: Is it also possible to 'divide' groups?
Over the next few sections, we'll develop ideas that will let us build quotient groups.
Exercise 4.1.1
Not all product groups are commutative. How many elements are in G = S4 × Z3 ? Identify the identity. Write down a few non-
identity elements and compute their respective products.
CONTRIBUTORS
Tom Denton (Fields Institute/York University in Toronto)
Let's try an example. Recall the homomorphism ϕ : Z → Z , defined by ϕ(n) = 2n for any n ∈ Z . The image of ϕ is the set of all
even integers. Notice that the set of all even integers is a subgroup of Z. The kernel of ϕ is just 0.
Here's another example. Consider the map ϕ : Z → Z given by ϕ(n) = 2n . So ϕ(0) = 0 , ϕ(1) = 2 , and ϕ(2) = 4 . This is
3 6
actually a homomorphism (of additive groups): ϕ(a + b) = 2(a + b) = 2a + 2b = ϕ(a) + ϕ(b) . The image is the set {0, 2, 4},
and, again, the kernel is just 0.
And another example. There's a homomorphism ρ : Z → Z given by ρ(a) = a (divide by 3 and keep the remainder). Then
6 3
ρ(0) = 0 , ρ(1) = 1 , ρ(2) = 2 , ρ(3) = 0 , ρ(4) = 1 and finally ρ(5) = 2 . You can check that this is actually a homomorphism,
So the image is the set of everything in H which has something in G which maps to it. The kernel is the set of elements of G which
map to the identity of H . The kernel is a subset of G , while the kernel is a subset of H . In fact, both are subgroups!
Proposition 4.2.1
To see that the kernel is a subgroup, we need to show that for any g and h in the kernel, gh is also in the kernel; in other words, we
need to show that ρ(gh) = 1 . But that follows from the definition of a homomorphism: ρ(gh) = ρ(g)ρ(h) = 1 ⋅ 1 = 1 . We leave it
to the reader to find the proof that the image is a subgroup of H .
Exercise 4.2.2
Show that for any homomorphism ρ : G → H , ρ(G) is a subgroup of H .
We can use the kernel and image to discern important properties of ρ as a function.
Proposition 4.2.3
Proof 4.2.4
If we assume is injective, then we know (from the exercise in the last section) that ρ (1) = {1} . For the reverse direction, suppose
ρ
−1
(1) = {1} , and assume (for contradiction) that ρ is not injective. Then there exist x ≠ y with ρ(x) = ρ(y) . But then
−1
ρ
Let ρ : G → H be a homomorphism, and let K be the kernel of ρ. Then for any k ∈ K and x ∈ G , we have xkx −1
∈ K .
Proof 4.2.6
CONTRIBUTORS
Tom Denton (Fields Institute/York University in Toronto)
Topic hierarchy
TOC.1 11/13/2019
5.0: QUOTIENT GROUPS
Previously we saw product groups; now we'll learn about quotient groups. The construction is a bit more involved than the construction
of product groups, just as division of natural numbers is a bit more complicated than multiplication...
Let's think for a moment about how quotients of natural numbers work, for the sake of building an imperfect analogy. When we write
= q , we have a numerator, denominator, and a quotient. The quotient q can be thought of as the number of times we can divide n
n
order of G .
Definition 5.0.0
The set of cosets of a subgroup H of G is denoted G/H .
Then we can try to take the cosets of H as the underlying set of our would-be quotient group Q . The question is whether we can now
identify a reasonable group operation on the set of cosets of H . The answer is 'sometimes!'
might cause it to fail? A problem arises because the set on which we're defining our new quotient group is the set of cosets, and it isn't
generally obvious which element to take as the representative of the coset; ie, there is more than one way to write a coset as gH , and
different choices might lead to different answers when we multiply our cosets.
Here's an example.
Take the group G = D , the symmetries of a pentagon generated by a flip f and a rotation r. Let H be the subgroup consisting of just
5
the identity and the flip f . Then H = {1, f} . This subgroup has five different cosets; suppose we want to multiply the cosets
C = {r, rf} and D = { r , r f} . Notice that there are two different ways to write C in the from gH : C = rH and C = rfH .
3 3
Each arises from a different choice of representative from H . The same is true for D: D = r H and D = r fH . Depending on the
3 3
choice of representatives, our rule for multiplying cosets then yields different answers. For example, rH ⋅ r H = H , but 3
rfH ⋅ r H = rf r H = r fH ≠ H .
3 3 2
Then we see that a more nuanced approach is necessary: in particular, our notion of a product shouldn't depend on a choice of coset
representative!
Exercise 5.0.1
Let R be the subgroup of D generated by the rotation r. Show that the product rule (aR) ⋅ (bR) does not depend on the choice of
5
coset representatives!
PRODUCTS OF COSETS
The initial idea for a product on cosets fell down because we were multiplying coset representatives, instead of thinking about how to
multiply the actual cosets. So let's try to define an actual product of cosets!
Earlier, we saw what we might call left cosets, of the form aH = {ah , ah , …} where h are all the elements of H . But we can
1 2 i
easily imagine right cosets as well, H a = { h a, h a, …} , and even double cosets aH b = {ah b, ah b, …}. More generally, we
1 2 1 2
can define product of sets: if A = { a , a , …} , then AH is the set obtained by taking products of elements of A and H in every
1 2
possible way: AH = {ah|a ∈ A, h ∈ H } . We can use the product of sets to compute explicit products of cosets.
Proposition 5.0.2
If H is a subgroup of G , then H H = H .
Proof 5.0.3
Since H is closed under the group operation, every element of H H is in H . Furthermore, since 1 ∈ H , every element h in H appears in HH
when examining products like (aH )(bH ), we have aH bH = abH H = abH. Then defining a product on cosets
(aH )(bH ) = abH makes sense, and will end up giving a nice group structure. The identity is 1H , associativity follows from the
We should also check that this product doesn't depend on choice of coset representative. Suppose aH = xH , and consider the product
(aH )(bH ) = aH H b = aH b . Then notice that (aH )(bH ) = aH b = abH , and (xH )(bH ) = xH b = (aH )b = abH . Thus, we
have aH = bH .
NORMAL SUBGROUPS
If we look closely at what we've just done, we didn't actually need G to be commutative: all that we needed was aH = Ha for every
a ∈ G . For example, we know this is true for the kernel of any homomorphism from the proposition in Section 4.2.
Let G be a commutative group. Then every subgroup H of G is a normal subgroup, and G/H is a group.
We have already noticed that the kernel of any homomorphism is a normal subgroup. We can also define the quotient map
π : G → G/H , defined by π(a) = aH for any a ∈ G . So long as the quotient is actually a group (ie, H is a normal subgroup of
Corollary 5.0.7
Exercise 5.0.8
|G|
Let G be a finite group and H a subgroup with |H |
= 2 . Show that H is a normal subgroup of G .
CONTRIBUTORS
Tom Denton (Fields Institute/York University in Toronto)
Let nZ denote the set of integers divisible by n: nZ = {… , −3n, −2n, −n, 0, n, 2n, 3n, …} . This forms a subgroup: 0 is always
divisible by n, and if a and b are divisible by n, then so is a + b . Since every subgroup of a commutative group is a normal subgroup,
we can from the quotient group Z/nZ.
To see this concretely, let . Then the cosets of 3Z are 3Z, 1 + 3Z , and 2 + 3Z . We can then add cosets, like so:
n = 3
(1 + 3Z) + (2 + 3Z) = 3 + 3Z = 3Z. The last equality is true because 3Z = {… , −6, −3, 0, 3, 6, …} , so that
3 + 3Z = {… , −3, 0, 3, 6, 9, …} = 3Z .
Exercise 5.1.0
Write out addition tables for Z/5Z as a quotient group, and check that it is isomorphic to Z as previously defined.
5
ways to interpret the alternating group, but they mainly come down to the idea of the sign of a permutation, which is always ±1 . The
set {1, −1} forms a group under multiplication, isomorphic to Z . The sign of a permutation is actually a homomorphism. There are
2
easily show that such a matrix has determinant equal to ±1 . Since the determinant is a multiplicative function -
det(M N ) = det(M ) det(N ) - we can observe the the determinant is a homomorphism from the group of permutation matrices
permutation is (−1) . i
3. Count crossings. Draw a braid notation for the permutation where no more than two lines cross at any point and no line intersects
itself. Then count the number of crossings, c. Then s(σ) = (−1) . The alternating group is the subgroup of S with s(σ) = 1 .
c
n
(To prove that this method of counting works, one needs a notion of Reidemeister moves, which originally arise in the fascinating
study of mathematical knots.)
Exercise 5.1.1
Find the inversion number for every permutation in S , and then sort the permutations by their inversion number.
4
Exercise 5.1.2
Show that each of the three definitions of the sign of a permutation give a homomorphism from S to {1, −1} . (For the third
n
definition, a sketch of a proof will suffice. Be sure to argue that different braid notations for the same permutation give the same
sign, even if the total number of crossings is different.)
We call a permutation with sign +1 a positive permutation, and a permutation with sign −1 a negative permutation.
Exercise 5.1.3
Show that there are n!
2
positive permutations in S .
n
Exercise 5.1.5
In fact, the alternating group has exactly two cosets. The quotient group S
n / An is then isomorphic to Z .
2
QUOTIENTS OF PRODUCTS
Amongst the natural numbers, if we do something like (m ∗ n)/m , we expect to get n back again. Is this also true for products and
quotients of groups?
If we consider a product group G × H with identity (1, 1), we can take the subgroup given by elements (1, h) for h ∈ H .
Remembering the injection map ι : H → G × H , we notice that this subgroup is just ι(H ). The cosets of ι(H ) are given by
(g, 1)ι(H ) , for g ∈ G . Then the quotient G × H /ι(H ) is isomorphic to G .
CONTRIBUTORS
Tom Denton (Fields Institute/York University in Toronto)
The proof is just to build a correspondence between the cosets of the kernel gK and elements of the image I . Indeed, in any coset gK
all elements map to the same element of the image. ρ(gk) = ρ(g)ρ(k) = ρ(g)1 = ρ(g) for any k ∈ K .
This suggests a homomorphism from the set of cosets to the image: set ϕ(gK) = ρ(g) . This is a homomorphism, since
ϕ(ghK) = ρ(gh) = ρ(g)ρ(h) = ϕ(gK)ϕ(hK) .
The map ϕ is also one-to-one: if ϕ(gK) = ϕ(hK) , we have ρ(g) = ρ(h) , so that 1 = ρ(g −1
h) , meaning g
−1
h ∈ K . Then
h) ∈ gK , which tells us that gK = hK , since cosets are either equal or disjoint.
−1
h = g(g
The map ϕ is onto, since any element in the image may be written as ρ(g) for some g, which is also the image of gK under ϕ .
Therefore, the map ϕ is an isomorphism.
TODO: Pictures!
This theorem is often called the "First Isomorphism Theorem." There are three isomorphism theorems, all of which are about
relationships between quotient groups. The third isomorphism theorem has a particularly nice statement: (G/N )/(H /N ) ∼ G/H ,
which one can relate to the the numerical identity
n
m
n
= . (5.2.1)
p
p
m
A PRELUDE TO CATEGORIES
Some of the important objects in this chapter are similar to things you've probably encountered in linear algebra: kernel, image, and
product. This isn't an accident: vector spaces are also algebraic structures. For almost any algebraic structure, we begin with the actual
structure, and then consider special functions between the objects with that kind of structure. In groups, these are homomorphisms, and
in linear algebra the special maps are linear functions.
In most of these contexts, the idea of kernel and image still make sense, and one can form products and quotients, and even derive a
version of the isomorphism theorems.
CONTRIBUTORS
Tom Denton (Fields Institute/York University in Toronto)
An interesting question, then, is 'Which groups have no quotients?' We've seen that we can form a quotient group whenever there is a
normal subgroup.
We'll now actually classify all of the finite simple groups, and discuss some of the history of the non-commutative case.
A finite commutative group is simple if and only if it has prime order p. In this case, it is isomorphic to the cyclic group, Z . p
Proof 5.3.2
If a finite commutative group has prime order then it has no proper subgroups, by Lagrange's theorem. Then it must be simple.
For the other direction, we assume G is a finite commutative simple group. G must be cyclic, or else we could form a proper subgroup by taking
powers of a generator. So G ∼ Z for some n. But if n is not prime we can find a subgroup using a proper divisor of n. Then G ∼ Z for some
n p
prime p.
Theorem 5.3.3
Every finite commutative group is a direct product of cyclic groups of prime order.
Proof 5.3.4
Let A be a commutative group with n elements. Take any element x not equal to the identity in A ; we know that there is some minimal integer
m for which x = 1 . Then A has a subgroup of order m generated by x, isomorphic to Z . As a result, we have A ∼ A ⊗ Z , where A
m
m 1 m 1
One can extend this trick to some infinite groups: those which have a finite number of generators. (Such groups, unsurprisingly, are
called finitely-generated.) This gives rise to the Fundamental theorem of finitely-generated commutative groups.
Exercise 5.3.5
CONTRIBUTORS
Tom Denton (Fields Institute/York University in Toronto)
TOC.1 11/13/2019
6.0: GROUP ACTIONS
Group actions bring us back to our original view of groups as measures of symmetry. We begin with a definition.
Definition 6.0.1
Let G be a group. A set S is a (left) G -set if there is a function from G × S → S (which we will write as g ⋅ s for g ∈ G, s ∈ S )
satisfying:
1. (gh) ⋅ s = g ⋅ (h ⋅ s) for all g, h ∈ G, s ∈ S , and
2. 1 ⋅ s = s for all s ∈ S .
An analogous definition can be written for a right G -set; a right G -set has a function from S × G → S .
These conditions say, in plain language, that elements of G move objects in the set to other objects, in a way which respects the group
operation. (ie, it doesn't matter whether you perform the operation before or after applying the action.) Furthermore, the identity fixes
every element of the set.
It's best to start with some easy combinatorial examples.
1. Let G be a group and S be any set. The trivial action of G on S is given by g⋅ s = s for every g ∈ G, s ∈ S . One can easily
check that the conditions for a group action hold!
2. Consider S and a set S of n labelled objects. Then the permutations of the objects constitute an action of S on S .
n n
3. But S can also act on sets with more than n elements! Consider a regular deck of playing cards; each card has one of four suits
n
(Clubs, Spades, Hearts, or Diamonds) and are numbered 1 to 13. (Where a Jack is 11, Queen is 12 and King is 13.) We can write
each card in a short form: 4D is short for 'four of diamonds.' Then S acts on the deck of cards by permuting the suits of the cards.
4
For example, consider the permutation σ which transposes hearts and diamonds while leaving clubs and spades alone. Then
σ(4D) = 4H , and σ(12C) = 12C .
We can be a bit more explicit about the action on suits by identifying C with 1, S with 2, H with 3, and D with 4. Then the
permutation σ is equivalent to [1, 2, 4, 3] in one-line notation.
On the other hand, S acts on the values of the cards while leaving the suits alone. For convenience, we write the action on values
13
of cards on the right and the action on suits on the left. Let τ be the permutation in S which sends i to i + 1 , and 13 to 1. Then
13
4. And S can act on sets with fewer than n elements. Consider a coin, with a 'heads' side (H ) and a 'tails' side (T ). We can define an
n
action of S where σ flips the coin if the sign of σ is negative, and leaves the coin alone if the sign of σ is positive. Here the set
n
For the example of S acting on a coin, the Cayley graph will just have two vertices, H and T , and arrows according to the action of
n
the generators. If we consider the generating set of simple transpositions (which, we'll recall, exchange i and i + 1 and leave
everything else alone), the Cayley graph will have arrows from H to T and back for each simple transposition. (Because all simple
transpositions have sign −1 .)
Exercise 6.0.1
Sn has another generating set consisting of the transposition [2, 1, 3, 4, … , n] and the rotation [n, 1, 2, … , n − 1] . Draw the
Cayley graph of S acting on the coin with these generators. (Note that the Cayley graph may be different for different n.)
n
Let's build a Cayley graph for the cards, but with a smaller set of cards to make things easier to draw. Imagine our deck only has two
suits - Spades and Diamonds - and only numbers 1 through 4. Then let S act on the left by permuting numbers. The Cayley graph is
4
below.
Figure 6.2: Cayley diagram for S acting on some cards numbered 1 through 4, whose suits are all either diamonds (D) or spades (S).
4
Notice that the action is broken up into two different connected pieces: Exchanging numbers will never allow us to change suits, but
we can switch around numbers freely. As a result, the suits each form a 'block' from which the other blocks can't be reached. These
blocks are called orbits; we'll study them intensively in the next section.
CONTRIBUTORS
Tom Denton (Fields Institute/York University in Toronto)
There are a few questions that come up when encountering a new group action. The foremost is 'Given two elements s and t from the
set S , is there a group element such that g ⋅ s = t ?' In other words, can I use the group to get from any element of the set to any other?
In the case of the action of S on a coin, the answer is yes. But in the case of S acting on the deck of cards, the answer is no. In fact,
n 4
this is just a question about orbits. If there is only one orbit, then I can always find a group element to move from any object to any
other object. This case has a special name.
Equally important is the stabilizer of an element, the subset of G which leaves a given element s alone.
For example, the stabilizer of the coin with heads (or tails) up is A , the set of permutations with positive sign. In our example with S
n 4
acting on the small deck of eight cards, consider the card 4D. The stabilizer of 4D is the set of permutations σ with σ(4) = 4 ; there
are six such permutations.
In both of these examples, the stabilizer was a subgroup; this is a general fact!
Proposition 6.1.3
Proof 6.1.4
Let g, h ∈ G . Then gh ⋅ s
s = g ⋅ (h ⋅ s) = g ⋅ s = s . Thus, gh ∈ G . If g
s ∈ Gs , then so is g −1
: By definition of a group action, 1 ∈ G , so:
s
s = 1⋅s = g
−1
g ⋅s = g
−1
s .
Thus, G is a subgroup.
s
Figure 6.1: H is the subgroup of S with σ(1) = 1 for all σ in H . This illustrates the action of S on cosets of H .
4 4
Exercise 6.1.5
1. Show H is a subgroup of S .
i n
2. Now let n = 5 and Sketch the Cayley graph of the coset action of S on H and H . 5 1 3
The coset action is quite special; we can use it to get a general idea of how group actions are put together.
Proposition 6.1.6
Proof 6.1.7
In fact, we can generalize this idea considerably. We're actually identifying elements of the G -set with cosets of the stabilizer group,
which is also a G -set; in other words, defining a function ϕ between two G -sets. The theorem says that this function preserves the
group operation: ϕ(g ⋅ s) = g ⋅ ϕ(s) .
Definition
Let S, T be G -sets. A morphism of G -sets is a function ϕ : S → T such that ϕ(g ⋅ s) = g ⋅ ϕ(s) for all g ∈ G, s ∈ S . We say
the G -sets are isomorphic if ϕ is a bijection.
Now we can use LaGrange's theorem in a very interesting way! We know that the cardinality of a subgroup divides the order of the
group, and that the number of cosets of a subgroup H is equal to |G|/|H |. Then we can use the relationship between cosets and orbits
to observe the following:
Theorem 6.1.10
For a somewhat obvious example, considering S acting on the numerical values of playing cards, we can observe that any given card
13
is fixed by a subgroup of S isomorphic to S (switching around the other twelve numbers in any way doesn't change affect the
13 12
given card). Then the size of the orbit of the card is |S |/|S | = 13 . That's a number we could have figured out directly by
13 12
Let G be a finite group, and S a finite G-set. Then S is isomorphic to a union of coset actions of G on subgroups.
For example, S acting on a full deck of cards decomposes as a union of four orbits, each isomorphic to the coset action of S
13 13 on a
subgroup isomorphic to S . 12
In short, to understand all possible G -sets, we should try to understand all of the subgroups of G . In general, this is a hard problem,
though it's easy for some cases.
Exercise 6.1.12
1. For n = 15 , draw Cayley graphs of the coset action of Z 15 on each of it's cosets.
Exercise 6.1.13
Sn acts on subsets of N = {1, 2, 3, … , n} in a natural way: if U = { i1 , … , ik } ⊂ N , then σ ⋅ U = {σ(i1 ), … , σ(ik )} .
1. Decompose the action of S on the subsets of {1, 2, 3, 4} into orbits.
4
CONTRIBUTORS
Tom Denton (Fields Institute/York University in Toronto)
all. This is a very useful thing to count: It's useful for counting things 'up to symmetry.' We'll denote the orbits of S by S/G , and thus
the number of orbits is |S/G| . The notation should be read 'S -mod-G '; it's useful when two things are 'the same' in S if they are
related by an application of an element of G .
For example, let's suppose we wanted to count all of the ways of painting the sides of a cube with three colors. We would naturally
think of two ways of painting as being the same if we could rotate the cube to line up the colors in the same way. Then the group G
might be symmetries of the cube, and the set S would be all ways of painting the cube in fixed position: What we're really trying to
count is |S/G| .
Let S denote the set {s ∈ S ∣ g ⋅ s = s} . This is like the stabilizer in reverse: we're collecting up all of the elements of the set S that
g
are fixed by g.
Theorem 6.2.0:
(Burnside's Lemma) The number of orbits in a G -set S is |S/G| = |G|
1
∑
g∈G
|S
g
| .
(Note that there's ample evidence that Burnside didn't actually invent Burnside's lemma; we include the name because it's what
everyone knows it by.)
Proof 6.2.1:
Let G ⋅ s denote the orbit of s under G . First notice that the sum of the size of the fixed sets S is equal to the sum of the size of the
g
stabilizer groups G : Both are counting the number of pairs (g, s) such that g ⋅ s = s . Then:
s
g 1
∑ |S | = ∑ | Gs | = ∑ |G|/|G ⋅ s| = |G| ∑ = |G| ∑ 1 = |G||S/G|
g∈G s∈S s∈S s∈S |G⋅s| S /G
The types of rotational axis for a cube which produce symmetries. From left to right, a 'face' axis, a 'vertex' axis, and an 'edge' axis.
There are 3 colorings of the base cube to consider in all; for each symmetry, we determine the number of colorings that the symmetry
6
fixes.
1. The identity permutation. This permutation fixes all 3 colorings. 6
2. Face rotations by ±90 . These are formed by rotating around the axis through the center of two opposite faces. There are six such
∘
rotations. In order to fix a coloring, the coloring must have the four 'moving' sides all colored with the same color. The other two
sides may be colored in any way. Thus, each of these symmetries fixes 3 colorings. 3
3. Face rotations by ±180 . Rotation by 180 about the axis through opposite faces. There are three such symmetries. Each requires
∘ ∘
that opposite 'moving' faces be the same color, while the 'fixed' faces may be colored arbitrarily. Thus, there are 3 fixed points for 4
these rotations.
4. Vertex rotations by ±120 . Rotation through an axis between opposite vertices of the cube. There are four such axes, with two non-
∘
trivial rotations through each such axis, for a total of eight symmetries in this class. To fix a coloring, the coloring must have same-
colored faces touching the vertices of rotation. There are thus 3 fixed points for these rotations.
2
5. Edge rotations by ±180 . Rotation through the axis connecting the centers of opposite edges. There are six such rotations. To fix a
∘
coloring, such a rotation needs all opposite sides to have the same color. Thus, each one fixes 3 colorings. 3
We can then use Burnside's Lemma to find the total number of colorings up to rotation:
1
G
(1 ⋅ 3
6
+6 ⋅ 3
3
+3 ⋅ 3
4
+8 ⋅ 3
2 3
+ 6 ⋅ 3 ) = 57 .
Exercise 6.2.2
Exercise 6.2.3
A chess-board is an 8 × 8 grid. Suppose we color the 64 squares on the board with n different colors. Suppose the board is made
of a thin cloth, so that the colors 'bleed through' and are visible on both sides of the board.
Up to rotation or flip, how many colorings of the board are possible? How about for a k × k board?
CONTRIBUTORS
Tom Denton (Fields Institute/York University in Toronto)
Topic hierarchy
TOC.1 11/13/2019
7.0: JUGGLING WITH TWO OPERATIONS
We'll now start looking at algebraic structures with more than one operation. Typically, these structures will have rules governing the
different operations, and additional rules for how the operations interact. We'll begin by looking at rings, which have two operations,
usually written as addition and multiplication, related by the distributive property.
There are many reasons to study ring theory, often having to do with generalizing the properties that we observe in many of the rings
we deal with in day-to-day life, like the integers and the rational numbers. By making precise the algebraic structures that (for
example) the integers satisfy, we can figure out what makes our favorite facts about the integers true, and easily see where those same
facts hold true.
It's also an area where most of the real pay-off comes later. Understanding ring theory is essential for algebraic geometry in particular,
which is a major force in modern mathematics. The basic idea of algebraic geometry is to study geometry using zeroes of polynomials:
for example, a line in the plane can be thought of as the zeroes of the polynomial f(x, y) = y − mx − b where m and b are
constants. In other words, to understand properties of geometry, it is helpful to understand properties of polynomials. And polynomials
are an example of a ring, as we'll see.
Definition 7.0.0
A ring is a set R with operations + and ⋅ such that:
1. R is a commutative group under + ,
2. (Distributivity) For all r, s, t ∈ R , we have r ⋅ (s + t) = r ⋅ s + r ⋅ t , and (s + t) ⋅ r = s ⋅ r + t ⋅ r .
Exercise 7.0.1
Show, using the definition of a ring, that for any ring R with additive identity 0, we have 0 ⋅ r = 0 for every r ∈ R .
This is the most general type of ring. There are many different types of ring which arise from placing extra conditions, especially on
the multiplicative operation. In fact, ring theory is kind of a zoo, divided up into the study of different 'species' of rings. Possibly the
most important rings to study are commutative, associative rings with unity, which we define now.
Definition 7.0.2:
Let R be a ring, and r, s, t ∈ R . Then R is:
1. Associative if the multiplication operation is associative: r ⋅ (s ⋅ t) = (r ⋅ s) ⋅ t ,
2. A ring with unity if there is a multiplicative identity 1, such that 1 ⋅ r = r = r ⋅ 1 ,
3. Commutative if the operation ⋅ is commutative: r ⋅ s = s ⋅ r .
Usually we'll deal with associative rings with unity; in fact, when we write 'ring' we'll mean an associative ring with unity unless
otherwise noted. As a result, 'commutative ring' will mean a ring that is commutative, associative and with unity.
There are numerous examples of rings! Here are some familiar examples.
1. Integers. The integers are a commutative group under addition, and have the distributive property. Additionally, the integers are
associative and commutative under multiplication, and have a multiplicative identity, 1. Thus, the integers are an commutative
associative ring with unity.
2. Rational Numbers, Real Numbers, Complex Numbers. All of these familiar number systems are examples of commutative
associative rings with unity.
3. Integers modulo n, Z . The multiplication operation works just as the addition operation does: do the normal multiplication, and
n
then divide by n and keep the remainder: a ⋅ b = (ab) . This is an associative and commutative operation, and there is a
multiplicative identity.
4. Matrices. Recall that matrix addition is just entry-by-entry, and that the multiplication of matrices adds and multiplies the entries
according to a certain rule: if M and N are matrices, then (M N ) = ∑ M N . Since this only uses addition and
i,j k i,k k,j
multiplication, we can thus form matrices with entries in any ring R , since R has notions of addition and multiplication. The set of
all m × n matrices with entries in R is denoted M m×n(R).
0 1 0 2 0 3
As an example, consider the matrices M = ( ) and N = ( ) with entries from Z5 . Then M +N = ( ) ,
2 3 3 4 0 2
3 4
and M ⋅ N = ( ) .
4 1
have (x + 1)(x + 1) = x + 1 .
2
Polynomials in many variables also form rings. We usually just write R[x, y] or R[x, y, z] if we're using two or three variables, or
more generally, R[x , x , … , x ] for more variables.
1 2 n
6. Rings of Functions. Many spaces of functions have a ring structure. For example, if we consider differentiable functions R → R ,
we can add and multiply functions: (f + g)(x) = f(x) + g(x) and (f ⋅ g)(x) = f(x)g(x) . Sums and products of differentiable
functions are also differentiable, so they are closed. The functions form an additive group, and there's a multiplicative identity: the
constant function defined by 1(x) = 1 .
There are numerous such function spaces: the space of continuous functions, integrable functions, infinitely differentiable
functions, and so on. And yes, polynomials. The study of rings of functions is important in mathematical analysis.
Exercise 7.0.3
1. Generate two 'random' matrices M and N in M (Z ) . Compute M + N , M N , and N M .
3,3 6
CONTRIBUTORS
Tom Denton (Fields Institute/York University in Toronto)
Definition 7.1.0
Let R and S be rings. Then ϕ : R → S is a homomorphism if:
1. ϕ is homomorphism of additive groups: ϕ(a + b) = ϕ(a) + ϕ(b) , and
2. ϕ preserves multiplication: ϕ(a ⋅ b) = ϕ(a) ⋅ ϕ(b) .
If the homomorphism is a bijection, then it is an isomorphism.
Examples:
1. We have the inclusion homomorphism ι : Z → Q , which just sets ι(n) = n . This map clearly preserves both addition and
multiplication.
2. Consider the map ϕ : Z → Z sending k to k. We've seen that this is a homomorphism of additive groups, and can easily check
n
ρ(0) = 0, ρ(3) = 1 .) This can be shown, using the same argument as above, to be a ring homomorphism.
3. The evaluation map e is a function from R[x] to R . For any polynomial f ∈ R[x] and k ∈ R , we set e (f) = f(k) . This is a
k k
ring homomorphism! Let f(x) = a x + ⋯ a x , and g(x) = b x + ⋯ b x , where the a , b ∈ R . (We'll also allow
n
n
0
0
n
n
0
0
i i
leading coefficients to be zero in order to make it easy to add f and g formally.) We then check the ring homomorphism conditions:
a. We have:
n 0
ek (f + g) = ek ((an + bn )x + ⋯ + (a0 + b0 )x ) (7.1.1)
n 0
= (an + bn )k + ⋯ + (a0 + b0 )k (7.1.2)
n 0 n 0
= an k + ⋯ a0 k + bn k + ⋯ b0 k (7.1.3)
b. Since we know that ek is an additive homomorphism, we only need to check that it is multiplicative on monomials. But that's
easy:
n m n+m
ek ((ax )(b x )) = ek (ab x ) (7.1.5)
n+m n m
= ab k = ek (ax )ek (b x ). (7.1.6)
Exercise 7.1.1
Definition 7.1.2
Let R and S be rings. Define the direct product R × S as the set {(r, s) ∣ r ∈ R, s ∈ S} with coordinate-wise operations:
(r1 , s1 ) + (r2 , s2 ) = (r1 + r2 , s1 + s2 ) , and (r , s ) ⋅ (r , s ) = (r ⋅ r , s ⋅ s ) .
1 1 2 2 1 2 1 2
Of course, one should verify that this is a ring by checking the ring axioms.
Exercise 7.1.3
1. Show that for any rings R and S that the product R × S is a ring.
2. Show that the inclusion map ι : R → R × S given by ι(r) = (r, 0) is a ring homomorphism.
3. Show that the projection π : R × S → R given by π((r, s)) = r is a ring homomorphism.
Definition 7.1.4
Let R be a ring. A subset S of R is a subring if S is itself a ring using the same operations as R . (We don't require that S has a
multiplicative identity, though.)
For example, take R[x], the polynomial ring over R. The set of degree 0 polynomials is closed under addition and multiplication;
indeed, this set is just a copy of R. Thus, R is a subring of R[x].
On the other hand, consider the set of all polynomials of degree greater than or equal to 2 in Z[x], which we'll denote P . This is ≥2
closed under addition (the sum of two polynomials has degree equal to the max of their degrees), and is closed under multiplication
(the degree of the product is the sum of the degrees). Thus, it is a subring. However, the multiplicative identity in R[x] is 1, which has
degree 0. So there is no unit in $P_{\geq 2}.
Another example: Take 2Z ⊂ Z , the set of even integers. This set is closed under addition and multiplication, and is thus a subring.
(The sum and product of two even integers is still even.) However, the even integers don't have the number 1, and so there is no unit in
2Z.
Exercise 7.1.5
Let P ≥2
n
denote all polynomials in Zn [x] with degree ≥ 2 . Is P 4
≥2
a subring of Zn [x]? Why or why not?
Definition 7.1.6
Let ϕ : R → S be a ring homomorphism. The kernel of ϕ is {r ∈ R ∣ ϕ(r) = 0} , which we also write as ϕ −1
. The image of
(0)
Let ϕ : R → S be a ring homomorphism. Then the kernel of ϕ is a subring of R and the image of ϕ is a subring of S .
Proof 7.1.8
Since ϕ is a homomorphism of commutative additive groups, we know that the kernel and image are closed under addition. The kernel is closed
under multiplication, because if ϕ(a) = ϕ(b) = 0 , then ϕ(ab) = ϕ(a)ϕ(b) = 0. The image is closed because if x, y ∈ ϕ(R) , then there exist
a, b ∈ R such that ϕ(a) = x, ϕ(b) = y . Then xy = ϕ(a)ϕ(b) = ϕ(ab) ∈ ϕ(R) .
Just as kernels of group homomorphisms were special kinds of subgroups, kernels of ring homomorphisms are special kinds of
subrings.
Proposition 7.1.10
Proof 7.1.11
For any x ∈ K , we have ϕ(x) = 0 . Then ϕ(rx) = ϕ(r)ϕ(x) = ϕ(r)0 = 0 . Similarly, ϕ(xr) = 0 . Thus, K is a two-sided ideal.
Ideals are playing exactly the same role as normal subgroups in the groups context; in fact, an ideal is a normal subgroup of the
additive group of the ring. In particular, we can form cosets and consider the quotient R/I . Since it's an additive group, cosets of an
ideal I are of the form r + I = {r + x|x ∈ I } .
Theorem 7.1.12
Proof 7.1.13
We know that under addition R/I is a commutative group. So we just need to show that the multiplication distributes over addition. For this we
have:
((r + I) + (q + I))(s + I) = rs + qs + I = (r + I)(s + I) + (q + I)(s + I) .
One can also check that the multiplication is associative and commutative if R is associative and commutative. Likewise, if R has a unit, then
1 + I acts as a unit in R/I .
Let R and S be rings, and ϕ : R → S a homomorphism. Then the image of ϕ is isomorphic to R/I .
Proof 7.1.15
To prove the isomorphism theorem, build a homomorphism from R/I to the image of ϕ, just as we did for groups, and show that it is a bijection.
CONTRIBUTORS
Tom Denton (Fields Institute/York University in Toronto)
Topic hierarchy
TOC.1 11/13/2019
8.0: THE PROBLEM OF DIVISION
Let's consider the problem of division. To get at a notion of division in general rings, let's recap what we know about division for
familiar number systems.
Definition 8.0.0:
For numbers x, y, we say that y divides x if there exists a number z such that x
y
= z if x = zy . We call z the quotient of x by y .
We can try to offload the problem of division to a problem of finding multiplicative inverses. If y has a multiplicative inverse, then
division by y is easy: we can set z = xy , so that zy = x y y = x . If every element other than 0 has a multiplicative inverse, then
−1 −1
R is called a field. You should already know three examples of fields: Z, R , and C . Part of the reason for the importance of fields is
that most of the basic facts in linear algebra work for any field.
Since the field must already be a commutative, associative ring with unity, we see that the set F ∖ {0} is a group! Then another way to
define a field is as a ring that is a commutative group under addition, and where F ∖ {0} is a commutative group as well.
For example, we've already seen the group Q , which is just Q with the 0 removed. Since this is a commutative group, Q is a field.
×
Example 8.0.2
For which values of n is Z a field?
n
All of this works fine in R , Q and C : in these rings, for every x and y , we can find a unique number z such that zy = x . In other
rings, though, things can go wrong in a number of different ways.
1. The first problem that could arise is that y has no multiplicative inverse. For example, in Z , there is no number z such that
6
2. It could be that for a given x and y , there is no quotient z = . An example of this occurs in Z, where (for example) there is no
x
number .
2
3. It could happen that the quotient z exists but is not unique. For example, consider the product ring Z × Z . Let x = (4, 0) and
y = (2, 0) . Then for any integer k, (2, k) ⋅ y = x .
4. There's also a problem if the ring R is not commutative. It could occur that yz = x but zy ≠ x . Which 'side' of x is our division
happening on?
We'll see that the different ways of resolving these questions give rise to definitions of different kinds of rings.
ZERO-DIVISORS
We'll first consider the question of multiplicative inverses. For a start, in any non-zero ring, 0 does not have a multiplicative inverse:
For any x we have have x ⋅ 0 = 0 , so it can't be the case that x ⋅ 0 = 1 . This situation is familiar from working with the rational and
real numbers. But there can be other elements without a multiplicative inverse.
For example, consider Z . The elements 1 and 5 have multiplicative inverses: 1 ⋅ 1 = 1 and 5 ⋅ 5 = 1 . But none of the other
6
elements have a multiplicative inverse! For example, if we multiply 2 times each element of Z , we get the list [0, 2, 0, 2, 0, 2]. Since
6
1 isn't in the list, 2 has no multiplicative inverse. Something interesting is happening in that list of multiples of 2 , though: there are
many zeroes!
Definition 8.0.3
Let x ∈ R . Then x is a zero-divisor if there exists y such that x ⋅ y = 0 .
Exercise 8.0.4
Describe all of the zero-divisors in the ring Z × Z .
As a result, the presence of zero-divisors means that there are non-invertible elements in the ring, and thus throws our division project
into jeopardy. Furthermore, zero divisors also contribute to non-uniqueness of division: if ry = 0 and x = zy , then we also have
x = (z + r)y , so that both z and z + r can be considered as solutions to .
x
To give a concrete example of this phenomenon, consider again Z6 . What is the quotient 4
2
? Obviously, 2 is an answer, since
2⋅2 = 4 . But we also have 2⋅5 = 4 , so we could also write 4
2
= 5 just as easily. Notice that (2 + 3) ⋅ 2 = 4 + 0 = 4 ; this is
exactly the case described above.
Interestingly, for elements which are neither invertible nor zero-divisors, we still have a cancelation law:
Corollary 8.0.7
Proof 8.0.8
r
is unique if it exists.
Then we see that the presence of zero-divisors is a major impediment to doing division in rings. Rings without zero-divisors will then
be nice to work with!
Every field is an integral domain, since every non-zero element of a field is invertible. The primary example of an integral domain that
is not a field is the integers: There are no non-zero integers where xy = 0 , but most integers don't have multiplicative inverses, so Z is
not a field.
Then we seem to have an answer to the problem of division for commutative rings:
1. The best-case scenario is when every element has an inverse. Such rings are called division rings, or (if the ring is also
commutative) fields.
2. The next-best case is when there are no zero divisors. These are the integral domains.
In the next two sections, we'll look at two different ways to 'solve' a division problem in an integral domain. The first way is to
introduce fractions, which allow us to find inverses for any element of the ring. The second - available only in some rings - allows us to
do division with a remainder.
Exercise 8.0.11
Show that M n,n (Q) , the ring of n × n matrices, is not an integral domain.
CONTRIBUTORS
Tom Denton (Fields Institute/York University in Toronto)
2
. But when we include , we also have to include all possible sums and products in order to ensure that we still have a ring; the
1
operations of addition and multiplication need to be closed, of course. So in addition to , we also need to include every number
1
2
, 2
n
m
with n ∈ Z and m ∈ N in order to ensure that the set is closed under multiplication and addition. Call this set R. Then R is a
commutative ring with unity. It's also an integral domain, but still not a field, since, for example, 3 has no multiplicative inverse.
In that case, we can go ahead and include the multiplicative inverse of every positive integer, along with all possible sums and products
of those inverses. The resulting ring, of course, is the rational numbers, Q .
We would like to extend this construction to an arbitrary integral domain: Starting from an integral domain D, we introduce inverses
and the appropriate sums and products until every element has an inverse. In fact, this involves copying the whole notion of fractions.
First we construct a ring D . For an integral domain D, D is the set D × (D ∖ {0}) . Each pair (a, b) in D can be thought of as
′ ′ ′
fractions ; note that we disallow b = 0 . The operation + defined by (a, b) + (x, y) = (ay + bx, by) and multiplication is defined
a
by (a, b) ⋅ (x, y) = (ax, by) . Then D is a commutative ring; we leave it as an exercise to show this is true.
′
Exercise 8.1.0
Let D be an integral domain. Show that D
′
is an integral domain. (In particular, check all of the ring axioms, and then show that
there are no zero-divisors in D .) ′
With rational numbers, it is important to notice that many different fractions are the same: currently our ring D is much too large! For
′
example, we haven't introduced any mechanism for cancellation of numerator and denominator: (a, b) ⋅ (b, a) = (ab, ab) ≠ 1 in D . ′
We'll construct the actual ring of fractions as a quotient of D . To construct a quotient, we only need to identify a suitable ideal I ; the
′
quotient will then just be D /I . The ideal should contain everything that is 'equivalent to 0' in the ring of fractions. Thinking by
′
CONTRIBUTORS
Tom Denton (Fields Institute/York University in Toronto)
f = (2 x
3
− 2x + 3)g − x . Here 2 x 3
− 2x + 3 is the whole part and −x is the remainder.
In both the integer division and the polynomial division, the key ingredient is a way of ordering the elements of the ring: in the
integers, we order by the usual ordering of the integers, and with polynomials we order by the degree of the polynomial.
Any given ring can have many different norms. The norm on the integers is simply the absolute value of the integer; it is a positive
norm. The norm on polynomials is the degree of the polynomial.
with n(r) < n(b) . The element q is called the quotient and r is the remainder.
A Euclidean domain then has the same kind of partial solution to the question of division as we have in the integers.
In fact, Euclidean domains further have a Euclidean algorithm for finding a common divisor of two elements. The Euclidean algorithm
is performed by starting with two elements f and g for which we wish to find a common divisor. Dividing f by g gives a quotient q 0
and a remainder r . We then divide g by r and obtain a new quotinet q and a new remainder, r . We then repeat this process to get
0 0 1 1
quotients q , q , … q and remainders r , r , … r . Each remainder has smaller norm than the previous, so this process must
2 3 k 2 3 k
The final quotient, qk divides both g and f : You can see this by writing f = q g + r , and then expanding r :
0 0 0
f = q0 (q1 r0 + r1 ) + r0 . If we imagine the process ending at this point, so that r = 0 , we then have r divides both f and g. On
1 0
the other hand, if the process doesn't terminate, we can expand r = q r + r . Then f = q (q (q r + r ) + r ) + q r + r
0 2 1 2 0 1 2 1 . If2 1 2 1 2
the process terminates, then r = 0 , and r divides every term, and thus divides f and g. If the process doesn't terminate, we repeat
2 1
CONTRIBUTORS
Tom Denton (Fields Institute/York University in Toronto)
Topic hierarchy
TOC.1 11/13/2019
9.0: A RETURN TO LINEAR ALGEBRA
We've now seen numerous examples of algebraic structures, which we can think of as sets with some operations which satisfy some
axioms. Here's a partial list:
1. Groups,
2. Commutative groups,
3. Group actions,
4. Rings,
5. Commutative rings,
6. Integral domains,
7. Fields,
8. and others...
In this chapter, we'll examine vector spaces as algebraic structures. Vector spaces are massively important because these are algebraic
structures where the tools of linear algebra are available. Linear algebra is, in some ways, the branch of mathematics which is best
developed: when a problem in science is converted into a linear algebra problem, we have a pretty good chance of being able to solve
it. This is why, for example, the technique of linearization which comes up in differential equations and modeling is so important.
In fact, viewing vector spaces as algebraic structures does two things for us.
1. This viewpoint helps us identify more situations as linear algebra situations, allowing us to use our linear algebra tools in a broader
set of circumstances, and
2. Abstracting allows us to better identify precisely what tools we are using when we prove statements in linear algebra, so we can
identify exactly which situations those tools are applicable in. As with rings, there are more than one kind of vector space, and
some vector spaces are more 'friendly' than others.
So let's see the definition.
(As an aside: There's a another way to think of vector spaces as well. For any ring R, there is a concept of an R-module which is
similar to a group action: a module is a set with a ring action. That is to say, a ring pushing around objects in the set in a way that is
compatible with both of the ring operations. From this viewpoint, a vector space is just a k-module, where the underlying set is a
commutative group itself. As a result, R-modules is a generalization of vector spaces.)
As is traditional, we list some examples. Note that the vector space is a set and a field: usually, the choice of field is derived from
context, but we'll be specific if the context is non-obvious. Often, we say that 'V is a vector space over k' to mean that V is the
commutative group and k is the field.
1. k is the vector space whose underlying set is lists of n elements of k, with coordinate-wise addition and k acting by scalar
n
multiplication. This gives rise to the familiar spaces R and C . But we also know about finite fields now: Z where p is prime is
n n n
p
Exercise 9.0.1
1. For each of the above examples of vector spaces, write some example elements and give examples of addition and scalar
multiplication in that vector space.
2. Prove that each of these examples is a vector space.
Exercise 9.0.2
Show that the set of continuous functions from R → R is a vector space over R . (Be sure to explicitly identify what the operations
of addition and scalar multiplication are.)
What extra condition would we need for a vector space V over k in order for the notion of continuous functions V → k to make
sense?
CONTRIBUTORS
Tom Denton (Fields Institute/York University in Toronto)
Definition 9.1.0
Let S be a set with elements s . A linear combination of elements { s , s , … , s } is given by any finite sum
i 1 2 n ∑
s∈S
cs s with
coefficients c ∈ k . (If S is an infinite set, then all but finitely many c must be equal to 0.)
s s
Definition 9.1.1
Let S be a set of vectors in a vector space V . Then we say that S is linearly dependent if there exists a linear combination of
elements of S equal to 0.
Example 9.1.1
Let R be the vector space of sequences of elements of R . (ie, the space of sequences r = (r , r , r , …) , with coordinate-wise
∞
1 2 3
addition and the usual scalar multiplication.) Let r ∈ R be the sequence with (e ) = 1 and (e ) = 0 for all j ≠ i . Let n be
i
∞
i i i j
the element (−1, −1, −1, …) . Now, let S be the set of all the e and n. This is actually a linearly independent set. You might
i
note that the sum of all of the elements in S (with all coefficients in the sum equal to 1) seems to be the 0-vector. But this is an
infinite sum, and is thus not considered a linear combination of elements of S .
CONTRIBUTORS
Tom Denton (Fields Institute/York University in Toronto)
Ring homomorphisms are significant because they define mappings between rings that preserve addition and multiplication operations, maintaining the structural integrity of the ring. These mappings allow for transferring properties and operations from one ring to another, often revealing insights into the nature of the rings and assisting in classifying and comparing different rings. They also play a fundamental role in constructing theories around kernel and image, quotient rings, and the application of isomorphism theorems .
Ideals in ring theory function similarly to normal subgroups in group theory as they provide a basis for constructing quotient structures. In rings, an ideal is a subset closed under addition and multiplication by ring elements, allowing the formation of quotient rings analogous to how normal subgroups allow construction of quotient groups. Just as normal subgroups help in analyzing group properties and relations, ideals are crucial for understanding ring structure, facilitating operations like homomorphisms and transformations in a ring .
Generators are crucial for understanding group structures as they provide a minimal set of elements from which the entire group can be derived using group operations. Unlike individual elements, which provide no such guarantees, generators can be combined to form every element of the group. They effectively distill the group's structure into its simplest form and allow for a more efficient representation of the group's operations. A smaller set of generators might require more complex compositions to achieve the same group elements, illustrating a trade-off between simplicity and operational efficiency .
No, counting the number of symmetries alone is insufficient to deduce the properties of a group of symmetries. For instance, an equilateral triangle and a regular hexagon with bumps both have six symmetries. However, the symmetries of the triangle include reflections, while the hexagon can only rotate. This demonstrates the importance of considering the nature of the symmetries, not just their count, when characterizing the group properties .
A subgroup is distinguished from the broader group by being a subset that itself forms a group with the same operations. The associative property inherent to the group operation is naturally inherited by the subgroup, so no additional validation is necessary for associativity. Subgroups have their identity elements as the same identity element of the larger group and maintain closure and inverses within themselves, leading to the creation of smaller group structures with the same foundational properties as the parent group .
Symmetry in groups when applied to geometric objects like polygons includes considering different kinds of symmetries such as rotational and reflectional symmetries. For instance, a pentagon possesses rotational symmetries, which form a group of rotations. The concept can be extended to consider subgroups, such as identifying rotation functions as a group within the larger group of a pentagon's symmetries. This is because the set of rotations forms a closed operation under the group conditions, making it a subgroup .
Both the DRY (Don't Repeat Yourself) methodology in computer science and mathematical generalization aim to minimize redundancy and repetition by focusing on overarching principles and structures. In computer science, this involves reducing duplicative code to ensure easier maintenance and greater efficiency. Similarly, mathematical generalization seeks to unify diverse theorems and proofs into broader concepts, such as the use of kernels and quotient constructs shared across groups, rings, and modules, promoting deeper understanding and simplifying complex problems by emphasizing underlying commonalities .
Lagrange's theorem states that the order (or size) of a subgroup is a divisor of the order of its parent group. This theorem highlights the numerical relationship between a group and its subgroups, asserting that the group can be partitioned into disjoint subsets known as cosets, each of which has the same number of elements as the subgroup. As a result, the group's total element count is evenly divisible by the subgroup's size, aiding in understanding the structure and properties of both groups and their subgroups .
Cayley graphs enhance the visualization and understanding of group structures by providing a graphical representation of the group elements and their relations through generators. Each vertex represents a group element, and directed edges represent the application of group operations. This allows one to easily see how different elements of the group are connected, visualize subgroup structures, and understand complex relationships among group elements, such as generators and their resulting powers or products .
The mathematical principle of abstraction in group theory is demonstrated by generalizing the specific concept of symmetry into an abstract structure called a group. Just as the number three abstracts the idea of 'three-ness' across different sets, a group abstracts the symmetrical properties of different objects. This abstraction allows mathematicians to define and explore symmetries mathematically without being confined to the specific details of any one object's symmetries, creating a universal framework for analyzing symmetrical behaviors .