PMath 453: Functional Analysis Notes
PMath 453: Functional Analysis Notes
Laurent W. Marcoux
Thanks to Adina Goldberg, Hayley Reid, Wanchun Shen, Erlang Surya and
Zhenyuan Zhang for catching those typos and mistakes that I thought I had hidden
really well, and for suggesting alternate proofs to some results presented in the third
edition!
I’ve begun to enlarge the “Appendix” sections for each Chapter, and to add
exercises at the end of each Chapter. The “Appendix” sections are meant to be
“cultural”. Students are not required to read these to understand the remainder of
the text, and material appearing in the Appendix sections will not appear on the
exam. The exercises are meant to be much easier than the assignment exercises,
and are there to help solidify some of the definitions and concepts appearing in the
notes.
The biggest change, mathematically speaking, in the notes is that I have now
provided a proof of the fact that the only compact, quasinilpotent, normal operator
acting on an infinite-dimensional, separable, complex Hilbert space is the zero op-
erator. (Yes, I am aware of the fact that this holds even if N is not compact.) The
usual proof of this result uses Beurling’s Spectral Radius formula, and it took me a
few hours to come up with a proof that avoided that result in the setting I needed
to consider. I have nothing against Beurling nor his Spectral Radius Formula in
general (in fact, you might say that I appreciate it even more now), except that I
didn’t have the time to cover it, and therefore I had to find a “work-around” that
did not require any functional calculus.
This set of notes is now undergoing its third iteration. The mathematical content
outside of the appendices is mostly stabilized, and now begins the long and lonely
hunt for typos, poor grammar, and awkward sentence constructions.
Please feel free to contact me if you find any mistakes – mathematical or other-
wise – in these notes.
i
ii
This set of notes has now undergone its second incarnation. I have corrected as
many typos as I have found so far, and in future instalments I will continue to add
comments and to modify the appendices where appropriate. The course number for
the Functional Analysis course at Waterloo has now changed to PMath 753, in case
anyone is checking.
The comment in the preface to the “first edition” regarding caution and buzz
saws is still à propos. Nevertheless, I maintain that this set of notes is worth at least
twice the price1 that I’m charging for them.
For the sake of reference: excluding the material in the appendices, and allowing
for the students to study the last section on topology themselves, one should be able
to cover the material in these notes in one term, which at Waterloo consists of 36
fifty-minute lectures.
My thanks to Xiao Jiang and Ian Hincks for catching a number of typos that I
missed in the second revision.
The following is a set of class notes for the PMath 453/653 course I taught at the
University of Waterloo in 2008. As mentioned on the front page, they are a work in
progress, and - this being the “first edition” - they are replete with typos. A student
should approach these notes with the same caution he or she would approach buzz
saws; they can be very useful, but you should be thinking the whole time you have
them in your hands. Enjoy.
I would like to thank Paul Skoufranis for having pointed out to me an embar-
rassing number of typos. I am glad to report that he still has both hands and all of
his fingers.
1If you were charged a single penny for the electronic version of these notes, you were robbed.
You can get them for free from my website.
iii
From the moment I picked your book up until I laid it down I was
convulsed with laughter. Someday I intend reading it.
Groucho Marx
Reading this book is like waiting for the first shoe to drop.
Ralph Novak
Thank you for sending me a copy of your book. I’ll waste no time
reading it.
Moses Hadas
Sometimes you just have to stop writing. Even before you begin.
Stanislaw J. Lec
Contents
i
1. Normed Linear Spaces 1
2. An introduction to operators 21
3. Hilbert space 38
4. Topological Vector Spaces 54
5. Seminorms and locally convex spaces 69
6. The Hahn-Banach theorem 89
7. Weak topologies and dual spaces 110
8. Extremal points 126
9. The chapter of named theorems 134
10. Operator Theory 141
11. Appendix – topological background 166
Bibliography 173
Index 175
v
1. NORMED LINEAR SPACES 1
I don’t like country music, but I don’t mean to denigrate those who do.
And for the people who like country music, denigrate means ‘put down’.
Bob Newhart
1.1. It is expected that the student of this course will have already seen the
notions of a normed linear space and of a Banach space. We shall review the
definitions of these spaces, as well as some of their fundamental properties. In both
cases, the underlying structure is that of a vector space. For our purposes, these
vector spaces will be over the field K, where K = R or K = C.
subspace of cK K
0 (N), because it is not closed. In fact, it is dense in c0 (N).
For (xn )∞
n=1 ∈ `pK (N), we set
∞
X 1/p
k(xn )n kp = |xn |p .
n=1
In most contexts, the underlying field K is understood, and we shall write only
`p (N), or even `p , 1 ≤ p ≤ ∞.
The last two examples have one especially nice property not shared by cK
00 (N)
and PK ([0, 1]), namely: they are complete.
1.11. Example. Let D ⊆ C denote the open unit disc D = {z ∈ C : |z| < 1},
and T denote the unit circle T = {z ∈ C : |z| = 1}. Consider the disc algebra
A(D) := {f ∈ C(D) : f |D is holomorphic}.
The function
kpk∞ := sup |p(z)|
z∈D
is easily seen to define a norm on A(D).
It follows from elementary Complex Analysis that the space C[z] of polynomials
(with domain restricted to D) is dense in A(D), and that A(D) is complete with
respect to this norm. That is, A(D) is a Banach space.
4 L.W. Marcoux Functional Analysis
Furthermore, from the Maximum Modulus Principle, we see that the map
Γ : A(D) → C(T)
f 7→ f |T
is isometric, and so we can (and do) identify A(D) with the algebra
1.12. Definition. Let H be an inner product space over K; that is, there exists
a map
h·, ·i → K
which, for all x, x1 , x2 , y ∈ H and λ ∈ K, satisfies:
(i) hx1 + x2 , yi = hx1 , yi + hx2 , yi;
(ii) hx, yi = hy, xi;
(iii) hλx, yi = λhx, yi;
(iv) hx, xi ≥ 0, with equality holding if and only if x = 0.
(Of course, when K = R, the complex conjugation in (ii) is superfluous.) Recall
that the canonical norm on H induced by the inner product is given by
1.13. Example. Recall that `2K (N) is a Hilbert space with inner product
∞
X
h(xn )n , (yn )n i = x n yn .
n=1
1.14. It is easy to see that if X is a normed linear space, then the vector space
operations
σ : X×X → X µ: K×X → X
and
(x, y) 7→ x + y (λ, x) 7→ λx
of addition and scalar multiplication are continuous (from the respective product
topologies on X × X and on K × X to the norm topology on X). The proof is left
as an exercise for the reader. In particular, therefore, if 0 6= λ ∈ K, y ∈ X, then
σy : X → X defined by σy (x) = x + y and µλ : X → X defined by µλ (x) = λx are
homeomorphisms.
As a simple corollary to this fact, a set G ⊆ X is open (resp. closed) if and only
if G + y is open (resp. closed) for all y ∈ X, and λG is open (resp. closed) for all
0 6= λ ∈ K. We shall return to this in a later section.
1.15. New Banach spaces from old. We now exhibit a few constructions
which allow us to produce new Banach spaces from simpler building blocks.Q
Let (Xn , k · kn )∞
n=1 denote a countable family of Banach spaces. Let X = n Xn .
(a) For each 1 ≤ p < ∞, define
∞ ∞
X X 1/p
⊕p Xn = {(xn )n ∈ X : k(xn )n kp = kxn kpn < ∞}.
n=1 n=1
Then ∞ p
P
n=1 ⊕p Xn is a Banach space, referred to as the ` -direct sum of
the (Xn )n .
(b) With p = ∞,
∞
X
⊕∞ Xn = {(xn )n ∈ X : k(xn )n k∞ = sup kxn kn < ∞}.
n=1 n≥1
P∞
Again, n=1 ⊕∞ Xn is a Banach space - namely the `∞ -direct sum of the
(Xn )n .
(c) We also define
c0 (X) = {(xn )n ∈ X : xn ∈ Xn , n ≥ 1 and lim kxn kn = 0}.
n→∞
The norm on c0 (X) is k(xn )n k∞ = supn≥1 kxn kn , andPequipped with this
norm, c0 (X) is easily seen to be a closed subspace of ∞
n=1 ⊕∞ Xn .
1.16. Definition. Let X be a vector space equipped with two norms k · k and
||| · |||. We say that these norms are equivalent if there exist constants κ1 , κ2 > 0
so that
κ1 kxk ≤ |||x||| ≤ κ2 kxk for all x ∈ X.
1.18. Example. Let X = C([0, 1], C), and consider the norms
kf k∞ = sup{|f (x)| : x ∈ [0, 1]}
and Z 1
kf k1 = |f (x)|dx
0
on X. If, for each n ≥ 1, we set fn to be the function fn (x) = xn , then kfn k∞ = 1,
R1 1
while kfn k1 = 0 xn dx = n+1 . Clearly k · k1 and k · k∞ are inequivalent norms on
X.
1.19. Proposition. Two norms k·k and |||·||| on a vector space X are equivalent
if and only if they generate the same metric topologies.
Proof. Suppose first that k · k and ||| · ||| are equivalent, say κ1 kxk ≤ |||x||| ≤ κ2 kxk
for all x ∈ X, where κ1 , κ2 > 0 are constants. If x ∈ X and (xn )n is a sequence in
X, then it immediately follows that
lim kxn − xk = 0 if and only if lim |||xn − x||| = 0.
n→∞ n→∞
That is, the two notions of convergence coincide, and thus the topologies are equal.
Conversely, suppose that the metric topologies τk·k and τ|||·||| , induced by k · k
and ||| · ||| respectively, coincide. Then G = {x ∈ X : kxk < 1} is an open nbhd of 0
in (X, ||| · |||), and so there exists δ > 0 so that H = {x ∈ X : |||x||| < δ} ⊆ G. That
is, |||x||| < δ implies kxk < 1. In particular, therefore, |||x||| ≤ δ/2 implies kxk ≤ 1,
so that that ||y|| ≤ (2/δ)|||y||| for all y ∈ X. By symmetry, there exists a constant
κ2 > 0 so that |||y||| ≤ κ2 kyk for all y ∈ X.
Thus k · k and ||| · ||| are equivalent norms.
2
The following result provides a very practical tool when trying to decide whether
or not a given normed linear space is complete. We remark that the second half of
the proof uses the standard fact that if (yn )n is a Cauchy sequence in a metric space
(Y, d), and if (yn )n admits a convergent subsequence with limit y0 , then the original
sequence (yn )n converges to y0 as well.
By the continuity of q,
X
q(x0 ) = q( (xn + mn ))
n
X
= q(xn + mn )
n
X
= q(xn ).
n
Appendix to Section 1.
1.29. This course assumes that the reader has taken at least enough Real Anal-
ysis to have seen that (`pK (N), k · kp ) is a normed linear space for each 1 ≤ p ≤ ∞.
Having said that, let us review Hölder’s Inequality as well as Minkowski’s Inequality
in this setting, since Hölder’s Inequality is also useful in studying dual spaces in the
next Section. The reader will recall that Minkowski’s Inequality is the statement
that the p-norm is subadditive; that is, that the p-norm satisfies condition (iii) of
Definition 1.2. We remark that both inequalities hold for more general Lp -spaces.
Our decision to concentrate on `p -spaces instead of their more general counterparts
is an attempt to accommodate the background of the students who took this course,
as opposed to a conscious effort to avoid Lp -spaces.
1.30. Lemma. Let a and b be positive real numbers and suppose that
1
1 < p, q < ∞ satisfy p + 1q = 1. Then
a b
1 1
ap bq ≤
+ .
p q
Proof. Let 0 < t < 1 and consider the function
f (x) = xt − tx + t − 1,
defined on (0, ∞). Then
f 0 (x) = txt−1 − t = t(xt−1 − 1).
Thus f (1) = 0 = f 0 (1). Since f 0 (x) > 0 for x ∈ (0, 1) and f 0 (x) < 0 for x ∈ (1, ∞),
it follows that
f (x) < f (1) = 0 for all x 6= 1.
t
That is, x ≤ (1 − t) + tx for all x > 0, with equality holding if and only if x = 1.
Letting x = a/b, t = 1/p yields
1 1 1 −1
−1
ap bq
= ap b p
a 1
= ( )p
b
1 1 a
≤ (1 − ) + ( )
p p b
1 a 1
= ( )+ .
p b q
Multiplying both sides of the equation by b yields the desired inequality.
2
12 L.W. Marcoux Functional Analysis
Proof. The cases where p = 1 or p = ∞ are routine and are left to the reader.
First let us suppose that kxkp = kykq = 1. Applying the previous Lemma to
our sequences x and y yields, for each n ≥ 1,
1 1
|xn yn | = (|xn |p ) p (|yn |q ) q
1 1
≤ |xn |p + |yn |q ,
p q
so that
X X
|zn | = |xn yn |
n n
1X 1X
≤ |xn |p + |yn |q
p n q n
1 1
= kxkpp + kykqq
p q
= 1.
In general, if x ∈ `p and y ∈ `q , let u = x/(kxkp ), v = v/(kykq ) so that
kukp = 1 = kvkq and so
1 X X
|xn yn | = |un vn |
kxkp kykq n n
≤ 1.
Thus
kzk1 ≤ kxkp kykq .
2
Proof. Again, the cases where p = 1 and where p = ∞ are left to the reader.
Suppose therefore that 1 < p < ∞. Observe that if a, b > 0, then
a+b p
≤ ap + bp ,
2
1. NORMED LINEAR SPACES 13
and similarly
X
|xn + yn |p−1 |yn | ≤ kykp k(|xn + yn |p−1 )n kq .
n
Now
!1
X q
= |xn + yn |(pq−q)
n
!1
X q
p
= |xn + yn |
n
= k(xn + yn )n kp/q
p .
Hence
X
kx + ykpp = |xn + yn | |xn + yn |p−1
n
X
≤ (|xn | + |yn |) |xn + yn |p−1
n
≤ (kxkp + kykp ) k(|xn + yn |p−1 )n kq
= (kxkp + kykp ) k(xn + yn )n kp/q ,
from which we get
kx + ykp = kx + ykp−p/q
p ≤ kxkp + kykp .
2
Let us now examine a couple of examples of useful Banach spaces whose defi-
nitions require a somewhat better background in Analysis than we are assuming in
the main body of the text.
14 L.W. Marcoux Functional Analysis
1.34. Example. The geometric theory of real Banach spaces is an active and
exciting area. For a period of time, the following question was open [Lin71]: does
every infinite-dimensional Banach space contain a subspace which is linearly home-
omorphic to one of the spaces `p , 1 ≤ p < ∞ or c0 ? In 1974, B.S. Tsirel’son [Tsi74]
provided a counterexample to this conjecture. In this example, we shall discuss
the broad outline of the construction of the Tsirel’son space, omitting the proofs of
certain technical details.
(b) {en }∞ P⊆ B.
n=1
(c) If x = ∞ n=1 xn en ∈ B, y = (yn )n ∈ c0 and |yn | ≤ |xn | for all n ≥ 1, then
y ∈ B. (This is a hereditary property.)
(d) If v1 < v2 < · · · < vr lie in B, then 21 Pr ((v1 , v2 , ..., vr )) ∈ B.
(e) For every x ∈ B there exists n ∈ N for which 2Pn (x) ∈ B.
Our first goal is to construct a set K which has all five of these properties.
Let L1 = {rej : −1 ≤ r ≤ 1, j ≥ 1} and for n ≥ 1, set
1
Ln+1 = Ln ∪ { Pr ((v1 , v2 , ..., vr )) : r ≥ 1, v1 < v2 < · · · < vr ∈ Ln }.
2
Let K denote the pointwise closure of ∪n≥1 Ln . It can be shown that K ⊆ c0 . We
set D = co(K) denote the closed convex hull of K (with the closure taking place in
c0 ).
The Tsirel’son space T is then defined as span D. The norm on T is given by
the Minkowski functional which we shall encounter later when studying locally
convex spaces. It is given by kxkT = inf{r ∈ (0, ∞) : x ∈ rD}, where rD = {ry :
y ∈ D}. As we shall later see, the definition of this norm ensures that D is precisely
the unit ball of T .
Although we shall not prove it here, (T, k · kT ) is a Banach space which does not
contain any copy of c0 or `p , 1 ≤ p < ∞.
1.35. Example. Another Banach space of interest to those who study the
geometry of said spaces is James’ space.
For a sequence (xn )n of real numbers, consider the following condition, which
we shall call condition J: for all k ≥ 1,
(xn1 − xn2 )2 + (xn2 − xn3 )2 + · · · + (xnk−1 − xnk )2 < ∞.
sup
n1 <n2 <···<nk
It can be shown that J is a Banach space when equipped with this norm.
1.36. Example. Let X be a locally compact topological space and let B denote
the σ–algebra of Borel subsets of X. Let µ be a positive measure on X, so that
µ : B → R ∪ {∞}
satisfies
(a) µ(∅) = 0;
(b) µ(B) ≥ 0 for all B ∈ B;
16 L.W. Marcoux Functional Analysis
Proof.
Let (q(xn ))∞
n=1 be a Cauchy sequence in X/M. For each n ≥ 1, there exists
kn > 1 so that i, j ≥ kn implies kq(xi ) − q(xj )k < 2−n . Without loss of generality,
we may assume that kn > kn−1 for all n ≥ 2.
Set zn := xkn , n ≥ 1 and let m1 = 0. For n > 1, choose mn ∈ M so that
k(zn−1 + mn−1 ) − (zn + mn )k < 2−(n−1) .
That this is possible follows from the definition of the quotient norm along with the
inequality of the second paragraph. If we now define yn := zn + mn , n ≥ 1, then
q(yn ) = q(zn ) = q(xkn ), and for n2 > n1 ,
2 −n1
nX
kyn1 − yn2 k ≤ kyn1 +j − yn1 +j−1 k
j=1
2 −n1
nX
1
≤ ( )(n1 +j−1)
2
j=1
∞
X 1
≤ ( )(n1 +j)
2
j=0
1
= ( )n1 −1 ,
2
from which it follows that (yn )∞
n=1 is Cauchy in X. Since X is complete, y :=
limn→∞ yn ∈ X, and since the quotient norm is contractive, q(y) = limn→∞ q(yn ) =
limn→∞ q(xkn ). Since (q(xn ))∞
n=1 is Cauchy, q(y) = limn→∞ q(xn ), which proves
that every Cauchy sequence in X/M converges - i.e. that X/M is complete.
2
Theorem. Suppose that X is a normed linear space and that M is a closed subspace
of X. Suppose furthermore that X/M and that M are both complete. Then X is also
complete.
P P
Proof. Suppose that n xn P is an absolutely summable series in X, so that n kxn k <
∞. It suffices to prove that n xn exists in X.
P PN ∞
Note that the fact that n kxn k is finite implies that the sequence
P∞ ( n=1 xn )N =1
is Cauchy. Indeed, let ε > 0 and choose M > 0 such that n=M kxn k < ε. If
t ≥ s ≥ M , then
t
X s
X t
X t
X ∞
X
k xn − xn k = k xn k ≤ kxn k ≤ kxn k < ε.
n=1 n=1 n=s+1 n=s+1 n=M
This will be useful later on.
18 L.W. Marcoux Functional Analysis
Let q : X → X/M denote the canonical quotient map. Then q is linear and
contractive, so
X X
kq(xn )k ≤ kxn k < ∞.
n n
N1
X
ky − xn − m1 k < 2δN1 = 1/2.
n=1
N2
X
k y− xn − m1 − m2 k < 2δN2 = 1/4.
n=1
Nk
X k−1
X
mn − mk k < 2δNk = 1/2k .
k y− xn −
n=1 n=1
In particular, limk→∞ k y − N
P k Pk
n=1 xn − n=1 mn k = 0.
Observe that for any k ≥ 1, it follows from the triangle inequality that
Nk k Nk−1 k−1
X X X X
kmk k ≤ k y − xn − mn k + k y − xn − mn k + kxNk k
n=1 n=1 n=1 n=1
1 1
< k + k−1 + kxNk k.
2 2
Thus
∞ ∞ ∞ ∞
X X 1 X 1 X
kmk k ≤ + + kxNk k < ∞.
2k 2k−1
k=1 k=1 k=1 k=1
1. NORMED LINEAR SPACES 19
P∞
Since M is assumed to be complete, it follows that m0 := k=1 mk ∈ M exists.
Also,
Nk
X k
X
−y + m0 = lim y − xn − mn + (−y + m0 )
k→∞
n=1 n=1
Nk
X k
X
= lim − xn + (m0 − mn )
k→∞
n=1 n=1
Nk
X
= lim − xn ,
k→∞
n=1
so that limk→∞ N
P k
n=1 xn = m0 − y ∈ X exists.
PN PNk
But then the Cauchy sequence ( n=1 xn )∞ ∞
N =1 has a subsequence ( n=1 xn )k=1
which converges to m0 − y, and so the original sequence must also converge to the
same limit. That is,
∞
X N
X Nk
X
xn = lim xn = lim xn = m0 − y ∈ X.
N →∞ k→∞
n=1 n=1 n=1
Since every absolutely summable series in X is summable, X is complete.
2
∗
Question 1.
Let ∅ 6= X be a compact, Hausdorff space. Prove that for each ∅ 6= Ω ⊆ X, the
function
νΩ : C(X, K) → R
f 7→ supx∈Ω |f (x)|
defines a seminorm on C(X, K), and that it is a norm if and only if Ω is dense in X.
Question 2.
Prove that (C([0, 1], K), k · k∞ ) is a Banach space.
Question 3.
Prove that the disc algebra A(D) defined in Example 1.11 is a Banach space.
2. AN INTRODUCTION TO OPERATORS 21
2. An introduction to operators
Some people are afraid of heights. Not me, I’m afraid of widths.
Steven Wright
2.1. The study of mathematics is the study of mathematical objects and the
relationships between them. These relationships are often measured by functions
from one object to another. Of course, when both objects belong to the same
category (be it the category of vector spaces, groups, rings, etc), it is to be expected
that the most important maps between these objects will be morphisms from that
category. In this Section we shall concern ourselves with bounded linear operators
between normed linear spaces. These bounded linear maps, as we shall soon discover,
coincide with those linear maps which are continuous in the norm topology. Since
normed linear spaces are vector spaces equipped with a norm topology, the bounded
linear operators are the natural morphisms between them.
It should be pointed out that Banach spaces can be quite complicated to analyze.
For this reason, many people working in this area often study the structure and
geometry of these spaces without necessarily emphasizing the study of the linear
maps between them. In the next Section we shall examine the notion of a Hilbert
space. These are amongst the best-behaved Banach spaces, and their structure
is relatively well understood. For this reason, fewer people study Hilbert spaces
alone; Hilbert space theory tends to focus on the theory of the bounded linear maps
between them, as well as algebras of such bounded linear operators.
We would also be remiss if we failed to point out that not everyone on the planet
restricts themselves to bounded (i.e. continuous) linear operators. Differentiation
has the grave misfortune of being an unbounded linear operator, but nevertheless
it is hard to avoid if one wishes to study the world around one - or around one’s
friends, acquaintances, enemies, and every other one. Indeed, in applied mathemat-
ics and physics, it is often the case that the unbounded linear operators are the more
interesting examples. Having said that, we shall leave it to the disciples of those
schools to wax poetic on these topics.
Our interest in bounded operators stems from the fact that they are precisely
the continuous operators from X to Y.
2.4. Computing the operator norm of a given operator T is not always a simple
task. For example, suppose that H = (C2 , k · k2 ) is a two-dimensional Hilbert
space with standard orthonormal basis {e1 = (1, 0), e2 = (0, 1)}.
Let T : H → H
1 2
be the map whose matrix with respect to this basis is , so that T (x, y) =
3 4
(x + 2y, 3x + 4y). By definition,
kT k = sup{kT zk : z ∈ C2 , kzk ≤ 1}
p p
= sup{ |x + 2y|2 + |3x + 4y|2 : x, y ∈ C, |x|2 + |y|2 ≤ 1},
which involves non-linear equations. For Hilbert spaces of low dimension – say,
less than dimension 5 – alternate methods exist (but won’t be developed just yet).
2. AN INTRODUCTION TO OPERATORS 23
Instead, we turn our attention to special classes of operator which are simple enough
to allow us to obtain interesting results. p √
So as to satisfy the curious reader, we mention that the norm of T is 15 + 221.
Mf : X → X
.
g 7→ f g
Mf : H → H
.
g 7→ f g
ZX
1
≤ kf k2∞ |g(x)|2 dµ 2
X
= kf k∞ kgk2 ,
n ≥ 1 and
Z
1
kMf gn k2 = |f (x)gn (x)|2 dµ 2
X
Z
1
= |f (x)|2 dµ 2
En
Z
1
≥ (kf k∞ − 1/n)2 dµ 2
En
Z
1
|gn (x)|2 dµ 2
= kf k∞ − 1/n
X
= kf k∞ − 1/n kgn k2 .
From this we see that kMf k ≥ kf k∞ − 1/n. Since n ≥ 1 was arbitrary,
kMf k ≥ kf k∞ , and so kMf k = kf k∞ .
Observe that the computation of the norm of the operator depended
very much upon the underlying norms of the spaces involved.
(c) As a special case of this phenomenon, let X = N and suppose that dµ is
counting measure. Then H = `2 (N) and f ∈ `∞ (N). As we are wont to
do when dealing with sequences, we denote by dn the value f (n) of f at
n ∈ N, so that f ≡ (dn )∞ ∞
n=1 ∈ ` (N). It follows that Mf (xn )n = (dn xn )n
2
for all (xn )n ∈ ` (N). By considering the matrix [Mf ] of Mf with respect
to the standard orthonormal basis (en )n for H, we see that
d1
d2
[Mf ] = .
d3
..
.
Thus, Mf , often denoted in this circumstance as D = diag{dn }n , is referred
to as a diagonal operator . The above calculation shows that
kDk = kMf k = kf k∞ = sup{|f (n)| : n ≥ 1} = sup{|dn | : n ≥ 1}.
Finally, consider H = `2 (Z), and with (un )n ∈ `∞ (Z), define the linear map
U : H → H via
U (xn )n = (un−1 xn−1 )n .
Again, U is bounded with kU k = sup{|un | : n ∈ Z}, and U is referred to as a
bilateral weighted shift. The reader should ask himself/herself why we do not
refer to “forward” and “backward” bilateral shift operators.
2.8. Notation. The set of bounded linear operators from the normed linear
space X to the normed linear space Y is denoted by B(X, Y). If X = Y, we abbreviate
this to B(X). We now fulfil an earlier promise by proving that the map T 7→ kT k
does indeed define a norm on B(X, Y).
2.9. Proposition. Let X and Y be normed linear spaces. Then B(X, Y) is a
vector space and the operator norm is a norm on B(X, Y).
Proof. Since linear combinations of continuous functions between topological spaces
are continuous, B(X, Y) is a vector space.
As to the second assertion: for R, T ∈ B(X, Y) and k ∈ K,
(i) kT k = sup{kT xk : kxk ≤ 1} ≥ 0;
(ii) kT k = 0 if and only if kT xk/kxk = 0 for all x 6= 0, which in turn happens
if and only if T x = 0 for all x ∈ X; i.e. if and only if T = 0.
(iii)
kkT k = sup{kkT xk : kxk ≤ 1}
= sup{|k| kT xk : kxk ≤ 1}
= |k| sup{kT xk : kxk ≤ 1}
= |k| kT k.
(iv)
kR + T k = sup{k(R + T )xk : kxk ≤ 1}
≤ sup{kRxk + kT xk : kxk ≤ 1}
≤ sup{kRxk + kT yk : kxk, kyk ≤ 1}
= kRk + kT k.
This completes the proof.
2
26 L.W. Marcoux Functional Analysis
2.10. Theorem. Let X and Y be normed linear spaces and suppose that Y is
complete. Then B(X, Y)
P is complete, and as such it is a Banach space.
Proof. Suppose that ∞ n=1 Tn is an absolutely summable series in B(X, Y). Given
x ∈ X,
X∞ X∞ X∞
kTn xk ≤ kTn k kxk = kxk kTn k < ∞,
n=1 n=1 n=1
compete, ∞
P
and thus, since Y is ∈ Y exists. Moreover,
n=1 Tn x P∞ the linearity of each
Tn implies that the map T : X → Y defined via T x = n=1 Tn x is linear, while
kxk ≤ 1 implies from above that kT xk ≤ ∞
P
n=1 kTn k. Hence
∞
X
kT k ≤ kTn k < ∞,
n=1
implying that T is bounded.
Finally,
N
X ∞
X
kT x − Tn xk = k Tn xk
n=1 n=N +1
X∞
≤ kxk kTn k,
n=N +1
P∞
from which it easily follows that T = limN →∞ N
P
n=1 Tn . That is, the series n=1 Tn
is summable. By Proposition 1.22, B(X, Y) is complete.
2
As a particular case of Theorem 2.10, consider the case where Y = K, the base
field.
2.11. Definition. Let X be a normed linear space. The dual of X is B(X, K),
and it is denoted by X∗ . The elements of X∗ are referred to as continuous linear
functionals or – when no confusion is possible – as functionals on X.
Since K is complete, Theorem 2.10 implies that X∗ is again a Banach space.
As such, we may consider the dual space of X∗ , namely X(2) = X∗∗ := (X∗ )∗ ,
known as the double dual of X, and more generally, the nth -iterated dual spaces
X(n) = (X(n−1) )∗ , n ≥ 3. All of these are Banach spaces.
Before proceeding to some examples, let us first introduce some notation and
terminology which will prove useful.
2.12. Definition. A collection {en }∞
n=1 in a Banach space X is said to be
a Schauder basis if every x ∈ X can be written in a unique way as a norm
convergent series
∞
X
x= xn en
n=1
for some choice xn ∈ K, n ≥ 1.
2. AN INTRODUCTION TO OPERATORS 27
2.13. Example.
(a) For each n ≥ 1, let en denote the sequence en = (0, 0, ..., 0, 1, 0, 0, ...) ∈ KN ,
where the unique “1” occurs in the nth position. Then {en }n is a Schauder
basis for c0 and for `p , 1 ≤ p < ∞. We shall refer to {en }n as the standard
Schauder basis for c0 and for `p .
Observe that it is not a Schauder basis for `∞ . Indeed, `∞ does not
admit any Schauder basis (why not?).
(b) It is not as obvious what one should choose as the Schauder basis for
(C[0, 1], R). It was Schauder [Sch27] who first discovered a basis for this
space. The description of such a basis is non-trivial.
equipped with the supremum norm k(xn )n k∞ = sup{|xn | : n ≥ 1}. We claim that
c∗0 is isometrically isomorphic to `1 = `1 (N). To see this, consider the map
Θ : (`1 , k · k1 ) → (c0 , k · k∞ )∗
,
z := (zn )n 7→ ϕz
where ϕz ((xn )n ) = ∞
P
n=1 xn zn for all (xn )n ∈ c0 . That Θ is linear is readily seen.
That the sum converges absolutely is also easy to verify.
If k(xn )n k∞ ≤ 1, then |xn | ≤ 1 for all n ≥ 1, so that
∞
X ∞
X X
|ϕz ((xn )n )| = | xn zn | ≤ |xn zn | ≤ |zn | = kzk1 .
n=1 n=1
Hence kϕz k ≤ kzk1 . On the other hand, if we set v[n] = (w1 , w2 , w3 , ..., wn , 0, 0, 0, ...)
for each n ≥ 1 (where wj = zj /|zj | if zj 6= 0, while wj = 1 if zj = 0), then v[n] ∈ c0 ,
kv[n]k∞ = 1 for all n ≥ 1, and
n
X
ϕz (v[n]) = |zj |.
j=1
From this it follows that kϕz k ≥ kzk1 . Combining these two estimates yields kϕz k =
kzk1 .
Thus Θ is an isometric injection of `1 into c0 . There remains to prove that Θ is
surjective.
To that end, suppose that ϕ ∈ c∗0 . Let {en }n denote the standard Schauder
basis for c0 , and for each n ≥ 1, let wn = ϕ(en ). Observe that if βn := wn /|wn | for
wn 6= 0, and βn := 0 if wn = 0, then
n
X
v[n] := βn en ∈ c0
k=1
28 L.W. Marcoux Functional Analysis
2.15. Example. Let 1 ≤ p < ∞. Recall from your Real Analysis courses that
there exists an isometric linear bijection Θ : (`q , k · kq ) → (`p , k · kp )∗ , where – as
always – q is the Lebesgue conjugate of p satisfying 1/p + 1/q = 1.
The map is defined via:
Θ : `q → (`p )∗
z 7→ ϕz ,
q
P
where for z = (zn )n ∈ ` , we have ϕz ((xn )n ) = n xn zn .
We normally abbreviate this result by saying that the dual of `p is `q , when
1 ≤ p < ∞. We refer the reader to the Appendix to Section 2 for a proof of this
result.
2.16. Example. The above example can be extended to more general measure
spaces. Let 1 ≤ p < ∞, and suppose that µ is a σ-finite, positive, regular Borel
measure on Lp (X, µ). Again, the map
Θ : Lq (X, µ) → Lp (X, µ)∗
g 7→ ϕg ,
where ϕg (f ) = X f gdµ defines a linear, isometric bijection between Lq (X, µ) and
R
Lp (X, µ)∗ .
If we drop the hypothesis that µ is σ-finite, the result still holds for 1 < p < ∞.
For reasons we shall discuss in the next Section, when p = 2, we often consider
the related map
Ω : L2 (X, µ) → L2 (X, µ)∗
g 7→ ϕg ,
R
where ϕg (f ) = X f gdµ defines a conjugate-linear, isometric bijection between
L2 (X, µ) and L2 (X, µ)∗ .
2. AN INTRODUCTION TO OPERATORS 29
2.18. Proposition. Let X be a normed linear space. Then there exists a con-
tractive linear map J : X → X∗∗ .
Proof. Let z ∈ X and define a map zb : X∗ → K via zb(x∗ ) = x∗ (z). It is routine to
check that zb is linear, and if kx∗ k ≤ 1, then |b
z (x∗ )| = |x∗ (z)| ≤ kx∗ k kzk, so that
kb ∗∗
z k ≤ kzk; in particular, zb ∈ X .
It is also easy to verify that the map
J : X → X∗∗
z 7→ zb
is linear, and the first paragraph shows that J is contractive.
2
Appendix to Section 2.
2.20. Although norms of operators can be difficult to compute, there are cases
where useful estimates can be obtained.
Consider the Volterra operator
Thus kV k ≤ 1. If 1(x) := 1 for all x ∈ [0, 1], then 1 ∈ C([0, 1], K), k1k∞ = 1,
and V 1 = j, where j(x) = x, x ∈ [0, 1]. But then kV 1k∞ = kjk∞ = 1, showing
that kV k ≥ 1, and hence kV k = 1.
Far more interesting (and useful) is the computation of kV n k for n ≥ 2.
Let us first general the construction of the operator V . We may consider the
function k : [0, 1] × [0, 1] → C defined by
0 if x < y,
k(x, y) =
1 if x ≥ y.
Then
Z x Z 1
(V f )(x) = f (y)dy = k(x, y)f (y)dy.
0 0
Now
(V 2 f )(x) = (V (V f ))(x)
Z 1
= k(x, t) (V f )(t)dt
0
Z 1 Z 1
= k(x, t) k(t, y) f (y)dy dt
0 0
Z 1 Z 1
= f (y) k(x, t) k(t, y)dt dy
0 0
Z 1
= f (y) k2 (x, y)dy,
0
R1
where k2 (x, y) = 0 k(x, t) k(t, y)dt is a new kernel for the integral operator V 2 .
Note that
Z 1
|k2 (x, y)| = | k(x, t) k(t, y)dt|
0
Z x
= | k(x, t) k(t, y)dt|
y
= (x − y) for x > y,
while for x < y, k2 (x, y) = 0.
In general, since x − y < 1 − 0 = 1, we get
Z 1
n
(V f )(x) = f (y) kn (x, y)dy, where
0
Z 1
kn (x, y) = k(x, t) kn−1 (t, y)dt, and where
0
1 1
|kn (x, y)| ≤ (x − y)n−1 ≤ .
(n − 1)! (n − 1)!
It follows that
kV n k = sup kV n f k∞
kf k=1
Z 1
= sup k f (y) kn (x, y)dyk∞
kf k=1 0
≤ sup kf k∞ kkn (x, y)k∞
kf k=1
≤ 1/(n − 1)!.
A simple consequence of these computations is that
1
lim kV n k n = lim 1/(n − 1)! = 0.
n→∞ n→∞
We shall have more to say about this in the Appendix to Section 3.
32 L.W. Marcoux Functional Analysis
2.21. Example. Let {e1 , e2 , ..., en } denote the standard basis for Kn , so that
ej = (0, 0, ..., 0, 1, 0, ..., 0), where the 1 occurs in the j th position, 1 ≤ j ≤ n. Suppose
that 1 ≤ p ≤ ∞, and that Kn carries the p-norm from Example 1.7.
Let [tij ] ∈ Mn (K), and define the map
T : Kn →
7 Kn
x → 7 [tij ]x.
It is instructive, while not difficult, to prove that if every row and every column
of [tij ] has at most one non-zero entry, then
kT k = max |tij |.
1≤i,j≤n
2.23. Example. We now return to the proof of the fact that the dual of `p is
`q when 1 < p < ∞, as stated in Example 2.15.
so that x ∈ `p , and
X
|βz (x)| = xn z n
n
X
= |zn |q
n
= kzkq/p
q kzkq
= kxkp kzkq ,
so that kβz k ≥ kzkq , and therefore kβz k = kzkq .
Consider the map Θ defined via:
Θ : `q → (`p )∗
z 7→ βz ,
with βz defined as above. Then Θ is easily seen to be linear, and from above, it is
isometric (hence injective). There remains only to show that Θ is surjective.
To that end, let ϕ ∈ (`p )∗ . For each n ≥ 1, let zn := ϕ(en ), where {en }n is the
standard Schauder basis for `p . Set z[n] := nk=1 zk ek , and x[n] := nk=1 αk zkq−1 ,
P P
where – as before – αk is chosen so that |αk | = 1 and xk zk ≥ 0 for all k ≥ 1. Then
z[n] ∈ `q and x[n] ∈ `p for all n ≥ 1.
34 L.W. Marcoux Functional Analysis
kz[n]kq−1
q = |zk |q
k=1
n
!1− 1
X q
q/p p
= (|zk | )
k=1
n
!1/p
X
= |xk |p
k=1
= kx[n]kp .
Thus
|ϕ(x[n])| = kx[n]kp kz[n]kq
≤ kx[n]kp kϕk for all n ≥ 1.
It follows that kz[n]kq ≤ kϕk for all n ≥ 1, so that if z := (zn )n , then z ∈ `q with
kzkq ≤ kϕk.
Finally, ϕ(y) = βz (y) for all y ∈ `p , so that ϕ = βz = Θ(z), proving that Θ is
surjective, as required.
2.24. Example. Let X be a compact, Hausdorff space and consider the Banach
space C(X, C), equipped with the norm kf k∞ := supx∈X |f (x)|.
The Riesz Representation Theorem states that the dual C(X, C)∗ of C(X, C)
can be identified with the set M(X) of all finite, regular Borel measures on X. Given
a measure µ ∈ M(X), we associate to it the linear functional ϕµ defined by
Z
ϕµ (f ) = f (x)dµ(x).
X
It is worth noting that the functional ϕµ sends positive functions to positive
functions precisely if µ is a positive measure. Such functionals play a significant role
in the theory of C ∗ -algebras, of which (C(X, C), k · k∞ ) is an example.
2. AN INTRODUCTION TO OPERATORS 35
I handed in a script last year and the studio didn’t change one word.
The word they didn’t change was on page 87.
Steve Martin
36 L.W. Marcoux Functional Analysis
Question 1.
Why does (`∞ , k · k∞ ) not admit a Schauder basis?
Question 2.
Let H = `2 (N), and let (wn )n ∈ `∞ (N). Denote by W the unilateral forward
shift determined by
W (xn )n = (0, w1 x1 , w2 x2 , w3 x3 , . . .), x = (xn )n ∈ H.
Prove that kW k = supn |wn |.
Question 3.
Let H = `2 (N) and let {en }∞
n=1 denote the standard onb for H. Given T ∈ B(H),
denote by [T ] = [ti,j ] the matrix for T relative to {en }∞
n=1 , where ti,j = hT ej , ei i
for all i, j ≥ 1.
(a) Prove that if T ∈ B(H), then supi,j≥1 |ti,j | < ∞.
(b) Prove the converse to (a), or find a counterexample to show that it is false.
(c) Let T ∈ B(H). Extend the result of Question 2 and of Example 2.5 (c) by
proving that if each row and each column of [T ] contains only one non-zero
entry, then kT k = supi,j≥1 |ti,j |.
(d) Prove that in general, given T ∈ B(H), kT k ≥ supi,j≥1 |ti,j |.
(e) Let N ≥ 1 and let EN := {e1 , e2 , . . . , eN } be an onb for CN . Let QN
denote the operator whose matrix relative to EN is [QN ] = [1i,j ]1≤i,j≤N .
Find kQN k.
Question 4.
Let [ai,j ]1≤i,j be a matrix with finite support; i.e. suppose that the set
{(i, j) : ai,j 6= 0}
is finite. Let {en }n denote the standard onb for `2 (N), and set H0 := span{en }n .
Note that we are not using closed linear spans, and so H0 is a dense linear sub-
manifold of H, consisting of finitely-supported
P sequences. Define a linear map
A0 : H0 → H0 by setting A0 eN = i≥1 ai,N ei for all N ≥ 1, and extending by
linearity to all of H0 . (Make sure that you know why and how you can do this!)
(a) Prove that A0 is continuous on H0 .
(b) Let M be any dense linear submanifold of H, and suppose that
T0 : M → M is a continuous linear map. Show that T0 extends to
a unique continuous linear map T ∈ B(H) with kT k = kT0 k. That is,
there exists (a unique) T ∈ B(H) with kT k = kT0 k and T |M = T0 .
(c) Conclude that A0 extends to a continuous linear map A ∈ B(H). What
can you say about the range of A?
2. AN INTRODUCTION TO OPERATORS 37
Question 5.
Let J : c0 → `∞ denote the canonical embedding of c0 into `∞ = (c0 )∗∗ , as
defined in Proposition 2.18. Prove that
J((xn )n ) = (xn )n
for all x = (xn )n ∈ c0 .
Question 6.
Let 1 < p < ∞. By Example 2.15, we see that `q ' (`p )∗ , and thus that
` ' (`q )∗ ' (`p )∗∗ . By identifying `p with (`p )∗∗ under this isomorphism, we may
p
Question 7.
Let (X, k · kX ), (Y, k · kY ), and (Z, k · kZ ) be Banach spaces. Let T ∈ B(X, Y) and
R ∈ B(Y, Z). Prove that R T ∈ B(X, Z) and that kR T k ≤ kRk kT k. Do we always
have equality? Can we ever have equality?
Question 8.
Let (X, k · k) be a Banach space. An operator T ∈ B(X) is said to be invertible
if there exists R ∈ B(X) such that R T = I = T R, where I ∈ B(X) is the identity
map Ix = x for all x ∈ X. We say that R is the inverse of T , and we typically
write T −1 instead of R to denote the inverse of T .
Prove that if T ∈ B(X) is invertible, then T is bounded below; that is, there
exists δ > 0 such that kT xk ≥ δ kxk for all x ∈ X.
Is the converse true? Prove it or find a counterexample to show that it is false.
38 L.W. Marcoux Functional Analysis
3. Hilbert space
3.1. In this brief Chapter, we shall examine a class of very well-behaved Banach
spaces, namely the class of Hilbert spaces. Hilbert spaces are the generalizations
of our familiar (two- and) three-dimensional Euclidean space. There are two basic
approaches to studying Hilbert spaces. If one is interested in Banach space geometry
– and many people are – then one often tries to compare other Banach spaces to
Hilbert spaces. As an example of such a phenomenon, we mention the calculation
of the Banach-Mazur distance between Banach spaces, which we define in the
Appendix to this Section.
In the second approach, one decides that because Hilbert spaces are so well-
behaved, they are in some sense “understood”, and for this reason they are “less
interesting” to study than the set of bounded linear operators acting upon them.
One can then study the operators individually or in sets which have no algebraic
structure – this kind of analysis belongs to Single Operator Theory. Alternatively,
one can various Operator Algebras, of which there are myriads of examples. The
literature dealing with operators and operator algebras is vast.
3.2. Recall that a Hilbert space H is a vector space equipped with an inner
product h·, ·i so that that the induced norm kxk := hx, xi1/2 gives rise to a complete
normed linear space, i.e. a Banach space. [When the corresponding normed linear
space is not complete, we refer only to inner product spaces.]
In any inner product space we have the Cauchy-Schwarz Inequality:
|hx, yi|2 ≤ kxk2 kyk2 ,
for all x, y ∈ H. We say that x and y are orthogonal if hx, yi = 0, and we write
x ⊥ y.
3.3. Example.
(a) If (X, µ) is a measure space, then L2 (X, µ) is a Hilbert space, with the
inner product given by
Z
hf, gi = f (x)g(x)dµ(x).
X
P∞
(b) `2= {(xn )n : xn ∈ K, n ≥ 1 and n=1 |xn |
2 < ∞} is a Hilbert space, with
the inner product given by
X
h(xn )n , (yn )n i = xn yn .
n
3. HILBERT SPACE 39
The reader with a background in measure theory will recognize that the second
example is merely a particular case of the first. While these are the canonical inner
products on these spaces, they are not the only ones.
For example, if (rn )n is any sequence of strictly positive integers, one can define
a weighted `2 space relative to this sequence by setting
X
`2(rn )n := {(xn )n ∈ KN : rn |xn |2 < ∞}
n
with inner product X
h(xn )n , (yn )n i = rn xn yn .
n
Proof. Both of these results follow immediately from the definition of the norm in
terms of the inner product.
2
The Parallelogram Law is a useful tool to show that many norms are not Hilbert
space norms.
3.5. Theorem. Let H be a Hilbert space, and K ⊆ H be a closed, non-empty
convex subset of H. Given x ∈ H, there exists a unique point y ∈ K which is closest
to x; that is,
kx − yk = dist(x, K) = min{kx − zk : z ∈ K}.
kn + km
as ∈ K because K is assumed to be convex.
2
We deduce from this that the sequence {kn }∞
n=1 is Cauchy, and hence converges
to some k ∈ K, since K is closed and H is complete. Clearly limn→∞ kn = k implies
that d = limn→∞ kkn k = kkk.
40 L.W. Marcoux Functional Analysis
3.7. Remarks.
(a) Given any non-empty subset S ⊆ H, let
S ⊥ := {y ∈ H : hx, yi = 0 for all x ∈ S}.
It is routine to show that S ⊥ is a norm-closed subspace of H. In particular,
⊥
S⊥ ⊇ span S,
the norm closure of the linear span of S.
(b) Recall from Linear Algebra that if V is a vector space and W is a (vector)
subspace of V, then there exists a (vector) subspace X ⊆ V such that
3. HILBERT SPACE 41
kΘ(y)k ≥ kyk for all y ∈ H, and Θ is isometric as well. It immediately follows that
Θ is injective, and there remains only to prove that Θ is surjective.
Let ϕ ∈ H∗ . If ϕ = 0, then ϕ = Θ(0). Otherwise, let M = ker ϕ, so that
codim M = 1 = dim M⊥ , since H/M ' K ' M⊥ . Choose e ∈ M⊥ with kek = 1.
Let P denote the orthogonal projection of H onto M, constructed as in Re-
mark 3.7. Then, as I − P is the orthogonal projection onto M⊥ , and as {e} is an
orthonormal basis for M⊥ , by Remark 3.7 (d), for all x ∈ H, we have
x = P x + (I − P )x = P x + hx, eie.
3.9. Remark. The fact that the map Θ defined in the proof the Riesz Rep-
resentation Theorem above induces an isometric, conjugate-linear automorphism of
H is worth remembering.
3.11. Example.
(a) If H = `2 then the standard Schauder basis {en }∞ 2
n=1 for ` as defined in
Example 2.13 is an orthonormal basis for H.
(b) If H = L2 (T, dm), where T = {z ∈ C : |z| = 1} and dm represents
normalised Lebesgue measure, then {fn }n∈Z is an orthonormal basis for
L2 (T, dm), where fn (z) = z n for all z ∈ T and for all n ∈ Z.
for all m ≥ 1. This is absurd. Thus |Fk | < ∞ for all k ≥ 1. But then
{e ∈ E : hx, ei =
6 0} = ∪k≥1 Fk
is countable.
(b) This is left as a (routine) exercise for the reader.
2
Unlike the situation in Banach spaces, where two non-isomorphic Banach spaces
can have Schauder bases of the same cardinality, the case of Hilbert spaces is as nice
as one can imagine.
3.21. Theorem. Two Hilbert spaces H1 and H2 over K are isomorphic if and
only if they have the same dimension.
Proof. Suppose first that H1 and H2 are isomorphic, and let U : H1 → H2 be
a unitary operator. Let {eλ }λ∈Λ be an orthonormal basis for H1 . We claim that
{U eλ }λ∈Λ is an orthonormal basis for H2 .
Indeed, hU eα , U eβ i = heα , eβ i = δα,β (the Kronecker delta function)), kU eα k =
keα k = 1 for all α ∈ Λ, and
H2 = U H1 = U (span {eλ }λ∈Λ ) = span {U eλ }λ∈Λ ,
by the continuity of U .
Hence dim H2 = |{U eλ }λ∈Λ | = |Λ| = dim H1 .
Conversely, suppose that dim H2 = dim H1 . Then we can find a set Λ and
orthonormal bases {eλ : λ ∈ Λ} for H1 , and {fλ : λ ∈ Λ} for H2 . Consider the map
U : H1 → `2 (Λ)
x 7→ (hx, eλ i)λ∈Λ ,
where `2 (Λ) := f : Λ → K : λ∈Λ |f (λ)|2 < ∞ . This is an inner product space
P
using the inner product hf, gi = λ∈Λ f (λ)g(λ). The proof that `2 (Λ) is complete
P
3.22. Corollary. The spaces `2 (N), `2 (Q), `2 (Z) and L2 ([0, 1], dx) (where dx
represents Lebesgue measure) are all isomorphic, as they are all infinite dimensional,
separable Hilbert spaces.
3. HILBERT SPACE 49
Appendix to Section 3.
3.23. When dealing with Hilbert space operators and operator algebras, one
tends to focus upon complex Hilbert spaces. One reason for this is that the spectrum
provides a terribly useful tool for analyzing operators. For X a normed linear space,
let I (or IX if we wish to emphasize the underlying space) denote the identity map
Ix = x for all x ∈ X.
3.24. Definition. Let X and Y be Banach spaces, and let T ∈ B(X, Y). We
say that T is invertible if there exists a (continuous) linear map R ∈ B(Y, X) such
that RT = IX and T R = IY .
If T ∈ B(X), we define the spectrum of T to be:
3.26. As mentioned in paragraph 3.1, Hilbert spaces arise naturally in the study
of Banach space geometry. In this context, much of the literature concerns real
Hilbert spaces.
For example, for each n ≥ 1, let Qn denote the set of n-dimensional (real)
Banach spaces. Given Banach spaces X and Y in Qn , we denote by GL(X, Y) the
set of all invertible operators from X to Y. We can define a metric δ on Qn via:
δ(X, Y) := log inf{kT k kT −1 k : T ∈ GL(X, Y)} .
It can be shown that (Qn , δ) is a compact metric space, known as the Banach-
Mazur compactum. One also refers to the quantity
d(X, Y) = inf{kT k kT −1 k : T ∈ GL(X, Y)}
as the Banach-Mazur distance between X and Y, and it is an important problem
in Banach space geometry to calculate Banach-Mazur distances between the n-
dimensional subspaces of two infinite-dimensional Banach spaces, say V and W.
Typically, one is interested in knowing something about the asymptotic behaviour
of these distances as n tends to infinity.
We mention without proof two interesting facts concerning the Banach-Mazur
distance:
(a) If X, Y and Z are n-dimensional Banach spaces, then
d(X, Z) ≤ d(X, Y) d(Y, Z).
(b) A Theorem of Fritz John shows that (with `2n := (Rn , k · k2 )),
√
d(X, `2n ) ≤ n for all n ≥ 1.
It clearly follows from these two properties that d(X, Y) ≤ n for all X, Y ∈ Qn .
3.27. We mentioned earlier in this section that the Parallelogram Law is useful
in determining that a given norm is not induced by an inner product. In fact, in
can be shown that a norm on a Banach space X is the norm induced by some inner
product if and only if the norm satisfies the Parallelogram Law.
3.28. In Lemma 3.16, we invoked the claim that a Cauchy net in our Hilbert
space H necessarily converges. Since we have defined completeness in terms of
sequences instead of nets, the following result may be helpful.
3.29. Proposition. Let (X, d) be a metric space. The following are equivalent:
(i) (X, d) is complete as a metric space; i.e. every Cauchy sequence (xn )∞
n=1
converges in X.
(ii) Every Cauchy net (xλ )λ∈Λ converges in X.
Proof.
3. HILBERT SPACE 51
(i) implies (ii). Suppose that every Cauchy sequence in X converges, and let
(xλ )λ∈Λ be a Cauchy net in X; that is, given ε > 0, there exists λ0 ∈ Λ so
that α, β ≥ λ0 implies d(xα , xβ ) < ε.
Choose λ1 ∈ Λ so that α, β ≥ λ1 implies that d(xα , xβ ) < 1. Choose
λ2 ≥ λ1 so that α, β ≥ λ2 implies that d(xα , xβ ) < 21 . (Note that by defini-
tion, we can always find γ2 ∈ Λ so that α, β ≥ γ2 implies that d(xα , xβ ) < 12 ;
by choosing λ2 ≥ λ1 , γ2 , it follows that α, β ≥ λ2 implies d(xα , xβ ) < 21 , as
required.)
Arguing as above, for n ≥ 2, we can find λn+1 ≥ λn so that α, β ≥ λn+1
1
implies that d(xα , xβ ) < n+1 .
Consider the sequence (xλn )∞ n=1 . It is easily seen that this sequence is
Cauchy, and so by hypothesis, x = limn→∞ xλn ∈ X exists. Let ε > 0 and
choose N > 2ε so that n ≥ N implies that d(xλn , x) < 2ε .
If λ ≥ λN , then
d(xλ , x) ≤ d(xλ , xλN ) + d(xλN , x)
1 ε
< +
N 2
< ε.
That is, limλ xλ = x.
(ii) implies (i). Since every sequence in X is also a net, this is clear.
2
Question 2.
Let H be a complex Hilbert space, and let M ⊆ H be a closed subspace of H.
Denote by P the orthogonal projection of H onto M. Given T ∈ B(H), we may
write the operator matrix for T relative to the decomposition H = M ⊕ M⊥ as
follows:
T T12
T = 11 ,
T21 T22
where
• T1 1 = P T PM ,
• T1 2 = P T (I − P )|M⊥ ,
• T2 1 = (I − P )T P |M and
• T2 2 = (I − P )T (I − P )|M⊥ .
(a) Prove that the operator matrix of P relative to this decomposition is
I 0
.
0 0
R11 R12
(b) Show that if R ∈ B(H) has the decomposition R = relative to
R21 R22
the same decomposition of H, then
R11 T11 + R12 T21 R11 T12 + R12 T22
RT = .
R21 T11 + R22 T21 R21 T12 + R22 T22
In other words, this behaves just like multiplication of scalar matrices.
(c) Prove that kT k ≥ kTij k, 1 ≤ i, j ≤ 2.
(d) Prove that
kT k ≤ max(kT11 k, kT22 k) + max(kT12 k, kT21 k).
(e) We say that M is invariant for T if and only if T M ⊆ M. Prove that M
is invariant for T if and only if T21 = 0, which in turn happens if and only
if (I − P )T P = 0.
(f) Prove that M is reducing for T if and only if T12 = 0 and T21 = 0, which
in turn happens if and only if T P = P T .
(g) Suppose that κ ∈ C and that R T = T R. Prove that ker (T − κI) is
invariant for R. (We say that ker (T − κI) is hyper-invariant for T . That
is, it is invariant for all operators that commute with T .)
3. HILBERT SPACE 53
Question 3.
Suppose that H is a Hilbert space and that T ∈ B(H). Let {eα }α∈Λ be an onb
for H. Prove that if R ∈ B(H) and Reα = T eα for all α ∈ Λ, then T = R.
In other words, the action of a bounded linear operator on H is entirely deter-
mined by its action on an onb.
Question 4.
Prove that if F ∈ B(H) has rank N ≥ 1, then F is a sum of N rank-one
operators.
54 L.W. Marcoux Functional Analysis
Someday I want to be rich. Some people get so rich they lose all
respect for humanity. That’s how rich I want to be.
Rita Rudner
4.2. Definition. Let W be a vector space over the field K, and let T be a
topology on W. We say that the topology T is compatible with the vector space
structure on W if the maps
σ : W ×W → W
(x, y) 7→ x + y
and
µ: K×W →W
(k, x) 7→ kx
are continuous, where K × W and W × W carry their respective product topologies.
A topological vector space (abbreviated TVS) is a pair (W, T ) where W is
a vector space with a compatible Hausdorff topology. Informally, we refer to W as
the topological vector space.
4.3. Remark. Not all authors require T to be Hausdorff in the above definition.
However, for all spaces of interest to us, the topology will indeed be Hausdorff.
Furthermore, one can always pass from a non-Hausdorff topology to a Hausdorff
topology via a natural quotient map. (See Appendix T.)
4.4. Example. Let (X, k · k) be any normed linear space, and let T denote the
norm topology. Suppose (x, y) ∈ X×X and ε > 0. Choose a net (xα , yα )α∈Λ ∈ X×X
so that limα (xα , yα ) = (x, y). Then there exists α0 ∈ Λ so that α ≥ α0 implies
kxα − xk < ε/2, kyα − yk < ε/2. But then α ≥ α0 implies kxα + yα − (x + y)k ≤
kxα − xk + kyα − yk < ε. In other words, σ(xα , yα ) tends to σ(x, y) and so σ is
continuous.
Similarly, if (k, x) ∈ K × X, then we can choose a net (kα , xα )α∈Λ so that
limα (kα , xα ) = (k, x). But then we can find α0 ∈ Λ so that α ≥ α0 implies |kα −k| <
1, and so |kα | < |k|+1. Next we can find α1 so that α ≥ α1 implies |kα −k| < ε/2kxk,
4. TOPOLOGICAL VECTOR SPACES 55
4.6. Remark. In the assignments we shall see how to construct a TVS which is
not a normed linear space. See also the discussion of Fréchet spaces in the Appendix
to Section 5.
4.9. Example. Let (X, k · k) be a normed linear space. For all δ > 0, Vδ (0) =
{x ∈ X : kxk < δ} is a balanced nbhd of 0, and if U ∈ U0X , then there exists δ > 0
such that Vδ (0) ⊆ U .
The following result shows that the assumption that a TVS topology be Haus-
dorff may be replaced with a weaker assumption - namely that points be closed (i.e.
that T be T1 ) - from which the Hausdorff condition follows.
4.12. Proposition. Let V be a vector space with a topology T for which
(i) addition is continuous;
(ii) scalar multiplication is continuous; and
(iii) points in V are closed in the T -topology.
Then T is a Hausdorff topology and (V, T ) is a TVS.
Proof. Let x 6= y ∈ V. Then {y} is closed (i.e. V\{y} is open) and so we can
find an open nbhd U ∈ Ux of x so that y 6∈ U . As above, by continuity of addition,
translation is a homeomorphism of V and so U = x+U0 for some open nbhd U0 of 0.
Also by continuity of addition, there exists an open nbhd V of 0 so that V + V ⊆ U0 .
By continuity of scalar multiplication, −V is again an open nbhd of 0, and hence
W = V ∩ (−V ) is an open nbhd of 0 as well, with W + W ⊆ V + V ⊆ U0 .
4. TOPOLOGICAL VECTOR SPACES 57
4.15. Definition. Let (V, T ) be a TVS, and let (xλ )λ be a net in V. We say
that (xλ )λ is a Cauchy net if for all U ∈ U0 there exists λ0 ∈ Λ so that λ1 , λ2 ≥ λ0
implies that xλ1 − xλ2 ∈ U .
We say that a subset K ⊆ V is Cauchy complete if every Cauchy net in K
converges to some element of K.
We pause to verify that if (xλ )λ is a net in V which converges to some x ∈ V,
then (xλ )λ is a Cauchy net. Indeed, let U ∈ U0 and choose a balanced, open nbhd
N ∈ U0 so that N + N = N − N ⊆ U . Also, choose λ0 ∈ Λ so that λ ≥ λ0 implies
that xλ ∈ x + N . If λ1 , λ2 ≥ λ0 , then xλ1 − xλ2 = (xλ1 − x) − (xλ2 − x) ∈ N − N ⊆ U .
Thus (xλ )λ is a Cauchy net.
4.16. Example. If (X, k·k) is a normed linear space, then X is Cauchy complete
if and only if X is complete. Indeed, the topology here being a metric topology, we
need only consider sequences instead of general nets.
58 L.W. Marcoux Functional Analysis
Proof. The map τ is clearly a bijection, and the continuity of scalar multiplication
in a TVS makes it continuous as well. We shall demonstrate that the inverse map
τ −1 (ke) = k is also continuous. To do this, it suffices to show that if limλ kλ e = 0,
then limλ kλ = 0. (Why? )
Let δ > 0. Then δe 6= 0, and as V is Hausdorff, we can find a nbhd U of 0 so
that δe 6∈ U . By Proposition 4.10, U contains a balanced nbhd V of 0. Obviously,
δe 6∈ V . Since limλ kλ e = 0, there exists λ0 so that λ ≥ λ0 implies that kλ e ∈ V .
Suppose that there exists β ≥ λ0 with |kβ | ≥ δ. Then
δ
δe = kβ e ∈ V,
kβ
as V is balanced. This contradiction shows that λ ≥ λ0 implies that |kλ | < δ. Since
δ > 0 was arbitrary, we have shown that limλ kλ = 0.
2
that Y is complete (check! ) and therefore closed, by Lemma 4.17. By the arguments
of paragraph 4.18, V/Y is a TVS and the canonical map qY : V → V/Y is continuous.
Moreover, {q(et1 ), q(et2 ), ..., q(etr )} is a basis for V/Y. Since r < m, our induction
hypothesis once again shows that the map
ρY : P V/Y → Kr
r
j=1 kj q(etj ) 7→ (k1 , k2 , ..., kr )
is continuous. Thus
γ := ρY ◦ q : P V → Kr
n Pn
i=1 ki ei 7→ ρY ( i=1 ki q(ei )) = (kt1 , kt2 , ..., ktr )
is also continuous, being the composition of continuous functions.
To complete the proof, we first apply the above argument with F = {em } to get
that
γ1 : P V → K
n
i=1 i i 7→ km
k e
is continuous, and then to F = {e1 , e2 , ..., em−1 } to get that
γ2 : P V → Km−1
n
i=1 ki ei 7→ (k1 , k2 , ..., km−1 )
is continuous. Since τ −1 = (γ1 , γ2 ), it too is continuous, and we are done.
2
Suppose that X is locally compact and Hausdorff, and that x0 ∈ X. Then for
all U ∈ Ux0 , there exists K ∈ Ux0 so that K is compact and K ⊆ U . Choose G ∈ T
so that x0 ∈ G ⊆ K. Then G ⊆ K = K ⊆ U , and so G is compact. That is, if X is
Hausdorff and locally compact, then for any U ∈ Ux0 , there exists G ∈ T so that G
is compact and x0 ∈ G ⊆ G ⊆ U .
4.26. Corollary. Let (X, k · k) be a NLS. Then the closed unit ball X1 of X is
compact if and only if X is finite-dimensional.
Proof. First suppose that X1 is compact. If U ∈ U0X is any nbhd of 0, then there
exists δ > 0 so that kxk < 2δ implies that x ∈ U . But then Xδ ⊆ U , and Xδ = δX1
is compact, being a homeomorphic image of X1 . By definition, X is locally compact,
hence finite-dimensional, by Theorem 4.25.
Conversely, suppose that X is finite-dimensional. By Theorem 4.25, X is locally
compact. By hypothesis, there exists a compact nbhd K of 0. As above, there exists
δ > 0 so that Xδ ⊆ K. Since Xδ is a closed subset of a compact set, it is compact.
Since X1 = (δ −1 )Xδ is a homeomorphic image of a compact set, it too is compact.
2
4.27. Definition. Let (V, TV ) and (W, TW ) be topological vector spaces and
suppose that f : V → W is a (not necessarily linear) map. We say that f is
uniformly continuous if, given U ∈ U0W there exists N ∈ U0V so that x − y ∈ N
implies that f (x) − f (y) ∈ U .
4.28. The definition of uniform continuity given here derives from the fact that
the collection B = {B(U ) = {(x, y) : x − y ∈ U } : U ∈ U0W } defines what is
known as a uniformity on the TVS (W, TW ) whose corresponding uniform topology
coincides with the initial topology T . The interested reader is referred to the book
by Willard [Wil70] and to the books of Kadison and Ringrose [KR83] for a more
complete development along these lines. We shall focus only upon that part of the
theory which we require in this text.
4.29. Let us verify that in the case where (X, k · kX ) and (Y, k · kY ) are normed
linear spaces, our new notion of uniform continuity coincides with our metric space
notion.
Observe first that if (Z, k · k) is a general normed linear space and δ > 0, then
x − y ∈ VδZ (0) if and only if kx − yk < δ.
Suppose f : X → Y is uniformly continuous in the sense of Definition 4.27.
Given ε > 0, VεY (0) = {y ∈ Y : kykY < ε} ∈ U0Y and so there exists N ∈ U0X such
that x − y ∈ N implies f (x) − f (y) ∈ VεY (0). But N ∈ U0X implies that there exists
δ > 0 so that VδX (0) ⊆ N . Thus kx − ykX < δ implies that x − y ∈ N , and thus
f (x) − f (y) ∈ VεY (0), i.e. kf (x) − f (y)kY < ε. Thus is the standard (metric) notion
of uniform continuity in a normed space.
Conversely, suppose that f : X → Y is uniformly continuous in the standard
metric sense. Let U ∈ U0Y . Then there exists ε > 0 so that VεY (0) ⊆ U . By
hypothesis, there exists δ > 0 so that kx − ykX < δ implies kf (x) − f (y)kY < ε, and
hence x − y ∈ VδX (0) implies that f (x) − f (y) ∈ VY (0) ⊆ U . That is, f is continuous
in the sense of Definition 4.27.
4.30. Definition. If (V, TV ) and (W, TW ) are topological vector spaces and
C ⊆ V, then f : C → W is uniformly continuous if for all U ∈ U0W there exists
N ∈ U0V such that x, y ∈ C and x − y ∈ N implies f (x) − f (y) ∈ U .
4.31. Example. Now (R, | · |) is a normed linear space, and so the comments of
paragraph 4.29 apply. The function f : (0, 1) → R defined by f (x) = x2 is therefore
uniformly continuous, whereas g : R → R defined by g(x) = x2 is not.
Another way in which uniform continuity in the TVS setting extends the notion
of uniform continuity in the metric setting is evinced by the following:
4.32. Proposition. Let (V, TV ) and (W, TW ) be topological vector spaces and
f : V → W be uniformly continuous. Then f is continuous on V.
Proof. Let x0 ∈ V. Let U ∈ UfW(x0 ) . Then by paragraph 4.11, U = f (x0 ) + U0
where U0 ∈ U0W . By hypothesis, there exists N0 ∈ U0V so that x − x0 ∈ N0 implies
that f (x) − f (x0 ) ∈ U0 . That is, x ∈ x0 + N0 implies f (x) ∈ f (x0 ) + N0 = U . Since
N := x0 + N0 ∈ UxV0 , we see that f is continuous at x0 . But x0 ∈ V was arbitrary,
and so f is continuous on V.
2
4.33. Theorem. Let (V, TV ) and (W, TW ) be topological vector spaces over K.
Suppose that T : V → W is linear. The following are then equivalent:
(a) there exists x0 ∈ V so that T is continuous at x0 ; and
(b) T is uniformly continuous on V.
Proof. By Proposition 4.32, it suffices to prove that (a) implies (b). To that end,
suppose that T is continuous at x0 and let U0 ∈ U0W . Then U := T x0 + U0 ∈ UTWx0 .
By continuity of T at x0 , there exists N ∈ UxV0 so that T (N ) ⊆ U . But N = x0 + N0
for some N0 ∈ U0V . Now if z ∈ N0 , then x0 + z ∈ N , and so T (x0 + z) = T x0 + T z ∈
T x0 + U0 . That is, T z ∈ U0 .
In particular, if x−y ∈ N0 , then T (x−y) = T x−T y ∈ U0 , and so T is uniformly
continuous on V.
2
4.34. Given vector spaces V and W over C, denote by VR and WR the same
spaces of vectors, viewed as vector spaces over R. Observe that if (V, TV ) and
(W, TW ) are topological vector spaces over C and T : V → W is conjugate-linear
(i.e. T (kx) = k x for all x ∈ V and k ∈ C), then T : VR → WR is linear (over R).
That is, T (kx + y) = kT x + T y for all x, y ∈ VR and k ∈ R. Moreover, the topologies
TV and TW are real-vector space topologies for VR and WR respectively (as well as
C-vector space topologies for V and W).
From this it follows that T : V → W is continuous if and only if T : (VR , TV ) →
(WR , TW ) is continuous, and by the above Theorem, this happens precisely when T
is uniformly continuous on V (or equivalently on VR , since all scalars appearing in the
definition of uniform continuity are real and since the definition of uniform continuity
does not depend upon scalar multiplication, but only upon “subtraction”).
64 L.W. Marcoux Functional Analysis
4.37. Proposition. Suppose that V and W are topological vector spaces and
that W is Cauchy complete. If X ⊆ V is a linear manifold and T0 : X → W is
continuous and linear, then T0 extends to a continuous linear map T : X → W.
Proof. Let x ∈ X and choose (xλ )λ∈Λ1 in X so that limλ xλ = x. Clearly, if T is to
be continuous, we shall need T x = limλ T xλ . The issue is whether or not this limit
exists and is independent of the choice of (xλ )λ .
Now (xλ )λ is a Cauchy net. Take U ∈ U0W . Since T0 is continuous, there exists
N ∈ U0X such that w ∈ N implies that T0 w ∈ U . Hence, there exists λ0 such
that λ1 , λ2 ≥ λ0 implies that xλ1 − xλ2 ∈ N and therefore that T0 xλ1 − T0 xλ2 =
4. TOPOLOGICAL VECTOR SPACES 65
T0 (xλ1 − xλ2 ) ∈ U . Thus (T0 xλ )λ is also a Cauchy net. Our assumption that W is
Cauchy complete implies that there exists z (depending a priori upon (xλ )λ ) such
that z = limλ T0 xλ .
Suppose that (yβ )β∈Λ2 ∈ X and that limβ yβ = x. Arguing as above, there exists
z2 ∈ W so that z2 = limβ T0 yβ . If we set
y(λ,β) := yβ , λ ∈ Λ1
x(λ,β) := xλ , β ∈ Λ2 ,
4.38. Corollary. Suppose that X and Y are Banach spaces and that M ⊆ X is
a linear manifold. If T0 : M → Y is bounded, then T0 extends to a bounded linear
map T : M → Y, and kT k = kT0 k.
Proof. Invoking Proposition 4.37, there remains only to show that kT k = kT0 k.
That kT k ≥ kT0 k is clear.
Conversely, given x ∈ M with kxk = 1 and ε > 0, there exists y ∈ M so that
kyk = 1 and kx − yk < ε. Then
kT0 y − T xk = kT y − T xk ≤ kT k ky − xk < kT kε.
(Recall that T is bounded since T is continuous!) Since ε > 0 was arbitrary,
sup{kT0 yk : y ∈ M, kyk = 1} ≥ sup{kT xk : kxk = 1},
66 L.W. Marcoux Functional Analysis
Appendix to Section 4.
4.39. We have only touched upon the basics of the theory of topological vector
spaces. Indeed, our interests lie much closer to the study of normed linear spaces
and Banach spaces. Our reason for developing the theory of TVS’s to this extent
is that there are many topologies that one associates to Banach spaces that are not
necessarily norm topologies, including weak and weak∗ -topologies which we shall
study in Chapter 7 and beyond.
One can develop a theory of normed linear spaces, and deal with each of these
topologies on an ad hoc basis, however we feel that the price we pay in introducing
this more general approach is compensated by having our versions of the Hahn-
Banach Theorem hold in any “locally convex space”, a special kind of TVS whose
topology, as we shall see in the next Chapter, is induced by a separating family of
seminorms.
Question 1.
(a) Give an example of two homeomorphic metric spaces (X, dX ) and (Y, dY )
such that X is complete, but Y is not complete.
(b) Why is this not an issue in Corollary 4.22? What is it about the “nature”
of the homeomorphism between W and Kn in the proof of Proposition 4.20
that ensures that W is complete?
Question 2.
Let (V, T ) be a TVS. Prove the following.
(a) If C ⊆ V is convex, then so is C.
(b) If E ⊆ V is balanced, then so is E.
Question 3.
Let (V, T ) be a TVS, and let W ⊆ V be a closed subspace. Prove that scalar
multiplication is continuous in the quotient topology Tq on V/W.
5. SEMINORMS AND LOCALLY CONVEX SPACES 69
The secret of life is honesty and fair dealing. If you can fake that,
you’ve got it made.
Groucho Marx
5.1. Our main interest in topological vector spaces is to develop the theory
of locally convex topological vector spaces, which appear naturally in defining cer-
tain weak topologies naturally associated with Banach spaces, including the Banach
space of all bounded operators on a Hilbert space. Locally convex spaces are also
the most general spaces for which (in our opinion) interesting versions of the Hahn-
Banach Theorem will be shown to apply. As we shall see in this section, there is an
intimate relation between locally convex topological vector spaces and separating
families of seminorms on the underlying vector spaces, a phenomenon to which we
now turn our attention.
5.2. Definition. Let V be a vector space over K. A seminorm on V is a map
p : V → R satisfying
(i) p(x) ≥ 0 for all x ∈ V;
(ii) p(λx) = |λ|p(x) for all x ∈ V, λ ∈ K;
(iii) p(x + y) ≤ p(x) + p(y) for all x, y ∈ V.
It follows from this definition that a norm on V is simply a seminorm which
satisfies the additional property that p(x) = 0 if and only if x = 0.
5.4. Example. Let V = C([0, 1], C). For each x ∈ [0, 1], the map
px : V → R
defined by setting px (f ) = |f (x)| is a seminorm on V which is not a norm.
5.7. The Minkowski functional. Let V be a TVS and suppose that E ∈ U0V
is convex. As we saw in the previous Chapter (see Remark 4.7), any nbhd of 0 in V
is absorbing, and thus there exists r0 > 0 so that x ∈ r0 E. This allows us to define
the map
pE : V → R
x 7→ inf{r ∈ (0, ∞) : x ∈ rE},
which we call the gauge functional or the Minkowski functional for E.
Note: the name is misleading, since the map is clearly not linear - its range is
contained in [0, ∞). By convexity of E, if x ∈ rE and 0 < r < s, then x = re for
some e ∈ E, so x = (1 − rs )0 + rs (se) ∈ co(sE) = sE. In particular, x ∈ sE for all
s > pE (x).
5.9. It is clear from the definition that every seminorm (and hence every norm)
on a vector space is a sublinear functional on that space. The converse is false in
general.
For example, the identity map κ : R → R is a sublinear functional on R. It is
not a seminorm since it is not even a non-negative valued function.
5. SEMINORMS AND LOCALLY CONVEX SPACES 71
5.12. Example. Recall from Example 5.4 that for each x ∈ [0, 1],
px : C([0, 1], C) → R
f 7→ |f (x)|
is a seminorm. Now B1 (0) := {f ∈ C([0, 1], C) : kf k∞ < 1} is open and f ∈ B1 (0)
implies that px (f ) = |f (x)| ≤ kf k∞ < 1.
Thus each such px is continuous on C([0, 1], C).
5.15. Corollary. Let (V, T ) be a LCS. Then V admits a nbhd base at 0 con-
sisting of balanced, open, convex sets.
5.16. Example. Let (X, k · k) be a normed linear space. For each ε > 0, the
argument of paragraph 5.6 shows that Bε (0) = {x ∈ X : kxk < ε} is convex. Since
{Bε (0) : ε > 0} is a nbhd base at 0 for the norm topology, (X, k · k) is a LCS.
More concretely, (Kn , k · k2 ) is a LCS, as is any Hilbert space H. So is B(H).
We have already seen that the quotient of a TVS by one of its closed subspaces
is a TVS. Let us first obtain the same result for locally convex spaces.
5.19. Example. Let W = C([0, 1], C) and consider Γ = {px : x ∈ Q ∩ [0, 1]},
where - as before - px (f ) = |f (x)| for all f ∈ W.
If 0 6= f ∈ W, then there exists y ∈ [0, 1] so that f (y) 6= 0. By continuity of f ,
there exists a nbhd N of y such that f (y) 6= 0 for all y ∈ N . Thus there exists a
rational number q ∈ N so that 0 6= f (q) and hence pq (f ) 6= 0. Thus Γ is a separating
family of seminorms.
74 L.W. Marcoux Functional Analysis
5.22. The above result says that a separating family of seminorms on a vector
space W gives rise to a locally convex topology on W. Our next goal is to show that
all locally convex spaces arise in this manner.
5.23. Theorem. Suppose that (V, TV ) is a LCS. Then there exists a separating
family Γ of seminorms on V which generate the topology TV .
Proof. By Corollary 5.15, (V, TV ) admits a nbhd base C0 at 0 consisting of balanced,
open, convex sets. By Proposition 5.10, for each E ∈ C0 , the Minkowski functional
pE is a seminorm and E = {x ∈ V : pE (x) < 1}.
Let Γ = {pE : E ∈ C0 }. We first show that Γ is separating. Indeed, suppose
that 0 6= x ∈ V. Since TV is Hausdorff by hypothesis, there exists G ∈ C0 so that
x 6∈ G (this is actually a bit weaker than the statement that TV is Hausdorff, but
certainly implied by it). Since G ∈ C0 , pG ∈ Γ. But x 6∈ G implies that pG (x) ≥ 1,
and hence pG (x) 6= 0. Thus Γ is separating. This is required before passing to the
next step.
By Theorem 5.21,
B = {N (x, F, ε) : x ∈ V, ε > 0, F ⊆ Γ finite}
is a base for a locally convex topology TΓ on V. Our goal, of course, is to prove that
TV = TΓ .
Let E ∈ C0 be a TV -open, balanced, convex nbhd of 0. Since E = N (0, pE , 1) ∈ B,
it follows that TΓ contains a nbhd base at 0 for the topology TV . Since both topologies
76 L.W. Marcoux Functional Analysis
are TVS-topologies, they are determined by their nbhd bases at any point (for eg.,
at 0), and from this it follows that TΓ ⊇ TV .
On the other hand, each pE ∈ Γ is bounded above by 1 on E, and E is a TV -
open nbhd of 0. By Proposition 5.11, pE is continuous on (V, TV ). It follows that
N (0, pE , ε) = p−1
E (−ε, ε) ∈ TV for all ε > 0. Thus TV contains a nbhd subbase for
TΓ at 0, and arguing as before, we get that TΓ ⊆ TV .
Hence TV = TΓ , and the topology TV is determined by the family Γ of seminorms.
2
5.24. Example. Let (X, k · k) be a normed linear space. The norm topology on
X is the metric topology induced by the metric d(x, y) = kx − yk. That is, a nbhd
base at x0 ∈ X for the norm topology is
Bx0 = {Vε (x0 ) : ε > 0}
= {{y ∈ X : ky − x0 k < ε} : ε > 0}
= {N (x0 , k · k, ε) : ε > 0}.
Thus we see that the norm topology on X is exactly the locally convex topology
generated by Γ = {k · k}. Observe that since k · k is a norm, 0 6= x ∈ X implies that
kxk =
6 0, and thus Γ is indeed separating, as required.
5.25. In Corollary 5.15, we saw that any LCS (V, T ) admits a nbhd base at 0
consisting of open, balanced, convex sets.
In fact, each N (0, {p1 , p2 , ..., pm }, ε) is balanced, open and convex for all choices
of m ≥ 1, p1 , p2 , ..., pm ∈ Γ and ε > 0, where Γ is a separating family of seminorms
which generate T . It is clear from Theorems 5.21 and 5.23 that the collection of
such sets is a nbhd base at 0 for T .
Proof.
• Suppose first that (xλ )λ converges to x in the T -topology. Given p ∈ Γ
and ε > 0, the set N (x, p, ε) ⊆ T and so there exists λ0 so that λ ≥ λ0
implies that xλ ∈ N (x, p, ε). That is, λ ≥ λ0 implies that p(x − xλ ) < ε.
Thus limλ p(x − xλ ) = 0.
Alternatively, one may argue as follows: suppose that (xλ )λ converges
to x in the T -topology. Given p ∈ Γ, we know that p is continuous in the
5. SEMINORMS AND LOCALLY CONVEX SPACES 77
There is, however, a situation where we can say a bit more than this. Let V
be a vector space and let (Xα , k · kα )α∈A be a collection of Banach spaces (in fact,
normed linear spaces will do). For each such α, suppose that Tα : V → Xα is a
linear map. Suppose furthermore that the family {Tα }α is separating in the sense
that if 0 6= x ∈ V, then there exists α ∈ A so that 0 6= Tα x ∈ Xα . Then each of the
functions
pα : V → [0, ∞)
x 7→ kTα xkα
is easily seen to be a seminorm. It is routine to verify that the fact that {Tα }α
is separating implies that Γ = {pα }α is a separating family of seminorms. Let T
denote the LCS topology on V generated by Γ. By Proposition 5.26, a net (xλ )λ
converges to x ∈ (V, T ) if and only if
lim pα (x − xλ ) = lim kTα (x − xλ )kα = 0 for all α ∈ A.
λ λ
78 L.W. Marcoux Functional Analysis
Since this is nothing more than the statement that each Tα is continuous, we find
that in this case, the T topology on V coincides with the weak topology generated
by the family {Tα }α∈A . This is still not the same as the weak topology generated
by the family Γ, however.
5.28. Example. Let H = `2 (N) and recall P that H is a Hilbert space when
equipped with the inner product h(xn ), (yn )i = ∞
n=1 xn yn .
Recall also that B(H) is a normed linear space with the operator norm kT k :=
sup{kT xk : x ∈ H, kxk ≤ 1}.
From above, we see that the norm topology on B(H) admits as a nbhd base at
T ∈ B(H) the collection
{N (T, k · k, ε) : ε > 0} = {Vε (T ) : ε > 0},
and that this is the locally convex topology generated by the separating family
Γ = {k · k} of (semi)norms.
Convergence of a net of operators (Tλ )λ to T ∈ B(H) in the norm topology (i.e.
limλ kTλ − T k = 0) should be thought of as uniform convergence on the closed unit
ball of H.
This is certainly not the only interesting topology one can impose upon B(H).
Let us first consider the topology of “pointwise convergence”.
Thus the SOT is the topology of pointwise convergence. That is, it is the weakest
topology that makes all of the evaluation maps T 7→ T x, x ∈ H continuous.
A nbhd base for the SOT at the point T ∈ B(H) is given by the collection
{N (T, {x1 , x2 , ..., xm }, ε) : m ≥ 1, xj ∈ H, 1 ≤ j ≤ m, ε > 0}
where, for m ≥ 1, F := {xj ∈ H : 1 ≤ j ≤ m} and ε > 0, we have
N (T, F, ε) = {R ∈ B(H) : kRxj − T xj k < ε, 1 ≤ j ≤ m}.
The weak operator topology (WOT) Next, for each pair (x, y) ∈ H × H,
consider the map
qx,y : B(H) → R
T 7→ |hT x, yi|.
Again, it is routine to verify that each qx,y is a seminorm but not a norm on B(H).
The locally convex topology on B(H) generated by ΓW OT := {qx,y : (x, y) ∈
H × H} is called the weak operator topology on B(H) and is denoted by WOT.
A net (Tλ )λ ∈ B(H) converges to T ∈ B(H) in the WOT if and only if
lim |h(Tλ − T )x, yi| = lim |hTλ x, yi − hT x, yi| = 0
λ λ
for all x, y ∈ H. In other words, the WOT is the weakest topology that makes all
of the functions T 7→ hT x, yi, x, y ∈ H continuous.
A nbhd base for the WOT at the point T ∈ B(H) is given by the collection
{N (T, {x1 , x2 , ..., xm , y1 , y2 , ..., ym }, ε) : m ≥ 1, xj , yj ∈ H, 1 ≤ j ≤ m, ε > 0},
where, for m ≥ 1, F := {(xj , yj ) ∈ H × H : 1 ≤ j ≤ m} and ε > 0, we have
N (T, F, ε) = {R ∈ B(H) : |hRxj − T xj , yj i| < ε, 1 ≤ j ≤ m}.
(a) implies [(b)] Suppose that p is continuous on V. Then M := p−1 ((−1, 1)) =
p−1 ([0, 1)) is a T -open nbhd of 0, and as such, it must contain a basic nbhd
N := N (0, {p1 , p2 , ..., pm }, ε) for some p1 , p2 , ..., pm ∈ Γ and ε > 0. It
follows that if pj (x) < ε for 1 ≤ j ≤ m, then x ∈ N ⊆ M , and hence
p(x) < 1.
More generally, let y ∈ V and let ry = max(p1 (y), p2 (y), ..., pm (y)).
• If ry = 0, then for all k > 0, pj (ky) = 0 < ε, 1 ≤ j ≤ m, so that from
above, p(ky) = kp(y) < 1. But then
p(y) = 0 ≤ 1 max(p1 (y), p2 (y), ..., pm (y)).
• If ry > 0, then x = 2rεy y satisfies pj (x) < ε, 1 ≤ j ≤ m, and so
ε
2ry p(y) = p(x) < 1. That is,
2ry 2
p(y) < = max(p1 (y), p2 (y), ..., pm (y)).
ε ε
We conclude that with κ = max(1, 2ε ),
p(y) ≤ κ max(p1 (y), p2 (y), ..., pm (y))
for all y ∈ V.
(b) implies [(a)] Suppose that (b) holds. Now N := N (0, {p1 , p2 , ..., pm }, 1) is
an open nbhd of 0 in the T -topology. If x ∈ N , then pj (x) < 1 for all
1 ≤ j ≤ m, and so p(x) ≤ κ. But then p is bounded above on the T -open
nbhd N of 0, and hence is continuous by Proposition 5.11.
2
5.30. Proposition. Let (V, TV ) and (W, TW ) be locally convex spaces. Let ΓV
and ΓW denote separating families of seminorms which generate the corresponding
locally convex topologies on V and W respectively. Finally, let T : V → W be a
linear map.
The following are equivalent:
(a) T is continuous.
(b) For all q ∈ ΓW there exists κ > 0 and p1 , p2 , ..., pm ∈ ΓV so that
q(T x) ≤ κ max(p1 (x), p2 (x), ..., pm (x)) for all x ∈ V.
Proof.
(a) implies (b): Suppose that T is continuous and that q ∈ ΓW . Clearly q is
continuous as well. It is routine to verify that q◦T is a seminorm on V. Since
the composition of continuous functions is continuous, q ◦ T is a continuous
seminorm on V, and the result now follows from Proposition 5.29.
(b) implies (a): Conversely, suppose that for all q ∈ ΓW there exists κ > 0 and
p1 , p2 , ..., pm ∈ ΓV so that
q(T x) ≤ κ max(p1 (x), p2 (x), ..., pm (x)) for all x ∈ V.
As before, we observe that q ◦ T is a seminorm on V for all q ∈ ΓW .
Moreover, by Proposition 5.29, each such q ◦ T is continuous.
5. SEMINORMS AND LOCALLY CONVEX SPACES 81
Appendix to Section 5.
5.34. Example. Most, but certainly not all metrics we deal with are translation
invariant. For example, if d(x, y) = |x−y| for x, y ∈ R, then d is obviously translation
invariant.
On the other hand, the metric d on R defined via:
d(x, y) = |x3 − y 3 |
for all x, y ∈ R is not translation invariant, since d(0, 1) = 1 6= 7 = d(1, 2).
5.37. Example.
(a) Let V = C ∞ (R) denote the vector space of all functions f : R → R which
are infinitely differentiable at each point x ∈ R. Let Γ = {pn,k }n,k≥0 , where
for f ∈ V,
pn,k (f ) := sup{|f (n) (x)| : x ∈ [−k, k]}.
Let T denote the LCS topology on V generated by the separating family Γ
of seminorms. Then (V, T ) is a Fréchet space.
A sequence (fk )k in V converges to f ∈ V if and only if
(n)
lim sup{|fj (x) − f (n) (x)| : x ∈ [−k, k]} = 0
j
for all n ≥ 0, k ≥ 0.
(b) If (X, k · k) is a normed linear space, then with Γ = {k · k}, X becomes a
Fréchet space.
5.39. The wot– and sot– topologies. In Example 5.28, we defined the weak-
operator topology (wot) and the strong-operator topology (sot) on B(H). As we
mentioned there, a net (Tλ )λ converges to T ∈ B(H) if and only if it converges
pointwise; that is, for each x ∈ H, limλ Tλ x = T x.
Let us now turn our attention to wot–convergence.
Let {eα : α ∈ Ω} be an onb for an infinite-dimensional Hilbert space H. Cor-
responding to any X ∈ B(H) is a matrix [X] = [xα,β ]α,β∈Ω defined by xα,β :=
hXeβ , eα i for all α, β ∈ Ω. (You will have definitely seen the finite-dimensional ver-
sion of this phenomenon, where we write [T ] = [ti,j ] ∈ Mn (C), with ti,j := hT ej , ei i
for some onb {e1 , e2 , . . . , en } for H = Cn .)
There is an important difference to note when passing from finite-dimensional
Hilbert spaces to infinite-dimensional Hilbert spaces, namely: in the infinite-dimen-
sional setting, not every matrix [xα,β ] is the matrix of a bounded linear operator X.
For example, if H is infinite-dimensional, there is no bounded linear map X ∈ B(H)
such that xα,β = 1 for all α, β ∈ Ω. Even in the case where H is infinite-dimensional
but separable, there are no known necessary and sufficient conditions to identify
84 L.W. Marcoux Functional Analysis
X = λI + K.
We mention in passing that an operator K ∈ B(H) for which there exists an
onb {en }∞
n=1 relative to which
1
∞ 2
X
2
kKk2 := |hKen , em i| < ∞
n,m=1
0 0 0 ... 0 ...
0 0 0 ... 0 . . .
..
0 . . .
. 0 0 . . .
0 . . . 0 N2 0 . . . .
[RN ] =
0 . . . . . . 0 0 . . .
0 . . . . . . 0 0 . . . . . .
.. .. .. .. ..
. . . . 0 .
That is, there is N 2 in the (N, N ) entry, and 0’s elsewhere. By our work in Chapter 2,
kRN k = N 2 for each N ≥ 1.
We leave it as an exercise for the reader to show that (RN )∞ N =1 does not converge
to anything in the weak-operator topology.
86 L.W. Marcoux Functional Analysis
Applying Example 11.19 of the online notes to the nbhd system U0sot of 0 in the
sot, we see that the latter forms a directed set using the relation
U1 ≤ U2 if U2 ⊆ U1 ,
and that if we choose an element XU ∈ U , U ∈ U0sot , then
lim XU = 0.
U ∈U0sot
Recall also from your Assignments that any sot-open nbhd of 0 ∈ B(H) contains
an infinite-dimensional sot-closed subspace of B(H). Of course, a non-zero subspace
of B(H) contains operators of any possible norm.
Each U ∈ U0sot then contains a basic sot-open nbhd of 0 of the form
N (0, F, ε) = {T ∈ B(H) : kT x − 0k < ε, x ∈ F },
where F = {x1 , x2 , . . . , xN } ⊆ H is a finite set, and ε > 0.
Choose an operator RU ∈ N (0, F, ε) ⊆ U with kRU k > 1ε . (Alternatively, we
may ask that kRU k ≥ |F |, the cardinality of F .)
Since RU ∈ U , U ∈ U0sot , we see from above that
lim RU = 0.
U ∈U0sot
Since ε > 0 can be made arbitrarily small (starting with any δ > 0, let 0 6= x ∈ H
and consider U = N (0, {x}, δ)), the net (RU )U is not bounded.
We also remark that there does not exist an unbounded sequence (Tn )∞ n=1 in
B(H) which converges in the wot to 0.
Indeed, suppose otherwise; that is, suppose that (Tn )n is a sequence and that
for each x, y ∈ H,
lim hTn x, yi = 0.
n→∞
Then, for each x ∈ H, (Tn x)∞n=1 is a sequence in H which converges weakly to
0. (This uses the Riesz Representation Theorem for Hilbert spaces to identify a
functional β ∈ H∗ with a functional βy (z) := hz, yi∀z ∈ H.) By Corollary 7.17 of
the online notes, supn≥1 kTn xk < ∞.
Of course, if we had a sequence (Tn )∞n=1 converging in the sot to 0, then for
each x ∈ H, limn kTn xk = 0, implying again that supn≥1 kTn xk < ∞.
Either way (i.e. wot or sot convergence to 0), we can apply the Uniform
Boundedness Principle: for each x ∈ H,
κx := sup kTn xk < ∞,
n≥1
5. SEMINORMS AND LOCALLY CONVEX SPACES 87
Question 1. This question is based upon Example 5.42 of the Appendix. Let
H = `2 and denote by {en }∞n=1 the standard onb for H. For each N ≥ 1, set
RN = N (eN ⊗ eN ), where for x, y ∈ H, x ⊗ y ∗ denotes the rank-one operator
2 ∗
Question 2.
Let H be an infinite-dimensional, separable, complex Hilbert space and let
{en }∞
n=1 be an onb for H. For each N ≥ 1, let PN denote the orthogonal pro-
jection of H onto span {e1 , e2 , . . . , eN }.
Prove that the sequence (Pn )∞ n=1 converges strongly to the identity; that is, that
it converges to the identity operator in the sot.
Question 3.
Let H be an infinite-dimensional, separable, complex Hilbert space, and denote
by F(H) the set of finite-rank operators in B(H). Prove that F(H) is dense in B(H)
in the wot.
Is it dense in B(H) in the sot?
6. THE HAHN-BANACH THEOREM 89
When I wake up in the morning, I just can’t get started until I’ve had
that first, piping hot pot of coffee. Oh, I’ve tried other enemas...
Emo Philips
6.4. Remarks.
(a) Recall from basic linear algebra that every linear functional on Kn is of this
form for some choice of k1 , k2 , ..., kn ∈ K. As such, every linear functional
on Kn is continuous.
(b) Recall from Proposition 4.20 that if V is an n-dimensional TVS with basis
{e1 , e2 , ..., en }, then V is homeomorphic to Kn via the map
P n n
i=1 ki ei 7→ (k1 , k2 , ..., kn ). Since the product topology on K is in turn
equivalent to the norm topology induced by the infinity norm, it follows
from (a) above that every linear functional on a finite-dimensional TVS is
continuous.
90 L.W. Marcoux Functional Analysis
6.5. Example. Let us next consider c00 (K) = {(xn )∞ n=1 : xn ∈ K for all n ≥
1 and xn = 0 for all but finitely many n’s}. Recall that this forms a normed linear
space when equipped with the norm
Define
f : c00 (K) → P K
∞ .
(xn )n 7→ n=1 xn
1 1 1
yn = ( , , ..., , 0, 0, 0, ...)
n n n
For a number of the results we shall obtain below, we shall assume that the
underlying field is R. In order to translate the results to the case of complex vector
spaces, the following Lemma will be useful.
For x ∈ V,
N
X
0 = (g − kj f j )(x + N )
j=1
N
X
= g(x) − kj fj (x),
j=1
PN
so that g = j=1 kj fj .
2
The first part of the above Proposition shows that if f and g are distinct linear
functionals on a vector space V, then they have the same kernel if and only if one
functional is a non-zero multiple of the other.
6.8. Definition. Let V be a vector space over K. A hyperplane M in V is a
linear manifold for which dim (V/M) = 1.
6.11. Example.
(a) Let X = (C([0, 1], C), k · k∞ ), and let δ 1 : X → C be the map δ 1 (f ) := f ( 12 ),
2 2
f ∈ X. Then ker δ 1 = {f : [0, 1] → C|f is continuous and f ( 21 ) = 0}. This
2
is clearly closed.
0 if i 6= j
(b) Let V = c0 (C), and let ek = (δ1k , δ2k , δ3k , ...), where δij = .
1 if i = j
Let z = (1, 1/2, 1/3, 1/4, ...). Then {z, e1 , e2 , e3 , ...} is linearly independent
in V, and as such it can be extended to a Hamel basis (i.e. a vector space
basis) for V, say
B = {z, e1 , e2 , e3 , ...} ∪ {bλ : λ ∈ Λ}.
Given v ∈ V, say v = αz + ∞
P P
k=1 βk ek + λ∈Λ γλ bλ for some α, βk , γλ ∈
K with only finitely many coefficients not equal to zero, define g(v) = α.
It is clear that this defines a linear functional on V. Since ker g is a
subspace containing ek , k ≥ 1, and since span {ek }∞ k=1 is dense in V, ker g
is dense in V. Since g 6= 0, and since ker g is dense in V, we see that ker g
is not closed.
If (X, Ω, µ) is σ-finite, then the same conclusion holds in the case where p = 1
and q = ∞.
Recall that if X is a locally compact space, then MK (X) denotes the space of
K-valued regular Borel measures on X with the total variation norm.
The key issue is that we can “reverse engineer” this process. Suppose that s ∈ R
satisfies both (1) and (2), namely
(3) f (m2 ) − p(−z + m2 ) ≤ s ≤ −f (m1 ) + p(z + m1 ) for all m1 , m2 ∈ M.
96 L.W. Marcoux Functional Analysis
• If t > 0, then
hs (tz + m) = ts + f (m)
≤ t(−f (m/t) + p(z + (m/t))) + f (m)
= p(tz + m) for all m ∈ M,
while
• if t < 0, then
hs (tz + m) = ts + f (m)
≤ t(f (−m/t) − p(−z − (m/t))) + f (m)
= −f (m) + (−t)p(−z − (m/t)) + f (m)
= p(tz + m) for all m ∈ M.
• If t = 0, then hs (tz + m) = hs (m) = f (m) ≤ p(m) = p(tz + m) for all
m ∈ M.
There remains to show, therefore, that we can find s ∈ R which satisfies (3) (or
equivalently, which satisfies both (1) and (2)). Now this can be done if
f (m2 ) + f (m1 ) ≤ p(−z + m2 ) + p(z + m1 ) for all m1 , m2 ∈ M.
But
f (m1 ) + f (m2 ) = f (m1 + m2 ) ≤ p(m1 + m2 )
≤ p(m2 − z) + p(z + m1 )
for all m1 , m2 ∈ M, and so we can choose
s0 := sup{f (m2 ) − p(−z + m2 ) : m2 ∈ M}.
Letting g = hs0 completes the proof.
2
and satisfies g0 (y) ≤ p(y) for all y ∈ Y0 . This contradicts the maximality of (Y, g).
Hence Y = V, and g has the required properties.
2
The complex version of this theorem can now be established.
Now suppose that K = C. Let f1 = Ref . Then by Lemma 6.6, |f1 (m)| ≤ p(m)
for all m ∈ M. By the argument of the first paragraph of this proof, there exists
an R-linear functional g1 : V → R so that g1 |M = f1 and |g1 (m)| ≤ p(m) for all
m ∈ M. Let g = (g1 )C denote the complexification of g1 , as obtained in Lemma 6.6.
Then g : V → C is C-linear, g|M = f , and by part (c) of that Lemma,
|g(x)| ≤ p(x) for all x ∈ V.
2
6.22. Corollary. Let (V, T ) be a LCS and {xj }m j=1 be a linearly independent
set of vectors in V. If {kj }m
j=1 ∈ K are arbitrary, then there exists g ∈ V ∗ so that
g(xj ) = kj , 1 ≤ j ≤ m.
Proof. Let M = span{xj }m j=1 , so that M is a finite-dimensional subspace of V.
Define f : M → K via
Xm Xm
f( aj xj ) = aj kj .
j=1 j=1
Then f is linear on M, and thus, by Corollary 4.35, it is continuous. By Corol-
lary 6.20, there exists g ∈ V ∗ so that g|M = f . Hence g(xj ) = kj , 1 ≤ j ≤ m.
2
A special case of the above Corollary which is worth pointing out is the following.
Let us now describe some quantitative versions of the above results, in the setting
of normed linear spaces.
6.33. Proposition. Let (V, T ) be a LCS over the field K and let ∅ 6= G ⊆ V
be an open, convex subset of V with 0 6∈ G. Then there exists a closed hyperplane
M in V such that G ∩ M = ∅.
Proof. Let us first consider the case where K = R.
Fix x0 ∈ G and let H = x0 −G. Then H ∈ U0V is open and convex. Let pH denote
the Minkowski functional on H. By Proposition 5.10, H = {x ∈ V : pH (x) < 1}.
Observe that 0 6∈ G implies that x0 6∈ H. Thus pH (x0 ) ≥ 1. Let W = Rx0 , and
define f : W → R via f (kx0 ) = kpH (x0 ). Clearly f ∈ W # . Moreover,
• if k ≥ 0, then f (kx0 ) = kpH (x0 ) = pH (kx0 ), while
• if k < 0, then f (kx0 ) = kpH (x0 ) < 0 ≤ pH (kx0 ).
It follows from the Hahn-Banach Theorem 01 (Theorem 6.18) that there exists a
linear functional g : V → R with g|W = f and g(x) ≤ pH (x) for all x ∈ V. Suppose
that y ∈ H. Then Re g(y) = g(y) ≤ pH (y) < 1.
By Proposition 6.12, g is continuous on V. Thus M := ker g is a closed hyper-
plane in V. [Note: obviously g 6= 0 since f 6= 0.]
Suppose z ∈ G. Then x0 − z ∈ H, so g(x0 ) − g(z) = g(x0 − z) ≤ pH (x0 − z) < 1.
On the other hand, g(x0 ) = f (x0 ) = pH (x0 ) ≥ 1, and so
g(z) > g(x0 ) − 1 ≥ 0, and z 6∈ M.
Thus G ∩ M = ∅.
6.36. Definition. Let (V, T ) be a TVS over the field R. By an open half-
space (resp. closed half-space) we shall mean a subset S ⊆ V for which there
exist a non-zero continuous linear functional f : V → R and k ∈ R so that
S = {x ∈ V : f (x) > k}
(resp. S = {x ∈ V : f (x) ≥ k}.)
We say that two subsets A and B of V are separated if we can find closed half-
spaces SA and SB so that A ⊆ SA , B ⊆ SB and SA ∩SB is a closed affine hyperplane
of V. We say that A and B are strictly separated if we can find disjoint open
half-spaces SA and SB with A ⊆ SA and B ⊆ SB .
Note that if f ∈ V ∗ and k ∈ R, then S = {x ∈ V : f (x) < k} is also an open
half-space, since g = −f ∈ V ∗ and S = {x ∈ V : g(x) > −k}. A similar statement
holds for closed half-spaces.
6. THE HAHN-BANACH THEOREM 103
6.37. Example.
(a) Consider R2 equipped with the Euclidean norm. Let A = {(x, y) ∈ R2 :
x < 0 and y ≥ 1/x2 }, B = {(x, y) ∈ R2 : x > 0 and y ≥ 1/x2 }. Then
f : R2 → R, f (x, y) = x defines a continuous linear functional on R2 . Let
SA = {(x, y) ∈ R2 : f (x) < 0} and SB = {(x, y) ∈ R2 : f (x) > 0}. Then
SA , SB are disjoint open half-spaces with A ⊆ SA and B ⊆ SB . Hence A
and B are strictly separated.
(b) With R2 , f A, SA and SB as above, set C = {(0, y) : y ∈ R}. Then
A ⊆ SA = {(x, y) ∈ R2 : f (x) ≤ 0}, C ⊆ SB = {(x, y) ∈ R2 : f (x) ≥ 0}
and SA , SB are closed half-spaces for which SA ∩SB = {(x, y) ∈ R2 : x = 0}
is a closed (affine) hyperplane.
Thus A and C are separated. One can check that A and C are not
strictly separated.
6.39. Remark. If A were open but not B, then G = ∪b∈B A − b would still be
a union of open sets and hence would still be open. The conclusion would be there
there exist a continuous linear functional g : V → R and a constant k ∈ R such that
A ⊆ {x ∈ V : g(x) > k} and B ⊆ {x ∈ V : g(x) ≤ k}.
Proof. Thinking of (V, T ) as a vector space over R, we may apply Theorem 6.38
(i.e. the HB04-R) above to obtain a continuous R-linear functional fR : V → R and
a constant k ∈ R so that
fR (a) > k > fR (b) for all a ∈ A, b ∈ B.
Let fC (x) = fR (x) − ifR (ix) be the complexification of fR . By Lemma 6.6, fC is
continuous and
RefC (a) > k > RefC (b) for all a ∈ A, b ∈ B.
Thus f = fC is the desired C-linear functional.
2
6.42. Corollary. Let (V, T ) be a LCS over R and ∅ 6= A ⊆ V. Then the closed,
convex hull of A, co(A), is the intersection of the closed half spaces that contain A.
Proof. Let Ω = {S : A ⊆ S, S ⊆ V is a closed half-space}. Since each S ∈ Ω is
closed and convex, B = ∩S∈Ω S is again closed and convex. Clearly A ⊆ B, and so
the closed convex hull of A is also a subset of B.
If z 6∈ co(A), then {z} and co(A) are disjoint, non-empty, closed and convex
subsets of V. Since {z} is compact, we can apply Theorem 6.41 (i.e. HB05) to
obtain a linear functional f ∈ V ∗ and α, β ∈ R such that
f (z) ≥ α > β ≥ f (y)
for all y ∈ co(A). Thus if S0 = {x ∈ V : f (x) ≤ β}, then S0 is a closed half-space of
V, A ⊆ co(A) ⊆ S0 , and z 6∈ S0 . Thus ∩S∈Ω S ⊆ co(A), proving that
co(A) = ∩S∈Ω S.
2
106 L.W. Marcoux Functional Analysis
Appendix to Section 6.
6.44. The proof of Proposition 6.33 also gives us an indication of how one may
try to interpret the Hahn-Banach Theorem 01, namely Theorem 6.18, geometrically.
Let V be a vector space over R and let p be a sublinear functional on V. Suppose
that M is a linear manifold in V and f : M → R is a linear functional with
f (m) ≤ p(m) for all m ∈ M.
Let H = {x ∈ V : p(x) < 1}. It is routine to verify that H is convex. Since
0 6= f , there exists m0 ∈ M such that f (m0 ) > 1. This forces p(m0 ) ≥ f (m0 ) > 1,
and so m0 6∈ H. Let K = m0 − H = {m0 − h : h ∈ H}. Clearly K is also convex.
Since m0 6∈ H, 0 6∈ K. In fact, we claim that K ∩ ker f = ∅.
Suppose otherwise: let k ∈ K ∩ ker f . Then k ∈ M and k = m0 − h for some
h ∈ H, which forces h ∈ H ∩ M, since k, m0 ∈ M. But
0 = f (k) = f (m0 − h) = f (m0 ) − f (h) > 1 − p(h) > 0,
a contradiction. Thus K ∩ ker f = ∅. Theorem 6.18 then says that we can extend
f to a linear functional g : V → R with g|M = f , such that
g(x) ≤ p(x) for all x ∈ V.
A similar analysis to that above shows that K ∩ ker g = ∅. In other words,
one can translate the “unit ball” H of V (as measured by the sublinear functional
p – note, p doesn’t even have to be a non-negative-valued function, and hence the
interpretation of H as a “unit ball” here is very, very loose) so that it doesn’t
intersect the linear manifold ker f in such a way that the manifold may be extended
to a hyperplane (ker g) which also doesn’t intersect the translation.
6. THE HAHN-BANACH THEOREM 107
Part (b) of Proposition 6.7 admits a second proof (by induction). We thank
W. Shen for pointing this out and providing the following proof.
Question 1.
Let V be a vector space over C, and let f : V → R be an R-linear functional.
Let fC denote the C-linear functional defined in Lemma 6.6, namely
fC (x) := f (x) − if (ix), x ∈ V.
Prove that kf k = ∞ if and only if kfC k = ∞.
Question 2.
For those of you who have studied Lebesgue measure: let 1 ≤ p < ∞, and let q
denote the Lebesgue conjugate of p, and dm denote Lebesgue measure on the real
line. Let f ∈ Lp ([0, 1], K). Prove that there exists g ∈ Lq ([0, 1], K) such that
(i) kgkq = 1, and
(ii)
Z 1
f gdm = kf kp .
0
This is typically one of the first things that one proves about Lp -spaces in any course
on measure theory. Hopefully you can now see why.
Question 3.
Find a linear functional ϕ ∈ (`∞ )∗ such that ϕ|c0 ≡ 0. That is, describe ϕ
explicitly.
Question 4.
Let 1 ≤ p ≤ ∞. Consider x = (1, 2) ∈ (C2 , k · kp ). Find a linear functional
ϕp ∈ (C2 , k · kp )∗ such that kϕp k = 1 and
ϕp (x) = kxkp .
110 L.W. Marcoux Functional Analysis
Last week I stated that this woman was the ugliest woman I had ever
seen. I have since been visited by her sister and now wish to withdraw
that statement.
Mark Twain
Thus (xλ )λ converges to x weakly. It follows that the weak topology on V induced
by V ∗ is weaker than the initial topology: in other words, σ(V, V ∗ ) ⊆ T .
Proof. That L is contained in (V, σ(V, L))∗ was shown in Paragraph 7.1.
Suppose now that µ ∈ V # is σ(V, L)-continuous. Then the map pµ : V → R
satisfying pµ (x) = |µ(x)| defines a σ(V, L)-continuous seminorm on V . By Proposi-
tion 5.29, there exist ρ1 , ρ2 , ..., ρn ∈ L and 0 < κ ∈ R so that
pµ (x) = |µ(x)| ≤ κ max(|ρ1 (x)|, |ρ2 (x)|, ..., |ρn (x)|) for all x ∈ V.
It follows that ker µ ⊇ ∩nj=1 ker ρj . By Proposition 6.7, µ ∈ span {ρj }nj=1 ⊆ L.
2
7.5. Remark.
• We first remark that if Ω ⊆ V # is a separating family of linear functionals
but is not a linear manifold, then after setting L = span Ω, one can verify
that the σ(V, L)-topology on V agrees with the σ(V, Ω)-topology, and hence
that
L = (V, σ(V, Ω))∗ .
• It follows from Theorem 7.4 that the only weakly continuous linear func-
tionals on a locally convex space (V, T ) are the elements of (V, T )∗ .
112 L.W. Marcoux Functional Analysis
7.7. Remark. It follows that a base for the weak∗ -topology on V ∗ is given by
B = {N (ϕ, F, ε) : ϕ ∈ V ∗ , ε > 0, F ⊆ V finite},
where
N (ϕ, F, ε) = {ρ ∈ V ∗ : |b
x(ρ − ϕ)| = |ρ(x) − ϕ(x)| < ε, x ∈ F }.
Moreover, a net (ρλ )λ in V converges in the weak∗ -topology to a functional ρ ∈ V ∗
∗
if and only if limλ ρλ (x) = ρ(x) for all x ∈ V. In other words, convergence in the
weak∗ -topology on V ∗ is convergence at every point of V.
By Theorem 7.4, a functional ϕ is weak∗ -continuous on V ∗ if and only if ϕ = x
b
for some x ∈ V.
7.8. Proposition. Let (V, TV ) and (W, TW ) be locally convex spaces, and sup-
pose that T : (V, TV ) → (W, TW ) is a continuous linear operator. Then T is contin-
uous as a linear map between V and W when they are equipped with their respective
weak topologies.
Proof. Suppose that (xλ )λ is a net in V which converges weakly to x ∈ V. We must
show that the net (T xλ )λ converges weakly to T x in W. Now, if ρ ∈ W ∗ , then ρ ◦ T
is continuous with respect to the TV topology on V, and hence ρ ◦ T ∈ V ∗ . But the
weakly continuous linear maps on V coincide with V ∗ , and therefore ρ ◦ T is weakly
continuous on V, i.e. limλ ρ ◦ T (xλ ) = ρ ◦ T (x), as was to be shown.
2
When V is a LCS and C ⊆ V is convex, we get a particularly nice result con-
cerning the weak topology.
7.9. Theorem. Let C be a convex set in a LCS (V, T ). Then the closure of C
in (V, T ) coincides with its weak closure in (V, σ(V, V ∗ )).
Proof. First observe that we can always view (V, T ) and (V, σ(V, V ∗ )) as locally
convex spaces over R. Since C is assumed to be convex already, Corollary 6.42
implies that the closure of C in (V, T ) (resp. in (V, σ(V, V ∗ ))) is the intersection of
the T -closed (resp. σ(V, V ∗ )-closed) half spaces which contain C.
But a closed half space in a LCS corresponds to (a constant and) a continuous
linear functional on that space. Since (V, T ) and (V, σ(V, V ∗ )) share the same dual
7. WEAK TOPOLOGIES AND DUAL SPACES 113
space, namely V ∗ , it follows that they also share the same closed half-spaces, and
hence the closure of C in these two topologies must coincide.
2
7.10. Let (X, k · k) be a Banach space. Recall from Proposition 2.18 that the
canonical embedding
J : X → X∗∗
x 7→ x b,
where x ∗ ∗ ∗ ∗
b(x ) := x (x) for all x ∈ X , is a contractive map. In Corollary 6.31 we saw
that – as a consequence of the Hahn-Banach Theorem – J is in fact an isometry.
By Theorem 7.4 and Remark 7.5, J(X) corresponds exactly to the weak∗ -
continuous linear functionals on X∗ .
Proof. For each m ≥ 1, let Em,h = {x ∈ X : |h(x)| ≤ m}, and let Em = ∩h∈H Em,h .
Since each Em,h is closed (as h ∈ H implies that h is continuous), so is Em . Also,
for any x ∈ X, there exists m > Mx , and so x ∈ Em . Thus
X = ∪∞
m=1 Em .
114 L.W. Marcoux Functional Analysis
kT xk ≤ M for all T ∈ A, x ∈ G.
Now ∅ 6= G open implies that there exists z ∈ G and δ > 0 so that Vδ (z) = {x ∈
X : kx − zk < δ} ⊆ G. Consider y ∈ Vδ (0) and T ∈ A.
Then
kT yk ≤ kT (y + z)k + k − T zk
≤ M + kT zk
≤ 2M,
Proof. Suppose that S is bounded by M > 0. If x∗ ∈ X∗ , then |x∗ (s)| ≤ kx∗ k ksk ≤
M kx∗ k < ∞.
Conversely, if sup{|x∗ (s)| : s ∈ S} < ∞, then sup{|b
s(x∗ )| : s ∈ S} < ∞ for all
∗ ∗
x ∈ X . By the Uniform Boundedness Principle,
sk : s ∈ S} = sup{ksk : s ∈ S} < ∞.
sup{kb
2
Hence kT k ≤ lim inf n kTn k ≤ M < ∞, which completes parts (b) and (c).
2
116 L.W. Marcoux Functional Analysis
then
(iii) sup kyn∗ k < ∞; and
(iv) ky ∗ k ≤ lim inf kyn∗ k.
Proof.
(a) Consider the natural isometric embedding Γ : X → X∗∗ sending y to yb,
where yb(x∗ ) = x∗ (y) for all x∗ ∈ X∗ .
Letting Y = K, we may apply the Banach-Steinhaus Theorem above
to the sequence (c xn )∞ ∞
n=1 , noting that convergence of (xn )n=1 in the weak
topology is simply the statement that limn→∞ x cn (x∗ ) = x
b(x∗ ) exists for all
∗ ∗
x ∈ X . Since Γ is an isometry, the result readily follows.
(b) Again, this is an immediate application of the Banach-Steinhaus Theorem,
replacing X in that Theorem with X∗ , and Y with K.
2
Before considering our next example, let us first recall a result from Measure
Theory, alternately referred to as the Riesz Representation Theorem or the Riesz-
Markov Theorem.
Proof. Suppose first that {fn }∞ n=1 converges weakly to f . By Corollary 7.17,
supn kfn k < ∞. Let δx : C(X) → K be the evaluation functional, δx (f ) = f (x) for
all x ∈ X. Then δx is linear and for f ∈ C(X), |δx (f )| = |f (x)| ≤ kf k, so that
kδx k ≤ 1 and δx ∈ C(X)∗ . (In fact, δx corresponds to the point mass measure at x.)
Thus limn→∞ δx (fn ) = δx (f ), i.e. limn→∞ fn (x) = f (x).
Conversely, suppose that (i) and (ii) hold. If ρ ∈ C(X)∗ , then by the Riesz
Representation Theorem above, there exists µ ∈ M(X) with kµk = kρk so that
Z
ρ(f ) = f dµ,
X
for all f ∈ C(X). By the Lebesgue Dominated Convergence Theorem,
Z Z
ρ(f ) = f dµ = lim fn dµ = lim ρ(fn ).
X n→∞ X n→∞
7.25. Example. By identifying c0 (K)∗ with `1 (K) and `1 (K)∗ with `∞ (K), we
see that the unit ball (c0 (K))1 of c0 (K) is weak∗ -dense in the closed unit ball of
`∞ (K), and thus c0 (K) is weak∗ -dense in `∞ (K).
Of course, c00 (K) is norm dense in c0 (K), and so c00 (K) is also weak∗ -dense in
∞
` (K).
Culture: Although we shall not have time to prove this, the non-commutative
analogue of the above statement is that the set of finite rank operators F(H) on an
infinite-dimensional Hilbert space is weak∗ -dense in B(H).
Let us now establish a relation between compactness and reflexivity of a Banach
space.
Thus x∗λ (xn ) converges to x∗ (xn ) for all n ≥ 1 if and only if (x∗λ )λ converges in the
weak∗ -topology to x∗ . Hence the weak∗ -topology on X∗1 is metrizable.
Next, assume that X∗1 is weak∗ -metrizable. Then we can find a countable se-
quence {G∗n }∞ ∗ ∗ ∞ ∗
n=1 of weak -open nbhds of 0 ∈ X1 so that ∩n=1 Gn = {0}. There is no
∗ ∗
harm in assuming that each Gn is a basic weak -open nbhd, so for each n ≥ 1 there
exists εn > 0 and a finite set Fn ⊆ X so that
Let F = ∪∞ ∗ ∗ ∗ ∗ ∗
n=1 Fn . If x ∈ X1 , x (F ) = 0, then x ∈ Gn for all n ≥ 1, and therefore
x∗ = 0. That is, if Y = spank·k F , then Y is separable and x∗ ∈ X∗1 , x∗ |Y = 0 implies
that x∗ = 0. By the Hahn-Banach Theorem [Corollary 6.29], Y = X.
2
Observe that M⊥ and ⊥ N are linear manifolds in their respective spaces. More-
over, both are norm-closed and hence Banach spaces in their own right.
122 L.W. Marcoux Functional Analysis
7.33. It is a worthwhile exercise to think about the relationship between the an-
nihilator M⊥ of a subspace M of a Hilbert space H and the orthogonal complement
of M in H, for which we used the same notation.
In particular, one should interpret what Theorem 7.32 says in the Hilbert space
setting, where M⊥ refers to the orthogonal complement of M.
If you had a face like mine, you’d punch me right on the nose, and
I’m just the fella to do it.
Stan Laurel
124 L.W. Marcoux Functional Analysis
Appendix to Section 7.
7.34. Proposition. Suppose that 1 < p < ∞. A sequence (xn )∞ n=1 in ` (N)
p
(i.e. each xn = (xn1 , xn2 , xn3 , ...) ∈ `p (N) converges weakly to z = (z1 , z2 , z3 , ...) ∈
`p (N)) if and only if
(i) supn≥1 kxn k < ∞, and
(ii) limn→∞ xnk = zk for all k ∈ N.
7. WEAK TOPOLOGIES AND DUAL SPACES 125
Question 1.
Let X be a Banach space and C ⊆ X be convex. Prove the following.
k·k weak
(a) C = C .
(b) C is norm-closed if and only if C is weakly closed.
Question 2.
Let X be a Banach space. Prove that X1 is weakly metrizable if and only if X∗
is separable.
Question 3.
Give a direct proof (i.e. without appealing to Goldstine’s Theorem) that c0 is
weak∗ -dense in `∞ .
Question 4.
Let (X, τ ) be a topological space. A function f : X → R is said to be lower
semicontinuous if for all a ∈ R, f −1 (a, ∞) is open in X.
(a) Show that every continuous function from X into R is lower semicontinuous.
(b) Let (X, k · k) be a Banach space. Prove that the function
f : (X, σ(X, X∗ )) → R
x 7→ kxk
is lower semicontinuous. That is, the norm on X is lower semicontinuous
for the weak topology on X.
(c) Let (X, k · k) be a Banach space. Prove that the function
g : (X∗ , σ(X∗ , X)) → R
x∗ 7→ kx∗ k
is lower semicontinuous. That is, the norm on X∗ is lower semicontinuous
for the weak∗ - topology on X.
126 L.W. Marcoux Functional Analysis
8. Extremal points
8.1. The main result of this section is the Krein-Milman Theorem, which asserts
that a non-empty, compact, convex subset of a LCS has extreme points; so many,
in fact, that we can generate the compact, convex set as the closed, convex hull of
these extreme points.
Extreme points of convex sets appear in many different contexts in Functional
Analysis. For example, it is an interesting exercise (so interesting that it may appear
as an Assignment question) to calculate the extreme points of the closed unit ball
B(Cn )1 of the locally convex space B(Cn ), where Cn is endowed with the Euclidean
norm k · k2 and B(Cn ) is given the operator norm.
Recall that a linear map T ∈ B(Cn ) is said to be positive and we write T ≥ 0
if hT x, xi ≥ 0 for all x ∈ Cn . An equivalent formulation of this property says that
T is positive if there exists an orthonormal basis for Cn with respect to which the
matrix [T ] of T is diagonal, and all eigenvalues of T are non-negative real numbers.
A linear functional ϕ ∈ B(Cn )∗ is said to be positive if ϕ(T ) ≥ 0 whenever T ≥ 0.
For example, if {e1 , e2 , ..., en } is an orthonormal basis for Cn , then the so-called
normalized trace functional
n
1X
τ (T ) := hT ek , ek i
n
k=1
for T ∈ B(Cn ) can be shown to be a positive linear functional of norm one.
The state space S(B(Cn )) of B(Cn ), consisting of all positive, norm-one linear
functionals on B(Cn ) – called states – forms a non-empty, compact, convex subset
of B(Cn )∗ . The extreme points of the state space are called pure states. For
example, if x ∈ Cn and kxk = 1, then the map
ϕx : B(Cn ) → C
T 7→ hT x, xi
defines a pure state. States on B(Cn ) (and more generally states on so-called
C ∗ -algebras) are of extreme importance in determining the representation theory
of these algebras. This, however, is beyond the scope of the present manuscript.
8.2. Definition. Let V be a vector space and C ⊆ V be a convex set. A point
e ∈ C is called an extreme point of C if whenever there exist x, y ∈ C and t ∈ (0, 1)
for which
e = tx + (1 − t)y,
it follows that x = y = e. We denote by Ext(C) the (possibly empty) set of all
extreme points of C.
8. EXTREMAL POINTS 127
8.3. Example.
(a) Let V = C. Let D = {w ∈ C : |w| < 1} denote the open disk. It is easy
to see that D is convex. However D has no extreme points. If w ∈ D, then
|w| < 1, so there exists δ > 0 so that (1 + δ)|w| < 1. Let x = (1 + δ)w,
y = (1 − δ)w. Then x, y ∈ D and w = 12 x + 12 y.
(b) With V = C again, let D = {w ∈ C : |w| ≤ 1}. Then every z ∈ T :=
{z ∈ C : |z| = 1} is an extreme point of D. The proof of this is left as an
exercise.
(c) Let V = R2 , and let p1 , p2 , p3 be three non-collinear points in V . The
triangle T whose vertices are p1 , p2 , p3 has exactly {p1 , p2 , p3 } as its set of
extreme points.
8.6. Example.
(a) Let V = R2 and let p1 , p2 , p3 be three non-collinear points in V . Denote
by T the triangle whose vertices are p1 , p2 , p3 . Then T is a face of itself.
Also, each line segment pi pj is a face of T . Finally, each extreme point pj
is a face of T .
(b) Let V = R3 and C be a cube in V , for e.g.,
C = {(x, y, z) ∈ R3 : 0 ≤ x, y, z ≤ 1}.
Then C has itself as a face. Also, the 6 (square) sides of the cube are faces.
The 12 edges of the cube are also faces, as are the 8 corners. The corners
are extreme points of the cube.
128 L.W. Marcoux Functional Analysis
The following result is a crucial step in the proof of the Krein-Milman Theorem.
8. EXTREMAL POINTS 129
Appendix to Section 8.
8.13. Convexity and convex hulls. It was anticipated that everyone would
have seen the concept of convexity and of convex hulls. Hopefully, therefore, the
following remarks will serve as a refresher for everyone.
Let V be a vector space over K. Recall that a subset E of V is said to be
convex if x, y ∈ E and 0 ≤ t ≤ 1 implies that tx + (1 − t)y ∈ E. A simple finite
P is equivalent toPrequiring that if n ≥ 1,
induction argument shows that this definition
tk ∈ [0, 1], xk ∈ E for all 1 ≤ k ≤ n and nk=1 tk = 1, then nk=1 tk xk ∈ E.
A sum of the form
n
X
t k xk
k=1
Pn
where 0 ≤ tk ≤ 1 and k=1 tk xk is called a convex combination of the xk ’s.
If {Eλ }λ is a collection of convex subsets of V, then F := ∩λ Eλ is once again
convex. For if x, y ∈ F and 0 ≤ t ≤ 1, then for each λ, we have that x, y ∈ Eλ and
Eλ convex implies that tx + (1 − t)y ∈ Eλ , whence tx + (1 − t)y ∈ ∩λ Eλ = F .
Let us define
n
X n
X
F ={ tk hk : n ≥ 1, tk ≥ 0, hk ∈ H, 1 ≤ k ≤ n, tk = 1}.
k=1 k=1
But
n m n m
!
X X X X
rtk + (1 − r)sj = r tk + (1 − r) sj = r + (1 − r) = 1,
k=1 j=1 k=1 j=1
I want to go back to Brazil, get married, have lots of kids, and just
be a couch tomato.
Ana Beatriz Barros
8. EXTREMAL POINTS 133
Question 1.
Prove that there does not exist a Banach space X such that X∗ = (c0 , k · k∞ ).
Question 2.
Let 1 ≤ N be an integer. Find the extreme points of the closed unit ball of
(B(CN ), k · k), where k · k denotes the operator norm.
Question 3.
Let H = `2 and let {en }∞n=1 denote the standard onb for H. Let S ∈ B(H) be
the unilateral forward shift defined by Sen = en+1 , n ≥ 1.
Prove or disprove that S an extreme point of the closed unit ball of B(H).
Question 4.
Find the extreme points of the closed unit ball of each of the following spaces:
(a) (`1 , k · k1 );
(b) (`p , k · kp ), 1 < p < ∞;
(c) (`∞ , k · k∞ ).
134 L.W. Marcoux Functional Analysis
Thus
kT xkY ≤ kxkX + kT xkY = k(x, T x)k1 = kπG−1 (x)k1 ≤ kπG−1 k kxkX
Appendix to Section 9.
Question 1.
Let X be a Banach space and T ∈ B(X). Recall that a closed subspace Y of X
is said to be invariant for T if T Y ⊆ Y.
Suppose that Y and Z are invariant for T , and that Z is a topological complement
for Y. Prove that there exists an idempotent E ∈ B(X) such that ET = T E.
Question 2.
Let H be a complex Hilbert space and M ⊆ H be a closed subspace. Prove
that E = E 2 ∈ B(H) is an idempotent with ran E = M if and only if there exists
X ∈ B(M⊥ , M) such that relative to the decomposition H = M ⊕ M⊥ , we have
I X
E= .
0 0
Question 3.
Let H be a Hilbert space and let E = E 2 ∈ B(H) be an idempotent operator.
Prove that there exists an invertible operator S ∈ B(H) such that P = S −1 ES is
an orthogonal projection.
Question 4.
Let (X, d) be a metric space and H ⊆ X. We say that H is nowhere dense
(or meager, or thin) if G := X \ H is dense in X. In other words, the interior of
H is empty.
We say that a subset H of a metric space (X, d) is of the first category in
(X, d) if there exists a sequence (Fn )∞
n=1 of closed, nowhere dense sets in X such
that
H ⊆ ∪∞ n=1 Fn .
Otherwise, H is said to be of the second category.
10.1. Much of the work that has been done in Banach space theory has focussed
on the study of the geometric structure of Banach spaces. For example, people have
been interested in how “close” two Banach spaces are to being isomorphic as Banach
spaces (say, in terms of the Banach-Mazur distance between them), and they have
been interested in finding subspaces of a Banach space which have or are close to
having a prescribed geometric structure. Other interesting questions in this area
involve the study of how well finite-dimensional subspaces of one Banach space can
be embedded in a second Banach space, and yet others involve the search for nice
“bases” for a Banach space (for eg., the search for Schauder bases for quotients of
Banach spaces), or the renorming of Banach spaces by equivalent norms. This list
is anything but inclusive.
In contrast, the study of Hilbert spaces focusses very much on the structure of the
bounded linear operators acting on the space. This is in part because Hilbert spaces
are so well-behaved. This means that one pretends to understand the underlying
space H, and instead focusses upon understanding the more complicated structure,
namely B(H). Because Hilbert spaces are so well-behaved, there is a chance of
getting interesting and deep results about subsets and subalgebras of B(H).
Of course, one can also study the space B(X) of bounded linear operators acting
on a Banach space X. Here one is often interested in how the structure of the
underlying space X determines the operators in B(X). In this Section we shall
examine the notion of compactness of operators on a Banach space. We begin,
however, with the notion of the Banach space adjoint of an operator.
Let X and Y be Banach spaces. Let T ∈ B(X, Y). Given y ∗ ∈ Y∗ , the map
x 7→ y ∗ (T x)
is a linear functional on X. Let us denote by T ∗ y ∗ ∈ X# the functional on X
determined by this formula, namely:
T ∗ y ∗ (x) = y ∗ (T x) for all x ∈ X.
Observe that
kT ∗ y ∗ (x)k = ky ∗ (T x)k ≤ ky ∗ k kT k kxk,
and so kT ∗ y ∗ k ≤ kT k ky ∗ k < ∞, implying that T ∗ y ∗ ∈ X∗ . Furthermore, the map
T ∗ Y∗ → X∗
y ∗ 7→ T ∗ y ∗
is easily seen to be linear (by definition of linear combinations of functionals on X).
142 L.W. Marcoux Functional Analysis
Many authors adopt the following notation for the action of a functional on a
vector, namely: given x ∈ X and x∗ ∈ X∗ , they write hx, x∗ i to denote x∗ (x). In
this notation, the equation T ∗ y ∗ (x) = y ∗ (T x) for all x ∈ X, y ∗ ∈ Y∗ becomes:
Again, since this is true for all x ∈ X and then for all z ∗ ∈ Z∗ , we find that
(R ◦ T )∗ = T ∗ ◦ R∗ .
2
10. OPERATOR THEORY 143
10.4. Proposition. Let X = Cn and A ∈ B(X) ' Mn . Then the matrix of the
Banach space adjoint A∗ of A with respect to the dual basis coincides with At , the
transpose of A.
Proof. Recall that X∗ ' X. We then let {ei }ni=1 be a basis for X and let {fj }nj=1 be
the corresponding dual basis for X∗ ; that is, fj (ei ) = δij , where δij is the Kronecker
delta function. Let x ∈ X. Define λj = fj (x), 1 ≤ j ≤ n.
Writing the matrix of A ∈ B(X) as [aij ], we have
0 a1j
0 a2j
. .
. .
n
0 aj−1 j X
Aej = [aij ] =
1 ajj
= akj ek .
k=1
0 aj+1 j
. .
. .
0 anj
Thus aij = fi (Aej ).
Now A∗ ∈ B(X∗ ) ' Mn , and so we can also write the matrix [αij ] for A∗ with
respect to {fj }nj=1 . Paralleling the above computation for [αij ], we obtain:
n
X
∗
A fj = αkj fk ,
k=1
and thus
αij = (A∗ fj )(ei ) = fj (Aei ) = aji .
In particular, the matrix for A∗ with respect to {fj }nj=1 is simply the transpose of
the matrix for A with respect to {ej }nj=1 .
2
The fact that a Hilbert space is isometrically isomorphic (via a conjugate-linear
map) to its own dual allows us to define a separate notion of an adjoint for operators
acting on these spaces. The crucial difference between the Hilbert space adjoint and
the Banach space adjoint of an operator T acting on a Hilbert space H is that the
Hilbert space adjoint will operate on the same Hilbert space H, while the Banach
space adjoint will be an operator in B(H∗ ). While H∗ is a Hilbert space isomorphic
to H, it is not H itself.
10.5. Theorem. Let H be a Hilbert space and let T ∈ B(H). Then there exists
a unique operator T ∗ ∈ B(H), called the Hilbert space adjoint of T , satisfying
hT x, yi = hx, T ∗ yi
for all x, y ∈ H.
144 L.W. Marcoux Functional Analysis
10.8. Remark. For a Hilbert space H and A, B ∈ B(H), it is easy to see that
we have (AB)∗ = B ∗ A∗ . Indeed, given x, y ∈ H,
hx, (AB)∗ yi = hABx, yi = hBx, A∗ yi = hx, B ∗ A∗ yi,
from which the result follows. The adjoint operator
∗ : B(H) → B(H)
is an example of an involution on a Banach algebra. Namely, for all α, β ∈ C and
A, B ∈ B(H), we obtain
(i) (αA)∗ = αA∗ ;
(ii) (A + B)∗ = A∗ + B ∗ ; and
(iii) (AB)∗ = B ∗ A∗ .
(iv) (A∗ )∗ = A.
10.11. Definition. Let X and Y be Banach spaces, and T ∈ B(X, Y). Then T
is said to be compact if T (X1 ) is compact in Y. The set of compact operators from
X to Y is denoted by K(X, Y), and if Y = X, we simply write K(X).
10.12. Proposition. Let X and Y be Banach spaces, and T ∈ B(X, Y). The
following are equivalent:
(a) T is compact;
(b) T (F ) is compact in Y for all bounded subsets F of X;
(c) If (xn )n is a bounded sequence in X, then (T xn )n has a convergent subse-
quence in Y;
(d) T (X1 ) is totally bounded.
Proof. This is left as an Assignment exercise.
2
T ◦ K(X1 ) = T (K(X1 ))
⊆ T (K(X1 ))
= T (K(X1 )).
Since T ◦ K(X1 ) is a closed subset of the compact set T (K(X1 )), it is compact as
well. Thus T K ∈ K(X, Z).
Now if W1 is the unit ball of W, then
but R(W1 ) is bounded since R is, and so by Proposition 10.12, KR(W1 ) is compact.
Thus KR ∈ K(W, Y).
2
10.18. Proposition. Let X and Y be Banach spaces. Then F(X, Y) ⊆ K(X, Y).
Proof. Suppose F ∈ F(X, Y). Then F X1 is closed and bounded in ran F , but ran F
is finite dimensional in Y, as F is finite rank. Thus F X1 is compact in ran F , and
thus compact in Y as well, showing that F is compact.
2
10.19. Proposition. Let X be a Banach space. Then K(X) = B(X) if and only
if X is finite dimensional.
Proof. If dim X < ∞, then B(X) = F(X) ⊆ K(X) ⊆ B(X), and equality follows.
If K(X) = B(X), then I ∈ K(X), so I(X1 ) = X1 is compact. In particular, X is
finite dimensional.
2
10.20. Theorem. Let X and Y be Banach spaces and suppose K ∈ K(X, Y).
Then K ∗ ∈ K(Y∗ , X∗ ).
Proof. Let ε > 0. Then K(X1 ) is totally bounded, so we can find x1 , x2 , . . . , xn ∈ X1
such that if x ∈ X1 , then kKx − Kxi k < ε/3 for some 1 ≤ i ≤ n. Let
R : Y∗ → (Cn , k · k∞ )
y ∗ 7→ (y ∗ (K(x1 )), y ∗ (K(x2 )), . . . , y ∗ (K(xn ))).
Then R ∈ F(Y∗ , Cn ) ⊆ K(Y∗ , Cn ), and so R(Y∗1 ) is totally bounded. Thus we can
find y1∗ , y2∗ , . . . , ym
∗ ∈ Y∗ such that if y ∗ ∈ Y∗ , then kRy ∗ − Ry ∗ k < ε/3 for some
1 1 j
1 ≤ j ≤ m. Now
kRy ∗ − Ryj∗ k = max |y ∗ (K(xi )) − yj∗ (K(xi ))|
1≤i≤n
= max |K ∗ (y ∗ )(xi ) − K ∗ (yj∗ )(xi )|.
1≤i≤n
10.21. The set of compact operators acting on a Hilbert space is more tractable
in general than the set of compact operators acting on an arbitrary Banach space.
One of the reasons for this is the characterization given below.
10. OPERATOR THEORY 149
10.22. Theorem. Let H be a Hilbert space and let K ∈ B(H). The following
are equivalent:
(i) K is compact;
(ii) K ∗ is compact;
(iii) There exists a sequence {Fn }∞n=1 ⊆ F(H) such that K = limn→∞ Fn .
Proof.
(i) ⇒ (iii) Let B1 denote the unit ball of H, and let > 0. Since K(B1 ) is
compact, it must be separable (i.e. it is totally bounded). Thus M = ran K
is a separable subspace of H, and thus possesses an orthonormal basis
{en }∞
n=1 .
Let Pn denote the orthogonal projection of H onto span {ek }nk=1 . Set
Fn = Pn K, noting that each Fn is finite rank. We now show that K =
limn→∞ Fn .
Let x ∈ H and consider y = Kx ∈ M, so that limn→∞ kPn y − yk = 0.
Thus limn→∞ kFn x − Kxk = limn→∞ kPn y − yk = 0. Since K is com-
pact, K(B1 ) is totally bounded, so we can choose {xk }m k=1 ⊆ B1 such that
m
K(B1 ) ⊆ ∪k=1 B(Kxk , /3), where given z ∈ H and δ > 0, B(z, δ) = {w ∈
H : kw − zk < δ}.
If kxk ≤ 1, choose i such that kKxi − Kxk < /3. Then for any n > 0,
kKx − Fn xk
≤ kKx − Kxi k + kKxi − Fn xi k + kFn xi − Fn xk
< /3 + kKxi − Fn xi k + kPn k kKxi − Kxk
< 2/3 + kKxi − Fn xi k.
Choose N > 0 such that kKxi − Fn xi k < /3, 1 ≤ i ≤ m for all n > N .
Then kKx − Fn xk ≤ 2/3 + /3 = . Thus kK − Fn k < 3 for all n > N .
Since > 0 was arbitrary, K = limn→∞ Fn .
(iii) ⇒ (ii) Suppose K = limn→∞ Fn , where Fn is finite rank for all n ≥ 1.
Note that Fn∗ is also finite rank (why?), and that kK ∗ − Fn∗ k = kK − Fn k
for all n ≥ 1, which clearly implies that K ∗ = limn→∞ Fn∗ , and hence that
K ∗ is compact.
(ii) ⇒(i) Since K compact implies K ∗ is compact from above, we deduce that
K ∗ compact implies (K ∗ )∗ = K is compact, completing the proof.
2
We can restate the above Theorem more succinctly by saying that K(H) is the
norm closure of the set of finite rank operators on H. This is an extraordinarily
useful result.
10.23. Remark. Contained in the above proof is the following interesting
observation. If K is a compact operator acting on a separable Hilbert space H, then
for any sequence {Pn }∞n=1 of finite rank projections tending strongly (i.e. pointwise)
to the identity, kK − Pn Kk tends to zero. By considering adjoints, we find that
kK − KPn k also tends to zero.
150 L.W. Marcoux Functional Analysis
Let > 0, and choose N > 0 such that n ≥ N implies kK − KPn k < /2 and
kK − Pn Kk < /2. Then for all n ≥ N we get
kK − Pn KPn k ≤ kK − KPn k + kKPn − Pn KPn k
≤ kK − KPn k + kK − Pn Kk kPn k
< /2 + /2 = .
It follows that if H has an orthonormal basis indexed by the natural numbers,
say {en }∞n=1 , then the matrix for K with respect to this basis comes within of
the matrix for PN KPN . In other words, K “virtually lives” on the “top left-hand
corner”.
Alternatively, if H has an orthonormal basis indexed by the integers, say {fn }n∈Z ,
and we let Pn denote the orthogonal projection onto span {ek }nk=−n , then the matrix
for K with respect to this basis can be arbitrarily well estimated by a sufficiently
large but finite “central block”.
10.25. Example. Let H = L2 ([0, 1], dx), and consider the function k(x, t) ∈
L2 ([0, 1]×[0, 1], dm),
where dm represents Lebesgue planar measure. Then we define
a Volterra operator
V : L2 ([0, 1], dx) → L2 ([0, 1], dx)
R1
(V f )(x) = 0 f (t) k(x, t) dt.
(The classical Volterra operator has k(x, t) = 1 if x ≥ t, and k(x, t) = 0 if x < t.)
Now for f ∈ L2 ([0, 1], dx) we have
Z 1
kV f k2 = |V f (x)|2 dx
0
Z 1 Z 1 2
= f (t) k(x, t)dt dx
0 0
Z 1 Z 1 2
≤ |f (t) k(x, t)|dt dx
0 0
Z 1 Z 1
≤ kf k22 |k(x, t)|2 dtdx by the Cauchy-Schwartz Inequality
0 0
= kf k22 kkk22 ,
so that kV k ≤ kkk2 .
Let A denote thePalgebra of continuous functions on [0, 1] × [0, 1] which can be
resolved as g(x, t) = ni=1 ui (x) wi (t). Then A is an algebra which separates points,
contains the constant functions, and is closed under complex conjugation. By the
10. OPERATOR THEORY 151
Stone-Weierstraß Theorem, given > 0 and h ∈ C([0, 1] × [0, 1]), there exists g ∈ A
such that kh − gk2 ≤ kh − gk∞ < . But since C([0, 1] × [0, 1]) is dense (in the
L2 -topology) in L2 ([0, 1] × [0, 1], dm), A must also be dense (in the L2 -topology) in
L2 ([0, 1] × [0, 1], dm).
Let > 0. For k as above, choose g ∈ A such that kk − gk2 < . Define
V0 : L2 ([0, 1], dx) → L2 ([0, 1], dx)
R1
V0 f (x) = 0 f (t) g(x, t)dt.
so that V0 f ∈ M.
Thus V can be approximated arbitrarily well by elements of the form V0 ∈
F(L2 ([0, 1], dx), and so V is compact.
showing that N ∗ x ∈ ker (p(N, N ∗ ) − αI), i.e that ker (p(N, N ∗ ) − αI) is
invariant for N ∗ .
By Exercise 10.27, ker (p(N, N ∗ ) − αI) is reducing for N .
(c) Suppose that α 6= β ∈ C. Let x ∈ ker (N − αI) and y ∈ ker (N − βI). Then
αhx, yi = hN x, yi = hx, N ∗ yi = hx, βyi = βhx, yi.
Since α 6= β, we must have x ⊥ y.
2
It follows that
N ∗ N = lim Fn∗ Fn .
n
154 L.W. Marcoux Functional Analysis
An 0
But relative to H = Hn ⊕ Hn⊥ , we may write Fn = for some An ∈ B(Cn ),
0 0
whence
A∗n An 0
Fn∗ Fn = .
0 0
Now A∗n An is a positive semidefinite matrix, and therefore it is diagonalizable. More-
over, limn kFn k = kN k = 1. Thus
lim kA∗n An k = lim kAn k2 = lim kFn k2 = kN k2 = 1.
n n n
10.31. Let (X, k · k) be a complex Banach space and T ∈ B(X). Recall that the
spectrum of T is the set
σ(T ) = {α ∈ K : (T − αI) is not invertible in B(X)}.
We do not yet have the machinery to be able to prove that σ(T ) is a non-empty,
compact set contained in {z ∈ C : |z| ≤ kT k}. (The fact that the underlying field
for X is C is required to prove that σ(T ) 6= ∅)
We denote the set of eigenvalues of T by σp (T ) and refer to this as the point
spectrum of T . In general, σp (T ) may be empty or non-empty. Clearly σp (T ) ⊆
σ(T ).
10. OPERATOR THEORY 155
the series), or we may as well assume that Ω = N, in which case the series may be
written
X∞
N= αn Pn .
n=1
That n 6= m ∈ Ω implies that Pn Pm = 0 is the statement that ker (N − αn I) ⊥
ker (N − αm I) if αn 6= αm , which we proved in Proposition 10.29.
Recall from Proposition 10.29 that each Mn is reducing for N and that by
Theorem 10.32, H = ⊕n∈Ω Mn . Thus
N = ⊕n∈Ω Nn ,
where Nn is the compression of N to the reducing subspace Mn . But for x ∈ Mn ,
(N − αn Pn )x = (N − αn I)x = 0. Thus
N = ⊕n∈Ω αn Pn .
P
In the case where Ω is finite, this yields that N = n∈Ω αn Pn , as there are no
convergence issues.
Suppose that Ω is infinite.
Case One. 0 6∈ σp (N ).
Then dim Mn < ∞ for all n ∈ Ω, and H = ⊕n∈Ω Mn implies that H is separable.
As we saw above, we may (and do) assume that Ω = N.
(n) (n) (n)
Choosing an orthonormal basis Bn = {e1 , e2 , ekn } for each Mn , n ≥ 1, we
find that B := ∪n=1 Bn is an onb for H and that relative to this basis we have
N = diag(α1 , α1 , . . . , α1 , α2 , α2 , . . . , α2 , α3 , α3 , . . .),
where αn is repeated exactly kn = dim Mn = rank Pn times.
Let QM := M
P
n=1 Pn , so that is a finite-rank projection for each M ≥ 1. Note
that (QM )∞
M =1 converges in the sot to I ∈ B(H). By Remark 10.23,
M
X X
N = lim QM N QM = lim αn Pn = αn Pn .
M →∞ M →∞
n=1 n
Case Two. 0 ∈ σp (N ).
Again, we may assume that Ω = N and that α1 = 0. Since M0 = ker N is
reducing for N , we may write
N1 0
N=
0 N0
relative to the decomposition H = M1 ⊕ M⊥ 0
1 . Note that N is compact, normal and
0
0 6∈ σp (N ). By Case One,
∞
X
N0 = αn Pn .
n=2
10. OPERATOR THEORY 157
But then
∞
X ∞
X
N = 0P1 + αn Pn = αn Pn .
n=2 n=1
2
The proof of this result is beyond (but not much beyond) the scope of this
course. (We simply didn’t have the time for it.) Nevertheless, two things are worth
pointing out.
Firstly, the above limit exists!! This is anything but obvious, and is interesting
in its own right.
Secondly, since kan k ≤ kakn for all n ≥ 1, we immediately see that
spr(a) ≤ kak.
This shows that σ(a) is always bounded. Another non-trivial fact is that σ(a) 6= 0.
This relies on a Banach-space version of Liouville’s Theorem.
Using Beurling’s Spectral Radius Formula, we may easily obtain:
= sup kN xk2
kxk=1
= kN k2 .
10. OPERATOR THEORY 159
n n
By induction, kN 2 k ≥ kN k2 for all n ≥ 1. The reverse inequality follows
immediately from the submultiplicativity of the norm in a Banach algebra. Thus
n n
kN 2 k = kN k2 for all n ≥ 1. By Beurling’s Spectral Radius Formula, Theo-
rem 10.36,
n n
spr (N ) = lim kN 2 k1/2 = kN k.
n→∞
2
This is the standard proof of this result. The proof of Proposition 10.30 given
in the notes above is original, and clearly was only devised to circumvent the fact
that we do not have Beurling’s Spectral Radius Formula.
10.39. Corollary. Let A be a unital Banach algebra, and let a ∈ A. Then ρ(a)
is open and σ(a) is compact.
Proof. Clearly ρ(a) = {λ ∈ C : (a − λ1) is invertible } is open, since A−1 is.
Indeed, if a − λ0 1 is invertible in A, then λ ∈ ρ(a) for all λ ∈ C such that |λ − λ0 | <
k(a − λ0 )−1 k−1 . Thus σ(a) is closed.
If |λ| > kak, then λ1 − a = λ (1 − λ−1 a) and kλ−1 ak < 1, and so (1 − λ−1 a) is
invertible. This implies
(λ1 − a)−1 = λ−1 (1 − λ−1 a)−1 .
Thus σ(a) is contained in the disk Dkak ({0}) of radius kak centred at the origin.
Since it both closed and bounded, σ(a) is compact.
2
10.42. Definition. Let A be a unital Banach algebra and let a ∈ A. The map
R(·, a) : ρ(a) → A
λ 7→ (λ1 − a)−1
1 1 (µ − t) − (λ − t) (µ − λ)
− = = .
λ−t µ−t (λ − t) (µ − t) (λ − t) (µ − t)
Noting that R(λ, a) and R(µ, a) clearly commute, we obtain the desired equation
by simply subtracting the second equation from the first.
2
since inversion is continuous on ρ(a). Thus the limit of the Newton quotient exists,
and so R(·, a) is analytic.
2
10. OPERATOR THEORY 161
since R(·, a) is a continuous function on this compact set. The conclusion is that
R(·, a) is a bounded, entire function. By Theorem 10.41, the resolvent function must
be constant. This obvious contradiction implies that σ(a) is non-empty.
2
Recall that a division algebra is an algebra in which each non-zero element is
invertible.
10.46. Theorem. [Gelfand-Mazur] If A is a Banach algebra and a division
algebra, then there is a unique isometric isomorphism of A onto C.
Proof. If b ∈ A, then σ(b) is non-empty by Corollary 10.45. Let β ∈ σ(b). Then
β1 − b is not invertible, and since A is a division algebra, we conclude that β1 = b;
that is to say, that σ(b) is a singleton.
Given a ∈ A, σ(a) is a singleton, say {λa }. The complex-valued map φ : a 7→ λa
is an algebra isomorphism. Moreover, kak = kλa 1k = |λa | = kφ(a)k, so the map is
isometric as well.
If φ0 : A → C were another such map, then φ0 (a) ∈ σ(a), implying that φ0 (a) =
φ(a).
2
Proof. First observe that if A is not unital, then we can always embed it isomet-
rically into a unital Banach algebra A+ . Since both the left and right hand sides
of the above equation remain unchanged when a is considered as an element of A+ ,
we may (and do) assume that A is already unital.
Now σ(an ) = (σ(a))n , and so spr(an ) = (spr(a))n . Moreover, for all b ∈ A, the
proof of Corollary 10.39 shows that spr(b) ≤ kbk. Thus
spr(a) = (spr(an ))1/n ≤ kan k1/n for all n ≥ 1.
This tells us that spr(a) ≤ inf n≥1 kan k1/n .
On the other hand, R(·, a) is analytic on ρ(a) and hence is analytic on {λ ∈ C :
|λ| > spr(a)}. Furthermore, if |λ| > kak, then
R(λ, a) = (λ − a)−1
= λ−1 (1 − λ−1 a)−1
X∞
= an /λn+1 .
n=0
Let φ ∈ A∗ . Then φ ◦ R(·, a) is an analytic, complex-valued function,
∞
X
[φ ◦ R(·, a)](λ) = φ(an )/λn+1
n=0
and this Laurent expansion is still valid for {λ ∈ C : |λ| > kak}, since the series
for R(·, a) is absolutely convergent on this set, and applying φ introduces at most
10. OPERATOR THEORY 163
a factor of kφk to the absolutely convergent sum. Since [φ ◦ R(·, a)] is analytic on
{λ ∈ C : |λ| > spr (a)}, the complex-valued series converges on this larger set.
From this it follows that the sequence {φ(an )/λn+1 }∞ n=1 converges to 0 as n
tends to infinity for all φ ∈ A , so therefore is bounded for all φ ∈ A∗ . It is now a
∗
consequence of the Uniform Boundedness Principle that {an /λn+1 }∞ n=1 is bounded
in norm, say by Mλ > 0, for each λ ∈ C satisfying |λ| > spr(a). That is:
kan k ≤ Mλ |λn+1 |
for all |λ| > spr(a). But then, for all |λ| > spr(a),
1/n
lim sup kan k1/n ≤ lim sup Mλ |λn+1/n | = |λ|.
n≥1 n≥1
Combining this estimate with the above yields spr(a) = limn→∞ kan k1/n .
2
Question 1. Let X and Y be Banach spaces, and let S, T ∈ B(X, Y). Prove that
for all k1 , k2 ∈ K,
(k1 S + k2 T )∗ = k1 S ∗ + k2 T ∗ .
Here, the ∗ -map refers to the Banach space adjoint.
Question 2.
Let H be a complex Hilbert space and M be a closed subspace of H. Let P be
the orthogonal projection of H onto M, and T ∈ B(H).
(a) Prove that relative to the decomposition H = M ⊕ M⊥ , we have
I 0
P = .
0 0
(b) Prove that P = P 2 = P ∗ .
T1 T2
(c) More generally, write T = relative to the decomposition H =
T3 T4
M ⊕ M⊥ . Prove that
∗
T1 T3∗
∗
T = .
T2∗ T4∗
(d) Prove that M is invariant for T if and only if (I − P )T P = 0, and M is
reducing for P if and only if T P = P T . Conclude that M is reducing for
T if and only if both M and M⊥ are invariant for T .
Question 3.
This problem is more challenging than the others, and should rightfully be an
assignment question.
Let H be a complex, separable Hilbert space and suppose that K ∈ K(H). Note
that K ∗ K is compact and selfadjoint, hence normal. By the Spectral Theorem for
Normal operators (see Corollary 10.34), there exists an onb {en }∞n=1 for H such
that
K ∗ K = diag (d1 , d2 , d3 , . . .).
(a) Prove that dn ≥ 0 for all n ≥ 1.
We denote by |K| the diagonal operator (relative to this onb)
|K| = diag (s1 , s2 , s3 , . . .),
1
where sn = (dn ) , n ≥ 1. Note that in the same way that for complex numbers we
2
Let {fn }n denote a second onb for H. Recall that for x, y ∈ H, we defined the
rank-one operator x ⊗ y ∗ ∈ B(H) via x ⊗ y ∗ (z) = hz, yix, z ∈ H.
(b) Prove that n sn fn ⊗ e∗n converges in norm in B(H).
P
(c) Prove that if K ∈ K(H) has singular numbers (sn )n , then
sn = dist(K, Fn−1 ),
where Fn−1 = {F ∈ B(H) : rank F ≤ n − 1}. That is,
sn = inf {kK − F k : rank F ≤ n − 1}.
166 L.W. Marcoux Functional Analysis
11.3. Example.
(i) Let X be any set. Then τ = {∅, X} is a topology on X, called the trivial
topology on X.
(ii) At the other extreme of the topological spectrum, if X is any non-empty
set, then τ = P(X), the power set of X, is a topology on X, called the
discrete topology on X.
(iii) Let X = {a, b}, and set τ = {∅, {a}, {a, b}}. Then τ is a topology on X.
(iv) Let (X, d) be a metric space. Let
τ = {G ⊆ X : for all g ∈ G there exists δ > 0 such that
bδ (g) := {y ∈ X : d(x, y) < δ} ⊆ G}.
Then τ is a topology, called the metric topology on X induced by d.
This is the usual topology one thinks of when dealing with metric spaces,
but as we shall see, there can be many more.
11. APPENDIX – TOPOLOGICAL BACKGROUND 167
The following result from [Wil70] illustrates the importance of nbhd systems.
11.5. Theorem. Let (X, τ ) be a topological space, and x ∈ X. Then:
(a) If U ∈ Ux , then x ∈ U .
(b) If U, V ∈ Ux , then U ∩ V ∈ Ux .
(c) If U ∈ Ux , there exists V ∈ Ux such that U ∈ Uy for each y ∈ V .
(d) If U ∈ Ux and U ⊆ V , then V ∈ Ux .
(e) G ⊆ X is open if and only if G contains a nbhd of each of its points.
Conversely, if in a set X a non-empty collections Ux of subsets of X is assigned
to each x ∈ X so as to satisfy conditions (a) through (d), and if we use (e) to define
the notion of an open set, the result is a topology on X in which the nbhd system at
x is precisely Ux .
Because of this, it is clear that if we know the nbhd system of each point in X,
then we know the topology of X.
There are a number of natural separation axioms that a topological space might
satisfy.
11.8. Theorem. Let (X, d) be a metric space. Then X, equipped with the
metric topology, is T4 .
The importance of neighbourhood bases is that all open sets can be constructed
from them, as we shall soon see.
11.12. Example. Consider (X, d) be a metric space equipped with the metric
topology τ . For each x ∈ X, fix a sequence {rn (x)}∞n=1 of positive real numbers
such that limn→∞ rn (x) = 0 and consider Bx = {Vrn (x) : n ≥ 1}. Then Bx is a nbhd
base at x for each x ∈ X.
11. APPENDIX – TOPOLOGICAL BACKGROUND 169
11.13. Definition. Let (X, τ ) be a topological space. A base for the topology is
a collection B ⊆ τ so that for every G ∈ τ there exists C ⊆ B so that G = ∪{B : B ∈
C}. That is, every open set is a union of elements of B. Note that if C is empty,
then ∪{B : B ∈ C} is also empty, so we do not need to include the empty set in our
base. A subbase for the topology is a collection S ⊆ τ such that the collection B of
all finite intersections of elements of S forms a base for τ .
11.17. Definition. Let (X, τ ) be a topological space. The net (xλ )λ is said to
converge to x ∈ X if for every U ∈ Ux there exists λ0 ∈ Λ so that λ ≥ λ0 implies
xλ ∈ U .
We write limλ xλ = x, or limλ∈Λ xλ = x.
This mimics the definition of convergence of a sequence in a metric space.
170 L.W. Marcoux Functional Analysis
11.18. Example.
(a) Since N is a directed set under the usual order ≤, every sequence is a net.
Any subsequence of a sequence is also a subnet. The converse to this is
false, however. A subnet of a sequence need not be a subsequence, since
its domain need not be N (or any countable set, for that matter).
(b) Let A be a non-empty set and Λ denote the power set of all subsets of A,
partially ordered with respect to inclusion. Then Λ is a directed set, and
any function from Λ to R is a net in R.
(c) Let P denote the set of all finite partitions of [0, 1], partially ordered by
inclusion (i.e. refinement). Let f be a continuous function on [0, 1]; then
to P = {0 P= t0 < t1 < · · · < tn = 1} ∈ P, we associate the quantity
LP (f ) = ni=1 f (ti−1 )(ti − ti−1 ). The map P 7→ LP (f ) is a net (P is a
R1
directed set), and from Calculus, limP ∈P LP (f ) = 0 f (x)dx.
11.20. Definition.
Let (X, τX ) and (Y, τY ) be topological spaces. We say that a function f : X → Y
is continuous if f −1 (G) is open in X for all G ∈ τY .
That this extends our usual notion of continuity for functions between metric
space is made clear by the following result:
11.21. Proposition. If (X, dX ) and (Y, dY ) are metric spaces with metric space
topologies τX and τY respectively, then the following are equivalent for a function
f :X →Y:
(a) f is continuous on X, i.e. f −1 (G) ∈ τX for all G ∈ τY .
(b) limn f (xn ) = f (x) whenever (xn )∞
n=1 is a sequence in X converging to
x ∈ X.
As we shall see in the Assignments, sequences are not enough to describe conver-
gence, nor are they enough to characterize continuity of functions between general
topological spaces. On the other hand, nets are sufficient for this task, and serve
as the natural replacement for sequences. (The following result also admits a local
version, which we shall also see in the Assignments.)
11. APPENDIX – TOPOLOGICAL BACKGROUND 171
The main and most important result concerning weak topologies induced by a
family of functions is the following:
11.24. Proposition.
(a) If τ is a topology on X and if fγ : (X, τ ) → (Yγ , τγ ) is continuous for all
γ ∈ Γ, then σ(X, F) ⊆ τ . In other words, σ(X, F) is the weakest topology
on X under which each fγ is continuous.
(b) Let (Z, τZ ) be a topological space. Then g : (Z, τZ ) → (X, σ(X, F)) is
continuous if and only if fγ ◦ g : Z → Yγ is continuous for all γ ∈ Γ.
11.25. Definition. Let {(Xα , τα )}α∈Λ be a collection of topological spaces. The
Cartesian product of the sets Xα is
Πα∈Λ Xα = {x : Λ → ∪α Xα | x(α) ∈ Xα for each α ∈ Λ}.
As with sequences, we write (xα )α for x.
From this it follows that {πα−1 (Uα ) : Uα ∈ Bα , α ∈ Λ} is a subbase for the product
topology, where Bα is a fixed base (or indeed even a subbase will do) for the topology
on Xα .
It is perhaps worth pointing out that it follows from the Axiom of Choice that
if for all α ∈ Λ we have Xα 6= ∅, then X 6= ∅.
We leave it to the reader to verify that the product topology on Rn = Πnk=1 R is
just the usual topology on Rn .
Bibliography
[Dav88] K.R. Davidson. Nest algebras. Triangular forms for operators on a Hilbert space, volume
191 of Pitman Research Notes in Mathematics. Longman Scientific and Technical, Harlow,
1988.
[KR83] R.V. Kadison and J.R. Ringrose. Fundamentals of the theory of operator algebras I: Ele-
mentary theory. Academic Press, New York, 1983.
[Lin71] J. Lindenstrauss. The geometric theory of the classical banach spaces. Actes du Congrès
Intern. Math., 1970, Paris, 2:365–372, 1971.
[Sch27] J. Schauder. Zur Theorie stetiger Abbildungen in Funktionalraumen. Math. Z., 26:47–65,
1927.
[Sou78] A.R. Sourour. Operators with absolutely bounded matrices. Math. Z., 162:183–187, 1978.
[Sun78] V.S. Sunder. Absolutely bounded matrices. Indiana Univ. Math. J., 27:919–927, 1978.
[Tsi74] B.S. Tsirel’son. Not every Banach space contains an imbedding of `p or c0 . Functional
Anal. Appl., 8:138–141, 1974.
[Whi66] R. Whitley. Projecting m onto c0 . The Amer. Math. Monthly, 73:285–286, 1966.
[Wil70] Stephen Willard. General Topology. Addison-Wesley Publishing Co., Reading, Mass.-
London-Don Mills, Ont., 1970.
173
Index
strong operator, 78
subbase for a, 169
trivial, 166
weak, 111, 171
weak operator, 79
weak∗ , 112
total variation, 14
total variation norm, 94
trace functional, 126
translation invariant metric, 82
trivial topology, 166
Tsirel’son space, 14, 15
Tsirel’son, B.S., 14
TVS, 54
Twain, Mark, 110
Tychonoff’s Theorem, 117
unconditionally summable, 44
uniform boundedness principle, 113, 114
uniform continuity on a subset, 63
uniform norm, 3
uniform space, 83
uniformity, 62
uniformly continuous function, 62
unilateral backward weighted shift operator,
24
unilateral forward weighted shift operator,
24
unitary operators, 46
variation, 16, 29
bounded, 29
total, 14
vector subspace, 2
Volterra operator, 30, 49
Youngman, Henny, 67