Quantum Physics Lecture Notes
Quantum Physics Lecture Notes
Quantum Physics
Script to the undergraduate lecture in spring 2022
Script to the graduate lecture in spring 2025
Maurits W. Haverkort
©(2020 / 2025) Maurits W. Haverkort All rights reserved
2
Contents
Preface 11
3
Contents
4 Wave mechanics 83
4.1 Time evolution of a delta function . . . . . . . . . . . . . . 83
4.2 Wave packets . . . . . . . . . . . . . . . . . . . . . . . . . . 90
4.2.1 The wave function in position and momentum space 91
4.2.2 Gaussian wave packet with finite momentum . . . . 92
4.2.3 Group and phase velocity . . . . . . . . . . . . . . . 94
4.2.4 Moment expansion in position and momentum space 97
4.2.5 Heisenberg’s uncertainty relation . . . . . . . . . . . 103
4
Contents
5
Contents
6
Contents
7
Contents
8
Contents
9
Contents
10
Preface
This script is written for the lecture Quantum Physics (Theoretical Physics
4) taught in the Summer semester 2020 and 2022 at Heidelberg University.
There are many good textbooks available on the theory of Quantum
Physics [1, 2, 3, 4, 5] and this script is not meant to replace any of those.
Based on these textbooks, and several scripts of lectures taught in different
years, I have created a set of lectures to introduce the theory of Quantum
Physics. Reading these notes should be accompanied by visiting the lectures
and by solving problems. I can not stress enough how important it is to
practice by solving problems yourselves. Ten years from now it will not
matter much if you can remember an equation. It will matter a lot if you
can solve a practical problem encountered in your profession or research.
You only learn this by practicing a lot.
We start our lecture with a few experimental examples showing the
limits of classical physics. At the end of the 19th century there were multiple
experiments showing the limits of classical physics. Understanding these
experiments motivates the need to introduce the quantization of energy,
the correspondence between particles and waves, and the need of a linear
equation to describe the quantum state of matter. This script introduces
several of these experiments. In the lecture we will only go into depth on
one or two examples as I assume that you are familiar with most of them.
In the next two chapters I will introduce the Schrödinger equation. Just
as Newton did not derive his laws of mechanics, Schrödinger by no means
derived his equation. We will give a reasonable background (although not
historically correct) on how one might come to the idea to propose this
11
Contents
12
Contents
13
Contents
14
Chapter 1
15
Chapter 1. The limits of classical physics
tury, science and physics had seen many years of steady progress. People
understood many physical phenomena and had equations with predictive
power. After many years of progress in the understanding of the natural
world many people had great confidence in the laws of physics. Based on
empirical findings the classical laws of mechanics, electro magnetism and
thermodynamics were formed. A few equations describe the motion of small
objects on earth, the movements of the planets, electro magnetic phenom-
ena, the transport of heat or the expansion of a gas when heated. These
findings were not only philosophical in nature. Being able to predict the po-
sition of the planets and their moons allowed one to better navigate at sea.
Newton’s laws had a profound influence on the development of structural
engineering. The understanding of thermodynamics allowed one to opti-
mize steam engines. The understanding of electro-magnetism contributed
to the development of electrical lighting. The improved understanding of
the natural sciences was a key ingredient to the industrial revolution and
enormously changed our way of life.
For many people in the late 19th century it was therefore hard to be-
lieve that anything could be wrong with classical physics, condensed in the
laws by Newton, Maxwell and thermodynamics. Some even went so-far as
to claim that physics was finished and one only needs to fill in the details.
It might have been shocking for many people to find more and more exper-
iments that seemed to indicate that something is wrong with the classical
picture. Many of these experiments are not related to our daily observation.
This might be a reason why these contradictions of classical physics have
gone unnoticed for so long. Classically, motion is continuous and particles
can have any velocity, mass or energy as desired. The experiments listed in
this chapter challenged that picture in such fundamental ways that people
started to look at alternative theories.
During the first quarter of the 20th century, several theories of Quantum
physics emerged. When Max Planck presented his Nobel lecture in 1920, he
did this with the title ”The genesis and present state of development of the
quantum theory” [16]. He was less modest in the translation of his original
speech, published in 1922 with the title ”The origin and development of the
quantum theory” [17]. Max Planck speaks about the quantum theory four
16
years before Schrödinger published his version of quantum theory we use to-
day. There were several theories of quantum physics produced between 1850
and 1926 [18]. Many contain some truth. A partial breakthrough came with
the matrix mechanics introduced by Heisenberg in 1925 [13]. Heisenberg
builds on previous experimental findings and theoretical considerations. He
specifically focused on the photon absorption spectra of atoms in magnetic
(Zeeman effect) and electric (Stark effect) fields. Based on these experi-
ments, although he never mentioned it in these words, Heisenberg basically
came to the conclusion that quantum states must be described by a com-
plex vector in a Hilbert space, following all rules of linear algebra. This
theory was rapidly developed further by Born, Heisenberg, and Jordan in
Göttingen [14, 15].
In his first publication, especially in the last paragraph, Heisenberg is
rather careful if his theory is sufficiently well developed to be the final the-
ory of quantum mechanics. As we now know it is not. Within a year,
Schrödinger proposed a differential equation to describe quantum mechan-
ics. Having a differential equation that determines the motion of particles
placed the understanding of quantum mechanics on the same footing as clas-
sical mechanics or electro dynamics. With Schrödingers equation, quantum
physics quickly became a well accepted theory. As Schrödingers differential
equation is linear, we can describe its solutions as complex vectors on a
Hilbert space that follow all rules of linear algebra. Schrödinger’s equa-
tion is thus in accordance with Heisenberg’s theory, but now also provides
explicit representations of the matrices and vectors present in Heisenbergs
equations. A dispute between Schrödinger and Heisenberg on how to inter-
pret the equations of quantum mechanics and which physical picture they
might represent remained till the death of Schrödinger.
In this chapter, I will provide a few examples of the experiments that
lead to the introduction of quantum physics. They are all beautiful by
themselves. Understanding these experiments in detail helps to develop
an intuitive feeling of quantum mechanics. We all have seen a ball fall as
described by the laws of classical mechanics. We have less experience with
observing single electrons scattering off the surface of a solid. Not having
that experience makes quantum mechanics less intuitive. In the end, the
17
Chapter 1. The limits of classical physics
18
1.1. Black body radiation
quency of the emitted and absorbed light. Kirchhoff derived this relation
by considering two of such bodies in thermal equilibrium in vacuum. If one
of them would absorb more than it radiates, its temperature would rise, si-
multaneously cooling the other object. This would violate the observations
of thermodynamics, from which we know that all objects strive towards
thermal equilibrium.
Using geometrical arguments, one can relate the amount of radiation
passing through a small opening of a cavity to the energy density in the
cavity.
4
u(λ, T ) = E(λ, T ) (1.1)
c
with E(λ, T ) the emissive power defined as the energy emitted at wave-
length λ per unit area, per unit time and u(λ, T ) the energy density inside
the cavity per unit of wavelength. Instead of working with the energy den-
sity as a function of wavelength, one can work with the energy density per
unit of frequency as a function of frequency. With λ = c/ν we have
dλ
u(ν, T ) = u(λ, T ) (1.2)
dν
c
= u(λ, T ).
ν2
At the end of the 19th century, several experiments were done to quan-
tify the radiation of a black body. Theoretical considerations were only
partially successful at explaining these observations.
Stefan observed that the total energy radiated by a black body and
thus the total energy density inside a cavity scales with T 4 [20]. Soon after
Stefan’s publication Boltzmann [21] explained this behaviour. He assumed
the second law of thermodynamics and the relation between the photon
pressure and energy inside the cavity as deduced by Maxwell. We thus find
that
Z ∞
4
u(ν, T )dν = σT 4 , (1.3)
0 c
19
Chapter 1. The limits of classical physics
2k4 π 5
with σ = 15cB2 h3 the Stefan-Boltzmann constant. The relation between the
Stefan-Boltzmann constant and the Boltzmann constant kB , the Planck
constant h, and the speed of light c was only derived later.
Wien considered what would
happen if the walls of a cavity con- 40 40
Planck
30 30 Wien
u(ν,T)
20 20
with α and β fit parameters to the experiment. For large frequencies (ν) or
small wave lengths (λ) this approximates the experimental black body radi-
ation well. For small frequencies or large wave lengths there is a substantial
deviation between the theory of Wien and the experimental data.
Rayleigh and Jeans [23, 24, 25] derived the black body radiation as a
function of frequency using very general arguments of statistical physics,
classical mechanics and classical electro dynamics. They found, as we will
discuss in more detail below
8π kB T
u(ν, T ) = ν 3 . (1.6)
c3 ν
20
1.1. Black body radiation
For low frequencies or large wavelengths, this equation agrees well with
experiment. For large frequencies or short wavelengths it does not agree
with experiment. Another major problem with Rayleigh-Jeans’ formula
is that the total energy density and thus the total emitted black body
radiation diverges
Z ∞
8π kB T
ν3 3 dν = ∞, (1.7)
0 c ν
not in agreement with the Stefan-Boltzmann equation. This is a major
problem as there is nothing wrong with the argumentation of Rayleigh and
Jeans. If one assumes the laws of classical mechanics, electrodynamics, and
statistical physics, the Rayleigh Jeans formula for black body radiation is
the result.
Max Planck published in 1900 a set of papers resolving the discrepancy
between the two theoretical models for black body radiation [26, 27, 28, 29,
30]. Planck’s work does not directly cite the theoretical work of Rayleigh
or Wien, but looks at the experimental data, which can be fitted to a
function of the form derived by Rayleigh at low frequencies of the light
and to function of the form derived by Wien at high frequency of the light.
In order to build a logical theory that can be derived from simple physical
principles, Planck had to assume that energy is quantized. For photons with
frequency ν the total energy of a ray can only occur in integer multiples of
hν. The physical consequences of this quantization of energy only became
clear during the next years and the work of Albert Einstein. Max Planck did
not start with the assumption that energy needs to be quantized. There is
no logical ground for such an assumption in classical physics. The first thing
Planck did is to interpolate the functions of Wien and Rayleigh. Wien’s
function for the energy density inside a cavity worked for high frequencies,
Rayleigh’s function for low frequencies. Planck found the following function
8πhν 3 1
u(ν, T ) = , (1.8)
c3 e khν
BT − 1
with h and kB the Planck and Boltzmann constant that needed to be fit
21
Chapter 1. The limits of classical physics
22
1.1. Black body radiation
nλ = 2L, (1.9)
~k = ( nx π , ny π , nz π ), (1.10)
L L L
with ni ∈ N+ .
In order to calculate the density
of modes, we consider the number
of modes with a wave vector smaller L ky
π
than kN . We can consider a sphere 8
in k space of radius kN and deter- 7
mine how many modes fit into this 6
sphere. The spacing between two 5
kN
modes is π/L in the x, y and z 4
direction. The volume per mode 3
is thus (π/L)3 . The volume of a 2
sphere with radius kN in k-space 1
is 4/3πkN 3 . The number of modes
1 2 3 4 5 6 7 8 L kx
with |~k| < kN is thus π
1 4 πk 3
N (kN ) = 2 3 N3 (1.11)
8 π Figure 1.4: Number of modes with
L
3
V kN wave vector smaller than kN
= .
3π 2
The factor of 1/8 comes from the
fact that we only consider modes
with positive wave-length. The factor of 2 comes from the fact that photons
23
Chapter 1. The limits of classical physics
dn(k) 1
= 2 k2 . (1.12)
dk π
ck
With ν = 2π we can express the density of modes per frequency as
dn(ν) dn(k) dk
= (1.13)
dν dk dν
1 2πν 2 2π
= 2
π c c
8πν 2
= 3 .
c
24
1.1. Black body radiation
dn(ν)
u(ν, T ) = hEi (1.14)
dν
8πν 2
= 3 kB T.
c
1 − kEnT
pn = e B , (1.15)
Z
25
Chapter 1. The limits of classical physics
n=0 e
B
hνx
hEi = (1.20)
1−x
26
1.2. The photoelectric effect
hν
= (1.21)
x−1 −1
hν
= hν . (1.22)
e kB T − 1
With this average energy per mode, the energy density inside the cavity
becomes
dn(ν)
u(ν, T ) = hEi (1.23)
dν
8πν 2 hν
= 3 hν ,
c e kB T − 1
such that we find for the emissive power (the energy emitted at frequency
ν per unit area per unit time) of a black body
2πν 2 hν
E(ν, T ) = 2 hν . (1.24)
c e kB T − 1
27
Chapter 1. The limits of classical physics
28
1.3. The third law of thermodynamics and Einsteins model of
the specific heat of solids
by the positive charge of the ion. In a solid we call the minimal energy
difference between a bound electron and a free electron with zero kinetic
energy the work function W . The maximal kinetic energy of a photon-
electron is thus
Ek = hν − W (1.25)
From this equation we can draw a few conclusions. If the frequency of the
light is such that the energy of each photon hν is smaller than the work
function W , there will be no photon current. The maximal kinetic energy
of the photo electrons is linearly proportional to the frequency of the light.
The slope of Ek as a function of ν is equal to Planck’s constant h. The
intensity of the photon current (number of electrons produced) does not
depend on the frequency (ν), but on the intensity of the light.
These predictions were confirmed in 1914 by Millikan [35]. In 1921,
Einstein got the Nobel prize for laying one of the most fundamental corner
stones of quantum physics. All objects, both matter and light can macro-
scopically be described by a continuous density. Microscopically, one can
not divide them in infinitely small parts but needs to consider their particle
nature.
Besides the discrepancy of black body radiation with the laws of classical
mechanics and statistical physics, also the third law of thermodynamics
conflicts with the notions of classical physics. In 1905 Nernst proposed,
based on experimental considerations, that the change in entropy of a sys-
tem (S) as a function of temperature (T ) is zero if cooled to zero Kelvin
29
Chapter 1. The limits of classical physics
[36] 1 .
dS
= 0. (1.26)
dT T =0
dQ = T dS. (1.28)
We thus find for the relation between the heat capacity and change of
entropy as a function of temperature
dS
CV = T . (1.29)
dT
From the third law of thermodynamics it follows that the heat capacity, at
constant volume of a system, must go to zero faster than T . Experimentally
the heat capacity indeed follows this rule.
In order to see why this conflicts with the theories of classical mechanics
and statistical physics we can use the results of the equipartition theorem
1
Nernst wrote a series of papers and talks in which he proposed this formula. In his
first papers he did not use the term entropy, but focused on possible useful work and
heat produced in different chemical reactions. Although historically not quite correct, we
here use the modern notation and definition of the third law of thermodynamics using
entropy.
30
1.3. The third law of thermodynamics ...
31
Chapter 1. The limits of classical physics
With 3 spatial directions and N particles, we find that the total kinetic
energy is equal to
D E 3
(N )
Ekin = N kB T. (1.33)
2
The heat capacity due to the kinetic energy of N particles becomes
3
C V = N kB . (1.34)
2
For ideal gasses at large enough temperatures, the heat capacity is in-
deed constant. In this regime the heat capacity can be used to determine
the Boltzmann constant. At low temperatures the heat capacity of all ob-
jects experimentally measured goes to zero. The experiment is in agreement
with the third law of thermodynamics. One might argue that in solids or
at low temperatures things are more complicated as atoms interact with
each other. One could try to add interaction terms to the Hamiltonian
to compensate for the heat capacity due to the kinetic energy. There is,
however, no easy way to do this, nor is it logical why interactions between
particles should have this feature for all systems encountered in nature.
Einstein, charmed by the idea of discretizing the available energy, pro-
posed that also for the specific heat of solids one should not allow all values
of the kinetic energy, but assume that the energy comes in multiples of
some basic quantity, E0 [37]. All forms of energy are quantized.
Once we quantized the energy, we can use the same derivation we used
for the derivation of Planck’s law to calculate the total energy of the system
as a function of temperature.
3 E0
hEi = E0 . (1.35)
2
e kB T
−1
d
CV = hEi (1.36)
dT
32
1.4. The Compton effect
2
3 E
E 0
= kB E .
2 0 − 2k 0T
kB T e 2kB T
−e B
For T E0 the heat capacity becomes equal to the classical limit of 3/2kB .
For temperatures smaller than E0 the heat capacity falls off to zero, which
is indeed observed experimentally.
Einstein assumed that each atom in a solid can oscillate independent of
its neighbours. This is not very realistic. A more appropriate consideration
of a solid and the treatment of its specific heat was given by De Bye in
1912 [38]. He starts his paper with a sentence stating that experimental
observations of the temperature dependence of the specific heat of solids
clearly show that one can not assume that energy is equally spread over
classical modes as one would obtain from classical physics. In order to
describe the specific heat, one needs to quantize the energy. The energy
unit this time not being hν as for photons, but a material specific constant
which we know as the De Bye temperature θD (or energy kB θD ).
33
Chapter 1. The limits of classical physics
radiation with very short wavelengths. When high energy electrons (several
10’s of keV) hit a solid, high energy photons or x-rays are produced. These
rays were discovered in 1895 by Röntgen [40]. Directly afterwards, x-rays
were intensively studied, as they were very useful for imaging objects that
are otherwise hidden. Bragg [41] had established that one can use x-rays to
do diffraction experiments from solid crystals. If one knows the wavelength
of the light this allows one to determine atomic distances and the atomic
structure in crystals. For crystals with known atomic distances one can use
Bragg scattering to determine the wave length of the x-rays. In 1923, the
x-ray wavelengths, produced by different materials when bombarded with
high energy electrons in x-ray tubes, were well characterized.
In order to calculate the amount
of light scattered from a particle,
one can solve Maxwell’s equations.
One assumes the particle to be a
dielectric. Electric charges within
the particle will start to oscillate
in the electric field of the incoming
radiation. These oscillating elec-
tric charges will in turn produce
an electric field which radiates the
outgoing scattered light. In 1923,
this theory was well known and for Figure 1.5: Figure from the original
scattering of photons in the optical paper of Compton [39] showing the
wave length these equations work scattering of a photon on an electron
really well. For the inelastic scat- by the angle θ (left) and the momen-
tering of x-rays in solids this the- tum transfer relations that need to be
ory seemed to fail. Compton starts fulfilled (right).
his paper with the sentence: ”J. J.
Thomson’s classical theory of the
scattering of X-rays, ..., has been found incapable of explaining many of
the more recent experiments” [39].
Knowing the results of Planck and Einstein relating electromagnetic
waves to particles with energy E = hν and momentum |~ p| = hν
c , Comp-
34
1.4. The Compton effect
ton used a different starting point to describe his inelastic scattering ex-
periments from Thomson. He assumed photons to be particles scattering
inelastic from electrons by the laws of classical physics of energy and mo-
mentum conservation. Given an incident photon with energy E0 = hν0
and momentum |~ p0 | = hνc 0 , one can scatter from an electron at rest by an
angle θ. The resulting energy and momentum of the photon areEθ = hνθ
hνθ 1
and |~
pθ | = c . For the electron we have the energy Ee = mc2 √
1−β 2
pe | = √mβc 2 , with β = ve /c. The momentum vectors for
and momentum |~
1−β
the particles involved are shown in figure 1.5. The equation of momentum
conservation is
!2
hν0 2 hνθ 2
mβc hν0 hνθ
p = + +2 cos(θ). (1.37)
1 − β2 c c c c
35
Chapter 1. The limits of classical physics
36
1.6. The Zeeman effect and the Stern-Gerlach Experiment
The examples in the previous sections showed the dual nature of light and
matter. Some experiments are described well by treating light and matter
as particles, other by treating light and matter as waves. The Zeeman ef-
fect and the Stern-Gerlach experiment show an other property of quantum
physics. In order to explain these experiments, it follows that quantum
states can be considered as superpositions of other quantum states. This
accounts for things like entanglement and is a reason that quantum com-
puting might become much more efficient than classical computing.
Zeeman observed that atomic absorption and emission line spectra in a
magnetic field broaden [46]. With better resolution, one can observe that
they split into multiple discrete lines [47]. Single lines where observed to
split either into double lines (doublets), triple lines (triplets), or higher mul-
tiplicities, depending on the material. In order to explain this phenomenon,
it was concluded that the magnetic state of an electron or atom consists of
a discrete set of possibilities. There are two internal micro states, repre-
senting the magnetic moment for a system that shows two lines (doublet),
three internal micro states representing the magnetic moment for a triplet
etc. With the use of these finite number of internal micro states, one can
generate a magnetic moment in any spacial direction possible. We can ex-
emplify this for a doublet, i.e. for example an electron with spin 12 in a
magnetic field. For a doublet, the energy difference of the two observed
lines show that the magnetic moments of these two states are either paral-
lel or anti-parallel to the applied magnetic field. This is true independent
of the direction of the magnetic field. The two states related to the two
separate lines in the spectra for a field in one direction thus must be linear
combinations of the two states that emerge for a field in an other direction.
In three papers published in 1921 to 1922 [48, 49, 50], Stern and Gerlach
tested the assumption on the quantization of magnetization into a set of
discrete states with a simple experiment. They were looking for an experi-
ment that does not depend on spectroscopy. Obtaining results from spectra
37
Chapter 1. The limits of classical physics
F~ = ∇(~ ~
µ · B), (1.41)
~ r) = B
If we take the field to be B(~ ~ 0 + zB1 ẑ, we find a force to be in the z
direction and of the form
∂
Fz = (zµz B1 ) = µz B1 . (1.42)
∂z
In the following, we will assume that we always take the field gradient to
be parallel to the direction of the field.
In the Stern Gerlach experiment, silver atoms are created by heating a
piece of metallic silver in an oven. Hot silver metal evaporates and produces
38
1.6. The Zeeman effect and the Stern-Gerlach Experiment
a silver vapour above the metal. Through a small opening in the oven the
silver vapour can move into the measurement chamber. As the measure-
ment chamber has been evacuated from air, the pressure difference allows
the silver atoms to come out in a beam-like fashion. This procedure of
creating a ray of silver atoms has no favoured orientation for the magnetic
moments of the silver atoms. The magnetic moment of the silver atoms
should be considered randomly orientated. If the atom traverses through
an inhomogeneous field as described above, one would expect that the sil-
ver atoms with the moment down (µz = −µ) would accelerate down-wards,
those with their moment up (µz = +µ) upwards and those with their mo-
ments perpendicular to the z direction (µz = 0) have their paths unchanged
by the magnetic field. In total, one would expect a semicircular distribution
centred around z = 0 representing the possible ways the magnetic moment
of the silver atoms could be orientated with respect to the field gradient.
The experimental result is different. In figure 1.6 we show the original
results of Stern and Gerlach. Without a magnetic field they focus the
silver atoms in a relatively sharp line. With an inhomogeneous field, the
line splits into two separate lines. The magnetic moment of the silver atom
is either parallel or anti-parallel to the field gradient, but seemingly never
perpendicular.
Although not performed by Stern and Gerlach, one can repeat such an
experiment multiple times. First of all we should notice that the choice of
the z direction is arbitrary. One can put the field gradient and magnetic
force in any direction acting on any component of the magnetic moment.
For a field and field gradient in the ~r direction, one should observe a split-
ting in the ~r direction of the ray of silver atoms. If the silver atoms pass
through an inhomogeneous field in the z direction with a field gradient in
the z direction, we observe a splitting into silver atoms with their magnetic
moment either parallel (+) or anti-parallel (−) to the magnetic field. We
can now block the silver atoms with their magnetic moment anti-parallel to
the magnetic field and make the silver atoms with their magnetic moment
parallel to the field pass through an other inhomogeneous magnetic field.
If this field is again in the z direction, we see no further splitting. This
situation is depicted in panel (a) of figure 1.7. The silver atoms are, after
39
Chapter 1. The limits of classical physics
the first pass through a field with gradient in the z direction, polarized in
the z direction. This polarization is conserved such that in a second pass
were we only enter with silver atoms that have their moment parallel to
the field, we will only observe silver atoms with their moment parallel to
the field.
If we polarize the silver atoms to have their moment in the positive z
direction, we can perform a Stern-Gerlach experiment with the field and
gradient in the x direction. This is depicted in panel (b) of figure 1.7. A
silver atom with its moment in the z direction has no moment in the x
direction. At the same time, the atom has to bend either in the x+ or
x− direction. Because of symmetry considerations, it is easy to see that
the silver atoms split equally into silver atoms with their moment either
40
1.6. The Zeeman effect and the Stern-Gerlach Experiment
41
Chapter 1. The limits of classical physics
with
0 1
σx = , (1.47)
1 0
0 −i
σy = ,
i 0
1 0
σz = .
0 −1
With the use of this mathematical framework, we can explain the Stern-
Gerlach experiment. After the silver atoms moved through a field and field
gradient in the z direction, we have two sets of silver atoms. The silver
atoms with their moments parallel to the field are in the state
1
. (1.48)
0
The silver atoms with their moments anti-parallel to the field are in the
state
0
. (1.49)
1
You should convince yourselves that with the use of equation 1.46 this
indeed yields the correct orientation of the magnetic moment. If the field
and gradient would have been in the x direction, we would have found the
polarized states to be
p p
1/2 1/2
p , and p . (1.50)
1/2 − 1/2
42
1.6. The Zeeman effect and the Stern-Gerlach Experiment
43
Chapter 1. The limits of classical physics
44
1.7. Atomic physics
and solids has come a long way since the 18th century and transformed in
an accurate science with quantitative predictive power.
Chemistry
Atomic physics studies the proper-
ties of the different elements. We
know about 118 different atomic el-
ements, with several different iso-
topes. In the periodic table they
are listed according to their atomic
number and grouped in rows and
columns according to their chemical
properties. Elements in the same
column have the same atomic va-
lence. Valence can be defined as the Figure 1.8: Elements in the peri-
number of hydrogen atoms that can odic table ordered according to their
combine with an element in a bi- atomic number and filling of different
nary hydride or twice the number atomic shells.
of oxygen atoms combining with an
element in its oxide or oxides. The atomic mass of each element is roughly
twice the atomic number.
In 1789, Lavoisier [51] wrote a text book discussing the chemical prop-
erties of the different elements. Naturally at that point in time not all
elements were known but basic chemical principles started to emerge. At
a given temperature and pressure, two different gasses have roughly the
same number of molecules per volume. A cubic cm of oxygen gas reacts
with two cubic cm of hydrogen gas, whereas a cubic cm of fluorine gas
only reacts with one cubic cm of hydrogen gas. This allows one to classify
the elements according to how they react with hydrogen. Based on these
and similar considerations Mendeleev arranged the elements in the peri-
odic table. His ordering allowed him to predict undiscovered elements. He
predicted the existence of the element Gallium and some of its chemical
properties. Gallium was subsequently found in the year 1875.
45
Chapter 1. The limits of classical physics
Why the periodic table has the structure it has was hard to understand.
With the publication of Schrödingers equation (combined with the relativis-
tic corrections of Dirac), one can immediately understand the structure of
the periodic table. With the Schrödinger equations, one can calculate the
magnetic, optical, and chemical properties of the elements. As a side note,
it turns out that the Schrödinger equation is highly non-trivial to solve
accurately for atoms with more than two electrons, and even harder for
molecules or solids. For complex systems including molecules and solids,
getting qualitative results is sometimes even a challenge, as we will discuss
in chapter 9 on many particle systems.
46
1.7. Atomic physics
47
Chapter 1. The limits of classical physics
48
1.7. Atomic physics
Diffraction
49
Chapter 1. The limits of classical physics
50
1.7. Atomic physics
51
Chapter 1. The limits of classical physics
nλ = 2πr. (1.55)
h
λ= =⇒ (1.56)
mv
nh
= 2πr =⇒ (1.57)
mv
nh
mvr = . (1.58)
2π
With l = mvr the angular momentum of the electron quantized as
integer multiples of ~ = h/2π.
E2 − E1
ν= , (1.59)
h
under the assumption that E1 > E2 . For an hydrogen-like atom (H,
or He+ , Li2+ , but also the Alkali metals with Z ≈ 1), we can calculate
the energy by stating that in a Bohr orbital the Coulomb force needs
to provide a centrifugal force that generates a circular orbit. From
Newtonian mechanics and electrostatics we know
Ze2 mv 2
= . (1.60)
4π0 r2 r
If one would use Maxwell’s equations to solve the electro dynamical
problem of a charge traveling in a point like potential, this would
52
1.7. Atomic physics
53
Chapter 1. The limits of classical physics
54
Chapter 2
In 1926 Erwin Schrödinger published 4 articles [7, 8, 9, 10] laying the fun-
daments of Quantum physics as we know it today. In his first paper he
conjectured an equation that describes the discrete spectrum of the Hy-
drogen atom. These papers are very well written. If you are interested in
the original argumentation I can recommend you to read these. An his-
torical overview of the introduction of quantum mechanics by Schrödinger
is given in the Arxive paper by Norbert Straumann [64]. This paper also
refers to letters written by Schrödinger. If you read the original articles of
Schrödigner I would recommend to read the article by Straumann as well.
In this chapter we will not follow the historical line of thought but
motivate the Schödinger equation by the equivalence between momentum
and wave-length as postulated by De Broglie [65]. In the end it must
be clear that the only reason we use the Schrödinger equation is that its
predictions agree astoundingly well with experiment.
55
Chapter 2. Schrödinger’s conjecture and the postulates of
quantum mechanics
2.1 Motivation
According to De Broglie there is a relation between the momentum of a par-
ticle and the corresponding wave-length this particle displays in a diffraction
experiment.
h
|~
p| = , (2.1)
λ
with p~ = (px , py , pz ) the momentum of the particle, h the Planck constant
and λ the wavelength. If one performs a diffraction experiment (i.g. elec-
trons passing through two slits with distances comparable or smaller than
the De Broglie wavelength) one needs to represent the particle by a wave.
We can represent a plane wave by a function of space coordinates ~r and
time coordinate t as:
~
ψ(t, ~r) = ei(k·~r−ω t) , (2.2)
with the wave-vector ~k = p~/~ and the angular frequency ω = 2πν. Using
the relation between energy and frequency E = hν = ~ω we have:
i
ψ(t, ~r) = e ~ (~p·~r−E t) (2.3)
We can obtain the energy E and the momentum p~ from the function ψ(t, ~r)
∂ ∂ ∂
by differential operators 1 . The gradient (∇ = ∂x x̂ + ∂y ŷ + ∂z ẑ) of the
wave-function results in the momentum times the wave-function
~
p~ ψ(t, ~r) = ∇ψ(t, ~r). (2.4)
i
The time derivative of the wave-function results in the energy times the
wave-function
∂
E ψ(t, ~r) = i~ ψ(t, ~r). (2.5)
∂t
1
Operators in Quantum mechanics map a function (ψ(t, ~r) in our example) to an other
function.
56
2.2. The wave function and the Hamiltonian defining the
micro state and its time evolution
p~2
E= + V (~r). (2.8)
2m
The Schrödinger equation for a particle in an external potential becomes:
∂ ~2
i~ ψ(t, ~r) = − ∆ψ(t, ~r) + V (~r)ψ(t, ~r). (2.9)
∂t 2m
57
Chapter 2. Schrödinger’s conjecture and the postulates of
quantum mechanics
The classical Hamiltonian function specifies the energy of the particles un-
der consideration. The quantum mechanical equivalent is obtained by re-
placing the momentum p~ by the momentum operator p ~ = ~i ∇. With this
substitution the Hamilton function, H becomes the Hamilton operator, H
~2
p
H= + V (~r) (2.11)
2m
~2
=− ∆ + V (~r).
2m
A quantum mechanical state of a system is defined by its wave-function
ψ(t, ~r). If one knows the function at time t = 0 as ψ(0, ~r) = ψ0 (~r) one
can calculate the wave-function at any specific time with the Schrödinger
equation:
∂
i~ ψ(t, ~r) = Hψ(t, ~r). (2.12)
∂t
~2
Hi = − ∆i + V (~ri ), (2.13)
2mi
with index i labelling the different particles at position ~ri with mass mi .
The total Hamiltonian is given by the sum of the Hamiltonians of each
particle
N
X
H= H i. (2.14)
i=1
58
2.3. The Schrödinger equation for many particle states
This is however not the most general form of the Hamiltonian we encounter
in nature. In general particles can interact with each other. In this case
the potential plus interaction term V is not given as a sum of potentials
acting on the single particles
N
X
V (~r1 , . . . , ~rN ) 6= V (~ri ). (2.15)
i=1
For most realistic systems the energy will depend on the position of the
individual particles as well as on the distance between the particles
N
X
V (~r1 , . . . , ~rN ) = V1 (~ri ) (2.16)
i=1
N,N
X
+ V2 (|~r1 − ~rj |)
i=1,j>i
That said, we at this point do not need to go into detail on the specific
interactions between possible particles. It P
is enough to note that the sum
over the single particle potential energies N i=1 V (~
ri ) for the total poten-
tial energy should be replaced by a more general function V (~r1 , . . . , ~rN )
that includes the potential energy of each particle due to an externally ap-
plied field, plus the potential energy due to interaction terms between the
particles. The result is the many-particle Hamiltonian operator
N
X ~2
H= − ∆i + V (~r1 , . . . , ~rN ). (2.17)
2mi
i=1
59
Chapter 2. Schrödinger’s conjecture and the postulates of
quantum mechanics
the solutions to those that fulfil the Pauli principle. The wave-function
must be totally symmetric (Bosons) or anti-symmetric (Fermions) under
the permutation of the coördinates of identical particles. For two particles
this yields for Bosons
60
2.4. The interpretation of the wave-function
61
Chapter 2. Schrödinger’s conjecture and the postulates of
quantum mechanics
62
2.4. The interpretation of the wave-function
63
Chapter 2. Schrödinger’s conjecture and the postulates of
quantum mechanics
We now briefly run ahead and make a statement we will prove and
discus later, in section 6.5. Given the quantum state described by the wave
function ψ(t, ~r1 , . . . , ~rn ). It follows from the Schrödinger equation, that if
we for this state measure an observable, described by operator O and find as
a result the value o, then if we repeat this experiment many times, starting
from the same state ψ(t, ~r1 , . . . , ~rn ) we will find a distribution of values
ρ(o). The possible values of o are given by the eigenvalues (spectrum) of
the operator O. The frequency with which these values are found are given
by the norm square of the inner product of the eigenstates of operator O
and the many particle wave function of the system. More on this later,
after we introduced the proper math.
Finding a distribution of values for a repeated experiment on an en-
semble of macroscopically many particles is not surprising. Thermal fluc-
tuations are expected in classical systems and described by statistical me-
chanics. Despite the similarities between thermal and quantum fluctuations
there are however important differences. Firstly, thermal fluctuations av-
erage out over time, whereby quantum fluctuations all happen at the same
time (again more on this later). Secondly thermal fluctuations disappear
if one considers an isolated single particle. Quantum fluctuations remain,
also for a single particle.
The quantum mechanical description and the fluctuations of observ-
ables, described above is not restricted to the case of macroscopically large
values of N . The previous equations also work for the case where N = 1, i.e.
a single particle. Quantum mechanics, or at least the Schrödinger equation,
predicts that for a single particle, observables are given by a distribution
of values. Schrödinger in his original interpretation of quantum mechanics
took this quite literally and proposed to replace the concept of point par-
ticles by wave packets, with finite, but small widths [11]. Observations on
a single particle would then see the full distribution function of values.
If one looks at the physics of a single atom Schrödinger’s original inter-
pretation has several appealing features. For these cases Schrödinger’s pic-
ture provides a natural explanation for several experiments. Experiments
where one performs single atom diffraction observe the Fourier transform of
a spatial distribution of the atomically bound electron charge and spin den-
64
2.4. The interpretation of the wave-function
sity, known as the atomic form factor. Similar arguments and observations
are true for experiments whereby the typical length scale in the experiment
is of the same order as the (de Broglie) wave length of the particles involved
and the temperature (kB T ) is small compared to the low energy excitations
in the system. Typical examples are scanning tunnelling microscopy and
spectroscopic variants of this method where single atoms and the orbital
structure of the electronic states is seen in the experimental results. In
order to determine the typical de Broglie wave length one should realize
that the kinetic energy of a particle at rest, i.e. a stationary state is not
necessarily zero.
If one on the other hand does experiments whereby the typical length
scale is much larger than the wave length of the particles involved something
seems to go wrong with Schrödinger’s original proposal for the interpreta-
tion of quantum mechanics. Cosmic rays originating from far away stars
do not appear as spread out wave packets. A cosmic particle appears in
a bubble chamber as a well localized point like particle on a clear trajec-
tory. It is possible to perform experiments on single particles whereby the
outcome is not a distribution function, but a single value. Before we will
touch the philosophical question on what this has to mean for the inter-
pretation of the wave function we will have a look at what possible single
values are allowed for such a measurement. For a microscopic ensemble of
N non-interacting particles we know that the solution of the Schrödinger
equation (section 6.5) dictates that we observe for the observable values of
an operator O a distribution function ρ(o) given by the eigenvalues of oper-
ator O with frequency given by the norm square of the inner product of the
eigenvectors of operator O with the N particle wave-function of the system.
If we have an apparatus that measures a single unique value for each par-
ticle, then in order to be in agreement with the solution for the ensemble
of N particles these values must be given by o, the eigenvalues of operator
O. The probability to obtain the eigenvalue o is given by the distribution
function ρ(o). An experiment that beautifully exemplifies this is the double
slit diffraction experiment. With a low flux electron beam one can register
each electron separately. As such one can see how the diffraction pattern
builds up as a distribution from individually events [44, 45].
65
Chapter 2. Schrödinger’s conjecture and the postulates of
quantum mechanics
66
Chapter 3
Mathematical properties of
the Schrödinger equation
67
Chapter 3. Mathematical properties of the Schrödinger
equation
~2
=− ∆ + V (~r).
2m
Where we used that the momentum operator is given as
~p = −i~∇. (3.3)
as the particle density. This definition for the particle density only makes
physical sense if the total number of particles is conserved. Furthermore, for
ρ(t, ~r) to be a physically sensible density one needs the continuity equations
to be fulfilled. The continuity equation states that density cannot disappear
at one point in space ~r0 to instantaneously reappear at a distant point ~r,
which we can formalize using the -δ definition of continuity. For any small
spacial distance = |~r − ~r0 |, there is a small time period δ = t − t0 during
which the density stays within a distance of around its original position
~r0 . Density flows in a continuous motion from one position to another. In
68
3.1. Continuity equation, density conservation and current
density
This equation needs to be valid for any volume V such that the equality
needs to hold not only for the integral, but also for the integrants.
∂
ρ(t, ~r) = −∇ · ~j(t, ~r). (3.9)
∂t
The change in density at point ~r as a function of time is equal to minus the
density that flows out of this point at time t.
In equation 3.6 we defined the particle density in terms of the wave
function. In quantum mechanics the continuity equation not only holds for
69
Chapter 3. Mathematical properties of the Schrödinger
equation
We now need to find a proper definition of the current density ~j(t, ~r) such
that equation 3.9 is fulfilled. We start with the left hand side of equation
3.9. For the time derivative of the density in point ~r we find
∂ ∂
ρ(t, ~r) = (ψ(t, ~r)∗ ψ(t, ~r)) (3.11)
∂t ∂t
∂ ∗ ∗ ∂
= ψ(t, ~r) ψ(t, ~r) + ψ(t, ~r) ψ(t, ~r) .
∂t ∂t
Next, we replace the time derivative of the wave function by a spatial
derivative using the Schrödinger equation. We need both the time derivative
of the wave function as well as the time derivative of the complex conjugate
of the wave function. The complex conjugate of a function that is a solution
of the Schrödinger equation is by itself not necessarily a solution of the
Schrödinger equation. We can find the relation between the time derivative
of the complex conjugate of a wave function and its spatial derivatives by
looking at the complex conjugate of the Schrödinger equation.
~2
∂
i~ ψ(t, ~r) = − ∆ + V (~r) ψ(t, ~r) =⇒ (3.12)
∂t 2m
∗ ∗
~2
∂
i~ ψ(t, ~r) = − ∆ + V (~r) ψ(t, ~r) =⇒ (3.13)
∂t 2m
~2
∂ ∗
−i~ ψ(t, ~r) = − ∆ + V (~r) ψ(t, ~r)∗ , (3.14)
∂t 2m
whereby in the last step we used that the potential V (~r) is real.
We can substitute this into equation 3.11 to find
~2
∂ 1 ∗
ρ(t, ~r) = − ∆ + V (~r) ψ(t, ~r) ψ(t, ~r) (3.15)
∂t −i~ 2m
70
3.1. Continuity equation, density conservation and current
density
~2
∗ 1
+ ψ (t, ~r) − ∆ + V (~r) ψ(t, ~r)
i~ 2m
~
=− (ψ(t, ~r)∗ ∆ψ(t, ~r) − ψ(t, ~r)∆ψ(t, ~r)∗ )
2mi
~
= −∇ · (ψ(t, ~r)∗ ∇ψ(t, ~r) − ψ(t, ~r)∇ψ(t, ~r)∗ ) .
2mi
We now define the current density as
~j(t, ~r) = ~ (ψ(t, ~r)∗ ∇ψ(t, ~r) − ψ(t, ~r)∇ψ(t, ~r)∗ ) (3.16)
2mi
1
= (ψ(t, ~r)∗ p pψ(t, ~r)∗ ) ,
~ψ(t, ~r) − ψ(t, ~r)~
2m
such that the continuity equation for the density defined for a single par-
ticle in quantum mechanics is obeyed. This leaves the question if the cur-
rent density as defined in equation 3.16 is a sensible definition. A sensible
definition requires that the obtained theory is internally mathematically
consistent. This is given by construction. It furthermore requires that the
theory describes the experimental observations. The density and current
as defined in equation 3.10 and 3.16 describe experiments with a very high
accuracy.
We can now also prove that the total density of a system is conserved
if certain boundary conditions of ψ(t, ~r) are considered. From equation 3.7
we immediately seeR that if the current through a surface S that encloses
volume V is zero ( S ~j(t, ~r) · d~n = 0) the total number of particles within
this volume is constant in time.
For infinite volumes V = Rd we have conservation of particle number if
the density is finite and the surface integral for V → Rd converges to zero.
This sets restrictions on the possible wave functions we can allow.
Z Z
ψ(t, ~r)∗ ψ(t, ~r)dd~r = ψ(t = 0, ~r)∗ ψ(t = 0, ~r)dd~r (3.17)
Rd Rd
if
Z
ψ(t = 0, ~r)∗ ψ(t = 0, ~r)dd~r < ∞. (3.18)
Rd
71
Chapter 3. Mathematical properties of the Schrödinger
equation
3.2 Linearity
The Schrödinger equation is a linear partial differential equation. It is a
multi variable differential equation with partial derivatives. It is linear
while it only depends linearly on partial derivatives of ψ(t, ~r), i.g. there are
∂
no quadratic terms of the form ∂t ψ(t, ~r) ∂r∂ i ψ(t, ~r), ∂r∂ j ψ(t, ~r) ∂r∂ i ψ(t, ~r), or
other higher order polynomial functions of the partial derivatives. Linear
differential equations have the property that if ψ1 (t, ~r) and ψ2 (t, ~r) are both
solutions of the differential equation then ψ(t, ~r) = c1 ψ1 (t, ~r) + c2 ψ2 (t, ~r),
with c1 and c2 complex numbers is also a solution of the differential equa-
tion.
For the Schrödinger equation we can prove the latter statement. Let
ψ1 (t, ~r) and ψ2 (t, ~r) be solutions of equation 3.1
∂
i~ ψ1 (t, ~r) = Hψ1 (t, ~r), (3.19)
∂t
and
∂
i~ ψ2 (t, ~r) = Hψ2 (t, ~r). (3.20)
∂t
72
3.3. Hermitian and Self-adjoint
We now find for the Schrödinger equation on ψ(t, ~r) = c1 ψ1 (t, ~r)+c2 ψ2 (t, ~r):
∂ ∂ ∂
i~ (c1 ψ1 (t, ~r) + c2 ψ2 (t, ~r)) = c1 i~ ψ1 (t, ~r) + c2 i~ ψ1 (t, ~r) (3.21)
∂t ∂t ∂t
= c1 Hψ1 (t, ~r) + c2 Hψ2 (t, ~r)
~2
= c1 − ∆ + V (~r) ψ1 (t, ~r)
2m
~2
+ c2 − ∆ + V (~r) ψ2 (t, ~r)
2m
~2
=− ∆ (c1 ψ1 (t, ~r) + c2 ψ2 (t, ~r))
2m
+ V (~r) (c1 ψ1 (t, ~r) + c2 ψ2 (t, ~r))
= H (c1 ψ1 (t, ~r) + c2 ψ2 (t, ~r)) ,
2
As the Hamiltonian is linear we can proof this for the two terms, − 2m
~
∆
and V (~r) in the Hamiltonian separately. For the potential energy term we
find
Z Z
∗ d
φ(t, ~r) V (~r)ψ(t, ~r)d ~r = V (~r)φ(t, ~r)∗ ψ(t, ~r)dd~r (3.24)
Rd Rd
73
Chapter 3. Mathematical properties of the Schrödinger
equation
Z
= V (~r)∗ φ(t, ~r)∗ ψ(t, ~r)dd~r (3.25)
Rd
Z
= (V (~r)φ(t, ~r))∗ ψ(t, ~r)dd~r, (3.26)
Rd
In our case, both ψ(t, ~r) and φ(t, ~r) are in DH . Specifically we need that
ψ(t, ~r) and φ(t, ~r) as well as their first and second derivative are square
integrable. We thus find for the boundary conditions
∂ ∗
lim φ(t, ~r) ψ(t, ~r) = 0, (3.28)
|ri |→∞ ∂ri
and
∂
lim φ(t, ~r)∗ ψ(t, ~r) = 0, (3.29)
|ri |→∞ ∂ri
for all but a finite set of values of ~r. The later set of points do not contribute
to the integral over this function and we can neglect the boundary term.
We can now proof the Hermiticity of the Hamiltonian for the kinetic
energy term
Z ∞ 2 Z ∞
∗ ∂ ∂ ∗ ∂
φ(t, ~r) 2 ψ(t, ~r)dri = − φ(t, ~r) ψ(t, ~r)dri (3.30)
−∞ ∂ri −∞ ∂ri ∂ri
Z ∞ 2
∂ ∗
= 2 φ(t, ~
r) ψ(t, ~r)dri
−∞ ∂ri
Z ∞ 2 ∗
∂
= 2 φ(t, ~
r) ψ(t, ~r)dri
−∞ ∂ri
74
3.4. Time evolution operator
for all φ(t, ~r) in DA and ψ(t, ~r) in DA† . If operator A is Hermitian and the
domains of A and A† are equivalent we call the operator A self-adjoint.
For self-adjoint operators we can write that A = A† .
75
Chapter 3. Mathematical properties of the Schrödinger
equation
Next we integrate ~r over Rd and change the order of the integral and time
derivative. For this to be allowed we need to assume that ψ(t, ~r) and φ(t, ~r)
as well as their first and second derivative are square integrable. We find
Z Z
∂ ∗
i~ d
φ(t, ~r) ψ(t, ~r)d ~r = φ(t, ~r)∗ Hψ(t, ~r)dd~r (3.38)
∂t Rd Rd
Z
− (Hφ(t, ~r))∗ ψ(t, ~r)dd~r
Rd
= 0,
With
Z
∗
U (t, t0 )φ(t0 , ~r) U (t, t0 )ψ(t0 , ~r) dd~r =
(3.41)
Rd Z
φ(t0 , ~r)∗ U (t, t0 )† U (t, t0 )ψ(t0 , ~r)dd~r,
Rd
76
3.6. Uniqueness of the time evolution and determinacy of the
wave function
we conclude that
77
Chapter 3. Mathematical properties of the Schrödinger
equation
with E a time and position independent constant. The latter follows from
the fact that the left hand side of the previous equation is independent of
position and the right hand side is independent of time and the equation
needs to be valid for all positions and time.
We thus find two independent eigenfunction equations
∂
i~ χ(t) = Eχ(t), (3.49)
∂t
and
78
3.7. Stationary states and Eigenfunctions
Z Z
E φ(~r)∗ φ(~r)dd~r = E ∗ φ(~r)∗ φ(~r)dd~r =⇒
Rd Rd
E = E∗.
E is a real constant and associated with the energy of the state.
For the time dependent part we can solve the eigensystem equation to
find
i
χ(t) = c1 e− ~ Et , (3.52)
with c1 and arbitrary complex constant.
The wave function is normalized such that
Z
χ(t)∗ φ(~r)∗ χ(t)φ(~r)dd~r = 1. (3.53)
Rd
We can take χ(t) outside the spatial integral and find that
Z
∗
χ(t) χ(t) φ(~r)∗ φ(~r)dd~r = 1 (3.54)
Rd
holds for all times. From which it follows that
χ(t)∗ χ(t) = |c1 |2 , (3.55)
and
Z
φ(~r)∗ φ(~r)dd~r = |1/c1 |2 . (3.56)
Rd
For convenience we normally choose
c1 = 1. (3.57)
There are operators that do not have eigenfunctions that are square-
integrable. These contain important physical systems. In this case, we
can still use the eigenfunctions, but not directly as a representation for
the wave function of a physical system. One solution, which we will see
later, is to use these eigenfunctions as a basis to generate functions that are
square-integrable. Another option is to consider not an infinite-dimensional
system, but contain the entire world in a finite box. One can then look at
the evolution of the solutions as one takes the limit of the box size to
infinite.
79
Chapter 3. Mathematical properties of the Schrödinger
equation
~2
− ∆φ(~r) = (E − V (~r))φ(~r), (3.58)
2m
or
2m
∆φ(~r) = (V (~r) − E)φ(~r). (3.59)
~2
If V (~r) is finite ∀~r we know that the second partial derivative of φ(~r) is
finite. From this, it follows that the wave function φ(~r) and its first partial
derivatives ∇φ(~r) need to be continuous.
Not all potentials we encounter will be finite. The Coulomb potential
acting on a particle due to another charged particle at position ~r0 is of the
form
1
V (~r) ∝ . (3.60)
|~r − ~r0 |
This potential diverges for ~r = ~r0 . A divergent potential leads to a diver-
gence in the second partial derivative of φ(~r) and thus to a discontinuity
in the gradient of φ(~r). The wave function φ(~r) is continuous, but it has
a cusp at the point of the divergence of the potential. With the use of
Gauss divergence theorem, we can relate the cusp in the wave function to
the residue of the pole in the diverging potential.
Without loss of generality, we can shift the coordinate system such that
the divergence we want to investigate is at ~r = 0. In order to determine
the size of the cusp in the wave function we investigate the derivative of
the wave function with respect to the radial coordinate r = |~r|. In order
for the derivative of the wave function to be continuous, it must hold that
∂ ∂
ψ(~r) = − ψ(−~r), (3.61)
∂r ∂r
80
3.8. Continuity of the wave function and its derivative
or equivalently
Z 2π Z π
1
lim ∇ψ(~r) · r̂ sin(θ)dθdφ = 0. (3.64)
R0 →0 4π 0 0 r=R0
The integral of the gradient of the wave function in the radial direction over
a sphere with radius R0 around the point ~r = 0 in the limit R0 → 0 has
to be zero for the wave function to be smooth. If this integral is non zero,
there must be a cusp in the wave function.
In order to relate this integral to the divergence of the potential, we use
Gauss’s theorem and relate the surface integral to a volume integral.
Z 2π Z π
1
lim ∇ψ(~r) · r̂ sin(θ)dθdφ = (3.65)
R0 →0 4π 0 0 r=R0
Z 2π Z π Z R0
1 1
lim 2 ∇ · ∇ψ(~r) sin(θ)r2 drdθdφ.
R0 →0 4π R0 0 0 0
~2 ∇2
− ψ(~r) + V (~r)ψ(~r) = Eψ(~r), (3.66)
2m
and thus find that
Z 2π Z π
1 ∂
ψ(~r) sin(θ)dθdφ (3.67)
4π 0 0 ∂r r=0
Z 2π Z π Z R0
1 1 2m
= lim 2 (V (~r) − E) ψ(~r) sin(θ)r2 drdθdφ.
R0 →0 4π R0 0 0 0 ~2
81
Chapter 3. Mathematical properties of the Schrödinger
equation
to find
Z 2π Z π
1 ∂
ψ(~r) sin(θ)dθdφ (3.69)
4π 0 0 ∂r r=0
Z 2π Z π Z R0
1 1 2m Z
= lim 2 2
+ c0 + . . . ψ(~r) sin(θ)r2 drdθdφ
R0 →0 4π R0 0 ~ |~
r |
0 0
1 1 2m 2 4π 3
= lim 2 ~2 2πZR0 + c0 R0 + . . . ψ(~r = 0)
R0 →0 4π R0 3
m
= Z 2 ψ(~r = 0).
~
For a divergent Coulomb potential, there is a cusp in the wave function at
the divergence, proportional to the residue of the pole times m/~2 times
the value of the wave function at the divergence.
82
Chapter 4
Wave mechanics
83
Chapter 4. Wave mechanics
ρ(x) = 0 ∀x 6= 0. (4.2)
The distribution that fulfils these requirements is the Dirac delta function,
or more accurately the Dirac delta distribution. As argued in appendix A,
we can define this distribution as a limiting case of a sequence of functions.
A Gaussian function with the width going to zero would suffice
r
1 − x22
δ(σ, x) = e 2σ . (4.3)
2πσ 2
The limit of σ → 0 should be taken outside the integral (see appendix A)
such that
Z ∞ Z ∞
δ(x)dx = lim δ(σ, x)dx (4.4)
−∞ σ→0 −∞
= 1.
84
4.1. Time evolution of a delta function
In order to prevent several quirks of the Dirac delta function we will con-
tinue here with one of the functions that can be used to define the delta
function. In particularly we will use the Gaussian function as defined in
equation 4.4. The results we derive in this chapter are valid for any value
of σ, but for a localized particle we are interested in the case where σ is
very small and approaches zero. The wave function at t = 0 of our particle
localized at x = 0 is thus given as
1/4
1 x2
ψ(t = 0, x) = e− 4σ2 , (4.7)
2πσ 2
for very small values of σ.
We can calculate the wave function at a later time t > 0, by solving the
partial differential equation defined by the Schrödinger equation
∂ ~2
i~ ψ(t, x) = − ∆ψ(t, x), (4.8)
∂t 2m
with the boundary condition for ψ(t, x) at t = 0 as given in equation 4.7.
In order to solve the time evolution we will use a method called spectral
decomposition. For a plane-wave we can relatively easy solve the time
evolution according to the Schrödinger equation. The function
At time t = 0 we have
1/4 2
r Z ∞
1 − x2 1
2
e 4σ = c(k) eik x dk, (4.11)
2πσ 2π −∞
85
Chapter 4. Wave mechanics
from which we can determine c(k). On the left hand side we have a Gaus-
sian function. On the right hand side we see the Fourier transform of the
function c(k). We now can use the equation that the Fourier transform (See
the appendix C.3.2 for a derivation) of a Gaussian function is a Gaussian
function
Z ∞ r
1 −σ 2 k2 ixk 1 − x22
√ e e dk = e 4σ , (4.12)
2π −∞ 2σ 2
to find
1/4 √
1 2 2
c(k) = 2
2σ 2 e−σ k (4.13)
2πσ
2 1/4
2σ 2 2
= e−σ k .
π
Next we substitute the equation for c(k) into the expansion of ψ(t, x) on
plane waves (eq. 4.10) to obtain
r ∞ 1/4
2σ 2 ~tk2
Z
1 −σ 2 k2 i k x− 2m
ψ(t, x) = e e dk. (4.14)
2π −∞ π
With β = ix and α = σ 2 + i 2m
~t
we find
1/4 s x2
σ2 −
π ( i~t
4 σ 2 + 2m )
ψ(t, x) = i~t
e (4.16)
2π 3 σ2 + 2m
!1/4 x2
1 −
4σ 2 1+ i~t 2
( )
= e 2mσ
i~t 2
2πσ 2 1+ 2mσ 2
86
4.1. Time evolution of a delta function
With the knowledge of the wave function we can calculate the particle
density ρ(t, x). (i.e. the distribution function of the particle density one
would find if one repeats such an experiment an infinite number of times).
We find that for all times the particle distribution will be given by a Gaus-
sian. We defined σ to be the width (standard deviation) of the Gaussian
at time t = 0. We find that at time t the particle distribution spreads and
the width becomes
s
t2 ~2
σ(t) = 2
σ + . (4.18)
4m2 σ 2
∂σ(t) t~2
= . (4.19)
∂t 4m2 σ 2 σ(t)
The rate depends on the inverse of σ 2 . The more localized we prepare the
initial wave function the faster the particle distribution will spread.
For our particle that we prepared to be at t = 0 exactly at position
x = 0, this result is somewhat problematic. We would like to take the limit
of σ to 0. This would yield for any finite time t > 0 an infinite width. The
particle instantly spreads from a delta distribution to a distribution that
is constant over the whole space and integrates to 1 over the whole space.
This also means that at times slightly larger than zero the particle can be
at a very large distance from the starting position. To do so its velocity
needs to be very large. A very large velocity means the particle must have
a very large energy.
We can actually calculate the energy of the particle. We have seen in eq.
2.5 that for a particle whose wave function is given by a single plane wave
87
Chapter 4. Wave mechanics
i
such that ψk (t, x) = e ~ (px−Et) the time derivative of the wave-function
is proportional to the energy times the wave-function i~∂/(∂t)ψk (t, x) =
Eψk (t, x). In section 3.7 we discussed stationary states and found that the
previous relation for plane waves holds for all stationary states ψstat (t, x). In
particular we found that ψstat (t, x) is an eigenstate of both the Hamiltonian
as well as the time derivative operator:
∂
ı~ ψstat (t, x) = Eψstat (t, x), and (4.20)
∂t
Hψstat (t, x) = Eψstat (t, x).
For both operators the eigenvalue is E, which we associate with the energy
of the state. Furthermore we know that for normalized states we have
Z ∞
ψstat (t, x)∗ ψstat (t, x)dx = 1, (4.21)
−∞
such that
Z ∞
∂
ψstat (t, x)∗ i~ ψstat (t, x)dx = E. (4.22)
−∞ ∂t
We can expand ψ(t, x) on plane waves and insert equation 4.10 for the
wave-function
Z ∞ r Z ∞ !∗
2t
1 i k x− ~k
E= c(k) e 2m
dk (4.24)
−∞ 2π −∞
r Z ∞ !
~k2 t
∂ 1 i k x− 2m
× i~ c(k) e dk dx
∂t 2π −∞
88
4.1. Time evolution of a delta function
2 −k2 )
∞ ∞ ∞
c(k2 )∗ c(k1 ) ~2 k12 i
Z Z Z ~t(k1 2
(k1 −k2 )x− 2m
= √ √ e dk2 dk1 dx.
−∞ −∞ −∞ 2π 2π 2m
R∞ i(k1 −k2 )x dx
We can separate the integral over x and realize that √1
2π −∞ e
√
is the Fourier transform of 1, which (see Appendix C.3.3) is 2π times a
Dirac delta function δ(k1 − k2 ).
Z ∞Z ∞
~2 k12 −i ~t(k12 −k22 )
E= δ(k1 − k2 )c(k2 )∗ c(k1 ) e 2m dk2 dk1 (4.25)
−∞ −∞ 2m
Z ∞
~2 k 2
= c(k)∗ c(k) dk.
−∞ 2m
The first thing to notice is that the energy is conserved, i.e. it does not
depend on time. Actually the reason to define equation 4.23 as the energy
of the state is that it is conserved in time. A particle prepared in a given
Gaussian wave packet has a fixed amount of energy that does not change
2 k2
with time. Furthermore we notice that ~2m is the energy we associated
with a plane wave of momentum k. At the same time c(k) is the expansion
coefficient of our Gaussian wave packet on this plane wave. We find that
2 k2
each plane wave with momentum k contributes an energy ~2m multiplied
∗
by c(k) c(k), i.e. the norm-square of the weight in the expansion of the
Gaussian wave packet on this plane wave.
If we insert the values of c(k) for the expansion of the delta distribution
(equation 4.14) we find
Z ∞r 2
2σ −2σ2 k2 ~2 k 2
E= e dk (4.26)
−∞ π 2m
~2
=
8mσ 2
The sharper we make the initial distribution the larger the energy we have
in the system. If we want to localize p the particle exactly at x = 0 such
that ρ(t = 0, x) = δ(x) (ψ(t = 0, x) = δ(x)) we need an infinite amount
of energy. Our result that a particle prepared with a delta function dis-
tribution for its density instantly spreads to the whole space now becomes
understandable. The particle also has an infinite amount of kinetic energy.
89
Chapter 4. Wave mechanics
90
4.2. Wave packets
91
Chapter 4. Wave mechanics
, Re(ψ(t,x)), Im(ψ(t,x))
, Re(ψ(t,x)), Im(ψ(t,x))
, Re(ψ(t,x)), Im(ψ(t,x))
, Re(ϕ(t,k)), Im(ϕ(t,k))
, Re(ϕ(t,k)), Im(ϕ(t,k))
, Re(ϕ(t,k)), Im(ϕ(t,k))
, Re(ϕ(t,k)), Im(ϕ(t,k))
0.6 0.6 0.6 0.6
2
2
2
-0.4 -0.5 -0.4 -0.5 -0.4 -0.5 -0.4 -0.5
|ϕ(t,k)
|ϕ(t,k)
|ϕ(t,k)
|ϕ(t,k)
|ψ(t,x)
|ψ(t,x)
|ψ(t,x)
|ψ(t,x)
-0.6 -0.6 -0.6 -0.6
-1.0 -1.0 -1.0 -1.0
-10 -5 0 5 10 -10 -5 0 5 10 -10 -5 0 5 10 -10 -5 0 5 10 -10 -5 0 5 10 -10 -5 0 5 10 -10 -5 0 5 10 -10 -5 0 5 10
x k x k x k x k
, Re(ψ(t,x)), Im(ψ(t,x))
, Re(ψ(t,x)), Im(ψ(t,x))
, Re(ψ(t,x)), Im(ψ(t,x))
, Re(ϕ(t,k)), Im(ϕ(t,k))
, Re(ϕ(t,k)), Im(ϕ(t,k))
, Re(ϕ(t,k)), Im(ϕ(t,k))
, Re(ϕ(t,k)), Im(ϕ(t,k))
0.6 0.6 0.6 0.6
2
2
2
-0.4 -0.5 -0.4 -0.5 -0.4 -0.5 -0.4 -0.5
|ϕ(t,k)
|ϕ(t,k)
|ϕ(t,k)
|ϕ(t,k)
|ψ(t,x)
|ψ(t,x)
|ψ(t,x)
|ψ(t,x)
-0.6 -0.6 -0.6 -0.6
-1.0 -1.0 -1.0 -1.0
-10 -5 0 5 10 -10 -5 0 5 10 -10 -5 0 5 10 -10 -5 0 5 10 -10 -5 0 5 10 -10 -5 0 5 10 -10 -5 0 5 10 -10 -5 0 5 10
x k x k x k x k
, Re(ψ(t,x)), Im(ψ(t,x))
, Re(ψ(t,x)), Im(ψ(t,x))
, Re(ψ(t,x)), Im(ψ(t,x))
, Re(ϕ(t,k)), Im(ϕ(t,k))
, Re(ϕ(t,k)), Im(ϕ(t,k))
, Re(ϕ(t,k)), Im(ϕ(t,k))
, Re(ϕ(t,k)), Im(ϕ(t,k))
0.6 0.6 0.6 0.6
2
2
2
-0.4 -0.5 -0.4 -0.5 -0.4 -0.5 -0.4 -0.5
|ϕ(t,k)
|ϕ(t,k)
|ϕ(t,k)
|ϕ(t,k)
|ψ(t,x)
|ψ(t,x)
|ψ(t,x)
|ψ(t,x)
-0.6 -0.6 -0.6 -0.6
-1.0 -1.0 -1.0 -1.0
-10 -5 0 5 10 -10 -5 0 5 10 -10 -5 0 5 10 -10 -5 0 5 10 -10 -5 0 5 10 -10 -5 0 5 10 -10 -5 0 5 10 -10 -5 0 5 10
x k x k x k x k
, Re(ψ(t,x)), Im(ψ(t,x))
, Re(ψ(t,x)), Im(ψ(t,x))
, Re(ψ(t,x)), Im(ψ(t,x))
, Re(ϕ(t,k)), Im(ϕ(t,k))
, Re(ϕ(t,k)), Im(ϕ(t,k))
, Re(ϕ(t,k)), Im(ϕ(t,k))
, Re(ϕ(t,k)), Im(ϕ(t,k))
0.6 0.6 0.6 0.6
2
2
2
-0.4 -0.5 -0.4 -0.5 -0.4 -0.5 -0.4 -0.5
|ϕ(t,k)
|ϕ(t,k)
|ϕ(t,k)
|ϕ(t,k)
|ψ(t,x)
|ψ(t,x)
|ψ(t,x)
|ψ(t,x)
-0.6 -0.6 -0.6 -0.6
-1.0 -1.0 -1.0 -1.0
-10 -5 0 5 10 -10 -5 0 5 10 -10 -5 0 5 10 -10 -5 0 5 10 -10 -5 0 5 10 -10 -5 0 5 10 -10 -5 0 5 10 -10 -5 0 5 10
x k x k x k x k
Figure 4.1: Time evolution of differently shaped wave packets with zero
momentum.
wave function in momentum space (φ(t, k)). The different panels show the
wave function at different times. The black lines show the norm of the wave
function, the blue and orange the real and imaginary part. For plotting we
took σ = 1, m = 1, ~ = 1 and t runs from 0 to 10. Both wave packets
spread over time in real space. The wave function in momentum space does
not broaden. φ(t, k) changes its phase for each k value, but not its norm
|φ(t, k)|, as a function of t.
92
4.2. Wave packets
, Re(ψ(t,x)), Im(ψ(t,x))
, Re(ψ(t,x)), Im(ψ(t,x))
, Re(ψ(t,x)), Im(ψ(t,x))
, Re(ϕ(t,k)), Im(ϕ(t,k))
, Re(ϕ(t,k)), Im(ϕ(t,k))
, Re(ϕ(t,k)), Im(ϕ(t,k))
, Re(ϕ(t,k)), Im(ϕ(t,k))
0.6 0.6 0.6 0.6
2
2
2
-0.4 -0.5 -0.4 -0.5 -0.4 -0.5 -0.4 -0.5
|ϕ(t,k)
|ϕ(t,k)
|ϕ(t,k)
|ϕ(t,k)
|ψ(t,x)
|ψ(t,x)
|ψ(t,x)
|ψ(t,x)
-0.6 -0.6 -0.6 -0.6
-1.0 -1.0 -1.0 -1.0
-10 -5 0 5 10 -10 -5 0 5 10 -10 -5 0 5 10 -10 -5 0 5 10 -10 -5 0 5 10 -10 -5 0 5 10 -10 -5 0 5 10 -10 -5 0 5 10
x k x k x k x k
, Re(ψ(t,x)), Im(ψ(t,x))
, Re(ψ(t,x)), Im(ψ(t,x))
, Re(ψ(t,x)), Im(ψ(t,x))
, Re(ϕ(t,k)), Im(ϕ(t,k))
, Re(ϕ(t,k)), Im(ϕ(t,k))
, Re(ϕ(t,k)), Im(ϕ(t,k))
, Re(ϕ(t,k)), Im(ϕ(t,k))
0.6 0.6 0.6 0.6
2
2
2
-0.4 -0.5 -0.4 -0.5 -0.4 -0.5 -0.4 -0.5
|ϕ(t,k)
|ϕ(t,k)
|ϕ(t,k)
|ϕ(t,k)
|ψ(t,x)
|ψ(t,x)
|ψ(t,x)
|ψ(t,x)
-0.6 -0.6 -0.6 -0.6
-1.0 -1.0 -1.0 -1.0
-10 -5 0 5 10 -10 -5 0 5 10 -10 -5 0 5 10 -10 -5 0 5 10 -10 -5 0 5 10 -10 -5 0 5 10 -10 -5 0 5 10 -10 -5 0 5 10
x k x k x k x k
Figure 4.2: Time evolution of a Gaussian wave packet in free space with
momentum k = 5, mass m = 1, ~ = 1, and width given by σ = 1. The time
evolution runs from 0 to 5.
∞
r Z
1
ψ(t, x) = φ(t, k)eikx dk (4.30)
2π −∞
!1/4 x2 −4ik0 xσ 2 +2ik0
2 tσ 2 ~/m
1 − i~t
= e
2
4σ 1+
2mσ 2
( ) ,
i~t 2
2πσ 2 1+ 2mσ 2
in real space.
The wave function of a particle represented by a Gaussian wave packet
with finite momentum k0 looks surprisingly complicated. If we compare
the wave packet with a finite momentum to the wave packet with zero
momentum k0 = 0 we find that firstly, both are represented by a Gaussian
with modified width that spreads over time. The wave packet at finite
momentum has an additional phase proportional to k0 x for one term and
to k02 t for the second term. The equation simplifies if we look at the particle
distribution
93
Chapter 4. Wave mechanics
2
k0 ~
x− m t
−
2
1 2σ 2 1+ ( t~
2mσ 2
)
=√ q 2 e .
t~
2πσ 1 + 2mσ 2
v = k0 ~/m (4.32)
= p/m.
This is the velocity momentum relation one would expect for a classical
particle. An example of such a wave packet can be seen in figure 4.2.
94
4.2. Wave packets
defined as
vp = E/p. (4.34)
This is not the velocity with which the envelope of the wave packet moves.
The latter is called the group velocity (vg ), which we will discuss in more
detail below.
The difference between the phase and group velocity can be seen in
figure 4.2. Figure 4.2 shows the propagation of a wave packet. In black, we
see the particle distribution. It moves with a constant velocity of vg = p/m
as one would expect for a classical particle. The real and the imaginary
part of the wave function seem to move slower than the norm of the wave
function. If one follows a zero crossing of either the real or imaginary part
one finds that it only moves with half the velocity vp = p/(2m).
We can derive the group velocity of a wave packet for an arbitrary
shaped wave and arbitrary dispersion relation E(p) or ω(k). A general
wave packet with a wave vector centered around k0 has the form
r Z ∞
1
ψ(t, x) = c(k − k0 )ei(kx−ω(k)t) dk. (4.35)
2π −∞
95
Chapter 4. Wave mechanics
This describes a monochrome plane wave with phase velocity vp = ω(k0 )/k0 .
The real and the imaginary part of ψ(t, x) form waves that travel through
the wave packet with velocity vp . For the particle distribution, this phase
modulation does not matter.
The second factor that dependents on x occurs inside the Fourier trans-
form of c(k − k0 ). This factor modulates the wave packet. The space
dependent factor is given as
∂ω(k)
i(k−k0 ) x− ∂k
t
e k0
. (4.39)
The velocity of this wave is defined as the group velocity and given by
∂ω(k)
vg = , (4.40)
∂k k0
Besides the terms defining the group and phase velocity of the wave
packet, there is a term that depends on the second and higher derivatives
of the energy-momentum dispersion. These terms are responsible for the
deformation of the shape of the wave packet in time. For systems that
have a linear energy-momentum dependence, i.e. E = c0 + c1 p one can
make wave packets that stay localized. For other energy-momentum rela-
tions, the wave packet will generally deform over time. Assuming different
energy-momentum relations than E = pc or E = p2 /(2m) is not just a
mathematical exercise. A wave traveling through a material will gener-
ally have a different energy-momentum dispersion than a wave traveling
96
4.2. Wave packets
The zeroth moment relates to the total particle density in the system,
which is one in our case. The first moment gives the center of the particle
97
Chapter 4. Wave mechanics
distribution or the expected value for its position / momentum. The higher
moments relate to the spread of the distribution. If we define the central
moment with respect to the expected value as
Z ∞
(n)
µ̃x (t) = (x − µ(1) n
x ) ρx (t, x)dx (4.45)
Z−∞
∞
(n) (1)
µ̃k (t) = (k − µk )n ρk (t, k)dk,
−∞
then the second central moment is the moment of inertia of the distribution,
or equivalent, its variance. The third and fourth moments are referred to
as the skewness and kurtosis of the distribution.
In the following we will assume that both ρx (t, x) and ρk (t, k) tend to
zero fast enough as |x| → ∞ or |k| → ∞ such that the integrals are well
behaved and the moments of the distribution finite.
98
4.2. Wave packets
Z ∞
= ψ(t, x)∗ ψ(t, x)dx
−∞
=1
We thus find the same normalization for the particle momentum distribu-
tion (ρk (t, k)) as well as for the particle position distribution (ρx (t, x)).
99
Chapter 4. Wave mechanics
Z ∞ Z ∞ Z ∞
1 0
= kψ(t, x0 )∗ ψ(t, x)eik(x −x) dx0 dxdk
2π
Z−∞ −∞
∞ Z ∞
−∞
Z ∞
1 0 ∗ 0
= ψ(t, x ) ψ(t, x) keik(x −x) dkdx0 dx.
−∞ −∞ 2π −∞
100
4.2. Wave packets
and
Z ∞
(2)
µk (t) = φ(t, k)∗ k 2 φ(t, k)dk (4.57)
−∞
Z ∞
∂2
= ψ(t, x)∗ − 2 ψ(t, x) dx.
−∞ ∂x
The second central moments can be calculated as
µ̃(2) (2) (1)
x (t) = µx (t) − (µx (t))
2
(4.58)
(2) (2) (1)
µ̃k (t) = µk (t) − (µk (t))2 .
These are known as the variances of the distribution and contain infor-
mation on the squared width of the distribution. The square root of the
variance is the standard deviation of the distribution
q
(2)
σx (t) = µ̃x (t), (4.59)
101
Chapter 4. Wave mechanics
and
q
(2)
σk (t) = µ̃k (t). (4.60)
For a Gaussian wave packet with zero velocity (k0 = 0), normalisation
(0) (1)
µx = 1, first moment µx = 0, and standard deviation σx at t = 0 we
have
2
1 − 21 x
ρx (t = 0, x) = √ e σx
, (4.61)
σx 2π
and
2
1 − 21 k
ρk (t = 0, k) = √ e σk
, (4.62)
σk 2π
with
1
σk = . (4.63)
2σx
The relation between σx and σk for general times and wave packets of
different shapes are known as Heisenberg’s uncertainty relations. We will
discuss them after we briefly note the calculation of general moments of the
particle distribution function.
102
4.2. Wave packets
and
∂ (1)
g(x) = −i − µk ψ(t, x). (4.69)
∂x
such that
Z ∞ Z ∞
∗ ∗
f (x)f (x)dx = (x − µ(1) (1)
x )ψ (t, x)(x − µx )ψ(t, x)dx (4.70)
−∞ −∞
103
Chapter 4. Wave mechanics
Z ∞
= ψ ∗ (t, x)(x2 − 2µ(1) (1) (1)
x x + µx µx )ψ(t, x)dx
−∞
= µ̃(2)
x ,
and
Z ∞ Z ∞ ∗
∗ ∂ (1)
g (x)g(x)dx = −i − µk ψ(t, x) (4.71)
−∞ −∞ ∂x
∂ (1)
−i − µk ψ(t, x) dx
∂x
Z ∞
∗ ∂ (1) ∂ (1)
= ψ (t, x) −i − µk −i − µk ψ(t, x)dx
−∞ ∂x ∂x
!
∂ 2
Z ∞
∗ (1) ∂ (1) (1)
= ψ (t, x) −i − 2µk −i + µk µk
−∞ ∂x ∂x
ψ(t, x)dx
(2)
= µ̃k ,
whereby for the second step we used that the momentum operator is Her-
mitian, which we can proof with the use of integration by parts.
To proof Heisenbergs uncertainty relation we note that the Cauchy-
Schwarz inequality has on the right hand side the modulus squared of a
complex number. This can be expressed as
104
4.2. Wave packets
105
Chapter 4. Wave mechanics
Using the momentum instead of the wave vector in our equations we have
~
σx σp ≥ . (4.77)
2
Instead of σ for the standard deviation of the particle or momentum dis-
tribution several authors use ∆x and ∆p as variables.
If we prepare a macroscopic large number of particles at time t = 0 in a
given wave function we can measure at a later time the particle momentum
and position distribution. It does not matter how we prepare our state,
we will always find that the standard deviation of the position distribution
times the standard deviation of the momentum distribution is larger than
~
2 . In particular, this is true at t = 0, i.e. right at the time where we
prepared our particle wave function. One can never prepare an ensemble of
particles with a product of the standard deviation of energy and momentum
distribution that is smaller than ~2 . For a single particle one can always
think of repeating the experiment nearly infinite many times. This again
will produce an ensemble of particles and for these, we can define a position
and momentum distribution. The product of the standard deviation of
these distributions built from single events will be larger than ~2 .
106
Chapter 5
Wave mechanics in
exemplary potentials
107
Chapter 5. Wave mechanics in exemplary potentials
V0
(
0 x0 ≤ 0
V (x) = (5.1)
V0 x0 > 0. x = x0
dk 2 ~2
V0 = . (5.2)
2m
Solutions to the Schrödinger equation that represent physical particles
localized in space are wave packets. If we locate the wave packet far away
from the step in the potential, i.e. at µ x0 or µ x0 , with µ the center,
or first moment, of the wave packet, then we should find similar solutions
for the wave packet and its time evolution as we did for the case without
a potential step 4. As a possible solution, we can assume a Gaussian wave
packet with a width σ |x − x0 | and finite momentum k0 . If we choose
the centre of the wave packet to be left of the potential step (µ x0 ) and
k0 > 0 the particle should travel towards the potential step and at time
t = µ2m
~k0 pass through or reflect from the step potential. Whereby we used
that the center of the wave will propagate with the group velocity vg = km0~
.
In order to understand what happens when the particle hits the potential
step, we can have a look at the two limiting cases where µ x0 or µ x0 .
The kinetic energy of the wave packet on the left-hand side of the potential
step is
k02 ~2
E= . (5.3)
2m
108
5.1. Potential step
, Re(ψ(t,x)), Im(ψ(t,x))
, Re(ψ(t,x)), Im(ψ(t,x))
, Re(ψ(t,x)), Im(ψ(t,x))
, Re(ψ(t,x)), Im(ψ(t,x))
, Re(ψ(t,x)), Im(ψ(t,x))
, Re(ψ(t,x)), Im(ψ(t,x))
32 ℏ2 32 ℏ2 32 ℏ2 32 ℏ2 32 ℏ2 32 ℏ2 32 ℏ2
V= V= V= V= V= V= V=
2m 2m 2m 2m 2m 2m 2m
0.4 0.4 0.4 0.4 0.4 0.4 0.4
2
-0.4 -0.4 -0.4 -0.4 -0.4 -0.4 -0.4
|ψ(t,x)
|ψ(t,x)
|ψ(t,x)
|ψ(t,x)
|ψ(t,x)
|ψ(t,x)
|ψ(t,x)
-0.6 -0.6 -0.6 -0.6 -0.6 -0.6 -0.6
0.6 52 ℏ2
0.6 52 ℏ2
0.6 52 ℏ2
0.6 52 ℏ2
0.6 52 ℏ2
0.6 52 ℏ2
0.6 52 ℏ2
, Re(ψ(t,x)), Im(ψ(t,x))
, Re(ψ(t,x)), Im(ψ(t,x))
, Re(ψ(t,x)), Im(ψ(t,x))
, Re(ψ(t,x)), Im(ψ(t,x))
, Re(ψ(t,x)), Im(ψ(t,x))
, Re(ψ(t,x)), Im(ψ(t,x))
, Re(ψ(t,x)), Im(ψ(t,x))
V= V= V= V= V= V= V=
2m 2m 2m 2m 2m 2m 2m
0.4 0.4 0.4 0.4 0.4 0.4 0.4
2
|ψ(t,x)
|ψ(t,x)
|ψ(t,x)
|ψ(t,x)
|ψ(t,x)
|ψ(t,x)
|ψ(t,x)
-0.4 -0.4 -0.4 -0.4 -0.4 -0.4 -0.4
0.6 72 ℏ2
0.6 72 ℏ2
0.6 72 ℏ2
0.6 72 ℏ2
0.6 72 ℏ2
0.6 72 ℏ2
0.6 72 ℏ2
, Re(ψ(t,x)), Im(ψ(t,x))
, Re(ψ(t,x)), Im(ψ(t,x))
, Re(ψ(t,x)), Im(ψ(t,x))
, Re(ψ(t,x)), Im(ψ(t,x))
, Re(ψ(t,x)), Im(ψ(t,x))
, Re(ψ(t,x)), Im(ψ(t,x))
, Re(ψ(t,x)), Im(ψ(t,x))
V= V= V= V= V= V= V=
2m 2m 2m 2m 2m 2m 2m
0.4 0.4 0.4 0.4 0.4 0.4 0.4
2
|ψ(t,x)
|ψ(t,x)
|ψ(t,x)
|ψ(t,x)
|ψ(t,x)
|ψ(t,x)
|ψ(t,x)
-0.4 -0.4 -0.4 -0.4 -0.4 -0.4 -0.4
Once the wave packet traveled past the potential step we should expect
that the kinetic energy is reduced by the potential energy of the step. The
kinetic energy of the wave packet past the potential step becomes
(k02 − dk 2 )~2
E − V0 = . (5.4)
2m
The central wave vector of the wave packet once the wave packet is centred
around a position µ > x0 therefor is
q
k1 = k02 − dk 2 . (5.5)
This requires that E > V0 or k0 > dk for the wave to propagate past the
potential step. For the case where V0 > E one would classically expect the
particle to reflect from the potential step. We will discuss the quantum
mechanical solution for both cases below.
Before we will investigate analytical solutions we will have a look at
some numerical results. In figure 5.2 we show numerical solutions to the
Schrödinger equation for an initial Gaussian wave packet traveling across a
potential step. The different columns represent different times. The size of
the potential step increases from top to bottom, with E > V0 for the top
panels, E = V0 for the middle panels, and E < V0 for the bottom panels. In
109
Chapter 5. Wave mechanics in exemplary potentials
all cases, we see that besides a transmitted wave packet there is a reflected
wave packet. This is similar to the transmission of light through an interface
between two materials with different dielectric constants. For the solutions
of the Maxwell equations, one finds for light propagating through such a
boundary a transmitted and reflected wave. For potential steps with a
potential energy smaller than the kinetic energy of the wave packet, we
find a large transmitted wave and a small reflected wave. The larger the
potential step the larger the reflected wave. For potential steps with a
potential energy larger than the kinetic energy of the original wave packet,
one finds complete reflection and an exponentially decaying wave above the
potential step.
In order to quantify the amount of transmission and reflection we can
have a look at the stationary states of the Schrödinger equation. In chapter
4 we wrote our wave packets as linear combinations of plane waves. Without
a potential a plane wave is a stationary solution of the Schrödinger equation.
That is, the plane wave is an eigenstate of the Hamiltonian. With a position
dependent potential, plane waves are no longer stationary solutions of the
Schrödinger equation. Following section 3.7 we can find the stationary
states of the Schrödinger equation and use these to determine the particle
currents related to these stationary states. This is a similar exercise as one
would encounter in a lecture on electro magnetism where one calculates the
transmittance and reflectivity of an electro magnetic wave from a surface
of a dielectric material. The Schrödinger equation for a potential step is
∂
i~ ψ(t, x) = Hψ(t, x), (5.6)
∂t
with
~2
H=− ∆ + V (x), (5.7)
2m
and V (x) defined in equation 5.1. We can separate the time and position
dependence and write
110
5.1. Potential step
This leads to two eigenvalue equations (see section 3.7). The solution for
χ(t) is
i
χ(t) = e− ~ Et (5.9)
~2
− ∆φ(x) + V (x)φ(x) = Eφ(x). (5.10)
2m
For x < x0 the solution of 5.10 are plane waves of the form
with
2mE
k02 = , (5.12)
~2
and c1 and c2 complex constants. For x > x0 the solution of 5.10 are plane
waves of the form
with
2m(E − V0 )
k12 = , (5.14)
~2
= k02 − dk 2 ,
and c3 and c4 complex constants. For the case where E > V0 , k12 is posi-
tive and the wave propagates on both sides of the potential step. For the
case where dk > k0 , k12 is negative and k1 becomes imaginary. In that case
φ+ (x) becomes a linear combination of an exponentially decaying and expo-
nentially growing function. The part that exponentially grows for x → ∞
can not be used to form square integrable wave packets and will get a
zero coefficient. The exponentially decaying wave is physical and shows the
111
Chapter 5. Wave mechanics in exemplary potentials
and
∂ ∂
φ− (x) = φ+ (x) =⇒ (5.16)
∂x x=x0 ∂x x=x0
−ik0 x0
k0 (c1 eik0 x0 − c2 e ) = k1 (c3 eik1 x0 − c4 e−ik1 x0 ).
Solving these sets of equations we find for each energy two possible
solutions
k0 − k1
c2 = c1 ei2k0 x0 , (5.17)
k0 + k1
2k0
c3 = c1 ei(k0 −k1 )x0 ,
k0 + k1
c4 = 0,
and
k0 + k1
c2 = c1 ei2k0 x0 , (5.18)
k0 − k1
c3 = 0,
2k0
c4 = c1 ei(k0 +k1 )x0 .
k0 − k1
A general solution is a linear combination of these two solutions.
112
5.1. Potential step
, Re(ψ(t,x)), Im(ψ(t,x))
V=
2m
dk = 5, or dk = 7. For dk ≥ k0 we 0.5
2
function (infinite decay length for -0.5
|ψ(t,x)
dk = k0 ). For dk < k0 we find for -1.0
V=
2m
wave. 0.5
These standing waves near po-
0.0
tential steps are observed in na-
ture (see figure 5.4). If one
2
-0.5
|ψ(t,x)
0.5
tal changes its thickness by one
atomic layer. At these steps 0.0
-0.5
be seen. These are related
|ψ(t,x)
113
Chapter 5. Wave mechanics in exemplary potentials
gions. From the different currents we can calculate the transmitted and
reflected portion of a wave with wave vector k. In section 3.1 we defined
the current density as
1
j(t, x) = (ψ(t, x)∗ pψ(t, ~r) − ψ(t, x)pψ(t, x)∗ ) , (5.19)
2m
i
~ ∂
with p = i ∂x . For a plane wave, ψ(t, x) = e ~ (px−Et) the current density is
given as
∗ ~ ∂ i
1 i
(px−Et) (px−Et) i
(px−Et) ~ ∂ i (px−Et) ∗
j(t, x) = e ~ e~ − e~ e~
2m i ∂x i ∂x
(5.20)
p
=
m
~k
= .
m
With these equations we can
calculated the reflected and trans-
mitted currents for the stationary
solutions of the Schrödinger equa-
tion with a potential step. We as-
sume a particle flux moving from
x = −∞ towards the potential step.
This particle current is partially re-
flected and partially transmitted.
In terms of the two solutions pre-
viously found we need the solution
with c4 = 0 as no particles enter the
described setup from x = +∞. Un- Figure 5.4: STM image of a copper
der the condition that k1 ∈ R, i.e. surface (Property of IBM)
V0 < E we find that the current for
ψ− (t, x) is
~k0
|c1 |2 − |c2 |)2
j− (t, x) = (5.21)
m
114
5.1. Potential step
2 !
~k0 k0 − k1
= |c1 |2 1−
m k0 + k1
~ 4k02 k1
= |c1 |2
m (k0 + k1 )2
and the current for ψ+ (t, x) is
~k1
|c3 |2 )
j+ (t, x) = (5.22)
m
2
~k1 2 2k0
= |c1 |
m k0 + k1
~ 4k02 k1
= |c1 |2
m (k0 + k1 )2
We see that the current for x < x0 is equal to the current for x >
x0 . This needs to be the case as for a stationary state we do not want
particle density to collect at the potential step. The ratio of reflected flux
to incoming flux is
m
~k0
|c2 |2
R = ~k (5.23)
0
m |c1 |2
2
k0 − k1
= ,
k0 + k1
and the ratio of transmitted flux to incoming flux is
~k1 2
m |c3 |
T = ~k (5.24)
2
m |c1 |
0
4k0 k1
= .
(k0 + k1 )2
For V0 > E we find that k1 becomes complex and
k0 − i|k1 | ∗
k0 − i|k1 |
R= (5.25)
k0 + i|k1 | k0 + i|k1 |
115
Chapter 5. Wave mechanics in exemplary potentials
= 1.
j+ (t, x) = 0. (5.27)
116
5.2. Potential barrier
, Re(ψ(t,x)), Im(ψ(t,x))
, Re(ψ(t,x)), Im(ψ(t,x))
, Re(ψ(t,x)), Im(ψ(t,x))
, Re(ψ(t,x)), Im(ψ(t,x))
, Re(ψ(t,x)), Im(ψ(t,x))
, Re(ψ(t,x)), Im(ψ(t,x))
42 ℏ2 42 ℏ2 42 ℏ2 42 ℏ2 42 ℏ2 42 ℏ2 42 ℏ2
V= V= V= V= V= V= V=
2m 2m 2m 2m 2m 2m 2m
0.4 0.4 0.4 0.4 0.4 0.4 0.4
2
-0.4 -0.4 -0.4 -0.4 -0.4 -0.4 -0.4
|ψ(t,x)
|ψ(t,x)
|ψ(t,x)
|ψ(t,x)
|ψ(t,x)
|ψ(t,x)
|ψ(t,x)
-0.6 -0.6 -0.6 -0.6 -0.6 -0.6 -0.6
-5 0 5 10 15 20 25 -5 0 5 10 15 20 25 -5 0 5 10 15 20 25 -5 0 5 10 15 20 25 -5 0 5 10 15 20 25 -5 0 5 10 15 20 25 -5 0 5 10 15 20 25
x x x x x x x
0.6 52 ℏ2
0.6 52 ℏ2
0.6 52 ℏ2
0.6 52 ℏ2
0.6 52 ℏ2
0.6 52 ℏ2
0.6 52 ℏ2
, Re(ψ(t,x)), Im(ψ(t,x))
, Re(ψ(t,x)), Im(ψ(t,x))
, Re(ψ(t,x)), Im(ψ(t,x))
, Re(ψ(t,x)), Im(ψ(t,x))
, Re(ψ(t,x)), Im(ψ(t,x))
, Re(ψ(t,x)), Im(ψ(t,x))
, Re(ψ(t,x)), Im(ψ(t,x))
V= V= V= V= V= V= V=
2m 2m 2m 2m 2m 2m 2m
0.4 0.4 0.4 0.4 0.4 0.4 0.4
2
|ψ(t,x)
|ψ(t,x)
|ψ(t,x)
|ψ(t,x)
|ψ(t,x)
|ψ(t,x)
|ψ(t,x)
-0.4 -0.4 -0.4 -0.4 -0.4 -0.4 -0.4
-5 0 5 10 15 20 25 -5 0 5 10 15 20 25 -5 0 5 10 15 20 25 -5 0 5 10 15 20 25 -5 0 5 10 15 20 25 -5 0 5 10 15 20 25 -5 0 5 10 15 20 25
x x x x x x x
0.6 62 ℏ2
0.6 62 ℏ2
0.6 62 ℏ2
0.6 62 ℏ2
0.6 62 ℏ2
0.6 62 ℏ2
0.6 62 ℏ2
, Re(ψ(t,x)), Im(ψ(t,x))
, Re(ψ(t,x)), Im(ψ(t,x))
, Re(ψ(t,x)), Im(ψ(t,x))
, Re(ψ(t,x)), Im(ψ(t,x))
, Re(ψ(t,x)), Im(ψ(t,x))
, Re(ψ(t,x)), Im(ψ(t,x))
, Re(ψ(t,x)), Im(ψ(t,x))
V= V= V= V= V= V= V=
2m 2m 2m 2m 2m 2m 2m
0.4 0.4 0.4 0.4 0.4 0.4 0.4
2
|ψ(t,x)
|ψ(t,x)
|ψ(t,x)
|ψ(t,x)
|ψ(t,x)
|ψ(t,x)
|ψ(t,x)
-0.4 -0.4 -0.4 -0.4 -0.4 -0.4 -0.4
-5 0 5 10 15 20 25 -5 0 5 10 15 20 25 -5 0 5 10 15 20 25 -5 0 5 10 15 20 25 -5 0 5 10 15 20 25 -5 0 5 10 15 20 25 -5 0 5 10 15 20 25
x x x x x x x
already guess what should happen. When the particle hits the potential
barrier part of the wave packet is reflected and a part is transmitted. When
the barrier is at a higher energy than the kinetic energy of the particle one
would classically expect the particle to be fully reflected. This is almost,
but not completely the case in quantum mechanics. Inside the barrier, the
wave function exponentially decays. What is left over of the wave function
after this exponential decay at the other end of the potential barrier will
generate a new wave packet that corresponds to part of the particle density
that is transmitted through the potential barrier.
In order to quantify the amount of particles that is transmitted and
reflected form a potential barrier we can look at the stationary states. We
follow the same line of argument as in the previous section where we solved
the stationary states of a potential step. We define ψ(t, x) = χ(t)φ(x),
χ(t) = e−i/~Et and
ik0 x + c e−ik0 x x ≤ −x
φ− (x) = c1 e
2 0
φ(x) = φ0 (x) = c3 eik1 x + c4 e−ik1 x −x0 ≤ x ≤ x0 (5.28)
ik
φ+ (x) = c5 e 0 x x0 ≤ x,
q q
with k0 = 2mE
~2
and k1 = 2m(E−V ~2
0)
.
117
Chapter 5. Wave mechanics in exemplary potentials
, Re(ψ(t,x)), Im(ψ(t,x))
V=
2m
2
scale with c1 . From these equations -0.5
|ψ(t,x)
we can calculate the current in the -1.0
-0.5
|c5 |2
|ψ(t,x)
T = (5.29)
|c1 |2 -1.0
-4 -2 0 2 4
1
= 2 , x
1 k0 k1
1+ 4 k1 − k0 sin2 (2k1 x0 ) 1.0
62 ℏ2
, Re(ψ(t,x)), Im(ψ(t,x))
V=
2m
0.5
|c2 |2
2
-0.5
R= (5.30)
|ψ(t,x)
|c1 |2
-1.0
= 1 − T. -4 -2 0 2 4
x
118
5.3. Delta function potential
119
Chapter 5. Wave mechanics in exemplary potentials
Schrödinger equation
∂ 2 φ(x) 2m
2
= 2 (V (x) − E)φ(x). (5.34)
∂x ~
As long as 2m
~2
(V (x) − E)φ(x) is finite the second derivative of φ(x) is finite
and φ(x) as well as its first derivative are continues. For V (x) = 2x0 V0 δ(x)
we need to realize that at x = 0 the potential is not finite. We thus need
to reanalyze the continuity of the wave function and its derivative.
whereby the 0+ in the limit means that is a positive number. When V (x)
and φ(x) are continues functions around x = 0 the limit of the integral
on the right converges to zero and we find that the derivative of φ(x) is a
continues function at x = 0. For the case where V (x) is given by a delta
function at x = 0 we find
Z
dφ(x) dφ(x) 2m
lim − = 2 lim (2x0 V0 δ(x) − E)φ(x)dx (5.36)
→0+ dx dx − ~ →0+ −
2m
= 2 2x0 V0 φ(0).
~
There is a discontinuity in the derivative of the wave function proportional
to the wave function at the point of the delta function and the strength of
the delta function potential (2x0 V0 ).
120
5.3. Delta function potential
c1 + c2 = c3 + c4 (5.37)
2m
−ik(c1 − c2 ) + ik(c3 − c4 ) = 2 2x0 V0 (c1 + c2 ).
~
There are two sets of solutions. For odd functions ψ(0) = 0 and the delta
function potential has no effect
121
Chapter 5. Wave mechanics in exemplary potentials
122
5.4. Particle in a box
We can combine the even and odd solutions into a single set of functions
(q nπ(x+x0 )
1
x0 sin( 2x0 ) |x| ≤ x0
φn (x) = , (5.47)
0 |x| ≥ x0
with n ∈ N+ .
The energy of these states is given as
R x0
φn (x)∗ Hφn (x)dx
E = −xR x00
∗
(5.48)
−x0 φn (x) φn (x)dx
123
Chapter 5. Wave mechanics in exemplary potentials
∗ − ~2 d2 φ (x) dx
R x0
−x0 φ n (x) 2m dx 2 n
= R x0
∗
−x0 φn (x) φn (x)dx
∗ ~2 π 2 n2
R x0
−x0 φn (x) 8mx20 φn (x)dx
= R x0
∗
−x0 φn (x) φn (x)dx
~2 π 2 n2
=
8mx20
We find that the more nodes the function has the higher its (kinetic)
energy is. This is a general property of wave functions. We can relate the
kinetic energy density at point x to the the norm square of the slope of the
wave function at point x.
Z ∞
~2 d2
Ekin =− φ(x)∗ 2 φ(x)dx (5.49)
2m −∞ dx
2
∞ Z ∞
~2
~ ∗ d d d
=− φ(x) φ(x) + φ(x)∗ φ(x)dx
2m dx −∞ 2m −∞ dx dx
2 Z ∞ 2
~ d
= φ(x) dx
2m −∞ dx
Whereby we used that the wave function is square integrable such that
d 2
limx→±∞ φ(x) = 0. As the integrant dx φ(x) is a strict positive function
we see that the kinetic energy increases if the norm of the slope of wave
function is large.
The set of stationary states that we find form an orthonormal set. (In
the next chapter we will generalise this statement ). Orthonormality of a
set of functions is defined as
Z ∞
φn (x)∗ φm (x)dx = δnm . (5.50)
−∞
We can prove the orthonormality of the set of stationary states found for a
124
5.4. Particle in a box
n
x0 2x0
We will not discuss the convergence of this expansion here but come back
to this in the next chapter. For now, we assume that there is a set of cn ’s
such that the left and right-hand sides become equal. We can calculate cn
by multiplying equation 5.52 with ψm (t, x)∗ = φm (x)∗ χm (t)∗ and integrate
over x.
Z ∞ Z ∞ X
ψn (t, x)∗ Ψ(t, x)dx = φn (x)∗ χn (t)∗ cm φm (x)χm (t)dx (5.53)
−∞ −∞
X Zm ∞
= cm χn (t)∗ χm (t) φn (x)∗ φm (x)dx
m −∞
X
= cm χn (t)∗ χm (t)δnm
m
= cn χn (t)∗ χn (t)
= cn .
125
Chapter 5. Wave mechanics in exemplary potentials
126
0
x < −x0
V (x) = V0 −x0 < x < x0
0 x0 < x
(5.55)
127
Chapter 5. Wave mechanics in exemplary potentials
-1
Tan[ x0 k1 ]
2 m x20 V0
ℏ2 k21 x20
-V0 large
2
-
1
-V0 small
0
0.0 0.5 1.0 1.5 2.0 2.5 3.0 3.5 4.0
x0 k1 / π
sin k1 x0
Figure 5.11: Graphical solution for the equation k0 = k1 cos k1 x0 , yielding
the discrete set of k vectors for the even bound states.
3
-1
-V0 large
-CoTan[ x0 k1 ]
2 m x20 V0
ℏ2 k21 x20
2
-
1
-V0 small
0
0.0 0.5 1.0 1.5 2.0 2.5 3.0 3.5 4.0
x0 k1 / π
128
5.5. Potential well
129
Chapter 5. Wave mechanics in exemplary potentials
tan(x0 k1 ), (5.64)
for the odd solutions in blue. The allowed vectors can be found at the
intersections of these graphs. This is done in figure 5.11 and figure 5.12 for
the odd and even solutions respectively. r
2mx2 V
In figure 5.11 we see in blue tan(x0 k1 ) and in red − ~2 k20x20 − 1. In
1 0
figure 5.13 we plotted a set of eigenfunctions for a potential well with
− 2mV
~2 kx2
0
= 100/π 2 , a value chosen to generate an aesthetic pleasing plot.
For states with V0 − E −V0 , i.e. those deeply bound in the potential
well we find solutions that are similar to the stationary states in a box.
For states where the binding energy is nearly zero E . 0, we find that the
eigenstates have exponential tails that substantially enter into the region
outside the potential well.
130
5.5. Potential well
The sum runs from 1 to N for a potential well with N bound states. As
N is finite it becomes immediately clear that in this case not all possible
wave packets can be represented as bound states inside the potential well.
Besides the bound states, there are also infinitely many stationary states
that are unbound. These functions can be found in the section on the
(b)
potential barrier. We can now use the set of bound (φn (x)) and unbound
(u)
(φk (x)) states to build our general wave packet.
N Z ∞
(u) (u)
X
Ψ(t, x) = cn φ(b) (b)
n (x)χn (t) + c(k)φk (x)χk (t)dk (5.67)
n=1 −∞
Naturally, such a wave packet will partially stay inside the well (the
part that is made of the bound stationary states) and partially propagate
towards infinity (the part that is made of the unbound stationary states
of a potential well). An important finding is that even if one finds a set
(b)
of stationary states (φn (x)) these might not be sufficient to express all
physically relevant wave packets as a sum over these states. We will discuss
the completeness of a set of stationary states, i.e. the question if we can
express all possible physical wave packets as a convergent sum over the set
of stationary states in the next chapter.
131
Chapter 5. Wave mechanics in exemplary potentials
∂ ~2 ∂ 2 1
i~ ψ(t, x) = − ψ(t, x) + kx2 ψ(t, x). (5.68)
∂t 2m ∂x2 2
In order to describe the dynamics of a quantum particle in a harmonic
potential, we will find solutions for the stationary states and use these to
build wave packets. For stationary states, we can write the wave function
as a product of a time and position-dependent part. The time-dependent
part is trivial to solve and yields χ(t) = e−i/~Et . The eigenvalue equation
for the position-dependent part is
~2 d2 φ(x) 1
− + mω 2 x2 φ(x) = Eφ(x), (5.69)
2m dx2 2
whereby we replaced k by the angular
q eigenfrequency of an undamped
k
classical harmonic oscillator ω = m . In order to reduce the writing
further we introduce the dimensionless variables
2E
= (5.70)
~ω
and
r
mω
y= x. (5.71)
~
With these substitutions we obtain the differential equation
d2 φ(y)
+ ( − y 2 )φ(y) = 0. (5.72)
dy 2
132
5.6. Harmonic oscillator
Within this section, we will use two different methods to solve this dif-
ferential equation. The first method uses general concepts on how to solve
partial differential equations, the second method uses operator methods
that we will formalize in the next chapter. After we found the stationary
states we discuss the resulting wave packets we can create.
Which we can test to find that this is an exact solution for the differential
equation
d2 φ∞ (y)
+ (1 − y 2 )φ∞ (y) = 0. (5.75)
dy 2
As − 1 y 2 for y → ±∞ we find that φ∞ (y) is a valid solutions in the
limit of y 2 → ∞ for all values of .
For our full solution we write
2 /2
φ(y) = h(y)e−y . (5.76)
133
Chapter 5. Wave mechanics in exemplary potentials
Substituting the expansion for hn (y) into equation 5.78 we find the relation
134
5.6. Harmonic oscillator
= 2n + 1, (5.82)
Combining all together we find that our normalized stationary states are
given as
s
1 2
φn (y) = n
√ Hn (y)e−y /2 , or (5.85)
2 n! π
r
1 mω 41
r
mω mω 2
φn (x) = n
Hn x e− 2~ x ,
2 n! ~π ~
with energy
1
E = ~ω(n + ). (5.86)
2
135
Chapter 5. Wave mechanics in exemplary potentials
~2 d2 mω 2 2
H=− + x , (5.87)
2m dx2 2
and
Z ∞ Z ∞
d ∗ d
φ(x) ϕ(x)dx = (− φ(x))∗ ϕ(x)dx (5.91)
−∞ dx −∞ dx
136
5.6. Harmonic oscillator
which hold for all φ(x) and ϕ(x), ∈ DH (see section 3.3). From this we see
that
Z ∞ Z ∞
∗
φ(x) a− ϕ(x)dx = (a+ φ(x))∗ ϕ(x)dx, (5.92)
−∞ −∞
or
a†+ = a− . (5.93)
Justifying the notation in terms of a† and a in equation 5.88.
We now solve for the eigenfunctions of H by first finding the ground-
state, i.e. the eigenstate with the lowest energy and then using our op-
erators
R∞ a† and a to build all other eigenstates. For all φ(x) ∈ DH with
2
−∞ |φ(x)| dx = 1, we have
Z ∞ Z ∞
∗ ∗ † 1
φ(x) Hφ(x)dx = ~ω φ(x) a a + φ(x)dx (5.94)
−∞ 2
−∞
Z ∞
1
= ~ω (aφ(x))∗ (aφ(x))dx +
−∞ 2
1
≥ ~ω.
2
In the last step we used that the integral over a real function that is equal
or larger than zero for all values of its argument is equal or larger than zero.
More precisely the integral can only be zero if aφ(x) = 0. As equation 5.94
defines the energy of the system we find that E ≥ 12 ~ω for all states. For
the state with the lowest eigenenergy we find that E = 21 ~ω. This state is
a solution to the differential equation aφ(x) = 0:
aφ0 (x) = 0 (5.95)
r r r !
1 mω ~ d
x+ φ0 (x) = 0.
2 ~ mω dx
137
Chapter 5. Wave mechanics in exemplary potentials
138
5.6. Harmonic oscillator
such that the energy is bound from below and the ground state φ0 (x) is
the state with lowest energy.
Given that φn (x) are eigenstates of the operator H = ~ω(a† a + 1/2)
with eigenenergies ~ω(En = n + 1/2) one finds that the states φn (x) are
also eigenstates of the operator N = a† a with eigenvalues n.
The operators a† and a relate the state φn to the state φn±1 . These
states differ in energy by a constant ±~ω. The operator a† that brings
139
Chapter 5. Wave mechanics in exemplary potentials
αn 2
= n + 1, (5.110)
αn+1
140
5.6. Harmonic oscillator
or
r
1
αn = . (5.111)
n!
To conclude we find
r
1 † n
φn (x) = (a ) φ0 (x), (5.112)
n!
√
aφn (x) = nφn−1 (x), (5.113)
and
√
a† φn (x) = n + 1φn+1 (x). (5.114)
with cn a set of complex numbers defining the state Ψ(t, x). We now follow
the same line of argumentation we used for a particle in a box (around
equation 5.52) and can calculate the expansion coefficients cn as
Z ∞
cn = ψn (t, x)∗ Ψ(t, x)dx. (5.117)
−∞
141
Chapter 5. Wave mechanics in exemplary potentials
This derivation requires that the eigen states of the Hamiltonian are or-
thogonal, i.e. that
Z ∞
φn (x)∗ φm (x)dx = δm,n , (5.118)
−∞
which one can check using the explicit representation of our eigen functions.
In the next chapter we will prove that non-degenerate eigenstates (En 6=
Em ) of any hermitian operator are orthogonal to each other.
relate to the deformation in time of the wave packet (see equation 4.37). For
a harmonic potential, the equidistant spacing of the eigenenergies allows us
to create a coherent state. We can define a wave packet that oscillates back
and forward in the harmonic potential trap like a classical particle, without
the density of the wave packet deforming.
We take
Ψ(t = 0, x) = φ0 (x − x0 ). (5.119)
with
Z ∞
cn = φn (x)∗ φ0 (x − x0 )dx. (5.121)
−∞
142
5.6. Harmonic oscillator
143
Chapter 5. Wave mechanics in exemplary potentials
It follows that
√
r
mω
cn+1 n + 1 = x0 cn , (5.125)
2~
or
r r n
1 mω
cn = x0 c0 . (5.126)
n! 2~
The value of c0 can be determined by directly integrating
Z ∞
c0 = φ0 (x)∗ φ0 (x − x0 )dx (5.127)
−∞
mωx20
= e− 4~ .
144
5.6. Harmonic oscillator
∞ r n
X
− mω (x0 e−iωt )2 1 mω
× e 4~ √ x0 e−iωt φn (x)
n! 2~
n=0
1 −2iωt −1 mω x2 −2iωt
= e 4 ((e )
~ 0 ) φ (x − x e−iωt )
0 0
If we now look at the particle density ρ(t, x) = Ψ(t, x)∗ Ψ(t, x) we find
r
mω − mω (x−x0 cos(ωt))2
ρ(t, x) = e ~ (5.132)
~π
145
Chapter 5. Wave mechanics in exemplary potentials
5.7 Summary
We have seen in this chapter several examples of solutions of the Schrödinger
equation with simple potentials. In order to understand the dynamics of a
particle, we define a wave packet (Ψ(t, x)) that represents this particle. We
can calculate the dynamics of a wave packet once we know the stationary
states:
with En the energy of the state. For the position dependent part φ(x) we
need to solve the eigenvalue equation of the Hamiltonian
Once we know the eigenstates of the Hamiltonian we can use the sta-
tionary states to build general wave packets
∞
X
Ψ(t, x) = cn ψ(t, x), (5.136)
n=0
with
Z ∞
cn = φn (x)∗ Ψ(t = 0, x)dx. (5.137)
−∞
146
Chapter 6
Linear algebra
In this chapter we will formalize the methods used in the previous two
chapters to solve problems in quantum mechanics. For a particle in free
space we have seen that we can Fourier transform the wave function Ψ(t, ~r)
to get
d Z
1 ~
φ(t, ~k) = √ Ψ(t, ~r)e−ik·~r dd~r. (6.1)
2π R d
The function φ(t, ~k) defines the Fourier components with wave vector ~k
of the wave packet Ψ(t, ~r). The time evolution of the Fourier components
can straightforwardly be calculated. For a particle in free space V (~r) is
constant and plane waves are stationary states such that
φ(t, ~k) = φ(~k)χ(t), (6.2)
~
with φ(~k) = c(~k)eik·~r , c(~k) a complex-valued function independent of ~r and
~
t, and χ(t) = e−iE(k)t . We can use these time-dependent Fourier compo-
nents φ(t, ~k) to write an arbitrary wave packet. For arbitrary potentials
plane waves are no longer the stationary solutions of the Schrödinger equa-
tion. For those systems a Fourier transform is not as efficient to solve the
Schrödinger equation. The stationary solutions of the Schrödigner equa-
tion are always given by the eigenstates of the Hamiltonian, but not always
147
Chapter 6. Linear algebra
6.1 States
In classical physics a microstate is described by all positions and momenta
of all particles involved. In quantum physics the microstate is defined by
the wave function. We will often drop the term micro in the reference to a
quantum (micro) state. A state in quantum mechanics is always considered
to be the description on a microscopic level.
We characterize the state of a quantum system using the notation in-
troduced by Dirac and write it as:
|ψi . (6.3)
We will call this the state vector ”ket” belonging to ψ. The ket for a wave
function ψ(t, ~r) is given as |ψi. We do not need to know the wave function
to write down a ket. For a Harmonic oscillator we have a set of normalized
stationary states that we can label by the quantum number n. The kets for
these states are |ni. We can work with these kets and know their associated
energy due to the lowering and raising operators, even if we do not know an
exact representation of the wave function. The fact that we can write down
the ket representing a state for which we do not know the wave function
becomes even more apparent if we look at the state of a Silver atom. We
have seen in chapter 1 where we discussed the Stern-Gerlach experiment
that the magnetic moment of a Silver atom is described by two internal
states. These two states are related to the angle the magnetic moment has
with respect to an external magnetic field. For the states of a Silver atom,
148
6.1. States
we can write the kets representing these states as |αi and |βi, without
detailed knowledge on the wave function they represent.
with c ∈ C. For c 6= 0, |αi and c |αi describe the same state. The multipli-
cation with c = 0 is special as this results in a null ket.
• Abelian
149
Chapter 6. Linear algebra
• Associative
(|αi + |βi) + |γi = |αi + (|βi + |γi) ∀ |αi , |βi , |γi ∈ V (6.9)
• Existence of a 0-vector
• Existence of an inverse-vector
• Associative
• Distributive
The property that 0 |αi = 0 |βi = 0 follows from the above definitions.
150
6.1. States
151
Chapter 6. Linear algebra
6.1.6 Basis
The basis B of a vector space V is defined as a set of linearly independent
vectors with the dimension (n) of B equal to the dimension (n) of V .
The equation
n
X
a |αi − bi |βi i = 0, (6.20)
i=1
has the trivial solution a = 0 and bi = 0 ∀i. If this was the only solution
then vector |αi would be linearly independent of the vectors |βi i and the
set B would not be the largest possible set of linearly independent vectors
152
6.1. States
As the set {|β1 i , . . . , |βn i} is the basis of V we know that the only solution
for equation 6.22 is the one where bi − ci = 0 ∀i. We thus find that
bi = ci ∀i and there is only one solution possible for the projection of a
state onto the basis.
This is an important mathematical result that has physical implications.
For a Silver atom we can define two state vectors and describe the magnetic
orientation for any Silver atom one can encounter in nature as a linear
superposition of these two state vectors. This expansion is unique and
complete.
153
Chapter 6. Linear algebra
All properties derived for the ket vectors in the previous section also hold
for the bra vectors. In particular if the vectors {|β1 i , . . . , |βn i} form a basis
for an n dimensional vector space V , then the vectors {hβ1 | , . . . , hβn |} form
a basis for the n dimensional vector space V † .
c = hα|βi , (6.24)
• Linearity
154
6.2. Hilbert space
• Positive inner product of the vector with its own dual representation
155
Chapter 6. Linear algebra
We can prove this by realizing that any state vector can be written as a
linear combination of the basis state vectors (equation 6.19). If we mul-
tiply equation 6.19 with hβj | from the left the relation for the expansion
coefficients bi follow.
156
6.2. Hilbert space
157
Chapter 6. Linear algebra
V = Va ⊗ Vb , (6.39)
B = {|α1 i |β1 i , |α1 i . . . , |α1 i |βnb i , . . . |αna i |β1 i , |αna i . . . , |αna i |βnb i}.
(6.40)
The choice on how to write down the product states is a matter of prefer-
ence.
158
6.3. Relation between bra ket notation and wave functions
n = na nb . (6.43)
|ψi . (6.45)
For a single particle at position ~r we can define the state vectors |~ri.
The vectors |~ri define the basis of an infinite dimensional vector space. A
general state |ψi can be expressed on the basis |~ri as
Z
|ψi = ψ(t, ~r) |~ri d~r, (6.46)
Rd
with ψ(t, ~r) the wave function representing the state |ψi. In order to express
the wave function in terms of an inner product of two state vectors we can
multiply equation 6.46 from the left with h~r1 |
Z
h~r1 |ψi = ψ(t, ~r) h~r1 |~ri d~r (6.47)
Rd
159
Chapter 6. Linear algebra
Z
= ψ(t, ~r)δ(~r1 − ~r)d~r
Rd
= ψ(t, ~r1 ).
The wave function ψ(t, ~r) at point ~r describes the complex coefficient of the
vector for the position basis state |~ri. Similarly the state |~r1 i corresponds
to the wave function δ(~r − ~r1 ).
With the relation between states and wave functions we can define the
inner product of two wave functions. Starting from the inner product of
two states hψ1 |ψ2 i we insert equation 6.46 twice to get
Z † Z
hψ1 |ψ2 i = ψ1 (t, ~r1 ) |~r1 i d~r1 ψ2 (t, ~r2 ) |~r2 i d~r2 (6.48)
Rd Rd
Z Z
= ψ1∗ (t, ~r1 )ψ2 (t, ~r2 ) h~r1 |~r2 i d~r1 d~r2
d d
ZR ZR
= ψ1∗ (t, ~r1 )ψ2 (t, ~r2 )δ(~r1 − ~r2 )d~r1 d~r2
d d
ZR R
= ψ1∗ (t, ~r)ψ2 (t, ~r)d~r.
Rd
We can relate the inner product of two state vectors to the integral over
the wave functions. We can generalize the previous equations to N particles.
Two N particle quantum states can be represented by the wave functions
ψ1 (t, ~r1 , . . . , ~rN ) and ψ2 (t, ~r1 , . . . , ~rN ). These states can equivalently be
represented by the state vectors |ψ1 i and |ψ2 i. The inner product between
these two states is given as
Z Z
hψ1 |ψ2 i = ... ψ1 (t, ~r1 , . . . , ~rN )∗ ψ2 (t, ~r1 , . . . , ~rN )d~r1 . . . d~rN . (6.49)
Rd Rd
160
6.4. Operators
6.4 Operators
Besides state vectors, we have operators in quantum mechanics. An oper-
ator takes one state vector and transforms it into another state vector. In
the previous chapters we have already encountered several operators acting
on wave functions. Due to the possible mapping between wave functions
and state vectors ψ(t, ~r) ⇐⇒ |ψi we can identify operators acting on
wave functions and rewrite them as operators acting on state vectors. An
operator acting on a wave function takes one wave function as its argument
and produces a wave function as its output. The Schrödinger equation
∂
(i~ ∂t ψ(t, ~r) = Hψ(t, ~r)) is given by two operators. The Hamilton operator
∂
H on the right-hand side as well as the time derivative operator i~ ∂t on
the left-hand side of the equal sign. We have also seen the position oper-
ator ~ r = ~r as well as the momentum operator p ~ = −i~∇ in the previous
chapters.
Operators are not only used for the description of the Schrödinger equa-
tion, they are also related to the values obtained in an experiment, called
observables. We have discussed the measurement of the position and ve-
locity (momentum) of a particle and have seen that these are given by
distribution functions derived from the wave function in coordinate or mo-
mentum space. In the last section of this chapter, we will generalize this
to the observation made for any type of experiment. Here we first define
several properties of operators.
161
Chapter 6. Linear algebra
A + B = B + A, (6.55)
and associative
A + (B + C) = (A + B) + C (6.56)
with a, b ∈ C.
Operators act on a ket vector from the left and on a bra vector from
the right. Given the Hilbert space V with element |αi we can define the
dual Hilbert space V † with element hα|. For an operator O acting between
Hilbert spaces V and V † we define
162
6.4. Operators
For two state vectors |αi and |βi in Hilbert space V as well as an operator
acting between V and V † we have
D E D E∗
β O α = α O† β . (6.60)
O † = O. (6.62)
For finite sized Hilbert spaces all Hermitian operators are self-adjoint.
AB = BA + [A, B] (6.63)
163
Chapter 6. Linear algebra
(AB)† = B † A† (6.65)
This operator projects any state vector to the state vector |βi with a com-
plex coefficient given by the inner product hα|γi.
This operator is equivalent to the identity operator and leaves all state
vectors unaltered. For a general state |αi ∈ V we can write
n
X
|αi = aj |βj i . (6.69)
j=1
164
6.4. Operators
165
Chapter 6. Linear algebra
We can write the expectation value of an operator with the use of matrix
multiplication as
o1,1 o1,2 ... o1,n a1
o2,1 o2,2
... o2,n a2
hγ|O|αi = c∗1 c∗2 . . . c∗n · .
.. · .. . (6.78)
.. ..
.. . . . .
on,1 on,2 ... on,n an
Whereby one must take care to use the same dual basis for both the operator
representation as well as for the representation of the bra and ket state
vectors.
166
6.4. Operators
with
ui,j = hβj |γi i . (6.83)
For the back transformation we can also express the basis vectors of B in
the basis vectors of C
X
|βj i = ũj,i |γi i , (6.84)
i
167
Chapter 6. Linear algebra
with
We can define the matrix U with matrix elements ui,j such that a basis
transformation from B to C is given as
C = UB (6.86)
B = U †C (6.87)
whereby we used the dagger as symbol for the Hermitian conjugate which we
take within a Hilbert space equivalent to the complex conjugate transpose
of a matrix.
In particularly we find that
B = U † U B, (6.88)
U †U = 1 (6.89)
or
U † = U −1 . (6.90)
168
6.4. Operators
X
= ci |γi i .
i
b = UT c (6.94)
∗
c = U b.
Vectors on a basis transform with the complex conjugate of the matrix with
which the basis states themselves transform.
169
Chapter 6. Linear algebra
For an arbitrary choice of basis, we can write this as a matrix equation with
O an operator represented as an n by n matrix and 1 an n by n dimensional
unity matrix acting on a coefficient vector ψi of |ψi i
(O − λ1)ψi = 0. (6.97)
This equation has the solution ψi = 0 which is not the one we are after, and
solutions for all values of λ where the determinant of the matrix (O − λ1)
is zero. For an n-dimensional matrix the determinant
|O − λ1| = 0, (6.98)
We also have
D E
ψi O † ψi = hoi ψi |ψi i (6.101)
170
6.4. Operators
= o∗i .
For a Hermitian operator these are equivalent. We find that the eigenvalues
for a Hermitian operator are real. This is a restatement of the results from
equation 3.51 in section 3.7.
Orthonormality of an eigenbasis
For a Hilbert space of dimension n, we can define operators that act within
that Hilbert space as n by n matrices. The eigenvalues of such a matrix are
given by the zeros of an n-th order characteristic polynomial, which has n
solutions. We find n eigenvectors |ψi i. In this section we will show that
these eigenvectors define an orthonormal basis for V .
If |ψi i is an eigenvector of O then c |ψi i with c ∈ C is also an eigenvector
of O. We can choose the normalization of our eigenvectors such that
Either the two eigenstates are orthogonal (hψi |ψj i = 0) or they have the
same eigenvalue (oi = oj ). States with the same eigenvalue are called
degenerate states.
For an operator without degenerate eigenstates, the normalized eigen-
vectors are all orthogonal to each other. For an n dimensional Hilbert space
there are n eigenstates and they define an orthonormal basis for V .
171
Chapter 6. Linear algebra
U ∗ U T = 1. (6.106)
OA = U ∗ OB U T , (6.107)
172
6.4. Operators
A = U B, (6.110)
and oi 6= oi0 ∀ i 6= i0 . Note that we have chosen the indices such that i labels
states with different eigenvalues and j labels different states with the same
eigenvalue. For each oi we will take Ni as the number of linear independent
eigenstates of O that have eigenvalue oi . Ni is called the degeneracy or
multiplicity of the eigenstates with eigenvalue oi of operator O.
A general state |ψk i can be expanded on the eigenbasis as
(k)
X
|ψk i = ci,j |ωi,j i . (6.112)
i,j
173
Chapter 6. Linear algebra
(k)
We can obtain the expansion coefficients ci,j with the help of the projection
operator that projects a state onto the state |ωi,j i
174
6.5. Distributions and expectation values
X (k)
Y 1
= ci0 ,j (oi0 − oi00 ) ωi0 ,j .
oi − oi00
i0 ,j i00 6=i
The operator P i projects a state onto the Hilbert space Hoi . Note that we
can express P i in terms of O, but can not express P i,j in terms of operator
O.
175
Chapter 6. Linear algebra
complex measurement apparatus can yield a single value for the measure-
ment of an observable. But if it does, the values and their frequency follow
the rules outlined above.
We start with the experimental observation that the density of a particle
is given by ρ(t, ~r) = ψ ∗ (t, ~r)ψ(t, ~r). With this, we mean that if we have
an ensemble of N particles, each given by the same state |ψi then ρ(t, ~r)
defines the ensemble-averaged particle density. Whereby it is irrelevant if
we measure all particles at once or one after the other. We now need to
understand what this implies for the measurement of the position of a single
particle and what this implies for the measurements of other operators than
the position operator for a single particle.
If we know that the average over the measurement of the position of
many particles starting from the same state ψ(t, ~r) yields the distribution
function ρ(t, ~r) then the probability to find the position ~r to be in the
volume V for a single measurement of the position is given by the value
r)dd~r. The more involved problem is to understand what this implies
R
V ρ(t, ~
for the measurement of other physical properties.
Instead of the position of a particle, we can look at the velocity, or
momentum, of a particle. For this we can Fourier transform the wave func-
tion from position space to momentum space (see section 4.2.4 or equation
4.42). As a result, we obtain the wave function as a function of the momen-
tum of the particle φ(t, p~). Consistent with the definition of the particle
density distribution we take the momentum distribution of a particle to be
ρ(t, p~) = φ∗ (t, p~)φ(t, p~). For a single particle given by the state φ(t, p~) you
will
R measure the momentum p~ to be in the volume V with the probability
ρ(t, p
~)ddp
~ .
V
We can quantify the density distribution ρ(t, ~r) or the momentum dis-
tribution ρ(t, p~) by the moments of these distributions. In section 4.2.4 we
defined the nth moment of the momentum distribution as the expectation
value of the operator (~p)n (see equation 4.44). In equation 4.53 and 4.54 we
defined two equivalent ways to determine the moments of the momentum
distribution. This concept is so important that we repeat these equations
176
6.5. Distributions and expectation values
177
Chapter 6. Linear algebra
X
= oni |ωi ihωi | .
i
µ(n) n
o (t) = hψ(t)|O |ψ(t)i (6.127)
X
= hψ(t)|ωi i oni hωi |ψ(t)i
i
X
= |hωi |ψ(t)i|2 oni .
i
The function ρi (t, E) is known as the partial density of states. Note that
Z ∞ Z ∞X
ρi (t, E)dE = |hφj |ψi (t)i|2 δE,Ej dE (6.130)
−∞ −∞ j
X
= |hφj |ψi (t)i|2
j
178
6.5. Distributions and expectation values
X
= hψi (t)|φj i hφj |ψi (t)i
j
R E+dE
The integral E DOS(E)dE gives the information on the number of
states one finds in the energy interval E to E + dE. The density of states is
a very useful concept to describe many properties in physics. In section 1.1
we calculated the black body radiation. In order to do so, we determined
the density of electromagnetic modes, which is the density of states, inside
a cavity. As you will see in later lectures, the density of states plays a
central role in understanding many physical properties, from the stability
of Neutron stars to the functioning of semiconductor solar cells.
179
Chapter 6. Linear algebra
180
Chapter 7
181
Chapter 7. Symmetry and conservation laws
182
7.1. Symmetry and commutation relations
φk (x) = eikx ,
χk (t) = e−iE(k)t/~ ,
~2 k 2
E(k) = + V0 .
2m
It is tempting to think that for a system that is translational invariant
the equation T x0 H = H should hold. This is not true. We can easily see
that T x0 H 6= H by evaluating both T x0 H and H appleid to a stationary
state.
but
The energy of the shifted state is the same as the energy of the unshifted
state (E(k)). The shifted state is (for non degenerate states) given by the
same state ray, but not by the same state vector. What is true is that
T −1
x0 HT x0 = H, (7.7)
with T −1x0 T x0 = I. We can shift the state, apply the Hamiltonian and
shift it back and find the same state as we would have obtained if we just
applied the Hamiltonian to the original state. Multiplying equation 7.7 by
T x0 yields that
HT x0 = T x0 H. (7.8)
[T x0 , H] = 0. (7.9)
183
Chapter 7. Symmetry and conservation laws
[O, H] = 0. (7.10)
O † = O −1 . (7.11)
∂ ∂ D i i
E
hψ(t)|O|ψ(t)i = ψ(t = 0) e+ ~ Ht Oe− ~ Ht ψ(t = 0) . (7.14)
∂t ∂t
We changed the time dependence from the wave function to the opera-
tor, which sometimes is called the Heisenberg representation (contrasting
184
7.2. Symmetry and conservation laws
∂ ∂ D E
hψ(t)|O|ψ(t)i = ψ(t = 0) e+iHt/~ Oe−iHt/~ ψ(t = 0) . (7.15)
∂t ∂t
∂ D E
= ψ(t = 0) e+iHt/~ e−iHt/~ O ψ(t = 0) .
∂t
∂
= hψ(t = 0)|O|ψ(t = 0)i .
∂t
= 0.
∂ +iHt/~ −iHt/~
e Oe = 0. (7.16)
∂t
185
Chapter 7. Symmetry and conservation laws
186
7.2. Symmetry and conservation laws
187
Chapter 7. Symmetry and conservation laws
= e−~r0 ·∇ .
is conserved for all values of ~r0 . With ρp~ (t, p~) = φ∗ (t, p~)φ(t, p~) the momen-
tum distribution of the particle. If the Fourier transform of the momentum
distribution of the particle is a conserved quantity, then the momentum dis-
(n)
tribution itself is a conserved quantity. In particular we find that µr̂0 ·~p , the
nth moment of the momentum distribution in the ~r0 direction (see section
4.2.4) is conserved.
∂
If [H, T ~r0 ] = 0 for all ~r0 in Rd then ∂t ρp~ (t, p~) = 0. It follows that
ρp~ (t, p~) = ρp~ (t = 0, p~). For systems with translational invariance the mo-
mentum distribution of a particle is conserved. In particular the first mo-
ment of this distribution is conserved. Translation symmetry leads to mo-
mentum conservation. This is a result we, without commenting, already
obtained in chapter 4 on wave mechanics. The propagating wave packets
discussed in that chapter had a momentum distribution, or equivalent a
wave vector distribution ρk (t, k) that is independent of time.
In chapter 2 we defined the momentum operator as ~p = −i~∇. We here
find that if a system has translational symmetry, then the expectation value
of the operator −i~∇ is conserved. This is an other reason to define the
operator −i~∇ as the operator representing the momentum of the particle.
188
7.2. Symmetry and conservation laws
Note that this is not a proof that the momentum operator needs to be
~p = −i~∇, but shows the internal consistency of our definitions.
T δt : t 7→ t − δt, (7.27)
translates the state by a time δt. If the potential in the Hamiltonian is time
independent we find that
[H, T δt ] = 0, (7.28)
and
∂
hΨ(t)|T δt |Ψ(t)i = 0. (7.29)
∂t
The value hΨ(t)|T δt |Ψ(t)i is a conserved quantity for all states |Ψ(t)i. We
here want to ask which physical observable this conserved quantity corre-
sponds to.
Following the same line of reasoning as we did in the previous section
(7.2.1) we want to express T δt as a differential operator. For a system
that is translational invariant we know that the potential is a constant
and the stationary states of the Hamiltonian are plane wave functions. A
system that is invariant under time translation can have an arbitrary, time
independent potential. The stationary states are not necessarily given by
plane waves. The stationary states do define a complete set that we can use
as a basis. We will write our states |Ψ(t)i as a sum over stationary states.
Any state |Ψ(t)i ∈ DH can be expanded on the set of stationary states as
X Z
Ψ(t, ~r) = cn φn (~r)χn (t) + c(κ)φ(κ, ~r)χ(κ, t)dκ. (7.30)
n
189
Chapter 7. Symmetry and conservation laws
Whereby φn (~r)χn (t) form the discrete set of stationary states and φ(κ, ~r)χ(κ, t)
form a continues set of stationary states of the Schrödinger equation. For
the stationary states we know that
i i
χn (t) = e− ~ En t , χ(κ, t) = e− ~ E(κ)t , (7.31)
Hφn (~r) = En φn (~r), Hφ(κ, ~r) = E(κ)φn (~r).
190
7.2. Symmetry and conservation laws
D ∂
E D i
E
We find that both Ψ(t) e−δt ∂t Ψ(t) and Ψ(t) e−δt ~ H Ψ(t) are con-
served quantities of motion. AsDthey are conservedE quantities of motion for
∂ n
any value of δt we know that Ψ(t) ∂t Ψ(t) and hΨ(t)|H n |Ψ(t)i are
= 0. (7.34)
∂ ∂ X ∗
hΨ(t)|H n |Ψ(t)i = c cn hφn1 |H n |φn2 i χ∗n1 (t)χn2 (t)
∂t ∂t n n n1 2
1 2
Z Z
∂
+ c(κ1 )∗ c(κ2 ) hφκ1 |H n |φκ2 i χ∗ (κ1 , t)χ(κ2 , t)dκ1 dκ2
∂t
∂ X ∗
= c cn δn ,n (En2 )n χ∗n1 (t)χn2 (t)
∂t n n n1 2 1 2
1 2
191
Chapter 7. Symmetry and conservation laws
Z Z
∂
+ c(κ1 )∗ c(κ2 )δ(κ1 − κ2 ) (E(κ2 ))n χ∗ (κ1 , t)χ(κ2 , t)dκ1 dκ2
∂t
Z
∂ X ∂
= |cn1 |2 (En1 )n + |c(κ1 )|2 (E(κ1 ))n dκ1
∂t n ∂t
1
= 0. (7.35)
The energy of a single stationary state φn (~r)χn (t) (φ(κ, ~r)χ(κ, t)) is En
(E(κ)). A general state Ψ(t, ~r) can be written as a linear superposition
of all stationary states as given in equation 7.30. The energy of Ψ(t, ~r) is
determined by the expansion coefficients of Ψ(t, ~r) on the stationary states
X Z
EΨ = |cn |2 En + |c(κ)|2 E(κ)dκ. (7.36)
n
192
7.3. Symmetry groups
193
Chapter 7. Symmetry and conservation laws
∀O 1 , O 2 ∈ G O 1 O 2 ∈ G. (7.39)
• Composition is associative
∀O 1 , O 2 , O 3 ∈ G (O 1 O 2 )O 3 = O 1 (O 2 O 3 ). (7.40)
∃O 1 = I ∈ G : O 1 O 2 = O 2 ∀O 2 ∈ G. (7.41)
∀O 2 ∈ G ∃O 1 ∈ G : O 1 O 2 = O 2 O 1 = I. (7.42)
194
7.4. Eigenfunctions and irreducible representations
195
Chapter 7. Symmetry and conservation laws
The previous paragraph made several definitions about groups and rep-
resentations. We can use these definitions to make several important state-
ments about the eigenstates of the Hamiltonian. The eigenfunctions of the
Hamiltonian are related to the eigenfunctions of the symmetry operators
of the system. The relations between the eigenstates of the Hamiltonian
and the eigenstates of the symmetry operators, or the irreducible repre-
sentations, become much easier if we assume that the eigenvalues of the
symmetry operators and the eigenvalues of the Hamiltonian are all non-
degenerate. We call the eigenvalues on of operator O degenerate if several
linearly independent eigenstates have the same eigenvalue. We call two
states degenerate (with respect to operator O) if they are linearly indepen-
dent and have degenerate eigenvalues:
For a state that is degenerate (with other states), we define the multiplicity,
or degeneracy N as the number of linear independent states |ψn i with
eigenvalue on , such that all other eigenstates of O have an eigenvalue that
is different from on .
We will restrict our proofs to non-degenerate states, the statements
made can be extended to the case with degeneracies. If [H, O] = 0 and
|ψi is a non-degenerate eigenstate of O then |ψi is also an eigenstate of H.
Given
we find
196
7.4. Eigenfunctions and irreducible representations
The state |ψi and H |ψi are both eigenstates of the operator O with the
same eigenvalue o. As the state |ψi is a non-degenerate eigenstate of opera-
tor O we know that |ψi and H |ψi describe the same state. Not necessarily
the same state vector, but the same state ray. We find
197
Chapter 7. Symmetry and conservation laws
we find
p2
H= . (7.52)
2m
198
7.4. Eigenfunctions and irreducible representations
The eigenstates of the Hamiltonian Hψk (x) = Ek ψk (x) are given by plane
waves
with eigenvalues
~2 k 2
E(k) = . (7.54)
2m
The Hamiltonian commutes with the translation operator T x0 : x 7→
x − x0 and the inversion operator ix0 : x 7→ x0 − x. These two symmetry
operators define a group. The eigenstates and eigenvalues of the translation
operator become evident if we act with the translation operator on a plane
wave
Such that the states eikx are eigenvectors of the translation operator with
eigenvalue e−ikx0 . For infinitesimal small x0 each eigenstate of T x0 has its
own eigenvalue such that the states are non-degenerate. We thus find that
the eigenstates of the translation operator are also the eigenstates of the
Hamiltonian. The eigenstates and eigenvalues of the inversion operator ix0
are given as
Z ∞
1 1
ix0 c(k) eik(x− 2 x0 ) ± e−ik(x− 2 x0 ) dk = (7.56)
−∞
Z ∞
1 1
± c(k) eik(x− 2 x0 ) ± e−ik(x− 2 x0 ) dk,
−∞
with c(k) a complex function. The inversion symmetry operator only dis-
tinguishes between even and odd states. If we look at the eigenstates of the
Hamiltonian we find that the state with momentum k is degenerate with
the state with momentum −k. The state eikx is not an eigenstate of the
inversion symmetry operator. The state eikx is neither even nor odd. We
can make a linear combination between the degenerate states at k and −k
199
Chapter 7. Symmetry and conservation laws
2 2 k
to generate an even and an odd state with energy ~2m . This is possible
because [T x0 , H] = 0 and [ix0 , H] = 0, but [T x0 , ix00 ] 6= 0.
All non-degenerate eigenstates of the symmetry operator T x0 are si-
multaneously eigenstates of the Hamiltonian. Linear combinations of the
degenerate eigenstates of the Hamiltonian with momentum k and −k are
eigenstates of the inversion operator ix0 .
R : ~r 7→ R~r (7.58)
The rotation matrices for a rotation around the x, y or z axis are given as
1 0 0
Rx (α) = 0 cos(α) − sin(α) , (7.59)
0 sin(α) cos(α)
cos(β) 0 sin(β)
Ry (β) = 0 1 0 , (7.60)
− sin(β) 0 cos(β)
200
7.5. Rotational symmetry and ...
and
cos(γ) − sin(γ) 0
Rz (γ) = sin(γ) cos(γ) 0 . (7.61)
0 0 1
201
Chapter 7. Symmetry and conservation laws
and
i
Ry (β) = e− ~ (zpx −xpz )β . (7.67)
∂ D i
E
ψ(t) e− ~ Li θ ψ(t) = 0. (7.70)
∂t
For such a system we can define a distribution function for the measurement
of the angular momentum ρ~l(t, ~l), which is time-independent.
202
7.5. Rotational symmetry and ...
203
Chapter 7. Symmetry and conservation laws
P
operators {O 1 , O 2 , . . .}, such that the operator O = i ci O i for some val-
ues ci is non-degenerate. For the first operator of this set we can take one
of the components of the operator L. ~ For a further operator we can search
for an operator that commutes both with L ~ and H. In order to find an
operator that commutes with the operator L ~ as well as with the Hamilto-
nian we can have a look at polynomial functions of L. ~ These polynomial
~
functions in L will, by construction, also commute with H. The simplest
symmetric combination we can make is L2 = L ~ · L.
~ For the commutator of
2
Lz with L we find
L+ = Lx + iLy , (7.75)
and
L− = Lx − iLy , (7.76)
204
7.5. Rotational symmetry and ...
~
as the raising and lowering operators. The angular momentum operator L
†
is Hermitian L~ = L.~ We thus find that
L†+ = L− . (7.77)
The reason why we call these the raising and lowering operators will become
clear later in this chapter. For the commutation relations we find
and
[L± , L2 ] = 0. (7.81)
and
205
Chapter 7. Symmetry and conservation laws
and
we have
or
206
7.5. Rotational symmetry and ...
207
Chapter 7. Symmetry and conservation laws
r cos θ
shown in figure 7.2 rs y
θ
in
ᵠ sin
ᵠ θ
x = r cos ϕ sin θ (7.96)
os
rc
r sin ᵠ sin θ
y = r sin ϕ sin θ x
such that
∂ ∂
Lz = −i~ x −y (7.98)
∂y ∂x
208
7.5. Rotational symmetry and ...
∂
= −i~ ,
∂ϕ
and
∂ ∂ ∂ ∂
L± = ~ −i y −z ± z −x (7.99)
∂z ∂y ∂x ∂z
∂ cos θ ∂
= ~e±iϕ ± +i .
∂θ sin θ ∂ϕ
209
Chapter 7. Symmetry and conservation laws
state, with half-integer values of the angular momentum. These states are
however not representable by a wave function that assigns a single complex
number to each point in space. In section 8.6.1 we will encounter the half-
integer angular momentum to represent spin and the accompanying wave
function to be given by a function that assigns a complex vector to each
point in space.
We used the conventional (Condon Shortley) phases for the spherical har-
monics. The functions are normalized such that the integral over the norm
square of the wave function is one. We can find the states for general m by
applying the raising and lowering operators to the two states with extremal
m = ±l. The general solutions are known as the spherical harmonics. For
m ≥ 0 we have
r s
2l + 1 (l − m)!
Yl,m (θ, ϕ) = Pl,m (cos(θ))eimϕ , (7.106)
4π (l + m)!
210
7.5. Rotational symmetry and ...
211
Chapter 7. Symmetry and conservation laws
212
7.5. Rotational symmetry and ...
213
Chapter 7. Symmetry and conservation laws
of the quantization axis, one finds a set of functions that are linear com-
binations of the states we have found for the specific choice of quantizing
in the z direction, i.e. taking the Lz operator. The quantum number l
labels the total angular momentum. The total angular momentum l distin-
guishes states that belong to different irreducible representations. States
with a different total angular momentum l will generally have a different
energy. The quantum number m labels the degenerate states within this
representation. We can thus make linear combinations of states with the
same l but different m without loss of generality. The spherical harmonics
are complex functions We can make linear combinations of the complex
spherical harmonics that are real. These real functions are known as the
Tesseral harmonics
We define the tesseral Harmonics Zl,m (θ, ϕ) as:
Yl,m (θ, ϕ) m=0
1 m
Zl,m (θ, ϕ) = √2 (Yl,−m (θ, ϕ) + (−1) Yl,m (θ, ϕ)) m > 0 (7.110)
√i (Yl,m (θ, ϕ) − (−1)m Yl,−m (θ, ϕ)) m < 0
2
such that all functions are real. We plot the tesseral harmonics for l ≤ 3 in
figure 7.5.
214
7.5. Rotational symmetry and ...
~ 2 = ~r2 × ~p2 .
L
~ of particle 1 and 2
We will define the total angular momentum operator L
as the sum of the angular momentum operators of the individual particles.
~ =L
L ~1 +L
~ 2. (7.112)
In this section we will show that the operator L ~ represents the physical
sensible total angular momentum. We will furthermore show how to diag-
~ relate to the eigenstates
onalize this operator and how the eigenstates of L
~ 1 and L
of L ~ 2.
~ =L
Commutation relations of L ~1 +L
~2
~ =L
For operator L ~ 1 +L ~ 2 to be a useful definition of the total angular mo-
mentum it must fulfill the commutation relations of an angular momentum
~ 1 and L
operator. As L ~ 2 act on different particles with different coordinates
we have
L+ = L + 1 + L+ 2 (7.115)
L− = L − 1 + L− 2 ,
215
Chapter 7. Symmetry and conservation laws
which have the same commutation relations as those for the angular mo-
mentum operators acting on a single particle. With these relations we can
derive all equations we found for the properties of the angular momentum
operators in section 7.5.2. In particularly we find that the operator L2 has
eigenstates with eigenvalue l(l + 1)~2 that are 2l + 1 fold degenerate. The
operator Lz has simultaneous eigenstates with the operator L2 . One can
use the eigenstates of Lz to label the degenerate states of the operator L2
uniquely from m = −l to m = l.
Similar we find for particle 2 with total angular momentum l2 the basis of
the Hilbert space to be
The basis of the Hilbert space for all product states is given by the tensor
product of B1 and B2
B = B1 ⊗ B2 (7.118)
= {|l1 −l1 l2 −l2 i , . . . |l1 l1 l2 −l2 i , |l1 −l1 l2 −l2 +1i , . . . ,
|l1 l1 l2 −l2 +1i , . . . , |l1 l1 l2 l2 i}.
216
7.5. Rotational symmetry and ...
Note that we will use all three of the notations |l1 m1 i⊗|l2 m2 i = |l1 m1 l2 ml2 i =
|l1 m1 i |l2 m2 i for product states, depending on the situation at hand. For a
given l1 there are 2l1 +1 states |l1 m1 i with different m1 possible. Similar for
a given l2 there are 2l2 + 1 states |l2 m2 i with different m2 possible. The re-
sult is that for a given l1 and l2 there are (2l1 + 1)(2l2 + 1) states |l1 m1 l2 m2 i
with different m1 and m2 possible. The dimension of the Hilbert spaces are
2l1 + 1 and 2l2 + 1 for particle 1 and 2 respectively and (2l1 + 1)(2l2 + 1)
for the tensor product space of the two particles simultaneously.
Instead of labeling the states as product states |l1 m1 i ⊗ |l2 m2 i we can
arrange the states in the product basis according to their total angular
momentum l and m into the states |lmi. The states |lmi are eigenvectors
of L2 and Lz . We now want to find the allowed values of l and m for the
states |lmi such that we generate a basis for the product Hilbert space.
We immediately know that we need to find (2l1 + 1)(2l2 + 1) of such states
|lmi as that is the dimension of the product Hilbert space. Classically we
would expect that l relates to the length of the angular momentum vector.
If classically ~l1 and ~l2 are parallel we find that |~l| = |~l1 | + |~l2 |, whereas when
~l1 and ~l2 are anti-parallel we find that |~l| = |~l1 | − |~l2 |. We thus could expect
that quantum mechanically the total angular momentum l is given by
l ∈ [|l1 − l2 |, l1 + l2 ] . (7.119)
In order to prove that this is indeed the case for the quantum mechanical
description of a system with spherical symmetry, we use the fact if a state
|lmi with total angular momentum l is in the basis of the Hilbert space,
then all 2l + 1 states with m ∈ [−l, l] must be in the Hilbert space. We
will furthermore count the number of states with total angular momentum
m = m1 + m2 in the basis B. We know that
Lz |l1 m1 l2 m2 i = (L1z + L2z ) |l1 m1 l2 m2 i (7.120)
= (m1 + m2 ) |l1 m1 l2 m2 i
= m |l1 m1 l2 m2 i ,
with m = m1 + m2 . As such we can count the number of states |l1 m1 l2 ml2 i
with a given eigenvalue m = m1 + m2 of operator Lz in the basis B and
know the number of states |lmi with a given value of m.
217
Chapter 7. Symmetry and conservation laws
Table 7.1: Product basis of two states |l1 m1 i and |l2 m2 i for fixed value of
l1 and l2 (l2 < l1 ) sorted according to m = m1 + m2 . Either given as states
on the product basis |l1 m1 l2 m2 i = |m1 m2 i or on the basis listing total
angular momentum |lmi = |li. The value of n = 2 min(l1 , l2 ) labels the
border between different behaviour of the number of states as a function of
m. For m < 0 the table is similar as the table for m > 0.
In the third column of table 7.1 we show the possible states |lmi for
different values of l at a fixed value of m. The maximum value for m is
l1 + l2 . The maximum value of l is thus l1 + l2 . There are two states with
m = l1 + l2 − 1 in basis B. There are thus two eigenstates |lmi of L2 and
218
7.5. Rotational symmetry and ...
lX
1 +l2
Dim(B) = 2l + 1 (7.121)
l=|l1 −l2 |
in agreement with the dimension found for the tensor product basis of B1
and B2 .
l1
X l2
X
|lmi = Cllm
1 m1 l2 m2
|l1 m1 l2 m2 i , (7.122)
m1 =−l1 m2 =−l2
219
Chapter 7. Symmetry and conservation laws
and
lX
1 +l2 l
X
|l1 m1 l2 m2 i = Cllm
1 m1 l2 m2
|lmi , (7.123)
l=|l1 −l2 | m=−l
with Cllm1 m1 l2 m2
real numbers known as the Clebsch-Gordan coefficients.
Note that in principle the basis transformation could be complex. As we
will show below it turns out that the Clebsch-Gordan coefficients are real
for the coupling of angular momentum. If we multiply equation 7.122 from
the left with hl1 m1 l2 m2 | we find
Cllm
1 m1 l2 m2
= hl1 m1 l2 m2 |lmi . (7.124)
The Clebsch-Gordan coefficients tell you how the states |l1 m1 i and |l2 m2 i
couple to form the state |lmi, an eigenstate of the total angular momentum
operators L2 and Lz .
Cllm
1 m1 l2 m2
=0 if m 6= m1 + m2 . (7.125)
Cllm
1 m1 l2 m2
=0 if l ∈
/ [|l1 − l2 |, l1 + l2 ]. (7.126)
220
7.5. Rotational symmetry and ...
221
Chapter 7. Symmetry and conservation laws
and
Cll11+l 2 l1+l2
l1 l2 l2 = 1. (7.132)
222
7.5. Rotational symmetry and ...
Next we express the state |lli in terms of the product states |l1 m1 l2 m2 i
and Clebsch-Gordan coefficients
min(l1 ,l+l2 )
X
|lli = Clll1 ql2 l−q |l1 ql2 l−qi (7.134)
q=max(−l1 ,l−l2 )
Combining equation 7.133 and 7.134 yields for the state |lmi the ex-
pression
l−m
1
|lmi ∝ L− |lli (7.135)
~
l−m
1
= (L1− + L2− ) |lli
~
l−m
X l − m 1 r l−m−r
1
= L1− L2− |lli
r ~ ~
r=0
l−m
X X l − m r l−m−r
ll 1 1
= Cl1 ql2 l−q L1− L2− |l1 qi |l2 l−qi
q
r ~ ~
r=0
l−m min(l1 ,l+l2 )
X X l−m
= Clll1 ql2 l−q
r
r=0 q=max(−l1 ,l−l2 )
r−1
Yp p
× l1 + q − p l1 − q + p + 1 |l1 q − ri
p=0
l−m−r−1
Y p p
× l2 + l − q − p l2 − l + q + p + 1 |l2 m + r − qi .
p=0
We now multiply equation 7.135 from the left with hl1 m1 l2 m2 | and find
with q − r = m1 the expression
l1 l−m
X Y 1
Cllm
1 m1 l2 m2
=δm1 +m2 ,m √ √ (7.136)
q=m p=1
l + m + p l − m − p + 1
1
223
Chapter 7. Symmetry and conservation laws
Y1 −1
q−m
p p
× l1 + q − p l1 − q + p + 1
p=0
Y 1 −1
l−m−q+m
p p
× l2 + l − q − p l2 − l + q + p + 1
p=0
l−m ll
× C .
q − m1 l1 ql2 l−q
If we know the values for Clll1 ql2 l−q we found a closed expression for all
Clebsch-Gordan coefficients.
As a next step we express Clll1 ql2 l−q in terms of Clll1 l1 l2 l−l1 with the use of
the recurrence relations between Clebsch-Gordan coefficients. Acting with
the ladder operator L± on the state |lmi yields
√ √
L± |lmi = ~ l ∓ m l ± m + 1 |lm ± 1i , (7.137)
m01 m02
q q
0 0
l2 ∓ m2 l2 ± m2 + 1 l1 m1 l2 m2 ±1 Cllm
0 0
0
1 m l2 m
0 .
1 2
We now multiply both sides with hl1 m1 l2 m2 | from the left and find
√ √
l ∓ m l ± m + 1 hl1 m1 l2 m2 |lm ± 1i = (7.139)
p p p p
l1 ∓ m1 + 1 l1 ± m1 Cllm
1 m1∓1l2 m2
+ l2 ∓ m2 + 1 l2 ± m2 Cllm1 m1 l2 m2∓1
.
224
7.5. Rotational symmetry and ...
225
Chapter 7. Symmetry and conservation laws
(7.146)
We now combine equation 7.136, 7.142 and 7.146 to come to the final
result
l1 l−m
X Y 1
Cllm
1 m1 l2 m2
= δm1 +m2 ,m √ √ (7.147)
q=−m1 p=1
l + m + p l − m − p + 1
Y1 −1 p
q−m
p
× l1 + q − p l1 − q + p + 1
p=0
Y 1 −1
l−m−q+m
p p
× l2 + l − q − p l2 − l + q + p + 1
p=0
l−m
×
q − m1
l1 −q−1 √ √
Y l2 − l + l1 − r l + l2 − l1 + r + 1
l1 −q
× (−1) √ √
r=0
r + 1 2l1 − r
√ √ 2 −1/2
0
l1 l1 −q
Y−1
X l2 − l + l1 − r l + l2 − l1 + r + 1
× √ √ .
q 0 =−l1 r=0
r + 1 2l1 − r
This result can, with the aid of some algebra, be rewritten. The stan-
dard form in the literature one commonly finds is
s
(l1 + l2 − l)! (l1 − l2 + l)! (−l1 + l2 + l)!
Cllm
1 m1 l2 m2
= δm1 +m2 ,m (7.148)
(l1 + l2 + l + 1)!
X √
× (−1)t t! (l1 +l(2l+1) (l1 +m1 )! (l1 −m1 )! (l2 +m2 )! (l2 −m2 )! (l+m)! (l−m)!
2 −l−t)! (l1 −m1 −t)! (l2 +m2 −t)! (l−l2 +m1 +t)! (l−l1 −m2 +t)!
.
t
226
7.5. Rotational symmetry and ...
Whereby the sum over t includes all integers for which the factorials are
not taken of negative numbers.
If one defines a projector operator 6.4.8 that projects states from the full
Hilbert space for two electrons with angular momentum l1 and l2 onto the
subspace of eigenvectors of the operator L2 /~2 and Lz /~ with eigenvalues
l(l + 1) and m we obtain an expression containing only a single Clebsch-
Gordan coefficient.
lX 0
1 +l2 l
0 0
X
0 0
P lm |l1 m1 l2 m2 i = Cll1m
m1 l2 m2 P lm l m (7.150)
l0 =|l1 −l2 | m0 =−l0
= Cllm
1 m1 l2 m2
|lmi .
As Cllm
1 m1 l2 m2
= 0 when m 6= m1 + m2 we do not really need a projection
onto the eigenstates of Lz /~ but can use that a projection on the eigenstates
227
Chapter 7. Symmetry and conservation laws
lX 0
1 +l2 l
0 0
X
0 0
P l |l1 m1 l2 m2 i = Cll1m
m1 l2 m2 Pl l m (7.151)
l0 =|l 1 −l2 | m0 =−l0
This expression still contains the unknown state |lmi. In order to re-
move this we take the inner product of two coupled state projected to the
same total angular momentum l
D E
l1 m01 l2 m02 P †l P l l1 m1 l2 m2 = δm0 ,m01 +m02 δm,m1 +m2 (7.152)
0
× Cllm 0
1 m l2 m
lm
0 Cl m l m
1 1 2 2
lm0 lm
1 2
228
7.5. Rotational symmetry and ...
For the rotation around the x or y direction the relation is not so simple.
The commutator between Lx or Ly and Lz is non-zero. Therefore also the
commutators
229
Chapter 7. Symmetry and conservation laws
We first of all note that a rotation of an angular momentum state does not
change the total angular momentum l. This is a direct result from the fact
that all components of L ~ commute with L2 . The numbers dl 0 define, for
mm
each l a square matrix d of dimension 2l + 1. The matrix d is known as a
Wigner d matrix. The elements of the Wigner d matrix are given as
D 1
E
dlm0 m (θ) = lm0 e 2~ (L− −L+ )θ lm (7.158)
We can relate the operator R(α, β, γ) to the Wigner D matrix, Dl (α, β, γ),
with matrix elements
l 0
Dm 0 m (α, β, γ) = lm R(α, β, γ) lm . (7.160)
We note that
l l
Dm 0 m (0, β, 0) = dm0 m (β), (7.161)
for a rotation around the y axis. Furthermore for a rotation around the z
axis we have
l −im α0
Dm 0 m (α, 0, 0) = e δm,m0 , (7.162)
and
l −imγ
Dm 0 m (0, 0, γ) = e δm,m0 , (7.163)
such that
0
−im α l
l
Dm 0 m (α, β, γ) = e dm0 m (β)e−imγ . (7.164)
230
7.5. Rotational symmetry and ...
Unitarity
The Wigner D matrix defines a unitary transformation between the states
|lmi and the rotated states |lm0 i. As such that matrix Dl (α, β, γ) is unitary
l
X
l
Dm 1m
l
(α, β, γ)Dm 2m
(α, β, γ)∗ = δm1 ,m2 . (7.166)
m=−l
For a rotation and inverse rotation acting on the state |lmi we find
X
l 0
R(α, β, γ) |lmi = Dm 0 m (α, β, γ) lm , (7.167)
m0
X
R(α, β, γ)−1 |lmi = l
Dmm 0 (α, β, γ)
∗
lm0 .
m0
At the same time we know that R(α, β, γ) represents a rotation such that
231
Chapter 7. Symmetry and conservation laws
We thus find
m0
q
2l+1
We now realize that Yl,m (0, 0) = δm,0 4π and find
r
2l + 1
Yl,m (θ, φ) = l
Dm0 (φ, θ, γ)∗ , (7.173)
4π
or
r
4π
l
Dm0 (α, β, γ)∗ = Yl,m (β, α). (7.174)
2l + 1
Clebsch-Gordan series
One property of the Wigner D matrix that we will need in the next section
l1 l2
is the Kronecker product of two matrices Dm 0 m (α, β, γ)Dm0 m (α, β, γ). In
1 1 2 2
order to derive this product we start from the expansion of a product state
|l1 m1 i |l1 m1 i into states of angular momentum |lmi
X
|l1 m1 i |l2 m2 i = Cllm
1 m1 l2 m2
|lmi . (7.175)
lm
232
7.5. Rotational symmetry and ...
X
l2 0
R(α, β, γ) |l2 m2 i = Dm 0 m (α, β, γ) l2 m2 .
2
2
m02
We find
X
0
Dml1 l2
0 m (α, β, γ)Dm0 m (α, β, γ) l1 m1
1 2
l2 m02 = (7.177)
1 2
m01 m02
XX
0
Cllm
1 m1 l2 m2
l
Dm 0 m (α, β, γ) lm .
lm m0
We now multiply from the left with hl2 m02 | hl1 m01 | to get
l1 l2
Dm 0 m (α, β, γ)Dm0 m (α, β, γ) = (7.178)
1 1 2 2
0
XX
Cllm C lm 0 0 Dm
1 m1 l2 m2 l1 m l2 m
l
0 m (α, β, γ).
1 2
lm m0
233
Chapter 7. Symmetry and conservation laws
The integrals over two spherical harmonics yield the orthogonality con-
ditions between the spherical harmonics
Z 2π Z π
Yl∗0 ,m0 (θ, φ)Yl,m (θ, φ) sin(θ)dθdφ = δm,m0 δl,l0 . (7.182)
0 0
234
7.5. Rotational symmetry and ...
state |nlmi represented by the wave function ψn,l,m (~r) = Rnl (r)Yl,m (θ, φ),
with Rnl (r) some radial function depending on the quantum numbers nl
and Yl,m (θ, φ) describing the angular dependence. We will often need to
evaluate integrals of the form
o = nlm O n0 l0 m0 . (7.184)
we find
XZ ∞
0 0 0 ∗
nlm O n l m = Rnl (r)O k,q (r)Rn0 l0 (r)dr (7.186)
k,q 0
Z 2π Z π
∗
× Yl,m (θ, φ)Yk,q (θ, φ)Yl0 ,m0 (θ, φ) sin(θ)dθdφ
0 0
s
XZ ∞ (2k + 1)(2l0 + 1) lm
∗
= Rnl (r)O k,q (r)Rn∗ 0 l0 (r)dr l0
Ckql0 m0 Ck0l 00.
0 4π(2l + 1)
k,q
235
Chapter 7. Symmetry and conservation laws
For the potential energy we have the operator H (pot.) = V (~r) and matrix
elements
(pot.)
Hi,j = hni li mi |V (~r)|nj lj mj i . (7.188)
4π
Al,m = p p (7.190)
(2l + 1) (l − m)! (l + m)!
× ∂zl−|m| (sign(m)∂x + i∂y )|m| V (r, θ, φ) .
r=0
With this expansion we find for the potential energy matrix elements
(pot.)
Hi,j = hni li mi |V (~r)|nj lj mj i (7.191)
∞ X
X l D E
= Al,m ni li rl nj lj hli mi |Yl,m (θ, φ)|lj mj i
l=0 m=−l
s
∞ X
l
X D
l
E (2l + 1)(2lj + 1) li mi li 0
= Al,m ni li r nj lj Clmlj mj Cl0l j0
.
4π(2li + 1)
l=0 m=−l
~
Expansion of eik·~r on spherical Harmonics
For the interaction ofE particles with light we encounter the matrix elements
D
~ ~
ni li mi eik·~r nj lj mj . In order to evaluate these we can expand eik·~r on
spherical harmonics. In principle we can use equation 7.189 and 7.190 and
expand the plane wave in powers of r. For large r this leads to a divergent
interaction. This is true for general potential expansions as discussed in
236
7.5. Rotational symmetry and ...
237
Chapter 7. Symmetry and conservation laws
1
Expansion of |~
r1 −~r2 | on spherical harmonics
Many interactions between two particles fall off as 1/r with r the distance
between the two particles. In order to evaluate the matrix elements of
the interaction between two particles we thus want to expand the function
1
|~
r1 −~r2 | on spherical harmonics. One finds
∞ X
l
1 X 1 min(r1 , r2 )l ∗
= Yl,m (θ1 φ1 )Yl,m (θ2 φ2 ). (7.196)
|~r1 − ~r2 | 2l + 1 max(r1 , r2 )l+1
l=0 m=−l
For the calculation of matrix elements, one needs to take into account
that the wave function of two fermions (bosons) needs to be antisymmetric
(symmetric) under the exchange of coordinates. We will discuss this in
more detail in chapter 9.
The operator R(θ, n̂) rotates the state |φi by an angle θ around the unit
vector n̂ and is given as
i ~
R(θ, n̂) = e− ~ θL·n̂ . (7.198)
238
7.5. Rotational symmetry and ...
Whereby the first line follows from the definition of the rotated state |φ0 i
~ being a vector operator. Note
and the second line from the definition of a O
that in the second line the expectation value is a vector and R(α, β, γ)
a matrix such that the multiplication is a matrix vector multiplication.
Equation 7.199 is valid for all states |ψi such that we find for the vector
operator
R† (θ, n̂)OR(θ,
~ ~
n̂) = R(α, β, γ)O. (7.200)
For the left hand side of equation 7.200 we find for an infinitesimal small
rotation
i~ ~ − δθ i L
R† (δθ, n̂)OR(δθ,
~ n̂) = (1 + δθ L · n̂)O(1 ~ · n̂) (7.202)
~ ~
~ + δθ i L
=O ~ · n̂O
~ −O
~L~ · n̂ .
~
Combining the left and right hand side of equation 7.200 for infinitesimal
small rotations we find that for vector operators the following commutation
relations must hold
~ · n̂, O]
[L ~ = −i~ n̂ × O.
~ (7.203)
239
Chapter 7. Symmetry and conservation laws
[Li , O i ] = 0.
We can use the commutation relations for the definition of a vector operator.
Any operator with 3 components that fulfills the commutation relations of
equation 7.203 is a vector operator. Some well known vector operators
are the angular momentum L, ~ the position operator ~r and the momentum
operator ~p.
As we will see in the next section, the rotation properties of products
of vector operators becomes much nicer if we do not work with Cartesian
vector operators, but with spherical vector operators. We define
r
1
O −1 = (O x − iO y ) (7.205)
2
O0 = Oz
r
1
O +1 = (−O x − iO y ) .
2
The spherical vector operators rotate like a spherical harmonic with l = 1.
This becomes clear if one writes down the explicit equations for the spherical
harmonics with l = 1.
r r
4π 1 x y
Y1,−1 (θ, φ) = −i (7.206)
3 2 r r
r
4π z
Y1,0 (θ, φ) =
3 r
r r
4π 1 x y
Y1,1 (θ, φ) = − −i .
3 2 r r
We can express a rotation of a spherical vector operator with the use of
a Wigner D matrix.
X
R† (α, β, γ)O m R(α, β, γ) = Dml=1
0 m (α, β, γ)O m0 . (7.207)
m0
For the commutator of the angular momentum operator with a spherical
vector operator we find
[Lz , O m ] = ~mO m (7.208)
240
7.5. Rotational symmetry and ...
√ √
[L± , O m ] = ~ 1 ± m 1 ± m + 1O m±1 .
Tensor operators
We can generate a rank k Cartesian tensor operator by taking the tensor
product of k Cartesian vector operators. The elements of tensor operator
T are given as
(k)
T i,j,... = O i O 0j . . . . (7.209)
(k)
X
R† T i,j,... R = Rii0 O i0 Rjj 0 O 0j 0 . . . , (7.210)
i0 ,j 0 ,...
(k=2)
T 0,0 =O ~ 0,
~ ·O (7.211)
~ (k=2) = O
T ~ 0,
~ ×O (7.212)
1
rotates like a vector operator. In total a rank 2 tensor operator has 9 ele-
ments. We see that we can create one linear combination of these elements
that rotates like a scaler, 3 linear combinations of the tensor elements rotate
like a vector and the other 5 are not determined yet.
The rotation properties of tensor operators are much nicer if we look at
spherical tensors instead of Cartesian tensors. If we couple two spherical
241
Chapter 7. Symmetry and conservation laws
vectors operators, then we couple two objects that rotate like spherical
harmonics with l = 1. The coupling of these two operators will yield 3 sets
of operators that rotate like a state with angular momentum l = 0, l = 1,
or l = 2. The l = 0 operator rotates like a scalar, the l = 1 operators like
a spherical vector, and the l = 2 operators are the 5 elements of a rank 2
Cartesian tensor we did not identify before. We define the spherical tensor
operators of rank 1 to be equal to the spherical vector operators
T (k=1)
q = O m=q . (7.213)
The coupling of two spherical tensors can be done with the use of the
Clebsch-Gordan coefficients
X kq
T (k)
q = Ck1 q1 k2 q2 T (k1) (k2 )
q1 T q2 . (7.214)
q1 q2
We will call k the rank of the spherical tensor operator, which has a slightly
different meaning than the rank of a Cartesian tensor operator. The indices
k and q are similar to the indices l and m used for states with angular
momentum l and z projection m.
(k)
The spherical tensor operator T q has 2k + 1 elements with q ∈ [−k, k].
The tensor rotates like a spherical harmonic with angular momentum l = k.
As such the rotation can be expressed with the use of a Wigner D matrix
(k)
X
R† (α, β, γ)T (k)
q R(α, β, γ) = Dql=k
0 q (α, β, γ)T 0 .
q (7.215)
q0
242
7.5. Rotational symmetry and ...
µkq 0 0
lml0 m0 describes the transition from state |l m i to state |lmi due to the
(k)
operator T q . For a given l, l0 and k there are (2l + 1) × (2l0 + 1) × (2k + 1)
different matrix elements µkq lml0 m0 . These matrix elements with different m,
0
m , or q are all related to each other. The Wigner-Eckart theorem states
that
D E D E
0 0 (k) 0
lm T (k)
q l m = C lm
0 0
l m kq l T l , (7.218)
state |l0 m0 i with state |kqi to state |lmi. Whereby we now used the Clebsch-
Gordan coefficients not to couple two states with each other, but to describe
the rotational dependence of the product of a spherical tensor operator with
a state.
Equation 7.218 is valid for all values of m, m0 and q, such that we can
pick a convenient value for them to calculate the reduced matrix elements.
For example
D E 1 D (k)
E
l T (k) l0 = l0 l0 T 0 l0 0 , (7.219)
Cl0 0k0
and
D
0 0
E Cllm
0 m0 kq
D
(k)
E
lm T (k)
q lm = l0 l0 T 0 l0 0 . (7.220)
Cl0 0k0
243
Chapter 7. Symmetry and conservation laws
If we combine
D equationE7.221 and 7.222 we find a recurrence relation be-
(k)
tween lm L± T q l0 m0 for different values of m and m0
√ √ D
(k) 0 0
E
l ± m l ∓ m + 1 lm ∓ 1 T q l m = (7.223)
√ √ D
0 0
E
l0 ∓ m0 l0 ± m0 + 1 lm T (k)
q l m ± 1
D E
(k)
+ k ∓ q k ± q + 1 lm T q±1 l0 m0 .
p p
244
7.5. Rotational symmetry and ...
with Dm l
0 m (θ, n̂) the Wigner D matrix. A rotation can mix the degenerate
states of L2 , but does not mix states with different l. We can assign to each
irreducible representation, i.g. the set of all 2l + 1 states |lmi with different
m for a fixed l, a character χ. The character is a set of (complex) numbers
that indicates how the representation, i.g. the states |lmi, behave under
rotation. The element of the character of an irreducible representation for
a specific rotation is given by the trace of the Wigner D matrix for that
rotation
l
X
l
χ(l, θ, n̂) = Dmm (θ, n̂). (7.225)
m=−l
245
Chapter 7. Symmetry and conservation laws
of one dimensional real functions f (x) and their behaviour under inversion
symmetry ix7→−x . The eigenfunctions of Ix7→−x are even or odd functions
with eigenvalue ±1
ix7→−x fg (x) = +fg (x) (7.226)
ix7→−x fu (x) = −fu (x).
Each function f (x) can be written as a sum of an even (gerade) and odd
(ungerade) contribution
f (x) = fg (x) + fu (x). (7.227)
Looking at the behavior of the even and odd functions separately often
greatly simplifies the problem at hand. We can recast the previous state-
ments in the language of groups. The inversion operator combined with
the identity operator forms a group. This group has two irreducible rep-
resentations. The even and odd representation. The character of the even
representation is χg = {1, 1} the character for the odd representation is
χu = {1, −1}. Whereby the two elements of the character refer to the sign
one obtains if one acts with the identity operator or with the inversion
operator on the irreducible representation.
It is useful to calculate the character for the irreducible representations
of the rotation group. We first of all notice, that, while we have rotational
symmetry, it should not matter around which axis we rotate. We thus
find that χ(l, θ, n̂) = χ(l, θ, ẑ) = χ(l, θ). We can choose to calculate the
character for a rotation around the ẑ direction. For a rotation around the
ẑ direction we have
i
R(θ, ẑ) |lmi = e− ~ Lz θ |lmi (7.228)
−imθ
=e |lmi .
l
The Wigner D matrix for a rotation around z is diagonal, with Dmm (θ, ẑ) =
e−imθ . The trace of the Wigner D matrix yields
l
X
χ(l, θ) = e−imθ (7.229)
m=−l
246
7.5. Rotational symmetry and ...
sin((l + 12 )θ)
= .
sin( 12 θ)
247
Chapter 7. Symmetry and conservation laws
248
Chapter 8
Before Schrödinger introduced his equation, atomic physics was poorly un-
derstood. The Bohr model based on results from atomic spectroscopy is
able to describe some of the phenomena in atomic physics. It was, from the
moment the Bohr model was introduced, well understood that the Bohr
model has major physical internal problems (see section 1.7). The Bohr
model can be used to describe the energy levels, but the model has little
predictive power. Schrödinger in his first paper [7] derived the excitation
spectrum of the Hydrogen atom. These results made his theory an immedi-
ate success. With the knowledge of the previous chapters, we are now able
to reproduce these results and explain some of the properties of a Hydrogen
atom.
249
Chapter 8. The Hydrogen atom
for the stationary solutions of the Schrödinger equation. These are the
eigenstates of the Hamiltonian. The Hamiltonian is given by
1 2 1 2 e2 1
H= ~pp + ~pe − , (8.1)
2mp 2me 4π0 |~re − ~rp |
with mp (me ) the mass of the proton (electron), ~rp (~re ) the position of the
proton (electron) and ~pp = −i~∇p (~pe = −i~∇e ) the momentum operator
for the proton (electron). The last term represents the Coulomb interaction.
As a first step we replace the two particle differential equation by two
decoupled differential equations related to the centre of mass and the rel-
ative distance between the proton and electron. We define the centre of
mass and relative coordinates as
~ = mp~rp + me~re
R (8.2)
mp + me
~r = ~re − ~rp .
∂ ∂~ri ∂ ∂R~i ∂
= + (8.3)
∂~rp,i ~i
∂~rp,i ∂~ri ∂~rp,i ∂ R
∂ mp ∂
=− + ,
∂~ri mp + me ∂ R ~i
∂ ∂~ri ∂ ∂R~i ∂
= +
∂~re,i ~i
∂~re,i ∂~ri ∂~re,i ∂ R
∂ me ∂
=+ + ,
∂~ri mp + me ∂ R ~i
such that
∂ ∂ ∂
= + , (8.4)
~
∂ Ri ∂~rp,i ∂~re,i
250
8.1. The central potential
∂ 1 ∂ ∂
= mp − me .
∂~ri mp + me ∂~re,i ∂~rp,i
The centre of mass and relative momentum operators become
~ = ~pp + ~pe = −i~∇R
P (8.5)
mp~pe − me~pp
~p = = −i~∇r .
mp + me
With
M = me + mp (8.6)
me mp
µ= ,
me + mp
we find for the Hamiltonian
1 ~2 1 2 e2 1
H= P + ~p − . (8.7)
2M 2µ 4π0 |~r|
There is no interaction between the centre of mass and relative coordi-
nates and we can separate the solution for the Schrödinger equation in a
product wave function for each coordinate separately. With
~ = φ(~r)ϕ(R),
Φ(~r, R) ~ (8.8)
and,
1 ~2
H R~ = P (8.9)
2M
1 2 e2 1
H ~r = ~p − ,
2µ 4π0 |~r|
we find
~ = Etot φ(~r)ϕ(R)
(H R~ + H ~r )φ(~r)ϕ(R) ~ =⇒ (8.10)
H ~r φ(~r) H R~ ϕ(R)~
+ = Etot =⇒
φ(~r) ~
ϕ(R)
251
Chapter 8. The Hydrogen atom
~ = E ~ ϕ(R)
H R~ ϕ(R) ~
R
H ~r φ(~r) = E~r φ(~r)
E~r + ER~ = Etot .
~ = 1 ~ ~
ϕ(R) eiK·R , (8.11)
(2π)3/2
with energy
~ 2
~2 |K|
ER~ = . (8.12)
2M
This is a plane wave. One can make a wave packet from these plane waves to
describe the position and velocity of the Hydrogen atom as a free quantum
particle.
The degrees of freedom related to the center of mass are generally not
the ones we are interested in. If we look at excitations of an atom we look at
the internal degrees of freedom represented by the relative coordinates. In
order to find the eigenstates of the Hamiltonian for the relative coordinates,
we need to do a bit more work.
The operator H ~r is invariant under rotation (It has the same form
as the operator given in equation 7.57). We thus know that angular mo-
mentum commutes with the Hamiltonian. We furthermore know that the
angular momentum is a conserved quantity, i.e. the expectation value of
the angular momentum operator (and all powers of the angular momen-
tum operator) for an arbitrary wave packet Epropagating in this potential
~ n Ψ(t, ~r) = 0). Last but not least we
D
∂
are time independent ( ∂t Ψ(t, ~r) L
know that the eigenstates of the Hamiltonian are also eigenstates of the
angular momentum operator L2 . The eigenstates of L2 have eigenvalues
l(l + 1) and a degeneracy 2l + 1. We label the different degenerate states
with equal l by the projection of the angular momentum on the z axis going
from m = −l to m = +l. The easiest way to continue with our quest to
find the eigenstates of the Hamiltonian is to write the Hamilton operator in
252
8.2. Angular solution
~2 1 ∂ ∂2
2 ∂ 1 ∂ ∂ 1
H ~r = − r + 2 sin θ + 2 2
2µ r2 ∂r ∂r r sin θ ∂θ ∂θ r sin θ ∂ϕ2
(8.13)
e2 1
− ,
4π0 r
whereby the first line of this equation represents the kinetic energy in spher-
ical coordinates and the second line the potential or Coulomb energy be-
tween the electron and proton. With the use of the angular momentum
operator (L~ = ~r × ~p)
1 ∂2
~ 2 = −~2 1 ∂ ∂
L sin θ + (8.14)
sin θ ∂θ ∂θ sin2 θ ∂ϕ2
we have
~2 L2 e2 1
1 ∂ 2 ∂
H ~r = − r − 2 2 − . (8.15)
2µ r2 ∂r ∂r ~ r 4π0 r
253
Chapter 8. The Hydrogen atom
The angular wave functions shown in figure 7.3 or 7.5 are besides eigen-
states of the angular momentum operator also the solutions for the atomic
orbitals. One uses the term orbital for a wave function that is relatively
well localized, similar to the term wave packet. The term orbital is mostly
used for stationary states, the term wave packet for states that show some
dynamics and as such are superpositions of stationary states. The atomic
orbitals play a very important role in understanding the chemical and phys-
ical properties of molecules and solids. Why, if, and how atoms bind and
form molecules or solids and what the properties of the resulting materials
are is intimately related to these angular wave functions.
~2 1 ∂ e2 1
2 ∂ l(l + 1)
Hr = − r − − , (8.18)
2µ r2 ∂r ∂r r2 4π0 r
With
∂ 1 ∂
R(r) = 2 r P (r) − P (r) , (8.21)
∂r r ∂r
and thus
∂2
∂ 2 ∂
r R(r) = r 2 P (r), (8.22)
∂r ∂r ∂r
254
8.3. Radial solution
and given that rH r R(r) = EP (r), we find the Hamiltonian acting on P (r)
to be:
~2 ∂ 2 l(l + 1)~2 e2 1
H=− + − , (8.23)
2µ ∂r2 2µr2 4π0 r
such that
HP (r) = EP (r). (8.24)
Form equation 8.23 we see that we are effectively looking for the stationary
solutions of a particle in a one-dimensional potential. The mass of the
electron is replaced by the reduced mass µ. The Coulomb potential is
2
modified by the additional term l(l+1)~
2µr2
.
In order to solve this eigenvalue equation, we use the same method as
we did for the solution of the Harmonic oscillator (see section 5.6.1). We
first look at the solution in the limit where r → ∞ and r → 0 and then use
this solution multiplied with a polynomial of order n as a trial solution for
the full eigenvalue equation. One might wonder why we need to test also
the limit for r → 0, as we did not test this limit for the harmonic oscillator.
One should realize that the harmonic oscillator defines a potential that
is analytic over the full domain of the function. This is not true for the
Coulomb potential. The Coulomb potential diverges for ~r = 0. If the
potential diverges we know that the derivative of φ(~r) at ~r = 0 might show
a discontinuity. Following the solution for the delta function potential (see
section 5.3.1) we should expect a cusp in φ(~r) at r = 0 that is proportional
to the value of φ(~r = 0).
For r → 0 we know that E and 1/r are small compared to 1/r2 . We can
thus neglect the terms proportional to E and 1/r and only keep the terms
proportional to 1/r2 in equation 8.23. In the limit r → 0 we thus find the
eigenvalue equation
∂2 l(l + 1)
− 2
P0 (r) + P0 (r) = 0. (8.25)
∂r r2
This equation is satisfied if
P0 (r) = rl+1 , (8.26)
255
Chapter 8. The Hydrogen atom
or
The solution that behaves like rl+1 is called the regular solution and can
lead to a wave function that is square-integrable. The solution that behaves
like r−l is called the irregular solution. For l > 0 the irregular solution is
ill-defined for ~r → 0. The radial function P (r) diverges, but the angular
function can have any phase such that the wave function is discontinuous.
As such the irregular solutions can not be used to build quantum states.
The irregular solution for l = 0 will be discussed later in this chapter 8.4.
We find that for small r, P (r) should behave like rl+1 (R(r) behaves like
rl ).
Next, we focus on the limit where r → ∞. In this limit we can neglect
the potential terms and need to find a solution for
~2 ∂ 2
− P∞ (r) = EP∞ (r). (8.28)
2µ ∂r2
with k 2 = 2µE
~2
we find two separate types of solutions depending if E > 0
for the free, scattering states and E < 0 for the bound states:
(
c1 eikr + c2 e−ikr E > 0
P∞ (r) = , (8.29)
e−|k|r E<0
256
8.3. Radial solution
This initial guess function has the correct behaviour for r → 0 and r → ∞.
With αn,l,q = αj=q−l−1 we can rewrite this trial function as
n
X
Pn,l (r) = αn,l,q rq e−|k|r , (8.31)
q=l+1
We now extend the sum to start at q = 1 and set all αn,l,q that were not in
the original sum to zero. With αn,l,q = 0 ∀q ≤ l, q > n we have
∞
X l(l + 1) − q(q + 1)
αn,l,q+1 (8.33)
2
q=1
!
µe2
+ |k|q − αn,l,q rq−1 = 0.
4π0 ~2
257
Chapter 8. The Hydrogen atom
This equation is valid for all values of r. As such each term needs to be
zero. We find
µe2
l(l + 1) − q(q + 1)
αn,l,q+1 = − |k|q αn,l,q . (8.34)
2 4π0 ~2
As αn,l,n+1 = 0 and αn,l,n 6= 0 we have
µe2
− |k|n = 0, (8.35)
4π0 ~2
or
µe2
|k| = . (8.36)
n4π0 ~2
Which yields for the energy
2 2
e µ 1
En = − . (8.37)
4π0 2~2 n2
This is the same result as Bohr obtained (see equation 1.65) within the Bohr
model and describes the experimental findings as described by Rydberg (see
equation 1.54). It is this result that Schrödinger obtained in his first paper
and shows the power of his equation.
With the energy of each eigenstate given by the quantum numbers n, l
and m we can calculate the wave function. Equation 8.34 defines the rela-
tion between the coefficients of the polynomial part of the radial function
Pn,l (r) given by equation 8.31. If we do not worry about the normalization
we can take αn,l,q=l+1 = 1 and use equation 8.34 to calculate the full radial
wave function. The radial wave function is given by an exponential function
times a polynomial. The polynomial contains the terms rl+1 to rn . These
polynomials are so important that they have gotten a name. We call them
(up to some scaling and a factor rl+1 ) the associated Laguerre polynomials
L2l+1
n−l−1 . In terms of these polynomials we find for the normalized radial
wave function
s
2 3 (n − l − 1)! 2r l − aµr n 2l+1
2r
Rn,l (r) = e Ln−l−1 , (8.38)
naµ 2n(n + l)! aµ n aµ n
258
8.3. Radial solution
2
with aµ = 4π 0~
µe2
the (reduced) Bohr radius.
The Bohr radius aµ (5.29 10−11 meter) defines a typical distance in
atomic physics and accordingly a length scale for the distances between
atoms in molecules and solids. We furthermore find that the radial function
scales with n and for larger n we find a more extended wave function. For
all l, except for l = 0 we find that R(r = 0) = 0. The divergence in the
potential at ~r = 0 leads to a cusp in the wave function for l = 0, but not
for states with finite angular momentum l > 0. Last but not least, the
Laguerre polynomials have n − l − 1 zeros.
259
Chapter 8. The Hydrogen atom
260
8.4. Cusp of the wave function at the origin
Note that Rk,l (r) is real. This can be seen from the series expansion in
equation 8.39 and 8.44 with c real. It is not so obvious in equation 8.45.
261
Chapter 8. The Hydrogen atom
∂ 1
Rn,l=0 (r) = αn,l=0,q=2 − αn,l=0,q=1 . (8.52)
∂r r→0 aµ n
We now use the relation between αn,l=0,q=2 and αn,l=0,q=1 as found in the
recurrence relation 8.34
1 1
αn,l=0,q=2 = − − αn,l=0,q=1 , (8.53)
aµ aµ n
and get
∂ 1
Rn,l=0 (r) =− αn,l=0,q=1 (8.54)
∂r r→0 aµ
1
=− Rn,l=0 (r)|r→0
aµ
µe2
=− Rn,l=0 (r)|r→0 .
4π0 ~2
The cusp in the wave function at ~r = 0 is proportional to the value of the
function at ~r = 0. It is furthermore multiplied by µ/~2 the effective mass
divided by ~2 and the residue of the pole in the potential e2 /(4π0 ). This
is the same result as we obtained in section 3.1 based on Gauss’s theorem.
262
8.5. The periodic table
inferred before the Schrödinger equation was known. Pauli in 1925 pub-
lished several papers where he compared atomic spectroscopy in magnetic
fields for different atoms. For a state with angular momentum l, the Zee-
man effect lifts the degeneracy of these 2l + 1 states. For a spectroscopic
transition one observes that a single absorption line without magnetic field
transforms into 2l + 1 lines in a magnetic field. Pauli related the spec-
troscopic line splitting due to the Zeeman effect to the positions of atoms
in the periodic table [73, 74]. Pauli concluded that in order to explain
the periodic table one can use the Bohr model and fill each orbital in this
model with a maximum of two electrons. This assumption remains valid
if one wants to explain the periodic table with the use of the Schrödinger
equation. Each atomic orbital with the quantum numbers n, l, and m
can at maximum host two electrons. This assumption is not part of the
Schrödinger equation and needs to be added as a separate side condition.
The number two for the filling per orbitals seems at first sight arbitrary.
In statistical physics one normally counts each degree of freedom or state
once. An atomic orbital with the quantum numbers n, l, and m seems to
have two possible internal quantum states. The degeneracy of these states
can be lifted by magnetic fields, as observed by doublet lines in absorption
spectroscopy, or in the Stern Gerlach experiment. An additional quantum
number σ was introduced; the spin of the electron [75]. We will label this
additional degree of freedom with the quantum number σ. The values of
σ can be ± 21 . The spin degree of freedom couples to magnetic fields. It
thus seems natural to relate this degree of freedom to an internal angular
momentum. The size of this angular momentum needs to be such that the
degeneracy is 2. We find
2 = 2lspin + 1, (8.55)
1
lspin = ,
2
for the internal spin-angular momentum of an electron.
In order to explain the periodic table, we find that for multi-particle
systems not all solutions of the Schrödinger equation represent physical
states. Only solutions of the Schrödinger equation where electrons occupy
263
Chapter 8. The Hydrogen atom
264
8.6. The introduction of electron spin
265
Chapter 8. The Hydrogen atom
266
8.6. The introduction of electron spin
the basis of the corresponding Hilbert space for each point in space as
representing the states by vectors labeled with |l, mi. It is worth noting
that several equivalent notations used are
or
1 0
B={ , }. (8.59)
0 1
The angular momentum operators on this basis are (see section 7.5.2)
1 1 0
Sz = ~ , (8.60)
2 0 −1
0 1
S+ = ~ , (8.61)
0 0
and
0 0
S− = ~ . (8.62)
1 0
We used the letter S for spin angular momentum instead of L, which
we will reserve for orbital angular momentum from now on. Both are
angular momentum operators. With the use of S x = 21 (S + + S − ) and
S y = −i 21 (S + − S − ) we furthermore have
1 0 1
Sx = ~ , (8.63)
2 1 0
and
1 0 −i
Sy = ~ . (8.64)
2 i 0
267
Chapter 8. The Hydrogen atom
These matrices are, besides a factor of ~/2 called the Pauli matrices.
Next we have a look on how these spin states behave under rotations.
i
With Ri (θ) = e− ~ S i θ (equation 7.69) for a rotation around the i direction
we have
cos 2θ −i sin 2θ
Rx (θ) = , (8.65)
−i sin 2θ cos 2θ
θ
− sin 2θ
cos 2
Ry (θ) = θ , (8.66)
sin 2 cos 2θ
iθ
!
e− 2 0
Rz (θ) = iθ . (8.67)
0 e2
which for each point ~r defines a complex vector within the Hilbert space
with basis B = {|↑i , |↓i}. For the normalization of this state we find
Z
ψ(t, ~r)† ψ(t, ~r)dd~r (8.71)
Rd
268
8.6. The introduction of electron spin
Z
= ψ↑ (t, ~r)∗ ψ↑ (t, ~r) + ψ↓ (t, ~r)∗ ψ↓ (t, ~r)dd~r.
Rd
The vector field S(~~ r) defines a vector to each point in space. This vector
is real and defines the spin direction of the electron at this point in space.
The length of this vector is related to the local charge density. For states
that can be written in the form of equation 8.70 we find
q
~ ~r) · S(t,
S(t, ~ ~r) 1
†
= ~. (8.73)
ψ(t, ~r) ψ(t, ~r) 2
Both the spin density and the charge density are functions of position. Their
quotient (without integration) is a constant. This relation generally holds
for states describing single particles. For compound particles (for example
an atom with several electrons) with total spin 1/2, the wave function de-
pends on several coordinates and the above equation not necessarily holds.
269
Chapter 8. The Hydrogen atom
A = πr2 , (8.75)
270
8.6. The introduction of electron spin
1 1 ∂ ~ · S.
~
HSOC = V (r)L (8.79)
2m2 c2 r ∂r
This is known as the spin-orbit interaction. As discussed before, the Maxwell
equations for this situation are slightly troublesome. According to the
271
Chapter 8. The Hydrogen atom
Maxwell equations one can not have a persistent rotating current. The
real motivation for the spin-orbit interaction comes from the Dirac equa-
tions, which to first order in (v/c)2 yields for a single electron in a spherical
potential the same result. The Hamiltonian for the hydrogen atom with
spin-orbit interaction becomes
~p2 e2 1 e2 ~ ~
H= − + L · S. (8.80)
2µ 4π0 r 2m2 c2 4π0 r3
There are other relativistic effects that are of the same order of mag-
nitude as the spin-orbit interaction. These are the mass velocity term and
the Darwin term, which modify the kinetic energy and the potential energy.
The mass velocity and Darwin term make quantitative changes, whereas the
spin-orbit coupling makes a qualitative change. The spin-orbit interaction
reduces the symmetry of the Hamiltonian. Without spin-orbit interaction
the Hamiltonian commutes with both L and S. With spin-orbit interaction
it commutes only with L + S as we will show below. Without spin-orbit
coupling an nl shell has a (2s + 1)(2l + 1) fold degeneracy with s = 12 . With
spin-orbit coupling this degeneracy is lifted.
The spin-orbitals
In order to diagonalize the Hamiltonian of the hydrogen atom with spin-
orbit interaction, we can have a look at the commutator of the Hamiltonian
with the angular momentum operators. We know that without spin-orbit
interaction the Hamiltonian commutes with both L ~ and S.
~ We find however
that
~ · S,
[L ~ S z ] = Lx [S x , S z ] + Ly [S y , S z ], (8.81)
and
~ · S,
[L ~ Lz ] = S x [Lx , Lz ] + S y [Ly , Lz ]. (8.82)
The orbital and spin angular momentum operators do not commute with
the Hamiltonian 8.80. Therefore we know that the orbital angular momen-
tum nor the spin angular momentum are conserved quantities. If we write
272
8.6. The introduction of electron spin
L ~ = Lz S z + 1 (L+ S − + L− S + ) ,
~ ·S (8.83)
2
we see that the spin-orbit coupling operator can transfer angular momen-
tum from the spin sector to the orbital sector and vice versa. For a con-
served quantity we should look at the total angular momentum J ~ of an
electron rotating around the nucleus with orbital angular momentum L ~
~
and spin S.
~ =L
J ~ + S.
~ (8.84)
~ · S,
[L ~ J~ ] = 0. (8.85)
and
~ ] = 0.
[H, J (8.87)
and
273
Chapter 8. The Hydrogen atom
jjz
With the use of the Glebsch-Gorden coefficients Css z llz
we can write these
states as linear combinations of the eigenstates of the orbital angular mo-
mentum and spin angular momentum.
X jj
|lsjjz i = Csszz llz |ssz llz i , (8.90)
sz ,lz
whereby |ssz llz i are the tensor product states of the eigenstates of the spin
|ssz i and orbital |llz i angular momentum. The eigenstates of the orbital
angular momentum are given by the spherical harmonics. The eigenstates
of the spin are given by a vector in a complex Hilbert space, which we denote
as |↑i and |↓i. Using explicit values of the Glebsch-Gordon coefficients for
s = 1/2 we find for the case where j = l + 1/2
r r
l + 1/2 + jz l + 1/2 − jz
|lsjjz i = Yl,jz −1/2 (θ, ϕ) |↑i + Yl,jz +1/2 (θ, ϕ) |↓i
2l + 1 2l + 1
(8.91)
274
8.6. The introduction of electron spin
a cut in the yz, zx, and xz plane. Spins parallel to the x-axis are red,
parallel to the y-axis are green, and parallel to the z-axis blue. For the s
spin-orbital with j = 1/2, the spin is coaligned for each point in space. For
the p spin-orbital with j = 1/2 the magnetization direction rotates once
around its axis as a function of θ going from 0 to π.
Spin-orbit coupling relates the direction of the spin orientation to the
spatial direction around the atom. For atoms embedded in a solid, where
the local environment is no longer spherical this leads to magnetic anisotropy.
Magnetic anisotropy creates magnetic moments that are larger in one spa-
tial direction than in another. Magnetic anisotropy is very important for
the fabrication of permanent magnets. Without anisotropy one can always
rotate the magnetization direction at very little energy cost. The intriguing
entanglement of spin direction and spatial direction induced by spin-orbit
coupling lifts in a non-spherical environment this degeneracy and makes
a Ferromagnet stable. These magnetization distributions around an atom
can be observed in experiments. If one diffracts a particle (photon, elec-
275
Chapter 8. The Hydrogen atom
tron, neutron) from an atom then one becomes an image of this object.
Either measured in momentum (Fourier) space as an interference pattern,
or with the help of lenses directly in real space. The momentum-dependent
scattering of an atom is known as the (magnetic) atomic form factor.
276
8.6. The introduction of electron spin
277
Chapter 8. The Hydrogen atom
278
Chapter 9
N particle systems
279
Chapter 9. N particle systems
In our notation we included the variables ~r1 and ~r2 in the bracket notation.
We normally do not need to do this. As we will show later in this section,
280
9.1. The Schrödinger equation alone is not ...
?
|ψHe (~r1 , ~r2 )i = |ψ1s↑ (~r1 )ψ1s↓ (~r2 )i (9.2)
?
|ψHe (~r1 , ~r2 )i = |ψ1s↓ (~r1 )ψ1s↑ (~r2 )i .
281
Chapter 9. N particle systems
in the z direction
r
1
|ψ1s↑x i = (|ψ1s↑z i + |ψ1s↓z i) (9.3)
2
r
1
|ψ1s↓x i = (|ψ1s↑z i − |ψ1s↓z i)
2
To exemplify the relation between these two basis sets we can have a look
at the spin operator S x as a matrix on these two basis sets:
1 0 1 hψ1s↑z |
S x = ~ |ψ1s↑z i |ψ1s↓z i · · (9.4)
2 1 0 hψ1s↓z |
1 1 0 hψ1s↑x |
= ~ |ψ1s↑x i |ψ1s↓x i · ·
2 0 −1 hψ1s↓x |
Returning to the problem of the wave function of the He atom, we can write
one of the two possible solutions as
?
|ψHe (~r1 , ~r2 )i = |ψ1s↑z (~r1 )ψ1s↓z (~r2 )i (9.5)
r r +
1 1
= (ψ1s↑x (~r1 ) + ψ1s↓x (~r1 )) (ψ1s↑x (~r2 ) − ψ1s↓x (~r2 ))
2 2
1
= |ψ1s↑x (~r1 )ψ1s↑x (~r2 )i − |ψ1s↑x (~r1 )ψ1s↓x (~r2 )i
2
+ |ψ1s↓x (~r1 )ψ1s↑x (~r2 )i − |ψ1s↓x (~r1 )ψ1s↓x (~r2 )i .
After rotating the basis one finds that two electrons that appear to be in
different spin orbitals for a quantization in the z direction are at least partly
in the same spin orbital for a quantization in the x direction.
We can guarantee that two electrons are not in the same spin orbital,
independent of the basis used, when we make the wave function anti sym-
metric under exchange of the particle coordinates. We find
r
1
|ψHe (~r1 , ~r2 )i = |ψ1s↑x (~r1 )ψ1s↓x (~r2 )i − |ψ1s↓x (~r1 )ψ1s↑x (~r2 )i (9.6)
2
282
9.1. The Schrödinger equation alone is not ...
r +
1
= ψ1s↑x (~r1 )ψ1s↓x (~r2 ) − ψ1s↓x (~r1 )ψ1s↑x (~r2 )
2
= |ψ1s↑x ψ1s↓x i
283
Chapter 9. N particle systems
For atomic Au a Single Slater determinant state has more terms than we
have atoms in the universe. If we want to write down a wave function
for a macroscopic piece of solid we need about 1023 atoms. If each atom
has one electron we need a wave function with 1023 electrons. In this case
the wave function is given by the determinant of a 1023 by 1023 matrix.
36
Such a matrix has 1023 ! ≈ 1010 terms. Calculating expectation values
36 2
of such an object would require to evaluate 1010 integrals. Quantum
mechanics yields a description of the world so complex that even writing
down the state of a moderately sized system becomes impossible. The use
of Slater determinants for states is however not the most compact notation,
such that there is partly a solution to the immense complexity encountered.
This we will discuss in the next section.
284
9.1. The Schrödinger equation alone is not ...
Instead of first creating particle 2 and then particle 1 we could also have
changed the order. This should give the same state, i.e. the same ray, but
285
Chapter 9. N particle systems
We find that the order in which we create the particles changes the sign
of the wave function. We can generate the desired behaviour if we define
the following anti-commutation behaviour for our creation operators
a†1 a†2 . . . a†i a†j . . . a†N |0i = −a†1 a†2 . . . a†j a†i . . . a†N |0i . (9.15)
we find that
†
a†τ |0i = h0| aτ . (9.17)
286
9.1. The Schrödinger equation alone is not ...
{aα , aβ } = aα aβ + aβ aα (9.19)
= 0.
D E
0 aα a†β 0 = δα,β . (9.20)
aτ |0i = 0, (9.21)
and
287
Chapter 9. N particle systems
= δα,β .
0 = aτ aτ a†τ (9.25)
= −aτ a†τ aτ + aτ
= +a†τ aτ aτ
= 0.
We call O (0) , O (1) , O (2) , . . . the zero, one, two, etc. particle operator. The
p2
potential energy V (~ri ), and the kinetic energy 2mi are given by a one particle
1
operator. The Coulomb interaction between two electrons ∝ |~ri −~ rj | , as well
as the angular momentum operator L2 = ( i (~ri × ~pi ))2 are given by two
P
particle operators.
288
9.1. The Schrödinger equation alone is not ...
In the last step we used that |φβ ihφα | = a†β aα . If these operators act on a
many particle state where orbital α is unoccupied both operators yield 0. If
they act on a many particle state where orbital α is occupied the occupied
orbital α is replaced by the orbital β.
In order to write the two particle operator in second quantization we
define the complete set of two particle states a†α a†β |0i = |φα φβ i for α ∈
[1, ∞] and β ∈ [1, α − 1]. Note that we need to take β < α as a†β a†α =
−a†α a†β . P
The identity operator written on a basis of two particle states
becomes β<α |φα φβ ihφβ φα | = 1, such that
X
O (2) = |φα φβ ihφβ φα | O (2) |φδ φγ ihφγ φδ | (9.28)
β<α,δ<γ
X D E
= φβ φα O (2) φδ φγ |φα φβ ihφγ φδ |
β<α,δ<γ
D E
φβ φα O (2) φδ φγ a†α a†β aγ aδ .
X
=
β<α,δ<γ
289
Chapter 9. N particle systems
We can rewrite the sum to include all values of β and δ by writing out the
two particle expectation value of operator O (2) .
ZZ
1 X
O (2) = φ∗α (~r1 )φ∗β (~r2 ) − φ∗β (~r1 )φ∗α (~r2 ) O (2) (~r1 , ~r2 )
2 R3
β<α,δ<γ
× (φδ (~r1 )φγ (~r2 ) − φγ (~r1 )φδ (~r2 )) d3~r1 d3~r2 a†α a†β aγ aδ (9.29)
X Z Z
= φ∗α (~r1 )φ∗β (~r2 )O (2) (~r1 , ~r2 )φδ (~r1 )φγ (~r2 )d3~r1 d3~r2
β<α,δ<γ R3
X ZZ
+ φ∗β (~r1 )φ∗α (~r2 )O (2) (~r1 , ~r2 )φγ (~r1 )φδ (~r2 )d3~r1 d3~r2
β<α,δ<γ R3
X ZZ
− φ∗β (~r1 )φ∗α (~r2 )O (2) (~r1 , ~r2 )φδ (~r1 )φγ (~r2 )d3~r1 d3~r2
β<α,δ<γ R3
ZZ !
X
− φ∗α (~r1 )φ∗β (~r2 )O (2) (~r1 , ~r2 )φγ (~r1 )φδ (~r2 )d3~r1 d3~r2
β<α,δ<γ R3
1
× a†α a†β aγ aδ
2
X ZZ
= φ∗α (~r1 )φ∗β (~r2 )O (2) (~r1 , ~r2 )φδ (~r1 )φγ (~r2 )d3~r1 d3~r2
β<α,δ<γ R3
X ZZ
+ φ∗α (~r1 )φ∗β (~r2 )O (2) (~r1 , ~r2 )φδ (~r1 )φγ (~r2 )d3~r1 d3~r2
β>α,δ>γ R3
X ZZ
+ φ∗α (~r1 )φ∗β (~r2 )O (2) (~r1 , ~r2 )φδ (~r1 )φγ (~r2 )d3~r1 d3~r2
β>α,δ<γ R3
ZZ !
X
+ φ∗α (~r1 )φ∗β (~r2 )O (2) (~r1 , ~r2 )φδ (~r1 )φγ (~r2 )d3~r1 d3~r2
β<α,δ>γ R3
1
× a†α a†β aγ aδ
2
290
9.1. The Schrödinger equation alone is not ...
ZZ
1 X
= φ∗α (~r1 )φ∗β (~r2 )O (2) (~r1 , ~r2 )φδ (~r1 )φγ (~r2 )d3~r1 d3~r2
2 R3
αβγδ
× a†α a†β aγ aδ
1 XD ∗ E
= φα (~r1 )φ∗β (~r2 ) O (2) (~r1 , ~r2 ) φδ (~r1 )φγ (~r2 ) a†α a†β aγ aδ .
2
αβγδ
with
with
1
Uα,β,γ,δ = hφα (~r1 ) φβ (~r2 )| O (2) (~r1 , ~r2 ) |φδ (~r1 )φγ (~r2 )i . (9.33)
2
With this notation the calculation of expectation values reduces to the
repeated use of commutation relations. Most importantly we never need
to explicitly implement the Pauli principle as a Slater determinant with N !
terms.
291
Chapter 9. N particle systems
[bα , bβ ] = bα bβ − bβ bα (9.35)
= 0,
and
292
9.2. Complexity
For a state with N electrons we can define single Slater determinant wave
functions as
Y
|ψi (~r1 , ~r2 , . . . , ~rN )i = a†τ |0i . (9.38)
τ ∈Di
9.2 Complexity
In the previous section we have seen that one is generally not able to write
down single Slater determinant states. For a moderate number of electrons
there are just too many terms in such an expression. We also showed that
by changing notation we can write single Slater determinant states as a
string of creation operators acting on the vacuum. The work we need to
do to write such a state scales linearly with the number of electrons. The
Pauli principle is implemented via the anti-commutation relations between
the creation and annihilation operators.
Sadly the single Slater determinant representable states are generally
not the eigenstates of the problem. The eigenstates are linear superpositions
of single Slater determinant states. The number of determinants needed to
represent a many particle system scales badly with system size. As an
293
Chapter 9. N particle systems
294
9.3. Entanglement and correlations
troduces a set of quasi particles that emerge from the interaction of many
particles. These quasi particles are described by an effective Hamiltonian
which is valid in a low energy regime. As these quasi particles are often non-
interacting the resulting low energy Hamiltonian is easier to solve. These
approximations sometimes work and allow us for example to predict the
properties of semi conductors and with that the electronics we use in our
devices.
What all of these models and approximations have in common is that
they strive to find ways to deal with the complexity that emerges from
the simple Schrödinger equation. Although the basic laws that govern the
interactions between particles are extremely well understood the resulting
states that emerge for a large number of these particles are extremely rich.
This richness in these equations allows for the complex structures we en-
counter in every day life. The complex behaviour of many particle systems
is often not easy to predict by extrapolating the behaviour of a few parti-
cles. In the words of P.W. Anderson: ”The ability to reduce every thing
to simple fundamental laws does not imply the ability to start from those
laws and reconstruct the universe.” [67, 68].
295
Chapter 9. N particle systems
and has dimension 6. For a given two particle state we can calculate the
occupation of the four different one particle states with the operator a†τ aτ ,
with τ ∈ {A, B, C, D}. In matrix form on basis B these operators are
1 0 0 0 0 0 1 0 0 0 0 0
0 1 0 0 0 0 0 0 0 0 0 0
† 0 0 1 0 0 0 † 0 0 0 0 0 0
aA aA =
0
aB aB = (9.42)
0 0 0 0 0
0
0 0 1 0 0
0 0 0 0 0 0 0 0 0 0 1 0
0 0 0 0 0 0 0 0 0 0 0 0
0 0 0 0 0 0 0 0 0 0 0 0
0 1 0 0 0 0 0 0 0 0 0 0
† 0 0 0 0 0 0 † 0 0 1 0 0 0
aC aC =
0
aD aD = .
0 0 1 0 0
0
0 0 0 0 0
0 0 0 0 0 0 0 0 0 0 1 0
0 0 0 0 0 1 0 0 0 0 0 1
From the matrix representation we can directly see that the eigen values of
the operators a†τ aτ are 0 and 1. Following section 6.5 we thus know that
in an experiment where one measures the occupation of state τ such that
a single number is the outcome of such an experiment this value can either
be 0 or 1. The frequency with wich one finds these values, if one repeats
the experiment very often, is determined by the wave function.
We now use the fact that we can represent a solution of the Schrödinger
equation as a superposition of other solutions of the Schrödinger equation.
We consider the following two states which we assume to be solutions of
the Schrödinger equation
296
9.3. Entanglement and correlations
or as wave functions
p
ψ1 (~r1 , ~r2 ) = 1/2(ψA (~r1 )ψB (~r2 ) − ψB (~r1 )ψA (~r2 )) (9.44)
p
ψ2 (~r1 , ~r2 ) = 1/2(ψC (~r1 )ψD (~r2 ) − ψD (~r1 )ψC (~r2 )).
is also a solution of the Schrödinger equation. For state |ψi we find that
the expectation value for the occupation of each of the one particle states
is 1/2, i.e.
D E 1
ψ a†τ aτ ψ = , (9.47)
2
for τ ∈ {A, B, C, D}. With the combined knowledge that for a single mea-
surement we either find the value 0 or 1 we know that in 50% of all cases we
will find the value 0 and in 50% of the cases the value 1 for the occupation
of the state A, B, C and D.
There is a strong correlation between the occupation we find for the
different one particle states. We can have a look at the operator a†A aA a†B aB
and the operator a†A aA a†C aC . If we calculate the expectation value of these
operators for the state |ψi we find
D E 1
ψ a†A aA a†B aB ψ = , (9.48)
D E 2
ψ a†A aA a†C aC ψ = 0.
297
Chapter 9. N particle systems
1
S z = ~ a†k↑ ak↑ − a†k↓ ak↓ + a†−k↑ a−k↑ − a†−k↓ a−k↓ (9.49)
2
S + = a†k↑ ak↓ + a†−k↑ a−k↓
S − = a†k↓ ak↑ + a†−k↓ a−k↑
298
9.3. Entanglement and correlations
299
Chapter 9. N particle systems
9.4 Temperature
We have seen in the early chapters of this script that a particle in motion
can be described by a wave packet. For the Hydrogen atom we used the
eigenstates of the Hamiltonian with negative energy as the bound states.
The eigenstates of the Hamiltonian are stationary such that the states are
stable over time. Being able to describe stable states with angular momen-
tum for the electron for the hydrogen atom, solved a big internal problem
of the Bohr model.
What we have not discussed is temperature. There are currently two
competing ways to model temperature. On a macroscopic scale both de-
scriptions yield the same result. We will discuss both theoretical models.
Based on some general argumentation we can derive what the macroscopic
limit for averaged values in a system with finite temperature needs to be.
(See also the section on Black body radiation 1.1 and Einsteins model for
the heat capacity 1.3.) Once we know the macroscopic averaged limit we
can discuss the two different microscopic mechanisms to realize the statis-
tical macroscopic state.
We find a correct description of systems at finite temperature if we use
Boltzmann statistics. Boltzmann statistics tells one what the time averaged
probability is to find a system in a state with energy E at temperature T .
For Boltzmann statistics to be applicable we need to assume that the system
300
9.4. Temperature
In this equation we used brackets to indicate the average over time. This
yields the thermodynamical properties of a system. We realize that in a
book on quantum mechanics this notation can be confusing. We use the
same bracket notation for inner products of vectors in a complex Hilbert
space within the Dirac notation. As will become clear later in the lecture or
future lectures there are clear parallels between the Dirac bracket and the
temperature averaging. There is also a major difference. The Dirac bracket
integrates over states in a complex vector space. The temperature bracket
integrates over time. It is important to distinguish the difference between
temperature averages and quantum averages over distribution functions.
We choose to add the index t for temperature averaging, indicating that
one averages over fluctuations in time (contrary to building an average
over quantum fluctuations in configurational space. More on that later in
this lecture and other lectures on Green’s / correlations functions or path
integrals).
301
Chapter 9. N particle systems
lectures we defined
For this integral we assumed that a > 0, otherwise the function would not
be normalizable.
We can relate a to the temperature of the system. We define temper-
ature in such a way that the average energy of an atom in an ideal gas is
3
hEit = kB T, (9.58)
2
302
9.4. Temperature
with kB the Boltzmann constant. We can also relate the average energy to
the average velocity square. For a particle with mass m we find
m 2
hEit = vx t + vy2 t + vz2 t (9.59)
2
3
= m vx2 t .
2
With the probability to find the particle with the velocity ~v 2 given by p we
can calculate vx2 t as
∞
Z r
a −av~x 2
vx2 t = vx2 e (9.60)
−∞ π
1
= .
2a
Combining the two expressions for hEit yields
m
a= . (9.61)
2kB T
1 − 2km~v2T
p(~v 2 ) = e B . (9.62)
Z
Instead of making the probability a function of the particles velocity we can
look at the probability to find the particle with energy E = 21 m~v 2 . This
yields
1 − k ET
p(E) = e B . (9.63)
Z
303
Chapter 9. N particle systems
304
9.4. Temperature
with cn,m some set of complex numbers whose exact values determine the
system, but are irrelevant for the current discussion. The coefficients cn,m
contain the information on how much the product of system state |αn i and
bath state |βm i contributes to the thermal state |ψi. If we calculate an
305
Chapter 9. N particle systems
The sum where n 6= n0 relates to the case where two different system
eigenstates |αn i and |αn0 i are multiplied with the same bath state |βm i.
If we assume that the bath
P to∗ be large and randomly entangled with the
system of interest then 0
m cn0 ,m cn,m becomes zero unless n = n. We
retrieve the Boltzmann equation if we choose
X
c∗n,m cn,m = pn (9.72)
m
1 − kEnT
= e B .
Z
A quantum system obtains Boltzmann statistics for its local expectation
values when the system entangles with a heat bath. The local system is in
a quantum super position of different eigenstates. Each term in this super
position is represented with a weight given by Boltzmann statistics. Each
term in this superposition is entangled with different states of the heat
bath, such that there is no interference between the different terms if one
looks at expectation values of operators acting on the system. The thermal
average one makes in this description of temperature is not over time, but
is an average over different configurations of local states that entangle with
states of the heat bath. This microscopic mechanism for temperature is not
able to describe Brownian motion, thermal noise and thermal fluctuations.
This microscopic model to describe temperature does allow for non thermal
states to thermalize.
306
9.4. Temperature
307
Chapter 9. N particle systems
X
= pn hψn |O|ψn i .
n
308
Chapter 10
Perturbation theory
For many interesting cases in quantum mechanics, we are not able to find
analytical solutions to the Schrödinger equation. This is not only true for
the complex many-particle systems but even for a single particle traveling
through a potential. The number of potentials for which we have analytical
solutions of the Schrödinger equation is rather limited. Sadly enough we
have already discussed a large part of the analytically solvable problems.
With the use of numerical methods, one is able to solve with good accuracy
basically all single-particle problems of interest, if the Hamiltonian is time-
independent. Time-dependent potentials can lead to numerical instabilities
in the differential equation that need to be solved. With some attention one
normally can find numerical solutions for a single particle traveling through
a time-dependent potential. For many-particle systems we often do not have
analytical, nor numerically accurate solutions. Instead of trying to find
exact analytical solutions or numerical accurate solutions one can look at
approximate solutions. Perturbation theory is an approximate analytical
method or series expansion that can help to find solutions to otherwise
intractable problems. Even for systems that can be solved numerically,
perturbation theory can help to give insight into the physical reason why
and how things happen.
The general idea behind perturbation theory is that we split the Hamil-
309
Chapter 10. Time independent perturbation ...
tonian (H) of a system that does not have a simple solution into a part
that we can solve (H 0 ) and a part that we treat as a correction to the
solutions of the solvable part (H 1 ). We then make a series expansion of
the true solution in the perturbing Hamiltonian. We can use perturbation
theory for example if we want to calculate the eigenstates of the hydrogen
atom in an external electromagnetic field. We can first solve the solutions
of the Schrödinger equation of the hydrogen atom without an external elec-
tromagnetic field and then treat the field as a perturbation to the solutions
without a field. The same is true when we look at relativistic corrections.
For most problems we can first solve the Schrödinger equation and then
treat the relativistic effects as captured in the Dirac equation or by quan-
tum electrodynamics as a perturbation. Even for simpler systems such
as the Harmonic oscillator one often finds that these only approximately
resemble experimental situations and one needs to take anharmonic per-
turbations into account.
H = H 0 + H 1, (10.2)
with
E E
H 0 φ(0)
n = En(0) φ(0)
n , (10.3)
310
an eigenvalue equation that we know how to solve. Within perturbation
theory we will express the eigenstates and eigenenergies of H using a power
series, or Taylor series, expansion in H 1 around the point H 0 .
We will for now assume that H 0 has no degenerate eigenstates, i.e.
En(0) 6= Em
(0)
∀n, m 6= n. (10.4)
H λ = H 0 + λH 1 , (10.5)
with
311
Chapter 10. Time independent perturbation ...
∞
X E
|φλn i = λi ϕ(i)
n ,
i=0
E
(i) (i)
with En and ϕn the expansion coefficients we need to determine. For
λ = 1 we find
∞
X
En = En(i) (10.8)
i=0
X∞ E
|φn i = ϕ(i)
n .
i=0
E
(i) (i)
As En and ϕn appear as the ith order in λ they also appear as the ith
E
(i) (i)
order in H 1 in the expansion. The energy En and state ϕn are thus
the ith order corrections to the energy and state in powers of H 1 . Given
that equation 10.7 can be calculated as a Taylor series expansion we find
that the ith order corrections to the energy and to the state vector of the
nth eigenstate are
1 ∂i
En(i) = Eλn (10.9)
i! ∂λi λ=0
E 1 ∂i
ϕ(i)
n = |φλn i
i! ∂λi λ=0
312
find that
313
Chapter 10. Time independent perturbation ...
E X∞ E
= φ(0)
n + λ i
ϕ (i)
n
i=1
∞ ∞
!
X D E E X E
i
= λ φ(0)
n ϕ(i)
n φ(0)
n + λ i
φ̃ (i)
n .
i=0 i=1
D E
P∞ (0) (i)
We now divide |φλn i by i=0 λi φn ϕn . We will call the new, unnor-
malized state |ψλn i. We find
1
|ψλn i = P D E |φλn i (10.15)
∞ (0) (i)
i=0 λi φn ϕn
∞
E X λi E
= φ(0)
n + D E φ̃(i)
n .
j φ(0) ϕ(j)
P∞
i=1 j=0 λ n n
D E
φ(0)
n ψn = 1, (10.17)
1
|φn i = p |ψn i , (10.18)
hψn |ψn i
and thus
D E 1
φ(0)
n φ n = p . (10.19)
hψn |ψn i
314
With these definitions we are able to derive the perturbation expansion.
We insert the expansion for the wave function |ψλn i in powers of λ as given
in equation 10.16 and the energy expanded in powers of λ as given in
equation 10.7, into the eigenvalue equation for H λ (equation 10.6) and find
∞
X E X∞ X
∞ E
(H 0 + λH 1 ) λi φ(i)
n = λ i j (j)
λ En φ (i)
n . (10.20)
i=0 i=0 j=0
∞
X E E X∞ X
i E
λi H 0 φ(i)
n + H 1 φ (i−1)
n = λ i (i−j)
En φ (j)
n . (10.21)
i=0 i=0 j=0
E E i
X E
H 0 φ(i)
n + H1 φ(i−1)
n = En(i−j) φ(j)
n . (10.22)
j=0
E Xi−1 E E
H 0 − En(0) φ(i)
n = En
(i−j)
φ (j)
n − H 1 φ (i−1)
n . (10.23)
j=0
We can express the ith order correction term to the wave function as a sum
over the 0th to i − 1th terms.
(i)
In order to calculate the ithDeigenenergy correction
D En weD multiply
(0) (0) (0) (0)
equation 10.23 from the left with φn . Using that φn H 0 = φn En
we find
i−1
X D E D E
0= En(i−j) φ(0)
n φn
(j)
− φ(0) (i−1)
n H1 φn . (10.24)
j=0
315
Chapter 10. Time independent perturbation ...
D E E
(0) (j) (j)
We now use that φn φn = δj,0 as we orthogonalized all states φn
E
(0)
to φn to find
D E
φ(0) (0)
n H 0 φn i=0
En(i) = D (0) E . (10.25)
φn H 1 φ(i−1)
n i>0
The ith order correction to the energy is given by the matrix element of H 1
between the i − 1th correction to the state and the 0th order state around
which we make the perturbative E series expansion.
(i)
In order to determine φn we return to equation 10.23 but now mul-
D
(0)
tiply it from the left with φm such that we obtain
D E Xi−1 D E D E
φ(0)
m H 0 − En
(0) (i)
φ n = En
(i−j)
φ (0) (j)
m φ n − φ (0)
m H 1 φ (i−1)
n . (10.26)
j=0
D D
(0) (0) (0)
We use that φm H 0 = φn Em to get
D E i−1
X D E D E
(0)
(Em − En(0) ) φ(0)
m φ(i)
n = En(i−j) φ(0)
m φn
(j)
− φ(0) (i−1)
m H 1 φn .
j=0
(10.27)
E
(0)
As φn forms a complete basis set (Eigenstates of H 0 ) we can write
E X ED E
φ(i6
n
=0)
= φ (0)
m φ (0) (i)
m φ n . (10.28)
m6=n
D E
(0) (i)
We can remove the case where m = n from the sum as φn φn = 0 for
(0)
i 6= 0. Next we use the fact that our state of interest φn is non degenerate
(0) (0) (0) (0)
such that En 6= Em for all m 6= n and divide equation 10.27 by En −Em .
316
E E
(i) (0)
With the combination of the expansion of φn on φm (equation 10.28)
we find
i−1
E X E 1 D E X E
φ(i)
n = φ(0)
m (0) (0)
φ(0)
m
H 1 φ(i−1)
n − En(i−j) φ(j)
n
.
m6=n En − Em j=0
(10.29)
This equation expresses the ith order corrections to the perturbative state
(0)
around φn in terms of the eigenenergy E and eigenstate correctionsEof lower
(i) (i)
order. Once we obtain the states φn we can sum the states φn to find
the unnormalized eigenstates of H
∞
X E
|ψn i = φ(i)
n . (10.30)
i=0
317
Chapter 10. Time independent perturbation ...
(0) 2
D E
(0)
X φ m H φ
1 n
En(2) = (0) (0)
, (10.35)
m6=n E n − Em
φ(2)
n = φ(0)
m2 (10.36)
(0) (0) (0) (0)
m1 6=n m2 6=n En − E m2 E n − Em1
D ED E
X E φ(0)
m H 1 φn
(0) (0)
φn H 1 φn
(0)
(0)
− φm
(0) (0) (0) (0)
m6=n En − Em En − Em
There are two very important conclusions to draw from these equations.
The first conclusion is that ”levels repel each other” when additional inter-
actions are added. If we look at equation 10.35 for the second-order energy
correction then we find that for a state |φn i the states with an energy larger
(0)
than En will lead to a lowering of the energy and states with an energy
(0)
smaller than En will lead to an increase of the state energy. The second
conclusion immediately follows from the first. For the groundstate there
are only states with larger energies in the Hilbert space. We thus find that
the ground-state energy including the 1-st order perturbative correction is
higher than the true ground-state energy.
318
10.1.1 Perturbation example on two-level system
The first example for perturbation theory we discuss uses a model E system E
(0) (0)
with two internal states. As a basis we take the states φ1 and φ2 .
D E D E
(0) (0) (0) (0)
These states have an energy φ1 H φ1 = E1 and φ2 H φ2 =
D E
(0) (0)
E2 > E1 . Furthermore there is a coupling φ1 H φ2 = t between
them. The model described here could be a magnetic atom with a doublet
state (Silver atom) in a strong magnetic field in the z direction responsible
for the energies E1 and E2 and a small field in the x direction responsible
for the interaction t. The full Hamiltonian is
D
(0)
(0)
E
(0)
E φ 1
H = φ1 φ2 · H · D (0) , (10.38)
φ2
with
0 0
H0 = , (10.41)
0 ∆
(0) (0)
∆ = E2 − E1 , and
0 1
H1 = t . (10.42)
1 0
319
Chapter 10. Time independent perturbation ...
state. This will simplify the notation later on. In principle we do not need
to do perturbation theory to find the eigenstates and eigenenergies of H.
We can solve this problem exactly and find
(0) 1 p
E1 = E1 + ∆ − ∆2 + 4t2 (10.43)
2 r
4t2
t √
2 +4t2 2
+1
1 (∆+ ∆ )
|φ1 i = r , − √
4t2 ∆ 2 + 4t2
√
2 +4t2 2
+1
(∆+ ∆ )
(0) 1 p
2 2
E2 = E1 + ∆ + ∆ + 4t
2
r
4t2
t √ 2 + 1
(∆+ ∆2 +4t2 ) 1
|φ2 i = √ ,r .
∆2 + 4t2 4t2
√ 2 + 1
(∆+ ∆ +4t )
2 2
(0) t2 t4 2t6
+ 3 − 5 + O t8
E1 = E1 − (10.44)
∆ ∆ ∆
1 2 1 3 3
|φ1 i = 1, 0 + t 0, − +t − , 0 + t 0, (10.45)
∆ 2∆2 2∆3
11 31
+ t4 , 0 + t5 0, − 5 + O t6
8∆ 4 8∆
(0) t2 t4 2t6
− 3 + 5 + O t8
E2 = E2 + (10.46)
∆ ∆ ∆
320
1 2 1 3 3
|φ2 i = 0, 1 + t , 0 + t 0, − 2 + t − ,0 (10.47)
∆ 2∆ 2∆3
11 31
+ t4 0, + t5 , 0 + O t6 .
8∆ 4 8∆ 5
The different termsE for the states in equation 10.45 and 10.47 are equal to
(i)
the states ϕn as defined in equation 10.7. The order i of the expansion
E
(i)
is given by the power of t, i.e. ϕn is the term proportional to ti . The
E
(i)
states ϕn are not orthonormal to each other. There are infinitely many
E
(i)
states ϕn but the dimension of the Hilbert space is only 2.
With the use of perturbation theory, i.e. following equation 10.29 and
10.25 we find
(0) t2 t4 2t6
+ 3 − 5 + O t8
E1 = E1 − (10.48)
∆ ∆ ∆
1 2 3 1
|ψ1 i = 1, 0 + t 0, − + t 0, 0 + t 0, 3 (10.49)
∆ ∆
2
+ t4 0, 0 + t5 0, − 5 + O(t6 ).
∆
(0) t2 t4 2t6
− 3 + 5 + O t8
E2 = E2 + (10.50)
∆ ∆ ∆
1 2 3 1
|ψ2 i = 0, 1 + t , 0 + t 0, 0 + t − 3 , 0 (10.51)
∆ ∆
2
+ t4 0, 0 + t5 , 0 + O(t6 ).
∆5
321
Chapter 10. Time independent perturbation ...
E
(i)
is given by the power of t, i.e. φn is the term proportional to ti . The
E E
(i) (0)
states φn for i > 0 are orthogonal to the state φn . For i > 0 and
E E
(i) (j)
j > 0 the states φn and φn are not orthogonal to each other. Actually
they are all linear dependent on each other as the E dimension of the Hilbert
(0)
space is 2 and they need to be orthogonal to φn .
|φn i and |ψn i as determined from a Taylor series or perturbation series
represent the same state. They differ in normalization such that hφn |φn i =
1 and
4t2
hψn |ψn i = √ 2 + 1. (10.52)
∆+ ∆2 + 4t2
Equivalently we can write for the overlap with the unperturbed state
D E
ψn φ(0)
n = 1, (10.53)
and
D E 1
φn φ(0)
n =r . (10.54)
4t2
√ 2 +1
(∆+ ∆2 +4t2 )
Range of convergence
We now discuss a very important point of perturbation theory. The range
of convergence. For which values of t and ∆ is the above series expansion
valid. For that we can have a look at the Taylor series expansion of E1 =
(0) √
E1 + 21 ∆ − ∆2 + 4t2 . A seemingly harmless function that is smooth
and infinitely differentiable on the real axis. For the convergence of a Taylor
series expansion we need to look for poles or branch cuts in the complex
√
plain. The function z z ∈ C has a branch cut for z = x with x ∈ R and
x < 0. For x < 0 we find
√ p p
i −x = lim x + iy = − lim x − iy. (10.55)
y→0+ y→0+
322
Figure 10.1: Real (left) and imaginary (right) part of the ground-state
energy of a two level system as given in equation 10.43 for ∆ = 1 and
t = x + iy. In blue for the exact solution and in yellow for a 20th order
perturbation theory. The range of convergence is t = 21 .
For t = iτ we hit the branch cut of the square root when τ > ∆/2. The
Taylor series and thus also the perturbation expansion has a range of con-
vergence for t of ∆/2. The above equations are valid when
∆
t≤ . (10.56)
2
In figure 10.1 we show the energy of the two level ground-state 21 (∆ −
√
∆2 + 4t2 ) as well as the 20th order perturbation theory approximation to
the energy as a function of t in the complex plane. Although the function
is well behaved on the real axis the branch cut for |Im(t)| > 12 yields a finite
convergence radius, also on the real axis.
In general one can state that for a perturbation series to be valid the
coupling strength between two states needs to be small compared to the
energy between these states. There are many interesting cases where this
is not realized.
323
Chapter 10. Time independent perturbation ...
We furthermore take
D E
(0) (0)
φn0 ,k0 φn,k = δk,k0 δn,n0 , (10.58)
E E
(0) (0)
to guarantee orthonormality. For two eigenstates φn0 ,k0 and φn,k with
n0 6= n we take
(0)
En0 6= En(0) . (10.59)
We group all degenerate states together with the same quantum number n.
(0)
The quantum number k labels different states with the same energy En .
For each quantum number n there are Nn orthonormal states with dif-
ferent k. These states form a basis for a Hilbert space. On this finite
dimensional Hilbert space we can express the Hamiltonian H = H 0 + H 1
as a finite dimensional matrix. In order to implement perturbation the-
ory with degenerate states we need to diagonalize
E this finite dimensional
(0)
matrix. We will call the eigenstates φn,κ . For the Hamiltonian matrix
elements we find
D E
D E δκ0 , κ φ(0)
n,κ H 0 + H 1 φ
(0)
n,κ n0 = n
(0)
φn0 ,κ0 H 0 + H 1 φ(0)
n,κ = D E . (10.60)
φ(0)0 0 H 0 + H 1 φ
(0)
n,κ n 0 6= n
n ,κ
324
10.3. Time dependent perturbation theory
We do not have to sum over states with the same n, but different κ. The
states we perturb around already diagonalized the full Hamiltonian
E within
(0)
the Hilbert space with a basis defined by the states φn,k for different k
and the same n.
325
Chapter 10. Time independent perturbation ...
We now insert this expansion of our time dependent state into the Schrödinger
equation 10.63 to find
X ∂
(0) i (0) (0)
−i ~t En −i ~t En
i~ cn (t) e − En cn (t)e |φn i = (10.67)
n
∂t ~
326
10.3. Time dependent perturbation theory
(0)
X t
cn (t)e−i ~ En En(0) + H 1 (t) |φn i ,
n
Simplifying yields
∂ i X t (0) (0)
ck (t) = − cn (t)e−i ~ (En −Ek ) hφk |H 1 (t)|φn i . (10.69)
∂t ~ n
i X t
Z
t0 (0) (0)
ck (t) = ck (0) − cn (t0 )e−i ~ (En −Ek ) φk H 1 (t0 ) φn dt0 . (10.70)
~ n 0
(0)
If H 1 (t) is ”small” we can assume that ck (t) ≈ ck (0). We define ck (t) =
ck (0) and the recursive definition
327
Chapter 10. Time independent perturbation ...
where we can express the first order time derivative of ck (t) at time t = 0
as
∂ i X
ck (t) =− cn (0) hφk |H 1 (0)|φn i , (10.72)
∂t t=0 ~ n
tn = t0 + nδt. (10.76)
We find
328
10.3. Time dependent perturbation theory
N
!
Y
=T U (tn , tn−1 ) .
n=1
Whereby in the last line we introduced the time ordering operator T . Given
the product of two time dependent operators, the time ordering operator
places the operator acting at a later time left of the operator acting at an
earlier time. For example
(
A(t1 )A(t2 ) t1 > t2
T (A(t1 )A(t2 )) = . (10.78)
A(t2 )A(t1 ) t2 > t1
The operators H(tn ) and H(tm ) do not necessarily commute such that
i i i
e− ~ δtH(tn ) e− ~ δtH(tm ) 6= e− ~ δt(H(tn )+H(tm )) . (10.81)
However the time ordering operator T will reorder the operators such that
operators at a later time will be left of operators acting on an earlier time.
As such we find
i i
i
T e− ~ δtH(tn ) e− ~ δtH(tm ) = T e− ~ δt(H(tn )+H(tm )) . (10.82)
329
Chapter 10. Time independent perturbation ...
i Rt
H(t0 )dt0
−
= T e ~ t0 .
Although this is a nice and compact analytical expression for the time evolu-
tion operator we will continue our argumentation with equation 10.80. We
can replace the time evolution operator with a time-dependent Hamiltonian
for a long interval by infinitely many evolutions of an infinitesimally small
interval. Whereby for each infinitesimal small interval the Hamiltonian is
constant in time.
As discussed in the previous section we often deal with a situation
whereby the Hamiltonian is given by a ”large” time independent part H 0
and a ”small” time dependent part H 1 (t) with
Using this dependence of the Hamiltonian we can rewrite the time evolution
operator as
N
!
− ~i δt(H 0 +H 1 (tn ))
Y
U (t, t0 ) = lim T e (10.85)
N →∞
n=1
N !
Y i
= lim T 1 − δt (H 0 + H 1 (tn )) + O(δt2 )
N →∞ ~
n=1
N !
Y i i
= lim T 1 − δtH 1 (tn ) 1 − δtH 0 + O(δt2 )
N →∞ ~ ~
n=1
N !
Y i − ~i δtH 0 2
= lim T 1 − δtH 1 (tn ) e + O(δt ) .
N →∞ ~
n=1
330
10.3. Time dependent perturbation theory
Z t
i i i
− e− ~ (t−t1 )H 0 H 1 (t1 )e− ~ (t1 −t0 )H 0 dt1
~ t0
Z tZ t1
1 i i
− e− ~ (t−t2 )H 0 H 1 (t2 )e− ~ (t2 −t1 )H 0 H 1 (t1 )
~2 t1 t0
i
e− ~ (t1 −t0 )H 0 dt1 dt2
+ ...
∞
−i 2
Z Z
X i
= ··· e− ~ (t−tn )H 0 H 1 (tn ) . . .
~ t≥tn ≥...≥t1 ≥t0
n=0
i i
H 1 (t2 )e− ~ (t2 −t1 )H 0 H 1 (t1 )e− ~ (t1 −t0 )H 0 dt1 dt2 . . . dtn .
The time evolution from the state |ψ(t0 )i to |ψ(t)i for a system with Hamil-
tonian H 0 +H 1 (t) can be written as a sum of operators containing different
orders of H 1 (t). To 0th order in H 1 (t) the time evolution operator is given
by the time evolution operator given by the Hamiltonian H 0 . To 1st order
in H 1 (t) the time evolution operator from t0 to t is given by the time evo-
lution operator given by H 0 from t0 to t1 , multiplied by H 1 (t1 ), multiplied
by the time evolution operator given by H 0 from t1 to t. The nth order is
given by n + 1 time intervals where the time evolution operator is given by
H 0 and n interactions with H 1 (ti ) in between these intervals.
331
Chapter 10. Time independent perturbation ...
U (t, t0 ) = (10.89)
t0 t
U 0 (tj+1 , tj ) = (10.90)
tj tj+1
i
− H 1 (tj ) = H1 . (10.91)
~ tj tj
The product of two equations is represented by writing the pictures next
to each other, connecting the lines at the left and right end of the pictures.
If a variable stands in the middle of a picture we assume that we integrate
over this variable. The integration boundaries are such that the arrows
always point towards a later time. The second term in equation 10.86 can
be written as
i t − i (t−t1 )H 0
Z
i
− e ~ H 1 (t1 )e− ~ (t1 −t0 )H 0 dt1 = H1
~ t0 t0 t1 t1 t
(10.92)
= (10.93)
t0 t t0 t
+ H1
t0 t1 t1 t
+ H1 H1
t0 t1 t1 t2 t2 t
+ ...
332
10.3. Time dependent perturbation theory
=
τj tj τi ti
We remark that we generally read equations from right to left, i.e. the
equation hτi |U (ti , tj )|τj i defines to complex amplitude to travel from state
|τj i at time tj to state |τi i at time ti . Feynman diagrams are read from
left to right, in the direction of the arrow. We will generally reserve arrows
pointing to the left for anti-particles or holes traveling in the positive time
direction. Similar we define for the matrix elements of operator H 1 (t)
i
− hτi |H 1 (tk )|τj i = τi H1 τj (10.97)
~ tk tk
333
Chapter 10. Time independent perturbation ...
H1
=
τj tk τi tk
We find for the complex amplitude to travel from state |ψi at time t0 to
state |φi at time t
= (10.98)
ψt0 φt ψt0 φt
+ H1
ψt0 τ1 t1 τ2 t1 φt
+ H1 H1
ψt0 τ1 t1 τ2 t1 τ3 t2 τ4 t2 φt
+ ...,
= U (φ, t; ψ, t0 ), (10.99)
ψt0 φt
D i
E
= τi e− ~ (ti −tj )H 0 τj , (10.100)
τj tj τi ti
and
i
H1 = − hτi |H 1 (tk )|τj i . (10.101)
τj tk τi tk ~
We implicitly integrate over all time variables that correspond to internal
vertices, i.e. the places where two basic diagrams connect. We furthermore
implicitly sum over all quantum states |τi ihτi |. For continuous basis sets
the sum over |τi ihτi | is replaced by an integral.
334
10.3. Time dependent perturbation theory
measure it. The way measurements work is that we have to make a small
change to the system to be able to observe something. The formal way to
describe measurements is with the use of response theory. We will discuss
response theory in chapter 13. With the use of the perturbation theory we
can derive a very important result from response theory, the transition rate
between states due to a small perturbation H 1 (t).
We assume the following situation. Before t = 0, i.e. for t < 0 the
dynamics of the system is described by the time independent Hamiltonian
H 0 . At time t = 0 we start the measurement and the Hamiltonian changes
to H 0 + H 1 (t). The Hamiltonian H 1 (t) can describe an electric field over
a conducting wire if we want to measure the resulting current, or an oscil-
lating electromagnetic field if we want to measure the optical conductivity.
The Hamiltonian valid for all times is given as
H = H 0 + θtH 1 (t), (10.102)
with θ(t) the Heaviside step function
(
0 t<0
θ(t) = . (10.103)
1 t>0
Note that for t = 0 we can define the function to be 0, 1, 1/2, or another
value one likes. We normally are interested in the integral over this function
such that a single point does not matter.
We want to describe how a wave packet defined at t < 0 changes its
propagation due to H 1 (t). Instead of considering a general wave packet we
can look at the propagation of the eigenstates of H 0 . Due to the linearity
of the Schrödinger equation we can always write the wave packet and its
time evolution as a sum over the eigenstates |φn i of operator H 0 . If we
start at t0 < 0 in the state |φn i we find at time t the state
|ψ(t)i = U (t0 , t) |φn i . (10.104)
If we, at time t measure the energy of this state we find the distribution
function (see equation 6.129 and the discussion in section 6.5)
X
ρ(t, E) = | hφm |ψ(t)i |2 δE,Em (10.105)
m
335
Chapter 10. Time independent perturbation ...
X
= |hφm |U (t0 , t)|φn i|2 δE,Em
m
X
= |U (m, t; n, 0)|2 δE,Em .
m
In the last step we used that for t0 < 0 the Hamiltonian that governs
the dynamics is given by H 0 such that for t0 < 0 we find U (n0 , 0; n, t0 ) =
i
−δn,n0 ~i e− ~ t0 En and we can replace t0 by 0. We can now define the transition
rate from state |φn i to state |φm i as
1
Γm,n (t) = |U (m, t; n, 0)|2 (10.106)
t
1 2
= + H1 + ... .
t φ n 0 φ m t φ n 0 φ τ1 t 1 φτ2 t1 φm t
With
i
= δn,m e− ~ tEn , (10.107)
φn 0 φm t
we find
1 i
Γm,n (t) = δn,m e− ~ tEn (10.108)
t
2
i t − i (t−t1 )Em
Z
i
− e ~ hφm |H 1 (t1 )|φn i e− ~ (t1 )En + . . . dt1 .
~ 0
336
10.3. Time dependent perturbation theory
1 1 D EZ t i
Γm,n (t) = 2 φm H 1 φn e− ~ t1 (En −Em +~ω) dt1 (10.111)
t~ 0
D EZ t i 2
† − ~ t1 (En −Em −~ω)
+ φm H 1 φn e dt1
0
1 D E 1 − e ~i t(En +~ω−Em )
= φm H 1 φn
t Em − En − ~ω
D E 1 − e ~i t(En −~ω−Em ) 2
†
+ φm H 1 φn
Em − En + ~ω
D E 2 t sin( t (E + ω~ − E )) 2
2~ n m
= φm H 1 φn t
~2 2~ (E n + ω~ − E m)
D E 2 t sin( t (E − ω~ − E )) 2
† 2~ n m
+ m H1 n 2 t
2~ (En − ω~ − Em )
~
+ cross terms.
The time dependent transition rate Γm,n (t) has several terms that os-
cillate in time t. If we look at the limit where t is large, these oscillating
parts will cancel. For En ± ~ω − Em = 0 a contribution remains that does
not oscilate. We find
We find first of all that the transition rate between the state |φn i and |φm i
is proportional to the matrix element |hm|H 1 |ni|2 . We furthermore see that
337
Chapter 10. Time independent perturbation ...
one can only have a transition if Em − En = ±~ω. The system can undergo
a transition that costs an energy Em − En only if the frequency of the
driving force is equal to ~ω. If we consider excitations with light, one can
only absorb a photon if the angular frequency of the light, times Planck’s
constant, matches the energy of the excitations. We furthermore see that
the system can also emit a photon if the energy of the state En > Em . The
energy of the emitted photon is En − Em . These are exactly the original
conditions proposed by Einstein for the quantization of light, that lead to
the development of quantum mechanics.
338
Chapter 11
Interaction with
electromagnetic fields
~p2
H= + V (~r), (11.1)
2m
339
Chapter 11. Interaction with electromagnetic fields
d~ri ∂H
= (11.2)
dt ∂~
pi
d~
pi ∂H
=− .
dt ∂~ri
We start from the classical Hamiltonian for a particle in a potential H =
~2
p
2m + V (~r). We now can, with the use of Hamiltons equations, derive
the equations of motion for the particle. We find from the first Hamilton
equation the relation between momentum and velocity of the particle
d~r p~
= , (11.3)
dt m
d~
r
or p~ = m~v , with ~v = dt the velocity of the particle. The second Hamilton
equation yields
d~
p
= −∇H (11.4)
dt
= −∇V (~r)
= F~ .
Combining the two equations yields F~ = m~a, with F~ the force acting on
2
the particle due to the potential V (~r) and ~a = ddt2~r the acceleration of the
particle.
In the next section we will derive the equations of motion from the
Hamilton formalism for a classical charged particle in an electromagnetic
field. We will find a change in the form of the total momentum derived
from the first Hamilton equation. This changed definition of momentum
combined with the second Hamilton equation yields the Lorentz equations.
The Lorentz equations give the relation between the force on the particle
and the electromagnetic fields. In the second section of this chapter, we
will extrapolate the classical Hamilton formalism for electromagnetic fields
to the quantum world. As a result we will add a classical electromagnetic
340
11.1. Electro magnetic fields in Hamilton formalism
field into the Schrödinger equation. From the consideration of black body
radiation, we know that we need to quantize electromagnetic fields. We
will do this in the third section of this chapter. The third section should
be seen as an outlook for future lectures on quantum field theory.
F~ = q E
~ + q~v × B.
~ (11.5)
The equations of motion for a charged particle subject to the Lorentz force
are
F~ = m~a (11.6)
~ + q~v × B.
= qE ~
In this section we want to proof that the Hamiltonian that generates this
force is given by
1 2
~ + qΦ,
H= p~ − q A (11.7)
2m
~ the magnetic vector potential, such that
with Φ the potential and A
~
~ ~r) = −∇Φ(t, ~r) − ∂ A
E(t, (11.8)
∂t
341
Chapter 11. Interaction with electromagnetic fields
~ ~r) = ∇ × A.
B(t, ~
dA~i q ~
~ · ∂ A − q ∂Φ .
m~ai + q = p~ − q A (11.13)
dt m ∂~ri ∂~ri
dA~i ∂A~i ~i · ∂~
r
With dt = ∂t + ∇A ∂t we find
∂Φ ∂A~i q ~
~ · ∂ A − q∇A
~ i · ∂~r
m~ai = −q −q + p~ − q A (11.14)
∂~ri ∂t m ∂~ri ∂t
342
11.2. Classical fields in the Schrödinger equation
~
= qE~ i + q~v · ∂ A − q∇A~ i · ~v
∂~ri
~
= q Ei + q ~v × B ~
i
∂
i~ |ψ(t)i = H |ψ(t)i . (11.15)
∂t
The Hamiltonian for several interacting charged particles with spin subject
to an external electromagnetic field will be written as a sum of several
terms related to the different interactions between charged particles and
electromagnetic fields.
Following the classical Hamiltonian for the kinetic energy and Lorentz
forces between a charged particle and electric field we define the quantum
Hamiltonian as
1 2
~ ~r) + qΦ(t, ~r).
H Lorentz = ~p − q A(t, (11.16)
2m
For N charged particles in an electro magnetic field, one should, besides
the Lorentz force, also consider the Coulomb forces between the particles.
We already encountered these in the previous chapter where we discussed
the hydrogen atom. The Coulomb Hamiltonian between two particles at
343
Chapter 11. Interaction with electromagnetic fields
q1 q2 1
H Coulomb = . (11.17)
4π0 |~r1 − ~r2 |
344
11.3. Gauge invariance
then
q
ψ 0 (t, ~r) = ei ~ Λ(t,~r) ψ(t, ~r) (11.25)
345
Chapter 11. Interaction with electromagnetic fields
equations and modifying the Hamiltonian such that the Maxwell equations
are retrieved one can investigate how a theory can be developed that is
invariant under a local phase change of the wave function. This property
is known as a Gauge symmetry.
Before we look at local phase changes of the wave function, i.e. a Gauge
symmetry, let us discus an almost obvious symmetry in quantum mechanics,
namely a global phase change of the wave function. The symmetry operator
that changes the global phase of a wave function is
The operator T θ multiplies the wave function with a phase. This operator
obviously commutes with the Hamiltonian of the Schrödinger equation
[T θ , H] = 0. (11.28)
∂
hψ(t)|T θ |ψ(t)i = 0. (11.29)
∂t
The operator T θ acting on a state just multiplies the state by a complex
number. We thus find that
∂
hψ(t)|ψ(t)i = 0. (11.30)
∂t
In other words the fact that a phase change of the wave-function does not
change the system leads to a conservation of the norm of the wave-function.
For the Schrödinger equation the norm of the wave-function is related to
the total particle number. A system that is invariant under a global phase
change of the wave-function thus obeys particle number conservation. For
massive charged particles the mass and charge of the system is proportional
to the total number of particles. We thus also find that for a system where
346
11.3. Gauge invariance
the Hamiltonian commutes with a global phase change of the wave function
the total mass and the total charge is conserved.
For a Gauge transformation we do not change the global phase, but
change locally the phase of the wave function. For a continues, differentiable
function Λ(t, ~r) we define the phase change as equation 11.21, ψ(t, ~r)0 =
q
ei ~ Λ(t,~r) ψ(t, ~r). The Gauge symmetry operation that generates this local
phase change is
q
T Λ(t,~r) ψ(t, ~r) = ei ~ Λ(t,~r) ψ(t, ~r). (11.31)
The operator T Λ(t,~r) does not commute with the Hamiltonian defined in
equation 11.22. We can however modify the Hamiltonian, and all other
operators such that all observables remain unchanged for a local phase
transformation. If we change all operators O by
q q
O 0 = ei ~ Λ(t,~r) Oe−i ~ Λ(t,~r) , (11.32)
then
q q q q
ψ10 (t, ~r)∗ O 0 ψ20 (t, ~r) = ψ1 (t, ~r)∗ e−i ~ Λ(t,~r) ei ~ Λ(t,~r) Oe−i ~ Λ(t,~r) ei ~ Λ(t,~r) ψ2 (t, ~r)
(11.33)
= ψ1 (t, ~r)∗ Oψ2 (t, ~r),
i.e. all observables are invariant under a simultaneous local phase transfor-
mation of the wave-function and all operators.
For operators that are functions of the position operator we know that
[~r , T Λ(t,~r) ] = 0 such that
q q
~r 0 = ei ~ Λ(t,~r)~r e−i ~ Λ(t,~r) (11.34)
= ~r .
Operators that only depend on the position of a particle, or on functions
of the position operator do not change under a Gauge transformation.
Next we investigate how the momentum operator transforms under a
Gauge transformation
q q
~p0 = ei ~ Λ(t,~r)~pe−i ~ Λ(t,~r) (11.35)
347
Chapter 11. Interaction with electromagnetic fields
The momentum operator does not transform back to it selves under a Gauge
transformation. Without additional modifications to the Schrödinger equa-
tion quantum mechanics would not be invariant under a local phase change
of the wave function. We can make the equation invariant if one looks at
the Gauge transformation of ~p − q A(t, ~ ~r), with A(t,~ ~r) the electromagnetic
vector field. The electromagnetic vector field is a function of ~r and as such
does not change under a local phase transformation. The operator ~p does
change under a Gauge transformation. The additional term, −q∇Λ(t, ~r)
is the gradient of a continues function. The electromagnetic vector field
can be changed by the gradient of a continues function without changing
physical observables, such that the combined operator
~ 0 (t, ~r) = ei ~q Λ(t,~r) ~p − q A(t,
~p0 − q A ~ ~r) e−i ~q Λ(t,~r) (11.36)
= ~p − q A(t, ~ ~r) + ∇Λ(t, ~r) ,
348
11.3. Gauge invariance
∂ 1 2
i~ ψ 0 (t, ~r) = ~ ~r) + ∇Λ(t, ~r)
~p − q A(t,
∂t 2m
∂Λ(t, ~r)
+q Φ(t, ~r) + ψ(t, ~r)0 .
∂t
q
The local change in the phase between ψ(t, ~r) and ψ(t, ~r)0 = ei ~ Λ(t,~r) ψ(t, ~r)
changes the effect of the position and time derivatives. These changes can
be absorbed into the definitions of the electro magnetic vector field A(t, ~ ~r)
and Φ(t, ~r). As a result if ψ(t, ~r) is a solution of the Schrödinger equation
then ψ 0 (t, ~r) is a solution of the transformed Schrödinger equation, and
both describe the same physics. We call a theory for which the physics and
dynamics of the system does not change, i.e. is invariant under a continues
local transformation a Gauge theory. We have seen that in order to turn
the Schrödinger equation that describes electrons into a Gauge theory we
needed to add the electro magnetic field in order to absorb the changes in
the derivatives under a local phase change. This is a general property we
will find in later lectures when describing other elementary particles. All
particles and forces in the standard model are described by Gauge theories
of the form we introduced in this section.
349
Chapter 11. Interaction with electromagnetic fields
(σ)
with ~k the wave vector of the field, c~ (t) the time-dependent expansion
k
(σ)
coefficients of the field on the plane wave basis, and ~~ the polarization
k
vector of the electromagnetic field. We can choose two possible polarization
vectors that are perpendicular to each other and perpendicular to the wave
(σ)
vector ~~ · ~k = 0. The field is zero outside the volume V and in order to
k
be continuous also zero at the boundary of the box. As a result the allowed
wave vectors are discrete
~k = (nx , ny , nz ) 2π (11.42)
L
with n ∈ Z. The quantization of the wave vector is not the one leading to
the definition of photons but needed to make the wave functions normaliz-
able. The quantization of ~k will vanish once we take the limit of L → ∞.
350
11.4. Quantized fields in the Schrödinger equation
The operator b† ~ creates a photon with polarization σ and wave vector ~k,
σ,k
the operator b ~ annihilates a photon from the same state. The commu-
σ,k
tation relations for the photon creation and annihilation operators are
[b ,b ] =0 (11.44)
σ,~k σ 0 ,~k0
[b† ~ , b† 0 ~ 0 ] =0
σ,k σ ,k
[b ~ , b† 0 ~ 0 ] = δ~k,~k0 δσ,σ0 .
σ,k σ ,k
The operators for the electro and magnetic field can be derived from this
and are respectively
r
~ω ~ ~
~σ b ~ eik·~r − ~σ∗ b† ~ e−ik·~r
XX
~ r) = i
E(~ (11.46)
σ
2V 0 σ,k σ,k
~k
r
~ω ~ ~ ~
k × ~σ b ~ eik·~r − ~k × ~σ∗ b† ~ e−ik·~r
XX
~ r) = i
B(~
σ
2V 0 σ,k σ,k
~k
351
Chapter 11. Interaction with electromagnetic fields
1 ~ r)
2
H= ~p − q A(~ (11.48)
2m
q ~ ~
−g S · B(~r)
2m
XX † 1
+ ~ω b ~b + .
σ,k σ,~k 2
~k σ
The first term in this operator describes the interaction between the
particle and the electro magnetic field. The operator A(~~ r) allows for the
creation (emission) and annihilation (absorption) of photons. Eigenstates
of this Hamiltonian are states with a single particle and a variable number
of photons. The electromagnetic field generated from a moving particle
follows naturally from this Hamiltonian. The second term describes the
interaction of the particle spin with the magnetic field. The last term de-
scribes the energy of the photons. The Hamiltonian for multiple charged
particles follows by summing the first two terms of equation 11.48 for each
particle. The last term describes the electromagnetic photon field, which is
the same for all particles. The Coulomb interaction between two particles
can be seen as the interaction of one particle with the electro magnetic
field generated by the other particle. As such we do not need to add the
Coulomb interaction to this Hamiltonian as we did for the case of the in-
teraction between charged particles and a classical electro magnetic field.
Each particle will generate an electro magnetic field by creating and an-
nihilating photons. The other particles will move through this field and
352
11.5. Zeeman effect
interact with this field by creating and annihilating photons. The resulting
interaction is the Coulomb force.
The future study of this Hamiltonian is part of quantum field theory,
which is one of the lectures you can follow in your future studies. We
should note that equation 11.48 is not fully Lorentz invariant. This can
lead to problems and incorrect answers. The correct way would be to do
the quantization of the electromagnetic field on top of the Dirac equation.
Done properly this leads to the equations of quantum electro dynamics.
~ =∇×A
B ~ (11.50)
= Bz ẑ.
1 ~ −g q S
2
~ ·B ~
H= ~p − q A (11.51)
2m 2m
2
1 1 q
= ~p − q (xBz ŷ − yBz x̂) − g S z Bz
2m 2 2m
1 2 q2 2 2 q q
Bz (x + y 2 ) −
= ~p + xpy − ypx Bz − g S z Bz
2m 8m 2m 2m
1 2 q2 2 2 q q
= ~p + Bz (x + y 2 ) − L z Bz − g S z Bz .
2m 8m 2m 2m
353
Chapter 11. Interaction with electromagnetic fields
The first term is the kinetic energy, the second term is related to the in-
ternal energy of the magnetic field, the third and fourth terms define the
interaction of the orbital and spin magnetic moments in the system with
the external magnetic field. In chapter 8.6.2 we inferred the interaction
between the magnetic moments of the hydrogen atom with a magnetic field
from classical arguments. In this section we have given it a theoretical
foundation.
354
Chapter 12
Relativistic effects
355
Chapter 12. Relativistic effects
Klein Gordon equation was published, this equation does not yield predic-
tions for the relativistic corrections to the hydrogen atomic energy levels in
agreement with experiment. In the end it was Dirac who proposed the cor-
rect form for the relativistic generalization of the Schrödinger equation for
electrons [81]. With his equation Dirac was able to include the spin of elec-
trons in a natural way and find relativistic corrections to the eigenenergies
of the hydrogen atom.
In this chapter we will first describe the general idea behind the Dirac
equation. After we formulated the Dirac equation we will have a look at the
exact solutions for a particle in free space. We then test if the Dirac equation
is compatible with the Schrödinger equation and look at the non-relativistic
limit of the Dirac equation. Once we convinced ourselves that the Dirac
equation gives a mere correction to the Schrödinger equation we will derive
the solution for the hydrogen atom. Once we know the eigenstates of the
hydrogen atom we will discuss the Lamb shift [82]. This is a small shift
of the electronic levels of the hydrogen atom due to the electro-magnetic
vector potential. The challenge of calculating the Lamb shift will bring us
to the formulation and basic ideas behind quantum electro dynamics. We
will not go into the details of the calculation, but will lay out the general
framework.
356
12.1. The Dirac equation
∂ ~2 2
i~ φ(~r, t) = − ∇ φ(~r, t). (12.2)
∂t 2m
The simplest assumption for a relativistic extension of the Schrödinger
equation would be to start from the relativistic energy momentum relation
for a massive particle
E 2 = c2 p~2 + m2 c4 . (12.3)
∂
We again can substitute E by the operator i~ ∂t and p~ by the operator
−i~∇. We let both sides act on a wave function φ(~r, t) and obtain the
differential equation
∂2
−~2 φ(~r, t) = (−c2 ~2 ∇2 + m2 c4 )φ(~r, t). (12.4)
∂t2
The resulting equation is the Klein Gordon equation. As Schrödinger al-
ready noted, this equation does not describe the relativistic corrections for
the energy levels of the hydrogen atom correctly.
In order to come to his equation, Dirac focused on two problems within
the Schrödinger equation. Firstly, the Schrödinger equation is not Lorentz
invariant, secondly the equation does not describe the spin of the electrons.
In the words of Dirac[81]: ”The discrepancies consist of duplexity phenom-
ena, the observed number of stationary states for an electron in an atom
357
Chapter 12. Relativistic effects
being twice the number given by the theory. To meet the difficulty, Goudsmit
and Uhlenbeck have introduced the idea of an electron with a spin angular
momentum of half a quantum and a magnetic moment of one Bohr magne-
ton [75]. This model for the electron has been fitted into the new mechanics
by Pauli [83] and Darwin [84].” Pauli and Darwin incorporated the spin of
an electron into the Schrödinger equation by writing the wave-function for
an electron as a two component vector. Each component of this vector is
a solution of its own Schrödinger equation. Magnetic fields and the spin-
orbit coupling can mix the two different components of the wave-function.
The doubled Schrödinger equation that incorporates the spin in the form
of Pauli and Darwin becomes:
∂
i~ ∂t 0 φ↑ (~r, t)
∂ · = (12.5)
0 i~ ∂t φ↓ (~r, t)
p2
!
~ e ~ ~
2m + V (~
r ) − 2m B · L 0
p2
~ e ~ ~
0 r) − 2m
2m + V (~ B·L
1 e Bz Bx + iBy
−g ~
2 2m Bx − iBy −Bz
!
1 Lz Lx + iLy φ↑ (~r, t)
+ζ ~ · ,
2 Lx − iLy −Lz φ↓ (~r, t)
with B ~ =
~ an external magnetic field, V (~r) an external electric potential, L
~r × ~p the angular momentum of the particle, and g and ζ experimentally
determined constants.
With these considerations in mind Dirac looked for a relativistic exten-
sion of the Schrödinger equation that correctly describes the energy levels
of an hydrogen atom. Dirac noted that the Schrödinger equation is linear
in the derivative of time. He searched for a similar form for the relativistic
equation. Dirac thus set himself the task to find an equation linear in the
energy that would reproduce the equation that E 2 = m2 c4 + c2 p~2 . For this
he wrote
X3
E=c αi p~i + α4 mc2 , (12.6)
i=1
358
12.1. The Dirac equation
αi αj + αj αi = 2δi,j . (12.8)
359
Chapter 12. Relativistic effects
The three Hermitian matrices that fulfill equation 12.11 are the Pauli ma-
trices:
0 1 0 −i 1 0
σx = , σy = , and σz = . (12.12)
1 0 i 0 0 −1
0 0 0 −i
0 0 i 0
0 −i 0 0 ,
α2 = αy = (12.14)
i 0 0 0
360
12.1. The Dirac equation
0 0 1 0
0 0 0 −1
α3 = αz =
1 0
, (12.15)
0 0
0 −1 0 0
1 0 0 0
0 1 0 0
α4 = . (12.16)
0 0 −1 0
0 0 0 −1
We now can replace the energy and momentum in equation 12.6 by opera-
tors as done within the Schrödinger equation. The Dirac equation becomes
3
!
∂ X ∂ 2
i~ φ(~r, t) = −i~c αi + α4 mc φ(~r, t). (12.17)
∂t ∂~ri
i=1
or
3
!
∂ X
i~ φ(~r, t) = c αi~pi + α4 mc2 φ(~r, t) (12.18)
∂t
i=1
α · ~p + α4 mc2 φ(~r, t),
= c~
α · ~p + α4 mc2 .
H D = c~ (12.19)
361
Chapter 12. Relativistic effects
The components of the wave function describe the 4 basis states that can
be used to create an electron or positron with spin up or spin down. The
first two components are called the large component, the second two the
small component of the wave function. The first and third index refer to
spin up, the second and forth to spin down. As we will show later in this
chapter only for a particle at rest the four basis states of the vector refer
to eigenstates of the Dirac equation and only in this case the basis states
corresponds to an electron or positron with either spin up or down. For
other situations physical particles are linear combinations of the 4 basis
states.
362
12.3. Properties of the Dirac α matrices and alternative
notations
αi0 = u† αi u, (12.25)
such that the new Dirac matrices αi0 can be used to factorize the quadratic
energy momentum dispersion relation. The unitary transformation u repre-
sents a basis transformation of the Dirac four-vectors. In the choice made
by Dirac the components relate to the large and small part of the wave
function with either spin up or down in the z direction. After a transfor-
mation of α to α0 by the unitary matrix u the new basis is given by uT
times the old basis.
One property that all choices of the Dirac matrices have in common is
their anti-commutation relations. By doing 32 matrix multiplications it is
straightforward to show that for all unitary matrices u
and thus
γ4 = α4 , (12.28)
γ1 = iα1 α4 , α1 = iγ4 γ1 ,
γ2 = iα2 α4 , α2 = iγ4 γ2 ,
γ3 = iα3 α4 , α3 = iγ4 γ3 .
363
Chapter 12. Relativistic effects
The Dirac matrices can be written as two by two block matrices of two
by two matrices. Each block then refers to the large or small component of
the Dirac four-vector. Within a block the components refer to spin up or
down. By doing so one can rewrite the Dirac matrices with the use of the
Pauli matrices. One finds for i ∈ [1, 2, 3]
0 σi
αi = , (12.30)
σi 0
and
1 0
α4 = . (12.31)
0 −1
364
12.4. Solutions for a free particle
r 0
E + mc2 1 r 1 i
φ2 (~r, t) = e ~ (~p·~r−Et) ,
(~px −i~py )c
2E E+mc2 V
−~
pz c
E+mc2
p
~z c
r E−mc2 r
E− mc2 x py2)c
(~
p +i~ 1 i (~p·~r−Et)
φ3 (~r, t) = E−mc e~ ,
2E 1 V
0
px −i~
(~ py )c
r E−mc2 r
E− mc2 −~pz c 1 i (~p·~r−Et)
φ4 (~r, t) = 2
e~ .
E−mc
2E V
0
1
The variable E is the eigenvalue of the Dirac Hamiltonian. For state φ1 (~r, t)
and state φ2 (~r, t) we find
p
E1,2 = p2 c2 + m2 c4 . (12.35)
365
Chapter 12. Relativistic effects
energy E. The states given by wave functions φ3 (~r, t) and φ4 (~r, t) describe
366
12.6. The non-relativistic limit
electron holes in the vacuum, i.e the positrons. The charge of a hole is
positive and it costs energy to create them.
To summarize we find that for p~ = 0 the state given by the wave function
φ1 (~r, t) describes an electron with spin up in the z direction, the wave
function φ2 (~r, t) describes an electron with spin down in the z direction, the
wave function φ3 (~r, t) describes a positron with spin up in the z direction,
and the wave function φ4 (~r, t) describes a positron with spin down in the
z direction. For finite momentum we still have 4 particles and 4 basis
states. The eigenstates of the Dirac equation at finite momentum do not
correspond to the basis states, but are given as superpositions of these
states. An electron with spin up is thus given as a linear superposition of
the 4 basis states used to write down the Dirac equation.
and
φ↑,s (~r, t)
φs (~r, t) = , (12.38)
φ↓,s (~r, t)
367
Chapter 12. Relativistic effects
for the large and small part of the wave function. With the use of the Pauli
matrices
0 1
σ1 = , (12.39)
1 0
0 −i
σ2 = ,
i 0
1 0
σ3 = ,
0 −1
whereby we introduced the vector notation for the Pauli matrices such that
3
X
~ r, t)) =
~σ · (~p + eA(~ ~ i (~r, t)).
σi (~pi + eA (12.41)
i=1
− E − V (~r) − mc2
c~σ · ~p φl (~r, t)
· = 0. (12.42)
E − V (~r) + mc2
−c~σ · ~p φs (~r, t)
368
12.6. The non-relativistic limit
The second equation shows that once, for stationary states, the large part
of the wave function is known also the small part is uniquely determined. If
we substitute the second equation into the first we find a single differential
equation for the large part of the wave function
c2
E − V (~r) − mc2 φl (~r, t) = ~σ · ~p
~σ · ~p φl (~r, t). (12.47)
E − V (~r) + mc2
The result is
c2
E (0) − V (~r) φl (~r, t) = ~σ · ~p ~σ · ~p φl (~r, t). (12.49)
E (0) − V (~r) + 2mc2
We now assume that the kinetic energy of the electron E (0) − V (~r) is small
compared to the energy needed to create a particle hole excitation (electron
positron pair production) at zero momentum i.e. 2mc2 . In this limit we
can make a Taylor series in (E (0) − V (~r))/(2mc2 ) for the right hand side of
the equation. For the energy dependent term this yields
∞
!n
c2 1 X E (0) − V (~r)
= (−1)n . (12.50)
E (0) − V (~r) + 2mc2 2m 2mc2
n=0
369
Chapter 12. Relativistic effects
If we keep the terms linear in E (0) we find for the Dirac equation on the
large part of the wave function
!
1 E (0) − V (~
r , t)
E (0) − V (~r) φl (~r, t) = ~σ · ~p 1 − ~σ · ~p φl (~r, t).
2m 2mc2
(12.51)
If we rearrange terms, realizing that ~p and V (~r) do not commute, and use
that (~σ · ~p)2 = ~p2 we find
1 2 1 1 2 (0)
~p + V (~r) + ~σ · ~p V (~r)~σ · ~p − ~p E φl (~r, t)
2m 4m2 c2 4m2 c2
= E (0) φl (~r, t). (12.52)
370
12.6. The non-relativistic limit
Whereby ~p2 is an operator, such that φe (~r, t) is not just a scaling of φl (~r, t)
but a different linear independent function. For φl (~r, t) we can write
−1/2
1 2
φl (~r, t) = 1+ ~p φe (~r, t) (12.59)
4m2 c2
!
~p2
≈ 1− φe (~r, t).
8m2 c2
We can now substitute the relation between the large part of the wave
function and the electron wave function into the Dirac equation 12.52 and
371
Chapter 12. Relativistic effects
find the Dirac equation acting on the electron wave function to first order
in 4m12 c2 to be
1 2 1 1 2 (0)
~p + V (~r) + ~σ · ~p V (~r)~σ · ~p − ~p E
2m 4m2 c2 4mc2
! !
~p2 (0) ~p2
× 1− φe (~r, t) = E 1− φe (~r, t). (12.60)
8m2 c2 8m2 c2
In order
to simplify
this equation we multiply the equation from the left
p2
~
with 1 − 8m2 c2
!
~p2
1 2 1 1 2 (0)
1− ~p + V (~r) + ~σ · ~p V (~r)~σ · ~p − ~p E
8m2 c2 2m 4m2 c2 4mc2
! ! !
~p2 ~p2 ~
p 2
× 1− φe (~r, t) = 1 − E (0) 1 − φe (~r, t).
8m2 c2 8m2 c2 8m2 c2
(12.61)
1
Keeping terms to first order in mc 2 we find
1 2 1 1
~p − ~p4 + V (~r) + ~σ · ~p V (~r)~σ · ~p (12.62)
2m 8m3 c2 4m2 c2
~p2 ~p2
− V (~r) − V (~r) 2 2 φe (~r, t) = E0 φe (~r, t)
8m2 c2 8m c
For the forth term of equation 12.62 we can write
1 1
~σ · ~p V (~r)~σ · ~p = (~σ · (~p V (~r))~σ · ~p + V (~r)~σ · ~p ~σ · ~p)
4m2 c2 4m2 c2
1
−i~~σ · (∇V (~r))~σ · ~p + V (~r)~p2
=
4m2 c2
1
~ σ · ~p + V (~r)~p2
= −ie~~σ · E~
4m2 c2
1
~ · ~p + ~e~σ · E
~ × ~p + V (~r)~p2 .
= −ie~E
4m2 c2
(12.63)
372
12.6. The non-relativistic limit
In the last step we used the identity for the Pauli matrices that
~ σ · ~p = E
~σ · E~ ~ · ~p + i~σ · E
~ × ~p, (12.64)
which one can derive using the commutation relations of the Pauli matrices,
or check to be valid by writing out all terms. For the fifth term of equation
12.62 we write
1 1
− 2 2
~p2 V (~r) = ~2 ∇2 V (~r) (12.65)
8m c 8m2 c2
1
= ~2 ∇ ((∇V (~r)) + V (~r)∇)
8m2 c2
1
~2 (∇2 V (~r)) + 2(∇V (~r))∇ + V (~r)∇2
= 2 2
8m c
1 2 2
~ · ~p − V (~r)~p2 .
= ~ (∇ V (~
r )) + 2ie~E
8m2 c2
The resulting Hamiltonian is
1 2 1 4 ~2 e~ ~ × ~p.
H= ~p − 3 2
~
p + V (~
r ) + 2 2
(∇2 V (~r)) + ~σ · E
2m 8m c 8m c 4m2 c2
(12.66)
373
Chapter 12. Relativistic effects
~ = ∂
eE V (r)r̂. (12.67)
∂r
~ due to a spherical potential into equation
If we substitute the electric field E
12.66 we find for the spin-orbit coupling term
1 1 ∂ ~ · L,
~
H SOC = V (r) S (12.68)
2m2 c2 r ∂r
for a state with quantum numbers τ we can discuss the effect of the spin
orbit coupling Hamiltonian on the angular part of the wave function. The
integrals over the radial wave function determine the coupling strength
1 1 ∂
ζτ = Rτ (r) V (r) R τ (r) (12.70)
2m2 c2 r ∂r
of the spin-orbit coupling. The operator acting on the angular part of the
wave function then becomes
~ · L.
H SOC = ζτ S ~ (12.71)
374
12.6. The non-relativistic limit
0.500
0.100 5p
5d
ℏ2 ζnl [eV]
0.050 4f
4p
4d
0.010
3d
0.005 3p
2p
0.001
0 20 40 60 80
Element number Z
Figure 12.1: The atomic values for the spin-orbit coupling constant ~2 ζn,l
for the valence electrons of the different elements.
375
Chapter 12. Relativistic effects
and numerically. With the radial wave function and the effective potential
one can calculate the spin-orbit coupling constant, ζnlj as given in equation
12.70.
In figure 12.1 we show the size of the spin-orbit coupling constant for the
valence electrons of the elements in the periodic table. We approximated
the N = Z electron ground state as the single Slater determinant repre-
sentable state with lowest energy. For Boron, with one electron in the 2p
shell the coupling constant is the smallest and slightly larger than 1 meV.
One generally finds that the heavier the element the larger the spin-orbit
coupling constant is. We furthermore find that each time a new shell starts
the coupling constant is considerably reduced. We can understand these
trends from basic considerations. Spin-orbit coupling is a relativistic effect.
It should become larger when the electrons have large velocities. The larger
the nuclear charge, the faster a classical electron will rotate around the nu-
cleus and thus the more important relativistic corrections become. If one
starts with a new shell, then the electrons of the previous shell will be closer
to the nucleus and thus shield the nuclear potential. The effective potential
seen by the outer electrons is thus smaller, thereby reducing the angular
velocity of the electrons. Although a picture based on classical behaviour
of the electrons, it does give a basic feeling of what happens.
The numerical values presented here agree well with experimental val-
ues. These values are however notably different from theoretical values one
obtains when neglecting the interactions between the electrons. One can
compare the numerical results including the interaction between electrons
to the values one finds for a single electron in a potential of the form
Ze
V (r) = . (12.72)
4πr
This is the potential one finds for an effective H atom with nuclear charge
Z. The radial wave function in this case is given as
l+ 3 s
Z 2 (n − l − 1)! − arZn sl+1 2rZ
Rn,l (r) = 2l+1 n−2−l rl e 0 Ln−l−1 ( ). (12.73)
a0 (n + l)! a0 n
376
12.7. Symmetries and conservation laws for the Dirac
equation
This value is about 10 times larger for the last element of each sub shell than
the numerical values we obtained. Furthermore is about 100 times larger
than the numerical values for the first element of each sub shell. This
discrepancy is due to the screened effective potential as described above.
The analytical result does explain the jumps in the trend each time a new
sub-shell is started. The coupling constant scales with Z 4 , but also with
1/n3 and 1/l3 .
with ~p = −i~∇, and V (~r, t) = −eΦ(~r, t). By the Noether theorem sym-
metry operators that commute with the Hamiltonian relate to a conserved
quantity (see chapter 7 for more detail).
377
Chapter 12. Relativistic effects
378
12.7. Symmetries and conservation laws for the Dirac
equation
∂
ρ(~r, t) = −∇ · ~j(~r, t), (12.78)
∂t
with for the Schrödigner equation the density defined as
In order to define the current density we can look for a definition that fulfills
the continuity equation. We have
∂
−∇ · ~j(~r, t) = ρ(~r, t) (12.82)
∂t
∂
= φ(~r, t)† φ(~r, t)
∂t
∂ † † ∂
= φ(~r, t) φ(~r, t) + φ(~r, t) φ(~r, t)
∂t ∂t
379
Chapter 12. Relativistic effects
380
12.7. Symmetries and conservation laws for the Dirac
equation
p~ = γm~v . (12.89)
~j(~r, t) = p~
(12.91)
E/c2
p~
=
mγ
= ~v .
The current density of a plane wave solution represent the velocity of the
particle.
381
Chapter 12. Relativistic effects
which is non-zero.
For the spin angular momentum we define the Pauli matrices acting on
a Dirac four-vector as
0 1 0 0
1 0 0 0
Σx = iα3 α2 =
0 0 0 1 ,
(12.93)
0 0 1 0
382
12.7. Symmetries and conservation laws for the Dirac
equation
0 −i 0 0
i 0 0 0
0 0 0 −i ,
Σy = iα1 α3 =
0 0 i 0
1 0 0 0
0 −1 0 0
Σz = iα2 α1 =
0 0 1 0 .
0 0 0 −1
The spin operator on the same basis as the one defined for the Dirac α
matrices then has the matrix representation
~ = 1 ~Σ.
S ~ (12.94)
2
With these matrices and the definition of the spin operator on the basis
of a Dirac four-vector we can calculate the commutation relations between
the spin operator and the Dirac Hamiltonian
h i 1
~ = c~
HD, S 2
α · ~p + mc α4 , ~Σ ~ (12.95)
2
3 iα3 α2
X 1
= c αi~pi + mc2 α4 , ~ iα1 α3
2
i=1 iα2 α1
3 α3 α2
1 X
= i~c ~pi αi , α1 α3
2
i=1 α2 α1
αy ~pz − αz ~py
= i~c αz ~px − αx~pz
αx~py − αy ~px
α × ~p.
= i~c~
Neither the spin nor the orbital angular momentum is conserved within
the Dirac equation. Comparing equation 12.92 and 12.95 it is straight-
forward to see that the sum of the spin and orbital angular momentum
383
Chapter 12. Relativistic effects
~ =L
J ~ +S ~ does commute with H D
h i
~ = 0.
HD, J (12.96)
The operator L ~ · ~p acting on any state gives zero. If the orbital angular
momentum along a certain direction is zero, then the total angular momen-
tum along this direction is given by the spin angular momentum. Whilst
the total angular momentum is conserved we should expect that the spin
angular momentum along the direction the particle travels is a conserved
quantity. This is known as the helicity of the particle. For systems where
~ r, t) and Φ(~r, t) are constants such that [H D , ~p] = 0 we find that
A(~
h i h i
HD, S ~ · ~p = H D , S~ · ~p (12.98)
αy ~pz − αz ~py
= i~c αz ~px − αx~pz · ~p
αx~py − αy ~px
= 0.
384
12.7. Symmetries and conservation laws for the Dirac
equation
385
Chapter 12. Relativistic effects
~2
2 ~ ~
[H D , K] = H D , α4 S·J − (12.101)
~ 4
~2 ~2
2 ~ ~ 2 ~ ~
= [H D , α4 ] S·J − − α4 S·J − , HD
~ 4 ~ 4
~2
2 ~ ~ 2 h~ i
~,
= [H D , α4 ] S·J − − α4 S, HD · J
~ 4 ~
whereby we used that for 3 operators A, B and C it holds that [A, BC] =
[A, B]C − B[C, A] and that [H D , J~ ] = 0. The commutator between H D
~ is given in equation 12.95. The commutator between H D and α4 is
and S
α · ~p, α4 ]
[H D , α4 ] = [c~ (12.102)
" 3 #
X
= c αi~pi , α4
i=1
3
X
=c ~pi [αi , α4 ]
i=1
3
X
= −2cα4 αi~pi
i=1
= −2cα4 α
~ · ~p,
~2
4 ~ ~ ~
[H D , K] = − cα4 α
~ · ~p S · J − α × ~p) · J
+ α4 2ic (~ (12.103)
~ 4
386
12.7. Symmetries and conservation laws for the Dirac
equation
3 X
3 3
X
~ j + i (~ ~ −
X ~
= −2cα4 αi~pi Σj J α × ~p) · J αi~pi
2
i=1 j=1 i=1
α1 α3 α2 α2 α3 α2 + α3 α3 α3 α2 − α2
= −2cα4 i~p · α1 α1 α3 − α3 α2 α1 α3 ~
α3 α1 α3 + α1 · J
α1 α2 α1 + α2 α2 α2 α1 − α1 α3 α2 α1
~
−~ α · ~p
2
~ +α ~
= 2cα4 iα1 α2 α3~p · J ~ · ~p
2
~ ~ ~
= 2cα4 iα1 α2 α3~p · (L + S) + α ~ · ~p
2
~ +α ~
= 2cα4 iα1 α2 α3~p · S ~ · ~p
2
~ ~ +α
= 2cα4 iα1 α2 α3~p · Σ ~ · ~p
2
α3 α2
~
= 2cα4 −α1 α2 α3 α3 α2 · ~p + α ~ · ~p
2
α3 α2
α1
~
= 2cα4 − α2 · ~p + α ~ · ~p
2
α3
= 0.
387
Chapter 12. Relativistic effects
h i h i
K, J ~ = 0, which is what we wanted to show in the first place. For α4 , J
~
we find
h i h i
~ = α4 , L
α4 , J ~ +S ~ (12.105)
h i
~
= α4 , S
1 h ~
i
= ~ α4 , Σ
2
= 0.
388
12.8. Solutions for the hydrogen atom
In summary we have shown that the operator K = α4 ~2 S ~ − ~2
~ ·J
4
commutes with both the Dirac Hamiltonian H D and the total angular
~.
momentum of the system J
389
Chapter 12. Relativistic effects
390
12.8. Solutions for the hydrogen atom
∂ ∂x ∂ ∂y ∂ ∂z ∂
−i~ = −i~ − i~ − i~ (12.116)
∂r ∂r ∂x ∂r ∂y ∂r ∂z
∂x ∂y ∂z
= p + p + p
∂r x ∂r y ∂r z
1
= ~r · ~p
r
We now write out the αr as a matrix
0 0 z x − iy
1 0 0 x + iy −z
αr = , (12.117)
r z x − iy 0 0
x + iy −z 0 0
(12.118)
391
Chapter 12. Relativistic effects
Z X − iY 0 0
i X + iY −Z 0 0
= ,
r 0 0 Z X − iY
0 0 X + iY −Z
with (X, Y, Z) = ~r × ~p. We now multiply this equation from the left with
αr to find
(ixy + y 2 + z 2 )px +
0 0 −xzpx − yzpy + (x2 + y 2 )pz (−xy − ix2 − iz 2 )py −
(x − iy)zpz
(−ixy + y 2 + z 2 )px +
i~ 1
0 0 (−xy + ix2 + iz 2 )py − xzpx + yzpy − (x2 + y 2 )pz
αr Σ · (~r × ~p) = 2 (x + iy)zpz
.
(ixy + y 2 + z 2 )px +
r r
−xzpx − yzpy + (x2 + y 2 )pz (−xy − ix2 − iz 2 )py − 0 0
(x − iy)zpz
(−ixy + y 2 + z 2 )px +
(−xy + ix2 + iz 2 )py − xzpx + yzpy − (x2 + y 2 )pz
0 0
(x + iy)zpz
(12.120)
We now add equation 12.118 to equation 12.120 together. We find for the
right hand side of equation 12.111 the expression
i 1 0 0 (x + y + z )p (x + y + z )(p − ip )
2 2 2
z
2 2 2
x y
r r (x + y + z )(p − ip ) (x + y + z )p
2 2 2
x
z
y 0 2 20
x
2
z
y
0 0 pz (px − ipy )
=
0
pz
0
(px − ipy )
(px − ipy )
0
pz
0
(px − ipy ) pz 0 0
~ · ~p,
=α (12.121)
which proofs that equation 12.111 indeed represents the kinetic energy term
of the Dirac equation in spherical coordinates. Putting it all together we
find for the Dirac equation represented on spherical coordinates
i
H D = cαr pr + α4 K + α4 mc2 + V (r). (12.122)
r
392
12.8. Solutions for the hydrogen atom
393
Chapter 12. Relativistic effects
Σ1 Σ1 = −α3 α2 α3 α2 (12.125)
= 1,
Σ1 Σ2 = −α3 α2 α1 α3 (12.126)
= −α2 α1
= iΣ3 ,
~ ×L
Note that L ~ 6= 0, but
Ly Lz − L z L y
~ ×L
L ~ = L z Lx − L x Lz (12.128)
L x Ly − L y L x
[Ly , Lz ]
= [Lz , Lx ]
[Lx , Ly ]
~
= i~L,
such that
K 2 = L2 − ~Σ ~ + 2~Σ
~ ·L ~ + ~2
~ ·L (12.129)
~ + ~2 .
~ ·L
= L2 + ~Σ
J 2 = (L + S)2 (12.130)
394
12.8. Solutions for the hydrogen atom
~ ·L
= L2 + 2 S ~ + S2
= L2 + ~Σ ~ + 3 ~2 ,
~ ·L
4
such that we, by comparison of K 2 to J 2 find the expression,
1
K 2 = J 2 + ~2 (12.131)
4
With this relation we can rewrite κ in terms of j. If ψ(~r) is an eigenstate
of H D , then ψ(~r) is also an eigenstate of K and J 2 . With eigenvalues −κ~
and j(j + 1) respectively. We now can use the expression of K 2 in terms
of J 2 to find
K 2 ψ(~r) = κ2 ~2 (12.132)
1
= j(j + 1)~2 + ~2 .
4
It follows that
1
κ2 = j(j + 1) + , (12.133)
4
or
1
κ=± j+ . (12.134)
2
395
Chapter 12. Relativistic effects
2~ ~
K→ S · L + ~2 (12.136)
~
1 2
J − L2 − S 2 + ~2
=
~
1 2 2 1 2
= J −L + ~ ,
~ 4
whereby we used in the last step that the spin of an electron is 1/2 ,
such that S 2 = 3/4. If we now use this operator to calculate, in the non-
relativistic limit, the expectation value of K for an eigenstate with quantum
numbers l, s, and j = l ± 1/2 we find
1 1 1
hψlsj |K|ψlsj i = ~ (l ± )(l + 1 ± ) − l(l + 1) + (12.137)
2 2 4
1 1
= ~ ±(l + ) + .
2 2
whereby κ < 0 when j > l and κ > 0 when j < l, i.e. κ = −1 corresponds
to the s1/2 state, κ = +1 corresponds to the p1/2 state, κ = −2 corresponds
to the p3/2 state, κ = +2 corresponds to the d3/2 state, etc.
Note that for the relativistic case L does not commute with the Hamil-
tonian. The operator K does commute with the Hamiltonian. We can
connect the non-relativistic limit to the relativistic case by changing the
396
12.8. Solutions for the hydrogen atom
speed of light c form infinite to its real value. If we do so the wave func-
tions will change in a continues fashion. The eigenvalues of operator K for
eigenstates of the Dirac equation, will not change when changing c form
infinite to its real value. Such that our discussion that the sign of κ is
different for the two solutions with the same expectation value of J 2 still
holds also for the relativistic solution.
We now have a unique set of quantum numbers to label the different
angular states. Next we want to determine the angular wave functions. For
this we will show that although the eigenstates of the Dirac Hamiltonian are
not eigenstates of the operator L2 , but the small and large part separately
are eigenstates of L2 , albeit with different eigenvalues.
For the operator L2 we can write
L2 = (J − S)2 (12.139)
2 2
= J + S − 2S · J ,
with K = α4 ~2 S · J − 14 ~2 we find
1
L2 = J 2 + S 2 − α4 ~K − ~2 (12.140)
2
2 1 2
= J − α4 ~K + ~ .
4
We take ψκ,jz (θ, φ) to be an eigenfunction of the Dirac Hamiltonian. We
find
1
L2 ψκ,jz (θ, φ) = j(j + 1)~2 + α4 ~2 κ + ~2 ψκ,jz (θ, φ). (12.141)
4
397
Chapter 12. Relativistic effects
and
12 0
α4 = . (12.144)
0 −12
j ∓ 21 j ∓ 21 + 1 ψκ,jz (θ, φ)
(A) (B)
Both ψκ,jz (θ, φ) and ψκ,jz (θ, φ) are eigenstates of L2 albeit with different
eigenvalues. We find
(
j − 12 κ < 0
lA = , (12.146)
j + 12 κ > 0
and
(
1
j+ 2 κ<0
lB = 1
. (12.147)
j− 2 κ>0
For κ = −1 we find that the large part of the wave function has the angular
dependence of a s1/2 state, the small part of the wave function has the
angular dependence of a p1/2 state. For κ = +1 this is reversed and the
large part of the wave function has the angular dependence of a p1/2 state,
the small part of the wave function has the angular dependence of a s1/2
398
12.8. Solutions for the hydrogen atom
state. For κ = −2 we find that the large part of the wave function has the
angular dependence of a p3/2 state, the small part of the wave function has
the angular dependence of a d3/2 state, etc.
In order to find explicit functions of the angular dependence we realize
that the angular dependence is given by the coupling of a state with l =
j ± 1/2 and s = 1/2 to a state with total angular momentum j. In general
we can express this coupling with the use of Clebsch-Gordan coefficients,
see section 7.5.7 and equation 7.148 for explicit values. In general we can
write
1/2
j,jz
X
|lsjjz i = Cl,lz =jz −sz ,s,sz
|l, jz − sz i |ssz i . (12.148)
sz =−1/2
399
Chapter 12. Relativistic effects
with l± = j ± 1/2 and s = 1/2. With the use of the notation jz± = jz ± 21
we find
r
l± + 12 ∓jz
∓ 2(l± + 12 ) Yl± ,jz− (θ, ϕ)
Y± = r . (12.154)
l± + 12 ±jz
Y + (θ, ϕ)
2(l + 1 ) l± ,jz
± 2
As the last step we return to the original Dirac equation. We have found
solutions for the angular dependence of the small and large part of the wave
function. Following the argumentation in the non-relativistic limit, the full
solution is a product of angular and radial solutions. We realize that the
radial dependence for spin up and down is the same. The radial dependence
of the large and small part can be very different. We will call the radial
dependence of the large part g(r) and of the small part if (r). The reason
for the phases of the radial functions will become clear once we find explicit
solutions. With these definitions the Eigenstates of the Dirac Hamiltonian
for a problem with spherical symmetry become
Yl− ,j,jz (θ, φ)g(r)
ψκ<0,jz (~r) = (12.155)
Yl+ ,j,jz (θ, φ)if (r),
Yl+ ,j,jz (θ, φ)g(r)
ψκ>0,jz (~r) = .
Yl− ,j,jz (θ, φ)if (r)
400
12.8. Solutions for the hydrogen atom
12.8.3 Operator αr
In order to solve the radial equations for the large and small part we return
to the equation 12.122 for the Dirac Hamiltonian in spherical coordinates.
We know that the eigenstates of this Hamiltonian are eigenstates of K with
eigenvalues −~κ and we replace the operator by its eigenvalues.
i
H D = cαr pr − ~ α4 κ + α4 mc2 + V (r). (12.157)
r
Our wave function factorizes into a radial and angular part. However the
Dirac Hamiltonian, although we replaced the operator K by its expectation
value still contains a part that depends on the angular coordinates. The
operator αr is defined as
1
αr = α~ · ~r (12.158)
r
1
0 r~σ· ~r
= 1 ,
σ · ~r
r~ 0
with
e−iϕ sin(θ)
1 cos(θ)
~σ · ~r = . (12.159)
r eiϕ sin(θ) − cos(θ)
The eigenvalues of αr2 are clearly 1. The eigenvalues of αr are thus either
+1 or −1. The value one finds depends on the phase choice of the spherical
401
Chapter 12. Relativistic effects
harmonics. For the choice we make we find the eigenvalue c = −1. The
proof of this involves some recouping of angular momenta and can be found
below.
Using the notation Y± = Yl=j± 1 ,j,jz (θ, φ) we find
2
1
Y± 0 r~σ · ~r Y±
αr = 1 · (12.161)
Y∓ σ · ~r
r~ 0 Y∓
Y±
=c ,
Y∓
with c = +1 or c = −1. It follows that
1
cY± = ~σ · ~rY∓ (12.162)
r
e−iϕ sin(θ)
cos(θ)
= iϕ Y∓
e sin(θ) − cos(θ)
r √
4π Y 1,0 (θ, ϕ) 2Y 1,−1 (θ, ϕ)
= √ Y∓
3 − 2Y1,1 (θ, ϕ) −Y1,0 (θ, ϕ)
We now insert the explicit angular dependence in terms of the spherical
harmonics for the spin up and down part of the wave function, whereby
we suppressed the notation of the arguments of the spherical harmonic
functions. We get
r
√ l∓ + 12 ±jz
± 2(l∓ + 12 ) Yl∓ ,jz − 12
r
4π Y1,0 2Y1,−1
cY± = √ · r (12.163)
3 − 2Y1,1 −Y1,0 l∓ + 1 ∓jz
2
Y
2(l∓ + 12 ) l∓ ,jz + 2
1
r r
l∓ + 12 ±jz l∓ + 12 ∓jz
r
4π ± 1 Y1,0 Yl∓ ,jz − 1 + Y Y
(l + 1 ) 1,−1 l∓ ,jz + 2
1
= r 2(l∓ + 2 ) 2
r ∓ 2
3 l∓ + 12 ±jz l∓ + 12 ∓jz
∓ (l + 1 ) Y1,1 Yl∓ ,jz − 1 − 2(l + ) 1 Y1,0 Yl∓ ,jz + 1 .
∓ 2 2 ∓ 2 2
402
12.8. Solutions for the hydrogen atom
and find
s r
l∓ −1 m l∓ 3
Y1,m1 Yl∓ ,m2 = − C1m 1 l∓ m2
Yl −1,m (12.166)
2l∓ − 1 4π ∓
s r
l∓ +1 m l∓ + 1 3
+ C1m 1 l∓ m2
Yl +1,m .
2l∓ + 3 4π ∓
(12.167)
403
Chapter 12. Relativistic effects
= Yl± ,m .
r
l± + 12 ∓jz
∓ 2(l± + 12 ) Yl± ,jz− (θ, ϕ)
cY± = − r . (12.169)
l± + 12 ±jz
Y + (θ, ϕ)
2(l + 1 ) l± ,jz
± 2
Comparing this to equation 12.154 we find that c = −1, i.e. the eigenvalue
of αr for all the eigenstates of K is −1.
404
Chapter 13
Response theory
405
Chapter 13. Response theory
back to the original ground state when the external perturbation is turned
off. At the same time, extremely large fields applied for very short times
can still be described by response theory.
We will start this chapter with a short general introduction of response
functions and their Volterra expansion. We will then focus on the calcu-
lation of the response functions from the microscopic description of the
system. We will first calculate the response of an electron bound to an
atom due to an oscillating electrical field (i.e. light), before we generalize
the response theory to any system and arbitrary operator representing an
observable. The specific example gives a very clear and intuitive feeling of
what response theory is about. We will do the example once on a classical
system, representing the bound electron by a charged particle on a spring
and once for the hydrogen atom described by the Schrödinger equation. We
can already guess what the possible responses are for such a system. We
assume a positive charge at position z = 0 and the electron at position z.
At rest the average position of the electron will be at z = 0. If we move
the electron away from z = 0 we have generated a dipole moment p = −ez.
In order for the electron to get to the position z, starting at the position
z = 0, it needs to move. The movement of the electron is related to a cur-
rent. In an oscillating electric field the induced current will also oscillate.
In order for the velocity to change we need to accelerate the electron. An
accelerating charge produces an electro magnet field, such that we get scat-
tering of the light. The response functions of relevance are related to the
position, velocity and acceleration of the electron, i.e. the induced dipole
moment, the induced current and the resulting scattering of light, due to
the external oscillating electrical field. The reason we will do this both for
a classical and quantum model is that the response of the system in both
descriptions is very similar, making a direct comparison useful.
Within response theory we want to calculate the response of a system
x(t) in time as a function of some external action on the system F (t). We
can think of these as the position x of a particle influenced by an external
force F . In general the position we find at time t depends on the position
at t → −∞ and the force applied at all times t0 < t. We can express x(t)
406
in terms of a Volterra series expansion
x(t) = χ(0) (t) (13.1)
Z t
+ χ(1) (t; t1 )F (t1 )dt1
−∞
Z t Z t
+ χ(2) (t; t1 , t2 )F (t1 )F (t2 )dt1 dt2
−∞ −∞
+ ...
∞ Z
X t Z t n
Y
(0) (n)
=χ (t) + ... χ (t; t1 , t2 , . . . , tn ) F (tj )dtj ,
n=1 −∞ −∞ j=1
whereby χ(n) is called the n-th order response function, or n-th order sus-
ceptibility. For most of our systems it does not matter if we do an experi-
ment today or tomorrow. The susceptibility therefore should only depend
on the difference between tj and t
χ(n) (t; t1 , t2 , . . . , tn ) = χ(n) (t − t1 , t − t2 , . . . , t − tn ). (13.2)
With τn = t − tn we find
∞ Z ∞
X Z ∞ n
Y
(0)
x(t) = χ (t) + ... χ(n) (τ1 , τ2 , . . . , τn ) F (t − τj )dτj . (13.3)
n=1 0 0 j=1
For a system that is time translational invariant we should find that en-
ergy is a conserved quantity. For most force fields we encounter in physics
the energy is related to a frequency and it thus makes sense to Fourier
transform equation 13.3 to the frequency domain. Even without the phys-
ical background of the Noether theorem that states that for each symme-
try operation there should be a conserved quantity one should see that
Fourier transforming equation 13.3 will simplify the math. Equation 13.3
describes a convolution of the susceptibility and the time dependent force.
The Fourier transform of a convolution yields a simple product. We define
the Fourier transforms:
Z ∞
x(ω) = x(t)eiωt dt (13.4)
−∞
407
Chapter 13. Response theory
Z ∞
1
x(t) = x(ω)e−iωt dω
2π −∞
Z ∞
F (ω) = F (t)eiωt dt
−∞
Z ∞
1
F (t) = F (ω)e−iωt dω,
2π −∞
and find
Z ∞
x(ω) = χ(0) (t)eiωt dt (13.5)
−∞
Z ∞X∞ Z ∞ Z ∞ n
Y
(n)
+ ... χ (τ1 , τ2 , . . . , τn ) F (t − τj )dτj eiωt dt
−∞ n=1 0 0 j=1
(0)
= χ (ω)
Z ∞Z ∞
+ θ(τ1 )χ(1) (τ1 )F (t − τ1 )eiω(t−τ1 ) eiωτ1 dτ1 d(t − τ1 )
−∞ −∞
+ ...,
whereby we introduced the the Heaviside step function θ(τj ) such that we
can take the integration boundaries from −∞ to ∞ for the integral over τ1 ,
used that eiω(t−τ1 ) eiωτ1 = eiωt , and shifted the integration boundaries for t
by τ1 . With
Z ∞
(1)
χ (ω) = θ(τ1 )χ(1) (τ1 )eiωτ1 dτ1 , (13.6)
−∞
we find
This chapter will deal with the question on how to calculate χ(n) for
various actuators and responses of different materials. We will start with
two model examples and then continue with a general treatment.
408
13.1. Classical example
FS = −kx. (13.8)
409
Chapter 13. Response theory
F0
x= cos(ωt) + A0 cos(ω0 t − φ0 ), (13.12)
mω 2 − mω02
with
r
k
ω0 = , (13.13)
m
the eigenfrequency of the harmonic oscillator. The term oscillating at fre-
quency ω0 are oscillations possible without a driving field. This term is
known as the transient solution and determined by the initial conditions.
The motion with frequency ω0 is not related to the response of the system
due to the external driving field. Without loss of generality for the deriva-
tion of the response function we can choose the initial settings such that
A0 = 0. With A0 = 0 the only movement of the particle is related to the
response of the system to the external driving force. The term that oscil-
lates with the frequency of the driving force is known as the steady-state
solution.
F0
A= (13.15)
mω02 − mω 2
410
13.1. Classical example
Amplitude of oscilation depending of driving frequency ω Phase of oscilation depending of driving frequency ω
1.0
6
]
ω20 m
F0
5 0.8
Phase ϕ [units of π]
Amplitude A [units of [
4 0.6
3
0.4
2
0.2
1
0 0.0
0.0 0.5 1.0 1.5 2.0 0.0 0.5 1.0 1.5 2.0
ω [units of ω0 ] ω [units of ω0 ]
Figure 13.2: The amplitude and phase of the response of the particle to an
oscillating driving force.
F0 1
= ,
m (ω0 − ω)(ω0 + ω)
and φ the phase of the oscillation with respect to the driving force
φ = π θ(ω − ω0 ), (13.16)
and θ(ω − ω0 ) the Heaviside step function that is zero for ω < ω0 and 1 for
ω > ω0 .
The amplitude and phase of the oscillation are shown in figure 13.2.
The amplitude is largest if the frequency of the driving force is equal to
the eigenfrequency of the harmonic oscillator (it diverges at this frequency)
1
and falls of as (ω0 −ω)(ω0 +ω)
away from the resonance. When |ω0 − ω| 1
we can take (ω0 + ω) to be roughly constant and the amplitude falls of like
1 1
ω . For the case where ω ω0 the amplitude falls of as ω 2 .
411
Chapter 13. Response theory
1
Epot = kx2 (13.18)
2
F2 ω02
= 0 cos2 (ωt).
2m (ω 2 − ω02 )2
Absorption of energy
412
13.1. Classical example
Half of the time the field adds energy to the oscillator, the other half of the
time the oscillator adds energy to the driving field. On average there is no
net absorption of energy by the particle in the driven harmonic oscillator.
413
Chapter 13. Response theory
q q 2
with ω0 = m k
and ω00 = |ω02 − γ2 |.
We find that the transient solution is exponentially damped by γ. We
furthermore find that when ω0 < γ/2 the oscillator is over-damped and
the transient solution behaves in total as an exponentially decay instead
of an oscillating function. The frequency with which the transient solution
oscillates is shifted from ω0 to ω00 . The movement of the particle due to
the driving field is at the same frequency of the driving force (ω), however
with a non-trivial phase shift.
Figure 13.3: The amplitude and phase of the response of the particle to an
oscillating driving force for the damped harmonic oscillator.
ω02 − ω 2
F0
A cos(φ) = (13.24)
m γ 2ω2 + ω2 − ω2 2
0
414
13.1. Classical example
F0 γω
A sin(φ) = . (13.25)
m γ 2ω2 + ω2 − ω2 2
0
415
Chapter 13. Response theory
Absorption of energy
We can calculate the absorption of energy by the oscillator from the time
averaged power exerted by the field. The power is given as the movement
against a force exerted by the field
dx
P =F (13.31)
dt
F02 1
= 2
2m (ω − ω0 )2 + γ 2 ω 2
2
416
13.1. Classical example
d2 z dz
m 2
= −kz − mγ + F0 e−iωt , (13.33)
dt dt
and x = Re(z). The gain in using complex variables is that the phase
shift is now much easier derived and allows us to write the response as a
multiplication of a response function times the driving force. Setting the
amplitude of the transient solution to zero we find
1
z= F0 e−iωt . (13.34)
m ω02 − ω 2 − iγω
417
Chapter 13. Response theory
418
13.1. Classical example
Susceptibility
We define the electric susceptibility for a single harmonic oscillator χs to
relate the induced electric dipole moment p to the applied external electric
field:
p ∝ χs (ω)E. (13.36)
The index s is there to remember that we calculate the susceptibility not for
a medium, but for a single oscillator. Instead of a relation between induced
dipole moment as a function of the electric field we can equally well define
the susceptibility as a relation between the particle position and the applied
force. The induced dipole moment is proportional to the position
p = −ez, (13.37)
F = −eE. (13.38)
419
Chapter 13. Response theory
We thus find that the susceptibility for a single harmonic oscillator also
relates the position of the particle to the applied force:
z ∝ χs (ω)F. (13.39)
Conductivity
The electric conductivity (σs ) for our single harmonic oscillator relates the
induced current to the external electrical field
j = σs (ω)E (13.41)
d
with j = −ev the current of the electron, v = dt x the velocity of the
electron and E the applied electric field. We can relate the current to the
time derivative of the induced dipole moment such that
d
j= p. (13.42)
dt
420
13.1. Classical example
d
σs (ω)E = p (13.43)
dt
d
∝ χs (ω) E
dt
∝ −iωχs (ω)E.
F v = Ej (13.45)
= Eσs (ω)E,
to find the total absorbed energy per unit time. The absorption coefficient is
defined as the absorbed energy per unit time divided by the energy density
in the electric field such that we find
!
~ ∗ · σs · E
E ~
α = Re . (13.46)
~ 2
|E|
421
Chapter 13. Response theory
the definition of our response functions, such that we need to take the
real part to obtain the observables. In some fields it is common to define
σs (ω) ∝ ωχs (ω), which changes the phase of the conductivity. In that case
we find that the absorption coefficient is given by the equation
!
~ ∗ · σs · E
E ~
α = −Im . (13.47)
~ 2
|E|
Scattering length
Photons can scatter of an electron. We define the scattering length to relate
the outgoing or scattered field to the applied field:
d2
Eout ∝ p(t). (13.50)
dt2
Using that p ∝ χs (ω)E(t, ω) we find that
d2
Eout ∝ χs (ω) E(t, ω) (13.51)
dt2
= −ω 2 χs (ω)E(t, ω).
422
13.2. Quantum example
ω2
1 1
= − .
2ω00 ω − ω00 + i γ2 ω + ω00 + i γ2
423
Chapter 13. Response theory
or in matrix form
ε1s 0
H0 = , (13.55)
0 ε2p
and
!
a1s
H0 = a†1s a†2p · H0 · , (13.56)
a2p
with ε1s = −1 Rydberg and ε2p = −1/4 Rydberg, as the eigenenergy for
the hydrogen atomic wave functions in the Schrödinger equation is equal
to −1/n2 Rydberg .
~ = E0 ẑ sin(ωt). Such that the Hamiltonian
The electric field is given as E
for the electromagnetic field is given as
with z the position operator. In matrix form we find for the position
operator
hφ1s |z|φ1s i hφ1s |z|φ2pz i
z= , (13.58)
hφ2pz |z|φ1s i hφ2pz |z|φ2pz i
and
!
a1s
z = a†1s a†2p · z · , (13.59)
a2p
We can evaluate the matrix elements with the use of the explicit form
of the hydrogen wave functions. With
424
13.2. Quantum example
3/2
−r/a0 1
R1s (r) = 2e ,
a0
5/2
1 1
R2p (r) = √ re−r/(2a0 ) ,
2 6 a0
we find for the matrix representing the position operator
0 1
z = z0 , (13.61)
1 0
with
128 √
z0 = 2a0 (13.62)
243
≈ 0.744a0 . (13.63)
Instead of z0 one often encounters the dipole moment between the 1s and
2p hydrogen wave function defined as µ = −ez0 .
Both the s and pz orbitals are centred around z = 0, which is reflected
by the zero values on the diagonal of the position operator. The eigenstates
of the position operator are
r
1
φ±z (~r) = (φ1s (~r) ± φ2pz (~r)) , (13.64)
2
with eigenvalues, i.e. the centre of the charge of the resulting orbitals, ±z0 .
On a basis of the 1s and 2pz orbital we can still describe the electric
field as a driving force that drives the electrons away from z = 0. As we
truncated the one particle basis we can no longer bring the electron to
±∞, but find a maximum distance of z0 that the electron can be moved
away from the origin. If this is a sufficient approximation depends on the
experiment under consideration.
In order to describe the full response of the system we should not look
at the eigenstates of the position operator only. For the state
425
Chapter 13. Response theory
0.5
0.4
0.3
ϕ 1 s , ϕ 2 pz
0.2
0.1
0.0
-0.1
-10 -5 0 5 10
z [a0 ]
φ(r) as a function of α
φ(r) as a function of β
Figure 13.4: The possible wave-functions on a basis of the 1s and 2pz hy-
drogen orbital. Top row from left to right, the 1s and 2pz wave-function
along the line x = y = 0, isosurface of the 1s and 2pz orbitals and isosur-
face of the 1s ± 2pz orbital. Middle rows isosurface of the wave function
cos(α)1s + sin(α)2pz shown in two different perspectives and the surface
defined by the equation r = | cos(α)Ys (θ, φ) + sin(α)Ypz (θ, φ)|. Bottom row,
surface defined by r = | cos(α)Ys (θ, φ) + eiβ sin(α)Ypz (θ, φ)| as a function of
β for α = π/2.
426
13.2. Quantum example
If we calculate the expectation value for the position operator for this gen-
eral state we find
hφα,β |z|φα,β i = z0 eiβ + e−iβ sin(α) cos(α) (13.68)
427
Chapter 13. Response theory
= z0 cos(β) sin(2α).
The phase of the pz orbital with respect to the s orbital allows one to
move the centre of the electron wave function in the z direction. In the
bottom row of figure 13.4 we show the wave function φα,β (~r) for α = π/2
as a function of β. The contour plotted is given by the equation r =
| cos(α)Ys (θ, φ) + eiβ sin(α)Ypz (θ, φ)| and the surface colour corresponds to
the phase of the wave function.
If we start at t = 0 in the state φ(t = 0, ~r) = φα,β (~r) without an external
electromagnetic field, then the state at finite time is given as
φ(t, ~r) = e−iε1s t/~ cos(α)φ1s (~r) + e−iε2p t/~ eiβ sin(α)φ2pz (~r). (13.69)
As εs 6= ε2p we find that the relative phase between the 1s and 2pz contri-
bution to the wave function changes as a function of time. Starting in a
state φα,β (~r) the electron thus oscillates around z = 0 with an amplitude
z0 sin(2α) and angular frequency ω0 = (ε2p − ε1s )/~.
In order for the electron to oscillate around z = 0 there must be a
restoring force that pulls the electron back to z = 0 if it is displaced.
We can calculate this force from the Hamiltonian H 0 . We can express
hφα,β |H 0 |φα,β i as a function of hφα,β |z|φα,β i. When doing so we should
realize that the amplitude of the oscillation is defined by α, the phase
within an oscillation by β. For a classical harmonic oscillator the kinetic
energy is zero when the particle is furthest away from the centre. In our
model this corresponds to β = 0. The expectation value of the energy and
position for β = 0 are given as
and
428
13.2. Quantum example
The first term describes the potential of an Harmonic oscillator, the higher
order terms are anharmonic corrections to the potential. We should expect
these anharmonic corrections as the electron in an hydrogen atom is trapped
in a bare potential that is of the form −1/r and not equal to r2 .
Once we have established the similarity (not the equivalence) between
the quantum and classical model we can have a look at the time dependent
solution of the quantum problem. For this we need to solve the time depen-
dent Schrödinger equation. The time dependent Hamiltonian on a basis of
the 1s and 2pz orbital is
!
a
H(t) = a†1s a†2p · H(t) · 1s , (13.75)
a2p
429
Chapter 13. Response theory
with α depending on the starting conditions. For the case with an external
field we will take the trail soltution
with |a(t)|2 + |b(t)|2 = 1. If we insert the trail solution into the Schrödinger
equation we find a set of two coupled equations
d
i~ a(t) = V sin(ωt)e−iω0 t b(t) (13.80)
dt
d
i~e−iω0 t b(t) − iω0 b(t) = V sin(ωt)a(t) + ~ω0 e−iω0 t b(t).
dt
These equations can either be solved numerically, or approximately analyt-
ical. With the use that sin(ωt) = 21 i e−iωt − eiωt we have
d 1
i~ a(t) = iV e−i(ω0 +ω)t − e−i(ω0 −ω)t b(t), (13.81)
dt 2
or
Z
1
a(t) = V e−i(ω0 +ω)t − e−i(ω0 −ω)t b(t)dt. (13.82)
2~
For ω ≈ ω0 the term with e−i(ω0 +ω)t oscillates much faster than the term
with e−i(ω0 −ω)t . As a result the integral over the first term will be much
430
13.2. Quantum example
We find that the phase between the 1s and 2pz contribution to ψ(t, ~r)
oscillates with frequency ω, i.e. the frequency of the driving field. This
will induce an oscillation of the electron and induced dipole moment with
the frequency of the driving force. We furthermore see that the coefficient
of the 2pz contribution is proportional to sin(Ωt). This will induce a time
dependence in the amplitude of the oscillations. We can quantify this by
looking at the time dependent expectation value of the position operator
z.
431
Chapter 13. Response theory
The electron oscillates with frequency ω and the amplitude is modified with
the frequency 2Ω.
We now want to focus on the linear response of the system. For this we
assume that V is small and make a series expansion of hφ(t)|z|φ(t)i. We
find
sin(ωt) − sin(ω0 t)
hφ(t)|z|φ(t)i = z0 V + O(V 2 ). (13.87)
~(ω − ω0 )
432
13.2. Quantum example
diverges for large ω. We know that for massive particles there should not
be a response at large frequencies. There is thus clearly something not fully
correct with the approximations we made to solve the quantum mechanical
model.
The problem is in the rotating wave approximation. We neglected the
term proportional to e−i(ω+ω0 )t and only kept the term e−i(ω−ω0 )t in the
differential equation. This approximation is not completely valid. In order
to correct this we can, based on the solution found in the rotating wave
approximation, guess a solution for the time dependent wave function. For
the linear response we only need a correct solution to linear order in V ,
which makes such a guess feasible.
If we look at φ(t) obtained from the rotating wave approximation to
linear order in V we find
ω+ω0 !
e−i 2 t sin( ω−ω
2
0
t)
φ(t) = 1, V . (13.91)
~(ω − ω0 )
This equation does not fulfill the original Schrödinger equation to linear
order in V . If we test this function as a solution of the Schrödinger equation
we find,
d 1 iωt
i~ φ(t) − H(t)φ(t) = 0, ie V , (13.92)
dt 2
433
Chapter 13. Response theory
is missing. The time dependent wave function in the rotating wave approx-
imation has a projection of 1 onto the 1s basis orbital and a projection that
is linear in V and depends on ω on the 2pz basis orbital. If we subtract
a second term to the projection on the 2pz orbital that is also linear in V
whereby we change ω to −ω we find the function
ω+ω0 −ω+ω0 !
e−i 2 sin( ω−ω
t
2 t)
0
e−i 2 t sin( −ω−ω
2
0
t)
φ(t) = 1, V −V (13.93)
~(ω − ω0 ) ~(−ω − ω0 )
V −iω0 t
= 1, −iωe + iω cos(ωt) + ω0 sin(ωt) ,
~(ω 2 − ω02 )
2ω0
χ∝− (13.94)
− ω02
ω2
1 1
∝− + .
ω − ω0 ω + ω0
434
13.3. General response derived in the frequency domain
with E0 = hφ0 |H 0 |φ0 i. We can now look at the response of this system to
a perturbation proportional to an operator that couples state 0 and 1
0 µ1
O= . (13.96)
µ∗1 0
with
Starting from this state we find, to linear order in λ, the time dependent
state to be
−iωe−iω1 t + iω cos(ωt) + ω1 sin(ωt)
|φ(t)i = |φ0 i + λµ∗1 2 2 |φ1 i + O(λ2 ).
~(ω − ω1 )
(13.100)
We note that the state is only normalized to linear order in λ. We find that
the occupation of the ground state |φ0 i is not altered in linear order in λ,
i.e. hφ0 |φ(t)i = 1. The linear response defines how one starts to deviate
from the state |φ0 i but the deviation always stays small. With the use of
|φ(t)i we can calculate the time dependent expectation value of operator
O. To linear order in λ we find:
2 1 1
hφ(t)|O|φ(t)i = λ sin(ωt)|µ1 | − , (13.101)
~(ω − ω1 ) ~(ω + ω1 )
such that the susceptibility becomes
hφ(t)|O|φ(t)i
χ(1) (ω) = (13.102)
λ sin(ωt)
435
Chapter 13. Response theory
2 1 1
= |µ1 | − .
~(ω − ω1 ) ~(ω + ω1 )
In order to generalize this result we can enlarge the Hilbert space from
2 to n + 1 states. We assume that |φ0 i does not couple to a single state,
but to n different states. On the eigenbasis of H 0 the Hamiltonian H 0 and
operator O can be written as
n
~ωi a†i ai ,
X
H 0 − E0 = (13.103)
i=1
and
n
µ∗i a†i a0 + µi a†0 ai .
X
O= (13.104)
1=1
with
−iωe−iωi t + iω cos(ωt) + ωi sin(ωt)
f (t; ω, ωi ) = . (13.106)
~(ω 2 − ωi2 )
436
13.3. General response derived in the frequency domain
hφ(t)|O|φ(t)i
χ(1) (ω) = (13.108)
E0 sin(ωt)
n
X
2 1 1
= |µi | − .
~(ω − ωi ) ~(ω + ωi )
i=1
Each state contributes separately to the response and thus the suscepti-
bility. On the eigenbasis of H 0 the response of the total system is given by
the sum of the response of the state |φ0 i coupling to each state separately.
This is a general feature of linear response theory in a quantum mechanical
system and extremely simplifies the treatment of response theory.
The previous derivation assumed the Hamiltonian to be on its eigen-
basis. In this case the matrix representation of H 0 is diagonal and the
response due to each state decouples from the response to all other states.
In a next step we can generalize the previous form of the susceptibility to
general Hamiltonians. We first note that
n
X
O |φ0 i = µ∗i |φi i . (13.109)
i=1
1 1
− , (13.111)
~(ω − ωi ) ~(ω + ωi )
437
Chapter 13. Response theory
and
(1) 1 1
χ (ω) = φ0 O H1 − H1 O φ0 . (13.114)
~ω − H 0 + E0 ~ω + H 0 − E0
438
13.4. General response derived in the time domain
439
Chapter 13. Response theory
∞ !
δt n
X 1
× −iH(t)
n! ~
n=0
∞ !
δt n
X 1
=T −i(H(t + δt)H(t))
n! ~
n=0
δt
= T e−i(H(t+δt)+H(t)) ~ ,
with T the time ordering operator that places operators acting on a later
time left of operators acting on an earlier time.
With the use of the time ordering operator we find for equation 13.120
P
−i N H(t0 ) δt0
|φ(t)i = lim T e j=1 ~ |φ0 i (13.123)
N →∞
i Rt 0 0
= T e− ~ 0 H(t )dt |φ0 i .
Even for time dependent Hamiltonians we can write the time evolution
operator as a single exponential, however we need to order different powers
in the Hamiltonian according to the time at which they act.
In order to solve for |φ(t)i we go back to equation 13.120 and expand
the exponential. We can stop at linear order in δt0 as we take the limit of
N → ∞ such that δt0 goes to zero. We find
N
Y i 0 0 02
|φ(t)i = lim 1 − H(t )δt + O(δt ) |φ0 i (13.124)
N →∞ ~
j=1
N
Y i 0 i 0 0
= lim 1 − H 0 δt − H1 (t )δt |φ0 i .
N →∞ ~ ~
j=1
N
Y i 0
|φ(t)i = lim 1 − H 0 δt |φ0 i (13.125)
N →∞ ~
j=1
440
13.4. General response derived in the time domain
N N
X Y i 0 i 0 0
+ lim 1 − H 0 δt − H 1 (nδt )δt
N →∞ ~ ~
n=1 j=n+1
n−1
Y !
i 0
× 1 − H 0 δt |φ0 i
~
j=1
+ ...
i
=e− ~ H 0 t |φ0 i
i t − i H0 (t−t1 )
Z
i
− e ~ H 1 (t1 )e− ~ H 0 t1 dt1 |φ0 i
~ 0
Z t Z t2
1 i i i
− 2 e− ~ H 0 (t−t2 ) H 1 (t2 )e− ~ H 0 (t2 −t1 ) H 1 (t1 )e− ~ H 0 t1 dt1 dt2 |φ0 i
~ 0 0
+ ....
441
Chapter 13. Response theory
The first term originates from the zero-th order expansion of the bra and
ket state. The second and third term consider either the bra or ket state
in first order in H 1 (t) and the other state in 0-th order. The expectation
value simplifies to
D i i
E
hφ(t)|O|φ(t)i = φ0 e ~ H 0 t Oe− ~ H 0 t φ0 (13.127)
Z tD
i i i i i
E
− φ0 e ~ H 0 t Oe− ~ H 0 t e ~ H 0 t1 H 1 (t1 )e− ~ H 0 t1 φ0 dt1
~ 0
i tD
Z E
i i i i
+ φ0 e ~ H 0 t1 H 1 (t1 )e− ~ H 0 t1 e ~ H 0 t Oe− ~ H 0 t φ0 dt1
~ 0
D i i
E
= φ0 e ~ H 0 t Oe− ~ H 0 t φ0
i t D h i H 0 t1
Z i E
i i i
+ φ0 e ~ H 1 (t1 )e− ~ H 0 t1 , e ~ H 0 t Oe− ~ H 0 t φ0 dt1 .
~ 0
This equation simplifies if we use the definition of the operator in the
interaction picture:
i i
O H (t) = e ~ H 0 t Oe− ~ H 0 t , (13.128)
for time dependent operators. With this notation we can write a compact
equation for the time dependent expectation value
i t
Z
hφ(t)|O|φ(t)i = hφ0 |O H (t)|φ0 i + hφ0 |[H 1 (t1 )H , O H (t), ]|φ0 i dt1
~ 0
(13.130)
We now can compare the result for the expectation value of the opera-
tor O with the definition of the response function as given by the Volterra
series in equation 13.1. The 0-th order term in the Volterra series of the sus-
ceptibility does not dependent on the time dependent perturbation H 1 (t)
442
13.4. General response derived in the time domain
and is given by
i t
Z
hφ(t)|O|φ(t)i = hφ0 |[H 1H (t1 ), O H (t)]|φ0 i f (t1 )dt1 . (13.133)
~ 0
443
Chapter 13. Response theory
444
13.5. Relation between frequency and time domain
This integral is zero for t < 0 as limR0 →∞ eR0 sin(Θ)t = 0 for t < 0. We thus
find that
For t > 0 we can look at the contour integral around the lower half
complex plane. Using the same argumentation as before we can conclude
that in the lower half complex plane the integral over the arc is zero for
t > 0 such that the contour integral is equal to the integral over the real
axis.
I
1
R±
G (t > 0) = e−iωt GR± (ω)dω (13.142)
2π lower half
−iωt 1
= −i res e .
ω ∓ H 0 ± E0 + i0+
The Green’s operator has poles in the lower half complex plane at ω =
±H 0 ∓ E0 with residue 1. We thus find
445
Chapter 13. Response theory
13.6 Dissipation
In equation 13.112 we derived the linear susceptibility for a general operator
O. We were able to express this as a ground-state expectation value. The
problem with this equation is that the operator from which we take the
expectation value is Hermitian. The resulting susceptibility is thus real. If
we then calculate the conductivity and from that the dissipation we find
that there is no absorption of energy in the system. This is indeed the
correct answer. When driven by an oscillatory force, a finite system might
absorb some energy during some time, but will restore the energy into the
driving field during some other time, such that there is no total dissipation.
Based on the classical result we can add dissipation to the system by
shifting the poles in the susceptibility from the real axis slightly into the
complex plain
(1) 1 1
χ (ω) = φ0 O − O φ0 .
~ω − H 0 + E0 + i γ2 ~ω + H 0 − E0 + i γ2
(13.146)
Although this works, this is not a completely satisfactory way to add dis-
sipation to our theory.
A finite size system described by quantum mechanics does not have
dissipation. We need an infinite system to get absorption. There are two
conceptual ways we can introduce an infinite basis. We either can assume
that the operator O does not couple state |φ0 i to a single state, but to
infinite many within a fixed energy interval, or we can assume that the
state |φ0 i couples to a single state O |φ0 i. This state O |φ0 i then couples
446
13.6. Dissipation
to infinite many states in a finite energy interval around the energy of the
state O |φ0 i. Both descriptions are mathematically equivalent and describe
the same process.
An example for the first method to include dissipation, would be a
model where we have an energy band of states that can be excited from
the ground-state. We can make a simple model assuming that we have N
states in the interval ωmin
p to ωmax , equidistantly spaced in energy with a
coupling strength µi = µ 1/N . We then take the limit of N to infinity to
generate an excitation into the continuum. The Hamiltonian for this model
is given as
N
i − 1/2
a†i ai .
X
H0 = ωmin + (ωmax − ωmin ) (13.147)
N
i=1
The coupling of the ground-state to the excited states by the time depen-
dent perturbation is given as H 1 (t) = H 1 (t) sin(ωt) with
N
µi a†i a0 + a†0 ai .
X
H1 = (13.148)
i=1
447
Chapter 13. Response theory
this model when the excitation energy is such that one can excite state |φ0 i
into the band of excited states. As a result this model shows dissipation
when the system is excited with light for which ω < ωmax and ω > ωmin .
Instead of looking at a single state that couples to a band of states we
can make a model where a single ground-state can be excited into a single
excited state. The excited state then couples to a continuum, which will
generate the dissipation. As an exemplary model for the second method to
induce dissipation we can look at the absorption of light in the hydrogen
atom. We will slightly simplify the coupling of light to our discrete states
as we do not need to calculate the exact form of the line broadening and
lifetime, but want to elude the general mechanism that induces dissipation
in quantum systems. As a model we look at the hydrogen atom where
we can excite the 1s state into the 2p state. If we include photons as
particles in our basis we reproduce the two level example discussed in the
previous section on a basis of the 1s orbital plus a photon and the 2p orbital
without a photon. These two states are coupled by the dipole moment of
the electron between the 1s and 2p orbital and the photon field at the
position of the atom. The resulting susceptibility is purely real with a pole
at ~ω ± ~ω0 with ~ω0 the energy difference between the 1s and 2p state
and ~ω the energy of the photon present in combination with the electron
in the 1s orbital. We can now extend the basis set with the infinite number
of states given by the electron in the 1s orbital and a photon with energy
~ωi . To be precise one should quantize the photons by their wave vector as
there are many states with the same energy possible, for the current model
quantizing by the energy is good enough. We will assume that the coupling
strength between the√2p state and the 1s state plus a photon with energy
~ωi is equal to µ(ωi ) δωi . The infinite set of states with the electron in
the 1s orbital and one additional photon now enter as an extension of the
Hilbert space only coupling the the state with the electron in the 2p orbital
and thus as a self energy in the susceptibility. We find
(1) 2 1 1
χ (ω) = µ − , (13.150)
ω − ω0 − Σ+ (ω) ω + ω0 + Σ− (ω)
448
13.7. Kramers Kronig relations
with
Z ∞
1
Σ± (ω) = ± lim µ2 (ωi ) dωi . (13.151)
γ→0+ 0 ω ∓ ωi + i γ2
449
Chapter 13. Response theory
that a contour integral around the upper half complex plain of χ(ω), σ(ω)
and f (ω) yields zero.
I Z +∞ Z π
χ(ω)dω = χ(ω)dω + lim χ(reiφ )rdφ (13.152)
γ −∞ r→∞ 0
= 0.
to χ(ω) and one pole at the real axis at ω = ω 0 . If we now use the Cauchy’s
residue theorem we find
Z ∞
χ(ω 0 ) χ(ω 0 )
I
0 0
0
dω = P 0
dω (13.154)
γ ω −ω −∞ ω − ω
= iπ res(χ(ω 0 ), ω)
= iπ χ(ω),
with P the principle value integral. This yields the Kramers Kronig rela-
tions
Z ∞
χ(ω 0 )
0
P 0
dω = iπχ(ω), (13.155)
−∞ ω − ω
or equivalent
Z ∞
Re(χ(ω 0 )) 0
P dω = −πIm(χ(ω)), (13.156)
ω0 − ω
Z−∞∞
Im(χ(ω 0 )) 0
P 0
dω = πRe(χ(ω)).
−∞ ω − ω
450
Appendix A
The Dirac delta function, δ(~r) is an object that is extremely useful for many
parts of physics. The function is defined on Rd with d ∈ N, and is zero for
all values, except for |~r| = 0. We furthermore define the function to be
normalized such that
Z
δ(~r)dd~r = 1. (A.1)
Rd
There is one problem. There is no function that can fulfill these prop-
erties. The integral over a function that is zero on its domain, except for a
singular point, is zero. As a solution we will define the Dirac delta function
as a special limiting case of a series of functions.
We can define a function δ(, ~r) on ~r ∈ Rd , ∈ R, > 0, which is con-
tinues on some finite compact domain around ~r = 0 and has the following
properties
δ(, ~r) ≥ 0, and (A.2)
n
lim |~r| δ(, ~r) = 0, ∀n ∈ N, and
|~
r|→∞
451
Appendix A. The Dirac delta function (distribution)
Z
lim δ(, ~r)dd~r = 1.
→0+ Rd
There are many functions δ(, ~r) that fulfil these properties. We can
take for example a rectangle distribution
( d
1
|~ri | ≤ /2 ∀i ∈ [1, d]
δ(, ~r) = , (A.3)
0 0
but that
Z
lim δ(, ~r)dd~r = 0. (A.6)
+
Rd →0
although one can find the later statement often in the literature. The
limit should be understood to be taken after one integrates over the delta
function. The integral Rd δ(~r)dd~r is defined as
R
Z Z
δ(~r)dd~r = lim δ(, ~r)dd~r, (A.8)
Rd →0+ Rd
for any of the generating functions one chooses. The final result does not
depend on the choice of the generating function used, which is meant with
the statement δ(~r) = lim→0+ δ(, ~r).
452
A.1. Properties of the Dirac delta function
A.1.1 Convolution
As δ(~r − ~r0 ) = 0 for all ~r 6= ~r0 it follows that for all continues functions f (~r)
for which the integral over δ(, ~r)f (~r) is convergent we have
Z
δ(~r0 − ~r)f (~r0 )dd~r0 = f (~r). (A.9)
Rd
A.1.2 Scaling
If we scale the delta function we find, for c ∈ R
d
1
δ(c~r) = δ(~r), (A.10)
|c|
in particular
A.1.3 Derivative
d n
The n-th derivative of the delta function dx (n) (x) is well defined
n δ(x) = δ
453
Appendix A. The Dirac delta function (distribution)
This is true for all functions f (x) and thus not only the integrals, but also
the integrants must be equivalent
454
Appendix B
Gaussian Integral
In several of the worked out problems and examples we encounter the fol-
lowing integral
Z ∞ r
ax2 +bx+c b2
c− 4a π
e dx = e , (B.1)
−∞ −a
for a, b, and c some complex numbers with Re(a) < 0. In this appendix we
will proof this relation.
We start by rewriting the exponent
Z ∞ Z ∞
b2 b 2
ax2 +bx+c c− 4a
e dx = e ea(x+ 2a ) dx (B.2)
−∞ −∞
b
2
b
Z ∞+ 2a
c− 4a 2
=e eau du,
b
−∞+ 2a
b b
with u = x + 2a . When 2a is real the change in integration limits does
b
not matter, while limx→∞ x = limx→∞ x + a for a ∈ R. However if 2a
is complex the path of integration in the complex plain actually becomes
different. As such we can not neglect the change in integration boundaries
2
without further commenting. Luckily we know that the integrant, eau has
no poles in the complex plain. The integral over a closed contour thus
455
Appendix B. Gaussian Integral
b b
vanishes. If we take the contour S : limR→∞ (−R, R, R + 2a , −R + 2a , −R)
then we find
I
2
eau du = 0 (B.3)
S
= 0,
while the integrand becomes zero along this path. We thus find that
b
Z ∞+ 2a Z ∞
2 2
eau du = eau du. (B.5)
b
−∞+ 2a −∞
456
∞
1 2
= π ear .
a 0
for a, b, and c some complex numbers if Re(a) < 0, which is the equation
we set out to derive.
457
Appendix B. Gaussian Integral
458
Appendix C
Fourier Transform
The Fourier transform between time (t) and energy (E) or angular fre-
quency (ω = E/~) coordinates is defined as
r Z ∞
1
g(ω) = f(t)eiωt dt. (C.2)
2π −∞
r Z ∞
1
f(t) = g(ω)e−iωt dω.
2π −∞
The sign definitions are chosen such that for the description of a plane
wave with wave vector k0 and angular frequency ω0 we can take g(ω) =
δ(ω − ω0 ), and g(k) = δ(k − k0 ), with δ the Dirac delta function (Appendix
A), such that the functions in frequency and wave vector domain peak at
459
Appendix C. Fourier Transform
the values of the plane wave they describe. The functions in the position
and time domain become
r Z ∞ ! Z
∞
1 ikx 1 −ikx
f (x)f(t) = g(k)e dk g(ω)e dω (C.3)
2π −∞ 2π −∞
r Z ∞ ! r Z !
∞
1 ikx 1 −iωt
= δ(k − k0 )e dk δ(ω − ω0 )e dω
2π −∞ 2π −∞
1 ikx −iωt
= e e
2π
1 i(kx−ωt)
= e .
2π
Which has the correct behaviour for a plane wave propagating in the posi-
tive x direction in time for positive wave vectors.
∞
r Z
1
f (x) = g(k)eikx dk (C.4)
2π −∞
!
∞ ∞
r Z r Z
1 1 0 −ikx0 0
= f (x )e dx eikx dk
2π −∞ 2π −∞
Z ∞ Z ∞
1 0
= f (x0 )eik(x−x ) dx0 dk.
2π −∞ −∞
460
C.2. Properties of the Fourier transform
With y = x − x0 we find
Z ∞ !
∞
r Z r
1 1 iky
f (x) = f (x − y) e dk dy. (C.5)
−∞ 2π −∞ 2π
and
∞
r Z
1
g2 (k) = f2 (x)e−ikx dx, (C.8)
2π −∞
we find
∞
r Z
1
g1 (k) + g2 (k) = (f1 (x) + f2 (x)) e−ikx dx. (C.9)
2π −∞
461
Appendix C. Fourier Transform
C.2.2 Derivative
For all n-fold differentiable functions f (x) that are integrable and fulfill
dm
that limx→±∞ dx m f (x) = 0 for all m ∈ N and m ≤ n we find that
Z ∞ n Z ∞
1 d −ikx n 1
√ f (x) e dx = (ik) √ f (x)e−ikx dx. (C.10)
2π −∞ dxn 2π −∞
We can proof this using integration by parts
Z ∞ n n−1 ∞
1 d −ikx 1 d −ikx
√ f (x) e dx = √ f (x) e (C.11)
2π −∞ dxn 2π dxn−1 −∞
Z ∞ n−1
1 d d −ikx
−√ n−1
f (x) e dx
2π −∞ dx dx
Z ∞
dn−1
1
= (ik) √ n−1
f (x) e−ikx dx.
2π −∞ dx
d m
Whereby we used that limx→±∞ dx m f (x) = 0 for all m ∈ N and m ≤ n.
C.2.3 Convolution
The Fourier transform of the convolution of two functions is the product
of the Fourier transform of each of these functions. Given two integrable
functions f1 (x) and f2 (x) their convolution f (x) is given as
Z ∞
f (x) = f1 (x − x0 )f2 (x0 )dx0 . (C.12)
−∞
462
C.3. The Fourier transform of some useful functions
Next we Fourier transform both the left and√right hand side. Whereby we
realize that the right hand side is currently 2π times the inverse Fourier
transform of g1 (k)g2 (k). We find
1
Z ∞ √
√ f (x)e−ikx dx = 2πg1 (k)g2 (k), (C.17)
2π −∞
which is the relation we wanted to proof.
463
Appendix C. Fourier Transform
r
1
= .
2π
and
464
C.3. The Fourier transform of some useful functions
465
Appendix C. Fourier Transform
We now use the properties of the delta function and its derivative, in par-
ticular equation A.15 and A.9 to find.
Z ∞ n
n n n d ikx
x = i (−1) n
e δ(k)dk (C.27)
−∞ dk
Z ∞
n n n
= i (−1) (ix) δ(k)eikx dk
−∞
n
=x .
466
Appendix D
Cauchy-Schwarz inequality
For two square integrable functions f (x) and g(x) we find the following
Cauchy-Schwarz inequality
Z ∞ Z ∞ Z ∞ 2
f ∗ (x)f (x)dx g ∗ (x)g(x)dx ≥ f ∗ (x)g(x)dx . (D.1)
−∞ −∞ −∞
467
Appendix D. Cauchy-Schwarz inequality
R∞ 2
∗
−∞ g (x)f (x)dx
Z ∞
= h∗ (x)h(x)dx + R∞
∗
−∞ −∞ g (x)g(x)dx
R∞ 2
∗
−∞ g (x)f (x)dx
≥ R∞
∗
,
−∞ g (x)g(x)dx
468
Appendix E
Fundamental Theorem of
Algebra
469
Appendix E. Fundamental Theorem of Algebra
470
Appendix F
471
Appendix F. The eigensystem of a Matrix
472
Appendix G
Euler angles
473
Appendix G. Euler angles
474
Bibliography
475
Bibliography
476
Bibliography
[24] J. Jeans, “XI. On the partition of energy between matter and aether,”
The London, Edinburgh, and Dublin Philosophical Magazine and Jour-
nal of Science, vol. 10, p. 91, 1905.
[31] A. Einstein, “Über einen die Erzeugung und Verwandlung des Lichtes
betreffenden heuristischen Gesichtspunkt,” Annalen der Physik,
vol. 322, p. 132, 1905.
477
Bibliography
[34] H. Herz, “Ueber einen Einfluss des ultravioletten Lichtes auf die elec-
trische Entladung,” Annalen der Physik, vol. 267, p. 983, 1887.
[37] A. Einstein, “Die Plancksche Theorie der Strahlung und die Theorie
der spezifischen Wärme,” Annalen der Physik, vol. 327, p. 180, 1907.
478
Bibliography
[46] P. Zeeman, “On the influence of magnetism on the nature of the light
emitted by a substance,” Philosophical Magazine. 5-th series, vol. 43,
p. 262, 1897.
[53] G. D. Liveing and J. Dewar, “On the Spectra of Sodium and Potas-
sium,” Proceedings of the Royal Society of London, vol. 29, p. 398,
1879.
479
Bibliography
480
Bibliography
481
Bibliography
482
The rate at which the particle distribution's width spreads is inversely proportional to the square of the initial width, \( σ^2 \). Thus, the more localized the initial wave function, the faster the distribution spreads .
The Schrödinger equation, even with relativistic corrections, struggles to describe systems like the helium atom accurately due to ignoring the Pauli exclusion principle, leading to incorrect predictions of multiple degenerate ground states not observed experimentally .
Interference patterns form when a Gaussian wave packet broadens to a width similar to the size of the box, indicating that the packet propagates as a free wave packet and bounces off the walls of the box. These patterns lead to intricate structures due to reflection and interference .
Goudsmit and Uhlenbeck introduced electron spin to resolve discrepancies in atomic models where the observed number of stationary electron states was twice the number given by the Schrödinger theory, addressing phenomena like the Zeeman effect, and fitting electron behavior into quantum mechanics .
Unitary transformations between basis states, such as \(|lm⟩\ and \(|l1m1l2m2⟩\), are significant as they provide a method to shift between equivalent basis sets in Hilbert spaces, ensuring that the physical interpretation remains consistent across different mathematical representations .
Coherent states in a harmonic oscillator behave like classical particles, oscillating without deformation of the wave packet density. This behavior is due to the equidistant energy levels in the harmonic potential, allowing a linear superposition of stationary states that maintains its shape over time .
Clebsch-Gordan coefficients describe how states \(|l1m1⟩\ and \(|l2m2⟩\) couple to form the state \(|lm⟩\), which is an eigenstate of the total angular momentum operators \(L^2\) and \(L_z\). They facilitate the transformation between different basis representations in quantum mechanics .
The Pauli exclusion principle, which prevents identical fermions from occupying the same quantum state, is justified as without it, multiple quantum states would imply incorrect predictions such as a higher heat capacity for helium gas, contradicting experimental observations .
The challenge in deriving the Dirac equation includes ensuring Lorentz invariance and correctly describing electron spin since the Schrödinger equation was neither linear in time nor accounted for electron spin .
The general linear response derived in the frequency domain implies that each state in a quantum mechanical system independently contributes to the total susceptibility, greatly simplifying the treatment of systems under perturbation by enabling separate consideration of state pair interactions .