Relativistic Quantum Mechanics Overview
Relativistic Quantum Mechanics Overview
Mechanics
Written by two of the most prominent leaders in particle physics, Relativistic Quantum Mechan-
ics: An Introduction to Relativistic Quantum Fields provides a classroom-tested introduction to
the formal and conceptual foundations of quantum field theory. Designed for advanced under-
graduate- and graduate-level physics students, the text only requires previous courses in classical
mechanics, relativity and quantum mechanics.
The introductory chapters of the book summarise the theory of special relativity and its applica-
tion to the classical description of the motion of a free particle and a field. The authors then explain
the quantum formulation of field theory through the simple example of a scalar field described
by the Klein–Gordon equation as well as its extension to the case of spin ½ particles described by
the Dirac equation. They also present the elements necessary for constructing the foundational
theories of the standard model of electroweak interactions, namely quantum electrodynamics
and the Fermi theory of neutron beta decay. Many applications to quantum electrodynamics and
weak interaction processes are thoroughly analysed. The book also explores the timely topic of
neutrino oscillations.
Logically progressing from the fundamentals to recent discoveries, this textbook provides stu-
dents with the essential foundation to study more advanced theoretical physics and elementary
particle physics. It will help them understand the theory of electroweak interactions and gauge
theories.
View the second and third books in this collection: Electroweak Interactions and An Introduction
to Gauge Theories.
Key Features of the New Edition:
Besides a general revision of text and formulae, three new chapters have been added.
• Chapter 17 introduces and discusses double beta decay processes with and without
neutrino emission, the latter being the only process able to determine the Dirac or
Majorana nature of the neutrino (discussed in Chapter 13). A discussion of the limits
to the Majorana neutrino mass obtained recently in several underground laboratories
is included.
• Chapter 18 illustrates the calculation of the mass spectrum of “quarkonia” (mesons
composed by a pair of heavy, charm or beauty quarks), in analogy with the positronium
spectrum discussed in Chapter 12. This calculation has put into evidence the existence
of “unexpected” states and has led to the new field of “exotic hadrons”, presently under
active theoretical and experimental scrutiny.
• Chapter 19 illustrates the Born-Oppenheimer approximation, extensively used in the
computation of simple molecules, and its application to the physics of exotic hadrons
containing a pair of heavy quarks, with application to the recently observed doubly
charmed baryons.
Relativistic Quantum
Mechanics
An Introduction to Relativistic
Quantum Fields
Second Edition
Any third party material in this book is not included in the OA Creative Commons license, unless indicated
otherwise in a credit line to the material. Please direct any permissions enquiries to the original rightsholder.
Reasonable efforts have been made to publish reliable data and information, but the author and publisher cannot
assume responsibility for the validity of all materials or the consequences of their use. The authors and publish-
ers have attempted to trace the copyright holders of all material reproduced in this publication and apologize to
copyright holders if permission to publish in this form has not been obtained. If any copyright material has not
been acknowledged please write and let us know so we may rectify in any future reprint.
The Open Access version of this book, available at [Link], has been made available under a Cre-
ative Commons Attribution-NonCommercial (CC-BY-NC) 4.0 International license.
Except as permitted under U.S. Copyright Law, no part of this book may be reprinted, reproduced, transmitted,
or utilized in any form by any electronic, mechanical, or other means, now known or hereafter invented, includ-
ing photocopying, microfi lming, and recording, or in any information storage or retrieval system, without writ-
ten permission from the publishers.
For permission to photocopy or use material electronically from this work, access [Link] or contact
the Copyright Clearance Center, Inc. (CCC), 222 Rosewood Drive, Danvers, MA 01923, 978-750-8400. For works
that are not available on CCC please contact mpkbookspermissions@[Link]
Trademark notice: Product or corporate names may be trademarks or registered trademarks and are used only for
identification and explanation without intent to infringe.
DOI: 10.1201/9781003436263
Typeset in CMR10
by KnowledgeWorks Global Ltd.
Publisher’s note: Th is book has been prepared from camera-ready copy provided by the authors.
Contents
Preface xi
The authors xiii
v
vi Contents
Bibliography 331
Index 339
Preface
xi
xii Preface
Omar Benhar, born in 1953, is an INFN research director and teaches gauge
theories at the University of Rome, “La Sapienza”. He has worked extensively
in the USA as a visiting professor, at the University of Illinois and the Old
Dominion University, as well as an associate scientist at the Thomas Jefferson
National Accelerator Facility. Since 2013, he has served as an adjunct profes-
sor at the Centre for Neutrino Physics of Virginia Polytechnic Institute and
State University. He is the author of more than a hundred scientific papers
on the theory of many-particle systems, the structure of compact stars and
electroweak interactions of nuclei.
xiii
CHAPTER 1
THE SYMMETRIES OF
SPACE-TIME
• our house (over intervals which are short compared to the Earth’s period
of rotation),
• the galaxy. . .
DOI: 10.1201/9781003436263-1 1
This chapter has been made available under a CC BY NC license.
2 Relativistic Quantum Mechanics
Let us consider two events which differ by Δx and Δt, which we suppose to
be infinitesimal. We define the squared invariant length of the interval (Δt, Δx)
by the quantity1 :
If |Δx| = c|Δt|, the two events are connected through the propagation of a
light ray which leaves from the first event and arrives in exact coincidence with
the second. Clearly this coincidence must occur in all reference frames and,
given the invariance of the speed of light, this implies that the difference should
1 Δs is often referred to as the line element.
The Symmetries of Space-Time 3
ζ 2 − δ 2 = α 2 − 2 = 1
ζ − αδ = 0. (1.9)
α = ζ = cosh(θ)
δ = = sinh(θ), (1.10)
Comments.
• From (1.11) we see that the speed of a physical system cannot exceed c.
• Newton hypothesised a universal time, which flows equally in every
frame of reference. If, instead of using principle (3), we set t = t, the
special Lorentz transformations reduce to Galilean transformations:
x = x − v · t
t = t
P : x = −x; t = t. (1.14)
The proper transformations are, for their part, constituted of two discon-
nected components, which are distinguished according to whether or not they
invert the direction of time. Events with coordinates (t, 0) in O describe, as a
function of t, the history of a stationary clock at the origin of O. If we apply
the relation (1.1), we find:
t = Λ00 · t,
therefore, the sign of Λ00 determines whether the clocks of O run in the same
direction as in O. The transformations characterised by:
T : x = x; t = −t. (1.16)
Combining both P and T transformations, total inversion, I, is obtained,
which is a non-orthochronous but also proper transformation:
The laws of physics are invariant under proper and orthochronous Lorentz
transformations. The corresponding group is usually denoted by L↑+ .
6 Relativistic Quantum Mechanics
• Events with s = 0. They are found on the surfaces of two cones (light
cones), which represent the trajectories of light rays emerging from A
(future cone) or converging on A (past cone). They are known as lightlike.
• Events with s > 0, t > 0. They fill the interior of the future light cone;
they are events which can be influenced by event A, since they can be
reached by physically realisable signals originating from A which travel
at speeds less than c.
• Events with s > 0, t < 0. They fill the interior of the past light cone;
they are events which can influence event A, by means of physically
realisable signals which travel at speeds less than c. These events and
those of the preceding paragraph are known as timelike (see Problem 1).
• Events with s < 0. These are outside the light cones; they are events
which cannot have any causal relationship with A, because signals with
speeds exceeding c would be required. The events in this region form the
absolute present of A. They are known as spacelike events (see Problem
2).
(x · y) = gμν xμ y ν = xμ yμ = y μ xμ
(x · y ) = (x · y) . (1.18)
f = f (y),
f (αy + βz) = αf (y) + βf (z)
8 Relativistic Quantum Mechanics
with α and β numbers (in our case, real numbers). It is easy to show that
every element f of V can be written as:
f (y) = fμ y μ = fμ y μ = (f · y), (1.24)
μ
f (y ) = f (y), if y = Λy .
Using the relation (A.5) it is immediately seen that the preceding equation
requires two transformation rules:
such that:
x = x − v · t
t = t .
R · RT = 1
The Symmetries of Space-Time 9
Sect. 1.2
1. Show that if an event has s > 0 it is possible to find an inertial frame
in which xμ = (t, 0). (Consequently, these events are called timelike).
2. Show that if an event has s < 0 it is possible to find an inertial frame
in which xμ = (0, x). (Consequently, these events are called spacelike).
3. Show that if an event has s = 0 it is possible to find an inertial frame
in which xμ = (|x|, x) and x aligned along the z axis. (These events are
called lightlike).
CHAPTER 2
d xμ (τ ) dxμ (t)
uμ = =γ , (2.3)
dτ dt
10 DOI: 10.1201/9781003436263-2
This chapter has been made available under a CC BY NC license.
The Classical Free Particle 11
pμ = muμ (2.6)
from which:
1 p2
p0 = (mc)2 + (p)2 (mc2 + ) (2.9)
c 2m
where the final step is valid for low velocity. The temporal component of pμ ,
in this limit, is equal to the kinetic energy divided by c plus a constant related
to the rest mass. In classical mechanics, the energy is always defined to within
a constant, so we can identify p0 with the energy of the particle divided by c:
pμ = ( , p). (2.10)
c
Thus we arrive at the important conclusion that in a relativistic formula-
tion momentum and energy are parts of a single physical entity, the energy-
momentum 4-vector (or 4-momentum) pμ .
The rules determining the components of pμ after transformation of the
coordinate system follow immediately from the nature of the 4-vector pμ . In
the specific case of a special Lorentz transformation along the x-axis, we have
(the quantities with accents refer to the IF O , those without an accent to O):
p1 = γ(p1 − β ) (2.11)
c
= γ(−βp1 + )
c c
p2 = p2 ; p3 = p3 .
12 Relativistic Quantum Mechanics
|p| = p0 ; (2.14)
β = 1,
the particle travels at the speed of light in any IF, for any state of motion.
Obviously photons, the quanta of light, have this property; the photon is a
particle with zero rest mass.
Despite the fact that (2.7) varies smoothly as m → 0, particles with mass
m > 0 are intrinsically different from those of zero mass, however, small the
value of m; the limit m → 0 is intrinsically discontinuous.
The simplest way to obtain this result is by verifying that the symmetry
group for the momentum (the little group introduced by Wigner) is different
in the two cases.
One sees therefore that when letting m → 0 the little group changes discontin-
uously from O(3) to O(2). In quantum mechanics, the little group determines
the value of the spin of the particle. For a particle with mass, the spin co-
incides with the angular momentum at rest and the states therefore form a
representation of the group of 3-dimensional rotations; a particle of spin S
possesses 2S + 1 spin states.
Conversely, a particle of zero mass can never be at rest, and its intrinsic
angular momentum is defined by rotations around p, the elements of O(2),
which allow one-dimensional representations corresponding to Sz = 12 n for
a given integer n. If we include parity, P , as a good symmetry, two states
should exist with Sz = ± 12 n. For example, the photon has two spin states,
corresponding to Sz = ±1 and not three, as we would have expected for a
particle of spin 1.
For a free particle, the only invariant non-trivially constant is the proper
time, dτ and therefore it must be true that:
v 2 (t)
L(x(t), v(t)) dt = −α dτ = −α 1− dt (2.18)
c2
with α a constant. The value of α is determined by the non-relativistic limit,
in which L should tend to the kinetic energy of the particle, 12 mv 2 , to within
an irrelevant additive constant. For small values of v(t) we find:
αv 2
L → −α + ; (2.19)
2c 2
→ α = mc2
v(t)2
L = −mc2 1− . (2.20)
c2
The results in (2.21) confirm the definition of the 4-momentum given in (2.6).
The Lagrange equations which follow from (2.20) and (2.17) are:
d i
p = 0; (i = 1, 2, 3) (2.22)
dt
Furthermore, from condition (2.7) we find:
p0 dp0 − p · dp = 0 (2.23)
and therefore:
d 0
p = 0. (2.24)
dt
Finally, we have obtained the covariant equations of motion:
d μ d μ
p =0= p (2.25)
dt dτ
which express the conservation of momentum and energy.
The Classical Free Particle 15
The total energy, E0 , in this frame of reference defines the rest mass of the
system
E0 = M0 c2 (2.29)
while the energy in another IF, E0 , is given by an analogous formula to (2.12):
We now look more closely at M0 . In the limit in which the gas particles
are non-relativistic, we have:
1 T
M0 = i mi + 2 (2.31)
c2 i c
p + p → 2 H + e+ + ν (2.34)
2 3
H + p → He + γ (2.35)
3 3 4
He + He → He + p + p. (2.36)
The positrons annihilate with the electrons of the medium releasing energy.
Therefore the total thermal energy released by each 4-proton reaction is:
< Eν > is the average energy carried away by neutrinos, which leave the
Sun undisturbed. The neutrino energy is continuously distributed between
zero (assuming negligible neutrino mass) and the Q value in the formation of
deuterium:
Sect. 2.4
1. Knowing that: (i) the Sun contains around N = 1056 protons in the core,
i.e. available for fusion; (ii) the solar constant (flux of solar energy onto
the Earth) is K0 = 3.3 · 10−2 cal cm−2 s−1 ; (iii) the Earth–Sun distance
is 8 light-minutes; (iv) the energy carried by neutrinos is negligible,
estimate the lifetime of the Sun from (2.37) and (2.38).
CHAPTER 3
THE LAGRANGIAN
THEORY OF FIELDS
where we have denoted the derivatives of the field with respect to the coordi-
nates as φμ :
∂φ
φμ (x) = .
∂xμ
18 DOI: 10.1201/9781003436263-3
This chapter has been made available under a CC BY NC license.
The Lagrangian Theory of Fields 19
The function L is given the name Lagrangian density or, more simply for
brevity, the Lagrangian and depends on the fields (the dynamic variables) and
their derivatives. The time derivatives are the generalisation of velocity, while
the dependence of the Lagrangian on spatial derivatives allows to couple the
degrees of freedom among nearby points in space.
We have considered a possible explicit dependence of the Lagrangian on
the space-time coordinates, to allow for the effect of possible agents external to
the system of fields. For an isolated system, this dependence cannot be present
and the Lagrangian depends on the coordinates only through the fields and
their derivatives.
In terms of L:
S= d4 x L(φ, φμ , x) (3.4)
V4
where V4 is the region of space-time limited by the hypersurfaces Γ1 : t = t1
and Γ2 : t = t2 .
We assume the values of the fields are fixed on Γ1,2 . The principle of the
least action states that:
• the evolution of the field between these values is given by the function
φ(x) = φ̄(x, t) which minimises S, with boundary conditions fixed on
Γ1,2 .
We note that the Lagrangian density is not unique. Because the principle
of least action stipulates that the fields have defined values on the boundaries
of V4 , we can add to the Lagrangian the divergence of any 4-vector without
changing the minimum of the action or, hence, the equations of motion.
To derive the differential equations which determine the evolution of the
field, we set:
φ(x) = φ̄(x) + δφ(x); δφ(x, t1 ) = δφ(x, t2 ) = 0. (3.5)
The condition of least action becomes the equation:
∂L ∂L
δS = 0 = d4 x [ δφ + δ(∂μ )φ] (3.6)
∂φ ∂∂μ φ
∂L ∂L
= d4 x [ δφ + ∂μ δφ]
∂φ ∂∂μ φ
∂L ∂L ∂L
= d4 x [ − ∂μ ( )]δφ + d4 x ∂μ ( δφ)
∂φ ∂∂μ φ ∂∂μ φ
because δ(∂μ )φ = ∂μ δφ.
Given that the field variations vanish at the edges of the region of integra-
tion, the final term in (3.6) is zero. Moreover, (3.6) must apply for arbitrary
variations δφ, so the function in square brackets must cancel identically in x.
Thus we find the Euler–Lagrange equations:
∂L ∂L
∂μ ( )= (3.7)
∂∂μ φ ∂φ
20 Relativistic Quantum Mechanics
Therefore
• the relativistic invariance of the action requires that the Lagrangian den-
sity should itself be invariant.
We then find:
∂L ∂L ∂L
δH = d3 x ∂t φ δπ + πδ(∂t φ) − δφ − · δ(∇φ) − δ(∂t φ)
∂φ ∂∇φ ∂∂t φ
∂L
− d3 x δt =
∂t
3 ∂L ∂L
= d x ∂t φ δπ − ∂t π δφ − ∇ · ( δφ) − d3 x δt. (3.12)
∂∇φ ∂t
where we have used the Euler–Lagrange equations in the form:
∂L ∂L ∂L
−∇ = ∂t = ∂t π (3.13)
∂φ ∂∇φ ∂∂t φ
We can discard the 3-divergence from the differential of the Hamiltonian,
so we obtain:
∂L
δH = d3 x (∂t φ δπ − ∂t π δφ) − d3 x δt. (3.14)
∂t
On the other hand, from the fact that the Hamiltonian density depends
on φ, ∇φ and π, we obtain (continuing to discard the 3-divergences):
∂H ∂H ∂H ∂H
δH = d3 x δπ + −∇· δφ + d3 x δt. (3.15)
∂φ ∂φ ∂∇φ ∂t
In this case, the symbol y which appears in the first term is simply a
dummy variable, and f (x) is the argument of the functional. For the functional
derivative, it is easily shown that:
δf (y)
= δ(x − y). (3.23)
δf (x)
If we introduce the concept of the functional derivative, the second of
Hamilton’s equations takes an aspect more similar to that of the case of a
finite number of degrees of freedom. Applying the definition (3.19), Hamilton’s
equations (3.17) can be put in the form2 :
δH
∂t φ(x) =
δπ(x)
δH
∂t π(x) = − . (3.24)
δφ(x)
which appear in the preceding equations are equal to dim[H] -dim[φ] (or π) - [length3 ], or,
equivalently, to dim[H] -dim[φ] (or π).
The Lagrangian Theory of Fields 23
Comment. The quantum canonical field variables, φ(x) and π(x), are op-
erators (i.e. infinite dimensional matrices) depending upon the space-time co-
ordinates x = (x, x0 ). In the limit → 0, these operators have to go into the
classical field variables, φ(x) and π(x), commuting functions of x. The limit
(at equal time):
[φ(x, 0), π(y, 0)]
lim→0 (3.30)
i
is classically a function which obeys the algebraic properties of the commu-
tator. In view of the antisymmetry of the Poisson bracket and of the Jacobi
identity (3.29), one is naturally led to assume that:
[φ(x, 0), π(y, 0)]
lim→0 = {φ(x, 0), π(y, 0)} = δ (3) (x − y). (3.31)
i
24 Relativistic Quantum Mechanics
Thus, the Heisenberg equal-time relations for the canonical variables q and
p, translated into field theory:
[φ(x, 0), π(y, 0)] = iδ (3) (x − y) (3.32)
guarantee that quantum field theory goes into classical field theory for van-
ishing Planck’s constant.
Extending the case of the scalar field, we can define tensor fields, functions
of xμ provided with a certain number of upper (covariant) indices, ns , and
lower (contravariant) ng , Fμν11,μ
,ν2 ,...
2 ,...
(x) whose transformation, rules are the same
as (3.38):
(F )νμ11,ν 2 ,...
,μ2 ,... (x ) = (Λ
−1 λ1
)μ1 (Λ−1 )λμ22 ...Λνρ11 Λνρ22 ...Fλρ11,λ
,ρ2 ,...
2 ,...
(x). (3.39)
The Lagrangian Theory of Fields 25
0123 = +1 (3.40)
μ1 μ2 μ3 μ4 = 0 (two equal indices)
μ1 μ2 μ3 μ4 = ±1 : μ1 , μ2 , μ3 , μ4 = even/odd permutations of 0123.
invariant (they transform like the scalar field in (3.34)). These invariant combi-
nations are the building blocks with which to construct the Lagrangian density,
which describes the field dynamics.
and therefore:
1 μν
L1 = F Fμν = (E)2 − (B)2 (3.50)
2
1
L2 = F̄ μν Fμν = (E · B).
2
The form of (3.50) suggests to consider the complex vectors:
The squares of the two 3-vectors are separately conserved by Lorentz trans-
formations, which therefore transforms their components among each other.
The parity transformation obviously exchanges Z + with −Z − .
Example 2. Tensors with two symmetric indices, both covariant (or con-
travariant): T μν (or Tμν ). In this case, the projection with gμν gives an in-
variant and the space factorises into the space of tensors symmetric and with
zero trace, gμν T μν = 0, of nine dimensions, and into a unidimensional space
of tensors of the form g μν T .
(1/2, 1/2)⊗(1/2, 1/2) = (1+0, 1+0) = [(1, 1)⊕(0, 0)]⊕[(1, 0)⊕(0, 1)]. (3.52)
if φ(x), x and φ (x ), x are the components of the field and the coordinates
associated with an event in the two IFs, equations (3.33) and (3.39).
We can consider equation (3.53) from another viewpoint, as the invariance
(or symmetry) of the action for the transformation which substitutes φ(x), x
with φ (x ), x in the same frame of reference:
x → x ;
φ(x) → φ (x ). (3.54)
Viewed in this way, the relativity principle simply expresses the symme-
try of the action under the symmetry group of (proper and orthochronous)
Lorentz transformations and we can study the consequences of this symmetry
at the same time as other possible symmetries of the action which can take
the general form (3.54), with different realisations of the transformation rules.
For simplicity, we have omitted in (3.55) the indices connected to the field,
φ, which are implied. We denote δT φ as the total variation of the field. We
can decompose δT φ in the following way:
The total variation is the sum of the functional variation, δφ, and of a
translation by δxμ .
The Poincaré group transformations and, a fortiori, the transformations
associated with internal symmetries leave the size of d4 x invariant. This re-
quires that:
∂x 4 ∂δxλ 4
d4 x = || ||d x = det(δμλ + )d x = d4 x. (3.57)
∂x ∂xμ
If we use the identity:
valid at least to terms of higher order in the infinitesimal matrix , the invari-
ance condition on the volume of integration takes the form:
1 = 1 + ∂μ δxμ ; or (3.59)
μ
∂μ δx = 0
Under these conditions, the invariance of the action, equation (3.53) simply
requires the invariance of the Lagrangian density:
Using the equations of motion and (3.59), we obtain the conservation equa-
tion:
∂L
∂μ [( )δφ + Lδxμ ] = 0. (3.63)
∂ ∂μ φ
We can express the infinitesimal variations of the fields and coordinates
as linear combinations of the infinitesimal parameters which characterise the
transformation:
δxμ = A (ΔA )μ (x); δφ = A (ΣA )φ (3.64)
A A
where ΔA and ΣA are matrices which represent the generators of the infinites-
imal transformations of the coordinates and the fields. Because (3.63) must
be satisfied for any arbitrary values of the infinitesimal parameters, we obtain
the conservation equations:
∂L
∂μ (J A )μ = ∂μ [( )ΣA φ + (ΔA )μ L] (3.65)
∂ ∂μ φ
for the currents associated with each infinitesimal generator. The result (3.65)
is Noether’s theorem:
where nμ and nμ are the normals to the two hypersurfaces, which point in the
time direction of the two reference frames, and I represents the contribution
of the lateral surfaces of our 4-volume. If we let these surfaces tend to infinity,
I → 0, and we then have:
nμ J μ dΓ0 = nμ J μ dΓ1 ; (3.71)
Γ0 Γ1
i.e. d3 x J 0 (x, t) = d3 x J 0 (x , t ).
sμ = ∂λ T λμ . (3.72)
xμ = xμ + aμ (3.74)
32 Relativistic Quantum Mechanics
Invariance under translations thus requires that L does not depend explic-
itly on x.
From (3.75), we derive the form of the functional variation of the fields:
The conserved current derived from (3.66) and (3.78) is a tensor of rank
two which is called the canonical energy-momentum tensor
∂L ν
T μ,ν = ∂ φ − g μν L; (3.79)
∂∂μ φ
∂μ T μ,ν = 0.
The infinitesimal parameters μν are not independent, because the matrices
Λ must be such that they leave the metric tensor invariant:
ΛT gΛ = g; i.e. (3.85)
λμ gλν + gμλ λν = μν + νμ = 0,
where we have kept the terms to first order in and we have defined a new in-
finitesimal tensor, μν , with two contravariant indices. Condition (3.85) states
that this tensor should be antisymmetric in the two indices. Equation (3.83)
is rewritten as:
The components associated with spatial rotations (αβ = ij) give as con-
served charge the total angular momentum of the field, for example:
J 1 = d3 x M 0[23] = d3 x[(x2 T 0,3 − x3 T 0,2 ) + Σ0[23] ]. (3.91)
Comment. This might suggest to identify the first term in (3.91) as due
to the orbital angular momentum and the second to the intrinsic angular
momentum of the field (which in quantum mechanics would correspond to
the spin of the field quanta). In fact, in the case of the scalar field, the second
term is absent. However, the two terms individually are ambiguous, as we
will see in the following section. For example, we can redefine the energy-
momentum tensor so as to eliminate completely the second term. The only
quantities uniquely defined are the constants of the motion, J k .
That K i is a constant simply expresses the fact that the barycentre of the
energy, for an isolated system, moves with uniform rectilinear motion.
3 i 00
d xxθ
< x >= 3 00
i
(3.101)
d xθ
3 0i
d xθ
< xi >= ct · 3 00 + constant.
d xθ
4. Write the complex scalar field in terms of its real and imaginary compo-
nents: φ = φ1 + iφ2 . Prove that the theory embodied by the Lagrangian
of Problem 1 is identical to the one of Problem 3.
5. Given the Lagrangian density
1 1 λ
L= ∂μ φ∂ μ φ − m2 φ2 − φ3
2 2 3!
– derive the equations of motions for the field φ;
– derive the expression of the canonical energy-momentum tensor
T μν applying Noether’s theorem in the case of translation invari-
ance;
– obtain the Hamiltonian density H, and verify that H = T 00 .
xμ → x = eα xμ
μ
Sect. 3.2
1. Using eq. (3.19), prove that:
δφ(x) δπ(x)
= = δ(x − y)
δφ(y) δπ(y)
δφ(x) δπ(x)
= =0
δπ(y) δφ(y)
and that:
Sect. 3.3
1. Demonstrate that the charges corresponding to the momentum tensor
in (3.99) agree with those of the canonical tensor (3.90).
CHAPTER 4
KLEIN–GORDON FIELD
QUANTISATION
∂V
2φ + m2 φ + = 0. (4.2)
∂φ
In the case V = 0, we find the Klein–Gordon equation:
(2 + m2 )φ = 0. (4.3)
which constitutes the simplest relativistic field equation.
To construct the Hamiltonian, we calculate the conjugate momentum den-
sity (recall that x0 = ct):
∂L 1
π= = 2 ∂t φ (4.4)
∂∂t φ c
so that:
1 2 2
H = π ∂t φ − L = [c π + (∇φ)2 + m 2 φ2 + V (φ)]. (4.5)
2
DOI: 10.1201/9781003436263-4 39
This chapter has been made available under a CC BY NC license.
40 Relativistic Quantum Mechanics
The stability of the field requires that the Hamiltonian should be bounded
from below, for variations of φ in all possible configurations. Restricting our-
selves to spatially constant configurations, we see from (4.5) that V (φ) must
be a function bounded from below.
Hamilton’s equations are:
∂t φ = c2 π; (4.6)
∂V
∂t π = −(m2 φ + − ∇ · ∇φ).
∂φ
Substituting the first into the second we find, of course, the relativistically
invariant equation (4.2).
(2 + m2 )φ = 0
which we can solve with periodic boundary conditions at the edges of a large
cubic spatial volume of side L:
k μ k μ − m2 = 0
where N is a normalisation factor which we will shortly define, and the sum
is over all vectors with integer components. In the second term we can sum
over −n and define c(−k) = b∗ (k):
The φ given by (4.11) is real and depends on two real functions of k, the real
and imaginary parts of a(k). This corresponds to the fact that, to completely
determine φ, it is necessary to provide two types of initial conditions: the
values of φ(x, 0) and ∂t φ(x, 0).
To conclude this section, we can state explicitly the relation between a(k)
and the initial conditions. We consider the system of functions fk (x), solutions
of the K–G equation for positive frequencies:
μ
fk (x) = N e−ikμ x (4.12)
φ1 + iφ2
φ= √
2
the Lagrangian (4.14) reduces to the sum of two identical Lagrangians for
the real fields φ1 and φ2 . The symmetry (4.15), in this new representation,
corresponds to an orthogonal rotation of the fields φ1,2 among themselves:
φi = Oij φj OT O = 1. (4.16)
The solutions of (4.18) are obtained starting from the Green’s function for the
problem, the solution to the equation in the presence of a pointlike source
described by a Dirac delta function localised at the origin of space-time:
as is easily verified.
There are, of course, an infinite number of Green’s functions, each deter-
mined by the particular boundary conditions which are assigned to (4.19). The
solutions differ among themselves by a solution to the homogeneous equation,
so that the general solution to (4.18) is written:
φ(x) = d4 x G(x − x )J(x ) + φ0 (x) (4.21)
(2 + μ2 )F (x) = 0.
Using the residue theorem, it is easily seen that the integral is equal to zero
if the path does not include either of the two singularities, or a combination
of the two homogeneous solutions, represented by the residues of the integral
around each singularity.
We denote by C + a path which encircles clockwise (only once!) the point
k 0 = +ω(k) and we define:
1 i
iΔ(+) (x) = 4
d4 k e−i(k·x) 2 =
(2π) C + k − μ2
1 i
= d3 k dk 0 0 e−i(k·x) ;
(2π)4 C+ (k − ω)(k 0 + ω)
3 3
(+) 1 d k −i(ωt−k·x) 1 d k −i(k·x)
iΔ (x) = 3
e = 3
e . (4.26)
(2π) 2ω (2π) 2ω
where we have put g̃(k) = g̃(ω(k), k), and g and h are two independent func-
tions. Equation (4.29) gives a representation of the general solution of the
homogeneous equation which reproduces what was given earlier, in (4.13) in
the complex case (in which g̃(k) and g̃(−k) are independent) and in the con-
tinuum limit, in which:
g̃(k) = 2ω(k)V a(k). (4.30)
√ that in the continuum limit, V → ∞, the a(k) must tend to zero like
(Note
1/ V if we wish for field configurations in which the total energy of the field,
rather than the energy density, remains finite).
For t < 0 the integration path must be closed in the upper part of the
complex plane (η > 0). To obtain a null result, the path closed in this
way should not contain the singularity, which must therefore be below
the path of integration. Conversely, when t > 0 and the integration path
is closed in the lower half-plane, the path, turning clockwise, includes
the two singularities. The result is therefore:
1 i
iΔret (x) = d4 k 2 = θ(t)[iΔ(+) (x) + iΔ(−) (x)].
(2π)4 Imk0 >0 k − μ2
(4.31)
• The symmetric condition, that G should vanish for t > 0, takes us to
the advanced Green’s function:
1 −1
iΔadv (x) = 4
d4 k 2 = −θ(−t)[iΔ(+) (x) + iΔ(−) (x)].
(2π) Imk0 <0 k − μ2
(4.32)
46 Relativistic Quantum Mechanics
k 0 = −ω(k) → k 0 = −ω + i;
k 0 = +ω(k) → k 0 = +ω − i.
L = ∂μ φ † ∂ μ φ − m 2 φ † φ (4.34)
(∂μ ∂ μ + m2 )φ = 0 (4.35)
with α constant.
It follows from Noether’s theorem that:
θμν = ∂ μ φ† ∂ ν φ − g μν L (4.38)
i † μ
J μ (x) = [φ (∂ φ) − (∂ μ φ† )φ]. (4.40)
From (4.35) the conjugate momenta and the Hamiltonian are quickly
found:
∂L
π= = ∂t φ † ; π † = ∂ t φ
∂∂t φ
H = π † π + ∇φ† · ∇φ + m 2 φ† φ; H = d 3 x H
i
Q = d3 xJ 0 (x, t) = d3 x φ† (∂ 0 φ) − (∂ 0 φ† )φ (4.41)
Canonical quantisation should replace classical Poisson brackets for the
canonical variables (cf. Chapter 3) with the equal time commutators according
to the rule:
[A, B]
{A, B} → . (4.42)
i
We obtain the equal time commutators from equation (3.26) :
[φ, Q] = +φ (4.46)
The solutions to the K–G equation are of the form found in section 4.1.
To be exact, the K–G equation, being linear in φ, has the same solutions,
regardless of whether φ is a complex number or an operator. These are the
amplitudes of the normal oscillation modes, a(k) and b(k), which now become
linear operators with commutation rules determined by (A.25). (We normalise
a(k) and b(k) so as to eliminate from their commutation rules.):
φ(x) = Σk [a(k)e−i(kx) + b† (k)ei(kx) ]. (4.47)
2ωV
Inverting equation (4.47) we find:
a(k) = i d3 x[fk∗ · (∂t φ) − (∂t fk∗ ) · φ]t=0 =,
b(k) = i d3 x[fk∗ · (∂t φ† ) − (∂t fk∗ ) · φ† ]t=0
1
fk = e−ikx , (4.48)
2ω(k)V
• the vacuum state in which all the oscillators are at the lowest level.
Mathematically, |0 > is determined by the condition of being annihilated
by the application of any destruction operator:
as (p)|0 >= br (q)|0 >= 0, for any s, r, p, q; (4.50)
Normal Products. The version of the classical Lagrangian and other ob-
servables of the field (energy, momentum, etc) suffer from an intrinsic am-
biguity, because we must convert the products of classical quantities (which
commute) into products of linear operators (in general, non-commuting). We
can remove this ambiguity by defining, as we have just done, the quantum
operator products in a way which ensures that their vacuum value is equal to
zero. When we impose this condition, we speak of having normal products or
normal ordering.
To formalise this condition, we observe that the field operators are sums
of two components, characterised by the sign of the exponential in t in the
plane waves:
Sect. 4.3
1. Given the real scalar field φ(x),
2. Find the momentum operator associated with the real scalar field φ,
defined as
P i = d3 x T 0i
which leaves the Lagrangian density of a free massless scalar field invari-
ant.
ELECTROMAGNETIC-
FIELD QUANTISATION
52 DOI: 10.1201/9781003436263-5
This chapter has been made available under a CC BY NC license.
Electromagnetic-Field Quantisation 53
Equation (5.10) expresses the fact that the dual tensor does not contain
sources, unlike F μν . Because the conversion from F to F̃ requires the inter-
change of electric and magnetic fields, equation (5.10) implies the absence of
magnetic monopoles, the magnetic analogue of electric charge.
F μν = ∂ ν Aμ − ∂ μ Aν (5.11)
Aμ → Aμ = Aμ + ∂ μ f (5.15)
the new vector potential gives rise to the same observables E and B, for any
function f.
54 Relativistic Quantum Mechanics
2f = 0 (5.18)
permits non-trivial solutions (as we have seen already in the case of the
Klein–Gordon equation). This is in accord with a count of the degrees of
freedom. Taking account of (5.16) we find three degrees of freedom for Aμ ,
but the preceding count says that there should be two.
Unfortunately, the final condition cannot generally be given in covariant
form. This is the origin of numerous problems, which will be confronted and
resolved in what follows.
∂ν F μν = 2Aμ − ∂ μ (∂ν Aν ) = J μ
which, if Aμ satisfies the Lorenz condition (5.16), reduces to the wave equation:
∂ ν ∂ ν Aμ = J μ (5.19)
To characterise the solutions of (5.19), we can use the results of the pre-
ceding section, in the limit of zero mass, μ2 → 0. As we have seen, the Green’s
functions of the Klein–Gordon equation contain singularities in the Fourier
transform, localised at the points k 0 = ± μ2 + (k)2 . To discuss the electro-
magnetic field, it is desirable to begin with a fictitious mass, λ, which is small
1 Proposed in 1867 by the Danish mathematical physicist, Ludvig Valentin Lorenz, not
to be confused with H. A. Lorentz of the homonymous transformations.
Electromagnetic-Field Quantisation 55
but non-zero, to prevent singularities from coalescing at the same point when
k → 0. The limit λ → 0 can be taken at the end of the calculations.
From (5.19), we see that the Green’s function for the vector potential
satisfies the equation:
−2Gμν (x) = −g μν δ (4) (x) (5.21)
or, after the Fourier transform:
k 2 G̃μν (k) = −g μν . (5.22)
The solution of (5.19) is therefore written (in the Lorenz gauge, with Feyn-
man boundary conditions):
d4 k −g μν ˜ −i(k·x)
μ
A (x) = limλ→0 Jν (k) e + Aμ0 (x) (5.23)
(2π)4 k 2 − λ2 + i
where Aμ0 satisfies the free wave equation.
As noted in the previous paragraph, the Lorenz condition does not com-
pletely determine the gauge and we can impose a further condition. To con-
tinue, we must identify an appropriate basis on which to project the four
components of Aμ .
We fix the 4-vector k μ = (ω(k), k) and, correspondingly, a space-time
coordinate system identified by the following four vectors:
μ1,2 = (0, 1,2 ); k · (1,2) = 0;
k
μ3 = (0, );
|k|
μ0 = η μ = (1, 0). (5.24)
The normalisation conditions are:
μα νβ gμν = gαβ (5.25)
and the completeness conditions are:
km kn
Σi=1,2 m n
i i = (δ
mn
− );
|k|
Σα gαα μα να = g μν . (5.26)
These 4-vectors form a basis in which any other vector can be expanded.
Using the second completeness equation from (5.26), we rewrite the Feynman
Green’s function in (5.23) in the following way (with the limit of zero mass
understood):
−g μν −(Σα gαα μα να )
=
k 2 + i k 2 + i
(Σ1,2 μi νi ) (μ3 ν3 − η μ η ν )
= + . (5.27)
k 2 + i k 2 + i
56 Relativistic Quantum Mechanics
kμ ω μ
μ3 = − η (5.28)
|k| |k|
and consequently:
kμ kν ω ω2 μ ν
μ3 ν3 − η μ η ν = 2
− 2
(k μ η ν + η μ k ν ) − (1 − )η η . (5.29)
(k) (k) (k)2
Explicitly:
d4 k 1 k i k j ˜j
i
A (x) = (δ ij
− )J (k)e−i(k·x) ;
(2π)4 k 2 + i |k|2
d4 k 1
A0 (x) = ρ̃(k)e−i(k·x) . (5.31)
(2π)4 |k|2
We note that:
∇ · A(x) = 0;
−∇ · ∇A0 (x) = ρ(x)
Electromagnetic-Field Quantisation 57
In general, we can fix the gauge so that the electric field is divided into
transverse and longitudinal parts:
E = EL + ET ;
∇ · ET = 0; ∇ · EL = ρ. (5.32)
From the preceding results we obtain explicitly:
1 1
E L (x, t) = −∇A = −∇ d3 y
0
ρ(y, t);
4π |x − y|
∂ ∂ d4 k 1 k(k · J˜)
ET = − A = − 4 2
e−ikx [J˜ − ]. (5.33)
∂t ∂t (2π) k + i |k|2
1 v2
L(x) = − Fμν F μν + δ (3) [x − x(t)](−mc2 1− ) − jμ Aμ (x). (5.38)
4 c2
From this result, the field equations are obtained as before, in the form:
1
∂ν F μν = j μ = q uμ Aμ [x(t)] δ (3) [x − x(t)] (5.39)
γ
58 Relativistic Quantum Mechanics
2
where uμ is the 4-velocity and, as usual, γ = 1/ 1 − vc2 .
To derive the equations of motion of the charge, we calculate the conjugate
momentum (where L = d3 xL):
∂L
= mγ v + qA[x(t), t] (5.40)
∂v
from which:
d ∂L d ∂A 1
= (mγv) + q + q (v · ∇)A (5.41)
dt ∂v dt ∂t c
and the generalised force:
∂L
= −q[∇A0 − Σi v i ∇Ai ]. (5.42)
∂x
The equations of motion are therefore:
d ∂L ∂L
= ;
dt ∂v ∂x
d d ∂A 1
(mγv) = (p) = −q − q∇A0 + q [−(v · ∇)A + Σi v i (∇Ai )] =
dt dt ∂t c
1 1
= qE + q v × ∇ × (A) = qE + q v × B. (5.43)
c c
These are the equations for the spatial components of the 4-momentum, p,
of the charge. The equation for the time component is obtained by multiplying
the previous equation by p and using the relations:
p0 dp0 = p · dp;
v/c = β = p/p0 .
We therefore find:
d dp0
=c = q v · E. (5.44)
dt dt
Equation (5.44) expresses the conservation of energy: the energy acquired
by the particle in a unit of time is equal to the power provided by the electric
field (the Lorentz force, the second term in the second line of (5.43), does not
perform work).
The preceding equations can be put in the covariant form, noting that:
1 0μ
v·E = F uν
γ
dp1 q 1 1μ
= qF 01 + (v 2 F 12 − v 3 F 31 ) = F uμ .
dt c cγ
Electromagnetic-Field Quantisation 59
We therefore obtain:
dpμ q 1 μν
= F uν ; or
dt cγ
dpμ q
= F μν uν (5.45)
dτ c
in terms of the proper time. In the case of multiple particles, we obtain an
equation of the type (5.45) for each particle, while in equations (5.39) we
should include the contribution of each particle to the current.
With the boundary conditions that there should be no external fields and
fields at infinity, the Maxwell–Lorentz equations (5.39, 5.45) describe the time
evolution of a system of charged particles, each under the action of the field
generated by itself and by the other particles.
Assuming that the other forces are negligible, this system of equations
describes the behaviour of matter in terms of its elementary constituents. The
Maxwell–Lorentz equations are the first example of a Theory of Everything, a
theory which would describe all the phenomena found in Nature.
The systematic description of the properties of matter based on equations
(5.45) was dealt with by Lorentz [4] at the start of the twentieth century, and
represented a fundamental step forward in the understanding of the structure
of matter.
For the behaviour of matter on the laboratory scale, the hypothesis that
electromagnetic forces should dominate is completely adequate. On astronom-
ical scales, it is necessary to take into account gravitational forces, which can
be inserted into the picture by extending the principle of special relativity to
Einstein’s general relativity. The Maxwell–Lorentz–Einstein equations give an
accurate description of phenomena on macroscopic scales, not yet superseded.
On a microscopic scale, at atomic and subatomic dimensions (10−8 cm),
the Maxwell–Lorentz picture must be replaced by quantum electrodynamics
(QED). It is a very noteworthy fact that the Maxwell–Lorentz equations,
once converted to the framework of quantum field theory, maintain their form
essentially unaltered and are capable of describing the properties of condensed
matter and of atoms extraordinarily accurately.
At nuclear and subnuclear levels (below 10−13 cm) other forces enter into
play: the nuclear forces described for the first time in a covariant and quantum
manner by Yukawa, at the beginning of the 1930s, and the weak interactions,
identified by Fermi, also at the start of the 1930s, as responsible for the decay
of the neutron and β radioactivity.
In conclusion:
g μν αβ
e.m. = −gρσ F
Θμν ρμ σν
F + F Fαβ . (5.48)
4
The explicit forms of the energy and momentum densities obtained from
(5.48) are:
1
Ee.m. = Θ00
e.m. = (E · E + B · B);
2
i
Pe.m. = Θ0i
e.m. = (E × B)i . (5.49)
L = Le.m. + Lq + Lint ;
1
Lq = −mc2 δ (3) [x − x(t)];
γ
1
Lint = −jμ Aμ = q δ (3) [x − x(t)]uμ Aμ .
γ
The overall energy-momentum tensor is obtained by differentiating with
respect to the degrees of freedom of the field and of the particle. We restrict
ourselves to consideration of the spatial integral of the time components (Lq +
Electromagnetic-Field Quantisation 61
Lint = d3 x(Lq + Lint ):
0,0
E tot = d3 xTtot
∂L 0 ∂(Lq + Lint )
= d3 x[ ∂ Aβ − g 00 Le.m. ] + · v − (Lq + Lint )
∂ 0 Aβ ∂v
= d3 xTe.m.
0,0
+ mc2 γ + qA0 [x(t), t] (5.50)
∂(Lq + Lint )
pitot = d3 xTe.m.
0i
+
∂v i
= d3 xTe.m.
0i
+ mγv i + qAi [x(t), t]. (5.51)
μ,ν
The tensor Te.m. is no longer conserved. Using the equation of motion for
μν
F , (5.39), we find:
dpνe.m.
= d3 x∂0 Te.m.
0,ν
= d3 x∂μ Te.m.
μ,ν
=
dt
1
= − d3 xjβ ∂ ν Aβ + d3 x Fμσ (∂ μ F σν + ∂ σ F νμ + ∂ ν F μσ ).
2
In the parentheses of the second term σ = μ. If we also have ν = σ = μ,
the bracketed term is zero for the homogeneous Maxwell’s equations, (5.8). If
instead ν = σ = μ, it vanishes because of the antisymmetry of F μν . In any
case, therefore:
dE e.m.
= − d3 x jβ ∂ 0 Aβ = −q∂0 A0 + qv · (∂0 A);
dt
dpie.m.
= qv · (∂i A)
dt
from which:
dE tot dE e.m. dE q
= + =
dt dt dt
d(mc2 γ)
= −q∂0 A0 + qv · (∂0 A) + + q∂0 A0 + qv · ∂A0 =
dt
d(mc2 γ)
= −qv · E + =0 (5.52)
dt
as a consequence of the equation of motion of the particle, (5.44). Similarly,
from the equation of motion, (5.43), we find:
dpitot dpie.m. dpiq
= + =
dt dt dt
dmγv i
= qv · (∂i A) + + q∂0 Ai + v · ∇Ai =
dt
dmγv i
= − qE i − q(v × B)i = 0. (5.53)
dt
62 Relativistic Quantum Mechanics
The total derivative can be omitted and we can combine the last term
with the expression for the energy-momentum of the particle. In this way we
obtain:
1 mc2
E tot = d3 x(E 2 + B 2 ) +
2 1 − vc2
2
mv i
pitot = d3 x(E × B)i + . (5.54)
2
1 − vc2
In the expressions of (5.54) apparently, only the energy of the free particle
appears. In the case of more than one particle, it can be asked where the
energy from the electrostatic interaction between the particles has gone. The
answer is that this energy is absorbed into the first term, as can be seen in the
following way. We separate the electric field into longitudinal and transverse
components, according to (5.32). We find:
1 1
d3 x E 2 = d3 x(EL2 + ET2 ) =
2 2
1 1
= d3 x ET2 + d3 x (∇Φ) · (∇Φ) =
2 2
1 1
= d3 x ET2 − d3 xΦ(∇ · ∇Φ) =
2 2
1
= d x ET + d3 x Φρ
3 2
(5.55)
2
where Φ is the electrostatic potential. The second term in the final expression
is the electrostatic energy which, for a system of pointlike particles, leads to
the expression:
1 1 q i qj
VCoul = Σij . (5.56)
2 4π |xi (t) − xj (t)|
We find:
1 1 1 qi qj mc2
Etot = d3 x(ET2 + B 2 ) + Σij + . (5.57)
2 2 4π |xi − xj | 2
1 − vc2
Electromagnetic-Field Quantisation 63
The first term contains only the degrees of freedom of the radiation field,
while the second and third contain the degrees of freedom of the particles.
Starting from (5.54) the unintegrated energy-momentum tensor can finally
be derived, in the form:
Θμν = Θμν μν
e.m. + θ ;
1
θμν = mc2 uμ uν δ (3) [x − x(t)].
γ
Δt
Δ(jA) = Δ3 xΔt(jμ Aμ ) = q uμ Aμ (x(t), t) = qΔτ [uμ Aμ (x(τ ), τ )] (5.58)
γ
where τ is the proper time of the particle. We have obtained an invariant
result for any 4-vector Aμ , therefore j μ must transform like a 4-vector. Similar
reasoning holds for the other densities.
H p = p · v − (Lp + Lint ) =
v2
= p · v + qA0 − qv · A + mc2 1− =
c2
v2
= qA0 + v · π + mc2 1− .
c2
Eliminating v we find, finally:
H p = qA0 + (mc2 )2 + c2 (p − qA)2 . (5.60)
H p → H p − qA0 ;
p → p − qA (5.62)
pμ → pμ − qAμ . (5.63)
We must add this result to the Hamiltonian of the particle, (5.60). Con-
sidering the case of several charged particles, we find (with Aμi = Aμ [xi (t), t]):
1
H tot = d3 x(ET2 + B 2 )−
2
1
− d3 x EL2 + Σi qA0i + Σi (mi c2 )2 + c2 (pi − Ai )2 } =
2
1 1 qi qj
= d3 x(ET2 + B 2 ) + Σij +
2 2 4π|xi − xj |
+ Σi (mi c2 )2 + c2 (pi − Ai )2 . (5.66)
dx ∂H p
= = ;
dt ∂p m
dp ∂H ∂V ∂V
=− =− = − 2 2x. (5.69)
dt ∂x ∂x ∂r
Now we introduce a constant magnetic field directed along the z-axis,
generated by the vector potential:
B
A= (−y, x, 0); ∇ × A = B.
2
According to the minimal substitution, the new Hamiltonian is:
(p + eA)2
H= + . . . + V (r2 , . . .). (5.70)
2m
66 Relativistic Quantum Mechanics
and in addition:
B
A = Ax + iAy = i ζ. (5.72)
2
Hamilton’s equations for the coordinates transverse to B are:
dζ ∂H ∂H p
= +i = + iωL ζ
dt ∂px ∂py m
where we have introduced the Larmor frequency:
eB
ωL = . (5.73)
2m
We define a new variable χ(t), setting:
are (i = 1, 2, 3):
π 0 = F 00 = 0, π i = F 0i = ∂ i A0 − ∂ 0 Ai = E i . (5.81)
∂φ(x)
2A(x) = j(x) − ∇ , (5.88)
∂t
as follows from (5.3), while (5.1) implies that the scalar potential A0 = φ, the
solution of Poisson’s equation:
For this reason, the gauge (5.87) is called the Coulomb gauge.
In principle, the term containing φ on the right-hand side of (5.88) can
be obtained from (5.90). However, separating the longitudinal and transverse
components of the current, by writing:
with
∇ × j L = 0, ∇ · jT = 0 (5.92)
Electromagnetic-Field Quantisation 69
∂φ(x)
∇ = j L (x) , (5.93)
∂t
which means that the source term in the equation which determines the vector
potential A is reduced to the transverse component of the current, j T (x).
In the absence of charges and currents, φ(x) ≡ 0 and (5.88) becomes:
2A = 0 . (5.94)
ωk = |k| . (5.96)
The gauge condition (5.87) implies that the polarisation vector has the
property:
k · k = 0 , (5.97)
that is, for every wave vector k, k lies on the plane perpendicular to k.
Therefore we can define two real unit vectors k1 and k2 , such that (r, r =
1, 2)
kr · kr = δrr (5.98)
and
kr · k = 0 . (5.99)
The general solution of (5.94) which satisfies the gauge condition (5.87) is
obtained from a linear combination of uk (x, t) (we recall that A(x, t) is a real
function)
2
A(x, t) = [ckr ukr (x, t) + c∗kr u∗kr (x, t)]
k r=1
2
= [ckr (t)ukr (x) + c∗kr (t)u∗kr (x)] , (5.100)
k r=1
where xkr and pkr are the canonical classical variables, which satisfy the
equations of motion:
ẍkr (t) = −ωk2 xkr (t), p̈kr (t) = −ωk2 pkr (t) . (5.109)
Comparing (5.106) with (5.108) we see immediately that H coincides with
Hosc if
1 1
ckr = (ωk xkr + ipkr ) , c∗kr = (ωk xkr − ipkr ) , (5.110)
2ωk 2ωk
or if
xkr = ckr + c∗kr , pkr = −iωk (ckr − c∗kr ) . (5.111)
This result suggests to interpret H as the energy of a set of classical
oscillators. The path to the quantum case is straightforward. The energy of a
system of quantum oscillators has the form:
ωk †
H= akr akr + akr a†kr , (5.112)
2
kr
which agrees with the energy of the electromagnetic field (5.106) if the coef-
ficients of the expansion of A in a Fourier series are interpreted as quantum
operators defined by the equations:
1 1
ckr = √ akr , c∗kr = √ a† . (5.113)
2ωk 2ωk kr
The vector potential A can therefore be expressed in terms of the creation
and destruction operators a†kr and akr
1 !
A(x) = √ kr akr (t)e−ik·x + a†kr (t)eik·x (5.114)
kr
2V ωk
1 !
= √ kr akr e−i(ωk t−k·x) + a†kr ei(ωk t−k·x)
kr
2V ωk
= A+ (x) + A− (x) , (5.115)
72 Relativistic Quantum Mechanics
and is itself an operator in the Hilbert space whose state vectors are:
Because A(x) is linear in the creation and destruction operators, from the
commutation rules
!
[akr , Nk r ] = akr , a†k r ak r = δrr δkk akr (5.121)
and ! !
a†kr , Nk r = a†kr , a†k r ak r = δrr δkk a†kr , (5.122)
it follows that the operator Nkr does not commute with A(x) nor, therefore,
with the electric and magnetic fields E and B. This result implies that the
values of E and B and nkr cannot be simultaneously measured with arbitrary
precision. Moreover, from the linearity of A(x) in akr and a†kr , it also follows
that the expectation values E and B in the states (5.116) are all zero.
Classically the momentum of an electromagnetic wave is given by the
Poynting vector (compare with equation (5.54))
∂A
p = d3 x E × B = − d3 x × (∇ × A) . (5.123)
∂t
Electromagnetic-Field Quantisation 73
that, substituted in the equation analogous to (3.91) for J 3 , gives the result
$ 3 %
∂ ∂
J 3 = d3 x Ȧi x1 − x2 Ai − (Ȧ1 A2 − Ȧ2 A1 ) . (5.128)
i=1
∂x 2 ∂x 1
Now substituting (5.114) into (5.128) we see that J 3 satisfies the commu-
tation rules
!
J 3 , a†kr = i 1kr a†k2 − 2kr a†k1 , (5.129)
where we have chosen the axis x3 along the direction of the wave vector k.
Note that only the second term in the integrand of (5.128), which we can
interpret as spin angular momentum, gives a non-zero contribution to the
commutator (5.129).
We now define two new operators
1 1
a†kR = √ a†k1 + ia†k2 , a†kL = √ a†k1 − ia†k2 , (5.130)
2 2
which create circularly polarised photons; that is, photons whose state of po-
larisation is described by the vectors
1 1
kR = √ (k1 + ik2 ) , kL = √ (k1 − ik2 ) . (5.131)
2 2
Setting k1 ≡ (1, 0, 0) and k2 ≡ (0, 1, 0) and rewriting (5.129) in terms of
the new operators we obtain the commutation rules
! !
J 3 , a†kR = a†kR , J 3 , a†kL = −a†kL , (5.132)
from which it follows that the third component of the angular momentum of
the state with a photon is given by
!
J 3 a†kR |0 = J 3 , a†kR |0 = a†kR |0 , (5.133)
!
J 3 a†kL |0 = J 3 , a†kL |0 = −a†kL |0 . (5.134)
This result shows that the photon has spin |J | = 1, as required by the
vector nature of the electromagnetic field, and that the two projections J3 =
±1 corresponds to the circularly polarised states. The absence of the state
with J3 = 0 is a consequence of the transversality condition (5.87), which
reduces the number of degrees of freedom by one unit.
Electromagnetic-Field Quantisation 75
Z
e
Hint = (p · A)
i=1
m i
where e and m are the electron charge and mass, while A is the radiation
field.
Using the dipole approximation, which amounts to replacing eik·x → 1
in the plane-wave expansion of the field A, evaluate the transition matrix
element associated with photon emission and absorption
where
|i = |A, nkr , |f = |B, nkr ± 1 .
In the above equations, nkr is the number of photons in the state of
momentum k and polarisation specified by the index r, whereas A and
B denote the quantum numbers of the atomic initial and final states,
respectively.
3. Obtain the commutation rules satisfied by the components of the electric
and magnetic fields (i = 1, 2, 3)
for x0 = y0 .
4. The basis of eigenstates of the electromagnetic Hamiltonian, specified
by the number of photons with a given wave number, k, and polarisa-
tion, r, provides a useful representation when the number of photons is
76 Relativistic Quantum Mechanics
small. The states produced by a classical light source, however, are best
described using coherent states.
Consider the coherent state associated with a single oscillation mode of
the electromagnetic field
∞ n
2 a†
|α = e−|α| √ |n
n=0 n!
a|α = α|α .
∂ μ Aμ = 0 .
Aμ → Aμ = Aμ − ∂μ Λ .
Electromagnetic-Field Quantisation 77
E 2 = p 2 + m2 . (6.4)
(2 + m2 )ψ = 0 , (6.5)
∂ρ
= ∇j , (6.6)
∂t
with
∂ψ ∂ψ ∗
ρ = ψ∗ −ψ (6.7)
∂t ∂t
78 DOI: 10.1201/9781003436263-6
This chapter has been made available under a CC BY NC license.
The Dirac Equation 79
and
j = −∇ (ψ ∗ ∇ψ − ψ∇ψ ∗ ) . (6.8)
However, the ρ which appears in equation (6.6) cannot be identified with
the probability density, in analogy with the similar quantity obtained from the
Schrödinger equation, because it does not have the required property of being
always positive-definite. To be convinced of this it is sufficient to substitute
i∂/∂t → E in (6.7). The result,
ρ = E|ψ|2 ,
shows that ρ can be either positive or negative, following from the fact that
the Klein–Gordon equation has solutions for the energy with both signs
E = ± p 2 + m2 ,
Note that in the non-relativistic limit E ≈ m > 0, and the familiar result
ρ ∝ |ψ|2 is recovered.
In addition, the presence of the second-time derivative conflicts with the
postulate of quantum mechanics stating that the wave function contains all
the information on the state of a physical system, which must therefore be
completely determined by its initial value.
Because of these problems the Klein–Gordon equation was initially aban-
doned, until Pauli and Weisskopf [7] suggested that the solution should be
interpreted as a quantum field, instead of the wave function of a particle.
Note that wave functions of the type (6.10) are encountered in non-relativistic
quantum mechanics. For example, the wave function of a particle of spin 12 is
a two-component vector.
Because the Hamiltonian is a hermitian operator, it must be true that
α = α† and β = β † . Moreover, from the requirement that (6.9) should be
compatible with the Klein–Gordon equation, or
(α · p + βm)(α · p + βm) = E 2 = p2 + m2 ,
it follows that:
{αi , β} = 0 (6.12)
αi2 2
=β =1. (6.13)
It is helpful to introduce a new set of matrices γ μ (μ = 0, 1, 2, 3) defined
as (i = 1, 2, 3)
γ 0 = β, γ i = βαi = γ 0 αi , (6.14)
which satisfies the anticommutation rules
{γ μ , γ ν } = 2g μν , (6.15)
(γ 0 )2 = 1, (γ i )2 = −1 , (6.16)
α i α j α j = αi = −αj αi αj
T r(αi ) = −T r(αj αi αj ) = −T r(αj αj αi ) = −T r(αi ) ,
thus
T r(αi ) = 0 . (6.20)
Using the same procedure it can also be shown that from (6.11) it follows that
T r(β) = 0 . (6.21)
M = M0 11 + Mi σi
with
1 0
11 = .
0 1
A matrix M that is independent of σi must have M0 = 0, but in this case
it obviously cannot anticommute with the σi . The smallest dimension which
the matrices αi and β can have is N = 4. It can easily be verified that the
4 × 4 matrices (written in terms of 2 × 2 blocks)
0 σi 1 0
αi = , β= , (6.24)
σi 0 0 −1
satisfy (6.11)–(6.13).
82 Relativistic Quantum Mechanics
{ ν } = S −1 γ μ SS −1 γ ν S + S −1 γ ν SS −1 γ μ S = S −1 {γ μ , γ ν } S = 2g μν .
γμ, γ
6.1.1 Spin
The Hamiltonian operator for the Dirac equation
H = α · p + βm (6.26)
L=x×p . (6.27)
[H, L3 ] = [αi pi , x1 p2 − p2 x1 ]
= α1 p2 [p1 , x1 ] − α2 p1 [p2 , x2 ]
= −i(α1 p2 − α2 p1 ) = 0 .
The constants of the motion associated with the invariance under rotation
of (6.9) are the components of the total angular momentum, defined as
1
J = L + Σ;
2
σ 0
Σ= , (6.28)
0 σ
which do commute with the Hamiltonian (6.26). To see this, we define the
antisymmetric tensor
i
σ μν = [γ μ , γ ν ] , (6.29)
2
whose components σ ij (i, j = 1, 2, 3) are
i j
i [σ , σ ] 0 σk 0
σ =−
ij
= ijk
= ijk Σk .
2 0 [σ i , σ j ] 0 σk
Thus we obtain
i i
Σ3 = (γ1 γ2 − γ2 γ1 ) = − (α1 α2 − α2 α1 ) .
2 2
The Dirac Equation 83
Comment. The appearance of spin explains why the wave function which
satisfies the Dirac equation must be a multidimensional vector. However for
spin 12 a two-component wave function was expected, while the minimum
dimension of the Dirac matrices is N = 4. The doubling of the components is
due to the necessity of the presence of an antiparticle, as we shall see later.
x = Λμ ν x ν .
μ
Note that we do not know a priori the form of S(Λ). The relativistic
invariance of the Dirac equation requires that it should be possible to determine
S(Λ) so that:
• the transformations are in accord with the combination rule (6.32),
• they lead to a ψ which satisfies the Dirac equation in O if ψ satisfies it in
O.
We now consider the Dirac equation in O:
λ ∂ λ ∂
0 = iγ − m ψ(x) = iγ − m S −1 (Λ)ψ (x ) =
∂xλ ∂xλ
μ
λ ∂x ∂
= iγ − m S −1 (Λ)ψ (x ) =
∂xλ ∂x μ
∂
= iγ Λ λ μ − m S −1 (Λ)ψ (x ).
λ μ
(6.33)
∂x
Multiplying by the matrix S(Λ), we obtain:
μ ∂
i
γ − m ψ (x ) = 0;
∂x μ
μ = Λμ ν S(Λ)γ ν S −1 (Λ).
γ (6.34)
Equation (6.34) agrees with the Dirac equation in system O if the matrices
are identical to γ μ , or if S(Λ) satisfies the relation:
γμ
Λμ ν = δ μ ν + μ ν ,
with
μ ν = g μα αν ;
αβ = −βα . (6.36)
Now we write
S =1+T , S −1 = 1 − T
with T infinitesimal:
1
T = αβ T αβ , (6.37)
2
and T αβ antisymmetric.
Substituting in equation (6.35) we obtain (to first order in )
Λμ ν γ ν = γ μ + μ ν γ ν = S −1 γ μ S = γ μ + (γ μ T − T γ μ ) .
The Dirac Equation 85
We note that γ5 is hermitian while relations similar to (6.17) hold for the
other matrices of the algebra
Γ = γ 0 Γ† γ 0 . (6.47)
Using the results of the previous paragraph one easily obtains transforma-
tion rules for the bilinear forms of the type
ψ † γ 0 Γψ = ψ̄Γψ , (6.48)
in terms of the adjoint spinor ψ̄, which transforms according to (6.45). The
covariant bilinear forms (6.48) perform an important role, because they have
definite transformation properties under Lorentz transformations. They are
the basic ingredients with which to construct observable quantities and in-
variant Lagrangian densities.
We consider, for example, the continuity equation (6.6) which is obtained
from the Dirac equation. It is easily shown that
ρ = ψ † ψ = ψ̄γ 0 ψ, j = ψ † αψ = ψ̄γψ .
The bilinear form ψ̄γ μ ψ is a 4-current associated with the particle de-
scribed by the Dirac equation and j 0 = ρ is the probability density, whose
volume integral is a conserved quantity.
We can now proceed to determine the transformation rules for the bilinear
forms.
• ψ̄ΓS ψ = ψ̄ψ transforms like a scalar, because
ψ̄ ψ = ψ̄S −1 Sψ = ψ̄ψ .
• ψ̄Γμν μν μ ν
T ψ = ψ̄σ ψ = ψ̄ i[γ , γ ]/2 ψ transforms like the elements of an
antisymmetric tensor, because
ψ̄ γ μ γ ν ψ = ψ̄S −1 γ μ SS −1 γ ν Sψ = Λμ α Λν β ψ̄γ α γ β ψ .
The Dirac Equation 87
ψ̄ γ 5 γ μ ψ = det(Λ) Λμ ν ψ̄γ 5 γ ν ψ ,
that is, like the components of a four-vector in the case of proper Lorentz
transformations and in the opposite way under parity transformations.
< B|U (Λ)† U (Λ)|A > = < B|A > or < B|U (Λ)† U (Λ)|A > = < B|A >∗ .
(6.51)
We can exclude the second case by continuity: when Λ tends towards the
unity transformation, |ΛA > must tend to |A >, so U (Λ) tends to unity,
satisfying the first condition and not the second. Finally
• Lorentz transformations are represented by quantum states of unitary
operators.
The density inside the integral is not invariant. Instead, the form of S(Λ)
implies that it should be the time component of a 4-current, conserved by the
Dirac equation:
∗
ψA (x, t)ψB (x, t) = ψ̄A (x, t)γ 0 ψB (x, t) = JA,B
0
;
μ
∂μ JA,B =0.
But as we saw in Section 3.5, the space integral of the time component
of a conserved 4-current is a relativistic invariant. Therefore, for any Lorentz
transformation:
< ΛA|ΛB > = < A|B >
The Dirac Equation 89
which is exactly the invariance condition for the scalar product. The non-
invariance of ψ ∗ ψ is in the proportion required to balance exactly the non-
invariance of the size of d3 x, so that the S(Λ) matrices impose on the physical
states a unitary representation of the Lorentz group. This result will be derived
explicitly in Section 7.4.
6.1.3 Boost
This expression denotes a specific Lorentz transformation, which corre-
sponds to the passage from the given frame of reference, O, to a frame O
moving along the positive x-axis at a velocity β with respect to O.
The transformation rule between the coordinates of O and O involves only
x and x1 , and is written:
0
where:
β = tanh θ. (6.53)
β = δθ
1,0 = −0,1 = δθ .
The matrix S(Λ), which determines the transformation between ψ(x) and
ψ (x ) is therefore, according to (6.39):
i
S(Λ) = 1 − αβ σ αβ =
4
iδθ 10
=1− σ . (6.54)
2
Using the definition:
i 1 0
σ 10 = [γ , γ ] = iγ 1 γ 0 = −iα1
2
90 Relativistic Quantum Mechanics
we find:
δθ
S(Λ) = 1 − α1 . (6.55)
2
Transformations with finite rapidity, θ = tanh−1 β, are obtained by com-
bining infinitesimal transformations, which corresponds to multiplying to-
gether the relevant matrices (6.55). We put:
θ
δθ = ; (N large)
N
and find:
N
θ θ
S(Λ) = 1 − α1 → e − 2 α1 . (6.56)
2N
n=∞ n
− θ2 α1 1 θ
e = − α1 =
n=0
n! 2
1 θ 2k
k=∞
k=∞
1
θ
2k+1
= − + α1 −
(2k)! 2 (2k + 1)! 2
k=0 k=0
θ θ
= cosh − α1 sinh (6.57)
2 2
and finally using the well-known hyperbolic trigonometric relationships:
θ cosh θ + 1 θ cosh θ − 1
cosh2 = ; sinh2 = (6.58)
2 2 2 2
to obtain:
θ 1 − tanh θ2 σ1
S(Λ) = cosh =
2 − tanh 2 σ1
θ
1
$ %
γ+1 1 − γ+1
βγ
σ1
= . (6.59)
2 − γ+1 σ1
βγ
1
(γ 0 E − m)u(p) = 0 , (6.63)
or, explicitly:
⎛ ⎞⎛ ⎞
E−m 0 0 0 u1
⎜ 0 E − m 0 0 ⎟ ⎜ u2 ⎟
⎜ ⎟⎜ ⎟
⎝ 0 0 −E − m 0 ⎠ ⎝ u3 ⎠ = 0 . (6.64)
0 0 0 −E − m u4
Equation (6.64) has eigenvalues E (1) = E (2) = m and E (3) = E (4) = −m,
whose corresponding eigenvectors are:
⎛ ⎞ ⎛ ⎞ ⎛ ⎞ ⎛ ⎞
1 0 0 0
⎜ 0 ⎟ ⎜ 1 ⎟ ⎜ 0 ⎟ ⎜ 0 ⎟
u =⎜
(1) ⎟ (2) ⎜ ⎟ (3) ⎜ ⎟ (4)
⎝ 0 ⎠ , u = ⎝ 0 ⎠ , u = ⎝ 1 ⎠ , u = ⎝ 0 ⎠ . (6.65)
⎜ ⎟
0 0 0 1
Like the Klein–Gordon equation, the Dirac equation also has negative en-
ergy, as well as positive energy, solutions. The two positive energy solutions
correspond to the two possible states of a particle with spin 12 . We will later
discuss the meaning of the solutions with negative energy.
In the general case of p = 0 we write the 4-spinor u(r) (p) in terms of two
two-component spinors uA and uB
uA
u(r) (p) = . (6.66)
uB
E 2 − m2 = (σ · p)2 = p2 . (6.68)
92 Relativistic Quantum Mechanics
or:
vr (p) = rs u(−)
s (−p), (r, s = 1, 2) (6.74)
where rs is the completely antisymmetric two-index tensor1 and the sum over
repeated indices is understood. Furthermore we put
ur (p) = u(+)
r (p) . (6.75)
It immediately follows from the definitions that ur and vr satisfy the equa-
tions:
(p/ − m)ur = 0 , (p/ + m)vr = 0 . (6.76)
The normalisation of the 4-spinors is defined by writing the solution of
(6.62) in the form
⎧
1/2 ⎨ ur (p)e−ipx (positive energy)
m
ψ(x) = N × (6.77)
V Ep ⎩
vr (p)eipx (negative energy)
and requiring that
d3 x ψ † (x)ψ(x) = 1 , (6.78)
which gives
Ep
ur † (p)ur (p) = vr † (p)vr (p) = . (6.79)
m
In this way the result N = [(Ep + m)/2m]1/2 is obtained.
With this choice of normalisation, the orthonormality relations for the
4-spinors are:
Ep
ur † (p)us (p) = vr † (p)vs (p) = δrs
m
†
ur † (p)vs (−p) = u(+)
r (p)u(−)
s (p) = 0 (6.80)
and, with simple formal manipulation2 we obtain from (6.76) the relations
where (ur )α (p) is the α component of the 4-spinor. Equation (6.83) is easily
obtained from the 4-spinor definition, which implies
+ p/ + m
(ur )α (p)(ūr )β (p) = Λp αβ = (6.84)
r
2m αβ
p/ − m
− (vr )α (p)(v̄r )β (p) = Λ−
p αβ = − . (6.85)
r
2m αβ
The operators Λ+ −
p and Λp are projectors of, respectively, the positive and
negative energy states, i.e.
Λ+ −
p u r = ur , Λ p v r = v r , (6.86)
Λ+ −
p v r = Λp u r = 0 (6.87)
2 The first line of (6.81) is obtained by multiplying (6.76) on the left by ū(p)γ μ and
summing with the adjoint of the same equation multiplied from the right by γ μ u(p); the
second is the outcome of a similar rearrangement of the equation for v(p) in (6.76).
94 Relativistic Quantum Mechanics
For later use, we give here the expressions for normalisation of the wave
function and the expectation values of the energy and momentum in terms of
the amplitudes of the normal modes of oscillation which appear in (6.88).
To carry out the calculations, we treat the components of the spinors u
and v as commuting numbers, but avoid exchanging the order in which the
coefficients a and b appear. With the help of the orthogonality conditions
(6.80) we find:
N = d3 x ψ † (x, t)ψ(x, t) =
= [ar (p)(ar (p))∗ + (br (p))∗ br (p)]; (6.89)
p,r
<E>= d3 x ψ † (x, t)[−iα · ∇ + βm]ψ(x, t) =
= E(p)[ar (p)(ar (p))∗ − (br (p))∗ br (p)]; (6.90)
p,r
<P >= d3 x ψ † (x, t)(−i∇)ψ(x, t) =
= p[ar (p)(ar (p))∗ − (br (p))∗ br (p)]. (6.91)
p,r
From (6.90) we see that the normalisation factor m/EV has been chosen
to ensure that the energy of the oscillation mode p, r is equal to E(p) for unit
oscillation amplitude.
for the positive and negative energy solutions. Therefore, for the first we find
(compare with the definitions (6.65)–(6.72)):
pσ1
E+m 1 χr
u(+)
r (p) = u(p) = pσ1
E+m =
2m E+m 1 0
χr
= pσ1 (r = 1, 2) , (6.96)
E+m χr
Note. As equation (6.96) shows, for small velocities the lower components of
u(p) are of the order of β = v/c = p/m with respect to the upper components.
In this limit, upper and lower components are referred to as the big and small
components of u(p), respectively. The roles are exchanged for v(p), as shown
by equation (6.97).
i∂ μ → i∂ μ + eAμ , (6.98)
or
∂
β i + eφ − βα · (p + eA) − m ψ = 0 . (6.100)
∂t
96 Relativistic Quantum Mechanics
and thus
σ · (p + eA)
η≈ ϕ. (6.110)
2m
The Dirac Equation 97
a photon obeys the selection rule ΔLz = ±1, 0. For atoms in which Lz is a
good quantum number, in the presence of a magnetic field the spectral line
splits into three components separated by ±eB/2m, 0, which coincides exactly
with the Larmor frequency (5.73). However, in complex atoms, Lz and Sz are
not individually diagonal and the difference between the levels involved in
the transition is also a function of the gyromagnetic spin ratio [9]. Equation
(6.113) correctly describes the behaviour of the electron; one finds g = 2 as
originally hypothesised by Goudsmit and Uhlenbeck to explain the anomalous
Zeeman effect.
The result (6.114) constitutes an extraordinary success for Dirac’s the-
ory, in which the spin-magnetic field term, which in non-relativistic quantum
mechanics must be added ad hoc to the Schrödinger equation, emerges in
a natural way from the quantum minimal substitution applied to the Dirac
equation of the free particle.
In general:
1
[αi , αj ] = iijk Σk (6.120)
2
from which, using the definition of pr , equation (6.117), and the orbital angular
momentum, L = x × p, we can rewrite equation (6.118) as:
i i
αr (α · p) = pr + (L · Σ + 1) = pr + β k (6.121)
r r
where we have defined the hermitian operator k:
k = β(L · Σ + 1) . (6.122)
The overall Hamiltonian for the electron in the Coulomb field of the proton
is written:
α
H = αr pr + iαr β(r)−1 k + βm − (6.126)
r
with α 1/137 the fine structure constant.
1
Spinors with Definite Angular Momentum. For a spin particle, a given
2
value of the total angular momentum can be obtained starting from two values
of the orbital angular momentum:
Even if Lz is not conserved, see equation (6.28), the value of the total
orbital angular momentum is a constant of the motion, since it is related to
the behaviour of the wave function under parity. For inversion of the axes,
the state with total orbital angular momentum l has a factor (−1)l , which
assumes opposite values for the orbital momenta in (6.127).
We take the value of l which corresponds to the plus sign in (6.127). The
two-dimensional spinor with angular momentum j, jz , is easily constructed
combining the spherical harmonics Ylm and Ylm+1 with the up and down
spinors:
l−m
a=
2l + 1
l+m+1
b=− . (6.129)
2l + 1
Similarly, we define the spinor corresponding to the minus sign in (6.127):
which gives:
l+m+1
c=
2l + 1
l−m
d= . (6.131)
2l + 1
3 Known as Clebsch–Gordan coefficients; see [13].
The Dirac Equation 101
where r−1 F and r−1 G are radial wave functions. As well as j and jz , we can
characterise the spinor u with the value l of the orbital momentum of the
upper component, which in the hydrogen atom is dominant, knowing that the
lower component will be related to the orbital momentum l given by (6.127),
and write simply:
1
uj,l,jz = r F (r)χ(l) . (6.133)
1
r G(r)χ(l )
Radial Equations. The two spinors, χ(±) are connected by the operator
σr = r−1 σ · x. The relation follows from the fact that the operator σr is
rotationally invariant, so cannot change the total angular momentum, and
has negative parity, therefore must exchange l and l :
while:
1 (+)
1 (+)
β r F (r)χ = r F (r)χ (6.138)
1 (−) 1
r G(r)χ − r G(r)χ(−)
Therefore, the spinors (6.132) are not eigenstates of the spin-orbit operator,
Σ · L, but they are eigenstates of k, owing to the multiplication by β, which
makes the eigenvalue of χ(l ) equal to that of χ(l). We also note the relations
used in the literature:
k(j) = −l, for l = j + 1/2; k(j) = l + 1, for l = j − 1/2. (6.145)
The set on the right-hand side has determinant zero, therefore we can
eliminate aν−1 and bν−1 and obtain the relation:
General ν. For general values of the recurrence relations (B.27) give rise
to the series (6.150) starting from a0 and b0 . For ν → ∞, we obtain:
making use of (6.154). Summing the series, we obtain (the same holds for G):
which is not acceptable; the series must terminate at a certain value ν = ν̄.
Setting ν = ν̄ + 1 in (B.27) and making the first term vanish, we find:
−
bν̄ = aν̄ (6.157)
1
and, substituting in (6.154):
m2
E2 = α2
(6.159)
1+ (s+ν̄)2
and, finally:
m
E= . (6.160)
√ α2
1+
(ν̄+ (j+1/2)2 −α2 )2
b0 −α
= (6.161)
a0 s + k(j)
The Dirac Equation 105
1
Table 6.1 Levels of the hydrogen atom in the Dirac equation for n = ν̄ + j + ≤ 4.
2
The energies increase from the bottom towards the top and from left to right. Levels
with the same values of j and ν̄ are degenerate. In the last column, the number of
states in the row, equal to 2n2 , is given (see Eq. (B.35)).
ν̄ 3 2 1 0 number
of states
1
n = ν̄ + j + 2
4 S1/2 , P1/2 P3/2 , D3/2 D5/2 , F5/2 F7/2 2 · 16
3 −− S1/2 , P1/2 P3/2 , D3/2 D5/2 2·9
2 −− −− S1/2 , P1/2 P3/2 2·4
1 −− −− −− S1/2 2·1
a0 + 1 b0 = 0. (6.162)
The latter equation clearly requires that a0 and b0 should have opposite
signs, which can be obtained from the preceding equation only if:
T r(γ μ ) = 0; (6.165)
μ ν μν
T r(γ γ ) = 4g . (6.166)
γ5 = iγ 0 γ 1 γ 2 γ 3 . (6.167)
Odd Number of γ Matrices. The rule (6.165) implies that the γ matri-
ces have an even number of dimensions and an equal number of +1 and −1
eigenvalues. Its generalisation is that
T r (γ μ γ ν γ ρ γ σ ) .
We can advance the first matrix by one place using the anticommutation
rule:
T r (γ μ γ ν γ ρ γ σ ) = −T r (γ ν γ μ γ ρ γ σ ) + 2g μν T r (γ ρ γ σ ) .
6 We recall that the completely antisymmetric Levi–Civita tensor is defined by
0123 =
+1 = −0123 , equation (3.40).
The Dirac Equation 107
ψ → S(Λ)ψ ,
with μν
i
S[Λ(ω)] = e− 4 ωμν σ
and
i μ ν
σ μν = [γ , γ ] .
2
The non-vanishing components of the antisymmetric tensor ωμν corre-
spond to a boost along the direction of the i-axis (ω0i , with i = 1, 2, 3)
and to rotations around the k-axis, perpendicular to the (i, j)-plane
(ωij ).
Sect. 6.3
QUANTISATION OF
THE DIRAC FIELD
Figure 7.1 A cloud chamber photograph by Anderson (C. D. Anderson, Phys. Rev.
44 (1933), 406 [8]) with one of the first images of a positron. The positron travels
from bottom to top, as shown by the fact that the track has a lower curvature in
the upper part of the trajectory, owing to the loss of energy as the positron passes
through the layer of lead, whose side view is visible in the middle of the chamber.
From this information, it can be deduced that the particle has a positive charge,
while the mass is consistent with the mass of an electron.
the order of the photon momentum, k = 1/λ, such that the uncertainty in the
position and the momentum satisfy the uncertainty relation:
1
Δx ∼ λ = . (7.3)
Δp
When the photon energy exceeds the value ω = k ∼ 2m a new phenomenon
occurs: the creation of an electron-positron pair. In the language of the Dirac
sea, an electron in a negative energy state absorbs the photon and passes to a
positive energy state leaving behind a hole. Now we have two electrons in the
state and it is no longer possible to speak of the electron position.
In order for the locations of the two electrons to be confused, they must be
separated by a distance of the order of the photon wavelength. Therefore the
effect of non-locality begins when we arrive at a precision in the coordinate
measurement of the electron of the order of:
1
Δx ∼ λ = 4.0 · 10−11 cm. (7.4)
m
The length defined by (7.4) is known as the Compton wavelength, λC . For
the proton, λC ∼ 0.2 fermi =0.2 · 10−13 cm.
The same result is found if we try to construct ever smaller wave packets
from the solutions to the Dirac equation. Using the positive energy solutions,
which obviously do not form a complete set, we cannot obtain dimensions for
the packet below λC .
The conclusion is that the representation of x, and therefore the wave
function, simply does not exist for relativistic particles. Even for the electron,
however, there is still a representation of the momentum which is sufficient for
practical purposes. The way in which these purposes are satisfied is provided
by second quantisation.
According to the second quantisation, more modernly known by the name
of relativistic quantum field theory, the object which satisfies the Dirac equa-
tion is a quantised field, mathematically described by a linear operator which
is a function of the location in space-time, ψ(x, t).
The field is a dynamic quantum variable in the Heisenberg representation.
As an operator, the field acts on the space of physical states which, also in
the Heisenberg representation, are constant in time.
The difficulty connected with solutions with negative energy (now, more
properly, negative frequency) is solved because the time dependence of an
operator in the Heisenberg representation is an exponential whose sign is un-
defined. The sign can be positive or negative, according to the energy difference
of the states between which the operator induces transitions.
In the second quantisation, the quantities referred to in (6.89), (6.90) and
(6.91) are also linear operators, rather than average values represented by
complex numbers. As we will see in the following section, the first of these
quantities must be identified with the charge associated with the particles,
Quantisation of the Dirac Field 113
which must be opposite for the particles destroyed by ψ (+) to those created
by ψ (−) , as happens for the electron and positron.
It is reasonable to identify (6.90) and (6.91) with the Hamiltonian and
the total momentum of the field. The operator structure of the creation and
destruction operators and the further physical characteristics of the associated
particles are obtained from the requirement that the Hamiltonian has a lower
bounded spectrum (stability). The stability condition leads unambiguously to
Fermi–Dirac statistics for the particles created or destroyed by the components
of the Dirac field.
The field in x can be obtained from the field at the origin by a space-time
translation according to the equation (see Appendix A):
μ μ
ψ(x) = eiP μ x ψ(0)e−iP μ x (7.8)
or:
We consider the second term in the Hamiltonian. The operator bb† is semi-
positive definite1 . This prevents the assignment of commutation relations to
the creation and destruction operators equal to those of the quantum harmonic
oscillator. In this case, for fixed p, we have:
where N (p) is the occupation number of the mode p. The right-hand side
can take arbitrarily negative values and the resulting Hamiltonian will be
unbounded from below.
To obtain a consistent theory, the operator b† b must be limited, as happens
for the Fermi oscillator2 which satisfies anticommutation rules. In this case we
obtain:
−bb† = +b† b − 1 = +N (p) − 1 (7.16)
which has eigenvalues 0, −1 (−1 for the vacuum). Using equation (7.16) for
all the values of p and r, we obtain:
H= Ep Nr (p) + N̄r (p) , (7.17)
pr
where Nr (p) = a†r (p)ar (p) and N̄r (p) = b†r (p)br (p) are the number operators
for the particles and antiparticles. In going from (7.13) to (7.17) we have
omitted a constant term which corresponds to the zero point energy of the
oscillators. Physically, this is equivalent to choosing the value 0|H|0 as the
zero of the energy scale.
1 Actually,
< A|bb† |A >= n < A|b|n >< n|b† |A >= n | < A|b|n > |2 ≥ 0, whatever
the state |A >.
2 The Fermi oscillator is defined by the anticommutation rule {b, b} = {b† , b† } =
We can therefore see that the particles created by b† , while they have
mechanical properties, mass and spin, equal to those created by a† , have
opposite charge.
with:
i α β
σ αβ =
[γ , γ ] (7.25)
2
Moreover, the Lagrangian (7.21) possesses a global symmetry, associated
with the transformation of the fields by a constant phase
ψ(x) → eiα ψ(x); ψ̄ → e−iα ψ̄(x) (7.26)
From (7.22), (7.24) and (7.26) we immediately find:
• the canonical energy-momentum tensor:
T μ,ν = iψ̄γ μ ∂ ν ψ − g μν L , (7.27)
Using the anticommutation rule for the γ matrices, we can make the op-
erators ∂μ γ μ operate on the fields, where they give ±im. It is easy to see that
the terms proportional to m cancel and we are left with the results of the
anticommutations, from which:
1 i i
(S1 −S2 −S3 )αβ = ψ̄(∂ α γ β −3∂ β γ α )ψ− [(∂ α ψ̄)γ β ψ+(∂ β ψ̄)γ α ψ] . (7.35)
2 4 4
Summing this result with (7.27) leads to (7.29).
or
0
{ψα (x, t), ψ̄β (y, t)} = γαβ δ (3) (x − y) . (7.37)
to find:
∂ψ
i = [ψ(x), H] = (−i α · ∇ + βm)ψ . (7.39)
∂t
The Heisenberg equations of motion reproduce the Dirac equation, as they
should. We can also calculate the commutator of the field with the conserved
charge, with the result:
[ψ(x), Q] = +ψ(x) . (7.40)
The Space of States. Proceeding in a similar way to what was done for
the Klein–Gordon equation, we can derive from (6.88) the as (p) and bs (p)
118 Relativistic Quantum Mechanics
{as (p), a†s (p )} = {bs (p), b†s (p )} = δs,s δp,p (7.42)
We see that the Hilbert space of the theory is that of a set of Fermi
oscillators. Explicitly, it consists of:
• the state |0 >, the vacuum, which vanishes following the application of
the destruction operators:
• the states with given occupation numbers, which are obtained by apply-
ing the creation operators a† and b† to the vacuum:
|n1 , n2 , . . . ; m1 , m2 , · · · >=
= [a†s1 (p1 )]n1 [a†s2 (p2 )]n2 . . . [b†r1 (q1 )]m1 [b†r2 (q2 )]m2 . . . |0 > .
(7.45)
As can be seen from the expression for the momentum, the a and b oper-
ators destroy relativistic particles of mass m and spin 12 . The quantum states
are those of a perfect gas made up of fermions of two different types, with
equal mechanical properties.
Quantisation of the Dirac Field 119
< q |[ψ, Q]|q >= (q − q ) < q |ψ|q >= + < q |ψ|q >
or : q = q − 1. (7.46)
In both cases, the action of the field reduces the conserved charge by one
unit, therefore:
where ψP,N denote the Dirac fields associated with the proton and neutron.
The value +1 is conventionally assigned to the baryon number of the proton
and neutron (and thus −1 to the antiproton and antineutron), while the lighter
particles, electrons and neutrinos, have zero baryon numbers. Many other
particles are now known which decay into protons or neutrons and therefore
also have non-zero baryon number.
3 The same result is obtained for the Klein–Gordon field starting from (4.46).
4 Electric charge conservation alone would allow the decay of the proton into e positron
and a neutral meson: P → e+ π 0 . The present non-observation of this decay leads to the
limit pf the proton’s lifetime: τP > 1.6 1034 years, see [16].
120 Relativistic Quantum Mechanics
p = Λ μ ν p ν .
μ
(7.52)
U (Λ1 )U (Λ2 ) = {U (L(Λ1 q))U (R1 )U (L−1 (q))}{U (L(Λ2 p))U (R2 )U (L−1 (p))
therefore,
if : Λ1 → R1 ; Λ2 → R2 ;
then : Λ1 Λ2 → R1 R2
where n3 is the unit vector along the z-axis. The little group is the one-
dimensional group of rotations around the z-axis.
We leave to the reader the task of constructing the corresponding repre-
sentations, to show the result already given in Section 2.2.
7.5 MICROCAUSALITY
As we saw in Section 1.3, given an event x, space-time divides into distinct
regions regarding the causal connection of different events y with event x. The
region located outside the two light cones originating from x represents the
absolute present of x. These events are characterised by the fact that the
interval y − x is spacelike, or (y − x)2 < 0. For brevity, we write y ∼ x.
Measurements carried out on two observables localised at x and y respec-
tively cannot influence one another when x ∼ y, because this implies propa-
gation of signals faster than the limit set by the speed of light in a vacuum.
From quantum mechanical principles it follows that, under these conditions,
the corresponding operators commute among themselves:
Given that Σ3 has eigenvalues ±1, the equation simply expresses the fact
that the field is associated with a spin 12 particle. However, for θ = 2π, when
x = x, we find:
ψ = −ψ. (7.63)
This result is obviously absurd for an observable quantity, which should
return to itself after a 2π rotation.
However, if ψ cannot be an observable, the physical quantities constructed
from ψ, such as the energy density or charge density, must be. On the basis of
equation (7.63), we can conclude that homogeneous functions of even powers
of the field are good candidates to be observables.
The same conclusion is reached by starting from the canonical anticom-
mutation rules. We consider as an example the commutators of two charge
densities: 0
j (x), j 0 (y) for x ∼ y. (7.64)
5 A complex scalar field is not observable, not being hermitian, but it’s real and imaginary
Repeatedly applying the identify (7.38), and similarly for the commutators,
we find:
a → a; a∗ → a†
) *
a, a† = 1 (7.67)
we find the absurd result that the Hamiltonian is a multiple of the identity
operator so that every observable has to be a constant of the motion. The
same holds for the electromagnetic field and the photons, cf. equation (5.112).
For particles of spin j ≤ 1 we can therefore state the spin-statistics theo-
rem:
• identical particles with integer spin obey Bose–Einstein statistics, while
those with half-integer spin conform to Fermi–Dirac statistics.
We can generalise the theorem to assemblies of arbitrary spin composed
from spin 12 particles. In fact, the particles which we know with spin greater
than 1 are all of this type:
• atomic nuclei, with mass number A and atomic number Z, composed of
Z protons and A − Z neutrons,
• atoms, composed of Z electrons and a nucleus with mass number A; in
all Z + A spin 12 particles,
• subnuclear particles, classified as baryons, made of three quarks which
are fundamental spin 12 particles, and mesons, composed of a quark-
antiquark pair.
Quantisation of the Dirac Field 125
The effective field which creates and destroys these assemblies (for exam-
ple the iron nucleus) is obtained starting from ψ N , if ψ is the field of the
constituent.
After rotations by 360◦ , each field ψ gains a factor −1, or:
while under the exchange of constituents of two identical assemblies, the state
has a phase factor:
Therefore the assemblies take a sign appropriate for either B–E or F–D
statistics, according to whether N is even or odd, or whether j is integer or
half-integer, in agreement with the theorem.
In a relativistic theory, a state with a definite j value can in general be
a superposition of one state with N constituents plus an indefinite number
of fermion–antifermion pairs. This corresponds to adding to Ψ components of
the type ψ N (ψψ † )m , with m variable. However, because ψψ † corresponds to
integer angular momentum and the pairs are equivalent to 2m fermions, the
result does not change. The same holds if we introduce the expression for Ψ
the derivatives of ψ or ψ † . This corresponds to introducing orbital angular
momenta which, having integer eigenvalues, do not change the integer or half-
integer property of j. Finally, the same result arises if, as in nuclei or atoms, the
constituents are groups of different kinds of fermions; each fermion contributes
a factor −1 to (7.68) and the same factor to (7.69).
prove that:
{as (p), a†s (p )} = {bs (p), b†s (p )} = δs,s δp,p
LD = ψ̄(iγμ ∂ μ − m)ψ .
Sect. 7.5
1. Consider the bilinear densities
JA = ψ̄(x)ΓA ψ(x)
where ΓA , A = 1, · · · 16 is one of the Dirac matrices introduced in
Sect. 6.1.2.
Prove the formula for the equal time commutator:
[JA (x, 0), JB (y, 0)] = δ (3) (x − y)ψ̄(y, 0) γ 0 ΓA , γ 0 ΓB ψ(y, 0) .
This shows, in particular, that the time component of the conserved
current Jμ has vanishing equal time commutators with all the current’s
components, a fact at the basis of the so-called Ward identities, see the
problems in Chapter 14.
CHAPTER 8
FREE FIELD
PROPAGATORS
gives the quantum amplitude for the simplest observable process involving a
quantised field:
(i) creation of a particle by a source located at y and corresponding ab-
sorption at x, if x0 > y 0 , or
(ii) creation of an antiparticle by a source located at x and corresponding
absorption at y, if y 0 > x0 .
The function (8.2) is known as the propagator of the corresponding field
and its exact form depends on the mass and the spin of the quanta associated
with the field.
In view of the anticommutation relations satisfied by the Dirac field, the
time-ordered product of fermion fields must be antisymmetric:
+
ψ(x)ψ̄(y) x0 − y 0 > 0
T {ψ(x)ψ̄(y)} = , (8.3)
ψ̄(y)ψ(x) x0 − y 0 < 0
When the particle associated with the field has a charge, as happens if
the field is complex, the charge flows in both cases from y towards x. The
propagator can be represented by a line directed from y to x.
iΔ(+) (x) =< 0|[φ(+) (x), (φ† )(−) (0)]|0 >; (8.6)
iΔ(−) (x) =< 0|[φ(−) (x), (φ† )(+) (0)]|0 > . (8.7)
We can quickly convince ourselves that these two functions are exactly the
solutions of the homogeneous Klein–Gordon equation which we introduced,
with the same name, in Section 4.2, equations (4.26) and (4.27). For exam-
ple, starting from the expansion (4.47) and taking account of the canonical
commutators, we find:
1
iΔ(+) (x) =< 0|[φ(+) (x), (φ† )(−) (0)]|0 >= e−i(kx) =
2ω(k)V
k
1 d3 k −i(kx)
= e
(2π)3 2ω(k)
which agrees with (4.26).
For the propagator, we therefore find:
which agrees with the solution of the inhomogeneous equation with the Feyn-
man boundary conditions referred to in equation (4.33) with the same name:
130 Relativistic Quantum Mechanics
Figure 8.1 Following the i prescription of Feynman, the poles of the Green’s func-
tion move in the complex plane as shown in the figure. The path of integration to
obtain iDF , equation (8.10), is now the real axis. The figure shows the way we can
close the path to obtaining iDF = iΔ(+) for x0 > 0. For x0 < 0 the path should be
closed in the upper half-plane.
• The Feynman boundary conditions are the correct ones to create the
quantum propagator.
In summary, if the scalar field satisfies the equation:
(−2 − μ2 )φ = 0 (8.9)
the Feynman propagator is given, in terms of its Fourier transform, by
† d4 k i
iDF (x) =< 0| T {φ(x)φ (0)} |0 >= e−i(kx) . (8.10)
(2π)4 k 2 − μ2 + i
As described in Section 4.2 the integration in the complex plane of k 0 is
carried out along the real axis and the prescription for i in (8.10) moves the
poles of the integrand as shown in Fig. 8.1.
Unlike the commutators of the fields, which satisfy the microcausality con-
dition, the propagator does not vanish for spatial separations. To be exact:
iDF (x, 0) = limt→0+ iΔ(+) (x, t) = limt→0− iΔ(−) (x, t) =
1 1
= d3 k eik·x = 0. (8.11)
(2π)3 2 k + μ2
2
For large values of r = |x| the propagator decreases exponentially (for a de-
tailed study of the functions Δ(±) (x) and DF (x), see Bogoliubov–Shirkov [2]):
1 2 μ1/2 −μr
iDF (r, 0) e (8.12)
8π 2 π r3/2
Free Field Propagators 131
∂t T {φ(x)φ† (0)} = T {∂t φ(x)φ† (0)} + δ(t)[φ(x, 0), φ† (0)] = T {∂t φ(x)φ† (0)}
∂t2 T {φ(x)φ† (0)} = T {∂t2 φ(x)φ† (0)} + δ(t)[∂t φ(x, 0), φ† (0)] =
= T {∂t2 φ(x)φ† (0)} − iδ 4 (x) (8.17)
obtained with two different sequences of limits; (8.13) is obtained in the limit t → 0 followed
by μ → ∞, while in (8.14) μ → ∞ for a general x; the two limits do not commute.
132 Relativistic Quantum Mechanics
We separate into positive and negative frequency parts in the fields and
omit terms with operators which vanish when applied to the vacuum. We
obtain:
Using the 4-spinor completeness relations, (6.84) and (6.85), we can rewrite
equations (8.20) and (8.21) in the form
(+) 1 m p/ + m −ip(x )
iS (x) = e =
V ps Ep 2m
1 d3 p −i(px)
= (i∂/ + m) e =
(2π)3 2E(k)
= (i∂/ + m)[iΔ(+) (x)]. (8.22)
Similarly:
1 m p/ − m ip(x)
iS (−) (x) = e
V ps Ep 2m
1 d3 p +i(px)
= −(i∂/ + m) 3
e =
(2π) 2E(k)
= (i∂/ + m)[iΔ(−) (x)] (8.23)
Taking account of the minus sign in the time-ordered product, the Feyn-
man propagator (8.19) is then (with the integration path in the plane p0 of
Fig. 8.1):
iSF (x) = θ(x0 )iS (+) (x) − θ(−x0 )iS (−) (x) =
1 i
= (i∂/ + m)[ d4 p 2 e−i(px) ] =
(2π)4 p − m2 + i
1 i(p/ + m) −i(px)
= d4 p 2 e . (8.24)
(2π)4 p − m2 + i
Sometimes, using the relation:
(see Chapter 5), would seem to indicate that the function DFμν (x) represents
the propagation of virtual photons in four different polarisation states, de-
scribed by the vectors λ . In addition to the two transverse polarisation states
which describe real photons, there is one longitudinal polarisation state, de-
scribed by a vector parallel to q, and one state of polarisation in time, de-
scribed by the timelike vector η μ ; cf. equation (5.24).
However, by carrying out the decomposition (5.27), it can easily be shown
that only photons in transverse states of polarisation contribute to the pole
at q 2 → 0, so only those states are observable at large distances from the
interaction vertex.
The role of the other terms in (8.29) becomes clear if we note that in the
calculation of the probability amplitudes of physical processes, the electro-
magnetic field propagator always appears saturated by the electromagnetic
current jμ , in the form jμ (−q)DFμν (q)jν (q). From the current conservation
condition, q μ jμ = 0, it follows that the terms proportional to q μ or q ν in
DFμν (x − y) gives zero contribution to the physical amplitudes.
134 Relativistic Quantum Mechanics
g μν 2 → −g μν q 2 = K μν (q) (8.31)
specified by a parameter ξ that can take arbitrary values. DFμν (q) is obtained
by inverting the operator
μν 2 1
Kμν = −g q − − 1 qμ qν . (8.34)
ξ
1 qμ qν
DFμν (q) = 2
−g μν + (1 − ξ) 2 . (8.36)
q q
Free Field Propagators 135
Sect. 8.4
1. Starting from the Lagrangian:
1 1
L = − Fμν F μν − (∂μ Aμ )2 , F μν = ∂ ν Aμ − ∂ μ Aν
4 2ξ
INTERACTIONS
The theory of the free field describes an unchangeable world in which the
energy and momentum of every particle of the system are conserved separately.
The variety of phenomena which we observe requires instead some form
of interaction between the fields. In this case, as we saw in the classical limit,
Section 5.3, particles can exchange energy and momentum giving rise to scat-
tering processes or to the emission and absorption of light; the Sun can shine,
the sky can be blue and our eyes can perceive the external world via photons
absorbed by the retina.
In a relativistic theory, not only photons, but also particles associated with
matter, like electrons, protons and others, can be created or annihilated. For
a system isolated from the rest of the world, the elementary processes must
respect the conservation laws imposed by the symmetry of the system: en-
ergy, momentum, angular momentum as well as possible internal conserved
charges; for example, electric charge. This requires that the interaction La-
grangian, which we add to the free field Lagrangian should be invariant un-
der the transformations of special relativity: the Poincaré group, comprised of
translations in space-time and proper Lorentz transformations, L↑+ .
Invariance under the Poincaré group still permits a wide variety of forms
for the interactions, for example between the field of the electron and the
electromagnetic field. In principle, we should identify the correct interaction by
an iterative comparison between prediction and experiment (trial and error).
One starts from a type of interaction which explains at least part of the
initial data and then the theory is extended to other processes, successively
comparing predictions with the experimental data. When new data are found
to be in contradiction with the theory, it is falsified, in Popper’s terminology,
and nothing remains but to modify the interaction to bring it into agreement
with the complete set of old and new data1 .
1 Following Popper, we can say that theories are never confirmed, in that every set of data
inevitably has ranges of errors which make it compatible with very many, even an infinite
number of, theories for the interaction. However, a theory can be eliminated when the data
falsify it in favour of a theory in agreement with the overall data set. For example, classical
In this process of trial and error, heuristic a priori principles, such as the
requirement to lead to the classical theory in the limit of large systems or the
presence of certain further symmetries, provide a powerful aid to restrict the
form of the interaction, and therefore the choice of discriminating experiments,
naturally keeping in mind the necessity to give greater weight to the scrutiny
by experimental facts compared to heuristic criteria.
A result of great significance from the physics of the last century has been
the assignment of observed processes in the subatomic and subnuclear world
to the action of three different categories of interaction: electromagnetic, weak
and strong, as already pointed out in Section 5.3. The form of these interac-
tions, in terms of fundamental fields, is greatly restricted by symmetry princi-
ples which are the extension of the gauge symmetry encountered in the theory
of the electromagnetic field, Section 5.1, and of the phase transformations of
Sections 4.3 and 7.3.
In what follows, we will derive the form of the electromagnetic interac-
tion for fundamental spinor particles, such as the electron and the muon, the
Fermi interaction is suitable for describing β decay of the neutron and discuss
qualitatively several aspects of nuclear interactions, above all to point out
the difficulty of a description in terms of fields associated with the observed
nuclear particles, nucleons and pions.
We delay to later volumes [13, 14] a deeper treatment of the interactions
in terms of fundamental constituents (quark, leptons and gauge fields).
i∂ μ → i∂ μ − qAμ . (9.2)
D μ ψ = (∂ μ − ieAμ )ψ (9.5)
QED for the Charged Leptons. Equation (9.10) provides a strong argu-
ment for limiting the electromagnetic interaction of the electron purely to the
minimal substitution, which already well describes the magnetic moment of
the electron. The result is confirmed by the calculation of higher order cor-
rections to the quantity g − 2, first carried out by Schwinger, which shows
140 Relativistic Quantum Mechanics
that these corrections reproduce with high precision the small experimentally
observed deviations from the Dirac prediction of g = 2, cf. [14].
The same conclusion holds for the muon, μ, which has spin 12 and a mass
about 200 times that of the electron. The muon appears in all respects a heavy
version of the electron. The magnetic moment of the muon is known exper-
imentally with great precision and the deviations from the Dirac prediction
g = 2 are well described by higher-order corrections.
In 1976, another charged spin 12 lepton similar to the electron and muon,
the τ particle, was discovered.
The electron, μ, τ and the corresponding neutrinos are classified as lep-
tons, a family of particles not sensitive to nuclear forces. The most convincing
hypothesis in agreement with our experimental knowledge is to describe the
electromagnetic interactions of the three charged leptons with only the mini-
mal substitution, and therefore with the Lagrangian:
1
LQED = − Fμν F μν + ψ̄e (iD/ − me )ψe + ψ̄μ (iD/ − mμ )ψμ + ψ̄τ (iD/ − mτ )ψτ
4
λ
= Le.m. + L0e + L0μ + L0τ − eAλ Jlept ;
λ
Jlept = −{ψ̄e γ λ ψe + ψ̄μ γ λ ψμ + ψ̄τ γ λ ψτ } (9.11)
Table 9.1 Electromagnetic properties of the charged leptons, from the Particle Data
Group [?]. Numbers in parentheses denote the error on the last digit of each quantity.
m (MeV) g
e 0.510998902(21) 2.002319304374(4)
μ 105.658357(5) 2.0023318320(6)
τ 1776.99(28) 2.000(58)
Interactions 141
μ μ
with Jlept and Jnucl given by (9.11) and (9.16).
Today we know many particles in addition to the proton and neutron which
are sensitive to strong interactions, collectively called hadrons, cf. Section 9.3.
If we want to describe each of them with a quantised field, we must add their
free Lagrangian to L0,tot , and include their contribution to the total electro-
magnetic current. The Lagrangian (9.17) keeps its form with the extension of
μ
Jnucl to the overall hadron current.
A∗ → A + γ. (9.20)
In the case of the reaction (9.19), Fermi assumed that it was induced by
the basic process of neutron disintegration:
n → p + e + ν̄ (9.22)
We have assumed that all the particles can be represented by Dirac fields,
and GF is the Fermi constant. Conventionally the particle emitted in neutron
β − decay is an antineutrino.
Interactions 143
In modern notation, the Lagrangian for the decay of the neutron is instead
written as:
GF gA
LF = − √ ψ̄p γ μ (1 + γ5 )ψn ψ̄e γμ (1 − γ5 )ψν (9.24)
2 gV
The Sign of GF . In his work on β decays [18], Fermi wrote the Hamiltonian
density with a sign consistent with the relation Hint = −Lint = −LF . The
applications we will discuss in Chapter 15 depend only on | f |LF |i|2 . The sign
of GF is determined from the interference of the weak and electromagnetic
contributions in the forward–backward asymmetry in, for example, e+ e− →
μ+ μ− , cf. [13], and in the M SW effect on the propagation of neutrinos through
the body of the Sun, Chapter 16. The sign in (9.23) and (9.24) with GF > 0
is consistent with what is obtained if we derive the Fermi interaction from the
second order of the theory of the intermediate vector boson, cf. Chapter 15.
field, the quarks, confined inside hadrons according to the scheme in which
mesons are q q̄ states and baryons are qqq states.
The hadrons are accompanied by the family of leptons, particles which
do not feel the strong interaction, divided into charged leptons, whose elec-
trodynamic interactions were described in Section 9.1, and neutrinos, subject
only to the weak interaction. As already mentioned, three charged leptons are
known today: e, μ and τ and their associated neutrinos.
The fundamental forces acting on quarks and leptons are attributed to
the exchange of particles of spin 1, all similar to the photon, and of a spin
0 particle, the Higgs boson, coupled to the mass of the different particles. In
more detail (see [13] for an extended discussion):
• QED, Section 9.1, describes electromagnetic forces caused by the ex-
change of photons,
• the weak interactions are associated with the exchange of massive, elec-
trically charged vector bosons (W ± ) responsible for the interactions
identified by Fermi, Section 9.2, and a neutral boson (Z 0 ) responsible
for neutral current process,
• the primary strong interactions between quarks are due to the action
of fields similar to the photon, gluons, from the word ‘glue’ because of
their property of binding the quarks together to form hadrons,
• the field associated with the Higgs boson, the particle observed by the
ATLAS and CMS Collaborations at CERN in 2012 with a mass of ap-
proximately 125 GeV, has a fundamental role in breaking the symmetry
which connects the photon to the fields of the weak interaction, generat-
ing the mechanism by which the W ± and Z 0 particles, as well as quarks
and leptons, gain their masses.
Particle Names. Apart from the term hadron, recently coined from the
Greek (adrós = strong), the terms lepton, meson and baryon were born in a
historical period when the only known particles were the electron and the neu-
trino (leptons, from leptós = light), the Yukawa meson with intermediate mass
(from mesos = in between) and the nucleons (baryons, from barús = heavy).
Today we know of leptons and mesons which are much heavier than nucleons
and the names really reflect the interactions and the quantum numbers of
the particles. The name quark was given to the fundamental constituents of
hadrons by Gell–Mann, from a passage in Finnegans Wake by James Joyce
(... Three quarks for Muster Mark!), probably alluding to the fact that three
quarks are needed to describe the nucleon.
146 Relativistic Quantum Mechanics
TIME EVOLUTION OF
QUANTUM SYSTEMS
dXH (t)
i = [XH (t), H]. (10.14)
dt
To visualise the Heisenberg representation, we return to the analogy with
the motion of a classical system.
We can describe the state of the system at time t by giving the instan-
taneous position of the system in phase space, (pt , qt ); this corresponds to
the Schrödinger representation. Instead, the Heisenberg representation cor-
responds to describe the state of motion by giving the initial conditions,
150 Relativistic Quantum Mechanics
(pt0 , qt0 ), at an arbitrary but fixed time, t0 . The initial conditions completely
determine the trajectory in phase space and naturally, like the Heisenberg
state, they do not change with time.
There is an unusual aspect of the Heisenberg representation which is im-
plicit in what has been said but is worth underlining.
The Heisenberg state is independent of time t. However, the vector which
represents it depends implicitly on the value of time t0 , which was chosen to
fix the initial conditions (in other words, the time at which the Heisenberg
representation coincides with the Schrödinger representation). The choice of
t0 is arbitrary, but we must fix t0 in the same way for all states of motion, if
we wish to compare them with different states.
The vectors which represent the state of motion for a given choice of t0
differ from those relative to an alternative choice by a unitary transformation:
|A; t0 >H = e+iH(t0 −t0 ) |A; t0 >H . (10.15)
Equation (10.15) leaves invariant the expectation values (10.1). However,
as t0 changes, the vectors which represent the same state of motion can assume
a considerably different appearance.
The expectation values of the observables must be the same in the two
representations:
and therefore:
OI (t) = e+iH0 t OS e−iH0 t . (10.20)
Differentiating with respect to t, we obtain the equations of motion of the
states and the observables:
∂
i |A(t) >I = V0I (t)|A(t) >I (10.21)
∂t
dOI (t)
i = [H0 , OI (t)] (10.22)
dt
where V0I (t) is the interaction Hamiltonian in the interaction representation:
The observables change in time with the free Hamiltonian, while the time
variation of the states is due only to the interaction. It should be noted that
V0I depends explicitly on time since V0 and H0 generally do not commute.
Equation (10.21) defines a translation operator in time between t0 and t:
where we have denoted with |t >I the state at time t which becomes |t0 >I
at time t0 .
UI (t, t0 ) is a linear operator. Also, it is subject to the relation:
Clearly, we can also solve the equation of motion with a final condition,
i.e. determining the state |t >I which reduces to a given state |t1 > at a time
t1 > t. The corresponding operator, ŪI (t, t1 ), is defined by:
and it is not difficult to see that it is the hermitian conjugate of UI (t1 , t):
Obviously, |t >I differs from |t0 >I in terms that are at least of order V0I .
We obtain a solution to first order in V0I by simply substituting |t >I with
|t0 >I :
t
|t >I = |t0 >I −i dt V0I (t )|t0 >I +O(V02 ). (10.30)
t0
and so on.
Continuing the process indefinitely, we find the series:
+∞
t
|t >I = |t0 >I + (−i)n dt1 V0I (t1 )×
n=1 t0
t1 tn−1
dt2 V0I (t2 ) . . . dtn V0I (tn )|t0 >I =
t0 t0
+∞
t t1 tn−1
= [1 + (−i)n dt1 V0I (t1 ) dt2 V0I (t2 ) . . . dtn V0I (tn )]|t0 >I
n=1 t0 t0 t0
which gives a formal solution to the equation of motion with the initial con-
dition |t >I = |t0 >I , as is easily confirmed by substituting the result (10.32)
into equation (10.21).
T (V0I (t1 ), V0I (t2 ) . . . , V0I (tn )) = V0I (ta1 )V0I (ta2 ) . . . V0I (tan ) (10.33)
T (V0I (I1 ), V0I (t2 )) = θ(t1 − t2 )V0I (t1 V0I (t2 + θ(t2 − t1 )V0I (t2 )V0I (t1 )
(10.35)
and therefore it coincides exactly with the integral which appears in (10.32),
apart from a trivial change in the name of the variables. The instruction of
the time ordering in (10.36) is such that the integrals extended to the n!
regions are all equal to (10.37), so that, finally, we can rewrite UI (t, t0 ) in the
symmetric form:
t
n 1
UI (t, t0 ) = 1 + (−i) dt1 dt2 . . . dtn T (V0I (t1 )V0I (t2 ) . . . V0I (tn )).
n=1
n! t0
(10.38)
or:
The second alternative applies to the case of time reversal. All transfor-
mations which leave the direction of time unchanged should be represented
by unitary operators, and this is the case we will consider in this section. In
particular, operators that represent those groups of transformations which are
continuously connected to the identity transformation (which can always be
represented by the operator 1) should be unitary, as for example are spatial
translations and rotations, discussed in Section A.4.1, or translations in time,
Section 10.1.
Given an observable X, we define the transformed observable to be one
which has the same expectation value on the state |RA >, as X has on |A >,
so:
then:
[U (R), H] = 0 (10.47)
T † = T. (10.50)
The observables of quantum field theories, for example, the energy den-
sity, are local quantities, operators which we can think of as determined by
measurements in a small region of space-time.
We denote one of these operators as Π(x, t). The vector x is a numerical
variable (and not the position operator) since it is connected to the position
of the macroscopic apparatus which measures Π(x, t).
156 Relativistic Quantum Mechanics
Using equations (10.45) and (A.39), we find for the spatial variables:
Ht − P · x = P μ xν gμν = Pμ xμ . (10.53)
‘
CHAPTER 11
RELATIVISTIC
PERTURBATION
THEORY
Intuitively, the condition arises from the fact that when the time tends
towards infinity, particles separate indefinitely and their mutual interactions
tend to zero.
However, condition (11.1) is actually not trivial. For example, it is not
satisfied in the presence of long range forces, such as the electrostatic interac-
tion between electric charges. This case requires particular care to define the
probability of observable processes (cf. for example [11]).
Also in the case of short-range forces, to satisfy condition (11.1) we must
carefully define the interaction Hamiltonian, so as to subtract the interaction
of each particle with the field generated by the particle itself. The procedure of
subtraction of the effects of self-interaction will be discussed in detail in the
third volume of this series, titled Introduction to Gauge Theories [14].
Assuming the validity of (11.1), the state vector tends to a constant vector
in the limit t → +∞ and the same thing happens, with a different vector, if
we let t → −∞:
limt→±∞ |t >= | ± ∞ >= constant. (11.2)
For finite times, the state vector is given by the application of the linear
operator U (t0 , t) to the initial state |t0 >. From equations (11.2) it follows
equally that | + ∞ > depends linearly on | − ∞ >. We can therefore define a
linear operator independent of time, the scattering matrix, or S-matrix, which
transforms one to the other:
Unitarity. The state |f is one of the possible results of the scattering pro-
cess. Summing over all the states |fn of a complete basis, the total probability
must be unity:
P (i → fn ) = i| S |fn fn | S † |i = i| SS † |i = 1.
n n
Being the same for every |i, this result implies that the S-matrix must be
unitary:
SS † = S † S = 1. (11.6)
Comment. The fields which appear in (11.9) are operators in the interac-
tion representation, therefore they satisfy the free equations of motion and are
in fact, the operators given by expansions of the type referred to in (4.47)
and (7.12), with the canonical quantisation rules. If we add the convention
of defining LI as normal products of the fields, these rules completely deter-
mine the products which appear in the Dyson formula as long as we maintain
x1 = x2 = . . . = xn . When one or more points coincide, the products of LI
exhibit an unusual behaviour (for this aspect, cf. in particular [2]). The over-
coming of the difficulty requires a redefinition of LI and the parameters which
appear in it. The possibility to renormalise the theory in interactions will be
studied in [14]. We can disclose in advance that the renormalisation procedure
can be completed only for a restricted class of interactions: renormalisable in-
teractions. QED generated by the minimal substitution is in this category.
Conversely, the electrodynamic interaction with the addition of Pauli terms,
for example, the Lagrangian (9.16), is not renormalisable and must be con-
sidered a phenomenological theory, valid only in a restricted range of energy.
of the coordinates, which would remain unaltered under the unitary transformations in
(11.12).
Relativistic Perturbation Theory 161
Therefore, in conclusion:
< Pf in , β| S |Pin , α >= (2π)4 δ (4) (Pf in − Pin ) · F (Pf in , β, Pin , α) (11.15)
e+iαQ Se−iαQ = S; or
[Q, S] = 0.
q = q (11.16)
is satisfied.
1, 2, . . . | S |a, b =
= d4 x d4 x . . . 1, 2, . . . | χ†1 (x1 )χ†2 (x2 ) . . . [. . . ] χa (xa )χb (xb ) |a, b =
(+)
= 1, 2, . . . | (χ†1 )(−) (x1 )(χ†2 )(−) (x2 ) . . . [. . . ] χ(+)
a (xa )χb (xb ) |a, b.
(11.27)
Mi→f is called the Feynman amplitude of the process and is, from its
derivation, relativistically invariant. The factor V 2 which arises from (14.200)
is cancelled by the factors 1/V in the normalisation of the initial states. The
factors 1/V in the normalisation of the final states are balanced by corre-
sponding factors in the phase space of each final particle:
2 1 d 3 p 1 d 3 p2
Πi=1,2,... Ni (. . .) = . . . Πi=1,2,... n2i (. . .) (11.30)
(2π)3nf 2E1 2E2
f
where we have set 2Ei Ni2 = n2i = 2mi or 1 for fermions or bosons, respectively.
It should be noted that the momentum integration volume associated with
Relativistic Perturbation Theory 165
each particle is invariant for L↑+ transformations4 . as seen from the equality:
d3 p
= d4 p δ(p2 − m2 )θ(p0 ) (11.31)
2E(p)
Taking into account all these observations, we arrive at the expression
(Eb = mb for a fixed target):
(na nb )2 d 3 pi
dσ = (2π)4 δ (4) (Pf − Pi ) Πi n2 |Mi→f |2 . (11.32)
v · (4Ea mb ) (2π)3 2Ei i
We can see that the flux factor is also Lorentz invariant, by rewriting:
v · (Ea mb ) = |pa |mb = (Ea mb )2 − m2a m2b = (pa · pb )2 − p2a p2b , (11.33)
The flux is determined by the relative velocity between the two particles
(which can also be larger than c). The differential cross-section in the general
case of particles with velocity va and vb is given by:
(na nb )2 d 3 pi
dσ = (2π)4 δ (4) (Pf − Pi ) Πi n2 |Mi→f |2 .
|va − vb | · (4Ea Eb ) (2π)3 2Ei i
(11.35)
Γ(f )
B(f ) = (11.38)
Γ
represents the probability to find the state f among the decay products and
is called the branching ratio or branching fraction. Clearly:
B(f ) = 1. (11.39)
f
dΓ(a → 1 + 2 + · · · + nf ) =
1 n2a d 3 pi 2
= 3n
Π i n (2π)4 δ (4) (Pf − Pa )|M (a → 1 + 2 + · · · + nf )|2 .
(2π) f 2ma 2Ei i
(11.41)
and we are left with three variables, the energies, restricted by the energy
conservation relation:
E1 + E2 + E3 = M = const.
3. Prove that the boundary of the allowed region of the Fabri–Dalitz plot
corresponds to collinear configurations, namely θ12 = 0, π.
4. Determine the boundary of the Fabri–Dalitz plot for decay in three equal
mass particles (e.g. K 0 → π + π − π 0 ).
CHAPTER 12
THE DISCRETE
SYMMETRIES: P, C, T
x → −x, t → t (12.1)
t → −t, x → x (12.2)
which we denote as T .
The first two transformations are represented in the Hilbert space of states
by unitary operators while time inversion must act as an anti-unitary operator.
In this chapter, we will refer principally to the QED Lagrangian which, as we
will see, is invariant under all these transformations, which are therefore exact
symmetries of QED. At the end of the chapter, we will consider the Fermi
Lagrangian, the prototype description of the weak interactions, in which the
symmetries are individually violated, beginning with parity, and the only exact
symmetry is the product of all three, the T CP transformation.
12.1 PARITY
The action of parity on the field operators follows from (12.1): the trans-
formed components of the field in x must be a superposition of the field
168 DOI: 10.1201/9781003436263-12
This chapter has been made available under a CC BY NC license.
The Discrete Symmetries: P, C, T 169
For the electromagnetic field, we know that in the classical limit, the spatial
components of the vector potential are polar vectors, while the scalar potential
is just that, a scalar. Therefore (the indices repeated three times are not
summed):
PAμ (x, t)P = g μμ Aμ (−x, t). (12.4)
From here we find:
P∂ λ Aμ (x, t)P = g μμ g λλ ∂ λ Aμ (−x, t) (12.5)
from which follows the rule for the transformation of Maxwell’s tensor.
PF μν (x, t)P = g μμ g νν F μν (−x, t). (12.6)
In the case of the spinor, the general rule is written:
Pψ(x, t)α P = [P ψ(−x, t)]α = Pαβ ψ(−x, t)β (12.7)
where P is a matrix in the space of Dirac spinors to be defined so that ob-
servables constructed with the fields transform correctly under parity.
To respect the classical limit, the current must transform like the vector
potential:
P ψ̄(x, t)γ μ ψ(x, t)P = g μμ ψ̄(−x, t)γ μ ψ(−x, t). (12.8)
Substitution of (12.7) in the above equation yields
P ψ̄(x, t)γ μ ψ(x, t)P = g μμ ψ̄(−x, t)P † γ 0 γ μ P ψ(−x, t). (12.9)
• For μ = 0, we find:
P † P = 1. (12.10)
from which:
C † γ 0 C = −γ 0 . (12.23)
Now we require that terms with derivatives in the Dirac action, Skin ,
should maintain the same form when expressed in the conjugate fields:
Skin = d x iψ̄∂/ψ = − d4 x i(∂μ ψ † )γ 0 γ μ ψ
4
= + d4 x iψ(γ 0 γ μ )T (∂μ ψ † )
= + d4 x iψ̄c γ 0 C † (γ μ )T γ 0 C(∂μ ψc ). (12.24)
γ 0 C † (γ μ )T γ 0 C = γ μ . (12.25)
C † C = 1 → C † = C = C −1 (12.26)
and from equation (12.23) we see that C must anticommute with γ 0 . Further-
more, if we use the general relation:
γ 0 (γ μ )† γ 0 = γ μ (12.27)
172 Relativistic Quantum Mechanics
Cγ μ C = −(γ μ )∗ . (12.28)
C = iγ 2 . (12.29)
where we have assumed that the bilinears are defined to satisfy the hermiticity
relation:
γ 0 Γγ 0 = Γ† (12.32)
(This is the reason for the factor which appears in the pseudoscalar density).
Finally, we find that, under charge conjugation:
From (12.33), it can easily be shown that each of the Dirac bilinears takes
a characteristic sign under C, which we denote as ηC , where
ηC = +1, for S, P, A
ηC = −1, for V, T. (12.34)
If we compare the behaviour of the system which starts from A at the time
t = 0 with one which starts from AT we can express the previous statements
in the following way:
Equation (12.36) is shown graphically in Fig. 12.1, for the case of the
motion of a harmonic oscillator.
What happens in quantum mechanics? The time reversal transformation
must be carried out on the quantum states by an operator, which we denote
as T , such that (working in the Schrödinger representation):
However, the T operator must have very special properties, as we can see
from the following argument. Suppose that the state A is an energy eigenstate
with eigenvalue E. We expect that AT should also have the same energy (as
happens in classical mechanics). In that case:
as required.
we must have:
which gives two possibilities for the operator which carries out the transfor-
mation |A → |AT :
The action of the T operator on the vector potential is fixed by the clas-
sical limit. If T leaves the positions of charges unchanged and reverses their
The Discrete Symmetries: P, C, T 175
T ψ̄(x, t)γ μ ψ(x, t)T = T ψ(x, t)† T T γ 0 γ μ ψ(x, t)T =
∗ ∗
= T ψ(x, t)† T γ 0 γ μ T ψ(x, t)T = ψ(x, −t)† T † γ 0 γ μ T ψ(x, −t)
(12.50)
from which we find:
μ = 0 : T †T = 1
μ = i : T † (αi )∗ T = −αi (12.51)
where we have introduced the Dirac matrices, αi = γ 0 γ i . If we require, as
usual, that T 2 = 1, the previous conditions give:
T † T = T −1 = T
μ = i : T (αi )∗ T = −αi . (12.52)
In the Pauli representation, α1,3 are real and α2 imaginary, therefore T
must anticommute with α1,3 and commute with α2 . Thus, from (12.52) we
find:
T = iγ 1 γ 3 = σ2 (12.53)
The transformation with matrix σ2 leaves unchanged γ 0 , which is real, and
changes the signs of γ i , therefore:
T (γ μ )∗ T = g μμ γ μ (12.54)
From this, the transformation rules of the Dirac bilinears, which are formed
from products of the gamma matrices, can immediately be found. We find:
O(Γ)((x, t) = ψ̄(x, t)Γψ(x, t)
T O(Γ)(x, t)T = O(T Γ∗ T )(x, −t) = ηT O(Γ)(x, −t) (12.55)
176 Relativistic Quantum Mechanics
where:
ηT = +1 (S, P)
ηT = g μμ (V, A)
ηT = −g μμ νν
g (T). (12.56)
σ2 (σ i )∗ σ2 = −σ i (i = 1, 2, 3)
σ3 (σ2 ξs ) = −σ2 (σ3 ξs )∗ = ∓(σ2 ξs∗ )
∗
or
(σ2 ξs∗ ) = ξ−s = χs
and also
(σ2 χ∗s ) = χ−s = ξs . (12.58)
• The left-hand side, expanded from the solutions to the Dirac equation,
is:
d3 p m
3/2
Pas (p)Pe−ipx us (p) + Pbs (p)† Pe+ipx vs (p) .
(2π) E(p)
(12.60)
The Discrete Symmetries: P, C, T 177
where
We proceed as before:
• Right-hand side:
d3 p m
×
(2π)3/2 E(p)
bs (p)e−ipx (iγ 2 )αβ [(vs )β (p)]∗ + as (p)† e+ipx (iγ 2 )αβ [(us )β (p)]∗ .
178 Relativistic Quantum Mechanics
and, similarly:
where:
Figure 12.2 The Action of parity, P, and charge conjugation, C, on the electron and
positron states with different helicity. The T transformation does not change the
helicity since it changes both momentum and spin.
or:
and similarly:
Fig. 12.2 shows the action of the three symmetries on the electron and
positron states.
180 Relativistic Quantum Mechanics
Positronium is similar in every way to the hydrogen atom, except that the
reduced mass, μ, is about half the reduced mass of hydrogen:
m1 m2
μ= ;
m 1 + m2
1 1
μ e + e − me μ H . (12.81)
2 2
Because the reduced mass determines the bound state energy levels, the
positronium spectrum is scaled in energy by a factor two compared to that of
hydrogen.
Positronium is formed each time a positron comes to rest in matter. At
the end of its travel, the positron captures an electron from the surrounding
atoms and forms an excited state of positronium. In contrast to hydrogen,
The Discrete Symmetries: P, C, T 181
positronium is not stable. Once the electron and positron have reached the
ground state with L = 0, the annihilation probability is significant and the
state decays into two or more photons (cf. Landau and Lifshitz [11] for the
cross-section calculation).
For the ground state, there are two energy levels with n = 1, L = 0 and
S = 0 or 1, known respectively as parapositronium and orthopositronium. The
energy difference between them is very small because it results from magnetic
interactions between the spins.
On the basis of the results from the previous section, we can determine the
parity and charge conjugation properties of the positronium levels. Because
these operations commute with the Hamiltonian, they provide good quantum
numbers and determine the selection rules of the decays.
Parity of the Levels. We apply the P operator to the state (12.82) and use
(12.65) and the invariance of the vacuum:
P|n, L, S, J = dpR(n,L,S,J) (p) dΩp YmL (p̂)×
C(S, s3 |1/2, s; 1/2, s ) Pas (p)† PPbs (−p)† PP|0 =
s,s
= dpR(n,L,S,J) (p) dΩp YmL (p̂)×
C(S, s3 |1/2, s; 1/2, s )(−1)as (−p)† bs (+p)† |0 (12.83)
s,s
where the minus sign originates in the opposite parities of electron and
positron. Now we can let p → −p in the integral and use the parity of the
spherical harmonics to obtain:
To return to the starting expression we must (i) exchange a with b, (ii) let
p → −p, (iii) exchange s with s .
In the latter operation, we recall that the Clebsch–Gordan coefficients for
two spin 12 combinations are symmetric for the exchange of s with s in the
case S = 1, while they are antisymmetric for S = 0.
The three operations introduce a factor (i) −1; (ii) (−1)L ; (iii) (−1)S+1
respectively and therefore an overall factor ηC :
C|n, L, S, J = ηC |n, L, S, J; ηC = (−1)(−1)L (−1)S+1 = (−1)L+S . (12.86)
PSab (x, t)P = P ψ̄a (x, t)ψb (x, t)P = ηP (S)Sab (−x, t);
CSab (x, t)C = ηC (S)Sba (x, t);
T Sab (x, t)T = ηT (S)Sab (x, −t) (12.91)
where η are the ± signs characteristic of the transformation and the specific
covariant. In Table 12.1 the values of η for the three transformations and for
the combined θ = CP T operation are listed.
The CP T operation is clearly represented by an antilinear operator and
acts according to the simple rule:
Table 12.1 Summary of the properties of the Dirac covariants and of the electro-
magnetic field under P, C, T transformations, and θ = CP T .
S P V A T Aμ Fμν
ηP +1 -1 gμμ -gμμ gμμ gνν gμμ gμμ gνν
ηC +1 +1 -1 +1 -1 -1 -1
ηT +1 -1 gμμ +gμμ -gμμ gνν gμμ +gμμ gνν
ηCP T +1 +1 -1 -1 +1 -1 -1
θψ̄a Γψb θ† = θψa† (x)θ† (γ 0 Γ)∗ θψb (x)θ† = [γ5 ψa (−x)](γ 0 Γ)∗ γ5 ψb† (−x) =
= −ψb† (−x)γ5 (γ 0 Γ)† γ5 ψa (−x) = + ψ̄b γ 0 γ5 Γ† γ5 γ 0 ψa (−x) (12.96)
where we have repeatedly made use of the fact that γ5 is hermitian and anti-
commutes with γ 0 .
Combination with γ5 produces a minus sign for every factor γ μ in Γ, there-
fore:
†
γ 0 γ5 Γ† γ5 γ 0 = γ 0 (γ5 Γγ5 ) γ 0 =
= (−1)N γ 0 Γ† γ 0 = (−1)N Γ (12.97)
The CPT Theorem. The (−1)N factor applies to the total inversion of the
coordinates:
TI : xμ → −xμ . (12.99)
space from purely imaginary time, i.e. from the theory in the four-dimensional
Euclidean space. This is the origin of the CPT Theorem which, under very
general conditions, states that the operation θ, total inversion of the axes sup-
plemented with the operation of hermitian conjugation, is an exact symmetry
of any relativistic quantum field theory.
The CP T theorem is due to Luders and Pauli [19, 20]. We will give a
demonstration very close to the one of Bjorken and Drell [21]. Subsequently,
we will illustrate the most noteworthy consequences of the CP T theorem.
We consider a theory described by a Lagrangian density L(x). The con-
ditions characterising a relativistic quantum theory, the subject to which the
Lagrangian is to be constructed, are as follows.
The Lagrangian must be:
• hermitian,
• a local function of the fields and their derivatives, up to finite order,
calculated at the same point,
• a boson operator; fermion fields can appear in even numbers and they
can therefore be organised into Dirac bilinears, ψ̄a Γψb , where ψa and
ψb are fields associated with different types of fermion (e.g. electron and
neutrino),
• invariant under proper Lorentz transformations,
• a normal product of fields.
Under these conditions, we show that it must necessarily be true that:
θL(x)θ† = L(−x) (12.100)
from which it follows that the action is invariant under CP T :
† 4 †
θSθ = d x θL(x)θ = d x L(−x) = d4 (−x) L(−x) = S (12.101)
4
Proof. The general form of a Lagrangian density which satisfies the condi-
tions above can be written in the following way:
L(x) = ci Oi (x);
i
Oi (x) =: . . . Aμ (x) . . . (ψ̄a Γψb )(x) . . . ∂ ν . . . : (12.102)
where ci are complex coefficients. We now apply the CP T operation. On the
basis of Table 12.1 and the antilinearity of θ we obtain:
†
θL(x)θ† = (ci )∗ (−1)Ntot : . . . Aμ (−x) . . . (ψ̄a Γψb )(−x) . . . ∂ ν . . . :
i
(12.103)
186 Relativistic Quantum Mechanics
where Ntot is the total number of Lorentz indices which appear in the equation
(12.102).
Inside the normal product, we can commute the boson operators and put
them in the opposite order to which they appear in (12.102), which means we
can rewrite (12.103) as:
†
θL(x)θ† = (−1)Ntot [ci Oi (−x)] . (12.104)
i
where the final step follows from the fact that L is Hermitian.
We have written explicitly the hermitian conjugate of the first term, nec-
essary to make the Lagrangian real.
The interaction (12.106) clearly is NOT invariant under parity, since it is
produced by superpositions of polar and axial vectors, which change relative
sign under parity. For example:
gA μ gA μ
P ψ̄P γ μ + γ γ5 ψN (x, t)P = g μμ ψ̄P γ μ − γ γ5 ψN (−x, t).
gV gV
(12.107)
Instead, the CP transformation acts in the same way on vector and axial
currents:
gA μ gA μ
C ψ̄P γ μ + γ γ5 ψN C = (−g μμ )ψ̄N γ μ + γ γ 5 ψP
gV gV
C ψ̄e (γ − γ γ5 ) ψνe C = (−g )ψ̄νe (γ − γ γ5 ) ψe .
μ μ μμ μ μ
(12.108)
The Discrete Symmetries: P, C, T 187
θHθ† = H. (12.110)
Moreover, from Table 12.1 it follows that θ changes signs of 3-vectors, like
momentum, and changes the sign of each conserved charge that is present in
our theory (e.g. electric charge):
θP θ† = −P ; θQθ† = −Q (12.111)
θJ θ† = J (12.112)
Taking account of (12.111), the state θ|A must have the same mass, the
same value of s2 = s(s + 1) and of the spin component, but opposite electric
charge:
• the mass of the antiproton coincides with that of the proton within a
relative precision of 10−8 ,
• the masses and half-lives of K 0 and anti-K 0 mesons agree to within a
few parts in 10−18 ,
• the electron and positron have equal masses within one part in 10−8 ,
and their magnetic moments, apart from opposite signs, agree within
one part in 10−12 ,
• the magnetic moments of muon and antimuon agree within two parts in
10−8 .
The observation of any violation of the CP T theorem would imply the
necessity for improvement of the relativistic quantum field theory paradigm
and would have enormous conceptual importance.
5. Show that the Weyl Lagrangian is NOT invariant under parity. On this
basis, Pauli rejected Weyl’s theory (parity violation had not beeen dis-
covered yet).
Hint: the matrix P should obey P 2 = 1 and P σP = −σ. Does such a
matrix exist?
Sect. 12.2
1. Define the action of parity and charge conjugation on the two-component
Weyl field according to
i∂/ψ = 0. (13.1)
we can separate the solutions into two invariant sub-spaces by means of the
projection operators:
1 ± γ5
a(±) = . (13.2)
2
We define:
ψL = a(−) ψ; ψR = a(+) ψ. (13.3)
If, for example, ψR = 0 at time t0 , it will remain so at later times. The
field therefore divides into two irreducible and independent components.
In the Pauli representation of the γ matrices:
0 1
γ5 = (13.4)
1 0
The Dirac equation with m = 0 gives rise to two possible equations for the
two-dimensional spinors:
∂
i χ = ±(−i∇ · σ)χ. (13.6)
∂t
introduce only three anticommuting matrices and therefore the solution α = σ is acceptable,
the minimum dimensionality of the spinor is 2 and instead of the Dirac equation we obtain
the Weyl equation.
Weyl and Majorana Neutrinos 193
The spinors with negative energy and momentum −p along the z-axis are
instead:
+
(E<0) 1 χ
u+ (−p) = v− (p) = √ ;
2 χ+
(E<0) 1 −χ−
u− (−p) = v+ (p) = √ . (13.9)
2 χ−
If we define:
The scalar density has non-zero matrix elements between the vacuum state
and the state with a neutrino-antineutrino pair. To calculate these matrix
elements, we must examine the spinors with momentum −p:
1 χ∓
u± (−p) = √ ;
2 ±χ∓
1 ∓χ±
v± (−p) = √ ; (13.14)
2 χ±
The neutrino and antineutrino are created with the same helicity, in agree-
ment with the fact that the state must have zero angular momentum, and must
also conform with the rules described above. Fig. 13.1 summarises the states
created from the vacuum by the scalar density for each chirality, and those
created by the vector density. We leave to the reader the task of showing that
the only states which have matrix elements of the vector current different from
zero are those illustrated in the figure.
Figure 13.1 States created by the scalar and vector densities acting on the vacuum.
Weyl and Majorana Neutrinos 195
0 = α2 ; γ
γ 1 = −iΣ3 ; γ
2 = γ 2 ; γ
3 = iΣ1
σ2 0
5 = γ 2 γ 5 =
γ . (13.16)
0 −σ2
v(p) = u(p)∗
therefore the expansion of ψ with the condition that it should be real, takes the
form (we also use here the normalisation (13.8) to facilitate the limit m → 0):
1
ψ(x) = √ [ar (p)ur (p)e−i(px) + ar (p)† ur (p)∗ e+i(px) ]. (13.17)
p,r V
From here the anticommutation relations for the creation and destruction
operators a and a† can be recovered.
The neutrality of the Majorana fermions can also be seen from the form
of the conserved current. We find:
j μ = ψ̄γ μ ψ = ψ T γ 0 γ μ ψ =
= (ψ T ψ, ψ T αψ) = complex number (13.19)
(ψ T denotes a row vector with the same components as ψ). That the current
should be a complex number follows from the fact that, in all the components
of j, the spinors ψα ψβ are multiplied by matrices symmetric in α and β, and
therefore, in view of the anticommutation relations (13.18), produce a multiple
of the identity matrix. If we define the current so as to vanish on the vacuum
state, we will have Q = 0 in all the states.
A non-trivial result is obtained for the axial current:
Aμ = ψ̄γ μ γ5 ψ = ψ T γ 0 γ μ γ5 ψ (13.20)
are antisymmetric. However, the current (13.20) is conserved only in the limit
of zero mass. Using the Dirac equation (7.23) we find:
∂μ Aμ = 2mψ̄iγ5 ψ. (13.22)
The vanishing of the vector current excludes that the electron or proton,
whose electromagnetic current is certainly different from zero can be described
by a Majorana field. The neutron too, in view of the conservation of baryon
number must be described by a (complex) Dirac field.
In the Majorana representation, the matrices which represent the Lorentz
transformations S(Λ), equation (6.39), take a special form. In fact, with imag-
inary gamma matrices, the σ μν matrices are also imaginary and the S(Λ) are
real matrices.
If we examine a complex Dirac field, the spinor:
ψT γ 0 (13.23)
It then follows that we can construct with ψ a different mass term from
what appears in the Dirac Lagrangian:
LM = μ ψ T γ 0 ψ + μ ∗ ψ † γ 0 ψ † (13.25)
Weyl and Majorana Neutrinos 197
ψ̄(i∂/)ψ + μψ T γ 0 ψ + μ∗ ψ † γ 0 ψ † (13.26)
still describes spin 12 fermions with non-zero mass, as we will see, but there
is no longer invariant for global phase transformations of the field ψ. The
corresponding vector current ψ̄γ μ ψ is not conserved.
For massless fields the separation is Lorentz-invariant and the Dirac field
divides into irreducible components, each of which describes a Weyl fermion.
The matching of degrees of freedom is accounted by:
ν1 (x) = νL (x) + [νL (x)]† ; ν2 (x) = νR (x) + [νR (x)]† (MR) (13.29)
(13.30)
Equation (13.30) shows that the two spin components of the Majorana
fermion are made up of the right-handed antineutrino and the left-handed
neutrino, the components of the Weyl field νL (x). Similarly the two spin com-
ponents of ν2 are composed of the right-handed neutrino and the left-handed
antineutrino of the field νR ; see Fig. 13.2.
198 Relativistic Quantum Mechanics
The equations in (13.29) can be inverted, recalling that γ5T = −γ5 , from
which:
1 − γ5 1 − γ5 †
(νL )† = [νL ]
2 2
1 + γ5
=[ νL ]† = 0. (13.31)
2
We therefore obtain:
1 − γ5 † 1 + γ5
νL = ν1 ; (νL ) = ν1 . (13.32)
2 2
Equations (13.29) and (13.32) show the equivalence of the Weyl and Ma-
jorana theories for massless fermions.
The Dirac Lagrangian can be written in terms of Weyl or Majorana fields,
according to the equivalence:
where h.c. denotes the hermitian conjugate operator, in this case2 mD ν̄R νL .
2 We leave it to the reader to show that if we start from a complex m
D we can reduce
it to the real case with a redefinition of the relative phase between νL and νR and the
consequent redefinition of ν1 and ν2 .
Weyl and Majorana Neutrinos 199
Again using the antisymmetry of γ5 , it is easy to see that the Dirac term
takes the form, in terms of ν1,2 :
However, given the Majorana fields ν1,2 , we can also consider two new
mass terms:
1 1
LmM = M1 ν1T γ 0 ν1 + M2 ν2T γ 0 ν2 . (13.36)
2 2
Individually, the two terms correspond to a Majorana mass for the left-
handed and right-handed neutrino, for example:
1 1
M1 ν1T γ 0 ν1 = M1 (νLT γ 0 νL + h.c.). (13.37)
2 2
Overall, the neutrino masses are described by a symmetric matrix:
1
Lm = ζ T γ 0 Mζ
2
ν1 M1 mD
ζ= ; M= . (13.38)
ν2 mD M2
ν1 → ν1 = eiαγ5 ν1 ;
ν2 → ν2 = eiβγ5 ν2 . (13.39)
However, the Majorana mass terms are not invariant, while the Dirac term
is invariant for the subgroup of transformations with α = −β:
1 1
LmM → M1 ν1T γ 0 e2iαγ5 ν1 + M2 ν2T γ 0 e2iβγ5 ν2 ;
2 2
T 0 i(α+β)γ5
LmD → mD ν1 γ e ν2 . (13.40)
The conserved current in this case is known as the lepton number and
distinguishes the neutrino from the antineutrino.
In the general case, in which at least one of M1 or M2 is non-zero, the
eigenvectors of the mass matrix are two Majorana fields, and there is neither
a conserved current nor a difference between neutrino and antineutrino.
The relationships between the Weyl, Dirac and Majorana neutrino states
and masses are illustrated in Fig. 13.2. The general particle-antiparticle sym-
metry requires that the Dirac masses which connect νL with νR or ν̄L with
ν̄R takes the same mD value.
m2D
ζ (−) ν1 ; m . (13.42)
M2
For sufficiently large M2 with fixed mD , the light neutrino has a tiny mass
and is a good approximation to a Majorana–Weyl neutrino. As we will see,
this situation describes the neutrino of β-decay very well.
APPLICATIONS: QED
causes the transition of the electron from the initial state |i = |pr to the
final state |f = |p r , where p ≡ (E, p) and p ≡ (E , p ) are the 4-momenta
and r and r the spin projections.
The corresponding S-matrix has the form
m 1/2 m 1/2
Sif = f |S|i = ie
d4 x ei(p −p)x ūr (p )A/(x)ur (p)
VE VE
m 1/2 m 1/2
= ie d3 x ei(q+p−p )x
VE V E
i(E −E)t d3 q
× dt e ūr (p )A/(q)ur (p)
(2π)3
m 1/2 m 1/2
= (2π) δ(E − E ) Mif , (14.4)
VE V E
where V is the normalisation volume and
Mif = ie ūr (p)A/(p − p)ur (p) . (14.5)
We note that in (14.4) only the δ-function associated with the conservation
of energy appears. Momentum is not conserved, because the static source,
which generates the field, breaks translational invariance. The relation which
expresses the conservation of energy, which implies
|p| = |p | , (14.6)
clearly shows that the momentum absorbed by the source is neglected.
The differential cross section for the process is obtained from the formulae
given in Chapter 11:
V V d 3 p
dσ = Wif , (14.7)
v (2π)3
where Wif is the transition probability per unit time
|Sif |2 m 2
Wif = = (2π) δ(E − E ) |Mif |2 , (14.8)
T VE
Applications: QED 203
and T is the duration of the interaction. Using equations (14.7) and (14.8),
and the relation:
|p|2 d|p| = |p|E dE , (14.9)
from (14.7) we obtain
VE m 2 V
dσ = (2π) δ(E − E ) |Mif |2 |p|E dE dΩp . (14.10)
|p| VE (2π)3
where q = p − p.
We now consider the case in which the external field is the Coulomb field
generated by an atomic nucleus, which we suppose to be pointlike. Therefore
we have:
Z e
Aμ (x) ≡ , 0, 0, 0 , (14.12)
4π |x|
with the Fourier transform (FT) given by (cf. Problem 1):
e
Aμ (q) ≡ Z 2 , 0, 0, 0 . (14.13)
|q|
Substituting (14.13) into (14.11), summing over the projections of the elec-
tron spin in the final state and averaging over those of the initial state of the
electron we obtain
dσ me 2 e2 1
= Z2 |ūr (p )γ 0 ur (p)|2
dΩp 2π |q|4 2
rr
(2mαZ)2 1 1
= T r (p/ + m)γ 0 (p/ + m)γ 0 , (14.14)
|q|4 (2m)2 2
where α = e2 /4π and the sums over r and r are carried out using the com-
pleteness relations satisfied by the Dirac spinors. The calculation of the trace
in (14.14) is easily done using the result
T r (p/ + m)γ 0 (p/ + m)γ 0 = T r(p/ γ 0 p/γ 0 ) + m2 T r(γ 0 γ 0 )
= 4pμ pν (2g μ0 g ν0 − g μν g 00 ) + 4m2 (14.15)
= 4[2EE − (pp ) + m ] = 4[EE + (p · p) + m2 ] .
2
204 Relativistic Quantum Mechanics
dσ (αZ)2
= 2 [E 2 + (p · p ) + m2 ]
dΩp |q|4
(αZ)2 2 |p|2 2 θ
= 2 2E 1 − 2 sin (14.16)
|q|4 E 2
(αZ)2 θ
= 2 2E 2 1 − v 2 sin2 .
|q|4 2
where v and θ are, respectively, the velocity and scattering angle of the elec-
tron, i.e. the angle between the vectors p and p . Using the relation
θ
|q|2 = |p − p|2 = 2|p|2 (1 − cos θ) = 4E 2 v 2 sin2 , (14.17)
2
we can rewrite equation (14.17) in the form originally obtained by Mott
dσ (αZ)2 2 2 θ
= 1 − v sin . (14.18)
dΩp 4E 2 v 4 sin4 (θ/2) 2
Coulomb Divergence. The integral of the Mott cross section, (14.20), over
solid angle diverges for θ = 0, preventing definition of the total cross-section.
To be exact, for small values of θ
dσ αZ 2
d cos θ 4 2 4 4 θdθ . (14.21)
dΩp 4E v θ
The origin of this divergence can be traced to the trend towards singularity,
for |q| → 0, of the FT of the Coulomb field, equation (14.13), or to the too
slow fall off of the Coulomb field itself, as |x| → ∞, equation (14.12). Due to
this slow decrease, even particles which pass very far from the charge at the
origin feel its effect; the asymptotic states, in the interaction representation,
do not truly tend to a constant and this prevents us from rigorously defining
the S-matrix.
Fortunately, isolated electric charges do not exist in Nature. The positive
charge of each nucleus, in ordinary matter, is screened by the negative charges
of the atomic electrons. Also in an ionised plasma, due to the long range of
the electrostatic forces, the positive charges attract negative charges around
them so that, on average, the plasma is electrically neutral. The result is that
the integrand in (14.21) is suppressed for small values of θ.
We recall that the FT at |q| is sensitive to the values of the function for
|x| |q|−1 . Therefore, if we consider scattering on an atom, the presence of
the external electrons is felt at values of |q| such that:
|q|R ≤ 1 , (14.22)
where R is the atomic radius, or for:
1
θ≤ . (14.23)
REv
In this θ region the differential cross section is suppressed by the fact that
the electron sees a gradually decreasing total charge and the total cross-section
is finite.
so that the total charge is still given by Ze. The electrostatic potential of the
nucleus is now given by:
Ze ρ(y)
A0 (x) = d3 y , (14.25)
4π |x − y|
206 Relativistic Quantum Mechanics
Ze
A0 (q) = F (q);
|q|
F (q) = d3 x eiq·x ρ(x). (14.26)
M κ ! M
p |Jpμ |p =
√ ū(p ) γ μ + iσμν q ν u(p) = √ J μ (p , p) ,
V EE 2M V EE
(14.28)
where we have introduced the momentum transfer:
q μ = pμ − pμ . (14.29)
We can ask what would be the most general form of J μ (p , p) compatible
with a conserved polar 4-vector:
qμ J μ (p , p) = 0 . (14.30)
F1 (0) = 1 , (14.39)
F1 (q 2 ) = Fa (q 2 ); F2 (q 2 ) = Fb (q 2 ) − Fa (q 2 ) (14.42)
or, for q 2 = 0:
Fb (0) immediately gives the total magnetic moment (in Bohr magneton
units).
The choice (14.41) is best suited for the calculation of radiative corrections
to the anomalous magnetic moment of the electron; see [14].
A particularly convenient choice to describe the elastic electron–nucleon
cross section is given by the Sachs form factors
GE = F1 − τ F2 , GM = F1 + F2
q2
τ =− , (14.44)
4M 2
which are known, respectively, as the electric and magnetic form factors. The
normalisation conditions follow from equation (14.44):
The matrix element for the proton current between proton states is pa-
rameterised in terms of the form factors introduced in the previous section,
equation (14.38):
M2
< p |Jpμ (0)|p >= 2
ū(p )Γμ (p , p)u(p) ,
V Ep E p
i
Γμ (p , p) = F1 (q 2 )γμ + F2 (q 2 ) σμν q ν . (14.48)
2M
Applications: QED 209
We have written only the terms of the T -product which have the right
annihilation and creation operators to destroy the initial particles and create
the final particles, summarising the others, which are not essential, in the
ellipsis dots.
|e, p, 0γ >= a† (e)a† (p)|0 >= |e > |p > |0γ >= a† (e)|0e > a† (p)|0p > |0γ > ,
where we have denoted the different momenta using the names of the relevant
particles, with, for example, the state with zero electrons being |0e >, and,
naturally:
|0 >= |0e , 0p , 0γ >= |0e > |0p > |0γ > .
The current of each particle operates only on the corresponding states.
Finally, we can take the currents outside the T -product and write:
< f |S|i > = (ie)2 d4 x d4 y < e |Jeμ (x)|e > .
Fig. 14.2 (a) gives a space-time picture of the scattering process. The initial
electron propagates up to a point x where it is destroyed by the current,
which creates the final electron at the same point. Also, at x a photon which
propagates from point x to point y, or from y to x, is emitted or absorbed,
while at point y the proton current changes the initial proton into the final
one. The amplitude of the propagation of the photon between x and y is given
by the T -product on the vacuum of the electromagnetic fields and the product
of the three factors in the integral of (14.50) represents the overall amplitude
of the history which corresponds to values of x and y.
The observation of the final particles does not determine the points x and
y where the interaction took place, therefore, according to the principles of
quantum mechanics, we must sum over the amplitudes of each history to have
the total amplitude for the process, integrating over the values of x and y.
210 Relativistic Quantum Mechanics
and similarly:
M2
< p |Jpν (y)|p >= ei(p −p)y ū(p )Γν (p , p)u(p) . (14.52)
V 2 E p Ep
1
dσ = (2π)4 δ (4) (p + e − p − e)×
4
m2e M 2 d 3 p d 3 e e 4
× Hμν Lμν . (14.55)
Ee Ee Ep Ep (2π)6 (q 2 )2
1
The factor 4 comes from the averaging over the initial spins and we have
set:
Hμν = ūr (p )Γμ us (p)ūs (p)Γν ur (p ) =
r,s
Lμν = ūp (e )γ μ uq (e)ūq (e)γ ν up (e ) =
p,q
/e ν /e μ lμν
= Tr γ γ = . (14.57)
2me 2me 4m2e
Here and in what follows we neglect the electron mass in the numerators.
To obtain the differential cross section in the variables describing the final
electron, which are those which are normally observed, we must integrate over
the momentum of the final proton using the three-dimensional δ-function for
momentum conservation, which fixes:
p = e − e (14.58)
e2
α= , (14.62)
4π
we find: 2
α2 E e X
dσ = dΩ , (14.63)
16q 4 Ee M2
where
X = lμν hμν , (14.64)
and dΩ is the solid angle of the final electron.
or the energy, Ee , of the final electron. From its definition, we find:
θ
q 2 = 2Ee Ee (1 − cos θ) = 4Ee Ee sin2 . (14.66)
2
In general W 2 is used to denote the squared mass of the system recoil-
ing against the final electron. In the elastic scattering process which we are
considering we must have:
W 2 = M 2 = (p + e − e )2 = M 2 + q 2 + 2M (Ee − Ee ) ,
or:
q2
Ee = Ee + ,
2M
and therefore, using equation (14.66):
Ee
Ee = 2Ee
. (14.67)
1+ M cos θ
Qμ = (p + p )μ . (14.69)
Applications: QED 213
We find:
Qμ
Γ(p , p) = A(q 2 ) γ μ + B(q 2 ) ,
2M
A = F1 + F2 , B = −F2 . (14.70)
With this arrangement, the trace (14.68) is easily calculated, with the
result
+
2 q2 2 2
hμν = 2 Q Q (A + B) −
μ ν
B − Π (q)A
μν
,
4M 2
Πμν (q) = q μ q ν − q 2 g μν . (14.71)
The trace of the electron is obtained from (14.71) with the obvious substi-
tutions:
Qμ → E μ = (e + e )μ , M → me 0 ,
A → 1, B → 0 , (14.72)
yielding
lμν = 2 [Qμ Qν − Πμν (q)] .
Finally, we find:
+
q2
X = 4 (Qμ E μ )2 + q 2 Q2 (A + B) − 2
B 2 + 2(q 2 )2 A2 .
4M 2
From the preceding equations, we can explicitly calculate2 :
(Qμ E μ )2 + q 2 Q2 = 4M 2 (Ee + Ee )2 + q 2 (4M 2 − q 2 )
q2 θ
= 16M 2 Ee (Ee + ) + 4M 2 q 2 = 16M 2 Ee Ee cos2 .
2M 2
Finally setting:
θ
2(q 2 )2 = 8(−q 2 )Ee Ee sin2 ,
2
we find:
+
X θ q2 −q 2 2 θ
= 64Ee Ee cos2 F12 − F 2
+ (F 1 + F 2 ) 2
tan .
M 2 2 4M 2 2 2M 2 2
(14.73)
Substituting into (14.63), we obtain the Rosenbluth formula:
dσ α2 cos2 θ2 E e
=
dΩ 4Ee2 sin4 θ2 Ee
+
2 q2 2 −q 2 2 2 θ
× F1 − F + (F1 + F2 ) tan . (14.74)
4M 2 2 2M 2 2
2 Using the relation (p e) = (pe ).
214 Relativistic Quantum Mechanics
dσ α2 cos2 θ
2
= 4 , (14.76)
dΩ 0 4Ee2 sin θ2
e + γ → e + γ (14.77)
The amplitudes of diagrams (1) and (2) are easily computed to be:
∗ p/ + k/ + m
M1 = ū μ (ieγ μ )i (ieγ ν )ν u , (14.82)
(p + k)2 − m2
and
p/ − k/ + m
M2 = ū μ ∗ (ieγ μ )i (ieγ ν )ν u , (14.83)
(p − k )2 − m2
where u = us (p) and u = us (p ) are the four-spinors associated with the
electrons e and e , and μ = μr (k ) and = μr (k ) are the polarisation
vectors of the photons γ and γ .
Before going into the detailed calculations, we comment on some properties
of amplitudes with external photons implied by invariance under gauge trans-
formations, which are useful for the calculation of the sum over polarization
states of the photons.
where
Aμ (x) = r,μ eikx . (14.86)
ikx
Choosing Λ(x) = Ce we find
that is:
kμ M μ = 0 . (14.89)
This result allows us to make the summation
2
|M|2 = (Mμ )∗ Mν ∗r,μ r,ν , (14.90)
r r=1
Calculation of |M1 |2 and |M2 |2 . To sum over the spins of the electrons,
we use the relation equation (6.84), Section 6.1.4:
p/ + m
us (p)us (p) = , (14.93)
s
2m
e4 1
= Tr (p/ + m)γ μ (p/ + k/ + m)γ ν (p/ + m)γν (p/ + k/ + m)γμ .
(s − m ) 16m2
2 2
Applications: QED 217
with
which is easily obtained from the identity, valid for any two 4-vectors a and b:
C = −8mTr p/ (s̃/ + m)(s̃/ + m) = −16m2 Tr p//s̃ = −64m2 (p s̃) , (14.103)
which substituting
C = −32m2 (s + m2 ) . (14.105)
A+C +D =0 , (14.107)
218 Relativistic Quantum Mechanics
from which it follows that the only term which contributes to the trace (14.96)
is B, which we can rewrite as
B = Tr (p/ (s̃/p//s̃ + m2 p/ p/) . (14.108)
The calculation is carried out by using the relation
Tr (a//b/cd/) = aλ bμ cν dρ Tr (γ λ γ μ γ ν γ ρ ) (14.109)
= 4aλ bμ cν dρ (g λμ g νρ + g λρ g μν − g λν g μρ )
= 4[(ab)(cd) + (ad)(bc) − (ac)(bd)] .
The result is
B = 16[2(p s̃)(ps̃) − (pp )s + m2 (pp )]
= 32{(pk)(pk ) + m2 [(pp ) + (p k)]} , (14.110)
which can be written in terms of the variables s and u by noting that
m2 − u s + m2
(pk ) = , [(pp ) + (p k)] = . (14.111)
2 2
Finally we find
B = 8 4m4 + (s − m2 )(m2 − u) + 2m2 (s − m2 ) . (14.112)
From equations (14.94) and (14.96) we find, in conclusion:
1 2e4 4
|M1 |2 = 4m − (s − m2 )(u − m2 ) + 2m2 (s − m2 ) .
4m2 (s − m2 )2
(14.113)
The expression for |M2 |2 , see equation (14.83), is obtained from (14.113)
with the substitution s u:
1 2e4 4
|M2 |2 = 2 2 2
4m − (u − m2 )(s − m2 ) + 2m2 (u − m2 ) .
4m (u − m )
(14.114)
Summing Up. Putting together the results (14.113), (14.114) and (14.115)
we find, finally:
$ 2
2 4 m2 m2
|M1 + M2 |2 = e + (14.116)
m2 s − m2 u − m2
m2 m2 1 u − m2 s − m2
+ + − + .
s − m2 u − m2 4 s − m2 u − m2
Applications: QED 219
M1 + M2 = μ ν (Mμν μν
1 + M2 ) , (14.117)
and we see from equation (14.88) that it must satisfy the condition
kμ kν (Mμν μν
1 + M2 ) = 0 . (14.118)
We consider the first term on the left-hand side of (14.118), which we can
rewrite using the definitions (14.82) and (14.83), with k 2 = 0 and p2 = m2 ,
kμ kν Mμν
1 = ū k
/u. (14.121)
−ū k/(p/ − k/ + m) = −ū k/(p/ + m) = −ū [2(p k) − (p/ − m)k/] = −2(p k)ū ,
(14.123)
we obtain
kμ kν Mμν
2 = −ū k/ u = −kμ kν Mμν 1 , (14.124)
confirming that the scattering amplitude for e + γ → e + γ is invariant under
gauge transformations. We note that only the sum M1 +M2 is invariant, while
the amplitudes corresponding to the two processes illustrated in the Feynman
diagrams of the figure, considered separately, are not.
4 Without loss of generality, we may choose real polarisation vectors.
220 Relativistic Quantum Mechanics
Klein–Nishina Cross Section. Now we will use the result (14.116) to obtain
the cross section in the laboratory frame, defined by the relations
p ≡ (m, 0), p ≡ (E , p ) ,
k ≡ (ω, k), k ≡ (ω , k ) ,
with |k| = ω and |k| = ω , from which it follows that
The integration over ω with the δ-function is carried out using the rule
dF −1
δ[F (ω )] = δ(ω − ω0 ) , (14.133)
dω
with F (ω0 ) = 0. Recalling that E = m2 + |k − k |2 , we find
d 1
dω (m + ω − E − ω ) = E (ω − ω cos θ) + 1 (14.134)
(p k ) (pk) mω
= = = ,
Eω Eω Eω
and therefore
dσ e4 ω 1 mω ω ω
= + − sin2 θ , (14.135)
dΩ 32π 2 ω mE E ω ω ω
or 2
dσ α2 ω ω ω
= + − sin2 θ . (14.136)
dΩ 2m2 ω ω ω
The cross section (14.136) was obtained by Klein and Nishina in 1929 and
it provides an accurate description of the Compton effect, observed experi-
mentally for the first time in 1923.
We now consider equation (14.136) in the non-relativistic limit: ω/m 1.
Solving equation (14.126) for ω we obtain the relation
ω
ω = , (14.137)
1+ ω
m (1 − cos θ)
which shows that, in the limit which interests us here, ω/ω → 1 and equation
(14.136) becomes
dσ α2
= (1 + cos2 θ) . (14.138)
dΩ 2m2
To obtain the total cross section we carry out the angular integral
α2 2 8π α2
σ = 2π d cos θ (1 + cos θ) = . (14.139)
2m2 3 m2
Equation (14.139) is the Thomson cross section, which describes the inter-
action of the classical electromagnetic field with an electron.
Eω − p · k = Eω − p · k + ωω (1 − cos θ) , (14.143)
|p| E 2 − m2 1 1
β= = = 1− ≈1− 2 , (14.146)
E E2 γ 2 2γ
1+β 2 z
ωmax =ω ≈ ω m2 =E , (14.147)
1 − β + 2ω
E 2E 2 +
2ω
E
1+z
with
4ωE
z= . (14.148)
m2
Now we consider the case in which p and k are still oppositely directed,
i.e. φ = π, but the momenta of the particles in the final state are such that
Applications: QED 223
This technique is used for the production of beams of photons for nuclear
physics experiments. The first beam of this type, of energy ∼80 MeV, was
obtained at the end of the 1970s at the Frascati Laboratory, using electrons
from the Adone storage ring with energy E = 1.5 GeV and photons of energy
ω = 2.45 eV. A photon beam obtained in this way possesses the important
property of having a high degree of polarisation. This is a consequence of
the fact that for relativistic electrons helicity is a good quantum number.
224 Relativistic Quantum Mechanics
γ 1 + γ 2 → e− + e + , (14.151)
in which the collision between the two photons results in the creation of an
electron-positron pair. Comparing the Feynman diagrams which describe this
process to order α2 , represented in Fig. 14.5, to those for Compton scattering,
Fig. 14.3, one sees that the only difference consists in the substitution
From this it follows that the expression for |M1 + M2 |2 written in terms
of Mandelstam variables s, t and u, equation (14.116), can also be used to
obtain the transition probability for the process (14.151) by setting
π α2 1+β
σ= 2
(1 − β 2 ) (3 − β 4 ) ln − 2β(2 − β 2 ) , (14.157)
2m 1−β
where 1/2
m2
β= 1− 2 , (14.158)
ω
and, obviously, the energy of the photons must satisfy the condition ω > m.
Equation (14.157) can easily be generalised to the case of any reference
frame in which the photons move along opposing paths in the same overall
direction, with energies ω1 and ω2 , by making the substitution
1/2
m2
β → 1− , (14.159)
ω1 ω2
m2
ω1 ≥ . (14.160)
ω2
The cross section for the process (14.151) applies to the collisions of high
energy photons in cosmic rays with the thermal background radiation (cosmic
microwave background, CMB) which permeates the universe. The mean free
path of a photon with energy ω1 can be estimated from the relation
1
λ(ω1 ) = , (14.161)
σργ
Comment. Setting ω2 ≈ 6 × 10−4 eV, the value which is obtained from the
spectral distribution of a black body at a temperature of T = 2.7 K, and us-
ing the result of recent measurements which give ργ = 410 cm−3 , a threshold
energy of ω1 ≈ 4 × 1014 eV = 400 TeV is obtained. At energies just above
the threshold the cross section has a value σ ≈ 10−26 cm2 , and the mean free
path of λ ≈ 31.5 kpc turns out to be comparable with the dimensions of our
galaxy.
e+ + e − → γ1 + γ 2 . (14.162)
The cross section calculation is carried out using the Mandelstam variables
πα2 λ2 + 4λ + 1 λ+3
σ= 2 2
ln (λ + λ2 − 1) − √ , (14.166)
m (λ + 1) λ −1 λ2 − 1
with λ = Ecm /m. In the non-relativistic limit equation (14.166) becomes
1 πα2
σ≈ , (14.167)
vrel m2
√
where vrel = 2λ λ2 − 1 is the relative velocity of the electron–positron sys-
tem.
From the cross section of the process (14.162) the mean lifetime of para-
positronium (i.e. the bound state of an electron and positron with spin zero)
can be obtained, for which the decay into two photons dominates5 (cf. Chap-
ter 12).
The mean lifetime is obtained from the product of the probability of the
annihilation process and the flux, equal to vrel |ψ(0)|2 , where ψ(r) is the nor-
malised wave function of the positronium ground state. Taking into account
5 As shown in Chapter 12, the spin one state, orthopositronium, has negative charge
conjugation and cannot annihilate into two photons, resulting in a much longer lifetime.
Applications: QED 227
that only one of the four spin states of the e+ e− system is available for anni-
hilation into two photons, we obtain
4πα2 1
Γ = 4|ψ(0)|2 vrel σ ≈ |ψ(0)|2 = mα5 (14.168)
m2 2
which corresponds to a lifetime
τ = Γ−1 ≈ 1.2 × 10−10 s (14.169)
in agreement with the experimental result quoted in Chapter 12.
with the factor 2 in front of the integrals to allow for the fact that two identical
contributions are obtained with the exchange x y. Obviously the field
operators ψe and ψμ , which describe the electron and muon (and relevant
antiparticles) commute, both with each other and the electromagnetic field.
We can therefore rewrite the time-ordered product in the form
T {jeλ (x)Aλ (x)jμν (y)Aν (y)} = jeλ (x)jμν (y)T {Aλ (x)Aν (y)} , (14.173)
where jeλ =: ψ̄e γ λ ψe : and jμν =: ψ̄μ γ ν ψμ : are the electromagnetic currents
of the electron and the muon. Because neither the initial nor the final state
contain photons, the contribution of the electromagnetic field to the transition
matrix element is given by
We consider the electron current (spin indices are omitted to simplify the
notation)
jeλ (x) = Nq Nq : a†q ūe (q)eiqx + bq v̄e (q)e−iqx γ λ
q,q
!
× aq ue (q )e−iq x + b†q ve (q )eiq x : , (14.177)
with Nq = m/V Eq . The only term which gives a non-zero contribution to
the matrix element (14.176) is the one containing the destruction operators
ap and bp , i.e.
Np Np bp ap v̄e (p )γ λ ue (p) e−i(p+p ) . (14.178)
Similarly, the only term which contributes to the muon current matrix
element is
Nk Nk a†k b†k ūμ (k)γ ν vμ (k ) e+i(k+k ) . (14.179)
We therefore obtain (ue = ue (p), ūe = ūe (p ), . . .)
and
k/ + M k/ + M
uμ ūμ = , − vμ v̄μ = . (14.185)
2M
2M
k k
From the term corresponding to the electron current, we obtain
1
v̄e γ λ ue ūe γ ν ve = − 2 Tr (p/ − m)γ λ (p/ + m)γ ν , (14.186)
4m
s,s
which, substituted together with the analogous expression for the muon cur-
rent in (14.183), gives the result
1 e4 1
|Mif |2 = |Mif |2 = ×
4
4 (p + p ) 4
s,s k,k
1 1
2
Tr (p/ − m)γ λ (p/ + m)γ ν Tr [(k/ + m)γλ (k/ − m)γν ] .
4m 4M 2
(14.187)
Using the rules for the calculation of products of γ-matrix traces, we find
Tr (p/ − m)γ λ (p/ + m)γ ν = Tr(p/ γ λ p/γ ν ) − m2 Tr(γ λ γ ν )
= p pσ Tr(γ ρ γ λ γ σ γ ν ) − 4m2 g λν
ρ
1 2
= 4 p pν + p pλ − g λν (pp ) + m2
λ ν
,
(14.188)
230 Relativistic Quantum Mechanics
and, similarly
) *
Tr [(k/ + M )γλ (k/ − M )γν ] = 4 kλ k ν + kν k λ − gλν (kk ) + M 2 .
(14.189)
From now on we set to zero the mass of the electron; m ∼ 0.5 MeV,
which is negligible compared to the mass of the muon, M ∼ 105 MeV and
to the 4-momenta of the particles which participate in the process. With this
approximation, we obtain
Tr (p/ − m)γ λ (p/ + m)γ ν Tr [(k/ + M )γλ (k/ − M )γν ]
= 32 (pk )(p k) + (pk)(p k ) + (pp )M 2 , (14.190)
k ≡ (E, 0, |k| sin θ, |k| cos θ), k = (E, 0, −|k| sin θ, −|k| cos θ), (14.193)
where√θ is the angle between the momenta of the electron and muon and
|k| = E 2 − M 2 . From the definitions it follows that
|k|
(pk ) = (p k) = E 2 1 + cos θ , (14.194)
E
|k|
(pk) = (p k ) = E 2 1 − cos θ , (14.195)
E
(p + p )2 = 2(pp ) = 4E 2 . (14.196)
Substituting into (14.191) we obtain the expression
2 1 1 4 M2 M2
|Mif | = e 1 + 2 + 1 − 2 cos2 θ , (14.197)
4m2 4M 2 E E
TV
| f |S (2) |i|2 = (2π)4 δ (4) (p + p − k − k )×
V4
e4 M2 M2
1 + 2 + 1 − 2 cos2 θ . (14.198)
16E 4 E E
Applications: QED 231
|v − v | |v| 2 |p| 2
=2 = = . (14.199)
V V V E V
The result is
V V e4 ! V V
dσ = 4
(2π)4 δ (4) (p + p − k − k ) 4
...... d3 k d3 k .
2 V 16E (2π)3 (2π)3
(14.200)
The integration over k can be carried out thanks to the function
δ (3) (p + p − k − k ), while to integrate over k we use the relation
d3 k = dΩk |k |2 d|k | = dΩk |k |E dE , (14.201)
which, substituted into (14.200), gives the result
1 1 e4 !
dσ = 2
δ(2E − 2E ) 4
. . . . . . dΩk |k |E dE , (14.202)
2 (2π) 16E
or (with α = e2 /4π)
dσ α2 |k| !
= ...... , (14.203)
dΩk 16E 2 E
or
1/2
dσ α2 M2 M2 M2
= 2
1− 2 1 + 2 + 1 − 2 cos2 θ , (14.204)
dΩk 4Ecm E E E
where the total energy in the centre of mass frame is Ecm = 2E. Finally,
carrying out the angular integration we obtain the total cross section
1 1/2
dσ α2 M2 8 1 M2
σ = 2π d cos θ = 2π 2 1− 2 1+ .
−1 dΩk 4Ecm E 3 2 E2
(14.205)
Obviously, equation (14.205) makes sense only if
M2
1− >0, (14.206)
E2
or if the threshold condition for μ+ μ− pair production, E 2 > M 2 or Ecm
2
>
2
4M , is satisfied. In the limit E → ∞
4 α2
σ→ π 2 . (14.207)
3 Ecm
The trend of equation (14.207) with energy was easily predictable, because
the cross section has dimensions of an area, i.e. in natural units with =
c = 1 the inverse of energy squared. In the limit E → ∞ the only energy
scale available is E, all the masses being negligible. To second order in the
interaction Lagrangian, we therefore find
α2
σ∝ 2
, (14.208)
Ecm
in agreement with equation (14.207).
232 Relativistic Quantum Mechanics
Sect. 14.4
1. Starting from the Dyson formula and using translation invariance, show
that the Compton scattering amplitude to order α can be concisely
written as:
3. Using equations (14.116), (14.140) and (14.141), show that the differen-
tial cross section for Compton scattering on a moving electron can be
cast in the form
$ 2
dσ α2 2 m2 m2 m2 m2
= 2ω 4 + + 4 +
dΩ (s − m2 )2 s − m2 u − m2 s − m2 u − m2
u − m2 s − m2
− + , (14.209)
s − m2 u − m2
APPLICATIONS: WEAK
INTERACTIONS
n → p + e− + ν̄ . (15.1)
The interaction Lagrangian which describes the process (15.1) and (9.24)
is recalled here for convenience:
GF gA GF
LF = − √ ψ̄p γ μ (1 + γ5 )ψn ψ̄e γμ (1 − γ5 )ψν = − √ H μ Lμ . (15.2)
2 g V 2
All the particles are represented by Dirac fields and GF is the Fermi constant.
The Lagrangian is the product of two operators: the nuclear current H μ ,
which induces the transition between heavy particles, n → p, and the lepton
current Lμ , which creates the lepton pair from the vacuum. Assuming the va-
lidity of (15.2) we calculate the mean lifetime of the neutron and the electron
asymmetry with respect to the spin of the neutron. Comparing with exper-
imental values we can determine the two constants which appear in (15.2).
The nuclear matrix element can be calculated in the limit in which the
proton is non-relativistic, given the small n-p mass difference compared to
the mass of the proton. In this limit, only the currents which correspond to
diagonal Dirac matrices survive, i.e.
γ0 , γ i γ5 .
G2F μ ν ∗
|M |2 = [h (h ) ][lμ lν∗ ] . (15.4)
2
The Nuclear Part. In the sum over the proton spin in the nuclear part of
|M |2 the projector of the two spin states is used:
(χp )a (χ†p )b = δab (a, b = 1, 2) . (15.5)
spin p
with
gA
aμ = (1, · σ) . (15.6)
gV
where we have used A + B = 1 and the polarisation along the 3-axis is:
P = σ3 = T r(ρσ3 ) = A − B .
Applications: Weak Interactions 235
Lepton Part. In the lepton part, we sum over all the spins, which are usually
not observed, using the formulae (6.84) and (6.85) for the projectors of the
solutions for positive (electron) and negative (antineutrino) energy. We obtain
(lμ lν∗ ) = [ūe (pe )γμ (1 − γ5 )vν (pν )] vν (pν )† (1 − γ5 )γν † γ 0 ue (pe )
spin e,ν spin e,ν
= [ūe (pe )γμ (1 − γ5 )vν (pν )] [v̄ν (pν )γν (1 − γ5 )ue (pe )] ,
spin e,ν
We can also express dΓ as a function of the electron energy and the angle
between the neutrino and the neutron spin direction. The neutrino momentum
is reconstructed by measuring the proton momentum as well as that of the
electron, using pν = −pe − pp . It is easily seen that the terms in P λ are
symmetric for exchange of e → ν while terms in P λ2 change sign. Therefore,
with vν = 1, we obtain:
G2F
dΓ = (1 + 3λ2 ) [1 + Aνn (P cosθνn )] pν Eν pe Ee dEe dcosθνn , (15.12)
4π 3
λ(1 − λ)
Aνn = −2 . (15.13)
(1 + 3λ2 )
Table 15.2 Properties of charged leptons are determined by the weak interactions,
from the Particle Data Group [12]. The numbers in brackets denote the error on
the last digit of each quantity. In the last two columns B(l), denotes the fraction of
lepton decays, with the emission of a l− ν̄l pair, with l = e, μ.
Figure 15.1 Eliminating λ from equations (15.11) and (15.13) one finds the second
degree consistency relation in the asymmetries, equation (15.17), which is repre-
sented by the ellipse shown in the figure, in the plane (Aen , Aνn ). The experimental
point is indicated by the dot and, indeed, it lies on the ellipse, showing that the
consistency condition is quite well obeyed.
Finally, from this simple analysis we find a value for gA /gV in (15.15) very
close to that adopted by the Particle Data Group [?] listed in Table 15.1.
Substituting the latter value into (15.14), we find the Fermi constant to be:
Comment. The existence of a correlation between the neutron spin and the
electron direction of flight shows that the weak interaction violates spatial
reflection symmetry. Under this operation, the electron momentum (polar
vector) changes sign while the neutron spin (axial vector) does not change. If
the final state is an eigenstate of parity, as happens if P commutes with the
Hamiltonian, we must have ve · σ n = 0, and therefore an average value of
cosθen equal to zero.
The existence of parity violation in weak interactions, and therefore in β
decays, was hypothesised by Lee and Yang [23] in 1956 to resolve the so-called
θ − τ puzzle, the decay of K mesons into both two and three π mesons. The
first experimental observation of the spin asymmetry was in β decays of nuclei
by Wu and collaborators in 1957 [24].
For a neutrino mass exactly equal to zero, g(Ee ) vanishes linearly at the
endpoint, while if mν = 0 the curve vanishes with an infinite derivative. The
effect allows an estimate of the neutrino mass, or at least an upper limit,
with greater sensitivity the smaller the endpoint value, which in the case of
the neutron is Δm 1.29 MeV. A particularly favourable nuclide is tritium,
which decays according to the scheme:
3
H → 3 He + e− + ν̄
0.05
0.04
0.03
0.02
0.01
Figure 15.2 Calculated shape of g(Ee ) in the decay of tritium, near to the endpoint.
The upper curve corresponds to a zero neutrino mass, the lower curve to m = 10 eV.
mνe ≤ 3 eV.
G(μ)
Lμ−dec = − √ ψ̄νμ γ λ (1 − γ5 )ψμ ψ̄e γλ (1 − γ5 )ψνe
2
+ hermitian conjugate. (15.21)
G(μ) is a new constant analogous to the Fermi constant introduced for the
neutron. In the muon rest frame, the S-matrix element for the decay is:
me mνμ mνe
νμ , e, ν̄e |S|μ = (2π)4 δ (4) (p − p − q − q ) M (i → f ) ,
E(p )E(q)E(q )V 4
Applications: Weak Interactions 241
where < .. > denotes the average over the muon spin. The tensor Lμν is the
same as introduced in (15.8), while, for a state of definite spin:
M μν = T r (χχ† )γ ν p/ γ μ (1 − γ5 ) .
In general, we must replace the spinor product with a density matrix,
similarly to what was done for the neutron:
† ρ(P ) 0
< (χ)α (χ )β >→ , (15.22)
0 0
1 + P σ3
ρ(P ) = ,
2
where P is the polarisation of the muon along the 3-axis. To calculate M μν
we must use the identity (6.170):
M μν = p S ν + p S μ − g μν p α S α − iναμρ p α S ρ ,
μ ν
S μ = T r [ργ μ (1 − γ5 )] .
Multiplying by Lμν we find:
M μν Lμν = 4(qp )S α qα , (15.23)
where the momenta are attributed as in (15.20). From this:
(G(μ) )2 d3 q (4)
3 3
d p d q
ρ σ
dΓ = 5
· 32 · q ρ S σ δ (p − q − p − q ) p q .
(2π) 2Ee 4Eνe Eνμ
If we do not observe the neutrinos, we must integrate the quantity in
brackets. The size of the integration volume being invariant, the result is a
tensor in the indices ρ, σ constructed with g ρσ and the vector Q = p − q, the
only variable which remains after the integration. We therefore set:
d3 p d 3 q ρ σ
p q = δ (4) (p − q − p − q )
ρ σ
p q
4Eνe Eνμ
= A(Q2 )g ρσ + B(Q2 )Qρ Qσ .
2 2
To determine A and B, we note the relations (p = q = 0):
1 Q2
gρσ · p q = (p + q )2 =
ρ σ
·I ,
2 2
1
Qρ Qσ · p q = (Q2 )2 · I ,
ρ σ
4
d3 p d 3 q
I = 1 = δ (4) (p − q − p − q ) .
4Eνe Eνμ
242 Relativistic Quantum Mechanics
from which:
T r [ρq/(1 − γ5 )] = Ee (1 + ve P cosθe ) .
We normalise the electron energy to the maximum value it can have in the
decay, Emax = mμ /2, putting:
Ee 2x
x= = ,
Emax mμ
Integrating over the remaining variables, we find from (15.24) the total
decay rate:
2
1 G(μ) m5μ
Γ= = . (15.25)
τ 192π 3
Both the electron spectrum and the spin asymmetry are in perfect agree-
ment with the experimental data. Comparing equation (15.25) with the ob-
served value of the lifetime, Table 15.2, we find in addition:
The kinematic limits of the decay in the Ee –Eνμ plane are given by the
condition cosθeνμ = ±1. Simplifying to the case of particles of zero mass, we
find:
mμ = Ee + Eνμ + |Ee ± Eνμ | .
With the positive sign, we have:
mμ
Ee + Eν μ = therefore :
2
mμ mμ mμ
Ee = x (0 ≤ x ≤ 1) , Eνμ = (1 − x) , Eν̄e = . (15.27)
2 2 2
With the negative sign, we have two solutions:
mμ
Ee = therefore :
2
mμ mμ
Eν μ =y (0 ≤ y ≤ 1) , Eν̄e = (1 − y) , (15.28)
2 2
or:
mμ
Eνμ = therefore :
2
mμ mμ
Ee = x (0 ≤ x ≤ 1) , Eν̄e = (1 − x) . (15.29)
2 2
Finally, the extreme configurations are three collinear configurations, in
which one particle takes the maximum energy, mμ /2, and the other two divide
the remaining half between them (see Fig. 15.3).
Figure 15.3 The V –A interaction implies that, in the limit of zero mass, particles
have negative helicity and antiparticles positive helicity. Therefore in the configu-
ration in which Meνμ = 0 the amplitude has to vanish because of conservation of
angular momentum.
the same current. In this way, we arrive at the current × current expression:
G λ †
LW = − √ JW (JW )λ ,
2
λ λ
JW = JN + Jeλ + Jμλ . (15.30)
νe + nucleus → e + . . . , → μ + . . . (15.31)
νμ + nucleus → μ + . . . , → e + . . . (15.32)
The selection rules follow from the invariance of the Lagrangian (15.30) for
global phase transformations carried out separately on the νe , e fields and on
the νμ , μ fields. The symmetry implies the conservation of two types of lepton
charge: electron number and muon number
The observation of the selection rule (15.32) at the beginning of the 1960s
allowed the existence of the muon neutrino, νμ = νe , to be established.
The current × current theory finds spectacular confirmation in the decay of
the τ lepton. If we add to the weak current (15.30) the term which corresponds
246 Relativistic Quantum Mechanics
G2 · m5τ
Γ(τ → ντ + l− + νl ) =
192π 3
mτ 5
=( ) Γμ (l = e, μ) , (15.35)
mμ
or:
B(e) = B(μ);
1 B(l) mμ 5
τ (τ ) = = −
= B(l)( ) τ (μ) = 2.86 · 10−13 s ,
Γτ Γ(τ → ντ + l + νl ) mτ
in excellent agreement with the values for the leptonic decay branching ratios,
B(e), B(μ) and the lifetime given in Table 15.2.
Alternatively, from the experimental values of the τ lifetime and the lep-
tonic branching ratios we can derive a new value of the Fermi constant. We
find:
G(τ ) = 1.15 · 10−5 GeV2 , (15.36)
in excellent agreement with the value determined from the muon lifetime,
(15.26).
where we have described the intermediate boson with a complex vector field;
(0)
LIV B is the free Lagrangian and g is a new coupling constant that we assume
to be small.
Treating Lint as a perturbation, we have:
(i)2
S = 1 + i d4 xLint + d4 xd4 yT [Lint (x)Lint (y)] + . . . . (15.39)
2
248 Relativistic Quantum Mechanics
where the ellipsis dots represent terms with the emission or absorption of two
intermediate bosons (still more forbidden) and we have used the fact that
in the Dyson formula we use free fields and therefore the currents and fields
commute.
In equation (15.41) the intermediate boson propagator appears which, be-
cause of its large mass, can be approximated by a δ-function, cf. equation
(8.14) and Problem 1 below. In this limit we obtain the product of the cur-
rents:
−G † μ
S = 1 + i d4 x √ JW (x)JW μ (x) + . . . (15.41)
2
=1+i d4 x LW (x) + · · · ,
where again the ellipsis dots represent terms irrelevant at low energy and we
have put:
G g2
√ = 2 >0. (15.42)
2 MW
In the limit of large mass of the intermediate boson and with a definite
sign3 for G, the terms of order g 2 in the S-matrix agree with the first order
term in the Fermi constant of the current × current Lagrangian (15.30)!
As well as giving the physical dimension of G (recall that g is dimensionless,
like the electric charge, in natural units), equation (15.42) gives a valuable clue
for constructing a unified theory; the true coupling constant g can be of the
same order of magnitude as the electric charge if MW is sufficiently large. We
require
4π
g 2 e2 = 4πα 0.091 (15.43)
137
and we use the value of the Fermi constant (15.26). We find:
√ 2
2 2g 2
MW = (100 GeV) . (15.44)
G
3 This is irrelevant for the applications just illustrated but essential to give the correct
sign for the interaction energy of neutrinos with matter, cf. Chapter 16.
Applications: Weak Interactions 249
with g, g1 of order e.
(0) 1 †
LIV B = − W μν Wμν − M 2 Wμ (W μ )†
2
W μν = ∂ ν W μ − ∂ μ W ν
NEUTRINO
OSCILLATIONS
Since the 1970s the systematic observation of neutrinos of both natural and
artificial origin has been made possible by the development of experimental
apparatus of large dimensions (tons or thousands of tons) which signal, with
different methods, the occurrence of a neutrino interaction.
Fig. 16.1 shows the general principle of these measurements. A source pro-
duces neutrinos or antineutrinos from β decay and the detector, at a distance
L from the source, signals the interaction of the neutrino by means of the ob-
servation of a charged lepton produced in the inverse β process; see Table 16.1
for the sources and decays utilised.
In the historic experiment of Cowan and Reines [32] in 1956, the source
was the Savannah River nuclear reactor, which produced a calculable flux of
antineutrinos from neutron decays. The detector was a tank containing water.
The antineutrinos produced positrons via the reaction:
ν̄ + p → e+ + n , (16.1)
where 109m Cd denotes a metastable excited state of 109 Cd. A scintillator ma-
terial dissolved in the water transformed the γ rays into light flashes detected
with a system of photomultipliers.
Figure 16.1 Production and detection of neutrinos via direct and inverse β processes.
where m is the neutrino mass matrix and we have used the ultra-relativistic
approximation.
Referring to Fig. 16.1, the amplitude to observe in the detector a neutrino
of flavour j, originating from the source as a neutrino of flavour i (i, j = e, μ)
is calculated from the evolution matrix, e−iHt . We approximate:
t = L , |p| = Eν ,
m2
−iHL −iEν L −i 2Eaν L
A(i → j) = j|a a|e |b b|i = e j|ae a|i ,
a,b a
(16.6)
Three Flavours. The formulae in (16.3) and (16.7) are easily generalised to
the case of three lepton flavours. We write:
|νi = Uia |a (i = e, μ, τ ; a = 1, 2, 3) . (16.10)
a
(m2b − m2a )
Δab = , Δ12 + Δ23 = Δ13 . (16.14)
2Eν
1 We recall that c ∼ 197 MeV·fm and that E(GeV) = 109 E(eV).
Neutrino Oscillations 255
In this case there are two differences between independent masses and
therefore two different spatial scales over which the oscillations evolve.
In general terms, the 3 × 3 unitary matrix, U , is parameterised by three
angles and a complex phase, which implies a violation of CP symmetry (of
which more later). The parameterisation commonly used in the literature is
obtained in the following way:
The most general form of a complex vector νe in terms of three complex
vectors ν1,2,3 is written:
νe = cos θ13 [cos θ12 ν1 + sin θ12 ν2 ] + eiδ sin θ13 ν3 . (16.15)
Proof. In general, we expect a phase factor in front of each of the three terms on
the right-hand side. A redefinition of the phase of the left-hand side νe → eiα νe
redefines the three phases for a constant value of the phase, which we choose so that
the coefficients of ν1 are real. Now we can redefine the phases of ν2 and ν3 only, in a
way so that the phase of νe does not change, therefore ν2 → eiβ ν2 and ν3 → e−iβ ν3 .
By this we can also reduce the phase of ν2 to zero and we are left with one phase
for the coefficient of ν3 , as in (16.15).
The most general form of vectors νμ and ντ will be that of two orthogonal
combinations of ν and ν , which we can choose in terms of a third angle
without having to introduce new phases (the reasoning is the same as that
used to have all real coefficients in equation (16.3)):
Overall, the expression for the unitary matrix U , for three flavours, is:
⎛ ⎞
c13 c12 c13 s12 eiδ s13
U = ⎝ −c23 s12 + s23 s13 c12 e−iδ c23 c12 + s23 s13 s12 e−iδ −s23 c13 ⎠ .
s23 s12 + c23 s13 c12 e−iδ −s23 c12 + c23 s13 s12 e−iδ −c23 c13
(16.19)
Figure 16.2 The fusion reactions in the Sun (Bethe cycles) [37].
Figure 16.3 Spectrum of solar neutrinos in the Solar Standard Model [38]. Energy
in MeV. Reprinted from Ref. [12] with permission. © APS (1998).
Neutrino Oscillations 257
νe + n → e + p , (16.20)
ν + d → ν + p + n (NC) , (16.21)
n + p → d + γ, γ + e → γ + e .
In reaction (16.21) all the flavours into which the original νe can be trans-
formed are active. SN O also measures charged current reactions and scatter-
ing on electrons:
ν + d → e− + p + p (CC) , (16.22)
ν + e − → ν + e− (ES) . (16.23)
The SN O results prove that there is no solar deficit for the neutrino elastic
scattering processes, confirming the oscillatory nature of the deficit observed
in the charged current reactions.
On 23 February 1987, at 7:35 am Universal Time, neutrino events were ob-
served in two large underground detectors: Kamiokande in Japan and IM B 7
2 Homestake Gold Mine, Lead, South Dakota, USA.
3 GALLEX, INFN Gran Sasso National Laboratory, Italy.
4 SAGE, Baksan Neutrino Observatory, Caucasus, Russia.
5 Kamioka Observatory, Hida-city, Toyama, Japan.
6 Sudbury Neutrino Observatory, Creighton Mine, Sudbury, Ontario, Canada.
7 Irvine-Brookhaven-Michigan experiment for detection of proton decays, Fairport mine,
Table 16.2 Observed deficit in solar neutrino experiments. For SNO, cf. Fig. 16.5.
in the USA. A few hours later optical signals from a supernova located in our
galaxy, at a distance of 170,000 light years, were detected. The correlation
between the neutrino events and the light signals allowed a significant limit
to be set on the difference between the speed of neutrinos of ∼10 MeV energy
and the speed of light.
Cosmic rays produce neutrinos of energy around 1 GeV starting from π
and K mesons produced in high layers of the atmosphere, via the decay chain
emitted by a group of nuclear reactors located in Japan and South Korea. The
flux is dominated by a few of the reactors, situated at an average distance
of 180 km. KAM LAN D succeeded in detecting a reduction of about 40%
compared to the expected flux, and a distortion of the spectrum in agreement
with the oscillation hypothesis.
Finally, starting from the Kamiokande observations, beams of neutrinos
produced by accelerators have been developed so that they can be detected
in underground experiments placed at long distances from the source (long-
baseline neutrino experiments).
At present there are three active beams of this type:
our considerations to the first interaction, which arises from the Hamiltonian
density:
G
Hνe −e (x) = −LF ermi (x) = + √ [ēγμ (1 − γ5 )νe ] [ν̄e γ μ (1 − γ5 )e] . (16.25)
2
The sign is consistent with the sign found in Section 15.4, when we derived
the Fermi Lagrangian starting from the exchange of the charged intermediate
boson.
The calculation is greatly simplified if we exchange the fields ē and ν̄e ,
by expressing the Hamiltonian as the product of diagonal bilinears, relative
to the field of the electron and νe . This transformation, known as the Fierz
transformation, has the property of leaving invariant the V –A interaction:
(ēL γμ νL )(ν̄L γ ν eL ) ,
by the completeness of the Dirac matrices. On the other hand, from the presence of
the projectors 1 − γ5 , the Γ(i) matrices must anticommute with γ5 (cf. Chapter 13)
or Γ(i) = V, A. Taken between the left-handed fields, γμ and γμ γ5 give the same
result, except for a sign. Therefore, if we take account of the sign changes from the
exchange of e and νe due to Fermi statistics, the relation to be proven is:
T r[γ ρ γμ (1 − γ5 )] [γ μ (1 − γ5 )]δ = 4γ ρ (1 − γ5 ),
Q. E. D.
Neutrino Oscillations 261
Matrix Element. The energy density of the neutrino is found by taking the
matrix element of equation (16.27) between the states with the initial and
final electron at rest, and the initial and final neutrino with equal momentum,
p:
G
EW = + √ e(p = 0)|ēγ μ (1 − γ5 )e|e(p = 0)
2
× νe (p)|ν̄e γμ (1 − γ5 )νe |νe (p) . (16.29)
To a good approximation in the Sun there is one proton for every electron,
therefore the density of 1 g/cm3 corresponds to NA = 6.6 · 1023 electrons/cm3 ,
from which:
ρSun (r = 0)
(ρe )Sun (r = 0) 3 NA · 162(c)3
162 g/cm
ρSun (r = 0)
= 3 8.18 · 1011 eV3 , (16.34)
162 g/cm
from which
ρSun (r = 0) eV2
V = 1.35 · 10−5 3 . (16.35)
162 g/cm MeV
MSW Effect. In the (νe , νμ ) basis, we can express the neutrino mass matrix
in terms of Δm212 = m22 − m21 and the vacuum mixing angle:
i|m2 |j = i|am2a a|j = [U (m2 )diag U † ]ij =
a
Δm212 cos(2θ12 ) sin(2θ12 )
=− + ··· , (16.36)
2 sin(2θ12 ) − cos(2θ12 )
where we have omitted a term proportional to the unity matrix. Similarly, we
write the potential as:
√ 1 0
V = + 2Gρe
0 0
√
2G 1 0
=+ ρe + ··· , (16.37)
2 0 −1
from which:
m2
ΔE = +V
2E
$ ν √ %
Δm212 cos(2θ12 ) 2G Δm212 sin(2θ12 )
= − + ρe σ3 + σ1 + · · · ,
4Eν 2 4Eν
(16.38)
where σ1,3 are the familiar Pauli matrices and we have omitted terms propor-
tional to the identity matrix.
If Δm212 cos(2θ12 ) > 0, equation (16.38) shows that a critical value of the
density exists for which the coefficient of σ3 vanishes and, correspondingly, the
mixing angle becomes maximal, even if the vacuum mixing angle is very small.
In these conditions a νe → νμ resonant conversion, the Mikheyev–Smirnov–
Wolfenstein effect [40, 41], can occur. The critical density is:
Δm212 cos(2θ12 )
ρ̄e = √ . (16.39)
2 2GEν
The reactions which generate the neutrinos take place in the central region
of the Sun (r = 0) where the density is maximum; in their journey towards the
surface, the neutrinos cross regions whose density decreases until it vanishes
at the surface of the Sun, r = RSun . The mixing matrix at the exit from the
Sun is obtained by solving the Schrödinger equation along the trajectory. We
can consider two extreme cases [16]:
• the potential term is negligible:
√
Δm212 cos(2θ12 ) 2G
>> ρe (r = 0) or ρe (r = 0) << ρ̄e . (16.40)
4Eν 2
The neutrino deviates further from the critical condition, the matter
effect is negligible and the formulae for oscillations in vacuum apply.
Neutrino Oscillations 263
νe (r = RSun ) ∼ ν2 . (16.42)
Solar Neutrinos. The first proposal to explain the deficit observed at Home-
stake, advanced by Pontecorvo and Gribov, was the “just so” solution, in
which the oscillation phase is of order π/2 just at the orbit of the Earth. Re-
ferring to the values from GALLEX (deficit R ∼ 0.58%, E ∼ 0.5 MeV) we
264 Relativistic Quantum Mechanics
Table 16.3 Oscillation parameters for νe –ν determined by the KAM LAN D exper-
iment [42].
find:
The just so solution appears in the lower part of Fig. 16.4, but is not
consistent with all the other information, in particular with the KAM LAN D
observations, which require an oscillation length of terrestrial scale. For Δm212
much larger than the just so value, the oscillation in vacuum averages to a
value independent of distance:
1
P (νe → νe ) → 1 − sin2 (2θ12 ) (vacuum oscillation) . (16.46)
2
Referring to the value of GALLEX we obtain:
This solution corresponds to a vertical line with tan2 θ12 = 0.43, from the
just so solution upwards, roughly the dark region of Fig. 16.4.
For a more precise analysis, it is necessary to take account of the MSW
effect in the Sun, equation (16.38). Using the KAM LAN D parameters, Ta-
ble 16.3, and the values of the energy, Eν = 0.5, Eν = 7 MeV for gallium and
boron, respectively, we find:
Figure 16.4 Allowed regions from the measurement of the probability of non-
oscillation; νe of solar origin and ν̄e produced on Earth [16]. Reprinted from Ref. [12]
with permission. © APS (1998).
266 Relativistic Quantum Mechanics
Figure 16.5 SNO results [43]. The measurement of charged current (CC) and neutral
current (NC, ES) reactions allows separation of the contributions of νe and the
superposition νμ –ντ , and to compare them with the predictions of the Standard
Model and the predicted spectrum for neutrinos from boron. Reprinted from Ref. [12]
with permission. © APS (1998).
Using the parameters from Tables 16.1 and 16.4 with Eν ∼ 1 GeV, we
find:
therefore:
∗ 2
P (νμ → νe ) = |Ue3 Uμ3 | 2[1 − cos(Δ31 L)]
2 2 2 Δm223 L
= sin θ23 sin (2θ13 ) sin . (16.54)
4Eν
and neutrino and antineutrino states are obtained by applying the relevant
creation operators, a† and b† , to the vacuum.
Since the mixing of the fields is described by a unique matrix which we
denote as U ∗ , it follows that:
• the mixing of the neutrino states is described by the matrix U ,
We see that:
formations. For a field theory discussion of the C and P symmetries and of time reversal,
T see Chapter 12.
Neutrino Oscillations 269
and the second condition from equation (16.14) to eliminate Δ13 . We find:
∗
ΔP = −2Im [(Uμ1 Ue1 )(Uμ3 U ∗ e3)]
× {sin(Δ12 L) − sin[(Δ12 + Δ23 )L] + sin(Δ23 L)}
∗ ∗
= −2Im [(Uμ1 Ue1 )(Uμ3 Ue3 )]×
× {sin(Δ12 L)[1 − cos(Δ23 L)] + sin(Δ23 L)[1 − cos(Δ12 L)]}
=F ×G , (16.61)
Majorana Neutrino. Everything which has been said so far holds for both
Dirac or Majorana neutrinos since, for masses which are anyway << E, the
V –A interaction inhibits transitions between particles and antiparticles, of
the type νe → e+ , cf. Chapter 13. The observation of neutrinoless double–β
decay is the characteristic signal of a Majorana neutrino with mass, and is
actively being sought in several laboratories, for example Gran Sasso. The
present situtation is illustrated in the next Section.
Mass Hierarchy. As is seen from equation (16.51) and in Table 16.1, the
data do not define the sign of Δm213 ; therefore, we have two possible orderings
of the neutrino masses:
normal order : m21 < m22 < m23 ,
inverted order : m23 < m21 < m22 . (16.63)
In the case of quark masses, normal ordering holds, and in addition each
mass squared is much less than the squared mass which follows. If this scheme
were to be repeated for neutrinos, we would have:
0 ∼ m21 << Δm212 ∼ m22 << Δm223 ∼ m23 , (16.64)
but they could also have squared masses roughly equal to each other and much
larger than the differences:
m21 ∼ m22 ∼ m23 >> Δm2 . (16.65)
The absolute values of the masses can be obtained with accurate measure-
ments of the electron spectrum in β decay of tritium, Chapter 15, or from the
observation of neutrinoless double-β decay.
Neutrino Oscillations 271
π + → μ+ + ν μ (π + in flight) ,
μ+ → ν̄μ + e+ + νe (μ+ at rest) . (16.66)
From the energy and distance scales involved, the interpretation of the
result as a νμ → νe or ν̄μ → ν̄e oscillation requires a value of Δm2 very
different from the values observed with atmospheric neutrinos:
NEUTRINOLESS
DOUBLE-BETA DECAY
where N and N denote the parent and daughter nucleus, respectively; see
Section 15.1.
The stability of a nucleus of mass number A against beta decays is driven
by the dependence of its mass, MA , on the charge number Z. In the vicinity
of the minimum, the available experimental data turns out to be accurately
1 In her paper, Goeppel-Mayer actually reports that the possible occurrence of double-
N (A, Z) → N (A, Z + 2) + e− −
1 + e2 , (17.6)
N (A, Z) → N (A, Z − 2) + e+
1 + e+
2 , (17.7)
first discussed by Furry in 1939 [49]. The decays proceed mainly from the
ground state of a parent nucleus with spin-parity 0+ to the 0+ ground state
of the daughter nucleus, although in some instances the transition to excited
0+ or 2+ states is also energetically allowed.
The occurrence of 2β0ν decay processes—which is forbidden in the Stan-
dard Model of weak interactions outlined in Chapter 15—is only possible if
neutrinos behave as massive Majorana particles, the properties of which are
discussed in Chapter 13.
Neglecting the recoil energy of the daughter nucleus, the Q-value of double
beta decays reduces to
where MN and MN are the masses of the initial and final nuclei, respectively,
and me is the electron mass. Note that, because 2β2ν decays lead to the
appearance of a final state comprising four leptons, the sum of the kinetic
energies of the two charged leptons, Ee1 +Ee2 , exhibits a continuous spectrum
extending from zero to Q. In the case of 2β0ν decays, on the other hand, there
are only two final-state leptons, and Ee1 + Ee2 = Q.
Lρ = ψ e γρ (1 − γ5 )ψν , (17.10)
and
gA
H ρ = ψ N γ ρ (1 − γ5 )τ + ψN . (17.11)
gV
In the above equation, the nucleon is described by the doublet
ψp
N= , (17.12)
ψn
Neutrinoless Double-Beta Decay 275
where ψp and ψn denote the proton and neutron fields, respectively, while
τ + = (τ1 + iτ2 )/2—with the τi being Pauli matrices—is the operator raising
the lower component of the doublet.
The S-matrix element describing the decay (17.4) can be written in the
form
S2ν = d4 x1 d4 x2 e1 , e2 , ν 1 , ν 2 , N |T {LF (x1 )LF (x2 )} |N (17.13)
where Pf and Pi are the four-momenta of the initial and final states, re-
spectively, and M2ν denotes the transition ampliude. The expression of S2ν
involves the lepton tensor
with2 ,
1 1
Jρ (xi ) = ei , ν i | Lρ (x) |0 = ei γρ (1 − γ5 )νi ei(pei +pνi )xi , (17.14)
2 E ei E ν i
with |ΨN and |ΨN being the ground states of the parent and daughter
nuclei, respectively. The nuclear current is defined as
ρ
A
HA = Hiρ . (17.16)
i=1
with Hiρ being the current of Eq. (17.11) associated with the i-th nucleon.
Figure 17.2 provides a diagrammatic representation of the reaction (17.4),
in which beta decays are associated with the exchange of the positively-
charged W bosons, as described by the IVB Lagrangian LIV B of Eq. (15.38).
The Fermi Lagrangian LF can be obtained from LIV B in the MW →
∞ limit, safely applicable to the description of nuclear beta decays; see
Section 15.4.
The calculation of H ρσ is carried out by rewriting the nuclear tensor in
the form ρ
H ρσ = ΨN | HA (x1 ) |Ψm Ψm | HA
σ
(x2 ) |ΨN , (17.17)
m
2 Here we use the spinor normalisation of Eq. (13.8), which turns out to be more conve-
where 2
gV
M2ν = GT
M2ν − F
M2ν . (17.21)
gA
F GT
In the above equation, M2ν and M2ν —referred to as Fermi and Gamow-
Telller matrix elements—describe the nuclear transition amplitudes involving
Neutrinoless Double-Beta Decay 277
the vector and axial-vector weak currents, respectively. Using the approxima-
tion of Eq. (17.19) and treating the nucleons as non relativistic particles they
can be written in the form
# + #A +
ΨN | A j=1 τj |Ψm Ψm | k=1 τk |ΨN
F
M2ν = , (17.22)
m
Em − (MN + MN /2)
# + #A +
ΨN | A j=1 σ j τj |Ψm · Ψm | k=1 σ k τk |ΨN
GT
M2ν = , (17.23)
m
Em − (MN + MN )/2
with σ j being the matrix associated with the spin of the j-th nucleon.
The phase-space factor is obtained from
4
1 G4 g A
G2ν = dΩ2ν F0 (Z + 2, Ee1 )F0 (Z + 2, Ee2 ) , (17.24)
ln 2 64π 7 m2e
with
dΩ2ν = Eν21 dEν1 Eν22 dEν2 |pe1 |Ee1 dEe1 |pe2 |Ee2 dEe2 d cos θ
× δ(EN + Ee1 + Eν1 + Ee2 + Eν2 − EN ) ,
where θ is the angle between the momenta of the emitted electrons. The Fermi
factor F0 (Z, Ei ) appearing in Eq. (17.24) takes into account the effect of the
Coulomb attraction between the emitted electron and the daughter nucleus,
having charge number Z + 2. The corresponding expression is
E 2πα(Z + 2)
F0 (Z + 2, E) = , (17.25)
|p| 1 − e−2πα(Z+2)
with α being the fine-structure constant. Note that nuclear matrix elements
defined according to the above equations turn out to be dimensionless.
The occurrence of 2β2ν decay has been unambiguously observed in several
nuclei—ranging from 48 Ca to 150 Nd—using different detection techniques. The
measured values of the half-life lie in the range 1018 < 2ν < 23
∼ T1/2 ∼ 10 years [50].
Precision measurements of 2β2ν reactions can provide information useful to
constrain the models employed to perform calculations of the nuclear matrix
elements, which are also needed for the determination of the 2β0ν decay rates
to be discussed in the next section.
Figure 17.3 Diagrammatic representation of the neutrinoless double beta decay re-
action (17.6).
1) The antineutrino emitted from the upper leptonic vertex could not be
absorbed by the lower vertex, which can only absorb a neutrino;
2) The helicity of the antineutrino emitted from the upper leptonic vertex
would be positive, while the lower leptonic vertex can only absorb a
neutrino with negative helicity.
As a consequence, the occurrence of 2β0ν decay entails two necessary condi-
tions:
1) The neutrino must be a Majorana fermion, in which case ν e = νe and
the total lepton number is not conserved;
2) The neutrino mass mνe must be non zero, in which case a neutrino
of negative helicity can be emitted from the upper leptonic vertex with
amplitude mνe /Eνe , and absorbed by the lower leptonic vertex with unit
amplitude.
For convenience, we recall here the expression of the field describing massive
Majorana fermions, discussed in Chapter 13
1
ψν (x) = √ ar (p)ur (p)e−ipx + a†r (p)ur (p)eipx , (17.26)
p,r
2V Eν
Comparison between the diagrams of Figs. 17.2 and 17.3 shows that the
nuclear parts of the 2β0ν and 2β2ν amplitudes are identical. The lepton part
of Eq. (17.27)
whith the Uej being elements of the neutrino mixing matrix. Let us now
assume that the neutrino be a Majorana particle, so that
ψνTk = −ψ νk C , (17.30)
we find that the lepton part of the 2β0ν amplitude is proportional to the
effective Majorana mass, defined as
3
2
mee = Uek mk . (17.34)
k=1
280 Relativistic Quantum Mechanics
The sum over intermediate nuclear states is performed using the closure ap-
proximation, which amounts to replacing the energy E = Em + Ee2 − EN ,
appearing in the denominator of the above equation, with an average value
E ∼ 10 MeV. The sum # can then be performed exploiting completeness
of the set {Ψm }, implying m |Ψm Ψm | = 11. This procedure drastically
simplifies the calculation of the nuclear matrix elements, which reduce to the
form
A
MF
0ν = ΨN | τj+ τk+ H(rjk )|ΨN , (17.37)
j,k=1
A
MGT
0ν = Ψ N | (σ j · σ k ) τj+ τk+ H(rjk )|ΨN , (17.38)
j,k=1
where rjk is the distance between the two nucleons undergoing beta decay. The
function H(rjk ), called neutrino potential, originates from the integration of
Eq. (17.35) over the space components of the virtual neutrino momentum,
which can be performed using the approximation ωq ≈ |q|. The resulting
expression is
2RA +∞ sin(qr)
H(r) = dq , (17.39)
πr 0 q + E
with RA = 1.2 A1/3 being the radius of the parent nucleus.
Collecting the nuclear and leptonic parts together we can cast the expres-
sion of the 2β0ν half-life in the form
!−1 |mee |
2
0ν 2
T1/2 = |M0ν | G0ν (17.40)
m2e
Neutrinoless Double-Beta Decay 281
where, mee is the effective Majorana mass, defined in Eq. (17.34), and M0ν
is the dimensionless nuclear transition matrix element, including the contri-
butions of Fermi and Gamow-Teller transitions
2
gV
M0ν = M0ν −GT F
M0ν . (17.41)
gA
Table 17.1 Current limits on the 2β0ν decay half-life and the effective
Majorana mass, reported by experiments performed using different nu-
clei.
0ν
Nucleus Experiment T1/2 mee
[years] [meV]
76
Ge GERDA [53] > 1.8 × 1026 < 79 − 180
76
Ge MAJORANA [54] > 8.3 × 1025 < 113 − 269
136
Xe KamLAND-Zen [55] > 2.3 × 1026 < 36 − 156
136
Xe EXO-200 [56] > 3.5 × 1025 < 93 − 286
130
Te CUORE [57] > 2.2 × 1025 < 90 − 305
100
Mo CUPID-Mo [58] > 1.8 × 1024 < 280 − 490
82
Se CUPID-0 [59] > 4.6 × 1024 < 263 − 545
48
Ca CANDLES-III [60] > 5.6 × 1022 < 1600 − 2900
Here, cij = cos θij , sij = sin θij , with θij being a neutrino mixing angle, and
δ is the phase associated with the violation of the CP symmetry discussed in
Section 16.1. The additional phases η1 and η2 which also describe CP violation,
only appear in the case of Majorana neutrinos.
The effective Majorana masses obtained from the above equations using
the results of the global analysis of oscillation data of Esteban et al. [63] are
shown in Fig. 17.4.
0
10
-1
10
mee (eV)
-2
10
-3
10
-4
10
0.0001 0.001 0.01 0.1 1
mlight (eV)
Figure 17.4 The effective Majorana neutrino mass mee as a function of the lightest
neutrino mass, mlight . The lower and upper bands correspond to normal and inverted
order; that is, mlight = m1 , and mlight = m3 , respectively. Reprinted from Ref. [12]
with permission. © APS (1998).
CHAPTER 18
A LEAP FORWARD:
CHARMONIUM
mesons = q q̄ ,
baryons = qqq . (18.2)
charm) or one unpaired c quark (charmed particles) have been observed from
1974 onwards, with the properties anticipated in [28].
In 1973, Kobayashi and Maskawa [29] have proposed the existence of a
further pair of quarks, to explain the CP-violation observed in K decays.
Hadrons with beauty and top quarks have been first observed in 1976 and
1994, respectively.
Heavy quarks decay by Weak Interactions into a lighter quark plus particles
with total baryon number B = 0. Thus they all have the same baryon number
(1/3) of the lightest ones, which is determined by the fact that proton and
neutron are made by three quarks. Indeed charmed and beauty baryons, with
composition (qq c) and (qq b) have been observed, as well as doubly charmed
baryons (qcc) (q and q denote light quarks).
QCD. The first baryon with S = 0 heavier than the proton is the so called
3-3 resonance Δ++ (1232)2 (in parenthesis the mass in MeV). The quark model
composition of Δ++ in its s3 = +3/2 state is:
Δ++ ↑ ↑ ↑
s3 =+3/2 = (u u u ) . (18.3)
q → q a : q = u, d, s and a = 1, 2, 3 . (18.4)
The further hypothesis is done that the observed hadrons are invariant
under the new symmetry, i.e. they are colour singlets. In the case of (18.3),
this implies:
1952 in π + + p collisions.
3 The name colour is given in analogy to the flavor quantum numbers exhibited by quarks,
where the λA are eigth 3 × 3 hermitian and traceless matrices (the generators
of SU (3)colour ).
with gs the strong quark gluon coupling, g Aμ (x), A = 1, . . . , 8 the gluon fields,
and m the quark masses. Unlike photons in QED, gluons interact among them-
selves, with 3-gluon and 4-gluons interactions of order gs and gs2 , respectively.
Quark Confinement. Gluons self-interaction, in the presence of a cou-
pling constant gs = O(1) is supposed to be at the origin of quark confinement,
the fact that quarks are permanently confined inside colour singlet hadrons.
Indeed, quarks have never been observed in isolation5 .
To visualise this phenomenon, consider a (colour singlet) meson made by
a quark-antiquark pair bound by colour forces. For weak coupling, we have
colour lines of force that start from the quark, like in electrostatics, and have
4 A gauge theory based on a non-commutative symmetry like SU (3)
colour is called a
Yang-Mills theory, after the names of the authors who first worked out these theories in
1954 [71].
5 The lightest quark must be stable due to its fractional electric charge. Isolated quarks
produced in the collisions of cosmic rays or at particle accelerators have been intensively
and unsuccessfully searched in the years nineteen sixties.
A Leap Forward: Charmonium 289
Figure 18.1 Inside hadrons, colour fields condense in “strings” that go from the quark to
the antiquark (mesons) or to a three-string vertex (baryons). The energy of the string is
proportional to the lenght. If we apply a force to the quark, e.g. from a collision with an
external electron, the string is stretched until string energy reaches the q q̄ threshold and it
gives rise to two mesons (baryon plus meson), still with confined quarks.
to land all in the antiquark, since the meson is colour neutral. It has been
supposed that, going to a strong coupling, gluon self-attraction makes the
force-lines to condense into a string that goes from the quark to the antiquark,
with an associated energy which increases (e.g. linearly) with the length. In
the case of baryons, the strings originating from the three quarks merge into
a colour invariant triple vertex with their colour indices saturated by the
antisymmetric tensor abc as in Eq. (18.5).
If we give energy and momentum to one quark, e.g. from a collision with
an external electron, the string is stretched until string energy reaches the
q q̄ threshold. At this point an additional quark pair is created, see Fig. 18.2,
and a state with two mesons (or a baryon plus meson) is produced, still with
confined quarks.
Asymptotic Freedom. In 1973 David Gross and Franck Wilczek [72]
and, indipendently, David Politzer [73], computed the asymptotic behaviour
of the SU (3)colour coupling, Eq. (18.7), for large values of the momentum
transfer. Unlike what happens in all cases previously studied (QED, Yukawa
and λφ4 interaction) they found that the colour coupling gs , defined to be
very large in the confinement region q < 1 GeV, decreases logarithmically for
large momentum transfer:
gs2 (q) C
αs (q) = ≈ → 0 (q 2 → ∞) . (18.8)
4π ln q 2
Asymptotically, quark behave as free particles!
The unexpected result agreed with the scaling relations observed in the
experimental e − p deep inelastic cross-sections since the first data in 1968.
The scaling behaviour had been anticipated by J. Bjorken, on the basis of
quark current commutators, and was explained by R. Feynman in 1971 as
290 Relativistic Quantum Mechanics
indicating that the proton, seen at very large momentum transfer, behaves
like a cloud of free, point-like particles that Feynman called partons.
Experimental investigations of deep inelastic scattering of electrons and
neutrinos off protons and neutrons, in the years 1970s and 1980s, have shown
that the proton seen at large momentum can indeed be described as an inco-
herent mixture of free quarks and antiquarks with different flavours, with an
additional component of neutral partons, to be identified with gluons. Each
parton is characterised by structure functions that describe the probability to
find that particular constituent (i.e. u or d quarks or gluons) with a fraction
x of proton’s momentum.
Owing to the logarithmic approach to asymptotic freedom, Eq. (18.8),
at large but finite energies there are deviations from free-parton behaviour,
which have been accurately computed in QCD, see Ref. [74], and compared
to experimental data at increasingly large energies.
The q dependence of the coupling, the so-called running coupling, has
been computed in QCD to leading logarithmic (LLO) and next-to-leading
logarithmic order (NLLO) to be6 :
6 The running coupling is determined by the renormalization group equation
∂α(t)
= β(α), t = ln q 2 , (18.9)
∂t
where β(α) = β1 α2 + β2 α3 , β1 and β2 are constants computed in [75] to leading logarithmic
order (LLO) and next-to-leading logarithmic order (NLLO), respectively, and reported in
(18.16). The solution of (18.9) is:
αs (q 2 ) q2
dα q2
= D(αs (q 2 )) = dt = ln , (18.10)
β1 α2 + β 2 α3 Λ2QCD
where − ln Λ2QCD is an integration constant and D(α) is the primitive of the integrand in
(18.10): D (α) = (β1 α2 + β2 α3 )−1 :
1 β2 αβ2
D(α) = − − 2 ln( ). (18.11)
αβ1 β1 β1 + αβ2
To LLO (and β1 < 0):
1 q2 1
= (−β1 ) ln 2 , i.e. αs1 (q 2 ) = . (18.12)
αs1 (q 2 ) ΛQCD q2
(−β1 ) ln Λ2
QCD
To find αs to NLLO we use (18.11) inserting αs1 in the NLLO correction, to find
1 β2 β2
αs (q 2 ) = 2 1 + α s1 (q 2
) ln α s1 (q 2
) , (18.13)
(−β1 ) ln 2q β1
ΛQCD
β1
which gives Eq. (18.15). To find the value of ΛQCD given MZ , we use Eq. (18.11) yielding
β2
1+ α
β1 sM Z
ln( β
β
2
αsM Z )
1
ΛQCD = MZ Exp − = 0.244 GeV , (18.14)
2(−β1 ) αsM Z
for 5 quark flavours with masses m << MZ .
A Leap Forward: Charmonium 291
Figure 18.2 Taken from Ref. [76], the figure shows the strong coupling αs = gs2 /4π as a
function of momentum transfer, compared to recent data from LHC and other sources.
1 β2 1 β2 1
αs (q 2 ) = q2
+ q2
ln[ q2
],
(−β1 ) ln Λ2 β1 [(−β1 ) ln
Λ2QCD
]2 β1 (−β1 ) ln
Λ2QCD
QCD
18.2 CHARMONIA
The critical parameter that regulates the passage from strong to weak
regime, ΛQCD = 0.25 GeV, Eq. (18.15), can be compared with the quark
masses given in Table 18.1. Quarks u and d fall in the strong coupling region,
s marginally out, but heavy quarks are definitely in the weak coupling region.
The result suggests the possibility to compute the mass spectrum of c̄c and b̄b
mesons with methods similar to those employed for atomic spectra.
After the discovery of asymptotic freedom, T . Applequist and D. Politzer
observed that the Coulomb-like interactions associated to the exchange of
gluons would produce a series of c̄c bound states [77], in all similar to the
positronium states introduced in Section 12.5.2. By analogy, Applequist and
Politzer proposed to name charmonium each of these states and observed
that the smallness of αs at the charm quark mass scale would make their
spectrum calculable in perturbation theory. As it happens for positronium,
the hyperfine (spin-spin) interaction is expected to split the ground state into
“Paracharmonium” (J = 0) and “Orthocharmonium” (J = 1).
292 Relativistic Quantum Mechanics
in [79].
8 −4/3 is a group theoretical factor reflecting the fact that the cc̄ pair, like all hadrons,
with χ(r), the reduced radial wave function, satisfying the boundary condition
1 χ(r) 2
|ψ(0)|2 = lim ( ) =0. (18.23)
4π r→0+ r
Defining
16παs
V̄SS = |ψ(0)|2 (18.24)
9Mc2
one has:
1
+1/2, (S = 1)
< VSStot >S−wave = V̄SS × . (18.25)
−3/2, (S = 0)
with S being the total c̄c spin. To first order in this potential, we compute the
integral
+∞
1 4αs k χ(r) 2
V̄LS = 2 3
− r2 ( ) dr . (18.27)
2Mc 0 r r r
The integral is convergent at the origin for L ≥ 1 since (χ(r)/r)2 ∼ r2L for
r → 0. We restrict to P -waves and find
⎧
⎨ −2 (J = 0)
δmLS =< VLStot >P −wave = V̄LS (L · S) = V̄LS × −1 (J = 1) . (18.28)
⎩
+1 (J = 2)
294 Relativistic Quantum Mechanics
Tensor Interaction.
One defines the tensor interaction starting from (see [83, 84])
1 d 2 VV 1 dVS
VT = 2
3 2
− ,
6Mc dr r dr
1 8αs k
VT tot = −3VT [Nij Si Sj ] = 2 3
+ [Nij Si Sj ] , (18.29)
2Mc r r
with:
1 ri
Nij = (n̂i n̂j − δij ), n̂i = . (18.30)
3 r
The average value of Nij in a state with definite orbital angular momentum
L is9 :
1 2
< (n̂i n̂j − δij ) >L = a(L) Li Lj + Lj Li − δij L(L + 1) , (18.31)
3 3
1
a(L) = − , (18.32)
4L(L + 1) − 3
so that:
1 8αs k
< VT tot >L =< + >×
2Mc2 r3 r
1
×2a(L) (S·L)2 − S(S + 1)L(L + 1) . (18.33)
3
The effect, of course, vanishes for L = 0 or S = 0. For L = S = 1, we compute
the integral
1 8αs k
V1 =< 2 3
+ >P −wave =
2Mc r r
∞
1 8αs k χ(r) 2
= 2 3
+ r2 ( ) dr , (18.34)
0 2Mc r r r
and find
2 −8 (J = 0) −8 (J = 0)
δmT =< VT tot >P −wave = V1 × = V̄T ;
15 +1 (J = 1, 2) +1 (J = 1, 2)
1 1 8αs k
V̄T = < + >P −wave . (18.35)
15 Mc2 r3 r
9 This result is reported in the book of Landau and Lifshitz [86]. The r.h.s of (18.31) is
the only symmetric and traceless tensor with two indices, which can be formed with Li , the
only vector remaining after integration over r. To obtain (18.32), one multiplies both sides
of (18.31) by Li (on the left) and Lj (on the right), summing over i and j. The result follows
r×p
from the fact that n̂ is orthogonal to L = and from angular momentum commutators.
A Leap Forward: Charmonium 295
M (n, L, S, J) = E(n, L) +
+δmSStot (n, L, J) + δmLStot (n, L, J) + δmT tot (n, L, J) (18.36)
The usual spectroscopic notation for the states with quantum numbers
n, L, S, J is (see [84])
2S+1
n L J : 1 1 S0 , 1 3 S1 , . . . . (18.37)
It is also usual to denote the S-wave, spin singlet and triplet states with the
symbols ηc and ψ respectively, P -wave states with χcJ , (S = 1, J = 0, 1, 2)
and hc , (S = 0, J = 1).
Parity and charge conjugation properties of charmonia are the same as
those of positronium (Eqs. (12.84) and (12.86)); that is:
4 E( 2P )
E( 2S)
E( 1P)
3 E( 1S )
V(r)
2
1
(1S) (1P)
0
(2P)
(2S)
0 1 2 3 4 5 6 7
r (GeV -1 )
Table 18.2 Values of the constants needed for hyperfine, spin-orbit and tensor corrections,
in GeV, see Eqs. (18.24), (18.27), (18.35).
– V̄SS V̄LS V̄T
1S 0.055 0 0
1PS 0 0.013 0.011
2S 0.041 0 0
A Leap Forward: Charmonium 297
Figure 18.4 Black lines. Expected and observed ground and excited levels of Charmonia
(c̄c mesons) up to masses of 4 GeV. J P C quantum numbers are indicated at the bottom
of the plot and particles are identified by the denominations introduced in the text. The
observed masses are reported in parenthesis, next to the particle name, and are taken
from [12]. Charmonia h(2P )c and χ(2P )c1 have not been identified yet. The mass values
obtained with the Cornell potential plus hyperfine, spin-orbit and tensor corrections are
reported for each particle, with the ”th” label. Gray lines. The particle named X(3872),
J P C = 1++ , was the first example of ”exotic hadron” found among hidden charm particles,
followed by the electrically charged Z(3900), Z(4020) and Z(4430), J P C = 1+− .
decays [94]
Γ(X(3872) → ρ0 ψ(3097)
= (2.9 ± 0.4)10−1 , (18.41)
Γ(X(3872) → ωψ(3097))
Sect. 17.2
1. Using the radial equation for χ(r), Eq. (B.16), prove that:
χ(r)
∼ rL , for r → 0+
r
where L is the orbital angular momentum.
3. Compute the masses of ψ(3S) and ψ(4S) and compare to the computed
and observed values reported in [84].
CHAPTER 19
THE
BORN-OPPENHEIMER
APPROXIMATION FOR
THE DOUBLY
CHARMED BARYON
Molecules and crystals are made of two kinds of particles with very differ-
ent masses, light electrons and heavy nuclei, both moving in the fields gener-
ated by Coulomb forces. Different masses entail different scales for the space
variation of the wave functions and this is the basis of an approximation,
the Born-Oppenheimer approximation, widely used in the theory of complex
atomic systems. A recent illustration of the Born-Oppenheimer approximation
in QED is found in Weinberg’s book on Quantum Mechanics [101].
The similarity with QED, has led several authors to apply the Born-
Oppenheimer approximation to hadronic systems which contain both heavy
and light quarks, and treat them as molecule-like systems bound by QCD
forces [104–107]. We illustrate here an application of the Born-Oppenheimer
approximation (BOA), to estimate the mass of the recently observed double
charm baryon, Ξ++cc = (ccu).
First, consider the heavy particles as classical sources with fixed coordi-
nates and quantum numbers, and find the ground state of the light particles,
solving the eigenvalue equation:
Then search for solutions of the complete Schrödinger equation for wave func-
tions of the form:
Φ = Ψ(xA , xB ) f0 (xA , xB , x1 , x2 ) .
In the equation:
The remarkable fact about Eq. (19.6) is that the wave function f0 has dropped
out: what matters is only the energy of the light quarks in presence of the
heavy sources, which makes the BO approximation amenable to numerical
calculations (see e.g. [102] for a review of recent Lattice QCD calculations).
The BO approximation requires all quantum numbers of the heavy parti-
cles to be fixed when computing the light quarks energy, E. Besides position
and flavour, we have to fix spin and colour quantum numbers. In the two
cases of interest, the c quarks are colour triplets with spin 1/2, with colours
combined to produce a colour 3̄ representation and total spin 11 .
λA λA
A
T1+2 = T1A + T2A = ( ) 1 ⊗ 12 + 1 1 ⊗ ( ) 2 , (19.10)
2 2
1 For the (c c q) baryon, color 3̄ is required to make a colour singlet with the colour triplet
1 2
q and spin 1 is required by Fermi statistics, since c1 and c2 are already antisymmetrised by
¯ not consider here, we would chose the
colour as in Eq. (18.5). For the tetraquark (c1 c2 ūd),
charm pair to be a colour anti-triplet, because it is the only attractive channel, see below,
and spin 1 follows from Fermi statistics as before.
The Born-Oppenheimer Approximation for the Doubly 303
1. quark-antiquark:
2. quark-quark
Figure 19.1 Schematic representation of the doubly charmed baryon within the BO ap-
proximation.
The formula given in Eq. (18.22) follows since λ = −4/3 for quark-antiquark
in colour singlet, as indicated above.
Ξ++ +
cc (3621) → Ξc + π
+
(19.17)
Quark composition is (ccu) and, given that the cc pair must be in total spin
Scc = 1, as noted before, the baryon total angular momentum may be J =
1/2, 3/2. There is no convincing evidence of a lighter baryon of this kind4 and
we assume provisionally J = 1/2.
state with the light quark bound to c(xA ) with a Cornell-like potential, see
Eq. (19.15)
2 αs 1
VA = − + kr + V0 ; r = |x − xA | , (19.18)
3 r 2
where V0 is the constant introduced in the Cornell potential in Sect. 18.2.2,
to fix the zero of the energy.
Of course, one has to also consider the degenerate state, with q bound to
c(xB ). We assume the light quark ground level, in the presence of the two
heavy sources, to be the superposition of the two states, see Fig. 19.1:
ψ(x) + φ(x) 1 R(|x − xA |) + R(|x − xB |)
f0 (x) = =√ , (19.19)
2(1 + S) 4π 2(1 + S)
where ψ and φ are the wave functions of the two states. We indicate explic-
itly in Fig. 19.1 that the radial wave functions are the same function of two
different variables: the distances of q from the source in xA or in xB . In cor-
respondence, ψ and φ are not orthogonal and we denote by S the overlap
integral:
S = d3 x ψ(x)φ(x) (real wave functions assumed) , (19.20)
2 1
Hpert = − αS . (19.22)
3 |x − xB |
Treating (19.22) as a first order perturbation, the energy of the light quark is:
E(R) = E0 + V0 + ΔE(R) , (19.23)
where
ΔE(R) = f0 |Hpert |f0 =
2αS 1
=− 2 [I1 (R) + I2 (R)] . (19.24)
3 2(1 + S)
306 Relativistic Quantum Mechanics
The factor 2 in the numerator of Eq. (19.24) arises because there are two equal
contributions, from the souces A and B, see Fig. 19.1.
The I1,2 are functions of R defined in terms of the orbital wave functions
ψ and φ:
1
I1 (R) = d3 ξ |ψ(ξ)|2 , (19.25)
|ξ − xB |
1
I2 (R) = d3 ξ ψ(ξ)φ(ξ) , (19.26)
|ξ − xB |
where the vector ξ originates from A, taken in the origin, and |xB | = R. Ana-
lytic expressions for S, I1,2 are given in [103] for the hydrogen wave functions.
We shall evaluate them numerically.
The complete expression of E, to be inserted in the BO potential, requires
the addition of quark masses, so that:
that is
and we obtain:
e.g. Ref. [100], the difference being interpreted as due to the different colour field con-
figurations in the two cases. For comparison, 2Mcbar + Mqbar = 3.782 GeV [100].
The Born-Oppenheimer Approximation for the Doubly 307
1/R, the charm quark in xB sees the other two particles as a 3̄ source with
which it combines to form a colour singlet. Therefore, the cc interaction must
contain at large distances a confining string potential, with the same strength
as the Cornell potential introduced in (18.17).
To take this effect into account, we add to the BO potential a linearly rising
term determined by the string tension k of charmonium, see Sect. 18.2.1
and leave, for the moment, the onset point R0 as a free parameter. The Born-
Oppenheimer potential then reads
2 αs
VBO (R) = − + ΔE(R) − ΔE(0) + Vconf (R, R0 ) +
3R
+2Mcmes + Mqmes . (19.33)
We set |xA | = 0, |xB | = R and note that φ(xA ) = ψ(xB ) = ψ(R) and
φ(xB ) = ψ(xA ) = ψ(0); thus, we find
For a Ξcc with spin J = 1/2, we have 2s · (sA + sB ) = −2 and the spin
potential, to be added to the BO potential (19.33) is
16παs 1
VBOspin = + ψ(0) + ψ(R)]2 . (19.35)
9Mq Mc 2(1 + S)
2 αs
VBOs (R) = − + ΔE(R) − ΔE(0) + Vconf (R, R0 ) + VBOspin +
3R
mes mes
+2Mc + Mq . (19.36)
0.4
0.4
0.3
RBO (R)
R(1S) 0.2
0.2
0.0
0.1 2A 3/2 Exp [-A r] 2 4 6 8 10
R(GeV-1 )
VBO (R)
-0.2
0.0
2 4 6 8 10
r(GeV-1 )
Figure 19.2 Left: Single exponential (dashed curve) fitted to the radial wave function of
the (qc) orbital (solid curve). Right: BO potential (no spin) and radial BO wave function.
correction to the eigenvalue EBO of the Schrödinger equation with the spinless
potential (19.33):
4παs
Ehf (sA , sB ) = + |Ψ(0)|2 , (19.37)
9Mc2
Table 19.1 Eigenvalues of the Born-Oppenheimer equation without and with the hyperfine
potential and the resulting M (Ξcc ) mass, for R0 = 7 ± 2 GeV−1 see text . Energy and mass
in MeV.
R0 (GeV −1 ) EBO , Eq. (19.33) EBOs , Eq. (19.36) Ehf (cc) 2Mcm + Mqm M (Ξcc )
5 48 34 1.58 3642 3678
7 22 9 1.1 ” ” 3652
9 11 1.6 0.9 ” ” 3645
We report in Fig. 19.2 (right) the no-spin BO potential, Eq. (19.33), and
the corresponding radial wave function, Ψ(R).
Table 19.1 summarises the eigenvalues of the Born-Oppenheimer equation
without, Eq. (19.33), and with, Eq. (19.36), the qc hyperfine potential for
R0 = 7 ± 2 GeV−1 . The hyperfine cc interaction is computed with Eq. (19.37).
To obtain M (Ξcc ) the quark mass combination 2Mcmes +Mqmes has been added.
In conclusion, we find:
+26 −1
M (Ξ++
cc , J = 1/2) = 3652−7 for R0 = 7 ± 2 GeV , (19.40)
to be compared to M (Ξ++ cc )exp = 3621. The agreement with the observed
Ξ++
cc mass is reasonable (given the error of the BO approximation estimated
in the next Section) at the expense, however, of having introduced the ad-hoc
parameter R0 . The real measure of the effectiveness of the Born-Oppenheimer
approximation has to wait for the comparison of this formula with the different
levels of doubly charmed baryons predicted by the quark model, as it was the
case for the charmonium spectrum vs. the Cornell potential.
Basic Elements of
Quantum Mechanics
• The superposition of |A > with itself (the case β = 0) gives rise to the
same physical state; |A > and α|A > represent the same state for all
values of α = 0.
• In the case in which α and β are both = 0, the result of a measurement
of X on the state |C > can be a or b; only one of these two values can
be the outcome of the measurement.
• It is not possible to predict which of the two values will be the result of
a given measurement. However, if the states |A > and |B > are correctly
normalised (in the sense defined below), the frequencies with which the
outcomes a and b occur are in the ratio |α|2 /|β|2 .
< C|αA + βB >= α < C|A > +β < C|B > (A.3)
< αA + βB|C >= α∗ < A|C > +β ∗ < B|C > (A.4)
With this final constraint, the space H is a Hilbert space, and the quantity
< A|A > is the square of the norm of the vector |A >.
Besides H, we can consider a new vector space, the dual space H∗ , defined
as the space of linear functionals (with complex values) defined on H. It is not
difficult to be convinced that the elements of H∗ have a one-to-one mapping
onto those of H. In effect, according to a well known theorem due to Riesz,
every linear functional f (|A >) can be written as:
with |f > a fixed ket. Therefore we can make the functional f (|A >) in H∗
correspond to |f > in H. (A.5) explains the Dirac notation for the elements
of H∗ , according to which we denote with:
< f| (A.6)
Basic Elements of Quantum Mechanics 313
the element of H∗ which corresponds to the ket |f >. The vectors < f | are
known by the name bra. As a result of (A.4), the bra < f | depends antilinearly
on the ket |f > and the scalar product (A.2) can be interpreted as the product
of the bra < B| with the ket |A > (bra* ket = bracket = product, from which
the terms bra and ket are derived).
Turning to the probabilistic interpretation of the results of the measure-
ment of X on the state |C > in equation (A.1), we assume that:
• If the vectors |A > and |B > have unity norm, the probability of ob-
taining a (or b) from the measurement of X on |C > is equal to |α|2 (or
|β|2 ).
for every < A| and |B >. This complex number depends antilinearly on |B >.
Therefore we can write:
.
Moreover, because |V > depends linearly on the ket |A > which corre-
sponds to bra < A|, we can also write:
(X † )† = X (A.10)
(αX)† = α∗ X † (A.11)
1 In what follows we will assume for simplicity that the operators are defined in all H.
314 Relativistic Quantum Mechanics
with:
cn =< hn |A >
< A|A >= |cn |2 . (A.15)
n
P2 = P (A.16)
P |h1 > = |h1 > (A.17)
P |V > = 0, if < h1 |V >= 0 (A.18)
Basic Elements of Quantum Mechanics 315
i.e. if |V > is orthogonal to |h1 >. It is easy to see that P can be written
formally as:
P = |h1 >< h1 | (A.19)
(assuming < h1 |h1 >= 1). Thus:
and the completeness condition for the base set of eigenvectors of H is ex-
pressed as:
|hn >< hn | = 1 (A.21)
n
.
Similarly,
< A|A >=< A|1|A >= < A|hn >< hn |A >= |cn |2 .
n n
• The sum of the probabilities for all possible cases must be equal to one.
If |A > is normalised to unity, equation (A.15) shows that:
This result gives a physical meaning to the scalar product of two vectors.
Let us suppose that |A > and |B > correspond to states in which two
different quantities assume definite values, xa for the observable X in
|A > and yb for the observable Y in |B >. If |A > and |B > are
normalised to unity, the probability that a measurement of Y in |A >
has the result yb is given by the squared modulus of the corresponding
scalar product:
For this reason, the scalar product is also known as the probability
amplitude.
[x, p] = i. (A.25)
2 This means that each of these measurements is carried out on a new replica of the
system, prepared in the state |A > using appropriate experimental apparatus.
Basic Elements of Quantum Mechanics 317
dx|x >< x| = 1 (A.27)
dp|p >< p| = 1. (A.29)
The wave function of a given ket, |A >, is the component of |A > in the
base set of the eigenvectors |x >:
with:
< B|A >= dx < B|x >< x|A >= dxψB (x)∗ ψA (x) (A.31)
from which:
1 =< A|A >= dx < A|x >< x|A >= dx|ψA (x)|2 (A.32)
or:
(U (a)ψA )(x) =< x|U (a)|A >=< x − a|A >= ψA (x − a). (A.40)
from which
L = x × p. (A.45)
U (R) = e−in·L
R = e−in·T (A.48)
320 Relativistic Quantum Mechanics
where n is the same vector which appears in equation (A.46) and the 3 × 3
matrices, T , are given by:3
R1 R2 = e−in1 ·T e−in2 ·T
= e−i[(n1 +n2 )·T +(i/2)[n1 ·T ,n2 ·T ]+...]
= e−i[(n1 +n2 −(1/2)n1 ×n2 )·T +...] (A.51)
where the dots represent higher order terms in n1 or n2 . On the other hand,
we have:
where the last equalities follow from the composition rule (A.47). For com-
parison, we see that the operators L must obey the commutation relations
(A.46). The fact that the canonical commutators lead to these relations shows
that the operators U , equation (A.44), provide a representation of the rotation
group, but there could be (and there are, as we will see) other independent
representations.
The considerations we have just explained lead to a very general definition
of angular momentum.
For any physical system, we can (operationally) identify operators U (R)
which describe the action of a rotation on that system and for which the
equations (A.46) hold. For this system, we define the angular momentum
from the equation:
U (R) = 1 − in · J (A.53)
for infinitesimal transformations. From what was seen earlier, the components
of J automatically satisfy commutation relations analogous to (A.46):
3 For example, it is easily shown that the matrix in equation (A.43) can be written as
e−iθT3 with (T3 )12 = −(T3 )21 = i132 , and all other elements equal to zero.
4 This relation follows directly from expanding the exponentials in powers of n and n .
1 2
Basic Elements of Quantum Mechanics 321
U (a) = 1 − ia · P (A.55)
A.4.2 Spin
The simplest example of the definition of angular momentum just given is
that of a particle with spin. In this case, the kets which represent a particle
localised in x are characterised by a further quantum number σ such that the
effect of a rotation, as well as turning x into Rx, is that of producing a linear
combination of the states corresponding to various values of σ ∗5 :
and furthermore the relation (A.56) implies that the matrices S(R) must
themselves provide a representation of the rotation group (for infinitesimal
rotations):
S = 1 − in · S (A.59)
with S being three suitable matrices in the space σ which satisfy the angular
momentum commutation rules:
and furthermore:
[U (R)ψ]σ (x) =< σ, x|U (R)|A >= S(R−1 )σσ ψσ (R−1 x)
The operator associated with the generators of rotations, the total angular
momentum, is the sum of two mutually commuting terms: the orbital angular
momentum, L, and the spin angular momentum, S.
J = L + S. (A.64)
The Non-Relativistic
Hydrogen Atom
p2
H = H0 + V = +V (B.1)
2m
and with the canonical commutation rules:
[r, pr ] = i. (B.6)
The factorisation we seek can be obtained starting from the expression for
the square of the orbital angular momentum1 :
A = r2 p2 − i(x · p) = r2 p2 − irpr + 1
B = (p · x)(x · p) + i(x · p) = (x · p)(x · p) − 2i(x · p)r =
i i
= r(pr + )r(pr + ) − 2irpr + 2 = (rpr + i)(rpr + i) − 2irpr + 2 =
r r
= r2 p2r − irpr + 1 (B.8)
from which:
L · L = r2 (p2 − p2r )
or:
L·L
p2 = p2r + . (B.9)
r2
(Being invariant under rotation, r commutes with all components of L).
In terms of the Laplacian operator, we recover the well known formula:
∂ ∂ ∂ 1 ∂ 1 L·L
− =− + + + =
∂xi ∂xi ∂r r ∂r r r2
1 ∂ ∂ L·L
= − 2 r2 + 2 . (B.10)
r ∂r ∂r r
p2 α p2 L·L α
H= − = r + 2 − . (B.11)
2mr r 2mr r r
with:
1 1 ∂ 2 ∂ l(l + 1) α
− r + − R(r) = ER(r). (B.14)
2mr r2 ∂r ∂r 2mr r2 r
Boundary Conditions. The condition that χ(r) does not extend into non-
physical negative values of the radial variable is imposed by assigning a con-
stant potential, V0 , to the region r < 0 and making V0 → +∞. In this situation
it is well known that:
such that the wave function is normalisable, as is necessary for a bound state:
+∞
d3 x|ψ(x)|2 = dΩ|Ylm |2 r2 dr|R(r)|2 = dr |χ(r)|2 < +∞. (B.19)
0
326 Relativistic Quantum Mechanics
l(l + 1) 2
−χ (ρ) + − χ(ρ) = χ(ρ). (B.21)
ρ2 ρ
with s > 0 to satisfy the condition (B.17). Inserting this expression into (B.21),
the term in a0 ρs generates a term proportional to ρs−2 which should not exist,
if we wish the series to start with a positive power. This term is:
(we have discarded the root with s < 0). Now, the condition that the term in
ρs−1 should also be absent determines a1 starting from a0 , the cancellation of
the term in ρs determines a2 in terms of a1 , etc. Proceeding in this way, one
arrives at the solution which satisfies (B.25).
For general values of , however, the solution found in this way tends
asymptotically to the form (B.22), which is not normalisable unless B = 0.
We now show that this can occur only if the series terminates for a finite value
of ν.
The Non-Relativistic Hydrogen Atom 327
We write the various terms of equation (B.21) in the form (B.23) and we
identify the coefficients of ρν−1 . One finds:
√ l(l + 1)
−χ(ρ) → −(s + ν + 1)(s + ν)aν+1 + 2 −(s + ν)aν + aν−1 +
ρ2
→ l(l + 1)aν+1
2
− → −2aν
ρ
−χ(ρ) → −aν−1 (B.26)
total →
√
0 = [l(l + 1) − (s + ν + 1)(s + ν)]aν+1 + 2[ −(s + ν) − 1]aν . (B.27)
or:
1
=− . (B.32)
(l + ν̄ + 1)2
l = 0, · · · , n − 1 (B.34)
328 Relativistic Quantum Mechanics
l=n−1
N (En ) = (2l + 1) = n2 . (B.35)
l=0
E1 = −Ry : 1S,
E2 = −Ry/4 : 2S, 2P,
E3 = −Ry/9 : 3S, 3P, 3D
···
B.4 EIGENFUNCTIONS
We start from the radial equation, Eq. (B.21), assuming the eigenvalue
= −1/n2 found in Eq. (B.33). For r → +∞ the radial equation reads
1
χ = χ (B.36)
n2
that is χ(ρ) = e±ρ/n . A normalizable wave function requires choosing the
minus sign and we set
ρ
χ(ρ) = ρl+1 P (ρ)e− n (B.37)
so that
2 2
P = w , P = ( )2 w , etc. (B.42)
n n
and find the equation:
As discussed in Ref. [86] (p. 119 and Mathematical Appendix d) this equation
admits solutions2 which are exponentially bound at infinity for n and l non-
negative integers and3
n≥l+1 (B.44)
In these cases, the solutions are polinomials related to the Laguerre polynomi-
als. Not surprisingly, we have re-obtained the condition previously found by
a direct calculation based on the power expansion of the reduced radial wave
function, Eqs. (B.31) and (B.32).
The lowest radial wave functions. In terms of the polynomials P (ρ)
defined in (B.37), one has
R10 = 2e−ρ ;
1 ρ
R20 = √ (1 − )e−ρ/2 ;
2 2
1
R21 = √ ρe−ρ/2 ;
2 6
2 2ρ 2ρ2 −ρ/3
R30 = √ (1 − + )e ;
3 3 3 27
8 ρ
R31 = √ ρ(1 − )e−ρ/3 ;
27 6 6
4
R32 = √ ρ2 e−ρ/3 .
81 30
331
332 Bibliography
[36] Z. Maki, M. Nakagawa, and S. Sakata, Prog. Theor. Phys. 28, 870 (1962).
[37] [Link] chain
reaction
[38] J. N. Bahcall, A. M. Serenelli, and S. Basu, ApJ 621, L85 (2005); e-print
arXiv astro-ph/0412440.
[39] V. N. Gribov and B. Pontecorvo, Phys. Lett. B 28, 493 (1969).
[40] L. Wolfenstein, Phys. Rev. D 17, 2369 (1978).
[41] S. P. Mikheyev and A. Y. Smirnov, Prog. Part. Nucl. Phys. 23, 41 (1989).
[42] S. Abe et al. [KamLAND Collaboration], Phys. Rev. Lett. 100, 221803
(2008); e-print arXiv:0801.4589 [hep-ex].
[43] O. Q. R. Ahmad et al. [SNO Collaboration], Phys. Rev. Lett. 89, 011301
(2002); e-print arXiv nucl-ex/0204008.
[44] G. L. Fogli, E. Lisi, A. Marrone, A. Palazzo, and A. M. Rotunno, Phys.
Rev. D 84, 053007 (2011); e-print arXiv:1106.6028 [hep-ph].
[45] T. Schwetz, M. Tortola, and J. W. F. Valle, New J. Phys. 13, 109401
(2011); e-print arXiv:1108.1376 [hep-ph].
[46] W. C. Louis [LSND Collaboration], Prog. Part. Nucl. Phys. 40, 151
(1998).
[47] W. C. Louis [LSND Collaboration], Prog. Part. Nucl. Phys. 40 (1998)
151.
[48] M. Goeppel-Mayer, Phys. Rev. 48, 512 (1935).
[49] W.H. Furry, Phys. Rev, 56, 1184 (1939).
[50] R. Saakyan, Ann. Rev. Nucl. Part. Sci. 63, 503 (2013).
[51] N. Aghanim et al. (Plank Collaboration), Astronomy & Astrophysics
641, A6 (2020).
[52] M. Aker et al. (The KATRIN Collaboration), Nature Physics 18, 160
(2022).
[53] M. Agostini et al. (GERDA Collaboration), Phys. Rev. Lett. 125, 252502
(2020).
[54] I.J. Arnquist et al. (Majorana Collaboration), Phys. Rev. Lett. 130,
062501 (2023).
[55] S. Abe et al. (KamLAND-Zen Collaboration). Phys. Rev. Lett. 130,
051801 (2023).
334 Bibliography
[56] G. Anton et al. (EXO-200 Collaboration), Phys. Rev. Lett. 123, 161802
(2019).
[57] D.Q. Adams et al. (The CUORE Collaboration), Nature 604, 53 (2022).
[61] J. Engel and J. Menéndez, Rep. Prog. Phys. 80, 046301 (2017).
[62] O. Benhar, R. Biondi, and E. Speranza, Phys. Rev. C 90, 065504 (2014).
[72] D. J. Gross and F. Wilczek, Phys. Rev. D 8 (1973) 3633; Phys. Rev. D
9 (1974) 980.
[98] R. Aaij et al. [LHCb], Phys. Rev. Lett. 122, no.22, 222001 (2019)
doi:10.1103/PhysRevLett.122.222001 [arXiv:1904.03947 [hep-ex]].
[107] L. Maiani, A. D. Polosa and V. Riquer, Phys. Rev. D 100 (2019) no.7,
074002 doi:10.1103/PhysRevD.100.074002 [arXiv:1908.03244 [hep-ph]].
339
340 Index
V–A interaction, 143, 240, 243, 246, Z boson, 145, 249, 259
247, 260 Zeeman effect, 65, 98
vacuum, 110 zero point energy, 114
energy, 49