Understanding the Renormalisation Group
Understanding the Renormalisation Group
We’ve built up the technology of field theory and path integrals, and I’ve promised you
that this is sufficient to understand what happens at a second order phase transition.
But so far, we’ve made little headway. All we’ve seen is that as we approach the critical
point, fluctuations dominate and the Gaussian path integral is no longer a good starting
point. We need to take the interactions into account.
Let’s start by painting the big picture. As in the previous section, we’re going to
consider a class of theories based around a single scalar field (x) in d dimensions. (We
will consider more general set-ups in Section 4.) The free energy takes the now familiar
form,
Z
d 1 1
F[ ] = d x r · r + µ2 2
+g 4
+ ... (3.1)
2 2
In what follows, we will look at what happens as we approach the critical point from
above T ! Tc . All the important temperature dependence in (3.1) is sitting in the
quadratic term, with
µ2 ⇠ T Tc (3.2)
In contrast to the previous section, we will allow µ2 to take either sign: µ2 > 0 in the
disordered phase, and µ2 < 0 in the ordered phase where h i = 6 0.
There is one important change in convention from our earlier discussion: we have
rescaled the coefficient of the gradient term to be 1/2; we will see the relevance of this
shortly. All other terms have arbitrary coefficients.
– 53 –
So far we’ve focussed on just a few couplings, as shown in the free energy (3.1).
Here we’re going to expand our horizons. We’ll consider all possible terms in the free
energy, subject to a couple of restrictions. We’ll insist that the free energy is analytic
around = 0, so has a nice Taylor expansion, and we will insist on the Z2 symmetry
! , so that only even powers of arise. This means, for example, that we will
include the term 6 and (r2 )2 and 14 and 137 (r · r )r2 and so so. Each of these
terms comes with its own coupling constant. However, we don’t include terms like 17
because this violates the Z2 symmetry, nor 1/ 2 because this is not analytic at = 0.
Next, consider the infinite dimensional space, parameterised by the infinite number
of coupling constants. We will call this the theory space (although I should warn you
that this isn’t standard terminology). You should have in your mind something like
this:
Note that I’ve written the exponent as e F rather than e F . This is because the overall
power of does nothing to a↵ect the physics; all the relevant temperature dependence
is in the coefficient (3.2) while, for the quantities of interest near the critical point, this
overall factor can be set to ⇡ 1/Tc . You can think that we’ve simply rescaled this
into the field .
There is one more ingredient that we need to make sense of the path integral (3.3).
This is the UV cut-o↵ ⇤. Recall that, implicit in our construction of the theory is the
requirement that the Fourier modes k vanish for suitably high momenta
k = 0 for k > ⇤
– 54 –
This arises because, ultimately our spins sit on some underlying lattice which, in turn,
was coarse-grained into boxes of size a. The UV cut-o↵ is given by ⇤ ⇠ 1/a.
Until now, the UV cut-o↵ has taken something of a back seat in our story, although
it was needed to render some of the path integral calculation in the previous section
finite. Now it’s time for ⇤ to move centre stage. As we will explain, we can use the
cut-o↵ to define a flow in the space of theories.
Suppose that we only care about physics on long distance scales, L. Then we have
no real interest in the Fourier modes k with k 1/L. This suggests that we can write
down a di↵erent theory, that has a lower cut-o↵,
⇤
⇤0 =
⇣
for some ⇣. As long as ⇤0 1/L, the scale of interest, our theory can tell us everything
that we need to know. Moreover, we know, at least in principle, how to construct such
a theory. We write our Fourier modes as
+
k = k + k
There are several other names for these variables that are used interchangeably. The
modes k and + k are also referred to as low- and high-energy modes or, importing
language from quantum mechanics, slow and fast modes, respectively. In a rather
quaint nod to the electromagnetic spectrum, the short-distance, microscopic physics
that we care little about is often called the ultra-violet; the long-distance physics that
we would like to understand is the infra-red.
– 55 –
Here FI [ k , +
k ] involves the terms which mix the short and long-wavelength modes.
The partition function (3.3) can then be written as
Z Y Z Y Z Y
F0 [ + +
Z= d ke = F
d k e F0 [ k ]
d +k e
k ] e FI [ k , k ]
We write this as
Z
F 0[ ]
Z= D e
where F 0 [ ] is known as the Wilsonian e↵ective free energy. (In fairness, this term
is rarely used: you’re more likely to hear “Wilsonian e↵ective action” to describe the
analogous object in a path integral describing a quantum field theory.) We’re left with
a free energy which describes the long-wavelength modes, but takes into account the
e↵ects of the short wavelength modes. It is defined by
Z Y
F 0[ ] F0 [ + +
e =e F0 [ k ]
d +
k e
k ] e FI [ k , k ] (3.4)
⇤0 <k<⇤
In subsequent sections, we’ll put some e↵ort into calculating this object. However, at
the end of the day the new free energy F 0 [ ] must take the same functional form as
the original free energy (3.1), simply because we started from the most general form
possible. The only e↵ect of integrating out the high-momentum modes is to shift the
infinite number of coupling constants, so we now have
Z
0 1 0 1
F [ ]= d x d
r · r + µ0 2 2 + g 0 4 + . . . (3.5)
2 2
We would like to compare the new free energy (3.5) with the original (3.1). However,
we’re not quite there yet because, the two theories are di↵erent types of objects – like
apples and oranges – and shouldn’t be directly compared. This is because the theory
is defined by both the free energy and the UV cut-o↵ and, by construction our two
theories have di↵erent cut-o↵s. This means that the original theory F [ ] can describe
things that the new theory F 0 [ ] cannot, namely momentum modes above the cut-o↵
⇤0 .
It is straightforward to remedy this. We can place the two theories on a level playing
field by rescaling the momenta in the new theory. We define
k0 = ⇣k
– 56 –
Now k 0 takes values up to ⇤, as did k in the original theory. The counterpart of this
scaling in real space is
x
x0 =
⇣
This means that all lengths scales are getting smaller. You can think of this step as
zooming out, to observe the system on larger and larger length scales. As you do so,
all features become smaller.
There is one final step that we should take. The new theory F 0 [ ] will typically have
some coefficient 0 6= 1 in front of the leading, quadratic gradient term. To compare
with the original free energy (3.1), we should rescale our field. We define
0
p
= 0
k0 k
0
p
(x0 ) = 0 (x) (3.6)
We see that this procedure induces a continuous map from ⇣ 2 [1, 1) onto the space
of coupling constants. Our original coupling constants in (3.1) are those evaluated at
⇣ = 1. As we increase ⇣, we trace out curves in our theory space, that look something
like the picture shown in Figure 22
We say that the coupling constants flow, where the direction of the flow is telling us
what the couplings look like on longer and longer length scales. The equations which
describe these flows – which we will derive shortly – are known, for historic reasons, as
beta functions.
These, then are the three steps of what is known as the renormalisation group (RG):
• Rescale the fields so that the gradient term remains canonically normalised.
– 57 –
Figure 22: Flows in theory space; the arrows are in the direction of increasing ⇣.
You may wonder why we didn’t just include a coupling constant (⇣) for the gradient
term, and watch that change too. The reason is that we can always scale this away by
redefining . But is just a dummy variable which is integrated over the path integral,
so this rescaling can’t change the physics. To remove this ambiguity, we should pin
down the value of one of the coupling constants, and the gradient term (r )2 is the
most convenient choice. If we ever find ourselves in a situation where (⇣) = 0 for
some ⇣ then we would have to re-evaluate this choice. (We’ll actually come across an
example where it’s sensible to make a di↵erent choice in Section 4.3.)
The “renormalisation group” is not a great name. It has a hint of a group structure,
because a scaling by ⇣1 followed by a scaling by ⇣2 gives the same result as a scaling by
⇣1 ⇣2 . However, unlike for groups, there is no inverse: we can only integrate out fields,
we can’t put them back in. A more accurate name would be the “renormalisation
semi-group”.
– 58 –
briefly in Section 1.3, one constructs a magnetisation field m(x) by coarse-graining
over boxes of size a, each of which contains many lattice sites. One can ask how the
free energy changes as we increase a, a procedure known as blocking. Ultimately this
leads to the same picture that we built above.
Of course, we care little about most of these details so, in an attempt to simplify
our lives we perform a renormalisation group transformation, integrating out short dis-
tance degrees of freedom to generate a new theory which describes the long wavelength
physics. And then we do this again. And then we do this again. Where do we end up?
There are essentially two possibilities: we could flow o↵ to infinity in theory space,
or we could converge towards a fixed point. These are points which are invariant under
a renormalisation group transformation. (One could also envisage further possibilities,
such as converging towards a limit cycle. It turns out that these can be ruled out in
many theories of interest.)
Our interest here lies in the fixed points. The second step in the renormalisation
group procedure ensures that fixed points describe theories that have no characteristic
scale. If the original theory had a correlation length scale ⇠, then the renormalised
theory has a length scale ⇠ 0 = ⇠/⇣. (We will derive this statement explicitly below
when we stop talking and start calculating.) Fixed points must therefore have either
⇠ = 0 or ⇠ = 1.
In the disordered phase, with T > Tc , enacting an RG flow reduces the correlation
length. Pictorially, we have
RG flow
!
In this case, shrinking the correlation length is equivalent to increasing the temperature.
The end point of the RG flow, at ⇠ = 0, is the infinite temperature limit of the theory.
– 59 –
This is rather like flowing o↵ to infinity in theory space. As we will see, it is not
uncommon to end up here after an RG flow. But it is boring.
Similarly, in the ordered phase the RG flow again reduces the correlation length,
RG flow
!
Now the end point at ⇠ = 0 corresponds to the zero temperature limit; again, it is a
typical end point of RG flow but is dull.
Theories with ⇠ = 1 are more interesting. As we saw above, this situation occurs
at a critical point where the theory contains fluctuations on all length scales. Now, if
we do an RG flow, the theory remains invariant. In terms of our visual configurations,
RG flow
!
Note that the configuration itself doesn’t stay the same. (It is, after all, merely a
representative configuration in the ensemble.) However, as the fluctuations on small
distance scales shrink away due to RG, they are replaced by fluctuations coming in
from larger distance scales. The result is a theory which is scale invariant. For this
reason, the term “critical point” is often used as a synonym for “fixed point” of the
RG flow.
– 60 –
Figure 23: The critical surface. Irrelevant deformations from the critical point are shown
as dotted blue lines; the relevant deformation is is red.
can move away from the fixed point. If we move in some of these directions, the RG
flow will take us back towards the fixed point. These deformations are called irrelevant
because if we add any such terms to the free energy we will end up describing the same
long-distance physics.
In contrast, there will be some directions in which the RG flow will sweep us away
from the fixed point. These deformations are called relevant because if we add any such
terms to the free energy, the long-distance physics will be something rather di↵erent.
Examples of relevant and irrelevant deformations are shown in Figure 23. Much of the
power of universality comes from the realisation that the vast majority of directions
are irrelevant. For a given fixed point, there are typically only a handful of relevant
deformations, and an infinite number of irrelevant ones. This means that our fixed
points have a large basin of attraction, huge slices of the infinite dimensional theory
space all converging to the same fixed point. The basin of attraction for a particular
fixed point is called the critical surface.
Finally, it’s possible that our fixed point is not a point at all, but a line or a higher
dimensional surface living within theory space. In this case, if we deform the theory
in the direction of the line, we will not flow anywhere, but simply end up on another
fixed point. Such deformations are called marginal; they are rare, but not unheard of.
– 61 –
Why High Energy Physics is Hard
Universality is a wonderful thing if you want to understand the low-energy, long-
wavelength physics. It tells you that you can throw away many of the microscopic
details because they are irrelevant for the things that you care about.
In contrast, if you want to understand the high-energy, short distance physics then
universality is the devil. It tells you that you have very little hope of extracting any
information about microscopic degrees of freedom if you only have access to information
at long distances. This is because many di↵erent microscopic theories will all give the
same answer.
As we saw in Section 2.3, quantum field theory is governed by the same mathematical
structure as statistical field theory, and the comments above also apply. Suppose, for
example, that you find yourself living in a technologically adolescent civilisation that
can perform experiments at distance scales of 10 16 cm or so, but no smaller. Yet, what
you really care about is physics at, say, 10 32 cm where you suspect that something
interesting is going on. The renormalisation group says that you shouldn’t pin your
hopes on learning anything from experiment.
The renormalisation group isn’t alone in hiding high-energy physics from us. In
gravity, cosmic censorship ensures that any high curvature regions are hidden behind
horizons of black holes while, in the early universe, inflation washes away any trace
of what took place before. Anyone would think there’s some kind of conspiracy going
on....
3.2 Scaling
The idea that second order phase transitions coincide with fixed points of the renor-
malisation group is a powerful one. In particular, it provides an organising principle
behind the flurry of critical exponents that we met in Section 1.
As we explained above, at a fixed point of the renormalisation group any scale must
be washed away. This is already enough to ensure that correlation functions must take
the form of a power-law,
1
h (x) (0)i ⇠ (3.8)
rd 2+⌘
Any other function would require a scale on dimensional grounds. The only freedom
that we have is in the choice of exponent which we have chosen to parameterise as ⌘.
One of the tasks of the RG procedure is to compute ⌘, and we will see how this works
in Section 3.5.
– 62 –
However, even here there’s something of a mystery because usually we can figure out
the way things scale by doing some simple dimensional analysis. (If you would like to
refresh your memory, some examples of dimensional analysis can be found in Chapter
3 of the lectures on Dynamics and Relativity.) What does that tell us in the present
case?
We will measure dimension in units of inverse length. So, for example, [x] = 1
while [@/@x] = +1. The quantity F [ ] must be dimensionless because it sits in the
exponent of the partition function as e F . The first term is
Z
1
F [ ] = dd x r · r + . . .
2
From this we learn that
d 2
[ ]= (3.9)
2
Which, in turn, tells us exactly what the exponent of the correlation function must be:
⌘ = 0.
This is sobering. Dimensional analysis is one of the most basic tools that we have,
and yet it seems to be failing at critical points where experiment is showing that ⌘ 6= 0.
What’s going on?
We still haven’t explained why the scaling dimension di↵ers from engineering dimen-
sion. The culprit turns out to be the third step of the RG procedure (3.6) where the
field gets rescaled. In real space, this is viewed as coarse-graining over blocks of
larger and larger size a. As we do so, it dresses with this UV cut-o↵ scale ⇤ ⇠ 1/a,
often in a complicated and non-intuitive way. This means that the correlation function
(3.8) is actually
a⌘
h (x) (0)i ⇠
rd 2+⌘
– 63 –
which is in full agreement with naive dimensional analysis. We can work with usual
engineering dimensions if we keep track of this microscopic distance scale a. But it is
much more useful to absorb this into and think of a coarse-grained observable, with
dimension , that is the appropriate for measuring long distance correlations.
⌫ |T Tc |
⇠⇠t with t = (3.11)
Tc
Here t is called the reduced temperature, while ⌫ is another critical exponent that we
will ultimately have to calculate. Since ⇠ is a length scale, it transforms simply as
⇠ ! ⇠/⇣. In other words, it has scaling dimension ⇠ = 1. The meaning of the
critical exponent ⌫ is that the reduced temperature scales as t ! ⇣ t t, with
1
t = (3.12)
⌫
In what follows, our only assumption is that the correlation length ⇠ is the only length
scale that plays any role.
We start with the thermodynamic free energy, Fthermo (t), evaluated at B = 0. This
takes the form
Z
Fthermo (t) = dd x f (t)
Because Fthermo is scale invariant at the fixed point, f (t) must have scaling dimension
d, which immediately tells us that
f (t) ⇠ td⌫
There is an intuitive way to understand this. At T close to Tc , the spins are correlated
over distances scales ⇠, and can be viewed as moving as one coherent block. The free
energy Fthermo is extensive, and so naturally scales as F ⇠ (L/⇠)d ⇠ td⌫ .
– 64 –
From the thermodynamic free energy, we can compute the singular contribution to
the heat capacity near t = 0. It is
@ 2f
c⇠ ⇠ td⌫ 2
⇠t ↵
@t2
where the second relationship is there to remind us that we already had a name for the
critical exponent related to heat capacity. We learn that
↵=2 d⌫ (3.13)
This is called the Josephson relation or, alternatively, the hyperscaling relation.
The next critical exponent on the list is . Recall that this relates the magnetisa-
tion in the ordered phase – which we used to call m and have now called – to the
temperature as
⇠t
B = ) = ⌫(2 ⌘) (3.15)
⌫
which is sometimes called Fisher’s identity. Once again, there is an intuitive way to
understand this. The meaning of ⇠ is that the spins are no longer correlated at distances
– 65 –
r ⇠. This can be seen, for example, in our original formula (2.29). Using our earlier
expression (2.28) for the susceptibility, we have
Z ⇠
1
⇠ dd x d 2+⌘ ⇠ ⇠ 2 ⌘ ⇠ t ⌫(2 ⌘)
0 r
which again gives = ⌫(2 ⌘).
The final critical exponent relates the magnetisation to the magnetic field B when
we sit at the critical temperature t = 0. It should come as little surprise by now to
learn that this is again fixed by scaling analysis
B d+2 ⌘
⇠ B 1/ ) = = (3.16)
d 2+⌘
We end up with four equations, relating ↵ (3.13), (3.14), (3.15) and (3.16) to the
critical exponents ⌘ and ⌫. For convenience, let’s recall what values we claimed this
exponents take:
↵ ⌘ ⌫
4 d 1 1
MF 2 2
1 3 0 2
1 7 1
d=2 0 8 4
15 4
1
d=3 0.1101 0.3264 1.2371 4.7898 0.0363 0.6300
where we’ve used the result (2.14), including quadratic fluctuations, for the mean field
value of ↵. We see that the relations are satisfied exactly for d = 2 and to within the
accuracy stated for d = 3. However, there’s a wrinkle because they only agree with the
mean field values when d = 4!
This latter point is an annoying subtlety and will be explained in Section 3.3.2. Our
main task is to understand why the mean field values don’t agree with experiment when
d < 4.
– 66 –
We’re interested in operators which, in the vicinity of a given point, transform simply
under RG. Specifically, suppose that, under the rescaling x ! x0 = x/⇣, the operator
has a well defined scaling dimension, transforming as
O(x) ! O0 (x0 ) = ⇣ O
O(x) (3.18)
You can think of such operators as eigenstates of the RG process. From the free energy
(3.17), the scaling dimension of the coupling is
gO =d O
• Relevant if O <d
• Irrelevant if O >d
• Marginal if O =d
The tricky part of the story is that it’s not always easy to identify the operators O
which have the nice scaling property (3.18). As we’ll see in the examples below, these
are typically complicated linear combinations of the operators n and n (r )2 and so
on.
where we’ve added a subscript to the coefficient µ20 in anticipation the fact that this
quantity will subsequently change under RG flow.
Because the free energy is quadratic in , it has the property that there is no mixing
between the short and long wavelength modes, and so factorises as
+
F0 [ ] = F0 [ ] + F0 [ ]
– 67 –
Integrating over the short wavelength modes is now easy, and results in an overall
constant in the partition function
Z
F 0[ ] +
e = D + e F0 [ ] e F0 [ ] = N e F0 [ ]
This constant N doesn’t change any physics; it just drops out when we di↵erentiate
log Z to compute correlation functions. However, we’re not yet done with the our RG;
we still need to do the rescaling
k0 = ⇣k and 0
k0 =⇣ w
k (3.20)
where w is constant that we will determine. Written in terms of the rescaled momenta,
we have
Z ⇤/⇣ d
d k 1 2
F0 [ ] = (k + µ20 ) k k
(2⇡)d 2
Z ⇤ d 0 ✓ 02 ◆
d k 1 k
= + µ0 ⇣ 2w 0k0 0 k0
2
(2⇡)d 2⇣ d ⇣ 2
We can put this back in the form we started with if we take
d+2
w= (3.21)
2
leaving us with
Z ⇤
dd k 1 2
F00 [ 0 ] = (k + µ2 (⇣)) 0 0
k k
(2⇡)d 2
The only price that we’ve paid for this is that the coefficient of quadratic term has
become
This illustrates how the length scales in the problem transform under RG. Recall that
the correlation length (2.22) is ⇠ 2 ⇠ 1/µ2 . We see that, under an RG procedure,
⇠
⇠!
⇣
The fixed points obey dµ2 /d⇣ = 0. As we anticipated previously, there are two of them.
The first is µ2 = 1. This corresponds to a state which has infinite temperature. It is
not where our interest lies. The other fixed point is at µ2 = 0. This is known as the
Gaussian fixed point.
– 68 –
3.3.1 In the Vicinity of the Fixed Point
As we mentioned previously, we would like to classify fixed points by thinking about
what happens when you sit near them. Do you flow into the fixed point, or get pushed
away?
We already have the answer to this question in one direction in coupling space. If
we add the term µ2 2 , the scaling (3.22) tells us that µ2 gets bigger as we flow towards
the infra-red. This is an example of a relevant coupling: turning it on pushes us away
from the fixed point.
Here is another example: it is simple to repeat the steps above including the term
↵0 (r2 )2 in the free energy. Upon RG, this coupling flows as ↵(⇣) = ⇣ 2 ↵0 . It is an
example of an irrelevant coupling, one which becomes less important as we flow towards
the infra-red.
where, to keep the Z2 symmetry, we restrict the sum to n even. Here things are a
little more subtle because, once we turn these couplings, on the first step of the RG
procedure is no longer so simple. Integrating out the short distance modes will shift
each of these couplings,
We will learn how to calculate the gn in section 3.4. But, for now, let’s ignore this
e↵ect and concentrate on the second and third parts of the RG procedure, in which we
rescale lengths and fields as in (3.20). In this approximation, the operators n enjoy
the nice scaling property (3.18),
x0 = x/⇣ and 0
(x0 ) = ⇣ (x)
To restore the coefficient of the gradient term, we pick the scaling dimension
d 2
=
2
– 69 –
µ2 µ2
d<4 d>4
g g
Figure 24: RG flows when d < 4 Figure 25: RG flows when d > 4
For once, the scaling dimension coincides with the engineering dimension (3.9): =
[ ]. This is because we’re looking at a particularly simple fixed point. Note that this
is related to our earlier result (3.21) by = d w, with the extra factor of d coming
R d
from the d k in the definition of the Fourier transform.
where
gn (⇣) = ⇣ d n
g0,n = ⇣ (1 n/2)d+n
g0,n (3.24)
We see that the way these coupling scale depends on the dimension d. For example,
the coefficient for 4 scales as
g4 (⇣) = ⇣ 4 d g0,4
We learn that 4 is irrelevant for d > 4 and is relevant for d < 4. According to the
analysis above, when d = 4, we have g4 (⇣) = g0,4 and the coupling is marginal. In this
case, however, we need to work a little harder because the leading contribution to the
scaling will come from the corrections g4 that we neglected. We’ll look at this in the
next section.
– 70 –
We can tally this with our discussion in Section 3.2. The fact that the scaling
dimension coincides with the naive engineering dimension [ ] immediately tells us
that ⌘ = 0. Meanwhile, the scaling of µ2 ⇠ t is given by t = [g2 ] = 2, which tells us
that ⌫ = 1/2. From this we can use (3.13) - (3.16) to extract the remaining critical
exponents. These agree with mean field for d = 4, but not for d 4. (We will address
the situation in d > 4 in Section 3.3.2.)
g6 (⇣) = ⇣ 6 2d
g0,6
So far, this all looks rather trivial. However, things become much more interesting
at other fixed points. For example, around most fixed points n 6= n . Indeed,
n
around most fixed points neither nor will have well defined scaling dimension;
instead those operators to which one can assign a scaling dimension consist of some
complicated linear combination of the n . We will start to understand this better in
Section 3.4.
– 71 –
p
B
liquid
T
TC gas
T
Tc
Figure 26: The phase diagram of the Figure 27: The phase diagram of the
Ising model. liquid-gas system.
4
(Of course, mean field still retains the term in the ordered phase, where it is needed
to stabilise the potential.)
However, suppose that we sit outside of this class and turn on interactions n with n
odd. The leading order e↵ect is the magnetic field B that we included in our original
Ising model. This is always a relevant interaction. This means that if we want to hit
the critical point, we must tune this to zero.
It may be more natural to tune B = 0 in some systems that others. For example,
a magnet in the Ising class automatically has B = 0 unless you choose to submit it to
a background magnetic field. This means that it’s easy to hit the critical point: just
heat up a magnetic and it will exhibit a second order phase transition.
In contrast, in the liquid gas system, setting “B = 0” is less natural. Unlike in the
Ising model, there is no Z2 symmetry manifest in the microscopic physics of gases.
Instead, it is an emergent symmetry which relates the density of liquid and gas states
at the phase transition. Correspondingly, if we simply take a liquid and heat it up then
we’re most likely to encounter a first order transition, or no transition at all. If we want
to hit the critical point, we must now tune the two relevant operators: temperature µ2
and pressure, which corresponds to the linear term with coefficient B.
In both situations above we really need to tune two relevant couplings to zero to hit
the critical point. Of these, one is even under Z2 and one is odd under Z2 . Doing this
– 72 –
will allow us to hit a fixed point with two relevant deformations, one even one odd.
This is the Gaussian fixed point in d > 4 and is something else (to be described below)
in d < 4.
What about higher order interactions n with n odd. If we have to tune , do we not
also need to tune 3 ? It turns out that that the 3 interaction is redundant. If you have
a free energy with no Z2 symmetry, and all powers of n , then you can always redefine
your field as ! + c for some constant c. This freedom allows you to eliminate
the 3 term. Note that if your free energy enjoys the Z2 symmetry ! then it
prohibits you from making this shift.
However, there’s one tricky issue that we haven’t yet explained: the mean field
exponents agree with the scaling analysis of Section 3.2 only when d = 4. Comparing
the two results, we have
↵ ⌘ ⌫
4 d 1 1
MF 2 2
1 3 0 2
4 d d 2 d+2 1
Scaling 2 4
1 d 2
0 2
where we’ve used the result (2.14), including quadratic fluctuations, for the mean field
value of ↵. This agrees with the scaling analysis. However, for d > 4, the exponents
and di↵er. It turns out that the results from Landau mean field are correct, and
those from the scaling analysis are wrong. Why?
To understand this, let’s recall our scaling argument from Section 3.2. We set B = 0
and focus on the critical exponent . The magnetisation scales with the temperature
t = |T Tc |/Tc as
m⇠t
Here m is identified with the scalar field . Scaling analysis gives = t . But both
mean field and scaling analysis agree that t = 1/⌫ = 2 and = (d 2)/2, and this
gives = (d 2)/4, rather than the mean field result = 1/2.
– 73 –
However, we were a little quick in the scaling analysis because we neglected the
quartic coupling g4 . Mean field really told us (1.31),
✓ ◆1/2
t
m0 ⇠
g4
But both t and g4 scale under RG flow. The scaling dimension of g4 is g4 = 4 d and
now the mean field result, with = 1/2 is fully compatible with scaling.
There’s a more general lesson to take from this. It is tempting, when doing RG,
to think that we can just neglect the irrelevant operators because their coefficients
flow to zero as we approach the infra-red. However, sometimes we will be interesting
in quantities – such as the magnetisation above – which have the irrelevant coupling
constants sitting in the denominator. In this case, one cannot just blindly ignore
these irrelevant couplings as they a↵ect the scaling analysis. When this happens, the
irrelevant coupling is referred to as dangerously irrelevant.
Our field theory description will respect the D8 symmetry of the underlying lattice
model, together with the Z2 symmetry ! which ensures that fields come in pairs.
But this would appear to be much less powerful than the full O(2) continuous rotation
and reflection symmetry. Have we cheated?
Let’s see what kind of terms we might expect. First, there are some simple terms
that are prohibited by the dihedral symmetry. For example a lone term (@1 )2 would
break the x1 ! x2 discrete rotational symmetry and so would not appear in the free
energy. Similarly, a term @1 breaks the x1 ! x1 symmetry. (On top of this, it
is also a total derivative and so doesn’t contribute to the free energy.) The lowest
dimension term that includes derivatives and is compatible with the discrete symmetry
is
O2 ⇠ (@1 )2 + (@2 )2
– 74 –
But this term happens to be invariant under the full, continuous O(2) rotational sym-
metry. We should keep going. The first term that preserves D8 , but not SO(2), is
O4 ⇠ @14 + @24
There is no reason not to add such terms to the free energy and, in general, we expect
that these will be present in any field theoretic description that accurately describes
the microscopic physics. However, this operator has dimension O4 = d + 2 and so
is irrelevant. This means that it gets washed away by the renormalisation group, and
the long wavelength physics is invariant under the full O(2) symmetry. We say that
the continuous rotational symmetry is emergent. A similar argument holds for higher
dimensions.
We now repeat our RG procedure, but with a di↵erent starting point in theory space,
Z
d 1 1
F[ ] = d x r · r + µ20 2 + g0 4
2 2
The renormalisation group procedure tells us to split the Fourier modes of the field into
long and short wavelengths,
+
k = k + k (3.25)
where we take F0 [ ] to coincide with the quadratic terms (3.19), and the interaction
terms are
Z
FI [ ] = dd x g 0 4
Note that we’ve chosen to include, for example, ( )4 in the interaction terms rather
than F0 . This is a matter of convention.
– 75 –
The e↵ective free energy for k,defined in (3.4), is given by
Z
F 0[ F0 [ + +
e ]
=e F0 [ k ]
D +
k e
k ] e FI [ k , k ]
R
There is a nice interpretation of this functional integral D + k . We can think of it
FI [ k , + ] +
as computing the expectation value of e k , treating
k as the random variable
F0 [ + ]
with Gaussian distribution e k . In other words, we can write this as
D E
F 0[ ] F0 [ k ] FI [ k , + ]
e =e e k
+
where the subscript on h·i+ is there to remind us that we are averaging over the + k
modes only. We take the definition of the path integral to be suitably normalised so
that h1i+ = 1. Taking the log of both sides,
D E
0 FI [ k , + ]
F [ ] = F0 [ k ] log e k (3.26)
+
3.4.1 Order g0
⌦ ↵
At leading order in g0 , we need to compute FI [ k , +k ] + . The first order of business
is to expand the interaction terms (3.26) in Fourier modes. We have
Z Y4 X
+ d d ki d d
FI [ k, k] = g0 ⇥ (terms with ) ⇥ (2⇡) ( ki )
i=1
(2⇡)d i
There are five di↵erent “terms with ”, most of which do not give anything interesting.
These five terms are:
– 76 –
i) k1 k2 k3 k4 : This term doesn’t include any + k , the average is trivial. It carries
R d 4
over to give the term g0 d x in the e↵ective free energy.
ii) 4 k1 k2 k3 +
k4 : This term has just a single
+
k and so vanishes when averaged over
the Gaussian ensemble.
iii) 6 k1 k2 + +
k3 k4 : This term is interesting. We will look at it more closely below.
For now, note that the factor of 6 comes from the di↵erent choices of momentum
labels.
iv) 4 k1 + + + +
k2 k3 k4 : This term is cubic in k and, like any term with an odd number
of insertions, vanishes when averaged over the Gaussian ensemble.
+ + + +
v) k 1 k2 k 3 k 4 :
This term doesn’t include any k, it simply gives constant to the
free energy. It will not be important here.
We learn that we need to compute just a single term,
Z Y4 X
d d ki ⌦ + +↵ d d
hFI i+ = 6g0 ⇥ ⇥ (2⇡) ( ki ) (3.28)
i=1
(2⇡)d k1 k2 k3 k4 +
i
But this is the same kind of correlation function that we computed in Section 2.2: it is
given by
⌦ ↵ 1
+ +
k k0 + = (2⇡)d d (k + k0 ) G0 (k) with G0 (k) = (3.29)
k 2 + µ20
After playing around with the delta-functions, and relabelling momentum variables,
we’re left with our first correction to the free energy,
Z ⇤/⇣ d Z ⇤
d k dd q 1
hFI i+ = 6g0 d k k d 2 2
0 (2⇡) ⇤/⇣ (2⇡) q + µ0
R
where the limits on the dq integral reflect the fact that we’ve only integrated out the
short wavelength modes, whose momenta lie within this band. We see that, at order
g0 , we get a correction only to the quadratic term whose coefficient becomes
Z ⇤
2 02 2 dd q 1
µ0 ! µ = µ0 + 12g0 d 2 2
(3.30)
⇤/⇣ (2⇡) q + µ0
Finally, we should enact the rescaling steps of the renormalisation group. This takes
the same form as before (3.20),
d+2
k0 = ⇣k and 0
k =⇣ w
k/⇣ with w =
2
– 77 –
This gives the same scaling of the parameters that we saw in Section 3.3. We have
✓ Z ⇤ ◆
2 2 dd q 1
µ (⇣) = ⇣ µ20 + 12g0 and g(⇣) = ⇣ 4 d g0 (3.31)
⇤/⇣ (2⇡) q + µ20
d 2
The upshot of this calculation is that turning on a 4 coupling will give rise to a
quadratic 2 coupling under RG flow. This is typical of these kinds of calculations:
couplings of one type will induce others.
The coefficient of the 2 term is particularly important for our story, since the critical
point is defined to be the place where this vanishes. We see that it’s not so easy to
make this happen. You can’t simply set µ20 = 0 at some high scale and expect to hit
criticality. Indeed, the result (3.31) tells us that, at long wavelengths, the “natural”
value is µ2 ⇠ g0 ⇤d 2 , which is typically large. If you want to hit the critical point, you
must “fine tune” the original coefficient µ20 to cancel the new terms that are generated
by RG flow.
You might think that this calculation answers the question of what happens to the
theory in d = 4 when we turn on g 4 . It certainly tells us that turning on this coupling
will induce the relevant coupling µ2 2 and so take us away from the Gaussian fixed
point. However, a closer look at (3.31) reveals that it’s possible to turn on a combination
of g0 and µ20 , so that µ2 (⇣) remains zero. This combination is a marginal coupling. We
learn that, at this order, there remains one relevant and one marginal deformation.
1
F 0[ ]⇠ hFI2 i hFI i2 (3.32)
2
Expanding out hFI2 i, we find 256 di↵erent terms. We will see how to organise them
shortly, but for now we make a few comments before focussing on the term of interest.
Some of the terms in hFI2 i will result in corrections that cannot be written as a local
R d 2
free energy, but are instead of the form d xf ( ) for some f ( ). These terms will
be cancelled by the hFI i2 terms. This is a general phenomena which you can learn more
about in the lectures on Quantum Field Theory. In terms of Feynman diagrams, which
we will introduce below, these kind of terms correspond to disconnected diagrams.
– 78 –
The terms that we care about in hFI2 i are those which can be written as local cor-
rections to the free energy. Of immediate interest for us are a subset of terms in
R
FI2 ⇠ dd x dd y 4 (x) 4 (y), given by
✓ ◆2 Z ⇤/⇣ Y4 d Z ⇤ Y4 d
1 2 1 4 2 d ki d qj + + + +
hFI i+ ⇠ g0 ki h q1 q2 q3 q4 i + (3.33)
2 2 2 0 i=1
(2⇡)d ⇤/⇣ j=1 (2⇡)
d
Let’s explain what’s going on here. Each (x) is decomposed into Fourier modes and
+
. The same is true for each (y). In the above term, we have chosen two out of
4 4 +
the (x) and two out of the (y); the remaining terms are . Each combinatoric
4
factor 2 = 6 out front reflects the choice of picking two from 4 . Meanwhile,
R d R d
the two delta functions come from doing the d x and d y integrals respectively.
Matching the momenta in the arguments of the delta functions to the ± tells us that
we’ve picked two from the 4 (x) and two from 4 (y) (as opposed to, say, all
four from 4 (y)).
+ + + +
To proceed, we need to compute the four-point function h q1 q2 q3 q4 i + . To do this
we need a result known as Wick’s theorem.
Wick’s Theorem
As we proceed in our perturbative expansion, the integrals start to blossom. From the
form of the expansion (3.27), we can see that the integrand will involve expectation
values of the form h + +
k1 . . . kl i+ . There is a simple way to compute expectation values
of this type in Gaussian ensembles. This follows from:
Lemma: Consider n variables a drawn from a Gaussian ensemble. This means that,
for any function f ( ), the expectation value is
Z 1
1 1
·G 1
hf ( )i = dn f ( ) e 2
N 1
for some invertible n ⇥ n matrix G. The normalisation factor is N = det1/2 (2⇡G) and
ensures that h1i = 1. The following identity then holds:
⌦ ↵ 1
e Ba a
= e 2 Ba h a b iBb
(3.34)
– 79 –
Proof: This is straightforward to show since we can just evaluate both sides
Z 1
⌦ Ba a ↵ 1 1 1
e = dn y e 2 ·G +B·
N 1
Z 1
1 1 1 1 1 1
= dn y e 2 ( GB)·G ( GB) e 2 B·GB = e 2 B·GB = e 2 Ba h a b iBb
N 1
The Taylor expansion of the identity (3.34) gives us the expressions that we want.
The left-hand-side is
⌦ Ba a ↵ 1 1 1
e = 1 + Ba h a i + Ba Bb h a b i + Ba Bb Bc h a b c i + Ba Bb Bc Bd h a b c d i + . . .
2 3! 4!
Meanwhile, the right-hand-side is
1 1 1
e 2 Ba h a b iBb
= 1 + Ba Bb h a bi + Ba Bb Bc Bd h a b ih c d i + ...
2 8
Now we just match powers of Ba on both sides. We immediately learn that
Our real interest is in l even. Here we have to be a little careful because multiplying by
the string of B’s automatically symmetrises the products of h a b i over the a = 1, . . . , n
indices. So, for example, comparing the B 4 terms gives
h a b c di =h a b ih c d i +h a c ih b d i +h a d ih b c i (3.35)
This leaves us with a sum over all pairwise contractions. This result is known as Wick’s
theorem.
h + +
q q0 i + = (2⇡)d d (q + q0 ) G0 (q)
– 80 –
The trick is to see how these new delta functions combine with the original delta
functions in (3.33). There are two di↵erent structures that emerge. The first term,
gives (ignoring factors of 2⇡ for now)
Z Y
4
d d qj h + + + + d
q1 q2 i+ h q3 q4 i+ (k1 + k2 + q1 + q2 ) d
(k3 + k4 + q3 + q4 )
j=1
Z
⇠ dd q2 dd q4 G0 (q2 )G0 (q4 ) d
(k1 + k2 ) d
(k3 + k4 ) (3.36)
We’re still left with two delta functions over the k variables. This means that when
we go back to real space, this term does not become a local integral. Instead, if you
R d 2
follow it through, it becomes a double integral of the form d x (x)2 . As we
explained after (3.32), these terms are ultimately cancelled by corresponding terms in
hFI i2 . They are not what interests us.
+ + + +
Instead, we care about the second and third terms in the Wick expansion of h q 1 q 2 q 3 q4 i + .
Each of them gives a contribution of the form
Z Y
4
d d qj h + + + + d
q1 q3 i+ h q2 q4 i+ (k1 + k2 + q1 + q2 ) d
(k3 + k4 + q3 + q4 )
j=1
Z
⇠ dd q1 dd q2 G0 (q1 ) G0 (q2 ) d
(k1 + k2 + q1 + q2 ) d
(k3 + k4 q1 q2 )
Z
⇠ dd q G0 (q) G0 (|k1 + k2 + q|) d
(k1 + k2 + k3 + k4 ) (3.37)
R
where, in going to the last line, we have done the integral dd q2 and relabelled q1 = q.
Now we have just a single delta function over ki and, correspondingly, when we go back
to real space this will give a local contribution to the free energy. Indeed, the terms
(3.33) now become
✓ ◆2 Z ⇤/⇣ Y4 d X
1 2 4 2 d ki
hF i+ ⇠ g0 f (k1 + k2 ) (2⇡)d d ( ki ) (3.38)
2 I 2 0 i=1
(2⇡)d ki
i
where the factor of 12 in (3.33) has disappeared because we get two contributions from
the Wick expansion, each of which gives the same contribution (3.37). The remaining
R
integral over dd q is hidden in the function f (k), which is given by
Z ⇤
dd q 1 1
f (k1 + k2 ) = (3.39)
⇤/⇣ (2⇡) q + µ0 (k1 + k2 + q)2 + µ20
d 2 2
– 81 –
This is not as complicated as it looks. We can write it as
Z ⇤
dd q 1
f (k1 + k2 ) = d 2 2 2
(1 + O(k1 , k2 ))
⇤/⇣ (2⇡) (q + µ0 )
All the terms that depend on the external momenta k1 and k2 will generate terms in the
free energy of the form k 2 ( )4 ⇠ ( )2 r2 ( )2 . These are irrelevant terms that will
not be interesting for us other than to note that once we let loose the dogs of RG, we
will no longer sit comfortably within some finite dimensional subspace of the coupling
constants. Integrating out degrees of freedom generates all possible terms consistent
with symmetries; flowing to the IR allows us to focus on the handful of relevant ones.
The contribution (3.38) to the free energy is what we want. Translating back to real
space, we learn that the quadratic term gets corrections at this order. We have
Z ⇤
0 2 dd q 1
g0 ! g0 = g0 36g0 d 2 2 2
(3.40)
⇤/⇣ (2⇡) (q + µ0 )
The minus sign in (3.40) is important. It can be traced to the minus sign in (3.32),
and it determines the fate of the would-be marginal coupling g 4 in d = 4 dimensions.
Recall that, in d = 4, there is no contribution to the running of g(⇣) from the second
and third steps of RG. Here, the leading contribution comes from the first step and, as
we see above, this causes g(⇣) to get smaller as ⇣ increases. This means that the theory
in d = 4 is similar in spirit to those in d > 4, with 4 an irrelevant coupling.
Finally, just because g is marginally irrelevant in d = 4 does not mean that you can
turn it on and expect to flow back to the Gaussian fixed point. As depicted in the
diagram, the coupling mixes with µ2 . If you want to flow back to the Gaussian fixed
point, you need to turn on a particular combination of µ20 and g0 .
– 82 –
3.4.3 Feynman Diagrams
The calculation above needed some care. As we go to higher order terms in the ex-
pansion, the number of possibilities starts to blossom. Fortunately, there is a simple
graphical formalism to keep track of what’s going on.
Suppose that we’re interested in a term in the expansion (3.27) of the form g0p ( )n ( + l
).
This can be represented by one or more Feynman diagrams. Here is the game:
• Each k is represented by an external, solid line.
+
• Each k is represented by a dotted line.
• The dotted lines are connected with each other to form internal loops. They are
paired up in all possible ways, reflecting the pair contraction of Wick’s theorem.
No dotted lines can be left hanging which means that the diagrams only make
sense for l even.
in (3.29)
R
• For each internal loop, there is an integral dd q/(2⇡)d .
P
• Each vertex comes with a power of g0 (2⇡)d d ( i ki ) where the delta function
imposes momentum conservation, with the convention that all momenta are in-
coming.
k1 k4 Z
=g dd x ( )4 (3.41)
k2 k3
– 83 –
k1 k4
+ 4 + + + +
Similarly, the ( ) terms are k1 k 2 k 3 k 4 ⇠ k2 k3 , but because these are internal
lines we should join them up to get a diagram that looks like .
+ +
The interesting term at order g0 is k1 k2 + q q , which includes both k and k.
This was evaluated in (3.28). It is represented by the diagram
q
Z ⇤ Z
dd q 1
= 6g0 dd x ( )2
k1 k2 ⇤/⇣ (2⇡)d q 2 + µ20
where the integral arises because the momentum q of the + excitation running in the
loop is not determined by momentum conservation of the external legs. Such corrections
are said to arise from loop diagrams, as opposed to the tree diagrams of (3.41).
k1+ k 2 + q
k1 k4
Z ⇤/⇣ Y4 d X
d ki
= 36g02 ki f (k1 + k2 ) (2⇡)d d ( ki )
k2
q
k3 0 i=1
(2⇡)d i
where the function f (k1 + k2 ) was given in (3.39) and contains two propagators associ-
ated to the + fields running in the loop. Evaluating this leads to the correction (3.40),
as well as a slew of higher derivative couplings.
+
Any graph that we can draw will appear somewhere in the expansion of loghe FI [ k , k ] i+
given in (3.27). As we noted above, this is a cumulant expansion which has the rather
nice graphical interpretation that only connected graphs appear in the expansion. For
example, at order g02 , the disconnected graph that appears in the expansion of hFI2 i+
that looks like will be cancelled by the same disconnected graph appearing in
2
hFI i+ .
In the language of quantum field theory, it’s tempting to view the lines in the Feyn-
man diagrams as the worldlines of particles. There is no such interpretation in the
present case: they’re simply useful.
More Diagrams
We can now also look at other diagrams and see what role they play. For example,
you might be worried about the diagram . This is strictly vanishing, because
– 84 –
the incoming momentum for lone leg is forced to be equal to the intermediate
+
momentum for the propagator. Yet the momentum for and + can never be
equal .
There are, however, two further diagrams that we neglected which do have an inter-
esting role to play. Both of these are two loop diagrams. They will not a↵ect what
we’re going to do later but, nonetheless, are worth highlighting. The first diagram is
Z
dd k 1
= g02 C(⇤) k k
(2⇡)d 2
for some C(⇤) whose exact form will not be important. This gives rise to a shift in the
quadratic term, so that (3.30) is replaced by
Z ⇤
2 02 2 dd q 1
µ0 ! µ = µ0 + 12g0 d 2 2
+ g02 C(⇤)
⇤/⇣ (2⇡) q + µ 0
We’ve called the result of this diagram A(k, ⇤); again we won’t need its detailed form.
Importantly, it is now a function of the external momentum k. This means that it gives
rise to two e↵ects that are (in a technical sense) relevant. The first is that there is yet
another renormalisation of µ2 , this time depending on A(0, ⇤). The second is novel:
upon Taylor expanding A(k) = A(0) + 12 k 2 A00 (0) + . . ., we get a contribution to the the
gradient term, which is now
Z
0 1 0
F [ ] = dd x r · r + . . . with 0 = 1 2g02 A00 (0, ⇤)
2
This, in turn, means that we need a new rescaling of the field. To order g02 , we can
write this as
⇣ (d+2)/2
k0 = ⇣k and 0
k =
1 g02 A00 (0, ⇤) k/⇣
This last, additional step is known as field renormalisation. (Actually, that’s not com-
pletely true. It should be known as “field renormalisation”, but instead is known as
“wavefunction renormalisation”. This a terrible name, one that betrays the long and
deep confusion that permeated the origins of this subject. Even in the context of quan-
tum field theory, this rescaling has nothing to do with wavefunctions. It is a rescaling
of fields!)
– 85 –
Although we won’t compute this field renormalisation exactly, it is nonetheless im-
portant for this is what gives rise to the anomalous dimension of , and this was
underlying the whole scaling analysis of Section 3.2.
⇤
⇤0 = = ⇤e s
⇣
The renormalisation group transformation described above tells us that each coupling
changes with scale, gn = gn (s). The beta function is defined as
dgn
= n (gn )
ds
Note that s increases as we flow towards the IR. This means that a positive beta
function tells us that gn gets stronger in the IR, while a negative beta function means
that gn gets weaker in the IR. (As an aside: this is the opposite to how beta functions
are sometimes defined in quantum field theory, where one parameterises the flow in
terms of energy rather than length.)
Before we jump straight in, it’s useful to take a step backwards and build up the
beta functions. Let’s go back to our original scaling analysis around the Gaussian fixed
point (3.24), where the running of the couplings is given by gn (s) = e(d nd/2+n)s g0,n .
The beta functions are
✓ ◆
dgn 1
= d nd + n gn (3.43)
ds 2
Notice that, at this leading order, there’s no mixing between di↵erent couplings: turning
on one coupling gn does not induce another to flow. As we saw above, this state of
a↵airs no longer holds when we include interactions.
We now focus on the two most important couplings, µ20 and g0 . At order g0 , the RG
equations are given by (3.31); the additional correction at order g02 , given in (3.40),
means that these get replaced by
– 86 –
where
Z ⇤ Z ⇤
dd q 1 dd q 1
a = 12 and b = 36
⇤/⇣ (2⇡) q + µ20
d 2
⇤/⇣ (2⇡) (q + µ20 )2
d 2
Note that we have kept our original scaling dimensions in (3.44); the corrections in
scaling due to the diagram (3.42) will be subleading and not needed in what follows.
When we di↵erentiate µ2 (⇣) and g(⇣) to derive the beta functions, we will get two
terms: the first is (3.43) and comes from the scaling; the second comes from the cor-
rections, given by the integrals a and b. Di↵erentiating these integrals is particularly
easy. For small s, we we write
Z ⇤ Z ⇤
s d
dq f (q) ⇡ [⇤ ⇤e ]f (⇤) ⇡ ⇤f (⇤)s ) dq f (q) = ⇤f (⇤)
⇤e s ds ⇤e s
dµ2 3g ⇤4 dg 9 ⇤4
= 2µ2 + 2 2 and = g2 (3.45)
ds 2⇡ ⇤ + µ2 ds 2⇡ 2 (⇤2 + µ2 )2
These don’t (yet) contain any new physics, but it’s worth reiterating what information
we can extract from these equations.
First, the beta function for µ2 has two terms; the first term comes from the second
and third steps in RG (scaling), while the second comes from the first step in RG
(integrating out). Meanwhile, the beta function for g has only a single term. There
is no term linear in g because it was marginal under scaling, but it does receive a
contribution when we integrate out the high momentum modes at order g 2 . This
contribution is negative, which tells us the coupling is marginally irrelevant. (A repeat
of the warning above: this is the opposite convention to quantum field theory where one
flows in decreasing energy, rather than increasing length, which means that a marginally
irrelevant interaction is usually said to have a positive beta function. )
– 87 –
In statistical mechanics, the field (x) is, by construction, a coarse grained object:
at the microscopic level, it dissolves into constituent parts, whether spins or atoms or
something else. This has the practical advantage that we have no expectation that
the statistical field theory will describe physics on arbitrarily short distance scales.
In contrast, when we talk about quantum field theory in the context of high energy
physics, it is tempting to think of the fields as “fundamental”, a basic building block
of our Universe. We may then wish for the theory to make sense down to arbitrarily
small distance scales.
To avoid this problem, one makes the so-called bare couplings – things we’ve called
µ20 and g0 – depend on ⇤. This is not such a dumb thing to do; after all, these quantities
were defined at the cut-o↵ scale ⇤. The original game of renormalisation was to find a
way to pick µ20 (⇤) and g0 (⇤) such that all physical quantities remain finite as ⇤ ! 1.
It is by no means obvious that this is possible. Theories which can be rendered finite
in this way are said to be renormalisable.
– 88 –
sweeping us away from the Gaussian fixed point. But we seem to be no closer to
figuring out where we end up. All we know is that we’re not in Kansas anymore.
The difficulty is that we’re limited in what we can calculate. We can’t do the path
integral exactly in the presence of 4 interactions, and are forced to work perturbatively
in the coupling g. Yet, as we have seen, in dimension d < 4 the RG flow increases g,
taking us to a regime where perturbation theory is no longer valid.
β (g)
Figure 29: The beta function for g when d < 4. The arrows show the flow of g as we move
towards the infra-red. The fixed point lies within the regime of perturbation theory when
d = 4 ✏.
Nonetheless, the calculations that we did above do contain information about where
we might expect to end up. But to see it, we have to do something rather dramatic.
We will consider our theories not in d = 4 dimensions, but in
d=4 ✏
dimensions where ✏ is a small number, much less than 1. Clearly, this is an odd thing to
do. You could view it as an act of wild creativity or, one of utter desperation. Probably
it is a little bit of both. But, as we shall see, it will give us the insight we need to
understand critical phenomena.
– 89 –
equations (3.44): they are
dµ2 12⌦d 1 ⇤4
= 2µ2 + g̃ + . . .
ds (2⇡)d ⇤2 + µ2
dg̃ 36⌦d 1 ⇤4
= ✏g̃ g̃ 2 + . . .
ds (2⇡)d (⇤2 + µ2 )2
where ⌦d 1 is the area of the sphere Sd 1 . We’ve introduced the dimensionless coupling
g̃ = ⇤ ✏ g. Note that the beta function for g̃ now includes a term linear in g̃ arising
from the scaling.
It can be checked that the two-loop diagrams we neglected contribute only at order
2
✏ . This means that it’s consistent to truncate to the beta functions above. We could
use the general formula for the area of a sphere, ⌦d 1 = 2⇡ d/2 / (d/2), but this will give
corrections of order ✏2 , so instead we simply use ⌦3 = 2⇡ 2 . Similar comments apply to
(2⇡)d . We’re left with
dµ2 3 ⇤4
⇡ 2µ2 + 2 2 g̃
ds 2⇡ ⇤ + µ2
dg̃ 9 ⇤4
⇡ ✏g̃ g̃ 2
ds 2⇡ 2 (⇤2 + µ2 )2
The novelty of these beta functions is that they have two fixed points. There is the
Gaussian fixed point µ2 = g̃ = 0 that we discussed before. And there is a new fixed
point,
3 ⇤4 2⇡ 2 (⇤2 + µ2? )2
µ2? = g̃? and g̃? = ✏
4⇡ 2 ⇤2 + µ2? 9 ⇤4
Since we’re working to leading order in ✏, the solution is
3 2 1 2 2⇡ 2
µ2? = ⇤ g̃ ? = ⇤ ✏ and g̃ ? = ✏
4⇡ 2 6 9
This is the Wilson-Fisher fixed point. Importantly, when ✏ is small then the fixed
point g̃? is also small, so our calculation is self-consistent (although, since we are in a
fractional dimension, arguably unphysical!).
– 90 –
where, as with all our other calculations, the entries in the matrix hold only up to
O(✏2 ).
Now suppose that we increase ✏. The Wilson-Fisher fixed point moves to higher g,
and our perturbative approach breaks down. Nonetheless, it is not unreasonable to
suppose that the qualitative picture of the flows remains the same. Indeed, this is
thought to happen. Because the Wilson-Fisher fixed point has just a single relevant
operator, it means that we will generically end up there if we we’re willing to tune just
a single parameter, namely T ! Tc .
It is now a simple matter to compute the critical exponents in the epsilon expansion.
Recall from Section 3.2 that these are related to the scaling dimensions of various
terms. The relevant direction away from the Wilson-Fisher fixed point is temperature,
t = |T Tc |/Tc . Its dimension is determined by the way it scales as we approach the
critical point, t ! ⇣ t t = es t t. But this is precisely the eigenvalue t that we just
computed.
1 1 ✏
⌫= = + + O(✏2 )
t 2 12
We can then use the hyperscaling relation ↵ = 2 d⌫, given in (3.13), to compute the
critical exponent for the heat capacity
↵ ✏
c⇠t with ↵ = + O(✏2 )
6
– 91 –
To compute the other critical exponents, we need to evaluate the anomalous dimension
⌘. As we mentioned briefly above, this is related to the diagram and turns
2
out to be ⌘ = ✏ /6, which is higher order in the expansion. We then have, from (3.10),
d 2 ✏
⇡ =1
2 2
Equations (3.14), (3.15) and (3.16) then give
1 ✏ ✏
⇡ , ⇡1+ , ⇡3+✏
2 6 6
where all expressions hold up to corrections of order O(✏2 ).
↵ ⌘ ⌫
1 1
MF 0 2
1 3 0 2
✏=1 0.17 0.33 1.17 4 0 0.58
d=3 0.1101 0.3264 1.2371 4.7898 0.0363 0.6300
Our answers are embarrassingly close to the correct values given the dishonest method
we used to get there. One can, however, make this approach more respectable. The ✏
expansion has been carried out to order O(✏5 ). It is not a convergent series. Nonethe-
less, sophisticated resummation techniques can be used to make sense of it, and the
resulting expressions give a fairly accurate account of the critical exponents.
The real power of the epsilon expansion, however, is more qualitative than quan-
titative; it usually – but not always – gives a reliable view of the structure of RG
flows.
=[ ]=0
This means that the Gaussian fixed point has an infinite number of relevant deforma-
tions since n , for each n, is relevant.
– 92 –
It turns out that, in contrast to d = 3, there are actually an infinite number of
fixed points in d = 2. Roughly speaking, the nth fixed point can be reached from the
Gaussian fixed point by turning on
Z
1
F[ ] = d2 x (r )2 + g2(n+1) 2(n+1)
2
Of course, as we’ve seen above, the RG flow is not quite so simple. When we turn on
the coupling g2(n+1) 2(n+1) we will generate all other terms, including 2 and 4 and
so on. To reach the nth fixed point, we should tune all such terms to zero as we flow
towards the infra-red.
One can show that the nth fixed point has n relevant operators: schematically, these
can be thought of as 2 , 4 , . . . , 2n although, as above, there will be mixing between
these. By turning on the least relevant operator, one can flow from the nth fixed point
to the (n 1)th fixed point.
The results above are not derived using the ✏ = 4 d expansion which, unsurprisingly,
is not much use in d = 2. Instead, they rely on something new which we will briefly
describe in Section 3.6.
Kenneth Wilson
Renormalisation first entered physics in the context of quantum field theory, with
the need to make sense of the UV divergences that arise in quantum electrodynamics.
The theory, developed by Schwinger, Feynman, Tomonaga, Dyson and others, amounts
to finding a consistent way to subtract one infinity from another, leaving behind a finite
answer. This method yields excellent agreement with experiment but is, in the words
of Feynman, a “dippy process”, in which the infinities are not so much understood as
swept under a very large rug.
– 93 –
The first hint of something deeper – and the first hint of a (semi)-group action – was
seen in the work of Gell-Mann and Low in 1954. They realised that one could define
an e↵ective charge of the electron, e(µ) where µ denotes the energy scale at which the
experiment takes place. This interpolates between the physical charge, as µ ! 0, and
the so called bare charge at high energies.
Meanwhile, throughout the 50s and 60s, a rather di↵erent community of physicists
were struggling to understand second order phase transitions. It had long been known
that Landau theory fails at critical points, but it was far from clear how to make
progress, and the results that we’ve described in this course took several decades to
uncover. In Kings College London, a group led by Cyril Domb stressed the importance
of focussing on critical exponents; at Cornell University, Benjamin Widom showed
that the relationships between critical exponents could be derived by invoking a scale
invariance, albeit with little understanding of where such scale invariance came from;
and at the University of Illinois, Leo Kadano↵ introduced the idea of “blocking” in
lattice models, a real-space renormalisation group in which one worked with successively
coarser lattice models.
While many people contributed to these developments, the big picture, linking ideas
from particle physics, statistical physics and condensed matter physics, is due mostly
to. . .
The floodgates opened in 1971 when Wilson set out his grand vision of the renormal-
isation group and, with his colleague Michael Fisher, suggested the epsilon expansion
as a perturbative method to compute critical exponents in a paper charmingly titled
“Critical Exponents in 3.99 Dimensions”. Wilson used these methods to solve the
“Kondo problem” in which an isolated spin, sitting in a bath of mobile electrons, ex-
hibits asymptotic freedom, and he was among the first to understand the importance
of numerical approaches to solve statistical and quantum field theories, pioneering the
– 94 –
subject of lattice gauge theory. In 1982 he was awarded the Nobel prize for his contri-
butions to critical phenomena.
The answers to all these questions can be found in the emergence of a rich math-
ematical structure at the critical point. As we’ve seen throughout these lectures, the
basic story of RG ensures that physics at the critical point is invariant under scale
transformations
x! x (3.46)
More surprising is the fact that the physics is invariant under a larger class of sym-
metries, known as conformal transformations. These transformation consist of any
map
@ x̃i @ x̃j
x ! x̃(x) such that ij = (x) kl (3.47)
@xk @xl
for some function (x). Such conformal transformations have the property that they
preserve angles between lines.
The equation (3.47) has obvious solutions, such as translations and rotations, for
which (x) = 1. Furthermore, it is simple to see that the scaling (3.46) falls into the
class of conformal transformations, with (x) = 2 . However, it turns out that there is
one further, less intuitive transformation that obeys this condition. This is known as
the special conformal transformation and is given by
xi (x · x)ai
x0i = (3.48)
1 2(x · a) + (a · a)(x · x)
parameterised by an arbitrary vector a.
The first question that we should ask is: why are theories at the fixed points invariant
under the larger group of conformal transformations, rather than just scale transforma-
tions? The answer to this, which goes somewhat beyond this course, involves a deeper
understanding of the nature of the RG flows and hinges, crucially, on being able to
– 95 –
construct a quantity which decreases monotonically along the flow. This quantity is,
unhelpfully, called c in d = 2 dimensions, f in d = 3 dimensions and a in d = 4 di-
mensions, and the fact that it decreases monotonically is referred to as the c-theorem,
f -theorem and a-theorem respectively. Using this machinery, it is then possible to
prove that scale invariance implies conformal invariance. (The proof is more clearcut
in d = 2; it relies on some extra assumptions in higher d, and there is a general feeling
that there is more to understand here.)
The existence of an extra symmetry (3.48) brings a newfound power to the study
of fixed points. The d translational symmetries, 12 d(d 1) rotational symmetries, d
special conformal symmetries and single scale transformation combine to form the
conformal group, which can be shown to be isomorphic to SO(d + 1, 1). All fields and,
in particular, all correlation functions must fall into representations of this group, a fact
which restricts their form. In recent years, our understanding of these representations
has allowed new precision in the computation of critical exponents in d = 3 dimensions.
This programme goes by the name of the conformal bootstrap.
Conformal Symmetry in d = 2
In d = 2 dimensions, conformal symmetry turns out to be particularly powerful. The
group of finite conformal transformations follows the pattern in higher dimensions, and
is SO(3, 1) ⇠= SL(2, C). However, something rather special happens if you look at in-
finitesimal transformations where one finds that many many more are allowed. In fact,
there are an infinite number. This means that there is a powerful, infinite dimensional
algebra, known as the Virasoro algebra, underlying conformal theories in d = 2 dimen-
sions. This places much stronger constraints on these fixed points, ultimately rendering
many of them solvable without resorting to perturbation theory. This is the reason why
the critical exponents are rational numbers which can be computed exactly. This is
also what allows us to understand the structure of the infinite number of multi-critical
fixed points described in Section 3.5.2.
– 96 –