Wavefunction Measurement in Quantum Mechanics
Wavefunction Measurement in Quantum Mechanics
1 notes, part1 1 of 38
Quantum Mechanics
Introductory Remarks: Q.M. is a new (and absolutely necessary) way of predicting the
behavior of microscopic objects.
It is based on several radical, and generally also counter-intuitive, ideas:
1) Many aspects of the world are essentially probabilistic, not deterministic.
2) Some aspects of the world are essentially discontinuous
Bohr: "Those who are not shocked when they first come across quantum theory cannot
possibly have understood it."
Humans have divided physics into a few artificial categories, called theories, such as
• classical mechanics (non-relativistic and relativistic)
• electricity & magnetism (classical version)
• quantum mechanics (non-relativistic)
• general relativity (theory of gravity)
• thermodynamics and statistical mechanics
• quantum electrodynamics and quantum chromodynamics (relativistic versions of
quantum mechanics)
Each of these theories can be taught without much reference to the others. (Whether any
theory can be learned that way is another question.) This is a bad way to teach and view
physics, of course, since we live in a single universe that must obey one set of rules. Really
smart students look for the connections between apparently different topics. We can only
really learn a concept by seeing it in context, that is, by answering the question: how does
this new concept fit in with other, previously learned, concepts?
Each of these theories, non-relativistic classical mechanics for instance, must rest on a set of
statements called axioms or postulates or laws. Laws or Postulates are statements that are
presented without proof; they cannot be proven; we believe them to be true because they
have been experimentally verified. (E.g. Newton's 2nd Law, Fnet = ma , is a postulate; it
cannot be proven from more fundamental relations. We believe it is true because it has been
abundantly verified by experiment. )
Actually, Newton's 2nd Law has a limited regime of € validity. If you consider objects going
very fast (approaching the speed of light) or very small (microscopic, atomic), then this "law"
begins to make predictions that conflict with experiment. However, within its regime of
validity, classical mechanics is quite correct; it works so well that we can use it to predict the
time of a solar eclipse to the nearest second, hundreds of year in advance. It works so well,
that we can send a probe to Pluto and have it arrive right on target, right on schedule, 8 years
after launch. Classical mechanics is not wrong; it is just incomplete. If you stay within its
well-prescribed limits, it is correct.
Each of our theories, except relativistic Quantum Mechanics, has a limited regime of validity.
As far as we can tell (to date), QM (relativistic version) is perfectly correct. It works for all
situations, no matter how small or how fast. Well... this is not quite true: no one knows how
to properly describe gravity using QM, but everyone believes that the basic framework of
QM is so robust and correct, that eventually gravity will be successfully folded into QM
without requiring a fundamental overhaul of our present understanding of QM. String theory
is our current best attempt to combine General Relativity and QM (some people argue
"String Theory" is perhaps not yet really a theory, since it cannot yet make (many)
predictions that can be checked experimentally, but we can debate this!)
Roughly speaking, our knowledge can be divided into regimes like so:
small
QM Relativistic
(non-relativistic) QM
1/size
Mechanics Relativistic
(non-relativistic) Mechanics
big
0 c
speed
In this course, we will mainly be restricting ourselves to the upper left quadrant of this figure.
However, we will show how non-relativistic QM is completely compatible with non-
relativistic classical mechanics. (We will show how QM agrees with classical mechanics, in
the limit of macroscopic objects.)
So, C.M. studies ballistics, pendula, simple harmonic motion, macroscopic charged particles
in E and B fields, etc. Then, one might use the concept of energy (and conservation laws) to
make life easier. This leads to new tools beyond just Newton's laws: e.g. the Lagrangian, L,
and the Hamiltonian,H, describe systems in terms of different (but still conventional)
variables. With these, C.M. becomes more economical, and solving problems is often
simpler.
(At the possible cost of being more formal)
Of course, what one is doing is fundamentally the same as Newton's F=ma!
The equations of motion are given in these various formalisms by equations like:
∂H
d ∂L ∂L ∂x = − px
− =0 or , or F = ma
dt ∂x˙ ∂x ∂H = x
∂px
(If you've forgotten the Lagrangian or Hamiltonian approaches, it's ok for now…)
Just realize that
the general goal of C.M. is to find the equation of motion of objects:
€ Given initial conditions, find x(t) and p(t), position and momentum, as a function of time.
Then, you can add complications: E.g. allow for more complicated bodies which are not
pointlike, ask questions about rotation (introduce the moment of inertia, and angular moment
L=rxp), move to many-body systems (normal modes), etc…
Q.M. is about the same basic thing: Given a potential, what is the motion? It's just that
Q.M. tends to focus on small systems. (Technically, systems with small action, ∫ Ldt <≈ )
And the idea of "motion" will have to be generalized a bit (as we shall see soon!)
Having just completed C.M., your initial reaction may be "but, size doesn't € matter"! After
all, neither L nor H cares about size, and C.M. often deals with so called "point objects".
(Isn't a point plenty small?!) Unfortunately, it turns out that in a certain sense, everything you
learned in 2210 and 3210 is WRONG! To be a little more fair, those techniques are fine, but
only if applied to real-world sized objects. (As I said above, there's a regime of validity) Size
doesn't matter up to a point, but ultimately, C.M. breaks down: if you try to apply the 3210
Lagrangian (or Hamiltonian) formalism to an electron in an atom, or an atom in a trap, or a
quark in a proton, or a photon in a laser beam, or many other such problems, you will fail big
time!
It's not just that the equations are wrong. You can't patch them up with some clever
correction terms, or slight modifications of the equations, like relativity does at high speeds.
The whole MIND SET is wrong! You cannot ask for x(t) and p(t). It's not well defined! Point
particles do not exist. Particles have a wave nature, and waves have a particle nature. There
is a duality in the physical world, which is simply not classical.
So, we must start from scratch, and develop a whole new framework to describe small
systems. There are many new ideas involved. Some are formal and mathematical, some are
rather unintuitive, at least at first. I will try to motivate as much as possible, and we'll study
plenty of examples. Quantum mechanics comes from experiment! Feynman says that the one
essential aspect to learn Q.M. is to learn to calculate, and we will basically follow this idea.
Q.M. is great fun: very weird, sometimes mysterious. Philosophers still argue about what it
all means, but we will take a "physicist's view", mostly. Issues of interpretation can come
later. As a colleague of mine once explained, it's kind of like trying to learn Swahili slang.
First, you must learn a new language, and then you must learn a new culture, and only then
can you finally begin to truly understand the slang...
Alas, there is no agreement on the number, the ordering, or the wording of the Postulates of
Quantum Mechanics. Our textbook (Griffiths) doesn't even write them down in any
organized way. They are all in there, but they are not well-labeled, and not collected in any
one place. (Griffiths sometimes indicates Postulate by putting the statement in a box.)
Quantum Mechanics has (roughly) 5 Postulates. They cannot be stated briefly; when stated
clearly, they are rather long-winded. Compared to Classical Mechanics, quantum mechanics
is an unwieldy beast – scary and ugly at first sight, but very, very powerful.
As we go along, I will write the Postulates as clearly as I can, so that you know what is
assumed and what is derived. Writing them all down now will do little good, since we
haven't yet developed the necessary vocabulary. I will begin by writing partially correct, but
incomplete or inaccurate versions of each Postulate, just so we can get started. Later on,
when ready, we will write the rigorously accurate versions of the postulates.
Don't worry if these seem rather alien and unfamiliar at first - this is really the subject of the
entire course - we're just making our first pass, getting a kind of overview of where we're
heading. So let's start.
1/21/2009 © University of Colorado, Michael Dubson (mods by S. Pollock )
Ch. 1 notes, part1 5 of 38
The wave function Ψ(x) is not "the particle", or "the position of the particle", it is a
mathematical function which carries information about the particle. (Hang on!)
€
In this course, we will mostly be restricting ourselves to systems that contain a single particle
(like one €
electron). In such a case, the wavefunction can be written as a function of the
position coordinate of the particle, and the time: Ψ = Ψ( r ,t) .
Often, we will simplify our lives by considering the (rather artificial) case of a particle
restricted to motion in 1D, in which case we can write Ψ = Ψ(x,t) .
We may also consider a particular moment in time, and focus on just Ψ(x) .
€
In general, Ψ(x) is a complex function of x; it has a real and an imaginary parts.
So when graphed, it looks something like. €
€
€
Re[Ψ] Im[Ψ]
x x
There are many different ways to write the wavefunction describing a single simple
(spinless) particle in 1D at some time: , and others, to be explained later.
(Here x is position, and p is momentum).
If the particle has spin, then we have to include a spin coordinate m, in addition to the position coordinate in the
wavefunction . If the system has 2 particles, then the wavefunction is a function of two positions:
.
Postulate 2 has to do with operators and observables and the possible results of a
measurement. We will just skip that one for now!
Postulate 3 has to do with the results of a measurement of some property of the system and it
introduces indeterminacy in a fundamental way.
It provides the physical interpretation of the wavefunction.
This QM description is very, very different from the situation in classical mechanics. In
classical mechanics, the state of a one-particle system at any given instant of time is
determined by the position and the momentum (or velocity): r , p . So, a maximum of 6 real
numbers completely describes the state of a classical single-particle system.
(Only 2 numbers, x and p, are needed in 1-D)
In contrast, in QM, you need a function . To specify a function, you need an infinite
€
number of numbers. (And it's a complex function, so you need 2 × ∞ numbers!)
In classical mechanics, the particle always has a precise, definite position, whether or not you
bother to measure its position.
In quantum mechanics, the particle does not have a definite position, until you measure it.
In quantum mechanics, we are not allowed to ask questions like "What is the particle doing?"
or "Where is the particle?" Instead, we can only ask about the possible results of
measurements: "If I make a measurement, what is the probability that I will get such-and-
such a result?"
QM is all about measurement, which is the only way we ever truly know anything about the
physical universe.
The act of measuring the position changes the wavefunction according to postulate 4:
If you make a measurement of position, and find the value xo, then immediately after the
measurement is made, the wavefunction will be sharply peaked about that value, like so:
x xo x
(The
graph on the right should have a much taller peak because the area under the curve is the
same as before the measurement. The wavefunction should remain normalized. )
Postulate 1 states that the wavefunction is continuous. By this we mean that Ψ(x,t) is
continuous in space. It is not necessary continuous in time. The wavefunction can change
discontinuously in time as a result of a measurement.
For now, "measurement" is any kind of interaction between the microscopic system observed
and some macroscopic (many-atom) system, such as a screen, which provides information
about the observed property.
Postulate 5, the last one, describes how the wavefunction evolves in time, in the absence of
any measurements.
One's first reaction to Postulate 5 is "Where did that come from?" How on earth did
Schrödinger think to write that down? We will try to make this equation plausible (coming
soon!) and show the reasoning that lead Schrödinger to this Nobel-prize-winning formula.
But, remember, it's a Postulate, so it cannot be derived. We believe it is true because it leads
to predictions that are experimentally verified.
We label the possible results of the measurements with an index i. For instance, for heights
of adults, we might have x1=25 cm, x2 = 26 cm, etc (no adult is shorter than 25 cm). The list
{x1 , x2, ... xi,... } is the called the spectrum of possible measurement results. Notice that xi is
not the result of the ith trial (the common notation in statistics books). Rather, xi is the ith
possible result of a measurement in the list of all possible results.
N = total # of measurements.
ni = # times that the result xi was found among the N measurements.
Note that where the sum is over the spectrum of possible results, not over the N
different trials.
In the limit of large N (which we will almost always assume), then the probability of a
particular result xi is = (fraction of the trials that resulted in xi).
Again, this is called the expectation value of x (even though you might e.g. NEVER find any
€ particular individual whose height is the average or "expected" height!)
The brackets means "average over many trials". We would call this the "expectation value of x2".
A measure of the expected spread in measurements of x is the standard deviation σ, defined
as "the rms average of the deviation from the mean".
"rms" = root-mean-square = take the square, average that, then square-root that.
Let us disassemble and reassemble: The deviation from the mean of any particular result x is
. The deviation from the mean is just as likely to be positive as negative, so if
we average the deviation from the mean, we get zero: .
To get the average size of , we will square it first, before taking the average, and then
later, square-root it:
_________
Now we make the transition from thinking about discrete values of x (say x = 1, 2, 3, ...) to a
continuous distribution (e.g. x any real number). We define a probability density ρ(x):
∑P i =1 → ∫ ρ(x)dx = 1
i -∞
∞
x = ∑ x i Pi → x = ∫ x ⋅ ρ(x)dx
i -∞
Please look again at these equations, (on left and right): think about how they "match up" and
mean basically the same thing! (We'll use both sides, throughout this course.)
€ From Postulate 3, we make the identification and we have
So in QM, the expectation value of the position (x) of a particle (with given wave function
Ψ) is given by this (simple) formula for <x>. It's the "average of position measurements" if
you had a bunch of identically prepared systems with the same wave function Ψ.
More generally , for any function f = f(x), we have .
Griffiths gives an example (1.1) of a continuous probability distribution. Let's redo that
example, just slightly modified, to help make sense of it. (Take a look at it first, though)
We can define the probability distribution in space, ρ(x), and the probability distribution in
time, τ(t):
ρ(x)dx = Prob (frame chosen at random is the one at x → x+dx)
τ(t)dt = Prob (frame chosen at random is the one at t → t+dt)
t
dt dt
(To be precise, I should really be writing Δx instead of dx, and Δt instead of dt. In the end, I'll
take the limit Δt →0)
Notice all the dt's are the same size, but the dx's start out short and get longer and longer.
If you pick a particular t and dt (i.e. some particular frame) then corresponding to that (t,dt) is
a particular (x,dx).
The probability that that particular frame will be picked is what it is (all frames are equally
likely, after all):
Prob(t → t+dt) = Prob(x t→ x+dx)
which means
τ (t)dt = ρ (x)dx
_____
By the way, you might be uncomfortable treating dx/dt as though it was just a fraction Δx/Δt.
But, we often "pull apart" dx/dt and writ things like
dx
= f (x) ⇒ dx = f(x)dt , or
dt
dt 1
=
dx (dx /dt)
dx Δx
€ This makes sense if you remember that = lim
dt Δt →0 Δt
To physicists, dx/dt really is a tiny Δx (dx) divided by a tiny Δt (dt).
€
Wave functions Ψ are in general complex functions. So it's worth a quick review of complex
numbers, since we'll be dealing with this all term.
1
i = −1 , i⋅ i = -1 ⇒ i = -1/i ⇒ = -i
i
Any complex number z can always be written in either
Cartesian form: z = x+iy or Polar form: z = Ae iθ
€
€ Squaring complex numbers does NOT always yield a real result, and in general is quite
different than multiplying by the complex conjugate. i.e. the square of a complex number is
€ €
DIFFERENT from the square of the amplitude of that number.
e z1 +z2 = e z1 ⋅ e z2
(where z1, z2 are any 2 complex numbers)
i(α + β )
This means in particular that e = e iα ⋅ e iβ
€ (which in turn can be used to derive various trig identities, like e.g. that
cos(a+b) = cos(a)cos(b)-sin(a)sin(b): just look at the real part of the equation)
€
Also, if z1 = A1e iθ 1 , z2 = A2e iθ 2 then it is very quick and easy to find the product:
€ Any complex number z, written as a complicated expression, no matter how messy, can be
turned into its complex conjugate z* by replacing every i with -i, so e.g.
(5 + 6i)(−7i) (5 − 6i)(+7i)
z= ⇒ z* =
2i + 3e −iθ −2i + 3e iθ
__________________________
€
QM is all about solving a wave equation, for ψ(x,t). But before learning that, let's quickly
review classical waves. (If you've never learned about waves in an earlier physics class - take
a little extra time to be sure you understand the basic ideas here!)
Such a wave is sometimes called a “traveling wave packet”, since it’s localized at any
moment in time, and travels to the right at steady speed.
x
Example 2: A sinusoidal wave y = f(x) = Asin(2π )
λ
(This one is probably very familiar, but still think about it carefully. )
Soon we will discover, for some waves, another kind of velocity, the group velocity. Never mind for
now!)
I said that f(x-vt) represents a traveling wave – it should be reasonable from the above
pictures and discussion, but let’s see a formal proof – (next page)
________________________________________________________
Claim: y(x,t) = f(x ± vt) represents a rigidly shaped ("dispersionless") traveling wave.
The upper "+" sign gives you a LEFT-moving wave.
The - sign is what we've been talking about above, a RIGHT-moving wave.
Proof of Claim:
Consider such a traveling wave, moving to the right, and then think of a new, moving
coordinate system (x',y'), moving along with the wave at the wave's speed v.
Now, in the moving (x',y') frame, the moving wave is stationary, right? (Because we're
moving right along with it.) It's very simple in that frame:
In classical mechanics, many physical systems exhibit simple harmonic motion, which is
what you get when the displacement of a point object obeys the equation F = -kx, or
d 2 x(t)
2
= −ω 2 x(t) . (Hopefully that looks pretty familiar!)
dt
If you have a bunch of coupled oscillators (like a rope, or water, or even in free space with
oscillating electric fields), you frequently get a related equation for the displacement of the
medium, y(x,t), which is called the wave equation.
€
∂2y 1 ∂2y
In just one spatial dimension (think of a string), that equation is 2 = 2 2 .
∂x v ∂t
(If you’re curious, go back to your mechanics notes, it’s likely you spent a lot of time
deriving and discussing it!)
Theorem: Any (1D) traveling wave of the form y(x,t) =€f(x ± vt) is a solution of the wave
equation above.
Proof: We are assuming y(x,t) = f(φ), where φ=φ(x,t) = x-vt, and we're going to show
(no matter what function, f(φ), you pick!) that this y(x,t) satisfies the wave equation.
∂y df ∂φ df ∂φ
= = . This is just the chain rule, (and I used the fact that = 1.)
∂x dφ ∂x dφ ∂x
(Please make sense of where I write partials, and where I write full derivatives)
Now do this again:
∂ 2 y ∂ df d df ∂φ d 2 f ∂φ
€ 2
= = = 2 (1) (Once again using € = 1)
∂x ∂x dφ dφ dφ ∂x dφ ∂x
Similarly, we can take time derivatives, again using the chain rule:
∂y df ∂φ df ∂φ
€ = = −v (here I used the fact that = −v€, you see why that is?)
∂t dφ ∂t dφ ∂t
And again, repeat the time derivative once more:
∂ 2 y ∂ df d df ∂φ d df 2
2 d f
= −v = −v = −v (−v) = +v (2)
€ ∂ t 2 ∂ t dφ dφ dφ ∂t € dφ dφ dφ 2
d2 f
Using (1) and (2) to express two different ways gives what we want:
dφ 2
€ d2 f 1 ∂2y ∂2y
= =
dφ 2 v 2 ∂t 2 ∂x 2
€
That last equality is the wave equation, so we're done.
€ Again: ANY 1-D traveling wave of the form f(x ± vt) solves the wave equation, and the wave
equation is just a very basic equation satisfied by MANY simple, linear systems built up out
of coupled oscillators (which means, much of the physical world!)
d2E 1 ∂2E
Example 1: Maxwell's equations in vacuum give = (where "E" can be E_y or
dx 2 v 2 ∂t 2
E_z here) and v = 1/ ε0µ0 = c, the speed of light, 3E8 m/s.
So we’re saying that EM waves do NOT have to be "sinusoidal waves": they can be pulses,
or basically any functional shape you like€- but they will all travel with the same constant
speed c, and they will not disperse (or change shape) in vacuum.
€
Example 2:
∂2y 1 ∂2y
A wave on a 1-D string will satisfy = , where y represents the displacement of the
∂x 2 v 2 ∂t 2
Tension
rope (and x is the position along the rope), and the speed is given by v = .
(mass/length)
So here again, on such a string,
€ wave pulses of any shape will propagate without dispersion
(the shape stays the same), and the speed is determined NOT by the pulse, but by the
properties of the medium (the rope - it's tension and mass density)
________________________________________________€
Superposition Principle: If y1(x,t) and y2(x,t) are both separately solutions of the wave
equation, then the function y1+y2 is also a valid solution.
This follows from the fact that the wave equation is a LINEAR differential equation. (Look
back at the wave equation, write it separately for y1 and y2, and simply add)
€ Reminder: Functions are things which take numbers in, and give out numbers, like f(x) = y.
Here x is the "input number", and y is the "output number". That's what functions ARE.
€
Now we have something new (which will reappear many times this term), we have an
operator, which takes a function in and gives a function out.
Lˆ[y(x)] = g(x) (Here y(x) was the input function, the operator operates on this function,
and gives back a different function out, g(x).
€
1/21/2009 © University of Colorado, Michael Dubson (mods by S. Pollock )
Ch. 1 notes, part1 20 of 38
This leads to constructive and destructive interference, one of the defining characteristic
properties of waves. Following, you will find some notes from a freshman course reviewing
interference, in case you’ve forgotten. (For instance, how is that you get an interference
pattern from two slits?) The math and physics here will apply directly in quantum
mechanics, because particles also exhibit wavelike behaviour.
In general, wave-like effects with light are difficult to detect because of the small wavelength
of visible light (400 nm (violet) → 700 nm (red)) The problem is even tougher with
particles, it requires very special circumstances to demonstrate the wave nature of matter.
So, in many situations, light behaves like a ray, exhibiting no obvious wave-like behavior.
Newton (late 1600's) did not believe that light was a wave since he always observed ray-
behavior. Wave-like behavior was not clearly observed until around 1800, by Young.
Wave-like behavior of particles was not clearly observed until around 1925, in Davisson and
Germer's experiment (using a crystal of nickel as a "grating")
+ =
+ =
add subtract
λ
c
wall with infinitesimal hole
To see interference of light waves,
you need a monochromatic (single λ) light source, which is coherent (nice, clean plane
wave). This is not easy to make. Most light sources are incoherent (jumble of waves with
monochromatic
plane wave
c d
λ 2 slits
screen
How do we explain this? Consider the 2 slits as 2 coherent point sources of monochromatic
light. Two sources are coherent is they have the same wavelength λ (and therefore the same
frequency f ) and they emit peaks and troughs in sync, in phase.
Each slit (source) emits light in all forward directions, but let us consider only the parts of the
waves heading toward a particular point on the screen.
If the screen is far away (L >> d), then the rays from the two slits to the same point on the
screen are nearly parallel, both heading in the same direction, at the same angle θ.
θ
θ to screen
d
d
d sinθ
d sinθ θ
The ray from the lower slit has to travel further by an extra distance (d sinθ) to reach the
screen. This extra distance is called the path difference. When the path difference (p.d.) is
one full wavelength, or 2 full wavelengths, or an integer number of wavelengths, then the
waves will arrive in phase at the screen. There will be constructive interference and a bright
spot on the screen.
p.d. = d sin θ = mλ , m=0,1,2,…. (constructive interference)
But if the path difference is ½ wavelengths or 3/2 wavelengths, etc, then there will be
destructive interference at the screen and the screen will be dark there.
€
p.d. = d sin θ = (m + 1/2)λ , m=0,1,2,…. (destructive interference)
Notice that the formula p.d. = d sinθ is NOT a definition of path difference. It is a formula
for path difference in a specific situation, namely when the screen is "at infinity".
€
The definition of path diff. is: p.d. = (distance to one source) – (distance to other source)
A plot of brightness (intensity) vs. angle position on the screen:
I (intensity)
m=2 m=1 m=0 m=1 m=2
sin θ ≅ θ
−λ / d 0 λ/d 2λ / d
Young's experiment was the first real proof that light is a wave. If you believe that light is a
ray, there is no way to explain the destructive interference seen on the screen. In the ray-
view, when you hit a screen with two rays, the brightness of the 2 rays always adds and you
see a bright spot there. It is impossible to explain destructive interference of two light
sources, unless you admit that light is a wave. The same thing happened in the 1920's with
Davisson and Germer's experiments with electron diffraction, here with the even more
bizarre conclusion that forces you to admit that electrons are "waves" too!
I (intensity)
sin θ ≅ θ
−λ / D 0 λ/D 2λ / D
2λ / D
The angular width of the "central maximum" is θ = 2λ /D .
Notice that in the limit, D → λ (slit width becomes as small as the wavelength of light), the
central max becomes so broad, that we get spherical wave behavior.
€
λ λ
D>λ D≈λ
Diffraction Grating
A diffraction grating is an array of many narrow slits with a uniform inter-slit spacing d.
A grating with "500 lines per cm" has a slit separation of
of the same d, but the peaks are much sharper and much brighter.
€
As N (the number of slits) increases, the width of each peak decreases (and gets brighter).
Why?
2 slits: I (intensity)
d
sin θ ≅ θ
−λ / d 0 λ/d 2λ / d 3λ / d
......
d sin θ
−λ / d 0 λ/d 2λ / d 3λ / d
With just 2 slits, when we are near the maximum at the angle θ = λ / d, then the waves from
the two slits are nearly in phase and we have nearly complete constructive interference and
nearly maximum brightness. But with N-slits, when we are near the angle θ = λ / d, any two
adjacent slits are nearly in phase, but the next slit over is a little more out of phase, and the
next one over is even more out of phase. With many slits, if you are just a bit off the special
angle for maximum brightness, the phase differences among the slits quickly add up and
gives destructive interference. Another nice feature of the grating is that, with many slits for
the light to get through, the pattern on the screen is brighter than in the double-slit case
"Modern" physics means physics discovered after 1900; i.e. twentieth-century physics.
1900: Max Planck (German) tried to explain blackbody ("BB") radiation (that's radiation
from warm objects, like glowing coals) using Maxwell's equations and statistical mechanics
and found that he could not. He could only reproduce the experimentally-known BB
spectrum by assuming that the energy in an electromagnetic wave of frequency f is quantized
according to
, where n = 1, 2, 3, … and h = Planck's constant = 6.6×10−34 (SI units)
Planck regarding this as a math trick; he was baffled by its physical significance.
1905: Albert Einstein, motivated in part by Planck's work, invents the concept of a photon
(though the name "photon" came later!) to explain the photoelectric effect.
A photon is a quantum (packet) of electromagnetic radiation, with energy
.
h
Note the definition of ≡ =1.05x10-34 J s = 6.6x10-16 eV s (Called "hbar")
2π
It is important to remember that the Bohr model is simple, useful, and wrong. For instance, it
predicts that the ground state of the H-atom has angular momentum , when in fact, the
ground state of the H atom (s-state) has L = 0. The Bohr model is a semi-classical model,
meaning it combines aspects of classical and quantum mechanics. Semi-classical models are
frequently used by physicists because they are heuristically useful (easy to understand and
often give correct results). But they must always be used with extreme care, because the
microscopic world is really purely quantum. We insert classical mechanics into the
microscopic world not because it is correct, but because it is convenient.
1922: Louis de Broglie (French) proposes wave-particle duality. Theory and experiment
indicate that waves sometimes act like particles (photons). Perhaps, argues de Broglie,
particles can sometimes act like waves. For photons: .
According to Special Relativity (and Maxwell's equations) light of energy E carries
momentum . Hence, .
De Broglie argues that the same equations apply to particles and introduces the idea of matter
waves.
de Broglie relations:
Soon, there was indisputable experimental verification of the photon concept and n=5
of the de Broglie relations. In 1923, the American Arthur Holly Compton
observes the Compton Effect, the change in wavelength of gamma-rays upon collision with
electrons. This effect can only be explained by assuming that gamma-rays are photons with
energy and momentum .
Then, in 1927, Americans Davisson and Germer diffract a beam of electrons from a nickel
crystal, experimentally verifying that for electrons.
Late in 1925, Erwin Schrödinger, then Professor of Physics at Zurich University, gives a
colloquium describing de Broglie's matter wave theory. In the audience is physicist Peter
Debye, who called this theory "childish" because "to deal properly with waves, one has to
have a wave equation". Over Christmas break, Schrödinger begins developing his equation
for matter waves.
1927: Erwin Schrödinger (Austrian) constructs a wave equation for de Broglie's matter
waves. He assumes that a free particle (potential energy = V = 0) is some kind of wave
described by
Schrödinger searches for a wave equation that will reproduce this energy relation. He notes
that
and
so ...
This looks promising. To describe a particle with both KE and potential energy V = V(x),
Schrödinger added in the term V⋅Ψ , producing finally
The Schrödinger Equation is really an energy equation in disguise. When you look at the
S.E., you should try to see E = KE + PE. For a particle with frequency f (energy E = h f )
and wavelength λ (momentum p = h/λ ) in a potential V, this equation appears to correctly
The physicist Paul Dirac famously asserted that the Schrödinger Equation accounts for
"much of physics and all of chemistry". It is probably the most important equation of the 20th
Century. Its effect on technological progress has been much, much greater than the more
famous equation E = mc2.
Schrödinger was quite puzzled by the nature of the wave function. What is the physical
meaning of Ψ(x,t)? He wanted to think of it as some kind of physical matter wave, like an
electromagnetic wave E(x,t). But this interpretation could not explain a host of experimental
results, such as that fact that a particle with a large extended wave function is always found at
one small spot when a position measurement is made.
It was German theorist Max Born, who late in 1927 proposed that the wave function is a kind
of information wave. It provides information about the probability of the results of
measurement, but does not provide any physical picture of "what is really going on." Bohr,
Heisenberg, and others argued that questions like "what is really going on" are meaningless.
Humans live at the macroscopic level, excellently described by classical mechanics, and our
brains evolved to correctly describe macroscopic (classical) phenomena. When we ask "what
is going on", we are really asking for an explanation in terms that our brains can process,
namely, a classical explanation. The microscopic world is fundamentally different from the
classical world of large objects that we inhabit, and our brains' internal models simply don't
apply at the level of atoms. There may be no hope of understanding "what is really going on"
in atoms because our brains are not built for that job. All we can know are the results of
measurements made with macroscopic instruments.
This view, that the wave function provides probabilistic information, but not a physical
picture of reality, is part of the "Copenhagen interpretation" of Quantum Mechanics (so-
called because it was largely developed at Bohr's research institute in Copenhagen.) Einstein,
de Broglie, Schrödinger himself, and others were very dissatisfied with this view, and they
never accepted the Copenhagen interpretation.
• Ernest Rutherford received the 1908 Nobel prize in chemistry for experimental
investigations of radioactive decay, but never received the prize for discovery of the
nuclear atom.
• Albert Einstein never received the prize for either Special or General Relativity.
•
You can always look up constants (and you should locate your favorite collection!) but
having a few common constants at your fingertips is really pretty handy.
-34
Still, (e.g.) =1.05x10 J s is a number which I often forget (it's so small!)
The combination of this constant times the speed of light, c = 2000 eV Angstroms
(roughly) is somehow a simpler number for me to remember. And this combination
occurs often!
€
• An Angstrom is 10-10 m, or 0.1 nm, a very convenient distance in atoms, since that
happens to be the typical size of atomic orbits. €
• An eV is not a standard SI energy unit, but this one I always seem to remember from
E&M:
1eV = 1.6x10-19 J is a typical atomic energy scale. It's the energy gained when an charge
"e" (like a proton, or electron) moves through a potential difference of 1 Volt)
You use eV a lot in practical atomic problems.
Remember from E&M that electrical potential energy of two charges a distance r apart is
q1q2
,
4 πε0 r
e2
which tells us (using q=e) that has units of energy*distance, the exact SAME units
4 πε0
€ as c (!)
Sure enough, the following combination of constants (which is thus UNITLESS, a pure
e2
number) appears over€and over in physical problems: = 1/137.
€ 4 πε0c
This is a highly complex (and ill-defined!) problem, but we can make a simple, crude
estimate by making some simplifying assumptions.
1) Assume light from a bulb is visible, in the middle of the spectrum (yellow).
You would then have to go look up the wavelength of visible light, it's about 6000
Angstroms.
2) Assume that all the energy is going into visible light (ok assumption for a fluorescent
bulb).
You would then have to go look up the power of a bulb, though you can probably guess
that's about 100W for a bright one.
Each individual photon has energy E=hν, and so the total power (100 J/s) must be
producing a total of 100 J/s / (hν Joules/photon) That's it, that's how many photons/sec
should come out.
Now, ν=c/λ (which is handy, because λ=6000 Angstroms was what I looked up)
And 1eV = 1.6x10-19 J converts us to a more "photonic" energy scale
So the answer is
100 J/s / (1. 6x10-19 J/eV) * λ / hc, and then multiplying top and bottom by 2 π, I get
(200π / 1. 6x10-19) * (λ / c ), and now I put in λ=6000 Angstroms, and c = 2000 eV
Angstroms, to get about 1022 photons each second.
No wonder we never notice or care that they're discrete, that's a huge number!
€ €
Slightly less silly Example: What energy should electrons have if you want to do an
experiment to demonstrate their "wave nature"?
I am thinking that "showing wave nature" means generating some sort of interference,
and for THAT, you need to have a wavelength comparable to a "slit size".
The smallest slit I can imagine is ONE atom wide (imagine bouncing electrons off a
crystal, they can bounce off of one atom, or its neighbor, and those two "sources" will
interfere).
Thus, without doing any hairball calculations, I would roughly want
λ(electron) = atomic size = 1 Angstrom. (or so)
DeBroglie then tells me I want the electron to have momentum p = h / λ .
€
1/21/2009 © University of Colorado, Michael Dubson (mods by S. Pollock )
Ch. 1 notes, part1 38 of 38
And the energy associated with that momentum will be the good old freshman kinetic
h2
energy formula, E = p2/2m. So E = . That's it, I could plug in numbers!
2mλ2
But I can also multiply top and bottom by c^2, and divide top and bottom by (2 pi)^2, to
get
4 π 2 (c) 2
E= (this is the€SAME formula but now has combinations of constants I
2mc 2 λ2
remember!)
This I can do in my head. π^2 is about 10, so I have (without a calculator!)
E = 4*π^2 *(2000 eV Angstrom)^2 / 2 (511 keV)(1 Angstrom)^2
€ That’s about 40*4*10^6 eV^2 / (1000 * 1000 eV) = 160 eV.
Anything LOWER than that will have a smaller momentum, or LONGER wavelength,
and will diffract even better. So that's an upper bound. You don't need a high voltage
supply for this!
(This is the basic physics of the Davisson and Germer experiment that first showed that
deBroglie was not off his rocker!)