P409M Miller 2008 All PDF
P409M Miller 2008 All PDF
(PART I)
1
Outline
2
1. Curvilinear coordinate systems
Any point in the space can be written in terms of three coordinates ___________
and three basis vectors
Sometimes the coordinates are written as _______ and the unit vectors as
(I will use both notations, but avoid i, j and k).
Any vector A in this space, can be written in terms of these unit vectors as,
I will sometimes assume the Einstein Summation convention and omit the
summation sign Σ when I have two repeated indices (unless otherwise specified),
e.g.
3
These unit vectors are orthonormal (orthogonal and of unit length) if
Kronecker delta
where ǫijk is the Levi Civita symbol, which is +1 for an even permutation of 1, 2
and 3 and -1 for an odd permutation, and zero if any of the indices are the same,
e.g. 4
For example, the z-component is:
From this, and the definitions above, follow the gradient of a scalar field φ(r),
5
Exercise: Using the Einstein summation convention, prove
6
1.2 Curvilinear coordinates in physics problems
Very often, physical systems are described by classical (or quantum) field theories. We
have a field, e.g. , which obeys a differential equation over all space, e.g. the
Schrödinger Equation
The boundary condition is often very much simpler in one particular choice of
coordinates, making the solution simpler in these coordinates.
General
Solution scenario
coordinate independent
system
Boundary Specific
Conditions Solution
scenario
dependent
8
Example: Spherical Polar Coordinates
e.g. Consider a hollow sphere or radius R kept at a constant temperature T. The
temperature inside the sphere obeys Laplace’s equation,
However, the boundary condition is much more easily expressed in spherical polar
coordinates:
9
Example: Cylindrical Coordinates
e.g. Consider a wire with a uniform charge density ρ0. The electric potential generated
by the charge distribution is given by Poisson’s equation,
which is coordinate system independent (and becomes Laplace’s equation away from
the charge).
However, the charge density is much more easily expressed in cylindrical polar
coordinates (with the wire running along the z-axis):
10
1.3 Unit vectors and scale factors
Key Point: Write down the definition of basis vectors and scale factors for
general curvilinear coordinates
This may seem obvious, but the analogue is not true in general curvilinear
coordinates.
11
e.g. Imagine changing θ by and amount dθ in cylindrical coordinates:
for dθ small.
How far we move for a particular coordinate shift is known as a scale factor.
The direction we move for a particular coordinate shift is given by the basis
vector.
12
Imagine a general coordinate system described by qi (i=1,2,3) and related to Cartesians
via
basis vector
scale factor
The basis vector is the unit vector in the direction of , i.e. (no sum)
13
As before, the basis vectors are orthogonal if
Let’s call dsi the distance moved by changing the coordinate qi by an amount dqi
(no sum)
For orthogonal coordinates, we can use Pythogoras’ theorem again to write the total
displacement from altering all three coordinates:
So,
14
More generally, one may define the metric of the space gij according to
metric
(no sum)
So, in our case we have
ds
ds 15
Key Point: Derive the scale factors and unit vectors for various
coordinate systems
(no sum)
So,
These are already unit vectors, so are the basis vectors and the scale factors are,
( but since er changes with θ and φ, it is more convenient to use ex, ey and ez.)
So,
17
Normalize these vectors to find the basis vectors and scale factors:
basis vectors:
So, write ds =dx+dy+dz and plug in dx, dy and dz (above). The cross terms
vanish and we are left with
The lack of cross-terms tells us the coordinates are orthogonal, and the coefficients of the
diagonal terms are the square of the scale factors.
Exercise: Find the scale factors, unit vectors, volume element and infinitesimal
line element-squared for cylindrical coordinates.
19
1.3 The Gradient
Key Point: Derive an expression for the gradient in general orthogonal
curvilinear co-ordinate systems
The gradient in Cartesian coordinates was:
i.e. with respect to the distance moved NOT how much the parameter changes
For a curvilinear coordinates qi, recall the distance moved for a change dqi was
dsi = hq dqi, so the component of the gradient in the direction of eq is
i i
(no sum)
20
1.5 The Divergence, Curl and Laplacian Operators
Key Point: Derive expressions for div, grad, curl and the Laplacian in
general orthogonal curvilinear co-ordinate systems
Divergence
We may now use the form of the gradient to derive the divergence ∇ · A
21
Our unit vectors form a right-handed set, so
22
these have no divergence so can be pulled in front of the ∇
23
Curl
Curl works in much the same way. We may use the result we obtained earlier:
Remember that
so I will pick up contributions for from the terms linked by the arrows.
24
Similarly for the other terms.
25
Laplacian
The laplacian is ∇·∇ = ∇2, so we can fairly easily insert our expression for the
gradient into the expression we derived for the divergence:
but
So,
26
Key Point: Apply these expressions to various coordinate systems,
including spherical polar coordinates and cylindrical coordinates.
27
Exercise: Find the gradient, divergence, curl and Laplacian for cylindrical coordinates.
28
2. Partial Differential Equations in cylindrical and spherical polar
coordinates
We have already seen that physical systems are often described by scalar and vector
fields in space and time, that obey particular differential equations involving the
operators of the previous section.
Laplace’s Equation:
where E is the electric field, ρ(r) is the charge density (at position r) and ǫ0 is the
permittivity of free space.
30
Diffusion Equation
Used for heat flow, diffusion of materials, etc., when there are no sources.
For systems which have reached equilibrium (i.e. don’t change with time) the time
derivative term vanishes and we have Laplace’s equation again.
The rate at which heat transfers from one volume to another depends on the
temperature gradient, the area in contact and the heat conductivity of the material
31
The rate at which heat transfers from one volume to another is proportional to the
temperature gradient, the area in contact and the thermal conductivity of the material. For
a boundary of area A,
dσ
dA
thermal vector with magnitude equal
conductivity to infinitesimal area dA and
normal to the boundary
Gauss’ theorem relates an integral over the boundary to an integral over the volume:
Equating with the expression for Q (and assuming ρ and cp are constant too)
velocity of propagation
This is a very important equation since it governs the motions of waves, e.g. vibrating
strings, vibrations in solids, sound waves, water waves, electromagnetic waves.
33
permittivity of
Example: Maxwell’s equations (again!) in vacuum are free space
permeability of
free space
Taking the curl of the first one and inserting the second
But 0
Measuring ǫ0 and µ0 tells us that electric fields are waves travelling with speed
34
Helmholtz Equation and time-independent diffusion equation
Both of these equations are special cases of the wave equation and time-dependent
diffusion equation, when the dependence on time has been factored out.
35
Schrödinger Equation
Erwin Schrödinger,
1887-1961
The time-independent Schrödinger Equation:
total energy
kinetic energy 36
potential energy
2.2 The method of separation of variables
These equations have multiple dimensions in them. One useful method of solving them is
to separate the differential equation into separate differential equations for each
dimension.
There might not be a solution of this form, and often there will be solutions not of this
form, so this is not necessarily going to work, but there is no harm in trying!
no t dependence no r dependence
37
So (if our original assumption about was true) both sides must just be equal to a
constant
an arbitrary constant
(no t or r dependence)
the time-independent
diffusion equation
We can now solve each of these separately and cobble them together at the end.
38
Key Point: Apply the method of separation of variables to Laplace’s
Equation in spherical and cylindrical coordinates
39
(Same as on
previous slide)
no dependence no r or θ dependence
So, just like for the time separation, each side must be constant, say m2.
and
40
Finally, we have a set of three equations, one for each coordinate:
This one is a little bit too hard for now. The solution is an Associated Legendre Polynomial.
We will study these functions in detail later (section 3). For now, we will just write the general
solution as
where A is a constant.
The l and m here are labels corresponding to the differential equation. We saw earlier
that m must be an integer. The associated Legendre function similarly insist that lx≥ 0,
and |m| ≤ l.
43
The general solution to is therefore
Alm, Blm, Clm and Dlm are constants, which are determined by the boundary conditions
of the physical problem.
44
Key Point: Apply boundary conditions to general solutions to provide
solutions to physical problems.
A physical example
There are no sources, other than the sphere, so the electrostatic potential obeys
Laplace’s equation away from the sphere. We can solve Laplace’s equation to provide a
general solution.
These are most easily expressed in spherical polar coordinates, so that is the
coordinate system we should use.
We apply the boundary conditions to the general solution to obtain a specific solution.
45
The general solution to is, from before:
So,
46
The first few Legendre polynomials (associated Legendre polynomials with m = 0) are
Exercise: Show that the above functions with x = cosθ are solutions of
But
so, since the other Legendre polynomials are higher powers of x, l = 1, and the
electrostatic potential is
47
More formally,
We can then use the orthogonality relation for Legendre polynomials (which I will
prove later in the course):
So and for
48
Outside the sphere
we need \vi to drop to zero infinitely far from the sphere, since
this is what we would have if we removed the sphere.
The same arguments for the angular dependence hold as for inside, so
50
Separation of variables in cylindrical coordinates
You may write your solution in terms of Bessel functions Jn(kr), which are the
solutions of the differential equation
(k and n constants)
Although J-n(kr) is clearly also a solution to the radial equation (since the equation
only involves n2) it is not an independent solution for n an integer, since
51
2.3 Spherical harmonics and the Schrödinger Equation.
Key Point: Define spherical harmonics and understand their role in the
solution of Schrödinger’s equation.
We can separate this into equations for each of the coordinates as we did before, but
this time, let’s just separate into a radial and angular part.
52
We can’t solve the radial equation without knowing V(r), but we can solve the angular
part (since we have explicitly required the potential to have no angular dependence).
Angular:
The solutions to this angular part are the spherical harmonics (as we saw earlier):
53
The constant N is fixed by requiring an orthogonality condition:
measure
54
Hopefully, you know by now that the important operators in quantum mechanical
discussions of angular momentum are and . What are these in spherical polars?
and
55
Let’s apply these operators to the spherical harmonics:
and
and z-component .
56
3. Legendre Polynomials and Bessel Functions
57
Equating powers of t:
You can have fun working out the next few (they get harder) but it is rather tedious…
Note that these are not the associated Legendre polynomials we saw in the spherical
harmonics. They are only ‘ordinary’ Legendre polynomials, that is associated Legendre
polynomials with m=0, i.e.
58
Notice that the value at is rather simple:
But,
so,
Also,
Equating powers of t:
59
Key Point: Expand the Coulomb potential in Legendre polynomials
Consider a point charge q on the z-axis, a distance a from the origin. The electrostatic
potential at a point r will be:
This is the potential from an electric dipole and 2qa is known as the
electric dipole moment. 61
Any charge distribution will produce an electrostatic potential described by a power series
with terms like
.. dipole,
.. quadrupole,
.. octupole.
Sometimes these names are used to identify the individual terms, e.g. the term containing
Key Point: Derive recurrence relations from the Legendre polynomial generating
function.
but
This is the most efficient way to calculate the polynomials using a computer. Since we
know P0 and P1 we can calculate P2 and then P3 etc.
Exercise: Use this result to verify the first five Legendre polynomials given earlier.
64
We could have taken the derivative with respect to x instead:
but
Exercise: Show that the generating function satisfies the differential equation
Then we have
We can use the Legendre equation to show that the Legendre polynomials are orthogonal.
symmetric in
zero
67
For this tells us that , i.e. they are orthogonal over [-1,1].
using
(orthogonality for )
Equating powers of t:
68
Putting these together we have the orthogonally and normalization condition
The Legendre polynomials are also “complete” – we can write any (continuous) function
as a sum over Legendre polynomials. (This is just like a Fourier series being a sum over
sines and cosines.)
So,
Exercise: Write the Dirac delta function as a sum over Legendre Polynomials.
69
3.4 Series expansion and Rodrigues’ Formula
(To do this yourself, you will need to know things like ! (see the later discussion of the
Gamma function), and be able to rearrange the order of summation.)
But , so,
70
We can use this to derive Rodrigues’ formula by using
for
Exercise: Show that the above definition of are solutions of the associated
Legendre equation,
You may assume that are solutions of the “ordinary” Legendre equation.
72
3.6 Bessel Functions
(k and ν constants)
Friedrich Wilhelm Bessel
1784 - 1846
Just like the Legendre polynomials, they have a generating function, a differential
equation (above), recurrence relations, a series expansion and orthogonality relations.
We don’t have time to go into detail here, so I will just state them without proof.
73
Generating function:
Series expansion:
Bessel functions with negative (integer) index are related to those with positive integer
index by
74
Integral representation for :
The values of x for which vanishes are called the zeros (or roots) and are often
denoted , where m denotes which zero we are concerned with, i.e. .
We could, of course, use this to derive the individual polynomials, but this is very tedious.
It is better to derive recurrence relations.
76
Differentiate with respect to t:
Relabel:
77
Differentiate with respect to x:
Stick in g:
Relabel:
78
We can use these recurrence relations to derive the Hermite differential equation (much
easier than Legendre’s!).
Generating function:
80
4.2 Properties of Hermite polynomials
Exercise: Use this series to verify the first few Hermite polynomials.
81
Exercise:
82
Key Point: Write down the Hermite polynomial orthogonality condition.
zero symmetric in
83
if
We say that Hermite polynomials are orthogonal on the interval [-∞,∞] with a weighting
84
Exercise: For a continuous function, I can write . Show that
85
4.3 Hermite polynomials and the Quantum Harmonic Oscillator
Key Point: Solve the quantum harmonic oscillator in terms of Hermite polynomials.
Recall our earlier discussion of the time-independent Schrödinger equation. That was in
3-dimensions, but here I will simplify to one dimension again,
Notice that this looks awfully like the equation we just had on the previous slide:
Our reweighted Hermite polynomials are solutions of the Quantum Harmonic Oscillator!
86
Let’s write with so we get
Then
88
But why is the quantum harmonic oscillator quantized?
We have seen why , and how to move from one energy state to another
using ladder operators, but we still have no reason for why n must be an integer!
For integer n, this solution (or to be more precise, half of it) will truncate to give Hermite
polynomials.
For non-integer n, it does not truncate and one can show that the terms grow like .
These solutions do not satisfy the boundary condition as , so must be
discarded and the harmonic oscillator is quantized.
89
4.4 Laguerre polynomials and the hydrogen atom
Generating function:
Edmond Laguerre
1834-1886
Exercise: Starting from the generating function, prove the two recurrence relations
90
Following a similar method to that used for Legendre and Hermite polynomials, we can
show that the Laguerre polynomials are orthogonal over the interval [0,∞] with a
weighting , i.e.
(These results can be proven using similar methods to those used earlier for Legendre
and Hermite polynomials. If you are feeling assiduous feel free to do these as an
exercise.)
91
Associated Laguerre polynomials are obtained by differentiating “regular” Laguerre
polynomials (just as for Legendre).
92
Recall our investigation of the Schrödinger equation in spherical coordinates with V = V(r).
For the hydrogen atom (that is, with the wavefunction for an electron orbiting a
proton), the potential is the Coulomb potential,
93
To make the maths a wee bit cleaner, let’s make the following redefinitions:
, , , , with
Then
becomes
94
Exercise: Plug the above result into the radial equation to recover the associated
Laguerre equation for L(ρ).
Just as for the Hermite equation, solutions exist for non-integer λ-l-1 but these diverge as
r→∞ and must be discarded. The boundary conditions quantize the energy of the
Hydrogen atom.
Fixing λ to be an integer n,
Notice the 2n here. This is because we don’t quite have the orthogonality
condition for the associated Laguerre polynomials we had before - we have an
extra power of ρ. This result is most easily proven with a recurrence relation,
96
5. Gamma and Beta Functions, and Stirling’s approximation
Key Point: write down the three definitions of the gamma function and
understand how they are related.
The gamma function is often known as the “factorial function”, and is useful because:
So,
98
Now we can use and to derive other values for the
gamma function:
where n is an integer.
This is why the gamma function is often called the factorial function.
99
The second definition (known as Euler’s integral definition) is
So,
(see slide 81)
To show that this function is the same as that of the first definition, let’s define
writing t = nu
100
Since
then,
101
The third definition (due to Weierstrass)
Karl Weierstrass
is the Euler-Mascheroni constant 1815-1897
To prove this is the same function again, write the first definition as a product:
102
which is the Weierstrass definition.
Proof:
0.2
Now
2 3 4 5
-0.2
0.6
0.8
0.6
0.2
-4 -2 2 4
So,
We could have gone further with the expansion and found the next term:
105
This series converges very quickly. It is useful for calculating things like the entropies, which are
logarithms of large factorials.
n n! Error Error
1 1 0.92213 7.779% 0 -1 ∞
Exercise: Use Stirling’s approximation to calculate the number of ways in which you
can rearrange a deck of standard playing cards (i.e. 52!). Estimate your error by
considering the next term in the expansion. How does your calculation and error
correspond to the correct result?
106
5.3 The Beta Function
Key Point: Define the beta function and understand its relation to the gamma function
e.g. put
107
Also notice that it is symmetric. Defining (in the original definition)
where
The angular integral should also now be familiar, since it is a beta function:
109
5.3 The simple pendulum at large angles
In the past, you have always assumed that pendulums only have small angle oscillations.
But what is the period of oscillation of a pendulum for large angle swings?
110
I don’t really care about the tension, so the interesting equation is:
Use a trick:
So,
We can set the constant via a boundary condition: the pendulum is at rest at an angle
The period is the time taken to go from back to again, or 4 times the time
taken to go from to .
111
So the period is
This is an elliptic integral, and unfortunately we don’t have time to introduce its solutions,
the elliptic functions.
However, we can now do the special case of α = 90o. Then cosα = 0 and the period
becomes,
112