students_notes
students_notes
JOHANNESBURG
SCHOOL OF PHYSICS
PHYS1001/1006
LECTURE NOTES
SEMESTER 2
2018
Student Class
SECTION 3: INTRODUCTION TO WAVES
3.1 Wave Parameters
3.2 Types of Waves
3.3 Superposition of Waves
SECTION 5: OPTICS
5.1 Geometrical Optics (Plane Interfaces)
5.2 Geometrical Optics (Curved Interfaces)
5.3 Physical Optics
3. INTRODUCTION TO WAVES
Frequency f number of crests (or complete cycles) that pass a given point per unit time;
measured in s Hz (hertz)
Period T time for one complete cycle; this is the time for two successive corresponding
points on the wave (for example, two successive crests) to pass a fixed point.
Frequency and period are related:
1
f
T
If you measure the water level at a fixed time as a function of position rather than time you would get
a graph like that shown in the next diagram.
Wavelength distance between any two successive identical points on the wave
Wave velocity v velocity at which the wave crests appear to move
3-3
Transverse waves
Transverse waves are those in which the direction of vibration is at right angles to the direction of
propagation. Examples are:
(i) Electromagnetic waves, e.g. light, heat, radio waves. These have an electric component ( E )
and a magnetic component ( B ) which oscillate at right angles to each other and to the velocity
(ii) Water waves: the direction of propagation is horizontal, but the water molecules themselves
oscillate (approximately) vertically up and down.
(iii) Waves on a rope.
3-4
Longitudinal waves
Longitudinal waves are those in which the direction of vibration is parallel to the direction of propaga-
tion. Examples are:
(i) Sound waves: molecules of the medium through which the sound is travelling vibrate back and
forth parallel to the direction of propagation. Note that there is no bulk movement of the me-
dium.
(ii) Pressure waves in blood caused by the heart’s pumping action.
Earthquake waves have both transverse (S) and longitudinal (P) components. Liquids cannot transmit
transverse (shear) waves through their bulk, and the absence of the transverse component of earthquake
waves in regions of the earth’s surface indicates the presence of a liquid core in the earth. The diameter
of the liquid core can also be determined.
Complete destructive interference, as shown in the diagram, can occur only if the two amplitudes
are equal.
If the phase difference lies between 0 and 2 (i.e. between 0 and rad.), the resultant amplitude will
lie between two extremes, i.e. between A1 A2 and A1 A2 , see the middle diagrams.
3-5
2. Two waves of the same amplitude and nearly the same frequency travelling in the same
direction.
The two waves move regularly in and out of phase, interfering constructively at one instant and destruc-
tively a short while later. The resultant amplitude varies sinusoidally. This is the phenomenon of beats.
The beat frequency (the frequency of the amplitude) is equal to the difference between the frequencies
of the two waves, and the frequency of the resultant itself is equal to the mean of the two frequencies.
3-6
If the amplitudes of the two waves are not equal, the resultant amplitude can never be zero.
Most radio receivers make use this principle. The effect of beats is well known in music.
3. Two waves of the same frequency and amplitude travelling in opposite directions
These can be set up in a tube by allowing a sound wave to be reflected from one end of the tube (the
same is also true for a spring or string).
If the length of the tube or spring is an exact multiple of half wavelengths then standing waves are
formed – these are not travelling waves. At certain points 2 apart the resultant amplitude is zero;
these points are called nodes. Midway between the nodes are antinodes where the amplitude is a max-
imum.
4.1. ELECTROSTATICS
Coulomb’s law
In the 1780s Charles Coulomb (1736–1806) carried out a series of experiments to investigate the elec-
trostatic force between two static charged particles. He discovered that:
The two particles each experience a force; these forces have the same magnitude but opposite di-
rections (as required by Newton’s third law).
The forces act along the line joining the two charges.
The magnitude of the force between two static charges q1 and q2 is directly proportional to the product
of the magnitudes of the two charges q1 and q2 and inversely proportional to the square of the dis-
tance r between them:
q1q2 qq
F 2 or F k 1 2 2
r r
Note: in these and many subsequent expressions for the magnitude of a vector quantity, the magnitudes
of the charges should be used. The direction of the vector should be determined separately.
In SI units the electrostatic constant has magnitude k 8,99 × 10 N.m.C. It is convenient to introduce
another constant through k 1 4 0 where 0 is the permittivity of free space with value
0 8,85 10 N.m.C. Then Coulomb’s law becomes:
1 q1q2
F (Coulomb’s law)
4 0 r 2
where F is the force exerted on a small test charge q when placed at that point.
It follows from this definition that the direction of the electric field is that of the force exerted on
a positive charge placed in the field.
A field exists around a charge q even if no test charge is placed in the field to detect or measure it.
The magnitude of the field depends only on the charge distribution that creates it; it does not depend
on the charge q used to measure it (which will always cancel out in any expression for the field –
see the example below).
The SI unit of E is N.C (or more usually volts per metre, V.m, see later).
F 1 q q
4 0 r 2
Therefore, the magnitude of the electric field at a distance r from a point charge q is, from the defini-
tion E F :
q
1 q
E (Field due to a point charge q)
4 0 r 2
The field E is directed radially away from the charge if q is positive and towards it if q is negative.
The diagrams below illustrate the direction of the electric field created by a charge Q and resulting
electric force on a second charge q for four different cases; the field direction at the position of the
second charge is shown.
4-5
The Principle of Superposition states that the resultant field due to a number of point charges is the
vector sum of the fields due to each.
Field lines cannot cross one another since the field at a point can lie only in one direction.
Field lines must start on a positive charge and end on a negative charge; they can also either start
or end at infinity if there is an excess of one type of charge, as in three cases illustrated above.
The density of lines in any region indicates the strength of the field in that region. In the two cases
illustrated above on the left, the lines become further apart far from the charge, where the field is
weaker.
Electrostatic flux
This is a measure of the number of field lines passing through a surface situated in an electric field.
In fact, the number of lines passing through a surface perpendicular to the field lines is proportional
both to the field strength E at the surface and to the area A of the surface. That is, N EA: this product
is called the electric flux through the surface.
4-6
Note that this is not the most general definition of electrostatic flux – it is valid only if the field
has the same value everywhere on the surface. This condition will be satisfied in all the examples
considered in this course.
The flux is a maximum when the field is perpendicular to the surface and zero when the field and
surface are parallel.
Gauss’s law
This law, due to Karl Friedrich Gauss (1777–1855), enables the electric field created by a distribution
of charge to be calculated in a number of cases of sufficient symmetry.
To illustrate Gauss’s law consider a point charge q, for which it was previously shown that the magni-
q
tude of the field is E 1 2 .
4 0 r
4-7
The magnitude of the field is the same everywhere on the surface of the sphere, since r has the same
value everywhere on the surface.
The field is normal to the surface everywhere, since the field lines go out radially. Hence
Eperp = E.
Then the flux through the surface of the imaginary sphere is
q q
E Eperp A E A 1 2 4 r 2
4 0 r 0
Thus the flux through a spherical surface centred on a point charge q is E q 0 .
where is the flux through a closed surface and Q is the total charge enclosed by that surface.
The law holds for any distribution of charge, not just for a point charge.
The law is valid whatever the shape of the surface (the Gaussian surface) drawn around the charge
distribution. The surface must however be a closed surface.
The law is useful only if it is possible to draw a surface over which the flux can easily be calculated.
In this course we consider only cases where a surface can be drawn that is either perpendicular to
or parallel to the field.
The field E is normal to the surface everywhere and will by symmetry have the same magnitude
E everywhere on the surface, so the flux through the surface is E Eperp A E 4 r 2 .
The field is the same as if all of the charge were concentrated at the centre of the sphere (cf. gravitation:
the gravitational field of the Earth can be found by assuming all the Earth’s mass is concentrated at its
centre).
Consider a plane conducting surface with excess charge uniformly distributed over its surface. The
surface charge density (the charge per unit area of surface) is denoted σ.
The field outside the conductor is E . By symmetry, the field lines must be normal to the surface of the
conductor (except near the edges). In the diagram below the excess charge is assumed to be positive;
otherwise the direction of the field must be reversed.
The field inside the conductor is zero, so there is no flux through the bottom of the Gaussian surface.
The field is parallel to the curved side of the Gaussian surface and so the component of E normal
to the side is zero. Hence, there is no flux through the side.
The only contribution to the flux is through the top of the Gaussian surface and so E = EA, since
E is perpendicular to A.
The only charge enclosed by the Gaussian surface is that on the section of the plane conducting surface
inside the cylinder; this section has area A, so Q A.
Gauss’s law E Q 0 becomes E A A 0 , giving
The field is normal to the surface of the conductor and is independent of the distance from the surface
(this is strictly valid only if the distance from the surface is small compared with the distance from the
edge of the conducting surface).
Consider for the moment the situation where no other forces act on the test charge.
It follows that a charge will normally (if no other forces act on it) move towards a region of lower
electrical potential energy, just as a massive body placed in the earth’s gravitational field moves
towards a region of lower gravitational potential energy.
The change in electric potential energy that a charge in an electric field experiences is a result of
the work done on it by the electric force, just as the gravitational potential energy of an object in
a gravitational field is changed by the gravitational force.
Consider now the more general case, when other forces (such as gravity) may be acting on the charge
in addition to the electric force.
If a charge q moves from any point A to another point B in an electric field, the change in its electric
potential energy U is defined by
U U B U A WA B (electric potential energy defined)
where WA B is the work done by the electric field on the charge as it moves from A to B.
Note that the work done is independent of the path taken and depends only on the initial and final points
of the motion (as is the case for the gravitational force).
4-10
For the situation considered initially (where the charge q is positive and the only force acting is the
electric force):
The charge is losing electric potential energy so that U is negative.
The work done WA B on the charge by the field is positive, since the electric force and the displace-
ment of the charge are in the same direction.
Electric potential
A quantity more useful than electric potential energy is electric potential, which is defined as the elec-
tric potential energy per unit charge. Therefore the potential difference or p.d. between two points
A and B in an electric field is:
WAB
VB VA (potential difference defined)
q
where WA B is the work done by the field in taking a small test charge q from A to B.
Note that potential difference is determined solely by the details of the charge distribution that
creates the electric field; it does not depend on the magnitude of the test charge that is placed in the
field in order to measure it.
The unit of potential and potential difference is the volt V. From its definition we have
V = J.C. A p.d. of 1V exists between two points if 1 J of work is done against the field in taking
1 C of charge from the point at lower potential to the point at higher potential.
Potential differences are sometimes referred to as voltages.
It follows from this definition that if the potential difference V between two points in an electric field
is known, the magnitude of the work that must be done to move a charge q between the points can be
calculated from
W q V
Only differences in potential or changes in electric potential energy have a physical significance. In
order to define the potential at a point in a field, it is necessary to specify a reference point at which
the potential is taken to be zero. In general this point can be chosen for convenience (as is the case for
gravitational potential energy).
The reference point is often taken to be infinity as is done for a point charge – see below.
In an electric circuit, it could be a point connected to earth.
The general definition of electric potential is therefore:
The potential at any point in an electric field is the work done (by us,
against the electric force) per unit charge in bringing a positive test
charge from the reference point to that point in the field.
In general, when a charge q is accelerated through a potential difference V, the work done is
W q V . If the particle is an electron of charge q = 1,60 × 10 C and if the potential difference is
V 1 V, then the work done is W q V 1,60 1019 J. Hence
1 eV 1,60 1019 J
Work must be done on a small positive charge q to move it from the negative plate (b) to the positive
plate (a), along a field line. The force that the field exerts on the charge has magnitude
F E q (from E F )
q
and is in the same direction as the field. The work done by the field is therefore
Wb a Fd (since cos 1)
E q d
The potential difference between points a and b, which by definition is the work done per unit charge
Wba q , is therefore
Wb a E q d
Vab Va Vb Ed
q q
The magnitude of the potential difference between any two points a and b a distance d apart in a uniform
electric field of strength E is therefore given by
Vab
E (p.d. in a uniform field)
d
This equation leads to the unit V.m for the electric field.
Infinity is often chosen as the reference point for measuring electric potential energy (and therefore
electric potential) because the electrostatic force acting between two charges is zero when they are
an infinite distance apart.
The potential at some point in space due to a fixed point charge q is therefore the work that we must
do, per unit charge, in bringing a small positive test charge q from infinity to that point.
Calculation of the work done is complicated by the fact that the electric force does not remain constant
as the charge q is moved (from Coulomb’s law it depends on the separation of the two charges), so
that the simple definition of work done as the product of the force and displacement cannot be used.
However, using integral calculus it is easily shown that the work we must do to move the charge q
from infinity to a point a distance r from the point charge q is:
1 q q
W (P.E. of two point charges)
4 0 r
From the definition of electric potential energy, this is the potential energy of two point charges q and
q a distance r apart. For two charges of the same sign the potential energy is positive; it is negative if
the charges have different signs.
The potential at a distance r from a point charge q, by definition the work done per unit charge, is
therefore:
1 q
V (potential due to a point charge)
4 0 r
Since potential is a scalar quantity, to find the potential at a point due to several charges, simply
calculate the algebraic sum of the potentials due to the individual charges.
The equations just derived are illustrated in the diagram below.
In the upper part of the diagram, the electric potential at
point P a distance r from the point charge q1 is
V q1 4 0 r .
Note that there are several very similar equations for point charges.
1 q1q2 1 q 1 q 1 q1q2
F , E , V , U
4 0 r 2
4 0 r 2
4 0 r 4 0 r
Do not confuse them, and note that they apply only to point charges
(or spherical charge distributions).
They must never be used in connection with two charged parallel plates.
4-13
Equipotential surfaces
An equipotential surface is a surface over which the potential has the same value everywhere (this
may or may not coincide with a physical surface).
As a first example, we consider a point charge q (assumed positive in the diagram below); the potential
a distance r from a point charge q is V q 4 0 r .
No work is required to move a charge at constant speed on an equipotential surface, since the work
done W qV is zero when V = 0.
Equipotential surfaces are always perpendicular to the electric field, as in the example above.
If a charge is moved on the equipotential surface, it follows from above that no work is done on the
charge by the electric force. But the work done by a force can be zero only if the force is perpen-
dicular to the motion. Hence the force, and therefore the field, must be perpendicular to the motion,
i.e. to the surface.
The surface and the whole volume of a conductor is an equipotential under electrostatic conditions.
If it were not, there would be a potential difference between two points on or in the conductor and
then free charges would flow. This would violate the electrostatic conditions.
The surface (and volume) of an insulator is not necessarily an equipotential, since no charges are
free to move even if a potential difference exists.
As a second example, consider the field between two parallel conducting plates that carry equal but
opposite charges. As already discussed the electric field between the plates is uniform.
The diagram shows both the field lines and the surfaces
of constant potential. These are planes perpendicular to
the field and therefore parallel to the plates (which are
themselves equipotentials).
Capacitance
A capacitor can be charged by connecting it to a battery (which is a device that maintains a constant
potential difference between its terminals).
Electrons are pulled from the conductor connected to the positive terminal, leaving it with a net
positive charge, and transferred through the battery to the other conductor which then has an equal
negative charge.
This creates an electric field, and therefore potential difference, in the gap between the two conduc-
tors.
The charge-transfer process stops when the potential difference between the conductors equals the
p.d. provided by the battery.
If we measure the magnitude of the charge q on either conductor of a capacitor for different potential
differences V between the conductors, we find that the ratio q V is a constant for a given capacitor.
The diagram on the left indicates both the physical appearance of a parallel
plate capacitor and the pictorial representation of a capacitor in circuit dia-
grams.
This representation is used even if the capacitor is not of the parallel-plate
construction.
Consider two parallel conducting plates, each of area A, a small distance d apart, which have been
charged by connecting them to a battery.
4-15
The field due to the charged conducting plates is E 0 , where the surface charge density on each
plates has magnitude q A . Therefore the field between the plates is E q 0 A .
q q A
The field and p.d. between the plates are related through E V so that V , or 0 , giving
d 0 A d V d
0 A
C (parallel-plate capacitor)
d
One device that utilises directly the dependence of C on d is a computer keyboard. In some keyboards,
pressing a key decreases the plate separation of a capacitor directly below the key. The charge-flow
that results from the change in capacitance is detected and interpreted by the computer circuitry.
Variable capacitors are also used in the tuning circuits of radios (the frequency to which the circuitry
responds is determined by the capacitance in the circuit). One type of variable capacitor consists of
two interwoven sets of metal plates, one fixed and the other movable; the capacitance depends on the
size of the overlapping plate area.
For a capacitor with plates of area 1 cm2 and plate separation 1 mm, the capacitance is 8,9 pF, which is
very small. To increase the capacitance to about 1 nF we could:
(i) Increase the area of the plates by a factor of about 100. This has the obvious disadvantage of
increasing the size of the capacitor considerably.
(ii) Decrease the plate separation to about 10 m. However, for a fixed voltage this would, from
E = V/d, increase the electric field between the plates, possibly leading to a breakdown of the air
gap between the plates (which happens at a field of about 3 MV/m).
Another way to increase the capacitance, through the use of dielectrics, is described below.
Capacitors in combination
Circuits normally contain a number of interconnected capacitors, to meet the specific needs of the de-
vice. Two simple ways to connect capacitors, in series or in parallel, are illustrated below.
The equivalent capacitance of a network of capacitors is defined as the capacitance of a single
capacitor that has exactly the same effect in the external circuit, in that it would, for the same potential
difference, store the same amount of charge.
Series combination
We start with two uncharged capacitors of capacitances C1 and C2 connected in series.
4-16
Electrons are transferred from the left plate of the capacitor C1 through the battery to the right plate
of C2, leaving the left plate of the capacitor C1 positively charged and the right plate of C2 negatively
charged.
The excess positive charge on the left plate of the capacitor C1 attracts an equal amount of negative
charge onto the right plate of capacitor C1; this charge comes from the left plate of C2. Since the
two inner plates were initially uncharged, this leaves an excess positive charge of the same magni-
tude on the left plate of C2.
This process continues until the p.d. across the combination of capacitors equals the potential dif-
ference V provided by the battery. The charges on the plates are then as shown in the diagram above;
each of the capacitors in series has the same charge on its plates.
The potential differences across C1 and C2 will in general be different:
q q
V1 and V2 ,
C1 C2
Therefore
V 1 1
q C1 C2
or
1 1 1 (capacitors in series)
Ceq C1 C2
where Ceq is the equivalent capacitance of the combination of two capacitors. For any number of ca-
pacitors C1, C2, C3 ... in series
1 1 1 1
Ceq C1 C2 C3
Note that the combined capacitance is smaller than that of any individual capacitor.
4-17
In summary, two capacitors of capacitances C1 and C2 can be replaced by a single capacitor of capaci-
tance Ceq; for the same potential difference this will store the same amount of charge as the original
capacitors.
equivalent to
The charge on the equivalent capacitor is the same as the charge on each of the original two capacitors.
Parallel combination
Two capacitors that are initially uncharged are now connected in parallel with a battery. Because of
differences in potential between the battery terminals and the capacitor plates, charges will flow.
Electrons flow from the upper plates through the battery to the lower plates, leaving the upper plates
positively charged and the lower plates negatively charged.
This flow of charge stops when the p.d. across each capacitor equals the potential difference V
provided by the battery. Note that for capacitors in parallel, the p.d. across each is the same, whereas
the charges on each will in general be different:
q1 VC1 and q2 VC2
Therefore
q
C1 C2
V
Or
Ceq C1 C2 (capacitors in parallel)
where Ceq is the equivalent capacitance. For any number of capacitors in parallel C1, C2, C3....
4-18
Ceq C1 C2 C3
In summary, the two capacitors of capacitances C1 and C2 can be replaced by a single capacitor of
capacitance Ceq; for the same potential difference this will store the same amount of charge as the orig-
inal capacitors.
equivalent to
Note that the p.d. across the equivalent capacitor is equal to the p.d across each of the original two
capacitors. The charge on the equivalent capacitor is the sum of the charges on the original two capac-
itors.
As more charge accumulates on the plates, the p.d. between the plates increases, and the amount of
work that must be done to transfer the next element of charge therefore increases proportionately.
The total work done in charging the capacitor from zero p.d. to a final potential difference V can be
found by considering a graph of V versus q. From V 1 q , we see that plotting a graph of V against q
C
results in a straight line of slope 1 C that passes through the origin.
The final charge q on the plates is moved against an average potential difference of V V 2 , so that
The total work done can also be found from the area under
the curve: W 1
2
qV (recall the formula W 12 Fx for the
Using the relationship q VC two alternative formulas for the work done can be derived, yielding
q2 1
W 12 qV 1 CV 2 (energy stored in a capacitor)
2C 2
The work done becomes energy stored in the capacitor; it can be released if the capacitor is discharged.
4-19
Dielectrics
We have so far assumed that the gap between the plates of a capacitor is filled with air. In practice there
is usually a dielectric, i.e. an insulating medium, between the plates.
Common dielectric materials include plastic, rubber and waxed paper.
The main function of the dielectric is to increase the capacitance of the capacitor (how this happens
is discussed later).
This effect of the dielectric is measured by the dielectric constant (or relative permittivity), defined
by:
C
(dielectric constant defined)
C0
where C0 is the capacitance with vacuum between the plates, and C is the capacitance with the dielec-
tric completely filling the gap between the plates.
is a constant for a particular material.
1 for vacuum, 1,00059 for air and is usually in the range 2–10 for common materials.
However, can be much larger: for example for distilled water at room temperature 80 and for
barium strontium titanate ~ 10000.
Besides increasing the capacitance, the use of a dielectric has other benefits related to its dielectric
strength; this is the field at which the dielectric breaks down, producing sparking between the plates
and consequent loss of the stored charge. When a dielectric is present, the dielectric strength is typically
much higher than for air.
Examples of dielectric strengths are 3 MV.m1 for air, 16 MV.m1 for paper and 60 MV.m1 for
Teflon.
The dielectric insulates the plates from one another, allowing the capacitor to be charged to a higher
voltage without breaking down; a higher voltage means more charge can be stored.
Alternatively, because of the higher dielectric strength, the plates can be much closer together,
thereby increasing the capacitance even further.
This is shown in the diagram on the left, where E indicates the di-
rection of the applied field that causes the separation of charges.
In some materials (such as water), molecules are naturally polarized but randomly orientated.
When a slab of dielectric is placed in the electric field between the charged plates of a capacitor, the
dipoles that are created are aligned by the field.
However, if the capacitor is still connected directly to the battery when the dielectric is inserted, charges
can move to or from the plates.
Because the field between the plates has decreased, the p.d. between the plates initially decreases
(according to E V d ). The p.d. between the plates is then not equal to the p.d. provided by the
battery to which it is connected, so charges must move.
Negative poles on one face of the dielectric push more electrons off the positive plate of the capac-
itor, making it more positive. In the same way, the negative plate becomes more negative.
This flow of charges from one plate to the other, through the battery, continues until the p.d. be-
tween the plates is again equal to the voltage supplied by the battery (this normally happens within
a fraction of a second).
The net result is that the charge stored on the capacitor has increased. From C q V , with V un-
changed, this means that C has increased.
Note that in both cases the capacitance increases.
4-21
In a conductor there are charges which are free to move, so that if a potential difference exists
between two points in a conductor the charges will move between the points, i.e. a current will be
created.
The magnitude of the current is effectively the rate at which charge moves through the conductor. If an
amount of charge q passes a given point in the conductor in time t, the current I is given by:
q
I
t
This equation defines the unit of charge, the coulomb. The unit of current, the ampere or amp (A), will
be defined later in terms of the magnetic force between two current-carrying conductors.
The type of moving charges that form the current, called the charge carriers, depends on the conduct-
ing material.
In a metal only electrons are free to move, and they will be attracted to the positive (i.e. high po-
tential) end of a conductor when a potential difference is maintained across it.
In solutions containing electrolytes (e.g. salt water) and in ionised gases (as in a fluorescent lamp,
for example), ions of both sign are present and flow in opposite directions.
In some semi-conductors movement of what are essentially positive charges occurs.
Negative charges flowing in one direction produce exactly the same effect as positive charges flowing
in the opposite direction.
By convention, positive current flows from the high potential (positive) to the low potential (negative)
end of a conductor.
4-22
provided that the temperature of the conductor remains constant. This may be written
V IR (Ohm’s law for a conductor)
A large number of conductors (e.g. rectifiers, transistors, diodes) do not obey Ohm’s law; they are
referred to as non-ohmic conductors. Ohm’s law also does not hold for the conduction of electricity
through a gas.
The resistance of a non-ohmic conductor can still be calculated from the formula R = V/I, but it is not
constant and depends on the current flowing through the conductor. For example, rectifiers have re-
sistances that depend on both the direction and magnitude of the current, being extremely large in one
direction. This is illustrated in the diagram below which compares an ohmic device (i) with a semicon-
ducting diode (ii).
4-23
A resistor is a conductor whose resistance obeys Ohm’s law and whose function in an electrical circuit
is to provide a specified resistance, which may range from an ohm or less to millions of ohms.
A common type of resistor is the carbon-composition resistor, which ranges in size from about
1 to several M. It is constructed from a cylinder of carbon, in the form of graphite, mixed with
non-conducting impurities to increase its resistance.
Resistors for specialised use may be made by winding a fine wire around an insulating tube to get
a long wire into a small space.
A useful analogy
It is sometimes convenient to think of an electric circuit as being analogous to a pipe through which
water is driven by a pump.
A battery of specific voltage is analogous to a pump that delivers a specific pressure.
The current is analogous to the water volume flow rate.
Electrical resistance is analogous to the pipe’s resistance to the flow of water.
In the same way, if the resistance is increased in an electrical circuit, the voltage must be increased to
maintain the same current; otherwise the current will decrease throughout the entire circuit.
If there are no branches in a pipe through which water is
flowing, the flow rate will be the same at all points in the
pipe (from the equation of continuity).
If, however, the pipe branches the sum of the flow rates
in the branches must be equal to the flow rate in the main
pipe.
In the same way, in a simple electrical circuit the current will be the same at all points in the circuit. If
the circuit branches, the sum of the currents in the branches must equal the current in the main circuit.
This follows from the conservation of charge and is often referred to as Kirchhoff’s junction rule.
Resistivity
Ohm deduced from his experiments that the resistance of various samples of a particular material is
proportional to the length and inversely proportional to the cross sectional area of the sample:
4-24
l
R (resistivity defined)
A
where is the resistivity, which is a constant (at constant temperature) for a particular material.
From the equation RA/l, the unit of resistivity is .m.
As an example, the resistivity of copper at 20°C is 1,7 10 .m, to be compared with the resis-
tivity of glass, an insulator, which is of the order 10 .m. Graphite, a semi-conducting material
used in many resistors, has a resistivity of 3,5 10 .m
0 1 0
where 0 is the resistivity at some reference temperature 0 (often taken as 20° C).
The proportionality constant is called the temperature coefficient of resistivity, which is measured
in C. (Do not confuse this quantity with the coefficient of linear expansion, for which the same sym-
bol is used).
When materials are heated their resistivity changes, but so does the size of the specimen. For most
materials the coefficient of resistivity is much larger than the coefficient of linear expansion (for exam-
ple for aluminium, 4,3 10 C compared with 7,4 10 C). This implies that changes in speci-
men size with temperature can be ignored compared with changes in resistivity, and then
R l/A gives
R R0 1 0
Hence can be regarded as the temperature coefficient of both resistance and resistivity.
4-25
The equivalent resistance of any network of resistances is defined by R = V/I where V is the potential
difference across the network and I is the current flowing into and out of the network. The network
could be replaced by a single resistor of the equivalent resistance with no effect on the rest of the circuit.
Series combination
Elements of a circuit are said to be connected in series if the same current flows through each, as in
the diagram on the left. There can be no circuit branches between the resistors.
equivalent to
The current I is the same through both resistors, so that the potential difference across each resistor is:
V1 IR1 and V2 IR2
V
Hence the equivalent resistance is Req R1 R2 .
I
It follows that for any number of resistances in series:
Req R1 R2 R3
Parallel combination
Elements of a circuit are said to be connected in parallel if the same potential difference exists across
the elements.
equivalent to
The current flowing into the network splits, with I1 going through one resistor and I2 through the other.
The potential difference V is the same across both resistors, so that the current through each is:
V V
I1 and I 2
R1 R2
4-26
Note that the combined resistance is always smaller than that of any individual resistor.
emf
When a source is connected in a circuit (as in the circuit diagram below), current flows from the positive
to the negative terminal of the source around the external circuit.
4-27
For a continuous flow in the circuit, the current inside the source must flow from negative to positive
(i.e. “uphill”). This is possible because there is another force acting within the source in addition to the
electrostatic force that causes current flow in the external circuit. This force (with a chemical origin in
a cell) is larger than the electrostatic force and acts in the opposite direction to it.
A source is an energy converter, converting chemical energy, for example, into electrical energy. The
conversion is measured by the emf, , which may be defined as the electrical energy delivered by the
source per unit charge passing through it, or equivalently, as the work done per unit charge in moving
charge q through the source; i.e.
W
(definition of emf)
q
emf is measured in volts, and is often referred to loosely as the “voltage” of the source.
The emf is not a force, despite the origin of its name (electromotive force).
The emf of an ideal cell is equal to the p.d. delivered by the cell.
However, for all cells, and other sources of emf, there is a resistance to the movement of charge within
the cell. The cell is said to have an internal resistance, usually denoted r.
A real cell or battery is therefore represented as an emf in series with its internal resistance r, as
indicated in the diagram below.
If no current is drawn from the battery the pd. between its terminals, the terminal voltage, is equal
to the emf of the battery (this is often called the open-circuit voltage).
However, if a current I flows from the battery there will be a drop in potential across the internal
resistance, which is equal to Ir from Ohm’s law.
Thus the terminal voltage is:
Vab Ir (p.d. across terminals of cell)
assuming the current flows from b to a in the diagram above – see the later discussion. The potential
difference measured across the terminals of a source thus depends on the current drawn from the source.
The internal resistance of a battery is usually quite small. For example, a torch battery may have an
internal resistance of less than 0,1 (which increases considerably as the battery ages) and a car battery
has an even smaller internal resistance, typically about 0,005 when in good condition.
4-28
The rate at which energy is delivered to a circuit element is therefore, from P W/t,
where I is the current passing through it and V is the potential difference across it.
From P W/t the unit of power is J.s, which is defined as the watt (W).
What happens to the energy delivered to a circuit element depends on the nature of the device.
In a light bulb, which contains a tiny wire element, it becomes heat and light energy.
In devices such as electric heaters, toasters, kettles and hair dryers thermal energy is produced in
the resistance wire, which is referred to as the heating element.
In a motor it is converted to mechanical energy.
In a loudspeaker it is turned into sound energy.
The formula P IV is valid for any electrical device. For an ohmic conductor, the potential difference,
current and resistance are related by V IR, and the expression for power can be written in two alter-
native forms:
P I 2R V 2 R (power loss in an ohmic conductor)
The rate at which a source of emf converts energy (or supplies energy to the circuit) follows from the
definition of emf: W q . When combined with I q t this leads to W I t
Therefore, the power delivered to the circuit by the source is, from P W/t,
Usually the current in a circuit flows in the direction in which a source would normally drive it and the
source supplies energy to the circuit. However, in circuits containing more than one source, the current
may flow in the opposite direction to which a particular source would normally drive it. Then energy
may be stored in the source (this happens, for example, if the chemical reaction within a cell is reversi-
ble).
4-29
If the source is connected to more than one external resistor, R must be replaced by the equivalent
resistance of the external circuit.
Similarly, if the circuit contains more than a single source, the quantities and r must be suitably
modified.
Each term on the left-hand side is the potential difference across the corresponding circuit element.
It follows that the sum of all the voltage changes in going round the complete circuit must be zero.
This same principle can be applied to any closed current loop in a more complicated circuit and is
often referred to as Kirchhoff’s loop rule. Complicated circuits are not considered in this course.
To calculate the potential difference between two points in a circuit, we use the following:
(a) Current always flows from high potential (H) to low potential (L) in the external circuit.
4-30
(b) The positive terminal of a source is always higher in potential by an amount than the negative
terminal (the internal resistance must be considered separately).
Four simple cases are considered in the diagrams below. In each case, we go from point a to point b to
calculate Va – Vb Vab , the potential difference between points a and b:
Remember that charges cannot move directly across the gap between the capacitor’s plates, so that
no current flows through the capacitor. The current is created by charges moving from one plate
to the other through the rest of the circuit, via the cell and resistor.
At any instant during the charging process the circuit equation is found by moving clockwise around
the circuit from the switch:
Q
IR 0
C
where I is the current at that instant and Q is the charge on the capacitor plates at the same instant.
Note that the potential difference across the capacitor appears with a negative sign in this equation –
the left plate is connected to the positive terminal of the cell and is therefore at a higher potential than
the other plate; there is therefore a decrease in potential as we move across the capacitor in the direction
of the current.
4-31
At the moment the switch is closed, Q 0 so that from the circuit equation the initial current is
I0 = /R.
The charging process continues until the potential difference across the capacitor equals the emf of
the cell. At that time I = 0 and therefore Q = C.
Using integral calculus, it easy to discover how the current, the potential difference across the capacitor
and the charge on the capacitor each change with time. The variation of the voltage and current with
time is illustrated in the following diagrams.
Since Q CV with C constant, the charge Q on the capacitor increases at the same rate as V.
The rise and decay are exponential and involve the expression exp t / where
RC (time constant)
is called the time constant of the circuit. This is a measure of how quickly the capacitor becomes
charged – after a time t >> all values are steady, having reached their final values. For example, the
potential difference across the capacitor reaches more than 99% of its maximum value at t 5 and is
thereafter effectively constant.
As an example, if R 1,0 M and C 1 F then 1,0 s, and after about 5 seconds all values are
effectively constant.
After charging, the capacitor can subsequently be discharged using a circuit such as that in the diagram
below.
It is easily shown that the p.d. across the capacitor decreases with time from its initial value V0 according
to the equation
V V0 exp t
where again = RC. The charge Q on the capacitor also decreases at the same rate (from Q = CV).
Examples of the use of RC circuits include camera flash units, heart pacemakers and the windshield
wipers on a car. In the latter, the intermittent operation used in light rain is controlled by an RC circuit
with an adjustable time constant (through selecting different values of R).
4-32
Alternating current
The current driven by a battery will flow through a circuit in one direction only, and its magnitude will
be constant almost immediately the circuit is connected. This is referred to as direct current (DC).
On the other hand electric generators at power stations produce alternating current (AC), whose mag-
nitude changes sinusoidally with time, reversing direction many times per second. The two currents are
compared in the diagrams below.
where V0 is called the peak voltage and f is the frequency, i.e. the number of complete oscillations
made per second. In South Africa and most other countries f = 50 Hz is used.
If a potential difference V exists across a resistance R in an AC circuit, the current through the resistor
is, from Ohm’s law,
V
I I 0 sin 2 ft
R
where I0 = V0/R is the peak current, as indicated in the previous diagram. Note that the average current
is zero.
The power delivered to the resistance R at any instant is
P I 2 R I 02 R sin 2 2 ft
It follows that, as far as power is concerned, the important quantities in an AC circuit are the mean
values I 2 12 I 02 and V 2 12 V02 . The square roots of these quantities are called the root-mean-square
(rms) values of the current and voltage:
I0 V
I rms and Vrms 0
2 2
It is usually the rms voltage that is quoted for an AC supply; in South Africa Vrms is 230 volts.
by the ammeter, some of the current in the main circuit must be made to bypass the galva-
nometer. By choosing a shunt resistor with the appropriate resistance, we can construct an
ammeter of any desired range (as illustrated in the next lecture example).
A voltmeter is an instrument used to measure the p.d. between two points in a circuit; it must therefore
be connected in parallel with a circuit element.
If the current drawn by the meter is not negligible, the current in the circuit will change when the meter
is connected and the p.d. will be lower than it was before the meter was connected.
The voltmeter must therefore have a very high resistance.
This is achieved by connecting a resistor of high resistance in series with the galvanometer.
This resistor limits the current through the galvanometer, and in addition its magnitude determines the
range of voltages that the voltmeter can measure.
Wheatstone bridge
The Wheatstone bridge is a circuit that can be used for measuring accurately resistance (and also, suit-
ably modified, capacitance).
In the circuit shown here, P, Q, R and S are resistors. One resistor has an unknown resistance that is to
be measured, a second resistor has a resistance that can be varied in a controlled way, and the other two
resistors have fixed, known resistances.
At balance:
The same current I1 will flow in P and Q and the same current I2 will flow in R and S, since
IG 0.
Points b and d are at the same potential (otherwise a current would flow between them).
Hence, using Ohm’s law
Vab Vad I1 P I 2 R
and
4-35
Dividing:
P R
Q S
Slidewire bridge
This is a practical implementation of the Wheatstone bridge circuit. In the circuit below, the points a,
b, c and d refer to the same points with the corresponding labels in the Wheatstone bridge circuit.
Balance is achieved by moving the sliding contact at d along the slidewire ac until the galvanometer
reads zero. Then l1 and l2 are the lengths of the slidewire on either side of the balance point d.
l1 l
The resistance of length l1 is R1 and the resistance of length l2 is R2 2 , where A is the cross-
A A
sectional area of the wire and its resistivity.
At balance
X R1 l1 A l1
.
S R2 A l2 l2
Hence
l1
X S
l2
4.3. ELECTROMAGNETISM
Magnetic poles are in some ways similar to electric charges (there are two kinds, each exerts a force on
the other), but there are important differences. For example:
Isolated magnetic poles have not yet been detected although there are theoretical grounds for be-
lieving that they may exist. Note that cutting a bar magnet in half simply produces two smaller bar
magnets, each with its own north and south poles.
Magnetic fields
Earlier, we described the interaction between two charges by saying that one charge creates an electric
field, and that this field exerts an electric force on the second charge.
In the same way a magnetic field is said to exist at a point if a force acts on a magnetic pole (e.g. one
end of a compass needle) placed at that point. (A compass comprises a small bar magnet pivoted at its
centre of gravity so that it can rotate freely in a horizontal plane.)
The direction of the field at any location in space is the direction in which the north pole of a
compass needle would point at that location.
As is the case with the electric field, the magnetic field can be represented by magnetic field lines,
the tangent to the lines indicating the direction of the field and the density of lines indicating the
intensity of the field.
Unlike the situation for electric fields, the magnetic field lines do not indicate the direction of the
magnetic force (which is discussed below) and should therefore not be called lines of force.
Again in contrast to the situation for electric field lines, magnetic field lines do not start and end on
magnetic poles; they form continuous loops (which will pass through the poles).
The diagram below shows the magnetic field created by a bar magnet.
4-37
The following diagram shows the field created by a horseshoe magnet (the field inside the magnet is
not shown). In the configuration on the right, the field between the poles is almost uniform.
The Earth creates a natural magnetic field, with field lines that roughly resemble those of a bar magnet.
Since the north pole of a compass needle points north, the mag-
netic pole that is situated near the Earth’s geographic north pole
is actually a south pole magnetically, although it is often referred
to as the “north magnetic pole”.
The magnetic poles do not coincide with the geographic poles,
being about 1500 km apart at present, so that the deviation of
magnetic north as indicated by a compass from true north may
be considerable. This angle, the magnetic declination, varies
from point to point on the Earth’s surface.
The position of the poles is not fixed, the north pole moving at
about 40 km/year over the past few years. In fact, it is believed
that the field reverses its direction completely at irregular inter-
vals, reversals being about 300,000 years apart on average (the
last reversal was about 780,000 years ago).
As is obvious from the diagram, the Earth’s magnetic field is not tangential to the Earth’s surface in
general. The angle that the field makes with the horizontal is called the angle of dip, and again it varies
considerably with location on the surface of the Earth, being close to zero near the equator and almost
90° near the pole.
The Earth’s field is believed to be caused by the electric currents in the liquid part of its core, and is not
due to permanently magnetised material in the core.
4-38
Magnetic flux
Magnetic flux is defined in the same way as electrostatic flux. The flux M through a small surface of
area A perpendicular to a uniform magnetic field B is M BA.
The name flux density for B follows from the equation B M A .
From experiment, the direction of F is always perpendicular to the plane containing v and B . In
the diagram shown above F is into the diagram for a positive charge and out of the diagram for a
negative charge.
A single equation can be used to indicate both the magnitude and direction of the magnetic force:
F q(v B) (force on a moving charge)
If the handle of a right-handed corkscrew is turned from the direction of v to the direction of B the
corkscrew will move in the direction of F for a positive charge q. The force on a negative charge is in
the opposite direction.
No force acts on a charge moving parallel to a magnetic field: if v is parallel to B , then sin 0
and so F 0.
The force is a maximum for a charge moving perpendicular to the field (when 90).
Therefore:
The magnetic force on the particle is always perpendicular to its motion, to its left in this case.
The magnitude of the force, F q v B , is constant for a static, uniform field.
4-40
It follows that the path of the particle is circular, and it undergoes uniform circular motion (moving
counterclockwise around the circle in the case illustrated in the diagram above). From Newton’s second
law:
v2
F qv B m
r
where m is the mass of the particle and r is the radius of the circular path. The radius of the path is
therefore
mv
r
qB
If the particle has a negative charge, the force is in the opposite direction and the motion is clock-
wise around a circular path for the situation described by the diagram above.
If the initial direction of the particle is not perpendicular to the field, then the path followed by the
particle is a helix around the magnetic field lines.
There are several applications of the effect described above:
The mass spectrometer or spectrograph, a device to measure the mass (or charge to mass ratio) of
a particle or ionised atom.
Synchrotrons and cyclotrons, devices used to accelerate elementary particles to very large speeds
for research purposes.
The charges that form the current are moving through the con-
ductor with drift velocity v.
The total charge of the charge carriers in this length of conduc-
tor at any instant is Q.
All these charges will drift through the plane formed by the end
of the conductor in a time t given by t l v .
The current in the conductor is therefore I Q t .
The average force on a single charge q moving with drift velocity v in the conductor is, from above,
Fq q v B sin .
F Q v B sin
It l B sin
t
Therefore the force on the conductor is
F I l B sin
The force is at right angles to the plane containing l and B (this follows from the direction of the force
on each charge carrier), so this can be written
F I ( l B)
where l is a vector of length l along the conductor in the direction of the current I.
The direction of F can be found from the corkscrew rule.
If the current is in the direction of the field (or in the opposite direction), the magnetic force on the
wire is zero.
The phenomenon described here is the principle of operation of the speakers found in most sound sys-
tems. It is also used in the design of electric motors and meters, where it produces a torque on a loop of
wire.
The galvanometer
The force on a current-carrying conductor in a magnetic field can be used to rotate the conductor. In
meters (and electric motors) a couple is exerted on a coil in the field. Electrical energy is thereby con-
verted into mechanical energy, specifically kinetic energy of rotation.
A current flows through the loop in the direction shown, and the function of the galvanometer is to
measure this current.
The force on the left side of the loop is into the diagram and the force on the right side of the loop
is in the opposite direction, so that a couple is created that tries to rotate the loop about the central
pivot.
The magnitude of the forces and hence the couple is proportional to the magnitude of the current.
4-42
This rotation is opposed by a spring that exerts a moment roughly proportional to the angle through
which it is turned (Hooke’s law).
A pointer attached to the pivot moves across a scale in response to the rotation.
Since the angle of rotation is proportional to the current, the deflection on the scale is also propor-
tional to the current.
Ampere’s law
Gauss’s law is used in electrostatics to calculate electric fields. A similar type of calculation, due to the
French scientist André-Marie Ampère (1775–1836), may be used in electromagnetism to find the mag-
netic flux density B in situations with sufficient symmetry.
To find the magnetic flux density due to a collection of conductors carrying steady currents, we proceed
as follows.
An arbitrary closed loop is drawn around the current-carrying conductors.
The closed loop is divided into small elements of length s, and for each element we calculate the
product Btan s of s and the component of B parallel or tangential to s . These terms are then
summed around the entire loop.
The total current I total flowing through the surface bounded by the loop is calculated, taking into
account the directions of the currents.
Ampere’s law states that:
The proportionality constant 0 4 10 H.m–1 (H henry) is analogous to 0, and is called the
permeability of free space.
4-43
The law is only useful for calculating magnetic fields due to highly symmetric current configura-
tions. In this course we shall consider only cases in which we can draw a loop with B parallel or
perpendicular to the loop.
The uses of Gauss’s and Ampere’s laws to calculate electric and magnetic fields are contrasted in the
table below.
To calculate the flux density a distance r from a long, straight conductor carrying a current I, we draw
a closed circular loop of radius r around it.
4-44
Therefore
0 I
B (long straight conductor)
2 r
0 I
F I l B sin I l .
2 r
The ampere
The equation just derived is used in the definition of the ampere. Since 0 4 10 H.m, we have
0 2 2 107 H.m1 .
If two long, parallel conductors, 1 m apart and carrying the same cur-
rents, each experiences a magnetic force of 2 × 10 N per metre
length, the current flowing in each is defined to be 1 A.
The field due to the solenoid approximates that of a bar magnet; the end of the solenoid where the field
lines emerge (on the left in the diagram above) acts as a north pole and the other end acts as a south
pole. For this reason a solenoid is sometimes referred to as an electromagnet – it acts as a magnet only
when it carries a current.
The diagram below shows a length-wise cut through a long solenoid, through which a current I flows
(into the page for the turns at the top, and out of the page for the turns at the bottom). The loops are
again shown widely spaced for clarity.
To calculate the flux density within the solenoid, we draw a rectangular closed loop abcd far from the
ends of a solenoid, partly inside and partly outside the solenoid, as shown in the diagram. We apply
Ampere’s law to the closed loop abcd.
Outside the solenoid B 0, so that Btan 0.
Inside the solenoid B is perpendicular to the lines ad and bc, so Btan 0 at the sides of the loop.
The only contribution is from the line ab, where Btan B since B is parallel to ab. Then
( B tan s) B s BL
B L 0 LI N ,
l
giving
N
B 0 I (field inside a long solenoid)
l
This equation does not hold near the ends of the solenoid.
Solenoids are used to generate magnetic fields in many practical devices, particularly where a uniform
field is required:
- In TV sets horizontal and vertical electromagnets are used to control the direction of the electron
beams that hits the screen;
- in the control of electric door locks in cars and in most types of doorbells;
4-47
Induced emf
The diagram below illustrates one of the experiments carried out by Faraday.
The secondary circuit, on the right, contains only a coil wrapped around the same iron ring and a gal-
vanometer to detect any current in the circuit.
Faraday discovered that:
The galvanometer deflects strongly the instant the switch is closed.
There is no deflection when the switch remains closed.
The galvanometer deflects strongly in the opposite direction the instant the switch is opened again.
4-48
In other words, a steady current in the primary circuit produces no current in the secondary circuit. But
when the current in the primary starts flowing or stops flowing, a current is briefly induced in the sec-
ondary circuit.
When the current in the primary circuit starts to flow or stops flowing, the magnetic field in the primary
coil changes and therefore the magnetic flux through the secondary coil also changes.
Faraday concluded that although a steady magnetic field produces no effect,
a changing magnetic field produces an induced current in the secondary circuit;
the presence of the induced current implies the existence of an induced emf in the secondary coil.
The next diagram shows a further experiment conducted by Faraday to investigate electromagnetic in-
duction.
Exactly the same effect is observed if the magnet is held stationary and the coil is moved towards it or
away from it. Again, it is the fact that the magnetic flux through the coil is changing that produces the
induced current in the coil.
This creates an induced current in the coil. The direction of the induced current is determined by
the polarity of the induced emf.
The induced current produces an induced magnetic field (which should not be confused with the
changing magnetic field that created it). The direction of the induced field and induced current are
related by the right-hand screw rule.
The polarity of the induced emf and, consequently the direction of the induced current and magnetic
field, are given by Lenz’s law. This is a consequence of energy conservation, and was proposed by the
Russian scientist Heinrich Lenz (1804–1865) soon after Faraday formulated his law.
The current caused by the induced emf travels in the direction that creates a mag-
netic field with flux opposing the change in the original flux through the circuit.
In other words, the induced current attempts to maintain the original flux through the circuit.
If the flux is increasing, the induced current will produce a magnetic field which opposes the in-
creasing applied field (i.e. the induced field is in the opposite direction to the applied field).
If the flux is decreasing the field produced by the induced current reinforces the decreasing applied
field (i.e. the induced field is in the same direction as the applied field).
Lenz’s law can be used to find the polarity of the induced emf even if no current flows (because the
circuit is not closed) by finding the direction it would flow if the circuit were complete.
Lenz’s law can be illustrated by its application to the experiment of Faraday described above.
As the south pole of the magnet gets closer to the coil, the flux through the coil increases, inducing
a current in the coil.
By Lenz’s law, the induced current must produce a field that opposes the increase in the flux. Since
the field due to the magnet is increasing, this requires that the induced field through the loop is in
the opposite direction to the magnet’s field (i.e. from left to right in the diagram).
Application of the corkscrew rule then says that in order to produce such a field, the current in the
coil must be clockwise, as shown in the diagram.
Motional emf
We describe in this section a particular example of electromagnetic induction in which a motional emf
is produced; this is an emf induced in a conductor that is moving through a magnetic field.
4-50
To see the origin of the emf, consider a straight conductor of length l moving with constant velocity v
through a uniform magnetic field B .
The field is into the diagram below, and the motion is perpendicular to the field.
A magnetic force FM = q v B acts on free electrons in the conductor; these move downwards and
accumulate at the lower end of the conductor. The upper end of the conductor becomes positively
charged and the lower end negatively charged.
As a result of this charge separation an electric field is set up in the conductor, directed from positive
to negative, i.e. downwards in the diagram above. Electrons in the conductor will therefore experi-
ence an electrical force FE = qE which is in the opposite direction to the magnetic force. As the
separation of charges continues the magnitude of the electrical field will increase until the electrical
and magnetic forces on an individual charge balance.
If the ends of the rod were connected to an external circuit (at least partly outside the field), a current
would flow in the external circuit from the positively charged end of the rod to the negative end.
The system therefore acts as a source of emf, and we say that a motional emf is being induced in
the moving conductor. The polarity of the induced emf is as shown on the right of the diagram.
We can derive an expression for the magnitude of the induced emf as follows. As indicated, the flow of
charge in the conductor stops when the magnetic and electric forces on a charge are equal, qE qvB ,
i.e. when the magnitude of the induced electric field in the conductor is
E vB
Because the electric field in the conductor is uniform, the magnitude of the field is related to the p.d.
across the ends of the conductor by
V El vBl
where l is the length of the conductor. But the potential difference V across the terminals of a source
when no current is drawn is just the emf of the source, so that
Bl v (motional emf)
If the conductor is not moving at right angles to B , we must take the component of v perpendicular
to B (or vice versa).
We have found the magnitude of the motional emf without using Faraday’s law. This was possible
because in the situation described here the physical cause of the induced emf is clear. Use of Faraday’s
law would lead to the same formula for the emf.
4-51
The polarity of the induced emf as deduced above is also consistent with the prediction of Lenz’s law.
The transformer
A transformer is a device for increasing or decreasing AC voltage. Transformers are found
- in TV sets to give the high voltage needed for the picture tube;
- in devices for connecting to the mains portable radios, electric razors etc as well as rechargeable
devices such as cell phones (these devices may also contain special circuitry to convert AC to
DC);
- in power stations, to convert the output of the station to very high voltages.
The transformer in its simplest form consists of two coils of wire, called the primary and the secondary,
linked by a soft-iron laminated core.
Using Faraday’s law, it can be shown that these voltages are related by
VS N S
(transformer equation)
VP N P
where NS and NP are the number of turns in the coils in the secondary and primary circuits.
For a step-up transformer, NS > NP.
For a step-down transformer, NS < NP.
However the conservation of energy requires that the power output cannot exceed the power input.
Energy losses due to resistance in the coils and losses in the iron core are usually extremely small
(typically only a few percent) and will be ignored.
From P = IV, we have VP IP = VS IS or
I S VP N P
I P VS N S
So for example, if we increase the voltage by a factor of 10, the current will decrease by at least the
same factor.
4-52
Low voltages are more practical in the home because circuits are more easily insulated against
breakdown (and low voltages are intrinsically safer). The power supplied to homes is therefore AC
at 230 V, so the voltage must be reduced using step-down transformers at local sub-stations.
As already indicated, most domestic appliances operate at even lower voltages, and therefore have
step-down transformers built into their design.
ELECTRICITY AND MAGNETISM
LECTURE EXAMPLES
University of the Witwatersrand, Johannesburg
School of Physics
PHYS1001/6 (Physics I D)
Question 1
Two charges, q1 Q and q2 4Q , are separated by a distance a. A third charge is placed on the line
joining q1 and q2.
Where must this third charge be placed so that the net force exerted on it is zero? [a/3 from q]
Question 2
A sphere of mass 103 kg which carries a charge of 5,0 nC is released from rest from a small distance
above a fixed charge of 3,0 nC. Calculate the equilibrium separation of the charges.
[3,67 mm]
Question 3
Water drops of mass 4,0 10 kg, each carrying 2000 excess electrons, are sprayed parallel to the
surface of a horizontal metal plate.
(i) What surface charge density must the plate carry if the path of the drops is to remain horizontal?
Hint: consider the forces that act on the drop; ignore the upthrust due to the air.
(ii) If the charge density on the plate were doubled, what would be the upward acceleration of the
drops?
[1,1110 C.m; 10 m.s]
Question 4
A water droplet of radius r 0,050 mm carries a charge q such that the electric field E′ at its surface is
6,0 104 V.m. If it is placed between two parallel metal plates a distance d 10 mm apart, what
potential difference V must be applied to the plates to keep the drop from falling?
Hint: solve this problem by carrying out the following steps, working with symbols rather than numbers
until the last step.
(i) Find an expression for the charge q on the drop in terms of the field E′ on its surface.
(ii) Hence determine an expression for the electric force on the charge when placed between the
plates, in terms of the V and d.
(iii) Now calculate the value of V by considering all the forces acting on the droplet.
Density of water = 1000 kg.m.
[3,14 kV]
Question 5
A sphere of mass 0,10 g carrying a charge of 20 nC is suspended from one of a pair of vertical parallel
plates 50 mm apart by an insulating thread AB which is 120 mm long.
If the potential difference between the plates is gradually increased, calculate the value when the sphere
just touches the negative plate. Hint: draw a free-body diagram for the sphere when it is just touching
the plate, and then use the fact that the sphere is then in equilibrium. [1150 V]
Question 6
A square ABCD has sides 0,20 m long, and has charges 10 nC at A and 10 nC at C. Calculate the
magnitudes of the electric field and potential at D. [3,18 kV.m; 0 V]
Question 7
The copper sphere shown above carries a charge of 5,0 C. How much work must be done to bring an
additional charge of 10,0 nC from point P to the surface of the sphere? Hint: first write down expres-
sions for the potentials at P and on the surface of the charge, and then use the fact that the work done is
the product of the charge and the potential difference. [2,55 mJ]
Question 8
Calculate the charge on the 2 C capacitor in the circuit above. Hint: solve this problem by carrying out
the following steps.
(i) Calculate the equivalent capacitance of the circuit. [1,5 F]
(ii) What is the charge on the equivalent capacitor? [18 C]
(iii) Find the p.d. across the capacitors in parallel and hence calculate the charge on the 2 C capac-
itor. [6 V; 12 C]
Question 9
A 2 F capacitor is charged by a 12 V battery. It is then disconnected from the battery and its terminals
are connected to those of a 6 F capacitor which is initially uncharged.
(i) What was the charge on the 2 F capacitor before it was disconnected from the battery?
[24 C]
(ii) Calculate the final charge on each capacitor and the potential difference across each capacitor.
[6 C; 18 C; 3 V]
(iii) Calculate the final energy stored in each capacitor. Compare the total energy stored in the capac-
itors with the energy originally stored in the 2 F capacitor and account for any difference.
[9 J; 27 J; 144 J]
Question 10
A parallel-plate capacitor is charged in air. It is then electrically isolated and lowered into a liquid
dielectric. As a result:
(a) both the capacitance and potential difference between the plates decrease;
(b) both the capacitance and potential difference between the plates increase;
(c) the capacitance increases and the potential difference between the plates decreases;
(d) both the capacitance and the charge on the plates decrease;
(e) both the capacitance and the charge on the plates increase.
Which statement is correct? [(c)]
Question 11
Two identical parallel-plate capacitors of 2 nF each with air between their plates are connected in par-
allel and charged with a 10 V battery. The battery is then disconnected and the space between the plates
of one of the capacitors is filled with a material having a dielectric constant 4. Calculate:
(i) the initial charge on each capacitor, before the battery is disconnected, [20 nC]
(ii) the new charge on the capacitor filled with the dielectric, [32 nC]
(iii) the new voltage across each capacitor, [4 V]
(iv) the energy of the system before and after the dielectric is inserted. [200 nJ; 80 nJ]
University of the Witwatersrand, Johannesburg
School of Physics
PHYS1001/6 (Physics I D)
Question 1
A wire of resistance R is drawn out so that its length is twice the original length. If the resistivity and
density of the wire do not change in the process, what is the new resistance of the wire? [4R]
Question 2
Light bulbs with the wattage ratings shown are connected as in the diagram below. The wattage rating
is the power that would be dissipated if each bulb were connected on its own across the 120-V supply.
Calculate:
(i) the current delivered to the circuit by the battery,
(ii) the power dissipated in each bulb, and
(iii) the total power delivered by the battery.
Hint: first calculate the resistance of each light bulb, and then calculate the equivalent resistance of all
the bulbs. Assume that the resistance of the bulbs is constant for any voltage.
[85,1 mA; 6,95 W; 2,61 W; 0,244 W; 0,407 W; 10,2 W]
Question 3
A direct current is passed through a human limb in such a way that layers of skin, fat and muscle form
three resistances in series. Calculate the relative rate of heat dissipation in each layer. [500: 1,5: 1]
The thickness of each layer and its resistivity is given in the table below. Assume that each layer has
the same cross sectional area.
For the circuit above calculate (i) Vab , (ii) Vbc and (iii) Vac . Hint: what is the current through the
circuit? [6 V; 6 V; 0 V]
Question 5
A number of identical lamps rated 20 W, 10 V are connected in parallel with a source of emf 10 V and
internal resistance 2 . Assume that the resistance of the lamps is independent of temperature.
How many of them can be connected without blowing a fuse rated at 3 A connected in series with the
source? Note: the fuse will blow when the current through it exceeds its rating. What does this tell you
about the equivalent resistance of the lamps connected in parallel? [3]
Question 6
Calculate:
(i) the value of R if the reading on the ammeter is 50 mA, and
(ii) the reading on the voltmeter. [400 ; 8 V]
Hint; for part (i), first calculate the equivalent resistance of the parallel arrangement that includes the
ammeter. You should then be able to use the circuit equation to find the equivalent resistance of the
voltmeter and the resistance R, which are connected in parallel.
Question 7
The resistance of a resistor is measured using a voltmeter and an ammeter. When the voltmeter is con-
nected directly across the resistor, the readings obtained are 50 V and 0,55 A. When the voltmeter is
connected across both the ammeter and the resistor, the readings are 54,3 V and 0,54 A. The resistance
of the voltmeter is 1000 .
Calculate the resistances of the resistor and ammeter. [100 ; 0,556 ]
Hint: draw the segment of the circuit containing the voltmeter, ammeter and resistor for each of the two
cases. From the first you should be able to determine the resistance of the resistor.
Question 8
When a length of nichrome wire of resistance X is placed in the bridge circuit shown, the balance point
is at P. The wire is now replaced by another piece of nichrome wire of the same length but twice the
diameter.
Question 9
In the circuit below R1 10 , R2 20 , the resistance of the voltmeter is 100 , the resistance of the
ammeter is 5 and r, the internal resistance of the source, is 2 .
(i) When the voltmeter reads 5 V the ammeter reads 0,25 A. What is the true resistance of the resistor
R? [25 ]
(ii) What is the emf of the source and what is the current drawn from it?
Hint: from your answer to the first part you should be able to determine the p.d. across R2 and
hence the current through R2. [13,0 V: 0,563 A]
University of the Witwatersrand, Johannesburg
School of Physics
PHYS1001/6 (Physics I D)
Question 1
A particle of charge q enters a magnetic field B and moves counterclockwise in a circle in the plane of
the paper.
Question 2
A particle with mass and charge numerically equal to those of the electron enters a cloud chamber
within which there is a constant magnetic field of flux density 9,0 10 T. The particle enters at right
angles to the field, and travels in a circular path of radius 0,20 m in a counter-clockwise direction
(viewed in the direction of the field).
(i) Is the particle an electron or a positron (the antiparticle of the electron, with the same mass but
positive charge)? Explain. [positron]
(ii) Calculate the speed of the particle. [3,16 107 m·s1]
(iii) Through what potential difference must the particle have been accelerated to acquire this speed?
Hint: use the conservation of energy. [2,85 kV]
Question 3
A wire ab of length 0,50 m and mass 0,010 kg is suspended by a pair of flexible leads in a magnetic
field of flux density 0,40 T as shown in the figure.
What are the magnitude and direction of the current required to remove the tension in the supporting
leads? Hint: consider the forces acting on the wire. [0,50 A from a to b]
Question 4
Two long, parallel vertical wires 0,30 m apart are placed east-west of each other. The current in the
easterly one is 30 A and that in the other is 20 A. Both currents flow upwards.
The earth’s magnetic field is horizontal, directed north, and has flux density 20 T.
Calculate the resultant force per metre on each wire.
Hint: to find the resultant force on a wire you need to find the resultant field at the position of the wire;
this is the resultant of the field due to the other wire and the earth’s field (make sure you take into
account the direction of each). [1,0 103 N; 0 N]
Question 5
The long straight wire AB carries a current I1 20 A. A current I 2 10 A flows through a rectangular
loop whose long edges are parallel to the wire AB.
Calculate the magnitude and direction of the force exerted on the loop by the magnetic field due to the
current in AB. Note: the forces on the sections of the loop perpendicular to AB cancel.
[40 N towards AB]
Question 6
Electrons are travelling in a circular path of radius 1,0 mm at the centre of a long solenoid in which a
current of 1,0 A is flowing. The solenoid is 1,0 m long and has a total of 10 turns.
(i) What is the magnitude of the magnetic field inside the solenoid? [12,6 mT]
(ii) What electric field, and in what direction relative to the magnetic field due to the solenoid, must
be applied to make the electrons travel in a straight line?
Hint: if an electron travels in a straight line, what can you deduce about the electric and mag-
netic forces acting on the electron? [27,8 kV·m perpendicular to B]
Question 7
A wire loop with radius 0,10 m and resistance 2,0 is placed inside a long solenoid with the plane of
the loop perpendicular to the axis of the solenoid. The solenoid has a radius of 0,15 m and has 2500
turns per metre.
A current is switched on in the solenoid and reaches its maximum value in 0,010 s. If a current of
4,0 mA is induced in the loop, what is the maximum value of the current in the solenoid?
Hint: the easiest way to solve this problem is to work backwards from the induced current. You should
be able to calculate the emf induced in the loop and then, using Faraday’s law, the maximum field
produced by the solenoid. [0,811 A]
Question 8
The smaller loop in the diagram below is nearer to you, and initially far from the larger loop. It moves
away from you towards the larger loop, passes through it and continues on the far side.
Which graph below best shows the variation in the current induced in the larger loop? Assume clock-
wise to be positive when viewed from your position. [c]
Question 9
A skate-boarder decides to speed himself up with an electric motor, driven by the emf induced as one
axle of the skate-board moves through the earth’s magnetic field.
The horizontal axle is 0,10 m long, and it moves at 5,0 m.s through a field of flux density 30 T
inclined at 30 to the horizontal.
What power will be produced if the total resistance of the circuit is 10 ? Do you think the plan is
feasible? Hint: first calculate the emf induced in the skate-board. [5,6 pW]
PHYS 1001/1006 TUTORIALS
YEAR 2018
3rd BLOCK
TUTORIALS TO PREPARE
A tutorial test will then be given at the end of each session (approximately
10 minutes). Tutors are expected to give feedback on the tutorial test at the
beginning of the next session.
Question 1
[ 80 nC]
Question 2
Note that for this question 3 and 4 the gravitational force on the electron is sufficiently small com-
pared with the electric force that it can be ignored
Question 3
Two parallel conducting plates are 50 mm apart. An electron is fired with an initial velocity of
103 m.s1 from the positive plate directly towards the negative plate.
What must the charge density on the plates be if the electron’s distance of closest approach to the neg-
ative plate is 5 mm?
Question 4
An electron is released from rest from the negative of two oppositely-charged parallel plates, and strikes
the positive plate, 20 mm away, after 20 ns.
Calculate:
2
(i) the velocity of the electron when it strikes the positive plate,
(ii) the electric field strength between the plates, and
(iii) the p.d. between the plates.
Question 5
The diagram represents a hollow metal sphere of radius R carrying a total positive charge q. A small
charged body with a positive charge q is taken from point X to point Y through a very small hole in
the sphere.
Derive an expression for the work that must be done against the electric field.
Question 6
A parallel-plate capacitor is charged by connecting a battery across it. The electric field between the
plates is E and the charge on the capacitor plates is q.
Determine what happens to the magnitudes of E and q as the separation between the plates is decreased
while the capacitor remains connected to the battery. Do they increase, decrease or stay the same?
Question 7
A 2 F capacitor is charged to a potential difference of 200 V and then isolated. When it is connected
in parallel with a second capacitor which is initially uncharged, the common potential difference be-
comes 40 V.
What is the capacitance of the second capacitor?
Question 8
A battery is used to supply a potential difference of 6 V to the circuit shown below. Calculate the energy
stored in the 4 F capacitor.
3
Question 9
(i) Two capacitors of capacitances 0,1 F and a 0,2 F are connected to a 12 V battery as shown in
the diagram on the left. Calculate the energy stored in the 0,2 F capacitor.
(ii) The battery is disconnected and then a dielectric of dielectric constant 4 is inserted into the
0,2 F capacitor, as shown in the diagram on the right. Calculate the energy now stored in this
capacitor.
Explain whether you would do work or have work done on you in inserting the dielectric.
[ (i) 14.4 μJ, (ii) 6.4 μJ, work done on you]
4
Question 1
The diagram below shows part of a circuit. Which pairs of resistors are in series and which pairs are in
parallel?
Question 2
A set of bulbs are each rated at 20 V, 10 W. (This means that if the p.d. across a bulb is 20 V, it con-
sumes 10 W.)
(i) If the minimum voltage at which they will glow is 10 V, what maximum number can be con-
nected in series across a source of emf 250 V and internal resistance 400 and still glow?
(ii) What maximum number can be connected in parallel across a source of emf 20 V and internal
resistance 2 without blowing a 2 A fuse in series with the source?
Assume the resistance of the bulbs does not change with temperature.
[(i) 15, (ii) 5]
Question 3
For the circuit below, calculate:
(i) the potential differences Vab, Vac and Vbc;
(ii) the power delivered by or stored in each cell, and the power dissipated in each resistor;
(iii) the total energy delivered to the circuit in 1 minute.
5
Question 4
An unknown resistance is to be measured using a cell, voltmeter and ammeter, connected as shown
below. The two meters are not ideal, since the voltmeter resistance is not large and the ammeter re-
sistance is not small.
If the voltmeter reads 10 V when the ammeter reads 0,4 A, calculate the values of the resistances R and
RV. Compare the computed value of R with the ratio of the readings on the voltmeter and ammeter.
[20
Question 5
In the circuit below XY is a slidewire of total resistance 5 . The emf 1 of the battery is greater than
2 V, and the internal resistance of the battery of emf 2 is zero. With the sliding contact Z at the centre
of the slidewire no current flows through the galvanometer.
Calculate the currents through the 1 and 3 resistances and through the slidewire under these con-
ditions.
[1 A , 1.2 A]
6
Question 6
Question 1
The diagram below shows a simple mass spectrometer, used to measure the charge to mass ratio of ions.
Ions of mass m and charge q are produced with negligible initial velocity in the source chamber.
The ions are accelerated through a potential difference V and then enter a chamber within which there
is a uniform magnetic field B directed out of the diagram.
There they move in a semicircle, striking a detector at a distance d from the entry point.
q 8V
Prove that the charge to mass ratio is given by 2 2.
m Bd
Question 2
The wire loop in the diagram carries a current of 10 A in a magnetic field B = 0,2 T.
Determine the magnitude and direction of the net force on the loop.
[0, irrelevant]
8
Question 3
A rectangular wire loop rests on a horizontal surface and has dimensions 500 mm by 9 mm. It carries a
current of 10 A. Parallel to the longer side of the loop and attached to the same horizontal surface is a
conductor carrying a current of 30 A.
Calculate the force exerted on the loop by the long conductor if the conductor is 1 mm from the loop.
Note that forces exist on the sides of the loop of length 9 mm. However the forces on the left side and
right side are equal in magnitude but have opposite directions (since the current flows in opposite di-
rections in the two sides); they do not therefore contribute to the total force.
[0.027 N towards the conductor]
Question 4
You are asked to make a solenoid that is to produce a magnetic field of 2,5 mT at the mid-point of its
axis. The solenoid is to be made by winding insulated copper wire round a cardboard cylinder with an
outer radius of 20 mm and length of 200 mm.
(i) If a current of 2,0 A is to flow, how many turns must there be on the solenoid?
(ii) What length of copper wire will you need?
(iii) You have a battery of emf 6,0 V and internal resistance 1,0 with which to drive the current
through the solenoid. What must the diameter of the wire be?
The resistivity of copper is 1,72 108 .m.
[(i) 199 (ii) 25.0 m (iii) 0.523 mm]
Question 5
A solenoid is 0,5 m long and is wound with 10 layers of wire, each containing 1000 turns. At the centre
of the solenoid is a single loop of wire of area 106 m2. The plane of the loop makes an angle of 30
with the axis of the solenoid, as shown schematically below.
(ii) Explain whether the direction of the induced current is clockwise or counterclockwise as
you look down the axis of the solenoid from the left.
[ (i) 12.6 μV (ii) counterclockwise]
Question 6
Two coils A and B are connected as shown in the figure and placed close to each other. The switch is
initially open. The switch is then closed and left closed for a while before it is opened again.
Which of the following graphs best represents the flow of the induced current through the resistor R as
a function of time? The positive direction of the induced current is indicated in the figure on the left.
[ (c)]
Question 7
step-up transformer connected to a 204 V line is to supply 18 kV for a neon sign. To reduce shock
hazard, a fuse is to be inserted in the primary circuit. The fuse is to blow when the current in the sec-
ondary circuit exceeds 10 mA. Calculate,
(i) the turns ration of the transformer
(ii) power which must be supplied to the transformer when the secondary current is 10 mA
(iii) current rating that the fuse in the primary circuit should have.
[ (i) 75:1 (ii) 180 W (iii) 0.75 A ]
5-1
5. OPTICS
Visible light forms only a very small part of the whole spectrum of electromagnetic waves, as illustrated
in the diagram below.
The divisions between regions of the spectrum are not sharp and they overlap to some extent. The name
given to the radiation sometimes depends on its origin (as, for example, with rays which emanate from
the nucleus and X rays whose origin is the atom).
Our eyes are sensitive to electromagnetic radiation with wavelengths from about 400 nm (violet) to
about 700 nm (red). Visible light may be divided into regions:
Representative wavelengths Approximate limits
Violet 410 400 - 424
Blue 470 424 - 491
Green 520 491 - 575
Yellow 580 575 - 585
Orange 600 585 - 647
Red 650 647 - 700
The eye is not equally sensitive to all wavelengths, being greatest for wavelengths around 560 nm.
5-3
In a vacuum all electromagnetic waves (including light) travel with the same velocity, which we take
as c 3,00 × 10 m.s. In a medium the velocity of light is reduced to a value cn, where
c
cn
n
The frequency of light is determined by its source and remains constant when it passes from one
medium to another.
c f
cn f n
where n n is the wavelength of the radiation in the medium.
The wavelength of light therefore decreases by a factor n when it passes from a vacuum into a me-
dium of refractive index n.
Some useful values of the refractive index are given in the table below.
Medium Refractive index
Air (at 0C, 1 atm) 1.00029
Glass (zinc crown) 1.52
Water 1.33
Diamond 2.42
The actual value of cn (and hence n) for a particular medium depends slightly on the wavelength (and
therefore also on the colour) of the radiation, as discussed later.
For air nair 1,00029, which we can approximate as nair 1.
To illustrate and explain phenomena in geometrical optics, the ray approximation is used. Here we
introduce the concepts of wavefronts and rays; we will investigate the connection between light rays
and the wave nature of light when we discuss Huygens’ principle in a later section.
A wavefront is a surface passing through the points of a wave that have the same phase and ampli-
tude.
A ray is an imaginary line drawn perpendicular to the wavefront; it indicates the direction of travel
of the wave.
5-4
It is important to realise that a source does not emit rays – it emits waves in the form of wave
packets (or photons), as discussed later.
Rays are construction lines drawn to show the direction in which light travels.
IO incident ray
NOM normal to the interface
OS reflected ray
OR refracted ray
i angle of incidence
i angle of reflection
r angle of refraction
The reflected and refracted rays at a smooth interface obey the following relationships:
(i) The incident ray, the reflected ray, the refracted ray and the normal to the interface at the point
of incidence all lie in the same plane.
(ii) The angle of incidence i is equal to the angle of reflection i. This is the law of reflection.
(iii) The angle of refraction and the angle of incidence are related by:
The law of reflection follows from the fact that the incident and reflected waves are travelling in the
same medium and therefore travel at the same speed; consequently they travel equal distances in any
time interval.
In the diagram below, A1D1 represents the wave front at some instant and A2D2 represents the same
wave front at some later instant.
5-6
D1D2 A1A2 Two rays in the same medium travel the same distance in the same
time interval.
Angle A2 D1 90 The rays are perpendicular to wavefronts.
The triangles A1D2D1 and A1D2A2 are congruent, since the hypotenuse
A1D2 is common to the two triangles and A1A2 D1D2.
i Angle NA1D1 i = 90 (IA1 is perpendicular to A1D1) and
angle NA1D1 = 90 (NA1 is perpendicular to A1D2).
i Angle A2A1D2 i = 90 (NA1 is perpendicular to A1D2) and
angle A2A1D2 = 90 (in triangle A1D1D2).
Therefore, since i and i are each equal to angles which are equal to each other, we obtain i i.
Consider a parallel beam of light incident on a plane interface separating media of refractive indices n1
and n2. In the diagram below, all angles marked i are equal (as shown above) and similarly all angles
marked r are equal.
At some instant the wavefront is at AA; after time t it has reached BB. In this time one ray travels from
A to B at speed c2 whilst the other ray travels from A to B at speed c1. Therefore:
AB c1t and AB c2t
leading to
AB c1 n2
AB c2 n1
5-7
where the last equality follows from the definition of the refractive index: c1 c n1 and c2 c n2 .
In the diagram below, light is travelling from medium 1 to medium 3 via medium 2; the light is refracted
at both interfaces, which are parallel to each other.
Therefore, if the intervening layer is parallel-sided, the angle i3 at which the light emerges depends only
on the refractive indices of the first and last media and on the angle of incidence i1. The intervening
layer has no effect on the direction of the emerging ray.
Note that if n1 n3, then if follows from the equation n1 sin i1 n3 sin i3 that sin i1 sin i3 ; therefore
i1 i3.
An object within a medium, when viewed by an observer outside the medium, appears to be either
further away or closer than it really is; this is a result of refraction of the emerging light rays.
If the observer is in the optically less dense medium (i.e. smaller refractive index) the object seems
closer, as in the diagram below. This would be appropriate for an object at the bottom of a swimming
pool being viewed from above the surface of the water.
An object is situated at point R. To the eye the rays appear to come from point A, nearer to the interface
than R, because they are refracted away from the normal on going from medium 2 to medium 1 which
has a smaller refractive index. This follows from Snell’s law:
sin r nin
sin i nout
where nin is the refractive index of the medium in which the object is situated and nout is the refractive
index of the outside medium, in which the observer is situated.
We now assume that the object in the diagram is viewed from a point nearly vertically above it, so that
the rays are almost normal to the interface; then i and r are small and
sin i tan i and sin r tan r .
Snell’s law now becomes
nin tan r ON OR OR
,
nout tan i OA ON OA
or
This provides a method for measuring the refractive index of a transparent material.
5-9
It follows from Snell’s law that when light travels from an optically dense to a less dense medium (i.e.
smaller refractive index) it is refracted away from the normal and that the angle of refraction increases
as the angle of incidence is increased.
At some angle of incidence ic the angle of refraction becomes 90. If i is increased beyond ic the beam
is totally reflected and no light is transmitted into the optically less dense medium. The reflected ray
still obeys the normal laws of reflection.
At the critical angle, n1 sin ic n2 sin 90 n2 . Hence the critical angle is given by
n2
sin ic for n1 n2 (total internal reflection)
n1
For a glass/air interface with nglass 1,52 the critical angle is 41,0. Therefore, when i > 41,0 the light
will be totally reflected.
Note that total internal reflection can only occur when light goes from a medium of higher refractive
index to one of lower refractive index.
Total internal reflection is employed in the construction of the periscope used in submarines.
Fibre optics is based on total internal reflection. A fibre consists of a thin flexible transparent core of
glass or transparent plastic, surrounded by cladding, which is a material of lower refractive index than
the core.
If n2 n1 and the angle of incidence is large enough, light will be totally reflected each time it strikes
the interface, and will emerge from the end of the fibre with little loss. This will happen even if the fibre
is bent. In practice, a large number of fine fibres is used (for flexibility and good resolution), forming a
fibre bundle.
Optical fibres of this kind are used:
- in medicine, to examine the internal organs. Two fibre-optic cables are used, one to transmit
light into the body and illuminate the organ, the other to transmit the images back.
- in the communications industry, to carry high-speed Internet traffic, radio and television sig-
nals, and telephone calls.
5-10
The angle depends on A, n (the refractive index of the prism material) and i (the angle of incidence).
For a given prism, A and n are fixed and plotting versus i produces a curve similar to that in the next
diagram.
There is a particular value of the angle of incidence, imin, for which the deviation has its minimum value,
Dmin. This is called the angle of minimum deviation, where the angles imin and Dmin depend on A and
n.
At minimum deviation the ray travels symmetrically through the prism, i.e. it makes equal angles with
the two refracting faces, which we prove as follows.
5-11
The prism in the diagram above is set at the position of minimum deviation.
Imagine that the direction of the ray through the prism is reversed so that light is incident from the
right. From the diagram, the angle of deviation of the reversed ray is also Dmin (vertically opposite an-
gles are equal).
Since only one angle of incidence can produce this angle of deviation (see the previous graph), the angle
of incidence for the reversed ray must also be imin.
Thus this ray must travel symmetrically through the prism, as shown in the diagram.
Measurement of the angle Dmin provides a very accurate method for determining the refractive index of
a prism.
The method is accurate because the curve of versus i has a flat minimum and so even if the angle
of incidence is not exactly imin the deviation will be very close to Dmin.
The prism in the diagram below is set at the position of minimum deviation.
In the quadrilateral ABCD there are right angles at B and D so that within the quadrilateral
A + C 180°. But the angle within the quadrilateral at C and the angle exterior to the quadrilateral at C
must also sum to 180. Therefore the exterior angle at C must equal the refracting angle A, as shown.
For any triangle, the exterior angle equals the sum of the interior opposite angles. Hence
A D A Dmin
rmin and imin rmin min .
2 2 2
From Snell’s law at B:
nair sin imin nprism sin rmin
giving
A Dmin
sin
nprism 2
A
sin
2
5-12
Chromatic dispersion
Careful measurement shows that the refractive index (and therefore the speed of light) in anything but
a vacuum depends somewhat on the wavelength of the light, i.e. its colour. This property is called
dispersion and the medium is said to be dispersive.
As an example, the refractive index of light travelling in flint glass decreases smoothly from about 1,66
for a wavelength of 400 nm to about 1,62 for light of wavelength 700 nm.
It follows that a prism will deviate different colours through different angles, forming a spectrum.
If the light incident on the prism contains all visible wavelengths (white light), a continuous spectrum
is produced.
The angle between the emerging red and the violet rays is a measure of the dispersive power of the
prism – the larger the dispersive power the greater the separation. Diamond owes its brilliance partly
to its large dispersive power.
If the incident light contains only certain wavelengths (such as light from a mercury vapour lamp)
then the spectrum will contain a bright coloured line for each of the wavelengths that are present.
As we shall discuss later, the wavelengths emitted by a particular element are characteristic of the
element and can be used to uniquely identify it.
This is the basis of the prism spectrometer, which is commonly used to study the wavelengths
emitted by a light source. It is employed in biology and chemistry to identify molecules (using infra-
red light) and in astronomy to identify elements in distant stars.
Dispersion accounts for the colours of the rainbow, where water droplets are the dispersive medium.
The primary bow is formed by light which has been reflected once in the drops and the secondary by
light which has been reflected twice. For a more detailed explanation, see the prescribed textbook.
2. The new wavefront at any instant is the envelope (tangent) to the secondary wavelets at that
instant.
The principle is illustrated for two common situations in the following diagram (a plane wave on the
left and a spherical wave on the right). In each diagram, AB represents the wavefront at some instant
and CD represents the wavefront a time t later.
As can be seen, while the wave is travelling in a medium of uniform refractive index (so that its speed
does not change), the wavefronts remain plane or spherical, respectively.
Huygens’ Principle can also be used to determine what happens when a wave meets the interface with
another medium, with a different refractive index.
We first use Huygens’ Principle to illustrate the reflection of plane waves from the boundary between
two media.
The wavefront ABC is incident on the interface. After time t the wavefront is A1B1C1, and after a
further time t it is A2B2C2.
5-14
Refraction at an interface
The principle can also be applied to examine the change in direction of a plane wave as it travels from
one medium into another medium.
By using Huygens’ Principle to identify the position of the wavefronts in situations such as those illus-
trated here, and drawing rays perpendicular to the wavefronts, we can construct “ray diagrams” which
indicate only the direction of propagation of the wave.
In geometrical optics, where we are concerned only with the directional properties of light propagation,
we need draw only the ray diagram. This was the approach employed in the earlier discussion of reflec-
tion and refraction – it can be seen that this approach is underpinned by Huygens’ Principle.
5-15
5.2.1. Lenses
Any system in which curved interfaces separate transparent media of different refractive indices is a
lens. Examples are the eye and the glass lens in air. Lenses may also be made from some transparent
plastics.
When the lens is surrounded by air, light refracts from the air into the lens, crosses through the lens,
and then refracts back into the air. Each refraction can change the direction of the light, according to
Snell’s law.
Types of lenses
We deal only with thin lenses (the lens must be thin compared with the object and image distances and
the focal length, all defined below). These are usually circular and the two faces are portions of a sphere.
Each face can be concave, convex or plane. For example:
We confine our attention to rays that strike the lens close to its axis and that make small angles with the
axis; these are called paraxial rays.
Lenses can be converging (which are thicker at the centre than at the edges) or diverging (which are
thicker at the edges).
Rays parallel to the axis that pass through a converging lens all pass through a point on the axis on the
other side of the lens – the focal point.
Rays parallel to the axis appear to diverge from the focal point after passing through a diverging lens.
P is the point where the axis meets the lens, i.e. the pole of the lens, and F is the focal point.
5-16
There is a focal point on each side of the lens and these are always equidistant from the pole if there is
the same medium on each side, irrespective of whether the two faces of the lens have the same curvature.
The distance from the pole to the focal point is the focal length of the lens.
An optical system makes all rays passing through some point on or near the axis pass through some
other point on or near the axis, either directly or in projection. These are called conjugate points.
The point which is the source of the light is the object O and the point conjugate with it is the
image I.
The object is the point associated with incoming light (i.e. rays coming in to the lens).
The image is the point associated with outgoing light (i.e. rays going out from the lens).
An extended object can be thought of as being made up of many point objects. An image is formed of
each, collectively producing an extended image.
5-17
Objects and images can be real or virtual, as defined in the following table.
Real Virtual
Object Rays are diverging before Rays are converging be-
striking the lens fore striking the lens
Image Rays converge when Rays diverge when leav-
leaving the lens ing the lens
Real and virtual objects and images are illustrated in the following diagrams.
Real images are formed at the point at which rays of light actually intersect (as in the left-hand dia-
gram).
Virtual images are formed at the point from which the rays appear to originate (as in the right-hand
diagram).
A real image can be cast onto a screen, whereas a virtual image cannot. A virtual image can however
be seen when the eye focuses the diverging rays onto the retina.
Virtual objects are only of relevance when discussing combinations of lenses (see below).
Ray Tracing
To find the position of an image we must find the point where two or more rays from the object meet
again after passing through the lens. Any two of the following can be used:
(i) A ray parallel to the axis will converge to (converging lens) or diverge from (diverging lens) the
appropriate focal point.
(ii) A ray passing through the other focal point (either directly or in projection) will emerge from the
lens parallel to the axis.
(iii) A ray through the pole will continue undeviated.
Some examples are shown in the following diagrams.
5-18
The diagram below shows rays from a real, upright object incident on a converging lens. A real, inverted
image is formed. From the sign convention, u, v and f are all positive.
AO AO BI BI
tan and tan
OP u IP v
or
BI v
AO u
Similarly, from triangles QPF and BIF we have
QP AO BI BI
tan = and tan
PF f IF v f
or, combining equations
BI v v f
AO u f
Rearranging this equation, we obtain:
1 1 1
(lens equation)
f u v
Although derived for a specific situation, this equation holds for all types of lens, object and image
provided the sign convention introduced earlier is followed.
Magnification
height of image
m
height of object
By convention:
The magnification is positive when the object and image have the same orientation.
If the image is inverted relative to the object, the magnification is negative.
5-20
height of image BI v
.
height of object AO u
For the case considered here the image is inverted relative to the object, so that the sign convention
requires that the magnification is negative. Therefore, since u and v are positive:
v
m (linear magnification defined)
u
This equation holds for any kind of object, image and lens, provided the sign convention introduced for
u, v and f is followed.
Power of a lens
n
P (power of lens defined)
f
If the focal length f of the lens is in metres, its power is in dioptres.
The shorter the focal length the more powerful the lens.
P has the same sign as f.
For a lens in air, P 1 f
Combinations of lenses
The equations derived above allow us to find the position and magnification of a single lens. In practice,
most optical instruments consist of a system of two or more lenses.
If several lenses are used in combination, the image produced by one acts as an object for the next. The
following diagrams give several examples.
In each case the solid lines show the path of rays through the combination of lenses.
Rays do not follow the path indicated by dashed lines, which are included to aid location of virtual
objects and images.
The overall magnification produced by the combination of lenses is the product of the magnifications
of the individual lenses, m m1 m2 m3
In each case the magnitude of the object distance for a lens is the distance from the pole of the lens to
the position of the object, whether or not the rays actually pass through the object position, and similarly
for the image distance.
For example:
In case (ii), the image distance for lens 1 is the distance from the pole of lens 1 to the point I1 (and is
positive).
The object distance for lens 2 in case (iii) is the distance from the pole of the lens to point I1 (and is
positive).
5-22
The object distance for lens 2 in case (iv) is the distance from the pole of the lens to point I1 (and is
negative).
Spherical aberration
Rays far from the axis of a lens with spherical interfaces are not brought to a focus on the focal plane
but at points in front of the focal plane.
This causes a variation in image position with distance of incident rays from the axis, so that the image
is not sharp.
The effect can be reduced by:
employing apertures (referred to as stops) to cut off the rays far off-axis (as is done in a camera),
careful choice of curvatures of the two surfaces of the lens,
use of combinations of lenses.
Spherical aberration is small in the eye since:
the iris cuts out rays far from the axis,
the refracting surfaces are slightly hyperbolic,
the refractive index of the eye lens decreases with distance from centre (the edge is weaker than the
centre).
5-23
Chromatic aberration
Because of the slight variation of the refractive index with wavelength, when light of more than one
colour passes through a lens, different colours will be focussed at different points. Thus the lens will
form a spread-out coloured image of a white object.
For a converging lens the focal length is longer for red light than for blue light, as shown in the diagram
on the left (which, for clarity, exaggerates the difference).
The chromatic aberration of a diverging lens is opposite to that of the converging lens.
The effect can be reduced by combining two (or more) lenses made of different materials with different
refractive indices to form an achromatic doublet. One lens must be converging and the other diverging;
otherwise the dispersion will be increased. The effect cannot be completely eliminated simultaneously
for all colours but can be greatly reduced.
This is not a serious defect in the eye which focuses near the middle of the visible spectrum (about 556
nm).
Light enters the eye through the cornea (a tough transparent skin) where it is refracted. It then goes
through a flexible crystalline converging lens where it is refracted again. Most of the refraction actually
takes place at the cornea, not the lens. This refraction causes an image to be formed on the sensitive
retina, which transmits impulses to the brain via the optic nerve.
5-24
The iris, or coloured portion of the eye, is a muscular diaphragm that automatically adjusts the size of
the pupil, or circular opening in its centre, according to the intensity of the light falling on it.
Focussing of the image on the retina is effected by an alteration in the focal length of the eye lens. This
is called accommodation and is brought about by the ciliary muscles, which vary the thickness, and
hence the focal length, of the lens.
When the eye focuses on a distant object, the ciliary muscles are relaxed. This causes the eye lens to
become flattened, thereby increasing its focal length. For an object at infinity the focal length of the eye
is equal to the distance between the lens and retina, which is about 1,7 cm.
The normal eye can accommodate for clear vision of objects from the far point (infinity) down to the
near point, which increases with age but is on average about 25 cm from eye. Typically the distance to
the near point is about 18 cm at age 10, increasing to 500 cm or more at age 60.
A long-sighted (hyperopic) person can see distant objects clearly but the near point is farther than 25
cm from the eye. This is caused by a mismatch between the focusing power of the lens-cornea system
and the length of the eye.
Correction is effected by a converging spectacle lens which forms a virtual image of a close object at
the eye’s own near point. The eye then focuses on this image.
Presbyopia (literally “old-age vision”) is somewhat similar to long sightedness. As the eye ages, it
becomes less able to accommodate and the near point moves out. This is due to a weakening of the
ciliary muscles and a hardening of the lens material. The symptoms and correction are the same as for
long sightedness.
A short-sighted (myopic) person can focus on nearby objects but not on distant objects.
5-25
Correction is effected by a diverging lens which forms a virtual image at the eye’s own far point of an
object at infinity.
The angular magnification of an optical instrument is defined as the ratio of the angular size of an
object viewed with the instrument to the maximum angular size max that can be achieved without it:
m (angular magnification defined)
max
The simple magnifying lens
This is a single converging lens whose function is to increase the apparent size of a small object whose
distance from the eye can be varied.
The normal human eye can focus a sharp image of an object onto the retina if the object is no closer to
the eye than the eye’s near point, labelled P in the following diagrams.
h
tan
N
5-26
where we have used the small angle approximation (for a small object the angle must be small).
By moving the object closer to the eye, you can increase the angular size and hence the possibility of
distinguishing details of the object. However, because the object is then closer than the near point, it is
no longer in focus; the image on the retina is no longer clear.
Therefore the maximum angular size that can be achieved with the unaided eye is
h
max .
N
If the object is moved closer to the eye than the near point, the retinal image can be brought into focus
again by looking at the object through a converging lens (the magnifying lens), placed so that the object
is inside the focal point of the lens.
The lens then produces a virtual, upright and enlarged image I of the object, which is further from the
eye than the image. The focal length of the lens must be chosen so that the image I is beyond the eye’s
near point P; the eye is then able to focus on this image.
The angular size of the virtual image when the magnifying lens is placed directly in front of the eye is
h
tan ,
f
N
m (magnification of simple magnifier)
f
This formula is valid if the object is placed at the focal point of the lens so that the image produced by
the lens is at infinity. The magnification can be increased somewhat if the final image is formed closer
to the near point (but still beyond it); it becomes N f 1 if the final image is actually at the eye’s
near point.
In practice, the maximum magnification that can be achieved is about 4, because of aberrations of the
lens. This can be increased to about 20 by using two or more lenses to reduce aberrations.
In its simplest form, the microscope comprises two converging lenses of short focal length, the objective
and the eyepiece (with focal lengths less than about 1 cm and a few cm, respectively). The distance L
between the two lenses is much greater than either focal length.
5-27
The microscope is used to view small objects that are placed close to the objective. The object must be
placed just outside the focal point of the objective, which then produces a much enlarged, real and
inverted image I1 of the object. The distance L between the two lenses is adjusted so that this image is
formed just inside the focal point of the eyepiece.
The image I1 produced by the objective acts as an object for the eyepiece which produces the virtual
image I2 which is even further enlarged. This image I2 is the image seen by the eye.
The magnification M of the microscope is the product of the magnifications of the two lenses. The linear
magnification produced by the objective is
vo L fe
mo
uo fo
Since the image produced by the objective is just inside the focal point of the eyepiece, the latter acts
like a simple magnifying lens. The angular magnification of the eyepiece, for an image produced far
from the eye, is therefore
N
me
fe
where N is the distance to the eye’s near point. The overall magnification M mome of the microscope
is therefore
L fe N
M
fo fe
(ii) In practice, the magnification is limited by diffraction of light as it passes through the lenses, as
will be discussed later.
We discuss here the refracting type of astronomical telescope, which consists of two converging lenses,
the objective and the eyepiece, located at opposite ends of a long tube. This differs from the reflecting
telescope, which uses a lens and a curved mirror to form an image.
Rays from a very distant object are essentially parallel on reaching the telescope. A real, inverted image
I1 is therefore formed in the focal plane of the objective. This image acts as an object for the eyepiece,
which produces a final virtual, greatly magnified image I2. It is this image that the eye focuses on.
In normal relaxed viewing the distance between the two lenses is adjusted so that the focal planes of
objective and eyepiece coincide and the final image is at infinity (for clarity, the focal points are shown
slightly separated in the diagram). The two lenses are then separated by the distance fo fe which is the
length of the tube of the telescope.
The function of the telescope is to increase the angle which a distant object appears to subtend at the
eye, and therefore produce the same effect as if the object were either larger or else closer to the eye.
The telescope’s total angular magnification is given by M . From the diagram the angle subtended
at the eye by the final image is:
h
fe
where h is the size of the intermediate image I1, whereas the angle that would be subtended by the object
is:
5-29
h
fo
f
M o
fe
So for high magnification the objective should have a long focal length and the eyepiece a short one.
Note that the final image is inverted – this is not significant for an astronomical telescope. For a terres-
trial telescope, where it is desirable to have an upright image, an additional lens is used to invert the
image.
5-30
5.3.1. Interference
Two waves of the same wavelength travelling in the same direction can interfere constructively (if they
are in phase) or destructively (if they are exactly out of phase).
If two light waves interfere constructively at some points in space and destructively at others, bright
and dark regions can be observed; these are called interference fringes.
Certain conditions must be satisfied if visible interference effects are to occur.
(1) The two light waves must originate from a single source of light.
(2) The path difference between the two interfering waves must not be too great.
(3) The two waves must have identical wavelengths, and also equal amplitudes (otherwise com-
plete destructive interference cannot occur).
The first condition results from the fact that light radiation is not emitted from a source as a continuous
wave, but as a series of wave packets, each of approximate length 10 s. These wave packets all have
the same wavelength and frequency, but differ randomly in phase.
To produce a stable interference pattern, the two travelling waves that interfere must maintain a phase
difference between them that is constant in time. Such waves are said to be coherent.
If two different light sources are used to produce the two travelling waves, the light waves from one
source are emitted independently of the waves from the other source; the phase difference between them
cannot therefore be constant. Consequently the states of constructive or destructive interference will
have durations of the order of 10 s; the eye cannot follow such rapid changes and no interference
effects can be observed. Ordinary light sources are said to be incoherent.
One mechanism for producing two coherent light sources is shown in the diagram below (an alternative
is to use a laser – a source that automatically produces coherent light).
5-31
After passing through the slits A and B the two waves will produce an interference pattern that can be
observed on a viewing screen placed behind the slits. The following diagram shows the apparatus
viewed from above.
This means that the path difference must not be too large.
5-32
Double slit interference was investigated by Thomas Young (1773–1829). His experiments in 1801
proved that light is a wave; all types of wave, including sound waves and water waves can undergo
interference.
A parallel monochromatic beam of light falls normally on two parallel, narrow slits A and B a small
distance d apart. The two beams emerging from the slits are in phase.
The transmitted light is focused on a screen a distance D behind the slits (with D d) by means of a
converging lens. Note that the diagram below is not drawn to scale.
Consider two rays passing through slits A and B which subsequently reach point P on the screen. The
path difference p.d. for the two rays is, from the diagram,
BG d sin .
If the position of P is such that the path difference is an integral number of wavelengths (i.e.
0, λ, 2λ, ), waves from A and B arrive at P in phase and will interfere constructively producing a
bright fringe. For bright fringes, therefore, p.d. = m, m = 1, 2, 3, or
Therefore
D
xm m , m 0, 1, 2
d
The separation of two adjacent bright fringes is
m D m 1 D
xm xm1
d d
leading to
D
Fringe separation
d
For small the fringes are therefore of equal width and equally spaced.
Note that:
No energy has been lost in the dark fringes; it has been re-distributed to the bright fringes. The total
intensity falling on the screen is the same as if there had been no interference.
For the interference pattern to be visible, the fringe separation must not be too small. This requires
that the inter-slit spacing d is not too large compared with the wavelength of the light. In practice, d
will be a fraction of a millimetre, whereas the wavelength is of the order 500 nm.
Diffraction grating
The diffraction grating is based on an extension of two-slit interference. It consists of a very large num-
ber of slits instead of only two. Gratings are often made by ruling very fine lines, called rulings, on a
glass sheet (typically thousands of lines per cm); the untouched spaces between the lines act like slits.
The equations derived above for the double slit also apply in the case of the diffraction grating, with the
slit spacing d being interpreted as the distance between adjacent slits. For the diffraction grating, the
grating spacing is therefore
1
d
number of lines per unit length
The fringes produced by the grating are much sharper than those from two slits only and, because of
the redistribution of energy, the intensity at the peaks is consequently much larger. The peaks are effec-
tively extremely bright, well-separated lines.
5-34
This is illustrated in the diagram, which shows that intensity pattern for two slits, six slits, and a very
large number of slits. Note that since d is much smaller for a grating, the angular separation of the
intensity maxima is in practice much larger (and the peaks are much more intense).
Diffraction gratings are widely used to determine the wavelengths of light emitted by sources of light
ranging from lamps to stars: the wavelength can be deduced from the fringe separation using the formula
derived above.
If white light is used instead of monochromatic light, a different value of satisfies the bright-fringe
equation d sin m for each different wavelength (i.e. colour). Each bright fringe therefore becomes
a spectrum.
However, at the centre of the pattern (where m = 0) the path difference is zero for all wavelengths, so
all colours interfere constructively giving rise to a central white fringe.
The resultant pattern will be a central white fringe bordered symmetrically on either side by differently
spaced coloured fringes, i.e. a white-light spectrum is formed.
In a given order (i.e. m fixed) m is larger for larger (from d sin m m ). Therefore, red light is
deviated more than blue (this is opposite to the spectrum produced by a prism). In fact the red end of
one order can, under certain circumstances, overlap the blue end of the next (higher) order.
5-35
Thin-film interference
An interference pattern can also be produced if light is reflected from the two surfaces of a thin film of
transparent material surrounded by a medium of different refractive index; the thickness of the film
must be comparable with the wavelength of the light. Examples are soap bubbles and thin layers of oil
or petrol on water.
The path difference may be correct for the destructive interference of, say, red light and then the com-
plementary colour, blue-green, will be seen. (Two colours are said to be complementary if when added
together they produce white light.)
For other thicknesses of oil film, destructive interference for other colours will occur.
There are two factors (in addition to the path difference) that determine whether constructive or destruc-
tive interference will occur in thin-film interference.
(i) An electromagnetic wave travelling from one medium towards another medium of higher refrac-
tive index undergoes a phase change of 180 on reflection (this is equivalent to the wave being
inverted, or adding a half wave length to the path length). No phase change occurs if a wave is
reflected at the interface with a medium of lower refractive index.
The transmitted light does not experience any phase change.
(ii) The wave length of light in a medium of refractive index n is reduced compared to the wave
length in air by a factor n.
n
n
To illustrate this we consider the simple case of a film of transparent material of refractive index n
surrounded by air. Monochromatic light is assumed.
The equations that will be derived assume that light is incident normally at interfaces; in the diagrams
near-vertical rays are drawn, but they are shown separated for clarity.
When reflected from surface A, ray 1 undergoes a phase change of radians (or 180) with respect to
the incident ray, because the reflecting medium has a larger refractive index. Ray 2, which is reflected
from surface B, undergoes no phase change since the reflecting medium has a smaller refractive index.
Therefore ray 2 will be 180 out of phase compared with ray 1; this is equivalent to adding n 2 to the
path length. The effective path difference between the two rays is therefore 2t n 2 .
For destructive interference, i.e. dark fringes, the path difference must be an odd number of half wave-
lengths (i.e. n 2, 3n 2, 5n 2, ).
Therefore we require:
2t n 2 (m 12 )n , m 0,1,2,
and it follows from n n that the condition on the film thickness t for dark fringes is:
In summary:
If there is a phase change at either interface (but not both), the conditions for dark and bright fringes
are as derived above.
If there is a phase change at neither interface or at both interfaces, the conditions must be inter-
changed.
The best approach to solving any problem involving thin-film interference is to derive the neces-
sary condition, as is done above for the case of a thin film in air.
Other examples of thin-film interference are:
5-37
The colours in a soap bubble; gravity causes the film thickness to vary, so that an interference pattern
is produced.
The colours in the feathers of some birds (especially peacocks) and in butterfly wings.
Note that there are also other mechanisms producing colours in birds, e.g. pigments and light scat-
tering.
Coated lenses. A thin layer of a material of the correct thickness and refractive index is coated onto the
surface of a lens. Rays reflected from the air/coating and coating/lens interfaces interfere destruc-
tively, thereby greatly reducing the amount of light reflected and increasing the amount transmitted
by the lens. This gives a brighter and sharper image.
5.3.2. Diffraction
An interference pattern can also be produced when light passes through a single slit or aperture (or when
light passes around an obstacle) provided the size of the slit is not too large compared with the wave-
length of the light.
A diffraction pattern is created by interference between wavelets from different points along the slit.
Single-slit diffraction
Consider parallel monochromatic light falling normally on a narrow parallel-sided slit of width w. The
diffracted light is focused by a lens on a screen a distance L behind the slit, with L w.
5-38
AG
sin .
AC w
Then PB PC 2 , and so wavelets from B and C interfere destructively at P, giving zero intensity
at P.
The two secondary wavelets from just above B and C will also interfere destructively at P. The process
is repeated for successive pairs of secondary wavelets, until A and B are reached; each pair will interfere
destructively, and so the total intensity at P will be zero.
Hence, there is a dark fringe for sin . If is made smaller or bigger by a small amount then
w
wavelets cannot be paired off and the resulting intensity will no longer be zero.
2
We now consider a point P such that PA PC 2, equivalent to sin . Rays leaving the slit are
w
shown enlarged in the diagram below.
m
sin m , m 1, 2, 3... (not m 0) (single-slit diffraction, dark fringes)
w
The equation for dark fringes in the single-slit case is very like the one for bright fringes in the double-
slit case, and so care must be taken not to confuse them.
In between the dark fringes are bright fringes, with maxima approximately midway between the dark
fringes.
5-39
Note that the central maximum is twice as wide as the others and considerably brighter. These features
are shown in the diagram above.
If m is small then
xm
sin m tan m
L
and the distance to the m’th dark fringe is given by
L
xm m , m 1, 2, 3....
w
Note the following:
The diffraction caused by an obstacle in the path of light is the same as produced by an aperture of
the same shape and dimensions.
A circular aperture or obstacle (such as a lens in an optical instrument) gives a diffraction pattern of
concentric circular fringes, where the angular deviation of the first-order dark fringe is given by
sin 1, 22
w
Whenever a double slit is used to produce an interference pattern, then the single-slit diffraction
pattern (produced by either slit) will be superimposed on the double-slit interference pattern dis-
cussed earlier.
The single-slit pattern will be wider than the double-slit one (since we must have w < d) and may
cause missing orders in the double-slit pattern if a maximum of one pattern coincides with a minimum
of the other.
When light passes through a small aperture, it is diffracted and spread out. The smaller the aperture the
greater the spread, since sin w . A measure of the angular width of the image is 2 w , the width
of the central maximum of the interference pattern (remember that most of the light intensity is in the
central maximum – see the previous diagram).
5-40
An important property of any optical instrument, including the eye, is its ability to separate two images
of this kind. This is the resolving power of the instrument or eye.
If the two sources are too close to each other, their images will overlap to such an extent that they cannot
be resolved. It is resolution that sets an upper limit on the magnifying power of microscopes and other
optical instruments.
Rayleigh’s Criterion provides a way of estimating the resolving power of an optical instrument. Two
objects are said to be just resolved if the central maximum of the diffraction pattern due to one coincides
with the first minimum of the diffraction pattern due to the other (and vice-versa).
This is illustrated in the following diagrams. Two point sources of light at O1 and O2 each produce a
diffraction pattern on a screen behind a single slit.
The angle in this situation is such that sin for a slit of width w or sin 1,22 for a circular
w w
aperture of diameter w.
Plotting the total intensity for the two sources as a function of position on the screen (or the retina of
the eye) gives the following curves.
The conditions indicated refer to a rectangular slit; for a circular aperture a factor of 1,22 must be in-
cluded.
5.3.3. Polarization
The electric and a magnetic vectors associated with an electromagnetic wave, including light, are at
right angles to each other and also to the direction of wave propagation.
The polarization of light, which is a phenomenon that can be explained only if light is a transverse wave,
is associated with the electric vector (i.e. the magnetic vector is irrelevant as far as polarization is con-
cerned).
A wave is said to be plane polarized (or linearly polarized, or simply polarized) if the electric field
vibrates in the same direction at all times at a particular point in space.
In the diagram above, with the electric field pointing in the y direction and the velocity vector in the x
direction, the wave is said to be linearly polarized in the y direction. The xy plane, which is formed by
the electric field vector and the direction of propagation of the wave, is called the plane of polarization.
Light from a normal source is emitted in wave packets; the plane of vibration of the electric vector in
the waves varies randomly from one wave packet to the next, so normally light is unpolarized.
In the diagram below a light beam is viewed along the direction of propagation, perpendicular to the
page.
Polarized light can be produced in several ways, which include by absorption, by reflection and by
scattering, each of which we consider below.
Polarization by absorption
If a sufficiently narrow slit is placed in the path of the wave, the wave will pass through it if its direction
of vibration is parallel to the slit, or it will be stopped completely if its direction of vibration is at right
angles to the slit.
Note that the vibrations of a longitudinal wave are along the direction of propagation and so no orien-
tation of the slit would affect them.
Certain types of material behave in a similar way as far as the electric vector of a light wave is con-
cerned. Examples of such materials are the naturally occurring crystal, tourmaline, and the artificially
prepared Polaroid sheet (invented by E H Land in 1932).
The discovery that light can be polarized in this way provided the evidence for light being a transverse
wave.
The material from which Polaroid sheets are made is fabricated in thin sheets of long-chain hydrocar-
bons; these are stretched during the manufacturing process so that the molecules are aligned.
5-43
The electric vector of light passing through a sheet can be resolved into components parallel and per-
pendicular to the aligned molecules. The parallel component causes free electrons in a molecule to
oscillate along its length. The molecules therefore readily absorb from the light those components par-
allel to their length and transmit components perpendicular to their length. For this reason, the direction
perpendicular to the molecular chains is referred to as the transmission axis.
The transmitted intensity is less than the incident intensity since some light has been absorbed, but now
the electric vectors in all the wave packets emerging from the sheet vibrate in one plane only; the light
is plane-polarized.
A sheet of Polaroid can be used as a polarizer (to produce plane polarized light) or as an analyzer (to
determine the plane of polarization of polarized light).
If the two sheets of Polaroid in the diagram above have their transmission axes parallel, the light
transmitted by the polarizer will also be transmitted by the analyzer.
If however the two sheets of Polaroid have their axes at right angles (called crossed Polaroids), as
shown in the diagram, then no light is transmitted.
Some materials, for example sugars, cause the plane of polarisation to be rotated when light passes
through them; such materials are called optically active.
The angle through which the plane is rotated can be determined by placing a sample of material
between the polarizer and analyzer and measuring the angle through which the polarizer must be
rotated to again cut out all transmitted light.
Polarization by reflection
Light incident on a transparent medium is partially reflected and partially transmitted. If the medium is
a dielectric (including glass and water), the reflected and refracted rays are partially plane polarized,
the degree of polarization depending on the angle of incidence.
Light is said to be partially plane polarized when the electric fields oscillating along one direction
have greater amplitudes than those oscillating along other directions.
For the special case when the reflected and refracted rays are at 90 to each other, the reflected ray is
completely polarized. This is illustrated in the following diagram, where the dots and lines attached to
the rays indicate the direction of vibration of the electric vector.
5-44
The reflected light is completely polarized, with the electric field vector parallel to the surface.
The transmitted light is partially polarized, with the electric field having greater amplitude in the
plane of the diagram.
We determine the condition for this to happen as follows. As shown in the diagram above, unpolarized
light is incident on the material at angle of incidence i. Since the reflected and refracted rays are per-
pendicular
i r 90 r 90 i .
sin i sin i
n tan i
sin 90 i cos i
Complete polarization of the reflected ray therefore occurs when
Polarization by reflection occurs frequently in nature when, for example, sunlight is reflected from a
horizontal surface covered by water or snow. The reflected electric vector has a large horizontal com-
ponent. Sunglasses made from polarizing material can be used to reduce the glare from the reflected
light; the transmission axis of the lenses must be vertical to absorb the horizontal component of the
reflected light.
5-45
Polarization by scattering
Light scattered by particles small compared with the wavelength of the light (e.g. sunlight scattered by
air molecules and dust particles in the atmosphere) is partially plane polarized.
Bees and homing pigeons are believed to detect this polarized light and use it for navigation.
The scattered intensity increases with increasing frequency of light, I f 4, and so more blue light than
red light is scattered (this is Rayleigh scattering). This accounts for the blue of the sky and the red of
sunsets.
For information about the scattering process and how it polarizes light, see the textbook.
OPTICS
LECTURE EXAMPLES
University of the Witwatersrand, Johannesburg
School of Physics
Physics I D (PHYS1001/6)
1. A pole 3,0 m long stands vertically on the horizontal floor of a pool containing water to a depth
of 1,5 m. Rays from the sun make an angle of 40 with the pole.
Calculate the length of the shadow of the pole on the floor of the pool. [2,09 m]
Hint: consider the path of the ray from the sun that just skims the top of the pole, before passing
through the air/water interface to reach the floor of the pool.
2. A submarine is below the surface in clear water and a helicopter is vertically above it. Which
one of the following statements is true?
(a) As the helicopter descends the submarine seems to come closer to the surface.
(b) To an observer on the submarine the helicopter seems to be closer than it really is.
(c) If the helicopter flies horizontally total internal reflection will never prevent it from los-
ing sight of the submarine.
(d) When the submarine dives the critical angle for total internal reflection decreases and it
will therefore be less easy to spot.
(e) If the submarine moves horizontally, because of total internal reflection it will eventually
lose sight of the helicopter.
[c]
3. A hollow prism, containing air and made from parallel-sided sheets of glass, is immersed in
water. Light incident at an angle of 30 emerges at an angle of 10 as shown. The dashed lines
are perpendicular to the surfaces.
Calculate the refracting angle A of the prism if the refractive index of water is 1,33. [28,3]
Hint: first use Snell’s law to determine the angle of refraction at the first interface and then the
angle of incidence at the second interface. Then relate these angles geometrically to the refract-
ing angle A.
4. A ray strikes one face of an equilateral prism in air at an angle of incidence of 40,0. The re-
fractive index of the prism material is 1,80. Determine the subsequent path of the ray until it
emerges from the prism, showing all relevant angles on your sketch.
[Emerges at 40 to normal at lower interface]
5. When a beam of light is incident on X from the left, some light emerges as shown at 90 to the
incident beam.
Which one of the following could X represent (x indicates angles that are equal within a given
triangle)? [a]
6. A beam of light strikes a plane block of glass at an angle of 50 to the normal. The beam con-
tains two wavelengths of 500 nm and 700 nm. The wavelengths in the glass are found to be
338 nm and 476 nm respectively.
What is the angle between the refracted rays? [0,19]
Hint: first use the wavelengths given to determine the refractive index of glass for each of the
two beams into which the light beam splits.
University of the Witwatersrand, Johannesburg
School of Physics
Physics I D (PHYS1001/6)
1. In the diagram below which ray(s) is/are not correctly drawn, or are they all correctly drawn?
[4]
2. In a drive-in cinema the screen is situated 100 m from the film projector. The image on the
screen measures 6,0 m 4,0 m.
Calculate the focal length of the projection lens required if the frames on the film measure
72 mm 48 mm. [1,17 m]
Hint: what magnification must be produced by the lens?
4. A converging lens of focal length 250 mm is placed 300 mm in front of a diverging lens of fo-
cal length 250 mm. An object of height 5 mm stands 500 mm in front of the converging lens,
as shown below.
6. The diagram shows two lenses mounted next to a metre stick. The numbers give the positions
of the indicated points in mm. Subscripts 1 and 2 refer to objects and images for lenses 1 and 2
respectively.
(a) f1 0 f 2 0 (b) f1 0 f 2 0
(c) f1 0 f 2 0 (d) f1 0 f 2 0
(e) It is impossible to determine the type of lens from the data given.
7. A near-sighted man cannot clearly see objects more than 2,0 m away.
(i) What power spectacles (assumed to be in contact with his eyes) does he need to see dis-
tant objects? [0,50 dioptre]
(ii) If his near point without spectacles is 100 mm, what is it with them? [105 mm]
Hint: the spectacle lens must take a real object placed at his new near point and produce
an image at the eye’s own (unaided) near point, which the eye then focuses on.
University of the Witwatersrand, Johannesburg
School of Physics
Physics I D (PHYS1001/6)
1. A double-slit interferometer has a screen 1,0 m from the slits, and is illuminated by a parallel
beam of normally-incident light with a wavelength range 450 mm 600 mm.
(i) Is the red end of the spectrum of a given order further from the zero-order line than the
blue end? [Yes]
(ii) The interference pattern on the screen is such that 3,0 mm from the zero-order line the
red end of one order is found to be coincident with the blue end of the next order.
What is the separation of the slits? [0,60 mm]
2. A pair of narrow slits is illuminated normally with light of wavelength 640 nm. When a piece
of transparent material 12 m thick is placed opposite one slit the third-order bright image is
found where the ninth order used to be.
Calculate the refractive index of the material. [1,32]
Hint: first write down the expression for the angular position of the 9th order image. Then de-
rive the condition for the angular position of the 3rd order image when the slab of transparent
material has been inserted (remember that the slab changes the optical path difference).
3. A pair of narrow slits is illuminated normally with monochromatic light and its interference
fringes observed on a screen (with air in the gap between the screen and slits).
Water (of refractive index 4/3) now fills the whole space between slits and screen. What order
bright image will now fall where the ninth order used to be? [12th]
Note: the answer does not depend on the slit separation or wavelength of the incident light; use
symbols for these quantities.
4. A diffraction grating is ruled with 400 lines per mm, and is illuminated with light from atomic
hydrogen at normal incidence. The and lines of hydrogen have wavelengths of 656 nm and
410 nm respectively.
(i) Calculate the angular separation, in the second-order spectrum, between the and
lines. [12,5°]
(ii) What is the highest order that is possible for each of these lines? [: 3, : 6]
Hint: sin cannot exceed 1.
5. In order to reduce reflection from an optical surface, such as the surface of a lens, the surface is
often coated with a thin film of MgF2 (of refractive index 1,38).
(i) Determine the minimum thickness of a coating that will minimize reflection for normal
incidence of light near the middle of the optical spectrum, with 552 nm. [100 nm]
Hint: you must first decide what the condition for destructive interference is for the com-
bination of refractive indices shown in the diagram above.
(ii) What wavelength of light nearest 552 nm will undergo a maximum reflection for this
thickness of coating? [276 nm]
6. Light of wavelength 560 nm is incident normally on a single slit, producing a diffraction pattern
on a screen 1,00 m behind the slit.
When the separation of the slit and screen is increased to 1,25 m and the fringe pattern is refo-
cused, it is found that the third-order minimum has moved 5,0 mm from its previous position on
the screen.
Calculate the width of the slit. [84 m]
7. The rails on railways in South Africa are about 1,1 m apart. How high can a plane be above the
railway before the pilot can no longer resolve the rails?
Assume that the pupil of the eye has a diameter of 3,0 mm and that the wavelength of the light
is 550 nm. [4,9 km]
PHYS 1001/1006 TUTORIALS
YEAR 2018
4th BLOCK
TUTORIALS TO PREPARE
A tutorial test will then be given at the end of each session (approximately
10 minutes). Tutors are expected to give feedback on the tutorial test at the
beginning of the next session.
Question 1
The velocity of light in a vacuum can be determined by measuring the change in wavelength, from 1
to 2, when light passes from a medium of refractive index n1 to one of refractive index n2.
Derive an equation for the velocity of light in a vacuum, c, in terms of these quantities and the frequency
f of the light.
Question 2
A man whose eyes are 1,8 m above the ground stands 2,4 m from the edge of a brim-full swimming
bath 2,0 m deep.
How far from the wall nearest him is a point on the bottom of the swimming bath that he can just see?
Refractive index of water = 1,33.
[1.51 m]
Question 3
A small object is placed at the bottom of a tank of water which is 200 mm deep.
(i) Calculate its apparent depth below the water surface.
(ii) A block of flint glass 50 mm thick is placed over the object in the water so that the surface of
the water is 150 mm above the top of the block. Calculate the apparent depth of the object below
the water surface.
Refractive index of flint glass = 1,67. Refractive index of water = 1,33.
[ (i) 150 mm (ii) 143 mm]
Question 4
(i) Because of total internal reflection the frog cannot see the fly until the fly gets to a certain
minimum distance above the water surface.
(ii) To the frog the fly appears to be higher above the water surface than it really is.
(iii) To the fly the frog appears to be further below the water surface than it really is.
(iv) Because of total internal reflection the fly cannot see the frog if the fly is less than a certain
distance above the water surface.
(v) The path of the rays from the frog to the fly depends on the colour of the frog.
[ (ii), (v)]
Question 5
(i) Determine by calculation whether the ray will be totally-internally reflected at face 2.
(ii) Calculate what angle the ray will make with the normal when it emerges from face 3.
[ (i) It will be (ii) 30o ]
Question 6
Which one of the diagrams below is correct?
[e]
3
Question 7
Light is incident on one face of a glass prism in air, as shown in the diagram. The refractive index of
the prism is 1,60. The top face of the prism is covered with a parallel-sided layer of oil.
(i) Calculate the refracting angle A of the prism if the light is just totally reflected at the oil/air
interface when the angle of incidence i is 70°.
(ii) Calculate the refractive index of the oil if the light is just totally reflected at the glass/oil inter-
face when i is 13°.
Question 8
(i) For a prism with a refracting angle of 60° the angle of minimum deviation is 37,18°. Calculate
the refractive index of the prism material.
(ii) On either side of the position for minimum deviation there are two values of the angle of inci-
dence for which the angles of deviation are equal. For this prism, calculate the deviation for an
angle of incidence of 63,46°, and determine the other value of the angle of incidence which
gives the same deviation.
[ (i) 1,50 (ii) 40.00o , 36.54o]
4
Question 1
A transparency measuring 24 mm 36 mm is to be projected so that the image is 1,2 m 1,8 m. The
projector lens has a focal length of 50 mm.
(i) How far from the lens must the slide be placed?
(ii) How far from the lens must the screen be placed?
[ (i) 51 mm (ii) 2.55 m ]
Question 2
The diagram shows rays passing through two lenses 1 and 2.
Question 3
A telephoto lens for a camera consists of a converging lens of focal length 50 mm and a diverging lens
of focal length 50 mm placed 20 mm apart, with the diverging lens nearer the film.
(i) Calculate the distance from the diverging lens to the film if a bird 2 m from the converging lens
is in sharp focus on the film.
(ii) Calculate the height of the image if the bird is 300 mm high.
Question 4
A camera has a lens of focal length f1 = +50 mm.
(i) How large is the image formed by this lens of an object 10 mm high situated 250 mm in front of
the lens?
(ii) A diverging lens, with a focal length f2 = 50 mm, is now placed 25 mm in front of the converging
lens. How large is the new final image if the object is 250 mm in front of the diverging lens?
[ (i) -2.5 mm (inverted) (ii) -5 mm (inverted)]
Question 5
A near-sighted man cannot focus on objects further than 0.5 m from his eyes, and acquires spectacles
which are 20 mm from his eyes.
(i) Calculate the power of the spectacle lenses that he would need to see distant objects.
(ii) If his near point is 30 mm from his eyes without spectacle, what will it be with his spectacles.
[ (i) -2.08 dioptre (ii) 30.2 mm]
Question 6
A telescope consists of two converging lenses. The objective has focal length 1 m and the eyepiece
focal length 40 mm. The image of a star is formed at the minimum distance of distinct vision (250 mm).
(i) What is the separation of the lenses?
(ii) Explain whether the image is inverted or upright.
[(i) 1.03 m, (ii) Inverted]
6
Question 1
Light of wavelength 560 nm strikes a double slit AB with slit separation 1,4 m at an angle
= 30 as shown. An observer looks at the interference pattern formed on a screen placed on the oppo-
site side of the double slit.
At what angles to the normal AN, within the angle NAB, does he see interference maxima?
[5.7o, 30o, 64.2o]
Question 2
The diagram represents a double-slit interferometer set up in air and illuminated by parallel, normally-
incident light. The slits are 0,6 mm apart, and an interference pattern is formed on a screen 1 m from
the slits.
(i) When monochromatic light is used the fringe separation on the screen is 0,9 mm. What is
the wavelength of the light?
(ii) A thin parallel-sided sheet of mica, of thickness 13,6 m and refractive index 1,60, is placed
over the lower slit. What is the order of the bright fringe now at O?
(iii) The illumination is now changed to white light. Where is the white fringe formed on the
screen? Ignore dispersive effects in the mica.
[ (i) 540 nm, (ii) 15 (iii) 13.5 mm below O]
7
Question 3
White light, with wavelengths ranging from 400 nm to 700 nm, shines normally onto a grating ruled
with 500 lines per mm.
(i) What is the angular width of the first-order spectrum?
(ii) What is the order of the first spectrum whose red edge overlaps the violet end of the next spec-
trum?
[ (i) 90o, (ii) 2nd ]
Question 4
A thin wedge of air is trapped between two glass plates. It is illuminated from vertically above with
light consisting of two wavelengths, 400 nm and 600 nm.
For what thickness of the wedge in the range 100 nm to 700 nm will there be a dark fringe in the
reflected light? Assume that there are no interference effects within the glass itself.
[600 nm]
Question 5
Light of wavelength 633 nm shines from a laser on to a thin smear of blood on a glass slide. The first-
order dark ring formed on a screen 1 m from the slide has a diameter of 0,2 m.
Calculate the diameter of the diffracting blood corpuscles.
[7.7 μm]
Question 6
The Viking Mars Lander had to choose a suitable landing place on mars. In order to do this it had to be
able to resolve objects 500 mm apart.
From what distance from the surface of Mars did it have to send back pictures from its TV camera,
which had a 100 mm diameter aperture?
Assume that red light of wavelength 630 nm was used.
[65.1 km]
6-1
6. MODERN PHYSICS
The intensity of the radiation at a particular temperature is the area under the curve for that temper-
ature. According to Stefan’s law it depends on the fourth power of the temperature (in kelvin).
The peak in the curves shifts with temperature. As the temperature increases the peak moves to-
wards lower wavelengths (i.e. higher frequencies).
The wavelength max for maximum intensity at any temperature T (in kelvin) is given by a formula first
discovered empirically by Wien:
max T 2.90 103 m.K (Wien’s displacement law)
For a body with a surface at room temperature (about 27C or 300 K), max 10 m, which is in
the far infrared part of the spectrum, and the intensity of the radiation is very low.
The peak for an object at 1000 K is still in the infrared, with max 3 m 3000 nm, and we can
feel the radiation as heat. The intensity in the visible region is sufficient that the object appears to
glow red.
6-3
The surface temperature of the sun is about 5800 K, for which max 500 nm. The peak is therefore
well within the visible region of the electromagnetic spectrum (see the curve for 6000 K).
Classical physics was able to explain the existence of thermal radiation:
Electromagnetic theory predicts that an oscillating electric charge emits electromagnetic radiation,
so that thermal radiation was interpreted as being due to oscillations of charges in the molecules of
the hot body.
The thermally agitated charges can have a distribution of frequencies, so that the spectrum of radi-
ation emitted will be continuous in frequency and hence wavelength.
As the body becomes hotter, the frequency of the oscillations increases and thus the wavelength of
the radiation decreases.
Theoretical attempts to explain the shape of the spectrum were however unsuccessful.
In 1900, Max Planck (1858–1947) proposed an empirical formula that fitted the data. This formula
contained an adjustable parameter h, now called Planck’s constant, whose value Planck found by fitting
the formula to the experimental curve; the modern value is h J.s.
In order to explain his formula, Planck made the radical proposal that the energy of any molecular
vibration with frequency f is given by
E nhf , n 1,2,3, (quantum hypothesis)
i.e. the molecular energy of vibration is quantised.
This is Planck’s quantum hypothesis, which was the beginning of the development of quantum
theory.
Planck’s explanation was not generally accepted by scientists (including Planck himself) until Einstein
extended the quantum hypothesis a few years later to explain the photoelectric effect.
6-4
Provided light of a sufficiently high frequency is used, the ammeter indicates that a current is flowing.
Thus, electrons must be moving across the photocell from the metal plate P to the collector C.
The following effects are observed as the frequency and intensity of the incident light are varied.
If the frequency f of the incident light is below a certain minimum value f0, called the threshold
frequency, no electrons are emitted, even for a very intense beam of light.
For frequencies above f0, electrons are emitted almost instantaneously (less than s after the
surface is illuminated).
The number of electrons emitted per second (the current) is proportional to the intensity of the light,
as shown in the diagram below.
If the polarity of the supply is reversed and its voltage increased slowly, it is found that the current
drops. The electrons released from the plate P are being repelled by the collector C, which is now at a
negative potential relative to the plate.
At reverse voltage V, only those electrons whose kinetic energy on leaving the plate P exceeds eV can
reach the collector; those with smaller energies will be turned back.
6-5
The maximum kinetic energy of the electrons emitted from the plate, KEmax, is related to the stopping
voltage through the relationship
KEmax eVstop
We now discuss the photoelectric effect in terms of the wave theory of light, the standard theory at the
beginning of the 20th century, and particle theory of light introduced by Einstein in 1905. We assume
that the incident light is monochromatic.
Photoelectric effect and wave theory of light
Classically, the incident radiation contains an oscillating electric field. A negatively-charged electron
in the metal plate should oscillate in response to the field, and thereby absorb energy from the field. If
the amplitude of the electron’s oscillations is large enough, it should break free from the surface of the
target; i.e. it should be ejected from the target to form the observed current.
Wave theory predicts:
(a) The photoelectric effect should occur at any frequency of the incident light, provided the light
intensity is sufficiently high; i.e. there should be no cut-off frequency.
(b) The photoelectrons should require a significant amount of time to absorb from the incident radia-
tion enough kinetic energy to escape from the metal, particularly at low intensities.
(c) If the intensity of the incident light is increased, more energy is carried into the metal per unit
time; hence electrons of higher kinetic energy should be ejected, i.e. KEmax should increase with
intensity.
(d) The frequency of the incident light should not affect the kinetic energy of the ejected electrons –
KEmax should depend only on the intensity of the light.
6-6
Thus the wave theory of light can explain the formation of the photoelectrons but is unable to explain
many observed features of the photoelectric effect.
Photoelectric effect and particle theory of light
Einstein reasoned that if the energy of the molecular oscillations that produce light is quantised, as
suggested by Planck, then the energy of the resulting radiation should also be quantised.
If an oscillator initially in a state of energy nhf emits energy, it must make a transition to a state of
energy n 1 hf ; the conservation of energy requires that radiation of energy hf will be emitted.
Light should therefore travel through space in localised packets or quanta (now called photons)
each with an energy E hf, where f is the frequency of the light, which is equal to the frequency of
the oscillator.
In a monochromatic beam, all photons have the same energy hf. In the photocell, an electron is ejected
from the metal by a single collision with a single incident photon; the electron absorbs all the photon’s
energy hf and the photon disappears.
Some minimum energy W0 (called the work function of the metal, typically a few eV) is required to get
an electron just out of the surface of the metal. If hf < W0 an electron cannot be ejected.
If hf W0 then, from the conservation of energy, an electron will be emitted with kinetic energy given
by
hf KEmax W0 (photoelectric equation)
For electrons more tightly bound in the metal, the electron has a correspondingly smaller kinetic energy
when it emerges from the metal; thus the kinetic energy predicted by the photoelectric equation is the
maximum value possible.
The photon theory predicts:
(a) An increase in intensity of the light beam means that more photons are incident, so more electrons
will be ejected. But since the energy of each photon is not changed, the maximum kinetic energy
of the electrons is not changed; i.e. KEmax is independent of the intensity of the light.
(b) If the frequency of the light is increased, the maximum kinetic energy of the electrons increases
linearly according to KEmax = hf – W0.
(c) It follows from the equation KEmax = hf – W0 that no electrons can be emitted unless hf W0.
Therefore there is a cut-off frequency f0, where
W0
f0 (cut-off frequency)
h
(d) Since an electron is ejected as the result of a single collision with an incident photon, it does not
have to wait to absorb sufficient energy to escape; it receives the required amount of energy all at
once (provided hf W0).
These predictions explained the facts as known in 1905. Further experiments carried out by R.A. Mil-
likan in 1913–1914 had results fully in agreement with Einstein’s photon theory; in particular, Milli-
kan’s experiments were in agreement with the Einstein’s photoelectric equation.
Planck’s constant can be determined from the slope of a plot of KEmax against frequency; the value
obtained agrees with that deduced from the spectrum of black-body radiation.
6-7
It should be noted that the Bohr theory is very successful for hydrogen and ions with a single electron,
but is unable to explain the properties of atoms with two or more electrons with any accuracy.
Since energy must be conserved in this process, the energy of the quantum emitted is
Ephoton E Einitial Efinal
where Einitial is the energy of the upper electron state and Efinal that of the lower electron state. The energy
of the emitted photon is related to its frequency and wavelength by
hc
Ephoton hf (photon wavelength)
where h is Planck’s constant and c is the speed of light.
Since E for any pair of energy levels varies from one element to another, the frequencies emitted
by an element are characteristic of it and can be used to identify it.
The energy levels involved in optical transitions are generally those far out from the nucleus where
the energy-level differences are comparatively small (as opposed to X-ray line spectra which will
be discussed later).
Consider the example of hydrogen, referring to the energy-level diagram reproduced on the left.
The K shell would then be unpopulated, and the atom would after a very short time interval return
to its original state by the emission of electromagnetic radiation in the form of a photon of energy
10,2 eV (and wavelength 122 nm, from Ephoton hc ).
To be excited from the ground state to the n = 3 state, the atom would have to absorb
13,6 1,5 12,1 eV of energy.
The electron could subsequently move directly back to the ground state, or it could first move to
the n 2 state by emitting a photon of energy 3,4 1,5 1,9 eV and wavelength 656 nm.
Experiments carried out towards the end of the 19th century showed that the spectrum emitted by hy-
drogen contained radiation of just a few wavelengths, including those calculated above. In particular,
the radiation of 656 nm produces a bright red line when the spectrum is analysed.
The spectra described above are referred to as emission spectra; radiation is emitted during atomic
transitions. An absorption spectrum is produced when cooler vapour absorbs light of those frequen-
cies which the vapour itself would emit at higher temperatures. The absorption spectrum consists of
dark lines on a continuous coloured background, the dark lines resulting from photons absorbed from
the incident light.
For example:
6-11
If white light passes through sodium vapour, the sodium atoms will absorb from the incident light
photons of the right frequency to cause transitions in the sodium atoms.
The transmitted light will then consist of the continuous incident white light minus the frequencies
which have been absorbed.
Each dark line in the absorption spectrum of a particular element coincides exactly with the bright
line seen in the emission spectrum of the same element.
6.2.3. X-Rays
In 1895 Wilhelm Röntgen (1845–1923) discovered that when a beam of fast-moving electrons struck
the end of the tube in which they had been produced, highly penetrating radiation was emitted. He called
this radiation X-rays. In 1912 Max von Laue (1879–1960) established the wave nature of the radiation
by diffracting X-rays with a crystal (diffraction is a wave phenomenon).
Production of X-rays
X-rays are electromagnetic waves of very short wavelength (less than about m), i.e. very high
frequency and energy. They are usually produced by allowing high-energy electrons to strike a metal
target. A typical X-ray setup is shown below.
A current in the filament causes electrons to be emitted, and these are accelerated towards the target
which is held at a much higher potential than the filament. Some of the electrons incident on the target
lose kinetic energy in collisions with atoms of the target; this energy is emitted as X-rays, through
processes described below.
The accelerating voltage V that must be used in an X-ray tube is determined by how the X-rays will be
employed. For example, when used for medical diagnosis, voltages up to about 150 kV are appropriate,
whereas for medical therapeutic purposes voltages are in the approximate range 250 kV to 4 MV.
The target used in the X-ray tube must have particular properties:
The target must have a high melting point, since most of the kinetic energy of the electrons (about
99%) is dissipated inside the target as heat. The target is water- or oil-cooled because of the large
heat dissipation.
The target material must have a high atomic number, since the energy-level differences in light
atoms are too small to generate X-rays. Tungsten and molybdenum are often used in targets.
6-12
The target and filament are encased in an evacuated tube to prevent collisions between electrons
and air molecules.
There are usually two components to the spectrum produced by an X-ray tube: a continuous spectrum
and a line spectrum (first observed in 1908), which is superimposed on the continuous spectrum and is
not always present.
The continuous spectrum depends on the voltage applied to the tube.
The line spectrum is characteristic of the target material.
kinetic energy before the collision is eV. Therefore, from the conservation of energy in the collision,
the maximum photon energy is given by
hc
eV Emax
min
where min is the photon wavelength corresponding to the maximum photon energy:
hc
min
eV
It follows that the continuous spectrum should have a short wavelength cut-off, min, which is the same
for all target elements at a given accelerating voltage; this feature is illustrated in the previous diagram
for W and Mo targets.
Electrons that lose smaller amounts of energy in each of several collisions emit lower energy X-rays,
and therefore larger wavelengths than the minimum.
This transition is accompanied by the emission of a photon whose energy equals the energy differ-
ence between the two atomic shells. This is the same process that was described earlier for hydro-
gen, except that the energy differences are much larger (usually greater than 1 keV) so that wave-
lengths are much smaller (0,01 – 1,0 nm); X-rays are produced rather than radiation in the visible
range.
This first transition leaves a further vacancy in the higher-energy shell; this is filled by an electron
dropping from a yet-higher shell, and a second photon is emitted, and so on.
Many of the features described above can be illustrated by consideration of the spectrum for molyb-
denum at an accelerating voltage of 35 keV, shown in the next diagram.
The equation min hc eV predicts that for V 35 kV the continuous spectrum should have a
minimum at a wavelength of 35,5 pm, as is observed.
Sharp lines are observed with wavelengths of about 63 pm and 71 pm. From E hc , these cor-
respond to the emission of photons of energy 19,6 keV and 17,4 keV, respectively.
Using these and other measured transition energies, an energy level diagram can be constructed for
molybdenum. The lowest few levels are shown schematically in the diagram below.
The lines are labelled according to the energy levels involved in the transition. The K lines are formed
when an electron drops to the K shell, and so on. The lines are further labelled etc. to indicate
that the electron falls from the next highest level, the one above that, and so on.
6-15
Note that the process of stimulated emission produces two identical photons, the incident photon and
the emitted photon, and these are exactly in phase. These photons can stimulate or induce other atoms
to emit identical photons in a chain of similar processes. The many photons produced in this way are
the source of the intense, coherent light produced by a laser.
Several conditions are necessary for the operation of a laser:
As indicated, normally most atoms in a sample will be in the ground state; when light is incident
on a gas of these atoms the net result is the absorption of energy. For the laser to work we require
more atoms to be in an excited state rather than the ground state; this is called population inversion.
How this is achieved depends on the type of laser.
A further condition is the existence of a metastable state in the atom, an excited energy level whose
lifetime may be s or more instead of the usual s. This is necessary in order that stimulated
emission takes place before the excited state decays by spontaneous emission.
In addition, the emitted photons must be confined within the laser long enough to allow them to
induce further emissions from other atoms in metastable states. This is achieved by placing reflect-
ing mirrors at the two ends of the laser tube; one mirror is totally reflecting and the other is slightly
transparent to allow the laser beam to leave the laser.
A common type of laser is the helium-neon laser, invented in 1961, which contains a mixture of helium
and neon gases (in the ratio 20:80). The figure shows simplified energy-level diagrams for the two
atoms.
A high voltage applied to the laser tube causes electrons to sweep through the tube, colliding with
atoms of the gas and raising them to excited states, including the metastable state at energy
E3 = 20,61 eV in helium.
Neon contains a state at energy E2 = 20,66 eV, which is very close to the energy of the metastable
state in helium. Therefore, when a metastable helium atom collides with a neon atom in its ground
state, the excitation energy of the helium atom is often transferred to the neon atom which is left in
the state of energy E2.
6-17
The metastability of the helium level E3 ensures a ready supply of neon atoms in level E2. In this
way the state of neon with energy E2 becomes more heavily populated than the state at lower energy
E1, i.e. there is a population inversion.
The coherent beam of wavelength 632.8 nm that the laser emits results from transitions from the
state of neon with energy E2 to the state at energy E1 (which then decays rapidly to the ground state
via intermediate levels not shown in the diagram).
6-18
Some properties of atomic particles are compared in the following table. Note that the masses of the
neutron and proton are very similar, each having a mass about 1840 times that of the electron.
The atomic mass of an atom is its mass expressed in (unified) atomic mass units; this is defined in such
a way that the mass of the neutral 12C atom is exactly 12 u. In terms of the standard SI unit of mass:
1 u 1,661 × 10 kg
1 kg 6,022 × u
The mass of a nucleus is smaller than that of the corresponding atom by an amount that is approximately
(but not exactly) equal to the mass of the electrons in the atom.
A nucleus is described by three numbers:
(i) Atomic number Z: This is the number of protons in the nucleus, and equals the number of orbital
electrons in the neutral atom. Every element has a different Z, and this gives the position of the
element in the Periodic Table, and determines the chemical properties of the element.
(ii) Neutron number N: This is the number of neutrons in the nucleus.
(iii) Mass number A: This is the total number of nucleons (neutrons plus protons) in the nucleus:
A = Z + N. The mass number is usually the integer nearest to the atomic mass in u.
A A
The symbol for a nucleus is written Z X or Z X N , where X represents the chemical symbol for the
1 4 238
element; e.g. H , He ,
1 2 92 U.
Isotopes are forms of the same element which differ in the number of neutrons in the nucleus. They
have the same atomic number (which characterises the element) but different mass number.
For example, there are three isotopes of hydrogen: 11 H (ordinary hydrogen), 12 H (deuterium) and 13 H
(tritium).
(i) Hydrogen is the simplest and lightest atom; a single electron is in orbit around a nucleus consisting
of a single proton.
6-19
(ii) The next heaviest atom is deuterium; its nucleus (the deuteron) consists on a neutron and a proton,
and one electron is in orbit around the nucleus.
(iii) Tritium has an unstable nucleus (the triton) containing one proton and two neutrons, and a single
electron is in orbit about the nucleus.
Note that in each case the atom contains one proton and one electron; they are all forms of hydrogen
and have similar chemical properties; the nuclear properties are however quite different because of the
different numbers of neutrons.
Binding energy
If we add up the mass of the constituent nucleons of any nucleus, the total is greater than the mass of
the nucleus by an amount called the mass deficit; if we wish to separate the nucleus into its constituent
nucleons we would have to create this extra mass.
Consider the mass of 12C as an example: if the masses of 6 hydrogen atoms and 6 neutrons are added
together the total is 12,0989 u and not 12,0000 u which, by definition, is the mass of an atom of 12C.
(Note that we use the mass of the hydrogen atom here, rather than the mass of the proton, to allow for
the fact that the neutral atom contains 6 electrons in addition to the protons and neutrons).
Mass of 6 hydrogen atoms 6 × 1,007825 u = 6,04695 u
Mass of 6 neutrons 6 × 1,008665 u = 6,05199 u
Total mass of nucleons 12,09894 u
Mass of carbon-12 atom 12,00000 u
Thus in order to take the atom apart, we must create extra mass 0,09894 u.
Einstein had already in 1905 proposed the mass-energy equivalence: he showed that matter of mass m
can, under suitable conditions, be converted into energy E (and vice versa) given by
E mc 2
Using this equivalence, it is easily shown that
1 u = 931,5 MeV (mass-energy equivalence)
The energy equivalent of the mass deficit of a nucleus is called its binding energy B. It is equal to the
amount of work required to split the nucleus into its component protons and neutrons.
In the case of carbon-12, the energy equivalent of the mass difference 0,09894 u is the binding energy
of carbon-12 which, using the mass-energy conversion factor above, is 92,2 MeV – the energy required
to split the nucleus 12C into its component six protons and six neutrons.
Note:
The size of the binding energy is a measure of the strength of the force that holds the nucleons
together in the nucleus – the strong nuclear interaction.
If this force were weaker, nuclei would fly apart because of the repulsive Coulomb force be-
tween the protons.
We sometimes use the binding energy per nucleon, B/A, rather than the binding energy itself. For ex-
ample for carbon-12, B/A = 7,68 MeV.
Plotting B/A versus A for the elements gives the following curve:
6-20
For most nuclei (but not the very lightest), B/A 8 MeV. The curve peaks at about 8.7 MeV near
A = 60 and then decreases slowly as mass number increases.
This curve can be used to explain how energy is obtained from fission and fusion reactions (see later).
In these diagrams the direction of the magnetic field is into the page, as
indicated by the symbol ×.
rays emerging from a radioactive source are not deviated by the mag-
netic field, indicating that they are uncharged.
6-21
rays are deviated to the left, indicating that they carry a positive charge.
There is very little dispersion of the beam of particles, indicating that
there is little variation in their speeds.
The particles are deviated to the right, showing that they are negatively
charged, and are denoted .
There is a second type of radiation, less common, that would be devi-
ated to the left, since they carry positive charge. They are denoted .
The considerable dispersion in the paths shows that the particles have
widely varying speeds.
particles
particles are helium nuclei, i.e. helium atoms stripped of their orbital electrons. Thus an particle
consists of two protons and two neutrons bound together. Some properties are shown in the following
table.
Charge: Equal to two proton charges (+2e).
Mass: About 4 times that of the hydrogen atom.
Ionization: Intensely ionizing, i.e. they produce a large number of ion pairs per unit
distance as they travel through matter.
Range: Because of above strong interaction, their range in matter is small, e.g.
about 10–100 mm in air and about 1/100 mm in solids.
Velocity: Up to 1/20 that of light.
A general decay is written
A
Z P ZA42 D 42 He
where P denotes the parent nucleus and D the daughter nucleus. For example:
238
92 U 23490Th 24 He (1/ 2 4,5 109 yr) .
Note that total Z and total A remain unchanged during the reaction; i.e. charge is conserved and the
number of nucleons is conserved.
These two important conservation laws hold for all nuclear transformations, as do the conservation
of momentum and mass-energy.
Measurements show that during the decay the total mass does not remain constant – in fact it decreases
by an amount
M M P M D M He
where M represents an atomic mass. The lost mass is actually converted into energy, as determined by
E mc 2 , so the amount of energy released in the decay of the nucleus P is
M c2 M P M D M He c 2
6-22
No. of
alpha
particles
Kinetic Energy
The spectral line of highest energy corresponds to the daughter nucleus being left in the ground
state. If we ignore the recoil energy of the daughter nucleus, the kinetic energy of the particle
is M P M D M He c 2 .
The existence of the other lines indicates that the daughter nucleus has been left in an excited
state. The excitation energies of the daughter nucleus can be deduced from the spacing of the
lines.
particles
There are actually two types of decay; by far the most common is decay in which an electron is
emitted. Less common is decay, in which the particle emitted is the antiparticle of the electron, the
positron e; these are effectively positively-charged electrons.
Some properties of particles are listed in the following table.
Charge: Can be either negative ( e) or positive ( e)
Mass: Very light particles.
6-23
The spectrum of particles is quite unlike that of particles; it is a continuous spectrum rather than
a line spectrum. This is because the particle is emitted together with another particle, the neutrino ,
which carries off some of the energy released in the reaction. The particles are therefore emitted with
a large range in energies, from zero up to a maximum which is characteristic of the emitter, with the
neutrinos taking the rest of the energy.
The existence of the neutrino was predicted by Wolfgang Pauli in 1930, based on the apparent violation
of the conservation of energy and momentum in decay. It is uncharged, has zero rest mass (or a very
small mass) and interacts extremely little with matter; neutrinos were first detected experimentally in
1956.
There are two kinds of neutrino associated with decay, the neutrino itself, and the antineutrino . In
decay a neutron is turned into a proton:
n p e A
Z P Z 1A D e
decay leads to an increase of atomic number Z by one, with the mass number A unchanged. In
decay, Z decreases by one. Overall, the charge and the number of nucleons are conserved.
Examples of decay:
234
90 Th 234
91 Pa e ( 1/2 24,5 days)
234
91 Pa 234
92 U e
( 1/2 68 s)
Note that:
The electrons emitted in decay are not orbital electrons, but originate through the process
n p e inside the nucleus.
rays
These are electromagnetic waves of very short wavelength (less than about 0,1 nm).
6-24
As discussed in connection with decay, when some nuclei decay by or emission the daughter
nucleus may be left in an excited state. The daughter nucleus subsequently returns to its ground state
by emission of energy in the form of radiation.
This process is similar to the emission of X rays during atomic transitions, except that the energies
involved are usually much larger and the photon wavelengths consequently much smaller.
Since they are uncharged and cannot cause ionization, rays can penetrate several mm of lead. Their
intensity is gradually attenuated in their passage through matter.
The intensity of a beam of rays or X rays passing through an absorbing material decreases exponen-
tially with increasing thickness of material.
For a thin layer of absorber, the fractional decrease in in-
tensity is proportional to absorber thickness.
I dI
x or I
I dx
where is called the linear absorption coefficient.
The minus sign is inserted since the intensity decreases
with increasing thickness.
The coefficient is a constant for a particular material for or X rays of a given energy. It has SI unit
m.
The absorption can be described by other parameters, such as the half value layer x. This is the
thickness of absorber necessary to reduce the incident intensity to half its original value.
1
When x = x, then I I 0 2 . From I I 0 exp x we see that exp x1/ 2 , giving
2
ln 2 x1/ 2 or
ln 2
x1/ 2
A good absorber has a small value of x and a large value of .
Materials of large Z are generally good absorbers of and X radiation (e.g. lead shielding).
6-25
ln 2
1/ 2
Thus, a large value for the decay constant means a short half-life, and vice versa.
6-26
The diagram below shows a plot of the number of atoms remaining as a function of time for a particular
radioactive sample.
The half-lives of naturally occurring radioisotopes for and emission vary over a wide range: for
example 6,5 × 10 yr (for the decay of W) and 3,0 × 10 s (for the decay of Po). Half-lives
for emission can be much smaller.
Ma M A MB Mb
As discussed in connection with decay, during a nuclear reaction mass is converted into energy, or
vice versa, according to E mc2.
The conservation of mass-energy must be applied to all transmutation reactions, i.e.
If mass on LHS < mass on RHS Energy must be supplied to the system if the reaction is to
(endothermic reaction) take place (to create the additional mass). This is done by
giving sufficient kinetic energy to the incident particles.
If mass on LHS > mass on RHS Excess energy released in the reaction appears as kinetic
(exothermic reaction) energy of the particles produced.
Fission
In a fission reaction, a heavy isotope, e.g. U or Pu, splits into two fragments, each of mass number
around 100-120. As can be seen from the plot of binding energy against A shown earlier, in such a
process the value of B/A increases. This means that energy is released in a fission event, around 200
MeV per reaction.
During fission two or three neutrons are emitted by each nucleus undergoing fission. Each neutron can
cause a further fission, thereby producing a chain reaction. This is the basis of nuclear reactors and
atomic bombs.
The fission process was first observed in 1939. The first nuclear reactor was built in 1942 by an inter-
national team led by Enrico Fermi (1901-1954) and the first atomic bombs were exploded in 1945.
Many of the fission fragments produced by a bomb are serious health hazards. For example Sr, which
is a emitter with a half life of 29 years, is chemically similar to calcium and can ultimately find its
way into bone, where it irradiates the bone marrow.
The large numbers of neutrons produced in a reactor can be used to produce nuclear transformations in
suitable elements, producing radioisotopes for therapy, tracing etc.
Fusion
If two light nuclei (e.g. H, D or He) fuse, much more energy is released per unit mass than in fission
(since the left-hand part of the B/A curve is steeper than the right-hand part).
For a fusion reaction to take place, the reacting particles must be given high energies in order to over-
come the electrostatic force of repulsion between the fusing nuclei. This translates into very high tem-
peratures, about 107 K for hydrogen.
Fusion processes of this kind occur naturally in stars (fusion is in fact the main source of energy in the
Universe) and in the hydrogen bomb, which was developed in 1952. So far, fusion reactors have not
been developed, mainly because of the difficulties in maintaining the required high temperatures for
sufficiently long that a chain reaction can occur.
MODERN PHYSICS
LECTURE EXAMPLES
University of the Witwatersrand, Johannesburg
School of Physics
Physics I D (PHYS1001/6)
1. The longest wavelength that will cause photoelectrons to be emitted from a sodium surface is
583 nm.
If the surface is illuminated with light of wavelength 450 nm what is the maximum speed of the
photoelectrons emitted? [4,71105 m.s]
13,60
2. The energy levels in hydrogen can be expressed in the form E eV .
n2
(i) To what level n will a hydrogen atom be excited after absorbing a photon of energy 12,0
eV, assuming that it is initially in its ground (lowest-energy) state? [3]
(ii) Calculate the energies and wavelengths of the photons which could be emitted as the
atom returns to its ground state.
[1,89 eV, 10,20 eV, 12,09 eV; 658 nm, 122 nm, 103 nm]
4. Calculate the binding energy per nucleon (in MeV/nucleon) of: (i) the 32 He atom, and (ii) the 42
He atom.
[2,57 MeV/nucleon; 7,07 MeV/nucleon]
37 Rb, a
5. The rubidium isotope 87 emitter with a half-life of 4,75 1010 years, is used to deter-
mine the age of rocks and fossils.
Rocks containing fossils of early animals are found to contain a ratio of 87 87
38 Sr to 37 Rb of 0,0160.
87
Assuming that there was no 38 Sr present when the rocks were formed, calculate the age of
these fossils. [1,11 Gyr]
6. Freshly-bottled wine contains radioactive tritium which decays giving a count rate of 10 min1
per kg of wine. Calculate the age of wine which gives a count rate of 8,3 min1.kg1.
The half-life of tritium is 12,3 years. [3,3 yr]
7. 238
U decays into an isotope of thorium (Th) by emitting an particle with energy 4,19 MeV
and a ray with energy 0,048 MeV.
(i) Write down the complete equation for the reaction.
(ii) Calculate the atomic mass of the thorium isotope, assuming it to be at rest after the reac-
tion. Atomic mass of 238 U = 238,0508 u. [234,0436 u]
8. Explain, using the data below, why 84 Be decays spontaneously into two particles but 16
8 O
does not decay spontaneously into four particles.
Mass of: particle = 4,00260 u, 48 Be = 8,00531 u, 168 O = 15,99491 u.
Calculate the expected energy production from the consumption of 1,00 g of lithium, assuming
100% efficiency in the process.
Mass of 6 Li = 6,0151 u. [3,591011 J]
Hint: first calculate the mass decrease in the consumption of one atom of lithium.
PHYS 1001/1006 TUTORIALS
YEAR 2018
4th BLOCK
TUTORIALS TO PREPARE
A tutorial test will then be given at the end of each session (approximately
10 minutes). Tutors are expected to give feedback on the tutorial test at the
beginning of the next session.
Question 1
The rate of emission of energy of a certain kind of glow-worm is 1 W.
If the average wavelength of the light radiated is 570 nm (i.e. in the visible spectrum), what is the rate
of emission of photons by the glow-worm?
[2.9 1012 s-1]
Question 2
Sodium has a work function of 2,3 eV.
(i) What is the maximum wavelength of light that will cause photoelectrons to be emitted from
sodium?
(ii) What will the maximum kinetic energy of the photoelectrons be if light of wavelength
200 nm falls on a sodium surface.
[ (i) 540 nm (ii) 6.26 1019 J = 3.92 eV]
Question 3
Which one of the following statements about the photoelectric effect is true?
(a) The electrons are always emitted with zero energy.
(b) For fixed frequency above the threshold, the electron current is independent of the intensity of
the incident light.
(c) For fixed frequency above the threshold, the maximum kinetic energy of the emitted electrons
depends on the intensity of the incident light.
(d) For any intensity of light, the maximum kinetic energy of the emitted electrons depends on the
frequency of the incident light.
(e) The cut-off frequency depends on the intensity of the incident light.
[d]
Question 4
The wavelength of the yellow sodium line is 589,6 nm.
What is the difference in energy between the two energy levels involved in the transition?
[3.37 1019 J = 2.11 eV]
2
Questions 5
What accelerating voltage applied to a beam of electrons will just cause them to excite the characteristic
radiation, with wavelength 0,063 nm, in a molybdenum target?
[19.7 kV]
Question 6
The binding energy per nucleon for the nucleus 73 Li is 5,606 MeV. If the masses of the neutron and
hydrogen atom are 1,00866 u and 1,00783 u respectively, what is the atomic mass of 73 Li ?
[7.01598 u]
Question 7
(i) Give the atomic numbers and mass numbers of Pb, At and Bi in this series.
(ii) What is the mode of decay for the stage At to Bi? Explain your reasoning.
[ (i) Pb: Z = 82, A = 211; At: Z = 85, A = 215; Bi: Z = 83, A =211 (ii) α ]
Question 8
The intensity of a given X-ray beam is reduced by a factor of 8 by 12 mm of aluminium.
(i) Calculate the linear absorption coefficient of aluminium for this X-ray beam.
(ii) What thickness of aluminium would be required to reduce the intensity to 1% of its initial
value?
[ (i) 173 m-1 , (ii) 26.6 mm]
Question 9
A small volume of solution containing a radioactive isotope, of half-life 15 hours, had an activity of
185 Bq when injected into the bloodstream of a patient. After 30 hours the activity of 1 ml of blood was
9,25 103 Bq.
What is the volume of blood in the patient?
[ 5 l]
3
Question 10
The equation 14
6 C 7x N y z represents the -decay reaction involved in radioactive carbon dating.
Question 11
The sun radiates energy at the rate 6,46 107 W per square meter of surface area.
(i) If the energy emitted by the sun has its origin in nuclear fusion processes, calculate the rate at
which the mass of the sun is decreasing. Diameter of sun 1,39 106 km.
(ii) If all the energy comes from reactions in which two deuterium atoms fuse to form one helium
atom, calculate the mass of helium produced per second.
[ (i) 4.36 109 kg.s-1 (ii) 6.81 1011 kg.s-1]