Physics 3 RD Ed
Physics 3 RD Ed
PHYSICS
THIRD EDITION
22/05/2009 11:45:35 AM
All rights reserved except under the conditions described in the Copyright Act 1968 of Australia and subsequent amendments. No part of this publication may be reproduced, stored in a retrieval system, or transmitted in any form or by any means, without the prior permission of the publishers.
While every care has been taken to trace and acknowledge copyright, the publishers tender their apologies for any accidental infringement where copyright has proved untraceable. They would be pleased to come to a suitable arrangement with the rightful owner in each case.
This material has been developed independently by the publisher and the content is in no way connected with nor endorsed by the International Baccalaureate Organization.
www.ibid.com.au
First published in 2007 by IBID Press, Victoria Reprinted January 2008 Corrections, revised and reprinted May 2009 All copyright statements, IBO 2007 refer to the Physics guide published by the International Baccalaureate Organization in 2007. IBID Press express their thanks to the International Baccalaureate Organization for permission to reproduce its intellectual property.
Library Catalogue: Kerr G & Ruth P. 1. Physics, 3rd Edition 2. International Baccalaureate. Series Title: International Baccalaureate in Detail
26/05/2009 4:21:27 PM
EDITORS NOTES
This project has involved teachers, authors, proof readers, artists and many other people on several continents. It has been done within an extremely tight timeframe and involved thousands of emails across the world and many different software applications. We are pleased, and trust that you will also be pleased with the final product which went to Press with no known errors. However we know from experience that some typographic and other errors have escaped our proofing process and will emerge as students and teachers start using the books and CDs. If you wish, you can help us and yourself in the following ways Send us an email at [email protected] with details of any errors that you notice Please visit www.ibid.com.au for errata sheets which will be produced promptly and be freely available as necessary Check our website and other publicity regarding our Student Guides to Internal Assessment and Volumes of Investigations for the Core, HL and Options in Biology, Chemistry and Physics. These materials are currently in preparation and are due for publication later this year.
22/05/2009 11:45:35 AM
THE AUTHORS
GREGG KERR
Gregg has a Bachelor of Education degree from Charles Sturt University, Australia, and a Master of Science in Education from the State University of New York, USA. He has taught in Germany, Hong Kong, Australia, Brunei, Thailand and he is presently Student Welfare Coordinator and Head of Science at the Utahloy International School Guangzhou, China. Gregg has taught IB Physics since 1988. He has been a member of the Physics Subject Committee, and is presently a Senior Examiner and a Senior Moderator in Physics. He has been an IB Physics workshop leader at conferences in Tokyo, Mumbai, Brisbane, Adelaide, Invercargill, Sydney, Singapore and Chiang Mai. I would like to thank my Bhutanese friend Kinga Tshering for all the suggestions he made to make this edition better. I would especially like to thank Ofelia and Angelica for giving me the support at home while I tore my hair out in writing my contribution to this third edition.
PAUL RUTH
Paul Ruth taught IB and A-level Nuffield physics for many years as Head of Physics at Sevenoaks School, Kent, England. He became involved with the examining side of IB in 1985 when he was appointed as an Assistant Examiner. Since 1990 he has been a member of the Senior Examining Team responsible for setting both the May and November examination papers. He was a member of the Syllabus Review team for both the 1998-2003 syllabus and for the new syllabus. He is also a senior moderator for Internal Assessment and Extended Essays. With thanks to my IB friends and colleagues for the many lively discussions and also for keeping me on my toes.
ACKNOWLEDGEMENTS
We wish to acknowledge the advice and assistance of the following people in the development and production of these materials to support the teaching of IB Physics.
AUTHORS
Gregg Kerr Paul Ruth
PROOF READERS
Paul Hadfield, Dr Paula Mills, Hiwa Jaldiani, Neville Lawrence
LAYOUT
Colin Flashman
22/05/2009 11:45:35 AM
CONTENTS
Chapter 1
1.1 1.2 1.3
Chapter 2
2.1 2.2 2.3 2.4
MECHANICS
Kinematics Forces and dynamics Work, energy and power Uniform circular motion 33 43 56 66
Chapter 3
3.1 3.2
THERMAL PHYSICS
Thermal concepts Thermal properties of matter 76 83
Chapter 4
4.1 4.2 4.3 4.4 4.5
Chapter 5
5.1 5.2
ELECTRIC CURRENTS
Electric potential difference, current and resistance Electric circuits 126 137
Chapter 6
6.1 6.2 6.3
Chapter 7
7.1 7.2 7.3
v
19 april 09 TOC.indd 5 22/05/2009 11:44:28 AM
Chapter 8
8.1 8.2 8.3 8.4 8.5 8.6
Chapter 9
9.1 9.2 9.3 9.4
MOTION IN FIELDS
Projectile motion Gravitational field, potential and energy Electric field, potential and energy Orbital motion 249 255 259 267
vi
19 april 09 TOC.indd 6 22/05/2009 11:44:29 AM
vii
19 april 09 TOC.indd 7 22/05/2009 11:44:29 AM
Glossary Index
viii
19 april 09 TOC.indd 8 22/05/2009 11:44:29 AM
The order of magnitude of large or small numbers can be difficult to comprehend at this introductory stage of the course. For example, 1023 grains of rice would cover Brazil to a depth of about one kilometre.
Period of visible light 1015 s Shortest lived subatomic particle Passage of light across the nucleus Mass of proton Mass of neutron Mass of electron 1023 s 1023 s 1027 kg 1027 kg 1030 kg
1
19 april 09 Physics Ch01.indd 1 22/05/2009 11:46:29 AM
CORE
CHAPTER 1
The order of magnitude of quantities in the macroscopic world are also important when expressing uncertainty in a measurement. This is covered in section 1.2 of this chapter.
Examples
1.
The number 8 is closer to 101 (10) than 100 (1). So the order of magnitude is 101. Similarly, 10 000 has an order of magnitude of 104. However, 4.3 103 has an order of magnitude of 104. The reason for this is if you use the log button on your calculator, the value of 4.3 103 = 103.633. 1.
Exercise
1.1 (a)
CORE
2.
The order of magnitude of 4 200 000 is: A. B. C. D. 104 105 106 107
Therefore the order of magnitude is 104. So, the normal mathematical rounding up or down above or below 5 does not apply with order of magnitude values. In fact, 100.5 = 3.16. This becomes our rounding value in determining the order of magnitude of a quantity.
2.
Give the order of magnitude of the following quantities: (a) (b) (c) (d) (e) (f) 20 000 2.6 104 3.9 107 7.4 1015 2.8 10-24 4.2 10-30
Order of magnitude, for all its uncertainty, is a good indicator of size. Lets look at two ways of calculating the order of magnitude of the number of heartbeats in a human in a lifetime. The average relaxed heart beats at 100 beats per minute. Do you agree? Try the following activity: 3. Using a timing device such as a wristwatch or a stopwatch, take your pulse for 60 seconds (1 minute). Repeat this 3 times. Find the average pulse rate. Now, using your pulse, multiply your pulse per minute (say 100) 60 minutes in an hour 24 hours in a day 365.25 days in a year 78 years in a lifetime. Your answer is 4.102 109. Take the log of this answer, and you get 109.613. The order of magnitude is 1010. Now let us repeat this but this time we will use the order of magnitude at each step: 102 beats min-1 102 min h-1 101 h day-1 103 day yr-1 102 yr The order of magnitude is 1010. Do the same calculations using your own pulse rate. Note that the two uncertain values here are pulse rate and lifespan. Therefore, you are only giving an estimate or indication. You are not giving an accurate value. 4
Give the order of magnitude of the following measurements: (a) (b) (c) (d) (e) The mean radius of the Earth, 6 370 000 m. The half-life of a radioactive isotope 0.0015 s. The mass of Jupiter 1 870 000 000 000 000 000 000 000 000 kg. The average distance of the moon from the Earth is 380 000 000 m. The wavelength of red light 0.000 000 7 m.
The ratio of the diameter of the nucleus to the diameter of the atom is approximately equal to: A. B. C. D. 1015 108 105 102
5.
What is the order of magnitude of: (a) (b) the time in seconds in a year. the time for the moon to revolve around the earth in seconds.
2
19 april 09 Physics Ch01.indd 2 22/05/2009 11:46:30 AM
This can be estimated as: = (2 101) (5 103) (1 101) (1 101) (1 101) (5 103) = 5 1011 The calculator answer is 7.7 1011. So our estimate gives a reasonable order of magnitude.
Exercise
1.1 (b)
1.
A rough estimate of the volume of your body in cm3 would be closest to: A. B. C. D. 2 103 2 105 5 103 5 105
2.
Estimate the: (a) (b) (c) (d) dimensions of this textbook in cm mass of an apple in g period of a heartbeat in s temperature of a typical room in C
3.
Estimate the answer to: (a) (b) (c) 16 5280 5280 5280 12 12 12 3728 (470165 10-14) 278146 (0.000713 10-5) 47816 (4293 10-4) 403000
4.
The universe is considered to have begun with the Big Bang event. The galaxies that have moved the farthest are those with the greatest initial speeds. It is believed that these speeds have been constant in time. If a galaxy 3 1021 km away is receding from us at 1.5 1011 km y -1, calculate the age of the universe in years.
3
19 april 09 Physics Ch01.indd 3 22/05/2009 11:46:30 AM
CORE
16 5280 12 12 12 5280
CHAPTER 1
5. Give an estimate of the order of magnitude of the following: (a) (b) (c) The length of your arm in mm. The quantity of milk you drink in a year in cm3. The mass of your backpack that contains your school materials in g. The diameter of a human hair in mm. The time you spend at school in a year in minutes. The number of people in the country where you live. Some quantities cannot be measured in a simpler form, and others are chosen for convenience. They have been selected as the basic quantities and are termed fundamental quantities. Figure 102 lists the fundamental quantities of the SI system together with their respective SI unit and SI symbol. Quantity length mass time electric current thermodynamic temperature amount of substance luminous intensity SI unit metre kilogram second ampere Kelvin mole candela SI symbol m kg s A K mol cd
CORE
Figure 102 Fundamental quantities Scientists and engineers need to be able to make accurate measurements so that they can exchange information. To be useful, a standard of measurement must be: 1. Invariant in time. For example, a standard of length that keeps changing would be useless. Readily accessible so that it can be easily compared. Reproducible so that people all over the world can check their instruments.
2.
3.
The standard metre, in 1960, was defined as the length equal to 1 650 763.73 wavelengths of a particular orange red line of krypton86 undergoing electrical discharge. Since 1983 the metre has been defined in terms of the speed of light. The current definition states that the metre is the length of path travelled by light in a vacuum during a time interval of 1299 792 453 second. The standard kilogram is the mass of a particular piece of platinum-iridium alloy that is kept in Svres, France. Copies of this prototype are sent periodically to Svres for adjustments. The standard second is the time for 9 192 631 770 vibrations of the cesium-133 atom. Standards are commonly based upon properties of atoms. It is for this reason that the standard kilogram could be replaced at some future date. When measuring lengths, we choose an instrument that is appropriate to the order of magnitude, the nature of the length, and the sensitivity required. For example, the orders of magnitude (the factor of 10) of the radius of a gold atom, a persons height and the radius of the solar system are 10-15, 100 and 1012
4
19 april 09 Physics Ch01.indd 4 22/05/2009 11:46:31 AM
respectively. The nature of a persons height is different from that of the radius of a gold atom in that the persons height is macroscopic (visible to the naked eye) and can be measured with, say, a metre stick, whereas the diameter of the atom is microscopic and can be inferred from electron diffraction.
The unit of electrical energy could be J or W h or kJ or kWh (kilowatt-hour). In atomic and nuclear physics the unit of energy could be J or eV (electronvolt) where 1 eV = 1.6 10-19 J.
5
19 april 09 Physics Ch01.indd 5 22/05/2009 11:46:32 AM
CORE
CHAPTER 1
for units is also preferred in the SI system multiple or submultiple units for large or small quantities respectively. The prefix is combined with the unit name. The main prefixes are related to the SI units by powers of three. However, some other multiples are used. 3. Which one of the following is a fundamental unit? A. B. C. D. 4. Kelvin Ohm Volt Newton
CORE
1 000 000 000 m 1 000 000 dm3 0.000 000 001 s 0.000 001 m
= 1 Gm = 1 Mdm3 = 1 ns = 1 m
Which of the following is measured in fundamental units? A. B. C. D. velocity electric charge electric current force
The main prefixes and other prefixes are shown in Figure 104. Multiple Prefix Symbol Multiple Prefix Symbol 1024 yotta Y 10-1 deci d 21 10 zetta Z 10-2 centi c 18 -3 10 exa E 10 milli m 1015 peta P 10-6 micro 12 -9 10 tera T 10 nano n 109 giga G 10-12 pico p 6 -15 10 mega M 10 femto f 103 kilo k 10-18 atto a 102 hecto h 10-21 zepto z 101 deca da 10-24 yocto y Figure 104 Preferred and some common prexes
5.
The density in g cm-3 of a sphere with a radius of 3 cm and a mass of 0.54 kg is: A. C. 2 g cm-3 0.50 g cm-3 B. D. 2.0 10 g cm-3 5.0 g cm-3
6.
Convert the following to fundamental S.I. units: (a) (c) (e) 5.6 g 3.2 dm 2.25 tonnes (b) (d) (f) 3.5 A 6.3 nm 440 Hz
7.
Convert the following to S.I. units: (a) (c) (e) (g) 2.24 MJ 2.7 km h-1 2.4 L 230.1 M dm3 (b) (d) (f) (h) 2.50 kPa 2.5 mm2 3.6 cm3 3.62 mm3
Exercise
1.2 (a)
8. Estimate the order of magnitude for the following: (a) (b) (c) (d) 9. your height in metres the mass of a 250 tonne aeroplane in kilograms the diameter of a hair in metres human life span in seconds.
1.
Which of the following isotopes is associated with the standard measurement of time? A. B. C. D. uranium235 krypton86 cesium133 carbon12
2.
Which one of the following lists a fundamental unit followed by a derived unit? 10. A. B. C. D. ampere coulomb ampere second mole watt joule kilogram
Calculate the distance in metres travelled by a parachute moving at a constant speed of 6 km h-1 in 4 min. The force of attraction F in newtons between the earth with mass M and the moon with mass m separated by a distance r in metres from their centres of mass is given by the following equation: F = G M m r-2
6
19 april 09 Physics Ch01.indd 6 22/05/2009 11:46:32 AM
Many ammeters and voltmeters have a means of adjustment to remove zero offset error. When you click a stop-watch, your reaction time for clicking at the start and the finish of the measurement interval is a systematic error. The timing instrument and you are part of the system. Systematic errors can, on most occasions, be eliminated or corrected before the investigation is carried out. Random uncertainties are due to variations in the performance of the instrument and the operator. Even when systematic errors have been allowed for, there exists error. Random uncertainties can be caused by such things as: vibrations and air convection currents in mass readings. temperature variations. misreadings. variations in the thickness of a surface being measured (thickness of a wire). not collecting enough data. using a less sensitive instrument when a more sensitive instrument is available. human parallax error (one has to view the scale of the meter in direct line, and not to the sides of the scale in order to minimise parallax error).
are definite sources of error but they are not considered as experimental errors.
7
19 april 09 Physics Ch01.indd 7 22/05/2009 11:46:33 AM
CORE
CHAPTER 1
error in the measurement. An accurate experiment has a low systematic error. Precision is an indication of the agreement among a number of measurements made in the same way indicated by the absolute error. A precise experiment has a low random error. However, you should be aware that repeating measurements may reduce the random uncertainty but at the same time the systematic error will not be reduced.
CORE
Suppose a technician was fine-tuning a computer monitor by aiming an electron gun at a pixel in the screen as shown in Figure 105.
4 3 2
screen
pixel
Figure 105
In case 1 there is low accuracy and precision. The technician needs to adjust the collimator to reduce the scattering of electrons, and to change the magnetic field so the electrons hit the pixel target. In case 2, the electron gun has been adjusted to increase precision but the magnetic field still needs adjustment. In case 3, both adjustments have been made. Can you give an explanation for case four?
4.
5.
8
19 april 09 Physics Ch01.indd 8 22/05/2009 11:46:33 AM
A student measures the current in a resistor as 655 mA for a potential difference of 2.0 V. A calculator shows the resistance of the resistor to be 1.310 . Which one of the following gives the resistance to an appropriate number of significant figures? A. B. C. D. 1.3 1.31 1.310 1
How many significant figures are indicated by each of the following: (a) (c) (e) (g) (i) (k) 1247 0.034 62.0 0.00250 tan -1 0.24 0.0300 (b) (d) (f) (h) (j) (l) 1007 1.20 107 0.0025 sin 45.2 3.2 10-16 1.0 101
5.
Express the following in standard notation (scientific notation): (a) (c) (e) 1250 (b) 30007 25.10 (d) an area of 4 km2 in m2 an object of 12.0 nm2 in m2
Exercise
1.2 (b)
1.
Consider the following measured quantities (a) 3.00 0.05 m (b) 12.0 0.3 m Which alternative is the best when the accuracy and precision for a and b are compared? a Low accuracy Low accuracy High accuracy High accuracy b Low precision High precision Low precision High precision
6. 7.
Calculate the area of a square with a side of 3.2 m. Add the following lengths of 2.35 cm, 7.62 m and 14.2 m. Calculate the volume of a rectangular block 1.52 cm by 103.4 cm by 3.1 cm. A metal block has a mass of 2.0 g and a volume of 0.01 cm3. Calculate the density of the metal in g cm -3.
8.
A. B. C. D.
9.
9
19 april 09 Physics Ch01.indd 9 22/05/2009 11:46:34 AM
CORE
CHAPTER 1
10. Round off the following to three significant figures: (a) (c) (e) 11. 7.1249 2001 6.5647 (b) (d) 2561 21256 The smallest uncertainty possible with any measuring device is half the limit of reading. However, most investigations generate an uncertainty greater than this. Figure 109 lists the uncertainty of some common laboratory equipment. Metre rule Vernier calipers Micrometer screw gauge 50 cm3 measuring cylinder 10 cm3 measuring cylinder Electric balance Watch second hand Digital timer Spring balance (020N) Resistor 0.05 cm 0.005 cm 0.005 mm 0.3 cm3 0.1 cm3 0.005 g 0.5 s 0.0005 s 0.1 N 2%
CORE
Determine the following to the correct number of significant figures: (a) (b) (3.74 1.3) 2.12 17.65 (2.9 + 3.2 + 7.1) 0.134
12.
Figure 109 Equipment uncertainties Absolute uncertainty is the size of an error and its units. In most cases it is not the same as the maximum degree of uncertainty (as in the previous example) because it can be larger than half the limit of reading. The experimenter can determine the absolute error to be different to half the limit of reading provided some justification can be given. For example, mercury and alcohol thermometers are quite often not as accurate as the maximum absolute uncertainty. Fractional (relative) uncertainty equals the absolute uncertainty divided by the measurement as follows. It has no units. absolute uncertainty Relative uncertainty = _________________ measurement Percentage uncertainty is the relative uncertainty multiplied by 100 to produce a percentage as follows Percentage uncertainty = relative uncertainty 100% For example, if a measurement is written as 9.8 0.2 m, then there is a
0.1
0.2
0.3
0.4
0.5
cm
part
Figure 108
Linear measurement
The limit of reading is 0.05 cm and the uncertainty of the measurement is 0.025 cm. The length is stated as 0.47 0.02 cm. (Uncertainties are given to 1 significant figure).
limit of reading = 0.1 m uncertainty = 0.05 m absolute uncertainty = 0.2 m relative uncertainty = 0.2 9.8 = 0.02 and percentage uncertainty = 0.02 100% = 2%
Percentage uncertainty should not be confused with percentage discrepancy or percentage difference which is an indication of how much your experimental answer varies from the known accepted value of a quantity.
10
19 april 09 Physics Ch01.indd 10 22/05/2009 11:46:35 AM
1.2.11
UNCERTAINTIES IN RESULTS
DETERMINATION
Example
Figure 110
Sample measurements
Solution
The sum of the readings = 46.602 and so the mean of the readings is 5.825. Then, the value for the thickness is 5.825 0.007 mm This method can be used to suggest an approprite uncertainty range for trigonometric functions. Alternatively, the mean, maximum and minimum values can be calculated to suggest an approprite uncertainty range. For example, if an angle is measured as 30 2, then the mean value of sin 30 = 0.5, the maximum value is sin 32 = 0.53 and the minimum value is sin 28 = 0.47. The answer with correct uncertainty range is 0.5 0.03.
First, we determine the product 2.6 cm 2.8 cm = 7.28 cm2 Relative error 1 Relative error 2 Sum of the relative errors Absolute error = 0.5 2.6 = 0.192 = 0.5 2.8 = 0.179
= 0.371 or 37.1% = 0.371 7.28 cm2 or 37.1% 7.28 cm2 = 2.70 cm2 Errors are expressed to one significant figure = 3 cm2 The product is equal to 7.3 3 cm2
11
19 april 09 Physics Ch01.indd 11 22/05/2009 11:46:35 AM
CORE
Exercise
1.2 (c)
1.
D 0 0 1 2 3 45 40
A student measures the mass m of a ball. The percentage uncertainty in the measurement of the mass is 5%. The student drops the ball from a height h of 100 m. The percentage uncertainty in the measurement of the height is 7%. The calculated value of the gravitational potential energy of the ball will have an uncertainty of: (use Ep = mgh) A. B. C. D. 2% 5% 7% 12%
CORE
spindle sphere
2.
Figure 111 A micrometer screw gauge In Figure 111, the reading on the micrometer screw gauge is 3.45 mm. You can see that the thimble (on the right of the gauge) is to the right of the 3 mm mark but you cannot see the 3.5 mm mark on the main scale. The vernier thimble scale is close to the 45 mark.
The electrical power dissipation P in a resistor of resistance R when a current I is flowing through it is given by the expression: P = I2R. In an investigation, I was determined from measurements of P and R. The uncertainties in P and in R are as shown below.
cm
2
0
3
10
P R
4% 10 %
The mass of the Earth is stated as 5.98 1024 kg. The absolute uncertainty is: A. B. C. D. 0.005 0.005 kg 0.005 1024 kg 0.005 1024
Figure 112
Vernier calipers
In Figure 112, the reading on the vernier calipers is 1.95 cm. The vertical line showing zero on the vernier scale lies between 1.9 cm and 2.0 cm. The vertical graduation on the vernier scale that matches the main scale best is the fifth graduation.
4.
12
19 april 09 Physics Ch01.indd 12 22/05/2009 11:46:36 AM
12.
The energy E of an particle is 4.20 0.03 MeV. How should the value and uncertainty of E -12 be stated? Suggest an appropriate answer with uncertainty range for sin if = 60 5.
6.
13.
UNCERTAINTIES IN GRAPHS
1.2.12 Identify uncertainties as error bars in graphs. 1.2.13 State random uncertainty as an uncertainty range () and represent it graphically as an error bar. 1.2.14 Determine the uncertainties in the slope and intercepts of a straight-line graph.
IBO 2007
(a) (b)
What is the reading and error on the micrometer? The thickness of the wire being measured varies over its length. What sort of error would this be?
7.
A student records the following currents in amperes A when the potential difference V across a resistor is 12V: 0.9 A 0.8 A (a) (b) 0.9 A 0.7 A 0.85 A 0.8 A 1.2 A 0.8 A 0.95 A 0.75 A
Would you disregard any of the readings? Justify your answer. Calculate the current and its uncertainty.
8.
A spring balance reads 0.5 N when it is not being used. If the needle reads 9.5 N when masses are attached to it, then what would be the correct reading to record (with uncertainty)? Five measurements of the length of a piece of string were recorded in metres as: 1.48 1.46 1.47 1.50 1.45
9.
Record a feasible length of the string with its uncertainty. 10. A metal cube has a side length of 3.00 0.01 cm. Calculate the volume of the cube. An iron cube has sides 10.3 0.2 cm, and a mass of 1.3 0.2 g. What is the density of the cube in g cm -3?
Figure 115 Extension of a spring When a graph of force versus extension is plotted, the line of best fit does not pass through every point. An error bar can be used to give an indication of the uncertainty range for each point as shown in Figure 116. In the vertical direction, we draw a line up and down for each point to show the uncertainty range of the force value. Then we place a small horizontal marker line on the extreme uncertainty boundary for the point.
11.
13
19 april 09 Physics Ch01.indd 13 22/05/2009 11:46:37 AM
CORE
CHAPTER 1
In the horizontal direction, we draw a line left and right for each point to show the uncertainty range of the extension value. Then we place a small vertical marker line on the extreme uncertainty boundary for the point.
+5 N
CORE
0.2 cm
+0.2 cm
5 N
Figure 116 Error Bars
(b) When all the points in Figure 115 are plotted on a graph, then the line of best fit with the appropriate error bars is shown in Figure 117. You can see that the line of best fit lies within the error bar uncertainty range. The line of best fit is interpolated between the plotted points. The line of best fit is extrapolated outside the plotted points.
(c)
(d)
300 250
Force /N
Some of these features are shown in the graphs in Figure 118. Notice how two variables can be drawn on the same axis as in Figure 118 (b).
Interpolation
velocity/ms
(a)
power/ W (b)
temperature/C
time /s
time /s
1.0
2.0
3.0
7.0
8.0
9.0
Examples of graphs
Figure 117
1.
Choice of axes
Error bars will not be expected for trigonometric or logarithmic functions in this course.
A variable is a quantity that varies when another quantity is changed. A variable can be an independent variable, a dependent variable or a controlled variable. During an experiment, an independent variable is altered while the dependent variable is measured. Controlled variables are the other variables that may be present but are kept constant. For example, when measuring the extension of a spring when different masses are added to it, the weight force is altered and the extension from the springs original length is measured. The force would be the independent variable plotted on the x-axis and the extension would be the dependant variable plotted on the y-axis. (The extension depends on the mass added). Possible controlled variables
14
19 april 09 Physics Ch01.indd 14 22/05/2009 11:46:38 AM
2.
Scales
In order to convey the desired information, the size of the graph should be large, and this usually means making the graph fill as much of the graph paper as possible. Choose a convenient scale that is easily subdivided.
2nd quadrant
Each axis is labelled with the name and/or symbols of the quantity plotted and the relevant unit used. For example, you could write current/A or current (A). The graph can also be given a descriptive title such as graph showing the relationship between the pressure of a gas and its volume at constant temperature.
3rd quadrant
4.
Figure 119
Use of axes
Potential difference/ V
It is not always clear which variable is the dependent and which is the independent. When time is involved it is the independent variable. In many electrodynamic and electromagnetic experiments the potential difference (voltage) or the current can be varied to see what happens to the other variable either could be the independent variable. Most experimental results will be plotted in the first quadrant. However, they can extend over the four quadrants as is the case with aspects of simple harmonic motion and waves, alternating current and the cathode ray oscilloscope to name a few. When you are asked to plot a graph of displacement against time or to plot a graph of force versus time, the variable first mentioned is plotted on the y-axis. Therefore displacement and force would be plotted on the y-axis in the two given examples. These days, graphs are quickly generated with graphic calculators and computer software. This is fine for quickly viewing the relationship being investigated. However, the graph is usually small and does not contain all the information that is required, such as error bars. Generally, a graph should be plotted on a piece of 1 or 2 mm graph paper and the scale chosen should use the majority of the graph paper. In the beginning of the course, it is good practice to plot some graphs manually. As the course progresses, software packages that allow for good graphing should be explored.
Points are plotted with a fine pencil cross or as a circled dot. In many cases, error bars are required. Of course, you are strongly recommended to use a graphing software package. These are short lines drawn from the plotted points parallel to the axes indicating the absolute error of the measurement. A typical graph is shown in Figure 120.
10
1.0
2.0
3.0 current/ A
Figure 120
5.
Lines of best t
The line or curve of best fit is called the line of best fit. When choosing the line or curve of best fit it is practical to use a transparent ruler. Position the ruler until it lies along the ideal line. Shapes and curves can be purchased to help you draw curves. The line or curve does not have to pass through every point. Do not assume that the line should pass through the origin as lines with an x-intercept or y-intercept are common.
15
19 april 09 Physics Ch01.indd 15 22/05/2009 11:46:39 AM
CORE
3.
Labels
CHAPTER 1
Two equations that you will become familiar with in Chapter 2 are:
10
work (J) = force (N) displacement (m) distance (m) = speed (m s -1) time (s)
Potential difference/ V
7.5 b 5 rise
CORE
In these examples, the area under the straight line (Figure 1.22(a)) will give the values for the work done (5 N 2 m = 10 J).
m= rise run run 2.3
In Figure 1.22(b), the area enclosed by the triangle will give the distance travelled in the first eight seconds (i.e., 8 s 10 m s-1 = 40 m).
current/ A
1.1
Figure 121
Normally, the line of best fit should lie within the error range of the plotted points as shown in Figure 118. The uncertainty in the slope and intercepts can be obtained by drawing the maximum and minimum lines passing through the error bars. The line of best fit should lie in between these two lines. The uncertainty in the y-intercept can be determined as being the difference in potential difference between the best fit line and the maximum/minimum lines. The uncertainty in the slope can be obtained using the same procedure. However, do not forget that you are dividing. You will therefore have to add the percentage errors to find the final uncertainty. In the graph, the top plotted point appears to be a data point that could be discarded as a mistake or a random uncertainty.
y rise b run x
Figure 123 A straight line graph
y = mx + c where y is the dependent variable, x is the independent variable, m is the slope or gradient given by
8 time / s
force/ N
2 displacement / m
Figure 122
16
19 april 09 Physics Ch01.indd 16 22/05/2009 11:46:40 AM
STANDARD GRAPHS
1. Linear
y
10
Potential difference/ V
m = rise --------run
rise k = ------run
2.
1.1 2.3 current/ A
Parabola
The parabola shows that y is directly proportional to x2. That is, y x2 or y = k x2 where k is the constant of proportionality.
y
y
= 2.08 V A-1
x2
V = Ir or V = -Ir + In the equation s = u t + a t2 , where, Because V and I are variables, then m = -r and b = . If T = 2 (l/g) where T and l are the variables, and 2 and g are constants, then T plotted against l will not give a straight-line relationship. But if a plot of T against l or T 2 against l is plotted, it will yield a straight line. These graphs are shown in below. (i) T vs l
T
s = displacement in m u = initial velocity in m s1 a = acceleration in m s2 t = time in s then, s t2, k = a m s-2 and u = yintercept
(ii) T vs l
(iii) T2 vs l
T2
3.
Hyperbola
The hyperbola shows that y is inversely proportional to x or y is directly proportional to the reciprocal of x.
1 i.e., y __ x
l
or xy = k
Figure 125
17
19 april 09 Physics Ch01.indd 17 22/05/2009 11:46:41 AM
CORE
CHAPTER 1
Sinusoidal Graph
y
amplitude/ m
2
= wavelength
1 -x
10
length / m
2 A = amplitude
CORE
amplitude/ m
An example of an inverse proportionality is found in relating pressure, P, and volume, V, of a fixed mass of gas at constant temperature k 1 P = __ P __ V V or PV = k (= constant)
A
0.1 0.2 0.3 0.4 0.5 time / s
The equations for these graphs will be explored in Chapter 4 when you will study oscillations and simple harmonic motion.
An inverse square law graph is also a hyperbola. The force F between electric charges at different distances d is given by: kq1q2 F = _____ d2 A graph of F versus d has a hyperbolic shape, and a graph 1 is a straight line. of F versus __ d2
4. Sinusoidal
A sinusoidal graph is a graph that has the shape of a sine curve and its mathematics is unique. It can be expressed using degrees or radians. The wavelength is the length of each complete wave in metres and the amplitude A is the maximum displacement from the x-axis. In the top sinusoidal graph the wavelength is equal to 4 m and the amplitude is equal to 2m. The frequency f of each wave is the number of waves occurring in a second measured in hertz (Hz) or s1. The period T is the time for one complete wave. In the bottom sinusoidal wave, the frequency is 5 Hz, and the period is 0.2 s.
Therefore when lnN is plotted against time the slope of the straight line produced is equal to k.
N N0 ln N ln N0 slope = k
time / s
time / s
Figure 131
Logarithmic Graphs
18
19 april 09 Physics Ch01.indd 18 22/05/2009 11:46:42 AM
s (m) T2 (s2)
(AHL)
5. RE where E and r are It can be shown that V = ______ (R + r) constants. In order to obtain a straight line graph, one would plot a graph of A. 1 against R __ V V against R 1 1 against __ __ V R 1 V against __ R
Exercise
1.2 (d)
B. C.
1.
It can be shown that the pressure of a fixed mass of gas at constant temperature is inversely proportional to the volume of the gas. If a graph of pressure versus volume was plotted, the shape of the graph would be: A. B. C. D. a straight line. a parabola. an exponential graph. a hyperbola.
D. 6.
The magnetic force F between 2 magnets and their distance of separation d are related by the equation F = kdn where n and k are constants. (a) (b) What graph would you plot to determine the values of the two constants? From the graph how could you determine n and k?
2.
Newton showed that a force of attraction F of two masses m and M separated by a distance d was Mm . If m and M are constant, a graph given by F ___ d2 of F versus d2 would have which shape? A. B. C. D. a parabola a straight line a hyperbola an exponential shape
7.
The intensity I of a laser beam passing through a cancer growth decreases exponentially with the thickness x of the cancer tissue according to the equation I = I0 e x, where I0 is the intensity before absorption and is a constant for cancer tissue. What graph would you draw to determine the values of I0 and ?
3.
The resistance of a coil of wire R increases as the temperature is increased. The resistance R at a temperature can be expressed as R = R0 (1 + ) where is the temperature coefficient of resistance. Given the following data , plot a graph that will allow you to determine R0 and . 23.8 15 25.3 30 26.5 45 28.1 60 29.7 80 31.7 100
R / / C
19
19 april 09 Physics Ch01.indd 19 22/05/2009 11:46:43 AM
CORE
CHAPTER 1
Because the wavelength is given to two significant figures, the frequency can only be given to two significant figures. For division, to find the frequency from hc , the relative uncertainty in the frequency has to be calculated for each wavelength. For example, for dark red: the relative uncertainty = 0.3 10-7 6.1 10-7 = 0.0492 the absolute uncertainty = 0.0492 1.6 1014 = 0.07 1014 Hz
Light source
Example
The schematic diagram in Figure 134 demonstrates an experiment to determine Plancks Constant. The wavelength () of light from the light source incident on a metal photoemissive plate of a photoelectric cell is varied, and the stopping voltage Vs applied across the photoelectric cell is measured.
CORE
A
V acuum tube
V
variable source of voltage
In this case, the absolute uncertainty is not half the limit of reading as the absolute uncertainty of the wavelength was given as 0.3 10-7 m. Remember that the minimum possible absolute uncertainty is half the limit of reading which would be 0.05 10-7m. Light Stopping Frequency Uncertainty Radiation Voltage Vs 0.3 10-7m 1014 Hz 1014 Hz Colour 0.05 V Red 1.20 6.1 1.6 0.07 Orange 1.40 5.5 1.8 0.09 Yellow 1.55 5.2 1.9 0.1 Green 1.88 4.6 2.2 0.1 Blue 2.15 4.2 2.4 0.2 Violet 2.50 3.8 2.6 0.2 Figure 136 (b) Data showing uncertainties
Figure 134 Determining Plancks Constant The following values were obtained for different light radiation colours Light Radiation Colour Red Orange Yellow Green Blue Violet Figure 135 Stopping Voltage Vs 0.05 V 0.3 10-7 1.20 6.1 1.40 5.5 1.55 5.2 1.88 4.6 2.15 4.2 2.50 3.8 Data For Plancks Constant
Plot a fully labelled graph with stopping voltage on the vertical axis against the frequency on the horizontal axis. Allow for a possible negative yintercept.
It can be shown that for this experiment: hc = h f = + eV where h is Plancks Constant __ s c is the speed of light constant 3 108 m s-1 is the wavelength in m and f is the frequency in Hz is the work function. e is the charge on an electron (1.6 10-19C) (a) Copy Figure 135, add 2 more columns and complete the frequency and the uncertainties columns for each colour of light radiation in the table.
Now can you put in the error bars for each point and label the axis. There will be a negative yintercept. Mark in the gradient and the yintercept. The required graph is shown in Figure 137. Note the maximum and minimum lines and the line of best fit , the gradient of the straight line of best fit and the value of the negative yintercept.
20
19 april 09 Physics Ch01.indd 20 22/05/2009 11:46:44 AM
2.5
1.5
(e)
4.62
0.5
0 -0.5
Frequency exp 14 Hz
Figure 137 Data for Plancks Constant (c) Calculate Plancks Constant by graphical means and compare your value with the theoretical value of 6.63 10-34 J s. 1. The equation given at the start of this example was: hc = h f = + eV __ S If we rearrange this equation in the form y = mx + c, the equation becomes: h f __ VS = ___ e e 2.07 V -1 h = _________ Therefore, the gradient = __ e 4.62 1014 s = 4.5 1015 Vs gradient h = _______ = 4.5 1015 Vs 1.6 1019 C e = 7.2 1034 Js The accepted value of Plancks constant is 6.63 10-34 Js. 7.2 6.63 100% The percentage discrepancy = ________ 6.63 = 8.6 % (d) Determine the minimum frequency of the photoelectric cell by graphical means. (a) Copy the table and complete the period column for the measurements. Be sure to give the uncertainty and the units of T. Calculate the various values for T2 including its units. Determine the absolute error of T2 for each value. Draw a graph of T2 against l. Make sure that you choose an appropriate scale to use as much of a piece of graph paper as possible. Label the axes, put a heading on the graph, and use error bars. Draw the curve of best fit. What is the relationship that exists between T2 and l? Are there any outliers? From the graph determine a value for g.
T2 Absolute error of T2
Exercise
1.2 (e)
An investigation was undertaken to determine the relationship between the length of a pendulum l and the time taken for the pendulum to oscillate twenty times. The time it takes to complete one swing back and forth is called the period T. It can be shown that
__
l T = 2 _ g
where g is the acceleration due to gravity. The data in the table below was obtained.
Period T
21
19 april 09 Physics Ch01.indd 21 22/05/2009 11:46:45 AM
CORE
From the graph, calculate the work function of the photoemissive surface in the photoelectric cell in joules and electron-volts.
CHAPTER 1
CORE
1.3.2 Determine the sum or dierence of two vectors by a graphical method. 1.3.3 Resolve vectors into perpendicular components along chosen axes.
IBO 2007
Addition of vectors
From simple arithmetic it is known that 4 cm + 5 cm = 9 cm However, in vector context, a different answer is possible when 4 and 5 are added. For example, 4 cm north (N) + 5 cm south (S) = 1 cm south Suppose you move the mouse of your computer 4 cm up your screen (N), and then 5 cm down the screen (S), you move the mouse a total distance of 9 cm. This does not give the final position of the arrow moved by the mouse. In fact, the arrow is 1cm due south of its starting point, and this is its displacement from its original position. The first statement adds scalar quantities and the second statement adds two vector quantities to give the resultant vector R. The addition of vectors which have the same or opposite directions can be done quite easily: 1 N east + 3 N east = 4 N east (newton force) 200 m north + 500 m south = 300 m south (micrometre) 300 m s-1 north-east + 400 m s-1 south-west = 100 m s-1 south west (velocity) The addition of co-planar vectors that do not have the same or opposite directions can be solved by using scale drawings or by calculation using Pythagoras theorem and trigonometry. Vectors can be denoted by boldtype, with an arrow above respectively. They are the letter, or a tilde, i.e., a, a or a ~ represented by a straight line segment with an arrow at the end. They are added by placing the tail of one to the tip of the first (placing the arrow head of one to the tail of the other). The resultant vector is then the third side of the triangle and the arrowhead points in the direction from the free tail to the free tip. This method of adding is called the triangle of vectors (see Figure 140).
1.3.1
Scalars are quantities that can be completely described by a magnitude (size). Scalar quantities can be added algebraically. They are expressed as a positive or negative number and a unit. Some scalar quantities, such as mass, are always positive, whereas others, such as electric charge, can be positive or negative. Figure 139 lists some examples of scalar and vector quantities. Scalars distance (s) speed mass (m) time (t) volume (V) temperature (T) charge (Q) density () pressure (P) energy (E) power (P) Vectors displacement (s) velocity (v) area (A) acceleration (a) momentum (p) force (F) torque () angular momentum (L) flux density() electric field intensity (E) magnetic field intensity (B)
Figure 139 Examples of scalar and vector quantities Vectors are quantities that need both magnitude and direction to describe them. The magnitude of the vector is always positive. In this textbook, vectors will be represented in heavy print. However, they can also be represented by underlined symbols or symbols with an arrow above or below the symbol. Because vectors have both magnitude and direction, they must be added, subtracted and multiplied in a special way. The basic mathematics of vector analysis will be outlined hereunder, and no mention will be made of i, j and k unit vectors.
22
19 april 09 Physics Ch01.indd 22 22/05/2009 11:46:46 AM
T a il
R =a + b a
Head
=
a b
Solution
By scale drawing
The parallelogram of vectors rule for adding vectors can also be used. That is, place the two vectors tail to tail and then complete a parallelogram, so that the diagonal starting where the two tails meet, becomes the resultant vector. This is shown in Figure 118.
C
37 40 m
a R =a+b b
30 m
Figure 143 Orienteering Draw a sketch of the two stages of your journey. From the sketch make a scale drawing using 1 cm equal to 10 m (1 cm : 10m). Figure 141 Addition of vectors using parallelogram rule If more than two co-planar vectors are to be added, place them all head to tail to form a polygon. Consider the three vectors, a, b and c shown in Figure 142. Adding the three vectors produces the result shown in Figure (b).
b c a b
(a)
If you then draw the resultant AC, it should be 5 cm in length. Measure CAB with a protractor. The angle should be about 37. Therefore, you are 50 m in a direction south 37 west from your starting point (i.e., S 37 W).
c a R =a+b+c
(b)
Method 2
By calculation
Using Pythagoras theorem, we have AC2 = 402 + 302 AC = 402 + 302 = 50 (taking the positive square root). From the tan ratio, opposite BC = ___ 30 = 0.75 tan = _______ we have tan = ___ AB 40 adjacent tan1( 0.75 ) = 36.9
________
Figure 142
Notice then that a + b + c = a + c + b = b + a + c = . . . That is, vectors can be added in any order, the resultant vector remaining the same.
Example
On an orienteering expedition, you walk 40 m due south and then 30 m due west. Determine how far and in what direction are you from your starting point.
You are 50 m in a direction south 37 west from your starting point (i.e. S 37 W).
23
19 april 09 Physics Ch01.indd 23 22/05/2009 11:46:47 AM
CORE
CHAPTER 1
Subtraction of vectors
In Chapter 2, you will describe motion kinematics. You will learn that change in velocity, v ,is equal to the final velocity minus the initial velocity, v u. Velocity is a vector quantity so v , v and u are vectors. To subtract v u, you reverse the direction of u to obtain u, and then you add vector v and vector u to obtain the resultant v.
CORE
That is, v = v + (u). Vectors v and u are shown. For v u, we reverse the direction of u and then add head to tail
(u ) u v R = v + (u) =v u v
Using the same scale as that used for the 5.0 m s-1 velocity vector, the change in velocity is 7.1 m s-1 at right angles to the cushion. We could also use Pythagoras theorem to determine the length (or magnitude) of the change in velocity vector, v:
Figure 144
Example
A snooker ball is cued and strikes the cushion of the snooker table with a velocity of 5.0 m s-1 at an angle of 45 to the cushion. It then rebounds off the cushion with a velocity of 5.0 m s-1 at an angle of 45 to the cushion. Determine change in velocity? (Assume the collision is perfectly elastic with no loss in energy).
Solution
A vector multiplied by a scalar gives a vector with the same direction as the vector and magnitude equal to the product of the scalar and the vector. For example: 3 15 N east = 45 N east;
You can solve this problem by scale drawing or calculation. Draw a sketch before solving the problem, then draw the correct vector diagram.
2kg 15 m s-1 south = 30 kg m s-1 south The vector analysis of a vector multiplied by a vector is not required for the syllabus. However, you will encounter these situations when you study work, energy and electromagnetism. Two points will be made in an oversimplified manner: 1. Vectors can be multiplied to give a scalar answer. For example, force can be multiplied by a displacement to give work which is a scalar. Finding the product in this manner is called the dot product, i.e., U V = |U| |V| cos where is the angle between the directions of V and U.
Vector diagram : 45
v i 45
5 m/s
vf
5 m/s
Notice that the lengths of the initial velocity vector, vi , and the final velocity vector, v f , are equal.
Using the vector diagram above we can now draw a vector diagram to show the change in velocity.
24
19 april 09 Physics Ch01.indd 24 22/05/2009 11:46:48 AM
V
Multiplying vectors
Figure 147 2.
The product of two vectors can also give a vector answer. For example, the force exerted on a proton moving with a velocity in a magnetic field is given by the equation F = qv B where q is the charge in coulombs, v is the velocity in metres per second, and B is the magnetic field strength in teslas. q is a scalar and v and B are vectors. The answer F is a vector. Finding the product in this manner is called the cross product, V U. The magnitude of the cross product, V U is given by |V U| = |U| |V| sin
Which one of the following is a scalar quantity? A. B. C. D. Force Velocity Momentum Energy
4.
The diagram below shows a boat crossing a river with a velocity of 4 m s-1 north. The current flows at 3 m s1 west.
The direction of of the answer, V U is at right angles to both V and U and can be found by curling the fingers of your right hand in the direction of V so that they curl towards U when you bend them. Your thumb is then pointing in the required direction.
-1 current 3m s
4 m s-1
boat
V U
The resultant magnitude of the velocity of the boat will be: A. B. C. D. 5. 3 m s-1 4 m s-1 5 m s-1 7 m s-1
Figure 148
Two vectors with displacements of 10 m north west and 10 m northeast are added. The direction of the resultant vector is: A. C. south north B. D. north-east north-west
Exercise
1.3 (a)
6. Add the following vectors by the graphical method (a) (b) (c) (d) 4 m south and 8 m north 5 m north and 12 m west 6.0 N west and 6.0 N north 9.0 m s-1 north + 4.0 m s-1 east + 6.0 m s-1 south.
1.
Which of the following lines best represents the vector 175 km east (1 cm : 25 km)?
A. B. C. D.
25
19 april 09 Physics Ch01.indd 25 22/05/2009 11:46:49 AM
CORE
CHAPTER 1
7. Subtract the following vectors by either the graphical method or by calculation: (a) (b) (c) (d) 8. 2 m east from 5 m east (i.e. 5 m east 2m east) 9 m s -2 north from 4 m s-2 south 4.0 N north from 3.0 N east 3.2 T east from 5.1 T south From trigonometry
CORE
Calculate the following products: (a) (b) 20 m s-1 north by 3 12 by 5 N s north 12 east
9.
If a cyclist travelling east at 40 m s1 slows down to 20 m s1, what is the change in velocity?
Example
10. Find the resultant of a vector of 5 m north 40 west added to a vector of 8 m east 35 north. A sky rocket is launched from the ground at an angle of 61.00 with an initial velocity of 120 m s-1. Determine the components of this initial velocity?
Solution
Suppose you have a vector that is at an angle to the horizontal direction. Then that vector consists of measurable horizontal and vertical components. In Figure 151, the vector F is broken into its components. Note that the addition of the components gives the resultant F.
Vertical component
Horizontal component
F
y x
Figure 151
Resolution of vectors
26
19 april 09 Physics Ch01.indd 26 22/05/2009 11:46:50 AM
Exercise
1.3 (b)
1.
The vertical component of a vector of a 4.0 N force acting at 30 to the horizontal is A. B. C. D. 4.3 N 2N 4N 8.6 N
12 N
45 0
25 0 8.0 N 15 N
2.
Calculate the horizontal component of a force of 8.4 N acting at 60.0 to the horizontal. Calculate the vertical and horizontal components of the velocity of a projectile that is launched with an initial velocity of 25.0 m s-1 at an angle of elevation of 65 to the ground. Calculate the easterly component of a force of 15 N south-east. Calculate the vector whose components are 5.0 N vertically and 12 N horizontally. Calculate F in the diagram below if the sum of all the forces in the is zero.
A B C F
3.
4.
5.
6.
27
19 april 09 Physics Ch01.indd 27 22/05/2009 11:46:51 AM
CORE
8.0 N
1. 2. 3. 4. 5.
a a = a
x+y
x y xy a a = a ---- = a y a
(a ) = a
x x
x y
xy x
a b = (a b )
0 x x 1x a = 1, 1 = 1 , 0 = 0 ( x 0 ) , x a = a
log x + log y = log ( x y ) , x > 0, y > 0. x - , x > 0, y > 0. log x log y = log y xlog y = log y , y > 0. a = y x = log ay
x x
x |x|
is equal to divided by or in units of is less than is greater than is proportional to is approximately equal to a small difference between two values of x the absolute value of x
2. 3. 4.
28
19 april 09 Physics Ch01.indd 28 22/05/2009 11:46:52 AM
1.3 (c)
Make y the subject of the equation if x = 2y 6. Make v the subject of the equation given that: mv F = ____ r
2
B C
4.
a b A
opposite a sin = __________ = __ hypotenuse b adjacent c cos = __________ = __ hypotenuse b opposite a = __ tan = _______ c adjacent sin , cos 0 tan = _____ cos For very small angles, sin tan , cos 1 b = ____ a = ____ c Sine rule: ____ sinA sinB sinC Cosine rule: a2 = b2 + c2 - 2bc cos A 1ab sinC Area of triangle: A = __ 2 Identities:
5.
l T = 2 _ g
( )
6.
7.
Calculate the following: (a) (b) (c) (d) 162 + 163 251..5 ( 2) 4 (3) -2
8.
9.
Find the circumference and area of a circle of radius 0.8 cm. Calculate the volume and surface area of a sphere of radius 0.023 m. How many radians are there in: A. B. 270 45
sin A + cos A = 1 sin ( A B ) + sin ( A + B ) = 2 sin A sin B sin A + sin B = 2 sin [ ( A + B ) 2 ] cos [ ( A B ) 2 ]
10.
11.
ANGULAR MEASURE
Angles are measured in radians. One radian is the angle subtended by an arc with length equal to the radius. If s = r, then = s r. Note then, that 2 rad = 360, and 1 rad = 57.3 12.
29
19 april 09 Physics Ch01.indd 29 22/05/2009 11:46:53 AM
CORE
CHAPTER 1
GREEK SYMBOLS
The Greek alphabet is commonly used in Physics for various quantities and constants. The capital and small letters and their names are given here for your convenience:
CORE
Letters A B E Z H I K M N P T X
Name alpha beta gamma delta epsilon zeta eta theta iota kappa mu nu xi omicron pi rho sigma tau phi chi psi omega
30
19 april 09 Physics Ch01.indd 30 22/05/2009 11:46:54 AM
MECHANICS
MECHANICS
2.1 2.2 2.3 2.4 Kinematics Forces and Dynamics Work, Energy and Power Uniform Circular motion
TOK
Background
he late Richard Feynman described the process of Physics as akin to observing a vast chess game in which the boundaries of the chessboard cannot be seen. Furthermore, we have no idea why the game is being played or by whom. Nor do we know when the game started, nor will we ever see the end of the game. We dont know the rules of the game and our problem is to figure them out. By careful observation over a period of time we might, for example, discover the rule that governs the move of the bishops and if we are really clever we might even find the rule which governs the movement of the knights. Occasionally something really odd might happen like two white queens appearing on the board at the same time. All our subsequent observations had led us to the conclusion that this could not be the case. The chessboard in this analogy is the Universe and the chess pieces are the matter in the Universe. The rules that we discover are the laws of Physics and the observation we make of the pieces are the experiments that we carry out to establish the laws of Physics. The rules give the how and not the why. In other words they do not tell us why the pieces move but they help us understand the manner in which they move. And so it is with Physics. We will never know for example, why when we push something it moves. However, we can give a very good description of how it will move under different circumstances. Physics is the science that describes how the Universe works. Physics falls into two main categories. There is the Physics before 1926 Classical Physics and there is the Physics after 1926- Quantum Physics. Most of the Physics that is studied in an IB course is Classical Physics. However, it is important to realise that ultimately our description of how the Universe works must be understood in terms of Quantum Physics because we know this to be (so far) the correct Physics So you might ask why do we spend so much time in teaching you the wrong Physics? Well, its not quite as bad as it sounds. For example, if we apply the laws of Classical Physics to the behaviour of electrons in solids we get the wrong answer. The laws of Quantum Physics give the right answer. On the other hand if we apply the laws of Classical Physics and the laws of Quantum Physics to the behaviour of billiard balls, at slow enough speeds, both give the right answer. However, using Quantum Physics in this situation is rather like taking the proverbial sledgehammer to crack a walnut. In many of the situations that we encounter, Classical Physics will give us the right answer and so for this reason, and the fact that Quantum Physics is not easy to grasp on first acquaintance, we spend a lot of time teaching students Classical Physics. If we plot the speed of things against size, then we can see the sort of areas pertinent to each of the main areas of Physics. Refer to Figure 201.
31
19 april 09 Phys chapter 2.indd 31 22/05/2009 11:47:39 AM
CORE
CHAPTER 2
Physics
CORE
Classical Physics
Quantum Physics
Electromagnetism (charge)
QED J.1
Figure 202
The structure of this course Physics. The latter is discussed in more detail in Topic D.1 and D.2 and Option H. The two great pillars upon which Classical Physics rests are Newtonian Mechanics and Electromagnetism. Mass and electric charge are the two basic properties that we associate with all matter and Newtonian mechanics essentially deals with mass and electromagnetism essentially deals with charge. The two corresponding pillars upon which Quantum Mechanics rest are Quantum Mechanics and Quantum Electrodynamics. Bridging both Quantum and Classical Physics is Relativistic Physics and Thermodynamics. This latter subject essentially deals with the relationship between heat and work and also such interesting questions as how can order arise from disorder?
108
Relativistic Physics
106
Speed (ms1)
104
Quantum Physics
Classical Physics
102
1015
1010
105
1010
The Figure 202 summarises the (essential) branches of Physics and also gives the appropriate syllabus reference in the IB Physics Syllabus. At the present time we understand the two great pillars of Physics to be General Relativity (which describes space and time) and Quantum Physics (which describes everything else) and somewhere along the line Thermodynamics has to fit in as well. One of the great aims of physicists is to try and unify General relativity and Quantum Mechanics into a single theory.
Figure 201 The dierent areas of Physics You will note that there is a region that overlaps both Quantum and Classical Physics. This is Relativistic Physics and is the Physics we have to use when we are dealing with speeds close to that of the speed of light. So there is Relativistic Quantum Physics and Relativistic Classical
32
19 april 09 Phys chapter 2.indd 32 22/05/2009 11:47:40 AM
MECHANICS
NEWTONIAN MECHANICS
CORE
33
19 april 09 Phys chapter 2.indd 33 22/05/2009 11:47:41 AM
In this part of the course we start our journey through Newtonian Mechanics, one of the great pillars of Classical Physics. The essential problem in Mechanics is this: if at any given instant in time we know the positions and velocities of all the particles that make up a particular system, can we predict the future position and velocities of all the particles? This is the mechanics problem in its most general form. Specific examples are problems such as predicting solar eclipses, putting satellites into orbit, finding out how the positions of an oscillating object varies with time and finding out where a snooker ball ends up when it is struck by another snooker ball. In 1687, Isaac Newton (1642-1727) published his Principia Mathematica in which he set out a method for solving these type of problems; hence the name Newtonian Mechanics.
2.1.1
Dene displacement, velocity, speed and acceleration. Explain the dierence between instantaneous and average values of
2.1.2
Outline the conditions under which the equations for uniformly accelerated motion may be applied. Identify the acceleration of a body falling in a vacuum near the Earths surface with the acceleration g of free fall. Solve problems involving the equations of uniformly accelerated motion Describe the eects of air resistance on falling objects. Draw and analyse distancetime graphs, displacementtime graphs, velocitytime graphs and accelerationtime graphs. Calculate and interpret the slopes of displacementtime graphs and velocity time graphs, and the areas under velocity time graphs and accelerationtime graphs. Determine relative velocity in one and in two dimensions.
IBO 2007
2.1.4
2.1.5
2.1 KINEMATICS
Introduction
Before we can solve such problems as outlined above we need some method of describing quantitatively the motion of particles. Note. The terms quantitative and qualitative are used a lot in Physics and it is important to be clear as to their meaning. Suppose that the motion of something was described as follows - it got faster and faster and then suddenly came to an abrupt stop a long way from where it started. This would be a qualitative description of the motion. On the other hand if the motion is described as follows - It started from rest and headed due north with a positive acceleration, -2 -1 of 15 m s for 20 s, reaching a velocity of 300 m s . After -2 20 s it acquired a negative acceleration of 50 m s again in a Northerly direction and came to rest having a displacement 3.9 km due North from where it started. This is then a quantitative description of the motion. The key words in the above quantitative description are displacement, velocity and acceleration so let us look at these so-called kinematic concepts in more detail.
2.1.6
2.1.7
2.1.8
2.1.9
CHAPTER 2
The car starts at O. When it has travelled to P its displacement as measured from O is: A B C D 100 m due East 100 m __due West 100__ 2 m South East 1002 m South West
P (4,0) x
P (8,0) x
CORE
Figure 203
Speed
This is a concept with which you will all be familiar. Speed tells us the rate at which a moving object covers distance with respect to time. Hence we have
B-A B (6,2)
Velocity
Velocity is speed in a given direction. It is therefore a vector quantity. To plot a course an airline pilot needs to know not just the speed of the wind but from which direction it is blowing i.e. the wind velocity must be known.
Figure 204 Another method of calculating displacement Figures 203 and 204 are two examples of calculating displacements. In Figure 203 the particle P is at the point (4,0) at some instant and at a certain time later it is at a point (8,0). Its displacement in this time interval is, therefore, 4 units in the positive x direction. In Figure 204, the particle is at the point (6, 2) at some instant and its displacement from the origin is represented by the vector A. At some interval later the particle has moved to the point (3, 5) and its displacement is represented by vector B. Its displacement in this time interval is therefore B - A .
Example
In still water a motor boat has a maximum speed of 5 m s-1. The boat sets off to cross from one bank of a river to the other. The river flows with a speed of 2 m s-1 and the motor boat engine is set to maximum. Calculate the velocity of the motor boat. The solution to this problem involves simple vector addition.
Exercise
2.1 (a)
A racing car travels round a circular track of radius 100 m as shown in the diagram below.
O
5
100 m
100 m P
With reference to the above diagram the magnitude of the resultant velocity of the boat is
5 + 2 = 5.4 m s1.
2 2
34
19 april 09 Phys chapter 2.indd 34 22/05/2009 11:47:42 AM
MECHANICS
The direction can be measured relative to the original direction of the boat and is given by the angle = tan
1
-- = 21.8 ( = 22). (2 5(
An experiment with free fall can help us understand this. Figure 207 shows a ball being dropped between two light gates which are connected to an electronic counter.
Acceleration
Acceleration is the rate of change of velocity in a given direction. (Change in velocity time taken). In the SI system the unit is metres per second per second. i.e. the change in velocity measured in m s-1 every second. We write this as m s-2. Since we define acceleration in terms of velocity it is therefore a vector quantity. It is important to understand that the word deceleration has no place in physics. If the acceleration of an object is positive then we understand its rate of change of velocity to be positive and it could mean that the speed of the body is increasing. A body that is slowing down will have a negative acceleration. However, do not think of acceleration as a slowing up or getting faster. If a car for example goes round a bend in the road at constant speed it is accelerating. Why? Because the direction of the car is changing and therefore its velocity is changing. If its velocity is changing then it must have acceleration. This is sometimes difficult for people to grasp when they first meet the physics definition of acceleration because in everyday usage acceleration refers to something getting faster. Similarly words like work and power which can have very flexible meanings in everyday usage are very precisely defined in physics. So beware. As we shall see later on in this chapter it is very important to keep in mind the vector nature of both velocity and acceleration.
s counter B
Figure 207 An experiment in free fall
The ball is dropped from a fixed point somewhere above the light gate A. The two light gates A and B connected to the counter will record the time t that it takes the ball to fall the distance s. The average speed of the ball as it falls s between A and B is just t . The ball is of course accelerating as it falls so its speed is changing. Now imagine the light gate B closer to A and repeat the experiment. We would obtain a different value for the average velocity. As we repeat the experiment several times, each time moving B closer to A, we will find that the values of the average speed obtained each time will be approaching some limiting value. This limiting value is actually the instantaneous speed of the ball as it passes A. When the distance between A and B becomes very small (as does the corresponding time of fall) then this distance divided by the time will very nearly be equal to the instantaneous speed at A. If we let the small distance equal s and the time of fall equal t then the average speed vav over this distance is
s vav = ___ t
and the instantaneous speed v is given by
s - as t 0 v = ----t
35
19 april 09 Phys chapter 2.indd 35 22/05/2009 11:47:43 AM
CORE
CHAPTER 2
If we are dealing with velocities then we must write the above equation in vector form. The magnitude of instantaneous velocity is the instantaneous speed of the object at the instant measured and the direction of the velocity is the direction in which the object is moving at that instant. The concept of instantaneous acceleration follows accordingly as
v v=V
CORE
v - as t 0 a = ----t
Where v is the change in velocity in time t.
t=T
Figure 208 Variation Of Time With Speed Since distance = speed time, the distance travelled is the area under the speed-time graph. If the body starts from rest, then the distance s travelled in a time t = T is s = 2 1 vT where v is the speed at time T. From the definition of acceleration, we have a = v/t, hence s=2 1 aT2. In general, the distance s travelled in any arbitary time t is therefore s = 2 1 at2. If the body starts with speed u, then clearly we have to add the extra area ut such that
2 1 - at s = ut + -2
We can eliminate the time from this equation by substituting vu - from the first equation such that in t = ---------a
v = u + 2 as
This is the third equation for the set.
v = u + at
The sketch-graph in Figure 208 shows the variation with time t of the speed v of a body moving with constant acceleration a.
36
19 april 09 Phys chapter 2.indd 36 22/05/2009 11:47:44 AM
MECHANICS
The following exercise demonstrates an alternative way of analysing data obtained using strobe photographic techniques. This method is based on the equations of uniform motion. Figure 209 shows the results of an experiment in which the strobe photograph of a falling ball has been analysed. The strobe takes 20 pictures a second. The time between each picture is therefore 0.05 s. The distance column is the measured distance of each successive photograph of the ball from the origin. The error in the distance has been estimated from parallax error in reading from the scale against which the photographs have been taken and also in locating the centre of the ball in each photograph. You are to plot a graph of s against and from the graph find a value of g. You should include error bars on the graph and use these to calculate the error in the value of g that you have determined. time t/s 0.01/s 0 0.05 0.10 0.15 0.20 0.25 0.30 0.35 0.40 0.45 0.50 0.55 0.60 Figure 209 distance s/cm 0.4 cm 0 1.2 4.8 10.9 19.4 30.3 43.7 59.4 77.6 98.2 121.2 146.7 174.6 Data For Free Fall t2 assume that the value of g is 9.8 m s2 and hence compute your error in measurement as 4%. You do not know what the value of g is at your location and that is why you are measuring it. One correct way to calculate the error is along the lines indicated in the preceeding exercise, i.e., using error bars.
1.
A stone is dropped down a well and a splash is heard 2.4 s later. Determine the distance from the top of the well to the surface of the water? A girl stands on the edge of a vertical cliff and she throws a stone vertically upwards. The stone eventually lands in the sea below her. The stone leaves her hand with a speed of 15 m s-1 and the height of the cliff is 25 m. Calculate i. ii. iii. iv. the maximum height reached by the stone. the time to reach the maximum height. the speed with which the stone hits the sea. the time from leaving the girls hand that it takes the stone to hit the sea.
2.
3.
Some comments on g
In Chapter 6 we will see that the value of g varies with position and with height above the Earths surface and also in the absence of air resistance, the acceleration of free falling objects is independent of their mass. This was first noted by Galileo who is reputed to have timed the duration of fall for different objects dropped from the top of the Leaning Tower of Pisa. The fact that the acceleration of free fall is independent of an objects mass has far reaching significance in Physics and is discussed in more detail in Topic H. (General relativity) If you carry out an experiment to measure g and obtain a value say of 9.4 m s2 then make sure that you calculate your error using the correct method. For example do not
A sprinter starts off down a track at a speed of 10 m s-1. At the same time a cyclist also starts off down the track. The cyclist accelerates to a top speed of 20 m s-1 in 4.0 s. Ignoring the acceleration of the sprinter, determine the distance from the start that the cyclist will pass the sprinter.
37
19 april 09 Phys chapter 2.indd 37 22/05/2009 11:47:45 AM
CORE
CHAPTER 2
velocity
A E G B O H C F D time
CORE
Figure 210
The ball leaves the hand at point O and accelerates uniformly until it hits the ground at A. At A it undergoes a large acceleration during which its velocity changes from positive to negative (being zero at B). The change in velocity between B and C is less than the change in velocity between A and B since the rebound velocity is lower than the impact velocity. The ball accelerates from C to D at which point it is at its maximum height and its velocity is zero. Notice that even though its velocity is zero its acceleration is not. The ball now falls back to the surface and hits the surface at point E. Neglecting air resistance the speed of the ball at points C, and E will be the same. The process now repeats. The lines OA, CE and FG are parallel and the gradient of these lines is the acceleration of free fall.g. The lines AC, EF and GH are also parallel and the gradient of these lines is equal to the acceleration of the ball whilst it is in contact with the surface. The lines should not be vertical as this would mean that the acceleration would be infinite. The acceleration of the ball at points such as A, C and E is not equal to g. Sketching an accelerationtime graph taking into account the acceleration of the ball whilst it is in contact with the surface is a little tricky however, you might like to try it as an exercise. For all other parts of the balls motion it has a constant acceleration, g. The speed-time graph is interesting and this is left as a question at the end of the chapter. The displacementtime graph is also interesting and this is also left as a question at the end of the chapter.
38
19 april 09 Phys chapter 2.indd 38 22/05/2009 11:47:46 AM
MECHANICS
Exercise
2.1 (c)
120 100
distance / m
1.
40 20 0
10
12
14
time / s
Figure 212
Distance-time graph
time
time
If the velocity is not constant we can still find average speeds and instantaneous speeds from displacementtime graphs. To demonstrate this, let us look at the distance time graph for a falling ball this is shown in Figure 213.
C.
D.
6
time time
distance / m
2.
For the example of the stone dropped from a balloon and striking the ground and taking into account air resistance sketch i. ii. iii. a distancetime graph. a velocitytime graph. an accelerationtime graph.
time / s
Figure 213 Distance- time for a falling ball From the graph we see that the time it takes the ball to fall 1.0 m is 0.4 s. The average speed over this distance is therefore 2.5 m s-1. (Remember that speed is the magnitude of velocity.) To find say the instantaneous speed at 1.0 m we find the gradient of the curve at this point. To do this we draw the tangent to the point as shown. From the tangent that is drawn, we see that the slope of the line is 1.8 (= s) divided by 0.4 (= t) = 4.5 m s-1
39
19 april 09 Phys chapter 2.indd 39 22/05/2009 11:47:47 AM
CORE
A glider bounces backwards and forwards between the buffers of a linear air-track. Neglecting friction, which one of the graphs below best represents how the
80 60
CHAPTER 2
This is actually what we mean when we write an instantaneous speed as s ----- as t 0 vP = t vP is the instantaneous speed at the point P and
CORE
velocity / m s1
10 9 8 7 6 5 4 3 2 1 0
0.2
0.4
0.6
0. 8
1.0
1.2
1.4
time / s
Figure 215
gradient at any time t and hence the instantaneous speed). When sketching or plotting a displacement-time graph we have to bear in mind that displacement is a vector quantity. Consider for example, the situation of an object that leaves point A, travels with uniform speed in a straight line to point B, returns to point A at the same constant speed and passes through point A to a point C. If we ignore the accelerations at A and B and regard the point A as the zero reference point, then a sketch of the displacement-time graph will look like that shown in Figure 214.
and in this The acceleration is the ____________ time case this is equal to the gradient of the straight line and is equal to 10 m s -2. This is a situation of constant acceleration but even when the acceleration is not constant the acceleration at any instant is equal to the gradient of the velocitytime graph at that instant. In section 2.1.2 we saw that we defined instantaneous acceleration as v as t 0 a = ___ t Students familiar with calculus will recognise that acceleration is the derivative dv a = ___ dt which can also be written as:
2s d __ ds = d ___ a = __ dt dt dt2
B displacement
( )
A C
t1
2t1
time
We can also determine distances from speedtime graphs. This is easily demonstrated with the graph of Figure 216 which shows the speed time graph for constant speed.
Figure 214
Displacement-time Graph
40
speed / m s 1
30 20 10 0
10
12
14
time / s
40
19 april 09 Phys chapter 2.indd 40 22/05/2009 11:47:48 AM
MECHANICS
The distance travelled is just velocity time. So at a constant speed of 20 m s-1 after 10 s the object will have travelled 200 m. This is of course equal to the area under the line between t = 0 and t = 10 s. If the velocity is not constant then the area under a velocitytime graph will also be equal to the displacement. So, for the falling ball, we see from the velocity time graph, Figure 215, that the distance travelled after 1.0 s is equal to the area of the triangle of base 1.0 s and height 10 m s-1 equals = 1.0 s 10 m s-1 = 5.0 m (a) the acceleration of the train in the first 3 minutes is the gradient of the line AB. Therefore, we have, a = (b)
CD = (c)
the distance travelled by the train is the total area under the graph. Total area
Example
= area of triangle ABE + area BCFE + area of triangle CDF A train accelerates uniformly from rest to reach a speed of 45 m s-1 in a time of 3.0 min. It then travels at this speed for a further 4.0 min at which time the brakes are applied. It comes to rest with constant acceleration in a further 2.0 min. Draw the speed-time graph for the journey and from the graph calculate (a) (b) (c) the magnitude of the acceleration between 0 and 3.0 min the magnitude of the acceleration after the brakes are applied the total distance travelled by the train = 17550 = 18000 m = 18 km.
ACCELERATION-TIME GRAPHS
If the acceleration of an object varies it is quite tricky to calculate the velocity of the object after a given time. However, we can use an acceleration-time graph to solve the problem. Just as the area under a velocity-time graph is the distance travelled then the area under an accelerationtime graph is the chane in speed achieved. For the falling ball the acceleration is constant with a value of 10 m s2. A plot of acceleration against time will yield a straight line parallel to the time axis, Figure 218. If we wish for example to find the speed 3.0 s after the ball is dropped, then this is just the area under the graph:
12 acceleration / m s2 10 8 6 4 2 0 0 1 2 time / s 3 4 5
Solution
50 40
speed / m s 1
30 20 10 0
Figure 218
0 100 200 300 400 500 600 time / s
We could of course have found the speed directly from the definition of uniform acceleration i.e. speed = acceleration timeExample
41
19 april 09 Phys chapter 2.indd 41 22/05/2009 11:47:49 AM
CORE
the acceleration of the train after the brakes are applied is the gradient of the line
CHAPTER 2
is 5 m s . Clearly the determination of speed (and therefore velocity and acceleration) depends on what it is measured relative to. Generally speaking, if the velocity of a particle A relative to an assigned point or reference frame O is VA and the velocity of a particle B relative to the same point is VB, then the velocity of A relative to B is the vector difference VA VB (See Topic 1.3) The example that follows illustrates how relative velocity is very important in situations such as plotting a correct course for an aircraft or for a boat or ship.
-1
Example
The acceleration of an object increases uniformly at a rate of 3.0 m s-2 every second. If the object starts from rest, calculate its speed after 10 s.
CORE
Solution
A plot of the acceleration-time graph for this situation is shown in the graph below.
Example
acceleration / m s
2
30 20 10 2 4 6 8 10 12 time /s
The Figure below shows the two banks of a river. A ferryboat operates between the two points P and Q that are directly opposite each other.
Q
The speed attained by the object after 10 s is the area under the line. Therefore, speed = area enclosed by accelerationtime graph =2 1 10 30 = 150 m s-1 To summarise :
90 m
1.2 m s
the gradient of a displacement-time graph is equal to the velocity the gradient of a velocity-time graph is equal to the acceleration the area under a velocity -time graph is equal to the distance. the area under an acceleration-time graph is the change in velocity.
The speed of flow of the river relative to the riverbanks is 1.2 m s-1 in the direction shown. The speed of the ferry boat in still water is 1.8 m s-1 in a direction perpendicular to the river banks. The distance between P and Q is 90 m. Clearly, if the ferryboat sets off from P directly towards Q, it will not land at Q. It is left as an exercise for you to show that the speed of the ferryboat relative to the speed of the water is 2.2 m s -1 and that it will land at a point 60 m downstream of Q. You should also show that, in order to land at Q, the ferry boat should leave P heading upstream at an angle of 56 to the riverbank and that the time taken to cross to Q is about 60 s. For an aircraft, the rate of water flow becomes the wind speed and for a ship at sea it becomes a combination of wind, tide and current speeds. The idea of relative measurement has far reaching consequences as will become clear to those of you who choose to study Special Relativity (Option D or in Option H.)
42
19 april 09 Phys chapter 2.indd 42 22/05/2009 11:47:50 AM
MECHANICS
Exercise
2.2 (a)
This exercise is designed to help you distinguish between the concepts of mass and weight. Here are six different hypothetical activities (a) (b) (c) (d) (e) (f) You weigh yourself using bathroom scales. You determine the mass of an object by using a chemical beam balance. You determine the density of lead. You drop a brick on your foot. You trap your fingers in a car door. You design a suspension bridge.
Suppose that you were able to carry out these activities on the moon. Discuss how would the result of each activity compare with the result when carried out on Earth.
Force
The word force crops up very often in everyday usage but what actually does a physicist mean by force? One thing is for certain, a physicist will not be able to tell you what a force is and in many respects the question has little meaning in physics. To a physicist a force is recognised by the effect or effects that it produces. A force is something that can cause an object to deform i.e. change its shape speed up slow down change direction.
43
19 april 09 Phys chapter 2.indd 43 22/05/2009 11:47:51 AM
CORE
CHAPTER 2
The last three of these can be summarised by stating that a force produces an acceleration. So if you were to see an object that is moving along in a straight line with constant speed, suddenly change direction you would know immediately that a force had acted on it even if you did not see anything tangible pushing on the object. This is not as daft as it sounds, just think of an object falling to the ground. The fact that free falling objects accelerate means that a force is acting on them. As we have seen in Section 2.2.1, this force is the force of gravity that the Earth exerts on the object and is the weight of the body. We shall discuss the gravitational force in more detail in Topic 6.1. Since a force can produce an acceleration, it is clearly a vector quantity. Hence, if two or more forces act on a particle, to find the resultant acceleration we have to find the resultant force. We use the word particle here because if forces act on a body they can produce a deformation as well as an acceleration. If they act on a particle that ideally has no physical dimensions then it can only produce an acceleration. In Figure 222 the blue arrows show the tension forces set up in the spring. There is a force that opposes the pulling force and a force equal in magnitude to this force is also exerted by the spring on the fixed support.
orce ion f Tens
e forc ing l l u P
Fixed pole
CORE
Figure 222
Spring forces
An interesting and very important type of force arises in connection with springs. A typical spring can be made by taking a length of wire and winding it round a pencil such that each successive turn (coil) is in contact with the previous turn. However, all school laboratories usually posses a variety of commercially available springs. If you hold one end of a spring in one hand and pull the other end then clearly to extend the spring you have to exert a force on the spring. If you dont hold one end of the spring, then when you pull, it will accelerate in the direction that you are pulling it. Holding one end and pulling the other produces a tension force in the spring. This is illustrated in Figure 221.
One thing that you will note as you pull the spring, is that the further you extend the spring, then the greater the force that you have to exert in order to extend it even further. You can investigate how the force required to extend the spring varies with the extension e of the spring by simply hanging weights of different values on the end of a vertically suspended spring. (The force of gravity measured in newtons that acts on a mass M can be found to a very good approximation by multiplying M by 10). The results for a typical spring in which the force F is plotted against the extension e is shown in Figure 223.
F /N X
e /m
Figure 223 Graph of force against extension of a spring Up to the point X the force F is directly proportional to e. beyond this point the proportionality is lost. If the point X is passed, the spring can become permanently deformed in such a way that when the weights are removed the spring will not go back to its original length. In the region of proportionality we can write F = ke
Pull
Fixed pole
Figure 221
Forces on a spring
where k is a constant whose value will depend on the particular spring. For this reason k is called the spring constant. This spring behaviour is a specific example of a more general rule known as Hookes law of elasticity after the 17th century physicist Robert Hooke. For this reason the region of proportionality is often referred to as the Hookey region or elastic region and point X is called the
44
19 april 09 Phys chapter 2.indd 44 22/05/2009 11:47:52 AM
MECHANICS
elastic limit. We can see that a spring can be calibrated to measure force and no doubt your physics laboratory has several so-called newton metres. However, you will discover that these newton metres do not provide a particularly reliable method of measuring force.
Weak-interaction
A force which is about 1026 times stronger than gravity is the so-called weak-interaction. This is the interaction, which is responsible for certain aspects of the radioactive decay of nuclei.
Exercise
2.2 (b)
Electromagnetic interaction
The electromagnetic interaction is some 1037 times stronger than gravity and this is the force that exists between particles as a consequence of the electrical charge that they carry.
1.
The graph below shows how the length of a spring varies with applied force.
15
F /N
10 5
i. ii.
State the value of the unstretched length of the spring. Use data from the graph to plot another graph of force against extension and from this graph determine the spring constant.
Fundamental forces
A discussion on forces would not be complete without some reference to the so-called fundamental forces or interactions as physicists prefer to call them. Although at first sight there seems to be a bewildering number of different types of force, pushes, pulls, friction, electrical, magnetic etc. physicists now recognise that all the different forces arise from just four fundamental forces.
All four of these interactions are discussed in more detail throughout this book. However, a simple way of looking at them is to think of the gravitational force as being the force that accounts for planetary motion and the way that galaxies are put together. The electromagnetic interaction is the force that accounts for the way in which the electrons are arranged in atoms and as such is the force that accounts for all chemical and biological processes. The strong interaction accounts for the nuclear structure of the atom and the weak interaction accounts for how the nucleus comes apart. Physicists would like to unify all these forces into just one force. That is they would like to find that all the interactions were just special cases of one fundamental interaction. There has been some success in unifying the weak and the electromagnetic interaction but that is the current situation. (See also Topic J.1)
Free-Body Diagrams
We have seen that we can represent the forces acting on a particle by lines with arrows, the lengths of which represented the relative magnitude of the forces. Such diagrams are a useful way to represent the forces acting on a body or particle and are called free-body diagrams. Figure 225 shows an object of mass M that is suspended vertically by a thread of negligible mass. It is then pulled to one side by a force of magnitude F and held in the position shown.
Gravitational force
The weakest of the four is the gravitational force. As we have seen, this is the force that gives rise to the weight of an object. However, as mentioned, we also understand this force to act between all particles in the Universe.
45
19 april 09 Phys chapter 2.indd 45 22/05/2009 11:47:53 AM
CORE
CHAPTER 2
Exercise
2.2 (c)
CORE
F Mass = M
Draw a free-body force diagram of the forces acting on an aircraft which is flying horizontally with constant velocity.
Figure 225 Forces acting on an object The free-body diagram for the forces acting on the object in Figure 225 is shown in Figure 226
T B F A
Two forces act on particle P as shown in the Figure below (N stands for newton and is the SI unit of force as we shall see in the next section.) Determine the magnitude of the net force acting in the horizontal direction and the magnitude of the net force acting in the vertical direction and hence determine the resultant force acting on P.
Weight = Mg
4.0 N 6.0 N P 30
Figure 226
The weight of the object is Mg and the magnitude of the tension in the thread is T. The object is in equilibrium (see 2.2.6) and so the net force acting on it is zero. This means that the vertical components of the forces must be zero as must the horizontal components. Therefore, in the diagram the line A is equal in length to the arrow representing the force F and the line B is equal in length to the arrow representing the weight. The tension T, the resultant of A and B, is found by the using the dotted line constructions. When producing a free-body diagram, there is no need to show these constructions. However, as well as being in the appropriate directions, the lengths of the arrows representing the forces should be approximately proportional to the magnitudes of the forces. The following example is left as an exercise for you.
The component of the 4.0 N in the horizontal direction is 4.0 cos 30 = 3.5 N. Hence the magnitude of the force in the horizontal direction is 2.5 N.
R 2.0 N 2.5 N
The component of the 4.0 N force in the vertical direction is 4.0 sin 30 = 2.0 N and this is the magnitude of the force in the vertical direction. The Figure above shows the vector addition of the horizontal and vertical components. The resultant R has a magnitude = 3.2 N and the angle = tan -1 (2.0/2.5) = 39.
46
19 april 09 Phys chapter 2.indd 46 22/05/2009 11:47:54 AM
MECHANICS
State Newtons rst law of motion. Describe examples of Newtons rst law. State the condition for translational equilibrium. Solve problems involving translational equilibrium.
IBO 2007
2.2.7
1. ball
forward thrust
2.
ball
gravity
gravity
Figure 229 Aristotelian and Galilean forces on a ball A girl throws a ball towards another girl standing some metres away from her, it is tempting to think that, as Aristotle did, there must be a forward thrust to keep the ball moving through the air as shown in 1. However, if air resistance is neglected, the only force acting on the ball is gravity as shown in 2.
47
19 april 09 Phys chapter 2.indd 47 22/05/2009 11:47:55 AM
CORE
CHAPTER 2
Suppose that we were to pull the object with a force as applied to it as shown in Figure 233.
CORE
Figure 233
Static equilibrium
Consider the simple case of a book resting on a table. Clearly gravity acts on the book and without the intervention of the table, the book would fall to the ground. The table therefore exerts an equal and opposite force on the book. This force we call the normal reaction. The force acting on the book are shown in Figure 231
N normal reaction W=N surface of table
In this situation, it is quite likely that the book will not move along the table but actually rotate in the direction shown by the arrow in Figure 233. This again demonstrates that in fact we should apply the Newton laws of motion to particles. Unless the pulling force acts through what we call the centre of mass of the object then the pulling force and the frictional force can produce rotation. This is quite a subtle point. However, in many situations in this chapter we will refer to objects and bodies when strictly speaking we mean particles. There is in fact a branch of mechanics known as Rigid Body Mechanics which specifically deals with the mechanics of extended bodies rather than particles. This is not covered in the IB course. However, we can still get quite a long way with mechanics by considering bodies to act as particles.
2.2.8 State Newtons second law of motion. 2.2.9 Solve problems involving Newtons second law.
IBO 2007
Figure 231
Dynamic equilibrium
Now consider the case where the book, or any other object, is pulled along the surface of the table with constant velocity. Gravity and the normal reaction are still acting but there is now a frictional force acting which is equal in magnitude but opposite in direction to the pulling force. The force diagram for this situation is shown in Figure 232.
Figure 232
48
19 april 09 Phys chapter 2.indd 48 22/05/2009 11:47:56 AM
MECHANICS
directly proportional to the force acting and is in the same direction as the applied force. Furthermore the constant of proportionality that relates the two is the inertial mass of the particle. We can therefore write that F = ma This is Newtons second law in its simplest form. There are however, many situations where the mass of the system does not remain constant e.g. a firework rocket, sand falling on to a conveyor belt etc. It is therefore helpful to express the law in a more general form. We can express the acceleration a in terms of the rate of change of velocity i.e. v F = m ----t We now define a quantity called the linear momentum p of the object as p = mv such that we can now write Newtons second law if the form p F = -----t We shall discuss the concept of momentum in more detail in Section 2.2.10 Although it is possible to verify Newtons Second Law directly by experiment its real validity is understood in terms of the experimentally verifiable results that it predicts. In a sense this law is the whole of Classical Mechanics and tells us that if we pay attention to the forces then we can find the acceleration and if we know the acceleration then we know the future behaviour of the particle. Newton in essence says find the force law governing a system and you will be able to predict its behaviour. Unfortunately, it is not always possible to know the force law. Newton himself gave one, his famous law of gravitation, which we will look at in Topic 6.1. However, in such situations such as the collision of two billiard balls we do not know the force acting nor do we know the force acting between the millions of molecules of a gas, solid or liquid. In situations such as these we have to find some other means of solving the problem and these we look at some of these methods in Sections 2.2.13 and 2.3.6. We can use Newtons Second Law to understand the equivalence between inertial and gravitational mass. A simple argument shows this to be so. If we assume that the gravitational force F that the Earth exerts on an object is G proportional to the gravitational mass m of the object. G We can write this as F = Km
G G
where K is a constant. The acceleration g of the object is given by Newtons second law. F = Km = m g
G G I
But experiment shows that g is a constant and has the same value for all objects. Hence, it follows from the above equation that m = m with g = K.
G I
If we have an independent definition of inertial mass, then we can use the second law to define a unit of force. The SI unit of force is the newton (N) and it is that force which produces an acceleration of 1 m s-2 in a mass of 1 kg. This is an absolute definition in that it does not depend on the properties of any material or any outside influence such as pressure and temperature. The second law also enables us to quantify the relationship between mass and weight. As has been previously stated the acceleration of free fall is the same for all objects. If its value is g and an object has a mass m then from the second law we see that the gravitational force exerted on it by the Earth has a value mg. Hence mg is the weight of the object. If we take g to have a value of 10 m s-2 then a mass of 1 kg will have a weight of 10 N close to the surface of the Earth. On the Moon where the acceleration of gravity is about 1.7 m s-2 a mass of 1 kg will have a weight of about 1.7 N. In the rest of this section we will look at some examples of the application of the second law and give some exercises. Hopefully, this will help you gain familiarity with its use. Remember that it is a very, very important law.
49
19 april 09 Phys chapter 2.indd 49 22/05/2009 11:47:57 AM
CORE
CHAPTER 2
Solution
2 2
We can use the equation v = u + 2 as to calculate the magnitude of the average acceleration of the car.
CORE
In this instance we have that u = 30, v = 0 and s = 180, so that The diagram below shows a block of wood of mass 1.0 kg attached via a pulley to a hanging weight of mass 0.5 kg. Assuming that there is no friction between the block and the bench and taking g to be 10 m s2, calculate the acceleration of the system.
2 2 0 = 30 + 2 180 a a = 900 -------360
= 2.5 m s-2 The negative sign indicates that the car is slowing down.
1.0 kg
pulley
0.50 kg
Hence the average force on the person, is 175 N (in the opposite direction to the direction of motion).
Exercise Solution
1. The force acting on the system is the weight of the hanging mass which is 0.50 kg = 5 N Using Newtons second law F = m a we have Hence 5.0 = (1.5) a a = 3.3 2.
2.2 (d)
A person stands on bathroom scales placed on the floor of a lift. When the lift is stationary the scales record a weight of 600 N. The person now presses the button for the 6th floor. During the journey to the 6th floor the scales read 680 N, then 600 N, then 500 N and finally 600 N. Explain these observations and calculate any accelerations that the lift might have during the journey. Three forces act as shown on a particle of mass 0.500 kg as shown in this Figure. Calculate the acceleration of the particle.
Example
5.00 N
A person of mass 70 kg is strapped into the front seat of a car, which is travelling at a speed of 30 m s1. The car brakes and comes to rest after travelling a distance of 180 m. Estimate the average force exerted on the person during the braking process.
50
19 april 09 Phys chapter 2.indd 50 22/05/2009 11:47:58 AM
MECHANICS
Also, in terms of the momentum change we have seen that we can express the second law as
2.2.10 Dene linear momentum and impulse. 2.2.11 Determine the impulse due to a time varying force by interpreting a force-time graph 2.2.12 State the law of conservation of linear momentum. 2.2.13 Solve problems involving momentum and impulse.
(Students should be familiar with elastic and inelastic collisions and explosions. Knowledge of the coecient of restitution is not required).
IBO 2007
F =
p t
p we see that t
The term F t is called the impulse of the force and it is a very useful concept in solving certain types of problem, particularly in situations where the force acts for a short time such as kicking a football. We also see by expressing the second law in this way that an equivalent unit for momentum is N s.
50 40
Force / N
30 20 10 0
0 0.05 0.1 0.15 0.20 0.25 0.30
time / s
Figure 236
Since the area under the graph is equal to the impulse we can calculate the speed with which the football leaves the foot. The area equals
1( 1( _ 50 0.14 ) + ( 50 0.14 ) + _ 50 0.08 ) = 13.5 N s 2 2
Suppose that the mass of the football is 0.40 kg then from 13.5 = 34 F t = p = mv , we have that v = ____ 0.40
51
19 april 09 Phys chapter 2.indd 51 22/05/2009 11:47:59 AM
CORE
CHAPTER 2
That is, the change in velocity is 34 m s-1. In actuality one is much more likely to use the measurement of the speed of the football to estimate the average force that is exerted by the foot on the football. The time that the foot is in contact with the ball can be measured electronically and the speed of the football can be computed by measuring its time of flight. Here is another example in which we use the ideas of impulse and the rate of change of momentum.
CORE
Example
Water is poured from a height of 0.50 m on to a top pan balance at the rate of 30 litres per minute. Estimate the reading on the scale of the balance.
Solution
We shall assume here that the water bounces off the top of the balance horizontally. Again we can calculate the speed with which the water hits the balance. From the equation v2 = 2as we have v2 = 2 10 0.50 = 10. So, v = 3.2 m s-1. The mass of water arriving at the balance per second is 0.50 kg s-1. The rate of change of momentum is therefore 0.50 32. = 1.6 N. If the balance is calibrated in grams the reading will therefore be about 160 g. Can you think of an example of where the theory of impulse has been used to good effect? One example is the reason why crash barriers between the opposite lanes on a highway are made of crumpling material. The crumpling means that on collision, a car will come to rest in a greater time. The change in momentum of the car would be the same as if the barrier were made of concrete but the F in Ft is now smaller since t is larger.
52
19 april 09 Phys chapter 2.indd 52 22/05/2009 11:47:59 AM
MECHANICS
v = 6.3 m s
-1
His change in momentum on coming to rest is therefore 70 6.3 N s. This is equal to the impulse Ft. hence we see that
2.0 m /s
Figure 237 The conveyor belt problem
6.3 = 220 N. F = 70 ___ 2.0 Therefore total force = 220 + 700 = 920 m.
Suppose that sand is poured vertically at a constant rate of 400 kg s-1 on to a horizontal conveyor belt that is moving with constant speed of 2.0 m s-1. (See Figure 237) We wish to find the minimum power required to keep the conveyor belt moving with constant speed. In every second the horizontal momentum of the sand changes by 400 2.0 kg m s-1. This means that the rate of change of momentum of the sand is 800 kg m s-2. The force exerted on the conveyor belt by the sand is therefore 800 kg m s-2. This force is the frictional force between the sand and the conveyor belt and it is this force which accelerates the sand to the speed of the conveyor belt. The power therefore to keep the conveyor belt moving at this speed is this force multiplied by the speed of the belt. i.e. the power (P) equals (Fv) = 1600 W. (see below) We can also work out the rate of change of kinetic energy of the sand since the change in KE every second is 800 J s-1 (see below). This is quite interesting since we see that whatever the nature of the sand or the belt we always lose half the power in dissipation by the frictional force.
Admittedly this problem could have been solved by computing the acceleration of the man on landing and coming to rest from the equation v = at. However, this involves another step and is not as elegant a solution since it doesnt really get to the physics. If he were to land on concrete then he would come to rest much more quickly. However his change in momentum would be the same hence the F in the impulse Ft would be much greater. Along with the law of conservation of energy (see 2.3.6) (strictly speaking mass-energy, see topic 7.3.4), the law of conservation of momentum is of great importance in Physics. Although Newtons laws are found not to work when applied to atoms and molecules and are also modified by Relativity theory, the law of conservation of momentum still stands. (If you were to invent a new theory, no matter how elegant the theory, if it violates conservation of energy and momentum then you can forget it.) The beauty of a law such as the conservation of momentum is that we are able to predict an outcome without knowing the intricacies of what actually is going on. When two billiard balls collide, the forces that act during collision are very complicated and we have no idea of their spatial and time dependence. However, because we know that the forces are equal and opposite we are able to predict the outcome of the collision. Figures 238-241 on the next page show some examples of collisions and their possible outcomes. (Remember, momentum is a vector quantity). The outcome of a collision will depend on the mass of each particle, their initial velocities and also how much energy is lost in the collision. However, whatever the outcome, momentum will always be conserved. Any predicted outcome which violates the conservation of linear momentum will not be accepted.
Example
Estimate the force exerted on a man who jumps off a wall of height 2.0 m and lands in soft earth. Explain why he would be likely to hurt himself if he landed on concrete.
Solution
We estimate the mass of the man as 70 kg and the time that he comes to rest on landing on the earth to be 2.0 s. We can find the speed with which he hits the ground from 2 v = 2g h. i.e.
53
19 april 09 Phys chapter 2.indd 53 22/05/2009 11:48:00 AM
CORE
CHAPTER 2
4. A football of mass 0.46 kg attains a speed of 7.7 m s-1 when kicked. The toe of the football boot with which it is kicked is in contact with the ball for 0.26 s.
Exercise
2.2 (e)
1.
CORE
A bullet of mass 9.0 g leaves the barrel of a rifle with a speed of 8.0 102 m s-1. The mass of the rifle is 1.8 kg. If the rifle is free to move, calculate the speed with which it recoils. Sarah is standing on the horizontal surface of a frozen pond. She throws a ball of mass 250 g. The ball leaves her hand with a horizontal speed of 8.0 m s-1. As a result, Sarah moves with an initial speed of 5.0 cm s-1. Estimate Sarahs mass.
Calculate (a) (b) the impulse given to the ball the average force exerted on the ball.
2.
3.
In a particular thunderstorm, the hailstones and raindrops have the same mass and terminal velocity. Explain, with reference to Newtons Second Law why the hailstones hurt more that raindrops when they hit you.
Case 1:
m1
u1
m2
u2 =
Figure 238 Case 1
m1
v1
m2
v2
m1 u1 + m2 u2
Case 2:
m1 v 1 + m2 v 2 v1 m1 m2 v2
m1
u1
u2
m2 =
Figure 239 Case 2
m1 u1 m2 u2 u1 u2
m1 v 1 + m2 v 2 v1 = 0 m1 v2
Case 3:
m1
m2 =
Figure 240 Case 3
m2 m2 v 2
coupled
m1 u1 m2 u2 u1 u2
Case 4:
m1
m2
m1 =
Figure 241 Case 4
m2
m1 u1 + m2 u2
( m1 + m2 ) v
54
19 april 09 Phys chapter 2.indd 54 22/05/2009 11:48:01 AM
MECHANICS
A book X Y E arth
Figure 242 The forces acting when a book rests on a table The table is actually incidental to the action of these forces. The two forces are the equal and opposite pair referred to in the third law such that X + Y =0 If the table is not there, the book falls towards the Earth and the Earth falls towards the book. Both the book and the Earth will accelerate in accordance with Newtons second law. However, considering that the earth is some 1026 times more massive than the book, we do not observe the acceleration of the Earth. The force A is the force that the table exerts on the book. The force B is the force that the book exerts on the table. These two forces are again an equal and opposite pair of forces referred to in the third law such that A + B =0 So what is the origin of these two forces? They actually arise from the interaction forces between the molecules of the book and the molecules of the table. The interaction force is complex but essentially between any two molecules there is either a force of repulsion or a force of attraction. Which force operates depends on the separation of the molecules and, in equilibrium, the two molecules will take up a separation at which the repulsion force balances the attraction force. If we push the molecules closer together the repulsion force becomes greater than the attraction force and if the push is released the molecules will move back to the equilibrium position. It is this repulsion force which stops you falling through the floor. If we pull the molecules further apart then the attraction force becomes greater than the repulsion force and if we release the pull, the molecules will move back to the equilibrium position. The attraction force is why you need to apply a force to stretch a spring and the repulsion force is why you need to apply a force to compress a spring.
table
55
19 april 09 Phys chapter 2.indd 55 22/05/2009 11:48:02 AM
CORE
CHAPTER 2
The three situations are illustrated in Figure 243. force and mass, can be defined from one equation, F = m a. The answer is that they are not. If a system consists of two isolated particles that exert equal and opposite forces on each other, then the ratio of their acceleration will be in the inverse ratio of their masses. One of the particles can be considered to be a standard mass and the acceleration of other particles interacting with this standard can be measured in order to determine their mass
CORE
Exercise
2.2 (f )
Figure 243
The magnitude of both the attraction and repulsion forces depends on the separation of the molecules. The repulsion force increases very rapidly as the molecules get closer together and this is why it requires a much greater force to compress certain solids than it does to extend them. So we see that the table can still break without violating the third law. When the table breaks we just have a re-arrangement of its molecular structure. When students first encounter the third law they sometime say things like if, when a force is exerted on an object there is an equal and opposite reaction, how can anything ever move? Equal and opposite forces must mean the net force on everything is always zero. Think of the Earth and the book. When the table is not present, there is a net force on the book and it accelerates in accordance with Newtons second law. It is the net force on the system of the Earth and the book that is zero. Similarly if we think of a horse pulling a cart, the acceleration of the cart depends not on the forces that it might exert on something else, but on the forces that are exerted on it. We have gone into the third law in some detail because it is an area of physics which is often misunderstood by students. Basically, remember to be careful when identifying the system in which the equal and opposite forces appear. Also remember that when computing the acceleration of a body, it is the forces that act on the body that are considered, not the forces that it exerts on other bodies. It is the third law which enables physicists to give a logical definition of mass. This is something that you will not be expected to know for the examination and it is included for the inquisitive who are puzzled as to how two quantities,
In 1920, the Smithsonian published an original paper by Robert Goddard titled, A Method for Reaching Extreme Altitudes, in which Goddard included a small section stressing that rockets could be used to send payloads to the Moon. The next day, the New York Times wrote a scathing editorial denouncing his theories as folly. Goddard was ridiculed and made to look like a fool. The editorial stated that travel in space by rocket was impossible because a rocket would have no air to push against. Discuss, with reference to Newtons law why the editor of the New York Times was actually the fool.
56
19 april 09 Phys chapter 2.indd 56 22/05/2009 11:48:03 AM
MECHANICS
idea and also note that getting something done usually involves a transfer of energy from one form to another. For example a car engine uses fuel in order to get the car moving and to keep it moving, you eat food in order to live, oil is burned in a power station to provide electrical energy and you can think of other examples.
mg
E ngine 2 mg
Engine
mg 2h h mg 2 mg
We shall assume that the surface of the slope is frictionless. The work W done by the engine using the above definition is: W = force s where s is the distance up the slope. The force this time is the component of the weight down the slope. Hence, W = mg sin s
Figure 244 Fuel used to lift a weight In Figure 244 we imagine a situation in which we are using fuel in an engine whose function is to lift a weight mg to a certain height. In the first diagram the engine lifts the weight to a height h and in the second diagram the engine lifts a weight 2mg to a height 2h. In the first situation let us suppose that the engine requires 1 dm3 of fuel to complete the task. Then in the second situation we would guess that the engine would require 4 dm3 of fuel. The amount of fuel used is a measure of the energy that is transferred and to complete the tasks we say that the engine has done work. The amount of work that is done will be a measure of the energy that has been transferred in the performance of the tasks. Clearly the engine has done work against a force, in this situation, the force of gravity. If we double the force i.e. the weight, then the work done is doubled and if we double the distance through which the weight is moved then the work done is also doubled. If we double both together, then the work done is quadrupled. It seems that we can define work as: Work = force distance moved However, we have to be careful in using this definition. Consider the situation below in which the engine lifts the weight up a slope to height h.
h h s = ____ But sin = __ s sin h_ meaning that W = mg sin s = mg sin ___ sin = mgh That is, W = mgh This is just the amount of work that the engine would have performed if it had lifted the weight directly. Our definition of work therefore becomes Work = magnitude of the force displacement in the direction of the force Which is the same as work = magnitude of the component of the force in the direction moved the distance moved The SI unit of work is the newton metre and is called the joule named after the 19th Century physicist James Prescott Joule.
57
19 april 09 Phys chapter 2.indd 57 22/05/2009 11:48:04 AM
CORE
CHAPTER 2
Example
CORE
A force of 100 N pulls a box of weight 200 N along a smooth horizontal surface as shown in the Figure below.
100 N 45
F orc e F
F orce F
Calculate the work done by the force (a) (b) in moving the box a distance of 25 m along the horizontal against gravity.
Solution
(a) and (b) Force vs displacement graphs (a) The component of the force along the direction of motion, i.e., the horizontal component, Fh, can be determined by using the fact that 100 N Fh ___ cos 45 = 45 100
Figure 246
The area under each of the graphs is clearly equal to the work done. In Figure 256 (a) when the force F undergoes a displacement d the work done is Fd. In Figure 246 (b) when the force F produces an extension s then the work done is
Fh = 100 cos45
= 71 N That is, the component of the force along the direction of motion is 71N. Therefore the work done (F s) = 71 25 = 1780 That is, the work done is 1780 J. There is no displacement by the force in the direction of gravity. Hence the work done by the force against gravity is zero.
(b)
Suppose that a constant frictional force of 50 N acts on the box. How much work is done against friction? Again, using the fact that W = force distance, we have that the work done is simply 50 25 = 1250 N. Notice that in the above example we use the expressions work done by and work done against. This occurs over and over again in physics. For example the engine lifting the weight does work against gravity. However, if
58
19 april 09 Phys chapter 2.indd 58 22/05/2009 11:48:05 AM
MECHANICS
the weight is allowed to fall, the work is done by gravity. Strictly speaking we have a sign convention for work. The convention is that Work done on a system is negative. Work done by a system is positive. So how do we find out just how much work a moving object is capable of doing? In the diagram below, a force F moves an object of mass m a distance d along a horizontal surface. There is no friction between the object and the surface.
F
In the example above, the box can be identified as the system. Although this convention is not always important in many mechanics situations, it is of great importance when we come to consider the relationship between heat and work (Chapter 10) and in the relation between field strength and potential (Chapter 9)
m d
Figure 249 An example of kinetic energy
Exercise
2.2 (g)
We will assume that the object starts from rest. The force F will accelerate the object in accordance with Newtons Second Law and the magnitude of the acceleration will be given by using the formula F F = ma, from which we obtain a = __ m. We can use the equation v2 = u2 + 2as to find an expression for the speed of the object after it has moved a distance d (= s). Fd F ___ 2 That is, v2 = 02 + 2 __ m d v =2 m
A man drags a sack of flour of mass 50 kg at constant speed up an inclined plane to a height of 5.0 m. The plane makes an angle of 30 with the horizontal and a constant frictional force of 200 N acts on the sack down the plane. Calculate the work that the man does against: (a) (b) friction. gravity.
( )
1 mv2 From which Fd = __ 2 The work done (Fd) by the force F in moving the object a distance d in the direction of F is now, therefore, expressed in terms of the properties of the body and its motion. 1mv2 is called the kinetic energy (KE) of the The quantity __ 2 body. This is the energy that a body possesses by virtue of its motion. The kinetic energy of a body essentially tells us how much work the body is capable of doing. 1 mv2 We denote the kinetic energy by EK, so that EK = __ 2 A very useful relationship exists between kinetic energy and momentum. We have that p = mv such that p2 = m2v2 Hence substituting for v2, we have p2 p2 2 EK = ___ and __ m = mv = 2 EK 2m
2.3.4 Outline what is meant by kinetic energy. 2.3.5 Outline what is meant by change in gravitational potential energy. 2.3.6 State the principle of conservation of energy.
2.3.7 List dierent forms of energy and describe examples of the transformation of energy from one form to another. 2.3.8 Distinguish between elastic and inelastic collisions.
IBO 2007
59
19 april 09 Phys chapter 2.indd 59 22/05/2009 11:48:06 AM
CORE
CHAPTER 2
To demonstrate the so-called principle of energy conservation we will solve a dynamics problem in two different ways, one using the principle of energy conservation and the other using Newtons laws and the Example kinematics equations.
CORE
Example
An object of mass 4.0 kg slides from rest without friction down an inclined plane. The plane makes an angle of 30 with the horizontal and the object starts from a vertical height of 0.50 m. Determine the speed of the object when it reaches the bottom of the plane.
Solution
Reaction force
v=0 A
mg
h = 0.50 0.50 m
v= V h= 0
30
We set zerolevel at point B, the base of the incline, so that h = 0 and so that at point A, h = 0.50. At A, the object has no kinetic energy (v = 0) and at point B, the object has gained kinetic energy (v = V). Method 1: Kinematic solution
The force down the plane is given by mgsin = 20 N. Using Newtons 2nd law (F = ma) gives the acceleration 20 = 5.0 m s2 a = ___ 4.0 (Note that we could have determined the acceleration by writing down the component of g down the plane) Using V2 = u2 + 2as with u = 0, and s (the distance down the plane)
0.50 = 1.0 0.50 = ______ s = ____ sin sin 30 We have, V2 = 02 + 2 5 1 V = 10 = 3.2 m s-1
___
60
19 april 09 Phys chapter 2.indd 60 22/05/2009 11:48:07 AM
MECHANICS
Method 2: Energy Principle It was the great triumph of some late eighteenth and early nineteenth physicists and engineers to recognise that this lost energy is transformed into thermal energy. If you rub your finger along the top of a table you will definitely feel it getting warm. This is where the lost energy has gone. It has in fact been used to make the molecules of the table and the molecules of your finger vibrate more vigorously. Another thing to notice is that this energy is lost in the sense that we cant get it back to do useful work. If there were no friction between the object and the surface of the plane then, when it reached the bottom of the plane, work could be done to take it up to the top of the plane and this cycle could go on indefinitely. (We could actually set up the arrangement such that the objects KE at the bottom of the plane could be used to get it back to the same height again). It is what we call a reversible process. The presence of friction stops this. If the object is dragged back up the plane you wont get the energy back that has been lost due to friction, you will just lose more energy. This is an irreversible process. We can now start to glimpse why, even though it is impossible to destroy energy, it is possible to run out of useful energy. Energy becomes as we say, degraded. The general principle of energy conservation finds its formulation in the First Law of Thermodynamics and the consequences of this law and the idea of energy degradation and its implication on World energy sources is discussed in much more detail in Chapter 8.
As the object slides down the plane its potential energy becomes transformed into kinetic energy. If we assume that no energy is lost we can write
2 1 change in PE = mgh = gain in KE = _ 2 mv 2 2 1 So that _ 2 mv = mgh V = 2gh
V = 2gh
____
Using the values of g and h, we have that V = 3.2 m s-1. That is, the object reaches a speed of 3.2 m s-1. Note that the mass of the object does not come into the question, nor does the distance travelled down the plane. When using the energy principle we are only concerned with the initial and final conditions and not with what happens in between. If you go on to study physics in more depth you will find that this fact is of enormous importance. The second solution involves making the assumption that potential energy is transformed into kinetic energy and that no energy is lost. This is the socalled energy principle, this means the energy is conserved. Clearly in this example it is much quicker to use the energy principle. This is often the case with many problems and in fact with some problems the solution can only be achieved using energy considerations. What happens if friction acts in the above example? Suppose a constant force of 16 N acts on the object as it slides down the plane. Now, even using the energy principle, we need the distance down the plane so we can calculate the work done against friction This is 16 1.0 = 16 J. The work done by gravity i.e. the change in PE = 20 J The total work done on the object is therefore 20 16 = 4.0 J.
2 1 Hence the speed is now given by _ 2 mv = 4.0.
Thermal energy
This is essentially the kinetic energy of atoms and molecules. It is sometimes incorrectly referred to as heat. The term heat actually refers to a transfer of energy between systems.
To give v = 1.4 m s-1. So in this problem, not all the work done has gone into accelerating the object. We say that the frictional force has dissipated energy. If we are to retain to the idea of energy conservation then we must account for this lost energy.
61
19 april 09 Phys chapter 2.indd 61 22/05/2009 11:48:08 AM
CORE
CHAPTER 2
Chemical energy
This is energy that is associated with the electronic structure of atoms and is therefore associated with the electromagnetic force. An example of this is combustion in which carbon combines with oxygen to release thermal energy, light energy and sound energy.
do approximate quite well to being elastic. The collision of two snooker (pool) balls is very nearly elastic, as is the collision between two steel ball bearings. An interesting situation arises when the balls are of the same mass and one is at rest before the collision and the collision takes place along a line joining their centres as shown Figure 251.
u1 = u m u2 = 0 m v1 = v m v2 = V m
CORE
Nuclear energy
This is the energy that is associated with the nuclear structure of atoms and is therefore associated with the strong nuclear force. An example of this is the splitting of nuclei of uranium by neutrons to produce energy.
Before
After
Figure 251
Rolling balls
Electrical energy
This is energy that is usually associated with an electric current and is sometimes referred to incorrectly as electricity. For example the thermal energy from a chemical reaction (chemical energy) can be used to boil water and produce steam. The kinetic energy of the molecules of steam (thermal energy) can be used to rotate magnets and this rotation generates an electric current. The electric current transfers the energy to consumers where it is transformed into for example thermal and light energy (filament lamps) and kinetic energy (electric motors).We shall learn later that these different forms of energy all fall into the category of either potential or kinetic energy and are all associated with one or other of the fundamental forces. Energy can be transformed from one form into another and as far as we know energy can never be created nor can it be destroyed. This is perhaps one of the most fundamental laws of nature and any new theories which might be proposed must always satisfy the principle of energy conservation. A simple example of the principle is, as we have seen, to be found in the transformation of gravitational potential energy into kinetic energy.
Suppose that the speed of the moving ball is u and that the respective speeds of the balls after collision are v and V. If we now apply the laws of momentum and energy conservation we have conservation of momentum: mu = mv + mV conservation of energy
1 1 1 _ mu2 = _ mv2 + _ mV2 2 2 2
From which we see that u = v + V and u2 = v2 + V2 The only solution to these equations is that u = V and v = 0. This means that the moving ball comes to rest after collision and the ball that was at rest moves off with the speed that the moving ball had before collision. This situation is demonstrated in that well known toy, Newtons Cradle.
2.3.9 Dene power. 2.3.10 Dene and apply the concept of eciency. 2.3.11 Solve problems involving momentum, work, energy and power.
IBO 2007
2.3.9 POWER
Suppose that we have two machines A and B that are used in lifting objects. Machine A lifts an object of weight 100 N to a height of 5.0 m in a time of 10 s. When machine B is used to perform the same task it takes 0.1 s. Our instinct
62
19 april 09 Phys chapter 2.indd 62 22/05/2009 11:48:09 AM
MECHANICS
tells us that machine B is more powerful than machine A. To quantify the concept of power we define power as Power = the rate of working work i.e., power = _____ time The unit that is used to measure power is the joule per second which is called the watt, (W) after the 19th Century Scottish engineer James Watt. In the example above, of our two machines, A will have a power output of 50 W and machine B a power output of 5000 W. In the following example some sort of engine is used to pull an object at a constant speed along the horizontal.
v F Engine
Solution
P Using the formula P = Fv, we have that F = --. v 3000000 - = 50 kN . Hence, F = -------------------60
Our answer is at best an approximation since the situation is in fact much more complicated than at first glance. The train reaches a maximum speed because as its speed increases the frictional force due to air resistance also increases. Hence, at its maximum speed all the energy produced by the motors is used to overcome air resistance, energy lost by friction between wheels and track and friction between moving parts of the motors and connected parts. Again, we shall see that the concept of power and its application is discussed in much more detail in Chapter 8 and elsewhere in the syllabus.
object Friction
Figure 252
2.3.10 EFFICIENCY
(Machines and efficiencies are discussed in more detail in Chapters 8 and 10). The efficiency of an engine is defined as follows
The pulling force F (which is the tension in the rope) produced by the engine will be equal to the frictional force between the object and the floor. Suppose that the engine moves the object a distance s in time t . The work done against the frictional force (i.e., the work done by the engine) is:
P WOUT ____ = OUT Eff = _____ WIN PIN Where WOUT is how much useful work the engine produces and WIN is how much work (energy) is delivered to the engine. The ratio of these two quantities is clearly the same as the ratio power output of the engine to its power output. To understand the idea of efficiency, we will look at the following example.
W = F s
The power P developed by the engine is therefore
s W ------- = F ----P = t t
Example
Example
A diesel locomotive is pulling a train at its maximum speed of 60 m s-1. At this speed the power output of the engine is 3.0 MW. Calculate the tractive force exerted by the wheels on the track.
An engine with a power output of 1.2 kW drags an object of weight 1000 N at a constant speed up an inclined plane that makes an angle of 30 with the horizontal. A constant frictional force of 300 N acts between the object and the plane and the object is dragged a distance of 8.0 m. Determine the speed of the object and the efficiency of the engine.
63
19 april 09 Phys chapter 2.indd 63 22/05/2009 11:48:10 AM
CORE
CHAPTER 2
Solution
CORE
Engine
1.
h = 8.0 sin30 = 4.0 m
Calculate the momentum of a particle of mass 0.06 kg that has a kinetic energy of 3.2 J. Estimate the minimum take-off power of a grasshopper (cicada). An elastic band of length 2d is attached to a horizontal board as shown in the diagram below.
original position of elastic band
2.
The component of the objects weight down the plane is 1000 sin30 = 500 N. The total force against which the machine does work is therefore 500 + 300 = 800 N.
3.
2d
The machine lifts the object to a height h, where h = 8.0 sin30 = 4.0 m. The useful work that is done by the machine is therefore 1000 4.0 = 4000 J. The actual work that is done by the machine is 800 8.0 = 6400 J. The efficiency of the machine is the useful work done divided by the actual work done which is A margarine tub has some weights attached to the bottom of the inside of the tub such that the total mass of the weights and the tub is M. The tub is placed at the centre of the band and is pulled back until the tub makes an angle with the band as shown. The tub is then released such that it is projected down the runway for a distance s before coming to rest. The problem is to deduce an expression for the speed with which the tub leaves the band. The force constant of the elastic band is k. 4. A railway truck, B, of mass 2000 kg is at rest on a horizontal track. Another truck, A, of the same mass moving with a speed of 5.0 m s1 collides with the stationary truck and they link up and move off together. Determine the speed with which the two trucks move off and also the loss of kinetic energy on collision.
64
19 april 09 Phys chapter 2.indd 64 22/05/2009 11:48:12 AM
MECHANICS
__
Solutions
k ____ 1 v = 2 __ m d sin 1
1.
The above example can form the basis for a useful experiment in which you can investigate the factors that effect the distance that the margarine tub will travel.
Before collision A B
After collision V A B
2.
Estimates
u1 = 5.0 u2 = 0
As the trucks couple after the collision, the conservation of momentum law states that:
mass of grasshopper = 2.0 10-3 kg height to which it jumps 0.50 m time it takes to develop take-off power = 200 ms Calculation energy needed to reach 0.50 m = mgh = 2.0 10-3 10 0.50 = 10-2 J = 0.05 W 3. In the following example we tie together the ideas of energy transformations and the work done by a non-constant force
m1 u1 + m2 u2 = ( m1 + m2 ) V w here, m1 = m2 = 2000 ,
so that
We first deduce an expression for the extension of the elastic band. The stretched length from the geometry of the situation is d equal to l = ____ sin The extension, e, is therefore
By lost energy we mean that the energy has been dissipated to the surroundings. Some of it will be converted into sound and most will heat up the coupling between the trucks.
( ( )
))
If we assume that all the energy is transferred to the tub when the tub is released we have that the kinetic energy of the tub, Ek , is such that Ek = Eelas so that
2 2 ____ 1 1 1 _ 2 mv = 2kd
( sin )
65
19 april 09 Phys chapter 2.indd 65 22/05/2009 11:48:14 AM
CORE
CHAPTER 2
Exercise
2.3
1.
CORE
Suppose that in Example 4, (page 64) after the collision, truck A and truck B do not link but instead truck A moves with a speed of 1 m s1 and in the same direction as prior to the collision. Determine (i) (ii) the speed of truck B after collision the kinetic energy lost on collision
2.
A man drags a sack of flour of mass 100 kg at constant speed up an inclined plane to a height of 6.0 m. The plane makes an angle of 30 with the horizontal and a constant frictional force of 250 N acts on the sack down the plane. Determine the efficiency of the inclined plane?
3.
A car of mass 1000 kg is parked on a level road with its handbrake on. Another car of mass 1500 kg travelling at 10 m s1 collides into the back of the stationary car. The two cars move together after collision in the same straight line. They travel 25m before finally coming to rest. Determine the average frictional force exerted on the cars as they come to rest.
Problems on banked motion (aircraft and vehicles going round banked tracks) will not be included.
66
19 april 09 Phys chapter 2.indd 66 22/05/2009 11:48:14 AM
MECHANICS
Let us think of the example where you whirl an object tied to a string about your head with constant speed. Clearly the force that produces the circular motion in this case in a horizontal plane is the tension in the string. If the string were to snap then the object would fly off at a tangent to the circle. This is the direction of the velocity vector of the object. The tension in the string acts at right angles to this vector and this is the prerequisite for an object to move in a circle with constant speed. If a force acts at right angles to the direction of motion of an object then there is no component of force in the direction of motion and therefore no acceleration in the direction of motion. If the force is constant then the direction of the path that the object follows will change by equal amounts in equal time intervals hence the overall path of motion must be a circle. Figure 256 shows the relation between the direction of the velocity vector and the force acting on a particle P moving with constant speed in a circular path.
v F O P O Q r v P
Figure 257
Centripetal Acceleration
In Figure 238 suppose that the particle moves from P to Q in time t In the absence of a centripetal force the particle would reach the point X in this time. The force therefore effectively causes the particle to fall a distance h. For intersecting chords of a circle we have in this situation
Figure 256
Centripetal Force
d 2 + ( r h ) 2 = r 2 d 2 = 2 rh h 2
Now suppose that we consider a very small time interval then h2 will be very small compared to 2rh. Hence we can write
The force causing the circular motion is called the centripetal force and this force causes the particle to accelerate towards the centre of the circle and this acceleration is called the centripetal acceleration. However, be careful to realise that the centripetal acceleration is always at right angles to the velocity of the particle. If the speed of the particle is reduced then it will spiral towards the centre of the circle, accelerating rapidly as it does so. This is in effect what happens as an orbiting satellite encounters the Earths atmosphere. People sometimes talk of a centrifugal force in connection with circular motion. They say something along the lines that when a car goes round a bend in the road you feel a force throwing you outwards and this force is the centrifugal force. But there is no such force. All that is happening is that you are moving in accordance with Newtons laws. Before the car entered the bend you were moving in a straight line and you still want to keep moving in a straight line. Fortunately the force exerted on you by a side of the car as you push up against it stops you moving in a straight line. Take the side away and you will continue moving in a straight line as the car turns the bend.
2 rh = d 2
However d is the horizontal distance travelled in time t such that d = vt. Hence
d2 ( v t ) 2 h = ---- = --------------2r 2r
However h is the distance fallen in time t
1 2 - at So using s = -2
From which we have
2 1 - a ( t ) h = -2
( v t ) 2 --------------2r
So that,
v 2 ( t ) 2 1 -- a ( t ) 2 -----------------2 2r v2 a = ---r
1 -- a ( t ) 2 2
67
19 april 09 Phys chapter 2.indd 67 22/05/2009 11:48:17 AM
CORE
CHAPTER 2
Below are some examples of forces which provide centripetal forces: 1. A geo-stationary satellite is one that orbits Earth in 24 hours. The orbital radius of the satellite is 4.2 107 m. Calculate the acceleration of the satellite and state the direction of the acceleration. (24 hours = 8.64 104 s) Gravitational force. This is discussed in detail in the Topic 6.1 Frictional forces between the wheels of a vehicle and the ground. Magnetic forces. This force is discussed in Topic 6.3
Example
2.
CORE
3.
Solution
Then of course we have the example of objects constrained to move in circles by strings or wires attached to the object.
Let r = the orbital radius and T the period. The speed v of the satellite is then given by the acceleration a is therefore given by
d 2 r v= = t T
v 2 4 2 r 40 4.2 10 7 a= = 2 = = 0.23 m s-2 (8.64) 2 10 8 r T
Example
A model airplane of mass 0.25 kg has a control wire of length 10.0 m attached to it. Whilst held in the hand of the controller it flies in a horizontal circle with a speed of 20 m s1. Calculate the tension in the wire.
Solution
mv F = ____ r
where m is the mass of the particle. It is important to realise that the above equation is effectively Newtons Second Law as applied to particles moving in a circle. In order for the particle to move in a circle a force must act at right angles to the velocity vector of the particle and the speed of the particle must remain constant. This means that the force must also remain constant The effect of the centripetal force is to produce an acceleration towards the centre of the circle. The magnitude of the particles linear velocity and the magnitude of the force acting on it will determine the circular path that a particular particle describes.
mv2 which in this situation is equal to is equal to ____ r 0.25 400 = 10 N. _________ 10
An interesting situation arises when we have circular motion in a vertical plane. Consider a situation in which you attach a length of string to an object of mass m and then whirl it in a vertical circle of radius r. If the speed of the object is v at its lowest point then the tension in the
mv2 and at the highest string at this point will be mg + ____ r mv point the tension will be mg ____ r
We now see why this is the case.
2
68
19 april 09 Phys chapter 2.indd 68 22/05/2009 11:48:18 AM
MECHANICS
Lowest point:
Example
mg
Figure 258 Lowest Point
Solution
The resultant force on mass, m / kg, throughout its motion in 2 ___ a circle (as long as the speed is constant) is always mv r . Taking the positive direction to be towards the centre of the circle, at its lowest point, the resultant force is provided by the expression T mg, so that
mg R
mg T
Figure 259
Highest Point
Again, we have that the resultant force on mass, m / kg, throughout its motion in a circle (as long as the speed is
mv2 R = --------- mg , however, as R 0, we have that r 2 mv2 mv - mg . So, the minimum speed will --------- mg 0 --------r r v2 be given by g = ---- since at any lower speed mg will be r mv2 - and the motor bike will leave the track. greater than --------r
So in this case the speed will be 14 m s-1. It is also worthwhile noting that in circular motion with constant speed, there is no change in kinetic energy. This is because the speed, v, is constant and so the expression for the kinetic energy, 2 1 mv2, is also always constant (anywhere along its motion). Another way to look at this is that since the force acts at right angles to the particle then no work is done on the particle by the force.
69
19 april 09 Phys chapter 2.indd 69 22/05/2009 11:48:21 AM
CORE
A wall of death motorcyclist rides his motorcycle in a vertical circle of radius 20 m. Calculate the minimum speed that he must have at the top of the circle in order to complete the loop.
CHAPTER 2
7. A sphere of mass m, attached to an inextensible string as shown in the diagram is released from rest at an angle with the vertical. When the sphere passes through its lowest point, show that the tension in the string is given by mg (3 - 2cos)
Exercise
2.4
1.
An object is travelling at a constant speed of 40 ms-1 in a circular path of radius 80 m. Calculate the acceleration of the object? A body of mass 5.0 kg, lying on a horizontal smooth table attached to an inextensible string of length 0.35 m, while the other end of the string is fixed to the table. The mass is whirled at a constant speed of 2.0 m s-1. Calculate (a) (b) (c) the centripetal acceleration. the tension in the string. the period of motion. 8.
CORE
2.
m kg
3.
The radius of the path of an object in uniform circular motion is doubled. The centripetal force needed if its speed remains the same is A. B. C. D. half as great as before. the same as before. twice as great as before. four times as great as before. 9.
A 3 kg mass attached to a string 6 m long is to be swung in a circle at a constant speed making one complete revolution in 1.25 s. Determine the value that the breaking strain that the string must not exceed if the string is not to break when the circular motion is in (a) (b) a horizontal plane? a vertical plane?
4.
A car rounds a curve of radius 70 m at a speed of 12 m s-1 on a level road. Calculate its centripetal acceleration? A 500 g sphere is hung from an inextensible string 1.25 m long and swung around to form a conical pendulum. The sphere moves in a circular horizontal path of radius 0.75 m. Determine the tension in the string.
5.
A mass, m kg, is released from point A, down a smooth inclined plane and once it reaches point B, it completes the circular motion, via the smooth circular track B to C to D and then back through B, which is connected to the end of the incline and has a radius a / m.
A D 4a
1.25 m
D.
(b)
C.
(c)
B.
6.
Determine the maximum (constant) speed at which a car can safely round a circular curve of radius 50 m on a horizontal road if the coefficient of static friction () between the tyres and the road is 0.7. (Use g = 10 m s2). (HINT: if the normal reaction is N, the relationship is F = N)
70
19 april 09 Phys chapter 2.indd 70 22/05/2009 11:48:23 AM
MECHANICS
Topic 2.2
1.
2.
2.
60
speed / m s
15 10 5 0
The strings are of the same length and the angle between them is 60. Draw a free body diagram of the forces acting on the object. Calculate the tension in the strings. 3. When a person stands on bathroom scales the scale reads 60 kg. Suppose the person stands on the same scales when in an elevator (lift). The elevator accelerates upwards at 2.0 m s-2. Determine the new reading on the scale. The diagram shows two blocks connected by a string that passes over a pulley.
20
40
60
80
100
120
140
time /
Calculate the 4. i. ii. iii. 3. acceleration of the car between 0 and 20 s and between 120 and 130 s. total distance travelled by the car during braking. total distance between the traffic lights.
A person drops a stone down a water well and hears a splash 2.0 s after it leaves his hand. Determine the depth of the well (g = 10 m s-2) A girl throws a stone vertically upwards. The stone leaves her hand with a speed of 15.0 m s1. Determine (i) the maximum height reached by the stone and (ii) how long the time it takes to return to the ground after leaving her hand (g = 10 m s-2)
A
Block A has a mass of 2.0 kg and block B a mass of 4.0 kg and rests on a smooth table. Determine the acceleration of the two blocks? 5. Here are four statements about a book resting on a table. A. B. C. D. The book exerts a force on the table. The table exerts a force on the book. the book exerts a force on the Earth. the Earth exerts a force on the book.
4.
71
19 april 09 Phys chapter 2.indd 71 22/05/2009 11:48:23 AM
CORE
An object is thrown through the air. Ignoring air resistance, draw a free body diagram of the forces acting on the ball whilst it is in flight.
CHAPTER 2
Which forces form a pair of forces as described by Newtons Third Law? 6. When a golfer strikes a golf-ball it is in contact with the club head for about 1 ms and the ball leaves the club head with a speed of about 70 m s-1. If the mass of the ball is 50 g estimate the maximum accelerating force exerted on the golf ball, stating any assumptions that you make. A ball of mass 0.1 kg is dropped from a height of 2.0 m onto a hard surface. It rebounds to a height of 1.5 m and it is in contact with the surface for 0.05 s. Calculate the i. ii. iii. iv. v. speed with which it strikes the surface. speed with which it leaves the surface. change in momentum of the ball. impulse given to the ball on contact with the surface. average force that the surface exerts on the ball. it falls through a height of 2.5 m before striking the top of the pile. It stays in contact with the pile and drives it a distance of 0.40 m into the ground. Calculate the average force exerted by the ground on the pile by using i. ii. energy considerations the equations of uniform motion and Newtons Second Law. (assume that the mass of the pile driver is much greater than the mass of the pile.)
CORE
7
2.
A man slides a box of mass 50 kg at constant speed up an inclined slope to a height of 2.0 m. The slope makes an angle of 30 with the horizontal and it takes him 4 s to reach the height of 2.0 m and a constant frictional force of 250 N acts on the block. Calculate i. ii. iii. iv. the work the man does against friction the work the man does against gravity the efficiency of the man-slope machine the power the man develops to push the block up the slope
A bullet of mass 0.02 kg is fired into a block of wood of mass 1.5 kg resting on a horizontal table. The block moves off with an initial speed of 8.0 m s-1. Estimate the speed with which the bullet strikes the block. The bullet in question 8 is fired from a rifle of mass 2.5 kg. Assuming that the bullet leaves the barrel of the rifle with the speed calculated above, find the recoil speed of the rifle if it is free to move. In reality the rifle is held and for a certain person the rifle recoils a distance of 0.12 m. Determine the average force that the person exerts on the rifle?
3.
This question is about calculating the power output of a car engine. Here is some data about a car that travels along a level road at a speed of 25 m s-1. Fuel consumption = 0.20 litre km-1 Calorific value of the fuel = 5.0 106 J litre-1 Engine efficiency 50%
Topic 2.3
1. The diagram shows a pile driver that is used to
pile driver
Determine: i. ii. iii. iv. v. the rate at which the engine consumes fuel the rate at which the fuel supplies energy the power output of the engine the power used to overcome the frictional forces acting on the car the average frictional force acting on the car.
pile ground
Explain why: (i) (ii) the power supplied by the engine is not all used to overcome friction the fuel consumption increases as the speed of the car increases
drive a metal bar (the pile) into the ground.A particular pile driver has a mass of 500 kg and
72
19 april 09 Phys chapter 2.indd 72 22/05/2009 11:48:24 AM
THERMAL PHYSICS
THERMAL PHYSICS
3.1 3.2 Thermal concepts Thermal properties of matter
Further refinements of the phlogiston theory were carried out by Antoine Lavoisier (17431794), and it became known as the caloric theory. Sir Isaac Newton (16421727) and other famous scientists supported the caloric theory. Calorists believed that a hot object had more caloric than a cold object. They explained expansion by saying that the caloric filled up the spaces between atoms pushing them apart. The total amount of caloric was unchanged when a hot and cold body came into contact. However, the caloric theory did not adequately explain some phenomena involving heat. It was difficult to understand how the conservation of caloric fluid applied to friction and the expansion of liquids and gases. Some calorists answer to the friction concept was that the latent heat was released which implies that a change of state was involved. Others argued that during friction the material is damaged and that it bleeds heat. No satisfactory answers were forthcoming.
PHLOGISTON/CALORIC THEORY
he concept of heat has been studied for many centuries. Aristotle (384 322 B.C.) considered fire one of the five basic elements of the Universe. Over 2000 years ago, Greek philosophers believed that matter was made of atomos, elemental atoms in rapid motion, and that the result of this rapid motion was heat. It was understood that heat flowed from hot bodies to colder ones, somewhat analogous to water or another fluid flowing from a higher to lower elevation. It is not surprising that the early theory of heat flow regarded heat as a type of fluid. Around the time of Galileo Galilei (1564 1642), this heat fluid was known as phlogiston the soul of matter. Phlogiston was believed to have a negative mass, and, upon heating or cooling, the phlogiston was driven out or absorbed by an object.
COUNT RUMFORD
Much of the credit for dismantling the idea that heat was motion rather than substance or caloric goes to Benjamin Thompson (1753 1814), also known as Count Rumford of Bavaria. During the American Revolution, he was a Tory or loyalist in the disputes between Britain and its American colonies serving as a major in a company of militia. It is believed that he invented a cork flotation system for cannons while being transported by horses across rivers. He also designed a gun carriage that could be carried by three horses and could be assembled ready for firing in 75 seconds. He
73
19 april 09 Physics Ch 3 final.i73 73 22/05/2009 11:49:12 AM
CORE
CHAPTER 3
was knighted by King George III of England, and made a Count in 1791 by Theodor in his brief reign as elector of the Holy Roman Empire. In 1793, Thompson left England ultimately to take up a post with the before mentioned Theodor, elector of Bavaria. He was appointed a major general in the Bavarian army. He designed fortifications and worked as an administrator in munitions. It was here that he observed that a large amount of heat was generated in the boring of cannons. He read the following extracts before the Royal Society of London in 1798. Being engaged, lately, in superintending the boring of cannon, in the workshops of the military arsenal at Munich, I was struck with the very considerable degree of Heat which a brass gun acquires, in a short time, in being bored; and with the still more intense Heat (much greater than that of boiling water as I found by experiment) of the metallic chips separated from it by the borer. From whence comes the Heat actually produced in the mechanical operation above mentioned? Is it furnished by the metallic chips which are separated by the borer from the solid mass of metal? If this were the case, then, according to the modern doctrines of latent heat, and of caloric, the capacity for Heat of the parts of the metal, so reduced to chips, ought not only to be changed, but the change undergone by them should be sufficiently great to account for all the Heat produced. Count Rumford was saying that the metal chips should have undergone some alteration in their properties after the production of so much thermal energy. He noted that some cannon shavings were hot enough to glow, but he continued: But no such change had taken place; for I found, upon taking equal quantities, by weight, of these chips, and of thin slips of the same block of metal separated by means of a fine saw, and putting them, at the same temperature (that of boiling water), into equal quantities of cold water, the portion of water into which the chips were put was not, to all appearance, heated either less or more than the other portion, into which the slips of metal were put. From whence it is evident that the Heat produced [by boring the cannon] could not possibly be furnished at the expense of the latent Heat of the metallic chips. Rumford further went on to explain that he had immersed cannons in water while they were being bored and noted the rate at which the temperature rose. His results showed that the cannon would have melted had it not been cooled. Rumford concluded that heat was not a caloric fluid in which caloric is conserved but rather a concept of motion. He argued that heat is generated when work is done, and that the heat will continue to be generated as long as work is done. He estimated a heat to work ratio within the order of magnitude accepted today. However, many scientists of the time were not convinced because Rumford could not give a clear explanation of exactly what heat was in terms of the accepted model for matter at that time. It would take another half century before Joule supplied the accepted answers.
CORE
74
19 april 09 Physics Ch 3 final.i74 74 22/05/2009 11:49:13 AM
THERMAL PHYSICS
that he could measure the temperature with a precision of a fraction of a degree.
handles
Example
ywheel to pulley with weights attached moving vanes calorimeter (containing water) xed vanes spindle to pulley with weights attached
Solution
Figure 301 Schematic diagram of Joules paddlewheel experiment. Joule arranged the vanes of the paddlewheel so that they would not interfere with the particles of water set in motion. He didnt want to bruise or damage the water particles so that they might bleed heat. In 1849 he published his results in which he reported the quantity of heat produced by friction of bodies, whether solid or liquid, is always proportional to the quantity of [energy] expended. the quantity of heat capable of increasing the temperature of a pound of water by 1 Fahrenheit requires for its evolution the expenditure of a mechanical energy represented by the fall of 772 pound through the distance of one foot. Joule found that about 4.2 joules of work would yield one calorie of heat or that the quantity of heat required to raise the temperature of one gram of water by 1 C is one calorie. A modern day value for the mechanical equivalent of heat is 4.18605 joules = 1 calorie. The experiments proved beyond doubt that mechanical work can produce heat and as such no caloric fluid can be created or destroyed. Furthermore, Joule reasoned that the temperature increase must be related to the energy of the microscopic motions of the particles. Finally, a paradigm shift in our way of reasoning had again proved that science is not the ultimate truth.
= 2.0 kg 9.8 m s-2 100 m = 1.96 10 3 J Heat energy produced is given by Q = m c T = 10 g 1 cal 46.8 C = 4.68 102 calories Mechanical equivalent of heat is given by
75
19 april 09 Physics Ch 3 final.i75 75 22/05/2009 11:49:14 AM
CORE
Calculate the mechanical equivalent of heat for Joules paddlewheel experiment if a mass of 2.0 kg falls through a height of 100 m, and increases the temperature of 10 g of water by 46.8 C.
CHAPTER 3
CORE
3.1.2 State the relation between the Kelvin and Celsius scales of temperature. 3.1.3 State that the internal energy of a substance is the total potential energy and random kinetic energy of the molecules of the substance. 3.1.4 Explain and distinguish between the macroscopic concepts of temperature, internal energy and thermal energy (heat). 3.1.5 Dene the mole and molar mass. 3.1.6 Dene the Avogadro constant.
IBO 2007
76
19 april 09 Physics Ch 3 final.i76 76 22/05/2009 11:49:14 AM
THERMAL PHYSICS
points). For example, different thermometers will give different values for the boiling point of zinc (907 C).
vacuum 100C
capillary tube
glass stem 0 C
The lower fixed point is absolute zero and is assigned a value of 0 K. This is the point where molecular vibrations become a minimum the molecules have minimum kinetic energy but molecular motion does not cease. The upper fixed point is the triple point of water. This is the temperature at which saturated water vapour, pure water and melting ice are all in equilibrium. For historical reasons, it is assigned a value of 273.16 K. T in K = T in C + 273.16
Exercise
A clinical thermometer as shown in Figure 303 does not need the temperature range of a laboratory thermometer. It is designed so that the maximum temperature remains constant after the patients temperature is taken. It has a small constriction to stop the mercury flowing back into the bulb. The mercury is then shaken back into the bulb after the temperature has been taken.
3.1 (a)
1.
At room temperature, an iron rod feels cooler when held in the hand than wood held in the same hand. This is because: A. B. thermal energy tends to flow from the metal to the wood wood has a higher specific heat capacity than the iron rod wood has a lower specific heat capacity than the iron rod the iron rod conducts thermal energy better than the wood
35
36
37
38
39
40
41
42
C. Figure 303 A Clinical Thermometer D. In order to calibrate these thermometers, two fixed points are used to define the standard temperature interval. The ice point (the lower fixed point) marked at 0 C is the temperature of pure ice at standard atmospheric pressure and is in thermal equilibrium with the liquid in the bulb. The steam point (the upper fixed point) marked at 100 C is the temperature of steam at standard atmospheric pressure and is in thermal equilibrium with the liquid in the bulb. The scale between these values is marked with even spaces. The Celsius temperature scale named after the Swedish astronomer Anders Celsius (1701-1774) is constructed in such a manner. Although thermometers constructed using thermometric properties are useful for everyday use, they are not accurate enough for scientific work. Unfortunately, two thermometers constructed using different thermometric properties do not necessarily agree with each other as they do not vary linearly over large temperature ranges. (They are of course in agreement at the lower and upper fixed
2.
Explain the difference between heat and temperature. If you were travelling to Antarctica, deduce what would be the better thermometer to take mercury or alcohol? State one advantage and one disadvantage of a i. ii. mercury in glass thermometer constant volume thermometer.
3.
4.
5.
The triple point of water is 273.16 K. Express this as a Celsius temperature. Determine the ice point and the steam point of pure water on the Kelvin scale?
6.
77
19 april 09 Physics Ch 3 final.i77 77 22/05/2009 11:49:15 AM
CORE
The standard fundamental temperature scale in the SI system is denoted by the symbol T. and is measured in Kelvin, K. It is the thermodynamic temperature scale used in scientific measurement.
CHAPTER 3
7. 8. Define absolute zero.
Potential energy, U / J
Figure 305 Potential energy versus separation of particles Work done = force distance = F r = change in potential energy
The bond energy is a form of chemical potential energy. It becomes significant in chemistry when a chemical reaction occurs, and bonds are broken and formed. The intermolecular forces of attraction between particles is due to the electromagnetic fundamental force since the gravitational force is too small to be of any significance. Figure 304 indicates how the intermolecular electromagnetic force F between particles varies with the distance r between their centres. At distances greater than r0 (less than 2.5 10-10 m) attraction takes place, and at distances closer than r0 the particles repel. At r0 the particles are in equilibrium. Any displacement from the equilibrium position results in a simple harmonic oscillation of a particle or molecule.
Force, F / N
Ep F = ____ r
In other words, the gradient of the potential energy curve at any point on the curve gives the force that must be applied to hold the molecules at that separation. We can classify the phases according to the sizes of the energy .
, the vibrations occur about fixed When less than __ 10 positions and the particles are in the solid phase. When , the particles have sufficient approximately equal to __ 10 energy to partly overcome the attractive forces and melting occurs. , a liquid can form. When greater When greater than __ 10 than , the particles have sufficient energy to leave the liquid and form a gas.
+
repulsion
The kinetic energy is mainly due to the translational, rotational and vibrational motion of the particles as depicted in Figure 306.
nuclear separation, r / m
r0
attraction
Figure 304
Figure 305 shows the relationship between the potential energy and the separation r of two molecules. At 0 K, the average separation of particles centres is r0 and the overall force is zero. This is the point of minimum potential energy. Work will need to be done to move the particles apart and there will be an increase in potential energy.
Figure 306
78
19 april 09 Physics Ch 3 final.i78 78 22/05/2009 11:49:16 AM
THERMAL PHYSICS
the aroma of the coffee due to diffusion caused by the kinetic energy of the air molecules allowing the aroma molecules to spread. The expansion of solids, liquids and gases is a macroscopic property that allows us to understand that matter in a system has potential energy. When you heat a liquid it can be seen to expand as in a thermometer and this means that the potential energy of the system is increasing as the molecules move further apart. The compressibility of gases allow us to understand that the potential energy of the molecules is decreasing. Although the internal energy in the examples above can never be absolutely determined, the change in internal energy can be observed.
TEMPERATURE
At the macroscopic level, temperature is the degree of hotness or coldness of a body as measured by a thermometer. Thermometers are made using the thermometric properties of a substance such as: the expansion of a column of liquid in a capillary tube (laboratory and clinical thermometers). the electrical resistance of a wire (resistance and thermister thermometers). the difference in the rates of expansion of two metals in contact (bimetallic strips). the pressure of a gas at constant volume. the volume of a gas at constant pressure (gases expand by a greater amount and more evenly than liquids). the heating of two metal wires wound together (thermocouple thermometers rely on the two metals producing different currents). the colour of a solid heated to high temperatures (pyrometers).
Therefore, it is incorrect to refer to the thermal energy in a body. At the macroscopic level, thermal energy (heat) can be transferred from one body to another by conduction, convection, radiation or by combinations of these three. Thermal conduction is the process by which a temperature difference causes the transfer of thermal energy from the hotter region of the body to the colder region by particle collision without there being any net movement of the substance itself. Thermal convection is the process in which a temperature difference causes the mass movement of fluid particles from areas of high thermal energy to areas of low thermal energy (the colder region). Thermal radiation is energy produced by a source because of its temperature that travels as electromagnetic waves. It does not need the presence of matter for its transfer. Conduction can occur in solids, liquids and gases. In gases it occurs due to the collision between fast and slow moving particles where kinetic energy is transferred from the fast to the slow particle. The transfer of energy is very slow because the particles are far apart relative to solids and liquids. In liquids, a particle at higher temperature vibrates about its position with increased vibrational energy. Because the majority of the particles are coupled to other particles they also begin to vibrate more energetically. These in turn cause further particles
INTERNAL ENERGY
As already mentioned, internal energy is the sum total of the potential energy and the random kinetic energy of the molecules of the substance making up a system. In order to apply the Law of conservation to thermal systems, one has to assume that a system has internal energy. At the macroscopic level, it can be observed that molecules are moving. When pollen (a fine powder produced by flowers) is sprinkled on the surface of water and the setup is viewed under magnification, it can be seen that the pollen particles carry out zig-zag motion called Brownian Motion. Their motion is caused by the kinetic energy of the water molecules. Walking past a coffee shop you smell
79
19 april 09 Physics Ch 3 final.i79 79 22/05/2009 11:49:17 AM
CORE
CHAPTER 3
to vibrate and thermal conduction occurs. This process is also slow because the particles have a large relative mass and the increase in vibrations is rather small. In solids, the transfer can occur in one of two ways. Most solids behave similarly to liquids. However, solids are held in their fixed positions more rigidly than liquids and the transfer of vibrational energy is more effective. However, again their large masses do not allow for large energy transfer. If a substance in the solid or molten form has mobile electrons, as is the case for metals, these electrons gain energy due to the temperature rise and their speeds increase much more than those held in their fixed positions in the lattice. Metals are said to be good conductors of heat but most other solids are good insulators. Saucepans for cooking are usually made of copper or aluminium because these metals conduct heat quickly when placed on a stove. The handle is made from a good solid insulator to reduce the conduction of heat. Generally, liquids and gases are not good thermal conductors. However, they can transfer heat readily by convection. Ocean currents, wind and weather patterns suggest that the mass movement of particles from one area to another can cause movement of particles on the grand scale. Figure 307 shows a potassium permanganate crystal placed in water inside a convection tube. Heat is applied for a short period of time and the direction of the purple trail is noted. Particles in a region of high thermal energy are further apart and hence their density is lower. In a region of low thermal energy the particles are closer together and the region is more dense. As a result, the less dense region rises as it is pushed out of the way by the more dense region and a convection current is produced. All thermal energy ultimately comes from the Sun in our solar system. It travels through 150 million km of mostly empty space. At the Earths atmosphere the radiant energy is mainly reflected back into space. However, some is transmitted and absorbed causing a heating effect. Just as the Sun emits thermal radiation so does any source that produces heat such as a light bulb or an electric heater. Thermal radiation is mainly electromagnetic waves in the infra-red region of the electromagnetic spectrum at temperatures below 1000 C. Above this temperature, wavelengths of the visible and ultra-violet regions are also detected. Dull black bodies are better absorbers and radiators than transparent or shiny bodies.
CORE
Convection tube
water Heat
The mole is the amount of substance that contains as many elementary particles as there are in 0.012 kg of carbon12. Amadeo Avogadro (1776 1856) found that equal volumes of gases at the same temperature and pressure contained the same number of particles. One mole of any gas contains the Avogadro number of particles NA. It is now known that one mole of a gas occupies 22.4 dm3 at 0 C and
Figure 307
Convection current.
Another way in which a fluid can move is by forced convection. In this case, a pump or fan system maintains the movement of a fluid. The cooling system in nuclear reactors operates on this principle.
80
19 april 09 Physics Ch 3 final.i80 80 22/05/2009 11:49:18 AM
THERMAL PHYSICS
101.3 kPa pressure (STP) and contains 6.02 10 23 particles. When using the mole, the atoms or molecules should be clearly stipulated. For example, one mole of copper atoms contains 6.02 10 23 copper atoms. One mole of nitrogen molecules (N2) contains 6.02 10 23 of nitrogen molecules and 12.04 1023 nitrogen atoms. If we have one mole of NH4NO3, this contains: = 6.02 1023 2 1 mol of NH ions = 6.02 10 ammonium ions 1 mol of NO ions = 6.02 10 23 nitrate ions 2 mol of nitrogen atoms = 1.204 1024 nitrogen atoms 4 mol of hydrogen atoms = 2.408 1024 hydrogen atoms 3 mol of oxygen atoms = 1.806 1024 oxygen atoms The amount of substance (the moles) is related to the mass and the molar mass according to the following equation:
+ 4 3 23
1. 2.
= 1.024 1024 molecules. 3. Volume = 2 mol 22.4 dm3 = 44.8 dm3 4. m=nM = 0.75 mol (12 +16 +16) g mol-1 = 33 g
m n = __ M
where n = amount of a substance in mol, m = the mass in g and M = the molar mass in g mol-1. The molar mass can be obtained from the periodic table.
Example
1.
Calculate the number of moles of oxygen molecules contained in 64.0 g of oxygen gas, O2. Calculate the number of oxygen molecules in part 1 of this example. Determine the volume of oxygen gas that would be present at STP. Calculate the mass in 0.75 mol of carbon dioxide gas.
2.
3.
4.
81
19 april 09 Physics Ch 3 final.i81 81 22/05/2009 11:49:18 AM
CORE
CHAPTER 3
6. The number of atoms present in 0.5 mol SO3 is A. B. C. D. 7. 3 1023 6 1023 12 1023 24 1023
Exercise
3.1 (b)
1.
The internal energy of a substance is equal to: A. the total potential energy stored in the bonds of a substance the potential and kinetic energy of molecules in a substance the energy stored in bonds and intermolecular forces of a substance the translational, rotational and vibrational motion of particles in the substance
CORE
B. C. D.
Calculate the approximate molar masses of each of the following molecules and compounds: (a) (b) (c) (d) (e) Cl2 HCl CuSO4 Na2CO3 CH4
2.
The number of moles of sodium chloride (NaCl) in 100g of pure sodium chloride is (Mr of NaCl = 58.5 gmol-1) A. B. C. D. 5850 mol 0.585 mol 1.71 mol 41.5 mol
8.
Calculate the mass of the given amounts of each of the following substances: (a) (b) (c) (d) (e) 2.0 mole of iron, Fe 0.2 mole of zinc, Zn 2.5 mole of carbon dioxide, CO2 0.001 mole of sulfur dioxide, SO2 50 mole of benzene, C6H6
3.
Two different objects with different temperatures are in thermal contact with one another. When the objects reach thermal equilibrium, the direction of transfer of thermal energy will be A. B. C. D. from the lower temperature object to the higher temperature object half way between the temperatures of the two objects from the higher temperature object to the lower temperature object in many different directions
9.
Calculate the amount of subtance (number of mole) in: (a) (b) (c) (d) (e) 100 g of copper 5.0 g of oxygen molecules 100 g of calcium carbonate, CaCO3 4.4 g of carbon dioxide 13.88 g of lithium
10.
A sample of aluminium sulfate Al2 (SO4)3 has a mass of 34.2 g. Calculate: (a) (b) the number of alumimium ions Al 3+ in the sample the number of sulfate ions SO4 2- in the sample
4.
A sealed flask contains 16 g of oxygen (mass number 16) and also 8 g of hydrogen (mass number 1). The ratio of the number atoms of hydrogen to the number of atoms of oxygen is A. B. C. D. 16 8 4 2 11.
Classify the following as a macroscopic or microscopic property of a gas (a) (b) (c) (d) (e) volume specific heat capacity kinetic energy of a particle pressure temperature
5. The number of molecules present in 0.5 mol SO3 is A. B. C. D. 3 1023 6 1023 12 1023 24 1023
82
19 april 09 Physics Ch 3 final.i82 82 22/05/2009 11:49:18 AM
THERMAL PHYSICS
Example
The thermal capacity of a sphere of lead is 3.2 103 JK-1 Determine how much heat can be released if the temperature changes from 61 C to 25 C.
Solution
( 61 25 ) C 36 C
T is the change in temperature in kelvin degrees K. Water is used in car cooling systems and heating systems because of its high thermal capacity. A metal heat sink is Figure 308 Front-on view of the metal blocks after a period of time.
83
19 april 09 Physics Ch 3 final.i83 83 22/05/2009 11:49:19 AM
CORE
CHAPTER 3
The aluminium block melts the most wax and the lead melts the least. Therefore, the metals of the same mass give out different amounts of thermal energy in a certain time period. This can be explained from a microscopic viewpoint. The kilogram unit masses have different numbers of particles of different types and masses. The metal blocks were given the same amount of thermal energy when they were heated to 80 C. When the thermal energy gained by each metal is distributed amongst its particles, the average energy change of each particle will be different for each metal. To obtain a characteristic value for the heat capacity of materials, equal masses of the materials must be considered. The physical property that includes the mass is called the specific heat capacity of a substance c. Specific heat capacity or specific heat is the heat capacity per unit mass. It is defined as the quantity of thermal energy required to raise the temperature of a unit mass of a substance by one degree Kelvin.
Example
Determine how much thermal energy is released when 650 g of aluminium is cooled from 80 C to 20 C.
CORE
Solution
Using the fact that Q = m.c.T, we have, Q = 0.650 kg 9.1 102 J kg-1 K-1 (80 20) K = 3.549 104 J = 4 104 J That is, 4 104 J of heat is released.
Q c = _____ mT Q = m c T
Q = the change in thermal energy required to produce a temperature change in Joules, J. m = mass of the material in grams (g) kilograms (kg) T = the temperature change in Kelvin, K. Note that T is always positive because heat always transfers from the higher temperature region to the lower temperature region. For gases, the molar heat capacity at constant volume Cv and the molar heat capacity at constant pressure Cp are more commonly used. Molar heat capacity is the quantity of heat required to raise the temperature of one mole of the gas by one degree Kelvin under the constant condition. Figure 309 shows the specific heat capacity for some common substances at room temperature (except ice) Substance Lead Mercury Zinc Brass Copper Figure 309 Specific heat J kg -1 K -1 1.3 10 1.4 102 3.8 102 3.8 102 3.85 102
2
Example
An active solar heater is used to heat 50 kg of water initially at a temperature of 12 C. If the average rate that thermal energy is absorbed in a one hour period is 920 J min-1 , determine the equilibrium temperature after one hour
Solution
Quantity of heat absorbed in one hour = 920 J min-1 60 min = 5.52 104 J Using the fact that Q = m c T , we have 5.52 104 J = 5.0 101 kg 4.18 103 J kg-1 K-1 (Tf 12)K 5.52 104 J = 2.09 105 JK-1 (Tf 12) C 5.52 104 J = 2.09 105 Tf 2.51 106 J 2.09 105 Tf = 5.52 104 J + 2.51 106 J
Specific heat J kg -1 K -1 4.7 10 9.1 102 1.23 103 2.1 103 4.18 103
2
Tf = 12 C
84
19 april 09 Physics Ch 3 final.i84 84 26/05/2009 4:09:54 PM
THERMAL PHYSICS
EXTENSION METHODS TO DETERMINE SPECIFIC HEAT CAPACITY
A calorimeter is a useful piece of equipment for investigations in Thermal Physics because it allows masses at different temperatures to be mixed with minimum energy loss to the surroundings. It is used for direct and indirect methods in determining the specific heat capacity of a substance. (The name of the instrument is derived from the Imperial unit, the calorie.)
stirrer
such as wool or polystyrene to reduce heat loss due to conduction and convection. After the power supply is switched off, the temperature should continue to rise for a period, and then level out for an infinite time. However, heat is lost to the surroundings, and the maximum temperature that could be achieved, in theory, is never reached. Instead appreciable cooling occurs. One method used to estimate the theoretical maximum temperature is to use a cooling correction curve as shown in Figure 311. {Note that cooling correction is not required in the syllabus but is included for possible extended essays.}
Temperature, C
3 2 1
theoretical curve
( = correction) ( = 2 1 )
actual curve
A1
A2
Figure 310 Calorimeter being used to measure the heating eect of a current Figure 310 illustrates the use of a calorimeter to determine the specific heat capacity of a liquid, in this case water. The heating coil is used to convert electrical energy to thermal energy. The electrical energy can be measured by a joulemeter or by using a voltmeter/ammeter circuit. The duration of time of electrical input is noted. The thermal energy gained by the calorimeter cup and the water is equal to the electrical energy lost to the calorimeter cup and water. Electrical energy lost = V I t = [m c T]calorimeter cup + [m c T]water where V is the potential difference across the heating coil in volts V and I is the current in the amperes, A. The specific heat capacity, c, of the calorimeter cup is obtained from published values. The other quantities are recorded and the specific heat capacity of the water is calculated. In calorimeter investigations, heat losses to the surroundings need to be minimised. It is normal to polish the calorimeter cup to reduce loss of heat due to radiation. The calorimeter is also insulated with lagging materials
A1 = ----- , so that 3 = 2 + A2
room temp
2t
3t
T ime, minutes
Figure 311
A cooling correction is based on Newtons Law of Cooling. It states that the rate of loss of heat of a body is proportional to the difference in temperature between the body and its surroundings (excess temperature). A full explanation of this Law will not be given. If the power supply is switched off at time 2t minutes, then the temperature should continue to be recorded for a further t minutes. The correction to the temperature can be obtained from the graph as shown. The final temperature, 3, is then given as the final temperature of the thermometer plus the correction . Another direct electrical method used to determine the specific heat capacity of a metal is shown in Figure 312. An immersion heater is placed into a metal block. The hole for the heater is lubricated with oil to allow even heat transmission. The electrical energy lost to the block is recorded for a given period of time and the specific heat of the metal is calculated. Cooling correction is more important in this case because the temperatures under which the investigation is carried out could be much higher than was the case when using a calorimeter.
85
19 april 09 Physics Ch 3 final.i85 85 22/05/2009 11:49:21 AM
CORE
CHAPTER 3
We also have that,
thermometer
V
immersion heater
Thermal energy lost by the copper = (3.0 kg) (3.85 102 J kg -1 K -1) (90.0 Tf) K
metal block
CORE
Thermal energy gained by the water = (2.0 kg) (4.18 103 J kg -1 K -1) (Tf 20.0) K
lagging
Figure 312 Electrical method using an immersion heater and a metal block A common indirect method to determine the specific heat capacity of a solid and liquids is called the method of mixtures. In the case of a solid, a known mass of the solid is heated to a certain temperature, and then transferred to a known mass of liquid in a calorimeter whose specific heat capacity is known. The change in temperature is recorded and the specific heat of the solid is calculated from the results obtained. In the case of a liquid, a hot solid of known specific heat is transferred to a liquid of unknown specific heat capacity.
Thermal energy gained by the cup = (0.21 kg) (9.1 102 J kg -1 K -1) (Tf 20.0) K 1.04 105 1.155 103 Tf = (8.36 103 Tf 1.67 105) +(1.91 102 Tf 3.82 103) That is, 9.71 103 Tf = 2.75 105 Giving Tf = 28.3 C
Example
The final temperature of the water is 28 C A block of copper of mass 3.0 kg at a temperature of 90 C is transferred to a calorimeter containing 2.00 kg of water at 20 C. The mass of the copper calorimeter cup is 0.210 kg. Determine the final temperature of the water. 1.
Exercise
3.2 (a)
Solution
The amount of thermal energy required to raise the temperature of 1.53 103g of water from 15 K to 40 K is A. B. C. D. 1.6 107 J 1.6 105 J 4.4 107 J 4.4 105 J
The thermal energy gained by the water and the calorimeter cup will be equal to the thermal energy lost by the copper. That is, [mcT]copper = [mcT]calorimeter cup + [mcT]cup 2.
The specific heat capacity of a metal block of mass m is determined by placing a heating coil in it, as shown in the following diagram. The block is electrically heated for time t and the maximum temperature change recorded is . The constant ammeter and voltmeter readings during the heating are I and V respectively. The electrical energy supplied is equal to VIt.
86
19 april 09 Physics Ch 3 final.i86 86 22/05/2009 11:49:22 AM
THERMAL PHYSICS
9.
thermometer
V
immersion heater
If 2.93 106 J is used to raise the temperature of water from 288 K to 372 K, calculate the mass of water present. 1.7 mJ of energy is required to cool a 15 kg mass of brass from 400 C to 25 C. Determine the specific heat capacity of brass. A piece of iron is dropped from an aeroplane at a height of 1.2 km. If 75% of the kinetic energy of the iron is converted to thermal energy on impact with the ground, determine the rise in temperature. If 115 g of water at 75.5 C is mixed with 0.22 kg of water at 21 C, determine the temperature of the resulting mixture. Describe an experiment that would allow you to determine the specific heat capacity of a metal. (i) (ii) (iii) Sketch the apparatus. Describe what measurements need to be made and how they are obtained. State and explain the equation used to calculate the specific heat capacity of the metal. Describe 2 main sources of error that are likely to occur in the experiment. Is the experimental value likely to be higher or lower than the theoretical value, if the experiment was carried out in a school laboratory? Explain your answer.
10.
11.
The specific heat capacity is best calculated using which one of the following expressions? A. B. C. D.
12.
m VI VI c= m VIt c= m m c= VIt
c=
13.
3.
5.4 106 J of energy is required to heat a 28 kg mass of steel from 22 C to 450 C. Determine the specific heat capacity of the steel. Liquid sodium is used as a coolant in some nuclear reactors. Describe the reason why liquid sodium is used in preference to water. 6.00 102 kg of pyrex glass loses 8.70 106 J of thermal energy. If the temperature of the glass was initially 95.0 C before cooling, calculate is its final temperature. (Take the specific heat capacity of pyrex glass to be 8.40 10 2 J kg -1 K-1)
(iv) (v)
4.
5.
14.
A heating fluid releases 4.2 107 Jkg-1 of heat as it undergoes combustion. If the fluid is used to heat 250 dm3 of water from 15 C to 71 C, and the conversion is 65% efficient, determine the mass of the heating fluid that will be consumed in this process. A large boulder of 125 kg falls off a cliff of height 122 m into a pool of water containing 120 kg of water. Determine the rise in temperature of the water. Assume that no water is lost in the entry of the boulder, and that all the heat goes to the water. A thermally insulated container of water is dropped from a large height and collides inelastically with the ground. Determine the height from which it is dropped if the temperature of the water increases by 1.5 C.
6.
A piece of wood placed in the Sun absorbs more thermal energy than a piece of shiny metal of the same mass. Explain why the wood feels cooler than the metal when you touch them. A hot water vessel contains 3.0 dm3 at 45 C. Calculate the rate that the water is losing thermal energy (in joules per second) if it cools to 38 C over an 8.0 h period. Determine how many joules of energy are released when 870 g of aluminium is cooled from 155 C to 20 C.
15.
7.
16.
8.
87
19 april 09 Physics Ch 3 final.i87 87 22/05/2009 11:49:23 AM
CORE
CHAPTER 3
17. A piece of copper is dropped from a height of 225 m. If 75% of its kinetic energy is converted to heat energy on impact with the ground, calculate the rise in temperature of the copper. (Use the table of specific heat capacities to find the value for copper). 5kg of lead shot is poured into a cylindrical cardboard tube 2.0 m long. The ends of the tube are sealed, and the tube is inverted 50 times. The temperature of the lead increases by 4.2 C. If the specific heat of lead is 0.031 kcal kg-1 C-1, determine the number of work units in joules that are equivalent to the heat unit of 1 kilocalorie. are placed on water and observed under a microscope, the pollen grains undergo constant random zig-zag motion. The motion becomes more vigorous as the thermal energy is increased with heating. A Whitley Bay smoke cell uses smoke in air to achieve the same brownian motion. In both cases, the motion is due to the smaller particles striking the larger particles and causing them to move. The large number of particles in a volume of a solid, liquid or gas ensures that the number of particles moving in all directions with a certain velocity is constant over time. There would be no gaseous state if the particles were losing kinetic energy. A mutual attractive force must exist between particles otherwise the particles of nature would not be combined as we know them. Further explanation of this assumption will be given later in this topic. Matter is defined as anything that has mass and occupies space. There are four states of matter which are also called the four phases of matter solids, liquids, gases and plasma. Most matter on Earth is in the form of solids, liquids and gases, but most matter in the Universe is in the plasma state. Liquids, gases and plasma are fluids. A plasma is made by heating gaseous atoms and molecules to a sufficient temperature to cause them to ionise. The resulting plasma consists then of some neutral particles but mostly positive ions and electrons or other negative ions. The Sun and other stars are mainly composed of plasma. The remainder of this chapter will concentrate on the other three states of matter, and their behaviour will be explained in terms of their macroscopic and microscopic characteristics of which some are given in Figures 315 and 316. Characteristic Solid Shape Definite Volume Definite Almost Compressibility Incompressible Diffusion Small Comparative High Density Liquid Variable Definite Very slightly Compressible Slow High Gas Variable Variable Highly Compressible Fast Low
CORE
18.
solid
liquid
gas
Figure 314 Arrangement of particles in solids, liquids and gases An atom is the smallest neutral particle that represents an element as displayed in a periodic table of elements. Atoms contain protons, neutrons and electrons and an array of other sub-atomic particles. Atomic diameters are of the order of magnitude 1010 m. Atoms can combine to form molecules of substances. In chemistry, the choice of the terms e.g. atoms, molecules, ions are specific to elements and compounds. In physics, the word particle is used to describe any of these specific chemistry terms at this stage of the course. As previously mentioned, evidence for the constant motion of particles can be gained from observation of what is known as Brownian Motion. If pollen grains from flowers
Figure 315 Some macroscopic characteristics of solids, liquids and gases Macroscopic properties are all the observable behaviours of that material such as shape, volume and compressibility.
88
19 april 09 Physics Ch 3 final.i88 88 22/05/2009 11:49:24 AM
THERMAL PHYSICS
The many macroscopic or physical properties of a substance can provide evidence for the nature and structure of that substance. Characteristic Solid Gas Mostly Vibrational translational Rotational Some translational Higher vibrational Higher Highest r0 < > /10 1028 10r0 > 1025 Higher rotational Liquid container into which it is placed. Diffusion (the spreading out from the point of release) can occur readily. Gases are compressible because the particles are widely spaced at a distance much greater than the size of the particles. The much higher mean speeds are due to an increased translational kinetic energy of the particles. Gases have a much higher potential energy than liquids because the particles are much further apart.
Kinetic energy
Vibrational
Potential energy Mean molecular Separation (r0) Thermal energy of particles () Molecules per m3
Figure 316 Some microscopic characteristics of solids, liquids and gases Microscopic characteristics help to explain what is happening at the atomic level, and this part of the model will be interpreted further at a later stage. The modern technique of X-ray diffraction that will be studied in detail in a later chapter has enabled scientists to determine the arrangement of particles in solids. The particles are closely packed and each particle is strongly bonded to its neighbour and is held fairly rigidly in a fixed position to give it definite shape in a crystalline lattice. Some patterns are disordered as is the case for ceramics, rubber, plastics and glass. These substances are said to be amorphous. The particles have vibrational kinetic energy in their fixed positions and the force of attraction between the particles gives them potential energy. In liquids the particles are still closely packed and the bonding between particles is still quite strong. However, they are not held as rigidly in position and the bonds can break and reform. This infers that the particles can slowly and randomly move relative to each other to produce variable shape and slow diffusion. Particles in a liquid have vibrational, rotational and some translational kinetic energy due to their higher mean speeds. The potential energy of the particles in a liquid is somewhat higher than for a solid because the spacing between the particles is large. In gases the particles are widely spaced and the particles only interact significantly on collision or very close approach. Because of the rapid random zig-zag motion of the particles, a gas will become dispersed throughout any
G AS
melting point 5. 5C SO L I D
L I QUID
Figure 317 Heating curve for benzene. When the solid benzene is heated the temperature begins to rise. When the temperature reaches 5.5 C the benzene begins to melt. Although heating continues the temperature of the solid liquid benzene mixture remains constant until all the benzene has melted. Once all the benzene has melted the temperature starts to rise until the liquid benzene begins to boil at a temperature of 80 C. With continued heating the temperature remains constant until all the liquid benzene has been converted to the gaseous state. The temperature then continues to rise as the gas is in a closed container.
89
19 april 09 Physics Ch 3 final.i89 89 22/05/2009 11:49:24 AM
CORE
CHAPTER 3
remaining particles in the liquid has been lowered. Since temperature is proportional to the average kinetic energy of the particles, a lower kinetic energy implies a lower temperature, and this is the reason why the temperature of the liquid falls as evaporative cooling takes place. Another way of explaining the temperature drop is in terms of latent heat. As a substance evaporates, it needs thermal energy input to replace its lost latent heat of vaporisation and this thermal energy can be obtained from the remaining liquid and its surroundings. A substance that evaporates rapidly is said to be a volatile liquid. A liquids volatility is controlled by a factor known as its equilibrium vapour pressure. There are forces that must be overcome before a particle can leave the surface of a liquid. Different liquids exert different vapour pressures that depend on the relative strengths of the intermolecular forces present in the liquids. Hydrofluorocarbons and CFCs used in refrigerators, and ether, chloroform and ethanol have relatively high vapour pressures. The values in Figure 318 compare the vapour pressure of some liquids at 293 K. Substance Ether Chloroform Ethanol Water Mercury Figure 318 When water is left in a container outside, exposed to the atmosphere, it will eventually evaporate. Mercury from broken thermometers has to be cleaned up immediately due to its harmful effects. Water has a boiling point of 100 C and mercury has a boiling point of 357 C. Yet they both evaporate at room temperature. The process of evaporation is a change from the liquid state to the gaseous state that occurs at a temperature below the boiling point. The moving particle theory can be applied to understand the evaporation process. A substance at a particular temperature has a range of kinetic energies. So in a liquid at any particular instant, a small fraction of the molecules will have kinetic energies considerably greater then the average value. If these particles are near the surface of the liquid, they may have enough kinetic energy to overcome the attractive forces of neighbouring particles and escape from the liquid as a vapour. Now that the more energetic particles have escaped, the average kinetic energy of the Vapour pressure / kPa 58.9 19.3 5.8 2.3 0.0002
CORE
90
19 april 09 Physics Ch 3 final.i90 90 22/05/2009 11:49:25 AM
THERMAL PHYSICS
throughout the liquid. If the vapour pressure of the bubble is less than the atmospheric pressure the bubbles are crushed. However a point is reached when the pressures are equal. The bubble will then increase in size as it rises to the surface of the liquid.
150
Temperature / oC
100
Exercise
3.2 (b)
50
1.
When smoke is strongly illuminated and viewed under a microscope it is possible to observe A. B. C. D. all particles moving in straight lines smoke particles moving randomly by air molecules smoke particles colliding with each other air molecules in random motion 5.
10
15 T ime / min
20
The internal potential energy of the unknown substance increases without any change in internal kinetic energy from the beginning of the: A. B. C. D. first minute to the end of the fourth minute seventh minute to the end of the seventeenth minute seventeenth minute to the end of the twenty first minute nineteenth to the end of the twenty fifth minute
2.
The internal energy of a monatomic gas such as neon is mainly due to A. B. C. D. the potential energy holding the atoms in fixed positions the vibrational energy of the atoms the random translational energy of the atoms the rotational energy of the atoms
6.
The specific heat capacity of the substance when it is solid is: A. B. C. D. 63 Jkg-1K-1 105 Jkg-1K-1 126 Jkg-1K-1 504 Jkg-1K-1
3.
For a given mass of a certain liquid, the magnitude of the thermal energy transfer is the same for the following two processes A. B. C. D. freezing and sublimation melting and evaporation evaporation and condensation sublimation and condensation 7.
Give five macroscopic and five microscopic characteristics of the liquid/gas in a butane lighter. Describe the components of internal energy in each of the following situations (a) (b) (c) air at room temperature a jar of honey a melting ice cream.
8.
4.
Which of the following is a unit of thermal energy? A. B. C. D. watt the product of the newton and the metre the quotient of the watt and the second the product of the joule and the second
9.
Explain the difference between heat, thermal energy and temperature. Does a block of ice contain any heat? Explain your answer fully. Draw a fully labelled cooling curve for the situation when steam at 110 C is converted to ice at 25 C.
Base your answers to Questions 5 and 6 on the following graph. The graph shows the temperature of an unknown substance of mass 10.0 kg as heat is added at a constant rate of 6300 Jmin-1. The substance is a solid at 0 0C.
10.
11.
91
19 april 09 Physics Ch 3 final.i91 91 22/05/2009 11:49:26 AM
CORE
CHAPTER 3
12. (a) (b) Convert 63 C to Kelvin Convert 52 K to degrees Celsius When thermal energy is absorbed/released by a body, the temperature may rise/fall, or it can remain constant. If the temperature remains constant then a phase change will occur as the thermal energy must either increase the potential energy of the particles as they move further apart or decrease the potential energy of the particles as they move closer together. If the temperature changes, then the energy must increase the average kinetic energy of the particles. The quantity of heat required to change one kilogram of a substance from one phase to another is called the latent heat of transformation. Q = mL Q is the quantity of heat absorbed or released during the phase change in J, m is the mass of the substance in kg and 17. Describe and explain the process of evaporative cooling in terms of its microscopic properties. A kettle made of stainless steel containing water is heated to a temperature of 95 C. Describe the processes of thermal energy transfer that are occurring in the stainless steel kettle and the water. L is the latent heat of the substance in J kg -1 L could be the latent heat of fusion Lf, the latent heat of vaporisation Lv or the latent heat of sublimation Ls. The latent heat of fusion of a substance is less than the latent heat of vaporisation or the latent heat of sublimation. More work has to be done to reorganise the particles as they increase their volume in vaporisation and sublimation than the work required to allow particles to move from their fixed position and slide over each other in fusion. Figure 321 lists the latent heat of some substances. Substance Melting Latent heat Boiling Latent heat of point of fusion point Vaporisation K 105 J kg-1 K 105 J kg-1 Oxygen Ethanol Lead Copper Water 55 159 600 1356 273 0.14 1.05 0.25 1.8 3.34 90 351 1893 2573 373 2.1 8.7 7.3 73 22.5
13.
The temperatures of the same volume of air and water are raised by a small amount. Explain why a different amount of heat is required for each process. If you increase the heat under a pot of water in which you are boiling potatoes, will the potatoes be cooked faster? If you wanted to cool a bottle of soft drink at the beach, would you be better to wrap a wet towel around it or to put it into the seawater? Explain your answer. Why is it important not to stand in a draught after vigorous exercise?
CORE
14.
15.
16.
18.
T H E R M A L E NE R G Y A DDE D
Example
Figure 320 Macroscopic transformations between states of matter. Sublimation is a change of phase directly from a solid to a gas or directly from a gas to a solid. Iodine and solid carbon dioxide are examples of substances that sublime. Calculate the heat energy required to evaporate 5.0 kg of ethanol at its boiling point.
92
19 april 09 Physics Ch 3 final.i92 92 22/05/2009 11:49:26 AM
THERMAL PHYSICS
The latent heat of vaporisation can be found using a selfjacketing vaporiser as shown in Figure 323. The liquid to be vaporised is heated electrically so that it boils at a steady rate. The vapour that is produced passes to the condenser through holes labelled H in the neck of the inner flask. Condensation occurs in the outer flask and the condenser.
A
Solution
Given that m = 5.0 kg and Lv = 8.7 105 J kg -1. We then have, = 4.35 106 J = 4.4 106 J The heat energy required for the vaporisation is 4.4 106 J.
Example
H H
Vapour acting as a jacket Liquid under investigation Heating coil
Determine the heat energy released when 1.5 kg of gaseous water at 100 C is placed in a freezer and converted to ice at -7 0C. The specific heat capacity of ice is 2.1 103 J kg1 K1..
Wa ter outflow
Solution
C ondenser
The energy changes in this process can be represented as shown in Figure 322. Using Q = mLV + mcTWATER + mLf + mcTICE
Figure 323 Latent heat of vaporisation apparatus. = m [LV + cTWATER+ Lf + cTICE] = 1.5 [22.5 10 + (4180 100) + 3.34 10 + (2100 7)] = 4.52 106 J That is, the energy released is 4.5 106 J or 4.5 MJ.
5 5
Eventually, the temperature of all the parts of the apparatus becomes steady. When this steady state is reached, a container of known mass is placed under the condenser outlet for a measured time t, and the measured mass of the condensed vapour m is determined. The heater current I is measured with the ammeter A and potential difference V is measured with a voltmeter V. They are closely monitored and kept constant with a rheostat. The electrical energy supplied is used to vaporise the liquid and some thermal energy H is lost to the surroundings. Therefore: V1I1t = m1LV + H In order to eliminate H from the relationship, the process is repeated using a different heater potential difference and current. The vapour is collected for the same time t The rate of vaporisation will be different but the heat lost to the surroundings will be the same as each part of the
I C E at 7C 0 C
L I Q U I D water at 0 C 100 C
Specific heat I ce
93
19 april 09 Physics Ch 3 final.i93 93 22/05/2009 11:49:28 AM
CORE
CHAPTER 3
apparatus will be at the same temperature as it was with the initial rate vaporisation. Therefore: V2I2t = m2LV + H 4. Determine the amount of thermal energy that is required to melt 35 kg of ice at its melting point. A 5.0 102 g aluminium block is heated to 350 C. Determine the number of kilograms of ice at 0 C that the aluminium block will melt as it cools. Steam coming from a kettle will give you a nastier burn than boiling water. Explain why. An immersion heater can supply heat at a rate of 5.2 102 J s -1. Calculate the time that it will take to completely evaporate 1.25 10-1 kg of water initially at a temperature of 21 C? A 3.45 kg sample of iron is heated to a temperature of 295 C and is then transferred to a 2.0 kg copper vessel containing 10.0 kg of a liquid at an initial temperature of 21.0 C. If the final temperature of the mixture is 31.5 C, determine the specific heat capacity of the liquid? A mass of dry steam at 1.0 102 C is blown over a 1.5 kg of ice at 0.0 C in an isolated container. Calculate the mass of steam needed to convert the ice to water at 21.5 C. A freezer in a refrigerator takes 2.00 hours to convert 2.15 kg of water initially at 21.5 C to just frozen ice. Calculate the rate at which the freezer absorbs heat. Describe an experiment to determine the specific heat capacity of an unknown metal. Sketch the apparatus used and describe what measurements are made. State the main sources of error and explain how they can be minimised. Calculate how much thermal energy is released when 1.2 kg of steam at 100 C is condensed to water at the same temperature. (Lv = 2.25 106 Jkg-1) Determine how much energy is released when 1.5 kg of gaseous water at 100 C is placed in a freezer and converted to ice at 7 C . (the specific heat capacity of ice is 2.1 103 J kg-1 K-1). Describe an experiment that can be used to determine the latent heat of vaporisation of a liquid.
5.
CORE
6.
7. From this equation, the value of the latent heat of vaporisation of the unknown substance can be determined. 8.
Exercise
3.2 (c)
1.
The specific latent heat of fusion of ice is the heat required to A. B. C. D. raise the temperature of ice from 0 C to 10 C change 1 dm3 of ice at 0 C to water at 0 C change 1kg of ice at 0 C to water at 0 C change the temperature of 1 kg by 10 C 9.
2.
A substance changes from liquid to gas at its normal boiling temperature. What change, if any, occurs in the average kinetic energy and the average potential energy of its molecules? Average kinetic energy A. B. C. D. constant increases increases constant Average potential energy increases constant decreases constant
10.
11.
12.
3.
Thermal energy is transferred to a mass of water in four steps. Which one of the four steps requires the most thermal energy? 13. A. B. C. D. 5 C to 20 C 15 C to 35 C 75 C to 90 C 95 C to 101 C 14.
94
19 april 09 Physics Ch 3 final.i94 94 22/05/2009 11:49:28 AM
THERMAL PHYSICS
certain limited conditions but they can condense to liquids, then solidify if the temperature is lowered. Furthermore, there are relatively small forces of attraction between particles of a real gas, and even this is not allowable for an ideal gas. Most gases, at temperatures well above their boiling points and pressures that are not too high, behave like an ideal gas. In other words, real gases vary from ideal gas behaviour at high pressures and low temperatures. When the moving particle theory is applied to gases it is generally called the kinetic theory of gases. The kinetic theory relates the macroscopic behaviour of an ideal gas to the behaviour of its molecules. The assumptions or postulates of the moving particle theory are extended for an ideal gas to include Gases consist of tiny particles called atoms (monatomic gases such as neon and argon) or molecules. The total number of molecules in any sample of a gas is extremely large. The molecules are in constant random motion. The range of the intermolecular forces is small compared to the average separation of the molecules. The size of the particles is relatively small compared with the distance between them. Collisions of short duration occur between molecules and the walls of the container and the collisions are perfectly elastic. No forces act between particles except when they collide, and hence particles move in straight lines. Between collisions the molecules obey Newtons Laws of motion.
3.2.9 PRESSURE
Investigations into the behaviour of gases involve measurement of pressure, volume, temperature and mass. Experiments use these macroscopic properties of a gas to formulate a number of gas laws. In 1643 Torricelli found that the atmosphere could support a vertical column of mercury about 76 cm high and the first mercury barometer became the standard instrument for measuring pressure. The pressure unit 760 mm Hg (760 millimetres of mercury) represented standard atmospheric pressure. In 1646, Pascal found that the atmosphere could support a vertical column of water about 10.4 m high. For our purposes in this section, pressure can be defined as the force exerted over an area. Pressure = Force / Area P = F /A The SI unit of pressure is the pascal Pa. 1 atm = 1.01 105 Nm-2 = 101.3 kPa = 760 mmHg
Based on these postulates the view of an ideal gas is one of molecules moving in random straight line paths at constant speeds until they collide with the sides of the container or with one another. Their paths over time are therefore zig-zags. Because the gas molecules can move freely and are relatively far apart, they occupy the total volume of a container. The large number of particles ensures that the number of particles moving in all directions is constant at any time.
95
19 april 09 Physics Ch 3 final.i95 95 22/05/2009 11:49:29 AM
CORE
CHAPTER 3
The peak of each curve is at the most probable speed vp a large number of particles in a sample of gas have their speeds in this region. When the mathematics of statistical mechanics is applied it is found that mean squared speed vav2 is higher than the most probable speed. Another quantity more often used is called the root mean square speed Vrms and it is equal to the square root of the mean squared speed.
CORE
At the microscopic level, temperature is regarded as the measure of the average kinetic energy per molecule associated with its movements. For gases, it can be shown that the average kinetic energy,
v rms =
The root mean square is higher than the mean squared speed. Other features of the graphs show that the higher the temperature, the more symmetric the curves becomes. The average speed of the particles increases and the peak is lowered and shifted to the right. The areas under the graphs only have significance when N is defined in a different way from above. Figure 326 shows the distribution of the number of particles with a particular energy N against the kinetic energy of the particles Ek at a particular temperature. The shape of the kinetic energy distribution curve is similar to the speed distribution curve and the total energy of the gas is given by the area under the curve.
N
T1 T2 T2 > T1
Ek
Figure 326 Distribution of kinetic energies for the same gas at dierent temperatures. The average kinetic energy of the particles of all gases is the same. However, gases have different masses. Hydrogen molecules have about one-sixteenth the mass of oxygen molecules and therefore have higher speeds if the average kinetic energy of the hydrogen and the oxygen are the same. Because the collisions are perfectly elastic there is no loss in kinetic energy as a result of the collisions.
v mp v v rms
v ms
Figure 325 Maxwell-Boltzmann speed distribution for the same gas at dierent temperatures.
96
19 april 09 Physics Ch 3 final.i96 96 22/05/2009 11:49:30 AM
THERMAL PHYSICS
In 1787 Jacques Charles (17461823) performed experiments to investigate how the volume of a gas changed with temperature. Gay-Lussac (17781850) published more accurate investigations in 1802. A very simple apparatus to investigate Charles Law is shown in Figure 328. A sample of dry air is trapped in a capillary tube by a bead of concentrated sulfuric acid. The capillary tube is heated in a water bath and the water is constantly stirred to ensure that the whole air column is at the same temperature.
thermometer capillary tube bead of acid (e.g. sulfuric acid) water bath
1 PV = constant P __ V
When the conditions are changed, with the temperature still constant
air column
H E A T
P1V1 = P2V2
Figure 328 The readings of P and V must be taken slowly to maintain constant temperature because when air is compressed, it warms up slightly. When a pressure versus volume graph is drawn for the collected data a hyperbola shape is obtained, and when pressure is plotted against the reciprocal of volume a straight line (direct proportionality) is obtained. See Figure 327. Apparatus for Charles law. The investigation should be carried out slowly to allow thermal energy to pass into or out of the thick glass walls of the capillary tube. When the volume and temperature measurements are plotted, a graph similar to Figure 328 is obtained.
V cm3
pressure, P mm Hg
pressure, P mm Hg
PV
273 0
volume, V cm 3 P
T C T K
Figure 329 Variation of volume with temperature. Note that from the extrapolation of the straight line that the volume of gases would be theoretically zero at 273 C called absolute zero. The scale chosen is called the Kelvin scale K. The Charles (Gay-Lussac) Law of gases states that: The volume of a fixed mass of gas at constant pressure is directly proportional to its absolute (Kelvin) temperature.
1 cm3 V
Figure 327
The pressure that the molecules exert is due to their collisions with the sides of the container. When the volume of the container is decreased, the frequency of the particle collisions with the walls of the container increases. This means that there is a greater force in a smaller area leading to an increase in pressure. The pressure increase has nothing to do with the collisions of the particles with each other.
97
19 april 09 Physics Ch 3 final.i97 97 22/05/2009 11:49:30 AM
CORE
CHAPTER 3
This can also be stated as: The volume of a fixed mass of gas increases by 1/273 of its volume at 0 C for every degree Celsius rise in temperature provided the pressure is constant. The variation in pressure as the temperature is changed is measured and graphed. A typical graph is shown in Figure 331.
Pressure, P kPa 273 0 0 100 273 373
CORE
V1 V T V = kt so that ___ =k T1
Therefore,
V1 ___ V ___ = 2 T1 T2
As the temperature of a gas is increased, the average kinetic energy per molecule increases. The increase in velocity of the molecules leads to a greater rate of collisions, and each collision involves greater impulse. Hence the volume of the gas increases as the collisions with the sides of the container increase. Experiments were similarly carried out to investigate the relationship between the pressure and temperature of a fixed mass of various gases. The essential parts of the apparatus shown in Figure 330 are a metal sphere or round bottomed flask, and a Bourdon pressure gauge. The sphere/flask and bourdon gauge are connected by a short column of metal tubing/ capillary tube to ensure that as little air as possible is at a different temperature from the main body of enclosed gas. The apparatus in Figure 330 allows the pressure of a fixed volume of gas to be determined as the gas is heated.
T C T K
Figure 331 Variation of pressure with temperature. The Pressure (Admonton) Law of Gases states that: The pressure of a fixed mass of gas at constant volume is directly proportional to its absolute (Kelvin) temperature.
P1 - = k P T P = k T----T1
Therefore,
P1 P2 ----- = ----T1 T2
As the temperature of a gas is increased, the average kinetic energy per molecule increases. The increase in velocity of the molecules leads to a greater rate of collisions, and each collision involves greater impulse. Hence the pressure of the gas increases as the collisions with the sides of the container increase.
retort stand
Figure 330
98
19 april 09 Physics Ch 3 final.i98 98 22/05/2009 11:49:31 AM
INTRODUCTION
hat we learn in this chapter about oscillations in mechanical systems and of the waves that oscillating systems may set up, forms the basis for gaining an understanding of many other areas of physics. A study of oscillations is important for many reasons, not least, safety in design. For example, oscillations may be instigated in a bridge as traffic passes over it. In a worst case scenario these oscillations can lead to structural damage in the bridge. Many types of machines (lathes, car engines etc.) are also subject to oscillations and again, depending on the nature of the oscillations, machines can be damaged. Of course oscillating systems may also be very useful. The oscillations of a simple pendulum may be used as an accurate timing device and the oscillations set up in a quartz crystal may be used as an even more accurate timing device. If an oscillating body causes other particles with which it is in contact to oscillate, then the energy of the oscillating body may be propagated as a wave. An oscillating tuning fork, vibrating string and vibrating reed cause the air molecules with which they are in contact to oscillate thereby giving rise to a sound wave that we may hear as a musical note. As we shall see, oscillating systems and waves are intimately connected.
On a fundamental level, all atoms and molecules are in effect oscillating systems. An understanding of these oscillations is crucial to understanding both the microscopic and macroscopic properties of a substance. For example, the dependence of specific heat capacity on temperature, (a topic well beyond the scope of an IB Physics Course) arises from studying atomic oscillations. Also, by analysing the oscillations of atoms and molecules, we gain an understanding of the interaction between matter and radiation. For example, we shall see in Chapter 8 that the Greenhouse Effect is essentially due to the interaction between infrared radiation and gases such as carbon dioxide. It must also be mentioned that the oscillations of electrically charged particles give rise to electromagnetic waves (light, radio waves, X-rays etc). This is examined in more detail in Option G on Electromagnetic Waves. However, the concept of electromagnetic waves is used in several other places in both the Core and AHL material. We look first at the oscillations in mechanical systems.
99
070813 Physics Ch 4 final.indd 99 22/05/2009 11:50:10 AM
CORE
CHAPTER 4
CORE
4.1.2 Dene the terms displacement, amplitude, frequency, period, phase dierence. 4.1.3 Dene simple harmonic motion (SHM) and state the dening equation as a = 2x. 4.1.4 Solve problems using the dening equation for simple harmonic motion. 4.1.5 Apply the equations v = v0sin t, v = v0 cos t, v = (x0 2 x2), x = x0 cos t, x = x0 sint, as solutions to the dening equation for simple harmonic motion. 4.1.6 Solve problems, both graphically and by calculation, for acceleration, velocity and displacement during simple harmonic motion
IBO 2007
0
-
C E X
x
B D
Figure 401 A simple oscillating system To set the pendulum oscillating, the bob is pulled up to a position such as B where the angle XPB is 0. It is then released. The bob will now oscillate between the positions B and C.
Displacement (x, )
This refers to the distance that an oscillating system is from its equilibrium position at any particular instant during the oscillation. In the case of the simple pendulum the displacement is best measured as an angular displacement. For example, in Figure 401 when the bob is at the position D, the displacement is the angle XPD = and when at E, is the angle XPE = . The displacement when the bob is at X is = 0.
a boat at anchor at sea the human vocal chords an oscillating cantilever the Earths atmosphere after a large explosion.
Amplitude (x0, 0)
This is the maximum displacement of an oscillating system from its equilibrium position. For the simple pendulum in Figure 401 this is clearly 0.
Period (T)
This is the time it takes an oscillating system to make one complete oscillation. For the simple pendulum in Figure 401, this is the time it takes to go from X to B, B to C and then back to X.
100
070813 Physics Ch 4 final.indd 100 22/05/2009 11:50:11 AM
Frequency (f)
This is the number of complete oscillations made by the system in one second.
r
A l
period
The time for one complete oscillation is the period T. Therefore the number of oscillations made in one second 1 - . The number of oscillations made in one second is is -T also defined as the frequency f, hence: 1 f = -Equation 4.1 T
Figure 402
The radian
Phase dierence
Suppose we have for instance, two identical pendulums oscillating next to each other. If the displacements of the pendulums are the same at all instances of time, then we say that they are oscillating in phase. If on the other hand the maximum displacement of one of them is 0 when the maximum displacement of the other is 0, then we say that they are oscillating in anti-phase or that the phase difference between them is 180. The reason for the specification in terms of angle will become clear in section 4.1.5. In general, the phase difference between two identical systems oscillating with the same frequency can have any value between 0 and 360 (or 0 to 2 radians). We shall see that the concept of phase difference is very important when discussing certain aspects of wave motion.
If = 180 then l is equal to half the circumference of the circle i.e. l = r. Hence from equation 4.2, we have that
(1800 )
r (rad) r
Radian measure
When dealing with angular displacements, it is often useful to measure the displacement in radians rather than in degrees. In Figure 402, the angle measured in radians is defined as the arc length AB (l) divided by the radius r of the circle i.e
Motion of pendulum
(rad)
l r
Equation 4.2
Paper
Direction of paper
Figure 403
101
070813 Physics Ch 4 final.indd 101 22/05/2009 11:50:13 AM
CORE
CHAPTER 4
As the pendulum oscillates, the paper is pulled at a constant speed in the direction shown. Figure 404 shows a particular example of what is traced on the paper by the marker pen.
0.4
Angular frequency ()
A very useful quantity associated with oscillatory motion is angular frequency, . This is defined in terms of the linear frequency as
= 2f
Using equation 4.1 we also have that
Equation 4.3
CORE
0.3
displacement/cm
0.2 0.1 0 -0.1 -0.2 -0.3 -0.4 0.5 1 1.5 2 2.5 3 3.5 4 4.5 time/s 5
2 T
Equation 4.4
Figure 404
A sample trace
The displacement is measured directly from the trace and the time is calculated from the speed with which the paper is pulled. There are several things to notice about the trace. 1. One complete oscillation is similar to a sine or cosine graph. The period stays constant at about 0.8 s. (Oscillations in which the period is constant are called isochronous.)
There is a connection between angular frequency and the angular speed of a particle moving in a circle with constant speed. The angular speed of the particle is defined as the number of radians through which the particle moves in one second. If the time for one complete revolution of the circle is T , then from equation 4.2 we have that 2 2 or T T There is actually a physical connection between angular speed and SHM in the respect that it can be shown that the projection of the particle onto any diameter moves with SHM. See Figure 405.
2.
3.
The amplitude is decaying with time. This is because the pendulum is losing energy to the surroundings due to friction at the point of suspension and to air resistance.
As the particle P moves round the circle, its projection N onto a diameter moves backwards and forwards along the diameter with SHM.
Figure 405 Circular and harmonic motion Based on this sort of time trace, we can define a special type of oscillatory motion. Oscillators that are perfectly isochronous and whose amplitude does not change with time are called simple harmonic oscillators and their motion is referred to as simple harmonic motion (SHM). Clearly SHM does not exist in the real world as the oscillations of any vibrating system will eventually die out. Interestingly enough, the simple pendulum does not perform SHM for yet another reason, namely that the period is actually dependant on the amplitude. However, this only becomes noticeable as 0 exceeds about 40. Although SHM does not exist in the real world, many oscillatory systems approximate to this motion. Furthermore, as part of the scientific method, it makes good sense to analyse a simple situation before moving onto more complex situations.
Denition of SHM
If it were possible to remove all frictional forces acting on an oscillating pendulum, then the displacement time graph for the motion would look like that in Figure 406. The amplitude does not decay with time. This therefore is a displacement-time graph for SHM
102
070813 Physics Ch 4 final.indd 102 22/05/2009 11:50:14 AM
displacement/cm
is the acceleration of the system. However, the acceleration is not constant. For those of you who have a mathematical bent, the relation between the force and the acceleration is written as kx = m
Figure 406
x = Pcost + Qsint where P and Q are constants and is the angular frequency of the system and is equal to .
It turns out that if the acceleration a of a system is directly proportional to its displacement x from its equilibrium position and is directed towards the equilibrium position, then the system will execute SHM. This is the formal definition of SHM. We can express this definition mathematically as a = const x Equation 4.5
Whether a particular solution involves the sine function or the cosine function, depends on the so-called boundary conditions. If for example x = x0 (the amplitude) when t = 0, then the solution is x = x0cost. The beauty of this mathematical approach is that, once the general equation has been solved, the solution for all systems executing SHM is known. All that has to be shown to know if a system will execute SHM, is that the acceleration of the system is given by Equation 4.5 or the force is given by equation 4.7. The physical quantities that will depend on is determined by the particular system. For example, for a weight of mass m oscillating at the end of a vertically supported spring whose spring constant is k, then = or , from equation 4.4 For
The negative sign indicates that the acceleration is directed towards the equilibrium position. Mathematical analysis shows that the constant is in fact equal to 2 where is the angular frequency (defined above) of the system. Hence equation 4.5 becomes a = 2 x Equation 4.6
This equation is the mathematical definition of SHM. If a system is performing SHM, then to produce the acceleration, a force must be acting on the system in the direction of the acceleration. From our definition of SHM, the magnitude of the force F is given by F = kx Equation 4.7 where l is the length of a simple pendulum, = the pendulum _ and g is the acceleration of free fall such l that T = 2 _ g .
TOK
A Mathematical Perspective
where k is a constant and the negative sign indicates that the force is directed towards the equilibrium position of the system. (Do not confuse this constant k with the spring constant. However, when dealing with the oscillations of a mass on the end of a spring, k will be the spring constant.)
Galileo stated that the book of nature is written in mathematical terms. In this respect, it is impossible to explain to somebody why the period of oscillation of a simple pendulum depends on the square root of its length without recourse to solving a second order differential equation.
103
070813 Physics Ch 4 final.indd 103 22/05/2009 11:50:15 AM
CORE
order differential equation. The solution of the equation gives x as a function of t. The actual solution is of the SHM equation is
CHAPTER 4
(c) Calculate the period T of oscillation of the wood.
Example
Answer 1. A cylindrical piece of wood floats upright in water as shown in Figure 407.
= 4.8 rad
CORE
push downwards
2 = 1.3 s T = ___ (d) State and explain in terms of the period T of oscillation of the wood, the first two instances when the acceleration is a maximum.
water
Answer The amplitude will be a maximum when t = 0 and again when T t = __ 2 So acceleration is a maximum at t = 0 and t = 0.65 s
wood
Figure 407 SHM of a oating piece of wood
The wood is pushed downwards and then released. The subsequent acceleration a of the wood is given by the expression
where = density of water, = density of the wood, l = length of wood, g = acceleration of free fall and x = displacement of the wood from its equilibrium position. (a) Answer Explain why the wood executes SHM.
The equation shows that the acceleration of the wood is proportional to its displacement from equilibrium and directed towards the equilibrium position. (b) The length of the wood is 52 cm and it is pushed downwards a distance of 24 cm. Calculate the maximum acceleration of the wood. ( = 1.0 103 kg m3, = 8.4 102 kg m3, g = 9.8 m s2).
mg weight of mass m
x0
(a)
(b)
Figure 408
= 5.4 m s-2
In Figure 408 (a), the equilibrium extension of the spring is e and the net force on the weight is mg ke = 0.
104
070813 Physics Ch 4 final.indd 104 22/05/2009 11:50:16 AM
However, we have to bear in mind that t varies between 0 and 2 where cost is negative for t for and sint is negative for t in the range to 2. This effectively means that when the displacement from equilibrium is positive, the velocity is negative and so directed towards equilibrium. When the displacement from equilibrium is negative, the velocity is positive and so directed away from equilibrium The sketch graph in Figure 409 shows the variation with time t of the displacement x and the corresponding variation with time t of the velocity v. This clearly demonstrates the relation between the sign of the velocity and sign of the displacement.
is, the weight will execute SHM with a frequency . The displacement of the weight x, determined by solving Equation 4.8, is given by
x0
0,0 x0 velocity v0
2T
Equation 4.9
0,0
This is the particular solution of the SHM equation for the oscillation of a weight on the end of a spring. This system is often referred to as a harmonic oscillator. The velocity v of the weight at any instant can be found by finding the gradient of the displacement-time graph. From Equation 4.9, the displacement graph is a cosine function and the gradient of a cosine function is a negative sine function. The gradient of x = x0cost is in fact
displacement 0,0 T 2T
v0
Figure 409
We can also see how the velocity v changes with displacement x. From equation 4.10, we have that v = x0 sin t
so Equation 4.10
ocity
However we can express sint in terms of cost using the trigonometric relation
sin 2 + cos 2 = 1
where v0 is the maximum and minimum velocity equal in magnitude to x0. Students familiar with calculus will recognise the velocity v as
dx d = (x0 cos t ) = xo sin t . Similarly, dt dt dv d a= = ( x0 sin t ) = 2 x0 cos t = 2 x dt d t v=
v = x0 1 cos 2 t
Remembering that square root gives and putting x0 inside the
105
070813 Physics Ch 4 final.indd 105 22/05/2009 11:50:18 AM
CORE
CHAPTER 4
_______
2 v = ( x0 x2 )
v = ( x x2 )
2 0
Equation 4.11
2 = x0 x2
2 = x0 x2
CORE
The velocity is zero when the displacement is a maximum and is a maximum when the displacement is zero. The graph in Figure 410 shows the variation with x of the velocity v for a system oscillating with a period of 1 sec and with an amplitude of 5 cm. The graph shows the variation over a time of any one period of oscillation.
7 6 5
Figure 411 Common equations We should mention that since the general solution to there are the SHM equation is in fact three solutions to the equation. This demonstrates a fundamental property of second order differential equations; that one of the solutions to the equation is the sum of all the other solutions. This is the mathematical basis of the so-called principle of superposition.
v / cm s -1
4 3 2 1
-5
-4
-3
-2
-1
-1 -2 -3 -4 -5 -6 -7
x / cm
Examples
The graph in Figure 412 shows the variation with time t of the displacement x of a system executing SHM.
10
Figure 410
Velocity-displacement graph
8 6 4 x /cm
Boundary Conditions
The two solutions to the general SHM equation are . Which solution applies to and a particular system depends, as mentioned above, on the boundary conditions for that system. For systems such as the harmonic oscillator and the simple pendulum, the boundary condition that gives the solution is that the displacement x = x0 when t = 0. For some other systems it might turn out that x = 0 when t =0. This will . From a practical point of lead to the solution view, the two solutions are essentially the same; for example when timing the oscillations of a simple pendulum, you might decide to start the timing when the pendulum bob passes through the equilibrium position. In effect, the two solutions differ in phase by . The table in Figure 411 summarises the solutions we have for SHM.
2 0 0. 5 1 1. 5 2 2. 5 3 t /s 3. 5
-2 -4 -6 -8 -10
Figure 412
Use the graph to determine the (i) (ii) (iii) (iv) (v) period of oscillation amplitude of oscillation maximum speed speed at t = 1.3 s maximum acceleration
106
070813 Physics Ch 4 final.indd 106 22/05/2009 11:50:19 AM
Solutions
(a)
) = 25 cm s-1
v = v0sint = 25sin (1.3). To find the value of the sine function, we have to convert the 1.3 into degrees (remember and hence t, is measured in radians) deg therefore 1.3 = 1.3 180 1 = 234 therefore v1 = 25sin (234) = +20 cm s 1.
_______
2 x2 ) Or we can solve using ( x0
4.2.1 Describe the interchange between kinetic energy and potential energy during SHM 4.2.2 Apply the expression E = m2(x 2 x2) for the kinetic energy of a particle undergoing simple harmonic motion, ET = m2x 2 for the total energy and E = m2x2 for the potential energy.
K 0 0 P
from the graph at t = 1.3 s, x = 4.8 cm therefore v = (v) Using = 20 cm s-1 = 2 8.0 = 79 m s-2
4.2.3 Solve problems, both graphically and by calculation, involving energy changes during simple harmonic motion.
IBO 2007
Exercise
4.1
1. (a) Answer the same questions (a)(i) to (a)(iv) in the above example for the system oscillating with SHM as described by the graph in the Figure below. (b) Also state two values of t for when the magnitude of the velocity is a maximum and two values of t for when the magnitude of the acceleration is a maximum.
7 6 5 4 3 2 x/ cm 1 0 -1 -2 -3 -4 -5 -6 -7 0. 5 1 1. 5 2 2. 5 3 t /s 3. 5
That is, that for any system performing SHM, the energy of the system is proportional to the square of the amplitude. This is an important result and one that we shall return to when we discuss wave motion. At x = 0 the spring is at its equilibrium extension and the magnitude of the velocity v of the oscillating mass is a maximum v0. The energy is all kinetic and again is equal to ET. We can see that this is indeed the case as the expression for the maximum kinetic energy Emax in terms of v0 is Equation 4.13
107
070813 Physics Ch 4 final.indd 107 22/05/2009 11:50:20 AM
CORE
CHAPTER 4
Clearly ET and Emax are equal such that 1kx 2 = E = __ 1mv 2 ET = __ max 2 0 2 0 From which k 2 v = __ m x0 (as v0 0)
2 0
CORE
Therefore
__
Example
k v0 = __ m x0 = x0
which ties in with the velocity being equal to the gradient of the displacement-time graph (see 4.1.5). As the system oscillates there is a continual interchange between kinetic energy and potential energy such that the loss in kinetic energy equals the gain in potential energy and ET = EK + EP.
The amplitude of oscillation of a mass suspended by a vertical spring is 8.0 cm. The spring constant of the spring is 74 N m1. Determine (a) (b) the total energy of the oscillator the potential and the kinetic energy of the oscillator at a displacement of 4.8 cm from equilibrium.
Solution
Equation 4.14 Clearly, the potential energy EP at any displacement x is given by Equation 4.15 1 _mv2 At any displacement x, the kinetic energy EK is EK = _ 2 Hence remembering that
_______
(a)
( x02 x2 ) , we have
2
1 _m EK = _ 2
( x02 x2 )
Although we have derived these equations for a harmonic oscillator, they are valid for any system oscillating with SHM. The sketch graph in Figure 414 shows the variation with displacement x of EK and EP for one period.
Exercise
4.2
potential kinetic
In a simple atomic model of a solid, the atoms vibrate with a frequency of 2.0 1011 Hz. The amplitude of vibration of the atoms is 5.5 1010 m and the mass of each atom is 4.8 1026 kg. Calculate the total energy of the oscillations of an atom.
energy
Figure 414
108
070813 Physics Ch 4 final.indd 108 22/05/2009 11:50:21 AM
4.3.3 State what is meant by natural frequency of vibration and forced oscillations. 4.3.4 Describe graphically the variation with forced frequency of the amplitude of vibration of an object close to its natural frequency of vibration. 4.3.5 State what is meant by resonance. 4.3.6 Describe examples of resonance where the eect is useful and where it should be avoided.
IBO 2007
4.3.1 DAMPING
In this section, we look at oscillations of real systems. In section 4.1.3, we described an arrangement by which the oscillations of a pendulum could be transcribed onto paper. Refer to Figure 415.
amplitude
time
Figure 416
time
Consider a harmonic oscillator in which the mass is pulled down and when released, and the mass comes to rest at its equilibrium position without oscillating. The friction forces acting are such that they prevent oscillations. However, suppose a very small reduction in the friction forces would result in heavily damped oscillation of the oscillator, then the oscillator is said to be critically damped. The graph in Figure 417 shows this special case of damping known as critical damping.
displacement
Figure 415
Damping
The amplitude of the oscillations gradually decreases with time, whereas for SHM, the amplitude stays at the same value forever. Clearly, the pendulum is losing energy as it oscillates. The reason for this is that dissipative forces are acting that oppose the motion of the pendulum. As mentioned earlier, these forces arise from air resistance and though friction at the support. Oscillations, for which the amplitude decreases with time, are called damped oscillations.
109
070813 Physics Ch 4 final.indd 109 22/05/2009 11:50:22 AM
CORE
OSCILLATIONS
CHAPTER 4
1.2 1
Exercise
4.3
displacement / m
0.8 0.6 0.4 0.2 0 0 0.2 0.4 0.6 time / s 0.8 1 1.2
Identify which of the following oscillatory systems are likely to be lightly damped and which are likely to be heavily damped. 1. 2. 3. 4. 5. 6. 7. 8. Atoms in a solid Car suspension Guitar string Harmonic oscillator under water Quartz crystal A cantilever that is not firmly clamped Oil in a U-tube Water in a U-tube
CORE
Figure 417
Critical damping
Although not in the IB syllabus, a useful way of classifying oscillating systems, is by a quantity known as the quality factor or Q-factor. The Q-factor does have a formal definition but it is approximately equal in value to the number of oscillations that occur before all the energy of the oscillator is dissipated. For example, a simple pendulum has a Q-factor of about 1000. As mentioned in the introduction to this chapter, the oscillations (vibrations) made by certain oscillatory systems can produce undesirable and sometimes, dangerous effects. Critical damping plays an important role in these situations. For example, when a ball strikes the strings of a tennis racquet, it sets the racquet vibrating and these vibrations will cause the player to lose some control over his or her shot. For this reason, some players fix a damper to the springs. If placed on the strings in the correct position, this has the effect of producing critically damped oscillations and as a result the struck tennis racquet moves smoothly back to equilibrium. The same effect can be achieved by making sure that the ball strikes the strings at a point known as the sweet spot of which there are two, one of which is know as the centre of percussion (COP). Cricket and baseball bats likewise have two sweet spots. Another example is one that involves vibrations that may be set up in buildings when there is an earthquake. For this reason, in regions prone to earthquakes, the foundations of some buildings are fitted with damping mechanisms. These mechanisms insure any oscillations set up in the building are critically damped.
Figure 418
A forced oscillation
The amplitude of the swing will get larger and larger and if you are not careful your little brother or sister, or who ever the small child might be, will end up looping the loop. The frequency with which you push the swing is exactly equal to the natural frequency of oscillation of the swing and importantly, is also in phase with the oscillations of the spring. Since you are actually forcing the swing to oscillate,
110
070813 Physics Ch 4 final.indd 110 22/05/2009 11:50:23 AM
(There are many very good computer simulations available that enable you to explore the relation between forced and natural oscillations in detail.) The driving force and system are in phase if, when the amplitude of system is a maximum, it receives maximum energy input from the driver. Clearly this is when the amplitude of the driver is a maximum. What is of particular interest is when the forced frequency is close to and when it equals the natural frequency. This we look at in the next two sections.
100 A /cm
50
0 0 5 10 15 20 25 f /Hz 30
Figure 420
We see that the maximum amplitude is now very large and also very sharply defined. Also, either side of f0, the amplitude drops off very rapidly.
0 0 5 10 15 20 25 30
f/ Hz
Figure 419
Forced frequency
111
070813 Physics Ch 4 final.indd 111 22/05/2009 11:50:24 AM
CORE
CHAPTER 4
In the introduction we mentioned the use of quartz crystal as timing devices. If a crystal is set oscillating at its natural frequency, electric charge constantly builds up and dies away on it surface in time with the vibration of the crystal (This is known as the piezoelectric effect.). This makes it easy to maintain the oscillations using an alternating voltage supply as the driving frequency. The vibrations of the crystal are then used to maintain the frequency of oscillation in a resonant circuit. It is the oscillations in the resonant circuit that control the hands of an analogue watch or the display of a digital watch. (The concept of analogue and digital signals is discussed in Topics 14.1 and C.1). It is left as an exercise to you to think of other situations in which resonance can be useful or can be harmful.
4.3.5 RESONANCE
(Note: As well as the availability of a large number of computer simulations that demonstrate resonance, there are also many laboratory demonstrations and experiments that can be done to demonstrate it).
CORE
We have seen that when an oscillatory system is driven at a frequency equal to its natural frequency, the amplitude of oscillation is a maximum. This phenomenon is known as resonance. The frequency at which resonance occurs is often referred to as the resonant frequency.
4.4.2 State that progressive (travelling) waves transfer energy. (Students should understand that there is no net motion of the medium through which the wave travels). 4.4.3 Describe and give examples of transverse and of longitudinal waves.
Perhaps one of the most familiar types of wave motion is a water wave. However, we can also set up waves in strings very easily. A simple demonstration is to take a length of rubber tubing. Hold one end of it and shake that end up and down. A wave will travel down the tube. If we give the end of the tube just one shake then we observe a pulse to travel down the tube. By this we can see that we can have either a continuous travelling wave or a travelling pulse. This is illustrated in Figure 421 in which we have taken an instantaneous snap shot of the tube.
112
070813 Physics Ch 4 final.indd 112 22/05/2009 11:50:24 AM
(a)
(b)
pulse
hand movement
Equilibrium position
Figure 421
We can also set up another type of wave by using a slinky spring. In this demonstration we lay the spring along the floor. Hold one end of it and move our hand backwards and forwards in the direction of the spring. In this way we see a wave travelling down the spring as a series of compressions and expansions of the spring as illustrated in Figure 422. We can also set up a pulse in the spring by moving our hand backwards and forwards just once in the direction of the spring. Of course we can set up a wave in the spring that is similar to the one we set up in the rubber tube. We shake the spring in a direction that is at right angles to the spring as shown in Figure 422.
2(a)
expansions
The y-axis now shows the displacement of the point P from equilibrium. The graph is a displacement-time graph. The space diagram and the time diagram are both identical in shape and if we mentally combine them we have the whole wave moving both in space and time.
displacement of particle P from equilibrium position
time
Figure 424
2(b)
compress
Displacement-Time graph
expand
Figure 422
Slinky springs
For the longitudinal wave in the slinky spring, the displacement-space graph actually shows the displacement of the individual turns of the spring from their equilibrium position as a function of distance along the spring. However, it could equally show how the density of turns of the spring varies with length along the spring. The displacement-time graph shows the displacement of one turn of the spring from its equilibrium positions as a function of time.
A very important property associated with all waves is their so-called periodicity. Waves in fact are periodic both in time and space and this sometimes makes it difficult to appreciate what actually is going on in wave motion. For example, in our demonstration of a wave in a rubber tube we actually drew a diagram that froze timean instantaneous snapshot of the whole string. Hence Figure 421 shows the periodicity of the wave in space. The diagram is repeated as a sketch graph in Figure 423. The y-axis shows the displacement of the tube from its equilibrium position. The graph is a displacement space graph. We now look at one particle of the tube labelled P and unfreeze time. The diagram in Figure 424 shows how the position of P varies with time. This illustrates the
113
070813 Physics Ch 4 final.indd 113 22/05/2009 11:50:25 AM
CORE
CHAPTER 4
demonstrations, the tube and the spring do not end up in a different part of the laboratory. Water waves however, can be a bit disconcerting. Waves at sea do not transport water but the tides do. Similarly, a wave on a lake does not transport water but water can actually be blown along by the wind. However, if you set up a ripple tank you will see that water is not transported by the wave set up by the vibrating dipper.
4.4.4 Describe waves in two dimensions, including the concepts of wavefronts and rays. 4.4.5 Describe the terms crest, trough, compression and rarefaction. 4.4.6 Dene the terms displacement, amplitude, frequency, period, wavelength, wave speed and intensity.
IBO 2007
CORE
Transverse waves
In these types of wave, the source that produces the wave vibrates at right angles to the direction of travel of the wave i.e. the direction in which the energy carried by the wave is propagated. It also means that the particles of the medium through which the wave travels vibrate at right angle to the direction of travel of the wave (direction of energy propagation). Figure 421 illustrates an example of a transverse wave. Light is another example of a transverse wave although this a very special kind of wave. Light waves are discussed in more detail in Topic G.1. An important property of transverse waves is that they cannot propagate through fluids (liquid or gases). This is one reason why light wave are special; they are transverse and yet can propagate through fluids and through a vacuum.
ripple tank
vibrating dipper
white paper
Figure 425 A ripple tank Again this is a snapshot. However, by using a stroboscope as the source of illumination it is possible to freeze the waves. Each bright area of illumination represents a trough or crest. (Light incident on the top or bottom of a crest will be transmitted through the water and not reflected by the surface). Each bright line representing a crest can be thought of as a wavefront and this is a very good way of representing a travelling wave as shown in Figure 426.
Longitudinal waves
In these types of wave, the source that produces the wave vibrates in the same direction as the direction of travel of the wave i.e. the direction in which the energy carried by the wave is propagated. It also means that the particles of the medium through which the wave travels vibrate in the same direction of travel of the wave (direction of energy propagation). The wave in the slinky spring in Figure 422 is a longitudinal wave as is sound (see Section 4.4.5).
114
070813 Physics Ch 4 final.indd 114 22/05/2009 11:50:26 AM
direction of travel
Figure 426
A wavefront
If the wave is a light wave then the arrow that shows the direction of travel of the wave is none other than what we call a light ray.
This is the maximum displacement of a particle from its equilibrium position. (It is also equal to the maximum displacement of the source that produces the wave). The energy that a wave transports per unit time across unit area of the medium through which it is travelling is called the intensity (I). From our knowledge of SHM we know that the energy of the oscillating system is proportional to the square of the amplitude (equation 4.12). Hence for a wave of amplitude A, we have that
Period (T)
This is the time that it takes a particle to make one complete oscillation. (It is also equal to the time for the source of the wave to make one complete oscillation).
Frequency (f)
This is the number of oscillations made per second by a particle. (It is also equal to the number of oscillations made per second by the source of the wave). The SI unit of 1 _ frequency is the hertz-Hz. Clearly then, f = _ T
Wavelength ()
This is the distance along the medium between two successive particles that have the same displacement
115
070813 Physics Ch 4 final.indd 115 22/05/2009 11:50:27 AM
CORE
wavefronts
Amplitude (A, a)
CHAPTER 4
(In some circumstances the wave speed is a function of wavelength, a phenomenon known as dispersion) Figure 427 shows how the different terms and definitions associated with waves relate to both transverse and longitudinal waves. From these diagrams, we also see that the wavelength of a transverse wave is equal to the distance between successive crests and also between successive troughs. For a longitudinal wave the wavelength is equal to the distance between successive points of maximum compression and also between successive points of maximum rarefaction.
tube from crest displacement of equilibrium position wavelength crest equilibrium position of tube distance along tube
4.4.7 DISPLACEMENT-TIME AND DISPLACEMENT-POSITION GRAPHS 4.4.8 THE RELATIONSHIP BETWEEN WAVE SPEED, WAVELENGTH
AND FREQUENCY
Figure 428 shows an instantaneous snapshot of a medium through which a wave is travelling. A particle of the medium is labelled P.
displacement of medium
CORE
TRANSVERSE
At time t = 0
displacement of particle
period, T
time
period, T
compression LONGITUDINAL
wavelength
Figure 428 Instantaneous snapshot of displacement of medium If we take another photograph half a period later then the particle P will be in the position shown in Figure 429.
displacement of medium
maximum compression
rarefaction
4.4.7 Draw and explain displacement-time and displacement-position graphs for transverse and for longitudinal waves 4.4.8 Derive and apply the relationship between wave speed, wavelength and frequency 4.4.9 State that all electromagnetic waves travel with the same speed in free space and recall the orders of magnitude of the wavelengths of the principal radiations in the electromagnetic spectrum.
IBO 2007
Figure 429 Particle P half a period later In this time the wave will have moved forward a distance of half a wavelength --. 2 We have therefore that the speed v of the wave distance = __ , but f = _ 1 _, = _______ time T T Hence we have
v = f
116
070813 Physics Ch 4 final.indd 116 22/05/2009 11:50:28 AM
Example
Water waves of wavelength 5.0 cm are travelling with a 1 speed of 1.0 ms . Calculate the frequency of the source producing the waves? The waves travel into deeper water where their speed is now 2.0 m s1. Calculate the new wavelength new of the waves.
Solution
Using v = f we have that 1.0 = f .05 and so f = 20 Hz. In the deeper water using v = f we now have that 2.0 = 20 and so
new
new
= 10 cm.
f / Hz
103
104
106
108
1010
1012
1014
Ultraviolet
1016
1018
Radio waves
Microwaves Infra-red
Source
Hot Objects
Gas discharge
/m
106
104
102
100
102
104
106
108
1010
Figure 430
117
070813 Physics Ch 4 final.indd 117 22/05/2009 11:50:29 AM
CORE
CHAPTER 4
The main thing to note is that the pulse keeps its shape except that now it is inverted i.e. the pulse has undergone a 180 () phase change. The reason for this is a little tricky to understand but it is essentially because the end of string that is fixed cannot move. As any part of the forward pulse reaches the fixed end the associated point on the string is moving upward and so if the fixed point is not to move, the point on the string that is moving upward must be cancelled out by a point moving downwards. This is an example of the so-called principle of superposition that we look at in Section 4.4.5 If the string is not attached to a support then a pulse is still reflected from the end of the string but this time there is no phase change and so the reflected pulse is not inverted.
CORE
Reection of wavefronts
REFLECTION
We can use the idea of wavefronts to see what happens when a wave strikes a barrier. This is demonstrated with a ripple tank. Figure 433 shows the incident wavefronts and the reflected wavefronts.
incident waves re ected waves
Figure 431
When the pulse reaches the end of the string it is reflected Figure 432 shows the reflected pulse, this pulse is the negative part of a sine curve. Some of the energy of the pulse will actually be absorbed at the support and as such, the amplitude of the reflected pulse will be less than that of the incident pulse.
Figure 433 Incident and reected wavefronts By constructing the associated rays (see Figure 433), we see that the angle at which the waves are reflected from the barrier is equal to the angle at which they are incident on the barrier (the angles are measured to the normal to the barrier). That is i = r . All waves, including light, sound and water obey this rule, the so-called law of reflection. We can use the idea of reflection to make a very simple measurement of the speed of sound. If you stand about 100 m away from a tall wall and clap your hands once, a short time later you will hear an echo of the clap. The sound pulse produced by your hand clap travels to the
118
070813 Physics Ch 4 final.indd 118 22/05/2009 11:50:30 AM
L I G HT
S O U R C E
perspex sheet
NB: d > s
Refraction
We now look to see what happens to a wave when it is incident on the boundary between two media and passes from one medium to the other (transmission). As for the single pulse, some energy will be absorbed at the boundary. Also, as well as energy being transmitted by the wave, some energy will be reflected at the boundary. Here we will concentrate on the transmitted energy. We have discussed previously the idea that the speed of a wave depends only on the nature and properties of the medium through which the wave travels. This gives rise to the phenomenon of refraction. That is the change in direction of travel of a wave resulting from a change in speed of the wave. This is easily demonstrated with a ripple tank by arranging two regions of different depth. To achieve this a piece of flat perspex or glass is placed a short distance from the source of the waves as shown in Figure 434.
source of plane waves perspex sheet ripple tank water
Figure 435 Wavefronts in water of dierent depth In this diagram the wave fronts are parallel to the boundary between the two regions. The frequency of the waves does not alter so, as we have mentioned before, the wavelength in the shallow water will be smaller. If the speed of the waves in the deep water is vd and the speed in the shallow water is vs, then,
such that
vd __ d __ vs =
s
In Figure 436 the wavefronts are now incident at an angle to the boundary between the deep and shallow water.
shallow
C
B A
deep
barrier
Figure 434 Ripple tank setup to demonstrate refraction Figure 436 Waves incident at an angle Figure 435, shows the result of a continuous plane wave going from deep to shallow water. As well as the wavelength being smaller in the shallow water the direction of travel of the wavefronts also alters. We can understand this by looking at the wavefront drawn in bold. By the time that part A of this wavefront reaches the barrier at B the refracted wave originated from the barrier will have only reached C since it is travelling more slowly.
119
070813 Physics Ch 4 final.indd 119 22/05/2009 11:50:31 AM
CORE
deep water
shallow water
deep water
CHAPTER 4
1
Y
CORE
Figure 437 shows a light ray travelling from one medium to another. The line AB (called the normal) is a line constructed such that it is at right angles to the surface between the two media. It is used as a reference to enable the measure of the incident angle, i, and the angle of refraction r. Snell discovered that for any two media
Figure 438 Snells law In the time that it takes point A on the wavefront XA to reach Y, point X will have travelled to point B. If we let this time be t then we have AY = v1t and XB = v2t
and
XB = XYsin2
1 2
B medium 2
that is
t t
Figure 437
This is known as Snells law. In fact it enables us to define a property of a given optical medium by measuring the angles when medium 1 is a vacuum. (In the school laboratory air will suffice.). The constant is then a property of medium 2 alone called its refractive index n. We usually write
That is, the constant in Snells law is the ratio of the speed of light in medium 1 to that in medium 2. The result will of course be valid for all types of waves. For light we have that the refractive index of a material is the ratio of the speed of light in vacuo (c) to that in the material (v). We can write this as
When Snell published his law it was essentially an empirical law and the argument as to the nature of light was still debatable. Although we have described Snells law for light rays, we must remember that a ray is a line that is perpendicular to the wavefronts of a wave. In this respect Snells law is true Example for all waves. In Figure 438 the wavefronts in medium 1 travel with speed v1 and in medium 2 with speed v2 .
As mentioned above this result has been confirmed for light by a direct measurement of the speed of light in water and subsequently in other materials.
Example
The refractive index of a certain type of glass is 1.5. The speed of light in free space is 3.0 108 m s-1. Calculate the speed of light in glass.
120
070813 Physics Ch 4 final.indd 120 22/05/2009 11:50:32 AM
Solution
secondary wave
new wavefront
From Snells law, the speed of light in glass can be found using c = 3 108 m s-1. So, with n = 1.5 and , we have that v = 2 .0 108 m s1.
Figure 440 Planar waves 4.5.3 Explain and discuss qualitatively the diraction of waves at apertures and obstacles. 4.5.4 Describe examples of diraction.
IBO 2007
4.5.3 DIFFRACTION
When waves pass through a slit or any aperture, or pass the edge of a barrier, they always spread out to some extent into the region that is not directly in the path of the waves. This phenomenon is called diffraction. This is clearly demonstrated in a ripple tank. Figure 439 (a) shows plane waves incident on a barrier in which there is a narrow slit, the width of which is similar in size to the wavelength of the incident waves. Figure 439 (b) shows plane waves passing the edge of a barrier.
Each point on the wavefront is the source of a secondary wave and the new wavefront is found by linking together the effect of all the secondary waves. Since there are, in this case an infinite number of them, we end up with a wavefront parallel to the first wavefront. In the case of the plane waves travelling through the slit in Figure 438 (b), it is as if the slit becomes a secondary point source. If we look at the effect of plane waves incident on a slit whose width is much larger than the incident wavelength as shown in Figure 441 then we see that diffraction effects are minimal. We can understand this from the fact that each point on the slit acts as a secondary source and we now have a situation where the waves from the secondary sources results in a wavefront that is nearly planar (see Figure 440).
Figure 441
(a)
(b)
Figure 439 (a) Diraction at a slit (b) Diraction at an edge To understand this, we use an idea put forward by Christiaan Huygens (1629-1695). Huygens suggested that each point on any wavefront acts as a source of a secondary wave that produce waves with a wavelength equal to the wavelength associated with the wavefront. In Figure 440 we can see how this works in the case of plane wavefronts.
Diffraction effects at edges can also be understood on the basis of Huygens suggestion. By sketching the wavefronts of secondary sources at points on a wavefront close to the edge, it easy to see that diffraction effects become more pronounced as the wavelength of the incident wave is increased.
121
070813 Physics Ch 4 final.indd 121 22/05/2009 11:50:33 AM
CORE
wavefront
CHAPTER 4
The barrier would be expected to prevent the waves reaching points such as X. However, because of diffraction at the slit, the waves spread out in the region beyond the slit and the microphone will detect sound at points such as X.
barrier
Exercise
4.5 (a)
1.
(a)
Calculate the wavelengths of (i) FM radio waves of frequency 96 MHz (ii) long wave radio waves of frequency 200 kHz.(Speed of light in free space c = 3.0 108 m s-1) Use your answers to (i) and (ii) to explain why if your car is tuned to FM, it cuts out when you enter a tunnel but doesnt if you are tuned to long wave reception. (Hint: consider diffraction)
microphone
CORE
speaker
CRO
signal generator
(b)
Figure 442
You can also demonstrate the diffraction of light by shining laser light through a single narrow slit such that after passing through the slit, the light is incident on a screen. The effect of diffraction at the slit produces a pattern on the screen that consists of areas of illumination (bright fringes) separated by dark areas (dark fringes). In this situation, each point on the slit is acting as a secondary source and the pattern of light and dark fringes is a result of the interference (see next section) of the waves from these sources. (The diffraction of light is considered in more detail in Option G and Topic 11.3). As mentioned above, diffraction effects at a slit really only becomes noticeable when the slit width is comparable to the wavelength. In this respect, if the laser is replaced by a point source, then as the slit is made wider, the diffraction pattern tends to disappear and the illumination on the screen becomes more like what one would expect if light consisted of rays rather than waves. Historically, the diffraction of light was strong evidence for believing that light did indeed consist of waves. The diffraction of light can also be demonstrated by looking directly at a point source through a narrow slit. Unless the point source is monochromatic a monochromatic source is one that emits light of a single colour (i.e. wavelength), you will see a series of different coloured fringes interspersed by dark fringes.
Suggest one reason why ships at sea use a very low frequency sound for their foghorn.
4.5.5 State the principle of superposition and explain what is meant by constructive interference and by destructive interference. 4.5.6 State and apply the conditions for constructive and for destructive interference in terms of path dierence and phase dierence. 4.5.7 Apply the principle of superposition to determine the resultant of two waves
IBO 2007
pulse 1
pulse 2
Figure 443
There is a very important principle in physics that applies not only to waves but to other situations as well. This is the principle of superposition. What it effectively tells us is that if you want to find out the effect of two separate
122
070813 Physics Ch 4 final.indd 122 22/05/2009 11:50:34 AM
pulse 1
pulse 2
2a a pulse 1 pulse 2
The dashed line shows where the crests from S1 meet the crests from S2 , creating a double crest, i.e., constructive interference.
S1
(antinodes) constructive interference (crest meets crest) (nodes) destructive interference (crest meets trough)
S2
Figure 444 (a) Constructive interference 1. In the Figure 444(a) the pulses do not fully overlap and in the second diagram they do. In the second diagram we have we what call full constructive interference. The two pulses add to give a single pulse of twice the amplitude of each separate pulse.
Figure 445 Two source interference The sources are two dippers connected to a bar that is vibrated by an electric motor. The dippers just touch the surface of the water in a ripple tank. The sources therefore have the same frequency. They also are in phase. By this we mean that when a crest is created by one dipper a crest is created by the other dipper. Sources that have the same frequency and that are in phase are called coherent sources. The bold lines in the diagram show the places where a trough of one wave meets the crest of the other wave producing complete destructive interference. The water at these points will not be displaced and such points are called nodal points or nodes. Points of maximum constructive interference are called antinodes. The bold lines are therefore called nodal lines. The overall pattern produced by the interfering waves is called an interference pattern.
pulse 1
pulse 2
pulse 1
pulse 2
pulse 1
pulse 2
Figure 444 (b) Destructive interference 2. We now consider two pulses as shown in Figure 444 (b).
To emphasise the idea of phase the diagrams in Figure 446 show snapshots of two waves in phase and two waves that are out of phase.
When these two pulses completely overlap the net displacement of the string will be zero. We now have what we call complete destructive interference.
123
070813 Physics Ch 4 final.indd 123 22/05/2009 11:50:35 AM
CORE
CHAPTER 4
wave B
wave B
CORE
Diagram 1
Diagram 2
Exercise
Figure 446 Phase dierence The idea of out of phase comes from the idea that if the space displacement of wave A in diagram 2 is represented by y = Asin , then the space displacement of wave B is y = Asin ( + ). At the instant shown the waves in diagram 1 will reinforce and produce constructive interference whereas the waves in diagram 2 will produce destructive interference. Let us now look at the interference between wave sources from two points in a little more detail. In Figure 447 we want to know what will be the condition for there to be a point of maximum or minimum interference at P.
P S1 S2 X
4.5 (b)
Sophia sounds two tuning forks A and B together and places them close to her ear. The graph in Figure 448 shows the variation with time t of the air pressure close to her ear over a short period of time.
1 0.8 0.6
pressure/arbitary units
0.4 0.2 0 -0.2 -0.4 -0.6 -0.8 -1 t tuning fork A tuning fork B
Figure 448
Figure 447 Interference S1 and S2 are two coherent point sources. For there to be a maximum at P, a trough must meet a trough or a crest must meet a crest. The waves from the sources will have travelled a different distance to P and therefore will not be necessarily in phase when they reach P. However, if a crest meets a crest (or trough a trough) at P then they will be in phase. They will be in phase only if the difference in the distances travelled by the two waves is an integral number of wavelengths. The interference of waves is discussed in further detail in Option G (Chapter 18). This means that path difference S2P S1P = S2X = n, n = 0, 1, 2, ... The waves will be out of phase if the difference in the distance travelled, is an odd number of half-wavelengths. So for a minimum we have path difference = 1 , n = 0, 1, 2, ... S2P S1P = S2X = n + __ 2
On a separate piece of graph paper, use the principle of superposition to construct a graph that shows the variation with time t of the resultant pressure close to Sophias ear. Use your graph to suggest the nature of the sound that Sophia will hear up until the time that the vibrations of the tuning forks die out.
124
070813 Physics Ch 4 final.indd 124 22/05/2009 11:50:36 AM
ELECTRIC CURRENTS
ELECTRIC CURRENTS
5.1 5.2 Electric Potential Dierence, Current and Resistance Electric Circuits
INTRODUCTION
igure 501 illustrates a typical model of a metal. Metals are good electrical conductors because of the mobility of electrons.
metal cations +
potential end as shown in Figure 502. Electrons entering at one end of the metal cause a similar number of electrons to be displaced from the other end, and the metal conducts. Even though they are accelerated along their path, it is estimated that the drift velocity is only a small fraction of a metre each second (about 10-4 m s-1).
Electric field
+ +
+ +
+ + + +
+ + + +
metal rod electrons leaving metal
low potential ( )
( +) high potential
Drift direction
Figure 501
Figure 502
Scientists have gathered evidence to suggest that the electrons in outer shells of metals move about freely, within a three-dimensional metal lattice of positively charged metal ions. Thus, metal structure consists of positive ions in a sea of delocalised electrons. Positive ions and free electrons have internal energy that depends on the temperature of the metal at that point in time, and the delocalised electrons move about at enormous speeds of about 106 m s-1, colliding with the positive ions in the metals lattice. However, as much as the speed is high, there is no net movement of charge unless the conductor is connected to a source of potential difference (or voltage). When a dry cell or battery is connected across the ends of a metal wire, an electric field is produced in the wire. The electrons drift with a drift velocity to the high positive
An electric field is created when a potential difference is supplied to a metal wire. Thus, the average speed and average kinetic energy of the electrons increases. When they collide with positive ions in the lattice, they give up some of their energy to them. After this event, they are again accelerated because of the electric field until the next collision occurs. At each stage, these collisions generate heat that causes the temperature of the metal to increase. We say that a current produces a heating effect.
125
19 april 09 Physics Ch 5 final.i125 125 22/05/2009 11:51:10 AM
CORE
CHAPTER 5
2.
The electrical potential difference between two points in a conductor carrying a current is defined as the amount of electrical energy that is changed to other forms of energy when an amount of charge moves from one point to another, where P is the power dissipated in watts (W) and I is the current in amps (A).
The concept of electric fields will be explored more fully in Chapter 6. Here is a third way to define potential difference in terms of electric potential. 3. The electrical potential difference between two points in a conductor carrying a current is defined as the work done per unit charge in moving a positive charge from a point at higher potential to a point at lower potential.
E p W V = ------ = ----q q
where q is the amount of charge measured in coulombs C. (If work is done in moving an electron, it would therefore move from a point at lower potential to a point at higher potential). So as charge q moves through a potential difference V, it does work equal to Vq and this work is equal to the energy given to the load component in the circuit. Therefore, potential difference is a measure of the energy released by an electric charge or group of charges. The accepted unit of electric potential difference is the volt, V. It follows that it can also be measured in JC -1. Just as work is a scalar quantity, so too electrical potential difference is a scalar quantity. When we use the term voltage, we are loosely talking about electric potential difference. If the potential difference between the terminals of a dry cell is 1.5V then the energy used in carrying a positive charge of +q from one terminal to the other is 1.5 J C -1. The work done in transferring a charge from two points does not depend on the path along which the charge is taken because potential difference is constant between two points.
W = F s or W = F s cos
Doing work on an object changes its energy. There are a number of different ways to define electric potential difference: 1. The electric potential difference between two points in a conductor carrying a current is defined as the power dissipated in a load per unit current in moving from one point to another. V = P / I
126
19 april 09 Physics Ch 5 final.i126 126 22/05/2009 11:51:10 AM
ELECTRIC CURRENTS
I 6V A 6V Vi
x +q
V out
B 3V V out
Example
0V
Figure 503 Charge moving in a uniform electric eld Calculate the work done in moving 10.0 C through a potential difference of 150 V
Solution
W = q V = ( 1.00 10
= 1.5 10-3 J
C ) ( 150 V )
127
19 april 09 Physics Ch 5 final.i127 127 22/05/2009 11:51:11 AM
CORE
CHAPTER 5
Example
A 2.0 C charge acquires 2.0 10-4 J of kinetic energy when it is accelerated by an electric field between two points. Calculate the potential difference between the 2 points.
5.1.6 Dene resistance. 5.1.7 Apply the equation for resistance in the
CORE
Solution
form
the material of the resistor. Using the formula W / q = V = 2.0 10-4 J / 2.0 10-6 C = 100 V The potential difference is 1.0 102 V. 5.1.8 State Ohms law. 5.1.9 Compare ohmic and non-ohmic behaviour. 5.1.10 Derive and apply expressions for electrical power dissipation in resistors. 5.1.11 Solve problems involving potential dierence, current and resistance.
IBO 2007
Example
The work done by an external force to move a -8.0 C charge from point a to point b is 8.0 10-3 J. If the charge initially at rest had 4.0 10-3 J of kinetic energy at point b, calculate the potential difference between a and b.
Solution
Using the formula V = W / q = 8.0 10-3 J / -8.0 10-6 C = -1.0 102 V The work done to have a gain in kinetic energy = 4.0 10-3 J / 8.0 10-6 C = 5.0 101 V. V =1.0 102 - 5.0 101 V The potential difference is 5.0 101 V.
Solids
electrons in metals and graphite, and holes in semiconductors.
Liquids
positive and negative ions in molten and aqueous electrolytes.
Gases
electrons and positive ions formed by electrons stripped from gaseous molecules by large potential differences. At this point, we will define the electric current as the rate at which charge flows past a given cross-section.
q I = ---t
128
19 april 09 Physics Ch 5 final.i128 128 22/05/2009 11:51:12 AM
ELECTRIC CURRENTS
It makes sense that for an electric current to flow, there must be a complete circuit for it to flow through. The unit of current from the above equation is the coulomb per second C s-1 and this unit is called the ampere (A). The ampere or amp is a rather large unit. Current is often expressed in milliamps (mA) and microamps (A) or even nanoamps and picoamps. Some relevant currents for situations are given in Figure 504. A computer chip Electron beam of a television Current dangerous to a human Household light bulb Car starter motor Lightning 10-12 10-6 A 10-3 A 10-2 10-1 A 0.5 A 200 A 104 A Figure 505 shows currents I1 and I2 flowing in the same direction in two wires. In part a, at point P on the right wire, the magnetic field B1 is upwards (right-hand screw rule). Then using the right-hand palm rule, the direction of the force F is in an easterly direction. Similar analysis of the diagram in part b reveals that the force is in a westerly direction. Both forces are inwards, and the wires will attract each other. Check that you obtain the same result as the diagrams. The vector diagrams and lines of force or lines of magnetic flux for currents in the same direction and in opposite directions are shown in Figures 505 (a) and (b) respectively.
I1 F B2 F I2 B1 F I2 I1
B2
B1 F
Figure 504 Some typical current situations. When a current flows in the same direction around an electric circuit, the current is said to be a direct current (dc). Dry cell and wet cell batteries supply dc. When the direction of the current changes with time it is said to be an alternating current (ac). Household electrical (nonelectronic) appliances are ac.
Attraction
Repulsion
EXTENSION ONLY
Electric current is a fundamental quantity in physics, and it is the primary quantity on which electrodynamics is based. Its SI unit is the ampere (A). The ampere is defined in terms of the force per unit length between parallel current-carrying conductors. This definition is all that is required for this course. This definition can be extended further. When two long parallel current-carrying wires are placed near each other, the force exerted by each will be a force of attraction or repulsion depending upon the direction of the current in each wire. The force is attractive if the two currents flow in the same direction, and the force is repulsive if the two currents flow in opposite directions.
B1 I1
P
Figure 506
The French physicist Andre Marie Ampre showed that the quantitative relationship for the force F per unit length l between two parallel wires carrying currents I1 and I2 separated by a distance r in a vacuum was given by
F I I 1 F = kI I 1 --1 2 -1 2 -l r l r
Therefore,
0 0 I1 I2 l l F = ---- I1 I2 - = -----------2 r 2 r
The constant 0 is called the permeability of free space, is defined to equal 0 = 4 10-7 T m A-1 This last equation now allows us to finally define the fundamental unit the ampere.
F F
Q
I2
I1
(a)
I2
B2
(b)
Figure 505 (a) and (b) Force between two parallel current-carrying wires.
129
19 april 09 Physics Ch 5 final.i129 129 22/05/2009 11:51:14 AM
CORE
CHAPTER 5
Thus one ampere is defined as that current flowing in each of two indefinitely-long parallel wires of negligible crosssectional area separated by a distance of one metre in a vacuum that results in a force of exactly 2 10-7 N per metre of length of each wire
resistor
CORE
Example
Figure 507 Calculate the current flowing through a hair drier if it takes 2.40 103 C of charge to dry a persons hair in 4.0 minutes. Conventional and electron currents
Solution
Unfortunately, this convention has been kept. Figure 507 shows a simple circuit diagram of a 1.5 V dry cell connected to a resistor. When drawing and interpreting circuit diagrams just remember that conventional current flows from the positive to negative terminal unless you are specifically asked for the correct electron flow that flows from the negative to the positive terminal.
To determine the current, given that there exists a charge of 2.40 103 C and that it is flowing for a period of 4.0 minutes (= 240 seconds). This gives,
5.1.6 RESISTANCE
Electrical resistance, R, is a measure of how easily charge flows in a material. Conductors, semiconductors and insulators differ in their resistance to current flow. An ohmic material of significant resistance placed in an electric circuit to control current or potential difference is called a resistor. The electrical resistance of a piece of material is defined by the ratio of the potential difference across the material to the current that flows through it.
Current is a scalar quantity but it is useful to indicate the direction of flow of current. Before the electron was discovered, the direction of the charge-carriers was already defined by scientists and engineers to be from positive to negative. Benjamin Franklin stated that an excess of fluid produced one kind of electric charge which he termed positive, and a lack of the same fluid produced the other type of electric charge which he called negative. Franklins designation of (+) being an excess of electric charge and of () being a deficiency of electric charge was unfortunate. When batteries and generators, the first source of continuous current, were developed in the 1800s, it was assumed that electric current represented the flow of positive charge as defined by Franklin. This is partly the reason that electric fields are defined in terms of a positive test charge, and the lines of electric flux being explained as going outwards from positive charges. It is now known that in a metal an electric current is a flow of electrons from negative to positive.
R = V I
The units of resistance are volts per ampere (VA-1). However, a separate SI unit called the ohm is defined as the resistance through which a current of 1 A flows when a potential difference of 1 V is applied. Since the term resistance refers to a small resistance, it is common for a resistor to have kilo ohm (k) and mega ohm (M) values.
130
19 april 09 Physics Ch 5 final.i130 130 22/05/2009 11:51:15 AM
ELECTRIC CURRENTS
Nichrome is used as the heating element in many toasters and electric radiators. The semiconductors behave in a special manner, and pyrex glass is an obvious insulator. Resistivity m Silver 1.6 10-8 Copper 1.7 10-8 Aluminium 2.8 10-8 Tungsten 5.6 10-8 Constantan (alloy of copper and nickel) 49 10-8 Nichrome (alloy of nickel, iron & chromium) 100 10-8 Graphite (3 - 60) 10-5 Silicon 0.1 - 60 Germanium (1 - 500) 10-3 Pyrex glass 1012 Material Figure 508 Resistivities for certain materials at 20C
l R = -A
where R is the resistance in , is the resistivity in m, l is the length of the conductor in m, and A is the crosssectional area of the conductor in m2. As the length of a conductor increases, the resistance increases proportionally Rl It seems logical that as the length increases so too does the resistance to the flow of charge across the conductor. As the cross-sectional area of a conductor increases, the resistance decreases proportionally.
Example
Determine the resistance of a piece of copper wire that is 10.0 m long and 1.2 mm in diameter?
Solution
The resistance, R, is given by the formula R = l / A, where A = r2. This means that R = (1.7 10-8 m) (10.0 m) / (6.0 10-4)2 m2 = 0.150 . The resistance of the copper wire is 0.15 .
With increased cross-sectional area, there is a greater surface through which the charge can drift. The resistivity is specific to the type of material being used as a resistance and is affected by the nature of the delocalised electrons and the positive ions within the material. The resistivity of a material is the resistance across opposite faces of a cube with sides of one metre. It can be found that by plotting a graph of R versus (l / A), the slope of the linear graph is the resistivity measured in m. Figure 508 gives the resistivities of various materials at 20o C. The first three values show that although silver has a low resistivity, it is expensive to use in electrical circuits. Copper is the preferred metal although aluminum is commonly used in electricity transmission cables because of its lower density. Constantan and nichrome are commonly used in wire form, and are termed high resistance wires.
The resistance of a material increases with temperature because of the thermal agitation of the atoms it contains, and this impedes the movement of electrons that make up the current. The increase in resistance can be shown as
R f = R 0 ( 1 + t)
where R0 equals the resistance at some reference temperature say 0 C, Rf is the resistance at some temperature, t C, above the reference temperature, and is the temperature coefficient for the material being used.
131
19 april 09 Physics Ch 5 final.i131 131 22/05/2009 11:51:15 AM
CORE
CHAPTER 5
One interesting phenomenon of the effect of temperature on resistance is superconductivity. In 1911, H. Kammerlingh Onnes found that mercury loses all its resistance abruptly at a critical temperature of 4.1 K. When a material attains zero resistance at some critical temperature, it is called a superconductor. The possibility of having a material that has an induced electric current that lasts forever has become a topic for research physicists. Just think of the energy saving if the perfect superconductor is found that can give zero resistance at room temperature. A typical apparatus used in the confirmation of Ohms Law is shown using a circuit diagram in Figure 509.
+
CORE
VI V -- = constant I
This is known as Ohms Law. There are two relevant statements to be made here. Firstly, Ohms Law is not really a law but rather an empirical statement of how materials behave. Many materials are non-ohmic and this law is only applicable to ohmic conductors. Secondly, the law should not be written as R = V / I as this statement defines resistance. The formula is commonly written as:
potential difference, V
V = IR
where V is the potential difference across the resistor (in volts V), I is the current in the resistor (in amperes A) and R is the resistance (in ohms ). When written in this form R is understood to be independent of V. As the current moves through the resistance of a device, it loses electric potential energy. The potential energy of a charge is less upon leaving the resistor than it was upon entering. We say that there is a potential drop across the device.
132
19 april 09 Physics Ch 5 final.i132 132 22/05/2009 11:51:16 AM
ELECTRIC CURRENTS
Example
Solution
a.
b. I
c.
V V d. I e. I f. I V
V g. I
Figure 511
133
19 april 09 Physics Ch 5 final.i133 133 22/05/2009 11:51:17 AM
CORE
An iron draws 6.0 A of current when operating in a country with a mains supply of 120 V. Calculate the resistance of the iron?
CHAPTER 5
Appliance Blow heater Kettle Toaster Iron Vacuum cleaner Television Power rating 2 kW 1.5 kW 1.2 kW 850 W 1.2 kW 250 W power is used for one hour. The consumer has to pay a certain cost per kilowatt-hour say 14 cents per kW h. The heating effect of a current was investigated in 1841 by James Joule. He was able to demonstrate that by supplying electrical energy to a high resistance coil of wire this energy could be converted to thermal energy.
CORE
V I t = m c T
Example
qV P = ----- = IV q -I
The same result could have been obtained using the fact that the electrical energy,
W = q V = ItV
Solution
(a)
Given that P = 2.5 103 and V = 240 V, we use the formula, P = IV. Now,
3
the current drawn is 1.0 101 A. so that (b) Next, we use the formula W = VIt, so that
3
P = I IR = I R
We also could have used I = V / R , giving the result,
Example
V V V = ---P = IV = -R R
Summary
2 W 2 P =---=I R=V ---- = VI t R
A 2.5 kW blow heater is used for eight hours. Calculate the cost if electricity is sold at 12 cents per kilowatt-hour?
The commercial unit of electrical energy is the kilowatthour (kW h). It is the energy consumed when 1 kW of
134
19 april 09 Physics Ch 5 final.i134 134 22/05/2009 11:51:19 AM
ELECTRIC CURRENTS
Solution
Exercise
5.1
Using the fact that energy consumed (E) = power time, we have
E = ( 2.5 kW ) ( 8 h ) = 20 kW h
1.
When a current is flowing through a wire attached to a dry cell A. B. C. D. positive charges flow from negative to positive terminal positive charges flow from positive to negative terminal negative charges flow from negative to positive terminal negative charges flow from positive to negative terminal
Therefore, the Cost = (20 kW h) $0.12 per kW h = $2.40 The cost to run the heater is two dollars forty cents ($2.40).
Example
2. A 1.2 kW electric water heater that is made of aluminium is used to heat 2.5 L of water from 25 C to the boil. If the mass of aluminium water heater is 350 g and all the electrical energy is converted to heat energy, determine the time in minutes to bring the water to the boil. (Specific heat capacity of aluminium is 9.1 102 J kg-1K-1) 3.
The total charge passing the same point in a conductor during 2.0 s of an electric current of 7.5 mA is A. B. C. D. 15 nC 15 mC 15 C 1.5 104 C
Solution
An electric heater raises the temperature of a measured quantity of water. 6.00 103 J of energy is absorbed by the water in 30.0 seconds. What is the minimum power rating of the heater? A. B. C. D. 5.00 102 W 2.00 102 W 2.00 103 W 1.80 105 W
A 1.2 kW heater delivers 1200 joules each second. 2.5 L = 2.5 kg The electrical energy is used to heat the heater and the water from 25 C to 100 C. Q = mcTheater + mcTwater = (0.35 kg 9.1 102 J kg-1K-1 75 C) + (2.5kg 4180 J kg-1K-1 75 C) = 8.08 105 J Now if the heater delivers 1200 J per second, then to consume 8.08 105 J will take: 8.08 105 J _________ 1200 J 673 s = _____ 60 = 11.2 min 11 min 4.
If the power developed in an electric circuit is doubled, the energy used in one second is A. B. C. D. halved doubled quartered quadrupled
5.
The electron volt is defined as A. B. C. D. the energy acquired by an electron when it passes through a potential difference of 1.0 V. the voltage of an electron. a fraction of the ionisation of an electron. unit of energy exactly equal to 1.6 1019 J.
135
19 april 09 Physics Ch 5 final.i135 135 22/05/2009 11:51:19 AM
CORE
CHAPTER 5
6. The definition of the unit of current, the ampere, is based on A. B. the charge per unit time delivered by an emf of 1.0 V. the force per unit length on parallel current-carrying wires. the force per unit length on a conductor in a magnetic field the charge passing a point per unit time. 17. Copy out and complete the table
C. D. 7.
Power p.d Current Fuse rating needed (Watt) (Volt) (Ampere) (3,5,10,13 A) Digital clock 4 240 Television 200 240 Hair dryer 110 5 Iron 230 4 Kettle 240 10 Appliance 18. Calculate the cost of heating the water to wash the dishes if the sink is 48 cm long, 25 cm wide and the water is 25 cm high. The tap water is at 14 C and the final temperature before washing up is 62 C. Power is sold to the consumer at 14 cents per kilowatt-hour. The element of an electric jug has a resistance of 60 and draws a current of 3.0 A. Determine by how much the temperature of 5.0 kg of water will rise if it is on for 6 minutes. Calculate the cost to heat 200 kg of water from 12C to its boiling point if power costs 14 cents per kilowatt-hour. Determine the work done in moving a charge of 10.0 nC through a potential difference of 1.50 102 V? An electron in an electron gun of a picture tube is accelerated by a potential 2.5 103 V. Calculate the kinetic energy gained by the electron in electronvolts.
CORE
Identify the charge-carriers in a. b. c. a length of copper wire. an aqueous solution of sodium chloride. the atmosphere during a lightning storm.
8.
The speed with which electrons move through a copper wire is typically 10-4 m s-1. a. b. Explain why is it that the electrons cannot travel faster in the conductor? Explain why the electron drift produces heat?
19.
20.
9.
Calculate the resistance of a wire if 0.5 V across it causes a current of 2.5 A to flow. 21. Calculate current flow through a 20 M resistor connected across a 100 kV power supply. A thin copper wire 200 cm in length has a 9 V dry cell connected between its ends. Determine the voltage drop that occurs along 30 cm of this wire. Determine the length of tungsten wire with a diameter of 1.0 mm that is used to make a 20.0 resistor. A nichrome wire has a diameter of 0.40 mm. Calculate the length of this wire needed to carry a current of 30 mA when there is a potential difference of 12 V across it. Explain in terms of atomic and electron movement, why resistance increases with temperature. Determine how many coulombs there are when 2.0 A flows for 2.0 hours? Distinguish the difference between an ohmic and a nonohmic material. 22.
10.
11.
12.
13.
14.
15.
16.
136
19 april 09 Physics Ch 5 final.i136 136 22/05/2009 11:51:20 AM
ELECTRIC CURRENTS
The terms emf, potential difference and voltage are commonly interchanged in talking about electricity. The term electromotive force should be avoided as it is not a force at all but rather a potential energy difference. Again, there are historical reasons for the use of the term in the first place. In the true sense, electromotive force (emf) is the work per unit charge made available by an electrical source. For the simple circuit in Figure 507 , the emf of the dry cell is 1.5 V. However, when a voltmeter is used to test the potential difference across the resistor the reading on the voltmeter is found to be less than 1.5 V. This is due to the cell not being an ideal dry cell because it has some internal resistance. For the moment, let us say that emf is the energy supplied per unit charge, and potential difference is the energy released (dissipated) per unit charge.
Example
W V = ----q
or
W = ( q ) V
A battery supplies 15.0 J of energy to 4.00 C of charge passing through it. Determine the emf of the battery
q I = ---t
or
q = It
Solution
Using the formula, V = W / q, we have, V = 15.0J / 4.00 C = 3.75 V The emf of the battery is 3.75 V.
W = I t V
Therefore it can be stated that
or
W V = ----It
Potential difference in external circuits is the power, (P), dissipated (released) per unit current Either definition for potential difference is acceptable. However, the above-mentioned ties voltage and current together.
137
19 april 09 Physics Ch 5 final.i137 137 22/05/2009 11:51:21 AM
CORE
Note: It is common place to use the expression V = W / It as opposed to V = W / It and q = It as opposed to q = It. That is, to leave out the change in (or ) notation when solving problems.
CHAPTER 5
IR = emf Ir
All sources of emf have some internal resistance and it is a factor in determining how useful a source of emf is. A dry cell being a primary cell releases its energy and becomes dead. When it is just about inoperable, the buildup of the products of the relevant oxidationreduction reaction increases the internal resistance beyond its normal value. The emf available does not decrease from giving the maximum number of joules per coulomb of charge. However, when much of the emf is used up in overcoming the potential difference across the internal resistor, the terminal voltage will drop to zero. Batteries are generally regarded as being a secondary cell because they can be recharged many times. If a series of potential difference readings measured with a voltmeter across a variable resistor R are taken concurrently with the current flowing in the circuit measured with an ammeter for each variable resistance, and a graph of voltage versus current is plotted, from the relationship
CORE
2. CHEMICAL oxidation-reduction reactions transfer electrons between chemicals. Dry cells, fuel cells and batteries are examples. 3. PHOTOELECTRIC EFFECT electrons are emitted from certain metal surfaces when high frequency light is shone on their surfaces. These photocells are used in watches, clocks, automatic doors. 4. PIEZOELECTRIC EFFECT certain crystals can produce a charge on one side when placed under stress. If one side of the crystal is charged and the other not, a potential difference exists across the crystal. This is used in crystal microphones. 5. THERMOELECTRIC EFFECT when two pieces of certain metals are wound together and one end is heated while the other end is cooled, a current is produced. Thermocouples can be used as temperaturemeasuring devices.
Example
A dry cell has an internal resistance of 1.50 . A resistor of 12.0 is connected in series with the dry cell. If the potential difference across the 12.0 resistor is 1.20 V, calculate the emf of the cell.
V2
I
.m .f
Solution
internal resistance of cell
+ r
Figure 514 Internal resistance circuit. In the circuit in Figure 514, the total energy supplied is determined by the value of the emf. The total energy released (per unit charge) is equal to the potential difference across resistor R plus the potential difference of the internal resistance r.
We first need to determine the current flowing through the system. This is done by using the formula, V = IR from where we obtain, I = V / R.
138
19 april 09 Physics Ch 5 final.i138 138 22/05/2009 11:51:22 AM
ELECTRIC CURRENTS
Therefore, we have that
I1
I
I3
I2
In Figure 515, we have that I = I1 + I2 + I3 + .. This is based on the Law of Conservation of charge.
Figure 516
This is a statement of the law of conservation of energy. Energy supplied equals the energy released in this closed path.
3.
Series circuits
In a series circuit with one cell:
A current will always take the easiest path in the circuit. If for some reason the current finds a way back to the source without passing through the essential components, a short-circuit occurs. Fuses are used in appliances to stop damage due to short-circuits. In the mid-nineteenth century, G.R. Kirchoff (1824-1887) stated two simple rules using the laws of conservation of energy and charge to help in the analysis of direct current circuits. These rules are
1.
All the components have only one current pathway. All components have the same current through them. The sum of the potential drop across each component is equal to the emf of the cell.
2.
3.
139
19 april 09 Physics Ch 5 final.i139 139 22/05/2009 11:51:23 AM
CORE
CHAPTER 5
From Kirchoff s laws, we have that I = I1 = I2 = I3 = and V = V1 + V2 + V3 + .. (c) Then, from Ohms law So that in this instance, we have that I = 4V / 8 = 0.5 A The current flowing is 0.5 A. The potential drop across each resistor, Ri , is given by, Vi = IRi . So that for the 6 ohm resistor, the potential difference (or potential drop) is given by V1 = IR1 = 0.5 6 = 3 V. Similarly, for the 2 ohm resistor, the potential difference (or potential drop) is given by V2 = IR2 = 0.5 2 = 1 V.
CORE
The potential differences in the 6 and the 2 resistors are 3 V and 1 V respectively.
R eff = R 1 + R 2 + R 3 +
The total or effective resistance Reff of a series circuit is equal to the sum of the separate resistances.
Example
From the diagram given in the Figure below of a potential divider, calculate (a) (b) (c) the effective resistance of the circuit the current flowing the potential difference across each resistor
V1 6 I 4V
V2 2
Figure 518 Potential displacement graph. Figure 518 shows the corresponding potential/displacement graph for the circuit in the example above. Notice how we refer to the potential difference, V1 as opposed to simply V1.
Parallel circuits
Solution
In the parallel circuit in Figure 519 There is more than one current pathway. (a) Using our rule (from above), we have that the effective resistance is given by 1. All components have the same potential difference across them. The sum of the currents flowing into any point is equal to the sum of the currents flowing out at that point.
R eff = R 1 + R 2 = 6 + 2 = 8
2. (b) That is, the effective resistance is 8 . We can determine the current by making use of the formula, I = V / Reff.
140
19 april 09 Physics Ch 5 final.i140 140 22/05/2009 11:51:24 AM
ELECTRIC CURRENTS
I1 I2 I3 I
V R1 V R2 V R3
Example
In
V Rn
I1
I I2
Figure 519
A parallel circuit
I = I 1 + I2 + I 3 +
and
6V
Solution
V = V1 = V 2 = V3 =
From Ohms law, we have V = IR V / R = I or 1 / R = I / V Meaning that,
(a)
141
19 april 09 Physics Ch 5 final.i141 141 22/05/2009 11:51:26 AM
CORE
the effective resistance of the circuit, the current flowing in the main circuit, the current in each resistor.
CHAPTER 5
(c)
The current in the 4 resistor is 2 A. The potential difference across the 4 resistor = IR = (2 A) . (4 ) = 8 V The potential difference across the parallel network = 12 V 8 V =4V The current in the network resistors is given by,
CORE
V V I = I1 + I2 = ---- + ---R 1 R2
3 I
4 A + 4 A = 1.33 A + 0.67 A I = --3 6
The current in the 4 resistor is 2 A. The currents in the 3 and the 6 resistors are 1.33 A and 0.67 A respectively.
4 I
I I2
6
12 V
Solution
(a)
1. That is, the effective resistance is 6 . 2. (b) To determine the current, once again we use, V = IR, so that,
is always connected across a device (in parallel). has a very high resistance so that it takes very little current from the device whose potential difference is being measured. has a high resistor (a multiplier) connected in series with a galvanometer. an ideal voltmeter would have infinite resistance with no current passing through it and no energy would be dissipated in it.
V 12 I = ---- = --- = 2A RT 6
The current flowing in the main circuit is 2 A.
3.
4.
142
19 april 09 Physics Ch 5 final.i142 142 22/05/2009 11:51:27 AM
ELECTRIC CURRENTS
joined wires wires crossing (not joined) cell
battery
lamp
a.c. supply
switch
ammeter
voltmeter V
A galvanometer resistor
variable resistor
potentiometer
heating element
fuse
transformer
oscilloscope
Table 522
Example
Solution
A galvanometer has a resistance of 1.0 103 (mainly due to the resistace of the coil), and gives a full-scale deflection (fsd) for 1.0 mA of current. Calculate the size of a multiplier resistor that would be needed to convert this to a voltmeter with a fsd of 10.0 volts.
143
19 april 09 Physics Ch 5 final.i143 143 22/05/2009 11:51:28 AM
CORE
CHAPTER 5
Because the resistors Rg and Rs are in series then the potential difference across the resistors will be 10 volts. Therefore, I (Rg + Rs) = 10 0.001 (1000 + Rs) = 10 0.001Rs = 9
CORE
An ammeter
1. 2. is always connected in series with a circuit. has a very low resistance compared with the resistance of the circuit so that it will not significantly alter the current flowing in the circuit. has a low resistor (a shunt) connected in parallel with a galvanometer. would ideally have no resistance with no potential difference across it and no energy would be dissipated in it.
R1 V
V1
3.
R2
V2
4.
Figure 525
Using Ohms Law, V1 = IR1 and V = I(R1 + R2 ). V1 / V = IR1 / I(R1 + R2). Therefore, V1 = R1 / (R1 + R2) V This is known as the potential divider equation.
Example
A galvanometer has a resistance of 1.0 103 (mainly due to the resistace of the coil), and gives a full-scale deflection (fsd) for 1.0 mA of current. Calculate the size of a shunt resistor that would be needed to convert this to a meter with a fsd of 5.0 A.
Example
Solution
the total current in the circuit the potential difference across each resistor the voltmeter reading if it was connected between terminals 2 and 6.
12 V
RS
Because the resistors Rg and Rs are in parallel then the potential difference across the resistors is the same. Therefore, iRg = (I i) Rs.
1 2 3 4 5 6 7
-3
144
19 april 09 Physics Ch 5 final.i144 144 22/05/2009 11:51:28 AM
ELECTRIC CURRENTS
Solution
(a)
(b)
(c)
Because resistance is directly proportional to the length of a resistor, a variable resistor also known as a potentiometer or colloquially as a pot can also be used to determine the potential difference across an output transducer (device for converting energy from one form to another) such as a filament lamp in Figure 527. If the pointer was at A then the potential difference would be zero as there is no power dissipated per unit current in the potentiometer (the load), and there would be no output voltage. However, if the pointer is moved up to two-thirds the length of the potentiometer as in the figure, then the output voltage across the filament lamp would be 6V = 4V.
A light dependent resistor (LDR) is a photo-conductive cell whose resistance changes with the intensity of the incident light. Typically, it contains a grid of interlocking electrodes made of gold deposited on glass over which is deposited a layer of the semiconductor, cadmium sulfide. Its range of resistance is from over 10 M in the dark to about 100 in sunlight. A simple LDR and its circuit symbol are shown in Figure 528.
potentiometer
Figure 528 A light dependent resistor There are only two ways to construct a voltage divider with an LDR sensor with the LDR at the top, or with the LDR at the bottom as shown in Figure 529.
6V A V
V in
V in V out
Figure 527 Laboratory potentiometer Pots have a rotating wheel mounted in plastic and they are commonly used as volume and tone controls in sound systems. They can be made from wire, metal oxides or carbon compounds.
V out 0V
0V
Figure 529 (a) and (b) two ways to construct an L.D.R. Light dependant resistors have many uses in electronic circuits including smoke detectors, burglar alarms, camera
145
19 april 09 Physics Ch 5 final.i145 145 22/05/2009 11:51:29 AM
CORE
CHAPTER 5
light meters, camera aperture controls in automatic cameras and controls for switching street lights off and on. Resistors that change resistance with temperature are called thermistors (derived from thermal resistors). They are made from ceramic materials containing a semiconductor the main types being bead and rod thermistors. The NTC (negative temperature coefficient) thermistor contains a mixture of iron, nickel and cobalt oxides with small amounts of other substances. They may have a positive (PTC) or negative (NTC) temperature coefficient according to the equation:
10 Resistance 50 20 40 60 80 100 Temperature / C
CORE
R f = R 0 ( 1 + t)
where R0 equals the resistance at some reference temperature say 0 C, Rf is the resistance at some temperature, t C, above the reference temperature, and is the temperature coefficient for the material being used. The circuit symbol for a thermistor is shown in Figure 530.
20 40 60 80 100 Temperature / C
200 Resistance
Figure 532 An NTC thermistor Figure 530 The circuit symbol for a thermistor For a NTC thermistor, the resistance decreases when the temperature rises and therefore they can pass more current. This current could be used to to operate a galvonometer with a scale calibrated in degrees as used in electronic thermometers or car coolant system gauges. Thermistors are also used in data-logging temperature probes but the analogue signal has to be converted to digital signal using an analogue to digital converter (ADC). With normal resistors the resistance becomes higher when the temperature increases and they therefore have a small positive temperature coefficient. Figures 531 and 532 demonstrate how the resistance changes with temperature for both types of thermistors. An electronic thermometer can be made using an NTC thermistor as shown in Figure 533.
Figure 533 An electronic thermometer When a metal conducting wire is put under vertical strain, it will become longer and thinner and as a result its resistance will increase. An electrical strain gauge is a device that employs this principle. It can be used to obtain information about the size and distribution of strains in structures such as metal bridges and aircraft to name but two. A simple gauge consists of very fine parallel threads of a continous metal alloy wire cemented to a thin piece of paper that are hooked up to a resistance measuring device with thick connecting wires as shown in Figure 534. When it is securely attached on the metal to be tested, it
146
19 april 09 Physics Ch 5 final.i146 146 22/05/2009 11:51:30 AM
ELECTRIC CURRENTS
will experience the same strain as the test metal and as this happens the strain gauge wire become longer and thinner and as such the resistance increases.
structure under strain
4.
In the circuit below, a heater with resistance R is connected in series with a 48 V supply and a resistor S.
48 V
thin paper
connecting lead
If the potential difference across the heater is to be maintained at 12 V, the resistance of the resistor S should be A. B. C. D. 5. R/2 R/4 R/3 3R
Figure 534
The diagrams below show circuits X, Y and Z of three resistors, each resistor having the same resistance.
Exercise
5.2
circuit X
circuit Y
circuit Z
1.
Three identical resistors of 3 are connected in parallel in a circuit. The effective resistance would be A. B. C. D. 1 3 6 9 6.
Which one of the following shows the resistances of the circuits in increasing order of magnitude? lowest Y Z X Z Z X Y Y highest X Y Z X
A. B. C. D.
2.
In a television tube the picture is produced in a fluorescent material at the front of the picture tube by A. B. C. D. an electrical discharge a beam of positive ions an electrolytic deposition of metal atoms a stream of electrons
24
16 Current A
3.
A 100 W light globe gives out a brighter light than a 60 W globe mainly because A. B. C. D. a larger potential difference is used to run it its resistance wire filament is longer more electric current flows through it it has a higher amount of inert gas in it
4 6 Voltage /V
10
147
19 april 09 Physics Ch 5 final.i147 147 22/05/2009 11:51:31 AM
CORE
CHAPTER 5
The resistance of the filament at 3.0 V is A. B. C. D. 0.25 250 4000 8000 12. Three identical lamps L1, L2 and L3 are connected as shown in the following diagram.
CORE
7.
Metal X has half the resistivity of metal Y and length three times that of Y. If both X and Y have the same surface area, the ratio of their resistance RX / RY is: A. B. C. D. 3:4 2:3 1.5 : 1 2:9
L1
L2
L3
When switch S is closed A. B. C. D. 13. L1 and L3 brighten and L2 goes out all three lamps glow with the same brightness L2 brightens and L1 and L3 remain unchanged L1 and L3 go dimmer and L2 goes out
8.
Three identical resistors of 3 are connected in parallel in a circuit. The effective resistance would be A. B. C. D. 1 3 6 9
Two resistors are connected in parallel and have the currents I1 and I2 as shown in the diagram
I1 R1
9.
If 18 J of work must be done to move 2.0 C of charge from point A to point B in an electric field, the potential difference between points A and B is: A. B. C. D. 0.1 V 9.0 V 12 V 20 V
R2 I2
10.
The fundamental SI unit the ampere is defined in terms of: A. B. C. D. potential difference and resistance the time rate of change of charge in a circuit the force acting between two current carrying wires the product of charge and time 14.
If the effective resistance of the circuit is R then A. B. C. D. R I1/R1 R R = = = = (R1 + R2) / R1R2 I2 / R2 R1R2 / R1 + R2 I (R1 + R2) / R1R2
11.
A 100 W light globe gives out a brighter light than a 60 W globe mainly because A. B. C. D. a larger potential difference is used to run it its resistance wire filament is longer more electric current flows through it it has a higher amount of inert gas in it
The following circuit was set up to determine the internal resistance r of a dry cell. The load resistor was varied from 100 to 150, and the current in the circuit was measured using an ammeter. The two respective values of the current are given in the circuits.
148
19 april 09 Physics Ch 5 final.i148 148 22/05/2009 11:51:32 AM
ELECTRIC CURRENTS
A voltmeter connected between the points X and Y should read: A. B. C. D. 17. 0V 3V 6V 9V
Determine the equivalent resistance when 12 , 6 and 4 are placed in (a) (b) series parallel
The internal resistance r of the dry cell is A. B. C. D. 15. 3.0 7.1 58.8 250
18.
Calculate the work done in moving a 12.0 C through a p.d of 240 V. The diagram shows resistances joined in a compound circuit.
I1
6 20
2 I
I I2
4 12.2 V
(a) (b)
12V
Determine the total resistance of the circuit. Calculate current flows through the 2.0 resistor. Deduce the potential difference across the 20.0 resistor. Determine is the potential difference across the 6.0 resistor. Calculate is the current through the 4.0 resistor.
20.
Consider the circuit below that contains a 15 V battery with zero internal resistance.
15 V
A photo-electric cell draws a current of 0.12 A when driving a small load of resistance 2 . If the emf of the cell is 0.8 V, determine the internal resistance of it under these conditions. In terms of emf and internal resistance, explain why is it possible to re-charge a nickel-cadmium cell while normal dry cells have to be discarded once they are flat. When a dry cell is connected to a circuit with a load resistor of 4.0 , there is a terminal voltage of 1.3 V. When the load resistor is changed to 12 , the terminal voltage is found to be 1.45 V. Calculate (a) (b) the emf of the cell. the internal resistance of the cell.
21.
3 4 X 4 Y
22.
149
19 april 09 Physics Ch 5 final.i149 149 22/05/2009 11:51:33 AM
CORE
CHAPTER 5
23. Calculate the current flowing through a hair drier if it takes 2.40 103 C of charge to dry a persons hair in 3.0 minutes. An iron draws 6.0 A of current when operating in a country with a mains-supply of 240 V. Calculate the resistance of the iron. An electrical appliance is rated as 2.5 kW, 240 V (a) (b) Calculate the current it needs to draw in order to operate. Determine how much energy would be consumed in 2 hours. 30. (a) (b) (c) (d) Complete the last column for the inverse of the current giving the correct unit. Plot a graph of R against 1 / I Describe the relationship that exists between the resistance and the current. Determine the electromotive force of the dry cell
24.
CORE
25.
Starting from the laws of conservation of energy and conservation of charge, derive a formula for calculating the effective resistance of two resistors in parallel. The diagram shows a typical circuit.
1.0 B 2.0 C 0.5 D 1.0
31.
26.
A 2.5 kW blow heater is used for eight hours. Calculate the cost of running the blow heater if the electricity is sold at 15 cents per kilowatt-hour. The circuit below refers to the following questions:
24.0 4.0 R V I 1.0 3.0
27.
1.0
1.5 V
100.0 V 35 A
(a) (b)
Determine the current flowing through R, and the value of resistor R. Deduce the reading on the voltmeter V. Describe the meaning of the 12 V on a 12 volt car battery. A 14 V car battery drops to 12 V when supplying a current of 5.0 A. Determine the internal resistance of the battery.
(c) (d)
Determine the effective resistance of the whole circuit. Determine the currents flowing in each network resistor. Determine the potential differences VAB and VAD. Determine the potential difference between B and D.
32. .
Determine the resistance of the LDR in the diagram below if a current of 4.5 mA is flowing in the circuit.
29.
A circuit was set up to investigate the relationship between the current I through a resistor and the magnitude of the resistance R while a constant electromotive force was supplied by a dry cell. The results of the investigation are given in the following table: R 0.5 I 0.1 A 5.0 1.7 0.83 0.63 0.56
1 k 9V
2.0 6.0 12 16 18
150
19 april 09 Physics Ch 5 final.i150 150 22/05/2009 11:51:34 AM
m1 F 12 r F 21
m2
Gm1 m2 F 12 = -- 12 = F 21 ----------a 2 r
F12 is the force that particle 1 exerts on particle 2 and F21 is the force that particle 2 exerts on particle 1. 12 is a unit vector directed along the line joining the particles. m1 and m2 are the masses of the two particles respectively and r is their separation. G is a constant known as the Universal Gravitational Constant and its accepted present day value is 11 6.67 10 N m-2 kg-2. There are several things to note about this equation. The forces between the particles obey Newtons third law as discussed in Section 2.7. That is, the forces are equal and opposite. The mass of the particles is in fact their gravitational mass as discussed in Section 2.3.3.
Every material particle in the Universe attracts every other material particle with a force that is directly proportional to the product of the masses of the particles and that is inversely proportional to the square of the distance between them.
151
19 april 09 Physics Ch 6 final.i151 151 22/05/2009 11:52:15 AM
CORE
CHAPTER 6
Every particle in the Universe, according to Newton, obeys this law and this is why the law is known as a universal law. This is the first time in the history of physics that we come across the idea of the universal application of a physical law. It is now an accepted fact that if a physical law is indeed to be a law and not just a rule then it must be universal. Newton was also very careful to specify the word particle. Clearly any two objects will attract each other because of the attraction between the respective particles of each object. However, this will be a very complicated force and will depend on the respective shapes of the bodies. Do not be fooled into thinking that for objects we need only specify the distance r as the distance between their respective centres of mass. If this were the case it would be impossible to peel an orange. The centre of mass of the orange is at its centre and the centre of the mass of the peel is also at this point. If we think that r in the Newton Law refers to the distance between the centres of mass of objects the distance between the two centres of mass is zero. The force therefore between the peel and the orange is infinite. You will almost invariably in the IB course and elsewhere, come across the law written in its scalar form as In Figure 602 a particle of mass m is placed at point X somewhere in the Universe.
m X particle P
CORE
Figure 602
The particle is observed to accelerate in the direction shown. We deduce that this acceleration is due to a gravitational field at X. We do not know the source of the field but that at this stage does not matter. We are only concerned with the effect of the field. If the mass of P is small then it will not effect the field at X with its own field. We define the gravitational field strength I at X in terms of the force that is exerted on P as follows
F I = -m
That is the gravitational field strength at a point is the force exerted per unit mass on a particle of small mass placed at that point. From Newtons 2nd law F = ma we see that the field strength is actually equal to the acceleration of the particle. The gravitational field strength is often given the symbol g So we can express the magnitude of the field strength in either N kg-1 or m s-2. However, if we are dealing explicitly with field strengths then we tend to use the unit N kg -1.
Gm1 m2 F = -----------r2
However, do not forget its vector nature nor that it is a force law between particles and not objects or masses.
6.1.3,4
(a)
(b)
r M M
152
19 april 09 Physics Ch 6 final.i152 152 22/05/2009 11:52:16 AM
m M R
Figure 605
Field strength
GMm F = --------r2
So the magnitude of the gravitational field strength I = --m is given by GM I = -----r2 If we wish to find the field strength at a point due to two or more point masses, then we use vector addition. (This is another example of the general principle of superposition - see 4.5.5). In Figure 604 the magnitude of the field strength produced by the point mass M1 at point P is I1 and that of point mass M2 is I2.
GMe I = ------2 Re
But the field strength is equal to the acceleration that is produced on a mass hence the acceleration of free fall at the surface of the Earth, g0 , is given by
G Me g 0 = -------2 Re
This actually means that whenever you determine the acceleration of free fall g0 at any point on the Earth you are in fact measuring the gravitational field strength at that point. It can also be seen now why the value of g varies with height above the surface of the Earth. Since at a height of h above the surface of the Earth the field strength, g, is given by
I1 I2 P
GMe g = --------------( R e + h )2
It can be shown that if we have a hollow sphere then the field strength at all points within the sphere is zero. This fact can be used to deduce an expression for the field strength at points inside the Earth. It is left as an exercise to demonstrate (if desired since this is not in the syllabus) that if is the mean density of the Earth then at a point distance r from the centre of the Earth the value of g is given by
M1
M2
Figure 604 Vector addition
If I1 and I2 are at right angles to each other, the resultant magnitude of the field strength I at P is given by:
I = I12 + I 2 2
If the particle of mass M is replaced with a sphere of mass M and radius R, as shown in Figure 605, then rely on the fact that the sphere behaves as a point mass situated at its centre, the field strength at the surface of the sphere will be given by I = GM -----2 R
4 r G g = 3
153
19 april 09 Physics Ch 6 final.i153 153 22/05/2009 11:52:18 AM
CORE
CHAPTER 6
CORE
Example
6.2.2 State and apply the law of conservation of charge. Take the value of g0= 10 N kg -1 and the mean radius of the Earth to be 6.4 106 m to estimate a value for the mass of the Earth. 6.2.3 Describe and explain the dierence in the electrical properties of conductors and insulators. 6.2.4 State Coulombs law. 6.2.5 Dene electric eld strength. 6.2.6 Determine the electric eld strength due to one or more point charges.
2
Solution
6.2.7 Draw the electric eld patterns for dierent charge congurations. 6.2.8 Solve problems involving electric charges, forces and elds.
IBO 2007
6 10
24
kg.
Example
Assuming the Earth and Moon to be isolated from all other masses, use the following data to estimate the mass of the Moon. mass of Earth = 6.0 1024 kg distance between centre of Earth and centre of Moon = 3.8 108 m distance from centre of Earth at which gravitational field is zero = 3.42 108 m
Solution
Since Me/(3.42 108)2 = Mm/(3.8 108 3.42 108)2 Mm = 7.4 1022 kg These properties will be outlined further in a later section of this chapter.
154
19 april 09 Physics Ch 6 final.i154 154 22/05/2009 11:52:18 AM
Like charges repel each other Unlike charges attract each other
+ + + + + +
attraction
When a perspex rod is rubbed with a piece of silk, the perspex rod becomes positively charged and the silk becomes negatively charged as demonstrated in Figure 606.
Before rubbing After rubbing
+ + + + + + + +
perspex
+ + + + + + + +
silk
+ ++ + ++ + + + ++ + + +
perspex
electron transfer
silk
Figure 606
The perspex rod and the silk are initially electrically neutral as each material has the same number of positive and negative charges. The action of friction allows the less tightly held electrons of the perspex rod to be transferred to the silk. With time, the excess electrons on the silk will leak off its surface. It has been further found that ebonite (a certain black material) rubbed with fur becomes negatively charged. polythene rubbed with a woollen cloth becomes negatively charged. cellulose acetate rubbed with a woollen cloth becomes positively charged.
+ + + + + +
+ ++ ++ +
ebonite strip
155
22/05/2009 11:52:20 AM
CORE
CHAPTER 6
that forms on the conductor will be transferred from the earth through the body of the person holding the conductor. It is said that the conductor is earthed. In an insulator, the electrons are held tightly by the atomic nuclei and are not as free to move through a material. They can accumulate on the surface of the insulator but they are not conducting. According to the energy band theory that is used to explain the properties of conductors, semiconductors (such as germanium and silicon), and insulators, the valence or outer-shell electrons are held in the valence band that is full or partially filled with electrons. When there are many atoms in close proximity (as there is with all materials), there also exists an upper energy band known as the conduction band. The conduction band is empty. A forbidden energy gap exists between the valence and conduction bands. For conductors such as metals, the valence and conduction bands overlap. However, in insulators, the energy gap between the valence band and the conduction band is large. Therefore, electrons cannot move across the forbidden energy gap. Insulators thus have a high electrical resistance and when a perspex or other insulating material is held, the electrons remain on the surface of the insulator and are not able to be conducted through the person. The charge on an insulator will remain for a short period of time until it leaks off the surface or is discharged. It consists of two conducting spheres fixed on an insulated rod that is suspended by a thin wire fibre connected to a suspension head. The whole apparatus is enclosed in a container to make sure that air currents do not disturb the degree of twist of the thin fibre when a test charge is lowered through a small opening in the apparatus component containing the spheres. The twist on the wire can be calibrated from the twist produced by small known forces and is read off the scale on the suspension head. Using the apparatus, Coulomb could determine the relationship that exists between the magnitude of two charges and force, and the distance and force. The quantitative relationship that exists between electric point charges separated by a distance was first stated by Coulomb in 1785. Using the torsion balance in Figure 608 he measured the quantity of charge, the distance separating the point charges and the force acting on the charged spheres. On the basis of his experiments, he concluded that 1. the force F between two point charges q1 and q2 was directly proportional to the product of the two point charges. F q1 q2 the force between the two point charges was inversely proportional to the square of the distance between them r2.
CORE
2.
1 F-2 r
Therefore, it follows that
q1 q2 F ----------2 r kq 1 q 2 F = --------2 r When F is measured in newtons (N), q1 and q2 in coulombs (C), and r in metres (m), the quantitative statement of Coulombs Law can be expressed mathematically as q1q2 F = ----------2 4 0 r where k = 1 / 4 0 is the constant of proportionality. Its value is 9.0 109 N m2 C-2. The part of the constant 0 is called the permittivity constant of free space. On its own, it has a value of 8.9 10 -12 N m2 C -2 , and this value applies if the experiment is carried out in air or in a vacuum. If the experiment is carried out in another medium, the value of 0 will need to be substituted with another value.
3.
156
19 april 09 Physics Ch 6 final.i156 156 22/05/2009 11:52:21 AM
2.
3. 4.
5. 6.
The electric field strength or electric field intensity at any point in space, E is equal to the force per unit charge exerted on a positive test charge.
E = F --q
or
F = E q
q2
Figure 610 Figure 609 Electric eld around a positive point charge
157
19 april 09 Physics Ch 6 final.i157 157 22/05/2009 11:52:22 AM
CORE
CHAPTER 6
Since the forces are the same, that is, F = k q1 q2 / r2 and F = Eq1, we then have that
k q 1 q2 --------- = E q1 2 r
CORE
+ + + + + + +
kq 2 E = ----2 r
or
q2 E = ----------2 4 0 r
If q1 is positive, the direction of the electric field is radially outwards from q2 as shown in Figure 611. If q1 is negative, then the direction of the electric field is radially inwards (towards q2 ).
+
weakest eld
+ +
+ +
E = 0 inside
+ ++ + +
strongest
Figure 612
+ q2
q2
For the oppositely charged parallel plates, the electric field is approximately uniform meaning that the electric field strength is the same at all points within the plates. Note the edge effect where the electric field lines are now radial at the ends of the plates and thus the electric field strength changes.
Figure 611
Calculate the force acting between two point charges of +10.0 C and -5.0 C separated by a distance of 10.0 cm in a vacuum.
Solution
As the answer is negative, it implies that the force is attractive. That is, there is a force of attraction of 45 N.
158
19 april 09 Physics Ch 6 final.i158 158 26/05/2009 4:04:25 PM
Example
Example
+1 C
1m
Solution
+1 C
Let the original charges be Q1 and Q2, and their separation be R, so that the force between these two charges is given by
1m
+1C
Determine the resultant force on the charge located at the right angle.
kQ 1 Q 2 F 1 = ----------2 R
Now, let the new charges be q1 and q2 and their separation be r, so that q1 = 2Q1, q2 = 3Q2, and r = R.
Solution
+1 C
q2 1m
kq 1 q 2
F2 q1
+1 C F1 1m
+1 C q3
kq 1 q 2 k ( 2 Q1 ) ( 3 Q2 ) F = --------- = ---------------------2 2 1 r -R 2
The rough vector diagram could be shown as above: If several point charges are present, the net force on any one of them will be the vector sum of the forces due to each of the others as shown in the figure above. Since the three point charges are positive, then there will be repulsion on the bottom point charge due to each of the two point charges. The force on the point charge on the right angle due to the top point charge is calculated as:
9 kq 1 q 2 9.0 10 N m2 C 2 ( 1 C ) ( 1 C ) F 1 = --------= --------------------------------------------------------2 2 r ( 1m )
6 kQ 1 Q 2 = ------------2 0.25 R kQ 1 Q 2 = 24 ----------2 R = 24 F 1 This means that the force is 24 times larger than it was originally. (i.e., 24 20.0 N)
The resultant force is 480 N.
= 9 109 N The force on the point charge on the right angle due to the right point charge is calculated as
9 kq 1 q 3 9.0 10 N m2 C 2 ( 1 C ) ( 1 C ) F 2 = --------= --------------------------------------------------------2 2 r ( 1m )
= 9 109 N
159
19 april 09 Physics Ch 6 final.i159 159 22/05/2009 11:52:27 AM
CORE
The force between two point charges is 20.0 N. If one charge is doubled, the other charge tripled and the distance between them is halved, calculate the resultant force between them.
Charges of +1C are located at the corners of a 45 rightangled triangle as shown in the Figure below.
CHAPTER 6
The resultant force is given by the vector addition of the two forces that can be obtained by Pythagorean theorem as in the Figure below.
Solution
F 2 = 9 10 N
CORE
F 1 = 9 10 N
FR
+1 C q2 1m +1 C F2 q1 F1
+1 C q3 1m
(F R) = (F 1 ) + (F 2 )
9 2
2 9 2
= ( 9 10 ) + (9 10 ) = 2 ( 9 10 ) F = 12.7 10 N
9 9 2
The force on the point charge on the right angle due to the two top point charges is still calculated as before. i.e., F 1 = --------- = --------------------------------2 2
kq 1 q 2 r
9.0 10 (1 ) ( 1 ) ( 1m )
= 9.0 10
The direction of the resultant force can be calculated using trigonometry:
= 9.0 10 N
However, this time the resultant force is shown in the figure below Again, using Pythagoras theorem, we have that
Example
Now consider the same problem as the previous one, but this time, the charges are set up as shown in the figure below.
F1
FR F2
The direction of the resultant force can be calculated using trigonometry:
+1 C
+1C
1m
1m
+1 C
(F R) = (F 1 ) + (F 2 )
9 2
2 9 2
Determine the resultant force on the charge located at the right angle?
= ( 9 10 ) + (9 10 ) = 2 ( 9 10 ) = 12.7 10 N
9 9 2
160
19 april 09 Physics Ch 6 final.i160 160 22/05/2009 11:52:30 AM
Example
Solution
+ 1.2 C 0.8 C 1.2 m X 1.0 m
Solution
E due to 0.80 C point charge is given by kq / r2 Using the formula, E = F / q, we have that = (9 109 Nm2C-2) (-0.8 C) (1.0 m)2 = - 7.2 109 NC-1 i.e., the E field, E1, has a magnitude of 7.2 109 NC-1 (radially inwards).The approximate direction is north-east. E due to +1.2 C point charge is given by kq / r2 = (9 109 Nm2C-2) (+1.2 C) (1.2 m)2 = 7.5 109 NC-1
1.0 10 N 1 E = -------------------- = 4.0 NC 5 2.5 10 C The electric field strength is 4.0 N C -1(in the direction of the force).
Example
i.e., the E field, E2, has a magnitude of 7.5 109 NC-1 (radially outwards). Calculate the electric field strength 1.5 cm from a point charge of 1.00 102 pC in a vacuum. The approximate direction is south-east The Figure below shows the vectors of the two electric fields, and chooses an angle to measure as a reference for the direction of the resultant electric field. (Another reference angle could be chosen).
Solution
Using the formula, E = q / 40 r2 E = (9 109 Nm2C-2) (1.00 10-10 C) (1.5 10-2 m)2 = 4.0 103 N C -1 The electric field strength is 4.0 103 N C -1(radially outwards).
+ 1.2 C
1.2 m
E1
0.8 C 1.0 m X
E2
Example
The field at a particular place due to more than one point charge is the vector sum of the fields caused by each point charge on its own. Calculate the electric field strength at X due to the charges shown in the following Figure.
E2 E1
2 R
= (7.2 109)2 +
161
19 april 09 Physics Ch 6 final.i161 161 22/05/2009 11:52:32 AM
CORE
A point charge of 25 C experiences a force of 1.0 10-4 N. Calculate the electric field strength producing this force.
CHAPTER 6
opposite 7.2 10 ------------ = 0.96 , = 43 50 ------------ = --tan = 9 adjacent 7.5 10
so that ER = 10.4 109 N C-1 Next, we have that
9
+ + + + + + + +
A B C
What is the strength of the electric field relative to locations A, B and C? A. B. C. D. 5. greater at A than at B greater at C than at A greater at B than at C the same at A, B and C
CORE
i.e. = 44 and = tan1 (1/1.2) = 40 Resultant field at X is 1.0 1010 NC1 at 1 below a horizontal line drawn through X.
Exercise
6.2
1.
A metal sphere with an excess of 11 electrons touches an identical metal sphere with an excess of 15 electrons. After the spheres touch, the number of excess electrons on the second sphere is A. B. C. D. 26 2 13 4
A.
A negatively charged sphere of negligible mass is moving horizontally in an easterly direction. It enters two long parallel plates carrying opposite charges. Which one of the following figures best shows the path followed by the sphere?
+ + + + + B. + + + + +
2.
When hair is combed with a plastic comb, the hair becomes positively charged because the comb
C.
D.
+ + + + +
+ + + +
A B. C. D. 3.
transfers electrons to the hair. transfers protons to the hair. removes protons from the hair. removes electrons from the hair. 6. Electric field strength may be defined as A. B. C. D. 7. the force per unit point charge the force exerted on a test point charge the force per unit charge exerted on a positive test charge the force per unit positive charge
A B C D
q l 2q l 2q 2l 2q 2l
2q 3q
3q 6q
Two conducting spheres of charge Q1 and Q2 whose centres are separated by a distance d attract each other with an electrostatic force F. If the charge on each sphere is halved and their separation is reduced to one-quarter of its original value, the new force of attraction is given by: A. B. C. D. F 4F 8F 64F
4.
162
19 april 09 Physics Ch 6 final.i162 162 22/05/2009 11:52:33 AM
A
F
B F
+5 C X
d
-5 C +5 C
d D F
C F
B.
d
C. D.
12. 9. If the magnitude of the charge on each of two negatively charged objects is halved, the electrostatic force between the objects will: A. B. C. D. 10. remain the same decrease to one-half decrease to one-quarter decrease to one-sixteenth
In terms of electrostatic induction, explain why road petrol tankers have a length of chain attached to the rear of the truck that touches the road. Explain why charging the nozzle of a spray painting device will use less paint. Why should spare petrol for cars be carried in a metal rather than a plastic container? Explain why it is not wise to play golf during a thunderstorm. Explain why the inside of a car is a safe place during a thunderstorm. Calculate is the charge on 4.0 1020 protons? Two charges of 6.00 C and +8.00 C attract each other with a force of 3.0 103 N in a vacuum. Calculate the distance between the particles? Calculate the electric field strength at a point 2.4 m from a point charge of 5.7 C in air. Deduce the electric field strength at a point midway between charges of +7.2 10-6 C and 3.4 10-6 C that are 2.0 m apart in air. Describe how the electric field strength at a point is similar to, and different from the gravitational field strength at a point.
13.
14.
15. Which of the following is a vector quantity? A. B. C. D. 11. potential difference electric field intensity electric charge electric power 16.
17. 18.
An electrostatic force F exists between two points with a separation of d metres. Which graph best represent s the relationship between F and d?
19.
20.
21.
163
19 april 09 Physics Ch 6 final.i163 163 22/05/2009 11:52:34 AM
CORE
CHAPTER 6
22. Sketch the electric field around two negatively charged point charges separated by distance d if one point charge has twice the charge of the other point charge. Two point charges placed 2.5 10-1 m apart in paraffin oil, carry charges of +7.00 pC and +9.00 pC. Calculate the force on each point charge. ( = 4.18 10-11 C-1N-1m2). Three identical 2.00 10-5 C point charges are placed at the corners of an equilateral triangle of sides 1.0 m. The triangle has one apex C pointing up the page and 2 base angles A and B. Deduce the magnitude and direction of the force acting at A. What is the force acting between two point charges of +10.0C and 5.0C when separated by a distance of 10.0 cm in a vacuum? Point charges of +1C are located at the corners of a 450 right-angled triangle as shown in the diagram.
23.
CORE
24. 25.
6.3.4
26.
6.3.5
+ 1C
+ 1C
6.3.6
1m + 1C
1m
28. 29.
30.
X +1.0 C
Y +4.0 C
Deduce at what point the magnitude of the electric field is equal to zero.
164
19 april 09 Physics Ch 6 final.i164 164 22/05/2009 11:52:35 AM
Figure 629 Magnetic eld patterns of bar magnets The Danish physicist, Hans Christian Oersted (1777-1851), in 1819, showed conclusively that there existed a relationship between electricity and magnetism. He placed a magnetic needle on a freely rotating pivot point beneath and parallel to a conducting wire. He aligned the compass needle and wire so that it lay along the earths magnetic north-south orientation. When no current was flowing in the wire, there was no deflection in the needle. However, when the current was switched on, the needle swung to an east-west direction almost perpendicular to the wire. When he reversed the direction of the current, the needle swung in the opposite direction. This is shown in Figure 630.
N
current flow
E
Figure 630 Oersteds experiment. Figure 631 Up to this stage, all forces were believed to act along a line joining the sources such as the force between two masses, the force between two charges or the force between two magnetic poles. With Oersteds findings, the force did not act along the line joining the forces but rather it acted perpendicular to the line of action. On closer examination and analysis, it was determined that the conducting wire produced its own magnetic field. The magnetic needle, upon interaction with the conducting wires magnetic field, turns so that it is tangentially (not radially) perpendicular to the wire. Therefore, the magnetic field produced by the conducting wire produces a circular magnetic field. A force is also experienced when a moving charge or a beam of moving charges is placed in a magnetic field. This is what happens in a television set. We will look more closely at moving charges in the next chapter on Atomic and Nuclear physics. A charged particle can be accelerated by an electric field or by a magnetic field because it experiences a force when Magnetic eld around wire carrying a current
The direction of the magnetic field for a straight conducting wire can be obtained using the right-hand grip rule demonstrated in Figure 632.
Fingers curl around the conductor (indicating the direction of magnetic field).
current flow
Figure 632
When the thumb of the right hand points in the direction of the conventional current, the fingers curl in the direction of the magnetic field.
165
19 april 09 Physics Ch 6 final.i165 165 22/05/2009 11:52:37 AM
CORE
CHAPTER 6
A more convenient two-dimensional representation of currents and magnetic fields is often used as shown in Figure 633. A cross () indicates that the current is into the page and a dot (.) indicates a current flow out of the page.
indicates, I out page indicates, I into page
A solenoid consists of many coils of a single long wire, and when a current flows in it, a magnetic field similar to a bar magnet is produced. By using plotting compasses as shown in Figure 636 the direction of the magnetic field can be determined.
CORE
Figure 633 The magnetic eld around a conductor The magnetic field due to current in a flat coil (single loop) is shown in Figure 634. Note that the lines of magnetic flux are dense in the middle of the coil and towards the left and right current-carrying wires. This is similar to what happens with a U-shaped magnet (horseshoe magnet). The strength of the magnetic field increases in a coil.
current w in coil
Figure 636
The field inside the coil is very strong and uniform and this makes solenoids useful devices in science and technology. The polarity of each end of the solenoid can be determined using the same method as shown in Figure 635. If the conventional current when viewed head-on is moving anti- clockwise, that end of the solenoid is a north pole. If the current flows clockwise, when viewed head-on it is a south pole. The north and south poles are shown in Figure 636. Check them for yourself. The strength of the magnetic field inside a solenoid can be increased by:
Battery source
1. Figure 634 The magnetic eld due to current in a single loop A useful method for determining the polarity of the flat coil is shown in Figure 635.
North end of a solenoid South end of a solenoid
Increasing the current flowing. Increasing the number of coils per unit length. Inserting a soft iron core in the coil.
2. 3.
I I
Figure 635
When a soft-iron core is inserted into a solenoid and the current is switched on, an electromagnet is produced. If the current is switched off, the solenoid loses its magnetic properties. We say it is a temporary magnet in this case. However, electromagnets can be left on for long periods of time and most magnets in science and industry are of this sort. Electromagnets have many practical uses in scrap metal yards, in electric bells, in particle accelerators and maglev trains. A relay is an electromagnet switch using a small current to switch on a larger current. This can be employed to switch on motors or electronic components commonly used in security systems.
If the conventional current is moving anti-clockwise, that end of the loop is a north pole. In this case the left side of the loop is a north pole. If the current flows clockwise as on the right side of the loop, it is a south pole.
166
19 april 09 Physics Ch 6 final.i166 166 22/05/2009 11:52:38 AM
Exercise
6.3 (a)
magnetic field
1. Explain why steel ships tend to become magnetised during the shipbuilding construction. Magnets are often fitted to the doors of refrigerators to keep them closed. Use the concept of magnetic induction to explain this practical application. Draw a diagram to show the magnetic field pattern round two magnets with their unlike poles close together where the strength of the field of one magnet is twice the strength of the other field. If a solenoid is viewed from one end, and the current travels in an anti-clockwise direction, what is the polarity of that end?
2.
current
3.
Figure 638 The motor eect The direction of the force experienced by moving currents in a magnetic field can be determined by the vector addition of the two fields. However, an easier way for determining the direction of the force is to use a right-hand palm rule or Flemings left-hand rule. There are a variety of hand rules used and it very much depends on the textbook you use as to what rules will be given. It is really up to you to use the hand rule that you prefer. Figure 639 shows three hand rules commonly used.
Fingers point along lines of magnetic field
4.
Figure (a)
Figure (b)
Left Hand
S
current
supply
B
Field along first finger
F
Current along second finger Force along thumb
N
switch wood
Figure (c)
Figure 637 Conductor in a magnetic eld The reason for the movement is due to the interaction of the two magnetic fields - that of the magnet and the magnetic field produced by the current-carrying wire. Figure 638 shows the resultant magnetic field in this case. If the current was reversed, then the wire would be catapulted inwards.
Figure 639 Hand rules used to show the direction of force. In Figure 639 (a), if the fingers of your right hand point in the direction of the magnetic field B, and your thumb points in the direction of the conventional current, then your palm points in the direction of the force. This rule is called the right-hand palm rule. An alternative to this
167
19 april 09 Physics Ch 6 final.i167 167 22/05/2009 11:52:40 AM
CORE
movement
CHAPTER 6
is shown in Figure 639 (b). (This is the rule preferrd by the author because the fingers give a sense of flow of conventional current and the palm points north-south like a bar magnet and the thumb is the direction of movement or force). Figure 639 (c) is Flemings left hand rule. The first finger gives the direction of the magnetic field, the second finger gives the direction of conventional current, and the thumb gives the movement or force direction Try these rules for the examples in Figures 637 and 638 to see which one you prefer. Note that these rules are for conventional current and not true electron flow. If electron flow is to be determined, apply a rule of choice and find the force for conventional flow, say north then state your answer as the opposite direction, in this case south.
CORE
1.
the strength of the magnetic field B measured in teslas (T) the current flowing in the wire I measured in amperes (A) the length of the conductor in the magnetic field l measured in metres (m).
2.
3.
So that
F = IlB
This force is greatest when the magnetic field is perpendicular to the conductor. Sometimes the wire in the magnetic field is at an angle to the magnetic field. In this case F = I lB sin Therefore, as decreases, so too does the force. When = 0 the current in the conductor is moving parallel to the magnetic field and no force on the conductor occurs. The force experienced can be increased if the number of turns of wire carrying the current is increased. In this case the force is given by F = I l B n where n is the number of turns of wire. When there are a number of turns of wire suspended between a magnetic field, the device is commonly called a wire toroid. In order to determine the magnitude of the force experienced by a single point charge q, we will follow through the following derivation. The velocity of the particle is given by
F
moving positive charge, q
S
B + +
Direction of charge movement
+ + +
l l = vt v = t
168
19 april 09 Physics Ch 6 final.i168 168 22/05/2009 11:52:42 AM
Force
q ( vt ) B F = -t
That is,
F = q vB
The force on the conductor is 3.50 10-6 N north. If a charged particle enters a uniform magnetic field at an angle other than 90, the force it experiences is given by
Example
F = q v B sin
When = 90, and the magnetic field is uniform, the particle will undergo uniform circular motion as the force it experiences is at right angles to its motion. The radius of its circular motion is given by: An electron is moving with a speed of 3.0 105 m s-1 in a direction that is at right angles to a uniform magnetic field of 3.0 103 T. Calculate a. b. the force exerted on the electron. the radius of the path of the electron.
mv mv q vB = ------ r = ---qB r
When the particle enters the field at an angle other than a right angle, it will follow a helical path.
Solution
a.
F = ( 1.6 10 = 1.44 10
C ) ( 3.0 10 ms ) ( 3.0 10 N
T)
The force exerted on the electron is 1.4 x 10-16 N at right angles to the magnetic field and the path of its motion. b. The force on each charge is given by either of the formulas, F = qv B and F= mv2 / r.
A wire that is carrying a current of 3.50 A east has 2.00 m of its length in a uniform magnetic field of magnetic flux density of 5.00 107 T directed vertically downwards into the paper. Determine the magnitude and direction of the force it experiences.
Equating these two expressions we can determine the radius of the path:
Solution
That is, r = [(9.11 10-31 kg) (3.0 105 ms-1)] Using the formula for the force on a wire in a magnetic field, we have: F = IlB = (3.5 A) (2.00 m) (5.00 10-7 T) = 3.50 10-6 N. = 5.69 10-4 The radius of the path is 5.7 10-4 m. [(1.6 10-19 C) (3.0 10-3 T)]
169
19 april 09 Physics Ch 6 final.i169 169 22/05/2009 11:52:43 AM
CORE
2.00 m
CHAPTER 6
6. Two parallel wires carry currents I of equal magnitude in opposite directions as shown in the diagram
X Y Z
Exercise
6.3 (b)
1.
A suitable unit of magnetic field strength is A. B. C. D. A N-1 m-1 kg s-2 A-1 A m N-1 kg A s2
CORE
2.
An electron enters a uniform magnetic field that is at right angles to its original direction of movement. The path of the electron is: A. B. C. D. an arc of a circle. helical. part of a parabola. a straight line.
The line along which the magnetic fields cancel is A. B. C. D. 7. X Y Z the magnetic fields do not cancel
3.
Two long straight wires with currents flowing in opposite directions experience a force because: A. B. C. D the current in both wires increases the current in both wires decreases the current in the wires produces an attraction the current in the wires produces a repulsion
A beam of protons enter a uniform magnetic field directed into the page as shown
4.
protons
S N
Current in a wire
The protons will experience a force that pushes them A. B. C. D. 5. outwards inwards it does not move sideways A. B. C. D. into the page out of the page upwards downwards
An electron passes through a uniform magnetic field of 0.050 T at right angles to the direction of the field at a velocity of 2.5 106 ms-1. The magnitude of the force on the electron in newtons is: A. B. C. D. 2.0 10-14 4.0 10-14 8.0 10-14 zero
170
19 april 09 Physics Ch 6 final.i170 170 22/05/2009 11:52:44 AM
(a) (b)
When an electric current flows in the circuit, the end of the coil labelled X will be: A. B. C. D. 9. a south pole a north pole either a north or a south pole neither a north or a south pole
An ion carrying a charge of 3.2 10-19 C enters a field of magnetic flux density of 1.5 T with a velocity of 2.5 105 m s-1 perpendicular to the field. Calculate the force on the ion. A straight wire of length 50 cm carries a current of 50 A. The wire is at right angles to a magnetic field of 0.3 T. Calculate the force on the wire. A straight wire of length 1.4 m carries a current of 2.5 A. If the wire is in a direction of 25 to a magnetic field of 0.7 T, calculate the force on the wire. A beam of electrons enters a pair of crossed electric and magnetic fields in which the electric field strength of 3.0 104 V m-1 and magnetic flux density of 1.0 10-2 T. If the beam is not deflected from its path by the fields, what must be the speed of the electrons? An electron in one of the electron guns of a television picture tube is accelerated by a potential difference of 1.2 104 V. It is then deflected by a magnetic field of 6.0 10-4 T. Determine i. ii. the velocity of the electron when it enters the magnetic field. the radius of curvature of the electron while it is in the magnetic field.
10.
11.
12.
13.
14.
A point charge of 15 C is moving due north at 1.0 103 ms-1 enters a uniform magnetic field of 1.2 10-4 T directed into the page. Determine the magnitude and direction of the force on the charge.
171
19 april 09 Physics Ch 6 final.i171 171 22/05/2009 11:52:45 AM
CORE
CHAPTER 6
CORE
172
19 april 09 Physics Ch 6 final.i172 172 22/05/2009 11:52:45 AM
charge on the proton. In 1910, the American physicist, Robert Millikan made the first precise determination of the charge on an electron as 1.602 10-19 C (The current value is 1.60217733 10-19 C). Earlier, in 1897, the English physicist J. J. Thomson had measured the ratio of the electron charge to its mass,
e e . From the value of me me
found by Thomson, Millikans determination of e enabled the mass of the electron to be determined. The current value for the electron mass me is 9.10938188 10-31 kg. The current value for mass of the proton mp is 1.67262158 10-27 kg.
173
090513 Physics Ch 7 for Paul.ind173 173 22/05/2009 11:53:29 AM
CORE
CHAPTER 7
Rutherford studied how -particles were absorbed by matter and found that they were readily absorbed by thin sheets of metal. However, he found that they were able to penetrate gold-foil which, due to the malleable nature of gold, can be made very thin. Figure 701 illustrates the principle of the experiment carried out by Geiger and Marsden.
Lead container
CORE
Evacuated region
(c) miss
Gold nucleus
Figure 702 The paths of alpha particles through gold foil It is the results of the Geiger-Marsden experiment that led to the atomic model outlined in 7.1.1 above. Detailed analysis of their results indicated that the gold atoms had a nuclear diameter of the order of 10-14 m, which meant that the radius of a proton is the order of 10-15 m. Work contemporary to that of Geiger and Marsden, using X-rays, had shown the atomic radius to be of the order of 10-10 m. To give some understanding to the meaning of the expression an atom is mainly empty space, consider the nucleus of the hydrogen atom to be the size of a tennis ball, then the radial orbit of the electron would be about 2 km.
Radium
Figure 701 Geiger and Marsdens experiment A piece of radium is placed in a lead casket such that a narrow beam of particles emerge from the tunnel in the casket The particle beam is incident on a piece of goldfoil behind which is placed a fluorescent screen. The whole apparatus is sealed in a vacuum. The result is rather surprising. Most of the -particles go straight through, but a few are scattered through quite large angle whilst some are even turned back on themselves. Rutherford was led to the conclusion that most of the gold-foil was empty space. However, to account for the scattering, some of the particles must encounter a relative massive object which deflects them from their path. Imagine firing a stream of bullets at a bale of hay in which is embedded a few stones. Most of the bullets go straight through but the ones which strike a stone will ricochet at varying angles depending on how they hit the stone. Rutherford suggested that the atom consisted of a positively charged centre (the stones) about which there was a mist of electrons (the straw). He neglected the interaction of the -particles with the electrons because of the latters tiny mass and diffuse distribution. The significant reaction was between the massive positively charged centre of the gold atoms and the incoming particles. Rutherford was in fact quoted as saying It was quite the most incredible event that has ever happened to me in my life. It was almost as incredible as if you had fired a 15inch shell at a piece of tissue paper and it came back and hit you. Figure 702 illustrates the paths that might be followed as the -particles pass through the gold-foil.
174
090513 Physics Ch 7 for Paul.ind174 174 22/05/2009 11:53:30 AM
Emission spectra
If a sufficiently high potential difference is applied between the ends of a glass tube that is evacuated apart from the presence of a small amount of mercury vapour, the tube will glow. To study the radiation emitted by the tube, the emitted radiation could be passed through a slit and then through a dispersive medium such as a prism. The prism splits the radiation into its component wavelengths. If the light emerging from the prism is brought to a focus, an image of the slit will be formed for each wavelength present in the radiation. Whereas the radiation from an incandescent solid (e.g the filament of a lit lamp) produces a continuous spectrum of colours, the mercury source produces a line spectrum. Each line in this spectrum is an image of the slit and in the visible region, mercury gives rise to three distinct lines- yellow, green and blue. The study of line spectra is of great interest as it is found that all the elements in the gaseous phase give rise to a line spectrum that is characteristic of the particular element. In fact elements can be identified by their characteristic spectrum and is one way that astronomers are able to determine the elements present in the surface of a star (see Option E). Also, the spectrum of an element provides clues as to the atomic structure of the atoms of the element. In 1905, based on the work of Max Planck, Einstein proposed that light is made of small packets of energy called photons. Each photon has an energy E given by E = hf, where h is a constant known as the Planck constant and has a value 6.6 10-34 J s. The photon model of light suggests an atomic model that accounts for the existence of the line spectra of the elements. If it is assumed that the electrons in atoms can only have certain discrete energies or, looking at it another way, can only occupy certain allowed energy levels within an atom, then when an electron moves from one energy level to a lower energy level, it emits a photon whose energy is equal to the difference in the energy of the two levels. The situation is somewhat analogous to a ball bouncing down a flight of stairs; instead of the ball losing
Figure 703 Shows two of the allowed energy levels of atomic hydrogen. When the electron makes the transition as shown in Figure 703, it emits a photon whose frequency is given by the Planck formula i.e
f =
This is in fact the measured value of the wavelength of the red line in the visible spectrum of atomic hydrogen. In most situations, the electrons in an atom will occupy the lowest possible energy states. Electrons will only move to higher energy levels if they obtain energy from somewhere such as when the element is heated or, as mentioned above, when subjected to an electrical discharge. When the electrons are in their lowest allowed levels, the atom is said to be unexcited and when electrons are in higher energy levels, the atom is said to be excited. To move from a lower level to a higher energy level, an electron must absorb an amount of energy exactly equal to the difference in the energy between the levels. (For IBO reference, carrying out calculations using the Planck relationship will not be expected at SL; the above is just to try and help explain how the existence of line spectra strongly supports the existence of atomic energy levels).
Absorption spectra
Atomic line spectra can be obtained in another way. If the radiation from a filament lamp passes through a slit and then a tube containing unexcited mercury vapour and is then focussed after passing through a prism, the resulting spectrum is continuous but is crossed with dark lines. These lines correspond exactly to the lines in the emission
175
090513 Physics Ch 7 for Paul.ind175 175 22/05/2009 11:53:31 AM
CORE
CHAPTER 7
spectrum of mercury. To understand this, suppose the difference in energy between the lowest energy level and the next highest level in mercury atoms is E, then to make the transition between these levels, an electron must absorb a photon of energy E. There are many such photons of this energy present in the radiation from the filament. On absorbing one of these photons, the electron will move to the higher level but then almost immediately fall back to the lower level and in doing so, will emit a photon also of energy E. However, the direction in which this photon is emitted will not necessarily be in the direction of the incident radiation. The result of this absorption and reemission is therefore, a sharp drop in intensity in the incident radiation that has a wavelength determined by the photon energy E. The phenomenon of absorption spectra is of great importance in the study of molecular structure since excitation of molecules will often cause them to dissociate before they reach excitation energies. penetrating ability. In 1930 Walther Bothe and Herbert Becker found that when beryllium is bombarded with particles a very penetrating radiation was produced. It is this radiation that Chadwick showed to consist of identical uncharged particles. These particles he called neutrons and the current value of the neutron mass mn is 1.67262158 10-27 kg. The neutron explains the existence of isotopes in the respect that, a nucleus is regarded as being made up of protons and neutrons. The nuclei of the different isotopes of an element have the same number of protons but have different numbers of neutrons. For example, there are three stable isotopes of oxygen; each nucleus has eight protons but the nuclei of the three isotopes have eight, nine and ten neutrons respectively. In the study of particle physics, the proton and neutron are regarded as different charge states of the same particle called the nucleon. (Particle physics is studied in depth in Options D SL, and Option J-HL).
CORE
NUCLEAR STRUCTURE
7.1.5 Explain the terms nuclide, isotope and nucleon. 7.1.6 Dene nucleon number A, proton number Z and neutron number N. 7.1.7 Describe the interactions in a nucleus.
IBO 2007
X
is the nucleon number is the proton number
A Z
We also define the neutron number N as N = A Z. It should be mentioned that it is the proton number that identifies a particular element and hence the electronic configuration of the atoms of the element. And it is the electronic configuration that determines the chemical properties of the element and also many of the elements physical properties such as electrical conductivity and tensile strength.
The neutron
The explanation for the existence of isotopes did not come until 1932 when James Chadwick an English physicist, isolated an uncharged particle that has a mass very nearly the same as the proton mass. Since 1920, both Rutherford and Chadwick had believed that an electrically neutral particle existed. An uncharged particle will not interact with the electric fields of the nuclei of matter through which it is passing and will therefore have considerable
176
090513 Physics Ch 7 for Paul.ind176 176 22/05/2009 11:53:31 AM
Nuclear interactions
Shortly after the discovery of the neutron, Hideki Yukawa, a Japanese physicist, postulated a strong force of attraction between nucleons that overcomes the Coulomb repulsion between protons. The existence of the force postulated by Yukawa is now well established and is known as the strong nuclear interaction. The force is independent of whether the particles involved are protons or neutrons and at nucleon separations of about 1.3 fm, the force is some 100 times stronger than the Coulomb force between protons. At separations greater than 1.3 fm, the force falls rapidly to zero. At smaller separations the force is strongly repulsive thereby keeping the nucleons at an average separation of about 1.3 fm. (1 femtometre = 10-15 m)
177
090513 Physics Ch 7 for Paul.ind177 177 22/05/2009 11:53:32 AM
CORE
CHAPTER 7
Identification Charge Kinetic energy range/MeV Rest mass/kg Penetration range helium nucleus +2e alpha, 2 10 6.70 10-27 4 cm air sheet of thin paper + electron/positron -e, +e 0.1 1.0 9.11 10-31 1-3 m in air thin aluminium sheet beta, , zero several cm of lead Gamma high frequency em zero 10-3 3 radiation (photon) Figure 704 A summary of radiations and their properties (table) Name
CORE
very penetrating; in fact a considerable thickness of lead is required to stop them. The energies are given in MeV ( 1eV = 1.6 10-19 J see 5.1.3)
-decay
A nucleus of a radioactive element that emits an -particle must transform into a nucleus of another element. The nucleus of the so-called parent element loses two neutrons and two protons. Therefore the nucleon number (A) changes by 4 and the proton number (Z) by 2. The nucleus formed by this decay is called the daughter nucleus. We may express such a nuclear decay by the nuclear reaction equation
A Z
so travels at the speed of light; it is uncharged and rarely reacts with matter (Millions upon millions of neutrinos pass through the human body every second). However, in 1956, the neutrino was finally detected. It turns out though, that in -decay in order to conserve other quantities, it must be an antineutrino and not a neutrino that is involved. (Conservation laws along with particles and their antiparticles are discussed in detail in Option J). The decay equation of a free neutron is
1 0 1 n1 p + 0 1 e+
Or simply n0 p+ + e- + is the symbol for the antineutrino. The origin of - particle is the decay of a neutron within a nucleus into a proton. The nucleon number of a daughter nucleus of an element formed by - decay will therefore remain the same as the nucleon number of the parent nucleus. However, its proton number will increase by 1. Hence we can write in general that
A Z
A- 4 Z- 2
4 2
He
(parent) (daughter) (-particle) For example the isotope uranium238 is radioactive and decays by emitting radiation to form the isotope thorium-234, the nuclear reaction equation being:
238 92
234 90
4 Th + 2 He
A Z +1
Y+
0 -1
Or simply
238 92
234 90
Th +
(parent) (daughter) (electron) (antineutrino) For example, a nucleus of the isotope thorium-234 formed by the decay of uranium-238, undergoes - decay to form a nucleus of the isotope protactinium-234. The nuclear reaction equation for this decay is
234 90 0 Th 234 91 Pa + 1 e +
Clearly in any nuclear radioactive decay equation, the nucleon number and proton number of the left-hand side of the equation must equal the nucleon number and proton number of the right-hand side of the equation.
Or simply
234 90 Th 234 91 Pa + e +
The origin of + particles is from the decay of a proton within a nucleus into a neutron. The decay equation of the proton is
1 1p 0 1 0n + 1e +
178
090513 Physics Ch 7 for Paul.ind178 178 22/05/2009 11:53:34 AM
90 44
0 Ru 90 43Tc + +1 e +
Or simply
90 44 + Ru 90 43Tc + e +
Unlike free neutrons, free protons are stable (although current theory suggests that they have an average life of 1030 years.). This probably explains why most of the observable matter in the universe is hydrogen.
radiation
The source of radiation in radioactive decay arises from the fact that the nucleus, just like the atom, possesses energy levels. In and decay, the parent nuclide often decays to an excited state of the daughter nuclide. The daughter nuclide then drops to its ground state by emitting a photon. Nuclear energy levels are of the order of MeV hence the high energy of the emitted photon. (Nuclear energy levels are discussed in more detail in topic 11).
179
090513 Physics Ch 7 for Paul.ind179 179 22/05/2009 11:53:34 AM
CORE
For example, a nucleus of the isotope ruthenium-90 decays to a nucleus of the isotope technetium-90. The nuclear reaction equation for this decay is
CHAPTER 7
On the plus side, the controlled use of the radiations associated with radioactivity is of great benefit in the treatment of cancerous tumours (see Option I). is because it is very nearly the same distance from each of the other protons. However, the strong nuclear force is very short range and is only really effective between adjacent neighbours. So as the size of the nucleus increases, proportionally more and more neutrons must be added. Each time protons and neutrons are added, they have to go into a higher energy state and eventually a nuclear size is reached at which the nucleus becomes unstable ( a bit like piling bricks on top of one another) and the nucleus tries to reach a more stable state by emitting a nuclear subgroup consisting of two protons and two neutrons i.e. a helium nucleus ( particle). Consider now a nucleus of the isotope 28Ni. This nucleus is unstable because the neutron excess is too great, each neutron added having to go into a higher energy state. To become stable, one of the neutrons will change into a proton by emitting an electron i.e. a --particle. On 54 the other hand, a nucleus of the isotope 25Mn does not contain enough neutrons to be stable. To become stable, a proton changes into a neutron by emitting a positron i.e. a + particle.
65
HALF-LIFE
7.2.6 State that radioactive decay is a random and spontaneous process and that the rate of decay decreases exponentially with time. 7.2.7 Dene the term radioactive half-life. 7.2.8 Determine the half-life of a nuclide from a decay curve. 7.2.9 Solve radioactive decay problems involving integral numbers of half-lives.
10
20
30
40
50 60 70 80 Proton number ( Z )
90
IBO 2007
180
090513 Physics Ch 7 for Paul.ind180 180 22/05/2009 11:53:37 AM
Exponential decay
There are many examples in nature where the rate of change at a particular instant of a quantity is proportional to the quantity at that instant. A very good example of this is the volume rate of flow of water from the hole in a bottom of the can. Here the volume rate is proportional to the volume of water in the can at any instant. Rates of change such as this, all possess a very important property, namely that the quantity halves in value in equal increments of time. For example, if the quantity Q in question has a value of 120 at time zero and a value of 60, 20 seconds later, then it will have a value of 30 a further 20 seconds later and a value of 15 another 20 seconds later. If the quantity Q is plotted against time t, we get the graph shown in Figure 706.
120 100 80 60 40 20 0 0 20 40 t/s 60 80 100
A freshly prepared sample of the isotope iodine-131 has an initial activity of 2.0 105 Bq. After 40 days the activity of the sample is 6.3 103 Bq. Estimate the half life of iodine-131. By plotting a suitable graph, estimate the activity of the sample after 12 days.
Solution
Figure 706
Exponential decay
This type of decay is called an exponential decay. The time it takes for the quantity to reach half its initial value is called the half-life. Clearly the half-life is independent of the initial value of the quantity and depends only on the physical nature to which the quantity refers. For instance, in the case of water flowing from a can, the half-life will depend on the size of the hole in the can and we might expect it to depend on the temperature of the water and the amount and type of impurities in the water. (Perhaps here, is a good idea for an experiment to assess the Design criterion in IA). For radioactive elements, the half-life depends only on the particular element and nothing else.
If we keep halving the activity 2.0 105, we get 1.0 105, 0.5 105, 0.25 105, 0.125 105, 0.0625 105 ( 6.3 103). So 5 half-lives = 40 days. Hence 1 half-life = 8 days. Another way of looking at this is to note that the activity of a sample after n half-lives is
A0 where A0 is the initial activity. For this situation we 2n 2.0 105 have 2n = = 32 , giving 6.3 103
n = 8.
181
090513 Physics Ch 7 for Paul.ind181 181 22/05/2009 11:53:38 AM
CORE
CHAPTER 7
The data points for a graph showing the variation with time of the activity are shown below: time/days 0 8 16 24 So the graph is as follows
2.5
CORE
acitvity/ Bq x 10
1.5
0.5
0 0 5 10 15 time / days 20 25 30
From which we see that after 12 days the activity is 7.0 104 Bq.
Exercise
7.2
1.
The initial activity of a sample of a radioactive 1 isotope decreases by a factor of 16 after 90 hours. Calculate the half-life of the isotope. The graph below shows the variation with time t of the activity A of a sample of the isotope xenon-114. Use the graph to determine the halflife of xenon-114.
9 8 7 6
7.3.1,2
ARTIFICIAL (INDUCED)
2.
TRANSMUTATION
So far only transmutation of elements has been discussed, i.e.the transformation of one element into another, that takes place through natural radioactivity. In 1919 Rutherford discovered that when nitrogen gas is bombarded with -particles, oxygen and protons are produced. He surmised that the following reaction takes place:
4 2 17 1 He + 14 7 N 8 O + 1H
A /MBq
5 4 3 2 1 0 0 10 20 30 t /s 40 50 60 70
After the discovery of this induced transformation, Rutherford working in conjunction with Chadwick, succeeded in producing artificial transmutation of all the elements from boron to potassium (excluding carbon and oxygen) by bombarding them with -particles.
182
090513 Physics Ch 7 for Paul.ind182 182 22/05/2009 11:53:39 AM
a mass of exactly 12 u. We know that 12 g of carbon (1 mole) has 6.02 1023 nuclei. Therefore 1 u is equivalent to
1 th 12
of
Following the work of Curie and Joliot, extensive work was carried out into the production of artificial isotopes. It was found that neutrons are particularly effective in inducing artificial transmutation to produce artificial radioactive isotopes. Neutrons may be produced from the bombardment of beryllium with -particles. The nuclear reaction equation for this is
A typical neutron reaction is the bombardment of lithium to produce the radioactive isotope of hydrogen called tritium. The nuclear reaction equation for this is
6 3 1 4 Li + 0 n3 1 H + 2 He
For those of you studying the Option I (Medical Physics), the importance of artificial isotopes in both therapy and diagnosis, is discussed in detail. It is left as an exercise for you to identify the bombarding particle in following nuclear reactions: 1. 2. 3.
14 7 6 3 14 N+ 1 1 H+ 6 C
One consequence of Einsteins Special Theory of Relativity is that in order for the conservation of momentum to be conserved for observers in relative motion, the observer who considers him or herself to be at rest, will observe that the mass of an object in the moving reference frame increases with the relative speed of the frames. One of the frames of reference might be a moving object. For example, if we measure the mass of an electron moving relative to us, the mass will be measured as being greater than the mass of the electron when it is at rest relative to us, the rest-mass. However, an observer sitting on the electron will measure the rest mass. This leads to the famous Einstein equation Etot = mc2 where Etot (often just written as E) is the total energy of the body, m is the measured mass and c is the speed of light in a vacuum. The total energy Etot consists of two parts, the rest-mass energy and the kinetic energy EK of the object. So in general it is written: Etot = m0c2 + EK Where m0 is the rest mass of the object. Nuclear reactions are often only concerned with the restmass. Essentially, the Einstein equation tells us that energy and mass are interchangeable. If for example, it were possible to convert 1 kg of matter completely to energy we would get 9 1016 J of energy (1 c2). Looking at it another way, if a coal-fired power station produces say 9 TJ of energy a day, then if you were to measure the mass of the coal used per day and then measure the mass of all the ash and fumes produced per day, you would find that the two masses would differ by about 0.1 g (
9 1012 c
2
4 4 Li + 2 He+ 2 He
205 81
Tl+
206 82
1 Pb+ 0 n
).
In table 701 we have seen that it is usual to express particle energies in electronvolt rather than joule. Using the Einstein relation, we can express particle mass in derived energy units.
183
090513 Physics Ch 7 for Paul.ind183 183 22/05/2009 11:53:40 AM
CORE
CHAPTER 7
For example, the atomic mass unit = 1.661 10-27 kg which is equivalent to 1.661 10-27 (2.998 108)2 J
-27 108 ) 2eV = 931.5 MeV or 1.66110 (2.998 -19
coal-fired power station mentioned above, represents the energy released in the reaction. If ignoring any recoil energy of the radon nucleus, then this energy is the kinetic energy of the -particle emitted in the decay. Using the conversion of units, we see that 0.0053 u has a mass of 4.956 MeV c-2. This means that the kinetic energy of the -particle is 4.956 MeV. The fact that the mass defect is positive indicates that energy is released in the reaction and the reaction will take place spontaneously. If the mass defect is negative in a reaction then this means that energy must be supplied for the reaction to take place. For example, let us postulate the following reaction:
23 11 4 Na 19 9 F + 2 He + Q
1.602 10
CORE
where Q is the mass defect. kg electron (me) 9.109 10-31 proton (mp) 1.673 10-27 neutron (mn) 1.675 10-27 Figure 710 MeV c-2 0.5110 938.2 939.6 If factoring in the rest masses the equation becomes 22.9897 u 18.9984 u + 4.0026 u + Q This gives Q = - 0.0113 u = -10.4 MeV c-2. In other words for such a reaction to take place 10.4 MeV of 23 energy must be supplied. 11 Na is therefore not radioactive but a stable nuclide.
Binding energy
A very important quantity associated with a nuclear reaction is the nuclear binding energy. To understand this concept, suppose we add up the individual masses of the individual nucleons that comprise the helium nucleus, then we find that this sum does not equal the mass of the nucleus as a whole. This is shown below 2mp + 2 mn
4 2
Mass defect Q
Let us now examine a nuclear reaction using the idea of mass-energy conversion. For example, consider the decay of a nucleus of radium-226 into a nucleus of radon-222. The reaction equation is
226 88
Ra
222 86
4 Rn + 2 He
He+ Q
(2 938.2 + 2 939.6) MeV c-2 3728 MeV c-2 + Q To give Q = 28.00 MeV c-2. This effectively means that when a helium nucleus is assembled from nucleons, 28 MeV of energy is released. Or looking at it another way, 28 MeV of energy is required to separate the nucleus into its individual nucleons since if 23 we postulate, as we did above for the decay of 11 Na , this reaction
Ra = 226.0254 u
222 86
4 2
Rn = 222.0175 u
He = 4.0026 u
The right-hand side of the reaction equation differs in mass from the left-hand side by +0.0053 u. This mass deficiency, or mass defect as it is usually referred to, just as in the
184
090513 Physics Ch 7 for Paul.ind184 184 22/05/2009 11:53:42 AM
4 2
He 2mp + 2 mn + Q
then Q = - 28 MeV c-2. The definition of nuclear binding energy is therefore either the energy required to separate the nucleus into it individual nucleons or the energy that would be released in assembling a nucleus from its individual nucleons. Since the potential energy of a nucleus is less than the potential energy of its separate nucleons, some texts take the binding energy to be a negative quantity. In this book, however, we will regard it to be a positive quantity on the basis that the greater the energy required to separate a nucleus into its nucleons, the greater the difference between the potential energy of the nucleus and its individual nucleons.
1.
Calculate the kinetic energy in MeV of the tritium plus the helium nucleus in the following nuclear reaction.
6 3 1 4 Li + 0 n3 1 H + 2 He
9 8 7 6 5 4 3 2 1
12
32S
56
Fe
138
Ba
Fission
239U
Fusion
1H
20
40
60
80 100 120 140 160 180 200 Mass number (number of nucleons)
220 240
185
090513 Physics Ch 7 for Paul.ind185 185 22/05/2009 11:53:43 AM
CORE
CHAPTER 7
Solution
Adding the masses of the left-hand and right-hand side of reaction equation gives:
CORE
Exercise
7.3 (a)
1.
Calculate the energy required to separate a nucleus of lithium-6 into its constituent nucleons. Hence find the binding energy per nucleon of lithium-6. Calculate the binding energy per nucleon of an particle Deduce, whether the following reaction may take place spontaneously.
212 83
2.
3.
Bi
208 81
Tl +
Fission
Nuclear reactions produce very much more energy per particle than do chemical reactions. For example, the oxidization of one carbon atom produces about 4 eV of energy whereas the decay of a uranium atom produces about 4 MeV. However, natural radioactive isotopes do not occur in sufficient quantity to be a practical source of energy. It was not until the discovery of nuclear fission that the possibility of nuclear reactions as a cheap and abundant source of energy became possible. In 1934 Fermi discovered that when uranium was bombarded with neutrons, radioactive products were produced. Then in 1939 Hahn and Strassman showed that one of the radioactive products was barium (Z = 56). It is now understood that a nucleus of uranium may capture a neutron to form an unstable isotope. Either of the following reactions may occur:
238 92 1 239 U+ 0 n 239 92 U 93 Np +e
mass of mass of
212 83
Bi = 211.99127 u
208 81
Tl = 207.98201 u
where X and Y are two fission elements and x is the number of neutrons produced. Which reaction takes place is dependant on the energy of the bombarding neutron.
186
090513 Physics Ch 7 for Paul.ind186 186 22/05/2009 11:53:44 AM
Given the following data it is left as an exercise for you to show that energy released in this reaction is about 18 MeV: mass of 2 H = 2.014102 u
Given the following data, it is left as an exercise for you to show that energy released in this reaction is about 160 MeV: mass of mass of mass of
238
mass of 3 H = 3.016049 u mass of 4 He = 4.002604 u The energy released appears in the form of kinetic energy of the helium nucleus and neutron. The advantage that fusion has compared to fission as a source of sustainable energy is that no radioactive elements are produced. The disadvantage is obtaining and maintaining the high temperature and pressure needed to initiate fusion. Again, this is discussed in more detail in topic 8.4.
U = 238.050788 u
90
Sr = 89.907737 u
Xe = 145.947750 u
146
The energy released appears in the form of kinetic energy of the fission nuclei and neutrons. The three neutrons produced is the key to using fission as a sustainable energy source as discussed in topic 8.4. Both the strontium isotope and xenon isotope produced are radioactive. Strontium-90 has a half-life of about 30 years and therein lies the main problem (as well as the large amounts of -radiation also produced) with nuclear fission as a sustainable energy source the fact that the fission nuclei are radioactive often with relatively long half-lives. The isotope uranium-235 also undergoes fission and much more readily than uranium-238. A typical fission reaction might be
235 92 1 131 1 U+ 0 n 103 38 Sr+ 54 I + 2 0 n
Fusion
Energy can also be obtained from nuclear reactions by arranging for two nuclei to fuse together as we alluded to when we discussed nuclear binding energy above. To produce nuclear fusion very high temperatures and pressures are needed so that nuclei can overcome the coulombic repulsion force between them and thereby come under the influence of the strong nuclear force. A 2 3 4 1 typical nuclear reaction might be 1 H+ 1 H 2 He+ 0 n In this reaction a nucleus of deuterium combines with a nucleus of tritium to form a nucleus of helium and a free neutron.
From the graph (Figure 711) we have that total binding energy of total binding energy of total binding energy of
238
U = 7.6 238
90
Sr = 8.7 90
146
Hence the sum of the total binding energies of the fission nuclei is greater than the total binding energy of the uranium-238 nucleus. Effectively the system has become more stable by losing energy.
3 4 1 Similarly for the fusion reaction 2 1 H+ 1 H 2 He+ 0 n the total binding energy of the helium nucleus is greater than the sum of binding energies of the tritium and deuterium nuclei. So, again as for fission, the system has effectively become more stable by losing energy.
187
090513 Physics Ch 7 for Paul.ind187 187 22/05/2009 11:53:45 AM
CORE
CHAPTER 7
Exercise
7.3 (b)
Determine the number x of neutrons produced and calculate the energy released in the following fission reaction
235 92 1 90 1 U+ 0 n 144 56 Ba + 36 Kr + x 0 n
235
CORE
U = 235.043929 u
Ba = 143.922952 u
146
90
Kr = 89.919516 u
Show that in the fusion cycle given in 7.3.10, the energy released is about 30 MeV.
For the complete cycle the first two reaction must occur twice and the final result is one helium nucleus, two positrons, two protons and two neutrinos. The protons are available for further fusion. In stars that are much more massive than our Sun, as they age fusion of elements with higher atomic numbers takes place until finally iron is reached and no further fusion can take place as seen from the binding energy graph. This evolution of stars is discussed in detail in Option E.
188
090513 Physics Ch 7 for Paul.ind188 188 22/05/2009 11:53:46 AM
INTRODUCTION
n Chapter 2, energy was defined as the capacity to do work. The various forms of energy can be classified as: Mechanical (kinetic and potential) Heat Radiant (electromagnetic) Chemical (potential) Sound Electrical/magnetic Nuclear
spectrum (rays, Xrays, UV radiation, visible light, IR radiation, microwaves and radio waves). Infra-red radiation falling on a body is converted into thermal energy. Solar energy is a form of radiant energy. Chemical energy is the energy locked up in fuels and other chemicals. The energy obtained by the combustion of fuels represents a major source of energy in current use. All food that we eat is a store of chemical energy. It can be considered to be latent potential energy that a body possesses. Sound energy is produced by longitudinal waves that have an organised and periodic pattern that causes the vibration of the particles in the same direction as the transfer of energy. Electrical energy is the energy carried by moving charges and these moving charges produce a magnetic field. Through the process of electromagnetic induction, electricity has become the greatest form of energy used by man in everyday life. Nuclear energy is the potential binding energy released during a nuclear reaction when mass is converted to thermal and perhaps light energy. Nuclear fission reactors are starting to gain a renewed acceptance and nuclear fusion has potential for the future when the technology becomes available.
Mechanical energy includes both kinetic and potential energy. Friction is mechanical energy as it is caused by kinetic energy and potential energy of a body as a force is applied through a distance. Heat energy is the energy a body possesses because of its internal energy due to the motion of the particles it contains. Radiant energy is the source of all life on the Earth and is the greatest potential energy resource available for the future. When radiant energy in the form of light is absorbed by plants, it is converted into stored chemical energy. This energy is available as food or biomass. Biomass is used to produce fuels. It is electromagnetic in nature and is possessed by all components of the electromagnetic
189
19 april 09 Physics Ch 8 final.i189 189 22/05/2009 11:54:42 AM
CORE
CHAPTER 8
A thermodynamic cycle is a process in which the system is returned to the same state from which it started. That is, the initial and final states are the same in the cyclic process. Figure 801 shows a series of schematic diagrams for the cycle of an internal combustion engine as used in most automobiles. With the exhaust valve closed, a mixture of petrol vapour and air is drawn into the combustion chamber through the inlet valve as the piston moves down during the intake stroke. Both valves are closed and the piston moves up to squeeze the mixture of petrol vapour and air to about th its original volume during the compression stroke. With both valves closed, the mixture is ignited by a spark from the spark plug. The mixture burns rapidly and the hot gases then expand against the piston in the power stroke. The exhaust valve is opened as the piston moves upwards during the exhaust stroke, and the cycle begins again.
intake stroke
Gas vapor and mixture intake valve open piston exhaust valve closed intake valve closed
CORE
compression stroke
exhaust valve closed
crankshaft
crankshaft
ignition exhaust
spent fuel gases intake valve closed exhaust valve open intake valve closed exhaust valve closed
power stroke
intake valve closed exhaust valve closed
Figure 801 The Internal Combustion Engine For a cycle to do net work, thermal contact with the original heat reservoir must be broken, and temperatures other than that of the original heat reservoir must play a part in the process. In the above example, if the piston is returned
190
19 april 09 Physics Ch 8 final.i190 190 22/05/2009 11:54:43 AM
compression, the air is squeezed to one-sixteenth of its volume. This makes the air so hot that the fuel ignites of its own accord and explodes as soon as it enters through the valve. Diesel engines with 40% efficiency are amongst the most efficient engines used today. Figure 803 demonstrates the diesel cycle patented by Rudolf Diesel in 1892.
constant pressure
QH B C
maximum temperature
adiabatic expansion constant D volume (V 1)
adiabatic compression
QL A minimum temperature
V1
V2
Figure 803
C maximum temperature
adiabatic expansion constant volume (V 2)
QH B
Jet engines burn fuel continuously. They suck air in the front of the engine and this air is compressed by the compressor fans. The air becomes so hot that it burns in the continuous fuel supply. Exhaust gases are blown out the back of the engine propelling the engine forward. These gases also turn a turbine that supplies electricity to the jet, and keeps the compressor fans turning.
QL
adiabatic compression
A V2
minimum temperature
V1
Figure 802 The Otto cycle The fuel-air mixture enters the piston at point A. The compression AB is carried out rapidly with no heat exchange making it an adiabatic compression. The ignition and combustion of the gases introduces a heat input QH that raises the temperature at constant volume from B to C. The power stroke is an adiabatic expansion from C to D. Thermal energy QL leaves the system during the exhaust stroke, and cooling occurs at constant volume from D to A. The net work is represented by the enclosed area ABCD. Diesel engines use diesel instead of petrol, and there is no spark from a spark plug to cause ignition. They do not have a carburettor which is used to produce a spray of the fuel-air mixture. Rather, the air is sucked in and the diesel is introduced through a valve when the piston is at the top of the compression stroke. During the adiabatic
When energy is transferred from one form to other forms, the energy before the transformation is equal to the energy after (Law of conservation of energy). However, some of the energy after the transformation may be in a less useful form. We say that the energy has been degraded. For example, in a simple battery operated flashlight, an energy input of 100 units of chemical potential energy will give a 10 unit output of light energy and the light energy is enhanced by placing a curved mirror behind the lamp to concentrate the light into a beam. The other 90 units of output is used in heating up the filament of the light bulb and in heating the battery and the surroundings. These 90 units of energy output have become degraded. The thermal energy that is transferred to the surroundings, the filament and the battery is no longer available to perform useful work. The Second Law of Thermodynamics in one form states that engines are theoretically inefficient users of energy. The efficiency of an energy conversion process is a ratio of the useful energy output to the total energy input usually expressed as a percentage. In practice, the efficiency is even lower than this theoretical value. Figure 804 gives
191
19 april 09 Physics Ch 8 final.i191 191 22/05/2009 11:54:44 AM
CORE
CHAPTER 8
Mechanical Mechanical Gravitational potential Electric Radiant Chemical Thermal 85% (water turbine) 93% (electric motor) 55% (wind power) 45% (animal muscle) 52% (steam turbine) Electric 99% (electric generator) 90% (hydro- electricity) 40% (gas laser) 27% (solar cell) 10% (dry cell battery) 7% (thermocouple) 15% (chemical laser) 5% (fluorescent tube) 72% (wet cell battery) 0.6% (photosynthesis) Radiant Chemical Thermal 100% (brakes)
CORE
Figure 804
Eciency of some energy conversion devices table The efficiency of this simple flashlight is 5%. The efficiency of any system can be determined by using the relationship: Efficiency = useful energy output 100% total energy input
examples of the efficiency attainable by some devices in their energy conversion process. Chemical energy and electrical energy are considered to be high-grade energy because they can be converted to other forms of energy. However, there is a gradual degradation of the high-grade energy to low-grade energy in the operation of machines as the entropy (amount of disorder in a system) increases. It has become increasingly more important that man explores the renewable energy sources so that the energy demands of the future can be met with new high-grade energy.
5 J light energy
95 J thermal energy
Figure 805 Sankey diagram for a torch In a Sankey diagram, the thickness of each arrow gives an indication of the scale of each energy transformation. The total energy before the energy transfer is equal to the total energy after the transfer otherwise the Law of conservation of energy would be violated. The problem is that once the thermal energy is transferred to the surroundings, it cannot be used to do useful work. Scientists are becoming more aware of this waste and there are many innovations being made in building designs to use some of this energy for heating purposes.
Figure 806 Sankey diagram for a coal-red power station Another useful energy transfer diagram is shown in Figure 807. The rectangles contain the different forms of energy, the circles show the conversion process, and the
192
19 april 09 Physics Ch 8 final.i192 192 22/05/2009 11:54:45 AM
areas, hydro-electricity is common as water stored in dams can be used to rotate turbines. By referring to the section on electromagnetic induction in Chapter 12 it can be deduced that a changing magnetic flux produces an induced eletromotive force (e.m.f). The rotating turbines contain coils of a conducting wire. When the coils are rotated in a magnetic field, the alternating current generator converts the kinetic energy into electrical energy. An alternating current is the present preferred option when compared to direct current because transformers can be used to step-up and step-down the voltage in the power grid. However, this is slowly changing in some countries. The turbines drive the alternators that produce threephase electricity. Most generators use stationary electromagnets to provide the magnetic fields. They have a rotating armature with hundreds of coils of copper wire wound around an iron core. With more coils a greater induced emf can be produced. These coils are arranged in sets. By having a number of electromagnets and three sets of armature coils for each magnetic pole of the electromagnets, separated by an angle of 120, three e.m.fs can be produced during each revolution of the alternator. These three-phase generators are more energy efficient then a single-phase generator. Most power stations will have a number of generators with power ratings between 300 to 1000 MW. Each alternator can produce voltages as high as 25kV. Stepup transformers increase the voltage to as high as 700 kV. This increased voltage results in a decreased current that reduces the heating losses in the power transmission. Additional transformers in the power grid further reduce energy losses and gradually lower the voltage to the required domestic or industrial level. Useful energy is lost due to eddy currents in the transformers in the form of thermal and sound energy. Thermal energy is also lost due to the current in the transmission cables. Alternating current is the preferred transmission type. However, more and more high voltage direct current transmission (HVDC) is occurring. There are advantages in both systems. For example, dc currents travel through more of the cross-sectional area of a conducting cable whereas ac currents tend to travel through the outer portion of the cable a phenomenon known as the skin effect. Furthermore, three-phase ac requires multiples of 3 cables whereas dc only requires sets of 2. There are also problems synchronising generating stations to run at the same frequency. With increased globalisation and the selling of commodities such as electric power, dc does not have these problems.
40 units
electrical energy
AC generator
kinetic energy
steam turbine
friction
friction
Figure 807 Energy ow diagram for a coal-red power station Heat energy is produced by the combustion of coal in a furnace. Liquid water absorbs the heat energy in a heat exchanger under pressure, and it is turned into steam. The steam contains latent potential energy as it has been converted from a liquid to a gas. Steam under pressure is capable of doing mechanical work to supply the rotational kinetic energy to turn the steam turbines. The turbine is coupled to the generator that produces electrical energy. Energy is lost to the surroundings at many stages. For example, if 100 units of energy are supplied from the primary energy source then only 40 units of useful energy is available. The majority of the useful energy is lost to water in the cooling towers as heat is evolved in the condensation component of the heat exchanger cycle. Other forms of energy loss are shown on the arrowed parts of the diagram. An oil-fired power station has a similar energy flow and efficiency to a coal-fired power station. However, a natural gas-fired power station is more efficient as they use combined cycle gas turbines (CCGT). A jet engine is used in place of the turbine to turn the generator. Natural gas is used to power the jet engine and the exhaust fumes from the jet engine are used to produce steam which turns the generator. These power stations can be up to 55% efficient. The production of the majority of electrical power involves the combustion of coal, natural gas and oil or the fission of uranium-235. It depends on the energy sources available to countries. For example, most coal-fired power stations are found close to the coal source. Countries like Japan rely heavily on the importation of coal and natural gas as there are no reserves of fossil fuels available. In mountainous
193
19 april 09 Physics Ch 8 final.i193 193 22/05/2009 11:54:45 AM
CORE
furnace
thermal energy
steam generator
CHAPTER 8
6. A generator takes in an amount Ek of kinetic energy. An amount W of useful electrical energy is produced. An amount Q of thermal energy is lost due to the moving parts of the generator. The law of conservation of energy and the efficiency of the generator are given by which of the following? Law of conservation of energy Efficiency W Ek = W + Q W / Ek Ek = W + Q Ek = W Q W/Q W / (Ek Q) Ek = W - Q
Exercise
8.1
1.
The most efficient energy conversion occurs in A. B. C. D. tidal power stations diesel engines solar panels hydro-electric power stations
CORE
2.
The original source of tidal power is the A. B. C. D. Moon Earth Sun water 7.
A. B. C. D.
3.
Heat engines A. B. C. D. produce more work output than energy input take in thermal energy at a low temperature and exhaust it at high temperature convert heat into mechanical energy can be close to 100% efficient
The diagram below is a Sankey diagram for a typical oil-fired power station. It shows the useful electrical output after the energy has been transmitted to your home. Analyse the energy that is lost from the energy input to the final energy output. Determine the overall efficiency of the system.
4.
All the following statements are correct EXCEPT 8. A. B. C. D. generators convert mechanical energy into electrical energy nuclear reactors convert mass into energy chemical energy is a form of potential energy thermal energy and solar energy are the same Copy and complete the following table to show the energy conversions for various devices. (There could be more than one type of energy produced). From To
5.
Two different objects that have different temperatures are in thermal contact with one another. It is the temperatures of the two objects that determines, A. B. C. D. the amount of internal energy in each object the process by which thermal energy is transferred the specific heat capacity of each object the direction of transfer of thermal energy between the objects
Device Cigarette lighter Human body Microphone Car engine Light bulb Light emitting diode Refrigerator Stereo speaker Thermocouple Atomic bomb
194
19 april 09 Physics Ch 8 final.i194 194 22/05/2009 11:54:46 AM
there is little doubt that mans use of fossil fuels has contributed to the increased concentration of carbon dioxide in the atmosphere. There has been an increased anxiety by many scientists, world leaders of governments and environmentalists as well as world citizens as to the long term effects of carbon dioxide increase. Many believe that these increase in concentration is increasing the temperature of the atmosphere and the oceans because of the enhanced greenhouse effect. The primary source of world energy has directly or indirectly had its origin due to the radiant energy from the Sun. The Sun is a medium-size hydrogen/helium star that uses nuclear fusion to convert millions of tonnes of mass into energy each second. It has been in existence in excess of 6 billion years and it has enough hydrogen to last for at least another 8 billion years. It has been providing 90% of the thermal energy needed to heat the Earth to a temperature comfortable enough for a diversity of living things to exist. For recorded history, the Sun has allowed plants and certain bacteria to convert solar energy into chemical energy stored in sugars and other carbon compounds. First-order consumers eat plants and other higher order consumers eat both plants and other animals to obtain the chemical energy necessary to survive. However, even today, up to 40% of the worlds population does not get the minimum requirement of 8500 kJ of energy through food per day. Apart from the fact that the Sun produces 90% of the thermal energy needed to heat the Earth, it is also indirectly responsible for wind, ocean currents, wave action, water evaporation and precipitation, food, wood, biomass and the fossil fuels. Some of these properties are now being used for alternative energy sources such as hydroelectricity, wind power, biogas, passive solar energy panels and photovoltaic cells. Nuclear fission is the source of nuclear energy in reactors and geothermal plants. The process of nuclear fusion reactors is being developed. The Moons gravitation is the cause of tides and there are a few tidal power stations in operation. Chemical energy is also used for supply of energy in batteries and fuel cells.
195
19 april 09 Physics Ch 8 final.i195 195 22/05/2009 11:54:47 AM
CORE
CHAPTER 8
the coal strata were then subjected to folding which increases the heat and pressure, a metamorphic rock called anthracite is formed. At each stage in the rank advance, the coal has a higher carbon content and a higher energy content per unit mass. Figure 811 demonstrates the formation of different coals.
CORE
Anthracite
Figure 811
Swamps would have been good environments for the beginning of coal formation because in stagnant water very little oxygen is present. In a swampy environment, anaerobic bacteria attack plant matter and partially decompose it. Carbon becomes concentrated in the remains. The bacteria gradually decrease, as they are killed by the poisonous acids produced in the decomposition process. At this stage, the plant matter is converted to peat. Peat is a brownish material that looks like wood. Although it can be burnt as a fuel, it contains a lot of water, and is very smoky when burnt. As the peat became buried beneath more plant matter, the pressure and temperature increased and the water was squeezed out of it. At a later stage in geological history, the swamp was covered rapidly with a sedimentary layer of rock. As the material became compacted the carbon content increased and the peat converted to lignite, then to sub-bituminous coal and finally bituminous coal. If
Crude oil and natural gas are products of the decomposition of marine plants and animals that were rapidly buried in sedimentary basins where there was a lack of oxygen. The organic material was laid down in a body of stagnant water where the presence of the depositing organic matter created an acid environment. The organic matter was quickly buried beneath mud. The source rocks were buried under sufficient cover of overlaying strata so that the conditions were right for the conversion of the organic matter in the source rock into hydrocarbons and other organic compounds. When the hydrocarbons and other organic compounds were generated, they were dispersed as individual molecules in the source rock. With increased heat, pressure and Earth movement, these molecules migrated into other porous, permeable reservoir rocks. With further heat and pressure, these reservoir rocks became trapped between impervious (non-porous) cap rocks. The oil and gas in the reservoir rocks also became trapped with the less dense gas rising to the top and the more dense liquid and solid crude oil sinking to the bottom of the trapped reserve. Figure 812 shows the accumulation in a reservoir rock.
196
19 april 09 Physics Ch 8 final.i196 196 22/05/2009 11:54:47 AM
Oil flow Shale or limestone Gas OIL Sandstone resevoir rocks Cap rock
Figure 812 Accumulation of oil in reservoir rock A borehole is drilled into the rock containing hydrocarbons using diamond core bits, and the reservoir is opened to the atmosphere and after the release of the natural gas the crude oil is pumped to the surface. If the pressure is not great enough or the liquid is too viscous to flow, the crude oil can be pumped to the surface. The crude oil is separated into fractions in a fractionating column using fractional distillation. The crude oil is heated to about 400 C to produce a hot liquid/vapour mixture. The lower boiling point components rise higher up in the fractionating column where they are condensed. The higher boiling point components are removed lower in the fractionating column. Crude oil is found as a liquid or a vapour although some oil can exist as a solid. The solid/ liquid components are known as crude oil and the gaseous component is known as natural gas. Although hydrocarbons have been found in rocks formed more than a billion years ago, it is thought that most reserves were formed less than 500 million years ago. Some formed as recently as 10 million years ago, and it is believed that some is being formed today. Oil shale and oil tar deposits make up less than 2% of the worlds fossil reserves. Oil shale is complex solid hydrocarbon material called kerogen. It is found in a finegrained sedimentary rock called marl. Exploration and refining of this material began during the 1980s when oil prices escalated. It requires a high capital outlay to produce the fuel from the kerogen and at this stage it is not considered viable for economic reasons. Tar sands are deposits that contain a tarlike material called bitumen. It is found in the same geographic areas as oil is. It can be pyrolysed to produce crude oil. The extraction and refining
fuel biomass
Figure 813
197
19 april 09 Physics Ch 8 final.i197 197 22/05/2009 11:54:48 AM
CORE
CHAPTER 8
Energy Density Of A Fuel = Chemical Potential Energy Mass It is measured in joules per gram (J g -1). Bomb calorimetry is used to determine the value and this technique requires only small masses of a sample. However, the joule is a small quantity and therefore it is more common to use bigger units such as megajoules per kilogram MJ kg -1. The energy density of coal is not the only consideration that has to be calculated. Its rank, chemical composition and heating ability are other important factors (refer to Figure 815). Coal with different rank advance will produce differing amounts of heat for a given mass. The grade of a sample of coal does not always indicate the chemical composition of that coal. However, with rank advance the purity of carbon content generally increases from peat lignite (brown coal) sub-bituminous coal bituminous coal anthracite. Chemical composition of the coal is defined in terms of the analysis of its moisture content, volatile matter (percentage of the coal that is lost as vapours when heated in the absence of air), ash, and fixed carbon. Table 815 lists some typical moisture values of some ranks of coal. Rank Of Coal 20 27 22 10 33 20 Peat Lignite Bituminous Coal Anthracite Figure 815 Percentage Moisture Content 75 80 50 70 5 10 25 Dierent coals
CORE
Figure 814 gives some approximate energy density values for some fuels. When solving examination questions please consult the table that is provided in the IB Data Booklet. Gravimetric Energy Density Type Of Fuel MJ kg-1 Wood 15 Coal 24 Crude Oil 42 LPG (Liquefied Petroleum Gas) 34.5 Compressed Natural Gas 55-56 Aviation Fuel 43 Ethanol 29.6 Plant Biomass 18 Lead-acid Battery 0.1 Nuclear Fission Of U-235 300 000 000 Nuclear Fusion 90 000 000 Volumetric Energy Density MJ dm-3
If the coal is dried and analysed then the percentage by composition of carbon, hydrogen, oxygen and volatile matter changes. Figure 816 lists the approximate percentage composition by mass of different ranks of coal after they have been dried. Percentage By Mass Bituminous Lignite Coal 60 75 80 90 20 30 10 15 56 45 45 55 20 40 Compostion of coal
Peat 50 60 35 40 56 60 65
Anthracite 90 95 23 23 57
Figure 816
Because of the high composition of volatile matter in peat and lignite, they cannot be transported for long distances because of safety risks. Usually, a power station will be placed where the mine is located. Other elemental analysis of nitrogen and sulfur within the coal will determine the choice when using various coals. The coal in some countries has a high sulfur content and this greatly affects air quality in some areas. On the other hand, Australian bituminous coal and anthracite is low in sulfur and it has become a top export fuel to Asia.
198
19 april 09 Physics Ch 8 final.i198 198 22/05/2009 11:54:49 AM
Example 1
= 3 325 13 kJ = 3 312 kJ A sample of lignite has a moisture content of 65%. (i) Determine how much water is in a 10 tonne sample of this coal before crushing and drying? Explain how the moisture content will reduce the amount of heat that can be obtained from combustion of the coal. For one gram this would be 33.12 kJ
(ii)
Solution
(i)
There is 65% moisture content. Therefore, the amount of water will be 0.65 10 000 kg = 6 500 kg
35
30
25 Percentage
(ii)
The more water present, the less coal there is to burn and some of the heat released has to be used to heat and evaporate the water.
20
15
10
Example 2
5 0 Biomass and others Coal Natural Gas Nuclear Oil Hydroelectric
A sample of anthracite has a moisture content of 5% and when dried it has an energy density of 35 kJ g -1. Assuming that during coal combustion the temperature of the water in the coal is raised from 20 oC to 100 oC and then vaporised at 100 oC, estimate the energy density of the coal as it is mined.
Figure 817
Solution
Energy required to convert the water to steam = mcT + mLV Since there is 5% moisture content, there is 5 g of water per 100g of coal. For 5 grams, Q = 5g 4.18 Jg -1K -1 (100 oC 20 oC) + 5g (22.5 10 2 J g -1) = 12 922 J = 13 kJ The energy in the other 95 grams of anthracite is 35 kJ g -1 95 g = 3 325kJ
The 1% for biomass and others include biomass, solar, geothermal, wind power and other alternatives. The percentage use represents all uses of the fuels. Much of the crude oil and natural gas is used for the petrochemical industry for the manufacture of plastics, pharmaceuticals and many more synthetic materials. These 2 fuels are also used for transportation fuels. The next biggest use is for the production of electricity. Each country in the world has different resources and as such these percentage use will differ from country to country.
199
19 april 09 Physics Ch 8 final.i199 199 22/05/2009 11:54:49 AM
CORE
CHAPTER 8
Chile. In many places the underground temperature is greater than 150 C at depths less than 3 kilometres and there is commonly circulation of underground water. The steam that is produced is piped under pressure to the surface to drive turbines and thus generate electricity. Geothermal power has great potential for many countries that have hot beds of rocks and an artesian water supply. The disadvantages of geothermal power include the release of polluting gases such as as sulfur dioxide and hydrogen sulfide into the atmosphere and groundwater pollution by chemicals including heavy metals. The main advantage of tidal power is that there is no pollution and the energy is renewable. The main disadvantage is that there are few areas in the world that have the necessary tidal range. The construction costs are high. Because high tides occur approximately 50 minutes later each day, it is difficult to meet electricity demand during some high peak times. There are also long construction times, high capital intensity, and these factors are likely to rule out significant cost reductions in the near term. Solar energy was neglected as an energy resource but today there is renewed interest because of its many advantages. It is a means of using a free, renewable energy resource. It is available to some extent everywhere in the world, unlike fossil fuels and nuclear power. It is exempt from rising energy prices. Few environmental problems are created. It can be used in a variety of energy transformations for heating, cooling, electricity, transportation, lighting and mechanical power. Its disadvantages are also evident. It produces a small energy output per surface area of the cell being used. For large-scale production, it requires thousands of mirrors or cells that take up a large area of land. It is intermittent with its output being upset by night and clouds. It is relatively expensive to set up and thus requires many years before the investment is returned. Wind power is cheap, clean, renewable and infinite. It can be used to provide electricity to remote areas of the world. However, winds resulting from the heating of the Earth are somewhat unpredictable. The initial set-up costs are high, the structures suffer from metal fatigue and they are noisy.The better option being considered today is to use wind turbines in association with another power source. One possibility is to combine solar power and wind power. Usually, when Sunny there is little wind but when it is overcast there is more wind. Another possibility is to use the wind to pump water into high dams associated with hydro-electric power stations and then to run the water downhill during high electricity demand periods.
CORE
200
19 april 09 Physics Ch 8 final.i200 200 22/05/2009 11:54:50 AM
Year
Figure 818 Past and predicted world population growth Consumption of natural gas, crude oil and coal has increased dramatically to meet the demands of the population. The consumption of oil since 1970 has been more than the total consumption of the previous 200 years and half the consumption of all fossil fuels has occurred in the past 50 years. Again, the industrialised countries are the biggest consumers. For example, the USA with 5% of the worlds population uses 33% of the worlds total energy whereas India with 15% of the worlds population only consumes 1% of its total energy. More energy is used in the USA for airconditioning than is used by 1 billion Chinese for all their energy purposes. Coal is the most abundant fossil fuel in the world and at this stage, we have consumed less than 10% of the original supply. Coal will last another 400 to 600 years at the present rate of consumption. Figure 819 demonstrates two estimated coal production/consumption curves at a production rate given in billions of tonnes per year. Each tonne of coal is equivalent to 1010 J of energy.
Production rate 109 metric tonnes per year
30 20 10
1800 1900 2000 2100 2200 2300 2400 2500 2600 2700 2800 Year
201
19 april 09 Physics Ch 8 final.i201 201 22/05/2009 11:54:51 AM
CORE
CHAPTER 8
The proven recoverable reserves of coal in the world was estimated to be 1 078 734 million tonnes in 1990. These reserves are found on every continent and this is the reason for its wide use since early times. Figure 820 gives an estimate of the coal reserves in various countries. Country USA Former USSR* China Australia Germany India South Africa Poland Canada Others Reserve percentage 24 22 15 8 7 6 5 4 2 7 Country Saudi Arabia Iraq UAE Kuwait Iran Former USSR* Venezuela Mexico USA China Libya Nigeria/Others Figure 822 Reserve percentage 25 10 10 10 9 6 6 5 3 2 2 210 Oil reserves in the world
CORE
(* Countries in the former USSR include Armenia, Azerbaijan, Belarus, Estonia, Georgia, Kazakhstan, Kirgyztan, Latvia, Lithuania, Moldova, Russia, Tajikistan, Turkmenistan, Ukraine and Uzbekistan.) Figure 820 Coal reserves in the world table We have already consumed 30% of the original supply of oil and natural gas and it is expected that the supply of these non-renewable sources will only last another 40 to 80 years. Figure 821 shows two estimated oil production/ consumption curves at a production rate of 109 barrels per year. (We do not get oil in barrels but this is the commodity unit of sale in business markets). Each barrel of oil is equivalent to 5. 9 109 J of energy.
The proven recoverable reserves of gas are more widely spread throughout the world and natural gas has become the preferred fuel in many of the modern power stations Figure 823 gives a percentage estimate of the gas reserves in various countries. Country Former USSR* Iran UAE Saudi Arabia USA Qatar Algeria Venezuela Canada Iraq Nigeria Indonesia Australia Others Reserve percentage 38 14 4 4 4 4 3 3 2 2 2 2 2 16
40
30
20
Figure 823 Gas reserves in the world With the advent of the Industrial Revolution in the 1750s, the steam engine and later the internal combustion energy became the principal sources of power. At their best steam engines were only 8% efficient. Large amounts of energy were needed to make and run machinery, to make roads and railway lines, to service cities as the rural population flocked to the cities for employment. In the steam engine, water is heated in a boiler heat reservoir (high temperature) to produce steam. The steam passes into an open intake valve where it expands causing a piston to move outwards. As the piston returns to its original position, the intake valve is closed and the exhaust
10
1900
1925
1950
1975
2000
2025
2050
2075
2100
80% in 58 years
Figure 821 Estimated world production of crude oil The proven recoverable reserves of oil in the world was estimated to be 136 500 million tonnes in 1990. These reserves are found mainly in the Middle East and South America. Figure 822 gives an estimate of the oil reserves in various countries.
202
19 april 09 Physics Ch 8 final.i202 202 22/05/2009 11:54:51 AM
piston valve
heat
With the steam engine came the opportunity to transport agricultural products from the country to cities and industrial products to the country and other cities. However, the Industrial Revolution saw the production of large amounts of steel from iron ore and coke (a strong porous solid composed mainly of carbon). Therefore, the advent of industrialisation saw a marked increase in the rate of energy useage which led to industries being established close to the source of the fuel in order to cut down on transportation costs. Coal is a versatile fuel that can be used directly as a solid fuel or modified to coke, processed to liquid and gaseous fuels or manufactured into a wide range of chemicals. Every tonne of a 60% iron ore requires one tonne of coke. Before electricity was produced after the discovery of electromagntic induction in 1832, town gas was produced for industry and homes for supplying heat energy for cooking and warming houses and for lighting. Those countries who were successful in modernising usually centred their manufacturing industries close to coal mines. This is especially true in the Ruhrgebiet of Germany and the coal mines of the USA, China, Australia, United Kingdom and Russia. The steam engine (and steam turbine) are examples of external combustion engines. The fuel is burnt outside the engine and the thermal energy is transferred to a piston or a turbine chamber by means of steam.
203
19 april 09 Physics Ch 8 final.i203 203 22/05/2009 11:54:52 AM
CORE
CHAPTER 8
Figure 825 lists typical energy density values of the major fuels used in fossil-fuelled power stations as well as petrol. Energy Density (Coals When Dried) Fuel Solid Or Liquid Gas -1 -3 MJ kg MJ dm MJ dm-3 natural gas 55-56 23-24 0.038-0.039 propane 50.0 25.4 0.093 butane 49.5 28.7 0.124 gasoline 56.5 45-55 peat 25 lignite 25-30 bituminous coal 30-35 anthracite 35-38 Figure 825 Typical energy densities for dried coals and other fuels (table) Because of the higher energy density of natural gas, it is becoming the preferred choice of the new power stations that are coming into production. Natural gas is also cleaner and less polluting than coals and oils. (c) 1 tonne = 1000 kg 1dm3 = 1kg 5.4 104 tonnes 1000 = 5.4 107 dm3.
Example
CORE
A 250 MW coal-fired power station burns coal with an energy density of 35 MJ kg -1. Water enters the cooling tower at a temperature of 293 K and leaves at a temperature of 350 K and the water flows through the cooling tower at the rate of 4200 kg s-1. (a) Calculate the energy removed by the water each second. Calculate the energy produced by the combustion of coal each second. Calculate the overall efficiency of the power station. Calculate the mass of coal burnt each second.
(b)
(c)
(d)
Example Solution
A coal-fired power station burns coal with 50% moisture content. The composition of the dried sample is found to contain on analysis 72% carbon, 5% hydrogen and 23% oxygen. If 500 tonnes is burnt hourly: (a) Estimate the mass of water vapour emitted from the cooling towers each hour. Estimate the mass of water vapour produced in a week. (b) (c) Estimate the volume of condensed water vapour produced in a week. The answer is 1250 MW. 1000 MW is degraded in the cooling tower and there is 250 MW output as electrical energy. Therefore, the energy produced by combustion of coal = 1000 +250 = 1250 MW. Efficiency = useful energy output total energy input 100% = 250 1250 100% = 20% (d) 1250 10 6 35 10 6 = 35.7 kg
(a)
Q = m c T = 4200 kg 4180 J kg-1K-1 (350 293 K) = 1.000692 109 J = 1 109 J per second = 1000 MW.
(b)
Solution
(c)
(a)
Assuming the hydrogen and oxygen is converted to steam, the total amount of steam = 50% + 28% of the remaining 50% = 64% 64% of 500 tonnes = 320 tonnes.
(b)
204
19 april 09 Physics Ch 8 final.i204 204 22/05/2009 11:54:53 AM
Transportation and storage of fossil fuels can be undertaken using pipelines, railroads, trucks and ships. Natural gas is usually transported and stored in pipelines although exports of LNG are shipped in pressurised containers. Pipelines are a cost effective means of distributing natural gas and there are many agreements between countries who pipe their gas to other countries as is the case with the vast reserves of the Russian Federation. The disadvantages of piping gas include unsightly pipelines, possible leakage, explosions, governments holding other countries to ransom when they do not conform to political issues, and, possible terrorist activities. Many oil refineries are located near the sea close to large cities so that the labour force and social infrastructures are available, and to ensure that the import or export of crude oil and it fractions can be transported by ships to their destinations throughout the world. The biggest disadvantage of shipping has proven to be the oil slicks that have threatened wildlife due to sinking and leaking ships. At the refineries, great care has to be taken in storing the crude oil fractions in tanks that have containment walls to protect leakage or explosion. Pipelines are common in many countries. However, they are unsightly and they are open to terrorist attack as has been the case in Iraq, Nigeria and Kuwait in recent times. Likewise, the transportation of petrol and diesel from the cities to the outlying provinces also has its associated hazards of leakages, transport accidents and explosions.
Power stations are becoming more efficient and many conventional ones have improved their efficiency by 5 to 10 percent over the last 30 years. This is mainly due to a better cooling water source allowing for a greater variation between the hot and cold reservoirs. A power station in Denmark obtains 45% efficiency, the worlds highest for an operational, single cycle, large coal-fired plant. Conventional coal-fired power stations can obtain efficiencies in the 33 39 percent range. Gas-powered power stations are in the 33 46 percentage efficiency range. The load factor is an indication of electricity used by consumers and it is calculated by dividing the average load by the peak load over a certain period of time. Residential homes tend to have low load factors because people only use electricity during certain hours of the day. Industrial consumers will have high load factors because they operate throughout the day and night. The highest efficiencies are being obtained in combined cycle plants such as the cogeneration or CHP (combined heat and power) plants. In a combined cycle plant, surplus heat from a gas turbine is used to produce steam which in turn drives a steam turbine. Efficiencies of over 50% can be achieved. Furthermore, if the plants waste heat is utilized for the heating of houses or an industrial process, efficiencies of 80% are achievable.
205
19 april 09 Physics Ch 8 final.i205 205 22/05/2009 11:54:53 AM
CORE
CHAPTER 8
surveys, core sampling for botanical microorganisms, drill prospecting, onshore and offshore engineering and construction of production facilities before a substantial sedimentary basin reserve comes into production at costs around 10 000 million dollars. If the sedimentary basin looks promising then a drilling program begins. Oil formations are usually drilled using mud-rotary and diamond core drilling. The mud-rotary method consists of a tri-cone bit with water and clay circulation which stops the hole in the relatively soft sedimentary rock from collapsing. The geologists get information from these holes by examining the rock chips called cuttings or by lowering geophysical instruments down the hole. Some mud-rotary holes for oil exploration can be as deep as 9 kilometres and may cost in the vicinity of 2 million dollars per hole. In diamond core drilling, a drill bit set with small diamonds can drill through very hard rocks. A cylindrical core of the rock formations is brought to the surface and information can be gathered. There are onshore and offshore reserves in basins at a depth usually between 2 to 6 kilometres underground. Offshore drilling is very expensive and therefore once a drilling platform has been anchored in the sea, between 10 and 30 wells are drilled using a technique known as directional drilling. The holes reach out from the platform to the target areas. The platforms have to be a safe work environment for up to 80 workers whose job it is to drill holes, supervise production, maintain the platform, provide food and accommodation. The platforms also need space for a helipad, cranes, communication dishes and towers and safety capsules. The gas and crude are brought to the surface by natural water drive or by artificial means. The water/oil mixture is passed through a separator and the water, sand and gases in solution are removed. The oil is pumped through pipelines under high pressure to sea terminals and refineries. Natural gas is mainly methane but it also contains ethane, propane, butane, pentane and impurities of sulfur and nitrogen compounds. The impurities have to be removed to give the natural gas a marketable value. Offshore exploration has little effect on the environment. Dynamite is no longer used to cause shock waves for seismic exploration. Modern air guns have little harmful effect to marine life. The greatest source of environmental damage can be caused by oil blowouts when the pressure in the reservoir exceeds the pressure of the mud column in the drill hole.
CORE
The recovery of coal from underground mines is dangerous and more expensive than open pit and strip mining. The mines have to be ventilated so that poisonous and flammable gases and particulate matter are removed and all the water has to be pumped to the surface. There are many accidents due to fires and collapsing rocks. There is also a risk of black lung disease to the miners due to particulate matter accumulating in their lungs. Not all of the coal can be removed as coal pillars support the roof of the coal seam to stop the mine collapsing. Further support is provided by timber, steel, concrete or roof bolts as shown in Figure 826. In many discontinued mines there has been collapsed roofs which cause land subsistence at the surface and houses and other buildings can become structurally unsafe.
Upper rock strata Ceiling bolts Coal pillar Coal seam
Figure 826
When coal seams are close to the surface, the topsoil is removed and the coal is mined in open cut pits. In strip mining, the overburden rock strata and soil is removed and piled on the previously mined area and mining proceeds in a series of strips across the coal seam. Most of the coal can be be extracted and the cost is less expensive than underground mining. Lower grade coals can be used. If the land is not reclaimed within a short period then water erosion can occur and sulfur deposits can react with these to form acid. Reclamation of the land can be undertaken by covering the ugly area with topsoil and planting trees and grass over the area. Some power plants use the ash from coal combustion to fill the area. However, the ash combines with water to form alkalis. It has also become popular to use compacted garbage to fill the land. Crude oil and natural gas recovery is very expensive when compared to coal mining. It can take 30 years of geological mapping, geophysical seismic exploration, geochemical
206
19 april 09 Physics Ch 8 final.i206 206 22/05/2009 11:54:54 AM
8200 tonnes of air 75.6% N2 23.1% O2 1.2% Ar 0.1% CO2 1000 tonnes of coal
FURNACE
Oxides of nitrogen are formed either from the combustion of carbon compounds containing nitrogen or the reaction at high temperature between nitrogen and oxygen in the air. These oxides of nitrogen can also be an environmental hazard. The reactions of oxides of nitrogen and hydrocarbons can produce photochemical smog. In the presence of ultra-violet radiation nitrogen dioxide decomposes to an oxygen free radical that can react with oxygen molecules to form ozone which is harmful to plants and animals. Power stations and factories produce particulate matter (small dust particles or ash) in the form of silica and metallic oxides, silicates and sulfates. By the use of electrostatics, much of this matter can be removed so that it does not enter the atmosphere, and the removed matter can be recycled to produce construction materials such as bricks. Figure 816 shows the basics of a typical electrostatic precipitator.
Chimney Positively charged mesh metal plates
Figure 827 Hourly use of reactants and products of a coal power station Pollutants are substances that have undesirable effects on living things and property. Air pollution occurs when these pollutants are introduced into the atmosphere. Carbon dioxide and carbon monoxide are the major pollutants introduced into the air by fossil-fuelled power stations. Carbon monoxide is a poisonous gas that reduces the ability of the blood to transport oxygen from the lungs to the cells in the body. If the carbon monoxide is in a high enough concentration, the haemoglobin reacts with the carbon monoxide rather than the oxygen and poisoning or death can occur. Luckily, most of the carbon monoxide is quickly removed from the atmosphere by soil bacteria. Carbon dioxide as already noted is increasing in the atmosphere and it is believed that infra-red radiation is being trapped in the atmosphere due to carbon dioxide increase. Through the enhanced greenhouse effect, the lower atmosphere temperature could increase by several degrees and this will contribute to global warming. The sulfur dioxide results from the combustion of carbon compounds containing sulfur and the oxidation of metal sulfides in the coal. Attempts are made to minimise sulfur gases in a scrubber. Sulfur dioxide can combine with water in the atmosphere to form sulfurous acid (H2SO3), a mildly acidic solution that falls as acid rain. Many plants are sensitive to sulfur dioxide as it reduces the production of chlorophyll and their leaves turn yellow. At higher concentrations it can cause plants and trees to dry out, bleach and die. Humans who suffer from respiratory
+
++ + + + + +
smoke particles
+ ++ + +
smoke and dust particles metal plates are earthed
Figure 828 Schematic diagram of an electrostatic precipitator A metal grid made of mesh is charged positively to about 50 000 V to cause the smoke surrounding it to be ionised to produce electrons and positive smoke ions. The positive ions are repelled from the mesh and attach themselves to some of the dust particles in the smoke. The charged dust particles are attracted to Earthed plates where they stick. A mechanical device hits the plates periodically and the ash falls into collecting bins.
207
19 april 09 Physics Ch 8 final.i207 207 22/05/2009 11:54:55 AM
CORE
problems can have problems when the gas reacts with moisture above the larynx. The effects are increased further if particles are present as the sulfur dioxide adheres to them and they are carried into the bronchus and alveoli. Sulfur dioxide can also reduce the growth of nitrogen-fixing soil bacteria. Petroleum refining and the production of coke also contribute to atmospheric sulfur dioxide.
CHAPTER 8
7. A coal-fired power station burns coal with 30% moisture content. The composition of the dried sample is found to contain on analysis 70% carbon, 5% hydrogen and 25% oxygen. If 1000 tonnes is burnt hourly: (a) Estimate the mass of water vapour emitted from the cooling towers each hour? Estimate the mass of water vapour produced in a week? Estimate the volume of condensed water vapour produced in a week?
Exercise
8.3
1.
Which one of the following is not considered to be a fossil fuel? A. B. C. D. wood uranium coal crude oil
CORE
(b) (c)
2.
The correct rank advance for rank advance of coal is: 8. A. B. C. D. lignite, peat, bituminous coal, anthracite bituminous coal, peat, anthracite, lignite peat, lignite, bituminous coal, anthracite anthracite, lignite, bituminous coal, peat It has been suggested that crude oil should be used for other purposes rather than as a transportation fuel. Deduce the reasoning behind this statement. Assume that a sample of coal has an empirical formula C5 H4 and that a coal-fired power station burns a 1000 tonne of coal per hour. (a) A. B. C. D. coal crude oil ethanol compressed natural gas (b) (c) Write an equation for the complete combustion of the coal. Calculate the mass of oxygen required for this combustion each hour. If 25 dm3 of oxygen is required per mole, calculate the volume of oxygen that is required each hour for this combustion. Air contains approximately 20% oxygen. What volume of air is required hourly.
9.
3.
The fuel below with the highest energy density value is:
4.
Why are energy density values of fuels usually expressed in J g -1 rather than kJ mol -1? A schematic diagram of a typical coal-fired power station is shown in the figure below.
Top ash Combustion gases
(d)
5.
10.
A coal-fired power station has a power output of 500 MW and operates at an efficiency of 35%. The energy density of the coal being consumed during combustion is 31.5 MJ kg-1. (a) Determine the rate at which heat is being produced by the burning coal. Determine the rate at which coal is being burned. The heat is discarded into the cooling towers of the power plant and is then stored in containment reservoirs. Determine the water flow rate needed to maintain the water temperature in the towers at 10 C.
Electrostatic precipitator
Steam Coal Bin Crusher Coal storage Pulveriser Coal delivery Bottom Ash Water Boiler
Turbine
(b) (c)
Suggest a reason why coal is ground to a fine powder before combustion. 6. A sample of lignite has a moisture content of 65% and when dried it has an energy density of 28 kJ g -1. Assuming that during coal combustion the temperature of the water in the coal is raised from 20 C to 100 C and then vaporised at 100 C, estimate the energy density of the coal as it is mined.
208
19 april 09 Physics Ch 8 final.i208 208 22/05/2009 11:54:55 AM
fast
(Bombarding neutron) U235 n U236
fast
M2 M1
fast M1 M2
U236
neutrino fast neutrino beta rays gamma rays light fission fragment heavier fission fragment
fast
An analogy to help explain this could be a drop of water when it breaks up as it falls from a dripping tap. The water droplet can be seen to be stretching and wobbling before it breaks apart into smaller fragments. The counter forces of surface tension and intermolecular forces between the water molecules are holding the water droplet intact before it breaks up.
INITIAL STATE Bombarding neutron U235
Figure 831
In a similar manner, the balance of electrostatic forces between protons and the short- range strong nuclear force holds the nucleus together. If a fast neutron was fired at the nucleus, it would pass straight through the nucleus leaving it unchanged. However, with a slow neutron absorbed, the unstable isotope U236 is momentarily formed causing the nucleus to deform due to the balance of forces being disturbed. The nucleus oscillates and the electrostatic repulsive forces between the protons dominate causing the nucleus to break up into two smaller fragments. The long-ranged coulombic forces of repulsion between the protons now dominate over the short-ranged nuclear forces of attraction.
209
19 april 09 Physics Ch 8 final.i209 209 22/05/2009 11:54:56 AM
CORE
CHAPTER 8
In any nuclear reaction, certain Conservation Laws govern the reaction: 1. 2. momentum is conserved (relativistically) total charge is conserved (charge of products = charge of reactants) the number of nucleons remain constant mass-energy is conserved ( E = mc2)
CORE
3. 4.
Since the parent nuclide is stationary, the kinetic energy before the reaction is due to the kinetic energy of the fired neutron. However, the kinetic energy after the reaction is greater than the kinetic energy of the initial neutron the mass defect m has been changed into energy E. The smallest possible amount of fissionable material that will sustain a chain reaction is called the critical mass. The critical mass is determined when one of the neutrons released by fission will cause another fission. Apart from the amount of fissionable material, the shape of the material is also important. The preferred fuel for thermal reactors is solid uranium pellets contained in cylindrical fuel rods made of a zirconium alloy. This alloy is capable of withstanding high temperatures without distorting and becoming stuck in the fuel rod cavities. Not all the neutrons produced will leak from the reactor core and this is taken into consideration when determining the critical mass. Typically in a small thermal reactor, the critical mass (contained as pellets in the fuel rods) is a few kilograms in mass. There are about 150 fuel assemblies (see Figure 832) containing about 60 fuel rods each placed into the core of the reactor. In the Chernobyl nuclear accident of 1986, it was determined that the ziconium alloy cladding of the fuel rods overheated and led to the release of the fission products causing a meltdown in the reactor core.
In other words, not every nucleus of U235 produces the same two daughter nuclides although some daughter fragments are more probable than others. In general we can write:
where x is a positive integer. The implications of this research would have far-reaching consequences in the history of our civilisation. In 1939, Albert Einstein wrote to the then US president Franklin Roosevelt discussing the military applications of fission research. Within a short time, The Manhattan Project, the precursor for the design of the bombs that destroyed Hiroshima and Nagasaki in Japan, had begun. Today, the threat of military use of nuclear fission is still real and the United Nations is concerned that some countries may have nuclear weapon ambition. However, the fission process has valid uses in medical research and power generation. When fission takes place, additional neutrons (2 or 3) are released which leads to the possibility of producing a selfsustaining chain reaction as demonstrated in Figure 833.
Control rods
Fuel rods
Figure 832
210
19 april 09 Physics Ch 8 final.i210 210 22/05/2009 11:54:57 AM
energy conversions in a power station using an energy flow diagram as shown in Figure 834. The rectangles contain the different forms of energy, the circles show the conversion process, and the arrows show energy changes and energy outputs. The linked forms of energy can be said to form an energy chain.
50 units lost as heat to the cooling towers 100 units uranium reactor core thermal energy steam generator
Figure 833
If there are too few neutrons the nuclear chain reaction will cease and the reactor will shut down. If there are too many neutrons a runaway reaction will cause an explosion. The reaction will become uncontrolled as in a typical atomic bomb where enough fissionable material is in a sufficiently small space. In a nuclear reactor, if a chain reaction was to occur, the large amounts of energy would cause the fuel to melt and set fire to the reactor in what is called a meltdown.
friction
friction
Figure 834 Principal mechanisms in a nuclear power station The efficiency can be represented by the Sankey diagram as in Figure 835.
30 units electrical output 100 units input
15 units lost 50 units lost as in the reactor heat to the core cooling towers
Figure 835
Heat energy is produced by the fission of uranium nuclei in the core of the nuclear reactor. Heavy water absorbs the heat energy in a heat exchanger under pressure, and it is turned into steam. The steam contains latent potential energy as it has been converted from a liquid to a gas. Steam under pressure is capable of doing mechanical work to supply the rotational kinetic energy to turn the steam turbines. The turbine is coupled to the generator that produces electrical energy. Energy is lost to the surroundings at many stages. For example, if 100 units of energy are supplied from the primary energy source then only 30 units of useful energy is available. The majority of the useful energy is lost to water in the cooling towers as heat is evolved in the condensation component of the heat exchanger cycle. Other forms of energy loss are shown on the arrowed parts of the diagram.
211
19 april 09 Physics Ch 8 final.i211 211 22/05/2009 11:54:59 AM
CORE
CHAPTER 8
reactor core, it can become radioactive and contaminated. Therefore, the coolant must exchange its heat with a secondary cooling circuit through a heat exchanger. The steam produced in the secondary loop drives a turbine to produce electricity. The low pressure steam that passes through the turbine is passed into cooling ponds or cooling towers where the excess heat is dissipated. Common coolants include air, helium gas, heavy water, liquid sodium or certain liquid organic compounds. In many reactors, the coolant is also the moderator. The radiation shielding ensures the safety of personnel working inside and around the reactor from suffering the ill effects of radiation exposure. There are usually two shields: several metres of high-density concrete to protect the walls of the reactor core from radiation leakage and to help reflect neutrons back into the core and a biological shield to protect personnel made of several centimetres of high density concrete. A typical schematic diagram of a thermal reactor is shown in Figure 836.
Steam turbine Core (Fuel & moderator) High pressure steam Electric generator
CORE
the fuel a moderator the control rods the coolant radiation shielding
The moderator is a material that will slow down the fast neutrons to the speed of the slow thermal neutrons needed for a self-sustained reaction without absorbing the neutrons when they collide with the moderator material. The moderator material is placed around the reactor core and in between each of the fuel assemblies. In order to be effective, the moderator must have a mass very close to the mass of a neutron so that the fast neutron can loose maximum energy in a single collision. Moderators include ordinary water, heavy water (D2O), graphite, beryllium or liquid sodium. In the Chernobyl accident, the graphite core caught fire. Graphite fires are almost impossible to fully extinguish. The reactor core sank to the bottom of the reactor building and a theory called the China Syndrome developed that the core could continue to burn out of control until it would eventually penetrate the Earths surface. If this had occurred, the core would have reached the water table and there would have been a massive radioactive cloud of steam many times the radioactivity of the original cloud. Fortunately, this did not happen. The rate of nuclear fission in the reactor core can be controlled by inserting or removing the control rods. The control rods are constructed of materials that absorb neutrons. The rods are usually steel rods containing boron or cadmium that are said to have high neutron capture per cross-section. Most reactors have two sets of control rods: one set of regulating rods for routine control of the fission rate, the other as a safety measure in case they have to be lowered into the core during an emergency shut-down. The regulating rods can be added or removed, or partially or fully inserted into the core as needed. There can be a large number of control rods in or out of a reactor. The coolant circulates through the reactor core and removes thermal energy transferring it to where it can do useful work by converting water into steam. The energy release in a single fission reaction is about 200 MeV or 3.2 10-11 J. Because the coolant is in direct contact with the
Secondary loop
212
19 april 09 Physics Ch 8 final.i212 212 22/05/2009 11:54:59 AM
Control rods
Turbine Generator
Sprinkler system
Baffles
Figure 839 Design features of the heat exchanger and cooling tower The temperature of the reactor is limited to a temperature of 570 K. Higher temperatures tend to damage the fuel rods. Typically, the water in the secondary loop is returned after condensation to the boiler at a temperature of 310 K. It can be shown that, the maximum possible efficiency of a nuclear power plant is:
QH
W
QL
Low temperature reservoir at T L
Figure 837
When the steam leaves the turbine, it is in the gaseous state at a higher temperature than the water supplied to the boiler. If the steam had returned to the original liquid state all the energy acquired in the boiler would have been extracted as work and the Second Law of Thermodynamics would be violated. To complete the cycle and to use the steam again, the steam is run through a condenser. The condenser is a coil of pipes in contact with a large volume of water, and carries the steam back to the boiler as cool water. The basic design features are shown in Figure 839.
8.4.7 Describe how neutron capture by a nucleus of uranium-238 (238U) results in the production of a nucleus of plutonium-239 (239Pu). 8.4.8 Describe the importance of plutonium-239 (239Pu) as a nuclear fuel. 8.4.9 Discuss safety issues and risks associated with the production of nuclear power. 8.4.10 Outline the problems associated with producing nuclear power using nuclear fusion. 8.4.11 Solve problems on the production of nuclear power.
IBO 2007
213
19 april 09 Physics Ch 8 final.i213 213 22/05/2009 11:55:01 AM
CORE
Condenser
CHAPTER 8
Blanket of uranium-238
+ 0n 92 U This isotope undergoes -decay to produce fissionable plutonium Pu239 according to the following nuclear equations:
0 239 239 + 92 U 93 Np -1e
Then
239 0 239 + e Np 93 94 Pu -1
-
238 92 U
239
Figure 840
Uranium-238 blanket
+ v
_
antineutrino
+ v
The Pu239 is fissionable as can be shown in the following equation and large amounts of energy are released:
239 94 Pu
Suppose there were 100 fissions of U-235 and there were 240 neutrons produced. 100 neutrons will be needed for the fission of U-235 and there will be 140 neutrons available. Suppose some neutrons are lost and that there is 110 available for capture by non-fissionable U-238 to produce 110 Pu-239 fissions. Therefore, 100 U-235 fissions would produce 110 Pu-239 fissions which is a 10% increase in fuel. No moderator is used as the neutrons do not have to be slowed down. Liquid sodium is used as the coolant.
+ 0n 56 Ba + 38 Sr + 3 0n
147
90
8.4.8 PLUTONIUM239 AS A
NUCLEAR FUEL
This type of fission reaction is used in slow and fast breeder reactors (a nuclear fission reactor that creates or breeds more fissionable material than consumed). As an isotope, uranium-238 is 140 times more abundant than uranium-235. The neutrons given off in a uranium235 fission can breed more fuel if the non-fissionable uranium-238 is placed in a blanket around control rods containing plutonium-239 and uranium-235 as shown in Figure 840. In a fast breeder reactor, the fast neutrons produced by uranium-235 fission are used to produce Pu-239. On average 2.4 neutrons are produced in a U-235 fission with one neutron required for the next fission and 1.4 left for neutron capture by U-238.
214
19 april 09 Physics Ch 8 final.i214 214 22/05/2009 11:55:02 AM
Uranium ore can be mined by open pit and underground mining or solution mining where solutions are pumped underground to leach the uranium-bearing minerals from sand. Extraction of uranium from sea water has also been undertaken. Uranium mining is considerably more dangerous than other mining. The biggest risk is the exposure of miners and the environment to radon-222 gas and other highly radioactive daughter products as well as seepage water containing radioactive and toxic materials. In the 1950s, a significant number of American uranium miners developed small-cell lung cancer due to the radon that was shown to be the cancer causing agent. The technology to build and operate fission reactors is significant. However, the main concern seems to be the effective disposal of the low-level (radioactive cooling water, laboratory equipment and protective clothing) intermediate-level (coolant) and high-level (fuel rods) waste. The products of fission called ash include isotopes of the elements strontium, caesium and krypton and these are highly radioactive with a half-life of 30 years or less. Perhaps the biggest concern is that plutonium-239, another highly radioactive product has a half-life of 24 600 years. This isotope is also a threat as it is used in nuclear warheads. Even though uranium-235 is only mildly radioactive, it becomes contaminated with the other highly reactive isotopes within the reactor. Presently, the disposal methods include storage in deep underground storage areas. If the present disposal methods fail, then the danger to the environment would be catastrophic. Radioactive waste would find its way into the food chain and underground water would be contaminated. A new method of disposal where the waste is ground into a powder and then made into a synthetic rock is having some success. Provided the reactors are maintained and built to standard, no obvious pollutants escape into the atmosphere that would contribute to the greenhouse effect. However, even with expensive cooling towers and cooling ponds, thermal pollution from the heat produced by the exchanger process could contribute to global warming. Opposition to nuclear fission has grown extensively especially since the bad accident at Chernobyl in the Ukraine that upon explosion sent a cloud of radioactive dust and gases across Northern Europe. The engineers were carrying out some tests on the coolant and the
215
19 april 09 Physics Ch 8 final.i215 215 22/05/2009 11:55:02 AM
CORE
CHAPTER 8
order to overcome the electrostatic forces of repulsion between the hydrogen isotopes they need to be heated to an extremely high temperature around 100 000 C so that they can be ionised into electrons and positively charged ions in a plasma that expands in all directions. In order for fusion to occur, the plasma has to be confined for 1 second with a density of about 500 trillion atoms per cubic centimetre. Because fusion is not a chain reaction, these temperature and density conditions have to be maintained for future fusions to occur. Because the plasma is electrically charged, a magnetic field surrounding the plasma (a magnetic bottle) could lead to a confined and controlled plasma that can undergo nuclear fusion. Recall that 1u = 1.661 10-27 kg Mass of uranium-235 = 235 1.661 10-27 kg = 3.90335 10-25 kg per fission Mass of uranium-235 needed = 3.90335 10-25 kg 4.9375 1020 fissions = 1.93 10-4 kg or 0.193 g
CORE
Example
235 144 92 U 56 Ba
89 36 Kr
1 + 30 n.
Estimate the initial amount of uranium-235 needed to operate a 600 MW reactor for one year assuming 40% efficiency and that for each fission, 200 MeV is released.
Solution
Suppose that the average power consumption for a household is 500 W per day. Estimate the amount of uranium-235 that would have to undergo fission to supply the household with electrical energy for a year. Assume that for each fission, 200 MeV is released.
200 MeV = 200 106 eV 1.6 10-19 C = 9.6 10-11 J. 600 MW = 600 106 Js-1. The total number of seconds in a year
Solution
= 60 60 24 365.25 = 3.16 107 s 200 MeV = 200 106 eV 1.6 10-19 C = 3.2 10-11 J. 500 W = 500 Js-1. The total number of seconds in a year = 60 60 24 365.25 = 3.16 107 s Therefore, the total electrical energy per year = 3.16 107 s 500 Js1 = 1.58 1010 Jyr1. 1 fission produces 3.2 10-11 J. So for 1.58 1010 J there would be 1.58 1010 J / 3.2 10-11 J = 4.9375 1020 fissions. Per year the total electrical energy = 3.16 107 s 600 106 Js1 = 1.896 1016 Jyr1. Since 40% efficient, the total energy needed 1.896 1016 Jyr1 = ______________ 0.4 = 4.74 1016 Jyr1. 1 fission produces 3.2 10-11 J. So for 4.74 1016 J there would be 4.74 1016 J / 3.2 10-11 J = 1.48125 1027 fissions. Mass of uranium-235 = 235 1.661 10-27 kg
216
19 april 09 Physics Ch 8 final.i216 216 22/05/2009 11:55:03 AM
+ 90 Th
231
12 1 13 6C + 1H 7N +
Exercise
8.4 (a)
6.
1.
238 Why is a 92U nucleus more likely to undergo alpha decay than fission as a means of attaining stability?
+ + 2 0n
7.
(a)
Explain how fission reactions, once started, are considered to be self-sustaining. How is the chain reaction in nuclear reactors controlled?
A. B. C. D. 2.
(b)
8.
50Sn
Fission is the process by which A. B. C. D. two light nuclei combine to form a heavier nucleus. a heavy nucleus splits to form two lighter nuclei. a heavy nucleus splits to form an alpha particle and another nucleus. a light nucleus splits to form an electron and another nucleus.
600 MW
The thermal power from the reactor is 2400 MW and this is used to operate the steam generator and turbine. The mechanical power output of the generator and turbine is used to drive a generator. The generator is 60 % efficient and produces 600 MW of electrical power. This is represented by the energy flow diagram below.
heat to the cooling towers uranium reactor core thermal energy steam generator
3.
The term critical mass refers to A. B. C. D. the mass defect when a fissile nucleus decays. the mass of a fissile nucleus. the mass required for a selfsustaining fission reaction. the mass of uranium235 required to fuel a nuclear reactor.
electrical energy
AC generator
kinetic energy
steam turbine
friction
friction
Calculate the power input to the generator. Calculate the power lost from the generator. Calculate the power lost by the heat engine. What are the strongest arguments in favour of pursuing nuclear fission as a source of energy? What are the strongest arguments against using nuclear fission as source of energy?
4.
The purpose of the control rods in a nuclear reactor is to: (b) A. B. C. D. absorb excess neutrons slow down the neutrons provide a container for the fuel reduce the radioactivity of the fissile materials
10.
Determine the number of fissions that will occur per second in a 500 MW nuclear reactor. Assume that 200 MeV is released per fission.
217
19 april 09 Physics Ch 8 final.i217 217 22/05/2009 11:55:04 AM
CORE
4 14 17 1 2He + 7N 8O + 1H
CHAPTER 8
11. State three essential differences between chemical bond breaking and nuclear fission. Estimate the initial amount of uranium-235 needed to operate a 500 MW reactor for one year assuming 35% efficiency and that for each fission, 200 MeV is released.
to house
12.
heat exchange thin copper tubing pump water inlet cold water in
CORE
Solar power
8.4.12 Distinguish between a photovoltaic cell and a solar heating panel. 8.4.13 Outline reasons for seasonal and regional variations in the solar power incident per unit area of the Earths surface. 8.4.14 Solve problems involving specic applications of photovoltaic cells and solar heating panels.
IBO 2007
insulation
Figure 842 A solar collector Parabolic dish collectors or solar furnaces are under construction in a number of countries. The Suns rays are converged to a point by a parabolic mirror and temperatures greater than 3000C can be produced. If a boiler is placed at the focus position, the steam that is generated can drive a turbine. A solar furnace that can generate 1 MW of thermal energy has been constructed in the French Pyrenees. It consists of a 45 m reflector made of 20 000 small mirrors moulded into a parabolic shape. 60 large computer controlled mirrors that follow the Sun reflect light onto the parabolic mirror. Two common methods used for the production of electrical energy from solar energy are: photovoltaic solar cells thermoelectric devices
Photovoltaic devices use the photoelectric effect. Photons from radiant energy excite electrons in a doped semiconducting material such as silicon or germanium, and the element becomes conducting allowing electrons to flow in an external circuit to produce electrical energy. The photons must have enough energy to cause electrons to move and this energy is available in the entire visible region of the electromagnetic spectrum. Modern solar cells consist of thin circular wafers made of p-type and n-type silicon (4 valence electrons). Doping with Group 5 element (5 valence electrons) such as arsenic (As) produces an electron rich layer / ntype semiconductor. This electron is free to move about. Doping with Group 3 (3 valence electrons) element such as gallium (Ga) produces an electron deficient layer / p-type semiconductor. There are not enough electrons to form a covalent bond with a neighbouring atom. An electron from the n-type semiconductor can move into the hole. The electrons can move from hole to hole and produce a potential difference. The wafers are about 1 mm thick and
218
19 april 09 Physics Ch 8 final.i218 218 22/05/2009 11:55:05 AM
Si
Si
Si
Si
Si
As
Si
Ga
the solar constant the Earths distance from the Sun the altitude of the Sun in the sky the length of night and day
free electron
Figure 843 Types of semiconductors Photovoltaic devices are a source of non-polluting, renewable energy that can be used in some ares of the world. Unfortunately, photovoltaic cells produce a very small voltage and provide very little current. They can be used to run electronic devices such as televisions and sound systems but cannot be used for high power rated appliances such washing machines, refrigerators and electric stoves. If connected in series the net voltage and current can be increased. The initial establishment costs are high and their efficiency at this stage is only 30%. Thermoelectric converters appear to be a better option for the future. They not only use the visible region of the electromagnetic spectrum but also the infra-red region - the heating region of the em spectrum. Bars of doped silicon are again used to create an emf between the hot end and the cold end. By connecting p-type and n-type bars in series higher voltages can be obtained. Its disadvantages are also evident. It produces a small energy output per surface area of the cell being used. For large-scale production, it requires thousands of mirrors or cells that take up a large area of land. It is intermittent with its output being upset by night and clouds. It is relatively expensive to set up and thus requires many years before the investment is returned.
The average radiant power radiated to an area placed perpendicular to the outer surface of the Earths atmosphere while the Earth is at its mean distance from the Sun defines the irradiance or solar constant at a particular surface. This varies with Sunspot activity which has a cycle around 11 years. Since the Earth-to-Sun distance varies over the course of a year from perihelion (nearest on Jan.3) to aphelion (furthest on Jul. 3) due to the elliptical orbit of the Earth, the solar constant varies about 6% from 1038 Wm-2 to 1398 Wm-2. Furthermore, the energy radiated by the Sun has changed over its time of stellar evolution. As the solar constant applies perpendicular to the top of the atmosphere, and because the atmosphere reduces this flux considerably on a clear day, the value is reduced to about 1 kWm -2. On an overcast day this value could be as low as a few watts per square metre. Solar radiation reaching the Earth will be different in regions at different latitudes because of the Suns altitude in the sky. At the equator, solar radiation has to travel through a smaller depth of the atmosphere than at the poles. Each bundle of solar insolation (the energy received by the Earth as incoming short-wave radiation) has twice the area to heat up at 60 0 than at the equator. There is also less atmosphere near the equator and this means there will be less reflection and absorption of radiation. Seasonal variations affect the amount of received radiation because the seasons will determine how spread out the rays become. Because the Earth is tilted on its axis by 23 0, at the poles there is no insolation for several months of the year. In terms of the actual power that reaches an object on the Earth, many other factors have to be taken into account. The albedo (the fraction of incident light diffusely reflected from a surface) of the Earth is about 30%. On its way through the atmosphere, solar radiation is absorbed and scattered to a different degree depending on the altitude
219
19 april 09 Physics Ch 8 final.i219 219 22/05/2009 11:55:05 AM
CORE
CHAPTER 8
of the Sun at a particular place on Earth. The lower the Suns altitude the greater is the zenith distance and thus the greater the degree of absorption and scattering. Other factors that affect climate include changes in the Earths orbit every 100 000 and 400 000 years, changes in the tilt of the Earths axis, volcanic emissions, continental drift affecting the ocean currents and winds and human activity such as burning fossil fuels and deforestation. (b) Estimate the minimum area of the solar panel needed to provide 1.8 108 J of energy in 2.0 hours.
Solution
CORE
(a)
mass of water = 1.4 103 kg; energy required = 1.4 103 kg 4.18 103 Jkg -1 C 30 C = 1.8 108 J.
energy provided in 2 hours = 7 200 900 A. therefore A = (1.8 108 J) / (7200 s 900 Js-1)
A solar panel with dimensions 2 m by 4 m is placed at an angle of 300 to the incoming solar radiation. On a clear day, 1000 Wm-2 reaches the Earths surface. Determine how much energy can an ideal solar panel generate in a day.
= 27.8 m2.
Hydroelectric power
8.4.15 Distinguish between dierent hydroelectric schemes. 8.4.16 Describe the main energy transformations that take place in hydroelectric schemes. 8.4.17 Solve problems involving hydroelectric schemes.
IBO 2007
Solution
Area of the solar panel = 8 m2. Area in radiation terms = 8 cos 30= 6.93 m2 1000 Wm-2. = 1000 J s-1 m-2 Energy produced / day = (1000 J s-1 m-2)(6.93 m2)(24 h day-1)(60 min h-1) (60 s min-1) = 5.98 108 J The energy produced per day = 6.0 108 J
Example
An active solar heater of volume 1.4 m3 is to provide the energy to heat water from 20 C to 50 C. The average power received from the Sun is 0.90 kWm-2. (a) Deduce that 1.8 108 J of energy is required to heat the volume of water in the tank from 20 C to 50 C.
Hydro-electric power stations are widely used in mountainous areas of countries throughout the world. The energy is ultimately derived from the radiant energy of the Sun through the water cycle. Water that has fallen in the mountains is piped from rivers and stored in large artificial lakes that have been dammed. The water retained
220
19 april 09 Physics Ch 8 final.i220 220 22/05/2009 11:55:06 AM
Figure 844
The amount of energy available is directly proportional to the rate of flow of water and the height through which the water falls. Some dams rely on a small rate of flow, with a large fall and others have a large water flow through a smaller fall. With some hydro- electric stations, electricity is used to pump the water uphill to reservoirs during offpeak electricity use.
221
19 april 09 Physics Ch 8 final.i221 221 22/05/2009 11:55:07 AM
CORE
CHAPTER 8
For tidal power, there is an area A of water at a height R. The mass of the water would be given by m = AR The water level rises and falls between each tidal surge, so the centre of mass of the water would be at R/2. Therefore, the change in potential energy as the water runs out would be: Ep = (AR) g R/2 = (AgR2) / 2 If water from a pumped storage dam fell through a pipe 150 m at a rate of 500 kg per second, calculate the power that could be produced if the power plant is 60% efficient. Assume the density of water is 1000 kgm-3.
Example
CORE
Solution
Example
Power = g h volume per second = 1000kg-3 500 kg s-1 9.8 ms-2 150 m
A barrage is placed across the mouth of a river as shown in the diagram of a tidal power station. If the barrage height is 15 m and water flows through 5 turbines at a rate of 1.0 102 kg per second in each turbine, calculate the power that could be produced if the power plant is 60% efficient. Assume the density of seawater is 1030 kgm-3.
barrage
= 735 106 J s-1 If 60% efficient then the power produced = 0.6 735 106 J s-1 Total power = 441 MW.
Wind power
trapped sea water
8.4.19 Determine the power that may be delivered by a wind generator, assuming that the wind kinetic energy is completely converted into mechanical kinetic energy, and explain why this is impossible. 8.4.20 Solve problems involving wind power.
Solution
IBO 2007
GENERATORS
Winds are produced due to the uneven heating of the Earths surface. The Suns rays strike the equatorial regions at right angles but they approach the polar regions at an angle, large-scale convection currents are set up in the Earths atmosphere. The inconsistency in the wind patterns is further compounded by the Earths axial spin and the difference in local surface conditions (mountains, deserts, oceans, lakes, forests).
222
19 april 09 Physics Ch 8 final.i222 222 22/05/2009 11:55:07 AM
Wind speed v = distance d time t and d = v t In one second the volume of air passing the turbine = v A Therefore, the mass of air m passing the turbine in 1 second = v A where is the air density. The kinetic energy available each second = m v 2 = ( v A) v 2 = A v 3 Power available = A v 3
Blade radius r
Rotor diameter Slip ring Power cable
Air density
Wind speed v
d
Centre
Figure 847 Power output of a wind generator In reality, the power produced cannot strictly obey this equation as there are great fluctuations in wind speed and air density throughout the days and months of the year. The electricity from wind power can be stored in batteries for later use but this storage method is very expensive. The most practical system is to set up a wind farm consisting of a large number of wind turbines interconnected to produce a power grid. Wind power is cheap, clean, renewable and infinite. It can be used to provide electricity to remote areas of the world. However, winds resulting from the heating of the Earth are somewhat unpredictable. The initial set-up costs are high, the structures suffer from metal fatigue and they are noisy. The better option being considered today is to use wind turbines in association with another power source. One possibility is to combine solar power and wind power. Usually, when sunny there is little wind but when it is overcast there is more wind. Another possibility is to use the wind to pump water into high dams associated with hydro-electric power stations and then to run the water downhill during high electricity demand periods.
Figure 846 Horizontal axis wind turbine The vertical axis wind turbine has the advantage that it does not have to be steered into the wind and as a result, the generator can be placed at the bottom of the system. However, if the weight of the blades becomes too great, a lot of stress is put on the pivot to the generator.
Consider a wind turbine that has a blade radius r as shown in Figure 847. The area A swept out by the blades = r2
223
19 april 09 Physics Ch 8 final.i223 223 22/05/2009 11:55:08 AM
CORE
CHAPTER 8
Wave power
8.4.21 Describe the principle of operation of an oscillating water column (OWC) oceanwave energy converter. 8.4.22 Determine the power per unit length of a wave front, assuming a rectangular prole for the wave. 8.4.23 Solve problems involving wave power.
IBO 2007
CORE
A wind turbine has blades 20 m long and the speed of the wind is 25 ms-1 on a day when the air density is 1.3 kgm-3. Calculate the power that could be produced if the turbine is 30% efficient.
Solution
Power = A v3 and A = r2 = 0.3 0.5 102 m2 1.3 kgm-3 253 m3s-3 = 9.57 105 W = 0.96 MW
Example
A wind generator is being used to power a solar heater pump. If the power of the solar heater pump is 0.5 kW, the average local wind speed is 8.0 ms-1 and the average density of air is 1.1 kgm-3, deduce whether it would be possible to power the pump using the wind generator.
The buoyant moored devices float above or below the water and are moored to the sea floor with cables. The Salter Duck is such an example as shown in Figure 848. As it bobs backwards and forwards matching the wave motion, it turns a generator.
waves
Solution
salter duck
cable
Power = A v and A = r
500 Js-1 = 0.5 r2 m2 1.1 kgm-3 8.03 m3s-3 1000 2P = _____________________ r = _____ v3 1.1 kgm-3 512 m3s-3
_____ _____________________
sea floor
Figure 848 Salter duck Hinged contour devices consist of a series of floating mattresses that are hinged together. As the sections move with the waves, the motion is resisted at the joints by hydraulic pumps that push high-pressure oil through hydraulic motors that produce electric power.
= 0.75 m This is a small diameter so it could be feasible provided the wind speed was always present.
224
19 april 09 Physics Ch 8 final.i224 224 22/05/2009 11:55:09 AM
Figure 849
= wave frequency 2 / T (rad s-1) T = wave period (s) Example If the wave potential energy over one period is to be calculated this equation after differentiation becomes
Figure 850 Onshore oscillating water column The best places in the world for capturing wave power are the north and south temperate zones where the prevailing westerly winds are strongest in the winter. There are OWC wave power stations in England (average power of 7.5 kW) and Japan (average power of 6 kW) and there is a promising project on the Island of Islay off the west coast of Scotland being conducted by private enterprise in which the OWC feeds a pair of counter-rotating turbines each of which drives a 250 kW generator.
PE = g A2 The total kinetic energy over one period is equal to the total potential energy: KE = g A2
225
19 april 09 Physics Ch 8 final.i225 225 22/05/2009 11:55:10 AM
CORE
CHAPTER 8
The total energy over one period will be:
Solution
ET = g A The power generated, work / time will equal: P = g A2 / T Power = g A2 /T = 0.5 1020 kgm-3 10 ms-2 (1.5)2 m2 100m 1m / 8s = 143 103 kg m2 s-3 = 143 kW per metre. Wave speed = wavelength / period = 100 m / 8 s = 12.5 ms-1 Figure 851 summarises some of the advantages and disadvantages of the use of non-renewable and renewable energy sources.
2
CORE
Power per wavelength = g A2 f This can be more importantly be expressed as: Power per metre = g A2 v (where v is the speed of the wave.) The density of seawater at the surface of the ocean varies from 1020 to 1029 kilograms per cubic metre.
Exercise
8.4 (b)
Doping a semicondutor to improve its conductivity means: A. B. C. D. adding elements with 3 valence electrons adding silicon or germanium to the semiconductor adding group 3 and group 5 elements adding elements with 5 valence electrons
If a wave is 3 m high and has a wavelength of 100 m and a frequency of 0.1 s-1, estimate the power for each metre of the wave. 2.
Solar energy: A. B. C. D. is converted completely into electricity in a photovoltaic cell is not able to be stored is a renewable energy source is suitable only for heating water
Solution
All the following statements are correct EXCEPT A. generators convert mechanical energy into electrical energy nuclear reactors convert mass into energy chemical energy is a form of potential energy thermal energy and solar energy are the same
= 1.14 10 kg m s
2 -3
B. C. D. 4.
Example
Photovoltaic cells can operate when the incident photons have A. B. C. D. frequencies above visible light infra-red frequencies microwave frequencies frequencies below visible light
If a wave is 3 m high and has a wavelength of 100 m and a period of 8 s, estimate the power over each metre of wavefront and calculate the wave speed.
226
19 april 09 Physics Ch 8 final.i226 226 22/05/2009 11:55:10 AM
Oil
Natural gas
Nuclear
High energy density Cleaner and more efficient than other fossil fuels Can be used in engines High power output Reserves available
Passive solar
No fuel costs Renewable Non-polluting Photovoltaic solar No fuel costs Renewable Non-polluting
Hydro-electric Tidal
Wind
Wave
Figure 851 Comparison of energy resources 5. In terms of energy transformations, distinguish between a solar panel and a solar cell. A wind turbine farm is being designed for a town with a total required energy of 150 TJ per year. There is available space for 25 turbines and the average annual wind speed is 15 ms-1. (a) (b) Deduce that the average required output from one turbine is 0.19 MW. Estimate the blade radius of the wind turbine that will give a power output of 0.19 MW. (Density of air = 1.3 kg m-3) 7. An active solar heater of volume 2.4 m3 is to provide the energy to heat water from 20 C to 60 C. The average power received from the Sun is 1000 Wm-2. (a) Deduce that 4.0 108 J of energy is required to heat the volume of water in the tank from 20 C to 60 C. Estimate the minimum area of the solar panel needed to provide 4.0 108 J of energy in 2.0 hours.
6.
(b)
8.
If a wave is 12 m high and has a wavelength of 30 m and a frequency of 0.1 s-1, estimate the power for each metre of the wave.
227
19 april 09 Physics Ch 8 final.i227 227 22/05/2009 11:55:11 AM
CORE
CHAPTER 8
9. If a wave is 12 m high and has a wavelength of 25 m and a period of 8 s, estimate the power over each metre of wavefront and calculate the wave speed. In a hydro-electric power station, water falls through a 75 m pipe at the rate of 1500 kg s -1. How many megawatts of electric power could be produced by the power plant if it is 80% efficient? A photovoltaic cell can produce an average 40 Wm -2 of electrical energy if it is directly facing the Sun at the equator. If a house has an electrical consumption of 75 kW, what would be the required surface area of cells needed to provide the power requirements of the household. The following table shows the power generated by a small wind turbine as a function of wind speed and radius of the blade. Plot graphs to show the linear relationships that exist between the power generated and these variables Blade radius / m 0.5 0.7 0.8 0.9 1.0 1.1 Power / W 20 80 200 370 580 610 1020 Wind speed 104 / km h-1 12.6 15.9 20 25.2 29.2 30.4 35
10.
CORE
11. 12.
228
19 april 09 Physics Ch 8 final.i228 228 22/05/2009 11:55:11 AM
Solution
Every square metre at an EarthSun mean distance receives 1.35 kW m-2 The surface area of a sphere = 4 r2 Total power received = 4 (1.5 1011 m)2 (1.35 10 3 J m -2) = 3.8 10 26 J.
8.5.2 ALBEDO
The term albedo () (Latin for white) at a surface is the ratio between the incoming radiation and the amount reflected expressed as a coefficient or as a percentage. Solar radiation is mainly radiated in the visible region of the electromagnetic spectrum (0.4 m to 0.7 m) and by incoming short-wave infra-red radiation called insolation. This radiation can pass through the atmosphere to warm the land and sea by the so called natural greenhouse effect. Water vapour and clouds can absorb radiation in the 0.4 m to 0.7 m range and carbon dioxide can absorb radiation in the 4 m to 7 m range. Between 7 m to 13 m range, more than 70% of the radiation escapes into space. About 7% is radiated in the short-wave ultraviolet region around 0.5 m. The Earth is cooled by outgoing longer wave infra-red radiation in the night. The Earth receives approximately 1 kW m -2 on a clear day at the rate of 1.7 107 W s-1. The incoming solar radiation is insolated, reflected and retransmitted in various ways. Figure 853 demonstrates how 100 units of input solar radiation is distributed. It can be seen that: 30% is reflected back into space mostly by the polar icecaps and particulate matter in the atmosphere. This reflected radiant energy mainly consists of short wavelength electromagnetic radiation such as ultra-violet radiation. 51% is absorbed by the Earth during the day as thermal energy which is then radiated back into space during the night as radiant thermal energy consisting of long wavelength electromagnetic infra-red radiation.
Example
How much solar radiation does one square metre of the Earths surface receive per day?
Solution
For the land area of the USA, the solar radiation available over the total land surface is over 1017 kW h annually. This is about 600 times greater than the total energy consumption of the USA.
Example
Given that the mean Sun-Earth distance is 1.5 108 km and that the power received at the top of the Earths atmosphere is given by the solar constant, determine the total power generated by the Sun.
229
19 april 09 Physics Ch 8 final.i229 229 22/05/2009 11:55:12 AM
CORE
CHAPTER 8
23% of the 51% is used in the water cycle. The radiant energy is absorbed by water and evaporation occurs as enough energy is supplied to overcome the latent heat of vaporisation. The gaseous water vapour is carried by convection currents higher into the atmosphere, and clouds with high potential energy are formed.
Space
Outgoing Shortwaves 6% 26% Outgoing Longwaves 38%
Surfaces Oceans Dark soils Pine forests Urban areas Light coloured deserts Deciduous forests Fresh snow Ice Whole planet
Albedo % 10 10 15 15 40 25 85 90 31
CORE
4%
Atmosphere
Water vapor emission, CO 2 Cloud emission 15% water vapor absorption, CO2
8.5.4 Describe the greenhouse eect. 8.5.5 Identify the main greenhouse gases and their sources. 8.5.6 Explain the molecular mechanisms by which greenhouse gases absorb infrared radiation. 8.5.7 Analyse absorption graphs to compare the relative eects of dierent greenhouse gases. 8.5.8 Outline the nature of black-body radiation. 8.5.9 Draw and annotate a graph of the emission spectra of black bodies at dierent temperatures. 8.5.10 State the StefanBoltzmann law and apply it to compare emission rates from dierent surfaces. 8.5.11 Apply the concept of emissivity to compare the emission rates from the dierent surfaces. 8.5.12 Dene surface heat capacity Cs.
51% absorbed
Figure 853
0.25% of the radiant energy is consumed in supplying the energy that drives the convection currents of the oceans and atmosphere, and only 0.025% is stored by photosynthesis in plants as chemical potential energy. This has been the main source of fossil fuels. Solar or radiant energy can be converted indirectly to electrical energy by: biomass conversion wind power wave energy geothermal energy
8.5.13 Solve problems on the greenhouse eect and the heating of planets using a simple energy balance climate model.
IBO 2007
230
19 april 09 Physics Ch 8 final.i230 230 22/05/2009 11:55:13 AM
The concentration of these natural greenhouse gases has been affected by human activity in what is known as the enhanced greenhouse effect and this increase will be examined in Section 8.6 concerning global warming. Carbon dioxide has always been the largest contributor to greenhouse gas concentration and in the 1700s it was thought to have a concentration of about 280 parts per million (ppm). The natural production is caused by respiration, organic decay of plants and animals, natural forest fires, dissolved carbon dioxide and volcanic activity. Methane concentrations in the 1700s were believed to be around 0.7 parts per million. The main natural source is decaying vegetation. Even when we burp and pass wind we are removing methane and other gases from our body due to the fermentation of plants in our digestive system. Decaying vegetation is found in agriculture and in wetland peat bogs. Water vapour is found in the atmosphere due to the water cycle in which water is evaporated from mainly the oceans and in the transpiration of plants (loosing water from their leaves). Nitrous oxide exists in parts per billion and the concentration in the 1700s was thought to be about 250 parts per billion (ppb). Natural sources include forests, grasslands, oceans and soil cultivation. Chlorofluorocarbons (CFCs) and ozone (O3) are also greenhouse gases. CFCs were found in refrigerants, aerosol sprays, solvents and foams but since the realisation that they were responsible for ozone depletion in the stratosphere has led to them being phased out and replaced with more suitable compounds. Ozone is produced by the action of sunlight on oxygen molecules and it is produced in the lower atmosphere as a component of photochemical smog. Its contribution to the greenhouse effect is not fully understood at this stage.
231
19 april 09 Physics Ch 8 final.i231 231 22/05/2009 11:55:13 AM
CORE
CHAPTER 8
CORE
Because ultraviolet radiation is more energetic than infrared radiation it tends to break bonds between atoms joined together. On the other hand, infrared radiation being less energetic tends to cause the atoms to vibrate in various ways. When the frequency of the infrared radiation is equal to the frequency of vibration then resonance occurs. This means that the frequency of the radiation is equal to the natural frequency of vibration of two atoms bonded together. It just so happens that the natural frequency of vibration of the molecules of the greenhouse gases is in the infrared region. If resonance occurs and the molecular dipole moment undergoes a change, then the greenhouse gas will absorb energy from the albedo infrared radiation coming from a surface. Only certain energies for the system are allowed and only photons with certain energies will excite molecular vibrations. Therefore vibrational motion is quantized and transitions can occur between different vibrational energy levels. The absorbed energy can then be re-radiated back into the biosphere. In order to examine the vibrations of greenhouse gases and other molecules and compounds the analysis is carried out by infrared spectrophotometry. In a digital IR spectrometer, a glowing filament produces infrared radiation in the form of heat and this is passed through an unknown sample held in a small transparent container. A detecting device then measures the amount of radiation at various wavelengths that is transmitted by the sample. This information is recorded as a spectrum showing the percentage transmission against the wavelength in micrometres (microns) (m) or the frequency. We have already learnt that energy is directly proportional to frequency or inversely proportional to wavelength (E = hf or E = hc / ). If the wave number is the number of waves per centimetre (cm-1) we have a variable that is directly proportional to energy. When the energy of the infrared radiation from the instrument matches the energy of vibration of a molecule in the sample, radiation is absorbed, and the frequency given in wave numbers (cm-1) of the infrared radiation matches the frequency of the vibration. Each sample examined has its own individual spectrum and therefore a blueprint of the sample just like the DNA of an individual.
Figure 855
The top 2 vibrations represent the stretching of the C=O bonds, one in a symmetric mode with both C=O bonds lengthening and contracting in-phase. This symmetric stretch is infrared inactive because there is no change in the molecular dipole moment and so this vibration is not seen in infrared spectrum of CO2. The top right diagram is in an asymmetric mode with one bond shortening while the other lengthens. The asymmetric stretch is infrared active due a change in the molecular dipole moment. Infrared radiation at 2349 cm-1 (4.26 m) excites this particular vibration. The two bottom diagrams show vibrations of equal energy with one mode being in the plane of the paper and the other out of the plane of the paper. Infrared radiation at 667 cm-1 (15.00 m) excites these vibrations. Figure 856 shows the IR spectrum for carbon dioxide at 4 kPa pressure with the 2 peaks clearly visible.
232
19 april 09 Physics Ch 8 final.i232 232 22/05/2009 11:55:14 AM
percentage transmittance
4000
3900
3400
1400
900
Figure 856
Methane being another greenhouse gas has even more modes of vibration. It has a tetrahedral shape. For a nonlinear molecule there are 3n 6 modes of vibration and methane CH4 has 3 5 6 = 9 modes of vibration. There are other ways that vibrations can occur in methane that were not evident in carbon dioxide. Apart from stretching, methane can have bending (1300 1500 cm-1) and rocking (600 900 cm-1) C H bond vibrations. The IR spectrum of methane is shown in Figure 857.
Figure 858 A black-body with multiple reections If the porcelain is heated to a given temperature black body radiation emerges from the hole. Depending on the temperature, the emerging radiation may appear red, yellow, blue or even white if the temperature is high enough. Emission occurs at every wavelength of light because it must be able to absorb every wavelength of all the incoming radiation. By using a suitable spectrometer, radiation intensity can be measured in the infrared, visible and ultra-violet regions of the electromagnetic spectrum.
percentage transmittance
4000
3900
3400
1400
900
233
19 april 09 Physics Ch 8 final.i233 233 22/05/2009 11:55:15 AM
CORE
CHAPTER 8
UV Vi si
In fra -re d
bl
CORE
1500 K
P = A T 4
3 4 Wavelength / m
Example
Figure 859 Black body radiation at dierent temperatures. The temperature of the Sun is about 6000 K. Figure 860 shows emission spectra of black bodies at even higher temperatures. Note that sunlight has its peak at 500 nm and that all colours of the visible spectrum are present in the emission spectra thus accounting for the reason that the Sun appears white.
10 9 8 7 6 5 4 3 2 1 100 Visible
The tungsten filament of a pyrometer (instrument for measuring high temperature thermal radiation) has a length of 0.50 m and an diameter of 5.0 10-5 m. The power rating is 60 W. Estimate the steady temperature of the filament. Assume that the radiation from the filament is the same as a perfect black body radiator at that steady temperature.
Solution
Power density (1013 watts / m3)
60 = P = A T 4 The surface area of tungsten filament (cylinder) = 2 r h = 2 5 10-5 m 0.5 m = 1.57 10-4 m2
500
1000
1500
2000
2500
Wavelength / nm
P = 1.57 10-4 m2 5.67 10 -8 W m-2 K- 4 T 4 T = [(60 W) (1.57 10-4 m2 5.67 10 -8 W m-2 K -4)] 1/ 4 = 1611 K = 1600 K
234
19 april 09 Physics Ch 8 final.i234 234 22/05/2009 11:55:16 AM
Example
Therefore, the power received per square metre on the Earth will be a fraction of that radiated by the Sun. Power radiated by the Sun = (4rE 2 4rS2) 1400 Wm-2
(Assume the radius of the Sun = 7 108 m and = 5.7 108 W m2 K4)
Solution
The surface area of a sphere = 4r2 Energy per second = P = A T 4 = 4r2 T 4 P = 4 (7 108)2 m2 5.7 10 8 W m2 K4 60004 K4 P = 4.55 1012 Wm2
P = eA T 4 The factor e is called the emissivity of a material. Emissivity is the ratio of the amount of energy radiated from a material at a certain temperature and the energy that would come from a blackbody at the same temperature and as such would be a number between 0 and 1. Black surfaces will have a value close to 1 and shiny surfaces will have a value close to 0. Most materials are coloured and they reflect some wavelengths better than others. For example, a blue object will reflect blue and absorb the other colours of the visible spectrum and a black object will absorb nearly all spectral colours. Therefore effective emissivity is also affected by the surface emissivity and wavelength dependence. Some approximate values are given in Figure 861. Material mercury tungsten rusted iron water soil Emissivity 0.05 0.15 0.1 0.6 0.6 0.9 0.6 0.7 0.4 0.95 Material snow ice plate glass coal black paint Emissivity 0.9 0.98 0.85 0.95 0.92
Example
The solar power received on the surface of the Earth at normal incidence is about 1400 Wm2. Deduce that the power output per square centimetre of the Suns surface is about 7.5 107 Wm2. Comment on some assumptions that have been made in determining this answer. (Take the Suns radius as 6.5 108 m and the radius of the Earths orbit around the Sun as 1.5 1011m).
Solution
Some assumptions are: the Sun and the Earth act as perfect black bodies, that the Earths orbit around the Sun is circular rather than elliptical or all of the Suns radiation falls on a sphere of this radius, that the Sun and the Earth are uniform spheres, and that the Earths atmosphere absorbs no energy. The surface area of a sphere = 4 r2 The total energy radiated by the Sun per second (power) = A T 4 = 4rS2 T 4 This energy falls around a circular sphere equivalent to the Earths orbit around the Sun equal to 4 rE 2.
Example
The Sun is at 50 to the horizontal on a clear day. Estimate how much radiation from the Sun is absorbed per hour by an animal that has a total area exposed to the Sun of 2.0 m2. (Assume = 5.7 10-8 W m-2 K- 4 and the emissivity to be 0.8)
235
19 april 09 Physics Ch 8 final.i235 235 22/05/2009 11:55:16 AM
CORE
If one assumes that the Sun is a perfect black body with a surface temperature of 6000 K, calculate the energy per second radiated from its surface.
CHAPTER 8
Solution
CS = f c h where f = 0.7 (fraction of Earth covered by water), = the density of sea water 1023 kgm-3,
50 os Ac
CORE
c = the specific heat capacity of water 4186 Jkg-1K-1 h = the depth of seawater that stores thermal energy.
50
In other words, to change the temperature of 1m2 of the Earth by 1 K will take on average 2.1 108 J of energy.
Example
Figure 863 gives some approximate specific heat capacity values for some environmental materials. Take the average temperature to be 300 K. Energy per second = P = eA T 4 cos 50 P = 0.8 2.0 m 5.7 10 W m K 300 K cos 50 P = 4.75 102 W This is the energy absorbed per second. Multiply by 3600 P = 1.7 106 W h-1
2 -8 -2 -4 4 4
Substance Water Ice Average rock Wet sand (20%) Snow Vegetated land Air
Specific Heat Capacity (J kg-1K-1) 4180 2100 2000 1500 880 830 700
Figure 863 Approximate specic heat capacity values for some environmental materials table.
Example
Estimate the effective heat capacity of the land surface if the specific heat capacity of rock and 20% wet sand are 2000 J kg-1K-1 and 1500 J kg-1K-1 respectively and the thermal energy is captured in the top 2 m. (Make the density equal to 2000 kgm-3).
Solution
CS = f c h f = 0.3 (fraction of Earth covered by land), = the average density is 2000 kgm-3,
236
19 april 09 Physics Ch 8 final.i236 236 22/05/2009 11:55:17 AM
Example
It takes 2 1011 J of thermal energy to heat 50 m2 of the Earth by 10 K. Determine the surface heat capacity of the Earth.
Solution
Q = A Cs T
Cs = Q / T A
237
19 april 09 Physics Ch 8 final.i237 237 22/05/2009 11:55:17 AM
CORE
CHAPTER 8
Remember that the solar constant at a particular surface was previously defined as the amount of solar energy per second that falls on an area of 1m2 of the upper atmosphere perpendicular to the Suns rays, and its value is equal to 1.35 103 Wm -2. Of course this is not the power that arrives on 1 m2 of the Earths surface because the planets average incoming radiation (insolation) is reflected and scattered and absorbed. Of the 100 units of insolation, 28 units is reflected off the clouds and aerosols, 18 units are absorbed in the atmosphere and 4 units are reflected off land surfaces. In fact the fraction of power from the solar constant at the Earths surface is 343 Wm-2. The temperature of the Earth can be determined by finding the Earths average incoming insolation and subtracting the amount reflected back into space by the global albedo and adding in the the energy that is supplied to the surface by the greenhouse effect. The incoming radiation only falls on an area equal to RE 2 because only one side of the Earth is facing the Sun to receive the incoming radiation. Now each 1 m2 will have its own albedo and its own surface temperature and adjustments to the above value of the incoming radiation have to be made to account for these factors. It can be proposed in the balance climate model that the incoming radiation will therefore be equal to: RE 2 (1 ) (m2) the solar constant (Wm-2) where is the albedo. Assuming that the the Earth is radiating over its entire surface area then the outgoing radiation can be obtained using Stefans Law and is determined to be equal to: Latitude 4 RE 2 TE4 Therefore, assuming a balance in thermal equilibrium: RE 2 (1 ) solar constant = 4 RE 2 TE4 (1 ) x solar constant / 4 = TE4 Let us see if this equilibrium state can match some known quantities in the following example.
Example
CORE
If the long-wave radiation flux from the surface of the Earth has an average value of 240 Wm-2 and the average temperature somewhere in the atmosphere is 255 K, determine the incoming and outgoing radiation of the Earth. Assume the global albedo is 0.3.
Solution
Incoming radiation = (1 ) solar constant / 4 = (1 0.3) 1.35 103 Wm-2 4 = 236.25 Wm-2 Outgoing radiation = TE4 = 5.7 10-8 W m-2 K- 4 (255)4 K4 = 241 Wm-2 These values are nearly equal to the average radiation flux value of 240 Wm-2. Now let us examine the long-wave radiation flux for some different latitudes that have different land fractions, ocean fractions and sea ice fractions. Table 864 gives some values and their respective albedos. It may be worthwhile to look at an atlas so that the values given make some sense. The zonal surface albedo for each latitude is determined by taking the various surfaces and their albedo into account. Zonal surface albedo 0.44 0.13 0.13 0.08 0.08 0.10 0.09
22/05/2009 11:55:18 AM
85 N 55 N 25 N 5 N 5 S 25 S 55 S
Figure 864
238
19 april 09 Physics Ch 8 final.i238 238
Ocean fraction
Land albedo S
Ocean albedo
Example
(1 P) insolation + F = TS 4 Determine the average zonal surface temperatures for the latitude belts 85 N and 25 S. F is the zonal long wave radiation flux and is equal to 240 Wm-2 as seen earlier in this section. The surface temperature can now be obtained from the model.
Solution Example
(1 S) solar constant ___________________ = TE4 4
____________________
Determine the average zonal surface temperatures for the latitude belts 55 N and 25 S. Are these temperature values in agreement with the actual average temperatures?
Solution
4 (1 0.44) 1.35 103 Wm2 = 240 K For 85N, T = _______________________ 4 5.7 108 W m2 K4
_______________________
(1 P) insolation + F = TE4
4
____________________
_______________________
For 250S, T =
(1 P) insolation + F so T = ____________________
For 55 N (1 0.365) 285.5 Wm2 + 240 Wm2 T = _______________________________ = 290 K 5.7 108 W m2 K 4
4
There appears to be something wrong with these temperatures using the zonal surface albedos. They are too low, and the reason why is because the insolation absorbed in the atmosphere has not been taken into account. These temperature values are not the surface temperatures but rather the low atmosphere temperatures because the atmospheric absorption and scattering has to be included. If the insolation and cloud cover for the latitude belts is included a planetary albedo value can be found. Table 865 provides this information for some latitude belts. Planetary albedo (P)
_______________________________
For 25 S, (1 0.241) 394.8 Wm2 + 240 Wm2 = 310 K T = _______________________________ 5.7 108 W m2 K 4
4
_______________________________
These temperatures are still not in agreement with the actual average temperatures of the 2 latitude belts. There appears to be 293.2 278.3 = 14.9 K and 311.9 296.4 = 15.5 K difference for the 55 north and the 25 south latitude belts. The final column in Table 865 gives the temperature difference from the actual average Earth temperature at each latitude using the present balance climate model. As can be seen the values at the equator are higher and the value in the northern polar region is lower. It has been suggested that the problem is to do with the transfer of heat due to convection in the atmosphere. There exists a temperature gradient between different zones of the Earth and it takes time for equilibrium to be established. The rate at which heat is transferred between the equator and the polar region is a function of the temperature gradient. It has been found that the transport of heat from the equator to the polar regions is a function of the zonal temperature, the global temperature, and the dynamical energy transport coefficient .
85 N 55 N 25 N 5 N 5 S 25 S 55 S
Latitude
239
19 april 09 Physics Ch 8 final.i239 239 26/05/2009 11:50:37 AM
CORE
CHAPTER 8
The globally average temperature can be taken as 288 K at present. The equation now becomes: (1 P) insolation - F = (TS - TS) where TS is the average latitude belt temperature, TS is the globally averaged temperature and is the dynamical energy transport coefficient with a value of 3.4 Wm-2K-1. It has also been shown that clouds enhance warming in the polar regions and causes cooling in the tropics. This factor is called the cloud top feedback and is equal to: = + C where represents the adjusted polar transport coefficient and C is the cloud feedback for a given latitude. This final adjustment makes the original equation become: (1 P) insolation - F = (TS - TS) Figure 866 provides this information for some latitude belts. Actual temperature (K) Combined insolation (Wm-2) Planetary albedo (P) It can be shown that the change in this regions temperature over a period of time is given by: zone will receive different amounts of incoming radiation depending on whether they are close to the equator or at the north and south extremes. Furthermore, it does not account for cloud cover. The model could be greatly improved by taking these factors into consideration or by using a more complex model such as three-dimensional general circulation model. Now suppose there was an increase to the amount of incoming radiation from the Sun. According to our energy balance climate model, there would also be an increase in the outgoing radiation so that equilibrium could be re-established. This increase would cause the planetary temperature to increase. Let us suppose that the solar constant at the 25 S latitude belt increased by 10% for one year when the albedo was 0.3 and the temperature somewhere above in the atmosphere was 255 K. The temperature change will be given by: T = new temperature old temperature
CORE
(Incoming radiation intensity outgoing radiation intensity) time ______________________________________________ surface heat capacity
(Iin Iout)t T = __________ Cs Let us go back to our basic model. Using the equations: (1 ) 1.1 solar constant Incoming radiation = ________________________ 4 3 2 (1 0.3) 1.1 1.35 10 Wm = ___________________________ = 259.875 W m2 4 Outgoing radiation = TE4 = 5.7 108 W m2 K 4 (255)4 K4 = 241 W m2 The change in temperature T would equal: (259.875 Wm 241 Wm ) 60 60 24 365 ______________________________________ = 1.49 K 4 10 J m2 K1
8 2 2
85 N 55 N 25 N 5 N 5 S 25 S 55 S
= + C
Latitude
Figure 866
Adjustment values
There are many energy balance climate models and they are only as good as the mathematics. If the mathematics is wrong or the wrong data is collected, then the model is flawed from the very beginning. This model is highly oversimplified because the model is not global but rather restricted to a certain small latitudinal region and it doeas not account for the flow of energy from one zone to the next or the fact that each latitude
240
19 april 09 Physics Ch 8 final.i240 240 22/05/2009 11:55:20 AM
Exercise
8.5
1.
All of the following are natural greenhouse gases EXCEPT: A. B. C. D. methane nitrogen water vapour nitrous oxide
By referring to Figures 859 and 860 answer the following questions. (a) What is the difference between a black body radiator and a non-black body radiator. Explain why a body at 1500 K is red hot whereas a body at 3000 K is white hot. How can you use the information from the graphs to attempt to explain Stefans law? As the temperature increases, what changes take place to the energy distribution among the wavelengths radiated?
The following information is about questions 2-5. (b) A perfectly black body sphere is at a steady temperature of 473 K and is enclosed in a container at absolute zero temperature. It radiates thermal energy at a rate of 300 Js-1. 2. If the temperature of the sphere is increased to 946 K it radiates heat at a rate of: 8. A. B. C. D. 3. 300 W 1200 W 3200 W 4800 W 9. If the radius of the sphere is doubled it radiates heat at a rate of: A. B. C. D. 4. 300 W 1200 W 3200 W 4800 W (b) If the temperature of the enclosure is raised to 500 K it radiates heat at a rate of: 10. A. B. C. D. 5. 300 W 1200 W 3200 W 4800 W 11. If the enclosure is at 473 K the net rate of heat loss would be: A. B. C. D. 0W 300 W 1200 W 100 000 W (c) (d)
A very long thin-walled glass tube of diameter 2.0 cm carries oil at a temperature 40 C above that of the surrounding air that is at a temperature of 27 C. Estimate the energy lost per unit length. (a) Estimate the mean surface temperature of the Earth if the Suns rays are normally incident on the Earth. Assume the Earth is in radiative equilibrium with the Sun. The Suns temperature is 6000 K and its radius is 6.5 108 m. The distance of the Earth from the Sun is1.5 1011m. What assumptions have been made about the temperature obtained?
Estimate the effective heat capacity of the oceans if the specific heat capacity of water is 4200 J kg-1K-1 and the thermal energy is captured in the top 50 m. (Make the density equal to 1030 kgm-3). Suppose that the solar constant at the 25 S latitude belt increased by 20% for two years when the albedo was 0.3 and the temperature somewhere above in the atmosphere was 255 K. Determine the change in temperature for this time period.
241
19 april 09 Physics Ch 8 final.i241 241 22/05/2009 11:55:21 AM
CORE
CHAPTER 8
A word of caution is offered to the reader before beginning this section concerning global warming. Very few scientists who specialise in this subject would disagree with the fact that human activity is responsible for an enhancement of greenhouse gas emissions and that this may contribute to global warming. However, the scientific community is in serious debate as to whether nature can deal with the enhanced greenhouse effect. In the 1970s, some science teachers taught students through various curriculums that there was a serious energy crisis and that most of the crude oil would be used up by the beginning of the 21st century, that an ice age was imminent and the world population was increasing exponentially. We know that these predictions did not happen. As individuals we are questioning the real meaning of truth as we witness the sad cause of events that are occuring in the world as portrayed in the media. There is a split in the science community concerning the pros and cons of global warming and any such polarisation tends to create disharmony and self-doubt. Global warming is big business and many individuals and organisations are making hefty profits from nature documentaries and climate modelling projects, and governments are collecting taxes to fund research and conventions on this subject. Take the time to do a web search on global warming, climate models and the greenhouse effect and it will become evident. As much as there is evidence that ocean and sea levels are rising and that the ice caps are melting, there is the counter-evidence that sea levels have not risen or have in fact fallen in some geographical areas and that there is an actual increase in the Arctic ice cap and the Antarctic sea ice cover. During December 2006, New York had one of its highest temperatures for a winter while Melbourne had snow in the hinterlands during summer. As students it is up to you to gather the evidence and come to your own conclusions.
CORE
Greenhouse gas emissions from the burning of fossil fuels has increased the concentration of carbon dioxide, nitrous oxides and aerosols in the atmosphere, and the melting of permafrost regions in the world and deforestation have increased the concentrations of methane. These greenhouse gases capture the infra-red radiation and cause resonance at the natural frequency of oscillation of the greenhouse gases. This energy is then re-radiated to the Earth causing climate changes. The Sun, being a variable star, has a major influence on the long-term and short-term changes in climate of the solar system.The centre of mass of the solar system varies in an eleven year Sunspot cycle and this causes variations in the centre of mass of the nucleus of the Sun that leads to a
242
19 april 09 Physics Ch 8 final.i242 242 22/05/2009 11:55:21 AM
243
19 april 09 Physics Ch 8 final.i243 243 22/05/2009 11:55:22 AM
CORE
CHAPTER 8
If global warming increased, the ice and snow cover of the Earth would decrease. Ice and snow have a high albedo which means the ratio between the incoming radiation and the amount reflected is high. However, the land which would be exposed or the water that would be formed has a low albedo and as a result, more radiation would be captured resulting in an increase in the rate of heat absorption of the Earth.
CORE
Deforestation reduces carbon fixation in the carbon cycle. By the process of photosynthesis, plants take carbon dioxide from the air to make sugars and cellulose thus locking up or fixing the carbon dioxide. With increased deforestation, there is less plants to carry out this process and CO2 concentrations would increase.
8.6.7 State that one possible eect of the enhanced greenhouse eect is a rise in mean sea-level. 8.6.8 Outline possible reasons for a predicted rise in mean sea-level. 8.6.9 Identify climate change as an outcome of the enhanced greenhouse eect. 8.6.10 Solve problems related to the enhanced greenhouse eect. 8.6.11 Identify some possible solutions to reduce the enhanced greenhouse eect. 8.6.12 Discuss international eorts to reduce the enhanced greenhouse eect.
IBO 2007
244
19 april 09 Physics Ch 8 final.i244 244 22/05/2009 11:55:22 AM
Example
(a)
Calculate the total energy needed to convert 10 tonnes of ice at -40 C to water at 16 C. If the ice has an area of 100 m2, estimate the depth of the ice. Now suppose this volume of water was increased in temperature by 4 C, estimate the increase in volume of the water.
(b)
(b)
Solution
(c)
(a)
Energy required = 4.84 109 J and the Sun supplies 1000 Js-1m-2. The albedo is 0.90 so 90% of the radiation is reflected. Energy supplied by the Sun = 100 Js-1m-2.
Solution
(a)
-1
-1
If this energy goes to heating the ice, then it will take: Latent heat of fusion of ice = 3.34 105 Jkg-1 4.85 109 J / 100 Js-1m-2 = 4.85 107 s = 1.54 years Specific heat capacity of water = 4180 Jkg K Using Q = mcTICE + mLf + mcTWATER
-1 -1
(b)
We have assumed that no heat was lost to the surroundings and that the ice is not 1 m thick but rather infinitely thin.
= m [LV + cTWATER+ Lf + cTICE] = 1 104 kg [(2.1 103 Jkg -1 K-1 400 K) + 3.34 105 Jkg-1 + (4180 Jkg -1 K-1 160 K) ] = 4.84 109 J = 5 109 J (b) 10 tonnes = 10 000 kg. If we assume that 1 kg of water has a volume of 1 dm3 then the total volume would be 10 000 dm3 or 100 m3. Therefore, the depth of the ice would be 1 m because 100 m2 1 m = 100 m3. V = V0 T = 210 10-6 K-1 100 m3 2 K = 4.2 x 10-2 m3 = 4.2 L.
(c)
245
19 april 09 Physics Ch 8 final.i245 245 22/05/2009 11:55:23 AM
CORE
Example
If the Sun supplied 1 kW m-2 to the ice in the previous question and the ice had an albedo of 0.90, estimate the time it would take for the ice to melt and reach 16 C.
CHAPTER 8
composed of the number of gas turbines followed by the steam turbine. For example, a 3-1 combined cycle facility has three gas turbines tied to one steam turbine. Efficiencies of over 50% can be achieved. Furthermore, if the plants waste heat is utilized for the heating of house or an industrial process, efficiencies of 80% are achievable. A number of advanced technologies are being developed including PFBC (pressurized fluidized bed combustion), IGCC (integrated gas combined cycle) and IDGCC (Integrated Drying Gasification Combined Cycle) These technologies generate electricity at a higher efficiency. China is presently buiding coal-fired power stations at the rate of one per day. Surely it would be better to use renewable energy resources that are cleaner and less polluting. However, China has coal resources and the demand for energy is unprecedented in this country. Nuclear power is back and some countries believe that it is the best alternative to fossil fuels because it is clean and 1kg of uranium can produce as much energy as 2 000 000 kg of coal. Revegetation of large areas of land has been proposed in the Kyoto Protocol that came into force on 16 February 2005. Countries can trade their emission quotas by being given carbon points for reforestation. This will store and capture carbon and reduce the amount of carbon dioxide in the atmosphere. Hybrid vehicles will play a big part in the reduction of carbon dioxide emissions. Many taxis and other vehicles have converted from gasoline to LPG gas over the past 20 years. Many local government vehicles are battery driven. Although many people talk about the hydrogen vehicle, the tank would have to be very heavy and the tanks would have to filled-up more often than conventional vehicles.
CORE
There are a number of ways that we can use to reduce enhanced greenhouse effect. These include: greater efficiency of power production replacing the use of coal and oil with natural gas use of combined heating and power systems (CHP) increased use of renewable energy sources and nuclear power carbon dioxide capture and storage use of hybrid vehicles.
As already mentioned, energy is lost to the surroundings at many stages in a fossil fuel power plant and the efficiency of coal and oil-fired power plants can be as low as 40%. The majority of the useful energy is lost to water in the cooling towers as heat is evolved in the condensation component of the heat exchanger cycle. However, a natural gas-fired power station is more efficient as they use combined cycle gas turbines (CCGT). A jet engine is used in place of the turbine to turn the generator. Natural gas is used to power the jet engine and the exhaust fumes from the jet engine are used to produce steam which turns the generator. These power stations can be up to 55% efficient. The highest efficiencies are being obtained in combined cycle plants such as the cogeneration or CHP (combined heat and power) plants. In a combined cycle plant, surplus heat from a gas turbine is used to produce steam which in turn drives a steam turbine. The combined cycle plants are designed in a variety of configurations
246
19 april 09 Physics Ch 8 final.i246 246 22/05/2009 11:55:23 AM
In the 1980s, the United Nations Environment Programme in conjunction with the World Meteorological Organization set up a panel of government representatives and scientists to determine the factors that may contribute to climate change. The panel was known as the Intergovernmental Panel on Climate Change (IPCC). This body has published many extensive reports that formed the basis for many discussions and decision-making about the enhanced greenhouse effect. In 1992, the Earth Summit 1 was held in Rio de Janeiro. Some 150 other countries signed the UN Framework Convention on Climate Change. The stated objective of the Framework Convention is to achieve: ...stabilisation of the greenhouse gas concentrations in the atmosphere at a level that would prevent dangerous anthropogenic interference with the climate system. Such a level should be achieved within a time-frame sufficient to allow ecosystems to adapt naturally to climate change, to ensure that food production is not threatened and to enable economic development to proceed in a sustainable manner. Each country agreed to report its greenhouse gas emissions and the strategies and measures it has adopted to reduce them.
247
19 april 09 Physics Ch 8 final.i247 247 22/05/2009 11:55:24 AM
CORE
CHAPTER 8
Exercise
8.6
1.
Define the following terms (a) (b) (c) (d) (e) (f) (g) (h) energy energy density efficiency of an energy conversion albedo resonance emissivity surface heat capacity the coefficient of volume expansion.
CORE
2.
Describe the meaning of the following terms and give an example of each (a) (b) (c) (d) (e) (f) (g) (h) a thermodynamic cycle energy degradation a fossil fuel a renewable energy source a pump storage system a combined cycle gas turbine an oscillating water column blackbody radiation
3.
Outline the findings of the following international bodies (a) (b) (c) IPPC The Kyoto Protocol APPCDC
4.
248
19 april 09 Physics Ch 8 final.i248 248 22/05/2009 11:55:24 AM
MOTION IN FIELDS
MOTION IN FIELDS
9.1 9.2 9.3 Projectile motion Gravitational eld, potential and energy Electric eld, potential and energy
9
AHL
vh g m s2 cliff h d
Figure 901 The path of a horizontal projectile Since there is no force acting in the horizontal direction the horizontal velocity will remain unchanged throughout the flight of the particle. However, the vertical acceleration of the projectile will be equal to g. We can find the time of flight t by finding the time it takes the particle to fall a height h. To start with, we consider only the vertical motion of the object:
u =0 a=g h s=h t=? v = vv
249
19 april 09 Physics Ch 9 final.i249 249 22/05/2009 11:56:06 AM
CHAPTER 9
Time of ight
This is calculated from the definition of acceleration i.e., using v = u + at, we have that
vv v v = 0 + g t t = --g
where vv is the vertical velocity with which the object strikes the ground. To find vv we use the equation v = u + 2 as , so that as the initial vertical velocity (u) is zero, the acceleration a =g and s = h. Then
2 2
vv = 0 + 2g h vv =
2g h
AHL
2gh t = --------- = g
2h ---g
Figure 903 Using multi-ash photography This is irrespective of the speed with which the particle is fired horizontally. The greater the horizontal speed, the further this projectile will travel from the base of the cliff. It is also possible to show that the path of the particle is parabolic. To find the velocity with which the particle strikes the ground we must remember that velocity is a vector quantity. So, using Pythagoras theorem at the point of impact (to take into account both the vertical component of velocity and the horizontal component of velocity) we have that the velocity has a magnitude of
t =
2h ---g
Since the horizontal velocity is constant, the horizontal distance d that the particle travels before striking the ground is vh t. (i.e., using s = ut + at2 = ut, where in the horizontal direction we have that a = 0 and u = vh = constant) This gives
2h --d = vv g
This is the general solution to the problem and it is not expected that you should remember the formula for this general result. You should always work from first principles with such problems. An interesting point to note is that, since there is no horizontal acceleration, then if you were to drop a projectile from the top of the cliff vertically down, at the moment that the other projectile is fired horizontally, then both would reach the ground at the same time. This is illustrated by the copy of a multiflash photograph, as shown in Figure 903.
vh vv V
V =
2 vv
2 + vh
250
19 april 09 Physics Ch 9 final.i250 250 22/05/2009 11:56:07 AM
MOTION IN FIELDS
or
vh -- = arc tan -v v
2 1 g t2 y = vvt + 1 -- ( g ) t y = ( v sin ) t -2 2
Notice that at impact the velocity vector is tangential to the path of motion. The velocity vector is always tangential to the path of motion and is made up of the horizontal and vertical components of the velocities of the object.
x t = --------v cos
into this equation we get
x 1 x 2 y = ( v sin ) ---------- --g --------v cos 2 v cos sin 1 x 2 1 2 = x-------------- -g cos 2 v cos x 2 sec 2 = x tan 1 -g 2 v
This is the general equation of the motion of the projectile that relates the vertical and horizontal distances. This equation is plotted below for a projectile that is launched with an initial speed of 20 m s1 at 60 to the horizontal. The path followed by the projectile is a parabola.
y v = 0 v 10
10 20 20 sin 60 17.32
10
Figure 905 Projectile launched at an angle The vertical component of the velocity, vv , is
60
20 cos 60 = 10
10 x
17.32
v v = v sin
The horizontal component of the velocity, vh, is Figure 906 The parabolic path
20
v h = v cos
As in the case of the projectile launched horizontally, there is no acceleration in the horizontal direction and the acceleration in the vertical direction is g. If we refer the motion of the projectile to a Cartesian coordinate system, then after a time t, the horizontal distance travelled will be given by
The maximum height H that the projectile reaches can be found from the equation
v = u + 2 as
where u is the initial vertical component of the velocity and v the final (vertical component) of the velocity at the highest point, where at this point, the vertical component is zero. So that,
2 2 v sin 0 = ( v sin ) + 2 g H H = -----------2g
2 2
x = v h t = ( v cos ) t
and the vertical distance can be found by using the equation
s = ut + 1 -- at 2
so that
251
19 april 09 Physics Ch 9 final.i251 251 22/05/2009 11:56:10 AM
AHL
CHAPTER 9
The time T to reach the maximum height is found using v = u + a t, such v = 0, u = v sin and a = g, to give 0 = v sin g T g T = v sin Hence, range of the projectile and the effect of the vertical drag will be to reduce the maximum height reached by the projectile. However, the presence or air resistance also means that the mass of the projectile will now affect the path followed by the projectile. In the absence of air resistance there is no acceleration in the horizontal direction and the acceleration in the vertical direction is g, the acceleration of free fall. With air resistance present, to find the horizontal (aH)and vertical (aV) accelerations we have to apply Newtons second law to both the directions. If we let the horizontal drag equal kvH and the vertical drag equal KvV where k and K are constants and vH and vV are the horizontal and vertical speeds respectively at any instant, then we can write
sin T = v --------g
For the example above the value of T is 1.73 s. This means (using symmetry) that the projectile will strike the ground 3.46 s after the launch. The horizontal range R is given by R = (vcos ) 2 T which for the example gives R = 34.6 m. (We could also find the time for the projectile to strike the ground by putting y = 0 in the equation
AHL
2 y = ( v sin ) t 1 --g t 2
Although we have established a general solution, when solving projectile problems, remember that the horizontal velocity does not change and that when using the equations of uniform motion you must use the component values of the respective velocities. Do not try to remember the formulae.
Example
A particle is fired horizontally with a speed of 25 ms-1 from the top of a vertical cliff of height 80 m. Determine (a) (b) the time of flight The distance from the base of the cliff where it strikes the ground the velocity with which it strikes the ground
horizontal drag
weight
Experiment shows that both the horizontal and vertical drag forces depend on the speed of the projectile. The effect of the horizontal drag will be to foreshorten the
252
19 april 09 Physics Ch 9 final.i252 252 22/05/2009 11:56:11 AM
MOTION IN FIELDS
The magnitude of this velocity is
Solution
40 + 25 = 47 m s1
Horizontal: u = 25 m s -1 a =0
+ve
(a)
The vertical velocity with which it strikes the ground can be found using the equation
2gh =
2 10 80
vv 40 --- = 4. t = --- = 10 g
That is, 4 seconds. (b) The distance travelled from the base of the cliff using
2 s = ut + 1 -- at , with u = 25, 2
a = 0 and t = 4 is given by
s = 25 4 = 100.
That is, the range is 100 m. (c) The velocity with which it strikes the ground is given by the resultant of the vertical and horizontal velocities as shown.
Ground level
v h = 25
Figure 909
Energy problem
v v = 40
V
253
19 april 09 Physics Ch 9 final.i253 253 22/05/2009 11:56:13 AM
AHL
CHAPTER 9
Example
1270 = 733.53 + 10 H
H = 53.6
A ball is projected at 50 ms at an angle of 40 above the horizontal. The ball is released 2.00 m above ground level. Taking g = 10 m s-2, determine (a) (b) the maximum height reached by the ball the speed of the ball as it hits the ground
-1
That is, the maximum height reached is 53.6 m. b. At C, the total energy is given by
2 1 mv 2 Ek + Ep = 1 - mvC + mg 0 = 2 2 C
Solution
2 2 1270 m = 1 -- mv v C = 2540 2 C
AHL
50 H A 40 2m R = range
a. The total energy at A is given by
2 Ek + Ep = 1 - m ( 50.0 ) + mg 2.00 2 = 1250 m + 20 m
v C =
2540 = 50.4
That is, the ball hits the ground with a speed of 50.4 m s1.
Exercise
9.1
1.
A projectile is fired from the edge of a vertical cliff with a speed of 30 m s1 at an angle of 30 to the horizontal. The height of the cliff above the surface of the sea is 100 m. (a) If g = 10 m s2 and air resistance is ignored show that at any time t after the launch the vertical displacement y of the projectile as measured from the top of the cliff is given by: y = 15t - 5t2
= 1270 m 1300 m
Next, to find the total energy at B we need to first determine the speed at B, which is given by the horizontal component of the speed at A. Horizontal component: 50.0 cos 40 = 38.3 m s1. Therefore, we have that
2 1 Ek + Ep = - m ( 38.3 ) + mg H 2
Hence show that the projectile will hit the surface of the sea about 6 s after it is launched. (b) Suggest the significance of the negative value of t that can be obtained in solving the equation. Determine the maximum height reached by the projectile and the horizontal distance to where it strikes the sea as measured from the base of the cliff.
y = 15 t 5 t
(c)
= 733.53 m + 10 mH
Equating, we have
1270 m = 733.53 m + 10 mH
254
19 april 09 Physics Ch 9 final.i254 254 22/05/2009 11:56:15 AM
MOTION IN FIELDS
9.2 GRAVITATIONAL FIELD, POTENTIAL AND ENERGY If the particle is moved to B, then since r is very small,
9.2.1 Dene gravitational potential and gravitational potential energy. 9.2.2 State and apply the expression for gravitational potential due to a point mass. 9.2.3 State and apply the formula relating gravitational eld strength to gravitational potential gradient.
IBO 2007
we can assume that the field remains constant over the distance AB. The work W done against the gravitational field of the Earth in moving the distance AB is
GMe m W = ----------r r2
(remember that work done against a force is negative) To find the total work done, W, in going from the surface of the Earth to infinity we have to add all these little bits of work. This is done mathematically by using integral calculus.
W =
1 = GMe m 0 - R G Me m = ----------R
Hence we have, where R is the radius of the Earth, that the work done by the gravitational field in moving an object of mass m from R (surface of the Earth) to infinity, is given by
GMe m W = ---------R
We can generalise the result by calculating the work necessary per unit mass to take a small mass from the surface of the Earth to infinity. This we call the gravitational potential, V, i.e.,
V = ---m
r Me m A g r B
r =
r + r
Figure 911
Gravitational forces
We would get exactly the same result if we calculated the work done by the field to bring the point mass from infinity to the surface of Earth. In this respect the formal definition of gravitational potential at a point in a gravitational field is therefore defined as the work done per unit mass in bringing a point mass from infinity to that point. Clearly then, the gravitational potential at any point in the Earths field distance r from the centre of the Earth (providing r > R) is
In the diagram we consider the work necessary to move the particle of mass m a distance r in the gravitational field of the Earth.
GM e V = -------r
255
19 april 09 Physics Ch 9 final.i255 255 22/05/2009 11:56:16 AM
AHL
CHAPTER 9
The potential is therefore a measure of the amount of work that has to be done to move particles between points in a gravitational field and its unit is the J kg1. We also note that the potential is negative so that the potential energy as we move away from the Earths surface increases until it reaches the value of zero at infinity. If the gravitational field is due to a point mass of mass m, then we have the same expression as above except that Me is replaced by m and must also exclude the value of the potential at the point mass itself i.e. at r = 0. We can express the gravitational potential due to the Earth (or due to any spherical mass) in terms of the gravitational field strength at its surface. At the surface of the Earth we have
Figure 912 The gravitational potential gradient The gravitational field is of strength I and is in the direction shown. The gravitational potential at A is V and at B is V + V. The work done is taking a point mass m from A to B is Fx = mIx. However, by definition this work is also equal to -mV. Therefore mIx = -mV
AHL
GM e g 0 R e = ----Re
So that,
2 = GM g 0 Re e
Hence at a distance r from the centre of the Earth the gravitational potential V can be written as
2 g 0 Re GMe V = -------- = -------r r
or I =
V x
The potential at the surface of the Earth (r = Re) is therefore -g0Re It is interesting to see how the expression for the gravitational potential ties in with the expression mgh. The potential at the surface of the Earth is -g0Re (see the example above) and at a height h will be -g0( R e + h ) if we assume that g0 does not change over the distance h. The difference in potential between the surface and the height h is therefore g0h. So the work needed to raise an object of mass m to a height h is mgh , i.e., m difference in gravitational potential This we have referred to as the gain in gravitational potential energy (see 2.3.5). However, this expression can be extended to any two points in any gravitational field such that if an object of mass m moves between two points whose potentials are V1 and V2 respectively, then the change in gravitational potential energy of the object is m(V1 V2).
Effectively this says that the magnitude of the gravitational field strength is equal to the negative gradient of the potential. If I is constant then V is a linear function of x and I is equal to the negative gradient of the straight line graph formed by plotting V against x. If I is not constant (as usually the case), then the magnitude of I at any point in the field can be found by find the gradient of the V-x graph at that point. An example of such a calculation can be found in Section 9.2.9. For those of you who do HL maths the relationship between field and potential is seen to follow from the expression for the potential of a point mass viz:
m r dV m = +G 2 = I dr r V = G
256
19 april 09 Physics Ch 9 final.i256 256 22/05/2009 11:56:18 AM
MOTION IN FIELDS
Figure 913 shows the field lines and equipotentials for two point masses m.
9.2.4 Determine the potential due to one or more point masses. 9.2.5 Describe and sketch the pattern of equipotential surfaces due to one and two point masses.
9.2.6 State the relation between equipotential surfaces and gravitational eld lines. 9.2.7 Explain the concept of escape speed from a planet. Figure 913 Equipotentials for two point masses 9.2.8 Derive an expression for the escape speed of an object from the surface of a planet.
IBO 2007
It is worth noting that we would get exactly the same pattern if we were to replace the point masses with two equal point charges. (See 9.3.5)
M M V = G E + M r x x
where ME = mass of Earth, MM= mass of Moon and r = distance between centre of Earth and Moon.
But what does it actually mean to take something to infinity? When the particle is on the surface of the Earth we can think of it as sitting at the bottom of a potential well as in figure 914.
infinity
GM --------R
particle surface of Earth
FIgure 914 A potential well The depth of the well is --------R and if the particle gains an amount of kinetic energy equal to ------------R where m is its mass then it will have just enough energy to lift it out of the well. In reality it doesnt actually go to infinity it just means that the particle is effectively free of the gravitational attraction of the Earth. We say that it has escaped the Earths gravitational pull. We meet this idea in connection with molecular forces. Two molecules in a solid will sit at their
257
19 april 09 Physics Ch 9 final.i257 257 22/05/2009 11:56:19 AM
AHL
CHAPTER 9
equilibrium position, the separation where the repulsive force is equal to the attractive force. If we supply just enough energy to increase the separation of the molecules such that they are an infinite distance apart then the molecules are no longer affected by intermolecular forces and the solid will have become a liquid. There is no increase in the kinetic energy of the molecules and so the solid melts at constant temperature. We can calculate the escape speed of an object very easily by equating the kinetic energy to the potential energy such that GMe m 1 2 ---------- mv es c ape = -Re 2 Determine also the gravitational field strength at a distance of 6.8 106 m above the surface of Mars.
Solution
6.4 10 23 M = 6.7 10 11 = 1.3 10 7 N kg-1 R 3.4 10 6
V = G
ve s c ape =
2 GMe --------- = Re
To determine the field strength gh at 6.8 106 m above the M surface, we use the fact that g 0 = G 2 such that GM = g0R2 R Therefore g h =
2 g 0 Re
AHL
Substituting for g0 and Re gives a value for vescape of about 11 km s1 from the surface of the Earth. You will note that the escape speed does not depend on the mass of the object since both kinetic energy and potential energy are proportional to the mass.
(the distance from the centre is 3.4 106+ 6.8 106 = 10.2 106 m)
Exercise
9.2
1. In theory, if you want to get a rocket to the moon it can be done without reaching the escape speed. However, this would necessitate an enormous amount of fuel and it is likely that the rocket plus fuel would be so heavy that it would never get off the ground. It is much more practical to accelerate the rocket to the escape speed from Earth orbit and then, in theory, just launch it to the Moon.
The graph below shows how the gravitational potential outside of the Earth varies with distance from the centre.
0 1 2 3 4 5 6 10 20 30 40 50 60 70
r /m 10
6
V / J kg 10
Example
(a)
Use the following data to determine the potential at the surface of Mars and the magnitude of the acceleration of free fall mass of Mars radius of Mars = 6.4 1023 kg = 3.4 106 m
(b)
(c)
Use the graph to determine the gain in gravitational potential energy of a satellite of mass 200 kg as it moves from the surface of the Earth to a height of 3.0 107 m above the Earths surface. Calculate the energy required to take it to infinity? Determine the slope of the graph at the surface of the Earth, m? Comment on your answer.
258
19 april 09 Physics Ch 9 final.i258 258 22/05/2009 11:56:21 AM
MOTION IN FIELDS
v1
BEFORE AFTER
v2
Figure 916 Interaction of two positive particles The electric potential energy between two point charges can be found by simply adding up the energy associated with each pair of point charges. For a pair of interacting charges, the electric potential energy is given by: kqQ kqQ ____ U = Ep + Ek = W = Fr =____ 2 r= r r Because no external force is acting on the system, the energy and momentum must be conserved. Initially, Ek = 0 and Ep = k qQ / r = 9 109 1 10-12 / 0.1 m = 0.09 J. When they are a great distance from each other, Ep will be negligible. The final energy will be equal to mv12 + mv22 = 0.09 J. Momentum is also conserved and the velocities would be the same magnitude but in opposite directions. Electric potential energy is more often defined in terms of a point charge moving in an electric field as: the electric potential energy between any two points in an electric field is defined as negative of the work done by an electric field in moving a point electric charge between two locations in the electric field.
U = Ep = -W = -Fd = qEx
where x is the distance moved along (or opposite to) the direction of the electric field. Electric potential energy is measured in joule (J). Just as work is a scalar quantity, so too electrical potential energy is a scalar quantity. The negative of the work done by an electric field in moving a unit electric charge between two points is independent of the path taken. In physics, we say the electric field is a conservative field. Suppose an external force such as your hand moves a small positive point test charge in the direction of a uniform electric field. As it is moving it must be gaining kinetic energy. If this occurs, then the electric potential energy of the unit charge is changing.
259
19 april 09 Physics Ch 9 final.i259 259 22/05/2009 11:56:22 AM
AHL
CHAPTER 9
In Figure 917 a point charge +q is moved between points A and B through a distance x in a uniform electric field.
Electric potential
The electric potential at a point in an electric field is defined as being the work done per unit charge in bringing a small positive point charge from infinity to that point. W V = V Vf = ___ q If we designate the potential energy to be zero at infinity then it follows that electric potential must also be zero at infinity and the electric potential at any point in an electric field will be: W V = ___ q Now suppose we apply an external force to a small positive test charge as it is moved towards an isolated positive charge. The external force must do work on the positive test charge to move it towards the isolated positive charge and the work must be positive while the work done by the electric field must therefore be negative. So the electric potential at that point must be positive according to the above equation. If a negative isolated charge is used, the electric potential at a point on the positive test charge would be negative. Positive point charges of their own accord, move from a place of high electric potential to a place of low electric potential. Negative point charges move the other way, from low potential to high potential. In moving from point A to point B in the diagram, the positive charge +q is moving from a low electric potential to a high electric potential. In the definition given, the term work per unit charge has significance. If the test charge is +1.6 10-19C where the charge has a potential energy of 3.2 10-17 J, then the potential would be 3.2 10-17J / +1.6 10-19 C = 200 JC-1. Now if the charge was doubled, the potential would become 6.4 10-17 J. However, the potential per unit charge would be the same. Electric potential is a scalar quantity and it has units JC-1 or volts where 1 volt equals one joule per coloumb. The volt allows us to adopt a unit for the electric field in terms of the volt. Previously, the unit for the electric field was NC1. W = __ F W = qV and F = qE, so ___ V E FV = ___ FV V m1. E = ___ W Fm That is, the units of the electric field, E, can also be expressed as V m1.
B x
+q
Figure 917 Movement of a positive point charge in a uniform eld In order to move a positive point charge from point A to point B, an external force must be applied to the charge equal to qE (F = qE). Since the force is applied through a distance x, then negative work has to be done to move the charge because energy is gained, meaning there is an increase electric potential energy between the two points. Remember that the work done is equivalent to the energy gained or lost in moving the charge through the electric field. The concept of electric potential energy is only meaningful as the electric field which generates the force in question is conservative.
AHL
W = F x = Eq x
xcos
Figure 918 Charge moved at an angle to the eld If a charge moves at an angle to an electric field, the component of the displacement parallel to the electric field is used as shown in Figure 918
W = F x = E q x cos
The electric potential energy is stored in the electric field, and the electric field will return the energy to the point charge when required so as not to violate the Law of conservation of energy.
260
19 april 09 Physics Ch 9 final.i260 260 22/05/2009 11:56:23 AM
MOTION IN FIELDS
The work done per unit charge in moving a point charge between two points in an electric field is again independent of the path taken.
The force F and the electric field E are oppositely directed, and we know that: F = -qE and W = q V Therefore, the work done can be given as: q V = -q E x Therefore
q V = ---------4 0 r
Or, simply
V E = ----x
The rate of change of potential V at a point with respect to distance x in the direction in which the change is maximum is called the potential gradient. We say that the electric field = - the potential gradient and the units are Vm-1. From the equation we can see that in a graph of electric potential versus distance, the gradient of the straight line equals the electric field strength. In reality, if a charged particle enters a uniform electric field, it will be accelerated uniformly by the field and its kinetic energy will increase. This is why we had to assume no acceleration in the last worked example.
q V = k ---r
Example
Determine how much work is done by the electric field of point charge 15.0 C when a charge of 2.00 C is moved from infinity to a point 0.400 m from the point charge. (Assume no acceleration of the charges).
Solution
1 2 V E k = - mv = q E x = q --x = qV 2 x
The work done by the electric field is W = -qV = -1/40 q (Q /r - Q / r0.400) W = (- 2.00 10-6 C 9.00 109 NmC-2 15.0 10-6 C) 0.400 m = - 0.675 J An external force would have to do +0.675 J of work.
Example
Determine how far apart two parallel plates must be situated so that a potential difference of 1.50 102 V produces an electric field strength of 1.00 103 NC-1.
261
19 april 09 Physics Ch 9 final.i261 261 22/05/2009 11:56:24 AM
AHL
CHAPTER 9
Some further observations of the graphs in Figure 915 are:
Solution
---------------------------- = --Using E = ---- x = --= 1.50 10-1 The plates are 1.50 10-1 m apart. The electric field and the electric potential at a point due to an evenly distributed charge +q on a sphere can be represented graphically as in Figure 919.
r0
V x
V E
1.5 10 2 V 1.00 10 3 N C 1
Outside the sphere, the graphs obey the relationships given as E 1 / r2 and V 1 / r At the surface, r = r0. Therefore, the electric field and potential have the minimum value for r at this point and this infers a maximum field and potential. Inside the sphere, the electric field is zero. Inside the sphere, no work is done to move a charge from a point inside to the surface. Therefore, there is no potential difference and the potential is the same as it is when r = r0.
Similar graphs can be drawn for the electric field intensity and the electric potential as a function of distance from conducting parallel plates and surfaces, and these are given in Figure 920.
Potential plot x + x x x x x x E field plot
AHL
E field:
+ + + +
r0
Figure 919 Electric eld and potential due to a charged sphere When the sphere becomes charged, we know that the charge distributes itself evenly over the surface. Therefore every part of the material of the conductor is at the same potential. As the electric potential at a point is defined as being numerically equal to the work done in bringing a unit positive charge from infinity to that point, it has a constant value in every part of the material of the conductor. Since the potential is the same at all points on the conducting surface, then V / x is zero. But E = V / x. Therefore, the electric field inside the conductor is zero. There is no electric field inside the conductor.
+ x x
+ x
Figure 920 Electric eld and electric potential at a distance from a charged surface
262
19 april 09 Physics Ch 9 final.i262 262 22/05/2009 11:56:25 AM
MOTION IN FIELDS
Solution
The electric potential of the +2 C charge due to the 6 C charge is: V = (9 109 Nm2C-2 -6 10-6 C) ( 32 + 42) m = - 1.08 104 V The electric potential of the +2 C charge due to the +3 C charge is: V = (9 109 Nm2C-2 3 10-6 C) 3m = 9 103 V
Example
Determine the electric potential at a point 2.0 10 m from the centre of an isolated conducting sphere with a point charge of 4.0 pC in air.
-1
The net absolute potential is the sum of the 2 potentials - 1.08 104 V + 9 103 V =
The absolute potential at the point is - 1.8 103 V. Using the formula V = kq / r , we have
Regions in space where the electric potential of a charge distribution has a constant value are called equipotentials. The places where the potential is constant in three dimensions are called equipotential surfaces, and where they are constant in two dimensions they are called equipotential lines. They are in some ways analogous to the contour lines on topographic maps. In this case, the gravitational potential energy is constant as a mass moves around the contour lines because the mass remains at the same elevation above the Earths surface. The gravitational field strength acts in a direction perpendicular to a contour line. Similarly, because the electric potential on an equipotential line has the same value, no work can be done by an electric force when a test charge moves on an equipotential. Therefore, the electric field cannot have a component along an equipotential, and thus it must be everywhere perpendicular to the equipotential surface or equipotential line. This fact makes it easy to plot equipotentials if the lines of force or lines of electric flux of an electric field are known. For example, there are a series of equipotential lines between two parallel plate conductors that are perpendicular to the electric field. There will be a series of concentric circles (each circle further apart than the previous one) that map out the equipotentials around an
Example
Three point charges of are placed at the vertices of a rightangled triangle as shown in the diagram below. Determine the absolute potential at the + 2.0 C charge, due to the two other charges.
- 6C
4m
+3C
3m
+2C
263
19 april 09 Physics Ch 9 final.i263 263 22/05/2009 11:56:26 AM
AHL
Solution
- 1.8 103 V
CHAPTER 9
isolated positive sphere. The lines of force and some equipotential lines for an isolated positive sphere are shown in Figure 922.
50 V 40 V
Lines of equipotential
30 V 20 V 10 V
Figure 925 Equipotential lines between charged parallel plates gravitational elds and electric elds Throughout this chapter the similarities and differences between gravitational fields and electric fields have been discussed. The relationships that exists between gravitational and electric quantities and the effects of point masses and charges is summarised in Table 926 Gravitational quantity Electrical quantity
Figure 922 Equipotentials around an isolated positive sphere In summary, we can conclude that
AHL
Quantities
No work is done to move a charge along an equipotential. Equipotentials are always perpendicular to the electric lines of force.
V = ---m
g = --m
V = ---q
E = -q
Figure 923 and 924 show some equipotential lines for two oppositely charged and identically positive spheres separated by a distance.
equipotential lines
V g = ---x
Point masses and charges
V E = ---x
V = G m --r
m g = G --2 r
1 q V = --------4 0 r
1 q E = ---------4 0 r 2
+ve
ve
m1 m2 F = G--------2 r
Figure 924 Equipotential lines between two charges which are the same
264
19 april 09 Physics Ch 9 final.i264 264 22/05/2009 11:56:29 AM
MOTION IN FIELDS
Total energy = kqQ / r + - kqQ / r = - kqQ / r
Example
= - 9.0 109 Nm2C-2 (1.6 10-19 C)2 5.3 10-11 m = -2.17 10-18 J Deduce the electric potential on the surface of a gold nucleus that has a radius of 6.2 fm. = -2.17 10-18 J 1.6 10-19 = -13.6 eV. The ionisation energy is 13.6 eV.
Solution Exercise
Using the formula V = kq / r , and knowing the atomic number of gold is 79. We will assume the nucleus is spherical and it behaves as if it were a point charge at its centre (relative to outside points). V = 9.0 109 Nm2C-2 79 1.6 10-19 C 6.2 10-15 m = 1.8 107 V The potential at the point is 18 MV. A B C D 2. Deduce the ionisation energy in electron-volts of the electron in the hydrogen atom if the electron is in its ground state and it is in a circular orbit at a distance of 5.3 10-11 m from the proton. 1. A point charge P is placed midway between two identical negative charges. Which one of the following is correct with regards to electric field and electric potential at point P? Electric field non-zero zero non-zero zero Electric potential zero non-zero non-zero zero
9.3
Example
Two positive charged spheres are tied together in a vacuum somewhere in space where there are no external forces. A has a mass of 25 g and a charge of 2.0 C and B has a mass of 15 g and a charge of 3.0 C. The distance between them is 4.0 cm. They are then released as shown in the diagram.
Solution
v1 v2
AFTER A BEFORE B
This problem is an energy, coulombic, circular motion question based on Bohrs model of the atom (not the accepted quantum mechanics model). The ionisation energy is the energy required to remove the electron from the ground state to infinity. The electron travels in a circular orbit and therefore has a centripetal acceleration. The ionisation energy will counteract the coulombic force and the movement of the electron will be in the opposite direction to the centripetal force. Total energy = Ek electron + Ep due to the proton-electron interaction F = kqQ / r2 = mv2 / r and as such mv2 = = kqQ / r. Therefore, Ek electron = kqQ / r. Ep due to the proton-electron interaction = - kqQ / r.
(a) (b)
Determine their initial electric potential energy in the before situation. Determine the speed of sphere B after release.
3.
The diagram below represents two equipotential lines in separated by a distance of 5 cm in a uniform electric field.
+ + + + + + + +
40 V 20 V 5 cm
Determine the strength of the electric field.
265
19 april 09 Physics Ch 9 final.i265 265 22/05/2009 11:56:30 AM
AHL
CHAPTER 9
4. This question is about the electric field due to a charged sphere and the motion of electrons in that field. The diagram below shows an isolated, metal sphere in a vacuum that carries a negative electric charge of 6.0 C. 8. The gap between two parallel plates is 1.0 10-3 m, and there is a potential difference of 1.0 104 V between the plates. Calculate i. ii. iii. (a) On the diagram draw the conventional way to represent the electric field pattern due to the charged sphere and lines to represent three equipotential surfaces in the region outside the sphere. Explain how the lines representing the equipotential surfaces that you have sketched indicate that the strength of the electric field is decreasing with distance from the centre of the sphere. The electric field strength at the surface of the sphere and at points outside the sphere can be determined by assuming that the sphere acts as a point charge of magnitude 6.0 C at its centre. The radius of the sphere is 2.5 102 m. Deduce that the magnitude of the field strength at the surface of the sphere is 8.6 107 Vm1. the work done by an electron in moving from one plate to the other the speed with which the electron reaches the second plate if released from rest. the electric field intensity between the plates.
9.
An electron gun in a picture tube is accelerated by a potential 2.5 103 V. Determine the kinetic energy gained by the electron in electron-volts. Determine the electric potential 2.0 10-2 m from a charge of -1.0 10-5 C. Determine the electric potential at a point midway between a charge of 20 pC and another of + 5 pC on the line joining their centres if the charges are 10 cm apart. During a thunderstorm the electric potential difference between a cloud and the ground is 1.0 109 V. Determine the magnitude of the change in electric potential energy of an electron that moves between these points in electron-volts. A charge of 1.5 C is placed in a uniform electric field of two oppositely charged parallel plates with a magnitude of 1.4 103 NC-1. (a) Determine the work that must be done against the field to move the point charge a distance of 5.5 cm. Calculate the potential difference between the final and initial positions of the charge. Determine the potential difference between the plates if their separation distance is 15 cm.
(b)
10.
11.
AHL
(c)
12.
An electron is initially at rest on the surface of the sphere. (d) (i) Describe the path followed by the electron as it leaves the surface of the sphere. Calculate the initial acceleration of the electron.
13.
(ii)
(b) (c)
5.
Determine the amount of work that is done in moving a charge of 10.0 nC through a potential difference of 1.50 102 V. 14. Three identical 2.0 C conducting spheres are placed at the corners of an equilateral triangle of sides 25 cm. The triangle has one apex C pointing up the page and 2 base angles A and B. Determine the absolute potential at B . Determine how far apart two parallel plates must be situated so that a potential difference of 2.50 102 V produces an electric field strength of 2.00 103 NC-1.
6.
During a flash of lightning, the potential difference between a cloud and the ground was 1.2 109 V and the amount of transferred charge was 32 C. (a) (b) Determine the change in energy of the transferred charge. If the energy released was all used to accelerate a 1 tonne car, deduce its final speed. If the energy released could be used to melt ice at 0 C, deduce the amount of ice that could be melted.
7.
(c)
266
19 april 09 Physics Ch 9 final.i266 266 22/05/2009 11:56:31 AM
MOTION IN FIELDS
15. Suppose that when an electron moved from A to B in the diagram along an electric field line that the electric field does 3.6 10-19 J of work on it. Determine the differences in electric potential: (a) (b) (c) VB VA VC VA VC VB
9.4.1 State that gravitation provides the centripetal force for circular orbital motion. 9.4.2 Derive Keplers third law.
B
16.
- 6C
5C
1m
9.4.5 Discuss the concept of weightlessness in orbital motion, in free fall and in deep space. 9.4.6 Solve problems involving orbital motion.
IBO 2007
+3C
1m
+2C
9.4.1 SATELLITES
The Moon orbits the Earth and in this sense it is often referred to as a satellite of the Earth. Before 1957 it was the only Earth satellite. However, in 1957 the Russians launched the first man made satellite, Sputnik 1. Since this date many more satellites have been launched and there are now literally thousands of them orbiting the Earth. Some are used to monitor the weather, some used to enable people to find accurately their position on the surface of the Earth, many are used in communications, and no doubt some are used to spy on other countries. Figure 932 shows how, in principle, a satellite can be put into orbit. The person (whose size is greatly exaggerated with respect to Earth) standing on the surface on the Earth throws some stones. The greater the speed with which a stone is thrown the further it will land from her. The paths followed by the thrown stones are parabolas. By a stretch of the imagination we can visualise a situation in which a stone is thrown with such a speed that, because of the curvature of the Earth, it will not land on the surface of the Earth but go into orbit. (Path 4 on Figure 932).
267
19 april 09 Physics Ch 9 final.i267 267 22/05/2009 11:56:31 AM
AHL
Determine the potential at point P that is located at the centre of the square as shown in the diagram below.
9.4.3 Derive expressions for the kinetic energy, potential energy and total energy of an orbiting satellite. 9.4.4 Sketch graphs showing the variation with orbital radius of the kinetic energy, gravitational potential energy and total energy of a satellite.
CHAPTER 9
1 2 3
E arth
Satellite orbit E arth
4
Figure 933 Getting a satellite into orbit
(This work of Kepler and Newtons synthesis of the work is an excellent example of the scientific method and makes for a good TOK discussion) In 1627 Johannes Kepler (1571-1630) published his laws of planetary motion. The laws are empirical in nature and were deduced from the observations of the Danish astronomer Tycho de Brahe (1546-1601). The third law gives a relationship between the radius of orbit R of a planet and its period T of revolution about the Sun. The law is expressed mathematically as
T2 = constant R3
AHL
We shall now use Newtons Law of Gravitation to show how it is that the planets move in accordance with Keplers third law. In essence Newton was able to use his law of gravity to predict the motion of the planets since all he had to do was factor the F given by this law into his second law, F = ma, to find their accelerations and hence their future positions. In Figure 934 the Earth is shown orbiting the Sun and the distance between their centres is R.
E arth Sun
where RE is the radius of the Earth and ME is the mass of the Earth. ME ----Bearing in mind that g 0 = G 2 , then RE
v =
g RE =
10 6.4 10 6 = 8 10 3 .
That is, the stone must be thrown at 8 103m s1. Clearly we are not going to get a satellite into orbit so close to the surface of the Earth. Moving at this speed the friction due to air resistance would melt the satellite before it had travelled a couple of kilometres. In reality therefore a satellite is put into orbit about the Earth by sending it, attached to a rocket, beyond the Earths atmosphere and then giving it a component of velocity perpendicular to a radial vector from the Earth. See Figure 933.
Fes
R Fse
Figure 934 Planets move according to Keplers third law Fes is the force that the Earth exerts on the Sun and Fse is the force that the Sun exerts on the Earth. The forces are equal and opposite and the Sun and the Earth will actually orbit about a common centre. However since the Sun is so very much more massive than the Earth this common centre will be close to the centre of the Sun and so we can regard the Earth as orbiting about the centre of the Sun. The other
268
19 april 09 Physics Ch 9 final.i268 268 22/05/2009 11:56:33 AM
MOTION IN FIELDS
thing that we shall assume is that we can ignore the forces that the other planets exert on the Earth. (This would not be a wise thing to do if you were planning to send a space ship to the Moon for example.). We shall also assume that we have followed Newtons example and indeed proved that a sphere will act as a point mass situated at the centre of the sphere. Kepler had postulated that the orbits of the planets are elliptical but since the eccentricity of the Earths orbit is small we shall assume a circular orbit. The acceleration of the Earth towards the Sun is a = R2 the 18th Century had reached a degree of perfection in design that enabled astronomers to actually measure the orbital perturbations of the planets. Their measurements were always in agreement with the predictions made by Newtons law. However, in 1781 a new planet, Uranus was discovered and the orbit of this planet did not fit with the orbit predicted by Universal Gravitation. Such was the physicists faith in the Newtonian method that they suspected that the discrepancy was due to the presence of a yet undetected planet. Using the Law of Gravitation the French astronomer J.Leverrier and the English astronomer. J. C. Adams were able to calculate just how massive this new planet must be and also where it should be. In 1846 the planet Neptune was discovered just where they had predicted. In a similar way, discrepancies in the orbit of Neptune led to the prediction and subsequent discovery in 1930 of the planet Pluto. Newtons Law of Gravitation had passed the ultimate test of any theory; it is not only able to explain existing data but also to make predictions.
where = 2 ---T
Hence,
But the acceleration is given by Newtons Second Law, F = ma, where F is now given by the Law of Gravitation. So in this situation GMs Me F = ma = ------------ , but, we also have that 2 R
is a constant that has the same value for each of the planets so we have for all the planets, not just Earth, that
The gravitational potential energy of the satellite Vsat GMe m --------is therefore -r .
e That is, Vsat = ----------.
GM m r
GMe g0 = ------2 Re
Hence we can write
g R 2m Vsat = 0 e r
2 The kinetic energy of the satellite Ksat is equal to mv , where v is its orbital speed.
269
19 april 09 Physics Ch 9 final.i269 269 22/05/2009 11:56:35 AM
AHL
a = R 2 --T
R = 4 -------2 T
CHAPTER 9
By equating the gravitational force acting on the satellite to its centripetal acceleration we have as such the book cannot catch up with the floor of the elevator. Furthermore, if you happened to be standing on a set of bathroom scales, the scales would now read zero - you would be apparently weightless. It is this idea of free fall that explains the apparent weightlessness of astronauts in an orbiting satellite. These astronauts are in free fall in the sense that they are accelerating towards the centre of the Earth. It is actually possible to define the weight of a body in several different ways. We can define it for example as the gravitational force exerted on the body by a specified object such as the Earth. This we have seen that we do in lots of situations where we define the weight as being equal to mg. If we use this definition, then an object in free fall cannot by definition be weightless since it is still in a gravitational field. However, if we define the weight of an object in terms of a weighing process such as the reading on a set of bathroom scales, which in effect measures the contact force between the object and the scales, then clearly objects in free fall are weightless. One now has to ask the question whether or not it is possible. For example, to measure the gravitational force acting on an astronaut in orbit about the Earth. We shall return to this idea of bodies in free fall when we look at Einsteins General Theory of Relativity in Chapter 19 We can also define weight in terms of the net gravitational force acting on a body due to several different objects. For example for an object out in space, its weight could be defined in terms of the resultant of the forces exerted on it by the Sun, the Moon, the Earth and all the other planets in the Solar System. If this resultant is zero at a particular point then the body is weightless at this point.
2 4
1 2 1 GMe m -- mv = -- ---------2 2 r
g 0R e m 2r
Which is actually quite interesting since it shows that, irrespective of the orbital radius the KE is numerically equal to half the PE, Also the total energy E tot of the satellite is always negative since
G Me m G M e m 1 G Me m E tot = K sat + Vsat = 1 - ---------- + ---------- = -- ---------2 2 r r r
The energies of an orbiting satellite as a function of radial distance from the centre of a planet are shown plotted in Figure 935.
1.2 1.0 0.8 0.6 0.4 0.2 0 0.2 0.4 0.6 0.8 1.0 1.2 kinetic energy total energy potential energy
AHL
energy/arbitary units
distance /R
10
12
Figure 935 Energy of an orbiting satellite as a function of distance from the centre of a planet
In view of the various definitions of weight that are available to us it is important that when we use the word weight we are aware of the context in which it is being used.
9.4.5 WEIGHTLESSNESS
Suppose that you are in an elevator (lift) which is descending at constant speed and you let go of a book that you are holding in your hand. The book will fall to the floor with acceleration equal to the acceleration due to gravity. If the cable that supports the elevator were to snap (a situation that I trust will never happen to any of you) and you now let go the book that you are holding in your other hand, this book will not fall to the floor - it will stay exactly in line with your hand. This is because the book is now falling with the same acceleration as the elevator and
Calculate the height above the surface of the Earth at which a geo-stationary satellite orbits.
270
19 april 09 Physics Ch 9 final.i270 270 22/05/2009 11:56:37 AM
MOTION IN FIELDS
Solution
Solution
A geo-stationary satellite is one that orbits the Earth in such a way that it is stationary with respect to a point on the surface of the Earth. This means that its orbital period must be the same as the time for the Earth to spin once on its axis i.e. 24 hours.
3 G Ms R ------= ---From Keplers third law we have 2 2. 4 T
We have seen that when dealing with gravitational fields and potential it is useful to remember that
That is,
GM g 0 R e = -----. Re
The gravitational potential at a distance R
m Me h R
using the fact that the force of attraction between the satellite and the Earth is given by G Me m F = ----------2 R and that F = ma
Now, the mass of the Earth is 6.0 1024 kg and the period, T, measured in seconds is given by T = 86,400 s. So substitution gives R = 42 106 m The radius of the Earth is 6.4 106 m so that the orbital height, h, is about 3.6 107 m.
Example
Calculate the minimum energy required to put a satellite of mass 500 kg into an orbit that is as a height equal to the Earths radius above the surface of the Earth.
271
19 april 09 Physics Ch 9 final.i271 271 22/05/2009 11:56:39 AM
AHL
The difference in potential between the surface of the Earth and a point distance R from the centre is therefore
CHAPTER 9
8. Use the following data to determine the gravitational field strength at the surface of the Moon and hence determine the escape speed from the surface of the Moon. Mass of the Moon = 7.3 1022 kg, Radius of the Moon = 1.7 106 m
Exercises
9.4
1.
The speed needed to put a satellite in orbit does not depend on A. B. C. D. the radius of the orbit. the shape of the orbit. the value of g at the orbit. the mass of the satellite.
2.
Estimate the speed of an Earth satellite whose orbit is 400 km above the Earths surface. Also determine the period of the orbit. Calculate the speed of a 200 kg satellite, orbiting the Earth at a height of 7.0 106 m. Assume that g = 8.2 m s2 for this orbit.
3.
AHL
4.
The radii of two satellites, X and Y, orbiting the Earth are 2r and 8r where r is the radius of the Earth. Calculate the ratio of the periods of revolution of X to Y. A satellite of mass m kg is sent from Earths surface into an orbit of radius 5R, where R is the radius of the Earth.Write down an expression for (a) (b) (c) the potential energy of the satellite in orbit. the kinetic energy of the satellite in orbit. the minimum work required to send the satellite from rest at the Earths surface into its orbit.
5.
6.
A satellite in an orbit of 10r, falls back to Earth (radius r) after a malfunction. Determine the speed with which it will hit the Earths surface? The radius of the moon is that of the Earth Assuming Earth and the Moon to have the same density, compare the accelerations of free fall at the surface of Earth to that at the surface of the Moon.
7.
272
19 april 09 Physics Ch 9 final.i272 272 22/05/2009 11:56:40 AM
THERMAL PHYSICS
THERMAL PHYSICS
10.1 10.2 10.3 Thermodynamics Processes The second law of thermodynamics and entropy
10
From the combined gas laws, we determined that: PV = k or PV = kT ___ T If the value of the universal gas constant is compared for different masses of different gases, it can be demonstrated that the constant depends not on the size of the atoms but rather on the number of particles present (the number of moles). Thus for n moles of any ideal gas: PV = R ___ nT or PV = nRT
10.1 THERMODYNAMICS
10.1.1 State the equation of state for an ideal gas. 10.1.2 Describe the dierence between an ideal gas and a real gas. 10.1.3 Describe the concept of the absolute zero of temperature and the Kelvin scale of temperature. 10.1.4 Solve problems using the equation of state of an ideal gas.
IBO 2007
This is called the equation of state of an ideal gas, where R is the universal gas constant and is equal to 8.31 J mol-1 K-1. The equation of state of an ideal gas is determined from the gas laws and Avogadros law.
273
19 april 09 Physics Ch 10 final.273 273 22/05/2009 11:57:39 AM
AHL
CHAPTER 10
Most gases, at temperatures well above their boiling points and pressures that are not too high, behave like an ideal gas. In other words, real gases vary from ideal gas behaviour at high pressures and low temperatures.
273 0
Pressure / P kPa
T C K
Figure 1002 Variation of pressure with temperature The Pressure (Admonton) Law of Gases states that: The pressure of a fixed mass of gas at constant volume is directly proportional to its its temperature.
AHL
P1 P T P = k T ---- = k T1
T C K
273 0
Therefore,
Figure 1001 Variation of volume with temperature Note that from the extrapolation of the straight line that the volume of gases would be theoretically zero at 273.15 C called absolute zero. The scale chosen is called the Kelvin scale K and this is the fundamental unit of thermodynamic temperature. The Charles (Gay-Lussac) Law of gases states that: The volume of a fixed mass of gas at constant pressure is directly proportional to its absolute (Kelvin) temperature. This can also be stated as: The volume of a fixed mass of gas increases by 1 / 273.15 of its volume at 0 C for every degree Celsius rise in temperature provided the pressure is constant. When the variation in pressure as a function temperature is plotted for an ideal gas, a graph similar to Figure 1002 is obtained.
P1 P2 ---- = --T1 T2
When a pressure versus volume graph is drawn for the collected data, a hyperbola shape is obtained, and when pressure is plotted against the reciprocal of volume a straight line is obtained. See Figure 1003.
pressure, P / mm Hg pressure, P / mm Hg
volume, V / cm3
1 /cm3 V
PV
Figure 1003
Pressure-volume graphs
Boyles Law for gases states that the pressure of a fixed mass of gas is inversely proportional to its volume at constant temperature.
1 P V = constant P -V
When the conditions are changed, with the temperature still constant
274
19 april 09 Physics Ch 10 final.274 274 22/05/2009 11:57:40 AM
THERMAL PHYSICS
(1.01 105 Nm-1) (molar mass / 1.25 kg m-3) = 1 (8.31 J mol-1 K-1) (0 + 273 K) 1 8.31 J mol1 K1 273 K 1.25 kg m3 Molar mass = _________________________________ 1.01 105 Nm1 n = 39.46 mol = 28 10-3 kg mol-1 The ideal gas is helium with a molar mas of 2.8 10-2 kg mol-1.
P 1 V1 = P 2 V2
Although the pressure and the reciprocal of volume have a directly proportional linear plot, it is the first volumetemperature graph that is used to define absolute zero. Although different samples of an ideal gas have different straight-line variations, they still extrapolate back to absolute zero.
Exercise
10.1
A weather balloon of volume 1.0 m3 contains helium at a pressure of 1.01 105 N m-2 and a temperature of 35 C. What is the mass of the helium in the balloon if one mole of helium has a mass of 4.003 10-3 kg?
If the average translational kinetic energy EK at a temperature T of helium (molar mass 4 g mol-1), then the average translational kinetic energy of neon (molar mass 20 g mol-1) at the same temperature would be: A. B. C. D. 1/5 EK 5 EK 5 EK EK
Solution
2. Use the equation, PV = nRT, we have (1.01 105 Nm-1) (1.0 m3) = n (8.31 J mol-1 K-1) (35 + 273 K) So that, n = 39.46 mol Then, the mass of helium = (39.46 mol) (4.003 10-3 kg mol-1) = 0.158 kg. 3. The mass of helium in the balloon is 0.16 kg.
A sample of gas is contained in a vessel at 20 C at a pressure P. If the pressure of the gas is to be doubled and the volume remain constant, the gas has to be heated to: A. B. C. D. 40 C 293 C 586 C 313 C
Real gases behave most like ideal gases at A. B. C. D. low temperatures and high pressures high temperatures and low pressures low temperatures and low pressures high temperatures and high pressures
Example
An ideal gas has a density of 1.25 kg m-3 at STP. Determine the molar mass of the ideal gas.
4.
The Kelvin temperature of an ideal gas is a measure of: A. the average potential energy of the gas molecules the average speed of the gas molecules the average pressure of the gas molecules the average kinetic energy of the gas molecules
Solution
B. C. D.
275
19 april 09 Physics Ch 10 final.275 275 22/05/2009 11:57:41 AM
AHL
CHAPTER 10
5.
Two identical containers A and B contain an ideal gas under the different conditions as shown below.
10.2 PROCESSES
10.2.1 Deduce an expression for the work involved in a volume change of a gas at constant pressure. 10.2.2 State the rst law of thermodynamics. 10.2.3 Identify the rst law of thermodynamics as a statement of the principle of energy conservation. 10.2.4 Describe the isochoric (isovolumetric), isobaric, isothermal and adiabatic changes of state of an ideal gas. 10.2.5 Draw and annotate thermodynamic processes and cycles on PV diagrams. 10.2.6 Calculate from a PV diagram the work done in a thermodynamic cycle. 10.2.7 Solve problems involving state changes of a gas.
IBO 2007
AHL
6.
The internal volume of a gas cylinder is 3.0 10-2 m3. An ideal gas is pumped into the cylinder until the pressure is 15 MPa at a temperature of 25 C. (a) (b) (c) (d) Determine the number of moles of the gas in the cylinder Determine the number of gas atoms in the cylinder Determine the average volume occupied by one atom of the gas Estimate the average separation of the gas atoms
7.
A cylinder of an ideal gas with a volume of 0.2 m3 and a temperature of 25 C contains 1.202 1024 molecules. Determine the pressure in the cylinder. (a) (b) (c) State what is meant by the term ideal gas. In terms of the kinetic theory of gases, state what is meant by an ideal gas. Explain why the internal energy of an ideal gas is kinetic energy only.
8.
cylinder
l
Figure 1005 Expansion of a gas at constant pressure
276
19 april 09 Physics Ch 10 final.276 276 22/05/2009 11:57:42 AM
THERMAL PHYSICS
The pressure, p, on the piston = force per unit area So that, volume, temperature and change in internal energy in determining the state of a system. Heat can be transferred between a system and its environment because of a temperature difference. Another way of transferring energy between a system and its environment is to do work on the system or allow work to be done by the system on the surroundings. In order to distinguish between thermal energy (heat) and work in thermodynamic processes If a system and its surroundings are at different temperatures and the system undergoes a process, the energy transferred by non-mchanical means is referred to as thermal energy (heat). Work is defined as the process in which thermal energy is transferred by means that are independent of a temperature difference.
p = F A
Therefore, the force on the piston, F, is given by F = pA Suppose the piston is moved a distance l when the gas expands. Normally, if the gas expands, the volume increases and the pressure decreases, as was determined from Boyles Law for ideal gases in the previous section. However, if the distance l is a small l, then the pressure can be considered constant. If the pressure is constant then the force F will be constant. The work done by the gas is: W = F l = pA l since pressure p = Force F / Area A = pV since volume V = A l That is, (work done / J) = (pressure / Nm-2) (volume change / m3) So that,
W = p V = p ( V2 V1 )
The sign of the work done by the gas depends on whether volume change is positive or negative. When a gas expands, as is the case for Figure 1005, then work is done by the gas, and the volume increases. As V is positive, then W is positive. This equation is also valid if the gas is compressed. In the compression, work is done on the gas and the volume is decreased. Therefore, V is negative which means that W will be negative. From the first law of thermodynamics this means that positive work is done on the gas.
In thermodynamics the word system is used often. A system is any object or set of objects that is being investigated. The surroundings will then be everything in the Universe apart from the system. For example, when a volume of gas in a cylinder is compressed with a piston, then the system is the cylinder-gas-piston apparatus and the surroundings is everything else in the Universe. A closed system is one in which no mass enters or leaves the system. It is an isolated system if no energy of any kind enters or leaves the system. Most systems are open systems because of the natural dynamic processes that occur in the Universe. In Chapter 3, internal energy U was defined as the sum total of the potential energy and kinetic energy of the particles making up the system. From a microscopic viewpoint, the internal energy of an ideal gas is due to the kinetic energy of the thermal motion of its molecules. There are no intermolecular forces and thus there cannot be any increase in potential energy. Therefore a change in the temperature of the gas will change the internal energy of the gas. From the macroscopic point of view of thermodynamics, one would expect that the internal energy of the system would be changed if: work is done on the system work is done by the system thermal energy is added to the system thermal energy is removed from the system
Internal energy is a property of the system that depends on the state of the system. In thermodynamics, a change of
277
19 april 09 Physics Ch 10 final.277 277 22/05/2009 11:57:42 AM
AHL
CHAPTER 10
state of an ideal gas occurs if some macroscopic property of the system has changed eg. phase, temperature, pressure, volume, mass, internal energy. Heat and work can change the state of the system but they are not properties of the system. They are not characteristic of the state itself but rather they are involved in the thermodynamic process that can change the system from one state to another. The absolute value for internal energy is not known. This does not cause a problem as one is mainly concerned with changes in internal energy, denoted by U, in thermodynamic processes. The first law of thermodynamics is a statement of the Law of Conservation of Energy in which the equivalence of work and thermal energy transfer is taken into account. It can be stated as:
p
A rea = Work done = p( V 2 V 1 )
pconst
V1
V2
Figure 1006 Work done by a gas expanding at constant pressure An isobaric transformation requires a volume change at constant pressure, and for this to occur, the temperature needs to change to keep the pressure constant. p = constant, or V / T = constant. For an isobaric expansion, work is done by the system so W is positive. Thermal energy is added to cause the expansion so Q is positive. This means that U must be positive. For an isobaric compression, all terms would be negative.
AHL
the heat added to a closed system equals the change in the internal energy of the system plus the work done by the system. That is,
Q = U + W = U + p V
or,
U = Q W
where +Q is the thermal energy added to the system and +W is the work done by the system. If thermal energy leaves the system, then Q is negative. If work is done on the system, then W is negative. For an isolated system, then W = Q = 0 and U = 0.
IDEAL GAS
Isobaric processes
A graph of pressure as a function of volume change when the pressure is kept constant is shown in Figure 1006. Such a process is said to be isobaric. Note that the work done by the gas is equal to the area under the curve.
Note that the work done by the gas is equal to zero as V = 0. There is zero area under the curve on a pV diagram. However, the temperature and pressure can both change and so such a transformation will be accompanied by a thermal energy change. V = constant, or p / T = constant. For an isochoric process, no work is done by the system so W is zero. Thermal energy leaves the system so Q is negative. This means that U must be negative. For an isobaric process, W is zero, and Q and U are positive.
278
19 april 09 Physics Ch 10 final.278 278 22/05/2009 11:57:43 AM
THERMAL PHYSICS
Isothermal processes
A thermodynamic process in which the pressure and the volume are varied while the temperature is kept constant is called an isothermal process. In other words, when an ideal gas expands or is compressed at constant temperature, then the gas is said to undergo an isothermal expansion or compression. Figure 1008 shows three isotherms for an ideal gas at different temperatures where T1 < T2 < T3.
Consider an ideal gas enclosed in a thin conducting vessel that is in contact with a heat reservoir, and is fitted with a light, frictionless, movable piston. If an amount of heat Q is added to the system which is at point A of Figure 1008, then the system will move to another point on the graph, B. The heat taken in will cause the gas to expand isothermally and will be equivalent to the mechanical work done by the gas. Because the temperature is constant, there is no change in internal energy of the gas. That is,
T = 0 and U = 0 Q = W
p T1 T2 T 3 A isothermal process
If the gas expands isothermally from A to B and then returns from B to A following exactly the same path during compression, then the isothermal change is said to be reversible. The conditions described above would follow this criterion.
B V
Figure 1008 Isotherms for an ideal gas The curve of an isothermal process represents a Boyles Law relation T = constant, or pV = constant = nRT The moles of gas n, the molar gas constant R, and the absolute temperature T are constant. For an isothermal expansion, temperature is constant so U is zero. Work is done by the system so W is positive. This means that U must be positive. In order to keep the temperature constant during an isothermal process the gas is assumed to be held in a thin container with a high thermal conductivity that is in contact with a heat reservoir an ideal body of large mass whose temperature remains constant when heat is exchanged with it. eg. a constanttemperature water bath. the expansion or compression should be done slowly so that no eddies are produced to create hot spots that would disrupt the energy equilibrium of the gas.
Adiabatic processes
An adiabatic expansion or contraction is one in which no heat Q is allowed to flow into or out of the system. For the entire adiabatic process, Q = 0. To ensure that no heat enters or leaves the system during an adiabatic process it is important to make sure that the system is extremely well insulated. carry out the process rapidly so that the heat does not have the time to leave the system.
The compression stroke of an automobile engine is essentially an adiabatic compression of the air-fuel mixture. The compression occurs too rapidly for appreciable heat transfer to take place. In an adiabatic compression the work done on the gas will lead to an increase in the internal energy resulting in an increase in temperature.
U = Q W but Q = 0 U = W
In an adiabatic expansion the work done by the gas will lead to a decrease in the internal energy resulting in a decrease in temperature. Figure 1009 shows the relationship that exists between an adiabatic and three isothermals. Note that the adiabatic curve is steeper than the isotherm AB because the adiabatic process has to occur rapidly so that the heat does not have
279
19 april 09 Physics Ch 10 final.279 279 22/05/2009 11:57:44 AM
AHL
CHAPTER 10
time to leave the system. The gas expands isothermally from point A to point B, and then it is compressed adiabatically from B to C. The temperature increases as a result of the adiabatic process from T1 to T3. If the gas is then compressed at constant pressure from the point C to A, the net amount of work done on the gas will equal the area enclosed by ABC.
(a)
(b)
(c)
V
Work done by gas in expanding
V
Work done on gas as it is compressed Net work done by gas
T1 T2 T 3 A
isothermal process
isobaric process
Figure 1011
adiabatic process
Figure 1009
AHL
For an adiabatic compression, no heat enters or leaves the system so Q is zero. Work is done on the system so W is negative. This means that U must be negative. For an adiabatic expansion, Q is zero, and W and U are positive. In Figure 1010 the area ABDE = work done by the gas during isothermal expansion. The area ACDE = work done by the gas during an adiabatic expansion.
p A
isothermal B adiabatic E C D V
A thermodynamic cycle is a process in which the system is returned to the same state from which it started. That is, the initial and final states are the same in the cyclic process. A cycle for a simple engine was shown in Figure1009. The net work done in the cycle is equal to the area enclosed by the cycle. Suppose a piston was placed on a heat reservoir, such as the hot plate of a stove. Thermal energy is supplied by the thermal reservoir, and work is done by the gas inside the piston as it expands. But this is not an engine as it only operates in one direction. The gas cannot expand indefinitely, because as the volume of the piston increases, the pressure decreases (Boyles Law). Some point will be reached when the expanding gas will not be able to move the piston. For this simple engine to function, the piston must eventually be compressed to restore the system to its original position ready to do work. For a cycle to do net work, thermal contact with the original heat reservoir must be broken, and temperatures other than that of the original heat reservoir must play a part in
Figure 1010
The work done by a gas and the work done on a gas can be seen using the following graphical representation of a pressurevolume diagram (Figure 1011).
280
19 april 09 Physics Ch 10 final.280 280 22/05/2009 11:57:45 AM
THERMAL PHYSICS
the process. In the above example, if the piston is returned to its original position while in contact with the hot plate, then all the work that the gas did in the expansion will have to be used in the compression. On a p V diagram, one would draw an isotherm for the expansion and an isotherm for the compression lying on top of the expansion isotherm but in the opposite direction. Therefore, the area enclosed by the cycle would be zero. However, if the gas is compressed at a lower temperature the internal pressure of the system will be lower than during the expansion. Less work will be needed for the compression than was produced in the expansion, and there will be net work available for transformation to mechanical energy.
QH
Engine does useful work, W.
QL
Cold reservoir at T L
Figure 1012
The heat input QH is represented as coming from the high temperature reservoir TH which is maintained at a constant temperature. Thermal energy QL is taken from the hot reservoir. This thermal energy is used to do work in the heat engine. Then thermal energy can be given to the low temperature reservoir TL without increasing its temperature. If a perfect engine completed a cycle, the change in internal energy U would be zero because all the heat would be converted to work. However, there is no perfect heat engine and the flow diagram in Figure 1012 is more the reality. At this stage, we will assume that the change in internal energy is zero. From the First Law of Thermodynamics
U = 0 = Q W, so that W = Q
That is,
QH Q L = W
Thus for a cycle, the heat added to the system equals the work done by the system plus the heat that flows out at lower temperature. An ideal gas can be used as a heat engine as in the simple cycle in Figure 1013.
p (kPa) 6 C D
B 4
Figure 1013
A 10 V (m )
3
281
19 april 09 Physics Ch 10 final.281 281 22/05/2009 11:57:46 AM
AHL
CHAPTER 10
From A to B, the gas is compressed (volume decreases) while the pressure is kept constant an isobaric compression. The amount of work done by the gas is given by the area under the 2 kPa isobar. Using the fact that W = pV , we have that W = 2 kPa (4 10) m3 = 1.2 104 J From B to C, the volume is kept constant as the pressure increases an isochoric increase in pressure. This can be achieved by heating the gas. Since V = 0, then no work is done by the gas, W = 0. From C to D, the gas expands (volume increases) while the pressure is kept constant an isobaric expansion. The amount of work done by the gas is given by the area under the 6 kPa isobar. Now, we have that W = pV , so that W = 6 kPa (10 4) m3 = 3.6 104 J Figure 1014 Four-stroke internal combustion engine From D to A, the gas is cooled to keep the volume constant as the pressure is decreased an isochoric decrease in pressure. Again V = 0 and no work is done by the gas, W = 0. That is, the net work done by the gas is therefore 3.6 104 J 1.2 104 J = 2.4 104 J. Motor cars usually have four or six pistons but five and eight cylinders are also common. The pistons are connected by a crankshaft to a flywheel which keeps the engine turning over during the power stroke. Automobiles are about 25% efficient. Any device that can pump heat from a low-temperature reservoir to a high-temperature reservoir is called a heat pump. Examples of heat pumps include the refrigerator and reverse cycle air-conditioning devices used for space heating and cooling. In the summer component of Figure 1015, the evaporator heat exchanger on the inside extracts heat from the surroundings. In the winter component, the evaporator heat exchanger is outside the room, and it exhausts heat to the inside air. In both cases, thermal energy is pumped from a low-temperature reservoir to a high- temperature reservoir.
WINTER room SUMMER room
intake valve closed Gas vapor and intake valve open piston
The mixture burns rapidly and the hot gases then expand against the piston in the power stroke. The exhaust valve is opened as the piston moves upwards during the exhaust stroke, and the cycle begins again.
intake stroke
exhaust closed intake valve closed
compression stroke
exhaust closed
crankshaft
crankshaft
ignition exhaust
spent fuel gases exhaust open intake valve closed exhaust closed
power stroke
intake valve closed exhaust closed
AHL
condenser TH
evaporator TL
evaporator TL
condenser TH
Figure 1015
282
19 april 09 Physics Ch 10 final.282 282 22/05/2009 11:57:47 AM
THERMAL PHYSICS
Figure 1016 shows the energy flow that occurs in a heat pump cycle. By doing work on the system, heat QL is added from the low temperature TL reservoir, being the inside of the refrigerator. A greater amount of heat QH is exhausted to the high temperature TH reservoir.
QH
W
QL
Low temperature reservoir at T L
Figure 1016
An ideal gas can be used as a heat pump as in the simple cycle in Figure 1017.
p (kPa)
C V (m )
3
freezer compartment
condenser pipes
Figure 1017
Because the cycle is traced in an anticlockwise direction, the net work done on the surroundings is negative.
HEAT OUT
HFC gas
cold food
cooling ns vapour at very high temperature
compressor pump
Figure 1018 The typical small refrigerator On the high-pressure, high-temperature side of the throttling valve, thermal energy is removed from the system. The vaporised HFC in the compressor pipes is compressed by the compressor pump, and gives up its
283
19 april 09 Physics Ch 10 final.283 283 22/05/2009 11:57:48 AM
AHL
CHAPTER 10
latent heat of vaporisation to the air surrounding the compressor pipes. The heat fins act as a heat sink to radiate the thermal energy to the surroundings at a faster rate. The fins are painted black and they have a relatively large surface area for their size. The nett work is the area enclosed by ABCDA. In the case given, the Carnot engine is working in a clockwise cycle ABCDA. Thermal energy is absorbed by the system at the high temperature reservoir TH and is expelled at the low temperature reservoir TL. Work is done by the system as it expands along the top isotherm from A to B, and along the adiabat from B to C. Work is then done on the system to compress it along the bottom isotherm from C to D and along the left adiabat from D to A. The efficiency of the Carnot cycle depends only on the absolute temperatures of the high and low temperature reservoirs. The greater the temperature difference, the greater the efficiency will be. As a result of the Carnot efficiency, many scientists list a Third Law of Thermodynamics which states: It is impossible to reach the absolute zero of temperature, 0 K. The efficiency of the Carnot cycle would be 100% if the low temperature reservoir was at absolute zero. Therefore absolute zero is unattainable.
AHL
Carnot argued that if thermal energy does flow from a cold body to a hot body then work must be done. Therefore, no engine can be more efficient than an ideal reversible one and that all such engines have the same efficiency. This means that if all engines have the same efficiency then only a simple engine was needed to calculate the efficiency of any engine. Consider an ideal perfectly insulated, frictionless engine that can work backwards as well as forwards. The pV diagram would have the form of that shown in Figure 1019.
VB
p maximum temperature A
VC VD VA Q=0
Volume compression
Volume expansion
VA
QH TH
If 22 J of work is done on a system and 3.4 102 J of heat is added, determine the change internal energy of the system.
Solution
VC VB VA
Volume expansion
B
adiabatic expansion
D C
isothermal compression
Q=0
Using the formula, Q = U + W , we have that 340 J = U + (-22) J 340 J = U + (22) J so that U = 340 J + 22 J = 362 J That is, the change in internal energy of the system is 3.6 102 J.
minimum temperature V2 V
VC VD VA QL TC
Volume compression
Figure 1019
284
19 april 09 Physics Ch 10 final.284 284 22/05/2009 11:57:49 AM
THERMAL PHYSICS
Example
Solution
6.0 dm3 of an ideal gas is at a pressure of 202.6 kPa. It is heated so that it expands at constant pressure until its volume is 12 dm3. Determine the work done by the gas.
(a)
Solution
Using the formula W = p V, we have that W = 202.6 kPa (12 6.0) dm3 = 202.6 103 Pa (12 6.0) 10-3 m3 = 1.216 103 J That is, the work done by the gas in the expansion is 1.2 103 J. (b)
p
The fuel-air mixture enters the piston at point A. The compression AB is carried out rapidly with no heat exchange making it an adiabatic compression. The ignition and combustion of the gases introduces a heat input QH that raises the temperature at constant volume from B to C. The power stroke is an adiabatic expansion from C to D. Thermal energy QL leaves the system during the exhaust stroke, and cooling occurs at constant volume from D to A. The Figure Below shows the changes that occur for each process in the cycle.
constant pressure
QH C
adiabatic compression
QL A minimum temperature
Example
V1
A thermal system containing a gas is taken around a cycle of a heat engine as shown in the Figure below. (a) (b) Starting at point A, describe the cycle. Label the diagram fully showing the maximum and minimum temperature reservoirs. Estimate the amount of work done in each cycle.
V2
(c)
The net work is represented by the enclosed area ABCD. If we assume that the area is approximately a rectangle with sides of 4 105 Pa and 200 cm3, we have:
(c)
Example
8
p x 10 5 Pa C
For the compression stroke of an experimental diesel engine, the air is rapidly decreased in volume by a factor of 15, the compression ratio. The work done on the air-fuel mixture for this compression is measured to be 550 J.
Q
4
(a)
What type of thermodynamic process is likely to have occurred? What is the change in internal energy of the airfuel mixture? Is the temperature likely to increase or decrease?
(b)
(c)
285
19 april 09 Physics Ch 10 final.285 285 22/05/2009 11:57:50 AM
AHL
maximum temperature
CHAPTER 10
4. The Figure below shows the variation of pressure p with volume V during one complete cycle of a simple heat engine.
Solution
(a)
Because the compression occurs rapidly appreciable heat transfer does not take place, and the process can be considered to be adiabatic, Q = 0. U = Q W = 0 (550) J Therefore, the change in internal energy is 550J.
p X C Y 0 0
(b)
(c)
The temperature rise will be very large resulting in the spontaneous ignition of the air-fuel mixture.
Exercise
10.2
A. B. C. D. 5. X+Y XY X Y
AHL
1.
An ideal gas was slowly compressed at constant temperature to one quarter of its original volume. In the process, 1.5 103 J of heat was given off. The change in internal energy of the gas was A. B. C. D. 1.5 103 J 0J 1.5 103 J 6.0 103 J
The Figure below shows the variation of the pressure p with volume V of a gas during one cycle of the Otto engine.
2.
When an ideal gas in a cylinder is compressed at constant temperature by a piston, the pressure of the gas increases. Which of the following statement(s) best explain the reason for the pressure increase? I. II. III. the mean speed of the molecules increases the molecules collide with each other more frequently the rate of collision with the sides of the cylinder increases. II only III only I and II only II and III only 6.
D Q A V1 V2 V
During which process does the gas do external work? A. B. C. D. AB CD BC and CD AB and CD
A. B. C. D. 3.
An ideal gas in a thermally insulated cylinder is compressed rapidly. The change in state would be: A. B. C. D. isochoric isothermal adiabatic isobaric
A system absorbs 100 J of thermal energy and in the process does 40 J of work. The change in internal energy is: A. B. C. D. 60 J 40 J 100 J 140 J
286
19 april 09 Physics Ch 10 final.286 286 22/05/2009 11:57:51 AM
THERMAL PHYSICS
7. Work is done when the volume of an ideal gas increases. During which of the following state processes would the work done be the greatest? A. B. C. D. 8. isochoric isothermal isobaric adiabatic 14. Helium gas at 312 K is contained in a cylinder fitted with a movable piston. The gas is initially at 2 atmospheres pressure and occupies a volume of 48.8 L. The gas expands isothermally until the volume is 106 L. Then the gas is compressed isobarically at that final pressure back to the original volume of 48.8L. It then isochorically returns back to its original pressure. Assuming that the helium gas behaves like an ideal gas (a) (b) 9. If 1.68 105 J of heat is added to a gas that expands and does 8.1 105 J of work, what is the change in internal energy of the gas? 6.0 m3 of an ideal gas is cooled at constant normal atmospheric pressure until its volume is 1/6 th its original volume. It is then allowed to expand isothermally back to its original volume. Draw the thermodynamic process on a pV diagram. A system consists of 3.0 kg of water at 75 C. Stirring the system with a paddlewheel does 2.5 104 J of work on it while 6.3 104 J of heat is removed. Calculate the change in internal energy of the system, and the final temperature of the system. A gas is allowed to expand adiabatically to four times its original volume. In doing so the gas does 1750 J of work. (a) (b) (c) How much heat flowed into the gas? Will the temperature rise or fall? What is the change in internal energy of the gas? (c) (d) 10. Calculate the number of moles of helium gas in the system. Determine the pressure after the isothermal expansion. Draw a diagram of the thermodynamic cycle. Assuming that the isotherm is a diagonal line rather than a curve, estimate the work done during the isothermal expansion. Determine the work done during the isobaric compression. Determine the work done during the isochoric part of the cycle. Calculate the net work done by the gas. Calculate the final temperature of the helium. Distinguish between an isothermal process and an adiabatic process as applied to an ideal gas. A fixed mass of an ideal gas is held in a cylinder by a moveable piston and thermal energy is supplied to the gas causing it to expand at a constant pressure of 1.5 102 kPa as shown in the Figure below.
How much heat energy must be added at atmospheric pressure to 0.50 kg of ice at 0 C to convert it to steam at 100 C?
11.
15.
(a)
12.
(b)
thermal energy
piston
13.
For each of the processes listed in the following table, supply the symbol +, , or 0 for each missing entry. Q W U +
Process Isobaric compression of an ideal gas Isothermal compression of an ideal gas Adiabatic expansion Isochoric pressure drop Free expansion of a gas
The initial volume of the gas in the container is 0.040 m3 and after expansion the volume is 0.10 m3. The total energy supplied to the gas during the process is 7.0 kJ. (i) (ii) (iii) State whether this process is isothermal, adiabatic or neither of these processes. Determine the work done by the gas. Calculate the change in internal energy of the gas.
287
19 april 09 Physics Ch 10 final.287 287 22/05/2009 11:57:51 AM
AHL
(e)
CHAPTER 10
16. This question is about a diesel engine cycle as shown in the Figure below. Mark on the diagram each of the state changes that occur at AB, BC, CD and DA. Identify the maximum and minimum temperature reservoirs and label QH and QL.
p Q B C D Q A V1 V2 V
AHL
Introduction
We are always told to conserve energy. But according to the First Law of Thermodynamics, in a closed system, energy is conserved, and the total amount of energy in the Universe does not change no matter what we do. Although the First Law of Thermodynamics is correct, it does not tell the whole story. How often have you seen a videotape played in reverse sequence. Views of water flowing uphill, demolished buildings rising from the rubble, people walking backwards. In none of the natural Laws of Physics studied so far have we encountered time reversal. If all of these Laws are obeyed, why then does the time-reversed sequence seem improbable? To explain this reversal paradox, scientists in the latter half of the nineteenth century came to formulate a new principle called the Second Law of Thermodynamics. This Law allows us to determine which processes will occur in nature, and which will not. There are many different but equivalent ways of stating the Second Law of Thermodynamics. Much of the language used for the definitions had its origins with the physicists
288
19 april 09 Physics Ch 10 final.288 288 22/05/2009 11:57:52 AM
THERMAL PHYSICS
who formulated the Law, and their desire to improve the efficiency of steam engines. These statements of the Second Law of Thermodynamics will be developed within this section.
10.3.2 ENTROPY
Recall that in thermodynamics, a system in an equilibrium state is characterised by its state variables (p, V, T, U, n ). The change in a state variable for a complete cycle is zero. In contrast, the net thermal energy and net work factors for a cycle are not equal to zero. In the latter half of the nineteenth century, Rudolf Clausius proposed a general statement of the Second Law in terms of a quantity called entropy. Entropy is a thermodynamic function of the state of the system and can be interpreted as the amount of order or disorder of a system. As with internal energy, it is the change in entropy that is important and not its absolute value.
It is possible to convert heat into work in a non-cyclic process. An ideal gas undergoing an isothermal expansion does just that. But after the expansion, the gas is not in its original state. In order to bring the gas back to its original state, an amount of work will have to be done on the gas and some thermal energy will be exhausted. The Kelvin Planck statement formulates that if energy is to be extracted from a reservoir to do work, a colder reservoir must be available in which to exhaust a part of the energy.
S = Q -T
The units of the change in entropy are J K1.
289
19 april 09 Physics Ch 10 final.289 289 22/05/2009 11:57:52 AM
AHL
CHAPTER 10
or
Example
A heat engine removes 100 J each cycle from a heat reservoir at 400 K and exhausts 85 J of thermal energy to a reservoir at 300 K. Compute the change in entropy for each reservoir. or Natural processes tend to move toward a state of greater disorder. The entropy of the Universe increases.
Solution
AHL
The change in entropy of the hot reservoir is 0.25 J K-1 and the change in entropy of the cold reservoir is 0.28 J K-1. The change in entropy of the cold reservoir is greater than the decrease for the hot reservoir. The total change in entropy of the whole system equals 0.033 J K -1. That is,
In this example and all other cases, it has been found that the total entropy increases. (For an ideal Carnot reversible cycle it can equal zero. The Carnot cycle was discussed earlier). This infers that total entropy increases in all natural systems. The entropy of a given system can increase or decrease but the change in entropy of the system Ss plus the change in entropy of the environment Senv must be greater than or equal to zero. i.e.,
S = S S + S env 0
In terms of entropy, the Second Law of Thermodynamics can be stated as The total entropy of any system plus that of its environment increases as a result of all natural processes.
290
19 april 09 Physics Ch 10 final.290 290 22/05/2009 11:57:53 AM
THERMAL PHYSICS
an Austrian physicist, was also concerned with the heat death of the Universe and irreversibilty. He concluded that the tendency toward dissipation of heat is not an absolute Law of Physics but rather a Statistical Law. 1. Consider 1022 air molecules in a container. At any one instant, there would be a large number of possibilities for the position and velocity of each molecule its microstate and the molecules would be disordered. Even if there is some momentary order in a group of molecules due to chance, the order would become less after collision with other molecules. Boltzmann argued that probability is directly related to disorder and hence to entropy. In terms of the Second Law of Thermodynamics, probability does not forbid a decrease in entropy but rather its probability of occurring is extremely low. If a coin is flipped 100 times, it is not impossible for the one hundred coins to land heads up, but it is highly improbable. The probability of rolling 100 sixes from 100 dice is even smaller. A small sample of a gas contains billions of molecules and the molecules have many possible microstates. It is impossible to know the position and velocity of each molecule at a given point in time. The probability of these microstates suddenly coming together into some improbable arrangement is infinitesimal. In reality, the macrostate is the only measurable part of the system. The Second Law in terms of probability does not infer that a decrease in entropy is not allowed but it suggests that the probability of this occurring is low. A final consequence of the Second Law is the heat degradation of the Universe. It can be reasoned that in any natural process, some energy becomes unavailable to do useful work. An outcome of this suggests that the Universe will eventually reach a state of maximum disorder. An equilibrium temperature will be reached and no work will be able to be done. All change of state will cease as all the energy in the Universe becomes degraded to thermal energy. This point in time is often referred to as the heat death of the Universe.
Exercise
10.3
The efficiency of a heat engine is the ratio of A. B. C. D. the thermal energy input to the thermal energy output the thermal energy output to the thermal energy input the work output to the thermal energy input the work output to the thermal energy output
2.
A heat engine is most efficient when it works between objects that have a A. B. C. D. large volume large temperature difference large surface area small temperature difference
3.
The four-stroke engine is often said to consist of the suck, squeeze, bang and blow strokes. Describe what these terms relate to. Explain the difference between internal and external combustion engines, and give an example of each. A car engine operates with an efficiency of 34% and it does 8.00 103 J of work each cycle. Calculate (a) (b) the amount of thermal energy absorbed per cycle at the high-temperature reservoir. the amount of exhaust thermal energy supplied to the surroundings during each cycle.
4.
5.
6.
On a hot day, a person closed all the doors and windows of the kitchen and decided to leave the door of the refrigerator open to cool the kitchen down. What will happen to the temperature of the room over a period of several hours. Give a full qualitative answer. Modern coal-fired power plants operate at a temperature of 520 C while nuclear reactors operate at a temperature of 320 C. If the waste heat of the two plants is delivered to a cooling reservoir at 21 C, calculate the Carnot efficiency of each type of plant. (optional question)
7.
291
19 april 09 Physics Ch 10 final.291 291 22/05/2009 11:57:54 AM
AHL
CHAPTER 10
8. It takes 7.80 105 J of thermal energy to melt a given sample of a solid. In the process, the entropy of the system increases by 1740 J K -1. Find the melting point of the solid in C. If 2.00 kg of pure water at 100 C is poured into 2.00 kg of water at 0 C in a perfectly insulated calorimeter, what is the net change in entropy. (Assume there is 4.00 kg of water at a final temperature of 50 C). Use the concepts of entropy and the arrow of time to explain the biological growth of an organism. You are given six coins which you shake and then throw onto a table. Construct a table showing the number of microstates for each macrostate. Describe the concept of energy degradation in terms of entropy. Using an example, explain the meaning of the term reversal paradox. What is meant by the heat death of the universe?
9.
10.
11.
12.
AHL
13.
14.
292
19 april 09 Physics Ch 10 final.292 292 22/05/2009 11:57:54 AM
WAVE PHENOMENA
WAVE PHENOMENA
11.1 11.2 11.3 11.4 11.5 (SL Option A2) Standing (Stationary) Waves (SL Option A3) Doppler Eect (SL Option A4) Diraction (SL Option A5) Resolution (SL Option A6) Polarization
11
AHL
A L
diagram 2
B on the tube are of course oscillating but the wave is not moving forward. You can actually get the wave to appear to stand still by illuminating it with a strobe light that flashes at the same frequency of vibration as your hand. The fact that the wave is not progressing is the reason why such waves are called standing or stationary waves.
B L
diagram 1
Figures 1101 and 1102 There are several things to note about standing waves. The fact the wave is not moving forward means that no energy is being propagated. If you increase the amplitude with which you shake your hand, this increase in energy input to the wave will result in a greater maximum displacement of the tube. It can also be seen that points along the wave oscillate with different amplitudes. In this respect, the amplitude of a standing wave clearly varies along its length. To illustrate this, Figure 1103 shows a standing wave set up in a string of length 30 cm at a particular instant in time. Figure 1104 shows the variation with time t of the amplitude x0 of the string at a point 13 cm along the string.
very interesting situation arises when considering a travelling wave that is reflected and the reflected wave interferes with the forward moving wave. It can be demonstrated either with a rubber tube, stretched string or a slinky spring attached to a rigid support. In the diagram the tube is attached at A and you set up a wave by moving your hand back and forth at B. If you get the frequency of your hand movement just right then the tube appears to take the shape as shown in Figure 1101. When you move your hand faster you can get the tube to take the shape shown in Figure 1102. All points other than A and
293
19 april 09 Physics Ch 11 final.293 293 22/05/2009 11:58:43 AM
CHAPTER 11
10 8 6 4 2
0 -2 -4 -6 -8 -10
Figure 1103
4 3 2 1
A standing wave
AHL
x0 /m
0 -1 -2 -3 -4
A standing wave arises from the interference of two waves moving in opposite directions. To understand this, let us look at the situation of a standing wave in a string or tube as described above. Initially when you start moving the free end of the tube up and down, a wave travels along the tube. When it reaches the fixed end, it is reflected and, as described in Topic 4, the reflected wave is out of phase with the forward (incident) wave. The forward wave and reflected wave interfere and the resultant displacement of the tube is found from the principle of superposition. This is illustrated in Figure 1105 which shows, at a particular incident of time, the displacement of the tube due to the incident wave, the displacement of the tube due to the reflected wave and the resultant displacement due to the interference of the two waves.
A
20 15
d /cm
0.05
0.1
0.15
0.2
t /s
0.25
amplitude x 0 vs time t
10 5
d / cm
Figure 1104
0 -5 0
x / cm
10 20 30
Since energy is not propagated by a standing wave, it doesnt really make a lot of sense to talk about the speed of a standing wave. This speed is the speed of a travelling wave in the string. As we have seen, the speed of a travelling wave is determined by the nature and properties of the material through which it travels. For the string therefore, the speed of the travelling wave in the string determines the frequency with which you have to oscillate the string to produce the standing wave. (From Figure 1104 we see that for this situation the frequency of oscillation of the string is f = 1/T = 1/0.25 = 4.0 Hz) Also, since at any one time all the particles in a standing wave are either moving up or moving down, it follows that all the particles are either in phase or in anti-phase with each other.
N A
Figure 1105
If at the fixed end, due to the incident wave, the tube is moving upwards, then due to the reflected wave it will be moving downwards such that the net displacement is always zero. Similarly, at the mid-point of the string, the displacements of the tube due to each wave are always in anti-phase. Hence the net displacement at this point is always zero. Points on a standing wave at which the displacement is always zero, are called nodes or nodal points. These are labelled N in Figure 1105. The points at which a standing wave reach maximum displacement are called antinodes. In Figure 1105, the antinodes are at points one quarter and three-quarters the length of the tube (7.5 cm and 22.5 cm) and are labelled A. The amplitude of the forward wave and the reflected wave is 10 cm, hence the maximum displacement at an antinode in the tube is 20 cm. The maximum displacement at the antinodes will occur at the times when the forward and reflected waves are in phase.
294
19 april 09 Physics Ch 11 final.294 294 22/05/2009 11:58:44 AM
WAVE PHENOMENA
Figure 1106 shows an instance in time when the interference of the forward and reflected wave produce an the overall displacement of zero in the standing wave.
20 15 10
fact be the one that the ear will hear above all the others. This is what enables you to sing in tune with the note emitted by the plucked string. If we were to vibrate the stretched string at a frequency equal to the fundamental or to one of its harmonics rather than just pluck it, then we set up a standing wave as earlier described. We have used the phenomenon of resonance to produce a single standing wave. In the case of the stretched string it has an infinite number of natural frequencies of oscillations, each corresponding to a standing wave. Hence, when plucked, we obtain an infinite number of harmonics. Figure 1107 shows part of what is called a harmonic series. Different fundamentals can be obtained by pinching the string along its length and then plucking it or by altering its tension. In a violin for example the four strings are of the same length, but under different tensions so that each produces a different fundamental. The notes of the harmonic series associated with each fundamental are obtained by holding the string down at different places and then bowing it. The harmonics essentially effect the quality of the note that you hear. The presence of harmonics is one reason why different types of musical instruments sounding a note of the same frequency actually sound different. It is not the only reason that musical instruments have different sound qualities. An A string on a guitar sounds different from the A string of a violin, because they are also produced in different ways and the sound box of each instrument is very different in construction. The actual construction of the violin for example distinguishes the quality of the notes produced by a Stradivarius from those produced by a plastic replica. Figure 1107 enables us to derive a relationship between the wavelengths of the harmonics and length of the string. If the length of the string is L then clearly, the wavelength of the fundamental (first harmonic) is = 2 L, for the second harmonic = L and for the third harmonic 2L = . From this sequence, we see that the wavelength n 3 of the nth harmonic is given by
d / cm
5 0 -5
x / cm
0 5 10 15 20 25 30
Exercise
11.1 (a)
Sketch the shape of the forward and reflected wave in a string at an instant in time that results in the antinodes of the standing wave having maximum displacement.
2nd harmonic
3rd harmonic
4th harmonic
n =
2L n
Figure 1107
Figure 1107 shows the first four modes of vibration i.e.standing waves in the string The modes of vibration are called harmonics.The first harmonic is called the fundamental. This is the dominant vibration and will in
Resonance and standing waves also play their part in the production of sound from pipes. If you take a pipe that is open at one end and blow across the top it will produce a sound. By blowing faster you can produce a different sound. In this situation you have used resonance to set up a standing wave in the pipe. It is now the air molecules inside the pipe that are set vibrating. The sound wave that
295
19 april 09 Physics Ch 11 final.295 295 22/05/2009 11:58:45 AM
AHL
CHAPTER 11
you create at the open-end travels to the bottom of the pipe, is reflected back and then again reflected when it reaches the open end. The waves interfere to produce a standing wave. However, when waves are reflected from an open boundary they do not undergo a phase change so that there is always an antinode at the open end. The fundamental and the first three harmonics for a pipe open at one end are shown Figure 1108.
1st harmonic (Fundamental) A 3rd harmonic N A 5th harmonic N A
Single amplitude Variable amplitude All phase differences Only 0, 2 and between 0 and 2 phase difference
Figure 1108
AHL
Figure 1110 Comparing travelling and standing waves Since travelling waves have a single amplitude, it follows that there are no nodal or anti-nodal points in a travelling wave.
A pipe that is open at both ends and that has the same dimensions, as the previous pipe, will produce a different fundamental note and a different harmonic series. This is shown in Figure 1109
A A N N N N A A A N A N
2nd harmonic
3rd harmonic
Figure 1109
We see that, whereas a pipe open at both ends produces all the odd and even harmonics of the fundamental = 2L, 2L (n = ), a pipe closed at one end can produce only the n odd harmonics of the fundamental = 4L. With open and close pipes we are essentially looking at the way in which organs, brass instruments and woodwind instruments produce musical sounds.
v=
where T is the tension in the string and is the mass per unit length of the string. (a) Deduce an expression for the frequency f of the fundamental in a string of length L. Use your answer to (a) to estimate the tension in the A string of a violin (frequency = 440 Hz).
(b)
296
19 april 09 Physics Ch 11 final.296 296 22/05/2009 11:58:46 AM
WAVE PHENOMENA
Solution
(a)
Using f = (b)
v we have
f =
1 T 2L
As this is an estimate, we are not looking for an exact value but we do need to make some sensible estimates of L and .
L = 0.5 m say and = 2.0 10-3 kg m-1 i.e. about 2 g per metre. From f =
1 T we have that T = 4 L2 f 2 = 2L
Exercise
11.1 (b)
1.
An organ pipe is closed at one end and produces a fundamental note of frequency 128 Hz. (a) Calculate (i) the frequencies of the next two harmonics in the harmonic series of the pipe. the frequencies of the corresponding harmonics for an open pipe whose fundamental is 128 Hz the ratio of the length of the closed pipe to that of the open pipe.
11.2.6 Outline an example in which the Doppler eect is used to measure speed.
IBO 2007
(ii)
(iii)
(b)
Suggest why organ pipes that emit notes at the lower end of the organs frequency range are usually open pipes.
297
19 april 09 Physics Ch 11 final.297 297 22/05/2009 11:58:47 AM
AHL
CHAPTER 11
a stationary observer, then the observer will hear a sound of ever increasing frequency. This sometimes leads to confusion in describing what is heard when a source approaches, passes and then recedes from a stationary observer. Suppose for example that you are standing on a station platform and a train sounding its whistle is approaching at constant speed. What will you hear as the train approaches and then passes through the station? As the train approaches you will hear a sound of constant pitch but increasing loudness. The pitch of the sound will be greater than if the train were stationary. As the train passes through the station you will hear the pitch change at the moment the train passes you, to a sound, again of constant pitch. The pitch of this sound will be lower than the sound of the approaching train and its intensity will decrease as the train recedes from you. What you do not hear is a sound of increasing pitch and then decreasing pitch.
wavefront
Figure 1111 Sound waves from a stationary source Suppose now that the source moves towards A with constant speed v. Figure 1112 (a) shows a snapshot of the new wave pattern.
A source
AHL
smaller wavelength
larger wavelength
The wavefronts are now crowded together in the direction of travel of the source and stretched out in the opposite direction. This is why now the two observers will now hear notes of different frequencies. How much the waves bunch together and how much they stretch out will depend on c c the speed v. Essentially, f A = and f B = where A < B B A and v is the speed of sound. If the source is stationary and A is moving towards it, then the waves from the source incident on A will be bunched up. If A is moving away from the stationary source then the waves from the source incident on A will be stretched out. Christian Doppler (18031853) actually applied the principle (incorrectly as it happens) to try and explain the colour of stars. However, the Doppler effect does apply to light as well as to sound. If a light source emits a light of frequency f then if it is moving away from an observer the observer will measure the light emitted as having a lower frequency than f. Since the sensation of colour vision is related to the frequency of light (blue light is of a higher frequency than red light), light emitted by objects moving way from an observer is often referred to as being redshifted whereas if the object is moving toward the observer it is referred to as blue-shifted. This idea is used in Option E (Chapter 16). We do not need to consider here the situations where either the source or the observer are accelerating. In a situation for example where an accelerating source is approaching
vs S S
Figure 1112 (b) In Figure 1112 (b) the observer O is at rest with respect to a source of sound S is moving with constant speed vs directly towards O. The source is emitting a note of constant frequency f and the speed of the emitted sound is v. S/ shows the position of the source t later. When the source is at rest, then in a time t the observer will receive ft waves and these waves will occupy a distance vt . i.e
v = vt = ft f
(Because of the motion of the source this number of waves will now occupy a distance (vt vst). The new wavelength is therefore
298
19 april 09 Physics Ch 11 final.298 298 22/05/2009 11:58:49 AM
WAVE PHENOMENA
vt vst v vs = = f ft
If f/ is the frequency heard by O then
of the source v is much smaller than the speed of light c in free space. (v << c). Under these circumstances, when the source is moving towards the observer, equation 11.1 becomes
f/=
v v v vs / = / = / or f f v f = f v vs
/
f / f = f =
v f c
Equation 11.5
From which
and when the source is moving away from the observer, v equation 11.2 becomes f / f = f = f c Provided that v << c, these same equations apply for a stationary source and moving observer Equation 11.1
1 f =f v 1 s v
/
We now consider the case where the source is stationary and the observer is moving towards the source with speed v0 . In this situation the speed of the sound waves as measured by the observer will be v0 + v . We therefore have that
Example
v f / v0 + v = = f
From which
A source emits a sound of frequency 440 Hz. It moves in a straight line towards a stationary observer with a speed of 30 m s-1. The observer hears a sound of frequency 484 Hz. Calculate the speed of sound in air.
v f / = 1 + o f v v f = 1 o f v
/
Equation 11.3
Solution
If the observer is moving away from the source then We use equation 11.1 and substitute f/= 484 Hz, f = 440 Hz and vs = 30 m s-1.
v v f = f / f = 1 + o f f = o f v v
such that 1
30 440 = to v 484
The velocities that we refer to in the above equations are the velocities with respect to the medium in which the waves from the source travel. However, when we are dealing with a light source it is the relative velocity between the source and the observer that we must consider. The reason for this is that light is unique in the respect that the speed of the light waves does not depend on the speed of the source. All observers irrespective of their speed or the speed of the source will measure the same velocity for the speed of light. This is one of the cornerstones of the Special Theory of Relativity which is discussed in more detail in Option H (Chapter18).When applying the Doppler effect to light we are mainly concerned with the motion of the source. We look here only at the situation where the speed
Exercise
11.2 (a)
Judy is standing on the platform of a station. A high speed train is approaching the station in a straight line at constant speed and is sounding its whistle. As the train passes by Judy, the frequency of the sound emitted by the whistle as heard by Judy, changes from 640 Hz to 430 Hz. Determine (a) (b) the speed of the train the frequency of the sound emitted by the whistle as heard by a person on the train. (Speed of sound = 330 m s-1)
299
19 april 09 Physics Ch 11 final.299 299 22/05/2009 11:58:51 AM
AHL
1 v f = f = f v 1+ s v vs v
/
Equation 11.2
CHAPTER 11
Example
A particular radio signal from a galaxy is measured as having a frequency of 1.39 109 Hz. The same signal from a source in a laboratory has a frequency of 1.42 109 Hz. Suggest why the galaxy is moving away from Earth and calculate its recession speed (i.e. the speed with which it is moving away from Earth).
v
transmitter
AHL
Solution
f f
//
f f
/ /
The fact that the frequency from the moving source is less than that when it is stationary indicates that it is moving away from the stationary observer i.e. Earth. Using f =
receiver reflector
Figure 1113 Using the Doppler eect to measure speed We shall consider the situation where v << c where c is the speed of the waves from the transmitter. For the reflector receiving waves from the transmitter, it is effectively an observer moving towards a stationary source. From equation (11.4), it therefore receives waves that have been Doppler shifted by an amount
v f we have c
v=
It is usual when dealing with the Doppler effect of light to express speeds as a fraction of c. So in this instance we have v = 0.021 c
Exercise
11.2 (b)
f/ f =
v f c
Equation 11.6
A galaxy is moving away from Earth with a speed of 0.0500c. The wavelength of a particular spectral line in light emitted by atomic hydrogen in a laboratory is 6.56 10-7 m. Calculate the value of the wavelength of this line, measured in the laboratory, in light emitted from a source of atomic hydrogen in the galaxy.
For the receiver receiving waves from the reflector, it is effectively a stationary observer receiving waves from a moving source. From equation (11.5), it therefore receives waves that have been Doppler shifted by an amount
f // f / =
v / f c
Equation 11.7
If we add equations (11.6) and (11.7) we get that the total Doppler shift at the receiver f is
f // f = f = f /
But f = 1 +
/
v f hence c
v v +f c c
300
19 april 09 Physics Ch 11 final.300 300 22/05/2009 11:58:52 AM
WAVE PHENOMENA vv v f = f 1 + + f cc c
v2 But since v << c, we can ignore the term 2 when we c expand the bracket in the above equation.
Therefore we have
f =
2v f c
Equation 11.8
If v c then we must use the full Doppler equations. However, for em radiation we will always only consider situations in which v << c.
Example
The speed of sound in blood is 1.500 103 m s-1. Ultrasound of frequency 1.00 MHz is reflected from blood flowing in an artery. The frequency of the reflected waves received back at the transmitter is 1.05 MHz. Estimate the speed of the blood flow in the artery.
Solution
intensity
0.05 10 6 =
2v 10 6 3 1.5 10
Figure 1114 Intensity distribution for single-slit diraction We would get the same intensity distribution if we were to plot the intensity against the angle of diffraction . (See next section). This intensity pattern arises from the fact that each point on the slit acts, in accordance with Huygens principle, as a source of secondary wavefronts. It is the interference between these secondary wavefronts that produces the typical diffraction pattern.
to give v 36 m s-1. (We have assumed that the ultrasound is incident at right angles to the blood flow.)
for the position of the rst minimum of the diraction pattern produced at a single slit. 11.3.3 Solve problems involving single-slit diraction.
IBO 2007
301
19 april 09 Physics Ch 11 final.301 301 22/05/2009 11:58:53 AM
AHL
When plane wavefronts pass through a small aperture they spread out as discussed in Topics 4.5.3 and 4.5.4. This is an example of the phenomenon called diffraction. Light waves are no exception to this and ways for observing the diffraction of light have also been discussed in the aforementioned topics. However, when we look at the diffraction pattern produced by light we observe a fringe pattern, that is, on the screen there is a bright central maximum with secondary maxima either side of it. There are also regions where there is no illumination and these minima separate the maxima. If we were to actually plot how the intensity of illumination varies along the screen then we would obtain a graph similar to that as in Figure 1114.
CHAPTER 11
infinite distance away form the slit. This can be achieved with the set up shown in Figure 1115. below the centre of the slit. In this way we can pair the sources across the whole width of the slit. If the screen is a long way from the slit then the angles 1 and 2 become nearly equal. (If the screen is at infinity then they are equal and the two lines PX and XY are at right angles to each other). From Figure 1116 we see therefore that there will be a minimum at P if
source
= b sin 1
where b is the width of the slit.
lens 1 lens 2 screen
However, both angles are very small, equal to say, where is the angle of diffraction. So it can be written
single slit
Figure 1115 Apparatus for viewing Fraunhofer diraction The source is placed at the principal focus of lens 1 and the screen is placed at the principal focus of lens 2. Lens 1 ensures that parallel wavefronts fall on the single slit and lens 2 ensures that the parallel rays are brought to a focus on the screen. The same effect can be achieved using a laser and placing the screen some distance from the slit. If the light and screen are not an infinite distance from the slit then we are dealing with a phenomenon called Fresnel diffraction and such diffraction is very difficult to analyse mathematically. To obtain a good idea of how the single slit pattern comes about we consider the diagram Figure 1116.
= -b
This actually gives us the half-angular width of the central maximum. We can calculate the actual width of the maximum along the screen if we know the focal length of the lens focussing the light onto the screen. If this is f then we have that d = -f Such that f d = --b To obtain the position of the next maximum in the pattern we note that the path difference is 3 - . We therefore divide 2 the slit into three equal parts, two of which will produce wavefronts that will cancel and the other producing wavefronts that reinforce. The intensity of the second maximum is therefore much less than the intensity of the central maximum. (Much less than one third in fact since the wavefronts that reinforce will have differing phases). We can also see now how diffraction effects become more and more noticeable the narrower the slit becomes. If light of wavelength 430 nm was to pass through a slit of width say 10 cm and fall on a screen 3.0 m away, then the half angular width of the central maximum would be 0.13 m. There will be lots of maxima of nearly the same intensity and the maxima will be packed very closely together. (The first minimum occurs at a distance of 0.12 m from the centre of the central maximum and the next occurs effectively at a distance of 0.24 m). We effectively observe the geometric pattern. Refer to Example 1. We also see now how for diffraction effects to be noticeable the wavelength must be of the order of the slit width. The width of the pattern increases in proportion to the wavelength and decreases inversely with the width of the slit. If the slit width is much greater than the wavelength then the width of the central maxima is very small.
AHL
P d
1
b X
Y f screen
Figure 1116
In particular we consider the light from one edge of the slit to the point P where this point is just one wavelength further from the lower edge of the slit than it is from the upper edge. The secondary wavefront from the upper edge will travel a distance /2 further than a secondary wavefront from a point at the centre of the slit. Hence when these wavefronts arrive at P they will be out of phase and will interfere destructively. The wavefronts from the next point below the upper edge will similarly interfere destructively with the wavefront from the next point
302
19 april 09 Physics Ch 11 final.302 302 22/05/2009 11:58:54 AM
WAVE PHENOMENA
Our discussion this far has been for rectangular slits. What is the half-angular width of the central maximum of the diffraction formed by a circular aperture? This is not easy to calculate since it involves some advanced mathematics. The problem was first solved by the English Astronomer Royal, George Airey, in 1835 who showed that .22 --------- where b is the diameter of for circular apertures = 1 b the aperture.
25 cm lens
Solution
The lens actually acts as a circular aperture of diameter 3.0 cm. The half angular width of central maximum of the diffraction pattern that it forms on the screen is
Light from a laser is used to form a single slit diffraction pattern. The width of the slit is 0.10 mm and the screen is placed 3.0 m from the slit. The width of the central maximum is measured as 2.6 cm. Calculate the wavelength of the laser light?
Solution
= 5.7 10-6 m. Although this is small, it is still finite and is effectively the image of the star as the intensity of the secondary maxima are small compared to that of the central maximum.
Since the screen is a long way from the slit we can use the small angle approximation such that the f is equal to 3.0 m. The half-width of the central maximum is 1.3 cm so we have
Exercise
11.3
1.
A parallel beam of light of wavelength 500 nm is incident on a slit of width 0.25 mm. The light is brought to focus on a screen placed 1.50 m from the slit. Calculate the angular width and the linear width of the central diffraction maximum. Light from a laser is used to form a single slit diffraction pattern on a screen. The width of the slit is 0.10 mm and the screen is 3.0 m from the slit. The width of the central diffraction maximum is 2.6 cm. Calculate the wavelength of the laser light.
2.
Example
In the following diagram, parallel light from a distant point source (such as a star) is brought to focus on the screen S by a converging lens (the lens is show as a vertical arrow). The focal length (distance from lens to screen) is 25 cm and the diameter of the lens is 3.0 cm. The wavelength of the light from the star is 560 nm. Calculate the diameter of the diameter of the image on the screen.
303
19 april 09 Physics Ch 11 final.303 303 22/05/2009 11:58:55 AM
AHL
CHAPTER 11
P1 S2 pupil retina
Figure 1117 Light from the source S1 enters the eye and is diffracted by the pupil such that the central maximum of the diffraction pattern is formed on the retina at P1. Similarly, light from S2 produces a maximum at P2. If the two central maxima are well separated then there is a fair chance that we will see the two sources as separate sources. If they overlap then we will not be able to distinguish one source from another. From the diagram we see as the sources are moved closer to the eye, then the angle increases and so does the separation of the central maxima. Figures 1118, 1119, 1120 and 1121 shows the different diffraction patterns and the intensity distribution, that might result on the retina as a result of light from two point sources
1. Well resolved S1 S2
11.4.3 Describe the signicance of resolution in the development of devices such as CDs and DVDs, the electron microscope and radio telescopes. 11.4.4 Solve problems involving resolution. Problems could involve the human eye and optical instruments.
IBO 2007
AHL
.Figure 1118
S1
2. Well resolved
Figure 1119
Well resolved
304
19 april 09 Physics Ch 11 final.304 304 22/05/2009 11:58:56 AM
WAVE PHENOMENA
where b is the width of the slit through which the light from the sources passes. However, we see from Figure 1117 that is the angle that the two sources subtend at the slit. Hence we conclude that two sources will be resolved by a slit if the angle that they subtend at the slit is greater than or equal to b So far we have been assuming that the eye is a rectangular slit whereas clearly it is a circular aperture and so we must use the formula 1.22 = --------b Figure 1120 Just resolved As mentioned above the angle is sometimes called the resolving power but should more accurately be called the minimum angle of resolution ( min) Clearly the smaller the greater the resolving power.
S1 S2
4. Not resolved
Figure 1121
Not resolved
Radio telescopes
The average diameter of the pupil of the human eye is about 2.5 mm. This means that two point sources emitting light of wavelength 500 nm will just be resolved by the eye if their angular separation at the eye is
= 1.22 5.0 10 7 = 2.4 10 4 rad 2.5 10 3
= -b
If the eye were to be able to detect radio waves of wavelength 0.15 m, then to have the same resolving power the pupil would have to have a diameter of about 600 m. Clearly this is nonsense, but it does illustrate a problem facing astronomers who wish to view very distant objects such as quasars and galaxies (see Option E) that emit radio waves. Conventional radio telescopes consist of a large dish, typically 25 m in diameter. Even with such a large diameter, the radio wavelength resolving power of the telescope is much less than the optical resolving power of the human eye. Let us look at an example.
305
19 april 09 Physics Ch 11 final.305 305 22/05/2009 11:58:58 AM
AHL
CHAPTER 11
and Astronautical Science (ISAS). This project is backed by the National Astronomical Observatory of Japan, the National Science Foundations National Radio Astronomy Observatory (NRAO); the Canadian Space Agency; the Australia Telescope National Facility; the European VLBI Network and the Joint Institute for Very Long Baseline Interferometry in Europe. This project is a very good example of how Internationalism can operate in Physics.
Example
The Galaxy Cygnus A can be resolved optically as an elliptically shaped galaxy. However, it is also a strong emitter of radio waves of wavelength 0.15 m. The Galaxy is estimated to be 5.0 1024 m from Earth. Use of a radio telescope shows that the radio emission is from two sources separated by a distance of 3.0 1021 m. Estimate the diameter of the dish required to just resolve the sources.
Exercise
11.4 (a)
Solution
It is suggested that using the ISAS, VLBA, it would be possible to see a grain of rice at a distance of 5000 km. Estimate the resolving power of the VLBA.
AHL
Electron microscope
Telescopes are used to look at very distant objects that are very large but, because of their distance from us, appear very small. Microscopes on the other hand, are used to look at objects that are close to us but are physically very small. As we have seen, just magnifying objects, that is making them appear larger, is not sufficient on its own to gain detail about the object; for detail, high resolution is needed. Figure 1122 is a schematic of how an optical microscope is used to view an object and Figure 1123 is a schematic of a transmission electron microscope (TEM).
glass slide containing specimen (object) Optical lens system eye bright light source
and d =
A radio telescope dish of this size would be impossible to make, let alone support. This shows that a single dish type radio telescope cannot be used to resolve the sources and yet they were resolved. To get round the problem, astronomers use two radio telescopes separated by a large distance. The telescopes view the same objects at the same time and the signals that each receive from the objects are simultaneously superimposed. The result of the superposition of the two signals is a two-slit interference pattern (see section 4.5.6). The pattern has much narrower fringe spacing than that of the diffraction pattern produced by either telescope on its own, hence producing a much higher resolving power. When telescopes are used like this, they are called a stellar interferometer. In Socorro in New Mexico there is a stellar interferometer that consists of 27 parabolic dishes each of diameter 25 m, arranged in a Y-shape that covers an area of 570 km2. This is a so-called Very Large Array (VLA). Even higher resolution can be obtained by using an array of radio telescopes in observatories thousands of kilometres apart. A system that uses this so-called technique of very-long-baseline interferometry (VLBI) is known as a very-long-baseline array (VLBA). With VLBA, a radio wavelength resolving power can be achieved that is 100 times greater than the best optical telescopes. Even higher resolving power can be achieved by using a telescope that is in a satellite orbiting Earth. Such a system was launched in February 1997 by Japans Institute of Space
Figure 1122
Figure 1123
306
19 april 09 Physics Ch 11 final.306 306 22/05/2009 11:58:59 AM
WAVE PHENOMENA
In the optical microscope, the resolving power is determined by the particular lens system employed and the wavelength of the light used. For example, two points in the sample separated by a distance d will just be resolved if eye will resolve the headlights into two separate sources if this angle equals 2 10-4 rad. This gives D = 7.5 km. In other words if the car is approaching you on a straight road then you will be able to distinguish the two headlights as separate sources when the car is 7.5 km away from you. Actually because of the structure of the retina and optical defects the resolving power of the average eye is about 6 10-4 rad. This means that the car is more likely to be 2.5 km away before you resolve the headlights.
d= 2m
where m is a property of the lens system know as the numerical aperture. In practice the largest value of m obtainable is about 1.6. Hence, if the microscope slide is illuminated with light of wavelength 480 nm, a good microscope will resolve two points separated by a distance d 1.5 10-7 m 0.15 m. Points closer together than this will not be resolved. However, this is good enough to distinguish some viruses such as the Ebola virus. Clearly, the smaller the higher the resolving power and this is where the electron microscope comes to the fore. The electron microscope makes use of the wave nature of electrons (see 13.1.5). In the TEM, electrons pass through a wafer thin sample and are then focussed by a magnetic field onto a fluorescent screen or CCD (charge coupled device see 14.2). Electrons used in an electron microscope have wavelengths typically of about 5 10-12 m. However, the numerical aperture of electron microscopes is considerably smaller than that of an optical microscope, typically about 0.02. Nonetheless, this means that a TEM can resolve two points that are about 0.25 nm apart. This resolving power is certainly high enough to make out the shape of large molecules. Another type of electron microscope uses a technique by which electrons are scattered from the surface of the sample. The scattered electrons and then focussed as in the TEM to form an image of the surface. These socalled scanning electron microscopes (SEM) have a lower resolving power than TEMs but give very good three dimensional images.
Exercise
11/4 (b)
The distance from the eye lens to the retina is 20 mm. The light receptors in the central part of the retina are about 5 10-6 apart. Determine whether the spacing of the receptors will allow for the eye to resolve the headlights in the above discussion when they are 2.5 km from the eye.
Astronomical telescope
Let us return to the example of the binary stars discussed at the beginning of this section on resolution. The stars Kruger A and B form a binary system. The average separation of the stars is 1.4 1012 m and their average distance from Earth is 1.2 1017 m. When viewed through a telescope on Earth, the system will therefore subtend an angle. 12
= 1.4 10 1.2 10 17
= 1.2 10-5 rad at the objective lens of the telescope. Assuming that the average wavelength of the light emitted by the stars is 500 nm, then if the telescope is to resolve the system into two separate images it must have a minimum 1.22 5.00 10 7 . diameter D where 1.2 10-5 =
D
This gives D = 0.050m, which is about 5 cm. So this particular system is easily resolved with a small astronomical telescope.
Exercise
11.4 (c)
The eye
We saw in the last section that the resolving power of the human eye is about 2 10-4 rad. Suppose that you are looking at car headlights on a dark night and the car is a distance D away. If the separation of the headlight is say 1.5 at 1.5 m then the headlights will subtend an angle = D you eye. Assuming an average wavelength of 500 nm, your
The diameter of Pluto is 2.3 106 m and its average distance from Earth is 6.0 1012 m. Estimate the minimum diameter of the objective of a telescope that will enable Pluto to be seen as a disc as opposed to a point source.
307
19 april 09 Physics Ch 11 final.307 307 22/05/2009 11:59:00 AM
AHL
CHAPTER 11
EH
EV
11.5.4 Explain the terms polarizer and analyser. Figure 1124 11.5.5 Calculate the intensity of a transmitted beam of polarized light using Malus law. 11.5.6 Describe what is meant by an optically active substance. 11.5.7 Describe the use of polarization in the determination of the concentration of certain solutions. 11.5.8 Outline qualitatively how polarization may be used in stress analysis.
x
Light in which the plane of vibration of the electric vector is continually changing is said to be unpolarized. When light passes through natural crystals of tourmaline and of calcite, and through certain synthetic materials, only the EV or EH (depending on ones viewpoint) is transmitted, a process called preferential absorption. Because of this, the emergent light is said to be plane polarized.
y
AHL
11.5.9 Outline qualitatively the action of liquidcrystal displays (LCDs). 11.5.10 Solve problems involving the polarization of light.
IBO 2007
z unpolarised light
polarised light
Figure 1125
Figure 1125 shows how unpolarized light upon entering and leaving a sheet of a synthetic material called a polaroid is polarized. Through the process of preferential absorption, the vibrations that are parallel to the transmission plane of the polaroid (the EH components) are removed and the light emerges polarized in the vertical plane. If the polaroid is rotated through 90 the EV components of the unpolarized light are removed and the EH components are transmitted.
308
19 april 09 Physics Ch 11 final.308 308 22/05/2009 11:59:00 AM
WAVE PHENOMENA
that polarized sunglasses cut down glare from the surface of water. Figure 1126 shows a ray of unpolarized light being reflected and transmitted (refracted) at the surface of water. The components of the electric field vector are shown by the dots and arrows.
unpolarized light partially polarized (reected)
r i f
air water
refracted light
Figure 1128 Illustrating Brewsters angle In Figure 1128 the light is incident at the Brewster angle . From the diagram we see that the refracted angle r = (90 - ) such that sin r = sin (90 - ) = cos . The incident angle i = From the definition of refractive index n there is therefore
Figure 1126 Partial polarization of reected light at the surface of water or n = tan
unpolarized light completely plane polarized
n=
This is known as Brewsters law. For water of refractive index 1.3, the Brewster angle = tan-1 (1.3) = 52. This means if you look at the surface of water at an angle of 38 to the surface, the light reflected from the surface to your eyes will be plane polarized. If you are wearing Polaroid sunglasses then the only light entering your eyes will be light originating from below the surface of the water. This phenomenon is put to good use by anglers. We have assumed in deriving Brewsters law that the incident light is in air. If the incident light is in a medium of refractive index n1 and is incident on the surface of a medium if refractive index n2, then Brewsters Law becomes
air water
i r
refracted light
Figure 1127 Complete polarization of reected light at the surface of water In Figure 1126 the unpolarized beam strikes the water at a certain angle of incidence and is partially polarized on reflection (the refracted beam is also partially polarized but to a much lesser extent than the reflected light). In Figure 1127, at a particular angle where the reflected ray is perpendicular to the refracted ray, the reflected ray is completely plane-polarized. The angle to the normal at which this occurs is called the Brewster angle or the polarising angle after its discoverer David Brewster, a Scottish physicist (1781-1868).
tan =
n1 n2
Since the refractive index is different for different wavelengths (Topic G.1.3), the Brewster angle will vary with wavelength. However for substances such as glass and water, the angle does not change very much over the visible spectrum as the example below shows.
309
19 april 09 Physics Ch 11 final.309 309 22/05/2009 11:59:02 AM
AHL
CHAPTER 11
Example
I = I0
(initial intensity of unpolarised light)
1 I = --I 2 0
I =0
analyser
The refractive index for crown glass for red light of wavelength 660 nm is 1.52 and for violet light of wavelength 480 nm is 1.54. Calculate the difference in the Brewster angle for these two wavelengths.
polariser
Figure 1129
Crossed polaroids
Solution
The analyser can be rotated from 0 to 90 to reduce the emerging light intensity from a maximum to zero. When the intensity is zero the polarizer and analyser are said to be crossed. Polaroid is a material composed of sheets of nitro-cellulose containing crystals of iodosulfate arranged as long molecules. These long molecules reflect and preferentially absorb electric vector components along their length. As mentioned above, this material is used in sunglasses to reduce glare.
AHL
When light is incident on a surface, the electric field vector of the light sets the electrons in the surface into oscillation. The radiation from these oscillating electrons is the origin of the reflected light. If the reflected light is observed at 90 to the refracted light, only the vibrations of the electric field vector that are perpendicular to the plane of incidence will be in the reflected beam. This is because the components of electric field in the plane of incidence cannot have a component at 90 to this plane. Hence at the Brewster angle, the reflected light is plane polarized. Clearly, longitudinal waves cannot be polarized. However, remember that all other electromagnetic waves such as radio and microwaves can be polarized.
Malus law
In the situation shown own in Figure 1129, the intensity of the light is incident on the polarizer is I0 and on the analyser is I0. The intensity of the light after transmission through the analyser is zero. Let us now consider the case where the transmission axis of the analyser is inclined at an ang1e to the direction of the field vector of the light incident on the analyser. The electric field vector of the light incident on the analyser may be resolved into a component parallel to the transmission plane of the polarizer and one at right angles to it as shown in Figure 1130. In this Figure the amplitude of the light is considered to be proportional to the amplitude of the electric field.
incident light of amplitude A0 transmission plane of analyser
Figure 1130 Clearly the component that is at right angles to the transmission plane of the analyser will not be transmitted. The amplitude A1 of the component parallel to the transmission plane is given by A1 = A0cos
310
19 april 09 Physics Ch 11 final.310 310 22/05/2009 11:59:03 AM
WAVE PHENOMENA
But the intensity of the light is proportional the square of the amplitude, therefore we can write that
monochromatic light
A
sugar solution
transmitted light
I = A02cos2 = I0 cos2 where I0 is the intensity of the polarized light (which is half the intensity of the unpolarized light) and I the intensity of the transmitted polarized light. We have then that I = I0 cos2 This is Malus law named after Etienne Malus (1775-1812) who discovered, by accident, polarization of light by reflection.
polaroid
Figure 1131 The principle of a polarimeter Monochromatic light is polarized by the polaroid A and after passing through the sugar solution, the plane of polarization has been rotated by the sugar molecules. The degree of rotation is determined by the number of sugar molecules present and the length of the path traversed by the polarized light. The polaroids A and B are initially crossed such when no solution is present, no light is transmitted by B. When a solution of sugar is added, light will now be transmitted by polaroid B since the solution has rotated the plane of polarization. The polaroid B may be rotated and is also provided with an angular scale. By rotating B until no light is transmitted, the degree of rotation of the plane of polarisation can be measured. If a polarimeter of length l with a concentration of the sugar solution C produces an angle of rotation , then we can define a specific angle of rotation S from
Plane polarized light is incident on a polaroid at right angles to the plane of the polaroid. The transmission axis of the Polaroid is parallel to the plane of polarization of the incident light. Sketch a graph to show the variation with angle of the intensity of the transmitted light as the plane of the polaroid is rotated through 360.
S=
lC
The angle S is often referred to as the optical activity. Because it also depends on the temperature of the solution and wavelength of light, measurements of optical activity are usually standardised at 20 C and 589 nm ( the wavelength of the D-line in the line emission spectrum of sodium). The phenomenon of optical activity is used in the sugar industry to measure the concentrations of syrups and it is also being developed to measure blood sugar levels in people with diabetes. Chemists also use it to identify the presence of certain substances present in solutions.
Figure 1132
311
19 april 09 Physics Ch 11 final.311 311 22/05/2009 11:59:03 AM
AHL
CHAPTER 11
When the stress is increased, the concentration of the optical activity increases. A series of light and dark coloured bands are seen that can be analysed. Plastic models of objects are made to determine any places that might cause mechanical breakdown. field of view will now contain a black area corresponding to the shape of the electrode on G. By varying the strength of the applied field and hence the degree of twist of the LC, the displayed shape can be varied from different shades of grey through to black. Whole pictures can be displayed by breaking the picture down into small areas called picture elements or pixels and using a LC for each pixel.
AHL
Unpolarized light of intensity 6.0 W m-2 is incident on a polarizer. The light transmitted by the polarizer is then incident on an analyser. The angle between the transmission axes of the polarizer and analyser is 60. Calculate the intensity of the light transmitted by the analyser.
Solution
P1
LC
P2
Figure 1133 The principle of a liquid crystal display (LCD) 1. Incident light is polarized by the polarizer P1. E is an indium-tin oxide electrode and G is a piece of glass upon which there is another electrode in the shape of the final display (e.g. a letter or a number). P2 is the analyser and M is a mirror. In the absence of an electric field, the LC rotates the plane of polarization such that the light is transmitted by P2 and reflected back to the observer by the mirror M. When a potential difference is applied to the electrodes, the resulting field untwists the nematic crystal such that the plane of polarization of the light is no longer rotated. However, only those parts of the LC outlined by the shape on the electrode will be affected such that the Light reflected from the surface of a material in air at an angle of 56.5 to the normal is completely polarized. Calculate the angle of refraction of the glass. Outline how the concentration of sugar solutions may be determined using polarized light. Describe qualitatively, the operation of a liquid crystal display and suggest one reason why there has been such a world-wide proliferation of LCDs.
2.
3.
312
19 april 09 Physics Ch 11 final.312 312 22/05/2009 11:59:04 AM
ELECTROMAGNETIC INDUCTION
ELECTROMAGNETIC INDUCTION
12.1 12.2 12.3 Induced electromotive force (emf ) Alternating current Transmission of electric power
12
discovered and described the organic compound benzene as well as some other chloro-carbon compounds. He did research on steel, optical glass and the liquefaction of gases. Faradays work was impressive, and he eventually became director of the Royal Institution. Faraday had a great talent for explaining his ideas to both children and adults. He gave many wizz-bang lectures to the young, and his book addressed at their level called The Chemical History of the Candle is still in print. He introduced the Friday Evening Discourses and the Christmas lectures for children at the Royal Institution, and these lectures still continue to this day. In 1865, he retired from the Royal Institution after 50 years service. In the 1830s, Faraday became interested in electrochemistry and he was the first to use the term electrolysis in 1832. Furthermore, he introduced the use of the terms electrolyte, cell, electrodes and electrochemical reaction so commonly used in the subject of electrochemistry. He subjected electrolysis to the first quantitative experimentation and in 1834 was able to establish that the amount of chemical compound decomposed at the electrodes was proportional to the amount of electricity used Faradays First and Second Law of Electrolysis. He devised the terminology, ions, for the part of the compound discharged at the electrodes. Once Oersted had discovered that a current flowing in a conductor produced a magnetic field in 1819, scientists were convinced that a moving magnetic field should be able to produce a current in the conductor. It took eleven years before, in 1831, the American, Joseph Henry (1797-1878),
n the chapter on electrostatics, and in this chapter on electromagnetism, mention is made of Michael Faraday. The laws of electricity and magnetism owe more perhaps to the experimental work of Faraday than any other person. There were great theoreticians like Ampre and Maxwell but Faraday was a real experimenter. He invented the first dynamo, electric motor and transformer. It was Faraday who originated the use of electric fields lines that he called lines of force even before the concept of the electric field was clearly understood. He along with Joseph Henry discovered electromagnetic induction, and this concept will be expanded on in this chapter. Electromagnetic induction has revolutionised the way we live. This phenomenon has had a huge impact on society and it has become the basis for the generation of electric power that we so often take for granted in our everyday life. Michael Faraday (1791-1867), the son of a blacksmith, was born in Newington, Surrey. He had little formal education as a child and at the age of 14, he took up an apprenticeship as a bookbinder. While rebinding a copy of the Encyclopaedia Britannica, he happened to read an article on electricity, and to his own admission, this article gave him a lifelong fascination with science.
He started to attend lectures given by Sir Humphry Davy, a famous electrochemist and publicist. Faraday became interested in electrolysis and he prepared a set of lecture notes that greatly impressed Davy. By good fortune or misfortune, when Davy was temporarily blinded in a laboratory accident at the Royal Institution in 1812, he needed a laboratory assistant and he requested that Faraday be given the position. During this time, Faraday
313
19 april 09 Physics Ch 12 final.313 313 22/05/2009 11:59:36 AM
AHL
TOK
Introduction
CHAPTER 12
and the Englishman, Michael Faraday (1791-1867), while working independently, explained the cause and effect of an induced current/emf being produced by a changing magnetic field. Henry is credited with the discovery but Faraday was the first to publish, introducing the concept of line of magnetic flux in his explanations. In his notebooks in the early 1830s, Faraday described how he placed wires near magnets looking for current in the wire but without success. However, as he moved the apparatus he noticed a brief pulse of current but the current immediately fell back to zero. Perhaps the missing ingredient was motion. The solution came in 1831 when he set up an apparatus similar to that in Figure 1201 that he called an induction ring. The apparatus may look familiar to us with its battery, coils and galvanometer (a meter to detect current). We also know that a soft iron core increases the strength of a magnetic field.
switch
magnet was in motion or the coil (or disc) was in motion, an induced current was produced provided there was a change in magnetic flux. Faraday presented his findings to the Royal Society in November 1831 and January 1832 in his Experimental researches into electricity in which he gave his Law which governs the evolution of electricity by magneto-electric induction a change in magnetic flux through any surface bounded by closed lines causes an .m.f around the lines. Within no time, the dynamo, the generator and the transformer were invented by this brilliant experimental scientist. We will expand on the principles of electromagnetic induction in the remainder of this chapter.
AHL
Figure 1201
To his initial disappointment, when he closed the switch to allow steady current to flow, only a slight twitch was observed in the galvanometer before the needle fell back to zero. This twitch could have been due to mechanical vibration. However, using his intuition, he noticed that when he slowly opened and closed the switch, a current was produced in one direction, then fell to zero, then a current was produced in the opposite direction. He called the current produced by a changing magnetic flux an induced current, and he called the general phenomenon electromagnetic induction. He assumed that the magnetic flux must be changing, but how? Was the iron ring really necessary to produce induction or did it merely strengthen an effect? Was it necessary to have two coils or could an induced current be produced simply by moving a magnet in and out of a single coil of wire? Remembering back to his earlier experiment where he considered that motion could be a factor, he quickly set up experiments using a coil and a magnet, and proved that the iron ring was not essential, and that motion inside one coil could produce an induced current. Furthermore, he found that a rotating copper disc inserted between the poles of a magnet could be used (instead of a coil) to produce an induced current. It didnt matter whether the
314
19 april 09 Physics Ch 12 final.314 314 22/05/2009 11:59:37 AM
ELECTROMAGNETIC INDUCTION
of the galvanometer in one direction. After a very short period of time, the needle returns to zero on the scale.
motion
N
sensitive galvanometer
The simple apparatus in Figure 1202 can detect the induced current, but the readings on the galvanometer are small (a zero-centred micro-ammeter is better). Faraday improved the apparatus by moving different magnetic flux densities into and out of different sized solenoids at different speeds. He found that the strength of the induced emf was dependent on 1. 2. 3. The speed of the movement The strength of the magnetic flux density The number of turns on the coil The area of the coil
Figure 1202
4.
The current produced is called an induced current. As work is done in moving the current from one end of the conductor to the other, an electrical potential difference exists, and an induced emf is produced. If the conductor is then moved in the opposite direction, the needle of the galvanometer deflects in the opposite direction before then falling to zero again. If the conductor is moved in the same direction as the magnetic field then no deflection occurs. The direction of the induced current can be obtained by using the left-hand palm rule (refer to the palm rules discussed for the motor effect in Chapter 6). Using the magnetic field and direction of movement of the wire, if the palm of your left hand points in the direction of motion of the conductor, and your fingers point in the direction of the magnetic field, then the thumb gives the direction of the induced current. In this case, the current is in an anticlockwise direction. Alternatively, you can continue to use the right-hand palm rule BUT your palm points in the opposite direction to the applied force. Flemings right-hand rule can also be used. The right-hand palm rule for the direction of an induced current and Flemings right-hand rule are shown in Figures 1203 (a) and (b) respectively.
Thumb points in direction of induced current Fingers points in direction First finger points in of magnetic field direction of field Thumb points in direction of movement
Example
Determine the direction of the induced current for each situation given below.
a. b. c. d.
Solution
Using a hand rule, the direction of the induced current for each situation is indicated by the arrows as shown in the diagram below:
a. b. c. d.
Figure 1203 (a) & (b) Palm rules for electromagnetic induction
315
19 april 09 Physics Ch 12 final.315 315 22/05/2009 11:59:38 AM
AHL
Faraday realised that the magnitude of the induced e.m.f was not proportional to the rate of change of the magnetic field B but rather proportional to the rate of change of magnetic flux for a straight conductor or flux linkage N. This will be discussed further in section 12.1.4.
CHAPTER 12
B into page
l v
B
B B
sin
os Bc
AHL
Figure 1206
Cause of an induced emf. Figure 1208 (a) and (b) Flux through a small, plane surface
When the wire conductor moves in the magnetic field, the free electrons experience a force because they are caused to move with velocity v as the conductor moves in the field.
F = e v B
This force causes the electrons to drift from one end of the conductor to the other, and one end builds-up an excess of electrons and the other a deficiency of electrons. This means that there is a potential difference or emf between the ends. Eventually, the emf becomes large enough to balance the magnetic force and thus stop electrons from moving.
The magnetic flux through a small plane surface is the product of the flux density normal to the surface and the area of the surface.
= BA
The unit of magnetic flux is the weber Wb. Rearranging this equation it can be seen that: B = / A which helps us understand why B can be called the flux density. So the unit for flux density can be the tesla T, or the weber per square metre Wbm-2. So, 1T = 1 Wbm-2. If the normal shown by the dotted line in Figure 1208 (b) to the area makes an angle with B, the the magnetic flux is given by:
= B A cos
evB = e E E = B v
If the potential difference (emf) between the ends of the conductor is then
= E l
By substitution, we have,
= B l v
If the conducting wire was a tightly wound coil of N turns of wire the equation becomes:
= NB l v
where A is the area of the region and is the angle of movement between the magnetic field and a line drawn perpendicular to the area swept out. (Be careful that you choose the correct vector component and angle because questions on past IB examinations give the correct answers of BA sin or BA cos depending on components supplied in the diagrams). If is the flux density through a cross-sectional area of a conductor with N coils, the total flux density will be given by:
= N B A cos
316
19 april 09 Physics Ch 12 final.316 316 22/05/2009 11:59:40 AM
ELECTROMAGNETIC INDUCTION
This is called the flux linkage. So it should now be obvious that we can increase the magnetic flux by: Increasing the conductor area Increasing the magnetic flux density B Keeping the flux density normal to the surface of the conductor
N
current flow
S
v
area swept out by wire in one second position of wire 1 second later
N
no current flow
no movement
Figure 1209 Rate of area swept out. We have already derived that = B l v The area swept out in a given time is given by (l d) / t. But v = d / t. So that the area swept out = lv / t.
N
current flow
where A is the area in m2. For a single conductor in the magnetic flux density, it can be seen that:
= --------t t
where the constant equals 1. The negative sign will be explained in the next section. If there are N number of coils, then:
When the north pole is moved toward the core of the solenoid, an induced current flows in the external circuit as indicated by a zero-centred galvanometer or a microammeter. The pointer moves to the right meaning that the conventional induced current is flowing anti-clockwise at the end of the solenoid nearest the magnet. This end is acting like a north pole.When the magnet is stationary the meter reads zero. This suggests that the induced current is dependent on the speed of the movement. When the bar magnet is removed from the solenoid, the induced current flows in the opposite direction, and a south pole is created in the end that was previously a north pole. In 1834, a Russian physicist Heinrich Lenz (1804-1865) applied the Law of Conservation of Energy to determine the direction of the induced emf for all types of conductors. It is known as the Second Law of Electromagnetic Induction and it can be stated as:
= N ----t
317
19 april 09 Physics Ch 12 final.317 317 22/05/2009 11:59:41 AM
AHL
CHAPTER 12
The direction of the induced emf is such that the current it causes to flow opposes the change producing it. In the above case, the current induced in the coil creates a north pole to oppose the incoming north pole of the magnet. Similarly, when the magnet is withdrawn its north pole creates a south pole in the solenoid to oppose the change. It can be reasoned that the Law of Conservation of Energy must apply. If the solenoid in Figure 1211 had an induced south pole when the north pole of the magnet was moved towards it, the magnet would accelerate as it would experience a force of attraction. More induced current would be produced creating more acceleration. The kinetic energy would increase indefinitely energy would be created. As this is impossible, it makes sense that the induced current must oppose the change producing it. Lenzs Law can be applied to straight conductors as well as solenoids. Figure 1211 shows the magnetic lines of force for a bar magnet and a current-carrying wire directed into the page before and during interaction. Suppose the conductor is carrying an induced current initially.
Before interaction magnetic field due to conductor
A metal conductor 2.5 m long moves at right angles to a magnetic field of 4.0 103 T with a velocity of 35 m s1. Calculate the emf of the conductor.
Solution
AHL
T ) ( 2.5 m ) ( 35 ms )
After interaction
magnetic field interaction produces an opposing force
Example
N
interaction of the two magnetic fields causes tension on lower side
S
force due to hand pushing down
A square solenoid with 120 turns and sides of 5.0 cm is placed in air with each turn perpendicular to a uniform magnetic flux density of 0.60 T. Calculate the induced emf if the field decreases to zero in 3.0 s.
Solution
The straight conductor is then pushed downwards say with your hand. Your energy source induces the current but the combined magnetic fields tend to push the conductor upwards (a force is applied in the direction from the region of most flux density to the region of least flux density). Therefore, the induced current will be in such a direction that tries to stop the conductor through the field. If we now combine Faradays and Lenzs Laws of electromagnetic induction into the equation, we can now understand the significance of the negative sign.
Wb
= N ----t
318
19 april 09 Physics Ch 12 final.318 318 22/05/2009 11:59:42 AM
ELECTROMAGNETIC INDUCTION
2. The magnetic flux through a coil having 200 turns varies with time t as shown below.
Example
(a)
A coil with 20 turns has an area of 2.0 10-1 m2. It is placed in a uniform magnetic field of flux density 1.0 10-1 T so that the flux links the turns normally. Calculate the average induced emf in the coil if it is removed from the field in 0.75 s. The same coil is turned from its normal position through an angle of 30 in 0.3 s in the field. Calculate the average induced emf.
5.0
4.0
(b)
3.0 / 10 Wb 2.0
2
Solution
1.0
(a)
= 20 turns 2.0 101 m2 1.0 101 T = 4.0 101 Wb Next, we make use of the formula, 4.0 101 Wb = ___________ emf = ___ 0.75 s t = 0.533 V The induced emf is 0.53 V. (b) The flux change through the coil = NBA NBAcos = 4.0 101 Wb 4.0 101 Wb cos 30 = 0.054 Wb 0.54 Wb = 0.179 V Average induced emf = _______ 0.3 s The induced emf is 0.18 V. 3.
0.0
0.5
1 1.5 2 t / 102 s
2.5
The magnitude of the emf induced in the coil is: A. B. C. D. 0.5 V 2V 100 V 400 V
S N
The induced current is directed
Exercise
12.1
1.
Consider a coil of length l, cross-sectional area A, number of turns n, in which a current I is flowing. The magnetic flux density of the coil depends on A. B. C. D. I, l, n but not A I, n, A but not l I, A, l but not n A, l, n but not I
A. B. C. D.
always opposite to the direction of the arrow. always in the same direction as the arrow. first in the opposite direction to the arrow, then as shown by the arrow. first as shown by the arrow, then in the opposite direction to the arrow.
319
19 april 09 Physics Ch 12 final.319 319 22/05/2009 11:59:43 AM
AHL
CHAPTER 12
4. The magnitude of an induced emf produced by the relative motion between a solenoid and a magnetic field is dependent upon: A. B. C. D. 5. the strength of the magnetic flux density the number of turns on the coil the area of the coil all of the above 10. What effect would the following have on the magnitude of the induced emf in a conductor moving perpendicular to a magnetic field? (a) (b) (c) Which of the following is a suitable unit to measure magnetic flux density? 11. A. B. C. D. 6. A m N-1 Kg A-1 s-2 A N-1 m-1 T m-1 Doubling the velocity of movement of the conductor. Halving the magnetic flux density and velocity. Changing the conductor from copper to iron.
Explain in detail the difference between magnetic flux density and magnetic flux. The magnetic flux through a coil of wire containing 5 loops changes from 25 Wb to + 15 Wb in 0.12 s. What is the induced emf in the coil? The wing of a Jumbo jet is 9.8 m long. It is flying at 840 km h1. If it is flying in a region where the earths magnetic field has a vertical component of 7.2 104 T, what potential difference could be produced across the wing? Find the total flux through an area of 0.04 m2 perpendicular to a uniform magnetic flux density of 1.25 T. If the total flux threading an area of 25 cm2 is 1.74 102 Wb, what would be the magnetic flux density? A coil of area 5 cm2 is in a uniform magnetic field of flux density 0.2 T. Determine the magnetic flux in the coil when: (a) (b) (c) The coil is normal to the magnetic field The coil is parallel to the magnetic field The normal to the coil and the field have an angle of 60
12.
Faradays law of electromagnetic induction states that the induced emf is 13. A. B. C. D. equal to the change in magnetic flux equal to the change in magnetic flux linkage proportional to the change in magnetic flux linkage proportional to the rate of change of magnetic flux linkage
AHL
14.
7.
A uniform magnetic field of strength B completely links a coil of area A. The field makes an angle to the plane of the coil.
15.
area A
The magnetic flux linking the coil is A. B. C. D. 8. BAcos BA BAsin BAtan
16.
17.
What factors determine the magnitude of an induced emf? Refer to Figure 1204. Use Lenzs Law to explain what would happen if the solenoid was moved rather than the magnet. 18.
A metal conductor 2.5 m long moves at right angles to a magnetic field of 4.0 10-3 T with a velocity of 35 m s-1. Calculate the emf of the conductor. A square solenoid with 120 turns and sides of 5.0 cm is placed in air with each turn perpendicular to a uniform magnetic flux density of 0.60 T. Calculate the induced emf if the field decreases to zero in 3.0 s.
9.
320
19 april 09 Physics Ch 12 final.320 320 22/05/2009 11:59:44 AM
ELECTROMAGNETIC INDUCTION
19. A coil with 1500 turns and a mean area of 45 cm2 is placed in air with each turn perpendicular to a uniform magnetic field of 0.65 T. Calculate the induced emf if the field decreases to zero in 5.0 s. The radius of the copper ring is 0.15 m and its resistance is 2.0 102 . A magnetic field strength is increasing at rate of 1.8 103 T s1. Calculate the value of the induced current in the copper ring.
20.
Example
In Figure 1215, if the potentiometer is set on 2 V/division and the time base is set a 5 ms/cm, what is the voltage and frequency of the ac generator?
Solution
The amplitude of the wave is 3 divisions and each division is 2 V. Therefore, the emf would be 6 V.
Figure 1215
CRO trace
Between the two dots there are 6 divisions. Therefore, the wavelength is equivalent to 12 divisions. Now there are 5 milliseconds/ cm. So the period of the wave is 60 ms, that is, T = 60 ms.
321
19 april 09 Physics Ch 12 final.321 321 22/05/2009 11:59:44 AM
AHL
CHAPTER 12
Therefore, the frequency of the source is given by, The magnitude of the emf and current varies with time as shown in Figure 1217. Consider a coil ABCD rotating clockwise initially in the horizontal position. From the graph of current versus time, you can see that the current reaches a maximum when the coil is horizontal and a minimum when the coil is vertical. If more lines of magnetic flux are being cut, then the induced current will be greater. This occurs to the greatest extent when the coil is moving at right angles to the magnetic field. When the coil moves parallel to the field, no current flows.
current B C A
D B
AHL
Figure 1217
Each complete cycle of the sinusoidal graph corresponds to one complete revolution of the generator.
I N S I
slip rings
I F
= N A B cos
The emf varies sinusoidally (sin and cos graphs have the same shape) with time and can be calculated using
brush contacts
Figure 1216 AC generator To determine the direction of the induced current produced as the coil rotates, we must apply Lenzs Law. As the left hand side of the coil (nearest the north-pole of the magnet) moves upward, a downward magnetic force must be exerted to oppose the rotation. By applying the right-hand palm rule for electromagnetic induction, you can determine the direction of the induced current on that side of the coil. The direction of the current in the rightside of the coil can also be determined.
322
19 april 09 Physics Ch 12 final.322 322 22/05/2009 11:59:46 AM
ELECTROMAGNETIC INDUCTION
Remember from your knowledge of rotational motion that
Example
t = the angular velocity in rad s = 2 f Also Calculate the peak voltage of a simple generator if the square armature has sides of 5.40 cm and it contains 120 loops. It rotates in a magnetic field of 0.80 T at the rate of 110 revolutions per second.
-1
= t = 2 ft
so that
Solution
= N A B sin ( t)
So that,
= 2 f N A B sin (2 ft )
= N A B sin ( t ) = 2 f N A B sin ( 2 ft )
We can see that the maximum emf will occur when sin t = 1, so that, max = N AB But, = 2 f, so that
0 = NBA Therefore: = 0 sin t The frequency of rotation in North America is 60 Hz but the main frequency used by many other countries is 50 Hz.
0 = (2) (110.0 Hz) (120 turns) (5.4 10-4 m2) (0.80 T) = 35.8 V That is, the output voltage is 36 V.
Example
Note that if the speed of the coil is doubled then the frequency and the magnitude of the emf will both increase as shown in Figures 1218 and 1219 respectively.
4
/V
t /ms
Suppose a coil with 1200 turns has an area of 2.0 10-2 m2 and is rotating at 50 revolutions per second in a magnetic field of magnitude 0.50 T. Draw graphs to show how the magnetic flux, the emf and the current change as a function of time. (Assume the current flows in a circuit with a resistance of 25 ).
Solution
Figure 1218
8
/V
Normal frequency The magnetic flux in the coil changes over time as shown in Figure 1220.
t /ms
= NBA = 1200 turns 0.5 T 2 10-2 m2 = 12 Wb 50 revolutions per second would have 1 revolution in 0.02 seconds = 20 ms.
Figure 1219
Doubled frequency
323
19 april 09 Physics Ch 12 final.323 323 22/05/2009 11:59:47 AM
AHL
When the plane of the coil is parallel to the magnetic field, sin t will have its maximum value as t = 90, so sin t = 1. This maximum value for the emf 0 is called the peak voltage, and is given by:
CHAPTER 12
12 /Wb t / ms
10
20
30
40
-12
Figure 1220
We can see that the maximum emf will occur when sin t = 1, so that,
max = NAB
But, = 2 f, so that 0 = (2) (50 Hz) (1200 turns) (2 10-2 m2) (0.50 T) = 75.4V = 75V
AHL
Figure 1223 Peak current and current over time In commercial practice, alternating currents are expressed in terms of their root-mean-square (r.m.s.) value.
10
20
30
40
-75
Figure 1221 Induced emf over time The current flows in a circuit with a resistance of 25 . 0 sin t = _______ I = __ = I0 sin t R R 75.4 V = 3.0 A where I0 = ______ 25 The appropriate graph is shown in Figure 1222.
3 I/A t / ms
Consider 2 identical resistors each of resistance R, one carrying d.c. and the other a.c. in an external circuit. Suppose they are both dissipating the same power as thermal energy. The r.m.s. value of the alternating current that produces the power is equal to the d.c. value of the direct current. For the maximum value in a.c., the power dissipated is given by
10
20
30
40
-3
Figure 1222
Induced current over time Because the current is squared the, the value for the power dissipated is always positive as shown in Figure 1224.
324
19 april 09 Physics Ch 12 final.324 324 22/05/2009 11:59:49 AM
ELECTROMAGNETIC INDUCTION
Power
I 0 2R I 0 2R
Example
Figure 1224 Power delivered to a resistor in an alternating current circuit. The value of sin2t will therefore vary between 0 and 1. (0 + 1) Therefore its average value = ______ = . 2 Therefore the average power that dissipates in the resistor equals: I0 ___ I 0 R Pave = I R = ___ 2 2
2 0
In the USA, the r.m.s value of the standard line voltage is 110 V and in some parts of Europe, it is 230 V. Calculate the peak voltage for each region.
Solution
V0 = 2 Vrms
or Pave
V0 V0 __ __ 2 2 = v 2 /R = ______ .
0
Example
So the current dissipated in a resistor in an a.c. circuit that I0 varies between I0 and I0 would be equal to a current __ 2 dissipated in a d.c circuit. This d.c current is known as r.m.s. equivalent current to the alternating current.
The domestic standard line voltage in Australia is 240 V. Calculate the current and resistance in a 1200 W electric jug and compare these values with the same electric jug used in the USA.
It can be shown that I0 V0 __ and V __ = ___ Ir.m.s. = ___ r.m.s. 2 2 Provided a circuit with alternating current only contains resistance components, it can be treated like a direct current circuit.
Solution
Pave Vrms Irms = ____ and R = ____ Vrms Irms In Australia: 1200 W = 5.0 A and R = _____ 240 V = 48 I = _______ 240 V 5.0 A In the USA: 1200 W = 11 A and R = ______ 110 V = 10 I = _______ 110 V 10.9 A
The current would have a greater heating effect in the USA than Australia but the element in the electric jug would need to be made of a conductor with a a greater cross-sectional area.
325
19 april 09 Physics Ch 12 final.325 325 22/05/2009 11:59:49 AM
AHL
CHAPTER 12
Example
Figure 1225 The V0 value for a circuit containing a 35 resistor is 45 V. Calculate the current and the power dissipated in the resistor. A simple transformer
When an ac voltage is applied to the primary coil, an ac voltage of the same frequency is induced in the secondary coil. This frequency in most countries is 50 Hz. When a current flows in the primary coil, a magnetic field is produced around the coil. It grows quickly and cuts the secondary coil to induce a current and thus to induce a magnetic field also. When the current falls in the primary coil due to the alternating current, the magnetic field collapses in the primary coil and cuts the secondary coil producing an induced current in the opposite direction. The size of the voltage input/output depends on the number of turns on each coil. It is found that
Solution
AHL
Np Is Vp -- = ---- = --N Vs Ip s Where N = the number of turns on a designated coil and I is the current in each coil.
It can be seen that if Ns is greater than Np then the transformer is a step-up transformer. If the reverse occurs and Ns is less than Np it will be a step-down transformer. If a transformer was 100% efficient, the power produced in the secondary coil should equal to the power input of the primary coil. In practice the efficiency is closer to 98% because of eddy currents.
326
19 april 09 Physics Ch 12 final.326 326 22/05/2009 11:59:50 AM
ELECTROMAGNETIC INDUCTION
(d) Using, P = VI, we have,
P = 240 150 10
(e) Efficiency
= 36 W
The figure below shows a step-down transformer that is used to light a filament globe of resistance 4.0 under operating conditions. Calculate (a) (b) (c) (d) (e) the reading on the voltmeter with S open the current in the secondary coil with an effective resistance of 0.2 with S closed the power dissipated in the lamp the power taken from the supply if the primary current is 150 mA the efficiency of the transformer.
Solution
4.0 S
240 V ac
12.3.5 Discuss some of the possible risks involved in living and working near high-voltage power lines.
IBO 2007
50 turns
1000 turns
(a)
Using the formula, VP / VS = Np / Ns, with Vp = 240 V, Np = 1000 turns and Ns = 50 turns we have,
Vs = 12 V
(b) Total resistance = 0.2 + 4 = 4.2 . From the formula, I = V / R , we have
I=
12 V = 2.86 A 4.2
(c)
The main heat loss is due to the heating effect of a current. By keeping the current as low as possible, the heating effect can be reduced. The resistance in a wire due to the flow of electrons over long distances also has a heating effect. If the thickness of the copper wire used in the core of the transmission line is increased, then the resistance can be decreased. However, there are practical considerations
327
19 april 09 Physics Ch 12 final.327 327 22/05/2009 11:59:52 AM
AHL
CHAPTER 12
such as weight and the mechanical and tensional strength that have to be taken into account. The copper wire is usually braided (lots of copper wires wound together) and these individual wires are insulated. The insulation material has a dielectric value which can cause some power loss. Finally, the changing electric and magnetic fields of the electrons can encircle other electrons and retard their movement on the outer surface of the wire through selfinductance. This is known as the skin effect. The size of the power loss depends on the magnitude of the transmission voltage, and power losses of the order of magnitude of 105 watts per kilometre are common. Power losses in real transformers are due to factors such as: Eddy currents Resistance of the wire used for the windings Hysteresis Flux leakage Physical vibration and noise of the core and windings Electromagnetic radiation Dielectric loss in materials used to insulate the core and windings. The capacity for the primary coil to carry current is limited by the insulation and air gaps between the turnings of the copper wire and this leads to flux leakage. This can be up to 50% of the total space in some cases. Because the power is being delivered to the transformer at 50Hz, you can often hear them making a humming noise. Minimal energy is lost in the physical vibration and noise of the core and windings. Modern transformers are up to 99% efficient. Long-distance AC alternating current transmission is affected by a transmission lines reactive power that is actually 90 degrees out of phase with the flow of real current to a load at the other end of the line. For short transmission lines, the effect is not as significant. Direct current transmission does not have reactive power once the voltage has been raised to the normal level. The power losses are considerably less than alternating current. Hysteresis is derived from the Greek word that means lagging behind and it becomes an important factor in the changes in flux density as a magnetic field changes in ferromagnetic materials. Transformer coils are subject to many changes in flux density. As the magnetic field strength increases in the positive direction, the flux density increases. If the field strength is reduced to zero, the iron remains strongly magnetised due to the retained flux density. When the magnetic field is reversed the flux density is reduced to zero. So in one cycle the magnetisation lags behind the magnetising field and we have another iron loss that produces heat. Hysteresis is reduced again by using silicon iron cores.
AHL
As already mentioned, any conductor that moves in a magnetic field has emf induced in it, and as such current, called eddy currents, will also be induced in the conductor. This current has a heating effect in the soft iron core of the transformer which causes a power loss termed an iron loss. There is also a magnetic effect in that the created magnetic fields will oppose the flux change that produces them according to Lenzs Law. This means that eddy currents will move in the opposite direction to the induced current causing a braking effect. Eddy currents are considerably reduced by alloying the iron with 3% silicon that increases the resistivity of the core. To reduce the heating effect due to eddy currents, the soft-iron core is made of sheets of iron called laminations that are insulated from each other by an oxide layer on each lamination. This insulation prevents currents from moving from one lamination to the next. Copper wire is used as the windings on the soft-iron core because of its low resistivity and good electrical conductivity. Real transformers used for power transmission reach temperatures well above room temperature and are cooled down by transformer oil. This oil circulates through the transformer and serves not only as a cooling fluid but also as a cleaning and anticorrosive agent. However, power is lost due resistance and temperature commonly referred to as copper loss.
328
19 april 09 Physics Ch 12 final.328 328 22/05/2009 11:59:52 AM
ELECTROMAGNETIC INDUCTION
An average of 120 kW of power is delivered to a suburb from a power plant 10 km away. The transmission lines have a total resistance of 0.40 . Calculate the power loss if the transmission voltage is (a) (b) 240 V 24 000 V
Solution
(a)
Once the voltage has been stepped-up, it is transmitted into a national supergrid system from a range of power stations. As it nears a city or town it is stepped-down into a smaller grid. As it approaches heavy industry, it is stepped down to around 33 132 kV in the UK, and when it arrives at light industry it is stepped-down to 11-33 kV. Finally, cities and farms use a range of values down to 240V from a range of power stations. When the current flows in the cables, some energy is lost to the surroundings as heat. Even good conductors such as copper still have a substantial resistance because of the significant length of wire needed for the distribution of power via the transmission cables. To minimise energy losses the current must be kept low.
P = I R = ( 5.0 A ) ( 0.40 ) = 10 W
329
19 april 09 Physics Ch 12 final.329 329 22/05/2009 11:59:53 AM
AHL
CHAPTER 12
2. An alternating current with a root-mean-square value of 2 A is compared with the direct current I flowing through a given resistor. If both currents generate heat at the same rate, the value of I would be A. B. C. D. 3. 4A 22A 2A 2A
12.3.4 EXTRA-LOW-FREQUENCY
ELECTROMAGNETIC FIELDS
We have all seen though the media patients being given shock treatment through 2 electrodes to try and get the heart beat at its natural frequency. The human body is a conducting medium so any alternating magnetic field produced at the extra-low frequency will induce an electric field which in turn produces a very small induced current in the body. Using a model calculation in a human of body radius 0.2 m and a conductivity of 0.2 Sm-1 (sieverts per metre), it has been shown that a magnetic field of 160 T can induce a body surface current density of 1 mA m-2. It is currently recommended that current densities to the head, neck and body trunk should not be greater than 10 mA m-2.
An ideal transformer has a primary coil of 5000 turns and a secondary coil of 250 turns. The primary voltage produced is 240 V. If a 24 W lamp connected to the secondary coil operates at this power rating, the current in the primary coil is: A. B. C. D. 0.05 A 0.1 A 12 A 20 A
AHL
If an alternating e.m.f has a peak value of 12 V then the root-mean-square value of this alternating e.m.f would be: A. B. C. D. 0V 6 V 3.5 V 62 V
5.
The figure below below shows the variation with time t of the emf generated in a coil rotating in a uniform magnetic field.
0
Exercise
12.3
/V
1.
Which of the following could correctly describe a step-up transformer? power supply dc dc ac ac Core Steel Iron Steel Iron primary coil 10 turns 100 turns 10 turns 100 turns secondary coil 100 turns 10 turns 100 turns 10 turns
0 - 0
T/ 2
What is the root-mean-square value rms of the emf and also the frequency f of rotation of the coil? rms A. B. C. D. 0 0 0 / 2 0 / 2 f
A. B. C. D.
2 T 1 T 2 T 1 T
330
19 april 09 Physics Ch 12 final.330 330 22/05/2009 11:59:54 AM
ELECTROMAGNETIC INDUCTION
6. A load resistor is connected in series with an alternating current supply of negligible internal resistance. If the peak value of the supply voltage is Vo and the peak value of the current in the resistor is I0, the average power dissipation in the resistor would be: A. B. C. D. 7. 13. This question deals with the production and transmission of electric power, electricity costs and efficiency, and fuse systems. (a) Is it feasible to transmit power from a power station over long distances using direct current rather than alternating current? Justify your answer. An aluminum transmission cable has a resistance of 5.0 when 10 kW of power is transmitted in the cable. Justify why it is better to transmit the power at 100 000V rather than 1000 V by comparing the power that would be wasted in the transmission at both of these voltages. Many step-up and step-down transformers are used in the electricity transmission from the power station to the home. In order to increase the efficiency of the transformers, eddy currents have to minimised. Describe how this achieved in the transformer design. If the fuse controlling the maximum power for lighting in your house is rated at 8 A, calculate the maximum number of 60 W light bulbs that can be operated in parallel with a 110 V power supply so as not to blow the fuse? A stainless steel calorimeter with a mass of 720 g was used to heat 2.5 kg of water. If the current / voltage in a heating element supplying the power was 30.2 A / 110V, and it took 2.5 minutes to heat the water from 25 C to 98 C, determine the specific heat capacity of the steel. (Assume no heat loss to the surroundings. The specific heat capacity of water is 4.18 103 J kg-1K-1). How much does it cost to run the following appliances at the same time for one hour if electricity costs 10.5 cent per kilowatt-hour: a 6 kW oven, two 300 W colour televisions and five 100 W light globes?
V0 I 0 2 V0 I 0 2 2 V0 I 0
(b)
V0 I 0
(c)
Explain why a soft-iron core is used in the construction of a transformer. Explain why a transformer will not work with direct current. If there are 1200 turns in the primary coil of an ac transformer with a primary voltage of 240 V, calculate the secondary voltage if the secondary coil has (a) (b) (c) 300 turns 900 turns 1800 turns
8.
9.
(d)
(e)
10.
The armature of a 30 Hz a.c. generator contains 120 loops. The area of each loop is 2.0 10-2 m2 . It produces a peak output voltage of 120 V when it rotates in a magnetic field. Calculate the strength of the magnetic field. Calculate the r.m.s value of the following currents: (f) 2A, 4A, 6A, 3A, -5A, 1A, 6A, 8A, -9A and 10A.
11.
12.
Calculate the peak current in a 2.4 103 resistor connected to a 230 V domestic a.c source. 14.
Name four factors that affect the magnitude of an induced emf in a generator. If there are 4 laminations in a transformer core, what fraction of the flux is in each lamination and what fraction of the power is dissipated in each?
15.
331
19 april 09 Physics Ch 12 final.331 331 22/05/2009 11:59:55 AM
AHL
CHAPTER 12
AHL
332
19 april 09 Physics Ch 12 final.332 332 22/05/2009 11:59:55 AM
13
AHL
333
22/05/2009 12:02:26 PM
ultra-violet light and that the surface was always positively charged. He concluded therefore that the ultra-violet light caused negative charge to be ejected from the surface in some manner. In 1899 Lenard showed that the negative charge involved in the photo-electric effect consisted of particles identical in every respect to those isolated by J. J. Thomson two years previously, namely, electrons. Figure 1301 (a) shows schematically the sort of arrangement that might be used to investigate the photo-electric effect in more detail. The tube B is highly evacuated, and a potential difference of about 10 V is applied between anode and cathode. The cathode consists of a small zinc plate, and a quartz window is arranged in the side of the tube such that the cathode may be illuminated with ultraviolet light. The current measured by the micro-ammeter gives a direct measure of the number of electrons emitted at the cathode. When the tube is dark no electrons are emitted at the cathode and therefore no current is recorded. When ultraviolet light is allowed to fall on the cathode electrons are ejected and traverse the tube to the anode, under the influence of the anode-cathode potential. A small current is recorded by the micro-ammeter. Figure 1301 (b) shows a plot of photoelectric current against light intensity for a constant anode-cathode potential. As you would
CHAPTER 13
expect, the graph is a a straight line and doubling the light intensity doubles the number of electrons ejected at the cathode. The graph of photoelectric current against light frequency, Figure 1301 (c) is not quite so obvious. The graph shows clearly that there is a frequency of light below which no electrons are emitted. This frequency is called the threshold frequency. Further experiment shows that the value of the threshold frequency is independent of the intensity of the light and also that its value depends on the nature of the material of the cathode.
U.V. source
In terms of wave theory we would expect photo-emission to occur for light of any frequency. For example: consider a very small portion of the cathode, so small in fact that it contains only one electron for photo-emission. If the incident light is a wave motion, the energy absorbed by this small portion of the cathode and consequently by the electron will increase uniformly with time. The amount of energy absorbed in a given time will depend on the intensity of the incident light and not on the frequency. If the light of a given frequency is made very very feeble there should be an appreciable time lag during which the electron absorbs sufficient energy to escape from within the metal. No time lag is ever observed.
AHL
P.E. current
light intensity
Figure 1301 (b)
P.E. current
f0
frequency
Figure 1301 (c)
334
19 april 09 Physics Ch 13 final.334 334 22/05/2009 12:02:27 PM
= hf -
determined using measurements from the spectra associated with hot objects. The results of Millikans experiment yielded the same value and the photoelectric effect is regarded as the method by which the value of the Planck constant is measured. The modern accepted value is 6.62660693 10-34 J s. The intercept on the frequency axis is the threshold frequency and intercept of the Vs axis is numerically equal to the work function measured in electron-volt. Figure 1302 shows the typical results of Millikans experiment which shows the variation with frequency f of the maximum kinetic energy Ek . It is left to you as an exercise, using data from this graph, to determine the Planck constant, and the threshold frequency and work function of the metal used for the cathode. (Ans 6.6 10-34 J s, 4.5 eV)
6 5 4 E k /eV 3 2
= hf hf0
Where f0 is the threshold frequency. Either of the above equations is referred to as the Einstein Photoelectric Equation. It is worth noting that Einstein received the Nobel prize for Physics in 1921 for his contributions to mathematical physics and especially for his discovery of the law of the photoelectric effect.
1 0 0 0.5 1 f /10 Hz
15
1.5
2.5
Figure 1302
In summary, Figure 1303 shows the observations associated with the photoelectric effect and why the Classical theory, that is the wave theory, of electromagnetic radiation is unable to explain the observations i.e. makes predictions inconsistent with the observations. Observation Emission of electrons is instantaneous no matter what the intensity of the incident radiation. The existence of a threshold frequency. Classical theory predictions Energy should be absorbed by the electron continuously until it has sufficient energy to break free from the metal surface. The less the intensity of the incident radiation, the less energy incident of the surface per unit time, so the longer it takes the electron to be ejected. The intensity of the radiation is independent of frequency. Emission of electrons should occur for all frequencies.
335
19 april 09 Physics Ch 13 final.335 335 22/05/2009 12:02:28 PM
AHL
CHAPTER 13
The fact that the photoelectric effect gives convincing evidence for the particle nature of light, raises the question as to whether light consists of waves or particles. If particulate in nature, how do we explain such phenomena as interference and diffraction?. This is an interesting area of discussion for TOK and it is worth bearing in mind that Newton wrote in his introduction to his book Optics It seems to me that the nature of light be particulate. 2. State and explain two observations associated with the photoelectric effect that cannot be explained by the Classical theory of electromagnetic radiation. In an experiment to measure the Planck constant, light of different frequencies f was shone on to the surface of silver and the stopping potential Vs for the emitted electrons was measured. The results are shown below. Uncertainties in the data are not shown. Vs/V 0.33 0.79 Calculate the energy of a photon in light of wavelength 120 nm. 1.2 1.49 1.82 f / 1014 Hz 6.0 7.1 8.0 8.8 9.7
3.
AHL
Solution
f =
Plot a graph to show the variation of Vs with f. Draw a line of best-fit for the data points. Use the graph to determine (i) (ii) a value of the Planck constant the work function of silver in electron-volt.
Solution
2.0 1.6 10 19 = = 4.8 1014 Hz h 6.6 10 34
f0 =
Exercise
13.1 (a)
1.
Use data from example 2 to calculate the maximum kinetic energy in electron-volts of electrons emitted from the surface of potassium when illuminated with light of wavelength 120 nm.
336
19 april 09 Physics Ch 13 final.336 336 22/05/2009 12:02:28 PM
nickel crystal
E = hf =
from which
hc = mc 2
Figure 1305 The scattering of electrons by a nickel crystal Their vacuum system broke down and the crystal oxidized. To remove the oxidization, Davisson and Germer heated the crystal to a high temperature. On continuing the experiment they found that the intensity of the scattered electrons went through a series of maxima and minimathe electrons were being diffracted. The heating of the nickel crystal had changed it into a single crystal and the electrons were now behaving just as scattered X-rays do. (See Chapter 18 Topic G.6). Effectively, that lattice ions of the crystal act as a diffraction grating whose slit width is equal to the spacing of the lattice ions. Davisson and Germer were able to calculate the de Broglie wavelength of the electrons from the potential difference V through which they had been accelerated. Using the relationship between kinetic energy and momentum, we have
mc = h
But mc is the momentum p of the photon, so that
Based on this result, the de Broglie hypothesis is that any h particle will have an associated wavelength given by p = The waves to which the wavelength relates are called matter waves. For a person of 70 kg running with a speed of 5 m s-1, the wavelength associated with the person is given by
h 6.6 10 34 2 10-36 m = p 70 5
This wavelength is minute to say the least. However, consider an electron moving with speed of 107 m s-1, then its associated wavelength is
E k = Ve =
Therefore
p2 2m
p = 2mVe
Using the de Broglie hypothesis p = h , we have
h = 2 mVe
They knew the spacing of the lattice ions from X-ray measurements and so were able to calculate the predicted diffraction angles for a wavelength equal to the de Broglie wavelength of the electrons. The predicted angles were in close agreement with the measured angles and the de Broglie hypothesis was verified particles behave as waves. Of course we now have a real dilemma; waves behave like particles and particles behave like waves. How can this be? This so-called wave-particle duality paradox was not resolved until the advent of Quantum Mechanics in
337
19 april 09 Physics Ch 13 final.337 337 22/05/2009 12:02:30 PM
AHL
p= h
CHAPTER 13
1926-27. There are plenty of physicists today who argue that it has still not really been resolved. To paraphrase what the late Richard Feynman once said, If someone tells you that they understand Quantum Mechanics, they are fooling themselves.
Calculate the de Broglie wavelength of an electron after acceleration through a potential difference of 75 V.
AHL
Solution
Use =
h 2 mVe
6.6 10 34
31 19
2 9.1 10
75 1.6 10
= 0.14 nm.
IBO 2007
Exercise
13.1 (b)
1. 2.
Repeat the example above but for a proton. Determine the ratio of the de Broglie wavelength of an electron to that of a proton accelerated through the same magnitude of potential difference.
Example
The diagram below shows some of the energy levels of the hydrogen atom.
-0.85 eV B A -1.50 eV
-3.4 eV
Calculate the frequency associated with the photon emitted in each of the electron transitions A and B.
338
19 april 09 Physics Ch 13 final.338 338 22/05/2009 12:02:30 PM
Solution
= hf to give f = B
f =
The corresponding wavelengths are A = 470 nm and B = 1900 nm Transition A gives rise the blue line in the visible spectrum of atomic hydrogen and B to a line in the infrared region of the spectrum.
n =
2L where n = 1, 2 , 3 n h nh = n 2 L
pn =
339
19 april 09 Physics Ch 13 final.339 339 22/05/2009 12:02:31 PM
AHL
CHAPTER 13
means is that if its momentum is defined precisely, then its associated probability wave is infinite in extent and we have no idea where it is. Effectively, the more precisely we know the momentum of a particle, the less precisely we know its position and vice versa. From the argument above, we see that in the real world waves are always made up of a range of wavelengths and form what is called a wave group. If this group is of length x, then classical wave theory predicts that the wavelength spread in the group is given by
1 x 1 If we consider the wave group to be associated with a particle i.e. a measure of the particle momentum, then
1 p = h
To understand how this links in with the de Broglie hypothesis and wave functions, consider a situation in which the momentum of a particle is known precisely. h In this situation, the wavelength is given by = p and is completely defined. But for a wave to have a single wavelength it must be infinite in time and space. For example if you switch on a sine-wave signal generator and observe the waveform produced on an oscilloscope, you will indeed see a single frequency/wavelength looking wave. However, when you switch off the generator, the wave amplitude decays to zero and in this decay there will be lots of other wavelengths present. So if you want a pure sine wave, dont ever switch the generator off and, conversely, never switch it on. For our particle what this
AHL
that is
p x 1 h
or
p x h
We have in fact used the classical idea of a wave but 1 we have interpreted the term as a measure of the
340
19 april 09 Physics Ch 13 final.340 340 22/05/2009 12:02:32 PM
Figure 1307
This is the reason why spectral lines have finite width. For a spectral line to have a single wavelength, there must be no uncertainty in the difference of energy between the associated energy levels. This would imply that the electron must make the transition between the levels in zero time.
The kinetic energy of the -particle when it is a long way from the nucleus is Ek. As it approaches the nucleus, due to the Coulomb force, its kinetic energy is converted into electrostatic potential energy. At the distance of closest approach all the kinetic energy will have become potential energy and the -particle will be momentarily at rest. Hence we have that
Ek =
Ze 2e Ze 2 = 4 0 d 2 0 d
Where Z is the proton number of gold such that the charge of the nucleus is Ze . The charge of the -particle is 2e. For an -particle with kinetic energy 4.0 MeV we have that 79 1.6 10 Ze = ___________________________ d = ______ 20Ek 2 3.14 8.85 1012 4.0 106 = 5.6 1014 The distance of closest approach will of course depend on the initial kinetic energy of the -particle. However, as the energy is increased a point is reached where Coulomb scattering no longer take place. The above calculation is therefore only an estimate. It is has been demonstrated at separations of the order of 1015 m, the Coulomb force is overtaken by the strong nuclear force.
2 19
341
19 april 09 Physics Ch 13 final.341 341 22/05/2009 12:02:33 PM
AHL
CHAPTER 13
S1 Plate B Y
P1
X S3
S2
P2
electron transitions in the hydrogen atom which are only of the order of several eV. The existence of nuclear energy levels receives complete experimental verification from the fact that -rays from radioactive decay have discrete energies consistent with the energies of the -particles emitted by the parent nucleus. Not all radioactive transformations give rise to -emission and in this case the emitted -particles all have the same energies.
Figure 1308 Measuring nuclear masses The electric field is produced by the plates P 1 and P 2 and the magnetic field by a Helmholtz coil arrangement. The region X acts as a velocity selector. If the magnitude of the electric field strength in this region is E and that of the magnetic field strength is B (and the magnitude of the charge on an ion is e) only those ions which have a v velocity given the expression Ee = Bev will pass through the slit S3 and so enter the main body, Y, of the instrument. A uniform magnetic field, B, exists in this region and in such a direction as to make the ions describe circular orbits. From Sections 2.4 and 6.3 we see that for a particular ion the radius r of the orbit is given by,
Radioactive decay
13.2.4 Describe + decay, including the existence of the neutrino. 13.2.5 State the radioactive decay law as an exponential function and dene the decay constant. 13.2.6 Derive the relationship between decay constant and half-life. 13.2.7 Outline methods for measuring the half-life of an isotope. 13.2.8 Solve problems involving radioactive halflife.
IBO 2007
AHL
1 1p
0 1 0n + 1e +
It is found that the energy spectrum of the -particles is continuous whereas that of any -rays involved is discrete. This was one of the reasons that the existence of the neutrino was postulated otherwise there is a problem with the conservation of energy. -decay clearly indicates the existence of nuclear energy levels so something in -decay has to account for any energy difference between the maximum -particle energy and the sum of the -ray plus intermediate -particle energies. We can illustrate how the neutrino accounts for this discrepancy by referring
342
19 april 09 Physics Ch 13 final.342 342 22/05/2009 12:02:34 PM
therefore
ln N ln N0 = t
or
N = N0 e t
excited level of daughter nucleus
This is the radioactive decay law and verifies mathematically the exponential nature of radioactive decay that we introduced in 7.2.6.
Figure 1309
T1 N0 = N0 e 2 2
e- T = 1 2 that is T = ln 2
N = N t
where is the constant of proportionality called the decay constant and is defined as the probability of decay of a nucleus per unit time. The above equation should be written as a differential equation i.e.
dN = N dt
such that
dN = dt N
lnN = t + constant
ln N0 = constant
343
19 april 09 Physics Ch 13 final.343 343 22/05/2009 12:02:36 PM
AHL
The figure shows how the neutrino accounts for the continuous spectrum without sacrificing the conservation of energy. An equivalent diagram can of course be drawn for - decay with the neutrino being replaced by an anti-neutrino.
CHAPTER 13
A sample of the isotope uranium-234 has a mass of 2.0 g. Its activity is measured as 3.0 103 Bq. The number of atoms in the sample is
2.0 10 6 2.0 10 6 = 3.3 10-16 NA 6.0 10 23
N = N we have Using t
1.
The isotope radium-223 has a half-life of 11.2 days. Determine (i) (ii) the decay constant for radium-223 the fraction of a given sample that will have decayed after 3 days.
Solution
(i) (ii)
Exercise
13.2
1.
The isotope technetium-99 has a half-life of 6.02 hours. A freshly prepared sample of the isotope has an activity of 640 Bq. Calculate the activity of the sample after 8.00 hours. A radioactive isotope has a half-life of 18 days. 2 Calculate the time it takes for of the atoms in a 5 sample of the isotope to decay. A nucleus of potassium-40 decays to a stable nucleus of argon-40. The half-life of potassium-40 is 1.3 109 yr. In a certain lump of rock, the amount of potassium-40 is 2.1 g and the amount of trapped argon-40 is 1.7 g. Estimate the age of the rocks.
2.
3.
344
19 april 09 Physics Ch 13 final.344 344 22/05/2009 12:02:37 PM
DIGITAL TECHNOLOGY
DIGITAL TECHNOLOGY
14.1 14.2 (SL Option C.1) Analogue and digital signals (SL Option C.2) Data capture; digital imaging using CCDs
14
AHL
In effect, the central processing chip of your computer consists of millions of transistors acting as electronic switches. So if computers work with binary numbers and we work with decimal numbers, we need to be able to convert decimal to binary and vice versa. Converting from decimal to binary is a little tedious. Essentially, you have to find the largest power of 2 that is less than the decimal number and then subtract and keep repeating this until you reach 20, putting a 1 for the first zero value reached then zeros for the powers of 2 that are left. We can see how this works for the number 236 by looking at Table 1401. power of 2 subtractions denary number left binary 27 128 108 1 26 25 24 23 64 32 16 8 44 12 12 4 1 1 0 1 22 4 1 1 21 2 0 0 20 1 0 0
Figure 1401 Converting from binary to decimal The conversion therefore gives 236 in binary as 11101000 = 1 27 +1 26 + 1 25 + 0 24 + 1 23 + 1 22 + 0 21 + 0 20 = 128 + 64 + 32 + 8 + 4 = 236.
345
070816 Physics Ch 14 final.indd 345 22/05/2009 12:04:13 PM
CHAPTER 14
Clearly, as seen from this example, converting binary into denary is a lot simpler. To emphasise the point the binary number 10011 = 1 24 + 0 23 + 0 22 + 1 21 + 1 20 = 16 + 2 + 1 = 19. NB. In the IB examination problems will be limited to 5 digit binary numbers.
Exercise
14.1 (a)
1. 2.
reference pulse
Figure 1402
A binary pulse
If a microphone is attached to an oscilloscope and you sing into the microphone then the sound of your voice is converted into an analogue voltage signal that will be shown displayed on the screen of the oscilloscope. We can now see from Figure 1402, the principle behind converting your voice into a digital signal.
346
070816 Physics Ch 14 final.indd 346 22/05/2009 12:04:13 PM
DIGITAL TECHNOLOGY
Example
Laser light of wavelength 780 nm is used to read the data stored on a CD. Calculate the minimum height of the bumps (depth of the pits) that must be etched onto the CD in order that the stored data can be read.
Solution
plastic
CD structure
depth 150 nm
The light reflected from the flat, that is the bottom of the pit created by the bumps, travels a distance 2d further to the receiver than the light reflected from the bump (see Figure 1404). For destructive interference the path difference between the light equals .
2
0.5 m
Hence d =
1.6 m
CD micro-structure
Information is encoded onto the surface of the aluminium as a series of bumps and flats. The bumps represent a binary zero and the flats a binary 1. The dimensions of the bumps and flats are very small indeed, about 0.5 m wide, 0.83 m long and about 150 nm deep. The bumps and flats are arranged along tracks which are separated by about 1.6 m as is shown in Figure 1403 (b). It is this that enables such a great deal of data to be stored in such a relatively small area. To read the data, light from a laser is shone onto the aluminium. The bumps appear as pits on the aluminium side, but on the side that the laser reads, they are indeed bumps. For this reason you will often come across the term pit with reference to CDs and DVDs. The laser light reflected from the flat will be read as a binary one. However, if the depth of the bumps and the wavelength of the light is just right, light reflected from the edge of the bump and the flat will interfere destructively and the absence of reflected light will be read as a zero. We can illustrate this by means of the following example.
Figure 1404
The essential difference between a CD and a DVD is in terms of the amount of data that they can store. Typically a CD can store 650 MB and a DVD 4.7 GB. The reason for this is that in a DVD the pits are 0.44 m in diameter and the tracks are only 0.74 m apart. The laser light used to read data from a CD or DVD is focussed by a lens onto the pits and flats. The lens acts as a circular aperture and produces a diffraction pattern on the disk. If more than one pit falls inside the central maximum of the diffraction pattern, the pits cannot be resolved and information is lost. The Raleigh criterion (see 11.4) gives the condition for resolution namely
347
070816 Physics Ch 14 final.indd 347 22/05/2009 12:04:14 PM
AHL
CHAPTER 14
= 1.22 b
where b is the diameter of the lens.
If d is the distance of the focussing lens from the CD surface and r is the radius of the principal maximum of the diffraction pattern formed on the surface, then
ANALOGUE FORM
The advantages of storing data in digital form as opposed to analogue form can be summarised as follows:
r = 1.22 d b
Since, for a DVD, r is smaller than that for a CD, the above equation shows that the wavelength of the laser light used to read the data must be less than that used for a CD. (Provided all other quantities remain constant).
quality and corruption reproducibility (accuracy) portability and high capacity manipulation
AHL
14.1.5 Solve problems on CDs and DVDs related to data storage capacity. 14.1.6 Discuss the advantage of the storage of information in digital rather than analogue form. 14.1.7 Discuss the implications for society of everincreasing capability of data storage.
IBO 2007
Reproducibility (accuracy)
When data is stored we often need to know where it stored in the storage device. Digital data enables an address to be given to each part of the storage system. Also, numeric data can be stored to a much higher degree of accuracy than analogue data (think about the thermometer mentioned in 14.1.2). We have also seen that the same data can be retrieved over and over again without it being corrupted. Storing alphabetic data in analogue form is fraught with problems not least in reproducing the data accurately. With digital storage, each letter of the alphabet and each punctuation symbol can be assigned a specific binary number. The digital code used is called the ASCII code - A(merican) S(tandard) C(ode for) I(nformation) I(nterchange).
1.
A typical wavelength of light used to read data from a CD is 780 nm and from a DVD is 640nm. Assuming that the physical dimensions of the read-out mechanism for a CD and DVD are approximately the same, estimate the ratio of the depth of pit on CD compared to the depth of pit on DVD. The wavelength used to read data from a DVD in a particular DVD player is 635 nm. Estimate the depth of the pits on the DVD.
348
070816 Physics Ch 14 final.indd 348 22/05/2009 12:04:15 PM
DIGITAL TECHNOLOGY
Manipulation
The fact that numeric information can be stored accurately in a digital form in elctronic calculators, for example, means not only that it is much easier to manipulate and process the data but the results of calculations should be less prone to error. Also the fact that alphabetic data is easily stored in digital form means that the data can readily sorted and manipulated as in a database.
The SI unit of capacitance C V-1 is called the farad (F). A device that is manufactured to have a specific capacitance is called a capacitor. A farad is an enormous unit and typical values of capacitance of commercial capacitors range from pF to several F. To show just how large a unit the farad is, the capacitance of Earth is only about 103 F (4 8.85 1012 6.4 106 ).
349
070816 Physics Ch 14 final.indd 349 22/05/2009 12:04:16 PM
AHL
IBO 2007
CHAPTER 14
generated by light liberating electrons from the valence band of a semiconductor (Topic 8.1.12). Figure 1405 shows the basic structure of a CCD.
electrodes + + + + silicon oxide pixels silicon
change in potential across the pixel is 0.24 mV. Calculate the rate at which photons are incident on the pixel.
Solution
Q
From C = V we have Q = CV = 0.24 10-3 4.0 10-11 = 9.6 10-15 C Number of photons = number of electrons produced
Figure 1405
A layer of silicon dioxide about 5 m thick is placed on the surface of silicon substrate about 500 m thick. The silicon dioxide is divided into regions called pixels. Each pixel contains three electrodes. Each pixel essentially acts as a capacitor.
AHL
14.2.5 Dene quantum eciency of a pixel. 14.2.6 Dene magnication. 14.2.7 State that two points on an object may be just resolved on a CCD if the images of the points are at least two pixels apart. 14.2.8 Discuss the eects of quantum eciency, magnication and resolution on the quality of the processed image. 14.2.9 Describe a range of practical uses of a CCD, and list some advantages compared with the use of lm. 14.2.10 Outline how the image stored in a CCD is retrieved. 14.2.11 Solve problems involving the use of CCDs.
IBO 2007
Example
Quantum eciency
In the foregoing example, we assumed that each photon that is incident on a pixel gives rise to an electron-hole pair. However, in practice some photons will be scattered by the surface of the CCD and some might reach the
Suppose that a pixel has a capacitance of 40 pF and as the result of light incident on the pixel for a period of 30 ms, the
350
070816 Physics Ch 14 final.indd 350 22/05/2009 12:04:16 PM
DIGITAL TECHNOLOGY
silicon substrate without interaction with a valence electron in the silicon oxide. The percentage of photons in the incident light that produce electron-hole pairs is called the quantum efficiency and it is usually in the range 70-80%. A word of warning here. Many digital cameras use what is called interpolated resolution. What this means is that software is used to add pixels to the final image, for example adding more blue pixels to the parts of the image that is blue. Although this process increases the overall file size, it does not add any further information to the image. This process is similar to that employed by the many photo-editing programs that are available. However, remember that the only way that you can get more detail from a CCD image is to physically increase the number of pixels on the CCD.
Magnication
The magnification of a CCD is defined as the ratio of the length of image on the CCD to the length of the object.
Resolution
The resolution of a CCD refers to the total number of pixels that there are in the image collecting area of the CCD. However, usually the resolution is defined in terms of the pixel array. So for example a CCD might be said to have a resolution of 1800 1600, that is the width of the image is collecting area is 1800 pixels wide and 1600 pixels in length. Alternatively it could be said to have a resolution of 2.88 megapixels (1800 1600). It is worth mentioning that the human eye collects digital data and in this respect the retina has a resolution of about 11000 11000 or about 120 megapixels. Although the number of pixels defines the resolution of a CCD, we still need to know how good a particular device is in actually resolving an image optically. For example if we use a digital camera to photograph a distant binary star system, will the CCD be able to resolve the two stars into separate images? Essentially, the answer is yes, provided that the separation of the image of each star formed on the CCD is at least two pixels. If, say, for a particular CCD, the pixel length is 1.5 10-5 m then, for the binary star system, the two images on the CCD will just be resolved if they are separated by 3.0 10-5 m.
light from an object is brought to a focus on the collection area the light incident on the collection area varies in intensity and wavelength the number of electrons ejected from each pixel will vary from pixel to pixel the potential change associated with each pixel varies from pixel to pixel the potential changes across the collection area are a map of the image of the object on the collection area each change of pd associated with a given pixel is converted to a digital signal
351
070816 Physics Ch 14 final.indd 351 22/05/2009 12:04:17 PM
AHL
CHAPTER 14
the position of each pixel is recorded digitally these digital signals are converted to an image on an LCD screen (see 11.5.9/A.6.9)
Solution
In our discussion of CCDs we should bear in mind that we have not looked in any detail at the other parts of the imaging device. For example, the device will have electronics that is used for image retrieval and also an optical system for focussing light on to the CCD. Furthermore, the pixels do not produce colour. In digital cameras, an array of colour filters is used to register the intensity of a single colour at each pixel. Software then uses the intensities of colours at neighbouring pixels to estimate the intensity of the colour at a particular pixel. Finally it has to be mentioned that CCDs are likely to be soon supplanted by CMOS devices. The latter are much cheaper to produce and are also manufactured in much the same way as the integrated circuits used in modern microprocessors. However, the basic principles of image formation and retrieval are the same as that of a CCD.
energy of photon =
hc 6.6 10 34 3 10 8 = 4.6 10-19 J = 4.3 10 7
1.4 10 3 = 3.0 1015 4.6 10 19
number incident on pixel in 20 ms = 3.0 10 15 2.2 10 10 2.0 10-2 = 1.3 104 number of electron-hole pairs produced = 0.7 1.3 104 = 9.4 103 charge produced = 9.1 103 1.6 10-19 = 1.5 10-15 C using C =
Q 1.5 10 -15 then V = = 0.060 mV V 2.5 10 -11
AHL
Exercise
14.2
Example
A pixel has a capacitance of 40 pF. When illuminated for a short period of time, a change of potential of 0.20 mV is produced across the pixel. Estimate the number of electrons liberated from the valence band. A CCD of a digital camera has an image collection area of 25 mm 25 mm and a resolution of 5.0 megapixels. An object that is photographed by the camera has an area of 4.6 10-3 m2. The image area formed on the CCD is 1.0 10-4 m2. (a) Calculate the magnification. Estimate the length of a pixel on the CCD. Two small dots on the object are separated by a distance of 0.20mm. Discuss whether the images of the dots will be resolved.
The collection area of a CCD used in a particular digital camera has an area of 30 mm 30 mm. Each pixel of the CCD has an area of 2.2 10-10 m2. Estimate the resolution of the digital camera.
2.
Solution
(3.0 10 2 ) 2 = 4.1 106 2.2 10 10
number of pixels =
(b) (c)
Example
3. Light of wavelength 430 nm and intensity 1.4 mW m-2 is incident on a pixel of area 2.2 10-10 for 20 ms. The capacitance of the pixel is 25 pF. Calculate the change in potential difference across the pixel if the quantum efficiency of the CCD is 70%. Outline why, when the image area of a CCD is illuminated, the change in potential will vary from pixel to pixel.
352
070816 Physics Ch 14 final.indd 352 22/05/2009 12:04:18 PM
15
Vitreous humour Retina Cornea Sclera Iris Pupil Lens Optic nerve
A.1.1 Describe the basic structure of the human eye. A.1.2 State and explain the process of depth of vision and accommodation. A.1.3 State that the retina contains rods and cones, and describe the variation in density across the surface of the retina. A.1.4 Describe the function of the rods and of the cones in photopic and scotopic vision. A.1.5 Describe colour mixing of light by addition and subtraction. A.1.6 Discuss the eect of light and dark, and colour, on the perception of objects.
IBO 2007
Blood vessels
Figure 1501 Horizontal section through the human eye The eyeball lies in a special cavity in the skull that contains fatty tissue to protect the eye. The wall of the eye has 3 layers the outer wall consisting of the sclera and cornea, the middle wall consisting of a choroid, ciliary muscle and iris, and the inner layer consisting of the retina. The white of the eye is moved about by six muscles and the white fibrous coat at the front of the eye contains the cornea which acts like a window of the eye. Inside the white fibrous coat is a black layer that makes the eye light-tight, and further in to the centre is the colorpigmented part called the iris which contains muscles that adjust the amount of light entering the hole in its centre called the pupil. Behind the pupil is a converging lens that is connected to the ciliary muscles. The pupil is very small in bright light and relatively big in dark light. The shape of the lens is controlled by the ciliary muscles, and the lens becomes rounder and shortens its focal length to
353
070815 Physics Ch 15 final.indd 353 22/05/2009 12:04:56 PM
OPTION
Conjunctiva
Blind spot
CHAPTER 15 (OPTION A)
view close objects, and, for distant objects it becomes less round and increases its focal length. In a normal eye, a real, diminished and inverted image of an object comes into focus on the retina after refraction in the lens. Figure 1502 lists the main similarities of the components of the eye and the camera that were briefly mentioned in the beginning of this section. Component of the Eye Cornea Iris Pupil Lens Retina Choroid Sclera Figure 1502 Component of the Camera Aperture for admitting light Aperture diaphragm Hole in the diaphragm Camera lens Film Black lining Camera case focal length of the flexible eye lens. The eye has most accommodation for prolonged viewing when viewing at the far point. The apparent size of an object can be increased by using a converging lens to allow the object to be brought closer to the eye, thus increasing the size of the image on the retina. This is the basis behind the simple magnifier.
OPTION
80
80 60
60
Blind spot
40
80 80
40
20 0
20
Figure 1503
The range over which an eye can sharply focus an image is determined by what are known as the near point and far point of the eye. The near point is the position of the closest object that can be brought into focus by the unaided eye. The near point varies from person to person but it has been given an arbitrary value of 25 cm. The far point is the position of the furthest object that can be brought into focus by the unaided eye. The far point of a normal eye is at infinity. The ability of the eye to focus over this range is called accommodation and this is controlled by the ciliary muscles pulling or relaxing in order to change the
200
Cone density
150
100
Rod density
50
0 -80
-60
-40
-20
20
40
60
80
Figure 1504
354
070815 Physics Ch 15 final.indd 354 22/05/2009 12:04:56 PM
(575 nm)
(705 nm)
Absorbance (in %)
380
420
460 500
660 700
740 780
1500
Figure 1506
Scotopic vision (dark adapted)
Colour blindness or colour vision deficiency is most commonly a hereditary condition that can affect up to 12% of males and about 1% of females. About 99% is red-green colour blindness although blue-yellow colour deficiency also exists. It cannot be cured at this stage. It occurs as a result of either a reduction of the pigment in the cones or if one of the types of cones is completely missing.
400
700
Figure 1505
Cones are responsible for photopic vision or high lightlevel vision, that is, colour vision under normal light
355
070815 Physics Ch 15 final.indd 355 22/05/2009 12:04:57 PM
OPTION
CHAPTER 15 (OPTION A)
(a light blue) and red and green are mixed to form yellow. When the 3 primary colours are mixed, white light is obtained. In colour subtraction cyan and magenta are mixed to form blue, magenta and yellow are mixed to obtain red, and yellow and cyan are mixed to form green. When all the secondary colours are mixed, black is obtained. 1.
Exercise
15.1
Match the component of the eye that is similar to the component of the camera in the table below.
Component Of The Eye Component Of The Camera Aperture for admitting light Aperture diaphragm Hole in the diaphragm Camera lens Film Black lining Camera case 2. (a) (b) (c) (d) 3. Describe the properties of an image formed in the eye. State the component of the eye where the image is formed. Name the coloured part of the eye. Describe the function of the ciliary muscles.
Perception is a process of acquiring, interpreting, selecting and organising sensory information. The eyes, with a reaction time of around 190 ms, can be used in combination to sense aspects of depth, colour and form. Depth is important in scenery background and in architecture. The further you look into the distance of scenery such as the Grand Canyon, the more blurred it becomes due to the scattering of light. Shadows give objects depth and scope depending on the direction that the sun is coming from. In architecture, deep shadow gives the impression of massiveness. Colour is a way of creating a feeling within an environment and perceiving what an object looks like against different colours. For example, if the eye concentrates on the central dot of four coloured dots set against a different coloured matrix, and the three other dots are made to rotate, the three dots disappear. The colour green evokes an impression of calmness while blue evokes a sense of depression. Red glow can give an impression of warmth while blue glow gives an impression of cold. Size is another way in which the brain operates as to how we perceive things. For example, light-coloured ceilings seem to heighten a room whereas softer colours tend to make a room smaller.
Define the terms near point and far point and state the arbitary value of each term. Distinguish the difference between the 2 types of photoreceptors that are found in the retina. Describe the function of the rods and of the cones in photopic and scotopic vision.
4.
OPTION
5.
356
070815 Physics Ch 15 final.indd 356 22/05/2009 12:04:57 PM
ASTROPHYSICS
ASTROPHYSICS
E.1 E.2 E.3 E.4 E.5 E.6 (SL and HL) Introduction to the universe (SL and HL) Stellar radiation and stellar types (SL and HL) Stellar distances (SL and HL) Cosmology (HL only) Stellar processes and stellar evolution (HL only) Galaxies and the expanding universe
16
The Milky Way is one of about 25 galaxies that make up a so-called local cluster. Some 50 million light-years from our local cluster is another cluster of galaxies, the Virgo cluster, which contains about a thousand galaxies. There are other clusters that can contain as many as ten thousand galaxies. Amazingly, all these different clusters are grouped into a socalled super cluster. Between these superclusters are vast voids of empty space. However, interstellar and intergalactic space is not completely empty. It actually contains gas and microscopic dust particles although the density is not very great. The density of interstellar space is estimated to be about 1020 kg m3 and that of inter-galactic space 1025 kg m3. Astrophysics is the science that tries to make sense of the Universe by providing a description of the Universe (Astronomy) and by trying to understand its structure and origin (Cosmology). It is a daunting subject for not only does it encompass the whole historical grandeur of physics but it also embraces all of physics as we understand it today, the microscopic and the macroscopic. We cannot understand the structure of stars, their birth and their death unless we understand the very nature of matter itself and the laws that govern its behaviour. It even takes us beyond the realm of physics for, as did our earliest ancestors, we still look up at the stars and ask who am I and whats it all about?. This Option can but scrape the surface of this truly vast topic.
357
070815 Physics Ch 16 final.indd 357 22/05/2009 12:05:30 PM
OPTION
Our planet, Earth, is an insignificant object orbiting an insignificant star, the Sun. The Sun is situated in one arm of an insignificant galaxy, the Milky Way, which contains around 200 billion stars. The Milky Way measures about 105 light years from one end to the other yet this enormous distance is tiny compared to the whole Universe which is about 1010 light years in diameter. A light year is the distance that light travels in a year. There are about 3.2 107 seconds in a year.
CHAPTER 16 (OPTION E)
Nebulae
Nebulae was the name originally given to misty type patterns in the night sky. Many such patterns are now recognised as being galaxies. Others are recognised as being the debris of a supernova such as the famous Crab Nebula that was first recorded by the ancient Chinese astronomers. Other so called dark nebulae such as the Horsehead nebula contain a large amount of gas and dust particles and are considered to be the birth places of the stars. To conclude this introductory section, Figure 1601 shows the distances as orders of magnitudes from Earth of various astronomical objects. Distance from Earth / m Quasar 1025 Nearest galaxy (Andromeda) 1022 Centre of the Milky Way 1020 North Star (Polaris) 1019 Nearest star (Alpha Centauri) 1017 Sun 1011 Moon 108 Object
Galaxies
As mentioned, galaxies are vast collections of stars. There are essentially three types of galaxy and these are discussed in more detail in the AHL secction.
Quasars
Quasars were first discovered in 1960 and their exact nature still remains a mystery. They are extremely bright objects having a luminosity equivalent to that of a 1000 galaxies. They are also very distant (Quasar 3C273 is some 3 billion light years away) and they are also much smaller than any known galaxy.
OPTION
358
070815 Physics Ch 16 final.indd 358 22/05/2009 12:05:30 PM
ASTROPHYSICS
last visible to the naked eye in 1986 and will make another appearance in 2061 which is considerably sooner than a comet called Hyakutake that has an orbital period of about 30,000 years. Figure 1602 summarises some of the details of the planets. Name of planet Mercury Venus Earth Mars Jupiter Saturn Uranus Neptune Pluto Number Average distance Equatorial Mass from the Sun / of diameter 23 kg / 10 106 km moons / 103 km None 57.9 4.9 3.3 None 108 12.0 49 1 150 12.8 60 2 228 6.8 6.4 16 778 143.0 19000 18 1430 120.5 5700 15 2900 51.2 866 8 4500 49.5 103 1 5920 2.3 0.13
E.1.3 E.1.4
E.1.5
Stellar cluster
This is a number of stars that is held together in a group by gravitational attraction. The stars in the group were all created at about the same time and there can be many thousands of stars in a group.
Constellation
A constellation is a collection of stars that form a recognisable group as viewed from Earth. For example there is a constellation called the Andromeda constellation which contains the galaxy called Andromeda. The ancient Greeks named many of the constellations and perhaps two of the most easily recognisable are the Big Dipper and the Great Bear. Constellations are useful landmarks for finding ones way around the night sky.
359
070815 Physics Ch 16 final.indd 359 22/05/2009 12:05:31 PM
OPTION
CHAPTER 16 (OPTION E)
The position of the Sun relative to the fixed stars varies slowly over a course of a year. On the first day of Spring, the vernal equinox, and the first day of autumn, the autumnal equinox, the Sun rises in the East, traverses the sky, sets in the West and day and night are of equal duration. During summer in the Northern Hemisphere the Sun rises in the Northeast and sets in the Northwest. Daylight hours are longer than the night time hours and the further North one goes the longer the daylight hours. In fact, north of the Arctic Circle the Sun never sets. During winter in the Northern Hemisphere the Sun rises in the Southeast, moves close to the southern horizon and sets in the Southwest. Daylight hours are shorter than the night-time hours and north of the Arctic Circle the Sun never rises above the horizon. The Sun takes about a year to make a journey from West to East around the celestial sphere. Against the background of the canopy of fixed stars, certain celestial objects do not move in circles. These objects wander back and forth against the backdrop exhibiting what is called retrograde motion. These are the planets and the Greek word for wanderer is indeed planet. The Moon, like the Sun and the stars, traverses the night sky from East to West. The path of the Moon relative to the fixed stars is close to that followed by the Sun. However, it takes only about four weeks to complete a journey round the celestial sphere. The Moons path around the celestial sphere varies from month to month but remains within about 8 either side of the path followed by the Sun. During the Moons trip around the celestial sphere it exhibits different phases as seen from Earth, waxing from a new Moon crescent to a full Moon and then waning from the full Moon to a crescent new Moon.
OPTION
360
070815 Physics Ch 16 final.indd 360 22/05/2009 12:05:31 PM
ASTROPHYSICS
E.2.2
E.2.3 E.2.4
E.2.5
E.2.6
E.2.7
E.2.9
E.2.10 Discuss the characteristics of spectroscopic and eclipsing binary stars. E.2.11 Identify the general regions of star types on a HertzsprungRussell (HR) diagram.
IBO 2007
361
070815 Physics Ch 16 final.indd 361 22/05/2009 12:05:32 PM
OPTION
E.2.8
CHAPTER 16 (OPTION E)
Earth L
Exercise
16.2
star
The Sun is 1.5 1011 m from the Earth. Estimate how much energy falls on a surface area of 1 m2 in a year? State any assumptions that you have made.
Topic 8.5.10 introduced the Stefan-Boltzmann law for a black body. If we regard stars to be black body radiators, then luminosity L of a star is given by the expression.
L = 4R 2 T 4
where R is the radius of the star, T its surface temperature and is the Stefan-Boltzmann constant. If we know the surface temperature and the radius of two stars then we can use the above equation to compare their luminosity. However, in practice we usually use the law to compare stellar radii as explained below.
L b = ------2 4 d
The apparent brightness of a star can be measured by attaching a radiation sensitive instrument known as a bolometer to a telescope. If d can be measured then the luminosity of the star can be determined. This is a very important property to know as it gives clues to the internal structure of the star, its age and its future evolution. If all stars were equally bright, then the further away from the Earth a star is, the less its apparent brightness would be. For example, if the distance of a star A is measured by the parallax method (see below) and found to be is at distance 2d, it would have a quarter of the apparent brightness of star A. A star that is at a distance 4d would have one sixteenth the apparent brightness of star A. The apparent brightness falls off as the inverse square of distance. However, stars are not all of the same brightness so unless we know a stars luminosity we cannot use a measurement of its apparent brightness to find its distance from the Earth.
OPTION
362
070815 Physics Ch 16 final.indd 362 22/05/2009 12:05:33 PM
ASTROPHYSICS
Law to find the temperature of a star from its spectrum. If we know its temperature and its luminosity then its radius can be found from the Stefan law. This is shown in the example below. The wavelength of these lines is unique to mercury. In fact every element can be identified by its characteristic line spectrum. See Figure 1604.
350 400 450
/ v
Example
blue green
red
Figure 1604 An example of atomic emission The wavelength maximum in the spectrum of Betelgeuse is 9.6 10-7 m. The luminosity of Betelgeuse is 104 times the luminosity of the Sun. Estimate the surface temperature of Betelgeuse and also its radius in terms of the radius of the Sun. We can alter the arrangement described above such that we now shine radiation from an incandescent source through the tube containing the mercury vapour when there is no potential difference across the tube. The radiation that has passed through the tube is analysed as above. On the screen we will now observe a continuous spectrum which has three dark lines crossing it. Each line will correspond to the blue, red and green emission lines of mercury. This is what is called an absorption spectrum. Absorption spectra can be used to find out the chemical composition of the atmosphere of a star. The continuous spectrum from a star will be found to contain absorption lines. These lines are formed as the radiation from the surface of the star passes through the cooler, less dense upper atmosphere of the star. The absorption lines will correspond to the emission lines of the elements in the upper atmosphere of the star. Different stars have different spectra. In some stars the lines corresponding to the visible hydrogen spectrum are prominent. Other stars, like our sun, have lines corresponding to the emission lines of elements such as iron, sodium and calcium.
Solution
2.9 10
Given that the surface temperature of the Sun is 5800 K, substitution gives the radius of Betelgeuse to be 370 times that of the Sun.
Stellar spectra
If a sufficiently high potential is applied between the anode and cathode of a discharge tube that contains a small amount of mercury vapour, the tube will glow. This is the basis of fluorescent lighting tubes. We can arrange for the radiation emitted from the tube to pass through a slit and hence onto a dispersive medium such as a prism or diffraction grating. The radiation can then be focused onto a screen. Images of the slit will be formed on the screen for every wavelength present in the radiation from the tube. Unlike an incandescent source, the mercury source produces a discrete line spectrum and a continuous spectrum in the ultraviolet region. In the visible region, mercury produces three distinct lines- red, green and blue.
363
070815 Physics Ch 16 final.indd 363 22/05/2009 12:05:34 PM
OPTION
CHAPTER 16 (OPTION E)
However, if the temperature is about 9000 K the photons will cause excitation in the hydrogen atoms rather than ionisation. Hence, in these stars the spectrum will show strong hydrogen absorption lines. At temperatures above 30,000 K the photons have sufficient energy to produce singly ionised helium and the spectrum of singly ionised helium is different from neutral helium. Hence, if a stellar absorption spectrum has lines corresponding to the emission lines of single ionised helium, we know that the star has a surface temperature in excess of 30,000 K. For every element there is a characteristic temperature range of the source which produces strong absorption lines. When the effects of temperature are taken into account we find that the composition of all stars is essentially the same, about 74% hydrogen, 25% helium and 1% other elements. It should also be noted that stellar absorption spectra are another check as to the value for the surface temperature of a star which can be deduced from Wiens law. The Figure 1605 below summarises the spectral classes. Spectral Approximate Colour class Temperature range /K O 30000-50000 Blue violet Blue B 10000-30000 white A 7500-10000 White F 6000-7500 Yellow white G 5000-6000 Yellow K 3500-5000 Orange M 2500-3500 Red orange Figure 1605 Main absorption lines Example
White Dwarfs
These are stars that are much smaller than the Sun (typically about the volume of the Earth) and have a much higher surface temperature. The white dwarf, Sirius B, has a surface temperature of 20,000 K.
Neutron stars
These are stars which have undergone gravitational collapse to such an extent that their core is effectively made up of just neutrons.
Supernovae
When the core of a star can collapse no further, the outer layers, which are still falling rapidly inwards, will be reflected back causing an enormous shock wave. This shock wave will in turn tear much of the surface of the core away in a colossal explosion. The star has become a supernova. In 1987 the star SK 69202 in the Large Magellanic Cloud went supernova and for a brief instant of time its brilliance was greater than that of the whole Universe by a factor of a 100.
Ionised helium Mintaka Neutral helium Rigel Hydrogen Sirius A These are rotating neutron stars. As they rotate they emit beams of electromagnetic radiation (usually of radio frequencies) essentially from the poles of the star. Each time a pole lines up with the Earth a pulse of radiation will be detected at the Earth. In 1968 a pulsar was detected in the Crab nebula which has a pulsing frequency of 33 Hz. Ionised metals Canopus Ionised calcium Sun Neutral metals Aldebaran Titanium oxide Betelgeuse
Pulsars
OPTION
Black Holes
It has been suggested that certain stars that undergo gravitational collapse will reach a density and radius such that the gravitational field at the surface of the star will be strong enough to prevent electromagnetic radiation from escaping from the surface. Such stars will not therefore emit any light and are therefore said to be black holes.
Red Giants
These are stars that are considerably larger than our Sun and have a much lower surface temperature than our Sun. The super red giant called Betelgeuse has a diameter equal to that of the distance of Jupiter from the Sun and a surface temperature of about 3000 K.
Binary stars
Many stars that appear to the naked eye to be a single point of light actually turn out to be two stars rotating about a common centre. Sirius, the brightest star as seen from Earth is in fact a binary star consisting of Sirius A
364
070815 Physics Ch 16 final.indd 364 22/05/2009 12:05:34 PM
ASTROPHYSICS
and Sirius B. Sirius A is a main sequence star (see below) and Sirius B is a white dwarf (see above)
Eclipsing binaries
Some binary stars cannot be resolved visually as two separate stars. However, the binary nature of the system can be deduced from the fact that the stars periodically eclipse each other. The orientation of the orbit of the stars with respect to the Earth is such that as the stars orbit each other, one will block light from the other. As seen from Earth the brightness of the system will vary periodically. This variation in brightness yields information as to the ratio of the surface temperature of the stars and also the relative size of the stars and the size of their orbit.
Cepheid variables
These are stars whose luminosities vary regularly, generally with a period of several days.
Spectroscopic binaries
The binary nature of a system can, in many cases, be deduced from its spectrum. Figure 1606 shows the observed spectrum of a possible binary system taken at different times.
A B C Day 1 Day 12 Day 23
Figure 1606 The spectrum of a binary system The two spectra A and C are identical. However, corresponding to each line in these spectra there are two lines in spectrum B. One of the lines is of a slightly longer wavelength than the corresponding line in A and C and the other is of a slightly shorter wavelength. They are red shifted and blueshifted respectively. We can see how this comes about by looking at Figure 1607.
X Y
Y Day 1
Y Day 12
X Day 23
Figure 1607
The two stars X and Y are orbiting each other as shown. The stars are of the same spectral class and have identical spectra. On day 1 the orbital plane is aligned with respect to the Earth in such a way that star Y completely blocks star X and so only the spectrum of star Y is observed. On day 12 star Y is moving away from the Earth and star X is moving towards the Earth. Because of the Doppler effect (see Chapter 6) the light from Y as observed on Earth will be red-shifted. Similarly because X is moving towards
365
070815 Physics Ch 16 final.indd 365 22/05/2009 12:05:35 PM
OPTION
CHAPTER 16 (OPTION E)
the Earth light from this star will be observed to be blueshifted. Hence spectrum B above will be observed. On day 23 star X now completely blocks star Y so only the spectrum from star X is observed. This is an ideal situation and although it does occur, it is more the exception than the rule. In many systems the stars are not of the same spectral class and so the spectra A and C will not be the same. However, spectrum C will still show red and blue shift. For example one of the stars might be so dim that its spectrum cannot be detected on Earth. However, the single spectrum will shift back and forth as the two stars orbit each other. Such systems are called singleline spectroscopic binaries. A spectroscopic binary might also be an eclipsing binary. These systems, although not common, are very useful since it is possible to calculate the mass and radius of each star from the information that such systems give.
-10 supergiants Absolute magnitude -5 100 000 10 000 1 000 0 giants 100 10 1 main sequence +10 white dwarfs +15 O 50 000 0.1 0.01 0.001 A 11 000 F 7 500 G 6 000 K 5 000 M Luminosity relative to the Sun ( 3.9 10 W )
26
+5
B 25 000
Spectral Class
Figure 1608
E.2.11 IDENTIFY THE GENERAL REGIONS OF STAR TYPES ON A HERTZSPRUNG RUSSELL (HR) DIAGRAM
The Hertzsprung-Russell Diagram
In 1911 the Danish astronomer Ejnar Hertzsprung noticed that a regular pattern is produced if the absolute magnitude (see next Section) (or luminosity) of stars is plotted against their colour (surface temperature). Two years later the American astronomer Henry Russell discovered a similar pattern if the luminosity is plotted against spectral class. (Effectively this is another plot against temperature). In recognition of the work of these two men, such diagrams are called Hertzsprung-Russell diagrams. A typical H-R diagram is shown in Figure 1608. You should note that neither the absolute magnitude scale nor the temperature scale is linear, they are in fact both log scales. For historical reasons the temperature scale is plotted from high to low.
The striking feature about this diagram is that the stars are grouped in several distinct regions with a main diagonal band that contains the majority of stars. For this reason stars which lie in this band are called main sequence stars. The sequence runs from large luminosity and high surface temperature (top left) to small luminosity and low surface temperature (bottom right hand corner). All stars in the main sequence derive their energy from hydrogen burning (fusion) in the core of the star. There is another grouping of stars towards the top righthand corner that have a large luminosity and relatively low surface temperature. To have such a large luminosity at low surface temperatures means that these stars must be huge. For this reason they are called giant stars. Cooler members of this class have a distinctive red appearance and are therefore called red giants. A few stars at low surface temperatures have a very large luminosity (we have already met one such star Betelgeuse in the constellation of Orion) and these are called supergiants. There is another grouping of stars towards the bottom left hand corner that have a low luminosity but very high surface temperatures. This means these stars are relatively small (typically the size of the Earth) and because of the low luminosity are called white dwarfs.
OPTION
366
070815 Physics Ch 16 final.indd 366 22/05/2009 12:05:36 PM
ASTROPHYSICS
Parallax method
1 AU
1 arcsecond
E.3.1 E.3.2
Dene the parsec. Describe the stellar parallax method of determining the distance to a star. Explain why the method of stellar parallax is limited to measuring stellar distances less than several hundred parsecs. Solve problems involving stellar parallax.
IBO 2007
E.3.3
1 parsec
The parsec
Figure 1610 shows the relationship between the units of distance. 1 AU = 1.496 1011 m 1 ly = 9.46 1015 m 1 ly = 63 240 AU 1 pc = 3.086 1016 m 1 pc = 3.26 ly 1 pc = 206 265 AU Figure 1610 The relationships bewtween units of distance
Introduction
Astronomical distances are vast. For example, it was quoted that the Sun is 25,000 light years (ly) from the centre of our galaxy. This is of the order of 1020 m. The distance to our nearest star is of the order of 1017 m and the distance to the quasar 3C273 is of the order of 1025 m. How are such vast distances measured? Let us first of all look at some of the units that are used in astronomical measurements.
Exercise
16.3 (a)
A useful astronomical distance is that of the average distance between Earth and the Sun, the astronomical unit (AU). 1 AU = 1.50 1011 m. We define the parsec in terms of the AU. A line of length 1 AU subtends an angle of 1 arcsecond (one second of arc) at a distance of one parsec. In Figure 1609 the object P is 1 pc away from the line AB. Since the angle is so small the lines AP, BP and CP can all be considered to be of the same length.
1.
Using the value for 1 AU given above and taking 1 year = 3.2 107 s, verify the conversions between parsecs and light years given in the above table.
367
070815 Physics Ch 16 final.indd 367 22/05/2009 12:05:37 PM
OPTION
CHAPTER 16 (OPTION E)
In using the method of parallax to measure stellar distances we must recognise the fact that the Earth is moving through space. When you look out of the window of a moving train or car, objects close to you move past you very rapidly but it takes a long time for the distant landscape to change. In the same way, the stars which are a great distance from the Earth (in astronomical terms) appear to keep their position whereas the nearer stars (and the planets) appear to move against this background of the so called fixed stars as the Earth orbits the Sun. For this reason, astronomers regard the fixed stars as a reference point against which to measure the direction of stars that are closer to the Earth. The position of these stars relative to the fixed stars will depend from which point in the Earths orbit about the Sun that they are observed. (Incidentally, if we regard the Earth to be fixed in space then the fixed stars appear to rotate about the Earth). Figure 1611 shows how we can use the parallax method to measure the distance from Earth of near stars.
Star Angle P P d d
The parallax angle for the star Sirius A is 0.37 arcsecond. Calculate the distance of the star from Earth in (a) (b) (c) (d) metres parsecs astronomical units light years
Earth (June) 1 AU
Earth (December)
1 AU
OPTION
Figure 1611
The Earth is shown at two different points in its orbit about the Sun separated by a time period of six months. The angular position (p) of the star is measured against the fixed stars both in December and June. 1 AU . Clearly we have tan pc = _____ d However d is very much larger than 1 AU so the angle p is very small and therefore tan pc p. 1 AU. 1 AU d _____ Therefore, we have p _____ p d
Solution
1 degrees of arc. 1 arcsecond = _____ 3 600 2 rad. 0.37 = 1.03 104 ___ Hence p = 0.37 arcs = _____ 3 600 360 1 AU = 1.5 1011 m, hence 1.5 10 = 8.36 1016 m. d = ________ 1.8 106 (b) If the parallax angle of a star is 1 arcsecond then it is said to be at a distance of 1 pc and hence: 1 = 2.7 pc. 1 ______________________ = ____ p(measured in arc seconds) 0.37 (c) 1 pc = 2.06 105 AU. Therefore d = 2.7 2.06 105 = 5.6 105 AU. (d) 1 pc = 3.26 ly. Therefore d = 2.7 3.26 = 8.8 ly.
11
(a)
368
070815 Physics Ch 16 final.indd 368 22/05/2009 12:05:37 PM
ASTROPHYSICS
Hence a magnitude 1 star is 2.512 times as bright as a magnitude 2 star. That is, if the difference between the apparent magnitude of two stars is 1, the ratio of their apparent brightness is 2.512. Hence a star that has an apparent magnitude of zero will be 2.512 times as bright as star of apparent magnitude 1. If the difference between the apparent magnitude is 2 then the ratio of the apparent brightness will be (2.512)2 = 6.31 and if the difference between the apparent magnitude is 3 the ratio of the brightness will be (2.512)3 = 15.85. In general, therefore, if we have two stars, one of apparent magnitude m1 and the other of magnitude m2 the ratio of b1 their apparent brightness --- is given by: b
Our nearest star, Alpha-Centuari, is 4.3 light years from Earth. Calculate the value of the parallax angle that gives this distance?
m2
m1 =
E.3.8
or m M = 5 log d 5
369
070815 Physics Ch 16 final.indd 369 22/05/2009 12:05:39 PM
OPTION
CHAPTER 16 (OPTION E)
The nearest star to the Sun is Alpha Centauri at a distance of 1.3 pc. With an apparent magnitude of 0.1, Alpha Centauri has an absolute magnitude of 4.5. This means that the Sun and Alpha Centauri have very nearly the same luminosity. Betelgeuse on the other hand, at a distance of 130 pc has an apparent magnitude of +0.50 that gives an absolute magnitude of 5.14 E. This means that if Betelgeuse where at 10 pc from the Earth its brightness would be (2.512)9.96 greater than the Sun. (The differences in the absolute magnitudes is 9.96). This means that Betelgeuse has luminosity some 10000 times that of the Sun.
Example
Solution
The apparent magnitude of the Sun is 26.7 The distance of the Sun from Earth = 1 AU = 4.9 10-6 pc Therefore m M = 5 log ( 4.9 10 m M = - 26.5 5.
6
) 5 or
To give M = + 4.8.
Spectroscopic parallax
E.3.9 State that the luminosity of a star may be estimated from its spectrum.
The apparent magnitude of the Sun is -26.7 and that of Betelgeuse 0.50. Calculate how much brighter the Sun is than Betelgeuse.
E.3.10 Explain how stellar distance may be determined using apparent brightness and luminosity. E.3.11 State that the method of spectroscopic parallax is limited to measuring stellar distances less than about 10 Mpc. E.3.12 Solve problems involving stellar distances, apparent brightness and luminosity.
IBO 2007
OPTION
Solution
We have m2 for Betelgeuse = 0.50 and m1 for the Sun = 26.7 to give m2 m1 the difference in the apparent magnitudes equal to 27.2.
-We therefore have that the ratio, b2 , of the brightness of that of the Sun to that of Betelgeuse to equal b1
2.51227.2 = 7.6 1010. That is the Sun is approximately 80000 million times brighter than Betelgeuse (if both of them are viewed from the Earth).
Exercise
16.3 (c)
The apparent magnitude of the Andromeda galaxy is 4.8 and that of the Crab Nebula is 8.4. Determine which of these is the brightest and by how much.
370
070815 Physics Ch 16 final.indd 370 22/05/2009 12:05:39 PM
ASTROPHYSICS
E.3.9,10
The star Regulus in the constellation of Leo has an apparent brightness of 5.2 1012 that of the Sun and a luminosity 140 times that of the Sun. If the distance from the Earth to the Sun is 4.9 106 pc, how far from the Earth is Regulus?
Solution
From the inverse square law we know: L b = ____ 4d2 2 Lsun ___ dsun bsun ___ ___ = 2 Lreg dreg breg Lreg bsun 2 2 ___ From which: dreg = ___ dsun Lsun breg 1 = 140 _________ ( 4.9 106 )2 5.2 1012 To give, dreg = 25pc.
Cepheid variables
E.3.13 Outline the nature of a Cepheid variable. E.3.14 State the relationship between period and absolute magnitude for Cepheid variables. E.3.15 Explain how Cepheid variables may be used as standard candles. E.3.16 Determine the distance to a Cepheid variable using the luminosityperiod relationship.
IBO 2007
371
070815 Physics Ch 16 final.indd 371 22/05/2009 12:05:40 PM
OPTION
( )( )
CHAPTER 16 (OPTION E)
100,000
instability strip
luminosity / L
10,000
100
main sequence
Cepheids
40, 000
20,000
10,000
5,000
surface temperature / K
CEPHEID VARIABLES
In 1784 the amateur astronomer John Goodricke noted that the luminosity of the star Delta Cephei varied regularly. He recorded the apparent magnitude as reaching a maximum of 4.4 and then falling to a minimum of 3.5 in four days, rising to the maximum again in the following 1.5 days. We now know that this periodic change in luminosity is due to the outer layers of the star undergoing periodic contractions and expansions. There are however, two types of Cepheid imaginatively called Type-I and Type-II. Many other variable stars have since been discovered and they are given the general name Cepheid variables. The position of the Cepheids on the HR diagram is shown in Figure 1612.
STANDARD CANDLES
Cepheid variables are extremely important in determining distances to galaxies. This is because they are extremely luminous (typically 104 L) and therefore relatively easily located and also because of the so-called period-luminosity relationship. The American astronomer Henrietta Leavitt showed that there is a actually a linear relationship between the luminosity and period of Cepheid variables. The graph in Figure 1613 shows the period-luminosity relationship for Type-II Cepheids.
luminosity / power of 10, Sun = 1
The Period-Luminosity Relation for Cepheids
OPTION
372
070815 Physics Ch 16 final.indd 372 22/05/2009 12:05:41 PM
ASTROPHYSICS
When a Cepheid is located then, by measuring the period, we can determine its luminosity from the periodluminosity relationship. If the apparent brightness of the star is measured then its distance can be computed from the inverse square law. Cepheid variables act as a sort of standard candle in as much as they can be used to check distance measurements made using parallax and/or spectroscopic parallax. However, beyond about 60 Mpc Cepheids are too faint and measurement of their period becomes unreliable. Other techniques have to be used to measure distances greater than 60 Mpc. A summary is given at the end of this chapter outlining these techniques.
Summary
The two flow charts in Figures 1614 and 1615 summarise the ideas that we have met so far in this chapter as to how we gain information about the nature of stars and their distance from Earth.
distance measurement by parallax apparent brightness b chemical composition of corona
spectrum
d = 1p
Luminosity L = 4 bd
2
L = 4 R 2 T
Stefan-Boltzmann
Radius
Figure 1614
Example
Cepheid variable
Luminosity class
Spectral type
Chemical composition
Luminosity ( L )
L b = -----------4 d2
Stefan Boltzmann
L = 4 R 2 T 4
Solution
Distance (d ) Radius ( R )
dstar ------ = d
373
070815 Physics Ch 16 final.indd 373 22/05/2009 12:05:42 PM
OPTION
The star -Cepheid is 300 pc from the Earth (This distance can be determined by parallax from the Hubble telescope). Another variable star is detected in a distant galaxy that has the same period as -Cepheid but with an apparent brightness of 109 of that of -Cepheid. How far is the galaxy from Earth?
Period
HR diagram
CHAPTER 16 (OPTION E)
E.4.1 E.4.2
R observer
OPTION
E.4.4
E.4.5
E.4.6
E.4.7
374
070815 Physics Ch 16 final.indd 374 22/05/2009 12:05:42 PM
ASTROPHYSICS
expands more and more space is created. In this sense it is wrong to think of the galaxies rushing away from each other. It is the space in which the galaxies are situated that is expanding. There is simple analogy that helps understand this and one that you can easily demonstrate for yourself. Partially inflate a toy balloon and then stick little bits of paper on to it at different places. The surface of the balloon represents space and the bits of paper represent galaxies. As you now further inflate the balloon you will see the galaxies move away from each other. The galaxies move even though they are fixed in space because space is expanding. We must not take the balloon analogy too literally since the balloon is expanding into space whereas the expanding Universe is creating space. There is no outside of the Universe into which it expands. So to ask the question what is outside the Universe? is just as meaningless as the question what was there before the Big Bang? However, the analogy does help us appreciate that there is no centre to the Universe. Any one of the bits of paper can be chosen as a reference point and all other bits of paper will be appear to be receding form this reference point. This is exactly the same for the galaxies in our Universe. The expanding Universe also helps us understand the origin of the red-shift observed from distant galaxies. Strictly speaking this is not due to the Doppler effect since the galaxies themselves are not receding from us, it is the space between them that is expanding. When a photon leaves a galaxy and travels through space as the space through which it travels expands, the wavelength associated with the photon will increase in length. The longer it spends on its journey through space the more its wavelength will increase. Hence when the photon reaches our eyes it will be redshifted and the further from whence it originated, the greater will be the red-shift.
E.4.4 DESCRIBE BOTH SPACE AND TIME AS ORIGINATING WITH THE BIG BANG
If all the galaxies are rushing away from each other then it is feasible to assume that in the past they were much closer together. It is possible to imagine that sometime long ago all the matter in the Universe was concentrated into a smaller volume. An explosion then occurred that threw the matter apart. The prevailing view of the creation of the Universe is that some 10-20 billion years ago all the matter of the Universe was concentrated into a point of infinite density. Then a cataclysmic explosion initiated the expansion of the Universe. The explosion is called the Big Bang. However, we must not think of this like an exploding bomb. When a bomb explodes the shrapnel flies off into space. When the Big Bang happened matter did not fly off into space but space and time itself were created. Before the Big Bang there was no time and there was no space. As the universe
375
070815 Physics Ch 16 final.indd 375 22/05/2009 12:05:43 PM
OPTION
CHAPTER 16 (OPTION E)
produced. The photons would have a black body spectrum corresponding to the then temperature of the Universe. As the Universe expanded and cooled the photon spectrum would also change with their maximum wavelength shifting in accordance with Wiens law. It is estimated that, at the present time, the photons should have a maximum wavelength corresponding to a black body spectrum of 3 K.
E.4.9
E.4.10 Discuss how the density of the universe determines the development of the universe. E.4.11 Discuss problems associated with determining the density of the universe. E.4.12 State that current scientic evidence suggests that the universe is open. E.4.13 Discuss an example of the international nature of recent astrophysics research. E.4.14 Evaluate arguments related to investing signicant resources into researching the nature of the universe.
IBO 2007
OPTION
376
070815 Physics Ch 16 final.indd 376 22/05/2009 12:05:43 PM
ASTROPHYSICS
closed
past
now
time future
377
070815 Physics Ch 16 final.indd 377 22/05/2009 12:05:44 PM
OPTION
CHAPTER 16 (OPTION E)
E.5.6
E.5.7
OPTION
E.5.8
378
070815 Physics Ch 16 final.indd 378 22/05/2009 12:05:44 PM
ASTROPHYSICS
consists mainly of hydrogen gas (and some helium) other nebulae can be found that contain dust particles. These particles scatter any starlight and so appear dark against the background of emission nebulae. The most celebrated of the dark nebulae is the Horsehead nebula, so-called because of its distinctive shape. This can be seen against the emission nebula of Orion. (The nebulae associated with Orions sword are in fact some 450 pc beyond the other two stars in the sword). A typical dark nebula has a temperature of about 100 K and contains between 1010 and 1019 particles. The particles consist of hydrogen (75%) and helium molecules (24%) and dust (1%). The dust consists of atoms and molecules of many different elements. It would seem that the dark nebulae are the birthplaces of the stars. Their temperature is low enough and their density high enough for gravity to pull the individual particles together. As the particles move together under their mutual gravitational attraction they lose gravitational potential energy and gain kinetic energy. In other words the temperature of the system increases and as the temperature increases ionisation of the molecules will take place and the system will acquire its own luminosity. At this point the so-called protostar is still very large and might have a surface temperature of some 3000 K and therefore has considerable luminosity. A protostar of mass equal to the Sun can have a surface area some 5000 times greater than the Sun and be 100 times as luminous. As the gravitational contraction continues the temperature of the core of the protostar continues to rise until it is at a sufficiently high temperature for all the electrons to be stripped from the atoms making up the core. The core has now become a plasma and nuclear fusion now takes place in which hydrogen is converted into helium (sometimes referred to by astronomers as hydrogen burning) and the protostar has become a main sequence star on the Hertzsprung-Russell diagram. The nuclear fusion process will eventually stop any further gravitational contraction and the star will have reached hydrostatic equilibrium in which gravitational pressure is balanced by the pressure created by the nuclear fusion processes. Whereabouts a protostar lands on the main sequence is determined by its initial mass. The greater the initial mass the higher will be the final surface temperature and the greater will be its luminosity. This is illustrated in the Figure 1618.
100,000,000
15 M 10,000 9M 5M 100
main sequence
luminosity
3M 2M 1M
1 0.5 M
40, 000
Figure 1618 The relationship of mass and luminosity The more massive a protostar (more than about 4M) the more quickly its core will reach a temperature at which fusion takes place. (Protostars with a mass of about 15 M will reach the main sequence in about 104 years whereas protostars with a mass of about 1 M will take about 107 years). Its luminosity quickly stabilises but its surface temperature will continue to increase as it further contracts. For protostars with less than this mass, the outer layers are relatively opaque (due to the presence of a large number of negative hydrogen ions) so little energy is lost from the core by radiation. Energy is actually transferred from the core to the surface by convection ensuring that the surface temperature stays reasonably constant. The luminosity therefore will decrease as the protostar contracts. Gravitational collapse puts a lower and upper limit on the mass of matter that can form a star. As we saw in a previous section, a protostar with a mass less than about 0.08 M will not develop the pressure and temperature necessary to initiate nuclear fusion and will contract to a brown dwarf. If a protostar has a mass greater than about 100 M then the internal pressure created by contraction will overcome the gravitational pressure and vast amounts of matter will be ejected from the outer layer of the protostar thereby disrupting the evolution of the star.
379
070815 Physics Ch 16 final.indd 379 22/05/2009 12:05:45 PM
OPTION
CHAPTER 16 (OPTION E)
of the carbon in living tissue originated in the core of stars like the Sun in their death throes. When all the helium in the core has been used up, the core further contracts and its temperature rises such that the energy radiated from the core will now cause helium burning in the outer layers. The Sun has entered a second red giant phase. When it enters this phase its outer layers will reach out and engulf Earth, and it will have a luminosity some 10,000 times that of its present luminosity. When it enters this phase it undergoes bursts of luminosity in which a shell of its outer layers is ejected into space. As the Sun ejects its outer layers its very hot core will be exposed. This core will have a surface temperature of about 100,000 K and the radiation that it emits will ionise the outer gas layers causing them to emit visible radiation producing an, inappropriately named, planetary nebula. The radius of the core will be about that of the Earth and with no fusion reaction taking place within the core it will just simply cool down. The Sun has become a white dwarf star, and as it continues to cool it will eventually fade from sight.
OPTION
L = M 3.5
However, it must be borne in mind that this is an average relationship and that the power n to which M is raised is to some extent mass dependant. Generally n is greater than 3 and less than 4. For example a star that is 5 times more massive than the Sun will be 53.5 (= 280) times more luminous.
380
070815 Physics Ch 16 final.indd 380 22/05/2009 12:05:46 PM
ASTROPHYSICS
luminosity increases
RED GIANT
carbon burning
neon burning
WHITE DWARF
Figure 1619
Whereas the Chandrasekhar limit applies to main sequence stars, the so-called OppenheimerVolkoff limit applies to neutron stars.
Figure 1620
381
070815 Physics Ch 16 final.indd 381 22/05/2009 12:05:46 PM
OPTION
CHAPTER 16 (OPTION E)
Because of a very large coulomb repulsion, elements with a proton number of 26 or greater cannot undergo fusion. (Except if there is an enormous energy input). Iron has a proton number of 26 so when the entire core is iron, fusion within the core must cease. The star has reached a critical state. When the entire inner core is iron it contracts very rapidly and reaches an enormously high temperature ( 6 109 K). The high energy gamma photons emitted at this temperature collide with the iron nuclei breaking the nuclei into alpha-particles. This takes place in a very short time and in the next fraction of a second the core becomes so dense that negative electrons combine with positive protons producing neutrons and a vast flux of neutrinos. This flux of neutrinos carries a large amount of energy from the core causing it to cool and further contract. The rapid contraction produces an outward moving pressure wave. At this point, because of the contraction of the inner core, material from the other shells is collapsing inward, when this material meets the outward moving pressure wave, it is forced back. The pressure wave continues to accelerate as it moves outward until it is moving faster than the speed of sound and becomes a colossal shock wave that rips the material of the stars outer layers apart. The inner cores are now exposed and a vast amount of radiation floods out into space. The star has become a supernova. It has been estimated that some 10 J of energy is liberated in such an event and the star loses about 96% of its mass. The energy produced when a star becomes a supernova is sufficiently high to produce all the elements with atomic numbers higher than iron. The material that is flung out in to space will eventually form dark nebulae from which new stars may be formed. And so the process repeats itself. The core material that is left is thought to contract to form a neutron star or a black hole (see next section). (On February 23rd, 1987 a supernova was detected in the Large Magellanic Cloud and was so bright that it could be seen in the Southern Hemisphere with the naked eye). The sequences of the birth and death of stars is summarised in Figure 1621.
46
Super red giant Red Giant carbon-oxygen core Red Giant oxygen-neon core iron core
OPTION
382
070815 Physics Ch 16 final.indd 382 22/05/2009 12:05:47 PM
ASTROPHYSICS
University detected rapidly varying radio pulses from one particular location in the sky. Since then many more such sources have been discovered with periods ranging from about 30 pulses per second to about 1 pulse every 1.5 seconds. This is far faster than the pulses from an eclipsing binary or variable star. Nor could the source be a rotating white dwarf since, at such speeds of rotation, the whitedwarf would tear itself apart. It is now thought that these so-called pulsars are in fact rotating neutron stars. Neutron stars are by necessity small and therefore, to conserve angular momentum, they must rotate rapidly. Also, supergiants have a magnetic field and as they shrink to a neutron star the magnetic field strength will become very large. As this field rotates with the star it will generate radio waves. Strong electric fields created by the rotating magnetic field could also create electron-positron pairs (see Chapter 20 Option J) and the acceleration of these charges would also be a source of radio waves. The neutron star model is now firmly accepted by astronomers. The Crab nebula in the constellation Taurus has a pulsar close to its centre with a radio frequency of 33 Hz. It is pretty certain that the nebula is the remains of a supernova. In the 1960s even stranger objects than quasars were detected which emit strong bursts of gamma radiation. It would seem that these objects are even more distant and more luminous than quasars.
Black holes
We have seen that neutron stars have enormous densities and small radii. This means that the gravitational potential at the surface of such a star will also be enormous. The escape velocity from the surface of a planet or star is related to the gravitational potential at its surface. So what would happen if the potential was such that the escape velocity was equal to the speed of light? It would in fact mean that no electromagnetic radiation could leave the surface of the star; it would have become a black hole. The correct way in which to think of the formation of a black hole is in terms of the General Theory of Relativity. (refer to chapter 19 Option H). This theory predicts that space is warped by the presence of mass. Hence the path of light travelling close to large masses will be curved. Near to a black hole the space is so severely warped that the path of any light leaving the surface will be bent back in on itself. The General Theory also predicts that time slows in a gravitational field. At a point close to a black hole, where the escape velocity just equals the speed of light, time will cease. This point is known as the event horizon of the black hole. When a dying star contracts within its event horizon the entire mass of the star will shrink to a mathematical point at which its density will be infinite. Such a point is known as a singularity. A black hole therefore consists of an event horizon and a singularity. So do black holes exist? We cannot see a black hole, however, its existence might be inferred from the effect that its gravitational field would have on its surroundings. Some stellar objects have been detected as sources of X-ray radiation such as Cygnus X-1. Spectroscopic observations revealed that close to the location of Cygnus is a supergiant with a mass of about 30M. This star itself cannot be the source of X-ray radiation and it has been surmised that the system Cygnus X-1 is in fact a binary, the companion star being a black hole. The intense gravitational field of the black hole draws material from the supergiant and as this material spirals into the black hole it reaches a temperature at which it emits X-rays. Other potential candidates for black holes have been found and some theorists think that there is a black hole at the centre of all galaxies including our own Milky Way.
Quasars
In 1944 Grote Reber, an amateur astronomer, detected strong radio signals from the constellations Sagittarius, Cassiopeia and Cygnus. The first two of these sources were found to lie within the Milky Way. However, in 1951 an odd looking galaxy was found to be the source of the Cygnus signals. The galaxy was subsequently called Cygnus A. One of the extraordinary things about this galaxy is that, unlike all hitherto known galaxies, it exhibits an emission spectrum. Furthermore the emission spectrum showed a very large red-shift indicating that the galaxy was some 220 Mpc from the Earth. This was the furthest known object in the Universe at this time meaning that Cygnus A must be one of the most luminous radio sources in the Universe. Because of its star like appearance and strong radio emission, Cygnus A was called a quasar (quasi-stellar radio source). To date about 10,000 quasars have been detected and the most distant is some 3600 Mpc from the Earth. Not only are quasars strong radio emitters but they also have enormous luminosities with the most luminous being some 10000 times more luminous than the Milky Way.
383
070815 Physics Ch 16 final.indd 383 22/05/2009 12:05:48 PM
OPTION
CHAPTER 16 (OPTION E)
There are essentially three types of galaxies.
Perseus arm
rotation
Sun Cygnus arm
Spiral galaxies
Centaurus arm
Orion arm
There are many spiral galaxies in the universe and the Milky Way is one of them. All such galaxies are characterised by three main components, a thin disc, a central bulge and a halo. The sketches in Figure 1622 shows a side view and plan view of the Milky Way. Andromeda is the nearest spiral galaxy to the Milky Way (about 2.2 million light years away) and is just about visible to the naked eye. If it is viewed with binoculars then its spiral shape can be distinguished. The halo of the Milky Way contains many globular clusters of stars each containing up to a million stars and seemingly very little interstellar dust or very bright stars. This suggests that the halo contains some of the oldest stars in our galaxy. Their age as been estimated at between 10 and 15 billion years. The bulge at the centre of the Milky Way has the greatest density of stars and many of these would seem to be very hot, young stars. However, direct visible evidence is difficult to collect since the galactic centre contains many dust clouds. Such evidence that we have comes from observations made in the infrared and radio regions of the electromagnetic spectrum. See Figure 1622.
Sagittarius arm
globular clusters
25 000 ly
Figure 1622
Irregular galaxies
These galaxies seem to have no specific structure. Our two nearest galactic neighbours are irregular, the Large and small Magellanic Clouds. The large Magellanic Cloud lies at a distance of about 160,000 light years from the Milky Way. All galaxies rotate. If they did not, they would collapse under gravitational attraction. Our Sun is about 25,000 light years from the centre of the Milky Way and Doppler shift measurements show it to be moving through space with a speed of 230 km s1. You can use these values to show that the orbital period of rotation of the Milky Way is about 2.0 108 years.
OPTION
Elliptical galaxies
As the name implies, these galaxies have an elliptical crosssection and no spiral arms. Some are highly elliptical and some are nearly circular.
384
070815 Physics Ch 16 final.indd 384 22/05/2009 12:05:49 PM
ASTROPHYSICS
1 + v - c -------- = --1 v c
where c is the free space velocity of light. This expression in parenthesis can be written as
2 v v --- 1 1 + v - 12 c c c ---------------------- = ------------2 v 2 1 v 1 c c
E.6.2
E.6.3
E.6.7
E.6.8 E.6.9
If we consider the situation when v << c then we can expand this expression by the binomial theorem and ignore second order and higher terms to get or
such that if the spectral line has been shifted by an amount = then
v ----- = c
Example
The wavelength of the blue line in the spectrum of atomic hydrogen as measured in a laboratory on Earth is 486 nm. In the spectrum from a distant galaxy the wavelength of this line is measure as 498 nm. Estimate the recession speed of the galaxy and its distance from Earth.
385
070815 Physics Ch 16 final.indd 385 22/05/2009 12:05:50 PM
OPTION
v = 1 + c
v = --c
CHAPTER 16 (OPTION E)
The diagram below shows this for a very simple arrangement of five beads on an elastic string. In Figure 1623 (a) the beads are 10 cm apart. In Figure 1623 (b) the string is stretched at a steady rate until the beads are 15 cm apart. Bead E will now be 60 cm from bead A instead of 40 cm and if the stretching of the string took 1 second, the speed of E relative to A will be 20 cm s-1. On the other hand bead C will now be 30 cm from bead A instead of 20 cm and its speed relative to A will be 10 cm s-1. Doubling the separation between any two beads doubles their relative speed; speed is proportional to separation which is just Hubbles law.
10 cm 10 cm 10 cm 10 cm ( a)
Solution
From the red-shift we can calculate the recession speed of the galaxy. = 498 486 nm = 12 nm Therefore
(b)
OPTION
Hubble published his discovery in 1929 and Hubbles law can be written. v = Hd where d is the distance to the galaxy and v is its recession speed. H is known as Hubbles constant. An accurate value for Hubbles constant is difficult to measure (as will be shown) but an average value is about 65 km s1 Mpc1. For example a galaxy at a distance of 50 Mpc from the Earth will be rushing away from the Earth with a speed of 3250 km s1 (about 0.01 the speed of light). The further away from the Earth, the greater will be the recession speed of a galaxy. The expanding balloon analogy of the expanding universe fits in with the data that enabled Hubble to arrive at his law. If you concentrate on one galaxy you will see that galaxies that are further away move faster than those nearer to the galaxy that you have chosen. If the surface of the balloon is expanding at a constant rate then the speed with which a galaxy moves relative to another galaxy will be proportional to how far away it is from the other galaxy.
Hubble was the first astronomer to settle a long standing debate as to the nature of galaxies. It had been known since 1845 (thanks to the then most powerful telescope in the world, built by William Parsons) that some of the nebulae showed a spiral structure. Parsons himself suggested that such nebulae could be island universes far beyond the Milky Way. In the 1920s opinions were still divided as to their nature with some astronomers of the opinion that they were relatively small objects scattered about the Milky Way. In 1923 Hubble took a photograph of the Andromeda nebula which on close examination showed a bright object that he recognised as a Cepheid variable. From measurements of the variables luminosity he was able to show that the Andromeda nebula is 900 kpc from the Earth and that it has a diameter of some 70 kpc, a diameter much greater than that of the Milky Way. The debate was settled. The universe was far greater in size than had previously been thought and contained many galaxies, some much larger than our own Milky Way.
386
070815 Physics Ch 16 final.indd 386 22/05/2009 12:05:51 PM
ASTROPHYSICS
Bearing in mind that 1 Mpc = 3 1019 km and 1 year = 3 107 s this gives an age between 10 and 20 billion years.
Exercise
16.6 (a)
Estimate the age of the Universe for a value of the Hubble constant = 80 km s-1 Mpc-1.
387
070815 Physics Ch 16 final.indd 387 22/05/2009 12:05:52 PM
OPTION
CHAPTER 16 (OPTION E)
At the beginning of this chapter it was mentioned that astrophysics contains all of physics. This is certainly apparent when trying to understand the beginnings of the Universe. Attempts to understand the first few moments of creation sometimes seem akin to mediaeval mysticismhow many angels can be put on a pinhead? Nonetheless attempts are made to understand the origins of the Universe in terms of the fundamental forces that govern the behaviour of matter. But much is conjecture and open to debate. In light of the scientific method many of the theories put forward are difficult and sometimes, as yet, impossible to verify experimentally. We stand, as it were, on the frontiers of ignorance. after the Big Bang all the primordial helium had been produced. After some 300,000 years the temperature of the Universe had cooled sufficiently such that the energy of photons no longer caused ionisation and electrons could now combine with nuclei to form atomic hydrogen and helium. High energy photons no longer interacted with atoms. The Universe had become transparent to photons and it is these photons which now give rise to the 3 K background radiation. There are no definitive answers to the question as to how the galaxies came into existence but it is thought that their evolution is connected with the emergence of gas clouds. If the Universe had continued to expand uniformly after the Big Bang then all the matter in the universe would now be distributed uniformly and it is difficult to see how regions of higher density such as gas clouds came into existence. Without these gas clouds it is difficult to see how the galaxies came into existence. During the early expansion of the Universe some sort of wrinkles in space must have been formed which provided the nucleus about which gas clouds could form. This is rather like the way that liquid droplets form along the track of a charged particle as it moves through a super-cooled liquid.
THIS THEN IS AN OUTLINE OF THE FIRST FEW MOMENTS OF CREATION AS PORTRAYED BY PHYSICISTS OF TODAY
In the first 1043 s after the Big Bang the four fundamental interactions, gravity, weak force, electromagnetic force and strong force were all unified. At 1043 s (the temperature being 1032 K) gravity appeared as a separate force and at 1035s (temperature 1027 K) the strong nuclear interaction separated from the electromagnetic and weak interaction. Between 1035 s and 1024 s the young Universe underwent a rapid expansion increasing its size by a factor of 1050. This is known as the Inflationary Epoch. This rapid inflation allowed matter and anti-matter to become sufficiently separated to stop mutual annihilation. During this period a so-called symmetry breaking took place in which the number of particles present outnumbered the number of anti-particles present. At 1012 s after the Big Bang and at a temperature of 1012 K, the electromagnetic force separated from the weak interaction. At 106 s the temperature had dropped sufficiently (1013 K) for individual neutrons and protons to exist. Prior to this the temperature was too high for even these particles to be stable, for as soon as they came into existence collisions would fragment them into quarks. At about 2 s and a temperature of 1010 K neutrinos ceased to interact with protons and neutrons and by 3 minutes
OPTION
388
070815 Physics Ch 16 final.indd 388 22/05/2009 12:05:52 PM
ASTROPHYSICS
Exercise
16.6 (b)
General Exercises
1.
The star AlphaCentauri B is 1.21 pc from Earth. Calculate (a) (b) this distance in AU. its parallax angle.
1.
Calculate the wavelength at which the Sun emits most of its energy? (TSun = 5800) Stars can be assigned to certain spectral classes. The classes are given in the table below. Spectral class M K G F A B O Colour Redorange Orange Yellow Yellowwhite White Bluewhite Bluewhite Temperature (K) 2500-3500 3500-5000 5000-6000 6000-7500 7500-10,000 10,000-28,000 28,000-50,000
2.
2.
Two stars A and B are respectively at distances 50 pc and 500 pc from the Earth. Both have equal brightness. Determine which star is the most luminous and by how much. The diagram below shows the Apparent magnitude scale used by Astronomers.
di m bri ght +15 Pluto +10 +5 0 Sirius 5 10 15 20 25 Sun
3.
+25
+20
The table below gives the spectral class and absolute magnitude of some well-known stars. Name of star Spectral class Rigel A Vega A Sun G Alderbaran K Pollux K Sirius B B Procyon B F Barnards star M (a) Absolute magnitude (approximate) 7.0 0 5 0 +2 + 12 + 14 + 13
(a)
(b) (c)
4.
Explain the difference between an eclipsing binary and a spectroscopic binary. Outline the evidence on which the idea of an expanding Universe is based. A certain line in the spectrum of atomic hydrogen has a wavelength of 121.6 nm as measured in the laboratory. The same line as detected in a distant galaxy has a wavelength of 147.9 nm. Determine the recession speed of the galaxy. State the property of a main sequence star that determines its final outcome. Describe the evolution of a main sequence star to a neutron star.
5.
(b) (c)
6.
Use this table to place the stars on a HertzprungRussell diagram in which absolute magnitude is plotted against spectral class (temperature). For each of the stars identify to which category it could belong. Identify a star that is hotter and more luminous than the Sun and a star that is cooler and less luminous than the Sun.
3.
Describe how Olbers paradox is inconsistent with Newtons model of the Universe. Explain the terms open, flat and closed as applied to the Universe.
7.
4.
389
070815 Physics Ch 16 final.indd 389 22/05/2009 12:05:53 PM
OPTION
Explain whether the Sun would be visible if it were at a distance of 10 pc from the Earth. Estimate by how much brighter is the Sun than Sirius A. The absolute magnitude of Sirius A is less than that of the Sun. Explain whether it is more or less luminous than the Sun. (Sirius A: m = 0.7, Sun: m = 26.7).
CHAPTER 16 (OPTION E)
OPTION
390
070815 Physics Ch 16 final.indd 390 22/05/2009 12:05:53 PM
COMMUNICATIONS
COMMUNICATIONS
F.1 F.2 F3 F.4 F.5 F.6 (SL and HL) Radio communication (SL and HL) Digital signals (SL and HL) Optic bre transmission (SL and HL) Channels of communication (HL) C.3 (SL) Electronics (HL) C.4 (SL) The mobile phone system
17
Introduction
This topic looks at how information is transmitted over large distances. For example, how can we hear a live concert in our home when it is being performed on the other side of the world or how can we have a telephone conversation with someone who lives in a different country? We have learnt in Topic 4 that a wave carries energy but it also carries information about the source that generated it. If the wave is a sound wave of audible frequency, the information that it carries is interpreted by the ear as the pitch of the source. The source might be a single tuning fork or it might be a full concert orchestra. However, we are well aware that sound waves lose energy quite rapidly; stand a few metres from a sounded tuning fork and you will not hear the sound it emits. You will have more success in hearing a full orchestra at some distance away, but not if you are several kilometres away from it. So sound information is soon lost if it has to travel any distance.
Clearly, the majority of the information that we wish to transmit to other people is primarily audio and/or visual information. To transmit this information, we need some means by which it can be carried between two points. If we think of the example of sound waves, information about the source is encoded in the wave but the wave, as mentioned above, just does not carry very far. On the other hand, experience tells us that electromagnetic waves can, and do travel vast distances. So, without going into the history of communication theory, it is sufficient to say that electromagnetic waves (em) are, in the main, the carriers used in the present day to transmit information. However, we need to look in some detail as to how information is actually carried by an em wave and this is what we look at in the next sections. Of course, we exclude here, the obvious case where visual information needs no encoding in the respect that it is carried directly to the eye by waves whose frequency lie in the visible region of the em spectrum.
391
070816 Physics Chapt 17 for Paul391 391 22/05/2009 12:06:27 PM
OPTION
CHAPTER 17 (OPTION F)
amlplitude/mm
0 5 10 15 20 time/ ms
-1
F.1.2
Figure 1701
Variations in amplitude
F.1.3
F.1.4
The variation of the amplitude with time and the frequency of the wave gives us the information about the source. In this situation, the information stays constant. If the information changes then the wave must change in some way. When this happens, the wave is said to be modulated. For example, suppose that the amplitude now varies with time as shown in Figure 1702.
2
F.1.5
amlitude/ mm
0 5 10 15 20 time/ ms
F.1.6
F.1.7
-2
OPTION
Figure 1702 A modulated wave The original wave is still there but it has been modulated by the superposition of another wave of a different frequency. The modulation of the original wave gives information regarding the changes that have taken place at the source.
F.1.8
F.1.9
392
070816 Physics Chapt 17 for Paul392 392 22/05/2009 12:06:27 PM
COMMUNICATIONS
microphone
voltage
carrier wave
V+
Figure 1703 A schematic communication system It is no secret that it is the radio wave region of the em spectrum that is most used for the transmission of information. The reason for this is that radio waves may be produced at accurately maintained frequencies and also that they travel long distances without attenuation (loss of energy). The radio spectrum also covers a wide range of frequencies varying from about 3 kHz to 300 GHz. In a radio communication system, the carrier wave in Figure 1703 would be a radio frequency wave and if audio data is being transmitted, the signal wave would be an em wave that is in the audio frequency range (the sound waves having been converted to an electrical signal).
loud speaker Vtime AM voltage voltage
Figure 1704
In the absence of any sound at the microphone, the current in the circuit is constant resulting in a constant voltage across the loud speaker. The constant voltage is the carrier wave. If you now, for example, speak into the microphone, the sound waves that you produce are converted into electrical signals that produce a variation in the amplitude of the current in the circuit. The variations in amplitude of the current in turn produce a variation of amplitude in the voltage (the signal wave) across the speaker as shown in the insert diagrams in Figure 1704. The speaker converts these voltage variations into sound waves. Amplitude modulation is very different to the example we gave F.1.1 in which we just added two sine waves. If we consider just the time variations of a wave, i.e. how the displacement of the medium at a particular point varies with time, then we can write the displacement y as y = Asin( t) where = 2f. If we add two waves of the different amplitudes A and M and different frequencies f1 and f2 respectively, then the resultant displacement Y at the point is given by
Amplitude Modulation
In amplitude modulation, the frequency of the carrier wave is constant and the signal wave is used to vary the amplitude of the carrier wave. For example, a violinist can produce a so-called tremolo effect by varying the amplitude of the vibrating string by suitable movement of the bow. We can further illustrate AM with the simple situation shown in Figure 1704 in which a microphone is used to vary the amplitude of a direct electric current of constant value.
which is a very different expression from just adding the two waves and gives a very different result as is shown in Figure 1705 in which fc = 18 fs.
393
070816 Physics Chapt 17 for Paul393 393 22/05/2009 12:06:28 PM
OPTION
CHAPTER 17 (OPTION F)
Spreadsheets can be used to demonstrate amplitude modulation using the above equation. This is why it is given here even though it will not be required to know this equation for the IB examination.
1.5 1 0.5 0 -0.5 -1 -1.5
Frequency Modulation
In frequency modulation the amplitude of the carrier wave is kept constant and the signal wave is used to vary the frequency of the carrier wave. With analogue signals (see below) the carrier wave frequency is varied in direct proportion to changes in the amplitude of the signal wave. With digital signals (see below) the carrier wave frequency is shifted among a set of discrete values of frequencies. To understand the principles of FM, we will just consider analogue signals here as shown in Figure 1706.
C A B
Figure 1705 (a) Signal wave frequency = fs
E F D
OPTION
Figure 1705
20
AB
EF
Figure 1706 Radio Frequency modulation Figure 1707 shows what is happening to the amplitude of the signal wave in the labelled regions and the corresponding effect on the frequency of the carrier wave.
-20
region amplitude of signal wave A B zero B C increasing C D decreasing D E increasing E F constant Figure 1705 (c) Amplitude modulation Figure 1707
frequency of carrier wave constant increasing with maximum at C decreasing with minimum at D increasing to constant value at E constant
In Figure 1705 (c) the dotted curve shows the envelope of the modulated wave. Note that there are 18 complete oscillations of the wave inside one complete period of the envelope.
394
070816 Physics Chapt 17 for Paul394 394 22/05/2009 12:06:29 PM
COMMUNICATIONS
A carrier wave is modulated by a single signal wave. As a result of the modulation, a maximum amplitude of the carrier wave occurs every 2.3 ms. Between each maximum there are 2.1 105 complete oscillations of the carrier wave. Determine the frequency of the signal wave and of the carrier wave.
Solution
1 = 2.3 103 For the signal wave __ fs to give fs = 435 Hz 2.1 10 = 9.1 107 = 91 MHz for the carrier wave fc = ________ 2.3 103
5
Exercise
17.1 (a)
If the carrier wave in the above example is frequency modulated by the same signal wave as above, determine the time interval between an oscillation of the carrier wave of maximum frequency and one of minimum frequency.
395
070816 Physics Chapt 17 for Paul395 395 22/05/2009 12:06:30 PM
OPTION
CHAPTER 17 (OPTION F)
The transmitted power is concentrated in the carrier signal, and as this carries no useful information this means that amplitude modulation is inefficient in terms of power usage. The remaining power is split between two identical sidebands. Since each band contains the same information only one is actually needed so again there is wasted power. AM can be made to be more power efficient by eliminating one of the sidebands (single sideband transmission) although the electronic circuitry needed to do this is complicated and adds to the expense of the transmitter.
amplitude
fc
Figure 1708
AM amplitude spectrum
However, if the carrier wave is modulated by a single signal wave of frequency fs, the frequency spectrum will look like that shown in Figure 1709.
F.1.7
amplitude
Exercise
17.1 (b)
1.
The allowed frequency range for broadcasting in the medium frequency range is 535 kHz to 1615 kHz. Calculate the number of 9 kHz broadcasting channels that are available. A carrier wave of frequency 535 kHz is modulated by frequencies in the range 50 Hz to 5.0 kHz. Determine the frequencies transmitted and the bandwidth.
fc - fs
fc
fc + fs
frequency
2.
OPTION
Figure 1709 Frequency spectrum of single frequency modulation Bearing in mind that power is proportional to amplitude squared, Figure 1709 is translated into a power spectrum as is shown in Figure 1710 (not to scale).
power
fc - fs
fc
fc + fs
frequency
One of the problems with AM transmission is the fact that the amplitude is not likely to stay constant during transmission. This is because of interference with other electrical signals (unwanted signals are known as noise) and also due to energy loss. Because of change of amplitude during transmission, when the transmitted signal is decoded, the encoded signals will be distorted. FM does not suffer from this problem since any change in amplitude does not affect the encoded signals. Another problem with AM is the limitation posed by bandwidth. The relatively narrow bandwidth does not lead
396
070816 Physics Chapt 17 for Paul396 396 22/05/2009 12:06:30 PM
COMMUNICATIONS
to much distortion of speech but the higher harmonics of musical sounds will not be transmitted and this leads to a lack of quality in the reproduced sound. For reasons that are too complex to go into here, audio frequencies up to 15 kHz can be encoded using FM but in order to do this successfully a total bandwidth of about 200 kHz is needed. It is for this reason that FM signals occupy the high frequency end of the radio wave spectrum. Historically, there are two main advantages of AM transmission. The circuitry involved is cheaper and simpler but the advent of the silicon chip means that this is no longer true. The range of AM transmission is also much greater than that of FM transmission. Strictly speaking, to receive FM you need to be able to see the transmitter; AM on the other hand is reflected by the ionosphere so can be received effectively anywhere on Earth irrespective of the position of the transmitter. However, communication satellites now effectively act as the ionosphere for FM signals, reflecting them to all points on the surface of Earth and, although AM is still in use, it is soon likely to be a thing of the past. AM is also much more subject to noise than FM. Tuning circuit: this is a circuit that can be tuned to resonate (see Topic 4) with a particular frequency of a carrier wave. r.f (radio frequency) amplifier: this amplifies the selected modulated carrier wave. Demodulator: this essentially removes the carrier wave leaving only the signal waves. a.f (audio frequency) amplifier: this amplifies the signal waves. Loudspeaker: this converts the signal waves into sound waves.
F.2.2
F.1.9
F.2.3
F.2.4
The Figure 1703 in F.1.2 shows the basic components of a radio transmitter. In Figure 1711, we show the basic components of a radio receiver.
aerial
F.2.5
F.2.6
loudspeaker tuning circuit r.f amplifier demodulator a.f. amplifier
F.2.7
A brief description of each component is given below. F.2.9 Aerial: this detects the transmitted signal by the electric field in the transmitted signal oscillating the electrons in the aerial.
397
070816 Physics Chapt 17 for Paul397 397 22/05/2009 12:06:31 PM
OPTION
CHAPTER 17 (OPTION F)
Please refer to Chapter 14 Topics C.1.1.and C.1.2
Clock
This provides the reference pulses.
OPTION
clock
ADC
modulator transmission
to serial converter
carrier wave
398
070816 Physics Chapt 17 for Paul398 398 22/05/2009 12:06:32 PM
COMMUNICATIONS
A 0 1 2 3 4 0 0 0 0 0
B 0 0 0 0 1
C 0 0 1 1 0
D 0 1 0 1 0
Modulator
The digital data modulates the carrier wave for transmission. Of course digital data transmission does not just occur in broadcasting (in fact the use here is a relatively new phenomenon) but in transmission along optical fibres (see below) and electrical cable in general. An example of the latter is transmission of data from the hard drive of your computer to your printer and even computer generated data to a word processor. For example in Figure 1714, four bit generated binary words are converted to a series of binary pulses along a single cable. The serial pulses are then converted back to 4-bit words to be used by the word processor.
serial to parallel converter
So, for example, encoding in 8-bit binary and sampling at a frequency of 16 kHz will result in a bit rate of (8 16) = 128 kb s-1 (kilobits per second). The greater the bit rate, the higher the quality of the reproduced transmitted data. Quite often, there is a trade off between the total amount of space that the data takes up and the bit rate as in MP3 and JPEG files. In general, though, the sampling frequency necessary to ensure accurate reproduction of the signal must be equal to, or greater, than twice the signal frequency. (This is known as the Nyquist Theorem or the sampling theory).
word generator
Word processor
399
070816 Physics Chapt 17 for Paul399 399 22/05/2009 12:06:32 PM
OPTION
CHAPTER 17 (OPTION F)
F.2.7
input S1 output input S2
Exercise
17.2
control pulse
The graph shows the variation with time of the output of an electrical signal.
9
Figure 1715 A two input multiplexer When the control pulse is, say, 1, then the switch is in the position shown and the input signal S1 will appear at the output, When the control pulse is 0, then the switch changes position and S2 now appears at the output. A four input multiplexer can be made out of three two input multiplexers and two controls and so on. To see how multiplexing works in practice, consider a situation in which sampling is done at the rate to 16 kHz, that is one sample every 62.5 s. If each sample takes 8 s to convert to an 8-bit word, then there is a vacant time slot of length of 54.5 s before the next word is transmitted. In theory, this slot can be filled by words from 6 different channels (54.5 8). Figure 1716 illustrates how a multiplexer can be used to transmit data from three separate channels.
A A B C A B C A B C B C
time division multiplexer time division multiplexer in reverse operation
8 7 6
output/V
5 4 3 2 1 0 0 2.5 5 7.5 10 12.5 15 17.5 20 22.5 25 27.5 30 32.5 35 37.5 40 time / 100 s
(a)
The signal is sampled at every 125 s and is converted into a 3-bit digital signal. The possible outputs of the ADC are given below. Analogue signal/Volts 0 1 2 3 4 5 6 7 <1 <2 <3 <4 <5 <6 <7 <8 ADC binary output 000 001 010 011 100 101 110 111
OPTION
A B C
Determine: (i) (ii) (iii) (b) the sampling rate. the digital output of the signal for the 8th sample. the bit rate.
Assuming a binary pulse is produced every 1 s, calculate the maximum number of channels that time division multiplexing can produce at this sampling frequency for sending digital data as two four bit bytes.
400
070816 Physics Chapt 17 for Paul400 400 22/05/2009 12:06:33 PM
COMMUNICATIONS
F3
F.3.1 Digital communication and the development of multiplexing have had a tremendous impact on worldwide communications. The amount of information and the speed with which it now can be transmitted across the world means that news is almost instantaneous. One only has to compare, for example, how long it took for news of the Battle of Trafalgar or news of the Boston Tea Party to reach Europe (and just how sparse the information was), with how long it took for news of the 2005 tsunami to reach different parts of the world. Also, just think how easy it is now to be in contact with distant friends and relatives by the use of email as compared to snail mail. Indeed, digital communication has made the world a much smaller place.
F.3.2
F.3.3
F.3.4
F.3.5
F.2.9
DISCUSS THE MORAL, ETHICAL, ECONOMIC AND ENVIRONMENTAL ISSUES ARISING FROM ACCESS TO THE INTERNET
F.3.6
F.3.7
F.3.8 The development of the Internet is a prime example of digital communication, the information highway. However, its use raises many issues that you might like to discuss with your teacher. Some of the issues might include the following. The use of the internet to spread political or religious propaganda. The use of the internet to sell illegal goods. The availability of inappropriate material such as pornography for minors. The advent of internet shopping and its effect economically and socially. The effect on the worlds energy consumption.
F.3.9
401
070816 Physics Chapt 17 for Paul401 401 22/05/2009 12:06:34 PM
OPTION
CHAPTER 17 (OPTION F)
normal I i boundary r T (a) normal I air boundary glass T glass air
where c is the angle of incidence in the glass that corresponds to an angle of refraction in air of 90. The dotted rays in Figure 1718 show what happens to light that is incident on the glass-air boundary at angles greater than c; the light is totally internally reflected. The reflection is total since no light is transmitted into the air. The angle of incidence at which total reflection just occurs is the angle c and for this reason c is called the critical angle.
(b)
Figure 1717 Refraction: (a) air to glass transmission (b) glass to air transmission In Figure 1717(b) the light clearly just follows the same path as in Figure 1717(a) but in the opposite direction. However, whereas any angle of incidence between 0 and 90 is possible when the light is travelling from air to glass, this is not the case when travelling from glass to air. When angle i = 90 in (a), this will correspond to the maximum possible angle of refraction when the light is travelling from glass to air as is shown in Figure 1718.
The critical angle for a certain type of glass is 40.5. Determine the refractive index of the glass.
Solution
1 1 __ ________ n = sin c, therefore: n = sin 40.5 = 1.54
r air = 90
air boundary
OPTION
Exercise
glass
17.3 (a)
Determine the critical angle for light travelling from air to water of refractive index 1.33.
i glass = c
n=
sin i sin r
402
070816 Physics Chapt 17 for Paul402 402 22/05/2009 12:06:35 PM
COMMUNICATIONS
single step fibre and of light transmission along the fibre by total internal reflection.
core
before transmission
50 m 125 m
after transmission
Figure 1720 Material dispersion Using laser light as the source of the carrier wave greatly reduces material dispersion since the bandwidth of the light emitted by a laser is only about 5 nm. Even using light emitted by light emitting diodes (LED) reduces material dispersion since the light emitted still as a bandwidth of only about 20 nm.
cladding
(a)
(b)
Figure 1719 Optical bre (a) structure (b) transmission of light by TIR Figure 1719 (a), shows that there is a two-layer structure of the fibre, the core and the cladding. The diameter of the core is constant, at approximately 50 to 60 m and a typical refractive index index would be 1.440. The surface of the core is kept as smooth as possible. The outer layer, the cladding, is bonded at all points to the surface of the core and a typical refractive index would be 1.411. The cladding ensures that the refractive index of the outside of the core is always less than the inside. The cladding is usually surrounded by a layer of plastic that protects and strengthens the fibre.
Attenuation
The intensity of the carrier wave in a fibre will decrease with the distance travelled along the fibre. This phenomenon is called attenuation and is due to energy being carried by the wave being lost. The energy loss is due to a variety of reasons such as scattering and absorption within the core. The attenuation is often measured in decibel per kilometre. The decibel scale is a logarithmic scale (see Topic I.1.6 to see how it applies to hearing loss) and in optic fibres the attenuated power is defined below. loss of power (attenuation) in decibels
= 10 log I initial power = 10 log 1 output power I2
Material dispersion
Although all electromagnetic waves have the same speed in a vacuum, the speed of the wave in a medium depends on the wavelength of the wave, a phenomenon known as dispersion. Another way to regard dispersion is to recognise that the refractive index of a medium depends on wavelength (see Topic G.1.4). The implication of this for optical fibres is that light of different wavelengths will travel different distances along a fibre. This means that the pulses in the fibre will spread out as they travel along the fibre and the information carried by the waves will be distorted. Figure 1720 shows how a square wave pulse might be distorted by this so-called material dispersion.
403
070816 Physics Chapt 17 for Paul403 403 22/05/2009 12:06:35 PM
OPTION
CHAPTER 17 (OPTION F)
We see that attenuation decreases with increasing wavelength and that there are two distinct minima at about 1300 and 1500 nm. Monomode fibres using these frequencies of carrier wave make them the best choice for long-distance communication such as in telecommunication.
Example
The power loss between source and detector in a particular optic fibre of length 1.5 km is 50%. Calculate the power loss in dB km-1.
Solution
1 = 3.0 dB power loss in dB = 10log10 __ 2
()
Exercise
17.3 (b)
One great advantage of using optic fibres for transmitting data is that they are not very susceptible to noise. Any noise that does affect the signals arises from stray light entering the fibre at the transmitting and receiving ends of the fibre. At the receiving end, photodiodes are used to convert the light pulses into electrical pulses and photodiodes are subject to random noise. The noise to signal power ratio for an optic fibre is typically in the range 10-17 to 10-18.
Calculate the length that will result in a power loss of 80% for of an optical fibre with an attenuation of 2.0 dB km-1.
Reshapers (regenerators)
Monomode fibres effectively eliminate modal dispersion and although lasers reduce material dispersion the latter is still present. Suppose for example, data is transmitted at the rate of 1 Gb s-1, then, to ensure that the pulses remain distinct from each other, they need to be separated by at least 0.5 ns. Over long distances the pulses can become quite spread out such that even with laser light, after about 50 km, individual pulses will be starting to merge together. So every 40-60 km, the pulses are detected and then reshaped. The reshaped pulses are then encoded onto a new laser beam for continued transmission. This is the function of the reshaper.
OPTION
5 4 3 2 1 0 7 8 9 10 11 12 13 14 15 16
wavelength / x 107 m
Amplier
Even if pulses have been re-shaped, the carrier wave and signal still undergo attenuation. In 1987 David Payne and his co-workers at the University of Southampton in England developed the first practical optical amplifier suitable for optic fibre communication systems.
404
070816 Physics Chapt 17 for Paul404 404 22/05/2009 12:06:36 PM
COMMUNICATIONS
These amplifiers enable the attenuated carrier and pulses to be amplified at various points along the fibre. The combination of reshapers and amplifiers means that data can be transmitted along optic fibres for vast distances. The longest optic fibre links are those that span the Worlds oceans, including several across the Atlantic from North America to Europe and several from the United States to Japan. The transatlantic cable laid in 1988 contains eight fibres and each fibre has a bit rate of 560 Mb s-1 enabling the cable to carry 40,000 telephone calls at one time. The next generation of cables which came into service in 1992, are able to carry double this number of calls.
F.4.2
F.4.4
1.
The cladding of an optic fibre has a refractive index of 1.46 and the core a refractive index of 1.48. The fibre is 1.80 102 m long. F.4.5 (a) (b) Calculate the critical angle between the core and cladding. Show that the difference between the transmission time for an axial signal and a signal that is incident to the cladding at an angle that is just greater than the critical angle is about 9 ns. (HINT: consider the geometry of the situation.)
F.4.6
2.
The input power to an optic fibre is 10 mW and the signal noise is 1.0 10-20 W. The attenuation loss in the fibre is 2.5 dB km-1. (a) (b) Calculate the ratio of input power to signal noise in decibels. The input signal needs to be amplified when its power is attenuated to 1.0 10-18 W. Determine the maximum separation of the amplifiers in the cable.
In this section we look briefly at the different communication channels that are available and also their relative advantages and disadvantages.
405
070816 Physics Chapt 17 for Paul405 405 22/05/2009 12:06:37 PM
OPTION
CHAPTER 17 (OPTION F)
compared to optic fibre is that coaxial cable is not suitable for the transmission of digital signals and also that it has an upper limit bit rate of about 140 Mb s-1.
F.4.1 OUTLINE DIFFERENT CHANNELS OF COMMUNICATION, INCLUDING WIRE PAIRS, COAXIAL CABLES, OPTIC FIBRES, RADIO WAVES AND
SATELLITE COMMUNICATION
Optic bres
Some of the advantages of optic fibres have been referred to in Topic F.3.
Wire pairs
The first real electronic communication system was the telegraph system that used electrical pulses transmitted along wires. This system led to the development of the so-called Morse code. One type of telegraph system that has survived is the Telex system. This system has the advantage over the conventional telephone system that written documents can be sent. Although the development of scanners used along with email and Fax and Teletex have to some extend superseded Telex, more than three million telex lines remain in use worldwide. For instance, it is is still widely used in some developing countries because of its low costs and reliability. However, wire pairs that are twisted have one great advantage, namely that the twisting together results in the cancelling out of electromagnetic interference from other sources, particularly from adjacent wires. Essentially, the wire carry equal and opposite signals such that the noise in one cancels out the noise in the other. Also, because of their relative low cost compared to optic fibres and coaxial cable, twisted wires are often used in data networks and computer systems where there are short and medium length connections between components. A modern variant of twisted wires is the so-called Ethernet connection.
ADVANTAGES
highly suitable for transmitting digital data much higher bit rate much smaller in diameter low noise good security
OPTION
DISADVANTAGES
often difficult to access and repair finite life of ampfliers and reshapers
Radio waves
Clearly, the main advantage of radio communication is that it does not require any wires or cables. The main disadvantage was that it could not be used to transmit data contained in documents. However, the advent of wireless connection for the internet means that this is no longer the case.
Coaxial cables
A coaxial cable consists of a thin copper wire surrounded by an insulator which in turn is surrounded by a copper grid. This grid is also surrounded by an insulator. Coaxial cable will probably be familiar to you as the cable that often connects the aerial to your television set. Although optic fibres have replaced coaxial cable in many situations, it is still used to carry the majority of cable television signals and also for DSL (digital subscriber line) internet connections. The main disadvantage of coaxial cable
406
070816 Physics Chapt 17 for Paul406 406 22/05/2009 12:06:37 PM
COMMUNICATIONS
available to those points that are in the field of view of the satellite. This is not true for a polar satellite as different points will move out of the field of view at different times. A disadvantage of geostationary satellites is that their great height above the surface of Earth means that more power is needed for transmitting to and from the satellites than for a polar satellite. An even bigger disadvantage is that, because of the curvature of Earth, geostationary satellites are not able to communicate with points further than about 60-70 North or South. On the other hand polar satellites will be able to communicate with all points in both the Northern and Southern hemispheres at some time during their orbit. Geostationary satellites, because of their large orbital height, are much more costly to put into orbit than polar satellites. It should be borne in mind that satellites are also used for remote imaging and that both types have advantages and disadvantages in this respect.
ADVANTAGES AND DISADVANTAGES OF THE USE OF GEOSTATIONARY AND OF POLAR-ORBITING SATELLITES FOR COMMUNICATION
Another type of communication satellite is the polar orbiting satellite. Clearly, a polar satellite is not geostationary and, as such, it orbital height above the surface of Earth is much less than that of a geostationary satellite. It is useful to compare the advantages and disadvantages of each type of satellite in their use as communication satellites. The fact that they keep the same position means that, unlike polar satellites, geostationary satellites need no tracking system Also, geostationary satellites are always
407
070816 Physics Chapt 17 for Paul407 407 22/05/2009 12:06:38 PM
OPTION
CHAPTER 17 (OPTION F)
F.5.2
The power supply, the +V and V in the diagram, of opamps may vary from 15 V to 30 V. It is important to realise that, for example, a 15 V supply is not the same as a 30 V supply. The +15 V is measured with reference to 0 V and so is the 15 V whereas in a 30 V supply the two supply lines are 30 V and 0 V. (Note that it is usual to omit the supply lines when drawing circuits involving op-amps). The amplifier has one output terminal and the output voltage can take any value between V, although in practice the output will have a slightly smaller range than this. There are two input terminals to the amplifier. One is an inverting input (marked -) and a non-inverting input (marked +). The two properties that make the amplifier such a useful device are: (i) its very high open loop gain (A0), typically about 105, meaning that without any other electrical components connected to the amplifier, the output voltage is 105 the input voltage. its very high input resistance, anything between 106 and 1012 , meaning that the input current to the amplifier is negligible.
F.5.3
F.5.4
F.5.5
F.5.6
OPTION
(OP-AMP)
The operational amplifier is a remarkable device that has many uses in electronics and communications. It was originally called a direct current amplifier (d.c.) to distinguish it from amplifiers that are used to amplify analogue signals such as audio amplifiers. Figure 1722 shows the circuit symbol for an operational amplifier (op-amp).
+V
(ii)
The amplifier also has very low output resistance and this is of importance if the output of the amplifier is to be connected to low resistance components. In Figure 1722 if the inverting input is V1, the noninverting input V2, and the output Vout then Vout = A0 (V2 V1)
inputs + output
That is, the amplifier amplifies the difference between the inputs and for this reason it is sometimes called a differential amplifier. In the open loop mode of operation, because of the high gain, the amplifier is very unstable as small changes in the input are subject to very large changes at the output.
-V
Figure 1722
408
070816 Physics Chapt 17 for Paul408 408 22/05/2009 12:06:38 PM
COMMUNICATIONS
However, we also have that V0 = -A0 VP or VP =
F.5.2 DRAW CIRCUIT DIAGRAMS FOR BOTH INVERTING AND NONINVERTING AMPLIFIERS
V0 A0
But A0 , the open-loop gain, is very high so this means that Vp zero. Therefore Vin = IRin
R in V in P V out +
G=
V0 R = f Vin Rin
0V
Figure 1723
R in
0V
Rf
Suppose that the potential difference between the point P on the diagram and the earth-line (0 V) is Vp then we have Vin - Vp = IRin where Vin is the input voltage and I is the input current. The input current through Rf will also be I because of the very high input resistance of the amplifier such that Vp V0 = IRf where V0 is the output voltage.
Virtual earth
409
070816 Physics Chapt 17 for Paul409 409 22/05/2009 12:06:39 PM
OPTION
In this amplifier, the non-inverting input is earthed and a resistor of resistance Rin is connected to the inverting input. Another resistor of resistance Rf is connected between the output and the input. Rf is called a feedback resistance since the resistor is effectively feeding back the output voltage to the input.
The point P in the diagram is called a virtual earth because it is effectively at zero potential with respect to the earth-line. If, in Figure 1723, we were to replace the amplifier with a piece of plastic, the very high resistance of the plastic would mean that the current in Rf would be the same as that in Rin but, and this is the important point, the potential difference at point P would not be zero. It is the other important property, the high gain of the amplifier, that ensures that P will effectively be at earth potential. The inverting amplifier could therefore be replaced with the circuit shown in Figure 1724.
CHAPTER 17 (OPTION F)
V Remembering that the gain G = out , dividing by Vin and Vin re-arranging, we have that
G =1+
Rf R
V out V in +
OPTION
R
+ R
V out
0V
Figure 1725
The potential difference across the resistor of resistance R equals the input potential Vin (the difference in the input voltages is zero) such that, if the current in the resistors R and Rf is I (remember that the current at the inputs is zero), then we have that
V I = in R
Figure 1726
and
Vout Vin Rf so that I= Vin Vout Vin = R Rf
The resistance of the variable resistor R is adjusted so that the inverting input is at a greater potential than the noninverting input. In this situation Vout +V and activates a green light emitting diode (LED) at the output (not shown). T is a thermistor and in the situation shown, its temperature is low. When the temperature increases the resistance of T decreases and reaches a value where the potential at the inverting input is greater than that at the non-inverting input. In this situation the output Vout - V and this switches off the green LED but activates a red LED giving warning of the rise in temperature.
410
070816 Physics Chapt 17 for Paul410 410 22/05/2009 12:06:40 PM
COMMUNICATIONS
The potential across R2 is
V + Vo = 6.0 = IR2 = 100 I 2
Therefore I = 0.060 mA, such that Vin = 22 0.060 +1 = 2.3 V We could of course, have done the calculation by considering R1 and R2 as a potential divider across which the potential difference is Vin + 5.0. Figure 1728 shows the output of a Schmitt trigger that is the reshaped pulses of misshaped pulse inputs.
+V o
V /
reshaped pulse
output
misshapen pulse
V in R1 P R2
time
Figure 1727 A non-inverting Schmitt trigger Suppose that, in the situation shown, the values of the potential at the two inputs is such that the output voltage is at its minimum value Vo. This means that the potential at point P, Vp (the potential at the non-inverting input), is less than the potential V/ at the inverting input. If Vp increases above V/, it will reach a value at which the output will switch to its maximum value +V0. Remembering that the amplifier draws no current, let us calculate by how much Vin has to rise in order for Vo to switch to its maximum value. To illustrate this, we will consider a particular circuit in which Vp = V/ = 1.0 V (i.e. the input signals are equal) and R1 = 22 k, R2 = 100 k and Vo = 5.0 V. In the situation where Vo is = -5.0 V, the potential across R1 is
-V o
Figure 1728 Reshaped pulse output of a Schmitt trigger The output of the Schmitt trigger is at -Vo until the input reaches V/ at which point the output switches to +Vo and remains at this value until the input drops to V/ again, when it switches back to -Vo. In this way the pulses are re-shaped. In order for the reshaped output pulses to have only a positive value, a diode is connected between R2 (Figure 1727) and the output.
411
070816 Physics Chapt 17 for Paul411 411 22/05/2009 12:06:41 PM
OPTION
CHAPTER 17 (OPTION F)
F.5.6
Exercise
17.5
1.
This exercise shows how an op-amp can be used as a summing amplifier. In the circuit below each resistor has the same value of resistance R. For the circuit, show that the output voltage is the sum of the three input voltages i.e VA + VB + VC = Vout
F.6.2
F.6.3
VA VB VC
V out
F.6.4
0V
Introduction
There is little doubt that the development of the mobile phone system has revolutionised electronic communication, and in such a relatively short space of time. Twenty years ago, mobile phones were rare and expensive. In the present day, the number of mobile phones now outnumber land-line telephones and also, in the UK for example, outnumber the population. In 2006, the estimated number of mobile phone users was 2.5 109 and the number is still growing. It is expected that over 90% of the worlds population will have access to mobile phone networks by 2010.
2.
The two Schmitt triggers shown below each have a maximum output voltage of + 12 V. Calculate the switching potential for each trigger. (a)
0V +
OPTION
33 k
100 k
(b)
5.0 V +
47 k
470 k
412
070816 Physics Chapt 17 for Paul412 412 22/05/2009 12:06:42 PM
COMMUNICATIONS
the phone can be reduced whereas the different operating frequencies minimises interference between neighbouring cells. The use of a cellular network is the reason that mobile phones are sometimes referred to as cell phones.
Like all innovations, there are benefits and there are downsides associated with mobile phone use. Here we outline only some of the possible downsides that your teacher might like to discuss with you in more detail; there are of course many others.
Apart from the invasion of other peoples privacy, there is the very difficult problem of the disposal of old phones. You might like to try, as an exercise, to think of some economic and international benefits and downsides connected with the use of mobile phones. As mentioned at the beginning of this Option Topic, the revolution in modern telecommunication is making the world a much smaller place and mobile phones are part of that revolution.
413
070816 Physics Chapt 17 for Paul413 413 22/05/2009 12:06:43 PM
OPTION
Environmental
CHAPTER 17 (OPTION F)
OPTION
414
070816 Physics Chapt 17 for Paul414 414 22/05/2009 12:06:43 PM
ELECTROMAGNETIC WAVES
ELECTROMAGNETIC WAVES
G.1 G.2 G.3 G.4 G.5 G.6 (SL and HL) The nature of em waves and light sources (SL and HL) Optical instruments (SL and HL) Two-source interference of waves (SL and HL) Diraction grating (HL Only) X-rays (HL Only) Thin-lm interference
18
TOK The nature of light In the 17th century there were two opposing views from Jupiter, and the remaining time moving towards it. expressed regarding the nature of light by Sir Isaac Newton Therefore, if the eclipsing at the close distance and the (1642-1727), and Christian Huygens (1629-1695). Newton far distance was compared, he estimated that the extra was of the opinion that light consisted of particles and time for the light to reach Earth in the extreme was about Huygens was of the opinion that light consisted of waves. 22 minutes. A short time later, the Dutch physicist, They both believed, however, that a medium the ether Christian Huygens, used Rmers value of 22 minutes and was necessary for the propagation of light. his estimate of the Earths orbit to obtain a value equivalent in SI units of 2 108 m s-1 about two-thirds the presently In Galileos book Two New Sciences published in accepted value (we now know that the actual value is 1638, he pointed out that the flash from an artillery gun 16 minutes). was seen before the sound of the blast was heard. He concluded that the flash of light appeared instantaneously. In 1849, the French physicist, A.H.L. Fizeau made the first However, he stated that we would not know whether it non-astronomical measurement of the speed of light. He was instantaneous unless there was some accurate way to realised that the use of a rotating toothed wheel would measure its speed. He suggested an experiment could be make it possible to measure very short time intervals. performed where a person with a lantern stood on one hill He placed a light source on a hill in Paris and a rotating and an observer stood on another hill, a known distance toothed wheel on another hill 8.63 km away as shown in away. By flashing the lantern on one hill, the observer on Figure 1801. the distant hill could time how long it took the flash to reach him. This was highly unlikely to produce a result, Light source and there is only anecdotal evidence to suggest that the experiment was actually performed. Lens
Reflecting plane mirror
In 1676, the Danish astronomer Ole Rmer made the first recognised prediction of the speed of light. Through observation of Jupiters moon Io, he noticed that the period between eclipses (when Io disappears behind Jupiter) of Io is about 42.5 hours. However, he observed that there was an irregularity in the times between successive eclipse periods as the Earth orbits the Sun. He reasoned that because of the Earths motion about the Sun, it spends about six months of the year moving away
Observer
8.63 km
Lens Lens
415
070817 Physics Ch 18 for Gregg.i415 415 22/05/2009 12:07:32 PM
OPTION
CHAPTER 18 (OPTION G)
Using a system of lenses and a half-silvered mirror, he was able to focus a beam of light onto a gap in the toothed wheel placed on one hill. The beam then travelled to the other hill where it was reflected off a plane mirror and returned to the rotating wheel. The light intensity of the returning light then passed through the transparent part of the half-silvered mirror where it was observed. At low speeds of rotation of the toothed wheel, he found that little light was visible because the path of the reflected light was obstructed by the teeth of the rotating wheel. He then increased the rotation of the wheel until the reflected light was visible. He reasoned that this would occur when the rotation speed was such that the reflected light passed through the next gap between teeth. The wheel had 720 teeth and the light was bright when the wheel rotated at 25.2 revolutions per second. In 1860, Jean Foucalt improved Fizeaus method by replacing the toothed wheel with an eight-sided rotating mirror. He used the improved apparatus to measure the speed of light through air and water, and discovered that the speed of light in water was less than in air. In the 1860s, James Clerk Maxwell (1831-1879) put Faradays theory of electricity and magnetism into a mathematical form that could be generalised and applied to electric and magnetic fields in conductors, insulators and space free of matter. According to Maxwells theory, a changing electric field in space produces a magnetic field, and a changing magnetic field in space produces an electric field. He predicted that by mutual induction, a sequence of time-varying and space-changing electric and magnetic fields would be propagated from an original source such as an oscillating charge. He proposed that these fluctuating, interlocked electric and magnetic fields would propagate through space in the form of an electromagnetic wave. He still believed that the ether was the medium for propagation. Following on from other earlier attempts to determine the speed of light, Maxwell proposed that the speed of electromagnetic waves was about 311 000 000 m s-1. In 1887 Heinrich Hertz (1857-1894) demonstrated experimental evidence for the existence of electromagnetic waves as proposed by Maxwell in 1865. Hertzs apparatus is shown schematically in Figure 1802.
Transmitter Spark gap Detector loop
The induction coil produced a large potential difference between the gap in the source loop, and sparks were produced. When the detector loop was brought near the source loop, sparks were also noticed jumping across the air gap in the detector loop. Hertz hypothesised that the sparks in the source loop set up changing electric and magnetic fields that propagated as an electromagnetic wave, as postulated by Maxwell. These waves then aligned in the air gap of the detector loop setting up electric and magnetic fields. These induced a spark in the air gap. Between the years of 1880 to 1930, the American physicist, Albert. A. Michelson, made a series of precise measurements to determine the speed of light. Using a rotating octagonal steel prism, he constructed the apparatus is shown in Figure 1803.
Light from a bright source S A
Concave mirror
Mt. Wilson
Fixed telescope
3.4 10 4 m
OPTION
Figure 1803 Michelsons method for determining the speed of light His apparatus consisted of the main set-up on the right of Figure 1803, placed on Mount Wilson in the USA, and a concave mirror to the left of the diagram placed on Mount San Antonio 3.4 104 m away. When the octagonal steel prism is stationary, the light from the source S follows the path SYABCDEDCFGX, and an image can be seen through the fixed telescope. When the rotating prism is rotated slowly, the image disappears because the faces of the prism are not in a suitable position. However, if the prism is rotated so that it turns through one- eighth of a revolution in the time it takes light to travel from X to Y, then an image is seen in the fixed telescope. Michelson found that this occurred when the prism was rotated at 530 revolutions per second. Michelson then corrected for the fact that the light was travelling through air rather than a vacuum, and obtained a value that is today considered to be accurate to one part in a hundred thousand.
Figure 1802
Hertzs apparatus
416
070817 Physics Ch 18 for Gregg.i416 416 22/05/2009 12:07:33 PM
ELECTROMAGNETIC WAVES
Another method to determine the speed of light involves the use of the electrical constants 0 and 0 rather than the speed of light directly. According to Maxwells electromagnetic theory, the speed of light is given by 1 c = ----------0 0 where 0 is the permittivity of free space and is equal to 8.85 10-12 C2 N-1 m-2, and 0 is the permeability of free space and is equal to 4.25 10-7 T m A-1. When we substitute these values into the equation, the value obtained for the speed of light is 299 863 380.5 m s-1. With modern electronic devices being used physicists are continuing to improve their methods of measurement. The most accurate measurements indicate that the speed of light in a vacuum is 299 792 458 m s-1 with an uncertainty of 1 m s-1. The standard unit of length, the metre, is now defined in terms of this speed. In 1960, the standard metre was defined as the length equal to 1 650 763.73 wavelengths of the orange-red line of the isotope krypton-86 undergoing electrical discharge. Since 1983, the metre has been defined as the metre is the length of path travelled by light in a vacuum during a time interval of 1 / 299 792 458 s. The speed of all electromagnetic waves is the same as the speed of light. Further investigation showed that the electromagnetic waves exhibit the properties of reflection, refraction, interference, diffraction and polarisation. Furthermore, they travelled at the speed of light. Since these properties are also exhibited by light, Hertzs experiment had shown that light is a form of a transverse electromagnetic wave. In 1905, Albert Einstein in his special theory of relativity proposed that the ether did not exist, and that the speed of light is constant in all frames of reference.
Example
The distance from the Earth to the Sun is 3.00 1011 m. If light takes 16 minutes to travel this distance, what is the speed of light?
Solution
d t 3.00 10 11 m ( 16 min 60 s min )
Example
Solution
Using,
417
070817 Physics Ch 18 for Gregg.i417 417 22/05/2009 12:07:34 PM
OPTION
CHAPTER 18 (OPTION G)
Recall from Chapter 4 the wave terminology concerning wave speed, wavelength, frequency and period. Because speed equals the time rate of change of distance, in electromagnetic wave terms:
c = f
TOK
Students should know that an oscillating electric charge produces varying electric and magnetic elds. Students should know that electromagnetic waves are transverse waves and all have the same speed in a vacuum. Students could consider the possible health hazards associated with transmission lines.
IBO 2007
OPTION
v B
Figure 1804
All electromagnetic waves travel at the same speed in a vacuum. This speed is known as the free space velocity of light and is given the symbol c. It is a fundamental constant of nature and has a velocity of 2.998 108 m s-1 (~ 3 108 m s-1). Electromagnetic waves can also, of course, propagate in matter. However, when they travel in different media such as glass, they are refracted and therefore travel at a different speed. An electromagnetic wave also carries momentum and energy, and the energy factor will be expanded on shortly.
418
070817 Physics Ch 18 for Gregg.i418 418 22/05/2009 12:07:35 PM
ELECTROMAGNETIC WAVES
Although electromagnetic waves can be produced and detected in different ways, all waves in the electromagnetic spectrum behave as predicted by Maxwells theory of electromagnetic waves. Wave type Radio Microwave Infrared Visible Ultra-violet X-rays Gamma rays Order of magnitude of the wavelength (m) 104 10-2 10-2 10-4 10-4 10-6 7 10-7 3 10-7 3 10-7 10-9 10-9 10-11 10-11 10-13 Wavelengths and wave types
Wavelength (metres) 10
5
Frequency (Hertz)
Hz
105
100
1010 Millimeter waves Infrared light 105 Visible light Ultraviolet light Xrays 10
10
Visible light
Hz
7 10 5 6 10 5 10
15
5 10 5 4 10 5 1020
4.3 10 14
7.5 10 14
Electromagnetic radiation has particular properties in the different regions of the EM spectrum, and they are generated and detected in different manners. The regions will now be discussed in some detail.
Figure 1805
You should become familiar with the order of magnitude of the frequencies (and wavelengths) of the different regions. Figure 1806 indicates the orders of magnitude of the wavelengths for the regions of the spectrum. If you become familiar with these you can determine the order of magnitude of the frequencies using the equation f = c / . Note that the range of gamma rays extends beyond the range of instrumentation that is presently available. Because the speed of electromagnetic waves is constant (~ 3 108 m s-1), as the wavelength gets longer (increases), the frequency decreases. Similarly, if the wavelength decreases, the frequency increases. Therefore, there is a range of wavelength values (and frequency values) that electromagnetic waves can have. The entire possible range is called the electromagnetic spectrum. A range of wavelengths from about 108 m to 10-17 m corresponding to a frequency range of 1 Hz to 1025 Hz have been studied by scientists. Refer to Figure 1806 (a).
Radio waves are generated by an electric circuit called an oscillator and are radiated from an aerial. A tuned oscillatory electric circuit that is part of a radio/television receiver detects the radio waves. As they do not penetrate solid materials, radio waves are relatively easily reflected off surfaces and this makes them ideal for communication technology. Refer to Figure 1806 (b).
419
070817 Physics Ch 18 for Gregg.i419 419 22/05/2009 12:07:36 PM
OPTION
Note that there are no sharp divisions or rapid changes of properties between the various regions, rather a gradual merging into each other. The names used for each region are for convenience only and the classification scheme has developed due to the origin of their manner of production. For example, the ranges of gamma radiation and X-radiation overlap. X-rays could be produced with wavelengths similar to gamma radiation emitted by radioactive substances, but they are called X-rays because they are produced when electrons hit a metal target.
Radio waves
Radio waves have the longest wavelengths, and lie in the frequency range from 30 Hz to greater than 3000 MHz. They have specialised uses in radio communication including AM (amplitude modulated) and FM (frequency modulated) radio, television, CB radio, radio microphones and scanning devices in MRI (magnetic resonance imaging). Because of their many uses governments must regulate the bandwidth that can be used in communication devices in order to avoid congestion of the airwaves. The internationally agreed frequency bands used for carrier waves, and their uses are given in Figure 1806 (b).
CHAPTER 18 (OPTION G)
Band Frequency Frequency Wavelength (m) Extremely high EHF 3003GHz <0.1 Super high SHF Ultra-high UHF 30.3GHz 10.1 Very high VHF 30030MHz 101 High HF 303MHz 10010 short wave Medium MF 30.3MHz 1000100 medium wave Low LF 3003kHz >100 000-1000 Very low VLF Use Space satellite link Television FM radio AM radio AM radio Defence use Radar consists of short pulses of microwaves. They are used to detect the speed of vehicles by police, and to find distances to aeroplanes and ships. It is a microwave system that guides large aircraft into airports. Microwaves can interact with matter and this is the basis of the microwave oven. Water readily absorbs radiation (energy) with a wavelength of 10 cm, and this absorbed energy causes the molecules to produce thermal energy due to their vibration. The heat is therefore generated in the substance itself rather than conducted in from the outside, and this allows food to be cooked rapidly. At short distances from the source, microwave radiation can damage living tissue.
Figure 1806 (b) Radio frequencies Long and medium wavelength radio waves easily diffract around obstacles such as small mountains and buildings, and they can be reflected by the earths ionosphere. Therefore, there does not have to be a direct line of sight between the antenna and the receiver and they can be broadcasted over large distances provided the transmitter used is powerful. Television and FM broadcasting stations have wavelengths from 110 m and they are not easily diffracted around objects. Therefore, coaxial cables or relay stations are necessary to transmit signals between points more than 80 km apart, even if there is a direct line free of obstacles between them.
Infra-red radiation
In heated bodies, the outer electrons in atoms and molecules give off electromagnetic waves with wavelengths shorter than 104 m due to a change in the rotational and vibrational kinetic energy of these particles. Because the radiation given off has a wavelength slightly longer than the red end of the visible spectrum, it is called infrared radiation. Infrared radiation allows us to receive warmth from the Sun and other heat sources. In fact most of the emitted radiation from any hot object is infrared. Infrared radiation can be detected by our skin, by thermometers, thermistors, photoconductive cells, special photographic film. Special photographic film is used to identify heat sources such as human beings trying to hide from the scene of a crime or soldiers moving in a war situation. They can also identify environmental problems. Because this radiation is scattered by small particles in the atmosphere, they can be used in haze photography. It is also employed in the identification of the molecular structure of many organic compounds.
OPTION
Waves in the UHF, SHF and EHF bands are not reflected off the ionosphere in the upper atmosphere but rather pass into space. For this reason, SHF and EHF bands are used for outer space and satellite communications. These outer bands are overlapping in the microwave region of the EM spectrum.
Microwaves
Microwaves have many applications including mobile phone, satellite communication, radar (radio detection and ranging) and cooking. They are also used in the analysis of fine details concerning atomic and molecular structure. Microwaves are the main carriers of communication between repeater stations. They use lineof-sight technology where relay stations are placed in high positions 50 km apart. They are produced by special electronic semi-conductor devices called Gunn diodes, or by vacuum tube devices such as klystrons and magnetrons. They are detected by point contact diodes, thermistor bolometers and valve circuits.
Visible light
As already mentioned, the receptors in the human eye are sensitive to electromagnetic radiation between about 400 nm to 700 nm, and radiation in this region is referred to as visible light. Visible light is detected by stimulating nerve endings of the retina of the eye or by photographic film and photocells. The eye is most sensitive to the green and yellow parts of the visible spectrum. It can be generated by the re-arrangement of outer orbital electrons in atoms and molecules. These excited electrons emit light and other electromagnetic radiation of a certain frequency when they lose energy as happens in gas discharge tubes.
420
070817 Physics Ch 18 for Gregg.i420 420 22/05/2009 12:07:36 PM
ELECTROMAGNETIC WAVES
Visible light can cause photochemical reactions in which radiant energy is converted into chemical energy. hydrogen and oxygen, but are less able to penetrate dense material such as bone containing the heavier element calcium. Because X-rays are detected by photography, the photographic plate placed beneath the body can be used to identify possible bone fractures. X-rays are ideal for identifying flaws in metals. They are also used in CAT scans (computerised axial tomography). Because tissues absorb X-rays differently, when a body is scanned, the internal organs and tissues can be identified from the analysis of the images produced by the computer. X-radiation can ionise gases and cause fluorescence. Because X-rays produce interference patterns when they interact with crystals in rocks and salts, the structure of these regular patterns of atoms and molecules can be determined by this process of X-ray diffraction. X-rays can damage living cells and continued use and exposure to X-rays is discouraged. Radiologists who work in X-ray Departments always stand behind lead-lined walls when an X-ray is being taken. On the other hand, some types of diseased cells are damaged more easily than are healthy cells. Therefore, if X-rays are carefully controlled, they can be used to destroy cancerous cells, as is the case with the use of certain lasers in radiation therapy.
Ultra-Violet radiation
Ultra-violet radiation produces EM waves between about 10-7 m to 10-9 m. It is generated by the orbital electrons of atoms of the Sun, and other instruments such as highvoltage discharge tubes and mercury vapour lamps. Like visible light, UV radiation can cause photochemical reactions in which radiant energy is converted into chemical energy as in the production of ozone in the atmosphere and the production of the dark pigment (melanin) that causes tanning in the skin. It also helps to produce vitamin D in our skin. However, too much UV radiation can cause melanoma cancers. UV radiation has the ability to ionise atoms and this is the reason why ozone is produced in the atmosphere. This ozone is capable of killing bacteria and therefore it can be put to good use in the sterilisation of many objects. The atoms of many elements emit UV radiations that are characteristic of those elements, and this quality allows many unknown substances to be identified. UV radiation can be detected by photography and the photo-electric effect. Furthermore, certain crystals fluoresce when they absorb UV radiation, and this is put to use in washing powders to make the whites look whiter.
Gamma radiation
The gamma radiation region of the EM spectrum overlaps with the X-ray region and their use in cancer therapy overlaps with the last statement made about Xray use in radiotherapy. These high frequency rays are highly penetrating and are produced by natural and artificial radioactive materials. As such, gamma rays will pass through metres of air and need large thicknesses of concrete or lead to absorb them in order to protect humans from danger. Gamma radiation can be detected by an ionisation chamber as found in a Geiger-Mller counter.
X-Radiation
X-radiation produces EM waves between about 108 m to 1018 m. It can be generated by the rapid deceleration (stopping) or deflection of fast-moving electrons when they strike a metal target or other hard objects. It can also be generated by the sudden change in energy of innermost orbital electrons in atoms. The maximum frequency produced is determined by the energy with which the fast-moving electrons from the source strike a target. This energy is in turn determined by the accelerating voltage of an X-ray machine. X-rays can penetrate different materials to different degrees. Hard X-rays (near the gamma end) are more penetrating than soft X-rays (near the ultra-violet end). Apart from the penetration increasing with increased frequency of the X-rays, the penetrating power also depends on the nature of the material being penetrated. The higher the atomic mass of a material, the less is the penetration. For example, X-rays can easily penetrate skin and flesh containing mainly the lighter atoms carbon,
421
070817 Physics Ch 18 for Gregg.i421 421 22/05/2009 12:07:37 PM
OPTION
CHAPTER 18 (OPTION G)
the spectrum are combined together and they appear as an off-white colour.
prism 2 V White light
Red Violet Red Violet
R O
White light
I B G Y
prism 1
When a narrow beam of white light undergoes refraction on entering a prism, the light spreads out into a spectrum of colours. The colours range from red at one side of the band, through orange, yellow, green, blue, indigo (light purple), to violet at the other side of the band. We say that the spectrum of white light is continuous because the colour bands gradually change from one colour to the next without there being any gaps. The separation of the white light into its component colours is due to dispersion as shown in Figure 1807. For clarity their angular separation is exaggerated.
Screen White light Red Orange Ye llow Green Blue Indigo Violet
prism
OPTION
Figure 1807
This form of dispersion was first explained by Sir Isaac Newton. When he isolated a particular spectrum colour produced when white light was passed through a prism, and then passed it through a second prism, he found there was no further colour change. He concluded that the colours produced had not been introduced by the prism, but rather were components of the white light. The red and violet light, being components, are incident on the prism at the same angle. However, upon entering the prism the violet ray is refracted through a greater angle than the red ray. As well as separating the spectrum into its different colours, Newton also showed that the colours could be recombined or synthesised to produce white light. Two methods to show this synthesis are shown in Figure 1808. In the first method two prisms are aligned as shown to produce white light. In the second method, a disc containing coloured sectors of the spectrum is rotated rapidly. The colours of
422
070817 Physics Ch 18 for Gregg.i422 422 22/05/2009 12:07:37 PM
ELECTROMAGNETIC WAVES
Radiation can relate to the transmission of electromagnetic energy through space. All regions of the electromagnetic spectrum can transmit radiation to the surroundings. For example, all hot bodies transfer thermal energy with energies mainly in the infrared region between two substances which are not in contact with each other. All the incoming radiation I from one object can be absorbed A, transmitted T or scattered S (reflected) from another object as follows: I = T + A + S Because electromagnetic waves are produced as a result of the movement of electric charges, they have energy or electromagnetic radiation. Electromagnetic radiation, whatever its frequency (or wavelength) can only be emitted if energy is supplied to the source of radiation the charge that is undergoing acceleration. This energy produced can be absorbed and detected in various ways such as the heating of food in a microwave oven. Some objects can then re-radiate the absorbed energy and transmit that energy to another place. This was discussed in Chapter 8 when the Greenhouse Effect was considered. If the radiation is not absorbed or scattered by an object, it will continue on its transmission path at the speed of light. In terms of this chapter, radiation scattering is the deflection of EM radiation from its original path due to its collisions with particles in a medium. The scattered radiation after a single collision with a single molecule could be in many directions with different frequencies. It may cause a change in polarisation or it may interact at the atomic or molecular level.
Quantum physics began in 1900 with the proposals of Max Planck. He found that radiation absorbed or emitted by a black body could not be explained using classical physics. Planck proposed that electromagnetic radiation could not be emitted or absorbed continuously as proposed by classical physics but rather it is emitted or absorbed in little bursts or packets of energy. We now call these photons and we say that radiation is quantised. This statement is given mathematically as: E = hf where E is the energy of the photon in J or MeV, and h is a constant called Plancks constant. It has a value of 6.6 10-34 J s.
A photon of blue light has a wavelength of 450 nm. Calculate the (a) (b) photons frequency photons energy
Solution
c c = ff = 8 3 10 m s 1 ---------------------450 10 9 m = 6.7 1014 Hz
E = h f = ( 6.6 10 34 J s ) ( 6.7 10 14 Hz )
(a)
(b)
When electromagnetic radiation in the visible region is transmitted from the Sun and enters the earths atmosphere, most of the short-wavelength radiation in the indigo-violet region is absorbed in the higher atmosphere. However, the continuous spectrum produced by a spectroscope on the earths surface still has large amounts of indigo and violet. So why doesnt the sky appear violet. The answer lies in the way in which the human eye is sensitive to some colours more than others. Our eye consists of three types of colour receptors, called cones, in our retina red, blue and green cones that respond more to colours within these wavelengths.
423
070817 Physics Ch 18 for Gregg.i423 423 22/05/2009 12:07:38 PM
OPTION
Example
CHAPTER 18 (OPTION G)
A further factor known as Rayleigh scattering needs to be examined. Lord Rayleigh found that the amount of light scattered is inversely proportional to the fourth power of the wavelength for small particles. Amount of scattering = k 1 / 4 Let us compare the scattering of red light with a wavelength of 660 nm with that of blue light with a wavelength of 470 nm. Amount of scattering of red k 1 = k 5.3 1024 k1 = _________ = ___________ (660 109) 4 1.9 1025 Amount of scattering of blue k 1 = k 2.0 1025 k1 = _________ = ___________ (470 109) 4 4.9 1026
G.1.7 Explain the terms monochromatic and coherent. G.1.8 Identify laser light as a source of coherent light. G.1.9 Outline the mechanism for the production of laser light. G.1.10 Outline an application of the use of a laser.
IBO 2007
k 2.0 1025 = 3.9. The ratio ___________ k 5.3 1024 So the blue light is scattered more than the red light. So on a clear day, the sky appears blue because certain molecules in the air scatter blue light more than red, orange and yellow light. A clear cloudless day-time sky is blue because molecules in the air scatter blue light from the sun more than they scatter red light. When we look towards the sun at sunset, we see red and orange colours because the blue light has been scattered out and away from the line of sight. It is a misconception to think that light is scattered by particulate matter in the atmosphere. The scattering that occurs is mainly due to the scattering that occurs when the transmitted radiation is scattered off nitrogen and oxygen molecules. Remember from Chapter 8 that it is the molecular dipole moments of the greenhouse gases that absorb radiation in the low infrared region. Therefore, it is believed that water vapour is not responsible for the scattering effect. When the Sun rises or sets, most of the blue light has been scattered and the Suns rays have to travel through more atmosphere. The red cone receptors are more sensitive to red light and we see what appears to be red, orange and yellow light .
OPTION
424
070817 Physics Ch 18 for Gregg.i424 424 22/05/2009 12:07:39 PM
ELECTROMAGNETIC WAVES
wave. The intensity is in fact increased due to the emitted waves being coherent. To gather information about the energy levels of an atom or a molecule, scientists investigate the radiation absorbed or emitted when an atom or molecule, (more specifically the electrons), undergo a transition. Recall continuous, absorption and emission spectra that were introduced in earlier chapters. In Figure 1810, an incident photon is absorbed and the system makes a transition to an excited state. Later, the system makes a transition to the ground state (or a lower state) with the emission of a photon.
Energy 1. resonance absorption
Energy 1.
2.
emitted light
Figure 1811 Stimulated emission In spontaneous emission, the phase of the light from one atom is unrelated to that from another atom, and the resultant radiation is incoherent. With stimulated emission, the phase of radiation emitted by an atom is the same as the phase of every other atom, and the resultant radiation is coherent. Stimulated emission has important application in modern optics. Masers and lasers
spontaneous emission
incident light 2.
Energy
MASER Microwave Amplification by Stimulated Emission of Radiation LASER Light Amplification by Stimulated Emission of Radiation
Energy 3.
emitted light
Both apply the same principles, and only differ in the frequency range in which they operate. A ruby laser consists of a transparent crystal rod of aluminium oxide containing 0.05% chromium with a flash tube wrapped around it as shown in Figure 1812. Both ends of the crystal rod are silvered, one end such that it is totally reflecting and the other partially silvered so that some of the light can be transmitted as a beam.
ruby rod
Figure 1810
Spontaneous emission.
This is a two-step process - the first step is called resonance absorption, and the second step is called spontaneous emission. The incident photon is not the same photon as the emitted photon, and the photons are not correlated. However, if an atom or molecule is initially in an excited state, and the energy of the incident photon is equal to the difference in the energy between the excited state and the ground state of the atom or molecule (E2 - E1), then a photon is emitted in the same direction, and in phase, as the original photon. Therefore, the resulting radiation is coherent and correlated. This process is called stimulated emission. See Figure 1811.
mirrored surface
atoms
Figure 1812
Figure 1813 shows the energy levels of the chromium atom. In the ruby laser, light of energy equivalent to 2.25 eV is absorbed from the flash tube, and this raises the electrons of chromium from the ground state E1 to an excited state E3. These electrons quickly undergo spontaneous emission and fall to level E2 known as the metastable energy state.
425
070817 Physics Ch 18 for Gregg.i425 425 22/05/2009 12:07:40 PM
OPTION
CHAPTER 18 (OPTION G)
If the incident radiation from the flash tube is intense enough more electrons are transferred to the E2 energy level than remain in the ground state a condition known as population inversion. The metastable state is an unusual excited state of chromium and other atoms and molecules because the electrons can remain in this state for a longer period before they decay to the ground state.
2.25 eV E 3 spontaneous emission
Once this is achieved, the neon atoms have a metastable state at 18.70 eV, and population inversion allows stimulated emission to occur.
1.79 eV
E2 stimulated emission
absorption
E1
Figure 1813
When some electrons in E2 decay to the ground state by stimulated emission, they emit photons of energy equivalent to 1.79 eV and wavelength of 694.3 nm. Population inversion is achieved slightly differently in the helium-neon laser. The ruby laser requires three energy states and the helium-neon laser requires four energy states. See Figure 1814. A mixture of about 15% helium gas and 85% neon gas is enclosed in a discharge tube. The helium electrons are excited to a state equivalent to 20.61 eV. The neon atoms require 20.66 eV to excite the electrons to their required excited state. This is 0.05 eV above that which is required for the helium atoms. In order to achieve the required 0.05 eV for excitation, the neon atoms collide with the excited helium atoms and their kinetic energies supply the extra 0.05 eV needed.
Neon 3
OPTION
EH
Helium 2
collision
EN EN
1.96 eV
2
stimulated emission
absorption
20.16 eV
20.66 eV 18.70 eV
spontaneous emission
EH
EN
Figure 1814
426
070817 Physics Ch 18 for Gregg.i426 426 22/05/2009 12:07:41 PM
ELECTROMAGNETIC WAVES
Of special interest is the laser used to read the data on Compact Discs (CDs), Compact Disc Read-OnlyMemory (CD-ROMs) and Digital Video (or Versatile) Discs (DVDs). As a laser beam scans the disc the light is either reflected from the metal surface or scattered by a pit or hole in the disc. Optical sensors convert the information back into digital data. This data can then be amplified to produce audio, text and pictures. A compact disc consists of a hard, plastic disc, 120 mm in diameter that is coated with a highly refractive material/ metal (usually aluminium) with a coating of plastic to protect the metal surface. To burn a master CD, a recording laser is focussed on the disc to produce a series of up to 3 billion pits on one side only. The pits or lack of such represent the digital data. This series of pits can form a spiral up to 5 km long and the CD can store up to 650 Mb (megabytes) of data. The master disc is copied, and is read by the scanning laser in CD players. The main use of CDs is to store audio sounds. CD-ROMs have become popular recently. These store data in the form of text, still and moving pictures and audio sounds. The newest optical disc storage is the DVD. It has a bigger capacity and it can run faster. It can store data on one or on both sides up to 17 Gb (gigabyte). DVDs need a different type of player, and these have proliferated in recent years. Scientists use lasers to produce a superheated gas that we call plasma. It is hoped that research in this area may one day make nuclear fusion reactors commercially viable. The viewing of a laser beam directly may do permanent damage to the retina. Lasers with a power between 3 and 4 mW are considered a probable hazard. Helium-neon lasers used for education have a fraction of a milliwatt power output and should be treated with respect.
Exercise
18.1
1.
Describe how are electromagnetic waves produced? List five characteristics of electromagnetic waves. A photon of red light has a wavelength of 700 nm. Calculate the: (a) (b) photons frequency photons energy.
2. 3.
4.
The wavelength of the electromagnetic waves detected in the air loop in the Hertz experiment were about 1 m. In which region of the EM spectrum are they found? Describe how long and medium radio waves pass around obstacles such as buildings and mountains. Draw a table showing the order of magnitude of the frequencies for the different regions of the electromagnetic spectrum. Describe how the frequency of X-rays is related to their penetration of matter.
5.
6.
7.
9.
Describe how the SI unit, the metre, is defined in terms of the speed of light. (a) (b) With the aid of a diagram, describe the dispersion of white light by a prism. Explain how a factor of the prism causes the dispersion to occur. What does the term LASER stand for? In simple terms, how is laser radiation produced? Give one application of the use of lasers in each of technology, industry and medicine.
10.
11.
427
070817 Physics Ch 18 for Gregg.i427 427 22/05/2009 12:07:41 PM
OPTION
8.
Identify a possible source of the radiation in each region of the EM spectrum. Give one way of detecting each source.
CHAPTER 18 (OPTION G)
CENTRE OF CURVATURE C
the centre of the sphere of which the lens is made.
Introduction
A lens is a transparent object with at least one curved surface but more commonly two curved faces. The amount of refraction is determined by the refractive index. Most lenses are made of glass but perspex (lucite) and quartz lenses are common. They are used to correct defects of vision using spectacles and in optical instruments such as cameras, microscopes and refracting telescopes. The curved surfaces of lenses may be spherical, parabolic or cylindrical but we will restrict our discussion to thin lenses with spherical surfaces. Lenses are either convex (converging) or concave (diverging) as shown by the ray diagrams that locate the focus in Figure 1815.
RADIUS OF CURVATURE R
the radius of the sphere from which the lens is made.
POLE P
central point of the refracting surface.
PRINCIPAL AXIS
line that passes through the centre of curvature and the centre of the refracting surface.
PRINCIPAL FOCUS F
point through which rays parallel and close to the principal axis pass after refraction if the lens is convex, or appear to come from if the lens is concave.
OPTION
Principal axis
Optical centre
FOCAL LENGTH f
the distance between the principal focus and the centre of the refracting surface. From the definition it can be seen that
Lens axis
lens axis
converging lens
diverging lens
Figure 1815
1 f = -R 2
APERTURE
the length of the refracting surface on which the incident rays can be refracted.
the plane that passes through the principal focus and is perpendicular to the principal axis.
Figure 1816
Some lenses
428
070817 Physics Ch 18 for Gregg.i428 428 22/05/2009 12:07:42 PM
ELECTROMAGNETIC WAVES
1. A ray passes through the optical centre without any deviation A ray parallel to the principal axis refracts so that it passes through the principal focus if convex, or appear to diverge from the principal focus if concave A ray passing through, or appearing to come, from the principal focus refracts so that it travels parallel to the principal axis.
2.
3.
The unit for the lens power is the dioptre D with the unit m-1. So if a lens has a focal length of 40 cm, the power
object
= 1 /0.40 m = 2.5 D. The power of a converging lens is positive and the power of a diverging lens is negative as we will find out later in this section.
X O F F
Principal axis
O'
image
object F O'
image
Principal axis
The linear or lateral magnification m of a lens is given by the ratio of the height of an image to the height of its object or the ratio of the image distance to the object distance
di m = --- = do hi --ho
Figure 1817
A negative magnification when both do and di are positive indicates that the image is inverted.
Note that a concave lens gives only one type of image, so only one ray diagram need be drawn. A concave lens gives an erect, diminished virtual image. A virtual image is an image that appears to come from a single point when rays are extrapolated to that point as shown by the dashed lines in the first and last case of Figure 1818. A real image is an image that can be seen on a screen that has been put at the point where the rays intersect at a single point.
G.2.4 Construct ray diagrams to locate the image formed by a convex lens. G.2.5 Distinguish between a real image and a virtual image.
IBO 2007
G.2.6 Apply the convention real is positive, virtual is negative to the thin lens formula. G.2.7 Solve problems for a single convex lens using the thin lens formula.
IBO 2007
429
070817 Physics Ch 18 for Gregg.i429 429 22/05/2009 12:07:43 PM
OPTION
where di and do are the image and object distances respectively and hi and ho the image and object heights respectively. Linear magnification has no units.
Figure 1817 shows the ray diagrams for an object placed at different positions along the principal axis, and gives examples of the uses of the lens for each situation.
CHAPTER 18 (OPTION G)
Position of object Diagram Properties of image a. virtual b. erect c. same side as object (but further away) d. magnified
O o
C L
1. Between F & L
image L F
obj
O'
F f i
2. At F
L parallel rays F F
obj
do
di
a. real
3. Between F and 2F
L F F
obj
2F
Figure 1819
A ray diagram
d. magnified
a. real
4. At 2F
L F F 2F
obj
5. Beyond 2F
L 2F F F 2F
obj
object at infinity
6. At infinity
F L
parallel rays F 2F
CL LF ----- = ---IE FI
Because
LF AO ------- = --FI IE
but LF = f and FI = LI LF = di f
Figure 1818
f AO ---------- = -di f IE
OPTION
From equations (1) and (2), we get, do f and d f = d ( d f ) --- = ------i o i di di f Divide both sides by d0dl f: di f do( di f ) 1 1 1 ------- = -- ---------------- --- = do d i f f di dodi f do So that,
1 1 1 + --- = ---d f do i
This equation is quite often written in textbooks as: 1/f=1/u+1/v Many optical devices contain more than one thin lens. The final image produced by the system of lenses is determined by firstly finding the image distance of the first lens and using this value along with the distances between lenses to find the object distance for the second lens.
The lens equation can be derived using geometry and algebra. Consider Figure 1819, the following ray diagram:
430
070817 Physics Ch 18 for Gregg.i430 430 22/05/2009 12:07:46 PM
ELECTROMAGNETIC WAVES
Example
Example
A small object is 15.0 cm from a concave lens with a focal length of 10.0 cm. Locate the image and determine its magnification.
Two converging lenses each of focal length 10 cm are 15 cm apart. Find the final image of an object that is 15 cm from one of the lenses.
Solution
1 1 1 ---+-- = Using the formula d d f , we have that o i
Solution
This problem can be solved either graphically or algebraically. In any case, a diagram as in the figure below can help to roughly see the position of the final image.
So that the image is a virtual image located 6.0 cm in front of the lens.
F2 F 2' F 1'
The magnitude is given by di ( 6.0 cm ) m = --- = ---------------- = + 0.40 10.0 cm do The virtual image is erect and has a magnification of 0.40.
F1
15 cm
15 cm
30 cm
Example
Solution
1 1 1 -+-- = From the formula, --d o di f , we have,
1 1 1 - = ------ + ----- d i = 6 cm 15 d i 10
The image is 6 cm on the transmission side of the second lens.
431
070817 Physics Ch 18 for Gregg.i431 431 22/05/2009 12:07:48 PM
OPTION
An object 1.2 cm high is placed 6.0 cm from a double convex lens with a focal length of 12.0 cm. Locate the image and determine its magnification.
CHAPTER 18 (OPTION G)
Linear magnification was previously defined as the ratio of the height of an image to the height of its object, or the ratio of the image distance to the object distance. In some circumstances this means of describing magnification can be misleading because it is assumed in such cases that the object is bigger than its image. However, sometimes the image size is bigger than the object size but appears smaller because it is further away (it makes a smaller angle at the eye than does the object). In this case, linear magnification does not always give the measure of the ratio of the apparent size of the image. Therefore, we need a more useful term for these occasions and the concept of angular magnification is used. The size of any image formed on the retina of the eye depends on the angle subtended by the object at the eye. The closer the object is to the eye the greater will be the angle and thus the angular magnification. However, if an object is too close to the eye then there is difficulty focusing the image. The range over which an eye can sharply focus an image is determined by what are known as the near point and far point of the eye. The near point is the position of the closest object that can be brought into focus by the unaided eye. The near point varies from person to person but it has been given an arbitrary value of 25 cm. The far point is the position of the furthest object that can be brought into focus by the unaided eye. The far point of a normal eye is at infinity. The ability of the eye to focus over this range is called accommodation and this is controlled by the ciliary muscles pulling or relaxing in order to change the focal length of the flexible eye lens. The eye has most accommodation for prolonged viewing when viewing at the far point. The apparent size of an object can be increased by using a converging lens to allow the object to be brought closer to the eye, thus increasing the size of the image on the retina. This is the basis behind the simple magnifier.
Exercises
18.2 (a)
1.
An object 1.0 cm high is placed 5.0 cm from a double convex lens with a focal length of 12.0 cm. Locate the image and describe its nature and its magnification. Using geometric construction rules, draw the following ray diagrams to show the position and properties of the image formed when: (a) (b) An object is placed outside the principal focus using a convex lens An object is placed outside the principal focus using a convex lens
2.
3.
An small object is 16.0 cm from a concave lens with a focal length of 12.0 cm. Locate the image and determine its magnification. An object 1.2 cm high is placed 6.0 cm from a double convex lens with a focal length of 12.0 cm. Locate the image and determine its magnification.
4.
432
070817 Physics Ch 18 for Gregg.i432 432 22/05/2009 12:07:48 PM
ELECTROMAGNETIC WAVES
(a) y
0
(a)
x np
(b) y
y0
y
Figure 1821 The simple magnier
(b) image
F L
Figure 1822
In case (b), a converging lens of focal length f is placed in front of the eye. The lens allows the eye to be closer than the near point to the object. Therefore, the focal length of the double convex lens can be less than the near point. This results in a greater angular magnification at the retina. When the object is at the focal point of the lens, parallel rays emerge from the lens and enter the eye as if they came from an object at infinity. This is the most comfortable state for prolonged viewing. The ratio / 0 is called the angular magnification M or magnifying power of the lens.
( )
() ( )
So, in the case of a magnifying glass: M (angular magnification) = m (linear magnification) d This means that M = ___i do d i ___ d 1 = __ 1 by d we get ___ 1 + __ = i1 If we multiply __ 0 do di f do f Therefore, d M = ___i 1 f So if the focal length is 3.0 cm and the image distance is at the near point of the eye, then the image distance will be negative. So: 25 1 = 8.33 1 = 9.3 M = ___ 3
M = ---o
where = the angle subtended at the eye by the image and 0 = the angle subtended at the unaided eye by the object when it is at the near point. The difference between the angular magnification and the linear magnification is that the angular magnification is the ratio of the apparent sizes of the object and the image whereas the linear magnification is the real size of the object and the image.
( )
Therefore, the angular magnification or the magnifying power of the lens is 9.3. So it should be clear that a lens of a small focal length is required and this is why a simple magnifying glass has smooth, curved faces. However, there is an upper limit to the angular magnification since images become distorted if the radius of curvature becomes too great. Simple magnifiers are used as the eyepiece in compound microscopes and both reflecting and refracting telescopes to view the image formed by another lens system.
433
070817 Physics Ch 18 for Gregg.i433 433 22/05/2009 12:07:49 PM
OPTION
CHAPTER 18 (OPTION G)
Example
Solution
(a)
Draw a ray diagram for an object that is inside the focal length of a double convex lens. For this lens, the focal length is 8.0 cm and the distance of the object from the lens forms an image at the near point of the eye. If the distance of the lens from the eye is 5.0 cm, calculate the distance of the object from the lens.
(a)
The following figure shows the correct construction. The image virtual, erect and magnified.
(b)
image F
d0 = 5 cm 25 cm
Solution
(a)
(b)
25 1 = 3.125 1 = 4.125 M = ___ 8.0 The magnifying power of the lens is 4.1
image L F
obj
OPTION
Example
A double convex lens has a focal length is 8.0 cm. An object is placed 5.0 cm from the lens. (a) Draw a ray diagram to show the position of the image and describe the nature of the image. Determine the magnifying power of the lens if the image is at the near point of the eye
(b)
434
070817 Physics Ch 18 for Gregg.i434 434 22/05/2009 12:07:50 PM
ELECTROMAGNETIC WAVES
TELESCOPE
Telescopes are used to view objects that are often large and that are far away. There are three basic types of telescopes 1. 2. 3. The reflecting telescope The astronomical refracting telescope The terrestrial refracting telescope.
hi
Figure 1825
The objective lens has a short focal length. The object is placed just outside the focal length of the objective. The image produced by the objective is real, magnified and inverted. (This image is slightly coloured due to chromatic aberration as will be explained in the next section G.2.15). The image acts as a real object for the eyepiece. The eyepiece is placed close to the eye and has a longer focal length. It acts as a simple magnifier so that the final image is an inverted, magnified and virtual image that is positioned at the near point. If the eyepiece is placed so that the image of the objective falls at the first principal focus of the eyepiece, then the image can be viewed at infinity. However, to gain a greater angular magnification, the eyepiece is placed a little inside the first principal focus of the eyepiece so that the final image is at the near point of the viewer. This virtual image is fairly free of colour. The overall magnification is given by the product of the angular magnification of the eyepiece and the linear magnification of the objective. hi h M = ---- -h ho These values are normally printed on the microscope by the manufacturer. For example, if a microscope has an 20 eyepiece and the objective being used is 40, then the magnification is 800 times.
A refracting astronomical telescope shown in Figure 1826 uses the properties of a two converging lens combination - the objective lens, and the eyepiece that acts as a simple magnifier.
Fo & FE
rays from virtual image at infinity Image
Figure 1826 A reecting astronomical telescope The objective lens has a long focal length, and a large diameter so that large quantities of light from a distant object can enter the telescope. The object distance, being very far away, is much larger than the focal length of the lens, and this produces an image that is very small (diminished), real and inverted at the focal length of the objective Fo. This real image is placed just inside the focal length of the eyepiece FE. The eyepiece has a short focal length. It is placed in position to produce an inverted, virtual image at infinity. As a rough estimate (allowing for accommodation), the diagram shows that the objective and the eyepiece need to be separated by a distance equivalent to the sum of their focal lengths, Fo + FE The angular magnification is given by: FO M = -- = ---- FE The negative sign for the focal length ratio indicates that the image is inverted. The equation also indicates that the angular magnification is optimum when an objective of
435
070817 Physics Ch 18 for Gregg.i435 435 22/05/2009 12:07:51 PM
OPTION
CHAPTER 18 (OPTION G)
large focal length and an eyepiece of small focal length are used. When specifying a refracting telescope, the diameter of the objective lens is frequently quoted. The bigger the diameter, the more light is collected thus allowing for greater resolution of what is being viewed. For distant objects such as the Moon, it does not cause major problems when viewing the inverted image. The terrestrial refracting telescope as shown in Figure 1827 incorporates a third converging lens called the inverting lens so that the image is upright. (a)
objective lens
objective f = 1.50 cm
eyepiece f =10.0 cm
Determine the image distance and the magnification of the objective lens. Determine the object distance of the eyepiece. Calculate the image distance of the eyepiece Calculate the magnification of the microscope.
(b)
Fo Image inverting lens
(c) (d)
Solution
An image that forms the object for the inverter is at twice the focal length of the inverter. Therefore, the image is real and upright and very diminished, and is formed at twice the focal length (on the transmission side) of the inverter. This image is just inside the focal length of the eyepiece as before and is viewed as an enlarged, virtual and upright image. Overall, it should be understood that the magnifying powers of telescopes are not as crucial as their lightgathering power. Furthermore, there is a limit to the size of the objective lens before aberrations are produced, and it is difficult to support large, heavy lenses by their edges. 1 = __ 1 __ 1 = ____ 1 ____ 1 = 0.167 cm1 __ di f do 1.50 2.00 di = 6.00 cm 6 cm = 3 Magnification = ____ 2 cm (the negative sign shows us that the image is inverted). (b) (c) do of the eyepiece = 15.0 6.00 = 9.0 cm. 1 __ 1 = ____ 1 ____ 1 = __ 1 = 0.011 cm1 __ di f d0 10.0 9.00 di = 9.00 101 cm (the negative sign indicates the image is virtual). (d) Magnification = magnification of the objective lens magnification of the eyepiece 90 = 30 times = 3 ___ 9
(a)
OPTION
( )
A compound microscope consists of an objective lens with a focal length of 1.50 cm and an eyepiece with a focal length of 10.0 cm. The lenses are separated by a distance of 15.0 cm as shown in the Figure below (not to scale). An object of height 0.3 cm is placed at a distance of 2.00 cm from the objective lens.
Example
The following Figure shows 3 rays of light coming from a distant star and passing through the objective lens of a telescope. The focal length of the objective lens and the eyepiece are fO and fE.
436
070817 Physics Ch 18 for Gregg.i436 436 22/05/2009 12:07:52 PM
ELECTROMAGNETIC WAVES
Aberrations
f0 fE
image
G.2.15 Explain the meaning of spherical aberration and of chromatic aberration as produced by a single lens. G.2.16 Describe how spherical aberration in a lens may be reduced. G.2.17 Describe how chromatic aberration in a lens may be reduced.
IBO 2007
objective
(a)
Complete the ray diagram to show the formation of the final image. Label the principal focus of the eyepiece lens fE and the image of the star formed by the objective lens. State where the final image is formed by the telescope. The telescope has a magnification of 65.0 and the lenses are 70.0 cm apart. Determine the focal length of the two lenses.
(b)
(c)
Solution
(a) and (b) The Figure above has now been completed. The image formed is at infinity.
2.
f0
fE
image
Spherical aberration occurs because the rays that refract at the outer edges of a lens will have a different focal length to those rays that refract near the principal focus. To put it another way, spherical aberration occurs because the rays incident near the edges of a converging lens are refracted more than the paraxial rays as shown in Figure 1831.
objective
image at infinity
(c) M = - fO / fE = -65 fO = 65 fE (1) Also, fO + fE = 70.0 cm. (2) Substituting (1) into (2) we have:
Figure 1831
This produces an area of illumination rather than a point image even when monochromatic light is used called the circle of least confusion. Spherical aberration causing curving of the image at its edges. So if the object was a series of square grids as shown in Figure 1832 then the image would be distorted at the edges.
437
070817 Physics Ch 18 for Gregg.i437 437 22/05/2009 12:07:53 PM
OPTION
CHAPTER 18 (OPTION G)
Exercise
18.2 (b)
1.
A simple optical device is placed inside a box as shown in the diagram. An object O, placed as shown, produces a real magnified image I. Refer to the following Figure.
OBJECT
Figure 1832
IMAGE
Spherical aberration
object image
optical device
To reduce spherical aberration, the lens can be ground to be slightly non-spherical to adjust for the circle of least confusion or by using different combinations of lenses put together. Alternatively, a stop (an opaque disc with a hole in it) is inserted before the lens so that the aperture size can be adjusted to allow only paraxial rays to enter. However, this reduces the light intensity and introduces diffraction of light. Recall from your studies that the property refractive index is a function of wavelength 1 sin i ------ = --1n2 = sin r 2 Because visible light is a mixture of wavelengths, the refractive index of the lens is different for each wavelength or colour of white light. Consequently, different wavelengths are refracted by different amounts as they are transmitted in the medium of the lens. For example, blue light is refracted more than red light as shown in Figure 1833.
B R
Canada balsam
The optical device is A. B. C. D. 2. a convex lens a concave lens a convex mirror a concave mirror
Which of the following is an incorrect statement? A. the magnifying power of an astronomical telescope can be increased by substituting an eyepiece of greater focal length with a simple astronomical telescope things look upside down the eye lens of a human produces a real, diminished inverted image on the retina refracting telescopes produce less chromatic aberration than reflecting telescopes.
B. C. D.
OPTION
R B
3.
R
Red Focus Blue Focus
B
Converging crown glass lens
B R
Diverging flint glass lens
An object 4.0 cm high is placed 15.0 cm from a convex lens of focal length 5.0 cm. On graph paper draw a ray diagram to determine the position and nature of the image. An object 4.0 cm high is placed 15.0 cm from a concave lens of focal length 5.0 cm. On graph paper draw a ray diagram to find the position and nature of the image. Place the following into the convex lens or concave lens category: Magnifying glass, eye lens, camera lens, the objective lens of a microscope, a lens to correct short-sightedness, spotlight lens.
Achromatic doublet
4.
Figure 1833
Each colour must therefore have a different focal length and it further follows that focal length is a function of wavelength. Chromatic aberration produces coloured edges around an image. It can be minimised by using an achromatic doublet that is made from a converging crown glass lens and a diverging flint glass lens that are adhered together by canada balsam as drawn in the right of Figure 1833. Since the chromatic aberration of converging and diverging lenses is opposite, a combination of these two lenses will minimise this effect.
5.
6.
Determine the position, nature and magnification of the image of an object placed 15 cm from a convex lens of focal length 10 cm.
438
070817 Physics Ch 18 for Gregg.i438 438 22/05/2009 12:07:54 PM
ELECTROMAGNETIC WAVES
7. A convex lens with a focal length of 8 cm is to be used to form a virtual image that is four times the size of the object. Where must the lens be placed ? A slide projector is place 5.0 m from a screen. Determine the focal length of a lens that would be used to produce an image that is five times as large as the object? A convex lens with a focal length of 4.0 cm is placed 20.0 cm from a concave lens with a focal length of 5.0 cm. Find the position of the image when the object is placed 12.0 cm in front of the convex lens. A double concave lens has a refractive index of 1.5 and radii of curvature of 10.0 cm and 15 cm. Determine its focal length. A refracting telescope has an objective lens with a diameter of 102 cm and a focal length of 19.5 m. If the focal length of the eyepiece is 10.0 cm, calculate the angular magnification of the telescope. Determine the angle of minimum deviation for a prism with an apex angle of 60 and a refractive index of 1.6. Can a rectangular prism be used to disperse white light? Explain your answer. Describe the meaning of spherical and chromatic aberration, and a method to reduce the effect of each. Refer to the following table. Choose the correct combination of lenses that are used in either the compound microscope or the refracting astronomical telescope. optical instrument compound microscope compound microscope refracting astronomical telescope refracting astronomical telescope objective lens Eyepiece We can obtain evidence for the wave nature of sound by showing that sound produces an interference pattern. Figure 1836 shows the set up for demonstrating this.
speaker
8.
9.
10.
11.
12.
13.
14.
15.
A B C
long focal length Long focal length long focal length Short focal length short focal length Long focal length
X microphone
439
070817 Physics Ch 18 for Gregg.i439 439 22/05/2009 12:07:55 PM
OPTION
CHAPTER 18 (OPTION G)
The two speakers are connected to the same output of the signal generator and placed about 50 cm apart. The signal generator frequency is set to about 600 Hz and the microphone is moved along the line XY which is about one metre from the speakers. As the microphone is moved along XY the trace on the cathode ray oscilloscope is seen to go through a series of maxima and minima corresponding to points of constructive and destructive interference of the sound waves. An interesting investigation is to find how the separation of the points of maximum interference depends on the frequency of the source and also the distance apart of the speakers.
coloured filter
S1 S2
single slit double slit screen
Figure 1837
Young allowed sunlight to fall onto a narrow single slit. A few centimetres from the single slit he placed a double slit. The slits are very narrow and separated by a distance equal to about fourteen slit widths. A screen is placed about a metre from the double slits. Young observed a pattern of multi- coloured fringes in the screen. When he placed a coloured filter between the single slit and double slit he obtained a pattern which consisted of bright coloured fringes separated by darkness. The single slit essentially ensures that the light falling on the double slit is coherent. The two slits then act as the two speakers in the sound experiment or the two dippers in the ripple tank. The light waves from each slit interfere and produce the interference pattern on the screen. Without the filter a pattern is formed for each wavelength present in the sunlight. Hence the multi-coloured fringe pattern. You can demonstrate optical interference for yourself. A double slit can be made by smoking a small piece of glass and then drawing two parallel lines on it. If you then look through the double slit at a single tungsten filament lamp you will see the fringe pattern. By placing filters between the lamps and the slits you will see the monochromatic fringe pattern. You can also see the effects of optical interference by looking at net curtains. Each hole in the net acts as a point source and the light from all these separate sources interferes and produces quite a complicated interference pattern. A laser can also be used to demonstrate optical interference. Since the light from the laser is coherent it is very easy to demonstrate interference. Just point the laser at a screen and place a double slit in the path of the laser beam. Let us now look at the Youngs double slit experiment in more detail. The geometry of the situation is shown in Figure 1838.
OPTION
440
070817 Physics Ch 18 for Gregg.i440 440 22/05/2009 12:07:56 PM
ELECTROMAGNETIC WAVES
But S X is the path difference between the waves as they 2 travel to P. We have therefore that
S1 d S2 X D
Figure 1838
P y
path difference =
yd D
yd = n D
screen
Suppose that there is a bright fringe at y = y corresponding 1 to n = n then
1
y1 d =n 1 D
If the next bright fringe occurs at y = y this will correspond 2 to n = n +1. Hence
1
S1 and S2 are the two narrow slits that we shall regard as two coherent, monochromatic point sources. The distance from the sources to the screen is D and the distance between the slits is d. The waves from the two sources will be in phase at Q and there will be a bright fringe here. We wish to find the condition for there to be a bright fringe at P distance y from Q. The waves from the two sources will be in phase at Q and there will be a bright fringe here. We wish to find the condition for there to be a bright fringe at P distance y from Q. We note that that D ( 1 metre) is very much greater than either y or d. ( few millimetres) This means that both and are very small angles and for intents and purposes equal. From the diagram we have that = And
= S 2X d
y2 d = (n +1) 1 D
This means that the spacing between the fringes y y is 2 1 given by
D y y = d 2 1
Young actually use this expression to measure the wavelength of the light he used and it is a method still used today. We see for instance that if in a given set up we move the slits closer together then the spacing between the fringes will get greater. Effectively our interference pattern spreads out, that is there will be fewer fringes in a given distance. We can also increase the fringe spacing by increasing the distance between the slits and the screen. You will also note that for a given set up using light of different wavelengths, then red fringes will space further apart than blue fringes. In this analysis we have assumed that the slits act as point sources and as such the fringes will be uniformly spaced and of equal intensity. A more thorough analysis should take into account the finite width of the slits, This has the effect of modifying the interference pattern as is discussed in Option H.
y D
441
070817 Physics Ch 18 for Gregg.i441 441 22/05/2009 12:07:57 PM
OPTION
CHAPTER 18 (OPTION G)
Returning to Figure 1838 we see that we can write the path difference S2X as S X = dtan
2
Solution
But since is a small angle the sine and tangent will be nearly equal so that S X = dsin
2
5 10 7 = d 10 3
-4
y=
D 1.5 5 10 7 = d 10 3
= 0.75 mm
The condition therefore for a bright fringe to be found at a point of the screen can therefore be written as
Exercise
dsin = n Figure 1839 shows the intensity distribution of the fringes on the screen when the separation of the slits is large compared to their width. The fringes are of equal intensity and of equal separation.
Intensity
18.3
In Figure 1836, the distance between the speakers is 0.50 m and the distance between the line of the speakers and the screen is 2.0 m. As the microphone is moved along the line XY, the distance between successive point of maximum sound intensity is 0.30 m. The frequency of the sound waves is 4.4 103 Hz. Calculate a value for the speed of sound.
OPTION
Figure 1839 The intensity distribution of the fringes It is worth noting that if the slits are close together, the intensity of the fringes is modulated by the intensity distribution of the diffraction pattern of one of the slits.
Light of wavelength 500 nm is incident on two small parallel slits separated by 1.0 mm. Determine the angle where the first maximum is formed? If after passing through the slits the light is brought to a focus on a screen 1.5 m from the slits calculate the observed fringe spacing on the screen.
442
070817 Physics Ch 18 for Gregg.i442 442 22/05/2009 12:07:58 PM
ELECTROMAGNETIC WAVES
The next position will be when n = 2 (the second order) and this give = 38. For n = 3, sin is greater than 1 so with this set up we only obtain 5 fringes, one zero order and two either side of the zero order. The calculation shows that the separation of the orders is relatively large. At any angles other than 18 or 38 the light leaving the slits interferes destructively. We can see that the fringes will be sharp since if we move just a small angle away from 18 the light from the slits will interfere destructively. An array of narrow slits such as described above is usually made by cutting narrow transparent lines very close together into the emulsion on a photographic plate (typically 200 lines per millimetre). Such an arrangement is called a diffraction grating. The diffraction grating is of great use in examining the spectral characteristics of light sources.
telescope
1 2
Figure 1840 A parallel beam passing through several slits The slits are very small so that they can be considered to act as point sources. They are also very close together such that d is small (106 m). Each slit becomes a source of circular wave fronts and the waves from each slit will interfere. Let us consider the light that leaves the slit at an angle as shown. The path difference between wave 1 and wave 2 is dsin and if this is equal to an integral number of wavelengths then the two waves will interfere constructively in this direction. Similarly wave 2 will interfere constructively with wave 3 at this angle, and wave 3 with 4 etc., across the whole grating. Hence if we look at the light through a telescope, that is bring it to a focus, then when the telescope makes an angle to the grating a bright fringe will be observed. The condition for observing a bright fringe is therefore dsin = n Suppose we use light of wavelength 500 nm and suppose that d = 1.6 106 m. Obviously we will see a bright fringe in the straight on position = 0 (the zero order). The next position will be when n = 1 (the first order) and substitution in the above equation gives = 18.
443
070817 Physics Ch 18 for Gregg.i443 443 22/05/2009 12:07:58 PM
OPTION
CHAPTER 18 (OPTION G)
the wavelength of the different wavelengths present in the white light. At any given point in the continuous spectrum the light will be very nearly monochromatic because of the narrowness of the images of the slit formed by the grating. This is in contrast to the double slit where if white light is used, the images are broad and the spectral colours are not separated. emanating from the discharge tube and which gave rise to the fluorescence at the screen. He concluded that the rays originated from the point where the electrons in the discharge tube (at this time the electron had not been discovered and in his published paper Rontgen referred to cathode rays) struck the side of the tube or the anode. Furthermore, the rays travelled in straight lines from their point of production and were capable of great penetrating power, quite a thick sheet of aluminium being necessary to stop them entirely; they could pass through a 1000 page book without any noticeable decrease in intensity. Perhaps more striking was their ability to photograph the bone structure of the hand and other parts of the body. In 1912, seventeen years after their discovery, Von Laue demonstrated that X-rays are very short wavelength electromagnetic radiation ( 10-10 to 10-11 m). So in fact X-rays are high energy photons. Figure 1841 is a schematic representation of a modern Xray(Coolidge) tube.
electron beam evacuated glass tube
Light from a laser is shone through a diffraction grating on to a screen. The screen is a distance of 2.0 m from the laser. The distance between the central diffraction maximum and the first principal maximum formed on the screen is 0.94 m. The diffraction grating has 680 lines per mm. Estimate the wavelength of the light emitted by the laser.
OPTION
G.5.2 Draw and annotate a typical X-ray spectrum. G.5.3 Explain the origins of the features of a characteristic X-ray spectrum. G.5.4 Solve problems involving accelerating potential dierence and minimum wavelength.
IBO 2007
Figure 1841
Electrons are produced by the heated cathode. The potential difference between cathode and anode may range from about 10 kV to 50 kV. The anode, which is often oil cooled because of the large amount of thermal energy produced is faced with a heavy metal such as tungsten or molybdenum. As a result of the electrons striking the metal anode, X-rays are ejected from the metal. The production of X-rays is sometimes referred to as the inverse photoelectric effect.
444
070817 Physics Ch 18 for Gregg.i444 444 22/05/2009 12:07:59 PM
ELECTROMAGNETIC WAVES
in Figure 1842 The electrons have been accelerated through 25 kV and also through 15 kV. There are several features of these curves which are immediately apparent. For both the spectra produced by electrons of 25 keV and 15 keV energies there is a minimum wavelength min produced. The 25 keV curve also shows two distinct peaks called the K and the K lines . Let us see if we can understand these curves.
6 Intensity / relative units 5 4 3 2 1 15 kV 0 1.0 , 2.0 3.0 K K
There is, however, another very important mechanism for the production of X-rays. If an electron has a sufficiently high energy it can ionize an atom of the target not by removing one of the electrons in an outer shell but by removing an electron from one of the inner electron energy levels. The ground state energy level is often referred to the K-shell (n = 1) and the next energy level, the M-shell (n = 2) For example, suppose an incident electron removes a K-shell electron, the vacancy in this shell can now be filled by an electron of the L-shell, or the M-shell or other shells, making a transition to the K-shell. A transition from the L-shell to the K-shell gives rise to the K peak shown in Figure 1842. An electron which makes a transition from the M -shell to a vacancy created in the K -shell gives rise to the K line. If an incident electron ionizes a target atom by ejecting an L-shell electron then electron transitions from the M-shell and N-shell to fill the vacancy give rise to X-ray lines called the L and L respectively. It is apparent that the wavelength of the lines in an X-ray spectrum will be characteristic of a particular element. In fact in 1913 Moseley, who measured the X-ray spectra produced by many different elements, showed that the frequency of a given line (say the K line) was related to the proton number Z. This was of vital importance since at this time Z was just regarded as a number that referred to the position of the element in the periodic table. Moseley was in fact able to show that the position of certain elements should be reversed. He was also able to fill in several gaps in the table by predicting the existence of new elements, such as Technetium and Promethium. Unfortunately, Moseley was killed in the illfated Dardanelles expedition of World War I in August 1915.
25 kV
Figure 1842
When electrons strike a target most of them usually lose their energy gradually by making glancing collisions with the atoms of the target material. The effect of this is to increase the average kinetic energy of the atoms and so increase the temperature of the target. It is for this reason the target in a high energy X-ray tube must be cooled, usually by a flow of oil. If it were not cooled the temperature rise could be sufficient to melt the target. About 99 per cent of the energy of the electron beam goes into heating the target. However, each glancing collision will result in the electron emitting radiation due to its acceleration. It is these glancing collisions that result in the continuous part of the spectrum. The continuous spectrum is sometimes referred to as bremsstrahlung , the German for braking radiation, a very descriptive name indeed. A few electrons will lose all their energy in one collision and this rapid acceleration of the electron results in an energetic pulse of electromagnetic radiation, i.e. a high energy photon. If an electron is accelerated from rest through a potential difference V then the maximum energy of the photon that is produced when it is brought to rest is Ve such that
1.
Calculate the minimum wavelength of X-ray photons produced when electrons that have been accelerated through a potential difference of 25 kV strike a heavy metal target. The ground state energy level of a fictitious element is 20 keV and that of the next state is 2.0 keV. Calculate the wavelength of the K line associated with this element.
2.
Ve = hf =
hc min
min =
hc Ve 445
22/05/2009 12:08:00 PM
OPTION
CHAPTER 18 (OPTION G)
Each lattice ion (represented by a dot) acts as a source of secondary waves. In general as these waves overlap they will tend to interfere in a random manner. However, those waves that are scattered at angles equal to the angle at which the X-rays are incident on the ion, will stand a chance of reinforcing constructively with another scattered wave. The direction of two such waves are shown in Figure 1843 by the ray labelled 1 that is scattered by the first layer and by the ray labelled 2 that is scattered by the second layer. Ray 2 travels an extra path difference AB + BC where AB = BC = dsin, d being the spacing between adjacent crystal layers and the angle between the incident X-ray and the crystal layer. The two waves will interfere constructively if the path difference between them is an integral number of wavelengths i.e. 2dsin = n where n = 1, 2 ..etc This equation is known as the Bragg scattering equation However, there are many more planes from which the X-rays can be scattered. For example the planes such as those labelled XY and XY in Figure 1843 must also be considered. By considering a perfect cubic crystal lattice array it is possible to predict the diffraction pattern that results from X-ray scattering. We consider this in the next section.
G.5.5 Explain how X-ray diraction arises from the scattering of X-rays in a crystal. G.5.6 Derive the Bragg scattering equation. G.5.7 Outline how cubic crystals may be used to measure the wavelength of X-rays. G.5.8 Outline how X-rays may be used to determine the structure of crystals. G.5.9 Solve problems involving the Bragg equation.
IBO 2007
X-Ray Diraction
As mentioned above, in 1912, Von Laue showed that X-rays are very short wavelength electromagnetic waves. In order to show this, he had the idea that the apparent symmetry of the spacing and position of the atoms in a crystal might act as a kind of diffraction grating. The resulting diffraction pattern is very complicated since we are dealing in effect with a three-dimensional grating. Shortly after Von Laues work, William Bragg showed the diffraction pattern produced by X-rays interacting with a crystal actually arose from the scattering of the X-rays by the crystal lattice planes. Figure 1843 illustrates the principle of Braggs interpretation of X-ray diffraction by crystals. It also shows the constructions necessary to find the condition for the scattered rays to interfere constructively.
1 2
OPTION
G.5.7
Y
A C
Y First plane
By assuming a perfect cubic lattice array it is possible to predict the resulting interference pattern produced. Such a pattern is produced by a zinc sulphide crystal and a sodium chloride crystal. Once it is known that a crystal has perfect cubic symmetry the wavelength of the incident X-rays may be measured. Consider a crystal which consists of two elements combined to form a cubic lattice. Let the compound have a density and molecular weight M, then M the mass of one molecule is N , where NA is Avogadros number and M is the molar mass. The number molecules n/ per unit volume is given by
A
Second plane
n/ =
X X
NA = M M NA
446
070817 Physics Ch 18 for Gregg.i446 446 22/05/2009 12:08:01 PM
ELECTROMAGNETIC WAVES
and since the crystal under study is diatomic the number of atoms N per unit volume is given by
N= 2 NA M
Now let n equal the number of atoms along the edge of a unit cube of the crystal and d equal the spacing of the individual atoms. The length of the unit cube is therefore nd and the volume n3d3. But n3 is just the number N of atoms in the unit cube, therefore
n3 d 3 = Nd3 = 2 NA d 3 =1 M
1
The three maxima a1, b1, c1, correspond to first order maximum interference for three distinct wavelengths present in the X-ray beam and a2, b2, c2 to second-order interference. The detector measures the first order maxima as occurring at an angle 2, where is the angle that the crystal face makes with the incident beam. The wavelengths of the X-rays can now be calculated from the Bragg condition equation and the value of d as calculated. X-ray spectrometers are used extensively at the present day for measuring X-ray wavelengths, the only modification to the Bragg spectrometer being the use of a fluorite crystal rather than a sodium chloride crystal.
so that
M 3 d = 2 NA Bragg was able to check his explanation experimentally by constructing an X-ray spectrometer. The principle of such an instrument is shown in Figure 1844. A collimated beam of X-rays is incident on a crystal of sodium chloride which may be rotated to be at any angle to the direction of the incident beam.
The intensity of the scattered beam is measured at different angles by the detector shown. The results of such an experiment are shown in the graph in Figure 1845.
Exercise
incident X-ray beam
18.5 (b)
A beam of monochromatic X-rays are scattered from a crystal of sodium chloride. Use the following data to determine the wavelength of the X-rays in the beam. Bragg angle at which first maximum occurs molecular mass of sodium chloride = 58 = 18
X-ray intensity
b1 a1 c1 a2 b2 c2
Bragg angle
Figure 1845 A typical X-ray crystal spectrum of e.g sodium chloride
447
070817 Physics Ch 18 for Gregg.i447 447 22/05/2009 12:08:02 PM
OPTION
detector
CHAPTER 18 (OPTION G)
Parallel lms
G.6.5 State the condition for light to undergo either a phase change of , or no phase change, on reection from an interface. G.6.6 Describe how a source of light gives rise to an interference pattern when the light is reected at both surfaces of a parallel lm. G.6.7 State the conditions for constructive and destructive interference. G.6.8 Explain the formation of coloured fringes when white light is reected from thin lms, such as oil and soap lms. G.6.9 Describe the dierence between fringes formed by a parallel lm and a wedge lm. G.6.10 Describe applications of parallel thin lms.
OPTION
1 air 2 3 C A 4
Figure 1846
448
070817 Physics Ch 18 for Gregg.i448 448 22/05/2009 12:08:03 PM
ELECTROMAGNETIC WAVES
Also from the diagram we see that, where is the angle of refraction DE = 2cos From which opd = 2ndcos Bearing in mind the change in phase of ray 1 on reflection we have therefore that the condition for constructive interference is
1 2 nd cos = - , m = 1, 2, m + -2
And for destructive interference 2ndcos = m
F B E
D
Figure 1847 The geometry of interference The film is of thickness d and refractive index n and the light has wavelength . If the line BF is perpendicular to ray 1 then the optical path difference (opd) between ray 1 and ray 2 when brought to a focus is opd = n(AC + CB) AF We have to multiple by the refractive index for the path travelled by the light in the film since the light travels more slowly in the film. If the light travels say a distance x in a material of refractive index n then in the time that it takes to travel this distance, the light would travel a distance nx in air. If the line CE is at right angles to ray 2 then we see that AF = nBE From the diagram AC = CD so we can write opd = n(CD + CB) nBE = nDE
Each fringe corresponds to a particular opd for a particular value of the integer m and for any fringe the value of the angle is fixed. This means that it will be in the form of an arc of a circle with the centre of the circle at the point where the perpendicular drawn from the eye meets the surface of the film. Such fringes are called fringes of equal inclination. Since the eye has a small aperture these fringes, unless viewed at near to normal incidence ( = 0), will only be observed if the film is very thin. This is because as the thickness of the film increases the reflected rays will get further and further apart and so very few will enter the eye.
449
070817 Physics Ch 18 for Gregg.i449 449 22/05/2009 12:08:03 PM
OPTION
CHAPTER 18 (OPTION G)
the radar waves and n the refractive index of the film at this wavelength.
Example
Non-reecting lms
A very important but simple application of thin film interference is in the production of non-reflecting surfaces. A thin film of thickness d and refractive index n1 is coated onto glass of refractive index n where n1 < n. Light of wavelength that is reflected at normal incidence will undergo destructive interference if 2n1 d = , that is
Solution
We assume that the fringes are formed by light incident at all angles from normal to grazing incidence. At normal incidence we have 2nd = m From which,
2 1.5 2 10 3 m = ---------------------------- = 10 , 000 6 10 7
d= 4 n1
(remember that there will now no phase change at the glass interface i.e. we have a rare to dense reflection) The use of such films can greatly reduce the loss of light by reflection at the various surfaces of a system of lenses or prisms. Optical parts of high quality systems are usually all coated with non-reflecting films in order to reduce stray reflections. The films are usually made by evaporating calcium or magnesium fluoride onto the surfaces in vacuum, or by chemical treatment with acids which leave a thin layer of silica on the surface of the glass. The coated surfaces have a purplish hue by reflected light. This is because the condition for destructive interference from a particular film thickness can only be obtained for one wavelength. The wavelength chosen is one that has a value corresponding to light near the middle of the visible spectrum. This means that reflection of red and violet light is greater combining to give the purple colour. Because of the factor cos, at angles other than normal incidence, the path difference will change but not significantly until say about 30 (e.g. cos 25 = 0.90). It should be borne in mind that no light is actually lost by a non-reflecting film; the decrease of reflected intensity is compensated by increase of transmitted intensity. Non-reflecting films can be painted onto aircraft to suppress reflection of radar. The thickness of the film is determined by nd = where is the wavelength of
4
At grazing incidence the angle of refraction is in fact the critical angle. Therefore, = arcsin ---- = 42
1 1.5
OPTION
i.e. cos = 0.75 At grazing incidence 2ndcos = m / From which (and using cos(sin1(1/1.5) = 0.75),
Exercise
18.6
When viewed from above, the colour of an oil film on water appears red in colour. Use the data below to estimate the minimum thickness of the oil film. average wavelength of red light = 630 nm refractive index of oil for red light = 1.5 refractive index of water for red light = 1.3
450
070817 Physics Ch 18 for Gregg.i450 450 22/05/2009 12:08:05 PM
ELECTROMAGNETIC WAVES
In going from one fringe to another m changes by 1 and this means that the thickness d of the wedge will change by /2. Hence the fringes are of equal thickness and parallel to each other. By measuring the fringe spacing for a given wedge the angle of the wedge can be determined which means that the thickness of the spacer forming the wedge can also be determined.
G.6.1 Explain the production of interference fringes by a thin air wedge. G.6.2 Explain how wedge fringes can be used to measure very small separations. G.6.3 Describe how thin-lm interference is used to test optical ats. G.6.4 Solve problems involving wedge lms.
IBO 2007
slides wire
Figure 1848 An arrangement to observe the fringes When the monochromatic light strikes the glass plate some of it will be reflected down onto the wedge. Some of the light reflected from the wedge will be transmitted through the glass plate to the travelling microscope. A system of equally spaced parallel fringes (fringes of equal thickness) is observed. The travelling microscope enables the fringe spacing to be measured. The fringes can also be observed by the naked eye. At normal incidence the condition for a bright fringe to be formed is:
1 2 nd = m + 2
A wedge film is made by wrapping a single turn of stickytape about one end of a microscope slide and placing another slide on top of it. The microscope slides are 4.8 cm long and the wavelength of the light is 500 nm. The fringe spacing is measured as 0.8 mm. Determine the thickness of the sticky-tape.
Solution
4.8 ----- = 60 fringes. In a length of 5 cm there will be --0.08
451
070817 Physics Ch 18 for Gregg.i451 451 22/05/2009 12:08:05 PM
OPTION
monochromatic light
glass plate
CHAPTER 18 (OPTION G)
The 60th fringe is formed where the wedge has thickness d, the thickness of the sticky-tape. Since the wedge encloses air (n =1) then, Air (n = 1) then, using we have 2d = 60 l = 60 500 109 d = 30 5 107 To give, d = 1.5 105.
COMPARISON BETWEEN FRINGES FORMED BY A PARALLEL FILM AND THOSE FORMED BY A WEDGE FILM.
For a parallel film, the fringes are of equal inclination, that is they form arcs of a circle whose centre is located at the end of a perpendicular drawn from the eye to the surface of the film. For a wedge film, the fringes are parallel and of equal thickness.
OPTION
452
070817 Physics Ch 18 for Gregg.i452 452 22/05/2009 12:08:06 PM
RELATIVITY
RELATIVITY
H.1 H.2 H.3 H.4 H.5 H.6 H.7 H.8 (HL) D.1 (SL) Introduction to Relativity (HL) D.2 (SL) Concepts and Postulates of Special Relativity (HL) D.3 (SL) Relativistic Kinematics Some Consequences of Special Relativity Evidence to Support Special Relativity Relativistic Momentum and Energy General relativity Evidence to support general relativity
19
H.1.1
Describe what is meant by a frame of reference. Describe what is meant by a Galilean transformation. Solve problems involving relative velocities using the Galilean transformation equations.
IBO 2007
H.1.2
H.1.3
A frame of reference
ll measurement must be made relative to some frame of reference. Usually this frame of reference in physics experiments will be your school laboratory and in mathematics, the conventional Cartesian reference frame consists of three mutually perpendicular axes x, y and z. To understand why the concept of a frame of reference is so important, suppose that you were asked to carry out
an experiment to measure the acceleration due to gravity at the Earths surface by timing the period of oscillation of a simple pendulum whilst on a fair-ground merrygo-ground. You would certainly expect to get some very unusual results. Your overall perspective of the world would in fact be very different from the world observed by somebody not on the ride. You might well expect the laws of physics to be different. Yet we all live on a merrygo-round. The Earth spins on its axis as it orbits the sun. Fortunately the Earth spins relatively slowly compared to the merry-go-round so most of the time we can ignore the effects. However, because of the Earths rotation, the acceleration due to gravity has a different value at the poles than that at the equator. Also you certainly cannot ignore the rotation of the Earth when making astronomical observations or measurements. Newton was well aware of the complications produced by making measurements relative to the Earth. Furthermore, he felt that, for the laws of physics to be precisely valid, then all observations must be made in a reference system that it is at rest or in a reference system that is moving with
453
070817 Physics Chapt 19 for Paul453 453 22/05/2009 12:08:38 PM
OPTION
CHAPTER 19 (OPTION H)
uniform speed. Based on the work of Galileo he stated a theory of relativity as follows: The motions of bodies included in a given space are the same among themselves, whether the space is at rest or moves uniformly forward in a straight line. The question of course arises, at rest with respect to what or at uniform speed with respect to what? In this option we shall see how the search for a reference system that truly is at rest led to a radical re-think about the nature of space and time culminating in the two relativity theories of Einstein. We shall in fact see that the Laws of Physics are always true even if you live on a fair ground merry-go-round. A frame of reference which is moving with uniform velocity or which is at rest is known as an inertial reference frame. When your maths teacher draws the conventional x and y axes on the board he or she is in fact expecting you to regard this as an inertial reference frame and ignore any effects of the Earths motion. Following in your maths teachers footsteps we will for the time being assume in this section on Special Relativity that we are dealing with truly inertial reference frames. (A space ship far away from any gravitational effects drifting along with constant velocity is a pretty good approximation of an inertial reference frame). Suppose now that Mary observes an object in her reference system to be moving with speed u' in the x direction, then clearly Paul will observe the object to be moving with speed u = u' + v, hence we have u' = u v. It is not difficult to show that if Mary were to measure the acceleration of an object as a' then Paul would measure the acceleration of the object as a = a'. In this respect they would both interpret Newtons Second Law (in its basic form, F = ma) identically. This means that there is no mechanics experiment that Mary or Paul could carry out to determine whether they were at rest or whether they were moving at constant speed in a straight line.
1.
The diagram below shows three buoys A, B and C in a river. The river flows in the direction shown and the speed of the current is 2.0 m s1.
C direction of current at 2.0 m s1 AB = AC Y v = 3.0 m s 1
X v = 3.0 m s 1
The distance AB is equal to the distance AC. A swimmer X swims from A to B and back with a steady speed of 3.0 m s1 relative to the water. At the same time X leaves A a swimmer Y sets off to C and back with the same steady speed relative to the water. (a) Determine, for an observer on the bank the speed of i. X as she swims towards B. ii. X as she swims back to A. iii. Y as he swims towards C. iv. Y as he swims back to A. Calculate the ratio of the times for the two journeys.
Figure 1901 Reference systems Mary measures the point P to be at the point x'. Paul on the other hand will measure the point to be at the point x as measured in his reference system where x = x' + vt and t is the time that has elapsed from the moment when the two reference systems were together. We have therefore that, x' = x vt. (Note that in all that follows we will only consider motion in the xdirection)
(b)
454
070817 Physics Chapt 19 for Paul454 454 22/05/2009 12:08:39 PM
RELATIVITY
H.2.2
pulse as 3 108 m s1 in her reference system and that the relative velocity between Paul and Mary is 2 108 m s1. This means that Paul would measure the speed of the pulse to be 5 108 m s1. This, according to Maxwell, is not possible. Also it would seem that by measuring the speed of light pulses Mary and Paul have a means of determining whether they are at rest or not. This seems odd. We have also seen that the laws of mechanics are the same for all inertial observers so we would expect the laws of electromagnetism to be the same for all inertial observers. If, however, we apply the rules for a Galilean transformation (something that is beyond the scope of this book) we find that the laws of electromagnetism are different for different observers. Clearly something is wrong. Furthermore, if light is an electromagnetic wave, what is the nature of the medium that supports its motion? To answer this question, physicists of the time proposed that an invisible substance called the ether permeates all of space. It was also proposed that this medium is at rest absolutely and is therefore the reference system with respect to which the laws of physics would hold exactly for all inertial observers. Putting it another way, an inertial observer should be able to determine his of her speed with respect to the ether. We shall return to the question of the ether later on in the chapter. Meanwhile we shall see how Einsteins Special Theory of Relativity resolves the problem we seem to have with Maxwells theory and the speed of light.
H.2.3
Introduction
In 1864 Clerk Maxwell published his electromagnetic theory of light in which he united the separate studies of Optics and Electromagnetism. His theory was one of the great unification points in physics comparable with the Galileo/Newton idea that the laws of celestial and terrestrial mechanics were the same. The source of the electric and magnetic fields is electric charge. If an electric charge is stationary with respect to you then you will measure an electrostatic field. If the charge is moving with constant velocity relative to you then you will measure a magnetic field as well. However, if the charge is accelerated then the fields are no longer static and according to Maxwell they radiate out from the charge, moving through space with the speed of light. Suppose that the charge in question is a small charged sphere suspended by a string. If it oscillates with a frequency of 1000 Hz then it is a source of longwave radio waves, at a frequency of 109 Hz it becomes a source of television signals. If it oscillates with a frequency of 1012 Hz it is a source of infrared radiation, at about 1015 Hz it would look yellow and at 1018 Hz it would be emitting x-rays. Of course all this is a bit absurd but it does illustrate the fact that the source of all radiation in the electromagnetic spectrum is the accelerated motion of electric charge. Maxwells theory also showed that the speed with which electromagnetic waves travel depends only on the electric and magnetic constants of the medium through which they travel. In a vacuum this means that the speed depends only on 0, the permittivity of free space and 0, the permeability of free space (see Chapters 7 & 9) This is fact means that the speed of light (or any other electromagnetic wave) is independent of the speed of the observer. This has far reaching consequences. Let us return to our two observers Paul and Mary above. Suppose that Mary were to measure the speed of a light
455
070817 Physics Chapt 19 for Paul455 455 22/05/2009 12:08:40 PM
OPTION
CHAPTER 19 (OPTION H)
Clearly, if a reference frame is accelerating, then objects in the reference frame will not remain at rest or continue to move with uniform motion in a straight line.
v Y
Consider observer, Y, who is at the mid-point of the train. Just as the train reaches a point where she is opposite X, lightning strikes both ends of the train. X sees these two events to take place simultaneously. Refer to Figure1903.
v Y
2.
Another way of looking at the second postulate is to recognise that it means that speed of light is independent of the speed of the source. We have seen that the laws of mechanics are the same for all inertial observers. However, the laws of electromagnetism do not appear to be so. As we have already seen, if they were, then we would have a means of finding an absolute reference system. The laws of electromagnetism had been verified by careful experiment and were certainly not in error. Einstein realised that in fact what was in error was the not the laws of electromagnetism but the nature of the Galilean transformation itself. A Galilean transformation does not take into account the second postulate and it also assumes that time is an absolute quantity. It assumes in fact that all inertial observers will measure the same value for given time intervals. But if, as we shall see, the second postulate of relativity is correct then this cannot be the case.
Figure 1903 Situation B But this will not be the case for Y. Since the speed of light is independent of the speed of the source, by the time the light from each of the strokes reaches Y the train will have moved forward, Figure 1904.
v Y
OPTION
X a b
Figure 1904 Situation C So in effect the light from the strike at the front of the train will reach Y before the light from the strike at the rear of the train. That is, Y has moved forward and, in doing so, has moved closer to where the lightning first hit, so that the light travels a shorter distance in getting to Y. Whereas the light from the back of the cart needs to travel further (a > b) and so takes more time to get to Y. Y will not see the two events as occurring simultaneously.
456
070817 Physics Chapt 19 for Paul456 456 22/05/2009 12:08:40 PM
RELATIVITY
We might ask which observer is correct? Are the two events simultaneous? In fact both observers are correct. What is simultaneous for one observer is not simultaneous for the other; there is no preferred reference frame. The interpretation of any sequence of events will depend on an individuals frame of reference. Einstein proposed that the three dimensions of space and the one dimension of time describe a four dimensional space-time continuum and that different observers will describe the same event with different space time coordinates. We shall see later on in the chapter how this idea is developed further. Where c is the free space velocity of light, and
1 = -------------- v2 --- 1 c 2
and t and t' refer to a time interval in the respective reference systems. The Lorentz transformation equations are embedded in the Maxwell equations the equations that express the behaviour of electric and magnetic fields. There is a certain amount of irony here. Newton was well aware of the concept of relativity and as we have seen, took steps to address the issue in terms of the Galilean transformations. Maxwell on the other hand put his equations together without addressing the relativity issue. When the issue was addressed by Lorentz, this led to a complete re assessment of how we think of time and space. The major contribution that Einstein made was to realise that the Lorentz transformation equations can also be derived from the second postulate of Special Relativity. This is not difficult to do but we shall not do so here. According to the Special Theory all the laws of physics must transform according to the Lorentz transformation equations. The constancy of the speed of light is contained within the laws of electromagnetism but not within Newtons laws under a Galilean transformation. Hence Newtons laws must transform according to the Lorentz equations.
H.3.5
Introduction
Shortly after Maxwells equations of electromagnetism where published it was found, unlike Newtons Laws, that they did not keep their same form under a Galilean transformation. Lorentz found that, for them to keep the same form, the transformations in the following table have to be applied. Galilean x = x vt t = t Lorentz x = (x vt) t = t
To understand the Lorentz equations and to see the effect that they have on our conventional understanding of space and time, let us first look at how our understanding of time is affected. Let us return to the observer in the moving train. The observer has set up an experiment in which she times how long it takes for a light pulse to bounce back and forth between two mirrors separated by a vertical distance d as shown in Figure 1906. This set up is effectively a light clock.
457
070817 Physics Chapt 19 for Paul457 457 22/05/2009 12:08:41 PM
OPTION
CHAPTER 19 (OPTION H)
v
A
Y
light B C
Figure 1908 Figure 1906 A simple light clock If we let t2 t1 = t and t'2 t'1 = t', then, the distance that the train travels is BC = vt. The distance that X measures for the path of the pulse is AC = ct'. The distance that Y measures for the path of the pulse is . AB = c(t'2 t'1). Applying Pythagorass theorem to the triangle, we have
As measured by Y, the pulse leaves the top mirror at time t'1 and reaches the bottom mirror at time t'2. The time interval t'2 t'1 is given by:
AC
= AB + B C [ c t] = [ c t ] + [ v t ] c ( t ) = c ( t ) + v ( t )
2
2
2
c 2 v 2
OPTION
Suppose that at the time that the light pulse leaves the top mirror, X and Y are directly opposite each other. X measures this time as t1 and the time that the pulse reaches the bottom mirror as t2
v ( t 2 t1 )
1 t = ------------ t 2 v 1 --2 c
1 ----------- , we have t = t'. Or, letting = 2 v 1--2 c
Figure 1907 By the time that the light pulse reaches the bottom mirror, the train will have moved forward a distance v(t2 t1) as measured by X and he will see the pulse follow the path as shown in Figures 1907 and 1908.
The equation t = t' is stated in the IB data booklet as t = t0 where t0 is the proper time. What this effectively means is that to observer X, the light pulse will take a longer time to traverse the distance between the mirrors compared to the time measured by observer Y. This phenomenon is known as time dilation.
458
070817 Physics Chapt 19 for Paul458 458 22/05/2009 12:08:43 PM
RELATIVITY
As stated above, the bouncing light pulse can effectively be regarded as a clock. We reach the conclusion therefore that any type of clock in an inertial reference systems which is moving relative to an observer in another inertial reference system will appear to run slower as measured by this observer. This effectively means that time is slowed for the moving observer. It has been said, tongue in cheek, that if you want to live longer, then keep running. If there were a clock on the train then to X it would not tell the same time as a clock on the ground nor would the clock on the ground tell the same time to Y as the clock in the train. Time can no longer be regarded as absolute. We must also bear in mind that the situation is symmetric and to observer Y, a clock in observer Xs system will appear to run slower than a clock in her own system. You might also like to ponder the following: is the speed of light the same for all inertial observers because time is not absolute or is time not absolute because the speed of light is the same for all inertial observers? This is the sort of question that keeps the philosophers happy for ages. From the physicists point of view, what is of importance is that the Special Theory has been verified experimentally and that all the predictions that it makes have also been verified experimentally. The Special Theory of Relativity is part of the accepted framework of Physics.
1.0 0.9 0.8 0.7 0.6 0.5 0.4 0.3 0.2 0.1
3 function
Figure 1909 From this graph you can see that, for two observers with a relative speed of about 0.98c, then a time interval of one second as measured by one observer will be measured as a time interval of nearly 5 seconds by the other observer.
Solution
Since the measured event occurs at the same place in which the time period is measured as 2.00 s, then 2.66 = 2.00
1 = 1.33 = -----------2 or such that v 1--2 c 2 2 v 1 1 1.33 = -- 1 --- = -------- ( = 0.57 ) ------2 2 2 c 1.33 v 1--2 c .
-- = 0.43 so that v = 0.66c. Therefore -c
2
459
070817 Physics Chapt 19 for Paul459 459 22/05/2009 12:08:44 PM
OPTION
An observer sets up an experiment to measure the time of oscillation of a mass suspended from a vertical spring. He measures the time period as 2.0 s. To another observer this time period is measured as 2.66 s. Calculate the relative velocity between the two observers.
CHAPTER 19 (OPTION H)
This is a very high speed to say the least and in one time period the observers will have moved about 4 109 m apart. Which makes the whole thing strange. However we will look at a more realistic example later in the chapter. We define the proper length of the rod as the length of the rod measured by the observer at rest with respect to the rod. Suppose that end A of the rod is at x'1 and B is at x'2 as measured by Mary. The rod is at rest in Marys system therefore the proper length of the rod is ( x2 x1 ) = L (say). If we apply the Lorentz transformation then x'1 and x'2 are given by x1 = ( x1 vt1 ) and x2 = ( x2 vt2 ) where x2 and x2 are the respective ends of the rod as measured by Paul. The length of the rod as measured by Paul can therefore have many different values depending on the choice of t1 and t2, the times when the rod is measured. To make any sense, the length of the moving rod (from Pauls point of view) is defined as the length when the ends are measured simultaneously i.e. t1 = t2. Hence the proper length of the rod
H.3.6 H.3.7
The derivation of the length contraction formula is not required. H.3.8 Solve problems involving length contraction.
IBO 2007
Length contraction
L = ( x 2 x 1 ) = ( x 2 x 1 ) .
Hence the length L of the rods measured by Paul will be given by L' = L
0 --- in the data booklet This equation is written as L = - where L0 is the proper length. (You will not be expected to derive this equation in an examination).
OPTION
Since is always greater than unity, L will always be less than the proper length. To Paul the rod will appear contracted in the direction of motion.
Exercise
19.3 (a)
1.
A x x 1 x2 B x
Use the definition of the function to confirm its value as shown by the previous graph when the velocity is 0.5c, 0.8c and 0.95c (Figure 1909). Mary measures the time between two events as being separated by an interval of 2.0 ms. Paul measures the interval to be 2.5 ms. Calculate the relative velocity of Mary and Paul.
Figure 1910 In terms of Galilean relativity Mary and Paul would both obtain the same value for the length AB of the rod. But this is not the case if we apply the Lorentz transformations.
460
070817 Physics Chapt 19 for Paul460 460 22/05/2009 12:08:45 PM
RELATIVITY
Exercise
19.3 (b)
CONTRACTION
Example
A spaceship is travelling away from the Earth with a speed of 0.6c as measured by an observer on the Earth. The rocket sends a light pulse back to Earth every 10 minutes as measured by a clock on the spaceship. (a) Calculate the distance that the rocket travels between light pulses as measured by i. the observer on Earth. ii. somebody on the spaceship? If the Earth observer measures the length of the spaceship as 60 m, determine the proper length of the spaceship.
Mary is travelling in a space ship, which is not accelerating. To her the length of the space ship is 100 m. To Paul who is travelling in another space ship, which is also not accelerating, Marys space ship has a length of 98 m. Calculate the relative velocity of Paul and Mary.
(b)
Solution
100 ----- = 1.02 The proper length is 100 m therefore = 98
so that = 1.02 =
1 ---------. 2 v --12 c
1
2.
Calculate the relative velocity between two inertial observers which produces a 50% reduction in the proper length.
so that v = 0.2c.
Example
To Paul, his space ship measures 150m. Determine the Solution length that it will appear to Mary? The factor is the same for both Paul and Mary but at this time 150 m is the proper length so Mary will see the ship contracted by a factor 1 i.e. 147 m. Since both observers are inertial observers both will observe the length contraction and both observers will be correct.
461
070817 Physics Chapt 19 for Paul461 461 22/05/2009 12:08:46 PM
OPTION
giving v2 = 0.04c2,
CHAPTER 19 (OPTION H)
appear as about 5 seconds for Paul. So when Mary returns to Earth she will be about 39 years old and Paul will be about 77 years old. But if all motion is relative, why cant we regard Paul as being the person in motion and Mary at rest? As we shall see, it has been verified experimentally that time does go slower in moving reference frames. So this is the paradox. By symmetry the only possible result is that Paul and Mary must be the same age when they meet up again on Earth. However, for Mary to come back to Earth from the star, she has to turn round and this involves an acceleration so this is no longer a symmetrical situation. She also has to accelerate away from the Earth and slow down to land back on Earth. This, therefore, is the way round the paradox - there is no paradox. The observer that experiences the acceleration is the person that is moving. If both observers are in inertial reference frames then they can never meet to compare their ages or their clocks unless one of them slows down and the observer that slows down is no longer in an inertial reference system.
OPTION
462
070817 Physics Chapt 19 for Paul462 462 22/05/2009 12:08:46 PM
RELATIVITY
Velocity addition
Exercise
19.4 (a)
1.
Use the Lorentz velocity transformation equations to show that any two inertial observers will always measure the same value for the velocity of light.
H.4.6 Distinguish between the energy of a body at rest and its total energy when moving. H.4.7 Explain why no object can ever attain the speed of light in a vacuum.
Figure 1911 (a) and (b) The object P is moving horizontally with a speed u relative to Paul. According to a Galilean transformation, Mary would measure the speed of the object as u' = u v. If the object is moving in the other direction, Figure 1911 (b), then the speed would be, as measured by Mary, given by u' = u + v. If P were a light beam, then Mary would measure the speed of the beam to be c + v. But if we are to believe the Special Theory then this can not be so since all inertial observers must measure the same value for the speed of light. The Galilean transformations for velocity cannot therefore be correct. It is not difficult to show that, if we apply the Lorentz transformations for displacement and time, then the velocity transformation equations become
IBO 2007
463
070817 Physics Chapt 19 for Paul463 463 22/05/2009 12:08:47 PM
OPTION
CHAPTER 19 (OPTION H)
If m0 is the mass of the body when it is at rest with respect to the observer, the so-called rest mass, then its mass m when moving at a speed v relative to the observer is given by m = m0 It is this change in energy which is the gain in the kinetic energy of the object and is equal to the work done on the object. It is important to understand the significance of these equations, namely that energy and mass are entirely equivalent.
A word on units
In the equation E = mc2 , if m is measured in kilograms and c as m s1, then the unit of E is clearly joules. However, the theory of relativity only becomes significant for speeds close to that of c and this usually means we are dealing with the acceleration or movement of atomic or sub-atomic particles. For a example, an electron accelerated from rest through a potential difference of 106 volt will attain an energy of 1M eV as measured in electron-volts. It is much more convenient to express the energy of particles in eV (or multiples thereof) such as MeV. Similarly, it is much more convenient to express their mass in units of MeV c-2. So for example the rest mass of a proton is 938 MeV c-2 which equals
938 10 6 1.6 10 19 kg = 1.67 10-27 kg. 9 1016
OPTION
This equation will not apply in Special Relativity. In fact, if we are still to believe in the conservation of energy we must look for a different relationship between the work done and the energy transferred. It might be tempting to substitute m as m0 but this does not in fact lead to the correct physical interpretation. In thinking along these lines Einstein was led to the idea of mass and energy being interchangeable such that the gain in mass of an accelerated body could be equated to a gain in energy. This led him to the celebrated equation E = mc2 In this equation, E is the total energy of the object and m is its relativistic mass. If the object is at rest then it has a rest mass energy given by
Example
A coal fired power station has a power output of 100 MW. Calculate the mass of coal that is converted into energy in one year (3.15 107 s).
Solution
E 0 = m0 c
If we combine these two equations in terms of the work done when a force accelerates an object from rest then, at a certain speed v, the object will have a total massenergy mc2. Its massenergy will have changed from rest by an amount, Ek, where
E k = E m0 c
= mc m0 c
464
070817 Physics Chapt 19 for Paul464 464 22/05/2009 12:08:49 PM
RELATIVITY
In view of the equivalence of mass energy we can no longer talk about the conservation of mass and the conservation of energy as two separate laws of physics but instead we have just one law, the conservation of mass-energy. However, for many chemical reactions we can use the separate laws since the mass deficiency involved is usually very small. For example, when one gram mole of carbon combines with two gram moles of oxygen, the mass deficiency is only about 10-9 g which is far too small to detect. However, as we have learned in Chapter 11, the mass energy equation plays a very important role in nuclear reactions.
(a) (b)
Solution
(a)
v =
A value which is clearly greater than the speed of light. (b) Using relativistic mechanics we have that the energy supplied i.e. the work done = V e and this equals the gain in KE of the electron such that
Ve = mc m0 c = m0 c m0 c
So that m0 c
2
2
9.1 10 9 10 ----------------------------------------------------- = 0.51 MeV. 19 1.6 10 (We can therefore express the mass of the electron as 0.5 MeV c-2.)
So substituting 0.5 MeV into equation (1) we have
31
16
Using the equation for the gamma factor, a value of 5 gives the velocity of the electron as 0.98c.
Exercise
19.4 (b)
1. Example
Calculate the speed of a particle relative to a laboratory observer when its kinetic energy is equal to its rest-mass energy
An electron is accelerated through a potential difference of 2.0 V. Calculate, after acceleration, the velocity of the electron applying
465
070817 Physics Chapt 19 for Paul465 465 22/05/2009 12:08:50 PM
OPTION
CHAPTER 19 (OPTION H)
H.5.2
H.5.3 H.5.4
H.5.5
OPTION
D B
fixed mirror
A
diffuse monochromatic light source beam splitter observer O
compensator plate
Figure 1912
466
070817 Physics Chapt 19 for Paul466 466 22/05/2009 12:08:51 PM
RELATIVITY
Light from a diffuse monochromatic source is incident on the half silvered mirror A. This acts as a beam splitter such that some of the light goes on to the moveable mirror and some on to the fixed mirror C. On reflection from these mirrors, light from both mirrors arrives at the observer O. B is a compensator plate equal in thickness to the beam splitter. If the different path length of the two rays on reaching the observer is a multiple number of wavelengths, then a bright spot of light will be observed at O and if it is an odd number of half wavelengths, a dark spot will be observed at O. However, since a diffuse source is used a great many paths of slightly differing lengths will occur and the overall effect will be to observe a series of light and dark fringes at O. An interference pattern in fact. If the mirror D is moved backwards or forwards the interference pattern will be shifted and the amount that it is shifted will depend on the amount that the mirror D is moved. The actual apparatus used by Michelson and Morley was very large. The effective length of the two arms was about 10 m. One arm was aligned to be parallel to the direction of motion of the Earth in its orbit such that the other arm was at right angles. If the ether through which the light travels is at rest relative to the motion of the Earth, then the light will take slightly different times to traverse the two paths. Hence there will be an observed shift in the interference pattern. The situation is analogous with two swimmers in a river both setting off from the same point. One swims parallel to the direction of the current and one swims at right angles to the direction of the current. They each swim the same distance from the starting point and then return. A little thought will show that the trip for the swimmer who sets off at right angles is going to take longer than the trip for the swimmer who swims parallel to the current. The result of the Michelson-Morley experiment was spectacular in as much as no shift in the interference pattern was observed. Many attempts were made to explain this non-result before Einstein recognised that there was no ether and therefore no absolute reference frame and that all inertial observers will measure the same value for the speed of light.
H.6.1 Apply the relation for the relativistic momentum p = m0u of particles. H.6.2 Apply the formula EK= ( -1)m0c2 for the kinetic energy of a particle. H.6.3 Solve problems involving relativistic momentum and energy.
IBO 2007
Note: Derivation of the relativistic momentum and energy formulae will not be examined. In Classical Physics, we saw in Topic 2.3 that there is a useful relationship between the momentum p and the kinetic energy Ek of a particle, namely
p2 Ek = ---2m
where m is the mass of the particle.
467
070817 Physics Chapt 19 for Paul467 467 22/05/2009 12:08:51 PM
OPTION
CHAPTER 19 (OPTION H)
In Special Relativity, we can find an equally useful relationship between momentum and energy, but in this instance the energy E is the total energy of the particle. We have that And m0 c hence,
2 0.216 10 e V p =----------------------- = 0.46 MeV c-1 . 2 c Or, p = 2.5 1022 N s 12 2 2
2 4
12 2 2
m0 m = -----------v2 1 --c2 such that if we square both sides and rearrange we have
m c
2 2
= m0 c + m v
2 2
2 2
Since particle physicists are often dealing with energies measured in electron volts, they often express momentum in the units MeV c1 (energy/speed). To find the speed, we need to find the mass of the electron after acceleration. We have that ETotal = mc2 = Ve + moc2
( mc ) = m0 c + (mv ) c
2 2
2 4
2 2
2
But, mv is the momentum p of the particle and mc is equal to E the total energy. Hence
Therefore m = (Ve + moc2) c-2 = (0.18 + 0.51) MeV c-2 = 0.68 MeV c-2 And v = p/m = 0.46 MeV c-1/0.68 MeV c-2 = 0.68 c. This demonstrates how much easier are relativistic dynamic calculations when we deal in the units MeV for energy, MeV c2 for mass and MeV c1 for momentum.
E = m0 c + p c
2
2 4
2 2
Example Example
OPTION
Find the momentum and speed of an electron after it has been accelerated through a potential difference of 1.8 105 V.
A gamma photon that is travelling close to a lead atom, materialises into an electron-positron pair e.g. e- + e+ The initial energy of the photon is 3.20 MeV. Neglecting the recoil of the lead atom, calculate the speed and mass of the electron and positron.
Solution
E = Ve + m0 c .
If we bear in mind that
Solution
m0 c is equal to 0.51 MeV, and Ve = 0.18 MeV, then in this instance we have that
E = ( 0.18 + 0.51) 10 eV E
6 2
For one particle Etot = 1.60 = EK + m0c2 = EK + 0.511 EK = 1.09 MeV = (1 - ) m0c2 To give = 3.13 Which gives v = 0.948c
= 0.4761 10
12
eV
Also, from the relativistic relation between total energy and momentum, we have
E m0 c p = ---------------2 c
2
2 4
468
070817 Physics Chapt 19 for Paul468 468 22/05/2009 12:08:53 PM
RELATIVITY
because High School physics generally makes no attempt to distinguish between inertial and gravitational mass. As we have seen (Chapter 2 and Chapter 9), the concept of mass arises in two very different ways in Physics. You met it for the first time in connection with the property of inertia - all objects are reluctant to change their state of motion. This reluctance is measured by a property of the object called its inertial mass. The concept of inertial mass is quantified in Newtons Second law, F = ma. But the concept of mass also arises in connection with Newtons gravitational law in which the force between two point masses m1 and m2 separated by a distance r is given by
Gm1 m2 F = -----------r2
In this respect, mass can be thought of as the property of an object, which gives rise to the gravitational force of attraction between all objects and is therefore called gravitational mass. Since gravitational and inertial mass measure entirely different properties there is no reason why we should consider them to be identical quantities. However, consider an object close to the surface of the Earth which has a gravitational mass mg and an inertial mass mI. If the gravitational mass of the Earth is Mg then the magnitude of the gravitational force exerted on the object is given by
GM g mg F = -----------R2
Where R is the radius of the Earth. The object will accelerate according to Newtons Second law such that
GM g mg F = ------------ = mI a R2
469
070817 Physics Chapt 19 for Paul469 469 22/05/2009 12:08:54 PM
OPTION
CHAPTER 19 (OPTION H)
The two results (Figures 1913 and 1914) are identical and according to Einstein there is no physical experiment that an observer can carry out to determine whether the force acting on the object arises from inertial effects due to the acceleration of the observers frame of reference or whether it arises because of the gravitational effects of a nearby mass. This is the so-called Einstein Principle of Equivalence and it can be stated there is no way in which gravitational effects can be distinguished from inertial effects. In this respect Einstein concluded that all motion is relative. If we consider the situation in Figure 1913 we can choose the lift to be the fixed reference system and it is the rest of the Universe that must be considered to be accelerating. It is this acceleration of the Universe, in Einsteins interpretation, that generates what Newton called a gravitational field. According to Einstein there is no absolute choice of a reference system, only relative motion can be considered.
Figure 1913 If he releases an object as shown he will observe that it falls to the ground with acceleration g. An outside observer would say that the ball stays where it is but the lift floor is accelerating upwards towards the ball with acceleration g. Another interesting situation arises here in which we can consider a lift in free fall close to the surface of the Earth. When the person in the lift releases the object in this situation then the object will stay where it is. The object is in fact weightless. This is the reason that astronauts in orbit around the Earth are weightless they are in free fall and although they are in a gravitational field, because of their acceleration, they will feel no gravitational force.
OPTION
light B acceleration
22/05/2009 12:08:55 PM
A g
Figure 1914 In Figure 1914, the lift is at the surface of the Earth and again the person drops the ball and observes it to accelerate downwards with an acceleration g. An outside observer would say that this is because of the gravitational attraction of the Earth.
Figure 1915
The General Theory predicts that light will be bent by gravity. In Figure 1915 a person is in an accelerating space ship far away from any mass. A ray of light enters through a window at A. Because of the acceleration of the ship the light will strike the opposite wall at point B which is below A. To the person in the ship the path of the light ray will therefore appear to be bent.
470
070817 Physics Chapt 19 for Paul470 470
RELATIVITY
there is nothing intrinsically wrong in having four lines mutually at right angles describing a spacetime continuum.
light A B
In Figure 1917, space is represented by the conventional xaxis and time t by an axis perpendicular to the xaxis.
Earth
Figure 1916 The space ship is stationary
A B C
In Figure 1916 a space ship is at rest on the surface of the Earth. If the Einstein principle of equivalence is correct then this situation cannot be distinguished from the situation described in Figure 1915. The path of a light beam entering a window of this space ship will therefore also appear to be bent. The prediction is therefore that light that passes close to large masses will have its path altered.
x
Figure 1917 Time space graph
The line A represents a stationary particle and the line B represents a particle that starts from the point where A is at rest and moves with constant velocity away from this point. The line C represents a particle that has certain velocity at some different point in space and is slowing down as it moves away from this point.
Gravitational attraction
The General Theory of Relativity essentially does away with the concepts of gravitational mass and gravitational force. How then do we account for the gravitational force of attraction between objects? We have seen that, in Special Relativity, space and time are intimately linked and an event is specified by four coordinates of spacetime. Einstein proposed that space-time is curved by the presence of mass. An analogy is to think of a stretched elastic membrane onto which is placed a heavy object. In the vicinity of the object, the membrane will no longer be flat but will be curved, see Figure 1918. The curvature will be greatest close to the object and the general curvature will also increase as the mass of the object increases.
Figure 1918
471
070817 Physics Chapt 19 for Paul471 471 22/05/2009 12:08:55 PM
OPTION
CHAPTER 19 (OPTION H)
We can explain gravitational attraction in terms of this warping of space.
Black holes
In 1939 Oppenheimer and Snyder pointed out that the General Theory predicts the existence of black holes. During the life-time of some stars, there is a period when they undergo collapse. As they collapse, their density increases and therefore their surface gravity increases. Radiation leaving the surface will not only be redshifted by this gravitational field but, as the surrounding space-time becomes more and more warped, as the gravitational field increases, the path of the radiation will become more and more curved. If the surface gravity increases sufficiently there will come a point when the path of the radiation is so curved that none of the radiation will leave the surface of the star. The star has effectively become a black hole. From a classical point of view we can think of a black hole in terms of escape velocity. A star becomes a black hole when the escape velocity at the surface becomes equal to the speed of light. The radius at which a star would become a black hole is known as the Schwarzchild Radius after the person who first derived the expression for it value. General Relativity enables a value for the radius of a particular star for this to happen to be derived, but the derivation is beyond the scope of HL physics. The Schwarzchild radius Rsch is given by
2GM R sch = -----------c2
Figure 1920 Consider two objects moving in the direction shown in Figure 1920. Each object curves its local space and will therefore move towards the other object. As the objects get closer, the local space-time will become more curved and they will behave as though they were experiencing an ever increasing force of attraction. If we choose the appropriate geometry to describe the curvature of space, then the objects will move just as if there were an inverse square force between them. You can think of the analogy of two ships at the equator sailing due North. Because of the curvature of the Earths surface they will get closer and closer together even though they are following a straight line path. However, do not take this analogy too literally since, in Einsteins theory, it is the objects themselves that curve the space. Einstein proposed that all objects will take the shortest possible distance between two events in space-time. Such a distance is known as a geodesic. The geodesic for a plane surface is a straight line and for a sphere, a great circle. In this sense, the planets are actually following geodesics in the particular geometry of the space-time produced by the mass of the sun.
OPTION
(Coincidentally, Newtonian mechanics gives the same value.) The surface of a black hole as defined by the Scwarzchild radius is called the event horizon since inside the surface all information is lost. It is left as an exercise for you to show that if our Sun were to shrink until its radius was 3000 m then it would become a black hole. Of course, if no radiation can leave a black hole and all radiation falling on it will also be trapped, we have to ask how can such things be detected, should they exist. One possibility is to observe a black hole as a companion to a binary star system. Another way is to observe the effect that a black hole has on high frequency gamma radiation as it passes close to a black hole. Sufficient to say at this point that astronomers do not doubt the existence of black holes.
H.7.8 Describe black holes. H.7.9 Dene the term Schwarzschild radius. H.7.10 Calculate the Schwarzschild radius. H.7.11 Solve problems involving time dilation close to a black hole.
IBO 2007
472
070817 Physics Chapt 19 for Paul472 472 22/05/2009 12:08:56 PM
RELATIVITY
t =
R 1 rs
where RS is the Schwarzschild radius of the black hole. This effectively means that, if the person where to observe a clock approaching a black hole, the motion of the hands of the clock would appear to get slower and slower the nearer the clock gets to the event horizon of the black hole. At the event horizon, they would stop moving and time would stand still.
Example
2 v = at
( a)
Solution
Figure 1921 (a) and (b) If we consider the space ship to be at rest on the surface of the Earth as shown in Figure 1921 (b), then because of the principle of equivalence, the same effects will be observed as in diagram (a). This means that the observed frequency of light emitted from a source depends upon the position of the source in a gravitational field. For example, light emitted from the surface of a star will be red-shifted as seen by an observer on Earth. Light emitted from atoms in the stars corona will not be as red-shifted as much. Since frequency is essentially a measure of time this means a consequence of General Relativity is that, to an observer on the top floor of a building, clocks on the ground floor will appear to run more slowly. The conclusion is that time slows in the presence of a gravitational field. As mentioned above, a remarkable consequence of this is that at the event horizon, of a black hole, to an outside observer, time stops.
Gravitational red-shift
H.7.12 Describe the concept of gravitational red-shift. -shift. shift. H.7.13 Solve problems involving frequency shifts between dierent points in a uniform gravitational eld. H.7.14 Solve problems using the gravitational time dilation formula.
IBO 2007
473
070817 Physics Chapt 19 for Paul473 473 22/05/2009 12:08:57 PM
OPTION
A person a distance of 3 RS from the event horizon of a black hole measures an event to last 4.0 s. Calculate how long the event would appear to last for a person outside the field of the black hole.
1 Earth ( b)
CHAPTER 19 (OPTION H)
fg h f ' f = f = ------2 c
so that
f g h --- = -----2 f c
This equation can also be derived on the basis of Einsteins principle of equivalence by considering the loss in energy of a photon from source 1 as it moves to source 2 in the gravitational field of the Earth. The two separate proofs of the gravitational redshift equation again show that there is no difference between inertia and gravity.
Example
2 h
v = gt
Calculate the redshift that is observed between radiation emitted from the surface of a neutron star and radiation emitted from the centre of the star if the mass of the star is 1031 kg and its radius is 10 km.
Solution
1
Figure 1922 Let the acceleration a = g and let the distance between the two sources be h. The time t for the light from source 1 to reach the observer is therefore We need to calculate g for the star which we do from GM g=----2 r f g h ------- = --and then substitute in the equation 2 f c with h = 10 km. You can do this to show that
OPTION
h t = ---c
In this time the speed gained by the O is
f --- = 0.74. f
Which is indeed an enormous redshift. Note that we have assumed that g is constant.
gh v = g t = ------c
Because of the effective Doppler shift of the light from source 2 (remember, to the observer O this source will effectively be moving away from him at speed v = gt when the light from the source reaches him), the frequency f ' measured by O will be given by the Doppler shift equation, i.e.,
Example
v f' = f 1 c
If we now substitute for v then
A satellite communication signal has a frequency of 100 MHz at the surface of the Earth. What frequency will be measured by an astronaut in a satellite which is in orbit 200 km above the surface of the Earth?
474
070817 Physics Chapt 19 for Paul474 474 22/05/2009 12:08:58 PM
RELATIVITY
The Special Theory introduced a completely new way of thinking about time and it was able to account for the Lorentz transformations encountered in Maxwells theory. It also made several predictions all of which have been born out by experiment. We therefore accept that the Special Theory tells us the correct way in which to think about time. The General Theory introduces a completely new way of thinking about space, time and gravity and if the theory is to be accepted then it too must account for known phenomena and make predictions that can be verified by experiment. In this section we look at some of the evidence that supports the predictions made by General Relativity.
Solution
Again, assuming that g is constant we can use the red shift equation in the form
f g h --- = ------2 f c
g h
9.8 200 10 3
Gravitational lensing
Einstein suggested a method by which the effect of gravity on the path of light could be detected. The position of a star can be measured very accurately relative to the position of other stars. Einstein suggested measuring the position of a particular star, say in June, and then again six months later when the Earth is on the other side of the Sun relative to the star.
apparent position of star
star
December
Earth position in
Sun
Figure 1923 Gravitational lensing The path of the light from the star reaching the Earth according to Einstein should now be bent as it passes close to the Sun. This will cause an apparent shift in the position of the star. See Figure 1923. Einstein predicted that the path of starlight should be deflected by 1.75 seconds of arc as it passes by the Sun. To observe starlight that passes close to the Sun, then the stars must be observed during the day and the only way that this can be done is during a total eclipse of the Sun. The General Theory was published in 1917 and by good fortune a total eclipse of the Sun was predicted for 29th March 1919 near the Gulf of Guinea and Northern Brazil. Expeditions were mounted to both destinations and scientists were able to collect enough photographs of suitable stars to test Einsteins prediction. The location
INTRODUCTION
For any physical theory to be accepted it must not only explain known phenomena but also make predictions that can be verified experimentally.
475
070817 Physics Chapt 19 for Paul475 475 22/05/2009 12:08:59 PM
OPTION
CHAPTER 19 (OPTION H)
of the stars indicated that the path of the light had been deviated by 1.64 seconds of arc, a result that compared very favourably with the Einstein prediction. However, recently doubt has been cast on Eddingtons interpretation of the data. More recently observations have been made of quasar images indicating that the light from the quasars has been deflected by galaxies or even clusters of galaxies on its passage to Earth. Even more accurate work has been carried out on radio signals transmitted from Earth and reflected from the planets. Solar space-time curvature will effect the delay times for the echo signals. The experimental results are in excellent agreement with the General Theory. This bending of light by large gravitational masses is often referred to as gravitational lensing in analogy with the bending of light by optical lenses.
II.
-1 2
TIME DILATION
OPTION
t = t0
LENGTH CONTRACTION
L0 L = ---
MASS INCREASE
m = m0
Conclusion
REST MASS ENERGY
The General Theory of Relativity is now accepted as being the correct interpretation of gravity and as such the correct model for our view of space and time. General Relativity and Quantum Theory form the two great theories upon which the whole of Physics rests. To date, all attempts to unify then into one complete theory have been unsuccessful.
E0 = m0c2
TOTAL ENERGY
E = mc2
VELOCITY TRANSFORMATION
476
070817 Physics Chapt 19 for Paul476 476 22/05/2009 12:09:00 PM
RELATIVITY
RELATIVISTIC MOMENTUM GRAVITATIONAL TIME DILATION EQUATION
p = m0 u
TOTAL ENERGY
2 2
t =
t0 R 1 rs
E = m0 c
= E k + m0 c
ENERGYMOMENTUM EQUATION
Experimental evidence
SPECIAL THEORY
The Special Theory is well supported by experimental evidence. The invariance of the velocity of light in respect of source and observers relative motion. Measured mass increase of accelerated electrons.
E = p c + m0 c
velocity as a fraction of the speed of light
1.0 0.9 0.8 0.7 0.6 0.5 0.4 0.3 0.2 0.1
3 function
Figure 1924 The graph shows how the function varies with velocity and should be referred to in problems in order to verify if a particular value of the function has been calculated correctly.
The arrival of muons at the surface of the Earth and the measurement of their respective half-lives gives evidence of time dilation. Pair production gives evidence of the conservation of mass-energy. Nuclear binding energy and nuclear processes all verify the conservation of mass-energy.
GENERAL THEORY
Except for some minor perturbations the orbits of the planets are ellipses and the major axis of the elliptical orbit is fixed. However it was observed that the major axis of the orbit of Mercury shifts its plane by some 5.75 seconds of arc per century. By considering the gravitational effects of all the other planets on the orbit of Mercury, Newtons theory accounted for all but 43 seconds of arc of the precession of the major axis. However, the General Theory accounts for the entire precession. The apparent displacement of the measured position of stars gives evidence of the bending of the path of a ray of light by a gravitational field. The Pound-Rebka experiment gravitational redshift. gives evidence of
POSTULATES
I. Machs principle - Inertial and gravitational forces are indistinguishable. Four dimensional space-time is curved as a result of the presence of mass. Objects take the shortest path between two points in space-time.
II.
III.
The existence of black holes gives further evidence of the warping of space by the presence of matter.
477
070817 Physics Chapt 19 for Paul477 477 22/05/2009 12:09:01 PM
OPTION
General Relativity
CHAPTER 19 (OPTION H)
Miscellaneous Exercises
6.
Estimate how far you would have to push a ball of mass 2.0 kg with a force of 50 N until its mass was 4.0 kg. If another beam of protons is accelerated at the same time through the same potential difference as in question 5, but in the opposite direction, calculate, after acceleration, the relative velocity of a proton in one beam with respect to a proton in the other beam. Calculate the de Broglie wavelength of electrons that have been accelerated through a potential difference of 1.8 105 V. Explain what is meant by Einsteins principle of equivalence. Describe how Einsteins description of the gravitational attraction between two particles differs from that offered by Newton. Summarise the evidence that supports the General Theory of Relativity. -rays are emitted from a source placed in a ground floor laboratory. They are measured to have a wavelength of 0.05 nm. If the source is moved to a laboratory on the top floor of the building, they are measured to have a frequency shift of 3.3 104 Hz. Estimate the height of the building. Use a spreadsheet to plot a graph that shows the variation with distance r from a black hole of the time dilation t of an event.
In the following exercises, should a particular value of be required, then refer to the graph, Figure 1924 on page 477. 1 State the two postulates of the Special Theory of Relativity and explain, with the use of appropriate diagrams, how two events that are simultaneous to one observer need not necessarily be simultaneous to another observer in a different reference frame. 2 Show that 1 atomic mass unit is equivalent to about 930 MeV.
8.
9. 3. An electron is moving at a constant velocity of 0.90c with respect to a laboratory observer X. 10. (a) (b) Determine the mass of the electron as measured by X? Another observer Y is moving at a constant velocity 0.50c with respect to X in a direction opposite to that of the electron in Xs reference frame. Determine the mass of the electron has measured by Y
11.
12.
OPTION
Use the time dilation (page 477) graph to find the function when v = 0.5c and 0.8c and hence calculate the relativistic mass increase of an electron when travelling at these speeds. A proton is accelerated from rest through a potential difference of 8.00 108 V. Calculate as measured in the laboratory frame of reference after acceleration the i. ii. iii. iv. proton mass. velocity of the proton. momentum of the proton (HL only). total energy of the proton (HL only).
13.
478
070817 Physics Chapt 19 for Paul478 478 22/05/2009 12:09:02 PM
MEDICAL PHYSICS
MEDICAL PHYSICS
I.1 I.2 I.3 The ear and hearing Medical imaging Radiation in medicine
20
The outer ear consists of the pinna, the external auditory canal and the ear-drum (tympanic membrane). Sound waves reaching the ear are collected by the pinna it has an important spatial focussing role in hearing. The sound waves are directed down the external auditory canal that is about 2.5 cm long and about 7 mm in diameter. It is closed at one end by the ear drum (tympanic membrane) that consists of a combination of radial and concentric fibres about 0.1 mm thick with an area of about 60 mm2 that vibrate with a small amplitude. The ear canal is like a closed organ pipe as the canal is a shaped tube enclosing a resonating column of air that vibrates with an optimum resonant frequency around 3 kHz. The ear-drum acts as an interface between the external and middle ear. As sound travels down the ear canal, air pressure waves from the sounds set up sympathetic vibrations in the taut membrane of the ear and passes these vibrations on to the middle ear structure.
Stirrup Hammer Anvil
I.1
I.1.1 I.1.2
I.1.3
I.1.4
I.1.5
Eustachian tube
479
070817 Physics Ch 20 final.indd 479 22/05/2009 12:09:51 PM
OPTION
CHAPTER 20 (OPTION I)
The middle ear consists of a small (around 6 cm3), irregular, air-filled cavity in which, suspended by ligaments, are the ossicles - a chain of three bones called the malleus, incus and stapes, more commonly known as the hammer, anvil and stirrup. They act as a series of levers with a combined mechanical advantage of 1.3. Because of their combined inertia as a result of the ossicle orientation, size and attachments, they cannot vibrate at frequencies much greater than 20 kHz. The malleus is attached to the inner wall of the tympanic membrane and the flat end of the stapes comes up against the oval window (a membrane called fenestra ovalis). window will be amplified, and this higher pressure will be able to put in motion the denser fluid of the inner ear. This also matches the acoustic impedance between the inner ear and the outer ear to a greater degree. As well as the necessity for acoustic impedance matching, the middle ear helps to protect the delicate inner ear cochlea from sudden increases in pressure intensities through the linking of the middle ear to the back of the throat by a tube called the eustachian tube. This tube, which is usually shut, equalises the air pressure on each side of the ear-drum. If the pressures were not equal, the eardrum will not vibrate efficiently, and if there was a sudden large change in pressure, the ear-drum or the cochlea would rupture. The eustachian tube can be opened by swallowing, yawning or chewing. We are all aware of the relief this brings when we experience sudden changes in pressure such as in air flights. The inner ear is a complicated bony chamber filled with fluid and embedded in the bone of the skull. It is divided into two parts: 1. The central part together with the semi-circular canals (three fluid-filled canals) that are concerned with maintaining balance and the detection of movement and the position of the body. They do not contribute to the process of hearing. A spirally coiled, fluid-filled tube about 3 mm in total diameter with a volume of 100 mm3 called the cochlea. The cochlea is connected to the brain by way of the auditory nerve.
OPTION
2.
The cochlea is the most delicate organ in the hearing process and it contains many intricate structures that will not be fully investigated at this level. It consists of three canals two outer canals, the upper (scala vestibuli ), the middle (scala tympani) , and a lower canal (scala media) as shown in Figure 2002.
Middle canal
2.
Upper canal
Auditory nerve
Lower canal
3.
Organ of Corti
Basilar membrane
To auditory nerve
Figure 2002
480
070817 Physics Ch 20 final.indd 480 22/05/2009 12:09:53 PM
MEDICAL PHYSICS
The pressure wave from the oval window passes through a fluid called perilymph down the spiral of the scala vestibuli (upper) to its end, and returns via the scala tympani (middle). The pressure variation is absorbed at the return end by the round window membrane. The two chambers make up the helicotrema. Between these two chambers is the membranous scala media that contains a different fluid called endolymph. It is surrounded by the vestibular membrane (top) and the basilar membrane (bottom), and it terminates at the base of the cochlea. The scala media contains the sensors in which pressure waves generate electrical signals that are carried to the brain via the auditory nerve due to a potential difference between the two different fluids and the membrane between the fluids. The basilar membrane is embedded with about 20 000 noncellular fibres. Located on the top of the basilar membrane are a set of hair cells that are suspended by fibres embedded in another membrane called the tectorial membrane. These components make up the complex structure called the organ of Corti. Although not fully understood, it is believed that when the basilar membrane moves, the hairlike fibres move back and forth and stimulate the hair cells of the organ of Corti to initiate neural impulses. This can be dealt with in terms of the different areas of the eardrum and oval window, together with the lever action of the ossicles. Although the concept of impedance matching is not formally required, students should appreciate that, without a mechanism for pressure transformation between media of different densities (air and fluid), most sound would be reflected, rather than transmitted into the cochlear fluid. The characteristics of a musical note are its pitch, its loudness and its quality, or timbre. The pitch is determined by the notes frequency. This has already been defined as the number of vibrations that occur in one second measured in Hz or s-1. The quality, or timbre, of a note is very much determined by the source or instrument producing the note. It is determined by both the frequency and the relative amplitude of the note/s being produced. The timbre of noise is poor because it consists of a random mixture of unrelated frequencies. The timbre of a violin is good because the notes produced are mixtures of fundamental frequencies and harmonics. Just as the resonance of two vibrating tuning forks mounted in two sound boards is the selective reinforcement of the natural frequency of vibration of the sound boards, so too the response of the ear to sound is essentially one of resonance due to the vibrations of the sound matching the natural frequencies of the vibration of parts of the ear. The external auditory canal is like a closed pipe and exhibits slight resonance at approximately 3000 Hz. The middle ear displays a slight but broad resonance between about 700 Hz and 1400 Hz, and is greatest at about 1200 Hz. The cochlea of the inner ear displays excellent transmission between about 600 Hz and 6000 Hz.
RESPONSE
The human perception of loudness at different frequencies is not constant and varies considerably over the audible frequency range as shown in the intensity-logarithmic frequency diagram of Figure 2004. A logarithmic scale for frequency is preferred so that the wide range of audible frequencies can be examined on the same graph.
481
070817 Physics Ch 20 final.indd 481 22/05/2009 12:09:54 PM
OPTION
CHAPTER 20 (OPTION I)
basilar membrane. Depending on the frequency of the stimulation to the oval membrane, the travelling wave will produce a maximum amplitude at some well-defined point along the length of the basilar membrane: 1. Low frequencies produce an amplitude peak far from the oval window and the hair fibres already mentioned are actually longer. Therefore, there is a greater membrane mass found here, and the amount of perilymph set in vibration is large. High frequencies produce amplitude peaks close to the oval window and the hair fibres are shorter and thinner. Therefore, there is a smaller membrane mass found here, and the amount of perilymph set in vibration is smaller. Medium frequencies produce an amplitude peak around 2000 Hz and the hair fibres have a length and thickness in between cases 1 and 2.
Sound Intensity / W m2
102 100 102 104 106 108 1010 1012 10 100 1000 10 000 Frequency / Hz
140 100 80 60 40 20 0 Intensity level / dB 120 jet taking off thunder shouting quiet talking whispers
2.
Figure 2004
OPTION
The minimum detectable intensity for a given frequency is called the threshold intensity of hearing and it is the envelope of the curve in the above diagram. For example, the threshold of hearing between 2-3 kHz is 10-12 Wm-2 and at 90 Hz, the threshold of hearing is about 10-8 Wm-2. From the graph, we can see that the intensity level of the sound increases as the range of frequencies that can be detected increases, up to a maximum intensity level of 100 dB where the audio range of 20 Hz to 20 kHz is reached. The ear is most sensitive to sounds at a frequency around 3 kHz. This is no accident as the the cochlear tube length is 2.5 cm and it acts like a closed pipe with a standing wave of . ( this is approximate because it is more a travelling wave due to changing speed and wavelength). This would give us = 10cm. If the speed of sound is 330 ms-1, then the frequency would equal 3000 Hz. Why do you think alarms and the human scream are around 3000 Hz? At the normal conversation level of 60 dB, the approximate frequency range is between 50 Hz and 14 kHz. The audible frequency range for the different intensity levels is shown by the threshold of hearing envelope in the diagram. The ability of the ear to distinguish different frequencies is called frequency discrimination. At frequencies below 50Hz, it is hard for the ear to discriminate the individual pitch. From 60Hz to 1000Hz, a pitch differing by 23Hz can be discriminated when listened to separately. Above 1000Hz, the ability to discriminate close frequencies decreases by about 2Hz per 1000Hz. Above this value, frequency discrimination is acute. The ability to discriminate frequencies is controlled by the cochlea. Refer to Figures 2001 and 2002 again. As the foot of the stirrup ossicle pushes inward onto the oval membrane, a pressure wave is produced in the perilymph fluid. The basilar membrane is forced to bulge towards the round window, and this in turn bulges outwards. The elastic tension causes a travelling wave to move along the
3.
Therefore, the cochlea can distinguish between different frequencies due to the complex fibres and hair cells and there are corresponding neurones along the length for each frequency. The specific neural impulses can then travel along the auditory nerve to the brain. The travelling waves produced are not standing waves because the travelling wave will lose energy as it moves along the length of the basilar membrane and it will, as such, have a changing velocity and wavelength.
Loudness
The loudness of a note is a subjective quantity. It is not a measure of intensity level. Although loudness takes into account the logarithmic measure of the ears response to intensity dB, it does not take into account the hearing of the listener or the frequency response of the listeners ear to different frequencies. Therefore, loudness is determined by the intensity and the frequency of a sound. Loudness is measured in phons. A change in the observed loudness of a sound is the response of the ear to a change in intensity. Just as an audible frequency is dependent on the sound level intensity, so too is loudness intensity. However, it is also dependent on the energy transfer mechanisms of the ear. The basilar membrane and the hair cells are affected, and, as the sound intensity is increased, the nerve fibres become more stimulated and this creates a greater impulse frequency to the brain. The brain registers this as an increase in loudness. A graph of loudness perception is shown in Figure 2005. Equal changes in sound intensity are not perceived as equal changes in loudness.
482
070817 Physics Ch 20 final.indd 482 22/05/2009 12:09:55 PM
MEDICAL PHYSICS
I.1.6
Dene intensity and also intensity level (IL). State the approximate magnitude of the intensity level at which discomfort is experienced by a person with normal hearing. Solve problems involving intensity levels. Describe the eects on hearing of shortterm and long-term exposure to noise.
Sound Intensity / W m2
I.1.7
I.1.8 I.1.9
Figure 2005 Curves of equal loudness perception for the human ear Each curve is of equal loudness perception and is measured in units called phons. The relationship between sound intensity and loudness is logarithmic. A hundredfold increase in intensity is perceived to be a twentyfold increase in loudness. So a ten times increase in intensity would raise the loudness by a factor of two. It can be shown that: Loudness increase the intensity increase / initial intensity i.e. dL = k dI I
I.1.10 Analyse and give a simple interpretation of graphs where IL is plotted against the logarithm of frequency for normal and for defective hearing.
IBO 2007
If we integate this expression we get L = k ln I + C where C is a constant. When I = Io, L = 0. Therefore, C = -k ln Io So L = k ln I k ln Io = k ln (I / I0) This is further evidence of the logarithmic reponse to a change in intensity.
483
070817 Physics Ch 20 final.indd 483 22/05/2009 12:09:56 PM
OPTION
CHAPTER 20 (OPTION I)
In a sound wave, the medium particles, such as air molecules, vibrate in simple harmonic motion parallel to the direction of wave propagation. Just as the energy in simple harmonic motion is proportional to the square of the amplitude (A) and the square of the frequency, so too sound intensity is directly proportional to the square of the amplitude and the square of the frequency of the particle vibrations. I A2 The ear-brain combination can accommodate a range of sound intensities. The just detectable sound intensity known as the threshold of hearing I0 is taken to be 10-12 W m -2. The sound intensity that produces a sensation of pain called the pain threshold is taken to be 1 W m-2. A sound wave can also be considered as a pressure wave because the particle vibrations cause harmonic variations in the density of the medium. The pressure variations corresponding to the hearing threshold and the pain threshold are 3 105 Pa to 30 Pa repectively superimposed on atmospheric pressure of about 101 kPa. Because of the extreme range of sound intensities to which the ear is sensitive - a factor of 1012 - (curiously about the same factor as the range of light intensities detectable by the human eye), and because loudness (a very subjective quantity) varies with intensity in a non-linear manner, a logarithmic scale is used to describe the intensity level of a sound wave. The intensity level of sound, IL, is defined as:
Whisper Library Normal office Normal conversation (2m) Machine shop 102
1010 100
Rock concert
100
Figure 2003 Intensity levels of some common sounds. At very high dB levels, a number of physiological effects have been noticed. Above 130 dB, nausea, vomiting and dizziness can occur. Noise greater than 140 dB can cause temporary deafness. Noise around 190 dB can cause major permanent damage to the ear in a short time.
OPTION
I b = log -10 I 0
where I is the intensity corresponding to the level b and I0 is the threshold intensity or threshold of hearing taken as 10-12 W m-2. b is measured in bels B, named after Alexander Graham Bell, one of the inventors of the telephone. Because the bel is a large unit, it is more convenient to use the decibel dB (one-tenth of a bel).
I dB b = 10 log -10 I 0
Using this scale, the threshold of hearing is:
The sound intensity at a distance of 20 m from a fire alarm is 5.0 103 W m2. Calculate the sound intensity at a distance of 50 m.
484
070817 Physics Ch 20 final.indd 484 22/05/2009 12:09:57 PM
MEDICAL PHYSICS
Solution
1 Id 2 = k (a constant) --Given that I . d2
Therefore,
I1 d 1 = I 2 d 2 .
So that (5.0 103 W m2) (20 m)2 = I2 (50 m)2 ( 5.0 10 3 W m2 ) ( 20 m ) 2 I2 = ---------------------------------------------( 50 m ) 2
I b = 10 log -- I 10 0
Physically, every 6 dB increase corresponds to a doubling of the sound pressure level. For our perception, every 10 dB increase sounds to twice as loud. For adults (subjects) there are established ranges of the sound intensity level of the degree of a hearing loss at each frequency interval as shown in Figure 2006.
120 Loud 11 0 100 90 80 70 60 50 40 30 20 10 Soft
= 8.0 10 4 W m 2 Example
Calculate the sound intensity level in dB of a sound from a loudspeaker with a sound intensity of 8.0 10-4 W m-2
Profound deafness Severe hearing loss Moderately sever e hearing loss Moderate hearing loss Mild hearing loss Normal hearing 125 250 500 1000 2000 4000 Frequency (logarithmic scale) / Hz Low pitch High pitch
Solution
A sound level meter placed near a circular saw registers a value of 92 dB. What is the intensity in W m2 corresponding to this intensity level.
Solution
We have that:
The subject will have difficulty hearing soft speech or normal conversations but would manage fine if the speech is clear. Mild hearing loss in the high frequencies in both ears suggests noise damage. The subject would miss some consonant sounds such as ss, zs, ths, vs, even under perfect listening conditions. Under noisy conditions, the subject would experience more difficulty with hearing these sounds. The subject does not need a hearing aid, but should protect their remaining hearing by avoiding loud noise and using effective ear protection.
485
070817 Physics Ch 20 final.indd 485 22/05/2009 12:09:58 PM
OPTION
CHAPTER 20 (OPTION I)
46 dB to 60 dB (moderate hearing loss) The subject will experience difficulty with multi conversations, especially if there is background noise. Much of the loudness of speech will be lost and there will be confusion amongst heard words due to the misinterpretation of consonant sounds. The volume for TV and radio would need to be increased. A hearing aid would help provided that the speech discrimination was good and the background noise minimised. 61 dB to 75 dB (moderately severe hearing loss) This is a common result amongst the old due to the effects of an ageing inner ear. The subject has moderately severe hearing loss in both ears usually in the higher frequencies meaning that it would be hard to distinguish one word from another. A hearing aid is needed to amplify soft sounds considerably, moderate sounds a little and loud sounds not at all but the aid must also leave the low frequencies unchanged. 76 dB to 90 dB (severe hearing loss) Normal conversational speech is inaudible. A hearing aid will only help a little. The subject is likely to lip-read. 91 dB (profound hearing loss) It is unlikely that a hearing aid will help. Hearing losses can be conductive, sensory or neural. A conductive hearing loss occurs due to an abnormality in the outer and/or middle ears, and as a result, the sound energy cannot be conducted to the inner ear and then to the brain as it should be. It can be caused by allergies or infections in the throat and it causes severe earaches. It is quite common in children. Otosclerosis is a condition found in adults in which the stapes bone begins to grow a spongy mass that prevents it from vibrating correctly in the oval window. Surgery can sometimes improve the hearing loss. A heavy blow to the auricle bone or the head can cause perforations in the eardrum or damage to any of the ossicles. The conductive mechanism is disrupted and there may be no surgical solution.
OPTION
486
070817 Physics Ch 20 final.indd 486 22/05/2009 12:09:59 PM
MEDICAL PHYSICS
It is not only the degree of hearing loss but also the range of audible sounds that affects a hearing-impaired person, and this selective frequency loss can lead to a loss of speech discrimination. Not only is the volume or quantity of sound affected but also the quality of the sound can be distorted. The lack of discrimination can be measured using speech discrimination scores that the audiologist then shades onto an audiogram. A speech discrimination score is shown by the shading on the audiogram in Figure 2007.
100 90 80 70 Intensity / dB 60 50 40 30 20 10 Frequency (logarithmic scale) / Hz OO Vowel sounds EE Consonant sounds Th
using a vibrating tuning fork placed on the mastoid bone behind the ear (bone conduction). Once the audiologist does this initial test, the subject is asked to sit in an acoustic booth and listen to a number of pure tones (pure tone audiometry), and indicate when the tones can be heard. The tones are reduced in intensity until they can just be heard. The hearing threshold is then marked on an audiogram. For air conduction, hearing is measured with pure tones through a set of headphones placed on the ears. The subject is asked to push a button when they hear certain frequencies. This is known as the air threshold method as sound must travel through the air of the ear canal, through the middle ear, and through the inner ear cochlea. When the sensitivity of the inner ear needs to be directly measured, the bone conduction method is used. This method allows for the outer ear and the middle ear to be bypassed. A vibrator is placed on the mastoid bone behind the ear, and this is held in place by a small metal band stretching over the top of the head. This method transmits sound by direct vibration of the bone and these vibrations are carried by the skull bones, tissues and fluids directly to the cochlea in the inner ear. The standard audiogram reads 1. the frequency in hertz from the lowest to the highest pitch within an audible hearing range of the average person on the horizontal axis.
Figure 2007 An audiogram Generally, vowel sounds (a, e, i, o, u) are recognised at frequencies lower than 1000 Hz, and consonant sounds (s, z, th, v), are found above this frequency. Speech discrimination can greatly affect the quality of sound. For example, missing many consonants would mean that the subject may experience difficulty distinguishing one word from another.
The range used is 125 Hz or 250 Hz on the left side and 8000 Hz on the right side. The scale is based upon octave intervals of a particular note like on a piano keyboard. Lows are to the left of 1750 Hz and highs are to the right of this imaginary vertical line. 2. the loudness (intensity) in decibels is on the vertical axis and from 10 dB to 0 dB at the bottom, and 110 dB at the top.
0 dB does not mean the recognition of no sound but rather it is the softest sound that a person will hear 50% of the time. Normal hearing is in the 0 to 20 dB range at all frequencies. Soft sounds are below 35 dB, moderate sounds are then from 35 dB to 70 dB, and loud sounds are above 70 dB. 3. represents the left ear and represents the right ear in air threshold hearing curves detected using headphones. If the bone-conduction vibrator technique is used to determine the threshold, a > is used for the left ear, and a < is used for the right ear.
487
070817 Physics Ch 20 final.indd 487 22/05/2009 12:10:00 PM
OPTION
CHAPTER 20 (OPTION I)
A typical audiogram is illustrated in Figure 2008 for a normal hearing range - all frequencies are in the 0 20 dB band. If all the and readings fall above the 20 dB line, then you have normal hearing. If anything is below this line, then you have a hearing loss. Sometimes, a frequency interval showing a 3000 Hz and 6000 Hz line is included making the audiogram longer by two intervals. Figure 2008 shows a normal audiogram obtained using the air or bone conduction method.
100 90 80 70 60 50 40 30 20 10 250 500 1000 2000 4000 8000 frequency (logarithmic scale) / Hz
(b)
(c)
(d)
(e)
For each of the parts listed below, label their position on the diagram, and explain their function: (i) ear drum (ii) ossicles (iii) semi-circular canals If there was no mechanism for pressure transformation between the media (air, bones and fluids) of the ear what would happen to most of the sound entering the ear? Label the cochlea. Using arrows, show the approximate position within the cochlea where high frequency and low frequency sounds are processed to produce neural impulses. Label the remaining arrows that are given in the figure.
2.
Which of the following are good estimates of the appropriate sound intensity level for some sounds? A. B. C. D. E. normal speech the hearing threshold thunder the pain threshold a noisy factory 60 dB 100 dB 60 dB 120 dB 100 dB
Figure 2008
A normal audiogram
OPTION
A comparison of the air and bone hearing curves can determine whether the hearing defect is conductive (when the air conduction curve shows a hearing loss but the bone conduction curve is normal), sensorineural (when the air conduction and bone conduction curves show the same amount of hearing loss), or a combination of both.
3.
Calculate the intensity level in dB of a sound having an intensity of magnitude 1.0 10-5 W m-2. By how many dB does a sound level increase if its intensity is trebled? A circular saw and a grinder when operated alone produce sound levels of 90 dB and 70 dB respectively. Calculate the sound level intensity when both machines are operated together. The intensity of sound 30 m from a busy street is 1.0 10-4 W m-2. Calculate the intensity at a distance of 120m from the street. Distinguish the difference between the terms sound intensity and sound intensity level as applied to sound and hearing. The sound intensity at a distance of 20m from a fire alarm is 5.0 10-3 Wm-2. Calculate the sound intensity at a distance of 40 m.
4.
Exercises
20.1
5.
1.
7.
8.
(a)
Draw 3 vertical lines to differentiate between the outer, middle and inner ear.
488
070817 Physics Ch 20 final.indd 488 22/05/2009 12:10:00 PM
MEDICAL PHYSICS
9. Calculate the intensity level in dB of a sound from a loudspeaker with a sound intensity of 7.0 10-4 Wm-2. A sound level meter placed near a circular saw registers a value of 96 dB. Determine the intensity in Wm-2 corresponding to this intensity level. By how many dB does a sound level increase if its intensity is doubled? The figure below shows how the typical threshold of hearing varies with frequency for the average ear. (b) If the cross-sectional area of the eardrum is 60 mm2 and the cross-sectional area of the oval window is 3.0 mm2, calculate the pressure amplification of the oval window.
10.
14.
11.
12.
pain discomfort
(a) (b)
(c)
10
(d)
Outline how the data for this audiogram was obtained from the patient. Describe a possible interpretation of the audiogram in terms of the possible cause of the hearing disorder. Determine the sound intensity in Wm-2 that is just audible at a frequency of 1000Hz. Would a hearing aid help the patient? Explain your answer.
Use the graph to find (a) (b) (c) the frequency range over which a sound of intensity 10-6 Wm-2 can just be heard. the frequency at which the ear is most sensitive. how much less intense a sound of 250 Hz must be than a sound of 10 000 Hz if it is to be just heard.
13.
The figure below shows a schematic diagram of the ossicles positioned as levers between the eardrum and the oval window.
pivot
oval window
A2
2 --l 3
l A1 F1
ear drum
F2
inner ear
middle ear
outer ear
(a)
With reference to the diagram, explain how pressure amplification to the inner ear occurs.
489
070817 Physics Ch 20 final.indd 489 22/05/2009 12:10:01 PM
OPTION
CHAPTER 20 (OPTION I)
I.2
I.2.1
MEDICAL IMAGING
Dene the terms attenuation coecient and half-value thickness. Derive the relation between attenuation coecient and half-value thickness. Solve problems using the equation I = I0 ex. Describe X-ray detection, recording and display techniques. Explain standard X-ray imaging techniques used in medicine. Outline the principles of computed tomography (CT).
IBO 2007
I.2.2
I.2.3
I.2.4
I.2.5
Electrons are released from the tungsten filament by thermionic emission. Because the large number of electrons released experience forces of repulsion, the electron stream tends to spread out. To prevent this spreading out so that a small area of the anode target material is bombarded, the cathode-focusing cup produces electrical forces that cause the electron stream to converge onto the anode target at a focal spot. Most of the energy of the electrons is converted to heat in this collision due to their sudden deceleration with less than 1% being converted to X-radiation. Because large amounts of heat are produced at the focal spot of the electron beam, the tungsten disc anode is made to rotate using an induction motor so that the heat loading on any particular point on the disc is reduced.
I.2.6
Introduction
In Chapter 18 Option G , the production of X-rays and their nature in terms of bremsstrahlung radiation (continuous spectrum) and characteristic line spectrum were discussed, and you might want to look at this section as back-up material.
OPTION
There are a number of X-ray tubes used for medical purposes. The tubes can be classified as either diagnostic (medical imaging) or therapeutic (radiation therapy). We will concentrate in this section on diagnostic details because we are interested in medical imaging at the moment. Most diagnostic X-ray machines use a rotating anode X-ray tube as shown in Figure 2012.
Leadlined steel compartment Released electrons Tungsten anode Filament Rotor Motor windings
Vacuum tube
(a) ( b)
Xray photon electron light photon recoil electron
electron
( c)
( d)
very high Energy Xray photon
V +
X-rays
Figure 2012
Figure 2013 The mechanisms of attenuation Simple coherent scattering (see Figure 2013(a)) occurs when the energy of the incoming X-ray photon is smaller than the energy required to remove inner-shell electrons from an atom. When the incident X-ray photon interacts
Electrons with a very high potential difference (typically around 15 000V in hospital machines) are accelerated between the cathode and the anode. A focusing cup usually made of molybdenum contains a tungsten filament.
490
070817 Physics Ch 20 final.indd 490 22/05/2009 12:10:02 PM
MEDICAL PHYSICS
with an atom, it is scattered in a new direction without a loss of energy. It is the dominant mechanism in soft tissue in the 1-30 keV range. In the photoelectric effect mechanism (see Figure 2013(b)), the incoming X-ray photon has an energy greater than the energy required to remove inner-shell electrons, and photoelectrons and positive ions are produced. As other electrons in the atom fill the vacant spots of the ejected photoelectrons, characteristic lower-energy photon emission occurs. It is the dominant mechanism in soft tissue in the 1-100 keV range. The optimum photon energy for diagnostic radiography is around 30 000 kV where the photoelectric effect predominates because this gives the maximum contrast between body tissues and bones. Compton scattering (see Figure 2013 (c)) occurs when the X-ray photon ejects outer-shell recoil electrons and the X-ray photon moves off in a different direction with a slightly lower energy. It is the dominant mechanism in soft tissue in the 0.5-5 MeV range. High energy X-ray photons can produce electron-positron pairs. It is the dominant mechanism in soft tissue above 5 MeV. The Compton scatter is used in therapeutic radiology where higher energies are preferred. When matter (such as an electron) collides with its corresponding antimatter (such as a positron), both particles are annihilated, and 2 gamma rays with the same energy but with a direction at 180 degrees to each other are produced. The direction of the gamma rays produced is in accordance with the law of conservation of momentum and the electron- positron annihilation gives energy equal to E = mc2 (0.51 MeV each). This is depicted in Figure 2013(d). The attenuation (reduction in intensity) of X-rays occurs in two ways: 1. the intensity of the X-ray beam may decrease with distance from the source (tungsten target) as they diverge or spread out in spherical wavefronts. the intensity of the X-ray beam decreases as the X-ray photons are scattered or absorbed by a material. A beam of homogeneous, monoenergetic X-rays contains photons of only one energy and thus only one wavelength. When a beam of monoenergetic X-rays of intensity I0 passes through a medium with a thickness x, the attenuation or fractional reduction in intensity I is given by:
I = I0 e x
where = the constant of proportionality called the linear attenuation coefficient. Its value depends on the X-ray energy concerned and the nature of the absorbing material. It has units m-1. The intensity of the monoenergetic beam decreases exponentially with absorber thickness. The value of the attenuation coefficient increases as the X-ray energy decreases and higher absorption results. Figure 2014 shows a small thickness of lead absorbing X-rays.
Io Intensity lead Intensity Io lead
x / nm Soft Xrays
Hard Xrays
x / nm
Figure 2014 Attenuation in a slab of lead. The most penetrating radiation with short wavelengths (~ 0.01 nm) are termed hard X-rays. Very little absorption occurs when they pass through the lead slab. Long wavelength (~1 nm) X-rays are easily absorbed by the lead slab and these are called soft X-rays. They are less penetrating and more absorbing than hard X-rays. The quality (penetrating power) of a monoenergetic beam of X-rays can be described in terms of the half-value thickness (HVT) in a given material. The half-value thickness is the thickness of a material that reduces the intensity of a monoenergetic X-ray beam to half its original value. Figure 2015 shows the exponential decay of attenuation and the corresponding half-value thickness of an absorbing material.
100
% transmission
2.
The radiation emitted by an X-ray tube is heterogeneous because it is made up of photons with a range of energies. The filters already mentioned filter out the low energy photons to improve the X-ray quality. These photons would only be absorbed by the skin or surface tissue of a person being X-rayed, and one aim is to minimise any excess radiation dose because of the invasive characteristics of Xradiation.
50
I --- = e x Io
halfvalue thickness
Thickness of absorber, x / m
491
070817 Physics Ch 20 final.indd 491 22/05/2009 12:10:03 PM
OPTION
CHAPTER 20 (OPTION I)
Since the half-value thickness is the thickness of a material that reduces the intensity of a monoenergetic X-ray beam to half its original value, then in this instance:
1 -I . If x = x1 2 then I = -2 0
Using I = I0 1 I = I e x1 2 1 = e x1 2 ln ( 0.5 ) = x -0 1 2 2 0 2
e x , we have:
ln ( 0.5 ) = x1 2 0.6931 = x1 2
That is,
The half-value thickness of a 30 keV X-ray photon in aluminium is 2.4 mm. If the initial intensity of the X-ray beam is 4.0 102 kW m2. (a) What is the intensity after passing through 9.6 mm of aluminium? Calculate the linear attenuation coefficient of the aluminium. What is the intensity of the beam after passing through 1.5 mm of aluminium?
0.6931 x1 2 = ---------
(b) We can also determine a value for the linear attenuation coefficient for a monoenergetic beam by plotting a graph of ln I against thickness: I = I0 e-x. Take loge of both sides. ln I = ln I0 + lne e-x ln I = - x + ln I0 (a)
(c)
Solution
ln I
Intensity after passing through 2.4 mm would be half the initial intensity. Intensity after passing through 4.8 mm would be a quarter of the initial intensity. Intensity after passing through 7.2 mm would be an eighth of the initial intensity. Intensity after passing through 9.6 mm would be one-sixteenth of the initial intensity 1 (4.0 102 kWm2) = 25 kWm2 New intensity = ___ 16 0.6931 = ______ 0.6931 = 0.29 0.6931 = ______ ______ x_ 1 = x_ 1 2.4 2 2 Therefore, the linear attenuation coefficient is 0.29 mm1 or 2.9 102 m1. Be careful of the units here because if the value is 0.29 per mm then it is 290 per m.
OPTION
y - intercept = lnI0
Gradient = -
(b)
thickness
Figure 2016 The linear attenuation coecient for a monoenergetic beam From Figure 2016 we can see that the gradient of the straight line is equal to - and the y-intercept is equal to lnI0.
(c)
I = I0ex = 4.0 102 e(290 0.0015) = 2.59 102 The intensity is 2.59 102 kW m-2.
492
070817 Physics Ch 20 final.indd 492 22/05/2009 12:10:04 PM
MEDICAL PHYSICS
(c) The attenuation (reduction in intensity) of X-rays occurs in two ways: 1. (a) What is meant by the term attenuation when referring to X-rays? Name two mechanisms responsible for the attenuation of X-rays by matter. Name two ways in which the attenuation of X-rays occur. Define the term attenuation coefficient. State the two factors upon which the value of the attenuation constant depends. The transmission of X-rays by matter can depend upon the thickness of the material in the path of the X-rays. (i) Sketch a graph of percentage transmission versus the thickness of the absorbing material. By using your graph, explain the meaning of the term half-value thickness. 2. the intensity of the X-ray beam may decrease with distance from the source (tungsten target) as they diverge or spread out in spherical wavefronts. the intensity of the X-ray beam decreases as the X-ray photons are scattered or absorbed by a material.
Example
(b)
(c)
(d)
(d) (e)
A beam of homogenous, monoenergetic X-rays contains photons of only of one energy and thus only one wavelength. When a beam of monoenergetic X-rays of intensity I0 passes through a medium with a thickness x, the attenuation or fractional reduction in intensity I is given by : I = I0 e x Where = the constant of proportionality called the linear attenuation coefficient. It has units m-1.
(f)
(e)
Its value depends on the energy of the X-ray photons and the nature of the absorbing material. (i) See the graph below which shows the exponential attenuation by an absorbing material.
(ii)
(f)
Solution
% transmission
100
(a)
The attenuation of an X-ray beam is the reduction in its intensity due to its passage through matter. (When a beam of X-rays passes through a material such as the soft tissue of the body or bone, some of the X-rays will be absorbed). There are four attenuation mechanisms where energy can be lost due to absorption in matter: 1. 2. 3. 4. simple coherent scattering the photoelectric effect Compton scattering pair production
50
I --- = e x Io
halfvalue thickness
Thickness of absorber, x / m
(b)
(ii)
The quality (penetrating power) of a monoenergetic beam of X-rays can be described in terms of the half-value thickness HVT in a given material. The half-value thickness is the thickness of a material that reduces the intensity of a monoenergetic Xray beam to half its original value.
493
070817 Physics Ch 20 final.indd 493 22/05/2009 12:10:05 PM
OPTION
CHAPTER 20 (OPTION I)
X rays from patient plastic front double sided film felt pad
Figure 2018 (c) Image intensifying screen The standard X-ray machine used in radiography can produce images of some of the internal organs of the body and the bones. Air pockets, fat and soft tissues can be differentiated from each other because they can attenuate the X-ray beam in different ways. Bones produce a white image because they contain heavier body elements such as calcium and phosphorus in a dense matrix that attenuate the X-ray beam more than the softer tissues. Tissues that contain lighter elements such as hydrogen, carbon, nitrogen and oxygen produce a grey image on the radiograph. It is easy to distinguish a black lung image because it contains lower density air when compared with more dense water that is present in abundance in tissues. Certain parts of the body are difficult to image against the background of other body parts. In order to improve the contrast of the image, solutions of heavy elements with a large attenuation co-efficient can be introduced into the body. These materials are known as contrast-enhancing media. Barium and bismuth can be introduced through the mouth or the rectum for the imaging of the alimentary canal or the appendix. It is common for people with stomach pain or possible gastro-intestinal ulcers to be asked to drink a barium sulfate meal before an X-ray is taken. An iodine solution can be introduced intravenously to enhance the image of the cardiovascular system, the kidney and the brain. However, the contrast of the image produced in soft tissue anatomy is not very clear in many situations. Over the past 20 years, this clarity has been greatly improved by using X-rays and electronic detection and display together in computed tomography imaging.
patient
Xray film
The X-ray beam passes through the glass wall of the X-ray tube, a layer of oil then a 3 mm thick aluminium plate to filter out low energy radiation. It is then collimated by lead plates. The aim is to produce a narrow beam because any random scatter increases the blur of the radiographic image. The amount of exposure time the patient experiences is strictly controlled. The X-rays enter the patient where they are either scattered or absorbed. In order to decrease blurring on the radiograph due to scattering, a lead grid system is inserted before the photogaphic film. Direct X-rays pass between the grid while the scattered X-rays are absorbed by the lead plates, see Figure 2018 (b).The direct X-rays then fall on an intensifying screen cassette containing double-sided film sandwiched between two fluorescent screens, see Figure 2018 (c).
OPTION
X ray beam
494
070817 Physics Ch 20 final.indd 494 22/05/2009 12:10:06 PM
MEDICAL PHYSICS
1. 2. 3. Increasing the tube voltage.
1.0 Relative intensity
20mA
Increasing the tube current. Using a target material with a relatively high atomic number Z. Using filters.
10 mA 0.5
4.
Tube voltage
When the accelerating voltage between the cathode and anode of an X-ray tube is increased, the frequency of the X-radiation increases. Therefore, the radiation has more energy and the penetration increases. As the intensity per unit area increases due to the higher potential, so too does the spectral spread as shown in Figure 2019.
High voltage Relative intensity Characteristic Xrays Low voltage
Figure 2020
The following effects can be observed: (a) (b) (c) the spectral shape remains the same. Emax remains the same as the voltage is constant. the total intensity (given by the area under the spectrum) increases as the area under the curve is proportional to I.
10 20 30 40 50 60 70
80 90 100
Target material
The target material must have a high melting point so that it will not melt with the large heat generated by the accelerated electrons bombarding it. Furthermore, the target material must have a relatively high atomic number so that the mass, size and number of protons in the atoms ensure a greater probability that the bombarding electrons make the necessary collisions to produce X-rays. Common target materials include tungsten (Z = 74) and platinum (Z = 78). Tungsten is more widely used because of its high melting point (3370 0C). See Figure 2021.
1.0 Relative intensity K lines high Z
Figure 2019
The following effects can be observed: (a) (b) (c) Emax increases. min decreases. the peak of the continuous spectrum moves towards higher energies. the total intensity given by the area under the curve increases, and is V2 . more characteristic line spectrum may appear.
(d)
(e)
Tube current
Increasing the tube current will increase the rate of thermionic emission from the cathode. Because there are more electrons available to produce X-ray photons, the overall intensity increases. Figure 2020 demonstrates the effects observed when the tube current is increased.
E max
495
070817 Physics Ch 20 final.indd 495 22/05/2009 12:10:07 PM
OPTION
CHAPTER 20 (OPTION I)
Figure 2021 shows the effects observed when the atomic number is increased. We then note that: (a) (b) Emax remains constant. the characteristic line spectra are shifted to higher photon energies. the X-ray intensity increases as the area under the curve is directly proportional to Z. An X-ray tube has a beam current of 35 mA and it is operated at a voltage of 30 kV. (a) (b) (c) (c) At what rate does the machine transform energy? How many electrons reach the target each second? What is the maximum energy of the X-rays produced? (Assume no thermal energy loss). What is the minimum wavelength of the X-rays produced?
Example
Filters
A thin sheet of material is placed in the path of the X-ray beam, and selectively absorbs more lower-energy photons than high-energy photons. The effect of selective filtration is shown in Figure 2022.
(d)
Solution
(a)
Relative intensity
P = VI = (3 104 V) (3.5 10-2 A) = 1.05 kW q = It = (3.5 10-2 A) (1 s) = 3.5 10-2 C 1 C is the charge on 6.25 1018 electrons . Thus the number of electrons reaching the target = (6.25 1018 e C-1 ) (3.5 10-2 C) = 2.2 1017 electrons per second
(b)
no filter removed by filter with filter
OPTION
(c) (d)
Figure 2022
hc = hc --E = --E
= 6.63 10-34 Js 3 108 ms-1 / 4.8 10-15 J
The following effects can be observed: = 4.143 10-11 m (a) (b) (c) Emax does not change. = 0.0414 nm There is a shift in Emin towards higher energies. There is a reduction in X-ray output.
Although the intensity is reduced, the beam is more penetrating because of the removal of lower energy photons. The X-rays are said to be harder.
496
070817 Physics Ch 20 final.indd 496 22/05/2009 12:10:08 PM
MEDICAL PHYSICS
A patient lies on a table that passes through a circular scanning machine about 60-70 cm in diameter called a gantry. The gantry can be tilted, and the table can be moved in the horizontal and vertical directions. X-rays from the gantry are fired at the organ being scanned and attenuation occurs dependent on the type of tissue being investigated. The image produced on the computer monitor is a series of sections or slices of an organ built up to create a threedimensional image. A schematic diagram of one section is shown in Figure 2023.
Xray tube mounted on gantry Xrays
many diagnostic applications including the detection of cancerous tumours and blood clots.
I.2.7
Describe the principles of the generation and the detection of ultrasound using piezoelectric crystals. Dene acoustic impedance as the product of the density of a substance and the speed of sound in that substance. Solve problems involving acoustic impedance.
I.2.8
I.2.9
computer
lead collimators
I.2.10 Outline the dierences between A-scans and B-scans. I.2.11 Identify factors that aect the choice of diagnostic frequency.
IBO 2007
Monitor
Figure 2023
A fan beam of around 100 X-ray pulses is produced as the X-ray tube and the photomultiplier detectors around the patient make a 3600 rotation. A cross-section or slice of an organ from 1 mm to 10 mm in thickness is obtained with each rotation. The slice thickness is controlled by the lead collimators. About 1000 profiles or pictures are obtained in each rotation. A series of slices can be made to produce a 3-dimensional picture of an entire organ. The time required for the complete scan of an entire organ is normally from 3-5 seconds. However, short scanning times of 500 ms can be used when the anatomical region being investigated is affected by the patients motion and breathing. The detectors send the information to a series of computers and a host computer oversees the entire operation. The plane of the tomographic image is divided into small pixel areas of about 1 mm2, each of which can be given a grey shade value from 1 (black) to 256 (white). The thickness of each slice is simultaneously built into a volume pixel called a voxel. The image is produced on a computer monitor, and this image can be manipulated and reconstructed to get rid of interference by subtracting the background. The required well-contrasted image of the organ being investigated is then obtained. CAT scans provide detailed cross-sectional images for nearly every part of the body including the brain and vessels, the heart and vessels, the spine, abdominal organs such as the liver and kidneys. They are being used in
497
070817 Physics Ch 20 final.indd 497 22/05/2009 12:10:09 PM
OPTION
CHAPTER 20 (OPTION I)
to mechanical deformation, a tiny electric potential difference is produced between the faces of the crystal, and conversely the application of an electric potential would deform the crystal and make it vibrate. This is known as the piezoelectric effect. Today, the piezoelectric transducers used in ultrasound have certain ceramic materials in place of crystals. These transducers operate over the entire range of ultrasound frequencies. When ultrasound meets an interface between two media, the ultrasound wave can undergo reflection, transmission, absorption and scattering. This is similar to when light from air enters glass. The same laws of reflection and refraction occur. For example, with refraction (transmission) the frequency of the ultrasound source will remain constant but a change in the waves velocity as it crosses the boundary will change the wavelength. In a typical ultrasound scan, a piezoelectric transducer is placed in close contact with the skin. To minimise the acoustic energy lost due to air being trapped between the transducer and the skin, a gel is applied between the transducer and the skin. The pulse produced by the transducer reflects off various tissue interfaces. The pulse is again detected by the same transducer as a reflected wave or echo. The electronic representation of the data generated from the repetition of this process is displayed on an oscilloscope as an ultrasonic image. Thus the distance, size and location of hard and soft tissue structures can be determined. Acoustic Density impedance kg m-3 kg m-2s-1 106 1.21 0.0004 998 1.48 1060 1.66 1025 1.60 1065 1.65 1038 1.62 1912 7.80 1075 1.70 1060 1.66 1136 1.84
Velocity ms-1 Air (20 C, 101.3 kPa) 344 water (20 C) 1482 whole blood (37 C) 1570 Brain 1541 Liver 1549 Kidney 1561 Skull bone 4080 Muscle 1580 Soft tissue (37 C) 1540 Lens of eye 1620
Medium
Figure 2024 The approximate speed of sound in some biological materials The greater the difference in acoustic impedance between two materials, the greater will be the proportion of the pulse reflected. If I0 is the initial intensity and Ir is the reflected intensity for normal incidence, it can be shown that: (Z Z1)2 Ir ________ __ = 2 I0 (Z2 + Z1)2 where Z1 is the acoustic impedance of material 1 and Z2 is the acoustic impedance of material 2.
OPTION
(a)
The speed of an ultrasound in blood is 1570 ms-1 and the density of the blood is 1060 kgm-3. Calculate the acoustic impedance of blood. Calculate the thickness of a slice of muscle tissue if its fundamental resonant frequency is 1.5 MHz. The time delay for a pulse going through fat is 0.133 ms and the speed of ultrasound in the fat is 1450 ms-1. Determine the depth of the fat.
(b)
(c)
498
070817 Physics Ch 20 final.indd 498 22/05/2009 12:10:09 PM
MEDICAL PHYSICS
This mode is seldom used, but when it is, it measures the size and distance to internal organs and other organs such as the eye.
Solution
(a) Z = r v = 1570 ms-1 1060 kgm-3 = 1.66 106 kg m-2 s-1. (b) The wavelength of the resonant frequency = v / f = 1580 ms-1 / 1.5 106 Hz = 1.05 10-3 m. Now the ultrasound has to go down and back. Therefore, the actual wavelength will be half of this = 0.53 mm. (c) The time taken to reach the boundary of the fat = 1.33 104 s = 6.65 105 s Distance = speed time = 1450 ms1 6.65 105 s = 9.6 cm
B-Scan imaging
In the B-scan mode (brightness-modulated scan), an array of transducers scan a slice in the body. Each echo is represented by a spot of a different shade of grey on an oscilloscope. The brightness of the spot is proportional to the amplitude of the echo as shown in Figure 2026.
skin
organ
Figure 2026
B-scan mode
transducer
organ
499
070817 Physics Ch 20 final.indd 499 22/05/2009 12:10:10 PM
OPTION
The scan head containing many transducers is arrayed so that the individual B-scans can be built up to produce a two-dimensional image. The scan head is rocked back and forth mechanically to increase the probability that the pulse will strike irregular interfaces.
CHAPTER 20 (OPTION I)
Although the principles of nuclear magnetic resonance are beyond the scope of this course, the basic principles of this phenomenon will be outlined in the next few paragraphs. Recall that when a current is passed through a coiled wire (solenoid), the magnetic field produced is similar to that produced by a simple bar magnet. At the microscopic level, it is known that a charged particle such as a proton or an electron acts like a tiny current loop. As a result, the nuclei of certain atoms and molecules also behave like small magnets due to the rotation or spin of their nuclear protons or neutrons. Spin is in two directions and when nuclei have equal numbers of protons and neutrons, the spin is equal in both directions and there is no net spin. However, if there are different numbers of protons and neutrons, the spins do not cancel and there is a net spin. This happens with hydrogen nuclei. If hydrogen nuclei are placed in a strong external magnetic field, they will tend to align their rotation axes with the external field direction. However, the laws of quantum mechanics allow certain alignment angles and as a result, the nuclear magnets cannot come into perfect alignment with the external field. Some will align with the magnetic field and others align themselves in the opposite direction to the magnetic field. In fact, they precess like small magnetic tops wobbling at fixed angles around the magnetic field direction. Now when a weak oscillating magnetic field in the form of pulses of radio waves are superimposed on the strong magnetic field, the oscillating field rotates at right angles to the strong field. If the radio frequency is not a certain frequency, known as the Larmor frequency, the axis of the rotating particle will wobble as described previously. If the applied frequency is equal to the Larmor frequency of precession, the charged particles resonate and absorb energy from the varying radio wave magnetic field. The magnetisation of the material is changed and this is detected by a radio-frequency signal emitted from the sample. The strength and duration of the radio signals absorbed and emitted are dependent on the properties of the tissue being examined. The proton in the hydrogen atom has a strong resonance signal and its concentration is abundant in body fluids due to the presence of water. Bone shows no MRI signal. MRI is ideal for detecting brain and pituitary tumours, infections in the brain, spine and joints and in diagnosing strokes.
I.2.12 Outline the basic principles of nuclear magnetic resonance (NMR) imaging. I.2.13 Describe examples of the use of lasers in clinical diagnosis and therapy.
IBO 2007
OPTION
saddle coil (producing horizontal field gradient) saddle coil (producing vertical field gradient)
500
070817 Physics Ch 20 final.indd 500 22/05/2009 12:10:11 PM
MEDICAL PHYSICS
Lasers are commonly used in conjunction with endoscopy which uses optical fibres to look inside the body. An endoscope consists of two basic parts a flexible tube fitted with a camera and a bright light that can be inserted through a small incision, and a viewing monitor that magnifies the transmitted image of the part of the body being examined. The endoscope is used as the viewing device for the surgical instruments being used. Because lasers are highly collimated they can be for transmission along optical fibres. For example, the neodymium YAG laser can be used with the endoscope to help to destroy cancers in parts of the gut, the colon and airways. It can also be used to treat blood clots in the legs and coagulate bleeding ulcers of the stomach. Recently, the Nd-Yag laser has been used to remove fatty deposits in blood vessels. Lasers are also used in pulse oximetry which is a noninvasive technique used to monitor the oxygen content of hemoglobin. It is critical that intensive care patients and patients being taken to hospital in an ambulance after a trauma are monitored. A small clip is attached to a part of the body that is transparent to laser light. A laser light source with two different wavelengths one in the infrared and one in the red region is partially absorbed by the hemoglobin by different amounts dependant upon the amount of saturated or unsaturated oxygen that is present. A processor connected to the laser probe can determine the proportion of the hemoglobin that is oxygenated. Carbon dioxide lasers cannot be used in optical fibres because they operate in the far infra-red region and their energy is absorbed by the fibre. However, they have become a major laser in operations for cutting the skin and surface tissue, cauterizing bleeding blood vessels, doing fine surgery of the brain and fallopian tubes and treating cervical cancer. Argon lasers that produce blue-green light are used to remove red birthmarks and some tattoos. They are also used to treat eye diseases in diabetic patients and for re-attaching detached retinas. The excimer laser is commonly used to correct eye defects by vaporising excess tissue in the reshaping of the corneal surface. The advantages and disadvantages of diagnostic techniques are summarised with the other imaging techniques in Figure 2029.
Carbon dioxide Neodymium 1064 0.5100 W yttrium aluminium garnet Nd - YAG Argon 488 -514 110 W Dye 550 -700 0.055 W depending on the dye used Nd- YAG 1064 0.11 J per (Q-switched) nanosecond Nd- YAG 1064 0.11 J per (pulsed) microsecond Dye (pulsed) 550 -700 0.010.1 depending on J per the dye used microsecond Excimer 193 0.00.J per (pulsed) nanosecond
Yes Yes
No Yes Yes
Yes
501
070817 Physics Ch 20 final.indd 501 22/05/2009 12:10:12 PM
OPTION
CHAPTER 20 (OPTION I)
Advantages
Relatively cheap to use. Abundant ultrasound machines. No ionising radiation. Non-invasive. Good for soft-tissue diagnosis. Can break down gallstones and kidney stones. Good for measuring bone density. Simple to use. Cheapest alternative. Abundant X-ray machines. Good for certain structures.
Disadvantages
Highly reflective boundaries between bone/tissue and air/ tissue prevent effective imaging. High frequency ultrasound has low penetrative ability. A limit to the size of objects that can be detected.
Exercise
20.2
1.
Ultrasound
Describe the function of the following parts of an X-ray tube: (a) (b) (c) The filament The potential difference across the electrodes The tungsten target
2.
Poor at body-function diagnosis. Not good for differentiating one structure from another. Resolution not as good as others. Sometimes enhancing materials need to be ingested. Radiation dangerous to health. Not good for organ function. Radiation not good for the health. More expensive than standard X-ray.
Calculate the wavelength and frequency of an X-ray with an energy of 30 000 V. Explain why the target in an X-ray tube is mounted on a disc that rotates at 3600 revolutions per minute. Explain why you would not use hard X-rays for imaging tissues. An X-ray tube has a beam current of 40 mA and it is operated at a voltage of 100 kV. (a) (b) (c) Calculate the rate at which the machine transforms energy. Determine how many electrons reach the target each second. Calculate the maximum energy of the X-rays produced. (Assume no thermal energy loss). Calculate the minimum wavelength of the X-rays produced.
3.
X-rays
4.
5.
Good for 3D images showing structure. Creates cross-sections. Resolution better than basic X-ray radiography. Good for tumours and other lesions. Good for stroke detection. No ionising radiation. Most expensive. Clearest images of the Scan time up to 40 brain. minutes. Best image of the central Cannot be used with nervous system. heart pacemakers and metal prostheses.
OPTION
(d)
6.
The half-value thickness of 30 keV X-ray photons in aluminium is 4.8 mm. The initial intensity of the X-ray beam is 2.59 102 kW m-2. (a) (b) (c) Determine the intensity of the beam after passing through 9.6 mm of aluminium. Calculate the linear attenuation coefficient of the aluminium. Determine the intensity of the beam after passing through 1.2 mm of aluminium.
7.
Describe what is meant by the term X-ray quality, and name two ways in which the quality can be increased.
502
070817 Physics Ch 20 final.indd 502 22/05/2009 12:10:12 PM
MEDICAL PHYSICS
8. The half-value thickness of 30 keV X-ray photons in aluminium is 2.4 mm. If the initial intensity of the X-ray beam is 4.0 102 kW m-2 (a) (b) (c) Determine the intensity of the beam after passing through 9.6 mm of aluminium Calculate the linear attenuation coefficient of the aluminium. Determine the intensity of the beam after passing through 1.5 mm of aluminium. Medium Velocity Density ms-1 kg m-3 344 1482 1570 1541 1549 1561 4080 1580 1.21 998 1060 1025 1065 1038 1912 1075 Acoustic Impedance kg m-2 s-1 106 0.0004 1.48 1.66 1.60 1.62 1.70
9.
CAT and MRI scanners produce tomographic images of parts of the body in diagnostic tests. (a) (b) (c) (d) Describe what is meant by the term tomography. Outline the method by which CAT scans are collected. Give two diagnostic applications that CAT scans are used for. Discuss the advantages that a CAT scan has when compared to conventional X-ray techniques.
Air (20 C, 101.3 kPa) Water (20 C) Whole blood (37 C) Brain Liver Kidney Skull bone Muscle (i)
(d) (e)
(f) 10. MRI is proving to be an extremely useful technique for imaging blood flow and soft tissue in the body. It is the preferred diagnostic imaging technique for studying the brain and the central nervous system. 12. (a) (b) Describe the basic principles employed to collect an MRI scan of body tissues State the property of the hydrogen atom makes it such a useful atom for MRI diagnosis? Give two diagnostic applications that MRI scans are used for. Discuss the advantages and disadvantages that a MRI scan has when compared to other diagnostic techniques. (g)
Calculate the acoustic impedance for the liver and the skull bone (ii) Predict whether ultrasound could be used to obtain images of the lung. Explain your prediction. What is the function of the gel used in ultrasound? Identify the factors that affect the choice of the diagnostic frequency used in ultrasound. Distinguish between A-scans and B-scans used in ultrasound diagnosis. Discuss some of the advantages and disadvantages of ultrasound in medical diagnosis.
examine the growth of a foetus. detect a broken bone. detect a tumour in the brain.
(c) (d)
The following figure shows the variation in intensity I of a parallel beam of X-rays after it has been transmitted through a thickness x of lead.
11.
Ultrasound is a useful device in medical diagnosis and imaging. (a) Describe how ultrasound is different to other types of radiation used in medical diagnosis. Explain the SONAR principle used in medical ultrasonic diagnosis. Below is data relating to ultrasound transmission in various media.
20 I 15 10 5 0 2 4 6 8 10
(b) (c)
x / mm
503
070817 Physics Ch 20 final.indd 503 22/05/2009 12:10:13 PM
OPTION
CHAPTER 20 (OPTION I)
The linear attenuation coefficient was 8 m-1 m for a 1.2 MeV radiation incident on a tissue. (a) Calculate the thickness of the tissue that is required to reduce the intensity of the radiation by half. Define half-value thickness. Estimate the half-value thickness for this beam in lead from the graph. Determine the thickness of lead that is required to reduce the intensity by 40% of the initial value. Another sample of lead has a half-value thickness of 4 mm. Determine the thickness of this lead that would reduce the radiation intensity by 80%. State a typical value for the frequency that is used in ultrasound imaging. The figure below shows an ultrasound transmitter / receiver placed in contact with the skin.
d
(ii)
The mean speed in tissue and muscle of the ultrasound used in this scan is 2.0 103 m s1. Using data from the graph above, estimate the depth d of the organ beneath the skin and the length l of the organ O.
(c)
Is the scan above an A-scan or a B-scan? Explain the difference between these types of scan.
(e)
I.3
I.3.1
RADIATION IN MEDICINE
State the meanings of the terms exposure, absorbed dose, quality factor (relative biological eectiveness) and dose equivalent as used in radiation dosimetry. Discuss the precautions taken in situations involving dierent types of radiation. Discuss the concept of balanced risk. (Students should appreciate that codes of practice have been developed for conduct involving the use of radiations). Distinguish between physical half-life, biological half-life and eective half-life. Solve problems involving radiation dosimetry. Outline the basis of radiation therapy for cancer. Solve problems involving the choice of radio-isotope suitable for a particular diagnostic or therapeutic application. Solve problems involving particular diagnostic applications.
IBO 2007
14
(a) (b)
I.3.2
I.3.3
X
OPTION
Transmitter/ receiver
I.3.4
l
I.3.5
skin fat layer
I.3.6 There is a layer of fat and an organ X at distance d from the fat layer. The organs length is l. I.3.7 On the following graph, the pulse strength of the reflected pulses is plotted against time t where t is the time elapsed between the pulse being transmitted and the time that the pulse is received.
I.3.8
pulse strength
A B C D
50
100
150
200
250
(i)
Indicate on the Figure the origin of the reflected pulses shown on the graph.
Dosimetry is the study of radiation. Recall that radiation can be transmitted in the form of electromagnetic waves or as energetic particles, and that, when sufficient energy is absorbed by an atom, it can cause the release of electrons and the formation of positive ions. When radiation causes ions to form it is called ionising radiation. Ionising
504
070817 Physics Ch 20 final.indd 504 22/05/2009 12:10:14 PM
MEDICAL PHYSICS
radiation is produced by X-rays, CAT, radioactive tracers and radiopharmaceuticals, as well as by many other natural and artificial means. When ionising radiation penetrates living cells at the surface or within the body, it may transfer its energy to atoms and molecules through a series of random collisions. The most acute damage is caused when a large functioning molecule such as DNA is ionised leading to changes or mutations in its chemical structure. If the DNA is damaged it can cause premature cell death, prevention or delay of cell division, or permanent genetic modification. If genetic modification occurs, the mutated genes pass the information on to daughter cells. If genetic modification occurs in sperm or egg cells, the mutated genes may be passed on to offspring. Since the body is 65% water by weight, most of the radiation energy is absorbed by the water content. This energy can produce ions (H+, OH-, H3O+) and electrically neutral free radicals of water. These ions and free radicals can cause chemical reactions with other chemical constituents of the cell. For example, OH- ions and OH free radicals that form the strong oxidising reagent hydrogen peroxide (H2O2) which can interfere with the carbon-carbon double bonds within the DNA molecule, causing rupture of the double helical strands. Free radicals may also cause damage to enzymes that are required for the metabolism of the cell or they can affect the membranes that are vital for the transport of materials within the cell. Ionising radiation appears to affect different cells in different ways. Cells of the reproductive organs are very radiationsensitive and sterility is a common outcome after radiation exposure. Bone and nerve cells are relatively radiation-resistant. However, radiation of bone marrow leads to a rapid depletion of stem cells that can then induce anaemia or even leukemia. Exposure to radiation results in a range of symptoms including skin burns, radiation sickness (nausea, vomiting, diarrhoea, loss of hair, loss of taste, fever) cancer, leukemia and death. External exposure to particles is fairly harmless as they will be absorbed by a few microns of skin. Internal exposure after ingestion is very damaging as the particles are very ionising and they can interact with body fluids and gases. -particles are more penetrating but because of their irregular paths upon entering body tissue they are considered to have low ionising ability. X-rays and -radiation have high ionising ability (but not as high as ingested -particles), and are a common cause of disruption to normal cellular metabolism and function. The relative sensitivity of different types of cells to ionising radiation can be summarised using the following four characteristics known as the Laws of Bergonie and Tribondeau:
1.
AGE
2.
DIFFERENTIATION
3.
METABOLIC RATE
Cells that use energy rapidly are more sensitive than those with a slower metabolism.
4.
MITOTIC RATE
Cells that divide and multiply rapidly are more sensitive than those that replicate slowly. All living things are being exposed to cosmic radiation from the Sun and space, and terrestrial radiation from the lithosphere (uranium and thorium series and isotopes of radon) It is important that we have measures in place to monitor both this background and artificial radiation produced in medicine and the nuclear industry. In 1928, the body now known as the International Commission on Radiological Protection (ICRP) was set up to make recommendations as to the maximum amounts of radiation which people could safely receive. Radiation dosimetry deals with the measurement of the absorbed dose or dose rate resulting from the interaction of ionising radiation with matter. The ways of measuring and monitoring radiation will now be discussed.
Exposure
It has been found that when X-rays and -radiation pass through matter, a measurement of their ionising ability gives a good indication of the total energy absorbed. It is also known that the energy absorbed per kilogram in air and tissue is similar. The term exposure X is defined for X-radiation and -radiation as the total charge, Q, of ions of one sign (either electrons or positrons) produced in air when all the -particles liberated by photons in a volume of air of mass m are completely stopped by the air.
505
070817 Physics Ch 20 final.indd 505 22/05/2009 12:10:15 PM
OPTION
CHAPTER 20 (OPTION I)
Q X = -m It can be seen that the units for exposure are C kg-1.
The measurement of exposure has limited applications because it only applies to X-rays and -radiation, and it only refers to ionisation in air and not to absorption in body materials. It would be better to have a monitoring quantity that applies to all forms of radiation in all material. Where f is a factor dependant on the material involved and the types of photon interactions involved.
Absorbed dose
A more useful quantity is absorbed dose D. It is defined as the amount of energy E transferred to a particular unit mass m.
E D = --m
The SI unit of absorbed dose is J kg-1 otherwise known as the Gray (Gy). The absorbed dose is difficult to measure directly and, as a result, it is usually calculated from measurements of exposure. The relationship between exposure and absorbed dose can be determined for X-rays and -radiation. On average, the energy required to release one electron or positron is found to be 34 eV in air. Now 1 eV = 1.6 10-19 J, and 1 C is the charge on 6.25 1018 electrons.
OPTION
Therefore, an exposure of 1 C kg-1 corresponds to a dose of 6.25 1018 electrons per kg = (34) (1.6 10 -19 J) (6.25 1018 electrons) per kilogram = 34 J kg-1 = 34 Gy Therefore, the absorbed dose is equal to 34 Gy the exposure D Gy = 34X Ckg-1 in air, where X = exposure. This is the value for the energy absorbed for air. The energy absorbed per kilogram for other biological materials will depend on the type of material and the photon interactions (scattering, photoelectric effect and Compton scattering). Therefore, each medium will have its own absorption property and identical exposure will lead to different absorbed doses in different materials. D Gy = f X C kg-1 for other media
506
070817 Physics Ch 20 final.indd 506 22/05/2009 12:10:15 PM
MEDICAL PHYSICS
The -dose is monitored by comparing the blackening through the open window and different thicknesses of plastic. The dural (an alloy of aluminium) window absorbs beta and low-voltage X-rays and gamma rays. The lead/tin window absorbs all but the highest energy X and gamma rays. The cadmium/lead window absorbs most of the X and gamma rays but thermal neutrons interact with the cadmium to produce gamma radiation, which blackens the film. The badges are assessed every two to four weeks and the TLD every 3 months. The ICRP recommends that the effective dose to industry workers should not exceed an average of 20 mSv per year over any five-year period, with the proviso that the effective dose should not exceed 50 mSv in any one year. Members of the public should not receive more than 1 mSv per year from non-medical, artificial sources. It is considered that a dose of 250 mSv of whole body irradiation, or 750 mSv to a part of the body is an overdose.
badge casing
absorbed dose of 250 keV Xrays for a given eect RBE = --------------------------------------------------------------------------------------------------------------------------------------absorbed dose of another Xray magnitude to produce the same
In 1962, the ICRU replaced the RBE with the dose equivalent quantity (the RBE factor is conceptually related to the quality factor). The reason for this was because the biological effect of radiation exposure varies according to the type and energy of radiation, and because equal doses of various types of radiation do not necessarily result in equal biological effects. Therefore, the terms quality factor and relative biological effectiveness can be taken to mean the same thing.
Figure 2035
For the patient, it is known that radiation doses to the bone marrow in the order of 3 000 to 4 000 mSv have lethal effects within a month in about half of the exposed people in the absence of specialised medical treatment. Single doses over 2 000 mSv absorbed by the testes or 3 000 mSv absorbed by the ovaries can cause permanent sterility. The specialised medical treatment in the case of doses up to 10 000 mSv would include isolation of the patient in a sterile environment, selective treatment with antibiotics and stimulation of leukocyte production in order to offset damage to white blood cells. Bone marrow transplant may also be necessary. So you can see that protection is paramount in any medical diagnosis or therapy. Some typical radiation doses used in medicine are given in Figure 2036.
507
070817 Physics Ch 20 final.indd 507 22/05/2009 12:10:16 PM
OPTION
open window
photographic film
CHAPTER 20 (OPTION I)
Medical Source Annual cosmic radiation reference at sea level Chest X-ray Dental X-ray Mammography Head CT Abdomen CT Thyroid radiopharmaceutical Heart scan radiopharmaceutical Soft tumour radiopharmaceutical Thyroid therapy Figure 2036 Radiation doses 2. mSv 0.3 0.03 0.14 0.4 1.8 7.2 2.0 17 22 8 000 As already mentioned, film badges or thermoluminescent badges are worn for monitoring purposes. The typical universal trefoil radiation symbol should be clearly signposted in any area where a radioactive source is being used. In addition, all workers at risk of being exposed to radiation should obey the following basic rules: 1. Wear laboratory coats or other protective clothing at all times in areas where radioactive materials are used. Wear disposable gloves at all times while handling radioactive materials. Either after each procedure or before leaving any area, monitor your hands for contamination in a low-background area. Do not eat, drink, smoke, or apply cosmetics in any area where radioactive material is stored or used. Do not store food, drink, or personal effects in areas where radioactive material is stored or used. Wear required personnel monitoring devices at all times while in areas where radioactive materials are used or stored. Dispose of radioactive waste only in designated, labeled and properly shielded receptacles. Never pipette by mouth. Confine radioactive solutions in CLEARLY labeled containers. Secure all radioactive material when not under the constant surveillance and immediate control of the authorized users and their staff.
5.
6.
OPTION
1.
No practice shall be adopted unless its introduction produces a positive net benefit All exposures shall be kept As Low As Reasonably Achievable (ALARA) with economic and social factors being taken into account The dose equivalent to individuals shall not exceed the limits recommended for the appropriate circumstances by the Commission.
7.
2.
8. 9.
3.
10.
This system has set guidelines to establish procedures and principles to minimise radiation exposure of patients and workers. The main components involve the way in which the source is isolated and the protection offered to the individual. In industry, full protective clothing including coats, gloves, overshoes and in some cases respirators which prevent contamination through the skin and wounds.
508
070817 Physics Ch 20 final.indd 508 22/05/2009 12:10:17 PM
MEDICAL PHYSICS
Solution
Average ionisation energy in air = 34 eV Exposure of one unit = 1 C kg -1 Energy absorbed = 34e / e = 34 J kg-1 Absorbed dose = E /m = 34 J / 1kg = 34 Gy
Example
Solution
H1 = D 1 1 = 15 mSv D 1 = 15 mJ kg 1 H 2 = D 2 3 = 15 mSv D 2 = 5 mJ kg 1
Therefore, energy absorbed by 70 kg = 70 kg 20 mJ kg-1 = 1400 mJ = 1.4 J
Example
The isotope iodine-131 can be used to treat malignant growths in the thyroid gland. The isotope has a physical half-life of 8 days and a biological half-life of 21 days. Calculate its effective half-life and determine the time it would take the activity to decrease to 1/8th of its initial activity.
509
070817 Physics Ch 20 final.indd 509 22/05/2009 12:10:17 PM
OPTION
How much energy is absorbed when a person of mass 70 kg receives an effective dose equivlent to 30 mSv, half the dose equivalent being acquired from radiation of quality factor 1 and half from radiation of quality factor 3
CHAPTER 20 (OPTION I)
Solution
Example
1 / TE = 1 / 8 + 1 / 21 1 / TE = 0.1726 TE = 5.8 days 1/8 is 3 effective half-lives. So the time is 3 5.8 days = 17.4 days.
- radiation and - radiation have different quality factors. State which type of radiation has the largest quality factor, and explain why the radiations have different effects for the same absorbed dose.
Solution
Example
particles have the largest quality factor because they cause more ionisation per unit length of their track.
(a)
Describe what is meant by the term quality factor (relative biological effectiveness). A person of mass 75 kg has his whole body exposed to monochromatic X-rays of energy 250 keV. Because of this exposure, he receives a dose equivalent of 0.50 mSv in 2.0 minutes.
(b)
Deduce that the person absorbs approximately 109 photons per second.
Solution
OPTION
2. (a) Different radiations of the same intensity cause different amounts of ionization in the body. The quality factor compares the relative biological effectiveness of different types of radiation to that of X-rays that are given a quality factor of 1. D = E / m Therefore D =0.5 mSv = 5 104 Jkg1. E = m D = 75 kg 5.0 104 Jkg-1 = 3.75 102 J. W = qV = 250 103 eV 1.6 1019 C = 4.0 1014 J. So the number of photons in 1 second = 3.75 102 J / (120 s 4.0 1014 J) = 7.8 108 109 photons per second.
3.
(b)
With internal radiotherapy, the radioactive isotope administered is taken up by that part of the body that is specific to the biological path of the disease site. Alternatively, the radioactive source may be attached to a biological compound such as immunoglobulin molecules (monoclonal antibodies) that, after insertion, lodges at the disease site. -emitters are mainly used in internal radiotherapy as they deposit their energy close to the site being treated and, as a result, do little damage to the surrounding tissue. Iodine-131, iodine-123, phosphorus-32, samarium-153 and yttrium-90 are some radioisotopes used in internal radiotherapy.
510
070817 Physics Ch 20 final.indd 510 22/05/2009 12:10:18 PM
MEDICAL PHYSICS
Iodine-131 is used either as a diagnostic or therapeutic radiation source. It has a half-life of 8 days and emits -particles as well as -radiation. It is used to treat the thyroid gland for cancers and other abnormal conditions such as hyperthyroidism (over-active thyroid). It is given to the patient orally as a sodium iodide solution and is absorbed by the gastrointestinal tract where it passes into the bloodstream. It then concentrates in the thyroid where it is used to make hormones specific to that gland. Iodine123, a -emitter, is also used. Phosphorus-32 is used to control a disease called polycythemia vera. In this disease, an excess of red blood cells is produced in the bone marrow. Phosphorus-32 is used to control this excess. Samarium-153 ethylene diamine tetramethylene phosphonate, known commercially as Quadramet, is used internally to reduce pain associated with primary tumours of the breast, prostate and some other cancers. Yttrium-90 is used for liver cancer therapy. External radiotherapy called teletherapy commonly uses the isotope cobalt-60 as a source of -radiation. It is produced by neutron bombardment of the common isotope cobalt-59 in a cyclotron. It produces penetrating gamma rays of sufficiently high energy around 1.25 MeV. This is equivalent to X-rays generated at 3 MV. The equipment requirements for teletherapy are simpler than X-rays and do not have the high voltage hazards associated with X-rays. The tumour to be irradiated is pinpointed using laser beams. The cobalt-60 source is located near the centre of a lead-filled steel container known as a head. During therapy, a shutter is opened by a motor and the emerging gamma rays are collimated before striking the patient. In order to minimise the impact on healthy tissue, multiple-beam and rotational therapy are used for deep tumours. Either the radioactive source is rotated or the patient is rotated. Unfortunately, the radioactivity of cobalt-60 cannot be switched off like an X-ray. Cobalt-60 therapy is losing favour these days with preference in developed countries for linear accelerators (betatrons or linacs) that use X-rays or high-energy protons. The results for cancers of the pelvis, cervix, larynx and pituitary gland have been more successful with Xrays than cobalt-60. Iridium-192 implants (brachytherapy) that emit particles and low energy -radiation are now commonly used to treat breast cancer and cancers of the mouth. These are produced in wire form and are introduced through a catheter to the target area usually in the head or breast. After a time period calculated to give the correct dose, the implant wire is removed to shielded storage. This procedure gives less overall radiation to the body, is more localised to the target tumour and is cost effective.
511
070817 Physics Ch 20 final.indd 511 22/05/2009 12:10:19 PM
OPTION
CHAPTER 20 (OPTION I)
One of the most popular radiopharmaceuticals is technetium-99m. It has a short half-life of 6 hours and it is considered to be non-toxic. It can be manufactured in major hospitals from its parent nuclide molybdenum99 with a half-life of 66 hours. This means it can produce 99m Tc for over a week before it needs to be replenished. It is an excellent tracer for many diagnostic purposes. Figure 2037 describes this radioisotope together with others in terms of their main applications. Radioisotope Au-198 Ga-67 Sr-85 Th-201 I-122 I-123 I-131 Tc-99m Half-life 2.7 days 72 hours 64 days 74 hours Applications Liver imaging Soft-tissue tumour detection Bone imaging Coronary artery disease diagnosis 3.6 minutes Brain blood flow 13.1 hours Brain, kidney, heart studies 8.04 days Thyroid imaging 6 hours Imaging of brain,thyroid,lungs, liver, spleen, kidney, gall bladder 36.4 days Imaging of brain disorders such as dementia
1.
Calculate the absorbed dose in air when the exposure is 2 units. Determine how much energy is absorbed when a person of mass 50 kg receives an effective dose equivalent of 20 mSv, half from radiation of quality factor 1 and half from radiation of quality factor 2. A certain source produces an exposure rate of 500 C kg-1h -1 at 1.0 m from the source. Determine the distance a barrier must be placed if the exposure rate at the barrier is not to exceed 30 C kg-1h-1? Iodine-131 can be used to label albumen that is found in blood serum. It has a physical half-life of 8 days and a biological half-life of 21 days. Calculate the effective half-life when iodine-131 is used for this purpose.
2.
Xe-127
3.
OPTION
512
070817 Physics Ch 20 final.indd 512
22/05/2009 12:10:19 PM
MEDICAL PHYSICS
5. Radioactive substances have been used since the 1960s to assist in the diagnosis of body metabolism (chemical reactions within the cells of organs and tissues) and diseases. (a) Discuss the use of radioactive tracers in medical diagnosis and in the study of body function. Explain why radioactive tracers of the gamma-emitter type are preferred. Certain stable isotopes tend to accumulate in particular parts of the body. Give an example of such a stable isotope and the body part where accumulation occurs. Describe why it is important that any radioactive tracer used has a short half-life. One of the most popular radioisotopes is technetium-99m. It has a short half-life of 6 hours, a gamma ray emission of 140 keV, and it is considered to be non-toxic. It can be manufactured in major hospitals from its parent nuclide molybdenum-99 with a half-life of 66 hours. State two reasons why these physical properties of 99mTc make it an excellent radioisotope for nuclear scanning. 8. (a) Outline the biological effects that can occur when a person is subjected to ionising radiation. Define the terms exposure, absorbed dose, quality factor and dose equivalent as used in radiation dosimetry. Explain, with reference to and radiation, the distinction between absorbed dose and dose equivalent. Calculate the absorbed dose per kilogram in air when a person is subjected to an exposure of 1 unit (the average ionisation in air is 34 eV). Determine how much energy is absorbed when a person of mass 50 kg receives an effective dose equivalent of 30 mSv, half the dose equivalent being acquired from radiation of quality factor 1, and half from radiation of quality factor 3. Discuss the precautions required in situations involving radiation and the types of protection that may be used. Explain the difference between biological half-life, physical half-life and effective half-life. Iodine-131 can be used to label albumen that is found in blood serum. It has a physical half-life of 8 days and a biological half-life of 20 days. (i) Calculate the effective half-life when iodine-131 is used for this purpose. (ii) What percentage activity will remain after 40 days? Discuss the different mechanisms and different types of sources that can be used in radiation therapy for cancer.
(b)
(c)
(b) (c)
(d)
(e)
(d) (e)
(f)
(g)
(h) 6. Explain why, when using radioactive tracer elements in the treatment of cancer, it is better to use radioactive isotopes that have a long physical half-life and a short biological half-life. A beam of protons with a quality factor 11 is directed at a tumour of mass 0.10 kg. Each proton has an energy of 4.2 MeV and 1.9 1010 are incident on the tumour each second. A dose equivalent of 250 Jkg-1 is required to destroy the tumour. If all the incident protons are absorbed within the tumour, determine the exposure time needed to destroy the tumour.
7.
(i)
513
070817 Physics Ch 20 final.indd 513 22/05/2009 12:10:20 PM
OPTION
CHAPTER 20 (OPTION I)
OPTION
514
070817 Physics Ch 20 final.indd 514 22/05/2009 12:10:20 PM
PARTICLE PHYSICS
PARTICLE PHYSICS
J.1 J.2 J.3 J.4 J.5 J.6 Particles and interactions Particle accelerators and detectors Quarks Leptons and the standard model Experimental evidence for the quark and standard model Cosmology and strings
21
515
0700817 Physics Ch 21 final.indd515 515 22/05/2009 12:11:56 PM
OPTION
CHAPTER 21 (OPTION J)
In Figure 2101, a voltage of 50 kV is applied to a pointed conductor at the bottom so that electrons are pulled off the moving belt insulator and the positive charges produced are moved to the top of the belt where they are transferred to the dome. Since there is no electric field inside a hollow conductor, the charges move to the outside of the dome because electrons are taken to earth by the belt. The conducting dome is hollow so as to allow a large charge build-up on its outside. The van der Graaf charge generator is connected to an evacuated accelerator tube containing hydrogen or helium ions. These are repelled by the high positive voltage and are accelerated to an earthed target. When these ions are crashed into a target material, new particles of different masses and sizes can be produced. There are currently several hundred composite and elementary particles with corresponding antiparticles that have been identified. In 1930, Ernest Lawrence developed a small cyclotron, and between the 1980s and 1990s, we saw the developments of the Stanford Linear Accelerator Centre (SLAC, California) electron-positron linear collider (3 km long), the Geneva CERN (Conseil Europen pour la Recherche Nuclaire) large electron-positron circular collider (27 km circumference) and the Fermilab tevatron (6.4 km circumference near Chicago) which collides protons and anti-protons inside electronic detectors. CERN presently has 2 accelerators that are being continually upgraded. These accelerators and others in Germany, Japan, Russia and China will be studied fully in Section J2. Particle phycisists have discovered hundreds of new varieties of particles with many weird names such as quarks, neutrinos, gauge bosons and muons with each having its own colour charge. Particle physicists talk about the flavor of some elementary particles. The up, down and strange quarks were originally called vanilla, chocolate and strawberry and thus the term flavor has stuck. Murray Gell-Mann was quite a comedian and when he came up with the term quark to describe a group of elementary particles, scientists asked him for the origins of the word. During various lectures he was known to call them quirks, quorks but the word quark stuck and it has its origins in James Joyces novel Finnegans Wake - . Three quarks for Muster Mark. Such is the menagerie of terms that some have coined the term sub-atomic zoo to describe the variety of particles. As has been mentioned a number of times already in this textbook, there are 4 fundamental forces: the strong force, the electromagnetic force, the weak force and gravity, and all particles are governed by these forces. In the past 50 years, particle theorists have organised what has been found by particle experimenters into a theory that may explain a standard model of elementary particles and unite the forces into a Grand Unified Theory (GUT). Maxwell was able to show in the 1860s that the electric force and the magnetic force could be unified into a single electromagnetic force and his theory has been further refined into quantum electrodynamics (QED). In the 1960s, electromagnetism and the weak force were unified into the electroweak theory which predicted the existence of the Z boson particle as one of the exchange particles that mediate the weak force. Particle experimenters found the Z boson and particle theorists were found to be correct in their prediction. In 1973, the discovery of asymptotic freedom established quantum chromodynamics QCD as the correct theory to explain the nature of the strong force. The calculations done by Gross, Wilczek and Politzer showed that quarks were held together very strongly at distances that are comparable to the size of a proton, and this explained the concept of quark confinement. Now the search is on to find the link between the strong force, the electromagnetic, the weak force and gravity. These major breakthroughs in particle physics have spurred on the developments in astrophysics and cosmology that hope to answer the questions concerning the origin and evolution of the universe - why does it have its shape and form and will it reach a point where it will stop expanding. Perhaps one of the best outcomes of particle physics is that it has brought scientists from many nationalities into a collaborative working environment. Accelerators are very expensive to construct and operate, and the CERN accelerator that passes underground into Switzerland and France is funded by 19 European countries and employs over 100 research physicists, over 800 applied physicists and engineers, over 1000 technicians and 1000+ office and administration staff and craftsmen. This option offers the TOK student an opportunity to reflect on the nature of observation, the meaning of measurement and the meaning of evidence among other things.
OPTION
516
0700817 Physics Ch 21 final.indd516 516 22/05/2009 12:11:56 PM
PARTICLE PHYSICS
+ + +
+ + + + +
1.
Leptons
+ + + + + +
Leptons are particles that can travel on their own, meaning that they are not trapped inside larger particles. Six distinct types called flavors have been identified along with their antiparticles. First generation ordinary matter included in this category are the electron with a size of less than 10-18 m, with its antiparticle the positron, and the neutrino, with its antiparticle the antineutrino. Electrons have a negative charge, while positrons have a positive charge. Neutrinos are neutral in charge. Leptons interact via the weak nuclear force, but not the strong, as well as the gravitational force, and where a lepton is charged, the electromagnetic force. There are other leptons which are believed to have existed in the early moments of the Big Bang and may be found in cosmic rays and particle accelerators. They are the second generation muon and the muon neutrino and their antiparticles; that are heavier than the electron, and the third generation tau and tau neutrino (found at the Stanford collider but not found in nature) and their respective antiparticles, that are even heavier still than the muon. Leptons have an electric charge of -1, +1 or 0 as can be seen in Figure 2102. The top Roman numerals give the generation of each lepton and the bottom number in the table gives the rest mass of each lepton relative to the rest mass of a proton mp. Leptons II muon -1 0.1 muon neutrino 0 ~0 Antileptons II antimuon+1 0.1 muon antineutrino 0 ~0
connecting wire
Figure 2101
I electron -1 e0.005 electron neutrino 0 e ~0 I positron +1 e+ 0.005 electron antineutrino 0 e+ ~0 Figure 2102
III tau -1 1.9 tau neutrino 0 <0 III antitau +1 1.9 tau antineutrino 0 <0
517
22/6/09 9:18:28 AM
OPTION
CHAPTER 21 (OPTION J)
In spite of the different masses of each flavor of leptons, they have identical spin of a and the same angular momentum. The muon is unstable and decays into an electron, an electron antineutrino and a muon neutrino on average every 2.2 s through the weak force interaction.The tau lepton decays on average every 3 10-13 s. Leptons are found in many environments. The electron is the charge-carrier in conductors and semi-conductors. Electron antineutrinos are found in the beta decay of a neutron into a proton. The remaining flavors are found in nuclear reactors, particle accelerators and cosmic rays. Quarks II III CHARM TOP c + 2/3 t +2/3 1.7 186 STRANGE BOTTOM s - 1/3 b - 1/3 0.5 4.9 Antiquarks II III ANTI CHARM ANTI TOP c - 2/3 t - 2/3 186 1.7 ANTI STRANGE ANTI BOTTOM s + 1/3 b + 1/3 0.5 4.9
I UP u + 2/3 1/3 DOWN d - 1/3 1/3 I ANTI UP u - 2/3 1/3 ANTI DOWN d + 1/3 1/3
2.
Quarks
Quarks with a size of less than 10-18 m can never be found in isolation as they are trapped inside other composite particles called hadrons of which the proton, the neutron and mesons are examples. Hadrons are classified as:
Figure 2103 Standard model for the quarks All six flavors of quarks have been identified. Quarks can experience weak interactions that can change them from one flavor to another.
(A) MESONS
Examples are the Pions +, - and 0, the Kaons +, and 0, J/PSI, J/ and Eta 0. Mesons can mediate the strong nuclear force and this will be discussed futher in a later section. Like the first and second generation leptons, mesons only exist for a short time and they are thus very unstable.
OPTION
(B) BARYONS
Baryons are the heavyweights amongst particles that make up matter, including the proton and the neutron. Other baryons include Lamda 0, Sigma +, 0 and -, Cascade 0 and - and Omega - particles to name but a few. Hadrons are not elementary particles because they are composed of quarks. Mesons consist of a quark and an antiquark. Baryons have three quarks. A proton has 2 up and 1 down quarks - uud, and the neutron has 2 down and 1 up quarks ddu. Hadrons interact predominantly via the strong nuclear force, although they can also interact via the other forces. Figure 2103 shows the generation, names, symbols, charges and rest mass (bottom left) relative to the rest mass of the proton of the flavors of quarks.
Gluons g W+ WZ0 weak Higgs boson (hypothetical) electromagnetic photons gravity graviton Figure 2104
Recall that when an electron transition occurs from a high energy level to a lower energy level, the energy difference is emitted in the form of a quanta of electromagnetic energy called a photon. The other exchange particles are similarly associated with quanta of energy. However, when we talk of a classical force we define it in terms of the rate of change in momentum. When looking at subatomic
518
22/6/09 9:18:28 AM
PARTICLE PHYSICS
Name Leptons electron epositron e+ muon and antimuon + tau and antitau + electron neutrino/ antineutrino e e muon neutrino/ antineutrino tau neutrino/ antineutrino Quarks up and anti-up u down and anti-down d d strange and anti-strange s s charm and anti-charm c c bottom and anti-bottom b b top and anti-top t t Gauge Bosons photon W-plus and Wminus Z gluon Mass / c2 Charge Spin Lifespan 0.511 MeV 1 0.511 MeV +1 1 105.6 MeV +1 1 1.784 GeV +1 < 50 eV 0 stable stable 2 10 s 3 1013 s stable stable stable
6
Name Mesons pion (pi-zero) 0 pion (pi-plus) + (pi-minus) kaon (K-zero) K0 kaon (K-plus) K+ (K-minus) KJ/psi Baryons proton p antiproton p neutron n anti-neutron n lamda anti-lamda sigma (sigma-zero) 0 sigma (sigma-plus) + sigma (sigma-minus) xi cascade (xi-minus) xi cascade (xi-zero) 0 omega minus -
Mass / c2 Charge Spin 135 MeV 140 MeV 498 MeV 494 MeV 3.1 GeV 0 +1 1 0 +1 1 0 0 0 0 0 1
Lifespan 0.8 1016 s 2.6 108 s short 0.9 10 10s long 5 10 8s 1.2 108 s 0.8 1020 s stable stable in nuclei: stable 2.6 10-10 s 6 10-20 s 0.8 10-10 s 1.5 10-10 s 1.6 1010 s 3 10-10 s
< 0.5MeV 0 < 50 MeV 0 +2/3 1/3 1/3 +1/3 1/3 +1/3 +2/3 1/3 1/3 +1/3 1/3 +1/3 0 +1 1 0 0
938.8 MeV +1 938.8 MeV 1 939.6 MeV 0 1.115 GeV 0 1.192 GeV 0 1.189 GeV +1 1.197 GeV 1 1.321 GeV 1 1.315 GeV 0 1.672 GeV 1
~ 5 MeV ~ 10 MeV ~ 100 MeV ~ 1.5 GeV ~ 4.7 GeV > 30 GeV 0 81 GeV 93 GeV 0
1 1 1 1
stable variable variable variable variable variable stable 1025 s 1025 s stable
Figure 2105 Properties of leptons, quarks and exchange bosons particles, quantum mechanics and relativity replace classical Newtonian mechanics and a more sophisticated notion of force is required that is descibed in terms of an interaction. It has become evident that the meaning of force transmission if a particular interaction is to occur is related to the energy and momentum that is carried by a quanta of the force field. This new force model will be explained more fully in section J.1.7. However, one thing that is obvious when looking at the standard model for exchange particles is that the weak force exchange particles have mass whereas the others do not. Why do they have mass nearly 100 times the mass of the proton while the other exchange particles have zero mass? This new force model will be explained more fully in section J.1.7.
Mass
All the classes of particles have distinct masses. Let us now introduce some common classes of particles in Figure 2105 and Figure 2106 showing their distinct masses, charge, spin and their life-spans. Because the kilogram is a large unit in which to measure mass, the preferred unit for mass in particle physics is the electron-volt/c2.
519
0700817 Physics Ch 21 final.indd519 519 22/05/2009 12:11:59 PM
OPTION
CHAPTER 21 (OPTION J)
However, one electron-volt is too small and as such we usually talk in mega and giga electron-volts. The rest energy E0 of a particle can be defined as the energy associated with its rest mass m0. The Theory of Relativity demonstrated that mass and energy are equivalent as given by the equation E0 = m0c2, so energy can be measured in electron-volt and rest mass can be measured in eV/c2. It is often convenient to assume that the c2 is therefore mass and just talk of a relative particle mass being measured in MeV or GeV. The orbital quantum number is related to the orbital angular momentum. In the Bohr model: L = mvrn = n h/2 where n = 1,2,3
Quantum numbers
The Quantum Mechanics model of the atom was proposed in 1925 and 1926 to overcome some of the inadequacies of the Bohr model of the atom. Remember that the Bohr model had its limitations because although it could account for the wavelengths of light absorbed and emitted for the hydrogen atom, it could not be applied to any other atom. Furthermore, it could not account for the hyperfine lines that existed due the tiny splitting of energy levels. While quantum mechanics retained some aspects of the Bohr model, it predicted that the electrons were spread out in space in electron clouds of negative charge as a result of the wave nature of the electrons. These electron clouds can have various three-dimensional shapes such as spheres, dumbbells and donuts. The different states in which an electron can exist are determined by four quantum numbers: the principal quantum number n where n = 1,2,3 the orbital quantum number l where l = 0 to n-1 the magnetic quantum number ml where ml = -l to +l the spin quantum number ms where ms = + or
Angular momentum is a vector and as such the magnetic quantum number relates to the direction of the angular momentum. Spin will be discussed soon. Just as leptons can exist in lepton energy levels, so too can other elementary particles and their composites the hadrons. This has already been discussed in section 13.2.3. Therefore, each elementary particle or a composite of elementary particles can be specified in terms of its mass and various quantum numbers.
Conservation of energy
In particle physics, the total energy of the particles before a reaction or decay must be equal to the total energy of the particles after the decay. The law of conservation of energy manifests itself in 2 forms in particle physics: The kinetic energy which is dependant on the velocity of the particles Their mass energy is given by E = mc2.
OPTION
Without going into too much depth here, the principal quantum number is similar to the n number of energy levels in the Bohr model, and it applies not only to electrons but to all leptons and baryons. These particles are called fermions (particles that have a half-integer spin). It defines the distinct energy levels or shells in which fermions can be located. The maximum number of fermions that the principal energy level can accommodate is 2n2. So, when n = 1, there can be two fermions, and when n=2, there can be 8 fermions and so on. Fermions move in orbitals which can be depicted as a three-dimensional probability region. For each value of n, there are n2 orbitals. So, for the third principal energy, level, n=3, and therefore there will be 9 orbitals.
It can be seen that the greater the mass of a particle, the greater is the mass energy that a particle has. It has been shown that if a particle decays into other particles then the mass of a decaying particle has to be greater or equal to the mass of the products. Let us say that particle X decays and forms Y and Z. XY+Z However, if energy is to be conserved, the kinetic energy of particles Y and Z must be taken into account: X Y + EKY + Z + EKZ Therefore, the mass of X > the mass of Y + the mass of Z On the other hand, a reaction between 2 particles in their initial state can have less mass than the total mass of the products because initial particles can introduce energy into the reaction so that energy is conserved.
520
0700817 Physics Ch 21 final.indd520 520 22/05/2009 12:12:00 PM
PARTICLE PHYSICS
strangeness, colour, baryon number and lepton number. Spin will be discussed shortly and the other attributes will be studied in detail in sections J3 and J4.
Example
Determine whether the following reactions can occur due to charge conservation: (a) (b) (c) (d) e- + p + n - + p - + + p + n p + n + + e+ + e- + +
Solution
(a)
e- + p + n Q -1 + 1 = 0 + 0 charge is conserved.
(b)
- + p - + + Q -1 + 1 = -1 + 1 charge is conserved.
(c)
(d)
J.1.5 ANTIPARTICLES
All particles have antiparticles which are identical to the particle in mass and half-integral spin but are opposite in charge to their corresponding particle. In 1928, Paul Dirac, while tryng to combine special relativity and electromagnetic theories mathematically, came to the conclusion that particles with the same mass but opposite charge might exist somewhere in the Universe. In 1932, Carl Anderson found that cosmic radiation travelling
521
0700817 Physics Ch 21 final.indd521 521 22/05/2009 12:12:00 PM
OPTION
CHAPTER 21 (OPTION J)
through a bubble chamber during pair production split up into an electron path and a path with the same mass as an electron but in an opposite direction. Thus the positron had been discovered. In Figure 2107, incoming gamma radiation (from the bottom) produces two circular tracks of a positron (to the left) and an electron (to the right), and a secondary electron-positron track.
J.1.6
The Pauli exclusion principle states that an orbital can only contain a maximum of two electrons and when the 2 electrons occupy an orbital they have opposite spin. This principle is also extended to the shell model that explains nuclear energy states. It perplexed physicists as to how a dense collection of strongly interacting nucleons could collide in the nucleus without the loss of energy during scattering. However, the Pauli exclusion principle explains why there is no loss of energy because only one nucleon can occupy a given energy state. The lower energy states will fill up first and particles in a higher energy state cannot lose their energy by dropping to a lower energy state because these energy levels are already full. Of course scattering can raise fermions to a higher energy level but scattering which would lower the energy is not possible according to the Pauli exclusion principle.
Fundamental interactions
J.1.7 J.1.8 List the fundamental interactions. Describe the fundamental interactions in terms of exchange particles. Discuss the uncertainty principle for time and energy in the context of particle creation.
IBO 2007
OPTION
J.1.9
Figure 2107
INTERACTIONS
There are four fundamental interactions and some of their properties are shown in Figure 2109. As can be seen the fundamental interactions are: Gravitational Weak Electromagnetic Strong
Since the early 1970s the electromagnetic and weak interactions have been shown to be two aspects of the same interaction, the electroweak interaction. The electromagnetic force is the cause for basic collisions between charged particles such as:
522
0700817 Physics Ch 21 final.indd522 522 22/05/2009 12:12:01 PM
PARTICLE PHYSICS
Weak Flavour Quarks Leptons W+ WZ0 10-5 ~10-18 m Electromagnetic Strong Electroweak Fundamental Composite Electric charge Colour charge Electrically charged Quarks Hadrons particles Gluons 10-2 Gluons 1 ~10-15 m Mesons Not applicable
Gravitational Mass-energy
On what particles: All Exchange particle Graviton? is: Relative strength 10-38 Range
Figure 2109 Fundamental interactions and some properties p+pp+p The strong force is mainly responsible for reactions between hadrons. For example, p + p p + p + 0 The weak force is the only force in lepton reactions that produce neutrinoes as they are electrically neutral and so will not be affected by the electromagnetic force. For example, e + - e- + force is transmitted almost instantaneously, since photons travel at the speed of light. In 1935, based on the earlier theory of electromagnetic forces, the Japanese physicist Hideki Yukawa (19071981) suggested the interchange of certain particles could also explain the strong nuclear force. These particles, members of the meson family, came to be known as pions (short for pi mesons). Yukawa suggested that the composite nucleons continuously exchange mesons with nearby nucleons without being altered themselves. Of course, this interaction must take place in a short enough time frame so as not to violate the law of conservation of mass-energy. Yukawa suggested the mesons mediate the strong nuclear force in the same way photons mediate the electromagnetic force and that the strong nuclear force is responsible for holding nucleons together in the nucleus. Yukawas theory started a search for his predicted particle, and in 1947, pions were discovered in cosmic rays by Cecil Frank Powell (1905 1969). In 1949, Yukawa received the Nobel Prize in Physics for his prediction of the existence of pions. One year later, Powell also received the Nobel prize for his discovery of mesons, and developing the method that enabled their discovery. The strong force due to gluons only occurs within hadrons. The force that holds the nucleus together is caused by leakage from the gluon exchange. For example, in interactions between protons and neutrons there is an exchange of pions. Gluons are the exchange particle that are responsible for quark colour. Just as the positive and negative charges are associated with the electromagnetic force, a three colour charge are associated with quarks and gluons that bind the quarks together. The linking between quarks and antiquarks is done by gluon clumps called glueballs. Colour will be dicussed in detail in section J3.
523
0700817 Physics Ch 21 final.indd523 523 22/05/2009 12:12:01 PM
OPTION
CHAPTER 21 (OPTION J)
the photon can transfer some or all of its energy to the particle the uncertainty in the energy E could be given by the equation E = hc/. So if we multiply the time and energy equations, the result is E t h. Heisenbergs calculations arrived at the equation: (E) ( t) h / 4 This form of the mathematical uncertainty principle describes that the energy of a particle can be uncertain or nonconserved by an amount E for a time ( t) h / 4. This fact will become important when we look at Feynman diagrams.
J.1.10 Describe what is meant by a Feynman diagram. J.1.11 Discuss how a Feynman diagram may be used to calculate probabilities for fundamental processes. J.1.12 Describe what is meant by virtual particles. J.1.13 Apply the formula for the range R for interactions involving the exchange of a particle. J.1.14 Describe pair annihilation and pair production through Feynman diagrams. J.1.15 Predict particle processes using Feynman diagrams.
IBO 2007
OPTION
524
0700817 Physics Ch 21 final.indd524 524 22/05/2009 12:12:02 PM
PARTICLE PHYSICS
arrows represent particles and the direction of their travel while wavy lines represent the virtual particle.
e
virtual photon
Figure 2110 Feynman diagram for the electromagnetic force between 2 electrons Each point at which lines come together is called a vertex. Lines with arrows represent particles and the direction of their travel while wavy lines represent the virtual particle. Backward arrows represent antiparticles. They are space time diagrams with the vertical direction representing time and the horizontal direction representing space, in this case ct. At each vertex, the conservation laws of charge, lepton number and baryon number must be obeyed. Feynman diagrams are a shorthand method for studying the probability for particle interactions. Some of the interactions using Feynman diagrams are shown in Figure 2111.
e
ve W
e virtual photon e n
Electromagnetic
p
Weak
blue
green
blue
between nucleons
525
0700817 Physics Ch 21 final.indd525 525 22/05/2009 12:12:03 PM
OPTION
CHAPTER 21 (OPTION J)
e+
e+
photons that are exchanged during the interaction. The whole process begins when the two electrons were an infinite distance apart. They then move to their point of closest approach where virtual photons are absorbed and emitted in a very short period of time during which the states energy becomes uncertain. The system is allowed to move through this state with some probability that energy conservation is not violated. The electrons then move an infinite distance apart.
A PARTICLE
The electromagnetic force has an infinite range and so it only needs a zero mass exchange particle but the weak and strong forces have ranges of 10-18 m and 10-15 m respectively. To understand their behaviour we give them large mass exchange particles. To create an exchange particle of large mass will require an amount of energy at least as large as E = m0c2 where m is the rest mass of the particle. This state according to the uncertainty principle cannot last longer than: h ______ h t ______ 4(E) 4 mc2 Now speed = distance / time and distance = speed time. The furthest distance that it can move called the range R that is equal to ct. So R ct. Therefore, upon substitution we get: h R ______ 4 m0c From this relationship it can be seen that the range of an interaction is inversely proportional to the rest mass of the virtual exchange particle. Now it is possible by using this equation to determine the mass of the virtual exchange particles for the strong and weak forces.
OPTION
Example
Calculate the approximate masses of the virtual exchange particles for the strong and weak forces.
526
0700817 Physics Ch 21 final.indd526 526 22/05/2009 12:12:03 PM
PARTICLE PHYSICS
When particles and antiparticles collide, they can annihilate one another, releasing energy in the form of gamma rays. The gamma rays result from conversion of matter to energy, according to Einsteins famous equation, E = mc2. For example, an electron and positron annihilating one another at rest release an amount of energy equal to: E = mc2 = (2 9.11 10-31 kg) (3 108 ms-1)2 = 1.64 10-13 J = 1.03 MeV. The energy released is higher if the particles annihilate in a collision where one or both contributes kinetic energy to the process. Particleantiparticle pairs can also be produced when a gamma ray with sufficient energy passes close by a nucleus. The process is the reverse of annihilation and is called pair production. The law of conservation of massenergy requires particles and antiparticles to be produced in pairs. Figure 2114 gives the Feynman diagrams for pair annihilation and pair production.
e+ e e electron-positron annihilation e+ electron-positron pair production e
Solution
Figure 2114
Note that the backward arrow is an antiparticle, in this case a positron. Also remember the space-time concept of the Feynman diagram with time progressing upwards. Even though the arrow of the antiparticle is downwards, the antiparticle is still progressing upwards in time. Providing sufficient energy is available, particles other than photons can be produced.
A number of particle processes have already been introduced in the preceding sections. Let us look at some more examples as shown in Figure 2115.
527
0700817 Physics Ch 21 final.indd527 527 22/05/2009 12:12:04 PM
OPTION
CHAPTER 21 (OPTION J)
(a)
e-
(b)
Exercise
s u
21.1
1.
Describe the difference between an elementary particle and a composite particle. Neutrons and protons are classified as hadrons, whereas electrons are classified as leptons. Describe the concept underlying this type of classification. (a) (b) Describe the properties of the neutrino. Explain the reasons that led Enrico Fermi to predict the existence of the neutrino.
2.
(c) eW
-
(d) d + e u + W+
3.
4.
The neutrino belongs to the same family as the A. B. C. D. neutron. proton. electron. baryon.
Figure 2115 Some examples of Feynman diagrams In (a), a muon neutrino interacts with a photon exchange particle to become an electron. In (b), a strange quark emits an exchange particle and becomes an up quark. This is an example of a flavour change as it transforms into a member of another generation. In (c), a negative muon emits a W- particle and becomes a muon neutrino. The W- particle changes to particle-antiparticle pair in the form of an electron and an electron antineutrino. In (d), a positive pion decays into a positive muon and a muon neutrino. The up quark and the antidown quark annihilate to produce a W+ particle. Note the backward direction of the antidown quark. The W+ then decays into a positive muon and a muon neutrino. More complicated interactions can be demonstrated. For example, the electromagnetic interaction leads to photon photon scattering (that is, scattering of light by light). The particles in the loop are electrons or positrons and this interaction is shown in Figure 2116.
photon photon
5.
Hadrons differ from the other two families in that A. B. C. D. they mainly interact via the electromagnetic force. they mainly interact via the gravitational force. they mainly interact via the strong nuclear force. they mainly interact via the weak interaction force.
OPTION
6.
State the name and give the charge of each of the symbols given, and classify the particles as either quarks, baryons, mesons, leptons or gauge bosons: 1. e+ 7. + 13. + 2. d 8. 0 14. Z 3. + 9. K15. 4. e 10. g 16. 5. 6. + 11. 12. 17. - 18. c
7.
photon photon
Determine whether the following reactions can occur: (a) (b) (c) (d) e- + p n + n - + p - + + p + n p + n + + K0 0 + 0 ++ State the Pauli exclusion principle Does the principle only apply to leptons?
Figure 2116
8.
(a) (b)
528
0700817 Physics Ch 21 final.indd528 528 22/05/2009 12:12:05 PM
PARTICLE PHYSICS
9. State the spin number for each of the following particles: 1. e+ 4. e 7. + 10. g 13. + 10. 2. n 5. 8. 0 11. 14. Z 3. + 6. + 9. K12. 15. 17. Calculate the energy released when a proton and antiproton annihilate one another a. b. at rest. in a collision where each has kinetic energy of 25 MeV (assume no energy is lost).
18.
Antiparticles differ from their corresponding particle in their 19. A. B. C. D. charge. rest mass. family. rest energy.
If a -ray is to produce a neutron-antineutron pair, determine the minimum energy, in MeV, that it must have. Explain how photon exchange mediates the force between two electrons. Predict the Feynman diagram particle processes as shown in the figure below.
(b) d u
20.
(a)
Define the terms antiparticle and antimass. Describe how baryons differ from mesons. Of the proton, neutron, and electron, state which would a. b. c. travel in a straight line through a magnetic field. travel in a curved path in a magnetic field. travel in a curved path of greatest radius in a magnetic field.
(c)
(d) e W+ e u + -
W-
15.
(a) (b)
(c) (d)
Describe what is meant by pair annihilation. Calculate the energy of each photon produced when an electron and positron, initially at rest, are annihilated. Describe the direction of travel of the photons. Determine the threshold photon frequency for electron-positron pair production.
16.
Pair production results when A. B. C. D. particles and antiparticles annihilate one another. particles of sufficient energy pass close to a nucleus. gamma rays of sufficient energy pass close to a nucleus. gamma rays and particles annihilate one another.
529
0700817 Physics Ch 21 final.indd529 529 22/05/2009 12:12:06 PM
OPTION
14.
Describe the quarks that make up a proton and a neutron in terms of the type of quark and their overall charge.
CHAPTER 21 (OPTION J)
Let us take the situation of a proton of mass M with energy E equal to 50Mc2 colliding with another proton of mass M, then the collision energy would be close to: (100M2c4)1/2 = 10Mc2. This is only 20% of the original energy. So the collision energy is much less than E/2 if the initial energy is much bigger than mc2. However, if two particle beams travelling in opposite directions collide head on with each other, the total kinetic energy of the combined system will be zero. Therefore, all the energy of the two particles becomes available as collision energy. The total available energy is given by: Eavail ~ (2Mc2E)1/2 + Mc2 + mc2 This can be written as: Ea 2 = 2Mc2E + (Mc2)2 + (mc2)2 This is a large amount of energy and new particles that have mass greater than the original particles can be formed. For example, 2 protons can produce 2 protons, a K+ and a Kprovided the original protons are accelerated to a speed close to the speed of light.
J.2.2
J.2.3
J.2.4
J.2.5
J.2.6
J.2.7
530
0700817 Physics Ch 21 final.indd530 530 22/05/2009 12:12:06 PM
PARTICLE PHYSICS
the proton. In effect, the electrons can diffract off quarks inside the protons. If the probe particle such as electron can be considered one source and the particle being probed as another source, then according to the Rayleigh criterion, if the central maximum of one diffraction pattern coincides with the first minimum of the other diffraction pattern, then the two sources will just be resolved. So as the energy of the particles increases in the accelerator, the momentum of the particles increases and therefore the wavelength decreases.
high frequency supply
to target
drift tubes
Figure 2118 Drift tube accelerator As the protons enter a drift tube, they travel with a constant velocity. At the gaps, the protons are accelerated and this is why the tubes have to be progessively longer in size. Knowing that W = qV = mv2, then making v the subject of the equation, we get: 2qV v = ____ m
____
By knowing the alternating potential difference and the frequency of the electric source, the time that a particle is in each tube can be calculated by taking half the period of the alternating potential difference. Therefore, the length of each drift tube can be calculated. The largest travelling-wave accelerator is at Stanford University in the USA. It is electron-positron collider that is 3.2 km long. It is capable of accelerating electrons to an energy of 50 GeV in an evacuated system of storage rings and the linear accelerator. The SLAC unit has been responsible for the the discovery of the tau lepton and the J/Psi meson as well as the up, down and strange quarks and antiquarks directly observed due to electron scattering. The electrons and positrons gain their energy by giving them a kick with microwave electromagnetic radiation. The basic components of the Stanford linear accelerator collider are: the electron gun the positron producer unit 2 storage rings the linear accelerator the klystrons the Stanford positron-electron accelerating ring (SPEAR) particle detectors inside the Stanford synchrotron research laboratory (SSRL)
Linear accelerators
A linear accelerator (LINAC) is a device that accelerates charged particles in a straight line inside a long evacuated tube. There are 2 types of linacs: Drift tube accelerators as at Berkeley university Travelling wave accelerators as at Stanford university
In the drift tube accelerator at Berkeley, protons are accelerated to 31 MeV through a series of hollow cylindrical electrodes of increasing length called drift tubes as shown in Figure 2118. These electrodes are connected alternately to opposite terminals of a high-frequency alternating potential difference produced by a magnetron.
At SLAC, the electrons are produced by an electron gun by thermionic emission and then their speed is regulated inside a klystron so that they arrive in bunches at the output cavity with the required microwave wavelength before they enter the accelerator. Some of the electrons are sent
531
0700817 Physics Ch 21 final.indd531 531 22/05/2009 12:12:07 PM
OPTION
CHAPTER 21 (OPTION J)
to a damping ring and some are sent down the linac where they are diverted and collided with tungsten to produce positron bunches which are then diverted by magnets back to a second damping ring. So why do we need damping rings in a linear accelerator? The reason is that the bunches of electrons and positrons tend to spread out if introduced directly into the linear accelerator. Therefore, the particles are sent to the small storage rings where the bunches lose energy due to synchrotron radiation (X-rays). The bunches are re-accelerated with electric (F = qE) field as they pass through a special cavity that sorts the bunches so that only those that have the required direction are fed into the two linear accelerator lines. Upon returning to the linear accelerator the bunches gain speed up to 99.9% the speed of light within the first few metres. The two lines are made of 80 000 small copper discs about 2 cm thick with a small aperture (hole) that are joined together over the 3.2 km journey. Microwave klystrons placed along the track produce current in the copper that then produces oscillating electromagnetic fields of the required sort so that all the microwaves are in phase. The electron and positron bunches must arrive in the copper discs just at the right moment when they can be accelerated by the electric field. The positrons with opposite charge to the electrons will have to arrive at a point when the oscillating electric field is in the opposite direction to that of the electron, so that they can be accelerated in the same direction as the electrons down the accelerator.
Cyclotrons
Linacs are used to make radioisotopes for medical diagnosis and therapy. The X-rays produced by synchrotron radiation can be used to study the structure of matter at the molecular level. A machine that is more commonly used for medical purposes is the cyclotron that was first invented by Ernest O Lawrence in the 1930s. A schematic diagram of a cyclotron is shown in Figure 2120. The cyclotron is basically like a linac that has been wrapped into a tight spiral. It has the following important components: a source of charged particles usually protons, deuterons or helium nuclei 2 semi-circular boxes called dees a uniform magnetic field an evacuated chamber an high-frequency alternating potential difference
Protons are injected into the first D-sector near the centre of the cyclotron and they move in a circular path according to r = mv / qB. If the proton takes time t to move a distance r to move through this dee, then t = r / v. Therefore, from these equations: m t = ___ qB This demonstrates that the time to travel around a dee is constant for a constant magnetic field intensity, and that the time is independent of the velocity and radius.
OPTION
After reaching the end of the linear accelerator, the particle bunches are diverted into the SPEAR storage ring that is 80 m in diameter. Dipole magnets separated by the required distance keep the particle bunches circulating in the ring with speeds up to 4 GeV. Again the particles lose energy due to synchrotron radiation and they have to be re-accelerated by electric fields at certain cavity points. When they have the required energies, they are sent to detectors at the Stanford synchrotron research laboratory for analysis after the electron and positron bunches are collided. Figure 2119 shows a schematic layout of the SLAC.
PEP II Low Energy Ring SSRL SPEAR
Alternating P.D.
Figure 2120
Positron Source
Beam Switch Yard End Station A Final Focus Test Beam NLCTA PEP II High Energy Ring End Station B SLD
Figure 2119
There is a high-frequency alternating potential difference between the dees that accelerates the protons into the second D-sector. Since its velocity has increased it will now travel in a path of larger radius because r = mv / qB. After leaving the second D-sector, the polarity of of the potential difference is reversed and it is again accelerated into the first D-sector. The protons follow a spiral path
532
0700817 Physics Ch 21 final.indd532 532 22/05/2009 12:12:08 PM
Linac
PARTICLE PHYSICS
that can consist of hundreds of loops. Upon reaching the maximum radius loop, the protons are deflected by a charged plate and are incident on a target material with energies of up to 25 MeV. The energy gained, E, can be calculated for any radius according to: E = mv2 = m(qBr/m)2 Since the potential difference must reverse twice each cycle, then the period T of each cycle will be given by: qB 2m and therefore f = ____ T = ____ 2m qB This frequency is known as the cyclotron frequency or the resonance frequency. This is usually 10 MHz which is in the radio frequency range. When the particles have energies of about 20 MeV, they become appreciably more massive according to the theory of relativity. This slows them down and they become unsynchronised with the alternating potential difference when they travel across the gaps between the dees. The solution is to use a synchrocyclotron. An oscillator (radiofrequency generator) that accelerates the particles around the dees is automatically adjusted to stay in step with the accelerated particles.
Accumulator ring Small synchroton Linear acceletor
Experimental hall
Figure 2121
A SYNCHROTRON
Two of the most famous synchrotrons are the large hadron collider at CERN and the proton-antiproton Tevatron collider at Fermilab just outside Batavia, Illinois. Schematic diagrams for CERNs electron-positron collider and Fermilabs Tevatron are shown in Figures 2121 and 2122. Synchrotrons are the most powerful members of the accelerator family, the main components being: a source of particles and antiparticles radio-frequency accelerating cavities bending magnets focusing magnets detector
Proton synchroton
Protons
Antiprotons
2 5
Antiproton storage ring
6.4
3
m
k
(4 m
iles
Tevatron
533
0700817 Physics Ch 21 final.indd533 533 22/05/2009 12:12:10 PM
OPTION
Firstly, beams of particles are injected into smaller synchrotron rings. They consist of two evacuated tubes at low pressure with large radii through which batches of particles can travel, one for a batch of particles and one for a batch of antiparticles. The batches are accelerated by an alternating potential difference of radio frequency as they move from cavity to cavity. At CERN, there are 14 acceleration points. The tubes are surrounded by bending magnets that keep the particles moving in the same circular path, and, focusing magnets that keep the particles travelling through the center of the tube in a focused beam. As the beams gain momentum, the magnetic flux density has to be increased to keep them traveling in a circular path. When the particle beams have been accelerated to many million electron volts, they are injected into a much larger synchrotron ring, and computers are used to maintain the relationship between the magnetic field and oscillator frequency of the electric field to compensate for the relativistic increase in mass. In a few seconds, the particles reach energies greater than 1 GeV and are ejected, either directly into experiments or toward targets that produce a variety of elementary particles upon collision with the accelerated particles.
CHAPTER 21 (OPTION J)
Synchrotrons have a big radius because according to the equation r = mv / qB, the larger the radius the greater are the velocities that can be achieved and thus the greater the kinetic energy of the particles. Remember it is not the magnetic flux density that allows them to gain momentum but rather the alternating potential difference of radio frequency.
Calculate the wavelength and comment on the resolution of a beam of 1.5 GeV electrons
Solution
h ___ h ___ h ____ hc 2 = __ p = mv mc = mc2 where mc = 1.5 GeV. 6.63 1034 Js 3 108 ms1 Therefore, = _________________________ 1.5 109 eV 1.6 1019 JeV1 = 8.3 10-16 m This is less than the size of the nucleus and thus the resolution should be good.
Example
OPTION
Cyclotron
Linear accelerator
A drift tube accelerator has an alternating potential difference of 50 kV, with a frequency of 10 MHz applied to a row of tubular electrodes. Calculate the length of the first drift tube if electrons are to arrive at the right time to be accelerated in the next gap.
Solution
Synchrotron
Knowing that W = qV = mv2, then making v the subject of the equation, we get: 2qV 2 1.6 1019 C 50 103 V ________________________ v = ____ m = 9.11 1031 kg
____ ________________________
Figure 2123
534
0700817 Physics Ch 21 final.indd534 534 22/05/2009 12:12:11 PM
PARTICLE PHYSICS
f = 10 106 Hz and so T = 1 107 s. The average time for the polarity to change in the tube lengths = T/2 = 0.5 10-7. Therefore, the length of the tube =
(b)
mv2/r = qvB and therefore v = rqB / m. So, 0.5 m 1.6 10 C 0.99 T = 4.7 107 ms1 = ________________________ 1.673 1027 kg EK = mv2 = 0.5 1.673 10-27 kg (4.7 107 ms-1)2 = 1.84 10-12 J
19
-1
-8
Example
Particle detectors
The synchrotron at Fermilab has a diameter of 2.0 km. Estimate the magnetic flux density needed to move a proton beam of 350 GeV. J.2.8 Outline the structure and operation of the bubble chamber, the photomultiplier and the wire chamber. Outline international aspects of research into high-energy particle physics.
Solution
J.2.9
mv2 / r = qvB where r = 1000 m. If we assume that the proton beam is travelling at approximately the speed of light, then mv2 / r mc2 / r where mc2 = 350 GeV. Therefore, mc2 (350 109 eV 1.6 10-19 JeV-1) 1.2 T. B qcr (1.6 10-19 C 3 108 ms-1 1000)
J.2.10 Discuss the economic and ethical implications of high-energy particle physics research.
IBO 2007
Example
A cyclotron is operated at an oscillator frequency of 15 MHz and has a dee radius of 0.50 m. (a) Calculate the magnetic flux density needed to accelerate protons in the cyclotron. Determine the kinetic energy of the protons in MeV.
(b)
Solution
(a)
Since the potential difference must reverse twice each cycle, then the period T of each cycle will be given by: T = 2m / qB and therefore f = qB / 2m and B = 2mf / q. So, 2 1.673 1027 kg 15 106 s1 = 0.99 T B = ______________________________ 1.6 1019 C
535
0700817 Physics Ch 21 final.indd535 535 22/05/2009 12:12:11 PM
OPTION
CHAPTER 21 (OPTION J)
period unless no disturbance happens in the liquid. But if a disturbance such as charged particles moving through the liquid occurred, bubbles would form along the path of the charged particles as they ionise the particles in the liquid, and these paths can be photographed. By repeating cycles of lowering and rapidly increasing the pressure, many paths of charged particle could be obtained. Glaser chose diethyl ether as the liquid because of its low vapour pressure but this was replaced with liquid hydrogen at the suggestion of L.W. Alvarez. Bubble chambers are easy to handle and the particles path can be viewed from all angles to give a three-dimensional picture. However, the data acquisition is slow and the bubble chamber has been phased out as a particle detector. several more electrons. These are then accelerated to the next dynode and so on. An avalanche of electrons builds up down the tube with perhaps 106 electrons arriving at the anode 1 nanosecond later. The voltage pulse at the final resistor can be counted by an electronic system. The height of each pulse carries information about the number of ion-pairs created in the scintillator by the emission. So, the received signals from the photomultiplier are measured, digitised, and the information is transmitted to higher computers to reconstruct the events of a collision.
PHOTOMULTIPLIERS
The main instrument used for studying radioactivity and X-rays at the beginning of the 1900s was the electroscope. A more convenient and accurate instrument is the scintillator or scintillation counter. This was the method used by Geiger and Marsden and Rutherford to count alpha-particles as they were scattered from thin gold foil. The alpha-particles emitted by the radium source have kinetic energy which can be converted to tiny flashes of light called scintillations. A scintillation counter consists of an appropriate phosphor such as zinc sulfide combined with a photomultiplier tube as shown in Figure 2124. Not only alpha-particles but also the weak flashes of beta-particles and gamma radiation can be detected.
photoelectron
2V 4V
mica window
+
wire anode
+400 V
metal cathode
pulse output
Figure 2125 Schematic diagram of the Geiger-Mller tube The tube is filled with argon gas. The mica window allows various types of radiation to enter. A potential difference around 400 V is maintained between the anode and the cathode, and because the anode is very thin, an intense electric field is created near it. When radiation enters the tube and produces a few ions, the tube dramatically increases the number of ions to produce a pulse of charge. As negative charge accelerates towards the wire anode, they have sufficient energy to produce ion-pairs. The electrons released by this secondary ionisation create more ion-pairs in an electron avalanche. All the electrons are absorbed by the wire to produce a large pulse of anode current. And the resistance of the gas is said to have broken down. The positive ions being more massive are slower to move towards the cathode and after a short time there are so many positive ions near the anode that the electric field around the wire is cancelled out and thus prevents more ionisation. So the electron avalanche and the associated anode current are cancelled out. This process is known as gas amplification and as many as 108 electrons can be produced in a single ionising event.
OPTION
anode
10V
signal
scintillator
vacuum
0V
Figure 2124 A photomultiplier unit It produces an electrical pulse for each ionizing emission it detects. Each flash of light in the scintillator can knock an electron out of the surface of the photocathode. This is at negative voltage so the electron is accelerated towards the first dynode, gaining enough energy to knock out
536
0700817 Physics Ch 21 final.indd536 536 22/05/2009 12:12:12 PM
PARTICLE PHYSICS
The wire chamber differs from the Geiger- Mller tube in that it has planes of thin wires spaced millimetres apart inside the chamber. The data collected gives the arrival time of a particle as well as its track. In many particle physics experiments, beams of particles are maintained by by focusing and bending magnets in what is known as a beam line. In this way, particles of interest can be kept circulating until needed. Wire chambers are placed along the beam line to identify individual particles and their momentum.
As already mentioned in the introduction to this chapter, the number of employees at the facilities mentioned above is quite high and researchers and engineers from all countries are selected for their expertise to work at these facilities. CERN and DESY (Germany) are funded by the European Economic Community (EEC). The various facilities share the information they gather in a joint collaboration. This is necessary because some operate at the lepton level and others at the hadron level. The researchers would love to be the first to make predictions and then track down the remaining enigmas of the grand unified theory (GUT) such as the Higgs boson, and there has been some fierce rivalry in the early days of particle physics. However, the cooperation needed to solve the mysteries of cosmology and particles has continued throughout the last century or so.
Exercise
21.2
1.
Calculate the total energy and wavelength of a proton that has kinetic energy of 30 GeV. Calculate the strength of a magnetic field used in a cyclotron in which a deuteron makes 1.5 107 revolutions per second. Calculate the wavelength and comment on the resolution of a beam of 3.2 GeV protons. A drift tube accelerator has an alternating potential difference of 30 kV, with a frequency of 15 MHz applied to a row of tubular electrodes. Calculate the length of the first drift tube if electrons are to arrive at the right time to be accelerated in the next gap. The synchrotron at Fermilab has a diameter of 2.0 km. Estimate the magnetic field intensity needed to move a proton beam of 300 GeV.
2.
3.
4.
5.
537
0700817 Physics Ch 21 final.indd537 537 22/05/2009 12:12:13 PM
OPTION
CHAPTER 21 (OPTION J)
6. A cyclotron is operated at an oscillator frequency of 15 MHz and has a dee radius of 0.50 m. (a) (b) Calculate the magnetic flux density needed to accelerate deuterons in the cyclotron. Determine the kinetic energy of the protons in MeV. 11. At CERN, protons are injected into the 200 m diameter, 28 GeV synchrotron ring with an energy of 50 MeV. The tube is filled with protons which are injected with a proton current of 100 mA for 6 s. There are 14 acceleration points spaced evenly around the ring with a potential difference between the electrodes of each accelerator of 4 kV. The final energy of the proton is 28 GeV. If relativistic effects are ignored: (a) (b) (c) (d) (e) calculate the speed of the proton at injection determine the time it takes to go around the ring at this speed calculate the momentum of the proton at injection determine the number of protons that were injected deduce by how much the energy of a proton increases in each revolution of the synchrotron estimate the number of times a proton must go around the accelerator to obtain its final maximum energy.
7.
Estimate the maximum resolving power that is attainable using 370 GeV protons. The hadron track at CERN has a diameter of 8.5 km. Estimate the time it would take a highenergy proton to make one revolution in the collider. The following figure is a sketch of the path of the pair production of an proton and antiproton. There is a magnetic field pointing out of the page.
8.
9.
(f)
A B
OPTION
Explain which of the tracks is due to the antiproton, A or B. Deduce whether the particles have the same energy. Calculate the minimum energy required for the pair production in GeV.
J.3.3
J.3.4 10. An ion gun in an evacuated container consists of 2 parallel conducting plates separated by a distance of 2.0 cm. A potential difference of 50 kV is applied across the plates. Protons enter between the plates and drift with negligible speed into a region between the plates. The negative plate has a small hole where protons can be ejected. (a) (b) Calculate the electric field strength between the plates. Determine the energy of the protons that are ejected through the hole of the negative plate. Deduce that the speed of the protons ejected through the hole is approximately 3.1 10-6 ms-1. Describe why the apparatus is evacuated.
J.3.5
J.3.6
(c)
J.3.10 Discuss the interaction that binds nucleons in terms of the colour force between quarks.
IBO 2007
(d)
538
0700817 Physics Ch 21 final.indd538 538 22/05/2009 12:12:13 PM
PARTICLE PHYSICS
The three quarks initially proposed were named Up (symbol u), Down (symbol d), and Strange (symbol s). Up and down have charges of +2/3e and -1/3e respectively, and zero strangeness, while strange has a charge of -1/3, but a strangeness value of 1. Baryons are built by grouping quarks in threes. For example, the proton is duu, while the neutron is ddu. Note that these groupings support the law of conservation of charge. Once the idea of quarks had been proposed, scientists began to hunt for evidence of their existence. Physicists at the Stanford Linear Accelerator Centre used electron beams to bombard liquid hydrogen (protons), and soon discovered tiny, point-like charge concentrations inside the proton. These charge points were determined to have the predicted charges, verifying that quarks do indeed exist. Scientists then proceeded to try to make quarks, using colliding beams of electrons and positrons, such as the ones used in the LEP Collider at the European Laboratory for Particle Physics. In 1974, a new quark was discovered that had a charge of +2/3e. The new quark was named charm (symbol c) because of its magical ability to solve certain theoretical problems. Another new quark, named bottom or beauty (symbol b), was discovered in 1977. Bottom carries a charge of -1/3e. Because quarks must occur in pairs, a sixth quark was proposed, with the suggested names top or truth (symbol t) and a predicted charge of +2/3e, this was confirmed in 1995. Some properties of quarks are given in Figure 2127. Quark Spin Q Charge B Baryon S Strange ness C Charmed B Bottom ness T
u d s c b t
+ - - + - +
0 0 -1 0 0 0
0 0 0 +1 0 0
0 0 0 0 -1 0
Figure 2127 Properties of quarks The corresponding symbols for the antiquarks are, Up; u, Down; d, Strange; s , Charm; c , Bottom; b, Top; t . Their properties are the same as those for quarks except that they have the opposite signs. For example, an strange quark has a charge of - e, a Baryon number of and
22/05/2009 12:12:14 PM
Top ness 0 0 0 0 0 -1
539
OPTION
CHAPTER 21 (OPTION J)
a strangeness of 1, meaning that the corresponding antiquark would have a charge of -e, a baryon number of and a strangeness of +1.
If we apply a couple of simple rules then we can decide whether baryon number is conseved. The rules are: the total number of baryons must remain constant all baryons are assigned a baryon number of 1 (p, n, , , ) all non-baryons (leptons and mesons) are assigned a baryon number of 0 (, , e, , ). an antiparticle has the opposite baryon number (-1) from its particle.
+1 +1 0 1 1 0 0 +1 0 0 1 0 0 +1 1 0 1 +1 0 +1 1 0 1 +1 +1 +1 1 1 1 +1 1 +1 1 +1 1 +1 1 +1 2 +1 1 +2 0 +1 2 0 1 +2 1 +1 3 +1 1 +3
uud uud ddu ddu uds uds uds uds uus uu s dds dds dss ds s uss us s 3 2 sss sss
If we take the original reaction and look at the baryon numbers we have: p + p p + + Q1+1=1+1 B1+11+0 The reaction does not occur because the baryon number is not conserved. Now we will look at an earlier equation: - + p - + + Q -1 + 1 = -1 + 1 B0+1=1+0 charge is conserved. baryon number is conserved.
OPTION
neutron n anti-neutron n lamda 0 anti-lamda 0 sigma-zero 0 anti-sigma zero 0 sigma-plus + anti-sigma minus sigma-minus anti-sigma plus + xi-minus anti-xi plus + xi-zero 0 anti-xi zero 0 omega minus omega plus + Figure 2128
The reaction does occur because electric charge and the baryon number is conserved. Finally, let us take an example that examines the 3 conservation laws introduced so far:
540
0700817 Physics Ch 21 final.indd540 540 22/05/2009 12:12:15 PM
PARTICLE PHYSICS
n++ energy 1 = 1/7 + 1/7 mass is greater in the initial particle Q0=1-1 charge is conserved As can be seen, the 0 baryon has an up and a down quark spinning in opposite directions and the strange quark spinning in the same direction as the up quark. The total spin is + - = . But, for the 0 baryon, the up and down quarks are spinning in same direction and the strange quark spins in the opposite direction to the up and down quarks. The total spin is + - = .
Spin is an important characteristic of some baryons that contain three quarks. For example the baryons 0 and 0 both have the same three quarks uds as shown in Figure 2129.
0 0
541
0700817 Physics Ch 21 final.indd541 541 22/05/2009 12:12:15 PM
OPTION
The quantum number rules for particles with a spin of , 3/2, 5/2 require that they must be distinguishable from each other in at least one quantum number. These are the particles that are the building blocks of matter. The particles that have a spin number of 0, 1, 2..can all have the same quantum numbers including position, and these are the particles that are associated with forces.
CHAPTER 21 (OPTION J)
of the colour force which would make the colour force a fundamental force. Just like electrically-charged particles, exchange photons in an electromagnetic interaction, coloured particles exchange gluons during the strong interactions causing the particles to be glued together. Unlike the photon that carries no colour charge, gluons carry combinations of a colour and an anti-colour and therefore change the colour of the quarks when they pass from one to another. These colours can be mixed such as a red/anti-blue combination. The changing of colour is different to what happens in other particle exchanges where there is no change to the particles doing the exchanging. This feature of gluons is important due to the fact that when quarks move apart, the strong force actually gets stronger, rather than weaker. This accounts for the fact that it highly unlikely that an individual quark will be observed because gluons are normally produced in quark-antiquark pairs. There are 8 combinations of colour/anti-colour for gluons. However, this reaction does not take place. Here are some rules for strangeness: the strangeness of leptons is zero protons, neutrons and pions are assigned a strangeness of 0 K+ and K0 mesons are assigned a strangeness of +1 K-, and and baryons are assigned a strangeness of -1 baryons are assigned a baryon number of -2 baryons are assigned a baryon number of -3 all antiparticles have the opposite strangeness to their particles.
So let us assign strangeness to the above example: +p-++ Q -1 + 1 = -1 + 1 B 0+1=0+1 charge is conserved. baryon number is conserved.
S 0 + 0 0 + 1 strangeness is not conserved. Therefore, the reaction does not occur. Now let us examine a further nuclear reaction. p + p p + p + K+ + KQ 1 + 1 = 1 + 1 + 1 + (-1) charge is conserved. B 1 + 1 = 1 + 1 + 0 + 0 baryon number is conserved and the number of baryons is the same. S 0 + 0 = 0 + 0 + 1 + (-1) strangeness is conserved. Therefore, this reaction can occur. Now let us examine the quark content of the particles that exhibit strangeness. The K+ meson consist of an up quark and an anti-strange quark (strangeness of +1), and K- consists of a strange quark and an anti-up quark (strangeness of -1). Another example is the baryon that contains uds quarks and has strangeness of -1. Finally, the - contains dss quarks and is assigned strangeness of -2. Therefore, it can be seen the number of the strangeness is negative for each strange quark present and positive for each anti-quark present.
OPTION
542
0700817 Physics Ch 21 final.indd542 542 22/05/2009 12:12:16 PM
PARTICLE PHYSICS
Name
RGB
~ 310
Figure 2130 The legend for the elementary particles The standard model is the presently accepted theory describing the electromagnetic and weak interactions of quarks and leptons.
J.4.1
State the three-family structure of quarks and leptons in the standard model. State the lepton number of the leptons in each family. Solve problems involving conservation laws in particle reactions. Evaluate the signicance of the Higgs particle (boson).
IBO 2007
J.4.2
The lepton number is given the symbol L. As already mentioned, leptons carry the same electric charge and react via the weak and electromagnetic forces but not the strong force. They have partner neutrinos and they are split into 3 generations - Le, L and L. Neutrinos must accompany their partner leptons. But how does one know if a neutrino or an antineutrino is involved in a reaction or a decay. Basically, the rules are: If the lepton and neutrino are on the same side of an equation electrons, negative muons and negative tau must be accompanied by an antineutrino positrons, positive muons and positive tau must be accompanied by an neutrino.
J.4.3
J.4.4
543
0700817 Physics Ch 21 final.indd543 543 22/05/2009 12:12:17 PM
OPTION
CHAPTER 21 (OPTION J)
Leptons II muon -1 106.6 muon 0 neutrino ~0 Antileptons II antimuon+1 101.6 muon 0 antineutrino ~0
Figure 2231 (a) Leptons and Antileptons Quarks II Charm + 2/3 c RGB 1500 Strange - 1/3 s RGB 505 Antiquarks II Anti Charm - 2/3 c CMY 1500 Anti Strange + 1/3 s CMY 505
III Top +2/3 t RGB > 22500 Bottom - 1/3 b RGB ~ 5000
III Anti Top - 2/3 t CMY > 22500 Anti Bottom + 1/3 b CMY ~ 5000
Figure 2231 (b) Quarks and Antiquarks Force strong weak Exchange particle Gluons g W+ WZ0 Higgs boson hypothetical photons graviton Rest mass Charge Spin GeVc-2 0 0 1 81 +1 1 81 -1 1 93 0 1 > 83 0 1 0 0 0 0 1 2 Relative strength 10
-19
OPTION
Electromagnetic gravity
10-2 10-30
Infinite Infinite
If the lepton and neutrino are on the opposite side of an equation electrons, negative muons and negative tau must be accompanied by an neutrino positrons, positive muons and positive tau must be accompanied by an antineutrino.
Again, this can be understood in terms of a new conservation rule the conservation of lepton number. The rules for lepton number conservation are:
the total number in each generation must always remain the same the electron and electron-neutrino are assigned a lepton electron number of 1 the negative muon and muon-neutrino are assigned a lepton muon number of 1 the negative tau and tau-neutrino are assigned a lepton tau number of 1 all other particles are assigned a lepton number of 0 an antiparticle has the opposite lepton number (-1) from its particle.
544
0700817 Physics Ch 21 final.indd544 544 22/05/2009 12:12:17 PM
PARTICLE PHYSICS
For example, consider the following reaction:
Solution
e + e e + Le 1 + 0 = 1 + 0 first generation lepton number is conserved second generation lepton number is conserved. (b) (a) Q -1 +1 + (0). Charge is not conserved. Le 0 0 + 1. Lepton number is not conserved. L 0 + 1 = 0 + 1 L 0 1 + 0. Muon number is not conserved. Q 0 = 0 + 0. Charge is conserved. B 1 = 1 + 0 Baryon number is conserved and the number of baryons is the same. (c) Le 0 + 1 0 + 0 first generation lepton number is not conserved (d) L 0 + 0 0 + 1 second generation lepton number is not conserved. (e) Therefore the reaction cannot take place. Although charge is conserved, the lepton generation number is not conserved. Although charge is conserved, baryon number is not conserved. This reaction occurs. Charge is conserved and there is a pair of strange hadrons. Q1+1=1+1+0 B1+1=1+1+0
-
Therefore the reaction can take place. Now let us look at a second equation: n + e p + -
S 0 + 0 = 0 + -1 + 1
BOSON
Scientists have gathered a lot of evidence about the structure and constituents of the atom but as yet, it is not known how particles get their masses. Furthermore, particle theorists have wondered why the W and Z bosons have large masses rather than being massless like the photon. Peter Higgs proposed that particles can acquire mass as a result of interactions with a hypothetical extra electroweak force field called the Higgs field. Higgs reasoned that if we start out with a particle H that has mass but no other conservation characteristics and bring it close to another particle, say a proton, then H can interact with the proton because there is a force between them. If H and the proton interact, then H must be a boson. When particles are created and annihilated in accelerators, particles are said to arise from fields that are spread out in space and time. By using quantum mechanics mathematics, Higgs found that if H was in its lowest energy state of a field empty space the field would not be zero. Therefore, the
Example
Indicate the validity of each of the following decay processes. State the reason for your choice. (a) (b) (c) (d) (e) - + + e 0 0 + 0 - e- + e + p+pp+p+n p+pp+ +K
+ 0
545
0700817 Physics Ch 21 final.indd545 545 22/05/2009 12:12:18 PM
OPTION
CHAPTER 21 (OPTION J)
Higgs particle (boson) that interacts with other particles can gain mass as a result of the interaction. Now the search is on to find the Higgs particle, and it is hoped that once the hadron collider is commisioned at the end of 2007 at CERN, that this elusive boson will be found. It is important to find it because it plays an important role in the unification of different forces. If it is not found, then particle physics is back to the drawing board, a new theory will have to be proposed to replace the Grand Unifying Theory. 5. No particles of fractional charge have been confirmed thus far. Does this mean that quarks do not exist? Explain. Explain why is it impossible thus far to detect single quarks? List 3 quarks that will produce a baryon with a charge of: (a) +1 (b) -1 (c) 0 8. 1. Given the following particles, and their quark composition, determine the charge of each particle: a. b. c. d. 2. ; uds ; uus + ; ud K-; us 9. The weak force is (a) the only force affecting neutrons (b) responsible for radioactive decay (c) the only force affecting protons (d) responsible for stability of the nucleus Strangeness must be conserved in: (a) (b) (c) (d) weak interactions electromagnetic interactions only strong interactions interactions only both strong and electromagnetic interactions. By using the the conservation rules determine the missing particle in the following reactions. (a) (b) (c) (d) (e) K+ + + ... - + p n + ... - + p + ... 0 + e + ... K+ + ...
6.
7.
Exercise
21.4
Determine whether the following nuclear reactions will occur. (a) (b) (c) (d) (e) (f) (g) (h) (i) p + p p + K+ + 0 n + e - + p p + n energy + + e- energy p + e e+ + 0 + K0 p + + + n p + e e- + + + K+ K+ + + 0 0 p + K-
OPTION
3.
10.
State the quark content of the following particles and name the particle/antiparticle pairs. (a) (b) (c) (d) (e) 0 + 0 K+
4.
State the name of the force carrier in the Feynman diagram shown in the Figure below. Explain why you have chosen this force carrier.
11.
evirtual photon
Identify the equation for the decay of a neutron by beta-decay. Explain this decay using a Feynman diagram.
e-
546
0700817 Physics Ch 21 final.indd546 546 22/05/2009 12:12:19 PM
PARTICLE PHYSICS
J.5.2
J.5.3
approach zero for very close confinement. On the other hand, if the quarks move apart, the force of attraction becomes stronger. This discovery is known as asymptotic freedom and it established quantum chromodynamics QCD as the correct theory to explain the nature of the strong force. The calculations done by Gross, Wilczek and Politzer showed that quarks are held together very strongly at distances that are comparable to the size of a proton, and this explained the concept of quark confinement, that was previously mentioned in section J3. Quark confinement can be visualized by using what is known as the bag model as shown in Figure 2134.
J.5.4 J.5.5
J.5.1 J.5.2
DEEP INELASTIC
U
U D U U U D U U U D U U U D U
D U
D U U
SCATTERING
Electrons can be used in collisions to indirectly identify protons inside the nucleus, and we now know that at high energies the de Broglie wavelength of an electron is small enough to resolve particles inside the proton. However, before the late 1960s, particle accelerators could not produce the energies needed to probe inside protons. However, when the SLAC linac came on line, electrons could be accelerated up to 20 GeV. Low energy electrons tend to be scattered away by protons. However, if the electron has sufficient energy, it can probe deep inside the proton. The collision is inelastic because the proton is disrupted and produces new particles. In effect, the electrons can diffract off quarks inside the protons causing one quark to move away from the other two, and this shattering produces a stream of hadrons. These and later experiments have provided evidence for the existence of quarks, gluons and colour.
D U
U D U
Figure 2134 The bag model of quark connement In a normal proton, the quarks are close together and are free to move within the proton bag. However, when you supply energy of the order of a GeV per femto distance, (10-15 m), the bag stretches like a balloon. The energy needed to remove a quark is much larger than that to produce a quark-antiquark pair. So instead of removing the quark, you just get a shower of mesons produced.
J.5.4 J.5.5
BOSON
NEUTRAL CURRENTS
Recall that the weak interaction is mediated by the W+, the W- and the Z0 intermediate bosons. In the 1960s, it was thought that the weak force that was responsible for radioactive decay was carried by the W+ and W- intermediate bosons. However, when the electroweak theory that unified the electromagnetic and weak force was proposed by Sheldon Glashow and Steven Weinberg at Harvard and Abdus Salam at Imperial College, London,
547
0700817 Physics Ch 21 final.indd547 547 22/05/2009 12:12:19 PM
OPTION
CHAPTER 21 (OPTION J)
it became obvious that the weak force should also have a neutral charge carrier. If this was the case, there must be reactions in which the charges of interacting particles were not moved between particles. They termed such a reaction a neutral current reaction. The neutral current was finally discovered at CERN in 1983. The discovery of neutral current Z0 interaction was of great significance in establishing the validity of the electroweak theory. Electroweak forces can be classified as: J.6.2 The electromagnetic interaction mediated by the photon. The charged-current weak interaction mediated by the W+ and W- intermediate bosons. The neutral-current weak interaction mediated by the Z0 intermediate boson. Solve problems involving particle interactions in the early universe. State that the early universe contained almost equal numbers of particles and antiparticles. Suggest a mechanism by which the predominance of matter over antimatter has occurred. Describe qualitatively the theory of strings.
IBO 2007
J.6.3
J.6.4 Let us examine these interactions and their exchange particles to see how they are unique to certain reactions. The photon carries no electric charge and therefore there is no charge change between the incoming and outgoing particles. The W+ and W cause exchange of charge such as when a neutral muon-neutrino is changed into a charged muon. The W is involved in processes such as the decay of the neutron, beta decay and pion decay. The Z0 carries no electric charge and so an incident particle such as the neutrino remains as an outgoing neutrino. These neutral particles can also be exchanged via the photon and thus the link between the electromagnetic force and the weak force can be seen it what we term electroweak unification. Just as an interaction of the Z0 with a lepton does not change its charge, the interaction with a quark does not change the colour of the quark. This fits perfectly with the lepton-quark standard model that predicts weak interaction processes involving the exchange of a massive, neutral particle (the Z0 boson).
J.6.5
OPTION
548
0700817 Physics Ch 21 final.indd548 548 22/05/2009 12:12:20 PM
PARTICLE PHYSICS
radiation, they would have been scattered and absorbed by electrons and positrons. This energy would have spread out in all space dimensions and the temperature would have dropped. photons, W and Z plus gluons. The gravitational force condensed out. There was a slight imbalance between the matter and anti-matter already occurring. At about 1027 K and 10-35 s, it is thought that the strong force separated out and because the quarks were too close to each other, the strong force could not bind them to form hadrons. There was a sea of quarks, gluons and leptons. However, as the quarks started to separate, quark confinement occurred and hadrons began to form. During the hadron era, the excess of matter over antimatter of the GUT era meant that there was a slight excess of quarks versus antiquarks creating an excess of baryons versus anti-baryons. At about 1015 K and 106 s, it is believed the weak force separated from the electromagnetic force.
0 10 4 10 8
12 10 10 10 10 10 10 16
Nucleo synthesis
20 24 28 32
Radiation era
At about 1013 K and 104 s, it is thought that hadrons with an energy of about 1GeV started to annihilate each other and there existed pair-production/annihilation equilibrium. However, once the energy had dropped below 1GeV, the pair production of nucleons could not take place. There was a slight excess of energy from the annihilations, enough to form some leftover nucleons, and it is this mass that is in the universe today. Light particles such as the photon and lighter leptons thus dominated the universe in equal numbers and thus began the lepton era.
10 10y
Stars, galaxies
Temperature K
MATTER
Lepton era
10 -43s
10-35s
1s 10s 10 2s 10 3s 10 6y
Figure 2135 Variation of temperature with time of the universe. The average kinetic energy of a particle is given by the expression: E = 3/2 kT where k is Boltzmanns constant that has a value of 1.38 10-23 JK-1. Because we are dealing with such large energies this equation can be stated as E = kT. Prior to 10-43 s, it is speculated that the four forces were unified into only one force and it is believed that the temperature was around 1032 K. This means that the energy of a particle would be approximately: E = (1.38 10-23 JK-1 1032 K) (1.6 10-19 JeV-1) = 1019 GeV. The particles had so much energy that the symmetry of the four forces was disrupted and the strong, electromagnetic and weak forces created an array of quarks, leptons,
At about 1010 K and 1 s, it is believed that the lighter leptons with energy about 1 MeV were still able to create electrons, positrons and photons in equal numbers and equilibrium between pair-production and annihilation still existed. However, within a few seconds, electrons and positrons started to annihilate each other in larger numbers and their numbers dropped and there was a slight excess of electrons over positrons. At about 10s, there was a large excess of photons and neutrinos and the radiation era began. At about 109 K and 3.2 s, crucial events began to occur as atoms of hydrogen, helium, deuterium and lithium started to form in what is known as nucleosynthesis. Let us do a quick calculation of the temperature for average kinetic energy of particles around 500 keV. E = 3/2 kT and T = 2/3 E/k 0.67 500 103 eV 1.6 1019 JeV1 = 3.9 109 K = _______________________________ 1.38 1023 JK1 The universe was cooling too quickly for heavier atoms to form and nucleosynthesis stopped.
549
0700817 Physics Ch 21 final.indd549 549 22/05/2009 12:12:20 PM
OPTION
CHAPTER 21 (OPTION J)
At about 3000 K and 300 000 years, free electrons with energies of a few electron-volts were bound to nuclei to form atoms. The photons that were previously ionizing atoms were free to spread throughout the universe. The universe became transparent. The photon radiation became red-shifted and the radiation cooled to 2.7 K and they formed the cosmic background radiation that we detect from every part of the universe. So from this point the universe became matter-dominated.
J.6.3 J.6.4
UNIVERSE
PARTICLE AND
ANTIPARTICLES IN THE
The early universe contained almost equal numbers of particles and antiparticles with a small imbalance favouring matter over anti-matter. However, once photons had reached energies at which they could not undergo pair production from their interaction with particles and antiparticles, matter dominated over anti-matter.
OPTION
550
0700817 Physics Ch 21 final.indd550 550 22/05/2009 12:12:21 PM
GLOSSARY
A
a.f (audio frequency) amplifier an amplifier that amplifies signals in the approximate range 10 Hz to 20 Hz aberration an image defect of which blurring and distortion are the most common image defects. Aberrations can occur with the use of both lenses and mirrors. absolute magnitude (M) the apparent magnitude of a star if it were at a distance of 10 pc from Earth. absolute zero the point where molecular motion becomes a minimum the molecules have minimum kinetic energy but molecular motion does not cease. absorbed dose (D) the amount of energy E transferred to a particular unit mass m. The SI unit of absorbed dose is J kg-1 otherwise known as the Gray (Gy). absorption spectrum occurs when white light passes through a substance in the gaseous phase. Dark lines in the white light correspond to the wavelengths characteristic of the emission spectrum of the particular substance. AC transformer a device that can be used for increasing or decreasing ac voltages and currents. acceleration see average acceleration and instantaneous acceleration accommodation the ability of the eye to focus over this range is called accommodation and this is controlled by the ciliary muscles pulling or relaxing in order to change the focal length of the flexible eye lens. The eye has most accommodation for prolonged viewing when viewing at the far point. accuracy is an indication of how close a measurement is to the accepted value indicated by the relative or percentage error in the measurement. An accurate experiment has a low systematic error. acoustic impedance a measure of how easy it is to transmit sound through a particular medium. The unit of acoustic impedance is the rayl.
active solar heating the use of solar collectors to convert solar energy into heat energy. adiabatic expansion or contraction is one in which no thermal energy Q is allowed to flow into or out of the system. For the entire adiabatic process, Q = 0. aerial a conductor designed to detect a transmitted EM signal air resistance a term that refers to the drag force exerted on object as they move through the atmosphere. albedo () at a surface, is the ratio between the incoming radiation and the amount reflected expressed as a coefficient or as a percentage. (Latin for white) alpha-particle a doubly ionised helium atom, that is a helium nucleus. AM see amplitude modulation ammeter an instrument used to measure the current flowing in an electric circuit and is always connected in series. A-mode scan measures the time lapsed between when the pulse is sent and the time the echo is received. The first echo is from the skin, the second and third pulses are from either side of the first organ, the fourth and fifth echo are from either side of the second organ. The pulse intensity decreases due to attenuation. ampere defined in terms of the force per unit length between parallel current-carrying conductors. amplifier any device that amplifies a signal amplitude the maximum displacement of a particle from its equilibrium position when executing SHM For wave motion it is the maximum displacement of the medium through which the wave travels. amplitude modulation (AM) the encoding of information on to a carrier wave by producing variations in the amplitude of the carrier wave. angle of incidence the angle between the direction of travel of the incident wave and the normal to the boundary
551
12/6/09 11:06:07 AM
GLOSSARY
angle of refraction the angle between the direction of travel of the refracted wave and the normal to the boundary angular frequency () 2 times the linear frequency. angular magnification the ratio / 0 is called the angular magnification M or magnifying power of the lens. antineutrino a particle with zero rest mass and zero charge that results from beta-minus decay and decay of a free neutron. antinode a point on a stationary wave where the displacement is zero. antiparticles all particles have antiparticles which are identical to the particle in mass and half-integral spin but are opposite in charge to their corresponding particle. Although antiparticles have the same mass as their particle pair, they have opposite charge, lepton number, baryon number and strangeness. Some electrically neutral bosons and mesons are their own antiparticle. aperture the length of the refracting surface on which the incident rays can be refracted. apparent brightness the apparent brightness of a star (b) is the energy received from the star per unit time per unit area of the Earths surface. apparent magnitude (m) a measure of how bright a star appears. The scale is defined such that a difference in apparent magnitude of 5 corresponds to a factor of 100 in brightness. This means that 100 stars of magnitude 6 will produce as much power per unit area at the surface of the Earth as a single star of apparent magnitude 1. The higher the value of m the less bright is the star. artificial transmutation a process by which nuclei of an element can be induced to from nuclei of a different element often by the bombardment with neutrons. APPCDC Asia-Pacific Partnership for Clean Development and Climate, an organisation that proposed that, rather than imposing compulsory emission cuts, it would work in partnership to complement the Kyoto protocol. The six countries involved were Australia, China, India, Japan, South Korea and the USA. astronomical unit (AU) the average distance between Earth and the Sun. 1 AU = 1.50 1011 m. atomic mass unit (u) this is 1/12th of the mass of an atom of carbon-12. attenuation of an X-ray beam is the reduction in its intensity due to its passage through matter. average acceleration change in velocity over an interval of time divided by the time interval average speed change in distance over an interval of time divided by the time interval average velocity change in displacement over an interval of time divided by the time interval Avogadros number one mole of a substance at 0C and 101.3 kPa pressure (STP) contains 6.02 x 1023 particles.
B
bandwidth the frequency range covered by the sideband frequencies baryons the heavyweights amongst particles that make up matter, including the proton and the neutron. Other baryons include Lamda 0, Sigma +, 0 and -, Cascade 0 and - and Omega - particles to name but a few. becquerel this is 1 nuclear disintegration per second. beta particle a negative or a positive electron associated with radioactive decay. Big Bang Theory postulates that the Universe emerged from an enormously dense and hot state about 14 billion years ago. The size of the universe at its beginning was assumed to be extremely small with enormous temperature and pressure. It is assumed that a gigantic explosion occurred that created space, time and matter. binary stars two stars that orbit a common centre of gravity.
552
12/6/09 11:06:07 AM
GLOSSARY
biological half-life (TB) of a material is the time taken for half the radioactive substance to be removed from the body by biological processes. black hole an object whose gravitational field strength at it surface is large enough to prevent light escaping from its surface/ an object whose escape velocity at its surface is equal to or greater than the free space speed of light. black-body radiation the radiation emitted by a perfect emitter. The radiation is sometimes called temperature radiation because the relative intensities of the emitted wavelengths are dependant only on the temperature of the black body. breeder reactor a nuclear fission reactor that creates or breeds more fissionable material than consumed. bremsstrahlung when a fast-moving particle is rapidly decelerated or deflected by another target particle, it radiates most of its energy in the form of photons in what is known as bremsstrahlung or braking radiation in the X-ray region of the electromagnetic spectrum. Brewster angle () the angle to the normal at which reflected light is completely plane polarized. Brewsters law the refractive index n of a substance is related to the Brewster angle () by n = tan. Brownian motion the random, zig-zag motion observed when larger molecules or particles in motion collide with smaller molecules. B-scan mode (brightness-modulated scan), an array of transducers scan a slice in the body. Each echo is represented by a spot of a different shade of grey on an oscilloscope. centripetal acceleration the acceleration of a particle traveling in a circle. centripetal force the general name given to the force causing a particle to travel in a circle. cepheid variables stars whose luminosity varies with a regular frequency. Chandrasekhar limit the maximum mass of a star for it to become a white dwarf. (1.4 Msun) change of state (of an ideal gas) if some macroscopic property of the system has changed eg. phase, temperature, pressure, volume, mass, internal energy. chemical energy energy associated with chemical reactions. chromatic aberration produces coloured edges around an image. It can be minimised by using an achromatic doublet. It is made from converging crown glass lens and a diverging flint glass lens that are adhered together by canada balsam coal an organic material made up primarily of carbon, along with varying amounts of hydrogen, oxygen, nitrogen and sulfur. It is a sedimentary rock. Coaxial cable consists of a thin copper wire surrounded by an insulator which in turn is surrounded by a copper grid. This grid is also surrounded by an insulator. cochlea the most delicate organ in the hearing process and it contains many intricate structures that will not be fully investigated at this level. It consists of three chambers - two outer chambers, the scala vestibuli (top) and the scala typani (bottom), and an inner chamber called the scala media. coefficient of volume (or cubical expansion) () the fractional change in volume per degree change in temperature: coherent when the filament of a light globe emits light, the atoms on the filament do not maintain a constant phase relationship because the filament atoms act independantly from each other.The light emitted is incoherent. However, in a laser, each photon of light is in phase with all the other photons. Laser light is coherent.
C
carrier wave the name given to the wave that is altered by the superposition of the signal wave cell phones another name for mobile phones centre of curvature C the centre of the sphere of which the lens is made.
553
12/6/09 11:06:07 AM
GLOSSARY
combined cycle gas turbines (CCGT) a jet engine is used in place of the turbine to turn the generator. Natural gas is used to power the jet engine and the exhaust fumes from the jet engine are used to produce steam which turns the generator. compression digital data can be compressed enabling the same bandwidth to be used by several different broadcasting channels computed tomography (CT) imaging also called computed axial tomography (CAT) imaging, uses X-rays, scintillation detectors and computer technology to build up an axial scan of a section of an organ or part of the body with 256 grey shades. conduction the process by which a temperature difference causes the transfer of thermal energy from the hotter region of the body to the colder region by particle collision without there being any net movement of the substance itself. conductor have a low electrical resistance and are therefore able to carry an electric current withour much energy dissipation as heat. cones photoreceptors that have slow response rates, and are insensitive at low light levels but are sensitive to particular wavelengths of light, and give us our colour vision. There are around 6.5 million of them. It is thought that the cones can be divided into three colour groups - red cones (64%), green cones (32%), and blue cones (2%). Conservation of energy states that energy cannot be created or destroyed but only transformed into different forms. (See conservation of mass-energy and first law of thermodynamics) Conservation of mass-energy states that mass and energy are interchangeable and in any interaction mass-energy is conserved. constellation a collection of stars that form a recognisable group as viewed from Earth (e.g the Plough) constructive interference occurs when two or more waves overlap and their individual displacements add to give a displacement that is greater than any of the individual displacements. control rods the rate of nuclear fission in the reactor core can be controlled by inserting or removing the control rods. The control rods are constructed of materials that absorb neutrons. convection the process in which a temperature difference causes the mass movement of fluid particles from areas of high thermal energy to areas of low thermal energy (the colder region). conventional current flows from the positive to negative terminal. coolant a material that circulates through the reactor core and removes thermal energy transferring it to where it can do useful work by converting water into steam. Coulombs Law the force F between two point charges q1 and q2 was directly proportional to the product of the two point charges and inversely proportional to the square of the distance between them r2. crest the maximum displacement of a medium through which a wave travels. critical angle the angle, measured to the normal, at which a ray incident on a boundary between two media, will undergo total internal reflection in the more dense medium. critical mass the smallest possible amount of fissionable material that will sustain a chain reaction. crude oil a product of the decomposition of marine plants and animals that were rapidly buried in sedimentary basins where there was a lack of oxygen. cyclotron basically like a linac that has been wrapped into a tight spiral.
D
damping the decrease with time of the amplitude of oscillations.
data transfer rate the number of bits transmitted per second also called bit rate. DC amplifier another name for an operational amplifier
554
12/6/09 11:06:07 AM
GLOSSARY
de Broglie hypothesis Any particle with momentum can exhibit wave-like properties and its wavelength is given by the de Broglie formula . degree of uncertainty of a measurement is equal to half the limit of reading. demodulator removes the carrier wave leaving only the signal waves. derived quantity a quantity involving the measurement of two or more fundamental quantities. destructive interference occurs when two or more waves overlap and their individual displacements add to give a displacement that is less than any of the individual displacements. differential amplifier another term for an operational amplifier diffraction the bending and/or spreading of waves when they meet an obstruction or pass through an aperture. diffusion a property observed in solids, liquids and gases as something spreads out. dioptre the unit for the lens power is the dioptre D with the unit m-1. dispersion when a narrow beam of white light undergoes refraction on entering a prism, the light spreads out into a spectrum of colours. The colours range from red at one side of the band, through orange, yellow, green, indigo, to violet at the other side of the band. The separation of the white light into its component colours is due to dispersion. displacement distance traveled in a specified direction Doppler Effect the phenomenon of the change in frequency that arises from the relative motion between a source and observer. dosimetry the study of radiation. drag force see air resistance drift velocity electrons entering at one end of the metal cause a similar number of electrons to be displaced from the other end, and the metal conducts. Even though they are accelerated along their path, it is estimated that the drift velocity is only a small fraction of a metre each second (about 10-4 m s-1).
E
eccentricity the earths orbit around the Sun is not circular but rather elliptical and this will affect its orbit every 100 000 and 400 000 years which in turn leads to climate change. eddy currents any conductor that moves in a magnetic field has emf induced in it, and as such current, called eddy currents, will also be induced in the conductor. This current has a heating effect in the soft iron core of the transformer which causes a power loss termed an iron loss. effective half-life (TE ) of the radioactive substance will be less than the physical half-life due to the biological half-life component. efficiency of an energy conversion process is the ratio of the useful energy output to the total energy input, usually expressed as a percentage. Einstein photoelectric equation relates the maximum kinetic energy of the emitted electrons, f is the frequency of the incident light, f0 is the threshold frequency and h is the Planck constant Einstein Principle of Equivalence states that it is impossible to distinguish between gravitational and inertial effects. elastic potential energy the energy associated with a system subject to stress e.g. a stretched spring electric current the rate at which charge flows past a given cross-section. electric field strength (electric field intensity) at any point in space, E is equal to the force per unit charge exerted on a positive test charge, it is a vector quantity. electric potential difference between two points in a conductor is defined as the power dissipated per unit curretn in moving from one point to another.
555
12/6/09 11:06:08 AM
GLOSSARY
electric potential energy defined in terms of a point charge moving in an electric field as The electric potential at a point in an electric field is defined as being the work done per unit charge in bringing a small positive point charge from infinity to that point. electric potential energy the energy associated with a particle due to its position in an electric field. electrical energy this is energy that is usually associated with an electric current and that is sometimes referred to incorrectly as electricity. electrical resistance the ratio of the potential difference across the material to the current that flows through it. The units of resistance are volts per ampere (V A-1). However, a separate SI unit called the ohm is defined as the resistance through which a current of 1 A flows when a potential difference of 1 V is applied. electrical strain gauge when a metal conducting wire is put under vertical strain, it will become longer and thinner and as a result its resistance will increase. An electrical strain gauge is a device that employs this principle. electromagnetic waves waves that consist of oscillating electric and magnetic fields. They are produced by the accelerated motion of electric charge. electromotive force (emf) the work per unit charge made available by an electrical source. electron flow flows from the negative to the positive terminal. electron microscope a microscope that utilizes the wave properties of electrons. electron-volt (eV) the energy acquired by an electron as a result of moving through a potential difference of one volt. electrostatics the study of stationary electric charges. elementary particles particles that have no internal structure, that is, they are not made out of any smaller constituents. The elementary particles are the leptons, quarks and exchange particles. emission spectra the spectra produced by excited gaseous atoms or molecules emissivity the ratio of the amount of energy radiated from a material at a certain temperature and the energy that would come from a blackbody at the same temperature and as such would be a number between 0 and 1. energy the capacity to do work energy balance climate model the word balance infers that the system is in equilibrium with no energy being accumulated in the earths surface and atmosphere. This model attempts to account for the difference between the incoming radiation intensity and the outgoing radiation intensity, and the simplest energy balance model chooses temperature as the only variable to be considered. energy degradation when energy is transferred from one form to other forms, the energy before the transformation is equal to the energy after (Law of conservation of energy). However, some of the energy after the transformation may be in a less useful form, usually heat. We say that the energy has been degraded. energy density the amount of potential energy stored in a fuel per unit mass, or per unit volume depending on the fuel being discussed. entropy a thermodynamic function of the state of the system and can be interpreted as the amount of order or disorder of a system. equipotential lines lines that join points of equal potential in a gravitational or electric field. equipotential surface all points on an equipotential surface at the same potential. equipotentials regions in space where the electric potential of a charge distribution has a constant value. ether a substance that was thought to permeate the whole of space and that was at absolute rest. evaporation a change from the liquid state to the gaseous state that occurs at a temperature below the boiling point.
556
12/6/09 11:06:08 AM
GLOSSARY
evaporative cooling as a substance evaporates, it needs thermal energy input to replace its lost latent heat of vaporisation and this thermal energy can be obtained from the remaining liquid and its surroundings. exchange particles elementary particles that transmit the forces of nature. exponential decay when a quantity continuously halves in value in equal intervals of time, the quantity is said to decay exponentially. exposure is defined for X-radiation and -radiation as the total charge (Q) of ions of one sign (either electrons or positrons) produced in air when all the -particles liberated by photons in a volume of air of mass m are completely stopped in air. extrapolation extending the line of best fit outside the plotted points of a graph. first law of thermodynamics a statement of the Law of Conservation of Energy in which the equivalence of work and thermal energy transfer is taken into account. It can be stated as the heat added to a closed system equals the change in the internal energy of the system plus the work done by the system. flux linkage () If is the flux density through a cross-sectional area of a conductor with coils focal length (f) the distance between the principal focus and the centre of the refractingsurface. forced oscillations oscillations resulting from the application of an external, usually periodic force. fossil fuels naturally occurring fuels that have been formed from the remains of plants and animals over millions of years. The common fossil fuels are peat, coal, crude oil, oil shale, oil tar and natural gas. fractional uncertainty see relative uncertainty. frame of reference a set of coordinates used to define position Fraunhofer diffraction diffraction resulting from the source of light and the screen on which the diffraction pattern is produced being an infinite distance from the diffracting aperture. frequency linear frequency ( f ) is the number of complete oscillations a system makes in unit time. frequency modulation (FM) the encoding of information on to a carrier wave by producing variations in the frequency of the carrier wave. Fresnel diffraction diffraction resulting from either or both the source of light and the screen on which the diffraction pattern is produced being a finite distance from the diffracting aperture. frictional force the force that arises between two bodies in contact. fundamental (see first harmonic)
F
far point the position of the furthest object that can be brought into focus by the unaided eye. The far point of a normal eye is at infinity. Faradays Law can be stated as the magnitude of the induced emf in a circuit is directly proportional to the rate of change of magneitc flux of flux linkage. feedback resistance the value of the resistance that feeds the output signal of a operational amplifier back to the input. Feynman diagrams so named for their inventor, the American physicist Richard Feynman (19181988). They were developed by Feynmann as a graphical tool to examine the conservation laws that govern particle interactions according to quantum electrodynamic theory. film badge a double emulsion photographic film that is placed inside a holder with an area of 3 cm by 5 cm that contains different thicknesses of plastic, an open window and 3 different metal plates. It is pinned to clothing and over a period of time the exposure to radiation results in a darkening of specific areas of the photographic film. first harmonic (also fundamental) the first possible mode of vibration of a stationary wave.
557
12/6/09 11:06:08 AM
GLOSSARY
fundamental interactions/forces all forces that appear in nature may be identified as one of four fundamental interactions, either the gravitational, weak, electromagnetic or strong interaction. fundamental units kilogram, metre, second, ampere, mole and Kelvin. gravitational red-shift the observed frequency of light emitted from a source depends upon the position of the source in a gravitational field. gravitational time dilation the slowing of time due to a gravitational field graviton the exchange particle for the gravitational force. It is an inverse square force with an infinite range that affects all particles and acts on all mass/energy and it has a rest mass of zero.
G
galaxies A collection of stars held together by gravity. gamma ray bursters astronomical objects that emit intense bursts of gamma radiation thought to be due to the collapse of a rapidly rotating neutron star gamma ray(s) high frequency electromagnetic radiation, that is high energy photons. generator is essentially a device for producing electrical energy from mechanical energy. geodesic the shortest path followed by an object moving in spacetime geostationary satellite a satellite that orbits Earth in a circular orbit above the equator and has an orbital period of one sidereal day gluons the exchange particle that is responsible the quark colour. Just as the positive and negative charges are associated with the electromagnetic force, a three colour charge are associated with quarks and gluons that bind the quarks together. gravitational lensing the bending of light by a gravitational field gravitational mass the mass that gives rise to the gravitational attraction between bodies as defined by Newtons law of gravity. gravitational potential the gravitational potential at a point in a gravitational field is defined as the work done per unit mass in moving a point mass from infinity to the point. gravitational potential energy the energy associated with a particle due to its position in a gravitational field.
H
hadrons are not elementary particles because they are composed of quarks. Mesons consist of a quark and an antiquark. Baryons have three quarks. A proton has 2 up and 1 down quarks - uud, and the neutron has 2 down and I up quarks ddu. Hadrons interact predominantly via the strong nuclear force, although they can also interact via the other forces. half-life see radioactive half-life half-value thickness is the thickness of a material that reduces the intensity of a monoenergetic X-ray beam to half its original value. harmonic series a series of musical notes arising from a particular fundamental frequency. harmonics the different possible modes of vibration of a stationary wave. heat the thermal energy that is absorbed, given up or transferred from one object to another. heat capacity see thermal capacity heat engine any device that converts thermal energy into work. heat exchanger a system basically acting as a heat engine driven by chemical reactions (the combustion of fossil fuels) or by nuclear reactions. The working fluid is water heated in a boiler that is converted to steam at high pressure.
558
12/6/09 11:06:08 AM
GLOSSARY
heat pump any device that can pump heat from a low-temperature reservoir to a high-temperature reservoir is called a heat pump. heat the non-mechanical transfer of energy between a system and its surroundings Heisenberg Uncertainty Principle The Uncertainty Principle was proposed by Werner Heisenberg in 1927 as explained in the text Hertzsprung-Russell diagrams a plot of the luminosity (or absolute magnitude) against temperature (or spectral class). Hubbles law The law states that the relative recession speed between galaxies is proportional to their separation. instantaneous velocity the rate of change of displacement with time insulator the electrons are held tightly by the atomic nuclei and are not as free to move through a material. They can accumulate on the surface of the insulator but they are not conducting. intensity the energy that a wave transports per unit time across unit area of the medium through which it is travelling interference pattern the overall pattern produced by interfering waves Intergovernmental Panel on Climate Change (IPCC) in the 1980s, the United Nations Environment Programme in conjunction with the World Meteorological Organization set up a panel of government representatives and scientists to determine the factors that may contribute to climate change. The panel was known as the Intergovernmental Panel on Climate Change (IPCC). internal energy the sum total of the potential energy and the random kinetic energy of the molecules of the substance making up the system. internal resistance the resistance inside a source of electrical energy. interpolation drawing the line of best fit between the plotted points of a graph. inverting amplifier an operational amplifier in which the non-inverting input is connected to earth. ionising radiation when radiation causes ions to form it is called ionising radiation. ionization current the current in a gas that results from the ionization of the atoms or molecules of the gas. ionization the removal of an electron or electrons from an atom. isobaric a graph of pressure as a function of volume change when the pressure is kept constant. Such a process is said to be isobaric. Note that the work done by the gas is equal to the area under the curve.
I
ideal gas a theoretical gas that obeys the equation of state of an ideal gas exactly. ideal gases obey the equation pV = nRT when there are no forces between molecules at all pressures, volumes and temperatures. induced current if the conductor is moved across the magnetic field, then a deflection occurs in the needle of the galvanometer in one direction. After a very short period of time, the needle returns to zero on the scale. The current produced is called an induced current. inertia a bodys reluctance to change its state of motion. inertial mass the mass referred to in Newtons second law inertial reference frame a reference frame in which Newtons first law holds true insolation incoming solar radiation, it is mainly in the visible region of the electromagnetic spectrum (0.4 m to 0.7 m) and short-wave infra-red radiation. instantaneous acceleration the rate of change of velocity with time instantaneous speed the rate of change of distance with time
559
12/6/09 11:06:08 AM
GLOSSARY
isochoric a graph of pressure as a function of volume change when the volume is kept constant. Such a process is said to be isochoric. When the volume is kept fixed, the curve of the transformation is said to be an isochore. isolated system a system where no energy of any kind enters or leaves the system. isothermal process a thermodynamic process in which the pressure and the volume are varied while the temperature is kept constant. In other words, when an ideal gas expands or is compressed at constant temperature, then the gas is said to undergo an isothermal expansion or compression. isotopes atoms of the same element with different numbers of neutrons in their nuclei. Kyoto Protocol this agreement required industrialized countries to reduce their emissions by 2012 to an average of 5 percent below 1990 levels. A system was developed to allow countries who had met this target to sell or trade their extra quota to countries having difficulty meeting their reduction deadlines.
L
laminations to reduce the heating effect due to eddy currents, the soft-iron core is made of sheets of iron called laminations that are insulated from each other by an oxide layer on each lamination. This insulation prevents currents from moving from one lamination to the next. laser is actually an acronym light amplification by stimulated emission of radiation. A laser is an instrument that has a power source and a light-amplifying substance. There are a variety of solid, liquid and gas lasers available on the market. The common laser used in the laboratory uses a helium- neon gas mixture as the light-amplifying substance. latent heat of fusion the quantiy of thermal energy required to change a substance from a solid at its melting point completely to a liquid at its melting point. latent heat of vaporisation the quantiy of thermal energy required to change a substance from a liquid at its boiling point completely to a gas at its boiling point. Law of conservation of electric charge in a closed system, the amount of charge is constant. laws of reflection the angle at which the waves are reflected from a barrier is equal to the angle at which they are incident on the barrier (the angles are measured to the normal to the barrier). All waves, including light, sound, water obey this rule. The normal and the rays associated with the incident and reflected rays all lie in the same plane. lens a transparent object with at least one curved surface but more commonly two curved faces. Most lenses are made of glass but perspex (lucite) and quartz lenses are common. They are used to correct defects of vision using spectacles and in optical instruments such as cameras, microscopes and refracting telescopes.
K
Kelvin temperature a fundamental quantity. It is the SI unit of thermodynamic temperature of the triple point of water. One degree Celsius is equal to 1 + 273 = 274 K. Keplers third law this is the law of periods and states that that the average orbital radius R of a planet about the Sun is related to the period T of rotation of the plane by R3 = kT2 where k is a constant. kilogram the mass of a particular piece of platinum-iridium alloy that is kept in Svres, France. kilowatt-hour (kW h) the energy consumed when 1 kW of power is used for one hour. kinetic energy energy associated with motion kinetic theory of a gas when the moving particle theory is applied to gases it is generally called the kinetic theory of gases. Kirchoff s current law junction rule the sum of the currents flowing into a point in a circuit equals the sum of the currents flowing out at that point. Kirchoff s voltage law loop rule in a closed loop the sum of the emfs equals the sum of the potential drops.
560
12/6/09 11:06:08 AM
GLOSSARY
Lenzs Law also known as the Second Law of Electromagnetic Induction and it can be stated as the direction of the induced emf is such that the current it causes to flow opposes the change producing it. leptons particles that can travel on their own meaning that they are not trapped inside larger particles. Six distinct types called flavors have been identified along with their antiparticles. light dependant resistor (LDR) is a photo-condutive cell whose resistance changes with the intensity of the incident light. light year the distance that light travels in one year. 1 light year (ly) = 9.46 1015 m limit of reading of a measurement is equal to the smallest graduation of the scale of an instrument. line spectrum produced when the spectrum produced by excited gaseous atoms or molecules is passed through a slit and then through a dispersive medium such as a prism of diffraction grating and then brought to a focus on a screen. linear accelerator (linac) is a device that accelerates charged particles in a straight line inside a long evacuated tube. linear attenuation coefficient a beam of homogeneous, monoenergetic X-rays contains photons of only one energy and thus only one wavelength. linear or lateral magnification m (of a lens) is given by the ratio of the height of an image to the height of its object or the ratio of the image distance to the object distance. Linear magnification has no units. longitudinal waves in these types of wave, the source that produces the wave vibrates in the same direction as the direction of travel of the wave i.e. the direction in which the energy carried by the wave is propagated. The particles of the medium through which the wave travels vibrate in the same direction of travel of the wave (direction of energy propagation). loudspeaker a transducer that converts an amplified electrical signal into sound. luminosity (L) the total power radiated by a star.
M
Machs Principle states that inertial and gravitational mass are identical macroscopic property a property that can be observed. Physical properties such as melting point, boiling point, density, thermal conductivity, thermal expansion and electrical conductivity can be observed and measured. magnetic flux () through a small plane surface is the product of the flux density normal to the surface and the area of the surface. The unit of magnetic flux is the weber Wb. magnetic force a force experienced when a moving charge or a beam of moving charges is placed in a magnetic field. magnifying power see angular magnification. main sequence stars a grouping of stars on a Hertzsprung-Russell diagram that extends diagonally across the graph from high temperature, high luminosity to low temperature low luminosity. Stars on the main sequence derive the energy from hydrogen burning in the core of the star. Malus law when light of intensity I0 is incident on an analyzer whose transmission axis makes angle to the electric field vector, the intensity I of the transmitted light is given by I = I0cos2 mass see gravitational mass and inertial mass mass defect The difference in mass between a nucleus and the sum of the mass of its constituent nucleons. The mass of a nucleus is always less than the sum of the mass of its constituent nucleons. material dispersion the spreading out of pulses as they travel along an optic fibre matter waves See de Broglie hypothesis
561
12/6/09 11:06:08 AM
GLOSSARY
Maxwells theory states that electromagnetic radiation consists of oscillating electric and magnetic fields. mesons hadrons that can mediate the strong nuclear force. Like the first and second generation leptons, mesons only exist for a short time and they are thus very unstable. metal structure positive ions in a sea of delocalised electrons. method of mixtures a common indirect method to determine the specific heat capacity of a solid and liquids is called the method of mixtures. metre the length of path traveled by light in a vacuum during a time interval of 1/299 792 453 second. minimum angle of resolution see Rayleigh criterion mobile phone a phone that is not connected by a landline to a telephone exchange modal dispersion a situation in which pulses associated with different waves in an optic fibre arrive at the detector at different times moderator a material that will slow down the fast neutrons to the speed of the slow thermal neutrons needed for a selfsustained reaction without absorbing the neutrons when they collide with the moderator material. modes the name given to the different paths followed by different waves in an optic fibre modulation the alteration of a wave form mole is the amount of substance that contains as many elementary particles as there are in 0.012 kg of carbon 12. The mole is a fundamental unit. momentum the product of mass and velocity monochromatic source of radiation is that has a extremely narrow band of frequencies or extremely small narrow wavelength band (or colour in the case of visible light). Most sources of light emit many different wavelengths. Laser light is monochromatic. monomode fibres a fibre in which there is only one transmission axis thereby eliminating modal dispersion Morse code an electronic communication system that used individual groups electrical pulses to represent letters and that were transmitted along wires multiplexing a means of increasing the bit rate by sending different sets of data apparently simultaneously.
N
natural frequency the frequency of oscillation of a system that is not subjected to a periodic external force. natural gas a product of the decomposition of marine plants and animals that were rapidly buried in sedimentary basins where there was a lack of oxygen. natural greenhouse effect a phenomenon in which the natural greenhouse gases absorb the outgoing long wave radiation from the earth and re-radiate some of it back to the earth. natural radioactivity a property associated with certain naturally occurring elements in which they emit ionizing radiations. near point the position of the closest object that can be brought into focus by the unaided eye. The near point varies from person to person but it has been given an arbitrary value of 25 cm. nebulae a cloud of interstellar dust and gas. nematic liquid crystal a liquid crystal whose molecules are in the shape of a twisted helix. neutron number the number of neutrons in a nucleus neutrons an uncharged nucleon nibble a 4-bit binary word node a point on a stationary wave where the displacement is a maximum.
562
12/6/09 11:06:08 AM
GLOSSARY
non-renewable source one that is considered to be a temporary source that is depleted when it is used. NTC thermistor (negative temperature coefficient) the resistance decreases when the temperature rises and they therefore pass more current. nuclear binding energy the energy required to separate the nucleus into it individual nucleons or the energy that would be released in assembling a nucleus from its individual nucleons. nuclear energy energy associated with nuclear reactions nuclear fission the splitting of a nucleus into two other nuclei. nuclear fusion the combining of two nuclei into a single nucleus nuclear magnetic resonance the basis of the diagnostic tool known as magnetic resonance imaging (MRI). It is a technique used for imaging blood flow and soft tissue in the body and is the preferred diagnostic imaging technique for studying the brain and the central nervous system. Rather than using X-rays as the source of radiation, it uses radiation in the radio region of the electromagnetic spectrum and magnetic energy to create cross-sectional slices of the body. nucleon a proton or a neutron. nucleon number the number of nucleons in a nucleus nucleosynthesis the different nuclear processes that take place in stars. nuclide the general term for a unique nucleus numerical aperture is related to the resolution of a lens, and the wavelength of the light (see text for formula) Nyquist Theorem states that the sampling signal must be equal to or greater than twice the signal frequency.
O
Ohms Law provided the physical conditions such as temperature are kept constant, the resistance is constant over a wide range of applied potential differences, and therefore the potential difference is directly proportional to the current flowing. Olbers paradox if Newtons model of a uniform, infinite Universe were correct, then the sky would always be bright. This paradox was first proposed by Henrich Olber in 1823. operational amplifier an amplifier with two inputs, very high input impedance and very high gain. OppenheimerVolkoff limit the maximum mass of a neutron star beyond which it will collapse to a black hole optic fibres a fibre in which the carrier wave is light. optical microscope a microscope using visible light and lenses to magnify small objects (usually used in biology and medicine) order of magnitude the power of ten closest to a number. oscillating water column (OWC) wave energy devices that convert wave energy to electrical energy. These can be moored to the ocean floor or built into cliffs or ocean retainer walls. oscillations another word for vibrations. ossicles a chain of three bones in the ear that transmit vibration form the ear drum to the cochlea. They are called the malleus, incus and stapes, more commonly known as the hammer, anvil and stirrup.
P
pair annihilation when matter (such as an electron) collides with its corresponding antimatter (such as a positron), both particles are annihilated, and 2 gamma rays with the same energy but with a direction at 180 0 to each other are produced. This is called pair annihilation.
563
12/6/09 11:06:08 AM
GLOSSARY
pair production particleantiparticle pairs can also be produced when a gamma ray with sufficient energy passes close by a nucleus. The process is the reverse of annihilation and is called pair production. parallax the apparent displacement of an object due to the motion of the observer. parsec a line of length1 AU subtends an angle of 1 arcsecond (one second of arc /4.8 10-6 rad) at a distance of one parsec. Pauli exclusion principle states that an orbital can only contain a maximum of two electrons and when the 2 electrons occupy an orbital they have opposite spin. peak current an alternating current varies sinusoidally and the maximum current called the peak current. peat a brownish material that looks like wood. Although it can be burnt as a fuel, it contains a lot of water, and is very smoky when burnt. Under pressure and over time it will be converted to other forms of coal. percentage uncertainty is the relative uncertainty multiplied by 100 to produce a percentage. period the time taken for an oscillating system to make one complete oscillation. periodicity repetition of motion both in space and in time phase change a substance can undergo changes of state or phase changes at different temperatures. Pure substances (elements and compounds) have definite melting and boiling points which are characteristic of the particular pure substance being examined. phase difference the time interval or phase angle by which one wave leads or lags another. photo-electric effect The emission of electrons from a metal surface that is illuminated with light above a certain frequency photoelectric work function The minimum energy required to remove an electron from the surface of a metal by photo-emission. It is related to the threshold frequency by = hf 0. photon The existence of the photon was postulated by Einstein in 1905 as being a quantum of electromagnetic energy, regarded as a discrete particle having zero mass, no electric charge, and an indefinitely long lifetime. The energy E of a photon associated with light of frequency f is given by the Planck equation E = hf. photopic vision cones are responsible for photopic vision or high lightlevel vision, that is, colour vision under normal light conditions during the day. The pigments of the cones are of three types long wavelength red, medium wavelength green and short wavelength blue. photovoltaic devices use the photoelectric effect. Photons from radiant energy excite electrons in a doped semi-conducting material such as silicon or germanium, and the element becomes conducting allowing electrons to flow in an external circuit to produce electrical energy. physical half-life (TR) of a radioactive nuclide is the time taken for half the nuclei present to disintegrate radioactively. pixels the smallest element of an image on a LCD or CCD Planck constant Max Planck postulated that energy associated with oscillating atoms is proportional to the frequency of oscillation of the atom. The constant (h) relates the energy (E) of a photon to its associated frequency ( f ). (E = hf ) (h = 6.2660693 10-34 J s) plasma a super heated gas. plasma confinement plasma has to be confined for 1 second with a density of about 500 trillion atoms per cubic centimetre. Because fusion is not a chain reaction, thes temperature and density conditions have to be maintained for future fusions to occur. polarimeter essentially a tube that is bounded at both ends with polarizing materials. polarization the rotation of the plane of vibration of the electric vector of an electromagnetic wave.
564
12/6/09 11:06:08 AM
GLOSSARY
pole (P) central point of the refracting surface. pollutants substances that have undesirable effects on living things and property. Air pollution occurs when these pollutants are introduced into the atmosphere. population inversion in the ruby laser, light of energy equivalent to 2.25 eV is absorbed from the flash tube, and this raises the electrons of chromium from the ground state E1 to an excited state E3. These electrons quickly undergo spontaneous emission and fall to level E2 known as the metastable energy state. If the incident radiation from the flash tube is intense enough more electrons are transferred to the E2 energy level than remain in the ground state a condition known as population inversion. positron a positively charged electron potential divider a device that produces the required voltage for a component from a larger voltage. potential energy see elastic potential energy, electric potential energy and gravitational potential energy potential gradient the rate of change of potential V at a point with respect to distance x in the direction in which the change is maximum is called the potential gradient. power the rate of working power of a convex lens (P) is the reciprocal of the focal length. It is a measure of the strength of a lens as used by optometrists and opthalmologists. power stations usually rely on thermal energy, gravitational potential energy or wind power to supply the kinetic energy to rotate a turbine. The turbine contains blades that are made to rotate by the force of water, gas, steam or wind. As the turbine rotates, it turns the shaft of a generator. The electrical energy can be produced by rotating coils in a magnetic field. precision is an indication of the agreement among a number of measurements made in the same way indicated by the absolute error. A precise experiment has a low random error. preferential absorption the phenomenon in which certain crystals only transmit the vertical or horizontal component of the electric vector of an electromagnetic wave. pressure it is defined as the force exerted over an area. The SI unit of pressure is the pascal (Pa). principal axis line that passes through the centre of curvature and the centre of the refracting surface. principal focal plane the plane that passes through the principal focus and is perpendicular to the principal axis. principal focus (F) point through which rays parallel and close to the principal axis pass after refraction if the lens is convex, or appear to come from if the lens is concave. principle of superposition the principle of superposition as applied to wave motion states the displacement at a point where two or more wave meet is the vector sum of the individual displacements of each wave at that point. proper length the length of an object as measured by an observer at rest with respect to the object proper time the time interval between two events as measured by an observer that sees the events take place at the same point in space. proton number the number of protons in a nucleus protostar a stage in the formation of a star in which the star is selfluminous but in which nuclear fusion as not yet started. public switched telephone network (PSTN) land based telephone exchange pulsar(s) a pulsating radio source believed to be a rapidly rotating neutron star. pulse oximetry a non-invasive technique used to monitor the oxygen content of haemoglobin. pump storage systems used in off-peak electicity demand periods. The water is pumped from low resevoirs to higher resevoirs during this period.
565
12/6/09 11:06:09 AM
GLOSSARY
Q
quality of an X-ray beam is a term used to describe its penetrating power. quality factor this is approximately equal in value to the number of oscillations that occur before all the energy of an oscillator is dissipated. quantum A discrete packet of energy associated with electromagnetic radiation. (see photon). Literally from the Latin how much. quantum mechanics The theory proposed in 1926/7 that replaced Newtonian physics. quantum numbers the different states in which an electron can exist are determined by four quantum numbers: principal, orbital, magnetic and spin quark confinement the property that quarks are always found in groups that are colourless is called quark confinement. quarks with a size of less than 10-18 m can never be found in isolation as they are trapped inside other composite particles called hadrons of which the proton, the neutron and mesons are examples. quasars very distant and very luminous stellar like objects.
radiation shielding ensures the safety of personnel working inside and around the reactor from suffering the ill effects of radiation exposure. There are usually two shields: several metres of high-density concrete to protect the walls of the reactor core from radiation leakage and to help reflect neutrons back into the core and a biological shield to protect personnel made of several centimetres of high density concrete. radioactive decay The spontaneous emission by the nuclei of certain atoms, of radiation in the form of alpha particles or beta particles and/or gamma radiation. The decay process cannot be controlled by chemical and physical means. radioactivity see natural radioactivity radius of curvature (R) the radius of the sphere from which the lens is made. random uncertainties are due to variations in the performance of the instrument and the operator. Even when systematic errors have been allowed for, there exists error. rank advance as peat became buried beneath more plant matter, the pressure and temperature increased and the water was squeezed out of it. As the material became compacted the peat is converted to lignite, then to sub-bituminous coal and finally bituminous coal. At each stage in the rank advance, the coal has a higher carbon content and a higher energy content per unit mass. rarefaction in a sound wave this refers to regions of minimum pressure. Rayleigh criterion the images of two sources will be just be resolved by an image forming system if the central maximum of one diffraction pattern image coincides with the first minima of the other diffraction pattern image. real image an image that can be seen on a screen that has been put at the point where the rays intersect at a single point. red giant star An evolutionary phase of main sequence stars usually with mass less than about 4 MSun characterized by low temperature and high luminosity. red-shift the Doppler shift of light observed from receding objects.
R
r.f (radio frequency) amplifier an amplifier that amplifies signals in the radio frequency range (several kHz to about 100 Mhz) radiation the energy produced by a source because of its temperature that travels as electromagnetic waves. It does not need the presence of matter for its transfer.
566
12/6/09 11:06:09 AM
GLOSSARY
reflection occurs when a wave is incident at a boundary between two different media and results in some of the energy of the wave being returned into the medium in which it is travelling before incidence. refraction occurs when a wave is incident at a boundary between two different media and results in some of the energy of the incident wave being transmitted across the boundary. If the wavefronts are not parallel to the boundary, the direction of travel of the wave is changed. refractive index (n) This is defined using the angle of incidence of light in a vacuum and the angle of refraction in the medium whose refractive index is n. relative uncertainty equals the absolute uncertainty divided by the measurement. It has no units. renewable energy source one that is permanent or one that can be replenished as it is used. Renewable sources being developed for commercial use include solar energy, biomass, wind energy, tidal energy, wave energy, hydro-electric energy and geothermal energy. reshapers a device used to re-shape pulses in an optic fibre resolving power the minimum angle of resolution resonance this occurs when the frequency of forced oscillations is equal to the natural frequency of the system that is being forced. rest mass-energy the energy that is equivalent to a bodys rest mass rest mass the mass of an object as measured by an observer at rest with respect to the object. rods photoreceptors that have fast response rates, and are sensitive at low light levels but they are insensitive to colour. There are around 120 million of them. root-mean-square (r.m.s.) value the current dissipated in a resistor in an a.c. circuit that varies between I0 and - I0 would be equal to a current I0 /2 dissipated in a d.c circuit. This d.c current is known as r.m.s. equivalent current to the alternating current.
S
Sankey diagram in a Sankey diagram, the thickness of each arrow gives an indication of the scale of each energy transformation. The total energy before the energy transfer is equal to the total energy after the transfer otherwise the Law of conservation of energy would be violated. scalar a quantity that has only magnitude scattering the deflection of EM radiation from its original path due to its collisions with particles in a medium. Schmitt trigger a circuit designed to re-shape digital electrical signals scientific notation expressing numbers to the power of ten scotopic vision rods are responsible for scotopic vision which is the ability to see at low light levels or vision in the dark or light levels below 0.034 candela per square metre. They do not mediate colour and are sometimes termed colour blind. Because they do not mediate colour, they are said to have low spatial resolution (acuity). second the time for 9 192 631 770 vibrations of the cesium-133 atom. second law of thermodynamics implies that thermal energy cannot spontaneously transfer from a region of low temperature to a region of high temperature. sensors an input transducers that allows for the transfer of energy from one form to another. SI unit an international system of units including the metric system. SI units are those of Le Systme International dUnits adopted in 1960 by the Confrence Gnrale des Poids et Mesures. sideband frequencies a modulated wave consists of the carrier wave plus two waves one of frequency ( fc - fs) and the other of frequency ( fc + fs). The frequencies are called the sideband frequencies.
567
12/6/09 11:06:09 AM
GLOSSARY
signal wave the name given to the wave that carries information significant figures/digits (sf/sd) are those digits that are known with certainty followed by the first digit which is uncertain. simple harmonic motion occurs when the force acting on a system is directed towards the equilibrium position of the system and is proportional to the displacement of the system from equilibrium Snells law is usually applied to light waves and states that when light travels from one medium into another solar constant the average radiant power radiated to an area placed perpendicular to the outer surface of the earths atmosphere while the earth is at its mean distance from the Sun. SONAR (sound navigation and ranging) the use of sound waves to detect and estimate the range of submerged objects. In the 1930s it had its applications in medical therapy. sound intensity the average power per unit area of a sound wave that is incident perpendicular to the direction of propagation is called the sound intensity. The units of sound intensity are watts per square metre, W m-2. As the sound intensity spreads out from its source, the intensity I is reduced as the inverse square of the distance d from the source. source independence the name given to the phenomenon in which audio and visual digital data can be transmitted using the same channel. space-time a coordinate system consisting of three dimensions of space and one of time spacetime diagram the representation of the motion of an object in spacetime specific heat see specific heat capacity. specific heat capacity is the heat capacity per unit mass. It is defined as the quantity of thermal energy required to raise the temperature of one kilogram of a substance by one degree Kelvin. spectral classes a classification of stars according to their observed spectrum speed see average speed and instantaneous speed spherical aberration occurs because the rays that refract at the outer edges of a lens will have a different focal length to those rays that refract near the principal focus. To put it another way, spherical aberration occurs because the rays incident near the edges of a converging lens are refracted more than the paraxial rays spring constant the constant k relating the extension x of a spring to the force F causing the extension F = k x standard form see scientific notation. standard notation see scientific notation. stationary waves sometimes also referred to as standing waves. Waves in which there is no propagation of energy between points along the wave. The amplitude of a stationary wave varies with position along the wave. steam engine an example of external combustion engines. The fuel is burnt outside the engine and the thermal energy is transferred to a piston or a turbine chamber by means of steam. Stefans law the total area under a spectral emission curve for a certain temperature T represents the total energy radiated per metre2 per unit time E and for that assigned temperature it has been found to be directly proportional to the fourth power T4. Stefan-Boltzmann law A law that relates the luminosity of an object to its absolute temperature and area Stellar cluster this is a number of stars that were all created about the same time and that is held together in a group by gravitational attraction. stellar interferometer a radio telescope that consists of two or more parabolic receiving dishes step-down transformer a transformer that if Ns is less than Np it will be a stepdown transformer.
568
12/6/09 11:06:09 AM
GLOSSARY
step-index fibre an optic fibre in which the refractive index of the different materials comprising the fibre change by discrete amounts. step-up transformer a transformer that if Ns is greater than Np then the transformer is a step-up transformer. strain viewer a device that use polarized light to view the stress produced in materials subject to strain. It consists of two polaroids with the material under strain placed between them. string theory an alternative to quantum theory that proposes that each fundamental particle consists of an oscillating string of a small size compared with the proton. Rather than talking about mathematical particles, string theory talks about oscillating strings that are lines or loops of about 10-35 m, and membranes in small dimensions other than the three dimensions that we presently use. strong nuclear interaction the short range force of attraction between nucleons. super red-giant star an evolutionary phase of main sequence stars usually with mass greater than about 8MSun characterized by low temperature and very high luminosity surface heat capacity Cs the energy required to raise the temperature of a unit area of a planets surface by one degree Kelvin and is measured in J m-2 K-1. synchrotrons the most powerful members of the accelerator family system any object or set of objects that is being investigated. The surroundings will then be everything in the Universe apart from the system. systematic error causes a random set of measurements to be spread about a value rather than being spread about the accepted value. It is a system or instrument error.
T
temperature a scalar quantity that gives an indication of the degree of hotness or coldness of a body. Alternatively, temperature is a macroscopic property that measures the average kinetic energy of particles on a defined scale such as the Celsius or Kelvin scales. At the microscopic level, temperature is regarded as the measure of the average random kinetic energy per molecule associated with its movements. tension force this arises when a system is subjected to two equal and opposite forces. terminal velocity the velocity reached when the magnitude of the frictional force acting on a body is equal to the magnitude of the driving force. thermal (heat) capacity the change in thermal energy for a given change in temperature. thermal energy (heat) If a system and its surroundings are at different temperatures and the system undergoes a process, the energy transferred by non-mechanical means is referred to as thermal energy (heat). It is measured in joules. thermistors resistors that change resistance with temperature (word derived from thermal resistors). thermodynamic cycle a process in which the system is returned to the same state from which it started. That is, the initial and final states are the same in the cyclic process. thermodynamic engine device that transforms thermal energy to mechanical energy (work) as in an engine, or mechanical energy to thermal energy such as in refrigeration and airconditioning systems. thermodynamics the name given to the study of processes in which thermal energy is transferred as heat and as work.
569
12/6/09 11:06:09 AM
GLOSSARY
three phase power There are 3 conductors on a transmission line to maximize the amount of power that can be generated. Each high voltage circuit has three phases. The generators at the power station supplying the power system have their coils connected through terminals at 120 to each other. When each generator at the power station rotates through a full rotation, the voltages and the currents rise and fall in each terminal in a synchronized manner. threshold frequency The frequency below which photoelectric emission will not take place. threshold intensity of hearing the minimum detectable intensity for a given frequency is called the threshold intensity of hearing. time dilation the slowing of time as observed by an inertial observer who assumes to be at rest with respect to another, moving inertial reference system total internal reflection reflection in which all the light incident at a boundary between two media undergoes reflection transmission rate another name for bit-rate transmutation see artificial transmutation transverse waves in these types of wave the source that produces the wave vibrates at right angles to the direction of travel of the wave i.e. the direction in which the energy carried by the wave is propagated. The particles of the medium through which the wave travels vibrate at right angles to the direction of travel of the wave (direction of energy propagation). travelling wave a wave that propagates energy trough the minimum displacement of a medium through which a wave travels. tuning circuit a circuit designed to respond to signals of a certain frequency
U
Uncertainty principle See Heisenberg Uncertainty Principle unit of current is the coulomb per second C s-1 and this unit is called the ampere (A).
V
variable a quantity that varies when another quantity is changed. A variable can be an independent variable, a dependent variable or a controlled variable. An independent variable is altered while the dependent variable is measured. Controlled variables are the other variables that may be present but are kept constant. vector a quantity that has both magnitude and direction vector resolution giving the x and y components of a vector. velocity see average velocity and instantaneous velocity virtual earth a point in a circuit that is effectively at earth potential (zero volts) virtual image an image that appears to come from a single point when rays are extrapolated to that point. virtual particle a particle that cannot be observed during an interaction. A virtual photon is said to be the carrier of the electromagnetic force. voltmeter is used to measure the voltage drop across part of an electric circuit and is always connected in parallel.
W+, W and Z0 the exchange particles involved in the weak nuclear interaction. wave number the number of waves per centimeter (cm-1)
570
12/6/09 11:06:09 AM
GLOSSARY
wave speed is the speed with which energy is carried in the medium by the wave. A very important fact is that wave speed depends only on the nature and properties of the medium wavelength is the distance along the medium between two successive particles that have the same displacement wave-mechanics another name for quantum mechanics weight another term for the force of gravity acting on an object weightlessness if the weight of an object is defined in terms of a weighing process such as the reading on a set of bathroom scales, which in effect measures the contact force between the object and the scales, then objects in free fall are weightless Wien Displacement Law a law that relates the maximum wavelength in the blackbody spectrum of an object to the absolute temperature of the object work the product of force and displacement in the direction of the force
571
12/6/09 11:06:09 AM
INDEX
INDEX
A
a.f (audio frequency) amplifier 397 aberration 437 absolute magnitude 369 absolute temperature 289 absolute zero 77, 274 absorbed dose 506 absorption spectrum 363 acceleration 33 accommodation 354, 432 accuracy 7 achromatic doublet 438 acoustic impedance 480, 498 adiabatic 279 aerial 397 Airey George 303 air resistance 38 albedo 229 AM 396 ampere 129 Ampre Andre Marie 129 amplifier 404 amplitude 115 amplitude modulation 393 analogue 346 Anderson Carl 177 angular magnification 433 antineutrino 178 antinodes 123, 294 antiparticles 521 aperture 428 apparent brightness 362 apparent magnitude 369 Aristotle 73 ASCII code 348 astronomical unit 367 asymptotic freedom 547 atomic mass unit 183 attenuation 403, 490, 499 audiogram 487 audiology 487 auditory canal 479 Avogadros hypothesis 273 Avogadro Amadeo 80 Avogadro number 80
B
bandwidth baryon number baryons beam line becquerel Bell Jocelyn big bang binary stars binary system biological half-life black-body radiation black hole black holes Bohr Neils Boltzmanns constant Boltzmann Ludwig Boyle Robert brachytherapy Bragg scattering equation Bragg William breeder reactors bremsstrahlung radiation Brewsters law Brewster angle Brewster David Broglie Louis de Brownian motion 395 540 518 537 181 382 375, 515 365 345 509 233 383 472 175 549 290 97 510 446 446 214 490, 534 309 309 309 336 88
C
capacitance capacitor Carnot engine Carnot Nicolas Lonard Sadi carrier wave cell phones Celsius Anders centre of curvature (C) centrifugal force centripetal acceleration centripetal force cepheid variables Chadwick James Chandrasekhar limit charge-coupled device (CCD) Charles Jacques chemical energy chemical potential energy Chernobyl chromatic aberration circle of least confusion circular motion 349 349 284 284 392 413 77 428 67 67 67 372 176 381 349 97 62 78 210 438 437 66
coal 196 coaxial cable 406 cochlea 480 coherent sources 123 collisions 62 colour 541 colour blindness 355 colour charge 541 colour force 541 compact disc 346 compact discs (CDs) 427 complimentary metal oxide semiconductor (CMOS) 349 compression 115, 398 computed tomography 496 conduction 79 conductive hearing loss 486 conductor 155 conservation of energy 253 conservation of mass-energy 465 constellation 359 constructive interference. 123 contrast-enhancing media 494 convection 79 conventional current 165 convex lens 429 cosmic rays 515 coulomb 157 Coulombs Law 157 Coulomb Charles Augustin 156 crest 115 Crick Francis 447 critical angle 402 critical mass 210 crossed fields 531 crude oil 196 Curie Pierre and Marie 177 cyclotron 532 cyclotron frequency 533
D
damping data transfer rate Davisson Clinton Davy Humphry de Brahe Tycho de Broglie equation demodulator derived quantity destructive interference. Diesel Rudolf differential amplifier diffraction 109 399 337 313 268 530 397 5 123 191 408 121, 301
572
index.indd 572 22/05/2009 12:14:00 PM
INDEX
diffraction grating diffusion digital information digital versatile disc digital video discs (DVDs) dioptre Dirac Paul dispersion displacement Doppler C. J. Doppler effect dose equivalent dosimetry drag force drift tube accelerator drift tubes drift velocity 443 79 346 346 427 429 177, 521 403, 422 33 297 297 506 504 38 531 531 125 equipotential lines equipotential surface equipotential surfaces error bar ether ethernet event horizon exchange particle exchange particles exponential decay exposure eye 263 257 263 13 455 406 472 523 518 181 505 353 Gay-Lussac Geiger-Mller counter Gell-Mann Murray generator geodesic geostationary satellite Germer Lester Glaser Donald Glashow Sheldon global warming gluons Goddard Robert Goodricke John graphs gravitational Field gravitational lensing gravitational mass gravitational potential, graviton greek symbols Gross David 97 421 516, 539 322 472 407 337 535 547 242 523 56 372 14 152 475 469 255 523 30 547
F
Faradays Law 317 Faraday Michael 157, 313 far point 432 feedback resistance 409 Fermi Enrico 178 Feynman diagrams 524 Feynman Richard 524 film badge 507 filters 495 first dynode 536 first law of thermodynamics 278 Fizeau A.H.L. 416 flavors 517 Fleming s left -hand rule 167 flux density 228 flux linkage 317 FM 396 focal length 428 forced oscillations. 111 fossil fuels 195 Foucalt Jean 416 fovea 355 frame of reference 453 Fraunhofer diffraction 301 free-body diagrams 45 frequency 115 frequency modulation 393 Fresnel diffraction 302 fuels 200 fundamental 295 fundamental forces 45 fundamental interactions 522
E
ear-drum 479 Eddington Arthur 47 effective half-life 509 effective resistance 140 efficiency 63, 191 Einstein Albert 210, 417 Einstein photoelectric equation 335 Einstein principle of equivalence 470 electrical energy 62 electrical resistance 130 electrical strain gauge 146 electric current 128 electric field strength 157 electric potential energy 259, 260 electrolysis 313 electromagnetic induction 314 electromagnetic spectrum 418 electromagnetic waves 117, 391 electron-volt 127 electron microscope 306 electrostatics 155 electroweak 516 electroweak theory 548 elementary particles 517 emission spectra 175 emissivity 235 EM spectrum 418 endoscopy 501 energy 56, 113, 189 energy balance climate model 237 energy band theory 156 energy density 197 enhanced greenhouse effect 231 entropy 289
H
hadrons 518 Hahn Otto 210 half-life 181, 343 half-value thickness 492 harmonics 295 harmonic series 295 heat engine 281 heat pump 282 heavily damped 109 Heisenberg uncertainty principle 524 Heisenberg Werner 340, 524 Henry Joseph 313 Hertz Heinrich 416 Hertzsprung-Russell diagrams 366 Hertzsprung Ejnar 366 Higgs field 545 Higgs particle (boson) 546 Hooke Robert 44 Hubbles law 386 Hubble Edwin 375 Humanson Milton 375 Huygens Christian 415 hydro-electric power 220
G
galaxies Galilei Galileo gamma radiation gamma ray bursters 358, 384 37, 73 421 467
I
ideal gas induced current, induced emf inertia 273 314 315 43
573
index.indd 573 22/05/2009 12:14:01 PM
INDEX
inertial mass 469 inertial reference frame 454 infra-red radiation 420 insolation 229 instantaneous speed 35 insulator 156 intensity 115 interference 449 interference pattern two point sources 123 internal combustion engine 190, 282 internal energy 79, 277 internal resistance 138 interpolated resolution 351 inverse photoelectric effect. 444 inverting amplifier 409 ionising radiation 504 ionization current 179 isobaric 278 isochoric 278 isolated system 277 isothermal 279 isotopes 176 Leavitt Henrietta left-hand palm rule lens Lenzs Law Lenz Heinrich lepton number leptons light dependent resistor light year linear attenuation coefficient line spectrum liquid drop model load factor longitudinal waves loudness loudspeaker luminosity 372 315 428 318 317 543 517 145 360 491 175 209 205 114 483 397 361 monochromatic monomode fibres Morse code most significant bit motor effect moving particle theory multiplexing 424 404 406 346 167 88 399
N
natural frequency 110 natural gas 196 Neeman Yuval 539 near point 354, 432 nebulae 358 nematic liquid crystal 312 neutral current reaction 548 neutron activation 209 neutron number 176 neutrons 176 Newton Sir Isaac 33, 73, 415 nibble 398 nodes 123, 294 non-renewable source 196 nuclear binding energy 184 nuclear energy 62 nuclear fission 186, 215 nuclear fusion 187, 195 nuclear magnetic resonance 500 nuclear power 214 nucleon 176 nucleon number 176 nucleosynthesis 549 nuclide 176 numerical aperture 307 Nyquist theorem 399
M
Machs Principle 469 macroscopic 79 magnetic field 164, 165 magnetic flux 316 magnetic resonance imaging (MRI) 500 magnetron 531 magnification 429 magnifiers 433 magnifying power 433 Malus law 311 Malus E.L. 308 Malus Etienne 311 mass 43 mass defect 184 material dispersion 403 matter waves 337 Max Planck 423 Maxwells theory 416, 455 Maxwell James Clerk 416, 455 meson 523 Michelson Albert A. 417 microwaves 420 middle ear 480 Millikan Robert 173 minimum angle of resolution 305 mobile phone 412 modal dispersion 403 moderator 212 modes 403 modulation 392 mole 80 molecular dipole moment 232 momentum 51
J
Joliot Frederic Joule James Prescott 177 57, 74
K
KelvinPlanck statement Keplers third law Kepler Johannes kinetic energy kinetic theory Kirchoff G.R. 289 268 268 59, 78 88 139
L
laser light 424 lasers 501 latent heat 92 latent heat of transformation. 92 Lavoisier Antoine 73 Law of conservation of electric charge 155 law of conservation of energy 278 law of gravitation 151 law of reflection 118 Lawrence Ernest 516 laws of Bergonie and Tribondeau 505 laws of mechanics 455 least significant bit 346
O
Oersted Oersted Hans Christian Ohms Law Ohm Georg Simon oil shale oil tar Olbers paradox Olber Henrich Onnes H. Kammerlingh operational amplifier OppenheimerVolkoff limit optical microscope optical path difference (opd) optic fibres order of magnitude 313 165 132 132 197 197 374 374 132 408 381 306 449 406 1
574
index.indd 574 22/05/2009 12:14:02 PM
INDEX
organ of Corti oscillations ossicles Otto Nikolaus outer ear oval window 481 109 480 191 479 480 principal focus 428 principle of superposition 118, 122 projectiles 251 proper length 460 proper time 458 proton number 176 protostar 379 public switched telephone network 413 pulsars 383 pulse oximetry 501 resistance resistivity resistor resolution resonance resonance frequency rest mass rest mass energy retina right-hand palm rule Rmer Ole Rontgen William Roosevelt Franklin Rumford Count Russell Henry Rutherford Ernest 131 131 130 304, 499 112 533 464 464 354 167, 315 415 444 210 73 366 173
P
pair annihilation 527 pair production 527 parallax 367 parsec 367 Pauli exclusion principle 522 Payne David 404 peat 196 Penzias Arno 548 percentage error 10 period 115 periodicity 113 permeability of free space 129 permittivity constant 156 phase difference 101 phons 483 photo-electric effect 333 photomultiplier tube 536 photon 334, 523 photopic vision 355 photovoltaic devices 218 physical half-life 509 piezoelectric crystal 498 pinna 479 pixels 312, 350 Planck constant 175, 334, 335, 423 Planck Max 175 plasma 216 polarimeter 311 polarization 308 polaroid 308 pole (P) 428 Politzer David 547 pollutants 207 population inversion 426 positron 177 potential difference 125 potential energy 60 Powel Cecil Frank 523 power 62 power rating 133 power stations 192, 205 precision 7 preferential absorption. 308 principal axis 428 principal focal plane 428
Q
quality factor quantum quantum chromodynamics quantum efficiency quantum mechanics model quantum numbers quark confinement quarks quasars 110, 506 334 516, 541 351 520 520 543 518, 539 358, 383
S
Salam Abdus Salter Duck Sankey diagram scalars Schmitt trigger Schrdinger Erwin scientific notation scintillation counter scotopic vision seismic exploration self-inductance semi-circular canals sensors sideband signal wave significant figures simple harmonic motion singularity SI units Slipher Vesto Snells law Snell Willebrord solar constant solar heating SONAR sound intensity source independence space-time spacetime diagram Special Theory Specific heat capacity speed of light spherical aberration spin quantum number spring constant 547 224 192 22 411 339 5 536 355 206 328 480 145 395 392 8 101, 107 548 4 375 120 120 219, 238 218 497 483 398 471 471 456 84 416 437 521 44
R
r.f (radio frequency) amplifier 397 radiant power 228 radiation 79 radiation scattering 423 radioactive decay 173, 342 radioactive tracers 511 radioactivity 177 radio-isotopes 511 radio-pharmaceuticals 512 radiotherapy 510 radio waves 419 radius of curvature (R) 428 random errors 7 rarefaction 115 Rayleigh criterion 305 Rayleigh Lord 424 Rayleigh scattering 424 real image 429 Reber Grote 383 red-shift 375, 473 reflection 118 refraction 119 refractive index 120 refractive indexes 422 refrigerator 283 relative biological effectiveness 507 relative error 10 renewable energy source 197 reshapers 405
575
index.indd 575 22/05/2009 12:14:03 PM
INDEX
standard model stars G class M class O class spectral classes stationary waves steam engine Stefan-Boltzmann law Stefans law stellar cluster stellar interferometer step-index strain viewer strangeness string theory strong nuclear interaction surface heat capacity synchrotron systematic errors 548 363 363 363 364 293 202 362 234 359 306 403 311 542 550 177 236 533 7
U
ultra-violet radiation ultrasound uncertainties uncertainty principle universal gas constant 420 497 4, 10 340 273
V
van der Graaf Robert vectors velocity vertex virtual earth virtual image virtual particle visible light 515 22, 33 33 524 409 429 525 420
W
Watson James Watt James wave-mechanics wavelength wave speed wedge films weight weightlessness Weinberg Steven Wien Displacement Law Wilczek Frank Wilson Robert wind turbines wire chamber work work-function working fluid 447 63 339 115 115 451 43 270 547 362 547 548 223 536 56, 277 334 281
T
target material 495 teletherapy 511 tension force 44 terminal velocity 38 thermal capacity 83 thermal energy 61, 76, 79 thermal reactors 211 thermistors 146 thermodynamic cycle 190, 280 thermodynamics 277 thermoelectric converters 219 thermometric 76 Third Law of Thermodynamics 284 Thomson J. J. 154, 173 threshold frequency 333 time dilation 458, 459 total internal reflection 402 transformer 326 transmission rate 398 transmutation 182, 209 transverse waves 114 travelling-wave accelerator 531 travelling wave 294 trough 115 tube curren 495 tube voltage 495 tuning circuit 397 twin paradox 462 types of star 364
X
X-radiation X-ray beam X-ray crystallography 421 494 447
Y
Youngs double slit experiment Young Thomas Yukawa Hideki 440 440 523
576
index.indd 576 22/05/2009 12:14:04 PM